Heritage in the digital age

A Step-by-Step Guide to Augmenting Digitized Historical Images,
with the help of LAVIS, groundingDINO and Segment Anything

Semester Project @ CVlab & EPFL+ECAL Lab

Visit the Website for the gallery and an explanation of the project!

The project in its original state can be found here.

WebUI designed for augmenting digitized historical images by generating captions, grounding the captions and segmenting their content.

💡 Highlights

Simple WebUI using
Caption generation using BLIP and BLIP2
Translation of captions to English using Helsinki-NLP
Grounding of captions using 🦕 groundingDINO
Segmentation of images using Segment Anything and Agnostic segmentation
Visualization of the results

🔥 Caption Generation

🔥 Translation

🔥 Preprocessing

🔥 Phrase Grounding

🔥 Segmentation

🔥 Visualization

Installation

Clone the repository and its submodules

git clone --recurse-submodules https://github.com/tgieruc/Heritage-in-the-digital-age.git

Install the dependencies

bash setup.sh

Run the server

python3 webui.py

Acknowledgements

Thanks to the following people for their help and their work:

The caption generation pipeline: Chenkai Wang
The English to French translation model: MarianMT
Captioning: LAVIS
Phrase Grounding: GLIP, MDETR, groundingDINO
The NLP model for ranking the expressions: DistilBERT
One segmentation model was created using the Segmentation Models library
The other segmentation models from Segment Anything

Contact

You can reach me here 😊

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
data/images		data/images
demo		demo
docs		docs
pipeline		pipeline
submodules		submodules
webui_helpers		webui_helpers
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
Report.pdf		Report.pdf
WebUI_colab.ipynb		WebUI_colab.ipynb
demo_pipeline.ipynb		demo_pipeline.ipynb
requirements.txt		requirements.txt
setup.sh		setup.sh
webui.py		webui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Heritage in the digital age

A Step-by-Step Guide to Augmenting Digitized Historical Images,
with the help of LAVIS, groundingDINO and Segment Anything

Semester Project @ CVlab & EPFL+ECAL Lab

Visit the Website for the gallery and an explanation of the project!

💡 Highlights

🔥 Caption Generation

🔥 Translation

🔥 Preprocessing

🔥 Phrase Grounding

🔥 Segmentation

🔥 Visualization

Installation

Acknowledgements

Contact

About

Releases 1

Contributors 3

Languages

License

tgieruc/Heritage-in-the-digital-age

Folders and files

Latest commit

History

Repository files navigation

Heritage in the digital age

A Step-by-Step Guide to Augmenting Digitized Historical Images, with the help of LAVIS, groundingDINO and Segment Anything

Semester Project @ CVlab & EPFL+ECAL Lab

Visit the Website for the gallery and an explanation of the project!

💡 Highlights

🔥 Caption Generation

🔥 Translation

🔥 Preprocessing

🔥 Phrase Grounding

🔥 Segmentation

🔥 Visualization

Installation

Acknowledgements

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Contributors 3

Languages

A Step-by-Step Guide to Augmenting Digitized Historical Images,
with the help of LAVIS, groundingDINO and Segment Anything