- Quick Link to Production Branch
- Quick Link to Devel Branch
Babel was an attempt to develop a PDF document manager.
Actually it features:
a PDF browser that permit to navigate in the file system, display PDF documents and sort (move) them in a similar way than the Geeqie image viewer.
a GUI showing the PDF document metadatas, pages and corresponding text blocks.
tools to generate thumbnails, extract text from pages
a MuPDF binding using CFFI ( it need a dynamic library and API update )
a BibTeX parser
Lexique tool using British National Corpus
some experimental codes to extract metadata, text, and index them.
To go further, look at https://whoosh.readthedocs.io and https://www.elastic.co/products/elasticsearch
The documentation is available on the Babel Home Page.
Look at the installation section in the documentation.
Authors: Fabrice Salvaire
Started project in Python 2