Code for the automated download and OCR of FOIA files.
-
Updated
Jun 19, 2022 - Python
Code for the automated download and OCR of FOIA files.
This repository will assist you in scrapping data from multiple websites. It will identify, download and classify the latest pdf files published on a website as per the users requirement. This can be used for automating various operations involved in market research.
IEEE Xplore PDFs to JSON conversion utility
Analyze the resume data to gauge and classify the categories of the resumes of candidates using Python and ML models.
📑🧐 Python project for extracting text from resumes in .pdf, .doc and .docx formats based on the article by Omkar Pathak at https://omkarpathak.in/2018/12/18/writing-your-own-resume-parser
Extract table from PDF document, Crop and Convert to JPG file
An app that checks drawings in the "Kornit" drawing template
NLP model for extracting chinese datas from the documents
[2023-01] A python Flask API to extrat metadata and text from PDF files. Asynchronous tasks executed with a Celery queue and Redis workers. A SQLite storage managed by SqlAlchemy. Clean code with Flake8 and Isort. Coverage tested with Pytest-cov. See the documentation in the Readme.md and check the API contract with Swagger.
Parses apart a PDF file into separate documents and then uses Natural Language Processing, Machine Learning models, and statistics to rank the documents by similarity to a single document.
A tool for extracting texts(eg: keywords, sentences) from pdf | Supported to export CSV | Based on pdfminer
This Repository contains AI Resume Analyzer that utilizes PDF parsing, database management, SQL-Python integration, and data extraction from PDFs. It offers skill recommendations and suggests videos and lectures for skill enhancement, aiming to enhance resume quality and job prospects.
This tool basically searches the given word in pdf file hierarchy. It searches one or more keywords in the hierarchy and generates an HTML report of it.
Add a description, image, and links to the pdfminer topic page so that developers can more easily learn about it.
To associate your repository with the pdfminer topic, visit your repo's landing page and select "manage topics."