AI-Powered Content Extractor is a command-line tool that extracts and summarizes content from PDF files and web pages using the power of AI Agents. This tool allows you to process single or multiple sources efficiently and generate concise summaries.
- Extract content from PDF files and web pages.
- Summarize the extracted content using AI.
- Support for batch processing of multiple sources.
- Customizable summary styles.
- Configurable AI model and API key.
- Python 3.9+
- Required dependencies (install using the command below)
pip install -r requirements.txt
python app.py extract "<URL_or_PDF_PATH>" -o <OUTPUT_PATH>
Example:
python app.py extract "https://example.com/article" -o summary.md
python app.py batch -f sources.txt -o summaries/
Where sources.txt
contains a list of URLs or file paths (one per line).
You can set up your AI model and API key with the config
command:
python app.py config -m <MODEL_ID> -k <API_KEY> -s detailed -l 500
Options:
-m
: AI model identifier.-k
: API key for the AI service.-s
: Summary style (concise, detailed, bullet-points, executive).-l
: Maximum summary length.
Windows.PowerShell.2025-03-02.21-17-38.mp4
This project is licensed under the MIT License.
Feel free to contribute! Open an issue or submit a pull request with improvements or feature suggestions.
Happy extracting! 🚀