Skip to content

Extract and summarize from pdf files and web site url's using the power of AI Agents.

License

Notifications You must be signed in to change notification settings

berkayguzel06/Content_Summarizer_with_AI_Agents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI-Powered Content Extractor

image

Overview

AI-Powered Content Extractor is a command-line tool that extracts and summarizes content from PDF files and web pages using the power of AI Agents. This tool allows you to process single or multiple sources efficiently and generate concise summaries.

Features

  • Extract content from PDF files and web pages.
  • Summarize the extracted content using AI.
  • Support for batch processing of multiple sources.
  • Customizable summary styles.
  • Configurable AI model and API key.

Installation

Prerequisites

  • Python 3.9+
  • Required dependencies (install using the command below)
pip install -r requirements.txt

Usage

Extract and Summarize a Single Source

python app.py extract "<URL_or_PDF_PATH>" -o <OUTPUT_PATH>

Example:

python app.py extract "https://example.com/article" -o summary.md

Extract and Summarize Multiple Sources

python app.py batch -f sources.txt -o summaries/

Where sources.txt contains a list of URLs or file paths (one per line).

Configuration

You can set up your AI model and API key with the config command:

python app.py config -m <MODEL_ID> -k <API_KEY> -s detailed -l 500

Options:

  • -m : AI model identifier.
  • -k : API key for the AI service.
  • -s : Summary style (concise, detailed, bullet-points, executive).
  • -l : Maximum summary length.

Demo

Windows.PowerShell.2025-03-02.21-17-38.mp4

License

This project is licensed under the MIT License.

Contribution

Feel free to contribute! Open an issue or submit a pull request with improvements or feature suggestions.

Author

Berkay Güzel

Happy extracting! 🚀

Releases

No releases published

Packages

No packages published

Languages