Skip to content

A document classifier that reads PDFs in a given directory and tags the document with the highest-occurring animal or plant keyword.

Notifications You must be signed in to change notification settings

rizarae-p/scipaper_classifier

Repository files navigation

Animal Tagger: A document classifier that reads PDFs in a given directory and tags the document with the highest-occurring animal or plant keyword.

📋 Prerequisites

  1. Python: Ensure you have a working Python environment on your machine.
  2. Jupyter Notebook

🚀 Getting Started

Installation

  1. Install Required Libraries:
    pip install spacy feedparser pymupdf frontend
    python -m spacy download en_core_web_sm
    
  2. Clone this repository:
    git clone [your-repo-link]
    cd [your-repo-directory]
    

Usage

  1. Run Jupyter Notebook
  2. Open the file demo_notebook3.ipynb

🤝 Contributing

Pull requests are welcome! 🙌 For major changes, please open an issue first to discuss the desired change.

About

A document classifier that reads PDFs in a given directory and tags the document with the highest-occurring animal or plant keyword.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •