Text_Summarizer is a simple Python project designed to summarize long text articles using Natural Language Processing (NLP) techniques. This project utilizes the NLTK library to process and summarize the text, and Streamlit for the user interface.
- Text Summarization: Enter text manually and receive a concise summary.
- Interactive UI: Simple and user-friendly interface powered by Streamlit.
- NLP Processing: Uses NLTK for tokenization, stopwords removal, and frequency-based summarization.
To set up the Text_Summarizer project, follow these steps:
- Clone the Repository:
git clone https://github.com/yaminibhole/Text_Summarizer.git
- Install the Required Dependencies: pip install -r requirements.txt
To run the Text_Summarizer application, use the following command: streamlit run app.py
- Input Text: The user inputs the text they want to summarize into the provided text area.
- Text Preprocessing: The text is cleaned to remove unwanted characters and stopwords.
- Frequency Table Creation: A frequency table of words is created after tokenization and stopwords removal.
- Sentence Scoring: Each sentence is scored based on the words it contains and their frequencies.
- Summary Extraction: The top sentences with the highest scores are selected to form the summary.
- NLTK: Natural Language Toolkit for NLP processing.
- Streamlit: Framework for creating interactive web applications.