Skip to content

A NLP based text summarizer. Can summarize PDF documents and Wikipedia articles too.

Notifications You must be signed in to change notification settings

suyesha07/PySummarize

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PySummarize

A NLP based text summarizer. Can summarize PDF documents and Wikipedia articles too.


Uses NLTK for Python to enable tokenisation and core NLP features for Extractive Summarisation, and Hugging Face Transformers for Abstractive Summarisation, with Streamlit for front-end.

PDF Summariser

Uses Streamlit upload feature, and PDFPlumber to parse text in the PDF. Issues with academic papers which causes some text to become garbled. Works well on non-technical text.

Wikipedia Summariser

Uses BeautifulSoup to extract text from HTML before passing through the text summarisation engine.

Textbox Summariser

Basic textbox to allow for copy and paste entry of text for summarisation.

Installation Instructions

  1. Install requirements - pip install -r requirements.txt
  2. Run streamlit - streamlit run app.py

In the demo, you can test out extractive summarisation.

Live demo here: https://suyesha07-pysummarize-app-kwx0pp.streamlitapp.com/

About

A NLP based text summarizer. Can summarize PDF documents and Wikipedia articles too.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published