Skip to content

Latest commit

 

History

History
7 lines (6 loc) · 311 Bytes

README.md

File metadata and controls

7 lines (6 loc) · 311 Bytes

Topic-Modeling-Summarizer

PDF summarizer with Latent Dirichlet Allocation to save time when it comes to reading

To Do

  • Clean data more in research LDA model
  • possibly do separate analysis on tables in research
  • check distribution of tfidf to figure out what features can be tweaked/removed in research