Skip to content

curious-broccoli/text-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

text-analysis

reddit scraper, simple text summarizer

summarizer

  • for counting word frequency:

    • remove everything except letters
    • count frequency of words (tokenized, stemmed), except stopwords
    • normalize frequency values
  • tokenize text into sentences

  • sentence -> list of stemmed words

  • for each sentence:

    • sum up the frequency values for all frequent words -> score
    • CHANGES TO TRY:
      • score = score + 2 * (score / word count)
      • score = score / word count

About

scrape reddit and analyse text

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages