Presentation: Google Slides
Project write-up: Google Docs
.
├── BBCNews # Scrapy spider for scraping BBCNews(unused)
├── Bloomberg # Scrapy spider for scraping Bloomberg
├── decision_tree # word2vec, decision tree training and analysis code
├── out # output of normalized text
├── text_normalization # regular expression and text normalization code
└── README.md
Read README.md in each folder for instruction on how to run the code