GitHub - wesmadrigal/topics: A topic classifier

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
topicstuff		topicstuff
README.md		README.md

Repository files navigation

A topic classifier

Approach:

mined 337 topics
for each topic, mined 100 documents
cataloged each document counting words
kept a running total of all words
kept an individual log of per/document words
calculated probabilities in a Markov fasion

Current Results:

currently running tests (ostensibly working)

Usage:

the main use case for this would be someone looking to disambiguate text in a large computational fashion
ideally, this module will server as a hook for larger scale purposes

About

A topic classifier

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%