Machine Learning models for large datasets
-
Updated
Jan 19, 2018 - Gnuplot
Machine Learning models for large datasets
Built a python pipeline to preprocess blog posts (lemmatization, coreference resolution, identify collocations, etc) and built an LDA topic model to flag irrelevant comments under those posts.
Add a description, image, and links to the lda-gibbs-sampling topic page so that developers can more easily learn about it.
To associate your repository with the lda-gibbs-sampling topic, visit your repo's landing page and select "manage topics."