sentence-clustering

The module exposes APIs for end to end sentence clutering.

Sentence clustering is done by first converting nlp-preprocessed sentences TF-IDF word vectors. Then reducing the dimensionality using SVD and then applying the HDB algorithm. The HDB algorithm is automatically tuned using a custom scoring function as described in https://towardsdatascience.com/how-to-cluster-in-high-dimensions-4ef693bacc6

The module also visualizes the sentences in 2 dimensions before and after clusterin using the TSNE algorithm.

Future score includes adding tags to each cluster to aid in analysis iof the clusters

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
data		data
scripts		scripts
tests		tests
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

sentence-clustering

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

prashantnbangar/sentence-clustering

Folders and files

Latest commit

History

Repository files navigation

sentence-clustering

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages