This project processes and clusters text data using BERT embeddings, K-means, and dimensionality reduction. Visualizations include t-SNE plots and word clouds. Dataset and embeddings links are provided.
nlp
pytorch
transformer
pca-analysis
text-clustering
kmeans-analysis
bert-embeddings
wordcloud-visualization
textprocessing
t-snes
sihouette-score
-
Updated
Sep 2, 2024 - Jupyter Notebook