Skip to content

Latest commit

 

History

History
23 lines (17 loc) · 605 Bytes

README.md

File metadata and controls

23 lines (17 loc) · 605 Bytes

HSE 2022 Coursework

Educational Analysis: Curriculum Clustering (and Data Mining)

v1.0

main-notebooks folder:

  • Text processing: parsing, lemmatization, purification - ✅
  • Doc embeddings: TF-IDF, Word2Vec, Doc2Vec - ✅
  • Clustering: K-Means, Gaussian Mixture, Spectral - ✅
  • Inner metrics: Xie-Benni - ✅

v2.0

extend-notebooks folder:

  • Dataset extension - ✅
  • Labeling - ✅

main-notebooks folder:

  • External metrics: B-Cubed - ✅
  • Aglorithms improvement and estimation - ✅
  • Visualization - TODO