Skip to content

elegans-io/MHPC_ScalaSpark_Datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HPCC_Datasets

  • sentences.utf8.clean.txt : cleaned utf8 list of sentences derived from archive.ics.uci.edu machine-learning-databases
  • HPCC_W2V_REDUCED_TXT: reduced word2vec model (reduced from GoogleNews-vectors-negative300.bin.gz) with only words which are in sentences.utf8.clean.txt
  • HPCC_W2V_REDUCED_MODEL: mllib serialized version of HPCC_W2V_REDUCED_TXT

Releases

No releases published

Packages

No packages published