Skip to content

MapReduce,Spark,ML-algorithms(ID3,NaiveBayes,CompleteIndex,VectorialIndex),MongoDB

Notifications You must be signed in to change notification settings

mizadri/big-data

Repository files navigation

Info

Implementation in python of different machine learning algorithms:

  • Classification tree (ID3).
  • Complete Index (data compression: variable-bytes, elias-gamma or elias-delta).
  • Vectorial Index.
  • Naive Bayes.

Some more data handling exercises in mongoDB, pySpark to analyse the feeling from different classics of literature, weather forecasting, parsing logs from a server and some more basic data wrangling exercises.

About

MapReduce,Spark,ML-algorithms(ID3,NaiveBayes,CompleteIndex,VectorialIndex),MongoDB

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published