Skip to content

Experiment files for ICCBDC'21 paper "Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification"

License

Notifications You must be signed in to change notification settings

tekdogan/iccbdc-21

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

plot




This repository incorporates the material used in experiments for the paper Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification; which was submitted to 5th International Conference on Cloud and Big Data Computing ICCDBC'21, in Liverpool, UK. You can reach the paper via DOI link.

📁 spark

└ naive-bayes.py

    Script that implements Naive-Bayes Classifier using MLlib library of Spark.

📁 mapreduce

└ mahout-nb.sh

    Shell script to use Mahout's NB implementations.

└ convert-to-seq.py

    Script to convert datasets in libsvm format to sequential file format.

About

Experiment files for ICCBDC'21 paper "Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification"

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published