GitHub - tekdogan/iccbdc-21: Experiment files for ICCBDC'21 paper "Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification"

This repository incorporates the material used in experiments for the paper Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification; which was submitted to 5th International Conference on Cloud and Big Data Computing ICCDBC'21, in Liverpool, UK. You can reach the paper via DOI link.

📁 spark

└ naive-bayes.py

    Script that implements Naive-Bayes Classifier using MLlib library of Spark.

📁 mapreduce

└ mahout-nb.sh

    Shell script to use Mahout's NB implementations.

└ convert-to-seq.py

    Script to convert datasets in libsvm format to sequential file format.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
mapreduce		mapreduce
spark		spark
LICENSE		LICENSE
README.md		README.md
paper-pic.jpg		paper-pic.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📁 spark

└ naive-bayes.py

📁 mapreduce

└ mahout-nb.sh

└ convert-to-seq.py

About

Releases

Packages

Languages

License

tekdogan/iccbdc-21

Folders and files

Latest commit

History

Repository files navigation

📁 spark

└ naive-bayes.py

📁 mapreduce

└ mahout-nb.sh

└ convert-to-seq.py

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages