Skip to content
This repository has been archived by the owner on Jul 16, 2022. It is now read-only.

MapReduce design and implementation of a Bloom Filter creation algorithm. Cloud Computing @ University of Pisa

License

Notifications You must be signed in to change notification settings

edoardoruffoli/BloomFilter-MapReduce

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

88 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BloomFilter-MapReduce

Project developed for the Cloud Computing course of the Master of Artificial Intelligence and Data Engineering at the University of Pisa.

This project consists in the design and implementation of a Bloom Filter for IMDb datasets using MapReduce (Hadoop and Spark frameworks).

Repository

The repository is organized as follows:

  • dataset/ contains the IMDb dataset stored in film_ratings.txt
  • docs/ contains the report and the assignment
  • hadoop/ contains the Hadoop implementation and test
  • results/ contains testing results and analysis
  • spark/ contains the Spark implementation and test

Contributors

About

MapReduce design and implementation of a Bloom Filter creation algorithm. Cloud Computing @ University of Pisa

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •