This repo contains all supplementary items for Cloud Computing course taught in IUST in Fall 2022.
Designed a Hadoop hands-on assignment that contains three different phases:
- Installing three VMs on the local machine
- Setting Hadoop and HDFS up
- Implementing 3 Map-Reduce programs for big data processing
You can download the dataset from the following link
- tconst (string) - alphanumeric unique identifier of the title
- titleType (string) – the type/format of the title (e.g. movie, short, tvseries, tvepisode, video, etc)
- primaryTitle (string) – the more popular title / the title used by the filmmakers on promotional materials at the point of release
- originalTitle (string) - original title, in the original language
- isAdult (boolean) - 0: non-adult title; 1: adult title
- startYear (YYYY) – represents the release year of a title. In the case of TV Series, it is the series start year
- endYear (YYYY) – TV Series end year. ‘\N’ for all other title types
- runtimeMinutes – primary runtime of the title, in minutes
- genres (string array) – includes up to three genres associated with the title
Introduction to Big Data and HDFS: Arian Boukani
What is Map-Reduce: Arian Boukani
VirtualBox - Windows,Mac(Intel) VM installation: M.Reza Dorudian
Multipass - Mac(AppleSilicon & Intel), windows and linux VM installation: M.Sadra Haeri Asadi
Hadoop installation: M.Reza Dorudian
MapReduce Examples: S.Alireza Ezaz