Cloud Computing Project - Hadoop LogAnalyser

Overview:

This project explores setting up Hadoop in Docker, developing MapReduce programs for log analysis, and solving specific problems using Hadoop clusters. The report includes detailed configurations, scripts, and solutions for various tasks.

File Structure:

The attached zip file contains separate folders for each part of the project:

Part 1: Setting up Hadoop in Docker
- Dockerfile
- bootstrap.sh
- core-site.xml
- hdfs-site.xml
- mapred-site.xml
- yarn-site.xml
Part 2: MapReduce Programs
- ngrammapper.py
- ngramreducer.py
- input.txt
Part 3: Log Analysis
- Mapper and Reducer scripts for each problem

Setting up Hadoop in Docker:

Build a Docker image based on the provided Dockerfile.
Generate public-private RSA key pair using the ssh-keyget command.
Initialize core components of a Hadoop cluster using start-dfs.sh and start-yarn.sh scripts.

MapReduce Programs:

Analyze log files using MapReduce programs.
Solve specific problems based on log data, such as counting hits to a website directory and calculating n-gram frequencies.

Instructions for Running the Project:

Ensure Docker is installed on your system.
Execute the provided scripts to set up Hadoop in Docker.
Run MapReduce programs using the provided commands for log analysis.
Refer to individual parts for detailed instructions and solutions.

Project Submission:

For any inquiries or assistance related to this project, please contact:

Bhavana Devulapally
Shusrita Venugopal
Neha Navarkar

Feel free to reach out to us for further clarification or support. Thank you for exploring our Project!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Hadoop LogAnalyzer		Hadoop LogAnalyzer
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cloud Computing Project - Hadoop LogAnalyser

Overview:

File Structure:

Setting up Hadoop in Docker:

MapReduce Programs:

Instructions for Running the Project:

Project Submission:

About

Releases

Packages

Languages

shusritavenugopal/Hadoop-LogAnalyser

Folders and files

Latest commit

History

Repository files navigation

Cloud Computing Project - Hadoop LogAnalyser

Overview:

File Structure:

Setting up Hadoop in Docker:

MapReduce Programs:

Instructions for Running the Project:

Project Submission:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages