Skip to content

The repository showcases a series of exercises and projects focused on big data processing using Hadoop, HBase, Hive, and Spark with Python. Hosted on AWS EMR, these projects demonstrate efficient data handling and processing techniques, leveraging the power of cloud computing to tackle complex data challenges.

Notifications You must be signed in to change notification settings

Mariam-iftikhar/BigData

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BigDataTechnologies

Key Highlights:

Hadoop: Implemented MapReduce jobs for large-scale data processing.

HBase: Developed and managed scalable, high-performance NoSQL databases.

Hive: Executed SQL-like queries for data warehousing and analytical tasks.

Spark: Built real-time and batch processing applications to extract valuable insights.

Explore the repository to see practical applications of these technologies and gain insights into big data solutions on AWS EMR.

About

The repository showcases a series of exercises and projects focused on big data processing using Hadoop, HBase, Hive, and Spark with Python. Hosted on AWS EMR, these projects demonstrate efficient data handling and processing techniques, leveraging the power of cloud computing to tackle complex data challenges.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published