Pair Programming — NumPy and Pandas Project

1) Overview

This project was developed as a pair programming exercise to practice data analysis using NumPy and Pandas.
The goal is to explore, clean, transform, and integrate datasets while applying core data analytics concepts, from exploratory analysis to visualization.

2) Repository structure

PAIR-PROGRAMMING-NUMPY-AND-PANDAS-PROJECT/
├─ EDA.ipynb                      # Exploratory Data Analysis
├─ Group-By-and-Apply.ipynb       # Aggregations and custom functions
├─ Nulls-management.ipynb         # Handling missing values
├─ Numpy.ipynb                     # NumPy fundamentals
├─ Pandas.ipynb                    # Pandas basics
├─ Merge-and-Data-Cleaning.ipynb   # Data merging and cleaning
├─ vis_world_data.ipynb            # Data visualization
├─ medallas.csv                    # Olympics medal dataset (input)
├─ world_data_full_apply.csv       # Processed world data (output)
└─ README.md

3) Learning objectives

NumPy
- Array creation, slicing, reshaping, broadcasting, and vectorized operations.
Pandas
- DataFrame and Series manipulation, indexing, and selection.
EDA (Exploratory Data Analysis)
- Descriptive statistics, distributions, correlations.
GroupBy & Apply
- Aggregations, transformations, and custom apply functions.
Null management
- Identifying, imputing, and dropping missing values.
Merging & Cleaning
- Joining multiple datasets and ensuring consistency.
Visualization
- Plotting insights with Matplotlib/Seaborn.

4) How to run

Open the notebooks in Jupyter Notebook or VS Code (Jupyter extension).
Run cells sequentially in each notebook.

5) Skills demonstrated

Efficient use of NumPy arrays for numerical computation.
Data wrangling and manipulation with Pandas.
Handling real-world data quality issues (nulls, duplicates).
Combining multiple datasets into a unified view.
Generating insights through exploratory data analysis and visualization.
Collaborative coding through pair programming practices (shared design, debugging, and review).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pair Programming — NumPy and Pandas Project

1) Overview

2) Repository structure

3) Learning objectives

4) How to run

5) Skills demonstrated

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
EDA.ipynb		EDA.ipynb
Group-By-and-Apply.ipynb		Group-By-and-Apply.ipynb
Merge-and-Data-Cleaning.ipynb		Merge-and-Data-Cleaning.ipynb
Nulls-management.ipynb		Nulls-management.ipynb
Numpy.ipynb		Numpy.ipynb
Pandas.ipynb		Pandas.ipynb
README.md		README.md
medallas.csv		medallas.csv
vis_world_data.ipynb		vis_world_data.ipynb
world_data_full_apply.csv		world_data_full_apply.csv

ana-nobre/Pair-Programming-NumPy-and-Pandas-Project

Folders and files

Latest commit

History

Repository files navigation

Pair Programming — NumPy and Pandas Project

1) Overview

2) Repository structure

3) Learning objectives

4) How to run

5) Skills demonstrated

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages