Skip to content

A comprehensive portfolio of data-driven Python projects focusing on real-world datasets, exploratory analysis, machine learning, and storytelling with data. Each project demonstrates end-to-end analytical thinking using modern data science libraries and tools.

Notifications You must be signed in to change notification settings

joaquinrojash/Python-Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🐍 Python Projects Portfolio

Welcome to my Python Projects Portfolio, a curated collection of data science and analytics projects designed to showcase my technical skills, problem-solving ability, and storytelling with data. Each project addresses real-world datasets and business questions using Python and industry-relevant tools.


🧰 Tools & Technologies Used

Throughout these projects, I have utilized a wide range of tools and libraries, including:

  • Languages: Python, SQL
  • Libraries: pandas, NumPy, matplotlib, seaborn, scikit-learn, plotly, statsmodels
  • Techniques: Data Cleaning, EDA, Machine Learning, Clustering, Predictive Modeling, Hypothesis Testing
  • Data Formats: CSV, Excel, TSV, Parquet
  • Other: Jupyter Notebooks, Git, Markdown

📁 Project Summaries

Each folder contains a standalone project with its own README.md, dataset(s), and Jupyter notebook(s). Here's a high-level overview:

1. 🔍 Analyzing Crime in Los Angeles

Analyze crime patterns across LA neighborhoods to uncover trends and hotspots.

Skills: Geospatial analysis, time series trends, data visualization.

2. 🛍️ Building a Retail Data Pipeline

Clean, aggregate, and analyze e-commerce data for Walmart to generate insights.

Skills: Data wrangling, pipeline design, customer segmentation.

3. 🏦 Cleaning Bank Marketing Campaign Data

Prepare and explore marketing data for a Portuguese bank to improve campaign outcomes.

Skills: Feature engineering, exploratory analysis, data cleaning.

4. 🐧 Clustering Antarctic Penguin Species

Apply unsupervised learning to cluster penguin species based on biological features.

Skills: Clustering (K-Means), EDA, classification-ready data prep.

5. 👥 Customer Analytics – Data Modeling Preparation

Explore customer behavior to prepare data for future predictive modeling.

Skills: Data preprocessing, segmentation, dimensionality reduction.

6. 🏙️ Exploring Airbnb Market Trends

Understand listing performance and pricing dynamics in NYC's Airbnb market.

Skills: Time-series, text analysis, multi-format data handling (CSV, TSV, Excel).

7. 📚 NYC Public School Test Result Analysis

Investigate disparities in standardized SAT test results across New York City schools.

Skills: Educational data analysis, socioeconomic impact assessment.

8. ⚽ FIFA 21 Data Cleaning

Clean and format messy video game data from FIFA 21 for analytical use.

Skills: Advanced cleaning, missing value imputation, data reshaping.

9. 🧪 Hypothesis Testing: Men's vs Women's Soccer

Statistically compare the performance of men's and women's national soccer teams.

Skills: Hypothesis testing, inferential statistics, visual storytelling.

10. 🎬 Investigating Netflix Movies

Dive into Netflix movie metadata to explore genres, trends, and viewer engagement.

Skills: EDA, categorical analysis, content trends.

11. 🚗 Modeling Car Insurance Claim Outcomes

Predict the likelihood of a customer filing a car insurance claim.

Skills: Binary classification, predictive modeling, ROC/AUC evaluation.

12. 🧾 Performing a Code Review

Review and improve a colleague’s notebook analyzing smartphone data.

Skills: Peer review, visualization improvements, best practices.

13. 📀 Predicting Movie Rental Durations

Build a predictive model to estimate movie rental durations using customer behavior.

Skills: Regression modeling, EDA, business use case simulation.

14. 🌾 Predictive Modeling for Agriculture

Use soil quality data to determine conditions for optimal crop yield.

Skills: Environmental data modeling, agriculture analytics, supervised learning.

15. 🏅 Visualizing Nobel Prize History

Analyze and visualize over a century of Nobel Prize winners to uncover global trends.

Skills: Historical data storytelling, visualization, gender/nationality distribution.


📈 Key Takeaways

  • Proven ability to analyze structured and unstructured data across various domains.
  • Strong focus on clean, reproducible code and storytelling with visualizations.
  • Applied end-to-end project lifecycle: from data collection to modeling and insights.

📬 Let's Connect!

If you're an employer, collaborator, or fellow data enthusiast, feel free to reach out. I'm always open to new opportunities and challenges.

📧 joaquinrojash@hotmail.com
🌐 LinkedIn
🐙 GitHub Profile


About

A comprehensive portfolio of data-driven Python projects focusing on real-world datasets, exploratory analysis, machine learning, and storytelling with data. Each project demonstrates end-to-end analytical thinking using modern data science libraries and tools.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published