Skip to content
#

data-science-pipeline

Here are 5 public repositories matching this topic...

Language: All
Filter by language

A Python wrapper for GNU parallel that naturally embeds shell code into Python scripts or Jupyter notebooks (no delimiter hell). Features parameter substitution, environment substitution, and cross-product generation to eliminate shell loops. Perfect for integrating Unix programs into Python environments for bioinformatics and data science.

  • Updated Jun 28, 2025
  • Python
Customer-Churn-Prediction

End-to-End data preprocessing and exploratory data analysis (EDA) techniques. It covers data cleaning, handling missing values, feature engineering, encoding categorical variables, scaling, and visualizations to understand patterns and insights in datasets

  • Updated Sep 24, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-science-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-science-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more