Skip to content
View aryanpandey's full-sized avatar

Highlights

  • Pro

Organizations

@analytics-club-iitm

Block or report aryanpandey

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Data-Science

25 repositories

📝 An awesome Data Science repository to learn and apply for real world problems.

25,878 6,011 Updated Mar 4, 2025

ZenML 🙏: The bridge between ML and Ops. https://zenml.io.

Python 4,441 479 Updated Mar 5, 2025

AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.

Python 856 314 Updated Mar 4, 2025

We will keep updating the paper list about machine learning + causal theory. We also internally discuss related papers between NExT++ (NUS) and LDS (USTC) by week.

512 78 Updated Mar 3, 2023

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1,506 165 Updated Feb 26, 2025

Blogs on Machine Learning and Deep learning

110 11 Updated Dec 6, 2021

Auton Survival - an open source package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Events

Python 333 77 Updated Apr 4, 2024

Temporal Causal Discovery Framework (PyTorch): discovering causal relationships between time series

Jupyter Notebook 490 109 Updated Oct 1, 2021

The Machine Learning & Deep Learning Compendium was a list of references in my private & single document, which I curated in order to expand my knowledge, it is now an open knowledge-sharing projec…

2,068 227 Updated Dec 26, 2024

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

27,789 3,734 Updated Jul 18, 2024

🦉 Data Versioning and ML Experiments

Python 14,223 1,204 Updated Mar 3, 2025

A collection of 85 minority oversampling techniques (SMOTE) for imbalanced learning with multi-class oversampling and model selection features

Jupyter Notebook 649 139 Updated Jan 3, 2024

🌊 Online machine learning in Python

Python 5,219 560 Updated Mar 3, 2025

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 10,465 1,689 Updated Mar 5, 2025

Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristi…

Python 5,846 585 Updated Feb 18, 2025

Labs and demos for courses for GCP Training (http://cloud.google.com/training).

Jupyter Notebook 8,062 5,922 Updated Feb 20, 2025

An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collec…

Jupyter Notebook 2,692 122 Updated Jan 10, 2025

A curated list of references for MLOps

12,940 1,928 Updated Nov 21, 2024

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 15,064 3,642 Updated Mar 5, 2025

Code repository for the O'Reilly publication "Building Machine Learning Pipelines" by Hannes Hapke & Catherine Nelson

Jupyter Notebook 590 252 Updated Mar 25, 2023

Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017

Jupyter Notebook 1,321 715 Updated May 1, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 36,574 2,758 Updated Mar 5, 2025

Curated list of quality open datasets

836 108 Updated Feb 12, 2025

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

Scala 94 53 Updated May 19, 2021

I upload my notes

1 Updated Dec 18, 2022