Skip to content
#

data-normalization

Here are 109 public repositories matching this topic...

The PyDI framework provides methods for end-to-end data integration. The framework covers all steps of the integration process, including schema matching, data translation, entity matching, and data fusion. The framework offers traditional string-based methods as well as modern LLM- and embedding-based techniques for these tasks.

  • Updated Sep 26, 2025
  • Python

This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.

  • Updated Aug 19, 2023
  • Jupyter Notebook
Finding-Donors-for-Charity-using-Machine-Learning

Machine Learning Nano-degree Project : To help a charity organization identify people most likely to donate to their cause

  • Updated Oct 19, 2019
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-normalization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-normalization topic, visit your repo's landing page and select "manage topics."

Learn more