Skip to content
olehmberg edited this page Jun 23, 2017 · 16 revisions

Welcome to the WInte.r Wiki. The pages in this Wiki introduce the most important concepts and parts of the framework. In addition, code examples provide a guideline how to start a data integration project.

The WInte.r framework comprises a data model and methods for various data integration tasks. The general workflow of an end-to-end data integration process looks like the following.

  1. Load the data into dataset objects
  2. Apply schema matching to get correspondences between attributes
  3. Apply identity resolution to get correspondences between records
  4. Transform the data into a consolidated schema using the correspondences between the attributes
  5. Use the correspondences between the records to perform data fusion and create one consolidated dataset

Contents