Skip to content
olehmberg edited this page Jun 19, 2017 · 16 revisions

Welcome to the WInte.r Wiki. The pages in this Wiki introduce the most important concepts and parts of the framework. In addition, code examples provide a guideline how to start a data integration project.

The WInte.r framework comprises a model and implementations for various data integration tasks. The general workflow of an end-to-end data integration process looks like the following.

  • Load the data into dataset objects
  • Apply schema matching to get correspondences between attributes
  • Apply identity resolution to get correspondences between records
  • Transform the data into a consolidated schema using the correspondences between the attributes
  • Use the correspondences between the records to perform data fusion and create one consolidated dataset

Contents