Skip to content

Repository for schema matching data and source code, used for PRISMA

Notifications You must be signed in to change notification settings

HPI-Information-Systems/prisma

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PRISMA - Privacy-Preserving Schema Matcher

This Project was implemented using the Schematch repository. Please check the Wiki for further information on the repository.

Setup

This project requires Java JDK 17 and Maven >=3.9.2. We suggest developing and running the project in IntelliJ IDEA. For setup, IntelliJ should guide you to install the correct Java JDK and download the Maven dependencies for you. In case of problems with Maven dependencies, try to reload the project.

You can run the project out of the box, as it comes with data and default configurations. For a successful run, you should see this log line at the end of your console:

[INFO ] <timestamp> [main] de.uni_marburg.schematch.Main - Ending Schematch

(taken from wiki)

To install and use PRISMA, LEAPME, and EmbDI we provide a Docker Compose file. Build the corresponding images by running:

sudo docker compose --verbose -f sota.docker-compose.yml build

Additionally, a Pyro server needs to be started. The files can be found in src/main/resources/pyro. Download the required metanome-cli-1.1.0.jar and pyro-distro-1.0-SNAPSHOT-distro.jar from here and here and place it in the pyro directory. Install the required python dependencies using poetry.

Running PRISMA

To run PRISMA, LEAPME, and EmbDI execute:

sudo docker compose --verbose -f sota.docker-compose.yml up

Start the Pyro server, which is required for Data Profiling using python pyro_server.py.

To specify the to be used matchers and their settings, check out the first_line_matchers.yaml. The to be used datasets can be selected in datasets.yaml.

Finally, running Schematch's Main function (e.g., using IntelliJ IDEA) will execute all defined scenarios.

The run's result can be found in the /results folder.

About

Repository for schema matching data and source code, used for PRISMA

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 66.2%
  • Python 33.3%
  • Other 0.5%