Skip to content

Tool for generating RDFS/OWL ontologies and knowledge graphs. It helps to test AI pipelines by creating synthetic data with reliable knowledge structure and characteristic graph patterns. It also aims to advance the topic of generating realistic RDF knowledge graphs through parametric generation and subgraph matching techniques.

License

Notifications You must be signed in to change notification settings

Orange-OpenSource/pygraft-gen

 
 

Repository files navigation

PyGraft-gen

Generate synthetic RDFS/OWL ontologies and RDF Knowledge Graphs at scale.

PyGraft-gen uses stochastic generation to produce ontologies and Knowledge Graphs with reliable structure while respecting OWL constraints, making it ideal for testing AI pipelines, benchmarking graph algorithms, and research scenarios where real data is sensitive or unavailable.

It also aims to advance the topic of generating realistic RDF Knowledge Graphs through parametric generation.

PyGraft-Gen is a major evolution of PyGraft, originally developed by Nicolas Hubert and awarded Best Resource Paper at ESWC 2024.

Typical workflows are:

  • Generate a synthetic RDFS/OWL ontology from statistical parameters
  • Generate an RDF Knowledge Graph from a synthetic ontology
  • Generate an RDF Knowledge Graph from a user-provided ontology

pygraft-gen_framework

Repository Structure:

.
├── evaluation/   # Subgraph matching research (experimental)
├── docs/         # Documentation source
└── src/          # PyGraft-gen library

The evaluation/ directory contains ongoing research on subgraph matching patterns and is separate from the main library.

Installation

Requirements: Python 3.10+, Java (optional, for reasoning)

pip:

pip install pygraft-gen

uv:

uv add pygraft-gen

poetry:

poetry add pygraft-gen

Verify the installation:

pygraft --help

See the installation documentation for setup details and the quickstart for complete examples.

Documentation

See the official documentation for guides, API reference, and examples.

Contributing

Contributions welcome! See CONTRIBUTING.md for guidelines.

Copyright

Copyright (c) 2024-2025, Orange and Nicolas HUBERT. All rights reserved.

License

MIT-License.

Maintainer

About

Tool for generating RDFS/OWL ontologies and knowledge graphs. It helps to test AI pipelines by creating synthetic data with reliable knowledge structure and characteristic graph patterns. It also aims to advance the topic of generating realistic RDF knowledge graphs through parametric generation and subgraph matching techniques.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 92.9%
  • Makefile 5.6%
  • HTML 1.1%
  • Shell 0.4%