Skip to content

OGIT ontology details

plarem edited this page Aug 26, 2014 · 52 revisions

OGIT ontology details

Here we take a more technical view of OGIT.

Reading this will help you give a deeper understanding of the ontology and make it much easier to read the ontology documentation.

Reading this is a must if you want to contribute to the ontology.

Some terminology

The ontology defines

  • attributes
  • entities
  • verbs

Entities define the objects we are dealing with in OGIT. Any instance of an entitiy type can have attributes, which are predefined by attribute definitions from the ontology. Verb definitions declare which relationships are possible between instances of entity types.

Relation between and OGIT ontology and OGIT data

The actual data will be stored in GraphIT. GraphIT is based on a graph database and used the OGIT ontology to ensure that only valid data gets stored. We have the following relation between graph data and ontology definitions:

  • Each vertex in the graph must be a valid instance of an entity type defined by the ontology
  • All vertex properties must be valid w.r.t. the the attribute definitions from the ontology (what attribute validity means will be explained below)
  • All edges in the graph must have labels defined by some verb type in the ontology.
  • All triples defined by graph data must be valid according to the ontology (see below for more details)

Note: to prevent some confusion from now on we will use the terms attribute, entity, and verb whenever referring to elements from the ontology and property, vertex, and edge when talking about instance data.

OGIT ontology format description

The ontology is represented in YAML format. By convention we are maintaining an individual YAML file for each attribute, verb, and entity definition.

Attribute definitions

Defining an attribute in OGIT ontology will require a YAML stanza like this (the exact format can be found in Attribute.yaml.tpl):

- Attribute:
    id: http://www.purl.org/ogit/color
    name: color
    description: "can be used to store the color of objects"
    validation-type: ""
    validation-parameter: ""

The example contains the following details

Parameter Description
id a unique id for that ontology element (see "IDs of ontology elements" below)
name by convention this repeats the last part of ID. It is only used as display name.
description should describe the semantics of that attribute as clear as possible. This will be the main source of information if somebody needs to decide if that attribute can be re-used
validation-type, validation-parameter if an entity definition refers to that attribute definition and a vertex of that entity is to be created, then actual value of that attribute will be the validated according to the requirements defined by validation-type and validation-parameter. Empty values, as in our example, will skip any further validation. (see "Attribute Validation")

Entity definitions

Defining an entity in OGIT ontology will require a YAML stanza like this (the exact format can be found in Entity.yaml.tpl):

- Entity:
    id: http://www.purl.org/ogit/Tree
    name: Tree
    description: "This entity type models a tree"
    scope: SGO
    parent: http://www.purl.org/ogit/Factual

    attributes:
      any: false
      mandatory:
        - id: http://www.purl.org/ogit/type
          validation-type: ""
          validation-parameter: ""
      optional:
        - id: http://www.purl.org/ogit/color
          validation-type: ""
          validation-parameter: ""

The example contains the following details

Parameter Description
id a unique id for that ontology element (see "IDs of ontology elements" below)
name by convention this repeats the last part of ID. It is only used as display name.
description description of the purpose of that entity. This should be detailed enough to let others decide whether this entity can be re-used
scope either "SGO" (stating that this entity definition is considered as part of the core ontology) or "NTO" (meaning that this entity definition is part of some domain specific extension)
parent contains the id of another entity definition (see section about "Inheritance" below)
attributes used for the property validation of all instances (vertices) of that entity type:

'mandatory' defines all properties that must be present in a vertex to be valid. This corresponds to the Specific Node Required Attributes

'optional' defines all properties that can be present in a vertex and the semantics of which is well-defined. This corresponds to the Specific Node Best Practice Attributes

'any: true' ensures any vertex property with a name starting with '/' will be accepted. This corresponds to the Specific Node Free Attributes
'validation-type, validation-parameter' if an entity definition refers to that attribute definition and a vertex of
that entity is to be created, then actual value of that attribute will be the validated according to the requirements defined by validation-type and validation-parameter. Possible validation-types are 'regex', 'xml' and 'generator'. Empty values, as in our example, will skip any further validation. (see "Attribute Validation")

Verb definitions

Defining a verb in OGIT ontology will require a YAML stanza like this (the exact format can be found in Verb.yaml.tpl):

- Verb:
    id: http://www.purl.org/ogit/likes
    name: likes
    description: |
      expresses that the 'from' entity 'likes' the 'to' entity
    cardinality: 

    allowed:
     - from: http://www.purl.org/ogit/Person
       to: http://www.purl.org/ogit/Tree
     - from: http://www.purl.org/ogit/Animal
       to: http://www.purl.org/ogit/Person

The example contains the following details

Parameter Description
id a unique id for that ontology element (see "IDs of ontology elements" below)
name by convention this repeats the last part of ID. It is only used as display name.
description description of the semantics of this verb. This should be detailed enough to let others decide whether this verb can be re-used
cardinality possible values are: one2one, one2many, many2one, many2many. An empty value defaults to many2many. This is used by the GraphIT validator during edge creation. See https://github.com/thinkaurelius/titan/wiki/Type-Definition-Overview#cardinality-constraints for the specification of cardinality constraints
allowed This is used by GraphIT validator during edge creation. It is a list of ['from', 'to'] pairs. An edge of that verb type can be created if the pair ['entity type of source vertex', 'entity type of destination vertex'] is compatible with one of the ['from', 'to'] pairs. This comparison takes "inheritance" into account.

Attribute Validation

For most operations on a vertex ("create", "update", ...) attribute validation takes place. This consists of two steps. The basis for attribute validation will be entity definition of the type the vertex belongs to.

  • check the list of properties provided along with the vertex operation:
    • during "create" (and "replace") any attribute defined as "mandatory" must be present (an empty value is possible though). If it is present the value validation will take place.
    • if any attribute defined as "optional" is present the value validation will take place
    • if request contains properties neither defined by "mandatory" nor "optional" attributes then the 'any' switch is checked. If it is true and the property names start with a '/' it will be accepted (without any value validation). Otherwise the whole vertex data is considered invalid.
  • value validation will be based on validation-type, validation-parameter of the corresponding attribute definition. Empty values mean: no further validation.

IDs of ontology elements

IDs of ontology elements, e.g. http://www.purl.org/ogit/color must be unique across the whole ontology (i.e. across the union of all attributes, verbs, and entities).

To ensure uniqueness we register a suitable Persistent URL for each ontology element at www.purl.org.

Note: When using GraphIT REST API those IDs are always used with the prefix http://www.purl.org/. In our examples you would use ogit/color to address that attribute type within an application.

Naming conventions

to inherit or not to inherit?

Each entity definition contains a parent entity type. Using this the new entity type

  • "inherits" attribute declarations from ancestor types
  • "inherits" allowed verbs ('from' and 'to' end) from ancestor types. This is mainly for convenience: it presents ontology maintainers to repeat the same definitions again and again. Especially each entity will automatically have all the default attributes which are used by GraphIT for proper handling of vertex data (e.g. 'ogit/_id', 'ogit/_type', and so on).

Those parent/child relationships of entity definitions must form a tree (e.g. must not contain any loops)

Note: It is just to simplify the ontology definitions. It is not real inheritance as you know from OO design! For example: let's assume the ontology defines an entity 'ogit/Plant' and 'ogit/Tree' having the first one as parent type. Now let's assume you create a vertex A of type 'ogit/Plant' and a vertex B of type 'ogit/Tree' in GraphIT. If you then query for all vertices of type 'ogit/Plant' GraphIT will return A but not B!

Clone this wiki locally