Onboarding Automation Tools

This repository hosts a set of libraries and command line tool for automating parts of the onboarding workflow. It gives the user the ability to apply rule-based mapping automation, ingest multiple source files, review loadsheet consistency, and validate entity definitions against a pre-defined ontology (i.e., Google's Digital Buildings Ontology).

Repo Overview

This repo contains the following critical pieces:

A well defined ontology (./ontology)
A command line interface for dynamically building and checking loadsheets (./programs/cli.py)
Associated support libraries for the command line interface (and for future enhancement):
1. An ontology validator
2. A loadsheet validator
3. A handler class that sits atop all the relevant classes
4. A rules engine for applying regular expression pattern matching
5. A representations class set for converting the loadsheet into ontology-usable objects

Dependencies

This repo requires the following libraries to be installed prior to use:

pyyaml (for parsing YAML documents)
pyfiglet (for fancy CLI name)
openpyxl (for Excel read/write)
pandas (for loadsheet backend)
ruamel.yaml

If not already installed, you can install the libraries by running requirements.py in your command line:

>>> python requirements.py

Example Workflow

Start the Commmand Line Interface (LoadBoy2000):

Run the progam: >>> python cli.py

Loadsheet process:

Prepare the loadsheet
1. Obtain a point list (in XSLX or CSV format)
2. Format the point list to adhere to the loadsheet template sheet
3. Run the RULE ENGINE over the data
4. Manually review the unmapped points
Validate the loadsheet
Match to existing DBO types
Create new types, as needed
Apply types to the loadsheet

Example workflow:

Import the ontology:

>>> import ontology '../ontology/yaml/resources' If successful, you should get CLI confirmation.

Manual (optional) unit tests:
- Add a fake field to the field list ('bacon_sensor') -- should return error
- Add a fake field with valid subfields ('supply_sensor') -- will NOT return an error.
- Add a new type with a fake field -- should return error
- Add duplicate fields to fake type -- should return error
Clean raw loadsheet:

>>> clean '../loadsheet/Loadsheet_ALC.xlsx'
Import the cleaned loadsheet:

>>> import loadsheet '../loadsheet/Loadsheet_ALC.xlsx'

If successful, you should get CLI confirmation.
Normalize the loadsheet (AKA apply the ruleset):

>>> normalize '../resources/rules/google_rules.json'

If successful, you should get CLI confirmation.
Export to a new loadsheet for review:

>>> export excel '../loadsheet/Loadsheet_ALC_Normalized.xlsx'

Rules should have been applied. You should see a new file with normalized columns (e.g., required, assetName, and standardFieldName) filled in.
Perform a manual review and repeat steps 3, 4, and 5 as necessary.
Import and validate finished loadsheet:

>>> import loadsheet '../loadsheet/Loadsheet_ALC_Final.xlsx'

>>> validate

Validation will fail for common errors:
- duplicate standardFieldName and fullAssetPath combinations
- an invalid standardFieldName (i.e., not defined in the referenced ontology or mispelled)
- missing bacnet info (e.g., missing objectId)
When no validation errors are present, assets in the loadsheet can be matched to DBO entity types:

>>> match
Perform a review of type matches and assign to a valid canonical type.

>>> review generalTypes >>> review generalTypes VAV >>> review generalTypes VAV 1

or

>>> review matches
Apply the matched types Either review all matches made using

>>> apply all

Or Autoapply exact matches and only review inexact using

>>> apply close
Convert normalized loadsheet to ABEL spreadsheet:

>>> convert abel ./path/to/building/payload.csv

Known deficiencies and future development

The following is a list of issues that need to be addressed before widespread use:

Add rigorous typing to all methods
Make the necessary fields in handler.py and representations.py private
Increase the match success rate of the rules JSON (and potentially provide tooling or templates for users to create their own ruleset)

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
loadsheet		loadsheet
ontology		ontology
programs		programs
resources		resources
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.py		requirements.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Onboarding Automation Tools

Repo Overview

Dependencies

Example Workflow

Known deficiencies and future development

About

Releases

Packages

Languages

License

shambergoldstein/Google-Onboarding-Tool-19

Folders and files

Latest commit

History

Repository files navigation

Onboarding Automation Tools

Repo Overview

Dependencies

Example Workflow

Known deficiencies and future development

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages