Skip to content

Register Transformer SK ingests records from a Kinesis stream (published by register_ingester_sk) and transforms them into BODS v0.2 records

Notifications You must be signed in to change notification settings

openownership/register-transformer-sk

Repository files navigation

Register Transformer SK

Register Transformer SK is a data transformer for the OpenOwnership Register project. It processes bulk data published to AWS S3, such as emitted from AWS Kinesis Data Firehose, converts them into the Beneficial Ownership Data Standard (BODS) format, and stores records in Elasticsearch. Optionally, it can also use AWS Kinesis for processing streamed data (rather than bulk data published to AWS S3), or for publishing newly-transformed records to a different stream.

The transformation schema is BODS 0.2.

Installation

Install and boot Register.

Configure your environment using the example file:

cp .env.example .env

Create the Elasticsearch indexes:

docker compose run transformer-sk create-indexes

Testing

Run the tests:

docker compose run transformer-sk test

Usage

To transform the bulk data from a prefix in AWS S3:

docker compose run transformer-sk transform-bulk raw_data/source=SK/year=2023/month=10/

About

Register Transformer SK ingests records from a Kinesis stream (published by register_ingester_sk) and transforms them into BODS v0.2 records

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published