Skip to content

Latest commit

 

History

History
43 lines (30 loc) · 1.33 KB

output.md

File metadata and controls

43 lines (30 loc) · 1.33 KB

Output Config

You can configure the connector to output to files or API.

Output to Local Files

File-based output is the preferred way as it enables decoupling between the connector and ingestion pipeline. By default, the connector will write to the directory ${pwd}/${CURRENT_TIMESTAMP}.

To write the extracted data to a specific location, add the following fragment to your config file:

output:
  file:
    # Location of the output directory, e.g. /tmp/metaphor
    directory: <output_directory>

    # (Optional) Maximum number of messages in each output file split. Default to 200.
    batch_size_count: <messages_per_file>

    # (Optional) Maximum size for each output file split. Default to 100 MB.
    batch_size_size: <size_in_bytes>

    # (Optional) IAM role to assume. Default using the current AWS credential.
    assume_role_arn: <iam_role_arn>

    # (Optional) Maximum number of query logs to store in one batch file. Default to 100.
    query_log_batch_size_count: <query_logs_per_file>

Output to S3

To write the output to a S3 bucket, you must also add the AWS region & credentials to the config:

output:
  file:
    directory: s3://<bucket>/<path>

    s3_auth_config:
      aws_access_key_id: <AWS_ACCESS_KEY_ID>
      aws_secret_access_key: <AWS_SECRET_ACCESS_KEY>
      region_name: <AWS_REGION>