Output Config

You can configure the connector to output to files or API.

Output to Local Files

File-based output is the preferred way as it enables decoupling between the connector and ingestion pipeline. By default, the connector will write to the directory ${pwd}/${CURRENT_TIMESTAMP}.

To write the extracted data to a specific location, add the following fragment to your config file:

output:
  file:
    # Location of the output directory, e.g. /tmp/metaphor
    directory: <output_directory>

    # (Optional) Maximum number of messages in each output file split. Default to 200.
    batch_size_count: <messages_per_file>

    # (Optional) Maximum size for each output file split. Default to 100 MB.
    batch_size_size: <size_in_bytes>

    # (Optional) IAM role to assume. Default using the current AWS credential.
    assume_role_arn: <iam_role_arn>

    # (Optional) Maximum number of query logs to store in one batch file. Default to 100.
    query_log_batch_size_count: <query_logs_per_file>

Output to S3

To write the output to a S3 bucket, you must also add the AWS region & credentials to the config:

output:
  file:
    directory: s3://<bucket>/<path>

    s3_auth_config:
      aws_access_key_id: <AWS_ACCESS_KEY_ID>
      aws_secret_access_key: <AWS_SECRET_ACCESS_KEY>
      region_name: <AWS_REGION>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

output.md

output.md

Output Config

Output to Local Files

Output to S3

Files

output.md

Latest commit

History

output.md

File metadata and controls

Output Config

Output to Local Files

Output to S3