Skip to content

Latest commit

 

History

History
38 lines (33 loc) · 1.69 KB

to-jsonlines-pr.md

File metadata and controls

38 lines (33 loc) · 1.69 KB

to-jsonlines-pr

  • domain(s): pairs
  • accepts: ldc.api.supervised.pairs.PairData

Writes prompt/output pairs in JsonLines-like JSON format.

usage: to-jsonlines-pr [-h] [-l {DEBUG,INFO,WARNING,ERROR,CRITICAL}]
                       [-N LOGGER_NAME] -o OUTPUT [--att_instruction ATT]
                       [--att_input ATT] [--att_output ATT] [--att_id ATT]
                       [-d NUM] [-b SIZE]

Writes prompt/output pairs in JsonLines-like JSON format.

optional arguments:
  -h, --help            show this help message and exit
  -l {DEBUG,INFO,WARNING,ERROR,CRITICAL}, --logging_level {DEBUG,INFO,WARNING,ERROR,CRITICAL}
                        The logging level to use. (default: WARN)
  -N LOGGER_NAME, --logger_name LOGGER_NAME
                        The custom name to use for the logger, uses the plugin
                        name by default (default: None)
  -o OUTPUT, --output OUTPUT
                        Path of the JsonLines file to write (directory when
                        processing multiple files) (default: None)
  --att_instruction ATT
                        The attribute for the instructions (default: None)
  --att_input ATT       The attribute for the inputs (default: None)
  --att_output ATT      The attribute for the outputs (default: None)
  --att_id ATT          The name of the attribute for the row IDs (uses 'id'
                        from meta-data) (default: None)
  -d NUM, --num_digits NUM
                        The number of digits to use for the filenames
                        (default: 6)
  -b SIZE, --buffer_size SIZE
                        The size of the record buffer when concatenating (to
                        improve I/O throughput) (default: 1000)