You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
At the end of the prepare pipeline, a couple of post-processing steps are performed on the merged TSV before we feed it to the classify pipeline. All of the scripts used in these steps support reading from stdin and writing to stdout except for fillna.bash
remove the first parameter from fillna.bash and make it read the TSV from stdin, instead
connect all of the post-processing steps together via pipes
this will allow us to save on file IO and wasted time compressing and uncompressing the file between steps
remove extra config params that nobody uses (like keepna, pure_numerics, and friends) - they just make things more complicated
mark extra files as temp
The text was updated successfully, but these errors were encountered:
At the end of the
prepare
pipeline, a couple of post-processing steps are performed on the merged TSV before we feed it to theclassify
pipeline. All of the scripts used in these steps support reading fromstdin
and writing tostdout
except forfillna.bash
fillna.bash
and make it read the TSV fromstdin
, insteadkeepna
,pure_numerics
, and friends) - they just make things more complicatedThe text was updated successfully, but these errors were encountered: