Skip to content

Commit

Permalink
usage updated
Browse files Browse the repository at this point in the history
  • Loading branch information
mikolmogorov committed Mar 3, 2020
1 parent 650befc commit 322541b
Show file tree
Hide file tree
Showing 3 changed files with 27 additions and 22 deletions.
32 changes: 18 additions & 14 deletions docs/USAGE.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,22 +17,25 @@ Table of Contents
## <a name="quickusage"></a> Quick usage

```
usage: flye (--pacbio-raw | --pacbio-corr | --nano-raw |
usage: flye (--pacbio-raw | --pacbio-corr | --pacbio-hifi | --nano-raw |
--nano-corr | --subassemblies) file1 [file_2 ...]
--genome-size SIZE --out-dir PATH
[--threads int] [--iterations int] [--min-overlap int]
[--meta] [--plasmids] [--trestle] [--polish-target]
[--debug] [--version] [--help] [--resume]
[--resume-from] [--stop-after]
[--keep-haplotypes] [--debug] [--version] [--help]
[--resume] [--resume-from] [--stop-after]
Assembly of long and error-prone reads
Assembly of long reads with repeat graphs
optional arguments:
-h, --help show this help message and exit
--pacbio-raw path [path ...]
PacBio raw reads
--pacbio-corr path [path ...]
PacBio corrected reads
--pacbio-hifi path [path ...]
PacBio HiFi reads
--nano-raw path [path ...]
ONT raw reads
--nano-corr path [path ...]
Expand All @@ -53,6 +56,7 @@ optional arguments:
set]
--plasmids rescue short unassembled plasmids
--meta metagenome / uneven coverage mode
--keep-haplotypes do not collapse alternative haplotypes
--trestle enable Trestle [disabled]
--polish-target path run polisher on the target sequence
--resume resume from the last completed stage
Expand All @@ -62,17 +66,17 @@ optional arguments:
stop after the specified stage completed
--debug enable debug output
-v, --version show program's version number and exit
```

Input reads can be in FASTA or FASTQ format, uncompressed
or compressed with `gz`. Currently, raw and corrected reads
from PacBio and ONT are supported. Expected error rates are
<30% for raw and <2% for corrected reads. Additionally, the
`--subassemblies` option performs a consensus assembly of multiple
or compressed with gz. Currently, PacBio (raw, corrected, HiFi)
and ONT reads (raw, corrected) are supported. Expected error rates are
<30% for raw, <3% for corrected, and <1% for HiFi. Note that Flye
was primarily developed to run on raw reads. Additionally, the
--subassemblies option performs a consensus assembly of multiple
sets of high-quality contigs. You may specify multiple
files with reads (separated by spaces). Mixing different read
types is not yet supported. The `--meta` option enables the mode
types is not yet supported. The --meta option enables the mode
for metagenome/uneven coverage assembly.

You must provide an estimate of the genome size as input,
Expand All @@ -83,11 +87,11 @@ should be provided.

To reduce memory consumption for large genome assemblies,
you can use a subset of the longest reads for initial disjointig
assembly by specifying `--asm-coverage` option. Typically,
30x coverage is enough to produce good disjointigs.
assembly by specifying --asm-coverage option. Typically,
40x coverage is enough to produce good disjointigs.

You can separately run Flye polisher on a target sequence
using `--polish-target` option.
You can run Flye polisher as a standalone tool using
--polish-target option.


## <a name="examples"></a> Examples
Expand Down
2 changes: 1 addition & 1 deletion flye/__build__.py
Original file line number Diff line number Diff line change
@@ -1 +1 @@
__build__ = 1581
__build__ = 1582
15 changes: 8 additions & 7 deletions flye/main.py
Original file line number Diff line number Diff line change
Expand Up @@ -608,9 +608,10 @@ def _usage():

def _epilog():
return ("Input reads can be in FASTA or FASTQ format, uncompressed\n"
"or compressed with gz. Currently, raw and corrected reads\n"
"from PacBio and ONT are supported. Expected error rates are\n"
"<30% for raw and <2% for corrected reads. Additionally, the\n"
"or compressed with gz. Currently, PacBio (raw, corrected, HiFi)\n"
"and ONT reads (raw, corrected) are supported. Expected error rates are\n"
"<30% for raw, <3% for corrected, and <1% for HiFi. Note that Flye\n"
"was primarily developed to run on raw reads. Additionally, the\n"
"--subassemblies option performs a consensus assembly of multiple\n"
"sets of high-quality contigs. You may specify multiple\n"
"files with reads (separated by spaces). Mixing different read\n"
Expand All @@ -624,9 +625,9 @@ def _epilog():
"To reduce memory consumption for large genome assemblies,\n"
"you can use a subset of the longest reads for initial disjointig\n"
"assembly by specifying --asm-coverage option. Typically,\n"
"30x coverage is enough to produce good disjointigs.\n\n"
"You can separately run Flye polisher on a target sequence \n"
"using --polish-target option.")
"40x coverage is enough to produce good disjointigs.\n\n"
"You can run Flye polisher as a standalone tool using\n"
"--polish-target option.")


def _version():
Expand Down Expand Up @@ -655,7 +656,7 @@ def check_int_range(value, min_val, max_val, require_odd=False):
return ival

parser = argparse.ArgumentParser \
(description="Assembly of long and error-prone reads",
(description="Assembly of long reads with repeat graphs",
formatter_class=argparse.RawDescriptionHelpFormatter,
usage=_usage(), epilog=_epilog())

Expand Down

0 comments on commit 322541b

Please sign in to comment.