Skip to content

Commit 15ae84e

Browse files
committed
2.3.5 + docs update
1 parent 95cf0eb commit 15ae84e

File tree

6 files changed

+46
-21
lines changed

6 files changed

+46
-21
lines changed

README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,9 @@
11
Flye assembler (successor of ABruijn)
22
=====================================
33

4-
### Version: 2.3.5b
4+
[![BioConda Install](https://img.shields.io/conda/dn/bioconda/flye.svg?style=flag&label=BioConda%20install)](https://anaconda.org/bioconda/flye)
5+
6+
### Version: 2.3.5
57

68
Flye is a de novo assembler for long and noisy reads, such as
79
those produced by PacBio and Oxford Nanopore Technologies.

docs/INSTALL.md

Lines changed: 14 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,25 +6,35 @@ Availability
66

77
Flye is available for Linux and MacOS platforms.
88

9+
Bioconda Releases
10+
-----------------
11+
12+
You can get the latest stable release through Bioconda:
13+
14+
conda install flye
15+
16+
Alternatively, you can get a release verson from the github "releases" page
17+
18+
919
Requirements
1020
------------
1121

1222
* C++ compiler with C++11 support (GCC 4.8+ / Clang 3.3+ / Apple Clang 5.0+)
1323
* GNU make
1424
* Python 2.7
1525
* Git
16-
* Basic OS development headers (zlib, etc.)
26+
* Core OS development headers (zlib, etc)
27+
1728

18-
Get the latest version (recommended)
19-
------------------------------------
29+
Get the latest source version
30+
-----------------------------
2031

2132
To get and compile the latest git version, run:
2233

2334
git clone https://github.com/fenderglass/Flye
2435
cd Flye
2536
python setup.py build
2637

27-
Alternatively, you can get a release verson from the "releases" page.
2838

2939
After building, Flye could be invoked with the following command:
3040

docs/NEWS.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,11 @@
1+
Flye 2.3.5 (7 August 2018)
2+
==========================
3+
* New solid kmer alignment implementation with improved specificity
4+
* Better corrected reads support
5+
* Minimum overlap is now selected within a wider range for better support of datasets with shorter read length
6+
* Assembly of large (human size) genomes is now faster
7+
* Various bugfixes and stability improvements
8+
19
Flye 2.3.4 (19 May 2018)
210
========================
311
* A fix for assemblies with low reads count

docs/USAGE.md

Lines changed: 19 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -58,7 +58,7 @@ from PacBio and ONT are supported. The expected error rates are
5858
<30% for raw and <2% for corrected reads. Additionally,
5959
```--subassemblies``` option performs a consensus assembly of multiple
6060
sets of high-quality contigs. You may specify multiple
61-
fles with reads (separated by spaces). Mixing different read
61+
files with reads (separated by spaces). Mixing different read
6262
types is not yet supported.
6363

6464
You must provide an estimate of the genome size as input,
@@ -117,8 +117,8 @@ ONT data than with PacBio data, especially in homopolymer regions.
117117

118118
### Error-corrected reads input
119119

120-
While Flye was designed for assembly of raw reads (and this is the recommended option),
121-
it also supports error-corrected PacBio/ONT reads as input (use the correpsonding option).
120+
While Flye was designed for assembly of raw reads (and this is the recommended way),
121+
it also supports error-corrected PacBio/ONT reads as input (use the ```corr``` option).
122122
The parameters are optimized for error rates <2%. If you are getting highly
123123
fragmented assembly - most likely error rates in your reads are higher. In this case,
124124
consider to assemble using the raw reads instead.
@@ -171,10 +171,15 @@ errors (due to improvements on how reads may align to the corrected assembly;
171171
especially for ONT datasets). If the parameter is set to 0, the polishing will
172172
not be performed.
173173

174-
### Resuming existing jobs
174+
### Starting from a particular assembly stage
175175

176-
Use --resume to resume a previous run of the assembler that may have terminated
177-
prematurely. The assembly will continue from the last previously completed step.
176+
Use ```--resume``` to resume a previous run of the assembler that may have terminated
177+
prematurely (using the same output directory).
178+
The assembly will continue from the last previously completed step.
179+
180+
You might also resume from a particular stage with ```--resume-from stage_name```,
181+
where ```stage_name``` is a choice of ```assembly, consensus, repeat, polishing```.
182+
For example, you might supply different sets of reads for different stages.
178183

179184
## <a name="graph"></a> Assembly graph
180185

@@ -256,16 +261,16 @@ for more detailed information. The assembly pipeline is organized as follows:
256261

257262
* Kmer counting / erroneous kmer pre-filtering
258263
* Solid kmer selection (kmers with sufficient frequency, which are unlikely to be erroneous)
259-
* Finding read overlaps based on the A-Bruijn graph
260-
* Detection of chimeric sequences
261-
* Contig assembly by read extension
264+
* Contig extension. The algorithm starts from a single read and extends it
265+
with a next overlapping read (overlaps are dynamically detected using the selected
266+
solid k-mers).
262267

263-
The resulting contig assembly is now simply a concatenation of read parts
264-
and is error-prone. Flye then aligns the reads on the draft contigs using minimap2 and
265-
calls a rough consensus. Afterwards, the algorithm performs additional repeat analysis
266-
as follows:
268+
Note that we do not attempt to resolve repeats at this stage, thus
269+
the reconstructed contigs might contain misassemblies.
270+
Flye then aligns the reads on these draft contigs using minimap2 and
271+
calls a consensus. Afterwards, Flye performs repeat analysis as follows:
267272

268-
* Repeat graph is reconstructed from the assembled sequence
273+
* Repeat graph is constructed from the (possibly misassembled) contigs
269274
* In this graph all repeats longer than minimum overlap are collapsed
270275
* The algorithm resolves repeats using the read information and graph structure
271276
* The unbranching paths in the graph are output as contigs

flye/__version__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
__version__ = "2.3.5b"
1+
__version__ = "2.3.5"

flye/main.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -386,7 +386,7 @@ def _epilog():
386386
"<30% for raw and <2% for corrected reads. Additionally,\n"
387387
"--subassemblies option performs a consensus assembly of multiple\n"
388388
"sets of high-quality contigs. You may specify multiple\n"
389-
"fles with reads (separated by spaces). Mixing different read\n"
389+
"files with reads (separated by spaces). Mixing different read\n"
390390
"types is not yet supported.\n\n"
391391
"You must provide an estimate of the genome size as input,\n"
392392
"which is used for solid k-mers selection. The estimate could\n"

0 commit comments

Comments
 (0)