Skip to content

Commit 848a411

Browse files
update
1 parent 5fc7ba4 commit 848a411

File tree

2 files changed

+36
-1
lines changed

2 files changed

+36
-1
lines changed

README.md

Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -38,3 +38,38 @@ chmod +x polishTE
3838

3939
### Singularity
4040

41+
```
42+
singularity pull library://tommasobarberis98/tealb/polishte
43+
```
44+
45+
### Usage
46+
47+
#### Basic usage
48+
49+
```
50+
polishTE -i seq.fasta -g ref.fasta [OPTIONS]
51+
```
52+
53+
#### With singularity
54+
55+
```
56+
singularity exec polishTE.sif polishTE -i seq.fasta -g ref.fasta
57+
```
58+
59+
### Options
60+
61+
| Parameter | Description | Default |
62+
| --------- | ----------- | ------- |
63+
| `**-i, --input**` | FASTA file with the sequence to polish | None |
64+
| `**-g, --genome**` | FASTA file with the reference | None |
65+
|` -h, --help` | show the help message | None |
66+
| `-t, --threads` | number of threads to use in the multiple alignement with `MAFFT` | 1 |
67+
| `-o, --output` | output directory | ./ |
68+
| `-min, --min_length` | Minmum length for `blastn` hits that will be recovered in the genome. If the option is set to 0, the program will use the half of the length of the TE as minimum length, else it will use the provided value | 0 |
69+
| `-f, --flank`| Number of bases to extract at the flanking regions (5' and 3') of the hits | 1000pb |
70+
| `-e, --evalue` | e-value threshold for the `blastn`| 1e-20 |
71+
| `-l, --limit` | Maximum number of hits to take in account for the alignment. If the number of hits is less then this limit, the subsample is skipped. | 100 |
72+
| `-k, --keep_longest` | Keep the \<k> % longest sequences among the `blastn` hits | 0.25 |
73+
| `-ins, --max_ins_size` | Remove an insertion from the MSA if its size is less than \<ins> (range:[200-1000]) | 200 |
74+
| `-m, --mode` | Speed mode. fast and less accurate (max 10 iterations) or slow and more accurate (max 100 iterations). When the maximum number of iteration is reached, the program is interrupted and it returns any sequence | fast |
75+
| `-c, --min_cov` | Minimum coverage on boundaries to perform the extension | 3 |

polishTE

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -58,7 +58,7 @@ function usage()
5858
will use the half of the length of the TE as minimum length, else
5959
it will use the provided value
6060
-f, --flank Number of bases to extract at the flanking regions of the hits
61-
(default: 100)
61+
(default: 1000)
6262
-e, --evalue e-value for blastn (default: 1e-20)
6363
-l, --limit Maximum number of hits to take in account for the alignement
6464
(default: 100). If the number of hits is less then this limit, the

0 commit comments

Comments
 (0)