Skip to content

seqtools

xieguigang edited this page Oct 18, 2016 · 1 revision

GCModeller [version 3.0.2456.4506]

Sequence operation utilities

Sequence search tools and sequence operation tools Sequence search tools and sequence operation tools Copyright © xie.guigang@gcmodeller.org 2014

Module AssemblyName: file:///D:/GCModeller/GCModeller/bin/seqtools.exe Root namespace: seqtools.Utilities


If you are having trouble debugging this Error, first read the best practices tutorial for helpful tips that address many common problems:

https://github.com/smrucc/gcmodeller/src/analysis/

The debugging facility Is helpful To figure out what's happening under the hood:

http://gcmodeller.org

If you're still stumped, you can try get help from author directly from E-mail:

xie.guigang@gcmodeller.org

All of the command that available in this program has been list below:

Generic function API list
Function API Info
/align
/check.attrs
/Clustal.Cut
/gwANI
/Loci.describ Testing
/logo * Drawing the sequence logo from the clustal alignment result.
/Mirrors.Context This function will convert the mirror data to the simple segment object data
/Mirrors.Context.Batch This function will convert the mirror data to the simple segment object data
/Mirrors.Group
/Mirrors.Group.Batch
/nw RunNeedlemanWunsch
/Palindrome.BatchTask
/Palindrome.Workflow
/Sigma
/SimpleSegment.AutoBuild
/SimpleSegment.Mirrors
/SimpleSegment.Mirrors.Batch
/SNP
/Time.Diffs
/Write.Seeds
-321 Polypeptide sequence 3 letters to 1 lettes sequence.
--align
--align.Self
-complement
--Drawing.ClustalW
-pattern_search Parsing the sequence segment from the sequence source using regular expression.
Repeats.Density
-reverse
rev-Repeats.Density
Search.Batch Batch search for repeats.
--translates Translates the ORF gene as protein sequence. If any error was output from the console, please using > operator dump the output to a log file for the analysis.
1. Fasta Sequence Tools

Tools command that works around the fasta format data.

Function API Info
/Compare.By.Locis
/Distinct Distinct fasta sequence by sequence content.
/Get.Locis
/Gff.Sites
/Merge Only search for 1 level folder, dit not search receve.
/Merge.Simple This tools just merge the fasta sequence into one larger file.
/Select.By_Locus
/Split
/subset
/To_Fasta Convert the sequence data in a excel annotation file into a fasta sequence file.
-segment
--segments
--Trim
2. Sequence Palindrome Features Analysis

Tools command that using for finding Palindrome sites.

Function API Info
/Mirror.Batch
/Mirror.Fuzzy
/Mirror.Fuzzy.Batch
/Mirror.Vector
/Mirrors.Nt.Trim
/Palindrome.Screen.MaxMatches
/Palindrome.Screen.MaxMatches.Batch
--Hairpinks
--Hairpinks.batch.task
--ImperfectsPalindrome.batch.Task
--Mirror.From.Fasta Mirror Palindrome, search from a fasta file.
--Mirror.From.NT Mirror Palindrome, and this function is for the debugging test
--Palindrome.batch.Task
--Palindrome.From.FASTA
--Palindrome.From.NT This function is just for debugger test, /nt parameter is the nucleotide sequence data as ATGCCCC
--Palindrome.Imperfects
--PerfectPalindrome.Filtering
--ToVector

CLI API list


1. /align

Prototype: seqtools.Utilities::Int32 Align2(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /align /query <query.fasta> /subject <subject.fasta> [/blosum <matrix.txt> /out <out.xml>]
Example
seqtools

2. /check.attrs

Prototype: seqtools.Utilities::Int32 CheckHeaders(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /check.attrs /in <in.fasta> /n <attrs.count> [/all]
Example
seqtools

3. /Clustal.Cut

Prototype: seqtools.Utilities::Int32 CutMlAlignment(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Clustal.Cut /in <in.fasta> [/left 0.1 /right 0.1 /out <out.fasta>]
Example
seqtools

4. /Compare.By.Locis

Prototype: seqtools.Utilities::Int32 CompareFile(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Compare.By.Locis /file1 <file1.fasta> /file2 </file2.fasta>
Example
seqtools

5. /Distinct

Distinct fasta sequence by sequence content. Prototype: seqtools.Utilities::Int32 Distinct(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Distinct /in <in.fasta> [/out <out.fasta> /by_Uid <uid_regexp>]
Example
seqtools

6. /Get.Locis

Prototype: seqtools.Utilities::Int32 GetSimpleSegments(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Get.Locis /in <locis.csv> /nt <genome.nt.fasta> [/out <outDIR>]
Example
seqtools

7. /Gff.Sites

Prototype: seqtools.Utilities::Int32 GffSites(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Gff.Sites /fna <genomic.fna> /gff <genome.gff> [/out <out.fasta>]
Example
seqtools

8. /gwANI

Prototype: seqtools.Utilities::Int32 gwANI(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /gwANI /in <in.fasta> [/fast /out <out.Csv>]
Example
seqtools

9. /Loci.describ

Testing Prototype: seqtools.Utilities::Int32 LociDescript(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Loci.describ /ptt <genome-context.ptt> [/test <loci:randomize> /complement /unstrand]
Example
seqtools
  • Drawing the sequence logo from the clustal alignment result. Prototype: seqtools.Utilities::Int32 SequenceLogo(Microsoft.VisualBasic.CommandLine.CommandLine)
Usage
seqtools /logo /in <clustal.fasta> [/out <out.png> /title ""]
Example
seqtools

Parameters information:

/in

The file path of the clustal output fasta file.

Example
[/out]

The output sequence logo image file path. default is the same name as the input fasta sequence file.

Example
[/title]

The display title on the sequence logo, default is using the fasta file name.

Example
Accepted Types
/in

Decalre: SMRUCC.genomics.SequenceModel.FASTA.FastaFile Example:

[
    
]
/out
/title

11. /Merge

Only search for 1 level folder, dit not search receve. Prototype: seqtools.Utilities::Int32 Merge(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Merge /in <fasta.DIR> [/out <out.fasta> /trim /unique /ext <*.fasta> /brief]
Example
seqtools

12. /Merge.Simple

This tools just merge the fasta sequence into one larger file. Prototype: seqtools.Utilities::Int32 SimpleMerge(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Merge.Simple /in <DIR> [/exts <default:*.fasta,*.fa> /line.break 120 /out <out.fasta>]
Example
seqtools

13. /Mirror.Batch

Prototype: seqtools.Utilities::Int32 MirrorBatch(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Mirror.Batch /nt <nt.fasta> [/out <out.csv> /mp /min <3> /max <20> /num_threads <-1>]
Example
seqtools

Parameters information:

/nt
Example
[/mp]

Calculation in the multiple process mode?

Example
[/out]
Example
Accepted Types
/nt

Decalre: SMRUCC.genomics.SequenceModel.FASTA.FastaFile Example:

[
    
]
/mp

Decalre: System.Boolean Example:

true
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.PalindromeLoci Example:

{
    "Loci": "System.String",
    "MirrorSite": "System.String",
    "PalEnd": 0,
    "Palindrome": "System.String",
    "Start": 0
}

14. /Mirror.Fuzzy

Prototype: seqtools.Utilities::Int32 FuzzyMirrors(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Mirror.Fuzzy /in <in.fasta> [/out <out.csv> /cut 0.6 /max-dist 6 /min 3 /max 20]
Example
seqtools

Parameters information:

/in
Example
[/out]
Example
Accepted Types
/in

Decalre: SMRUCC.genomics.SequenceModel.FASTA.FastaToken Example:

{
    "Extension": {
        "DynamicHash": {
            "Properties": {
                
            },
            "source": [
                
            ]
        }
    },
    "SequenceData": "System.String",
    "Attributes": [
        "System.String"
    ]
}
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.PalindromeLoci Example:

{
    "Loci": "System.String",
    "MirrorSite": "System.String",
    "PalEnd": 0,
    "Palindrome": "System.String",
    "Start": 0
}

15. /Mirror.Fuzzy.Batch

Prototype: seqtools.Utilities::Int32 FuzzyMirrorsBatch(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Mirror.Fuzzy.Batch /in <in.fasta/DIR> [/out <out.DIR> /cut 0.6 /max-dist 6 /min 3 /max 20 /num_threads <-1>]
Example
seqtools

Parameters information:

[/out]
Example
Accepted Types
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.PalindromeLoci Example:

{
    "Loci": "System.String",
    "MirrorSite": "System.String",
    "PalEnd": 0,
    "Palindrome": "System.String",
    "Start": 0
}

16. /Mirror.Vector

Prototype: seqtools.Utilities::Int32 MirrorsVector(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Mirror.Vector /in <inDIR> /size <genome.size> [/out out.txt]
Example
seqtools

17. /Mirrors.Context

This function will convert the mirror data to the simple segment object data Prototype: seqtools.Utilities::Int32 MirrorContext(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Mirrors.Context /in <mirrors.csv> /PTT <genome.ptt> [/trans /strand <+/-> /out <out.csv> /stranded /dist <500bp>]
Example
seqtools

Parameters information:

[/trans]

Enable this option will using genome_size minus loci location for the location correction, only works in reversed strand.

Example
Accepted Types
/trans

18. /Mirrors.Context.Batch

This function will convert the mirror data to the simple segment object data Prototype: seqtools.Utilities::Int32 MirrorContextBatch(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Mirrors.Context.Batch /in <mirrors.csv.DIR> /PTT <genome.ptt.DIR> [/trans /strand <+/-> /out <out.csv> /stranded /dist <500bp> /num_threads -1]
Example
seqtools

Parameters information:

[/trans]

Enable this option will using genome_size minus loci location for the location correction, only works in reversed strand.

Example
Accepted Types
/trans

19. /Mirrors.Group

Prototype: seqtools.Utilities::Int32 MirrorGroups(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Mirrors.Group /in <mirrors.Csv> [/batch /fuzzy <-1> /out <out.DIR>]
Example
seqtools

Parameters information:

[/fuzzy]

-1 means group sequence by string equals compared, and value of 0-1 means using string fuzzy compare.

Example
Accepted Types
/fuzzy

20. /Mirrors.Group.Batch

Prototype: seqtools.Utilities::Int32 MirrorGroupsBatch(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Mirrors.Group.Batch /in <mirrors.DIR> [/fuzzy <-1> /out <out.DIR> /num_threads <-1>]
Example
seqtools

21. /Mirrors.Nt.Trim

Prototype: seqtools.Utilities::Int32 TrimNtMirrors(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Mirrors.Nt.Trim /in <mirrors.Csv> [/out <out.Csv>]
Example
seqtools

Parameters information:

[/out]
Example
Accepted Types
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.PalindromeLoci Example:

{
    "Loci": "System.String",
    "MirrorSite": "System.String",
    "PalEnd": 0,
    "Palindrome": "System.String",
    "Start": 0
}

22. /nw

RunNeedlemanWunsch Prototype: seqtools.Utilities::Int32 NW(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /nw /query <query.fasta> /subject <subject.fasta> [/out <out.txt>]
Example
seqtools

Parameters information:

/query
Example
/subject
Example
[/out]
Example
Accepted Types
/query

Decalre: SMRUCC.genomics.SequenceModel.FASTA.FastaToken Example:

{
    "Extension": {
        "DynamicHash": {
            "Properties": {
                
            },
            "source": [
                
            ]
        }
    },
    "SequenceData": "System.String",
    "Attributes": [
        "System.String"
    ]
}
/subject

Decalre: SMRUCC.genomics.SequenceModel.FASTA.FastaToken Example:

{
    "Extension": {
        "DynamicHash": {
            "Properties": {
                
            },
            "source": [
                
            ]
        }
    },
    "SequenceData": "System.String",
    "Attributes": [
        "System.String"
    ]
}
/out

Decalre: System.String Example:

"System.String"

23. /Palindrome.BatchTask

Prototype: seqtools.Utilities::Int32 PalindromeBatchTask(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Palindrome.BatchTask /in <in.DIR> [/num_threads 4 /min 3 /max 20 /min-appears 2 /cutoff <0.6> /Palindrome /max-dist <1000 (bp)> /partitions <-1> /out <out.DIR>]
Example
seqtools

Parameters information:

[/Palindrome]

Only search for Palindrome, not includes the repeats data.

Example
Accepted Types
/Palindrome

24. /Palindrome.Screen.MaxMatches

Prototype: seqtools.Utilities::Int32 FilteringMatches(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Palindrome.Screen.MaxMatches /in <in.csv> /min <min.max-matches> [/out <out.csv>]
Example
seqtools

25. /Palindrome.Screen.MaxMatches.Batch

Prototype: seqtools.Utilities::Int32 FilteringMatchesBatch(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Palindrome.Screen.MaxMatches.Batch /in <inDIR> /min <min.max-matches> [/out <out.DIR> /num_threads <-1>]
Example
seqtools

26. /Palindrome.Workflow

Prototype: seqtools.Utilities::Int32 PalindromeWorkflow(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Palindrome.Workflow /in <in.fasta> [/batch /min-appears 2 /min 3 /max 20 /cutoff <0.6> /max-dist <1000 (bp)> /Palindrome /partitions <-1> /out <out.DIR>]
Example
seqtools

Parameters information:

/in

This is a single sequence fasta file.

Example
[/Palindrome]

Only search for Palindrome, not includes the repeats data.

Example
Accepted Types
/in
/Palindrome

27. /Select.By_Locus

Prototype: seqtools.Utilities::Int32 SelectByLocus(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Select.By_Locus /in <locus.txt> /fa <fasta/.inDIR> [/out <out.fasta>]
Example
seqtools

28. /Sigma

Prototype: seqtools.Utilities::Int32 Sigma(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Sigma /in <in.fasta> [/out <out.Csv> /simple /round <-1>]
Example
seqtools

29. /SimpleSegment.AutoBuild

Prototype: seqtools.Utilities::Int32 ConvertsAuto(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /SimpleSegment.AutoBuild /in <locis.csv> [/out <out.csv>]
Example
seqtools

Parameters information:

/in
Example
[/out]
Example
Accepted Types
/in

Decalre: Microsoft.VisualBasic.Data.csv.DocumentStream.File Example:

[
    
]
/out

Decalre: SMRUCC.genomics.SequenceModel.NucleotideModels.SimpleSegment Example:

{
    "Complement": "System.String",
    "Ends": 0,
    "ID": "System.String",
    "SequenceData": "System.String",
    "Start": 0,
    "Strand": "System.String"
}

30. /SimpleSegment.Mirrors

Prototype: seqtools.Utilities::Int32 ConvertMirrors(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /SimpleSegment.Mirrors /in <in.csv> [/out <out.csv>]
Example
seqtools

Parameters information:

/in
Example
[/out]
Example
Accepted Types
/in

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.PalindromeLoci Example:

{
    "Loci": "System.String",
    "MirrorSite": "System.String",
    "PalEnd": 0,
    "Palindrome": "System.String",
    "Start": 0
}
/out

Decalre: SMRUCC.genomics.SequenceModel.NucleotideModels.SimpleSegment Example:

{
    "Complement": "System.String",
    "Ends": 0,
    "ID": "System.String",
    "SequenceData": "System.String",
    "Start": 0,
    "Strand": "System.String"
}

31. /SimpleSegment.Mirrors.Batch

Prototype: seqtools.Utilities::Int32 ConvertMirrorsBatch(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /SimpleSegment.Mirrors.Batch /in <in.DIR> [/out <out.DIR>]
Example
seqtools

32. /SNP

Prototype: seqtools.Utilities::Int32 SNP(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /SNP /in <nt.fasta> [/ref 0 /pure /monomorphic]
Example
seqtools

Parameters information:

/in
Example
[/ref]
Example
[/pure]
Example
Accepted Types
/in

Decalre: SMRUCC.genomics.SequenceModel.FASTA.FastaFile Example:

[
    
]
/ref

Decalre: System.Int32 Example:

0
/pure

Decalre: System.Boolean Example:

true

33. /Split

Prototype: seqtools.Utilities::Int32 Split(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Split /in <in.fasta> [/n <4096> /out <outDIR>]
Example
seqtools

34. /subset

Prototype: seqtools.Utilities::Int32 SubSet(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /subset /lstID <lstID.txt> /fa <source.fasta>
Example
seqtools

35. /Time.Diffs

Prototype: seqtools.Utilities::Int32 TimeDiffs(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Time.Diffs /in <aln.fasta> [/out <out.csv>]
Example
seqtools

36. /To_Fasta

Convert the sequence data in a excel annotation file into a fasta sequence file. Prototype: seqtools.Utilities::Int32 ToFasta(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /To_Fasta /in <anno.csv> [/out <out.fasta> /attrs <gene;locus_tag;gi;location,...> /seq <Sequence>]
Example
seqtools

37. /Write.Seeds

Prototype: seqtools.Utilities::Int32 WriteSeeds(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools /Write.Seeds /out <out.dat> [/prot /max <20>]
Example
seqtools

38. -321

Polypeptide sequence 3 letters to 1 lettes sequence. Prototype: seqtools.Utilities::Int32 PolypeptideBriefs(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools -321 /in <sequence.txt> [/out <out.fasta>]
Example
seqtools

39. --align

Prototype: seqtools.Utilities::Int32 Align(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --align /query <query.fasta> /subject <subject.fasta> [/out <out.DIR> /cost <0.7>]
Example
seqtools

40. --align.Self

Prototype: seqtools.Utilities::Int32 AlignSelf(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --align.Self /query <query.fasta> /out <out.DIR> [/cost 0.75]
Example
seqtools

41. -complement

Prototype: seqtools.Utilities::Int32 Complement(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools -complement -i <input_fasta> [-o <output_fasta>]
Example
seqtools

42. --Drawing.ClustalW

Prototype: seqtools.Utilities::Int32 DrawClustalW(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Drawing.ClustalW /in <align.fasta> [/out <out.png> /dot.Size 10]
Example
seqtools

43. --Hairpinks

Prototype: seqtools.Utilities::Int32 Hairpinks(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Hairpinks /in <in.fasta> [/out <out.csv> /min <6> /max <7> /cutoff 3 /max-dist <35 (bp)>]
Example
seqtools

Parameters information:

[/out]
Example
Accepted Types
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.ImperfectPalindrome Example:

{
    "Distance": 0,
    "Evolr": "System.String",
    "Left": 0,
    "Matches": "System.String",
    "MaxMatch": 0,
    "Palindrome": "System.String",
    "Paloci": 0,
    "Score": 0,
    "Site": "System.String"
}

44. --Hairpinks.batch.task

Prototype: seqtools.Utilities::Int32 HairpinksBatch(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Hairpinks.batch.task /in <in.fasta> [/out <outDIR> /min <6> /max <7> /cutoff <0.6> /max-dist <35 (bp)> /num_threads <-1>]
Example
seqtools

45. --ImperfectsPalindrome.batch.Task

Prototype: seqtools.Utilities::Int32 BatchSearchImperfectsPalindrome(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --ImperfectsPalindrome.batch.Task /in <in.fasta> /out <outDir> [/min <3> /max <20> /cutoff <0.6> /max-dist <1000 (bp)> /num_threads <-1>]
Example
seqtools

46. --Mirror.From.Fasta

Mirror Palindrome, search from a fasta file. Prototype: seqtools.Utilities::Int32 SearchMirrotFasta(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Mirror.From.Fasta /nt <nt-sequence.fasta> [/out <out.csv> /min <3> /max <20>]
Example
seqtools

Parameters information:

/nt

This fasta file should contains only just one sequence.

Example
Accepted Types
/nt

Decalre: SMRUCC.genomics.SequenceModel.FASTA.FastaToken Example:

{
    "Extension": {
        "DynamicHash": {
            "Properties": {
                
            },
            "source": [
                
            ]
        }
    },
    "SequenceData": "System.String",
    "Attributes": [
        "System.String"
    ]
}

47. --Mirror.From.NT

Mirror Palindrome, and this function is for the debugging test Prototype: seqtools.Utilities::Int32 SearchMirrotNT(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Mirror.From.NT /nt <nt-sequence> /out <out.csv> [/min <3> /max <20>]
Example
seqtools

Parameters information:

[/out]
Example
Accepted Types
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.PalindromeLoci Example:

{
    "Loci": "System.String",
    "MirrorSite": "System.String",
    "PalEnd": 0,
    "Palindrome": "System.String",
    "Start": 0
}

48. --Palindrome.batch.Task

Prototype: seqtools.Utilities::Int32 BatchSearchPalindrome(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Palindrome.batch.Task /in <in.fasta> /out <outDir> [/min <3> /max <20> /num_threads <-1>]
Example
seqtools

49. --Palindrome.From.FASTA

Prototype: seqtools.Utilities::Int32 SearchPalindromeFasta(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Palindrome.From.Fasta /nt <nt-sequence.fasta> [/out <out.csv> /min <3> /max <20>]
Example
seqtools

Parameters information:

/nt

Fasta sequence file, and this file should just contains only one sequence.

Example
[/out]
Example
Accepted Types
/nt

Decalre: SMRUCC.genomics.SequenceModel.FASTA.FastaToken Example:

{
    "Extension": {
        "DynamicHash": {
            "Properties": {
                
            },
            "source": [
                
            ]
        }
    },
    "SequenceData": "System.String",
    "Attributes": [
        "System.String"
    ]
}
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.PalindromeLoci Example:

{
    "Loci": "System.String",
    "MirrorSite": "System.String",
    "PalEnd": 0,
    "Palindrome": "System.String",
    "Start": 0
}

50. --Palindrome.From.NT

This function is just for debugger test, /nt parameter is the nucleotide sequence data as ATGCCCC Prototype: seqtools.Utilities::Int32 SearchPalindromeNT(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Palindrome.From.NT /nt <nt-sequence> /out <out.csv> [/min <3> /max <20>]
Example
seqtools

Parameters information:

[/out]
Example
Accepted Types
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.PalindromeLoci Example:

{
    "Loci": "System.String",
    "MirrorSite": "System.String",
    "PalEnd": 0,
    "Palindrome": "System.String",
    "Start": 0
}

51. --Palindrome.Imperfects

Prototype: seqtools.Utilities::Int32 ImperfectPalindrome(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Palindrome.Imperfects /in <in.fasta> [/out <out.csv> /min <3> /max <20> /cutoff <0.6> /max-dist <1000 (bp)> /partitions <-1>]
Example
seqtools

Parsing the sequence segment from the sequence source using regular expression. Prototype: seqtools.Utilities::Int32 PatternSearchA(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools -pattern_search -i <file_name> -p <regex_pattern>[ -o <output_directory> -f <format:fsa/gbk>]
Example
seqtools -pattern_search -i ~/xcc8004.txt -p TTA{3}N{1,2} -f fsa

Parameters information:

-i

The sequence input data source file, it can be a fasta or genbank file.

Example
~/Desktop/xcc8004.txt
-p

This switch specific the regular expression pattern for search the sequence segment, for more detail information about the regular expression please read the user manual.

Example
N{1,5}TA
[-o]

Optional, this switch value specific the output directory for the result data, default is user Desktop folder.

Example
~/Documents/
[-f]

Optional, specific the input file format for the sequence reader, default value is FASTA sequence file. fsa - The input sequence data file is a FASTA format file; gbk - The input sequence data file is a NCBI genbank flat file.

Example
fsa
Accepted Types
-i
-p
-o
-f

53. --PerfectPalindrome.Filtering

Prototype: seqtools.Utilities::Int32 FilterPerfectPalindrome(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --PerfectPalindrome.Filtering /in <inDIR> [/min <8> /out <outDIR>]
Example
seqtools

Parameters information:

[/out]
Example
Accepted Types
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.ImperfectPalindrome Example:

{
    "Distance": 0,
    "Evolr": "System.String",
    "Left": 0,
    "Matches": "System.String",
    "MaxMatch": 0,
    "Palindrome": "System.String",
    "Paloci": 0,
    "Score": 0,
    "Site": "System.String"
}

54. Repeats.Density

Prototype: seqtools.Utilities::Int32 RepeatsDensity(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools Repeats.Density /dir <dir> /size <size> /ref <refName> [/out <out.csv> /cutoff <default:=0>]
Example
seqtools

55. -reverse

Prototype: seqtools.Utilities::Int32 Reverse(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools -reverse -i <input_fasta> [-o <output_fasta>]
Example
seqtools

56. rev-Repeats.Density

Prototype: seqtools.Utilities::Int32 revRepeatsDensity(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools rev-Repeats.Density /dir <dir> /size <size> /ref <refName> [/out <out.csv> /cutoff <default:=0>]
Example
seqtools

57. Search.Batch

Batch search for repeats. Prototype: seqtools.Utilities::Int32 BatchSearch(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools Search.Batch /aln <alignment.fasta> [/min 3 /max 20 /min-rep 2 /out <./>]
Example
seqtools

Parameters information:

/aln

The input fasta file should be the output of the clustal multiple alignment fasta output.

Example
[/out]
Example
Accepted Types
/aln
/out

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.RepeatsView Example:

{
    "Left": 0,
    "Locis": [
        0
    ],
    "SequenceData": "System.String"
}

Decalre: SMRUCC.genomics.Analysis.SequenceTools.SequencePatterns.Topologically.RevRepeatsView Example:

{
    "Left": 0,
    "Locis": [
        0
    ],
    "SequenceData": "System.String",
    "RevLocis": [
        0
    ],
    "RevSegment": "System.String"
}

58. -segment

Prototype: seqtools.Utilities::Int32 GetSegment(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools -segment /fasta <Fasta_Token> [-loci <loci>] [/left <left> /length <length> /right <right> [/reverse]] [/ptt <ptt> /geneID <gene_id> /dist <distance> /downstream] -o <saved> [-line.break 100]
Example
seqtools

59. --segments

Prototype: seqtools.Utilities::Int32 GetSegments(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --segments /regions <regions.csv> /fasta <nt.fasta> [/complement /reversed /brief-dump]
Example
seqtools

Parameters information:

[/reversed]

If the sequence is on the complement strand, reversed it after complement operation?

Example
[/complement]

If this Boolean switch is set on, then all of the reversed strand segment will be complemenet and reversed.

Example
[/brief-dump]

If this parameter is set up true, then only the locus_tag of the ORF gene will be dump to the fasta sequence.

Example
Accepted Types
/reversed
/complement
/brief-dump

60. --ToVector

Prototype: seqtools.Utilities::Int32 ToVector(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --ToVector /in <in.DIR> /min <4> /max <8> /out <out.txt> /size <genome.size>
Example
seqtools

61. --translates

Translates the ORF gene as protein sequence. If any error was output from the console, please using > operator dump the output to a log file for the analysis. Prototype: seqtools.Utilities::Int32 Translates(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --translates /orf <orf.fasta> [/transl_table 1 /force]
Example
seqtools

Parameters information:

/orf

ORF gene nt sequence should be completely complement and reversed as forwards strand if it is complement strand.

Example
[/force]

This force parameter will force the translation program ignore of the stop code and continute sequence translation.

Example
[/transl_table]

Available index value was described at http://www.ncbi.nlm.nih.gov/Taxonomy/taxonomyhome.html/index.cgi?chapter=tgencodes#SG25

Example
Accepted Types
/orf
/force
/transl_table

62. --Trim

Prototype: seqtools.Utilities::Int32 Trim(Microsoft.VisualBasic.CommandLine.CommandLine)

Usage
seqtools --Trim /in <in.fasta> [/case <u/l> /break <-1/int> /out <out.fasta> /brief]
Example
seqtools

Parameters information:

[/case]

Adjust the letter case of your sequence, l for lower case and u for upper case. Default value is upper case.

Example
[/break]

Adjust the sequence break when this program write the fasta sequence, default is -1 which means no break, write all sequence in one line.

Example
Accepted Types
/case
/break

New

Clone this wiki locally