Skip to content

Sequence designer for caDNAno. Returns generated sequences of caDNAno file, even if they contain multiple scaffolds or multiple scaffold breaks.

License

Notifications You must be signed in to change notification settings

SBlokhuizen/SequenceDesigner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

49 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sequence Designer for caDNAno

Overview

This program makes it possible to sequence scaffold and staple strands, even if the cadnano file contains multiple separate scaffolds.

Usage

The program can be run using the following command:

python3 seq_designer.py <cadnano json file> <scaffold file>

For example:

python3 seq_designer.py json_files/test_virtual.json scaffold_files/M13mp18 

Input

The program will require two inputs as arguments:

  • cadnano .json file
  • scaffold sequence file - this sequence will be assigned to the longest scaffold strand in the caDNAnojson file. The other scaffold sequences will be pseudorandomly generated.

Output

The program will generate three output files:

  • scaffolds.txt - contains the sequences of the scaffold strands. Moreover, it contains the start and end location, and the length of each scaffold.
  • staples.txt - contains the sequences of the staple strands. Moreover, it contains the start and end location, and the length of each staple.
  • visualized_sequence.txt - contains a nicely formatted visualization of the scaffold and staple sequence data, analogous to the visual representation in cadnano. This might be useful for checking the final results.

Example Output

Here is an example for the outputs using a small caDNAno file. (json_files/small_twobreak.json and M13mp18 scaffold, specifically).

image

scaffolds.txt

Start,End,Sequence,Length
1[6],0[5],GTGATGATT,9
0[6],1[7],AATGCTACTAC,11

staples.txt

Start,End,Sequence,Length
1[2],1[11],ATCACGTAGT,10
0[11],0[2],AGCATTAATC,10

visualized_sequence.txt

Scaffold 0    |--GATTAATGCT---------|
Staple 0      |--CTAATTACGA---------|

Staple 1      |--ATCACGTAGT---------|
Scaffold 1    |--TAGTGCATCA---------|

About

Sequence designer for caDNAno. Returns generated sequences of caDNAno file, even if they contain multiple scaffolds or multiple scaffold breaks.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages