Skip to content

Identifying DnaA boxes in the origin of replication of bacterial genomes. (A repository of Python solutions to the code challenges from Unit 1 Bioinformatics Algorithms. )

License

Notifications You must be signed in to change notification settings

ClarissaPereira/Finding-Ori

Repository files navigation

Finding-Ori

A repository of Python solutions to the code challenges from Unit 1 of Bioinformatics Algorithms (Stepik).

Before opening the code files, I recommend quickly reading about the biology background of this project to get a clearer sense of the purpose of each program/function.

The majority of the files in this repo are for a single function. Most of these small functions contribute to the larger Final DnaA Box Finder program.

The Salmonella enterica genome is included as test data. The final DnaA Box Finder identified the S. enterica DnaA box as TTATCCACA. An existing UniProt database entry confirms that the DnaA sequence is 5'-TTATC[CA]A[CA]A-3' - a successful match!

K-mer Searching Functions

Generators

Skew Functions

skewplotex

Mutations & Imperfect Pattern Matching Functions

About

Identifying DnaA boxes in the origin of replication of bacterial genomes. (A repository of Python solutions to the code challenges from Unit 1 Bioinformatics Algorithms. )

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages