Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New organism: Photinus pyralis #159

Closed
19 tasks done
mpoelchau opened this issue Dec 17, 2020 · 1 comment
Closed
19 tasks done

New organism: Photinus pyralis #159

mpoelchau opened this issue Dec 17, 2020 · 1 comment
Assignees
Labels

Comments

@mpoelchau
Copy link
Contributor

mpoelchau commented Dec 17, 2020

NCBI RefSeq # for assembly: GCF_008802855.1

See https://gitlab.com/i5k_Workspace/workspace_roadmap/-/wikis/Adding-an-organism-CWL-update for full description of each task (requires gitlab login). We can use the genomics-workspace cwl workflow now, but it may need some refinement.

i5k-stage

Commands
Loading from /usr/local/i5k/media/blast/db/
(i5k) [i5k@i5k-stage-node1 ~]$ python manage.py addblast Photinus pyralis -t nucleotide Genome Assembly -f /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_genomic.fna -d Photinus pyralis genome assembly, Ppyr1.3 (i5k) [i5k@i5k-stage-node1 ~]$ python manage.py addblast Photinus pyralis -t nucleotide Transcript -f /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_rna_from_genomic.fna -d Photinus pyralis NCBI Annotation release 100, transcripts (i5k) [i5k@i5k-stage-node1 ~]$ python manage.py addblast Photinus pyralis -t peptide Protein -f /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_translated_cds.faa -d Photinus pyralis NCBI Annotation release 100, translated CDS (i5k) [i5k@i5k-stage-node1 ~]$ python manage.py addblast Photinus pyralis -t nucleotide Transcript -f /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_cds_from_genomic.fna -d Photinus pyralis NCBI Annotation release 100, CDS

Make blast DBs and populate. Only listing commands for the genome now
(i5k) [i5k@i5k-stage-node1 ~]$ python manage.py blast_utility /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_genomic.fna -m (i5k) [i5k@i5k-stage-node1 ~]$ python manage.py blast_utility /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_genomic.fna -p
Make visible
(i5k) [i5k@i5k-stage-node1 ~]$ python manage.py blast_shown /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_genomic.fna --shown true

You need quotes around the description otherwise spaces are ignored when populating hmmer_hmmerdb
(i5k) [i5k@i5k-stage-node1 ~]$ python manage.py addhmmer Photinus pyralis -f /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_genomic.fna -d "Photinus pyralis genome assembly, Ppyr 1.3"

Adding to jbrowse
(i5k) [i5k@i5k-stage-node1 ~]$ time python manage.py addjbrowse /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_genomic.fna https://apollo.nal.usda.gov/apollo/Photinus%20pyralis/jbrowse/

i5k production

@mpoelchau
Copy link
Contributor Author

Slight updates on the genomics-workspace commands used for prod (see also NAL-i5K/genomics-workspace#321):

python manage.py addorganism Photinus pyralis

#Note - DON'T use the full path of the fasta file - just the file name. Run from within the manage.py directory. (Would be nice to make this less brittle - the jbrowse derives the link from the title, which is copied from the file path. Would make more sense to get it from the file path and strip the directory, right?)
python manage.py addblast Photinus pyralis -t nucleotide Genome Assembly -f GCF_008802855.1_Ppyr1.3_genomic.fna -d 'Photinus pyralis genome assembly, Ppyr1.3'

python manage.py addblast Photinus pyralis -t nucleotide Transcript -f GCF_008802855.1_Ppyr1.3_rna_from_genomic-idupdate.fna -d Photinus pyralis NCBI Annotation release 100, transcripts

python manage.py addblast Photinus pyralis -t peptide Protein -f GCF_008802855.1_Ppyr1.3_translated_cds-idupdate.faa -d Photinus pyralis NCBI Annotation release 100, translated CDS 

python manage.py addblast Photinus pyralis -t nucleotide Transcript -f GCF_008802855.1_Ppyr1.3_cds_from_genomic-idupdate.fna -d Photinus pyralis NCBI Annotation release 100, CDS

#
python manage.py blast_utility GCF_008802855.1_Ppyr1.3_genomic.fna -m 
python manage.py blast_utility GCF_008802855.1_Ppyr1.3_genomic.fna -p
python manage.py blast_shown GCF_008802855.1_Ppyr1.3_genomic.fna --shown true
python manage.py addjbrowse GCF_008802855.1_Ppyr1.3_genomic.fna https://apollo.nal.usda.gov/apollo/Photinus_pyralis/jbrowse/

python manage.py blast_utility /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_rna_from_genomic-idupdate.fna -m 
python manage.py blast_utility /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_rna_from_genomic-idupdate.fna -p
python manage.py blast_shown /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_rna_from_genomic-idupdate.fna --shown true

python manage.py blast_utility /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_translated_cds-idupdate.faa -m 
python manage.py blast_utility /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_translated_cds-idupdate.faa -p
python manage.py blast_shown /usr/local/i5k/media/blast/db/GCF_008802855.1_Ppyr1.3_translated_cds-idupdate.faa --shown true

python manage.py blast_utility GCF_008802855.1_Ppyr1.3_cds_from_genomic-idupdate.fna -m 
python manage.py blast_utility GCF_008802855.1_Ppyr1.3_cds_from_genomic-idupdate.fna -p
python manage.py blast_shown GCF_008802855.1_Ppyr1.3_cds_from_genomic-idupdate.fna --shown true

#hmmer needs the protein fasta
python manage.py addhmmer Photinus pyralis -f GCF_008802855.1_Ppyr1.3_translated_cds-idupdate.faa -d 'Photinus pyralis NCBI Annotation release 100, translated CDS'

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants