Skip to content
This repository has been archived by the owner on Jan 31, 2020. It is now read-only.

Auxiliary Data Import

mkiwala-g edited this page Aug 11, 2014 · 20 revisions

Importing a New Human Reference Genome

$ URI='ftp://ftp.ncbi.nih.gov/genbank/genomes/Eukaryotes/vertebrates_mammals/Homo_sapiens/GRCh37/special_requests/GRCh37-lite.fa.gz'

$ genome taxon create                                         \
  --domain=Eukaryota                                          \
  --name=human                                                \
  --ncbi-taxon-id=9606                                        \
  --species-latin-name='Homo sapiens'

$ genome processing-profile create imported-reference-sequence --name=chromosome-fastas

$ wget $URI

$ gunzip GRCh37-list.fa.gz

$ genome model define imported-reference-sequence             \
  --fasta-file=$PWD/GRCh37-lite.fa                            \
  --processing-profile-id=2dc430f34746455b87b3dd179b3a193e    \
  --species-name=human                                        \
  --version=37-lite-test                                      \
  --prefix=GRC                                                \
  --assembly-name=GRCh37-lite                                 \
  --build-name=GRCh37-lite-build37                            \
  --sequence-uri=$URI

Creating a Modified Reference from a Previously Imported Reference Genome

Importing a New Version of dbSNP

Importing a New Version of Ensembl

Clone this wiki locally