Skip to content

Latest commit

 

History

History
36 lines (36 loc) · 1.43 KB

HG002_small_variant_calling_test_run.md

File metadata and controls

36 lines (36 loc) · 1.43 KB

Preparing the output of Guppy and Minimap2 to test PEPPER-Margin-DeepVariant pipeline for a subset of chromosomes for HG002

WORK_DIR=/data/hg002_pmdv
mkdir -p $WORK_DIR/
  • Create the configuration file (e.g. sample.config in the $PROJECT_DIR)
cp $PROJECT_DIR/sample.config $WORK_DIR/
CONFIG_PATH=$WORK_DIR/sample.config
  • Create a Google Storage Bucket with a unique name and add the configuration file to it e.g.
BUCKET=gs://urwgs_hg002_test_$(date +%s)
gsutil mb $BUCKET
sed -i "s|^BUCKET=.*$|BUCKET=${BUCKET}|g" $CONFIG_PATH
gsutil cp $CONFIG_PATH ${BUCKET}/sample.config
  • Transfer data and create the appropriate directory for testing
$PROJECT_DIR/simulation/simulate_alignment_output.sh
  • Add cron job for deleting instances based on the status from the instances
(crontab -u $USER -l; echo -e "*/3 * * * * bash -c $PROJECT_DIR/manage_instances/delete_instances_pmdv_wrapper.sh >> /data/logs/delete_instances_stdout.log 2>> /data/logs/delete_instances_stderr.log") | crontab -u $USER - 
  • Start 2 instances to run PEPPER-Margin-DeepVariant on chr2 and chr16,chr21
parallel -j 2 $PROJECT_DIR/manage_instances/pmdv_instance.sh ::: \
	pmdv-{1..2} :::+ \
	chr2 \
	chr16:chr21 ::: \
	${BUCKET}/sample.config
  • The generated output is transferred to $BUCKET/pmdv_${DV}_${DV_MODEL}_output.