Add parallel execution for extract_prescored.py #83

yangyxt · 2024-12-16T09:05:30Z

This can at most lead to 24 parallel subprocesses to extract the prescored variants.

I did this because I run into a situation where this step has been running for over 41 hours to extract prescored records for a VCF file with near 300k variant records.

yangyxt · 2024-12-19T23:58:20Z

I also refractor the esmSCore_inFrame and esmScore_frameshift script because I found them running over 30 hours to process a VCF file with 500k variants.

The most time consuming part is using list appending (append a single item on a big list is incredibly slow in python), so I switch them to use numpy array instead.

yangyxt added 12 commits December 16, 2024 11:17

add parallel prescore extraction

97f7602

close stdin if needed

b4e2c89

migrate to python 2.7

ceaa15c

add a log line to show the used thread number

3714d18

add logging lines

039e99c

update Snakefile and adding a thread argument for checkpoint process

dbf1ec6

fix a syntax error

3097ec9

remove the temp_dir clean part

e479ad3

optimize tabix indexing check

1abe15a

Refractor esmScore_inFrame_av.py to greatly improve the performance

00873b5

Refractor esmScore_frameshift_av.py to greatly improve the performance

daf241c

Refractor esmScore_inFrame_av.py to greatly improve the performance

5e99d6b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add parallel execution for extract_prescored.py #83

Add parallel execution for extract_prescored.py #83

Uh oh!

yangyxt commented Dec 16, 2024

Uh oh!

yangyxt commented Dec 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add parallel execution for extract_prescored.py #83

Are you sure you want to change the base?

Add parallel execution for extract_prescored.py #83

Uh oh!

Conversation

yangyxt commented Dec 16, 2024

Uh oh!

yangyxt commented Dec 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant