-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
limit / reduce disk space usage #37
Comments
Hi, Could I ask also what cheers, |
Hi, cheers, |
Thank you very much for the feedback and I apologize for the late reply. I am glad you got it to run. We implemented the logic to limit disk space usage in MMseqs2 (the library MetaEuk uses) and it was quite demanding in terms of the possible scenarios it had to cover. The behavior you describe strongly indicates something is not fully working there. I will open an issue for MMseqs2 and refer to this issue. I hope we can get to this in future versions. |
Expected Behavior
successfully run easy-predict on large chromosome-level genome assembly (within BUSCO)
Current Behavior
metaeuk runs, but runs out of disk space (5TB), even if I impose a --disk-space-limit of 3TB
Steps to Reproduce (for bugs)
Don't think there's a bug - just looking for a way to limit disk space usage. I have access to a server with 2x Intel(R) Xeon(R) CPU E5-2690 v4 @ 2.60GHz (14 Cores / 28 threads per CPU), with 1.5T RAM and atm 5TB of disk space.
Command (within BUSCO):
metaeuk easy-predict --threads 14 Neoceratodus_forsteri.fna run_vertebrata_odb10/metaeuk_output/refseq_db_rerun.faa run_vertebrata_odb10/metaeuk_output/rerun_results/Neoceratodus_forsteri.fna run_vertebrata_odb10/metaeuk_output/tmp --max-intron 130000 --max-seq-len 160000 --min-exon-aa 5 --max-overlap 5 --min-intron 1 --overlap 1 -s 6 --slice-search 1 --remove-tmp-files 1 --disk-space-limit 3000G --split-mode 0 --split-memory-limit 1500G
last few parameters from 'slice-search' onwards, were my attempts to limit/reduce disk space usage and limit RAM usage. The rest I can't control - this is BUSCO behaviour.
Context
Running metaeuk as part of the BUSCO pipeline (v5.2.1) on a publicly available large Eukaryote genome (Australian lungfish)
Your Environment
Include as many relevant details about the environment you experienced the bug in.
metaeuk Version: 4.a0f584d
The text was updated successfully, but these errors were encountered: