Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Weather could provide a method for comparing multiple samples at once #876

Open
PingWang7 opened this issue Aug 28, 2024 · 3 comments
Open

Comments

@PingWang7
Copy link

Kraken2 is an excellent tools to align microbial sequences.

While in each run, loading the database is a rather time-consuming process.

I wonder if future developers could provide a way to load the database only once, and align multiple samples at once.

@ChillarAnand
Copy link

There is --memory-mapping option for this purpose. You can read the kraken2 manual regarding this.

https://github.com/DerrickWood/kraken2/blob/master/docs/MANUAL.markdown#system-requirements

I have also written a detailed tutorial on --memory-mapping and other tips to improve classification speed.

https://avilpage.com/2024/07/mastering-kraken2-performance-optimisation.html

@PingWang7
Copy link
Author

@ChillarAnand Thanks a lot. It's really a good solution to use --memory-mapping option in kraken2 to speed up, however, sometimes it is unclear why using this parameter results in slower runtime compared to not using it.

@ChillarAnand
Copy link

It all depends on how many index pages are in RAM already. As mentioned in the article, you can use vmtouch to make sure you get consistent performance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants
@ChillarAnand @PingWang7 and others