-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Repcred runs out of memory #37
Comments
Is this in ipa1? I am using this command and I only get 4579 sequences. If found the repertoire_id using the gateway.
|
Sorry, that is on ipa3.ireceptor.org. Unfortunately on our old repositories our repertoire_id fields are not unique so this type of confusion can happen.
Also, unfortunately, on the Gateway there is no easy way to see which of the IPAs this repertoire is on. If the repertoire_id was unique, then you could search them all and it would only show up on one of them. This is an issue we need to address... |
And I run out of patience. It takes forever to run the chunk |
When I am running these jobs, repcred is reporting that it is downsampling:
So it is running out of memory either while down sampling, or maybe one of the analysis steps isn't down sampling??? The last job reported this before it was killed for exceeding memory limits.
|
Hmm, this failed with running with 15GB of memory, so maybe this is a bug of some kind. It seems odd that 2M works fine in 8GB but 4M fails on 15GB. I am re-running with 30GB to confirm. |
Looks like my job isn't getting memory allocated like I think it is... So ignore my comment about it failing with 15GB. I need to test still. |
It looks like 4M sequences requires about 12 GB of memory, which is why it failed at 8GB. If I run with 30GB it works fine and one of the job summary tools reports over 11GB of memory used. |
The largest repertoire in the ADC is 16M annotations, so this would presumably require a very large amount of memory if this scales linearly which at a very basic level it seems to be close to that based on my quick testing. |
I ran this on a Repertoire of 4M sequences. Would you expect this? Repcred was set to downsample??? This ran on a compute node with 8GB of memory, so this tells me that the job used more than that for a significant amount of time.
This is where the output got to:
See #35 for details of scalability/performance testing.
Failed
The text was updated successfully, but these errors were encountered: