-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Stata 17 crashes when using iebaltab
with 4.3GB dataset
#368
Comments
Wow, you are really putting our code to the test. Fun! Here are my first reactions to what you have already tested:
Questions:
Suggestions:
Let me know what these comments make you think or what these suggestions teaches you. Happy to keep working with you until this is resolved. However, it might also be related to Stata (especially on Linux) where I would not be able to help with a solution. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hello,
I am a Research Computing Facilitator at FASRC. Raul Duarte reached out to our support because he was running a Stata code with the function
iebaltab
on our cluster and the job was dying midway through computation. We troubleshot extensively without much progress, so we are reaching out to you for guidance. I will try to summarize the computational environment and what we have done so far.Unfortunately, because Raul’s data cannot be shared (because of a Data Use Agreement [DUA] signed), we cannot share the data, but we will try to explain as much as possible.
Computational environment
fasse_bigmem
partition: Intel Ice Lake chipset, 499 GB of RAM,/tmp
space is 172 GBfasse_ultramem
partition: Intel Ice Lake chipset, 2000 GB of RAM,/tmp
space is 396 GBAnalysis
Raul wrote a Do file that uses the iebaltab function to analyze a dataset that is 4.3GB:
Raul wrote:
His typical run was on
fasse_bigmem
(499 GB of RAM and 64 cores).Troubleshooting steps
max_memory
to slightly less than the total memory, he set it to 495 GB when the memory requested onfasse_bigmem
was 499 GB.top
to see cpu and memory usage and I also kept checking the disk usage of/tmp
with thedu
command. The core usage was almost at 100% for all 64 cores, memory was at about 5-6% (of 499 GB), and /tmp had about 4-5 GB usage. At about 1h, I could see each process dying and everything stalled.I am hoping that you have some guidance if Raul possibly ran into a bug or something on our end that we need to change.
Thank you for taking the time to read this. We will be happy to answer any questions.
Best,
Paula and Raul
The text was updated successfully, but these errors were encountered: