-
Notifications
You must be signed in to change notification settings - Fork 127
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problems during model training. #192
Comments
Hi, this isn’t enough information to help you debug your training run.
Usually this indicates that the training data itself is numerically
unstable. I’d convert the tfrecords back to numpy arrays and verify their
integrity and the appropriateness of your loss function.
…On Fri, Apr 5, 2024 at 11:42 PM amingc3 ***@***.***> wrote:
I have trained the new two genomes, but during the training Pearsonr has
been NAN and r2 is inf. and the model automatically stops training at the
12th time. Can you please help me to solve this problem?
image.png (view on web)
<https://github.com/calico/basenji/assets/165865744/843ac042-4d5e-4bcd-8f85-da986580d74e>
—
Reply to this email directly, view it on GitHub
<#192>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AABKFEG7QKOTHW77HEYXCL3Y36KNFAVCNFSM6AAAAABF2GSJ3CVHI2DSMVQWIX3LMV43ASLTON2WKOZSGIZDSMJQHAYTONQ>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Thank you for your reply, I have looked at the existing tfr file and there are a lot of warnings during the generation of the tfr file that the corresponding fragment is not found in the cool file, I don't understand what the problem is, I hope you can help me to see if it's a parameter setting error! |
I think you're using Akita for Hi-C prediction. Is that right? Your sequence length 6,291,456 is much larger than anything we've tried before. I recommend starting with something like 1,048,576. If you share the dataset construction warnings, we might be able to help. |
The last portion seems to encounter a switch we made a few years ago to specify the one hot encoded nucleotides with a single integer, rather than a length 4 array of actual one hot encodings. You might follow the code around that to see if you can identify the problem. But beyond that, you're just not sharing a clear enough description of the steps you're taking for me to help. |
I have trained the new two genomes, but during the training Pearsonr has been NAN and r2 is inf. and the model automatically stops training at the 12th time. Can you please help me to solve this problem?
The text was updated successfully, but these errors were encountered: