Issue with some molecules #199
mearcstapa-gqz
started this conversation in
PCQM4M-LSC
Replies: 1 comment 7 replies
-
Hi! Great questions! The dataset is what it is, so 1 and 3 could happen. 2 is probably a by-product of our data split. Given that unseen atoms are extremely rare in the test set (0.0008% of test atoms are unseen during training, 0.01% of test molecules contain unseen atoms), it should not affect the result much (e.g., even if you make a big 10 eV error on those molecules, it will only affect the entire test MAE by 0.001). Properly handling unseen atoms (e.g., make sure the prediction is not extreme, using periodic table information rather than atom ID as input features) might help very slightly. Hope this helps! |
Beta Was this translation helpful? Give feedback.
7 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, Is it intended that
Beta Was this translation helpful? Give feedback.
All reactions