Torchsim support in Quacc #3005

orionarcher · 2025-11-04T17:38:33Z

Summary of Changes

This PR introduces support for the 3 high-level runners in TorchSim optimize, integrate, and static. It attempts to provide an easy to use interface for quacc users that relies on minimal TorchSim imports. It also adopts the quacc nomenclature (relax_job, md_job, and static_job) and IO (input atoms, return dicts).

I initially implemented a TorchSim runner but it ended up being just an extra unecessary layer so I decided recipes would be the best way forward.

Happy to take any feedback!

Requirements

My PR is focused on a single feature addition or bugfix.
My PR has relevant, comprehensive unit tests.
My PR is on a custom branch (i.e. is not named main).

Note: If you are an external contributor, you will see a comment from @buildbot-princeton. This is solely for the maintainers.

buildbot-princeton · 2025-11-04T17:38:37Z

Can one of the admins verify this patch?

codecov · 2025-11-05T16:29:32Z

Codecov Report

❌ Patch coverage is 87.15084% with 23 lines in your changes missing coverage. Please review.
✅ Project coverage is 97.62%. Comparing base (1e27489) to head (b723310).

Files with missing lines	Patch %	Lines
src/quacc/recipes/torchsim/_base.py	80.83%	23 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3005      +/-   ##
==========================================
- Coverage   98.18%   97.62%   -0.57%     
==========================================
  Files          92       95       +3     
  Lines        3863     4042     +179     
==========================================
+ Hits         3793     3946     +153     
- Misses         70       96      +26

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Andrew-S-Rosen

Hi @orionarcher! Thank you very much for this PR!

Overall, this is looking to be in relatively good shape. I have left some minor cleanup related comments below.

One larger question: by avoiding the use of runners entirely, there is no directory management. I assume there is some I/O being done? Everything will run in the current working directory at the moment, which would problematic if there is I/O. You will see that the various runners have a setup and cleanup method that they call from the BaseRunner class to take care of file management. Is there I/O to worry about here, or is it all kept in-memory?

Another comment: some of the @job-decorated recipes take classes as arguments. If these are classes that are not instantiated, that's fine. But it looks like some of these arguments are instantiated classes (e.g. the autobatcher). This is problematic for workflow tools, especially those from the MP side, because it is not possible to (de)serialize an instantiated class. The workaround from MP tooling is, as you know, that such methods need to have an .as_dict() and .from_dict() class. It's okay if internally called functions have such classes, but the main @job-decorated function cannot have them unless they are monty serializable. Any thoughts about how to best address this?

src/quacc/recipes/torchsim/core.py

tests/core/recipes/torchsim_recipes/test_torchsim_recipes.py

Andrew-S-Rosen · 2025-11-06T01:36:30Z

src/quacc/recipes/torchsim/core.py

+    model = model if isinstance(model, ModelInterface) else pick_model(*model)
+
+    state = ts.initialize_state(atoms, model.device, model.dtype)
+    if autobatcher:
+        autobatcher = ts.runners._configure_batches_iterator(
+            state, model, autobatcher=autobatcher
+        )  # type: ignore
+        autobatcher_dict = _get_autobatcher_dict(autobatcher)


If I understand correctly, this stuff is repeated through most of the recipes. To keep the recipes as minimal as possible, perhaps this is something that can be abstracted out into _base.py as a function?

Done! Moved this all out to _base.py along with the type hints (some of which needed to be there.)

orionarcher · 2025-11-13T18:42:33Z

Hi Andrew, thanks for the comments!

Is there I/O to worry about here, or is it all kept in-memory?

Good catch. There are output files for these calcualtions. I'll add in calc_setup / calc_teardown calls to get the output files in the proper location. While it makes the code a little noisier I still think it's preferable to having a whole extra set of classes and functions.

It's okay if internally called functions have such classes, but the main @job-decorated function cannot have them unless they are monty serializable. Any thoughts about how to best address this?

Very good point. I wasn't thinking about this but it definitely needs a change. I am also working on an Atomate2 API where I have eliminated the "live" classes and replaced them with config dicts instead. I'll transfer that work over this PR, which should solve the issue.

Will respond to your other comments as I address them!

orionarcher · 2025-11-21T20:44:54Z

I think I've addressed all the changes! Only the larger question remains:

One larger question: by avoiding the use of runners entirely, there is no directory management. I assume there is some I/O being done? Everything will run in the current working directory at the moment, which would problematic if there is I/O. You will see that the various runners have a setup and cleanup method that they call from the BaseRunner class to take care of file management. Is there I/O to worry about here, or is it all kept in-memory?

There is no extraneous IO here (i.e. input sets) but we do write out trajectory files as output. Users must provide names for the trajectories. If they use something like "file.traj" it will end up at "./file.traj", of course they could also specify "/exact/path/file.traj". In my mind, this is the correct behavior. Workflow runners will write out un-rooted paths to the current dir and users still have control if they want to hard code it. Does that make sense to you?

Andrew-S-Rosen · 2025-11-21T21:14:48Z

Thanks, @orionarcher! I will review soon.

There is no extraneous IO here (i.e. input sets) but we do write out trajectory files as output. Users must provide names for the trajectories. If they use something like "file.traj" it will end up at "./file.traj", of course they could also specify "/exact/path/file.traj". In my mind, this is the correct behavior. Workflow runners will write out un-rooted paths to the current dir and users still have control if they want to hard code it. Does that make sense to you?

Unfortunately, this will not work out because it requires the workflow runner to essentially cd into a new directory for every run, which is not what happens across the board. It is what happens with FireWorks but not for most other tools. In the more general case, we will have file.traj overwriting itself over and over. Everything related to paths must be handled by quacc and be an absolute path (to avoid multithreading issues where cd calls are a nightmare...), which is why the Runner classes inherit from the BaseRunner, which has a setup and cleanup method that automatically handles directory setup and cleanup. The added benefit of this approach is that it allows quacc to manage different directories for where calculations should be run vs. where they should be stored after the run.

I will try to provide some guidance about how to address this. I do not think it will be especially challenging. It probably means just creating a TorchSimRunner that inherits from BaseRunner with basically no extra features. We use the TorchSimRunner to setup the directory, pass the runtime directory path to the trajectory name like here and don't allow the user to overwrite it. I'm happy to provide more direct suggestions next week!

orionarcher · 2025-11-25T22:19:50Z

Unfortunately, this will not work out because it requires the workflow runner to essentially cd into a new directory for every run, which is not what happens across the board.

Got it! Understood. That was my (incorrect) expectation.

Yeah I'm not sure I grok it so a little more guidance would be helpful. I can create a runner if needed, or if there is a way to get a way with just adding a copy_files arg to the optimize, integrate and static functions, perhaps that could work too.

orionarcher added 2 commits October 28, 2025 15:06

add initial commit of torchsim interface

5caf279

initial tests for quacc torchsim interface

7b54a13

orionarcher added 3 commits November 4, 2025 13:37

wrap torchsim and torch imports in tests inside try block

ab77d0c

test torchsim in tests.yaml

4f9d2ea

change requirement to torchsim dev, NEED TO CHANGE BACK

f908abf

orionarcher mentioned this pull request Nov 4, 2025

Misc fixes TorchSim/torch-sim#336

Merged

3 tasks

orionarcher added 2 commits November 5, 2025 11:18

remove unused kwarg

5d0d4ae

change mace_mp variable to fixture

d65af42

Andrew-S-Rosen reviewed Nov 6, 2025

View reviewed changes

orionarcher and others added 15 commits November 13, 2025 13:42

Merge branch 'main' into torchsim

ff3eb71

remove instantiate torchsim classes from torchsim API

68f9d13

get tests passing

5067eea

Merge branch 'main' into torchsim

0087f33

refactor input structure to eliminate live objects

71b77d9

Merge commit '0087f335e22c8f3e80392b299e33001edebcdc5c' into torchsim

4149031

Merge branch 'main' into torchsim

343eec9

update documentation

bac9299

move io and schema out of core to _base.py

b160131

move pick_model to _base utils

537ec16

improve documentation

fc0cfff

fix testing imports to be more easily debugable

76aacc1

change import order

37f4eb1

point to WIP TS branch

d99f05d

fix docstring

f18af82

pin torchsim to newly released version

503b83d

Merge branch 'main' into torchsim

b723310

Torchsim support in Quacc #3005

Are you sure you want to change the base?

Torchsim support in Quacc #3005

Uh oh!

Conversation

orionarcher commented Nov 4, 2025

Summary of Changes

Requirements

Uh oh!

buildbot-princeton commented Nov 4, 2025

Uh oh!

codecov bot commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Andrew-S-Rosen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Andrew-S-Rosen Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

orionarcher Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

orionarcher commented Nov 13, 2025

Uh oh!

orionarcher commented Nov 21, 2025

Uh oh!

Andrew-S-Rosen commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

orionarcher commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

codecov bot commented Nov 5, 2025 •

edited

Loading

Andrew-S-Rosen commented Nov 21, 2025 •

edited

Loading