fix: preparse_alignments_of3 and add tests #81

jandom · 2026-01-05T14:55:51Z

Summary

Looks like I introduced a bug during a refactor (what else is new!) in this PR #77: added some extra structure with the BaseModel but the underlying code expects a dictionary. It'd be cleaned to make everything consume the BaseModel but in the absence of tests, I just use the BaseModel for user-input validation and then dump to a dict.

Changes

Fixed the bug
Added a smoke test to confirm files are being written out (and are not empty!)

Related Issues

Testing

Other Notes

docs/source/precomputed_msa_how_to.md

jnwei

LGTM, some small nits regarding documentation on tets.

jnwei · 2026-01-20T11:03:07Z

openfold3/tests/scripts/data_preprocessing/test_preparse_alignments_of3.py

+
+        # Check that npz files were created for both chains
+        npz_files = list(tmp_path.glob("*.npz"))
+        assert len(npz_files) == 6, (


Could we add a little more context for why 6 is the expected number of npz files? Would it make sense to check for the list of filenames instead?

Yeah, I re-organized the tests a bit – by default the script processes the whole directory and that's not ideal (we were mixing loose alignment files with 2 PDB, which were the actual inputs)

jnwei · 2026-01-20T11:04:40Z

openfold3/tests/scripts/data_preprocessing/test_preparse_alignments_of3.py

+        return CliRunner()
+
+    def test_preparse_databases(self, cli_runner, tmp_path):
+        """Test preparsing alignments with a single database (uniref90_hits)."""


Could we add the the expected input file type (e.g. .sto)?

It actually could be both a3m/sto – I'm hesitant to write documentation for the script in the test :P What's the intent here?

When reviewing, it isn't obvious to me what are the inputs to this script since as you mention later, we have a combination of inputs and outputs in the testdata/alignments directory.

It would be helpful to leave a signpost, either in the test or in the test_dir, of the expected inputs to help future readers / maintainers.

Another option could be to rearrange the test directory to have the inputs / outputs separated.

Ah, makes sense now – so this is how this directory is organized

The two directories make sense. The loose file directly under alignments/ I have no idea what these are - they're not outputs. In fact, they don't seem to be called by anything. I'm going to do an exploratory 'rm'...

Added the docstring explaining what the script does to these inputs.

vinay-swamy

LGTM

jnwei · 2026-01-22T04:18:06Z

This should be good to merge after fixing a small spacing typo for directory paths.

jandom · 2026-01-22T07:48:49Z

@jnwei thanks for the fix, let's merge this bad boy

fix: preparse_alignments_of3 and add tests

7f805f5

jandom requested a review from vinay-swamy January 5, 2026 14:55

jandom self-assigned this Jan 5, 2026

run linter

9a8764f

jandom added the safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. label Jan 5, 2026

Remove overly custom fixture (gen-ai slop)

c70bb18

jandom marked this pull request as ready for review January 19, 2026 14:28

Merge branch 'main' into jandom/2026-01/fix/preparse_alignments_of3

bcbf2dc

jandom requested a review from jnwei January 19, 2026 14:56

vinay-swamy requested changes Jan 19, 2026

View reviewed changes

docs/source/precomputed_msa_how_to.md Show resolved Hide resolved

review: comments from Vinnie

e9bb2df

jandom requested a review from vinay-swamy January 20, 2026 10:45

jnwei reviewed Jan 20, 2026

View reviewed changes

vinay-swamy approved these changes Jan 20, 2026

View reviewed changes

review: change the test to be more useful

6ef0685

jandom added safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. and removed safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. labels Jan 20, 2026

Merge branch 'main' into jandom/2026-01/fix/preparse_alignments_of3

8773f86

jandom added safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. and removed safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. labels Jan 20, 2026

Merge branch 'main' into jandom/2026-01/fix/preparse_alignments_of3

c984345

jandom added safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. and removed safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. labels Jan 21, 2026

jandom requested a review from jnwei January 21, 2026 09:11

review: comment from Jennifer

1d81a94

jandom added safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. and removed safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. labels Jan 21, 2026

remove stray space that was added to TEST_DIR path

d71eac7

jnwei added safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. and removed safe-to-test Internal only label used to indicate PRs that are ready for automated CI testing. labels Jan 22, 2026

jandom merged commit 5c464e5 into main Jan 22, 2026
5 checks passed

jandom deleted the jandom/2026-01/fix/preparse_alignments_of3 branch January 22, 2026 07:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preparse_alignments_of3 and add tests #81

fix: preparse_alignments_of3 and add tests #81

Uh oh!

jandom commented Jan 5, 2026

Uh oh!

Uh oh!

jnwei left a comment

Uh oh!

jnwei Jan 20, 2026

Uh oh!

jandom Jan 20, 2026

Uh oh!

jnwei Jan 20, 2026

Uh oh!

jandom Jan 21, 2026

Uh oh!

jnwei Jan 21, 2026

Uh oh!

jandom Jan 21, 2026

Uh oh!

vinay-swamy left a comment

Uh oh!

jnwei commented Jan 22, 2026

Uh oh!

jandom commented Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: preparse_alignments_of3 and add tests #81

fix: preparse_alignments_of3 and add tests #81

Uh oh!

Conversation

jandom commented Jan 5, 2026

Uh oh!

Uh oh!

jnwei left a comment

Choose a reason for hiding this comment

Uh oh!

jnwei Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

jandom Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

jnwei Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

jandom Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

jnwei Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

jandom Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

vinay-swamy left a comment

Choose a reason for hiding this comment

Uh oh!

jnwei commented Jan 22, 2026

Uh oh!

jandom commented Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants