Skip to content

Support dummy sequence MSA#28

Merged
jnwei merged 7 commits intomainfrom
feature/no-msa-prediction
Nov 13, 2025
Merged

Support dummy sequence MSA#28
jnwei merged 7 commits intomainfrom
feature/no-msa-prediction

Conversation

@haewonc
Copy link
Contributor

@haewonc haewonc commented Nov 6, 2025

Support dummy main MSA consisted of query sequence
Figure: Complex with and without dummy MSA (PDB: 3WBM)
no_vs_dummy

@haewonc haewonc requested review from gnikolenyi and jnwei November 6, 2025 16:22
for query_name, query in inference_query_set.queries.items():
for chain in query.chains:
if (
chain.molecule_type == MoleculeType.PROTEIN
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Instead of hard-coding the moltypes, could you make them dependent on MSASettings.moltypes (should be in projects.of3_all_atom.config.dataset_config_components), which we use elsewhere to determine which molecule types are expected to have MSAs.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh, moltypes is attribute of MSASettings not MSAComputationSettings.
So I think it's not accessible yet in the DataModule prepare_data level.
Is there any way to cleanly address this?
But I think it's okay to compute & save for both. Later if moltypes don't include RNA or PROTEIN, this field will be ignored. How do you think?

Copy link
Collaborator

@gnikolenyi gnikolenyi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks so much, Minji, nice start, I added some suggestions.

Copy link
Collaborator

@gnikolenyi gnikolenyi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry to spam, I just noticed that there is an issue with one of the unittests, would you mind also taking a look at that?

@jnwei
Copy link
Contributor

jnwei commented Nov 7, 2025

This is great Minji!

Could you please add a test for testing feature generation using the dummy msa file? I think you could do it by extending / copying this test: https://github.com/aqlaboratory/openfold-3/blob/main/openfold3/tests/test_colabfold_msa.py#L225

@jnwei jnwei linked an issue Nov 7, 2025 that may be closed by this pull request
@jnwei jnwei merged commit b7940cb into main Nov 13, 2025
4 checks passed
@jnwei jnwei deleted the feature/no-msa-prediction branch November 13, 2025 07:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

single sequence prediction

3 participants