Skip to content

[BUG] planar sp2 C=C bond giving a 3D geometry #136

@jandom

Description

@jandom

Describe the bug

This is a write up for a bug that Deanna Wood (Vizcarra lab) have reported. This is a bug that relates to bad small molecule geometry, and impacts the latest p2 release (155k checkpoint), and probably everything else.

This is in fact a common failure mode for all AF3-like models

Here is the molecule in question

Image

To Reproduce

We tried a couple of smiles for this molecule

  • O=C1NC(=S)N(c2cccc(Br)c2)C(=O)/C1=C/c1ccco1 canonical smiles
  • BrC1=CC=CC(N(C(/C(C(N2)=O)=C/C3=CC=CO3)=O)C2=S)=C1 smiles
  • [H]\\C(C1=CC=CO1)=C1\\C(=O)NC(=S)N(C1=O)C1=CC(Br)=CC=C1 explicitly placed hydrogen

Here are the input JSONs

{
  "queries": {
    "diaph1_smifh2": {
      "chains": [
        {
          "molecule_type": "protein",
          "chain_ids": ["A"],
          "sequence": "GLTPKKLYKPEVQLRRPNWSKLVAEDLSQDCFWTKVKEDRFENNELFAKLTLTFSAQTKTSKAKKDQEGGEEKKSVQKKKVKELKVLDSKTAQNLSIFLGSFRMPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKMLSELKDEYDDLAESEQFGVVMGTVPRLRPRLNAILFKLQFSEQVENIKPEIVSVTAACEELRKSESFSNLLEITLLVGNYMNAGSRNAGAFGFNISFLCKLRDTKSTDQKMTLLHFLAELCENDYPDVLKFPDELAHVEKASRVSAENLQKNLDQMKKQISDVERDVQNFPAATDEKDKFVEKMTSFVKDAQEQYNKLRMMHSNMETLYKELGEYFLFDPKKLSVEEFFMDLHNFRNMFLQAVKENQKRRETEEKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGV"
        },
        {
          "molecule_type": "protein",
          "chain_ids": ["B"],
          "sequence": "GLTPKKLYKPEVQLRRPNWSKLVAEDLSQDCFWTKVKEDRFENNELFAKLTLTFSAQTKTSKAKKDQEGGEEKKSVQKKKVKELKVLDSKTAQNLSIFLGSFRMPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKMLSELKDEYDDLAESEQFGVVMGTVPRLRPRLNAILFKLQFSEQVENIKPEIVSVTAACEELRKSESFSNLLEITLLVGNYMNAGSRNAGAFGFNISFLCKLRDTKSTDQKMTLLHFLAELCENDYPDVLKFPDELAHVEKASRVSAENLQKNLDQMKKQISDVERDVQNFPAATDEKDKFVEKMTSFVKDAQEQYNKLRMMHSNMETLYKELGEYFLFDPKKLSVEEFFMDLHNFRNMFLQAVKENQKRRETEEKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGV"
        },
        {
          "molecule_type": "ligand",
          "chain_ids": ["C"],
          "smiles": "O=C1NC(=S)N(c2cccc(Br)c2)C(=O)/C1=C/c1ccco1"
        },
        {
          "molecule_type": "ligand",
          "chain_ids": ["D"],
          "smiles": "O=C1NC(=S)N(c2cccc(Br)c2)C(=O)/C1=C/c1ccco1"
        }
      ]
    }
  }
}

A custom runner YAML was provided but the low-mem example YAML yields tho same result

experiment_settings:
  cache_path: /home/jandom/.openfold3

model_update:
  presets:
    - predict
    - low_mem
    - pae_enabled
    
  custom:
    settings:
      memory:
        eval:
          use_deepspeed_evo_attention: false
          use_cueq_triangle_kernels: false

pl_trainer_args:
  accelerator: gpu
  devices: 1

Expected behavior

Atoms in the plane of the C=C bond should be planar.

Actual behavior

Image

diaph1_smifh2.pse.zip

Stack trace
N/A

Configuration (please complete the following information):

  • OS: [e.g. Ubuntu 20.04], Ubuntu
  • GPU used and memory available: DGX
  • Python version (if relevant):
  • Installation path (e.g. conda / pip / build from repo) pixi openfold3:openfold3-cuda13-pypi

Additional context

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingmodelRelated to the model definition

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions