Conversation
|
Do you know of an option where the MSA generation with mmseqs2 (or maybe with mmseqs2-GPU) runs locally? Having enough computational resources, and in situations with a closed system, this would be very helpful. |
Hi there, A fully offline, local MSA generation workflow would be a great complement to the server-based approach I implemented here. Regarding the usage of msa, Boltz accepts custom MSAs in two ways:
The MSA exporter implemented here writes CSV files in the same format that Boltz expects by default. I have used boltz like this and it seems to run perfectly. MSA files in A3M format are also supported and should work fine, though I’ve used them less often. Hope that helps! |
Background
A few users have been asking about generating MSAs, so they can prepare CSVs on a local system using the msa-server api, and then transfer them to an HPC cluster with limited internet (see issues #409, #447). I also found it annoying to deal with, so I added a msa subcommand to try to make this more convenient. It doesn't require all model weights to be downloaded, and just performs the msa generation part, making it more light-weight:
msasubcommandboltz msa …)Open question
Right now boltz1 still downloads a full
ccd.pklwhile boltz2 usesmols.tar. If the workflows for msa generation is identical, we could simplify to always use the smaller pickle file.I’m happy to adjust anything here, just let me know and I will try to improve it! If you like the changes I can also update the readme so this new functionality is briefly mentioned.