Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sto to json #112

Closed
wenhuihuang19791109 opened this issue Nov 24, 2024 · 2 comments
Closed

sto to json #112

wenhuihuang19791109 opened this issue Nov 24, 2024 · 2 comments
Labels
question Further information is requested

Comments

@wenhuihuang19791109
Copy link

I have some precomputed receptors in sto formats, how could I convert it into json file?
image

@Augustin-Zidek Augustin-Zidek added the question Further information is requested label Nov 25, 2024
@Augustin-Zidek
Copy link
Collaborator

Augustin-Zidek commented Nov 25, 2024

You will need to do the following steps (which could be automated using e.g. Python):

  1. Convert all 4 files from Stockholm to a3m. You could use e.g. esl-reformat from the HMMER Suite to do that, or you could use our Stockholm to a3m function from https://github.com/google-deepmind/alphafold3/blob/main/src/alphafold3/data/parsers.py#L104.
  2. Crop, merge and deduplicate all 4 a3m files into a single a3m, in the following order and crop size: UniRef90 (10,000), Small BFD (5,000), MGnify (5,000). The cropping, and deduplication is optional. We don't use PDB hits as MSA, so feel free to not include that one. You can use the tooling from https://github.com/google-deepmind/alphafold3/blob/main/src/alphafold3/data/pipeline.py#L85 to achieve all of this. Or simply just concatenate the 4 a3m files if you don't care about cropping and duplicate sequences in the MSA.
  3. Get the resulting a3m and put it in the unpairedMsa field in the input JSON. Set unpairedMsa to an empty string, templates to an empty list.

Hope this helps!

@wenhuihuang19791109
Copy link
Author

thanks a lot, that's very helpful!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants