Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Guideline for locally generating MSAs #48

Open
sangyeon-hits opened this issue Nov 22, 2024 · 1 comment
Open

Guideline for locally generating MSAs #48

sangyeon-hits opened this issue Nov 22, 2024 · 1 comment

Comments

@sangyeon-hits
Copy link

sangyeon-hits commented Nov 22, 2024

Hello, I'd like to reproduce the model's performance while generating MSAs locally.

I've got colabfold_search and mmseqs2 as the search tools, and uniref30_2302 and colabfold_envdb_202108 as DBs following the white paper.

But I don't know the exact commands and workflow for generating .a3m files the same way the authors did.

In particular, as I understand, I need to pair MSAs of different chains using taxonomy. But I'm not sure the current code includes such because I expect that there should be a file containing the taxonomy annotations, which I don't see:

We then assign taxonomy labels to all UniRef sequences using the taxonomy annotation provided by UniProt.

#32 asked a similar question in addition to the confidentiality of the mmseqs server, but that part seems to have been missed out.

@sangyeon-hits
Copy link
Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant