-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Depth of auto-generated MSAs #39
Comments
Yeah that d be due to the server. Maybe try this (https://zhanggroup.org/DeepMSA/) and see if you're getting a much deeper MSA? Maybe your sequence just doesn't have many homologues? |
Thanks for the answer! Yes when using the DeepMSA I get a much deeper MSA. However, it is also deeper when using colabfold which should also use a MMSeq2 server if I'm not mistaken? |
Would you mind sharing your input config? I can take a look |
Is the query sequence a hetero-multimeric protein? If so, I had the same issue. Line 178 in e43f910
This might the reason why you have a shallow MSA... |
So should it do the multiples then? Does that match the benchmarking?
…On Thu, Nov 21, 2024 at 4:56 PM Lim Heo ***@***.***> wrote:
Is the query sequence a hetero-multimeric protein? If so, I had the same
issue.
In ColabFold, it queries MMseqs2 API twice: one for each chain and the
other for the "pair" mode.
https://github.com/sokrypton/ColabFold/blob/e2ca9e8f992cd65c986de5b64885d5572d8b8ad9/colabfold/batch.py#L817-L857
In contrast, the current implementation of Boltz, compute_msa, calls the
API only once for the "pair" mode.
https://github.com/jwohlwend/boltz/blob/e43f9101886ea6c290d6b1e0ada3796f0e798d88/src/boltz/main.py#L178
This might the reason why you have a shallow MSA...
—
Reply to this email directly, view it on GitHub
<#39 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAZDHRAN6EUFIP5T432MGG32BZJJVAVCNFSM6AAAAABSHCAH3GVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIOJSGQYTQOJRGI>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Looking into it, will report back |
@heol1 yes exactly it's a hetero multimer. When I run the chains individually I get much deeper MSAs |
@jwohlwend input fasta for reference:
|
I'm having similar issues. I tried this:
with the command:
and both MSAs for the individual, as well as the pair, are single sequences only. Using ColabFold, I get much deeper MSAs (and much better predictions). |
Just adding another voice to this - I'm also finding the auto-generated MSA to be very shallow (single sequence) using two proteins and the --use_msa_server flag. |
The MSAs that got generated for my predictions only contain a couple of sequences.
Is this due to limitations of the MMSeq2 sever or can this be adjusted?
The text was updated successfully, but these errors were encountered: