-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How motifs are defined in the structures of pathogenic STRs ? #45
Comments
Hi Mélanie, In the BED file the motifs are usually defined by the motif units in the hg38 assembly strand. Is the DAB1 locus by any chance transcribed in the negative strand relative to hg38? Becuase in this case the ATTTC locus would be the GAAAT unit shown above, with AAAAT possibly flanking the expansion. Best, |
Hi Mélanie. Just to add to Guilherme's reply, STRchive (from @hdashnow and the team) is a great resource for definitions of known pathogenic repeats. For example, here is the entry for DAB1: https://strchive.org/database/DAB1.html that specifies motifs in both reference and gene orientations. Best wishes, |
Ah, but of course, that seems obvious now. Mélanie |
By chance, do you have a tool that can tell whether a STR is pathogenic or not according to the given thresholds? |
You might try STRanger That is what we and some others do. It does benefit (and add extra info) from its own set of extra fields in those repeat definition files, that you can find over there-ish if you like them. We love Egor and his tools, but I don’t know if we morally speaking should encourage TRGT, or allow Stranger for in the long run though, as they have this weird partly-closed license excluding use with other chemistries. It adds a bit of complexity to pipelines etc and a bit of a bad taste. ExpansionHunter was cleaner that way. 😔 I hope it changes soon! |
Thank you @dnil, I have the impression that the repeats bed file is the key to obtaining good results. Depending on the version of the pathogenic_repeats.hg38.TRGT.bed file used, there is not the same definition of patterns, and I think that this can lead to false negatives. And I was happy to have found an STR expansion in my index case, but the TAAAA pattern is not the one that is pathogenic in the literature, but (TGGAA)*TAAAA. Mélanie |
Thank you for the feedback - I'll move this comment as an issue on the STRanger repo instead! |
Dear team,
Is there an error in the motifs included in the STR pathogen bed file?
For example, for the DAB1 gene, you have indicated the following motifs:
However, in several publications, the pathogenic motif is ATTTC. Here, with the structure defined in the bed file, this motif is never looked at, or do I not understand how TRGT works?
Please enlighten me on this subject, as many other genes do not have the pathogenic motif in their structure.
Thank you very much in advance for your interest in my message.
The best
Mélanie :)
The text was updated successfully, but these errors were encountered: