-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Heatmap: location for distance matrix, path for output figure #6
Comments
Could you please write down all of the commands that you have run, from the start, as well as where your data is stored? I think that if you've gotten results inside your Conda directory you've done something odd. |
Hi! Yes indeed! I overwrote the distance matrix script somehow. So I downloaded the source code and copied the distance matrix script in the miniconda folder again, and then it worked! Thus now I could generate the the distance matrix and it is in one of my folders. I used this command for it: (base) aimre@sisko:~/VarClust-0.2.3/bin$ varclust_distance_matrix /home5/aimre/Varclust_profiles/ /home5/aimre/Varclust_profiles/output_distance_matrix However for the heatmap I am getting a similar error message. I am sorry, probably it is something very basic that I am missing. So I really appreciate your help! (base) aimre@sisko:~/Varclust_profiles$ varclust_heatmap /home5/aimre/Varclust_profiles/output_distance_matrix /home5/aimre/Varclust_profiles/output_heatmap_figure |
I also add the information you asked previously: Command for creating profiles: I think the rest of the steps I have done is in my previous comment. Let me know if you need more information! |
Okay, strange. Do you have a way of sharing the vcf file you have, or at least a portion of it, so that I can test it on my end? |
Yes! I attached a few of my vcf files. They are all single sample vcf files. I appreciate your help! The file names contain '.sam' because for some reason the GATK pipeline thought that it was part of the sample name and left it like this in the heading of the vcf files. W1_1_S1_L003_.sam.vcf.gz |
Update: In the meantime I fixed the issue. I found the solution here: https://stackoverflow.com/questions/76812405/typeerror-stringmethods-rsplit-takes-from-1-to-2-positional-arguments-but-3-w So I just corrected str.split(': ', 1) to str.split(': ', n=1) and then it worked. However, I noticed that the generated figure is not informative, and it seems according to VarClust there is no detectable difference between my samples. Also, in the distance matrix all the comparisons received the same value (0.8333), which to me does not seem realistic. So, I opened my profile files, and it turned out they were empty, there is only a headline. I will try to figure out the reason of this, but if you have any idea, I would genuinely appreciate it! |
Sorry for the late response, I've been on vacation and away from the computer. Okay, it sounds like your VCFs are malformed somehow. The value 0.8333 is the expected value to get when you're comparing empty profiles with the similarity score and default parameters: Could you try passing the I can also see that your chromosomes are named with Roman numerals instead of normal numbers. I'm not sure this is supported by the underlying PyVCF package. |
Hi! Thank you for your answer! I added snpEff annotations but, when I wanted to build the profiles, I received error messages. Without snpEff annotations the command makes empty profiles. Can you send over one of your vcf.gz files that works for you for comparison? I tried the --method position_only argument like you see below, but it did not work either. Can you elaborate on the usage of this argument? varclust_create_profiles /home5/aimre/VarClust_data/ /home5/aimre/VarClust_data/snpEff_annotated/varclust_probe/probe_profiles/ --method position_only |
Could you try this file: https://github.com/fasterius/seqCAT/blob/master/inst/extdata/sample1.vcf.gz? The |
Hi! I tried to use the heatmap function, but it did not work. I am not sure where the distance matrix is supposed to be, but it was in my miniconda3 directory, which I think is weird. Could you also tell me what the path for the output figure should be? It seems it is not just a simple jpg file.
The text was updated successfully, but these errors were encountered: