Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pangenome_variants_to_treeWAS #91

Open
ingridvanw opened this issue Sep 16, 2024 · 3 comments
Open

pangenome_variants_to_treeWAS #91

ingridvanw opened this issue Sep 16, 2024 · 3 comments

Comments

@ingridvanw
Copy link

ingridvanw commented Sep 16, 2024

Hi guys,

A number of additional scripts are provided (in the scripts and tools directories) for converting or analysing the PIRATE outputs.
I am using the ones to convert and run treeWas (pangenome_variants_to_treeWAS.pl)

When running:
perl pangenome_variants_to_treeWAS.pl -i /results/ -o /output/ -m /metadata/phenotype_for_pirate.csv -tree /output/tree.parsimony.tre --treeWAS /scripts/

The PIRATE files are succesfully converted (accessory_alleles.treewas_input, accessory.treewas_input and core_alleles.treewas_input are created. However, the results after running treewas are empty.
It created three folders (for example accessory_alleles) with one log file, showing:

running
'/usr/lib/R/bin/R --no-echo --no-restore --file=/scripts/run_treeWAS.R --args /accessory_alleles.treeWAS_input /metadata/phenotype_for_pirate.csv /output/tree.parsimony.tre /accessory_alleles'

But the created .tab files in results are empty, like this:

SNP_locus allele_name family_name gene_name product pvalue score G1P1 G0P0 G1P0 G0P1 test variant_type

Someone any idea?
Thanks in advance!

@SionBayliss
Copy link
Owner

Hi,

That is very odd. So you are saying the input files contain your genes but that they are removed after running treewas? Do you genes names contain any odd or protected characters? Are you sure it isn't something with odd carriage return characters being incompatible. The perl scripts were developed for Linux so running dos2unix or unix2dos might resolve any issues if that is the case.

S

@ingridvanw
Copy link
Author

I don't see any odd or protected characters, no spaces.
accessory_alleles.treeWAS_input looks like:

Samples g00827_00002 g00827_00005 g00827_00006 g00827_00014 g00827_00022
SAB003 1 1 0 1 0
SAB005 1 1 0 1 0
SAB006 1 1 0 1 0
SAB007 1 1 0 1 0
SAB008 1 1 0 1 0
SAB009 1 1 0 1 0

Seems that the scripts sees no variants in accessory_alleles.treeWAS_input?

Best regards,
ingrid

@SionBayliss
Copy link
Owner

Hi Ingrid,

Have you successfully used similar, example or dummy data to treewas? Are you sure that any filters, such as for allelic frequency, are not filtering out any or all of the data that you are providing it? If the data looks suitable for input and that format clearly works with treewas, then the issue may lie with treewas and not with the conversion script.

All the best,
Sion

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants