GitHub - a-h-b/MuStMultiomics: Multiomics walktrough for Integrated Multi-omics Study (Heintz-Buschart et al. 2016)

This repository contains code used in the multiomic analyses of faecal microbiota from four families with several cases of T1DM ( MuSt ).

You can find the different scripts behind the links next to the bullet points below. The links in the sub-headings below lead to descriptions of the workflows which connect the different scripts.

to build a search data base for proteomics from predicted proteins and their variants:

rename4proteomics.pl
trypsinStartEndProdigal.pl (corrected version)
variant_annotateRepairedTabProdigal.pl (corrected version)
variants_annotateTab4StatsProdigal.pl (corrected version)
trypsinStartEnd.pl (old version)
variant_annotateRepairedTab.pl (old version)
variant_annotateRepairedTabProdigalStillWrong.pl (version to keep workflow)
variants_annotateTab4Stats.pl (old version)
variants_locateType.pl

to parse functional annotations of gene predictions (some including coverage):

to annotate phylogenetic marker genes with the taxonomy of the best hit from the mOTU database:

to parse taxonomy of MG-RAST annotations of genes:

to automatically cluster contigs based on nucleotide signature (BH-SNE maps), DNA coverage and essential genes:

to gather contig clusters by related phylogenetic marker genes in a phylogenetic tree:

to reconstruct a metabolic network from KOs and analyse it:

140630_MUST_NW.R
the above script needs file 150705_KOs_in_NW.tsv
runHeinz.sh
plotModules_omicLevels.R

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
140630_MUST_NW.R		140630_MUST_NW.R
150310_MUST_hmmBestAll.py		150310_MUST_hmmBestAll.py
150322_bestHmmReadParse.py		150322_bestHmmReadParse.py
150415_bestHmmAveCovParse.py		150415_bestHmmAveCovParse.py
150630_keggReadParse.py		150630_keggReadParse.py
150705_KOs_in_NW.tsv		150705_KOs_in_NW.tsv
150705_MUST_hmmParse.py		150705_MUST_hmmParse.py
150705_MUST_hmmParsePfam.py		150705_MUST_hmmParsePfam.py
150705_MUST_keggParseNW.py		150705_MUST_keggParseNW.py
150816_getPhyloMarkers.R		150816_getPhyloMarkers.R
150819_MUST_tree.R		150819_MUST_tree.R
150928_MUST_relatedClusterWSFromMongo.R		150928_MUST_relatedClusterWSFromMongo.R
150928_mongifyMust.py		150928_mongifyMust.py
151020_funOIMongoWS.R		151020_funOIMongoWS.R
MGRASTaccessions.tsv		MGRASTaccessions.tsv
MGRASTgeneLevelTax.R		MGRASTgeneLevelTax.R
README		README
README.md		README.md
annotate-phylogenetic-marker-genes.md		annotate-phylogenetic-marker-genes.md
autoCluster.R		autoCluster.R
automatic-clustering.md		automatic-clustering.md
calculateCoverageAndGaps2.pl		calculateCoverageAndGaps2.pl
calculating-coverage.md		calculating-coverage.md
consolidate_hmmscan_results.pl		consolidate_hmmscan_results.pl
consolidate_hmmscan_results_justKEGG.pl		consolidate_hmmscan_results_justKEGG.pl
eukaryoticGenesMongo.R		eukaryoticGenesMongo.R
fastaExtractCutRibosomal1000.pl		fastaExtractCutRibosomal1000.pl
fastaExtractWithCoordBase1.pl		fastaExtractWithCoordBase1.pl
fastaExtractWithCoordBase1_old.pl		fastaExtractWithCoordBase1_old.pl
fastaExtractrRNA.pl		fastaExtractrRNA.pl
fastaProteinExtractAddSampleCluster.pl		fastaProteinExtractAddSampleCluster.pl
figS11_DB.png		figS11_DB.png
functional-annotations.md		functional-annotations.md
getHitPhylogenyNew.R		getHitPhylogenyNew.R
getNWexprMongoAllSamples.R		getNWexprMongoAllSamples.R
getValidGenes1.R		getValidGenes1.R
getValidGenes2new.R		getValidGenes2new.R
ko2des_clean.txt		ko2des_clean.txt
makeWSvarAnnoCorrect.R		makeWSvarAnnoCorrect.R
mongo-database.md		mongo-database.md
parse_taxbrowser_MGRAST.py		parse_taxbrowser_MGRAST.py
phylogenetic-marker-genes-trees.md		phylogenetic-marker-genes-trees.md
plotModules_omicLevels.R		plotModules_omicLevels.R
proteomics-data-base.md		proteomics-data-base.md
reconstructed-KO-network.md		reconstructed-KO-network.md
rename4proteomics.pl		rename4proteomics.pl
runHeinz.sh		runHeinz.sh
taxonomic-MG-RAST-annotations.md		taxonomic-MG-RAST-annotations.md
testFastaExtract.pl		testFastaExtract.pl
trypsinStartEnd.pl		trypsinStartEnd.pl
trypsinStartEndProdigal.pl		trypsinStartEndProdigal.pl
variant_annotateRepairedTab.pl		variant_annotateRepairedTab.pl
variant_annotateRepairedTabProdigal.pl		variant_annotateRepairedTabProdigal.pl
variant_annotateRepairedTabProdigalStillWrong.pl		variant_annotateRepairedTabProdigalStillWrong.pl
variants_annotateTab4Stats.pl		variants_annotateTab4Stats.pl
variants_annotateTab4StatsProdigal.pl		variants_annotateTab4StatsProdigal.pl
variants_locateType.pl		variants_locateType.pl
virusGenesMongo.R		virusGenesMongo.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This repository contains code used in the multiomic analyses of faecal microbiota from four families with several cases of T1DM ( MuSt ).

to build a search data base for proteomics from predicted proteins and their variants:

to parse functional annotations of gene predictions (some including coverage):

to annotate phylogenetic marker genes with the taxonomy of the best hit from the mOTU database:

to parse taxonomy of MG-RAST annotations of genes:

to automatically cluster contigs based on nucleotide signature (BH-SNE maps), DNA coverage and essential genes:

to gather contig clusters by related phylogenetic marker genes in a phylogenetic tree:

to reconstruct a metabolic network from KOs and analyse it:

to feed a mongo database with all the data from MuSt and retrieve some of the data:

About

Releases

Packages

Languages

a-h-b/MuStMultiomics

Folders and files

Latest commit

History

Repository files navigation

This repository contains code used in the multiomic analyses of faecal microbiota from four families with several cases of T1DM ( MuSt ).

to build a search data base for proteomics from predicted proteins and their variants:

to parse functional annotations of gene predictions (some including coverage):

to annotate phylogenetic marker genes with the taxonomy of the best hit from the mOTU database:

to parse taxonomy of MG-RAST annotations of genes:

to automatically cluster contigs based on nucleotide signature (BH-SNE maps), DNA coverage and essential genes:

to gather contig clusters by related phylogenetic marker genes in a phylogenetic tree:

to reconstruct a metabolic network from KOs and analyse it:

to feed a mongo database with all the data from MuSt and retrieve some of the data:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages