GitHub - vineetbansal/dsprint-pipeline

dSPRINT

A machine learning framework predicting interaction sites in human protein domains

This repository provides companion code to

A. Etzion-Fuchs, D. Todd and M. Singh (2020) "dSPRINT: predicting DNA, RNA, ion, peptide and small molecule interaction sites within protein domains", Manuscript under review

This repository can be used as a computation pipeline, and uses Snakemake as the underlying engine.

Essentially, given a file input.hmm, with one or multiple domains which follow the syntax of a Pfam-A entry, the following computational graph of rules is run:

Output files are generated in the output folder, with the final result per-position ligand binding score generated in the file output/binding_scores.csv

        ligand_type     binding_score   domain  match_state
0       dna     0.9916359186172485      zf-C2H2 1
1       dna     0.9872528910636902      zf-C2H2 10
2       dna     0.997771143913269       zf-C2H2 11
3       dna     0.997983455657959       zf-C2H2 12
4       dna     0.9957016110420227      zf-C2H2 13
5       dna     0.9956439733505249      zf-C2H2 14

Read the Getting Started guide on how to run dSPRINT.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
docs		docs
dsprint		dsprint
logs		logs
output		output
pertinint-internal		pertinint-internal
run-hmmer		run-hmmer
scripts		scripts
snakefiles		snakefiles
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
Snakefile		Snakefile
config.json		config.json
config.yaml		config.yaml
environment.yml		environment.yml
input.hmm		input.hmm
python2.yaml		python2.yaml
requirements.txt		requirements.txt
run_script.py		run_script.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dSPRINT

A machine learning framework predicting interaction sites in human protein domains

About

Releases

Packages

Contributors 2

Languages

vineetbansal/dsprint-pipeline

Folders and files

Latest commit

History

Repository files navigation

dSPRINT

A machine learning framework predicting interaction sites in human protein domains

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages