This is the standalone version of web-server http://topcons.net. This software package is supposed to be run on Ubuntu x64 system. It might also work on other Linux boxes but have not been tested.
If you are interested in running TOPCONS2 on other systems, please contact Arne Elofsson ([email protected])
TOPCONS2 is an updated version of the widely used TOPCONS for predicting membrane protein topologies using consensus prediction. It is faster yet more accurate than the old TOPCONS according to our solid benchmarking. Moreover, it predicts not only the trans-membrane helices, but also the location of signal peptide
The software is open source and licensed under the GPL license.
Tsirigos, K.D., Peters, C., Shu, N., Kall, L., Elofsson, A., 2015. The TOPCONS web server for consensus prediction of membrane protein topology and signal peptides. Nucleic Acids Res. 43, W401-W407
-
Check out the software from the github by
git clone https://github.com/ElofssonLab/TOPCONS2
-
Download the database for TOPCONS2 from http://topcons.net/static/download/topcons2_database.zip and unzip it by
unzip topcons2_database.zip
-
Change to the folder 'topcons2_webserver' and create a soft link to the downloaded database
ln -s /path/to/the/downloaded/database database
You may need to update the BLAST database for uniref90 by the script install_blastdb.sh with the following command:
./install_blastdb.sh database
-
Install dependencies if not installed
- Cmake (for installation of modhmm, e.g. sudo apt-get install cmake)
- perl-Moose (e.g. sudo apt-get install perl-Moose)
- bioperl-1.6.924 (e.g. cpan > install CJFIELDS/BioPerl-1.6.924.tar.gz )
- biopython (e.g. sudo pip install biopython )
- IPC (e.g. cpan > install IPC::Run)
- kalign (e.g. sudo apt-get install kalign)
- hmmer3.0 (note that hmmscan should be compatible with the pfam database otherwise, you may encounter format incompatible problem http://hmmer.org/download.html)
- Gnuplot (e.g. sudo apt-get install gnuplot)
- Java (make sure that the command java is in the PATH, e.g. sudo apt-get install default-jre)
- convert from ImageMagick (e.g. sudo apt-get install imagemagick)
- xsltproc (e.g. sudo apt-get install xsltproc)
- gengetopt (e.g. sudo apt-get install gengetopt)
- awk, sort, head
Note that the commands hmmscan, kalign, gnuplot, convert, sort, awk and head should be in the PATH
-
Install modhmm
change to the folder
topcons2_webserver/predictors/source/modhmm
bash fresh_install.sh /path/to/topcons2_webserver/predictors
-
Test the topcons2 workflow
change to the folder
topcons2_webserver/test
and run the following commands../run_topcons2.sh one_seq.fasta -outpath rst_one_seq ../run_topcons2.sh multiple_seqs.fasta -outpath rst_multiple_seqs
The example results can be found in the folder
rst_one_seq
andrst_multiple_seqs
for the example fasta fileone_seq.fasta
andmultiple_seqs.fasta
respectively.
-
Description of the output results If the input is
one_seq.fasta
and the outpath isrst_one_seq
The tree view of all output files under the folderrst_one_seq
isone_seq ├── query.result.txt ├── query.result.txt.fa ├── query.result.txt.unfinished.fa ├── seq_0 │ ├── DG1.txt │ ├── dg.txt │ ├── Homology │ │ ├── query.fa.total_aligns │ │ └── query.top │ ├── nicetop.html │ ├── OCTOPUS │ │ ├── NN_PRF_FILES │ │ │ └── query.prf │ │ └── query.top │ ├── philius │ │ └── query.top │ ├── PolyPhobius │ │ └── query.top │ ├── SCAMPI_MSA │ │ └── query.top │ ├── seq.fa │ ├── SPOCTOPUS │ │ ├── NN_PRF_FILES │ │ │ └── query.nnprf │ │ └── query.top │ └── Topcons │ ├── reliability.final │ ├── reliability.txt │ ├── topcons.gnu │ ├── topcons.large.png │ ├── topcons.png │ ├── topcons.top │ ├── total_image.gnu │ ├── total_image.large.png │ └── total_image.png └── time.txt
-
The file
query.result.txt
contains all predictions for the query in text format similar as the example here -
The file
query.result.txt.fa
contains the consensus prediction for all sequences in the query in FASTA format. -
The file
query.result.txt.unfinished.fa
contains the sequences that are not successfully predicted by TOPCONS2 (if there are any) in FASTA format.
-
-
Download the database for TOPCONS2 from http://topcons.net/static/download/topcons2_database.zip and unzip it by
unzip topcons2_database.zip
and saved as
/data/topcons2_database
Update the BLAST database for uniref90 by the script install_blastdb.sh with the following command:
./install_blastdb.sh /data/topcons2_database/blast
-
Pull the Docker image by
docker pull nanjiang/topcons2
or you can also build the Docker image locally by
docker build -t topcons2 .
within the cloned folder, i.e.
TOPCONS2/
-
Run Docker container by
docker run -v /data/topcons2_database:/data/topcons2_database -it nanjiang/topcons2
-
Test run
cd /app/topcons2/test ../run_topcons2.sh one_seq.fasta -outpath rst1
The result will be available at
rst1/one_seq/seq_0
If you want to run TOPCONS2 Docker from outside of the container, suppose you want to output the result to the folder
/scratch
, then start the container bydocker run -e USER_ID=$(id -u $USER) -v /data/topcons2_database:/data/topcons2_database -v /scratch:/scratch -it --name topcons2 -d nanjiang/topcons2
Now you can test run the TOPCONS2 using the example sequence provided in the package by
docker exec --user user topcons2 script /dev/null -c "/app/topcons2/run_topcons2.sh /app/topcons2/test/one_seq.fasta -outpath /scratch/rst1"
To use you own sequence, you can copy your query sequence, e.g.
yourseq.fasta
to/scratch
and then run the following command in the shell terminaldocker exec --user user topcons2 script /dev/null -c "/app/topcons2/run_topcons2.sh /scratch/yourseq.fasta -outpath /scratch/rst_yourseq"
The prediction result will be output to
/scratch/rst_yourseq
given successful run.Note also that the output files are owned by your current user, i.e.
$USER
You can also run TOPCONS2 with Singularity
-
Database preparation: do the same as step 1 for when running with Docker.
-
Pull the Docker image by
singularity pull topcons2.img docker://nanjiang/topcons2
-
Suppose you have saved the database at
/data/topcons2_database
and you have a user writable folder/scratch
, then you can run TOPCONS2 with the following commandsingularity exec -B /data:/data -B /scratch:/scratch topcons2.img /app/topcons2/run_topcons2.sh /app/topcons2/test/one_seq.fasta -outpath /scratch/rst1
The prediction result will be output to
/scratch/rst1
given successful run.
If you only need to run OCTOPUS and SPOCTOPUS within the TOPCONS2 package, you
need to install the whole package using the procedure described above for
TOPCONS2. Then, use the script pfam_workflow_octopus.py
.
Examples: change to the folder 'topcons2_webserver/test' and run the following commands
../run_octopus.sh -outpath rst2 multiple_seqs.fasta
The result of predicted topologies in Fasta format can be found in
rst1/multiple_seqs.OCTOPUS.topfa
and rst1/multiple_seqs.SPOCTOPUS.topfa
If you do not need the individual output files nor the ANN output, you can run the commands with the "-remove-individual-files" flag, that is
../run_octopus.sh -outpath rst2 -RM multiple_seqs.fasta
Run OCTOPUS and SPOCTOPUS using Docker is similar to run the whole TOPCONS2 workflow.
start the container by (assume you have the write permission to the folder
/scratch
)
docker run -e USER_ID=$(id -u $USER) -v /data/topcons2_database:/data/topcons2_database -v /scratch:/scratch -it --name topcons2 -d nanjiang/topcons2
And make your test run by
docker exec --user user topcons2 script /dev/null -c "/app/topcons2/run_octopus.sh /app/topcons2/test/one_seq.fasta -outpath /scratch/rst2"