Skip to content

Banana proteomes

Arnold Kuzniar edited this page Dec 5, 2016 · 1 revision

(Wild) banana proteomes:

Musa acuminata subsp. malaccensis (wild banana)
Clade: Musa acuminata
ID: MUSAM
Taxon ID: 214687
Release: Ensembl Plants 21; 6-DEC-2013
Number of sequences: 36439
Proteins in matrix: 34188
Musa acuminata (banana)
Clade: Musa acuminata
ID: MUSAC
Taxon ID: 4641
Release: Ensembl Plants 18; 8-APR-2013
Number of sequences: 36439
Proteins in matrix: 34187

Count the number of protein sequences:

cd OMA.1.0.5/DB
grep -c ">" MUSA*.fa
MUSAC.fa:36439
MUSAM.fa:36439

Count the number of pair-wise sequence similarities (Smith-Waterman raw scores) and PAM distances:

wc -l *.sim.graph
2200051 MUSAC-MUSAM.sim.graph
1082161 MUSAC-MUSAC.sim.graph
1082161 MUSAM-MUSAM.sim.graph

wc -l *.dist.graph
2200051 MUSAC-MUSAM.dist.graph
1082161 MUSAC-MUSAC.dist.graph
1082161 MUSAM-MUSAM.dist.graph
Clone this wiki locally