|
|
Example for PHD outputSequence: prio_human (Swissprot)MAJOR PRION PROTEIN PRECURSOR (PRP)
OUTPUT
Publication to reference in reporting results:
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Rost, Burkhard; Sander, Chris: Prediction of protein structure
at better than 70% accuracy. J. Mol. Biol., 1993, 232, 584-599.
Rost, Burkhard; Sander, Chris: Combining evolutionary information
and neural networks to predict protein secondary structure.
Proteins, 1994, 19 (No. 1), 55-72.
Rost, Burkhard; Sander, Chris: Conservation and Prediction of
Solvent Accessibility in Protein Families. Proteins, 1994,
in press.
Some statistics:
~~~~~~~~~~~~~~~~
Percentage of amino acids:
+--------------+--------+--------+--------+--------+--------+
| AA: | G | P | S | Q | V |
| % of AA: | 17.8 | 6.7 | 5.9 | 5.9 | 5.5 |
+--------------+--------+--------+--------+--------+--------+
| AA: | Y | T | N | M | L |
| % of AA: | 5.1 | 5.1 | 4.7 | 4.7 | 4.7 |
+--------------+--------+--------+--------+--------+--------+
| AA: | R | K | H | A | W |
| % of AA: | 4.3 | 4.0 | 4.0 | 4.0 | 3.6 |
+--------------+--------+--------+--------+--------+--------+
| AA: | I | E | F | D | C |
| % of AA: | 3.6 | 3.6 | 2.8 | 2.4 | 1.6 |
+--------------+--------+--------+--------+--------+--------+
Percentage of secondary structure predicted:
+--------------+--------+--------+--------+
| SecStr: | H | E | L |
| % Predicted: | 10.7 | 18.2 | 71.1 |
+--------------+--------+--------+--------+
According to the following classes:
all-alpha: %H>45 and %E< 5; all-beta : %H<5 and %E>45
alpha-beta : %H>30 and %E>20; mixed: rest,
this means that the predicted class is: mixed class
PHD output for your protein:
~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Fri Jun 10 15:18:27 1994
For secondary structure prediction:
Jury on: 10 different architectures (version 5.94 ).
For solvent accessibility prediction:
Jury on: 5 different architectures (version 5.94 ).
Note: differently trained architectures, i.e., different versions can
result in different predictions.
About the protein:
------------------
HEADER prio_human.gcg
COMPND
SOURCE
AUTHOR
SEQLENGTH 253
NCHAIN 1 chain(s) in prio_human data set
NALIGN 48
(=number of aligned sequences in HSSP file)
Abbreviations:
--------------
secondary structure : H=helix, E=extended (sheet), blank=rest (loop)
accessibility :
3st: relative solvent accessibility (acc) in 3 states:
b = 0-9%, i = 9-36%, e = 36-100%.
10st: relative solvent acc. in 10 states
= n corresponds to a relative acc. of n*n %
AA: amino acid sequence
OBS: values for experimentally observed 3D structures
OBS sec: DSSP classification of secondary structure
OBS acc: DSSP estimat of relative solvent accessibility area
O_3 acc: observed relative accessibility in 3 states: B, I, E
Note: a blank is used for intermediate (i)
PHD: Profile network prediction HeiDelberg
PHD sec: prediction of secondary structure in three states
PHD acc: prediction of solvent accessibility in 10 states
P_3 acc: predicted relative accessibility in 3 states
Rel: Reliability index of prediction (0-9)
Rel sec: reliability of secondary structure prediction
Rel sec: reliability of accessibility prediction
detail:
prH sec: 'probability' for assigning helix
prE sec: 'probability' for assigning strand
prL:sec: 'probability' for assigning loop
Note: the 'probabilites' are scaled to the interval 0-9,
i.e. prH=5 means, that the signal at the first
output node is 0.5-0.6.
subset:
SUB: a subset of the prediction, for all residues with
a reliablity > n, with the following choices:
SUB sec: for secondary structure prediction n >= 5, which
corresponds to an expected accuracy > 82% (see
tables in header)
note: for this subset the following symbols are used:
L: is loop (for which above " " is used)
".": means that no prediction is made for this residue,
as the reliability is Rel < 5
SUB acc: for accessibility prediction n >= 4, which
corresponds to an expected correlation coefficient
> 0.69% (see tables in header)
Note: for this subset the following symbols are used:
"I": is intermediate (for which above " " is used)
".": means that no prediction is made for this residue,
as the reliability is Rel < 4
protein: prio length 253
....,....1....,....2....,....3....,....4....,....5....,....6
AA |MANLGCWMLVLFVATWSDLGLCKKRPKPGGWNTGGSRYPGQGSPGGNRYPPQGGGGWGQP|
PHD sec | EEEEEEEEEEE |
Rel sec |983211699899885157889998887788556886678877647677876567766654|
detail:
prH sec |001222000000000000000000111101222111111111221211111221111122
prE sec |002334788888886421000000000000000000000000000000000000000000
prL sec |985443100000012477888998888888767887778888767788877777777776
subset: SUB sec |LL....EEEEEEEEE.LLLLLLLLLLLLLLLLLLLLLLLLLLL.LLLLLLLLLLLLLLL.|
ACC:
3st: P_3 acc |eeebbbbbbbbbbbbbbeee eeeeeeeeb e bbe ebeb eebee eeeb b b b|
10st: PHD acc |967000000000000007665667666860565006556060566066558860505030
Rel acc |515413599998742104212224102410211002222021211021113311102200
subset: SUB acc |e.eb..bbbbbbbb...e.....e...e................................|
....,....7....,....8....,....9....,....10...,....11...,....1
AA |HGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTHSQWNKPSKPKTNMKHMAGAAAAGA|
PHD sec | |
Rel sec |556543545687444567754445787553244453568899998789887566545676|
detail:
prH sec |221233222111222221123222111223433323220000000110011211222111
prE sec |000000000000000000000000000000000000000000000000000010000000
prL sec |777666666787666677766666788765566666678899998888887777767787
subset: SUB sec |LLLL..L.LLLL...LLLLL...LLLLLL.....L.LLLLLLLLLLLLLLLLLLL.LLLL|
ACC:
3st: P_3 acc | b b b b b b b e bbb b e bb b bbbe eeeeeeeeeee eebbbbbbbbb|
10st: PHD acc |505050505050305630005036500530500065366668766676366000000000
Rel acc |221223201010001101012301110103101122021124411141022103331222
ACC:
2...,....13...,....14...,....15...,....16...,....17...,....1
AA |VVGGLGGYMLGSAMSRPIIHFGSDYEDRYYRENMHRYPNQVYYRPMDEYSNQNNFVHDCV|
PHD sec | EEEE EEEEEE|
Rel sec |567755311344445665677778778555587455899355432765678995554168|
detail:
prH sec |111122344322332111110011111111001210000000000112110000000000
prE sec |100000000010000012111000000122210122100367664101100002676578
prL sec |777766544566666776678878878766687666888522335776678997223421
subset: SUB sec |LLLLLL........LLLLLLLLLLLLLLLLLLL.LLLLL.EE...LLLLLLLLLEE..EE|
ACC:
3st: P_3 acc | bbb bb bbbeb beb bebb e eeebbeebb bbe bbb bbee eeeeebbbbbb|
10st: PHD acc |500050050006050605060057566600670055206500040066567666000000
Rel acc |113211521142021301020224222200241123321241130034233203342299
ACC:
8...,....19...,....20...,....21...,....22...,....23...,....2
AA |NITIKQHTVTTTTKGENFTETDVKMMERVVEQMCITQYERESQAYYQRGSSMVLFSSPPV|
PHD sec |EEEEEEEEEEE HHHHHHHHHHHHHHHHHHHHHHHHHHH EEEEE E|
Rel sec |999988588653788762115688999999998698996699986179974688829967|
detail:
prH sec |000000000000000000136788999999998788897799987410000000000000
prE sec |999988688773110123320000000000000000000000001100016788840018
prL sec |000011311225888875432200000000000100002200001489973110059971
subset: SUB sec |EEEEEEEEEEE.LLLLL...HHHHHHHHHHHHHHHHHHHHHHHHH.LLLL.EEEE.LLLE|
ACC:
3st: P_3 acc |bbbbee bbeeeeeeee eeebbebbe bbebbbbbbbeeeeeb eeeebbbbbebbbb|
10st: PHD acc |000066300666667675766006007500700000006766605577860000080000
Rel acc |375942003323347172543014675499729930014933505286621233540006
subset: SUB acc |.bbbe........ee.e.ee...ebbeibbe.bb....ee..e.i.eee.....be...b|
4...,....25...,....26...,....27...,....28...,....29...,....3
AA |ILLISFLIFLIVG|
PHD sec |EEEE EEEE |
Rel sec |8885111124529
detail:
prH sec |0012344432100
prE sec |8886212456650
prL sec |0001443100139
subset: SUB sec |EEEE......E.L|
ACC:
3st: P_3 acc |bbbbbbbbbbb e|
10st: PHD acc |0000000000059
Rel acc |9999699896229
subset: SUB acc |bbbbbbbbbb..e|
|
Copyright © 2008 Burkhard Rost, CUBIC all rights reserved. Terms of Use | Privacy Policy | Contact Information