As shown below, amino-acids are grouped into three categories: DDDDD-oriented,
DUDUD-oriented,
and
others.
Amino-
acid |
Freq.
(times) |
D2
code
assignments (%)
DDDDD
("0") |
DUDUD
("A") |
UUDUU
("R") |
UUDUD
("Q") |
DUDUU
("B") |
Else
("G") |
|
VAL (V) |
118,030 |
54 |
28 |
4 |
5 |
2 |
7 |
ILE (I) |
92,900 |
49 |
33 |
4 |
5 |
3 |
7 |
SER (S) |
90,250 |
44 |
26 |
8 |
7 |
7 |
8 |
THR (T) |
89,111 |
49 |
23 |
8 |
5 |
6 |
8 |
PHE (F) |
62,974 |
45 |
30 |
6 |
5 |
6 |
8 |
TYR (Y) |
54,210 |
47 |
28 |
6 |
5 |
6 |
8 |
HIS (H) |
36,063 |
42 |
27 |
9 |
5 |
7 |
8 |
TRP (W) |
21,091 |
42 |
34 |
5 |
8 |
5 |
6 |
CYS (C) |
20,460 |
51 |
25 |
8 |
3 |
5 |
8 |
LEU (L) |
114,979 |
35 |
41 |
6 |
6 |
6 |
6 |
ALA (A) |
133,378 |
31 |
45 |
6 |
7 |
6 |
5 |
GLU (E) |
109,542 |
29 |
43 |
7 |
8 |
7 |
5 |
LYS (K) |
91,565 |
34 |
35 |
9 |
7 |
7 |
7 |
ARG (R) |
86,253 |
36 |
37 |
8 |
6 |
6 |
6 |
GLN (Q) |
58,778 |
32 |
40 |
8 |
5 |
8 |
7 |
MET(M) |
32,351 |
36 |
41 |
6 |
5 |
6 |
7 |
GLX (Z) |
46 |
28 |
52 |
13 |
4 |
0 |
0 |
GLY (G) |
120,167 |
43 |
14 |
30 |
4 |
3 |
5 |
ASP (D) |
92,614 |
35 |
26 |
16 |
6 |
10 |
7 |
PRO (P) |
71,132 |
55 |
8 |
10 |
23 |
1 |
2 |
ASN (N) |
65,487 |
35 |
24 |
19 |
3 |
11 |
8 |
ASX (B) |
28 |
43 |
36 |
14 |
0 |
4 |
0 |
Total |
1,591,409 |
40 |
31 |
10 |
6 |
6 |
6 |
[Note]
The
table is obtained by considering amino-acid
fragments extracted
from 2,750 proteins, one for each SCOP family (1.69 release).
[References]
- N.Morikawa, Number sequence representation of
protein structures based on the second derivative of a folded
tetrahedron sequence. (manuscript, 2006).