[HOME]
As shown below, amino-acids are grouped into three categories: DDDDD-oriented, DUDUD-oriented, and others.

Amino-
acid
Freq.
(times)
D2 code assignments (%)
DDDDD
("0")
DUDUD
("A")
UUDUU
("R")
UUDUD
("Q")
DUDUU
("B")
Else
("G")

VAL (V) 118,030 54 28 4 5 2 7
ILE (I) 92,900 49 33 4 5 3 7
SER (S) 90,250 44 26 8 7 7 8
THR (T) 89,111 49 23 8 5 6 8
PHE (F) 62,974 45 30 6 5 6 8
TYR (Y) 54,210 47 28 6 5 6 8
HIS (H) 36,063 42 27 9 5 7 8
TRP (W) 21,091 42 34 5 8 5 6
CYS (C) 20,460 51 25 8 3 5 8

LEU (L) 114,979 35 41 6 6 6 6
ALA (A) 133,378 31 45 6 7 6 5
GLU (E) 109,542 29 43 7 8 7 5
LYS (K) 91,565 34 35 9 7 7 7
ARG (R) 86,253 36 37 8 6 6 6
GLN (Q) 58,778 32 40 8 5 8 7
MET(M) 32,351 36 41 6 5 6 7
GLX (Z) 46 28 52 13 4 0 0

GLY (G) 120,167 43 14 30 4 3 5
ASP (D) 92,614 35 26 16 6 10 7
PRO (P) 71,132 55 8 10 23 1 2
ASN (N) 65,487 35 24 19 3 11 8
ASX (B) 28 43 36 14 0 4 0

Total 1,591,409 40 31 10 6 6 6


[Note] The table is obtained by considering amino-acid fragments extracted from 2,750 proteins, one for each SCOP family (1.69 release).

[References]
  1. N.Morikawa, Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence. (manuscript, 2006).