Amino acid dipepetide frequency for Glaciibacter sp. YIM 131861

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.986AlaAla: 19.986 ± 0.222
0.632AlaCys: 0.632 ± 0.027
8.582AlaAsp: 8.582 ± 0.096
7.45AlaGlu: 7.45 ± 0.103
3.973AlaPhe: 3.973 ± 0.065
11.985AlaGly: 11.985 ± 0.12
2.358AlaHis: 2.358 ± 0.048
6.555AlaIle: 6.555 ± 0.093
2.749AlaLys: 2.749 ± 0.064
13.641AlaLeu: 13.641 ± 0.145
2.627AlaMet: 2.627 ± 0.056
2.449AlaAsn: 2.449 ± 0.054
6.159AlaPro: 6.159 ± 0.102
3.563AlaGln: 3.563 ± 0.058
8.916AlaArg: 8.916 ± 0.126
8.561AlaSer: 8.561 ± 0.111
7.464AlaThr: 7.464 ± 0.106
11.823AlaVal: 11.823 ± 0.122
1.818AlaTrp: 1.818 ± 0.048
2.413AlaTyr: 2.413 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.031
0.037CysCys: 0.037 ± 0.006
0.326CysAsp: 0.326 ± 0.017
0.2CysGlu: 0.2 ± 0.014
0.166CysPhe: 0.166 ± 0.011
0.521CysGly: 0.521 ± 0.022
0.112CysHis: 0.112 ± 0.012
0.211CysIle: 0.211 ± 0.015
0.059CysLys: 0.059 ± 0.007
0.423CysLeu: 0.423 ± 0.022
0.073CysMet: 0.073 ± 0.009
0.095CysAsn: 0.095 ± 0.011
0.274CysPro: 0.274 ± 0.018
0.1CysGln: 0.1 ± 0.011
0.295CysArg: 0.295 ± 0.015
0.358CysSer: 0.358 ± 0.019
0.329CysThr: 0.329 ± 0.021
0.434CysVal: 0.434 ± 0.021
0.081CysTrp: 0.081 ± 0.008
0.1CysTyr: 0.1 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
9.658AspAla: 9.658 ± 0.108
0.215AspCys: 0.215 ± 0.017
4.563AspAsp: 4.563 ± 0.095
4.29AspGlu: 4.29 ± 0.074
1.751AspPhe: 1.751 ± 0.043
6.391AspGly: 6.391 ± 0.094
1.253AspHis: 1.253 ± 0.035
2.534AspIle: 2.534 ± 0.051
1.099AspLys: 1.099 ± 0.04
6.168AspLeu: 6.168 ± 0.082
0.784AspMet: 0.784 ± 0.026
1.019AspAsn: 1.019 ± 0.037
3.92AspPro: 3.92 ± 0.069
1.61AspGln: 1.61 ± 0.044
4.646AspArg: 4.646 ± 0.077
2.939AspSer: 2.939 ± 0.057
2.956AspThr: 2.956 ± 0.057
5.742AspVal: 5.742 ± 0.079
1.038AspTrp: 1.038 ± 0.029
1.323AspTyr: 1.323 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
6.304GluAla: 6.304 ± 0.085
0.21GluCys: 0.21 ± 0.016
2.446GluAsp: 2.446 ± 0.049
2.576GluGlu: 2.576 ± 0.065
1.655GluPhe: 1.655 ± 0.043
3.473GluGly: 3.473 ± 0.069
1.394GluHis: 1.394 ± 0.044
2.498GluIle: 2.498 ± 0.064
1.419GluLys: 1.419 ± 0.044
6.38GluLeu: 6.38 ± 0.105
0.907GluMet: 0.907 ± 0.032
1.218GluAsn: 1.218 ± 0.038
2.814GluPro: 2.814 ± 0.06
2.073GluGln: 2.073 ± 0.052
5.16GluArg: 5.16 ± 0.094
2.969GluSer: 2.969 ± 0.055
2.924GluThr: 2.924 ± 0.06
4.244GluVal: 4.244 ± 0.064
0.827GluTrp: 0.827 ± 0.032
1.054GluTyr: 1.054 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
4.248PheAla: 4.248 ± 0.068
0.171PheCys: 0.171 ± 0.015
2.452PheAsp: 2.452 ± 0.051
1.697PheGlu: 1.697 ± 0.039
1.102PhePhe: 1.102 ± 0.038
3.549PheGly: 3.549 ± 0.063
0.569PheHis: 0.569 ± 0.022
1.401PheIle: 1.401 ± 0.042
0.44PheLys: 0.44 ± 0.022
2.748PheLeu: 2.748 ± 0.063
0.441PheMet: 0.441 ± 0.024
0.683PheAsn: 0.683 ± 0.028
1.46PhePro: 1.46 ± 0.042
0.726PheGln: 0.726 ± 0.029
1.835PheArg: 1.835 ± 0.044
2.002PheSer: 2.002 ± 0.043
2.23PheThr: 2.23 ± 0.049
2.758PheVal: 2.758 ± 0.064
0.536PheTrp: 0.536 ± 0.023
0.7PheTyr: 0.7 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
10.689GlyAla: 10.689 ± 0.127
0.578GlyCys: 0.578 ± 0.026
5.064GlyAsp: 5.064 ± 0.072
4.639GlyGlu: 4.639 ± 0.073
3.373GlyPhe: 3.373 ± 0.062
7.892GlyGly: 7.892 ± 0.135
1.814GlyHis: 1.814 ± 0.045
4.868GlyIle: 4.868 ± 0.067
2.102GlyLys: 2.102 ± 0.058
8.646GlyLeu: 8.646 ± 0.096
1.947GlyMet: 1.947 ± 0.042
1.774GlyAsn: 1.774 ± 0.051
3.667GlyPro: 3.667 ± 0.068
2.43GlyGln: 2.43 ± 0.06
6.439GlyArg: 6.439 ± 0.082
5.988GlySer: 5.988 ± 0.092
5.378GlyThr: 5.378 ± 0.1
8.061GlyVal: 8.061 ± 0.102
1.703GlyTrp: 1.703 ± 0.046
2.272GlyTyr: 2.272 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.321HisAla: 2.321 ± 0.046
0.11HisCys: 0.11 ± 0.012
1.371HisAsp: 1.371 ± 0.036
1.057HisGlu: 1.057 ± 0.031
0.583HisPhe: 0.583 ± 0.025
1.846HisGly: 1.846 ± 0.043
0.533HisHis: 0.533 ± 0.026
0.77HisIle: 0.77 ± 0.027
0.266HisLys: 0.266 ± 0.016
1.996HisLeu: 1.996 ± 0.046
0.302HisMet: 0.302 ± 0.02
0.362HisAsn: 0.362 ± 0.017
1.567HisPro: 1.567 ± 0.045
0.496HisGln: 0.496 ± 0.02
1.547HisArg: 1.547 ± 0.039
1.024HisSer: 1.024 ± 0.035
0.915HisThr: 0.915 ± 0.029
1.598HisVal: 1.598 ± 0.039
0.273HisTrp: 0.273 ± 0.017
0.439HisTyr: 0.439 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.376IleAla: 7.376 ± 0.105
0.249IleCys: 0.249 ± 0.016
3.772IleAsp: 3.772 ± 0.067
2.734IleGlu: 2.734 ± 0.059
1.149IlePhe: 1.149 ± 0.037
4.99IleGly: 4.99 ± 0.072
0.741IleHis: 0.741 ± 0.028
2.148IleIle: 2.148 ± 0.06
0.828IleLys: 0.828 ± 0.034
3.761IleLeu: 3.761 ± 0.074
0.652IleMet: 0.652 ± 0.025
0.935IleAsn: 0.935 ± 0.035
2.501IlePro: 2.501 ± 0.048
0.95IleGln: 0.95 ± 0.033
3.041IleArg: 3.041 ± 0.055
2.484IleSer: 2.484 ± 0.051
3.056IleThr: 3.056 ± 0.066
4.918IleVal: 4.918 ± 0.08
0.534IleTrp: 0.534 ± 0.024
0.79IleTyr: 0.79 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
2.659LysAla: 2.659 ± 0.068
0.062LysCys: 0.062 ± 0.008
1.192LysAsp: 1.192 ± 0.045
0.9LysGlu: 0.9 ± 0.035
0.46LysPhe: 0.46 ± 0.022
1.558LysGly: 1.558 ± 0.052
0.433LysHis: 0.433 ± 0.025
0.864LysIle: 0.864 ± 0.035
0.782LysLys: 0.782 ± 0.033
1.942LysLeu: 1.942 ± 0.053
0.355LysMet: 0.355 ± 0.02
0.587LysAsn: 0.587 ± 0.026
1.269LysPro: 1.269 ± 0.042
0.744LysGln: 0.744 ± 0.029
1.514LysArg: 1.514 ± 0.042
1.135LysSer: 1.135 ± 0.039
1.425LysThr: 1.425 ± 0.044
1.666LysVal: 1.666 ± 0.045
0.262LysTrp: 0.262 ± 0.019
0.468LysTyr: 0.468 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
14.265LeuAla: 14.265 ± 0.139
0.495LeuCys: 0.495 ± 0.021
6.741LeuAsp: 6.741 ± 0.089
4.915LeuGlu: 4.915 ± 0.083
2.997LeuPhe: 2.997 ± 0.063
9.123LeuGly: 9.123 ± 0.098
1.859LeuHis: 1.859 ± 0.049
4.643LeuIle: 4.643 ± 0.086
1.783LeuLys: 1.783 ± 0.051
9.867LeuLeu: 9.867 ± 0.133
1.585LeuMet: 1.585 ± 0.047
1.742LeuAsn: 1.742 ± 0.049
5.355LeuPro: 5.355 ± 0.069
2.444LeuGln: 2.444 ± 0.049
7.181LeuArg: 7.181 ± 0.086
6.054LeuSer: 6.054 ± 0.076
6.09LeuThr: 6.09 ± 0.082
9.38LeuVal: 9.38 ± 0.112
1.241LeuTrp: 1.241 ± 0.036
1.659LeuTyr: 1.659 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
2.239MetAla: 2.239 ± 0.052
0.078MetCys: 0.078 ± 0.009
0.818MetAsp: 0.818 ± 0.026
0.641MetGlu: 0.641 ± 0.029
0.493MetPhe: 0.493 ± 0.026
1.267MetGly: 1.267 ± 0.04
0.36MetHis: 0.36 ± 0.02
0.819MetIle: 0.819 ± 0.031
0.45MetLys: 0.45 ± 0.024
1.827MetLeu: 1.827 ± 0.045
0.311MetMet: 0.311 ± 0.02
0.457MetAsn: 0.457 ± 0.024
1.204MetPro: 1.204 ± 0.036
0.559MetGln: 0.559 ± 0.023
1.402MetArg: 1.402 ± 0.042
1.47MetSer: 1.47 ± 0.042
1.614MetThr: 1.614 ± 0.042
1.382MetVal: 1.382 ± 0.039
0.186MetTrp: 0.186 ± 0.015
0.267MetTyr: 0.267 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.669AsnAla: 2.669 ± 0.057
0.131AsnCys: 0.131 ± 0.01
1.175AsnAsp: 1.175 ± 0.039
0.974AsnGlu: 0.974 ± 0.029
0.661AsnPhe: 0.661 ± 0.027
2.119AsnGly: 2.119 ± 0.066
0.374AsnHis: 0.374 ± 0.02
0.88AsnIle: 0.88 ± 0.03
0.416AsnLys: 0.416 ± 0.024
1.845AsnLeu: 1.845 ± 0.046
0.315AsnMet: 0.315 ± 0.021
0.52AsnAsn: 0.52 ± 0.025
1.529AsnPro: 1.529 ± 0.04
0.582AsnGln: 0.582 ± 0.027
1.307AsnArg: 1.307 ± 0.034
1.065AsnSer: 1.065 ± 0.036
1.218AsnThr: 1.218 ± 0.038
1.718AsnVal: 1.718 ± 0.047
0.309AsnTrp: 0.309 ± 0.019
0.49AsnTyr: 0.49 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
6.948ProAla: 6.948 ± 0.102
0.179ProCys: 0.179 ± 0.016
4.0ProAsp: 4.0 ± 0.066
3.346ProGlu: 3.346 ± 0.058
1.729ProPhe: 1.729 ± 0.042
4.798ProGly: 4.798 ± 0.072
1.047ProHis: 1.047 ± 0.031
2.241ProIle: 2.241 ± 0.045
1.03ProLys: 1.03 ± 0.038
4.847ProLeu: 4.847 ± 0.061
0.871ProMet: 0.871 ± 0.032
1.055ProAsn: 1.055 ± 0.036
2.278ProPro: 2.278 ± 0.064
1.434ProGln: 1.434 ± 0.043
3.227ProArg: 3.227 ± 0.066
3.515ProSer: 3.515 ± 0.068
3.566ProThr: 3.566 ± 0.07
4.836ProVal: 4.836 ± 0.068
0.876ProTrp: 0.876 ± 0.033
1.044ProTyr: 1.044 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
3.438GlnAla: 3.438 ± 0.061
0.104GlnCys: 0.104 ± 0.01
1.159GlnAsp: 1.159 ± 0.037
1.081GlnGlu: 1.081 ± 0.036
0.876GlnPhe: 0.876 ± 0.029
1.885GlnGly: 1.885 ± 0.048
0.564GlnHis: 0.564 ± 0.024
1.282GlnIle: 1.282 ± 0.034
0.711GlnLys: 0.711 ± 0.027
3.181GlnLeu: 3.181 ± 0.067
0.451GlnMet: 0.451 ± 0.023
0.64GlnAsn: 0.64 ± 0.027
1.537GlnPro: 1.537 ± 0.042
1.083GlnGln: 1.083 ± 0.036
2.399GlnArg: 2.399 ± 0.056
1.505GlnSer: 1.505 ± 0.038
1.552GlnThr: 1.552 ± 0.041
2.405GlnVal: 2.405 ± 0.05
0.461GlnTrp: 0.461 ± 0.021
0.635GlnTyr: 0.635 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
8.787ArgAla: 8.787 ± 0.124
0.301ArgCys: 0.301 ± 0.016
4.267ArgAsp: 4.267 ± 0.077
4.14ArgGlu: 4.14 ± 0.082
2.489ArgPhe: 2.489 ± 0.048
5.324ArgGly: 5.324 ± 0.075
1.55ArgHis: 1.55 ± 0.039
3.632ArgIle: 3.632 ± 0.062
1.242ArgLys: 1.242 ± 0.04
7.286ArgLeu: 7.286 ± 0.114
1.854ArgMet: 1.854 ± 0.04
1.375ArgAsn: 1.375 ± 0.031
3.71ArgPro: 3.71 ± 0.076
1.987ArgGln: 1.987 ± 0.052
6.561ArgArg: 6.561 ± 0.111
4.685ArgSer: 4.685 ± 0.077
4.105ArgThr: 4.105 ± 0.063
6.024ArgVal: 6.024 ± 0.091
1.16ArgTrp: 1.16 ± 0.036
1.636ArgTyr: 1.636 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
7.996SerAla: 7.996 ± 0.101
0.281SerCys: 0.281 ± 0.018
3.572SerAsp: 3.572 ± 0.068
2.779SerGlu: 2.779 ± 0.064
2.017SerPhe: 2.017 ± 0.046
6.353SerGly: 6.353 ± 0.088
1.062SerHis: 1.062 ± 0.034
2.943SerIle: 2.943 ± 0.055
1.238SerLys: 1.238 ± 0.044
5.528SerLeu: 5.528 ± 0.076
1.242SerMet: 1.242 ± 0.037
1.25SerAsn: 1.25 ± 0.039
3.297SerPro: 3.297 ± 0.055
1.452SerGln: 1.452 ± 0.037
4.145SerArg: 4.145 ± 0.075
4.123SerSer: 4.123 ± 0.087
4.134SerThr: 4.134 ± 0.079
5.575SerVal: 5.575 ± 0.082
0.967SerTrp: 0.967 ± 0.034
1.277SerTyr: 1.277 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
7.768ThrAla: 7.768 ± 0.102
0.305ThrCys: 0.305 ± 0.02
4.2ThrAsp: 4.2 ± 0.074
2.84ThrGlu: 2.84 ± 0.054
1.974ThrPhe: 1.974 ± 0.054
5.691ThrGly: 5.691 ± 0.088
1.046ThrHis: 1.046 ± 0.028
3.008ThrIle: 3.008 ± 0.062
1.224ThrLys: 1.224 ± 0.041
5.952ThrLeu: 5.952 ± 0.065
0.98ThrMet: 0.98 ± 0.032
1.286ThrAsn: 1.286 ± 0.047
3.869ThrPro: 3.869 ± 0.08
1.401ThrGln: 1.401 ± 0.037
3.801ThrArg: 3.801 ± 0.068
3.714ThrSer: 3.714 ± 0.066
3.956ThrThr: 3.956 ± 0.085
5.766ThrVal: 5.766 ± 0.097
0.889ThrTrp: 0.889 ± 0.032
1.126ThrTyr: 1.126 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
11.781ValAla: 11.781 ± 0.125
0.5ValCys: 0.5 ± 0.025
6.128ValAsp: 6.128 ± 0.085
4.651ValGlu: 4.651 ± 0.068
3.0ValPhe: 3.0 ± 0.062
7.493ValGly: 7.493 ± 0.092
1.681ValHis: 1.681 ± 0.046
4.795ValIle: 4.795 ± 0.076
1.716ValLys: 1.716 ± 0.054
9.394ValLeu: 9.394 ± 0.115
1.541ValMet: 1.541 ± 0.039
1.869ValAsn: 1.869 ± 0.049
4.641ValPro: 4.641 ± 0.082
2.174ValGln: 2.174 ± 0.054
5.843ValArg: 5.843 ± 0.081
5.384ValSer: 5.384 ± 0.081
5.783ValThr: 5.783 ± 0.098
9.003ValVal: 9.003 ± 0.123
1.166ValTrp: 1.166 ± 0.039
1.545ValTyr: 1.545 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.581TrpAla: 1.581 ± 0.041
0.081TrpCys: 0.081 ± 0.01
0.733TrpAsp: 0.733 ± 0.03
0.599TrpGlu: 0.599 ± 0.025
0.592TrpPhe: 0.592 ± 0.025
1.121TrpGly: 1.121 ± 0.033
0.303TrpHis: 0.303 ± 0.018
0.775TrpIle: 0.775 ± 0.032
0.349TrpLys: 0.349 ± 0.023
1.797TrpLeu: 1.797 ± 0.047
0.382TrpMet: 0.382 ± 0.02
0.484TrpAsn: 0.484 ± 0.024
0.765TrpPro: 0.765 ± 0.028
0.55TrpGln: 0.55 ± 0.027
1.214TrpArg: 1.214 ± 0.037
0.979TrpSer: 0.979 ± 0.029
0.912TrpThr: 0.912 ± 0.034
1.151TrpVal: 1.151 ± 0.037
0.371TrpTrp: 0.371 ± 0.023
0.318TrpTyr: 0.318 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.353TyrAla: 2.353 ± 0.045
0.114TyrCys: 0.114 ± 0.01
1.338TyrAsp: 1.338 ± 0.038
1.037TyrGlu: 1.037 ± 0.032
0.711TyrPhe: 0.711 ± 0.029
1.944TyrGly: 1.944 ± 0.047
0.321TyrHis: 0.321 ± 0.02
0.738TyrIle: 0.738 ± 0.028
0.377TyrLys: 0.377 ± 0.02
2.187TyrLeu: 2.187 ± 0.056
0.224TyrMet: 0.224 ± 0.014
0.499TyrAsn: 0.499 ± 0.027
1.051TyrPro: 1.051 ± 0.028
0.593TyrGln: 0.593 ± 0.024
1.642TyrArg: 1.642 ± 0.042
1.276TyrSer: 1.276 ± 0.04
1.19TyrThr: 1.19 ± 0.038
1.603TyrVal: 1.603 ± 0.04
0.357TyrTrp: 0.357 ± 0.02
0.471TyrTyr: 0.471 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3119 proteins (990533 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski