Amino acid dipepetide frequency for Pontimonas salivibrio

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.861AlaAla: 11.861 ± 0.187
0.518AlaCys: 0.518 ± 0.032
6.123AlaAsp: 6.123 ± 0.121
7.264AlaGlu: 7.264 ± 0.132
3.295AlaPhe: 3.295 ± 0.075
9.837AlaGly: 9.837 ± 0.149
2.592AlaHis: 2.592 ± 0.08
5.727AlaIle: 5.727 ± 0.106
3.572AlaLys: 3.572 ± 0.104
13.504AlaLeu: 13.504 ± 0.18
2.619AlaMet: 2.619 ± 0.081
2.293AlaAsn: 2.293 ± 0.071
4.925AlaPro: 4.925 ± 0.124
4.051AlaGln: 4.051 ± 0.088
6.564AlaArg: 6.564 ± 0.111
6.72AlaSer: 6.72 ± 0.112
6.86AlaThr: 6.86 ± 0.133
9.178AlaVal: 9.178 ± 0.152
1.522AlaTrp: 1.522 ± 0.062
2.038AlaTyr: 2.038 ± 0.072
0.0AlaXaa: 0.0 ± 0.0
Cys
0.516CysAla: 0.516 ± 0.031
0.045CysCys: 0.045 ± 0.009
0.327CysAsp: 0.327 ± 0.026
0.368CysGlu: 0.368 ± 0.026
0.163CysPhe: 0.163 ± 0.017
0.501CysGly: 0.501 ± 0.034
0.134CysHis: 0.134 ± 0.018
0.165CysIle: 0.165 ± 0.017
0.069CysLys: 0.069 ± 0.011
0.41CysLeu: 0.41 ± 0.031
0.063CysMet: 0.063 ± 0.012
0.123CysAsn: 0.123 ± 0.016
0.299CysPro: 0.299 ± 0.023
0.2CysGln: 0.2 ± 0.019
0.275CysArg: 0.275 ± 0.022
0.29CysSer: 0.29 ± 0.024
0.277CysThr: 0.277 ± 0.021
0.468CysVal: 0.468 ± 0.035
0.056CysTrp: 0.056 ± 0.009
0.121CysTyr: 0.121 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
6.377AspAla: 6.377 ± 0.12
0.258AspCys: 0.258 ± 0.024
3.873AspAsp: 3.873 ± 0.143
4.053AspGlu: 4.053 ± 0.101
2.024AspPhe: 2.024 ± 0.062
4.706AspGly: 4.706 ± 0.121
1.424AspHis: 1.424 ± 0.06
3.312AspIle: 3.312 ± 0.082
1.357AspLys: 1.357 ± 0.062
5.406AspLeu: 5.406 ± 0.117
1.03AspMet: 1.03 ± 0.046
1.403AspAsn: 1.403 ± 0.054
3.494AspPro: 3.494 ± 0.09
2.569AspGln: 2.569 ± 0.067
3.555AspArg: 3.555 ± 0.103
3.691AspSer: 3.691 ± 0.097
3.572AspThr: 3.572 ± 0.086
5.339AspVal: 5.339 ± 0.116
0.858AspTrp: 0.858 ± 0.041
1.571AspTyr: 1.571 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
7.619GluAla: 7.619 ± 0.148
0.34GluCys: 0.34 ± 0.026
3.097GluAsp: 3.097 ± 0.085
3.828GluGlu: 3.828 ± 0.097
1.944GluPhe: 1.944 ± 0.053
5.051GluGly: 5.051 ± 0.105
1.675GluHis: 1.675 ± 0.066
3.137GluIle: 3.137 ± 0.083
2.089GluLys: 2.089 ± 0.078
6.685GluLeu: 6.685 ± 0.129
1.353GluMet: 1.353 ± 0.054
1.468GluAsn: 1.468 ± 0.054
3.11GluPro: 3.11 ± 0.093
2.462GluGln: 2.462 ± 0.079
4.611GluArg: 4.611 ± 0.106
3.977GluSer: 3.977 ± 0.091
3.065GluThr: 3.065 ± 0.072
5.239GluVal: 5.239 ± 0.1
0.839GluTrp: 0.839 ± 0.044
1.205GluTyr: 1.205 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
3.718PheAla: 3.718 ± 0.089
0.143PheCys: 0.143 ± 0.019
2.534PheAsp: 2.534 ± 0.069
1.914PheGlu: 1.914 ± 0.063
1.372PhePhe: 1.372 ± 0.064
3.399PheGly: 3.399 ± 0.091
0.763PheHis: 0.763 ± 0.038
1.641PheIle: 1.641 ± 0.064
0.481PheLys: 0.481 ± 0.035
3.065PheLeu: 3.065 ± 0.086
0.553PheMet: 0.553 ± 0.031
0.705PheAsn: 0.705 ± 0.04
1.468PhePro: 1.468 ± 0.05
1.049PheGln: 1.049 ± 0.043
1.838PheArg: 1.838 ± 0.065
2.384PheSer: 2.384 ± 0.066
1.829PheThr: 1.829 ± 0.065
3.035PheVal: 3.035 ± 0.078
0.472PheTrp: 0.472 ± 0.031
0.735PheTyr: 0.735 ± 0.037
0.0PheXaa: 0.0 ± 0.0
Gly
8.657GlyAla: 8.657 ± 0.161
0.52GlyCys: 0.52 ± 0.036
4.868GlyAsp: 4.868 ± 0.096
5.335GlyGlu: 5.335 ± 0.105
3.447GlyPhe: 3.447 ± 0.089
7.381GlyGly: 7.381 ± 0.194
2.098GlyHis: 2.098 ± 0.076
4.816GlyIle: 4.816 ± 0.112
2.549GlyLys: 2.549 ± 0.084
9.204GlyLeu: 9.204 ± 0.169
1.895GlyMet: 1.895 ± 0.057
1.91GlyAsn: 1.91 ± 0.069
3.683GlyPro: 3.683 ± 0.073
3.093GlyGln: 3.093 ± 0.074
4.866GlyArg: 4.866 ± 0.101
5.783GlySer: 5.783 ± 0.115
4.597GlyThr: 4.597 ± 0.108
8.677GlyVal: 8.677 ± 0.14
1.377GlyTrp: 1.377 ± 0.051
2.168GlyTyr: 2.168 ± 0.069
0.0GlyXaa: 0.0 ± 0.0
His
2.189HisAla: 2.189 ± 0.075
0.149HisCys: 0.149 ± 0.015
1.307HisAsp: 1.307 ± 0.052
1.238HisGlu: 1.238 ± 0.052
0.687HisPhe: 0.687 ± 0.036
1.829HisGly: 1.829 ± 0.065
0.759HisHis: 0.759 ± 0.041
1.077HisIle: 1.077 ± 0.048
0.525HisLys: 0.525 ± 0.033
2.185HisLeu: 2.185 ± 0.065
0.395HisMet: 0.395 ± 0.026
0.555HisAsn: 0.555 ± 0.029
1.751HisPro: 1.751 ± 0.062
0.951HisGln: 0.951 ± 0.045
1.624HisArg: 1.624 ± 0.06
1.643HisSer: 1.643 ± 0.059
1.632HisThr: 1.632 ± 0.061
1.558HisVal: 1.558 ± 0.054
0.364HisTrp: 0.364 ± 0.025
0.618HisTyr: 0.618 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
6.518IleAla: 6.518 ± 0.109
0.219IleCys: 0.219 ± 0.022
3.952IleAsp: 3.952 ± 0.097
3.136IleGlu: 3.136 ± 0.079
1.348IlePhe: 1.348 ± 0.06
4.27IleGly: 4.27 ± 0.097
0.969IleHis: 0.969 ± 0.041
2.343IleIle: 2.343 ± 0.063
0.965IleLys: 0.965 ± 0.049
3.854IleLeu: 3.854 ± 0.089
0.679IleMet: 0.679 ± 0.041
1.285IleAsn: 1.285 ± 0.044
2.935IlePro: 2.935 ± 0.078
1.411IleGln: 1.411 ± 0.054
3.074IleArg: 3.074 ± 0.08
3.204IleSer: 3.204 ± 0.085
3.113IleThr: 3.113 ± 0.085
4.663IleVal: 4.663 ± 0.111
0.544IleTrp: 0.544 ± 0.031
0.847IleTyr: 0.847 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
3.28LysAla: 3.28 ± 0.088
0.089LysCys: 0.089 ± 0.011
1.574LysAsp: 1.574 ± 0.061
1.513LysGlu: 1.513 ± 0.067
0.564LysPhe: 0.564 ± 0.036
2.038LysGly: 2.038 ± 0.072
0.499LysHis: 0.499 ± 0.026
1.22LysIle: 1.22 ± 0.052
1.48LysLys: 1.48 ± 0.077
2.057LysLeu: 2.057 ± 0.061
0.557LysMet: 0.557 ± 0.035
0.913LysAsn: 0.913 ± 0.041
1.585LysPro: 1.585 ± 0.063
0.776LysGln: 0.776 ± 0.038
1.99LysArg: 1.99 ± 0.062
1.628LysSer: 1.628 ± 0.058
1.879LysThr: 1.879 ± 0.068
2.064LysVal: 2.064 ± 0.076
0.316LysTrp: 0.316 ± 0.025
0.509LysTyr: 0.509 ± 0.032
0.0LysXaa: 0.0 ± 0.0
Leu
12.654LeuAla: 12.654 ± 0.209
0.509LeuCys: 0.509 ± 0.028
6.284LeuAsp: 6.284 ± 0.129
6.369LeuGlu: 6.369 ± 0.119
3.24LeuPhe: 3.24 ± 0.11
9.616LeuGly: 9.616 ± 0.168
2.09LeuHis: 2.09 ± 0.07
4.962LeuIle: 4.962 ± 0.104
2.287LeuLys: 2.287 ± 0.075
9.414LeuLeu: 9.414 ± 0.188
1.918LeuMet: 1.918 ± 0.057
2.146LeuAsn: 2.146 ± 0.061
5.206LeuPro: 5.206 ± 0.105
3.13LeuGln: 3.13 ± 0.073
6.47LeuArg: 6.47 ± 0.128
6.86LeuSer: 6.86 ± 0.118
5.954LeuThr: 5.954 ± 0.116
9.279LeuVal: 9.279 ± 0.161
1.485LeuTrp: 1.485 ± 0.062
1.786LeuTyr: 1.786 ± 0.064
0.0LeuXaa: 0.0 ± 0.0
Met
2.723MetAla: 2.723 ± 0.071
0.091MetCys: 0.091 ± 0.013
1.058MetAsp: 1.058 ± 0.044
1.004MetGlu: 1.004 ± 0.045
0.574MetPhe: 0.574 ± 0.035
1.615MetGly: 1.615 ± 0.052
0.325MetHis: 0.325 ± 0.021
0.917MetIle: 0.917 ± 0.051
0.6MetLys: 0.6 ± 0.036
1.712MetLeu: 1.712 ± 0.057
0.427MetMet: 0.427 ± 0.029
0.566MetAsn: 0.566 ± 0.032
1.021MetPro: 1.021 ± 0.042
0.472MetGln: 0.472 ± 0.032
1.324MetArg: 1.324 ± 0.046
1.739MetSer: 1.739 ± 0.056
1.47MetThr: 1.47 ± 0.05
2.011MetVal: 2.011 ± 0.06
0.265MetTrp: 0.265 ± 0.023
0.278MetTyr: 0.278 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.345AsnAla: 2.345 ± 0.067
0.111AsnCys: 0.111 ± 0.015
1.223AsnAsp: 1.223 ± 0.049
1.16AsnGlu: 1.16 ± 0.051
0.705AsnPhe: 0.705 ± 0.035
1.684AsnGly: 1.684 ± 0.064
0.485AsnHis: 0.485 ± 0.03
1.181AsnIle: 1.181 ± 0.049
0.594AsnLys: 0.594 ± 0.034
2.348AsnLeu: 2.348 ± 0.076
0.462AsnMet: 0.462 ± 0.031
0.574AsnAsn: 0.574 ± 0.036
1.946AsnPro: 1.946 ± 0.057
0.982AsnGln: 0.982 ± 0.041
1.567AsnArg: 1.567 ± 0.051
1.287AsnSer: 1.287 ± 0.056
1.602AsnThr: 1.602 ± 0.058
1.702AsnVal: 1.702 ± 0.061
0.382AsnTrp: 0.382 ± 0.025
0.605AsnTyr: 0.605 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
5.553ProAla: 5.553 ± 0.117
0.186ProCys: 0.186 ± 0.019
3.607ProAsp: 3.607 ± 0.091
4.689ProGlu: 4.689 ± 0.111
1.559ProPhe: 1.559 ± 0.051
5.04ProGly: 5.04 ± 0.102
1.381ProHis: 1.381 ± 0.054
2.189ProIle: 2.189 ± 0.055
1.37ProLys: 1.37 ± 0.048
4.979ProLeu: 4.979 ± 0.108
0.887ProMet: 0.887 ± 0.043
1.123ProAsn: 1.123 ± 0.05
2.3ProPro: 2.3 ± 0.085
1.817ProGln: 1.817 ± 0.053
2.749ProArg: 2.749 ± 0.075
3.247ProSer: 3.247 ± 0.082
3.163ProThr: 3.163 ± 0.071
4.372ProVal: 4.372 ± 0.092
0.765ProTrp: 0.765 ± 0.037
1.053ProTyr: 1.053 ± 0.049
0.0ProXaa: 0.0 ± 0.0
Gln
4.309GlnAla: 4.309 ± 0.1
0.167GlnCys: 0.167 ± 0.016
1.533GlnAsp: 1.533 ± 0.055
1.953GlnGlu: 1.953 ± 0.061
1.001GlnPhe: 1.001 ± 0.043
2.603GlnGly: 2.603 ± 0.068
0.897GlnHis: 0.897 ± 0.043
1.613GlnIle: 1.613 ± 0.057
1.043GlnLys: 1.043 ± 0.05
3.555GlnLeu: 3.555 ± 0.092
0.852GlnMet: 0.852 ± 0.036
0.663GlnAsn: 0.663 ± 0.038
1.929GlnPro: 1.929 ± 0.069
1.431GlnGln: 1.431 ± 0.059
2.722GlnArg: 2.722 ± 0.077
2.285GlnSer: 2.285 ± 0.065
1.673GlnThr: 1.673 ± 0.062
3.098GlnVal: 3.098 ± 0.068
0.737GlnTrp: 0.737 ± 0.04
0.577GlnTyr: 0.577 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
6.518ArgAla: 6.518 ± 0.116
0.293ArgCys: 0.293 ± 0.023
3.657ArgAsp: 3.657 ± 0.078
4.418ArgGlu: 4.418 ± 0.101
2.35ArgPhe: 2.35 ± 0.064
5.133ArgGly: 5.133 ± 0.094
1.515ArgHis: 1.515 ± 0.053
3.158ArgIle: 3.158 ± 0.079
1.667ArgLys: 1.667 ± 0.061
6.451ArgLeu: 6.451 ± 0.13
1.509ArgMet: 1.509 ± 0.047
1.415ArgAsn: 1.415 ± 0.05
2.87ArgPro: 2.87 ± 0.075
2.313ArgGln: 2.313 ± 0.064
4.52ArgArg: 4.52 ± 0.117
3.915ArgSer: 3.915 ± 0.082
3.124ArgThr: 3.124 ± 0.08
5.577ArgVal: 5.577 ± 0.108
0.921ArgTrp: 0.921 ± 0.045
1.526ArgTyr: 1.526 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
6.746SerAla: 6.746 ± 0.116
0.308SerCys: 0.308 ± 0.025
3.912SerAsp: 3.912 ± 0.081
3.965SerGlu: 3.965 ± 0.095
2.239SerPhe: 2.239 ± 0.071
6.479SerGly: 6.479 ± 0.126
1.574SerHis: 1.574 ± 0.057
2.775SerIle: 2.775 ± 0.077
1.545SerLys: 1.545 ± 0.051
6.583SerLeu: 6.583 ± 0.117
1.422SerMet: 1.422 ± 0.052
1.416SerAsn: 1.416 ± 0.063
3.533SerPro: 3.533 ± 0.079
2.248SerGln: 2.248 ± 0.071
3.988SerArg: 3.988 ± 0.095
4.288SerSer: 4.288 ± 0.12
3.874SerThr: 3.874 ± 0.087
5.931SerVal: 5.931 ± 0.123
0.975SerTrp: 0.975 ± 0.044
1.364SerTyr: 1.364 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
5.716ThrAla: 5.716 ± 0.122
0.212ThrCys: 0.212 ± 0.021
3.217ThrAsp: 3.217 ± 0.086
3.355ThrGlu: 3.355 ± 0.083
1.764ThrPhe: 1.764 ± 0.065
5.391ThrGly: 5.391 ± 0.114
1.392ThrHis: 1.392 ± 0.055
3.054ThrIle: 3.054 ± 0.078
1.645ThrLys: 1.645 ± 0.063
6.719ThrLeu: 6.719 ± 0.126
1.125ThrMet: 1.125 ± 0.048
1.32ThrAsn: 1.32 ± 0.05
4.19ThrPro: 4.19 ± 0.101
1.938ThrGln: 1.938 ± 0.064
3.438ThrArg: 3.438 ± 0.081
3.485ThrSer: 3.485 ± 0.077
3.692ThrThr: 3.692 ± 0.105
5.248ThrVal: 5.248 ± 0.102
0.774ThrTrp: 0.774 ± 0.037
0.976ThrTyr: 0.976 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
10.392ValAla: 10.392 ± 0.16
0.466ValCys: 0.466 ± 0.032
5.785ValAsp: 5.785 ± 0.114
5.38ValGlu: 5.38 ± 0.106
3.317ValPhe: 3.317 ± 0.078
7.526ValGly: 7.526 ± 0.132
1.758ValHis: 1.758 ± 0.07
4.792ValIle: 4.792 ± 0.086
1.983ValLys: 1.983 ± 0.066
9.234ValLeu: 9.234 ± 0.147
1.78ValMet: 1.78 ± 0.053
2.215ValAsn: 2.215 ± 0.066
3.904ValPro: 3.904 ± 0.088
2.191ValGln: 2.191 ± 0.069
4.925ValArg: 4.925 ± 0.098
6.219ValSer: 6.219 ± 0.107
5.577ValThr: 5.577 ± 0.118
9.316ValVal: 9.316 ± 0.158
1.233ValTrp: 1.233 ± 0.052
1.493ValTyr: 1.493 ± 0.053
0.0ValXaa: 0.0 ± 0.0
Trp
1.442TrpAla: 1.442 ± 0.053
0.093TrpCys: 0.093 ± 0.012
0.728TrpAsp: 0.728 ± 0.036
0.741TrpGlu: 0.741 ± 0.044
0.64TrpPhe: 0.64 ± 0.038
1.073TrpGly: 1.073 ± 0.057
0.308TrpHis: 0.308 ± 0.025
0.576TrpIle: 0.576 ± 0.034
0.362TrpLys: 0.362 ± 0.027
1.895TrpLeu: 1.895 ± 0.066
0.373TrpMet: 0.373 ± 0.028
0.384TrpAsn: 0.384 ± 0.024
0.739TrpPro: 0.739 ± 0.04
0.618TrpGln: 0.618 ± 0.034
1.015TrpArg: 1.015 ± 0.046
1.188TrpSer: 1.188 ± 0.05
0.622TrpThr: 0.622 ± 0.036
1.164TrpVal: 1.164 ± 0.057
0.512TrpTrp: 0.512 ± 0.037
0.236TrpTyr: 0.236 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.931TyrAla: 1.931 ± 0.064
0.152TyrCys: 0.152 ± 0.019
1.212TyrAsp: 1.212 ± 0.05
1.108TyrGlu: 1.108 ± 0.044
0.865TyrPhe: 0.865 ± 0.042
1.752TyrGly: 1.752 ± 0.063
0.397TyrHis: 0.397 ± 0.03
0.676TyrIle: 0.676 ± 0.035
0.351TyrLys: 0.351 ± 0.027
2.397TyrLeu: 2.397 ± 0.07
0.288TyrMet: 0.288 ± 0.021
0.468TyrAsn: 0.468 ± 0.03
1.144TyrPro: 1.144 ± 0.046
0.939TyrGln: 0.939 ± 0.042
1.736TyrArg: 1.736 ± 0.061
1.335TyrSer: 1.335 ± 0.053
1.077TyrThr: 1.077 ± 0.047
1.565TyrVal: 1.565 ± 0.055
0.314TyrTrp: 0.314 ± 0.026
0.444TyrTyr: 0.444 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1693 proteins (538662 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski