Amino acid dipepetide frequency for Campylobacter phage CP20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.48AlaAla: 2.48 ± 0.361
0.846AlaCys: 0.846 ± 0.122
2.615AlaAsp: 2.615 ± 0.244
2.557AlaGlu: 2.557 ± 0.212
1.75AlaPhe: 1.75 ± 0.205
2.442AlaGly: 2.442 ± 0.252
0.596AlaHis: 0.596 ± 0.135
3.326AlaIle: 3.326 ± 0.218
4.211AlaLys: 4.211 ± 0.228
4.403AlaLeu: 4.403 ± 0.254
1.019AlaMet: 1.019 ± 0.109
3.557AlaAsn: 3.557 ± 0.308
1.154AlaPro: 1.154 ± 0.158
1.269AlaGln: 1.269 ± 0.189
1.596AlaArg: 1.596 ± 0.189
3.48AlaSer: 3.48 ± 0.342
2.038AlaThr: 2.038 ± 0.191
2.807AlaVal: 2.807 ± 0.208
0.346AlaTrp: 0.346 ± 0.074
1.769AlaTyr: 1.769 ± 0.196
0.0AlaXaa: 0.0 ± 0.0
Cys
0.423CysAla: 0.423 ± 0.082
0.288CysCys: 0.288 ± 0.087
0.961CysAsp: 0.961 ± 0.181
1.0CysGlu: 1.0 ± 0.147
0.865CysPhe: 0.865 ± 0.151
0.865CysGly: 0.865 ± 0.156
0.231CysHis: 0.231 ± 0.066
1.365CysIle: 1.365 ± 0.187
1.192CysLys: 1.192 ± 0.155
1.327CysLeu: 1.327 ± 0.17
0.365CysMet: 0.365 ± 0.081
1.269CysAsn: 1.269 ± 0.237
0.346CysPro: 0.346 ± 0.109
0.461CysGln: 0.461 ± 0.113
0.558CysArg: 0.558 ± 0.118
0.961CysSer: 0.961 ± 0.149
0.692CysThr: 0.692 ± 0.167
0.731CysVal: 0.731 ± 0.106
0.115CysTrp: 0.115 ± 0.049
0.711CysTyr: 0.711 ± 0.136
0.0CysXaa: 0.0 ± 0.0
Asp
2.999AspAla: 2.999 ± 0.194
1.288AspCys: 1.288 ± 0.245
3.365AspAsp: 3.365 ± 0.309
3.192AspGlu: 3.192 ± 0.254
5.364AspPhe: 5.364 ± 0.402
2.942AspGly: 2.942 ± 0.236
0.673AspHis: 0.673 ± 0.108
6.672AspIle: 6.672 ± 0.38
4.095AspLys: 4.095 ± 0.262
8.229AspLeu: 8.229 ± 0.449
0.846AspMet: 0.846 ± 0.124
4.653AspAsn: 4.653 ± 0.28
2.23AspPro: 2.23 ± 0.2
1.096AspGln: 1.096 ± 0.166
1.788AspArg: 1.788 ± 0.166
6.883AspSer: 6.883 ± 0.418
4.172AspThr: 4.172 ± 0.232
3.076AspVal: 3.076 ± 0.225
0.538AspTrp: 0.538 ± 0.099
4.576AspTyr: 4.576 ± 0.302
0.0AspXaa: 0.0 ± 0.0
Glu
4.038GluAla: 4.038 ± 0.305
0.923GluCys: 0.923 ± 0.155
3.288GluAsp: 3.288 ± 0.263
3.23GluGlu: 3.23 ± 0.242
4.172GluPhe: 4.172 ± 0.292
2.576GluGly: 2.576 ± 0.243
0.904GluHis: 0.904 ± 0.142
6.153GluIle: 6.153 ± 0.323
4.192GluLys: 4.192 ± 0.296
7.095GluLeu: 7.095 ± 0.42
1.0GluMet: 1.0 ± 0.121
4.749GluAsn: 4.749 ± 0.316
2.115GluPro: 2.115 ± 0.214
1.519GluGln: 1.519 ± 0.155
1.538GluArg: 1.538 ± 0.214
5.076GluSer: 5.076 ± 0.309
4.192GluThr: 4.192 ± 0.309
4.057GluVal: 4.057 ± 0.313
0.404GluTrp: 0.404 ± 0.08
3.807GluTyr: 3.807 ± 0.26
0.0GluXaa: 0.0 ± 0.0
Phe
1.577PheAla: 1.577 ± 0.162
0.692PheCys: 0.692 ± 0.122
4.115PheAsp: 4.115 ± 0.271
4.095PheGlu: 4.095 ± 0.293
1.98PhePhe: 1.98 ± 0.237
2.615PheGly: 2.615 ± 0.203
0.577PheHis: 0.577 ± 0.105
4.326PheIle: 4.326 ± 0.294
6.614PheLys: 6.614 ± 0.396
4.691PheLeu: 4.691 ± 0.311
0.942PheMet: 0.942 ± 0.119
5.018PheAsn: 5.018 ± 0.294
0.884PhePro: 0.884 ± 0.14
1.25PheGln: 1.25 ± 0.156
1.73PheArg: 1.73 ± 0.16
3.076PheSer: 3.076 ± 0.253
2.75PheThr: 2.75 ± 0.233
2.423PheVal: 2.423 ± 0.184
0.231PheTrp: 0.231 ± 0.071
2.576PheTyr: 2.576 ± 0.218
0.0PheXaa: 0.0 ± 0.0
Gly
2.576GlyAla: 2.576 ± 0.243
0.827GlyCys: 0.827 ± 0.155
3.096GlyAsp: 3.096 ± 0.234
2.384GlyGlu: 2.384 ± 0.256
2.634GlyPhe: 2.634 ± 0.255
1.961GlyGly: 1.961 ± 0.217
0.442GlyHis: 0.442 ± 0.092
4.211GlyIle: 4.211 ± 0.259
3.615GlyLys: 3.615 ± 0.265
4.192GlyLeu: 4.192 ± 0.284
0.961GlyMet: 0.961 ± 0.124
3.096GlyAsn: 3.096 ± 0.233
0.808GlyPro: 0.808 ± 0.109
1.231GlyGln: 1.231 ± 0.172
1.0GlyArg: 1.0 ± 0.126
4.057GlySer: 4.057 ± 0.306
2.807GlyThr: 2.807 ± 0.237
3.134GlyVal: 3.134 ± 0.249
0.25GlyTrp: 0.25 ± 0.065
2.307GlyTyr: 2.307 ± 0.215
0.0GlyXaa: 0.0 ± 0.0
His
0.269HisAla: 0.269 ± 0.061
0.365HisCys: 0.365 ± 0.098
0.577HisAsp: 0.577 ± 0.113
0.423HisGlu: 0.423 ± 0.088
0.846HisPhe: 0.846 ± 0.125
0.442HisGly: 0.442 ± 0.098
0.269HisHis: 0.269 ± 0.071
1.423HisIle: 1.423 ± 0.151
1.807HisLys: 1.807 ± 0.187
1.192HisLeu: 1.192 ± 0.142
0.154HisMet: 0.154 ± 0.067
1.173HisAsn: 1.173 ± 0.181
0.538HisPro: 0.538 ± 0.101
0.346HisGln: 0.346 ± 0.08
0.481HisArg: 0.481 ± 0.084
1.173HisSer: 1.173 ± 0.161
0.884HisThr: 0.884 ± 0.116
0.327HisVal: 0.327 ± 0.075
0.038HisTrp: 0.038 ± 0.028
0.692HisTyr: 0.692 ± 0.101
0.0HisXaa: 0.0 ± 0.0
Ile
3.403IleAla: 3.403 ± 0.278
1.25IleCys: 1.25 ± 0.148
7.383IleAsp: 7.383 ± 0.336
6.595IleGlu: 6.595 ± 0.392
4.23IlePhe: 4.23 ± 0.275
3.038IleGly: 3.038 ± 0.236
1.173IleHis: 1.173 ± 0.16
6.614IleIle: 6.614 ± 0.395
10.075IleLys: 10.075 ± 0.473
7.479IleLeu: 7.479 ± 0.421
1.327IleMet: 1.327 ± 0.161
7.979IleAsn: 7.979 ± 0.41
2.442IlePro: 2.442 ± 0.238
3.403IleGln: 3.403 ± 0.271
2.519IleArg: 2.519 ± 0.26
5.307IleSer: 5.307 ± 0.374
4.557IleThr: 4.557 ± 0.307
4.365IleVal: 4.365 ± 0.263
0.538IleTrp: 0.538 ± 0.101
3.326IleTyr: 3.326 ± 0.256
0.0IleXaa: 0.0 ± 0.0
Lys
4.038LysAla: 4.038 ± 0.275
1.211LysCys: 1.211 ± 0.199
7.71LysAsp: 7.71 ± 0.431
6.557LysGlu: 6.557 ± 0.381
4.48LysPhe: 4.48 ± 0.281
4.038LysGly: 4.038 ± 0.25
1.596LysHis: 1.596 ± 0.187
8.633LysIle: 8.633 ± 0.452
7.922LysLys: 7.922 ± 0.397
8.672LysLeu: 8.672 ± 0.452
1.711LysMet: 1.711 ± 0.189
7.287LysAsn: 7.287 ± 0.395
3.153LysPro: 3.153 ± 0.24
3.057LysGln: 3.057 ± 0.239
2.653LysArg: 2.653 ± 0.252
6.807LysSer: 6.807 ± 0.367
5.249LysThr: 5.249 ± 0.368
4.711LysVal: 4.711 ± 0.333
0.923LysTrp: 0.923 ± 0.123
6.287LysTyr: 6.287 ± 0.346
0.0LysXaa: 0.0 ± 0.0
Leu
3.942LeuAla: 3.942 ± 0.242
0.846LeuCys: 0.846 ± 0.094
7.364LeuAsp: 7.364 ± 0.373
6.73LeuGlu: 6.73 ± 0.38
3.403LeuPhe: 3.403 ± 0.25
4.076LeuGly: 4.076 ± 0.309
1.327LeuHis: 1.327 ± 0.173
6.883LeuIle: 6.883 ± 0.408
10.575LeuLys: 10.575 ± 0.442
5.614LeuLeu: 5.614 ± 0.323
1.519LeuMet: 1.519 ± 0.162
8.268LeuAsn: 8.268 ± 0.401
2.653LeuPro: 2.653 ± 0.203
2.153LeuGln: 2.153 ± 0.218
2.769LeuArg: 2.769 ± 0.201
6.46LeuSer: 6.46 ± 0.419
4.538LeuThr: 4.538 ± 0.277
4.23LeuVal: 4.23 ± 0.301
0.692LeuTrp: 0.692 ± 0.093
4.038LeuTyr: 4.038 ± 0.304
0.0LeuXaa: 0.0 ± 0.0
Met
0.884MetAla: 0.884 ± 0.122
0.269MetCys: 0.269 ± 0.07
1.038MetAsp: 1.038 ± 0.163
0.75MetGlu: 0.75 ± 0.119
1.038MetPhe: 1.038 ± 0.138
0.75MetGly: 0.75 ± 0.124
0.25MetHis: 0.25 ± 0.073
1.5MetIle: 1.5 ± 0.141
1.769MetLys: 1.769 ± 0.189
1.673MetLeu: 1.673 ± 0.159
0.212MetMet: 0.212 ± 0.062
1.73MetAsn: 1.73 ± 0.197
0.519MetPro: 0.519 ± 0.094
0.519MetGln: 0.519 ± 0.094
0.615MetArg: 0.615 ± 0.108
1.404MetSer: 1.404 ± 0.173
0.846MetThr: 0.846 ± 0.118
0.865MetVal: 0.865 ± 0.105
0.058MetTrp: 0.058 ± 0.035
0.884MetTyr: 0.884 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
3.23AsnAla: 3.23 ± 0.238
1.211AsnCys: 1.211 ± 0.215
4.538AsnAsp: 4.538 ± 0.348
5.288AsnGlu: 5.288 ± 0.257
4.634AsnPhe: 4.634 ± 0.347
3.672AsnGly: 3.672 ± 0.243
1.0AsnHis: 1.0 ± 0.13
8.402AsnIle: 8.402 ± 0.412
7.883AsnLys: 7.883 ± 0.428
6.845AsnLeu: 6.845 ± 0.344
1.481AsnMet: 1.481 ± 0.175
6.191AsnAsn: 6.191 ± 0.383
3.211AsnPro: 3.211 ± 0.241
2.596AsnGln: 2.596 ± 0.224
2.173AsnArg: 2.173 ± 0.215
4.249AsnSer: 4.249 ± 0.308
4.903AsnThr: 4.903 ± 0.307
4.461AsnVal: 4.461 ± 0.313
0.423AsnTrp: 0.423 ± 0.087
4.826AsnTyr: 4.826 ± 0.329
0.0AsnXaa: 0.0 ± 0.0
Pro
0.961ProAla: 0.961 ± 0.141
0.212ProCys: 0.212 ± 0.065
2.134ProAsp: 2.134 ± 0.199
2.826ProGlu: 2.826 ± 0.222
1.327ProPhe: 1.327 ± 0.185
1.884ProGly: 1.884 ± 0.197
0.308ProHis: 0.308 ± 0.082
2.057ProIle: 2.057 ± 0.188
2.769ProLys: 2.769 ± 0.206
2.288ProLeu: 2.288 ± 0.214
0.481ProMet: 0.481 ± 0.09
2.057ProAsn: 2.057 ± 0.197
0.365ProPro: 0.365 ± 0.081
0.635ProGln: 0.635 ± 0.112
1.058ProArg: 1.058 ± 0.165
2.288ProSer: 2.288 ± 0.2
1.442ProThr: 1.442 ± 0.163
1.654ProVal: 1.654 ± 0.174
0.231ProTrp: 0.231 ± 0.057
1.404ProTyr: 1.404 ± 0.161
0.0ProXaa: 0.0 ± 0.0
Gln
2.211GlnAla: 2.211 ± 0.224
0.385GlnCys: 0.385 ± 0.091
2.096GlnAsp: 2.096 ± 0.222
2.25GlnGlu: 2.25 ± 0.195
1.154GlnPhe: 1.154 ± 0.167
1.615GlnGly: 1.615 ± 0.207
0.404GlnHis: 0.404 ± 0.105
2.615GlnIle: 2.615 ± 0.23
2.307GlnLys: 2.307 ± 0.192
2.403GlnLeu: 2.403 ± 0.217
0.461GlnMet: 0.461 ± 0.085
2.346GlnAsn: 2.346 ± 0.251
0.519GlnPro: 0.519 ± 0.104
0.461GlnGln: 0.461 ± 0.093
0.961GlnArg: 0.961 ± 0.167
1.461GlnSer: 1.461 ± 0.172
1.654GlnThr: 1.654 ± 0.185
1.807GlnVal: 1.807 ± 0.21
0.231GlnTrp: 0.231 ± 0.063
1.634GlnTyr: 1.634 ± 0.158
0.0GlnXaa: 0.0 ± 0.0
Arg
1.365ArgAla: 1.365 ± 0.157
0.404ArgCys: 0.404 ± 0.09
1.711ArgAsp: 1.711 ± 0.194
2.0ArgGlu: 2.0 ± 0.207
2.327ArgPhe: 2.327 ± 0.213
1.634ArgGly: 1.634 ± 0.209
0.5ArgHis: 0.5 ± 0.072
2.942ArgIle: 2.942 ± 0.2
2.173ArgLys: 2.173 ± 0.238
2.73ArgLeu: 2.73 ± 0.243
0.404ArgMet: 0.404 ± 0.072
2.115ArgAsn: 2.115 ± 0.186
0.711ArgPro: 0.711 ± 0.106
0.731ArgGln: 0.731 ± 0.126
0.923ArgArg: 0.923 ± 0.136
2.077ArgSer: 2.077 ± 0.21
1.904ArgThr: 1.904 ± 0.173
1.807ArgVal: 1.807 ± 0.179
0.192ArgTrp: 0.192 ± 0.058
1.634ArgTyr: 1.634 ± 0.199
0.0ArgXaa: 0.0 ± 0.0
Ser
3.038SerAla: 3.038 ± 0.317
0.711SerCys: 0.711 ± 0.115
4.365SerAsp: 4.365 ± 0.331
4.345SerGlu: 4.345 ± 0.334
3.499SerPhe: 3.499 ± 0.3
3.788SerGly: 3.788 ± 0.314
0.904SerHis: 0.904 ± 0.112
5.98SerIle: 5.98 ± 0.301
8.306SerLys: 8.306 ± 0.437
5.768SerLeu: 5.768 ± 0.329
1.538SerMet: 1.538 ± 0.195
5.614SerAsn: 5.614 ± 0.389
1.365SerPro: 1.365 ± 0.191
2.442SerGln: 2.442 ± 0.213
2.153SerArg: 2.153 ± 0.194
4.672SerSer: 4.672 ± 0.41
3.538SerThr: 3.538 ± 0.239
4.076SerVal: 4.076 ± 0.314
0.404SerTrp: 0.404 ± 0.096
2.999SerTyr: 2.999 ± 0.215
0.0SerXaa: 0.0 ± 0.0
Thr
2.327ThrAla: 2.327 ± 0.236
0.923ThrCys: 0.923 ± 0.204
3.634ThrAsp: 3.634 ± 0.271
3.922ThrGlu: 3.922 ± 0.309
2.711ThrPhe: 2.711 ± 0.223
2.807ThrGly: 2.807 ± 0.275
0.673ThrHis: 0.673 ± 0.104
4.384ThrIle: 4.384 ± 0.268
6.191ThrLys: 6.191 ± 0.42
4.307ThrLeu: 4.307 ± 0.256
1.038ThrMet: 1.038 ± 0.119
4.711ThrAsn: 4.711 ± 0.269
2.057ThrPro: 2.057 ± 0.182
2.153ThrGln: 2.153 ± 0.23
2.0ThrArg: 2.0 ± 0.213
3.192ThrSer: 3.192 ± 0.296
3.422ThrThr: 3.422 ± 0.318
2.519ThrVal: 2.519 ± 0.251
0.481ThrTrp: 0.481 ± 0.091
2.807ThrTyr: 2.807 ± 0.253
0.0ThrXaa: 0.0 ± 0.0
Val
2.25ValAla: 2.25 ± 0.24
0.769ValCys: 0.769 ± 0.144
4.384ValAsp: 4.384 ± 0.292
3.519ValGlu: 3.519 ± 0.242
2.442ValPhe: 2.442 ± 0.24
2.23ValGly: 2.23 ± 0.229
0.731ValHis: 0.731 ± 0.1
4.134ValIle: 4.134 ± 0.209
4.845ValLys: 4.845 ± 0.311
4.922ValLeu: 4.922 ± 0.333
0.942ValMet: 0.942 ± 0.144
4.038ValAsn: 4.038 ± 0.273
1.596ValPro: 1.596 ± 0.202
1.673ValGln: 1.673 ± 0.206
1.807ValArg: 1.807 ± 0.165
3.634ValSer: 3.634 ± 0.307
2.961ValThr: 2.961 ± 0.285
2.942ValVal: 2.942 ± 0.233
0.346ValTrp: 0.346 ± 0.079
2.961ValTyr: 2.961 ± 0.235
0.0ValXaa: 0.0 ± 0.0
Trp
0.288TrpAla: 0.288 ± 0.06
0.173TrpCys: 0.173 ± 0.057
0.558TrpAsp: 0.558 ± 0.11
0.423TrpGlu: 0.423 ± 0.07
0.327TrpPhe: 0.327 ± 0.09
0.327TrpGly: 0.327 ± 0.07
0.038TrpHis: 0.038 ± 0.026
0.481TrpIle: 0.481 ± 0.101
0.423TrpLys: 0.423 ± 0.092
0.519TrpLeu: 0.519 ± 0.091
0.25TrpMet: 0.25 ± 0.056
0.769TrpAsn: 0.769 ± 0.135
0.115TrpPro: 0.115 ± 0.039
0.327TrpGln: 0.327 ± 0.08
0.173TrpArg: 0.173 ± 0.063
0.5TrpSer: 0.5 ± 0.095
0.327TrpThr: 0.327 ± 0.083
0.442TrpVal: 0.442 ± 0.094
0.038TrpTrp: 0.038 ± 0.026
0.365TrpTyr: 0.365 ± 0.074
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.961TyrAla: 1.961 ± 0.23
1.134TyrCys: 1.134 ± 0.169
3.749TyrAsp: 3.749 ± 0.208
2.942TyrGlu: 2.942 ± 0.243
2.98TyrPhe: 2.98 ± 0.26
1.884TyrGly: 1.884 ± 0.181
0.827TyrHis: 0.827 ± 0.122
4.903TyrIle: 4.903 ± 0.325
5.73TyrLys: 5.73 ± 0.348
3.73TyrLeu: 3.73 ± 0.305
0.904TyrMet: 0.904 ± 0.115
4.615TyrAsn: 4.615 ± 0.345
1.596TyrPro: 1.596 ± 0.191
1.654TyrGln: 1.654 ± 0.179
1.788TyrArg: 1.788 ± 0.191
2.75TyrSer: 2.75 ± 0.225
3.384TyrThr: 3.384 ± 0.256
2.692TyrVal: 2.692 ± 0.253
0.385TyrTrp: 0.385 ± 0.088
2.557TyrTyr: 2.557 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 202 proteins (52010 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski