Amino acid dipepetide frequency for Thermus phage phiYS40

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.669AlaAla: 1.669 ± 0.234
0.188AlaCys: 0.188 ± 0.054
1.69AlaAsp: 1.69 ± 0.159
2.899AlaGlu: 2.899 ± 0.317
2.524AlaPhe: 2.524 ± 0.239
1.71AlaGly: 1.71 ± 0.225
0.521AlaHis: 0.521 ± 0.106
3.963AlaIle: 3.963 ± 0.289
4.193AlaLys: 4.193 ± 0.298
5.152AlaLeu: 5.152 ± 0.407
0.667AlaMet: 0.667 ± 0.131
2.878AlaAsn: 2.878 ± 0.203
1.126AlaPro: 1.126 ± 0.152
1.564AlaGln: 1.564 ± 0.162
1.836AlaArg: 1.836 ± 0.226
3.755AlaSer: 3.755 ± 0.338
2.42AlaThr: 2.42 ± 0.297
2.545AlaVal: 2.545 ± 0.22
0.313AlaTrp: 0.313 ± 0.086
2.148AlaTyr: 2.148 ± 0.2
0.0AlaXaa: 0.0 ± 0.0
Cys
0.271CysAla: 0.271 ± 0.065
0.042CysCys: 0.042 ± 0.031
0.271CysAsp: 0.271 ± 0.081
0.396CysGlu: 0.396 ± 0.096
0.459CysPhe: 0.459 ± 0.125
0.417CysGly: 0.417 ± 0.111
0.063CysHis: 0.063 ± 0.034
0.292CysIle: 0.292 ± 0.081
0.375CysLys: 0.375 ± 0.095
0.334CysLeu: 0.334 ± 0.097
0.042CysMet: 0.042 ± 0.03
0.292CysAsn: 0.292 ± 0.088
0.292CysPro: 0.292 ± 0.096
0.146CysGln: 0.146 ± 0.068
0.146CysArg: 0.146 ± 0.056
0.647CysSer: 0.647 ± 0.148
0.292CysThr: 0.292 ± 0.077
0.25CysVal: 0.25 ± 0.068
0.021CysTrp: 0.021 ± 0.022
0.334CysTyr: 0.334 ± 0.074
0.0CysXaa: 0.0 ± 0.0
Asp
2.482AspAla: 2.482 ± 0.226
0.417AspCys: 0.417 ± 0.113
3.129AspAsp: 3.129 ± 0.296
5.235AspGlu: 5.235 ± 0.341
4.672AspPhe: 4.672 ± 0.289
1.752AspGly: 1.752 ± 0.176
0.355AspHis: 0.355 ± 0.087
4.777AspIle: 4.777 ± 0.327
4.318AspLys: 4.318 ± 0.344
5.924AspLeu: 5.924 ± 0.345
0.584AspMet: 0.584 ± 0.112
2.294AspAsn: 2.294 ± 0.203
1.231AspPro: 1.231 ± 0.224
0.834AspGln: 0.834 ± 0.12
1.773AspArg: 1.773 ± 0.191
2.774AspSer: 2.774 ± 0.251
2.774AspThr: 2.774 ± 0.23
3.963AspVal: 3.963 ± 0.307
0.313AspTrp: 0.313 ± 0.075
3.755AspTyr: 3.755 ± 0.263
0.0AspXaa: 0.0 ± 0.0
Glu
4.026GluAla: 4.026 ± 0.488
0.334GluCys: 0.334 ± 0.092
5.653GluAsp: 5.653 ± 0.411
9.95GluGlu: 9.95 ± 0.871
5.089GluPhe: 5.089 ± 0.347
3.442GluGly: 3.442 ± 0.26
1.272GluHis: 1.272 ± 0.177
7.426GluIle: 7.426 ± 0.467
9.115GluLys: 9.115 ± 0.584
7.113GluLeu: 7.113 ± 0.376
1.418GluMet: 1.418 ± 0.179
6.466GluAsn: 6.466 ± 0.315
1.731GluPro: 1.731 ± 0.204
2.002GluGln: 2.002 ± 0.274
2.941GluArg: 2.941 ± 0.268
4.297GluSer: 4.297 ± 0.332
3.4GluThr: 3.4 ± 0.273
5.715GluVal: 5.715 ± 0.369
0.292GluTrp: 0.292 ± 0.097
3.901GluTyr: 3.901 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
3.254PheAla: 3.254 ± 0.237
0.521PheCys: 0.521 ± 0.127
3.942PheAsp: 3.942 ± 0.238
5.215PheGlu: 5.215 ± 0.357
4.255PhePhe: 4.255 ± 0.316
3.129PheGly: 3.129 ± 0.267
0.667PheHis: 0.667 ± 0.119
4.38PheIle: 4.38 ± 0.322
4.756PheLys: 4.756 ± 0.327
6.737PheLeu: 6.737 ± 0.399
1.481PheMet: 1.481 ± 0.201
3.796PheAsn: 3.796 ± 0.291
2.274PhePro: 2.274 ± 0.204
0.939PheGln: 0.939 ± 0.126
2.44PheArg: 2.44 ± 0.212
6.424PheSer: 6.424 ± 0.399
2.628PheThr: 2.628 ± 0.278
4.985PheVal: 4.985 ± 0.317
0.313PheTrp: 0.313 ± 0.081
4.672PheTyr: 4.672 ± 0.367
0.0PheXaa: 0.0 ± 0.0
Gly
2.294GlyAla: 2.294 ± 0.257
0.229GlyCys: 0.229 ± 0.094
1.773GlyAsp: 1.773 ± 0.212
3.129GlyGlu: 3.129 ± 0.248
2.753GlyPhe: 2.753 ± 0.214
2.378GlyGly: 2.378 ± 0.27
0.542GlyHis: 0.542 ± 0.111
3.692GlyIle: 3.692 ± 0.279
4.422GlyLys: 4.422 ± 0.313
3.942GlyLeu: 3.942 ± 0.296
0.959GlyMet: 0.959 ± 0.144
3.15GlyAsn: 3.15 ± 0.283
0.772GlyPro: 0.772 ± 0.124
1.168GlyGln: 1.168 ± 0.16
1.898GlyArg: 1.898 ± 0.246
3.15GlySer: 3.15 ± 0.309
2.211GlyThr: 2.211 ± 0.258
3.066GlyVal: 3.066 ± 0.302
0.438GlyTrp: 0.438 ± 0.096
2.253GlyTyr: 2.253 ± 0.25
0.0GlyXaa: 0.0 ± 0.0
His
0.521HisAla: 0.521 ± 0.11
0.063HisCys: 0.063 ± 0.037
0.417HisAsp: 0.417 ± 0.111
0.73HisGlu: 0.73 ± 0.129
0.834HisPhe: 0.834 ± 0.123
0.542HisGly: 0.542 ± 0.116
0.167HisHis: 0.167 ± 0.06
1.377HisIle: 1.377 ± 0.165
1.398HisLys: 1.398 ± 0.186
1.523HisLeu: 1.523 ± 0.179
0.104HisMet: 0.104 ± 0.05
0.459HisAsn: 0.459 ± 0.091
0.667HisPro: 0.667 ± 0.133
0.25HisGln: 0.25 ± 0.078
0.396HisArg: 0.396 ± 0.095
0.667HisSer: 0.667 ± 0.123
0.542HisThr: 0.542 ± 0.125
0.688HisVal: 0.688 ± 0.133
0.042HisTrp: 0.042 ± 0.03
0.584HisTyr: 0.584 ± 0.107
0.0HisXaa: 0.0 ± 0.0
Ile
3.4IleAla: 3.4 ± 0.265
0.396IleCys: 0.396 ± 0.09
4.485IleAsp: 4.485 ± 0.249
7.008IleGlu: 7.008 ± 0.399
4.881IlePhe: 4.881 ± 0.312
2.92IleGly: 2.92 ± 0.245
1.147IleHis: 1.147 ± 0.151
4.86IleIle: 4.86 ± 0.379
7.697IleLys: 7.697 ± 0.513
7.175IleLeu: 7.175 ± 0.407
1.106IleMet: 1.106 ± 0.16
5.194IleAsn: 5.194 ± 0.488
2.899IlePro: 2.899 ± 0.24
1.898IleGln: 1.898 ± 0.174
3.337IleArg: 3.337 ± 0.286
7.28IleSer: 7.28 ± 0.375
4.026IleThr: 4.026 ± 0.263
4.505IleVal: 4.505 ± 0.297
0.334IleTrp: 0.334 ± 0.094
4.985IleTyr: 4.985 ± 0.392
0.0IleXaa: 0.0 ± 0.0
Lys
4.047LysAla: 4.047 ± 0.305
0.188LysCys: 0.188 ± 0.068
5.319LysAsp: 5.319 ± 0.267
10.262LysGlu: 10.262 ± 0.768
5.736LysPhe: 5.736 ± 0.366
3.191LysGly: 3.191 ± 0.296
1.064LysHis: 1.064 ± 0.162
8.385LysIle: 8.385 ± 0.579
8.448LysLys: 8.448 ± 0.579
7.572LysLeu: 7.572 ± 0.472
1.398LysMet: 1.398 ± 0.184
6.258LysAsn: 6.258 ± 0.454
2.19LysPro: 2.19 ± 0.192
2.315LysGln: 2.315 ± 0.219
3.984LysArg: 3.984 ± 0.307
5.11LysSer: 5.11 ± 0.348
4.693LysThr: 4.693 ± 0.415
6.091LysVal: 6.091 ± 0.358
0.313LysTrp: 0.313 ± 0.07
4.213LysTyr: 4.213 ± 0.343
0.0LysXaa: 0.0 ± 0.0
Leu
4.359LeuAla: 4.359 ± 0.389
0.563LeuCys: 0.563 ± 0.109
5.924LeuAsp: 5.924 ± 0.4
8.469LeuGlu: 8.469 ± 0.543
5.632LeuPhe: 5.632 ± 0.36
5.069LeuGly: 5.069 ± 0.335
1.106LeuHis: 1.106 ± 0.142
6.904LeuIle: 6.904 ± 0.343
9.157LeuLys: 9.157 ± 0.505
9.115LeuLeu: 9.115 ± 0.466
1.564LeuMet: 1.564 ± 0.153
6.758LeuAsn: 6.758 ± 0.484
3.546LeuPro: 3.546 ± 0.284
2.545LeuGln: 2.545 ± 0.271
4.005LeuArg: 4.005 ± 0.323
8.719LeuSer: 8.719 ± 0.461
4.255LeuThr: 4.255 ± 0.345
5.82LeuVal: 5.82 ± 0.336
0.417LeuTrp: 0.417 ± 0.087
4.902LeuTyr: 4.902 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
0.521MetAla: 0.521 ± 0.108
0.104MetCys: 0.104 ± 0.054
0.438MetAsp: 0.438 ± 0.103
0.959MetGlu: 0.959 ± 0.155
1.293MetPhe: 1.293 ± 0.179
0.626MetGly: 0.626 ± 0.101
0.25MetHis: 0.25 ± 0.056
1.168MetIle: 1.168 ± 0.202
1.94MetLys: 1.94 ± 0.211
1.564MetLeu: 1.564 ± 0.174
0.271MetMet: 0.271 ± 0.073
0.855MetAsn: 0.855 ± 0.14
0.584MetPro: 0.584 ± 0.106
0.501MetGln: 0.501 ± 0.112
0.772MetArg: 0.772 ± 0.113
1.252MetSer: 1.252 ± 0.163
0.98MetThr: 0.98 ± 0.149
0.647MetVal: 0.647 ± 0.112
0.104MetTrp: 0.104 ± 0.052
0.667MetTyr: 0.667 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
2.586AsnAla: 2.586 ± 0.259
0.229AsnCys: 0.229 ± 0.087
3.087AsnAsp: 3.087 ± 0.293
4.943AsnGlu: 4.943 ± 0.395
4.693AsnPhe: 4.693 ± 0.344
2.566AsnGly: 2.566 ± 0.29
0.626AsnHis: 0.626 ± 0.133
5.736AsnIle: 5.736 ± 0.413
5.069AsnLys: 5.069 ± 0.362
7.405AsnLeu: 7.405 ± 0.415
0.626AsnMet: 0.626 ± 0.119
3.984AsnAsn: 3.984 ± 0.47
2.753AsnPro: 2.753 ± 0.303
1.794AsnGln: 1.794 ± 0.347
2.211AsnArg: 2.211 ± 0.213
4.902AsnSer: 4.902 ± 0.431
3.713AsnThr: 3.713 ± 0.361
3.233AsnVal: 3.233 ± 0.276
0.292AsnTrp: 0.292 ± 0.086
3.546AsnTyr: 3.546 ± 0.332
0.0AsnXaa: 0.0 ± 0.0
Pro
0.939ProAla: 0.939 ± 0.168
0.104ProCys: 0.104 ± 0.053
1.523ProAsp: 1.523 ± 0.182
2.399ProGlu: 2.399 ± 0.253
1.94ProPhe: 1.94 ± 0.183
1.21ProGly: 1.21 ± 0.161
0.438ProHis: 0.438 ± 0.11
3.087ProIle: 3.087 ± 0.269
2.753ProLys: 2.753 ± 0.256
2.628ProLeu: 2.628 ± 0.28
0.521ProMet: 0.521 ± 0.118
1.794ProAsn: 1.794 ± 0.221
1.21ProPro: 1.21 ± 0.192
1.502ProGln: 1.502 ± 0.178
0.98ProArg: 0.98 ± 0.177
2.837ProSer: 2.837 ± 0.241
1.564ProThr: 1.564 ± 0.185
1.794ProVal: 1.794 ± 0.234
0.188ProTrp: 0.188 ± 0.059
2.315ProTyr: 2.315 ± 0.222
0.0ProXaa: 0.0 ± 0.0
Gln
1.398GlnAla: 1.398 ± 0.168
0.063GlnCys: 0.063 ± 0.042
1.356GlnAsp: 1.356 ± 0.176
2.545GlnGlu: 2.545 ± 0.252
1.252GlnPhe: 1.252 ± 0.164
1.46GlnGly: 1.46 ± 0.247
0.167GlnHis: 0.167 ± 0.06
2.086GlnIle: 2.086 ± 0.209
2.524GlnLys: 2.524 ± 0.335
2.274GlnLeu: 2.274 ± 0.282
0.459GlnMet: 0.459 ± 0.108
2.148GlnAsn: 2.148 ± 0.282
0.48GlnPro: 0.48 ± 0.09
1.272GlnGln: 1.272 ± 0.247
0.959GlnArg: 0.959 ± 0.173
1.272GlnSer: 1.272 ± 0.144
1.356GlnThr: 1.356 ± 0.152
1.314GlnVal: 1.314 ± 0.2
0.125GlnTrp: 0.125 ± 0.051
1.293GlnTyr: 1.293 ± 0.18
0.0GlnXaa: 0.0 ± 0.0
Arg
1.439ArgAla: 1.439 ± 0.189
0.25ArgCys: 0.25 ± 0.076
1.627ArgAsp: 1.627 ± 0.18
3.108ArgGlu: 3.108 ± 0.244
2.586ArgPhe: 2.586 ± 0.208
1.982ArgGly: 1.982 ± 0.223
0.647ArgHis: 0.647 ± 0.121
3.108ArgIle: 3.108 ± 0.272
3.984ArgLys: 3.984 ± 0.378
3.546ArgLeu: 3.546 ± 0.25
0.647ArgMet: 0.647 ± 0.135
2.899ArgAsn: 2.899 ± 0.296
1.001ArgPro: 1.001 ± 0.134
0.793ArgGln: 0.793 ± 0.153
1.982ArgArg: 1.982 ± 0.236
1.856ArgSer: 1.856 ± 0.175
1.731ArgThr: 1.731 ± 0.24
2.232ArgVal: 2.232 ± 0.205
0.209ArgTrp: 0.209 ± 0.061
1.877ArgTyr: 1.877 ± 0.224
0.0ArgXaa: 0.0 ± 0.0
Ser
3.129SerAla: 3.129 ± 0.298
0.563SerCys: 0.563 ± 0.131
3.421SerAsp: 3.421 ± 0.302
5.861SerGlu: 5.861 ± 0.437
5.59SerPhe: 5.59 ± 0.338
3.88SerGly: 3.88 ± 0.3
0.73SerHis: 0.73 ± 0.118
5.861SerIle: 5.861 ± 0.347
5.444SerLys: 5.444 ± 0.346
8.948SerLeu: 8.948 ± 0.519
1.293SerMet: 1.293 ± 0.193
4.005SerAsn: 4.005 ± 0.31
2.42SerPro: 2.42 ± 0.215
2.169SerGln: 2.169 ± 0.228
2.294SerArg: 2.294 ± 0.228
6.237SerSer: 6.237 ± 0.493
3.442SerThr: 3.442 ± 0.278
5.173SerVal: 5.173 ± 0.363
0.438SerTrp: 0.438 ± 0.085
4.005SerTyr: 4.005 ± 0.299
0.0SerXaa: 0.0 ± 0.0
Thr
1.606ThrAla: 1.606 ± 0.23
0.313ThrCys: 0.313 ± 0.089
2.065ThrAsp: 2.065 ± 0.216
3.066ThrGlu: 3.066 ± 0.295
3.671ThrPhe: 3.671 ± 0.319
2.086ThrGly: 2.086 ± 0.291
0.688ThrHis: 0.688 ± 0.123
3.963ThrIle: 3.963 ± 0.312
4.088ThrLys: 4.088 ± 0.308
5.194ThrLeu: 5.194 ± 0.347
0.626ThrMet: 0.626 ± 0.119
3.421ThrAsn: 3.421 ± 0.339
2.211ThrPro: 2.211 ± 0.289
1.314ThrGln: 1.314 ± 0.169
1.335ThrArg: 1.335 ± 0.16
4.401ThrSer: 4.401 ± 0.37
3.191ThrThr: 3.191 ± 0.31
2.002ThrVal: 2.002 ± 0.235
0.271ThrTrp: 0.271 ± 0.072
2.315ThrTyr: 2.315 ± 0.257
0.0ThrXaa: 0.0 ± 0.0
Val
3.233ValAla: 3.233 ± 0.279
0.396ValCys: 0.396 ± 0.091
4.151ValAsp: 4.151 ± 0.259
5.319ValGlu: 5.319 ± 0.414
4.735ValPhe: 4.735 ± 0.285
2.983ValGly: 2.983 ± 0.275
0.647ValHis: 0.647 ± 0.102
3.859ValIle: 3.859 ± 0.301
5.528ValLys: 5.528 ± 0.368
6.216ValLeu: 6.216 ± 0.375
0.793ValMet: 0.793 ± 0.149
3.734ValAsn: 3.734 ± 0.293
2.378ValPro: 2.378 ± 0.219
1.189ValGln: 1.189 ± 0.132
1.836ValArg: 1.836 ± 0.216
4.86ValSer: 4.86 ± 0.303
1.836ValThr: 1.836 ± 0.251
4.568ValVal: 4.568 ± 0.332
0.396ValTrp: 0.396 ± 0.095
3.755ValTyr: 3.755 ± 0.265
0.0ValXaa: 0.0 ± 0.0
Trp
0.209TrpAla: 0.209 ± 0.064
0.021TrpCys: 0.021 ± 0.02
0.188TrpAsp: 0.188 ± 0.07
0.605TrpGlu: 0.605 ± 0.103
0.459TrpPhe: 0.459 ± 0.093
0.313TrpGly: 0.313 ± 0.081
0.146TrpHis: 0.146 ± 0.051
0.459TrpIle: 0.459 ± 0.086
0.459TrpLys: 0.459 ± 0.102
0.334TrpLeu: 0.334 ± 0.076
0.125TrpMet: 0.125 ± 0.063
0.292TrpAsn: 0.292 ± 0.073
0.146TrpPro: 0.146 ± 0.049
0.125TrpGln: 0.125 ± 0.057
0.229TrpArg: 0.229 ± 0.066
0.229TrpSer: 0.229 ± 0.074
0.271TrpThr: 0.271 ± 0.082
0.313TrpVal: 0.313 ± 0.083
0.021TrpTrp: 0.021 ± 0.023
0.313TrpTyr: 0.313 ± 0.087
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.378TyrAla: 2.378 ± 0.247
0.396TyrCys: 0.396 ± 0.102
2.899TyrAsp: 2.899 ± 0.237
3.483TyrGlu: 3.483 ± 0.288
3.755TyrPhe: 3.755 ± 0.299
2.566TyrGly: 2.566 ± 0.295
0.73TyrHis: 0.73 ± 0.127
4.151TyrIle: 4.151 ± 0.359
5.11TyrLys: 5.11 ± 0.421
6.132TyrLeu: 6.132 ± 0.353
0.793TyrMet: 0.793 ± 0.115
3.337TyrAsn: 3.337 ± 0.254
1.815TyrPro: 1.815 ± 0.219
1.481TyrGln: 1.481 ± 0.192
2.148TyrArg: 2.148 ± 0.206
4.234TyrSer: 4.234 ± 0.278
2.503TyrThr: 2.503 ± 0.286
3.504TyrVal: 3.504 ± 0.273
0.417TyrTrp: 0.417 ± 0.09
3.024TyrTyr: 3.024 ± 0.298
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 170 proteins (47943 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski