Amino acid dipepetide frequency for Vibrio phage phi50-12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.02AlaAla: 5.02 ± 0.822
0.837AlaCys: 0.837 ± 0.238
3.718AlaAsp: 3.718 ± 0.392
4.416AlaGlu: 4.416 ± 0.433
3.068AlaPhe: 3.068 ± 0.342
4.555AlaGly: 4.555 ± 0.516
1.255AlaHis: 1.255 ± 0.241
5.252AlaIle: 5.252 ± 0.481
4.276AlaLys: 4.276 ± 0.449
6.368AlaLeu: 6.368 ± 0.78
1.813AlaMet: 1.813 ± 0.25
4.044AlaAsn: 4.044 ± 0.488
1.72AlaPro: 1.72 ± 0.255
2.789AlaGln: 2.789 ± 0.343
2.882AlaArg: 2.882 ± 0.409
4.23AlaSer: 4.23 ± 0.475
4.323AlaThr: 4.323 ± 0.64
4.694AlaVal: 4.694 ± 0.736
0.697AlaTrp: 0.697 ± 0.165
2.975AlaTyr: 2.975 ± 0.362
0.0AlaXaa: 0.0 ± 0.0
Cys
0.511CysAla: 0.511 ± 0.181
0.093CysCys: 0.093 ± 0.069
0.837CysAsp: 0.837 ± 0.262
0.744CysGlu: 0.744 ± 0.197
0.558CysPhe: 0.558 ± 0.163
0.837CysGly: 0.837 ± 0.196
0.372CysHis: 0.372 ± 0.131
0.744CysIle: 0.744 ± 0.208
0.93CysLys: 0.93 ± 0.223
1.023CysLeu: 1.023 ± 0.288
0.186CysMet: 0.186 ± 0.099
0.186CysAsn: 0.186 ± 0.085
0.232CysPro: 0.232 ± 0.102
0.372CysGln: 0.372 ± 0.131
0.139CysArg: 0.139 ± 0.08
0.511CysSer: 0.511 ± 0.197
0.651CysThr: 0.651 ± 0.195
0.837CysVal: 0.837 ± 0.226
0.139CysTrp: 0.139 ± 0.076
0.418CysTyr: 0.418 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
5.066AspAla: 5.066 ± 0.678
0.697AspCys: 0.697 ± 0.198
3.811AspAsp: 3.811 ± 0.487
4.323AspGlu: 4.323 ± 0.563
2.649AspPhe: 2.649 ± 0.28
4.369AspGly: 4.369 ± 0.418
0.79AspHis: 0.79 ± 0.211
4.09AspIle: 4.09 ± 0.375
4.555AspLys: 4.555 ± 0.48
5.02AspLeu: 5.02 ± 0.528
1.58AspMet: 1.58 ± 0.289
3.811AspAsn: 3.811 ± 0.42
2.138AspPro: 2.138 ± 0.452
1.487AspGln: 1.487 ± 0.295
2.045AspArg: 2.045 ± 0.255
4.183AspSer: 4.183 ± 0.432
5.159AspThr: 5.159 ± 0.508
3.579AspVal: 3.579 ± 0.399
0.93AspTrp: 0.93 ± 0.178
2.928AspTyr: 2.928 ± 0.387
0.0AspXaa: 0.0 ± 0.0
Glu
4.927GluAla: 4.927 ± 0.564
0.604GluCys: 0.604 ± 0.182
4.508GluAsp: 4.508 ± 0.407
5.763GluGlu: 5.763 ± 0.806
3.718GluPhe: 3.718 ± 0.348
4.044GluGly: 4.044 ± 0.358
0.976GluHis: 0.976 ± 0.248
4.137GluIle: 4.137 ± 0.566
4.88GluLys: 4.88 ± 0.572
7.994GluLeu: 7.994 ± 0.524
2.463GluMet: 2.463 ± 0.327
3.254GluAsn: 3.254 ± 0.401
2.928GluPro: 2.928 ± 0.521
3.858GluGln: 3.858 ± 0.447
3.114GluArg: 3.114 ± 0.452
4.648GluSer: 4.648 ± 0.469
3.672GluThr: 3.672 ± 0.351
5.763GluVal: 5.763 ± 0.553
1.069GluTrp: 1.069 ± 0.193
2.37GluTyr: 2.37 ± 0.34
0.0GluXaa: 0.0 ± 0.0
Phe
2.231PheAla: 2.231 ± 0.31
0.325PheCys: 0.325 ± 0.119
3.486PheAsp: 3.486 ± 0.32
2.928PheGlu: 2.928 ± 0.37
1.116PhePhe: 1.116 ± 0.254
2.928PheGly: 2.928 ± 0.286
0.883PheHis: 0.883 ± 0.161
2.463PheIle: 2.463 ± 0.349
3.579PheLys: 3.579 ± 0.443
2.324PheLeu: 2.324 ± 0.366
1.255PheMet: 1.255 ± 0.196
2.789PheAsn: 2.789 ± 0.349
1.208PhePro: 1.208 ± 0.275
1.348PheGln: 1.348 ± 0.23
1.394PheArg: 1.394 ± 0.232
1.859PheSer: 1.859 ± 0.27
3.393PheThr: 3.393 ± 0.305
2.603PheVal: 2.603 ± 0.268
0.325PheTrp: 0.325 ± 0.118
1.673PheTyr: 1.673 ± 0.241
0.0PheXaa: 0.0 ± 0.0
Gly
3.904GlyAla: 3.904 ± 0.467
0.697GlyCys: 0.697 ± 0.251
3.021GlyAsp: 3.021 ± 0.323
4.09GlyGlu: 4.09 ± 0.393
2.649GlyPhe: 2.649 ± 0.41
3.951GlyGly: 3.951 ± 0.583
1.255GlyHis: 1.255 ± 0.246
4.044GlyIle: 4.044 ± 0.461
5.531GlyLys: 5.531 ± 0.457
5.159GlyLeu: 5.159 ± 0.54
1.952GlyMet: 1.952 ± 0.272
3.765GlyAsn: 3.765 ± 0.362
0.93GlyPro: 0.93 ± 0.186
1.859GlyGln: 1.859 ± 0.205
1.999GlyArg: 1.999 ± 0.339
3.997GlySer: 3.997 ± 0.583
4.276GlyThr: 4.276 ± 0.466
3.997GlyVal: 3.997 ± 0.399
0.697GlyTrp: 0.697 ± 0.182
3.254GlyTyr: 3.254 ± 0.39
0.0GlyXaa: 0.0 ± 0.0
His
1.023HisAla: 1.023 ± 0.193
0.279HisCys: 0.279 ± 0.138
1.301HisAsp: 1.301 ± 0.235
1.162HisGlu: 1.162 ± 0.224
0.93HisPhe: 0.93 ± 0.208
1.255HisGly: 1.255 ± 0.239
0.558HisHis: 0.558 ± 0.193
0.883HisIle: 0.883 ± 0.191
1.348HisLys: 1.348 ± 0.307
1.673HisLeu: 1.673 ± 0.284
0.837HisMet: 0.837 ± 0.205
1.069HisAsn: 1.069 ± 0.237
0.79HisPro: 0.79 ± 0.239
0.697HisGln: 0.697 ± 0.22
0.93HisArg: 0.93 ± 0.175
0.976HisSer: 0.976 ± 0.248
1.023HisThr: 1.023 ± 0.269
1.208HisVal: 1.208 ± 0.303
0.186HisTrp: 0.186 ± 0.108
0.93HisTyr: 0.93 ± 0.177
0.0HisXaa: 0.0 ± 0.0
Ile
4.555IleAla: 4.555 ± 0.482
0.79IleCys: 0.79 ± 0.23
3.718IleAsp: 3.718 ± 0.358
4.323IleGlu: 4.323 ± 0.516
1.58IlePhe: 1.58 ± 0.316
3.161IleGly: 3.161 ± 0.43
1.348IleHis: 1.348 ± 0.316
3.951IleIle: 3.951 ± 0.495
5.392IleLys: 5.392 ± 0.751
4.508IleLeu: 4.508 ± 0.567
1.673IleMet: 1.673 ± 0.272
4.508IleAsn: 4.508 ± 0.552
2.417IlePro: 2.417 ± 0.326
2.556IleGln: 2.556 ± 0.357
2.649IleArg: 2.649 ± 0.359
3.858IleSer: 3.858 ± 0.415
4.183IleThr: 4.183 ± 0.52
3.858IleVal: 3.858 ± 0.432
0.465IleTrp: 0.465 ± 0.136
1.999IleTyr: 1.999 ± 0.281
0.0IleXaa: 0.0 ± 0.0
Lys
5.903LysAla: 5.903 ± 0.692
0.511LysCys: 0.511 ± 0.179
5.252LysAsp: 5.252 ± 0.588
7.297LysGlu: 7.297 ± 0.695
2.463LysPhe: 2.463 ± 0.378
4.834LysGly: 4.834 ± 0.413
1.627LysHis: 1.627 ± 0.316
3.532LysIle: 3.532 ± 0.367
4.323LysLys: 4.323 ± 0.492
5.996LysLeu: 5.996 ± 0.529
2.277LysMet: 2.277 ± 0.355
3.486LysAsn: 3.486 ± 0.46
3.858LysPro: 3.858 ± 0.554
3.3LysGln: 3.3 ± 0.394
2.742LysArg: 2.742 ± 0.349
3.718LysSer: 3.718 ± 0.344
4.416LysThr: 4.416 ± 0.396
5.485LysVal: 5.485 ± 0.386
0.604LysTrp: 0.604 ± 0.16
3.254LysTyr: 3.254 ± 0.3
0.0LysXaa: 0.0 ± 0.0
Leu
6.832LeuAla: 6.832 ± 0.635
0.837LeuCys: 0.837 ± 0.24
6.879LeuAsp: 6.879 ± 0.44
6.507LeuGlu: 6.507 ± 0.684
3.3LeuPhe: 3.3 ± 0.38
4.694LeuGly: 4.694 ± 0.442
1.301LeuHis: 1.301 ± 0.284
5.485LeuIle: 5.485 ± 0.443
5.624LeuLys: 5.624 ± 0.635
5.67LeuLeu: 5.67 ± 0.486
2.556LeuMet: 2.556 ± 0.376
5.624LeuAsn: 5.624 ± 0.504
3.625LeuPro: 3.625 ± 0.42
2.742LeuGln: 2.742 ± 0.401
4.23LeuArg: 4.23 ± 0.375
5.67LeuSer: 5.67 ± 0.51
4.694LeuThr: 4.694 ± 0.407
5.438LeuVal: 5.438 ± 0.444
0.604LeuTrp: 0.604 ± 0.17
2.185LeuTyr: 2.185 ± 0.274
0.0LeuXaa: 0.0 ± 0.0
Met
1.627MetAla: 1.627 ± 0.243
0.279MetCys: 0.279 ± 0.114
1.487MetAsp: 1.487 ± 0.299
1.952MetGlu: 1.952 ± 0.298
0.93MetPhe: 0.93 ± 0.199
1.58MetGly: 1.58 ± 0.274
0.465MetHis: 0.465 ± 0.124
1.72MetIle: 1.72 ± 0.339
2.928MetLys: 2.928 ± 0.332
2.231MetLeu: 2.231 ± 0.447
0.325MetMet: 0.325 ± 0.116
1.952MetAsn: 1.952 ± 0.278
0.883MetPro: 0.883 ± 0.227
1.394MetGln: 1.394 ± 0.209
0.883MetArg: 0.883 ± 0.217
2.185MetSer: 2.185 ± 0.281
2.37MetThr: 2.37 ± 0.32
1.627MetVal: 1.627 ± 0.295
0.186MetTrp: 0.186 ± 0.093
1.255MetTyr: 1.255 ± 0.263
0.0MetXaa: 0.0 ± 0.0
Asn
4.369AsnAla: 4.369 ± 0.515
0.651AsnCys: 0.651 ± 0.217
2.742AsnAsp: 2.742 ± 0.35
3.904AsnGlu: 3.904 ± 0.322
1.952AsnPhe: 1.952 ± 0.247
3.439AsnGly: 3.439 ± 0.402
1.999AsnHis: 1.999 ± 0.356
3.161AsnIle: 3.161 ± 0.482
5.066AsnLys: 5.066 ± 0.522
4.323AsnLeu: 4.323 ± 0.419
1.58AsnMet: 1.58 ± 0.289
3.625AsnAsn: 3.625 ± 0.428
2.975AsnPro: 2.975 ± 0.32
3.068AsnGln: 3.068 ± 0.456
1.999AsnArg: 1.999 ± 0.266
3.579AsnSer: 3.579 ± 0.407
3.672AsnThr: 3.672 ± 0.369
3.068AsnVal: 3.068 ± 0.334
0.511AsnTrp: 0.511 ± 0.135
2.417AsnTyr: 2.417 ± 0.352
0.0AsnXaa: 0.0 ± 0.0
Pro
1.627ProAla: 1.627 ± 0.232
0.372ProCys: 0.372 ± 0.137
2.696ProAsp: 2.696 ± 0.354
3.951ProGlu: 3.951 ± 0.555
1.441ProPhe: 1.441 ± 0.299
0.651ProGly: 0.651 ± 0.134
0.511ProHis: 0.511 ± 0.173
2.138ProIle: 2.138 ± 0.357
2.789ProLys: 2.789 ± 0.346
3.114ProLeu: 3.114 ± 0.456
1.301ProMet: 1.301 ± 0.216
1.627ProAsn: 1.627 ± 0.243
0.744ProPro: 0.744 ± 0.185
1.441ProGln: 1.441 ± 0.236
1.255ProArg: 1.255 ± 0.264
2.324ProSer: 2.324 ± 0.419
2.603ProThr: 2.603 ± 0.325
3.579ProVal: 3.579 ± 0.312
0.372ProTrp: 0.372 ± 0.122
1.394ProTyr: 1.394 ± 0.271
0.0ProXaa: 0.0 ± 0.0
Gln
3.951GlnAla: 3.951 ± 0.492
0.465GlnCys: 0.465 ± 0.151
2.277GlnAsp: 2.277 ± 0.277
4.183GlnGlu: 4.183 ± 0.555
2.277GlnPhe: 2.277 ± 0.275
2.51GlnGly: 2.51 ± 0.31
0.558GlnHis: 0.558 ± 0.214
2.138GlnIle: 2.138 ± 0.314
1.766GlnLys: 1.766 ± 0.262
3.904GlnLeu: 3.904 ± 0.564
1.255GlnMet: 1.255 ± 0.309
1.58GlnAsn: 1.58 ± 0.368
1.069GlnPro: 1.069 ± 0.194
1.999GlnGln: 1.999 ± 0.326
1.301GlnArg: 1.301 ± 0.243
2.556GlnSer: 2.556 ± 0.335
2.185GlnThr: 2.185 ± 0.456
2.603GlnVal: 2.603 ± 0.338
0.465GlnTrp: 0.465 ± 0.176
1.116GlnTyr: 1.116 ± 0.19
0.0GlnXaa: 0.0 ± 0.0
Arg
2.045ArgAla: 2.045 ± 0.339
0.604ArgCys: 0.604 ± 0.192
2.277ArgAsp: 2.277 ± 0.3
3.021ArgGlu: 3.021 ± 0.384
1.162ArgPhe: 1.162 ± 0.241
2.231ArgGly: 2.231 ± 0.314
0.372ArgHis: 0.372 ± 0.122
2.51ArgIle: 2.51 ± 0.328
4.044ArgLys: 4.044 ± 0.499
3.161ArgLeu: 3.161 ± 0.358
1.162ArgMet: 1.162 ± 0.249
2.277ArgAsn: 2.277 ± 0.3
1.069ArgPro: 1.069 ± 0.226
1.58ArgGln: 1.58 ± 0.223
1.58ArgArg: 1.58 ± 0.247
1.627ArgSer: 1.627 ± 0.271
1.906ArgThr: 1.906 ± 0.255
2.603ArgVal: 2.603 ± 0.273
0.418ArgTrp: 0.418 ± 0.125
1.58ArgTyr: 1.58 ± 0.219
0.0ArgXaa: 0.0 ± 0.0
Ser
3.765SerAla: 3.765 ± 0.448
0.465SerCys: 0.465 ± 0.167
3.951SerAsp: 3.951 ± 0.388
4.927SerGlu: 4.927 ± 0.482
2.649SerPhe: 2.649 ± 0.325
4.09SerGly: 4.09 ± 0.51
1.116SerHis: 1.116 ± 0.241
3.393SerIle: 3.393 ± 0.434
5.252SerLys: 5.252 ± 0.47
5.206SerLeu: 5.206 ± 0.546
1.952SerMet: 1.952 ± 0.327
4.09SerAsn: 4.09 ± 0.479
2.324SerPro: 2.324 ± 0.283
2.324SerGln: 2.324 ± 0.535
1.673SerArg: 1.673 ± 0.233
3.672SerSer: 3.672 ± 0.362
3.858SerThr: 3.858 ± 0.393
3.532SerVal: 3.532 ± 0.43
0.372SerTrp: 0.372 ± 0.138
2.417SerTyr: 2.417 ± 0.424
0.0SerXaa: 0.0 ± 0.0
Thr
4.276ThrAla: 4.276 ± 0.433
0.325ThrCys: 0.325 ± 0.111
3.904ThrAsp: 3.904 ± 0.424
3.997ThrGlu: 3.997 ± 0.627
2.649ThrPhe: 2.649 ± 0.308
4.648ThrGly: 4.648 ± 0.421
1.162ThrHis: 1.162 ± 0.195
4.741ThrIle: 4.741 ± 0.576
4.88ThrLys: 4.88 ± 0.522
5.949ThrLeu: 5.949 ± 0.472
1.301ThrMet: 1.301 ± 0.262
3.114ThrAsn: 3.114 ± 0.317
2.928ThrPro: 2.928 ± 0.323
2.277ThrGln: 2.277 ± 0.38
1.673ThrArg: 1.673 ± 0.26
3.858ThrSer: 3.858 ± 0.445
4.09ThrThr: 4.09 ± 0.534
5.485ThrVal: 5.485 ± 0.541
0.418ThrTrp: 0.418 ± 0.134
2.975ThrTyr: 2.975 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
4.555ValAla: 4.555 ± 0.552
0.837ValCys: 0.837 ± 0.256
4.508ValAsp: 4.508 ± 0.515
4.23ValGlu: 4.23 ± 0.538
2.51ValPhe: 2.51 ± 0.393
3.811ValGly: 3.811 ± 0.4
1.255ValHis: 1.255 ± 0.238
3.858ValIle: 3.858 ± 0.441
4.694ValLys: 4.694 ± 0.47
5.717ValLeu: 5.717 ± 0.471
1.952ValMet: 1.952 ± 0.305
4.276ValAsn: 4.276 ± 0.468
2.277ValPro: 2.277 ± 0.379
3.207ValGln: 3.207 ± 0.383
2.696ValArg: 2.696 ± 0.333
4.416ValSer: 4.416 ± 0.421
5.438ValThr: 5.438 ± 0.535
3.811ValVal: 3.811 ± 0.591
0.744ValTrp: 0.744 ± 0.211
2.51ValTyr: 2.51 ± 0.395
0.0ValXaa: 0.0 ± 0.0
Trp
0.418TrpAla: 0.418 ± 0.167
0.093TrpCys: 0.093 ± 0.051
0.418TrpAsp: 0.418 ± 0.174
0.651TrpGlu: 0.651 ± 0.204
0.465TrpPhe: 0.465 ± 0.14
0.418TrpGly: 0.418 ± 0.12
0.279TrpHis: 0.279 ± 0.114
0.558TrpIle: 0.558 ± 0.182
0.79TrpLys: 0.79 ± 0.195
1.58TrpLeu: 1.58 ± 0.329
0.186TrpMet: 0.186 ± 0.102
0.511TrpAsn: 0.511 ± 0.199
0.186TrpPro: 0.186 ± 0.091
0.418TrpGln: 0.418 ± 0.12
0.372TrpArg: 0.372 ± 0.101
0.651TrpSer: 0.651 ± 0.148
0.232TrpThr: 0.232 ± 0.092
0.697TrpVal: 0.697 ± 0.136
0.093TrpTrp: 0.093 ± 0.061
0.558TrpTyr: 0.558 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.324TyrAla: 2.324 ± 0.27
0.558TyrCys: 0.558 ± 0.157
2.417TyrAsp: 2.417 ± 0.308
2.324TyrGlu: 2.324 ± 0.344
2.045TyrPhe: 2.045 ± 0.336
3.021TyrGly: 3.021 ± 0.344
0.976TyrHis: 0.976 ± 0.254
2.696TyrIle: 2.696 ± 0.473
2.231TyrLys: 2.231 ± 0.317
3.579TyrLeu: 3.579 ± 0.406
0.604TyrMet: 0.604 ± 0.21
2.51TyrAsn: 2.51 ± 0.335
1.627TyrPro: 1.627 ± 0.266
1.441TyrGln: 1.441 ± 0.245
1.72TyrArg: 1.72 ± 0.21
2.51TyrSer: 2.51 ± 0.302
2.51TyrThr: 2.51 ± 0.366
2.835TyrVal: 2.835 ± 0.346
0.279TyrTrp: 0.279 ± 0.104
1.348TyrTyr: 1.348 ± 0.328
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (21516 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski