Amino acid dipepetide frequency for Enterobacteria phage YYZ-2008

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.361AlaAla: 10.361 ± 1.664
1.074AlaCys: 1.074 ± 0.268
5.559AlaAsp: 5.559 ± 0.677
7.076AlaGlu: 7.076 ± 0.896
2.843AlaPhe: 2.843 ± 0.459
7.455AlaGly: 7.455 ± 0.712
1.579AlaHis: 1.579 ± 0.349
5.18AlaIle: 5.18 ± 0.629
3.917AlaLys: 3.917 ± 0.552
8.529AlaLeu: 8.529 ± 0.959
3.159AlaMet: 3.159 ± 0.382
3.159AlaAsn: 3.159 ± 0.491
2.401AlaPro: 2.401 ± 0.435
4.296AlaGln: 4.296 ± 0.456
6.318AlaArg: 6.318 ± 0.962
6.697AlaSer: 6.697 ± 0.967
4.106AlaThr: 4.106 ± 0.705
6.507AlaVal: 6.507 ± 0.955
2.022AlaTrp: 2.022 ± 0.288
1.958AlaTyr: 1.958 ± 0.369
0.063AlaXaa: 0.063 ± 0.072
Cys
1.011CysAla: 1.011 ± 0.304
0.569CysCys: 0.569 ± 0.185
0.884CysAsp: 0.884 ± 0.228
0.632CysGlu: 0.632 ± 0.229
0.569CysPhe: 0.569 ± 0.188
1.074CysGly: 1.074 ± 0.283
0.632CysHis: 0.632 ± 0.223
0.758CysIle: 0.758 ± 0.222
0.632CysLys: 0.632 ± 0.232
0.884CysLeu: 0.884 ± 0.239
0.569CysMet: 0.569 ± 0.19
0.442CysAsn: 0.442 ± 0.166
0.632CysPro: 0.632 ± 0.243
0.316CysGln: 0.316 ± 0.143
1.39CysArg: 1.39 ± 0.364
1.327CysSer: 1.327 ± 0.287
0.948CysThr: 0.948 ± 0.215
1.011CysVal: 1.011 ± 0.211
0.442CysTrp: 0.442 ± 0.166
0.379CysTyr: 0.379 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
4.738AspAla: 4.738 ± 0.635
0.505AspCys: 0.505 ± 0.178
3.791AspAsp: 3.791 ± 0.53
3.791AspGlu: 3.791 ± 0.549
1.643AspPhe: 1.643 ± 0.3
6.002AspGly: 6.002 ± 0.781
0.632AspHis: 0.632 ± 0.173
2.906AspIle: 2.906 ± 0.342
2.969AspLys: 2.969 ± 0.533
3.791AspLeu: 3.791 ± 0.604
1.39AspMet: 1.39 ± 0.34
2.717AspAsn: 2.717 ± 0.428
2.401AspPro: 2.401 ± 0.466
1.2AspGln: 1.2 ± 0.294
3.159AspArg: 3.159 ± 0.485
3.096AspSer: 3.096 ± 0.505
2.969AspThr: 2.969 ± 0.466
4.549AspVal: 4.549 ± 0.732
0.948AspTrp: 0.948 ± 0.256
1.958AspTyr: 1.958 ± 0.366
0.0AspXaa: 0.0 ± 0.0
Glu
6.76GluAla: 6.76 ± 0.853
1.264GluCys: 1.264 ± 0.313
2.906GluAsp: 2.906 ± 0.479
3.664GluGlu: 3.664 ± 0.653
2.59GluPhe: 2.59 ± 0.367
3.791GluGly: 3.791 ± 0.457
1.137GluHis: 1.137 ± 0.28
4.549GluIle: 4.549 ± 0.448
4.106GluLys: 4.106 ± 0.474
6.002GluLeu: 6.002 ± 0.745
2.211GluMet: 2.211 ± 0.327
2.906GluAsn: 2.906 ± 0.442
2.211GluPro: 2.211 ± 0.331
3.727GluGln: 3.727 ± 0.529
4.485GluArg: 4.485 ± 0.633
3.538GluSer: 3.538 ± 0.533
2.527GluThr: 2.527 ± 0.404
3.159GluVal: 3.159 ± 0.495
0.884GluTrp: 0.884 ± 0.231
1.895GluTyr: 1.895 ± 0.397
0.0GluXaa: 0.0 ± 0.0
Phe
2.085PheAla: 2.085 ± 0.438
0.632PheCys: 0.632 ± 0.191
1.39PheAsp: 1.39 ± 0.29
1.579PheGlu: 1.579 ± 0.315
0.948PhePhe: 0.948 ± 0.253
2.337PheGly: 2.337 ± 0.322
0.821PheHis: 0.821 ± 0.237
1.706PheIle: 1.706 ± 0.424
1.453PheLys: 1.453 ± 0.336
2.337PheLeu: 2.337 ± 0.407
1.011PheMet: 1.011 ± 0.246
1.643PheAsn: 1.643 ± 0.298
1.011PhePro: 1.011 ± 0.22
1.137PheGln: 1.137 ± 0.203
2.717PheArg: 2.717 ± 0.453
4.106PheSer: 4.106 ± 0.604
2.274PheThr: 2.274 ± 0.423
2.274PheVal: 2.274 ± 0.405
0.569PheTrp: 0.569 ± 0.194
1.074PheTyr: 1.074 ± 0.257
0.063PheXaa: 0.063 ± 0.062
Gly
5.623GlyAla: 5.623 ± 0.707
0.632GlyCys: 0.632 ± 0.207
4.422GlyAsp: 4.422 ± 0.558
5.37GlyGlu: 5.37 ± 0.671
2.653GlyPhe: 2.653 ± 0.353
5.433GlyGly: 5.433 ± 0.649
1.074GlyHis: 1.074 ± 0.271
4.612GlyIle: 4.612 ± 0.553
4.738GlyLys: 4.738 ± 0.539
5.244GlyLeu: 5.244 ± 0.653
3.159GlyMet: 3.159 ± 0.536
3.601GlyAsn: 3.601 ± 0.515
1.958GlyPro: 1.958 ± 0.689
3.032GlyGln: 3.032 ± 0.47
4.296GlyArg: 4.296 ± 0.594
4.675GlySer: 4.675 ± 0.547
3.348GlyThr: 3.348 ± 0.582
5.054GlyVal: 5.054 ± 0.589
1.453GlyTrp: 1.453 ± 0.291
2.022GlyTyr: 2.022 ± 0.377
0.0GlyXaa: 0.0 ± 0.0
His
1.39HisAla: 1.39 ± 0.338
0.569HisCys: 0.569 ± 0.201
1.011HisAsp: 1.011 ± 0.241
0.948HisGlu: 0.948 ± 0.227
0.821HisPhe: 0.821 ± 0.219
1.958HisGly: 1.958 ± 0.336
0.758HisHis: 0.758 ± 0.222
1.327HisIle: 1.327 ± 0.362
1.2HisLys: 1.2 ± 0.247
1.579HisLeu: 1.579 ± 0.346
0.253HisMet: 0.253 ± 0.171
0.442HisAsn: 0.442 ± 0.173
1.2HisPro: 1.2 ± 0.305
0.442HisGln: 0.442 ± 0.163
1.074HisArg: 1.074 ± 0.271
1.264HisSer: 1.264 ± 0.343
1.074HisThr: 1.074 ± 0.278
0.884HisVal: 0.884 ± 0.245
0.253HisTrp: 0.253 ± 0.111
0.758HisTyr: 0.758 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
5.117IleAla: 5.117 ± 0.65
0.884IleCys: 0.884 ± 0.236
3.411IleAsp: 3.411 ± 0.466
3.475IleGlu: 3.475 ± 0.555
0.758IlePhe: 0.758 ± 0.235
3.348IleGly: 3.348 ± 0.376
0.884IleHis: 0.884 ± 0.228
2.401IleIle: 2.401 ± 0.36
3.348IleLys: 3.348 ± 0.429
3.727IleLeu: 3.727 ± 0.599
1.2IleMet: 1.2 ± 0.255
2.527IleAsn: 2.527 ± 0.488
2.78IlePro: 2.78 ± 0.544
1.895IleGln: 1.895 ± 0.333
4.485IleArg: 4.485 ± 0.529
4.991IleSer: 4.991 ± 0.637
4.991IleThr: 4.991 ± 0.516
2.401IleVal: 2.401 ± 0.374
0.505IleTrp: 0.505 ± 0.208
1.264IleTyr: 1.264 ± 0.357
0.0IleXaa: 0.0 ± 0.0
Lys
5.433LysAla: 5.433 ± 0.553
0.505LysCys: 0.505 ± 0.2
2.464LysAsp: 2.464 ± 0.378
2.906LysGlu: 2.906 ± 0.43
1.137LysPhe: 1.137 ± 0.261
3.348LysGly: 3.348 ± 0.491
0.821LysHis: 0.821 ± 0.23
2.969LysIle: 2.969 ± 0.496
3.159LysLys: 3.159 ± 0.657
3.727LysLeu: 3.727 ± 0.496
1.643LysMet: 1.643 ± 0.37
2.59LysAsn: 2.59 ± 0.51
2.843LysPro: 2.843 ± 0.427
3.285LysGln: 3.285 ± 0.522
3.032LysArg: 3.032 ± 0.448
3.285LysSer: 3.285 ± 0.549
4.17LysThr: 4.17 ± 0.523
2.717LysVal: 2.717 ± 0.598
0.758LysTrp: 0.758 ± 0.245
1.074LysTyr: 1.074 ± 0.228
0.126LysXaa: 0.126 ± 0.142
Leu
8.971LeuAla: 8.971 ± 0.758
1.832LeuCys: 1.832 ± 0.413
3.791LeuAsp: 3.791 ± 0.51
4.17LeuGlu: 4.17 ± 0.577
2.59LeuPhe: 2.59 ± 0.426
4.549LeuGly: 4.549 ± 0.709
1.769LeuHis: 1.769 ± 0.35
4.422LeuIle: 4.422 ± 0.433
4.864LeuLys: 4.864 ± 0.656
5.812LeuLeu: 5.812 ± 0.695
2.59LeuMet: 2.59 ± 0.447
3.285LeuAsn: 3.285 ± 0.48
3.664LeuPro: 3.664 ± 0.484
3.348LeuGln: 3.348 ± 0.538
6.128LeuArg: 6.128 ± 0.561
5.875LeuSer: 5.875 ± 0.638
5.623LeuThr: 5.623 ± 0.718
4.675LeuVal: 4.675 ± 0.665
1.579LeuTrp: 1.579 ± 0.319
2.085LeuTyr: 2.085 ± 0.378
0.063LeuXaa: 0.063 ± 0.065
Met
2.906MetAla: 2.906 ± 0.414
0.19MetCys: 0.19 ± 0.116
1.516MetAsp: 1.516 ± 0.323
1.137MetGlu: 1.137 ± 0.314
0.821MetPhe: 0.821 ± 0.234
1.706MetGly: 1.706 ± 0.357
0.253MetHis: 0.253 ± 0.14
1.453MetIle: 1.453 ± 0.268
1.579MetLys: 1.579 ± 0.323
3.348MetLeu: 3.348 ± 0.501
0.505MetMet: 0.505 ± 0.192
1.39MetAsn: 1.39 ± 0.276
1.39MetPro: 1.39 ± 0.311
1.832MetGln: 1.832 ± 0.336
2.022MetArg: 2.022 ± 0.308
2.401MetSer: 2.401 ± 0.363
2.717MetThr: 2.717 ± 0.375
1.453MetVal: 1.453 ± 0.282
0.442MetTrp: 0.442 ± 0.145
0.253MetTyr: 0.253 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
5.623AsnAla: 5.623 ± 0.681
0.695AsnCys: 0.695 ± 0.195
2.022AsnAsp: 2.022 ± 0.322
2.843AsnGlu: 2.843 ± 0.522
0.948AsnPhe: 0.948 ± 0.297
3.917AsnGly: 3.917 ± 0.466
1.011AsnHis: 1.011 ± 0.231
2.274AsnIle: 2.274 ± 0.375
2.211AsnLys: 2.211 ± 0.404
2.843AsnLeu: 2.843 ± 0.42
0.821AsnMet: 0.821 ± 0.283
1.769AsnAsn: 1.769 ± 0.341
2.464AsnPro: 2.464 ± 0.313
1.643AsnGln: 1.643 ± 0.349
2.527AsnArg: 2.527 ± 0.415
2.527AsnSer: 2.527 ± 0.365
2.464AsnThr: 2.464 ± 0.505
2.085AsnVal: 2.085 ± 0.327
0.632AsnTrp: 0.632 ± 0.165
1.074AsnTyr: 1.074 ± 0.273
0.0AsnXaa: 0.0 ± 0.0
Pro
3.664ProAla: 3.664 ± 0.567
0.379ProCys: 0.379 ± 0.146
3.98ProAsp: 3.98 ± 0.708
3.727ProGlu: 3.727 ± 0.42
1.706ProPhe: 1.706 ± 0.3
2.969ProGly: 2.969 ± 0.422
0.821ProHis: 0.821 ± 0.262
1.327ProIle: 1.327 ± 0.342
1.769ProLys: 1.769 ± 0.424
2.78ProLeu: 2.78 ± 0.32
1.011ProMet: 1.011 ± 0.29
1.453ProAsn: 1.453 ± 0.268
2.148ProPro: 2.148 ± 0.475
1.643ProGln: 1.643 ± 0.311
1.895ProArg: 1.895 ± 0.4
2.78ProSer: 2.78 ± 0.344
1.579ProThr: 1.579 ± 0.27
4.233ProVal: 4.233 ± 0.664
0.632ProTrp: 0.632 ± 0.182
1.137ProTyr: 1.137 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
3.601GlnAla: 3.601 ± 0.696
1.074GlnCys: 1.074 ± 0.238
1.895GlnAsp: 1.895 ± 0.352
2.906GlnGlu: 2.906 ± 0.456
1.643GlnPhe: 1.643 ± 0.29
2.843GlnGly: 2.843 ± 0.443
0.948GlnHis: 0.948 ± 0.226
3.032GlnIle: 3.032 ± 0.456
3.032GlnLys: 3.032 ± 0.452
3.538GlnLeu: 3.538 ± 0.53
1.39GlnMet: 1.39 ± 0.375
1.2GlnAsn: 1.2 ± 0.456
1.579GlnPro: 1.579 ± 0.267
2.337GlnGln: 2.337 ± 0.539
2.969GlnArg: 2.969 ± 0.499
2.969GlnSer: 2.969 ± 0.481
2.211GlnThr: 2.211 ± 0.416
2.401GlnVal: 2.401 ± 0.396
0.569GlnTrp: 0.569 ± 0.187
1.706GlnTyr: 1.706 ± 0.273
0.063GlnXaa: 0.063 ± 0.06
Arg
4.233ArgAla: 4.233 ± 0.737
0.821ArgCys: 0.821 ± 0.276
4.485ArgAsp: 4.485 ± 0.719
5.812ArgGlu: 5.812 ± 0.751
2.148ArgPhe: 2.148 ± 0.376
3.601ArgGly: 3.601 ± 0.513
1.832ArgHis: 1.832 ± 0.434
3.538ArgIle: 3.538 ± 0.588
3.538ArgLys: 3.538 ± 0.532
6.697ArgLeu: 6.697 ± 0.613
2.211ArgMet: 2.211 ± 0.333
3.727ArgAsn: 3.727 ± 0.46
2.274ArgPro: 2.274 ± 0.372
3.032ArgGln: 3.032 ± 0.464
6.191ArgArg: 6.191 ± 0.781
2.717ArgSer: 2.717 ± 0.402
3.538ArgThr: 3.538 ± 0.42
4.233ArgVal: 4.233 ± 0.537
1.074ArgTrp: 1.074 ± 0.289
1.895ArgTyr: 1.895 ± 0.376
0.0ArgXaa: 0.0 ± 0.0
Ser
7.771SerAla: 7.771 ± 1.092
1.074SerCys: 1.074 ± 0.28
3.159SerAsp: 3.159 ± 0.445
3.98SerGlu: 3.98 ± 0.514
2.022SerPhe: 2.022 ± 0.329
5.875SerGly: 5.875 ± 0.827
0.948SerHis: 0.948 ± 0.283
3.285SerIle: 3.285 ± 0.506
2.401SerLys: 2.401 ± 0.392
5.812SerLeu: 5.812 ± 0.68
2.464SerMet: 2.464 ± 0.388
2.906SerAsn: 2.906 ± 0.417
2.59SerPro: 2.59 ± 0.416
3.854SerGln: 3.854 ± 0.587
4.043SerArg: 4.043 ± 0.493
4.296SerSer: 4.296 ± 0.616
3.285SerThr: 3.285 ± 0.556
5.244SerVal: 5.244 ± 0.711
0.442SerTrp: 0.442 ± 0.161
1.579SerTyr: 1.579 ± 0.329
0.0SerXaa: 0.0 ± 0.0
Thr
5.749ThrAla: 5.749 ± 0.677
0.948ThrCys: 0.948 ± 0.229
3.032ThrAsp: 3.032 ± 0.505
3.98ThrGlu: 3.98 ± 0.661
2.653ThrPhe: 2.653 ± 0.504
5.496ThrGly: 5.496 ± 0.677
1.453ThrHis: 1.453 ± 0.323
3.285ThrIle: 3.285 ± 0.509
2.274ThrLys: 2.274 ± 0.407
6.065ThrLeu: 6.065 ± 0.778
0.948ThrMet: 0.948 ± 0.232
1.769ThrAsn: 1.769 ± 0.346
3.159ThrPro: 3.159 ± 0.562
2.148ThrGln: 2.148 ± 0.325
2.59ThrArg: 2.59 ± 0.409
3.348ThrSer: 3.348 ± 0.563
3.348ThrThr: 3.348 ± 0.518
4.549ThrVal: 4.549 ± 0.564
0.632ThrTrp: 0.632 ± 0.173
1.264ThrTyr: 1.264 ± 0.222
0.063ThrXaa: 0.063 ± 0.063
Val
5.686ValAla: 5.686 ± 0.652
0.884ValCys: 0.884 ± 0.291
2.906ValAsp: 2.906 ± 0.34
4.296ValGlu: 4.296 ± 0.408
2.527ValPhe: 2.527 ± 0.445
3.791ValGly: 3.791 ± 0.59
0.948ValHis: 0.948 ± 0.233
3.727ValIle: 3.727 ± 0.487
3.411ValLys: 3.411 ± 0.499
5.18ValLeu: 5.18 ± 0.607
1.579ValMet: 1.579 ± 0.33
3.664ValAsn: 3.664 ± 0.376
3.222ValPro: 3.222 ± 0.509
2.085ValGln: 2.085 ± 0.417
4.675ValArg: 4.675 ± 0.561
4.043ValSer: 4.043 ± 0.566
4.801ValThr: 4.801 ± 0.658
4.928ValVal: 4.928 ± 0.556
0.948ValTrp: 0.948 ± 0.289
1.832ValTyr: 1.832 ± 0.349
0.063ValXaa: 0.063 ± 0.072
Trp
0.948TrpAla: 0.948 ± 0.218
0.379TrpCys: 0.379 ± 0.156
0.695TrpAsp: 0.695 ± 0.27
0.569TrpGlu: 0.569 ± 0.161
0.758TrpPhe: 0.758 ± 0.241
1.011TrpGly: 1.011 ± 0.259
0.505TrpHis: 0.505 ± 0.174
0.632TrpIle: 0.632 ± 0.188
0.569TrpLys: 0.569 ± 0.203
1.706TrpLeu: 1.706 ± 0.404
0.505TrpMet: 0.505 ± 0.17
0.442TrpAsn: 0.442 ± 0.143
0.505TrpPro: 0.505 ± 0.17
0.948TrpGln: 0.948 ± 0.206
1.264TrpArg: 1.264 ± 0.24
1.011TrpSer: 1.011 ± 0.221
1.011TrpThr: 1.011 ± 0.243
1.264TrpVal: 1.264 ± 0.269
0.442TrpTrp: 0.442 ± 0.186
0.505TrpTyr: 0.505 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.59TyrAla: 2.59 ± 0.374
0.19TyrCys: 0.19 ± 0.101
1.769TyrAsp: 1.769 ± 0.363
1.453TyrGlu: 1.453 ± 0.353
1.39TyrPhe: 1.39 ± 0.292
1.895TyrGly: 1.895 ± 0.401
0.379TyrHis: 0.379 ± 0.133
1.011TyrIle: 1.011 ± 0.322
0.821TyrLys: 0.821 ± 0.209
1.958TyrLeu: 1.958 ± 0.346
0.632TyrMet: 0.632 ± 0.188
0.758TyrAsn: 0.758 ± 0.203
1.2TyrPro: 1.2 ± 0.322
1.643TyrGln: 1.643 ± 0.42
2.337TyrArg: 2.337 ± 0.336
1.958TyrSer: 1.958 ± 0.319
1.706TyrThr: 1.706 ± 0.322
1.579TyrVal: 1.579 ± 0.363
0.442TyrTrp: 0.442 ± 0.154
0.632TyrTyr: 0.632 ± 0.231
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.063XaaAla: 0.063 ± 0.06
0.0XaaCys: 0.0 ± 0.0
0.063XaaAsp: 0.063 ± 0.063
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.126XaaLeu: 0.126 ± 0.096
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.063XaaGln: 0.063 ± 0.065
0.0XaaArg: 0.0 ± 0.0
0.126XaaSer: 0.126 ± 0.105
0.063XaaThr: 0.063 ± 0.071
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (15830 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski