Amino acid dipepetide frequency for Proteus phage PM 93

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.688AlaAla: 2.688 ± 0.751
0.509AlaCys: 0.509 ± 0.222
4.868AlaAsp: 4.868 ± 0.703
5.449AlaGlu: 5.449 ± 0.581
2.833AlaPhe: 2.833 ± 0.441
6.393AlaGly: 6.393 ± 0.848
1.526AlaHis: 1.526 ± 0.328
4.795AlaIle: 4.795 ± 0.79
6.03AlaLys: 6.03 ± 0.844
5.812AlaLeu: 5.812 ± 0.592
3.051AlaMet: 3.051 ± 0.552
3.487AlaAsn: 3.487 ± 0.428
2.616AlaPro: 2.616 ± 0.428
4.722AlaGln: 4.722 ± 0.78
4.795AlaArg: 4.795 ± 0.628
5.594AlaSer: 5.594 ± 0.818
3.923AlaThr: 3.923 ± 0.58
4.722AlaVal: 4.722 ± 0.625
0.872AlaTrp: 0.872 ± 0.206
2.107AlaTyr: 2.107 ± 0.281
0.0AlaXaa: 0.0 ± 0.0
Cys
0.509CysAla: 0.509 ± 0.195
0.073CysCys: 0.073 ± 0.069
0.436CysAsp: 0.436 ± 0.178
0.944CysGlu: 0.944 ± 0.231
0.291CysPhe: 0.291 ± 0.144
0.727CysGly: 0.727 ± 0.25
0.218CysHis: 0.218 ± 0.154
0.436CysIle: 0.436 ± 0.194
0.799CysLys: 0.799 ± 0.272
1.09CysLeu: 1.09 ± 0.261
0.727CysMet: 0.727 ± 0.217
0.727CysAsn: 0.727 ± 0.254
0.509CysPro: 0.509 ± 0.217
0.218CysGln: 0.218 ± 0.127
0.727CysArg: 0.727 ± 0.242
0.509CysSer: 0.509 ± 0.178
0.363CysThr: 0.363 ± 0.189
0.799CysVal: 0.799 ± 0.244
0.218CysTrp: 0.218 ± 0.112
0.436CysTyr: 0.436 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
5.667AspAla: 5.667 ± 0.626
0.436AspCys: 0.436 ± 0.169
3.415AspAsp: 3.415 ± 0.588
5.231AspGlu: 5.231 ± 0.608
2.47AspPhe: 2.47 ± 0.491
5.086AspGly: 5.086 ± 0.669
0.944AspHis: 0.944 ± 0.22
4.287AspIle: 4.287 ± 0.552
4.505AspLys: 4.505 ± 0.585
5.231AspLeu: 5.231 ± 0.525
1.816AspMet: 1.816 ± 0.296
3.778AspAsn: 3.778 ± 0.53
1.598AspPro: 1.598 ± 0.333
0.727AspGln: 0.727 ± 0.21
2.252AspArg: 2.252 ± 0.265
3.487AspSer: 3.487 ± 0.521
4.141AspThr: 4.141 ± 0.54
4.577AspVal: 4.577 ± 0.596
1.09AspTrp: 1.09 ± 0.419
2.107AspTyr: 2.107 ± 0.366
0.0AspXaa: 0.0 ± 0.0
Glu
6.103GluAla: 6.103 ± 0.643
1.162GluCys: 1.162 ± 0.406
4.287GluAsp: 4.287 ± 0.397
5.885GluGlu: 5.885 ± 0.982
2.543GluPhe: 2.543 ± 0.358
5.231GluGly: 5.231 ± 0.629
1.671GluHis: 1.671 ± 0.412
3.197GluIle: 3.197 ± 0.432
4.214GluLys: 4.214 ± 0.658
6.829GluLeu: 6.829 ± 0.733
2.18GluMet: 2.18 ± 0.474
2.979GluAsn: 2.979 ± 0.508
1.889GluPro: 1.889 ± 0.392
4.069GluGln: 4.069 ± 0.585
3.778GluArg: 3.778 ± 0.594
2.543GluSer: 2.543 ± 0.406
2.979GluThr: 2.979 ± 0.456
6.393GluVal: 6.393 ± 0.74
1.38GluTrp: 1.38 ± 0.345
1.598GluTyr: 1.598 ± 0.37
0.0GluXaa: 0.0 ± 0.0
Phe
2.107PheAla: 2.107 ± 0.41
0.509PheCys: 0.509 ± 0.185
3.197PheAsp: 3.197 ± 0.624
2.616PheGlu: 2.616 ± 0.311
1.526PhePhe: 1.526 ± 0.316
2.833PheGly: 2.833 ± 0.534
0.436PheHis: 0.436 ± 0.171
2.761PheIle: 2.761 ± 0.292
3.705PheLys: 3.705 ± 0.624
3.56PheLeu: 3.56 ± 0.454
1.308PheMet: 1.308 ± 0.229
2.398PheAsn: 2.398 ± 0.416
1.308PhePro: 1.308 ± 0.266
1.09PheGln: 1.09 ± 0.275
1.816PheArg: 1.816 ± 0.303
2.688PheSer: 2.688 ± 0.81
1.889PheThr: 1.889 ± 0.361
2.034PheVal: 2.034 ± 0.397
0.073PheTrp: 0.073 ± 0.076
1.235PheTyr: 1.235 ± 0.391
0.0PheXaa: 0.0 ± 0.0
Gly
5.958GlyAla: 5.958 ± 0.64
1.38GlyCys: 1.38 ± 0.369
6.466GlyAsp: 6.466 ± 0.826
4.287GlyGlu: 4.287 ± 0.524
2.979GlyPhe: 2.979 ± 0.599
5.376GlyGly: 5.376 ± 0.708
1.598GlyHis: 1.598 ± 0.325
4.141GlyIle: 4.141 ± 0.481
7.12GlyLys: 7.12 ± 1.024
5.231GlyLeu: 5.231 ± 0.701
1.671GlyMet: 1.671 ± 0.356
3.415GlyAsn: 3.415 ± 0.6
0.0GlyPro: 0.0 ± 0.0
2.47GlyGln: 2.47 ± 0.355
3.051GlyArg: 3.051 ± 0.395
5.086GlySer: 5.086 ± 0.534
4.94GlyThr: 4.94 ± 0.628
4.214GlyVal: 4.214 ± 0.78
1.162GlyTrp: 1.162 ± 0.268
3.342GlyTyr: 3.342 ± 0.408
0.0GlyXaa: 0.0 ± 0.0
His
0.944HisAla: 0.944 ± 0.278
0.218HisCys: 0.218 ± 0.135
1.453HisAsp: 1.453 ± 0.331
1.162HisGlu: 1.162 ± 0.316
1.162HisPhe: 1.162 ± 0.231
1.671HisGly: 1.671 ± 0.468
0.436HisHis: 0.436 ± 0.189
1.162HisIle: 1.162 ± 0.227
1.38HisLys: 1.38 ± 0.367
2.688HisLeu: 2.688 ± 0.439
0.727HisMet: 0.727 ± 0.343
1.09HisAsn: 1.09 ± 0.357
0.291HisPro: 0.291 ± 0.12
0.654HisGln: 0.654 ± 0.218
1.017HisArg: 1.017 ± 0.313
0.872HisSer: 0.872 ± 0.236
1.017HisThr: 1.017 ± 0.256
1.017HisVal: 1.017 ± 0.286
0.145HisTrp: 0.145 ± 0.091
0.727HisTyr: 0.727 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
4.505IleAla: 4.505 ± 0.454
0.363IleCys: 0.363 ± 0.146
3.923IleAsp: 3.923 ± 0.517
4.214IleGlu: 4.214 ± 0.518
1.962IlePhe: 1.962 ± 0.452
4.868IleGly: 4.868 ± 0.552
1.526IleHis: 1.526 ± 0.317
4.722IleIle: 4.722 ± 0.684
4.505IleLys: 4.505 ± 0.737
3.778IleLeu: 3.778 ± 0.604
2.18IleMet: 2.18 ± 0.464
3.778IleAsn: 3.778 ± 0.487
2.252IlePro: 2.252 ± 0.445
2.107IleGln: 2.107 ± 0.369
2.979IleArg: 2.979 ± 0.441
3.269IleSer: 3.269 ± 0.371
3.269IleThr: 3.269 ± 0.51
3.051IleVal: 3.051 ± 0.466
0.654IleTrp: 0.654 ± 0.214
1.671IleTyr: 1.671 ± 0.584
0.0IleXaa: 0.0 ± 0.0
Lys
6.757LysAla: 6.757 ± 0.858
0.581LysCys: 0.581 ± 0.212
4.287LysAsp: 4.287 ± 0.693
5.885LysGlu: 5.885 ± 0.516
1.162LysPhe: 1.162 ± 0.361
4.868LysGly: 4.868 ± 0.628
1.162LysHis: 1.162 ± 0.342
3.56LysIle: 3.56 ± 0.463
3.851LysLys: 3.851 ± 0.686
6.393LysLeu: 6.393 ± 0.668
1.816LysMet: 1.816 ± 0.365
2.252LysAsn: 2.252 ± 0.371
3.269LysPro: 3.269 ± 0.478
2.833LysGln: 2.833 ± 0.425
3.705LysArg: 3.705 ± 0.36
4.069LysSer: 4.069 ± 0.5
3.124LysThr: 3.124 ± 0.503
6.03LysVal: 6.03 ± 0.758
1.017LysTrp: 1.017 ± 0.263
2.616LysTyr: 2.616 ± 0.352
0.0LysXaa: 0.0 ± 0.0
Leu
6.103LeuAla: 6.103 ± 0.992
0.872LeuCys: 0.872 ± 0.265
4.94LeuAsp: 4.94 ± 0.486
5.74LeuGlu: 5.74 ± 0.621
2.906LeuPhe: 2.906 ± 0.569
5.594LeuGly: 5.594 ± 0.551
1.235LeuHis: 1.235 ± 0.321
4.432LeuIle: 4.432 ± 0.63
5.74LeuLys: 5.74 ± 0.743
7.047LeuLeu: 7.047 ± 0.588
1.889LeuMet: 1.889 ± 0.371
4.505LeuAsn: 4.505 ± 0.525
3.633LeuPro: 3.633 ± 0.47
4.722LeuGln: 4.722 ± 0.756
3.851LeuArg: 3.851 ± 0.633
5.594LeuSer: 5.594 ± 0.679
4.141LeuThr: 4.141 ± 0.459
5.376LeuVal: 5.376 ± 0.719
0.944LeuTrp: 0.944 ± 0.227
3.051LeuTyr: 3.051 ± 0.468
0.0LeuXaa: 0.0 ± 0.0
Met
3.342MetAla: 3.342 ± 0.445
0.291MetCys: 0.291 ± 0.137
1.09MetAsp: 1.09 ± 0.352
1.889MetGlu: 1.889 ± 0.378
1.308MetPhe: 1.308 ± 0.283
1.962MetGly: 1.962 ± 0.375
0.581MetHis: 0.581 ± 0.146
1.816MetIle: 1.816 ± 0.461
2.18MetLys: 2.18 ± 0.419
3.197MetLeu: 3.197 ± 0.434
0.581MetMet: 0.581 ± 0.218
1.38MetAsn: 1.38 ± 0.254
0.799MetPro: 0.799 ± 0.182
1.453MetGln: 1.453 ± 0.333
2.034MetArg: 2.034 ± 0.494
2.688MetSer: 2.688 ± 0.393
1.526MetThr: 1.526 ± 0.245
1.671MetVal: 1.671 ± 0.317
0.436MetTrp: 0.436 ± 0.271
1.235MetTyr: 1.235 ± 0.371
0.0MetXaa: 0.0 ± 0.0
Asn
3.124AsnAla: 3.124 ± 0.407
0.872AsnCys: 0.872 ± 0.356
2.252AsnAsp: 2.252 ± 0.403
2.688AsnGlu: 2.688 ± 0.569
1.744AsnPhe: 1.744 ± 0.373
3.269AsnGly: 3.269 ± 0.441
0.944AsnHis: 0.944 ± 0.322
2.47AsnIle: 2.47 ± 0.403
3.124AsnLys: 3.124 ± 0.437
4.214AsnLeu: 4.214 ± 0.372
1.671AsnMet: 1.671 ± 0.291
2.688AsnAsn: 2.688 ± 0.512
2.107AsnPro: 2.107 ± 0.348
2.47AsnGln: 2.47 ± 0.399
3.342AsnArg: 3.342 ± 0.371
2.252AsnSer: 2.252 ± 0.502
2.761AsnThr: 2.761 ± 0.486
3.487AsnVal: 3.487 ± 0.502
0.727AsnTrp: 0.727 ± 0.225
1.744AsnTyr: 1.744 ± 0.294
0.0AsnXaa: 0.0 ± 0.0
Pro
2.979ProAla: 2.979 ± 0.471
0.436ProCys: 0.436 ± 0.19
2.398ProAsp: 2.398 ± 0.458
2.833ProGlu: 2.833 ± 0.418
1.526ProPhe: 1.526 ± 0.363
0.218ProGly: 0.218 ± 0.128
0.872ProHis: 0.872 ± 0.247
1.235ProIle: 1.235 ± 0.313
1.889ProLys: 1.889 ± 0.326
1.816ProLeu: 1.816 ± 0.341
1.017ProMet: 1.017 ± 0.235
1.671ProAsn: 1.671 ± 0.441
0.727ProPro: 0.727 ± 0.216
1.162ProGln: 1.162 ± 0.251
1.235ProArg: 1.235 ± 0.245
1.962ProSer: 1.962 ± 0.403
2.543ProThr: 2.543 ± 0.378
3.415ProVal: 3.415 ± 0.547
0.654ProTrp: 0.654 ± 0.21
1.671ProTyr: 1.671 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
3.851GlnAla: 3.851 ± 0.652
0.291GlnCys: 0.291 ± 0.146
2.18GlnAsp: 2.18 ± 0.339
3.342GlnGlu: 3.342 ± 0.506
1.744GlnPhe: 1.744 ± 0.357
4.287GlnGly: 4.287 ± 0.533
1.09GlnHis: 1.09 ± 0.213
2.325GlnIle: 2.325 ± 0.277
1.453GlnLys: 1.453 ± 0.304
3.705GlnLeu: 3.705 ± 0.515
1.671GlnMet: 1.671 ± 0.346
1.235GlnAsn: 1.235 ± 0.221
1.453GlnPro: 1.453 ± 0.277
2.252GlnGln: 2.252 ± 0.482
2.325GlnArg: 2.325 ± 0.441
2.18GlnSer: 2.18 ± 0.4
2.616GlnThr: 2.616 ± 0.409
2.107GlnVal: 2.107 ± 0.312
0.727GlnTrp: 0.727 ± 0.204
1.38GlnTyr: 1.38 ± 0.329
0.0GlnXaa: 0.0 ± 0.0
Arg
4.722ArgAla: 4.722 ± 0.892
0.436ArgCys: 0.436 ± 0.194
3.415ArgAsp: 3.415 ± 0.463
3.487ArgGlu: 3.487 ± 0.552
1.816ArgPhe: 1.816 ± 0.255
4.359ArgGly: 4.359 ± 0.684
1.38ArgHis: 1.38 ± 0.343
4.069ArgIle: 4.069 ± 0.619
2.906ArgLys: 2.906 ± 0.424
3.923ArgLeu: 3.923 ± 0.665
2.107ArgMet: 2.107 ± 0.343
2.107ArgAsn: 2.107 ± 0.45
1.526ArgPro: 1.526 ± 0.352
1.671ArgGln: 1.671 ± 0.293
2.979ArgArg: 2.979 ± 0.431
2.761ArgSer: 2.761 ± 0.483
2.107ArgThr: 2.107 ± 0.375
3.415ArgVal: 3.415 ± 0.512
0.799ArgTrp: 0.799 ± 0.237
2.325ArgTyr: 2.325 ± 0.354
0.0ArgXaa: 0.0 ± 0.0
Ser
4.65SerAla: 4.65 ± 0.688
0.581SerCys: 0.581 ± 0.251
4.141SerAsp: 4.141 ± 0.497
3.56SerGlu: 3.56 ± 0.494
3.197SerPhe: 3.197 ± 0.716
4.795SerGly: 4.795 ± 0.478
0.727SerHis: 0.727 ± 0.254
4.287SerIle: 4.287 ± 0.461
3.633SerLys: 3.633 ± 0.471
5.086SerLeu: 5.086 ± 0.522
1.671SerMet: 1.671 ± 0.326
2.906SerAsn: 2.906 ± 0.47
1.889SerPro: 1.889 ± 0.373
2.616SerGln: 2.616 ± 0.476
3.56SerArg: 3.56 ± 0.519
3.487SerSer: 3.487 ± 0.728
2.906SerThr: 2.906 ± 0.514
3.633SerVal: 3.633 ± 0.483
1.162SerTrp: 1.162 ± 0.242
2.107SerTyr: 2.107 ± 0.322
0.0SerXaa: 0.0 ± 0.0
Thr
3.487ThrAla: 3.487 ± 0.49
0.436ThrCys: 0.436 ± 0.138
2.325ThrAsp: 2.325 ± 0.544
3.633ThrGlu: 3.633 ± 0.376
2.833ThrPhe: 2.833 ± 0.435
4.141ThrGly: 4.141 ± 0.495
1.162ThrHis: 1.162 ± 0.28
4.141ThrIle: 4.141 ± 0.687
4.141ThrLys: 4.141 ± 0.446
4.868ThrLeu: 4.868 ± 0.651
1.598ThrMet: 1.598 ± 0.32
1.816ThrAsn: 1.816 ± 0.346
2.034ThrPro: 2.034 ± 0.393
2.325ThrGln: 2.325 ± 0.359
2.688ThrArg: 2.688 ± 0.409
3.487ThrSer: 3.487 ± 0.561
2.833ThrThr: 2.833 ± 0.459
3.051ThrVal: 3.051 ± 0.563
0.291ThrTrp: 0.291 ± 0.16
1.816ThrTyr: 1.816 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
5.885ValAla: 5.885 ± 0.57
0.581ValCys: 0.581 ± 0.177
4.505ValAsp: 4.505 ± 0.49
4.722ValGlu: 4.722 ± 0.655
2.47ValPhe: 2.47 ± 0.424
4.577ValGly: 4.577 ± 0.685
1.308ValHis: 1.308 ± 0.4
3.342ValIle: 3.342 ± 0.539
4.65ValLys: 4.65 ± 0.467
4.069ValLeu: 4.069 ± 0.698
2.398ValMet: 2.398 ± 0.273
3.56ValAsn: 3.56 ± 0.612
2.398ValPro: 2.398 ± 0.351
2.761ValGln: 2.761 ± 0.52
4.069ValArg: 4.069 ± 0.551
4.722ValSer: 4.722 ± 0.537
3.197ValThr: 3.197 ± 0.523
4.795ValVal: 4.795 ± 0.631
0.799ValTrp: 0.799 ± 0.239
2.18ValTyr: 2.18 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
0.799TrpAla: 0.799 ± 0.218
0.363TrpCys: 0.363 ± 0.163
0.727TrpAsp: 0.727 ± 0.226
1.235TrpGlu: 1.235 ± 0.313
0.654TrpPhe: 0.654 ± 0.256
0.872TrpGly: 0.872 ± 0.227
0.509TrpHis: 0.509 ± 0.257
0.436TrpIle: 0.436 ± 0.146
1.162TrpLys: 1.162 ± 0.256
1.235TrpLeu: 1.235 ± 0.296
0.145TrpMet: 0.145 ± 0.098
0.581TrpAsn: 0.581 ± 0.198
0.799TrpPro: 0.799 ± 0.252
0.436TrpGln: 0.436 ± 0.175
0.654TrpArg: 0.654 ± 0.226
1.162TrpSer: 1.162 ± 0.391
0.363TrpThr: 0.363 ± 0.142
1.09TrpVal: 1.09 ± 0.307
0.218TrpTrp: 0.218 ± 0.127
0.218TrpTyr: 0.218 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.616TyrAla: 2.616 ± 0.326
0.363TyrCys: 0.363 ± 0.159
2.543TyrAsp: 2.543 ± 0.401
2.18TyrGlu: 2.18 ± 0.396
1.962TyrPhe: 1.962 ± 0.47
2.761TyrGly: 2.761 ± 0.498
0.509TyrHis: 0.509 ± 0.192
1.962TyrIle: 1.962 ± 0.335
2.252TyrLys: 2.252 ± 0.399
2.543TyrLeu: 2.543 ± 0.533
1.162TyrMet: 1.162 ± 0.281
1.598TyrAsn: 1.598 ± 0.295
1.162TyrPro: 1.162 ± 0.316
1.453TyrGln: 1.453 ± 0.307
1.453TyrArg: 1.453 ± 0.398
2.18TyrSer: 2.18 ± 0.383
2.47TyrThr: 2.47 ± 0.333
2.107TyrVal: 2.107 ± 0.334
0.291TyrTrp: 0.291 ± 0.164
0.654TyrTyr: 0.654 ± 0.215
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (13765 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski