Amino acid dipepetide frequency for Proteus phage pPM_01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.193AlaAla: 3.193 ± 0.603
0.627AlaCys: 0.627 ± 0.218
4.961AlaAsp: 4.961 ± 0.564
5.531AlaGlu: 5.531 ± 0.673
3.706AlaPhe: 3.706 ± 0.455
5.018AlaGly: 5.018 ± 0.459
1.482AlaHis: 1.482 ± 0.265
4.562AlaIle: 4.562 ± 0.613
5.873AlaLys: 5.873 ± 0.881
5.759AlaLeu: 5.759 ± 0.714
2.452AlaMet: 2.452 ± 0.394
3.649AlaAsn: 3.649 ± 0.566
2.68AlaPro: 2.68 ± 0.536
2.737AlaGln: 2.737 ± 0.547
3.706AlaArg: 3.706 ± 0.626
5.816AlaSer: 5.816 ± 0.696
4.505AlaThr: 4.505 ± 0.418
5.075AlaVal: 5.075 ± 0.609
1.654AlaTrp: 1.654 ± 0.265
2.566AlaTyr: 2.566 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
0.513CysAla: 0.513 ± 0.175
0.0CysCys: 0.0 ± 0.0
0.855CysAsp: 0.855 ± 0.228
0.684CysGlu: 0.684 ± 0.212
0.513CysPhe: 0.513 ± 0.177
0.798CysGly: 0.798 ± 0.191
0.057CysHis: 0.057 ± 0.053
0.399CysIle: 0.399 ± 0.136
0.399CysLys: 0.399 ± 0.175
0.513CysLeu: 0.513 ± 0.183
0.285CysMet: 0.285 ± 0.137
0.456CysAsn: 0.456 ± 0.173
0.741CysPro: 0.741 ± 0.267
0.342CysGln: 0.342 ± 0.171
0.456CysArg: 0.456 ± 0.17
0.228CysSer: 0.228 ± 0.124
0.399CysThr: 0.399 ± 0.175
0.684CysVal: 0.684 ± 0.273
0.057CysTrp: 0.057 ± 0.057
0.399CysTyr: 0.399 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
4.676AspAla: 4.676 ± 0.486
0.57AspCys: 0.57 ± 0.191
4.276AspAsp: 4.276 ± 0.547
4.847AspGlu: 4.847 ± 0.51
2.395AspPhe: 2.395 ± 0.38
6.272AspGly: 6.272 ± 0.779
1.14AspHis: 1.14 ± 0.28
4.162AspIle: 4.162 ± 0.667
4.105AspLys: 4.105 ± 0.553
5.816AspLeu: 5.816 ± 0.66
2.11AspMet: 2.11 ± 0.404
3.421AspAsn: 3.421 ± 0.467
2.965AspPro: 2.965 ± 0.852
1.654AspGln: 1.654 ± 0.254
3.307AspArg: 3.307 ± 0.503
3.193AspSer: 3.193 ± 0.346
3.706AspThr: 3.706 ± 0.484
4.562AspVal: 4.562 ± 0.516
1.197AspTrp: 1.197 ± 0.251
1.939AspTyr: 1.939 ± 0.326
0.0AspXaa: 0.0 ± 0.0
Glu
5.417GluAla: 5.417 ± 0.703
0.57GluCys: 0.57 ± 0.188
4.39GluAsp: 4.39 ± 0.511
5.93GluGlu: 5.93 ± 0.728
2.395GluPhe: 2.395 ± 0.322
3.022GluGly: 3.022 ± 0.389
0.855GluHis: 0.855 ± 0.227
4.847GluIle: 4.847 ± 0.512
5.417GluLys: 5.417 ± 0.633
5.873GluLeu: 5.873 ± 0.579
1.882GluMet: 1.882 ± 0.292
3.25GluAsn: 3.25 ± 0.408
2.395GluPro: 2.395 ± 0.397
2.338GluGln: 2.338 ± 0.481
3.763GluArg: 3.763 ± 0.585
4.505GluSer: 4.505 ± 0.634
4.505GluThr: 4.505 ± 0.539
4.904GluVal: 4.904 ± 0.486
1.482GluTrp: 1.482 ± 0.29
2.395GluTyr: 2.395 ± 0.384
0.0GluXaa: 0.0 ± 0.0
Phe
2.68PheAla: 2.68 ± 0.363
0.684PheCys: 0.684 ± 0.168
3.25PheAsp: 3.25 ± 0.443
4.39PheGlu: 4.39 ± 0.433
1.882PhePhe: 1.882 ± 0.328
3.592PheGly: 3.592 ± 0.397
0.741PheHis: 0.741 ± 0.253
2.908PheIle: 2.908 ± 0.404
2.452PheLys: 2.452 ± 0.419
2.68PheLeu: 2.68 ± 0.402
1.026PheMet: 1.026 ± 0.257
1.825PheAsn: 1.825 ± 0.295
1.425PhePro: 1.425 ± 0.318
1.14PheGln: 1.14 ± 0.245
2.11PheArg: 2.11 ± 0.344
2.11PheSer: 2.11 ± 0.326
2.68PheThr: 2.68 ± 0.41
2.224PheVal: 2.224 ± 0.36
0.399PheTrp: 0.399 ± 0.156
1.311PheTyr: 1.311 ± 0.241
0.0PheXaa: 0.0 ± 0.0
Gly
4.79GlyAla: 4.79 ± 0.502
1.026GlyCys: 1.026 ± 0.273
4.904GlyAsp: 4.904 ± 0.567
5.189GlyGlu: 5.189 ± 0.475
3.364GlyPhe: 3.364 ± 0.461
5.702GlyGly: 5.702 ± 1.023
1.026GlyHis: 1.026 ± 0.293
4.447GlyIle: 4.447 ± 0.454
5.018GlyLys: 5.018 ± 0.764
4.676GlyLeu: 4.676 ± 0.529
1.939GlyMet: 1.939 ± 0.372
2.224GlyAsn: 2.224 ± 0.414
1.482GlyPro: 1.482 ± 0.245
3.25GlyGln: 3.25 ± 0.492
4.162GlyArg: 4.162 ± 0.469
4.562GlySer: 4.562 ± 0.539
4.105GlyThr: 4.105 ± 0.484
6.5GlyVal: 6.5 ± 0.5
1.026GlyTrp: 1.026 ± 0.359
2.737GlyTyr: 2.737 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
1.14HisAla: 1.14 ± 0.22
0.057HisCys: 0.057 ± 0.054
0.969HisAsp: 0.969 ± 0.246
0.798HisGlu: 0.798 ± 0.205
0.513HisPhe: 0.513 ± 0.141
1.14HisGly: 1.14 ± 0.287
0.285HisHis: 0.285 ± 0.126
0.969HisIle: 0.969 ± 0.24
0.855HisLys: 0.855 ± 0.25
1.425HisLeu: 1.425 ± 0.266
0.342HisMet: 0.342 ± 0.145
0.57HisAsn: 0.57 ± 0.144
0.912HisPro: 0.912 ± 0.227
0.285HisGln: 0.285 ± 0.107
1.311HisArg: 1.311 ± 0.26
0.969HisSer: 0.969 ± 0.178
1.026HisThr: 1.026 ± 0.258
1.254HisVal: 1.254 ± 0.243
0.114HisTrp: 0.114 ± 0.076
0.513HisTyr: 0.513 ± 0.219
0.0HisXaa: 0.0 ± 0.0
Ile
4.562IleAla: 4.562 ± 0.493
0.456IleCys: 0.456 ± 0.185
4.505IleAsp: 4.505 ± 0.567
4.733IleGlu: 4.733 ± 0.496
1.768IlePhe: 1.768 ± 0.345
3.763IleGly: 3.763 ± 0.505
1.482IleHis: 1.482 ± 0.221
3.649IleIle: 3.649 ± 0.435
4.847IleLys: 4.847 ± 0.544
3.763IleLeu: 3.763 ± 0.473
1.026IleMet: 1.026 ± 0.269
3.82IleAsn: 3.82 ± 0.569
2.851IlePro: 2.851 ± 0.35
1.654IleGln: 1.654 ± 0.279
3.364IleArg: 3.364 ± 0.463
4.676IleSer: 4.676 ± 0.424
3.763IleThr: 3.763 ± 0.405
3.25IleVal: 3.25 ± 0.382
0.399IleTrp: 0.399 ± 0.142
2.11IleTyr: 2.11 ± 0.347
0.0IleXaa: 0.0 ± 0.0
Lys
5.759LysAla: 5.759 ± 0.794
0.399LysCys: 0.399 ± 0.174
4.105LysAsp: 4.105 ± 0.499
3.763LysGlu: 3.763 ± 0.497
2.623LysPhe: 2.623 ± 0.354
3.592LysGly: 3.592 ± 0.451
1.14LysHis: 1.14 ± 0.282
3.136LysIle: 3.136 ± 0.503
4.105LysLys: 4.105 ± 0.563
5.132LysLeu: 5.132 ± 0.666
2.566LysMet: 2.566 ± 0.452
2.68LysAsn: 2.68 ± 0.328
3.136LysPro: 3.136 ± 0.495
1.939LysGln: 1.939 ± 0.386
3.478LysArg: 3.478 ± 0.591
4.333LysSer: 4.333 ± 0.53
3.877LysThr: 3.877 ± 0.437
3.934LysVal: 3.934 ± 0.563
0.912LysTrp: 0.912 ± 0.268
2.053LysTyr: 2.053 ± 0.407
0.0LysXaa: 0.0 ± 0.0
Leu
6.671LeuAla: 6.671 ± 0.793
0.798LeuCys: 0.798 ± 0.223
4.676LeuAsp: 4.676 ± 0.619
4.79LeuGlu: 4.79 ± 0.548
2.395LeuPhe: 2.395 ± 0.426
5.645LeuGly: 5.645 ± 0.713
0.741LeuHis: 0.741 ± 0.211
3.649LeuIle: 3.649 ± 0.461
4.904LeuLys: 4.904 ± 0.592
5.417LeuLeu: 5.417 ± 0.485
2.395LeuMet: 2.395 ± 0.38
3.934LeuAsn: 3.934 ± 0.455
3.478LeuPro: 3.478 ± 0.408
2.395LeuGln: 2.395 ± 0.388
5.246LeuArg: 5.246 ± 0.57
5.645LeuSer: 5.645 ± 0.634
5.303LeuThr: 5.303 ± 0.604
6.272LeuVal: 6.272 ± 0.467
0.399LeuTrp: 0.399 ± 0.154
1.996LeuTyr: 1.996 ± 0.27
0.0LeuXaa: 0.0 ± 0.0
Met
3.25MetAla: 3.25 ± 0.39
0.171MetCys: 0.171 ± 0.101
1.197MetAsp: 1.197 ± 0.305
1.825MetGlu: 1.825 ± 0.31
0.912MetPhe: 0.912 ± 0.236
1.425MetGly: 1.425 ± 0.305
0.285MetHis: 0.285 ± 0.108
2.053MetIle: 2.053 ± 0.305
1.368MetLys: 1.368 ± 0.262
2.395MetLeu: 2.395 ± 0.349
0.684MetMet: 0.684 ± 0.176
1.368MetAsn: 1.368 ± 0.259
1.368MetPro: 1.368 ± 0.286
0.513MetGln: 0.513 ± 0.229
1.654MetArg: 1.654 ± 0.281
1.711MetSer: 1.711 ± 0.403
1.711MetThr: 1.711 ± 0.303
2.11MetVal: 2.11 ± 0.359
0.342MetTrp: 0.342 ± 0.145
0.969MetTyr: 0.969 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
4.105AsnAla: 4.105 ± 0.721
0.285AsnCys: 0.285 ± 0.125
2.794AsnAsp: 2.794 ± 0.427
2.623AsnGlu: 2.623 ± 0.366
1.54AsnPhe: 1.54 ± 0.295
4.276AsnGly: 4.276 ± 0.459
0.57AsnHis: 0.57 ± 0.17
2.965AsnIle: 2.965 ± 0.443
2.566AsnLys: 2.566 ± 0.35
3.934AsnLeu: 3.934 ± 0.454
0.741AsnMet: 0.741 ± 0.229
2.395AsnAsn: 2.395 ± 0.434
2.965AsnPro: 2.965 ± 0.639
1.597AsnGln: 1.597 ± 0.356
2.395AsnArg: 2.395 ± 0.343
3.079AsnSer: 3.079 ± 0.586
3.82AsnThr: 3.82 ± 0.478
2.737AsnVal: 2.737 ± 0.383
0.513AsnTrp: 0.513 ± 0.153
1.54AsnTyr: 1.54 ± 0.357
0.0AsnXaa: 0.0 ± 0.0
Pro
2.509ProAla: 2.509 ± 0.37
0.342ProCys: 0.342 ± 0.159
3.25ProAsp: 3.25 ± 0.574
3.25ProGlu: 3.25 ± 0.447
1.597ProPhe: 1.597 ± 0.319
3.25ProGly: 3.25 ± 0.568
1.254ProHis: 1.254 ± 0.329
2.794ProIle: 2.794 ± 0.53
2.338ProLys: 2.338 ± 0.424
2.851ProLeu: 2.851 ± 0.405
1.482ProMet: 1.482 ± 0.316
2.167ProAsn: 2.167 ± 0.329
1.654ProPro: 1.654 ± 0.428
1.368ProGln: 1.368 ± 0.397
1.939ProArg: 1.939 ± 0.337
2.965ProSer: 2.965 ± 0.478
2.737ProThr: 2.737 ± 0.401
3.421ProVal: 3.421 ± 0.492
0.741ProTrp: 0.741 ± 0.249
1.711ProTyr: 1.711 ± 0.329
0.0ProXaa: 0.0 ± 0.0
Gln
2.908GlnAla: 2.908 ± 0.578
0.228GlnCys: 0.228 ± 0.115
1.597GlnAsp: 1.597 ± 0.34
1.711GlnGlu: 1.711 ± 0.292
1.597GlnPhe: 1.597 ± 0.305
1.825GlnGly: 1.825 ± 0.374
0.399GlnHis: 0.399 ± 0.116
2.338GlnIle: 2.338 ± 0.364
1.939GlnLys: 1.939 ± 0.454
3.307GlnLeu: 3.307 ± 0.449
0.798GlnMet: 0.798 ± 0.207
1.654GlnAsn: 1.654 ± 0.371
1.311GlnPro: 1.311 ± 0.355
1.482GlnGln: 1.482 ± 0.534
1.597GlnArg: 1.597 ± 0.344
2.167GlnSer: 2.167 ± 0.322
1.597GlnThr: 1.597 ± 0.294
2.509GlnVal: 2.509 ± 0.516
0.399GlnTrp: 0.399 ± 0.134
0.912GlnTyr: 0.912 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
3.307ArgAla: 3.307 ± 0.398
0.228ArgCys: 0.228 ± 0.125
3.535ArgAsp: 3.535 ± 0.398
3.763ArgGlu: 3.763 ± 0.628
2.623ArgPhe: 2.623 ± 0.341
4.447ArgGly: 4.447 ± 0.474
0.855ArgHis: 0.855 ± 0.213
4.048ArgIle: 4.048 ± 0.568
4.562ArgLys: 4.562 ± 0.654
4.276ArgLeu: 4.276 ± 0.547
1.254ArgMet: 1.254 ± 0.242
2.224ArgAsn: 2.224 ± 0.369
2.908ArgPro: 2.908 ± 0.44
2.11ArgGln: 2.11 ± 0.446
3.478ArgArg: 3.478 ± 0.525
2.794ArgSer: 2.794 ± 0.468
2.68ArgThr: 2.68 ± 0.334
4.276ArgVal: 4.276 ± 0.466
0.513ArgTrp: 0.513 ± 0.148
1.654ArgTyr: 1.654 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
6.329SerAla: 6.329 ± 0.908
0.627SerCys: 0.627 ± 0.253
4.39SerAsp: 4.39 ± 0.46
4.961SerGlu: 4.961 ± 0.56
3.364SerPhe: 3.364 ± 0.43
5.189SerGly: 5.189 ± 0.549
0.855SerHis: 0.855 ± 0.245
3.421SerIle: 3.421 ± 0.468
2.794SerLys: 2.794 ± 0.463
5.417SerLeu: 5.417 ± 0.679
2.281SerMet: 2.281 ± 0.36
3.364SerAsn: 3.364 ± 0.363
3.022SerPro: 3.022 ± 0.626
1.597SerGln: 1.597 ± 0.316
2.794SerArg: 2.794 ± 0.441
3.763SerSer: 3.763 ± 0.659
3.136SerThr: 3.136 ± 0.464
4.676SerVal: 4.676 ± 0.523
0.912SerTrp: 0.912 ± 0.23
2.167SerTyr: 2.167 ± 0.471
0.0SerXaa: 0.0 ± 0.0
Thr
4.447ThrAla: 4.447 ± 0.591
0.285ThrCys: 0.285 ± 0.12
3.82ThrAsp: 3.82 ± 0.545
3.364ThrGlu: 3.364 ± 0.356
2.509ThrPhe: 2.509 ± 0.393
4.276ThrGly: 4.276 ± 0.506
0.798ThrHis: 0.798 ± 0.284
3.478ThrIle: 3.478 ± 0.421
3.364ThrLys: 3.364 ± 0.504
4.676ThrLeu: 4.676 ± 0.411
1.482ThrMet: 1.482 ± 0.275
2.68ThrAsn: 2.68 ± 0.406
2.623ThrPro: 2.623 ± 0.406
1.825ThrGln: 1.825 ± 0.298
3.592ThrArg: 3.592 ± 0.541
3.535ThrSer: 3.535 ± 0.513
3.022ThrThr: 3.022 ± 0.543
5.987ThrVal: 5.987 ± 0.706
1.254ThrTrp: 1.254 ± 0.248
2.167ThrTyr: 2.167 ± 0.372
0.0ThrXaa: 0.0 ± 0.0
Val
6.101ValAla: 6.101 ± 0.452
0.855ValCys: 0.855 ± 0.249
5.189ValAsp: 5.189 ± 0.474
4.961ValGlu: 4.961 ± 0.534
3.478ValPhe: 3.478 ± 0.419
5.474ValGly: 5.474 ± 0.52
0.684ValHis: 0.684 ± 0.163
4.505ValIle: 4.505 ± 0.465
3.706ValLys: 3.706 ± 0.349
4.505ValLeu: 4.505 ± 0.528
1.54ValMet: 1.54 ± 0.257
3.592ValAsn: 3.592 ± 0.473
2.737ValPro: 2.737 ± 0.371
1.996ValGln: 1.996 ± 0.333
4.39ValArg: 4.39 ± 0.577
5.816ValSer: 5.816 ± 0.582
4.048ValThr: 4.048 ± 0.605
4.619ValVal: 4.619 ± 0.561
1.083ValTrp: 1.083 ± 0.3
2.851ValTyr: 2.851 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
0.855TrpAla: 0.855 ± 0.225
0.114TrpCys: 0.114 ± 0.086
1.311TrpAsp: 1.311 ± 0.298
1.197TrpGlu: 1.197 ± 0.279
0.741TrpPhe: 0.741 ± 0.236
0.969TrpGly: 0.969 ± 0.212
0.0TrpHis: 0.0 ± 0.0
0.513TrpIle: 0.513 ± 0.154
0.57TrpLys: 0.57 ± 0.192
1.425TrpLeu: 1.425 ± 0.334
0.171TrpMet: 0.171 ± 0.102
0.627TrpAsn: 0.627 ± 0.21
0.798TrpPro: 0.798 ± 0.232
0.684TrpGln: 0.684 ± 0.182
0.912TrpArg: 0.912 ± 0.261
0.741TrpSer: 0.741 ± 0.224
0.741TrpThr: 0.741 ± 0.216
0.855TrpVal: 0.855 ± 0.229
0.285TrpTrp: 0.285 ± 0.138
0.456TrpTyr: 0.456 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.338TyrAla: 2.338 ± 0.347
0.456TyrCys: 0.456 ± 0.193
2.338TyrAsp: 2.338 ± 0.357
1.825TyrGlu: 1.825 ± 0.318
1.654TyrPhe: 1.654 ± 0.265
2.452TyrGly: 2.452 ± 0.453
0.798TyrHis: 0.798 ± 0.184
1.597TyrIle: 1.597 ± 0.262
1.482TyrLys: 1.482 ± 0.299
2.908TyrLeu: 2.908 ± 0.37
0.798TyrMet: 0.798 ± 0.176
1.54TyrAsn: 1.54 ± 0.332
2.167TyrPro: 2.167 ± 0.485
1.368TyrGln: 1.368 ± 0.339
1.882TyrArg: 1.882 ± 0.375
2.452TyrSer: 2.452 ± 0.405
1.825TyrThr: 1.825 ± 0.333
2.281TyrVal: 2.281 ± 0.438
0.285TyrTrp: 0.285 ± 0.124
1.311TyrTyr: 1.311 ± 0.278
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (17539 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski