Amino acid dipepetide frequency for Staphylococcus phage phiRS7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.371AlaAla: 3.371 ± 1.133
0.3AlaCys: 0.3 ± 0.148
3.071AlaAsp: 3.071 ± 0.618
2.921AlaGlu: 2.921 ± 0.434
2.846AlaPhe: 2.846 ± 0.438
2.097AlaGly: 2.097 ± 0.534
0.974AlaHis: 0.974 ± 0.267
3.52AlaIle: 3.52 ± 0.432
6.516AlaLys: 6.516 ± 0.984
4.12AlaLeu: 4.12 ± 0.538
1.273AlaMet: 1.273 ± 0.352
2.996AlaAsn: 2.996 ± 0.449
1.573AlaPro: 1.573 ± 0.291
2.022AlaGln: 2.022 ± 0.479
2.472AlaArg: 2.472 ± 0.355
3.745AlaSer: 3.745 ± 0.499
3.895AlaThr: 3.895 ± 1.001
3.296AlaVal: 3.296 ± 0.634
1.124AlaTrp: 1.124 ± 0.344
2.097AlaTyr: 2.097 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.449CysAsp: 0.449 ± 0.224
0.449CysGlu: 0.449 ± 0.215
0.375CysPhe: 0.375 ± 0.198
0.449CysGly: 0.449 ± 0.267
0.225CysHis: 0.225 ± 0.132
0.225CysIle: 0.225 ± 0.14
0.375CysLys: 0.375 ± 0.156
0.824CysLeu: 0.824 ± 0.342
0.0CysMet: 0.0 ± 0.0
0.375CysAsn: 0.375 ± 0.171
0.225CysPro: 0.225 ± 0.161
0.225CysGln: 0.225 ± 0.135
0.15CysArg: 0.15 ± 0.113
0.3CysSer: 0.3 ± 0.153
0.3CysThr: 0.3 ± 0.155
0.449CysVal: 0.449 ± 0.237
0.0CysTrp: 0.0 ± 0.0
0.3CysTyr: 0.3 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
3.221AspAla: 3.221 ± 0.684
0.0AspCys: 0.0 ± 0.0
4.494AspAsp: 4.494 ± 0.793
5.992AspGlu: 5.992 ± 0.797
3.445AspPhe: 3.445 ± 0.495
3.895AspGly: 3.895 ± 0.523
0.974AspHis: 0.974 ± 0.334
4.794AspIle: 4.794 ± 0.744
6.142AspLys: 6.142 ± 0.525
4.943AspLeu: 4.943 ± 0.673
1.648AspMet: 1.648 ± 0.316
2.622AspAsn: 2.622 ± 0.536
2.097AspPro: 2.097 ± 0.377
1.648AspGln: 1.648 ± 0.344
1.573AspArg: 1.573 ± 0.383
3.146AspSer: 3.146 ± 0.491
3.371AspThr: 3.371 ± 0.548
3.895AspVal: 3.895 ± 0.494
0.749AspTrp: 0.749 ± 0.208
3.595AspTyr: 3.595 ± 0.497
0.0AspXaa: 0.0 ± 0.0
Glu
3.371GluAla: 3.371 ± 0.622
0.824GluCys: 0.824 ± 0.288
3.595GluAsp: 3.595 ± 0.671
6.142GluGlu: 6.142 ± 1.081
2.322GluPhe: 2.322 ± 0.511
4.419GluGly: 4.419 ± 0.523
0.899GluHis: 0.899 ± 0.228
5.917GluIle: 5.917 ± 0.817
5.393GluLys: 5.393 ± 1.107
6.591GluLeu: 6.591 ± 0.774
2.322GluMet: 2.322 ± 0.415
4.869GluAsn: 4.869 ± 0.717
2.097GluPro: 2.097 ± 0.552
5.093GluGln: 5.093 ± 1.001
2.921GluArg: 2.921 ± 0.505
4.344GluSer: 4.344 ± 0.698
3.97GluThr: 3.97 ± 0.58
5.018GluVal: 5.018 ± 0.927
0.824GluTrp: 0.824 ± 0.28
3.221GluTyr: 3.221 ± 0.563
0.0GluXaa: 0.0 ± 0.0
Phe
2.172PheAla: 2.172 ± 0.485
0.449PheCys: 0.449 ± 0.195
3.146PheAsp: 3.146 ± 0.487
3.221PheGlu: 3.221 ± 0.548
1.498PhePhe: 1.498 ± 0.47
2.696PheGly: 2.696 ± 0.508
0.225PheHis: 0.225 ± 0.143
4.12PheIle: 4.12 ± 0.56
3.445PheLys: 3.445 ± 0.502
3.296PheLeu: 3.296 ± 0.421
1.124PheMet: 1.124 ± 0.272
3.595PheAsn: 3.595 ± 0.571
1.049PhePro: 1.049 ± 0.226
1.723PheGln: 1.723 ± 0.304
1.723PheArg: 1.723 ± 0.299
2.547PheSer: 2.547 ± 0.381
2.097PheThr: 2.097 ± 0.421
2.921PheVal: 2.921 ± 0.536
0.375PheTrp: 0.375 ± 0.177
1.498PheTyr: 1.498 ± 0.448
0.0PheXaa: 0.0 ± 0.0
Gly
3.146GlyAla: 3.146 ± 0.549
0.15GlyCys: 0.15 ± 0.118
2.547GlyAsp: 2.547 ± 0.49
2.846GlyGlu: 2.846 ± 0.417
3.146GlyPhe: 3.146 ± 0.599
5.168GlyGly: 5.168 ± 0.925
0.974GlyHis: 0.974 ± 0.274
5.168GlyIle: 5.168 ± 0.82
5.168GlyLys: 5.168 ± 0.76
5.618GlyLeu: 5.618 ± 0.755
1.648GlyMet: 1.648 ± 0.483
4.12GlyAsn: 4.12 ± 0.552
1.049GlyPro: 1.049 ± 0.33
2.322GlyGln: 2.322 ± 0.431
2.547GlyArg: 2.547 ± 0.353
3.296GlySer: 3.296 ± 0.608
3.296GlyThr: 3.296 ± 0.593
3.745GlyVal: 3.745 ± 0.739
0.599GlyTrp: 0.599 ± 0.207
3.595GlyTyr: 3.595 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
0.674HisAla: 0.674 ± 0.192
0.15HisCys: 0.15 ± 0.114
0.824HisAsp: 0.824 ± 0.246
0.899HisGlu: 0.899 ± 0.288
1.423HisPhe: 1.423 ± 0.346
0.674HisGly: 0.674 ± 0.253
0.674HisHis: 0.674 ± 0.262
0.749HisIle: 0.749 ± 0.25
0.674HisLys: 0.674 ± 0.299
1.798HisLeu: 1.798 ± 0.354
0.824HisMet: 0.824 ± 0.243
1.198HisAsn: 1.198 ± 0.342
0.599HisPro: 0.599 ± 0.179
0.375HisGln: 0.375 ± 0.158
0.749HisArg: 0.749 ± 0.238
1.124HisSer: 1.124 ± 0.237
0.974HisThr: 0.974 ± 0.291
0.899HisVal: 0.899 ± 0.284
0.075HisTrp: 0.075 ± 0.073
0.974HisTyr: 0.974 ± 0.365
0.0HisXaa: 0.0 ± 0.0
Ile
4.045IleAla: 4.045 ± 0.524
0.599IleCys: 0.599 ± 0.207
5.318IleAsp: 5.318 ± 0.669
6.591IleGlu: 6.591 ± 0.921
2.547IlePhe: 2.547 ± 0.382
3.67IleGly: 3.67 ± 0.6
1.273IleHis: 1.273 ± 0.369
5.018IleIle: 5.018 ± 0.709
7.041IleLys: 7.041 ± 0.658
4.943IleLeu: 4.943 ± 0.832
1.348IleMet: 1.348 ± 0.323
5.543IleAsn: 5.543 ± 0.69
2.547IlePro: 2.547 ± 0.544
2.696IleGln: 2.696 ± 0.488
2.696IleArg: 2.696 ± 0.475
4.644IleSer: 4.644 ± 0.627
5.093IleThr: 5.093 ± 0.602
4.419IleVal: 4.419 ± 0.529
0.749IleTrp: 0.749 ± 0.315
2.696IleTyr: 2.696 ± 0.459
0.0IleXaa: 0.0 ± 0.0
Lys
4.869LysAla: 4.869 ± 0.763
0.449LysCys: 0.449 ± 0.219
5.618LysAsp: 5.618 ± 0.675
6.292LysGlu: 6.292 ± 0.789
2.322LysPhe: 2.322 ± 0.517
5.992LysGly: 5.992 ± 1.021
1.348LysHis: 1.348 ± 0.267
5.917LysIle: 5.917 ± 0.743
7.041LysLys: 7.041 ± 0.818
7.041LysLeu: 7.041 ± 0.806
2.247LysMet: 2.247 ± 0.386
7.19LysAsn: 7.19 ± 1.021
2.397LysPro: 2.397 ± 0.435
4.869LysGln: 4.869 ± 0.644
4.045LysArg: 4.045 ± 0.744
5.018LysSer: 5.018 ± 0.55
5.543LysThr: 5.543 ± 0.646
5.917LysVal: 5.917 ± 0.697
0.974LysTrp: 0.974 ± 0.363
3.296LysTyr: 3.296 ± 0.677
0.0LysXaa: 0.0 ± 0.0
Leu
4.194LeuAla: 4.194 ± 0.5
0.599LeuCys: 0.599 ± 0.228
4.943LeuAsp: 4.943 ± 0.933
5.692LeuGlu: 5.692 ± 0.846
3.146LeuPhe: 3.146 ± 0.422
4.419LeuGly: 4.419 ± 0.649
1.273LeuHis: 1.273 ± 0.371
7.041LeuIle: 7.041 ± 1.026
8.014LeuLys: 8.014 ± 0.874
6.217LeuLeu: 6.217 ± 0.914
1.498LeuMet: 1.498 ± 0.293
6.891LeuAsn: 6.891 ± 0.766
2.846LeuPro: 2.846 ± 0.427
3.82LeuGln: 3.82 ± 0.556
2.846LeuArg: 2.846 ± 0.474
5.543LeuSer: 5.543 ± 0.581
4.569LeuThr: 4.569 ± 0.603
3.445LeuVal: 3.445 ± 0.403
1.049LeuTrp: 1.049 ± 0.343
2.846LeuTyr: 2.846 ± 0.459
0.0LeuXaa: 0.0 ± 0.0
Met
2.247MetAla: 2.247 ± 0.447
0.075MetCys: 0.075 ± 0.074
1.423MetAsp: 1.423 ± 0.376
1.873MetGlu: 1.873 ± 0.395
0.974MetPhe: 0.974 ± 0.309
0.824MetGly: 0.824 ± 0.26
0.075MetHis: 0.075 ± 0.061
1.573MetIle: 1.573 ± 0.365
2.472MetLys: 2.472 ± 0.368
2.097MetLeu: 2.097 ± 0.471
0.824MetMet: 0.824 ± 0.265
2.022MetAsn: 2.022 ± 0.42
0.3MetPro: 0.3 ± 0.17
1.124MetGln: 1.124 ± 0.265
1.124MetArg: 1.124 ± 0.29
1.198MetSer: 1.198 ± 0.294
2.247MetThr: 2.247 ± 0.352
1.273MetVal: 1.273 ± 0.309
0.674MetTrp: 0.674 ± 0.251
0.674MetTyr: 0.674 ± 0.258
0.0MetXaa: 0.0 ± 0.0
Asn
3.67AsnAla: 3.67 ± 0.524
0.375AsnCys: 0.375 ± 0.197
5.168AsnAsp: 5.168 ± 0.657
4.869AsnGlu: 4.869 ± 0.555
2.996AsnPhe: 2.996 ± 0.437
5.842AsnGly: 5.842 ± 1.037
1.049AsnHis: 1.049 ± 0.294
4.644AsnIle: 4.644 ± 0.595
5.842AsnLys: 5.842 ± 0.61
5.168AsnLeu: 5.168 ± 0.658
1.273AsnMet: 1.273 ± 0.269
4.494AsnAsn: 4.494 ± 0.698
2.322AsnPro: 2.322 ± 0.407
1.648AsnGln: 1.648 ± 0.299
2.996AsnArg: 2.996 ± 0.532
4.644AsnSer: 4.644 ± 0.722
3.296AsnThr: 3.296 ± 0.513
3.745AsnVal: 3.745 ± 0.449
1.049AsnTrp: 1.049 ± 0.291
2.771AsnTyr: 2.771 ± 0.378
0.0AsnXaa: 0.0 ± 0.0
Pro
0.824ProAla: 0.824 ± 0.219
0.0ProCys: 0.0 ± 0.0
1.947ProAsp: 1.947 ± 0.504
2.996ProGlu: 2.996 ± 0.693
1.723ProPhe: 1.723 ± 0.305
0.899ProGly: 0.899 ± 0.247
0.674ProHis: 0.674 ± 0.225
2.322ProIle: 2.322 ± 0.431
1.947ProLys: 1.947 ± 0.464
1.723ProLeu: 1.723 ± 0.409
0.824ProMet: 0.824 ± 0.308
2.547ProAsn: 2.547 ± 0.536
0.599ProPro: 0.599 ± 0.197
0.824ProGln: 0.824 ± 0.263
0.974ProArg: 0.974 ± 0.309
1.798ProSer: 1.798 ± 0.362
2.097ProThr: 2.097 ± 0.526
2.022ProVal: 2.022 ± 0.449
0.075ProTrp: 0.075 ± 0.076
1.873ProTyr: 1.873 ± 0.378
0.0ProXaa: 0.0 ± 0.0
Gln
3.595GlnAla: 3.595 ± 0.5
0.15GlnCys: 0.15 ± 0.113
2.696GlnAsp: 2.696 ± 0.521
3.52GlnGlu: 3.52 ± 0.695
1.273GlnPhe: 1.273 ± 0.287
2.472GlnGly: 2.472 ± 0.508
0.899GlnHis: 0.899 ± 0.258
2.397GlnIle: 2.397 ± 0.283
2.247GlnLys: 2.247 ± 0.332
4.569GlnLeu: 4.569 ± 0.842
0.974GlnMet: 0.974 ± 0.352
2.622GlnAsn: 2.622 ± 0.459
1.423GlnPro: 1.423 ± 0.375
3.221GlnGln: 3.221 ± 0.621
2.247GlnArg: 2.247 ± 0.429
2.921GlnSer: 2.921 ± 0.523
1.798GlnThr: 1.798 ± 0.348
2.022GlnVal: 2.022 ± 0.351
0.3GlnTrp: 0.3 ± 0.174
1.873GlnTyr: 1.873 ± 0.312
0.0GlnXaa: 0.0 ± 0.0
Arg
1.648ArgAla: 1.648 ± 0.323
0.15ArgCys: 0.15 ± 0.097
2.322ArgAsp: 2.322 ± 0.351
2.846ArgGlu: 2.846 ± 0.548
2.322ArgPhe: 2.322 ± 0.312
2.472ArgGly: 2.472 ± 0.463
0.749ArgHis: 0.749 ± 0.218
3.146ArgIle: 3.146 ± 0.627
4.269ArgLys: 4.269 ± 0.763
3.445ArgLeu: 3.445 ± 0.49
1.124ArgMet: 1.124 ± 0.311
2.921ArgAsn: 2.921 ± 0.565
1.049ArgPro: 1.049 ± 0.326
1.348ArgGln: 1.348 ± 0.352
2.022ArgArg: 2.022 ± 0.534
2.322ArgSer: 2.322 ± 0.307
2.172ArgThr: 2.172 ± 0.412
1.947ArgVal: 1.947 ± 0.478
0.375ArgTrp: 0.375 ± 0.149
2.022ArgTyr: 2.022 ± 0.467
0.0ArgXaa: 0.0 ± 0.0
Ser
3.146SerAla: 3.146 ± 0.643
0.375SerCys: 0.375 ± 0.219
4.194SerAsp: 4.194 ± 0.576
3.82SerGlu: 3.82 ± 0.678
3.445SerPhe: 3.445 ± 0.457
3.97SerGly: 3.97 ± 0.65
0.824SerHis: 0.824 ± 0.252
5.318SerIle: 5.318 ± 0.665
5.093SerLys: 5.093 ± 0.707
3.97SerLeu: 3.97 ± 0.745
1.947SerMet: 1.947 ± 0.313
3.745SerAsn: 3.745 ± 0.494
1.573SerPro: 1.573 ± 0.321
2.921SerGln: 2.921 ± 0.573
2.921SerArg: 2.921 ± 0.397
3.52SerSer: 3.52 ± 0.695
3.071SerThr: 3.071 ± 0.458
4.494SerVal: 4.494 ± 0.568
0.524SerTrp: 0.524 ± 0.169
2.996SerTyr: 2.996 ± 0.666
0.0SerXaa: 0.0 ± 0.0
Thr
3.745ThrAla: 3.745 ± 0.778
0.15ThrCys: 0.15 ± 0.093
3.52ThrAsp: 3.52 ± 0.445
3.82ThrGlu: 3.82 ± 0.578
2.846ThrPhe: 2.846 ± 0.491
4.419ThrGly: 4.419 ± 0.506
1.198ThrHis: 1.198 ± 0.314
4.269ThrIle: 4.269 ± 0.647
5.393ThrLys: 5.393 ± 0.646
5.468ThrLeu: 5.468 ± 0.6
1.049ThrMet: 1.049 ± 0.286
2.771ThrAsn: 2.771 ± 0.525
1.873ThrPro: 1.873 ± 0.384
2.622ThrGln: 2.622 ± 0.509
1.498ThrArg: 1.498 ± 0.298
3.67ThrSer: 3.67 ± 0.442
4.045ThrThr: 4.045 ± 0.863
3.82ThrVal: 3.82 ± 0.475
0.599ThrTrp: 0.599 ± 0.239
2.472ThrTyr: 2.472 ± 0.571
0.0ThrXaa: 0.0 ± 0.0
Val
4.045ValAla: 4.045 ± 0.665
0.3ValCys: 0.3 ± 0.163
3.97ValAsp: 3.97 ± 0.679
5.093ValGlu: 5.093 ± 0.867
2.397ValPhe: 2.397 ± 0.323
2.996ValGly: 2.996 ± 0.434
0.824ValHis: 0.824 ± 0.265
3.146ValIle: 3.146 ± 0.621
5.468ValLys: 5.468 ± 0.774
4.045ValLeu: 4.045 ± 0.663
1.348ValMet: 1.348 ± 0.311
3.895ValAsn: 3.895 ± 0.513
1.873ValPro: 1.873 ± 0.464
2.022ValGln: 2.022 ± 0.408
2.472ValArg: 2.472 ± 0.436
4.719ValSer: 4.719 ± 0.606
4.344ValThr: 4.344 ± 0.443
3.895ValVal: 3.895 ± 0.587
0.524ValTrp: 0.524 ± 0.203
2.547ValTyr: 2.547 ± 0.481
0.0ValXaa: 0.0 ± 0.0
Trp
0.449TrpAla: 0.449 ± 0.197
0.075TrpCys: 0.075 ± 0.076
0.599TrpAsp: 0.599 ± 0.188
0.449TrpGlu: 0.449 ± 0.165
0.225TrpPhe: 0.225 ± 0.128
0.3TrpGly: 0.3 ± 0.19
0.15TrpHis: 0.15 ± 0.104
0.974TrpIle: 0.974 ± 0.284
1.124TrpLys: 1.124 ± 0.316
1.423TrpLeu: 1.423 ± 0.332
0.375TrpMet: 0.375 ± 0.172
0.974TrpAsn: 0.974 ± 0.332
0.075TrpPro: 0.075 ± 0.074
0.375TrpGln: 0.375 ± 0.144
0.599TrpArg: 0.599 ± 0.202
0.974TrpSer: 0.974 ± 0.338
0.749TrpThr: 0.749 ± 0.256
0.599TrpVal: 0.599 ± 0.172
0.15TrpTrp: 0.15 ± 0.112
0.449TrpTyr: 0.449 ± 0.201
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.798TyrAla: 1.798 ± 0.379
0.524TyrCys: 0.524 ± 0.228
2.472TyrAsp: 2.472 ± 0.521
3.82TyrGlu: 3.82 ± 0.765
1.648TyrPhe: 1.648 ± 0.383
2.472TyrGly: 2.472 ± 0.462
1.049TyrHis: 1.049 ± 0.288
3.221TyrIle: 3.221 ± 0.613
4.719TyrLys: 4.719 ± 0.69
3.595TyrLeu: 3.595 ± 0.541
1.348TyrMet: 1.348 ± 0.347
2.472TyrAsn: 2.472 ± 0.516
1.124TyrPro: 1.124 ± 0.318
2.322TyrGln: 2.322 ± 0.469
2.097TyrArg: 2.097 ± 0.419
2.397TyrSer: 2.397 ± 0.405
2.322TyrThr: 2.322 ± 0.384
2.097TyrVal: 2.097 ± 0.457
0.3TyrTrp: 0.3 ± 0.141
2.097TyrTyr: 2.097 ± 0.452
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski