Amino acid dipepetide frequency for Streptococcus phage IPP8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.344AlaAla: 2.344 ± 0.673
0.323AlaCys: 0.323 ± 0.161
5.576AlaAsp: 5.576 ± 0.678
5.899AlaGlu: 5.899 ± 0.766
2.505AlaPhe: 2.505 ± 0.623
4.768AlaGly: 4.768 ± 0.959
0.727AlaHis: 0.727 ± 0.279
4.606AlaIle: 4.606 ± 0.967
6.627AlaLys: 6.627 ± 0.739
6.223AlaLeu: 6.223 ± 0.878
2.101AlaMet: 2.101 ± 0.422
4.364AlaAsn: 4.364 ± 0.771
1.778AlaPro: 1.778 ± 0.299
2.344AlaGln: 2.344 ± 0.568
3.233AlaArg: 3.233 ± 0.605
2.667AlaSer: 2.667 ± 0.671
5.334AlaThr: 5.334 ± 0.789
5.415AlaVal: 5.415 ± 0.837
1.374AlaTrp: 1.374 ± 0.411
1.535AlaTyr: 1.535 ± 0.358
0.0AlaXaa: 0.0 ± 0.0
Cys
0.162CysAla: 0.162 ± 0.098
0.081CysCys: 0.081 ± 0.074
0.323CysAsp: 0.323 ± 0.133
0.485CysGlu: 0.485 ± 0.173
0.323CysPhe: 0.323 ± 0.211
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.485CysIle: 0.485 ± 0.28
0.566CysLys: 0.566 ± 0.187
0.404CysLeu: 0.404 ± 0.177
0.0CysMet: 0.0 ± 0.0
0.162CysAsn: 0.162 ± 0.149
0.242CysPro: 0.242 ± 0.152
0.242CysGln: 0.242 ± 0.139
0.242CysArg: 0.242 ± 0.12
0.323CysSer: 0.323 ± 0.173
0.162CysThr: 0.162 ± 0.122
0.081CysVal: 0.081 ± 0.085
0.242CysTrp: 0.242 ± 0.123
0.404CysTyr: 0.404 ± 0.188
0.0CysXaa: 0.0 ± 0.0
Asp
3.556AspAla: 3.556 ± 0.693
0.566AspCys: 0.566 ± 0.224
2.909AspAsp: 2.909 ± 0.705
5.011AspGlu: 5.011 ± 1.069
2.829AspPhe: 2.829 ± 0.469
4.768AspGly: 4.768 ± 0.592
0.485AspHis: 0.485 ± 0.226
5.576AspIle: 5.576 ± 0.696
5.334AspLys: 5.334 ± 0.766
5.172AspLeu: 5.172 ± 0.673
1.616AspMet: 1.616 ± 0.351
2.99AspAsn: 2.99 ± 0.486
1.778AspPro: 1.778 ± 0.406
1.859AspGln: 1.859 ± 0.409
2.424AspArg: 2.424 ± 0.48
3.313AspSer: 3.313 ± 0.571
3.313AspThr: 3.313 ± 0.465
3.394AspVal: 3.394 ± 0.483
1.535AspTrp: 1.535 ± 0.433
2.667AspTyr: 2.667 ± 0.444
0.0AspXaa: 0.0 ± 0.0
Glu
6.304GluAla: 6.304 ± 0.922
0.162GluCys: 0.162 ± 0.091
3.879GluAsp: 3.879 ± 0.681
6.627GluGlu: 6.627 ± 1.308
3.798GluPhe: 3.798 ± 0.716
3.556GluGly: 3.556 ± 0.488
1.374GluHis: 1.374 ± 0.403
5.899GluIle: 5.899 ± 0.537
7.193GluLys: 7.193 ± 1.243
9.213GluLeu: 9.213 ± 1.422
2.424GluMet: 2.424 ± 0.582
4.122GluAsn: 4.122 ± 0.482
1.455GluPro: 1.455 ± 0.441
2.909GluGln: 2.909 ± 0.683
3.879GluArg: 3.879 ± 0.568
4.768GluSer: 4.768 ± 0.595
4.687GluThr: 4.687 ± 0.701
4.768GluVal: 4.768 ± 0.608
1.131GluTrp: 1.131 ± 0.268
3.233GluTyr: 3.233 ± 0.472
0.0GluXaa: 0.0 ± 0.0
Phe
2.505PheAla: 2.505 ± 0.634
0.242PheCys: 0.242 ± 0.133
3.96PheAsp: 3.96 ± 0.472
3.96PheGlu: 3.96 ± 0.637
1.778PhePhe: 1.778 ± 0.407
2.344PheGly: 2.344 ± 0.64
0.242PheHis: 0.242 ± 0.153
2.182PheIle: 2.182 ± 0.436
3.313PheLys: 3.313 ± 0.572
2.909PheLeu: 2.909 ± 0.434
1.293PheMet: 1.293 ± 0.343
2.667PheAsn: 2.667 ± 0.621
0.566PhePro: 0.566 ± 0.242
1.051PheGln: 1.051 ± 0.298
1.616PheArg: 1.616 ± 0.257
3.313PheSer: 3.313 ± 0.606
2.263PheThr: 2.263 ± 0.411
1.455PheVal: 1.455 ± 0.321
0.647PheTrp: 0.647 ± 0.219
1.94PheTyr: 1.94 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
3.152GlyAla: 3.152 ± 0.549
0.162GlyCys: 0.162 ± 0.118
3.475GlyAsp: 3.475 ± 0.582
4.364GlyGlu: 4.364 ± 0.768
2.909GlyPhe: 2.909 ± 0.653
4.849GlyGly: 4.849 ± 1.134
0.808GlyHis: 0.808 ± 0.262
3.96GlyIle: 3.96 ± 0.672
5.495GlyLys: 5.495 ± 0.562
5.899GlyLeu: 5.899 ± 1.087
1.94GlyMet: 1.94 ± 0.353
3.475GlyAsn: 3.475 ± 0.465
0.97GlyPro: 0.97 ± 0.262
3.556GlyGln: 3.556 ± 0.51
3.556GlyArg: 3.556 ± 0.628
3.879GlySer: 3.879 ± 0.817
3.313GlyThr: 3.313 ± 0.598
4.122GlyVal: 4.122 ± 0.697
1.051GlyTrp: 1.051 ± 0.437
2.586GlyTyr: 2.586 ± 0.402
0.0GlyXaa: 0.0 ± 0.0
His
0.647HisAla: 0.647 ± 0.273
0.0HisCys: 0.0 ± 0.0
0.647HisAsp: 0.647 ± 0.301
1.293HisGlu: 1.293 ± 0.328
0.566HisPhe: 0.566 ± 0.229
0.97HisGly: 0.97 ± 0.293
0.323HisHis: 0.323 ± 0.195
0.727HisIle: 0.727 ± 0.31
0.727HisLys: 0.727 ± 0.299
1.374HisLeu: 1.374 ± 0.377
0.323HisMet: 0.323 ± 0.219
0.889HisAsn: 0.889 ± 0.245
0.647HisPro: 0.647 ± 0.22
0.566HisGln: 0.566 ± 0.248
0.485HisArg: 0.485 ± 0.193
1.697HisSer: 1.697 ± 0.506
0.647HisThr: 0.647 ± 0.242
0.727HisVal: 0.727 ± 0.259
0.081HisTrp: 0.081 ± 0.074
0.647HisTyr: 0.647 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
5.415IleAla: 5.415 ± 0.725
0.566IleCys: 0.566 ± 0.196
3.879IleAsp: 3.879 ± 0.648
6.465IleGlu: 6.465 ± 0.74
2.667IlePhe: 2.667 ± 0.582
4.526IleGly: 4.526 ± 0.866
0.485IleHis: 0.485 ± 0.226
3.152IleIle: 3.152 ± 0.517
6.384IleLys: 6.384 ± 0.601
4.445IleLeu: 4.445 ± 0.756
1.293IleMet: 1.293 ± 0.32
2.909IleAsn: 2.909 ± 0.461
1.535IlePro: 1.535 ± 0.355
2.667IleGln: 2.667 ± 0.371
3.071IleArg: 3.071 ± 0.563
4.768IleSer: 4.768 ± 0.799
4.202IleThr: 4.202 ± 0.569
3.475IleVal: 3.475 ± 0.673
0.485IleTrp: 0.485 ± 0.176
2.505IleTyr: 2.505 ± 0.769
0.0IleXaa: 0.0 ± 0.0
Lys
5.334LysAla: 5.334 ± 0.671
0.404LysCys: 0.404 ± 0.199
5.819LysAsp: 5.819 ± 0.66
7.677LysGlu: 7.677 ± 0.99
3.233LysPhe: 3.233 ± 0.647
4.526LysGly: 4.526 ± 0.684
1.535LysHis: 1.535 ± 0.312
5.495LysIle: 5.495 ± 0.755
8.324LysLys: 8.324 ± 1.218
7.758LysLeu: 7.758 ± 0.701
3.071LysMet: 3.071 ± 0.469
4.445LysAsn: 4.445 ± 0.658
2.667LysPro: 2.667 ± 0.556
3.556LysGln: 3.556 ± 0.648
3.556LysArg: 3.556 ± 0.391
4.526LysSer: 4.526 ± 0.464
5.819LysThr: 5.819 ± 0.596
5.738LysVal: 5.738 ± 0.74
1.051LysTrp: 1.051 ± 0.349
2.909LysTyr: 2.909 ± 0.412
0.0LysXaa: 0.0 ± 0.0
Leu
7.193LeuAla: 7.193 ± 0.824
0.485LeuCys: 0.485 ± 0.255
5.738LeuAsp: 5.738 ± 0.605
7.193LeuGlu: 7.193 ± 0.982
2.667LeuPhe: 2.667 ± 0.389
5.576LeuGly: 5.576 ± 1.242
0.97LeuHis: 0.97 ± 0.282
3.879LeuIle: 3.879 ± 0.506
7.354LeuLys: 7.354 ± 0.798
6.95LeuLeu: 6.95 ± 1.062
2.424LeuMet: 2.424 ± 0.447
2.909LeuAsn: 2.909 ± 0.604
3.071LeuPro: 3.071 ± 0.635
3.394LeuGln: 3.394 ± 0.655
3.879LeuArg: 3.879 ± 0.558
5.576LeuSer: 5.576 ± 0.975
5.899LeuThr: 5.899 ± 0.838
4.526LeuVal: 4.526 ± 0.665
0.808LeuTrp: 0.808 ± 0.201
2.586LeuTyr: 2.586 ± 0.313
0.0LeuXaa: 0.0 ± 0.0
Met
1.94MetAla: 1.94 ± 0.5
0.0MetCys: 0.0 ± 0.0
1.455MetAsp: 1.455 ± 0.321
2.344MetGlu: 2.344 ± 0.563
1.131MetPhe: 1.131 ± 0.271
1.051MetGly: 1.051 ± 0.41
0.242MetHis: 0.242 ± 0.157
1.94MetIle: 1.94 ± 0.511
2.02MetLys: 2.02 ± 0.413
1.859MetLeu: 1.859 ± 0.393
0.566MetMet: 0.566 ± 0.226
1.616MetAsn: 1.616 ± 0.458
1.212MetPro: 1.212 ± 0.364
0.647MetGln: 0.647 ± 0.243
1.374MetArg: 1.374 ± 0.364
1.616MetSer: 1.616 ± 0.366
1.778MetThr: 1.778 ± 0.402
1.94MetVal: 1.94 ± 0.363
0.323MetTrp: 0.323 ± 0.169
0.889MetTyr: 0.889 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
4.687AsnAla: 4.687 ± 0.815
0.242AsnCys: 0.242 ± 0.127
2.505AsnAsp: 2.505 ± 0.563
2.667AsnGlu: 2.667 ± 0.612
2.101AsnPhe: 2.101 ± 0.482
4.041AsnGly: 4.041 ± 0.715
0.97AsnHis: 0.97 ± 0.302
3.071AsnIle: 3.071 ± 0.471
4.526AsnLys: 4.526 ± 0.556
4.687AsnLeu: 4.687 ± 0.528
1.212AsnMet: 1.212 ± 0.382
2.182AsnAsn: 2.182 ± 0.517
1.778AsnPro: 1.778 ± 0.32
2.909AsnGln: 2.909 ± 0.601
2.748AsnArg: 2.748 ± 0.545
3.233AsnSer: 3.233 ± 0.669
2.344AsnThr: 2.344 ± 0.458
3.071AsnVal: 3.071 ± 0.415
0.889AsnTrp: 0.889 ± 0.251
1.778AsnTyr: 1.778 ± 0.362
0.0AsnXaa: 0.0 ± 0.0
Pro
2.02ProAla: 2.02 ± 0.464
0.081ProCys: 0.081 ± 0.082
1.616ProAsp: 1.616 ± 0.304
3.475ProGlu: 3.475 ± 0.453
0.727ProPhe: 0.727 ± 0.294
1.212ProGly: 1.212 ± 0.238
0.323ProHis: 0.323 ± 0.139
2.101ProIle: 2.101 ± 0.574
2.99ProLys: 2.99 ± 0.463
1.455ProLeu: 1.455 ± 0.41
0.647ProMet: 0.647 ± 0.225
1.616ProAsn: 1.616 ± 0.529
0.485ProPro: 0.485 ± 0.234
0.97ProGln: 0.97 ± 0.31
1.293ProArg: 1.293 ± 0.293
1.616ProSer: 1.616 ± 0.416
0.889ProThr: 0.889 ± 0.287
2.101ProVal: 2.101 ± 0.419
0.404ProTrp: 0.404 ± 0.205
1.697ProTyr: 1.697 ± 0.477
0.0ProXaa: 0.0 ± 0.0
Gln
3.798GlnAla: 3.798 ± 0.542
0.081GlnCys: 0.081 ± 0.074
1.697GlnAsp: 1.697 ± 0.351
3.313GlnGlu: 3.313 ± 0.7
1.616GlnPhe: 1.616 ± 0.365
1.778GlnGly: 1.778 ± 0.419
0.566GlnHis: 0.566 ± 0.248
2.99GlnIle: 2.99 ± 0.573
3.879GlnLys: 3.879 ± 0.618
3.071GlnLeu: 3.071 ± 0.516
0.97GlnMet: 0.97 ± 0.229
1.697GlnAsn: 1.697 ± 0.36
1.131GlnPro: 1.131 ± 0.351
1.455GlnGln: 1.455 ± 0.397
2.02GlnArg: 2.02 ± 0.41
2.505GlnSer: 2.505 ± 0.371
2.909GlnThr: 2.909 ± 0.552
3.637GlnVal: 3.637 ± 0.51
0.485GlnTrp: 0.485 ± 0.168
1.051GlnTyr: 1.051 ± 0.35
0.0GlnXaa: 0.0 ± 0.0
Arg
3.071ArgAla: 3.071 ± 0.47
0.404ArgCys: 0.404 ± 0.182
2.748ArgAsp: 2.748 ± 0.543
2.909ArgGlu: 2.909 ± 0.479
1.535ArgPhe: 1.535 ± 0.382
2.02ArgGly: 2.02 ± 0.331
0.808ArgHis: 0.808 ± 0.336
2.667ArgIle: 2.667 ± 0.508
2.829ArgLys: 2.829 ± 0.717
4.202ArgLeu: 4.202 ± 0.7
2.182ArgMet: 2.182 ± 0.44
2.909ArgAsn: 2.909 ± 0.537
1.051ArgPro: 1.051 ± 0.243
2.586ArgGln: 2.586 ± 0.409
2.99ArgArg: 2.99 ± 0.695
2.586ArgSer: 2.586 ± 0.461
2.667ArgThr: 2.667 ± 0.555
2.667ArgVal: 2.667 ± 0.501
0.485ArgTrp: 0.485 ± 0.166
2.02ArgTyr: 2.02 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
4.606SerAla: 4.606 ± 1.044
0.162SerCys: 0.162 ± 0.131
3.717SerAsp: 3.717 ± 0.507
4.445SerGlu: 4.445 ± 0.601
1.778SerPhe: 1.778 ± 0.42
5.172SerGly: 5.172 ± 0.786
1.374SerHis: 1.374 ± 0.508
4.041SerIle: 4.041 ± 0.732
4.93SerLys: 4.93 ± 0.789
5.334SerLeu: 5.334 ± 0.593
1.212SerMet: 1.212 ± 0.357
3.071SerAsn: 3.071 ± 0.639
1.455SerPro: 1.455 ± 0.304
2.586SerGln: 2.586 ± 0.433
2.586SerArg: 2.586 ± 0.542
3.637SerSer: 3.637 ± 0.841
4.526SerThr: 4.526 ± 0.479
3.233SerVal: 3.233 ± 0.802
0.97SerTrp: 0.97 ± 0.306
2.667SerTyr: 2.667 ± 0.498
0.0SerXaa: 0.0 ± 0.0
Thr
4.687ThrAla: 4.687 ± 0.894
0.162ThrCys: 0.162 ± 0.131
3.717ThrAsp: 3.717 ± 0.514
4.526ThrGlu: 4.526 ± 0.592
3.313ThrPhe: 3.313 ± 0.56
4.526ThrGly: 4.526 ± 0.669
0.889ThrHis: 0.889 ± 0.306
5.253ThrIle: 5.253 ± 0.584
5.011ThrLys: 5.011 ± 0.697
4.283ThrLeu: 4.283 ± 0.705
0.808ThrMet: 0.808 ± 0.279
3.556ThrAsn: 3.556 ± 0.451
1.778ThrPro: 1.778 ± 0.539
2.748ThrGln: 2.748 ± 0.566
1.697ThrArg: 1.697 ± 0.289
3.96ThrSer: 3.96 ± 0.673
4.93ThrThr: 4.93 ± 0.831
4.041ThrVal: 4.041 ± 0.605
0.566ThrTrp: 0.566 ± 0.247
2.748ThrTyr: 2.748 ± 0.592
0.0ThrXaa: 0.0 ± 0.0
Val
5.495ValAla: 5.495 ± 0.657
0.242ValCys: 0.242 ± 0.132
3.96ValAsp: 3.96 ± 0.614
5.091ValGlu: 5.091 ± 0.594
2.02ValPhe: 2.02 ± 0.445
4.849ValGly: 4.849 ± 0.751
0.889ValHis: 0.889 ± 0.32
3.798ValIle: 3.798 ± 0.617
5.172ValLys: 5.172 ± 0.721
4.041ValLeu: 4.041 ± 0.61
0.97ValMet: 0.97 ± 0.31
3.717ValAsn: 3.717 ± 0.746
1.94ValPro: 1.94 ± 0.341
1.455ValGln: 1.455 ± 0.41
2.505ValArg: 2.505 ± 0.357
4.606ValSer: 4.606 ± 0.568
4.364ValThr: 4.364 ± 0.677
3.96ValVal: 3.96 ± 0.706
0.485ValTrp: 0.485 ± 0.18
2.344ValTyr: 2.344 ± 0.513
0.0ValXaa: 0.0 ± 0.0
Trp
1.293TrpAla: 1.293 ± 0.319
0.162TrpCys: 0.162 ± 0.1
0.97TrpAsp: 0.97 ± 0.328
1.051TrpGlu: 1.051 ± 0.362
1.051TrpPhe: 1.051 ± 0.443
0.727TrpGly: 0.727 ± 0.23
0.0TrpHis: 0.0 ± 0.0
0.485TrpIle: 0.485 ± 0.199
1.212TrpLys: 1.212 ± 0.341
0.647TrpLeu: 0.647 ± 0.279
0.323TrpMet: 0.323 ± 0.169
0.889TrpAsn: 0.889 ± 0.297
0.081TrpPro: 0.081 ± 0.082
1.051TrpGln: 1.051 ± 0.345
0.162TrpArg: 0.162 ± 0.117
0.323TrpSer: 0.323 ± 0.154
0.889TrpThr: 0.889 ± 0.27
1.293TrpVal: 1.293 ± 0.287
0.081TrpTrp: 0.081 ± 0.066
0.727TrpTyr: 0.727 ± 0.539
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.616TyrAla: 1.616 ± 0.318
0.404TyrCys: 0.404 ± 0.16
2.505TyrAsp: 2.505 ± 0.465
2.586TyrGlu: 2.586 ± 0.553
1.697TyrPhe: 1.697 ± 0.415
2.263TyrGly: 2.263 ± 0.448
0.97TyrHis: 0.97 ± 0.275
2.586TyrIle: 2.586 ± 0.587
3.556TyrLys: 3.556 ± 0.631
3.152TyrLeu: 3.152 ± 0.555
0.485TyrMet: 0.485 ± 0.271
1.455TyrAsn: 1.455 ± 0.367
2.101TyrPro: 2.101 ± 0.515
2.101TyrGln: 2.101 ± 0.414
2.02TyrArg: 2.02 ± 0.525
2.586TyrSer: 2.586 ± 0.547
2.263TyrThr: 2.263 ± 0.497
2.263TyrVal: 2.263 ± 0.566
0.323TyrTrp: 0.323 ± 0.158
1.455TyrTyr: 1.455 ± 0.49
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12375 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski