Amino acid dipepetide frequency for Streptococcus phage P7152

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.642AlaAla: 2.642 ± 1.03
0.182AlaCys: 0.182 ± 0.136
4.555AlaAsp: 4.555 ± 0.591
3.098AlaGlu: 3.098 ± 0.482
1.822AlaPhe: 1.822 ± 0.543
3.644AlaGly: 3.644 ± 0.856
0.729AlaHis: 0.729 ± 0.255
5.011AlaIle: 5.011 ± 0.907
5.831AlaLys: 5.831 ± 0.968
6.195AlaLeu: 6.195 ± 0.885
1.64AlaMet: 1.64 ± 0.396
4.92AlaAsn: 4.92 ± 0.843
1.549AlaPro: 1.549 ± 0.307
2.733AlaGln: 2.733 ± 0.622
2.642AlaArg: 2.642 ± 0.407
4.738AlaSer: 4.738 ± 0.687
4.647AlaThr: 4.647 ± 0.835
3.644AlaVal: 3.644 ± 0.683
1.276AlaTrp: 1.276 ± 0.301
2.369AlaTyr: 2.369 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.273CysAla: 0.273 ± 0.138
0.0CysCys: 0.0 ± 0.0
0.82CysAsp: 0.82 ± 0.293
0.182CysGlu: 0.182 ± 0.128
0.547CysPhe: 0.547 ± 0.269
0.182CysGly: 0.182 ± 0.133
0.182CysHis: 0.182 ± 0.135
0.273CysIle: 0.273 ± 0.167
0.273CysLys: 0.273 ± 0.153
0.547CysLeu: 0.547 ± 0.3
0.091CysMet: 0.091 ± 0.09
0.364CysAsn: 0.364 ± 0.194
0.273CysPro: 0.273 ± 0.154
0.273CysGln: 0.273 ± 0.171
0.364CysArg: 0.364 ± 0.21
0.456CysSer: 0.456 ± 0.237
0.273CysThr: 0.273 ± 0.175
0.091CysVal: 0.091 ± 0.08
0.182CysTrp: 0.182 ± 0.135
0.182CysTyr: 0.182 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
3.735AspAla: 3.735 ± 0.701
0.364AspCys: 0.364 ± 0.189
4.1AspAsp: 4.1 ± 0.777
4.555AspGlu: 4.555 ± 0.642
3.098AspPhe: 3.098 ± 0.624
6.195AspGly: 6.195 ± 0.934
1.276AspHis: 1.276 ± 0.323
5.011AspIle: 5.011 ± 0.774
5.558AspLys: 5.558 ± 0.584
3.553AspLeu: 3.553 ± 0.896
2.369AspMet: 2.369 ± 0.424
4.009AspAsn: 4.009 ± 0.681
2.46AspPro: 2.46 ± 0.479
1.549AspGln: 1.549 ± 0.315
2.278AspArg: 2.278 ± 0.571
3.735AspSer: 3.735 ± 0.517
4.191AspThr: 4.191 ± 0.654
3.918AspVal: 3.918 ± 0.803
1.184AspTrp: 1.184 ± 0.267
3.189AspTyr: 3.189 ± 0.601
0.0AspXaa: 0.0 ± 0.0
Glu
4.282GluAla: 4.282 ± 0.609
0.273GluCys: 0.273 ± 0.148
3.189GluAsp: 3.189 ± 0.626
4.464GluGlu: 4.464 ± 0.758
2.915GluPhe: 2.915 ± 0.599
3.098GluGly: 3.098 ± 0.472
1.184GluHis: 1.184 ± 0.339
6.013GluIle: 6.013 ± 0.866
4.373GluLys: 4.373 ± 0.826
6.469GluLeu: 6.469 ± 0.766
2.278GluMet: 2.278 ± 0.52
4.373GluAsn: 4.373 ± 0.807
1.913GluPro: 1.913 ± 0.525
2.278GluGln: 2.278 ± 0.408
3.189GluArg: 3.189 ± 0.516
3.371GluSer: 3.371 ± 0.471
3.827GluThr: 3.827 ± 0.609
4.464GluVal: 4.464 ± 0.59
0.911GluTrp: 0.911 ± 0.312
3.28GluTyr: 3.28 ± 0.559
0.0GluXaa: 0.0 ± 0.0
Phe
3.189PheAla: 3.189 ± 0.532
0.182PheCys: 0.182 ± 0.132
3.553PheAsp: 3.553 ± 0.561
2.278PheGlu: 2.278 ± 0.437
2.095PhePhe: 2.095 ± 0.51
3.189PheGly: 3.189 ± 0.716
0.547PheHis: 0.547 ± 0.169
2.824PheIle: 2.824 ± 0.676
3.553PheLys: 3.553 ± 0.541
3.189PheLeu: 3.189 ± 0.604
0.364PheMet: 0.364 ± 0.189
3.735PheAsn: 3.735 ± 0.775
0.547PhePro: 0.547 ± 0.225
1.184PheGln: 1.184 ± 0.302
1.458PheArg: 1.458 ± 0.317
2.642PheSer: 2.642 ± 0.511
2.915PheThr: 2.915 ± 0.553
2.642PheVal: 2.642 ± 0.367
0.638PheTrp: 0.638 ± 0.239
1.822PheTyr: 1.822 ± 0.431
0.0PheXaa: 0.0 ± 0.0
Gly
3.007GlyAla: 3.007 ± 0.676
0.456GlyCys: 0.456 ± 0.184
4.464GlyAsp: 4.464 ± 0.524
4.191GlyGlu: 4.191 ± 0.67
3.462GlyPhe: 3.462 ± 0.456
4.464GlyGly: 4.464 ± 0.936
0.82GlyHis: 0.82 ± 0.25
5.193GlyIle: 5.193 ± 0.807
6.651GlyLys: 6.651 ± 0.794
5.466GlyLeu: 5.466 ± 0.849
1.184GlyMet: 1.184 ± 0.336
3.827GlyAsn: 3.827 ± 0.6
0.456GlyPro: 0.456 ± 0.196
3.007GlyGln: 3.007 ± 0.565
3.007GlyArg: 3.007 ± 0.565
4.464GlySer: 4.464 ± 0.803
3.918GlyThr: 3.918 ± 0.639
3.918GlyVal: 3.918 ± 0.744
1.367GlyTrp: 1.367 ± 0.299
2.824GlyTyr: 2.824 ± 0.518
0.0GlyXaa: 0.0 ± 0.0
His
0.364HisAla: 0.364 ± 0.213
0.182HisCys: 0.182 ± 0.143
0.911HisAsp: 0.911 ± 0.286
0.456HisGlu: 0.456 ± 0.239
0.729HisPhe: 0.729 ± 0.234
1.002HisGly: 1.002 ± 0.292
0.547HisHis: 0.547 ± 0.186
1.276HisIle: 1.276 ± 0.314
1.002HisLys: 1.002 ± 0.258
1.549HisLeu: 1.549 ± 0.295
0.456HisMet: 0.456 ± 0.204
0.638HisAsn: 0.638 ± 0.339
0.729HisPro: 0.729 ± 0.248
0.638HisGln: 0.638 ± 0.243
0.729HisArg: 0.729 ± 0.253
0.911HisSer: 0.911 ± 0.327
1.002HisThr: 1.002 ± 0.384
1.549HisVal: 1.549 ± 0.321
0.182HisTrp: 0.182 ± 0.149
0.911HisTyr: 0.911 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
5.284IleAla: 5.284 ± 0.961
0.456IleCys: 0.456 ± 0.219
5.922IleAsp: 5.922 ± 0.674
4.555IleGlu: 4.555 ± 0.755
1.913IlePhe: 1.913 ± 0.569
4.829IleGly: 4.829 ± 0.489
0.638IleHis: 0.638 ± 0.335
3.553IleIle: 3.553 ± 0.741
6.742IleLys: 6.742 ± 0.77
4.373IleLeu: 4.373 ± 0.79
2.095IleMet: 2.095 ± 0.545
4.009IleAsn: 4.009 ± 0.556
2.915IlePro: 2.915 ± 0.529
2.642IleGln: 2.642 ± 0.433
2.915IleArg: 2.915 ± 0.482
4.373IleSer: 4.373 ± 0.495
3.735IleThr: 3.735 ± 0.586
3.098IleVal: 3.098 ± 0.522
0.729IleTrp: 0.729 ± 0.209
2.369IleTyr: 2.369 ± 0.559
0.0IleXaa: 0.0 ± 0.0
Lys
5.74LysAla: 5.74 ± 0.577
0.456LysCys: 0.456 ± 0.23
4.464LysAsp: 4.464 ± 0.737
7.289LysGlu: 7.289 ± 0.875
4.009LysPhe: 4.009 ± 0.854
5.922LysGly: 5.922 ± 0.847
1.731LysHis: 1.731 ± 0.641
5.011LysIle: 5.011 ± 0.609
7.106LysLys: 7.106 ± 1.279
6.924LysLeu: 6.924 ± 0.887
1.64LysMet: 1.64 ± 0.562
4.282LysAsn: 4.282 ± 0.581
2.915LysPro: 2.915 ± 0.371
3.827LysGln: 3.827 ± 0.518
3.462LysArg: 3.462 ± 0.506
4.464LysSer: 4.464 ± 0.58
6.104LysThr: 6.104 ± 0.651
4.738LysVal: 4.738 ± 0.67
0.911LysTrp: 0.911 ± 0.285
3.007LysTyr: 3.007 ± 0.615
0.0LysXaa: 0.0 ± 0.0
Leu
6.469LeuAla: 6.469 ± 0.637
0.638LeuCys: 0.638 ± 0.297
6.104LeuAsp: 6.104 ± 0.699
6.469LeuGlu: 6.469 ± 1.058
3.098LeuPhe: 3.098 ± 0.377
5.102LeuGly: 5.102 ± 0.953
0.911LeuHis: 0.911 ± 0.324
4.1LeuIle: 4.1 ± 0.603
7.198LeuLys: 7.198 ± 0.7
5.193LeuLeu: 5.193 ± 0.806
2.642LeuMet: 2.642 ± 0.465
5.284LeuAsn: 5.284 ± 0.679
2.915LeuPro: 2.915 ± 0.476
3.007LeuGln: 3.007 ± 0.565
3.918LeuArg: 3.918 ± 0.776
4.555LeuSer: 4.555 ± 0.745
6.286LeuThr: 6.286 ± 0.844
4.009LeuVal: 4.009 ± 0.521
0.729LeuTrp: 0.729 ± 0.286
1.822LeuTyr: 1.822 ± 0.478
0.0LeuXaa: 0.0 ± 0.0
Met
2.278MetAla: 2.278 ± 0.46
0.091MetCys: 0.091 ± 0.081
0.911MetAsp: 0.911 ± 0.286
1.367MetGlu: 1.367 ± 0.404
1.367MetPhe: 1.367 ± 0.324
0.547MetGly: 0.547 ± 0.241
0.364MetHis: 0.364 ± 0.203
1.458MetIle: 1.458 ± 0.302
2.642MetLys: 2.642 ± 0.49
1.731MetLeu: 1.731 ± 0.292
0.364MetMet: 0.364 ± 0.233
1.184MetAsn: 1.184 ± 0.269
0.911MetPro: 0.911 ± 0.233
0.82MetGln: 0.82 ± 0.279
0.82MetArg: 0.82 ± 0.215
2.187MetSer: 2.187 ± 0.472
2.004MetThr: 2.004 ± 0.459
2.004MetVal: 2.004 ± 0.446
0.091MetTrp: 0.091 ± 0.08
0.82MetTyr: 0.82 ± 0.264
0.0MetXaa: 0.0 ± 0.0
Asn
4.373AsnAla: 4.373 ± 1.193
0.364AsnCys: 0.364 ± 0.158
3.644AsnAsp: 3.644 ± 0.541
4.1AsnGlu: 4.1 ± 0.669
2.551AsnPhe: 2.551 ± 0.509
6.378AsnGly: 6.378 ± 1.16
1.184AsnHis: 1.184 ± 0.351
4.373AsnIle: 4.373 ± 0.732
3.827AsnLys: 3.827 ± 0.473
5.102AsnLeu: 5.102 ± 0.641
1.276AsnMet: 1.276 ± 0.313
4.92AsnAsn: 4.92 ± 0.966
3.098AsnPro: 3.098 ± 0.549
2.915AsnGln: 2.915 ± 0.464
2.095AsnArg: 2.095 ± 0.553
3.189AsnSer: 3.189 ± 0.557
3.371AsnThr: 3.371 ± 0.524
3.644AsnVal: 3.644 ± 0.45
1.458AsnTrp: 1.458 ± 0.311
1.549AsnTyr: 1.549 ± 0.514
0.0AsnXaa: 0.0 ± 0.0
Pro
1.64ProAla: 1.64 ± 0.421
0.0ProCys: 0.0 ± 0.0
1.64ProAsp: 1.64 ± 0.517
2.733ProGlu: 2.733 ± 0.48
1.367ProPhe: 1.367 ± 0.32
0.911ProGly: 0.911 ± 0.263
0.456ProHis: 0.456 ± 0.189
1.458ProIle: 1.458 ± 0.338
3.371ProLys: 3.371 ± 0.597
2.551ProLeu: 2.551 ± 0.442
0.273ProMet: 0.273 ± 0.172
2.551ProAsn: 2.551 ± 0.46
0.638ProPro: 0.638 ± 0.348
1.093ProGln: 1.093 ± 0.322
0.638ProArg: 0.638 ± 0.242
3.007ProSer: 3.007 ± 0.498
2.095ProThr: 2.095 ± 0.404
1.731ProVal: 1.731 ± 0.417
0.547ProTrp: 0.547 ± 0.2
0.729ProTyr: 0.729 ± 0.307
0.0ProXaa: 0.0 ± 0.0
Gln
4.009GlnAla: 4.009 ± 0.615
0.273GlnCys: 0.273 ± 0.132
1.822GlnAsp: 1.822 ± 0.399
2.733GlnGlu: 2.733 ± 0.51
1.367GlnPhe: 1.367 ± 0.397
2.915GlnGly: 2.915 ± 0.619
0.456GlnHis: 0.456 ± 0.173
2.733GlnIle: 2.733 ± 0.543
3.462GlnLys: 3.462 ± 0.634
3.553GlnLeu: 3.553 ± 0.461
1.002GlnMet: 1.002 ± 0.311
2.642GlnAsn: 2.642 ± 0.489
0.364GlnPro: 0.364 ± 0.203
2.642GlnGln: 2.642 ± 0.676
1.64GlnArg: 1.64 ± 0.342
2.733GlnSer: 2.733 ± 0.468
2.187GlnThr: 2.187 ± 0.463
1.822GlnVal: 1.822 ± 0.534
0.456GlnTrp: 0.456 ± 0.194
1.913GlnTyr: 1.913 ± 0.427
0.0GlnXaa: 0.0 ± 0.0
Arg
2.278ArgAla: 2.278 ± 0.378
0.091ArgCys: 0.091 ± 0.092
2.369ArgAsp: 2.369 ± 0.519
2.46ArgGlu: 2.46 ± 0.526
2.187ArgPhe: 2.187 ± 0.529
2.733ArgGly: 2.733 ± 0.605
1.002ArgHis: 1.002 ± 0.291
3.371ArgIle: 3.371 ± 0.608
2.733ArgLys: 2.733 ± 0.484
3.553ArgLeu: 3.553 ± 0.651
1.184ArgMet: 1.184 ± 0.333
2.915ArgAsn: 2.915 ± 0.41
1.093ArgPro: 1.093 ± 0.301
2.095ArgGln: 2.095 ± 0.481
1.458ArgArg: 1.458 ± 0.363
1.822ArgSer: 1.822 ± 0.342
2.46ArgThr: 2.46 ± 0.542
2.915ArgVal: 2.915 ± 0.662
0.82ArgTrp: 0.82 ± 0.213
1.913ArgTyr: 1.913 ± 0.417
0.0ArgXaa: 0.0 ± 0.0
Ser
3.098SerAla: 3.098 ± 0.471
0.364SerCys: 0.364 ± 0.183
4.829SerAsp: 4.829 ± 0.596
3.189SerGlu: 3.189 ± 0.55
2.369SerPhe: 2.369 ± 0.413
4.282SerGly: 4.282 ± 0.599
0.638SerHis: 0.638 ± 0.231
4.464SerIle: 4.464 ± 0.521
5.193SerLys: 5.193 ± 0.702
5.284SerLeu: 5.284 ± 0.635
1.822SerMet: 1.822 ± 0.372
4.1SerAsn: 4.1 ± 0.613
1.64SerPro: 1.64 ± 0.366
2.642SerGln: 2.642 ± 0.424
2.733SerArg: 2.733 ± 0.696
3.371SerSer: 3.371 ± 0.606
4.1SerThr: 4.1 ± 0.652
5.193SerVal: 5.193 ± 0.779
0.729SerTrp: 0.729 ± 0.266
2.187SerTyr: 2.187 ± 0.601
0.0SerXaa: 0.0 ± 0.0
Thr
4.464ThrAla: 4.464 ± 0.901
0.364ThrCys: 0.364 ± 0.185
4.282ThrAsp: 4.282 ± 0.691
3.735ThrGlu: 3.735 ± 0.486
2.733ThrPhe: 2.733 ± 0.657
3.918ThrGly: 3.918 ± 0.564
1.458ThrHis: 1.458 ± 0.283
4.282ThrIle: 4.282 ± 0.78
5.74ThrLys: 5.74 ± 0.679
6.469ThrLeu: 6.469 ± 0.881
0.911ThrMet: 0.911 ± 0.274
3.553ThrAsn: 3.553 ± 0.468
1.731ThrPro: 1.731 ± 0.549
2.551ThrGln: 2.551 ± 0.481
1.913ThrArg: 1.913 ± 0.372
3.553ThrSer: 3.553 ± 0.61
3.827ThrThr: 3.827 ± 0.691
4.647ThrVal: 4.647 ± 0.583
1.184ThrTrp: 1.184 ± 0.374
3.462ThrTyr: 3.462 ± 0.572
0.0ThrXaa: 0.0 ± 0.0
Val
3.918ValAla: 3.918 ± 0.884
0.456ValCys: 0.456 ± 0.203
4.829ValAsp: 4.829 ± 0.619
4.373ValGlu: 4.373 ± 0.771
2.46ValPhe: 2.46 ± 0.475
4.282ValGly: 4.282 ± 0.52
0.547ValHis: 0.547 ± 0.19
4.282ValIle: 4.282 ± 0.587
5.375ValLys: 5.375 ± 0.684
4.009ValLeu: 4.009 ± 1.097
1.093ValMet: 1.093 ± 0.378
3.827ValAsn: 3.827 ± 0.746
1.731ValPro: 1.731 ± 0.361
1.64ValGln: 1.64 ± 0.348
2.278ValArg: 2.278 ± 0.487
4.647ValSer: 4.647 ± 0.659
5.193ValThr: 5.193 ± 0.745
4.1ValVal: 4.1 ± 0.736
1.093ValTrp: 1.093 ± 0.235
1.731ValTyr: 1.731 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
0.456TrpAla: 0.456 ± 0.177
0.091TrpCys: 0.091 ± 0.091
1.458TrpAsp: 1.458 ± 0.484
0.911TrpGlu: 0.911 ± 0.264
0.729TrpPhe: 0.729 ± 0.268
0.456TrpGly: 0.456 ± 0.148
0.364TrpHis: 0.364 ± 0.202
0.456TrpIle: 0.456 ± 0.215
0.729TrpLys: 0.729 ± 0.285
1.64TrpLeu: 1.64 ± 0.388
0.273TrpMet: 0.273 ± 0.146
0.638TrpAsn: 0.638 ± 0.223
0.273TrpPro: 0.273 ± 0.148
0.82TrpGln: 0.82 ± 0.255
0.911TrpArg: 0.911 ± 0.235
1.731TrpSer: 1.731 ± 0.613
1.002TrpThr: 1.002 ± 0.244
1.276TrpVal: 1.276 ± 0.235
0.182TrpTrp: 0.182 ± 0.118
0.456TrpTyr: 0.456 ± 0.272
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.369TyrAla: 2.369 ± 0.332
0.547TyrCys: 0.547 ± 0.296
2.46TyrAsp: 2.46 ± 0.523
3.007TyrGlu: 3.007 ± 0.632
1.731TyrPhe: 1.731 ± 0.314
2.095TyrGly: 2.095 ± 0.549
0.729TyrHis: 0.729 ± 0.235
2.004TyrIle: 2.004 ± 0.397
2.733TyrLys: 2.733 ± 0.487
3.098TyrLeu: 3.098 ± 0.497
0.911TyrMet: 0.911 ± 0.293
1.64TyrAsn: 1.64 ± 0.326
1.276TyrPro: 1.276 ± 0.371
2.369TyrGln: 2.369 ± 0.316
3.007TyrArg: 3.007 ± 0.58
2.278TyrSer: 2.278 ± 0.526
1.64TyrThr: 1.64 ± 0.397
2.46TyrVal: 2.46 ± 0.337
0.182TyrTrp: 0.182 ± 0.127
2.551TyrTyr: 2.551 ± 0.619
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski