Amino acid dipepetide frequency for Streptococcus phage Javan316

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.639AlaAla: 4.639 ± 1.469
0.273AlaCys: 0.273 ± 0.198
4.457AlaAsp: 4.457 ± 0.616
6.368AlaGlu: 6.368 ± 0.721
3.093AlaPhe: 3.093 ± 0.671
5.185AlaGly: 5.185 ± 1.565
1.183AlaHis: 1.183 ± 0.32
5.367AlaIle: 5.367 ± 0.868
6.641AlaLys: 6.641 ± 0.959
6.095AlaLeu: 6.095 ± 1.503
2.82AlaMet: 2.82 ± 0.604
3.639AlaAsn: 3.639 ± 0.637
1.183AlaPro: 1.183 ± 0.265
2.547AlaGln: 2.547 ± 0.582
2.547AlaArg: 2.547 ± 0.567
4.73AlaSer: 4.73 ± 0.896
3.275AlaThr: 3.275 ± 0.87
4.912AlaVal: 4.912 ± 1.204
0.273AlaTrp: 0.273 ± 0.173
2.547AlaTyr: 2.547 ± 0.443
0.0AlaXaa: 0.0 ± 0.0
Cys
0.364CysAla: 0.364 ± 0.141
0.0CysCys: 0.0 ± 0.0
0.364CysAsp: 0.364 ± 0.204
0.91CysGlu: 0.91 ± 0.403
0.182CysPhe: 0.182 ± 0.13
0.819CysGly: 0.819 ± 0.361
0.273CysHis: 0.273 ± 0.157
0.182CysIle: 0.182 ± 0.157
0.455CysLys: 0.455 ± 0.2
0.728CysLeu: 0.728 ± 0.236
0.0CysMet: 0.0 ± 0.0
0.091CysAsn: 0.091 ± 0.086
0.091CysPro: 0.091 ± 0.103
0.091CysGln: 0.091 ± 0.098
0.091CysArg: 0.091 ± 0.086
0.182CysSer: 0.182 ± 0.108
0.091CysThr: 0.091 ± 0.082
0.273CysVal: 0.273 ± 0.15
0.0CysTrp: 0.0 ± 0.0
0.091CysTyr: 0.091 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
4.184AspAla: 4.184 ± 0.497
0.364AspCys: 0.364 ± 0.212
4.366AspAsp: 4.366 ± 0.792
4.912AspGlu: 4.912 ± 0.712
3.457AspPhe: 3.457 ± 0.69
5.822AspGly: 5.822 ± 0.892
0.819AspHis: 0.819 ± 0.239
5.458AspIle: 5.458 ± 0.674
4.457AspLys: 4.457 ± 0.694
6.095AspLeu: 6.095 ± 0.582
2.001AspMet: 2.001 ± 0.376
3.457AspAsn: 3.457 ± 0.417
1.365AspPro: 1.365 ± 0.379
0.819AspGln: 0.819 ± 0.246
2.183AspArg: 2.183 ± 0.425
3.275AspSer: 3.275 ± 0.556
3.912AspThr: 3.912 ± 0.375
4.366AspVal: 4.366 ± 0.565
0.364AspTrp: 0.364 ± 0.176
2.82AspTyr: 2.82 ± 0.548
0.0AspXaa: 0.0 ± 0.0
Glu
5.185GluAla: 5.185 ± 0.93
0.728GluCys: 0.728 ± 0.284
4.366GluAsp: 4.366 ± 0.747
7.004GluGlu: 7.004 ± 1.17
2.82GluPhe: 2.82 ± 0.536
4.003GluGly: 4.003 ± 0.409
1.274GluHis: 1.274 ± 0.426
5.549GluIle: 5.549 ± 1.015
5.549GluLys: 5.549 ± 0.843
8.278GluLeu: 8.278 ± 1.054
2.092GluMet: 2.092 ± 0.482
4.184GluAsn: 4.184 ± 0.713
2.092GluPro: 2.092 ± 0.572
5.185GluGln: 5.185 ± 0.875
4.548GluArg: 4.548 ± 0.845
3.912GluSer: 3.912 ± 0.691
4.184GluThr: 4.184 ± 0.537
5.185GluVal: 5.185 ± 0.826
0.728GluTrp: 0.728 ± 0.241
2.183GluTyr: 2.183 ± 0.45
0.0GluXaa: 0.0 ± 0.0
Phe
1.91PheAla: 1.91 ± 0.601
0.182PheCys: 0.182 ± 0.128
4.184PheAsp: 4.184 ± 0.555
4.094PheGlu: 4.094 ± 0.865
2.092PhePhe: 2.092 ± 0.61
2.729PheGly: 2.729 ± 0.535
0.364PheHis: 0.364 ± 0.183
2.82PheIle: 2.82 ± 0.586
2.911PheLys: 2.911 ± 0.543
2.729PheLeu: 2.729 ± 0.399
1.365PheMet: 1.365 ± 0.396
2.274PheAsn: 2.274 ± 0.496
0.546PhePro: 0.546 ± 0.179
2.001PheGln: 2.001 ± 0.391
2.274PheArg: 2.274 ± 0.521
2.638PheSer: 2.638 ± 0.417
1.819PheThr: 1.819 ± 0.45
3.457PheVal: 3.457 ± 0.516
0.273PheTrp: 0.273 ± 0.167
1.455PheTyr: 1.455 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
5.367GlyAla: 5.367 ± 1.884
0.364GlyCys: 0.364 ± 0.246
3.73GlyAsp: 3.73 ± 0.575
3.275GlyGlu: 3.275 ± 0.559
4.094GlyPhe: 4.094 ± 0.622
4.184GlyGly: 4.184 ± 0.69
1.455GlyHis: 1.455 ± 0.365
5.549GlyIle: 5.549 ± 1.035
5.094GlyLys: 5.094 ± 0.785
6.186GlyLeu: 6.186 ± 1.034
1.274GlyMet: 1.274 ± 0.384
3.639GlyAsn: 3.639 ± 0.655
1.001GlyPro: 1.001 ± 0.362
3.366GlyGln: 3.366 ± 0.651
3.548GlyArg: 3.548 ± 0.568
4.457GlySer: 4.457 ± 0.892
2.729GlyThr: 2.729 ± 0.569
4.275GlyVal: 4.275 ± 1.023
1.91GlyTrp: 1.91 ± 0.63
3.366GlyTyr: 3.366 ± 0.593
0.0GlyXaa: 0.0 ± 0.0
His
1.001HisAla: 1.001 ± 0.23
0.091HisCys: 0.091 ± 0.106
1.092HisAsp: 1.092 ± 0.304
1.183HisGlu: 1.183 ± 0.334
0.91HisPhe: 0.91 ± 0.295
1.001HisGly: 1.001 ± 0.282
0.182HisHis: 0.182 ± 0.13
1.092HisIle: 1.092 ± 0.473
1.092HisLys: 1.092 ± 0.276
1.092HisLeu: 1.092 ± 0.324
0.455HisMet: 0.455 ± 0.212
0.637HisAsn: 0.637 ± 0.318
0.91HisPro: 0.91 ± 0.253
0.637HisGln: 0.637 ± 0.231
0.819HisArg: 0.819 ± 0.215
1.274HisSer: 1.274 ± 0.302
0.819HisThr: 0.819 ± 0.249
1.001HisVal: 1.001 ± 0.315
0.455HisTrp: 0.455 ± 0.204
0.546HisTyr: 0.546 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
5.276IleAla: 5.276 ± 0.919
0.364IleCys: 0.364 ± 0.227
5.003IleAsp: 5.003 ± 0.531
5.094IleGlu: 5.094 ± 0.694
1.546IlePhe: 1.546 ± 0.387
3.821IleGly: 3.821 ± 0.758
0.819IleHis: 0.819 ± 0.268
3.548IleIle: 3.548 ± 0.679
6.277IleLys: 6.277 ± 0.945
5.549IleLeu: 5.549 ± 0.721
1.637IleMet: 1.637 ± 0.331
4.275IleAsn: 4.275 ± 0.661
2.274IlePro: 2.274 ± 0.444
3.639IleGln: 3.639 ± 0.699
2.547IleArg: 2.547 ± 0.481
5.094IleSer: 5.094 ± 0.94
4.003IleThr: 4.003 ± 0.731
4.184IleVal: 4.184 ± 0.502
0.546IleTrp: 0.546 ± 0.275
2.638IleTyr: 2.638 ± 0.382
0.0IleXaa: 0.0 ± 0.0
Lys
7.55LysAla: 7.55 ± 0.929
0.182LysCys: 0.182 ± 0.126
5.003LysAsp: 5.003 ± 0.613
8.187LysGlu: 8.187 ± 1.468
3.002LysPhe: 3.002 ± 0.519
5.367LysGly: 5.367 ± 0.69
1.365LysHis: 1.365 ± 0.3
5.367LysIle: 5.367 ± 0.759
7.459LysLys: 7.459 ± 1.221
6.459LysLeu: 6.459 ± 0.799
1.455LysMet: 1.455 ± 0.37
4.366LysAsn: 4.366 ± 0.655
2.729LysPro: 2.729 ± 0.564
3.275LysGln: 3.275 ± 0.537
3.548LysArg: 3.548 ± 0.707
4.821LysSer: 4.821 ± 0.638
4.639LysThr: 4.639 ± 0.704
3.73LysVal: 3.73 ± 0.633
1.001LysTrp: 1.001 ± 0.231
2.274LysTyr: 2.274 ± 0.551
0.0LysXaa: 0.0 ± 0.0
Leu
6.55LeuAla: 6.55 ± 0.89
0.182LeuCys: 0.182 ± 0.124
6.368LeuAsp: 6.368 ± 0.683
7.459LeuGlu: 7.459 ± 1.149
2.638LeuPhe: 2.638 ± 0.412
7.095LeuGly: 7.095 ± 1.22
0.91LeuHis: 0.91 ± 0.21
5.185LeuIle: 5.185 ± 0.617
7.459LeuLys: 7.459 ± 0.681
6.095LeuLeu: 6.095 ± 0.8
1.637LeuMet: 1.637 ± 0.505
4.366LeuAsn: 4.366 ± 0.578
2.456LeuPro: 2.456 ± 0.577
3.366LeuGln: 3.366 ± 0.526
4.003LeuArg: 4.003 ± 0.549
6.55LeuSer: 6.55 ± 0.682
4.73LeuThr: 4.73 ± 0.815
4.548LeuVal: 4.548 ± 0.765
0.455LeuTrp: 0.455 ± 0.234
2.547LeuTyr: 2.547 ± 0.541
0.0LeuXaa: 0.0 ± 0.0
Met
2.456MetAla: 2.456 ± 0.635
0.273MetCys: 0.273 ± 0.154
2.365MetAsp: 2.365 ± 0.415
1.274MetGlu: 1.274 ± 0.347
0.637MetPhe: 0.637 ± 0.184
1.092MetGly: 1.092 ± 0.286
0.637MetHis: 0.637 ± 0.25
1.455MetIle: 1.455 ± 0.387
1.637MetLys: 1.637 ± 0.391
1.91MetLeu: 1.91 ± 0.355
0.546MetMet: 0.546 ± 0.306
0.819MetAsn: 0.819 ± 0.261
0.455MetPro: 0.455 ± 0.194
1.274MetGln: 1.274 ± 0.296
1.274MetArg: 1.274 ± 0.312
1.819MetSer: 1.819 ± 0.456
2.547MetThr: 2.547 ± 0.586
1.728MetVal: 1.728 ± 0.624
0.455MetTrp: 0.455 ± 0.188
0.455MetTyr: 0.455 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
3.275AsnAla: 3.275 ± 0.548
0.182AsnCys: 0.182 ± 0.113
2.547AsnAsp: 2.547 ± 0.482
4.003AsnGlu: 4.003 ± 0.703
2.001AsnPhe: 2.001 ± 0.396
4.003AsnGly: 4.003 ± 0.745
1.274AsnHis: 1.274 ± 0.349
3.639AsnIle: 3.639 ± 0.75
3.821AsnLys: 3.821 ± 0.569
4.821AsnLeu: 4.821 ± 0.754
1.455AsnMet: 1.455 ± 0.317
2.547AsnAsn: 2.547 ± 0.433
1.819AsnPro: 1.819 ± 0.48
2.274AsnGln: 2.274 ± 0.61
1.637AsnArg: 1.637 ± 0.427
3.639AsnSer: 3.639 ± 0.559
2.638AsnThr: 2.638 ± 0.457
3.184AsnVal: 3.184 ± 0.46
0.819AsnTrp: 0.819 ± 0.264
1.183AsnTyr: 1.183 ± 0.297
0.0AsnXaa: 0.0 ± 0.0
Pro
1.637ProAla: 1.637 ± 0.41
0.182ProCys: 0.182 ± 0.145
1.637ProAsp: 1.637 ± 0.364
2.82ProGlu: 2.82 ± 0.659
1.365ProPhe: 1.365 ± 0.388
1.637ProGly: 1.637 ± 0.355
0.273ProHis: 0.273 ± 0.175
1.819ProIle: 1.819 ± 0.366
2.092ProLys: 2.092 ± 0.415
1.819ProLeu: 1.819 ± 0.472
0.546ProMet: 0.546 ± 0.257
1.274ProAsn: 1.274 ± 0.266
0.546ProPro: 0.546 ± 0.229
1.274ProGln: 1.274 ± 0.35
0.91ProArg: 0.91 ± 0.291
1.183ProSer: 1.183 ± 0.345
1.637ProThr: 1.637 ± 0.412
2.001ProVal: 2.001 ± 0.359
0.455ProTrp: 0.455 ± 0.166
1.546ProTyr: 1.546 ± 0.389
0.0ProXaa: 0.0 ± 0.0
Gln
3.639GlnAla: 3.639 ± 0.579
0.091GlnCys: 0.091 ± 0.085
2.001GlnAsp: 2.001 ± 0.469
3.184GlnGlu: 3.184 ± 0.511
2.183GlnPhe: 2.183 ± 0.437
2.274GlnGly: 2.274 ± 0.528
0.637GlnHis: 0.637 ± 0.247
3.184GlnIle: 3.184 ± 0.557
2.911GlnLys: 2.911 ± 0.595
3.912GlnLeu: 3.912 ± 0.515
1.365GlnMet: 1.365 ± 0.349
2.001GlnAsn: 2.001 ± 0.484
0.637GlnPro: 0.637 ± 0.242
1.365GlnGln: 1.365 ± 0.373
1.637GlnArg: 1.637 ± 0.555
3.73GlnSer: 3.73 ± 0.6
2.092GlnThr: 2.092 ± 0.402
2.456GlnVal: 2.456 ± 0.393
0.546GlnTrp: 0.546 ± 0.248
1.365GlnTyr: 1.365 ± 0.297
0.0GlnXaa: 0.0 ± 0.0
Arg
1.819ArgAla: 1.819 ± 0.541
0.273ArgCys: 0.273 ± 0.167
2.092ArgAsp: 2.092 ± 0.399
3.548ArgGlu: 3.548 ± 0.663
1.455ArgPhe: 1.455 ± 0.441
3.366ArgGly: 3.366 ± 0.542
0.91ArgHis: 0.91 ± 0.298
2.365ArgIle: 2.365 ± 0.575
3.821ArgLys: 3.821 ± 0.8
3.912ArgLeu: 3.912 ± 0.472
1.092ArgMet: 1.092 ± 0.349
2.001ArgAsn: 2.001 ± 0.51
1.183ArgPro: 1.183 ± 0.37
1.183ArgGln: 1.183 ± 0.338
1.91ArgArg: 1.91 ± 0.386
2.547ArgSer: 2.547 ± 0.444
2.456ArgThr: 2.456 ± 0.652
3.002ArgVal: 3.002 ± 0.536
0.637ArgTrp: 0.637 ± 0.249
1.91ArgTyr: 1.91 ± 0.668
0.0ArgXaa: 0.0 ± 0.0
Ser
5.185SerAla: 5.185 ± 1.592
0.273SerCys: 0.273 ± 0.148
4.275SerAsp: 4.275 ± 0.489
4.639SerGlu: 4.639 ± 0.633
2.729SerPhe: 2.729 ± 0.55
5.003SerGly: 5.003 ± 0.767
1.001SerHis: 1.001 ± 0.336
4.73SerIle: 4.73 ± 0.506
5.276SerLys: 5.276 ± 0.656
5.458SerLeu: 5.458 ± 0.944
1.91SerMet: 1.91 ± 0.594
3.093SerAsn: 3.093 ± 0.586
2.092SerPro: 2.092 ± 0.401
3.184SerGln: 3.184 ± 0.38
1.819SerArg: 1.819 ± 0.414
3.639SerSer: 3.639 ± 0.628
4.275SerThr: 4.275 ± 0.592
4.094SerVal: 4.094 ± 0.788
0.546SerTrp: 0.546 ± 0.2
2.365SerTyr: 2.365 ± 0.55
0.0SerXaa: 0.0 ± 0.0
Thr
4.003ThrAla: 4.003 ± 0.807
0.0ThrCys: 0.0 ± 0.0
4.094ThrAsp: 4.094 ± 0.617
4.003ThrGlu: 4.003 ± 0.8
2.092ThrPhe: 2.092 ± 0.484
4.094ThrGly: 4.094 ± 0.777
0.91ThrHis: 0.91 ± 0.3
4.457ThrIle: 4.457 ± 0.59
4.457ThrLys: 4.457 ± 0.793
4.094ThrLeu: 4.094 ± 0.737
1.455ThrMet: 1.455 ± 0.4
2.911ThrAsn: 2.911 ± 0.548
2.183ThrPro: 2.183 ± 0.516
2.456ThrGln: 2.456 ± 0.404
2.183ThrArg: 2.183 ± 0.565
3.639ThrSer: 3.639 ± 0.723
4.094ThrThr: 4.094 ± 0.64
2.82ThrVal: 2.82 ± 0.656
0.637ThrTrp: 0.637 ± 0.243
2.001ThrTyr: 2.001 ± 0.392
0.0ThrXaa: 0.0 ± 0.0
Val
4.73ValAla: 4.73 ± 0.925
0.364ValCys: 0.364 ± 0.174
3.548ValAsp: 3.548 ± 0.54
4.366ValGlu: 4.366 ± 0.585
2.911ValPhe: 2.911 ± 0.465
4.821ValGly: 4.821 ± 1.169
0.637ValHis: 0.637 ± 0.295
3.821ValIle: 3.821 ± 0.574
5.822ValLys: 5.822 ± 0.802
4.003ValLeu: 4.003 ± 0.48
1.274ValMet: 1.274 ± 0.269
3.093ValAsn: 3.093 ± 0.644
1.819ValPro: 1.819 ± 0.345
1.546ValGln: 1.546 ± 0.351
1.91ValArg: 1.91 ± 0.449
5.276ValSer: 5.276 ± 0.793
4.275ValThr: 4.275 ± 0.583
4.184ValVal: 4.184 ± 0.707
1.001ValTrp: 1.001 ± 0.298
2.638ValTyr: 2.638 ± 0.478
0.0ValXaa: 0.0 ± 0.0
Trp
0.728TrpAla: 0.728 ± 0.248
0.182TrpCys: 0.182 ± 0.117
0.637TrpAsp: 0.637 ± 0.255
0.728TrpGlu: 0.728 ± 0.255
0.819TrpPhe: 0.819 ± 0.29
0.364TrpGly: 0.364 ± 0.159
0.091TrpHis: 0.091 ± 0.096
0.728TrpIle: 0.728 ± 0.261
1.183TrpLys: 1.183 ± 0.325
1.183TrpLeu: 1.183 ± 0.344
0.091TrpMet: 0.091 ± 0.082
0.455TrpAsn: 0.455 ± 0.217
0.273TrpPro: 0.273 ± 0.17
0.637TrpGln: 0.637 ± 0.219
0.637TrpArg: 0.637 ± 0.244
0.728TrpSer: 0.728 ± 0.291
0.91TrpThr: 0.91 ± 0.392
0.546TrpVal: 0.546 ± 0.243
0.455TrpTrp: 0.455 ± 0.261
0.637TrpTyr: 0.637 ± 0.356
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.274TyrAla: 2.274 ± 0.445
0.728TyrCys: 0.728 ± 0.36
2.365TyrAsp: 2.365 ± 0.463
2.001TyrGlu: 2.001 ± 0.437
2.092TyrPhe: 2.092 ± 0.46
2.183TyrGly: 2.183 ± 0.407
1.274TyrHis: 1.274 ± 0.374
1.819TyrIle: 1.819 ± 0.348
3.366TyrLys: 3.366 ± 0.582
4.184TyrLeu: 4.184 ± 0.869
0.546TyrMet: 0.546 ± 0.21
1.728TyrAsn: 1.728 ± 0.394
1.001TyrPro: 1.001 ± 0.326
1.092TyrGln: 1.092 ± 0.285
1.455TyrArg: 1.455 ± 0.351
2.456TyrSer: 2.456 ± 0.414
1.455TyrThr: 1.455 ± 0.299
2.092TyrVal: 2.092 ± 0.443
0.455TyrTrp: 0.455 ± 0.203
1.637TyrTyr: 1.637 ± 0.429
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (10994 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski