Amino acid dipepetide frequency for Streptococcus phage IPP26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.413AlaAla: 3.413 ± 0.745
0.569AlaCys: 0.569 ± 0.241
5.119AlaAsp: 5.119 ± 0.627
5.973AlaGlu: 5.973 ± 0.545
1.706AlaPhe: 1.706 ± 0.43
4.077AlaGly: 4.077 ± 0.665
0.853AlaHis: 0.853 ± 0.266
5.119AlaIle: 5.119 ± 0.619
5.593AlaLys: 5.593 ± 0.665
5.025AlaLeu: 5.025 ± 0.765
1.517AlaMet: 1.517 ± 0.371
3.887AlaAsn: 3.887 ± 0.622
1.232AlaPro: 1.232 ± 0.337
2.275AlaGln: 2.275 ± 0.439
2.37AlaArg: 2.37 ± 0.513
3.697AlaSer: 3.697 ± 0.591
4.266AlaThr: 4.266 ± 0.563
5.404AlaVal: 5.404 ± 0.903
0.664AlaTrp: 0.664 ± 0.286
1.706AlaTyr: 1.706 ± 0.33
0.0AlaXaa: 0.0 ± 0.0
Cys
0.095CysAla: 0.095 ± 0.089
0.0CysCys: 0.0 ± 0.0
0.569CysAsp: 0.569 ± 0.219
0.379CysGlu: 0.379 ± 0.171
0.19CysPhe: 0.19 ± 0.175
0.095CysGly: 0.095 ± 0.106
0.095CysHis: 0.095 ± 0.095
0.0CysIle: 0.0 ± 0.0
0.758CysLys: 0.758 ± 0.283
0.664CysLeu: 0.664 ± 0.236
0.0CysMet: 0.0 ± 0.0
0.284CysAsn: 0.284 ± 0.163
0.095CysPro: 0.095 ± 0.083
0.19CysGln: 0.19 ± 0.132
0.474CysArg: 0.474 ± 0.232
0.379CysSer: 0.379 ± 0.181
0.19CysThr: 0.19 ± 0.139
0.19CysVal: 0.19 ± 0.137
0.095CysTrp: 0.095 ± 0.082
0.284CysTyr: 0.284 ± 0.241
0.0CysXaa: 0.0 ± 0.0
Asp
3.697AspAla: 3.697 ± 0.678
0.379AspCys: 0.379 ± 0.197
4.456AspAsp: 4.456 ± 0.799
4.835AspGlu: 4.835 ± 0.647
3.413AspPhe: 3.413 ± 0.567
4.835AspGly: 4.835 ± 0.766
0.853AspHis: 0.853 ± 0.294
5.214AspIle: 5.214 ± 0.935
6.257AspLys: 6.257 ± 0.712
5.973AspLeu: 5.973 ± 0.901
1.801AspMet: 1.801 ± 0.332
3.603AspAsn: 3.603 ± 0.632
2.275AspPro: 2.275 ± 0.406
1.422AspGln: 1.422 ± 0.32
2.56AspArg: 2.56 ± 0.547
4.077AspSer: 4.077 ± 0.521
3.129AspThr: 3.129 ± 0.585
3.034AspVal: 3.034 ± 0.59
0.758AspTrp: 0.758 ± 0.279
3.034AspTyr: 3.034 ± 0.528
0.0AspXaa: 0.0 ± 0.0
Glu
5.119GluAla: 5.119 ± 0.736
0.0GluCys: 0.0 ± 0.0
3.887GluAsp: 3.887 ± 0.604
5.119GluGlu: 5.119 ± 0.987
3.318GluPhe: 3.318 ± 0.489
4.171GluGly: 4.171 ± 0.714
1.327GluHis: 1.327 ± 0.371
5.878GluIle: 5.878 ± 0.785
7.11GluLys: 7.11 ± 1.033
8.153GluLeu: 8.153 ± 0.976
1.801GluMet: 1.801 ± 0.439
3.508GluAsn: 3.508 ± 0.503
1.043GluPro: 1.043 ± 0.323
2.939GluGln: 2.939 ± 0.434
3.887GluArg: 3.887 ± 0.751
4.645GluSer: 4.645 ± 0.617
5.593GluThr: 5.593 ± 0.696
5.878GluVal: 5.878 ± 1.033
1.232GluTrp: 1.232 ± 0.285
3.413GluTyr: 3.413 ± 0.51
0.0GluXaa: 0.0 ± 0.0
Phe
2.181PheAla: 2.181 ± 0.475
0.284PheCys: 0.284 ± 0.161
3.318PheAsp: 3.318 ± 0.737
3.887PheGlu: 3.887 ± 0.625
2.749PhePhe: 2.749 ± 0.511
2.655PheGly: 2.655 ± 0.671
0.284PheHis: 0.284 ± 0.166
1.517PheIle: 1.517 ± 0.51
3.129PheLys: 3.129 ± 0.406
3.223PheLeu: 3.223 ± 0.539
1.043PheMet: 1.043 ± 0.241
2.655PheAsn: 2.655 ± 0.48
0.569PhePro: 0.569 ± 0.249
1.706PheGln: 1.706 ± 0.394
1.612PheArg: 1.612 ± 0.392
2.56PheSer: 2.56 ± 0.512
1.991PheThr: 1.991 ± 0.358
2.655PheVal: 2.655 ± 0.417
0.569PheTrp: 0.569 ± 0.253
1.612PheTyr: 1.612 ± 0.45
0.0PheXaa: 0.0 ± 0.0
Gly
3.413GlyAla: 3.413 ± 0.841
0.284GlyCys: 0.284 ± 0.171
3.413GlyAsp: 3.413 ± 0.685
3.603GlyGlu: 3.603 ± 0.552
3.129GlyPhe: 3.129 ± 0.524
4.077GlyGly: 4.077 ± 0.826
1.043GlyHis: 1.043 ± 0.249
4.171GlyIle: 4.171 ± 0.86
5.878GlyLys: 5.878 ± 0.796
4.74GlyLeu: 4.74 ± 0.831
1.706GlyMet: 1.706 ± 0.405
3.413GlyAsn: 3.413 ± 0.609
0.569GlyPro: 0.569 ± 0.228
2.56GlyGln: 2.56 ± 0.443
2.465GlyArg: 2.465 ± 0.435
2.465GlySer: 2.465 ± 0.638
2.844GlyThr: 2.844 ± 0.567
3.413GlyVal: 3.413 ± 0.49
1.138GlyTrp: 1.138 ± 0.525
3.223GlyTyr: 3.223 ± 0.41
0.0GlyXaa: 0.0 ± 0.0
His
0.664HisAla: 0.664 ± 0.228
0.0HisCys: 0.0 ± 0.0
1.043HisAsp: 1.043 ± 0.299
1.232HisGlu: 1.232 ± 0.327
0.664HisPhe: 0.664 ± 0.213
0.853HisGly: 0.853 ± 0.31
0.19HisHis: 0.19 ± 0.147
1.612HisIle: 1.612 ± 0.441
0.948HisLys: 0.948 ± 0.401
1.138HisLeu: 1.138 ± 0.415
0.379HisMet: 0.379 ± 0.194
0.569HisAsn: 0.569 ± 0.212
0.664HisPro: 0.664 ± 0.292
0.853HisGln: 0.853 ± 0.244
0.284HisArg: 0.284 ± 0.157
1.043HisSer: 1.043 ± 0.472
1.327HisThr: 1.327 ± 0.278
0.664HisVal: 0.664 ± 0.266
0.19HisTrp: 0.19 ± 0.121
0.379HisTyr: 0.379 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
5.593IleAla: 5.593 ± 0.506
0.379IleCys: 0.379 ± 0.194
5.404IleAsp: 5.404 ± 0.667
7.11IleGlu: 7.11 ± 0.773
1.896IlePhe: 1.896 ± 0.388
3.603IleGly: 3.603 ± 0.55
1.043IleHis: 1.043 ± 0.318
4.93IleIle: 4.93 ± 0.813
7.774IleLys: 7.774 ± 0.855
5.214IleLeu: 5.214 ± 0.635
1.706IleMet: 1.706 ± 0.397
4.077IleAsn: 4.077 ± 0.725
2.275IlePro: 2.275 ± 0.41
2.086IleGln: 2.086 ± 0.408
2.465IleArg: 2.465 ± 0.503
5.404IleSer: 5.404 ± 0.83
4.171IleThr: 4.171 ± 0.698
3.792IleVal: 3.792 ± 0.545
1.043IleTrp: 1.043 ± 0.411
2.56IleTyr: 2.56 ± 0.561
0.0IleXaa: 0.0 ± 0.0
Lys
6.542LysAla: 6.542 ± 0.825
0.379LysCys: 0.379 ± 0.145
5.499LysAsp: 5.499 ± 0.793
7.584LysGlu: 7.584 ± 0.852
2.844LysPhe: 2.844 ± 0.555
4.266LysGly: 4.266 ± 0.588
1.801LysHis: 1.801 ± 0.408
7.205LysIle: 7.205 ± 0.807
9.386LysLys: 9.386 ± 1.177
8.627LysLeu: 8.627 ± 0.955
2.749LysMet: 2.749 ± 0.541
3.603LysAsn: 3.603 ± 0.468
2.37LysPro: 2.37 ± 0.476
3.697LysGln: 3.697 ± 0.585
3.697LysArg: 3.697 ± 0.565
5.878LysSer: 5.878 ± 0.656
6.352LysThr: 6.352 ± 0.913
5.404LysVal: 5.404 ± 0.687
1.232LysTrp: 1.232 ± 0.347
3.318LysTyr: 3.318 ± 0.62
0.0LysXaa: 0.0 ± 0.0
Leu
7.205LeuAla: 7.205 ± 0.652
0.664LeuCys: 0.664 ± 0.358
7.964LeuAsp: 7.964 ± 0.979
7.205LeuGlu: 7.205 ± 0.843
3.129LeuPhe: 3.129 ± 0.689
4.077LeuGly: 4.077 ± 0.613
0.758LeuHis: 0.758 ± 0.318
5.783LeuIle: 5.783 ± 0.652
8.438LeuLys: 8.438 ± 1.02
7.016LeuLeu: 7.016 ± 0.721
2.181LeuMet: 2.181 ± 0.449
3.887LeuAsn: 3.887 ± 0.628
3.223LeuPro: 3.223 ± 0.555
3.223LeuGln: 3.223 ± 0.509
3.223LeuArg: 3.223 ± 0.535
5.973LeuSer: 5.973 ± 0.814
5.688LeuThr: 5.688 ± 0.813
5.119LeuVal: 5.119 ± 0.613
0.664LeuTrp: 0.664 ± 0.241
2.181LeuTyr: 2.181 ± 0.402
0.0LeuXaa: 0.0 ± 0.0
Met
1.706MetAla: 1.706 ± 0.413
0.095MetCys: 0.095 ± 0.095
1.706MetAsp: 1.706 ± 0.491
1.991MetGlu: 1.991 ± 0.414
0.569MetPhe: 0.569 ± 0.175
0.853MetGly: 0.853 ± 0.308
0.095MetHis: 0.095 ± 0.109
1.896MetIle: 1.896 ± 0.495
1.706MetLys: 1.706 ± 0.375
2.275MetLeu: 2.275 ± 0.475
0.664MetMet: 0.664 ± 0.238
1.801MetAsn: 1.801 ± 0.414
0.853MetPro: 0.853 ± 0.319
1.517MetGln: 1.517 ± 0.42
1.706MetArg: 1.706 ± 0.432
1.801MetSer: 1.801 ± 0.393
2.086MetThr: 2.086 ± 0.387
1.801MetVal: 1.801 ± 0.35
0.284MetTrp: 0.284 ± 0.168
0.853MetTyr: 0.853 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
4.171AsnAla: 4.171 ± 0.633
0.474AsnCys: 0.474 ± 0.22
3.034AsnAsp: 3.034 ± 0.506
4.74AsnGlu: 4.74 ± 0.635
2.844AsnPhe: 2.844 ± 0.489
4.645AsnGly: 4.645 ± 0.593
1.232AsnHis: 1.232 ± 0.319
3.508AsnIle: 3.508 ± 0.506
4.077AsnLys: 4.077 ± 0.492
5.404AsnLeu: 5.404 ± 0.588
1.043AsnMet: 1.043 ± 0.254
3.603AsnAsn: 3.603 ± 0.65
1.801AsnPro: 1.801 ± 0.473
1.991AsnGln: 1.991 ± 0.447
2.465AsnArg: 2.465 ± 0.57
3.508AsnSer: 3.508 ± 0.49
2.939AsnThr: 2.939 ± 0.54
2.655AsnVal: 2.655 ± 0.449
0.853AsnTrp: 0.853 ± 0.295
2.086AsnTyr: 2.086 ± 0.333
0.0AsnXaa: 0.0 ± 0.0
Pro
1.612ProAla: 1.612 ± 0.367
0.095ProCys: 0.095 ± 0.108
1.801ProAsp: 1.801 ± 0.476
2.181ProGlu: 2.181 ± 0.474
1.138ProPhe: 1.138 ± 0.379
1.327ProGly: 1.327 ± 0.332
0.474ProHis: 0.474 ± 0.189
2.749ProIle: 2.749 ± 0.393
2.37ProLys: 2.37 ± 0.451
1.327ProLeu: 1.327 ± 0.33
0.284ProMet: 0.284 ± 0.145
1.517ProAsn: 1.517 ± 0.396
0.474ProPro: 0.474 ± 0.274
0.664ProGln: 0.664 ± 0.235
0.948ProArg: 0.948 ± 0.293
2.086ProSer: 2.086 ± 0.46
0.948ProThr: 0.948 ± 0.293
1.706ProVal: 1.706 ± 0.432
0.0ProTrp: 0.0 ± 0.0
1.517ProTyr: 1.517 ± 0.416
0.0ProXaa: 0.0 ± 0.0
Gln
3.034GlnAla: 3.034 ± 0.415
0.095GlnCys: 0.095 ± 0.113
1.327GlnAsp: 1.327 ± 0.319
2.275GlnGlu: 2.275 ± 0.557
1.612GlnPhe: 1.612 ± 0.453
1.422GlnGly: 1.422 ± 0.289
0.474GlnHis: 0.474 ± 0.234
2.56GlnIle: 2.56 ± 0.412
4.266GlnLys: 4.266 ± 0.562
4.077GlnLeu: 4.077 ± 0.61
0.758GlnMet: 0.758 ± 0.235
2.275GlnAsn: 2.275 ± 0.492
1.232GlnPro: 1.232 ± 0.303
1.232GlnGln: 1.232 ± 0.386
1.801GlnArg: 1.801 ± 0.368
2.275GlnSer: 2.275 ± 0.552
2.465GlnThr: 2.465 ± 0.465
3.129GlnVal: 3.129 ± 0.5
0.284GlnTrp: 0.284 ± 0.156
1.517GlnTyr: 1.517 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
1.612ArgAla: 1.612 ± 0.47
0.474ArgCys: 0.474 ± 0.214
1.801ArgAsp: 1.801 ± 0.373
2.939ArgGlu: 2.939 ± 0.588
1.232ArgPhe: 1.232 ± 0.367
2.275ArgGly: 2.275 ± 0.418
0.664ArgHis: 0.664 ± 0.262
2.465ArgIle: 2.465 ± 0.538
4.171ArgLys: 4.171 ± 0.734
5.025ArgLeu: 5.025 ± 0.958
2.181ArgMet: 2.181 ± 0.522
3.034ArgAsn: 3.034 ± 0.475
0.853ArgPro: 0.853 ± 0.249
1.801ArgGln: 1.801 ± 0.45
2.086ArgArg: 2.086 ± 0.552
2.086ArgSer: 2.086 ± 0.425
3.223ArgThr: 3.223 ± 0.62
1.706ArgVal: 1.706 ± 0.356
0.569ArgTrp: 0.569 ± 0.276
1.138ArgTyr: 1.138 ± 0.347
0.0ArgXaa: 0.0 ± 0.0
Ser
3.318SerAla: 3.318 ± 0.652
0.19SerCys: 0.19 ± 0.137
3.887SerAsp: 3.887 ± 0.503
4.74SerGlu: 4.74 ± 0.564
2.275SerPhe: 2.275 ± 0.459
3.223SerGly: 3.223 ± 0.554
0.948SerHis: 0.948 ± 0.338
4.266SerIle: 4.266 ± 0.647
5.593SerLys: 5.593 ± 0.882
6.542SerLeu: 6.542 ± 0.717
1.991SerMet: 1.991 ± 0.437
3.508SerAsn: 3.508 ± 0.675
1.138SerPro: 1.138 ± 0.31
3.034SerGln: 3.034 ± 0.605
2.086SerArg: 2.086 ± 0.393
4.266SerSer: 4.266 ± 0.66
3.603SerThr: 3.603 ± 0.489
3.508SerVal: 3.508 ± 0.597
0.758SerTrp: 0.758 ± 0.237
3.129SerTyr: 3.129 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
4.456ThrAla: 4.456 ± 0.69
0.19ThrCys: 0.19 ± 0.126
4.361ThrAsp: 4.361 ± 0.493
4.74ThrGlu: 4.74 ± 0.731
2.086ThrPhe: 2.086 ± 0.385
4.835ThrGly: 4.835 ± 1.061
1.043ThrHis: 1.043 ± 0.35
5.499ThrIle: 5.499 ± 0.706
4.835ThrLys: 4.835 ± 0.636
4.74ThrLeu: 4.74 ± 0.717
1.706ThrMet: 1.706 ± 0.349
3.982ThrAsn: 3.982 ± 0.55
1.706ThrPro: 1.706 ± 0.464
2.37ThrGln: 2.37 ± 0.496
1.327ThrArg: 1.327 ± 0.316
2.844ThrSer: 2.844 ± 0.47
3.603ThrThr: 3.603 ± 0.637
4.456ThrVal: 4.456 ± 0.617
0.474ThrTrp: 0.474 ± 0.239
1.896ThrTyr: 1.896 ± 0.452
0.0ThrXaa: 0.0 ± 0.0
Val
4.171ValAla: 4.171 ± 0.688
0.095ValCys: 0.095 ± 0.092
3.887ValAsp: 3.887 ± 0.489
4.74ValGlu: 4.74 ± 0.761
2.465ValPhe: 2.465 ± 0.329
4.266ValGly: 4.266 ± 0.7
0.853ValHis: 0.853 ± 0.242
5.119ValIle: 5.119 ± 0.902
5.119ValLys: 5.119 ± 0.65
3.982ValLeu: 3.982 ± 0.571
1.232ValMet: 1.232 ± 0.323
4.645ValAsn: 4.645 ± 0.618
1.138ValPro: 1.138 ± 0.286
1.896ValGln: 1.896 ± 0.383
2.655ValArg: 2.655 ± 0.374
4.266ValSer: 4.266 ± 0.646
4.551ValThr: 4.551 ± 0.572
4.077ValVal: 4.077 ± 0.709
0.284ValTrp: 0.284 ± 0.17
1.612ValTyr: 1.612 ± 0.419
0.0ValXaa: 0.0 ± 0.0
Trp
0.569TrpAla: 0.569 ± 0.218
0.0TrpCys: 0.0 ± 0.0
0.569TrpAsp: 0.569 ± 0.259
0.948TrpGlu: 0.948 ± 0.279
0.758TrpPhe: 0.758 ± 0.227
0.284TrpGly: 0.284 ± 0.152
0.095TrpHis: 0.095 ± 0.082
0.284TrpIle: 0.284 ± 0.169
1.138TrpLys: 1.138 ± 0.356
0.758TrpLeu: 0.758 ± 0.296
0.379TrpMet: 0.379 ± 0.18
0.853TrpAsn: 0.853 ± 0.355
0.19TrpPro: 0.19 ± 0.113
0.758TrpGln: 0.758 ± 0.305
0.569TrpArg: 0.569 ± 0.205
0.853TrpSer: 0.853 ± 0.315
0.853TrpThr: 0.853 ± 0.245
0.569TrpVal: 0.569 ± 0.221
0.0TrpTrp: 0.0 ± 0.0
0.948TrpTyr: 0.948 ± 0.629
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.896TyrAla: 1.896 ± 0.369
0.379TyrCys: 0.379 ± 0.179
2.56TyrAsp: 2.56 ± 0.468
1.801TyrGlu: 1.801 ± 0.395
2.181TyrPhe: 2.181 ± 0.416
1.706TyrGly: 1.706 ± 0.512
0.569TyrHis: 0.569 ± 0.218
2.844TyrIle: 2.844 ± 0.591
3.887TyrLys: 3.887 ± 0.606
3.697TyrLeu: 3.697 ± 0.493
1.043TyrMet: 1.043 ± 0.366
2.37TyrAsn: 2.37 ± 0.493
1.517TyrPro: 1.517 ± 0.381
1.896TyrGln: 1.896 ± 0.393
2.655TyrArg: 2.655 ± 0.633
1.896TyrSer: 1.896 ± 0.569
1.327TyrThr: 1.327 ± 0.331
1.896TyrVal: 1.896 ± 0.482
0.284TyrTrp: 0.284 ± 0.157
1.896TyrTyr: 1.896 ± 0.557
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (10549 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski