Amino acid dipepetide frequency for Streptococcus phage Javan69

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.567AlaAla: 4.567 ± 1.523
0.496AlaCys: 0.496 ± 0.188
3.872AlaAsp: 3.872 ± 0.573
5.064AlaGlu: 5.064 ± 0.877
3.376AlaPhe: 3.376 ± 0.454
4.071AlaGly: 4.071 ± 0.432
0.496AlaHis: 0.496 ± 0.199
6.553AlaIle: 6.553 ± 0.817
6.156AlaLys: 6.156 ± 0.728
6.255AlaLeu: 6.255 ± 0.862
1.39AlaMet: 1.39 ± 0.359
4.269AlaAsn: 4.269 ± 0.569
1.688AlaPro: 1.688 ± 0.398
3.078AlaGln: 3.078 ± 0.837
2.78AlaArg: 2.78 ± 0.389
5.461AlaSer: 5.461 ± 0.922
4.269AlaThr: 4.269 ± 0.525
3.475AlaVal: 3.475 ± 0.485
0.993AlaTrp: 0.993 ± 0.38
3.376AlaTyr: 3.376 ± 0.627
0.0AlaXaa: 0.0 ± 0.0
Cys
0.298CysAla: 0.298 ± 0.214
0.099CysCys: 0.099 ± 0.104
0.397CysAsp: 0.397 ± 0.232
0.596CysGlu: 0.596 ± 0.198
0.298CysPhe: 0.298 ± 0.16
0.894CysGly: 0.894 ± 0.317
0.099CysHis: 0.099 ± 0.098
0.298CysIle: 0.298 ± 0.157
0.397CysLys: 0.397 ± 0.23
0.695CysLeu: 0.695 ± 0.267
0.099CysMet: 0.099 ± 0.098
0.099CysAsn: 0.099 ± 0.093
0.298CysPro: 0.298 ± 0.173
0.298CysGln: 0.298 ± 0.169
0.298CysArg: 0.298 ± 0.173
0.397CysSer: 0.397 ± 0.189
0.099CysThr: 0.099 ± 0.137
0.695CysVal: 0.695 ± 0.313
0.0CysTrp: 0.0 ± 0.0
0.695CysTyr: 0.695 ± 0.288
0.0CysXaa: 0.0 ± 0.0
Asp
3.872AspAla: 3.872 ± 0.593
0.298AspCys: 0.298 ± 0.172
2.78AspAsp: 2.78 ± 0.738
4.666AspGlu: 4.666 ± 0.759
3.078AspPhe: 3.078 ± 0.538
5.759AspGly: 5.759 ± 0.737
0.794AspHis: 0.794 ± 0.256
4.666AspIle: 4.666 ± 0.546
4.17AspLys: 4.17 ± 0.561
4.865AspLeu: 4.865 ± 0.87
1.986AspMet: 1.986 ± 0.41
2.681AspAsn: 2.681 ± 0.551
1.787AspPro: 1.787 ± 0.53
1.39AspGln: 1.39 ± 0.363
2.383AspArg: 2.383 ± 0.601
3.872AspSer: 3.872 ± 0.775
2.879AspThr: 2.879 ± 0.489
3.376AspVal: 3.376 ± 0.48
0.993AspTrp: 0.993 ± 0.296
2.383AspTyr: 2.383 ± 0.71
0.0AspXaa: 0.0 ± 0.0
Glu
5.064GluAla: 5.064 ± 0.52
0.397GluCys: 0.397 ± 0.24
4.071GluAsp: 4.071 ± 0.631
5.858GluGlu: 5.858 ± 1.018
2.184GluPhe: 2.184 ± 0.407
4.865GluGly: 4.865 ± 0.567
0.993GluHis: 0.993 ± 0.393
4.17GluIle: 4.17 ± 0.583
5.858GluLys: 5.858 ± 1.064
7.943GluLeu: 7.943 ± 1.018
2.681GluMet: 2.681 ± 0.735
4.17GluAsn: 4.17 ± 0.556
1.39GluPro: 1.39 ± 0.523
4.468GluGln: 4.468 ± 0.58
3.475GluArg: 3.475 ± 0.681
3.276GluSer: 3.276 ± 0.445
5.262GluThr: 5.262 ± 0.762
4.269GluVal: 4.269 ± 0.637
0.596GluTrp: 0.596 ± 0.212
1.191GluTyr: 1.191 ± 0.286
0.0GluXaa: 0.0 ± 0.0
Phe
2.681PheAla: 2.681 ± 0.38
0.496PheCys: 0.496 ± 0.205
3.276PheAsp: 3.276 ± 0.593
2.581PheGlu: 2.581 ± 0.586
1.489PhePhe: 1.489 ± 0.475
2.879PheGly: 2.879 ± 0.521
0.496PheHis: 0.496 ± 0.196
1.787PheIle: 1.787 ± 0.446
3.376PheLys: 3.376 ± 0.676
2.979PheLeu: 2.979 ± 0.608
0.794PheMet: 0.794 ± 0.212
2.482PheAsn: 2.482 ± 0.46
0.596PhePro: 0.596 ± 0.348
1.291PheGln: 1.291 ± 0.286
1.589PheArg: 1.589 ± 0.339
2.482PheSer: 2.482 ± 0.447
1.39PheThr: 1.39 ± 0.332
2.184PheVal: 2.184 ± 0.525
0.794PheTrp: 0.794 ± 0.291
2.085PheTyr: 2.085 ± 0.416
0.0PheXaa: 0.0 ± 0.0
Gly
3.475GlyAla: 3.475 ± 0.604
0.099GlyCys: 0.099 ± 0.104
4.567GlyAsp: 4.567 ± 0.615
3.475GlyGlu: 3.475 ± 0.45
2.085GlyPhe: 2.085 ± 0.346
4.369GlyGly: 4.369 ± 0.659
2.184GlyHis: 2.184 ± 0.491
5.759GlyIle: 5.759 ± 0.951
4.865GlyLys: 4.865 ± 0.773
6.255GlyLeu: 6.255 ± 0.835
1.688GlyMet: 1.688 ± 0.472
4.071GlyAsn: 4.071 ± 0.684
0.695GlyPro: 0.695 ± 0.239
3.276GlyGln: 3.276 ± 0.554
3.376GlyArg: 3.376 ± 0.6
4.567GlySer: 4.567 ± 0.74
4.468GlyThr: 4.468 ± 0.533
3.872GlyVal: 3.872 ± 0.617
0.695GlyTrp: 0.695 ± 0.23
3.078GlyTyr: 3.078 ± 0.618
0.0GlyXaa: 0.0 ± 0.0
His
1.092HisAla: 1.092 ± 0.287
0.099HisCys: 0.099 ± 0.099
0.894HisAsp: 0.894 ± 0.261
0.496HisGlu: 0.496 ± 0.162
0.894HisPhe: 0.894 ± 0.287
1.986HisGly: 1.986 ± 0.394
0.695HisHis: 0.695 ± 0.254
1.39HisIle: 1.39 ± 0.311
0.695HisLys: 0.695 ± 0.275
1.986HisLeu: 1.986 ± 0.382
0.397HisMet: 0.397 ± 0.221
0.794HisAsn: 0.794 ± 0.288
1.092HisPro: 1.092 ± 0.419
0.993HisGln: 0.993 ± 0.323
0.596HisArg: 0.596 ± 0.225
0.894HisSer: 0.894 ± 0.315
0.894HisThr: 0.894 ± 0.239
1.191HisVal: 1.191 ± 0.278
0.099HisTrp: 0.099 ± 0.117
0.695HisTyr: 0.695 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
5.064IleAla: 5.064 ± 0.471
0.695IleCys: 0.695 ± 0.245
5.064IleAsp: 5.064 ± 0.617
4.468IleGlu: 4.468 ± 0.61
1.489IlePhe: 1.489 ± 0.538
5.361IleGly: 5.361 ± 0.754
0.695IleHis: 0.695 ± 0.207
3.078IleIle: 3.078 ± 0.493
4.666IleLys: 4.666 ± 0.619
5.759IleLeu: 5.759 ± 0.825
0.496IleMet: 0.496 ± 0.286
2.879IleAsn: 2.879 ± 0.491
2.284IlePro: 2.284 ± 0.248
2.681IleGln: 2.681 ± 0.367
3.773IleArg: 3.773 ± 0.609
5.064IleSer: 5.064 ± 1.072
4.666IleThr: 4.666 ± 0.94
4.964IleVal: 4.964 ± 0.609
1.39IleTrp: 1.39 ± 0.483
1.589IleTyr: 1.589 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
6.056LysAla: 6.056 ± 0.622
0.298LysCys: 0.298 ± 0.211
3.574LysAsp: 3.574 ± 0.635
5.56LysGlu: 5.56 ± 0.597
1.787LysPhe: 1.787 ± 0.336
4.369LysGly: 4.369 ± 0.477
1.589LysHis: 1.589 ± 0.507
5.064LysIle: 5.064 ± 0.594
3.971LysLys: 3.971 ± 0.621
6.454LysLeu: 6.454 ± 0.566
1.986LysMet: 1.986 ± 0.39
3.177LysAsn: 3.177 ± 0.546
2.581LysPro: 2.581 ± 0.472
3.376LysGln: 3.376 ± 0.669
4.071LysArg: 4.071 ± 0.712
4.468LysSer: 4.468 ± 0.834
4.071LysThr: 4.071 ± 0.614
5.163LysVal: 5.163 ± 0.934
1.191LysTrp: 1.191 ± 0.382
1.787LysTyr: 1.787 ± 0.477
0.0LysXaa: 0.0 ± 0.0
Leu
6.95LeuAla: 6.95 ± 0.819
0.596LeuCys: 0.596 ± 0.285
5.262LeuAsp: 5.262 ± 0.694
7.546LeuGlu: 7.546 ± 0.96
2.681LeuPhe: 2.681 ± 0.545
4.766LeuGly: 4.766 ± 0.801
1.688LeuHis: 1.688 ± 0.38
4.964LeuIle: 4.964 ± 0.475
7.248LeuLys: 7.248 ± 0.765
6.156LeuLeu: 6.156 ± 1.068
2.184LeuMet: 2.184 ± 0.467
4.369LeuAsn: 4.369 ± 0.735
3.376LeuPro: 3.376 ± 0.605
3.376LeuGln: 3.376 ± 0.732
3.376LeuArg: 3.376 ± 0.669
7.347LeuSer: 7.347 ± 0.794
6.751LeuThr: 6.751 ± 0.817
7.049LeuVal: 7.049 ± 1.013
0.397LeuTrp: 0.397 ± 0.174
3.971LeuTyr: 3.971 ± 0.722
0.0LeuXaa: 0.0 ± 0.0
Met
1.787MetAla: 1.787 ± 0.404
0.099MetCys: 0.099 ± 0.099
2.085MetAsp: 2.085 ± 0.477
1.39MetGlu: 1.39 ± 0.456
0.794MetPhe: 0.794 ± 0.303
1.589MetGly: 1.589 ± 0.556
0.099MetHis: 0.099 ± 0.1
1.787MetIle: 1.787 ± 0.376
1.787MetLys: 1.787 ± 0.351
1.191MetLeu: 1.191 ± 0.354
0.794MetMet: 0.794 ± 0.295
0.695MetAsn: 0.695 ± 0.247
0.397MetPro: 0.397 ± 0.19
0.894MetGln: 0.894 ± 0.35
1.191MetArg: 1.191 ± 0.294
1.986MetSer: 1.986 ± 0.307
2.681MetThr: 2.681 ± 0.453
1.291MetVal: 1.291 ± 0.413
0.298MetTrp: 0.298 ± 0.185
0.397MetTyr: 0.397 ± 0.215
0.0MetXaa: 0.0 ± 0.0
Asn
4.766AsnAla: 4.766 ± 0.785
0.199AsnCys: 0.199 ± 0.127
2.78AsnAsp: 2.78 ± 0.459
3.574AsnGlu: 3.574 ± 0.596
2.284AsnPhe: 2.284 ± 0.438
4.964AsnGly: 4.964 ± 0.75
0.993AsnHis: 0.993 ± 0.283
2.284AsnIle: 2.284 ± 0.384
2.78AsnLys: 2.78 ± 0.539
4.865AsnLeu: 4.865 ± 0.8
0.993AsnMet: 0.993 ± 0.273
2.581AsnAsn: 2.581 ± 0.758
2.284AsnPro: 2.284 ± 0.439
2.184AsnGln: 2.184 ± 0.429
2.78AsnArg: 2.78 ± 0.726
2.78AsnSer: 2.78 ± 0.442
2.085AsnThr: 2.085 ± 0.604
2.284AsnVal: 2.284 ± 0.482
0.894AsnTrp: 0.894 ± 0.303
0.794AsnTyr: 0.794 ± 0.248
0.0AsnXaa: 0.0 ± 0.0
Pro
1.092ProAla: 1.092 ± 0.345
0.397ProCys: 0.397 ± 0.16
1.787ProAsp: 1.787 ± 0.43
1.886ProGlu: 1.886 ± 0.43
0.993ProPhe: 0.993 ± 0.29
0.993ProGly: 0.993 ± 0.364
0.894ProHis: 0.894 ± 0.25
1.886ProIle: 1.886 ± 0.474
2.482ProLys: 2.482 ± 0.568
3.376ProLeu: 3.376 ± 0.582
0.199ProMet: 0.199 ± 0.142
1.489ProAsn: 1.489 ± 0.427
0.993ProPro: 0.993 ± 0.357
0.695ProGln: 0.695 ± 0.34
1.986ProArg: 1.986 ± 0.376
2.284ProSer: 2.284 ± 0.54
2.581ProThr: 2.581 ± 0.697
2.383ProVal: 2.383 ± 0.571
0.496ProTrp: 0.496 ± 0.217
1.489ProTyr: 1.489 ± 0.377
0.0ProXaa: 0.0 ± 0.0
Gln
4.071GlnAla: 4.071 ± 0.638
0.199GlnCys: 0.199 ± 0.148
1.787GlnAsp: 1.787 ± 0.342
3.276GlnGlu: 3.276 ± 0.604
2.184GlnPhe: 2.184 ± 0.37
1.986GlnGly: 1.986 ± 0.489
0.496GlnHis: 0.496 ± 0.186
2.879GlnIle: 2.879 ± 0.475
2.581GlnLys: 2.581 ± 0.533
4.468GlnLeu: 4.468 ± 0.552
1.191GlnMet: 1.191 ± 0.348
1.688GlnAsn: 1.688 ± 0.427
1.688GlnPro: 1.688 ± 0.564
2.085GlnGln: 2.085 ± 0.466
1.688GlnArg: 1.688 ± 0.374
2.581GlnSer: 2.581 ± 0.603
3.475GlnThr: 3.475 ± 0.89
3.674GlnVal: 3.674 ± 0.502
0.794GlnTrp: 0.794 ± 0.349
0.596GlnTyr: 0.596 ± 0.287
0.0GlnXaa: 0.0 ± 0.0
Arg
2.581ArgAla: 2.581 ± 0.552
0.596ArgCys: 0.596 ± 0.274
2.383ArgAsp: 2.383 ± 0.524
3.276ArgGlu: 3.276 ± 0.459
1.787ArgPhe: 1.787 ± 0.441
2.681ArgGly: 2.681 ± 0.479
0.894ArgHis: 0.894 ± 0.358
2.78ArgIle: 2.78 ± 0.56
3.276ArgLys: 3.276 ± 0.681
4.666ArgLeu: 4.666 ± 0.513
0.993ArgMet: 0.993 ± 0.269
2.482ArgAsn: 2.482 ± 0.494
1.39ArgPro: 1.39 ± 0.448
2.879ArgGln: 2.879 ± 0.459
1.589ArgArg: 1.589 ± 0.452
2.184ArgSer: 2.184 ± 0.412
3.376ArgThr: 3.376 ± 0.746
3.376ArgVal: 3.376 ± 0.612
0.993ArgTrp: 0.993 ± 0.369
1.489ArgTyr: 1.489 ± 0.425
0.0ArgXaa: 0.0 ± 0.0
Ser
4.865SerAla: 4.865 ± 0.987
0.596SerCys: 0.596 ± 0.224
4.567SerAsp: 4.567 ± 0.691
4.468SerGlu: 4.468 ± 0.675
2.581SerPhe: 2.581 ± 0.743
3.971SerGly: 3.971 ± 0.57
1.291SerHis: 1.291 ± 0.38
5.064SerIle: 5.064 ± 0.714
4.071SerLys: 4.071 ± 0.542
5.659SerLeu: 5.659 ± 0.704
1.489SerMet: 1.489 ± 0.339
3.475SerAsn: 3.475 ± 0.826
2.383SerPro: 2.383 ± 0.426
3.177SerGln: 3.177 ± 0.614
2.581SerArg: 2.581 ± 0.486
5.262SerSer: 5.262 ± 1.153
4.964SerThr: 4.964 ± 0.777
4.17SerVal: 4.17 ± 0.642
1.39SerTrp: 1.39 ± 0.339
1.986SerTyr: 1.986 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
5.957ThrAla: 5.957 ± 0.585
0.199ThrCys: 0.199 ± 0.158
2.78ThrAsp: 2.78 ± 0.647
4.865ThrGlu: 4.865 ± 0.838
3.078ThrPhe: 3.078 ± 0.763
4.766ThrGly: 4.766 ± 0.538
0.993ThrHis: 0.993 ± 0.328
4.468ThrIle: 4.468 ± 0.916
4.567ThrLys: 4.567 ± 0.468
5.957ThrLeu: 5.957 ± 0.701
0.993ThrMet: 0.993 ± 0.345
2.78ThrAsn: 2.78 ± 0.516
2.085ThrPro: 2.085 ± 0.579
2.284ThrGln: 2.284 ± 0.622
1.986ThrArg: 1.986 ± 0.442
5.163ThrSer: 5.163 ± 0.868
5.957ThrThr: 5.957 ± 0.823
5.659ThrVal: 5.659 ± 0.733
0.993ThrTrp: 0.993 ± 0.314
2.581ThrTyr: 2.581 ± 0.546
0.0ThrXaa: 0.0 ± 0.0
Val
3.971ValAla: 3.971 ± 0.677
0.596ValCys: 0.596 ± 0.258
3.475ValAsp: 3.475 ± 0.851
5.56ValGlu: 5.56 ± 0.814
2.085ValPhe: 2.085 ± 0.493
3.574ValGly: 3.574 ± 0.677
0.993ValHis: 0.993 ± 0.23
4.468ValIle: 4.468 ± 0.603
5.163ValLys: 5.163 ± 0.783
6.354ValLeu: 6.354 ± 0.741
1.589ValMet: 1.589 ± 0.434
2.085ValAsn: 2.085 ± 0.38
2.085ValPro: 2.085 ± 0.433
2.482ValGln: 2.482 ± 0.501
3.674ValArg: 3.674 ± 0.66
4.468ValSer: 4.468 ± 0.795
5.262ValThr: 5.262 ± 0.615
3.574ValVal: 3.574 ± 0.609
0.993ValTrp: 0.993 ± 0.297
2.482ValTyr: 2.482 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
0.993TrpAla: 0.993 ± 0.329
0.199TrpCys: 0.199 ± 0.149
0.596TrpAsp: 0.596 ± 0.244
1.092TrpGlu: 1.092 ± 0.361
1.092TrpPhe: 1.092 ± 0.401
0.794TrpGly: 0.794 ± 0.237
0.397TrpHis: 0.397 ± 0.242
0.695TrpIle: 0.695 ± 0.327
0.695TrpLys: 0.695 ± 0.334
1.191TrpLeu: 1.191 ± 0.377
0.397TrpMet: 0.397 ± 0.172
1.39TrpAsn: 1.39 ± 0.325
0.199TrpPro: 0.199 ± 0.136
0.695TrpGln: 0.695 ± 0.303
0.695TrpArg: 0.695 ± 0.323
1.291TrpSer: 1.291 ± 0.416
0.894TrpThr: 0.894 ± 0.386
0.894TrpVal: 0.894 ± 0.317
0.199TrpTrp: 0.199 ± 0.115
0.199TrpTyr: 0.199 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.482TyrAla: 2.482 ± 0.42
0.397TyrCys: 0.397 ± 0.25
2.681TyrAsp: 2.681 ± 0.549
2.482TyrGlu: 2.482 ± 0.523
1.489TyrPhe: 1.489 ± 0.376
2.284TyrGly: 2.284 ± 0.497
1.092TyrHis: 1.092 ± 0.306
2.085TyrIle: 2.085 ± 0.429
1.787TyrLys: 1.787 ± 0.502
2.979TyrLeu: 2.979 ± 0.588
0.794TyrMet: 0.794 ± 0.302
1.489TyrAsn: 1.489 ± 0.362
1.191TyrPro: 1.191 ± 0.342
1.787TyrGln: 1.787 ± 0.406
1.787TyrArg: 1.787 ± 0.373
2.184TyrSer: 2.184 ± 0.478
2.085TyrThr: 2.085 ± 0.475
1.39TyrVal: 1.39 ± 0.404
0.397TyrTrp: 0.397 ± 0.174
0.794TyrTyr: 0.794 ± 0.315
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (10073 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski