Amino acid dipepetide frequency for Streptococcus phage IPP5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.353AlaAla: 2.353 ± 0.782
0.294AlaCys: 0.294 ± 0.133
4.804AlaAsp: 4.804 ± 0.818
6.176AlaGlu: 6.176 ± 0.724
2.353AlaPhe: 2.353 ± 0.506
3.627AlaGly: 3.627 ± 1.011
0.49AlaHis: 0.49 ± 0.24
5.392AlaIle: 5.392 ± 1.292
5.392AlaLys: 5.392 ± 0.73
4.706AlaLeu: 4.706 ± 1.05
1.863AlaMet: 1.863 ± 0.462
3.039AlaAsn: 3.039 ± 0.831
2.157AlaPro: 2.157 ± 0.453
2.941AlaGln: 2.941 ± 0.672
3.039AlaArg: 3.039 ± 0.506
3.824AlaSer: 3.824 ± 0.702
4.902AlaThr: 4.902 ± 0.858
4.804AlaVal: 4.804 ± 0.848
1.176AlaTrp: 1.176 ± 0.358
2.451AlaTyr: 2.451 ± 0.496
0.0AlaXaa: 0.0 ± 0.0
Cys
0.196CysAla: 0.196 ± 0.093
0.0CysCys: 0.0 ± 0.0
0.196CysAsp: 0.196 ± 0.146
0.784CysGlu: 0.784 ± 0.235
0.294CysPhe: 0.294 ± 0.145
0.294CysGly: 0.294 ± 0.137
0.294CysHis: 0.294 ± 0.161
0.196CysIle: 0.196 ± 0.149
0.588CysLys: 0.588 ± 0.273
0.294CysLeu: 0.294 ± 0.179
0.0CysMet: 0.0 ± 0.0
0.098CysAsn: 0.098 ± 0.107
0.196CysPro: 0.196 ± 0.209
0.294CysGln: 0.294 ± 0.158
0.196CysArg: 0.196 ± 0.124
0.196CysSer: 0.196 ± 0.188
0.392CysThr: 0.392 ± 0.209
0.49CysVal: 0.49 ± 0.256
0.098CysTrp: 0.098 ± 0.076
0.49CysTyr: 0.49 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
3.333AspAla: 3.333 ± 0.703
0.196AspCys: 0.196 ± 0.13
3.627AspAsp: 3.627 ± 0.754
3.431AspGlu: 3.431 ± 0.921
3.039AspPhe: 3.039 ± 0.581
4.216AspGly: 4.216 ± 0.731
0.98AspHis: 0.98 ± 0.329
4.216AspIle: 4.216 ± 0.836
6.275AspLys: 6.275 ± 1.164
4.902AspLeu: 4.902 ± 0.514
1.471AspMet: 1.471 ± 0.385
3.039AspAsn: 3.039 ± 0.567
2.059AspPro: 2.059 ± 0.55
1.275AspGln: 1.275 ± 0.282
2.157AspArg: 2.157 ± 0.449
3.627AspSer: 3.627 ± 0.419
3.333AspThr: 3.333 ± 0.517
4.314AspVal: 4.314 ± 0.557
1.176AspTrp: 1.176 ± 0.35
3.627AspTyr: 3.627 ± 0.679
0.0AspXaa: 0.0 ± 0.0
Glu
5.098GluAla: 5.098 ± 1.314
0.686GluCys: 0.686 ± 0.252
3.627GluAsp: 3.627 ± 0.589
7.549GluGlu: 7.549 ± 1.369
2.843GluPhe: 2.843 ± 0.539
3.039GluGly: 3.039 ± 0.517
1.176GluHis: 1.176 ± 0.381
7.059GluIle: 7.059 ± 0.78
7.745GluLys: 7.745 ± 1.372
10.294GluLeu: 10.294 ± 1.124
1.569GluMet: 1.569 ± 0.396
5.098GluAsn: 5.098 ± 0.893
1.863GluPro: 1.863 ± 0.544
2.647GluGln: 2.647 ± 0.726
3.725GluArg: 3.725 ± 0.854
3.922GluSer: 3.922 ± 0.603
5.0GluThr: 5.0 ± 0.555
5.098GluVal: 5.098 ± 0.846
0.882GluTrp: 0.882 ± 0.314
1.667GluTyr: 1.667 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
2.353PheAla: 2.353 ± 0.595
0.196PheCys: 0.196 ± 0.143
4.216PheAsp: 4.216 ± 0.717
3.137PheGlu: 3.137 ± 0.485
1.078PhePhe: 1.078 ± 0.399
1.961PheGly: 1.961 ± 0.541
0.392PheHis: 0.392 ± 0.213
2.157PheIle: 2.157 ± 0.529
3.431PheLys: 3.431 ± 0.586
1.863PheLeu: 1.863 ± 0.525
1.373PheMet: 1.373 ± 0.457
2.745PheAsn: 2.745 ± 0.643
0.784PhePro: 0.784 ± 0.333
1.471PheGln: 1.471 ± 0.341
1.373PheArg: 1.373 ± 0.391
2.745PheSer: 2.745 ± 0.504
3.235PheThr: 3.235 ± 0.469
1.275PheVal: 1.275 ± 0.415
0.49PheTrp: 0.49 ± 0.217
1.569PheTyr: 1.569 ± 0.378
0.0PheXaa: 0.0 ± 0.0
Gly
3.627GlyAla: 3.627 ± 0.92
0.196GlyCys: 0.196 ± 0.124
2.745GlyAsp: 2.745 ± 0.51
3.922GlyGlu: 3.922 ± 0.458
2.745GlyPhe: 2.745 ± 0.536
3.333GlyGly: 3.333 ± 0.709
0.588GlyHis: 0.588 ± 0.213
4.902GlyIle: 4.902 ± 0.588
5.0GlyLys: 5.0 ± 0.664
4.51GlyLeu: 4.51 ± 0.821
1.176GlyMet: 1.176 ± 0.341
3.529GlyAsn: 3.529 ± 0.661
0.686GlyPro: 0.686 ± 0.291
2.843GlyGln: 2.843 ± 0.703
2.941GlyArg: 2.941 ± 0.668
3.333GlySer: 3.333 ± 0.786
2.941GlyThr: 2.941 ± 0.568
3.235GlyVal: 3.235 ± 0.404
1.569GlyTrp: 1.569 ± 0.632
2.451GlyTyr: 2.451 ± 0.438
0.0GlyXaa: 0.0 ± 0.0
His
0.784HisAla: 0.784 ± 0.224
0.0HisCys: 0.0 ± 0.0
0.784HisAsp: 0.784 ± 0.3
1.471HisGlu: 1.471 ± 0.356
0.686HisPhe: 0.686 ± 0.317
0.686HisGly: 0.686 ± 0.293
0.0HisHis: 0.0 ± 0.0
1.667HisIle: 1.667 ± 0.313
0.98HisLys: 0.98 ± 0.355
1.275HisLeu: 1.275 ± 0.422
0.196HisMet: 0.196 ± 0.121
0.98HisAsn: 0.98 ± 0.332
0.196HisPro: 0.196 ± 0.113
0.196HisGln: 0.196 ± 0.171
0.392HisArg: 0.392 ± 0.21
1.275HisSer: 1.275 ± 0.515
0.98HisThr: 0.98 ± 0.411
0.784HisVal: 0.784 ± 0.288
0.196HisTrp: 0.196 ± 0.16
0.882HisTyr: 0.882 ± 0.327
0.0HisXaa: 0.0 ± 0.0
Ile
5.196IleAla: 5.196 ± 0.855
0.686IleCys: 0.686 ± 0.304
4.608IleAsp: 4.608 ± 0.702
6.275IleGlu: 6.275 ± 0.696
2.255IlePhe: 2.255 ± 0.549
3.333IleGly: 3.333 ± 0.686
0.686IleHis: 0.686 ± 0.287
3.431IleIle: 3.431 ± 0.743
5.686IleLys: 5.686 ± 0.77
5.392IleLeu: 5.392 ± 0.923
1.471IleMet: 1.471 ± 0.377
4.902IleAsn: 4.902 ± 0.772
1.863IlePro: 1.863 ± 0.454
3.431IleGln: 3.431 ± 0.491
3.137IleArg: 3.137 ± 0.615
5.882IleSer: 5.882 ± 1.218
4.216IleThr: 4.216 ± 0.549
2.941IleVal: 2.941 ± 0.526
0.98IleTrp: 0.98 ± 0.26
1.471IleTyr: 1.471 ± 0.576
0.0IleXaa: 0.0 ± 0.0
Lys
5.98LysAla: 5.98 ± 0.919
0.196LysCys: 0.196 ± 0.13
4.804LysAsp: 4.804 ± 0.687
7.941LysGlu: 7.941 ± 1.139
2.157LysPhe: 2.157 ± 0.367
3.922LysGly: 3.922 ± 0.787
1.667LysHis: 1.667 ± 0.334
6.176LysIle: 6.176 ± 1.012
6.176LysLys: 6.176 ± 1.072
7.451LysLeu: 7.451 ± 1.172
3.235LysMet: 3.235 ± 0.488
4.216LysAsn: 4.216 ± 0.627
2.353LysPro: 2.353 ± 0.511
4.412LysGln: 4.412 ± 0.496
3.529LysArg: 3.529 ± 0.588
5.392LysSer: 5.392 ± 0.676
5.686LysThr: 5.686 ± 0.78
4.706LysVal: 4.706 ± 0.583
0.784LysTrp: 0.784 ± 0.251
4.412LysTyr: 4.412 ± 0.715
0.0LysXaa: 0.0 ± 0.0
Leu
7.157LeuAla: 7.157 ± 0.801
0.196LeuCys: 0.196 ± 0.174
7.157LeuAsp: 7.157 ± 1.036
6.765LeuGlu: 6.765 ± 1.051
3.137LeuPhe: 3.137 ± 0.5
5.686LeuGly: 5.686 ± 0.671
0.686LeuHis: 0.686 ± 0.261
5.294LeuIle: 5.294 ± 0.649
7.157LeuLys: 7.157 ± 1.054
6.176LeuLeu: 6.176 ± 0.877
2.059LeuMet: 2.059 ± 0.676
5.098LeuAsn: 5.098 ± 0.758
2.549LeuPro: 2.549 ± 0.657
2.941LeuGln: 2.941 ± 0.694
4.608LeuArg: 4.608 ± 0.828
6.176LeuSer: 6.176 ± 0.808
5.882LeuThr: 5.882 ± 0.969
3.627LeuVal: 3.627 ± 0.754
0.784LeuTrp: 0.784 ± 0.344
3.333LeuTyr: 3.333 ± 0.549
0.0LeuXaa: 0.0 ± 0.0
Met
1.373MetAla: 1.373 ± 0.33
0.294MetCys: 0.294 ± 0.168
0.882MetAsp: 0.882 ± 0.262
2.549MetGlu: 2.549 ± 0.711
0.098MetPhe: 0.098 ± 0.116
1.176MetGly: 1.176 ± 0.304
0.0MetHis: 0.0 ± 0.0
1.471MetIle: 1.471 ± 0.429
2.941MetLys: 2.941 ± 0.64
1.569MetLeu: 1.569 ± 0.322
0.49MetMet: 0.49 ± 0.242
2.353MetAsn: 2.353 ± 0.641
0.784MetPro: 0.784 ± 0.31
0.882MetGln: 0.882 ± 0.302
0.98MetArg: 0.98 ± 0.382
0.98MetSer: 0.98 ± 0.4
1.961MetThr: 1.961 ± 0.487
1.569MetVal: 1.569 ± 0.336
0.196MetTrp: 0.196 ± 0.139
0.588MetTyr: 0.588 ± 0.233
0.0MetXaa: 0.0 ± 0.0
Asn
3.725AsnAla: 3.725 ± 0.703
0.294AsnCys: 0.294 ± 0.16
3.039AsnAsp: 3.039 ± 0.616
4.51AsnGlu: 4.51 ± 0.542
2.647AsnPhe: 2.647 ± 0.512
3.922AsnGly: 3.922 ± 0.802
1.471AsnHis: 1.471 ± 0.492
2.941AsnIle: 2.941 ± 0.615
3.529AsnLys: 3.529 ± 0.632
5.588AsnLeu: 5.588 ± 0.874
1.176AsnMet: 1.176 ± 0.389
3.137AsnAsn: 3.137 ± 0.51
2.745AsnPro: 2.745 ± 0.494
3.137AsnGln: 3.137 ± 0.76
3.627AsnArg: 3.627 ± 0.475
3.529AsnSer: 3.529 ± 0.812
3.235AsnThr: 3.235 ± 0.605
4.02AsnVal: 4.02 ± 0.473
0.98AsnTrp: 0.98 ± 0.266
1.667AsnTyr: 1.667 ± 0.466
0.0AsnXaa: 0.0 ± 0.0
Pro
1.471ProAla: 1.471 ± 0.364
0.392ProCys: 0.392 ± 0.235
1.961ProAsp: 1.961 ± 0.36
1.765ProGlu: 1.765 ± 0.46
0.882ProPhe: 0.882 ± 0.327
0.784ProGly: 0.784 ± 0.257
0.49ProHis: 0.49 ± 0.193
1.373ProIle: 1.373 ± 0.565
3.039ProLys: 3.039 ± 0.474
2.549ProLeu: 2.549 ± 0.569
0.392ProMet: 0.392 ± 0.206
1.176ProAsn: 1.176 ± 0.318
1.176ProPro: 1.176 ± 0.702
1.961ProGln: 1.961 ± 0.544
1.471ProArg: 1.471 ± 0.473
1.667ProSer: 1.667 ± 0.481
1.275ProThr: 1.275 ± 0.348
1.961ProVal: 1.961 ± 0.388
0.196ProTrp: 0.196 ± 0.135
1.275ProTyr: 1.275 ± 0.463
0.0ProXaa: 0.0 ± 0.0
Gln
3.039GlnAla: 3.039 ± 0.573
0.196GlnCys: 0.196 ± 0.143
1.765GlnAsp: 1.765 ± 0.367
4.118GlnGlu: 4.118 ± 0.697
1.471GlnPhe: 1.471 ± 0.35
2.255GlnGly: 2.255 ± 0.348
0.196GlnHis: 0.196 ± 0.147
3.039GlnIle: 3.039 ± 0.507
4.118GlnLys: 4.118 ± 0.544
4.51GlnLeu: 4.51 ± 0.732
0.686GlnMet: 0.686 ± 0.222
2.353GlnAsn: 2.353 ± 0.541
1.275GlnPro: 1.275 ± 0.484
1.569GlnGln: 1.569 ± 0.494
2.451GlnArg: 2.451 ± 0.56
3.039GlnSer: 3.039 ± 0.628
2.059GlnThr: 2.059 ± 0.462
3.039GlnVal: 3.039 ± 0.577
0.196GlnTrp: 0.196 ± 0.129
0.686GlnTyr: 0.686 ± 0.28
0.0GlnXaa: 0.0 ± 0.0
Arg
3.431ArgAla: 3.431 ± 0.469
0.49ArgCys: 0.49 ± 0.179
1.765ArgAsp: 1.765 ± 0.428
3.137ArgGlu: 3.137 ± 0.744
2.059ArgPhe: 2.059 ± 0.368
1.667ArgGly: 1.667 ± 0.474
0.98ArgHis: 0.98 ± 0.401
3.039ArgIle: 3.039 ± 0.737
3.725ArgLys: 3.725 ± 0.684
5.392ArgLeu: 5.392 ± 0.732
1.078ArgMet: 1.078 ± 0.379
2.843ArgAsn: 2.843 ± 0.671
1.373ArgPro: 1.373 ± 0.44
2.843ArgGln: 2.843 ± 0.435
2.647ArgArg: 2.647 ± 0.708
2.647ArgSer: 2.647 ± 0.41
3.824ArgThr: 3.824 ± 1.028
2.745ArgVal: 2.745 ± 0.473
0.49ArgTrp: 0.49 ± 0.193
2.157ArgTyr: 2.157 ± 0.539
0.0ArgXaa: 0.0 ± 0.0
Ser
5.0SerAla: 5.0 ± 0.885
0.196SerCys: 0.196 ± 0.152
3.824SerAsp: 3.824 ± 0.586
3.725SerGlu: 3.725 ± 0.719
2.647SerPhe: 2.647 ± 0.38
4.804SerGly: 4.804 ± 0.934
0.686SerHis: 0.686 ± 0.231
3.529SerIle: 3.529 ± 0.565
4.412SerLys: 4.412 ± 0.699
5.098SerLeu: 5.098 ± 0.666
1.373SerMet: 1.373 ± 0.53
4.608SerAsn: 4.608 ± 0.715
1.275SerPro: 1.275 ± 0.358
2.255SerGln: 2.255 ± 0.46
3.529SerArg: 3.529 ± 0.72
4.706SerSer: 4.706 ± 0.81
5.294SerThr: 5.294 ± 0.83
3.922SerVal: 3.922 ± 1.177
0.882SerTrp: 0.882 ± 0.314
1.863SerTyr: 1.863 ± 0.408
0.0SerXaa: 0.0 ± 0.0
Thr
4.902ThrAla: 4.902 ± 1.1
0.196ThrCys: 0.196 ± 0.134
3.824ThrAsp: 3.824 ± 0.519
3.725ThrGlu: 3.725 ± 0.535
2.745ThrPhe: 2.745 ± 0.834
4.706ThrGly: 4.706 ± 0.946
1.373ThrHis: 1.373 ± 0.426
5.196ThrIle: 5.196 ± 0.676
5.49ThrLys: 5.49 ± 0.461
5.98ThrLeu: 5.98 ± 0.687
0.588ThrMet: 0.588 ± 0.214
3.725ThrAsn: 3.725 ± 0.422
1.275ThrPro: 1.275 ± 0.487
2.451ThrGln: 2.451 ± 0.63
2.549ThrArg: 2.549 ± 0.553
4.02ThrSer: 4.02 ± 0.731
5.49ThrThr: 5.49 ± 0.992
4.902ThrVal: 4.902 ± 0.733
0.882ThrTrp: 0.882 ± 0.378
2.255ThrTyr: 2.255 ± 0.44
0.0ThrXaa: 0.0 ± 0.0
Val
3.725ValAla: 3.725 ± 0.456
0.392ValCys: 0.392 ± 0.189
4.216ValAsp: 4.216 ± 0.582
6.275ValGlu: 6.275 ± 0.608
2.059ValPhe: 2.059 ± 0.722
4.02ValGly: 4.02 ± 0.761
0.98ValHis: 0.98 ± 0.309
3.137ValIle: 3.137 ± 0.51
5.98ValLys: 5.98 ± 0.861
3.431ValLeu: 3.431 ± 0.642
1.275ValMet: 1.275 ± 0.307
3.333ValAsn: 3.333 ± 0.587
1.471ValPro: 1.471 ± 0.345
2.059ValGln: 2.059 ± 0.423
3.039ValArg: 3.039 ± 0.604
3.431ValSer: 3.431 ± 0.755
4.02ValThr: 4.02 ± 0.794
3.824ValVal: 3.824 ± 0.637
0.784ValTrp: 0.784 ± 0.212
2.059ValTyr: 2.059 ± 0.645
0.0ValXaa: 0.0 ± 0.0
Trp
0.784TrpAla: 0.784 ± 0.312
0.098TrpCys: 0.098 ± 0.09
0.196TrpAsp: 0.196 ± 0.138
0.686TrpGlu: 0.686 ± 0.227
0.588TrpPhe: 0.588 ± 0.269
1.078TrpGly: 1.078 ± 0.235
0.294TrpHis: 0.294 ± 0.224
1.275TrpIle: 1.275 ± 0.36
0.686TrpLys: 0.686 ± 0.277
1.275TrpLeu: 1.275 ± 0.409
0.588TrpMet: 0.588 ± 0.257
1.275TrpAsn: 1.275 ± 0.406
0.098TrpPro: 0.098 ± 0.083
0.49TrpGln: 0.49 ± 0.199
0.588TrpArg: 0.588 ± 0.227
0.882TrpSer: 0.882 ± 0.464
0.98TrpThr: 0.98 ± 0.256
0.588TrpVal: 0.588 ± 0.232
0.098TrpTrp: 0.098 ± 0.076
0.784TrpTyr: 0.784 ± 0.706
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.353TyrAla: 2.353 ± 0.505
0.392TyrCys: 0.392 ± 0.166
2.255TyrAsp: 2.255 ± 0.528
2.647TyrGlu: 2.647 ± 0.519
2.059TyrPhe: 2.059 ± 0.761
1.765TyrGly: 1.765 ± 0.431
0.98TyrHis: 0.98 ± 0.275
2.353TyrIle: 2.353 ± 0.51
2.843TyrLys: 2.843 ± 0.73
4.216TyrLeu: 4.216 ± 0.694
0.98TyrMet: 0.98 ± 0.373
1.373TyrAsn: 1.373 ± 0.502
1.078TyrPro: 1.078 ± 0.363
1.961TyrGln: 1.961 ± 0.324
2.353TyrArg: 2.353 ± 0.575
2.353TyrSer: 2.353 ± 0.398
1.569TyrThr: 1.569 ± 0.429
1.667TyrVal: 1.667 ± 0.439
0.49TyrTrp: 0.49 ± 0.222
1.569TyrTyr: 1.569 ± 0.789
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (10201 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski