Amino acid dipepetide frequency for Streptococcus satellite phage Javan72

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.137AlaAla: 1.137 ± 0.533
0.0AlaCys: 0.0 ± 0.0
2.274AlaAsp: 2.274 ± 1.046
5.117AlaGlu: 5.117 ± 1.397
3.411AlaPhe: 3.411 ± 1.114
3.695AlaGly: 3.695 ± 1.135
1.137AlaHis: 1.137 ± 0.516
7.391AlaIle: 7.391 ± 1.458
5.117AlaLys: 5.117 ± 1.106
5.969AlaLeu: 5.969 ± 1.52
2.274AlaMet: 2.274 ± 0.679
3.127AlaAsn: 3.127 ± 0.845
1.421AlaPro: 1.421 ± 0.661
2.274AlaGln: 2.274 ± 0.678
2.558AlaArg: 2.558 ± 0.943
2.843AlaSer: 2.843 ± 0.926
3.411AlaThr: 3.411 ± 0.826
2.843AlaVal: 2.843 ± 1.213
0.284AlaTrp: 0.284 ± 0.343
2.843AlaTyr: 2.843 ± 0.97
0.0AlaXaa: 0.0 ± 0.0
Cys
0.853CysAla: 0.853 ± 0.461
0.0CysCys: 0.0 ± 0.0
0.284CysAsp: 0.284 ± 0.335
0.284CysGlu: 0.284 ± 0.244
0.284CysPhe: 0.284 ± 0.254
0.569CysGly: 0.569 ± 0.435
0.284CysHis: 0.284 ± 0.335
0.569CysIle: 0.569 ± 0.41
1.421CysLys: 1.421 ± 0.475
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.569CysPro: 0.569 ± 0.35
0.284CysGln: 0.284 ± 0.311
0.284CysArg: 0.284 ± 0.274
1.421CysSer: 1.421 ± 0.581
0.284CysThr: 0.284 ± 0.343
0.284CysVal: 0.284 ± 0.226
0.0CysTrp: 0.0 ± 0.0
0.284CysTyr: 0.284 ± 0.335
0.0CysXaa: 0.0 ± 0.0
Asp
1.99AspAla: 1.99 ± 0.588
0.853AspCys: 0.853 ± 0.484
1.421AspAsp: 1.421 ± 0.615
2.843AspGlu: 2.843 ± 0.915
3.411AspPhe: 3.411 ± 1.059
1.137AspGly: 1.137 ± 0.602
0.284AspHis: 0.284 ± 0.3
5.969AspIle: 5.969 ± 1.303
5.685AspLys: 5.685 ± 1.337
5.685AspLeu: 5.685 ± 1.234
1.706AspMet: 1.706 ± 0.774
5.401AspAsn: 5.401 ± 1.563
0.853AspPro: 0.853 ± 0.584
1.137AspGln: 1.137 ± 0.542
2.843AspArg: 2.843 ± 0.936
3.695AspSer: 3.695 ± 0.935
1.706AspThr: 1.706 ± 0.756
1.706AspVal: 1.706 ± 0.874
0.853AspTrp: 0.853 ± 0.541
2.558AspTyr: 2.558 ± 1.14
0.0AspXaa: 0.0 ± 0.0
Glu
5.685GluAla: 5.685 ± 1.02
0.284GluCys: 0.284 ± 0.226
2.843GluAsp: 2.843 ± 0.845
6.254GluGlu: 6.254 ± 2.021
3.98GluPhe: 3.98 ± 1.31
1.99GluGly: 1.99 ± 0.837
0.853GluHis: 0.853 ± 0.426
5.401GluIle: 5.401 ± 1.524
10.517GluLys: 10.517 ± 1.118
7.391GluLeu: 7.391 ± 1.305
3.127GluMet: 3.127 ± 1.052
4.832GluAsn: 4.832 ± 1.516
0.569GluPro: 0.569 ± 0.391
5.401GluGln: 5.401 ± 1.497
3.411GluArg: 3.411 ± 1.833
3.127GluSer: 3.127 ± 0.984
3.695GluThr: 3.695 ± 0.917
2.843GluVal: 2.843 ± 1.126
0.853GluTrp: 0.853 ± 0.515
2.558GluTyr: 2.558 ± 0.893
0.0GluXaa: 0.0 ± 0.0
Phe
1.706PheAla: 1.706 ± 0.459
0.853PheCys: 0.853 ± 0.421
3.411PheAsp: 3.411 ± 0.962
2.274PheGlu: 2.274 ± 0.858
2.843PhePhe: 2.843 ± 0.573
2.558PheGly: 2.558 ± 0.59
0.569PheHis: 0.569 ± 0.418
3.98PheIle: 3.98 ± 0.9
3.98PheLys: 3.98 ± 0.708
4.832PheLeu: 4.832 ± 1.634
0.569PheMet: 0.569 ± 0.503
1.421PheAsn: 1.421 ± 0.805
2.274PhePro: 2.274 ± 0.812
1.421PheGln: 1.421 ± 0.588
2.558PheArg: 2.558 ± 0.913
2.843PheSer: 2.843 ± 0.93
2.558PheThr: 2.558 ± 0.729
1.706PheVal: 1.706 ± 0.642
0.853PheTrp: 0.853 ± 0.572
2.274PheTyr: 2.274 ± 0.732
0.0PheXaa: 0.0 ± 0.0
Gly
2.843GlyAla: 2.843 ± 0.712
0.284GlyCys: 0.284 ± 0.309
1.421GlyAsp: 1.421 ± 0.631
2.558GlyGlu: 2.558 ± 0.779
0.853GlyPhe: 0.853 ± 0.52
2.843GlyGly: 2.843 ± 1.185
0.284GlyHis: 0.284 ± 0.293
4.832GlyIle: 4.832 ± 1.105
3.695GlyLys: 3.695 ± 0.981
7.675GlyLeu: 7.675 ± 1.837
2.274GlyMet: 2.274 ± 0.978
2.558GlyAsn: 2.558 ± 0.802
0.0GlyPro: 0.0 ± 0.0
1.137GlyGln: 1.137 ± 0.737
2.558GlyArg: 2.558 ± 0.935
3.127GlySer: 3.127 ± 0.868
3.127GlyThr: 3.127 ± 1.061
1.99GlyVal: 1.99 ± 0.659
0.853GlyTrp: 0.853 ± 0.516
2.558GlyTyr: 2.558 ± 1.088
0.0GlyXaa: 0.0 ± 0.0
His
1.137HisAla: 1.137 ± 0.884
0.0HisCys: 0.0 ± 0.0
0.853HisAsp: 0.853 ± 0.578
0.284HisGlu: 0.284 ± 0.254
0.284HisPhe: 0.284 ± 0.307
1.706HisGly: 1.706 ± 0.794
1.137HisHis: 1.137 ± 0.936
1.421HisIle: 1.421 ± 0.539
1.137HisLys: 1.137 ± 0.589
2.843HisLeu: 2.843 ± 0.704
0.0HisMet: 0.0 ± 0.0
0.853HisAsn: 0.853 ± 0.478
0.569HisPro: 0.569 ± 0.389
0.284HisGln: 0.284 ± 0.254
0.0HisArg: 0.0 ± 0.0
0.853HisSer: 0.853 ± 0.521
0.853HisThr: 0.853 ± 0.371
0.569HisVal: 0.569 ± 0.275
0.0HisTrp: 0.0 ± 0.0
0.853HisTyr: 0.853 ± 0.449
0.0HisXaa: 0.0 ± 0.0
Ile
4.548IleAla: 4.548 ± 1.011
1.706IleCys: 1.706 ± 0.653
7.391IleAsp: 7.391 ± 1.401
5.685IleGlu: 5.685 ± 1.446
3.98IlePhe: 3.98 ± 0.974
1.137IleGly: 1.137 ± 0.688
1.99IleHis: 1.99 ± 0.855
5.401IleIle: 5.401 ± 0.794
5.401IleLys: 5.401 ± 1.237
9.949IleLeu: 9.949 ± 2.231
1.421IleMet: 1.421 ± 0.665
3.695IleAsn: 3.695 ± 1.339
3.127IlePro: 3.127 ± 0.969
3.98IleGln: 3.98 ± 1.054
6.254IleArg: 6.254 ± 1.064
4.548IleSer: 4.548 ± 0.993
3.98IleThr: 3.98 ± 1.1
1.421IleVal: 1.421 ± 0.709
1.99IleTrp: 1.99 ± 0.687
2.274IleTyr: 2.274 ± 0.646
0.0IleXaa: 0.0 ± 0.0
Lys
8.243LysAla: 8.243 ± 2.015
0.569LysCys: 0.569 ± 0.374
3.695LysAsp: 3.695 ± 1.132
9.665LysGlu: 9.665 ± 1.345
3.695LysPhe: 3.695 ± 0.821
3.127LysGly: 3.127 ± 0.826
1.137LysHis: 1.137 ± 0.734
5.401LysIle: 5.401 ± 1.14
8.243LysLys: 8.243 ± 1.922
9.949LysLeu: 9.949 ± 1.672
1.99LysMet: 1.99 ± 0.692
7.106LysAsn: 7.106 ± 1.041
2.558LysPro: 2.558 ± 0.722
4.264LysGln: 4.264 ± 1.185
3.98LysArg: 3.98 ± 1.175
3.695LysSer: 3.695 ± 0.934
6.538LysThr: 6.538 ± 1.367
4.264LysVal: 4.264 ± 1.13
0.284LysTrp: 0.284 ± 0.343
3.127LysTyr: 3.127 ± 1.316
0.0LysXaa: 0.0 ± 0.0
Leu
6.254LeuAla: 6.254 ± 1.777
0.853LeuCys: 0.853 ± 0.454
8.812LeuAsp: 8.812 ± 1.393
11.37LeuGlu: 11.37 ± 2.109
3.98LeuPhe: 3.98 ± 1.194
7.391LeuGly: 7.391 ± 1.876
1.137LeuHis: 1.137 ± 0.806
5.685LeuIle: 5.685 ± 1.476
11.939LeuLys: 11.939 ± 1.721
13.076LeuLeu: 13.076 ± 1.67
3.411LeuMet: 3.411 ± 0.999
5.401LeuAsn: 5.401 ± 1.065
3.411LeuPro: 3.411 ± 1.183
4.264LeuGln: 4.264 ± 0.865
3.127LeuArg: 3.127 ± 1.325
6.254LeuSer: 6.254 ± 1.42
5.117LeuThr: 5.117 ± 1.192
3.695LeuVal: 3.695 ± 0.997
0.853LeuTrp: 0.853 ± 0.684
4.832LeuTyr: 4.832 ± 0.95
0.0LeuXaa: 0.0 ± 0.0
Met
2.843MetAla: 2.843 ± 0.981
0.0MetCys: 0.0 ± 0.0
0.853MetAsp: 0.853 ± 0.652
2.558MetGlu: 2.558 ± 1.057
0.284MetPhe: 0.284 ± 0.3
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.706MetIle: 1.706 ± 0.662
2.558MetLys: 2.558 ± 0.966
2.558MetLeu: 2.558 ± 0.85
0.0MetMet: 0.0 ± 0.0
1.99MetAsn: 1.99 ± 1.01
0.853MetPro: 0.853 ± 0.508
0.284MetGln: 0.284 ± 0.309
1.137MetArg: 1.137 ± 0.662
0.853MetSer: 0.853 ± 0.672
2.274MetThr: 2.274 ± 0.724
2.558MetVal: 2.558 ± 0.777
0.284MetTrp: 0.284 ± 0.335
0.853MetTyr: 0.853 ± 0.537
0.0MetXaa: 0.0 ± 0.0
Asn
3.127AsnAla: 3.127 ± 0.814
0.0AsnCys: 0.0 ± 0.0
1.99AsnAsp: 1.99 ± 0.85
4.264AsnGlu: 4.264 ± 1.327
2.558AsnPhe: 2.558 ± 0.737
3.98AsnGly: 3.98 ± 0.814
1.421AsnHis: 1.421 ± 0.648
4.548AsnIle: 4.548 ± 0.959
4.548AsnLys: 4.548 ± 0.882
4.832AsnLeu: 4.832 ± 1.741
1.137AsnMet: 1.137 ± 0.636
6.538AsnAsn: 6.538 ± 1.302
1.706AsnPro: 1.706 ± 0.544
3.411AsnGln: 3.411 ± 0.661
4.548AsnArg: 4.548 ± 1.275
3.98AsnSer: 3.98 ± 1.188
3.695AsnThr: 3.695 ± 1.108
1.99AsnVal: 1.99 ± 0.84
1.706AsnTrp: 1.706 ± 0.751
2.843AsnTyr: 2.843 ± 0.764
0.0AsnXaa: 0.0 ± 0.0
Pro
1.99ProAla: 1.99 ± 0.634
0.0ProCys: 0.0 ± 0.0
1.706ProAsp: 1.706 ± 0.578
1.706ProGlu: 1.706 ± 0.769
1.421ProPhe: 1.421 ± 0.615
0.284ProGly: 0.284 ± 0.284
0.284ProHis: 0.284 ± 0.244
2.274ProIle: 2.274 ± 0.715
2.843ProLys: 2.843 ± 1.065
2.274ProLeu: 2.274 ± 0.965
0.0ProMet: 0.0 ± 0.0
0.853ProAsn: 0.853 ± 0.435
0.284ProPro: 0.284 ± 0.31
1.421ProGln: 1.421 ± 0.577
2.558ProArg: 2.558 ± 0.781
1.137ProSer: 1.137 ± 0.54
2.274ProThr: 2.274 ± 0.743
0.853ProVal: 0.853 ± 0.431
0.0ProTrp: 0.0 ± 0.0
0.569ProTyr: 0.569 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
4.264GlnAla: 4.264 ± 1.421
0.853GlnCys: 0.853 ± 0.472
2.558GlnAsp: 2.558 ± 0.989
3.695GlnGlu: 3.695 ± 0.889
1.137GlnPhe: 1.137 ± 0.493
2.558GlnGly: 2.558 ± 0.74
0.284GlnHis: 0.284 ± 0.335
3.127GlnIle: 3.127 ± 1.201
3.695GlnLys: 3.695 ± 0.669
6.822GlnLeu: 6.822 ± 1.875
0.569GlnMet: 0.569 ± 0.392
2.274GlnAsn: 2.274 ± 0.705
1.421GlnPro: 1.421 ± 0.839
3.411GlnGln: 3.411 ± 1.479
2.274GlnArg: 2.274 ± 0.65
1.99GlnSer: 1.99 ± 0.682
2.558GlnThr: 2.558 ± 0.68
2.843GlnVal: 2.843 ± 1.105
0.0GlnTrp: 0.0 ± 0.0
1.137GlnTyr: 1.137 ± 0.588
0.0GlnXaa: 0.0 ± 0.0
Arg
1.99ArgAla: 1.99 ± 0.925
0.569ArgCys: 0.569 ± 0.352
3.411ArgAsp: 3.411 ± 0.892
4.264ArgGlu: 4.264 ± 1.152
2.558ArgPhe: 2.558 ± 0.685
3.411ArgGly: 3.411 ± 1.326
0.569ArgHis: 0.569 ± 0.35
3.695ArgIle: 3.695 ± 1.136
2.843ArgLys: 2.843 ± 0.846
3.411ArgLeu: 3.411 ± 0.806
0.569ArgMet: 0.569 ± 0.444
3.127ArgAsn: 3.127 ± 0.853
1.706ArgPro: 1.706 ± 1.12
4.548ArgGln: 4.548 ± 1.376
2.274ArgArg: 2.274 ± 0.862
2.843ArgSer: 2.843 ± 0.894
1.137ArgThr: 1.137 ± 0.86
1.421ArgVal: 1.421 ± 0.59
0.569ArgTrp: 0.569 ± 0.395
4.264ArgTyr: 4.264 ± 0.907
0.0ArgXaa: 0.0 ± 0.0
Ser
2.843SerAla: 2.843 ± 0.871
0.284SerCys: 0.284 ± 0.284
4.548SerAsp: 4.548 ± 1.104
4.548SerGlu: 4.548 ± 1.202
1.99SerPhe: 1.99 ± 0.798
2.558SerGly: 2.558 ± 0.902
1.137SerHis: 1.137 ± 0.506
4.264SerIle: 4.264 ± 0.754
4.548SerLys: 4.548 ± 1.141
4.832SerLeu: 4.832 ± 0.968
1.137SerMet: 1.137 ± 0.579
2.274SerAsn: 2.274 ± 0.718
0.284SerPro: 0.284 ± 0.254
2.274SerGln: 2.274 ± 0.857
2.843SerArg: 2.843 ± 1.01
1.421SerSer: 1.421 ± 0.752
4.264SerThr: 4.264 ± 1.242
4.264SerVal: 4.264 ± 1.075
0.0SerTrp: 0.0 ± 0.0
2.558SerTyr: 2.558 ± 1.061
0.0SerXaa: 0.0 ± 0.0
Thr
3.127ThrAla: 3.127 ± 0.956
0.0ThrCys: 0.0 ± 0.0
1.99ThrAsp: 1.99 ± 0.836
2.274ThrGlu: 2.274 ± 0.643
3.127ThrPhe: 3.127 ± 1.236
4.548ThrGly: 4.548 ± 1.095
1.137ThrHis: 1.137 ± 0.599
5.117ThrIle: 5.117 ± 1.193
4.832ThrLys: 4.832 ± 1.063
7.391ThrLeu: 7.391 ± 1.096
1.99ThrMet: 1.99 ± 0.736
3.411ThrAsn: 3.411 ± 0.837
0.853ThrPro: 0.853 ± 0.444
3.411ThrGln: 3.411 ± 1.126
1.99ThrArg: 1.99 ± 0.845
1.706ThrSer: 1.706 ± 0.572
1.706ThrThr: 1.706 ± 0.868
3.98ThrVal: 3.98 ± 0.988
0.284ThrTrp: 0.284 ± 0.244
2.274ThrTyr: 2.274 ± 0.688
0.0ThrXaa: 0.0 ± 0.0
Val
1.706ValAla: 1.706 ± 0.902
0.569ValCys: 0.569 ± 0.359
0.853ValAsp: 0.853 ± 0.481
3.127ValGlu: 3.127 ± 1.21
1.706ValPhe: 1.706 ± 0.957
1.706ValGly: 1.706 ± 0.739
0.284ValHis: 0.284 ± 0.254
4.548ValIle: 4.548 ± 1.264
4.548ValLys: 4.548 ± 1.598
3.98ValLeu: 3.98 ± 0.808
1.137ValMet: 1.137 ± 0.511
4.264ValAsn: 4.264 ± 1.195
0.853ValPro: 0.853 ± 0.54
1.421ValGln: 1.421 ± 0.525
1.421ValArg: 1.421 ± 0.562
2.274ValSer: 2.274 ± 0.761
3.127ValThr: 3.127 ± 1.1
1.99ValVal: 1.99 ± 0.875
0.284ValTrp: 0.284 ± 0.3
3.411ValTyr: 3.411 ± 0.798
0.0ValXaa: 0.0 ± 0.0
Trp
1.706TrpAla: 1.706 ± 0.701
0.0TrpCys: 0.0 ± 0.0
0.569TrpAsp: 0.569 ± 0.352
1.137TrpGlu: 1.137 ± 0.599
0.284TrpPhe: 0.284 ± 0.335
0.569TrpGly: 0.569 ± 0.389
0.569TrpHis: 0.569 ± 0.386
0.284TrpIle: 0.284 ± 0.311
0.284TrpLys: 0.284 ± 0.3
1.706TrpLeu: 1.706 ± 0.743
0.284TrpMet: 0.284 ± 0.335
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.853TrpGln: 0.853 ± 0.376
0.0TrpArg: 0.0 ± 0.0
0.284TrpSer: 0.284 ± 0.335
0.569TrpThr: 0.569 ± 0.359
0.853TrpVal: 0.853 ± 0.5
0.0TrpTrp: 0.0 ± 0.0
0.569TrpTyr: 0.569 ± 0.372
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.137TyrAla: 1.137 ± 0.566
0.284TyrCys: 0.284 ± 0.335
0.853TyrAsp: 0.853 ± 0.508
1.706TyrGlu: 1.706 ± 0.709
3.98TyrPhe: 3.98 ± 1.325
1.706TyrGly: 1.706 ± 0.501
1.137TyrHis: 1.137 ± 0.505
4.264TyrIle: 4.264 ± 1.169
3.695TyrLys: 3.695 ± 1.135
5.685TyrLeu: 5.685 ± 1.462
0.569TyrMet: 0.569 ± 0.427
3.695TyrAsn: 3.695 ± 0.874
1.421TyrPro: 1.421 ± 0.493
1.99TyrGln: 1.99 ± 0.73
2.558TyrArg: 2.558 ± 0.817
3.98TyrSer: 3.98 ± 0.847
2.274TyrThr: 2.274 ± 0.981
1.137TyrVal: 1.137 ± 0.563
0.569TyrTrp: 0.569 ± 0.384
1.706TyrTyr: 1.706 ± 0.645
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (3519 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski