Amino acid dipepetide frequency for Streptococcus phage Javan47

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.388AlaAla: 4.388 ± 1.551
0.163AlaCys: 0.163 ± 0.114
5.445AlaAsp: 5.445 ± 0.771
5.445AlaGlu: 5.445 ± 0.977
1.95AlaPhe: 1.95 ± 0.757
3.657AlaGly: 3.657 ± 0.865
0.731AlaHis: 0.731 ± 0.339
7.233AlaIle: 7.233 ± 1.096
6.258AlaLys: 6.258 ± 0.793
5.607AlaLeu: 5.607 ± 0.946
2.275AlaMet: 2.275 ± 0.927
6.176AlaAsn: 6.176 ± 1.235
1.869AlaPro: 1.869 ± 0.484
3.982AlaGln: 3.982 ± 0.752
3.007AlaArg: 3.007 ± 0.416
4.145AlaSer: 4.145 ± 1.012
4.714AlaThr: 4.714 ± 1.329
5.039AlaVal: 5.039 ± 0.668
0.894AlaTrp: 0.894 ± 0.238
2.275AlaTyr: 2.275 ± 0.446
0.0AlaXaa: 0.0 ± 0.0
Cys
0.244CysAla: 0.244 ± 0.149
0.081CysCys: 0.081 ± 0.088
0.406CysAsp: 0.406 ± 0.237
0.488CysGlu: 0.488 ± 0.22
0.163CysPhe: 0.163 ± 0.117
0.325CysGly: 0.325 ± 0.2
0.244CysHis: 0.244 ± 0.137
0.244CysIle: 0.244 ± 0.15
0.325CysLys: 0.325 ± 0.168
0.081CysLeu: 0.081 ± 0.08
0.244CysMet: 0.244 ± 0.147
0.244CysAsn: 0.244 ± 0.123
0.163CysPro: 0.163 ± 0.108
0.163CysGln: 0.163 ± 0.11
0.569CysArg: 0.569 ± 0.244
0.406CysSer: 0.406 ± 0.168
0.0CysThr: 0.0 ± 0.0
0.406CysVal: 0.406 ± 0.165
0.0CysTrp: 0.0 ± 0.0
0.163CysTyr: 0.163 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
3.901AspAla: 3.901 ± 0.624
0.569AspCys: 0.569 ± 0.253
3.82AspAsp: 3.82 ± 0.789
5.201AspGlu: 5.201 ± 0.866
3.088AspPhe: 3.088 ± 0.46
3.982AspGly: 3.982 ± 0.959
0.65AspHis: 0.65 ± 0.34
4.063AspIle: 4.063 ± 0.554
4.795AspLys: 4.795 ± 0.747
5.689AspLeu: 5.689 ± 1.013
1.138AspMet: 1.138 ± 0.345
5.364AspAsn: 5.364 ± 0.885
1.544AspPro: 1.544 ± 0.378
1.544AspGln: 1.544 ± 0.313
2.763AspArg: 2.763 ± 0.592
3.495AspSer: 3.495 ± 0.598
3.088AspThr: 3.088 ± 0.642
2.601AspVal: 2.601 ± 0.569
0.813AspTrp: 0.813 ± 0.212
3.982AspTyr: 3.982 ± 0.644
0.0AspXaa: 0.0 ± 0.0
Glu
4.876GluAla: 4.876 ± 0.661
0.325GluCys: 0.325 ± 0.163
3.413GluAsp: 3.413 ± 0.643
5.039GluGlu: 5.039 ± 0.995
2.357GluPhe: 2.357 ± 0.418
3.576GluGly: 3.576 ± 0.53
0.813GluHis: 0.813 ± 0.244
4.957GluIle: 4.957 ± 0.784
6.501GluLys: 6.501 ± 1.169
8.208GluLeu: 8.208 ± 1.183
1.788GluMet: 1.788 ± 0.563
3.251GluAsn: 3.251 ± 0.564
1.625GluPro: 1.625 ± 0.371
3.657GluGln: 3.657 ± 0.742
4.307GluArg: 4.307 ± 0.913
3.495GluSer: 3.495 ± 0.578
3.332GluThr: 3.332 ± 0.533
4.145GluVal: 4.145 ± 0.568
0.975GluTrp: 0.975 ± 0.277
3.251GluTyr: 3.251 ± 0.555
0.0GluXaa: 0.0 ± 0.0
Phe
3.738PheAla: 3.738 ± 0.769
0.0PheCys: 0.0 ± 0.0
2.682PheAsp: 2.682 ± 0.494
2.357PheGlu: 2.357 ± 0.463
1.3PhePhe: 1.3 ± 0.445
2.844PheGly: 2.844 ± 0.485
0.325PheHis: 0.325 ± 0.176
2.275PheIle: 2.275 ± 0.417
3.332PheLys: 3.332 ± 0.569
2.113PheLeu: 2.113 ± 0.421
0.894PheMet: 0.894 ± 0.27
3.088PheAsn: 3.088 ± 0.523
0.894PhePro: 0.894 ± 0.332
0.894PheGln: 0.894 ± 0.245
1.219PheArg: 1.219 ± 0.265
2.438PheSer: 2.438 ± 0.437
1.869PheThr: 1.869 ± 0.385
2.275PheVal: 2.275 ± 0.521
0.325PheTrp: 0.325 ± 0.225
1.219PheTyr: 1.219 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
4.632GlyAla: 4.632 ± 0.969
0.488GlyCys: 0.488 ± 0.175
2.519GlyAsp: 2.519 ± 0.467
2.519GlyGlu: 2.519 ± 0.328
2.763GlyPhe: 2.763 ± 0.522
3.251GlyGly: 3.251 ± 0.552
0.65GlyHis: 0.65 ± 0.229
4.795GlyIle: 4.795 ± 0.93
6.42GlyLys: 6.42 ± 0.792
4.632GlyLeu: 4.632 ± 1.069
1.95GlyMet: 1.95 ± 0.766
2.763GlyAsn: 2.763 ± 0.513
0.975GlyPro: 0.975 ± 0.272
2.763GlyGln: 2.763 ± 0.421
2.357GlyArg: 2.357 ± 0.531
3.576GlySer: 3.576 ± 0.507
4.063GlyThr: 4.063 ± 0.764
3.82GlyVal: 3.82 ± 0.695
1.544GlyTrp: 1.544 ± 0.501
2.763GlyTyr: 2.763 ± 0.545
0.0GlyXaa: 0.0 ± 0.0
His
0.406HisAla: 0.406 ± 0.159
0.163HisCys: 0.163 ± 0.127
0.244HisAsp: 0.244 ± 0.146
1.056HisGlu: 1.056 ± 0.342
0.65HisPhe: 0.65 ± 0.226
0.975HisGly: 0.975 ± 0.324
0.406HisHis: 0.406 ± 0.175
0.813HisIle: 0.813 ± 0.31
0.894HisLys: 0.894 ± 0.279
0.894HisLeu: 0.894 ± 0.291
0.163HisMet: 0.163 ± 0.125
0.65HisAsn: 0.65 ± 0.22
0.244HisPro: 0.244 ± 0.142
0.406HisGln: 0.406 ± 0.163
0.894HisArg: 0.894 ± 0.263
0.813HisSer: 0.813 ± 0.202
0.813HisThr: 0.813 ± 0.222
0.406HisVal: 0.406 ± 0.214
0.081HisTrp: 0.081 ± 0.085
0.569HisTyr: 0.569 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
6.989IleAla: 6.989 ± 0.798
0.569IleCys: 0.569 ± 0.192
6.42IleAsp: 6.42 ± 0.739
5.933IleGlu: 5.933 ± 0.854
2.194IlePhe: 2.194 ± 0.448
3.901IleGly: 3.901 ± 0.774
0.65IleHis: 0.65 ± 0.282
6.095IleIle: 6.095 ± 1.046
7.395IleLys: 7.395 ± 0.759
4.226IleLeu: 4.226 ± 0.657
1.788IleMet: 1.788 ± 0.403
5.201IleAsn: 5.201 ± 0.773
1.869IlePro: 1.869 ± 0.408
3.251IleGln: 3.251 ± 0.899
2.438IleArg: 2.438 ± 0.448
4.714IleSer: 4.714 ± 1.166
5.526IleThr: 5.526 ± 0.846
5.201IleVal: 5.201 ± 0.578
0.569IleTrp: 0.569 ± 0.179
2.357IleTyr: 2.357 ± 0.471
0.0IleXaa: 0.0 ± 0.0
Lys
6.42LysAla: 6.42 ± 0.705
0.244LysCys: 0.244 ± 0.15
4.388LysAsp: 4.388 ± 0.641
5.933LysGlu: 5.933 ± 0.945
2.194LysPhe: 2.194 ± 0.419
4.47LysGly: 4.47 ± 0.494
0.975LysHis: 0.975 ± 0.43
6.095LysIle: 6.095 ± 0.893
7.395LysLys: 7.395 ± 1.248
6.176LysLeu: 6.176 ± 0.824
2.519LysMet: 2.519 ± 0.453
5.12LysAsn: 5.12 ± 0.682
1.869LysPro: 1.869 ± 0.445
3.738LysGln: 3.738 ± 0.627
4.388LysArg: 4.388 ± 0.715
6.095LysSer: 6.095 ± 0.683
5.526LysThr: 5.526 ± 0.748
4.551LysVal: 4.551 ± 0.845
1.219LysTrp: 1.219 ± 0.319
2.763LysTyr: 2.763 ± 0.702
0.0LysXaa: 0.0 ± 0.0
Leu
5.282LeuAla: 5.282 ± 0.651
0.325LeuCys: 0.325 ± 0.197
6.176LeuAsp: 6.176 ± 0.857
6.826LeuGlu: 6.826 ± 1.061
2.601LeuPhe: 2.601 ± 0.453
5.364LeuGly: 5.364 ± 0.929
0.894LeuHis: 0.894 ± 0.241
6.014LeuIle: 6.014 ± 0.74
5.933LeuLys: 5.933 ± 1.074
5.607LeuLeu: 5.607 ± 0.796
1.463LeuMet: 1.463 ± 0.453
5.282LeuAsn: 5.282 ± 0.618
3.982LeuPro: 3.982 ± 0.555
3.495LeuGln: 3.495 ± 0.617
3.007LeuArg: 3.007 ± 0.64
5.933LeuSer: 5.933 ± 0.83
5.607LeuThr: 5.607 ± 0.896
5.445LeuVal: 5.445 ± 0.774
0.65LeuTrp: 0.65 ± 0.238
2.275LeuTyr: 2.275 ± 0.502
0.0LeuXaa: 0.0 ± 0.0
Met
2.763MetAla: 2.763 ± 0.884
0.163MetCys: 0.163 ± 0.118
1.869MetAsp: 1.869 ± 0.381
1.707MetGlu: 1.707 ± 0.441
0.569MetPhe: 0.569 ± 0.234
0.406MetGly: 0.406 ± 0.163
0.0MetHis: 0.0 ± 0.0
1.95MetIle: 1.95 ± 0.342
1.138MetLys: 1.138 ± 0.251
2.194MetLeu: 2.194 ± 0.314
0.244MetMet: 0.244 ± 0.16
1.056MetAsn: 1.056 ± 0.572
1.056MetPro: 1.056 ± 0.277
1.382MetGln: 1.382 ± 0.328
1.3MetArg: 1.3 ± 0.424
2.438MetSer: 2.438 ± 0.458
2.032MetThr: 2.032 ± 0.32
1.138MetVal: 1.138 ± 0.317
0.406MetTrp: 0.406 ± 0.132
0.244MetTyr: 0.244 ± 0.137
0.0MetXaa: 0.0 ± 0.0
Asn
4.632AsnAla: 4.632 ± 1.094
0.163AsnCys: 0.163 ± 0.115
3.007AsnAsp: 3.007 ± 0.569
3.738AsnGlu: 3.738 ± 0.744
2.275AsnPhe: 2.275 ± 0.414
4.388AsnGly: 4.388 ± 0.618
0.813AsnHis: 0.813 ± 0.294
5.851AsnIle: 5.851 ± 0.813
4.632AsnLys: 4.632 ± 0.671
5.933AsnLeu: 5.933 ± 0.805
1.95AsnMet: 1.95 ± 0.328
4.307AsnAsn: 4.307 ± 0.685
2.357AsnPro: 2.357 ± 0.537
3.007AsnGln: 3.007 ± 0.522
2.601AsnArg: 2.601 ± 0.413
3.657AsnSer: 3.657 ± 1.015
2.844AsnThr: 2.844 ± 0.465
4.063AsnVal: 4.063 ± 0.57
1.138AsnTrp: 1.138 ± 0.346
2.194AsnTyr: 2.194 ± 0.333
0.0AsnXaa: 0.0 ± 0.0
Pro
2.357ProAla: 2.357 ± 0.417
0.081ProCys: 0.081 ± 0.08
1.625ProAsp: 1.625 ± 0.538
1.707ProGlu: 1.707 ± 0.446
0.975ProPhe: 0.975 ± 0.33
1.707ProGly: 1.707 ± 0.416
0.325ProHis: 0.325 ± 0.202
1.625ProIle: 1.625 ± 0.371
2.926ProLys: 2.926 ± 0.665
2.113ProLeu: 2.113 ± 0.403
0.163ProMet: 0.163 ± 0.137
1.625ProAsn: 1.625 ± 0.394
0.65ProPro: 0.65 ± 0.4
2.113ProGln: 2.113 ± 0.684
1.463ProArg: 1.463 ± 0.359
1.869ProSer: 1.869 ± 0.485
2.194ProThr: 2.194 ± 0.498
1.707ProVal: 1.707 ± 0.473
0.325ProTrp: 0.325 ± 0.147
0.65ProTyr: 0.65 ± 0.196
0.0ProXaa: 0.0 ± 0.0
Gln
4.063GlnAla: 4.063 ± 0.745
0.325GlnCys: 0.325 ± 0.154
1.3GlnAsp: 1.3 ± 0.259
3.657GlnGlu: 3.657 ± 0.68
1.788GlnPhe: 1.788 ± 0.443
2.438GlnGly: 2.438 ± 0.551
0.569GlnHis: 0.569 ± 0.272
3.738GlnIle: 3.738 ± 0.826
3.495GlnLys: 3.495 ± 0.593
5.526GlnLeu: 5.526 ± 1.022
0.975GlnMet: 0.975 ± 0.375
2.194GlnAsn: 2.194 ± 0.466
1.707GlnPro: 1.707 ± 0.506
2.763GlnGln: 2.763 ± 0.758
1.544GlnArg: 1.544 ± 0.415
2.519GlnSer: 2.519 ± 0.46
3.088GlnThr: 3.088 ± 0.62
1.869GlnVal: 1.869 ± 0.415
0.488GlnTrp: 0.488 ± 0.223
1.625GlnTyr: 1.625 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
2.194ArgAla: 2.194 ± 0.522
0.325ArgCys: 0.325 ± 0.176
2.357ArgAsp: 2.357 ± 0.453
3.413ArgGlu: 3.413 ± 0.559
1.869ArgPhe: 1.869 ± 0.338
2.113ArgGly: 2.113 ± 0.483
0.813ArgHis: 0.813 ± 0.273
3.738ArgIle: 3.738 ± 0.477
3.088ArgLys: 3.088 ± 0.601
4.063ArgLeu: 4.063 ± 0.915
1.219ArgMet: 1.219 ± 0.27
2.763ArgAsn: 2.763 ± 0.571
1.056ArgPro: 1.056 ± 0.289
1.382ArgGln: 1.382 ± 0.32
1.382ArgArg: 1.382 ± 0.355
2.032ArgSer: 2.032 ± 0.405
2.113ArgThr: 2.113 ± 0.481
2.844ArgVal: 2.844 ± 0.534
0.731ArgTrp: 0.731 ± 0.229
1.869ArgTyr: 1.869 ± 0.422
0.0ArgXaa: 0.0 ± 0.0
Ser
5.201SerAla: 5.201 ± 2.447
0.406SerCys: 0.406 ± 0.228
3.901SerAsp: 3.901 ± 0.501
3.982SerGlu: 3.982 ± 0.515
2.926SerPhe: 2.926 ± 0.421
4.876SerGly: 4.876 ± 1.171
0.731SerHis: 0.731 ± 0.283
4.876SerIle: 4.876 ± 0.981
5.039SerLys: 5.039 ± 0.723
5.689SerLeu: 5.689 ± 0.815
1.707SerMet: 1.707 ± 0.311
3.413SerAsn: 3.413 ± 0.617
1.219SerPro: 1.219 ± 0.305
3.413SerGln: 3.413 ± 0.488
2.113SerArg: 2.113 ± 0.45
5.689SerSer: 5.689 ± 1.623
4.551SerThr: 4.551 ± 0.896
3.413SerVal: 3.413 ± 0.573
0.731SerTrp: 0.731 ± 0.291
1.625SerTyr: 1.625 ± 0.434
0.0SerXaa: 0.0 ± 0.0
Thr
5.282ThrAla: 5.282 ± 0.912
0.163ThrCys: 0.163 ± 0.128
4.145ThrAsp: 4.145 ± 0.666
3.088ThrGlu: 3.088 ± 0.545
2.682ThrPhe: 2.682 ± 0.596
4.388ThrGly: 4.388 ± 0.617
0.325ThrHis: 0.325 ± 0.179
5.364ThrIle: 5.364 ± 0.705
3.82ThrLys: 3.82 ± 0.497
5.445ThrLeu: 5.445 ± 0.779
1.544ThrMet: 1.544 ± 0.502
3.982ThrAsn: 3.982 ± 0.667
2.275ThrPro: 2.275 ± 0.611
2.763ThrGln: 2.763 ± 0.578
1.869ThrArg: 1.869 ± 0.288
4.876ThrSer: 4.876 ± 1.062
4.226ThrThr: 4.226 ± 0.697
4.795ThrVal: 4.795 ± 0.707
0.569ThrTrp: 0.569 ± 0.203
2.032ThrTyr: 2.032 ± 0.405
0.0ThrXaa: 0.0 ± 0.0
Val
4.47ValAla: 4.47 ± 0.658
0.244ValCys: 0.244 ± 0.156
5.12ValAsp: 5.12 ± 0.657
4.957ValGlu: 4.957 ± 0.902
1.95ValPhe: 1.95 ± 0.484
3.82ValGly: 3.82 ± 0.383
0.975ValHis: 0.975 ± 0.243
3.738ValIle: 3.738 ± 0.548
4.632ValLys: 4.632 ± 0.684
4.714ValLeu: 4.714 ± 0.771
1.138ValMet: 1.138 ± 0.332
4.307ValAsn: 4.307 ± 0.578
1.625ValPro: 1.625 ± 0.396
2.032ValGln: 2.032 ± 0.7
1.788ValArg: 1.788 ± 0.372
3.901ValSer: 3.901 ± 0.838
4.226ValThr: 4.226 ± 0.603
4.551ValVal: 4.551 ± 0.699
0.569ValTrp: 0.569 ± 0.202
1.382ValTyr: 1.382 ± 0.436
0.0ValXaa: 0.0 ± 0.0
Trp
0.975TrpAla: 0.975 ± 0.293
0.0TrpCys: 0.0 ± 0.0
0.975TrpAsp: 0.975 ± 0.364
1.219TrpGlu: 1.219 ± 0.277
0.488TrpPhe: 0.488 ± 0.202
0.894TrpGly: 0.894 ± 0.364
0.163TrpHis: 0.163 ± 0.106
1.138TrpIle: 1.138 ± 0.355
1.056TrpLys: 1.056 ± 0.242
0.731TrpLeu: 0.731 ± 0.226
0.163TrpMet: 0.163 ± 0.122
0.894TrpAsn: 0.894 ± 0.286
0.244TrpPro: 0.244 ± 0.124
0.488TrpGln: 0.488 ± 0.252
0.406TrpArg: 0.406 ± 0.178
0.813TrpSer: 0.813 ± 0.239
0.731TrpThr: 0.731 ± 0.326
0.569TrpVal: 0.569 ± 0.263
0.081TrpTrp: 0.081 ± 0.078
0.325TrpTyr: 0.325 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.438TyrAla: 2.438 ± 0.512
0.163TyrCys: 0.163 ± 0.117
2.275TyrAsp: 2.275 ± 0.547
1.625TyrGlu: 1.625 ± 0.34
1.3TyrPhe: 1.3 ± 0.272
1.869TyrGly: 1.869 ± 0.38
0.488TyrHis: 0.488 ± 0.194
2.682TyrIle: 2.682 ± 0.485
3.007TyrLys: 3.007 ± 0.574
2.519TyrLeu: 2.519 ± 0.423
0.65TyrMet: 0.65 ± 0.233
1.869TyrAsn: 1.869 ± 0.515
0.975TyrPro: 0.975 ± 0.286
2.438TyrGln: 2.438 ± 0.574
1.95TyrArg: 1.95 ± 0.489
2.601TyrSer: 2.601 ± 0.493
3.088TyrThr: 3.088 ± 0.637
1.3TyrVal: 1.3 ± 0.28
0.325TyrTrp: 0.325 ± 0.16
1.382TyrTyr: 1.382 ± 0.354
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12306 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski