Amino acid dipepetide frequency for Haloterrigena jeotgali icosahedral virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.241AlaAla: 11.241 ± 3.102
0.468AlaCys: 0.468 ± 0.303
5.621AlaAsp: 5.621 ± 1.399
8.197AlaGlu: 8.197 ± 1.701
2.81AlaPhe: 2.81 ± 0.698
7.494AlaGly: 7.494 ± 2.958
1.874AlaHis: 1.874 ± 0.628
3.747AlaIle: 3.747 ± 0.982
1.639AlaLys: 1.639 ± 0.646
9.368AlaLeu: 9.368 ± 1.603
2.108AlaMet: 2.108 ± 0.863
3.044AlaAsn: 3.044 ± 0.76
3.981AlaPro: 3.981 ± 0.794
3.279AlaGln: 3.279 ± 0.695
5.621AlaArg: 5.621 ± 1.172
6.557AlaSer: 6.557 ± 1.046
7.963AlaThr: 7.963 ± 1.859
6.557AlaVal: 6.557 ± 1.348
0.703AlaTrp: 0.703 ± 0.372
1.171AlaTyr: 1.171 ± 0.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.937CysAla: 0.937 ± 0.639
0.234CysCys: 0.234 ± 0.208
0.703CysAsp: 0.703 ± 0.371
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.171CysGly: 1.171 ± 0.509
0.234CysHis: 0.234 ± 0.208
0.0CysIle: 0.0 ± 0.0
0.234CysLys: 0.234 ± 0.25
0.234CysLeu: 0.234 ± 0.25
0.0CysMet: 0.0 ± 0.0
0.703CysAsn: 0.703 ± 0.374
1.874CysPro: 1.874 ± 0.575
0.234CysGln: 0.234 ± 0.239
0.703CysArg: 0.703 ± 0.394
0.468CysSer: 0.468 ± 0.314
0.234CysThr: 0.234 ± 0.319
0.468CysVal: 0.468 ± 0.469
0.234CysTrp: 0.234 ± 0.206
0.468CysTyr: 0.468 ± 0.327
0.0CysXaa: 0.0 ± 0.0
Asp
10.539AspAla: 10.539 ± 2.334
1.405AspCys: 1.405 ± 0.695
11.71AspAsp: 11.71 ± 2.097
9.133AspGlu: 9.133 ± 1.588
3.044AspPhe: 3.044 ± 0.699
9.836AspGly: 9.836 ± 2.109
1.405AspHis: 1.405 ± 0.619
3.044AspIle: 3.044 ± 0.664
1.171AspLys: 1.171 ± 0.697
7.26AspLeu: 7.26 ± 1.049
1.874AspMet: 1.874 ± 0.514
0.937AspAsn: 0.937 ± 0.381
3.279AspPro: 3.279 ± 0.888
1.639AspGln: 1.639 ± 0.653
5.621AspArg: 5.621 ± 1.249
6.323AspSer: 6.323 ± 0.961
5.152AspThr: 5.152 ± 1.072
6.089AspVal: 6.089 ± 1.034
2.108AspTrp: 2.108 ± 0.603
2.576AspTyr: 2.576 ± 0.63
0.0AspXaa: 0.0 ± 0.0
Glu
10.07GluAla: 10.07 ± 1.939
0.0GluCys: 0.0 ± 0.0
7.963GluAsp: 7.963 ± 1.559
7.728GluGlu: 7.728 ± 1.632
1.639GluPhe: 1.639 ± 0.616
7.728GluGly: 7.728 ± 1.297
1.171GluHis: 1.171 ± 0.499
4.215GluIle: 4.215 ± 1.278
2.342GluLys: 2.342 ± 0.772
4.215GluLeu: 4.215 ± 0.994
2.342GluMet: 2.342 ± 0.82
2.108GluAsn: 2.108 ± 0.59
4.215GluPro: 4.215 ± 1.141
3.513GluGln: 3.513 ± 1.015
9.368GluArg: 9.368 ± 1.688
4.215GluSer: 4.215 ± 1.003
4.684GluThr: 4.684 ± 1.077
4.45GluVal: 4.45 ± 1.273
1.874GluTrp: 1.874 ± 0.683
2.108GluTyr: 2.108 ± 0.816
0.0GluXaa: 0.0 ± 0.0
Phe
2.576PheAla: 2.576 ± 0.619
0.234PheCys: 0.234 ± 0.25
3.279PheAsp: 3.279 ± 0.85
2.81PheGlu: 2.81 ± 0.872
0.937PhePhe: 0.937 ± 0.558
2.81PheGly: 2.81 ± 0.73
0.234PheHis: 0.234 ± 0.319
0.234PheIle: 0.234 ± 0.218
0.468PheLys: 0.468 ± 0.359
2.108PheLeu: 2.108 ± 0.488
0.468PheMet: 0.468 ± 0.285
1.639PheAsn: 1.639 ± 0.645
1.405PhePro: 1.405 ± 0.568
0.937PheGln: 0.937 ± 0.542
2.342PheArg: 2.342 ± 0.604
0.468PheSer: 0.468 ± 0.327
2.108PheThr: 2.108 ± 0.679
2.81PheVal: 2.81 ± 0.93
0.0PheTrp: 0.0 ± 0.0
0.234PheTyr: 0.234 ± 0.218
0.0PheXaa: 0.0 ± 0.0
Gly
7.026GlyAla: 7.026 ± 2.351
0.0GlyCys: 0.0 ± 0.0
8.899GlyAsp: 8.899 ± 2.252
4.918GlyGlu: 4.918 ± 0.904
3.044GlyPhe: 3.044 ± 0.747
8.899GlyGly: 8.899 ± 3.222
1.405GlyHis: 1.405 ± 0.645
4.918GlyIle: 4.918 ± 1.033
2.576GlyLys: 2.576 ± 1.032
7.963GlyLeu: 7.963 ± 1.257
2.342GlyMet: 2.342 ± 0.71
3.981GlyAsn: 3.981 ± 1.087
2.576GlyPro: 2.576 ± 0.803
2.576GlyGln: 2.576 ± 0.581
4.918GlyArg: 4.918 ± 1.282
4.918GlySer: 4.918 ± 0.991
5.386GlyThr: 5.386 ± 1.225
4.918GlyVal: 4.918 ± 1.013
1.639GlyTrp: 1.639 ± 0.653
2.342GlyTyr: 2.342 ± 0.932
0.0GlyXaa: 0.0 ± 0.0
His
0.703HisAla: 0.703 ± 0.341
0.234HisCys: 0.234 ± 0.25
2.576HisAsp: 2.576 ± 0.852
0.937HisGlu: 0.937 ± 0.442
0.234HisPhe: 0.234 ± 0.218
1.171HisGly: 1.171 ± 0.434
0.234HisHis: 0.234 ± 0.208
0.468HisIle: 0.468 ± 0.272
0.234HisLys: 0.234 ± 0.239
0.937HisLeu: 0.937 ± 0.427
0.0HisMet: 0.0 ± 0.0
1.171HisAsn: 1.171 ± 0.43
0.937HisPro: 0.937 ± 0.392
0.0HisGln: 0.0 ± 0.0
1.639HisArg: 1.639 ± 0.512
0.468HisSer: 0.468 ± 0.266
0.937HisThr: 0.937 ± 0.379
2.108HisVal: 2.108 ± 0.606
0.0HisTrp: 0.0 ± 0.0
0.468HisTyr: 0.468 ± 0.291
0.0HisXaa: 0.0 ± 0.0
Ile
3.279IleAla: 3.279 ± 0.752
0.234IleCys: 0.234 ± 0.261
5.621IleAsp: 5.621 ± 1.267
5.152IleGlu: 5.152 ± 1.227
0.937IlePhe: 0.937 ± 0.553
5.152IleGly: 5.152 ± 0.988
0.468IleHis: 0.468 ± 0.417
0.703IleIle: 0.703 ± 0.449
0.937IleLys: 0.937 ± 0.415
2.108IleLeu: 2.108 ± 0.804
1.405IleMet: 1.405 ± 0.434
0.468IleAsn: 0.468 ± 0.266
2.108IlePro: 2.108 ± 0.573
1.171IleGln: 1.171 ± 0.611
3.513IleArg: 3.513 ± 0.936
1.639IleSer: 1.639 ± 0.711
1.171IleThr: 1.171 ± 0.477
2.81IleVal: 2.81 ± 0.826
0.468IleTrp: 0.468 ± 0.292
1.874IleTyr: 1.874 ± 0.518
0.0IleXaa: 0.0 ± 0.0
Lys
2.108LysAla: 2.108 ± 0.645
0.468LysCys: 0.468 ± 0.285
0.703LysAsp: 0.703 ± 0.424
1.874LysGlu: 1.874 ± 0.679
0.703LysPhe: 0.703 ± 0.307
2.576LysGly: 2.576 ± 0.711
0.703LysHis: 0.703 ± 0.368
1.405LysIle: 1.405 ± 0.522
1.171LysLys: 1.171 ± 0.575
0.937LysLeu: 0.937 ± 0.44
0.468LysMet: 0.468 ± 0.275
0.468LysAsn: 0.468 ± 0.337
1.171LysPro: 1.171 ± 0.587
0.937LysGln: 0.937 ± 0.547
2.108LysArg: 2.108 ± 0.688
1.171LysSer: 1.171 ± 0.534
1.639LysThr: 1.639 ± 0.652
0.937LysVal: 0.937 ± 0.512
1.639LysTrp: 1.639 ± 0.629
1.405LysTyr: 1.405 ± 0.626
0.0LysXaa: 0.0 ± 0.0
Leu
6.323LeuAla: 6.323 ± 1.446
0.937LeuCys: 0.937 ± 0.524
4.684LeuAsp: 4.684 ± 0.995
11.475LeuGlu: 11.475 ± 2.23
2.108LeuPhe: 2.108 ± 0.72
5.386LeuGly: 5.386 ± 0.888
0.468LeuHis: 0.468 ± 0.345
3.044LeuIle: 3.044 ± 0.63
1.874LeuLys: 1.874 ± 0.523
8.665LeuLeu: 8.665 ± 2.064
1.171LeuMet: 1.171 ± 0.611
1.405LeuAsn: 1.405 ± 0.617
4.684LeuPro: 4.684 ± 1.022
3.513LeuGln: 3.513 ± 1.127
7.494LeuArg: 7.494 ± 1.881
6.089LeuSer: 6.089 ± 1.233
3.513LeuThr: 3.513 ± 0.786
5.855LeuVal: 5.855 ± 1.092
0.234LeuTrp: 0.234 ± 0.218
1.405LeuTyr: 1.405 ± 0.712
0.0LeuXaa: 0.0 ± 0.0
Met
2.81MetAla: 2.81 ± 1.056
0.234MetCys: 0.234 ± 0.208
0.937MetAsp: 0.937 ± 0.517
0.703MetGlu: 0.703 ± 0.392
0.234MetPhe: 0.234 ± 0.261
2.342MetGly: 2.342 ± 0.573
0.234MetHis: 0.234 ± 0.215
1.874MetIle: 1.874 ± 0.658
0.234MetLys: 0.234 ± 0.23
0.703MetLeu: 0.703 ± 0.425
0.703MetMet: 0.703 ± 0.322
0.468MetAsn: 0.468 ± 0.324
0.468MetPro: 0.468 ± 0.258
1.171MetGln: 1.171 ± 0.502
0.937MetArg: 0.937 ± 0.423
2.108MetSer: 2.108 ± 0.567
1.874MetThr: 1.874 ± 0.533
2.108MetVal: 2.108 ± 0.605
0.234MetTrp: 0.234 ± 0.25
0.468MetTyr: 0.468 ± 0.325
0.0MetXaa: 0.0 ± 0.0
Asn
2.576AsnAla: 2.576 ± 0.768
0.0AsnCys: 0.0 ± 0.0
3.747AsnAsp: 3.747 ± 0.942
1.171AsnGlu: 1.171 ± 0.504
0.937AsnPhe: 0.937 ± 0.708
1.874AsnGly: 1.874 ± 0.59
0.703AsnHis: 0.703 ± 0.419
1.171AsnIle: 1.171 ± 0.502
0.703AsnLys: 0.703 ± 0.369
2.576AsnLeu: 2.576 ± 0.784
0.0AsnMet: 0.0 ± 0.0
2.576AsnAsn: 2.576 ± 0.956
1.874AsnPro: 1.874 ± 0.536
0.937AsnGln: 0.937 ± 0.519
1.171AsnArg: 1.171 ± 0.469
2.81AsnSer: 2.81 ± 0.745
1.874AsnThr: 1.874 ± 0.53
0.937AsnVal: 0.937 ± 0.587
0.703AsnTrp: 0.703 ± 0.338
0.468AsnTyr: 0.468 ± 0.289
0.0AsnXaa: 0.0 ± 0.0
Pro
4.918ProAla: 4.918 ± 0.652
0.937ProCys: 0.937 ± 0.461
3.513ProAsp: 3.513 ± 0.909
3.513ProGlu: 3.513 ± 0.817
1.405ProPhe: 1.405 ± 0.776
5.386ProGly: 5.386 ± 1.166
1.171ProHis: 1.171 ± 0.514
1.874ProIle: 1.874 ± 0.685
1.639ProLys: 1.639 ± 0.714
2.81ProLeu: 2.81 ± 0.584
0.703ProMet: 0.703 ± 0.617
0.937ProAsn: 0.937 ± 0.393
3.747ProPro: 3.747 ± 1.155
0.468ProGln: 0.468 ± 0.31
3.044ProArg: 3.044 ± 0.727
3.044ProSer: 3.044 ± 0.979
4.45ProThr: 4.45 ± 1.022
3.279ProVal: 3.279 ± 1.032
0.703ProTrp: 0.703 ± 0.465
1.405ProTyr: 1.405 ± 0.598
0.0ProXaa: 0.0 ± 0.0
Gln
5.386GlnAla: 5.386 ± 1.132
0.234GlnCys: 0.234 ± 0.239
2.342GlnAsp: 2.342 ± 0.699
3.513GlnGlu: 3.513 ± 1.004
0.937GlnPhe: 0.937 ± 0.492
1.639GlnGly: 1.639 ± 0.568
0.234GlnHis: 0.234 ± 0.208
0.937GlnIle: 0.937 ± 0.448
0.468GlnLys: 0.468 ± 0.306
1.874GlnLeu: 1.874 ± 0.831
0.937GlnMet: 0.937 ± 0.456
0.937GlnAsn: 0.937 ± 0.354
1.639GlnPro: 1.639 ± 0.832
1.171GlnGln: 1.171 ± 0.465
2.342GlnArg: 2.342 ± 0.833
1.405GlnSer: 1.405 ± 0.464
2.576GlnThr: 2.576 ± 0.714
1.171GlnVal: 1.171 ± 0.608
0.937GlnTrp: 0.937 ± 0.456
0.468GlnTyr: 0.468 ± 0.335
0.0GlnXaa: 0.0 ± 0.0
Arg
5.855ArgAla: 5.855 ± 1.357
0.937ArgCys: 0.937 ± 0.452
5.386ArgAsp: 5.386 ± 1.104
7.728ArgGlu: 7.728 ± 1.695
2.81ArgPhe: 2.81 ± 0.802
3.044ArgGly: 3.044 ± 0.919
0.937ArgHis: 0.937 ± 0.477
1.639ArgIle: 1.639 ± 0.599
3.044ArgLys: 3.044 ± 0.898
8.431ArgLeu: 8.431 ± 1.489
2.108ArgMet: 2.108 ± 0.612
1.639ArgAsn: 1.639 ± 0.543
3.279ArgPro: 3.279 ± 0.938
3.279ArgGln: 3.279 ± 1.0
9.133ArgArg: 9.133 ± 4.823
6.323ArgSer: 6.323 ± 1.754
4.918ArgThr: 4.918 ± 1.181
4.684ArgVal: 4.684 ± 0.974
0.468ArgTrp: 0.468 ± 0.308
1.171ArgTyr: 1.171 ± 0.49
0.0ArgXaa: 0.0 ± 0.0
Ser
2.342SerAla: 2.342 ± 0.658
0.468SerCys: 0.468 ± 0.3
8.197SerAsp: 8.197 ± 1.797
4.215SerGlu: 4.215 ± 0.783
0.703SerPhe: 0.703 ± 0.461
3.747SerGly: 3.747 ± 0.927
0.937SerHis: 0.937 ± 0.444
2.81SerIle: 2.81 ± 0.826
1.405SerLys: 1.405 ± 0.61
3.513SerLeu: 3.513 ± 0.727
2.342SerMet: 2.342 ± 0.684
2.576SerAsn: 2.576 ± 1.142
3.747SerPro: 3.747 ± 0.967
1.874SerGln: 1.874 ± 0.499
4.684SerArg: 4.684 ± 1.321
7.026SerSer: 7.026 ± 1.834
5.152SerThr: 5.152 ± 1.225
4.684SerVal: 4.684 ± 1.299
1.639SerTrp: 1.639 ± 0.586
1.874SerTyr: 1.874 ± 0.743
0.0SerXaa: 0.0 ± 0.0
Thr
5.621ThrAla: 5.621 ± 1.292
1.639ThrCys: 1.639 ± 0.688
6.323ThrAsp: 6.323 ± 1.672
4.918ThrGlu: 4.918 ± 0.983
2.342ThrPhe: 2.342 ± 0.609
4.684ThrGly: 4.684 ± 1.003
1.171ThrHis: 1.171 ± 0.506
4.215ThrIle: 4.215 ± 1.149
1.874ThrLys: 1.874 ± 0.696
5.386ThrLeu: 5.386 ± 1.455
0.468ThrMet: 0.468 ± 0.359
2.108ThrAsn: 2.108 ± 0.906
2.81ThrPro: 2.81 ± 1.065
2.342ThrGln: 2.342 ± 0.637
3.279ThrArg: 3.279 ± 0.688
4.918ThrSer: 4.918 ± 1.336
4.215ThrThr: 4.215 ± 1.148
3.981ThrVal: 3.981 ± 1.154
0.937ThrTrp: 0.937 ± 0.368
2.108ThrTyr: 2.108 ± 1.109
0.0ThrXaa: 0.0 ± 0.0
Val
6.792ValAla: 6.792 ± 1.381
0.468ValCys: 0.468 ± 0.283
9.368ValAsp: 9.368 ± 1.688
5.152ValGlu: 5.152 ± 1.485
1.639ValPhe: 1.639 ± 0.53
5.152ValGly: 5.152 ± 0.966
0.234ValHis: 0.234 ± 0.239
2.81ValIle: 2.81 ± 0.692
1.405ValLys: 1.405 ± 0.426
7.26ValLeu: 7.26 ± 1.247
0.937ValMet: 0.937 ± 0.438
0.937ValAsn: 0.937 ± 0.53
2.81ValPro: 2.81 ± 0.887
1.171ValGln: 1.171 ± 0.532
4.684ValArg: 4.684 ± 0.947
2.576ValSer: 2.576 ± 0.809
4.45ValThr: 4.45 ± 0.844
5.152ValVal: 5.152 ± 1.501
1.405ValTrp: 1.405 ± 0.71
1.171ValTyr: 1.171 ± 0.545
0.0ValXaa: 0.0 ± 0.0
Trp
0.937TrpAla: 0.937 ± 0.457
0.0TrpCys: 0.0 ± 0.0
0.937TrpAsp: 0.937 ± 0.45
1.874TrpGlu: 1.874 ± 0.716
0.937TrpPhe: 0.937 ± 0.575
2.108TrpGly: 2.108 ± 0.939
0.703TrpHis: 0.703 ± 0.373
0.468TrpIle: 0.468 ± 0.295
0.234TrpLys: 0.234 ± 0.208
0.937TrpLeu: 0.937 ± 0.431
0.234TrpMet: 0.234 ± 0.266
0.234TrpAsn: 0.234 ± 0.186
0.234TrpPro: 0.234 ± 0.235
0.234TrpGln: 0.234 ± 0.215
2.108TrpArg: 2.108 ± 0.73
0.468TrpSer: 0.468 ± 0.256
0.937TrpThr: 0.937 ± 0.392
1.405TrpVal: 1.405 ± 0.518
0.234TrpTrp: 0.234 ± 0.239
0.937TrpTyr: 0.937 ± 0.332
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.405TyrAla: 1.405 ± 0.47
0.0TyrCys: 0.0 ± 0.0
1.639TyrAsp: 1.639 ± 0.659
1.171TyrGlu: 1.171 ± 0.415
0.703TyrPhe: 0.703 ± 0.402
3.044TyrGly: 3.044 ± 0.799
0.937TyrHis: 0.937 ± 0.385
1.639TyrIle: 1.639 ± 0.623
0.937TyrLys: 0.937 ± 0.373
3.044TyrLeu: 3.044 ± 0.92
0.0TyrMet: 0.0 ± 0.0
0.468TyrAsn: 0.468 ± 0.272
2.108TyrPro: 2.108 ± 0.659
0.703TyrGln: 0.703 ± 0.395
2.108TyrArg: 2.108 ± 0.615
0.937TyrSer: 0.937 ± 0.454
2.108TyrThr: 2.108 ± 1.151
1.171TyrVal: 1.171 ± 0.383
0.0TyrTrp: 0.0 ± 0.0
0.703TyrTyr: 0.703 ± 0.497
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 30 proteins (4271 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski