Amino acid dipepetide frequency for Orgi virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.952AlaAla: 1.952 ± 0.324
1.394AlaCys: 1.394 ± 0.623
1.952AlaAsp: 1.952 ± 1.088
3.346AlaGlu: 3.346 ± 1.021
2.789AlaPhe: 2.789 ± 1.088
1.952AlaGly: 1.952 ± 0.78
1.115AlaHis: 1.115 ± 0.43
6.135AlaIle: 6.135 ± 2.024
2.51AlaLys: 2.51 ± 0.414
5.298AlaLeu: 5.298 ± 1.739
0.837AlaMet: 0.837 ± 0.548
2.231AlaAsn: 2.231 ± 0.86
1.673AlaPro: 1.673 ± 1.328
1.952AlaGln: 1.952 ± 0.612
1.952AlaArg: 1.952 ± 0.576
2.51AlaSer: 2.51 ± 0.582
2.231AlaThr: 2.231 ± 0.963
1.394AlaVal: 1.394 ± 0.527
0.558AlaTrp: 0.558 ± 0.552
1.115AlaTyr: 1.115 ± 0.476
0.0AlaXaa: 0.0 ± 0.0
Cys
0.558CysAla: 0.558 ± 0.38
0.0CysCys: 0.0 ± 0.0
1.115CysAsp: 1.115 ± 0.777
1.115CysGlu: 1.115 ± 0.631
0.0CysPhe: 0.0 ± 0.0
0.837CysGly: 0.837 ± 1.126
0.558CysHis: 0.558 ± 0.409
0.837CysIle: 0.837 ± 0.466
1.394CysLys: 1.394 ± 0.986
1.952CysLeu: 1.952 ± 0.932
0.558CysMet: 0.558 ± 0.38
1.115CysAsn: 1.115 ± 0.403
0.279CysPro: 0.279 ± 0.458
0.0CysGln: 0.0 ± 0.0
0.279CysArg: 0.279 ± 0.155
0.837CysSer: 0.837 ± 0.466
1.115CysThr: 1.115 ± 0.631
0.558CysVal: 0.558 ± 0.316
0.558CysTrp: 0.558 ± 0.316
0.279CysTyr: 0.279 ± 0.375
0.0CysXaa: 0.0 ± 0.0
Asp
1.115AspAla: 1.115 ± 0.622
0.558AspCys: 0.558 ± 0.38
5.02AspAsp: 5.02 ± 2.268
3.904AspGlu: 3.904 ± 1.986
2.231AspPhe: 2.231 ± 0.511
2.231AspGly: 2.231 ± 0.684
1.115AspHis: 1.115 ± 0.622
3.346AspIle: 3.346 ± 0.633
2.789AspLys: 2.789 ± 1.006
7.808AspLeu: 7.808 ± 1.572
1.673AspMet: 1.673 ± 0.836
1.394AspAsn: 1.394 ± 0.516
4.741AspPro: 4.741 ± 1.142
2.51AspGln: 2.51 ± 0.649
3.067AspArg: 3.067 ± 0.793
4.183AspSer: 4.183 ± 0.503
1.673AspThr: 1.673 ± 0.756
3.346AspVal: 3.346 ± 0.971
2.231AspTrp: 2.231 ± 0.86
1.952AspTyr: 1.952 ± 0.78
0.0AspXaa: 0.0 ± 0.0
Glu
2.51GluAla: 2.51 ± 2.1
0.558GluCys: 0.558 ± 0.316
3.904GluAsp: 3.904 ± 1.192
4.741GluGlu: 4.741 ± 0.8
0.837GluPhe: 0.837 ± 0.466
4.462GluGly: 4.462 ± 1.33
0.837GluHis: 0.837 ± 0.494
4.462GluIle: 4.462 ± 0.908
3.625GluLys: 3.625 ± 0.818
5.856GluLeu: 5.856 ± 0.853
0.837GluMet: 0.837 ± 0.466
2.789GluAsn: 2.789 ± 1.145
1.673GluPro: 1.673 ± 0.634
1.673GluGln: 1.673 ± 0.647
2.789GluArg: 2.789 ± 0.846
4.741GluSer: 4.741 ± 1.959
3.904GluThr: 3.904 ± 2.23
4.462GluVal: 4.462 ± 0.882
0.558GluTrp: 0.558 ± 0.38
1.952GluTyr: 1.952 ± 0.446
0.0GluXaa: 0.0 ± 0.0
Phe
0.558PheAla: 0.558 ± 0.38
0.279PheCys: 0.279 ± 0.155
2.231PheAsp: 2.231 ± 1.221
2.231PheGlu: 2.231 ± 0.778
2.789PhePhe: 2.789 ± 1.084
1.952PheGly: 1.952 ± 0.324
0.837PheHis: 0.837 ± 0.621
1.952PheIle: 1.952 ± 0.446
3.067PheLys: 3.067 ± 0.508
5.577PheLeu: 5.577 ± 1.718
1.673PheMet: 1.673 ± 0.409
2.231PheAsn: 2.231 ± 0.337
1.952PhePro: 1.952 ± 0.792
2.789PheGln: 2.789 ± 1.555
1.673PheArg: 1.673 ± 0.647
2.51PheSer: 2.51 ± 1.157
1.394PheThr: 1.394 ± 0.417
5.02PheVal: 5.02 ± 1.181
0.279PheTrp: 0.279 ± 0.155
1.115PheTyr: 1.115 ± 0.342
0.0PheXaa: 0.0 ± 0.0
Gly
3.067GlyAla: 3.067 ± 0.993
0.558GlyCys: 0.558 ± 0.409
3.625GlyAsp: 3.625 ± 0.788
3.067GlyGlu: 3.067 ± 1.643
3.346GlyPhe: 3.346 ± 0.721
3.346GlyGly: 3.346 ± 0.866
1.673GlyHis: 1.673 ± 0.956
4.462GlyIle: 4.462 ± 0.969
3.625GlyLys: 3.625 ± 1.018
6.693GlyLeu: 6.693 ± 1.332
0.558GlyMet: 0.558 ± 0.409
1.952GlyAsn: 1.952 ± 0.871
2.789GlyPro: 2.789 ± 0.629
2.51GlyGln: 2.51 ± 0.932
2.51GlyArg: 2.51 ± 0.826
5.577GlySer: 5.577 ± 1.665
3.067GlyThr: 3.067 ± 1.899
3.067GlyVal: 3.067 ± 0.877
1.394GlyTrp: 1.394 ± 1.116
1.952GlyTyr: 1.952 ± 0.446
0.0GlyXaa: 0.0 ± 0.0
His
1.394HisAla: 1.394 ± 0.739
0.837HisCys: 0.837 ± 0.327
1.394HisAsp: 1.394 ± 0.539
1.952HisGlu: 1.952 ± 0.446
2.231HisPhe: 2.231 ± 0.564
1.115HisGly: 1.115 ± 0.585
0.558HisHis: 0.558 ± 0.311
1.952HisIle: 1.952 ± 0.544
0.837HisLys: 0.837 ± 0.569
3.346HisLeu: 3.346 ± 0.642
0.0HisMet: 0.0 ± 0.0
0.837HisAsn: 0.837 ± 0.327
1.394HisPro: 1.394 ± 0.658
1.115HisGln: 1.115 ± 0.607
1.394HisArg: 1.394 ± 0.777
1.952HisSer: 1.952 ± 1.07
0.558HisThr: 0.558 ± 0.554
1.394HisVal: 1.394 ± 0.658
0.558HisTrp: 0.558 ± 0.316
1.952HisTyr: 1.952 ± 1.016
0.0HisXaa: 0.0 ± 0.0
Ile
3.625IleAla: 3.625 ± 0.824
2.231IleCys: 2.231 ± 1.553
3.625IleAsp: 3.625 ± 0.808
4.462IleGlu: 4.462 ± 1.263
1.952IlePhe: 1.952 ± 0.569
3.067IleGly: 3.067 ± 1.389
1.673IleHis: 1.673 ± 0.864
4.462IleIle: 4.462 ± 0.924
5.856IleLys: 5.856 ± 1.898
5.298IleLeu: 5.298 ± 1.129
1.115IleMet: 1.115 ± 0.463
5.577IleAsn: 5.577 ± 1.862
5.577IlePro: 5.577 ± 1.385
2.231IleGln: 2.231 ± 0.626
4.741IleArg: 4.741 ± 1.423
5.856IleSer: 5.856 ± 0.568
3.346IleThr: 3.346 ± 1.006
3.346IleVal: 3.346 ± 1.394
1.115IleTrp: 1.115 ± 0.403
2.789IleTyr: 2.789 ± 0.568
0.0IleXaa: 0.0 ± 0.0
Lys
3.067LysAla: 3.067 ± 0.52
1.115LysCys: 1.115 ± 0.631
2.51LysAsp: 2.51 ± 0.57
4.741LysGlu: 4.741 ± 0.563
2.231LysPhe: 2.231 ± 0.953
3.067LysGly: 3.067 ± 1.28
0.837LysHis: 0.837 ± 0.466
5.02LysIle: 5.02 ± 0.958
5.298LysLys: 5.298 ± 1.782
5.577LysLeu: 5.577 ± 1.433
0.558LysMet: 0.558 ± 0.311
1.673LysAsn: 1.673 ± 0.793
2.231LysPro: 2.231 ± 0.429
1.394LysGln: 1.394 ± 1.271
3.625LysArg: 3.625 ± 0.958
6.135LysSer: 6.135 ± 1.149
4.183LysThr: 4.183 ± 2.383
3.625LysVal: 3.625 ± 1.339
1.952LysTrp: 1.952 ± 0.611
2.51LysTyr: 2.51 ± 1.408
0.0LysXaa: 0.0 ± 0.0
Leu
5.298LeuAla: 5.298 ± 1.886
0.837LeuCys: 0.837 ± 0.327
7.529LeuAsp: 7.529 ± 1.339
6.693LeuGlu: 6.693 ± 1.612
4.183LeuPhe: 4.183 ± 1.297
8.366LeuGly: 8.366 ± 3.542
3.625LeuHis: 3.625 ± 1.332
8.645LeuIle: 8.645 ± 2.597
4.741LeuLys: 4.741 ± 0.761
10.039LeuLeu: 10.039 ± 2.397
2.51LeuMet: 2.51 ± 1.512
3.904LeuAsn: 3.904 ± 0.844
3.625LeuPro: 3.625 ± 0.983
3.346LeuGln: 3.346 ± 1.294
7.25LeuArg: 7.25 ± 2.192
7.529LeuSer: 7.529 ± 1.542
7.808LeuThr: 7.808 ± 0.815
4.462LeuVal: 4.462 ± 1.577
1.115LeuTrp: 1.115 ± 0.561
5.298LeuTyr: 5.298 ± 1.135
0.0LeuXaa: 0.0 ± 0.0
Met
1.115MetAla: 1.115 ± 0.403
0.0MetCys: 0.0 ± 0.0
2.51MetAsp: 2.51 ± 0.565
1.394MetGlu: 1.394 ± 0.656
0.558MetPhe: 0.558 ± 0.38
1.673MetGly: 1.673 ± 1.089
0.0MetHis: 0.0 ± 0.0
1.673MetIle: 1.673 ± 0.38
1.952MetLys: 1.952 ± 1.238
1.673MetLeu: 1.673 ± 1.242
0.558MetMet: 0.558 ± 0.38
1.673MetAsn: 1.673 ± 0.647
0.558MetPro: 0.558 ± 0.316
0.558MetGln: 0.558 ± 0.316
1.115MetArg: 1.115 ± 0.503
1.394MetSer: 1.394 ± 0.417
1.673MetThr: 1.673 ± 0.75
1.115MetVal: 1.115 ± 0.403
0.0MetTrp: 0.0 ± 0.0
1.394MetTyr: 1.394 ± 0.613
0.0MetXaa: 0.0 ± 0.0
Asn
3.067AsnAla: 3.067 ± 2.014
0.279AsnCys: 0.279 ± 0.375
1.115AsnAsp: 1.115 ± 0.622
1.394AsnGlu: 1.394 ± 0.417
3.904AsnPhe: 3.904 ± 1.813
1.115AsnGly: 1.115 ± 0.403
1.952AsnHis: 1.952 ± 1.043
2.789AsnIle: 2.789 ± 0.662
2.789AsnLys: 2.789 ± 0.846
7.808AsnLeu: 7.808 ± 2.707
1.115AsnMet: 1.115 ± 0.334
3.625AsnAsn: 3.625 ± 1.076
3.346AsnPro: 3.346 ± 0.466
1.673AsnGln: 1.673 ± 0.38
2.789AsnArg: 2.789 ± 1.039
3.067AsnSer: 3.067 ± 0.52
2.789AsnThr: 2.789 ± 0.862
2.231AsnVal: 2.231 ± 0.484
1.115AsnTrp: 1.115 ± 1.108
2.231AsnTyr: 2.231 ± 0.806
0.0AsnXaa: 0.0 ± 0.0
Pro
1.394ProAla: 1.394 ± 0.31
0.0ProCys: 0.0 ± 0.0
3.625ProAsp: 3.625 ± 0.66
1.673ProGlu: 1.673 ± 1.545
0.837ProPhe: 0.837 ± 1.145
2.789ProGly: 2.789 ± 1.197
3.346ProHis: 3.346 ± 1.354
1.394ProIle: 1.394 ± 0.623
2.231ProLys: 2.231 ± 0.92
4.462ProLeu: 4.462 ± 1.863
1.673ProMet: 1.673 ± 0.777
3.346ProAsn: 3.346 ± 1.995
3.067ProPro: 3.067 ± 0.929
1.394ProGln: 1.394 ± 0.481
2.51ProArg: 2.51 ± 1.162
3.625ProSer: 3.625 ± 1.526
3.904ProThr: 3.904 ± 0.224
4.462ProVal: 4.462 ± 0.754
0.558ProTrp: 0.558 ± 0.311
1.673ProTyr: 1.673 ± 0.75
0.0ProXaa: 0.0 ± 0.0
Gln
1.673GlnAla: 1.673 ± 0.459
0.279GlnCys: 0.279 ± 0.155
2.231GlnAsp: 2.231 ± 0.699
1.115GlnGlu: 1.115 ± 0.76
1.394GlnPhe: 1.394 ± 0.527
3.625GlnGly: 3.625 ± 0.763
1.394GlnHis: 1.394 ± 0.527
1.952GlnIle: 1.952 ± 0.874
2.231GlnLys: 2.231 ± 0.674
3.346GlnLeu: 3.346 ± 0.887
2.231GlnMet: 2.231 ± 1.403
1.673GlnAsn: 1.673 ± 0.933
1.115GlnPro: 1.115 ± 0.607
0.558GlnGln: 0.558 ± 0.75
0.837GlnArg: 0.837 ± 0.432
1.952GlnSer: 1.952 ± 0.932
1.673GlnThr: 1.673 ± 0.647
1.952GlnVal: 1.952 ± 0.612
1.115GlnTrp: 1.115 ± 1.037
2.231GlnTyr: 2.231 ± 0.429
0.0GlnXaa: 0.0 ± 0.0
Arg
2.789ArgAla: 2.789 ± 0.936
1.394ArgCys: 1.394 ± 0.623
2.51ArgAsp: 2.51 ± 0.704
1.673ArgGlu: 1.673 ± 0.834
3.346ArgPhe: 3.346 ± 1.29
4.462ArgGly: 4.462 ± 1.345
1.673ArgHis: 1.673 ± 0.647
2.789ArgIle: 2.789 ± 1.63
2.51ArgLys: 2.51 ± 0.8
6.693ArgLeu: 6.693 ± 1.093
0.837ArgMet: 0.837 ± 0.375
2.51ArgAsn: 2.51 ± 0.783
1.952ArgPro: 1.952 ± 0.66
1.952ArgGln: 1.952 ± 0.693
1.673ArgArg: 1.673 ± 1.226
3.625ArgSer: 3.625 ± 1.546
5.298ArgThr: 5.298 ± 0.868
2.789ArgVal: 2.789 ± 1.383
1.115ArgTrp: 1.115 ± 0.403
0.837ArgTyr: 0.837 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
3.625SerAla: 3.625 ± 1.408
0.837SerCys: 0.837 ± 0.327
4.462SerAsp: 4.462 ± 0.837
5.298SerGlu: 5.298 ± 1.52
1.952SerPhe: 1.952 ± 0.78
3.346SerGly: 3.346 ± 0.892
1.952SerHis: 1.952 ± 0.446
6.972SerIle: 6.972 ± 3.115
6.135SerLys: 6.135 ± 1.778
7.25SerLeu: 7.25 ± 1.373
1.115SerMet: 1.115 ± 0.403
3.904SerAsn: 3.904 ± 1.265
3.067SerPro: 3.067 ± 0.909
1.952SerGln: 1.952 ± 0.624
3.904SerArg: 3.904 ± 0.844
5.856SerSer: 5.856 ± 2.077
4.741SerThr: 4.741 ± 2.465
4.183SerVal: 4.183 ± 1.649
1.952SerTrp: 1.952 ± 0.576
3.904SerTyr: 3.904 ± 0.997
0.0SerXaa: 0.0 ± 0.0
Thr
3.067ThrAla: 3.067 ± 0.508
0.558ThrCys: 0.558 ± 0.311
3.625ThrAsp: 3.625 ± 0.908
2.789ThrGlu: 2.789 ± 0.991
1.952ThrPhe: 1.952 ± 0.576
3.625ThrGly: 3.625 ± 1.66
1.394ThrHis: 1.394 ± 0.623
4.741ThrIle: 4.741 ± 1.221
3.625ThrLys: 3.625 ± 1.2
6.972ThrLeu: 6.972 ± 1.136
1.394ThrMet: 1.394 ± 0.31
3.067ThrAsn: 3.067 ± 0.458
3.904ThrPro: 3.904 ± 1.741
0.558ThrGln: 0.558 ± 0.554
3.346ThrArg: 3.346 ± 2.078
4.462ThrSer: 4.462 ± 2.169
4.741ThrThr: 4.741 ± 0.65
4.462ThrVal: 4.462 ± 1.806
1.394ThrTrp: 1.394 ± 0.777
1.115ThrTyr: 1.115 ± 1.181
0.0ThrXaa: 0.0 ± 0.0
Val
2.51ValAla: 2.51 ± 1.102
0.837ValCys: 0.837 ± 0.327
2.789ValAsp: 2.789 ± 1.524
2.51ValGlu: 2.51 ± 0.789
1.952ValPhe: 1.952 ± 0.761
3.625ValGly: 3.625 ± 0.809
1.115ValHis: 1.115 ± 0.76
5.298ValIle: 5.298 ± 1.763
3.067ValLys: 3.067 ± 1.368
5.298ValLeu: 5.298 ± 1.345
1.394ValMet: 1.394 ± 0.562
2.231ValAsn: 2.231 ± 0.568
2.231ValPro: 2.231 ± 0.778
3.346ValGln: 3.346 ± 1.065
3.904ValArg: 3.904 ± 0.549
4.741ValSer: 4.741 ± 1.549
3.625ValThr: 3.625 ± 1.019
3.067ValVal: 3.067 ± 0.763
0.0ValTrp: 0.0 ± 0.0
3.625ValTyr: 3.625 ± 2.087
0.0ValXaa: 0.0 ± 0.0
Trp
0.837TrpAla: 0.837 ± 0.494
0.558TrpCys: 0.558 ± 0.552
0.279TrpAsp: 0.279 ± 0.155
1.394TrpGlu: 1.394 ± 0.516
0.837TrpPhe: 0.837 ± 0.375
1.394TrpGly: 1.394 ± 0.527
0.279TrpHis: 0.279 ± 0.155
2.231TrpIle: 2.231 ± 0.944
1.394TrpLys: 1.394 ± 0.658
1.673TrpLeu: 1.673 ± 0.647
0.279TrpMet: 0.279 ± 0.375
1.673TrpAsn: 1.673 ± 1.702
0.279TrpPro: 0.279 ± 0.155
0.837TrpGln: 0.837 ± 0.375
0.558TrpArg: 0.558 ± 0.78
1.952TrpSer: 1.952 ± 0.624
0.837TrpThr: 0.837 ± 0.432
0.837TrpVal: 0.837 ± 0.432
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.673TyrAla: 1.673 ± 0.647
0.837TyrCys: 0.837 ± 0.676
0.837TyrAsp: 0.837 ± 0.466
1.673TyrGlu: 1.673 ± 1.026
2.789TyrPhe: 2.789 ± 1.272
2.789TyrGly: 2.789 ± 0.62
0.837TyrHis: 0.837 ± 0.466
1.115TyrIle: 1.115 ± 0.631
1.673TyrLys: 1.673 ± 0.459
4.183TyrLeu: 4.183 ± 1.231
1.394TyrMet: 1.394 ± 0.539
2.789TyrAsn: 2.789 ± 1.194
2.51TyrPro: 2.51 ± 0.699
2.231TyrGln: 2.231 ± 0.684
2.51TyrArg: 2.51 ± 0.81
3.904TyrSer: 3.904 ± 1.224
2.231TyrThr: 2.231 ± 2.569
1.394TyrVal: 1.394 ± 0.777
0.558TyrTrp: 0.558 ± 0.316
1.115TyrTyr: 1.115 ± 0.598
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3587 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski