Amino acid dipepetide frequency for Oyster mushroom spherical virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.267AlaAla: 4.267 ± 1.33
0.776AlaCys: 0.776 ± 0.395
4.267AlaAsp: 4.267 ± 1.614
1.939AlaGlu: 1.939 ± 0.744
5.043AlaPhe: 5.043 ± 1.559
3.879AlaGly: 3.879 ± 0.759
1.552AlaHis: 1.552 ± 0.584
2.327AlaIle: 2.327 ± 0.804
1.939AlaLys: 1.939 ± 0.987
10.085AlaLeu: 10.085 ± 2.47
1.939AlaMet: 1.939 ± 1.652
2.327AlaAsn: 2.327 ± 0.632
7.37AlaPro: 7.37 ± 1.514
5.431AlaGln: 5.431 ± 1.064
6.594AlaArg: 6.594 ± 0.867
8.146AlaSer: 8.146 ± 3.178
3.879AlaThr: 3.879 ± 1.331
3.491AlaVal: 3.491 ± 1.038
0.776AlaTrp: 0.776 ± 0.395
1.552AlaTyr: 1.552 ± 0.789
0.0AlaXaa: 0.0 ± 0.0
Cys
2.715CysAla: 2.715 ± 1.69
0.0CysCys: 0.0 ± 0.0
0.388CysAsp: 0.388 ± 0.197
0.388CysGlu: 0.388 ± 0.564
1.164CysPhe: 1.164 ± 1.477
1.939CysGly: 1.939 ± 0.702
0.776CysHis: 0.776 ± 0.523
0.388CysIle: 0.388 ± 0.197
0.388CysLys: 0.388 ± 0.197
1.939CysLeu: 1.939 ± 1.956
0.0CysMet: 0.0 ± 0.0
0.388CysAsn: 0.388 ± 0.197
2.715CysPro: 2.715 ± 2.023
0.0CysGln: 0.0 ± 0.0
0.776CysArg: 0.776 ± 0.523
1.552CysSer: 1.552 ± 1.667
0.776CysThr: 0.776 ± 0.523
1.939CysVal: 1.939 ± 0.733
0.388CysTrp: 0.388 ± 0.649
0.388CysTyr: 0.388 ± 0.197
0.0CysXaa: 0.0 ± 0.0
Asp
4.267AspAla: 4.267 ± 1.741
0.776AspCys: 0.776 ± 0.395
1.552AspAsp: 1.552 ± 0.789
1.939AspGlu: 1.939 ± 0.987
2.327AspPhe: 2.327 ± 1.184
3.879AspGly: 3.879 ± 2.053
1.164AspHis: 1.164 ± 0.592
1.939AspIle: 1.939 ± 0.987
0.776AspLys: 0.776 ± 0.395
5.431AspLeu: 5.431 ± 1.601
1.552AspMet: 1.552 ± 0.608
1.552AspAsn: 1.552 ± 0.634
2.327AspPro: 2.327 ± 0.924
1.552AspGln: 1.552 ± 0.789
2.715AspArg: 2.715 ± 1.067
2.327AspSer: 2.327 ± 0.796
1.164AspThr: 1.164 ± 0.85
2.327AspVal: 2.327 ± 0.885
0.776AspTrp: 0.776 ± 0.395
0.388AspTyr: 0.388 ± 0.197
0.0AspXaa: 0.0 ± 0.0
Glu
1.939GluAla: 1.939 ± 0.987
0.388GluCys: 0.388 ± 0.564
0.776GluAsp: 0.776 ± 0.395
1.552GluGlu: 1.552 ± 0.789
1.939GluPhe: 1.939 ± 0.987
1.164GluGly: 1.164 ± 0.529
1.552GluHis: 1.552 ± 0.679
2.715GluIle: 2.715 ± 1.052
1.164GluLys: 1.164 ± 0.592
4.267GluLeu: 4.267 ± 1.324
0.776GluMet: 0.776 ± 0.395
1.552GluAsn: 1.552 ± 0.789
1.552GluPro: 1.552 ± 1.172
1.552GluGln: 1.552 ± 0.721
2.327GluArg: 2.327 ± 0.81
2.715GluSer: 2.715 ± 0.741
0.388GluThr: 0.388 ± 0.197
1.939GluVal: 1.939 ± 0.987
1.552GluTrp: 1.552 ± 0.789
0.388GluTyr: 0.388 ± 0.197
0.0GluXaa: 0.0 ± 0.0
Phe
2.327PheAla: 2.327 ± 0.804
0.776PheCys: 0.776 ± 0.582
2.327PheAsp: 2.327 ± 0.885
1.164PheGlu: 1.164 ± 0.529
1.939PhePhe: 1.939 ± 0.702
3.491PheGly: 3.491 ± 1.039
1.164PheHis: 1.164 ± 0.592
1.552PheIle: 1.552 ± 0.814
0.388PheLys: 0.388 ± 0.197
6.982PheLeu: 6.982 ± 2.853
0.776PheMet: 0.776 ± 0.494
2.715PheAsn: 2.715 ± 0.857
5.818PhePro: 5.818 ± 2.542
3.103PheGln: 3.103 ± 1.609
3.879PheArg: 3.879 ± 2.216
5.043PheSer: 5.043 ± 2.658
3.103PheThr: 3.103 ± 1.216
3.103PheVal: 3.103 ± 1.781
0.388PheTrp: 0.388 ± 0.564
1.164PheTyr: 1.164 ± 0.87
0.0PheXaa: 0.0 ± 0.0
Gly
4.655GlyAla: 4.655 ± 1.423
2.327GlyCys: 2.327 ± 1.161
3.491GlyAsp: 3.491 ± 1.41
1.939GlyGlu: 1.939 ± 0.744
3.103GlyPhe: 3.103 ± 1.298
6.982GlyGly: 6.982 ± 3.601
1.939GlyHis: 1.939 ± 0.701
3.103GlyIle: 3.103 ± 0.905
1.939GlyLys: 1.939 ± 0.987
5.043GlyLeu: 5.043 ± 1.686
0.776GlyMet: 0.776 ± 0.523
3.879GlyAsn: 3.879 ± 1.04
3.491GlyPro: 3.491 ± 0.987
2.715GlyGln: 2.715 ± 1.052
4.267GlyArg: 4.267 ± 1.022
6.206GlySer: 6.206 ± 1.901
6.594GlyThr: 6.594 ± 2.398
3.879GlyVal: 3.879 ± 2.053
1.164GlyTrp: 1.164 ± 0.592
1.164GlyTyr: 1.164 ± 0.529
0.0GlyXaa: 0.0 ± 0.0
His
2.327HisAla: 2.327 ± 0.899
0.0HisCys: 0.0 ± 0.0
0.388HisAsp: 0.388 ± 0.197
1.164HisGlu: 1.164 ± 0.592
0.388HisPhe: 0.388 ± 0.197
1.164HisGly: 1.164 ± 0.592
0.388HisHis: 0.388 ± 0.597
0.0HisIle: 0.0 ± 0.0
1.164HisLys: 1.164 ± 0.592
2.327HisLeu: 2.327 ± 2.21
0.0HisMet: 0.0 ± 0.0
0.388HisAsn: 0.388 ± 0.197
3.103HisPro: 3.103 ± 1.126
0.776HisGln: 0.776 ± 0.517
0.776HisArg: 0.776 ± 0.395
1.939HisSer: 1.939 ± 0.987
0.776HisThr: 0.776 ± 0.395
3.103HisVal: 3.103 ± 0.93
1.164HisTrp: 1.164 ± 0.592
0.776HisTyr: 0.776 ± 0.395
0.0HisXaa: 0.0 ± 0.0
Ile
2.327IleAla: 2.327 ± 1.058
0.776IleCys: 0.776 ± 0.523
1.164IleAsp: 1.164 ± 0.592
1.164IleGlu: 1.164 ± 0.592
1.164IlePhe: 1.164 ± 1.056
1.552IleGly: 1.552 ± 0.721
0.388IleHis: 0.388 ± 0.197
2.715IleIle: 2.715 ± 1.122
1.552IleLys: 1.552 ± 0.814
3.879IleLeu: 3.879 ± 0.781
1.552IleMet: 1.552 ± 0.601
0.776IleAsn: 0.776 ± 0.395
4.655IlePro: 4.655 ± 1.197
0.0IleGln: 0.0 ± 0.0
3.879IleArg: 3.879 ± 1.21
5.431IleSer: 5.431 ± 2.555
3.491IleThr: 3.491 ± 1.279
2.715IleVal: 2.715 ± 0.954
0.776IleTrp: 0.776 ± 1.043
0.776IleTyr: 0.776 ± 0.395
0.0IleXaa: 0.0 ± 0.0
Lys
3.879LysAla: 3.879 ± 1.021
0.388LysCys: 0.388 ± 0.597
0.388LysAsp: 0.388 ± 0.597
0.776LysGlu: 0.776 ± 0.395
0.0LysPhe: 0.0 ± 0.0
1.552LysGly: 1.552 ± 0.814
0.388LysHis: 0.388 ± 0.197
1.552LysIle: 1.552 ± 0.789
0.388LysLys: 0.388 ± 0.197
3.491LysLeu: 3.491 ± 1.776
0.388LysMet: 0.388 ± 0.514
1.552LysAsn: 1.552 ± 0.615
2.715LysPro: 2.715 ± 1.143
0.388LysGln: 0.388 ± 0.197
1.552LysArg: 1.552 ± 0.789
3.879LysSer: 3.879 ± 1.219
0.776LysThr: 0.776 ± 0.51
0.776LysVal: 0.776 ± 0.395
0.776LysTrp: 0.776 ± 0.51
1.164LysTyr: 1.164 ± 0.592
0.0LysXaa: 0.0 ± 0.0
Leu
8.534LeuAla: 8.534 ± 1.633
3.491LeuCys: 3.491 ± 2.345
2.715LeuAsp: 2.715 ± 1.013
2.327LeuGlu: 2.327 ± 0.796
6.594LeuPhe: 6.594 ± 0.523
8.534LeuGly: 8.534 ± 1.637
2.715LeuHis: 2.715 ± 0.825
2.327LeuIle: 2.327 ± 0.701
3.103LeuLys: 3.103 ± 1.229
12.025LeuLeu: 12.025 ± 2.074
2.715LeuMet: 2.715 ± 2.215
7.37LeuAsn: 7.37 ± 0.977
11.249LeuPro: 11.249 ± 2.573
1.552LeuGln: 1.552 ± 1.046
8.534LeuArg: 8.534 ± 3.559
11.637LeuSer: 11.637 ± 4.364
8.534LeuThr: 8.534 ± 2.573
3.879LeuVal: 3.879 ± 1.427
1.552LeuTrp: 1.552 ± 0.595
1.939LeuTyr: 1.939 ± 0.987
0.0LeuXaa: 0.0 ± 0.0
Met
1.552MetAla: 1.552 ± 0.771
0.388MetCys: 0.388 ± 0.577
1.164MetAsp: 1.164 ± 0.518
0.776MetGlu: 0.776 ± 0.523
1.552MetPhe: 1.552 ± 0.584
0.776MetGly: 0.776 ± 0.395
0.776MetHis: 0.776 ± 0.517
1.552MetIle: 1.552 ± 1.347
1.164MetLys: 1.164 ± 0.577
2.715MetLeu: 2.715 ± 0.844
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.939MetPro: 1.939 ± 1.229
0.0MetGln: 0.0 ± 0.0
2.715MetArg: 2.715 ± 1.736
0.776MetSer: 0.776 ± 0.517
2.327MetThr: 2.327 ± 2.149
1.164MetVal: 1.164 ± 0.85
0.388MetTrp: 0.388 ± 0.649
0.388MetTyr: 0.388 ± 0.197
0.0MetXaa: 0.0 ± 0.0
Asn
3.879AsnAla: 3.879 ± 1.074
0.388AsnCys: 0.388 ± 0.564
3.879AsnAsp: 3.879 ± 1.973
1.552AsnGlu: 1.552 ± 0.789
1.164AsnPhe: 1.164 ± 0.577
3.879AsnGly: 3.879 ± 0.894
0.776AsnHis: 0.776 ± 0.395
1.939AsnIle: 1.939 ± 0.987
1.552AsnLys: 1.552 ± 0.615
4.655AsnLeu: 4.655 ± 0.984
0.388AsnMet: 0.388 ± 0.197
1.164AsnAsn: 1.164 ± 0.592
1.164AsnPro: 1.164 ± 0.592
0.388AsnGln: 0.388 ± 0.197
1.939AsnArg: 1.939 ± 0.744
1.552AsnSer: 1.552 ± 0.709
2.327AsnThr: 2.327 ± 0.631
3.491AsnVal: 3.491 ± 0.892
0.0AsnTrp: 0.0 ± 0.0
0.388AsnTyr: 0.388 ± 0.197
0.0AsnXaa: 0.0 ± 0.0
Pro
8.534ProAla: 8.534 ± 1.468
0.776ProCys: 0.776 ± 1.194
4.267ProAsp: 4.267 ± 1.784
3.879ProGlu: 3.879 ± 1.185
2.715ProPhe: 2.715 ± 1.061
6.206ProGly: 6.206 ± 2.008
1.164ProHis: 1.164 ± 0.685
3.103ProIle: 3.103 ± 1.933
1.164ProLys: 1.164 ± 0.908
9.697ProLeu: 9.697 ± 5.315
2.327ProMet: 2.327 ± 1.764
3.879ProAsn: 3.879 ± 1.154
9.697ProPro: 9.697 ± 1.221
1.552ProGln: 1.552 ± 0.709
3.103ProArg: 3.103 ± 0.678
10.085ProSer: 10.085 ± 4.874
6.206ProThr: 6.206 ± 2.13
5.431ProVal: 5.431 ± 1.808
1.164ProTrp: 1.164 ± 0.592
3.103ProTyr: 3.103 ± 1.579
0.0ProXaa: 0.0 ± 0.0
Gln
2.715GlnAla: 2.715 ± 0.984
0.0GlnCys: 0.0 ± 0.0
0.776GlnAsp: 0.776 ± 0.395
0.776GlnGlu: 0.776 ± 0.395
1.552GlnPhe: 1.552 ± 0.814
1.939GlnGly: 1.939 ± 0.987
0.776GlnHis: 0.776 ± 0.395
0.776GlnIle: 0.776 ± 0.395
1.164GlnLys: 1.164 ± 0.529
2.327GlnLeu: 2.327 ± 0.849
1.552GlnMet: 1.552 ± 0.634
0.776GlnAsn: 0.776 ± 0.51
1.164GlnPro: 1.164 ± 0.577
0.0GlnGln: 0.0 ± 0.0
3.491GlnArg: 3.491 ± 0.987
2.715GlnSer: 2.715 ± 0.702
1.552GlnThr: 1.552 ± 1.357
3.103GlnVal: 3.103 ± 1.395
0.776GlnTrp: 0.776 ± 0.395
0.388GlnTyr: 0.388 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
5.431ArgAla: 5.431 ± 1.458
1.552ArgCys: 1.552 ± 0.864
2.715ArgAsp: 2.715 ± 0.702
3.103ArgGlu: 3.103 ± 1.225
5.043ArgPhe: 5.043 ± 1.133
4.655ArgGly: 4.655 ± 1.638
2.327ArgHis: 2.327 ± 1.184
2.327ArgIle: 2.327 ± 0.632
3.103ArgLys: 3.103 ± 1.229
6.982ArgLeu: 6.982 ± 1.51
2.715ArgMet: 2.715 ± 1.72
1.552ArgAsn: 1.552 ± 1.019
3.879ArgPro: 3.879 ± 0.857
1.164ArgGln: 1.164 ± 0.685
5.818ArgArg: 5.818 ± 1.705
5.818ArgSer: 5.818 ± 2.53
5.043ArgThr: 5.043 ± 1.749
5.431ArgVal: 5.431 ± 0.691
1.552ArgTrp: 1.552 ± 1.332
3.491ArgTyr: 3.491 ± 1.02
0.0ArgXaa: 0.0 ± 0.0
Ser
7.37SerAla: 7.37 ± 3.224
2.715SerCys: 2.715 ± 0.854
3.103SerAsp: 3.103 ± 1.132
1.552SerGlu: 1.552 ± 0.608
5.431SerPhe: 5.431 ± 2.04
5.818SerGly: 5.818 ± 1.557
1.939SerHis: 1.939 ± 0.661
3.879SerIle: 3.879 ± 1.715
2.715SerLys: 2.715 ± 1.086
9.31SerLeu: 9.31 ± 2.87
1.939SerMet: 1.939 ± 1.202
1.939SerAsn: 1.939 ± 0.701
9.31SerPro: 9.31 ± 3.192
2.715SerGln: 2.715 ± 1.896
8.534SerArg: 8.534 ± 2.971
8.922SerSer: 8.922 ± 2.015
7.758SerThr: 7.758 ± 4.6
5.043SerVal: 5.043 ± 1.426
0.776SerTrp: 0.776 ± 0.395
1.939SerTyr: 1.939 ± 1.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.043ThrAla: 5.043 ± 1.784
1.552ThrCys: 1.552 ± 1.658
1.552ThrAsp: 1.552 ± 0.637
1.164ThrGlu: 1.164 ± 0.592
3.491ThrPhe: 3.491 ± 1.612
4.655ThrGly: 4.655 ± 1.978
0.388ThrHis: 0.388 ± 0.197
3.103ThrIle: 3.103 ± 1.015
1.939ThrLys: 1.939 ± 0.661
7.37ThrLeu: 7.37 ± 3.033
0.388ThrMet: 0.388 ± 0.649
1.939ThrAsn: 1.939 ± 1.135
5.818ThrPro: 5.818 ± 1.731
1.552ThrGln: 1.552 ± 0.789
3.879ThrArg: 3.879 ± 1.049
6.982ThrSer: 6.982 ± 3.765
4.267ThrThr: 4.267 ± 2.308
6.206ThrVal: 6.206 ± 1.649
0.776ThrTrp: 0.776 ± 0.395
2.715ThrTyr: 2.715 ± 1.242
0.0ThrXaa: 0.0 ± 0.0
Val
3.103ValAla: 3.103 ± 1.205
1.164ValCys: 1.164 ± 1.095
2.715ValAsp: 2.715 ± 1.067
3.491ValGlu: 3.491 ± 0.908
3.879ValPhe: 3.879 ± 1.691
3.491ValGly: 3.491 ± 1.333
1.164ValHis: 1.164 ± 0.518
3.879ValIle: 3.879 ± 1.275
0.388ValLys: 0.388 ± 0.564
6.982ValLeu: 6.982 ± 1.842
1.164ValMet: 1.164 ± 0.685
1.939ValAsn: 1.939 ± 0.733
8.534ValPro: 8.534 ± 2.325
2.327ValGln: 2.327 ± 1.184
5.818ValArg: 5.818 ± 1.432
5.043ValSer: 5.043 ± 1.786
2.327ValThr: 2.327 ± 0.885
5.043ValVal: 5.043 ± 0.865
0.388ValTrp: 0.388 ± 0.564
0.776ValTyr: 0.776 ± 0.517
0.0ValXaa: 0.0 ± 0.0
Trp
1.939TrpAla: 1.939 ± 0.757
0.388TrpCys: 0.388 ± 0.197
1.164TrpAsp: 1.164 ± 0.953
0.776TrpGlu: 0.776 ± 0.395
1.164TrpPhe: 1.164 ± 1.217
0.776TrpGly: 0.776 ± 0.395
0.0TrpHis: 0.0 ± 0.0
0.776TrpIle: 0.776 ± 0.395
0.776TrpLys: 0.776 ± 0.523
2.327TrpLeu: 2.327 ± 0.849
0.388TrpMet: 0.388 ± 0.197
0.0TrpAsn: 0.0 ± 0.0
0.776TrpPro: 0.776 ± 0.959
0.388TrpGln: 0.388 ± 0.197
1.164TrpArg: 1.164 ± 0.529
1.164TrpSer: 1.164 ± 0.592
0.776TrpThr: 0.776 ± 0.582
0.776TrpVal: 0.776 ± 0.395
0.0TrpTrp: 0.0 ± 0.0
0.388TrpTyr: 0.388 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.776TyrAla: 0.776 ± 0.395
0.388TyrCys: 0.388 ± 0.649
1.939TyrAsp: 1.939 ± 0.987
0.776TyrGlu: 0.776 ± 0.523
1.552TyrPhe: 1.552 ± 0.608
2.327TyrGly: 2.327 ± 0.607
0.776TyrHis: 0.776 ± 0.395
0.776TyrIle: 0.776 ± 0.517
0.0TyrLys: 0.0 ± 0.0
3.879TyrLeu: 3.879 ± 1.973
0.388TyrMet: 0.388 ± 0.197
0.776TyrAsn: 0.776 ± 0.395
0.776TyrPro: 0.776 ± 0.674
1.164TyrGln: 1.164 ± 0.592
1.939TyrArg: 1.939 ± 0.702
0.776TyrSer: 0.776 ± 0.523
2.715TyrThr: 2.715 ± 1.067
0.776TyrVal: 0.776 ± 0.395
0.776TyrTrp: 0.776 ± 0.395
0.776TyrTyr: 0.776 ± 0.395
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2579 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski