Amino acid dipepetide frequency for Atractylodes mottle virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.944AlaAla: 5.944 ± 2.613
0.699AlaCys: 0.699 ± 0.368
2.448AlaAsp: 2.448 ± 1.133
5.944AlaGlu: 5.944 ± 3.916
3.147AlaPhe: 3.147 ± 1.79
5.245AlaGly: 5.245 ± 1.197
1.399AlaHis: 1.399 ± 0.503
6.294AlaIle: 6.294 ± 1.428
4.895AlaLys: 4.895 ± 2.576
6.643AlaLeu: 6.643 ± 3.447
1.399AlaMet: 1.399 ± 0.736
5.594AlaAsn: 5.594 ± 2.408
1.049AlaPro: 1.049 ± 0.552
2.448AlaGln: 2.448 ± 1.425
3.497AlaArg: 3.497 ± 1.267
3.497AlaSer: 3.497 ± 1.35
3.497AlaThr: 3.497 ± 0.947
4.895AlaVal: 4.895 ± 2.134
0.35AlaTrp: 0.35 ± 0.184
3.497AlaTyr: 3.497 ± 1.015
0.0AlaXaa: 0.0 ± 0.0
Cys
1.399CysAla: 1.399 ± 0.54
0.35CysCys: 0.35 ± 0.532
0.699CysAsp: 0.699 ± 1.019
2.797CysGlu: 2.797 ± 1.357
2.098CysPhe: 2.098 ± 1.104
1.399CysGly: 1.399 ± 0.904
0.699CysHis: 0.699 ± 1.019
2.797CysIle: 2.797 ± 1.501
1.049CysLys: 1.049 ± 0.552
1.748CysLeu: 1.748 ± 0.615
0.699CysMet: 0.699 ± 1.135
0.699CysAsn: 0.699 ± 0.368
1.399CysPro: 1.399 ± 0.736
0.35CysGln: 0.35 ± 0.184
2.098CysArg: 2.098 ± 0.758
1.399CysSer: 1.399 ± 0.736
1.748CysThr: 1.748 ± 0.615
3.147CysVal: 3.147 ± 2.527
0.0CysTrp: 0.0 ± 0.0
1.049CysTyr: 1.049 ± 1.76
0.0CysXaa: 0.0 ± 0.0
Asp
3.846AspAla: 3.846 ± 1.584
0.699AspCys: 0.699 ± 0.368
2.098AspAsp: 2.098 ± 0.625
4.196AspGlu: 4.196 ± 1.776
2.098AspPhe: 2.098 ± 1.297
2.797AspGly: 2.797 ± 1.103
1.049AspHis: 1.049 ± 0.942
2.448AspIle: 2.448 ± 1.288
0.699AspLys: 0.699 ± 0.368
5.944AspLeu: 5.944 ± 1.539
1.049AspMet: 1.049 ± 0.523
2.448AspAsn: 2.448 ± 1.449
3.147AspPro: 3.147 ± 1.673
1.049AspGln: 1.049 ± 0.552
2.098AspArg: 2.098 ± 0.756
1.049AspSer: 1.049 ± 0.442
1.049AspThr: 1.049 ± 0.523
3.497AspVal: 3.497 ± 0.982
1.748AspTrp: 1.748 ± 0.615
2.448AspTyr: 2.448 ± 0.867
0.0AspXaa: 0.0 ± 0.0
Glu
7.692GluAla: 7.692 ± 3.492
1.049GluCys: 1.049 ± 0.442
1.748GluAsp: 1.748 ± 0.615
5.245GluGlu: 5.245 ± 1.665
2.448GluPhe: 2.448 ± 2.336
3.147GluGly: 3.147 ± 1.348
2.098GluHis: 2.098 ± 1.104
2.098GluIle: 2.098 ± 0.625
3.497GluLys: 3.497 ± 1.84
6.643GluLeu: 6.643 ± 1.961
1.399GluMet: 1.399 ± 1.138
3.497GluAsn: 3.497 ± 2.844
3.497GluPro: 3.497 ± 1.231
3.147GluGln: 3.147 ± 1.248
5.594GluArg: 5.594 ± 1.733
4.196GluSer: 4.196 ± 1.326
1.049GluThr: 1.049 ± 0.523
5.944GluVal: 5.944 ± 1.132
1.049GluTrp: 1.049 ± 0.523
1.748GluTyr: 1.748 ± 0.645
0.0GluXaa: 0.0 ± 0.0
Phe
4.196PheAla: 4.196 ± 0.867
1.399PheCys: 1.399 ± 0.736
3.147PheAsp: 3.147 ± 1.57
5.594PheGlu: 5.594 ± 2.013
1.049PhePhe: 1.049 ± 0.819
2.448PheGly: 2.448 ± 0.929
1.049PheHis: 1.049 ± 0.819
1.748PheIle: 1.748 ± 0.877
1.748PheLys: 1.748 ± 0.92
4.545PheLeu: 4.545 ± 1.8
0.699PheMet: 0.699 ± 0.837
3.497PheAsn: 3.497 ± 0.982
0.699PhePro: 0.699 ± 0.368
1.748PheGln: 1.748 ± 1.154
2.098PheArg: 2.098 ± 1.104
3.147PheSer: 3.147 ± 0.963
4.895PheThr: 4.895 ± 1.17
3.147PheVal: 3.147 ± 2.632
0.699PheTrp: 0.699 ± 0.368
2.448PheTyr: 2.448 ± 1.008
0.0PheXaa: 0.0 ± 0.0
Gly
3.147GlyAla: 3.147 ± 2.418
2.098GlyCys: 2.098 ± 0.927
4.545GlyAsp: 4.545 ± 0.846
3.846GlyGlu: 3.846 ± 1.632
2.448GlyPhe: 2.448 ± 1.186
5.594GlyGly: 5.594 ± 1.671
1.399GlyHis: 1.399 ± 0.54
4.895GlyIle: 4.895 ± 1.136
5.944GlyLys: 5.944 ± 2.294
3.147GlyLeu: 3.147 ± 1.798
0.699GlyMet: 0.699 ± 0.368
1.049GlyAsn: 1.049 ± 0.552
1.748GlyPro: 1.748 ± 0.645
2.448GlyGln: 2.448 ± 1.853
6.643GlyArg: 6.643 ± 1.738
3.846GlySer: 3.846 ± 1.332
3.497GlyThr: 3.497 ± 1.425
2.448GlyVal: 2.448 ± 0.808
1.049GlyTrp: 1.049 ± 0.552
1.399GlyTyr: 1.399 ± 0.938
0.0GlyXaa: 0.0 ± 0.0
His
1.748HisAla: 1.748 ± 0.92
1.049HisCys: 1.049 ± 0.442
1.049HisAsp: 1.049 ± 0.552
1.748HisGlu: 1.748 ± 0.615
1.049HisPhe: 1.049 ± 0.552
2.098HisGly: 2.098 ± 0.758
1.399HisHis: 1.399 ± 1.527
1.399HisIle: 1.399 ± 0.54
2.098HisLys: 2.098 ± 0.927
3.846HisLeu: 3.846 ± 0.909
0.699HisMet: 0.699 ± 0.569
1.399HisAsn: 1.399 ± 0.503
0.699HisPro: 0.699 ± 1.019
0.35HisGln: 0.35 ± 0.184
1.049HisArg: 1.049 ± 0.962
3.147HisSer: 3.147 ± 1.743
1.049HisThr: 1.049 ± 0.552
2.448HisVal: 2.448 ± 0.902
0.0HisTrp: 0.0 ± 0.0
1.399HisTyr: 1.399 ± 0.736
0.0HisXaa: 0.0 ± 0.0
Ile
3.497IleAla: 3.497 ± 1.229
2.098IleCys: 2.098 ± 1.006
2.098IleAsp: 2.098 ± 1.104
3.497IleGlu: 3.497 ± 0.982
2.797IlePhe: 2.797 ± 1.808
3.497IleGly: 3.497 ± 2.318
1.049IleHis: 1.049 ± 0.523
3.846IleIle: 3.846 ± 2.072
3.846IleLys: 3.846 ± 0.934
4.895IleLeu: 4.895 ± 1.532
1.748IleMet: 1.748 ± 0.92
2.797IleAsn: 2.797 ± 1.618
2.448IlePro: 2.448 ± 0.879
2.098IleGln: 2.098 ± 1.052
2.448IleArg: 2.448 ± 0.657
3.846IleSer: 3.846 ± 1.447
3.497IleThr: 3.497 ± 0.992
3.497IleVal: 3.497 ± 1.894
0.699IleTrp: 0.699 ± 1.012
2.448IleTyr: 2.448 ± 1.407
0.0IleXaa: 0.0 ± 0.0
Lys
2.797LysAla: 2.797 ± 1.02
1.748LysCys: 1.748 ± 0.95
1.399LysAsp: 1.399 ± 0.736
2.797LysGlu: 2.797 ± 1.078
3.147LysPhe: 3.147 ± 1.656
3.497LysGly: 3.497 ± 1.84
2.448LysHis: 2.448 ± 0.867
1.049LysIle: 1.049 ± 0.552
4.545LysLys: 4.545 ± 1.877
6.294LysLeu: 6.294 ± 2.187
0.699LysMet: 0.699 ± 1.019
3.497LysAsn: 3.497 ± 1.84
4.196LysPro: 4.196 ± 1.557
0.699LysGln: 0.699 ± 0.453
2.448LysArg: 2.448 ± 0.808
3.846LysSer: 3.846 ± 1.144
3.147LysThr: 3.147 ± 0.962
4.545LysVal: 4.545 ± 1.528
0.699LysTrp: 0.699 ± 0.368
1.399LysTyr: 1.399 ± 0.503
0.0LysXaa: 0.0 ± 0.0
Leu
7.343LeuAla: 7.343 ± 2.876
3.147LeuCys: 3.147 ± 1.248
4.196LeuAsp: 4.196 ± 1.512
7.692LeuGlu: 7.692 ± 2.186
4.196LeuPhe: 4.196 ± 1.776
8.042LeuGly: 8.042 ± 2.301
2.098LeuHis: 2.098 ± 0.927
4.895LeuIle: 4.895 ± 0.994
5.594LeuLys: 5.594 ± 2.944
9.091LeuLeu: 9.091 ± 1.98
1.049LeuMet: 1.049 ± 1.189
4.895LeuAsn: 4.895 ± 1.462
5.944LeuPro: 5.944 ± 2.073
4.196LeuGln: 4.196 ± 2.158
7.343LeuArg: 7.343 ± 2.727
5.944LeuSer: 5.944 ± 2.069
3.846LeuThr: 3.846 ± 1.43
6.294LeuVal: 6.294 ± 2.745
0.699LeuTrp: 0.699 ± 1.278
2.098LeuTyr: 2.098 ± 0.625
0.0LeuXaa: 0.0 ± 0.0
Met
2.797MetAla: 2.797 ± 1.02
0.699MetCys: 0.699 ± 0.368
1.399MetAsp: 1.399 ± 0.503
0.699MetGlu: 0.699 ± 0.569
0.699MetPhe: 0.699 ± 0.368
1.399MetGly: 1.399 ± 0.736
0.35MetHis: 0.35 ± 0.664
1.748MetIle: 1.748 ± 1.078
0.699MetLys: 0.699 ± 0.368
2.797MetLeu: 2.797 ± 1.423
0.699MetMet: 0.699 ± 0.346
0.699MetAsn: 0.699 ± 1.019
1.748MetPro: 1.748 ± 0.615
1.748MetGln: 1.748 ± 0.903
1.748MetArg: 1.748 ± 1.154
0.35MetSer: 0.35 ± 0.184
0.35MetThr: 0.35 ± 0.184
1.049MetVal: 1.049 ± 0.552
0.0MetTrp: 0.0 ± 0.0
0.35MetTyr: 0.35 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
2.098AsnAla: 2.098 ± 3.21
1.748AsnCys: 1.748 ± 0.92
2.098AsnAsp: 2.098 ± 0.729
3.147AsnGlu: 3.147 ± 2.209
3.497AsnPhe: 3.497 ± 0.986
1.748AsnGly: 1.748 ± 0.92
2.448AsnHis: 2.448 ± 0.867
2.098AsnIle: 2.098 ± 2.841
2.797AsnLys: 2.797 ± 1.02
5.944AsnLeu: 5.944 ± 2.12
0.699AsnMet: 0.699 ± 0.569
2.098AsnAsn: 2.098 ± 2.446
1.399AsnPro: 1.399 ± 1.303
0.35AsnGln: 0.35 ± 0.184
2.797AsnArg: 2.797 ± 0.812
1.748AsnSer: 1.748 ± 1.749
2.448AsnThr: 2.448 ± 0.867
4.196AsnVal: 4.196 ± 1.012
0.699AsnTrp: 0.699 ± 0.368
1.399AsnTyr: 1.399 ± 0.938
0.0AsnXaa: 0.0 ± 0.0
Pro
4.196ProAla: 4.196 ± 2.837
1.399ProCys: 1.399 ± 1.154
4.895ProAsp: 4.895 ± 0.975
5.245ProGlu: 5.245 ± 1.599
2.098ProPhe: 2.098 ± 0.887
2.797ProGly: 2.797 ± 1.842
2.098ProHis: 2.098 ± 3.709
2.448ProIle: 2.448 ± 1.601
2.448ProLys: 2.448 ± 1.048
4.196ProLeu: 4.196 ± 1.72
0.35ProMet: 0.35 ± 0.184
1.748ProAsn: 1.748 ± 0.877
4.895ProPro: 4.895 ± 4.708
2.098ProGln: 2.098 ± 0.729
2.448ProArg: 2.448 ± 1.048
0.699ProSer: 0.699 ± 0.453
3.497ProThr: 3.497 ± 2.146
2.448ProVal: 2.448 ± 1.048
0.699ProTrp: 0.699 ± 1.012
1.399ProTyr: 1.399 ± 1.154
0.0ProXaa: 0.0 ± 0.0
Gln
2.797GlnAla: 2.797 ± 1.078
0.699GlnCys: 0.699 ± 1.065
2.098GlnAsp: 2.098 ± 0.758
2.098GlnGlu: 2.098 ± 1.104
1.399GlnPhe: 1.399 ± 0.503
2.448GlnGly: 2.448 ± 1.874
1.399GlnHis: 1.399 ± 1.015
2.448GlnIle: 2.448 ± 1.048
1.049GlnLys: 1.049 ± 0.552
4.196GlnLeu: 4.196 ± 1.747
1.399GlnMet: 1.399 ± 1.752
0.0GlnAsn: 0.0 ± 0.0
3.147GlnPro: 3.147 ± 1.794
2.448GlnGln: 2.448 ± 1.008
1.748GlnArg: 1.748 ± 1.078
1.748GlnSer: 1.748 ± 0.92
2.098GlnThr: 2.098 ± 0.756
1.748GlnVal: 1.748 ± 1.655
0.0GlnTrp: 0.0 ± 0.0
0.699GlnTyr: 0.699 ± 0.368
0.0GlnXaa: 0.0 ± 0.0
Arg
4.895ArgAla: 4.895 ± 1.862
2.098ArgCys: 2.098 ± 3.724
1.748ArgAsp: 1.748 ± 2.432
3.497ArgGlu: 3.497 ± 0.982
5.944ArgPhe: 5.944 ± 1.709
2.797ArgGly: 2.797 ± 1.02
2.098ArgHis: 2.098 ± 0.625
2.448ArgIle: 2.448 ± 1.889
3.846ArgLys: 3.846 ± 1.043
5.245ArgLeu: 5.245 ± 2.064
2.797ArgMet: 2.797 ± 1.06
1.049ArgAsn: 1.049 ± 0.523
3.846ArgPro: 3.846 ± 3.587
1.748ArgGln: 1.748 ± 1.306
2.797ArgArg: 2.797 ± 1.842
3.846ArgSer: 3.846 ± 1.203
1.748ArgThr: 1.748 ± 1.749
3.497ArgVal: 3.497 ± 1.03
1.049ArgTrp: 1.049 ± 0.552
2.098ArgTyr: 2.098 ± 1.104
0.0ArgXaa: 0.0 ± 0.0
Ser
4.196SerAla: 4.196 ± 1.139
1.399SerCys: 1.399 ± 2.5
3.846SerAsp: 3.846 ± 1.166
1.399SerGlu: 1.399 ± 0.736
1.399SerPhe: 1.399 ± 0.54
3.846SerGly: 3.846 ± 2.024
1.399SerHis: 1.399 ± 0.736
3.497SerIle: 3.497 ± 1.134
4.895SerLys: 4.895 ± 1.394
5.944SerLeu: 5.944 ± 1.212
0.699SerMet: 0.699 ± 0.368
2.098SerAsn: 2.098 ± 1.047
3.147SerPro: 3.147 ± 1.04
3.147SerGln: 3.147 ± 1.293
2.797SerArg: 2.797 ± 2.676
4.895SerSer: 4.895 ± 2.538
5.944SerThr: 5.944 ± 1.759
3.846SerVal: 3.846 ± 1.394
0.0SerTrp: 0.0 ± 0.0
1.748SerTyr: 1.748 ± 0.95
0.0SerXaa: 0.0 ± 0.0
Thr
3.147ThrAla: 3.147 ± 0.848
2.098ThrCys: 2.098 ± 1.632
1.399ThrAsp: 1.399 ± 0.736
2.098ThrGlu: 2.098 ± 1.977
5.245ThrPhe: 5.245 ± 1.519
2.448ThrGly: 2.448 ± 1.655
1.748ThrHis: 1.748 ± 0.92
3.497ThrIle: 3.497 ± 1.566
1.399ThrLys: 1.399 ± 0.904
7.343ThrLeu: 7.343 ± 1.242
1.748ThrMet: 1.748 ± 0.763
2.098ThrAsn: 2.098 ± 1.707
1.399ThrPro: 1.399 ± 2.118
2.098ThrGln: 2.098 ± 0.939
2.797ThrArg: 2.797 ± 1.716
2.797ThrSer: 2.797 ± 0.736
0.0ThrThr: 0.0 ± 0.0
2.448ThrVal: 2.448 ± 0.912
0.0ThrTrp: 0.0 ± 0.0
4.196ThrTyr: 4.196 ± 2.105
0.0ThrXaa: 0.0 ± 0.0
Val
3.147ValAla: 3.147 ± 1.335
1.748ValCys: 1.748 ± 0.615
2.448ValAsp: 2.448 ± 0.867
3.497ValGlu: 3.497 ± 1.35
3.147ValPhe: 3.147 ± 1.109
4.545ValGly: 4.545 ± 1.667
2.098ValHis: 2.098 ± 1.104
4.895ValIle: 4.895 ± 2.737
1.748ValLys: 1.748 ± 1.973
5.594ValLeu: 5.594 ± 3.622
2.098ValMet: 2.098 ± 1.104
3.846ValAsn: 3.846 ± 0.989
5.594ValPro: 5.594 ± 1.446
1.748ValGln: 1.748 ± 0.92
3.497ValArg: 3.497 ± 1.61
6.993ValSer: 6.993 ± 1.401
3.497ValThr: 3.497 ± 1.03
4.196ValVal: 4.196 ± 3.921
0.35ValTrp: 0.35 ± 0.664
1.748ValTyr: 1.748 ± 1.835
0.0ValXaa: 0.0 ± 0.0
Trp
1.049TrpAla: 1.049 ± 1.112
0.699TrpCys: 0.699 ± 0.368
0.0TrpAsp: 0.0 ± 0.0
0.699TrpGlu: 0.699 ± 1.262
0.699TrpPhe: 0.699 ± 0.368
0.35TrpGly: 0.35 ± 1.103
1.049TrpHis: 1.049 ± 0.552
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.748TrpLeu: 1.748 ± 0.92
0.35TrpMet: 0.35 ± 0.184
0.699TrpAsn: 0.699 ± 0.569
0.35TrpPro: 0.35 ± 0.184
0.0TrpGln: 0.0 ± 0.0
0.35TrpArg: 0.35 ± 0.664
0.35TrpSer: 0.35 ± 0.184
0.0TrpThr: 0.0 ± 0.0
1.049TrpVal: 1.049 ± 0.552
0.0TrpTrp: 0.0 ± 0.0
0.699TrpTyr: 0.699 ± 0.368
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.448TyrAla: 2.448 ± 0.808
0.699TyrCys: 0.699 ± 0.368
2.098TyrAsp: 2.098 ± 0.729
1.049TyrGlu: 1.049 ± 0.442
1.399TyrPhe: 1.399 ± 0.54
1.748TyrGly: 1.748 ± 0.986
0.35TyrHis: 0.35 ± 0.184
2.098TyrIle: 2.098 ± 0.997
2.098TyrLys: 2.098 ± 0.729
2.797TyrLeu: 2.797 ± 1.103
0.699TyrMet: 0.699 ± 0.368
1.748TyrAsn: 1.748 ± 1.973
1.748TyrPro: 1.748 ± 0.95
1.748TyrGln: 1.748 ± 0.986
2.797TyrArg: 2.797 ± 1.842
2.797TyrSer: 2.797 ± 1.772
3.497TyrThr: 3.497 ± 1.096
2.098TyrVal: 2.098 ± 1.76
0.35TyrTrp: 0.35 ± 0.184
0.699TyrTyr: 0.699 ± 0.453
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2861 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski