Amino acid dipepetide frequency for Cassava mosaic Madagascar virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.127AlaAla: 4.127 ± 1.711
1.179AlaCys: 1.179 ± 0.604
0.59AlaAsp: 0.59 ± 0.571
1.769AlaGlu: 1.769 ± 0.867
1.179AlaPhe: 1.179 ± 0.993
0.0AlaGly: 0.0 ± 0.0
0.0AlaHis: 0.0 ± 0.0
1.179AlaIle: 1.179 ± 0.621
2.948AlaLys: 2.948 ± 1.054
3.538AlaLeu: 3.538 ± 1.865
0.0AlaMet: 0.0 ± 0.0
1.769AlaAsn: 1.769 ± 1.359
2.948AlaPro: 2.948 ± 1.014
3.538AlaGln: 3.538 ± 1.067
4.717AlaArg: 4.717 ± 1.718
5.307AlaSer: 5.307 ± 1.146
4.717AlaThr: 4.717 ± 1.925
1.769AlaVal: 1.769 ± 0.817
1.179AlaTrp: 1.179 ± 1.011
0.59AlaTyr: 0.59 ± 0.547
0.0AlaXaa: 0.0 ± 0.0
Cys
0.59CysAla: 0.59 ± 0.505
1.769CysCys: 1.769 ± 1.01
0.59CysAsp: 0.59 ± 0.547
0.59CysGlu: 0.59 ± 0.571
0.59CysPhe: 0.59 ± 0.617
1.179CysGly: 1.179 ± 0.699
1.179CysHis: 1.179 ± 0.671
2.948CysIle: 2.948 ± 1.715
0.59CysLys: 0.59 ± 0.571
1.769CysLeu: 1.769 ± 0.884
1.769CysMet: 1.769 ± 0.726
1.179CysAsn: 1.179 ± 0.549
1.179CysPro: 1.179 ± 0.907
0.59CysGln: 0.59 ± 0.505
1.769CysArg: 1.769 ± 1.035
2.948CysSer: 2.948 ± 1.711
0.59CysThr: 0.59 ± 0.505
0.59CysVal: 0.59 ± 0.571
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.538AspAla: 3.538 ± 1.861
0.59AspCys: 0.59 ± 0.589
2.948AspAsp: 2.948 ± 0.91
2.358AspGlu: 2.358 ± 0.94
1.769AspPhe: 1.769 ± 1.085
2.358AspGly: 2.358 ± 1.208
2.948AspHis: 2.948 ± 1.759
1.769AspIle: 1.769 ± 0.71
1.769AspLys: 1.769 ± 0.644
7.665AspLeu: 7.665 ± 1.985
0.0AspMet: 0.0 ± 0.0
4.127AspAsn: 4.127 ± 1.584
2.358AspPro: 2.358 ± 1.016
0.59AspGln: 0.59 ± 0.505
1.769AspArg: 1.769 ± 1.137
5.307AspSer: 5.307 ± 1.383
1.769AspThr: 1.769 ± 0.695
5.896AspVal: 5.896 ± 1.823
1.179AspTrp: 1.179 ± 0.699
1.179AspTyr: 1.179 ± 0.792
0.0AspXaa: 0.0 ± 0.0
Glu
4.127GluAla: 4.127 ± 1.657
0.59GluCys: 0.59 ± 0.497
2.948GluAsp: 2.948 ± 1.208
0.59GluGlu: 0.59 ± 0.505
2.358GluPhe: 2.358 ± 1.573
4.127GluGly: 4.127 ± 0.676
0.59GluHis: 0.59 ± 0.497
2.358GluIle: 2.358 ± 1.376
1.769GluLys: 1.769 ± 0.971
3.538GluLeu: 3.538 ± 0.792
0.0GluMet: 0.0 ± 0.0
1.769GluAsn: 1.769 ± 0.662
3.538GluPro: 3.538 ± 0.985
1.769GluGln: 1.769 ± 1.085
0.59GluArg: 0.59 ± 0.497
3.538GluSer: 3.538 ± 1.416
2.948GluThr: 2.948 ± 1.398
1.769GluVal: 1.769 ± 1.028
1.179GluTrp: 1.179 ± 0.699
2.948GluTyr: 2.948 ± 1.354
0.0GluXaa: 0.0 ± 0.0
Phe
0.59PheAla: 0.59 ± 0.497
0.59PheCys: 0.59 ± 0.571
2.358PheAsp: 2.358 ± 1.34
1.179PheGlu: 1.179 ± 0.549
1.179PhePhe: 1.179 ± 0.71
1.769PheGly: 1.769 ± 1.131
1.769PheHis: 1.769 ± 0.83
0.59PheIle: 0.59 ± 0.505
6.486PheLys: 6.486 ± 1.271
4.717PheLeu: 4.717 ± 1.875
2.358PheMet: 2.358 ± 0.499
2.948PheAsn: 2.948 ± 1.336
1.769PhePro: 1.769 ± 1.09
2.358PheGln: 2.358 ± 1.572
1.769PheArg: 1.769 ± 0.726
2.358PheSer: 2.358 ± 0.601
4.127PheThr: 4.127 ± 1.368
1.769PheVal: 1.769 ± 1.035
0.59PheTrp: 0.59 ± 0.497
1.179PheTyr: 1.179 ± 0.737
0.0PheXaa: 0.0 ± 0.0
Gly
3.538GlyAla: 3.538 ± 1.231
1.769GlyCys: 1.769 ± 0.83
4.717GlyAsp: 4.717 ± 0.978
2.948GlyGlu: 2.948 ± 1.208
1.769GlyPhe: 1.769 ± 0.903
4.127GlyGly: 4.127 ± 1.967
1.179GlyHis: 1.179 ± 0.753
4.127GlyIle: 4.127 ± 1.223
3.538GlyLys: 3.538 ± 1.516
1.769GlyLeu: 1.769 ± 0.831
1.769GlyMet: 1.769 ± 1.078
1.769GlyAsn: 1.769 ± 1.252
3.538GlyPro: 3.538 ± 2.01
1.179GlyGln: 1.179 ± 0.67
1.769GlyArg: 1.769 ± 0.763
1.179GlySer: 1.179 ± 0.549
3.538GlyThr: 3.538 ± 1.846
2.358GlyVal: 2.358 ± 1.329
0.0GlyTrp: 0.0 ± 0.0
0.59GlyTyr: 0.59 ± 0.547
0.0GlyXaa: 0.0 ± 0.0
His
1.179HisAla: 1.179 ± 0.856
1.769HisCys: 1.769 ± 1.315
2.948HisAsp: 2.948 ± 1.368
1.179HisGlu: 1.179 ± 0.753
2.948HisPhe: 2.948 ± 1.033
1.179HisGly: 1.179 ± 0.788
2.358HisHis: 2.358 ± 2.355
2.358HisIle: 2.358 ± 1.129
1.769HisLys: 1.769 ± 0.779
1.769HisLeu: 1.769 ± 1.124
0.59HisMet: 0.59 ± 0.505
1.769HisAsn: 1.769 ± 1.124
2.948HisPro: 2.948 ± 1.488
1.179HisGln: 1.179 ± 1.011
4.127HisArg: 4.127 ± 1.66
1.769HisSer: 1.769 ± 0.703
2.358HisThr: 2.358 ± 1.42
2.948HisVal: 2.948 ± 0.953
0.0HisTrp: 0.0 ± 0.0
1.179HisTyr: 1.179 ± 0.549
0.0HisXaa: 0.0 ± 0.0
Ile
1.179IleAla: 1.179 ± 0.825
0.59IleCys: 0.59 ± 0.497
5.307IleAsp: 5.307 ± 1.526
1.179IleGlu: 1.179 ± 0.549
2.948IlePhe: 2.948 ± 1.356
2.358IleGly: 2.358 ± 1.129
0.59IleHis: 0.59 ± 0.617
7.665IleIle: 7.665 ± 2.507
7.075IleLys: 7.075 ± 0.766
1.769IleLeu: 1.769 ± 0.601
2.948IleMet: 2.948 ± 0.818
2.948IleAsn: 2.948 ± 1.469
1.769IlePro: 1.769 ± 1.124
6.486IleGln: 6.486 ± 2.121
5.896IleArg: 5.896 ± 2.195
3.538IleSer: 3.538 ± 2.068
4.717IleThr: 4.717 ± 1.592
1.769IleVal: 1.769 ± 0.556
1.769IleTrp: 1.769 ± 1.166
1.769IleTyr: 1.769 ± 0.97
0.0IleXaa: 0.0 ± 0.0
Lys
2.358LysAla: 2.358 ± 1.022
1.769LysCys: 1.769 ± 0.677
1.769LysAsp: 1.769 ± 0.601
5.896LysGlu: 5.896 ± 1.545
3.538LysPhe: 3.538 ± 1.452
1.769LysGly: 1.769 ± 0.916
2.948LysHis: 2.948 ± 0.703
2.358LysIle: 2.358 ± 0.814
1.769LysLys: 1.769 ± 0.662
3.538LysLeu: 3.538 ± 1.998
1.769LysMet: 1.769 ± 1.383
3.538LysAsn: 3.538 ± 1.431
4.717LysPro: 4.717 ± 0.983
1.769LysGln: 1.769 ± 0.943
4.127LysArg: 4.127 ± 1.11
4.717LysSer: 4.717 ± 1.3
2.948LysThr: 2.948 ± 1.436
4.127LysVal: 4.127 ± 0.895
0.0LysTrp: 0.0 ± 0.0
3.538LysTyr: 3.538 ± 0.987
0.0LysXaa: 0.0 ± 0.0
Leu
1.179LeuAla: 1.179 ± 0.621
1.769LeuCys: 1.769 ± 0.971
4.127LeuAsp: 4.127 ± 1.363
4.717LeuGlu: 4.717 ± 1.889
2.948LeuPhe: 2.948 ± 1.068
4.127LeuGly: 4.127 ± 1.743
5.307LeuHis: 5.307 ± 1.646
5.896LeuIle: 5.896 ± 1.172
5.307LeuLys: 5.307 ± 0.988
4.127LeuLeu: 4.127 ± 1.37
0.59LeuMet: 0.59 ± 0.732
5.307LeuAsn: 5.307 ± 0.959
3.538LeuPro: 3.538 ± 1.382
2.358LeuGln: 2.358 ± 0.99
2.948LeuArg: 2.948 ± 1.07
7.665LeuSer: 7.665 ± 2.249
2.948LeuThr: 2.948 ± 1.398
3.538LeuVal: 3.538 ± 1.051
0.59LeuTrp: 0.59 ± 0.571
2.948LeuTyr: 2.948 ± 2.187
0.0LeuXaa: 0.0 ± 0.0
Met
0.59MetAla: 0.59 ± 0.547
0.0MetCys: 0.0 ± 0.0
2.358MetAsp: 2.358 ± 1.089
2.358MetGlu: 2.358 ± 0.874
2.948MetPhe: 2.948 ± 1.715
2.358MetGly: 2.358 ± 1.377
0.59MetHis: 0.59 ± 0.505
0.59MetIle: 0.59 ± 0.547
1.179MetLys: 1.179 ± 0.622
3.538MetLeu: 3.538 ± 1.086
0.59MetMet: 0.59 ± 0.571
0.0MetAsn: 0.0 ± 0.0
2.358MetPro: 2.358 ± 0.499
1.179MetGln: 1.179 ± 0.825
3.538MetArg: 3.538 ± 1.626
0.59MetSer: 0.59 ± 0.571
0.59MetThr: 0.59 ± 0.657
0.0MetVal: 0.0 ± 0.0
1.769MetTrp: 1.769 ± 0.762
1.769MetTyr: 1.769 ± 1.166
0.0MetXaa: 0.0 ± 0.0
Asn
2.948AsnAla: 2.948 ± 1.275
0.59AsnCys: 0.59 ± 0.453
2.358AsnAsp: 2.358 ± 0.862
1.769AsnGlu: 1.769 ± 0.726
1.179AsnPhe: 1.179 ± 0.773
1.179AsnGly: 1.179 ± 0.84
2.358AsnHis: 2.358 ± 1.415
4.127AsnIle: 4.127 ± 1.727
2.358AsnLys: 2.358 ± 0.859
2.948AsnLeu: 2.948 ± 1.089
1.179AsnMet: 1.179 ± 0.71
2.358AsnAsn: 2.358 ± 1.091
3.538AsnPro: 3.538 ± 1.337
1.179AsnGln: 1.179 ± 0.622
1.769AsnArg: 1.769 ± 1.089
3.538AsnSer: 3.538 ± 2.08
3.538AsnThr: 3.538 ± 1.303
4.717AsnVal: 4.717 ± 2.286
0.0AsnTrp: 0.0 ± 0.0
2.358AsnTyr: 2.358 ± 1.208
0.0AsnXaa: 0.0 ± 0.0
Pro
3.538ProAla: 3.538 ± 1.182
1.769ProCys: 1.769 ± 0.726
1.769ProAsp: 1.769 ± 0.762
2.358ProGlu: 2.358 ± 1.19
2.358ProPhe: 2.358 ± 1.01
3.538ProGly: 3.538 ± 1.133
3.538ProHis: 3.538 ± 1.921
5.307ProIle: 5.307 ± 1.131
3.538ProLys: 3.538 ± 1.938
2.948ProLeu: 2.948 ± 1.014
2.358ProMet: 2.358 ± 1.282
2.948ProAsn: 2.948 ± 1.054
2.948ProPro: 2.948 ± 1.377
2.948ProGln: 2.948 ± 1.603
4.717ProArg: 4.717 ± 1.284
7.075ProSer: 7.075 ± 3.079
6.486ProThr: 6.486 ± 2.685
0.0ProVal: 0.0 ± 0.0
1.179ProTrp: 1.179 ± 0.549
4.127ProTyr: 4.127 ± 1.49
0.0ProXaa: 0.0 ± 0.0
Gln
3.538GlnAla: 3.538 ± 1.357
0.59GlnCys: 0.59 ± 0.571
2.358GlnAsp: 2.358 ± 1.183
0.59GlnGlu: 0.59 ± 0.505
2.358GlnPhe: 2.358 ± 1.01
1.769GlnGly: 1.769 ± 0.971
0.59GlnHis: 0.59 ± 0.589
4.127GlnIle: 4.127 ± 1.488
1.179GlnLys: 1.179 ± 0.621
2.358GlnLeu: 2.358 ± 1.201
1.179GlnMet: 1.179 ± 0.759
1.769GlnAsn: 1.769 ± 0.807
3.538GlnPro: 3.538 ± 1.682
2.358GlnGln: 2.358 ± 1.125
2.948GlnArg: 2.948 ± 1.079
2.358GlnSer: 2.358 ± 0.707
3.538GlnThr: 3.538 ± 1.063
6.486GlnVal: 6.486 ± 2.1
0.0GlnTrp: 0.0 ± 0.0
0.59GlnTyr: 0.59 ± 0.505
0.0GlnXaa: 0.0 ± 0.0
Arg
1.769ArgAla: 1.769 ± 0.817
1.769ArgCys: 1.769 ± 0.726
4.717ArgAsp: 4.717 ± 1.876
0.59ArgGlu: 0.59 ± 0.505
2.948ArgPhe: 2.948 ± 0.746
4.127ArgGly: 4.127 ± 1.019
2.358ArgHis: 2.358 ± 0.982
3.538ArgIle: 3.538 ± 2.36
4.127ArgLys: 4.127 ± 1.717
7.665ArgLeu: 7.665 ± 3.032
3.538ArgMet: 3.538 ± 2.171
0.0ArgAsn: 0.0 ± 0.0
4.127ArgPro: 4.127 ± 1.486
2.948ArgGln: 2.948 ± 0.953
8.255ArgArg: 8.255 ± 2.806
5.896ArgSer: 5.896 ± 2.016
2.358ArgThr: 2.358 ± 0.918
3.538ArgVal: 3.538 ± 2.118
0.0ArgTrp: 0.0 ± 0.0
2.948ArgTyr: 2.948 ± 1.07
0.0ArgXaa: 0.0 ± 0.0
Ser
2.358SerAla: 2.358 ± 1.166
1.179SerCys: 1.179 ± 0.699
2.948SerAsp: 2.948 ± 0.848
4.127SerGlu: 4.127 ± 1.461
2.948SerPhe: 2.948 ± 0.914
2.358SerGly: 2.358 ± 1.275
2.948SerHis: 2.948 ± 1.538
4.127SerIle: 4.127 ± 0.987
5.307SerLys: 5.307 ± 1.452
4.127SerLeu: 4.127 ± 1.668
1.769SerMet: 1.769 ± 0.718
2.948SerAsn: 2.948 ± 1.177
7.665SerPro: 7.665 ± 1.234
5.307SerGln: 5.307 ± 1.441
5.896SerArg: 5.896 ± 1.267
10.024SerSer: 10.024 ± 2.063
9.434SerThr: 9.434 ± 3.667
6.486SerVal: 6.486 ± 2.786
0.59SerTrp: 0.59 ± 0.571
4.127SerTyr: 4.127 ± 0.744
0.0SerXaa: 0.0 ± 0.0
Thr
2.358ThrAla: 2.358 ± 0.951
1.769ThrCys: 1.769 ± 0.884
1.769ThrAsp: 1.769 ± 1.089
1.769ThrGlu: 1.769 ± 0.916
2.358ThrPhe: 2.358 ± 0.76
4.127ThrGly: 4.127 ± 1.125
4.717ThrHis: 4.717 ± 1.408
4.127ThrIle: 4.127 ± 1.884
2.948ThrLys: 2.948 ± 1.373
4.717ThrLeu: 4.717 ± 1.386
2.358ThrMet: 2.358 ± 0.85
3.538ThrAsn: 3.538 ± 1.28
4.717ThrPro: 4.717 ± 0.746
1.179ThrGln: 1.179 ± 0.882
3.538ThrArg: 3.538 ± 1.662
5.896ThrSer: 5.896 ± 2.283
3.538ThrThr: 3.538 ± 1.161
4.127ThrVal: 4.127 ± 1.606
1.769ThrTrp: 1.769 ± 0.919
2.948ThrTyr: 2.948 ± 1.436
0.0ThrXaa: 0.0 ± 0.0
Val
0.59ValAla: 0.59 ± 0.547
1.179ValCys: 1.179 ± 0.549
4.127ValAsp: 4.127 ± 1.658
3.538ValGlu: 3.538 ± 2.181
2.358ValPhe: 2.358 ± 1.271
2.358ValGly: 2.358 ± 1.245
1.179ValHis: 1.179 ± 0.621
4.127ValIle: 4.127 ± 1.267
2.358ValLys: 2.358 ± 0.984
4.717ValLeu: 4.717 ± 2.078
1.769ValMet: 1.769 ± 1.21
2.358ValAsn: 2.358 ± 1.775
5.896ValPro: 5.896 ± 1.661
3.538ValGln: 3.538 ± 2.268
2.948ValArg: 2.948 ± 2.242
8.255ValSer: 8.255 ± 2.362
2.948ValThr: 2.948 ± 0.96
2.358ValVal: 2.358 ± 1.12
1.769ValTrp: 1.769 ± 0.839
2.358ValTyr: 2.358 ± 1.669
0.0ValXaa: 0.0 ± 0.0
Trp
1.769TrpAla: 1.769 ± 0.931
0.59TrpCys: 0.59 ± 0.571
0.59TrpAsp: 0.59 ± 0.453
1.179TrpGlu: 1.179 ± 0.688
0.0TrpPhe: 0.0 ± 0.0
0.59TrpGly: 0.59 ± 0.505
0.59TrpHis: 0.59 ± 0.571
0.0TrpIle: 0.0 ± 0.0
0.59TrpLys: 0.59 ± 0.571
0.59TrpLeu: 0.59 ± 0.571
0.59TrpMet: 0.59 ± 0.571
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.59TrpGln: 0.59 ± 0.505
1.179TrpArg: 1.179 ± 0.699
1.769TrpSer: 1.769 ± 0.781
0.59TrpThr: 0.59 ± 0.617
1.179TrpVal: 1.179 ± 0.604
0.0TrpTrp: 0.0 ± 0.0
1.179TrpTyr: 1.179 ± 0.787
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.179TyrAla: 1.179 ± 0.622
0.59TyrCys: 0.59 ± 0.571
0.59TyrAsp: 0.59 ± 0.497
2.358TyrGlu: 2.358 ± 1.089
1.769TyrPhe: 1.769 ± 0.677
1.769TyrGly: 1.769 ± 0.662
0.59TyrHis: 0.59 ± 0.505
2.948TyrIle: 2.948 ± 1.013
2.358TyrLys: 2.358 ± 1.01
4.717TyrLeu: 4.717 ± 1.679
1.179TyrMet: 1.179 ± 0.824
2.948TyrAsn: 2.948 ± 0.982
2.358TyrPro: 2.358 ± 0.76
1.179TyrGln: 1.179 ± 0.622
2.948TyrArg: 2.948 ± 1.808
2.948TyrSer: 2.948 ± 1.213
0.59TyrThr: 0.59 ± 0.497
5.307TyrVal: 5.307 ± 2.35
0.0TyrTrp: 0.0 ± 0.0
1.179TyrTyr: 1.179 ± 0.737
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1697 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski