Amino acid dipepetide frequency for Mirabilis leaf curl virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.428AlaAla: 6.428 ± 2.264
1.837AlaCys: 1.837 ± 1.074
0.918AlaAsp: 0.918 ± 0.696
1.837AlaGlu: 1.837 ± 1.052
0.0AlaPhe: 0.0 ± 0.0
1.837AlaGly: 1.837 ± 0.744
2.755AlaHis: 2.755 ± 1.929
1.837AlaIle: 1.837 ± 0.941
4.591AlaLys: 4.591 ± 1.881
8.264AlaLeu: 8.264 ± 1.306
0.0AlaMet: 0.0 ± 0.0
1.837AlaAsn: 1.837 ± 0.744
2.755AlaPro: 2.755 ± 1.139
5.51AlaGln: 5.51 ± 2.063
3.673AlaArg: 3.673 ± 1.849
5.51AlaSer: 5.51 ± 1.813
4.591AlaThr: 4.591 ± 1.875
3.673AlaVal: 3.673 ± 2.993
1.837AlaTrp: 1.837 ± 0.744
1.837AlaTyr: 1.837 ± 1.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.837CysCys: 1.837 ± 2.026
0.0CysAsp: 0.0 ± 0.0
0.918CysGlu: 0.918 ± 0.696
0.918CysPhe: 0.918 ± 0.985
1.837CysGly: 1.837 ± 0.941
0.918CysHis: 0.918 ± 0.928
0.0CysIle: 0.0 ± 0.0
0.918CysLys: 0.918 ± 0.696
0.0CysLeu: 0.0 ± 0.0
2.755CysMet: 2.755 ± 1.493
0.918CysAsn: 0.918 ± 0.674
1.837CysPro: 1.837 ± 2.026
1.837CysGln: 1.837 ± 2.026
0.918CysArg: 0.918 ± 0.674
3.673CysSer: 3.673 ± 1.882
0.918CysThr: 0.918 ± 0.696
1.837CysVal: 1.837 ± 1.392
0.0CysTrp: 0.0 ± 0.0
0.918CysTyr: 0.918 ± 1.288
0.0CysXaa: 0.0 ± 0.0
Asp
1.837AspAla: 1.837 ± 1.349
0.0AspCys: 0.0 ± 0.0
2.755AspAsp: 2.755 ± 1.18
2.755AspGlu: 2.755 ± 0.89
1.837AspPhe: 1.837 ± 0.744
1.837AspGly: 1.837 ± 1.349
1.837AspHis: 1.837 ± 1.297
3.673AspIle: 3.673 ± 1.254
1.837AspLys: 1.837 ± 0.744
5.51AspLeu: 5.51 ± 2.815
0.0AspMet: 0.0 ± 0.0
1.837AspAsn: 1.837 ± 1.074
1.837AspPro: 1.837 ± 1.007
1.837AspGln: 1.837 ± 1.349
2.755AspArg: 2.755 ± 1.273
3.673AspSer: 3.673 ± 0.972
2.755AspThr: 2.755 ± 1.303
4.591AspVal: 4.591 ± 1.357
1.837AspTrp: 1.837 ± 0.941
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.755GluAla: 2.755 ± 0.938
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
4.591GluGlu: 4.591 ± 1.602
3.673GluPhe: 3.673 ± 1.928
6.428GluGly: 6.428 ± 1.938
0.918GluHis: 0.918 ± 0.985
0.918GluIle: 0.918 ± 0.985
0.918GluLys: 0.918 ± 0.674
3.673GluLeu: 3.673 ± 2.027
0.0GluMet: 0.0 ± 0.0
4.591GluAsn: 4.591 ± 1.875
2.755GluPro: 2.755 ± 0.89
1.837GluGln: 1.837 ± 1.13
0.0GluArg: 0.0 ± 0.0
4.591GluSer: 4.591 ± 1.955
2.755GluThr: 2.755 ± 1.464
1.837GluVal: 1.837 ± 0.744
2.755GluTrp: 2.755 ± 1.348
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.918PheCys: 0.918 ± 0.696
2.755PheAsp: 2.755 ± 1.238
1.837PheGlu: 1.837 ± 0.744
0.918PhePhe: 0.918 ± 0.696
1.837PheGly: 1.837 ± 1.392
2.755PheHis: 2.755 ± 1.382
3.673PheIle: 3.673 ± 0.972
2.755PheLys: 2.755 ± 1.923
7.346PheLeu: 7.346 ± 2.677
0.918PheMet: 0.918 ± 0.674
2.755PheAsn: 2.755 ± 1.914
2.755PhePro: 2.755 ± 1.303
2.755PheGln: 2.755 ± 1.348
2.755PheArg: 2.755 ± 1.369
0.918PheSer: 0.918 ± 0.674
1.837PheThr: 1.837 ± 0.941
0.918PheVal: 0.918 ± 0.674
0.0PheTrp: 0.0 ± 0.0
0.918PheTyr: 0.918 ± 0.696
0.0PheXaa: 0.0 ± 0.0
Gly
1.837GlyAla: 1.837 ± 1.349
1.837GlyCys: 1.837 ± 1.074
4.591GlyAsp: 4.591 ± 2.225
4.591GlyGlu: 4.591 ± 0.978
0.918GlyPhe: 0.918 ± 0.928
2.755GlyGly: 2.755 ± 1.238
0.918GlyHis: 0.918 ± 0.674
3.673GlyIle: 3.673 ± 0.972
7.346GlyLys: 7.346 ± 2.431
1.837GlyLeu: 1.837 ± 1.13
0.918GlyMet: 0.918 ± 1.013
1.837GlyAsn: 1.837 ± 2.576
3.673GlyPro: 3.673 ± 1.849
1.837GlyGln: 1.837 ± 0.744
1.837GlyArg: 1.837 ± 1.052
2.755GlySer: 2.755 ± 1.238
2.755GlyThr: 2.755 ± 2.231
2.755GlyVal: 2.755 ± 2.396
0.0GlyTrp: 0.0 ± 0.0
0.918GlyTyr: 0.918 ± 1.013
0.0GlyXaa: 0.0 ± 0.0
His
2.755HisAla: 2.755 ± 1.273
1.837HisCys: 1.837 ± 1.463
1.837HisAsp: 1.837 ± 1.074
0.918HisGlu: 0.918 ± 1.013
2.755HisPhe: 2.755 ± 1.382
0.918HisGly: 0.918 ± 0.928
3.673HisHis: 3.673 ± 2.877
0.918HisIle: 0.918 ± 0.696
1.837HisLys: 1.837 ± 1.256
2.755HisLeu: 2.755 ± 1.369
0.0HisMet: 0.0 ± 0.0
3.673HisAsn: 3.673 ± 2.104
1.837HisPro: 1.837 ± 1.349
0.918HisGln: 0.918 ± 0.674
4.591HisArg: 4.591 ± 2.921
2.755HisSer: 2.755 ± 1.929
1.837HisThr: 1.837 ± 1.392
1.837HisVal: 1.837 ± 1.97
0.0HisTrp: 0.0 ± 0.0
0.918HisTyr: 0.918 ± 0.674
0.0HisXaa: 0.0 ± 0.0
Ile
0.918IleAla: 0.918 ± 0.928
2.755IleCys: 2.755 ± 1.464
2.755IleAsp: 2.755 ± 2.023
0.918IleGlu: 0.918 ± 0.674
3.673IlePhe: 3.673 ± 1.913
1.837IleGly: 1.837 ± 1.392
0.918IleHis: 0.918 ± 0.985
2.755IleIle: 2.755 ± 1.485
6.428IleLys: 6.428 ± 2.089
1.837IleLeu: 1.837 ± 1.719
0.0IleMet: 0.0 ± 0.78
0.918IleAsn: 0.918 ± 0.985
1.837IlePro: 1.837 ± 0.941
3.673IleGln: 3.673 ± 1.231
4.591IleArg: 4.591 ± 1.965
1.837IleSer: 1.837 ± 0.744
4.591IleThr: 4.591 ± 2.939
1.837IleVal: 1.837 ± 1.349
2.755IleTrp: 2.755 ± 1.914
3.673IleTyr: 3.673 ± 1.254
0.0IleXaa: 0.0 ± 0.0
Lys
4.591LysAla: 4.591 ± 1.2
1.837LysCys: 1.837 ± 1.052
1.837LysAsp: 1.837 ± 1.349
4.591LysGlu: 4.591 ± 1.6
1.837LysPhe: 1.837 ± 1.051
4.591LysGly: 4.591 ± 2.687
0.918LysHis: 0.918 ± 0.674
4.591LysIle: 4.591 ± 1.359
1.837LysLys: 1.837 ± 1.074
0.918LysLeu: 0.918 ± 0.674
0.0LysMet: 0.0 ± 0.0
6.428LysAsn: 6.428 ± 2.196
3.673LysPro: 3.673 ± 1.912
1.837LysGln: 1.837 ± 1.052
1.837LysArg: 1.837 ± 1.392
2.755LysSer: 2.755 ± 1.348
2.755LysThr: 2.755 ± 0.925
3.673LysVal: 3.673 ± 2.154
0.918LysTrp: 0.918 ± 0.696
4.591LysTyr: 4.591 ± 0.978
0.0LysXaa: 0.0 ± 0.0
Leu
4.591LeuAla: 4.591 ± 2.01
2.755LeuCys: 2.755 ± 1.369
4.591LeuAsp: 4.591 ± 1.878
2.755LeuGlu: 2.755 ± 1.382
0.0LeuPhe: 0.0 ± 0.0
5.51LeuGly: 5.51 ± 1.668
1.837LeuHis: 1.837 ± 1.349
3.673LeuIle: 3.673 ± 2.558
4.591LeuLys: 4.591 ± 1.084
3.673LeuLeu: 3.673 ± 1.496
2.755LeuMet: 2.755 ± 1.658
6.428LeuAsn: 6.428 ± 1.255
2.755LeuPro: 2.755 ± 1.743
3.673LeuGln: 3.673 ± 1.829
4.591LeuArg: 4.591 ± 2.373
4.591LeuSer: 4.591 ± 2.433
5.51LeuThr: 5.51 ± 1.96
3.673LeuVal: 3.673 ± 2.26
0.918LeuTrp: 0.918 ± 0.985
3.673LeuTyr: 3.673 ± 1.304
0.0LeuXaa: 0.0 ± 0.0
Met
0.918MetAla: 0.918 ± 0.696
0.918MetCys: 0.918 ± 0.696
2.755MetAsp: 2.755 ± 1.658
0.918MetGlu: 0.918 ± 0.674
2.755MetPhe: 2.755 ± 1.553
2.755MetGly: 2.755 ± 1.464
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
3.673MetLeu: 3.673 ± 2.868
0.0MetMet: 0.0 ± 0.0
1.837MetAsn: 1.837 ± 1.051
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.837MetArg: 1.837 ± 0.941
1.837MetSer: 1.837 ± 1.424
1.837MetThr: 1.837 ± 1.652
0.0MetVal: 0.0 ± 0.0
1.837MetTrp: 1.837 ± 1.007
0.918MetTyr: 0.918 ± 0.696
0.0MetXaa: 0.0 ± 0.0
Asn
2.755AsnAla: 2.755 ± 1.238
0.0AsnCys: 0.0 ± 0.0
1.837AsnAsp: 1.837 ± 1.349
1.837AsnGlu: 1.837 ± 1.13
0.918AsnPhe: 0.918 ± 0.696
0.918AsnGly: 0.918 ± 0.674
4.591AsnHis: 4.591 ± 2.034
4.591AsnIle: 4.591 ± 1.757
0.0AsnLys: 0.0 ± 0.0
6.428AsnLeu: 6.428 ± 2.108
1.837AsnMet: 1.837 ± 1.298
4.591AsnAsn: 4.591 ± 2.011
3.673AsnPro: 3.673 ± 1.231
2.755AsnGln: 2.755 ± 0.925
6.428AsnArg: 6.428 ± 2.086
5.51AsnSer: 5.51 ± 2.24
2.755AsnThr: 2.755 ± 1.464
6.428AsnVal: 6.428 ± 2.87
0.918AsnTrp: 0.918 ± 0.674
2.755AsnTyr: 2.755 ± 0.938
0.0AsnXaa: 0.0 ± 0.0
Pro
3.673ProAla: 3.673 ± 2.049
2.755ProCys: 2.755 ± 1.483
2.755ProAsp: 2.755 ± 1.483
2.755ProGlu: 2.755 ± 1.303
2.755ProPhe: 2.755 ± 1.467
0.918ProGly: 0.918 ± 0.674
3.673ProHis: 3.673 ± 1.928
3.673ProIle: 3.673 ± 2.026
2.755ProLys: 2.755 ± 2.023
3.673ProLeu: 3.673 ± 1.574
2.755ProMet: 2.755 ± 1.139
2.755ProAsn: 2.755 ± 1.348
1.837ProPro: 1.837 ± 1.052
1.837ProGln: 1.837 ± 1.074
5.51ProArg: 5.51 ± 1.601
6.428ProSer: 6.428 ± 3.331
4.591ProThr: 4.591 ± 2.421
4.591ProVal: 4.591 ± 2.043
0.0ProTrp: 0.0 ± 0.0
0.918ProTyr: 0.918 ± 0.696
0.0ProXaa: 0.0 ± 0.0
Gln
9.183GlnAla: 9.183 ± 2.238
0.0GlnCys: 0.0 ± 0.0
3.673GlnAsp: 3.673 ± 1.501
2.755GlnGlu: 2.755 ± 1.139
2.755GlnPhe: 2.755 ± 1.467
0.918GlnGly: 0.918 ± 0.674
2.755GlnHis: 2.755 ± 0.89
1.837GlnIle: 1.837 ± 1.349
0.0GlnLys: 0.0 ± 0.0
1.837GlnLeu: 1.837 ± 2.026
1.837GlnMet: 1.837 ± 1.473
2.755GlnAsn: 2.755 ± 1.467
4.591GlnPro: 4.591 ± 2.722
2.755GlnGln: 2.755 ± 0.89
1.837GlnArg: 1.837 ± 1.145
6.428GlnSer: 6.428 ± 1.098
0.0GlnThr: 0.0 ± 0.0
4.591GlnVal: 4.591 ± 1.743
0.0GlnTrp: 0.0 ± 0.0
1.837GlnTyr: 1.837 ± 0.744
0.0GlnXaa: 0.0 ± 0.0
Arg
3.673ArgAla: 3.673 ± 1.532
0.918ArgCys: 0.918 ± 1.013
3.673ArgAsp: 3.673 ± 1.309
2.755ArgGlu: 2.755 ± 1.348
2.755ArgPhe: 2.755 ± 0.925
3.673ArgGly: 3.673 ± 1.291
3.673ArgHis: 3.673 ± 1.914
4.591ArgIle: 4.591 ± 1.359
2.755ArgLys: 2.755 ± 1.485
3.673ArgLeu: 3.673 ± 1.914
0.918ArgMet: 0.918 ± 0.696
0.918ArgAsn: 0.918 ± 0.674
7.346ArgPro: 7.346 ± 1.848
0.918ArgGln: 0.918 ± 1.013
5.51ArgArg: 5.51 ± 3.221
3.673ArgSer: 3.673 ± 1.214
4.591ArgThr: 4.591 ± 1.346
5.51ArgVal: 5.51 ± 2.504
0.0ArgTrp: 0.0 ± 0.0
1.837ArgTyr: 1.837 ± 1.13
0.0ArgXaa: 0.0 ± 0.0
Ser
4.591SerAla: 4.591 ± 3.372
0.918SerCys: 0.918 ± 0.674
2.755SerAsp: 2.755 ± 0.89
1.837SerGlu: 1.837 ± 1.007
4.591SerPhe: 4.591 ± 0.965
2.755SerGly: 2.755 ± 1.437
0.0SerHis: 0.0 ± 0.0
3.673SerIle: 3.673 ± 2.182
7.346SerLys: 7.346 ± 1.856
3.673SerLeu: 3.673 ± 1.913
1.837SerMet: 1.837 ± 2.468
6.428SerAsn: 6.428 ± 3.074
9.183SerPro: 9.183 ± 2.853
5.51SerGln: 5.51 ± 3.629
5.51SerArg: 5.51 ± 1.822
9.183SerSer: 9.183 ± 3.488
7.346SerThr: 7.346 ± 2.177
2.755SerVal: 2.755 ± 1.553
0.0SerTrp: 0.0 ± 0.0
2.755SerTyr: 2.755 ± 1.179
0.0SerXaa: 0.0 ± 0.0
Thr
3.673ThrAla: 3.673 ± 1.709
0.918ThrCys: 0.918 ± 0.674
0.0ThrAsp: 0.0 ± 0.0
3.673ThrGlu: 3.673 ± 1.304
2.755ThrPhe: 2.755 ± 2.023
3.673ThrGly: 3.673 ± 1.76
3.673ThrHis: 3.673 ± 2.147
1.837ThrIle: 1.837 ± 1.349
2.755ThrLys: 2.755 ± 1.273
4.591ThrLeu: 4.591 ± 1.267
0.918ThrMet: 0.918 ± 0.674
1.837ThrAsn: 1.837 ± 0.744
5.51ThrPro: 5.51 ± 3.453
4.591ThrGln: 4.591 ± 3.091
2.755ThrArg: 2.755 ± 1.532
4.591ThrSer: 4.591 ± 2.223
1.837ThrThr: 1.837 ± 1.145
5.51ThrVal: 5.51 ± 2.138
0.0ThrTrp: 0.0 ± 0.0
1.837ThrTyr: 1.837 ± 1.719
0.0ThrXaa: 0.0 ± 0.0
Val
0.918ValAla: 0.918 ± 1.288
0.0ValCys: 0.0 ± 0.0
2.755ValAsp: 2.755 ± 0.89
0.918ValGlu: 0.918 ± 1.013
2.755ValPhe: 2.755 ± 1.914
1.837ValGly: 1.837 ± 1.074
1.837ValHis: 1.837 ± 0.744
3.673ValIle: 3.673 ± 1.928
5.51ValLys: 5.51 ± 1.643
5.51ValLeu: 5.51 ± 2.701
2.755ValMet: 2.755 ± 1.537
4.591ValAsn: 4.591 ± 2.926
2.755ValPro: 2.755 ± 0.89
6.428ValGln: 6.428 ± 2.168
2.755ValArg: 2.755 ± 2.088
7.346ValSer: 7.346 ± 3.404
2.755ValThr: 2.755 ± 2.088
3.673ValVal: 3.673 ± 1.76
0.0ValTrp: 0.0 ± 0.0
5.51ValTyr: 5.51 ± 1.903
0.0ValXaa: 0.0 ± 0.0
Trp
3.673TrpAla: 3.673 ± 1.849
0.0TrpCys: 0.0 ± 0.0
0.918TrpAsp: 0.918 ± 1.013
0.918TrpGlu: 0.918 ± 0.985
0.0TrpPhe: 0.0 ± 0.0
0.918TrpGly: 0.918 ± 0.674
0.918TrpHis: 0.918 ± 0.696
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.837TrpMet: 1.837 ± 1.051
0.918TrpAsn: 0.918 ± 0.985
0.0TrpPro: 0.0 ± 0.0
0.918TrpGln: 0.918 ± 0.674
0.918TrpArg: 0.918 ± 0.928
0.918TrpSer: 0.918 ± 0.928
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.918TrpTyr: 0.918 ± 0.674
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.755TyrAla: 2.755 ± 1.273
0.918TyrCys: 0.918 ± 1.013
1.837TyrAsp: 1.837 ± 1.13
0.918TyrGlu: 0.918 ± 0.696
2.755TyrPhe: 2.755 ± 1.274
1.837TyrGly: 1.837 ± 1.007
0.0TyrHis: 0.0 ± 0.0
1.837TyrIle: 1.837 ± 1.349
1.837TyrLys: 1.837 ± 1.145
3.673TyrLeu: 3.673 ± 1.416
1.837TyrMet: 1.837 ± 0.994
3.673TyrAsn: 3.673 ± 1.377
0.0TyrPro: 0.0 ± 0.0
0.918TyrGln: 0.918 ± 0.696
2.755TyrArg: 2.755 ± 1.485
3.673TyrSer: 3.673 ± 1.352
0.918TyrThr: 0.918 ± 1.288
4.591TyrVal: 4.591 ± 1.875
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1090 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski