Amino acid dipepetide frequency for Sugarcane bacilliform IM virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.704AlaAla: 2.704 ± 1.618
0.451AlaCys: 0.451 ± 0.221
2.704AlaAsp: 2.704 ± 1.618
7.21AlaGlu: 7.21 ± 4.331
4.056AlaPhe: 4.056 ± 1.987
2.253AlaGly: 2.253 ± 1.104
0.901AlaHis: 0.901 ± 0.442
4.957AlaIle: 4.957 ± 1.981
6.309AlaLys: 6.309 ± 5.272
6.309AlaLeu: 6.309 ± 2.961
2.704AlaMet: 2.704 ± 1.325
2.704AlaAsn: 2.704 ± 2.054
2.253AlaPro: 2.253 ± 1.104
3.155AlaGln: 3.155 ± 1.84
4.507AlaArg: 4.507 ± 1.312
2.704AlaSer: 2.704 ± 1.325
4.507AlaThr: 4.507 ± 3.438
4.507AlaVal: 4.507 ± 1.312
0.901AlaTrp: 0.901 ± 0.442
4.056AlaTyr: 4.056 ± 1.479
0.0AlaXaa: 0.0 ± 0.0
Cys
1.352CysAla: 1.352 ± 0.662
0.0CysCys: 0.0 ± 0.0
0.451CysAsp: 0.451 ± 0.221
0.451CysGlu: 0.451 ± 0.221
1.352CysPhe: 1.352 ± 1.192
0.901CysGly: 0.901 ± 0.442
0.901CysHis: 0.901 ± 0.442
0.451CysIle: 0.451 ± 0.221
3.605CysLys: 3.605 ± 1.766
0.451CysLeu: 0.451 ± 0.221
0.0CysMet: 0.0 ± 0.0
0.901CysAsn: 0.901 ± 0.442
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.352CysArg: 1.352 ± 0.662
0.451CysSer: 0.451 ± 0.221
1.803CysThr: 1.803 ± 0.883
0.0CysVal: 0.0 ± 0.0
0.451CysTrp: 0.451 ± 0.221
0.451CysTyr: 0.451 ± 0.221
0.0CysXaa: 0.0 ± 0.0
Asp
3.605AspAla: 3.605 ± 1.072
1.352AspCys: 1.352 ± 0.662
5.408AspAsp: 5.408 ± 1.624
4.957AspGlu: 4.957 ± 2.428
1.803AspPhe: 1.803 ± 0.883
3.155AspGly: 3.155 ± 1.545
1.352AspHis: 1.352 ± 1.192
3.605AspIle: 3.605 ± 1.494
5.408AspLys: 5.408 ± 5.523
4.957AspLeu: 4.957 ± 6.661
1.803AspMet: 1.803 ± 0.883
3.155AspAsn: 3.155 ± 1.545
2.253AspPro: 2.253 ± 1.104
2.253AspGln: 2.253 ± 1.104
1.803AspArg: 1.803 ± 0.883
0.901AspSer: 0.901 ± 0.442
2.253AspThr: 2.253 ± 1.104
2.253AspVal: 2.253 ± 1.01
0.901AspTrp: 0.901 ± 0.442
1.803AspTyr: 1.803 ± 0.883
0.0AspXaa: 0.0 ± 0.0
Glu
6.76GluAla: 6.76 ± 2.065
0.451GluCys: 0.451 ± 0.221
7.661GluAsp: 7.661 ± 3.048
15.773GluGlu: 15.773 ± 6.464
2.253GluPhe: 2.253 ± 1.104
4.056GluGly: 4.056 ± 3.555
3.605GluHis: 3.605 ± 1.072
5.858GluIle: 5.858 ± 1.82
7.21GluLys: 7.21 ± 2.143
7.661GluLeu: 7.661 ± 4.337
2.253GluMet: 2.253 ± 1.104
3.155GluAsn: 3.155 ± 1.004
2.253GluPro: 2.253 ± 4.111
4.056GluGln: 4.056 ± 5.174
4.957GluArg: 4.957 ± 1.546
3.155GluSer: 3.155 ± 1.541
3.155GluThr: 3.155 ± 2.267
7.21GluVal: 7.21 ± 2.404
2.253GluTrp: 2.253 ± 1.01
3.155GluTyr: 3.155 ± 1.004
0.0GluXaa: 0.0 ± 0.0
Phe
0.451PheAla: 0.451 ± 0.221
0.451PheCys: 0.451 ± 0.221
4.507PheAsp: 4.507 ± 5.797
2.704PheGlu: 2.704 ± 0.983
0.901PhePhe: 0.901 ± 0.442
0.901PheGly: 0.901 ± 2.134
0.451PheHis: 0.451 ± 0.221
2.704PheIle: 2.704 ± 0.983
1.803PheLys: 1.803 ± 0.883
2.253PheLeu: 2.253 ± 1.104
1.803PheMet: 1.803 ± 0.883
1.352PheAsn: 1.352 ± 1.192
2.704PhePro: 2.704 ± 1.325
1.803PheGln: 1.803 ± 0.883
3.605PheArg: 3.605 ± 1.072
1.803PheSer: 1.803 ± 2.66
2.253PheThr: 2.253 ± 1.104
2.253PheVal: 2.253 ± 1.719
0.0PheTrp: 0.0 ± 0.0
2.253PheTyr: 2.253 ± 1.104
0.0PheXaa: 0.0 ± 0.0
Gly
5.858GlyAla: 5.858 ± 1.727
0.901GlyCys: 0.901 ± 0.442
1.803GlyAsp: 1.803 ± 0.883
4.957GlyGlu: 4.957 ± 3.331
3.605GlyPhe: 3.605 ± 3.682
1.803GlyGly: 1.803 ± 0.883
0.0GlyHis: 0.0 ± 0.0
1.352GlyIle: 1.352 ± 0.662
5.858GlyLys: 5.858 ± 3.152
3.605GlyLeu: 3.605 ± 1.494
0.451GlyMet: 0.451 ± 0.221
0.901GlyAsn: 0.901 ± 0.442
1.803GlyPro: 1.803 ± 0.883
0.451GlyGln: 0.451 ± 0.221
1.352GlyArg: 1.352 ± 0.662
3.155GlySer: 3.155 ± 1.541
5.858GlyThr: 5.858 ± 1.82
4.056GlyVal: 4.056 ± 1.987
1.352GlyTrp: 1.352 ± 0.662
2.704GlyTyr: 2.704 ± 0.983
0.0GlyXaa: 0.0 ± 0.0
His
0.901HisAla: 0.901 ± 0.442
0.451HisCys: 0.451 ± 0.221
0.901HisAsp: 0.901 ± 0.442
1.352HisGlu: 1.352 ± 1.192
0.451HisPhe: 0.451 ± 0.221
0.901HisGly: 0.901 ± 2.134
0.901HisHis: 0.901 ± 0.442
1.803HisIle: 1.803 ± 0.883
0.451HisLys: 0.451 ± 0.221
3.155HisLeu: 3.155 ± 1.545
0.901HisMet: 0.901 ± 0.442
1.352HisAsn: 1.352 ± 2.814
0.451HisPro: 0.451 ± 0.221
0.901HisGln: 0.901 ± 0.442
2.253HisArg: 2.253 ± 2.517
0.901HisSer: 0.901 ± 1.33
1.352HisThr: 1.352 ± 2.814
1.352HisVal: 1.352 ± 0.662
0.451HisTrp: 0.451 ± 0.221
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.803IleAla: 1.803 ± 0.883
2.253IleCys: 2.253 ± 1.104
4.056IleAsp: 4.056 ± 1.987
8.562IleGlu: 8.562 ± 5.064
0.901IlePhe: 0.901 ± 2.134
1.352IleGly: 1.352 ± 0.662
1.352IleHis: 1.352 ± 1.192
2.253IleIle: 2.253 ± 1.01
6.309IleLys: 6.309 ± 3.091
3.605IleLeu: 3.605 ± 1.629
0.901IleMet: 0.901 ± 0.442
3.155IleAsn: 3.155 ± 1.541
2.253IlePro: 2.253 ± 1.104
3.155IleGln: 3.155 ± 1.84
4.507IleArg: 4.507 ± 2.02
3.155IleSer: 3.155 ± 4.684
3.155IleThr: 3.155 ± 1.545
3.605IleVal: 3.605 ± 1.766
0.451IleTrp: 0.451 ± 0.221
1.803IleTyr: 1.803 ± 0.883
0.0IleXaa: 0.0 ± 0.0
Lys
7.21LysAla: 7.21 ± 5.651
1.803LysCys: 1.803 ± 0.883
4.507LysAsp: 4.507 ± 2.208
8.562LysGlu: 8.562 ± 0.67
2.253LysPhe: 2.253 ± 1.01
4.507LysGly: 4.507 ± 2.208
2.704LysHis: 2.704 ± 0.983
6.309LysIle: 6.309 ± 2.961
8.562LysLys: 8.562 ± 7.068
8.112LysLeu: 8.112 ± 5.258
2.253LysMet: 2.253 ± 1.104
4.056LysAsn: 4.056 ± 1.177
5.408LysPro: 5.408 ± 3.235
4.507LysGln: 4.507 ± 4.537
5.408LysArg: 5.408 ± 3.248
4.507LysSer: 4.507 ± 1.312
5.408LysThr: 5.408 ± 5.523
3.605LysVal: 3.605 ± 1.629
0.451LysTrp: 0.451 ± 0.221
2.253LysTyr: 2.253 ± 1.104
0.0LysXaa: 0.0 ± 0.0
Leu
6.76LeuAla: 6.76 ± 2.801
0.901LeuCys: 0.901 ± 0.442
4.056LeuAsp: 4.056 ± 1.177
6.309LeuGlu: 6.309 ± 0.546
0.901LeuPhe: 0.901 ± 1.33
4.957LeuGly: 4.957 ± 1.981
1.352LeuHis: 1.352 ± 1.98
6.309LeuIle: 6.309 ± 1.638
9.914LeuLys: 9.914 ± 10.087
7.21LeuLeu: 7.21 ± 2.866
2.704LeuMet: 2.704 ± 1.325
4.507LeuAsn: 4.507 ± 1.216
3.155LeuPro: 3.155 ± 1.545
2.704LeuGln: 2.704 ± 3.961
2.253LeuArg: 2.253 ± 1.01
4.957LeuSer: 4.957 ± 1.981
5.408LeuThr: 5.408 ± 1.638
4.056LeuVal: 4.056 ± 3.577
1.352LeuTrp: 1.352 ± 0.662
2.253LeuTyr: 2.253 ± 1.104
0.0LeuXaa: 0.0 ± 0.0
Met
4.056MetAla: 4.056 ± 1.987
0.451MetCys: 0.451 ± 0.221
1.803MetAsp: 1.803 ± 0.883
1.803MetGlu: 1.803 ± 0.883
1.352MetPhe: 1.352 ± 0.662
1.352MetGly: 1.352 ± 0.662
0.451MetHis: 0.451 ± 0.221
1.352MetIle: 1.352 ± 0.662
1.803MetLys: 1.803 ± 1.083
2.253MetLeu: 2.253 ± 1.104
0.901MetMet: 0.901 ± 0.442
1.352MetAsn: 1.352 ± 0.662
0.451MetPro: 0.451 ± 0.221
0.451MetGln: 0.451 ± 0.221
1.803MetArg: 1.803 ± 0.883
1.352MetSer: 1.352 ± 1.98
0.901MetThr: 0.901 ± 0.442
2.253MetVal: 2.253 ± 1.01
0.451MetTrp: 0.451 ± 0.221
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.253AsnAla: 2.253 ± 1.01
0.0AsnCys: 0.0 ± 0.0
2.253AsnAsp: 2.253 ± 1.104
3.155AsnGlu: 3.155 ± 1.541
3.605AsnPhe: 3.605 ± 1.766
2.253AsnGly: 2.253 ± 1.104
0.901AsnHis: 0.901 ± 1.33
1.803AsnIle: 1.803 ± 1.083
3.605AsnLys: 3.605 ± 1.766
3.605AsnLeu: 3.605 ± 2.166
0.451AsnMet: 0.451 ± 0.221
1.352AsnAsn: 1.352 ± 0.662
2.253AsnPro: 2.253 ± 1.719
2.253AsnGln: 2.253 ± 1.01
0.0AsnArg: 0.0 ± 0.0
3.155AsnSer: 3.155 ± 1.004
2.253AsnThr: 2.253 ± 2.268
2.253AsnVal: 2.253 ± 1.104
0.0AsnTrp: 0.0 ± 0.0
2.704AsnTyr: 2.704 ± 1.325
0.0AsnXaa: 0.0 ± 0.0
Pro
4.957ProAla: 4.957 ± 1.546
0.451ProCys: 0.451 ± 0.221
2.253ProAsp: 2.253 ± 1.104
2.704ProGlu: 2.704 ± 1.325
2.704ProPhe: 2.704 ± 1.618
2.253ProGly: 2.253 ± 1.719
0.901ProHis: 0.901 ± 0.442
0.901ProIle: 0.901 ± 2.134
3.155ProLys: 3.155 ± 1.84
2.253ProLeu: 2.253 ± 1.01
0.451ProMet: 0.451 ± 0.221
0.901ProAsn: 0.901 ± 0.442
0.901ProPro: 0.901 ± 0.442
1.352ProGln: 1.352 ± 0.662
3.155ProArg: 3.155 ± 1.545
4.957ProSer: 4.957 ± 1.546
3.155ProThr: 3.155 ± 1.545
1.803ProVal: 1.803 ± 0.883
0.451ProTrp: 0.451 ± 0.221
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.605GlnAla: 3.605 ± 1.494
0.0GlnCys: 0.0 ± 0.0
2.704GlnAsp: 2.704 ± 2.054
3.155GlnGlu: 3.155 ± 1.004
0.901GlnPhe: 0.901 ± 2.918
1.803GlnGly: 1.803 ± 0.883
0.451GlnHis: 0.451 ± 0.221
3.155GlnIle: 3.155 ± 1.545
2.704GlnLys: 2.704 ± 2.385
5.408GlnLeu: 5.408 ± 3.303
1.803GlnMet: 1.803 ± 1.18
0.451GlnAsn: 0.451 ± 0.221
2.253GlnPro: 2.253 ± 1.01
1.352GlnGln: 1.352 ± 1.192
1.352GlnArg: 1.352 ± 1.192
2.253GlnSer: 2.253 ± 1.01
2.253GlnThr: 2.253 ± 1.01
2.253GlnVal: 2.253 ± 1.01
0.901GlnTrp: 0.901 ± 0.442
0.901GlnTyr: 0.901 ± 0.442
0.0GlnXaa: 0.0 ± 0.0
Arg
2.253ArgAla: 2.253 ± 1.01
0.901ArgCys: 0.901 ± 0.442
1.803ArgAsp: 1.803 ± 0.883
1.803ArgGlu: 1.803 ± 1.083
3.155ArgPhe: 3.155 ± 1.545
4.056ArgGly: 4.056 ± 1.479
0.901ArgHis: 0.901 ± 0.442
3.155ArgIle: 3.155 ± 2.267
4.056ArgLys: 4.056 ± 1.177
5.408ArgLeu: 5.408 ± 1.638
0.901ArgMet: 0.901 ± 1.33
1.352ArgAsn: 1.352 ± 0.662
2.704ArgPro: 2.704 ± 0.983
1.803ArgGln: 1.803 ± 0.883
2.704ArgArg: 2.704 ± 0.983
4.507ArgSer: 4.507 ± 2.208
4.056ArgThr: 4.056 ± 2.082
3.155ArgVal: 3.155 ± 2.267
0.901ArgTrp: 0.901 ± 0.442
2.704ArgTyr: 2.704 ± 1.325
0.0ArgXaa: 0.0 ± 0.0
Ser
4.507SerAla: 4.507 ± 1.216
0.451SerCys: 0.451 ± 0.221
0.451SerAsp: 0.451 ± 0.221
7.661SerGlu: 7.661 ± 1.025
1.803SerPhe: 1.803 ± 0.883
4.507SerGly: 4.507 ± 3.438
0.901SerHis: 0.901 ± 1.33
2.704SerIle: 2.704 ± 1.618
5.858SerLys: 5.858 ± 1.85
3.605SerLeu: 3.605 ± 1.072
0.901SerMet: 0.901 ± 0.895
2.704SerAsn: 2.704 ± 0.983
2.704SerPro: 2.704 ± 1.325
2.704SerGln: 2.704 ± 0.983
2.704SerArg: 2.704 ± 0.983
9.013SerSer: 9.013 ± 4.415
3.155SerThr: 3.155 ± 1.84
3.155SerVal: 3.155 ± 1.545
1.352SerTrp: 1.352 ± 0.662
0.901SerTyr: 0.901 ± 0.442
0.0SerXaa: 0.0 ± 0.0
Thr
4.056ThrAla: 4.056 ± 1.479
1.352ThrCys: 1.352 ± 1.192
2.704ThrAsp: 2.704 ± 1.325
4.507ThrGlu: 4.507 ± 2.02
1.803ThrPhe: 1.803 ± 2.66
3.605ThrGly: 3.605 ± 1.494
0.901ThrHis: 0.901 ± 0.442
3.155ThrIle: 3.155 ± 1.541
4.507ThrLys: 4.507 ± 1.216
3.605ThrLeu: 3.605 ± 1.766
1.803ThrMet: 1.803 ± 0.883
2.704ThrAsn: 2.704 ± 1.325
0.451ThrPro: 0.451 ± 2.298
2.253ThrGln: 2.253 ± 1.01
3.605ThrArg: 3.605 ± 1.766
4.957ThrSer: 4.957 ± 1.018
3.155ThrThr: 3.155 ± 1.84
3.605ThrVal: 3.605 ± 2.93
1.352ThrTrp: 1.352 ± 1.192
2.704ThrTyr: 2.704 ± 1.325
0.0ThrXaa: 0.0 ± 0.0
Val
2.704ValAla: 2.704 ± 2.385
1.803ValCys: 1.803 ± 0.883
3.605ValAsp: 3.605 ± 1.072
6.76ValGlu: 6.76 ± 4.428
2.704ValPhe: 2.704 ± 1.325
4.056ValGly: 4.056 ± 1.987
1.352ValHis: 1.352 ± 2.814
3.155ValIle: 3.155 ± 1.541
4.957ValLys: 4.957 ± 2.279
4.056ValLeu: 4.056 ± 2.082
2.253ValMet: 2.253 ± 1.104
2.253ValAsn: 2.253 ± 1.104
2.253ValPro: 2.253 ± 1.104
1.803ValGln: 1.803 ± 0.883
2.704ValArg: 2.704 ± 1.325
3.605ValSer: 3.605 ± 1.494
2.704ValThr: 2.704 ± 1.325
2.704ValVal: 2.704 ± 1.325
0.0ValTrp: 0.0 ± 0.0
0.451ValTyr: 0.451 ± 0.221
0.0ValXaa: 0.0 ± 0.0
Trp
1.352TrpAla: 1.352 ± 0.662
0.451TrpCys: 0.451 ± 0.221
0.451TrpAsp: 0.451 ± 0.221
1.803TrpGlu: 1.803 ± 1.083
0.451TrpPhe: 0.451 ± 0.221
1.352TrpGly: 1.352 ± 0.662
0.0TrpHis: 0.0 ± 0.0
0.901TrpIle: 0.901 ± 0.442
2.253TrpLys: 2.253 ± 1.01
1.352TrpLeu: 1.352 ± 0.662
0.0TrpMet: 0.0 ± 0.0
0.451TrpAsn: 0.451 ± 0.221
0.451TrpPro: 0.451 ± 0.221
1.352TrpGln: 1.352 ± 0.662
0.901TrpArg: 0.901 ± 0.442
0.451TrpSer: 0.451 ± 0.221
0.0TrpThr: 0.0 ± 0.0
0.451TrpVal: 0.451 ± 0.221
0.0TrpTrp: 0.0 ± 0.0
0.451TrpTyr: 0.451 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.704TyrAla: 2.704 ± 1.618
0.451TyrCys: 0.451 ± 0.221
0.901TyrAsp: 0.901 ± 0.442
2.704TyrGlu: 2.704 ± 1.325
0.0TyrPhe: 0.0 ± 0.0
1.803TyrGly: 1.803 ± 0.883
0.901TyrHis: 0.901 ± 0.442
2.704TyrIle: 2.704 ± 1.325
4.056TyrLys: 4.056 ± 1.177
3.155TyrLeu: 3.155 ± 1.004
0.901TyrMet: 0.901 ± 0.442
1.352TyrAsn: 1.352 ± 0.662
2.704TyrPro: 2.704 ± 1.325
1.352TyrGln: 1.352 ± 0.662
1.352TyrArg: 1.352 ± 0.662
2.253TyrSer: 2.253 ± 1.104
0.0TyrThr: 0.0 ± 0.0
1.352TyrVal: 1.352 ± 0.662
0.901TyrTrp: 0.901 ± 0.442
0.901TyrTyr: 0.901 ± 0.442
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2220 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski