Amino acid dipepetide frequency for Sugarcane bacilliform MO virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.296AlaAla: 2.296 ± 1.437
0.918AlaCys: 0.918 ± 0.46
3.214AlaAsp: 3.214 ± 3.286
7.346AlaGlu: 7.346 ± 2.84
4.132AlaPhe: 4.132 ± 1.008
2.755AlaGly: 2.755 ± 1.364
1.377AlaHis: 1.377 ± 0.691
2.755AlaIle: 2.755 ± 1.381
4.132AlaLys: 4.132 ± 2.971
4.132AlaLeu: 4.132 ± 1.372
4.132AlaMet: 4.132 ± 2.072
3.214AlaAsn: 3.214 ± 1.736
1.377AlaPro: 1.377 ± 1.671
1.837AlaGln: 1.837 ± 0.921
3.673AlaArg: 3.673 ± 0.918
4.132AlaSer: 4.132 ± 1.372
4.591AlaThr: 4.591 ± 2.302
2.755AlaVal: 2.755 ± 1.381
0.918AlaTrp: 0.918 ± 1.423
2.755AlaTyr: 2.755 ± 1.364
0.0AlaXaa: 0.0 ± 0.0
Cys
1.377CysAla: 1.377 ± 0.691
0.0CysCys: 0.0 ± 0.0
0.459CysAsp: 0.459 ± 0.23
0.459CysGlu: 0.459 ± 0.23
0.459CysPhe: 0.459 ± 0.23
0.459CysGly: 0.459 ± 0.23
0.918CysHis: 0.918 ± 0.46
0.459CysIle: 0.459 ± 0.23
3.214CysLys: 3.214 ± 1.612
0.459CysLeu: 0.459 ± 0.23
0.0CysMet: 0.0 ± 0.0
0.459CysAsn: 0.459 ± 0.23
0.0CysPro: 0.0 ± 0.0
0.918CysGln: 0.918 ± 0.46
1.837CysArg: 1.837 ± 0.921
0.459CysSer: 0.459 ± 0.23
0.459CysThr: 0.459 ± 0.23
0.0CysVal: 0.0 ± 0.0
0.459CysTrp: 0.459 ± 0.23
1.377CysTyr: 1.377 ± 1.25
0.0CysXaa: 0.0 ± 0.0
Asp
2.296AspAla: 2.296 ± 1.151
0.459AspCys: 0.459 ± 0.23
4.132AspAsp: 4.132 ± 1.372
5.969AspGlu: 5.969 ± 2.993
1.837AspPhe: 1.837 ± 0.921
2.755AspGly: 2.755 ± 0.901
1.837AspHis: 1.837 ± 0.921
4.132AspIle: 4.132 ± 2.072
3.673AspLys: 3.673 ± 5.767
5.51AspLeu: 5.51 ± 7.23
1.837AspMet: 1.837 ± 1.092
1.837AspAsn: 1.837 ± 0.921
1.837AspPro: 1.837 ± 0.921
1.837AspGln: 1.837 ± 1.098
1.837AspArg: 1.837 ± 0.921
0.918AspSer: 0.918 ± 0.46
0.918AspThr: 0.918 ± 0.46
1.837AspVal: 1.837 ± 1.098
1.377AspTrp: 1.377 ± 0.691
1.377AspTyr: 1.377 ± 0.691
0.0AspXaa: 0.0 ± 0.0
Glu
7.805GluAla: 7.805 ± 2.495
0.918GluCys: 0.918 ± 0.46
6.428GluAsp: 6.428 ± 3.472
17.447GluGlu: 17.447 ± 8.749
4.132GluPhe: 4.132 ± 2.072
5.051GluGly: 5.051 ± 3.036
3.673GluHis: 3.673 ± 0.918
6.887GluIle: 6.887 ± 2.07
6.428GluLys: 6.428 ± 0.449
7.805GluLeu: 7.805 ± 5.889
2.296GluMet: 2.296 ± 1.151
4.591GluAsn: 4.591 ± 1.138
1.837GluPro: 1.837 ± 1.541
7.346GluGln: 7.346 ± 4.393
3.673GluArg: 3.673 ± 1.33
3.214GluSer: 3.214 ± 1.327
4.132GluThr: 4.132 ± 3.75
6.887GluVal: 6.887 ± 2.07
2.296GluTrp: 2.296 ± 0.978
2.296GluTyr: 2.296 ± 0.978
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.918PheCys: 0.918 ± 0.46
1.837PheAsp: 1.837 ± 0.921
5.051PheGlu: 5.051 ± 2.377
1.837PhePhe: 1.837 ± 0.921
0.459PheGly: 0.459 ± 0.23
0.918PheHis: 0.918 ± 0.46
3.673PheIle: 3.673 ± 1.842
2.296PheLys: 2.296 ± 1.151
3.214PheLeu: 3.214 ± 1.612
2.296PheMet: 2.296 ± 1.151
0.918PheAsn: 0.918 ± 0.46
2.296PhePro: 2.296 ± 1.151
1.837PheGln: 1.837 ± 0.921
2.296PheArg: 2.296 ± 1.151
2.755PheSer: 2.755 ± 0.901
2.296PheThr: 2.296 ± 0.978
0.459PheVal: 0.459 ± 1.986
0.0PheTrp: 0.0 ± 0.0
1.377PheTyr: 1.377 ± 0.691
0.0PheXaa: 0.0 ± 0.0
Gly
4.591GlyAla: 4.591 ± 1.45
0.459GlyCys: 0.459 ± 0.23
0.918GlyAsp: 0.918 ± 0.46
5.51GlyGlu: 5.51 ± 2.728
2.296GlyPhe: 2.296 ± 1.437
2.296GlyGly: 2.296 ± 1.151
0.459GlyHis: 0.459 ± 0.23
2.296GlyIle: 2.296 ± 1.151
6.428GlyLys: 6.428 ± 2.655
4.591GlyLeu: 4.591 ± 3.226
0.459GlyMet: 0.459 ± 0.23
1.837GlyAsn: 1.837 ± 1.541
3.214GlyPro: 3.214 ± 1.612
0.918GlyGln: 0.918 ± 0.46
1.837GlyArg: 1.837 ± 1.098
2.296GlySer: 2.296 ± 1.151
5.051GlyThr: 5.051 ± 1.297
3.214GlyVal: 3.214 ± 1.612
1.377GlyTrp: 1.377 ± 0.691
3.214GlyTyr: 3.214 ± 0.88
0.0GlyXaa: 0.0 ± 0.0
His
1.377HisAla: 1.377 ± 1.671
0.459HisCys: 0.459 ± 0.23
0.918HisAsp: 0.918 ± 1.423
0.918HisGlu: 0.918 ± 0.46
0.0HisPhe: 0.0 ± 0.0
1.377HisGly: 1.377 ± 1.671
0.918HisHis: 0.918 ± 1.423
3.214HisIle: 3.214 ± 0.88
1.377HisLys: 1.377 ± 0.691
2.296HisLeu: 2.296 ± 0.978
0.459HisMet: 0.459 ± 1.61
0.918HisAsn: 0.918 ± 1.423
1.377HisPro: 1.377 ± 0.691
1.837HisGln: 1.837 ± 0.921
0.918HisArg: 0.918 ± 1.423
0.459HisSer: 0.459 ± 0.23
0.0HisThr: 0.0 ± 0.0
0.918HisVal: 0.918 ± 0.46
0.459HisTrp: 0.459 ± 0.23
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.837IleAla: 1.837 ± 0.921
1.377IleCys: 1.377 ± 0.691
3.673IleAsp: 3.673 ± 1.842
10.56IleGlu: 10.56 ± 1.792
1.377IlePhe: 1.377 ± 1.25
1.377IleGly: 1.377 ± 0.691
1.837IleHis: 1.837 ± 1.098
3.214IleIle: 3.214 ± 0.88
6.887IleLys: 6.887 ± 2.189
6.428IleLeu: 6.428 ± 0.449
0.918IleMet: 0.918 ± 0.46
3.214IleAsn: 3.214 ± 1.327
2.296IlePro: 2.296 ± 1.151
3.673IleGln: 3.673 ± 3.62
3.673IleArg: 3.673 ± 1.842
1.837IleSer: 1.837 ± 4.64
4.132IleThr: 4.132 ± 1.372
1.377IleVal: 1.377 ± 0.691
0.0IleTrp: 0.0 ± 0.0
1.377IleTyr: 1.377 ± 0.691
0.0IleXaa: 0.0 ± 0.0
Lys
5.051LysAla: 5.051 ± 4.143
2.296LysCys: 2.296 ± 1.151
4.132LysAsp: 4.132 ± 2.072
7.346LysGlu: 7.346 ± 3.061
2.296LysPhe: 2.296 ± 1.151
5.51LysGly: 5.51 ± 2.763
1.377LysHis: 1.377 ± 1.25
6.428LysIle: 6.428 ± 4.66
8.724LysLys: 8.724 ± 0.658
7.805LysLeu: 7.805 ± 2.855
2.296LysMet: 2.296 ± 1.151
4.132LysAsn: 4.132 ± 1.296
4.591LysPro: 4.591 ± 1.45
2.755LysGln: 2.755 ± 4.028
5.51LysArg: 5.51 ± 2.151
5.969LysSer: 5.969 ± 0.534
5.969LysThr: 5.969 ± 1.843
7.346LysVal: 7.346 ± 3.029
0.918LysTrp: 0.918 ± 0.46
3.673LysTyr: 3.673 ± 1.842
0.0LysXaa: 0.0 ± 0.0
Leu
6.428LeuAla: 6.428 ± 0.449
0.459LeuCys: 0.459 ± 0.23
4.591LeuAsp: 4.591 ± 2.874
6.428LeuGlu: 6.428 ± 2.507
1.837LeuPhe: 1.837 ± 1.098
5.051LeuGly: 5.051 ± 2.377
1.377LeuHis: 1.377 ± 1.671
5.51LeuIle: 5.51 ± 3.295
6.887LeuLys: 6.887 ± 3.25
9.642LeuLeu: 9.642 ± 5.208
2.296LeuMet: 2.296 ± 1.151
4.132LeuAsn: 4.132 ± 1.008
3.214LeuPro: 3.214 ± 1.612
2.755LeuGln: 2.755 ± 0.901
5.51LeuArg: 5.51 ± 3.919
3.673LeuSer: 3.673 ± 3.917
6.887LeuThr: 6.887 ± 7.441
6.428LeuVal: 6.428 ± 3.041
0.918LeuTrp: 0.918 ± 0.46
2.296LeuTyr: 2.296 ± 1.151
0.0LeuXaa: 0.0 ± 0.0
Met
2.296MetAla: 2.296 ± 1.151
0.0MetCys: 0.0 ± 0.0
2.296MetAsp: 2.296 ± 1.151
1.837MetGlu: 1.837 ± 0.921
0.918MetPhe: 0.918 ± 0.46
1.837MetGly: 1.837 ± 0.921
0.0MetHis: 0.0 ± 0.0
0.918MetIle: 0.918 ± 0.46
4.132MetLys: 4.132 ± 1.008
2.296MetLeu: 2.296 ± 1.151
0.918MetMet: 0.918 ± 0.46
0.918MetAsn: 0.918 ± 0.46
0.918MetPro: 0.918 ± 0.46
1.377MetGln: 1.377 ± 0.691
1.377MetArg: 1.377 ± 0.691
1.837MetSer: 1.837 ± 1.541
1.837MetThr: 1.837 ± 0.921
1.377MetVal: 1.377 ± 1.25
0.0MetTrp: 0.0 ± 0.0
1.377MetTyr: 1.377 ± 0.691
0.0MetXaa: 0.0 ± 0.0
Asn
2.755AsnAla: 2.755 ± 1.959
0.0AsnCys: 0.0 ± 0.0
0.918AsnAsp: 0.918 ± 0.46
4.591AsnGlu: 4.591 ± 1.45
1.377AsnPhe: 1.377 ± 0.691
1.837AsnGly: 1.837 ± 0.921
0.918AsnHis: 0.918 ± 1.423
1.837AsnIle: 1.837 ± 1.541
4.132AsnLys: 4.132 ± 2.072
4.591AsnLeu: 4.591 ± 1.083
0.918AsnMet: 0.918 ± 0.46
1.377AsnAsn: 1.377 ± 1.671
2.296AsnPro: 2.296 ± 1.437
2.296AsnGln: 2.296 ± 0.978
1.377AsnArg: 1.377 ± 0.691
3.673AsnSer: 3.673 ± 2.197
2.296AsnThr: 2.296 ± 1.151
1.377AsnVal: 1.377 ± 0.691
0.459AsnTrp: 0.459 ± 0.23
3.214AsnTyr: 3.214 ± 1.612
0.0AsnXaa: 0.0 ± 0.0
Pro
4.591ProAla: 4.591 ± 1.45
0.459ProCys: 0.459 ± 0.23
3.214ProAsp: 3.214 ± 1.612
2.755ProGlu: 2.755 ± 1.381
1.837ProPhe: 1.837 ± 1.541
2.755ProGly: 2.755 ± 1.381
0.918ProHis: 0.918 ± 0.46
0.459ProIle: 0.459 ± 0.23
2.755ProLys: 2.755 ± 0.901
1.837ProLeu: 1.837 ± 1.098
0.918ProMet: 0.918 ± 0.46
0.459ProAsn: 0.459 ± 0.23
1.377ProPro: 1.377 ± 0.691
1.377ProGln: 1.377 ± 0.691
2.755ProArg: 2.755 ± 1.381
3.214ProSer: 3.214 ± 1.327
2.296ProThr: 2.296 ± 1.151
3.673ProVal: 3.673 ± 1.33
0.0ProTrp: 0.0 ± 0.0
0.918ProTyr: 0.918 ± 0.46
0.0ProXaa: 0.0 ± 0.0
Gln
3.673GlnAla: 3.673 ± 1.842
0.459GlnCys: 0.459 ± 0.23
2.296GlnAsp: 2.296 ± 1.437
4.132GlnGlu: 4.132 ± 1.008
0.918GlnPhe: 0.918 ± 1.423
1.837GlnGly: 1.837 ± 0.921
0.918GlnHis: 0.918 ± 0.46
2.755GlnIle: 2.755 ± 1.381
2.296GlnLys: 2.296 ± 2.669
5.969GlnLeu: 5.969 ± 0.534
2.296GlnMet: 2.296 ± 1.024
1.377GlnAsn: 1.377 ± 1.671
2.755GlnPro: 2.755 ± 0.901
1.377GlnGln: 1.377 ± 1.25
2.296GlnArg: 2.296 ± 0.978
1.837GlnSer: 1.837 ± 2.846
3.214GlnThr: 3.214 ± 0.88
2.296GlnVal: 2.296 ± 0.978
0.0GlnTrp: 0.0 ± 0.0
1.377GlnTyr: 1.377 ± 0.691
0.0GlnXaa: 0.0 ± 0.0
Arg
3.673ArgAla: 3.673 ± 0.918
0.918ArgCys: 0.918 ± 0.46
1.377ArgAsp: 1.377 ± 0.691
3.673ArgGlu: 3.673 ± 2.197
2.296ArgPhe: 2.296 ± 1.151
3.214ArgGly: 3.214 ± 1.327
0.918ArgHis: 0.918 ± 0.46
5.051ArgIle: 5.051 ± 0.878
5.969ArgLys: 5.969 ± 3.697
5.51ArgLeu: 5.51 ± 1.474
0.459ArgMet: 0.459 ± 0.23
2.755ArgAsn: 2.755 ± 1.381
0.918ArgPro: 0.918 ± 1.423
0.918ArgGln: 0.918 ± 0.46
2.755ArgArg: 2.755 ± 2.5
3.214ArgSer: 3.214 ± 1.612
4.591ArgThr: 4.591 ± 1.083
3.673ArgVal: 3.673 ± 2.197
0.918ArgTrp: 0.918 ± 0.46
1.377ArgTyr: 1.377 ± 0.691
0.0ArgXaa: 0.0 ± 0.0
Ser
3.214SerAla: 3.214 ± 1.612
0.459SerCys: 0.459 ± 0.23
3.214SerAsp: 3.214 ± 0.88
5.51SerGlu: 5.51 ± 3.295
1.837SerPhe: 1.837 ± 0.921
4.132SerGly: 4.132 ± 2.971
0.918SerHis: 0.918 ± 1.423
2.755SerIle: 2.755 ± 3.342
4.591SerLys: 4.591 ± 1.138
2.755SerLeu: 2.755 ± 1.959
0.918SerMet: 0.918 ± 1.021
2.296SerAsn: 2.296 ± 0.978
2.755SerPro: 2.755 ± 1.381
2.755SerGln: 2.755 ± 0.901
3.214SerArg: 3.214 ± 2.342
4.591SerSer: 4.591 ± 2.874
3.673SerThr: 3.673 ± 0.918
2.296SerVal: 2.296 ± 1.151
0.459SerTrp: 0.459 ± 0.23
0.918SerTyr: 0.918 ± 0.46
0.0SerXaa: 0.0 ± 0.0
Thr
2.296ThrAla: 2.296 ± 0.978
1.377ThrCys: 1.377 ± 1.25
1.377ThrAsp: 1.377 ± 0.691
6.428ThrGlu: 6.428 ± 1.864
3.214ThrPhe: 3.214 ± 1.612
4.132ThrGly: 4.132 ± 1.372
0.0ThrHis: 0.0 ± 0.0
2.755ThrIle: 2.755 ± 1.959
6.887ThrLys: 6.887 ± 4.491
4.132ThrLeu: 4.132 ± 3.42
1.377ThrMet: 1.377 ± 0.691
2.755ThrAsn: 2.755 ± 1.381
0.459ThrPro: 0.459 ± 0.23
3.673ThrGln: 3.673 ± 0.918
4.132ThrArg: 4.132 ± 1.008
5.051ThrSer: 5.051 ± 1.297
3.673ThrThr: 3.673 ± 3.058
2.755ThrVal: 2.755 ± 0.901
1.377ThrTrp: 1.377 ± 0.691
1.377ThrTyr: 1.377 ± 0.691
0.0ThrXaa: 0.0 ± 0.0
Val
3.673ValAla: 3.673 ± 3.058
1.837ValCys: 1.837 ± 0.921
0.918ValAsp: 0.918 ± 1.423
6.428ValGlu: 6.428 ± 5.281
3.214ValPhe: 3.214 ± 1.612
3.214ValGly: 3.214 ± 1.612
0.918ValHis: 0.918 ± 2.864
2.296ValIle: 2.296 ± 1.151
6.428ValLys: 6.428 ± 1.702
3.673ValLeu: 3.673 ± 2.197
1.837ValMet: 1.837 ± 0.921
2.296ValAsn: 2.296 ± 1.151
1.837ValPro: 1.837 ± 0.921
2.755ValGln: 2.755 ± 1.381
2.755ValArg: 2.755 ± 1.381
2.296ValSer: 2.296 ± 1.151
2.755ValThr: 2.755 ± 1.381
2.755ValVal: 2.755 ± 1.364
0.0ValTrp: 0.0 ± 0.0
1.377ValTyr: 1.377 ± 0.691
0.0ValXaa: 0.0 ± 0.0
Trp
0.918TrpAla: 0.918 ± 0.46
0.0TrpCys: 0.0 ± 0.0
0.918TrpAsp: 0.918 ± 0.46
0.918TrpGlu: 0.918 ± 1.423
0.459TrpPhe: 0.459 ± 0.23
0.918TrpGly: 0.918 ± 0.46
0.0TrpHis: 0.0 ± 0.0
0.459TrpIle: 0.459 ± 0.23
2.755TrpLys: 2.755 ± 0.901
0.918TrpLeu: 0.918 ± 0.46
0.0TrpMet: 0.0 ± 0.0
0.918TrpAsn: 0.918 ± 0.46
0.459TrpPro: 0.459 ± 0.23
0.918TrpGln: 0.918 ± 0.46
0.459TrpArg: 0.459 ± 0.23
0.459TrpSer: 0.459 ± 0.23
0.0TrpThr: 0.0 ± 0.0
0.459TrpVal: 0.459 ± 0.23
0.0TrpTrp: 0.0 ± 0.0
0.459TrpTyr: 0.459 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.296TyrAla: 2.296 ± 1.437
0.459TyrCys: 0.459 ± 0.23
1.837TyrAsp: 1.837 ± 0.921
1.837TyrGlu: 1.837 ± 0.921
0.459TyrPhe: 0.459 ± 0.23
2.296TyrGly: 2.296 ± 1.151
0.0TyrHis: 0.0 ± 0.0
3.214TyrIle: 3.214 ± 1.612
4.132TyrLys: 4.132 ± 1.008
2.755TyrLeu: 2.755 ± 0.901
1.377TyrMet: 1.377 ± 0.691
1.837TyrAsn: 1.837 ± 0.921
2.755TyrPro: 2.755 ± 1.381
0.918TyrGln: 0.918 ± 0.46
2.296TyrArg: 2.296 ± 0.978
1.377TyrSer: 1.377 ± 0.691
0.459TyrThr: 0.459 ± 0.23
1.377TyrVal: 1.377 ± 0.691
0.459TyrTrp: 0.459 ± 0.23
0.459TyrTyr: 0.459 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2179 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski