Amino acid dipepetide frequency for Oat chlorotic stunt virus (isolate United Kingdom) (OCSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.255AlaAla: 6.255 ± 4.304
0.0AlaCys: 0.0 ± 0.0
3.127AlaAsp: 3.127 ± 0.8
3.909AlaGlu: 3.909 ± 2.038
2.346AlaPhe: 2.346 ± 0.518
4.691AlaGly: 4.691 ± 2.009
0.782AlaHis: 0.782 ± 0.442
5.473AlaIle: 5.473 ± 1.37
1.564AlaLys: 1.564 ± 0.4
10.164AlaLeu: 10.164 ± 0.909
0.782AlaMet: 0.782 ± 0.442
3.909AlaAsn: 3.909 ± 1.288
5.473AlaPro: 5.473 ± 1.686
2.346AlaGln: 2.346 ± 1.327
2.346AlaArg: 2.346 ± 0.518
3.127AlaSer: 3.127 ± 2.391
3.909AlaThr: 3.909 ± 2.121
6.255AlaVal: 6.255 ± 2.376
2.346AlaTrp: 2.346 ± 0.518
3.127AlaTyr: 3.127 ± 0.8
0.0AlaXaa: 0.0 ± 0.0
Cys
0.782CysAla: 0.782 ± 0.442
0.0CysCys: 0.0 ± 0.0
2.346CysAsp: 2.346 ± 1.004
1.564CysGlu: 1.564 ± 0.4
0.782CysPhe: 0.782 ± 0.665
0.0CysGly: 0.0 ± 0.0
1.564CysHis: 1.564 ± 0.4
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.782CysLeu: 0.782 ± 2.664
1.564CysMet: 1.564 ± 0.884
0.782CysAsn: 0.782 ± 0.442
0.782CysPro: 0.782 ± 0.442
1.564CysGln: 1.564 ± 2.499
1.564CysArg: 1.564 ± 0.884
0.0CysSer: 0.0 ± 0.0
1.564CysThr: 1.564 ± 1.33
5.473CysVal: 5.473 ± 1.37
0.782CysTrp: 0.782 ± 2.664
1.564CysTyr: 1.564 ± 0.884
0.0CysXaa: 0.0 ± 0.0
Asp
2.346AspAla: 2.346 ± 0.518
1.564AspCys: 1.564 ± 0.884
1.564AspAsp: 1.564 ± 0.884
1.564AspGlu: 1.564 ± 0.884
0.0AspPhe: 0.0 ± 0.0
2.346AspGly: 2.346 ± 0.518
1.564AspHis: 1.564 ± 0.4
1.564AspIle: 1.564 ± 0.884
1.564AspLys: 1.564 ± 0.884
4.691AspLeu: 4.691 ± 2.009
1.564AspMet: 1.564 ± 0.799
2.346AspAsn: 2.346 ± 1.004
2.346AspPro: 2.346 ± 1.327
2.346AspGln: 2.346 ± 1.327
1.564AspArg: 1.564 ± 0.884
4.691AspSer: 4.691 ± 1.036
3.127AspThr: 3.127 ± 0.876
0.782AspVal: 0.782 ± 0.442
0.0AspTrp: 0.0 ± 0.0
0.782AspTyr: 0.782 ± 0.665
0.0AspXaa: 0.0 ± 0.0
Glu
2.346GluAla: 2.346 ± 1.327
0.782GluCys: 0.782 ± 0.665
2.346GluAsp: 2.346 ± 1.327
3.909GluGlu: 3.909 ± 2.457
2.346GluPhe: 2.346 ± 0.518
1.564GluGly: 1.564 ± 0.884
2.346GluHis: 2.346 ± 1.327
0.782GluIle: 0.782 ± 0.665
3.909GluLys: 3.909 ± 0.813
6.255GluLeu: 6.255 ± 1.612
2.346GluMet: 2.346 ± 1.319
1.564GluAsn: 1.564 ± 0.4
2.346GluPro: 2.346 ± 0.518
2.346GluGln: 2.346 ± 0.518
3.127GluArg: 3.127 ± 1.769
0.782GluSer: 0.782 ± 0.665
1.564GluThr: 1.564 ± 0.884
0.782GluVal: 0.782 ± 0.665
0.782GluTrp: 0.782 ± 0.442
0.782GluTyr: 0.782 ± 0.442
0.0GluXaa: 0.0 ± 0.0
Phe
2.346PheAla: 2.346 ± 0.518
0.782PheCys: 0.782 ± 0.442
4.691PheAsp: 4.691 ± 1.036
2.346PheGlu: 2.346 ± 0.518
0.782PhePhe: 0.782 ± 0.442
1.564PheGly: 1.564 ± 0.884
1.564PheHis: 1.564 ± 0.4
3.909PheIle: 3.909 ± 0.813
2.346PheLys: 2.346 ± 0.518
3.909PheLeu: 3.909 ± 2.457
0.782PheMet: 0.782 ± 0.442
2.346PheAsn: 2.346 ± 1.327
1.564PhePro: 1.564 ± 0.4
0.0PheGln: 0.0 ± 0.0
1.564PheArg: 1.564 ± 1.33
3.127PheSer: 3.127 ± 0.876
2.346PheThr: 2.346 ± 1.995
4.691PheVal: 4.691 ± 2.598
0.0PheTrp: 0.0 ± 0.0
2.346PheTyr: 2.346 ± 0.518
0.0PheXaa: 0.0 ± 0.0
Gly
6.255GlyAla: 6.255 ± 1.599
3.127GlyCys: 3.127 ± 4.998
3.127GlyAsp: 3.127 ± 0.876
1.564GlyGlu: 1.564 ± 0.4
2.346GlyPhe: 2.346 ± 1.004
3.909GlyGly: 3.909 ± 1.377
1.564GlyHis: 1.564 ± 0.884
0.782GlyIle: 0.782 ± 0.665
4.691GlyLys: 4.691 ± 1.036
7.037GlyLeu: 7.037 ± 3.022
3.127GlyMet: 3.127 ± 1.474
3.127GlyAsn: 3.127 ± 0.876
0.782GlyPro: 0.782 ± 0.665
1.564GlyGln: 1.564 ± 0.4
4.691GlyArg: 4.691 ± 1.036
5.473GlySer: 5.473 ± 1.919
7.819GlyThr: 7.819 ± 4.631
8.6GlyVal: 8.6 ± 1.809
0.0GlyTrp: 0.0 ± 0.0
1.564GlyTyr: 1.564 ± 1.33
0.0GlyXaa: 0.0 ± 0.0
His
3.127HisAla: 3.127 ± 0.8
0.782HisCys: 0.782 ± 0.665
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.564HisGly: 1.564 ± 0.884
0.782HisHis: 0.782 ± 0.442
0.782HisIle: 0.782 ± 0.442
0.782HisLys: 0.782 ± 0.442
1.564HisLeu: 1.564 ± 0.4
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.782HisPro: 0.782 ± 0.442
1.564HisGln: 1.564 ± 0.884
3.127HisArg: 3.127 ± 2.168
1.564HisSer: 1.564 ± 0.4
0.782HisThr: 0.782 ± 0.442
3.127HisVal: 3.127 ± 0.8
0.782HisTrp: 0.782 ± 0.442
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.691IleAla: 4.691 ± 1.864
0.0IleCys: 0.0 ± 0.0
1.564IleAsp: 1.564 ± 0.4
3.127IleGlu: 3.127 ± 0.876
3.127IlePhe: 3.127 ± 0.876
2.346IleGly: 2.346 ± 0.518
0.782IleHis: 0.782 ± 0.442
2.346IleIle: 2.346 ± 2.374
4.691IleLys: 4.691 ± 1.199
3.127IleLeu: 3.127 ± 4.998
0.0IleMet: 0.0 ± 0.0
3.127IleAsn: 3.127 ± 2.66
3.127IlePro: 3.127 ± 0.8
2.346IleGln: 2.346 ± 1.327
0.0IleArg: 0.0 ± 0.0
4.691IleSer: 4.691 ± 1.199
2.346IleThr: 2.346 ± 2.374
1.564IleVal: 1.564 ± 0.4
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.127LysAla: 3.127 ± 0.8
3.127LysCys: 3.127 ± 2.168
1.564LysAsp: 1.564 ± 0.884
1.564LysGlu: 1.564 ± 0.884
3.127LysPhe: 3.127 ± 0.876
3.909LysGly: 3.909 ± 2.211
0.782LysHis: 0.782 ± 0.442
0.782LysIle: 0.782 ± 0.665
3.909LysLys: 3.909 ± 1.377
6.255LysLeu: 6.255 ± 1.599
0.782LysMet: 0.782 ± 0.665
2.346LysAsn: 2.346 ± 1.327
1.564LysPro: 1.564 ± 0.4
1.564LysGln: 1.564 ± 1.33
3.909LysArg: 3.909 ± 2.211
1.564LysSer: 1.564 ± 0.884
3.909LysThr: 3.909 ± 1.288
3.909LysVal: 3.909 ± 0.813
1.564LysTrp: 1.564 ± 0.884
3.909LysTyr: 3.909 ± 1.288
0.782LysXaa: 0.782 ± 0.442
Leu
7.037LeuAla: 7.037 ± 4.393
0.0LeuCys: 0.0 ± 0.0
2.346LeuAsp: 2.346 ± 1.327
9.382LeuGlu: 9.382 ± 5.306
4.691LeuPhe: 4.691 ± 1.036
14.073LeuGly: 14.073 ± 4.331
1.564LeuHis: 1.564 ± 2.499
6.255LeuIle: 6.255 ± 7.696
3.127LeuLys: 3.127 ± 1.769
8.6LeuLeu: 8.6 ± 1.086
1.564LeuMet: 1.564 ± 0.884
3.909LeuAsn: 3.909 ± 0.813
8.6LeuPro: 8.6 ± 3.97
0.782LeuGln: 0.782 ± 2.664
4.691LeuArg: 4.691 ± 1.715
10.164LeuSer: 10.164 ± 9.267
3.909LeuThr: 3.909 ± 1.377
6.255LeuVal: 6.255 ± 1.599
0.782LeuTrp: 0.782 ± 0.442
0.782LeuTyr: 0.782 ± 0.665
0.0LeuXaa: 0.0 ± 0.0
Met
2.346MetAla: 2.346 ± 0.518
2.346MetCys: 2.346 ± 0.518
0.782MetAsp: 0.782 ± 0.442
1.564MetGlu: 1.564 ± 0.884
0.782MetPhe: 0.782 ± 0.442
1.564MetGly: 1.564 ± 0.4
0.0MetHis: 0.0 ± 0.0
1.564MetIle: 1.564 ± 0.884
0.782MetLys: 0.782 ± 0.442
0.782MetLeu: 0.782 ± 0.665
1.564MetMet: 1.564 ± 0.4
0.782MetAsn: 0.782 ± 0.442
1.564MetPro: 1.564 ± 0.4
0.0MetGln: 0.0 ± 0.0
2.346MetArg: 2.346 ± 2.405
5.473MetSer: 5.473 ± 1.919
1.564MetThr: 1.564 ± 0.4
0.782MetVal: 0.782 ± 0.442
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.127AsnAla: 3.127 ± 0.8
1.564AsnCys: 1.564 ± 0.4
0.782AsnAsp: 0.782 ± 0.442
0.0AsnGlu: 0.0 ± 0.0
0.782AsnPhe: 0.782 ± 2.664
3.909AsnGly: 3.909 ± 1.288
0.782AsnHis: 0.782 ± 0.665
3.909AsnIle: 3.909 ± 0.813
2.346AsnLys: 2.346 ± 0.518
4.691AsnLeu: 4.691 ± 1.036
3.127AsnMet: 3.127 ± 0.8
2.346AsnAsn: 2.346 ± 1.327
2.346AsnPro: 2.346 ± 0.518
0.0AsnGln: 0.0 ± 0.0
2.346AsnArg: 2.346 ± 1.004
5.473AsnSer: 5.473 ± 1.761
3.127AsnThr: 3.127 ± 0.876
2.346AsnVal: 2.346 ± 0.518
0.0AsnTrp: 0.0 ± 0.0
3.909AsnTyr: 3.909 ± 2.315
0.0AsnXaa: 0.0 ± 0.0
Pro
3.909ProAla: 3.909 ± 1.377
0.782ProCys: 0.782 ± 0.442
2.346ProAsp: 2.346 ± 1.327
0.782ProGlu: 0.782 ± 0.442
1.564ProPhe: 1.564 ± 0.884
4.691ProGly: 4.691 ± 2.009
0.0ProHis: 0.0 ± 0.0
1.564ProIle: 1.564 ± 2.499
1.564ProLys: 1.564 ± 0.884
7.037ProLeu: 7.037 ± 10.01
3.127ProMet: 3.127 ± 0.876
1.564ProAsn: 1.564 ± 0.884
0.782ProPro: 0.782 ± 0.442
0.782ProGln: 0.782 ± 0.665
4.691ProArg: 4.691 ± 2.653
6.255ProSer: 6.255 ± 1.599
5.473ProThr: 5.473 ± 1.761
4.691ProVal: 4.691 ± 1.036
2.346ProTrp: 2.346 ± 1.004
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.346GlnAla: 2.346 ± 0.518
0.782GlnCys: 0.782 ± 0.442
0.0GlnAsp: 0.0 ± 0.0
2.346GlnGlu: 2.346 ± 1.004
1.564GlnPhe: 1.564 ± 0.884
1.564GlnGly: 1.564 ± 0.4
0.782GlnHis: 0.782 ± 0.442
1.564GlnIle: 1.564 ± 0.4
0.0GlnLys: 0.0 ± 0.0
5.473GlnLeu: 5.473 ± 4.531
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.782GlnPro: 0.782 ± 0.442
2.346GlnGln: 2.346 ± 0.518
0.782GlnArg: 0.782 ± 0.442
0.782GlnSer: 0.782 ± 0.442
0.0GlnThr: 0.0 ± 0.0
1.564GlnVal: 1.564 ± 1.33
0.782GlnTrp: 0.782 ± 0.442
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.909ArgAla: 3.909 ± 1.288
1.564ArgCys: 1.564 ± 1.33
0.782ArgAsp: 0.782 ± 0.442
1.564ArgGlu: 1.564 ± 0.884
1.564ArgPhe: 1.564 ± 0.4
1.564ArgGly: 1.564 ± 0.4
0.782ArgHis: 0.782 ± 0.442
3.127ArgIle: 3.127 ± 0.876
5.473ArgLys: 5.473 ± 2.148
3.909ArgLeu: 3.909 ± 0.813
2.346ArgMet: 2.346 ± 2.405
2.346ArgAsn: 2.346 ± 0.518
3.127ArgPro: 3.127 ± 1.769
0.782ArgGln: 0.782 ± 0.665
3.909ArgArg: 3.909 ± 1.377
4.691ArgSer: 4.691 ± 1.864
4.691ArgThr: 4.691 ± 2.598
5.473ArgVal: 5.473 ± 3.095
1.564ArgTrp: 1.564 ± 5.327
0.782ArgTyr: 0.782 ± 0.442
0.0ArgXaa: 0.0 ± 0.0
Ser
3.127SerAla: 3.127 ± 2.432
0.782SerCys: 0.782 ± 0.442
2.346SerAsp: 2.346 ± 1.004
1.564SerGlu: 1.564 ± 0.4
3.127SerPhe: 3.127 ± 0.876
7.037SerGly: 7.037 ± 1.551
3.127SerHis: 3.127 ± 0.8
1.564SerIle: 1.564 ± 1.33
3.127SerLys: 3.127 ± 0.8
7.037SerLeu: 7.037 ± 4.393
0.0SerMet: 0.0 ± 0.0
6.255SerAsn: 6.255 ± 2.376
4.691SerPro: 4.691 ± 1.999
0.782SerGln: 0.782 ± 0.665
7.037SerArg: 7.037 ± 4.185
8.6SerSer: 8.6 ± 1.734
11.728SerThr: 11.728 ± 3.992
6.255SerVal: 6.255 ± 2.204
1.564SerTrp: 1.564 ± 0.884
3.127SerTyr: 3.127 ± 0.8
0.0SerXaa: 0.0 ± 0.0
Thr
8.6ThrAla: 8.6 ± 1.938
3.909ThrCys: 3.909 ± 1.288
3.127ThrAsp: 3.127 ± 1.769
2.346ThrGlu: 2.346 ± 2.374
3.909ThrPhe: 3.909 ± 0.813
7.037ThrGly: 7.037 ± 3.97
1.564ThrHis: 1.564 ± 0.4
2.346ThrIle: 2.346 ± 1.004
2.346ThrLys: 2.346 ± 0.518
6.255ThrLeu: 6.255 ± 1.599
1.564ThrMet: 1.564 ± 0.884
3.909ThrAsn: 3.909 ± 3.448
4.691ThrPro: 4.691 ± 1.199
0.0ThrGln: 0.0 ± 0.0
2.346ThrArg: 2.346 ± 2.405
5.473ThrSer: 5.473 ± 1.172
7.037ThrThr: 7.037 ± 3.97
8.6ThrVal: 8.6 ± 3.655
0.782ThrTrp: 0.782 ± 0.442
1.564ThrTyr: 1.564 ± 0.4
0.0ThrXaa: 0.0 ± 0.0
Val
4.691ValAla: 4.691 ± 1.999
1.564ValCys: 1.564 ± 0.4
4.691ValAsp: 4.691 ± 1.036
2.346ValGlu: 2.346 ± 1.995
6.255ValPhe: 6.255 ± 1.753
3.909ValGly: 3.909 ± 2.121
0.782ValHis: 0.782 ± 0.665
3.909ValIle: 3.909 ± 1.288
7.819ValLys: 7.819 ± 1.813
3.127ValLeu: 3.127 ± 0.876
1.564ValMet: 1.564 ± 0.4
3.127ValAsn: 3.127 ± 1.656
7.819ValPro: 7.819 ± 3.1
0.0ValGln: 0.0 ± 0.0
2.346ValArg: 2.346 ± 1.327
7.819ValSer: 7.819 ± 1.626
7.819ValThr: 7.819 ± 1.877
11.728ValVal: 11.728 ± 4.13
0.0ValTrp: 0.0 ± 0.0
3.127ValTyr: 3.127 ± 0.876
0.0ValXaa: 0.0 ± 0.0
Trp
0.782TrpAla: 0.782 ± 0.442
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
3.127TrpPhe: 3.127 ± 2.391
0.782TrpGly: 0.782 ± 0.442
0.0TrpHis: 0.0 ± 0.0
0.782TrpIle: 0.782 ± 0.442
0.0TrpLys: 0.0 ± 0.0
3.127TrpLeu: 3.127 ± 2.168
0.0TrpMet: 0.0 ± 0.0
1.564TrpAsn: 1.564 ± 0.884
0.782TrpPro: 0.782 ± 0.442
1.564TrpGln: 1.564 ± 0.4
0.0TrpArg: 0.0 ± 0.0
0.782TrpSer: 0.782 ± 0.665
0.0TrpThr: 0.0 ± 0.0
0.782TrpVal: 0.782 ± 2.664
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.782TyrAla: 0.782 ± 0.665
0.0TyrCys: 0.0 ± 0.0
0.782TyrAsp: 0.782 ± 0.665
2.346TyrGlu: 2.346 ± 0.518
0.782TyrPhe: 0.782 ± 0.665
2.346TyrGly: 2.346 ± 0.518
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
3.909TyrLys: 3.909 ± 1.288
4.691TyrLeu: 4.691 ± 1.036
0.0TyrMet: 0.0 ± 0.0
1.564TyrAsn: 1.564 ± 0.4
0.0TyrPro: 0.0 ± 0.0
0.782TyrGln: 0.782 ± 0.442
1.564TyrArg: 1.564 ± 1.33
2.346TyrSer: 2.346 ± 1.004
4.691TyrThr: 4.691 ± 1.199
0.782TyrVal: 0.782 ± 0.442
0.0TyrTrp: 0.0 ± 0.0
0.782TyrTyr: 0.782 ± 0.442
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.782XaaGly: 0.782 ± 0.442
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1280 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski