Amino acid dipepetide frequency for Pelargonium ringspot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.453AlaAla: 7.453 ± 4.801
0.678AlaCys: 0.678 ± 0.424
2.033AlaAsp: 2.033 ± 0.703
3.388AlaGlu: 3.388 ± 1.652
2.71AlaPhe: 2.71 ± 0.494
4.065AlaGly: 4.065 ± 1.476
2.033AlaHis: 2.033 ± 1.331
2.71AlaIle: 2.71 ± 1.025
3.388AlaLys: 3.388 ± 1.229
6.098AlaLeu: 6.098 ± 1.042
0.0AlaMet: 0.0 ± 0.0
4.065AlaAsn: 4.065 ± 2.256
3.388AlaPro: 3.388 ± 1.671
2.033AlaGln: 2.033 ± 2.031
3.388AlaArg: 3.388 ± 1.183
6.098AlaSer: 6.098 ± 2.065
2.71AlaThr: 2.71 ± 1.144
7.453AlaVal: 7.453 ± 1.279
0.0AlaTrp: 0.0 ± 0.0
3.388AlaTyr: 3.388 ± 0.624
0.0AlaXaa: 0.0 ± 0.0
Cys
0.678CysAla: 0.678 ± 0.677
0.0CysCys: 0.0 ± 0.0
1.355CysAsp: 1.355 ± 0.724
0.678CysGlu: 0.678 ± 0.424
0.0CysPhe: 0.0 ± 0.0
1.355CysGly: 1.355 ± 0.544
0.678CysHis: 0.678 ± 0.424
3.388CysIle: 3.388 ± 0.974
2.71CysLys: 2.71 ± 1.449
0.678CysLeu: 0.678 ± 0.424
0.678CysMet: 0.678 ± 0.424
0.0CysAsn: 0.0 ± 0.0
0.678CysPro: 0.678 ± 0.424
0.678CysGln: 0.678 ± 0.424
2.033CysArg: 2.033 ± 0.678
0.678CysSer: 0.678 ± 0.424
0.0CysThr: 0.0 ± 0.0
0.678CysVal: 0.678 ± 0.424
1.355CysTrp: 1.355 ± 0.724
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.71AspAla: 2.71 ± 1.088
1.355AspCys: 1.355 ± 0.544
1.355AspAsp: 1.355 ± 0.848
0.0AspGlu: 0.0 ± 0.0
0.0AspPhe: 0.0 ± 0.0
4.743AspGly: 4.743 ± 2.126
0.0AspHis: 0.0 ± 0.0
3.388AspIle: 3.388 ± 1.408
4.743AspLys: 4.743 ± 0.827
3.388AspLeu: 3.388 ± 1.754
1.355AspMet: 1.355 ± 0.848
0.678AspAsn: 0.678 ± 1.196
3.388AspPro: 3.388 ± 0.868
3.388AspGln: 3.388 ± 1.403
0.0AspArg: 0.0 ± 0.0
4.743AspSer: 4.743 ± 1.108
2.033AspThr: 2.033 ± 1.348
4.065AspVal: 4.065 ± 1.013
0.0AspTrp: 0.0 ± 0.0
2.033AspTyr: 2.033 ± 0.807
0.0AspXaa: 0.0 ± 0.0
Glu
4.065GluAla: 4.065 ± 0.746
1.355GluCys: 1.355 ± 0.724
1.355GluAsp: 1.355 ± 0.848
2.033GluGlu: 2.033 ± 0.807
6.775GluPhe: 6.775 ± 1.411
4.065GluGly: 4.065 ± 0.945
2.033GluHis: 2.033 ± 1.272
0.0GluIle: 0.0 ± 0.0
2.033GluLys: 2.033 ± 1.348
10.163GluLeu: 10.163 ± 3.378
0.678GluMet: 0.678 ± 0.424
0.678GluAsn: 0.678 ± 0.677
1.355GluPro: 1.355 ± 0.848
1.355GluGln: 1.355 ± 0.544
3.388GluArg: 3.388 ± 0.624
0.678GluSer: 0.678 ± 0.677
2.033GluThr: 2.033 ± 0.807
6.098GluVal: 6.098 ± 1.637
1.355GluTrp: 1.355 ± 0.848
0.678GluTyr: 0.678 ± 0.424
0.0GluXaa: 0.0 ± 0.0
Phe
0.678PheAla: 0.678 ± 1.196
0.678PheCys: 0.678 ± 0.424
3.388PheAsp: 3.388 ± 1.028
2.033PheGlu: 2.033 ± 0.807
0.678PhePhe: 0.678 ± 1.196
5.42PheGly: 5.42 ± 1.813
0.678PheHis: 0.678 ± 0.424
2.033PheIle: 2.033 ± 1.32
0.678PheLys: 0.678 ± 0.677
4.065PheLeu: 4.065 ± 1.476
0.678PheMet: 0.678 ± 1.181
0.678PheAsn: 0.678 ± 0.677
0.678PhePro: 0.678 ± 0.677
1.355PheGln: 1.355 ± 0.544
1.355PheArg: 1.355 ± 0.544
6.098PheSer: 6.098 ± 1.167
3.388PheThr: 3.388 ± 0.624
6.098PheVal: 6.098 ± 2.223
0.678PheTrp: 0.678 ± 0.424
2.033PheTyr: 2.033 ± 0.807
0.0PheXaa: 0.0 ± 0.0
Gly
2.71GlyAla: 2.71 ± 2.438
1.355GlyCys: 1.355 ± 0.544
2.033GlyAsp: 2.033 ± 0.703
7.453GlyGlu: 7.453 ± 1.776
4.065GlyPhe: 4.065 ± 0.945
3.388GlyGly: 3.388 ± 0.624
0.678GlyHis: 0.678 ± 0.424
2.033GlyIle: 2.033 ± 1.331
4.065GlyLys: 4.065 ± 1.013
8.13GlyLeu: 8.13 ± 2.632
2.033GlyMet: 2.033 ± 0.807
5.42GlyAsn: 5.42 ± 2.305
2.033GlyPro: 2.033 ± 0.678
1.355GlyGln: 1.355 ± 1.354
2.71GlyArg: 2.71 ± 1.088
3.388GlySer: 3.388 ± 2.167
4.065GlyThr: 4.065 ± 1.013
9.485GlyVal: 9.485 ± 2.986
0.678GlyTrp: 0.678 ± 0.677
1.355GlyTyr: 1.355 ± 0.848
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.678HisCys: 0.678 ± 0.424
2.033HisAsp: 2.033 ± 0.678
0.0HisGlu: 0.0 ± 0.0
2.033HisPhe: 2.033 ± 0.807
0.678HisGly: 0.678 ± 0.424
0.0HisHis: 0.0 ± 0.0
0.678HisIle: 0.678 ± 0.677
2.033HisLys: 2.033 ± 1.272
2.71HisLeu: 2.71 ± 1.067
0.0HisMet: 0.0 ± 0.0
0.678HisAsn: 0.678 ± 0.424
0.0HisPro: 0.0 ± 0.0
0.678HisGln: 0.678 ± 0.424
2.033HisArg: 2.033 ± 1.153
4.743HisSer: 4.743 ± 1.76
0.678HisThr: 0.678 ± 0.677
0.678HisVal: 0.678 ± 0.424
0.0HisTrp: 0.0 ± 0.0
0.678HisTyr: 0.678 ± 0.424
0.0HisXaa: 0.0 ± 0.0
Ile
0.678IleAla: 0.678 ± 0.677
0.0IleCys: 0.0 ± 0.0
2.71IleAsp: 2.71 ± 0.494
1.355IleGlu: 1.355 ± 0.848
1.355IlePhe: 1.355 ± 0.848
4.065IleGly: 4.065 ± 0.945
0.678IleHis: 0.678 ± 0.677
0.678IleIle: 0.678 ± 0.677
0.678IleLys: 0.678 ± 0.424
3.388IleLeu: 3.388 ± 1.408
0.0IleMet: 0.0 ± 0.0
1.355IleAsn: 1.355 ± 0.848
2.033IlePro: 2.033 ± 0.807
2.033IleGln: 2.033 ± 1.102
1.355IleArg: 1.355 ± 0.848
5.42IleSer: 5.42 ± 3.753
6.098IleThr: 6.098 ± 2.892
4.743IleVal: 4.743 ± 2.369
0.0IleTrp: 0.0 ± 0.0
4.065IleTyr: 4.065 ± 1.786
0.0IleXaa: 0.0 ± 0.0
Lys
6.775LysAla: 6.775 ± 1.769
1.355LysCys: 1.355 ± 0.848
2.033LysAsp: 2.033 ± 0.703
4.065LysGlu: 4.065 ± 1.677
4.743LysPhe: 4.743 ± 1.651
2.71LysGly: 2.71 ± 1.25
1.355LysHis: 1.355 ± 0.724
4.065LysIle: 4.065 ± 2.544
3.388LysLys: 3.388 ± 1.343
3.388LysLeu: 3.388 ± 1.134
2.71LysMet: 2.71 ± 1.06
1.355LysAsn: 1.355 ± 0.724
4.065LysPro: 4.065 ± 3.069
0.678LysGln: 0.678 ± 0.677
3.388LysArg: 3.388 ± 1.201
6.098LysSer: 6.098 ± 1.986
6.775LysThr: 6.775 ± 1.769
6.098LysVal: 6.098 ± 1.74
0.678LysTrp: 0.678 ± 0.424
2.033LysTyr: 2.033 ± 0.703
0.678LysXaa: 0.678 ± 0.424
Leu
13.55LeuAla: 13.55 ± 1.528
1.355LeuCys: 1.355 ± 1.074
1.355LeuAsp: 1.355 ± 1.529
8.808LeuGlu: 8.808 ± 1.922
3.388LeuPhe: 3.388 ± 2.432
7.453LeuGly: 7.453 ± 0.973
0.678LeuHis: 0.678 ± 0.424
2.033LeuIle: 2.033 ± 1.272
5.42LeuLys: 5.42 ± 0.674
11.518LeuLeu: 11.518 ± 3.136
2.033LeuMet: 2.033 ± 0.807
4.743LeuAsn: 4.743 ± 1.685
2.71LeuPro: 2.71 ± 1.449
2.71LeuGln: 2.71 ± 1.449
4.065LeuArg: 4.065 ± 0.945
8.808LeuSer: 8.808 ± 1.52
7.453LeuThr: 7.453 ± 1.757
10.163LeuVal: 10.163 ± 2.83
0.678LeuTrp: 0.678 ± 1.196
2.033LeuTyr: 2.033 ± 1.153
0.0LeuXaa: 0.0 ± 0.0
Met
1.355MetAla: 1.355 ± 1.354
0.0MetCys: 0.0 ± 0.0
0.678MetAsp: 0.678 ± 1.381
1.355MetGlu: 1.355 ± 0.848
0.678MetPhe: 0.678 ± 0.424
3.388MetGly: 3.388 ± 1.474
0.678MetHis: 0.678 ± 0.424
0.678MetIle: 0.678 ± 0.424
2.033MetLys: 2.033 ± 0.807
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.602
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.355MetArg: 1.355 ± 0.848
2.71MetSer: 2.71 ± 1.025
2.033MetThr: 2.033 ± 0.678
0.678MetVal: 0.678 ± 0.424
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.033AsnAla: 2.033 ± 0.703
0.0AsnCys: 0.0 ± 0.0
2.033AsnAsp: 2.033 ± 0.807
1.355AsnGlu: 1.355 ± 0.724
1.355AsnPhe: 1.355 ± 1.297
2.71AsnGly: 2.71 ± 1.188
2.033AsnHis: 2.033 ± 0.807
0.0AsnIle: 0.0 ± 0.0
2.033AsnLys: 2.033 ± 1.331
3.388AsnLeu: 3.388 ± 0.624
0.678AsnMet: 0.678 ± 0.424
1.355AsnAsn: 1.355 ± 0.848
4.065AsnPro: 4.065 ± 0.798
1.355AsnGln: 1.355 ± 0.544
2.033AsnArg: 2.033 ± 0.678
4.743AsnSer: 4.743 ± 1.916
2.71AsnThr: 2.71 ± 0.494
3.388AsnVal: 3.388 ± 1.183
0.0AsnTrp: 0.0 ± 0.0
2.71AsnTyr: 2.71 ± 1.273
0.0AsnXaa: 0.0 ± 0.0
Pro
1.355ProAla: 1.355 ± 0.848
0.0ProCys: 0.0 ± 0.0
4.065ProAsp: 4.065 ± 0.746
1.355ProGlu: 1.355 ± 0.724
2.033ProPhe: 2.033 ± 0.807
3.388ProGly: 3.388 ± 1.261
0.678ProHis: 0.678 ± 0.677
4.743ProIle: 4.743 ± 0.865
6.098ProLys: 6.098 ± 2.475
2.033ProLeu: 2.033 ± 0.678
0.0ProMet: 0.0 ± 0.0
2.71ProAsn: 2.71 ± 1.449
4.065ProPro: 4.065 ± 1.786
0.678ProGln: 0.678 ± 0.424
6.775ProArg: 6.775 ± 2.55
2.71ProSer: 2.71 ± 3.001
5.42ProThr: 5.42 ± 1.459
3.388ProVal: 3.388 ± 2.12
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.355GlnAla: 1.355 ± 0.544
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.71GlnGlu: 2.71 ± 0.494
0.0GlnPhe: 0.0 ± 0.0
3.388GlnGly: 3.388 ± 1.183
1.355GlnHis: 1.355 ± 0.544
0.678GlnIle: 0.678 ± 0.424
2.033GlnLys: 2.033 ± 1.919
1.355GlnLeu: 1.355 ± 0.544
2.033GlnMet: 2.033 ± 1.01
1.355GlnAsn: 1.355 ± 0.724
2.033GlnPro: 2.033 ± 0.703
2.033GlnGln: 2.033 ± 1.112
2.71GlnArg: 2.71 ± 1.144
1.355GlnSer: 1.355 ± 1.354
2.033GlnThr: 2.033 ± 0.678
1.355GlnVal: 1.355 ± 1.354
0.678GlnTrp: 0.678 ± 0.424
0.678GlnTyr: 0.678 ± 0.424
0.0GlnXaa: 0.0 ± 0.0
Arg
6.098ArgAla: 6.098 ± 1.986
2.71ArgCys: 2.71 ± 1.449
4.743ArgAsp: 4.743 ± 1.703
3.388ArgGlu: 3.388 ± 1.408
1.355ArgPhe: 1.355 ± 0.544
3.388ArgGly: 3.388 ± 1.492
2.033ArgHis: 2.033 ± 1.272
0.678ArgIle: 0.678 ± 0.677
2.033ArgLys: 2.033 ± 1.272
6.098ArgLeu: 6.098 ± 1.611
1.355ArgMet: 1.355 ± 0.544
2.71ArgAsn: 2.71 ± 1.088
1.355ArgPro: 1.355 ± 0.724
0.678ArgGln: 0.678 ± 0.424
4.743ArgArg: 4.743 ± 1.155
0.678ArgSer: 0.678 ± 0.677
2.71ArgThr: 2.71 ± 1.025
4.743ArgVal: 4.743 ± 0.848
2.71ArgTrp: 2.71 ± 1.188
2.033ArgTyr: 2.033 ± 0.703
0.0ArgXaa: 0.0 ± 0.0
Ser
4.065SerAla: 4.065 ± 2.031
2.71SerCys: 2.71 ± 1.449
4.743SerAsp: 4.743 ± 0.865
2.033SerGlu: 2.033 ± 0.703
2.71SerPhe: 2.71 ± 0.978
4.743SerGly: 4.743 ± 2.547
2.033SerHis: 2.033 ± 0.807
7.453SerIle: 7.453 ± 2.414
5.42SerLys: 5.42 ± 1.616
12.195SerLeu: 12.195 ± 4.178
0.678SerMet: 0.678 ± 0.677
2.71SerAsn: 2.71 ± 1.088
5.42SerPro: 5.42 ± 1.045
4.065SerGln: 4.065 ± 2.4
3.388SerArg: 3.388 ± 1.671
7.453SerSer: 7.453 ± 1.77
4.065SerThr: 4.065 ± 1.972
4.065SerVal: 4.065 ± 2.146
0.0SerTrp: 0.0 ± 0.0
1.355SerTyr: 1.355 ± 1.074
0.0SerXaa: 0.0 ± 0.0
Thr
2.71ThrAla: 2.71 ± 1.449
0.678ThrCys: 0.678 ± 0.424
2.71ThrAsp: 2.71 ± 1.449
0.678ThrGlu: 0.678 ± 0.677
6.098ThrPhe: 6.098 ± 1.986
2.033ThrGly: 2.033 ± 1.414
3.388ThrHis: 3.388 ± 1.183
3.388ThrIle: 3.388 ± 0.868
4.065ThrLys: 4.065 ± 1.614
5.42ThrLeu: 5.42 ± 0.961
0.0ThrMet: 0.0 ± 0.0
2.033ThrAsn: 2.033 ± 1.348
8.13ThrPro: 8.13 ± 1.346
1.355ThrGln: 1.355 ± 1.529
4.743ThrArg: 4.743 ± 1.9
4.065ThrSer: 4.065 ± 1.353
5.42ThrThr: 5.42 ± 0.961
6.098ThrVal: 6.098 ± 1.279
0.0ThrTrp: 0.0 ± 0.0
2.033ThrTyr: 2.033 ± 2.031
0.0ThrXaa: 0.0 ± 0.0
Val
6.098ValAla: 6.098 ± 2.893
2.71ValCys: 2.71 ± 1.144
4.743ValAsp: 4.743 ± 1.28
6.098ValGlu: 6.098 ± 2.433
2.033ValPhe: 2.033 ± 1.414
5.42ValGly: 5.42 ± 1.395
0.0ValHis: 0.0 ± 0.0
2.71ValIle: 2.71 ± 1.067
9.485ValLys: 9.485 ± 1.594
8.13ValLeu: 8.13 ± 4.562
2.71ValMet: 2.71 ± 1.024
3.388ValAsn: 3.388 ± 1.474
5.42ValPro: 5.42 ± 0.674
2.71ValGln: 2.71 ± 0.494
4.743ValArg: 4.743 ± 1.485
6.775ValSer: 6.775 ± 1.142
2.71ValThr: 2.71 ± 2.657
6.775ValVal: 6.775 ± 2.281
2.033ValTrp: 2.033 ± 0.678
2.71ValTyr: 2.71 ± 1.025
0.0ValXaa: 0.0 ± 0.0
Trp
2.033TrpAla: 2.033 ± 0.678
0.678TrpCys: 0.678 ± 0.424
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.355TrpGly: 1.355 ± 0.848
0.0TrpHis: 0.0 ± 0.0
0.678TrpIle: 0.678 ± 0.424
2.033TrpLys: 2.033 ± 0.678
3.388TrpLeu: 3.388 ± 1.201
0.0TrpMet: 0.0 ± 0.0
0.678TrpAsn: 0.678 ± 0.424
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.678TrpSer: 0.678 ± 1.196
0.0TrpThr: 0.0 ± 0.0
0.678TrpVal: 0.678 ± 0.424
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.678TyrAla: 0.678 ± 0.424
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
3.388TyrGlu: 3.388 ± 1.474
1.355TyrPhe: 1.355 ± 0.544
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
0.678TyrIle: 0.678 ± 0.424
4.065TyrLys: 4.065 ± 1.801
5.42TyrLeu: 5.42 ± 1.336
0.0TyrMet: 0.0 ± 0.0
3.388TyrAsn: 3.388 ± 0.624
0.678TyrPro: 0.678 ± 0.424
0.0TyrGln: 0.0 ± 0.0
2.033TyrArg: 2.033 ± 1.272
3.388TyrSer: 3.388 ± 3.084
2.71TyrThr: 2.71 ± 1.088
0.678TyrVal: 0.678 ± 0.677
1.355TyrTrp: 1.355 ± 0.848
3.388TyrTyr: 3.388 ± 3.411
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.678XaaGly: 0.678 ± 0.424
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1477 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski