Amino acid dipepetide frequency for Honeysuckle yellow vein virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.735AlaAla: 2.735 ± 1.3
0.912AlaCys: 0.912 ± 0.709
0.912AlaAsp: 0.912 ± 0.709
1.823AlaGlu: 1.823 ± 1.312
1.823AlaPhe: 1.823 ± 1.139
0.0AlaGly: 0.0 ± 0.0
2.735AlaHis: 2.735 ± 1.2
2.735AlaIle: 2.735 ± 1.128
3.646AlaLys: 3.646 ± 1.084
8.204AlaLeu: 8.204 ± 2.038
0.0AlaMet: 0.0 ± 0.0
2.735AlaAsn: 2.735 ± 0.903
3.646AlaPro: 3.646 ± 1.822
5.469AlaGln: 5.469 ± 1.812
2.735AlaArg: 2.735 ± 1.968
4.558AlaSer: 4.558 ± 2.241
7.293AlaThr: 7.293 ± 2.046
1.823AlaVal: 1.823 ± 1.398
2.735AlaTrp: 2.735 ± 1.3
1.823AlaTyr: 1.823 ± 0.894
0.0AlaXaa: 0.0 ± 0.0
Cys
0.912CysAla: 0.912 ± 0.923
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.823CysGlu: 1.823 ± 1.219
1.823CysPhe: 1.823 ± 1.756
1.823CysGly: 1.823 ± 0.92
0.0CysHis: 0.0 ± 0.0
1.823CysIle: 1.823 ± 0.676
0.912CysLys: 0.912 ± 0.709
0.0CysLeu: 0.0 ± 0.0
0.912CysMet: 0.912 ± 1.102
0.912CysAsn: 0.912 ± 0.656
1.823CysPro: 1.823 ± 2.204
1.823CysGln: 1.823 ± 1.028
1.823CysArg: 1.823 ± 1.028
2.735CysSer: 2.735 ± 1.723
1.823CysThr: 1.823 ± 0.676
0.912CysVal: 0.912 ± 0.709
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.912AspAla: 0.912 ± 0.656
0.0AspCys: 0.0 ± 0.0
0.912AspAsp: 0.912 ± 0.656
1.823AspGlu: 1.823 ± 0.676
4.558AspPhe: 4.558 ± 2.034
2.735AspGly: 2.735 ± 1.968
0.912AspHis: 0.912 ± 0.878
5.469AspIle: 5.469 ± 1.733
0.912AspLys: 0.912 ± 0.656
5.469AspLeu: 5.469 ± 2.089
0.912AspMet: 0.912 ± 0.603
0.912AspAsn: 0.912 ± 0.878
1.823AspPro: 1.823 ± 1.028
0.912AspGln: 0.912 ± 0.656
3.646AspArg: 3.646 ± 1.244
7.293AspSer: 7.293 ± 1.374
1.823AspThr: 1.823 ± 1.426
4.558AspVal: 4.558 ± 1.183
0.912AspTrp: 0.912 ± 0.656
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.381GluAla: 6.381 ± 2.808
0.912GluCys: 0.912 ± 1.102
3.646GluAsp: 3.646 ± 2.057
6.381GluGlu: 6.381 ± 3.941
2.735GluPhe: 2.735 ± 1.478
3.646GluGly: 3.646 ± 1.092
0.0GluHis: 0.0 ± 0.0
1.823GluIle: 1.823 ± 1.398
4.558GluLys: 4.558 ± 0.912
3.646GluLeu: 3.646 ± 1.111
0.0GluMet: 0.0 ± 0.0
4.558GluAsn: 4.558 ± 2.562
1.823GluPro: 1.823 ± 1.139
3.646GluGln: 3.646 ± 1.204
0.0GluArg: 0.0 ± 0.0
0.912GluSer: 0.912 ± 0.923
2.735GluThr: 2.735 ± 1.326
0.912GluVal: 0.912 ± 0.819
1.823GluTrp: 1.823 ± 0.92
1.823GluTyr: 1.823 ± 0.894
0.0GluXaa: 0.0 ± 0.0
Phe
1.823PheAla: 1.823 ± 0.92
0.912PheCys: 0.912 ± 0.709
2.735PheAsp: 2.735 ± 1.128
1.823PheGlu: 1.823 ± 0.676
0.912PhePhe: 0.912 ± 0.656
0.912PheGly: 0.912 ± 0.656
2.735PheHis: 2.735 ± 1.326
1.823PheIle: 1.823 ± 1.312
4.558PheLys: 4.558 ± 3.34
4.558PheLeu: 4.558 ± 1.711
0.912PheMet: 0.912 ± 0.656
2.735PheAsn: 2.735 ± 1.782
0.912PhePro: 0.912 ± 1.102
3.646PheGln: 3.646 ± 1.697
2.735PheArg: 2.735 ± 1.302
1.823PheSer: 1.823 ± 0.959
1.823PheThr: 1.823 ± 1.139
2.735PheVal: 2.735 ± 1.968
1.823PheTrp: 1.823 ± 1.418
0.912PheTyr: 0.912 ± 0.709
0.0PheXaa: 0.0 ± 0.0
Gly
5.469GlyAla: 5.469 ± 2.268
3.646GlyCys: 3.646 ± 1.697
2.735GlyAsp: 2.735 ± 1.304
0.0GlyGlu: 0.0 ± 0.0
2.735GlyPhe: 2.735 ± 1.499
2.735GlyGly: 2.735 ± 1.128
2.735GlyHis: 2.735 ± 0.903
2.735GlyIle: 2.735 ± 0.903
4.558GlyLys: 4.558 ± 1.74
0.912GlyLeu: 0.912 ± 0.709
0.912GlyMet: 0.912 ± 0.923
1.823GlyAsn: 1.823 ± 0.959
2.735GlyPro: 2.735 ± 1.128
1.823GlyGln: 1.823 ± 0.959
0.912GlyArg: 0.912 ± 0.656
3.646GlySer: 3.646 ± 1.038
4.558GlyThr: 4.558 ± 1.087
2.735GlyVal: 2.735 ± 1.181
0.0GlyTrp: 0.0 ± 0.0
0.912GlyTyr: 0.912 ± 1.102
0.0GlyXaa: 0.0 ± 0.0
His
1.823HisAla: 1.823 ± 1.034
2.735HisCys: 2.735 ± 1.2
0.912HisAsp: 0.912 ± 0.878
1.823HisGlu: 1.823 ± 0.92
2.735HisPhe: 2.735 ± 1.3
2.735HisGly: 2.735 ± 2.135
2.735HisHis: 2.735 ± 1.949
1.823HisIle: 1.823 ± 0.998
1.823HisLys: 1.823 ± 1.398
2.735HisLeu: 2.735 ± 1.3
0.912HisMet: 0.912 ± 0.819
3.646HisAsn: 3.646 ± 1.243
1.823HisPro: 1.823 ± 0.894
1.823HisGln: 1.823 ± 0.676
4.558HisArg: 4.558 ± 2.144
3.646HisSer: 3.646 ± 1.535
1.823HisThr: 1.823 ± 1.418
2.735HisVal: 2.735 ± 1.422
0.0HisTrp: 0.0 ± 0.0
0.912HisTyr: 0.912 ± 0.656
0.0HisXaa: 0.0 ± 0.0
Ile
0.912IleAla: 0.912 ± 0.709
0.912IleCys: 0.912 ± 0.656
3.646IleAsp: 3.646 ± 1.848
0.912IleGlu: 0.912 ± 0.656
3.646IlePhe: 3.646 ± 1.092
2.735IleGly: 2.735 ± 1.662
0.912IleHis: 0.912 ± 0.878
3.646IleIle: 3.646 ± 1.204
7.293IleLys: 7.293 ± 1.519
2.735IleLeu: 2.735 ± 1.478
1.823IleMet: 1.823 ± 1.37
4.558IleAsn: 4.558 ± 1.322
0.912IlePro: 0.912 ± 0.656
7.293IleGln: 7.293 ± 2.163
4.558IleArg: 4.558 ± 0.901
3.646IleSer: 3.646 ± 1.874
3.646IleThr: 3.646 ± 2.613
0.912IleVal: 0.912 ± 0.656
2.735IleTrp: 2.735 ± 1.782
1.823IleTyr: 1.823 ± 0.92
0.0IleXaa: 0.0 ± 0.0
Lys
4.558LysAla: 4.558 ± 1.937
0.912LysCys: 0.912 ± 0.878
1.823LysAsp: 1.823 ± 1.312
3.646LysGlu: 3.646 ± 1.038
0.912LysPhe: 0.912 ± 0.878
1.823LysGly: 1.823 ± 0.676
0.912LysHis: 0.912 ± 0.656
4.558LysIle: 4.558 ± 1.718
1.823LysLys: 1.823 ± 0.959
2.735LysLeu: 2.735 ± 0.789
0.0LysMet: 0.0 ± 0.0
4.558LysAsn: 4.558 ± 1.74
1.823LysPro: 1.823 ± 0.676
1.823LysGln: 1.823 ± 1.219
5.469LysArg: 5.469 ± 2.598
5.469LysSer: 5.469 ± 0.931
5.469LysThr: 5.469 ± 1.67
2.735LysVal: 2.735 ± 1.657
0.0LysTrp: 0.0 ± 0.0
4.558LysTyr: 4.558 ± 0.901
0.0LysXaa: 0.0 ± 0.0
Leu
1.823LeuAla: 1.823 ± 1.219
4.558LeuCys: 4.558 ± 1.52
5.469LeuAsp: 5.469 ± 1.995
7.293LeuGlu: 7.293 ± 2.37
2.735LeuPhe: 2.735 ± 0.903
1.823LeuGly: 1.823 ± 1.224
3.646LeuHis: 3.646 ± 1.018
3.646LeuIle: 3.646 ± 1.917
4.558LeuLys: 4.558 ± 1.168
6.381LeuLeu: 6.381 ± 4.602
0.0LeuMet: 0.0 ± 0.0
3.646LeuAsn: 3.646 ± 1.285
1.823LeuPro: 1.823 ± 1.07
3.646LeuGln: 3.646 ± 1.111
5.469LeuArg: 5.469 ± 2.329
3.646LeuSer: 3.646 ± 2.057
3.646LeuThr: 3.646 ± 1.092
2.735LeuVal: 2.735 ± 0.923
0.0LeuTrp: 0.0 ± 0.0
5.469LeuTyr: 5.469 ± 2.278
0.0LeuXaa: 0.0 ± 0.0
Met
1.823MetAla: 1.823 ± 0.676
0.0MetCys: 0.0 ± 0.0
1.823MetAsp: 1.823 ± 1.034
3.646MetGlu: 3.646 ± 1.183
0.912MetPhe: 0.912 ± 0.709
2.735MetGly: 2.735 ± 1.302
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.912MetLeu: 0.912 ± 1.102
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.912MetPro: 0.912 ± 0.819
0.912MetGln: 0.912 ± 0.923
0.0MetArg: 0.0 ± 0.0
2.735MetSer: 2.735 ± 0.877
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.823MetTrp: 1.823 ± 1.028
2.735MetTyr: 2.735 ± 2.127
0.0MetXaa: 0.0 ± 0.0
Asn
3.646AsnAla: 3.646 ± 1.217
0.912AsnCys: 0.912 ± 0.923
3.646AsnAsp: 3.646 ± 1.111
1.823AsnGlu: 1.823 ± 0.676
0.912AsnPhe: 0.912 ± 0.709
1.823AsnGly: 1.823 ± 1.034
6.381AsnHis: 6.381 ± 3.27
1.823AsnIle: 1.823 ± 0.676
1.823AsnLys: 1.823 ± 0.676
3.646AsnLeu: 3.646 ± 1.789
2.735AsnMet: 2.735 ± 1.487
2.735AsnAsn: 2.735 ± 1.319
2.735AsnPro: 2.735 ± 0.789
1.823AsnGln: 1.823 ± 0.894
0.912AsnArg: 0.912 ± 0.709
3.646AsnSer: 3.646 ± 2.057
2.735AsnThr: 2.735 ± 1.017
5.469AsnVal: 5.469 ± 1.444
0.0AsnTrp: 0.0 ± 0.0
2.735AsnTyr: 2.735 ± 1.304
0.0AsnXaa: 0.0 ± 0.0
Pro
0.912ProAla: 0.912 ± 0.709
1.823ProCys: 1.823 ± 1.219
1.823ProAsp: 1.823 ± 1.219
1.823ProGlu: 1.823 ± 1.028
1.823ProPhe: 1.823 ± 0.894
1.823ProGly: 1.823 ± 0.998
3.646ProHis: 3.646 ± 1.822
3.646ProIle: 3.646 ± 1.697
3.646ProLys: 3.646 ± 1.717
4.558ProLeu: 4.558 ± 1.232
1.823ProMet: 1.823 ± 0.676
1.823ProAsn: 1.823 ± 0.894
1.823ProPro: 1.823 ± 1.312
3.646ProGln: 3.646 ± 1.991
5.469ProArg: 5.469 ± 1.683
3.646ProSer: 3.646 ± 2.171
6.381ProThr: 6.381 ± 2.661
2.735ProVal: 2.735 ± 1.662
0.912ProTrp: 0.912 ± 0.656
1.823ProTyr: 1.823 ± 1.418
0.0ProXaa: 0.0 ± 0.0
Gln
2.735GlnAla: 2.735 ± 1.54
0.912GlnCys: 0.912 ± 0.656
3.646GlnAsp: 3.646 ± 1.77
4.558GlnGlu: 4.558 ± 1.941
2.735GlnPhe: 2.735 ± 1.3
2.735GlnGly: 2.735 ± 1.326
1.823GlnHis: 1.823 ± 1.07
5.469GlnIle: 5.469 ± 2.534
0.912GlnLys: 0.912 ± 1.102
1.823GlnLeu: 1.823 ± 0.894
0.912GlnMet: 0.912 ± 0.819
2.735GlnAsn: 2.735 ± 1.499
6.381GlnPro: 6.381 ± 4.188
0.0GlnGln: 0.0 ± 0.0
3.646GlnArg: 3.646 ± 1.018
4.558GlnSer: 4.558 ± 1.168
1.823GlnThr: 1.823 ± 1.249
4.558GlnVal: 4.558 ± 1.672
0.0GlnTrp: 0.0 ± 0.0
0.912GlnTyr: 0.912 ± 0.709
0.0GlnXaa: 0.0 ± 0.0
Arg
2.735ArgAla: 2.735 ± 1.422
0.912ArgCys: 0.912 ± 1.102
5.469ArgAsp: 5.469 ± 1.845
3.646ArgGlu: 3.646 ± 2.057
4.558ArgPhe: 4.558 ± 1.84
3.646ArgGly: 3.646 ± 0.866
4.558ArgHis: 4.558 ± 1.846
3.646ArgIle: 3.646 ± 1.848
0.912ArgLys: 0.912 ± 0.709
3.646ArgLeu: 3.646 ± 1.731
1.823ArgMet: 1.823 ± 1.418
0.912ArgAsn: 0.912 ± 0.878
6.381ArgPro: 6.381 ± 0.906
2.735ArgGln: 2.735 ± 1.549
7.293ArgArg: 7.293 ± 4.556
6.381ArgSer: 6.381 ± 1.865
1.823ArgThr: 1.823 ± 0.894
3.646ArgVal: 3.646 ± 1.728
0.0ArgTrp: 0.0 ± 0.0
0.912ArgTyr: 0.912 ± 1.102
0.0ArgXaa: 0.0 ± 0.0
Ser
6.381SerAla: 6.381 ± 2.737
0.912SerCys: 0.912 ± 1.102
2.735SerAsp: 2.735 ± 0.903
2.735SerGlu: 2.735 ± 2.376
0.912SerPhe: 0.912 ± 0.656
4.558SerGly: 4.558 ± 1.3
3.646SerHis: 3.646 ± 2.822
3.646SerIle: 3.646 ± 1.183
6.381SerLys: 6.381 ± 1.582
4.558SerLeu: 4.558 ± 1.792
0.912SerMet: 0.912 ± 0.819
4.558SerAsn: 4.558 ± 1.747
9.116SerPro: 9.116 ± 2.135
0.0SerGln: 0.0 ± 0.0
4.558SerArg: 4.558 ± 2.062
11.851SerSer: 11.851 ± 3.796
7.293SerThr: 7.293 ± 3.232
3.646SerVal: 3.646 ± 2.105
0.912SerTrp: 0.912 ± 0.709
3.646SerTyr: 3.646 ± 1.111
0.0SerXaa: 0.0 ± 0.0
Thr
3.646ThrAla: 3.646 ± 1.631
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
2.735ThrGlu: 2.735 ± 0.789
1.823ThrPhe: 1.823 ± 0.998
4.558ThrGly: 4.558 ± 0.953
5.469ThrHis: 5.469 ± 1.194
4.558ThrIle: 4.558 ± 2.179
2.735ThrLys: 2.735 ± 1.128
2.735ThrLeu: 2.735 ± 1.22
0.912ThrMet: 0.912 ± 0.656
5.469ThrAsn: 5.469 ± 1.125
2.735ThrPro: 2.735 ± 1.22
6.381ThrGln: 6.381 ± 1.924
2.735ThrArg: 2.735 ± 0.903
6.381ThrSer: 6.381 ± 2.86
3.646ThrThr: 3.646 ± 1.336
4.558ThrVal: 4.558 ± 2.499
0.912ThrTrp: 0.912 ± 0.819
3.646ThrTyr: 3.646 ± 1.038
0.0ThrXaa: 0.0 ± 0.0
Val
1.823ValAla: 1.823 ± 1.07
0.912ValCys: 0.912 ± 1.102
1.823ValAsp: 1.823 ± 0.894
1.823ValGlu: 1.823 ± 2.204
1.823ValPhe: 1.823 ± 0.894
2.735ValGly: 2.735 ± 1.662
0.912ValHis: 0.912 ± 1.102
4.558ValIle: 4.558 ± 1.782
2.735ValLys: 2.735 ± 0.923
6.381ValLeu: 6.381 ± 2.433
0.0ValMet: 0.0 ± 0.95
0.0ValAsn: 0.0 ± 0.0
4.558ValPro: 4.558 ± 1.275
3.646ValGln: 3.646 ± 1.759
3.646ValArg: 3.646 ± 1.337
3.646ValSer: 3.646 ± 1.285
3.646ValThr: 3.646 ± 1.352
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
4.558ValTyr: 4.558 ± 2.175
0.0ValXaa: 0.0 ± 0.0
Trp
3.646TrpAla: 3.646 ± 1.717
0.0TrpCys: 0.0 ± 0.0
0.912TrpAsp: 0.912 ± 1.102
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.823TrpLys: 1.823 ± 0.959
0.912TrpLeu: 0.912 ± 0.709
0.912TrpMet: 0.912 ± 0.709
0.912TrpAsn: 0.912 ± 0.878
0.0TrpPro: 0.0 ± 0.0
0.912TrpGln: 0.912 ± 0.656
1.823TrpArg: 1.823 ± 0.92
0.0TrpSer: 0.0 ± 0.0
1.823TrpThr: 1.823 ± 1.756
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.912TrpTyr: 0.912 ± 0.656
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.646TyrAla: 3.646 ± 1.877
0.0TyrCys: 0.0 ± 0.0
1.823TyrAsp: 1.823 ± 1.219
1.823TyrGlu: 1.823 ± 1.418
2.735TyrPhe: 2.735 ± 0.789
2.735TyrGly: 2.735 ± 1.128
0.0TyrHis: 0.0 ± 0.0
1.823TyrIle: 1.823 ± 1.034
0.0TyrLys: 0.0 ± 0.0
5.469TyrLeu: 5.469 ± 1.498
3.646TyrMet: 3.646 ± 0.94
2.735TyrAsn: 2.735 ± 0.789
1.823TyrPro: 1.823 ± 0.998
0.912TyrGln: 0.912 ± 0.656
3.646TyrArg: 3.646 ± 2.245
2.735TyrSer: 2.735 ± 1.326
1.823TyrThr: 1.823 ± 0.92
2.735TyrVal: 2.735 ± 1.628
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1098 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski