Amino acid dipepetide frequency for Wenzhou tombus-like virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.909AlaAla: 3.909 ± 0.582
1.564AlaCys: 1.564 ± 0.496
4.691AlaAsp: 4.691 ± 1.488
1.564AlaGlu: 1.564 ± 0.496
4.691AlaPhe: 4.691 ± 0.172
4.691AlaGly: 4.691 ± 2.804
1.564AlaHis: 1.564 ± 0.82
2.346AlaIle: 2.346 ± 1.23
3.909AlaLys: 3.909 ± 0.582
7.037AlaLeu: 7.037 ± 1.058
3.127AlaMet: 3.127 ± 1.64
3.909AlaAsn: 3.909 ± 3.214
1.564AlaPro: 1.564 ± 0.82
4.691AlaGln: 4.691 ± 1.488
7.037AlaArg: 7.037 ± 1.058
4.691AlaSer: 4.691 ± 0.172
3.127AlaThr: 3.127 ± 1.64
7.819AlaVal: 7.819 ± 0.152
0.782AlaTrp: 0.782 ± 0.906
0.782AlaTyr: 0.782 ± 0.41
0.0AlaXaa: 0.0 ± 0.0
Cys
1.564CysAla: 1.564 ± 0.82
0.782CysCys: 0.782 ± 0.41
0.0CysAsp: 0.0 ± 0.0
1.564CysGlu: 1.564 ± 0.82
1.564CysPhe: 1.564 ± 0.82
0.782CysGly: 0.782 ± 0.906
0.782CysHis: 0.782 ± 0.906
0.0CysIle: 0.0 ± 0.0
0.782CysLys: 0.782 ± 0.41
0.0CysLeu: 0.0 ± 0.0
2.346CysMet: 2.346 ± 0.398
0.0CysAsn: 0.0 ± 0.0
0.782CysPro: 0.782 ± 0.41
0.0CysGln: 0.0 ± 0.0
1.564CysArg: 1.564 ± 0.82
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.564CysVal: 1.564 ± 0.82
0.0CysTrp: 0.0 ± 0.0
0.782CysTyr: 0.782 ± 0.41
0.782CysXaa: 0.782 ± 0.41
Asp
4.691AspAla: 4.691 ± 2.461
0.782AspCys: 0.782 ± 0.41
1.564AspAsp: 1.564 ± 0.82
2.346AspGlu: 2.346 ± 0.086
2.346AspPhe: 2.346 ± 0.086
4.691AspGly: 4.691 ± 1.144
0.782AspHis: 0.782 ± 0.41
5.473AspIle: 5.473 ± 0.238
6.255AspLys: 6.255 ± 0.648
6.255AspLeu: 6.255 ± 0.648
1.564AspMet: 1.564 ± 0.496
0.0AspAsn: 0.0 ± 0.0
3.909AspPro: 3.909 ± 1.898
0.0AspGln: 0.0 ± 0.0
3.909AspArg: 3.909 ± 0.734
3.909AspSer: 3.909 ± 1.898
1.564AspThr: 1.564 ± 0.496
3.127AspVal: 3.127 ± 0.992
0.0AspTrp: 0.0 ± 0.0
2.346AspTyr: 2.346 ± 1.402
0.0AspXaa: 0.0 ± 0.0
Glu
6.255GluAla: 6.255 ± 0.648
0.0GluCys: 0.0 ± 0.0
3.127GluAsp: 3.127 ± 0.992
4.691GluGlu: 4.691 ± 2.461
2.346GluPhe: 2.346 ± 1.402
7.037GluGly: 7.037 ± 0.258
2.346GluHis: 2.346 ± 0.086
2.346GluIle: 2.346 ± 0.086
1.564GluLys: 1.564 ± 0.82
7.037GluLeu: 7.037 ± 2.375
3.127GluMet: 3.127 ± 1.64
0.0GluAsn: 0.0 ± 0.0
1.564GluPro: 1.564 ± 1.812
0.782GluGln: 0.782 ± 0.41
6.255GluArg: 6.255 ± 1.965
4.691GluSer: 4.691 ± 1.144
3.127GluThr: 3.127 ± 1.64
0.782GluVal: 0.782 ± 0.41
0.0GluTrp: 0.0 ± 0.0
2.346GluTyr: 2.346 ± 1.23
0.0GluXaa: 0.0 ± 0.0
Phe
1.564PheAla: 1.564 ± 0.496
0.782PheCys: 0.782 ± 0.41
3.909PheAsp: 3.909 ± 0.734
2.346PheGlu: 2.346 ± 1.23
0.0PhePhe: 0.0 ± 0.0
5.473PheGly: 5.473 ± 2.394
0.782PheHis: 0.782 ± 0.41
2.346PheIle: 2.346 ± 1.23
0.782PheLys: 0.782 ± 0.41
5.473PheLeu: 5.473 ± 0.238
1.564PheMet: 1.564 ± 0.82
0.782PheAsn: 0.782 ± 0.906
0.782PhePro: 0.782 ± 0.41
0.782PheGln: 0.782 ± 0.906
3.909PheArg: 3.909 ± 1.898
2.346PheSer: 2.346 ± 1.402
1.564PheThr: 1.564 ± 0.82
2.346PheVal: 2.346 ± 0.086
0.782PheTrp: 0.782 ± 0.41
2.346PheTyr: 2.346 ± 1.23
0.0PheXaa: 0.0 ± 0.0
Gly
7.819GlyAla: 7.819 ± 1.164
1.564GlyCys: 1.564 ± 0.82
2.346GlyAsp: 2.346 ± 1.23
2.346GlyGlu: 2.346 ± 1.402
5.473GlyPhe: 5.473 ± 2.394
7.819GlyGly: 7.819 ± 3.796
0.0GlyHis: 0.0 ± 0.0
3.909GlyIle: 3.909 ± 0.734
6.255GlyLys: 6.255 ± 1.965
7.037GlyLeu: 7.037 ± 0.258
2.346GlyMet: 2.346 ± 0.086
0.0GlyAsn: 0.0 ± 0.0
3.909GlyPro: 3.909 ± 0.582
2.346GlyGln: 2.346 ± 1.402
2.346GlyArg: 2.346 ± 0.086
5.473GlySer: 5.473 ± 1.078
4.691GlyThr: 4.691 ± 2.804
5.473GlyVal: 5.473 ± 2.394
0.782GlyTrp: 0.782 ± 0.906
3.127GlyTyr: 3.127 ± 2.308
0.0GlyXaa: 0.0 ± 0.0
His
2.346HisAla: 2.346 ± 1.23
0.0HisCys: 0.0 ± 0.0
2.346HisAsp: 2.346 ± 0.086
0.782HisGlu: 0.782 ± 0.41
0.0HisPhe: 0.0 ± 0.0
3.127HisGly: 3.127 ± 0.324
0.782HisHis: 0.782 ± 0.41
0.0HisIle: 0.0 ± 0.0
2.346HisLys: 2.346 ± 0.086
0.0HisLeu: 0.0 ± 0.0
0.782HisMet: 0.782 ± 0.906
2.346HisAsn: 2.346 ± 1.23
1.564HisPro: 1.564 ± 0.496
0.782HisGln: 0.782 ± 0.41
0.782HisArg: 0.782 ± 0.906
0.782HisSer: 0.782 ± 0.41
0.782HisThr: 0.782 ± 0.41
2.346HisVal: 2.346 ± 1.23
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.346IleAla: 2.346 ± 0.086
0.0IleCys: 0.0 ± 0.0
1.564IleAsp: 1.564 ± 0.82
1.564IleGlu: 1.564 ± 1.812
1.564IlePhe: 1.564 ± 0.82
2.346IleGly: 2.346 ± 1.402
2.346IleHis: 2.346 ± 0.086
2.346IleIle: 2.346 ± 0.086
1.564IleLys: 1.564 ± 0.82
3.127IleLeu: 3.127 ± 1.64
1.564IleMet: 1.564 ± 0.82
3.127IleAsn: 3.127 ± 0.992
0.782IlePro: 0.782 ± 0.906
3.127IleGln: 3.127 ± 0.324
4.691IleArg: 4.691 ± 1.144
0.782IleSer: 0.782 ± 0.41
2.346IleThr: 2.346 ± 0.086
4.691IleVal: 4.691 ± 1.144
0.782IleTrp: 0.782 ± 0.906
1.564IleTyr: 1.564 ± 1.812
0.0IleXaa: 0.0 ± 0.0
Lys
9.382LysAla: 9.382 ± 3.605
1.564LysCys: 1.564 ± 0.496
4.691LysAsp: 4.691 ± 1.144
4.691LysGlu: 4.691 ± 1.144
2.346LysPhe: 2.346 ± 1.23
1.564LysGly: 1.564 ± 0.82
0.782LysHis: 0.782 ± 0.41
1.564LysIle: 1.564 ± 0.496
3.909LysLys: 3.909 ± 2.05
3.909LysLeu: 3.909 ± 2.05
0.782LysMet: 0.782 ± 0.41
1.564LysAsn: 1.564 ± 1.812
3.127LysPro: 3.127 ± 0.992
0.782LysGln: 0.782 ± 0.41
3.127LysArg: 3.127 ± 0.992
1.564LysSer: 1.564 ± 0.496
6.255LysThr: 6.255 ± 0.668
3.909LysVal: 3.909 ± 0.582
1.564LysTrp: 1.564 ± 0.82
2.346LysTyr: 2.346 ± 1.23
0.0LysXaa: 0.0 ± 0.0
Leu
4.691LeuAla: 4.691 ± 0.172
1.564LeuCys: 1.564 ± 0.82
1.564LeuAsp: 1.564 ± 0.82
3.909LeuGlu: 3.909 ± 2.05
2.346LeuPhe: 2.346 ± 0.086
5.473LeuGly: 5.473 ± 1.078
0.0LeuHis: 0.0 ± 0.0
3.909LeuIle: 3.909 ± 0.734
5.473LeuLys: 5.473 ± 1.554
5.473LeuLeu: 5.473 ± 0.238
3.909LeuMet: 3.909 ± 0.734
3.909LeuAsn: 3.909 ± 2.05
5.473LeuPro: 5.473 ± 1.078
2.346LeuGln: 2.346 ± 0.086
3.909LeuArg: 3.909 ± 0.582
7.037LeuSer: 7.037 ± 2.375
10.164LeuThr: 10.164 ± 2.699
3.909LeuVal: 3.909 ± 0.734
1.564LeuTrp: 1.564 ± 0.496
3.127LeuTyr: 3.127 ± 0.324
0.0LeuXaa: 0.0 ± 0.0
Met
2.346MetAla: 2.346 ± 1.23
0.782MetCys: 0.782 ± 0.41
0.782MetAsp: 0.782 ± 0.41
3.909MetGlu: 3.909 ± 0.734
1.564MetPhe: 1.564 ± 0.82
0.782MetGly: 0.782 ± 0.41
0.0MetHis: 0.0 ± 0.0
1.564MetIle: 1.564 ± 1.812
0.782MetLys: 0.782 ± 0.41
0.782MetLeu: 0.782 ± 0.41
3.127MetMet: 3.127 ± 1.64
2.346MetAsn: 2.346 ± 0.086
2.346MetPro: 2.346 ± 1.23
3.127MetGln: 3.127 ± 1.64
3.909MetArg: 3.909 ± 0.582
1.564MetSer: 1.564 ± 0.82
0.782MetThr: 0.782 ± 0.906
3.909MetVal: 3.909 ± 0.582
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.127AsnAla: 3.127 ± 2.308
0.0AsnCys: 0.0 ± 0.0
0.782AsnAsp: 0.782 ± 0.906
1.564AsnGlu: 1.564 ± 0.82
2.346AsnPhe: 2.346 ± 1.23
2.346AsnGly: 2.346 ± 1.402
1.564AsnHis: 1.564 ± 0.496
2.346AsnIle: 2.346 ± 1.23
3.909AsnLys: 3.909 ± 0.582
3.909AsnLeu: 3.909 ± 0.582
0.0AsnMet: 0.0 ± 0.0
2.346AsnAsn: 2.346 ± 1.402
0.782AsnPro: 0.782 ± 0.41
0.0AsnGln: 0.0 ± 0.0
3.909AsnArg: 3.909 ± 0.582
3.909AsnSer: 3.909 ± 0.582
3.909AsnThr: 3.909 ± 1.898
1.564AsnVal: 1.564 ± 0.496
0.782AsnTrp: 0.782 ± 0.41
1.564AsnTyr: 1.564 ± 0.82
0.0AsnXaa: 0.0 ± 0.0
Pro
2.346ProAla: 2.346 ± 2.718
1.564ProCys: 1.564 ± 0.496
3.127ProAsp: 3.127 ± 2.308
2.346ProGlu: 2.346 ± 0.086
0.0ProPhe: 0.0 ± 0.0
5.473ProGly: 5.473 ± 3.71
0.782ProHis: 0.782 ± 0.41
4.691ProIle: 4.691 ± 2.804
3.127ProLys: 3.127 ± 1.64
2.346ProLeu: 2.346 ± 1.23
0.782ProMet: 0.782 ± 0.41
1.564ProAsn: 1.564 ± 0.82
4.691ProPro: 4.691 ± 1.144
0.0ProGln: 0.0 ± 0.0
5.473ProArg: 5.473 ± 0.238
2.346ProSer: 2.346 ± 1.402
2.346ProThr: 2.346 ± 1.402
7.819ProVal: 7.819 ± 0.152
0.0ProTrp: 0.0 ± 0.0
1.564ProTyr: 1.564 ± 0.496
0.0ProXaa: 0.0 ± 0.0
Gln
2.346GlnAla: 2.346 ± 0.086
0.0GlnCys: 0.0 ± 0.0
3.127GlnAsp: 3.127 ± 2.308
2.346GlnGlu: 2.346 ± 0.086
1.564GlnPhe: 1.564 ± 1.812
2.346GlnGly: 2.346 ± 0.086
1.564GlnHis: 1.564 ± 0.82
1.564GlnIle: 1.564 ± 0.496
0.782GlnLys: 0.782 ± 0.41
2.346GlnLeu: 2.346 ± 0.086
0.0GlnMet: 0.0 ± 0.0
0.782GlnAsn: 0.782 ± 0.41
3.909GlnPro: 3.909 ± 1.898
3.909GlnGln: 3.909 ± 1.898
1.564GlnArg: 1.564 ± 0.82
2.346GlnSer: 2.346 ± 0.086
0.782GlnThr: 0.782 ± 0.906
1.564GlnVal: 1.564 ± 0.82
1.564GlnTrp: 1.564 ± 0.82
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.346ArgAla: 2.346 ± 1.402
1.564ArgCys: 1.564 ± 0.82
5.473ArgAsp: 5.473 ± 0.238
3.127ArgGlu: 3.127 ± 1.64
3.127ArgPhe: 3.127 ± 1.64
2.346ArgGly: 2.346 ± 0.086
0.0ArgHis: 0.0 ± 0.0
2.346ArgIle: 2.346 ± 0.086
8.6ArgLys: 8.6 ± 1.879
6.255ArgLeu: 6.255 ± 1.965
3.909ArgMet: 3.909 ± 0.582
7.819ArgAsn: 7.819 ± 0.152
4.691ArgPro: 4.691 ± 0.172
0.782ArgGln: 0.782 ± 0.41
8.6ArgArg: 8.6 ± 0.562
4.691ArgSer: 4.691 ± 0.172
7.819ArgThr: 7.819 ± 0.152
4.691ArgVal: 4.691 ± 1.488
0.782ArgTrp: 0.782 ± 0.41
1.564ArgTyr: 1.564 ± 0.82
0.0ArgXaa: 0.0 ± 0.0
Ser
3.909SerAla: 3.909 ± 1.898
0.782SerCys: 0.782 ± 0.41
6.255SerAsp: 6.255 ± 0.668
3.127SerGlu: 3.127 ± 0.992
3.909SerPhe: 3.909 ± 0.734
7.819SerGly: 7.819 ± 1.164
0.0SerHis: 0.0 ± 0.0
1.564SerIle: 1.564 ± 0.496
0.782SerLys: 0.782 ± 0.906
7.819SerLeu: 7.819 ± 0.152
0.782SerMet: 0.782 ± 0.41
2.346SerAsn: 2.346 ± 1.402
3.127SerPro: 3.127 ± 2.308
6.255SerGln: 6.255 ± 0.668
0.782SerArg: 0.782 ± 0.41
7.037SerSer: 7.037 ± 2.89
0.782SerThr: 0.782 ± 0.41
3.127SerVal: 3.127 ± 0.324
0.0SerTrp: 0.0 ± 0.0
3.127SerTyr: 3.127 ± 0.324
0.0SerXaa: 0.0 ± 0.0
Thr
3.909ThrAla: 3.909 ± 0.734
0.0ThrCys: 0.0 ± 0.0
3.127ThrAsp: 3.127 ± 1.64
6.255ThrGlu: 6.255 ± 1.965
1.564ThrPhe: 1.564 ± 0.496
2.346ThrGly: 2.346 ± 1.402
2.346ThrHis: 2.346 ± 1.23
2.346ThrIle: 2.346 ± 1.23
5.473ThrLys: 5.473 ± 3.71
6.255ThrLeu: 6.255 ± 0.648
1.564ThrMet: 1.564 ± 0.496
2.346ThrAsn: 2.346 ± 0.086
2.346ThrPro: 2.346 ± 1.402
1.564ThrGln: 1.564 ± 1.812
5.473ThrArg: 5.473 ± 2.871
4.691ThrSer: 4.691 ± 1.488
6.255ThrThr: 6.255 ± 0.648
3.127ThrVal: 3.127 ± 0.324
0.0ThrTrp: 0.0 ± 0.0
1.564ThrTyr: 1.564 ± 1.812
0.0ThrXaa: 0.0 ± 0.0
Val
5.473ValAla: 5.473 ± 2.394
0.782ValCys: 0.782 ± 0.41
3.127ValAsp: 3.127 ± 0.324
4.691ValGlu: 4.691 ± 0.172
2.346ValPhe: 2.346 ± 1.23
4.691ValGly: 4.691 ± 1.144
2.346ValHis: 2.346 ± 0.086
0.782ValIle: 0.782 ± 0.41
2.346ValLys: 2.346 ± 1.23
3.127ValLeu: 3.127 ± 0.324
2.346ValMet: 2.346 ± 0.819
3.909ValAsn: 3.909 ± 0.582
6.255ValPro: 6.255 ± 0.648
1.564ValGln: 1.564 ± 0.496
7.819ValArg: 7.819 ± 0.152
4.691ValSer: 4.691 ± 2.804
3.127ValThr: 3.127 ± 0.324
2.346ValVal: 2.346 ± 1.23
0.782ValTrp: 0.782 ± 0.41
3.127ValTyr: 3.127 ± 0.324
0.0ValXaa: 0.0 ± 0.0
Trp
0.782TrpAla: 0.782 ± 0.41
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.346TrpGlu: 2.346 ± 1.23
0.782TrpPhe: 0.782 ± 0.906
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.782TrpLys: 0.782 ± 0.41
0.782TrpLeu: 0.782 ± 0.41
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.127TrpArg: 3.127 ± 0.324
0.782TrpSer: 0.782 ± 0.906
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.782TrpTyr: 0.782 ± 0.906
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.346TyrAla: 2.346 ± 1.402
1.564TyrCys: 1.564 ± 0.82
3.909TyrAsp: 3.909 ± 0.734
3.909TyrGlu: 3.909 ± 0.734
0.782TyrPhe: 0.782 ± 0.906
3.127TyrGly: 3.127 ± 0.324
3.127TyrHis: 3.127 ± 0.324
0.782TyrIle: 0.782 ± 0.906
0.0TyrLys: 0.0 ± 0.0
0.782TyrLeu: 0.782 ± 0.906
0.0TyrMet: 0.0 ± 0.0
1.564TyrAsn: 1.564 ± 0.496
0.782TyrPro: 0.782 ± 0.906
1.564TyrGln: 1.564 ± 0.496
2.346TyrArg: 2.346 ± 1.23
0.0TyrSer: 0.0 ± 0.0
3.127TyrThr: 3.127 ± 0.992
2.346TyrVal: 2.346 ± 1.23
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.782XaaGly: 0.782 ± 0.41
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1280 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski