Amino acid dipepetide frequency for Wenzhou tombus-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.158AlaAla: 10.158 ± 2.569
1.129AlaCys: 1.129 ± 0.536
3.386AlaAsp: 3.386 ± 0.648
5.643AlaGlu: 5.643 ± 2.681
3.386AlaPhe: 3.386 ± 1.609
4.515AlaGly: 4.515 ± 0.112
1.129AlaHis: 1.129 ± 0.536
5.643AlaIle: 5.643 ± 0.424
2.257AlaLys: 2.257 ± 1.073
3.386AlaLeu: 3.386 ± 0.648
2.257AlaMet: 2.257 ± 1.073
4.515AlaAsn: 4.515 ± 0.112
5.643AlaPro: 5.643 ± 1.833
3.386AlaGln: 3.386 ± 0.648
7.901AlaArg: 7.901 ± 1.497
2.257AlaSer: 2.257 ± 1.073
5.643AlaThr: 5.643 ± 1.833
10.158AlaVal: 10.158 ± 0.312
1.129AlaTrp: 1.129 ± 0.536
3.386AlaTyr: 3.386 ± 1.609
0.0AlaXaa: 0.0 ± 0.0
Cys
2.257CysAla: 2.257 ± 1.073
0.0CysCys: 0.0 ± 0.0
1.129CysAsp: 1.129 ± 0.536
1.129CysGlu: 1.129 ± 0.536
0.0CysPhe: 0.0 ± 0.0
1.129CysGly: 1.129 ± 0.536
2.257CysHis: 2.257 ± 1.073
0.0CysIle: 0.0 ± 0.0
1.129CysLys: 1.129 ± 1.721
3.386CysLeu: 3.386 ± 1.609
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.257CysPro: 2.257 ± 1.073
1.129CysGln: 1.129 ± 0.536
2.257CysArg: 2.257 ± 1.073
1.129CysSer: 1.129 ± 1.721
0.0CysThr: 0.0 ± 0.0
3.386CysVal: 3.386 ± 1.609
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.386AspAla: 3.386 ± 1.609
2.257AspCys: 2.257 ± 1.073
1.129AspAsp: 1.129 ± 0.536
2.257AspGlu: 2.257 ± 1.073
1.129AspPhe: 1.129 ± 0.536
2.257AspGly: 2.257 ± 1.185
0.0AspHis: 0.0 ± 0.0
2.257AspIle: 2.257 ± 1.073
1.129AspLys: 1.129 ± 1.721
7.901AspLeu: 7.901 ± 1.497
1.129AspMet: 1.129 ± 0.536
2.257AspAsn: 2.257 ± 1.185
2.257AspPro: 2.257 ± 1.073
1.129AspGln: 1.129 ± 0.536
4.515AspArg: 4.515 ± 0.112
2.257AspSer: 2.257 ± 1.073
2.257AspThr: 2.257 ± 1.185
5.643AspVal: 5.643 ± 1.833
3.386AspTrp: 3.386 ± 0.648
2.257AspTyr: 2.257 ± 1.073
0.0AspXaa: 0.0 ± 0.0
Glu
4.515GluAla: 4.515 ± 0.112
1.129GluCys: 1.129 ± 0.536
2.257GluAsp: 2.257 ± 1.073
1.129GluGlu: 1.129 ± 0.536
2.257GluPhe: 2.257 ± 1.073
2.257GluGly: 2.257 ± 1.185
2.257GluHis: 2.257 ± 1.073
1.129GluIle: 1.129 ± 1.721
1.129GluLys: 1.129 ± 0.536
5.643GluLeu: 5.643 ± 2.681
0.0GluMet: 0.0 ± 0.0
1.129GluAsn: 1.129 ± 0.536
2.257GluPro: 2.257 ± 1.185
1.129GluGln: 1.129 ± 0.536
4.515GluArg: 4.515 ± 2.145
3.386GluSer: 3.386 ± 1.609
2.257GluThr: 2.257 ± 3.442
4.515GluVal: 4.515 ± 2.145
1.129GluTrp: 1.129 ± 0.536
3.386GluTyr: 3.386 ± 0.648
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.129PheCys: 1.129 ± 0.536
2.257PheAsp: 2.257 ± 1.185
2.257PheGlu: 2.257 ± 1.073
0.0PhePhe: 0.0 ± 0.0
6.772PheGly: 6.772 ± 3.218
2.257PheHis: 2.257 ± 1.073
2.257PheIle: 2.257 ± 3.442
1.129PheLys: 1.129 ± 0.536
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
3.386PhePro: 3.386 ± 2.906
0.0PheGln: 0.0 ± 0.0
1.129PheArg: 1.129 ± 0.536
1.129PheSer: 1.129 ± 0.536
5.643PheThr: 5.643 ± 1.833
4.515PheVal: 4.515 ± 2.369
2.257PheTrp: 2.257 ± 1.185
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.772GlyAla: 6.772 ± 1.297
3.386GlyCys: 3.386 ± 1.609
5.643GlyAsp: 5.643 ± 0.424
4.515GlyGlu: 4.515 ± 0.112
4.515GlyPhe: 4.515 ± 0.112
6.772GlyGly: 6.772 ± 5.811
0.0GlyHis: 0.0 ± 0.0
6.772GlyIle: 6.772 ± 0.96
0.0GlyLys: 0.0 ± 0.0
5.643GlyLeu: 5.643 ± 1.833
3.386GlyMet: 3.386 ± 0.648
1.129GlyAsn: 1.129 ± 0.536
2.257GlyPro: 2.257 ± 1.185
0.0GlyGln: 0.0 ± 0.0
4.515GlyArg: 4.515 ± 0.112
2.257GlySer: 2.257 ± 3.442
6.772GlyThr: 6.772 ± 3.554
16.93GlyVal: 16.93 ± 5.787
3.386GlyTrp: 3.386 ± 1.609
3.386GlyTyr: 3.386 ± 1.609
0.0GlyXaa: 0.0 ± 0.0
His
2.257HisAla: 2.257 ± 1.073
1.129HisCys: 1.129 ± 0.536
1.129HisAsp: 1.129 ± 1.721
1.129HisGlu: 1.129 ± 0.536
0.0HisPhe: 0.0 ± 0.0
1.129HisGly: 1.129 ± 0.536
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.129HisLys: 1.129 ± 1.721
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.129HisAsn: 1.129 ± 0.536
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
2.257HisArg: 2.257 ± 1.073
3.386HisSer: 3.386 ± 1.609
0.0HisThr: 0.0 ± 0.0
3.386HisVal: 3.386 ± 1.609
1.129HisTrp: 1.129 ± 0.536
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.257IleAla: 2.257 ± 1.073
1.129IleCys: 1.129 ± 1.721
3.386IleAsp: 3.386 ± 1.609
1.129IleGlu: 1.129 ± 0.536
0.0IlePhe: 0.0 ± 0.0
4.515IleGly: 4.515 ± 4.627
0.0IleHis: 0.0 ± 0.0
1.129IleIle: 1.129 ± 0.536
0.0IleLys: 0.0 ± 0.0
2.257IleLeu: 2.257 ± 1.073
0.0IleMet: 0.0 ± 0.0
3.386IleAsn: 3.386 ± 0.648
1.129IlePro: 1.129 ± 1.721
3.386IleGln: 3.386 ± 2.906
2.257IleArg: 2.257 ± 1.185
1.129IleSer: 1.129 ± 1.721
1.129IleThr: 1.129 ± 0.536
2.257IleVal: 2.257 ± 1.185
0.0IleTrp: 0.0 ± 0.0
1.129IleTyr: 1.129 ± 0.536
0.0IleXaa: 0.0 ± 0.0
Lys
4.515LysAla: 4.515 ± 2.145
0.0LysCys: 0.0 ± 0.0
1.129LysAsp: 1.129 ± 0.536
0.0LysGlu: 0.0 ± 0.0
1.129LysPhe: 1.129 ± 1.721
4.515LysGly: 4.515 ± 2.369
0.0LysHis: 0.0 ± 0.0
1.129LysIle: 1.129 ± 1.721
0.0LysLys: 0.0 ± 0.0
3.386LysLeu: 3.386 ± 0.648
2.257LysMet: 2.257 ± 1.185
2.257LysAsn: 2.257 ± 1.185
1.129LysPro: 1.129 ± 0.536
0.0LysGln: 0.0 ± 0.0
2.257LysArg: 2.257 ± 1.185
0.0LysSer: 0.0 ± 0.0
0.0LysThr: 0.0 ± 0.0
2.257LysVal: 2.257 ± 1.185
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
1.129LysXaa: 1.129 ± 0.536
Leu
12.415LeuAla: 12.415 ± 3.642
2.257LeuCys: 2.257 ± 1.073
5.643LeuAsp: 5.643 ± 0.424
4.515LeuGlu: 4.515 ± 2.145
1.129LeuPhe: 1.129 ± 0.536
5.643LeuGly: 5.643 ± 0.424
1.129LeuHis: 1.129 ± 0.536
0.0LeuIle: 0.0 ± 0.0
3.386LeuLys: 3.386 ± 1.609
9.029LeuLeu: 9.029 ± 4.29
1.129LeuMet: 1.129 ± 0.916
1.129LeuAsn: 1.129 ± 0.536
4.515LeuPro: 4.515 ± 2.145
3.386LeuGln: 3.386 ± 1.609
6.772LeuArg: 6.772 ± 0.96
1.129LeuSer: 1.129 ± 0.536
6.772LeuThr: 6.772 ± 1.297
6.772LeuVal: 6.772 ± 1.297
0.0LeuTrp: 0.0 ± 0.0
1.129LeuTyr: 1.129 ± 0.536
0.0LeuXaa: 0.0 ± 0.0
Met
4.515MetAla: 4.515 ± 0.112
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
3.386MetPhe: 3.386 ± 2.906
2.257MetGly: 2.257 ± 1.073
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.129MetLys: 1.129 ± 1.721
1.129MetLeu: 1.129 ± 0.536
0.0MetMet: 0.0 ± 0.0
1.129MetAsn: 1.129 ± 0.536
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
4.515MetSer: 4.515 ± 2.145
2.257MetThr: 2.257 ± 1.185
4.515MetVal: 4.515 ± 2.145
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.257AsnAla: 2.257 ± 1.185
1.129AsnCys: 1.129 ± 0.536
1.129AsnAsp: 1.129 ± 0.536
0.0AsnGlu: 0.0 ± 0.0
1.129AsnPhe: 1.129 ± 1.721
1.129AsnGly: 1.129 ± 0.536
1.129AsnHis: 1.129 ± 1.721
1.129AsnIle: 1.129 ± 1.721
1.129AsnLys: 1.129 ± 0.536
5.643AsnLeu: 5.643 ± 0.424
1.129AsnMet: 1.129 ± 1.721
2.257AsnAsn: 2.257 ± 1.185
4.515AsnPro: 4.515 ± 4.627
0.0AsnGln: 0.0 ± 0.0
2.257AsnArg: 2.257 ± 1.073
4.515AsnSer: 4.515 ± 0.112
3.386AsnThr: 3.386 ± 0.648
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.386ProAla: 3.386 ± 1.609
1.129ProCys: 1.129 ± 0.536
3.386ProAsp: 3.386 ± 1.609
2.257ProGlu: 2.257 ± 1.185
0.0ProPhe: 0.0 ± 0.0
4.515ProGly: 4.515 ± 2.369
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
1.129ProLys: 1.129 ± 0.536
2.257ProLeu: 2.257 ± 1.073
1.129ProMet: 1.129 ± 0.536
2.257ProAsn: 2.257 ± 3.442
0.0ProPro: 0.0 ± 0.0
1.129ProGln: 1.129 ± 0.536
5.643ProArg: 5.643 ± 2.681
3.386ProSer: 3.386 ± 2.906
5.643ProThr: 5.643 ± 1.833
9.029ProVal: 9.029 ± 2.481
0.0ProTrp: 0.0 ± 0.0
2.257ProTyr: 2.257 ± 1.185
0.0ProXaa: 0.0 ± 0.0
Gln
3.386GlnAla: 3.386 ± 0.648
0.0GlnCys: 0.0 ± 0.0
1.129GlnAsp: 1.129 ± 0.536
1.129GlnGlu: 1.129 ± 1.721
1.129GlnPhe: 1.129 ± 0.536
0.0GlnGly: 0.0 ± 0.0
1.129GlnHis: 1.129 ± 0.536
1.129GlnIle: 1.129 ± 1.721
0.0GlnLys: 0.0 ± 0.0
2.257GlnLeu: 2.257 ± 1.073
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
3.386GlnPro: 3.386 ± 0.648
3.386GlnGln: 3.386 ± 1.609
3.386GlnArg: 3.386 ± 1.609
2.257GlnSer: 2.257 ± 1.185
0.0GlnThr: 0.0 ± 0.0
1.129GlnVal: 1.129 ± 0.536
0.0GlnTrp: 0.0 ± 0.0
1.129GlnTyr: 1.129 ± 0.536
0.0GlnXaa: 0.0 ± 0.0
Arg
5.643ArgAla: 5.643 ± 0.424
0.0ArgCys: 0.0 ± 0.0
3.386ArgAsp: 3.386 ± 1.609
4.515ArgGlu: 4.515 ± 2.145
2.257ArgPhe: 2.257 ± 1.073
5.643ArgGly: 5.643 ± 2.681
2.257ArgHis: 2.257 ± 1.073
2.257ArgIle: 2.257 ± 1.073
4.515ArgLys: 4.515 ± 2.369
7.901ArgLeu: 7.901 ± 3.754
3.386ArgMet: 3.386 ± 1.609
0.0ArgAsn: 0.0 ± 0.0
3.386ArgPro: 3.386 ± 1.609
2.257ArgGln: 2.257 ± 1.073
6.772ArgArg: 6.772 ± 5.811
6.772ArgSer: 6.772 ± 1.297
3.386ArgThr: 3.386 ± 0.648
12.415ArgVal: 12.415 ± 1.385
4.515ArgTrp: 4.515 ± 0.112
5.643ArgTyr: 5.643 ± 2.681
0.0ArgXaa: 0.0 ± 0.0
Ser
5.643SerAla: 5.643 ± 6.348
0.0SerCys: 0.0 ± 0.0
1.129SerAsp: 1.129 ± 0.536
2.257SerGlu: 2.257 ± 1.073
5.643SerPhe: 5.643 ± 6.348
5.643SerGly: 5.643 ± 2.681
0.0SerHis: 0.0 ± 0.0
2.257SerIle: 2.257 ± 1.185
2.257SerLys: 2.257 ± 1.073
4.515SerLeu: 4.515 ± 2.145
1.129SerMet: 1.129 ± 0.536
0.0SerAsn: 0.0 ± 0.0
3.386SerPro: 3.386 ± 0.648
0.0SerGln: 0.0 ± 0.0
5.643SerArg: 5.643 ± 0.424
2.257SerSer: 2.257 ± 1.073
4.515SerThr: 4.515 ± 0.112
7.901SerVal: 7.901 ± 0.76
1.129SerTrp: 1.129 ± 0.536
2.257SerTyr: 2.257 ± 1.185
0.0SerXaa: 0.0 ± 0.0
Thr
1.129ThrAla: 1.129 ± 0.536
2.257ThrCys: 2.257 ± 1.185
2.257ThrAsp: 2.257 ± 1.185
2.257ThrGlu: 2.257 ± 1.185
4.515ThrPhe: 4.515 ± 2.145
9.029ThrGly: 9.029 ± 2.481
1.129ThrHis: 1.129 ± 0.536
1.129ThrIle: 1.129 ± 1.721
2.257ThrLys: 2.257 ± 3.442
4.515ThrLeu: 4.515 ± 0.112
2.257ThrMet: 2.257 ± 2.623
3.386ThrAsn: 3.386 ± 5.163
5.643ThrPro: 5.643 ± 0.424
1.129ThrGln: 1.129 ± 1.721
5.643ThrArg: 5.643 ± 0.424
5.643ThrSer: 5.643 ± 0.424
6.772ThrThr: 6.772 ± 3.554
3.386ThrVal: 3.386 ± 2.906
1.129ThrTrp: 1.129 ± 0.536
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
12.415ValAla: 12.415 ± 3.642
1.129ValCys: 1.129 ± 0.536
6.772ValAsp: 6.772 ± 1.297
6.772ValGlu: 6.772 ± 1.297
4.515ValPhe: 4.515 ± 0.112
15.801ValGly: 15.801 ± 0.736
1.129ValHis: 1.129 ± 0.536
3.386ValIle: 3.386 ± 0.648
3.386ValLys: 3.386 ± 0.648
6.772ValLeu: 6.772 ± 0.96
0.0ValMet: 0.0 ± 0.0
3.386ValAsn: 3.386 ± 0.648
1.129ValPro: 1.129 ± 0.536
0.0ValGln: 0.0 ± 0.0
14.673ValArg: 14.673 ± 4.714
6.772ValSer: 6.772 ± 1.297
6.772ValThr: 6.772 ± 3.554
10.158ValVal: 10.158 ± 4.827
3.386ValTrp: 3.386 ± 0.648
3.386ValTyr: 3.386 ± 0.648
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
2.257TrpCys: 2.257 ± 1.073
0.0TrpAsp: 0.0 ± 0.0
3.386TrpGlu: 3.386 ± 0.648
0.0TrpPhe: 0.0 ± 0.0
2.257TrpGly: 2.257 ± 1.073
1.129TrpHis: 1.129 ± 0.536
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.257TrpLeu: 2.257 ± 1.073
2.257TrpMet: 2.257 ± 1.073
1.129TrpAsn: 1.129 ± 1.721
0.0TrpPro: 0.0 ± 0.0
2.257TrpGln: 2.257 ± 1.185
2.257TrpArg: 2.257 ± 1.073
0.0TrpSer: 0.0 ± 0.0
1.129TrpThr: 1.129 ± 0.536
1.129TrpVal: 1.129 ± 0.536
0.0TrpTrp: 0.0 ± 0.0
2.257TrpTyr: 2.257 ± 1.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.129TyrCys: 1.129 ± 0.536
4.515TyrAsp: 4.515 ± 0.112
1.129TyrGlu: 1.129 ± 0.536
0.0TyrPhe: 0.0 ± 0.0
1.129TyrGly: 1.129 ± 0.536
2.257TyrHis: 2.257 ± 1.185
0.0TyrIle: 0.0 ± 0.0
0.0TyrLys: 0.0 ± 0.0
1.129TyrLeu: 1.129 ± 0.536
2.257TyrMet: 2.257 ± 1.073
3.386TyrAsn: 3.386 ± 1.609
2.257TyrPro: 2.257 ± 1.073
2.257TyrGln: 2.257 ± 1.073
2.257TyrArg: 2.257 ± 1.073
3.386TyrSer: 3.386 ± 2.906
1.129TyrThr: 1.129 ± 0.536
2.257TyrVal: 2.257 ± 1.185
1.129TyrTrp: 1.129 ± 0.536
1.129TyrTyr: 1.129 ± 1.721
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
1.129XaaGly: 1.129 ± 0.536
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (887 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski