Amino acid dipepetide frequency for Wenzhou tombus-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.485AlaAla: 9.485 ± 1.364
2.71AlaCys: 2.71 ± 1.681
4.065AlaAsp: 4.065 ± 0.262
2.71AlaGlu: 2.71 ± 1.681
4.065AlaPhe: 4.065 ± 0.262
5.42AlaGly: 5.42 ± 1.102
0.0AlaHis: 0.0 ± 0.0
5.42AlaIle: 5.42 ± 3.416
2.71AlaLys: 2.71 ± 0.579
5.42AlaLeu: 5.42 ± 1.157
1.355AlaMet: 1.355 ± 0.84
2.71AlaAsn: 2.71 ± 1.681
5.42AlaPro: 5.42 ± 1.157
4.065AlaGln: 4.065 ± 0.262
5.42AlaArg: 5.42 ± 1.157
4.065AlaSer: 4.065 ± 1.997
2.71AlaThr: 2.71 ± 0.579
12.195AlaVal: 12.195 ± 3.733
1.355AlaTrp: 1.355 ± 0.84
2.71AlaTyr: 2.71 ± 0.579
0.0AlaXaa: 0.0 ± 0.0
Cys
1.355CysAla: 1.355 ± 0.84
1.355CysCys: 1.355 ± 1.419
0.0CysAsp: 0.0 ± 0.0
1.355CysGlu: 1.355 ± 0.84
0.0CysPhe: 0.0 ± 0.0
2.71CysGly: 2.71 ± 1.681
0.0CysHis: 0.0 ± 0.0
2.71CysIle: 2.71 ± 0.579
1.355CysLys: 1.355 ± 1.419
1.355CysLeu: 1.355 ± 0.84
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.355CysPro: 1.355 ± 0.84
2.71CysGln: 2.71 ± 0.579
1.355CysArg: 1.355 ± 0.84
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.71CysVal: 2.71 ± 1.681
0.0CysTrp: 0.0 ± 0.0
1.355CysTyr: 1.355 ± 0.84
0.0CysXaa: 0.0 ± 0.0
Asp
6.775AspAla: 6.775 ± 1.942
2.71AspCys: 2.71 ± 1.681
1.355AspAsp: 1.355 ± 0.84
0.0AspGlu: 0.0 ± 0.0
0.0AspPhe: 0.0 ± 0.0
4.065AspGly: 4.065 ± 2.521
1.355AspHis: 1.355 ± 0.84
2.71AspIle: 2.71 ± 2.838
1.355AspLys: 1.355 ± 1.419
4.065AspLeu: 4.065 ± 2.521
0.0AspMet: 0.0 ± 0.0
1.355AspAsn: 1.355 ± 1.419
6.775AspPro: 6.775 ± 1.942
4.065AspGln: 4.065 ± 2.521
1.355AspArg: 1.355 ± 1.419
4.065AspSer: 4.065 ± 0.262
5.42AspThr: 5.42 ± 3.416
1.355AspVal: 1.355 ± 1.419
0.0AspTrp: 0.0 ± 0.0
2.71AspTyr: 2.71 ± 1.681
0.0AspXaa: 0.0 ± 0.0
Glu
4.065GluAla: 4.065 ± 0.262
2.71GluCys: 2.71 ± 1.681
1.355GluAsp: 1.355 ± 0.84
1.355GluGlu: 1.355 ± 0.84
2.71GluPhe: 2.71 ± 1.681
2.71GluGly: 2.71 ± 0.579
1.355GluHis: 1.355 ± 0.84
0.0GluIle: 0.0 ± 0.0
2.71GluLys: 2.71 ± 1.681
4.065GluLeu: 4.065 ± 2.521
0.0GluMet: 0.0 ± 0.0
1.355GluAsn: 1.355 ± 0.84
4.065GluPro: 4.065 ± 0.262
1.355GluGln: 1.355 ± 0.84
10.84GluArg: 10.84 ± 6.722
4.065GluSer: 4.065 ± 0.262
1.355GluThr: 1.355 ± 1.419
4.065GluVal: 4.065 ± 0.262
1.355GluTrp: 1.355 ± 0.84
2.71GluTyr: 2.71 ± 1.681
0.0GluXaa: 0.0 ± 0.0
Phe
2.71PheAla: 2.71 ± 2.838
2.71PheCys: 2.71 ± 1.681
5.42PheAsp: 5.42 ± 1.102
2.71PheGlu: 2.71 ± 1.681
0.0PhePhe: 0.0 ± 0.0
2.71PheGly: 2.71 ± 1.681
2.71PheHis: 2.71 ± 0.579
1.355PheIle: 1.355 ± 1.419
0.0PheLys: 0.0 ± 0.0
4.065PheLeu: 4.065 ± 0.262
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
4.065PhePro: 4.065 ± 4.257
1.355PheGln: 1.355 ± 1.419
5.42PheArg: 5.42 ± 1.102
1.355PheSer: 1.355 ± 1.419
2.71PheThr: 2.71 ± 0.579
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
2.71PheTyr: 2.71 ± 1.681
0.0PheXaa: 0.0 ± 0.0
Gly
1.355GlyAla: 1.355 ± 1.419
0.0GlyCys: 0.0 ± 0.0
6.775GlyAsp: 6.775 ± 0.317
4.065GlyGlu: 4.065 ± 2.521
5.42GlyPhe: 5.42 ± 1.157
4.065GlyGly: 4.065 ± 4.257
1.355GlyHis: 1.355 ± 1.419
4.065GlyIle: 4.065 ± 0.262
1.355GlyLys: 1.355 ± 1.419
9.485GlyLeu: 9.485 ± 1.364
1.355GlyMet: 1.355 ± 0.84
2.71GlyAsn: 2.71 ± 0.579
2.71GlyPro: 2.71 ± 0.579
4.065GlyGln: 4.065 ± 0.262
9.485GlyArg: 9.485 ± 0.895
4.065GlySer: 4.065 ± 1.997
4.065GlyThr: 4.065 ± 4.257
9.485GlyVal: 9.485 ± 3.623
2.71GlyTrp: 2.71 ± 1.681
5.42GlyTyr: 5.42 ± 3.416
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.355HisAsp: 1.355 ± 1.419
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.355HisGly: 1.355 ± 0.84
0.0HisHis: 0.0 ± 0.0
1.355HisIle: 1.355 ± 0.84
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.355HisAsn: 1.355 ± 0.84
2.71HisPro: 2.71 ± 0.579
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.71HisSer: 2.71 ± 0.579
0.0HisThr: 0.0 ± 0.0
2.71HisVal: 2.71 ± 0.579
2.71HisTrp: 2.71 ± 1.681
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.065IleAla: 4.065 ± 1.997
0.0IleCys: 0.0 ± 0.0
5.42IleAsp: 5.42 ± 1.157
0.0IleGlu: 0.0 ± 0.0
0.0IlePhe: 0.0 ± 0.0
4.065IleGly: 4.065 ± 4.257
0.0IleHis: 0.0 ± 0.0
1.355IleIle: 1.355 ± 1.419
2.71IleLys: 2.71 ± 0.579
1.355IleLeu: 1.355 ± 0.84
1.355IleMet: 1.355 ± 0.84
1.355IleAsn: 1.355 ± 1.419
1.355IlePro: 1.355 ± 1.419
0.0IleGln: 0.0 ± 0.0
1.355IleArg: 1.355 ± 0.84
4.065IleSer: 4.065 ± 0.262
4.065IleThr: 4.065 ± 1.997
2.71IleVal: 2.71 ± 2.838
0.0IleTrp: 0.0 ± 0.0
1.355IleTyr: 1.355 ± 0.84
0.0IleXaa: 0.0 ± 0.0
Lys
4.065LysAla: 4.065 ± 2.521
0.0LysCys: 0.0 ± 0.0
2.71LysAsp: 2.71 ± 1.681
2.71LysGlu: 2.71 ± 1.681
1.355LysPhe: 1.355 ± 1.419
5.42LysGly: 5.42 ± 1.157
0.0LysHis: 0.0 ± 0.0
1.355LysIle: 1.355 ± 0.84
0.0LysLys: 0.0 ± 0.0
4.065LysLeu: 4.065 ± 0.262
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
0.0LysPro: 0.0 ± 0.0
1.355LysGln: 1.355 ± 1.419
5.42LysArg: 5.42 ± 1.157
0.0LysSer: 0.0 ± 0.0
1.355LysThr: 1.355 ± 0.84
5.42LysVal: 5.42 ± 1.102
1.355LysTrp: 1.355 ± 1.419
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.775LeuAla: 6.775 ± 1.942
0.0LeuCys: 0.0 ± 0.0
5.42LeuAsp: 5.42 ± 1.102
5.42LeuGlu: 5.42 ± 3.361
2.71LeuPhe: 2.71 ± 1.681
5.42LeuGly: 5.42 ± 3.416
1.355LeuHis: 1.355 ± 0.84
4.065LeuIle: 4.065 ± 0.262
2.71LeuLys: 2.71 ± 1.681
2.71LeuLeu: 2.71 ± 1.681
1.355LeuMet: 1.355 ± 0.827
6.775LeuAsn: 6.775 ± 1.942
2.71LeuPro: 2.71 ± 0.579
2.71LeuGln: 2.71 ± 0.579
9.485LeuArg: 9.485 ± 1.364
4.065LeuSer: 4.065 ± 2.521
9.485LeuThr: 9.485 ± 1.364
5.42LeuVal: 5.42 ± 1.102
0.0LeuTrp: 0.0 ± 0.0
2.71LeuTyr: 2.71 ± 1.681
0.0LeuXaa: 0.0 ± 0.0
Met
2.71MetAla: 2.71 ± 0.579
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.355MetPhe: 1.355 ± 1.419
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.355MetIle: 1.355 ± 1.419
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
1.355MetMet: 1.355 ± 0.84
0.0MetAsn: 0.0 ± 0.0
1.355MetPro: 1.355 ± 1.419
1.355MetGln: 1.355 ± 0.84
1.355MetArg: 1.355 ± 0.84
5.42MetSer: 5.42 ± 3.361
0.0MetThr: 0.0 ± 0.0
1.355MetVal: 1.355 ± 0.84
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.065AsnAla: 4.065 ± 2.521
1.355AsnCys: 1.355 ± 0.84
0.0AsnAsp: 0.0 ± 0.0
1.355AsnGlu: 1.355 ± 0.84
4.065AsnPhe: 4.065 ± 1.997
5.42AsnGly: 5.42 ± 1.102
0.0AsnHis: 0.0 ± 0.0
1.355AsnIle: 1.355 ± 1.419
1.355AsnLys: 1.355 ± 1.419
2.71AsnLeu: 2.71 ± 2.838
0.0AsnMet: 0.0 ± 0.0
2.71AsnAsn: 2.71 ± 0.579
2.71AsnPro: 2.71 ± 0.579
1.355AsnGln: 1.355 ± 1.419
2.71AsnArg: 2.71 ± 0.579
1.355AsnSer: 1.355 ± 0.84
5.42AsnThr: 5.42 ± 3.416
1.355AsnVal: 1.355 ± 0.84
0.0AsnTrp: 0.0 ± 0.0
1.355AsnTyr: 1.355 ± 1.419
0.0AsnXaa: 0.0 ± 0.0
Pro
2.71ProAla: 2.71 ± 0.579
1.355ProCys: 1.355 ± 0.84
1.355ProAsp: 1.355 ± 0.84
2.71ProGlu: 2.71 ± 0.579
2.71ProPhe: 2.71 ± 2.838
5.42ProGly: 5.42 ± 3.416
1.355ProHis: 1.355 ± 1.419
2.71ProIle: 2.71 ± 1.681
0.0ProLys: 0.0 ± 0.0
5.42ProLeu: 5.42 ± 1.157
1.355ProMet: 1.355 ± 0.84
1.355ProAsn: 1.355 ± 1.419
1.355ProPro: 1.355 ± 0.84
0.0ProGln: 0.0 ± 0.0
6.775ProArg: 6.775 ± 4.202
4.065ProSer: 4.065 ± 1.997
6.775ProThr: 6.775 ± 4.835
8.13ProVal: 8.13 ± 0.523
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.71GlnAla: 2.71 ± 0.579
1.355GlnCys: 1.355 ± 0.84
0.0GlnAsp: 0.0 ± 0.0
5.42GlnGlu: 5.42 ± 1.157
1.355GlnPhe: 1.355 ± 0.84
0.0GlnGly: 0.0 ± 0.0
2.71GlnHis: 2.71 ± 1.681
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
5.42GlnLeu: 5.42 ± 1.102
1.355GlnMet: 1.355 ± 1.419
1.355GlnAsn: 1.355 ± 1.419
1.355GlnPro: 1.355 ± 1.419
1.355GlnGln: 1.355 ± 0.84
1.355GlnArg: 1.355 ± 0.84
0.0GlnSer: 0.0 ± 0.0
1.355GlnThr: 1.355 ± 1.419
4.065GlnVal: 4.065 ± 0.262
0.0GlnTrp: 0.0 ± 0.0
1.355GlnTyr: 1.355 ± 1.419
0.0GlnXaa: 0.0 ± 0.0
Arg
8.13ArgAla: 8.13 ± 0.523
2.71ArgCys: 2.71 ± 0.579
5.42ArgAsp: 5.42 ± 3.361
4.065ArgGlu: 4.065 ± 2.521
4.065ArgPhe: 4.065 ± 0.262
5.42ArgGly: 5.42 ± 1.102
0.0ArgHis: 0.0 ± 0.0
0.0ArgIle: 0.0 ± 0.0
6.775ArgLys: 6.775 ± 0.317
8.13ArgLeu: 8.13 ± 5.042
4.065ArgMet: 4.065 ± 2.521
2.71ArgAsn: 2.71 ± 0.579
5.42ArgPro: 5.42 ± 1.157
2.71ArgGln: 2.71 ± 0.579
6.775ArgArg: 6.775 ± 0.317
1.355ArgSer: 1.355 ± 0.84
0.0ArgThr: 0.0 ± 0.0
9.485ArgVal: 9.485 ± 3.623
2.71ArgTrp: 2.71 ± 0.579
6.775ArgTyr: 6.775 ± 1.942
0.0ArgXaa: 0.0 ± 0.0
Ser
5.42SerAla: 5.42 ± 3.416
2.71SerCys: 2.71 ± 2.838
1.355SerAsp: 1.355 ± 0.84
2.71SerGlu: 2.71 ± 1.681
4.065SerPhe: 4.065 ± 1.997
9.485SerGly: 9.485 ± 3.155
1.355SerHis: 1.355 ± 0.84
0.0SerIle: 0.0 ± 0.0
4.065SerLys: 4.065 ± 2.521
8.13SerLeu: 8.13 ± 2.783
0.0SerMet: 0.0 ± 0.602
0.0SerAsn: 0.0 ± 0.0
1.355SerPro: 1.355 ± 0.84
0.0SerGln: 0.0 ± 0.0
2.71SerArg: 2.71 ± 1.681
5.42SerSer: 5.42 ± 1.157
2.71SerThr: 2.71 ± 2.838
5.42SerVal: 5.42 ± 1.157
1.355SerTrp: 1.355 ± 0.84
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.775ThrAla: 6.775 ± 2.576
0.0ThrCys: 0.0 ± 0.0
5.42ThrAsp: 5.42 ± 3.416
1.355ThrGlu: 1.355 ± 0.84
1.355ThrPhe: 1.355 ± 0.84
2.71ThrGly: 2.71 ± 2.838
0.0ThrHis: 0.0 ± 0.0
5.42ThrIle: 5.42 ± 3.416
1.355ThrLys: 1.355 ± 1.419
6.775ThrLeu: 6.775 ± 0.317
1.355ThrMet: 1.355 ± 1.419
2.71ThrAsn: 2.71 ± 2.838
6.775ThrPro: 6.775 ± 2.576
0.0ThrGln: 0.0 ± 0.0
4.065ThrArg: 4.065 ± 0.262
5.42ThrSer: 5.42 ± 3.416
4.065ThrThr: 4.065 ± 1.997
1.355ThrVal: 1.355 ± 1.419
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.775ValAla: 6.775 ± 2.576
0.0ValCys: 0.0 ± 0.0
2.71ValAsp: 2.71 ± 0.579
8.13ValGlu: 8.13 ± 2.783
5.42ValPhe: 5.42 ± 1.102
14.905ValGly: 14.905 ± 0.207
1.355ValHis: 1.355 ± 1.419
0.0ValIle: 0.0 ± 0.0
5.42ValLys: 5.42 ± 3.361
4.065ValLeu: 4.065 ± 0.262
0.0ValMet: 0.0 ± 0.0
5.42ValAsn: 5.42 ± 1.157
2.71ValPro: 2.71 ± 1.681
2.71ValGln: 2.71 ± 0.579
8.13ValArg: 8.13 ± 2.783
5.42ValSer: 5.42 ± 1.157
1.355ValThr: 1.355 ± 1.419
9.485ValVal: 9.485 ± 1.364
4.065ValTrp: 4.065 ± 0.262
2.71ValTyr: 2.71 ± 0.579
0.0ValXaa: 0.0 ± 0.0
Trp
1.355TrpAla: 1.355 ± 0.84
0.0TrpCys: 0.0 ± 0.0
1.355TrpAsp: 1.355 ± 0.84
4.065TrpGlu: 4.065 ± 0.262
1.355TrpPhe: 1.355 ± 0.84
1.355TrpGly: 1.355 ± 0.84
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.355TrpLeu: 1.355 ± 0.84
0.0TrpMet: 0.0 ± 0.0
1.355TrpAsn: 1.355 ± 1.419
0.0TrpPro: 0.0 ± 0.0
1.355TrpGln: 1.355 ± 1.419
1.355TrpArg: 1.355 ± 0.84
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.355TrpVal: 1.355 ± 0.84
0.0TrpTrp: 0.0 ± 0.0
1.355TrpTyr: 1.355 ± 0.84
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.71TyrAla: 2.71 ± 0.579
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
4.065TyrGlu: 4.065 ± 0.262
0.0TyrPhe: 0.0 ± 0.0
1.355TyrGly: 1.355 ± 0.84
1.355TyrHis: 1.355 ± 0.84
1.355TyrIle: 1.355 ± 1.419
2.71TyrLys: 2.71 ± 1.681
2.71TyrLeu: 2.71 ± 1.681
1.355TyrMet: 1.355 ± 1.419
5.42TyrAsn: 5.42 ± 1.157
1.355TyrPro: 1.355 ± 0.84
0.0TyrGln: 0.0 ± 0.0
1.355TyrArg: 1.355 ± 0.84
2.71TyrSer: 2.71 ± 1.681
4.065TyrThr: 4.065 ± 0.262
2.71TyrVal: 2.71 ± 0.579
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (739 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski