Amino acid dipepetide frequency for Wenzhou picorna-like virus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.952AlaAla: 5.952 ± 3.252
1.276AlaCys: 1.276 ± 0.049
4.252AlaAsp: 4.252 ± 0.273
3.401AlaGlu: 3.401 ± 0.565
1.701AlaPhe: 1.701 ± 0.37
3.401AlaGly: 3.401 ± 1.392
2.551AlaHis: 2.551 ± 0.097
1.701AlaIle: 1.701 ± 0.282
2.126AlaLys: 2.126 ± 1.168
5.527AlaLeu: 5.527 ± 1.081
1.701AlaMet: 1.701 ± 0.37
2.976AlaAsn: 2.976 ± 0.974
2.976AlaPro: 2.976 ± 0.321
4.677AlaGln: 4.677 ± 0.691
4.252AlaArg: 4.252 ± 0.273
4.677AlaSer: 4.677 ± 0.691
3.827AlaThr: 3.827 ± 0.506
6.378AlaVal: 6.378 ± 0.243
0.85AlaTrp: 0.85 ± 0.185
1.276AlaTyr: 1.276 ± 0.604
0.0AlaXaa: 0.0 ± 0.0
Cys
0.425CysAla: 0.425 ± 0.419
1.276CysCys: 1.276 ± 0.049
1.701CysAsp: 1.701 ± 0.37
1.276CysGlu: 1.276 ± 0.604
1.276CysPhe: 1.276 ± 0.049
2.126CysGly: 2.126 ± 0.789
0.425CysHis: 0.425 ± 0.234
0.85CysIle: 0.85 ± 0.467
2.126CysLys: 2.126 ± 1.168
0.85CysLeu: 0.85 ± 0.185
0.425CysMet: 0.425 ± 0.419
0.85CysAsn: 0.85 ± 0.185
0.425CysPro: 0.425 ± 0.419
0.0CysGln: 0.0 ± 0.0
0.425CysArg: 0.425 ± 0.419
1.701CysSer: 1.701 ± 0.37
0.425CysThr: 0.425 ± 0.419
1.276CysVal: 1.276 ± 0.049
1.701CysTrp: 1.701 ± 0.935
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.551AspAla: 2.551 ± 0.097
0.0AspCys: 0.0 ± 0.0
4.677AspAsp: 4.677 ± 0.613
4.252AspGlu: 4.252 ± 1.032
2.976AspPhe: 2.976 ± 0.983
2.976AspGly: 2.976 ± 0.321
1.276AspHis: 1.276 ± 0.701
3.401AspIle: 3.401 ± 0.74
4.677AspLys: 4.677 ± 2.57
6.803AspLeu: 6.803 ± 0.477
2.126AspMet: 2.126 ± 0.136
2.126AspAsn: 2.126 ± 0.516
3.827AspPro: 3.827 ± 2.463
1.701AspGln: 1.701 ± 0.282
1.276AspArg: 1.276 ± 0.701
1.701AspSer: 1.701 ± 0.37
4.252AspThr: 4.252 ± 0.925
3.827AspVal: 3.827 ± 0.146
0.425AspTrp: 0.425 ± 0.234
1.701AspTyr: 1.701 ± 0.37
0.0AspXaa: 0.0 ± 0.0
Glu
4.677GluAla: 4.677 ± 1.918
2.126GluCys: 2.126 ± 0.136
2.126GluAsp: 2.126 ± 0.516
5.102GluGlu: 5.102 ± 0.847
2.126GluPhe: 2.126 ± 0.136
1.276GluGly: 1.276 ± 0.701
0.425GluHis: 0.425 ± 0.419
5.527GluIle: 5.527 ± 0.428
4.677GluLys: 4.677 ± 1.266
5.102GluLeu: 5.102 ± 1.499
2.976GluMet: 2.976 ± 0.983
2.126GluAsn: 2.126 ± 0.136
1.276GluPro: 1.276 ± 0.701
2.551GluGln: 2.551 ± 0.097
3.827GluArg: 3.827 ± 0.798
4.677GluSer: 4.677 ± 3.3
2.551GluThr: 2.551 ± 0.75
1.701GluVal: 1.701 ± 1.022
0.425GluTrp: 0.425 ± 0.234
3.401GluTyr: 3.401 ± 0.088
0.0GluXaa: 0.0 ± 0.0
Phe
2.551PheAla: 2.551 ± 1.86
0.85PheCys: 0.85 ± 0.185
4.677PheAsp: 4.677 ± 0.691
1.276PheGlu: 1.276 ± 0.049
1.276PhePhe: 1.276 ± 0.049
2.126PheGly: 2.126 ± 0.136
0.425PheHis: 0.425 ± 0.234
2.976PheIle: 2.976 ± 0.331
3.401PheLys: 3.401 ± 1.217
5.527PheLeu: 5.527 ± 1.733
2.126PheMet: 2.126 ± 0.136
3.827PheAsn: 3.827 ± 1.451
1.701PhePro: 1.701 ± 0.935
2.551PheGln: 2.551 ± 0.097
2.976PheArg: 2.976 ± 0.321
6.378PheSer: 6.378 ± 1.714
2.551PheThr: 2.551 ± 1.207
3.401PheVal: 3.401 ± 0.088
0.0PheTrp: 0.0 ± 0.0
0.425PheTyr: 0.425 ± 0.234
0.0PheXaa: 0.0 ± 0.0
Gly
2.976GlyAla: 2.976 ± 0.974
1.276GlyCys: 1.276 ± 0.049
2.976GlyAsp: 2.976 ± 0.321
4.677GlyGlu: 4.677 ± 0.613
3.401GlyPhe: 3.401 ± 0.565
1.276GlyGly: 1.276 ± 1.256
0.425GlyHis: 0.425 ± 0.234
4.677GlyIle: 4.677 ± 0.613
4.252GlyLys: 4.252 ± 0.925
2.976GlyLeu: 2.976 ± 0.983
2.976GlyMet: 2.976 ± 0.46
3.401GlyAsn: 3.401 ± 1.392
2.126GlyPro: 2.126 ± 0.516
2.126GlyGln: 2.126 ± 0.136
4.677GlyArg: 4.677 ± 0.691
5.527GlySer: 5.527 ± 2.181
2.976GlyThr: 2.976 ± 0.321
5.527GlyVal: 5.527 ± 1.081
0.425GlyTrp: 0.425 ± 0.234
2.551GlyTyr: 2.551 ± 0.555
0.0GlyXaa: 0.0 ± 0.0
His
1.276HisAla: 1.276 ± 0.701
0.0HisCys: 0.0 ± 0.0
1.276HisAsp: 1.276 ± 0.701
0.85HisGlu: 0.85 ± 0.467
1.701HisPhe: 1.701 ± 1.022
2.126HisGly: 2.126 ± 0.136
0.85HisHis: 0.85 ± 0.467
0.0HisIle: 0.0 ± 0.0
2.126HisLys: 2.126 ± 1.168
2.551HisLeu: 2.551 ± 0.75
0.425HisMet: 0.425 ± 0.419
1.276HisAsn: 1.276 ± 0.604
1.276HisPro: 1.276 ± 0.604
0.85HisGln: 0.85 ± 0.467
0.425HisArg: 0.425 ± 0.234
2.126HisSer: 2.126 ± 0.516
0.85HisThr: 0.85 ± 0.185
2.126HisVal: 2.126 ± 0.516
1.276HisTrp: 1.276 ± 0.604
0.425HisTyr: 0.425 ± 0.234
0.0HisXaa: 0.0 ± 0.0
Ile
2.126IleAla: 2.126 ± 0.136
1.701IleCys: 1.701 ± 0.282
4.252IleAsp: 4.252 ± 0.273
3.401IleGlu: 3.401 ± 1.392
4.677IlePhe: 4.677 ± 1.266
5.527IleGly: 5.527 ± 0.876
0.85IleHis: 0.85 ± 0.467
2.551IleIle: 2.551 ± 1.402
3.827IleLys: 3.827 ± 1.451
3.401IleLeu: 3.401 ± 1.217
2.551IleMet: 2.551 ± 1.402
1.701IleAsn: 1.701 ± 1.675
1.701IlePro: 1.701 ± 1.022
1.276IleGln: 1.276 ± 1.256
3.401IleArg: 3.401 ± 1.217
4.677IleSer: 4.677 ± 1.266
4.252IleThr: 4.252 ± 0.38
5.102IleVal: 5.102 ± 0.195
0.425IleTrp: 0.425 ± 0.419
5.527IleTyr: 5.527 ± 1.081
0.0IleXaa: 0.0 ± 0.0
Lys
2.551LysAla: 2.551 ± 0.097
0.425LysCys: 0.425 ± 0.419
5.102LysAsp: 5.102 ± 2.804
2.976LysGlu: 2.976 ± 0.321
1.701LysPhe: 1.701 ± 0.282
4.677LysGly: 4.677 ± 1.266
2.551LysHis: 2.551 ± 1.402
2.551LysIle: 2.551 ± 1.402
3.401LysLys: 3.401 ± 1.217
4.677LysLeu: 4.677 ± 0.039
0.85LysMet: 0.85 ± 0.185
3.827LysAsn: 3.827 ± 0.798
2.976LysPro: 2.976 ± 0.983
2.126LysGln: 2.126 ± 0.136
5.102LysArg: 5.102 ± 0.458
3.827LysSer: 3.827 ± 2.103
2.976LysThr: 2.976 ± 0.983
4.677LysVal: 4.677 ± 1.266
0.0LysTrp: 0.0 ± 0.0
2.551LysTyr: 2.551 ± 1.402
0.0LysXaa: 0.0 ± 0.0
Leu
6.803LeuAla: 6.803 ± 0.175
2.126LeuCys: 2.126 ± 0.516
2.976LeuAsp: 2.976 ± 0.983
5.102LeuGlu: 5.102 ± 2.804
2.976LeuPhe: 2.976 ± 1.636
5.952LeuGly: 5.952 ± 1.314
1.701LeuHis: 1.701 ± 0.282
4.677LeuIle: 4.677 ± 1.266
4.252LeuLys: 4.252 ± 2.337
4.252LeuLeu: 4.252 ± 1.032
3.827LeuMet: 3.827 ± 1.451
6.803LeuAsn: 6.803 ± 1.782
1.276LeuPro: 1.276 ± 0.701
1.701LeuGln: 1.701 ± 0.37
1.276LeuArg: 1.276 ± 0.604
5.952LeuSer: 5.952 ± 0.662
5.102LeuThr: 5.102 ± 0.195
4.252LeuVal: 4.252 ± 0.925
0.425LeuTrp: 0.425 ± 0.419
1.701LeuTyr: 1.701 ± 0.37
0.0LeuXaa: 0.0 ± 0.0
Met
2.976MetAla: 2.976 ± 0.983
0.85MetCys: 0.85 ± 0.467
1.701MetAsp: 1.701 ± 1.022
0.85MetGlu: 0.85 ± 0.185
2.126MetPhe: 2.126 ± 0.789
2.551MetGly: 2.551 ± 0.097
1.276MetHis: 1.276 ± 0.701
4.252MetIle: 4.252 ± 0.38
1.701MetLys: 1.701 ± 0.282
2.551MetLeu: 2.551 ± 1.402
0.85MetMet: 0.85 ± 0.185
2.126MetAsn: 2.126 ± 0.136
2.976MetPro: 2.976 ± 0.983
0.85MetGln: 0.85 ± 0.467
2.551MetArg: 2.551 ± 0.097
1.701MetSer: 1.701 ± 0.282
2.551MetThr: 2.551 ± 1.86
1.701MetVal: 1.701 ± 1.022
0.0MetTrp: 0.0 ± 0.0
0.425MetTyr: 0.425 ± 0.419
0.0MetXaa: 0.0 ± 0.0
Asn
3.401AsnAla: 3.401 ± 0.74
1.276AsnCys: 1.276 ± 0.049
4.252AsnAsp: 4.252 ± 0.925
3.401AsnGlu: 3.401 ± 0.74
2.976AsnPhe: 2.976 ± 0.331
3.827AsnGly: 3.827 ± 0.146
0.85AsnHis: 0.85 ± 0.837
5.952AsnIle: 5.952 ± 0.643
2.551AsnLys: 2.551 ± 0.097
3.827AsnLeu: 3.827 ± 0.798
0.85AsnMet: 0.85 ± 0.185
2.551AsnAsn: 2.551 ± 0.75
2.126AsnPro: 2.126 ± 0.516
1.276AsnGln: 1.276 ± 0.604
2.551AsnArg: 2.551 ± 0.097
2.976AsnSer: 2.976 ± 0.331
3.827AsnThr: 3.827 ± 0.146
3.827AsnVal: 3.827 ± 1.159
0.85AsnTrp: 0.85 ± 0.185
2.126AsnTyr: 2.126 ± 1.441
0.0AsnXaa: 0.0 ± 0.0
Pro
3.401ProAla: 3.401 ± 0.088
0.85ProCys: 0.85 ± 0.185
0.85ProAsp: 0.85 ± 0.467
2.551ProGlu: 2.551 ± 0.555
1.701ProPhe: 1.701 ± 1.022
1.701ProGly: 1.701 ± 0.37
1.701ProHis: 1.701 ± 0.37
1.276ProIle: 1.276 ± 0.049
1.276ProLys: 1.276 ± 0.604
5.102ProLeu: 5.102 ± 1.499
1.276ProMet: 1.276 ± 0.214
3.401ProAsn: 3.401 ± 0.088
0.85ProPro: 0.85 ± 0.467
0.85ProGln: 0.85 ± 0.185
2.126ProArg: 2.126 ± 0.516
3.827ProSer: 3.827 ± 0.506
4.252ProThr: 4.252 ± 2.23
1.276ProVal: 1.276 ± 0.049
2.126ProTrp: 2.126 ± 0.789
2.126ProTyr: 2.126 ± 0.136
0.0ProXaa: 0.0 ± 0.0
Gln
2.126GlnAla: 2.126 ± 0.136
0.425GlnCys: 0.425 ± 0.234
1.276GlnAsp: 1.276 ± 0.701
2.551GlnGlu: 2.551 ± 0.097
1.701GlnPhe: 1.701 ± 1.022
1.276GlnGly: 1.276 ± 0.049
1.701GlnHis: 1.701 ± 0.282
2.551GlnIle: 2.551 ± 0.555
1.276GlnLys: 1.276 ± 0.049
1.276GlnLeu: 1.276 ± 0.604
2.551GlnMet: 2.551 ± 1.207
0.85GlnAsn: 0.85 ± 0.185
1.701GlnPro: 1.701 ± 0.37
1.701GlnGln: 1.701 ± 0.282
0.85GlnArg: 0.85 ± 0.467
1.276GlnSer: 1.276 ± 0.049
2.126GlnThr: 2.126 ± 1.441
2.551GlnVal: 2.551 ± 0.097
0.425GlnTrp: 0.425 ± 0.419
2.126GlnTyr: 2.126 ± 0.516
0.0GlnXaa: 0.0 ± 0.0
Arg
4.252ArgAla: 4.252 ± 1.684
0.85ArgCys: 0.85 ± 0.467
2.551ArgAsp: 2.551 ± 0.555
3.401ArgGlu: 3.401 ± 0.74
3.401ArgPhe: 3.401 ± 0.088
1.701ArgGly: 1.701 ± 0.37
0.425ArgHis: 0.425 ± 0.419
2.976ArgIle: 2.976 ± 0.321
3.401ArgLys: 3.401 ± 1.217
5.102ArgLeu: 5.102 ± 0.847
0.425ArgMet: 0.425 ± 0.419
2.551ArgAsn: 2.551 ± 0.097
1.276ArgPro: 1.276 ± 0.604
2.126ArgGln: 2.126 ± 0.516
2.976ArgArg: 2.976 ± 0.321
2.976ArgSer: 2.976 ± 0.983
1.701ArgThr: 1.701 ± 0.282
2.976ArgVal: 2.976 ± 0.331
0.0ArgTrp: 0.0 ± 0.0
2.976ArgTyr: 2.976 ± 0.321
0.0ArgXaa: 0.0 ± 0.0
Ser
6.803SerAla: 6.803 ± 0.175
0.85SerCys: 0.85 ± 0.837
2.976SerAsp: 2.976 ± 0.331
4.677SerGlu: 4.677 ± 0.613
2.551SerPhe: 2.551 ± 1.402
8.503SerGly: 8.503 ± 0.759
1.276SerHis: 1.276 ± 0.049
5.527SerIle: 5.527 ± 0.224
5.952SerLys: 5.952 ± 0.662
2.976SerLeu: 2.976 ± 0.321
2.551SerMet: 2.551 ± 0.75
3.401SerAsn: 3.401 ± 2.697
4.252SerPro: 4.252 ± 2.23
1.276SerGln: 1.276 ± 0.049
4.252SerArg: 4.252 ± 1.032
5.102SerSer: 5.102 ± 0.458
4.252SerThr: 4.252 ± 1.577
3.827SerVal: 3.827 ± 1.811
0.85SerTrp: 0.85 ± 0.185
3.401SerTyr: 3.401 ± 0.74
0.0SerXaa: 0.0 ± 0.0
Thr
3.827ThrAla: 3.827 ± 1.811
0.85ThrCys: 0.85 ± 0.837
2.551ThrAsp: 2.551 ± 0.555
2.551ThrGlu: 2.551 ± 0.097
3.401ThrPhe: 3.401 ± 0.565
2.976ThrGly: 2.976 ± 1.626
0.85ThrHis: 0.85 ± 0.467
4.677ThrIle: 4.677 ± 0.039
1.276ThrLys: 1.276 ± 0.604
4.252ThrLeu: 4.252 ± 0.38
1.701ThrMet: 1.701 ± 0.37
3.401ThrAsn: 3.401 ± 1.392
2.976ThrPro: 2.976 ± 0.974
0.425ThrGln: 0.425 ± 0.419
2.126ThrArg: 2.126 ± 0.136
7.228ThrSer: 7.228 ± 1.246
3.401ThrThr: 3.401 ± 2.045
5.527ThrVal: 5.527 ± 0.428
0.85ThrTrp: 0.85 ± 0.185
1.701ThrTyr: 1.701 ± 1.022
0.0ThrXaa: 0.0 ± 0.0
Val
3.827ValAla: 3.827 ± 0.506
1.701ValCys: 1.701 ± 1.022
2.976ValAsp: 2.976 ± 0.321
3.401ValGlu: 3.401 ± 1.217
4.252ValPhe: 4.252 ± 0.273
5.952ValGly: 5.952 ± 0.01
2.126ValHis: 2.126 ± 0.789
2.976ValIle: 2.976 ± 0.331
5.102ValLys: 5.102 ± 0.847
3.401ValLeu: 3.401 ± 0.565
2.976ValMet: 2.976 ± 0.331
4.677ValAsn: 4.677 ± 0.039
6.378ValPro: 6.378 ± 1.061
2.126ValGln: 2.126 ± 0.516
0.85ValArg: 0.85 ± 0.185
5.952ValSer: 5.952 ± 0.01
2.551ValThr: 2.551 ± 1.207
5.952ValVal: 5.952 ± 0.662
1.276ValTrp: 1.276 ± 0.604
1.701ValTyr: 1.701 ± 0.282
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.701TrpAsp: 1.701 ± 0.282
0.425TrpGlu: 0.425 ± 0.234
0.85TrpPhe: 0.85 ± 0.837
0.425TrpGly: 0.425 ± 0.419
0.425TrpHis: 0.425 ± 0.419
1.276TrpIle: 1.276 ± 0.701
1.276TrpLys: 1.276 ± 0.049
1.276TrpLeu: 1.276 ± 0.049
0.425TrpMet: 0.425 ± 0.419
0.85TrpAsn: 0.85 ± 0.185
0.425TrpPro: 0.425 ± 0.419
0.85TrpGln: 0.85 ± 0.837
0.0TrpArg: 0.0 ± 0.0
0.425TrpSer: 0.425 ± 0.419
1.276TrpThr: 1.276 ± 0.049
0.425TrpVal: 0.425 ± 0.234
0.0TrpTrp: 0.0 ± 0.0
0.85TrpTyr: 0.85 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.976TyrAla: 2.976 ± 1.626
0.85TyrCys: 0.85 ± 0.185
1.701TyrAsp: 1.701 ± 0.935
2.551TyrGlu: 2.551 ± 1.402
4.252TyrPhe: 4.252 ± 1.577
0.85TyrGly: 0.85 ± 0.837
1.701TyrHis: 1.701 ± 0.282
2.551TyrIle: 2.551 ± 0.555
0.85TyrLys: 0.85 ± 0.185
1.701TyrLeu: 1.701 ± 0.935
2.551TyrMet: 2.551 ± 0.75
2.551TyrAsn: 2.551 ± 0.097
0.425TyrPro: 0.425 ± 0.234
1.276TyrGln: 1.276 ± 1.256
2.126TyrArg: 2.126 ± 0.516
2.551TyrSer: 2.551 ± 0.555
0.85TyrThr: 0.85 ± 0.185
3.827TyrVal: 3.827 ± 0.506
0.85TyrTrp: 0.85 ± 0.185
1.276TyrTyr: 1.276 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2353 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski