Amino acid dipepetide frequency for Changjiang picorna-like virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.936AlaAla: 6.936 ± 0.227
0.816AlaCys: 0.816 ± 0.43
3.672AlaAsp: 3.672 ± 1.3
3.672AlaGlu: 3.672 ± 0.641
3.264AlaPhe: 3.264 ± 1.073
4.08AlaGly: 4.08 ± 0.438
0.816AlaHis: 0.816 ± 0.43
4.08AlaIle: 4.08 ± 0.856
2.04AlaLys: 2.04 ± 0.428
3.672AlaLeu: 3.672 ± 0.641
2.448AlaMet: 2.448 ± 0.45
1.224AlaAsn: 1.224 ± 0.002
4.488AlaPro: 4.488 ± 0.424
1.632AlaGln: 1.632 ± 0.434
1.632AlaArg: 1.632 ± 0.434
7.344AlaSer: 7.344 ± 3.893
4.08AlaThr: 4.08 ± 2.379
2.856AlaVal: 2.856 ± 0.436
0.816AlaTrp: 0.816 ± 0.217
2.448AlaTyr: 2.448 ± 0.643
0.0AlaXaa: 0.0 ± 0.0
Cys
1.224CysAla: 1.224 ± 0.002
0.408CysCys: 0.408 ± 0.215
0.408CysAsp: 0.408 ± 0.432
2.04CysGlu: 2.04 ± 0.219
0.816CysPhe: 0.816 ± 0.217
2.04CysGly: 2.04 ± 1.075
0.408CysHis: 0.408 ± 0.215
0.0CysIle: 0.0 ± 0.0
2.04CysLys: 2.04 ± 1.075
1.632CysLeu: 1.632 ± 0.86
0.408CysMet: 0.408 ± 0.215
0.816CysAsn: 0.816 ± 0.217
0.408CysPro: 0.408 ± 0.215
0.0CysGln: 0.0 ± 0.0
0.816CysArg: 0.816 ± 0.43
0.816CysSer: 0.816 ± 0.217
0.0CysThr: 0.0 ± 0.0
0.816CysVal: 0.816 ± 0.43
0.0CysTrp: 0.0 ± 0.0
1.632CysTyr: 1.632 ± 0.434
0.0CysXaa: 0.0 ± 0.0
Asp
1.632AspAla: 1.632 ± 0.86
0.816AspCys: 0.816 ± 0.43
4.488AspAsp: 4.488 ± 2.365
3.672AspGlu: 3.672 ± 0.006
2.448AspPhe: 2.448 ± 0.643
2.856AspGly: 2.856 ± 0.211
0.816AspHis: 0.816 ± 0.43
5.304AspIle: 5.304 ± 0.44
2.856AspLys: 2.856 ± 1.505
6.936AspLeu: 6.936 ± 0.227
1.224AspMet: 1.224 ± 0.645
4.488AspAsn: 4.488 ± 1.071
3.672AspPro: 3.672 ± 1.947
1.224AspGln: 1.224 ± 0.002
1.224AspArg: 1.224 ± 0.645
2.856AspSer: 2.856 ± 0.436
4.896AspThr: 4.896 ± 0.655
4.896AspVal: 4.896 ± 0.639
0.408AspTrp: 0.408 ± 0.215
3.264AspTyr: 3.264 ± 0.868
0.0AspXaa: 0.0 ± 0.0
Glu
1.224GluAla: 1.224 ± 0.645
0.408GluCys: 0.408 ± 0.215
3.672GluAsp: 3.672 ± 1.935
2.448GluGlu: 2.448 ± 0.004
2.04GluPhe: 2.04 ± 0.219
0.816GluGly: 0.816 ± 0.217
1.224GluHis: 1.224 ± 0.002
2.856GluIle: 2.856 ± 0.436
3.264GluLys: 3.264 ± 1.073
3.672GluLeu: 3.672 ± 0.006
0.816GluMet: 0.816 ± 0.43
3.672GluAsn: 3.672 ± 0.006
0.816GluPro: 0.816 ± 0.217
2.04GluGln: 2.04 ± 0.428
2.448GluArg: 2.448 ± 0.004
5.712GluSer: 5.712 ± 0.225
2.856GluThr: 2.856 ± 0.211
4.08GluVal: 4.08 ± 0.209
1.224GluTrp: 1.224 ± 0.002
2.04GluTyr: 2.04 ± 0.428
0.0GluXaa: 0.0 ± 0.0
Phe
4.488PheAla: 4.488 ± 1.071
1.224PheCys: 1.224 ± 0.645
2.856PheAsp: 2.856 ± 0.211
2.448PheGlu: 2.448 ± 1.29
2.856PhePhe: 2.856 ± 0.211
2.856PheGly: 2.856 ± 0.211
1.224PheHis: 1.224 ± 0.002
2.04PheIle: 2.04 ± 0.428
3.672PheLys: 3.672 ± 1.288
6.528PheLeu: 6.528 ± 0.852
0.816PheMet: 0.816 ± 0.43
3.672PheAsn: 3.672 ± 0.006
2.448PhePro: 2.448 ± 0.004
1.224PheGln: 1.224 ± 0.002
3.264PheArg: 3.264 ± 0.221
4.488PheSer: 4.488 ± 0.424
3.672PheThr: 3.672 ± 0.653
4.08PheVal: 4.08 ± 0.856
0.816PheTrp: 0.816 ± 0.217
2.04PheTyr: 2.04 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
4.08GlyAla: 4.08 ± 1.732
0.408GlyCys: 0.408 ± 0.432
5.304GlyAsp: 5.304 ± 0.207
1.632GlyGlu: 1.632 ± 1.081
2.856GlyPhe: 2.856 ± 0.211
0.816GlyGly: 0.816 ± 0.217
0.816GlyHis: 0.816 ± 0.43
4.896GlyIle: 4.896 ± 0.008
1.224GlyLys: 1.224 ± 0.645
2.448GlyLeu: 2.448 ± 0.004
1.224GlyMet: 1.224 ± 0.645
3.672GlyAsn: 3.672 ± 0.006
1.224GlyPro: 1.224 ± 0.649
2.04GlyGln: 2.04 ± 0.428
2.856GlyArg: 2.856 ± 0.436
5.712GlySer: 5.712 ± 0.872
4.08GlyThr: 4.08 ± 0.438
3.264GlyVal: 3.264 ± 0.221
0.408GlyTrp: 0.408 ± 0.215
4.488GlyTyr: 4.488 ± 0.223
0.0GlyXaa: 0.0 ± 0.0
His
0.408HisAla: 0.408 ± 0.215
0.816HisCys: 0.816 ± 0.217
0.0HisAsp: 0.0 ± 0.0
0.816HisGlu: 0.816 ± 0.43
0.816HisPhe: 0.816 ± 0.43
1.224HisGly: 1.224 ± 0.645
0.408HisHis: 0.408 ± 0.215
2.448HisIle: 2.448 ± 0.651
1.632HisLys: 1.632 ± 0.213
2.448HisLeu: 2.448 ± 1.29
0.816HisMet: 0.816 ± 0.351
0.408HisAsn: 0.408 ± 0.215
1.632HisPro: 1.632 ± 0.213
0.0HisGln: 0.0 ± 0.0
0.408HisArg: 0.408 ± 0.432
1.224HisSer: 1.224 ± 0.645
1.224HisThr: 1.224 ± 0.645
1.632HisVal: 1.632 ± 0.213
0.408HisTrp: 0.408 ± 0.215
0.408HisTyr: 0.408 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
7.344IleAla: 7.344 ± 1.953
1.224IleCys: 1.224 ± 0.645
4.08IleAsp: 4.08 ± 0.209
4.896IleGlu: 4.896 ± 0.008
1.224IlePhe: 1.224 ± 0.002
4.08IleGly: 4.08 ± 0.856
1.632IleHis: 1.632 ± 0.86
4.896IleIle: 4.896 ± 0.008
3.264IleLys: 3.264 ± 1.073
4.488IleLeu: 4.488 ± 1.718
1.224IleMet: 1.224 ± 0.002
6.12IleAsn: 6.12 ± 1.951
3.264IlePro: 3.264 ± 0.868
1.632IleGln: 1.632 ± 0.434
2.04IleArg: 2.04 ± 0.219
8.568IleSer: 8.568 ± 3.895
3.264IleThr: 3.264 ± 1.073
2.856IleVal: 2.856 ± 0.858
1.632IleTrp: 1.632 ± 0.213
5.712IleTyr: 5.712 ± 0.422
0.0IleXaa: 0.0 ± 0.0
Lys
2.856LysAla: 2.856 ± 0.858
0.816LysCys: 0.816 ± 0.43
2.856LysAsp: 2.856 ± 0.211
1.632LysGlu: 1.632 ± 0.213
4.488LysPhe: 4.488 ± 0.424
2.04LysGly: 2.04 ± 1.075
1.224LysHis: 1.224 ± 0.002
5.304LysIle: 5.304 ± 0.44
2.856LysLys: 2.856 ± 0.436
3.672LysLeu: 3.672 ± 0.641
0.816LysMet: 0.816 ± 0.43
2.448LysAsn: 2.448 ± 0.004
2.04LysPro: 2.04 ± 0.219
0.816LysGln: 0.816 ± 0.43
2.04LysArg: 2.04 ± 1.075
4.08LysSer: 4.08 ± 1.503
3.264LysThr: 3.264 ± 1.72
3.264LysVal: 3.264 ± 0.221
0.408LysTrp: 0.408 ± 0.432
0.408LysTyr: 0.408 ± 0.215
0.0LysXaa: 0.0 ± 0.0
Leu
3.672LeuAla: 3.672 ± 0.653
1.632LeuCys: 1.632 ± 0.86
4.896LeuAsp: 4.896 ± 1.933
3.264LeuGlu: 3.264 ± 1.073
4.488LeuPhe: 4.488 ± 0.424
2.04LeuGly: 2.04 ± 0.219
2.448LeuHis: 2.448 ± 1.29
5.304LeuIle: 5.304 ± 0.207
6.12LeuLys: 6.12 ± 0.01
6.528LeuLeu: 6.528 ± 1.499
0.816LeuMet: 0.816 ± 0.217
5.712LeuAsn: 5.712 ± 0.872
2.448LeuPro: 2.448 ± 0.651
3.672LeuGln: 3.672 ± 0.006
3.672LeuArg: 3.672 ± 0.653
8.976LeuSer: 8.976 ± 0.201
6.12LeuThr: 6.12 ± 1.304
3.672LeuVal: 3.672 ± 0.006
0.408LeuTrp: 0.408 ± 0.432
2.448LeuTyr: 2.448 ± 0.643
0.0LeuXaa: 0.0 ± 0.0
Met
1.224MetAla: 1.224 ± 0.645
0.408MetCys: 0.408 ± 0.215
0.816MetAsp: 0.816 ± 0.43
1.632MetGlu: 1.632 ± 0.213
2.448MetPhe: 2.448 ± 0.004
1.224MetGly: 1.224 ± 0.649
0.408MetHis: 0.408 ± 0.215
1.632MetIle: 1.632 ± 0.213
1.632MetLys: 1.632 ± 0.86
1.632MetLeu: 1.632 ± 0.86
0.408MetMet: 0.408 ± 0.215
0.816MetAsn: 0.816 ± 0.43
0.816MetPro: 0.816 ± 0.43
0.816MetGln: 0.816 ± 0.217
2.448MetArg: 2.448 ± 0.643
1.632MetSer: 1.632 ± 0.213
2.448MetThr: 2.448 ± 1.945
0.816MetVal: 0.816 ± 0.217
0.408MetTrp: 0.408 ± 0.432
0.408MetTyr: 0.408 ± 0.432
0.0MetXaa: 0.0 ± 0.0
Asn
4.896AsnAla: 4.896 ± 0.008
0.408AsnCys: 0.408 ± 0.215
2.856AsnAsp: 2.856 ± 0.858
1.632AsnGlu: 1.632 ± 0.434
2.04AsnPhe: 2.04 ± 0.219
5.304AsnGly: 5.304 ± 0.854
1.632AsnHis: 1.632 ± 0.86
5.304AsnIle: 5.304 ± 1.734
2.448AsnLys: 2.448 ± 0.004
6.528AsnLeu: 6.528 ± 1.089
2.448AsnMet: 2.448 ± 0.004
3.264AsnAsn: 3.264 ± 0.868
3.264AsnPro: 3.264 ± 0.221
1.632AsnGln: 1.632 ± 1.081
0.0AsnArg: 0.0 ± 0.0
4.488AsnSer: 4.488 ± 0.424
2.448AsnThr: 2.448 ± 0.004
3.264AsnVal: 3.264 ± 1.515
0.408AsnTrp: 0.408 ± 0.432
4.08AsnTyr: 4.08 ± 0.856
0.0AsnXaa: 0.0 ± 0.0
Pro
2.448ProAla: 2.448 ± 1.298
0.0ProCys: 0.0 ± 0.0
2.856ProAsp: 2.856 ± 1.73
1.224ProGlu: 1.224 ± 0.002
4.488ProPhe: 4.488 ± 0.223
3.672ProGly: 3.672 ± 0.006
1.224ProHis: 1.224 ± 0.649
3.672ProIle: 3.672 ± 1.3
1.632ProLys: 1.632 ± 0.434
2.856ProLeu: 2.856 ± 0.436
2.04ProMet: 2.04 ± 1.513
2.04ProAsn: 2.04 ± 0.428
2.04ProPro: 2.04 ± 0.428
1.224ProGln: 1.224 ± 0.002
2.04ProArg: 2.04 ± 0.219
2.856ProSer: 2.856 ± 0.211
1.632ProThr: 1.632 ± 1.081
2.856ProVal: 2.856 ± 0.211
1.224ProTrp: 1.224 ± 0.002
1.224ProTyr: 1.224 ± 0.649
0.0ProXaa: 0.0 ± 0.0
Gln
2.448GlnAla: 2.448 ± 0.004
0.408GlnCys: 0.408 ± 0.432
1.632GlnAsp: 1.632 ± 0.434
0.408GlnGlu: 0.408 ± 0.215
0.816GlnPhe: 0.816 ± 0.43
2.856GlnGly: 2.856 ± 0.436
0.408GlnHis: 0.408 ± 0.215
2.04GlnIle: 2.04 ± 0.428
0.816GlnLys: 0.816 ± 0.217
2.448GlnLeu: 2.448 ± 1.29
0.0GlnMet: 0.0 ± 0.0
2.448GlnAsn: 2.448 ± 0.643
0.408GlnPro: 0.408 ± 0.432
2.448GlnGln: 2.448 ± 0.643
1.224GlnArg: 1.224 ± 0.002
3.672GlnSer: 3.672 ± 1.288
2.04GlnThr: 2.04 ± 0.866
2.448GlnVal: 2.448 ± 0.651
0.408GlnTrp: 0.408 ± 0.215
3.672GlnTyr: 3.672 ± 0.653
0.0GlnXaa: 0.0 ± 0.0
Arg
1.632ArgAla: 1.632 ± 0.213
0.816ArgCys: 0.816 ± 0.43
1.632ArgAsp: 1.632 ± 0.86
3.264ArgGlu: 3.264 ± 1.073
4.08ArgPhe: 4.08 ± 0.438
4.488ArgGly: 4.488 ± 1.517
0.408ArgHis: 0.408 ± 0.432
2.448ArgIle: 2.448 ± 1.298
1.632ArgLys: 1.632 ± 0.86
4.08ArgLeu: 4.08 ± 1.085
2.04ArgMet: 2.04 ± 0.428
2.04ArgAsn: 2.04 ± 0.428
2.04ArgPro: 2.04 ± 1.513
1.224ArgGln: 1.224 ± 0.649
2.448ArgArg: 2.448 ± 0.004
3.264ArgSer: 3.264 ± 0.426
1.632ArgThr: 1.632 ± 0.434
4.488ArgVal: 4.488 ± 0.424
0.408ArgTrp: 0.408 ± 0.432
0.816ArgTyr: 0.816 ± 0.217
0.0ArgXaa: 0.0 ± 0.0
Ser
5.304SerAla: 5.304 ± 1.087
0.816SerCys: 0.816 ± 0.864
3.264SerAsp: 3.264 ± 0.426
4.08SerGlu: 4.08 ± 0.209
6.12SerPhe: 6.12 ± 1.931
3.264SerGly: 3.264 ± 1.515
0.0SerHis: 0.0 ± 0.0
5.712SerIle: 5.712 ± 0.422
4.08SerLys: 4.08 ± 0.438
6.936SerLeu: 6.936 ± 1.521
1.632SerMet: 1.632 ± 0.434
4.488SerAsn: 4.488 ± 0.424
3.264SerPro: 3.264 ± 1.515
4.896SerGln: 4.896 ± 0.639
6.12SerArg: 6.12 ± 0.01
6.12SerSer: 6.12 ± 0.657
5.304SerThr: 5.304 ± 1.087
7.344SerVal: 7.344 ± 0.012
1.224SerTrp: 1.224 ± 0.649
5.304SerTyr: 5.304 ± 1.087
0.0SerXaa: 0.0 ± 0.0
Thr
3.672ThrAla: 3.672 ± 1.947
1.632ThrCys: 1.632 ± 0.86
4.896ThrAsp: 4.896 ± 0.008
2.448ThrGlu: 2.448 ± 0.643
2.448ThrPhe: 2.448 ± 0.004
2.04ThrGly: 2.04 ± 1.513
0.816ThrHis: 0.816 ± 0.43
6.12ThrIle: 6.12 ± 1.284
2.856ThrLys: 2.856 ± 0.436
4.08ThrLeu: 4.08 ± 0.209
0.816ThrMet: 0.816 ± 0.217
4.08ThrAsn: 4.08 ± 1.732
4.08ThrPro: 4.08 ± 2.379
1.632ThrGln: 1.632 ± 0.213
3.264ThrArg: 3.264 ± 0.426
5.304ThrSer: 5.304 ± 1.087
5.712ThrThr: 5.712 ± 0.872
4.896ThrVal: 4.896 ± 0.008
1.632ThrTrp: 1.632 ± 0.213
2.448ThrTyr: 2.448 ± 0.004
0.0ThrXaa: 0.0 ± 0.0
Val
3.672ValAla: 3.672 ± 0.641
1.632ValCys: 1.632 ± 0.213
6.528ValAsp: 6.528 ± 0.442
2.856ValGlu: 2.856 ± 0.436
4.08ValPhe: 4.08 ± 0.856
3.672ValGly: 3.672 ± 1.3
1.632ValHis: 1.632 ± 0.86
4.488ValIle: 4.488 ± 1.071
2.04ValLys: 2.04 ± 0.428
4.08ValLeu: 4.08 ± 1.085
2.448ValMet: 2.448 ± 0.004
3.264ValAsn: 3.264 ± 0.868
3.264ValPro: 3.264 ± 0.426
2.04ValGln: 2.04 ± 1.075
4.08ValArg: 4.08 ± 1.085
4.488ValSer: 4.488 ± 0.223
4.896ValThr: 4.896 ± 0.008
6.936ValVal: 6.936 ± 2.361
0.816ValTrp: 0.816 ± 0.217
2.04ValTyr: 2.04 ± 0.219
0.0ValXaa: 0.0 ± 0.0
Trp
0.408TrpAla: 0.408 ± 0.215
0.408TrpCys: 0.408 ± 0.432
1.224TrpAsp: 1.224 ± 0.649
0.408TrpGlu: 0.408 ± 0.215
0.408TrpPhe: 0.408 ± 0.215
0.408TrpGly: 0.408 ± 0.215
0.816TrpHis: 0.816 ± 0.217
0.816TrpIle: 0.816 ± 0.217
0.0TrpLys: 0.0 ± 0.0
1.224TrpLeu: 1.224 ± 1.296
0.408TrpMet: 0.408 ± 0.215
0.816TrpAsn: 0.816 ± 0.43
0.0TrpPro: 0.0 ± 0.0
0.408TrpGln: 0.408 ± 0.432
0.408TrpArg: 0.408 ± 0.432
1.632TrpSer: 1.632 ± 0.213
1.632TrpThr: 1.632 ± 0.213
1.224TrpVal: 1.224 ± 0.002
0.0TrpTrp: 0.0 ± 0.0
1.224TrpTyr: 1.224 ± 0.002
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.856TyrAla: 2.856 ± 0.436
1.632TyrCys: 1.632 ± 0.434
2.04TyrAsp: 2.04 ± 0.428
2.856TyrGlu: 2.856 ± 0.211
4.488TyrPhe: 4.488 ± 1.071
3.672TyrGly: 3.672 ± 0.653
0.816TyrHis: 0.816 ± 0.217
4.08TyrIle: 4.08 ± 0.209
1.224TyrLys: 1.224 ± 0.002
2.448TyrLeu: 2.448 ± 0.004
0.816TyrMet: 0.816 ± 0.217
2.856TyrAsn: 2.856 ± 1.083
1.632TyrPro: 1.632 ± 0.213
2.448TyrGln: 2.448 ± 0.643
2.448TyrArg: 2.448 ± 1.945
2.04TyrSer: 2.04 ± 0.866
3.672TyrThr: 3.672 ± 1.935
3.264TyrVal: 3.264 ± 0.868
0.816TyrTrp: 0.816 ± 0.43
2.856TyrTyr: 2.856 ± 0.211
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2452 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski