Amino acid dipepetide frequency for Wenzhou picorna-like virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.549AlaAla: 4.549 ± 1.98
1.654AlaCys: 1.654 ± 0.221
2.068AlaAsp: 2.068 ± 0.428
2.068AlaGlu: 2.068 ± 0.428
2.895AlaPhe: 2.895 ± 1.592
4.963AlaGly: 4.963 ± 1.27
0.827AlaHis: 0.827 ± 0.194
3.722AlaIle: 3.722 ± 0.04
4.963AlaLys: 4.963 ± 0.053
6.203AlaLeu: 6.203 ± 1.283
2.068AlaMet: 2.068 ± 0.79
4.136AlaAsn: 4.136 ± 0.97
2.895AlaPro: 2.895 ± 1.592
0.414AlaGln: 0.414 ± 0.207
4.136AlaArg: 4.136 ± 1.464
6.203AlaSer: 6.203 ± 0.066
6.203AlaThr: 6.203 ± 4.803
3.309AlaVal: 3.309 ± 0.776
1.241AlaTrp: 1.241 ± 0.013
2.068AlaTyr: 2.068 ± 1.036
0.0AlaXaa: 0.0 ± 0.0
Cys
2.068CysAla: 2.068 ± 0.181
0.827CysCys: 0.827 ± 0.415
0.0CysAsp: 0.0 ± 0.0
0.827CysGlu: 0.827 ± 0.415
0.414CysPhe: 0.414 ± 0.207
1.241CysGly: 1.241 ± 0.622
0.0CysHis: 0.0 ± 0.0
0.414CysIle: 0.414 ± 0.207
2.481CysLys: 2.481 ± 1.244
2.068CysLeu: 2.068 ± 0.428
0.414CysMet: 0.414 ± 0.207
1.241CysAsn: 1.241 ± 0.013
0.827CysPro: 0.827 ± 0.194
0.0CysGln: 0.0 ± 0.0
0.827CysArg: 0.827 ± 0.415
0.827CysSer: 0.827 ± 0.415
0.0CysThr: 0.0 ± 0.0
0.827CysVal: 0.827 ± 0.415
0.0CysTrp: 0.0 ± 0.0
1.654CysTyr: 1.654 ± 0.829
0.0CysXaa: 0.0 ± 0.0
Asp
2.481AspAla: 2.481 ± 0.026
0.827AspCys: 0.827 ± 0.415
6.203AspAsp: 6.203 ± 0.543
2.481AspGlu: 2.481 ± 0.026
4.136AspPhe: 4.136 ± 0.362
2.068AspGly: 2.068 ± 0.428
0.414AspHis: 0.414 ± 0.207
4.136AspIle: 4.136 ± 0.247
4.136AspLys: 4.136 ± 0.856
6.203AspLeu: 6.203 ± 1.283
1.241AspMet: 1.241 ± 0.013
1.654AspAsn: 1.654 ± 0.221
5.376AspPro: 5.376 ± 3.392
0.827AspGln: 0.827 ± 0.415
3.309AspArg: 3.309 ± 0.441
4.136AspSer: 4.136 ± 0.856
2.481AspThr: 2.481 ± 0.026
2.895AspVal: 2.895 ± 0.234
0.827AspTrp: 0.827 ± 0.415
3.722AspTyr: 3.722 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
4.136GluAla: 4.136 ± 0.247
0.827GluCys: 0.827 ± 0.415
3.309GluAsp: 3.309 ± 0.168
2.895GluGlu: 2.895 ± 1.451
4.136GluPhe: 4.136 ± 0.856
2.895GluGly: 2.895 ± 0.375
1.654GluHis: 1.654 ± 0.221
3.722GluIle: 3.722 ± 1.257
4.136GluLys: 4.136 ± 0.856
6.203GluLeu: 6.203 ± 0.066
1.654GluMet: 1.654 ± 0.829
1.241GluAsn: 1.241 ± 0.622
3.722GluPro: 3.722 ± 1.257
1.654GluGln: 1.654 ± 0.829
1.654GluArg: 1.654 ± 0.829
2.481GluSer: 2.481 ± 0.582
4.549GluThr: 4.549 ± 0.763
4.136GluVal: 4.136 ± 0.97
1.654GluTrp: 1.654 ± 0.221
2.895GluTyr: 2.895 ± 0.234
0.0GluXaa: 0.0 ± 0.0
Phe
3.722PheAla: 3.722 ± 0.04
0.827PheCys: 0.827 ± 0.194
2.068PheAsp: 2.068 ± 0.428
4.136PheGlu: 4.136 ± 0.362
2.481PhePhe: 2.481 ± 0.582
3.309PheGly: 3.309 ± 0.441
1.654PheHis: 1.654 ± 0.388
1.241PheIle: 1.241 ± 0.013
2.068PheLys: 2.068 ± 0.181
6.203PheLeu: 6.203 ± 1.892
2.068PheMet: 2.068 ± 0.79
2.068PheAsn: 2.068 ± 0.79
1.241PhePro: 1.241 ± 0.013
2.481PheGln: 2.481 ± 1.191
2.481PheArg: 2.481 ± 0.582
3.309PheSer: 3.309 ± 0.168
4.136PheThr: 4.136 ± 0.362
2.895PheVal: 2.895 ± 0.375
0.827PheTrp: 0.827 ± 0.803
2.481PheTyr: 2.481 ± 0.635
0.0PheXaa: 0.0 ± 0.0
Gly
4.963GlyAla: 4.963 ± 1.164
0.827GlyCys: 0.827 ± 0.415
4.549GlyAsp: 4.549 ± 0.763
4.136GlyGlu: 4.136 ± 0.97
2.895GlyPhe: 2.895 ± 0.234
3.309GlyGly: 3.309 ± 0.441
0.827GlyHis: 0.827 ± 0.415
3.722GlyIle: 3.722 ± 0.04
5.79GlyLys: 5.79 ± 2.293
3.722GlyLeu: 3.722 ± 0.04
2.068GlyMet: 2.068 ± 0.181
4.549GlyAsn: 4.549 ± 1.372
2.481GlyPro: 2.481 ± 0.582
2.481GlyGln: 2.481 ± 0.635
2.895GlyArg: 2.895 ± 1.592
5.79GlySer: 5.79 ± 0.75
3.722GlyThr: 3.722 ± 0.648
4.136GlyVal: 4.136 ± 1.464
0.0GlyTrp: 0.0 ± 0.0
1.654GlyTyr: 1.654 ± 0.997
0.0GlyXaa: 0.0 ± 0.0
His
0.827HisAla: 0.827 ± 0.803
0.0HisCys: 0.0 ± 0.0
0.827HisAsp: 0.827 ± 0.415
1.241HisGlu: 1.241 ± 0.013
1.241HisPhe: 1.241 ± 0.622
2.481HisGly: 2.481 ± 1.244
0.0HisHis: 0.0 ± 0.0
2.481HisIle: 2.481 ± 0.635
1.654HisLys: 1.654 ± 0.221
2.481HisLeu: 2.481 ± 0.635
0.0HisMet: 0.0 ± 0.0
0.827HisAsn: 0.827 ± 0.415
1.241HisPro: 1.241 ± 0.013
0.827HisGln: 0.827 ± 0.803
0.0HisArg: 0.0 ± 0.0
1.241HisSer: 1.241 ± 0.013
0.827HisThr: 0.827 ± 0.194
2.481HisVal: 2.481 ± 0.026
1.241HisTrp: 1.241 ± 0.013
1.241HisTyr: 1.241 ± 0.595
0.0HisXaa: 0.0 ± 0.0
Ile
5.376IleAla: 5.376 ± 0.869
2.068IleCys: 2.068 ± 0.428
5.376IleAsp: 5.376 ± 0.869
4.549IleGlu: 4.549 ± 1.672
2.481IlePhe: 2.481 ± 0.635
4.963IleGly: 4.963 ± 0.053
0.414IleHis: 0.414 ± 0.401
3.309IleIle: 3.309 ± 0.441
3.309IleLys: 3.309 ± 0.168
3.309IleLeu: 3.309 ± 1.05
1.241IleMet: 1.241 ± 0.013
1.654IleAsn: 1.654 ± 0.388
0.414IlePro: 0.414 ± 0.401
0.827IleGln: 0.827 ± 0.194
0.827IleArg: 0.827 ± 0.194
6.203IleSer: 6.203 ± 0.675
2.895IleThr: 2.895 ± 0.234
3.309IleVal: 3.309 ± 0.776
0.414IleTrp: 0.414 ± 0.207
2.068IleTyr: 2.068 ± 0.181
0.0IleXaa: 0.0 ± 0.0
Lys
2.481LysAla: 2.481 ± 0.635
0.827LysCys: 0.827 ± 0.415
2.068LysAsp: 2.068 ± 0.428
2.895LysGlu: 2.895 ± 1.451
3.722LysPhe: 3.722 ± 0.04
3.309LysGly: 3.309 ± 0.776
1.654LysHis: 1.654 ± 0.829
3.722LysIle: 3.722 ± 1.178
2.481LysLys: 2.481 ± 1.244
5.79LysLeu: 5.79 ± 1.685
1.654LysMet: 1.654 ± 0.221
3.309LysAsn: 3.309 ± 1.05
4.136LysPro: 4.136 ± 0.856
3.309LysGln: 3.309 ± 0.168
3.722LysArg: 3.722 ± 0.04
5.376LysSer: 5.376 ± 2.086
0.827LysThr: 0.827 ± 0.415
3.309LysVal: 3.309 ± 0.168
0.0LysTrp: 0.0 ± 0.0
1.654LysTyr: 1.654 ± 0.829
0.0LysXaa: 0.0 ± 0.0
Leu
4.136LeuAla: 4.136 ± 0.247
2.481LeuCys: 2.481 ± 0.635
3.722LeuAsp: 3.722 ± 0.04
2.895LeuGlu: 2.895 ± 0.234
2.895LeuPhe: 2.895 ± 0.234
4.549LeuGly: 4.549 ± 0.154
3.722LeuHis: 3.722 ± 0.648
5.376LeuIle: 5.376 ± 2.086
4.136LeuLys: 4.136 ± 0.856
6.203LeuLeu: 6.203 ± 0.675
0.414LeuMet: 0.414 ± 0.344
5.376LeuAsn: 5.376 ± 0.957
6.203LeuPro: 6.203 ± 0.066
2.068LeuGln: 2.068 ± 0.428
4.549LeuArg: 4.549 ± 1.063
6.617LeuSer: 6.617 ± 0.273
9.512LeuThr: 9.512 ± 0.71
6.617LeuVal: 6.617 ± 2.099
1.241LeuTrp: 1.241 ± 0.595
2.895LeuTyr: 2.895 ± 0.375
0.0LeuXaa: 0.0 ± 0.0
Met
3.309MetAla: 3.309 ± 1.994
0.827MetCys: 0.827 ± 0.415
0.414MetAsp: 0.414 ± 0.207
2.895MetGlu: 2.895 ± 0.234
0.827MetPhe: 0.827 ± 0.415
1.241MetGly: 1.241 ± 0.013
0.827MetHis: 0.827 ± 0.194
1.654MetIle: 1.654 ± 0.221
2.481MetLys: 2.481 ± 0.026
2.895MetLeu: 2.895 ± 0.375
0.827MetMet: 0.827 ± 0.415
0.827MetAsn: 0.827 ± 0.803
2.068MetPro: 2.068 ± 0.181
0.0MetGln: 0.0 ± 0.0
1.654MetArg: 1.654 ± 0.388
2.895MetSer: 2.895 ± 0.234
2.481MetThr: 2.481 ± 0.026
0.827MetVal: 0.827 ± 0.415
0.414MetTrp: 0.414 ± 0.207
0.414MetTyr: 0.414 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
2.481AsnAla: 2.481 ± 1.191
0.827AsnCys: 0.827 ± 0.415
2.895AsnAsp: 2.895 ± 0.984
4.136AsnGlu: 4.136 ± 0.362
1.654AsnPhe: 1.654 ± 0.388
5.79AsnGly: 5.79 ± 0.75
0.827AsnHis: 0.827 ± 0.194
2.481AsnIle: 2.481 ± 0.635
1.654AsnLys: 1.654 ± 0.221
2.895AsnLeu: 2.895 ± 0.234
1.241AsnMet: 1.241 ± 0.013
1.241AsnAsn: 1.241 ± 0.595
2.068AsnPro: 2.068 ± 0.79
1.241AsnGln: 1.241 ± 0.013
1.241AsnArg: 1.241 ± 0.013
3.309AsnSer: 3.309 ± 0.441
2.895AsnThr: 2.895 ± 1.592
4.963AsnVal: 4.963 ± 1.164
1.241AsnTrp: 1.241 ± 0.013
1.654AsnTyr: 1.654 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
1.654ProAla: 1.654 ± 0.997
0.0ProCys: 0.0 ± 0.0
4.963ProAsp: 4.963 ± 0.556
2.895ProGlu: 2.895 ± 0.842
3.722ProPhe: 3.722 ± 1.178
1.654ProGly: 1.654 ± 0.388
2.481ProHis: 2.481 ± 0.635
2.895ProIle: 2.895 ± 0.234
0.827ProLys: 0.827 ± 0.415
5.376ProLeu: 5.376 ± 1.477
1.241ProMet: 1.241 ± 1.204
2.481ProAsn: 2.481 ± 0.026
2.895ProPro: 2.895 ± 0.984
3.309ProGln: 3.309 ± 1.385
1.241ProArg: 1.241 ± 0.595
1.654ProSer: 1.654 ± 0.997
3.722ProThr: 3.722 ± 0.569
4.549ProVal: 4.549 ± 0.763
1.654ProTrp: 1.654 ± 0.221
2.068ProTyr: 2.068 ± 1.398
0.0ProXaa: 0.0 ± 0.0
Gln
0.827GlnAla: 0.827 ± 0.415
0.414GlnCys: 0.414 ± 0.207
2.481GlnAsp: 2.481 ± 0.026
2.068GlnGlu: 2.068 ± 0.181
1.241GlnPhe: 1.241 ± 0.622
2.895GlnGly: 2.895 ± 0.375
1.241GlnHis: 1.241 ± 0.013
1.654GlnIle: 1.654 ± 0.221
2.068GlnLys: 2.068 ± 1.036
1.241GlnLeu: 1.241 ± 0.622
1.654GlnMet: 1.654 ± 0.388
2.481GlnAsn: 2.481 ± 0.026
0.827GlnPro: 0.827 ± 0.194
0.414GlnGln: 0.414 ± 0.401
2.068GlnArg: 2.068 ± 0.181
2.895GlnSer: 2.895 ± 1.592
1.241GlnThr: 1.241 ± 0.622
2.895GlnVal: 2.895 ± 0.984
0.414GlnTrp: 0.414 ± 0.207
0.414GlnTyr: 0.414 ± 0.207
0.0GlnXaa: 0.0 ± 0.0
Arg
4.963ArgAla: 4.963 ± 1.27
1.241ArgCys: 1.241 ± 0.622
3.722ArgAsp: 3.722 ± 0.04
2.481ArgGlu: 2.481 ± 0.026
2.895ArgPhe: 2.895 ± 0.375
2.068ArgGly: 2.068 ± 0.79
1.654ArgHis: 1.654 ± 0.388
2.481ArgIle: 2.481 ± 0.635
2.895ArgLys: 2.895 ± 1.451
2.895ArgLeu: 2.895 ± 0.234
1.654ArgMet: 1.654 ± 0.388
1.241ArgAsn: 1.241 ± 0.622
2.895ArgPro: 2.895 ± 0.375
0.827ArgGln: 0.827 ± 0.415
2.895ArgArg: 2.895 ± 0.234
0.414ArgSer: 0.414 ± 0.207
1.241ArgThr: 1.241 ± 1.204
3.722ArgVal: 3.722 ± 0.569
0.414ArgTrp: 0.414 ± 0.401
2.481ArgTyr: 2.481 ± 0.635
0.0ArgXaa: 0.0 ± 0.0
Ser
4.963SerAla: 4.963 ± 0.556
0.414SerCys: 0.414 ± 0.207
3.722SerAsp: 3.722 ± 0.648
5.79SerGlu: 5.79 ± 1.076
2.895SerPhe: 2.895 ± 1.592
4.963SerGly: 4.963 ± 1.164
1.241SerHis: 1.241 ± 0.622
3.722SerIle: 3.722 ± 0.04
4.963SerLys: 4.963 ± 0.053
8.685SerLeu: 8.685 ± 1.733
2.068SerMet: 2.068 ± 0.181
2.068SerAsn: 2.068 ± 0.428
3.722SerPro: 3.722 ± 0.648
2.068SerGln: 2.068 ± 0.181
2.481SerArg: 2.481 ± 0.582
4.136SerSer: 4.136 ± 0.247
4.136SerThr: 4.136 ± 0.97
4.549SerVal: 4.549 ± 1.063
1.241SerTrp: 1.241 ± 0.622
2.895SerTyr: 2.895 ± 0.234
0.0SerXaa: 0.0 ± 0.0
Thr
5.376ThrAla: 5.376 ± 2.174
0.827ThrCys: 0.827 ± 0.415
4.549ThrAsp: 4.549 ± 0.454
2.895ThrGlu: 2.895 ± 0.375
3.722ThrPhe: 3.722 ± 0.04
5.79ThrGly: 5.79 ± 0.141
1.241ThrHis: 1.241 ± 0.013
3.309ThrIle: 3.309 ± 1.385
1.654ThrLys: 1.654 ± 0.221
4.549ThrLeu: 4.549 ± 2.589
2.481ThrMet: 2.481 ± 0.026
3.309ThrAsn: 3.309 ± 1.994
2.068ThrPro: 2.068 ± 0.79
2.895ThrGln: 2.895 ± 0.842
3.722ThrArg: 3.722 ± 0.569
5.376ThrSer: 5.376 ± 0.957
7.031ThrThr: 7.031 ± 1.954
4.549ThrVal: 4.549 ± 0.763
0.0ThrTrp: 0.0 ± 0.0
1.654ThrTyr: 1.654 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
4.963ValAla: 4.963 ± 0.053
0.0ValCys: 0.0 ± 0.0
3.722ValAsp: 3.722 ± 0.648
4.963ValGlu: 4.963 ± 1.27
4.136ValPhe: 4.136 ± 2.188
3.309ValGly: 3.309 ± 1.385
1.654ValHis: 1.654 ± 0.221
2.895ValIle: 2.895 ± 0.842
2.481ValLys: 2.481 ± 0.026
5.376ValLeu: 5.376 ± 0.869
3.722ValMet: 3.722 ± 0.867
2.895ValAsn: 2.895 ± 0.984
4.963ValPro: 4.963 ± 1.164
3.309ValGln: 3.309 ± 0.441
2.481ValArg: 2.481 ± 1.244
3.722ValSer: 3.722 ± 1.178
4.136ValThr: 4.136 ± 0.97
4.963ValVal: 4.963 ± 0.556
0.0ValTrp: 0.0 ± 0.0
3.309ValTyr: 3.309 ± 0.168
0.0ValXaa: 0.0 ± 0.0
Trp
0.827TrpAla: 0.827 ± 0.194
0.0TrpCys: 0.0 ± 0.0
1.241TrpAsp: 1.241 ± 0.013
0.414TrpGlu: 0.414 ± 0.207
0.827TrpPhe: 0.827 ± 0.415
0.414TrpGly: 0.414 ± 0.207
0.414TrpHis: 0.414 ± 0.207
0.414TrpIle: 0.414 ± 0.207
0.414TrpLys: 0.414 ± 0.401
0.827TrpLeu: 0.827 ± 0.194
0.414TrpMet: 0.414 ± 0.207
2.068TrpAsn: 2.068 ± 0.181
0.0TrpPro: 0.0 ± 0.0
0.414TrpGln: 0.414 ± 0.207
0.827TrpArg: 0.827 ± 0.415
0.827TrpSer: 0.827 ± 0.194
2.068TrpThr: 2.068 ± 0.181
0.827TrpVal: 0.827 ± 0.194
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.481TyrAla: 2.481 ± 1.244
1.241TyrCys: 1.241 ± 0.013
1.654TyrAsp: 1.654 ± 0.221
2.895TyrGlu: 2.895 ± 0.234
2.481TyrPhe: 2.481 ± 0.635
3.309TyrGly: 3.309 ± 0.168
0.414TyrHis: 0.414 ± 0.401
1.654TyrIle: 1.654 ± 0.997
1.654TyrLys: 1.654 ± 0.388
2.895TyrLeu: 2.895 ± 0.375
1.241TyrMet: 1.241 ± 0.013
2.068TyrAsn: 2.068 ± 0.79
1.241TyrPro: 1.241 ± 0.622
2.068TyrGln: 2.068 ± 0.181
2.068TyrArg: 2.068 ± 0.428
3.309TyrSer: 3.309 ± 0.168
2.895TyrThr: 2.895 ± 0.234
1.241TyrVal: 1.241 ± 0.013
0.414TyrTrp: 0.414 ± 0.207
1.654TyrTyr: 1.654 ± 0.221
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski