Amino acid dipepetide frequency for Beihai picorna-like virus 35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.858AlaAla: 7.858 ± 1.182
0.414AlaCys: 0.414 ± 0.378
2.481AlaAsp: 2.481 ± 0.468
4.963AlaGlu: 4.963 ± 0.263
3.309AlaPhe: 3.309 ± 0.624
6.617AlaGly: 6.617 ± 1.248
2.068AlaHis: 2.068 ± 0.509
4.963AlaIle: 4.963 ± 1.462
4.136AlaLys: 4.136 ± 1.019
6.203AlaLeu: 6.203 ± 2.128
0.827AlaMet: 0.827 ± 0.444
4.963AlaAsn: 4.963 ± 0.263
2.895AlaPro: 2.895 ± 0.846
2.895AlaGln: 2.895 ± 0.246
3.722AlaArg: 3.722 ± 0.197
4.549AlaSer: 4.549 ± 1.757
5.79AlaThr: 5.79 ± 1.092
5.79AlaVal: 5.79 ± 0.107
1.241AlaTrp: 1.241 ± 0.665
0.827AlaTyr: 0.827 ± 0.156
0.0AlaXaa: 0.0 ± 0.0
Cys
0.414CysAla: 0.414 ± 0.222
0.0CysCys: 0.0 ± 0.0
0.827CysAsp: 0.827 ± 0.156
0.414CysGlu: 0.414 ± 0.378
0.827CysPhe: 0.827 ± 0.755
1.241CysGly: 1.241 ± 0.066
0.0CysHis: 0.0 ± 0.0
0.827CysIle: 0.827 ± 0.444
2.068CysLys: 2.068 ± 0.09
0.414CysLeu: 0.414 ± 0.378
1.241CysMet: 1.241 ± 0.066
1.241CysAsn: 1.241 ± 0.066
1.241CysPro: 1.241 ± 0.665
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.827CysVal: 0.827 ± 0.444
0.414CysTrp: 0.414 ± 0.222
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.203AspAla: 6.203 ± 0.87
0.0AspCys: 0.0 ± 0.0
5.376AspAsp: 5.376 ± 0.714
4.549AspGlu: 4.549 ± 0.042
3.309AspPhe: 3.309 ± 0.624
1.241AspGly: 1.241 ± 0.665
0.0AspHis: 0.0 ± 0.0
3.309AspIle: 3.309 ± 1.175
3.722AspLys: 3.722 ± 0.197
4.136AspLeu: 4.136 ± 0.419
1.654AspMet: 1.654 ± 0.288
1.654AspAsn: 1.654 ± 0.911
3.309AspPro: 3.309 ± 2.422
1.654AspGln: 1.654 ± 0.911
3.309AspArg: 3.309 ± 1.175
5.79AspSer: 5.79 ± 1.092
2.895AspThr: 2.895 ± 1.445
1.241AspVal: 1.241 ± 0.534
0.827AspTrp: 0.827 ± 0.156
2.068AspTyr: 2.068 ± 1.109
0.0AspXaa: 0.0 ± 0.0
Glu
2.068GluAla: 2.068 ± 0.509
1.654GluCys: 1.654 ± 0.288
1.654GluAsp: 1.654 ± 0.288
3.309GluGlu: 3.309 ± 1.774
2.481GluPhe: 2.481 ± 1.331
2.895GluGly: 2.895 ± 0.353
0.414GluHis: 0.414 ± 0.222
4.549GluIle: 4.549 ± 0.558
3.309GluLys: 3.309 ± 1.774
7.031GluLeu: 7.031 ± 0.426
1.241GluMet: 1.241 ± 0.066
1.241GluAsn: 1.241 ± 0.066
2.895GluPro: 2.895 ± 0.353
0.414GluGln: 0.414 ± 0.222
2.895GluArg: 2.895 ± 0.353
4.549GluSer: 4.549 ± 1.241
3.309GluThr: 3.309 ± 0.624
4.136GluVal: 4.136 ± 1.019
0.414GluTrp: 0.414 ± 0.222
2.068GluTyr: 2.068 ± 0.69
0.0GluXaa: 0.0 ± 0.0
Phe
4.549PheAla: 4.549 ± 0.641
0.414PheCys: 0.414 ± 0.378
2.481PheAsp: 2.481 ± 1.331
2.068PheGlu: 2.068 ± 0.509
2.895PhePhe: 2.895 ± 2.045
5.79PheGly: 5.79 ± 1.306
1.654PheHis: 1.654 ± 1.511
2.068PheIle: 2.068 ± 0.509
2.068PheLys: 2.068 ± 0.509
4.963PheLeu: 4.963 ± 0.263
0.827PheMet: 0.827 ± 0.444
1.654PheAsn: 1.654 ± 0.288
2.068PhePro: 2.068 ± 0.09
1.241PheGln: 1.241 ± 0.066
2.068PheArg: 2.068 ± 0.509
2.895PheSer: 2.895 ± 0.246
4.549PheThr: 4.549 ± 0.558
4.549PheVal: 4.549 ± 1.84
0.827PheTrp: 0.827 ± 0.156
2.481PheTyr: 2.481 ± 1.067
0.0PheXaa: 0.0 ± 0.0
Gly
2.895GlyAla: 2.895 ± 0.353
0.0GlyCys: 0.0 ± 0.0
3.722GlyAsp: 3.722 ± 1.002
2.481GlyGlu: 2.481 ± 1.667
1.241GlyPhe: 1.241 ± 0.665
3.309GlyGly: 3.309 ± 1.223
1.654GlyHis: 1.654 ± 0.887
5.79GlyIle: 5.79 ± 0.707
4.963GlyLys: 4.963 ± 2.661
3.309GlyLeu: 3.309 ± 1.223
2.068GlyMet: 2.068 ± 0.509
4.963GlyAsn: 4.963 ± 2.135
4.136GlyPro: 4.136 ± 0.78
2.895GlyGln: 2.895 ± 2.045
3.309GlyArg: 3.309 ± 0.624
4.549GlySer: 4.549 ± 0.558
3.722GlyThr: 3.722 ± 1.002
4.549GlyVal: 4.549 ± 0.042
2.068GlyTrp: 2.068 ± 0.69
2.895GlyTyr: 2.895 ± 0.353
0.0GlyXaa: 0.0 ± 0.0
His
1.241HisAla: 1.241 ± 0.066
0.414HisCys: 0.414 ± 0.378
0.827HisAsp: 0.827 ± 0.156
0.827HisGlu: 0.827 ± 0.444
1.654HisPhe: 1.654 ± 0.288
1.654HisGly: 1.654 ± 0.288
0.827HisHis: 0.827 ± 0.444
0.414HisIle: 0.414 ± 0.222
1.241HisLys: 1.241 ± 0.066
1.654HisLeu: 1.654 ± 0.288
1.654HisMet: 1.654 ± 0.312
0.827HisAsn: 0.827 ± 0.444
0.414HisPro: 0.414 ± 0.222
0.414HisGln: 0.414 ± 0.222
0.827HisArg: 0.827 ± 0.156
1.241HisSer: 1.241 ± 0.066
1.654HisThr: 1.654 ± 0.312
0.414HisVal: 0.414 ± 0.222
0.414HisTrp: 0.414 ± 0.222
0.827HisTyr: 0.827 ± 0.755
0.0HisXaa: 0.0 ± 0.0
Ile
7.858IleAla: 7.858 ± 0.617
0.827IleCys: 0.827 ± 0.444
3.722IleAsp: 3.722 ± 0.197
2.895IleGlu: 2.895 ± 0.353
2.481IlePhe: 2.481 ± 1.331
2.481IleGly: 2.481 ± 0.132
0.827IleHis: 0.827 ± 0.156
1.241IleIle: 1.241 ± 0.534
3.309IleLys: 3.309 ± 0.575
2.481IleLeu: 2.481 ± 0.132
2.068IleMet: 2.068 ± 0.509
2.895IleAsn: 2.895 ± 0.846
3.722IlePro: 3.722 ± 0.797
2.068IleGln: 2.068 ± 0.509
4.136IleArg: 4.136 ± 0.18
5.79IleSer: 5.79 ± 0.492
4.963IleThr: 4.963 ± 0.863
2.895IleVal: 2.895 ± 0.246
0.414IleTrp: 0.414 ± 0.222
2.068IleTyr: 2.068 ± 0.09
0.0IleXaa: 0.0 ± 0.0
Lys
2.895LysAla: 2.895 ± 0.953
1.241LysCys: 1.241 ± 0.665
3.309LysAsp: 3.309 ± 0.575
4.549LysGlu: 4.549 ± 1.84
4.136LysPhe: 4.136 ± 0.18
2.895LysGly: 2.895 ± 0.953
2.068LysHis: 2.068 ± 1.109
3.722LysIle: 3.722 ± 1.397
3.309LysLys: 3.309 ± 1.175
5.376LysLeu: 5.376 ± 0.485
0.827LysMet: 0.827 ± 0.156
3.309LysAsn: 3.309 ± 0.624
3.309LysPro: 3.309 ± 0.624
0.827LysGln: 0.827 ± 0.156
1.241LysArg: 1.241 ± 0.534
5.376LysSer: 5.376 ± 1.684
2.068LysThr: 2.068 ± 0.509
4.963LysVal: 4.963 ± 2.661
0.414LysTrp: 0.414 ± 0.222
2.895LysTyr: 2.895 ± 0.953
0.0LysXaa: 0.0 ± 0.0
Leu
8.271LeuAla: 8.271 ± 0.239
2.068LeuCys: 2.068 ± 0.09
3.722LeuAsp: 3.722 ± 0.797
2.895LeuGlu: 2.895 ± 0.353
6.617LeuPhe: 6.617 ± 1.75
3.309LeuGly: 3.309 ± 0.624
2.068LeuHis: 2.068 ± 0.69
4.963LeuIle: 4.963 ± 2.062
6.203LeuLys: 6.203 ± 1.528
6.617LeuLeu: 6.617 ± 1.75
1.654LeuMet: 1.654 ± 1.011
4.963LeuAsn: 4.963 ± 0.936
4.549LeuPro: 4.549 ± 1.757
3.309LeuGln: 3.309 ± 0.575
4.136LeuArg: 4.136 ± 1.618
5.79LeuSer: 5.79 ± 1.906
4.136LeuThr: 4.136 ± 1.979
5.79LeuVal: 5.79 ± 0.707
0.827LeuTrp: 0.827 ± 0.755
2.481LeuTyr: 2.481 ± 0.132
0.0LeuXaa: 0.0 ± 0.0
Met
2.068MetAla: 2.068 ± 0.509
0.414MetCys: 0.414 ± 0.222
0.414MetAsp: 0.414 ± 0.378
1.654MetGlu: 1.654 ± 0.887
0.414MetPhe: 0.414 ± 0.222
1.654MetGly: 1.654 ± 0.312
0.0MetHis: 0.0 ± 0.0
1.241MetIle: 1.241 ± 0.066
1.654MetLys: 1.654 ± 0.312
2.481MetLeu: 2.481 ± 1.331
0.827MetMet: 0.827 ± 0.444
1.241MetAsn: 1.241 ± 0.665
2.068MetPro: 2.068 ± 0.509
0.827MetGln: 0.827 ± 0.156
2.068MetArg: 2.068 ± 0.09
2.895MetSer: 2.895 ± 0.353
2.068MetThr: 2.068 ± 0.09
2.481MetVal: 2.481 ± 0.132
0.414MetTrp: 0.414 ± 0.378
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.481AsnAla: 2.481 ± 0.731
0.0AsnCys: 0.0 ± 0.0
2.481AsnAsp: 2.481 ± 1.067
2.068AsnGlu: 2.068 ± 0.09
2.895AsnPhe: 2.895 ± 0.353
3.722AsnGly: 3.722 ± 1.601
1.241AsnHis: 1.241 ± 0.665
2.481AsnIle: 2.481 ± 0.468
1.654AsnLys: 1.654 ± 0.288
2.481AsnLeu: 2.481 ± 0.468
2.481AsnMet: 2.481 ± 0.731
0.827AsnAsn: 0.827 ± 0.444
2.895AsnPro: 2.895 ± 2.045
0.827AsnGln: 0.827 ± 0.156
2.068AsnArg: 2.068 ± 0.09
5.376AsnSer: 5.376 ± 1.313
3.722AsnThr: 3.722 ± 0.402
4.963AsnVal: 4.963 ± 0.263
0.827AsnTrp: 0.827 ± 0.156
2.481AsnTyr: 2.481 ± 1.667
0.0AsnXaa: 0.0 ± 0.0
Pro
2.481ProAla: 2.481 ± 1.667
0.0ProCys: 0.0 ± 0.0
3.309ProAsp: 3.309 ± 0.024
2.481ProGlu: 2.481 ± 0.132
3.309ProPhe: 3.309 ± 1.223
2.068ProGly: 2.068 ± 0.509
1.241ProHis: 1.241 ± 0.534
2.481ProIle: 2.481 ± 0.468
2.068ProLys: 2.068 ± 1.109
6.203ProLeu: 6.203 ± 1.528
1.654ProMet: 1.654 ± 0.312
1.654ProAsn: 1.654 ± 0.911
2.068ProPro: 2.068 ± 0.69
1.241ProGln: 1.241 ± 0.534
2.068ProArg: 2.068 ± 1.109
2.068ProSer: 2.068 ± 0.69
4.963ProThr: 4.963 ± 2.135
2.481ProVal: 2.481 ± 0.468
0.827ProTrp: 0.827 ± 0.755
2.068ProTyr: 2.068 ± 0.69
0.0ProXaa: 0.0 ± 0.0
Gln
3.309GlnAla: 3.309 ± 0.624
0.414GlnCys: 0.414 ± 0.378
2.068GlnAsp: 2.068 ± 1.889
0.827GlnGlu: 0.827 ± 0.156
2.481GlnPhe: 2.481 ± 1.331
2.895GlnGly: 2.895 ± 0.846
0.414GlnHis: 0.414 ± 0.378
2.068GlnIle: 2.068 ± 1.289
2.481GlnLys: 2.481 ± 0.132
3.309GlnLeu: 3.309 ± 0.024
0.0GlnMet: 0.0 ± 0.0
1.654GlnAsn: 1.654 ± 0.312
0.0GlnPro: 0.0 ± 0.0
0.827GlnGln: 0.827 ± 0.156
1.241GlnArg: 1.241 ± 0.665
6.617GlnSer: 6.617 ± 0.648
2.481GlnThr: 2.481 ± 0.468
1.654GlnVal: 1.654 ± 0.312
0.0GlnTrp: 0.0 ± 0.0
0.827GlnTyr: 0.827 ± 0.444
0.0GlnXaa: 0.0 ± 0.0
Arg
5.376ArgAla: 5.376 ± 0.714
0.0ArgCys: 0.0 ± 0.0
2.481ArgAsp: 2.481 ± 0.132
4.549ArgGlu: 4.549 ± 1.241
2.481ArgPhe: 2.481 ± 0.132
2.068ArgGly: 2.068 ± 0.509
1.241ArgHis: 1.241 ± 0.665
2.481ArgIle: 2.481 ± 0.132
2.895ArgLys: 2.895 ± 0.953
1.654ArgLeu: 1.654 ± 0.288
1.654ArgMet: 1.654 ± 0.312
2.068ArgAsn: 2.068 ± 0.509
1.241ArgPro: 1.241 ± 0.534
3.309ArgGln: 3.309 ± 0.624
2.895ArgArg: 2.895 ± 0.246
3.309ArgSer: 3.309 ± 0.575
2.068ArgThr: 2.068 ± 0.09
4.136ArgVal: 4.136 ± 0.419
0.827ArgTrp: 0.827 ± 0.755
1.241ArgTyr: 1.241 ± 0.066
0.0ArgXaa: 0.0 ± 0.0
Ser
5.79SerAla: 5.79 ± 0.107
0.827SerCys: 0.827 ± 0.444
5.376SerAsp: 5.376 ± 1.913
3.722SerGlu: 3.722 ± 1.397
4.136SerPhe: 4.136 ± 0.419
6.203SerGly: 6.203 ± 0.329
0.827SerHis: 0.827 ± 0.156
5.79SerIle: 5.79 ± 0.492
4.963SerLys: 4.963 ± 0.263
7.031SerLeu: 7.031 ± 0.426
1.241SerMet: 1.241 ± 0.297
4.136SerAsn: 4.136 ± 0.419
4.963SerPro: 4.963 ± 0.336
4.549SerGln: 4.549 ± 0.558
5.79SerArg: 5.79 ± 2.291
5.79SerSer: 5.79 ± 1.691
4.136SerThr: 4.136 ± 1.379
2.895SerVal: 2.895 ± 0.353
1.654SerTrp: 1.654 ± 0.312
2.481SerTyr: 2.481 ± 0.132
0.0SerXaa: 0.0 ± 0.0
Thr
4.549ThrAla: 4.549 ± 1.158
0.827ThrCys: 0.827 ± 0.156
2.895ThrAsp: 2.895 ± 0.846
4.136ThrGlu: 4.136 ± 0.18
1.654ThrPhe: 1.654 ± 0.288
4.963ThrGly: 4.963 ± 3.933
0.827ThrHis: 0.827 ± 0.444
3.309ThrIle: 3.309 ± 0.624
2.481ThrLys: 2.481 ± 0.132
7.858ThrLeu: 7.858 ± 1.816
2.068ThrMet: 2.068 ± 0.09
2.895ThrAsn: 2.895 ± 1.445
0.827ThrPro: 0.827 ± 0.156
2.895ThrGln: 2.895 ± 0.246
2.068ThrArg: 2.068 ± 0.09
6.203ThrSer: 6.203 ± 2.069
1.241ThrThr: 1.241 ± 0.066
5.376ThrVal: 5.376 ± 0.714
0.827ThrTrp: 0.827 ± 0.755
2.481ThrTyr: 2.481 ± 0.468
0.0ThrXaa: 0.0 ± 0.0
Val
4.136ValAla: 4.136 ± 0.419
0.0ValCys: 0.0 ± 0.0
2.895ValAsp: 2.895 ± 0.353
3.309ValGlu: 3.309 ± 0.575
2.068ValPhe: 2.068 ± 0.509
5.376ValGly: 5.376 ± 0.114
0.827ValHis: 0.827 ± 0.444
6.617ValIle: 6.617 ± 0.049
3.722ValLys: 3.722 ± 1.397
5.79ValLeu: 5.79 ± 0.107
1.241ValMet: 1.241 ± 0.665
2.895ValAsn: 2.895 ± 0.353
3.309ValPro: 3.309 ± 0.575
2.895ValGln: 2.895 ± 0.353
2.895ValArg: 2.895 ± 0.953
6.203ValSer: 6.203 ± 0.27
4.549ValThr: 4.549 ± 0.558
5.79ValVal: 5.79 ± 1.306
1.241ValTrp: 1.241 ± 0.665
0.827ValTyr: 0.827 ± 0.156
0.0ValXaa: 0.0 ± 0.0
Trp
0.414TrpAla: 0.414 ± 0.378
0.0TrpCys: 0.0 ± 0.0
2.481TrpAsp: 2.481 ± 0.468
0.414TrpGlu: 0.414 ± 0.222
0.827TrpPhe: 0.827 ± 0.755
1.241TrpGly: 1.241 ± 1.133
0.0TrpHis: 0.0 ± 0.0
0.414TrpIle: 0.414 ± 0.222
1.654TrpLys: 1.654 ± 0.312
2.068TrpLeu: 2.068 ± 0.09
0.414TrpMet: 0.414 ± 0.222
1.654TrpAsn: 1.654 ± 0.911
0.0TrpPro: 0.0 ± 0.0
1.241TrpGln: 1.241 ± 0.534
0.0TrpArg: 0.0 ± 0.0
0.414TrpSer: 0.414 ± 0.222
0.827TrpThr: 0.827 ± 0.444
0.0TrpVal: 0.0 ± 0.0
0.414TrpTrp: 0.414 ± 0.222
1.241TrpTyr: 1.241 ± 0.665
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.654TyrAla: 1.654 ± 0.288
2.481TyrCys: 2.481 ± 0.132
3.722TyrAsp: 3.722 ± 0.797
0.414TyrGlu: 0.414 ± 0.222
2.895TyrPhe: 2.895 ± 0.246
2.895TyrGly: 2.895 ± 0.846
1.241TyrHis: 1.241 ± 0.534
0.414TyrIle: 0.414 ± 0.222
0.827TyrLys: 0.827 ± 0.156
3.722TyrLeu: 3.722 ± 0.197
0.827TyrMet: 0.827 ± 0.755
1.241TyrAsn: 1.241 ± 0.534
0.827TyrPro: 0.827 ± 0.444
1.241TyrGln: 1.241 ± 0.534
1.241TyrArg: 1.241 ± 0.066
2.895TyrSer: 2.895 ± 0.846
1.241TyrThr: 1.241 ± 0.066
1.654TyrVal: 1.654 ± 0.288
0.827TyrTrp: 0.827 ± 0.156
2.068TyrTyr: 2.068 ± 0.09
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski