Amino acid dipepetide frequency for Beihai picorna-like virus 121

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.637AlaAla: 8.637 ± 4.739
0.376AlaCys: 0.376 ± 1.53
4.131AlaAsp: 4.131 ± 0.087
6.008AlaGlu: 6.008 ± 2.586
3.755AlaPhe: 3.755 ± 1.766
5.257AlaGly: 5.257 ± 2.263
2.253AlaHis: 2.253 ± 0.97
5.257AlaIle: 5.257 ± 0.572
5.633AlaLys: 5.633 ± 2.425
7.886AlaLeu: 7.886 ± 5.063
1.502AlaMet: 1.502 ± 0.647
3.38AlaAsn: 3.38 ± 1.455
6.008AlaPro: 6.008 ± 2.586
3.004AlaGln: 3.004 ± 1.293
2.253AlaArg: 2.253 ± 0.722
7.135AlaSer: 7.135 ± 1.38
6.008AlaThr: 6.008 ± 0.797
4.506AlaVal: 4.506 ± 1.94
1.127AlaTrp: 1.127 ± 1.207
3.004AlaTyr: 3.004 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
0.751CysAla: 0.751 ± 0.323
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.127CysGlu: 1.127 ± 0.485
0.0CysPhe: 0.0 ± 0.0
1.502CysGly: 1.502 ± 1.045
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.878CysLys: 1.878 ± 0.883
0.376CysLeu: 0.376 ± 1.53
0.0CysMet: 0.0 ± 0.0
0.751CysAsn: 0.751 ± 1.368
1.127CysPro: 1.127 ± 0.485
0.751CysGln: 0.751 ± 1.368
0.751CysArg: 0.751 ± 1.368
1.878CysSer: 1.878 ± 0.883
0.376CysThr: 0.376 ± 0.162
2.253CysVal: 2.253 ± 0.722
0.0CysTrp: 0.0 ± 0.0
0.751CysTyr: 0.751 ± 3.06
0.0CysXaa: 0.0 ± 0.0
Asp
2.629AspAla: 2.629 ± 1.132
0.751AspCys: 0.751 ± 1.368
4.131AspAsp: 4.131 ± 1.778
3.004AspGlu: 3.004 ± 0.398
4.506AspPhe: 4.506 ± 1.443
3.004AspGly: 3.004 ± 1.293
1.878AspHis: 1.878 ± 0.883
4.882AspIle: 4.882 ± 2.102
2.629AspLys: 2.629 ± 1.132
4.882AspLeu: 4.882 ± 0.41
1.127AspMet: 1.127 ± 0.315
1.502AspAsn: 1.502 ± 0.647
3.755AspPro: 3.755 ± 1.617
3.004AspGln: 3.004 ± 3.781
2.629AspArg: 2.629 ± 0.56
4.882AspSer: 4.882 ± 1.281
3.004AspThr: 3.004 ± 1.293
4.506AspVal: 4.506 ± 0.248
0.751AspTrp: 0.751 ± 0.323
1.878AspTyr: 1.878 ± 0.808
0.0AspXaa: 0.0 ± 0.0
Glu
4.506GluAla: 4.506 ± 0.248
1.878GluCys: 1.878 ± 2.575
3.755GluAsp: 3.755 ± 0.075
1.502GluGlu: 1.502 ± 1.045
2.629GluPhe: 2.629 ± 2.251
1.502GluGly: 1.502 ± 0.647
1.127GluHis: 1.127 ± 1.207
3.38GluIle: 3.38 ± 0.237
1.878GluLys: 1.878 ± 0.808
6.008GluLeu: 6.008 ± 0.797
0.751GluMet: 0.751 ± 0.323
1.127GluAsn: 1.127 ± 0.485
1.878GluPro: 1.878 ± 4.266
2.629GluGln: 2.629 ± 1.132
1.127GluArg: 1.127 ± 0.485
3.004GluSer: 3.004 ± 0.398
4.131GluThr: 4.131 ± 1.778
3.38GluVal: 3.38 ± 1.455
0.0GluTrp: 0.0 ± 0.0
0.751GluTyr: 0.751 ± 0.323
0.0GluXaa: 0.0 ± 0.0
Phe
4.506PheAla: 4.506 ± 1.443
0.751PheCys: 0.751 ± 0.323
2.629PheAsp: 2.629 ± 0.56
3.38PheGlu: 3.38 ± 1.928
2.629PhePhe: 2.629 ± 3.943
3.38PheGly: 3.38 ± 1.455
1.127PheHis: 1.127 ± 0.485
4.131PheIle: 4.131 ± 1.778
3.755PheLys: 3.755 ± 1.766
1.127PheLeu: 1.127 ± 0.485
1.878PheMet: 1.878 ± 0.969
1.502PheAsn: 1.502 ± 0.647
0.0PhePro: 0.0 ± 0.0
1.502PheGln: 1.502 ± 2.736
1.127PheArg: 1.127 ± 1.207
4.131PheSer: 4.131 ± 1.778
3.004PheThr: 3.004 ± 1.293
1.502PheVal: 1.502 ± 0.647
0.751PheTrp: 0.751 ± 1.368
1.878PheTyr: 1.878 ± 0.808
0.0PheXaa: 0.0 ± 0.0
Gly
4.882GlyAla: 4.882 ± 2.102
0.376GlyCys: 0.376 ± 0.162
3.38GlyAsp: 3.38 ± 1.455
1.502GlyGlu: 1.502 ± 0.647
2.629GlyPhe: 2.629 ± 0.56
3.755GlyGly: 3.755 ± 1.617
1.502GlyHis: 1.502 ± 0.647
4.506GlyIle: 4.506 ± 1.94
3.004GlyLys: 3.004 ± 0.398
3.004GlyLeu: 3.004 ± 0.398
1.127GlyMet: 1.127 ± 0.485
1.127GlyAsn: 1.127 ± 0.485
2.253GlyPro: 2.253 ± 2.413
1.878GlyGln: 1.878 ± 0.808
1.502GlyArg: 1.502 ± 1.045
3.004GlySer: 3.004 ± 1.293
4.882GlyThr: 4.882 ± 2.102
3.38GlyVal: 3.38 ± 1.455
1.878GlyTrp: 1.878 ± 0.808
2.253GlyTyr: 2.253 ± 0.722
0.0GlyXaa: 0.0 ± 0.0
His
1.127HisAla: 1.127 ± 0.485
0.751HisCys: 0.751 ± 0.323
2.253HisAsp: 2.253 ± 0.97
0.376HisGlu: 0.376 ± 0.162
0.751HisPhe: 0.751 ± 0.323
1.878HisGly: 1.878 ± 0.808
0.0HisHis: 0.0 ± 0.0
0.376HisIle: 0.376 ± 0.162
1.127HisLys: 1.127 ± 1.207
3.755HisLeu: 3.755 ± 1.766
0.376HisMet: 0.376 ± 0.162
1.127HisAsn: 1.127 ± 0.485
2.629HisPro: 2.629 ± 0.56
1.502HisGln: 1.502 ± 0.647
0.751HisArg: 0.751 ± 1.368
2.253HisSer: 2.253 ± 0.97
1.127HisThr: 1.127 ± 2.898
0.751HisVal: 0.751 ± 1.368
0.376HisTrp: 0.376 ± 0.162
1.502HisTyr: 1.502 ± 1.045
0.0HisXaa: 0.0 ± 0.0
Ile
5.257IleAla: 5.257 ± 2.263
0.376IleCys: 0.376 ± 0.162
4.131IleAsp: 4.131 ± 1.778
3.755IleGlu: 3.755 ± 1.617
1.502IlePhe: 1.502 ± 0.647
4.506IleGly: 4.506 ± 0.248
3.38IleHis: 3.38 ± 0.237
3.004IleIle: 3.004 ± 1.293
3.755IleLys: 3.755 ± 1.617
6.008IleLeu: 6.008 ± 2.488
2.253IleMet: 2.253 ± 0.97
3.755IleAsn: 3.755 ± 1.617
4.882IlePro: 4.882 ± 0.41
1.878IleGln: 1.878 ± 0.883
3.004IleArg: 3.004 ± 1.293
7.51IleSer: 7.51 ± 1.542
3.755IleThr: 3.755 ± 1.617
2.629IleVal: 2.629 ± 1.132
0.0IleTrp: 0.0 ± 0.0
3.004IleTyr: 3.004 ± 1.293
0.0IleXaa: 0.0 ± 0.0
Lys
4.131LysAla: 4.131 ± 1.605
0.376LysCys: 0.376 ± 0.162
2.629LysAsp: 2.629 ± 0.56
3.38LysGlu: 3.38 ± 0.237
1.127LysPhe: 1.127 ± 0.485
1.502LysGly: 1.502 ± 0.647
1.127LysHis: 1.127 ± 1.207
5.257LysIle: 5.257 ± 0.572
3.004LysLys: 3.004 ± 1.293
6.008LysLeu: 6.008 ± 2.586
1.502LysMet: 1.502 ± 0.647
2.629LysAsn: 2.629 ± 1.132
4.506LysPro: 4.506 ± 1.443
1.127LysGln: 1.127 ± 0.485
3.38LysArg: 3.38 ± 1.455
3.755LysSer: 3.755 ± 0.075
3.755LysThr: 3.755 ± 0.075
1.878LysVal: 1.878 ± 0.883
0.376LysTrp: 0.376 ± 0.162
1.502LysTyr: 1.502 ± 0.647
0.0LysXaa: 0.0 ± 0.0
Leu
9.012LeuAla: 9.012 ± 2.188
0.751LeuCys: 0.751 ± 1.368
6.759LeuAsp: 6.759 ± 3.856
4.506LeuGlu: 4.506 ± 3.135
4.131LeuPhe: 4.131 ± 1.605
3.38LeuGly: 3.38 ± 1.455
3.755LeuHis: 3.755 ± 1.766
3.38LeuIle: 3.38 ± 0.237
4.882LeuLys: 4.882 ± 0.41
6.384LeuLeu: 6.384 ± 2.748
0.376LeuMet: 0.376 ± 0.162
2.629LeuAsn: 2.629 ± 0.56
5.257LeuPro: 5.257 ± 0.572
4.882LeuGln: 4.882 ± 2.102
2.629LeuArg: 2.629 ± 1.132
4.506LeuSer: 4.506 ± 1.443
7.135LeuThr: 7.135 ± 3.695
6.384LeuVal: 6.384 ± 2.326
1.127LeuTrp: 1.127 ± 0.485
4.131LeuTyr: 4.131 ± 1.605
0.0LeuXaa: 0.0 ± 0.0
Met
3.004MetAla: 3.004 ± 1.293
0.0MetCys: 0.0 ± 0.0
1.127MetAsp: 1.127 ± 0.485
1.127MetGlu: 1.127 ± 0.485
1.502MetPhe: 1.502 ± 0.647
1.127MetGly: 1.127 ± 0.485
0.751MetHis: 0.751 ± 0.323
0.751MetIle: 0.751 ± 0.323
1.878MetLys: 1.878 ± 0.808
1.878MetLeu: 1.878 ± 0.808
0.0MetMet: 0.0 ± 0.0
1.127MetAsn: 1.127 ± 0.485
0.376MetPro: 0.376 ± 0.162
1.127MetGln: 1.127 ± 1.207
1.127MetArg: 1.127 ± 0.485
2.629MetSer: 2.629 ± 1.132
0.751MetThr: 0.751 ± 1.368
0.376MetVal: 0.376 ± 0.162
0.376MetTrp: 0.376 ± 0.162
1.127MetTyr: 1.127 ± 0.485
0.0MetXaa: 0.0 ± 0.0
Asn
3.004AsnAla: 3.004 ± 0.398
0.751AsnCys: 0.751 ± 0.323
2.629AsnAsp: 2.629 ± 1.132
1.127AsnGlu: 1.127 ± 1.207
0.751AsnPhe: 0.751 ± 0.323
1.502AsnGly: 1.502 ± 0.647
0.751AsnHis: 0.751 ± 0.323
4.882AsnIle: 4.882 ± 2.102
1.502AsnLys: 1.502 ± 1.045
3.004AsnLeu: 3.004 ± 0.398
0.376AsnMet: 0.376 ± 0.162
0.751AsnAsn: 0.751 ± 0.323
3.004AsnPro: 3.004 ± 1.293
1.878AsnGln: 1.878 ± 0.808
2.253AsnArg: 2.253 ± 0.97
1.878AsnSer: 1.878 ± 0.808
2.629AsnThr: 2.629 ± 1.132
2.629AsnVal: 2.629 ± 1.132
1.127AsnTrp: 1.127 ± 1.207
0.376AsnTyr: 0.376 ± 0.162
0.0AsnXaa: 0.0 ± 0.0
Pro
4.882ProAla: 4.882 ± 0.41
1.127ProCys: 1.127 ± 1.207
3.755ProAsp: 3.755 ± 0.075
1.878ProGlu: 1.878 ± 0.883
1.878ProPhe: 1.878 ± 0.808
3.004ProGly: 3.004 ± 0.398
0.0ProHis: 0.0 ± 0.0
5.633ProIle: 5.633 ± 2.425
1.878ProLys: 1.878 ± 0.808
4.131ProLeu: 4.131 ± 0.087
1.878ProMet: 1.878 ± 0.808
3.004ProAsn: 3.004 ± 0.398
2.629ProPro: 2.629 ± 1.132
1.878ProGln: 1.878 ± 0.808
2.253ProArg: 2.253 ± 2.413
5.633ProSer: 5.633 ± 2.65
5.257ProThr: 5.257 ± 2.263
3.38ProVal: 3.38 ± 1.455
0.751ProTrp: 0.751 ± 1.368
2.253ProTyr: 2.253 ± 0.97
0.0ProXaa: 0.0 ± 0.0
Gln
4.506GlnAla: 4.506 ± 1.443
1.502GlnCys: 1.502 ± 1.045
1.878GlnAsp: 1.878 ± 2.575
2.629GlnGlu: 2.629 ± 2.251
2.629GlnPhe: 2.629 ± 1.132
1.127GlnGly: 1.127 ± 0.485
0.0GlnHis: 0.0 ± 0.0
3.38GlnIle: 3.38 ± 1.455
0.751GlnLys: 0.751 ± 0.323
5.633GlnLeu: 5.633 ± 0.958
0.376GlnMet: 0.376 ± 0.162
0.0GlnAsn: 0.0 ± 0.0
3.755GlnPro: 3.755 ± 0.075
1.878GlnGln: 1.878 ± 0.808
2.629GlnArg: 2.629 ± 1.132
4.131GlnSer: 4.131 ± 0.087
2.629GlnThr: 2.629 ± 0.56
3.38GlnVal: 3.38 ± 1.455
0.376GlnTrp: 0.376 ± 0.162
0.376GlnTyr: 0.376 ± 0.162
0.0GlnXaa: 0.0 ± 0.0
Arg
1.878ArgAla: 1.878 ± 0.808
1.127ArgCys: 1.127 ± 1.207
3.755ArgAsp: 3.755 ± 0.075
1.502ArgGlu: 1.502 ± 1.045
3.004ArgPhe: 3.004 ± 1.293
1.878ArgGly: 1.878 ± 0.808
0.376ArgHis: 0.376 ± 0.162
1.878ArgIle: 1.878 ± 0.808
3.38ArgLys: 3.38 ± 1.455
4.131ArgLeu: 4.131 ± 3.296
1.878ArgMet: 1.878 ± 0.808
1.502ArgAsn: 1.502 ± 1.045
3.004ArgPro: 3.004 ± 1.293
1.878ArgGln: 1.878 ± 0.883
4.506ArgArg: 4.506 ± 1.94
3.004ArgSer: 3.004 ± 2.09
3.004ArgThr: 3.004 ± 2.09
2.253ArgVal: 2.253 ± 0.97
0.751ArgTrp: 0.751 ± 1.368
0.751ArgTyr: 0.751 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
5.257SerAla: 5.257 ± 1.12
1.127SerCys: 1.127 ± 1.207
3.004SerAsp: 3.004 ± 1.293
2.629SerGlu: 2.629 ± 0.56
4.506SerPhe: 4.506 ± 0.248
4.131SerGly: 4.131 ± 0.087
1.878SerHis: 1.878 ± 0.883
6.008SerIle: 6.008 ± 0.797
2.253SerLys: 2.253 ± 0.722
8.261SerLeu: 8.261 ± 1.518
2.253SerMet: 2.253 ± 0.97
2.629SerAsn: 2.629 ± 1.132
2.253SerPro: 2.253 ± 0.97
4.882SerGln: 4.882 ± 0.41
4.131SerArg: 4.131 ± 1.778
6.384SerSer: 6.384 ± 4.018
9.012SerThr: 9.012 ± 0.497
3.38SerVal: 3.38 ± 5.311
0.751SerTrp: 0.751 ± 1.368
3.004SerTyr: 3.004 ± 1.293
0.0SerXaa: 0.0 ± 0.0
Thr
10.514ThrAla: 10.514 ± 2.24
1.127ThrCys: 1.127 ± 1.207
1.502ThrAsp: 1.502 ± 0.647
4.506ThrGlu: 4.506 ± 0.248
3.755ThrPhe: 3.755 ± 1.617
3.38ThrGly: 3.38 ± 0.237
1.127ThrHis: 1.127 ± 0.485
6.008ThrIle: 6.008 ± 0.895
4.506ThrLys: 4.506 ± 1.443
5.257ThrLeu: 5.257 ± 2.263
1.502ThrMet: 1.502 ± 0.647
2.253ThrAsn: 2.253 ± 0.97
4.882ThrPro: 4.882 ± 0.41
4.131ThrGln: 4.131 ± 0.087
3.755ThrArg: 3.755 ± 1.766
4.506ThrSer: 4.506 ± 0.248
7.51ThrThr: 7.51 ± 1.841
3.38ThrVal: 3.38 ± 0.237
1.502ThrTrp: 1.502 ± 0.647
2.253ThrTyr: 2.253 ± 0.722
0.0ThrXaa: 0.0 ± 0.0
Val
4.882ValAla: 4.882 ± 2.102
0.376ValCys: 0.376 ± 0.162
4.506ValAsp: 4.506 ± 1.443
1.502ValGlu: 1.502 ± 0.647
0.376ValPhe: 0.376 ± 1.53
3.004ValGly: 3.004 ± 0.398
1.502ValHis: 1.502 ± 1.045
4.131ValIle: 4.131 ± 1.778
3.38ValLys: 3.38 ± 1.455
4.506ValLeu: 4.506 ± 0.248
1.127ValMet: 1.127 ± 0.485
2.253ValAsn: 2.253 ± 0.97
4.131ValPro: 4.131 ± 0.087
2.629ValGln: 2.629 ± 1.132
2.629ValArg: 2.629 ± 2.251
4.882ValSer: 4.882 ± 2.973
4.882ValThr: 4.882 ± 0.41
3.38ValVal: 3.38 ± 1.455
1.502ValTrp: 1.502 ± 0.647
1.127ValTyr: 1.127 ± 0.485
0.0ValXaa: 0.0 ± 0.0
Trp
1.502TrpAla: 1.502 ± 0.647
0.376TrpCys: 0.376 ± 0.162
1.502TrpAsp: 1.502 ± 0.647
0.376TrpGlu: 0.376 ± 0.162
0.0TrpPhe: 0.0 ± 0.0
0.751TrpGly: 0.751 ± 0.323
0.751TrpHis: 0.751 ± 0.323
1.127TrpIle: 1.127 ± 1.207
0.751TrpLys: 0.751 ± 1.368
0.751TrpLeu: 0.751 ± 0.323
0.376TrpMet: 0.376 ± 0.162
0.376TrpAsn: 0.376 ± 0.162
0.376TrpPro: 0.376 ± 0.162
0.376TrpGln: 0.376 ± 1.53
1.502TrpArg: 1.502 ± 1.045
1.127TrpSer: 1.127 ± 1.207
1.127TrpThr: 1.127 ± 1.207
0.751TrpVal: 0.751 ± 0.323
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.253TyrAla: 2.253 ± 0.97
0.376TyrCys: 0.376 ± 0.162
2.253TyrAsp: 2.253 ± 0.97
1.127TyrGlu: 1.127 ± 0.485
2.629TyrPhe: 2.629 ± 0.56
2.253TyrGly: 2.253 ± 0.722
1.502TyrHis: 1.502 ± 1.045
0.376TyrIle: 0.376 ± 0.162
1.127TyrLys: 1.127 ± 0.485
3.004TyrLeu: 3.004 ± 2.09
1.127TyrMet: 1.127 ± 0.485
3.38TyrAsn: 3.38 ± 0.237
0.376TyrPro: 0.376 ± 0.162
0.751TyrGln: 0.751 ± 0.323
1.502TyrArg: 1.502 ± 0.647
1.502TyrSer: 1.502 ± 1.045
3.38TyrThr: 3.38 ± 1.455
2.629TyrVal: 2.629 ± 0.56
0.376TyrTrp: 0.376 ± 0.162
1.127TyrTyr: 1.127 ± 0.485
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2664 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski