Amino acid dipepetide frequency for Beihai picorna-like virus 118

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.757AlaAla: 7.757 ± 7.75
2.468AlaCys: 2.468 ± 0.631
3.173AlaAsp: 3.173 ± 0.674
1.763AlaGlu: 1.763 ± 1.278
4.584AlaPhe: 4.584 ± 1.153
4.584AlaGly: 4.584 ± 3.642
1.763AlaHis: 1.763 ± 0.474
4.584AlaIle: 4.584 ± 0.836
3.879AlaLys: 3.879 ± 1.81
6.7AlaLeu: 6.7 ± 2.49
1.763AlaMet: 1.763 ± 0.67
2.116AlaAsn: 2.116 ± 0.636
2.468AlaPro: 2.468 ± 1.973
1.763AlaGln: 1.763 ± 0.767
2.821AlaArg: 2.821 ± 1.568
4.231AlaSer: 4.231 ± 1.271
5.289AlaThr: 5.289 ± 2.26
1.763AlaVal: 1.763 ± 0.98
0.705AlaTrp: 0.705 ± 0.392
3.526AlaTyr: 3.526 ± 3.89
0.0AlaXaa: 0.0 ± 0.0
Cys
2.116CysAla: 2.116 ± 0.62
0.0CysCys: 0.0 ± 0.0
0.705CysAsp: 0.705 ± 0.392
0.705CysGlu: 0.705 ± 0.392
0.353CysPhe: 0.353 ± 0.196
1.763CysGly: 1.763 ± 0.67
0.705CysHis: 0.705 ± 0.392
0.353CysIle: 0.353 ± 0.196
1.41CysLys: 1.41 ± 0.349
1.763CysLeu: 1.763 ± 0.67
0.353CysMet: 0.353 ± 0.525
0.353CysAsn: 0.353 ± 0.196
1.41CysPro: 1.41 ± 0.349
0.353CysGln: 0.353 ± 0.525
0.353CysArg: 0.353 ± 0.196
2.468CysSer: 2.468 ± 1.372
0.353CysThr: 0.353 ± 0.196
1.058CysVal: 1.058 ± 0.588
0.0CysTrp: 0.0 ± 0.0
0.353CysTyr: 0.353 ± 0.196
0.0CysXaa: 0.0 ± 0.0
Asp
5.642AspAla: 5.642 ± 1.397
1.763AspCys: 1.763 ± 0.474
4.937AspAsp: 4.937 ± 2.744
5.994AspGlu: 5.994 ± 1.738
3.173AspPhe: 3.173 ± 0.674
2.821AspGly: 2.821 ± 1.13
0.705AspHis: 0.705 ± 0.384
3.879AspIle: 3.879 ± 1.566
4.937AspLys: 4.937 ± 1.625
5.289AspLeu: 5.289 ± 1.317
0.705AspMet: 0.705 ± 0.302
2.821AspAsn: 2.821 ± 0.698
3.879AspPro: 3.879 ± 0.971
2.116AspGln: 2.116 ± 1.152
1.41AspArg: 1.41 ± 0.768
3.879AspSer: 3.879 ± 1.104
3.879AspThr: 3.879 ± 0.971
5.289AspVal: 5.289 ± 2.26
1.41AspTrp: 1.41 ± 0.349
1.41AspTyr: 1.41 ± 0.91
0.0AspXaa: 0.0 ± 0.0
Glu
4.231GluAla: 4.231 ± 0.346
0.353GluCys: 0.353 ± 0.196
3.173GluAsp: 3.173 ± 0.931
3.879GluGlu: 3.879 ± 1.104
2.116GluPhe: 2.116 ± 1.176
3.526GluGly: 3.526 ± 0.948
1.058GluHis: 1.058 ± 0.588
2.468GluIle: 2.468 ± 0.631
2.821GluLys: 2.821 ± 1.568
7.405GluLeu: 7.405 ± 1.907
1.763GluMet: 1.763 ± 0.67
2.116GluAsn: 2.116 ± 0.663
3.526GluPro: 3.526 ± 0.783
2.821GluGln: 2.821 ± 0.587
2.116GluArg: 2.116 ± 0.663
2.468GluSer: 2.468 ± 0.606
3.173GluThr: 3.173 ± 0.428
3.173GluVal: 3.173 ± 0.809
1.41GluTrp: 1.41 ± 0.784
3.173GluTyr: 3.173 ± 1.428
0.0GluXaa: 0.0 ± 0.0
Phe
2.821PheAla: 2.821 ± 0.997
1.41PheCys: 1.41 ± 0.768
4.584PheAsp: 4.584 ± 0.554
4.231PheGlu: 4.231 ± 1.325
3.879PhePhe: 3.879 ± 1.515
2.821PheGly: 2.821 ± 0.997
1.763PheHis: 1.763 ± 1.944
1.41PheIle: 1.41 ± 0.784
3.879PheLys: 3.879 ± 0.971
3.173PheLeu: 3.173 ± 1.662
1.41PheMet: 1.41 ± 0.342
2.468PheAsn: 2.468 ± 0.813
2.468PhePro: 2.468 ± 0.813
2.116PheGln: 2.116 ± 0.97
0.705PheArg: 0.705 ± 0.384
3.879PheSer: 3.879 ± 1.892
2.468PheThr: 2.468 ± 0.813
2.821PheVal: 2.821 ± 0.611
0.353PheTrp: 0.353 ± 0.196
0.705PheTyr: 0.705 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
2.821GlyAla: 2.821 ± 1.803
0.353GlyCys: 0.353 ± 0.196
5.289GlyAsp: 5.289 ± 0.499
2.821GlyGlu: 2.821 ± 0.698
2.821GlyPhe: 2.821 ± 0.698
3.526GlyGly: 3.526 ± 1.396
1.763GlyHis: 1.763 ± 0.474
2.468GlyIle: 2.468 ± 1.047
4.584GlyLys: 4.584 ± 1.153
7.052GlyLeu: 7.052 ± 0.972
2.116GlyMet: 2.116 ± 0.62
3.526GlyAsn: 3.526 ± 4.001
1.763GlyPro: 1.763 ± 1.135
1.763GlyGln: 1.763 ± 0.92
2.116GlyArg: 2.116 ± 0.62
2.468GlySer: 2.468 ± 1.047
3.879GlyThr: 3.879 ± 1.276
4.584GlyVal: 4.584 ± 1.528
0.705GlyTrp: 0.705 ± 0.384
1.058GlyTyr: 1.058 ± 0.588
0.0GlyXaa: 0.0 ± 0.0
His
1.41HisAla: 1.41 ± 0.349
0.353HisCys: 0.353 ± 0.196
1.41HisAsp: 1.41 ± 1.42
2.116HisGlu: 2.116 ± 1.176
1.058HisPhe: 1.058 ± 0.588
1.058HisGly: 1.058 ± 0.898
1.058HisHis: 1.058 ± 0.588
1.763HisIle: 1.763 ± 0.98
1.058HisLys: 1.058 ± 0.588
2.116HisLeu: 2.116 ± 1.176
0.705HisMet: 0.705 ± 0.392
0.705HisAsn: 0.705 ± 0.384
3.173HisPro: 3.173 ± 0.809
1.41HisGln: 1.41 ± 0.91
1.41HisArg: 1.41 ± 0.349
2.468HisSer: 2.468 ± 1.372
1.058HisThr: 1.058 ± 0.898
0.705HisVal: 0.705 ± 0.392
0.705HisTrp: 0.705 ± 0.384
2.468HisTyr: 2.468 ± 1.372
0.0HisXaa: 0.0 ± 0.0
Ile
3.173IleAla: 3.173 ± 0.809
1.41IleCys: 1.41 ± 0.784
4.584IleAsp: 4.584 ± 0.15
3.173IleGlu: 3.173 ± 0.428
2.821IlePhe: 2.821 ± 0.698
3.526IleGly: 3.526 ± 0.931
1.41IleHis: 1.41 ± 0.784
2.116IleIle: 2.116 ± 1.797
4.937IleLys: 4.937 ± 1.42
7.405IleLeu: 7.405 ± 0.862
0.705IleMet: 0.705 ± 0.392
3.173IleAsn: 3.173 ± 1.185
2.821IlePro: 2.821 ± 0.997
1.763IleGln: 1.763 ± 0.67
2.468IleArg: 2.468 ± 0.813
3.526IleSer: 3.526 ± 1.375
3.879IleThr: 3.879 ± 2.425
2.116IleVal: 2.116 ± 0.636
0.705IleTrp: 0.705 ± 0.392
2.468IleTyr: 2.468 ± 0.606
0.0IleXaa: 0.0 ± 0.0
Lys
4.231LysAla: 4.231 ± 0.401
2.116LysCys: 2.116 ± 1.176
4.231LysAsp: 4.231 ± 1.271
2.821LysGlu: 2.821 ± 0.997
2.821LysPhe: 2.821 ± 0.587
3.879LysGly: 3.879 ± 0.302
1.058LysHis: 1.058 ± 0.588
5.642LysIle: 5.642 ± 1.993
4.231LysLys: 4.231 ± 0.401
6.7LysLeu: 6.7 ± 3.139
1.763LysMet: 1.763 ± 0.359
1.41LysAsn: 1.41 ± 0.784
2.821LysPro: 2.821 ± 0.698
2.468LysGln: 2.468 ± 0.763
5.289LysArg: 5.289 ± 1.6
4.231LysSer: 4.231 ± 0.401
3.526LysThr: 3.526 ± 0.948
3.879LysVal: 3.879 ± 1.104
1.763LysTrp: 1.763 ± 0.67
2.468LysTyr: 2.468 ± 0.631
0.0LysXaa: 0.0 ± 0.0
Leu
6.7LeuAla: 6.7 ± 3.79
0.353LeuCys: 0.353 ± 0.196
4.231LeuAsp: 4.231 ± 1.241
4.937LeuGlu: 4.937 ± 2.427
3.526LeuPhe: 3.526 ± 0.314
7.405LeuGly: 7.405 ± 2.921
5.642LeuHis: 5.642 ± 2.535
4.231LeuIle: 4.231 ± 1.271
7.757LeuLys: 7.757 ± 0.538
4.584LeuLeu: 4.584 ± 1.236
1.763LeuMet: 1.763 ± 0.98
4.937LeuAsn: 4.937 ± 2.427
4.937LeuPro: 4.937 ± 1.192
3.879LeuGln: 3.879 ± 0.922
4.584LeuArg: 4.584 ± 1.153
7.052LeuSer: 7.052 ± 1.223
7.052LeuThr: 7.052 ± 1.895
4.231LeuVal: 4.231 ± 1.048
0.705LeuTrp: 0.705 ± 0.384
2.116LeuTyr: 2.116 ± 1.878
0.0LeuXaa: 0.0 ± 0.0
Met
1.763MetAla: 1.763 ± 1.135
0.353MetCys: 0.353 ± 0.196
0.0MetAsp: 0.0 ± 0.0
1.763MetGlu: 1.763 ± 0.67
0.353MetPhe: 0.353 ± 0.196
0.353MetGly: 0.353 ± 0.525
0.353MetHis: 0.353 ± 0.196
1.058MetIle: 1.058 ± 0.31
1.41MetLys: 1.41 ± 0.784
2.116MetLeu: 2.116 ± 0.97
0.353MetMet: 0.353 ± 1.112
1.058MetAsn: 1.058 ± 0.31
1.41MetPro: 1.41 ± 0.768
1.058MetGln: 1.058 ± 0.942
0.0MetArg: 0.0 ± 0.0
2.821MetSer: 2.821 ± 0.698
1.058MetThr: 1.058 ± 0.898
1.41MetVal: 1.41 ± 0.349
0.0MetTrp: 0.0 ± 0.0
0.705MetTyr: 0.705 ± 0.392
0.0MetXaa: 0.0 ± 0.0
Asn
1.763AsnAla: 1.763 ± 0.92
1.058AsnCys: 1.058 ± 0.31
1.763AsnAsp: 1.763 ± 0.67
1.41AsnGlu: 1.41 ± 0.349
1.763AsnPhe: 1.763 ± 1.945
4.231AsnGly: 4.231 ± 1.708
1.058AsnHis: 1.058 ± 0.588
2.821AsnIle: 2.821 ± 1.674
2.821AsnLys: 2.821 ± 0.997
3.526AsnLeu: 3.526 ± 0.314
0.353AsnMet: 0.353 ± 0.196
2.821AsnAsn: 2.821 ± 3.219
2.821AsnPro: 2.821 ± 0.698
2.821AsnGln: 2.821 ± 4.101
3.173AsnArg: 3.173 ± 3.99
2.468AsnSer: 2.468 ± 0.813
2.821AsnThr: 2.821 ± 0.587
2.116AsnVal: 2.116 ± 0.663
0.0AsnTrp: 0.0 ± 0.0
2.468AsnTyr: 2.468 ± 1.66
0.0AsnXaa: 0.0 ± 0.0
Pro
2.821ProAla: 2.821 ± 1.167
1.41ProCys: 1.41 ± 0.784
4.937ProAsp: 4.937 ± 1.625
4.584ProGlu: 4.584 ± 0.836
2.116ProPhe: 2.116 ± 0.62
3.173ProGly: 3.173 ± 1.428
0.353ProHis: 0.353 ± 0.196
6.347ProIle: 6.347 ± 2.308
3.526ProLys: 3.526 ± 0.783
4.937ProLeu: 4.937 ± 1.262
1.058ProMet: 1.058 ± 0.31
1.41ProAsn: 1.41 ± 0.768
3.526ProPro: 3.526 ± 1.919
2.116ProGln: 2.116 ± 0.636
2.116ProArg: 2.116 ± 0.663
5.642ProSer: 5.642 ± 1.174
4.231ProThr: 4.231 ± 1.241
2.116ProVal: 2.116 ± 1.152
0.353ProTrp: 0.353 ± 0.196
0.705ProTyr: 0.705 ± 0.392
0.0ProXaa: 0.0 ± 0.0
Gln
4.231GlnAla: 4.231 ± 1.759
0.705GlnCys: 0.705 ± 0.384
0.705GlnAsp: 0.705 ± 0.384
1.763GlnGlu: 1.763 ± 0.67
2.468GlnPhe: 2.468 ± 0.763
1.763GlnGly: 1.763 ± 1.135
0.353GlnHis: 0.353 ± 0.196
1.41GlnIle: 1.41 ± 0.91
2.821GlnLys: 2.821 ± 0.997
3.526GlnLeu: 3.526 ± 3.885
0.705GlnMet: 0.705 ± 1.012
1.763GlnAsn: 1.763 ± 4.47
1.763GlnPro: 1.763 ± 0.98
2.116GlnGln: 2.116 ± 1.878
0.705GlnArg: 0.705 ± 1.012
4.231GlnSer: 4.231 ± 1.325
1.763GlnThr: 1.763 ± 0.767
4.584GlnVal: 4.584 ± 2.913
0.705GlnTrp: 0.705 ± 0.384
1.058GlnTyr: 1.058 ± 0.942
0.0GlnXaa: 0.0 ± 0.0
Arg
1.058ArgAla: 1.058 ± 0.588
0.353ArgCys: 0.353 ± 0.196
4.231ArgAsp: 4.231 ± 0.401
2.116ArgGlu: 2.116 ± 3.035
2.116ArgPhe: 2.116 ± 0.62
2.821ArgGly: 2.821 ± 1.568
2.116ArgHis: 2.116 ± 0.62
3.879ArgIle: 3.879 ± 0.302
3.879ArgLys: 3.879 ± 2.749
3.526ArgLeu: 3.526 ± 0.931
0.0ArgMet: 0.0 ± 0.0
3.879ArgAsn: 3.879 ± 1.104
3.526ArgPro: 3.526 ± 0.948
1.058ArgGln: 1.058 ± 0.31
2.468ArgArg: 2.468 ± 1.055
3.173ArgSer: 3.173 ± 0.809
4.231ArgThr: 4.231 ± 1.325
1.41ArgVal: 1.41 ± 0.349
0.0ArgTrp: 0.0 ± 0.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.7SerAla: 6.7 ± 1.913
1.058SerCys: 1.058 ± 0.898
5.289SerAsp: 5.289 ± 1.422
2.468SerGlu: 2.468 ± 0.813
3.173SerPhe: 3.173 ± 0.931
3.173SerGly: 3.173 ± 1.428
3.173SerHis: 3.173 ± 1.185
5.994SerIle: 5.994 ± 2.729
3.879SerLys: 3.879 ± 1.104
5.642SerLeu: 5.642 ± 0.373
1.41SerMet: 1.41 ± 0.91
1.763SerAsn: 1.763 ± 0.767
3.173SerPro: 3.173 ± 1.764
3.526SerGln: 3.526 ± 2.271
4.937SerArg: 4.937 ± 1.213
4.937SerSer: 4.937 ± 1.192
5.289SerThr: 5.289 ± 1.855
3.526SerVal: 3.526 ± 0.931
0.705SerTrp: 0.705 ± 0.392
2.116SerTyr: 2.116 ± 0.636
0.0SerXaa: 0.0 ± 0.0
Thr
3.879ThrAla: 3.879 ± 1.642
0.353ThrCys: 0.353 ± 0.196
4.937ThrAsp: 4.937 ± 1.262
4.231ThrGlu: 4.231 ± 1.241
4.937ThrPhe: 4.937 ± 1.192
2.116ThrGly: 2.116 ± 1.152
1.058ThrHis: 1.058 ± 0.588
3.173ThrIle: 3.173 ± 1.185
3.879ThrLys: 3.879 ± 1.515
5.994ThrLeu: 5.994 ± 1.499
0.705ThrMet: 0.705 ± 0.384
2.468ThrAsn: 2.468 ± 1.954
6.7ThrPro: 6.7 ± 1.919
1.763ThrGln: 1.763 ± 1.135
4.584ThrArg: 4.584 ± 1.952
5.289ThrSer: 5.289 ± 2.01
7.757ThrThr: 7.757 ± 1.084
2.821ThrVal: 2.821 ± 0.698
0.353ThrTrp: 0.353 ± 0.196
1.41ThrTyr: 1.41 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
3.879ValAla: 3.879 ± 0.922
0.705ValCys: 0.705 ± 0.384
2.468ValAsp: 2.468 ± 1.047
2.116ValGlu: 2.116 ± 1.176
2.116ValPhe: 2.116 ± 1.176
2.468ValGly: 2.468 ± 0.813
1.41ValHis: 1.41 ± 0.768
3.173ValIle: 3.173 ± 0.931
3.173ValLys: 3.173 ± 0.674
5.994ValLeu: 5.994 ± 1.547
1.058ValMet: 1.058 ± 0.31
2.468ValAsn: 2.468 ± 0.631
3.526ValPro: 3.526 ± 2.556
2.468ValGln: 2.468 ± 0.606
2.116ValArg: 2.116 ± 1.152
3.526ValSer: 3.526 ± 2.767
3.173ValThr: 3.173 ± 0.809
3.173ValVal: 3.173 ± 0.934
0.0ValTrp: 0.0 ± 0.0
2.468ValTyr: 2.468 ± 0.813
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.705TrpAsp: 0.705 ± 0.392
1.41TrpGlu: 1.41 ± 0.784
1.763TrpPhe: 1.763 ± 1.278
0.353TrpGly: 0.353 ± 0.196
0.705TrpHis: 0.705 ± 0.392
0.353TrpIle: 0.353 ± 0.196
1.41TrpLys: 1.41 ± 0.768
0.353TrpLeu: 0.353 ± 0.196
0.0TrpMet: 0.0 ± 0.0
0.705TrpAsn: 0.705 ± 0.384
0.353TrpPro: 0.353 ± 0.196
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.058TrpSer: 1.058 ± 0.31
1.058TrpThr: 1.058 ± 0.588
0.353TrpVal: 0.353 ± 0.525
0.0TrpTrp: 0.0 ± 0.0
0.705TrpTyr: 0.705 ± 0.392
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.705TyrAla: 0.705 ± 1.222
0.353TyrCys: 0.353 ± 0.196
4.584TyrAsp: 4.584 ± 1.938
2.468TyrGlu: 2.468 ± 0.813
1.41TyrPhe: 1.41 ± 0.349
2.116TyrGly: 2.116 ± 0.636
0.705TyrHis: 0.705 ± 1.012
1.058TyrIle: 1.058 ± 0.31
1.41TyrLys: 1.41 ± 0.91
2.821TyrLeu: 2.821 ± 0.997
0.353TyrMet: 0.353 ± 0.525
2.116TyrAsn: 2.116 ± 1.878
1.41TyrPro: 1.41 ± 0.349
1.763TyrGln: 1.763 ± 2.001
2.468TyrArg: 2.468 ± 0.813
2.116TyrSer: 2.116 ± 0.663
2.468TyrThr: 2.468 ± 0.813
0.353TyrVal: 0.353 ± 0.525
0.705TyrTrp: 0.705 ± 0.384
1.058TyrTyr: 1.058 ± 0.588
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2837 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski