Amino acid dipepetide frequency for Beihai picorna-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.619AlaAla: 3.619 ± 1.221
0.362AlaCys: 0.362 ± 0.224
6.515AlaAsp: 6.515 ± 0.758
3.257AlaGlu: 3.257 ± 1.194
3.257AlaPhe: 3.257 ± 1.255
5.067AlaGly: 5.067 ± 1.19
1.448AlaHis: 1.448 ± 0.497
1.81AlaIle: 1.81 ± 0.701
3.981AlaLys: 3.981 ± 1.614
5.067AlaLeu: 5.067 ± 1.996
1.086AlaMet: 1.086 ± 0.55
2.895AlaAsn: 2.895 ± 1.386
5.067AlaPro: 5.067 ± 1.19
1.086AlaGln: 1.086 ± 0.55
2.172AlaArg: 2.172 ± 0.656
4.705AlaSer: 4.705 ± 1.738
3.981AlaThr: 3.981 ± 1.251
4.705AlaVal: 4.705 ± 1.689
1.81AlaTrp: 1.81 ± 0.78
1.81AlaTyr: 1.81 ± 1.027
0.0AlaXaa: 0.0 ± 0.0
Cys
1.448CysAla: 1.448 ± 0.489
0.0CysCys: 0.0 ± 0.0
1.81CysAsp: 1.81 ± 2.013
0.362CysGlu: 0.362 ± 1.165
0.724CysPhe: 0.724 ± 0.244
0.724CysGly: 0.724 ± 0.448
0.0CysHis: 0.0 ± 0.0
0.724CysIle: 0.724 ± 0.244
1.086CysLys: 1.086 ± 1.002
0.362CysLeu: 0.362 ± 1.165
0.724CysMet: 0.724 ± 0.448
1.086CysAsn: 1.086 ± 0.672
0.724CysPro: 0.724 ± 0.244
0.362CysGln: 0.362 ± 0.224
0.362CysArg: 0.362 ± 0.224
0.724CysSer: 0.724 ± 0.448
0.724CysThr: 0.724 ± 2.329
0.362CysVal: 0.362 ± 0.224
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.343AspAla: 4.343 ± 1.047
0.724AspCys: 0.724 ± 0.448
7.962AspAsp: 7.962 ± 2.301
3.257AspGlu: 3.257 ± 0.559
3.619AspPhe: 3.619 ± 1.497
2.895AspGly: 2.895 ± 0.977
1.448AspHis: 1.448 ± 0.497
3.619AspIle: 3.619 ± 1.196
3.981AspLys: 3.981 ± 1.614
7.239AspLeu: 7.239 ± 2.243
1.448AspMet: 1.448 ± 0.489
2.172AspAsn: 2.172 ± 1.1
2.895AspPro: 2.895 ± 0.977
1.81AspGln: 1.81 ± 1.221
2.895AspArg: 2.895 ± 1.352
5.067AspSer: 5.067 ± 1.19
2.172AspThr: 2.172 ± 0.733
6.153AspVal: 6.153 ± 0.781
0.724AspTrp: 0.724 ± 1.063
3.257AspTyr: 3.257 ± 1.194
0.0AspXaa: 0.0 ± 0.0
Glu
3.257GluAla: 3.257 ± 0.989
1.086GluCys: 1.086 ± 1.002
2.172GluAsp: 2.172 ± 0.642
0.724GluGlu: 0.724 ± 1.063
2.533GluPhe: 2.533 ± 1.132
2.533GluGly: 2.533 ± 0.79
0.724GluHis: 0.724 ± 0.448
4.343GluIle: 4.343 ± 1.284
2.895GluLys: 2.895 ± 0.994
5.429GluLeu: 5.429 ± 0.667
1.81GluMet: 1.81 ± 0.723
2.172GluAsn: 2.172 ± 0.642
1.81GluPro: 1.81 ± 0.701
1.81GluGln: 1.81 ± 0.701
1.448GluArg: 1.448 ± 0.897
3.257GluSer: 3.257 ± 1.194
2.172GluThr: 2.172 ± 1.1
1.448GluVal: 1.448 ± 1.005
1.81GluTrp: 1.81 ± 0.701
1.448GluTyr: 1.448 ± 0.497
0.0GluXaa: 0.0 ± 0.0
Phe
3.257PheAla: 3.257 ± 0.963
0.362PheCys: 0.362 ± 0.224
3.257PheAsp: 3.257 ± 0.963
3.981PheGlu: 3.981 ± 0.858
4.705PhePhe: 4.705 ± 0.337
4.705PheGly: 4.705 ± 1.357
1.81PheHis: 1.81 ± 0.78
3.981PheIle: 3.981 ± 1.3
3.619PheLys: 3.619 ± 1.714
5.791PheLeu: 5.791 ± 0.476
0.724PheMet: 0.724 ± 0.448
1.448PheAsn: 1.448 ± 0.897
1.81PhePro: 1.81 ± 0.78
2.533PheGln: 2.533 ± 3.238
3.257PheArg: 3.257 ± 1.632
5.791PheSer: 5.791 ± 0.863
2.895PheThr: 2.895 ± 0.835
3.619PheVal: 3.619 ± 4.085
0.724PheTrp: 0.724 ± 0.244
1.81PheTyr: 1.81 ± 0.701
0.0PheXaa: 0.0 ± 0.0
Gly
4.705GlyAla: 4.705 ± 0.218
0.724GlyCys: 0.724 ± 0.244
5.067GlyAsp: 5.067 ± 2.032
2.895GlyGlu: 2.895 ± 0.977
4.343GlyPhe: 4.343 ± 1.284
5.067GlyGly: 5.067 ± 1.709
0.724GlyHis: 0.724 ± 0.448
3.257GlyIle: 3.257 ± 0.559
3.257GlyLys: 3.257 ± 1.632
5.429GlyLeu: 5.429 ± 2.3
1.086GlyMet: 1.086 ± 0.55
6.153GlyAsn: 6.153 ± 1.214
2.172GlyPro: 2.172 ± 0.733
3.257GlyGln: 3.257 ± 2.5
3.981GlyArg: 3.981 ± 1.466
3.981GlySer: 3.981 ± 1.614
3.981GlyThr: 3.981 ± 2.064
4.343GlyVal: 4.343 ± 1.047
1.448GlyTrp: 1.448 ± 1.36
1.81GlyTyr: 1.81 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
1.81HisAla: 1.81 ± 0.701
0.362HisCys: 0.362 ± 0.224
0.0HisAsp: 0.0 ± 0.0
0.362HisGlu: 0.362 ± 0.224
1.086HisPhe: 1.086 ± 0.672
0.362HisGly: 0.362 ± 0.224
0.0HisHis: 0.0 ± 0.0
0.362HisIle: 0.362 ± 0.224
1.448HisLys: 1.448 ± 1.005
1.81HisLeu: 1.81 ± 1.027
2.533HisMet: 2.533 ± 0.806
0.362HisAsn: 0.362 ± 0.342
0.362HisPro: 0.362 ± 0.342
0.362HisGln: 0.362 ± 0.224
1.448HisArg: 1.448 ± 0.897
2.172HisSer: 2.172 ± 0.656
0.362HisThr: 0.362 ± 0.224
0.724HisVal: 0.724 ± 0.448
0.0HisTrp: 0.0 ± 0.0
1.086HisTyr: 1.086 ± 1.025
0.0HisXaa: 0.0 ± 0.0
Ile
3.257IleAla: 3.257 ± 0.963
0.362IleCys: 0.362 ± 0.224
3.257IleAsp: 3.257 ± 0.989
3.257IleGlu: 3.257 ± 0.989
3.257IlePhe: 3.257 ± 0.963
1.086IleGly: 1.086 ± 0.321
0.362IleHis: 0.362 ± 0.224
3.257IleIle: 3.257 ± 1.03
3.257IleLys: 3.257 ± 0.335
4.705IleLeu: 4.705 ± 0.729
2.895IleMet: 2.895 ± 1.352
2.172IleAsn: 2.172 ± 2.13
2.172IlePro: 2.172 ± 0.642
1.81IleGln: 1.81 ± 0.819
2.533IleArg: 2.533 ± 0.539
5.429IleSer: 5.429 ± 2.103
5.067IleThr: 5.067 ± 1.612
2.533IleVal: 2.533 ± 0.806
0.362IleTrp: 0.362 ± 0.224
0.724IleTyr: 0.724 ± 0.448
0.0IleXaa: 0.0 ± 0.0
Lys
2.172LysAla: 2.172 ± 0.642
0.362LysCys: 0.362 ± 1.165
2.533LysAsp: 2.533 ± 1.132
1.448LysGlu: 1.448 ± 0.897
4.343LysPhe: 4.343 ± 0.525
3.257LysGly: 3.257 ± 0.335
1.086LysHis: 1.086 ± 0.672
3.981LysIle: 3.981 ± 1.614
1.448LysLys: 1.448 ± 0.497
4.705LysLeu: 4.705 ± 1.179
0.362LysMet: 0.362 ± 1.165
1.448LysAsn: 1.448 ± 0.897
1.81LysPro: 1.81 ± 0.78
2.172LysGln: 2.172 ± 0.642
4.343LysArg: 4.343 ± 2.218
9.41LysSer: 9.41 ± 2.378
2.533LysThr: 2.533 ± 4.34
5.067LysVal: 5.067 ± 1.458
1.086LysTrp: 1.086 ± 0.55
2.895LysTyr: 2.895 ± 0.89
0.0LysXaa: 0.0 ± 0.0
Leu
5.429LeuAla: 5.429 ± 1.278
1.81LeuCys: 1.81 ± 1.027
3.981LeuAsp: 3.981 ± 0.731
2.895LeuGlu: 2.895 ± 1.352
4.705LeuPhe: 4.705 ± 3.765
5.791LeuGly: 5.791 ± 0.485
2.895LeuHis: 2.895 ± 0.835
3.981LeuIle: 3.981 ± 1.3
4.705LeuLys: 4.705 ± 2.914
3.981LeuLeu: 3.981 ± 1.3
2.533LeuMet: 2.533 ± 1.569
3.981LeuAsn: 3.981 ± 1.228
1.81LeuPro: 1.81 ± 0.767
4.343LeuGln: 4.343 ± 1.847
5.429LeuArg: 5.429 ± 2.233
6.153LeuSer: 6.153 ± 2.068
6.515LeuThr: 6.515 ± 0.669
6.153LeuVal: 6.153 ± 1.115
1.086LeuTrp: 1.086 ± 0.321
2.533LeuTyr: 2.533 ± 1.826
0.0LeuXaa: 0.0 ± 0.0
Met
3.257MetAla: 3.257 ± 0.335
0.724MetCys: 0.724 ± 0.244
2.533MetAsp: 2.533 ± 0.806
1.086MetGlu: 1.086 ± 0.672
1.448MetPhe: 1.448 ± 0.989
1.448MetGly: 1.448 ± 1.005
0.0MetHis: 0.0 ± 0.0
0.362MetIle: 0.362 ± 0.224
2.172MetLys: 2.172 ± 1.1
2.172MetLeu: 2.172 ± 0.642
1.086MetMet: 1.086 ± 0.321
1.448MetAsn: 1.448 ± 0.897
3.257MetPro: 3.257 ± 0.679
0.724MetGln: 0.724 ± 0.448
1.086MetArg: 1.086 ± 0.672
2.172MetSer: 2.172 ± 0.642
1.81MetThr: 1.81 ± 0.524
2.172MetVal: 2.172 ± 0.745
0.362MetTrp: 0.362 ± 0.224
1.448MetTyr: 1.448 ± 0.497
0.0MetXaa: 0.0 ± 0.0
Asn
1.448AsnAla: 1.448 ± 0.497
0.362AsnCys: 0.362 ± 0.224
1.81AsnAsp: 1.81 ± 0.524
1.81AsnGlu: 1.81 ± 0.524
2.895AsnPhe: 2.895 ± 0.835
4.343AsnGly: 4.343 ± 2.095
1.086AsnHis: 1.086 ± 0.321
1.086AsnIle: 1.086 ± 0.321
2.895AsnLys: 2.895 ± 2.98
3.619AsnLeu: 3.619 ± 1.221
1.448AsnMet: 1.448 ± 0.497
2.172AsnAsn: 2.172 ± 0.733
5.429AsnPro: 5.429 ± 1.605
1.448AsnGln: 1.448 ± 0.489
2.172AsnArg: 2.172 ± 0.656
3.981AsnSer: 3.981 ± 0.231
2.895AsnThr: 2.895 ± 0.994
5.067AsnVal: 5.067 ± 1.19
0.362AsnTrp: 0.362 ± 0.224
3.257AsnTyr: 3.257 ± 1.651
0.0AsnXaa: 0.0 ± 0.0
Pro
3.619ProAla: 3.619 ± 1.559
1.086ProCys: 1.086 ± 1.205
3.619ProAsp: 3.619 ± 0.455
3.981ProGlu: 3.981 ± 1.151
2.172ProPhe: 2.172 ± 1.56
0.724ProGly: 0.724 ± 0.448
0.0ProHis: 0.0 ± 0.0
2.533ProIle: 2.533 ± 0.539
1.448ProLys: 1.448 ± 0.989
4.705ProLeu: 4.705 ± 0.337
0.362ProMet: 0.362 ± 0.342
3.257ProAsn: 3.257 ± 0.335
1.448ProPro: 1.448 ± 0.883
3.257ProGln: 3.257 ± 0.989
0.724ProArg: 0.724 ± 0.244
4.343ProSer: 4.343 ± 1.284
3.981ProThr: 3.981 ± 1.251
4.343ProVal: 4.343 ± 1.274
1.086ProTrp: 1.086 ± 0.321
1.81ProTyr: 1.81 ± 1.141
0.0ProXaa: 0.0 ± 0.0
Gln
2.172GlnAla: 2.172 ± 1.1
0.724GlnCys: 0.724 ± 1.135
1.448GlnAsp: 1.448 ± 0.849
2.533GlnGlu: 2.533 ± 0.806
1.448GlnPhe: 1.448 ± 0.497
2.172GlnGly: 2.172 ± 1.953
0.0GlnHis: 0.0 ± 0.0
1.81GlnIle: 1.81 ± 1.141
1.81GlnLys: 1.81 ± 0.701
2.533GlnLeu: 2.533 ± 1.227
2.172GlnMet: 2.172 ± 0.733
2.533GlnAsn: 2.533 ± 0.753
1.086GlnPro: 1.086 ± 1.025
1.086GlnGln: 1.086 ± 0.977
1.81GlnArg: 1.81 ± 0.767
5.067GlnSer: 5.067 ± 2.916
2.172GlnThr: 2.172 ± 0.733
1.81GlnVal: 1.81 ± 0.78
0.724GlnTrp: 0.724 ± 0.448
1.448GlnTyr: 1.448 ± 0.849
0.0GlnXaa: 0.0 ± 0.0
Arg
4.343ArgAla: 4.343 ± 1.572
0.362ArgCys: 0.362 ± 0.342
3.619ArgAsp: 3.619 ± 1.402
2.533ArgGlu: 2.533 ± 0.539
3.981ArgPhe: 3.981 ± 0.858
3.257ArgGly: 3.257 ± 2.93
0.362ArgHis: 0.362 ± 0.342
2.895ArgIle: 2.895 ± 0.903
2.533ArgLys: 2.533 ± 1.227
3.257ArgLeu: 3.257 ± 0.335
1.086ArgMet: 1.086 ± 0.65
1.81ArgAsn: 1.81 ± 0.524
3.981ArgPro: 3.981 ± 1.712
1.086ArgGln: 1.086 ± 0.672
1.81ArgArg: 1.81 ± 0.701
1.086ArgSer: 1.086 ± 0.321
1.81ArgThr: 1.81 ± 0.78
4.343ArgVal: 4.343 ± 1.49
0.724ArgTrp: 0.724 ± 0.448
1.81ArgTyr: 1.81 ± 0.78
0.0ArgXaa: 0.0 ± 0.0
Ser
6.153SerAla: 6.153 ± 1.582
1.086SerCys: 1.086 ± 0.672
6.153SerAsp: 6.153 ± 0.51
1.81SerGlu: 1.81 ± 1.121
5.067SerPhe: 5.067 ± 2.364
8.324SerGly: 8.324 ± 2.306
1.448SerHis: 1.448 ± 0.849
6.515SerIle: 6.515 ± 2.388
5.429SerLys: 5.429 ± 2.288
8.686SerLeu: 8.686 ± 1.819
3.257SerMet: 3.257 ± 1.688
3.981SerAsn: 3.981 ± 0.731
4.343SerPro: 4.343 ± 0.014
1.448SerGln: 1.448 ± 0.497
3.257SerArg: 3.257 ± 1.651
6.515SerSer: 6.515 ± 0.905
3.981SerThr: 3.981 ± 1.151
2.895SerVal: 2.895 ± 0.977
0.362SerTrp: 0.362 ± 0.224
3.981SerTyr: 3.981 ± 2.948
0.0SerXaa: 0.0 ± 0.0
Thr
1.81ThrAla: 1.81 ± 0.78
0.362ThrCys: 0.362 ± 1.165
5.067ThrAsp: 5.067 ± 1.469
3.257ThrGlu: 3.257 ± 1.255
1.81ThrPhe: 1.81 ± 0.701
4.705ThrGly: 4.705 ± 1.357
1.81ThrHis: 1.81 ± 0.767
2.895ThrIle: 2.895 ± 1.327
2.533ThrLys: 2.533 ± 1.979
5.791ThrLeu: 5.791 ± 3.395
1.448ThrMet: 1.448 ± 0.883
3.619ThrAsn: 3.619 ± 1.122
2.895ThrPro: 2.895 ± 0.835
3.619ThrGln: 3.619 ± 1.497
2.533ThrArg: 2.533 ± 1.431
4.705ThrSer: 4.705 ± 2.422
2.895ThrThr: 2.895 ± 1.386
4.343ThrVal: 4.343 ± 1.274
0.724ThrTrp: 0.724 ± 1.063
1.81ThrTyr: 1.81 ± 2.105
0.0ThrXaa: 0.0 ± 0.0
Val
4.705ValAla: 4.705 ± 0.337
1.086ValCys: 1.086 ± 1.002
3.981ValAsp: 3.981 ± 1.151
2.533ValGlu: 2.533 ± 0.539
3.981ValPhe: 3.981 ± 1.228
5.067ValGly: 5.067 ± 0.679
1.086ValHis: 1.086 ± 0.672
2.533ValIle: 2.533 ± 0.753
3.619ValLys: 3.619 ± 1.049
3.257ValLeu: 3.257 ± 1.632
2.533ValMet: 2.533 ± 1.227
4.343ValAsn: 4.343 ± 0.523
4.705ValPro: 4.705 ± 1.633
2.895ValGln: 2.895 ± 0.89
2.533ValArg: 2.533 ± 1.132
6.153ValSer: 6.153 ± 0.691
5.791ValThr: 5.791 ± 0.476
3.619ValVal: 3.619 ± 1.915
0.362ValTrp: 0.362 ± 0.224
3.619ValTyr: 3.619 ± 1.533
0.0ValXaa: 0.0 ± 0.0
Trp
0.362TrpAla: 0.362 ± 1.165
0.362TrpCys: 0.362 ± 0.224
0.0TrpAsp: 0.0 ± 0.0
1.448TrpGlu: 1.448 ± 0.497
1.448TrpPhe: 1.448 ± 0.989
1.448TrpGly: 1.448 ± 0.489
0.0TrpHis: 0.0 ± 0.0
0.362TrpIle: 0.362 ± 0.224
1.81TrpLys: 1.81 ± 1.027
0.724TrpLeu: 0.724 ± 0.448
1.086TrpMet: 1.086 ± 0.55
0.362TrpAsn: 0.362 ± 0.342
0.0TrpPro: 0.0 ± 0.0
0.724TrpGln: 0.724 ± 0.244
1.086TrpArg: 1.086 ± 0.321
0.0TrpSer: 0.0 ± 0.0
1.448TrpThr: 1.448 ± 0.489
0.724TrpVal: 0.724 ± 0.684
0.362TrpTrp: 0.362 ± 0.224
0.724TrpTyr: 0.724 ± 0.448
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.533TyrAla: 2.533 ± 0.806
0.0TyrCys: 0.0 ± 0.0
2.895TyrAsp: 2.895 ± 0.501
1.81TyrGlu: 1.81 ± 0.701
3.257TyrPhe: 3.257 ± 0.335
4.705TyrGly: 4.705 ± 2.6
0.724TyrHis: 0.724 ± 1.063
1.448TyrIle: 1.448 ± 0.849
2.533TyrLys: 2.533 ± 0.539
1.448TyrLeu: 1.448 ± 0.849
0.724TyrMet: 0.724 ± 1.135
2.172TyrAsn: 2.172 ± 1.1
0.724TyrPro: 0.724 ± 0.448
0.724TyrGln: 0.724 ± 0.244
1.81TyrArg: 1.81 ± 1.141
3.981TyrSer: 3.981 ± 1.151
1.448TyrThr: 1.448 ± 1.005
3.981TyrVal: 3.981 ± 0.858
0.362TyrTrp: 0.362 ± 0.224
2.172TyrTyr: 2.172 ± 0.656
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2764 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski