Amino acid dipepetide frequency for Hubei orthoptera virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.215AlaAla: 3.215 ± 1.203
0.715AlaCys: 0.715 ± 0.201
4.645AlaAsp: 4.645 ± 0.409
4.287AlaGlu: 4.287 ± 1.205
5.002AlaPhe: 5.002 ± 0.388
5.002AlaGly: 5.002 ± 1.406
1.072AlaHis: 1.072 ± 0.002
2.858AlaIle: 2.858 ± 0.991
5.002AlaLys: 5.002 ± 0.388
0.357AlaLeu: 0.357 ± 0.199
2.501AlaMet: 2.501 ± 0.404
2.501AlaAsn: 2.501 ± 0.194
2.501AlaPro: 2.501 ± 0.194
1.429AlaGln: 1.429 ± 1.0
1.786AlaArg: 1.786 ± 0.203
2.501AlaSer: 2.501 ± 1.002
4.287AlaThr: 4.287 ± 0.607
2.858AlaVal: 2.858 ± 0.804
0.715AlaTrp: 0.715 ± 0.201
2.144AlaTyr: 2.144 ± 0.603
0.0AlaXaa: 0.0 ± 0.0
Cys
0.715CysAla: 0.715 ± 0.201
0.0CysCys: 0.0 ± 0.0
0.357CysAsp: 0.357 ± 0.199
1.786CysGlu: 1.786 ± 0.203
1.786CysPhe: 1.786 ± 0.395
1.429CysGly: 1.429 ± 0.196
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.357CysLys: 0.357 ± 0.4
0.357CysLeu: 0.357 ± 0.199
0.357CysMet: 0.357 ± 0.322
0.0CysAsn: 0.0 ± 0.0
0.715CysPro: 0.715 ± 0.799
0.715CysGln: 0.715 ± 0.397
1.072CysArg: 1.072 ± 0.596
0.715CysSer: 0.715 ± 0.397
0.357CysThr: 0.357 ± 0.199
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.287AspAla: 4.287 ± 1.188
0.357AspCys: 0.357 ± 0.199
2.858AspAsp: 2.858 ± 0.991
3.573AspGlu: 3.573 ± 1.388
4.645AspPhe: 4.645 ± 1.386
2.501AspGly: 2.501 ± 1.002
0.357AspHis: 0.357 ± 0.4
6.431AspIle: 6.431 ± 1.21
3.93AspLys: 3.93 ± 0.806
6.788AspLeu: 6.788 ± 1.98
1.072AspMet: 1.072 ± 0.002
3.573AspAsn: 3.573 ± 2.201
4.287AspPro: 4.287 ± 0.009
2.501AspGln: 2.501 ± 0.194
1.072AspArg: 1.072 ± 0.596
3.573AspSer: 3.573 ± 0.406
5.002AspThr: 5.002 ± 1.406
2.144AspVal: 2.144 ± 0.004
0.715AspTrp: 0.715 ± 0.397
1.786AspTyr: 1.786 ± 0.203
0.0AspXaa: 0.0 ± 0.0
Glu
5.716GluAla: 5.716 ± 0.786
0.0GluCys: 0.0 ± 0.0
2.501GluAsp: 2.501 ± 1.002
5.716GluGlu: 5.716 ± 2.58
3.573GluPhe: 3.573 ± 0.406
3.215GluGly: 3.215 ± 1.19
0.715GluHis: 0.715 ± 0.397
5.716GluIle: 5.716 ± 1.009
3.573GluLys: 3.573 ± 0.406
5.359GluLeu: 5.359 ± 1.185
1.429GluMet: 1.429 ± 0.196
5.359GluAsn: 5.359 ± 1.208
1.072GluPro: 1.072 ± 0.6
2.501GluGln: 2.501 ± 0.792
2.144GluArg: 2.144 ± 0.603
3.573GluSer: 3.573 ± 0.79
2.144GluThr: 2.144 ± 1.192
5.716GluVal: 5.716 ± 0.411
1.429GluTrp: 1.429 ± 0.795
3.215GluTyr: 3.215 ± 1.19
0.0GluXaa: 0.0 ± 0.0
Phe
1.786PheAla: 1.786 ± 0.395
0.357PheCys: 0.357 ± 0.199
2.858PheAsp: 2.858 ± 0.804
3.215PheGlu: 3.215 ± 1.19
2.858PhePhe: 2.858 ± 0.804
2.858PheGly: 2.858 ± 0.393
0.357PheHis: 0.357 ± 0.4
1.786PheIle: 1.786 ± 0.801
3.573PheLys: 3.573 ± 0.79
5.002PheLeu: 5.002 ± 0.388
0.715PheMet: 0.715 ± 0.397
3.573PheAsn: 3.573 ± 0.406
1.786PhePro: 1.786 ± 0.801
1.072PheGln: 1.072 ± 0.596
2.858PheArg: 2.858 ± 0.804
3.93PheSer: 3.93 ± 1.587
2.144PheThr: 2.144 ± 0.603
3.93PheVal: 3.93 ± 0.391
0.715PheTrp: 0.715 ± 0.397
1.786PheTyr: 1.786 ± 0.203
0.0PheXaa: 0.0 ± 0.0
Gly
2.858GlyAla: 2.858 ± 0.804
0.0GlyCys: 0.0 ± 0.0
4.645GlyAsp: 4.645 ± 0.19
2.144GlyGlu: 2.144 ± 0.004
1.786GlyPhe: 1.786 ± 0.203
2.858GlyGly: 2.858 ± 2.0
0.357GlyHis: 0.357 ± 0.4
4.287GlyIle: 4.287 ± 0.009
6.431GlyLys: 6.431 ± 0.585
6.431GlyLeu: 6.431 ± 2.38
1.786GlyMet: 1.786 ± 0.395
4.287GlyAsn: 4.287 ± 0.607
1.072GlyPro: 1.072 ± 0.002
2.144GlyGln: 2.144 ± 0.004
1.786GlyArg: 1.786 ± 0.801
3.215GlySer: 3.215 ± 1.801
3.93GlyThr: 3.93 ± 0.208
2.144GlyVal: 2.144 ± 1.799
1.072GlyTrp: 1.072 ± 0.6
1.072GlyTyr: 1.072 ± 0.596
0.0GlyXaa: 0.0 ± 0.0
His
0.357HisAla: 0.357 ± 0.4
0.0HisCys: 0.0 ± 0.0
0.715HisAsp: 0.715 ± 0.201
0.715HisGlu: 0.715 ± 0.201
1.072HisPhe: 1.072 ± 0.6
1.072HisGly: 1.072 ± 0.596
0.0HisHis: 0.0 ± 0.0
1.072HisIle: 1.072 ± 0.6
1.429HisLys: 1.429 ± 0.196
1.429HisLeu: 1.429 ± 0.795
1.072HisMet: 1.072 ± 0.002
0.357HisAsn: 0.357 ± 0.199
1.072HisPro: 1.072 ± 0.596
0.715HisGln: 0.715 ± 0.201
0.357HisArg: 0.357 ± 0.199
1.429HisSer: 1.429 ± 0.196
1.072HisThr: 1.072 ± 0.002
1.429HisVal: 1.429 ± 0.795
0.357HisTrp: 0.357 ± 0.4
0.357HisTyr: 0.357 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
4.287IleAla: 4.287 ± 0.607
1.072IleCys: 1.072 ± 0.002
6.788IleAsp: 6.788 ± 1.382
5.716IleGlu: 5.716 ± 1.009
2.501IlePhe: 2.501 ± 0.404
2.858IleGly: 2.858 ± 1.402
1.429IleHis: 1.429 ± 0.795
2.501IleIle: 2.501 ± 0.194
4.645IleLys: 4.645 ± 0.409
7.145IleLeu: 7.145 ± 1.58
2.144IleMet: 2.144 ± 0.594
4.287IleAsn: 4.287 ± 0.589
4.645IlePro: 4.645 ± 2.203
1.786IleGln: 1.786 ± 0.203
2.858IleArg: 2.858 ± 0.393
5.716IleSer: 5.716 ± 0.411
4.287IleThr: 4.287 ± 0.009
3.215IleVal: 3.215 ± 2.4
1.072IleTrp: 1.072 ± 0.002
2.501IleTyr: 2.501 ± 0.792
0.0IleXaa: 0.0 ± 0.0
Lys
3.573LysAla: 3.573 ± 0.192
1.072LysCys: 1.072 ± 0.596
4.287LysAsp: 4.287 ± 0.009
4.287LysGlu: 4.287 ± 0.009
2.858LysPhe: 2.858 ± 0.991
3.215LysGly: 3.215 ± 0.007
2.501LysHis: 2.501 ± 0.404
5.359LysIle: 5.359 ± 0.587
7.503LysLys: 7.503 ± 3.574
6.074LysLeu: 6.074 ± 0.81
1.429LysMet: 1.429 ± 0.417
5.002LysAsn: 5.002 ± 0.21
2.858LysPro: 2.858 ± 0.393
1.429LysGln: 1.429 ± 0.196
3.573LysArg: 3.573 ± 0.192
3.573LysSer: 3.573 ± 0.79
6.431LysThr: 6.431 ± 1.781
5.359LysVal: 5.359 ± 1.185
1.429LysTrp: 1.429 ± 0.402
2.858LysTyr: 2.858 ± 0.205
0.0LysXaa: 0.0 ± 0.0
Leu
4.645LeuAla: 4.645 ± 0.788
1.429LeuCys: 1.429 ± 0.402
5.716LeuAsp: 5.716 ± 0.411
6.431LeuGlu: 6.431 ± 2.38
3.215LeuPhe: 3.215 ± 0.007
5.716LeuGly: 5.716 ± 2.58
2.144LeuHis: 2.144 ± 1.192
3.215LeuIle: 3.215 ± 1.788
3.215LeuLys: 3.215 ± 0.592
6.074LeuLeu: 6.074 ± 0.984
2.501LeuMet: 2.501 ± 0.194
4.287LeuAsn: 4.287 ± 0.009
1.786LeuPro: 1.786 ± 0.801
3.93LeuGln: 3.93 ± 0.989
5.002LeuArg: 5.002 ± 0.808
5.359LeuSer: 5.359 ± 1.208
7.86LeuThr: 7.86 ± 2.21
8.217LeuVal: 8.217 ± 1.578
0.357LeuTrp: 0.357 ± 0.4
2.858LeuTyr: 2.858 ± 1.589
0.0LeuXaa: 0.0 ± 0.0
Met
1.786MetAla: 1.786 ± 0.801
0.715MetCys: 0.715 ± 0.397
2.144MetAsp: 2.144 ± 1.192
1.072MetGlu: 1.072 ± 0.596
0.715MetPhe: 0.715 ± 0.397
0.715MetGly: 0.715 ± 0.201
1.429MetHis: 1.429 ± 0.196
1.072MetIle: 1.072 ± 0.596
2.144MetLys: 2.144 ± 0.004
2.144MetLeu: 2.144 ± 1.192
0.715MetMet: 0.715 ± 0.397
2.144MetAsn: 2.144 ± 0.004
1.429MetPro: 1.429 ± 0.196
3.573MetGln: 3.573 ± 0.406
1.429MetArg: 1.429 ± 0.196
1.072MetSer: 1.072 ± 0.002
1.072MetThr: 1.072 ± 0.596
0.715MetVal: 0.715 ± 0.397
0.715MetTrp: 0.715 ± 0.397
1.786MetTyr: 1.786 ± 1.4
0.0MetXaa: 0.0 ± 0.0
Asn
2.858AsnAla: 2.858 ± 0.804
1.429AsnCys: 1.429 ± 0.196
3.573AsnAsp: 3.573 ± 0.192
3.93AsnGlu: 3.93 ± 0.208
0.715AsnPhe: 0.715 ± 0.201
2.144AsnGly: 2.144 ± 0.004
0.357AsnHis: 0.357 ± 0.4
6.074AsnIle: 6.074 ± 0.81
3.93AsnLys: 3.93 ± 0.989
6.431AsnLeu: 6.431 ± 1.21
4.287AsnMet: 4.287 ± 1.188
2.144AsnAsn: 2.144 ± 0.603
1.786AsnPro: 1.786 ± 0.801
0.357AsnGln: 0.357 ± 0.199
2.501AsnArg: 2.501 ± 0.792
3.215AsnSer: 3.215 ± 0.605
5.359AsnThr: 5.359 ± 0.609
5.002AsnVal: 5.002 ± 0.21
0.357AsnTrp: 0.357 ± 0.199
2.501AsnTyr: 2.501 ± 0.792
0.0AsnXaa: 0.0 ± 0.0
Pro
2.501ProAla: 2.501 ± 0.194
0.357ProCys: 0.357 ± 0.4
2.144ProAsp: 2.144 ± 1.201
2.144ProGlu: 2.144 ± 0.594
2.501ProPhe: 2.501 ± 1.002
3.215ProGly: 3.215 ± 1.203
0.357ProHis: 0.357 ± 0.4
3.93ProIle: 3.93 ± 2.002
1.786ProLys: 1.786 ± 0.395
6.431ProLeu: 6.431 ± 0.013
1.072ProMet: 1.072 ± 0.002
1.786ProAsn: 1.786 ± 0.395
1.072ProPro: 1.072 ± 0.002
2.144ProGln: 2.144 ± 1.201
0.715ProArg: 0.715 ± 0.201
2.858ProSer: 2.858 ± 0.205
3.215ProThr: 3.215 ± 0.592
2.858ProVal: 2.858 ± 0.804
1.072ProTrp: 1.072 ± 0.002
2.501ProTyr: 2.501 ± 0.404
0.0ProXaa: 0.0 ± 0.0
Gln
1.429GlnAla: 1.429 ± 0.196
0.357GlnCys: 0.357 ± 0.4
0.715GlnAsp: 0.715 ± 0.397
2.501GlnGlu: 2.501 ± 0.194
0.715GlnPhe: 0.715 ± 0.397
0.715GlnGly: 0.715 ± 0.397
0.357GlnHis: 0.357 ± 0.199
4.645GlnIle: 4.645 ± 1.007
2.858GlnLys: 2.858 ± 0.991
1.429GlnLeu: 1.429 ± 1.0
0.0GlnMet: 0.0 ± 0.0
3.215GlnAsn: 3.215 ± 0.592
3.215GlnPro: 3.215 ± 0.592
0.357GlnGln: 0.357 ± 0.199
2.501GlnArg: 2.501 ± 1.391
1.429GlnSer: 1.429 ± 1.0
0.715GlnThr: 0.715 ± 0.397
3.93GlnVal: 3.93 ± 0.806
0.357GlnTrp: 0.357 ± 0.199
1.786GlnTyr: 1.786 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
3.215ArgAla: 3.215 ± 1.801
0.0ArgCys: 0.0 ± 0.0
1.072ArgAsp: 1.072 ± 0.002
2.501ArgGlu: 2.501 ± 0.194
1.786ArgPhe: 1.786 ± 0.801
4.287ArgGly: 4.287 ± 0.009
0.357ArgHis: 0.357 ± 0.199
2.144ArgIle: 2.144 ± 1.192
2.858ArgLys: 2.858 ± 0.393
3.93ArgLeu: 3.93 ± 0.806
0.715ArgMet: 0.715 ± 0.397
1.429ArgAsn: 1.429 ± 0.196
3.215ArgPro: 3.215 ± 1.203
1.429ArgGln: 1.429 ± 0.196
3.573ArgArg: 3.573 ± 0.79
2.501ArgSer: 2.501 ± 0.404
2.144ArgThr: 2.144 ± 0.594
3.93ArgVal: 3.93 ± 0.989
0.715ArgTrp: 0.715 ± 0.397
1.786ArgTyr: 1.786 ± 0.203
0.0ArgXaa: 0.0 ± 0.0
Ser
3.573SerAla: 3.573 ± 0.406
0.0SerCys: 0.0 ± 0.0
4.287SerAsp: 4.287 ± 1.205
3.573SerGlu: 3.573 ± 0.192
1.429SerPhe: 1.429 ± 0.196
5.359SerGly: 5.359 ± 0.609
1.429SerHis: 1.429 ± 0.196
6.074SerIle: 6.074 ± 1.409
2.858SerLys: 2.858 ± 0.393
3.573SerLeu: 3.573 ± 0.192
2.501SerMet: 2.501 ± 0.792
3.573SerAsn: 3.573 ± 0.79
1.786SerPro: 1.786 ± 0.801
2.858SerGln: 2.858 ± 0.393
3.215SerArg: 3.215 ± 0.605
4.645SerSer: 4.645 ± 0.19
2.501SerThr: 2.501 ± 0.194
4.287SerVal: 4.287 ± 1.205
1.429SerTrp: 1.429 ± 0.402
3.573SerTyr: 3.573 ± 1.603
0.0SerXaa: 0.0 ± 0.0
Thr
3.215ThrAla: 3.215 ± 1.801
1.786ThrCys: 1.786 ± 0.395
2.858ThrAsp: 2.858 ± 0.991
3.573ThrGlu: 3.573 ± 0.192
3.93ThrPhe: 3.93 ± 0.208
3.573ThrGly: 3.573 ± 1.603
1.786ThrHis: 1.786 ± 0.395
5.716ThrIle: 5.716 ± 0.187
5.716ThrLys: 5.716 ± 1.384
5.359ThrLeu: 5.359 ± 1.784
1.786ThrMet: 1.786 ± 0.203
3.93ThrAsn: 3.93 ± 0.208
4.287ThrPro: 4.287 ± 0.607
2.501ThrGln: 2.501 ± 0.194
2.144ThrArg: 2.144 ± 0.594
3.93ThrSer: 3.93 ± 0.391
5.002ThrThr: 5.002 ± 0.808
3.573ThrVal: 3.573 ± 1.603
0.715ThrTrp: 0.715 ± 0.397
1.072ThrTyr: 1.072 ± 0.002
0.0ThrXaa: 0.0 ± 0.0
Val
2.858ValAla: 2.858 ± 0.804
1.072ValCys: 1.072 ± 0.596
5.359ValAsp: 5.359 ± 0.609
5.716ValGlu: 5.716 ± 1.009
2.501ValPhe: 2.501 ± 0.792
2.858ValGly: 2.858 ± 1.402
1.072ValHis: 1.072 ± 0.002
5.716ValIle: 5.716 ± 0.411
6.074ValLys: 6.074 ± 0.212
4.287ValLeu: 4.287 ± 0.009
0.357ValMet: 0.357 ± 0.4
3.93ValAsn: 3.93 ± 0.806
3.93ValPro: 3.93 ± 0.208
1.429ValGln: 1.429 ± 0.196
2.501ValArg: 2.501 ± 0.404
5.359ValSer: 5.359 ± 1.806
4.645ValThr: 4.645 ± 0.409
5.002ValVal: 5.002 ± 0.21
0.715ValTrp: 0.715 ± 0.201
3.93ValTyr: 3.93 ± 0.989
0.0ValXaa: 0.0 ± 0.0
Trp
0.715TrpAla: 0.715 ± 0.799
0.357TrpCys: 0.357 ± 0.199
1.072TrpAsp: 1.072 ± 0.002
0.0TrpGlu: 0.0 ± 0.0
2.144TrpPhe: 2.144 ± 0.594
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.357TrpIle: 0.357 ± 0.199
1.786TrpLys: 1.786 ± 0.395
2.501TrpLeu: 2.501 ± 0.194
0.0TrpMet: 0.0 ± 0.0
1.072TrpAsn: 1.072 ± 0.596
0.357TrpPro: 0.357 ± 0.199
0.0TrpGln: 0.0 ± 0.0
0.357TrpArg: 0.357 ± 0.4
0.715TrpSer: 0.715 ± 0.799
1.072TrpThr: 1.072 ± 0.596
1.429TrpVal: 1.429 ± 0.402
0.357TrpTrp: 0.357 ± 0.199
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.786TyrAla: 1.786 ± 0.203
0.0TyrCys: 0.0 ± 0.0
2.858TyrAsp: 2.858 ± 0.991
1.786TyrGlu: 1.786 ± 0.203
1.429TyrPhe: 1.429 ± 0.196
0.715TyrGly: 0.715 ± 0.397
0.0TyrHis: 0.0 ± 0.0
2.501TyrIle: 2.501 ± 0.792
5.002TyrLys: 5.002 ± 0.21
2.858TyrLeu: 2.858 ± 0.804
1.786TyrMet: 1.786 ± 0.203
2.501TyrAsn: 2.501 ± 0.792
1.786TyrPro: 1.786 ± 0.203
0.357TyrGln: 0.357 ± 0.199
2.144TyrArg: 2.144 ± 0.603
3.215TyrSer: 3.215 ± 0.592
3.215TyrThr: 3.215 ± 0.592
3.573TyrVal: 3.573 ± 1.005
0.0TyrTrp: 0.0 ± 0.0
1.072TyrTyr: 1.072 ± 0.002
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2800 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski