Amino acid dipepetide frequency for Shahe picorna-like virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.409AlaAla: 4.409 ± 1.133
0.401AlaCys: 0.401 ± 0.203
2.405AlaAsp: 2.405 ± 0.128
0.401AlaGlu: 0.401 ± 0.203
2.004AlaPhe: 2.004 ± 0.342
2.405AlaGly: 2.405 ± 0.545
2.405AlaHis: 2.405 ± 1.219
5.21AlaIle: 5.21 ± 1.294
3.607AlaLys: 3.607 ± 1.155
3.607AlaLeu: 3.607 ± 1.539
1.603AlaMet: 1.603 ± 0.139
4.008AlaAsn: 4.008 ± 0.011
2.806AlaPro: 2.806 ± 0.599
5.611AlaGln: 5.611 ± 0.524
2.004AlaArg: 2.004 ± 0.331
7.214AlaSer: 7.214 ± 1.058
6.413AlaThr: 6.413 ± 2.812
5.21AlaVal: 5.21 ± 1.4
1.202AlaTrp: 1.202 ± 0.738
3.607AlaTyr: 3.607 ± 0.481
0.0AlaXaa: 0.0 ± 0.0
Cys
0.802CysAla: 0.802 ± 0.406
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.202CysGlu: 1.202 ± 0.064
1.603CysPhe: 1.603 ± 0.139
0.401CysGly: 0.401 ± 0.203
0.0CysHis: 0.0 ± 0.0
0.802CysIle: 0.802 ± 0.406
0.802CysLys: 0.802 ± 0.267
0.401CysLeu: 0.401 ± 0.203
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.202CysPro: 1.202 ± 0.064
0.802CysGln: 0.802 ± 0.941
1.202CysArg: 1.202 ± 0.609
0.401CysSer: 0.401 ± 0.203
0.802CysThr: 0.802 ± 0.941
1.202CysVal: 1.202 ± 0.064
0.0CysTrp: 0.0 ± 0.0
0.401CysTyr: 0.401 ± 0.203
0.0CysXaa: 0.0 ± 0.0
Asp
3.206AspAla: 3.206 ± 0.278
0.401AspCys: 0.401 ± 0.203
6.413AspAsp: 6.413 ± 0.791
1.603AspGlu: 1.603 ± 0.139
5.611AspPhe: 5.611 ± 1.197
4.81AspGly: 4.81 ± 1.604
0.0AspHis: 0.0 ± 0.0
5.611AspIle: 5.611 ± 0.15
4.409AspLys: 4.409 ± 1.561
7.214AspLeu: 7.214 ± 0.385
0.802AspMet: 0.802 ± 0.406
1.603AspAsn: 1.603 ± 0.813
2.806AspPro: 2.806 ± 0.599
2.405AspGln: 2.405 ± 0.545
1.603AspArg: 1.603 ± 0.139
2.004AspSer: 2.004 ± 1.005
3.607AspThr: 3.607 ± 1.155
2.405AspVal: 2.405 ± 0.128
1.202AspTrp: 1.202 ± 0.064
0.802AspTyr: 0.802 ± 0.406
0.0AspXaa: 0.0 ± 0.0
Glu
2.405GluAla: 2.405 ± 0.545
0.802GluCys: 0.802 ± 0.406
4.008GluAsp: 4.008 ± 0.684
4.008GluGlu: 4.008 ± 0.684
2.806GluPhe: 2.806 ± 0.599
6.012GluGly: 6.012 ± 0.321
0.401GluHis: 0.401 ± 0.203
7.214GluIle: 7.214 ± 0.385
3.607GluLys: 3.607 ± 1.155
3.607GluLeu: 3.607 ± 0.866
1.202GluMet: 1.202 ± 0.064
2.004GluAsn: 2.004 ± 0.342
0.802GluPro: 0.802 ± 0.406
2.806GluGln: 2.806 ± 1.422
1.603GluArg: 1.603 ± 0.139
3.607GluSer: 3.607 ± 1.828
3.206GluThr: 3.206 ± 0.278
4.008GluVal: 4.008 ± 1.358
2.405GluTrp: 2.405 ± 0.128
1.603GluTyr: 1.603 ± 0.535
0.0GluXaa: 0.0 ± 0.0
Phe
2.405PheAla: 2.405 ± 0.802
0.0PheCys: 0.0 ± 0.0
2.405PheAsp: 2.405 ± 0.128
2.806PheGlu: 2.806 ± 0.075
0.802PhePhe: 0.802 ± 0.267
2.004PheGly: 2.004 ± 1.005
2.004PheHis: 2.004 ± 1.678
3.206PheIle: 3.206 ± 0.395
1.202PheLys: 1.202 ± 0.738
4.409PheLeu: 4.409 ± 1.561
1.202PheMet: 1.202 ± 0.609
2.806PheAsn: 2.806 ± 0.075
0.802PhePro: 0.802 ± 0.406
1.202PheGln: 1.202 ± 0.738
2.004PheArg: 2.004 ± 1.016
4.81PheSer: 4.81 ± 0.256
3.206PheThr: 3.206 ± 0.278
4.008PheVal: 4.008 ± 0.011
0.802PheTrp: 0.802 ± 0.406
2.405PheTyr: 2.405 ± 0.128
0.0PheXaa: 0.0 ± 0.0
Gly
4.409GlyAla: 4.409 ± 1.133
1.202GlyCys: 1.202 ± 0.064
4.409GlyAsp: 4.409 ± 1.133
4.008GlyGlu: 4.008 ± 0.011
1.202GlyPhe: 1.202 ± 0.064
2.004GlyGly: 2.004 ± 0.331
0.0GlyHis: 0.0 ± 0.0
6.413GlyIle: 6.413 ± 1.23
7.615GlyLys: 7.615 ± 1.839
4.008GlyLeu: 4.008 ± 0.011
2.004GlyMet: 2.004 ± 0.331
2.806GlyAsn: 2.806 ± 1.272
2.806GlyPro: 2.806 ± 0.748
1.202GlyGln: 1.202 ± 0.738
1.603GlyArg: 1.603 ± 0.139
3.206GlySer: 3.206 ± 0.395
2.806GlyThr: 2.806 ± 0.599
0.802GlyVal: 0.802 ± 0.406
0.401GlyTrp: 0.401 ± 0.203
2.004GlyTyr: 2.004 ± 1.016
0.0GlyXaa: 0.0 ± 0.0
His
2.004HisAla: 2.004 ± 0.342
0.802HisCys: 0.802 ± 0.406
0.401HisAsp: 0.401 ± 0.203
0.802HisGlu: 0.802 ± 0.406
2.004HisPhe: 2.004 ± 1.016
0.802HisGly: 0.802 ± 0.406
0.0HisHis: 0.0 ± 0.0
1.202HisIle: 1.202 ± 0.064
2.405HisLys: 2.405 ± 0.128
1.202HisLeu: 1.202 ± 0.064
0.802HisMet: 0.802 ± 0.267
0.401HisAsn: 0.401 ± 0.203
1.202HisPro: 1.202 ± 0.064
0.802HisGln: 0.802 ± 0.267
0.401HisArg: 0.401 ± 0.203
2.004HisSer: 2.004 ± 0.342
1.202HisThr: 1.202 ± 0.738
2.004HisVal: 2.004 ± 1.016
0.401HisTrp: 0.401 ± 0.203
0.802HisTyr: 0.802 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
3.607IleAla: 3.607 ± 1.155
0.401IleCys: 0.401 ± 0.203
4.008IleAsp: 4.008 ± 0.663
6.012IleGlu: 6.012 ± 0.353
3.206IlePhe: 3.206 ± 0.395
2.405IleGly: 2.405 ± 1.475
0.401IleHis: 0.401 ± 0.47
4.409IleIle: 4.409 ± 0.214
2.806IleLys: 2.806 ± 0.075
9.218IleLeu: 9.218 ± 1.978
2.004IleMet: 2.004 ± 0.331
6.814IleAsn: 6.814 ± 0.759
4.409IlePro: 4.409 ± 1.133
1.202IleGln: 1.202 ± 0.609
2.806IleArg: 2.806 ± 0.075
5.21IleSer: 5.21 ± 0.053
4.81IleThr: 4.81 ± 0.417
4.81IleVal: 4.81 ± 1.764
0.401IleTrp: 0.401 ± 0.203
2.806IleTyr: 2.806 ± 0.599
0.0IleXaa: 0.0 ± 0.0
Lys
3.607LysAla: 3.607 ± 1.155
0.802LysCys: 0.802 ± 0.267
3.607LysAsp: 3.607 ± 1.155
3.206LysGlu: 3.206 ± 0.278
2.405LysPhe: 2.405 ± 0.128
4.008LysGly: 4.008 ± 1.358
1.603LysHis: 1.603 ± 0.813
4.008LysIle: 4.008 ± 0.684
2.004LysLys: 2.004 ± 0.342
7.214LysLeu: 7.214 ± 0.962
0.802LysMet: 0.802 ± 0.267
1.202LysAsn: 1.202 ± 0.064
7.615LysPro: 7.615 ± 0.182
2.806LysGln: 2.806 ± 0.748
2.004LysArg: 2.004 ± 0.342
3.206LysSer: 3.206 ± 0.395
4.409LysThr: 4.409 ± 1.807
4.409LysVal: 4.409 ± 0.887
0.0LysTrp: 0.0 ± 0.0
3.206LysTyr: 3.206 ± 0.952
0.0LysXaa: 0.0 ± 0.0
Leu
5.611LeuAla: 5.611 ± 0.823
0.401LeuCys: 0.401 ± 0.47
4.008LeuAsp: 4.008 ± 0.684
5.21LeuGlu: 5.21 ± 0.727
1.202LeuPhe: 1.202 ± 0.738
5.21LeuGly: 5.21 ± 0.62
2.405LeuHis: 2.405 ± 1.219
3.607LeuIle: 3.607 ± 1.155
5.21LeuLys: 5.21 ± 0.62
5.21LeuLeu: 5.21 ± 0.727
2.806LeuMet: 2.806 ± 0.599
5.21LeuAsn: 5.21 ± 0.62
4.409LeuPro: 4.409 ± 0.214
2.004LeuGln: 2.004 ± 0.331
4.409LeuArg: 4.409 ± 1.133
6.012LeuSer: 6.012 ± 0.994
7.615LeuThr: 7.615 ± 0.492
6.012LeuVal: 6.012 ± 0.321
1.202LeuTrp: 1.202 ± 0.609
2.806LeuTyr: 2.806 ± 0.748
0.0LeuXaa: 0.0 ± 0.0
Met
2.806MetAla: 2.806 ± 0.599
0.401MetCys: 0.401 ± 0.47
2.004MetAsp: 2.004 ± 0.331
2.806MetGlu: 2.806 ± 0.748
1.603MetPhe: 1.603 ± 0.139
1.603MetGly: 1.603 ± 0.535
0.401MetHis: 0.401 ± 0.203
2.405MetIle: 2.405 ± 1.475
2.405MetLys: 2.405 ± 0.545
1.202MetLeu: 1.202 ± 0.609
0.0MetMet: 0.0 ± 0.184
1.603MetAsn: 1.603 ± 0.813
0.802MetPro: 0.802 ± 0.267
1.202MetGln: 1.202 ± 0.609
2.004MetArg: 2.004 ± 0.331
2.004MetSer: 2.004 ± 0.331
0.401MetThr: 0.401 ± 0.203
0.802MetVal: 0.802 ± 0.267
0.401MetTrp: 0.401 ± 0.203
1.202MetTyr: 1.202 ± 0.064
0.0MetXaa: 0.0 ± 0.0
Asn
5.21AsnAla: 5.21 ± 0.62
0.802AsnCys: 0.802 ± 0.267
3.206AsnAsp: 3.206 ± 0.395
2.405AsnGlu: 2.405 ± 0.545
1.202AsnPhe: 1.202 ± 0.064
3.607AsnGly: 3.607 ± 1.155
0.401AsnHis: 0.401 ± 0.203
4.409AsnIle: 4.409 ± 0.214
4.409AsnLys: 4.409 ± 0.887
5.21AsnLeu: 5.21 ± 1.4
0.802AsnMet: 0.802 ± 0.406
4.008AsnAsn: 4.008 ± 2.683
1.603AsnPro: 1.603 ± 0.535
1.202AsnGln: 1.202 ± 0.609
3.206AsnArg: 3.206 ± 0.952
4.409AsnSer: 4.409 ± 0.887
2.806AsnThr: 2.806 ± 0.075
2.004AsnVal: 2.004 ± 0.331
0.802AsnTrp: 0.802 ± 0.267
1.603AsnTyr: 1.603 ± 1.208
0.0AsnXaa: 0.0 ± 0.0
Pro
3.607ProAla: 3.607 ± 0.192
0.0ProCys: 0.0 ± 0.0
4.81ProAsp: 4.81 ± 0.93
3.206ProGlu: 3.206 ± 0.395
2.806ProPhe: 2.806 ± 0.748
2.004ProGly: 2.004 ± 1.016
1.202ProHis: 1.202 ± 0.064
4.81ProIle: 4.81 ± 0.256
2.405ProLys: 2.405 ± 0.545
5.611ProLeu: 5.611 ± 0.15
1.603ProMet: 1.603 ± 0.4
2.806ProAsn: 2.806 ± 0.599
1.202ProPro: 1.202 ± 0.064
0.802ProGln: 0.802 ± 0.267
2.004ProArg: 2.004 ± 0.331
2.806ProSer: 2.806 ± 0.599
3.206ProThr: 3.206 ± 2.416
2.004ProVal: 2.004 ± 1.678
1.603ProTrp: 1.603 ± 0.139
1.202ProTyr: 1.202 ± 0.738
0.0ProXaa: 0.0 ± 0.0
Gln
1.603GlnAla: 1.603 ± 0.139
0.0GlnCys: 0.0 ± 0.0
1.603GlnAsp: 1.603 ± 0.535
2.004GlnGlu: 2.004 ± 1.016
1.202GlnPhe: 1.202 ± 1.411
1.603GlnGly: 1.603 ± 0.139
1.603GlnHis: 1.603 ± 0.535
4.008GlnIle: 4.008 ± 0.663
1.603GlnLys: 1.603 ± 0.535
1.202GlnLeu: 1.202 ± 0.609
3.206GlnMet: 3.206 ± 0.278
1.202GlnAsn: 1.202 ± 0.609
4.81GlnPro: 4.81 ± 1.091
2.004GlnGln: 2.004 ± 0.331
1.202GlnArg: 1.202 ± 0.609
0.802GlnSer: 0.802 ± 0.267
1.603GlnThr: 1.603 ± 0.139
2.405GlnVal: 2.405 ± 0.545
0.401GlnTrp: 0.401 ± 0.203
2.405GlnTyr: 2.405 ± 0.128
0.0GlnXaa: 0.0 ± 0.0
Arg
2.405ArgAla: 2.405 ± 1.475
0.802ArgCys: 0.802 ± 0.267
1.202ArgAsp: 1.202 ± 0.609
3.206ArgGlu: 3.206 ± 0.952
2.004ArgPhe: 2.004 ± 0.331
2.004ArgGly: 2.004 ± 0.331
1.202ArgHis: 1.202 ± 0.064
2.004ArgIle: 2.004 ± 0.342
2.806ArgLys: 2.806 ± 1.422
3.206ArgLeu: 3.206 ± 0.278
1.202ArgMet: 1.202 ± 0.609
3.607ArgAsn: 3.607 ± 1.828
2.004ArgPro: 2.004 ± 1.005
2.004ArgGln: 2.004 ± 1.016
3.206ArgArg: 3.206 ± 0.952
2.004ArgSer: 2.004 ± 0.342
6.012ArgThr: 6.012 ± 0.321
2.806ArgVal: 2.806 ± 0.748
0.401ArgTrp: 0.401 ± 0.203
1.603ArgTyr: 1.603 ± 0.535
0.0ArgXaa: 0.0 ± 0.0
Ser
1.603SerAla: 1.603 ± 1.882
0.802SerCys: 0.802 ± 0.406
1.603SerAsp: 1.603 ± 0.139
3.206SerGlu: 3.206 ± 0.952
3.607SerPhe: 3.607 ± 0.481
3.607SerGly: 3.607 ± 0.192
2.806SerHis: 2.806 ± 0.075
4.008SerIle: 4.008 ± 0.011
4.008SerLys: 4.008 ± 0.011
6.413SerLeu: 6.413 ± 1.465
2.806SerMet: 2.806 ± 0.599
3.206SerAsn: 3.206 ± 0.395
3.607SerPro: 3.607 ± 0.866
1.202SerGln: 1.202 ± 1.411
2.806SerArg: 2.806 ± 0.075
3.607SerSer: 3.607 ± 0.192
6.413SerThr: 6.413 ± 0.791
5.21SerVal: 5.21 ± 2.074
0.401SerTrp: 0.401 ± 0.203
3.206SerTyr: 3.206 ± 0.395
0.0SerXaa: 0.0 ± 0.0
Thr
5.21ThrAla: 5.21 ± 0.053
1.202ThrCys: 1.202 ± 0.064
3.206ThrAsp: 3.206 ± 0.952
3.206ThrGlu: 3.206 ± 1.069
2.806ThrPhe: 2.806 ± 0.075
5.611ThrGly: 5.611 ± 1.197
1.603ThrHis: 1.603 ± 0.813
5.21ThrIle: 5.21 ± 0.62
4.81ThrLys: 4.81 ± 1.604
4.008ThrLeu: 4.008 ± 0.684
1.603ThrMet: 1.603 ± 0.535
2.806ThrAsn: 2.806 ± 0.599
2.806ThrPro: 2.806 ± 1.272
3.607ThrGln: 3.607 ± 0.192
5.611ThrArg: 5.611 ± 0.823
3.607ThrSer: 3.607 ± 0.866
4.81ThrThr: 4.81 ± 1.091
5.21ThrVal: 5.21 ± 1.4
0.802ThrTrp: 0.802 ± 0.941
3.607ThrTyr: 3.607 ± 0.481
0.0ThrXaa: 0.0 ± 0.0
Val
8.016ValAla: 8.016 ± 1.325
2.004ValCys: 2.004 ± 0.342
6.413ValAsp: 6.413 ± 0.117
4.409ValGlu: 4.409 ± 0.887
4.008ValPhe: 4.008 ± 0.663
4.008ValGly: 4.008 ± 0.684
2.004ValHis: 2.004 ± 1.016
0.401ValIle: 0.401 ± 0.203
2.806ValLys: 2.806 ± 0.075
3.607ValLeu: 3.607 ± 1.828
0.401ValMet: 0.401 ± 0.203
4.008ValAsn: 4.008 ± 0.663
2.405ValPro: 2.405 ± 2.149
2.405ValGln: 2.405 ± 0.545
1.603ValArg: 1.603 ± 0.813
3.206ValSer: 3.206 ± 3.09
3.607ValThr: 3.607 ± 1.155
4.81ValVal: 4.81 ± 0.256
0.802ValTrp: 0.802 ± 0.406
2.806ValTyr: 2.806 ± 0.599
0.0ValXaa: 0.0 ± 0.0
Trp
0.802TrpAla: 0.802 ± 0.406
0.401TrpCys: 0.401 ± 0.47
0.802TrpAsp: 0.802 ± 0.406
1.202TrpGlu: 1.202 ± 0.064
0.802TrpPhe: 0.802 ± 0.406
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.202TrpIle: 1.202 ± 0.064
0.401TrpLys: 0.401 ± 0.203
1.202TrpLeu: 1.202 ± 0.738
1.202TrpMet: 1.202 ± 0.064
1.603TrpAsn: 1.603 ± 0.139
0.401TrpPro: 0.401 ± 0.203
0.0TrpGln: 0.0 ± 0.0
1.202TrpArg: 1.202 ± 0.064
0.802TrpSer: 0.802 ± 0.267
1.603TrpThr: 1.603 ± 0.813
0.401TrpVal: 0.401 ± 0.203
0.401TrpTrp: 0.401 ± 0.203
0.802TrpTyr: 0.802 ± 0.406
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.806TyrAla: 2.806 ± 1.946
0.401TyrCys: 0.401 ± 0.203
2.004TyrAsp: 2.004 ± 0.342
3.206TyrGlu: 3.206 ± 1.625
1.603TyrPhe: 1.603 ± 0.813
0.802TyrGly: 0.802 ± 0.406
1.202TyrHis: 1.202 ± 0.064
1.603TyrIle: 1.603 ± 0.535
2.806TyrLys: 2.806 ± 0.599
2.806TyrLeu: 2.806 ± 0.075
2.004TyrMet: 2.004 ± 0.331
1.603TyrAsn: 1.603 ± 0.535
1.202TyrPro: 1.202 ± 0.064
1.202TyrGln: 1.202 ± 0.609
3.206TyrArg: 3.206 ± 0.278
3.206TyrSer: 3.206 ± 1.743
2.806TyrThr: 2.806 ± 0.075
3.206TyrVal: 3.206 ± 0.278
1.202TyrTrp: 1.202 ± 0.609
2.806TyrTyr: 2.806 ± 1.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2496 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski