Amino acid dipepetide frequency for Hubei picorna-like virus 42

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.669AlaAla: 2.669 ± 1.202
0.667AlaCys: 0.667 ± 0.3
3.67AlaAsp: 3.67 ± 1.213
3.67AlaGlu: 3.67 ± 1.213
1.335AlaPhe: 1.335 ± 0.601
4.338AlaGly: 4.338 ± 0.361
1.335AlaHis: 1.335 ± 0.036
5.005AlaIle: 5.005 ± 1.249
2.336AlaLys: 2.336 ± 0.54
7.007AlaLeu: 7.007 ± 2.2
2.336AlaMet: 2.336 ± 0.096
4.004AlaAsn: 4.004 ± 1.381
3.67AlaPro: 3.67 ± 1.213
2.002AlaGln: 2.002 ± 0.372
3.337AlaArg: 3.337 ± 1.045
3.337AlaSer: 3.337 ± 0.408
4.004AlaThr: 4.004 ± 1.166
5.672AlaVal: 5.672 ± 0.962
0.0AlaTrp: 0.0 ± 0.0
2.002AlaTyr: 2.002 ± 1.538
0.0AlaXaa: 0.0 ± 0.0
Cys
1.001CysAla: 1.001 ± 0.505
0.667CysCys: 0.667 ± 0.336
0.334CysAsp: 0.334 ± 0.168
0.667CysGlu: 0.667 ± 0.336
1.001CysPhe: 1.001 ± 0.505
1.335CysGly: 1.335 ± 1.238
0.0CysHis: 0.0 ± 0.0
1.335CysIle: 1.335 ± 0.601
0.0CysLys: 0.0 ± 0.0
0.667CysLeu: 0.667 ± 0.3
1.001CysMet: 1.001 ± 0.505
2.002CysAsn: 2.002 ± 0.372
0.0CysPro: 0.0 ± 0.0
0.334CysGln: 0.334 ± 0.168
0.667CysArg: 0.667 ± 0.336
1.335CysSer: 1.335 ± 0.673
0.667CysThr: 0.667 ± 0.937
1.001CysVal: 1.001 ± 0.132
0.0CysTrp: 0.0 ± 0.0
0.667CysTyr: 0.667 ± 0.336
0.0CysXaa: 0.0 ± 0.0
Asp
2.002AspAla: 2.002 ± 0.265
1.668AspCys: 1.668 ± 0.433
4.004AspAsp: 4.004 ± 2.44
3.003AspGlu: 3.003 ± 0.397
2.336AspPhe: 2.336 ± 1.177
1.668AspGly: 1.668 ± 0.433
1.001AspHis: 1.001 ± 0.132
6.34AspIle: 6.34 ± 0.011
3.003AspLys: 3.003 ± 0.24
5.005AspLeu: 5.005 ± 2.572
0.667AspMet: 0.667 ± 0.3
3.337AspAsn: 3.337 ± 1.045
2.336AspPro: 2.336 ± 0.54
3.67AspGln: 3.67 ± 0.06
2.002AspArg: 2.002 ± 1.009
2.002AspSer: 2.002 ± 0.372
5.005AspThr: 5.005 ± 0.612
1.668AspVal: 1.668 ± 0.204
1.001AspTrp: 1.001 ± 0.505
4.004AspTyr: 4.004 ± 0.108
0.0AspXaa: 0.0 ± 0.0
Glu
3.337GluAla: 3.337 ± 1.045
0.667GluCys: 0.667 ± 0.336
2.336GluAsp: 2.336 ± 0.096
2.669GluGlu: 2.669 ± 0.709
1.668GluPhe: 1.668 ± 0.204
1.335GluGly: 1.335 ± 0.036
1.335GluHis: 1.335 ± 0.036
2.336GluIle: 2.336 ± 0.096
2.669GluLys: 2.669 ± 1.202
3.337GluLeu: 3.337 ± 0.408
1.668GluMet: 1.668 ± 0.433
4.004GluAsn: 4.004 ± 1.381
3.337GluPro: 3.337 ± 0.865
2.336GluGln: 2.336 ± 1.177
1.335GluArg: 1.335 ± 0.673
3.337GluSer: 3.337 ± 0.229
6.006GluThr: 6.006 ± 2.39
3.67GluVal: 3.67 ± 0.576
2.002GluTrp: 2.002 ± 0.265
1.668GluTyr: 1.668 ± 0.433
0.0GluXaa: 0.0 ± 0.0
Phe
3.003PheAla: 3.003 ± 0.877
0.667PheCys: 0.667 ± 0.3
2.669PheAsp: 2.669 ± 1.345
1.001PheGlu: 1.001 ± 0.132
1.001PhePhe: 1.001 ± 0.132
2.669PheGly: 2.669 ± 0.072
2.336PheHis: 2.336 ± 0.54
2.669PheIle: 2.669 ± 0.565
2.669PheLys: 2.669 ± 0.072
3.003PheLeu: 3.003 ± 2.307
1.335PheMet: 1.335 ± 0.036
2.669PheAsn: 2.669 ± 0.072
1.335PhePro: 1.335 ± 0.673
0.0PheGln: 0.0 ± 0.0
2.002PheArg: 2.002 ± 0.372
0.667PheSer: 0.667 ± 0.937
2.336PheThr: 2.336 ± 0.096
1.668PheVal: 1.668 ± 0.204
0.667PheTrp: 0.667 ± 0.336
2.669PheTyr: 2.669 ± 0.072
0.0PheXaa: 0.0 ± 0.0
Gly
2.669GlyAla: 2.669 ± 0.565
0.334GlyCys: 0.334 ± 0.469
2.669GlyAsp: 2.669 ± 0.709
2.336GlyGlu: 2.336 ± 0.096
1.001GlyPhe: 1.001 ± 0.132
2.336GlyGly: 2.336 ± 0.54
0.334GlyHis: 0.334 ± 0.168
2.336GlyIle: 2.336 ± 0.54
2.336GlyLys: 2.336 ± 0.54
5.672GlyLeu: 5.672 ± 0.325
1.335GlyMet: 1.335 ± 0.673
3.67GlyAsn: 3.67 ± 1.213
2.336GlyPro: 2.336 ± 0.096
0.667GlyGln: 0.667 ± 0.336
1.668GlyArg: 1.668 ± 0.204
5.005GlySer: 5.005 ± 0.025
4.004GlyThr: 4.004 ± 1.166
5.005GlyVal: 5.005 ± 1.298
2.336GlyTrp: 2.336 ± 0.733
1.668GlyTyr: 1.668 ± 0.204
0.0GlyXaa: 0.0 ± 0.0
His
1.335HisAla: 1.335 ± 0.036
0.334HisCys: 0.334 ± 0.168
0.667HisAsp: 0.667 ± 0.3
0.0HisGlu: 0.0 ± 0.0
0.334HisPhe: 0.334 ± 0.469
1.668HisGly: 1.668 ± 0.433
0.667HisHis: 0.667 ± 0.336
2.336HisIle: 2.336 ± 0.733
0.334HisLys: 0.334 ± 0.469
4.004HisLeu: 4.004 ± 1.381
1.668HisMet: 1.668 ± 0.433
1.335HisAsn: 1.335 ± 1.238
1.335HisPro: 1.335 ± 1.238
0.667HisGln: 0.667 ± 0.3
1.335HisArg: 1.335 ± 0.036
0.667HisSer: 0.667 ± 0.3
2.002HisThr: 2.002 ± 0.372
3.003HisVal: 3.003 ± 0.24
0.334HisTrp: 0.334 ± 0.168
1.335HisTyr: 1.335 ± 1.875
0.0HisXaa: 0.0 ± 0.0
Ile
5.672IleAla: 5.672 ± 0.962
1.001IleCys: 1.001 ± 0.505
4.004IleAsp: 4.004 ± 0.108
4.671IleGlu: 4.671 ± 2.103
2.336IlePhe: 2.336 ± 0.733
2.336IleGly: 2.336 ± 1.177
1.668IleHis: 1.668 ± 0.841
6.673IleIle: 6.673 ± 1.453
2.336IleLys: 2.336 ± 0.733
5.339IleLeu: 5.339 ± 1.13
0.667IleMet: 0.667 ± 0.336
4.671IleAsn: 4.671 ± 0.83
4.338IlePro: 4.338 ± 0.913
3.003IleGln: 3.003 ± 1.514
2.336IleArg: 2.336 ± 1.177
5.672IleSer: 5.672 ± 1.585
4.338IleThr: 4.338 ± 0.913
7.674IleVal: 7.674 ± 0.59
1.001IleTrp: 1.001 ± 0.505
3.337IleTyr: 3.337 ± 0.229
0.0IleXaa: 0.0 ± 0.0
Lys
1.335LysAla: 1.335 ± 0.036
0.667LysCys: 0.667 ± 0.3
3.003LysAsp: 3.003 ± 0.397
2.669LysGlu: 2.669 ± 0.072
1.668LysPhe: 1.668 ± 0.433
1.668LysGly: 1.668 ± 0.841
2.002LysHis: 2.002 ± 2.175
3.337LysIle: 3.337 ± 0.865
4.004LysLys: 4.004 ± 3.076
3.67LysLeu: 3.67 ± 0.697
1.668LysMet: 1.668 ± 0.433
3.003LysAsn: 3.003 ± 0.24
2.002LysPro: 2.002 ± 1.538
1.668LysGln: 1.668 ± 0.433
2.336LysArg: 2.336 ± 1.37
4.004LysSer: 4.004 ± 0.529
2.002LysThr: 2.002 ± 0.265
3.337LysVal: 3.337 ± 0.408
0.0LysTrp: 0.0 ± 0.0
2.336LysTyr: 2.336 ± 0.096
0.0LysXaa: 0.0 ± 0.0
Leu
5.339LeuAla: 5.339 ± 0.144
1.335LeuCys: 1.335 ± 0.036
6.673LeuAsp: 6.673 ± 0.457
4.671LeuGlu: 4.671 ± 1.081
3.337LeuPhe: 3.337 ± 0.865
2.669LeuGly: 2.669 ± 0.072
2.336LeuHis: 2.336 ± 2.007
5.672LeuIle: 5.672 ± 0.325
5.005LeuLys: 5.005 ± 1.935
9.676LeuLeu: 9.676 ± 2.128
1.001LeuMet: 1.001 ± 0.505
5.339LeuAsn: 5.339 ± 0.493
2.336LeuPro: 2.336 ± 0.733
5.339LeuGln: 5.339 ± 0.144
4.671LeuArg: 4.671 ± 1.081
7.674LeuSer: 7.674 ± 1.863
3.67LeuThr: 3.67 ± 0.697
4.338LeuVal: 4.338 ± 0.998
1.001LeuTrp: 1.001 ± 0.132
3.003LeuTyr: 3.003 ± 1.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.002MetAla: 2.002 ± 0.265
0.667MetCys: 0.667 ± 0.336
2.669MetAsp: 2.669 ± 0.709
3.003MetGlu: 3.003 ± 0.877
1.001MetPhe: 1.001 ± 0.769
0.667MetGly: 0.667 ± 0.336
1.001MetHis: 1.001 ± 1.406
1.001MetIle: 1.001 ± 0.505
1.335MetLys: 1.335 ± 1.238
1.001MetLeu: 1.001 ± 0.132
0.0MetMet: 0.0 ± 0.0
0.667MetAsn: 0.667 ± 0.336
1.335MetPro: 1.335 ± 0.036
1.335MetGln: 1.335 ± 0.601
2.669MetArg: 2.669 ± 0.709
2.669MetSer: 2.669 ± 0.709
1.001MetThr: 1.001 ± 0.505
0.667MetVal: 0.667 ± 0.3
1.001MetTrp: 1.001 ± 0.505
1.001MetTyr: 1.001 ± 0.132
0.0MetXaa: 0.0 ± 0.0
Asn
5.339AsnAla: 5.339 ± 1.13
0.667AsnCys: 0.667 ± 0.336
2.336AsnAsp: 2.336 ± 0.733
3.003AsnGlu: 3.003 ± 1.514
3.003AsnPhe: 3.003 ± 0.877
2.669AsnGly: 2.669 ± 0.565
1.668AsnHis: 1.668 ± 0.204
4.338AsnIle: 4.338 ± 0.276
2.002AsnLys: 2.002 ± 0.901
3.337AsnLeu: 3.337 ± 0.408
2.336AsnMet: 2.336 ± 0.415
3.003AsnAsn: 3.003 ± 0.397
4.004AsnPro: 4.004 ± 0.108
2.002AsnGln: 2.002 ± 1.009
2.336AsnArg: 2.336 ± 0.54
4.671AsnSer: 4.671 ± 1.081
7.674AsnThr: 7.674 ± 0.047
3.003AsnVal: 3.003 ± 0.24
0.0AsnTrp: 0.0 ± 0.0
3.003AsnTyr: 3.003 ± 0.877
0.0AsnXaa: 0.0 ± 0.0
Pro
1.668ProAla: 1.668 ± 1.07
1.001ProCys: 1.001 ± 0.505
1.668ProAsp: 1.668 ± 0.841
3.003ProGlu: 3.003 ± 1.514
3.003ProPhe: 3.003 ± 1.514
2.669ProGly: 2.669 ± 0.565
1.668ProHis: 1.668 ± 0.433
2.002ProIle: 2.002 ± 0.265
1.001ProLys: 1.001 ± 0.132
8.342ProLeu: 8.342 ± 0.89
0.334ProMet: 0.334 ± 0.168
4.004ProAsn: 4.004 ± 0.745
2.669ProPro: 2.669 ± 0.709
2.002ProGln: 2.002 ± 0.372
1.335ProArg: 1.335 ± 0.036
3.67ProSer: 3.67 ± 1.971
5.339ProThr: 5.339 ± 0.493
3.337ProVal: 3.337 ± 0.408
0.0ProTrp: 0.0 ± 0.0
3.67ProTyr: 3.67 ± 0.06
0.0ProXaa: 0.0 ± 0.0
Gln
4.004GlnAla: 4.004 ± 2.018
1.001GlnCys: 1.001 ± 0.132
1.001GlnAsp: 1.001 ± 0.132
1.668GlnGlu: 1.668 ± 0.433
1.335GlnPhe: 1.335 ± 0.036
0.667GlnGly: 0.667 ± 0.3
0.667GlnHis: 0.667 ± 0.3
3.67GlnIle: 3.67 ± 1.85
1.668GlnLys: 1.668 ± 0.433
3.003GlnLeu: 3.003 ± 0.397
1.001GlnMet: 1.001 ± 0.505
2.669GlnAsn: 2.669 ± 0.565
3.67GlnPro: 3.67 ± 0.576
1.668GlnGln: 1.668 ± 0.433
3.337GlnArg: 3.337 ± 1.045
3.337GlnSer: 3.337 ± 1.045
4.004GlnThr: 4.004 ± 0.108
0.334GlnVal: 0.334 ± 0.168
0.667GlnTrp: 0.667 ± 0.937
0.667GlnTyr: 0.667 ± 0.336
0.0GlnXaa: 0.0 ± 0.0
Arg
3.003ArgAla: 3.003 ± 1.514
1.001ArgCys: 1.001 ± 0.505
3.003ArgAsp: 3.003 ± 0.877
1.668ArgGlu: 1.668 ± 0.841
2.336ArgPhe: 2.336 ± 0.54
3.003ArgGly: 3.003 ± 0.877
1.335ArgHis: 1.335 ± 0.601
2.669ArgIle: 2.669 ± 0.072
1.668ArgLys: 1.668 ± 0.433
2.669ArgLeu: 2.669 ± 1.345
2.669ArgMet: 2.669 ± 1.345
3.67ArgAsn: 3.67 ± 0.576
3.67ArgPro: 3.67 ± 0.576
1.335ArgGln: 1.335 ± 0.673
3.003ArgArg: 3.003 ± 0.877
2.669ArgSer: 2.669 ± 0.565
3.67ArgThr: 3.67 ± 1.213
3.67ArgVal: 3.67 ± 0.06
1.335ArgTrp: 1.335 ± 0.036
2.336ArgTyr: 2.336 ± 0.54
0.0ArgXaa: 0.0 ± 0.0
Ser
3.67SerAla: 3.67 ± 0.576
0.667SerCys: 0.667 ± 0.3
3.337SerAsp: 3.337 ± 0.229
4.338SerGlu: 4.338 ± 0.361
2.669SerPhe: 2.669 ± 0.709
5.339SerGly: 5.339 ± 0.78
1.335SerHis: 1.335 ± 0.601
5.672SerIle: 5.672 ± 0.325
3.67SerLys: 3.67 ± 0.06
5.672SerLeu: 5.672 ± 0.962
1.335SerMet: 1.335 ± 0.036
1.668SerAsn: 1.668 ± 0.204
3.337SerPro: 3.337 ± 1.502
2.669SerGln: 2.669 ± 0.709
3.67SerArg: 3.67 ± 0.576
4.004SerSer: 4.004 ± 2.018
5.672SerThr: 5.672 ± 0.949
6.34SerVal: 6.34 ± 0.625
1.668SerTrp: 1.668 ± 0.204
4.338SerTyr: 4.338 ± 0.913
0.0SerXaa: 0.0 ± 0.0
Thr
5.005ThrAla: 5.005 ± 0.025
0.334ThrCys: 0.334 ± 0.168
4.004ThrAsp: 4.004 ± 0.529
2.336ThrGlu: 2.336 ± 0.54
2.669ThrPhe: 2.669 ± 0.565
5.339ThrGly: 5.339 ± 1.417
2.336ThrHis: 2.336 ± 0.096
6.673ThrIle: 6.673 ± 0.816
3.337ThrLys: 3.337 ± 0.408
3.337ThrLeu: 3.337 ± 0.229
1.668ThrMet: 1.668 ± 0.433
4.671ThrAsn: 4.671 ± 1.718
4.671ThrPro: 4.671 ± 1.081
4.004ThrGln: 4.004 ± 1.803
4.671ThrArg: 4.671 ± 0.444
6.006ThrSer: 6.006 ± 0.157
5.672ThrThr: 5.672 ± 0.962
4.671ThrVal: 4.671 ± 2.103
0.667ThrTrp: 0.667 ± 0.336
2.336ThrTyr: 2.336 ± 1.177
0.0ThrXaa: 0.0 ± 0.0
Val
4.671ValAla: 4.671 ± 0.83
0.667ValCys: 0.667 ± 0.3
3.67ValAsp: 3.67 ± 1.971
4.338ValGlu: 4.338 ± 0.276
2.002ValPhe: 2.002 ± 0.265
3.003ValGly: 3.003 ± 1.034
0.334ValHis: 0.334 ± 0.168
6.006ValIle: 6.006 ± 2.39
3.003ValLys: 3.003 ± 1.67
5.672ValLeu: 5.672 ± 0.312
1.335ValMet: 1.335 ± 0.036
4.338ValAsn: 4.338 ± 2.271
4.004ValPro: 4.004 ± 0.745
3.003ValGln: 3.003 ± 0.24
4.338ValArg: 4.338 ± 0.913
5.005ValSer: 5.005 ± 0.612
3.003ValThr: 3.003 ± 1.034
2.336ValVal: 2.336 ± 0.096
1.001ValTrp: 1.001 ± 0.132
2.002ValTyr: 2.002 ± 0.265
0.0ValXaa: 0.0 ± 0.0
Trp
0.667TrpAla: 0.667 ± 0.3
0.0TrpCys: 0.0 ± 0.0
1.335TrpAsp: 1.335 ± 0.601
0.334TrpGlu: 0.334 ± 0.168
0.334TrpPhe: 0.334 ± 0.168
1.001TrpGly: 1.001 ± 0.769
0.334TrpHis: 0.334 ± 0.168
0.0TrpIle: 0.0 ± 0.0
1.335TrpLys: 1.335 ± 0.036
2.002TrpLeu: 2.002 ± 0.372
0.667TrpMet: 0.667 ± 0.336
0.667TrpAsn: 0.667 ± 0.336
0.334TrpPro: 0.334 ± 0.168
0.334TrpGln: 0.334 ± 0.168
0.667TrpArg: 0.667 ± 0.336
2.669TrpSer: 2.669 ± 0.709
1.335TrpThr: 1.335 ± 0.601
0.334TrpVal: 0.334 ± 0.469
0.667TrpTrp: 0.667 ± 0.336
0.667TrpTyr: 0.667 ± 0.3
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.004TyrAla: 4.004 ± 0.108
0.667TyrCys: 0.667 ± 0.336
2.669TyrAsp: 2.669 ± 0.709
2.002TyrGlu: 2.002 ± 0.265
2.669TyrPhe: 2.669 ± 0.565
3.337TyrGly: 3.337 ± 0.229
1.668TyrHis: 1.668 ± 0.204
3.003TyrIle: 3.003 ± 1.67
2.669TyrLys: 2.669 ± 0.565
2.669TyrLeu: 2.669 ± 0.565
1.668TyrMet: 1.668 ± 1.07
1.335TyrAsn: 1.335 ± 0.601
1.668TyrPro: 1.668 ± 0.433
2.002TyrGln: 2.002 ± 0.372
2.669TyrArg: 2.669 ± 0.709
2.669TyrSer: 2.669 ± 0.072
3.003TyrThr: 3.003 ± 1.514
2.002TyrVal: 2.002 ± 1.009
0.334TyrTrp: 0.334 ± 0.469
2.002TyrTyr: 2.002 ± 0.901
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2998 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski