Amino acid dipepetide frequency for Hubei picorna-like virus 15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.278AlaAla: 5.278 ± 1.798
1.759AlaCys: 1.759 ± 0.836
3.871AlaAsp: 3.871 ± 0.621
2.111AlaGlu: 2.111 ± 1.003
4.574AlaPhe: 4.574 ± 0.328
4.926AlaGly: 4.926 ± 1.35
1.056AlaHis: 1.056 ± 0.502
5.982AlaIle: 5.982 ± 0.997
4.574AlaLys: 4.574 ± 2.173
6.334AlaLeu: 6.334 ± 0.066
1.407AlaMet: 1.407 ± 0.669
3.167AlaAsn: 3.167 ± 0.341
4.222AlaPro: 4.222 ± 1.684
3.167AlaGln: 3.167 ± 0.956
4.574AlaArg: 4.574 ± 0.287
4.926AlaSer: 4.926 ± 1.35
4.926AlaThr: 4.926 ± 1.965
5.278AlaVal: 5.278 ± 1.798
1.407AlaTrp: 1.407 ± 0.561
2.463AlaTyr: 2.463 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
0.352CysAla: 0.352 ± 0.167
0.0CysCys: 0.0 ± 0.0
0.704CysAsp: 0.704 ± 0.334
0.352CysGlu: 0.352 ± 0.448
1.056CysPhe: 1.056 ± 0.502
1.759CysGly: 1.759 ± 0.836
0.352CysHis: 0.352 ± 0.167
0.352CysIle: 0.352 ± 0.167
0.704CysLys: 0.704 ± 0.334
2.111CysLeu: 2.111 ± 1.003
0.0CysMet: 0.0 ± 0.0
1.407CysAsn: 1.407 ± 0.669
1.056CysPro: 1.056 ± 0.502
0.352CysGln: 0.352 ± 0.167
0.352CysArg: 0.352 ± 0.167
1.056CysSer: 1.056 ± 0.502
1.056CysThr: 1.056 ± 0.114
2.815CysVal: 2.815 ± 0.107
0.0CysTrp: 0.0 ± 0.0
0.704CysTyr: 0.704 ± 0.334
0.0CysXaa: 0.0 ± 0.0
Asp
5.278AspAla: 5.278 ± 0.663
2.111AspCys: 2.111 ± 0.227
2.815AspAsp: 2.815 ± 0.722
2.815AspGlu: 2.815 ± 0.722
3.519AspPhe: 3.519 ± 0.788
2.815AspGly: 2.815 ± 0.107
0.352AspHis: 0.352 ± 0.167
4.222AspIle: 4.222 ± 0.776
0.704AspLys: 0.704 ± 0.334
3.871AspLeu: 3.871 ± 1.236
2.815AspMet: 2.815 ± 0.508
2.463AspAsn: 2.463 ± 0.06
0.704AspPro: 0.704 ± 0.334
0.352AspGln: 0.352 ± 0.167
1.759AspArg: 1.759 ± 0.394
2.463AspSer: 2.463 ± 1.17
2.463AspThr: 2.463 ± 0.675
6.334AspVal: 6.334 ± 0.066
0.704AspTrp: 0.704 ± 0.334
2.815AspTyr: 2.815 ± 0.508
0.0AspXaa: 0.0 ± 0.0
Glu
4.926GluAla: 4.926 ± 0.735
0.704GluCys: 0.704 ± 0.334
1.407GluAsp: 1.407 ± 0.669
2.463GluGlu: 2.463 ± 0.555
3.871GluPhe: 3.871 ± 0.006
1.056GluGly: 1.056 ± 0.114
1.056GluHis: 1.056 ± 0.502
2.815GluIle: 2.815 ± 0.107
2.111GluLys: 2.111 ± 0.388
3.519GluLeu: 3.519 ± 0.788
2.463GluMet: 2.463 ± 0.555
3.167GluAsn: 3.167 ± 1.505
1.056GluPro: 1.056 ± 0.502
3.519GluGln: 3.519 ± 1.057
2.111GluArg: 2.111 ± 0.227
2.815GluSer: 2.815 ± 0.722
2.111GluThr: 2.111 ± 0.388
2.111GluVal: 2.111 ± 0.227
0.704GluTrp: 0.704 ± 0.334
2.463GluTyr: 2.463 ± 0.555
0.0GluXaa: 0.0 ± 0.0
Phe
2.111PheAla: 2.111 ± 0.227
1.056PheCys: 1.056 ± 0.502
1.407PheAsp: 1.407 ± 0.054
3.519PheGlu: 3.519 ± 0.173
2.463PhePhe: 2.463 ± 0.06
1.759PheGly: 1.759 ± 0.221
1.056PheHis: 1.056 ± 0.729
4.926PheIle: 4.926 ± 2.34
3.519PheLys: 3.519 ± 0.788
4.574PheLeu: 4.574 ± 1.558
1.407PheMet: 1.407 ± 0.657
2.463PheAsn: 2.463 ± 0.06
1.407PhePro: 1.407 ± 0.054
2.463PheGln: 2.463 ± 1.905
4.222PheArg: 4.222 ± 1.069
4.574PheSer: 4.574 ± 1.517
2.111PheThr: 2.111 ± 0.227
4.222PheVal: 4.222 ± 0.161
0.0PheTrp: 0.0 ± 0.0
3.519PheTyr: 3.519 ± 0.173
0.0PheXaa: 0.0 ± 0.0
Gly
4.574GlyAla: 4.574 ± 1.517
0.704GlyCys: 0.704 ± 0.334
2.463GlyAsp: 2.463 ± 0.06
1.056GlyGlu: 1.056 ± 0.729
1.056GlyPhe: 1.056 ± 0.114
1.759GlyGly: 1.759 ± 1.624
1.056GlyHis: 1.056 ± 0.502
5.63GlyIle: 5.63 ± 0.83
2.111GlyLys: 2.111 ± 1.003
2.463GlyLeu: 2.463 ± 0.555
1.407GlyMet: 1.407 ± 0.669
2.815GlyAsn: 2.815 ± 0.508
1.407GlyPro: 1.407 ± 0.054
1.056GlyGln: 1.056 ± 0.502
3.167GlyArg: 3.167 ± 0.956
5.278GlySer: 5.278 ± 2.413
3.871GlyThr: 3.871 ± 1.236
3.519GlyVal: 3.519 ± 0.788
1.056GlyTrp: 1.056 ± 0.114
1.407GlyTyr: 1.407 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
1.407HisAla: 1.407 ± 0.669
0.704HisCys: 0.704 ± 0.334
0.704HisAsp: 0.704 ± 0.334
1.056HisGlu: 1.056 ± 0.502
1.407HisPhe: 1.407 ± 0.669
1.056HisGly: 1.056 ± 0.502
0.352HisHis: 0.352 ± 0.167
1.759HisIle: 1.759 ± 0.836
1.407HisLys: 1.407 ± 0.054
1.056HisLeu: 1.056 ± 0.502
0.352HisMet: 0.352 ± 0.167
1.056HisAsn: 1.056 ± 0.114
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.352HisArg: 0.352 ± 0.167
2.111HisSer: 2.111 ± 0.388
1.407HisThr: 1.407 ± 0.054
2.111HisVal: 2.111 ± 0.842
0.0HisTrp: 0.0 ± 0.0
1.759HisTyr: 1.759 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
5.982IleAla: 5.982 ± 1.612
0.704IleCys: 0.704 ± 0.334
4.926IleAsp: 4.926 ± 0.735
2.463IleGlu: 2.463 ± 0.06
2.463IlePhe: 2.463 ± 0.555
2.815IleGly: 2.815 ± 0.508
1.056IleHis: 1.056 ± 0.502
5.63IleIle: 5.63 ± 0.4
4.574IleLys: 4.574 ± 1.558
4.574IleLeu: 4.574 ± 1.558
2.463IleMet: 2.463 ± 0.555
4.574IleAsn: 4.574 ± 1.558
3.519IlePro: 3.519 ± 0.173
2.111IleGln: 2.111 ± 0.842
4.574IleArg: 4.574 ± 0.943
5.278IleSer: 5.278 ± 0.568
7.037IleThr: 7.037 ± 1.577
4.222IleVal: 4.222 ± 2.006
0.704IleTrp: 0.704 ± 0.334
3.167IleTyr: 3.167 ± 1.571
0.0IleXaa: 0.0 ± 0.0
Lys
2.463LysAla: 2.463 ± 1.17
0.0LysCys: 0.0 ± 0.0
2.815LysAsp: 2.815 ± 0.722
2.111LysGlu: 2.111 ± 0.388
4.222LysPhe: 4.222 ± 0.776
1.056LysGly: 1.056 ± 0.502
2.463LysHis: 2.463 ± 1.17
3.519LysIle: 3.519 ± 1.057
0.704LysLys: 0.704 ± 0.334
5.63LysLeu: 5.63 ± 2.06
1.759LysMet: 1.759 ± 0.836
2.815LysAsn: 2.815 ± 0.107
1.759LysPro: 1.759 ± 0.221
2.111LysGln: 2.111 ± 0.388
1.056LysArg: 1.056 ± 0.114
2.815LysSer: 2.815 ± 1.337
5.278LysThr: 5.278 ± 1.278
1.759LysVal: 1.759 ± 1.624
0.352LysTrp: 0.352 ± 0.167
2.463LysTyr: 2.463 ± 0.555
0.0LysXaa: 0.0 ± 0.0
Leu
6.334LeuAla: 6.334 ± 1.779
0.704LeuCys: 0.704 ± 0.334
5.982LeuAsp: 5.982 ± 1.612
3.519LeuGlu: 3.519 ± 0.442
4.222LeuPhe: 4.222 ± 1.069
3.519LeuGly: 3.519 ± 0.442
1.759LeuHis: 1.759 ± 0.836
4.574LeuIle: 4.574 ± 0.328
4.574LeuLys: 4.574 ± 2.173
5.982LeuLeu: 5.982 ± 1.463
1.056LeuMet: 1.056 ± 0.502
5.982LeuAsn: 5.982 ± 0.997
7.741LeuPro: 7.741 ± 2.472
4.574LeuGln: 4.574 ± 1.558
3.871LeuArg: 3.871 ± 0.006
9.148LeuSer: 9.148 ± 1.189
7.037LeuThr: 7.037 ± 1.498
3.519LeuVal: 3.519 ± 0.173
0.704LeuTrp: 0.704 ± 0.281
3.871LeuTyr: 3.871 ± 1.224
0.0LeuXaa: 0.0 ± 0.0
Met
1.407MetAla: 1.407 ± 1.176
0.704MetCys: 0.704 ± 0.281
1.759MetAsp: 1.759 ± 0.221
1.407MetGlu: 1.407 ± 0.669
1.056MetPhe: 1.056 ± 0.502
0.704MetGly: 0.704 ± 0.281
0.704MetHis: 0.704 ± 0.334
1.407MetIle: 1.407 ± 0.669
0.704MetLys: 0.704 ± 0.334
2.463MetLeu: 2.463 ± 0.555
1.056MetMet: 1.056 ± 0.114
1.056MetAsn: 1.056 ± 0.114
0.704MetPro: 0.704 ± 0.281
0.704MetGln: 0.704 ± 0.334
1.056MetArg: 1.056 ± 0.502
2.815MetSer: 2.815 ± 1.337
2.111MetThr: 2.111 ± 1.003
1.407MetVal: 1.407 ± 0.561
0.352MetTrp: 0.352 ± 0.167
1.407MetTyr: 1.407 ± 0.561
0.0MetXaa: 0.0 ± 0.0
Asn
5.278AsnAla: 5.278 ± 0.663
2.463AsnCys: 2.463 ± 1.17
1.759AsnAsp: 1.759 ± 0.221
2.463AsnGlu: 2.463 ± 0.555
3.167AsnPhe: 3.167 ± 2.186
1.759AsnGly: 1.759 ± 0.221
1.759AsnHis: 1.759 ± 0.836
4.574AsnIle: 4.574 ± 0.902
3.519AsnLys: 3.519 ± 1.057
5.63AsnLeu: 5.63 ± 2.06
1.056AsnMet: 1.056 ± 0.502
2.815AsnAsn: 2.815 ± 1.337
2.463AsnPro: 2.463 ± 1.17
2.815AsnGln: 2.815 ± 0.722
2.815AsnArg: 2.815 ± 0.722
4.222AsnSer: 4.222 ± 1.069
1.759AsnThr: 1.759 ± 0.836
3.871AsnVal: 3.871 ± 0.006
0.704AsnTrp: 0.704 ± 0.334
1.407AsnTyr: 1.407 ± 0.054
0.0AsnXaa: 0.0 ± 0.0
Pro
3.167ProAla: 3.167 ± 0.956
0.0ProCys: 0.0 ± 0.0
0.704ProAsp: 0.704 ± 0.334
4.222ProGlu: 4.222 ± 0.161
4.222ProPhe: 4.222 ± 0.776
3.871ProGly: 3.871 ± 0.621
1.056ProHis: 1.056 ± 0.729
3.871ProIle: 3.871 ± 1.236
1.759ProLys: 1.759 ± 0.836
3.167ProLeu: 3.167 ± 0.956
0.704ProMet: 0.704 ± 0.334
2.111ProAsn: 2.111 ± 0.388
2.463ProPro: 2.463 ± 1.905
1.056ProGln: 1.056 ± 0.502
3.167ProArg: 3.167 ± 0.956
2.111ProSer: 2.111 ± 0.842
1.759ProThr: 1.759 ± 1.624
1.759ProVal: 1.759 ± 0.221
0.352ProTrp: 0.352 ± 0.448
1.759ProTyr: 1.759 ± 1.624
0.0ProXaa: 0.0 ± 0.0
Gln
3.519GlnAla: 3.519 ± 0.173
0.704GlnCys: 0.704 ± 0.334
1.056GlnAsp: 1.056 ± 0.502
1.056GlnGlu: 1.056 ± 0.502
1.759GlnPhe: 1.759 ± 0.394
1.759GlnGly: 1.759 ± 0.836
1.056GlnHis: 1.056 ± 0.502
2.463GlnIle: 2.463 ± 0.675
0.704GlnLys: 0.704 ± 0.334
6.334GlnLeu: 6.334 ± 0.549
0.704GlnMet: 0.704 ± 0.334
1.759GlnAsn: 1.759 ± 0.836
1.407GlnPro: 1.407 ± 1.176
2.111GlnGln: 2.111 ± 0.227
2.815GlnArg: 2.815 ± 0.722
0.352GlnSer: 0.352 ± 0.167
3.519GlnThr: 3.519 ± 0.442
2.463GlnVal: 2.463 ± 1.29
1.056GlnTrp: 1.056 ± 0.729
3.167GlnTyr: 3.167 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
2.111ArgAla: 2.111 ± 1.457
0.352ArgCys: 0.352 ± 0.167
2.815ArgAsp: 2.815 ± 1.337
1.759ArgGlu: 1.759 ± 0.836
4.574ArgPhe: 4.574 ± 1.517
2.111ArgGly: 2.111 ± 0.842
1.407ArgHis: 1.407 ± 0.054
4.222ArgIle: 4.222 ± 0.161
2.111ArgLys: 2.111 ± 1.003
4.926ArgLeu: 4.926 ± 0.12
1.056ArgMet: 1.056 ± 0.502
5.278ArgAsn: 5.278 ± 1.278
2.111ArgPro: 2.111 ± 0.388
2.815ArgGln: 2.815 ± 0.508
1.056ArgArg: 1.056 ± 0.502
4.574ArgSer: 4.574 ± 0.328
2.815ArgThr: 2.815 ± 1.123
4.222ArgVal: 4.222 ± 0.161
0.0ArgTrp: 0.0 ± 0.0
1.407ArgTyr: 1.407 ± 1.176
0.0ArgXaa: 0.0 ± 0.0
Ser
5.63SerAla: 5.63 ± 0.215
0.0SerCys: 0.0 ± 0.0
4.574SerAsp: 4.574 ± 2.747
2.815SerGlu: 2.815 ± 0.508
3.167SerPhe: 3.167 ± 1.571
3.871SerGly: 3.871 ± 1.236
2.111SerHis: 2.111 ± 0.842
6.685SerIle: 6.685 ± 0.514
3.871SerLys: 3.871 ± 0.621
8.093SerLeu: 8.093 ± 0.77
1.407SerMet: 1.407 ± 0.054
2.815SerAsn: 2.815 ± 1.337
2.815SerPro: 2.815 ± 1.738
3.167SerGln: 3.167 ± 0.341
3.519SerArg: 3.519 ± 1.057
5.278SerSer: 5.278 ± 1.183
7.389SerThr: 7.389 ± 4.485
6.685SerVal: 6.685 ± 1.946
0.704SerTrp: 0.704 ± 0.281
1.759SerTyr: 1.759 ± 0.394
0.0SerXaa: 0.0 ± 0.0
Thr
8.445ThrAla: 8.445 ± 3.983
1.056ThrCys: 1.056 ± 0.114
2.815ThrAsp: 2.815 ± 0.107
4.222ThrGlu: 4.222 ± 0.454
1.407ThrPhe: 1.407 ± 0.054
4.222ThrGly: 4.222 ± 2.299
0.704ThrHis: 0.704 ± 0.334
2.815ThrIle: 2.815 ± 0.508
3.167ThrLys: 3.167 ± 0.341
6.685ThrLeu: 6.685 ± 1.946
2.111ThrMet: 2.111 ± 0.207
4.926ThrAsn: 4.926 ± 0.735
2.815ThrPro: 2.815 ± 0.722
2.815ThrGln: 2.815 ± 0.107
3.167ThrArg: 3.167 ± 0.341
4.926ThrSer: 4.926 ± 2.58
4.926ThrThr: 4.926 ± 1.965
4.574ThrVal: 4.574 ± 0.902
0.0ThrTrp: 0.0 ± 0.0
2.111ThrTyr: 2.111 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
4.926ValAla: 4.926 ± 0.735
1.056ValCys: 1.056 ± 0.502
4.926ValAsp: 4.926 ± 1.965
4.926ValGlu: 4.926 ± 1.11
2.463ValPhe: 2.463 ± 0.555
4.926ValGly: 4.926 ± 0.735
0.0ValHis: 0.0 ± 0.0
4.222ValIle: 4.222 ± 0.776
4.222ValLys: 4.222 ± 0.776
5.63ValLeu: 5.63 ± 1.63
0.352ValMet: 0.352 ± 0.448
4.574ValAsn: 4.574 ± 0.287
2.463ValPro: 2.463 ± 1.29
1.759ValGln: 1.759 ± 0.394
4.926ValArg: 4.926 ± 0.495
5.982ValSer: 5.982 ± 0.848
3.871ValThr: 3.871 ± 0.006
3.871ValVal: 3.871 ± 0.006
0.0ValTrp: 0.0 ± 0.0
2.815ValTyr: 2.815 ± 0.508
0.0ValXaa: 0.0 ± 0.0
Trp
0.352TrpAla: 0.352 ± 0.167
0.0TrpCys: 0.0 ± 0.0
0.704TrpAsp: 0.704 ± 0.281
0.704TrpGlu: 0.704 ± 0.334
0.704TrpPhe: 0.704 ± 0.334
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.704TrpIle: 0.704 ± 0.281
0.352TrpLys: 0.352 ± 0.167
1.407TrpLeu: 1.407 ± 0.054
0.0TrpMet: 0.0 ± 0.0
0.352TrpAsn: 0.352 ± 0.167
0.352TrpPro: 0.352 ± 0.448
0.352TrpGln: 0.352 ± 0.167
0.704TrpArg: 0.704 ± 0.281
1.056TrpSer: 1.056 ± 0.729
0.352TrpThr: 0.352 ± 0.167
1.056TrpVal: 1.056 ± 0.114
0.0TrpTrp: 0.0 ± 0.0
0.352TrpTyr: 0.352 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.463TyrAla: 2.463 ± 0.675
1.759TyrCys: 1.759 ± 0.836
3.167TyrAsp: 3.167 ± 0.275
1.407TyrGlu: 1.407 ± 0.669
1.407TyrPhe: 1.407 ± 0.054
2.111TyrGly: 2.111 ± 0.388
0.704TyrHis: 0.704 ± 0.334
2.463TyrIle: 2.463 ± 0.555
1.759TyrLys: 1.759 ± 0.221
4.222TyrLeu: 4.222 ± 0.161
1.056TyrMet: 1.056 ± 0.729
0.704TyrAsn: 0.704 ± 0.334
2.815TyrPro: 2.815 ± 1.123
2.463TyrGln: 2.463 ± 0.555
2.463TyrArg: 2.463 ± 0.675
4.574TyrSer: 4.574 ± 2.747
2.463TyrThr: 2.463 ± 1.29
2.111TyrVal: 2.111 ± 0.842
0.704TyrTrp: 0.704 ± 0.334
1.759TyrTyr: 1.759 ± 0.394
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2843 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski