Amino acid dipepetide frequency for Hubei picorna-like virus 22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.945AlaAla: 5.945 ± 1.257
0.793AlaCys: 0.793 ± 0.21
3.567AlaAsp: 3.567 ± 2.554
3.964AlaGlu: 3.964 ± 0.876
2.774AlaPhe: 2.774 ± 0.228
5.153AlaGly: 5.153 ± 1.047
0.793AlaHis: 0.793 ± 0.21
5.153AlaIle: 5.153 ± 0.239
5.153AlaLys: 5.153 ± 0.404
9.909AlaLeu: 9.909 ± 0.905
0.793AlaMet: 0.793 ± 0.432
3.171AlaAsn: 3.171 ± 0.842
3.567AlaPro: 3.567 ± 0.626
4.756AlaGln: 4.756 ± 1.263
3.171AlaArg: 3.171 ± 0.199
2.774AlaSer: 2.774 ± 1.701
6.342AlaThr: 6.342 ± 1.684
6.342AlaVal: 6.342 ± 0.245
0.396AlaTrp: 0.396 ± 0.216
3.567AlaTyr: 3.567 ± 0.017
0.0AlaXaa: 0.0 ± 0.0
Cys
1.189CysAla: 1.189 ± 0.006
0.396CysCys: 0.396 ± 0.216
1.585CysAsp: 1.585 ± 0.421
1.189CysGlu: 1.189 ± 0.637
1.189CysPhe: 1.189 ± 0.637
1.982CysGly: 1.982 ± 0.438
0.0CysHis: 0.0 ± 0.0
0.793CysIle: 0.793 ± 0.853
0.793CysLys: 0.793 ± 0.432
0.396CysLeu: 0.396 ± 0.216
1.189CysMet: 1.189 ± 0.649
0.0CysAsn: 0.0 ± 0.0
1.189CysPro: 1.189 ± 0.637
0.0CysGln: 0.0 ± 0.0
1.189CysArg: 1.189 ± 0.649
0.793CysSer: 0.793 ± 0.432
0.0CysThr: 0.0 ± 0.0
0.396CysVal: 0.396 ± 0.216
0.0CysTrp: 0.0 ± 0.0
0.793CysTyr: 0.793 ± 0.432
0.0CysXaa: 0.0 ± 0.0
Asp
3.567AspAla: 3.567 ± 1.303
0.793AspCys: 0.793 ± 0.432
5.549AspAsp: 5.549 ± 2.384
4.756AspGlu: 4.756 ± 0.62
2.774AspPhe: 2.774 ± 0.415
2.378AspGly: 2.378 ± 0.654
0.793AspHis: 0.793 ± 0.432
3.171AspIle: 3.171 ± 0.842
2.378AspLys: 2.378 ± 0.631
4.756AspLeu: 4.756 ± 1.263
0.793AspMet: 0.793 ± 0.432
0.793AspAsn: 0.793 ± 0.432
2.378AspPro: 2.378 ± 0.631
3.567AspGln: 3.567 ± 1.269
1.982AspArg: 1.982 ± 0.205
5.549AspSer: 5.549 ± 0.455
3.171AspThr: 3.171 ± 0.199
3.567AspVal: 3.567 ± 0.017
0.793AspTrp: 0.793 ± 0.21
1.982AspTyr: 1.982 ± 0.438
0.0AspXaa: 0.0 ± 0.0
Glu
2.378GluAla: 2.378 ± 0.011
0.793GluCys: 0.793 ± 0.432
4.756GluAsp: 4.756 ± 0.62
3.171GluGlu: 3.171 ± 1.087
2.774GluPhe: 2.774 ± 0.415
3.171GluGly: 3.171 ± 0.444
1.982GluHis: 1.982 ± 0.848
5.153GluIle: 5.153 ± 2.168
3.964GluLys: 3.964 ± 1.519
5.549GluLeu: 5.549 ± 0.188
1.982GluMet: 1.982 ± 0.438
3.171GluAsn: 3.171 ± 1.087
1.982GluPro: 1.982 ± 0.438
1.585GluGln: 1.585 ± 0.222
1.982GluArg: 1.982 ± 1.081
1.982GluSer: 1.982 ± 1.081
4.36GluThr: 4.36 ± 1.093
7.134GluVal: 7.134 ± 0.677
1.189GluTrp: 1.189 ± 0.649
2.774GluTyr: 2.774 ± 1.513
0.0GluXaa: 0.0 ± 0.0
Phe
2.378PheAla: 2.378 ± 0.654
1.189PheCys: 1.189 ± 0.637
1.982PheAsp: 1.982 ± 0.438
1.982PheGlu: 1.982 ± 1.081
2.774PhePhe: 2.774 ± 0.871
4.36PheGly: 4.36 ± 1.093
1.585PheHis: 1.585 ± 0.421
1.982PheIle: 1.982 ± 0.438
1.189PheLys: 1.189 ± 0.637
3.964PheLeu: 3.964 ± 2.162
1.982PheMet: 1.982 ± 0.205
2.378PheAsn: 2.378 ± 1.274
1.982PhePro: 1.982 ± 1.081
1.189PheGln: 1.189 ± 0.649
3.964PheArg: 3.964 ± 1.695
3.964PheSer: 3.964 ± 0.233
3.171PheThr: 3.171 ± 3.413
5.153PheVal: 5.153 ± 1.69
0.396PheTrp: 0.396 ± 0.427
1.585PheTyr: 1.585 ± 0.865
0.0PheXaa: 0.0 ± 0.0
Gly
3.964GlyAla: 3.964 ± 0.233
1.189GlyCys: 1.189 ± 0.006
4.36GlyAsp: 4.36 ± 0.193
4.36GlyGlu: 4.36 ± 1.093
3.567GlyPhe: 3.567 ± 1.269
3.171GlyGly: 3.171 ± 1.087
0.793GlyHis: 0.793 ± 0.432
3.171GlyIle: 3.171 ± 0.199
3.567GlyLys: 3.567 ± 0.66
1.189GlyLeu: 1.189 ± 0.649
0.793GlyMet: 0.793 ± 0.432
1.982GlyAsn: 1.982 ± 0.205
2.378GlyPro: 2.378 ± 0.631
2.378GlyGln: 2.378 ± 0.631
3.171GlyArg: 3.171 ± 0.199
3.964GlySer: 3.964 ± 1.052
5.549GlyThr: 5.549 ± 0.83
3.964GlyVal: 3.964 ± 1.052
1.189GlyTrp: 1.189 ± 0.649
2.378GlyTyr: 2.378 ± 0.654
0.0GlyXaa: 0.0 ± 0.0
His
1.189HisAla: 1.189 ± 0.649
1.189HisCys: 1.189 ± 0.006
1.189HisAsp: 1.189 ± 1.28
1.585HisGlu: 1.585 ± 0.222
1.982HisPhe: 1.982 ± 1.081
1.189HisGly: 1.189 ± 0.006
0.396HisHis: 0.396 ± 0.216
0.396HisIle: 0.396 ± 0.216
0.793HisLys: 0.793 ± 0.21
0.396HisLeu: 0.396 ± 0.216
0.396HisMet: 0.396 ± 0.216
0.0HisAsn: 0.0 ± 0.0
0.396HisPro: 0.396 ± 0.216
0.793HisGln: 0.793 ± 0.21
2.378HisArg: 2.378 ± 0.654
2.378HisSer: 2.378 ± 0.654
1.189HisThr: 1.189 ± 1.28
0.396HisVal: 0.396 ± 0.427
0.0HisTrp: 0.0 ± 0.0
0.396HisTyr: 0.396 ± 0.216
0.0HisXaa: 0.0 ± 0.0
Ile
3.567IleAla: 3.567 ± 1.269
1.189IleCys: 1.189 ± 0.649
3.567IleAsp: 3.567 ± 0.66
2.774IleGlu: 2.774 ± 0.871
2.378IlePhe: 2.378 ± 0.654
3.171IleGly: 3.171 ± 0.842
0.0IleHis: 0.0 ± 0.0
2.378IleIle: 2.378 ± 0.654
1.189IleLys: 1.189 ± 0.649
3.964IleLeu: 3.964 ± 0.876
0.793IleMet: 0.793 ± 0.432
2.378IleAsn: 2.378 ± 0.011
4.36IlePro: 4.36 ± 0.193
1.982IleGln: 1.982 ± 0.438
5.153IleArg: 5.153 ± 1.525
4.756IleSer: 4.756 ± 0.023
4.756IleThr: 4.756 ± 2.549
2.378IleVal: 2.378 ± 0.631
0.793IleTrp: 0.793 ± 0.432
1.585IleTyr: 1.585 ± 0.421
0.0IleXaa: 0.0 ± 0.0
Lys
6.342LysAla: 6.342 ± 0.888
0.793LysCys: 0.793 ± 0.432
1.982LysAsp: 1.982 ± 0.205
3.964LysGlu: 3.964 ± 0.876
1.585LysPhe: 1.585 ± 0.421
3.964LysGly: 3.964 ± 0.876
0.396LysHis: 0.396 ± 0.216
7.531LysIle: 7.531 ± 0.251
3.171LysLys: 3.171 ± 1.087
3.567LysLeu: 3.567 ± 0.66
0.793LysMet: 0.793 ± 0.21
2.378LysAsn: 2.378 ± 0.631
1.982LysPro: 1.982 ± 0.205
1.189LysGln: 1.189 ± 0.649
2.774LysArg: 2.774 ± 0.871
3.567LysSer: 3.567 ± 0.66
2.378LysThr: 2.378 ± 0.654
1.982LysVal: 1.982 ± 0.438
1.189LysTrp: 1.189 ± 0.006
1.189LysTyr: 1.189 ± 0.649
0.0LysXaa: 0.0 ± 0.0
Leu
5.945LeuAla: 5.945 ± 0.614
1.585LeuCys: 1.585 ± 0.222
3.964LeuAsp: 3.964 ± 1.519
5.153LeuGlu: 5.153 ± 0.882
2.378LeuPhe: 2.378 ± 0.631
2.378LeuGly: 2.378 ± 0.011
1.982LeuHis: 1.982 ± 1.081
3.567LeuIle: 3.567 ± 0.017
3.964LeuLys: 3.964 ± 1.519
3.567LeuLeu: 3.567 ± 1.946
3.964LeuMet: 3.964 ± 0.876
3.171LeuAsn: 3.171 ± 0.199
5.153LeuPro: 5.153 ± 0.404
2.774LeuGln: 2.774 ± 0.415
4.756LeuArg: 4.756 ± 0.023
5.153LeuSer: 5.153 ± 0.404
6.738LeuThr: 6.738 ± 1.468
5.153LeuVal: 5.153 ± 0.239
0.793LeuTrp: 0.793 ± 0.432
1.982LeuTyr: 1.982 ± 0.438
0.0LeuXaa: 0.0 ± 0.0
Met
3.171MetAla: 3.171 ± 0.444
0.396MetCys: 0.396 ± 0.216
1.982MetAsp: 1.982 ± 1.081
1.189MetGlu: 1.189 ± 0.649
0.0MetPhe: 0.0 ± 0.0
1.189MetGly: 1.189 ± 0.649
0.793MetHis: 0.793 ± 0.21
0.793MetIle: 0.793 ± 0.21
1.585MetLys: 1.585 ± 0.865
1.585MetLeu: 1.585 ± 0.421
0.396MetMet: 0.396 ± 0.216
1.982MetAsn: 1.982 ± 0.205
0.793MetPro: 0.793 ± 0.432
0.793MetGln: 0.793 ± 0.21
0.793MetArg: 0.793 ± 0.432
2.378MetSer: 2.378 ± 0.631
3.567MetThr: 3.567 ± 1.946
1.982MetVal: 1.982 ± 0.205
0.793MetTrp: 0.793 ± 0.21
2.378MetTyr: 2.378 ± 0.654
0.0MetXaa: 0.0 ± 0.0
Asn
3.964AsnAla: 3.964 ± 2.981
0.396AsnCys: 0.396 ± 0.427
1.585AsnAsp: 1.585 ± 0.421
3.567AsnGlu: 3.567 ± 1.946
1.189AsnPhe: 1.189 ± 0.006
2.378AsnGly: 2.378 ± 0.631
1.189AsnHis: 1.189 ± 0.649
2.774AsnIle: 2.774 ± 1.058
1.585AsnLys: 1.585 ± 0.222
2.378AsnLeu: 2.378 ± 0.011
1.189AsnMet: 1.189 ± 0.006
2.378AsnAsn: 2.378 ± 0.011
3.567AsnPro: 3.567 ± 0.626
1.585AsnGln: 1.585 ± 0.222
3.567AsnArg: 3.567 ± 0.017
2.774AsnSer: 2.774 ± 0.415
1.585AsnThr: 1.585 ± 1.064
1.982AsnVal: 1.982 ± 0.438
0.793AsnTrp: 0.793 ± 0.432
1.982AsnTyr: 1.982 ± 1.491
0.0AsnXaa: 0.0 ± 0.0
Pro
2.378ProAla: 2.378 ± 0.631
0.0ProCys: 0.0 ± 0.0
2.378ProAsp: 2.378 ± 1.274
4.36ProGlu: 4.36 ± 0.193
0.793ProPhe: 0.793 ± 0.432
2.378ProGly: 2.378 ± 0.654
0.793ProHis: 0.793 ± 0.21
2.378ProIle: 2.378 ± 0.631
2.774ProLys: 2.774 ± 0.871
3.964ProLeu: 3.964 ± 0.409
1.585ProMet: 1.585 ± 0.865
2.378ProAsn: 2.378 ± 0.631
2.774ProPro: 2.774 ± 0.228
2.378ProGln: 2.378 ± 0.631
3.567ProArg: 3.567 ± 1.303
5.945ProSer: 5.945 ± 0.614
4.36ProThr: 4.36 ± 0.836
3.964ProVal: 3.964 ± 1.052
0.793ProTrp: 0.793 ± 0.21
3.567ProTyr: 3.567 ± 1.269
0.0ProXaa: 0.0 ± 0.0
Gln
3.567GlnAla: 3.567 ± 0.017
0.396GlnCys: 0.396 ± 0.427
1.585GlnAsp: 1.585 ± 0.222
0.793GlnGlu: 0.793 ± 0.21
0.793GlnPhe: 0.793 ± 0.432
2.774GlnGly: 2.774 ± 1.058
1.189GlnHis: 1.189 ± 0.637
1.982GlnIle: 1.982 ± 0.205
1.982GlnLys: 1.982 ± 1.081
1.585GlnLeu: 1.585 ± 0.421
1.189GlnMet: 1.189 ± 0.006
1.585GlnAsn: 1.585 ± 0.421
2.774GlnPro: 2.774 ± 0.415
2.378GlnGln: 2.378 ± 0.011
2.774GlnArg: 2.774 ± 0.871
2.378GlnSer: 2.378 ± 0.654
1.585GlnThr: 1.585 ± 0.222
1.585GlnVal: 1.585 ± 0.222
0.396GlnTrp: 0.396 ± 0.427
2.378GlnTyr: 2.378 ± 0.631
0.0GlnXaa: 0.0 ± 0.0
Arg
7.531ArgAla: 7.531 ± 1.035
0.0ArgCys: 0.0 ± 0.0
1.585ArgAsp: 1.585 ± 0.865
3.964ArgGlu: 3.964 ± 1.519
4.36ArgPhe: 4.36 ± 0.193
1.982ArgGly: 1.982 ± 0.438
1.189ArgHis: 1.189 ± 0.649
3.171ArgIle: 3.171 ± 1.087
3.964ArgLys: 3.964 ± 1.519
3.567ArgLeu: 3.567 ± 0.017
1.982ArgMet: 1.982 ± 0.205
1.585ArgAsn: 1.585 ± 0.421
2.774ArgPro: 2.774 ± 1.058
0.793ArgGln: 0.793 ± 0.432
2.774ArgArg: 2.774 ± 1.513
4.756ArgSer: 4.756 ± 1.309
4.756ArgThr: 4.756 ± 1.309
3.964ArgVal: 3.964 ± 0.409
0.793ArgTrp: 0.793 ± 0.21
1.585ArgTyr: 1.585 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
5.945SerAla: 5.945 ± 2.543
0.793SerCys: 0.793 ± 0.432
2.378SerAsp: 2.378 ± 0.011
3.964SerGlu: 3.964 ± 1.519
3.964SerPhe: 3.964 ± 0.409
5.945SerGly: 5.945 ± 1.9
1.189SerHis: 1.189 ± 0.006
1.189SerIle: 1.189 ± 0.006
3.171SerLys: 3.171 ± 0.199
5.549SerLeu: 5.549 ± 0.455
1.982SerMet: 1.982 ± 0.137
4.36SerAsn: 4.36 ± 1.479
3.964SerPro: 3.964 ± 1.519
3.964SerGln: 3.964 ± 0.233
4.36SerArg: 4.36 ± 0.193
2.774SerSer: 2.774 ± 0.228
4.756SerThr: 4.756 ± 2.549
5.549SerVal: 5.549 ± 0.188
1.585SerTrp: 1.585 ± 0.222
3.964SerTyr: 3.964 ± 1.052
0.0SerXaa: 0.0 ± 0.0
Thr
3.171ThrAla: 3.171 ± 0.842
1.189ThrCys: 1.189 ± 0.637
3.964ThrAsp: 3.964 ± 0.409
3.171ThrGlu: 3.171 ± 0.444
5.945ThrPhe: 5.945 ± 0.614
3.567ThrGly: 3.567 ± 0.017
0.793ThrHis: 0.793 ± 0.21
3.964ThrIle: 3.964 ± 0.409
5.153ThrLys: 5.153 ± 1.047
9.116ThrLeu: 9.116 ± 1.456
1.189ThrMet: 1.189 ± 0.176
2.774ThrAsn: 2.774 ± 1.058
4.36ThrPro: 4.36 ± 1.479
1.982ThrGln: 1.982 ± 0.205
3.171ThrArg: 3.171 ± 1.73
4.36ThrSer: 4.36 ± 2.122
5.153ThrThr: 5.153 ± 1.69
3.964ThrVal: 3.964 ± 1.052
0.396ThrTrp: 0.396 ± 0.427
3.171ThrTyr: 3.171 ± 0.842
0.0ThrXaa: 0.0 ± 0.0
Val
6.738ValAla: 6.738 ± 2.753
1.189ValCys: 1.189 ± 0.006
3.964ValAsp: 3.964 ± 0.409
3.964ValGlu: 3.964 ± 0.233
4.36ValPhe: 4.36 ± 0.193
2.774ValGly: 2.774 ± 1.058
1.189ValHis: 1.189 ± 0.006
0.396ValIle: 0.396 ± 0.216
3.964ValLys: 3.964 ± 0.876
5.153ValLeu: 5.153 ± 1.525
3.567ValMet: 3.567 ± 1.303
3.171ValAsn: 3.171 ± 0.444
5.945ValPro: 5.945 ± 1.9
0.396ValGln: 0.396 ± 0.216
2.774ValArg: 2.774 ± 0.228
7.927ValSer: 7.927 ± 4.033
3.964ValThr: 3.964 ± 0.409
5.945ValVal: 5.945 ± 0.672
1.585ValTrp: 1.585 ± 0.222
0.793ValTyr: 0.793 ± 0.432
0.0ValXaa: 0.0 ± 0.0
Trp
1.189TrpAla: 1.189 ± 0.006
0.0TrpCys: 0.0 ± 0.0
1.585TrpAsp: 1.585 ± 0.865
0.396TrpGlu: 0.396 ± 0.216
0.793TrpPhe: 0.793 ± 0.21
0.396TrpGly: 0.396 ± 0.216
0.396TrpHis: 0.396 ± 0.216
0.396TrpIle: 0.396 ± 0.216
1.585TrpLys: 1.585 ± 0.421
0.793TrpLeu: 0.793 ± 0.21
0.793TrpMet: 0.793 ± 0.432
1.189TrpAsn: 1.189 ± 0.006
0.396TrpPro: 0.396 ± 0.216
0.793TrpGln: 0.793 ± 0.432
0.793TrpArg: 0.793 ± 0.21
0.396TrpSer: 0.396 ± 0.216
1.189TrpThr: 1.189 ± 0.637
1.189TrpVal: 1.189 ± 0.637
0.0TrpTrp: 0.0 ± 0.0
0.793TrpTyr: 0.793 ± 0.432
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.964TyrAla: 3.964 ± 0.876
0.793TyrCys: 0.793 ± 0.21
1.982TyrAsp: 1.982 ± 0.438
3.567TyrGlu: 3.567 ± 0.017
2.774TyrPhe: 2.774 ± 0.871
2.774TyrGly: 2.774 ± 0.415
1.189TyrHis: 1.189 ± 0.006
2.774TyrIle: 2.774 ± 0.871
0.793TyrLys: 0.793 ± 0.21
3.171TyrLeu: 3.171 ± 0.199
0.793TyrMet: 0.793 ± 0.853
1.585TyrAsn: 1.585 ± 0.222
0.793TyrPro: 0.793 ± 0.432
0.396TyrGln: 0.396 ± 0.216
1.982TyrArg: 1.982 ± 1.491
2.774TyrSer: 2.774 ± 1.058
2.378TyrThr: 2.378 ± 1.297
3.171TyrVal: 3.171 ± 0.199
1.189TyrTrp: 1.189 ± 0.006
1.982TyrTyr: 1.982 ± 0.438
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2524 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski