Amino acid dipepetide frequency for Hubei picorna-like virus 51

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.142AlaAla: 3.142 ± 1.478
0.786AlaCys: 0.786 ± 0.392
2.749AlaAsp: 2.749 ± 1.065
1.571AlaGlu: 1.571 ± 0.785
3.142AlaPhe: 3.142 ± 0.868
2.749AlaGly: 2.749 ± 0.455
0.786AlaHis: 0.786 ± 0.217
5.892AlaIle: 5.892 ± 0.505
5.499AlaLys: 5.499 ± 0.301
5.499AlaLeu: 5.499 ± 0.301
0.393AlaMet: 0.393 ± 0.196
3.142AlaAsn: 3.142 ± 0.35
1.571AlaPro: 1.571 ± 0.175
2.749AlaGln: 2.749 ± 0.455
1.571AlaArg: 1.571 ± 0.434
3.142AlaSer: 3.142 ± 0.35
5.499AlaThr: 5.499 ± 1.52
6.284AlaVal: 6.284 ± 0.518
0.786AlaTrp: 0.786 ± 0.217
2.749AlaTyr: 2.749 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
1.178CysAla: 1.178 ± 0.588
0.0CysCys: 0.0 ± 0.0
1.571CysAsp: 1.571 ± 0.785
0.786CysGlu: 0.786 ± 0.392
1.178CysPhe: 1.178 ± 0.63
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.393CysIle: 0.393 ± 0.413
1.571CysLys: 1.571 ± 0.175
1.571CysLeu: 1.571 ± 0.785
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.393CysPro: 0.393 ± 0.196
1.571CysGln: 1.571 ± 0.175
1.178CysArg: 1.178 ± 0.588
1.178CysSer: 1.178 ± 0.021
0.393CysThr: 0.393 ± 0.196
1.571CysVal: 1.571 ± 0.175
0.0CysTrp: 0.0 ± 0.0
0.786CysTyr: 0.786 ± 0.217
0.0CysXaa: 0.0 ± 0.0
Asp
3.142AspAla: 3.142 ± 0.868
1.178AspCys: 1.178 ± 0.588
3.928AspAsp: 3.928 ± 0.133
3.535AspGlu: 3.535 ± 1.156
3.535AspPhe: 3.535 ± 0.547
1.964AspGly: 1.964 ± 0.371
1.178AspHis: 1.178 ± 0.021
4.321AspIle: 4.321 ± 0.889
5.106AspLys: 5.106 ± 1.941
5.499AspLeu: 5.499 ± 0.918
1.178AspMet: 1.178 ± 0.588
4.713AspAsn: 4.713 ± 1.135
2.357AspPro: 2.357 ± 0.042
1.571AspGln: 1.571 ± 0.175
1.964AspArg: 1.964 ± 0.371
3.928AspSer: 3.928 ± 2.304
3.142AspThr: 3.142 ± 1.478
1.964AspVal: 1.964 ± 0.238
0.393AspTrp: 0.393 ± 0.196
2.749AspTyr: 2.749 ± 0.455
0.0AspXaa: 0.0 ± 0.0
Glu
3.142GluAla: 3.142 ± 0.96
1.178GluCys: 1.178 ± 0.588
1.964GluAsp: 1.964 ± 0.238
1.571GluGlu: 1.571 ± 0.785
0.786GluPhe: 0.786 ± 0.217
1.964GluGly: 1.964 ± 0.981
0.786GluHis: 0.786 ± 0.392
3.535GluIle: 3.535 ± 0.547
0.786GluLys: 0.786 ± 0.392
4.713GluLeu: 4.713 ± 1.744
1.964GluMet: 1.964 ± 0.313
2.357GluAsn: 2.357 ± 0.042
2.357GluPro: 2.357 ± 0.042
1.964GluGln: 1.964 ± 0.238
3.928GluArg: 3.928 ± 1.352
4.321GluSer: 4.321 ± 0.889
5.499GluThr: 5.499 ± 2.137
3.535GluVal: 3.535 ± 0.063
1.178GluTrp: 1.178 ± 0.588
1.571GluTyr: 1.571 ± 0.785
0.0GluXaa: 0.0 ± 0.0
Phe
1.178PheAla: 1.178 ± 0.021
0.393PheCys: 0.393 ± 0.196
6.284PheAsp: 6.284 ± 0.091
1.571PheGlu: 1.571 ± 0.434
1.964PhePhe: 1.964 ± 0.238
2.749PheGly: 2.749 ± 0.154
1.178PheHis: 1.178 ± 0.63
2.357PheIle: 2.357 ± 0.042
1.964PheLys: 1.964 ± 0.371
3.928PheLeu: 3.928 ± 1.352
0.0PheMet: 0.0 ± 0.0
5.106PheAsn: 5.106 ± 0.722
0.786PhePro: 0.786 ± 0.217
0.786PheGln: 0.786 ± 0.217
2.749PheArg: 2.749 ± 1.065
5.892PheSer: 5.892 ± 3.152
3.928PheThr: 3.928 ± 0.743
2.357PheVal: 2.357 ± 0.042
0.393PheTrp: 0.393 ± 0.196
1.571PheTyr: 1.571 ± 1.044
0.0PheXaa: 0.0 ± 0.0
Gly
3.535GlyAla: 3.535 ± 0.063
0.393GlyCys: 0.393 ± 0.196
2.357GlyAsp: 2.357 ± 0.042
3.535GlyGlu: 3.535 ± 0.672
1.571GlyPhe: 1.571 ± 0.434
1.178GlyGly: 1.178 ± 0.021
0.393GlyHis: 0.393 ± 0.196
3.928GlyIle: 3.928 ± 0.133
3.535GlyLys: 3.535 ± 1.156
5.106GlyLeu: 5.106 ± 0.722
0.786GlyMet: 0.786 ± 0.392
2.357GlyAsn: 2.357 ± 0.042
0.0GlyPro: 0.0 ± 0.0
3.142GlyGln: 3.142 ± 1.478
0.393GlyArg: 0.393 ± 0.196
3.928GlySer: 3.928 ± 0.743
2.749GlyThr: 2.749 ± 1.065
1.964GlyVal: 1.964 ± 0.981
0.393GlyTrp: 0.393 ± 0.196
2.357GlyTyr: 2.357 ± 1.177
0.0GlyXaa: 0.0 ± 0.0
His
0.786HisAla: 0.786 ± 0.392
0.0HisCys: 0.0 ± 0.0
0.786HisAsp: 0.786 ± 0.392
1.178HisGlu: 1.178 ± 0.588
2.357HisPhe: 2.357 ± 0.042
0.786HisGly: 0.786 ± 0.392
1.178HisHis: 1.178 ± 0.021
0.393HisIle: 0.393 ± 0.413
2.357HisLys: 2.357 ± 1.177
1.571HisLeu: 1.571 ± 0.175
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.571HisPro: 1.571 ± 0.175
0.786HisGln: 0.786 ± 0.217
0.786HisArg: 0.786 ± 0.392
0.393HisSer: 0.393 ± 0.413
2.357HisThr: 2.357 ± 0.042
0.786HisVal: 0.786 ± 0.217
0.0HisTrp: 0.0 ± 0.0
0.786HisTyr: 0.786 ± 0.392
0.0HisXaa: 0.0 ± 0.0
Ile
6.284IleAla: 6.284 ± 1.127
1.964IleCys: 1.964 ± 0.371
6.284IleAsp: 6.284 ± 0.091
5.106IleGlu: 5.106 ± 1.331
3.142IlePhe: 3.142 ± 0.96
3.928IleGly: 3.928 ± 0.133
1.178IleHis: 1.178 ± 0.021
7.463IleIle: 7.463 ± 0.68
2.749IleLys: 2.749 ± 0.764
2.357IleLeu: 2.357 ± 0.568
1.964IleMet: 1.964 ± 0.371
3.142IleAsn: 3.142 ± 0.259
4.713IlePro: 4.713 ± 1.303
3.142IleGln: 3.142 ± 0.259
1.964IleArg: 1.964 ± 0.371
5.106IleSer: 5.106 ± 2.935
5.106IleThr: 5.106 ± 1.941
4.713IleVal: 4.713 ± 0.084
0.393IleTrp: 0.393 ± 0.413
3.535IleTyr: 3.535 ± 0.672
0.0IleXaa: 0.0 ± 0.0
Lys
4.321LysAla: 4.321 ± 2.158
0.393LysCys: 0.393 ± 0.196
2.749LysAsp: 2.749 ± 0.764
2.749LysGlu: 2.749 ± 1.373
2.357LysPhe: 2.357 ± 0.042
3.142LysGly: 3.142 ± 0.259
1.178LysHis: 1.178 ± 0.588
5.499LysIle: 5.499 ± 0.309
1.571LysLys: 1.571 ± 0.785
2.357LysLeu: 2.357 ± 0.042
1.571LysMet: 1.571 ± 0.175
2.749LysAsn: 2.749 ± 0.455
1.571LysPro: 1.571 ± 0.434
3.535LysGln: 3.535 ± 0.547
1.571LysArg: 1.571 ± 0.785
4.321LysSer: 4.321 ± 2.158
3.928LysThr: 3.928 ± 0.743
3.535LysVal: 3.535 ± 1.765
0.393LysTrp: 0.393 ± 0.196
2.749LysTyr: 2.749 ± 1.373
0.0LysXaa: 0.0 ± 0.0
Leu
6.677LeuAla: 6.677 ± 0.931
0.786LeuCys: 0.786 ± 0.217
5.106LeuAsp: 5.106 ± 0.112
4.321LeuGlu: 4.321 ± 0.939
3.928LeuPhe: 3.928 ± 1.086
3.142LeuGly: 3.142 ± 0.259
2.357LeuHis: 2.357 ± 0.568
7.463LeuIle: 7.463 ± 0.68
5.499LeuLys: 5.499 ± 2.137
6.284LeuLeu: 6.284 ± 0.518
1.571LeuMet: 1.571 ± 0.66
5.499LeuAsn: 5.499 ± 0.918
5.106LeuPro: 5.106 ± 1.716
2.749LeuGln: 2.749 ± 1.373
5.106LeuArg: 5.106 ± 0.497
7.463LeuSer: 7.463 ± 1.758
5.892LeuThr: 5.892 ± 1.324
5.892LeuVal: 5.892 ± 0.105
0.786LeuTrp: 0.786 ± 0.392
3.142LeuTyr: 3.142 ± 0.35
0.0LeuXaa: 0.0 ± 0.0
Met
1.964MetAla: 1.964 ± 0.847
0.393MetCys: 0.393 ± 0.196
0.0MetAsp: 0.0 ± 0.0
1.571MetGlu: 1.571 ± 0.785
0.393MetPhe: 0.393 ± 0.196
0.786MetGly: 0.786 ± 0.392
0.786MetHis: 0.786 ± 0.392
1.571MetIle: 1.571 ± 0.175
1.964MetLys: 1.964 ± 0.371
2.749MetLeu: 2.749 ± 0.154
1.178MetMet: 1.178 ± 0.588
1.178MetAsn: 1.178 ± 0.021
1.178MetPro: 1.178 ± 0.021
0.786MetGln: 0.786 ± 0.217
0.786MetArg: 0.786 ± 0.217
1.571MetSer: 1.571 ± 0.785
0.786MetThr: 0.786 ± 0.217
2.749MetVal: 2.749 ± 0.764
0.0MetTrp: 0.0 ± 0.0
2.357MetTyr: 2.357 ± 0.651
0.0MetXaa: 0.0 ± 0.0
Asn
1.964AsnAla: 1.964 ± 0.238
2.749AsnCys: 2.749 ± 0.764
4.713AsnAsp: 4.713 ± 0.526
2.357AsnGlu: 2.357 ± 0.568
4.713AsnPhe: 4.713 ± 0.084
2.749AsnGly: 2.749 ± 0.154
0.786AsnHis: 0.786 ± 0.392
3.928AsnIle: 3.928 ± 0.133
3.142AsnLys: 3.142 ± 0.35
4.321AsnLeu: 4.321 ± 0.889
1.571AsnMet: 1.571 ± 0.434
3.928AsnAsn: 3.928 ± 0.133
1.964AsnPro: 1.964 ± 0.238
1.571AsnGln: 1.571 ± 0.434
2.749AsnArg: 2.749 ± 0.764
5.892AsnSer: 5.892 ± 1.114
2.749AsnThr: 2.749 ± 0.455
3.928AsnVal: 3.928 ± 0.133
1.178AsnTrp: 1.178 ± 0.588
1.964AsnTyr: 1.964 ± 0.238
0.0AsnXaa: 0.0 ± 0.0
Pro
2.749ProAla: 2.749 ± 0.154
0.786ProCys: 0.786 ± 0.827
2.749ProAsp: 2.749 ± 1.674
2.749ProGlu: 2.749 ± 0.764
3.928ProPhe: 3.928 ± 1.086
3.142ProGly: 3.142 ± 0.96
0.0ProHis: 0.0 ± 0.0
2.357ProIle: 2.357 ± 1.261
1.178ProLys: 1.178 ± 0.021
4.713ProLeu: 4.713 ± 0.693
1.964ProMet: 1.964 ± 0.238
3.142ProAsn: 3.142 ± 0.868
2.749ProPro: 2.749 ± 1.065
2.749ProGln: 2.749 ± 0.154
1.178ProArg: 1.178 ± 0.63
3.535ProSer: 3.535 ± 1.282
3.535ProThr: 3.535 ± 0.063
2.357ProVal: 2.357 ± 0.651
0.0ProTrp: 0.0 ± 0.0
2.357ProTyr: 2.357 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
1.964GlnAla: 1.964 ± 1.457
0.786GlnCys: 0.786 ± 0.392
1.571GlnAsp: 1.571 ± 0.175
1.964GlnGlu: 1.964 ± 0.238
1.571GlnPhe: 1.571 ± 0.434
2.357GlnGly: 2.357 ± 0.651
1.571GlnHis: 1.571 ± 0.785
3.535GlnIle: 3.535 ± 1.156
2.357GlnLys: 2.357 ± 1.177
5.106GlnLeu: 5.106 ± 1.716
0.786GlnMet: 0.786 ± 0.217
2.357GlnAsn: 2.357 ± 0.042
3.928GlnPro: 3.928 ± 0.133
2.749GlnGln: 2.749 ± 1.674
2.357GlnArg: 2.357 ± 1.261
2.357GlnSer: 2.357 ± 0.651
4.321GlnThr: 4.321 ± 0.329
3.142GlnVal: 3.142 ± 0.868
0.786GlnTrp: 0.786 ± 0.392
1.571GlnTyr: 1.571 ± 0.785
0.0GlnXaa: 0.0 ± 0.0
Arg
1.178ArgAla: 1.178 ± 0.588
0.0ArgCys: 0.0 ± 0.0
2.749ArgAsp: 2.749 ± 0.764
0.786ArgGlu: 0.786 ± 0.392
2.357ArgPhe: 2.357 ± 0.651
1.571ArgGly: 1.571 ± 0.785
0.0ArgHis: 0.0 ± 0.0
3.142ArgIle: 3.142 ± 0.35
3.142ArgLys: 3.142 ± 1.569
3.535ArgLeu: 3.535 ± 0.547
3.142ArgMet: 3.142 ± 0.35
1.964ArgAsn: 1.964 ± 0.981
1.571ArgPro: 1.571 ± 0.175
1.178ArgGln: 1.178 ± 0.021
2.357ArgArg: 2.357 ± 0.042
1.964ArgSer: 1.964 ± 0.238
3.535ArgThr: 3.535 ± 1.282
2.749ArgVal: 2.749 ± 1.065
0.393ArgTrp: 0.393 ± 0.196
1.178ArgTyr: 1.178 ± 0.021
0.0ArgXaa: 0.0 ± 0.0
Ser
4.713SerAla: 4.713 ± 0.084
1.571SerCys: 1.571 ± 0.434
2.749SerAsp: 2.749 ± 0.764
3.142SerGlu: 3.142 ± 0.259
1.964SerPhe: 1.964 ± 0.847
5.499SerGly: 5.499 ± 0.91
1.571SerHis: 1.571 ± 0.434
6.677SerIle: 6.677 ± 2.759
3.928SerLys: 3.928 ± 0.743
11.39SerLeu: 11.39 ± 2.234
1.178SerMet: 1.178 ± 0.021
4.713SerAsn: 4.713 ± 0.084
3.928SerPro: 3.928 ± 1.086
5.499SerGln: 5.499 ± 0.91
2.749SerArg: 2.749 ± 0.764
14.925SerSer: 14.925 ± 0.141
6.284SerThr: 6.284 ± 2.346
4.713SerVal: 4.713 ± 1.303
0.0SerTrp: 0.0 ± 0.0
3.535SerTyr: 3.535 ± 0.063
0.0SerXaa: 0.0 ± 0.0
Thr
1.964ThrAla: 1.964 ± 0.371
0.786ThrCys: 0.786 ± 0.392
1.571ThrAsp: 1.571 ± 0.434
1.571ThrGlu: 1.571 ± 0.785
3.142ThrPhe: 3.142 ± 0.35
2.749ThrGly: 2.749 ± 0.154
0.786ThrHis: 0.786 ± 0.217
4.713ThrIle: 4.713 ± 0.084
2.357ThrLys: 2.357 ± 0.042
8.248ThrLeu: 8.248 ± 3.194
2.357ThrMet: 2.357 ± 0.568
6.284ThrAsn: 6.284 ± 0.518
5.499ThrPro: 5.499 ± 1.52
6.677ThrGln: 6.677 ± 0.897
2.357ThrArg: 2.357 ± 0.568
9.427ThrSer: 9.427 ± 0.777
5.106ThrThr: 5.106 ± 2.935
4.321ThrVal: 4.321 ± 0.329
0.393ThrTrp: 0.393 ± 0.413
2.357ThrTyr: 2.357 ± 0.651
0.0ThrXaa: 0.0 ± 0.0
Val
5.892ValAla: 5.892 ± 1.324
0.393ValCys: 0.393 ± 0.413
3.535ValAsp: 3.535 ± 0.672
5.892ValGlu: 5.892 ± 0.105
1.964ValPhe: 1.964 ± 0.371
1.571ValGly: 1.571 ± 0.175
1.178ValHis: 1.178 ± 0.021
3.928ValIle: 3.928 ± 1.352
1.571ValLys: 1.571 ± 0.175
5.499ValLeu: 5.499 ± 0.309
1.964ValMet: 1.964 ± 0.371
3.142ValAsn: 3.142 ± 0.259
4.321ValPro: 4.321 ± 0.889
2.357ValGln: 2.357 ± 1.261
2.357ValArg: 2.357 ± 1.177
5.892ValSer: 5.892 ± 0.714
5.106ValThr: 5.106 ± 0.497
3.142ValVal: 3.142 ± 0.868
0.786ValTrp: 0.786 ± 0.392
2.357ValTyr: 2.357 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.393TrpAsp: 0.393 ± 0.196
0.393TrpGlu: 0.393 ± 0.196
1.178TrpPhe: 1.178 ± 0.588
0.0TrpGly: 0.0 ± 0.0
0.393TrpHis: 0.393 ± 0.196
0.786TrpIle: 0.786 ± 0.392
0.786TrpLys: 0.786 ± 0.392
1.178TrpLeu: 1.178 ± 0.588
0.0TrpMet: 0.0 ± 0.0
0.393TrpAsn: 0.393 ± 0.196
0.0TrpPro: 0.0 ± 0.0
0.393TrpGln: 0.393 ± 0.196
0.0TrpArg: 0.0 ± 0.0
1.178TrpSer: 1.178 ± 0.63
0.393TrpThr: 0.393 ± 0.196
0.393TrpVal: 0.393 ± 0.413
0.0TrpTrp: 0.0 ± 0.0
0.786TrpTyr: 0.786 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.928TyrAla: 3.928 ± 2.304
0.786TyrCys: 0.786 ± 0.392
3.535TyrAsp: 3.535 ± 1.156
2.749TyrGlu: 2.749 ± 0.764
1.964TyrPhe: 1.964 ± 0.371
1.571TyrGly: 1.571 ± 0.175
1.571TyrHis: 1.571 ± 0.785
2.357TyrIle: 2.357 ± 0.042
1.178TyrLys: 1.178 ± 0.63
3.535TyrLeu: 3.535 ± 0.063
0.786TyrMet: 0.786 ± 0.392
2.749TyrAsn: 2.749 ± 0.764
2.357TyrPro: 2.357 ± 0.651
1.178TyrGln: 1.178 ± 0.021
0.393TyrArg: 0.393 ± 0.196
4.321TyrSer: 4.321 ± 0.889
2.357TyrThr: 2.357 ± 0.568
2.749TyrVal: 2.749 ± 0.455
0.393TyrTrp: 0.393 ± 0.413
1.178TyrTyr: 1.178 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2547 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski