Amino acid dipepetide frequency for Hubei picorna-like virus 72

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.492AlaAla: 4.492 ± 1.699
1.123AlaCys: 1.123 ± 0.643
3.93AlaAsp: 3.93 ± 1.853
2.527AlaGlu: 2.527 ± 0.738
3.088AlaPhe: 3.088 ± 0.584
5.053AlaGly: 5.053 ± 3.553
0.842AlaHis: 0.842 ± 0.476
5.615AlaIle: 5.615 ± 2.014
2.527AlaLys: 2.527 ± 0.773
5.334AlaLeu: 5.334 ± 1.996
3.088AlaMet: 3.088 ± 0.625
3.65AlaAsn: 3.65 ± 1.746
3.65AlaPro: 3.65 ± 2.713
3.93AlaGln: 3.93 ± 1.794
2.807AlaArg: 2.807 ± 0.905
3.93AlaSer: 3.93 ± 1.178
4.211AlaThr: 4.211 ± 1.522
4.211AlaVal: 4.211 ± 2.528
0.842AlaTrp: 0.842 ± 0.592
1.965AlaTyr: 1.965 ± 0.355
0.0AlaXaa: 0.0 ± 0.0
Cys
1.684CysAla: 1.684 ± 0.272
0.561CysCys: 0.561 ± 0.321
2.246CysAsp: 2.246 ± 1.493
1.123CysGlu: 1.123 ± 0.327
0.842CysPhe: 0.842 ± 0.497
2.246CysGly: 2.246 ± 0.96
1.123CysHis: 1.123 ± 1.033
1.404CysIle: 1.404 ± 0.602
1.123CysLys: 1.123 ± 0.528
0.561CysLeu: 0.561 ± 1.164
0.281CysMet: 0.281 ± 0.161
1.404CysAsn: 1.404 ± 0.271
1.123CysPro: 1.123 ± 0.643
0.281CysGln: 0.281 ± 0.582
0.842CysArg: 0.842 ± 0.246
1.684CysSer: 1.684 ± 0.272
0.561CysThr: 0.561 ± 0.321
0.842CysVal: 0.842 ± 0.482
0.281CysTrp: 0.281 ± 0.161
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.088AspAla: 3.088 ± 1.046
1.123AspCys: 1.123 ± 0.643
3.088AspAsp: 3.088 ± 1.253
4.773AspGlu: 4.773 ± 1.098
3.088AspPhe: 3.088 ± 0.221
3.93AspGly: 3.93 ± 1.015
0.842AspHis: 0.842 ± 0.246
3.93AspIle: 3.93 ± 0.574
2.527AspLys: 2.527 ± 0.347
4.773AspLeu: 4.773 ± 1.133
1.404AspMet: 1.404 ± 0.843
2.527AspAsn: 2.527 ± 0.148
5.053AspPro: 5.053 ± 0.567
1.684AspGln: 1.684 ± 0.492
0.842AspArg: 0.842 ± 0.497
3.088AspSer: 3.088 ± 1.404
2.527AspThr: 2.527 ± 0.148
4.492AspVal: 4.492 ± 1.798
0.281AspTrp: 0.281 ± 0.161
3.369AspTyr: 3.369 ± 1.258
0.0AspXaa: 0.0 ± 0.0
Glu
3.65GluAla: 3.65 ± 1.485
0.561GluCys: 0.561 ± 0.517
2.246GluAsp: 2.246 ± 0.758
4.211GluGlu: 4.211 ± 1.389
4.773GluPhe: 4.773 ± 0.748
2.527GluGly: 2.527 ± 0.148
0.561GluHis: 0.561 ± 0.256
3.65GluIle: 3.65 ± 1.759
1.684GluLys: 1.684 ± 0.964
5.053GluLeu: 5.053 ± 1.242
2.807GluMet: 2.807 ± 1.212
3.088GluAsn: 3.088 ± 1.045
1.965GluPro: 1.965 ± 0.745
1.965GluGln: 1.965 ± 0.745
2.246GluArg: 2.246 ± 0.48
3.65GluSer: 3.65 ± 0.455
3.65GluThr: 3.65 ± 0.956
3.369GluVal: 3.369 ± 1.191
1.123GluTrp: 1.123 ± 0.513
2.807GluTyr: 2.807 ± 0.308
0.0GluXaa: 0.0 ± 0.0
Phe
3.93PheAla: 3.93 ± 1.176
1.684PheCys: 1.684 ± 0.994
3.93PheAsp: 3.93 ± 0.873
3.088PheGlu: 3.088 ± 1.045
1.404PhePhe: 1.404 ± 1.001
3.93PheGly: 3.93 ± 1.198
0.842PheHis: 0.842 ± 0.246
3.369PheIle: 3.369 ± 1.546
2.246PheLys: 2.246 ± 0.758
2.246PheLeu: 2.246 ± 1.056
0.842PheMet: 0.842 ± 0.246
2.527PheAsn: 2.527 ± 1.935
1.684PhePro: 1.684 ± 0.953
1.684PheGln: 1.684 ± 0.492
2.246PheArg: 2.246 ± 0.713
2.807PheSer: 2.807 ± 0.64
1.684PheThr: 1.684 ± 0.492
1.965PheVal: 1.965 ± 0.556
0.281PheTrp: 0.281 ± 0.161
1.404PheTyr: 1.404 ± 0.939
0.0PheXaa: 0.0 ± 0.0
Gly
3.65GlyAla: 3.65 ± 1.746
0.281GlyCys: 0.281 ± 0.582
3.088GlyAsp: 3.088 ± 0.221
2.246GlyGlu: 2.246 ± 0.963
1.404GlyPhe: 1.404 ± 0.453
3.65GlyGly: 3.65 ± 2.713
1.684GlyHis: 1.684 ± 0.994
4.492GlyIle: 4.492 ± 0.628
3.369GlyLys: 3.369 ± 0.673
5.334GlyLeu: 5.334 ± 0.931
3.369GlyMet: 3.369 ± 1.106
3.93GlyAsn: 3.93 ± 1.447
1.684GlyPro: 1.684 ± 0.784
2.807GlyGln: 2.807 ± 1.685
2.246GlyArg: 2.246 ± 0.963
3.369GlySer: 3.369 ± 0.629
3.65GlyThr: 3.65 ± 1.097
4.211GlyVal: 4.211 ± 0.194
0.561GlyTrp: 0.561 ± 0.321
2.246GlyTyr: 2.246 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
1.684HisAla: 1.684 ± 0.336
0.281HisCys: 0.281 ± 0.582
0.561HisAsp: 0.561 ± 0.256
1.404HisGlu: 1.404 ± 0.803
0.842HisPhe: 0.842 ± 0.476
1.123HisGly: 1.123 ± 0.353
0.281HisHis: 0.281 ± 0.161
1.404HisIle: 1.404 ± 1.001
1.404HisLys: 1.404 ± 0.476
2.246HisLeu: 2.246 ± 0.758
0.842HisMet: 0.842 ± 1.088
0.281HisAsn: 0.281 ± 0.35
1.404HisPro: 1.404 ± 0.497
0.842HisGln: 0.842 ± 0.246
0.842HisArg: 0.842 ± 0.482
1.123HisSer: 1.123 ± 0.643
1.404HisThr: 1.404 ± 1.001
1.965HisVal: 1.965 ± 0.787
1.684HisTrp: 1.684 ± 0.994
1.404HisTyr: 1.404 ± 0.271
0.0HisXaa: 0.0 ± 0.0
Ile
4.492IleAla: 4.492 ± 0.88
2.807IleCys: 2.807 ± 1.203
3.088IleAsp: 3.088 ± 0.221
4.773IleGlu: 4.773 ± 0.617
1.404IlePhe: 1.404 ± 1.001
4.211IleGly: 4.211 ± 0.705
1.123IleHis: 1.123 ± 1.032
3.369IleIle: 3.369 ± 0.544
4.492IleLys: 4.492 ± 2.111
3.65IleLeu: 3.65 ± 1.283
2.246IleMet: 2.246 ± 0.616
3.088IleAsn: 3.088 ± 0.584
4.773IlePro: 4.773 ± 1.329
3.088IleGln: 3.088 ± 0.923
1.404IleArg: 1.404 ± 0.803
4.492IleSer: 4.492 ± 0.628
6.457IleThr: 6.457 ± 0.687
3.369IleVal: 3.369 ± 0.301
0.561IleTrp: 0.561 ± 0.321
1.965IleTyr: 1.965 ± 0.556
0.0IleXaa: 0.0 ± 0.0
Lys
3.93LysAla: 3.93 ± 1.198
1.123LysCys: 1.123 ± 0.643
3.65LysAsp: 3.65 ± 1.06
4.211LysGlu: 4.211 ± 1.644
3.93LysPhe: 3.93 ± 1.719
3.65LysGly: 3.65 ± 1.645
1.684LysHis: 1.684 ± 0.964
2.527LysIle: 2.527 ± 0.762
3.93LysLys: 3.93 ± 2.249
5.896LysLeu: 5.896 ± 1.225
1.123LysMet: 1.123 ± 0.643
3.93LysAsn: 3.93 ± 1.178
2.527LysPro: 2.527 ± 0.148
1.404LysGln: 1.404 ± 0.803
2.246LysArg: 2.246 ± 0.758
2.527LysSer: 2.527 ± 1.055
4.211LysThr: 4.211 ± 1.644
3.93LysVal: 3.93 ± 0.71
1.404LysTrp: 1.404 ± 0.271
3.369LysTyr: 3.369 ± 0.378
0.0LysXaa: 0.0 ± 0.0
Leu
4.773LeuAla: 4.773 ± 1.036
1.404LeuCys: 1.404 ± 0.453
2.807LeuAsp: 2.807 ± 0.64
5.896LeuGlu: 5.896 ± 1.066
3.088LeuPhe: 3.088 ± 1.253
1.684LeuGly: 1.684 ± 0.492
2.527LeuHis: 2.527 ± 1.055
4.492LeuIle: 4.492 ± 0.731
6.176LeuLys: 6.176 ± 2.74
5.053LeuLeu: 5.053 ± 0.567
2.246LeuMet: 2.246 ± 0.48
6.738LeuAsn: 6.738 ± 2.273
3.93LeuPro: 3.93 ± 1.737
5.896LeuGln: 5.896 ± 0.959
3.93LeuArg: 3.93 ± 1.178
5.053LeuSer: 5.053 ± 1.213
4.773LeuThr: 4.773 ± 1.431
4.773LeuVal: 4.773 ± 1.329
0.842LeuTrp: 0.842 ± 0.497
3.65LeuTyr: 3.65 ± 0.949
0.0LeuXaa: 0.0 ± 0.0
Met
3.369MetAla: 3.369 ± 0.498
0.281MetCys: 0.281 ± 0.161
2.246MetAsp: 2.246 ± 0.026
1.684MetGlu: 1.684 ± 0.595
1.404MetPhe: 1.404 ± 0.843
0.842MetGly: 0.842 ± 0.246
0.561MetHis: 0.561 ± 0.517
2.246MetIle: 2.246 ± 0.758
2.246MetLys: 2.246 ± 0.713
3.65MetLeu: 3.65 ± 1.632
1.123MetMet: 1.123 ± 0.938
1.123MetAsn: 1.123 ± 0.938
0.842MetPro: 0.842 ± 0.592
0.842MetGln: 0.842 ± 0.592
1.684MetArg: 1.684 ± 0.336
3.088MetSer: 3.088 ± 0.833
1.123MetThr: 1.123 ± 1.032
1.965MetVal: 1.965 ± 0.724
0.561MetTrp: 0.561 ± 0.256
0.281MetTyr: 0.281 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
5.334AsnAla: 5.334 ± 0.197
0.842AsnCys: 0.842 ± 1.088
3.93AsnAsp: 3.93 ± 0.873
3.369AsnGlu: 3.369 ± 0.983
2.527AsnPhe: 2.527 ± 0.738
4.492AsnGly: 4.492 ± 0.966
1.965AsnHis: 1.965 ± 1.427
2.807AsnIle: 2.807 ± 1.172
2.527AsnLys: 2.527 ± 1.1
5.334AsnLeu: 5.334 ± 0.54
1.123AsnMet: 1.123 ± 0.938
2.527AsnAsn: 2.527 ± 0.347
5.053AsnPro: 5.053 ± 1.216
2.527AsnGln: 2.527 ± 1.305
2.246AsnArg: 2.246 ± 1.056
4.492AsnSer: 4.492 ± 0.88
2.807AsnThr: 2.807 ± 0.952
2.807AsnVal: 2.807 ± 0.994
0.281AsnTrp: 0.281 ± 0.161
1.965AsnTyr: 1.965 ± 0.631
0.0AsnXaa: 0.0 ± 0.0
Pro
2.527ProAla: 2.527 ± 0.738
0.561ProCys: 0.561 ± 0.517
4.211ProAsp: 4.211 ± 0.194
1.965ProGlu: 1.965 ± 1.124
1.965ProPhe: 1.965 ± 0.724
3.369ProGly: 3.369 ± 1.396
1.404ProHis: 1.404 ± 0.271
4.492ProIle: 4.492 ± 1.41
2.246ProLys: 2.246 ± 1.493
4.773ProLeu: 4.773 ± 1.036
1.123ProMet: 1.123 ± 0.327
2.246ProAsn: 2.246 ± 0.939
2.527ProPro: 2.527 ± 1.305
1.123ProGln: 1.123 ± 0.938
1.965ProArg: 1.965 ± 2.177
3.65ProSer: 3.65 ± 1.114
5.053ProThr: 5.053 ± 0.694
4.492ProVal: 4.492 ± 1.011
1.123ProTrp: 1.123 ± 0.513
1.404ProTyr: 1.404 ± 0.476
0.0ProXaa: 0.0 ± 0.0
Gln
4.211GlnAla: 4.211 ± 2.528
0.842GlnCys: 0.842 ± 0.482
2.807GlnAsp: 2.807 ± 0.77
2.807GlnGlu: 2.807 ± 0.795
2.527GlnPhe: 2.527 ± 0.738
1.684GlnGly: 1.684 ± 0.492
1.404GlnHis: 1.404 ± 0.271
1.684GlnIle: 1.684 ± 0.769
2.246GlnLys: 2.246 ± 0.899
2.807GlnLeu: 2.807 ± 0.308
0.0GlnMet: 0.0 ± 0.0
2.527GlnAsn: 2.527 ± 0.787
2.807GlnPro: 2.807 ± 0.795
2.246GlnGln: 2.246 ± 0.654
1.965GlnArg: 1.965 ± 0.724
3.93GlnSer: 3.93 ± 2.192
2.527GlnThr: 2.527 ± 1.776
2.527GlnVal: 2.527 ± 0.787
0.842GlnTrp: 0.842 ± 0.482
0.842GlnTyr: 0.842 ± 0.482
0.0GlnXaa: 0.0 ± 0.0
Arg
1.965ArgAla: 1.965 ± 1.096
1.123ArgCys: 1.123 ± 0.643
1.684ArgAsp: 1.684 ± 0.492
1.684ArgGlu: 1.684 ± 1.55
1.404ArgPhe: 1.404 ± 0.476
1.123ArgGly: 1.123 ± 0.513
0.281ArgHis: 0.281 ± 0.161
3.369ArgIle: 3.369 ± 0.544
1.965ArgLys: 1.965 ± 1.124
3.93ArgLeu: 3.93 ± 0.754
1.123ArgMet: 1.123 ± 2.327
2.807ArgAsn: 2.807 ± 1.203
1.965ArgPro: 1.965 ± 1.012
1.123ArgGln: 1.123 ± 0.513
3.088ArgArg: 3.088 ± 0.878
1.684ArgSer: 1.684 ± 0.705
2.527ArgThr: 2.527 ± 0.608
4.211ArgVal: 4.211 ± 0.812
0.842ArgTrp: 0.842 ± 0.482
1.965ArgTyr: 1.965 ± 0.631
0.0ArgXaa: 0.0 ± 0.0
Ser
2.246SerAla: 2.246 ± 0.026
1.404SerCys: 1.404 ± 1.602
2.807SerAsp: 2.807 ± 0.542
3.93SerGlu: 3.93 ± 0.95
1.965SerPhe: 1.965 ± 0.556
5.053SerGly: 5.053 ± 2.821
1.684SerHis: 1.684 ± 0.272
4.211SerIle: 4.211 ± 0.705
8.141SerLys: 8.141 ± 2.635
5.896SerLeu: 5.896 ± 1.444
1.965SerMet: 1.965 ± 1.344
2.527SerAsn: 2.527 ± 1.055
3.369SerPro: 3.369 ± 0.673
3.93SerGln: 3.93 ± 1.198
3.369SerArg: 3.369 ± 1.058
2.527SerSer: 2.527 ± 1.305
4.773SerThr: 4.773 ± 1.329
3.369SerVal: 3.369 ± 1.783
1.123SerTrp: 1.123 ± 0.643
2.246SerTyr: 2.246 ± 0.654
0.0SerXaa: 0.0 ± 0.0
Thr
5.615ThrAla: 5.615 ± 1.523
1.404ThrCys: 1.404 ± 0.476
2.246ThrAsp: 2.246 ± 0.713
3.088ThrGlu: 3.088 ± 1.265
3.369ThrPhe: 3.369 ± 1.693
3.93ThrGly: 3.93 ± 0.574
0.842ThrHis: 0.842 ± 1.147
4.492ThrIle: 4.492 ± 0.628
4.211ThrLys: 4.211 ± 0.362
5.334ThrLeu: 5.334 ± 1.023
1.965ThrMet: 1.965 ± 1.108
3.65ThrAsn: 3.65 ± 0.789
3.369ThrPro: 3.369 ± 0.983
2.527ThrGln: 2.527 ± 0.976
1.404ThrArg: 1.404 ± 0.93
7.299ThrSer: 7.299 ± 0.692
3.088ThrThr: 3.088 ± 1.23
3.369ThrVal: 3.369 ± 0.544
0.281ThrTrp: 0.281 ± 0.161
2.246ThrTyr: 2.246 ± 0.48
0.0ThrXaa: 0.0 ± 0.0
Val
3.369ValAla: 3.369 ± 0.498
2.246ValCys: 2.246 ± 0.758
3.93ValAsp: 3.93 ± 1.198
1.965ValGlu: 1.965 ± 1.513
1.123ValPhe: 1.123 ± 0.513
3.65ValGly: 3.65 ± 1.281
1.684ValHis: 1.684 ± 0.336
5.053ValIle: 5.053 ± 1.592
3.93ValLys: 3.93 ± 1.392
4.773ValLeu: 4.773 ± 1.133
2.246ValMet: 2.246 ± 1.025
4.492ValAsn: 4.492 ± 1.59
2.807ValPro: 2.807 ± 0.308
1.965ValGln: 1.965 ± 0.745
2.527ValArg: 2.527 ± 0.608
4.492ValSer: 4.492 ± 1.33
4.492ValThr: 4.492 ± 0.731
4.492ValVal: 4.492 ± 0.052
1.965ValTrp: 1.965 ± 0.724
1.965ValTyr: 1.965 ± 0.556
0.0ValXaa: 0.0 ± 0.0
Trp
0.561TrpAla: 0.561 ± 0.7
0.281TrpCys: 0.281 ± 0.161
1.404TrpAsp: 1.404 ± 0.803
0.0TrpGlu: 0.0 ± 0.0
1.404TrpPhe: 1.404 ± 0.939
0.0TrpGly: 0.0 ± 0.0
0.281TrpHis: 0.281 ± 0.161
0.561TrpIle: 0.561 ± 0.321
1.404TrpLys: 1.404 ± 0.453
1.404TrpLeu: 1.404 ± 0.803
0.842TrpMet: 0.842 ± 0.476
1.965TrpAsn: 1.965 ± 0.556
0.561TrpPro: 0.561 ± 0.321
0.561TrpGln: 0.561 ± 0.321
0.561TrpArg: 0.561 ± 0.321
0.561TrpSer: 0.561 ± 0.517
1.123TrpThr: 1.123 ± 0.657
0.561TrpVal: 0.561 ± 0.321
0.0TrpTrp: 0.0 ± 0.0
1.123TrpTyr: 1.123 ± 0.643
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.246TyrAla: 2.246 ± 0.654
0.281TyrCys: 0.281 ± 0.35
2.527TyrAsp: 2.527 ± 0.976
0.842TyrGlu: 0.842 ± 0.246
2.246TyrPhe: 2.246 ± 1.056
1.123TyrGly: 1.123 ± 0.643
1.404TyrHis: 1.404 ± 0.271
1.684TyrIle: 1.684 ± 0.272
2.807TyrLys: 2.807 ± 1.203
1.684TyrLeu: 1.684 ± 0.595
1.123TyrMet: 1.123 ± 0.513
3.93TyrAsn: 3.93 ± 1.176
1.123TyrPro: 1.123 ± 1.032
2.807TyrGln: 2.807 ± 0.308
1.404TyrArg: 1.404 ± 0.453
3.369TyrSer: 3.369 ± 1.693
3.088TyrThr: 3.088 ± 1.23
2.246TyrVal: 2.246 ± 0.963
0.281TyrTrp: 0.281 ± 0.161
0.842TyrTyr: 0.842 ± 0.482
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3563 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski