Amino acid dipepetide frequency for Hubei picorna-like virus 69

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.816AlaAla: 3.816 ± 0.944
0.881AlaCys: 0.881 ± 0.994
2.348AlaAsp: 2.348 ± 1.175
2.642AlaGlu: 2.642 ± 0.26
2.055AlaPhe: 2.055 ± 0.858
2.348AlaGly: 2.348 ± 0.574
2.055AlaHis: 2.055 ± 0.744
3.229AlaIle: 3.229 ± 1.474
3.522AlaLys: 3.522 ± 1.201
4.696AlaLeu: 4.696 ± 1.775
1.468AlaMet: 1.468 ± 0.374
2.055AlaAsn: 2.055 ± 0.528
2.642AlaPro: 2.642 ± 0.78
3.522AlaGln: 3.522 ± 0.517
0.881AlaArg: 0.881 ± 0.356
4.403AlaSer: 4.403 ± 0.693
2.642AlaThr: 2.642 ± 0.896
4.109AlaVal: 4.109 ± 0.808
0.881AlaTrp: 0.881 ± 0.441
3.229AlaTyr: 3.229 ± 1.11
0.0AlaXaa: 0.0 ± 0.0
Cys
1.761CysAla: 1.761 ± 0.491
0.294CysCys: 0.294 ± 0.147
0.294CysAsp: 0.294 ± 0.556
1.468CysGlu: 1.468 ± 0.79
1.468CysPhe: 1.468 ± 0.342
1.468CysGly: 1.468 ± 0.735
0.881CysHis: 0.881 ± 0.441
1.761CysIle: 1.761 ± 0.42
1.174CysLys: 1.174 ± 1.147
1.468CysLeu: 1.468 ± 0.735
0.0CysMet: 0.0 ± 0.0
0.587CysAsn: 0.587 ± 0.294
0.881CysPro: 0.881 ± 0.356
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.294CysSer: 0.294 ± 0.147
0.881CysThr: 0.881 ± 0.441
1.174CysVal: 1.174 ± 0.588
0.294CysTrp: 0.294 ± 0.147
0.294CysTyr: 0.294 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
3.522AspAla: 3.522 ± 0.307
1.761AspCys: 1.761 ± 0.882
4.99AspAsp: 4.99 ± 1.266
4.696AspGlu: 4.696 ± 1.243
4.403AspPhe: 4.403 ± 0.688
2.348AspGly: 2.348 ± 1.473
1.174AspHis: 1.174 ± 0.588
3.816AspIle: 3.816 ± 0.781
4.696AspLys: 4.696 ± 0.843
4.99AspLeu: 4.99 ± 0.723
1.761AspMet: 1.761 ± 0.42
1.468AspAsn: 1.468 ± 0.553
3.816AspPro: 3.816 ± 0.706
1.761AspGln: 1.761 ± 0.42
0.881AspArg: 0.881 ± 0.356
2.642AspSer: 2.642 ± 0.822
2.348AspThr: 2.348 ± 0.653
4.403AspVal: 4.403 ± 0.688
1.174AspTrp: 1.174 ± 0.287
2.055AspTyr: 2.055 ± 0.621
0.0AspXaa: 0.0 ± 0.0
Glu
4.109GluAla: 4.109 ± 0.562
1.174GluCys: 1.174 ± 0.588
3.229GluAsp: 3.229 ± 0.752
4.99GluGlu: 4.99 ± 1.399
4.403GluPhe: 4.403 ± 0.132
3.229GluGly: 3.229 ± 0.995
2.055GluHis: 2.055 ± 0.529
5.283GluIle: 5.283 ± 1.032
4.696GluLys: 4.696 ± 1.305
4.99GluLeu: 4.99 ± 0.178
1.468GluMet: 1.468 ± 0.374
2.348GluAsn: 2.348 ± 0.633
4.403GluPro: 4.403 ± 1.84
2.935GluGln: 2.935 ± 0.677
2.935GluArg: 2.935 ± 0.684
2.055GluSer: 2.055 ± 1.028
2.348GluThr: 2.348 ± 0.121
3.816GluVal: 3.816 ± 1.037
1.174GluTrp: 1.174 ± 0.484
3.229GluTyr: 3.229 ± 0.752
0.0GluXaa: 0.0 ± 0.0
Phe
2.348PheAla: 2.348 ± 0.807
0.881PheCys: 0.881 ± 0.356
1.468PheAsp: 1.468 ± 0.339
1.174PheGlu: 1.174 ± 0.588
3.229PhePhe: 3.229 ± 1.883
2.642PheGly: 2.642 ± 1.322
0.587PheHis: 0.587 ± 0.819
1.761PheIle: 1.761 ± 0.712
2.642PheLys: 2.642 ± 1.322
4.403PheLeu: 4.403 ± 0.423
1.761PheMet: 1.761 ± 0.491
3.229PheAsn: 3.229 ± 0.861
1.468PhePro: 1.468 ± 1.009
1.468PheGln: 1.468 ± 0.339
2.348PheArg: 2.348 ± 0.41
5.283PheSer: 5.283 ± 1.263
3.522PheThr: 3.522 ± 1.452
3.229PheVal: 3.229 ± 1.11
0.294PheTrp: 0.294 ± 0.147
1.468PheTyr: 1.468 ± 0.79
0.0PheXaa: 0.0 ± 0.0
Gly
1.174GlyAla: 1.174 ± 0.287
0.881GlyCys: 0.881 ± 0.441
3.522GlyAsp: 3.522 ± 0.982
2.055GlyGlu: 2.055 ± 0.621
1.468GlyPhe: 1.468 ± 0.339
2.348GlyGly: 2.348 ± 0.757
1.761GlyHis: 1.761 ± 0.682
3.816GlyIle: 3.816 ± 0.469
4.696GlyLys: 4.696 ± 1.904
2.935GlyLeu: 2.935 ± 1.062
0.881GlyMet: 0.881 ± 0.26
2.055GlyAsn: 2.055 ± 0.528
2.642GlyPro: 2.642 ± 0.817
1.468GlyGln: 1.468 ± 1.527
1.174GlyArg: 1.174 ± 0.287
3.816GlySer: 3.816 ± 0.934
2.935GlyThr: 2.935 ± 0.781
3.522GlyVal: 3.522 ± 0.697
0.881GlyTrp: 0.881 ± 0.711
1.468GlyTyr: 1.468 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
1.468HisAla: 1.468 ± 0.339
0.294HisCys: 0.294 ± 0.147
0.881HisAsp: 0.881 ± 0.441
0.587HisGlu: 0.587 ± 0.294
1.761HisPhe: 1.761 ± 0.52
0.881HisGly: 0.881 ± 0.441
0.881HisHis: 0.881 ± 0.26
2.348HisIle: 2.348 ± 0.622
1.468HisLys: 1.468 ± 0.374
2.935HisLeu: 2.935 ± 1.008
0.587HisMet: 0.587 ± 0.294
1.761HisAsn: 1.761 ± 0.712
0.881HisPro: 0.881 ± 0.441
0.881HisGln: 0.881 ± 0.26
0.587HisArg: 0.587 ± 0.294
2.935HisSer: 2.935 ± 1.469
0.881HisThr: 0.881 ± 0.711
1.468HisVal: 1.468 ± 1.436
0.881HisTrp: 0.881 ± 0.441
0.881HisTyr: 0.881 ± 0.26
0.0HisXaa: 0.0 ± 0.0
Ile
4.99IleAla: 4.99 ± 1.399
1.468IleCys: 1.468 ± 0.735
4.696IleAsp: 4.696 ± 0.581
4.403IleGlu: 4.403 ± 2.043
2.348IlePhe: 2.348 ± 0.574
4.403IleGly: 4.403 ± 1.714
1.468IleHis: 1.468 ± 0.342
2.642IleIle: 2.642 ± 0.784
5.87IleLys: 5.87 ± 0.367
6.164IleLeu: 6.164 ± 2.018
1.761IleMet: 1.761 ± 1.329
2.348IleAsn: 2.348 ± 0.121
5.577IlePro: 5.577 ± 0.488
1.174IleGln: 1.174 ± 1.549
2.642IleArg: 2.642 ± 0.78
6.751IleSer: 6.751 ± 0.244
5.87IleThr: 5.87 ± 1.049
4.99IleVal: 4.99 ± 0.905
0.881IleTrp: 0.881 ± 0.711
2.055IleTyr: 2.055 ± 0.744
0.0IleXaa: 0.0 ± 0.0
Lys
2.348LysAla: 2.348 ± 0.653
0.881LysCys: 0.881 ± 0.441
2.935LysAsp: 2.935 ± 1.469
4.109LysGlu: 4.109 ± 1.487
2.055LysPhe: 2.055 ± 0.529
2.055LysGly: 2.055 ± 0.621
1.174LysHis: 1.174 ± 0.287
8.218LysIle: 8.218 ± 1.446
5.283LysLys: 5.283 ± 1.041
5.283LysLeu: 5.283 ± 0.152
1.468LysMet: 1.468 ± 0.735
2.935LysAsn: 2.935 ± 1.038
3.816LysPro: 3.816 ± 1.469
2.935LysGln: 2.935 ± 1.343
2.935LysArg: 2.935 ± 1.038
3.229LysSer: 3.229 ± 0.551
6.457LysThr: 6.457 ± 1.319
4.696LysVal: 4.696 ± 1.234
1.174LysTrp: 1.174 ± 0.62
2.055LysTyr: 2.055 ± 0.529
0.0LysXaa: 0.0 ± 0.0
Leu
5.87LeuAla: 5.87 ± 0.473
0.881LeuCys: 0.881 ± 0.441
4.403LeuAsp: 4.403 ± 0.423
5.283LeuGlu: 5.283 ± 0.781
3.522LeuPhe: 3.522 ± 0.384
2.642LeuGly: 2.642 ± 1.322
1.468LeuHis: 1.468 ± 0.735
4.696LeuIle: 4.696 ± 0.774
7.631LeuLys: 7.631 ± 1.375
8.805LeuLeu: 8.805 ± 2.032
2.055LeuMet: 2.055 ± 0.529
4.403LeuAsn: 4.403 ± 0.688
4.696LeuPro: 4.696 ± 1.538
4.403LeuGln: 4.403 ± 1.212
4.109LeuArg: 4.109 ± 1.615
5.577LeuSer: 5.577 ± 0.988
6.457LeuThr: 6.457 ± 1.281
3.816LeuVal: 3.816 ± 0.843
1.468LeuTrp: 1.468 ± 0.374
3.522LeuTyr: 3.522 ± 0.697
0.0LeuXaa: 0.0 ± 0.0
Met
0.587MetAla: 0.587 ± 0.294
0.0MetCys: 0.0 ± 0.0
1.468MetAsp: 1.468 ± 0.735
1.468MetGlu: 1.468 ± 0.339
1.174MetPhe: 1.174 ± 1.119
1.174MetGly: 1.174 ± 0.287
0.294MetHis: 0.294 ± 0.147
2.935MetIle: 2.935 ± 0.677
0.587MetLys: 0.587 ± 0.294
2.348MetLeu: 2.348 ± 1.175
0.881MetMet: 0.881 ± 0.441
0.587MetAsn: 0.587 ± 0.443
1.468MetPro: 1.468 ± 1.436
0.881MetGln: 0.881 ± 0.356
0.587MetArg: 0.587 ± 0.31
4.403MetSer: 4.403 ± 1.137
1.174MetThr: 1.174 ± 0.886
1.174MetVal: 1.174 ± 0.588
0.587MetTrp: 0.587 ± 0.443
1.174MetTyr: 1.174 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
3.816AsnAla: 3.816 ± 0.428
1.174AsnCys: 1.174 ± 0.316
2.055AsnAsp: 2.055 ± 1.114
2.935AsnGlu: 2.935 ± 0.405
2.348AsnPhe: 2.348 ± 0.574
1.761AsnGly: 1.761 ± 0.195
1.174AsnHis: 1.174 ± 0.316
3.816AsnIle: 3.816 ± 0.934
1.174AsnLys: 1.174 ± 0.588
4.99AsnLeu: 4.99 ± 0.494
0.587AsnMet: 0.587 ± 0.443
1.174AsnAsn: 1.174 ± 0.588
3.229AsnPro: 3.229 ± 0.219
0.587AsnGln: 0.587 ± 0.819
2.348AsnArg: 2.348 ± 1.175
4.109AsnSer: 4.109 ± 0.562
3.229AsnThr: 3.229 ± 1.11
2.642AsnVal: 2.642 ± 0.26
0.587AsnTrp: 0.587 ± 0.31
2.348AsnTyr: 2.348 ± 0.121
0.0AsnXaa: 0.0 ± 0.0
Pro
3.229ProAla: 3.229 ± 1.204
1.468ProCys: 1.468 ± 1.436
3.522ProAsp: 3.522 ± 0.697
3.229ProGlu: 3.229 ± 1.496
1.468ProPhe: 1.468 ± 0.339
3.816ProGly: 3.816 ± 0.934
0.294ProHis: 0.294 ± 0.556
3.522ProIle: 3.522 ± 1.003
3.816ProLys: 3.816 ± 0.469
5.283ProLeu: 5.283 ± 1.924
0.0ProMet: 0.0 ± 0.0
2.935ProAsn: 2.935 ± 1.008
4.403ProPro: 4.403 ± 1.468
2.348ProGln: 2.348 ± 0.41
0.881ProArg: 0.881 ± 0.63
5.283ProSer: 5.283 ± 1.924
4.109ProThr: 4.109 ± 0.465
4.696ProVal: 4.696 ± 1.284
1.468ProTrp: 1.468 ± 0.79
0.881ProTyr: 0.881 ± 0.63
0.0ProXaa: 0.0 ± 0.0
Gln
1.174GlnAla: 1.174 ± 0.588
0.294GlnCys: 0.294 ± 0.41
1.468GlnAsp: 1.468 ± 1.009
4.403GlnGlu: 4.403 ± 1.179
2.642GlnPhe: 2.642 ± 1.672
2.348GlnGly: 2.348 ± 1.24
0.587GlnHis: 0.587 ± 0.294
3.522GlnIle: 3.522 ± 0.85
2.055GlnLys: 2.055 ± 0.543
0.881GlnLeu: 0.881 ± 0.356
1.174GlnMet: 1.174 ± 0.351
1.761GlnAsn: 1.761 ± 1.207
1.468GlnPro: 1.468 ± 1.344
1.468GlnGln: 1.468 ± 0.823
1.468GlnArg: 1.468 ± 0.735
3.522GlnSer: 3.522 ± 0.307
1.761GlnThr: 1.761 ± 0.875
2.935GlnVal: 2.935 ± 0.749
1.174GlnTrp: 1.174 ± 1.553
0.881GlnTyr: 0.881 ± 0.356
0.0GlnXaa: 0.0 ± 0.0
Arg
1.761ArgAla: 1.761 ± 0.491
1.174ArgCys: 1.174 ± 0.588
2.642ArgAsp: 2.642 ± 1.287
3.522ArgGlu: 3.522 ± 1.763
2.348ArgPhe: 2.348 ± 0.653
1.468ArgGly: 1.468 ± 0.374
1.468ArgHis: 1.468 ± 0.342
1.761ArgIle: 1.761 ± 0.52
1.174ArgLys: 1.174 ± 0.287
4.109ArgLeu: 4.109 ± 1.161
1.468ArgMet: 1.468 ± 0.342
2.055ArgAsn: 2.055 ± 0.621
2.935ArgPro: 2.935 ± 1.226
1.468ArgGln: 1.468 ± 0.374
2.935ArgArg: 2.935 ± 0.749
3.522ArgSer: 3.522 ± 0.384
3.816ArgThr: 3.816 ± 1.037
1.468ArgVal: 1.468 ± 0.342
0.0ArgTrp: 0.0 ± 0.0
0.881ArgTyr: 0.881 ± 0.441
0.0ArgXaa: 0.0 ± 0.0
Ser
4.696SerAla: 4.696 ± 1.148
0.587SerCys: 0.587 ± 0.294
4.403SerAsp: 4.403 ± 0.649
5.283SerGlu: 5.283 ± 0.583
2.935SerPhe: 2.935 ± 0.701
2.935SerGly: 2.935 ± 1.226
1.761SerHis: 1.761 ± 0.882
7.925SerIle: 7.925 ± 0.512
3.229SerLys: 3.229 ± 0.551
7.044SerLeu: 7.044 ± 0.895
1.761SerMet: 1.761 ± 0.42
3.816SerAsn: 3.816 ± 1.11
4.109SerPro: 4.109 ± 1.322
2.348SerGln: 2.348 ± 0.757
4.99SerArg: 4.99 ± 0.681
4.696SerSer: 4.696 ± 0.729
5.283SerThr: 5.283 ± 1.569
5.283SerVal: 5.283 ± 0.586
0.881SerTrp: 0.881 ± 0.441
3.229SerTyr: 3.229 ± 2.667
0.0SerXaa: 0.0 ± 0.0
Thr
2.055ThrAla: 2.055 ± 0.621
1.761ThrCys: 1.761 ± 0.882
4.403ThrAsp: 4.403 ± 0.701
3.816ThrGlu: 3.816 ± 0.254
2.055ThrPhe: 2.055 ± 0.07
3.522ThrGly: 3.522 ± 0.85
1.761ThrHis: 1.761 ± 0.491
4.109ThrIle: 4.109 ± 0.139
3.522ThrLys: 3.522 ± 0.861
4.403ThrLeu: 4.403 ± 2.37
2.642ThrMet: 2.642 ± 0.44
2.935ThrAsn: 2.935 ± 0.749
2.935ThrPro: 2.935 ± 0.425
2.055ThrGln: 2.055 ± 0.744
4.99ThrArg: 4.99 ± 0.681
5.283ThrSer: 5.283 ± 1.302
4.109ThrThr: 4.109 ± 1.615
5.283ThrVal: 5.283 ± 0.583
0.881ThrTrp: 0.881 ± 0.441
0.587ThrTyr: 0.587 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
3.816ValAla: 3.816 ± 0.469
0.0ValCys: 0.0 ± 0.0
4.99ValAsp: 4.99 ± 2.24
5.283ValGlu: 5.283 ± 0.152
1.761ValPhe: 1.761 ± 0.682
2.642ValGly: 2.642 ± 0.896
2.348ValHis: 2.348 ± 0.633
3.522ValIle: 3.522 ± 0.697
4.99ValLys: 4.99 ± 2.05
6.457ValLeu: 6.457 ± 1.178
1.174ValMet: 1.174 ± 0.484
3.816ValAsn: 3.816 ± 0.907
2.642ValPro: 2.642 ± 0.26
3.522ValGln: 3.522 ± 0.391
2.935ValArg: 2.935 ± 1.038
6.457ValSer: 6.457 ± 0.173
2.348ValThr: 2.348 ± 1.175
3.522ValVal: 3.522 ± 0.841
1.174ValTrp: 1.174 ± 0.484
2.935ValTyr: 2.935 ± 0.781
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.294TrpCys: 0.294 ± 0.41
1.761TrpAsp: 1.761 ± 0.195
0.294TrpGlu: 0.294 ± 0.147
0.294TrpPhe: 0.294 ± 0.147
0.294TrpGly: 0.294 ± 0.147
0.587TrpHis: 0.587 ± 0.31
1.174TrpIle: 1.174 ± 0.62
1.761TrpLys: 1.761 ± 0.42
1.468TrpLeu: 1.468 ± 0.735
0.294TrpMet: 0.294 ± 0.556
1.761TrpAsn: 1.761 ± 0.491
0.881TrpPro: 0.881 ± 0.356
0.587TrpGln: 0.587 ± 0.294
0.881TrpArg: 0.881 ± 0.63
0.587TrpSer: 0.587 ± 0.31
0.587TrpThr: 0.587 ± 0.777
2.642TrpVal: 2.642 ± 1.166
0.587TrpTrp: 0.587 ± 0.31
0.587TrpTyr: 0.587 ± 0.443
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.881TyrAla: 0.881 ± 0.26
0.294TyrCys: 0.294 ± 0.556
3.229TyrAsp: 3.229 ± 0.955
3.229TyrGlu: 3.229 ± 0.598
1.468TyrPhe: 1.468 ± 0.339
1.174TyrGly: 1.174 ± 0.588
1.468TyrHis: 1.468 ± 0.339
2.642TyrIle: 2.642 ± 0.26
2.055TyrLys: 2.055 ± 0.07
2.055TyrLeu: 2.055 ± 1.072
1.468TyrMet: 1.468 ± 1.306
2.055TyrAsn: 2.055 ± 0.744
1.761TyrPro: 1.761 ± 0.491
1.174TyrGln: 1.174 ± 0.287
1.468TyrArg: 1.468 ± 0.342
2.348TyrSer: 2.348 ± 0.622
2.348TyrThr: 2.348 ± 0.653
1.468TyrVal: 1.468 ± 0.342
0.881TyrTrp: 0.881 ± 0.441
2.348TyrTyr: 2.348 ± 1.98
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski