Amino acid dipepetide frequency for Hubei virga-like virus 17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.654AlaAla: 4.654 ± 1.283
0.997AlaCys: 0.997 ± 0.379
4.987AlaAsp: 4.987 ± 0.159
2.66AlaGlu: 2.66 ± 0.614
1.33AlaPhe: 1.33 ± 1.572
2.327AlaGly: 2.327 ± 1.226
1.33AlaHis: 1.33 ± 1.005
2.66AlaIle: 2.66 ± 0.614
3.989AlaLys: 3.989 ± 1.15
4.987AlaLeu: 4.987 ± 1.278
2.327AlaMet: 2.327 ± 1.03
0.997AlaAsn: 0.997 ± 0.525
0.665AlaPro: 0.665 ± 2.43
1.662AlaGln: 1.662 ± 0.871
1.662AlaArg: 1.662 ± 0.876
3.657AlaSer: 3.657 ± 1.374
1.995AlaThr: 1.995 ± 1.051
3.657AlaVal: 3.657 ± 1.033
0.332AlaTrp: 0.332 ± 0.175
3.324AlaTyr: 3.324 ± 0.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.997CysAla: 0.997 ± 0.525
0.332CysCys: 0.332 ± 0.175
3.989CysAsp: 3.989 ± 2.101
1.662CysGlu: 1.662 ± 0.812
0.665CysPhe: 0.665 ± 0.35
0.665CysGly: 0.665 ± 0.35
0.0CysHis: 0.0 ± 0.0
0.997CysIle: 0.997 ± 0.379
0.997CysLys: 0.997 ± 1.149
1.662CysLeu: 1.662 ± 1.454
0.332CysMet: 0.332 ± 0.175
0.997CysAsn: 0.997 ± 0.379
1.33CysPro: 1.33 ± 0.383
0.0CysGln: 0.0 ± 0.0
0.997CysArg: 0.997 ± 0.525
0.997CysSer: 0.997 ± 0.525
1.33CysThr: 1.33 ± 0.898
2.66CysVal: 2.66 ± 0.879
0.0CysTrp: 0.0 ± 0.0
0.665CysTyr: 0.665 ± 0.35
0.0CysXaa: 0.0 ± 0.0
Asp
4.987AspAla: 4.987 ± 1.379
1.662AspCys: 1.662 ± 0.46
7.314AspAsp: 7.314 ± 2.033
4.654AspGlu: 4.654 ± 1.448
4.654AspPhe: 4.654 ± 1.283
3.657AspGly: 3.657 ± 1.374
0.997AspHis: 0.997 ± 0.379
4.322AspIle: 4.322 ± 1.153
4.654AspLys: 4.654 ± 1.885
9.309AspLeu: 9.309 ± 0.582
2.992AspMet: 2.992 ± 0.828
1.995AspAsn: 1.995 ± 0.581
1.662AspPro: 1.662 ± 0.876
0.665AspGln: 0.665 ± 0.35
3.657AspArg: 3.657 ± 1.926
3.657AspSer: 3.657 ± 1.033
1.662AspThr: 1.662 ± 0.46
6.649AspVal: 6.649 ± 1.417
0.332AspTrp: 0.332 ± 0.567
5.652AspTyr: 5.652 ± 3.497
0.0AspXaa: 0.0 ± 0.0
Glu
1.995GluAla: 1.995 ± 0.989
1.33GluCys: 1.33 ± 0.7
2.66GluAsp: 2.66 ± 0.879
1.662GluGlu: 1.662 ± 0.46
4.322GluPhe: 4.322 ± 0.453
1.33GluGly: 1.33 ± 0.383
1.995GluHis: 1.995 ± 1.051
5.319GluIle: 5.319 ± 0.719
6.649GluLys: 6.649 ± 1.311
5.652GluLeu: 5.652 ± 1.854
0.997GluMet: 0.997 ± 0.379
3.989GluAsn: 3.989 ± 2.161
2.66GluPro: 2.66 ± 0.766
0.997GluGln: 0.997 ± 0.525
1.662GluArg: 1.662 ± 0.46
2.992GluSer: 2.992 ± 1.041
2.992GluThr: 2.992 ± 1.041
4.987GluVal: 4.987 ± 1.498
0.332GluTrp: 0.332 ± 0.175
2.66GluTyr: 2.66 ± 1.872
0.0GluXaa: 0.0 ± 0.0
Phe
3.657PheAla: 3.657 ± 1.118
1.662PheCys: 1.662 ± 0.876
4.322PheAsp: 4.322 ± 1.301
4.654PheGlu: 4.654 ± 0.999
1.995PhePhe: 1.995 ± 0.755
2.66PheGly: 2.66 ± 1.978
0.332PheHis: 0.332 ± 0.175
1.662PheIle: 1.662 ± 1.252
5.319PheLys: 5.319 ± 1.866
5.652PheLeu: 5.652 ± 1.051
0.997PheMet: 0.997 ± 0.379
3.324PheAsn: 3.324 ± 1.206
1.33PhePro: 1.33 ± 1.005
1.33PheGln: 1.33 ± 0.383
1.33PheArg: 1.33 ± 0.383
3.657PheSer: 3.657 ± 1.374
2.992PheThr: 2.992 ± 2.349
4.322PheVal: 4.322 ± 1.413
0.665PheTrp: 0.665 ± 0.35
2.66PheTyr: 2.66 ± 0.614
0.0PheXaa: 0.0 ± 0.0
Gly
0.997GlyAla: 0.997 ± 2.33
0.997GlyCys: 0.997 ± 0.525
5.984GlyAsp: 5.984 ± 1.758
2.66GlyGlu: 2.66 ± 1.099
3.989GlyPhe: 3.989 ± 1.611
2.66GlyGly: 2.66 ± 0.614
0.332GlyHis: 0.332 ± 0.175
3.324GlyIle: 3.324 ± 0.968
5.652GlyLys: 5.652 ± 2.977
4.654GlyLeu: 4.654 ± 0.291
1.995GlyMet: 1.995 ± 1.051
3.657GlyAsn: 3.657 ± 1.622
0.665GlyPro: 0.665 ± 1.301
0.665GlyGln: 0.665 ± 0.449
1.995GlyArg: 1.995 ± 0.755
3.324GlySer: 3.324 ± 1.206
0.332GlyThr: 0.332 ± 0.567
5.319GlyVal: 5.319 ± 1.533
0.997GlyTrp: 0.997 ± 1.045
1.995GlyTyr: 1.995 ± 0.581
0.0GlyXaa: 0.0 ± 0.0
His
0.665HisAla: 0.665 ± 0.35
0.665HisCys: 0.665 ± 0.449
0.997HisAsp: 0.997 ± 0.525
0.997HisGlu: 0.997 ± 0.525
0.0HisPhe: 0.0 ± 0.0
1.33HisGly: 1.33 ± 0.7
0.0HisHis: 0.0 ± 0.0
2.66HisIle: 2.66 ± 0.879
2.327HisLys: 2.327 ± 0.666
1.995HisLeu: 1.995 ± 0.581
0.332HisMet: 0.332 ± 0.567
0.997HisAsn: 0.997 ± 1.008
0.665HisPro: 0.665 ± 0.35
0.0HisGln: 0.0 ± 0.0
0.997HisArg: 0.997 ± 0.525
1.33HisSer: 1.33 ± 0.7
0.665HisThr: 0.665 ± 0.449
1.662HisVal: 1.662 ± 0.876
0.332HisTrp: 0.332 ± 0.175
0.997HisTyr: 0.997 ± 0.525
0.0HisXaa: 0.0 ± 0.0
Ile
2.327IleAla: 2.327 ± 2.526
1.33IleCys: 1.33 ± 1.005
1.662IleAsp: 1.662 ± 0.812
2.66IleGlu: 2.66 ± 0.879
3.989IlePhe: 3.989 ± 2.695
1.662IleGly: 1.662 ± 0.977
0.997IleHis: 0.997 ± 0.525
4.322IleIle: 4.322 ± 2.876
4.322IleLys: 4.322 ± 1.153
6.316IleLeu: 6.316 ± 1.057
1.662IleMet: 1.662 ± 0.981
2.992IleAsn: 2.992 ± 1.041
3.657IlePro: 3.657 ± 2.904
2.327IleGln: 2.327 ± 0.666
2.66IleArg: 2.66 ± 0.743
5.319IleSer: 5.319 ± 2.198
1.995IleThr: 1.995 ± 1.051
2.992IleVal: 2.992 ± 3.193
0.665IleTrp: 0.665 ± 1.301
4.654IleTyr: 4.654 ± 1.429
0.0IleXaa: 0.0 ± 0.0
Lys
1.995LysAla: 1.995 ± 0.581
2.327LysCys: 2.327 ± 1.226
5.319LysAsp: 5.319 ± 2.961
7.314LysGlu: 7.314 ± 2.065
6.649LysPhe: 6.649 ± 0.54
3.657LysGly: 3.657 ± 1.926
1.662LysHis: 1.662 ± 0.876
5.984LysIle: 5.984 ± 2.966
3.989LysLys: 3.989 ± 1.15
7.646LysLeu: 7.646 ± 2.191
1.995LysMet: 1.995 ± 1.051
5.652LysAsn: 5.652 ± 1.285
3.989LysPro: 3.989 ± 0.235
0.997LysGln: 0.997 ± 0.525
3.657LysArg: 3.657 ± 1.418
2.992LysSer: 2.992 ± 0.611
4.322LysThr: 4.322 ± 2.012
8.976LysVal: 8.976 ± 2.859
0.332LysTrp: 0.332 ± 0.175
1.662LysTyr: 1.662 ± 1.839
0.0LysXaa: 0.0 ± 0.0
Leu
7.314LeuAla: 7.314 ± 1.151
0.665LeuCys: 0.665 ± 0.35
6.649LeuAsp: 6.649 ± 2.985
5.984LeuGlu: 5.984 ± 0.957
3.657LeuPhe: 3.657 ± 0.296
6.316LeuGly: 6.316 ± 0.633
1.995LeuHis: 1.995 ± 0.758
5.652LeuIle: 5.652 ± 3.701
8.976LeuLys: 8.976 ± 1.666
8.644LeuLeu: 8.644 ± 2.971
2.992LeuMet: 2.992 ± 1.784
6.981LeuAsn: 6.981 ± 0.711
0.997LeuPro: 0.997 ± 2.33
1.662LeuGln: 1.662 ± 0.876
2.992LeuArg: 2.992 ± 0.828
5.652LeuSer: 5.652 ± 0.3
4.654LeuThr: 4.654 ± 1.584
11.968LeuVal: 11.968 ± 5.86
1.662LeuTrp: 1.662 ± 0.977
2.992LeuTyr: 2.992 ± 1.041
0.0LeuXaa: 0.0 ± 0.0
Met
1.662MetAla: 1.662 ± 0.876
0.997MetCys: 0.997 ± 0.525
0.997MetAsp: 0.997 ± 0.525
1.33MetGlu: 1.33 ± 0.997
3.324MetPhe: 3.324 ± 0.655
2.327MetGly: 2.327 ± 0.724
0.665MetHis: 0.665 ± 0.35
0.997MetIle: 0.997 ± 1.045
1.662MetLys: 1.662 ± 0.812
2.66MetLeu: 2.66 ± 2.009
0.997MetMet: 0.997 ± 1.045
0.332MetAsn: 0.332 ± 0.567
0.665MetPro: 0.665 ± 0.449
0.332MetGln: 0.332 ± 1.215
1.662MetArg: 1.662 ± 0.871
2.327MetSer: 2.327 ± 1.226
0.665MetThr: 0.665 ± 0.35
2.992MetVal: 2.992 ± 1.041
0.332MetTrp: 0.332 ± 0.175
1.995MetTyr: 1.995 ± 0.755
0.0MetXaa: 0.0 ± 0.0
Asn
2.327AsnAla: 2.327 ± 0.724
0.997AsnCys: 0.997 ± 1.7
1.995AsnAsp: 1.995 ± 0.581
1.995AsnGlu: 1.995 ± 0.989
3.657AsnPhe: 3.657 ± 0.795
2.66AsnGly: 2.66 ± 3.186
1.33AsnHis: 1.33 ± 0.7
2.327AsnIle: 2.327 ± 1.03
4.654AsnLys: 4.654 ± 2.743
5.652AsnLeu: 5.652 ± 2.689
1.995AsnMet: 1.995 ± 0.65
2.992AsnAsn: 2.992 ± 1.873
0.997AsnPro: 0.997 ± 0.379
1.662AsnGln: 1.662 ± 0.46
1.662AsnArg: 1.662 ± 0.46
1.662AsnSer: 1.662 ± 0.46
2.66AsnThr: 2.66 ± 1.401
5.319AsnVal: 5.319 ± 2.831
0.665AsnTrp: 0.665 ± 1.12
2.992AsnTyr: 2.992 ± 0.828
0.0AsnXaa: 0.0 ± 0.0
Pro
2.327ProAla: 2.327 ± 1.03
0.332ProCys: 0.332 ± 0.175
1.662ProAsp: 1.662 ± 0.46
0.665ProGlu: 0.665 ± 0.35
1.33ProPhe: 1.33 ± 1.005
3.324ProGly: 3.324 ± 0.655
0.997ProHis: 0.997 ± 1.008
1.33ProIle: 1.33 ± 0.383
1.662ProLys: 1.662 ± 0.876
3.657ProLeu: 3.657 ± 1.476
1.33ProMet: 1.33 ± 2.24
0.997ProAsn: 0.997 ± 0.379
1.662ProPro: 1.662 ± 4.759
0.997ProGln: 0.997 ± 1.045
1.33ProArg: 1.33 ± 0.7
1.662ProSer: 1.662 ± 3.449
0.665ProThr: 0.665 ± 1.12
3.989ProVal: 3.989 ± 2.825
0.0ProTrp: 0.0 ± 0.0
0.997ProTyr: 0.997 ± 0.525
0.0ProXaa: 0.0 ± 0.0
Gln
0.665GlnAla: 0.665 ± 1.12
0.997GlnCys: 0.997 ± 1.008
1.995GlnAsp: 1.995 ± 0.581
1.662GlnGlu: 1.662 ± 0.46
0.665GlnPhe: 0.665 ± 0.35
1.662GlnGly: 1.662 ± 0.977
0.332GlnHis: 0.332 ± 0.175
1.995GlnIle: 1.995 ± 0.581
0.997GlnLys: 0.997 ± 0.525
1.662GlnLeu: 1.662 ± 0.871
0.665GlnMet: 0.665 ± 1.12
0.665GlnAsn: 0.665 ± 0.35
1.33GlnPro: 1.33 ± 0.997
1.33GlnGln: 1.33 ± 0.997
0.997GlnArg: 0.997 ± 0.525
1.33GlnSer: 1.33 ± 0.383
0.332GlnThr: 0.332 ± 0.175
1.995GlnVal: 1.995 ± 0.581
0.0GlnTrp: 0.0 ± 0.0
0.665GlnTyr: 0.665 ± 0.35
0.0GlnXaa: 0.0 ± 0.0
Arg
1.995ArgAla: 1.995 ± 1.051
0.665ArgCys: 0.665 ± 0.35
3.324ArgAsp: 3.324 ± 1.751
2.327ArgGlu: 2.327 ± 0.724
2.66ArgPhe: 2.66 ± 1.401
0.997ArgGly: 0.997 ± 0.525
0.665ArgHis: 0.665 ± 0.35
2.66ArgIle: 2.66 ± 1.186
3.324ArgLys: 3.324 ± 0.655
3.989ArgLeu: 3.989 ± 0.853
0.997ArgMet: 0.997 ± 0.525
1.662ArgAsn: 1.662 ± 0.871
1.662ArgPro: 1.662 ± 0.977
0.997ArgGln: 0.997 ± 0.379
1.995ArgArg: 1.995 ± 0.758
1.662ArgSer: 1.662 ± 0.876
2.66ArgThr: 2.66 ± 0.743
3.324ArgVal: 3.324 ± 0.968
0.0ArgTrp: 0.0 ± 0.0
2.66ArgTyr: 2.66 ± 1.797
0.0ArgXaa: 0.0 ± 0.0
Ser
1.662SerAla: 1.662 ± 0.812
1.33SerCys: 1.33 ± 0.383
4.987SerAsp: 4.987 ± 0.572
5.319SerGlu: 5.319 ± 1.533
2.327SerPhe: 2.327 ± 1.226
3.989SerGly: 3.989 ± 0.853
1.33SerHis: 1.33 ± 0.7
2.66SerIle: 2.66 ± 1.099
5.984SerLys: 5.984 ± 1.758
4.322SerLeu: 4.322 ± 0.984
1.662SerMet: 1.662 ± 0.876
1.662SerAsn: 1.662 ± 0.871
1.33SerPro: 1.33 ± 0.997
1.995SerGln: 1.995 ± 1.051
2.66SerArg: 2.66 ± 1.099
3.989SerSer: 3.989 ± 1.161
1.33SerThr: 1.33 ± 0.383
9.973SerVal: 9.973 ± 3.618
0.665SerTrp: 0.665 ± 0.449
2.66SerTyr: 2.66 ± 1.401
0.0SerXaa: 0.0 ± 0.0
Thr
3.324ThrAla: 3.324 ± 1.297
1.33ThrCys: 1.33 ± 0.7
3.657ThrAsp: 3.657 ± 1.118
2.992ThrGlu: 2.992 ± 1.137
1.995ThrPhe: 1.995 ± 0.581
1.995ThrGly: 1.995 ± 0.755
0.332ThrHis: 0.332 ± 0.175
1.995ThrIle: 1.995 ± 0.758
1.995ThrLys: 1.995 ± 1.051
1.662ThrLeu: 1.662 ± 0.871
0.997ThrMet: 0.997 ± 0.645
1.995ThrAsn: 1.995 ± 0.755
0.665ThrPro: 0.665 ± 0.35
0.997ThrGln: 0.997 ± 1.045
1.33ThrArg: 1.33 ± 0.383
3.657ThrSer: 3.657 ± 1.926
2.327ThrThr: 2.327 ± 1.226
2.992ThrVal: 2.992 ± 1.576
0.665ThrTrp: 0.665 ± 1.301
2.992ThrTyr: 2.992 ± 0.828
0.0ThrXaa: 0.0 ± 0.0
Val
4.987ValAla: 4.987 ± 0.572
1.995ValCys: 1.995 ± 0.758
6.649ValAsp: 6.649 ± 2.412
3.657ValGlu: 3.657 ± 1.476
3.989ValPhe: 3.989 ± 2.016
3.989ValGly: 3.989 ± 1.725
3.657ValHis: 3.657 ± 1.033
5.652ValIle: 5.652 ± 1.84
8.976ValLys: 8.976 ± 2.226
11.968ValLeu: 11.968 ± 4.868
1.995ValMet: 1.995 ± 0.581
4.322ValAsn: 4.322 ± 1.49
3.657ValPro: 3.657 ± 1.033
1.995ValGln: 1.995 ± 0.581
4.322ValArg: 4.322 ± 1.49
8.311ValSer: 8.311 ± 1.179
4.322ValThr: 4.322 ± 1.713
8.976ValVal: 8.976 ± 4.335
0.665ValTrp: 0.665 ± 0.35
4.322ValTyr: 4.322 ± 1.153
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.665TrpAsp: 0.665 ± 0.449
0.997TrpGlu: 0.997 ± 1.045
0.997TrpPhe: 0.997 ± 0.379
0.997TrpGly: 0.997 ± 1.045
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.662TrpLys: 1.662 ± 0.876
0.997TrpLeu: 0.997 ± 0.525
0.0TrpMet: 0.0 ± 0.0
0.332TrpAsn: 0.332 ± 1.215
0.332TrpPro: 0.332 ± 0.175
0.0TrpGln: 0.0 ± 0.0
0.665TrpArg: 0.665 ± 1.12
0.665TrpSer: 0.665 ± 0.449
0.0TrpThr: 0.0 ± 0.0
0.332TrpVal: 0.332 ± 1.215
0.0TrpTrp: 0.0 ± 0.0
0.665TrpTyr: 0.665 ± 0.449
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.662TyrAla: 1.662 ± 0.812
0.665TyrCys: 0.665 ± 0.35
5.319TyrAsp: 5.319 ± 1.533
1.33TyrGlu: 1.33 ± 0.383
2.327TyrPhe: 2.327 ± 1.257
3.989TyrGly: 3.989 ± 1.15
0.665TyrHis: 0.665 ± 0.35
1.995TyrIle: 1.995 ± 3.594
3.324TyrLys: 3.324 ± 0.655
5.319TyrLeu: 5.319 ± 0.735
1.33TyrMet: 1.33 ± 0.898
3.657TyrAsn: 3.657 ± 0.795
0.997TyrPro: 0.997 ± 1.045
1.33TyrGln: 1.33 ± 0.383
1.995TyrArg: 1.995 ± 1.347
2.992TyrSer: 2.992 ± 1.041
2.327TyrThr: 2.327 ± 0.666
5.319TyrVal: 5.319 ± 1.418
0.665TyrTrp: 0.665 ± 0.35
2.992TyrTyr: 2.992 ± 1.704
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3009 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski