Amino acid dipepetide frequency for Hazara virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.745AlaAla: 2.745 ± 0.96
1.201AlaCys: 1.201 ± 0.499
1.887AlaAsp: 1.887 ± 0.116
3.775AlaGlu: 3.775 ± 1.847
1.373AlaPhe: 1.373 ± 0.785
2.745AlaGly: 2.745 ± 0.984
0.686AlaHis: 0.686 ± 0.193
2.574AlaIle: 2.574 ± 0.267
2.917AlaLys: 2.917 ± 1.016
5.491AlaLeu: 5.491 ± 2.059
1.201AlaMet: 1.201 ± 0.334
1.887AlaAsn: 1.887 ± 0.116
1.544AlaPro: 1.544 ± 0.723
2.231AlaGln: 2.231 ± 1.7
2.231AlaArg: 2.231 ± 0.631
4.804AlaSer: 4.804 ± 2.747
3.432AlaThr: 3.432 ± 1.04
3.26AlaVal: 3.26 ± 0.999
1.03AlaTrp: 1.03 ± 0.595
1.544AlaTyr: 1.544 ± 0.747
0.0AlaXaa: 0.0 ± 0.0
Cys
1.544CysAla: 1.544 ± 1.884
1.373CysCys: 1.373 ± 0.387
1.544CysAsp: 1.544 ± 0.455
1.887CysGlu: 1.887 ± 0.598
1.201CysPhe: 1.201 ± 0.334
0.858CysGly: 0.858 ± 0.82
0.343CysHis: 0.343 ± 0.192
2.059CysIle: 2.059 ± 0.023
1.373CysLys: 1.373 ± 0.448
3.432CysLeu: 3.432 ± 1.206
0.172CysMet: 0.172 ± 0.222
1.03CysAsn: 1.03 ± 0.758
1.887CysPro: 1.887 ± 1.297
0.686CysGln: 0.686 ± 0.193
1.887CysArg: 1.887 ± 0.116
3.26CysSer: 3.26 ± 1.442
2.231CysThr: 2.231 ± 1.141
2.231CysVal: 2.231 ± 0.244
0.515CysTrp: 0.515 ± 0.379
1.03CysTyr: 1.03 ± 1.042
0.0CysXaa: 0.0 ± 0.0
Asp
2.574AspAla: 2.574 ± 1.646
2.745AspCys: 2.745 ± 0.782
2.231AspAsp: 2.231 ± 0.755
3.775AspGlu: 3.775 ± 1.084
1.716AspPhe: 1.716 ± 0.147
2.917AspGly: 2.917 ± 0.838
0.858AspHis: 0.858 ± 0.265
4.118AspIle: 4.118 ± 0.255
4.461AspLys: 4.461 ± 0.48
5.662AspLeu: 5.662 ± 1.738
1.03AspMet: 1.03 ± 0.312
3.26AspAsn: 3.26 ± 0.778
1.544AspPro: 1.544 ± 0.454
1.03AspGln: 1.03 ± 0.925
2.574AspArg: 2.574 ± 0.964
4.29AspSer: 4.29 ± 1.491
2.402AspThr: 2.402 ± 1.124
2.917AspVal: 2.917 ± 0.337
0.343AspTrp: 0.343 ± 0.48
2.402AspTyr: 2.402 ± 0.717
0.0AspXaa: 0.0 ± 0.0
Glu
2.745GluAla: 2.745 ± 0.623
2.231GluCys: 2.231 ± 0.244
4.461GluAsp: 4.461 ± 1.32
6.52GluGlu: 6.52 ± 1.283
2.231GluPhe: 2.231 ± 0.755
5.491GluGly: 5.491 ± 1.24
1.201GluHis: 1.201 ± 0.853
3.089GluIle: 3.089 ± 1.087
4.461GluLys: 4.461 ± 0.708
10.124GluLeu: 10.124 ± 1.305
1.716GluMet: 1.716 ± 0.422
2.402GluAsn: 2.402 ± 1.167
1.544GluPro: 1.544 ± 0.62
1.373GluGln: 1.373 ± 0.387
3.26GluArg: 3.26 ± 0.65
4.976GluSer: 4.976 ± 0.486
4.804GluThr: 4.804 ± 1.512
5.319GluVal: 5.319 ± 1.117
0.858GluTrp: 0.858 ± 0.335
1.716GluTyr: 1.716 ± 0.731
0.0GluXaa: 0.0 ± 0.0
Phe
1.887PheAla: 1.887 ± 1.196
1.373PheCys: 1.373 ± 1.088
1.544PheAsp: 1.544 ± 0.308
2.574PheGlu: 2.574 ± 0.828
2.231PhePhe: 2.231 ± 0.749
1.544PheGly: 1.544 ± 0.147
0.343PheHis: 0.343 ± 0.444
1.716PheIle: 1.716 ± 0.53
2.059PheLys: 2.059 ± 0.607
4.633PheLeu: 4.633 ± 0.816
0.515PheMet: 0.515 ± 0.287
1.373PheAsn: 1.373 ± 0.775
1.201PhePro: 1.201 ± 0.52
1.373PheGln: 1.373 ± 0.2
1.373PheArg: 1.373 ± 0.403
4.461PheSer: 4.461 ± 1.025
2.574PheThr: 2.574 ± 0.839
2.059PheVal: 2.059 ± 0.607
0.172PheTrp: 0.172 ± 0.096
1.544PheTyr: 1.544 ± 0.862
0.0PheXaa: 0.0 ± 0.0
Gly
2.231GlyAla: 2.231 ± 1.365
1.716GlyCys: 1.716 ± 1.356
2.231GlyAsp: 2.231 ± 1.459
3.775GlyGlu: 3.775 ± 0.233
1.544GlyPhe: 1.544 ± 0.147
3.432GlyGly: 3.432 ± 0.675
0.858GlyHis: 0.858 ± 0.335
3.26GlyIle: 3.26 ± 0.65
5.319GlyLys: 5.319 ± 1.963
6.863GlyLeu: 6.863 ± 0.791
1.544GlyMet: 1.544 ± 0.454
1.544GlyAsn: 1.544 ± 0.723
2.745GlyPro: 2.745 ± 1.502
1.373GlyGln: 1.373 ± 0.403
3.089GlyArg: 3.089 ± 0.717
6.177GlySer: 6.177 ± 1.839
2.745GlyThr: 2.745 ± 0.561
2.574GlyVal: 2.574 ± 0.655
0.343GlyTrp: 0.343 ± 0.48
1.716GlyTyr: 1.716 ± 0.67
0.0GlyXaa: 0.0 ± 0.0
His
1.373HisAla: 1.373 ± 0.387
1.03HisCys: 1.03 ± 0.304
0.686HisAsp: 0.686 ± 0.329
0.343HisGlu: 0.343 ± 0.192
0.858HisPhe: 0.858 ± 0.54
1.544HisGly: 1.544 ± 0.455
0.172HisHis: 0.172 ± 0.096
1.03HisIle: 1.03 ± 0.348
1.201HisLys: 1.201 ± 0.276
2.917HisLeu: 2.917 ± 0.433
0.515HisMet: 0.515 ± 0.287
0.686HisAsn: 0.686 ± 0.329
0.686HisPro: 0.686 ± 0.438
0.515HisGln: 0.515 ± 0.449
1.201HisArg: 1.201 ± 0.334
2.917HisSer: 2.917 ± 1.522
0.858HisThr: 0.858 ± 0.335
1.03HisVal: 1.03 ± 0.304
0.515HisTrp: 0.515 ± 0.152
0.515HisTyr: 0.515 ± 0.152
0.0HisXaa: 0.0 ± 0.0
Ile
2.574IleAla: 2.574 ± 0.688
1.03IleCys: 1.03 ± 0.304
2.574IleAsp: 2.574 ± 0.16
2.917IleGlu: 2.917 ± 0.922
1.887IlePhe: 1.887 ± 0.611
1.887IleGly: 1.887 ± 0.497
1.201IleHis: 1.201 ± 0.461
3.432IleIle: 3.432 ± 0.866
4.29IleLys: 4.29 ± 0.341
5.834IleLeu: 5.834 ± 1.089
1.03IleMet: 1.03 ± 0.647
2.745IleAsn: 2.745 ± 2.148
1.716IlePro: 1.716 ± 0.422
3.26IleGln: 3.26 ± 0.477
2.745IleArg: 2.745 ± 0.929
4.633IleSer: 4.633 ± 1.113
3.089IleThr: 3.089 ± 0.858
3.432IleVal: 3.432 ± 0.295
0.515IleTrp: 0.515 ± 0.152
1.373IleTyr: 1.373 ± 0.876
0.0IleXaa: 0.0 ± 0.0
Lys
3.775LysAla: 3.775 ± 4.684
1.544LysCys: 1.544 ± 0.624
5.491LysAsp: 5.491 ± 1.245
5.319LysGlu: 5.319 ± 0.756
2.574LysPhe: 2.574 ± 0.839
4.29LysGly: 4.29 ± 1.115
1.716LysHis: 1.716 ± 0.147
3.775LysIle: 3.775 ± 0.484
5.319LysLys: 5.319 ± 0.899
8.236LysLeu: 8.236 ± 0.63
1.544LysMet: 1.544 ± 0.747
4.118LysAsn: 4.118 ± 0.045
2.059LysPro: 2.059 ± 0.607
3.089LysGln: 3.089 ± 0.458
4.976LysArg: 4.976 ± 1.024
3.775LysSer: 3.775 ± 0.484
3.603LysThr: 3.603 ± 0.829
5.662LysVal: 5.662 ± 0.822
0.858LysTrp: 0.858 ± 0.452
1.544LysTyr: 1.544 ± 0.723
0.0LysXaa: 0.0 ± 0.0
Leu
5.834LeuAla: 5.834 ± 2.026
2.917LeuCys: 2.917 ± 0.959
6.692LeuAsp: 6.692 ± 1.148
7.207LeuGlu: 7.207 ± 1.068
4.461LeuPhe: 4.461 ± 0.566
5.148LeuGly: 5.148 ± 0.32
3.432LeuHis: 3.432 ± 0.383
6.863LeuIle: 6.863 ± 1.69
8.751LeuLys: 8.751 ± 0.917
15.614LeuLeu: 15.614 ± 3.063
1.373LeuMet: 1.373 ± 0.387
5.834LeuAsn: 5.834 ± 0.716
3.432LeuPro: 3.432 ± 0.964
3.775LeuGln: 3.775 ± 1.049
6.692LeuArg: 6.692 ± 1.353
10.467LeuSer: 10.467 ± 2.34
7.035LeuThr: 7.035 ± 1.641
7.035LeuVal: 7.035 ± 0.819
0.515LeuTrp: 0.515 ± 0.449
2.917LeuTyr: 2.917 ± 0.838
0.0LeuXaa: 0.0 ± 0.0
Met
0.686MetAla: 0.686 ± 0.193
0.172MetCys: 0.172 ± 0.096
1.03MetAsp: 1.03 ± 0.925
1.887MetGlu: 1.887 ± 1.223
1.03MetPhe: 1.03 ± 0.575
1.373MetGly: 1.373 ± 0.387
0.686MetHis: 0.686 ± 0.392
0.858MetIle: 0.858 ± 0.447
1.03MetLys: 1.03 ± 0.575
2.402MetLeu: 2.402 ± 0.172
0.858MetMet: 0.858 ± 0.479
1.201MetAsn: 1.201 ± 0.436
0.172MetPro: 0.172 ± 0.096
1.544MetGln: 1.544 ± 0.308
0.515MetArg: 0.515 ± 0.287
1.716MetSer: 1.716 ± 0.212
0.858MetThr: 0.858 ± 0.54
0.686MetVal: 0.686 ± 0.329
0.172MetTrp: 0.172 ± 0.096
0.343MetTyr: 0.343 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
1.716AsnAla: 1.716 ± 1.897
1.544AsnCys: 1.544 ± 0.454
1.201AsnAsp: 1.201 ± 0.289
2.231AsnGlu: 2.231 ± 0.244
1.201AsnPhe: 1.201 ± 0.436
1.373AsnGly: 1.373 ± 0.974
0.686AsnHis: 0.686 ± 0.383
2.231AsnIle: 2.231 ± 1.107
3.775AsnLys: 3.775 ± 1.847
4.633AsnLeu: 4.633 ± 1.361
0.343AsnMet: 0.343 ± 0.192
1.716AsnAsn: 1.716 ± 0.894
3.089AsnPro: 3.089 ± 0.935
1.716AsnGln: 1.716 ± 0.822
2.574AsnArg: 2.574 ± 0.593
4.633AsnSer: 4.633 ± 1.297
1.887AsnThr: 1.887 ± 0.814
3.775AsnVal: 3.775 ± 1.222
1.201AsnTrp: 1.201 ± 0.289
1.716AsnTyr: 1.716 ± 0.67
0.0AsnXaa: 0.0 ± 0.0
Pro
2.231ProAla: 2.231 ± 0.244
1.201ProCys: 1.201 ± 1.181
1.887ProAsp: 1.887 ± 0.409
2.745ProGlu: 2.745 ± 0.774
1.201ProPhe: 1.201 ± 0.289
1.544ProGly: 1.544 ± 0.59
0.343ProHis: 0.343 ± 0.164
1.373ProIle: 1.373 ± 0.578
2.745ProLys: 2.745 ± 0.561
2.574ProLeu: 2.574 ± 0.445
0.343ProMet: 0.343 ± 0.164
1.201ProAsn: 1.201 ± 0.334
1.201ProPro: 1.201 ± 1.181
0.858ProGln: 0.858 ± 0.335
2.402ProArg: 2.402 ± 0.748
4.29ProSer: 4.29 ± 0.761
1.887ProThr: 1.887 ± 0.611
2.745ProVal: 2.745 ± 1.976
0.515ProTrp: 0.515 ± 0.463
0.686ProTyr: 0.686 ± 0.193
0.0ProXaa: 0.0 ± 0.0
Gln
1.544GlnAla: 1.544 ± 0.62
1.03GlnCys: 1.03 ± 0.493
1.03GlnAsp: 1.03 ± 0.476
3.089GlnGlu: 3.089 ± 0.858
1.716GlnPhe: 1.716 ± 0.67
1.716GlnGly: 1.716 ± 0.737
1.03GlnHis: 1.03 ± 0.493
2.402GlnIle: 2.402 ± 0.183
2.745GlnLys: 2.745 ± 2.11
3.775GlnLeu: 3.775 ± 0.993
1.03GlnMet: 1.03 ± 0.348
1.03GlnAsn: 1.03 ± 0.362
0.343GlnPro: 0.343 ± 0.164
2.231GlnGln: 2.231 ± 0.996
0.686GlnArg: 0.686 ± 0.329
3.946GlnSer: 3.946 ± 1.03
2.745GlnThr: 2.745 ± 1.055
2.745GlnVal: 2.745 ± 0.62
0.343GlnTrp: 0.343 ± 0.164
0.515GlnTyr: 0.515 ± 0.152
0.0GlnXaa: 0.0 ± 0.0
Arg
0.858ArgAla: 0.858 ± 0.335
2.402ArgCys: 2.402 ± 0.669
4.118ArgAsp: 4.118 ± 1.959
2.402ArgGlu: 2.402 ± 1.167
2.574ArgPhe: 2.574 ± 0.839
1.716ArgGly: 1.716 ± 1.128
1.544ArgHis: 1.544 ± 0.454
1.887ArgIle: 1.887 ± 0.814
2.574ArgLys: 2.574 ± 0.984
8.751ArgLeu: 8.751 ± 2.03
1.716ArgMet: 1.716 ± 0.212
2.574ArgAsn: 2.574 ± 0.4
1.716ArgPro: 1.716 ± 0.822
2.917ArgGln: 2.917 ± 0.959
4.118ArgArg: 4.118 ± 0.645
3.946ArgSer: 3.946 ± 1.301
2.402ArgThr: 2.402 ± 0.466
3.089ArgVal: 3.089 ± 1.24
0.172ArgTrp: 0.172 ± 0.096
1.201ArgTyr: 1.201 ± 0.67
0.0ArgXaa: 0.0 ± 0.0
Ser
6.349SerAla: 6.349 ± 1.499
3.089SerCys: 3.089 ± 1.649
4.804SerAsp: 4.804 ± 1.421
8.236SerGlu: 8.236 ± 2.005
3.432SerPhe: 3.432 ± 0.295
6.349SerGly: 6.349 ± 1.135
2.231SerHis: 2.231 ± 0.95
4.976SerIle: 4.976 ± 0.732
4.633SerLys: 4.633 ± 1.361
7.721SerLeu: 7.721 ± 1.018
1.03SerMet: 1.03 ± 0.362
4.804SerAsn: 4.804 ± 0.786
3.089SerPro: 3.089 ± 0.615
2.745SerGln: 2.745 ± 0.19
4.976SerArg: 4.976 ± 1.654
10.124SerSer: 10.124 ± 1.268
7.378SerThr: 7.378 ± 0.963
4.976SerVal: 4.976 ± 1.13
1.544SerTrp: 1.544 ± 0.624
1.887SerTyr: 1.887 ± 0.814
0.0SerXaa: 0.0 ± 0.0
Thr
3.432ThrAla: 3.432 ± 0.746
1.373ThrCys: 1.373 ± 1.485
3.26ThrAsp: 3.26 ± 0.65
5.319ThrGlu: 5.319 ± 0.868
1.887ThrPhe: 1.887 ± 0.116
4.633ThrGly: 4.633 ± 0.71
1.201ThrHis: 1.201 ± 0.436
2.231ThrIle: 2.231 ± 0.645
5.491ThrLys: 5.491 ± 0.536
6.692ThrLeu: 6.692 ± 1.936
0.858ThrMet: 0.858 ± 0.265
1.887ThrAsn: 1.887 ± 0.2
2.574ThrPro: 2.574 ± 1.005
1.544ThrGln: 1.544 ± 0.454
1.373ThrArg: 1.373 ± 0.2
5.491ThrSer: 5.491 ± 0.727
3.26ThrThr: 3.26 ± 1.046
4.461ThrVal: 4.461 ± 1.06
0.858ThrTrp: 0.858 ± 0.452
2.231ThrTyr: 2.231 ± 0.076
0.0ThrXaa: 0.0 ± 0.0
Val
2.917ValAla: 2.917 ± 1.016
1.201ValCys: 1.201 ± 0.334
4.461ValAsp: 4.461 ± 0.669
4.976ValGlu: 4.976 ± 0.831
2.059ValPhe: 2.059 ± 0.277
3.26ValGly: 3.26 ± 1.391
1.544ValHis: 1.544 ± 0.866
2.745ValIle: 2.745 ± 0.363
5.834ValLys: 5.834 ± 0.84
5.834ValLeu: 5.834 ± 0.509
1.201ValMet: 1.201 ± 0.363
2.059ValAsn: 2.059 ± 0.762
2.402ValPro: 2.402 ± 0.749
2.402ValGln: 2.402 ± 0.669
3.946ValArg: 3.946 ± 0.523
6.349ValSer: 6.349 ± 1.313
4.633ValThr: 4.633 ± 0.483
5.319ValVal: 5.319 ± 1.985
0.515ValTrp: 0.515 ± 0.463
1.373ValTyr: 1.373 ± 0.387
0.0ValXaa: 0.0 ± 0.0
Trp
0.343TrpAla: 0.343 ± 0.48
0.686TrpCys: 0.686 ± 0.392
0.515TrpAsp: 0.515 ± 0.449
0.515TrpGlu: 0.515 ± 0.379
0.343TrpPhe: 0.343 ± 0.48
1.716TrpGly: 1.716 ± 0.737
0.0TrpHis: 0.0 ± 0.0
0.172TrpIle: 0.172 ± 0.096
1.03TrpLys: 1.03 ± 0.476
1.716TrpLeu: 1.716 ± 0.147
0.343TrpMet: 0.343 ± 0.164
0.172TrpAsn: 0.172 ± 0.222
0.515TrpPro: 0.515 ± 0.666
0.0TrpGln: 0.0 ± 0.0
0.858TrpArg: 0.858 ± 0.335
0.686TrpSer: 0.686 ± 0.329
0.515TrpThr: 0.515 ± 0.379
1.03TrpVal: 1.03 ± 0.595
0.343TrpTrp: 0.343 ± 0.164
0.343TrpTyr: 0.343 ± 0.48
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.201TyrAla: 1.201 ± 0.334
0.686TyrCys: 0.686 ± 0.599
1.201TyrAsp: 1.201 ± 0.289
1.03TyrGlu: 1.03 ± 0.575
1.03TyrPhe: 1.03 ± 0.304
1.544TyrGly: 1.544 ± 0.454
0.515TyrHis: 0.515 ± 0.287
1.544TyrIle: 1.544 ± 0.62
3.26TyrLys: 3.26 ± 2.556
3.089TyrLeu: 3.089 ± 0.335
0.686TyrMet: 0.686 ± 0.544
1.716TyrAsn: 1.716 ± 0.482
0.515TyrPro: 0.515 ± 0.449
1.03TyrGln: 1.03 ± 0.861
1.201TyrArg: 1.201 ± 0.436
3.432TyrSer: 3.432 ± 1.299
1.716TyrThr: 1.716 ± 0.905
0.686TyrVal: 0.686 ± 0.193
0.515TyrTrp: 0.515 ± 0.463
1.201TyrTyr: 1.201 ± 0.499
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5829 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski