Amino acid dipepetide frequency for Wigeon coronavirus HKU20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.194AlaAla: 5.194 ± 0.982
1.771AlaCys: 1.771 ± 0.436
4.013AlaAsp: 4.013 ± 1.417
2.361AlaGlu: 2.361 ± 0.618
3.305AlaPhe: 3.305 ± 0.925
2.715AlaGly: 2.715 ± 0.725
1.771AlaHis: 1.771 ± 0.689
5.548AlaIle: 5.548 ± 0.714
3.541AlaLys: 3.541 ± 0.985
6.964AlaLeu: 6.964 ± 1.34
1.18AlaMet: 1.18 ± 0.304
4.721AlaAsn: 4.721 ± 0.81
2.361AlaPro: 2.361 ± 0.705
2.597AlaGln: 2.597 ± 0.571
2.833AlaArg: 2.833 ± 1.017
5.548AlaSer: 5.548 ± 1.355
4.839AlaThr: 4.839 ± 0.63
5.548AlaVal: 5.548 ± 1.036
0.236AlaTrp: 0.236 ± 0.289
3.659AlaTyr: 3.659 ± 0.777
0.0AlaXaa: 0.0 ± 0.0
Cys
1.534CysAla: 1.534 ± 0.395
1.18CysCys: 1.18 ± 0.799
2.007CysAsp: 2.007 ± 0.418
0.472CysGlu: 0.472 ± 0.449
1.18CysPhe: 1.18 ± 0.493
1.653CysGly: 1.653 ± 0.435
0.236CysHis: 0.236 ± 0.261
2.125CysIle: 2.125 ± 0.679
1.062CysLys: 1.062 ± 0.365
1.18CysLeu: 1.18 ± 0.574
0.472CysMet: 0.472 ± 0.55
1.534CysAsn: 1.534 ± 0.427
1.062CysPro: 1.062 ± 0.284
0.944CysGln: 0.944 ± 0.288
0.708CysArg: 0.708 ± 0.361
1.062CysSer: 1.062 ± 0.423
2.361CysThr: 2.361 ± 0.553
3.541CysVal: 3.541 ± 0.729
0.236CysTrp: 0.236 ± 0.12
2.007CysTyr: 2.007 ± 0.656
0.0CysXaa: 0.0 ± 0.0
Asp
4.013AspAla: 4.013 ± 1.204
1.771AspCys: 1.771 ± 0.549
3.305AspAsp: 3.305 ± 1.11
2.243AspGlu: 2.243 ± 0.998
3.659AspPhe: 3.659 ± 0.714
4.603AspGly: 4.603 ± 0.586
0.59AspHis: 0.59 ± 0.604
3.069AspIle: 3.069 ± 0.614
1.653AspLys: 1.653 ± 0.72
3.895AspLeu: 3.895 ± 0.638
1.18AspMet: 1.18 ± 0.44
3.187AspAsn: 3.187 ± 1.019
2.479AspPro: 2.479 ± 1.276
1.653AspGln: 1.653 ± 0.36
2.007AspArg: 2.007 ± 0.572
4.367AspSer: 4.367 ± 1.588
3.423AspThr: 3.423 ± 0.86
5.43AspVal: 5.43 ± 1.157
0.472AspTrp: 0.472 ± 0.222
3.187AspTyr: 3.187 ± 0.563
0.0AspXaa: 0.0 ± 0.0
Glu
1.534GluAla: 1.534 ± 0.301
1.298GluCys: 1.298 ± 0.315
0.944GluAsp: 0.944 ± 0.271
1.062GluGlu: 1.062 ± 0.316
1.889GluPhe: 1.889 ± 0.592
2.597GluGly: 2.597 ± 0.643
0.59GluHis: 0.59 ± 0.301
1.771GluIle: 1.771 ± 0.498
1.653GluLys: 1.653 ± 0.459
3.777GluLeu: 3.777 ± 1.09
0.59GluMet: 0.59 ± 0.244
1.534GluAsn: 1.534 ± 0.783
1.889GluPro: 1.889 ± 0.411
2.243GluGln: 2.243 ± 0.837
1.534GluArg: 1.534 ± 0.715
2.361GluSer: 2.361 ± 0.528
2.007GluThr: 2.007 ± 0.452
2.951GluVal: 2.951 ± 0.718
0.59GluTrp: 0.59 ± 0.266
1.653GluTyr: 1.653 ± 0.376
0.0GluXaa: 0.0 ± 0.0
Phe
2.125PheAla: 2.125 ± 1.26
1.534PheCys: 1.534 ± 0.389
2.479PheAsp: 2.479 ± 0.71
2.007PheGlu: 2.007 ± 0.474
2.125PhePhe: 2.125 ± 1.175
2.007PheGly: 2.007 ± 0.704
0.59PheHis: 0.59 ± 0.449
2.833PheIle: 2.833 ± 0.591
2.479PheLys: 2.479 ± 0.754
3.305PheLeu: 3.305 ± 1.002
1.062PheMet: 1.062 ± 0.459
2.243PheAsn: 2.243 ± 0.596
0.708PhePro: 0.708 ± 0.373
1.18PheGln: 1.18 ± 0.782
1.653PheArg: 1.653 ± 0.566
4.249PheSer: 4.249 ± 0.474
2.361PheThr: 2.361 ± 1.141
5.312PheVal: 5.312 ± 0.473
0.826PheTrp: 0.826 ± 0.403
3.659PheTyr: 3.659 ± 0.566
0.0PheXaa: 0.0 ± 0.0
Gly
4.013GlyAla: 4.013 ± 0.662
1.653GlyCys: 1.653 ± 0.543
4.603GlyAsp: 4.603 ± 1.057
2.243GlyGlu: 2.243 ± 0.489
2.715GlyPhe: 2.715 ± 0.503
4.013GlyGly: 4.013 ± 0.8
1.298GlyHis: 1.298 ± 0.571
3.187GlyIle: 3.187 ± 0.596
2.951GlyLys: 2.951 ± 0.909
4.839GlyLeu: 4.839 ± 0.961
0.59GlyMet: 0.59 ± 0.435
2.479GlyAsn: 2.479 ± 0.758
1.771GlyPro: 1.771 ± 0.407
1.889GlyGln: 1.889 ± 0.851
2.125GlyArg: 2.125 ± 0.451
3.305GlySer: 3.305 ± 0.464
4.249GlyThr: 4.249 ± 1.05
4.367GlyVal: 4.367 ± 0.743
0.354GlyTrp: 0.354 ± 0.142
2.479GlyTyr: 2.479 ± 0.661
0.0GlyXaa: 0.0 ± 0.0
His
2.125HisAla: 2.125 ± 0.765
0.826HisCys: 0.826 ± 0.271
0.944HisAsp: 0.944 ± 0.288
0.708HisGlu: 0.708 ± 0.285
1.18HisPhe: 1.18 ± 0.602
0.826HisGly: 0.826 ± 0.655
0.472HisHis: 0.472 ± 0.135
2.007HisIle: 2.007 ± 0.63
0.944HisLys: 0.944 ± 0.208
1.889HisLeu: 1.889 ± 0.421
0.59HisMet: 0.59 ± 0.405
1.534HisAsn: 1.534 ± 0.405
0.472HisPro: 0.472 ± 0.247
0.472HisGln: 0.472 ± 0.241
0.708HisArg: 0.708 ± 0.361
1.062HisSer: 1.062 ± 0.496
1.298HisThr: 1.298 ± 0.678
2.479HisVal: 2.479 ± 0.873
0.236HisTrp: 0.236 ± 0.473
0.826HisTyr: 0.826 ± 0.232
0.0HisXaa: 0.0 ± 0.0
Ile
4.367IleAla: 4.367 ± 0.589
1.416IleCys: 1.416 ± 0.406
2.833IleAsp: 2.833 ± 1.066
2.007IleGlu: 2.007 ± 0.376
2.951IlePhe: 2.951 ± 0.637
2.597IleGly: 2.597 ± 1.106
0.826IleHis: 0.826 ± 0.436
3.305IleIle: 3.305 ± 0.876
3.187IleLys: 3.187 ± 1.24
5.312IleLeu: 5.312 ± 1.126
1.062IleMet: 1.062 ± 0.284
3.305IleAsn: 3.305 ± 1.561
3.777IlePro: 3.777 ± 1.391
1.889IleGln: 1.889 ± 0.537
2.479IleArg: 2.479 ± 0.316
5.312IleSer: 5.312 ± 1.608
3.187IleThr: 3.187 ± 1.549
6.256IleVal: 6.256 ± 1.184
0.59IleTrp: 0.59 ± 0.564
3.305IleTyr: 3.305 ± 0.539
0.0IleXaa: 0.0 ± 0.0
Lys
4.367LysAla: 4.367 ± 0.859
1.298LysCys: 1.298 ± 0.742
2.833LysAsp: 2.833 ± 1.002
1.062LysGlu: 1.062 ± 0.229
2.243LysPhe: 2.243 ± 0.405
2.125LysGly: 2.125 ± 0.522
1.534LysHis: 1.534 ± 0.451
1.771LysIle: 1.771 ± 0.331
2.479LysLys: 2.479 ± 1.159
5.902LysLeu: 5.902 ± 2.152
0.59LysMet: 0.59 ± 0.594
1.653LysAsn: 1.653 ± 0.63
2.951LysPro: 2.951 ± 1.274
2.243LysGln: 2.243 ± 0.803
2.125LysArg: 2.125 ± 1.226
3.423LysSer: 3.423 ± 0.908
2.715LysThr: 2.715 ± 0.896
4.013LysVal: 4.013 ± 0.914
0.472LysTrp: 0.472 ± 0.135
2.715LysTyr: 2.715 ± 0.584
0.0LysXaa: 0.0 ± 0.0
Leu
7.908LeuAla: 7.908 ± 0.625
2.479LeuCys: 2.479 ± 0.815
4.839LeuAsp: 4.839 ± 0.899
3.305LeuGlu: 3.305 ± 0.802
5.194LeuPhe: 5.194 ± 1.095
4.958LeuGly: 4.958 ± 0.487
2.243LeuHis: 2.243 ± 0.711
4.131LeuIle: 4.131 ± 1.632
5.076LeuLys: 5.076 ± 1.312
9.561LeuLeu: 9.561 ± 1.982
1.771LeuMet: 1.771 ± 0.407
5.548LeuAsn: 5.548 ± 1.518
3.895LeuPro: 3.895 ± 0.898
4.958LeuGln: 4.958 ± 1.253
2.951LeuArg: 2.951 ± 1.264
6.374LeuSer: 6.374 ± 2.04
7.436LeuThr: 7.436 ± 1.11
7.318LeuVal: 7.318 ± 1.088
0.708LeuTrp: 0.708 ± 0.6
4.013LeuTyr: 4.013 ± 1.424
0.0LeuXaa: 0.0 ± 0.0
Met
1.771MetAla: 1.771 ± 0.45
0.59MetCys: 0.59 ± 0.558
0.472MetAsp: 0.472 ± 0.357
0.708MetGlu: 0.708 ± 0.19
0.59MetPhe: 0.59 ± 0.31
0.708MetGly: 0.708 ± 0.285
0.472MetHis: 0.472 ± 0.279
0.472MetIle: 0.472 ± 0.135
0.826MetLys: 0.826 ± 0.421
2.715MetLeu: 2.715 ± 0.751
0.236MetMet: 0.236 ± 0.12
0.708MetAsn: 0.708 ± 0.319
0.59MetPro: 0.59 ± 0.301
0.59MetGln: 0.59 ± 0.244
0.59MetArg: 0.59 ± 0.262
1.534MetSer: 1.534 ± 0.82
1.18MetThr: 1.18 ± 0.392
2.243MetVal: 2.243 ± 0.821
0.236MetTrp: 0.236 ± 0.12
1.416MetTyr: 1.416 ± 0.406
0.0MetXaa: 0.0 ± 0.0
Asn
4.721AsnAla: 4.721 ± 1.091
1.416AsnCys: 1.416 ± 0.386
1.889AsnAsp: 1.889 ± 0.493
1.416AsnGlu: 1.416 ± 0.38
2.007AsnPhe: 2.007 ± 1.1
4.367AsnGly: 4.367 ± 0.801
0.826AsnHis: 0.826 ± 0.421
2.951AsnIle: 2.951 ± 0.635
3.069AsnLys: 3.069 ± 0.569
6.138AsnLeu: 6.138 ± 0.807
1.18AsnMet: 1.18 ± 0.593
2.951AsnAsn: 2.951 ± 0.872
2.715AsnPro: 2.715 ± 1.37
2.125AsnGln: 2.125 ± 0.535
1.771AsnArg: 1.771 ± 0.336
2.479AsnSer: 2.479 ± 0.673
4.367AsnThr: 4.367 ± 1.478
4.958AsnVal: 4.958 ± 1.492
0.472AsnTrp: 0.472 ± 0.247
2.361AsnTyr: 2.361 ± 0.581
0.0AsnXaa: 0.0 ± 0.0
Pro
2.479ProAla: 2.479 ± 1.079
1.18ProCys: 1.18 ± 0.405
2.479ProAsp: 2.479 ± 1.089
1.771ProGlu: 1.771 ± 0.841
2.007ProPhe: 2.007 ± 0.421
3.187ProGly: 3.187 ± 0.671
0.826ProHis: 0.826 ± 0.421
3.305ProIle: 3.305 ± 1.342
2.007ProLys: 2.007 ± 1.77
3.777ProLeu: 3.777 ± 1.154
0.944ProMet: 0.944 ± 0.271
2.361ProAsn: 2.361 ± 0.622
2.597ProPro: 2.597 ± 0.911
2.125ProGln: 2.125 ± 0.613
1.653ProArg: 1.653 ± 1.507
3.659ProSer: 3.659 ± 1.916
3.541ProThr: 3.541 ± 0.859
3.895ProVal: 3.895 ± 0.383
0.236ProTrp: 0.236 ± 0.172
2.007ProTyr: 2.007 ± 0.628
0.0ProXaa: 0.0 ± 0.0
Gln
3.423GlnAla: 3.423 ± 0.615
1.18GlnCys: 1.18 ± 0.392
2.243GlnAsp: 2.243 ± 0.806
1.534GlnGlu: 1.534 ± 0.625
1.062GlnPhe: 1.062 ± 0.525
1.298GlnGly: 1.298 ± 0.275
2.007GlnHis: 2.007 ± 0.434
1.416GlnIle: 1.416 ± 0.757
1.062GlnLys: 1.062 ± 0.484
3.659GlnLeu: 3.659 ± 1.266
1.534GlnMet: 1.534 ± 0.433
1.416GlnAsn: 1.416 ± 0.569
3.069GlnPro: 3.069 ± 1.104
1.416GlnGln: 1.416 ± 0.774
1.416GlnArg: 1.416 ± 0.721
3.069GlnSer: 3.069 ± 1.336
2.361GlnThr: 2.361 ± 0.55
3.069GlnVal: 3.069 ± 0.685
0.236GlnTrp: 0.236 ± 0.12
1.889GlnTyr: 1.889 ± 0.485
0.0GlnXaa: 0.0 ± 0.0
Arg
2.833ArgAla: 2.833 ± 0.953
1.298ArgCys: 1.298 ± 0.257
1.653ArgAsp: 1.653 ± 0.68
1.18ArgGlu: 1.18 ± 0.25
1.653ArgPhe: 1.653 ± 1.07
1.653ArgGly: 1.653 ± 0.466
0.944ArgHis: 0.944 ± 0.254
2.243ArgIle: 2.243 ± 0.429
1.653ArgLys: 1.653 ± 0.633
4.013ArgLeu: 4.013 ± 1.112
0.59ArgMet: 0.59 ± 0.301
2.479ArgAsn: 2.479 ± 0.475
1.534ArgPro: 1.534 ± 0.402
1.771ArgGln: 1.771 ± 0.423
0.826ArgArg: 0.826 ± 0.324
1.771ArgSer: 1.771 ± 1.074
2.479ArgThr: 2.479 ± 0.696
2.951ArgVal: 2.951 ± 0.533
0.118ArgTrp: 0.118 ± 0.06
1.298ArgTyr: 1.298 ± 0.515
0.0ArgXaa: 0.0 ± 0.0
Ser
4.839SerAla: 4.839 ± 1.324
0.59SerCys: 0.59 ± 0.562
5.194SerAsp: 5.194 ± 1.265
2.479SerGlu: 2.479 ± 0.527
2.243SerPhe: 2.243 ± 1.004
3.659SerGly: 3.659 ± 0.64
1.771SerHis: 1.771 ± 0.356
4.958SerIle: 4.958 ± 1.542
3.423SerLys: 3.423 ± 0.998
5.43SerLeu: 5.43 ± 1.704
0.944SerMet: 0.944 ± 0.211
3.187SerAsn: 3.187 ± 0.443
3.187SerPro: 3.187 ± 1.363
2.951SerGln: 2.951 ± 1.51
2.243SerArg: 2.243 ± 1.423
4.013SerSer: 4.013 ± 1.202
5.666SerThr: 5.666 ± 1.344
6.256SerVal: 6.256 ± 1.049
0.944SerTrp: 0.944 ± 0.568
3.659SerTyr: 3.659 ± 0.633
0.0SerXaa: 0.0 ± 0.0
Thr
3.777ThrAla: 3.777 ± 0.966
1.653ThrCys: 1.653 ± 0.715
3.305ThrAsp: 3.305 ± 0.489
2.007ThrGlu: 2.007 ± 0.434
3.069ThrPhe: 3.069 ± 0.609
4.603ThrGly: 4.603 ± 1.449
1.534ThrHis: 1.534 ± 0.783
4.721ThrIle: 4.721 ± 1.333
2.951ThrLys: 2.951 ± 0.597
7.554ThrLeu: 7.554 ± 0.6
1.298ThrMet: 1.298 ± 0.407
3.777ThrAsn: 3.777 ± 0.441
4.249ThrPro: 4.249 ± 0.534
2.125ThrGln: 2.125 ± 0.58
2.361ThrArg: 2.361 ± 0.807
4.721ThrSer: 4.721 ± 1.106
6.374ThrThr: 6.374 ± 1.339
7.082ThrVal: 7.082 ± 1.707
0.472ThrTrp: 0.472 ± 0.241
4.485ThrTyr: 4.485 ± 1.267
0.0ThrXaa: 0.0 ± 0.0
Val
6.256ValAla: 6.256 ± 1.069
1.771ValCys: 1.771 ± 0.5
6.728ValAsp: 6.728 ± 0.75
4.013ValGlu: 4.013 ± 0.909
4.249ValPhe: 4.249 ± 1.289
4.839ValGly: 4.839 ± 1.242
1.298ValHis: 1.298 ± 0.34
5.902ValIle: 5.902 ± 1.168
4.958ValLys: 4.958 ± 0.724
8.617ValLeu: 8.617 ± 1.183
1.298ValMet: 1.298 ± 0.496
5.194ValAsn: 5.194 ± 1.29
4.485ValPro: 4.485 ± 1.147
2.951ValGln: 2.951 ± 0.711
2.715ValArg: 2.715 ± 0.41
5.312ValSer: 5.312 ± 1.075
7.082ValThr: 7.082 ± 1.274
10.623ValVal: 10.623 ± 1.852
1.534ValTrp: 1.534 ± 0.332
4.367ValTyr: 4.367 ± 0.601
0.0ValXaa: 0.0 ± 0.0
Trp
1.18TrpAla: 1.18 ± 0.801
0.118TrpCys: 0.118 ± 0.06
0.59TrpAsp: 0.59 ± 0.378
0.236TrpGlu: 0.236 ± 0.355
0.236TrpPhe: 0.236 ± 0.261
0.354TrpGly: 0.354 ± 0.247
0.118TrpHis: 0.118 ± 0.06
0.354TrpIle: 0.354 ± 0.181
0.236TrpLys: 0.236 ± 0.12
1.298TrpLeu: 1.298 ± 0.336
0.118TrpMet: 0.118 ± 0.06
0.708TrpAsn: 0.708 ± 0.361
0.236TrpPro: 0.236 ± 0.172
0.708TrpGln: 0.708 ± 0.342
0.236TrpArg: 0.236 ± 0.12
0.59TrpSer: 0.59 ± 0.355
0.59TrpThr: 0.59 ± 0.266
1.298TrpVal: 1.298 ± 0.585
0.118TrpTrp: 0.118 ± 0.214
0.236TrpTyr: 0.236 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.243TyrAla: 2.243 ± 0.451
1.534TyrCys: 1.534 ± 0.572
2.833TyrAsp: 2.833 ± 1.308
1.653TyrGlu: 1.653 ± 0.455
1.18TyrPhe: 1.18 ± 0.307
2.361TyrGly: 2.361 ± 0.507
1.653TyrHis: 1.653 ± 0.472
3.895TyrIle: 3.895 ± 0.78
3.305TyrLys: 3.305 ± 0.692
4.839TyrLeu: 4.839 ± 0.939
1.298TyrMet: 1.298 ± 0.452
3.659TyrAsn: 3.659 ± 0.983
2.007TyrPro: 2.007 ± 0.351
1.534TyrGln: 1.534 ± 0.425
2.007TyrArg: 2.007 ± 0.529
3.423TyrSer: 3.423 ± 0.491
4.721TyrThr: 4.721 ± 1.105
4.603TyrVal: 4.603 ± 0.99
0.472TyrTrp: 0.472 ± 0.406
2.597TyrTyr: 2.597 ± 0.495
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (8473 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski