Amino acid dipepetide frequency for Wuhan coneheads virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.976AlaAla: 3.976 ± 0.299
0.852AlaCys: 0.852 ± 0.464
2.84AlaAsp: 2.84 ± 0.331
4.828AlaGlu: 4.828 ± 1.047
2.272AlaPhe: 2.272 ± 0.689
4.544AlaGly: 4.544 ± 1.367
1.704AlaHis: 1.704 ± 0.726
2.84AlaIle: 2.84 ± 0.936
3.408AlaLys: 3.408 ± 1.34
4.828AlaLeu: 4.828 ± 0.556
1.42AlaMet: 1.42 ± 1.416
1.988AlaAsn: 1.988 ± 0.721
1.988AlaPro: 1.988 ± 0.721
2.556AlaGln: 2.556 ± 0.69
3.124AlaArg: 3.124 ± 1.543
5.112AlaSer: 5.112 ± 1.38
4.26AlaThr: 4.26 ± 1.877
1.988AlaVal: 1.988 ± 0.857
0.284AlaTrp: 0.284 ± 0.155
2.272AlaTyr: 2.272 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
1.136CysAla: 1.136 ± 0.933
0.0CysCys: 0.0 ± 0.0
1.136CysAsp: 1.136 ± 1.141
0.568CysGlu: 0.568 ± 0.309
1.42CysPhe: 1.42 ± 0.866
1.136CysGly: 1.136 ± 0.619
0.284CysHis: 0.284 ± 0.155
1.704CysIle: 1.704 ± 0.2
0.284CysLys: 0.284 ± 0.155
0.284CysLeu: 0.284 ± 0.324
0.852CysMet: 0.852 ± 0.257
0.852CysAsn: 0.852 ± 0.979
0.0CysPro: 0.0 ± 0.0
1.704CysGln: 1.704 ± 0.928
0.0CysArg: 0.0 ± 0.0
1.136CysSer: 1.136 ± 0.447
2.272CysThr: 2.272 ± 0.059
1.988CysVal: 1.988 ± 0.751
0.0CysTrp: 0.0 ± 0.0
0.852CysTyr: 0.852 ± 0.759
0.0CysXaa: 0.0 ± 0.0
Asp
2.84AspAla: 2.84 ± 1.746
1.136AspCys: 1.136 ± 0.295
2.84AspAsp: 2.84 ± 0.68
4.544AspGlu: 4.544 ± 0.542
4.544AspPhe: 4.544 ± 0.555
1.704AspGly: 1.704 ± 0.745
0.852AspHis: 0.852 ± 0.464
3.124AspIle: 3.124 ± 1.351
2.556AspLys: 2.556 ± 1.559
3.692AspLeu: 3.692 ± 0.792
0.852AspMet: 0.852 ± 0.424
1.988AspAsn: 1.988 ± 1.033
1.704AspPro: 1.704 ± 0.2
1.136AspGln: 1.136 ± 0.933
2.556AspArg: 2.556 ± 1.048
2.272AspSer: 2.272 ± 0.589
2.84AspThr: 2.84 ± 0.963
3.124AspVal: 3.124 ± 1.004
0.852AspTrp: 0.852 ± 0.257
1.42AspTyr: 1.42 ± 0.451
0.0AspXaa: 0.0 ± 0.0
Glu
5.396GluAla: 5.396 ± 2.78
0.568GluCys: 0.568 ± 0.456
2.84GluAsp: 2.84 ± 1.629
4.544GluGlu: 4.544 ± 1.23
3.976GluPhe: 3.976 ± 0.815
2.556GluGly: 2.556 ± 0.358
1.42GluHis: 1.42 ± 0.468
2.84GluIle: 2.84 ± 0.301
5.964GluLys: 5.964 ± 2.015
5.396GluLeu: 5.396 ± 1.228
2.556GluMet: 2.556 ± 0.573
3.692GluAsn: 3.692 ± 1.356
2.84GluPro: 2.84 ± 0.84
3.692GluGln: 3.692 ± 1.176
1.988GluArg: 1.988 ± 0.857
3.408GluSer: 3.408 ± 0.805
5.964GluThr: 5.964 ± 1.185
5.68GluVal: 5.68 ± 1.53
1.42GluTrp: 1.42 ± 0.451
1.988GluTyr: 1.988 ± 0.297
0.0GluXaa: 0.0 ± 0.0
Phe
2.272PheAla: 2.272 ± 0.429
2.556PheCys: 2.556 ± 0.18
2.84PheAsp: 2.84 ± 1.643
1.704PheGlu: 1.704 ± 0.515
1.704PhePhe: 1.704 ± 0.298
1.988PheGly: 1.988 ± 0.59
0.284PheHis: 0.284 ± 0.533
3.976PheIle: 3.976 ± 0.645
3.408PheLys: 3.408 ± 0.597
3.408PheLeu: 3.408 ± 0.397
1.704PheMet: 1.704 ± 0.169
3.692PheAsn: 3.692 ± 1.568
2.556PhePro: 2.556 ± 0.69
1.988PheGln: 1.988 ± 0.59
2.272PheArg: 2.272 ± 1.286
5.112PheSer: 5.112 ± 1.564
2.272PheThr: 2.272 ± 0.463
4.26PheVal: 4.26 ± 1.164
1.42PheTrp: 1.42 ± 0.481
1.988PheTyr: 1.988 ± 1.565
0.0PheXaa: 0.0 ± 0.0
Gly
4.544GlyAla: 4.544 ± 0.49
0.568GlyCys: 0.568 ± 0.57
3.408GlyAsp: 3.408 ± 0.397
2.556GlyGlu: 2.556 ± 1.28
3.124GlyPhe: 3.124 ± 1.004
1.42GlyGly: 1.42 ± 0.798
0.852GlyHis: 0.852 ± 0.257
3.408GlyIle: 3.408 ± 1.201
5.68GlyLys: 5.68 ± 1.443
5.112GlyLeu: 5.112 ± 0.897
1.136GlyMet: 1.136 ± 0.619
3.408GlyAsn: 3.408 ± 1.212
3.692GlyPro: 3.692 ± 0.843
1.136GlyGln: 1.136 ± 0.295
2.84GlyArg: 2.84 ± 0.902
4.828GlySer: 4.828 ± 1.248
3.976GlyThr: 3.976 ± 0.299
2.84GlyVal: 2.84 ± 0.539
0.568GlyTrp: 0.568 ± 0.248
2.556GlyTyr: 2.556 ± 1.048
0.0GlyXaa: 0.0 ± 0.0
His
0.284HisAla: 0.284 ± 0.155
0.568HisCys: 0.568 ± 0.309
0.284HisAsp: 0.284 ± 0.324
2.272HisGlu: 2.272 ± 0.689
0.568HisPhe: 0.568 ± 0.456
0.568HisGly: 0.568 ± 0.309
0.852HisHis: 0.852 ± 0.464
1.136HisIle: 1.136 ± 0.447
0.852HisLys: 0.852 ± 0.257
2.84HisLeu: 2.84 ± 0.721
0.284HisMet: 0.284 ± 0.324
1.704HisAsn: 1.704 ± 0.2
1.42HisPro: 1.42 ± 0.773
0.284HisGln: 0.284 ± 0.324
0.568HisArg: 0.568 ± 0.456
1.988HisSer: 1.988 ± 0.739
0.568HisThr: 0.568 ± 0.248
2.272HisVal: 2.272 ± 0.689
0.284HisTrp: 0.284 ± 0.533
0.284HisTyr: 0.284 ± 0.324
0.0HisXaa: 0.0 ± 0.0
Ile
2.556IleAla: 2.556 ± 1.11
1.42IleCys: 1.42 ± 0.517
3.692IleAsp: 3.692 ± 1.591
3.976IleGlu: 3.976 ± 0.574
2.84IlePhe: 2.84 ± 0.331
4.828IleGly: 4.828 ± 0.814
1.42IleHis: 1.42 ± 0.773
3.976IleIle: 3.976 ± 0.645
3.124IleLys: 3.124 ± 0.929
5.964IleLeu: 5.964 ± 0.999
1.136IleMet: 1.136 ± 0.447
2.556IleAsn: 2.556 ± 0.772
3.408IlePro: 3.408 ± 0.638
3.124IleGln: 3.124 ± 0.484
3.408IleArg: 3.408 ± 1.029
2.84IleSer: 2.84 ± 2.123
3.976IleThr: 3.976 ± 1.274
3.124IleVal: 3.124 ± 1.351
1.136IleTrp: 1.136 ± 0.295
2.84IleTyr: 2.84 ± 0.721
0.0IleXaa: 0.0 ± 0.0
Lys
2.272LysAla: 2.272 ± 0.429
1.42LysCys: 1.42 ± 0.468
1.704LysAsp: 1.704 ± 0.515
6.816LysGlu: 6.816 ± 2.04
2.84LysPhe: 2.84 ± 1.771
2.84LysGly: 2.84 ± 0.936
1.988LysHis: 1.988 ± 0.588
4.828LysIle: 4.828 ± 0.911
2.84LysLys: 2.84 ± 1.062
7.952LysLeu: 7.952 ± 1.244
1.988LysMet: 1.988 ± 1.318
2.272LysAsn: 2.272 ± 0.429
2.84LysPro: 2.84 ± 0.84
2.556LysGln: 2.556 ± 0.611
3.408LysArg: 3.408 ± 0.249
1.988LysSer: 1.988 ± 0.149
3.976LysThr: 3.976 ± 1.467
4.544LysVal: 4.544 ± 0.819
1.42LysTrp: 1.42 ± 0.773
1.704LysTyr: 1.704 ± 0.606
0.0LysXaa: 0.0 ± 0.0
Leu
5.964LeuAla: 5.964 ± 1.185
0.852LeuCys: 0.852 ± 0.464
5.68LeuAsp: 5.68 ± 2.651
5.68LeuGlu: 5.68 ± 1.53
2.272LeuPhe: 2.272 ± 0.059
5.964LeuGly: 5.964 ± 0.717
0.852LeuHis: 0.852 ± 0.257
3.692LeuIle: 3.692 ± 0.482
4.828LeuLys: 4.828 ± 1.713
5.112LeuLeu: 5.112 ± 1.228
1.988LeuMet: 1.988 ± 0.654
3.692LeuAsn: 3.692 ± 0.14
3.692LeuPro: 3.692 ± 1.657
3.408LeuGln: 3.408 ± 1.456
4.544LeuArg: 4.544 ± 0.859
5.112LeuSer: 5.112 ± 1.726
4.828LeuThr: 4.828 ± 0.399
5.964LeuVal: 5.964 ± 0.452
0.852LeuTrp: 0.852 ± 0.427
3.692LeuTyr: 3.692 ± 0.807
0.0LeuXaa: 0.0 ± 0.0
Met
3.124MetAla: 3.124 ± 1.257
0.568MetCys: 0.568 ± 0.456
1.136MetAsp: 1.136 ± 0.447
2.272MetGlu: 2.272 ± 0.893
0.852MetPhe: 0.852 ± 0.257
0.568MetGly: 0.568 ± 0.309
0.568MetHis: 0.568 ± 0.248
1.704MetIle: 1.704 ± 0.298
1.988MetLys: 1.988 ± 0.739
1.136MetLeu: 1.136 ± 0.295
0.0MetMet: 0.0 ± 0.0
1.704MetAsn: 1.704 ± 0.726
2.272MetPro: 2.272 ± 0.689
1.136MetGln: 1.136 ± 0.345
0.0MetArg: 0.0 ± 0.0
0.852MetSer: 0.852 ± 0.427
2.272MetThr: 2.272 ± 0.589
1.42MetVal: 1.42 ± 0.481
0.568MetTrp: 0.568 ± 0.456
0.568MetTyr: 0.568 ± 0.456
0.0MetXaa: 0.0 ± 0.0
Asn
2.272AsnAla: 2.272 ± 0.87
0.284AsnCys: 0.284 ± 0.155
1.704AsnAsp: 1.704 ± 0.726
3.692AsnGlu: 3.692 ± 0.482
2.84AsnPhe: 2.84 ± 0.84
2.84AsnGly: 2.84 ± 0.68
0.852AsnHis: 0.852 ± 0.464
1.988AsnIle: 1.988 ± 0.717
3.124AsnLys: 3.124 ± 0.984
7.1AsnLeu: 7.1 ± 0.58
1.136AsnMet: 1.136 ± 0.295
2.84AsnAsn: 2.84 ± 0.963
3.124AsnPro: 3.124 ± 0.749
1.704AsnGln: 1.704 ± 0.606
2.272AsnArg: 2.272 ± 0.589
3.124AsnSer: 3.124 ± 2.174
1.988AsnThr: 1.988 ± 0.588
5.396AsnVal: 5.396 ± 1.741
0.568AsnTrp: 0.568 ± 0.57
3.124AsnTyr: 3.124 ± 0.316
0.0AsnXaa: 0.0 ± 0.0
Pro
0.852ProAla: 0.852 ± 0.257
0.568ProCys: 0.568 ± 0.248
2.272ProAsp: 2.272 ± 0.899
4.544ProGlu: 4.544 ± 1.539
3.692ProPhe: 3.692 ± 0.915
1.704ProGly: 1.704 ± 0.515
1.704ProHis: 1.704 ± 0.618
3.408ProIle: 3.408 ± 1.034
2.272ProLys: 2.272 ± 0.87
1.704ProLeu: 1.704 ± 0.515
1.42ProMet: 1.42 ± 0.451
1.704ProAsn: 1.704 ± 0.298
2.272ProPro: 2.272 ± 0.689
2.556ProGln: 2.556 ± 0.611
1.704ProArg: 1.704 ± 1.112
3.976ProSer: 3.976 ± 1.308
4.828ProThr: 4.828 ± 0.831
5.112ProVal: 5.112 ± 1.818
0.852ProTrp: 0.852 ± 0.424
2.556ProTyr: 2.556 ± 1.223
0.0ProXaa: 0.0 ± 0.0
Gln
3.124GlnAla: 3.124 ± 1.004
1.136GlnCys: 1.136 ± 1.509
0.852GlnAsp: 0.852 ± 0.257
3.408GlnGlu: 3.408 ± 0.548
3.692GlnPhe: 3.692 ± 0.96
2.556GlnGly: 2.556 ± 0.807
0.568GlnHis: 0.568 ± 1.065
3.976GlnIle: 3.976 ± 0.645
2.556GlnLys: 2.556 ± 0.18
3.408GlnLeu: 3.408 ± 1.024
0.0GlnMet: 0.0 ± 0.0
2.556GlnAsn: 2.556 ± 1.392
1.42GlnPro: 1.42 ± 0.866
3.976GlnGln: 3.976 ± 1.737
2.84GlnArg: 2.84 ± 0.936
3.124GlnSer: 3.124 ± 1.431
0.852GlnThr: 0.852 ± 0.464
2.84GlnVal: 2.84 ± 0.68
0.284GlnTrp: 0.284 ± 0.533
1.988GlnTyr: 1.988 ± 0.59
0.0GlnXaa: 0.0 ± 0.0
Arg
2.84ArgAla: 2.84 ± 1.062
1.136ArgCys: 1.136 ± 0.496
2.272ArgAsp: 2.272 ± 0.463
2.556ArgGlu: 2.556 ± 1.271
1.704ArgPhe: 1.704 ± 0.848
2.556ArgGly: 2.556 ± 0.573
1.136ArgHis: 1.136 ± 0.345
3.692ArgIle: 3.692 ± 0.807
1.704ArgLys: 1.704 ± 1.149
2.272ArgLeu: 2.272 ± 0.842
0.852ArgMet: 0.852 ± 0.464
3.976ArgAsn: 3.976 ± 1.501
3.976ArgPro: 3.976 ± 0.574
2.272ArgGln: 2.272 ± 0.429
1.988ArgArg: 1.988 ± 0.654
2.272ArgSer: 2.272 ± 0.059
1.988ArgThr: 1.988 ± 0.59
1.704ArgVal: 1.704 ± 1.367
0.284ArgTrp: 0.284 ± 0.533
2.556ArgTyr: 2.556 ± 0.358
0.0ArgXaa: 0.0 ± 0.0
Ser
4.26SerAla: 4.26 ± 1.877
0.284SerCys: 0.284 ± 0.155
4.544SerAsp: 4.544 ± 0.79
3.408SerGlu: 3.408 ± 0.853
3.408SerPhe: 3.408 ± 1.201
7.384SerGly: 7.384 ± 0.756
1.136SerHis: 1.136 ± 0.345
3.124SerIle: 3.124 ± 0.367
4.828SerLys: 4.828 ± 1.051
4.828SerLeu: 4.828 ± 2.487
2.84SerMet: 2.84 ± 1.01
2.556SerAsn: 2.556 ± 1.28
2.84SerPro: 2.84 ± 0.646
3.976SerGln: 3.976 ± 0.299
2.84SerArg: 2.84 ± 0.646
8.804SerSer: 8.804 ± 1.532
5.112SerThr: 5.112 ± 1.544
3.976SerVal: 3.976 ± 1.128
0.852SerTrp: 0.852 ± 0.464
2.272SerTyr: 2.272 ± 0.429
0.0SerXaa: 0.0 ± 0.0
Thr
3.408ThrAla: 3.408 ± 0.397
1.136ThrCys: 1.136 ± 0.447
2.556ThrAsp: 2.556 ± 1.223
3.408ThrGlu: 3.408 ± 1.029
2.84ThrPhe: 2.84 ± 1.108
3.692ThrGly: 3.692 ± 0.482
1.42ThrHis: 1.42 ± 0.468
2.84ThrIle: 2.84 ± 0.84
2.84ThrLys: 2.84 ± 1.2
5.68ThrLeu: 5.68 ± 1.077
1.136ThrMet: 1.136 ± 0.447
5.112ThrAsn: 5.112 ± 1.137
3.976ThrPro: 3.976 ± 1.153
2.556ThrGln: 2.556 ± 0.772
2.272ThrArg: 2.272 ± 0.429
5.396ThrSer: 5.396 ± 1.461
5.396ThrThr: 5.396 ± 2.343
3.976ThrVal: 3.976 ± 0.594
1.988ThrTrp: 1.988 ± 0.59
3.408ThrTyr: 3.408 ± 0.399
0.0ThrXaa: 0.0 ± 0.0
Val
4.26ValAla: 4.26 ± 1.272
0.852ValCys: 0.852 ± 0.427
2.272ValAsp: 2.272 ± 0.899
3.976ValGlu: 3.976 ± 1.274
3.692ValPhe: 3.692 ± 1.479
3.408ValGly: 3.408 ± 1.108
0.852ValHis: 0.852 ± 0.257
4.544ValIle: 4.544 ± 1.255
5.964ValLys: 5.964 ± 1.352
5.68ValLeu: 5.68 ± 0.451
1.704ValMet: 1.704 ± 0.745
3.124ValAsn: 3.124 ± 0.713
2.84ValPro: 2.84 ± 1.547
2.556ValGln: 2.556 ± 0.478
2.556ValArg: 2.556 ± 0.358
8.52ValSer: 8.52 ± 2.544
4.544ValThr: 4.544 ± 0.926
4.828ValVal: 4.828 ± 1.427
0.568ValTrp: 0.568 ± 0.309
1.988ValTyr: 1.988 ± 0.857
0.0ValXaa: 0.0 ± 0.0
Trp
1.136TrpAla: 1.136 ± 0.447
0.284TrpCys: 0.284 ± 0.155
0.568TrpAsp: 0.568 ± 1.065
1.136TrpGlu: 1.136 ± 0.933
1.136TrpPhe: 1.136 ± 0.447
0.852TrpGly: 0.852 ± 0.971
0.0TrpHis: 0.0 ± 0.0
1.136TrpIle: 1.136 ± 0.295
1.136TrpLys: 1.136 ± 0.447
0.852TrpLeu: 0.852 ± 0.464
0.284TrpMet: 0.284 ± 0.324
0.568TrpAsn: 0.568 ± 0.248
0.852TrpPro: 0.852 ± 0.464
0.852TrpGln: 0.852 ± 0.971
0.568TrpArg: 0.568 ± 0.57
1.136TrpSer: 1.136 ± 0.619
1.704TrpThr: 1.704 ± 1.149
0.284TrpVal: 0.284 ± 0.155
0.0TrpTrp: 0.0 ± 0.0
0.852TrpTyr: 0.852 ± 0.464
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.852TyrAla: 0.852 ± 0.427
1.42TyrCys: 1.42 ± 0.451
1.136TyrAsp: 1.136 ± 0.875
2.84TyrGlu: 2.84 ± 1.01
1.988TyrPhe: 1.988 ± 0.297
4.544TyrGly: 4.544 ± 0.931
1.136TyrHis: 1.136 ± 0.295
3.408TyrIle: 3.408 ± 0.397
2.84TyrLys: 2.84 ± 0.331
1.42TyrLeu: 1.42 ± 0.798
1.136TyrMet: 1.136 ± 0.619
1.988TyrAsn: 1.988 ± 0.654
1.988TyrPro: 1.988 ± 0.857
1.988TyrGln: 1.988 ± 0.588
1.988TyrArg: 1.988 ± 1.258
2.272TyrSer: 2.272 ± 0.689
1.42TyrThr: 1.42 ± 0.995
3.408TyrVal: 3.408 ± 0.638
1.136TyrTrp: 1.136 ± 0.605
2.272TyrTyr: 2.272 ± 0.589
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3522 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski