Amino acid dipepetide frequency for Wuhan Tick Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.24AlaAla: 5.24 ± 0.729
2.158AlaCys: 2.158 ± 0.8
2.774AlaAsp: 2.774 ± 0.859
4.932AlaGlu: 4.932 ± 1.525
0.925AlaPhe: 0.925 ± 0.724
2.158AlaGly: 2.158 ± 0.327
2.158AlaHis: 2.158 ± 0.903
3.083AlaIle: 3.083 ± 0.439
2.158AlaLys: 2.158 ± 1.082
6.165AlaLeu: 6.165 ± 2.029
0.925AlaMet: 0.925 ± 0.296
1.85AlaAsn: 1.85 ± 0.615
3.391AlaPro: 3.391 ± 1.264
3.083AlaGln: 3.083 ± 0.946
3.699AlaArg: 3.699 ± 2.398
3.699AlaSer: 3.699 ± 1.637
4.932AlaThr: 4.932 ± 1.852
3.391AlaVal: 3.391 ± 0.664
0.925AlaTrp: 0.925 ± 0.409
2.466AlaTyr: 2.466 ± 0.926
0.0AlaXaa: 0.0 ± 0.0
Cys
1.233CysAla: 1.233 ± 0.65
0.0CysCys: 0.0 ± 0.0
0.617CysAsp: 0.617 ± 0.339
0.308CysGlu: 0.308 ± 0.426
0.617CysPhe: 0.617 ± 0.339
0.308CysGly: 0.308 ± 0.169
0.0CysHis: 0.0 ± 0.0
1.233CysIle: 1.233 ± 0.356
0.925CysLys: 0.925 ± 0.296
4.007CysLeu: 4.007 ± 1.176
0.0CysMet: 0.0 ± 0.0
0.308CysAsn: 0.308 ± 0.169
0.617CysPro: 0.617 ± 0.339
0.0CysGln: 0.0 ± 0.0
0.308CysArg: 0.308 ± 0.58
1.85CysSer: 1.85 ± 0.615
0.925CysThr: 0.925 ± 0.296
1.233CysVal: 1.233 ± 0.356
0.617CysTrp: 0.617 ± 0.339
1.233CysTyr: 1.233 ± 0.677
0.0CysXaa: 0.0 ± 0.0
Asp
1.541AspAla: 1.541 ± 1.421
0.925AspCys: 0.925 ± 0.296
1.233AspAsp: 1.233 ± 0.625
3.083AspGlu: 3.083 ± 1.758
1.541AspPhe: 1.541 ± 0.599
3.083AspGly: 3.083 ± 0.749
0.617AspHis: 0.617 ± 0.325
1.85AspIle: 1.85 ± 1.736
2.158AspLys: 2.158 ± 0.327
4.932AspLeu: 4.932 ± 0.709
1.541AspMet: 1.541 ± 0.661
1.85AspAsn: 1.85 ± 1.016
4.007AspPro: 4.007 ± 0.923
2.774AspGln: 2.774 ± 0.661
2.774AspArg: 2.774 ± 0.784
3.391AspSer: 3.391 ± 1.057
3.391AspThr: 3.391 ± 1.166
3.391AspVal: 3.391 ± 0.548
1.85AspTrp: 1.85 ± 0.592
2.158AspTyr: 2.158 ± 1.185
0.0AspXaa: 0.0 ± 0.0
Glu
3.083GluAla: 3.083 ± 0.949
0.925GluCys: 0.925 ± 0.296
4.007GluAsp: 4.007 ± 0.477
4.932GluGlu: 4.932 ± 0.987
3.699GluPhe: 3.699 ± 1.583
5.857GluGly: 5.857 ± 2.374
1.233GluHis: 1.233 ± 0.951
4.007GluIle: 4.007 ± 0.795
4.007GluLys: 4.007 ± 1.349
5.857GluLeu: 5.857 ± 1.204
1.233GluMet: 1.233 ± 0.696
1.85GluAsn: 1.85 ± 0.615
3.083GluPro: 3.083 ± 0.649
1.85GluGln: 1.85 ± 0.778
4.007GluArg: 4.007 ± 0.475
4.316GluSer: 4.316 ± 1.742
4.316GluThr: 4.316 ± 1.031
4.316GluVal: 4.316 ± 0.904
1.85GluTrp: 1.85 ± 0.688
1.85GluTyr: 1.85 ± 0.785
0.0GluXaa: 0.0 ± 0.0
Phe
0.925PheAla: 0.925 ± 0.738
0.308PheCys: 0.308 ± 0.169
0.925PheAsp: 0.925 ± 0.81
3.391PheGlu: 3.391 ± 1.084
1.541PhePhe: 1.541 ± 0.68
2.774PheGly: 2.774 ± 2.111
1.85PheHis: 1.85 ± 0.778
1.85PheIle: 1.85 ± 0.778
1.541PheLys: 1.541 ± 0.473
4.624PheLeu: 4.624 ± 1.092
1.233PheMet: 1.233 ± 0.356
1.85PheAsn: 1.85 ± 0.308
2.774PhePro: 2.774 ± 0.444
1.541PheGln: 1.541 ± 0.609
2.774PheArg: 2.774 ± 1.057
3.083PheSer: 3.083 ± 1.252
1.85PheThr: 1.85 ± 0.778
2.158PheVal: 2.158 ± 0.579
1.541PheTrp: 1.541 ± 0.847
2.466PheTyr: 2.466 ± 1.031
0.0PheXaa: 0.0 ± 0.0
Gly
3.083GlyAla: 3.083 ± 0.649
1.85GlyCys: 1.85 ± 1.016
3.699GlyAsp: 3.699 ± 1.326
4.007GlyGlu: 4.007 ± 2.661
2.774GlyPhe: 2.774 ± 0.874
4.316GlyGly: 4.316 ± 1.599
2.466GlyHis: 2.466 ± 1.031
2.158GlyIle: 2.158 ± 0.802
3.083GlyLys: 3.083 ± 1.758
8.94GlyLeu: 8.94 ± 1.9
1.541GlyMet: 1.541 ± 0.609
1.233GlyAsn: 1.233 ± 0.493
3.083GlyPro: 3.083 ± 1.867
3.083GlyGln: 3.083 ± 0.337
2.466GlyArg: 2.466 ± 0.384
4.932GlySer: 4.932 ± 1.04
4.624GlyThr: 4.624 ± 1.042
6.782GlyVal: 6.782 ± 1.634
0.617GlyTrp: 0.617 ± 0.339
0.925GlyTyr: 0.925 ± 0.81
0.0GlyXaa: 0.0 ± 0.0
His
1.233HisAla: 1.233 ± 0.614
0.308HisCys: 0.308 ± 0.169
0.308HisAsp: 0.308 ± 0.169
2.774HisGlu: 2.774 ± 0.319
0.617HisPhe: 0.617 ± 0.339
1.233HisGly: 1.233 ± 1.229
0.308HisHis: 0.308 ± 0.169
2.158HisIle: 2.158 ± 0.633
1.541HisLys: 1.541 ± 0.868
3.699HisLeu: 3.699 ± 1.179
0.0HisMet: 0.0 ± 0.0
0.925HisAsn: 0.925 ± 0.296
1.541HisPro: 1.541 ± 0.847
0.617HisGln: 0.617 ± 0.473
2.466HisArg: 2.466 ± 0.657
1.85HisSer: 1.85 ± 0.641
0.925HisThr: 0.925 ± 0.508
1.233HisVal: 1.233 ± 0.411
0.925HisTrp: 0.925 ± 0.724
0.925HisTyr: 0.925 ± 0.296
0.0HisXaa: 0.0 ± 0.0
Ile
3.699IleAla: 3.699 ± 0.661
0.617IleCys: 0.617 ± 0.339
2.774IleAsp: 2.774 ± 1.905
1.541IleGlu: 1.541 ± 0.561
3.083IlePhe: 3.083 ± 1.381
2.466IleGly: 2.466 ± 0.42
2.774IleHis: 2.774 ± 1.088
2.466IleIle: 2.466 ± 0.937
3.083IleLys: 3.083 ± 1.826
4.316IleLeu: 4.316 ± 1.266
1.541IleMet: 1.541 ± 0.374
1.85IleAsn: 1.85 ± 1.635
3.083IlePro: 3.083 ± 0.337
2.466IleGln: 2.466 ± 0.657
3.391IleArg: 3.391 ± 1.032
2.158IleSer: 2.158 ± 0.501
4.007IleThr: 4.007 ± 0.701
4.932IleVal: 4.932 ± 0.199
0.0IleTrp: 0.0 ± 0.0
1.85IleTyr: 1.85 ± 0.615
0.0IleXaa: 0.0 ± 0.0
Lys
2.466LysAla: 2.466 ± 0.42
0.617LysCys: 0.617 ± 0.339
3.699LysAsp: 3.699 ± 0.907
4.932LysGlu: 4.932 ± 1.529
1.233LysPhe: 1.233 ± 0.945
4.624LysGly: 4.624 ± 1.458
0.925LysHis: 0.925 ± 0.409
2.158LysIle: 2.158 ± 0.802
4.007LysLys: 4.007 ± 0.713
6.473LysLeu: 6.473 ± 2.51
2.158LysMet: 2.158 ± 0.515
1.541LysAsn: 1.541 ± 0.477
2.774LysPro: 2.774 ± 1.905
1.541LysGln: 1.541 ± 0.473
1.541LysArg: 1.541 ± 0.599
4.007LysSer: 4.007 ± 1.38
3.391LysThr: 3.391 ± 3.068
2.466LysVal: 2.466 ± 0.926
1.233LysTrp: 1.233 ± 0.677
0.925LysTyr: 0.925 ± 0.296
0.0LysXaa: 0.0 ± 0.0
Leu
7.398LeuAla: 7.398 ± 0.532
1.541LeuCys: 1.541 ± 0.473
5.549LeuAsp: 5.549 ± 2.006
7.398LeuGlu: 7.398 ± 0.657
4.007LeuPhe: 4.007 ± 1.173
6.782LeuGly: 6.782 ± 1.706
1.233LeuHis: 1.233 ± 0.65
7.398LeuIle: 7.398 ± 1.7
6.473LeuLys: 6.473 ± 2.302
11.714LeuLeu: 11.714 ± 3.489
5.24LeuMet: 5.24 ± 1.759
4.007LeuAsn: 4.007 ± 1.38
5.857LeuPro: 5.857 ± 0.942
3.699LeuGln: 3.699 ± 0.907
9.556LeuArg: 9.556 ± 3.211
9.864LeuSer: 9.864 ± 2.355
7.09LeuThr: 7.09 ± 0.715
3.391LeuVal: 3.391 ± 0.707
1.233LeuTrp: 1.233 ± 1.016
3.083LeuTyr: 3.083 ± 0.892
0.0LeuXaa: 0.0 ± 0.0
Met
1.541MetAla: 1.541 ± 0.473
0.617MetCys: 0.617 ± 0.325
0.308MetAsp: 0.308 ± 0.58
2.158MetGlu: 2.158 ± 0.719
0.925MetPhe: 0.925 ± 0.738
3.083MetGly: 3.083 ± 1.123
0.617MetHis: 0.617 ± 0.339
1.85MetIle: 1.85 ± 1.27
1.233MetLys: 1.233 ± 0.493
3.083MetLeu: 3.083 ± 1.638
0.0MetMet: 0.0 ± 0.0
0.925MetAsn: 0.925 ± 0.296
1.541MetPro: 1.541 ± 0.68
0.617MetGln: 0.617 ± 0.94
0.925MetArg: 0.925 ± 0.59
1.85MetSer: 1.85 ± 0.641
2.774MetThr: 2.774 ± 0.648
1.541MetVal: 1.541 ± 0.374
0.925MetTrp: 0.925 ± 0.508
1.541MetTyr: 1.541 ± 0.892
0.0MetXaa: 0.0 ± 0.0
Asn
2.158AsnAla: 2.158 ± 0.579
0.617AsnCys: 0.617 ± 0.325
2.158AsnAsp: 2.158 ± 0.719
0.308AsnGlu: 0.308 ± 0.426
1.85AsnPhe: 1.85 ± 0.545
2.158AsnGly: 2.158 ± 0.767
1.233AsnHis: 1.233 ± 0.677
0.925AsnIle: 0.925 ± 0.508
2.774AsnLys: 2.774 ± 0.55
5.549AsnLeu: 5.549 ± 1.029
0.617AsnMet: 0.617 ± 0.339
0.617AsnAsn: 0.617 ± 0.339
1.85AsnPro: 1.85 ± 0.818
1.233AsnGln: 1.233 ± 0.411
2.466AsnArg: 2.466 ± 0.957
2.774AsnSer: 2.774 ± 1.174
1.85AsnThr: 1.85 ± 0.615
2.158AsnVal: 2.158 ± 0.57
0.308AsnTrp: 0.308 ± 0.169
0.617AsnTyr: 0.617 ± 0.339
0.0AsnXaa: 0.0 ± 0.0
Pro
1.541ProAla: 1.541 ± 1.035
0.925ProCys: 0.925 ± 0.508
4.007ProAsp: 4.007 ± 0.612
2.466ProGlu: 2.466 ± 0.986
1.85ProPhe: 1.85 ± 0.778
3.391ProGly: 3.391 ± 2.182
1.233ProHis: 1.233 ± 0.614
2.774ProIle: 2.774 ± 1.514
4.007ProLys: 4.007 ± 0.848
5.857ProLeu: 5.857 ± 1.285
2.466ProMet: 2.466 ± 0.65
1.541ProAsn: 1.541 ± 0.477
3.083ProPro: 3.083 ± 1.68
1.233ProGln: 1.233 ± 0.951
3.391ProArg: 3.391 ± 1.477
4.932ProSer: 4.932 ± 0.929
4.007ProThr: 4.007 ± 1.555
4.316ProVal: 4.316 ± 1.027
2.158ProTrp: 2.158 ± 0.917
1.541ProTyr: 1.541 ± 0.868
0.0ProXaa: 0.0 ± 0.0
Gln
3.083GlnAla: 3.083 ± 0.749
0.617GlnCys: 0.617 ± 0.339
0.617GlnAsp: 0.617 ± 0.615
1.541GlnGlu: 1.541 ± 0.374
1.85GlnPhe: 1.85 ± 1.181
3.083GlnGly: 3.083 ± 1.217
0.308GlnHis: 0.308 ± 0.169
2.466GlnIle: 2.466 ± 0.713
2.466GlnLys: 2.466 ± 0.821
3.699GlnLeu: 3.699 ± 1.843
1.233GlnMet: 1.233 ± 0.493
1.233GlnAsn: 1.233 ± 0.677
1.233GlnPro: 1.233 ± 1.016
0.925GlnGln: 0.925 ± 0.409
1.541GlnArg: 1.541 ± 0.374
1.85GlnSer: 1.85 ± 0.818
2.774GlnThr: 2.774 ± 0.874
2.158GlnVal: 2.158 ± 0.898
0.617GlnTrp: 0.617 ± 0.325
0.308GlnTyr: 0.308 ± 0.169
0.0GlnXaa: 0.0 ± 0.0
Arg
4.007ArgAla: 4.007 ± 0.701
1.233ArgCys: 1.233 ± 0.356
3.083ArgAsp: 3.083 ± 1.381
7.09ArgGlu: 7.09 ± 1.268
3.083ArgPhe: 3.083 ± 1.693
4.624ArgGly: 4.624 ± 1.925
2.158ArgHis: 2.158 ± 0.898
1.233ArgIle: 1.233 ± 0.356
3.083ArgLys: 3.083 ± 0.469
4.624ArgLeu: 4.624 ± 1.663
2.158ArgMet: 2.158 ± 1.588
2.158ArgAsn: 2.158 ± 0.898
3.083ArgPro: 3.083 ± 0.946
1.85ArgGln: 1.85 ± 1.016
3.083ArgArg: 3.083 ± 1.43
4.007ArgSer: 4.007 ± 1.887
3.083ArgThr: 3.083 ± 1.152
4.007ArgVal: 4.007 ± 1.332
0.617ArgTrp: 0.617 ± 0.339
1.85ArgTyr: 1.85 ± 0.545
0.0ArgXaa: 0.0 ± 0.0
Ser
4.316SerAla: 4.316 ± 1.34
0.308SerCys: 0.308 ± 0.169
5.24SerAsp: 5.24 ± 1.801
4.316SerGlu: 4.316 ± 1.501
2.774SerPhe: 2.774 ± 1.088
4.316SerGly: 4.316 ± 1.036
2.158SerHis: 2.158 ± 0.898
4.316SerIle: 4.316 ± 1.031
3.699SerLys: 3.699 ± 0.75
9.864SerLeu: 9.864 ± 1.075
1.233SerMet: 1.233 ± 1.172
3.699SerAsn: 3.699 ± 0.661
5.857SerPro: 5.857 ± 1.021
2.158SerGln: 2.158 ± 2.221
3.391SerArg: 3.391 ± 0.759
5.549SerSer: 5.549 ± 2.075
4.316SerThr: 4.316 ± 0.904
2.466SerVal: 2.466 ± 1.105
0.925SerTrp: 0.925 ± 0.508
2.466SerTyr: 2.466 ± 0.657
0.0SerXaa: 0.0 ± 0.0
Thr
4.932ThrAla: 4.932 ± 1.04
0.617ThrCys: 0.617 ± 0.325
2.158ThrAsp: 2.158 ± 0.719
5.549ThrGlu: 5.549 ± 1.201
2.774ThrPhe: 2.774 ± 1.305
3.391ThrGly: 3.391 ± 0.427
0.617ThrHis: 0.617 ± 0.339
4.932ThrIle: 4.932 ± 1.186
1.233ThrLys: 1.233 ± 0.677
5.549ThrLeu: 5.549 ± 1.003
2.158ThrMet: 2.158 ± 0.327
1.541ThrAsn: 1.541 ± 0.374
4.007ThrPro: 4.007 ± 1.771
1.233ThrGln: 1.233 ± 0.411
4.624ThrArg: 4.624 ± 1.481
5.857ThrSer: 5.857 ± 1.468
4.316ThrThr: 4.316 ± 0.992
5.857ThrVal: 5.857 ± 2.753
1.233ThrTrp: 1.233 ± 0.677
1.541ThrTyr: 1.541 ± 1.035
0.0ThrXaa: 0.0 ± 0.0
Val
5.549ValAla: 5.549 ± 1.201
0.617ValCys: 0.617 ± 0.339
3.083ValAsp: 3.083 ± 0.337
1.541ValGlu: 1.541 ± 1.035
2.774ValPhe: 2.774 ± 0.319
4.007ValGly: 4.007 ± 0.475
1.541ValHis: 1.541 ± 1.737
2.774ValIle: 2.774 ± 0.55
2.774ValLys: 2.774 ± 0.661
7.707ValLeu: 7.707 ± 3.333
1.233ValMet: 1.233 ± 0.356
3.699ValAsn: 3.699 ± 1.221
3.699ValPro: 3.699 ± 2.088
1.85ValGln: 1.85 ± 0.545
4.007ValArg: 4.007 ± 0.557
3.699ValSer: 3.699 ± 1.069
4.316ValThr: 4.316 ± 1.139
1.541ValVal: 1.541 ± 0.599
0.617ValTrp: 0.617 ± 0.615
2.158ValTyr: 2.158 ± 0.633
0.0ValXaa: 0.0 ± 0.0
Trp
1.85TrpAla: 1.85 ± 1.217
0.308TrpCys: 0.308 ± 0.169
0.617TrpAsp: 0.617 ± 0.339
1.541TrpGlu: 1.541 ± 0.847
1.541TrpPhe: 1.541 ± 0.473
1.541TrpGly: 1.541 ± 0.609
0.617TrpHis: 0.617 ± 0.339
0.925TrpIle: 0.925 ± 0.296
0.617TrpLys: 0.617 ± 0.339
1.233TrpLeu: 1.233 ± 0.356
1.233TrpMet: 1.233 ± 0.614
0.617TrpAsn: 0.617 ± 0.339
1.233TrpPro: 1.233 ± 0.677
0.617TrpGln: 0.617 ± 0.325
0.925TrpArg: 0.925 ± 0.59
1.85TrpSer: 1.85 ± 0.545
0.308TrpThr: 0.308 ± 0.426
0.925TrpVal: 0.925 ± 0.738
0.0TrpTrp: 0.0 ± 0.0
0.308TrpTyr: 0.308 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.233TyrAla: 1.233 ± 0.356
1.233TyrCys: 1.233 ± 0.356
1.541TyrAsp: 1.541 ± 0.374
2.466TyrGlu: 2.466 ± 1.031
1.541TyrPhe: 1.541 ± 0.868
2.466TyrGly: 2.466 ± 0.713
1.541TyrHis: 1.541 ± 0.561
1.541TyrIle: 1.541 ± 0.892
1.85TyrLys: 1.85 ± 1.016
4.624TyrLeu: 4.624 ± 1.419
0.0TyrMet: 0.0 ± 0.0
0.925TyrAsn: 0.925 ± 0.508
0.925TyrPro: 0.925 ± 0.508
1.233TyrGln: 1.233 ± 0.696
2.774TyrArg: 2.774 ± 0.765
1.85TyrSer: 1.85 ± 1.016
0.925TyrThr: 0.925 ± 1.125
0.925TyrVal: 0.925 ± 0.59
0.617TyrTrp: 0.617 ± 0.615
1.233TyrTyr: 1.233 ± 0.356
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3245 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski