Amino acid dipepetide frequency for Hubei tombus-like virus 40

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.772AlaAla: 3.772 ± 0.258
1.616AlaCys: 1.616 ± 0.916
3.772AlaAsp: 3.772 ± 2.051
4.849AlaGlu: 4.849 ± 1.549
2.155AlaPhe: 2.155 ± 0.957
5.388AlaGly: 5.388 ± 1.29
1.616AlaHis: 1.616 ± 0.746
2.155AlaIle: 2.155 ± 0.678
4.849AlaLys: 4.849 ± 1.152
13.47AlaLeu: 13.47 ± 1.91
1.078AlaMet: 1.078 ± 0.725
1.078AlaAsn: 1.078 ± 0.752
3.772AlaPro: 3.772 ± 1.239
2.155AlaGln: 2.155 ± 0.95
9.698AlaArg: 9.698 ± 1.363
2.155AlaSer: 2.155 ± 1.38
2.155AlaThr: 2.155 ± 0.957
5.388AlaVal: 5.388 ± 1.29
2.155AlaTrp: 2.155 ± 0.281
1.616AlaTyr: 1.616 ± 0.838
0.0AlaXaa: 0.0 ± 0.0
Cys
1.616CysAla: 1.616 ± 1.467
0.539CysCys: 0.539 ± 0.381
1.078CysAsp: 1.078 ± 0.761
1.078CysGlu: 1.078 ± 0.752
0.0CysPhe: 0.0 ± 0.0
1.616CysGly: 1.616 ± 0.157
0.0CysHis: 0.0 ± 0.0
1.616CysIle: 1.616 ± 0.157
0.0CysLys: 0.0 ± 0.0
3.233CysLeu: 3.233 ± 0.54
0.539CysMet: 0.539 ± 0.489
0.539CysAsn: 0.539 ± 0.376
2.694CysPro: 2.694 ± 1.782
0.539CysGln: 0.539 ± 0.376
2.694CysArg: 2.694 ± 0.908
3.772CysSer: 3.772 ± 1.444
0.539CysThr: 0.539 ± 0.376
0.539CysVal: 0.539 ± 0.381
0.0CysTrp: 0.0 ± 0.0
1.616CysTyr: 1.616 ± 0.746
0.0CysXaa: 0.0 ± 0.0
Asp
4.849AspAla: 4.849 ± 0.922
1.616AspCys: 1.616 ± 0.607
1.616AspAsp: 1.616 ± 0.607
3.233AspGlu: 3.233 ± 1.213
1.616AspPhe: 1.616 ± 0.615
4.31AspGly: 4.31 ± 1.177
0.0AspHis: 0.0 ± 0.0
1.616AspIle: 1.616 ± 0.157
1.078AspLys: 1.078 ± 0.339
2.694AspLeu: 2.694 ± 0.761
1.078AspMet: 1.078 ± 0.339
3.233AspAsn: 3.233 ± 1.23
3.233AspPro: 3.233 ± 0.315
2.155AspGln: 2.155 ± 0.281
5.927AspArg: 5.927 ± 2.11
1.616AspSer: 1.616 ± 0.638
1.616AspThr: 1.616 ± 0.916
3.233AspVal: 3.233 ± 1.23
1.616AspTrp: 1.616 ± 1.142
0.539AspTyr: 0.539 ± 0.376
0.0AspXaa: 0.0 ± 0.0
Glu
3.772GluAla: 3.772 ± 1.23
1.078GluCys: 1.078 ± 0.339
3.772GluAsp: 3.772 ± 1.23
4.31GluGlu: 4.31 ± 0.751
3.233GluPhe: 3.233 ± 1.018
5.927GluGly: 5.927 ± 1.049
2.155GluHis: 2.155 ± 0.965
1.616GluIle: 1.616 ± 0.607
1.616GluLys: 1.616 ± 0.838
5.388GluLeu: 5.388 ± 0.402
1.616GluMet: 1.616 ± 0.916
2.155GluAsn: 2.155 ± 0.281
1.078GluPro: 1.078 ± 0.339
3.772GluGln: 3.772 ± 1.133
4.31GluArg: 4.31 ± 1.238
3.233GluSer: 3.233 ± 1.832
4.31GluThr: 4.31 ± 1.326
2.694GluVal: 2.694 ± 0.201
1.078GluTrp: 1.078 ± 0.339
2.155GluTyr: 2.155 ± 0.854
0.0GluXaa: 0.0 ± 0.0
Phe
3.233PheAla: 3.233 ± 1.136
0.539PheCys: 0.539 ± 0.381
2.155PheAsp: 2.155 ± 0.678
0.539PheGlu: 0.539 ± 0.381
0.539PhePhe: 0.539 ± 0.381
4.31PheGly: 4.31 ± 0.563
1.616PheHis: 1.616 ± 1.142
1.616PheIle: 1.616 ± 0.157
0.0PheLys: 0.0 ± 0.0
1.616PheLeu: 1.616 ± 1.142
1.078PheMet: 1.078 ± 0.761
1.616PheAsn: 1.616 ± 0.615
0.0PhePro: 0.0 ± 0.0
0.539PheGln: 0.539 ± 0.376
2.155PheArg: 2.155 ± 1.504
2.155PheSer: 2.155 ± 0.396
1.616PheThr: 1.616 ± 0.638
1.616PheVal: 1.616 ± 0.157
1.078PheTrp: 1.078 ± 0.503
0.539PheTyr: 0.539 ± 0.376
0.0PheXaa: 0.0 ± 0.0
Gly
4.31GlyAla: 4.31 ± 1.767
2.155GlyCys: 2.155 ± 1.38
7.004GlyAsp: 7.004 ± 2.004
3.233GlyGlu: 3.233 ± 0.315
0.0GlyPhe: 0.0 ± 0.0
4.31GlyGly: 4.31 ± 1.914
1.616GlyHis: 1.616 ± 0.607
1.616GlyIle: 1.616 ± 0.157
2.694GlyLys: 2.694 ± 1.131
5.388GlyLeu: 5.388 ± 1.821
2.694GlyMet: 2.694 ± 0.313
1.078GlyAsn: 1.078 ± 0.339
4.849GlyPro: 4.849 ± 2.796
3.233GlyGln: 3.233 ± 1.28
9.698GlyArg: 9.698 ± 3.678
3.233GlySer: 3.233 ± 0.639
3.772GlyThr: 3.772 ± 0.535
9.698GlyVal: 9.698 ± 2.409
1.078GlyTrp: 1.078 ± 0.503
4.31GlyTyr: 4.31 ± 0.563
0.0GlyXaa: 0.0 ± 0.0
His
2.155HisAla: 2.155 ± 0.95
0.0HisCys: 0.0 ± 0.0
2.155HisAsp: 2.155 ± 0.678
3.233HisGlu: 3.233 ± 0.54
0.539HisPhe: 0.539 ± 0.381
2.155HisGly: 2.155 ± 0.643
0.539HisHis: 0.539 ± 0.376
1.078HisIle: 1.078 ± 0.339
0.539HisLys: 0.539 ± 0.381
1.616HisLeu: 1.616 ± 0.607
0.539HisMet: 0.539 ± 0.376
1.078HisAsn: 1.078 ± 0.752
1.078HisPro: 1.078 ± 0.761
0.539HisGln: 0.539 ± 0.376
2.694HisArg: 2.694 ± 1.879
0.539HisSer: 0.539 ± 0.489
2.694HisThr: 2.694 ± 0.643
0.539HisVal: 0.539 ± 0.376
0.0HisTrp: 0.0 ± 0.0
0.539HisTyr: 0.539 ± 0.376
0.0HisXaa: 0.0 ± 0.0
Ile
2.694IleAla: 2.694 ± 0.519
1.616IleCys: 1.616 ± 1.467
2.155IleAsp: 2.155 ± 1.072
1.616IleGlu: 1.616 ± 0.615
1.616IlePhe: 1.616 ± 0.157
2.694IleGly: 2.694 ± 0.908
0.539IleHis: 0.539 ± 0.376
2.694IleIle: 2.694 ± 0.519
0.0IleLys: 0.0 ± 0.0
3.233IleLeu: 3.233 ± 1.136
0.539IleMet: 0.539 ± 0.376
1.078IleAsn: 1.078 ± 0.339
4.31IlePro: 4.31 ± 0.751
0.0IleGln: 0.0 ± 0.0
4.31IleArg: 4.31 ± 1.507
4.31IleSer: 4.31 ± 0.637
3.233IleThr: 3.233 ± 0.315
5.927IleVal: 5.927 ± 1.44
1.616IleTrp: 1.616 ± 0.746
2.155IleTyr: 2.155 ± 0.854
0.0IleXaa: 0.0 ± 0.0
Lys
2.155LysAla: 2.155 ± 1.522
0.0LysCys: 0.0 ± 0.0
1.078LysAsp: 1.078 ± 0.339
1.616LysGlu: 1.616 ± 0.916
1.078LysPhe: 1.078 ± 0.339
3.772LysGly: 3.772 ± 1.144
0.539LysHis: 0.539 ± 0.376
2.155LysIle: 2.155 ± 1.072
0.0LysLys: 0.0 ± 0.0
2.155LysLeu: 2.155 ± 0.854
1.078LysMet: 1.078 ± 0.503
1.616LysAsn: 1.616 ± 0.157
0.0LysPro: 0.0 ± 0.0
0.0LysGln: 0.0 ± 0.0
4.31LysArg: 4.31 ± 1.326
2.694LysSer: 2.694 ± 0.519
2.694LysThr: 2.694 ± 1.395
1.616LysVal: 1.616 ± 0.916
1.078LysTrp: 1.078 ± 0.761
0.539LysTyr: 0.539 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
7.543LeuAla: 7.543 ± 0.336
2.694LeuCys: 2.694 ± 1.131
3.233LeuAsp: 3.233 ± 0.639
6.466LeuGlu: 6.466 ± 2.144
2.155LeuPhe: 2.155 ± 1.38
7.004LeuGly: 7.004 ± 2.413
3.233LeuHis: 3.233 ± 1.018
3.772LeuIle: 3.772 ± 0.258
3.233LeuLys: 3.233 ± 1.508
10.237LeuLeu: 10.237 ± 3.245
1.078LeuMet: 1.078 ± 0.503
2.694LeuAsn: 2.694 ± 0.761
5.388LeuPro: 5.388 ± 0.675
4.849LeuGln: 4.849 ± 1.146
7.543LeuArg: 7.543 ± 3.245
10.237LeuSer: 10.237 ± 1.524
3.233LeuThr: 3.233 ± 2.339
4.31LeuVal: 4.31 ± 0.792
2.694LeuTrp: 2.694 ± 1.131
3.772LeuTyr: 3.772 ± 1.444
0.0LeuXaa: 0.0 ± 0.0
Met
2.155MetAla: 2.155 ± 1.072
0.0MetCys: 0.0 ± 0.0
1.616MetAsp: 1.616 ± 0.746
0.539MetGlu: 0.539 ± 0.489
2.155MetPhe: 2.155 ± 0.643
1.078MetGly: 1.078 ± 0.339
0.0MetHis: 0.0 ± 0.0
0.539MetIle: 0.539 ± 0.381
2.155MetLys: 2.155 ± 1.38
4.849MetLeu: 4.849 ± 1.774
0.539MetMet: 0.539 ± 0.381
1.078MetAsn: 1.078 ± 0.978
1.078MetPro: 1.078 ± 0.761
1.078MetGln: 1.078 ± 0.339
2.155MetArg: 2.155 ± 0.643
0.539MetSer: 0.539 ± 0.381
0.539MetThr: 0.539 ± 0.381
1.078MetVal: 1.078 ± 0.339
0.539MetTrp: 0.539 ± 0.376
1.078MetTyr: 1.078 ± 0.427
0.0MetXaa: 0.0 ± 0.0
Asn
1.616AsnAla: 1.616 ± 0.615
2.155AsnCys: 2.155 ± 0.957
1.078AsnAsp: 1.078 ± 0.752
1.616AsnGlu: 1.616 ± 0.607
0.539AsnPhe: 0.539 ± 0.381
2.694AsnGly: 2.694 ± 0.918
1.078AsnHis: 1.078 ± 0.339
3.772AsnIle: 3.772 ± 0.258
0.539AsnLys: 0.539 ± 0.376
2.155AsnLeu: 2.155 ± 0.396
2.155AsnMet: 2.155 ± 0.396
0.0AsnAsn: 0.0 ± 0.0
1.078AsnPro: 1.078 ± 0.339
1.078AsnGln: 1.078 ± 0.339
2.694AsnArg: 2.694 ± 1.312
1.616AsnSer: 1.616 ± 0.916
0.539AsnThr: 0.539 ± 0.381
2.155AsnVal: 2.155 ± 0.281
1.078AsnTrp: 1.078 ± 0.503
1.078AsnTyr: 1.078 ± 0.427
0.0AsnXaa: 0.0 ± 0.0
Pro
6.466ProAla: 6.466 ± 2.036
2.694ProCys: 2.694 ± 0.761
0.0ProAsp: 0.0 ± 0.0
3.772ProGlu: 3.772 ± 1.55
4.849ProPhe: 4.849 ± 1.569
1.616ProGly: 1.616 ± 0.615
2.155ProHis: 2.155 ± 0.957
2.155ProIle: 2.155 ± 0.854
1.616ProLys: 1.616 ± 0.746
4.849ProLeu: 4.849 ± 2.295
0.539ProMet: 0.539 ± 0.381
1.616ProAsn: 1.616 ± 0.838
3.233ProPro: 3.233 ± 0.839
0.539ProGln: 0.539 ± 0.489
4.849ProArg: 4.849 ± 1.82
2.155ProSer: 2.155 ± 1.38
3.772ProThr: 3.772 ± 1.649
4.31ProVal: 4.31 ± 1.525
0.539ProTrp: 0.539 ± 0.489
2.694ProTyr: 2.694 ± 0.643
0.0ProXaa: 0.0 ± 0.0
Gln
1.616GlnAla: 1.616 ± 0.157
0.0GlnCys: 0.0 ± 0.0
1.616GlnAsp: 1.616 ± 0.916
2.155GlnGlu: 2.155 ± 0.957
1.078GlnPhe: 1.078 ± 0.427
1.078GlnGly: 1.078 ± 0.752
2.155GlnHis: 2.155 ± 0.281
1.616GlnIle: 1.616 ± 0.157
1.078GlnLys: 1.078 ± 0.503
3.233GlnLeu: 3.233 ± 0.315
0.539GlnMet: 0.539 ± 0.381
2.155GlnAsn: 2.155 ± 0.281
3.772GlnPro: 3.772 ± 1.55
1.078GlnGln: 1.078 ± 0.503
3.772GlnArg: 3.772 ± 1.583
1.078GlnSer: 1.078 ± 0.503
1.078GlnThr: 1.078 ± 0.503
1.616GlnVal: 1.616 ± 0.607
0.0GlnTrp: 0.0 ± 0.0
2.155GlnTyr: 2.155 ± 0.396
0.0GlnXaa: 0.0 ± 0.0
Arg
8.082ArgAla: 8.082 ± 2.709
1.616ArgCys: 1.616 ± 0.607
3.233ArgAsp: 3.233 ± 1.68
5.927ArgGlu: 5.927 ± 0.852
3.233ArgPhe: 3.233 ± 1.705
9.698ArgGly: 9.698 ± 3.693
1.616ArgHis: 1.616 ± 0.607
2.694ArgIle: 2.694 ± 1.312
2.694ArgLys: 2.694 ± 0.908
6.466ArgLeu: 6.466 ± 0.844
1.078ArgMet: 1.078 ± 0.978
4.31ArgAsn: 4.31 ± 1.289
7.543ArgPro: 7.543 ± 1.194
4.849ArgGln: 4.849 ± 1.587
5.927ArgArg: 5.927 ± 0.594
4.31ArgSer: 4.31 ± 0.79
6.466ArgThr: 6.466 ± 0.629
6.466ArgVal: 6.466 ± 2.851
2.155ArgTrp: 2.155 ± 1.504
3.233ArgTyr: 3.233 ± 1.23
0.0ArgXaa: 0.0 ± 0.0
Ser
2.155SerAla: 2.155 ± 1.304
0.539SerCys: 0.539 ± 0.376
4.849SerAsp: 4.849 ± 1.152
4.31SerGlu: 4.31 ± 0.79
1.078SerPhe: 1.078 ± 0.761
5.388SerGly: 5.388 ± 1.287
2.155SerHis: 2.155 ± 0.281
3.233SerIle: 3.233 ± 0.54
1.616SerLys: 1.616 ± 0.746
7.543SerLeu: 7.543 ± 3.827
2.155SerMet: 2.155 ± 1.957
1.078SerAsn: 1.078 ± 0.339
2.155SerPro: 2.155 ± 0.396
2.694SerGln: 2.694 ± 1.857
2.694SerArg: 2.694 ± 0.519
4.849SerSer: 4.849 ± 4.402
6.466SerThr: 6.466 ± 1.248
3.233SerVal: 3.233 ± 0.968
2.155SerTrp: 2.155 ± 1.38
3.233SerTyr: 3.233 ± 1.23
0.0SerXaa: 0.0 ± 0.0
Thr
4.849ThrAla: 4.849 ± 1.228
1.078ThrCys: 1.078 ± 0.978
1.616ThrAsp: 1.616 ± 0.157
2.694ThrGlu: 2.694 ± 0.201
1.078ThrPhe: 1.078 ± 0.752
5.388ThrGly: 5.388 ± 1.037
0.539ThrHis: 0.539 ± 0.376
5.927ThrIle: 5.927 ± 1.44
2.155ThrLys: 2.155 ± 1.006
2.694ThrLeu: 2.694 ± 0.519
2.694ThrMet: 2.694 ± 1.21
1.078ThrAsn: 1.078 ± 0.427
4.31ThrPro: 4.31 ± 2.76
1.078ThrGln: 1.078 ± 0.427
4.849ThrArg: 4.849 ± 0.503
6.466ThrSer: 6.466 ± 2.32
6.466ThrThr: 6.466 ± 0.629
4.31ThrVal: 4.31 ± 0.792
1.078ThrTrp: 1.078 ± 0.427
1.078ThrTyr: 1.078 ± 0.752
0.0ThrXaa: 0.0 ± 0.0
Val
9.159ValAla: 9.159 ± 3.702
1.078ValCys: 1.078 ± 0.752
3.233ValAsp: 3.233 ± 1.23
4.31ValGlu: 4.31 ± 1.93
1.078ValPhe: 1.078 ± 0.339
4.849ValGly: 4.849 ± 0.307
1.078ValHis: 1.078 ± 0.752
3.772ValIle: 3.772 ± 0.467
1.616ValLys: 1.616 ± 1.142
8.082ValLeu: 8.082 ± 2.073
1.078ValMet: 1.078 ± 0.339
1.078ValAsn: 1.078 ± 0.752
2.155ValPro: 2.155 ± 0.965
1.616ValGln: 1.616 ± 1.128
5.927ValArg: 5.927 ± 1.662
5.927ValSer: 5.927 ± 1.446
5.927ValThr: 5.927 ± 1.895
7.543ValVal: 7.543 ± 1.172
0.539ValTrp: 0.539 ± 0.376
3.233ValTyr: 3.233 ± 0.454
0.0ValXaa: 0.0 ± 0.0
Trp
1.078TrpAla: 1.078 ± 0.427
1.616TrpCys: 1.616 ± 0.746
0.539TrpAsp: 0.539 ± 0.376
1.616TrpGlu: 1.616 ± 0.838
0.539TrpPhe: 0.539 ± 0.381
1.078TrpGly: 1.078 ± 0.427
0.539TrpHis: 0.539 ± 0.489
1.078TrpIle: 1.078 ± 0.427
1.078TrpLys: 1.078 ± 0.339
2.155TrpLeu: 2.155 ± 1.38
0.539TrpMet: 0.539 ± 0.381
0.539TrpAsn: 0.539 ± 0.376
0.0TrpPro: 0.0 ± 0.0
1.078TrpGln: 1.078 ± 0.503
1.078TrpArg: 1.078 ± 0.761
1.078TrpSer: 1.078 ± 0.427
1.078TrpThr: 1.078 ± 0.503
4.31TrpVal: 4.31 ± 1.525
0.539TrpTrp: 0.539 ± 0.381
0.539TrpTyr: 0.539 ± 0.489
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.155TyrAla: 2.155 ± 0.643
1.616TyrCys: 1.616 ± 0.607
1.078TyrAsp: 1.078 ± 0.503
2.155TyrGlu: 2.155 ± 0.678
0.0TyrPhe: 0.0 ± 0.0
1.616TyrGly: 1.616 ± 0.638
1.078TyrHis: 1.078 ± 0.339
1.616TyrIle: 1.616 ± 0.746
1.616TyrLys: 1.616 ± 0.838
4.31TyrLeu: 4.31 ± 1.707
1.616TyrMet: 1.616 ± 1.467
1.616TyrAsn: 1.616 ± 0.607
2.694TyrPro: 2.694 ± 0.643
0.0TyrGln: 0.0 ± 0.0
4.31TyrArg: 4.31 ± 0.751
1.616TyrSer: 1.616 ± 0.615
3.233TyrThr: 3.233 ± 0.639
2.694TyrVal: 2.694 ± 1.312
1.078TyrTrp: 1.078 ± 0.503
3.233TyrTyr: 3.233 ± 1.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1857 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski