Amino acid dipepetide frequency for Hubei tombus-like virus 39

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.855AlaAla: 7.855 ± 0.323
0.491AlaCys: 0.491 ± 0.272
3.436AlaAsp: 3.436 ± 0.875
4.909AlaGlu: 4.909 ± 1.348
4.418AlaPhe: 4.418 ± 0.338
3.927AlaGly: 3.927 ± 0.594
0.491AlaHis: 0.491 ± 0.572
1.964AlaIle: 1.964 ± 1.089
2.946AlaLys: 2.946 ± 0.539
5.4AlaLeu: 5.4 ± 1.144
3.436AlaMet: 3.436 ± 0.535
3.927AlaAsn: 3.927 ± 0.536
4.418AlaPro: 4.418 ± 0.338
0.491AlaGln: 0.491 ± 0.272
3.927AlaArg: 3.927 ± 1.05
4.909AlaSer: 4.909 ± 1.49
6.382AlaThr: 6.382 ± 1.273
7.364AlaVal: 7.364 ± 2.108
1.964AlaTrp: 1.964 ± 0.525
1.964AlaTyr: 1.964 ± 1.089
0.0AlaXaa: 0.0 ± 0.0
Cys
1.473CysAla: 1.473 ± 0.468
0.0CysCys: 0.0 ± 0.0
1.473CysAsp: 1.473 ± 0.817
1.473CysGlu: 1.473 ± 0.817
0.0CysPhe: 0.0 ± 0.0
1.473CysGly: 1.473 ± 0.468
0.491CysHis: 0.491 ± 0.572
0.491CysIle: 0.491 ± 0.591
0.982CysLys: 0.982 ± 0.458
0.982CysLeu: 0.982 ± 0.402
0.0CysMet: 0.0 ± 0.0
0.491CysAsn: 0.491 ± 0.272
0.491CysPro: 0.491 ± 0.572
1.473CysGln: 1.473 ± 1.328
0.982CysArg: 0.982 ± 0.458
2.455CysSer: 2.455 ± 0.782
0.0CysThr: 0.0 ± 0.0
1.473CysVal: 1.473 ± 0.817
0.491CysTrp: 0.491 ± 0.272
0.982CysTyr: 0.982 ± 0.544
0.0CysXaa: 0.0 ± 0.0
Asp
4.909AspAla: 4.909 ± 1.49
0.982AspCys: 0.982 ± 0.544
4.909AspAsp: 4.909 ± 0.156
3.927AspGlu: 3.927 ± 1.05
3.436AspPhe: 3.436 ± 0.875
5.891AspGly: 5.891 ± 1.579
0.982AspHis: 0.982 ± 0.835
2.455AspIle: 2.455 ± 0.782
0.982AspLys: 0.982 ± 0.544
5.4AspLeu: 5.4 ± 2.607
0.982AspMet: 0.982 ± 0.544
1.473AspAsn: 1.473 ± 0.468
2.946AspPro: 2.946 ± 0.539
1.473AspGln: 1.473 ± 0.565
2.455AspArg: 2.455 ± 1.343
3.436AspSer: 3.436 ± 1.92
3.927AspThr: 3.927 ± 0.594
2.455AspVal: 2.455 ± 0.078
0.491AspTrp: 0.491 ± 0.572
1.964AspTyr: 1.964 ± 1.089
0.0AspXaa: 0.0 ± 0.0
Glu
5.4GluAla: 5.4 ± 1.316
2.455GluCys: 2.455 ± 1.361
1.964GluAsp: 1.964 ± 0.297
1.473GluGlu: 1.473 ± 0.817
2.946GluPhe: 2.946 ± 1.207
2.946GluGly: 2.946 ± 1.066
1.473GluHis: 1.473 ± 0.468
3.927GluIle: 3.927 ± 0.805
2.455GluLys: 2.455 ± 0.745
4.418GluLeu: 4.418 ± 1.134
1.964GluMet: 1.964 ± 0.525
2.455GluAsn: 2.455 ± 0.078
1.964GluPro: 1.964 ± 0.613
1.473GluGln: 1.473 ± 0.38
4.909GluArg: 4.909 ± 0.857
3.927GluSer: 3.927 ± 0.805
1.964GluThr: 1.964 ± 0.525
4.418GluVal: 4.418 ± 0.338
0.0GluTrp: 0.0 ± 0.0
3.927GluTyr: 3.927 ± 0.904
0.0GluXaa: 0.0 ± 0.0
Phe
1.964PheAla: 1.964 ± 0.613
0.491PheCys: 0.491 ± 0.572
4.909PheAsp: 4.909 ± 2.824
1.473PheGlu: 1.473 ± 0.817
2.455PhePhe: 2.455 ± 0.734
3.927PheGly: 3.927 ± 2.178
0.982PheHis: 0.982 ± 0.835
2.455PheIle: 2.455 ± 0.078
1.964PheLys: 1.964 ± 0.805
1.964PheLeu: 1.964 ± 0.916
0.0PheMet: 0.0 ± 0.0
0.982PheAsn: 0.982 ± 1.181
2.455PhePro: 2.455 ± 0.745
1.964PheGln: 1.964 ± 0.613
0.982PheArg: 0.982 ± 1.181
2.455PheSer: 2.455 ± 0.078
0.491PheThr: 0.491 ± 0.272
2.946PheVal: 2.946 ± 0.539
1.473PheTrp: 1.473 ± 0.38
0.982PheTyr: 0.982 ± 0.458
0.0PheXaa: 0.0 ± 0.0
Gly
3.436GlyAla: 3.436 ± 1.248
0.491GlyCys: 0.491 ± 0.272
2.455GlyAsp: 2.455 ± 0.782
2.455GlyGlu: 2.455 ± 0.078
3.927GlyPhe: 3.927 ± 1.373
4.418GlyGly: 4.418 ± 0.463
1.964GlyHis: 1.964 ± 0.525
3.927GlyIle: 3.927 ± 0.594
2.946GlyLys: 2.946 ± 0.268
8.346GlyLeu: 8.346 ± 1.882
1.964GlyMet: 1.964 ± 0.613
0.982GlyAsn: 0.982 ± 0.402
5.4GlyPro: 5.4 ± 1.732
2.455GlyGln: 2.455 ± 0.826
4.418GlyArg: 4.418 ± 1.141
5.891GlySer: 5.891 ± 1.113
3.927GlyThr: 3.927 ± 0.805
4.909GlyVal: 4.909 ± 1.544
0.982GlyTrp: 0.982 ± 0.544
2.455GlyTyr: 2.455 ± 0.782
0.0GlyXaa: 0.0 ± 0.0
His
1.473HisAla: 1.473 ± 1.328
0.0HisCys: 0.0 ± 0.0
1.964HisAsp: 1.964 ± 1.04
2.455HisGlu: 2.455 ± 0.078
0.982HisPhe: 0.982 ± 0.544
0.491HisGly: 0.491 ± 0.272
0.982HisHis: 0.982 ± 0.402
0.982HisIle: 0.982 ± 1.144
0.982HisLys: 0.982 ± 0.544
0.491HisLeu: 0.491 ± 0.572
0.491HisMet: 0.491 ± 0.572
0.982HisAsn: 0.982 ± 0.835
1.964HisPro: 1.964 ± 1.517
0.982HisGln: 0.982 ± 1.181
0.0HisArg: 0.0 ± 0.0
3.436HisSer: 3.436 ± 1.329
2.455HisThr: 2.455 ± 1.581
1.964HisVal: 1.964 ± 0.297
1.473HisTrp: 1.473 ± 0.565
0.491HisTyr: 0.491 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
2.946IleAla: 2.946 ± 0.539
0.982IleCys: 0.982 ± 0.544
2.455IleAsp: 2.455 ± 2.114
2.946IleGlu: 2.946 ± 1.902
0.0IlePhe: 0.0 ± 0.0
4.909IleGly: 4.909 ± 0.772
2.455IleHis: 2.455 ± 0.782
0.982IleIle: 0.982 ± 1.144
2.455IleLys: 2.455 ± 1.581
4.909IleLeu: 4.909 ± 0.961
1.473IleMet: 1.473 ± 0.38
0.982IleAsn: 0.982 ± 0.458
3.927IlePro: 3.927 ± 0.303
2.455IleGln: 2.455 ± 0.885
3.436IleArg: 3.436 ± 0.342
3.927IleSer: 3.927 ± 1.05
4.909IleThr: 4.909 ± 0.156
3.436IleVal: 3.436 ± 0.86
0.0IleTrp: 0.0 ± 0.0
0.491IleTyr: 0.491 ± 0.572
0.0IleXaa: 0.0 ± 0.0
Lys
3.927LysAla: 3.927 ± 1.511
0.982LysCys: 0.982 ± 0.835
0.491LysAsp: 0.491 ± 0.572
1.473LysGlu: 1.473 ± 0.38
1.473LysPhe: 1.473 ± 0.817
3.436LysGly: 3.436 ± 0.535
1.964LysHis: 1.964 ± 0.297
2.455LysIle: 2.455 ± 1.343
0.982LysLys: 0.982 ± 0.402
6.873LysLeu: 6.873 ± 1.07
1.964LysMet: 1.964 ± 0.984
0.491LysAsn: 0.491 ± 0.272
2.946LysPro: 2.946 ± 0.991
0.982LysGln: 0.982 ± 1.144
2.455LysArg: 2.455 ± 0.782
4.418LysSer: 4.418 ± 1.516
1.473LysThr: 1.473 ± 1.021
2.455LysVal: 2.455 ± 0.854
0.0LysTrp: 0.0 ± 0.0
1.473LysTyr: 1.473 ± 0.468
0.0LysXaa: 0.0 ± 0.0
Leu
7.364LeuAla: 7.364 ± 0.234
1.473LeuCys: 1.473 ± 0.817
2.946LeuAsp: 2.946 ± 0.539
4.909LeuGlu: 4.909 ± 0.857
2.455LeuPhe: 2.455 ± 0.885
6.873LeuGly: 6.873 ± 0.684
2.455LeuHis: 2.455 ± 0.885
1.964LeuIle: 1.964 ± 0.805
3.436LeuLys: 3.436 ± 1.056
7.364LeuLeu: 7.364 ± 0.586
1.964LeuMet: 1.964 ± 0.356
3.436LeuAsn: 3.436 ± 2.963
3.927LeuPro: 3.927 ± 1.227
1.473LeuGln: 1.473 ± 0.38
6.873LeuArg: 6.873 ± 1.751
6.873LeuSer: 6.873 ± 1.038
3.436LeuThr: 3.436 ± 2.702
6.873LeuVal: 6.873 ± 1.07
1.473LeuTrp: 1.473 ± 0.565
3.436LeuTyr: 3.436 ± 1.109
0.0LeuXaa: 0.0 ± 0.0
Met
4.418MetAla: 4.418 ± 1.077
0.491MetCys: 0.491 ± 0.572
1.473MetAsp: 1.473 ± 0.565
2.455MetGlu: 2.455 ± 0.782
1.473MetPhe: 1.473 ± 0.817
3.436MetGly: 3.436 ± 1.318
0.491MetHis: 0.491 ± 0.572
1.473MetIle: 1.473 ± 0.817
0.982MetLys: 0.982 ± 0.458
1.964MetLeu: 1.964 ± 1.04
0.982MetMet: 0.982 ± 0.402
0.0MetAsn: 0.0 ± 0.0
0.982MetPro: 0.982 ± 0.544
0.0MetGln: 0.0 ± 0.0
2.455MetArg: 2.455 ± 0.734
0.982MetSer: 0.982 ± 0.458
2.455MetThr: 2.455 ± 0.078
1.964MetVal: 1.964 ± 0.613
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.473AsnAla: 1.473 ± 0.468
0.982AsnCys: 0.982 ± 0.835
1.473AsnAsp: 1.473 ± 0.565
1.473AsnGlu: 1.473 ± 0.468
2.946AsnPhe: 2.946 ± 0.935
0.0AsnGly: 0.0 ± 0.0
0.982AsnHis: 0.982 ± 0.835
2.455AsnIle: 2.455 ± 1.581
1.964AsnLys: 1.964 ± 1.04
0.982AsnLeu: 0.982 ± 0.835
0.491AsnMet: 0.491 ± 0.272
2.946AsnAsn: 2.946 ± 1.129
0.982AsnPro: 0.982 ± 0.458
1.473AsnGln: 1.473 ± 0.817
1.964AsnArg: 1.964 ± 0.613
4.418AsnSer: 4.418 ± 0.338
5.4AsnThr: 5.4 ± 1.919
2.455AsnVal: 2.455 ± 0.854
0.0AsnTrp: 0.0 ± 0.0
2.455AsnTyr: 2.455 ± 0.826
0.0AsnXaa: 0.0 ± 0.0
Pro
5.4ProAla: 5.4 ± 2.372
0.0ProCys: 0.0 ± 0.0
5.4ProAsp: 5.4 ± 1.732
1.473ProGlu: 1.473 ± 0.817
0.491ProPhe: 0.491 ± 0.591
1.473ProGly: 1.473 ± 0.817
0.982ProHis: 0.982 ± 0.458
4.909ProIle: 4.909 ± 1.291
2.455ProLys: 2.455 ± 1.343
3.436ProLeu: 3.436 ± 0.342
1.473ProMet: 1.473 ± 0.468
1.473ProAsn: 1.473 ± 0.468
1.964ProPro: 1.964 ± 0.297
1.964ProGln: 1.964 ± 0.613
3.927ProArg: 3.927 ± 1.609
4.418ProSer: 4.418 ± 1.429
3.927ProThr: 3.927 ± 1.304
5.891ProVal: 5.891 ± 1.892
0.982ProTrp: 0.982 ± 0.402
1.473ProTyr: 1.473 ± 0.817
0.0ProXaa: 0.0 ± 0.0
Gln
2.455GlnAla: 2.455 ± 0.885
0.982GlnCys: 0.982 ± 0.544
1.473GlnAsp: 1.473 ± 0.817
2.455GlnGlu: 2.455 ± 1.581
0.982GlnPhe: 0.982 ± 0.402
1.473GlnGly: 1.473 ± 0.38
0.491GlnHis: 0.491 ± 0.591
0.982GlnIle: 0.982 ± 0.835
0.491GlnLys: 0.491 ± 0.272
2.946GlnLeu: 2.946 ± 1.129
0.982GlnMet: 0.982 ± 0.746
0.0GlnAsn: 0.0 ± 0.0
1.473GlnPro: 1.473 ± 0.817
1.473GlnGln: 1.473 ± 0.817
1.473GlnArg: 1.473 ± 0.951
2.946GlnSer: 2.946 ± 0.268
1.964GlnThr: 1.964 ± 0.916
2.455GlnVal: 2.455 ± 1.361
0.0GlnTrp: 0.0 ± 0.0
0.491GlnTyr: 0.491 ± 0.591
0.0GlnXaa: 0.0 ± 0.0
Arg
3.436ArgAla: 3.436 ± 0.875
0.982ArgCys: 0.982 ± 0.402
5.4ArgAsp: 5.4 ± 1.732
3.436ArgGlu: 3.436 ± 0.875
0.982ArgPhe: 0.982 ± 1.144
2.946ArgGly: 2.946 ± 0.76
1.473ArgHis: 1.473 ± 0.565
1.964ArgIle: 1.964 ± 0.613
3.436ArgLys: 3.436 ± 0.342
3.927ArgLeu: 3.927 ± 0.805
3.927ArgMet: 3.927 ± 0.594
3.436ArgAsn: 3.436 ± 1.904
3.436ArgPro: 3.436 ± 0.535
1.473ArgGln: 1.473 ± 0.38
4.909ArgArg: 4.909 ± 0.697
3.927ArgSer: 3.927 ± 1.577
2.455ArgThr: 2.455 ± 0.078
5.891ArgVal: 5.891 ± 1.982
0.491ArgTrp: 0.491 ± 0.272
2.946ArgTyr: 2.946 ± 1.338
0.0ArgXaa: 0.0 ± 0.0
Ser
2.946SerAla: 2.946 ± 1.633
0.491SerCys: 0.491 ± 0.272
3.436SerAsp: 3.436 ± 0.535
5.4SerGlu: 5.4 ± 1.887
2.455SerPhe: 2.455 ± 0.078
7.364SerGly: 7.364 ± 0.894
2.455SerHis: 2.455 ± 0.854
3.927SerIle: 3.927 ± 1.839
4.418SerLys: 4.418 ± 0.661
6.382SerLeu: 6.382 ± 0.987
1.964SerMet: 1.964 ± 0.297
6.382SerAsn: 6.382 ± 0.632
5.4SerPro: 5.4 ± 1.086
2.946SerGln: 2.946 ± 0.991
3.436SerArg: 3.436 ± 0.342
10.309SerSer: 10.309 ± 2.736
6.382SerThr: 6.382 ± 0.226
5.4SerVal: 5.4 ± 0.729
0.491SerTrp: 0.491 ± 0.591
1.473SerTyr: 1.473 ± 0.951
0.0SerXaa: 0.0 ± 0.0
Thr
2.946ThrAla: 2.946 ± 0.76
1.473ThrCys: 1.473 ± 1.772
4.418ThrAsp: 4.418 ± 1.931
3.927ThrGlu: 3.927 ± 0.594
1.473ThrPhe: 1.473 ± 0.565
3.927ThrGly: 3.927 ± 0.303
0.982ThrHis: 0.982 ± 1.144
6.382ThrIle: 6.382 ± 2.443
0.491ThrLys: 0.491 ± 0.591
4.418ThrLeu: 4.418 ± 0.661
0.982ThrMet: 0.982 ± 0.544
1.473ThrAsn: 1.473 ± 1.021
4.418ThrPro: 4.418 ± 1.141
2.946ThrGln: 2.946 ± 1.207
3.927ThrArg: 3.927 ± 1.511
6.382ThrSer: 6.382 ± 1.909
10.309ThrThr: 10.309 ± 3.185
5.891ThrVal: 5.891 ± 0.891
0.982ThrTrp: 0.982 ± 0.402
2.946ThrTyr: 2.946 ± 2.042
0.0ThrXaa: 0.0 ± 0.0
Val
8.837ValAla: 8.837 ± 1.452
1.964ValCys: 1.964 ± 0.916
3.436ValAsp: 3.436 ± 1.056
5.891ValGlu: 5.891 ± 0.891
0.491ValPhe: 0.491 ± 0.272
5.891ValGly: 5.891 ± 1.113
2.455ValHis: 2.455 ± 0.734
4.418ValIle: 4.418 ± 0.338
5.4ValLys: 5.4 ± 1.914
5.4ValLeu: 5.4 ± 1.004
1.473ValMet: 1.473 ± 0.817
3.927ValAsn: 3.927 ± 0.805
1.964ValPro: 1.964 ± 1.089
0.982ValGln: 0.982 ± 0.458
4.909ValArg: 4.909 ± 0.857
5.891ValSer: 5.891 ± 0.804
4.418ValThr: 4.418 ± 2.429
8.346ValVal: 8.346 ± 3.989
1.473ValTrp: 1.473 ± 0.38
3.436ValTyr: 3.436 ± 1.318
0.0ValXaa: 0.0 ± 0.0
Trp
0.982TrpAla: 0.982 ± 1.144
0.982TrpCys: 0.982 ± 0.402
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.964TrpPhe: 1.964 ± 0.613
0.982TrpGly: 0.982 ± 0.544
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.491TrpLys: 0.491 ± 0.572
1.964TrpLeu: 1.964 ± 1.04
0.0TrpMet: 0.0 ± 0.0
0.491TrpAsn: 0.491 ± 0.272
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.473TrpArg: 1.473 ± 0.817
0.491TrpSer: 0.491 ± 0.272
0.982TrpThr: 0.982 ± 0.402
0.982TrpVal: 0.982 ± 0.402
0.0TrpTrp: 0.0 ± 0.0
0.982TrpTyr: 0.982 ± 0.458
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.982TyrAla: 0.982 ± 0.835
0.982TyrCys: 0.982 ± 0.458
1.964TyrAsp: 1.964 ± 0.613
2.455TyrGlu: 2.455 ± 0.885
1.964TyrPhe: 1.964 ± 1.04
1.964TyrGly: 1.964 ± 0.297
0.491TyrHis: 0.491 ± 0.572
1.964TyrIle: 1.964 ± 0.613
2.455TyrLys: 2.455 ± 1.361
3.436TyrLeu: 3.436 ± 1.121
1.473TyrMet: 1.473 ± 0.38
0.982TyrAsn: 0.982 ± 0.458
2.455TyrPro: 2.455 ± 0.854
0.0TyrGln: 0.0 ± 0.0
1.964TyrArg: 1.964 ± 1.089
1.964TyrSer: 1.964 ± 0.297
3.436TyrThr: 3.436 ± 0.342
3.436TyrVal: 3.436 ± 0.535
0.0TyrTrp: 0.0 ± 0.0
1.473TyrTyr: 1.473 ± 0.38
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2038 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski