Amino acid dipepetide frequency for Dolichos yellow mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.674AlaAla: 3.674 ± 0.802
0.612AlaCys: 0.612 ± 0.507
3.062AlaAsp: 3.062 ± 0.864
1.837AlaGlu: 1.837 ± 0.575
3.062AlaPhe: 3.062 ± 1.162
0.0AlaGly: 0.0 ± 0.0
2.449AlaHis: 2.449 ± 1.358
1.225AlaIle: 1.225 ± 0.746
5.511AlaLys: 5.511 ± 1.923
4.899AlaLeu: 4.899 ± 1.511
0.612AlaMet: 0.612 ± 0.443
0.612AlaAsn: 0.612 ± 0.507
1.837AlaPro: 1.837 ± 1.016
5.511AlaGln: 5.511 ± 1.919
3.674AlaArg: 3.674 ± 2.473
4.287AlaSer: 4.287 ± 1.127
4.287AlaThr: 4.287 ± 1.333
4.899AlaVal: 4.899 ± 1.509
1.225AlaTrp: 1.225 ± 0.639
1.225AlaTyr: 1.225 ± 0.64
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
2.449CysGlu: 2.449 ± 0.901
0.0CysPhe: 0.0 ± 0.0
1.837CysGly: 1.837 ± 0.783
0.0CysHis: 0.0 ± 0.0
1.225CysIle: 1.225 ± 0.853
1.225CysLys: 1.225 ± 0.812
0.612CysLeu: 0.612 ± 0.531
1.225CysMet: 1.225 ± 0.84
2.449CysAsn: 2.449 ± 0.987
0.612CysPro: 0.612 ± 0.517
0.0CysGln: 0.0 ± 0.0
1.837CysArg: 1.837 ± 0.783
2.449CysSer: 2.449 ± 1.767
0.612CysThr: 0.612 ± 0.507
1.837CysVal: 1.837 ± 1.061
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.449AspAla: 2.449 ± 0.827
0.612AspCys: 0.612 ± 0.531
2.449AspAsp: 2.449 ± 1.094
3.062AspGlu: 3.062 ± 0.785
2.449AspPhe: 2.449 ± 0.869
3.674AspGly: 3.674 ± 1.266
1.225AspHis: 1.225 ± 0.717
3.062AspIle: 3.062 ± 1.226
1.837AspLys: 1.837 ± 0.575
6.124AspLeu: 6.124 ± 1.799
1.225AspMet: 1.225 ± 0.65
1.225AspAsn: 1.225 ± 0.669
4.287AspPro: 4.287 ± 1.608
0.0AspGln: 0.0 ± 0.0
1.837AspArg: 1.837 ± 1.018
5.511AspSer: 5.511 ± 0.945
2.449AspThr: 2.449 ± 1.005
5.511AspVal: 5.511 ± 0.846
1.225AspTrp: 1.225 ± 1.035
1.225AspTyr: 1.225 ± 0.64
0.0AspXaa: 0.0 ± 0.0
Glu
4.899GluAla: 4.899 ± 1.279
1.225GluCys: 1.225 ± 0.751
0.612GluAsp: 0.612 ± 0.531
3.674GluGlu: 3.674 ± 1.687
2.449GluPhe: 2.449 ± 0.957
2.449GluGly: 2.449 ± 0.821
2.449GluHis: 2.449 ± 1.005
0.0GluIle: 0.0 ± 0.0
0.0GluLys: 0.0 ± 0.0
5.511GluLeu: 5.511 ± 1.454
1.225GluMet: 1.225 ± 1.204
3.062GluAsn: 3.062 ± 1.403
1.837GluPro: 1.837 ± 0.782
3.674GluGln: 3.674 ± 1.381
3.062GluArg: 3.062 ± 2.169
4.287GluSer: 4.287 ± 1.334
2.449GluThr: 2.449 ± 2.123
1.837GluVal: 1.837 ± 1.211
1.837GluTrp: 1.837 ± 1.156
1.837GluTyr: 1.837 ± 1.174
0.0GluXaa: 0.0 ± 0.0
Phe
1.837PheAla: 1.837 ± 0.575
0.612PheCys: 0.612 ± 0.507
3.062PheAsp: 3.062 ± 1.574
1.837PheGlu: 1.837 ± 0.885
1.225PhePhe: 1.225 ± 0.717
1.837PheGly: 1.837 ± 1.11
0.612PheHis: 0.612 ± 0.517
2.449PheIle: 2.449 ± 1.104
4.287PheLys: 4.287 ± 1.525
3.674PheLeu: 3.674 ± 2.391
1.837PheMet: 1.837 ± 0.91
4.899PheAsn: 4.899 ± 2.059
2.449PhePro: 2.449 ± 1.742
2.449PheGln: 2.449 ± 1.218
4.899PheArg: 4.899 ± 1.611
5.511PheSer: 5.511 ± 1.1
1.225PheThr: 1.225 ± 0.707
3.674PheVal: 3.674 ± 2.83
1.837PheTrp: 1.837 ± 1.061
1.225PheTyr: 1.225 ± 0.812
0.0PheXaa: 0.0 ± 0.0
Gly
4.899GlyAla: 4.899 ± 1.91
1.837GlyCys: 1.837 ± 0.789
2.449GlyAsp: 2.449 ± 0.652
1.837GlyGlu: 1.837 ± 0.999
3.062GlyPhe: 3.062 ± 1.273
2.449GlyGly: 2.449 ± 0.949
0.612GlyHis: 0.612 ± 0.517
4.287GlyIle: 4.287 ± 2.031
4.287GlyLys: 4.287 ± 1.694
2.449GlyLeu: 2.449 ± 1.153
1.225GlyMet: 1.225 ± 1.077
2.449GlyAsn: 2.449 ± 1.012
3.062GlyPro: 3.062 ± 0.785
1.225GlyGln: 1.225 ± 1.014
3.674GlyArg: 3.674 ± 0.871
1.225GlySer: 1.225 ± 0.717
4.287GlyThr: 4.287 ± 1.593
2.449GlyVal: 2.449 ± 0.917
0.0GlyTrp: 0.0 ± 0.0
0.612GlyTyr: 0.612 ± 0.568
0.0GlyXaa: 0.0 ± 0.0
His
1.837HisAla: 1.837 ± 0.628
1.225HisCys: 1.225 ± 1.145
0.612HisAsp: 0.612 ± 0.507
0.612HisGlu: 0.612 ± 0.71
1.837HisPhe: 1.837 ± 1.168
2.449HisGly: 2.449 ± 0.917
1.225HisHis: 1.225 ± 0.864
0.612HisIle: 0.612 ± 0.531
1.225HisLys: 1.225 ± 0.812
2.449HisLeu: 2.449 ± 1.094
0.612HisMet: 0.612 ± 0.568
3.062HisAsn: 3.062 ± 1.527
1.225HisPro: 1.225 ± 0.724
3.062HisGln: 3.062 ± 0.961
4.287HisArg: 4.287 ± 1.712
1.225HisSer: 1.225 ± 0.528
1.837HisThr: 1.837 ± 1.521
1.225HisVal: 1.225 ± 0.746
0.0HisTrp: 0.0 ± 0.0
1.225HisTyr: 1.225 ± 0.528
0.0HisXaa: 0.0 ± 0.0
Ile
1.225IleAla: 1.225 ± 0.772
0.612IleCys: 0.612 ± 0.621
2.449IleAsp: 2.449 ± 1.502
2.449IleGlu: 2.449 ± 1.411
2.449IlePhe: 2.449 ± 2.07
2.449IleGly: 2.449 ± 1.089
1.837IleHis: 1.837 ± 1.282
1.837IleIle: 1.837 ± 0.857
7.348IleLys: 7.348 ± 1.125
4.287IleLeu: 4.287 ± 1.561
0.612IleMet: 0.612 ± 0.71
1.225IleAsn: 1.225 ± 0.707
0.612IlePro: 0.612 ± 0.517
0.612IleGln: 0.612 ± 0.572
4.287IleArg: 4.287 ± 0.673
4.899IleSer: 4.899 ± 1.157
4.899IleThr: 4.899 ± 1.177
3.062IleVal: 3.062 ± 1.365
1.837IleTrp: 1.837 ± 0.958
1.837IleTyr: 1.837 ± 1.204
0.0IleXaa: 0.0 ± 0.0
Lys
6.124LysAla: 6.124 ± 2.316
2.449LysCys: 2.449 ± 0.903
2.449LysAsp: 2.449 ± 0.903
3.674LysGlu: 3.674 ± 2.62
1.837LysPhe: 1.837 ± 0.858
1.837LysGly: 1.837 ± 0.901
1.837LysHis: 1.837 ± 0.575
3.062LysIle: 3.062 ± 1.625
2.449LysLys: 2.449 ± 1.325
4.287LysLeu: 4.287 ± 1.814
0.0LysMet: 0.0 ± 0.0
3.674LysAsn: 3.674 ± 0.861
3.674LysPro: 3.674 ± 0.763
0.612LysGln: 0.612 ± 0.531
5.511LysArg: 5.511 ± 1.77
2.449LysSer: 2.449 ± 0.863
2.449LysThr: 2.449 ± 0.938
2.449LysVal: 2.449 ± 1.337
0.0LysTrp: 0.0 ± 0.0
1.225LysTyr: 1.225 ± 0.639
0.0LysXaa: 0.0 ± 0.0
Leu
0.612LeuAla: 0.612 ± 0.621
2.449LeuCys: 2.449 ± 1.118
3.674LeuAsp: 3.674 ± 1.101
4.899LeuGlu: 4.899 ± 1.818
2.449LeuPhe: 2.449 ± 1.104
4.899LeuGly: 4.899 ± 1.397
2.449LeuHis: 2.449 ± 1.088
1.837LeuIle: 1.837 ± 1.062
4.287LeuLys: 4.287 ± 1.356
6.736LeuLeu: 6.736 ± 1.74
1.837LeuMet: 1.837 ± 1.647
4.899LeuAsn: 4.899 ± 1.162
4.899LeuPro: 4.899 ± 1.827
1.837LeuGln: 1.837 ± 1.174
5.511LeuArg: 5.511 ± 1.896
7.961LeuSer: 7.961 ± 3.186
4.899LeuThr: 4.899 ± 0.697
4.287LeuVal: 4.287 ± 1.56
0.612LeuTrp: 0.612 ± 0.517
3.062LeuTyr: 3.062 ± 1.65
0.0LeuXaa: 0.0 ± 0.0
Met
1.837MetAla: 1.837 ± 1.018
0.612MetCys: 0.612 ± 0.517
1.837MetAsp: 1.837 ± 1.031
1.225MetGlu: 1.225 ± 0.858
1.837MetPhe: 1.837 ± 1.479
2.449MetGly: 2.449 ± 0.811
1.225MetHis: 1.225 ± 0.64
0.612MetIle: 0.612 ± 0.568
0.612MetLys: 0.612 ± 0.531
1.225MetLeu: 1.225 ± 0.846
0.612MetMet: 0.612 ± 0.531
1.837MetAsn: 1.837 ± 0.885
1.837MetPro: 1.837 ± 0.874
0.612MetGln: 0.612 ± 0.602
1.837MetArg: 1.837 ± 0.683
3.674MetSer: 3.674 ± 1.326
1.225MetThr: 1.225 ± 0.797
0.0MetVal: 0.0 ± 0.0
1.225MetTrp: 1.225 ± 0.788
1.225MetTyr: 1.225 ± 0.65
0.0MetXaa: 0.0 ± 0.0
Asn
3.062AsnAla: 3.062 ± 0.678
0.612AsnCys: 0.612 ± 0.572
3.062AsnAsp: 3.062 ± 1.185
2.449AsnGlu: 2.449 ± 0.473
1.837AsnPhe: 1.837 ± 1.204
3.062AsnGly: 3.062 ± 1.531
2.449AsnHis: 2.449 ± 1.641
5.511AsnIle: 5.511 ± 0.978
0.612AsnLys: 0.612 ± 0.531
3.674AsnLeu: 3.674 ± 0.69
1.225AsnMet: 1.225 ± 0.995
4.899AsnAsn: 4.899 ± 1.382
3.062AsnPro: 3.062 ± 0.809
2.449AsnGln: 2.449 ± 0.941
2.449AsnArg: 2.449 ± 1.337
3.062AsnSer: 3.062 ± 0.761
2.449AsnThr: 2.449 ± 0.935
7.348AsnVal: 7.348 ± 1.688
0.0AsnTrp: 0.0 ± 0.0
3.062AsnTyr: 3.062 ± 0.678
0.0AsnXaa: 0.0 ± 0.0
Pro
1.837ProAla: 1.837 ± 0.901
0.612ProCys: 0.612 ± 0.507
1.225ProAsp: 1.225 ± 0.788
0.612ProGlu: 0.612 ± 0.531
3.674ProPhe: 3.674 ± 1.582
3.062ProGly: 3.062 ± 1.078
2.449ProHis: 2.449 ± 1.626
3.674ProIle: 3.674 ± 1.202
3.062ProLys: 3.062 ± 0.978
3.674ProLeu: 3.674 ± 1.112
3.674ProMet: 3.674 ± 1.49
0.612ProAsn: 0.612 ± 0.71
1.225ProPro: 1.225 ± 0.528
0.612ProGln: 0.612 ± 0.71
6.124ProArg: 6.124 ± 0.853
8.573ProSer: 8.573 ± 2.277
3.062ProThr: 3.062 ± 1.179
1.225ProVal: 1.225 ± 0.735
0.612ProTrp: 0.612 ± 0.531
1.837ProTyr: 1.837 ± 1.11
0.0ProXaa: 0.0 ± 0.0
Gln
1.225GlnAla: 1.225 ± 0.707
0.0GlnCys: 0.0 ± 0.0
3.062GlnAsp: 3.062 ± 1.334
2.449GlnGlu: 2.449 ± 0.938
3.062GlnPhe: 3.062 ± 0.871
2.449GlnGly: 2.449 ± 1.626
1.837GlnHis: 1.837 ± 0.963
2.449GlnIle: 2.449 ± 1.056
0.0GlnLys: 0.0 ± 0.0
2.449GlnLeu: 2.449 ± 1.005
0.612GlnMet: 0.612 ± 0.517
1.837GlnAsn: 1.837 ± 1.156
1.837GlnPro: 1.837 ± 0.683
0.0GlnGln: 0.0 ± 0.0
3.674GlnArg: 3.674 ± 1.086
3.062GlnSer: 3.062 ± 1.491
1.837GlnThr: 1.837 ± 0.924
3.062GlnVal: 3.062 ± 1.091
0.612GlnTrp: 0.612 ± 0.71
1.837GlnTyr: 1.837 ± 0.628
0.0GlnXaa: 0.0 ± 0.0
Arg
2.449ArgAla: 2.449 ± 0.781
3.062ArgCys: 3.062 ± 1.231
3.674ArgAsp: 3.674 ± 1.57
2.449ArgGlu: 2.449 ± 1.128
3.674ArgPhe: 3.674 ± 1.343
3.062ArgGly: 3.062 ± 1.169
2.449ArgHis: 2.449 ± 0.798
5.511ArgIle: 5.511 ± 1.899
3.062ArgLys: 3.062 ± 1.408
5.511ArgLeu: 5.511 ± 2.017
0.612ArgMet: 0.612 ± 0.531
4.287ArgAsn: 4.287 ± 0.641
4.899ArgPro: 4.899 ± 1.417
3.062ArgGln: 3.062 ± 1.218
7.348ArgArg: 7.348 ± 2.682
9.186ArgSer: 9.186 ± 1.654
6.124ArgThr: 6.124 ± 1.656
6.736ArgVal: 6.736 ± 1.122
0.0ArgTrp: 0.0 ± 0.0
3.062ArgTyr: 3.062 ± 1.711
0.0ArgXaa: 0.0 ± 0.0
Ser
6.124SerAla: 6.124 ± 1.335
1.225SerCys: 1.225 ± 0.883
4.287SerAsp: 4.287 ± 1.269
3.062SerGlu: 3.062 ± 1.064
4.899SerPhe: 4.899 ± 2.244
3.674SerGly: 3.674 ± 1.527
1.225SerHis: 1.225 ± 0.735
3.674SerIle: 3.674 ± 1.14
4.287SerLys: 4.287 ± 1.948
4.287SerLeu: 4.287 ± 1.199
4.287SerMet: 4.287 ± 1.76
6.124SerAsn: 6.124 ± 1.262
2.449SerPro: 2.449 ± 1.385
2.449SerGln: 2.449 ± 0.903
5.511SerArg: 5.511 ± 1.722
11.023SerSer: 11.023 ± 3.567
11.023SerThr: 11.023 ± 2.004
4.899SerVal: 4.899 ± 1.27
0.612SerTrp: 0.612 ± 0.517
4.899SerTyr: 4.899 ± 1.822
0.0SerXaa: 0.0 ± 0.0
Thr
3.062ThrAla: 3.062 ± 1.17
0.612ThrCys: 0.612 ± 0.572
3.674ThrAsp: 3.674 ± 1.447
2.449ThrGlu: 2.449 ± 1.147
3.674ThrPhe: 3.674 ± 1.543
3.062ThrGly: 3.062 ± 1.133
3.062ThrHis: 3.062 ± 1.003
3.674ThrIle: 3.674 ± 1.335
3.062ThrLys: 3.062 ± 0.94
5.511ThrLeu: 5.511 ± 1.294
1.837ThrMet: 1.837 ± 0.803
2.449ThrAsn: 2.449 ± 1.337
6.124ThrPro: 6.124 ± 2.439
3.062ThrGln: 3.062 ± 1.509
2.449ThrArg: 2.449 ± 0.473
5.511ThrSer: 5.511 ± 1.373
2.449ThrThr: 2.449 ± 1.134
3.674ThrVal: 3.674 ± 0.825
0.612ThrTrp: 0.612 ± 0.531
1.225ThrTyr: 1.225 ± 0.528
0.0ThrXaa: 0.0 ± 0.0
Val
1.225ValAla: 1.225 ± 1.035
0.0ValCys: 0.0 ± 0.0
7.348ValAsp: 7.348 ± 2.46
4.287ValGlu: 4.287 ± 1.119
5.511ValPhe: 5.511 ± 2.285
1.225ValGly: 1.225 ± 0.735
1.225ValHis: 1.225 ± 0.64
3.674ValIle: 3.674 ± 1.069
2.449ValLys: 2.449 ± 0.76
4.287ValLeu: 4.287 ± 1.865
1.225ValMet: 1.225 ± 0.639
5.511ValAsn: 5.511 ± 2.477
3.674ValPro: 3.674 ± 0.947
4.899ValGln: 4.899 ± 1.332
6.124ValArg: 6.124 ± 1.835
4.287ValSer: 4.287 ± 2.17
1.837ValThr: 1.837 ± 1.061
4.899ValVal: 4.899 ± 1.003
0.0ValTrp: 0.0 ± 0.0
3.062ValTyr: 3.062 ± 1.548
0.0ValXaa: 0.0 ± 0.0
Trp
3.674TrpAla: 3.674 ± 1.266
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.225TrpGlu: 1.225 ± 0.812
0.0TrpPhe: 0.0 ± 0.0
0.612TrpGly: 0.612 ± 0.517
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.612TrpLys: 0.612 ± 0.71
0.612TrpLeu: 0.612 ± 0.507
0.612TrpMet: 0.612 ± 0.507
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.612TrpGln: 0.612 ± 0.517
1.837TrpArg: 1.837 ± 0.813
0.612TrpSer: 0.612 ± 0.531
0.612TrpThr: 0.612 ± 0.507
0.612TrpVal: 0.612 ± 0.621
0.0TrpTrp: 0.0 ± 0.0
0.612TrpTyr: 0.612 ± 0.517
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.449TyrAla: 2.449 ± 1.169
0.0TyrCys: 0.0 ± 0.0
2.449TyrAsp: 2.449 ± 1.192
1.837TyrGlu: 1.837 ± 1.11
2.449TyrPhe: 2.449 ± 1.115
1.837TyrGly: 1.837 ± 0.825
1.225TyrHis: 1.225 ± 0.797
2.449TyrIle: 2.449 ± 0.473
1.837TyrLys: 1.837 ± 1.552
2.449TyrLeu: 2.449 ± 1.212
1.837TyrMet: 1.837 ± 0.801
1.225TyrAsn: 1.225 ± 0.639
1.837TyrPro: 1.837 ± 0.76
0.612TyrGln: 0.612 ± 0.507
4.287TyrArg: 4.287 ± 1.771
1.225TyrSer: 1.225 ± 0.84
1.225TyrThr: 1.225 ± 0.528
3.062TyrVal: 3.062 ± 1.022
0.0TyrTrp: 0.0 ± 0.0
1.225TyrTyr: 1.225 ± 0.751
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1634 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski