Amino acid dipepetide frequency for Hedyotis uncinella yellow mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.04AlaAla: 4.04 ± 1.148
0.673AlaCys: 0.673 ± 0.54
0.673AlaAsp: 0.673 ± 0.54
2.02AlaGlu: 2.02 ± 1.053
1.347AlaPhe: 1.347 ± 0.881
0.673AlaGly: 0.673 ± 0.604
2.02AlaHis: 2.02 ± 0.65
1.347AlaIle: 1.347 ± 0.684
3.367AlaLys: 3.367 ± 1.444
5.387AlaLeu: 5.387 ± 1.597
0.673AlaMet: 0.673 ± 0.54
1.347AlaAsn: 1.347 ± 0.72
2.694AlaPro: 2.694 ± 0.864
3.367AlaGln: 3.367 ± 1.394
6.734AlaArg: 6.734 ± 1.947
1.347AlaSer: 1.347 ± 1.079
4.04AlaThr: 4.04 ± 2.194
0.0AlaVal: 0.0 ± 0.0
1.347AlaTrp: 1.347 ± 0.962
0.673AlaTyr: 0.673 ± 0.481
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.347CysCys: 1.347 ± 1.311
0.0CysAsp: 0.0 ± 0.0
0.673CysGlu: 0.673 ± 0.54
1.347CysPhe: 1.347 ± 0.926
2.02CysGly: 2.02 ± 0.963
0.673CysHis: 0.673 ± 0.655
0.673CysIle: 0.673 ± 0.655
0.673CysLys: 0.673 ± 0.54
0.673CysLeu: 0.673 ± 0.726
2.694CysMet: 2.694 ± 1.609
1.347CysAsn: 1.347 ± 0.848
1.347CysPro: 1.347 ± 1.311
0.673CysGln: 0.673 ± 0.656
1.347CysArg: 1.347 ± 0.762
2.694CysSer: 2.694 ± 1.546
2.694CysThr: 2.694 ± 1.462
1.347CysVal: 1.347 ± 1.079
0.673CysTrp: 0.673 ± 0.481
0.673CysTyr: 0.673 ± 0.54
0.0CysXaa: 0.0 ± 0.0
Asp
1.347AspAla: 1.347 ± 0.962
0.673AspCys: 0.673 ± 0.726
0.673AspAsp: 0.673 ± 0.481
2.02AspGlu: 2.02 ± 0.764
1.347AspPhe: 1.347 ± 0.572
1.347AspGly: 1.347 ± 0.962
0.673AspHis: 0.673 ± 0.604
5.387AspIle: 5.387 ± 1.121
0.673AspLys: 0.673 ± 0.481
7.407AspLeu: 7.407 ± 2.242
0.673AspMet: 0.673 ± 0.54
2.02AspAsn: 2.02 ± 1.087
2.02AspPro: 2.02 ± 1.251
1.347AspGln: 1.347 ± 0.684
3.367AspArg: 3.367 ± 0.965
4.714AspSer: 4.714 ± 0.986
4.04AspThr: 4.04 ± 1.262
5.387AspVal: 5.387 ± 1.427
0.673AspTrp: 0.673 ± 0.481
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.367GluAla: 3.367 ± 1.222
0.673GluCys: 0.673 ± 0.481
2.694GluAsp: 2.694 ± 1.397
2.694GluGlu: 2.694 ± 1.578
2.694GluPhe: 2.694 ± 1.39
4.04GluGly: 4.04 ± 1.252
0.673GluHis: 0.673 ± 0.604
1.347GluIle: 1.347 ± 0.876
1.347GluLys: 1.347 ± 0.962
4.714GluLeu: 4.714 ± 1.272
0.673GluMet: 0.673 ± 0.675
4.714GluAsn: 4.714 ± 1.617
3.367GluPro: 3.367 ± 1.344
2.02GluGln: 2.02 ± 1.214
0.673GluArg: 0.673 ± 0.718
1.347GluSer: 1.347 ± 1.01
2.694GluThr: 2.694 ± 1.166
1.347GluVal: 1.347 ± 0.749
1.347GluTrp: 1.347 ± 0.848
0.673GluTyr: 0.673 ± 0.481
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.673PheCys: 0.673 ± 0.54
3.367PheAsp: 3.367 ± 1.27
2.02PheGlu: 2.02 ± 0.721
2.02PhePhe: 2.02 ± 0.795
1.347PheGly: 1.347 ± 1.079
1.347PheHis: 1.347 ± 0.962
1.347PheIle: 1.347 ± 0.657
3.367PheLys: 3.367 ± 1.569
6.061PheLeu: 6.061 ± 2.134
1.347PheMet: 1.347 ± 0.572
2.694PheAsn: 2.694 ± 1.561
0.673PhePro: 0.673 ± 0.656
3.367PheGln: 3.367 ± 1.499
1.347PheArg: 1.347 ± 0.931
2.02PheSer: 2.02 ± 0.84
4.04PheThr: 4.04 ± 1.364
2.694PheVal: 2.694 ± 0.894
0.673PheTrp: 0.673 ± 0.675
0.673PheTyr: 0.673 ± 0.54
0.0PheXaa: 0.0 ± 0.0
Gly
4.04GlyAla: 4.04 ± 1.686
1.347GlyCys: 1.347 ± 0.842
2.02GlyAsp: 2.02 ± 0.981
3.367GlyGlu: 3.367 ± 1.507
0.673GlyPhe: 0.673 ± 0.726
2.02GlyGly: 2.02 ± 0.908
0.673GlyHis: 0.673 ± 0.481
2.02GlyIle: 2.02 ± 0.945
4.714GlyLys: 4.714 ± 1.671
4.04GlyLeu: 4.04 ± 1.322
0.673GlyMet: 0.673 ± 0.492
0.673GlyAsn: 0.673 ± 0.54
2.694GlyPro: 2.694 ± 1.079
2.694GlyGln: 2.694 ± 1.125
1.347GlyArg: 1.347 ± 0.805
2.694GlySer: 2.694 ± 1.584
4.04GlyThr: 4.04 ± 1.246
4.714GlyVal: 4.714 ± 1.964
0.673GlyTrp: 0.673 ± 0.675
1.347GlyTyr: 1.347 ± 0.921
0.0GlyXaa: 0.0 ± 0.0
His
0.673HisAla: 0.673 ± 0.54
2.02HisCys: 2.02 ± 1.632
2.02HisAsp: 2.02 ± 1.052
1.347HisGlu: 1.347 ± 0.848
2.02HisPhe: 2.02 ± 0.981
2.02HisGly: 2.02 ± 1.092
1.347HisHis: 1.347 ± 0.893
2.694HisIle: 2.694 ± 1.279
1.347HisLys: 1.347 ± 0.921
2.694HisLeu: 2.694 ± 1.064
0.0HisMet: 0.0 ± 0.0
3.367HisAsn: 3.367 ± 1.106
2.02HisPro: 2.02 ± 0.948
2.694HisGln: 2.694 ± 1.165
2.694HisArg: 2.694 ± 1.407
0.673HisSer: 0.673 ± 0.655
4.714HisThr: 4.714 ± 1.978
3.367HisVal: 3.367 ± 0.918
0.0HisTrp: 0.0 ± 0.0
0.673HisTyr: 0.673 ± 0.481
0.0HisXaa: 0.0 ± 0.0
Ile
1.347IleAla: 1.347 ± 0.842
2.02IleCys: 2.02 ± 0.795
4.04IleAsp: 4.04 ± 1.491
0.673IleGlu: 0.673 ± 0.481
3.367IlePhe: 3.367 ± 1.177
2.694IleGly: 2.694 ± 1.182
3.367IleHis: 3.367 ± 2.671
4.04IleIle: 4.04 ± 3.029
6.734IleLys: 6.734 ± 1.901
6.734IleLeu: 6.734 ± 4.679
2.02IleMet: 2.02 ± 0.923
2.694IleAsn: 2.694 ± 0.957
4.714IlePro: 4.714 ± 1.704
3.367IleGln: 3.367 ± 1.288
7.407IleArg: 7.407 ± 1.768
7.407IleSer: 7.407 ± 3.09
4.04IleThr: 4.04 ± 1.189
2.694IleVal: 2.694 ± 0.985
1.347IleTrp: 1.347 ± 1.435
1.347IleTyr: 1.347 ± 1.079
0.0IleXaa: 0.0 ± 0.0
Lys
2.694LysAla: 2.694 ± 1.07
0.673LysCys: 0.673 ± 0.481
1.347LysAsp: 1.347 ± 0.962
3.367LysGlu: 3.367 ± 1.222
3.367LysPhe: 3.367 ± 1.429
1.347LysGly: 1.347 ± 0.72
1.347LysHis: 1.347 ± 0.72
2.694LysIle: 2.694 ± 0.875
3.367LysLys: 3.367 ± 1.59
2.02LysLeu: 2.02 ± 0.957
0.673LysMet: 0.673 ± 0.609
5.387LysAsn: 5.387 ± 1.514
2.694LysPro: 2.694 ± 0.93
0.0LysGln: 0.0 ± 0.0
1.347LysArg: 1.347 ± 1.079
6.061LysSer: 6.061 ± 1.501
2.02LysThr: 2.02 ± 0.908
5.387LysVal: 5.387 ± 1.757
0.0LysTrp: 0.0 ± 0.0
4.04LysTyr: 4.04 ± 1.189
0.0LysXaa: 0.0 ± 0.0
Leu
4.714LeuAla: 4.714 ± 1.453
2.02LeuCys: 2.02 ± 1.123
6.734LeuAsp: 6.734 ± 1.984
2.02LeuGlu: 2.02 ± 1.053
1.347LeuPhe: 1.347 ± 1.311
4.714LeuGly: 4.714 ± 2.199
4.04LeuHis: 4.04 ± 1.588
6.061LeuIle: 6.061 ± 3.019
3.367LeuLys: 3.367 ± 0.996
9.428LeuLeu: 9.428 ± 4.868
2.02LeuMet: 2.02 ± 1.068
5.387LeuAsn: 5.387 ± 1.091
2.694LeuPro: 2.694 ± 1.684
1.347LeuGln: 1.347 ± 0.749
6.734LeuArg: 6.734 ± 2.665
5.387LeuSer: 5.387 ± 3.122
6.734LeuThr: 6.734 ± 1.645
5.387LeuVal: 5.387 ± 2.048
1.347LeuTrp: 1.347 ± 1.35
4.04LeuTyr: 4.04 ± 2.048
0.0LeuXaa: 0.0 ± 0.0
Met
1.347MetAla: 1.347 ± 1.079
0.673MetCys: 0.673 ± 0.724
2.02MetAsp: 2.02 ± 1.222
0.0MetGlu: 0.0 ± 0.0
1.347MetPhe: 1.347 ± 0.81
4.714MetGly: 4.714 ± 1.483
0.673MetHis: 0.673 ± 0.724
2.694MetIle: 2.694 ± 1.781
1.347MetLys: 1.347 ± 0.845
1.347MetLeu: 1.347 ± 0.991
1.347MetMet: 1.347 ± 1.396
0.0MetAsn: 0.0 ± 0.0
0.673MetPro: 0.673 ± 0.481
0.673MetGln: 0.673 ± 0.726
1.347MetArg: 1.347 ± 1.35
2.694MetSer: 2.694 ± 1.429
0.673MetThr: 0.673 ± 0.655
0.673MetVal: 0.673 ± 0.54
2.02MetTrp: 2.02 ± 0.65
2.02MetTyr: 2.02 ± 1.619
0.0MetXaa: 0.0 ± 0.0
Asn
2.694AsnAla: 2.694 ± 0.853
0.673AsnCys: 0.673 ± 0.655
2.694AsnAsp: 2.694 ± 0.804
2.02AsnGlu: 2.02 ± 0.65
1.347AsnPhe: 1.347 ± 0.805
2.02AsnGly: 2.02 ± 1.052
4.04AsnHis: 4.04 ± 2.151
2.694AsnIle: 2.694 ± 0.751
1.347AsnLys: 1.347 ± 0.72
5.387AsnLeu: 5.387 ± 1.877
2.02AsnMet: 2.02 ± 1.116
3.367AsnAsn: 3.367 ± 0.945
4.04AsnPro: 4.04 ± 1.405
2.02AsnGln: 2.02 ± 1.503
4.04AsnArg: 4.04 ± 1.191
6.061AsnSer: 6.061 ± 1.994
4.04AsnThr: 4.04 ± 0.92
3.367AsnVal: 3.367 ± 1.394
0.0AsnTrp: 0.0 ± 0.0
1.347AsnTyr: 1.347 ± 0.962
0.0AsnXaa: 0.0 ± 0.0
Pro
2.694ProAla: 2.694 ± 0.962
2.694ProCys: 2.694 ± 1.126
2.02ProAsp: 2.02 ± 1.03
2.694ProGlu: 2.694 ± 1.217
2.02ProPhe: 2.02 ± 0.764
2.694ProGly: 2.694 ± 0.891
5.387ProHis: 5.387 ± 1.564
7.407ProIle: 7.407 ± 2.649
4.04ProLys: 4.04 ± 2.069
3.367ProLeu: 3.367 ± 1.155
2.02ProMet: 2.02 ± 1.34
2.02ProAsn: 2.02 ± 1.034
4.04ProPro: 4.04 ± 2.247
2.02ProGln: 2.02 ± 1.018
2.02ProArg: 2.02 ± 1.048
4.714ProSer: 4.714 ± 1.491
3.367ProThr: 3.367 ± 1.117
2.02ProVal: 2.02 ± 1.214
0.673ProTrp: 0.673 ± 0.481
1.347ProTyr: 1.347 ± 0.805
0.0ProXaa: 0.0 ± 0.0
Gln
2.02GlnAla: 2.02 ± 0.997
1.347GlnCys: 1.347 ± 1.435
0.673GlnAsp: 0.673 ± 0.726
2.02GlnGlu: 2.02 ± 0.789
2.694GlnPhe: 2.694 ± 1.449
2.02GlnGly: 2.02 ± 1.015
2.02GlnHis: 2.02 ± 1.089
2.694GlnIle: 2.694 ± 1.398
1.347GlnLys: 1.347 ± 1.01
2.694GlnLeu: 2.694 ± 1.23
0.0GlnMet: 0.0 ± 0.0
5.387GlnAsn: 5.387 ± 1.858
1.347GlnPro: 1.347 ± 1.451
3.367GlnGln: 3.367 ± 1.41
2.694GlnArg: 2.694 ± 0.991
4.04GlnSer: 4.04 ± 1.543
1.347GlnThr: 1.347 ± 0.932
3.367GlnVal: 3.367 ± 1.084
0.673GlnTrp: 0.673 ± 0.604
0.673GlnTyr: 0.673 ± 0.54
0.0GlnXaa: 0.0 ± 0.0
Arg
2.02ArgAla: 2.02 ± 0.982
2.02ArgCys: 2.02 ± 0.945
2.694ArgAsp: 2.694 ± 0.819
4.04ArgGlu: 4.04 ± 1.674
5.387ArgPhe: 5.387 ± 1.804
3.367ArgGly: 3.367 ± 1.141
4.04ArgHis: 4.04 ± 1.66
7.407ArgIle: 7.407 ± 1.544
2.02ArgLys: 2.02 ± 1.222
5.387ArgLeu: 5.387 ± 2.398
2.02ArgMet: 2.02 ± 1.199
0.673ArgAsn: 0.673 ± 0.675
4.04ArgPro: 4.04 ± 1.51
0.0ArgGln: 0.0 ± 0.0
9.428ArgArg: 9.428 ± 4.244
6.734ArgSer: 6.734 ± 1.715
5.387ArgThr: 5.387 ± 2.196
4.714ArgVal: 4.714 ± 2.078
0.673ArgTrp: 0.673 ± 0.675
2.02ArgTyr: 2.02 ± 0.946
0.0ArgXaa: 0.0 ± 0.0
Ser
3.367SerAla: 3.367 ± 1.847
0.673SerCys: 0.673 ± 0.724
4.04SerAsp: 4.04 ± 0.902
3.367SerGlu: 3.367 ± 1.037
2.694SerPhe: 2.694 ± 0.897
0.673SerGly: 0.673 ± 0.481
2.02SerHis: 2.02 ± 0.995
6.061SerIle: 6.061 ± 2.354
5.387SerLys: 5.387 ± 1.438
5.387SerLeu: 5.387 ± 2.208
2.694SerMet: 2.694 ± 1.685
5.387SerAsn: 5.387 ± 1.364
9.428SerPro: 9.428 ± 2.564
2.694SerGln: 2.694 ± 1.01
6.061SerArg: 6.061 ± 0.776
12.121SerSer: 12.121 ± 3.818
6.061SerThr: 6.061 ± 3.281
4.04SerVal: 4.04 ± 1.387
0.0SerTrp: 0.0 ± 0.0
2.02SerTyr: 2.02 ± 0.984
0.0SerXaa: 0.0 ± 0.0
Thr
2.02ThrAla: 2.02 ± 1.222
1.347ThrCys: 1.347 ± 0.987
0.673ThrAsp: 0.673 ± 0.724
3.367ThrGlu: 3.367 ± 0.929
1.347ThrPhe: 1.347 ± 0.931
4.714ThrGly: 4.714 ± 1.126
3.367ThrHis: 3.367 ± 1.623
4.714ThrIle: 4.714 ± 1.739
2.694ThrLys: 2.694 ± 0.914
4.04ThrLeu: 4.04 ± 0.92
1.347ThrMet: 1.347 ± 0.72
6.061ThrAsn: 6.061 ± 1.445
7.407ThrPro: 7.407 ± 2.006
2.694ThrGln: 2.694 ± 1.169
6.061ThrArg: 6.061 ± 1.93
4.714ThrSer: 4.714 ± 1.952
1.347ThrThr: 1.347 ± 0.893
4.04ThrVal: 4.04 ± 1.988
0.673ThrTrp: 0.673 ± 0.718
2.02ThrTyr: 2.02 ± 1.01
0.0ThrXaa: 0.0 ± 0.0
Val
1.347ValAla: 1.347 ± 0.805
0.673ValCys: 0.673 ± 0.54
4.04ValAsp: 4.04 ± 2.114
4.04ValGlu: 4.04 ± 2.234
2.694ValPhe: 2.694 ± 1.24
1.347ValGly: 1.347 ± 0.805
0.673ValHis: 0.673 ± 0.656
6.061ValIle: 6.061 ± 1.748
2.02ValLys: 2.02 ± 1.002
4.714ValLeu: 4.714 ± 1.504
1.347ValMet: 1.347 ± 0.572
1.347ValAsn: 1.347 ± 0.931
2.694ValPro: 2.694 ± 0.907
6.061ValGln: 6.061 ± 1.618
4.714ValArg: 4.714 ± 2.456
7.407ValSer: 7.407 ± 1.602
2.02ValThr: 2.02 ± 1.23
0.673ValVal: 0.673 ± 0.54
0.673ValTrp: 0.673 ± 0.54
4.04ValTyr: 4.04 ± 1.128
0.0ValXaa: 0.0 ± 0.0
Trp
1.347TrpAla: 1.347 ± 0.962
0.0TrpCys: 0.0 ± 0.0
0.673TrpAsp: 0.673 ± 0.656
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.673TrpGly: 0.673 ± 0.481
0.0TrpHis: 0.0 ± 0.0
2.02TrpIle: 2.02 ± 0.993
0.0TrpLys: 0.0 ± 0.0
0.673TrpLeu: 0.673 ± 0.675
2.02TrpMet: 2.02 ± 1.348
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.673TrpGln: 0.673 ± 0.481
2.02TrpArg: 2.02 ± 0.897
0.0TrpSer: 0.0 ± 0.0
1.347TrpThr: 1.347 ± 1.435
1.347TrpVal: 1.347 ± 0.792
0.0TrpTrp: 0.0 ± 0.0
0.673TrpTyr: 0.673 ± 0.481
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.694TyrAla: 2.694 ± 0.979
1.347TyrCys: 1.347 ± 1.01
2.694TyrAsp: 2.694 ± 1.278
1.347TyrGlu: 1.347 ± 0.805
2.02TyrPhe: 2.02 ± 0.764
2.02TyrGly: 2.02 ± 0.65
0.0TyrHis: 0.0 ± 0.0
2.694TyrIle: 2.694 ± 1.134
0.673TyrLys: 0.673 ± 0.481
3.367TyrLeu: 3.367 ± 1.499
1.347TyrMet: 1.347 ± 0.762
1.347TyrAsn: 1.347 ± 0.572
1.347TyrPro: 1.347 ± 0.749
1.347TyrGln: 1.347 ± 0.805
2.694TyrArg: 2.694 ± 1.505
1.347TyrSer: 1.347 ± 0.962
0.0TyrThr: 0.0 ± 0.0
2.02TyrVal: 2.02 ± 0.65
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1486 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski