Amino acid dipepetide frequency for Sugarcane yellow leaf virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.91AlaAla: 3.91 ± 1.769
1.303AlaCys: 1.303 ± 0.786
4.778AlaAsp: 4.778 ± 0.912
3.91AlaGlu: 3.91 ± 1.806
1.303AlaPhe: 1.303 ± 0.56
2.606AlaGly: 2.606 ± 0.9
1.738AlaHis: 1.738 ± 0.635
2.172AlaIle: 2.172 ± 0.857
4.344AlaLys: 4.344 ± 1.003
4.778AlaLeu: 4.778 ± 0.613
1.738AlaMet: 1.738 ± 0.533
3.475AlaAsn: 3.475 ± 1.624
9.123AlaPro: 9.123 ± 2.898
3.041AlaGln: 3.041 ± 1.854
4.344AlaArg: 4.344 ± 1.689
5.647AlaSer: 5.647 ± 1.124
5.213AlaThr: 5.213 ± 1.12
3.91AlaVal: 3.91 ± 0.611
0.434AlaTrp: 0.434 ± 0.367
3.475AlaTyr: 3.475 ± 1.282
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.869CysAsp: 0.869 ± 0.42
0.869CysGlu: 0.869 ± 0.574
0.434CysPhe: 0.434 ± 0.478
1.303CysGly: 1.303 ± 0.991
0.0CysHis: 0.0 ± 0.0
0.869CysIle: 0.869 ± 0.373
0.869CysLys: 0.869 ± 0.475
1.738CysLeu: 1.738 ± 0.766
0.0CysMet: 0.0 ± 0.0
0.869CysAsn: 0.869 ± 0.661
0.0CysPro: 0.0 ± 0.0
0.434CysGln: 0.434 ± 0.367
1.303CysArg: 1.303 ± 0.696
3.041CysSer: 3.041 ± 1.076
0.869CysThr: 0.869 ± 0.42
0.434CysVal: 0.434 ± 0.367
0.434CysTrp: 0.434 ± 0.33
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.041AspAla: 3.041 ± 0.562
0.869AspCys: 0.869 ± 0.373
4.344AspAsp: 4.344 ± 1.277
2.606AspGlu: 2.606 ± 0.606
3.475AspPhe: 3.475 ± 0.964
1.738AspGly: 1.738 ± 0.843
1.738AspHis: 1.738 ± 0.879
1.738AspIle: 1.738 ± 0.607
3.475AspLys: 3.475 ± 1.657
2.172AspLeu: 2.172 ± 0.796
0.869AspMet: 0.869 ± 0.42
1.738AspAsn: 1.738 ± 1.244
2.172AspPro: 2.172 ± 1.005
2.606AspGln: 2.606 ± 0.408
3.041AspArg: 3.041 ± 1.5
4.344AspSer: 4.344 ± 0.679
2.606AspThr: 2.606 ± 1.13
3.041AspVal: 3.041 ± 0.938
1.303AspTrp: 1.303 ± 0.991
0.434AspTyr: 0.434 ± 0.367
0.0AspXaa: 0.0 ± 0.0
Glu
3.91GluAla: 3.91 ± 0.822
0.434GluCys: 0.434 ± 0.522
2.606GluAsp: 2.606 ± 1.166
5.213GluGlu: 5.213 ± 1.493
2.172GluPhe: 2.172 ± 0.681
2.606GluGly: 2.606 ± 1.091
2.606GluHis: 2.606 ± 1.209
0.869GluIle: 0.869 ± 0.373
3.91GluLys: 3.91 ± 1.286
3.91GluLeu: 3.91 ± 1.267
1.303GluMet: 1.303 ± 0.602
0.869GluAsn: 0.869 ± 0.422
2.606GluPro: 2.606 ± 0.447
3.91GluGln: 3.91 ± 1.199
3.041GluArg: 3.041 ± 1.41
5.213GluSer: 5.213 ± 0.87
2.172GluThr: 2.172 ± 0.889
5.647GluVal: 5.647 ± 1.695
1.738GluTrp: 1.738 ± 0.533
2.172GluTyr: 2.172 ± 0.825
0.0GluXaa: 0.0 ± 0.0
Phe
1.738PheAla: 1.738 ± 1.014
0.869PheCys: 0.869 ± 0.373
1.303PheAsp: 1.303 ± 0.455
0.869PheGlu: 0.869 ± 0.584
1.303PhePhe: 1.303 ± 0.662
3.475PheGly: 3.475 ± 1.399
1.303PheHis: 1.303 ± 0.599
2.606PheIle: 2.606 ± 0.408
2.606PheLys: 2.606 ± 0.817
2.606PheLeu: 2.606 ± 0.447
0.434PheMet: 0.434 ± 0.522
1.303PheAsn: 1.303 ± 0.701
1.303PhePro: 1.303 ± 1.04
2.606PheGln: 2.606 ± 0.925
2.606PheArg: 2.606 ± 1.775
3.91PheSer: 3.91 ± 1.521
2.606PheThr: 2.606 ± 1.049
3.475PheVal: 3.475 ± 0.706
0.434PheTrp: 0.434 ± 0.522
0.869PheTyr: 0.869 ± 0.734
0.0PheXaa: 0.0 ± 0.0
Gly
4.778GlyAla: 4.778 ± 1.073
0.869GlyCys: 0.869 ± 0.42
2.606GlyAsp: 2.606 ± 0.848
2.606GlyGlu: 2.606 ± 0.881
3.91GlyPhe: 3.91 ± 1.235
6.082GlyGly: 6.082 ± 1.76
1.303GlyHis: 1.303 ± 0.686
2.172GlyIle: 2.172 ± 0.692
4.778GlyLys: 4.778 ± 0.864
5.647GlyLeu: 5.647 ± 1.035
0.869GlyMet: 0.869 ± 0.543
3.475GlyAsn: 3.475 ± 1.092
4.344GlyPro: 4.344 ± 1.341
0.869GlyGln: 0.869 ± 0.729
2.172GlyArg: 2.172 ± 1.464
5.213GlySer: 5.213 ± 2.358
2.172GlyThr: 2.172 ± 0.806
2.172GlyVal: 2.172 ± 0.532
2.172GlyTrp: 2.172 ± 0.561
3.475GlyTyr: 3.475 ± 1.229
0.0GlyXaa: 0.0 ± 0.0
His
1.303HisAla: 1.303 ± 0.874
1.303HisCys: 1.303 ± 0.602
1.303HisAsp: 1.303 ± 0.988
1.738HisGlu: 1.738 ± 1.042
1.303HisPhe: 1.303 ± 0.309
0.869HisGly: 0.869 ± 0.42
0.0HisHis: 0.0 ± 0.0
1.738HisIle: 1.738 ± 1.912
1.303HisLys: 1.303 ± 0.602
2.606HisLeu: 2.606 ± 0.933
0.434HisMet: 0.434 ± 0.367
1.303HisAsn: 1.303 ± 0.702
3.041HisPro: 3.041 ± 1.093
0.0HisGln: 0.0 ± 0.0
1.303HisArg: 1.303 ± 0.602
0.869HisSer: 0.869 ± 0.475
0.869HisThr: 0.869 ± 0.373
1.303HisVal: 1.303 ± 0.48
0.0HisTrp: 0.0 ± 0.0
0.434HisTyr: 0.434 ± 0.478
0.0HisXaa: 0.0 ± 0.0
Ile
2.172IleAla: 2.172 ± 0.796
0.869IleCys: 0.869 ± 0.373
1.303IleAsp: 1.303 ± 0.309
1.738IleGlu: 1.738 ± 0.599
1.303IlePhe: 1.303 ± 0.455
1.303IleGly: 1.303 ± 0.988
2.606IleHis: 2.606 ± 1.049
1.738IleIle: 1.738 ± 0.607
2.172IleLys: 2.172 ± 1.005
3.91IleLeu: 3.91 ± 1.497
1.303IleMet: 1.303 ± 0.403
1.303IleAsn: 1.303 ± 0.786
2.606IlePro: 2.606 ± 0.817
1.738IleGln: 1.738 ± 0.739
1.738IleArg: 1.738 ± 0.865
2.606IleSer: 2.606 ± 0.908
2.606IleThr: 2.606 ± 1.146
3.475IleVal: 3.475 ± 1.025
0.869IleTrp: 0.869 ± 0.373
1.303IleTyr: 1.303 ± 0.309
0.0IleXaa: 0.0 ± 0.0
Lys
3.475LysAla: 3.475 ± 1.092
0.869LysCys: 0.869 ± 0.661
2.172LysAsp: 2.172 ± 0.973
2.606LysGlu: 2.606 ± 0.885
1.303LysPhe: 1.303 ± 0.696
3.91LysGly: 3.91 ± 1.437
0.0LysHis: 0.0 ± 0.0
4.344LysIle: 4.344 ± 0.861
0.869LysLys: 0.869 ± 0.591
3.91LysLeu: 3.91 ± 1.439
2.606LysMet: 2.606 ± 0.652
1.738LysAsn: 1.738 ± 0.533
5.213LysPro: 5.213 ± 1.172
2.172LysGln: 2.172 ± 0.731
4.344LysArg: 4.344 ± 1.486
6.082LysSer: 6.082 ± 1.823
3.91LysThr: 3.91 ± 1.134
3.91LysVal: 3.91 ± 0.865
2.606LysTrp: 2.606 ± 0.603
1.303LysTyr: 1.303 ± 0.727
0.0LysXaa: 0.0 ± 0.0
Leu
3.475LeuAla: 3.475 ± 0.441
2.172LeuCys: 2.172 ± 0.889
3.041LeuAsp: 3.041 ± 0.948
8.254LeuGlu: 8.254 ± 2.625
4.778LeuPhe: 4.778 ± 0.5
3.91LeuGly: 3.91 ± 1.048
1.303LeuHis: 1.303 ± 0.991
3.475LeuIle: 3.475 ± 1.221
5.213LeuLys: 5.213 ± 1.542
8.254LeuLeu: 8.254 ± 2.769
2.172LeuMet: 2.172 ± 1.365
3.041LeuAsn: 3.041 ± 0.789
5.213LeuPro: 5.213 ± 0.753
4.344LeuGln: 4.344 ± 1.47
5.213LeuArg: 5.213 ± 1.486
6.516LeuSer: 6.516 ± 1.146
6.082LeuThr: 6.082 ± 1.723
5.213LeuVal: 5.213 ± 1.528
2.172LeuTrp: 2.172 ± 0.438
3.041LeuTyr: 3.041 ± 0.867
0.0LeuXaa: 0.0 ± 0.0
Met
2.606MetAla: 2.606 ± 0.515
0.0MetCys: 0.0 ± 0.0
0.434MetAsp: 0.434 ± 0.367
0.869MetGlu: 0.869 ± 0.56
0.869MetPhe: 0.869 ± 0.484
0.869MetGly: 0.869 ± 0.373
0.434MetHis: 0.434 ± 0.367
0.434MetIle: 0.434 ± 0.367
0.434MetLys: 0.434 ± 0.392
1.738MetLeu: 1.738 ± 0.637
0.434MetMet: 0.434 ± 0.367
0.434MetAsn: 0.434 ± 0.522
0.434MetPro: 0.434 ± 0.392
0.0MetGln: 0.0 ± 0.0
0.434MetArg: 0.434 ± 0.367
3.475MetSer: 3.475 ± 0.795
2.172MetThr: 2.172 ± 1.032
2.172MetVal: 2.172 ± 0.894
0.434MetTrp: 0.434 ± 0.367
0.434MetTyr: 0.434 ± 0.392
0.0MetXaa: 0.0 ± 0.0
Asn
5.213AsnAla: 5.213 ± 0.77
0.0AsnCys: 0.0 ± 0.0
0.869AsnAsp: 0.869 ± 0.785
3.041AsnGlu: 3.041 ± 1.019
1.738AsnPhe: 1.738 ± 0.843
4.344AsnGly: 4.344 ± 1.956
0.0AsnHis: 0.0 ± 0.0
1.303AsnIle: 1.303 ± 0.309
2.172AsnLys: 2.172 ± 0.681
3.91AsnLeu: 3.91 ± 1.137
0.0AsnMet: 0.0 ± 0.0
3.475AsnAsn: 3.475 ± 0.768
2.172AsnPro: 2.172 ± 1.106
0.869AsnGln: 0.869 ± 0.422
2.606AsnArg: 2.606 ± 1.418
4.778AsnSer: 4.778 ± 1.074
3.475AsnThr: 3.475 ± 0.54
1.738AsnVal: 1.738 ± 0.789
0.869AsnTrp: 0.869 ± 0.6
1.738AsnTyr: 1.738 ± 0.375
0.434AsnXaa: 0.434 ± 0.33
Pro
7.385ProAla: 7.385 ± 1.777
0.434ProCys: 0.434 ± 0.33
3.475ProAsp: 3.475 ± 1.248
4.344ProGlu: 4.344 ± 0.958
1.303ProPhe: 1.303 ± 0.558
4.778ProGly: 4.778 ± 1.891
1.303ProHis: 1.303 ± 0.602
3.041ProIle: 3.041 ± 0.73
5.647ProLys: 5.647 ± 0.91
3.475ProLeu: 3.475 ± 0.733
0.434ProMet: 0.434 ± 0.367
3.475ProAsn: 3.475 ± 0.706
10.426ProPro: 10.426 ± 3.802
1.738ProGln: 1.738 ± 0.398
3.041ProArg: 3.041 ± 1.05
4.778ProSer: 4.778 ± 1.032
7.385ProThr: 7.385 ± 3.508
5.647ProVal: 5.647 ± 1.596
0.0ProTrp: 0.0 ± 0.0
1.303ProTyr: 1.303 ± 0.455
0.0ProXaa: 0.0 ± 0.0
Gln
5.213GlnAla: 5.213 ± 1.962
0.434GlnCys: 0.434 ± 0.392
3.041GlnAsp: 3.041 ± 1.862
0.869GlnGlu: 0.869 ± 0.661
0.869GlnPhe: 0.869 ± 1.044
3.475GlnGly: 3.475 ± 1.686
0.434GlnHis: 0.434 ± 0.33
0.434GlnIle: 0.434 ± 0.367
2.606GlnLys: 2.606 ± 1.204
4.344GlnLeu: 4.344 ± 0.837
0.434GlnMet: 0.434 ± 0.33
2.606GlnAsn: 2.606 ± 0.893
2.172GlnPro: 2.172 ± 0.653
2.606GlnGln: 2.606 ± 0.781
1.738GlnArg: 1.738 ± 0.831
1.303GlnSer: 1.303 ± 0.618
2.606GlnThr: 2.606 ± 1.12
2.606GlnVal: 2.606 ± 0.781
0.869GlnTrp: 0.869 ± 0.552
1.738GlnTyr: 1.738 ± 1.326
0.0GlnXaa: 0.0 ± 0.0
Arg
5.213ArgAla: 5.213 ± 1.513
0.869ArgCys: 0.869 ± 0.956
3.041ArgAsp: 3.041 ± 0.825
4.778ArgGlu: 4.778 ± 1.601
1.738ArgPhe: 1.738 ± 0.93
4.778ArgGly: 4.778 ± 1.541
1.303ArgHis: 1.303 ± 0.988
1.738ArgIle: 1.738 ± 1.068
2.606ArgLys: 2.606 ± 0.946
8.254ArgLeu: 8.254 ± 2.322
0.0ArgMet: 0.0 ± 0.0
2.172ArgAsn: 2.172 ± 0.989
3.91ArgPro: 3.91 ± 1.092
2.606ArgGln: 2.606 ± 0.408
7.385ArgArg: 7.385 ± 4.365
4.778ArgSer: 4.778 ± 1.939
2.172ArgThr: 2.172 ± 0.51
4.778ArgVal: 4.778 ± 1.268
0.434ArgTrp: 0.434 ± 0.33
0.869ArgTyr: 0.869 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
4.344SerAla: 4.344 ± 1.074
2.172SerCys: 2.172 ± 1.08
4.344SerAsp: 4.344 ± 0.838
2.606SerGlu: 2.606 ± 0.968
4.344SerPhe: 4.344 ± 1.778
7.819SerGly: 7.819 ± 1.031
1.738SerHis: 1.738 ± 0.595
3.475SerIle: 3.475 ± 1.324
5.213SerLys: 5.213 ± 0.979
9.123SerLeu: 9.123 ± 2.126
1.303SerMet: 1.303 ± 0.662
4.344SerAsn: 4.344 ± 0.27
6.95SerPro: 6.95 ± 1.972
3.91SerGln: 3.91 ± 1.022
5.647SerArg: 5.647 ± 1.418
7.819SerSer: 7.819 ± 1.427
6.95SerThr: 6.95 ± 1.559
5.213SerVal: 5.213 ± 1.386
0.869SerTrp: 0.869 ± 0.574
1.303SerTyr: 1.303 ± 0.309
0.0SerXaa: 0.0 ± 0.0
Thr
2.606ThrAla: 2.606 ± 0.617
0.0ThrCys: 0.0 ± 0.0
3.041ThrAsp: 3.041 ± 0.933
2.172ThrGlu: 2.172 ± 0.661
2.172ThrPhe: 2.172 ± 1.032
3.041ThrGly: 3.041 ± 1.604
1.738ThrHis: 1.738 ± 0.758
3.91ThrIle: 3.91 ± 0.906
1.738ThrLys: 1.738 ± 0.533
3.041ThrLeu: 3.041 ± 0.781
2.172ThrMet: 2.172 ± 0.954
3.041ThrAsn: 3.041 ± 0.869
6.95ThrPro: 6.95 ± 2.068
3.91ThrGln: 3.91 ± 1.201
3.475ThrArg: 3.475 ± 0.312
8.688ThrSer: 8.688 ± 2.216
3.041ThrThr: 3.041 ± 1.442
3.041ThrVal: 3.041 ± 0.793
0.434ThrTrp: 0.434 ± 0.33
2.172ThrTyr: 2.172 ± 1.431
0.0ThrXaa: 0.0 ± 0.0
Val
6.082ValAla: 6.082 ± 2.045
0.434ValCys: 0.434 ± 0.522
3.475ValAsp: 3.475 ± 1.354
5.213ValGlu: 5.213 ± 0.998
3.041ValPhe: 3.041 ± 1.23
2.606ValGly: 2.606 ± 0.578
1.738ValHis: 1.738 ± 1.014
1.303ValIle: 1.303 ± 0.991
3.041ValLys: 3.041 ± 0.565
7.819ValLeu: 7.819 ± 0.63
1.303ValMet: 1.303 ± 0.518
2.606ValAsn: 2.606 ± 0.945
4.344ValPro: 4.344 ± 1.491
1.738ValGln: 1.738 ± 0.595
4.344ValArg: 4.344 ± 1.79
4.778ValSer: 4.778 ± 1.03
2.172ValThr: 2.172 ± 0.91
4.778ValVal: 4.778 ± 1.331
0.434ValTrp: 0.434 ± 0.367
3.041ValTyr: 3.041 ± 1.576
0.0ValXaa: 0.0 ± 0.0
Trp
0.434TrpAla: 0.434 ± 0.33
0.434TrpCys: 0.434 ± 0.367
0.434TrpAsp: 0.434 ± 0.392
0.869TrpGlu: 0.869 ± 0.373
0.0TrpPhe: 0.0 ± 0.0
0.869TrpGly: 0.869 ± 0.42
0.0TrpHis: 0.0 ± 0.0
0.434TrpIle: 0.434 ± 0.452
1.738TrpLys: 1.738 ± 1.016
3.041TrpLeu: 3.041 ± 1.246
1.303TrpMet: 1.303 ± 0.662
1.303TrpAsn: 1.303 ± 0.662
0.434TrpPro: 0.434 ± 0.33
0.0TrpGln: 0.0 ± 0.0
2.606TrpArg: 2.606 ± 0.752
2.606TrpSer: 2.606 ± 0.865
0.0TrpThr: 0.0 ± 0.0
0.869TrpVal: 0.869 ± 0.475
0.869TrpTrp: 0.869 ± 0.734
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.344TyrAla: 4.344 ± 0.962
0.0TyrCys: 0.0 ± 0.0
0.869TyrAsp: 0.869 ± 0.699
1.738TyrGlu: 1.738 ± 0.533
0.869TyrPhe: 0.869 ± 0.785
1.303TyrGly: 1.303 ± 0.702
1.738TyrHis: 1.738 ± 0.565
0.869TyrIle: 0.869 ± 0.475
2.606TyrLys: 2.606 ± 1.25
2.172TyrLeu: 2.172 ± 0.952
0.0TyrMet: 0.0 ± 0.0
2.172TyrAsn: 2.172 ± 0.692
0.0TyrPro: 0.0 ± 0.0
1.303TyrGln: 1.303 ± 0.599
3.041TyrArg: 3.041 ± 0.93
3.475TyrSer: 3.475 ± 1.101
1.303TyrThr: 1.303 ± 0.686
0.434TyrVal: 0.434 ± 0.367
0.869TyrTrp: 0.869 ± 0.734
1.303TyrTyr: 1.303 ± 1.434
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.434XaaLeu: 0.434 ± 0.33
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2303 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski