Amino acid dipepetide frequency for Beet western yellows virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.11AlaAla: 5.11 ± 1.739
1.179AlaCys: 1.179 ± 0.537
1.572AlaAsp: 1.572 ± 0.832
4.717AlaGlu: 4.717 ± 1.923
2.358AlaPhe: 2.358 ± 0.863
6.289AlaGly: 6.289 ± 0.944
0.0AlaHis: 0.0 ± 0.0
2.752AlaIle: 2.752 ± 1.098
4.324AlaLys: 4.324 ± 0.363
4.717AlaLeu: 4.717 ± 1.482
1.965AlaMet: 1.965 ± 0.514
3.145AlaAsn: 3.145 ± 2.121
2.358AlaPro: 2.358 ± 1.498
5.503AlaGln: 5.503 ± 1.052
5.11AlaArg: 5.11 ± 2.094
7.862AlaSer: 7.862 ± 2.08
2.752AlaThr: 2.752 ± 0.669
5.11AlaVal: 5.11 ± 0.774
2.358AlaTrp: 2.358 ± 0.746
3.538AlaTyr: 3.538 ± 1.165
0.0AlaXaa: 0.0 ± 0.0
Cys
1.179CysAla: 1.179 ± 1.045
0.786CysCys: 0.786 ± 0.689
0.393CysAsp: 0.393 ± 0.345
1.572CysGlu: 1.572 ± 0.954
0.786CysPhe: 0.786 ± 0.689
1.179CysGly: 1.179 ± 0.76
1.179CysHis: 1.179 ± 0.603
0.393CysIle: 0.393 ± 0.345
1.179CysLys: 1.179 ± 0.961
0.786CysLeu: 0.786 ± 0.359
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.179CysPro: 1.179 ± 0.496
1.179CysGln: 1.179 ± 0.53
2.358CysArg: 2.358 ± 0.488
1.572CysSer: 1.572 ± 0.954
0.0CysThr: 0.0 ± 0.0
1.179CysVal: 1.179 ± 1.045
0.786CysTrp: 0.786 ± 0.549
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.145AspAla: 3.145 ± 0.473
1.965AspCys: 1.965 ± 0.575
3.538AspAsp: 3.538 ± 1.429
3.538AspGlu: 3.538 ± 1.133
2.752AspPhe: 2.752 ± 0.949
2.358AspGly: 2.358 ± 0.768
0.393AspHis: 0.393 ± 0.42
0.393AspIle: 0.393 ± 0.32
1.179AspLys: 1.179 ± 1.004
3.538AspLeu: 3.538 ± 0.696
1.179AspMet: 1.179 ± 0.603
2.752AspAsn: 2.752 ± 1.064
4.717AspPro: 4.717 ± 1.4
1.965AspGln: 1.965 ± 1.008
1.965AspArg: 1.965 ± 0.901
0.393AspSer: 0.393 ± 0.511
0.0AspThr: 0.0 ± 0.0
2.358AspVal: 2.358 ± 0.906
1.965AspTrp: 1.965 ± 1.008
0.786AspTyr: 0.786 ± 0.64
0.0AspXaa: 0.0 ± 0.0
Glu
4.324GluAla: 4.324 ± 1.217
0.393GluCys: 0.393 ± 0.345
3.931GluAsp: 3.931 ± 1.246
5.503GluGlu: 5.503 ± 1.467
3.538GluPhe: 3.538 ± 0.826
3.931GluGly: 3.931 ± 1.234
0.0GluHis: 0.0 ± 0.0
3.931GluIle: 3.931 ± 1.464
2.358GluLys: 2.358 ± 0.486
4.717GluLeu: 4.717 ± 1.552
0.393GluMet: 0.393 ± 0.32
1.965GluAsn: 1.965 ± 0.776
1.179GluPro: 1.179 ± 0.62
3.931GluGln: 3.931 ± 1.115
5.11GluArg: 5.11 ± 1.6
4.324GluSer: 4.324 ± 1.735
4.717GluThr: 4.717 ± 0.928
4.324GluVal: 4.324 ± 1.334
0.786GluTrp: 0.786 ± 0.359
1.572GluTyr: 1.572 ± 0.552
0.0GluXaa: 0.0 ± 0.0
Phe
0.786PheAla: 0.786 ± 0.549
1.179PheCys: 1.179 ± 0.635
1.965PheAsp: 1.965 ± 0.901
2.752PheGlu: 2.752 ± 0.64
1.572PhePhe: 1.572 ± 0.457
3.145PheGly: 3.145 ± 1.044
1.179PheHis: 1.179 ± 0.53
1.179PheIle: 1.179 ± 0.603
1.965PheLys: 1.965 ± 0.537
4.717PheLeu: 4.717 ± 2.572
0.393PheMet: 0.393 ± 0.32
1.572PheAsn: 1.572 ± 0.895
0.786PhePro: 0.786 ± 0.723
1.179PheGln: 1.179 ± 0.635
3.538PheArg: 3.538 ± 1.051
3.145PheSer: 3.145 ± 0.851
3.145PheThr: 3.145 ± 0.992
3.931PheVal: 3.931 ± 0.681
0.786PheTrp: 0.786 ± 0.723
0.393PheTyr: 0.393 ± 0.345
0.0PheXaa: 0.0 ± 0.0
Gly
6.682GlyAla: 6.682 ± 1.016
0.786GlyCys: 0.786 ± 0.366
2.752GlyAsp: 2.752 ± 0.959
3.538GlyGlu: 3.538 ± 1.382
2.752GlyPhe: 2.752 ± 1.573
2.752GlyGly: 2.752 ± 1.092
1.179GlyHis: 1.179 ± 0.628
2.752GlyIle: 2.752 ± 1.027
3.931GlyLys: 3.931 ± 1.0
3.538GlyLeu: 3.538 ± 0.809
0.786GlyMet: 0.786 ± 0.669
3.931GlyAsn: 3.931 ± 0.937
3.931GlyPro: 3.931 ± 0.901
1.965GlyGln: 1.965 ± 0.731
7.469GlyArg: 7.469 ± 1.158
9.041GlySer: 9.041 ± 1.902
4.717GlyThr: 4.717 ± 0.984
2.358GlyVal: 2.358 ± 0.741
1.179GlyTrp: 1.179 ± 0.62
1.965GlyTyr: 1.965 ± 0.536
0.0GlyXaa: 0.0 ± 0.0
His
1.179HisAla: 1.179 ± 0.493
1.572HisCys: 1.572 ± 0.558
1.965HisAsp: 1.965 ± 0.684
1.965HisGlu: 1.965 ± 0.756
0.786HisPhe: 0.786 ± 0.533
0.393HisGly: 0.393 ± 0.345
0.0HisHis: 0.0 ± 0.0
1.965HisIle: 1.965 ± 0.849
1.965HisLys: 1.965 ± 1.063
0.786HisLeu: 0.786 ± 0.517
0.0HisMet: 0.0 ± 0.0
1.179HisAsn: 1.179 ± 0.607
2.358HisPro: 2.358 ± 1.132
0.786HisGln: 0.786 ± 0.58
1.179HisArg: 1.179 ± 0.731
1.965HisSer: 1.965 ± 0.756
0.786HisThr: 0.786 ± 0.517
1.179HisVal: 1.179 ± 0.258
0.0HisTrp: 0.0 ± 0.0
0.393HisTyr: 0.393 ± 0.32
0.0HisXaa: 0.0 ± 0.0
Ile
4.324IleAla: 4.324 ± 1.266
0.393IleCys: 0.393 ± 0.345
1.179IleAsp: 1.179 ± 0.605
1.572IleGlu: 1.572 ± 0.695
2.358IlePhe: 2.358 ± 1.225
1.965IleGly: 1.965 ± 0.514
0.786IleHis: 0.786 ± 0.549
0.393IleIle: 0.393 ± 0.335
1.179IleLys: 1.179 ± 0.472
5.896IleLeu: 5.896 ± 1.707
0.786IleMet: 0.786 ± 0.359
3.145IleAsn: 3.145 ± 1.682
2.358IlePro: 2.358 ± 0.746
0.0IleGln: 0.0 ± 0.0
3.145IleArg: 3.145 ± 1.117
5.11IleSer: 5.11 ± 1.069
3.931IleThr: 3.931 ± 1.839
1.179IleVal: 1.179 ± 1.004
0.393IleTrp: 0.393 ± 0.32
1.179IleTyr: 1.179 ± 0.472
0.0IleXaa: 0.0 ± 0.0
Lys
5.11LysAla: 5.11 ± 1.148
1.965LysCys: 1.965 ± 0.575
2.358LysAsp: 2.358 ± 1.2
2.752LysGlu: 2.752 ± 1.015
1.179LysPhe: 1.179 ± 0.496
3.145LysGly: 3.145 ± 0.293
1.965LysHis: 1.965 ± 1.624
3.538LysIle: 3.538 ± 0.897
1.179LysLys: 1.179 ± 0.605
3.145LysLeu: 3.145 ± 1.383
1.572LysMet: 1.572 ± 0.664
0.786LysAsn: 0.786 ± 0.366
6.289LysPro: 6.289 ± 0.993
2.752LysGln: 2.752 ± 1.064
3.145LysArg: 3.145 ± 0.944
4.717LysSer: 4.717 ± 1.344
3.538LysThr: 3.538 ± 1.529
2.358LysVal: 2.358 ± 0.961
0.0LysTrp: 0.0 ± 0.0
1.965LysTyr: 1.965 ± 0.776
0.393LysXaa: 0.393 ± 0.335
Leu
7.862LeuAla: 7.862 ± 2.151
2.358LeuCys: 2.358 ± 0.945
3.931LeuAsp: 3.931 ± 1.405
5.11LeuGlu: 5.11 ± 2.479
2.752LeuPhe: 2.752 ± 1.802
3.145LeuGly: 3.145 ± 1.254
2.358LeuHis: 2.358 ± 0.974
4.324LeuIle: 4.324 ± 1.823
3.931LeuLys: 3.931 ± 1.634
4.717LeuLeu: 4.717 ± 1.643
1.572LeuMet: 1.572 ± 0.99
2.752LeuAsn: 2.752 ± 0.613
3.538LeuPro: 3.538 ± 1.586
3.538LeuGln: 3.538 ± 1.643
3.538LeuArg: 3.538 ± 1.418
7.469LeuSer: 7.469 ± 1.468
5.503LeuThr: 5.503 ± 0.934
3.145LeuVal: 3.145 ± 1.864
1.572LeuTrp: 1.572 ± 0.65
2.752LeuTyr: 2.752 ± 0.528
0.0LeuXaa: 0.0 ± 0.0
Met
1.572MetAla: 1.572 ± 0.719
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.358MetGlu: 2.358 ± 1.08
0.0MetPhe: 0.0 ± 0.0
0.786MetGly: 0.786 ± 0.549
0.0MetHis: 0.0 ± 0.0
1.572MetIle: 1.572 ± 0.906
1.179MetLys: 1.179 ± 0.496
3.145MetLeu: 3.145 ± 1.459
0.786MetMet: 0.786 ± 0.514
1.179MetAsn: 1.179 ± 0.756
0.786MetPro: 0.786 ± 0.669
0.0MetGln: 0.0 ± 0.0
0.786MetArg: 0.786 ± 0.64
1.965MetSer: 1.965 ± 0.467
0.786MetThr: 0.786 ± 0.64
1.965MetVal: 1.965 ± 0.448
0.0MetTrp: 0.0 ± 0.0
0.393MetTyr: 0.393 ± 0.335
0.0MetXaa: 0.0 ± 0.0
Asn
3.145AsnAla: 3.145 ± 0.944
0.393AsnCys: 0.393 ± 0.335
0.393AsnAsp: 0.393 ± 0.335
1.572AsnGlu: 1.572 ± 0.936
1.572AsnPhe: 1.572 ± 0.692
5.503AsnGly: 5.503 ± 1.896
0.786AsnHis: 0.786 ± 0.549
0.786AsnIle: 0.786 ± 0.533
3.931AsnLys: 3.931 ± 0.721
2.752AsnLeu: 2.752 ± 1.117
1.179AsnMet: 1.179 ± 0.308
2.752AsnAsn: 2.752 ± 1.556
2.752AsnPro: 2.752 ± 1.11
1.572AsnGln: 1.572 ± 0.457
2.358AsnArg: 2.358 ± 0.504
3.145AsnSer: 3.145 ± 0.717
3.145AsnThr: 3.145 ± 0.621
2.358AsnVal: 2.358 ± 1.061
0.786AsnTrp: 0.786 ± 0.37
1.965AsnTyr: 1.965 ± 0.536
0.0AsnXaa: 0.0 ± 0.0
Pro
2.752ProAla: 2.752 ± 0.758
0.786ProCys: 0.786 ± 0.366
2.752ProAsp: 2.752 ± 1.426
2.752ProGlu: 2.752 ± 1.6
0.393ProPhe: 0.393 ± 0.42
5.896ProGly: 5.896 ± 0.571
1.572ProHis: 1.572 ± 0.411
1.572ProIle: 1.572 ± 0.732
4.717ProLys: 4.717 ± 1.128
3.145ProLeu: 3.145 ± 1.413
1.179ProMet: 1.179 ± 0.605
1.965ProAsn: 1.965 ± 0.537
4.324ProPro: 4.324 ± 1.86
6.289ProGln: 6.289 ± 1.61
3.931ProArg: 3.931 ± 0.792
5.503ProSer: 5.503 ± 0.951
2.358ProThr: 2.358 ± 0.54
4.717ProVal: 4.717 ± 0.842
0.393ProTrp: 0.393 ± 0.335
1.179ProTyr: 1.179 ± 0.537
0.0ProXaa: 0.0 ± 0.0
Gln
4.324GlnAla: 4.324 ± 1.15
0.786GlnCys: 0.786 ± 0.533
2.358GlnAsp: 2.358 ± 1.061
1.965GlnGlu: 1.965 ± 0.958
2.752GlnPhe: 2.752 ± 1.188
3.538GlnGly: 3.538 ± 0.986
1.965GlnHis: 1.965 ± 0.919
0.393GlnIle: 0.393 ± 0.345
3.931GlnLys: 3.931 ± 0.791
3.145GlnLeu: 3.145 ± 0.934
1.179GlnMet: 1.179 ± 0.76
2.358GlnAsn: 2.358 ± 1.006
3.145GlnPro: 3.145 ± 1.177
0.786GlnGln: 0.786 ± 0.359
5.11GlnArg: 5.11 ± 1.503
2.752GlnSer: 2.752 ± 0.758
3.145GlnThr: 3.145 ± 0.531
2.358GlnVal: 2.358 ± 1.098
0.786GlnTrp: 0.786 ± 0.517
0.393GlnTyr: 0.393 ± 0.345
0.0GlnXaa: 0.0 ± 0.0
Arg
7.075ArgAla: 7.075 ± 0.645
0.393ArgCys: 0.393 ± 0.345
2.358ArgAsp: 2.358 ± 0.7
3.538ArgGlu: 3.538 ± 1.889
2.358ArgPhe: 2.358 ± 0.943
4.717ArgGly: 4.717 ± 1.41
0.786ArgHis: 0.786 ± 0.723
4.717ArgIle: 4.717 ± 1.513
3.931ArgLys: 3.931 ± 0.842
5.503ArgLeu: 5.503 ± 2.761
1.179ArgMet: 1.179 ± 0.493
2.752ArgAsn: 2.752 ± 1.225
3.145ArgPro: 3.145 ± 1.37
3.145ArgGln: 3.145 ± 0.583
12.186ArgArg: 12.186 ± 4.76
6.682ArgSer: 6.682 ± 1.684
3.538ArgThr: 3.538 ± 1.964
4.324ArgVal: 4.324 ± 1.661
2.358ArgTrp: 2.358 ± 0.948
1.572ArgTyr: 1.572 ± 0.631
0.0ArgXaa: 0.0 ± 0.0
Ser
3.538SerAla: 3.538 ± 0.956
0.393SerCys: 0.393 ± 0.345
3.145SerAsp: 3.145 ± 1.087
5.503SerGlu: 5.503 ± 2.059
3.145SerPhe: 3.145 ± 0.654
9.041SerGly: 9.041 ± 1.431
2.752SerHis: 2.752 ± 0.883
3.538SerIle: 3.538 ± 1.182
4.717SerLys: 4.717 ± 0.592
8.648SerLeu: 8.648 ± 1.258
2.358SerMet: 2.358 ± 1.544
3.145SerAsn: 3.145 ± 0.878
4.324SerPro: 4.324 ± 1.314
5.11SerGln: 5.11 ± 1.792
5.11SerArg: 5.11 ± 1.994
18.082SerSer: 18.082 ± 4.002
7.862SerThr: 7.862 ± 1.49
4.717SerVal: 4.717 ± 0.995
1.572SerTrp: 1.572 ± 0.696
3.145SerTyr: 3.145 ± 0.642
0.0SerXaa: 0.0 ± 0.0
Thr
4.717ThrAla: 4.717 ± 1.715
0.786ThrCys: 0.786 ± 0.37
4.324ThrAsp: 4.324 ± 1.042
1.572ThrGlu: 1.572 ± 0.484
3.145ThrPhe: 3.145 ± 0.939
3.538ThrGly: 3.538 ± 1.214
1.572ThrHis: 1.572 ± 0.65
3.145ThrIle: 3.145 ± 1.363
1.965ThrLys: 1.965 ± 0.667
4.717ThrLeu: 4.717 ± 1.358
1.572ThrMet: 1.572 ± 0.552
1.965ThrAsn: 1.965 ± 0.928
3.538ThrPro: 3.538 ± 0.775
2.358ThrGln: 2.358 ± 0.733
5.11ThrArg: 5.11 ± 1.522
5.503ThrSer: 5.503 ± 0.745
5.896ThrThr: 5.896 ± 1.864
3.145ThrVal: 3.145 ± 1.024
0.393ThrTrp: 0.393 ± 0.335
1.572ThrTyr: 1.572 ± 1.049
0.0ThrXaa: 0.0 ± 0.0
Val
3.538ValAla: 3.538 ± 0.645
0.786ValCys: 0.786 ± 1.023
0.786ValAsp: 0.786 ± 0.359
5.503ValGlu: 5.503 ± 1.121
2.752ValPhe: 2.752 ± 0.74
3.538ValGly: 3.538 ± 1.472
1.572ValHis: 1.572 ± 0.363
1.572ValIle: 1.572 ± 0.593
3.145ValLys: 3.145 ± 0.981
5.11ValLeu: 5.11 ± 1.026
1.179ValMet: 1.179 ± 0.258
2.752ValAsn: 2.752 ± 0.854
5.503ValPro: 5.503 ± 0.967
3.931ValGln: 3.931 ± 1.246
1.572ValArg: 1.572 ± 0.484
5.896ValSer: 5.896 ± 1.326
1.572ValThr: 1.572 ± 0.895
5.11ValVal: 5.11 ± 1.625
1.572ValTrp: 1.572 ± 1.049
1.179ValTyr: 1.179 ± 0.704
0.0ValXaa: 0.0 ± 0.0
Trp
0.786TrpAla: 0.786 ± 0.366
0.786TrpCys: 0.786 ± 0.549
0.393TrpAsp: 0.393 ± 0.345
1.572TrpGlu: 1.572 ± 0.457
1.179TrpPhe: 1.179 ± 0.605
1.179TrpGly: 1.179 ± 0.258
0.786TrpHis: 0.786 ± 0.52
0.393TrpIle: 0.393 ± 0.32
0.393TrpLys: 0.393 ± 0.32
1.965TrpLeu: 1.965 ± 1.402
0.0TrpMet: 0.0 ± 0.0
0.393TrpAsn: 0.393 ± 0.335
1.572TrpPro: 1.572 ± 0.694
0.0TrpGln: 0.0 ± 0.0
1.965TrpArg: 1.965 ± 0.901
1.965TrpSer: 1.965 ± 1.231
1.572TrpThr: 1.572 ± 1.016
0.393TrpVal: 0.393 ± 0.345
0.0TrpTrp: 0.0 ± 0.0
0.393TrpTyr: 0.393 ± 0.335
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.572TyrAla: 1.572 ± 0.363
0.0TyrCys: 0.0 ± 0.0
0.786TyrAsp: 0.786 ± 0.37
1.572TyrGlu: 1.572 ± 0.411
1.179TyrPhe: 1.179 ± 0.635
1.965TyrGly: 1.965 ± 0.858
1.572TyrHis: 1.572 ± 0.558
1.965TyrIle: 1.965 ± 0.539
2.752TyrLys: 2.752 ± 0.843
1.179TyrLeu: 1.179 ± 0.537
0.0TyrMet: 0.0 ± 0.0
2.358TyrAsn: 2.358 ± 0.7
0.393TyrPro: 0.393 ± 0.335
1.179TyrGln: 1.179 ± 0.476
1.572TyrArg: 1.572 ± 0.631
2.752TyrSer: 2.752 ± 0.453
1.572TyrThr: 1.572 ± 0.895
1.965TyrVal: 1.965 ± 1.338
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.393XaaVal: 0.393 ± 0.335
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2545 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski