Amino acid dipepetide frequency for Beet pseudoyellows virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.442AlaAla: 2.442 ± 1.289
0.61AlaCys: 0.61 ± 0.288
2.035AlaAsp: 2.035 ± 0.714
2.848AlaGlu: 2.848 ± 0.687
1.628AlaPhe: 1.628 ± 0.584
3.052AlaGly: 3.052 ± 0.524
1.221AlaHis: 1.221 ± 0.405
2.645AlaIle: 2.645 ± 0.834
3.459AlaLys: 3.459 ± 0.704
3.662AlaLeu: 3.662 ± 0.606
0.814AlaMet: 0.814 ± 0.432
3.866AlaAsn: 3.866 ± 1.788
0.61AlaPro: 0.61 ± 0.331
1.628AlaGln: 1.628 ± 0.722
2.035AlaArg: 2.035 ± 0.661
2.645AlaSer: 2.645 ± 0.717
2.035AlaThr: 2.035 ± 0.579
1.221AlaVal: 1.221 ± 0.232
0.203AlaTrp: 0.203 ± 0.124
1.424AlaTyr: 1.424 ± 0.646
0.0AlaXaa: 0.0 ± 0.0
Cys
0.407CysAla: 0.407 ± 0.22
0.407CysCys: 0.407 ± 0.22
1.628CysAsp: 1.628 ± 0.501
1.017CysGlu: 1.017 ± 0.432
0.61CysPhe: 0.61 ± 0.249
0.61CysGly: 0.61 ± 0.372
0.203CysHis: 0.203 ± 0.124
1.221CysIle: 1.221 ± 1.077
1.628CysLys: 1.628 ± 1.048
2.442CysLeu: 2.442 ± 0.81
0.203CysMet: 0.203 ± 0.124
1.628CysAsn: 1.628 ± 0.604
0.203CysPro: 0.203 ± 0.124
0.61CysGln: 0.61 ± 0.331
0.203CysArg: 0.203 ± 0.255
2.035CysSer: 2.035 ± 0.731
1.017CysThr: 1.017 ± 0.256
1.017CysVal: 1.017 ± 0.365
0.203CysTrp: 0.203 ± 0.124
1.221CysTyr: 1.221 ± 0.37
0.0CysXaa: 0.0 ± 0.0
Asp
2.848AspAla: 2.848 ± 0.725
1.424AspCys: 1.424 ± 0.403
4.069AspAsp: 4.069 ± 1.599
3.052AspGlu: 3.052 ± 0.398
3.866AspPhe: 3.866 ± 1.38
3.459AspGly: 3.459 ± 0.642
1.424AspHis: 1.424 ± 0.6
4.476AspIle: 4.476 ± 0.484
4.883AspLys: 4.883 ± 2.006
5.493AspLeu: 5.493 ± 1.2
2.442AspMet: 2.442 ± 0.837
4.273AspAsn: 4.273 ± 1.252
1.424AspPro: 1.424 ± 0.614
1.424AspGln: 1.424 ± 0.595
2.238AspArg: 2.238 ± 0.957
6.104AspSer: 6.104 ± 1.14
2.238AspThr: 2.238 ± 0.78
7.528AspVal: 7.528 ± 1.61
0.407AspTrp: 0.407 ± 0.352
1.424AspTyr: 1.424 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
1.424GluAla: 1.424 ± 0.429
1.017GluCys: 1.017 ± 0.307
3.866GluAsp: 3.866 ± 1.617
1.831GluGlu: 1.831 ± 0.483
3.052GluPhe: 3.052 ± 1.02
2.645GluGly: 2.645 ± 0.662
0.203GluHis: 0.203 ± 0.124
4.273GluIle: 4.273 ± 1.301
5.086GluLys: 5.086 ± 0.891
5.9GluLeu: 5.9 ± 1.606
1.221GluMet: 1.221 ± 0.558
3.459GluAsn: 3.459 ± 1.133
1.424GluPro: 1.424 ± 0.476
0.61GluGln: 0.61 ± 0.331
2.848GluArg: 2.848 ± 0.807
3.459GluSer: 3.459 ± 0.644
3.662GluThr: 3.662 ± 0.996
3.662GluVal: 3.662 ± 0.868
0.61GluTrp: 0.61 ± 0.261
2.645GluTyr: 2.645 ± 1.134
0.0GluXaa: 0.0 ± 0.0
Phe
1.221PheAla: 1.221 ± 0.653
1.628PheCys: 1.628 ± 0.531
5.086PheAsp: 5.086 ± 0.829
2.645PheGlu: 2.645 ± 0.579
2.645PhePhe: 2.645 ± 1.243
3.459PheGly: 3.459 ± 1.061
0.814PheHis: 0.814 ± 0.942
5.086PheIle: 5.086 ± 1.06
4.069PheLys: 4.069 ± 0.752
6.104PheLeu: 6.104 ± 2.592
0.814PheMet: 0.814 ± 0.346
2.848PheAsn: 2.848 ± 0.66
2.035PhePro: 2.035 ± 0.691
0.61PheGln: 0.61 ± 0.46
3.662PheArg: 3.662 ± 1.007
8.138PheSer: 8.138 ± 1.871
3.866PheThr: 3.866 ± 0.656
1.831PheVal: 1.831 ± 0.518
0.407PheTrp: 0.407 ± 0.399
2.035PheTyr: 2.035 ± 0.745
0.0PheXaa: 0.0 ± 0.0
Gly
2.035GlyAla: 2.035 ± 0.697
1.017GlyCys: 1.017 ± 0.371
4.069GlyAsp: 4.069 ± 1.008
3.459GlyGlu: 3.459 ± 0.975
2.645GlyPhe: 2.645 ± 0.617
3.866GlyGly: 3.866 ± 0.788
0.61GlyHis: 0.61 ± 0.613
2.848GlyIle: 2.848 ± 0.918
4.069GlyLys: 4.069 ± 0.715
4.273GlyLeu: 4.273 ± 0.75
1.221GlyMet: 1.221 ± 0.743
2.848GlyAsn: 2.848 ± 1.017
0.407GlyPro: 0.407 ± 0.333
0.407GlyGln: 0.407 ± 0.223
2.848GlyArg: 2.848 ± 0.862
2.848GlySer: 2.848 ± 0.821
1.628GlyThr: 1.628 ± 0.518
4.883GlyVal: 4.883 ± 1.134
0.203GlyTrp: 0.203 ± 0.255
1.424GlyTyr: 1.424 ± 0.545
0.0GlyXaa: 0.0 ± 0.0
His
0.814HisAla: 0.814 ± 0.265
0.407HisCys: 0.407 ± 0.248
1.424HisAsp: 1.424 ± 0.479
0.203HisGlu: 0.203 ± 0.32
1.221HisPhe: 1.221 ± 0.703
0.814HisGly: 0.814 ± 0.398
0.407HisHis: 0.407 ± 0.248
0.814HisIle: 0.814 ± 0.439
1.221HisLys: 1.221 ± 0.533
1.424HisLeu: 1.424 ± 0.828
0.203HisMet: 0.203 ± 0.124
0.61HisAsn: 0.61 ± 0.288
0.407HisPro: 0.407 ± 0.223
0.0HisGln: 0.0 ± 0.0
1.424HisArg: 1.424 ± 0.532
1.628HisSer: 1.628 ± 0.392
0.61HisThr: 0.61 ± 0.247
0.61HisVal: 0.61 ± 0.263
0.0HisTrp: 0.0 ± 0.0
1.424HisTyr: 1.424 ± 0.349
0.0HisXaa: 0.0 ± 0.0
Ile
2.238IleAla: 2.238 ± 0.814
0.61IleCys: 0.61 ± 0.458
6.104IleAsp: 6.104 ± 1.515
4.069IleGlu: 4.069 ± 0.733
3.866IlePhe: 3.866 ± 1.169
1.424IleGly: 1.424 ± 0.456
0.814IleHis: 0.814 ± 0.329
4.069IleIle: 4.069 ± 0.739
6.104IleLys: 6.104 ± 2.309
6.918IleLeu: 6.918 ± 2.098
1.424IleMet: 1.424 ± 0.783
5.086IleAsn: 5.086 ± 0.954
2.442IlePro: 2.442 ± 0.489
1.831IleGln: 1.831 ± 0.592
3.255IleArg: 3.255 ± 0.737
9.969IleSer: 9.969 ± 0.677
2.645IleThr: 2.645 ± 0.695
5.493IleVal: 5.493 ± 1.702
0.0IleTrp: 0.0 ± 0.0
2.442IleTyr: 2.442 ± 0.681
0.0IleXaa: 0.0 ± 0.0
Lys
2.645LysAla: 2.645 ± 0.495
1.424LysCys: 1.424 ± 0.502
3.459LysAsp: 3.459 ± 0.904
4.069LysGlu: 4.069 ± 1.195
5.493LysPhe: 5.493 ± 2.311
3.255LysGly: 3.255 ± 0.758
1.017LysHis: 1.017 ± 0.47
6.104LysIle: 6.104 ± 0.913
4.476LysLys: 4.476 ± 1.103
7.121LysLeu: 7.121 ± 1.081
1.424LysMet: 1.424 ± 0.569
3.866LysAsn: 3.866 ± 1.186
3.052LysPro: 3.052 ± 0.709
2.238LysGln: 2.238 ± 0.88
5.697LysArg: 5.697 ± 0.967
6.104LysSer: 6.104 ± 1.291
5.29LysThr: 5.29 ± 0.68
5.086LysVal: 5.086 ± 0.829
0.203LysTrp: 0.203 ± 0.124
2.442LysTyr: 2.442 ± 0.793
0.0LysXaa: 0.0 ± 0.0
Leu
3.255LeuAla: 3.255 ± 0.533
1.831LeuCys: 1.831 ± 0.564
4.273LeuAsp: 4.273 ± 1.047
5.29LeuGlu: 5.29 ± 1.308
5.493LeuPhe: 5.493 ± 1.38
4.069LeuGly: 4.069 ± 0.928
1.221LeuHis: 1.221 ± 0.414
6.511LeuIle: 6.511 ± 1.16
10.376LeuLys: 10.376 ± 1.363
8.545LeuLeu: 8.545 ± 1.822
2.848LeuMet: 2.848 ± 0.658
5.697LeuAsn: 5.697 ± 1.231
3.662LeuPro: 3.662 ± 1.876
2.645LeuGln: 2.645 ± 1.038
6.714LeuArg: 6.714 ± 1.25
9.969LeuSer: 9.969 ± 1.105
4.883LeuThr: 4.883 ± 1.316
6.307LeuVal: 6.307 ± 0.94
0.203LeuTrp: 0.203 ± 0.255
4.476LeuTyr: 4.476 ± 1.222
0.0LeuXaa: 0.0 ± 0.0
Met
0.61MetAla: 0.61 ± 0.249
0.814MetCys: 0.814 ± 0.496
1.221MetAsp: 1.221 ± 0.802
1.831MetGlu: 1.831 ± 0.673
1.831MetPhe: 1.831 ± 0.781
1.221MetGly: 1.221 ± 0.405
0.0MetHis: 0.0 ± 0.0
2.035MetIle: 2.035 ± 0.639
1.831MetLys: 1.831 ± 0.592
2.238MetLeu: 2.238 ± 0.992
0.0MetMet: 0.0 ± 0.0
1.017MetAsn: 1.017 ± 0.432
0.61MetPro: 0.61 ± 0.372
0.814MetGln: 0.814 ± 0.446
0.814MetArg: 0.814 ± 0.361
2.645MetSer: 2.645 ± 0.406
1.831MetThr: 1.831 ± 1.115
2.035MetVal: 2.035 ± 0.57
0.0MetTrp: 0.0 ± 0.0
0.814MetTyr: 0.814 ± 0.271
0.0MetXaa: 0.0 ± 0.0
Asn
2.035AsnAla: 2.035 ± 0.475
0.407AsnCys: 0.407 ± 0.22
3.052AsnAsp: 3.052 ± 0.472
3.255AsnGlu: 3.255 ± 0.883
4.273AsnPhe: 4.273 ± 1.271
2.442AsnGly: 2.442 ± 0.893
0.0AsnHis: 0.0 ± 0.0
4.883AsnIle: 4.883 ± 1.207
3.255AsnLys: 3.255 ± 1.341
6.714AsnLeu: 6.714 ± 0.771
1.017AsnMet: 1.017 ± 0.443
2.442AsnAsn: 2.442 ± 0.825
1.831AsnPro: 1.831 ± 0.344
3.255AsnGln: 3.255 ± 1.002
2.442AsnArg: 2.442 ± 0.909
4.883AsnSer: 4.883 ± 1.33
4.273AsnThr: 4.273 ± 1.254
5.086AsnVal: 5.086 ± 1.906
0.407AsnTrp: 0.407 ± 0.303
2.645AsnTyr: 2.645 ± 0.982
0.0AsnXaa: 0.0 ± 0.0
Pro
0.814ProAla: 0.814 ± 0.371
0.407ProCys: 0.407 ± 0.248
1.628ProAsp: 1.628 ± 0.463
2.035ProGlu: 2.035 ± 0.609
1.017ProPhe: 1.017 ± 0.751
1.628ProGly: 1.628 ± 0.659
0.814ProHis: 0.814 ± 0.386
2.848ProIle: 2.848 ± 1.233
2.035ProLys: 2.035 ± 1.214
3.662ProLeu: 3.662 ± 0.813
1.017ProMet: 1.017 ± 0.373
1.628ProAsn: 1.628 ± 0.494
2.035ProPro: 2.035 ± 1.009
0.61ProGln: 0.61 ± 0.263
0.814ProArg: 0.814 ± 0.329
2.442ProSer: 2.442 ± 0.88
2.645ProThr: 2.645 ± 0.996
2.238ProVal: 2.238 ± 0.752
0.0ProTrp: 0.0 ± 0.0
1.831ProTyr: 1.831 ± 0.598
0.0ProXaa: 0.0 ± 0.0
Gln
0.61GlnAla: 0.61 ± 0.549
0.814GlnCys: 0.814 ± 0.439
1.424GlnAsp: 1.424 ± 0.914
1.424GlnGlu: 1.424 ± 0.682
2.645GlnPhe: 2.645 ± 0.694
0.61GlnGly: 0.61 ± 0.342
0.814GlnHis: 0.814 ± 0.529
1.424GlnIle: 1.424 ± 0.567
2.035GlnLys: 2.035 ± 0.638
2.238GlnLeu: 2.238 ± 0.776
1.017GlnMet: 1.017 ± 0.417
1.831GlnAsn: 1.831 ± 0.921
0.814GlnPro: 0.814 ± 0.329
1.424GlnGln: 1.424 ± 0.494
1.221GlnArg: 1.221 ± 0.408
2.035GlnSer: 2.035 ± 0.681
1.831GlnThr: 1.831 ± 0.558
1.831GlnVal: 1.831 ± 0.428
0.61GlnTrp: 0.61 ± 0.622
0.814GlnTyr: 0.814 ± 0.325
0.0GlnXaa: 0.0 ± 0.0
Arg
3.052ArgAla: 3.052 ± 0.849
1.221ArgCys: 1.221 ± 0.558
3.255ArgAsp: 3.255 ± 1.117
1.831ArgGlu: 1.831 ± 0.375
3.255ArgPhe: 3.255 ± 0.972
2.035ArgGly: 2.035 ± 0.707
1.017ArgHis: 1.017 ± 0.432
2.238ArgIle: 2.238 ± 0.759
3.255ArgLys: 3.255 ± 0.542
4.883ArgLeu: 4.883 ± 0.921
2.645ArgMet: 2.645 ± 0.863
2.238ArgAsn: 2.238 ± 0.491
1.628ArgPro: 1.628 ± 0.326
2.238ArgGln: 2.238 ± 0.631
3.662ArgArg: 3.662 ± 0.514
4.883ArgSer: 4.883 ± 1.093
3.662ArgThr: 3.662 ± 1.062
3.052ArgVal: 3.052 ± 0.957
0.61ArgTrp: 0.61 ± 0.368
1.424ArgTyr: 1.424 ± 0.729
0.0ArgXaa: 0.0 ± 0.0
Ser
5.29SerAla: 5.29 ± 1.116
0.814SerCys: 0.814 ± 0.622
6.714SerAsp: 6.714 ± 1.398
5.29SerGlu: 5.29 ± 1.063
5.9SerPhe: 5.9 ± 1.514
3.866SerGly: 3.866 ± 0.97
2.848SerHis: 2.848 ± 0.608
6.104SerIle: 6.104 ± 0.892
6.104SerLys: 6.104 ± 1.177
9.563SerLeu: 9.563 ± 1.37
1.221SerMet: 1.221 ± 0.758
4.883SerAsn: 4.883 ± 0.638
2.645SerPro: 2.645 ± 0.559
3.255SerGln: 3.255 ± 0.599
4.273SerArg: 4.273 ± 1.049
5.697SerSer: 5.697 ± 0.624
5.493SerThr: 5.493 ± 1.666
7.325SerVal: 7.325 ± 1.003
0.407SerTrp: 0.407 ± 0.303
3.866SerTyr: 3.866 ± 0.908
0.0SerXaa: 0.0 ± 0.0
Thr
2.848ThrAla: 2.848 ± 1.105
0.61ThrCys: 0.61 ± 0.263
2.035ThrAsp: 2.035 ± 0.727
2.848ThrGlu: 2.848 ± 1.275
3.866ThrPhe: 3.866 ± 0.828
4.273ThrGly: 4.273 ± 0.735
0.814ThrHis: 0.814 ± 0.398
4.68ThrIle: 4.68 ± 0.742
2.035ThrLys: 2.035 ± 0.642
7.325ThrLeu: 7.325 ± 1.404
0.814ThrMet: 0.814 ± 0.271
4.069ThrAsn: 4.069 ± 0.932
1.424ThrPro: 1.424 ± 0.463
1.831ThrGln: 1.831 ± 0.627
1.017ThrArg: 1.017 ± 0.365
5.29ThrSer: 5.29 ± 1.307
2.848ThrThr: 2.848 ± 0.773
4.273ThrVal: 4.273 ± 1.212
0.61ThrTrp: 0.61 ± 0.372
3.459ThrTyr: 3.459 ± 0.531
0.0ThrXaa: 0.0 ± 0.0
Val
3.255ValAla: 3.255 ± 1.365
2.238ValCys: 2.238 ± 0.771
4.68ValAsp: 4.68 ± 0.514
3.662ValGlu: 3.662 ± 1.11
3.255ValPhe: 3.255 ± 0.83
3.459ValGly: 3.459 ± 0.846
0.814ValHis: 0.814 ± 0.496
4.883ValIle: 4.883 ± 1.308
4.883ValLys: 4.883 ± 1.366
4.68ValLeu: 4.68 ± 1.362
2.442ValMet: 2.442 ± 0.895
4.883ValAsn: 4.883 ± 0.885
3.662ValPro: 3.662 ± 0.899
1.628ValGln: 1.628 ± 0.266
4.069ValArg: 4.069 ± 0.669
6.918ValSer: 6.918 ± 0.844
4.883ValThr: 4.883 ± 0.686
3.459ValVal: 3.459 ± 0.623
0.203ValTrp: 0.203 ± 0.124
2.238ValTyr: 2.238 ± 0.409
0.0ValXaa: 0.0 ± 0.0
Trp
0.407TrpAla: 0.407 ± 0.211
0.0TrpCys: 0.0 ± 0.0
0.814TrpAsp: 0.814 ± 0.449
0.0TrpGlu: 0.0 ± 0.0
0.407TrpPhe: 0.407 ± 0.22
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.407TrpIle: 0.407 ± 0.484
0.814TrpLys: 0.814 ± 0.316
1.017TrpLeu: 1.017 ± 0.358
0.407TrpMet: 0.407 ± 0.333
0.203TrpAsn: 0.203 ± 0.255
0.203TrpPro: 0.203 ± 0.255
0.0TrpGln: 0.0 ± 0.0
0.203TrpArg: 0.203 ± 0.242
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.203TrpVal: 0.203 ± 0.247
0.0TrpTrp: 0.0 ± 0.0
0.203TrpTyr: 0.203 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.831TyrAla: 1.831 ± 0.644
0.814TyrCys: 0.814 ± 0.398
3.662TyrAsp: 3.662 ± 0.922
2.238TyrGlu: 2.238 ± 0.671
2.035TyrPhe: 2.035 ± 0.678
1.424TyrGly: 1.424 ± 1.357
0.61TyrHis: 0.61 ± 0.372
3.459TyrIle: 3.459 ± 1.089
2.848TyrLys: 2.848 ± 0.699
4.069TyrLeu: 4.069 ± 0.81
0.814TyrMet: 0.814 ± 0.453
1.221TyrAsn: 1.221 ± 0.441
1.628TyrPro: 1.628 ± 0.494
0.407TyrGln: 0.407 ± 0.379
2.645TyrArg: 2.645 ± 1.161
3.866TyrSer: 3.866 ± 0.842
1.628TyrThr: 1.628 ± 0.811
3.052TyrVal: 3.052 ± 0.77
0.0TyrTrp: 0.0 ± 0.0
1.424TyrTyr: 1.424 ± 0.701
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (4916 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski