Amino acid dipepetide frequency for Carrot yellow leaf virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.168AlaAla: 7.168 ± 1.226
0.581AlaCys: 0.581 ± 0.37
2.518AlaAsp: 2.518 ± 0.757
3.874AlaGlu: 3.874 ± 1.199
4.068AlaPhe: 4.068 ± 1.133
2.906AlaGly: 2.906 ± 0.696
1.162AlaHis: 1.162 ± 0.356
4.456AlaIle: 4.456 ± 0.603
3.487AlaLys: 3.487 ± 0.757
5.037AlaLeu: 5.037 ± 0.695
0.969AlaMet: 0.969 ± 0.458
2.325AlaAsn: 2.325 ± 0.675
2.906AlaPro: 2.906 ± 0.97
1.162AlaGln: 1.162 ± 0.651
3.293AlaArg: 3.293 ± 0.39
5.812AlaSer: 5.812 ± 0.886
3.874AlaThr: 3.874 ± 1.38
3.487AlaVal: 3.487 ± 0.572
0.0AlaTrp: 0.0 ± 0.0
1.356AlaTyr: 1.356 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
1.937CysAla: 1.937 ± 0.634
0.581CysCys: 0.581 ± 0.365
1.356CysAsp: 1.356 ± 0.544
1.55CysGlu: 1.55 ± 0.284
1.356CysPhe: 1.356 ± 0.275
1.162CysGly: 1.162 ± 0.618
0.194CysHis: 0.194 ± 0.228
0.969CysIle: 0.969 ± 0.322
0.969CysLys: 0.969 ± 0.467
2.906CysLeu: 2.906 ± 1.449
0.387CysMet: 0.387 ± 0.197
0.194CysAsn: 0.194 ± 0.228
0.581CysPro: 0.581 ± 0.223
0.775CysGln: 0.775 ± 0.253
1.55CysArg: 1.55 ± 0.37
2.325CysSer: 2.325 ± 0.865
0.581CysThr: 0.581 ± 0.33
1.55CysVal: 1.55 ± 0.593
0.387CysTrp: 0.387 ± 0.391
1.162CysTyr: 1.162 ± 0.356
0.0CysXaa: 0.0 ± 0.0
Asp
4.649AspAla: 4.649 ± 0.607
0.581AspCys: 0.581 ± 0.299
2.712AspAsp: 2.712 ± 0.443
3.1AspGlu: 3.1 ± 0.562
3.874AspPhe: 3.874 ± 0.768
1.744AspGly: 1.744 ± 0.371
1.937AspHis: 1.937 ± 0.467
2.906AspIle: 2.906 ± 0.695
2.712AspLys: 2.712 ± 0.806
5.812AspLeu: 5.812 ± 1.147
1.356AspMet: 1.356 ± 0.516
1.356AspAsn: 1.356 ± 0.521
1.356AspPro: 1.356 ± 0.412
1.356AspGln: 1.356 ± 0.419
2.712AspArg: 2.712 ± 0.638
4.649AspSer: 4.649 ± 0.7
2.518AspThr: 2.518 ± 0.405
5.424AspVal: 5.424 ± 0.842
0.194AspTrp: 0.194 ± 0.252
3.1AspTyr: 3.1 ± 1.34
0.0AspXaa: 0.0 ± 0.0
Glu
3.1GluAla: 3.1 ± 0.838
1.162GluCys: 1.162 ± 0.346
2.518GluAsp: 2.518 ± 0.794
2.131GluGlu: 2.131 ± 0.587
2.325GluPhe: 2.325 ± 0.643
3.1GluGly: 3.1 ± 0.828
0.581GluHis: 0.581 ± 0.278
4.456GluIle: 4.456 ± 1.034
5.424GluLys: 5.424 ± 0.691
3.874GluLeu: 3.874 ± 0.852
1.937GluMet: 1.937 ± 0.867
2.518GluAsn: 2.518 ± 0.393
1.937GluPro: 1.937 ± 0.506
0.775GluGln: 0.775 ± 0.369
3.874GluArg: 3.874 ± 0.6
3.681GluSer: 3.681 ± 0.419
4.843GluThr: 4.843 ± 0.953
5.037GluVal: 5.037 ± 0.776
0.194GluTrp: 0.194 ± 0.118
2.131GluTyr: 2.131 ± 0.567
0.0GluXaa: 0.0 ± 0.0
Phe
2.325PheAla: 2.325 ± 0.592
1.744PheCys: 1.744 ± 0.696
3.681PheAsp: 3.681 ± 0.683
3.487PheGlu: 3.487 ± 0.977
2.906PhePhe: 2.906 ± 0.745
4.068PheGly: 4.068 ± 0.892
1.744PheHis: 1.744 ± 0.453
3.1PheIle: 3.1 ± 0.893
3.874PheLys: 3.874 ± 1.242
6.587PheLeu: 6.587 ± 1.091
1.356PheMet: 1.356 ± 0.473
1.937PheAsn: 1.937 ± 0.459
1.356PhePro: 1.356 ± 0.371
0.581PheGln: 0.581 ± 0.223
1.744PheArg: 1.744 ± 0.6
9.299PheSer: 9.299 ± 1.145
2.712PheThr: 2.712 ± 0.696
4.456PheVal: 4.456 ± 0.516
0.387PheTrp: 0.387 ± 0.273
1.744PheTyr: 1.744 ± 0.583
0.0PheXaa: 0.0 ± 0.0
Gly
2.518GlyAla: 2.518 ± 0.397
1.937GlyCys: 1.937 ± 0.825
3.293GlyAsp: 3.293 ± 0.537
4.649GlyGlu: 4.649 ± 0.661
1.937GlyPhe: 1.937 ± 0.791
3.487GlyGly: 3.487 ± 1.402
0.775GlyHis: 0.775 ± 0.412
2.518GlyIle: 2.518 ± 1.238
4.649GlyLys: 4.649 ± 0.603
4.068GlyLeu: 4.068 ± 0.82
0.194GlyMet: 0.194 ± 0.242
2.131GlyAsn: 2.131 ± 0.84
0.969GlyPro: 0.969 ± 0.252
1.162GlyGln: 1.162 ± 0.502
2.518GlyArg: 2.518 ± 0.421
4.262GlySer: 4.262 ± 0.658
1.937GlyThr: 1.937 ± 1.055
5.424GlyVal: 5.424 ± 0.678
0.387GlyTrp: 0.387 ± 0.387
1.937GlyTyr: 1.937 ± 0.637
0.0GlyXaa: 0.0 ± 0.0
His
1.744HisAla: 1.744 ± 0.524
1.162HisCys: 1.162 ± 0.955
1.744HisAsp: 1.744 ± 0.378
1.356HisGlu: 1.356 ± 0.35
1.55HisPhe: 1.55 ± 0.619
0.775HisGly: 0.775 ± 0.371
1.162HisHis: 1.162 ± 0.283
1.744HisIle: 1.744 ± 0.378
0.775HisLys: 0.775 ± 0.398
0.775HisLeu: 0.775 ± 0.611
0.194HisMet: 0.194 ± 0.234
0.387HisAsn: 0.387 ± 0.237
0.969HisPro: 0.969 ± 0.294
0.387HisGln: 0.387 ± 0.231
1.55HisArg: 1.55 ± 0.628
3.487HisSer: 3.487 ± 0.72
0.775HisThr: 0.775 ± 0.328
1.162HisVal: 1.162 ± 0.548
0.0HisTrp: 0.0 ± 0.0
1.356HisTyr: 1.356 ± 0.448
0.0HisXaa: 0.0 ± 0.0
Ile
3.487IleAla: 3.487 ± 0.99
0.969IleCys: 0.969 ± 0.432
3.874IleAsp: 3.874 ± 1.075
1.744IleGlu: 1.744 ± 0.417
2.906IlePhe: 2.906 ± 0.919
2.906IleGly: 2.906 ± 1.154
1.356IleHis: 1.356 ± 0.446
3.681IleIle: 3.681 ± 0.905
2.712IleLys: 2.712 ± 0.453
5.424IleLeu: 5.424 ± 0.954
1.162IleMet: 1.162 ± 0.537
1.162IleAsn: 1.162 ± 0.302
3.681IlePro: 3.681 ± 0.862
1.162IleGln: 1.162 ± 0.312
2.518IleArg: 2.518 ± 0.61
7.168IleSer: 7.168 ± 0.92
4.068IleThr: 4.068 ± 0.734
3.681IleVal: 3.681 ± 0.833
0.194IleTrp: 0.194 ± 0.252
1.356IleTyr: 1.356 ± 0.923
0.0IleXaa: 0.0 ± 0.0
Lys
2.518LysAla: 2.518 ± 0.786
1.744LysCys: 1.744 ± 0.536
4.068LysAsp: 4.068 ± 1.183
4.456LysGlu: 4.456 ± 0.773
3.293LysPhe: 3.293 ± 1.019
2.712LysGly: 2.712 ± 0.805
0.969LysHis: 0.969 ± 0.475
4.068LysIle: 4.068 ± 1.081
4.456LysLys: 4.456 ± 0.882
5.812LysLeu: 5.812 ± 1.344
1.55LysMet: 1.55 ± 0.906
3.874LysAsn: 3.874 ± 0.85
2.712LysPro: 2.712 ± 0.481
1.55LysGln: 1.55 ± 0.494
4.068LysArg: 4.068 ± 0.627
6.005LysSer: 6.005 ± 1.685
3.681LysThr: 3.681 ± 0.779
3.1LysVal: 3.1 ± 0.968
1.162LysTrp: 1.162 ± 0.652
3.1LysTyr: 3.1 ± 0.723
0.0LysXaa: 0.0 ± 0.0
Leu
4.456LeuAla: 4.456 ± 0.685
1.356LeuCys: 1.356 ± 0.447
5.424LeuAsp: 5.424 ± 0.793
4.456LeuGlu: 4.456 ± 0.937
6.199LeuPhe: 6.199 ± 0.783
4.262LeuGly: 4.262 ± 1.444
2.518LeuHis: 2.518 ± 0.565
3.487LeuIle: 3.487 ± 0.762
8.136LeuLys: 8.136 ± 1.183
7.943LeuLeu: 7.943 ± 0.984
1.937LeuMet: 1.937 ± 0.746
5.424LeuAsn: 5.424 ± 1.314
5.231LeuPro: 5.231 ± 1.077
2.906LeuGln: 2.906 ± 0.962
5.231LeuArg: 5.231 ± 1.042
10.849LeuSer: 10.849 ± 1.023
6.393LeuThr: 6.393 ± 1.057
5.424LeuVal: 5.424 ± 1.24
0.581LeuTrp: 0.581 ± 0.441
3.874LeuTyr: 3.874 ± 0.897
0.0LeuXaa: 0.0 ± 0.0
Met
0.775MetAla: 0.775 ± 0.462
0.0MetCys: 0.0 ± 0.0
0.969MetAsp: 0.969 ± 0.423
1.356MetGlu: 1.356 ± 0.654
1.55MetPhe: 1.55 ± 0.337
0.387MetGly: 0.387 ± 0.361
0.581MetHis: 0.581 ± 0.428
1.356MetIle: 1.356 ± 0.373
0.969MetLys: 0.969 ± 0.332
1.937MetLeu: 1.937 ± 0.414
0.387MetMet: 0.387 ± 0.237
1.356MetAsn: 1.356 ± 0.778
0.387MetPro: 0.387 ± 0.349
0.969MetGln: 0.969 ± 0.553
0.581MetArg: 0.581 ± 0.286
1.744MetSer: 1.744 ± 0.629
1.162MetThr: 1.162 ± 0.283
1.744MetVal: 1.744 ± 0.456
0.0MetTrp: 0.0 ± 0.0
0.775MetTyr: 0.775 ± 0.497
0.0MetXaa: 0.0 ± 0.0
Asn
2.712AsnAla: 2.712 ± 0.471
0.581AsnCys: 0.581 ± 0.355
2.131AsnAsp: 2.131 ± 0.539
1.937AsnGlu: 1.937 ± 1.208
2.712AsnPhe: 2.712 ± 0.427
1.744AsnGly: 1.744 ± 0.444
0.581AsnHis: 0.581 ± 0.278
1.937AsnIle: 1.937 ± 0.924
1.937AsnLys: 1.937 ± 0.792
3.293AsnLeu: 3.293 ± 0.498
0.194AsnMet: 0.194 ± 0.252
0.969AsnAsn: 0.969 ± 0.505
2.131AsnPro: 2.131 ± 0.734
0.387AsnGln: 0.387 ± 0.24
1.55AsnArg: 1.55 ± 0.395
4.262AsnSer: 4.262 ± 0.64
3.1AsnThr: 3.1 ± 0.62
2.325AsnVal: 2.325 ± 0.93
0.0AsnTrp: 0.0 ± 0.0
2.325AsnTyr: 2.325 ± 0.385
0.0AsnXaa: 0.0 ± 0.0
Pro
1.744ProAla: 1.744 ± 0.463
0.775ProCys: 0.775 ± 0.398
2.325ProAsp: 2.325 ± 1.09
1.937ProGlu: 1.937 ± 0.798
1.744ProPhe: 1.744 ± 0.447
2.518ProGly: 2.518 ± 0.701
0.969ProHis: 0.969 ± 0.373
1.55ProIle: 1.55 ± 0.409
3.1ProLys: 3.1 ± 0.653
4.456ProLeu: 4.456 ± 0.834
1.162ProMet: 1.162 ± 0.506
1.55ProAsn: 1.55 ± 0.882
1.937ProPro: 1.937 ± 0.569
0.775ProGln: 0.775 ± 0.314
2.906ProArg: 2.906 ± 0.933
5.037ProSer: 5.037 ± 0.49
1.356ProThr: 1.356 ± 1.078
2.325ProVal: 2.325 ± 0.573
0.194ProTrp: 0.194 ± 0.118
1.744ProTyr: 1.744 ± 0.665
0.0ProXaa: 0.0 ± 0.0
Gln
1.744GlnAla: 1.744 ± 0.457
1.162GlnCys: 1.162 ± 0.349
0.387GlnAsp: 0.387 ± 0.231
1.162GlnGlu: 1.162 ± 0.692
0.581GlnPhe: 0.581 ± 0.319
1.744GlnGly: 1.744 ± 0.628
0.581GlnHis: 0.581 ± 0.445
0.969GlnIle: 0.969 ± 0.399
1.744GlnLys: 1.744 ± 0.829
3.487GlnLeu: 3.487 ± 0.787
0.387GlnMet: 0.387 ± 0.237
0.775GlnAsn: 0.775 ± 0.329
0.0GlnPro: 0.0 ± 0.0
0.969GlnGln: 0.969 ± 0.274
1.55GlnArg: 1.55 ± 0.506
2.325GlnSer: 2.325 ± 0.992
1.744GlnThr: 1.744 ± 0.506
1.937GlnVal: 1.937 ± 0.614
0.194GlnTrp: 0.194 ± 0.118
0.387GlnTyr: 0.387 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
3.293ArgAla: 3.293 ± 1.076
2.906ArgCys: 2.906 ± 1.029
2.712ArgAsp: 2.712 ± 0.896
2.325ArgGlu: 2.325 ± 1.082
2.712ArgPhe: 2.712 ± 1.204
2.906ArgGly: 2.906 ± 0.609
1.356ArgHis: 1.356 ± 0.513
3.1ArgIle: 3.1 ± 0.642
4.068ArgLys: 4.068 ± 1.437
5.231ArgLeu: 5.231 ± 0.853
0.581ArgMet: 0.581 ± 0.299
2.131ArgAsn: 2.131 ± 0.4
1.744ArgPro: 1.744 ± 0.429
1.356ArgGln: 1.356 ± 0.597
3.681ArgArg: 3.681 ± 0.843
5.812ArgSer: 5.812 ± 1.908
2.518ArgThr: 2.518 ± 0.614
4.649ArgVal: 4.649 ± 0.533
0.387ArgTrp: 0.387 ± 0.358
1.937ArgTyr: 1.937 ± 0.857
0.194ArgXaa: 0.194 ± 0.118
Ser
8.33SerAla: 8.33 ± 1.014
1.744SerCys: 1.744 ± 0.268
5.424SerAsp: 5.424 ± 0.765
6.974SerGlu: 6.974 ± 1.533
7.555SerPhe: 7.555 ± 0.975
5.037SerGly: 5.037 ± 0.98
2.712SerHis: 2.712 ± 0.825
5.424SerIle: 5.424 ± 0.932
6.393SerLys: 6.393 ± 1.122
10.461SerLeu: 10.461 ± 1.137
1.55SerMet: 1.55 ± 0.447
3.487SerAsn: 3.487 ± 1.18
5.424SerPro: 5.424 ± 1.025
2.518SerGln: 2.518 ± 1.305
7.943SerArg: 7.943 ± 2.321
13.173SerSer: 13.173 ± 1.387
4.649SerThr: 4.649 ± 1.073
7.168SerVal: 7.168 ± 1.28
0.194SerTrp: 0.194 ± 0.252
4.456SerTyr: 4.456 ± 0.728
0.0SerXaa: 0.0 ± 0.0
Thr
2.712ThrAla: 2.712 ± 0.781
1.162ThrCys: 1.162 ± 0.322
1.744ThrAsp: 1.744 ± 0.726
2.712ThrGlu: 2.712 ± 0.886
4.649ThrPhe: 4.649 ± 0.91
3.1ThrGly: 3.1 ± 0.671
0.969ThrHis: 0.969 ± 0.399
4.649ThrIle: 4.649 ± 1.115
2.906ThrLys: 2.906 ± 0.742
6.005ThrLeu: 6.005 ± 1.169
0.775ThrMet: 0.775 ± 0.369
1.55ThrAsn: 1.55 ± 0.662
2.325ThrPro: 2.325 ± 0.598
1.744ThrGln: 1.744 ± 0.91
2.712ThrArg: 2.712 ± 0.882
6.393ThrSer: 6.393 ± 0.873
3.681ThrThr: 3.681 ± 0.782
3.487ThrVal: 3.487 ± 1.104
0.387ThrTrp: 0.387 ± 0.231
2.518ThrTyr: 2.518 ± 0.581
0.0ThrXaa: 0.0 ± 0.0
Val
3.681ValAla: 3.681 ± 0.675
1.744ValCys: 1.744 ± 0.701
4.262ValAsp: 4.262 ± 0.629
3.874ValGlu: 3.874 ± 0.769
4.456ValPhe: 4.456 ± 0.914
3.874ValGly: 3.874 ± 0.77
1.744ValHis: 1.744 ± 0.378
3.293ValIle: 3.293 ± 0.928
4.649ValLys: 4.649 ± 0.855
6.587ValLeu: 6.587 ± 1.428
1.356ValMet: 1.356 ± 0.489
2.325ValAsn: 2.325 ± 0.456
3.487ValPro: 3.487 ± 0.448
2.131ValGln: 2.131 ± 0.609
3.681ValArg: 3.681 ± 0.592
8.718ValSer: 8.718 ± 1.691
2.906ValThr: 2.906 ± 0.565
4.068ValVal: 4.068 ± 0.662
0.387ValTrp: 0.387 ± 0.399
2.906ValTyr: 2.906 ± 0.56
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.387TrpCys: 0.387 ± 0.399
0.194TrpAsp: 0.194 ± 0.118
0.194TrpGlu: 0.194 ± 0.118
0.194TrpPhe: 0.194 ± 0.296
0.387TrpGly: 0.387 ± 0.206
0.0TrpHis: 0.0 ± 0.0
0.581TrpIle: 0.581 ± 0.441
0.194TrpLys: 0.194 ± 0.252
0.969TrpLeu: 0.969 ± 0.552
0.387TrpMet: 0.387 ± 0.48
0.194TrpAsn: 0.194 ± 0.118
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.387TrpSer: 0.387 ± 0.273
0.387TrpThr: 0.387 ± 0.206
0.969TrpVal: 0.969 ± 0.567
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.356TyrAla: 1.356 ± 0.479
0.194TyrCys: 0.194 ± 0.118
2.712TyrAsp: 2.712 ± 0.605
2.325TyrGlu: 2.325 ± 0.718
2.518TyrPhe: 2.518 ± 0.802
1.937TyrGly: 1.937 ± 0.766
1.162TyrHis: 1.162 ± 0.575
1.162TyrIle: 1.162 ± 0.647
1.744TyrLys: 1.744 ± 0.373
5.424TyrLeu: 5.424 ± 1.096
0.969TyrMet: 0.969 ± 0.328
1.162TyrAsn: 1.162 ± 0.59
0.969TyrPro: 0.969 ± 0.318
1.162TyrGln: 1.162 ± 0.59
1.937TyrArg: 1.937 ± 0.551
5.231TyrSer: 5.231 ± 1.65
3.293TyrThr: 3.293 ± 0.807
2.712TyrVal: 2.712 ± 0.892
0.194TyrTrp: 0.194 ± 0.118
2.325TyrTyr: 2.325 ± 0.567
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.194XaaAla: 0.194 ± 0.118
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (5163 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski