Amino acid dipepetide frequency for Barley yellow dwarf virus (isolate MAV-PS1) (BYDV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.405AlaAla: 5.405 ± 2.177
0.772AlaCys: 0.772 ± 0.36
2.317AlaAsp: 2.317 ± 0.891
5.792AlaGlu: 5.792 ± 1.9
2.317AlaPhe: 2.317 ± 0.51
4.633AlaGly: 4.633 ± 1.875
1.158AlaHis: 1.158 ± 0.592
3.475AlaIle: 3.475 ± 1.275
3.475AlaLys: 3.475 ± 1.176
3.861AlaLeu: 3.861 ± 1.29
1.544AlaMet: 1.544 ± 0.509
3.861AlaAsn: 3.861 ± 2.061
4.247AlaPro: 4.247 ± 0.791
5.792AlaGln: 5.792 ± 2.21
6.564AlaArg: 6.564 ± 2.429
8.88AlaSer: 8.88 ± 1.018
1.931AlaThr: 1.931 ± 0.726
4.633AlaVal: 4.633 ± 0.642
1.158AlaTrp: 1.158 ± 0.445
1.158AlaTyr: 1.158 ± 0.358
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.386CysAsp: 0.386 ± 0.384
1.158CysGlu: 1.158 ± 0.445
1.544CysPhe: 1.544 ± 0.917
2.317CysGly: 2.317 ± 0.977
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.158CysLys: 1.158 ± 0.358
1.158CysLeu: 1.158 ± 0.539
0.0CysMet: 0.0 ± 0.0
1.544CysAsn: 1.544 ± 0.72
1.931CysPro: 1.931 ± 0.716
0.772CysGln: 0.772 ± 0.36
0.0CysArg: 0.0 ± 0.0
0.386CysSer: 0.386 ± 0.558
0.772CysThr: 0.772 ± 0.36
1.544CysVal: 1.544 ± 1.142
0.772CysTrp: 0.772 ± 0.458
0.772CysTyr: 0.772 ± 0.458
0.0CysXaa: 0.0 ± 0.0
Asp
4.633AspAla: 4.633 ± 0.705
0.772AspCys: 0.772 ± 0.36
1.931AspAsp: 1.931 ± 0.852
4.247AspGlu: 4.247 ± 1.023
2.317AspPhe: 2.317 ± 1.375
3.089AspGly: 3.089 ± 0.938
1.158AspHis: 1.158 ± 0.358
5.405AspIle: 5.405 ± 1.15
3.475AspLys: 3.475 ± 0.762
4.247AspLeu: 4.247 ± 1.647
0.0AspMet: 0.0 ± 0.0
2.317AspAsn: 2.317 ± 0.788
1.544AspPro: 1.544 ± 0.65
2.317AspGln: 2.317 ± 1.118
1.544AspArg: 1.544 ± 0.65
4.633AspSer: 4.633 ± 1.074
4.247AspThr: 4.247 ± 0.907
3.861AspVal: 3.861 ± 1.095
0.0AspTrp: 0.0 ± 0.0
1.544AspTyr: 1.544 ± 0.65
0.0AspXaa: 0.0 ± 0.0
Glu
6.564GluAla: 6.564 ± 2.322
0.0GluCys: 0.0 ± 0.0
3.089GluAsp: 3.089 ± 0.721
7.722GluGlu: 7.722 ± 2.215
3.475GluPhe: 3.475 ± 0.803
1.544GluGly: 1.544 ± 0.541
2.317GluHis: 2.317 ± 1.079
2.317GluIle: 2.317 ± 0.929
6.95GluLys: 6.95 ± 1.646
5.405GluLeu: 5.405 ± 0.317
1.544GluMet: 1.544 ± 0.693
2.317GluAsn: 2.317 ± 0.51
3.089GluPro: 3.089 ± 1.12
4.247GluGln: 4.247 ± 1.066
5.405GluArg: 5.405 ± 1.15
3.475GluSer: 3.475 ± 0.81
1.931GluThr: 1.931 ± 0.818
6.564GluVal: 6.564 ± 2.273
0.386GluTrp: 0.386 ± 0.497
2.703GluTyr: 2.703 ± 1.438
0.0GluXaa: 0.0 ± 0.0
Phe
2.703PheAla: 2.703 ± 1.005
0.772PheCys: 0.772 ± 0.36
1.931PheAsp: 1.931 ± 0.607
3.475PheGlu: 3.475 ± 1.176
1.931PhePhe: 1.931 ± 0.818
4.247PheGly: 4.247 ± 0.594
0.386PheHis: 0.386 ± 0.384
3.475PheIle: 3.475 ± 0.98
5.019PheLys: 5.019 ± 1.436
2.317PheLeu: 2.317 ± 1.1
0.0PheMet: 0.0 ± 0.404
1.544PheAsn: 1.544 ± 0.72
1.158PhePro: 1.158 ± 0.358
2.317PheGln: 2.317 ± 0.669
0.772PheArg: 0.772 ± 0.756
0.386PheSer: 0.386 ± 0.558
3.861PheThr: 3.861 ± 0.968
2.703PheVal: 2.703 ± 0.758
0.0PheTrp: 0.0 ± 0.0
1.544PheTyr: 1.544 ± 0.509
0.0PheXaa: 0.0 ± 0.0
Gly
4.633GlyAla: 4.633 ± 1.43
0.772GlyCys: 0.772 ± 0.751
4.247GlyAsp: 4.247 ± 1.21
1.931GlyGlu: 1.931 ± 0.989
3.861GlyPhe: 3.861 ± 1.146
2.703GlyGly: 2.703 ± 1.673
1.931GlyHis: 1.931 ± 0.355
1.158GlyIle: 1.158 ± 0.531
3.475GlyLys: 3.475 ± 1.322
5.405GlyLeu: 5.405 ± 0.972
1.931GlyMet: 1.931 ± 0.607
1.544GlyAsn: 1.544 ± 0.65
1.544GlyPro: 1.544 ± 2.233
2.703GlyGln: 2.703 ± 1.202
4.247GlyArg: 4.247 ± 1.065
2.317GlySer: 2.317 ± 1.123
3.475GlyThr: 3.475 ± 1.262
5.792GlyVal: 5.792 ± 1.622
0.386GlyTrp: 0.386 ± 0.384
3.089GlyTyr: 3.089 ± 0.93
0.0GlyXaa: 0.0 ± 0.0
His
1.931HisAla: 1.931 ± 0.716
0.772HisCys: 0.772 ± 0.458
1.544HisAsp: 1.544 ± 0.535
0.386HisGlu: 0.386 ± 0.384
0.772HisPhe: 0.772 ± 0.36
0.772HisGly: 0.772 ± 0.36
0.0HisHis: 0.0 ± 0.0
0.386HisIle: 0.386 ± 0.384
0.772HisLys: 0.772 ± 0.36
2.703HisLeu: 2.703 ± 0.706
0.772HisMet: 0.772 ± 0.458
1.931HisAsn: 1.931 ± 0.77
0.0HisPro: 0.0 ± 0.0
2.317HisGln: 2.317 ± 0.788
0.772HisArg: 0.772 ± 0.36
1.931HisSer: 1.931 ± 0.77
0.772HisThr: 0.772 ± 0.63
1.544HisVal: 1.544 ± 0.623
0.772HisTrp: 0.772 ± 0.36
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.178IleAla: 6.178 ± 0.94
0.772IleCys: 0.772 ± 0.36
1.931IleAsp: 1.931 ± 0.728
2.703IleGlu: 2.703 ± 0.458
1.158IlePhe: 1.158 ± 0.358
4.247IleGly: 4.247 ± 0.763
0.0IleHis: 0.0 ± 0.0
3.475IleIle: 3.475 ± 1.322
5.019IleLys: 5.019 ± 1.726
5.405IleLeu: 5.405 ± 0.696
2.317IleMet: 2.317 ± 0.788
2.703IleAsn: 2.703 ± 0.706
3.089IlePro: 3.089 ± 0.547
1.158IleGln: 1.158 ± 0.358
1.544IleArg: 1.544 ± 0.509
4.247IleSer: 4.247 ± 1.056
4.633IleThr: 4.633 ± 0.753
1.158IleVal: 1.158 ± 0.358
0.0IleTrp: 0.0 ± 0.0
2.317IleTyr: 2.317 ± 0.788
0.0IleXaa: 0.0 ± 0.0
Lys
6.95LysAla: 6.95 ± 1.402
0.772LysCys: 0.772 ± 0.36
6.95LysAsp: 6.95 ± 2.788
4.633LysGlu: 4.633 ± 1.022
3.475LysPhe: 3.475 ± 0.362
1.544LysGly: 1.544 ± 0.535
1.544LysHis: 1.544 ± 0.535
2.703LysIle: 2.703 ± 0.706
5.405LysLys: 5.405 ± 1.882
7.722LysLeu: 7.722 ± 2.523
2.317LysMet: 2.317 ± 0.747
1.931LysAsn: 1.931 ± 0.355
2.703LysPro: 2.703 ± 0.772
2.317LysGln: 2.317 ± 0.51
3.475LysArg: 3.475 ± 0.362
6.95LysSer: 6.95 ± 1.492
3.475LysThr: 3.475 ± 0.627
4.633LysVal: 4.633 ± 1.222
1.544LysTrp: 1.544 ± 0.72
1.931LysTyr: 1.931 ± 0.355
0.0LysXaa: 0.0 ± 0.0
Leu
4.247LeuAla: 4.247 ± 1.209
2.317LeuCys: 2.317 ± 0.977
2.703LeuAsp: 2.703 ± 0.88
7.722LeuGlu: 7.722 ± 1.835
0.772LeuPhe: 0.772 ± 0.36
5.019LeuGly: 5.019 ± 1.074
1.158LeuHis: 1.158 ± 0.587
3.089LeuIle: 3.089 ± 0.409
7.336LeuLys: 7.336 ± 1.899
5.792LeuLeu: 5.792 ± 0.998
1.931LeuMet: 1.931 ± 0.818
3.089LeuAsn: 3.089 ± 0.533
1.931LeuPro: 1.931 ± 1.005
3.861LeuGln: 3.861 ± 0.803
5.792LeuArg: 5.792 ± 1.265
6.95LeuSer: 6.95 ± 1.132
3.475LeuThr: 3.475 ± 1.789
4.247LeuVal: 4.247 ± 1.209
0.386LeuTrp: 0.386 ± 0.497
3.089LeuTyr: 3.089 ± 0.927
0.0LeuXaa: 0.0 ± 0.0
Met
1.544MetAla: 1.544 ± 0.585
1.544MetCys: 1.544 ± 0.535
2.317MetAsp: 2.317 ± 0.592
1.931MetGlu: 1.931 ± 0.818
2.703MetPhe: 2.703 ± 0.884
0.772MetGly: 0.772 ± 0.36
1.544MetHis: 1.544 ± 0.72
0.0MetIle: 0.0 ± 0.0
1.158MetLys: 1.158 ± 0.445
1.931MetLeu: 1.931 ± 0.716
0.772MetMet: 0.772 ± 0.36
1.158MetAsn: 1.158 ± 0.925
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
3.475MetSer: 3.475 ± 0.871
1.544MetThr: 1.544 ± 0.509
2.703MetVal: 2.703 ± 0.706
0.0MetTrp: 0.0 ± 0.0
1.544MetTyr: 1.544 ± 0.72
0.0MetXaa: 0.0 ± 0.0
Asn
4.247AsnAla: 4.247 ± 1.209
0.772AsnCys: 0.772 ± 0.458
1.931AsnAsp: 1.931 ± 0.726
2.703AsnGlu: 2.703 ± 0.932
0.772AsnPhe: 0.772 ± 0.36
3.861AsnGly: 3.861 ± 1.129
1.544AsnHis: 1.544 ± 0.72
3.089AsnIle: 3.089 ± 0.93
1.931AsnLys: 1.931 ± 0.355
1.544AsnLeu: 1.544 ± 0.509
1.931AsnMet: 1.931 ± 0.568
2.317AsnAsn: 2.317 ± 0.583
1.158AsnPro: 1.158 ± 0.918
1.931AsnGln: 1.931 ± 1.06
2.317AsnArg: 2.317 ± 1.649
4.633AsnSer: 4.633 ± 1.779
1.544AsnThr: 1.544 ± 0.623
3.089AsnVal: 3.089 ± 0.409
0.0AsnTrp: 0.0 ± 0.0
0.772AsnTyr: 0.772 ± 0.767
0.0AsnXaa: 0.0 ± 0.0
Pro
1.931ProAla: 1.931 ± 1.006
0.0ProCys: 0.0 ± 0.0
3.089ProAsp: 3.089 ± 1.464
4.633ProGlu: 4.633 ± 1.075
0.772ProPhe: 0.772 ± 0.36
0.772ProGly: 0.772 ± 0.767
0.772ProHis: 0.772 ± 0.36
5.405ProIle: 5.405 ± 0.713
4.247ProLys: 4.247 ± 2.325
1.544ProLeu: 1.544 ± 0.585
0.0ProMet: 0.0 ± 0.0
1.544ProAsn: 1.544 ± 0.676
2.317ProPro: 2.317 ± 1.118
1.544ProGln: 1.544 ± 1.258
3.089ProArg: 3.089 ± 0.825
1.931ProSer: 1.931 ± 1.527
3.861ProThr: 3.861 ± 1.37
3.861ProVal: 3.861 ± 0.773
0.386ProTrp: 0.386 ± 0.497
1.544ProTyr: 1.544 ± 0.65
0.0ProXaa: 0.0 ± 0.0
Gln
3.089GlnAla: 3.089 ± 1.019
1.544GlnCys: 1.544 ± 0.676
2.317GlnAsp: 2.317 ± 0.929
0.772GlnGlu: 0.772 ± 0.63
2.703GlnPhe: 2.703 ± 1.296
3.089GlnGly: 3.089 ± 1.444
1.544GlnHis: 1.544 ± 0.535
2.703GlnIle: 2.703 ± 0.657
3.475GlnLys: 3.475 ± 0.871
2.317GlnLeu: 2.317 ± 1.079
0.0GlnMet: 0.0 ± 0.0
1.158GlnAsn: 1.158 ± 1.068
3.475GlnPro: 3.475 ± 2.555
1.931GlnGln: 1.931 ± 1.06
0.772GlnArg: 0.772 ± 0.63
7.336GlnSer: 7.336 ± 1.548
1.931GlnThr: 1.931 ± 0.768
1.544GlnVal: 1.544 ± 0.585
0.0GlnTrp: 0.0 ± 0.0
0.772GlnTyr: 0.772 ± 0.767
0.0GlnXaa: 0.0 ± 0.0
Arg
5.792ArgAla: 5.792 ± 1.34
0.0ArgCys: 0.0 ± 0.0
2.317ArgAsp: 2.317 ± 1.389
3.089ArgGlu: 3.089 ± 1.105
2.703ArgPhe: 2.703 ± 0.772
3.089ArgGly: 3.089 ± 0.664
0.386ArgHis: 0.386 ± 0.384
1.158ArgIle: 1.158 ± 0.358
1.158ArgLys: 1.158 ± 0.358
6.178ArgLeu: 6.178 ± 1.755
4.247ArgMet: 4.247 ± 0.629
2.317ArgAsn: 2.317 ± 1.431
2.703ArgPro: 2.703 ± 1.009
1.544ArgGln: 1.544 ± 0.693
6.95ArgArg: 6.95 ± 4.523
4.633ArgSer: 4.633 ± 1.105
3.089ArgThr: 3.089 ± 1.017
1.931ArgVal: 1.931 ± 1.143
0.772ArgTrp: 0.772 ± 0.458
4.633ArgTyr: 4.633 ± 1.124
0.0ArgXaa: 0.0 ± 0.0
Ser
3.861SerAla: 3.861 ± 1.603
2.317SerCys: 2.317 ± 0.51
2.703SerAsp: 2.703 ± 1.398
4.247SerGlu: 4.247 ± 0.854
3.475SerPhe: 3.475 ± 0.362
5.792SerGly: 5.792 ± 1.335
2.317SerHis: 2.317 ± 0.624
6.178SerIle: 6.178 ± 1.419
4.247SerLys: 4.247 ± 0.469
5.405SerLeu: 5.405 ± 1.058
2.317SerMet: 2.317 ± 0.788
1.158SerAsn: 1.158 ± 0.918
4.633SerPro: 4.633 ± 1.413
3.861SerGln: 3.861 ± 1.969
4.633SerArg: 4.633 ± 1.105
3.861SerSer: 3.861 ± 2.0
8.108SerThr: 8.108 ± 2.877
5.792SerVal: 5.792 ± 0.952
0.0SerTrp: 0.0 ± 0.0
4.633SerTyr: 4.633 ± 1.221
0.0SerXaa: 0.0 ± 0.0
Thr
3.861ThrAla: 3.861 ± 1.388
0.0ThrCys: 0.0 ± 0.0
4.633ThrAsp: 4.633 ± 1.074
4.247ThrGlu: 4.247 ± 0.978
1.931ThrPhe: 1.931 ± 1.06
1.931ThrGly: 1.931 ± 0.935
1.158ThrHis: 1.158 ± 0.539
4.247ThrIle: 4.247 ± 1.69
3.475ThrLys: 3.475 ± 0.905
3.475ThrLeu: 3.475 ± 1.344
1.544ThrMet: 1.544 ± 0.532
3.475ThrAsn: 3.475 ± 0.362
3.475ThrPro: 3.475 ± 0.989
0.0ThrGln: 0.0 ± 0.0
4.633ThrArg: 4.633 ± 1.736
3.475ThrSer: 3.475 ± 1.817
2.703ThrThr: 2.703 ± 1.136
5.405ThrVal: 5.405 ± 1.892
1.544ThrTrp: 1.544 ± 0.609
1.931ThrTyr: 1.931 ± 0.355
0.0ThrXaa: 0.0 ± 0.0
Val
4.633ValAla: 4.633 ± 1.037
0.386ValCys: 0.386 ± 0.61
4.247ValAsp: 4.247 ± 1.066
6.95ValGlu: 6.95 ± 0.809
2.703ValPhe: 2.703 ± 1.25
5.019ValGly: 5.019 ± 0.797
0.0ValHis: 0.0 ± 0.0
3.089ValIle: 3.089 ± 1.07
7.722ValLys: 7.722 ± 2.059
5.019ValLeu: 5.019 ± 1.305
0.386ValMet: 0.386 ± 0.532
2.317ValAsn: 2.317 ± 1.184
2.703ValPro: 2.703 ± 0.567
2.317ValGln: 2.317 ± 0.374
4.247ValArg: 4.247 ± 0.541
5.019ValSer: 5.019 ± 1.348
3.475ValThr: 3.475 ± 0.871
2.703ValVal: 2.703 ± 1.454
0.0ValTrp: 0.0 ± 0.0
3.861ValTyr: 3.861 ± 1.3
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.386TrpCys: 0.386 ± 0.558
0.0TrpAsp: 0.0 ± 0.0
1.544TrpGlu: 1.544 ± 0.72
0.0TrpPhe: 0.0 ± 0.0
0.386TrpGly: 0.386 ± 0.384
0.0TrpHis: 0.0 ± 0.0
1.158TrpIle: 1.158 ± 0.445
0.0TrpLys: 0.0 ± 0.0
1.158TrpLeu: 1.158 ± 0.592
1.158TrpMet: 1.158 ± 0.358
1.158TrpAsn: 1.158 ± 0.445
0.0TrpPro: 0.0 ± 0.0
0.772TrpGln: 0.772 ± 0.36
0.386TrpArg: 0.386 ± 0.384
0.386TrpSer: 0.386 ± 0.497
0.772TrpThr: 0.772 ± 0.458
0.386TrpVal: 0.386 ± 0.497
0.386TrpTrp: 0.386 ± 0.384
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.772TyrAla: 0.772 ± 0.458
1.544TyrCys: 1.544 ± 0.535
2.703TyrAsp: 2.703 ± 0.932
1.158TyrGlu: 1.158 ± 0.358
1.544TyrPhe: 1.544 ± 0.72
2.317TyrGly: 2.317 ± 0.717
1.931TyrHis: 1.931 ± 0.716
1.544TyrIle: 1.544 ± 0.609
3.475TyrLys: 3.475 ± 1.057
2.317TyrLeu: 2.317 ± 0.458
1.544TyrMet: 1.544 ± 0.689
2.703TyrAsn: 2.703 ± 0.813
1.544TyrPro: 1.544 ± 0.72
0.772TyrGln: 0.772 ± 0.767
1.544TyrArg: 1.544 ± 0.72
4.633TyrSer: 4.633 ± 1.485
1.544TyrThr: 1.544 ± 0.65
2.703TyrVal: 2.703 ± 1.452
1.544TyrTrp: 1.544 ± 0.65
1.544TyrTyr: 1.544 ± 0.535
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2591 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski