Amino acid dipepetide frequency for Cereal yellow dwarf virus RPS

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.106AlaAla: 5.106 ± 0.639
0.0AlaCys: 0.0 ± 0.0
4.765AlaAsp: 4.765 ± 1.26
5.106AlaGlu: 5.106 ± 0.879
2.723AlaPhe: 2.723 ± 0.629
7.828AlaGly: 7.828 ± 0.746
1.361AlaHis: 1.361 ± 0.437
4.425AlaIle: 4.425 ± 1.73
3.744AlaLys: 3.744 ± 0.708
4.425AlaLeu: 4.425 ± 1.297
0.681AlaMet: 0.681 ± 0.409
0.681AlaAsn: 0.681 ± 0.225
5.106AlaPro: 5.106 ± 1.557
3.404AlaGln: 3.404 ± 0.672
8.85AlaArg: 8.85 ± 1.653
8.169AlaSer: 8.169 ± 1.306
3.744AlaThr: 3.744 ± 0.68
4.765AlaVal: 4.765 ± 0.756
2.383AlaTrp: 2.383 ± 0.636
3.744AlaTyr: 3.744 ± 1.008
0.0AlaXaa: 0.0 ± 0.0
Cys
1.702CysAla: 1.702 ± 0.594
0.34CysCys: 0.34 ± 0.231
0.681CysAsp: 0.681 ± 0.462
0.0CysGlu: 0.0 ± 0.0
1.361CysPhe: 1.361 ± 0.535
1.361CysGly: 1.361 ± 0.618
0.34CysHis: 0.34 ± 0.283
2.723CysIle: 2.723 ± 0.336
1.021CysLys: 1.021 ± 0.426
1.021CysLeu: 1.021 ± 0.713
0.0CysMet: 0.0 ± 0.0
0.681CysAsn: 0.681 ± 0.462
1.702CysPro: 1.702 ± 0.66
0.34CysGln: 0.34 ± 0.364
1.361CysArg: 1.361 ± 0.535
1.702CysSer: 1.702 ± 0.66
2.042CysThr: 2.042 ± 0.794
0.34CysVal: 0.34 ± 0.364
0.0CysTrp: 0.0 ± 0.0
0.34CysTyr: 0.34 ± 0.231
0.0CysXaa: 0.0 ± 0.0
Asp
3.744AspAla: 3.744 ± 1.088
0.681AspCys: 0.681 ± 0.462
2.042AspAsp: 2.042 ± 0.713
2.383AspGlu: 2.383 ± 0.623
2.383AspPhe: 2.383 ± 0.761
1.702AspGly: 1.702 ± 0.355
0.0AspHis: 0.0 ± 0.0
1.021AspIle: 1.021 ± 0.672
2.383AspLys: 2.383 ± 0.74
4.425AspLeu: 4.425 ± 0.999
1.021AspMet: 1.021 ± 0.225
3.404AspAsn: 3.404 ± 0.65
3.404AspPro: 3.404 ± 1.032
3.404AspGln: 3.404 ± 0.392
2.723AspArg: 2.723 ± 0.853
3.744AspSer: 3.744 ± 0.604
1.361AspThr: 1.361 ± 0.263
2.042AspVal: 2.042 ± 0.481
1.361AspTrp: 1.361 ± 0.618
1.702AspTyr: 1.702 ± 0.47
0.0AspXaa: 0.0 ± 0.0
Glu
2.723GluAla: 2.723 ± 0.54
0.34GluCys: 0.34 ± 0.231
4.425GluAsp: 4.425 ± 0.844
3.063GluGlu: 3.063 ± 1.046
1.702GluPhe: 1.702 ± 0.441
2.042GluGly: 2.042 ± 0.821
1.021GluHis: 1.021 ± 0.416
3.744GluIle: 3.744 ± 0.802
3.744GluLys: 3.744 ± 0.766
5.106GluLeu: 5.106 ± 1.212
0.0GluMet: 0.0 ± 0.0
1.702GluAsn: 1.702 ± 0.701
2.383GluPro: 2.383 ± 0.634
0.681GluGln: 0.681 ± 0.462
2.042GluArg: 2.042 ± 0.576
4.425GluSer: 4.425 ± 0.909
2.723GluThr: 2.723 ± 0.747
3.404GluVal: 3.404 ± 0.992
1.702GluTrp: 1.702 ± 0.53
1.021GluTyr: 1.021 ± 0.669
0.0GluXaa: 0.0 ± 0.0
Phe
3.063PheAla: 3.063 ± 1.249
0.681PheCys: 0.681 ± 0.385
2.042PheAsp: 2.042 ± 0.832
0.0PheGlu: 0.0 ± 0.0
1.361PhePhe: 1.361 ± 0.263
4.765PheGly: 4.765 ± 0.423
1.361PheHis: 1.361 ± 0.31
0.681PheIle: 0.681 ± 0.225
3.063PheLys: 3.063 ± 0.382
4.084PheLeu: 4.084 ± 0.698
0.0PheMet: 0.0 ± 0.0
2.383PheAsn: 2.383 ± 0.649
3.063PhePro: 3.063 ± 0.677
1.702PheGln: 1.702 ± 0.537
1.361PheArg: 1.361 ± 0.592
3.063PheSer: 3.063 ± 0.699
2.723PheThr: 2.723 ± 0.718
2.723PheVal: 2.723 ± 1.348
0.681PheTrp: 0.681 ± 0.385
1.021PheTyr: 1.021 ± 0.225
0.0PheXaa: 0.0 ± 0.0
Gly
5.106GlyAla: 5.106 ± 1.287
1.361GlyCys: 1.361 ± 0.359
0.34GlyAsp: 0.34 ± 0.231
4.765GlyGlu: 4.765 ± 0.976
4.425GlyPhe: 4.425 ± 1.323
5.786GlyGly: 5.786 ± 1.838
3.063GlyHis: 3.063 ± 0.542
2.723GlyIle: 2.723 ± 0.746
4.084GlyLys: 4.084 ± 1.605
5.446GlyLeu: 5.446 ± 0.953
1.021GlyMet: 1.021 ± 0.493
4.084GlyAsn: 4.084 ± 1.239
5.106GlyPro: 5.106 ± 1.773
0.681GlyGln: 0.681 ± 0.267
5.786GlyArg: 5.786 ± 1.062
7.148GlySer: 7.148 ± 2.314
2.383GlyThr: 2.383 ± 0.549
4.084GlyVal: 4.084 ± 0.366
1.021GlyTrp: 1.021 ± 0.357
2.042GlyTyr: 2.042 ± 0.364
0.0GlyXaa: 0.0 ± 0.0
His
1.702HisAla: 1.702 ± 0.497
1.021HisCys: 1.021 ± 0.426
1.021HisAsp: 1.021 ± 0.416
1.021HisGlu: 1.021 ± 0.426
1.702HisPhe: 1.702 ± 0.405
1.702HisGly: 1.702 ± 0.676
0.0HisHis: 0.0 ± 0.0
1.361HisIle: 1.361 ± 0.709
1.361HisLys: 1.361 ± 0.618
0.681HisLeu: 0.681 ± 0.409
0.34HisMet: 0.34 ± 0.231
1.021HisAsn: 1.021 ± 0.416
1.361HisPro: 1.361 ± 0.592
0.681HisGln: 0.681 ± 0.515
0.681HisArg: 0.681 ± 0.385
3.063HisSer: 3.063 ± 0.865
1.702HisThr: 1.702 ± 0.355
1.361HisVal: 1.361 ± 0.263
0.0HisTrp: 0.0 ± 0.0
0.681HisTyr: 0.681 ± 0.728
0.0HisXaa: 0.0 ± 0.0
Ile
3.404IleAla: 3.404 ± 0.91
1.021IleCys: 1.021 ± 0.416
4.425IleAsp: 4.425 ± 1.143
1.361IleGlu: 1.361 ± 0.491
1.702IlePhe: 1.702 ± 0.66
0.681IleGly: 0.681 ± 0.567
1.021IleHis: 1.021 ± 0.416
0.681IleIle: 0.681 ± 0.373
1.361IleLys: 1.361 ± 0.263
2.723IleLeu: 2.723 ± 1.122
0.34IleMet: 0.34 ± 0.231
1.361IleAsn: 1.361 ± 0.592
4.425IlePro: 4.425 ± 1.259
1.361IleGln: 1.361 ± 0.437
3.404IleArg: 3.404 ± 0.71
4.425IleSer: 4.425 ± 0.862
7.488IleThr: 7.488 ± 1.587
2.723IleVal: 2.723 ± 0.328
0.0IleTrp: 0.0 ± 0.0
2.383IleTyr: 2.383 ± 0.72
0.0IleXaa: 0.0 ± 0.0
Lys
4.084LysAla: 4.084 ± 1.304
2.042LysCys: 2.042 ± 0.6
3.744LysAsp: 3.744 ± 0.629
3.404LysGlu: 3.404 ± 0.727
1.361LysPhe: 1.361 ± 0.359
3.744LysGly: 3.744 ± 0.649
0.0LysHis: 0.0 ± 0.0
4.084LysIle: 4.084 ± 1.374
2.383LysLys: 2.383 ± 1.243
3.063LysLeu: 3.063 ± 1.031
1.361LysMet: 1.361 ± 0.747
0.681LysAsn: 0.681 ± 0.225
2.042LysPro: 2.042 ± 0.481
1.021LysGln: 1.021 ± 0.416
2.042LysArg: 2.042 ± 0.642
6.127LysSer: 6.127 ± 1.503
4.084LysThr: 4.084 ± 0.483
3.063LysVal: 3.063 ± 0.966
0.34LysTrp: 0.34 ± 0.231
2.042LysTyr: 2.042 ± 0.364
0.34LysXaa: 0.34 ± 0.283
Leu
4.765LeuAla: 4.765 ± 0.622
3.063LeuCys: 3.063 ± 1.156
4.765LeuAsp: 4.765 ± 0.38
4.765LeuGlu: 4.765 ± 0.444
3.063LeuPhe: 3.063 ± 0.819
5.106LeuGly: 5.106 ± 0.971
1.361LeuHis: 1.361 ± 0.77
4.084LeuIle: 4.084 ± 0.789
1.702LeuLys: 1.702 ± 0.48
7.488LeuLeu: 7.488 ± 2.084
2.042LeuMet: 2.042 ± 0.753
1.361LeuAsn: 1.361 ± 0.557
3.404LeuPro: 3.404 ± 0.918
6.807LeuGln: 6.807 ± 1.53
6.467LeuArg: 6.467 ± 0.726
7.488LeuSer: 7.488 ± 1.166
5.106LeuThr: 5.106 ± 1.744
5.106LeuVal: 5.106 ± 0.76
3.404LeuTrp: 3.404 ± 1.134
1.702LeuTyr: 1.702 ± 0.673
0.0LeuXaa: 0.0 ± 0.0
Met
2.042MetAla: 2.042 ± 0.664
0.681MetCys: 0.681 ± 0.267
0.0MetAsp: 0.0 ± 0.0
1.021MetGlu: 1.021 ± 0.41
1.021MetPhe: 1.021 ± 0.38
1.021MetGly: 1.021 ± 0.416
0.0MetHis: 0.0 ± 0.0
1.021MetIle: 1.021 ± 0.426
0.0MetLys: 0.0 ± 0.0
1.021MetLeu: 1.021 ± 0.693
0.0MetMet: 0.0 ± 0.0
0.34MetAsn: 0.34 ± 0.384
0.0MetPro: 0.0 ± 0.0
0.34MetGln: 0.34 ± 0.277
0.0MetArg: 0.0 ± 0.0
3.404MetSer: 3.404 ± 0.775
0.34MetThr: 0.34 ± 0.283
0.681MetVal: 0.681 ± 0.481
0.34MetTrp: 0.34 ± 0.231
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.383AsnAla: 2.383 ± 0.636
0.681AsnCys: 0.681 ± 0.385
0.681AsnAsp: 0.681 ± 0.267
2.383AsnGlu: 2.383 ± 0.37
2.042AsnPhe: 2.042 ± 0.451
4.425AsnGly: 4.425 ± 3.165
0.681AsnHis: 0.681 ± 0.267
0.681AsnIle: 0.681 ± 0.225
3.404AsnLys: 3.404 ± 0.727
2.383AsnLeu: 2.383 ± 0.918
0.681AsnMet: 0.681 ± 0.255
1.702AsnAsn: 1.702 ± 0.53
1.702AsnPro: 1.702 ± 0.537
0.681AsnGln: 0.681 ± 0.567
1.021AsnArg: 1.021 ± 0.693
4.765AsnSer: 4.765 ± 1.245
2.042AsnThr: 2.042 ± 0.451
2.042AsnVal: 2.042 ± 0.653
1.361AsnTrp: 1.361 ± 0.482
0.681AsnTyr: 0.681 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
7.488ProAla: 7.488 ± 2.839
1.021ProCys: 1.021 ± 0.493
3.744ProAsp: 3.744 ± 0.766
2.383ProGlu: 2.383 ± 0.686
0.34ProPhe: 0.34 ± 0.231
5.446ProGly: 5.446 ± 0.729
1.702ProHis: 1.702 ± 0.302
1.702ProIle: 1.702 ± 0.639
3.404ProLys: 3.404 ± 0.752
3.404ProLeu: 3.404 ± 0.73
1.702ProMet: 1.702 ± 0.355
2.042ProAsn: 2.042 ± 1.014
4.765ProPro: 4.765 ± 1.727
2.723ProGln: 2.723 ± 0.698
3.744ProArg: 3.744 ± 0.609
7.488ProSer: 7.488 ± 1.566
4.425ProThr: 4.425 ± 1.249
5.446ProVal: 5.446 ± 0.85
0.34ProTrp: 0.34 ± 0.283
0.34ProTyr: 0.34 ± 0.283
0.0ProXaa: 0.0 ± 0.0
Gln
2.383GlnAla: 2.383 ± 0.599
0.0GlnCys: 0.0 ± 0.0
1.021GlnAsp: 1.021 ± 0.521
1.361GlnGlu: 1.361 ± 0.924
1.702GlnPhe: 1.702 ± 1.136
3.744GlnGly: 3.744 ± 0.857
0.0GlnHis: 0.0 ± 0.0
1.021GlnIle: 1.021 ± 0.225
2.383GlnLys: 2.383 ± 0.777
4.084GlnLeu: 4.084 ± 1.09
0.0GlnMet: 0.0 ± 0.32
2.723GlnAsn: 2.723 ± 1.028
2.383GlnPro: 2.383 ± 1.313
2.723GlnGln: 2.723 ± 0.982
2.723GlnArg: 2.723 ± 0.946
2.383GlnSer: 2.383 ± 0.591
1.702GlnThr: 1.702 ± 0.539
4.425GlnVal: 4.425 ± 0.612
0.681GlnTrp: 0.681 ± 0.481
1.021GlnTyr: 1.021 ± 0.52
0.0GlnXaa: 0.0 ± 0.0
Arg
6.807ArgAla: 6.807 ± 1.116
1.702ArgCys: 1.702 ± 0.914
1.702ArgAsp: 1.702 ± 0.549
1.702ArgGlu: 1.702 ± 0.364
2.042ArgPhe: 2.042 ± 0.731
4.084ArgGly: 4.084 ± 1.761
1.702ArgHis: 1.702 ± 0.933
3.063ArgIle: 3.063 ± 0.281
2.383ArgLys: 2.383 ± 0.391
6.807ArgLeu: 6.807 ± 2.085
0.681ArgMet: 0.681 ± 0.742
1.021ArgAsn: 1.021 ± 0.52
2.723ArgPro: 2.723 ± 1.359
2.383ArgGln: 2.383 ± 0.461
9.871ArgArg: 9.871 ± 5.465
7.148ArgSer: 7.148 ± 1.186
3.744ArgThr: 3.744 ± 0.761
3.744ArgVal: 3.744 ± 0.8
1.361ArgTrp: 1.361 ± 0.427
1.702ArgTyr: 1.702 ± 0.278
0.0ArgXaa: 0.0 ± 0.0
Ser
5.446SerAla: 5.446 ± 1.054
1.021SerCys: 1.021 ± 0.38
4.765SerAsp: 4.765 ± 0.75
5.446SerGlu: 5.446 ± 0.923
4.084SerPhe: 4.084 ± 0.915
7.148SerGly: 7.148 ± 0.743
2.042SerHis: 2.042 ± 0.698
5.446SerIle: 5.446 ± 0.998
3.744SerLys: 3.744 ± 0.88
9.53SerLeu: 9.53 ± 1.949
0.681SerMet: 0.681 ± 0.514
4.425SerAsn: 4.425 ± 0.548
9.19SerPro: 9.19 ± 1.431
4.765SerGln: 4.765 ± 1.965
4.765SerArg: 4.765 ± 0.971
9.871SerSer: 9.871 ± 1.826
5.786SerThr: 5.786 ± 1.114
8.169SerVal: 8.169 ± 1.266
1.702SerTrp: 1.702 ± 0.405
3.744SerTyr: 3.744 ± 0.631
0.0SerXaa: 0.0 ± 0.0
Thr
5.786ThrAla: 5.786 ± 0.812
1.361ThrCys: 1.361 ± 0.535
2.383ThrAsp: 2.383 ± 1.179
2.042ThrGlu: 2.042 ± 0.409
2.723ThrPhe: 2.723 ± 0.688
4.425ThrGly: 4.425 ± 0.877
2.723ThrHis: 2.723 ± 0.997
3.404ThrIle: 3.404 ± 0.776
4.425ThrLys: 4.425 ± 0.597
5.446ThrLeu: 5.446 ± 0.6
1.361ThrMet: 1.361 ± 0.581
3.063ThrAsn: 3.063 ± 0.548
4.425ThrPro: 4.425 ± 1.571
1.361ThrGln: 1.361 ± 0.834
3.063ThrArg: 3.063 ± 0.546
6.807ThrSer: 6.807 ± 1.228
5.106ThrThr: 5.106 ± 1.088
2.042ThrVal: 2.042 ± 0.905
0.34ThrTrp: 0.34 ± 0.231
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
11.232ValAla: 11.232 ± 1.654
1.361ValCys: 1.361 ± 0.535
1.021ValAsp: 1.021 ± 0.78
2.383ValGlu: 2.383 ± 0.699
2.042ValPhe: 2.042 ± 0.609
3.404ValGly: 3.404 ± 0.881
2.042ValHis: 2.042 ± 0.688
3.744ValIle: 3.744 ± 0.857
2.042ValLys: 2.042 ± 0.653
6.467ValLeu: 6.467 ± 0.938
0.681ValMet: 0.681 ± 0.267
0.681ValAsn: 0.681 ± 0.462
3.744ValPro: 3.744 ± 0.845
2.723ValGln: 2.723 ± 0.728
4.425ValArg: 4.425 ± 0.723
4.765ValSer: 4.765 ± 0.602
3.063ValThr: 3.063 ± 0.876
4.765ValVal: 4.765 ± 1.782
1.361ValTrp: 1.361 ± 0.427
2.042ValTyr: 2.042 ± 0.481
0.0ValXaa: 0.0 ± 0.0
Trp
1.361TrpAla: 1.361 ± 0.263
0.0TrpCys: 0.0 ± 0.0
0.681TrpAsp: 0.681 ± 0.462
1.021TrpGlu: 1.021 ± 0.416
1.021TrpPhe: 1.021 ± 0.225
0.681TrpGly: 0.681 ± 0.385
0.681TrpHis: 0.681 ± 0.517
0.681TrpIle: 0.681 ± 0.267
1.021TrpLys: 1.021 ± 0.225
2.723TrpLeu: 2.723 ± 0.997
0.34TrpMet: 0.34 ± 0.231
0.34TrpAsn: 0.34 ± 0.283
1.021TrpPro: 1.021 ± 0.416
0.34TrpGln: 0.34 ± 0.231
1.021TrpArg: 1.021 ± 0.416
1.361TrpSer: 1.361 ± 0.461
2.383TrpThr: 2.383 ± 0.388
1.361TrpVal: 1.361 ± 0.535
0.0TrpTrp: 0.0 ± 0.0
0.681TrpTyr: 0.681 ± 0.46
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.681TyrAla: 0.681 ± 0.46
0.681TyrCys: 0.681 ± 0.385
0.681TyrAsp: 0.681 ± 0.567
2.383TyrGlu: 2.383 ± 0.429
1.021TyrPhe: 1.021 ± 0.416
1.361TyrGly: 1.361 ± 0.92
1.361TyrHis: 1.361 ± 0.567
0.0TyrIle: 0.0 ± 0.0
3.404TyrLys: 3.404 ± 0.605
2.723TyrLeu: 2.723 ± 0.812
0.0TyrMet: 0.0 ± 0.0
2.723TyrAsn: 2.723 ± 0.538
1.702TyrPro: 1.702 ± 0.852
0.681TyrGln: 0.681 ± 0.481
1.021TyrArg: 1.021 ± 0.52
4.425TyrSer: 4.425 ± 0.691
0.34TyrThr: 0.34 ± 0.231
1.361TyrVal: 1.361 ± 0.31
0.34TyrTrp: 0.34 ± 0.283
1.702TyrTyr: 1.702 ± 0.676
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.34XaaVal: 0.34 ± 0.283
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2939 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski