Amino acid dipepetide frequency for Melandrium yellow fleck virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.979AlaAla: 3.979 ± 2.584
2.21AlaCys: 2.21 ± 0.592
6.189AlaAsp: 6.189 ± 2.027
2.653AlaGlu: 2.653 ± 1.407
3.537AlaPhe: 3.537 ± 0.511
2.21AlaGly: 2.21 ± 1.167
0.442AlaHis: 0.442 ± 0.346
7.958AlaIle: 7.958 ± 1.576
3.979AlaLys: 3.979 ± 1.437
4.863AlaLeu: 4.863 ± 1.923
3.095AlaMet: 3.095 ± 1.197
0.884AlaAsn: 0.884 ± 0.638
1.768AlaPro: 1.768 ± 0.931
2.21AlaGln: 2.21 ± 0.592
3.095AlaArg: 3.095 ± 1.054
2.653AlaSer: 2.653 ± 1.945
3.095AlaThr: 3.095 ± 0.604
4.421AlaVal: 4.421 ± 1.166
0.0AlaTrp: 0.0 ± 0.0
3.537AlaTyr: 3.537 ± 1.113
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.326CysCys: 1.326 ± 0.503
3.095CysAsp: 3.095 ± 1.722
0.884CysGlu: 0.884 ± 0.692
2.653CysPhe: 2.653 ± 1.045
1.326CysGly: 1.326 ± 0.957
0.0CysHis: 0.0 ± 0.0
0.884CysIle: 0.884 ± 0.638
2.21CysLys: 2.21 ± 1.167
1.326CysLeu: 1.326 ± 0.957
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.653CysPro: 2.653 ± 1.178
0.0CysGln: 0.0 ± 0.0
1.768CysArg: 1.768 ± 0.79
0.442CysSer: 0.442 ± 0.346
0.442CysThr: 0.442 ± 0.774
1.768CysVal: 1.768 ± 0.988
0.0CysTrp: 0.0 ± 0.0
0.442CysTyr: 0.442 ± 0.319
0.0CysXaa: 0.0 ± 0.0
Asp
4.863AspAla: 4.863 ± 1.75
1.768AspCys: 1.768 ± 0.479
4.863AspAsp: 4.863 ± 1.299
3.979AspGlu: 3.979 ± 0.569
3.537AspPhe: 3.537 ± 1.192
3.979AspGly: 3.979 ± 1.371
0.884AspHis: 0.884 ± 0.728
2.21AspIle: 2.21 ± 1.05
5.305AspLys: 5.305 ± 1.057
4.421AspLeu: 4.421 ± 0.514
0.884AspMet: 0.884 ± 0.29
2.21AspAsn: 2.21 ± 1.186
4.421AspPro: 4.421 ± 2.424
1.326AspGln: 1.326 ± 0.503
4.421AspArg: 4.421 ± 1.786
3.979AspSer: 3.979 ± 1.718
3.979AspThr: 3.979 ± 1.661
6.631AspVal: 6.631 ± 2.012
3.537AspTrp: 3.537 ± 1.251
3.979AspTyr: 3.979 ± 1.262
0.0AspXaa: 0.0 ± 0.0
Glu
3.537GluAla: 3.537 ± 1.251
1.326GluCys: 1.326 ± 1.438
3.095GluAsp: 3.095 ± 1.346
4.421GluGlu: 4.421 ± 0.689
3.095GluPhe: 3.095 ± 1.03
0.442GluGly: 0.442 ± 0.319
1.326GluHis: 1.326 ± 0.74
5.305GluIle: 5.305 ± 2.049
0.884GluLys: 0.884 ± 0.728
5.305GluLeu: 5.305 ± 0.631
1.326GluMet: 1.326 ± 0.527
4.421GluAsn: 4.421 ± 1.452
3.095GluPro: 3.095 ± 1.521
2.653GluGln: 2.653 ± 1.054
3.979GluArg: 3.979 ± 1.954
3.537GluSer: 3.537 ± 1.116
3.095GluThr: 3.095 ± 1.054
6.631GluVal: 6.631 ± 0.873
0.442GluTrp: 0.442 ± 0.319
3.537GluTyr: 3.537 ± 1.439
0.0GluXaa: 0.0 ± 0.0
Phe
0.884PheAla: 0.884 ± 0.29
0.884PheCys: 0.884 ± 0.638
3.537PheAsp: 3.537 ± 0.474
3.095PheGlu: 3.095 ± 1.03
1.326PhePhe: 1.326 ± 0.554
2.653PheGly: 2.653 ± 0.509
1.326PheHis: 1.326 ± 1.038
2.21PheIle: 2.21 ± 0.353
3.537PheLys: 3.537 ± 1.089
2.653PheLeu: 2.653 ± 1.107
0.442PheMet: 0.442 ± 0.346
1.768PheAsn: 1.768 ± 0.557
2.653PhePro: 2.653 ± 1.005
3.095PheGln: 3.095 ± 1.521
2.21PheArg: 2.21 ± 0.756
4.421PheSer: 4.421 ± 1.026
3.095PheThr: 3.095 ± 1.051
2.21PheVal: 2.21 ± 0.592
0.0PheTrp: 0.0 ± 0.0
0.442PheTyr: 0.442 ± 0.319
0.0PheXaa: 0.0 ± 0.0
Gly
2.653GlyAla: 2.653 ± 1.056
1.326GlyCys: 1.326 ± 0.957
3.537GlyAsp: 3.537 ± 1.361
1.768GlyGlu: 1.768 ± 0.876
1.326GlyPhe: 1.326 ± 0.503
4.421GlyGly: 4.421 ± 0.683
1.326GlyHis: 1.326 ± 0.809
1.326GlyIle: 1.326 ± 0.74
4.421GlyLys: 4.421 ± 1.026
5.747GlyLeu: 5.747 ± 1.483
0.442GlyMet: 0.442 ± 0.774
1.768GlyAsn: 1.768 ± 0.479
0.442GlyPro: 0.442 ± 0.319
1.768GlyGln: 1.768 ± 0.988
2.21GlyArg: 2.21 ± 1.817
6.189GlySer: 6.189 ± 1.279
3.095GlyThr: 3.095 ± 0.504
6.631GlyVal: 6.631 ± 2.644
0.442GlyTrp: 0.442 ± 0.319
1.768GlyTyr: 1.768 ± 0.79
0.0GlyXaa: 0.0 ± 0.0
His
1.768HisAla: 1.768 ± 1.052
0.442HisCys: 0.442 ± 0.319
0.884HisAsp: 0.884 ± 0.29
1.768HisGlu: 1.768 ± 1.276
1.326HisPhe: 1.326 ± 0.503
2.653HisGly: 2.653 ± 0.862
1.326HisHis: 1.326 ± 0.554
0.0HisIle: 0.0 ± 0.0
0.442HisLys: 0.442 ± 0.319
3.537HisLeu: 3.537 ± 0.995
0.884HisMet: 0.884 ± 0.748
2.21HisAsn: 2.21 ± 0.756
0.0HisPro: 0.0 ± 0.0
0.442HisGln: 0.442 ± 0.319
1.768HisArg: 1.768 ± 0.876
0.884HisSer: 0.884 ± 0.638
0.442HisThr: 0.442 ± 0.551
0.884HisVal: 0.884 ± 0.974
0.0HisTrp: 0.0 ± 0.0
1.326HisTyr: 1.326 ± 0.957
0.0HisXaa: 0.0 ± 0.0
Ile
5.747IleAla: 5.747 ± 2.107
0.442IleCys: 0.442 ± 0.319
3.979IleAsp: 3.979 ± 0.767
2.653IleGlu: 2.653 ± 0.782
2.21IlePhe: 2.21 ± 0.671
2.653IleGly: 2.653 ± 1.552
0.442IleHis: 0.442 ± 0.319
1.768IleIle: 1.768 ± 0.574
4.421IleLys: 4.421 ± 1.116
4.421IleLeu: 4.421 ± 1.975
2.21IleMet: 2.21 ± 1.095
3.537IleAsn: 3.537 ± 1.05
2.21IlePro: 2.21 ± 0.671
1.326IleGln: 1.326 ± 1.098
0.884IleArg: 0.884 ± 0.638
6.631IleSer: 6.631 ± 1.402
3.537IleThr: 3.537 ± 1.932
3.095IleVal: 3.095 ± 1.158
0.0IleTrp: 0.0 ± 0.0
0.442IleTyr: 0.442 ± 0.551
0.0IleXaa: 0.0 ± 0.0
Lys
5.305LysAla: 5.305 ± 1.562
2.21LysCys: 2.21 ± 1.095
4.863LysAsp: 4.863 ± 1.497
3.095LysGlu: 3.095 ± 1.197
3.537LysPhe: 3.537 ± 1.162
3.979LysGly: 3.979 ± 0.239
1.326LysHis: 1.326 ± 0.503
3.095LysIle: 3.095 ± 0.795
2.653LysLys: 2.653 ± 1.045
5.747LysLeu: 5.747 ± 1.448
1.768LysMet: 1.768 ± 0.951
2.653LysAsn: 2.653 ± 0.378
4.421LysPro: 4.421 ± 2.993
1.768LysGln: 1.768 ± 0.479
4.863LysArg: 4.863 ± 1.359
7.958LysSer: 7.958 ± 1.614
3.095LysThr: 3.095 ± 0.604
3.095LysVal: 3.095 ± 1.533
1.326LysTrp: 1.326 ± 0.809
2.21LysTyr: 2.21 ± 0.813
0.0LysXaa: 0.0 ± 0.0
Leu
3.979LeuAla: 3.979 ± 1.114
1.768LeuCys: 1.768 ± 0.479
7.515LeuAsp: 7.515 ± 2.692
7.073LeuGlu: 7.073 ± 2.28
2.653LeuPhe: 2.653 ± 0.509
5.747LeuGly: 5.747 ± 2.001
2.653LeuHis: 2.653 ± 1.198
3.095LeuIle: 3.095 ± 1.872
7.073LeuLys: 7.073 ± 1.802
7.515LeuLeu: 7.515 ± 0.931
1.326LeuMet: 1.326 ± 0.957
6.631LeuAsn: 6.631 ± 1.969
4.421LeuPro: 4.421 ± 1.792
1.326LeuGln: 1.326 ± 0.527
3.537LeuArg: 3.537 ± 1.58
7.515LeuSer: 7.515 ± 1.586
3.095LeuThr: 3.095 ± 1.03
4.421LeuVal: 4.421 ± 1.973
1.326LeuTrp: 1.326 ± 0.74
2.21LeuTyr: 2.21 ± 0.756
0.0LeuXaa: 0.0 ± 0.0
Met
3.095MetAla: 3.095 ± 1.358
0.442MetCys: 0.442 ± 0.319
0.884MetAsp: 0.884 ± 0.638
1.326MetGlu: 1.326 ± 0.836
1.326MetPhe: 1.326 ± 0.758
0.884MetGly: 0.884 ± 0.638
0.884MetHis: 0.884 ± 0.638
0.884MetIle: 0.884 ± 0.692
1.768MetLys: 1.768 ± 0.581
1.326MetLeu: 1.326 ± 0.554
0.442MetMet: 0.442 ± 0.346
1.326MetAsn: 1.326 ± 0.503
0.442MetPro: 0.442 ± 0.346
0.0MetGln: 0.0 ± 0.0
1.326MetArg: 1.326 ± 0.503
3.095MetSer: 3.095 ± 2.036
1.768MetThr: 1.768 ± 0.872
0.442MetVal: 0.442 ± 0.319
0.0MetTrp: 0.0 ± 0.0
1.326MetTyr: 1.326 ± 0.868
0.0MetXaa: 0.0 ± 0.0
Asn
2.653AsnAla: 2.653 ± 1.045
0.442AsnCys: 0.442 ± 0.319
3.537AsnAsp: 3.537 ± 0.968
1.326AsnGlu: 1.326 ± 0.957
0.884AsnPhe: 0.884 ± 0.728
2.653AsnGly: 2.653 ± 0.428
0.442AsnHis: 0.442 ± 0.319
2.21AsnIle: 2.21 ± 0.62
4.863AsnLys: 4.863 ± 0.789
5.305AsnLeu: 5.305 ± 0.755
0.884AsnMet: 0.884 ± 0.29
3.095AsnAsn: 3.095 ± 1.816
2.21AsnPro: 2.21 ± 0.894
1.768AsnGln: 1.768 ± 1.293
1.768AsnArg: 1.768 ± 0.479
0.884AsnSer: 0.884 ± 0.696
2.21AsnThr: 2.21 ± 0.62
4.863AsnVal: 4.863 ± 0.804
1.768AsnTrp: 1.768 ± 0.581
2.21AsnTyr: 2.21 ± 1.23
0.0AsnXaa: 0.0 ± 0.0
Pro
1.326ProAla: 1.326 ± 1.463
0.0ProCys: 0.0 ± 0.0
2.21ProAsp: 2.21 ± 0.813
3.537ProGlu: 3.537 ± 1.36
1.326ProPhe: 1.326 ± 0.836
1.326ProGly: 1.326 ± 1.098
0.884ProHis: 0.884 ± 0.638
2.653ProIle: 2.653 ± 1.552
2.653ProLys: 2.653 ± 2.089
3.979ProLeu: 3.979 ± 1.51
0.442ProMet: 0.442 ± 0.774
2.21ProAsn: 2.21 ± 0.763
1.326ProPro: 1.326 ± 0.493
1.326ProGln: 1.326 ± 0.74
2.21ProArg: 2.21 ± 2.065
4.421ProSer: 4.421 ± 1.753
2.653ProThr: 2.653 ± 0.71
6.189ProVal: 6.189 ± 1.409
0.0ProTrp: 0.0 ± 0.0
0.442ProTyr: 0.442 ± 0.346
0.0ProXaa: 0.0 ± 0.0
Gln
2.653GlnAla: 2.653 ± 0.862
0.442GlnCys: 0.442 ± 0.346
2.21GlnAsp: 2.21 ± 0.813
1.768GlnGlu: 1.768 ± 0.931
0.884GlnPhe: 0.884 ± 0.638
1.768GlnGly: 1.768 ± 0.479
0.884GlnHis: 0.884 ± 0.29
1.768GlnIle: 1.768 ± 1.138
2.21GlnLys: 2.21 ± 0.813
1.768GlnLeu: 1.768 ± 1.393
0.442GlnMet: 0.442 ± 0.774
0.884GlnAsn: 0.884 ± 0.728
0.442GlnPro: 0.442 ± 0.551
0.884GlnGln: 0.884 ± 0.29
3.095GlnArg: 3.095 ± 2.057
1.768GlnSer: 1.768 ± 0.633
0.884GlnThr: 0.884 ± 0.569
2.653GlnVal: 2.653 ± 0.653
0.442GlnTrp: 0.442 ± 0.346
0.884GlnTyr: 0.884 ± 0.696
0.0GlnXaa: 0.0 ± 0.0
Arg
4.421ArgAla: 4.421 ± 2.014
1.768ArgCys: 1.768 ± 0.931
2.653ArgAsp: 2.653 ± 1.005
6.189ArgGlu: 6.189 ± 2.464
1.768ArgPhe: 1.768 ± 1.384
2.653ArgGly: 2.653 ± 1.407
1.768ArgHis: 1.768 ± 0.931
3.537ArgIle: 3.537 ± 0.92
3.537ArgLys: 3.537 ± 1.58
4.863ArgLeu: 4.863 ± 0.967
2.653ArgMet: 2.653 ± 0.474
2.21ArgAsn: 2.21 ± 0.594
0.884ArgPro: 0.884 ± 0.6
1.326ArgGln: 1.326 ± 0.503
1.768ArgArg: 1.768 ± 1.307
2.653ArgSer: 2.653 ± 1.436
1.768ArgThr: 1.768 ± 0.479
4.863ArgVal: 4.863 ± 1.136
1.326ArgTrp: 1.326 ± 0.503
1.768ArgTyr: 1.768 ± 0.581
0.0ArgXaa: 0.0 ± 0.0
Ser
3.979SerAla: 3.979 ± 2.603
1.326SerCys: 1.326 ± 0.493
3.979SerAsp: 3.979 ± 1.449
5.305SerGlu: 5.305 ± 2.488
5.305SerPhe: 5.305 ± 1.057
5.747SerGly: 5.747 ± 1.352
1.326SerHis: 1.326 ± 0.503
4.421SerIle: 4.421 ± 2.873
5.747SerLys: 5.747 ± 0.337
5.747SerLeu: 5.747 ± 2.528
1.326SerMet: 1.326 ± 0.622
4.421SerAsn: 4.421 ± 4.129
3.095SerPro: 3.095 ± 1.816
1.326SerGln: 1.326 ± 0.503
4.863SerArg: 4.863 ± 1.594
7.958SerSer: 7.958 ± 3.217
4.421SerThr: 4.421 ± 1.492
6.189SerVal: 6.189 ± 1.671
0.884SerTrp: 0.884 ± 0.696
4.421SerTyr: 4.421 ± 1.525
0.0SerXaa: 0.0 ± 0.0
Thr
3.095ThrAla: 3.095 ± 0.374
0.884ThrCys: 0.884 ± 0.638
2.653ThrAsp: 2.653 ± 1.552
2.21ThrGlu: 2.21 ± 0.62
1.768ThrPhe: 1.768 ± 0.581
2.653ThrGly: 2.653 ± 2.183
0.884ThrHis: 0.884 ± 0.728
4.421ThrIle: 4.421 ± 1.556
2.21ThrLys: 2.21 ± 0.986
4.863ThrLeu: 4.863 ± 1.114
1.326ThrMet: 1.326 ± 0.503
1.326ThrAsn: 1.326 ± 0.868
0.884ThrPro: 0.884 ± 1.102
3.095ThrGln: 3.095 ± 1.197
2.21ThrArg: 2.21 ± 1.268
5.305ThrSer: 5.305 ± 1.622
2.21ThrThr: 2.21 ± 1.595
5.305ThrVal: 5.305 ± 3.267
0.0ThrTrp: 0.0 ± 0.0
3.095ThrTyr: 3.095 ± 1.03
0.0ThrXaa: 0.0 ± 0.0
Val
6.631ValAla: 6.631 ± 1.148
0.884ValCys: 0.884 ± 0.29
7.958ValAsp: 7.958 ± 1.162
3.979ValGlu: 3.979 ± 0.874
0.884ValPhe: 0.884 ± 0.692
3.095ValGly: 3.095 ± 1.197
3.095ValHis: 3.095 ± 1.346
3.095ValIle: 3.095 ± 1.197
7.073ValLys: 7.073 ± 0.52
7.073ValLeu: 7.073 ± 1.542
0.442ValMet: 0.442 ± 0.346
2.653ValAsn: 2.653 ± 1.178
5.305ValPro: 5.305 ± 1.527
0.884ValGln: 0.884 ± 0.696
6.189ValArg: 6.189 ± 1.88
7.515ValSer: 7.515 ± 1.356
5.747ValThr: 5.747 ± 2.514
8.4ValVal: 8.4 ± 2.837
0.442ValTrp: 0.442 ± 0.319
1.326ValTyr: 1.326 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
1.768TrpAla: 1.768 ± 0.581
0.0TrpCys: 0.0 ± 0.0
0.442TrpAsp: 0.442 ± 0.346
0.442TrpGlu: 0.442 ± 0.551
1.326TrpPhe: 1.326 ± 0.503
0.0TrpGly: 0.0 ± 0.0
0.884TrpHis: 0.884 ± 0.29
0.442TrpIle: 0.442 ± 0.319
1.768TrpLys: 1.768 ± 0.79
0.442TrpLeu: 0.442 ± 0.551
1.326TrpMet: 1.326 ± 0.503
0.442TrpAsn: 0.442 ± 0.774
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.884TrpArg: 0.884 ± 0.29
0.442TrpSer: 0.442 ± 0.319
0.884TrpThr: 0.884 ± 0.696
0.442TrpVal: 0.442 ± 0.319
0.884TrpTrp: 0.884 ± 0.29
0.442TrpTyr: 0.442 ± 0.346
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.442TyrAla: 0.442 ± 0.346
1.326TyrCys: 1.326 ± 0.957
3.095TyrAsp: 3.095 ± 0.374
3.979TyrGlu: 3.979 ± 0.716
1.768TyrPhe: 1.768 ± 1.091
1.768TyrGly: 1.768 ± 0.479
1.326TyrHis: 1.326 ± 0.957
1.326TyrIle: 1.326 ± 1.438
2.21TyrLys: 2.21 ± 0.353
3.979TyrLeu: 3.979 ± 1.661
0.884TyrMet: 0.884 ± 0.638
1.326TyrAsn: 1.326 ± 0.957
0.442TyrPro: 0.442 ± 0.319
2.21TyrGln: 2.21 ± 1.212
1.326TyrArg: 1.326 ± 0.503
3.537TyrSer: 3.537 ± 1.339
0.884TyrThr: 0.884 ± 0.728
3.537TyrVal: 3.537 ± 1.148
0.442TyrTrp: 0.442 ± 0.774
1.768TyrTyr: 1.768 ± 1.384
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2263 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski