Amino acid dipepetide frequency for Mesta yellow vein mosaic Bahraich virus-[India:Bahraich:2007]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.309AlaAla: 6.309 ± 1.727
1.577AlaCys: 1.577 ± 1.032
1.577AlaAsp: 1.577 ± 0.992
0.789AlaGlu: 0.789 ± 0.767
0.0AlaPhe: 0.0 ± 0.0
3.155AlaGly: 3.155 ± 1.076
1.577AlaHis: 1.577 ± 0.961
1.577AlaIle: 1.577 ± 0.774
1.577AlaLys: 1.577 ± 0.902
5.521AlaLeu: 5.521 ± 1.911
0.0AlaMet: 0.0 ± 0.0
2.366AlaAsn: 2.366 ± 1.093
3.943AlaPro: 3.943 ± 1.543
3.155AlaGln: 3.155 ± 1.36
3.943AlaArg: 3.943 ± 1.637
5.521AlaSer: 5.521 ± 2.109
3.155AlaThr: 3.155 ± 1.29
0.789AlaVal: 0.789 ± 0.898
1.577AlaTrp: 1.577 ± 0.716
0.789AlaTyr: 0.789 ± 0.541
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.577CysCys: 1.577 ± 1.737
1.577CysAsp: 1.577 ± 1.049
1.577CysGlu: 1.577 ± 1.032
0.789CysPhe: 0.789 ± 0.898
1.577CysGly: 1.577 ± 0.902
0.789CysHis: 0.789 ± 0.917
1.577CysIle: 1.577 ± 0.955
0.789CysLys: 0.789 ± 0.696
0.789CysLeu: 0.789 ± 0.767
0.789CysMet: 0.789 ± 0.868
2.366CysAsn: 2.366 ± 1.008
2.366CysPro: 2.366 ± 1.797
0.789CysGln: 0.789 ± 0.541
2.366CysArg: 2.366 ± 1.014
3.943CysSer: 3.943 ± 1.878
0.789CysThr: 0.789 ± 0.696
0.789CysVal: 0.789 ± 0.696
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.155AspAla: 3.155 ± 1.541
0.0AspCys: 0.0 ± 0.0
2.366AspAsp: 2.366 ± 1.003
2.366AspGlu: 2.366 ± 0.854
1.577AspPhe: 1.577 ± 0.716
3.155AspGly: 3.155 ± 1.706
0.0AspHis: 0.0 ± 0.0
3.943AspIle: 3.943 ± 1.655
2.366AspLys: 2.366 ± 0.896
3.943AspLeu: 3.943 ± 1.933
0.0AspMet: 0.0 ± 0.0
3.943AspAsn: 3.943 ± 1.884
2.366AspPro: 2.366 ± 1.196
2.366AspGln: 2.366 ± 1.066
1.577AspArg: 1.577 ± 1.391
3.943AspSer: 3.943 ± 1.108
1.577AspThr: 1.577 ± 1.737
4.732AspVal: 4.732 ± 1.855
2.366AspTrp: 2.366 ± 1.17
1.577AspTyr: 1.577 ± 0.953
0.0AspXaa: 0.0 ± 0.0
Glu
5.521GluAla: 5.521 ± 1.855
0.789GluCys: 0.789 ± 0.917
2.366GluAsp: 2.366 ± 1.17
4.732GluGlu: 4.732 ± 2.645
3.155GluPhe: 3.155 ± 1.233
3.943GluGly: 3.943 ± 1.626
2.366GluHis: 2.366 ± 1.069
0.0GluIle: 0.0 ± 0.0
0.789GluLys: 0.789 ± 0.541
4.732GluLeu: 4.732 ± 2.055
0.0GluMet: 0.0 ± 0.0
3.155GluAsn: 3.155 ± 1.964
1.577GluPro: 1.577 ± 0.716
0.789GluGln: 0.789 ± 0.868
0.789GluArg: 0.789 ± 0.898
2.366GluSer: 2.366 ± 1.167
0.789GluThr: 0.789 ± 0.541
3.155GluVal: 3.155 ± 1.349
0.789GluTrp: 0.789 ± 0.541
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.789PheCys: 0.789 ± 0.696
2.366PheAsp: 2.366 ± 1.061
0.789PheGlu: 0.789 ± 0.541
1.577PhePhe: 1.577 ± 0.716
1.577PheGly: 1.577 ± 1.391
0.789PheHis: 0.789 ± 0.541
1.577PheIle: 1.577 ± 1.088
3.943PheLys: 3.943 ± 2.715
5.521PheLeu: 5.521 ± 1.643
0.789PheMet: 0.789 ± 0.541
1.577PheAsn: 1.577 ± 1.011
0.789PhePro: 0.789 ± 0.868
2.366PheGln: 2.366 ± 1.17
3.943PheArg: 3.943 ± 1.303
4.732PheSer: 4.732 ± 2.788
0.789PheThr: 0.789 ± 0.898
3.943PheVal: 3.943 ± 1.728
0.0PheTrp: 0.0 ± 0.0
1.577PheTyr: 1.577 ± 0.955
0.0PheXaa: 0.0 ± 0.0
Gly
1.577GlyAla: 1.577 ± 1.081
2.366GlyCys: 2.366 ± 1.204
2.366GlyAsp: 2.366 ± 1.161
1.577GlyGlu: 1.577 ± 0.902
1.577GlyPhe: 1.577 ± 1.264
3.943GlyGly: 3.943 ± 0.919
1.577GlyHis: 1.577 ± 0.902
2.366GlyIle: 2.366 ± 1.17
5.521GlyLys: 5.521 ± 2.685
3.943GlyLeu: 3.943 ± 1.353
0.789GlyMet: 0.789 ± 0.868
1.577GlyAsn: 1.577 ± 1.419
3.155GlyPro: 3.155 ± 1.524
4.732GlyGln: 4.732 ± 1.612
1.577GlyArg: 1.577 ± 0.774
4.732GlySer: 4.732 ± 1.52
3.943GlyThr: 3.943 ± 1.153
3.943GlyVal: 3.943 ± 2.635
0.789GlyTrp: 0.789 ± 0.917
0.789GlyTyr: 0.789 ± 0.868
0.0GlyXaa: 0.0 ± 0.0
His
2.366HisAla: 2.366 ± 2.087
1.577HisCys: 1.577 ± 1.264
1.577HisAsp: 1.577 ± 1.2
0.789HisGlu: 0.789 ± 0.541
2.366HisPhe: 2.366 ± 1.284
2.366HisGly: 2.366 ± 2.031
2.366HisHis: 2.366 ± 1.93
2.366HisIle: 2.366 ± 1.125
2.366HisLys: 2.366 ± 1.443
2.366HisLeu: 2.366 ± 0.974
0.0HisMet: 0.0 ± 0.0
3.943HisAsn: 3.943 ± 1.746
1.577HisPro: 1.577 ± 1.2
0.789HisGln: 0.789 ± 0.868
3.943HisArg: 3.943 ± 2.066
2.366HisSer: 2.366 ± 1.399
1.577HisThr: 1.577 ± 1.391
0.789HisVal: 0.789 ± 0.898
0.0HisTrp: 0.0 ± 0.0
0.789HisTyr: 0.789 ± 0.541
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
3.155IleCys: 3.155 ± 1.035
1.577IleAsp: 1.577 ± 1.081
1.577IleGlu: 1.577 ± 1.081
2.366IlePhe: 2.366 ± 1.17
1.577IleGly: 1.577 ± 1.391
3.155IleHis: 3.155 ± 1.613
1.577IleIle: 1.577 ± 1.088
5.521IleLys: 5.521 ± 1.443
3.155IleLeu: 3.155 ± 1.235
0.789IleMet: 0.789 ± 0.92
2.366IleAsn: 2.366 ± 1.911
0.789IlePro: 0.789 ± 0.541
2.366IleGln: 2.366 ± 1.17
5.521IleArg: 5.521 ± 2.014
3.155IleSer: 3.155 ± 1.522
1.577IleThr: 1.577 ± 1.796
2.366IleVal: 2.366 ± 1.304
2.366IleTrp: 2.366 ± 1.66
1.577IleTyr: 1.577 ± 0.716
0.0IleXaa: 0.0 ± 0.0
Lys
2.366LysAla: 2.366 ± 2.034
2.366LysCys: 2.366 ± 1.274
2.366LysAsp: 2.366 ± 1.622
3.155LysGlu: 3.155 ± 2.162
2.366LysPhe: 2.366 ± 0.816
3.155LysGly: 3.155 ± 1.152
0.789LysHis: 0.789 ± 0.541
3.155LysIle: 3.155 ± 1.522
1.577LysLys: 1.577 ± 0.716
0.789LysLeu: 0.789 ± 0.898
0.0LysMet: 0.0 ± 0.0
3.155LysAsn: 3.155 ± 1.432
2.366LysPro: 2.366 ± 1.304
1.577LysGln: 1.577 ± 0.955
3.943LysArg: 3.943 ± 2.066
6.309LysSer: 6.309 ± 2.044
2.366LysThr: 2.366 ± 0.816
4.732LysVal: 4.732 ± 1.842
0.789LysTrp: 0.789 ± 0.696
4.732LysTyr: 4.732 ± 1.061
0.0LysXaa: 0.0 ± 0.0
Leu
1.577LeuAla: 1.577 ± 0.796
2.366LeuCys: 2.366 ± 1.093
3.155LeuAsp: 3.155 ± 1.584
2.366LeuGlu: 2.366 ± 1.008
0.789LeuPhe: 0.789 ± 0.898
4.732LeuGly: 4.732 ± 1.637
1.577LeuHis: 1.577 ± 0.961
2.366LeuIle: 2.366 ± 1.742
5.521LeuLys: 5.521 ± 1.149
6.309LeuLeu: 6.309 ± 2.516
1.577LeuMet: 1.577 ± 0.882
7.886LeuAsn: 7.886 ± 1.912
1.577LeuPro: 1.577 ± 1.147
3.155LeuGln: 3.155 ± 1.36
6.309LeuArg: 6.309 ± 2.079
4.732LeuSer: 4.732 ± 2.059
9.464LeuThr: 9.464 ± 3.002
3.155LeuVal: 3.155 ± 1.173
0.0LeuTrp: 0.0 ± 0.0
6.309LeuTyr: 6.309 ± 2.355
0.0LeuXaa: 0.0 ± 0.0
Met
0.789MetAla: 0.789 ± 0.696
0.789MetCys: 0.789 ± 0.696
3.943MetAsp: 3.943 ± 1.933
0.0MetGlu: 0.0 ± 0.0
1.577MetPhe: 1.577 ± 1.391
2.366MetGly: 2.366 ± 0.974
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.577MetLeu: 1.577 ± 0.955
0.0MetMet: 0.0 ± 0.0
0.789MetAsn: 0.789 ± 0.696
0.789MetPro: 0.789 ± 0.541
0.789MetGln: 0.789 ± 0.917
1.577MetArg: 1.577 ± 0.961
0.789MetSer: 0.789 ± 0.696
0.789MetThr: 0.789 ± 0.71
0.0MetVal: 0.0 ± 0.0
1.577MetTrp: 1.577 ± 0.953
2.366MetTyr: 2.366 ± 1.337
0.0MetXaa: 0.0 ± 0.0
Asn
3.943AsnAla: 3.943 ± 2.026
0.789AsnCys: 0.789 ± 0.917
2.366AsnAsp: 2.366 ± 1.093
3.155AsnGlu: 3.155 ± 1.507
2.366AsnPhe: 2.366 ± 1.629
2.366AsnGly: 2.366 ± 1.003
2.366AsnHis: 2.366 ± 1.473
4.732AsnIle: 4.732 ± 2.906
0.0AsnLys: 0.0 ± 0.0
7.098AsnLeu: 7.098 ± 3.334
3.943AsnMet: 3.943 ± 1.245
3.155AsnAsn: 3.155 ± 1.253
6.309AsnPro: 6.309 ± 1.909
3.155AsnGln: 3.155 ± 0.964
5.521AsnArg: 5.521 ± 1.601
3.155AsnSer: 3.155 ± 1.481
2.366AsnThr: 2.366 ± 1.443
3.155AsnVal: 3.155 ± 1.477
0.789AsnTrp: 0.789 ± 0.71
3.943AsnTyr: 3.943 ± 1.053
0.0AsnXaa: 0.0 ± 0.0
Pro
3.155ProAla: 3.155 ± 1.076
2.366ProCys: 2.366 ± 1.106
3.155ProAsp: 3.155 ± 1.909
0.789ProGlu: 0.789 ± 0.541
1.577ProPhe: 1.577 ± 0.961
0.789ProGly: 0.789 ± 0.541
2.366ProHis: 2.366 ± 1.284
3.943ProIle: 3.943 ± 1.938
2.366ProLys: 2.366 ± 1.622
3.943ProLeu: 3.943 ± 1.155
2.366ProMet: 2.366 ± 0.965
3.943ProAsn: 3.943 ± 1.814
2.366ProPro: 2.366 ± 1.274
3.943ProGln: 3.943 ± 1.516
4.732ProArg: 4.732 ± 1.212
4.732ProSer: 4.732 ± 3.389
4.732ProThr: 4.732 ± 1.68
5.521ProVal: 5.521 ± 2.278
0.789ProTrp: 0.789 ± 0.767
2.366ProTyr: 2.366 ± 0.854
0.0ProXaa: 0.0 ± 0.0
Gln
1.577GlnAla: 1.577 ± 1.032
0.0GlnCys: 0.0 ± 0.0
1.577GlnAsp: 1.577 ± 0.955
3.943GlnGlu: 3.943 ± 1.107
2.366GlnPhe: 2.366 ± 1.622
1.577GlnGly: 1.577 ± 1.081
3.155GlnHis: 3.155 ± 2.198
1.577GlnIle: 1.577 ± 1.081
0.789GlnLys: 0.789 ± 0.868
2.366GlnLeu: 2.366 ± 1.196
0.0GlnMet: 0.0 ± 0.0
3.155GlnAsn: 3.155 ± 1.907
5.521GlnPro: 5.521 ± 2.995
4.732GlnGln: 4.732 ± 1.355
0.789GlnArg: 0.789 ± 0.541
3.943GlnSer: 3.943 ± 1.378
5.521GlnThr: 5.521 ± 1.698
4.732GlnVal: 4.732 ± 1.549
0.0GlnTrp: 0.0 ± 0.0
1.577GlnTyr: 1.577 ± 0.902
0.0GlnXaa: 0.0 ± 0.0
Arg
0.789ArgAla: 0.789 ± 0.696
2.366ArgCys: 2.366 ± 1.372
3.155ArgAsp: 3.155 ± 1.29
3.155ArgGlu: 3.155 ± 1.493
1.577ArgPhe: 1.577 ± 0.716
3.943ArgGly: 3.943 ± 1.276
3.155ArgHis: 3.155 ± 2.149
6.309ArgIle: 6.309 ± 2.311
3.155ArgLys: 3.155 ± 1.346
3.943ArgLeu: 3.943 ± 1.852
1.577ArgMet: 1.577 ± 1.391
6.309ArgAsn: 6.309 ± 2.906
4.732ArgPro: 4.732 ± 1.52
2.366ArgGln: 2.366 ± 1.287
7.886ArgArg: 7.886 ± 2.682
5.521ArgSer: 5.521 ± 0.794
3.155ArgThr: 3.155 ± 1.224
7.098ArgVal: 7.098 ± 1.288
0.0ArgTrp: 0.0 ± 0.0
1.577ArgTyr: 1.577 ± 0.955
0.0ArgXaa: 0.0 ± 0.0
Ser
4.732SerAla: 4.732 ± 1.557
0.789SerCys: 0.789 ± 0.868
4.732SerAsp: 4.732 ± 0.943
3.155SerGlu: 3.155 ± 1.724
3.943SerPhe: 3.943 ± 2.493
3.943SerGly: 3.943 ± 1.254
2.366SerHis: 2.366 ± 1.069
2.366SerIle: 2.366 ± 1.737
7.098SerLys: 7.098 ± 1.578
3.943SerLeu: 3.943 ± 1.324
0.0SerMet: 0.0 ± 0.0
4.732SerAsn: 4.732 ± 1.173
7.886SerPro: 7.886 ± 1.756
2.366SerGln: 2.366 ± 1.196
5.521SerArg: 5.521 ± 1.321
12.618SerSer: 12.618 ± 4.554
9.464SerThr: 9.464 ± 2.358
5.521SerVal: 5.521 ± 2.839
1.577SerTrp: 1.577 ± 1.081
2.366SerTyr: 2.366 ± 1.17
0.0SerXaa: 0.0 ± 0.0
Thr
3.155ThrAla: 3.155 ± 0.964
0.0ThrCys: 0.0 ± 0.0
1.577ThrAsp: 1.577 ± 1.2
1.577ThrGlu: 1.577 ± 1.032
2.366ThrPhe: 2.366 ± 1.066
6.309ThrGly: 6.309 ± 1.933
3.155ThrHis: 3.155 ± 1.57
0.789ThrIle: 0.789 ± 0.541
3.155ThrLys: 3.155 ± 1.432
3.155ThrLeu: 3.155 ± 1.47
3.155ThrMet: 3.155 ± 1.745
4.732ThrAsn: 4.732 ± 1.772
7.098ThrPro: 7.098 ± 1.846
3.155ThrGln: 3.155 ± 1.271
3.155ThrArg: 3.155 ± 1.805
5.521ThrSer: 5.521 ± 1.396
2.366ThrThr: 2.366 ± 1.975
2.366ThrVal: 2.366 ± 1.427
0.789ThrTrp: 0.789 ± 0.71
3.155ThrTyr: 3.155 ± 1.035
0.0ThrXaa: 0.0 ± 0.0
Val
1.577ValAla: 1.577 ± 0.992
0.789ValCys: 0.789 ± 0.767
3.943ValAsp: 3.943 ± 0.89
2.366ValGlu: 2.366 ± 1.995
3.943ValPhe: 3.943 ± 1.494
0.789ValGly: 0.789 ± 0.696
3.943ValHis: 3.943 ± 1.791
5.521ValIle: 5.521 ± 2.563
4.732ValLys: 4.732 ± 1.829
5.521ValLeu: 5.521 ± 1.613
1.577ValMet: 1.577 ± 1.391
1.577ValAsn: 1.577 ± 0.992
3.155ValPro: 3.155 ± 0.821
3.943ValGln: 3.943 ± 0.968
3.943ValArg: 3.943 ± 2.642
4.732ValSer: 4.732 ± 0.993
3.943ValThr: 3.943 ± 2.642
2.366ValVal: 2.366 ± 1.304
0.789ValTrp: 0.789 ± 0.767
3.943ValTyr: 3.943 ± 1.974
0.0ValXaa: 0.0 ± 0.0
Trp
3.155TrpAla: 3.155 ± 1.541
0.0TrpCys: 0.0 ± 0.0
0.789TrpAsp: 0.789 ± 0.868
0.789TrpGlu: 0.789 ± 0.898
0.0TrpPhe: 0.0 ± 0.0
0.789TrpGly: 0.789 ± 0.541
0.789TrpHis: 0.789 ± 0.696
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.789TrpLeu: 0.789 ± 0.71
0.789TrpMet: 0.789 ± 0.696
0.789TrpAsn: 0.789 ± 0.767
0.0TrpPro: 0.0 ± 0.0
0.789TrpGln: 0.789 ± 0.541
0.789TrpArg: 0.789 ± 0.917
2.366TrpSer: 2.366 ± 1.008
1.577TrpThr: 1.577 ± 0.902
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.789TrpTyr: 0.789 ± 0.541
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.155TyrAla: 3.155 ± 1.29
0.789TyrCys: 0.789 ± 0.71
0.789TyrAsp: 0.789 ± 0.696
3.155TyrGlu: 3.155 ± 1.724
3.155TyrPhe: 3.155 ± 0.964
0.789TyrGly: 0.789 ± 0.541
0.0TyrHis: 0.0 ± 0.0
1.577TyrIle: 1.577 ± 1.081
0.789TyrLys: 0.789 ± 0.541
5.521TyrLeu: 5.521 ± 1.871
1.577TyrMet: 1.577 ± 0.856
3.155TyrAsn: 3.155 ± 1.56
1.577TyrPro: 1.577 ± 0.774
1.577TyrGln: 1.577 ± 1.011
3.943TyrArg: 3.943 ± 1.618
3.943TyrSer: 3.943 ± 1.824
0.789TyrThr: 0.789 ± 0.696
3.943TyrVal: 3.943 ± 1.19
0.0TyrTrp: 0.0 ± 0.0
0.789TyrTyr: 0.789 ± 0.917
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1269 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski