Amino acid dipepetide frequency for Hydrangea ringspot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.234AlaAla: 8.234 ± 3.489
0.457AlaCys: 0.457 ± 0.265
0.915AlaAsp: 0.915 ± 0.63
4.575AlaGlu: 4.575 ± 1.335
3.66AlaPhe: 3.66 ± 1.519
5.947AlaGly: 5.947 ± 1.866
2.287AlaHis: 2.287 ± 0.876
2.745AlaIle: 2.745 ± 0.994
6.404AlaLys: 6.404 ± 3.046
13.266AlaLeu: 13.266 ± 4.679
1.372AlaMet: 1.372 ± 0.796
3.66AlaAsn: 3.66 ± 1.098
5.947AlaPro: 5.947 ± 1.488
1.83AlaGln: 1.83 ± 0.686
4.575AlaArg: 4.575 ± 1.475
6.404AlaSer: 6.404 ± 3.245
6.404AlaThr: 6.404 ± 2.308
4.575AlaVal: 4.575 ± 1.886
1.372AlaTrp: 1.372 ± 1.366
4.117AlaTyr: 4.117 ± 2.387
0.0AlaXaa: 0.0 ± 0.0
Cys
0.915CysAla: 0.915 ± 0.63
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.457CysGlu: 0.457 ± 0.265
0.915CysPhe: 0.915 ± 1.359
0.915CysGly: 0.915 ± 0.53
0.457CysHis: 0.457 ± 0.265
0.915CysIle: 0.915 ± 0.53
0.0CysLys: 0.0 ± 0.0
0.915CysLeu: 0.915 ± 0.833
0.457CysMet: 0.457 ± 0.93
0.915CysAsn: 0.915 ± 0.53
1.83CysPro: 1.83 ± 1.388
0.457CysGln: 0.457 ± 0.265
0.915CysArg: 0.915 ± 0.63
2.745CysSer: 2.745 ± 2.154
0.457CysThr: 0.457 ± 0.265
0.915CysVal: 0.915 ± 0.53
0.457CysTrp: 0.457 ± 0.265
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.287AspAla: 2.287 ± 0.848
0.915AspCys: 0.915 ± 1.302
1.372AspAsp: 1.372 ± 0.796
1.372AspGlu: 1.372 ± 0.653
1.83AspPhe: 1.83 ± 0.686
0.915AspGly: 0.915 ± 1.381
0.457AspHis: 0.457 ± 0.265
4.117AspIle: 4.117 ± 1.337
0.0AspLys: 0.0 ± 0.0
4.117AspLeu: 4.117 ± 2.387
0.457AspMet: 0.457 ± 0.907
1.372AspAsn: 1.372 ± 0.799
5.032AspPro: 5.032 ± 1.757
0.457AspGln: 0.457 ± 0.265
0.915AspArg: 0.915 ± 0.53
1.83AspSer: 1.83 ± 0.715
3.66AspThr: 3.66 ± 2.535
1.83AspVal: 1.83 ± 1.061
0.457AspTrp: 0.457 ± 0.265
0.915AspTyr: 0.915 ± 0.833
0.0AspXaa: 0.0 ± 0.0
Glu
8.692GluAla: 8.692 ± 1.986
0.457GluCys: 0.457 ± 0.265
1.372GluAsp: 1.372 ± 0.796
3.202GluGlu: 3.202 ± 1.856
2.745GluPhe: 2.745 ± 1.166
2.287GluGly: 2.287 ± 0.953
0.0GluHis: 0.0 ± 0.0
2.287GluIle: 2.287 ± 1.326
5.947GluLys: 5.947 ± 2.787
7.319GluLeu: 7.319 ± 2.004
0.915GluMet: 0.915 ± 0.546
2.287GluAsn: 2.287 ± 0.848
2.745GluPro: 2.745 ± 1.712
2.745GluGln: 2.745 ± 1.166
4.117GluArg: 4.117 ± 1.766
1.372GluSer: 1.372 ± 1.916
4.575GluThr: 4.575 ± 1.696
3.202GluVal: 3.202 ± 1.279
0.457GluTrp: 0.457 ± 0.265
0.457GluTyr: 0.457 ± 0.712
0.0GluXaa: 0.0 ± 0.0
Phe
3.66PheAla: 3.66 ± 1.798
1.372PheCys: 1.372 ± 1.366
3.66PheAsp: 3.66 ± 1.798
3.202PheGlu: 3.202 ± 0.843
0.457PhePhe: 0.457 ± 0.756
0.915PheGly: 0.915 ± 0.63
0.0PheHis: 0.0 ± 0.0
1.83PheIle: 1.83 ± 1.061
1.83PheLys: 1.83 ± 0.85
6.862PheLeu: 6.862 ± 1.453
0.915PheMet: 0.915 ± 0.53
1.372PheAsn: 1.372 ± 0.653
2.287PhePro: 2.287 ± 1.319
2.287PheGln: 2.287 ± 1.326
1.372PheArg: 1.372 ± 0.796
0.915PheSer: 0.915 ± 0.53
3.66PheThr: 3.66 ± 1.295
1.83PheVal: 1.83 ± 0.927
0.915PheTrp: 0.915 ± 0.906
0.915PheTyr: 0.915 ± 0.63
0.0PheXaa: 0.0 ± 0.0
Gly
4.117GlyAla: 4.117 ± 1.164
1.372GlyCys: 1.372 ± 0.799
3.66GlyAsp: 3.66 ± 1.35
2.745GlyGlu: 2.745 ± 1.591
1.83GlyPhe: 1.83 ± 0.686
2.287GlyGly: 2.287 ± 0.974
2.287GlyHis: 2.287 ± 1.326
2.745GlyIle: 2.745 ± 1.307
1.83GlyLys: 1.83 ± 0.932
4.117GlyLeu: 4.117 ± 1.244
0.457GlyMet: 0.457 ± 1.024
1.83GlyAsn: 1.83 ± 1.061
3.66GlyPro: 3.66 ± 1.095
1.372GlyGln: 1.372 ± 0.653
1.83GlyArg: 1.83 ± 1.08
5.032GlySer: 5.032 ± 3.416
4.117GlyThr: 4.117 ± 1.327
3.202GlyVal: 3.202 ± 1.575
0.0GlyTrp: 0.0 ± 0.0
1.83GlyTyr: 1.83 ± 0.772
0.0GlyXaa: 0.0 ± 0.0
His
3.202HisAla: 3.202 ± 1.857
0.0HisCys: 0.0 ± 0.0
0.457HisAsp: 0.457 ± 0.265
0.915HisGlu: 0.915 ± 0.53
3.202HisPhe: 3.202 ± 1.405
2.745HisGly: 2.745 ± 1.619
0.457HisHis: 0.457 ± 0.265
0.915HisIle: 0.915 ± 0.53
0.915HisLys: 0.915 ± 0.53
4.575HisLeu: 4.575 ± 1.009
0.457HisMet: 0.457 ± 0.653
0.0HisAsn: 0.0 ± 0.0
2.745HisPro: 2.745 ± 0.999
1.83HisGln: 1.83 ± 1.061
3.66HisArg: 3.66 ± 2.289
2.287HisSer: 2.287 ± 2.184
2.745HisThr: 2.745 ± 1.591
0.457HisVal: 0.457 ± 0.712
0.0HisTrp: 0.0 ± 0.0
1.372HisTyr: 1.372 ± 0.796
0.0HisXaa: 0.0 ± 0.0
Ile
3.202IleAla: 3.202 ± 0.843
0.915IleCys: 0.915 ± 0.53
0.457IleAsp: 0.457 ± 0.712
0.915IleGlu: 0.915 ± 0.53
2.745IlePhe: 2.745 ± 0.901
0.457IleGly: 0.457 ± 0.265
2.287IleHis: 2.287 ± 1.755
0.915IleIle: 0.915 ± 0.53
4.117IleLys: 4.117 ± 2.387
4.575IleLeu: 4.575 ± 1.905
0.915IleMet: 0.915 ± 0.53
1.372IleAsn: 1.372 ± 0.796
3.66IlePro: 3.66 ± 2.307
3.66IleGln: 3.66 ± 1.519
1.83IleArg: 1.83 ± 0.715
3.202IleSer: 3.202 ± 1.723
3.66IleThr: 3.66 ± 2.122
1.372IleVal: 1.372 ± 0.653
0.457IleTrp: 0.457 ± 0.756
0.457IleTyr: 0.457 ± 0.265
0.0IleXaa: 0.0 ± 0.0
Lys
6.404LysAla: 6.404 ± 2.7
0.915LysCys: 0.915 ± 0.932
4.117LysAsp: 4.117 ± 1.886
3.66LysGlu: 3.66 ± 1.638
1.83LysPhe: 1.83 ± 1.26
0.915LysGly: 0.915 ± 0.53
2.287LysHis: 2.287 ± 0.848
2.745LysIle: 2.745 ± 1.591
3.202LysLys: 3.202 ± 1.856
5.947LysLeu: 5.947 ± 2.755
1.372LysMet: 1.372 ± 0.796
0.915LysAsn: 0.915 ± 0.53
5.032LysPro: 5.032 ± 1.473
1.372LysGln: 1.372 ± 1.066
1.83LysArg: 1.83 ± 0.887
3.66LysSer: 3.66 ± 1.029
2.745LysThr: 2.745 ± 1.166
1.83LysVal: 1.83 ± 1.061
0.457LysTrp: 0.457 ± 0.265
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.692LeuAla: 8.692 ± 4.5
1.372LeuCys: 1.372 ± 0.653
4.575LeuAsp: 4.575 ± 1.905
7.777LeuGlu: 7.777 ± 1.375
4.575LeuPhe: 4.575 ± 2.017
6.404LeuGly: 6.404 ± 1.478
4.575LeuHis: 4.575 ± 1.575
2.745LeuIle: 2.745 ± 1.597
7.777LeuLys: 7.777 ± 2.509
10.064LeuLeu: 10.064 ± 2.908
0.457LeuMet: 0.457 ± 0.265
2.287LeuAsn: 2.287 ± 0.848
10.064LeuPro: 10.064 ± 1.696
5.032LeuGln: 5.032 ± 2.112
5.947LeuArg: 5.947 ± 1.224
9.607LeuSer: 9.607 ± 3.956
7.777LeuThr: 7.777 ± 3.098
6.862LeuVal: 6.862 ± 2.601
1.83LeuTrp: 1.83 ± 0.887
4.117LeuTyr: 4.117 ± 1.886
0.0LeuXaa: 0.0 ± 0.0
Met
1.83MetAla: 1.83 ± 1.061
0.457MetCys: 0.457 ± 0.265
0.457MetAsp: 0.457 ± 0.712
1.372MetGlu: 1.372 ± 0.796
0.457MetPhe: 0.457 ± 0.265
2.287MetGly: 2.287 ± 0.99
0.0MetHis: 0.0 ± 0.0
0.915MetIle: 0.915 ± 0.53
0.915MetLys: 0.915 ± 0.53
1.83MetLeu: 1.83 ± 0.887
0.0MetMet: 0.0 ± 0.0
0.457MetAsn: 0.457 ± 0.265
1.83MetPro: 1.83 ± 1.408
0.0MetGln: 0.0 ± 0.0
2.287MetArg: 2.287 ± 1.326
0.915MetSer: 0.915 ± 0.833
0.915MetThr: 0.915 ± 0.63
0.457MetVal: 0.457 ± 0.265
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.202AsnAla: 3.202 ± 1.826
1.372AsnCys: 1.372 ± 0.796
0.915AsnAsp: 0.915 ± 0.53
1.83AsnGlu: 1.83 ± 0.686
0.915AsnPhe: 0.915 ± 1.512
0.915AsnGly: 0.915 ± 0.906
1.83AsnHis: 1.83 ± 0.772
2.287AsnIle: 2.287 ± 0.876
1.372AsnLys: 1.372 ± 0.796
1.83AsnLeu: 1.83 ± 2.08
0.457AsnMet: 0.457 ± 0.265
0.915AsnAsn: 0.915 ± 0.53
2.287AsnPro: 2.287 ± 0.848
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
0.915AsnSer: 0.915 ± 0.53
4.117AsnThr: 4.117 ± 1.244
0.915AsnVal: 0.915 ± 1.425
0.457AsnTrp: 0.457 ± 0.265
0.915AsnTyr: 0.915 ± 0.53
0.0AsnXaa: 0.0 ± 0.0
Pro
6.862ProAla: 6.862 ± 2.516
1.372ProCys: 1.372 ± 1.237
3.66ProAsp: 3.66 ± 2.451
7.777ProGlu: 7.777 ± 1.921
2.287ProPhe: 2.287 ± 1.827
3.202ProGly: 3.202 ± 0.951
5.489ProHis: 5.489 ± 2.66
3.66ProIle: 3.66 ± 1.098
5.032ProLys: 5.032 ± 2.295
6.862ProLeu: 6.862 ± 2.277
1.83ProMet: 1.83 ± 0.792
0.915ProAsn: 0.915 ± 0.63
18.298ProPro: 18.298 ± 18.066
2.745ProGln: 2.745 ± 2.118
3.202ProArg: 3.202 ± 2.957
11.436ProSer: 11.436 ± 11.268
7.777ProThr: 7.777 ± 4.474
3.66ProVal: 3.66 ± 0.601
1.372ProTrp: 1.372 ± 0.856
2.287ProTyr: 2.287 ± 0.953
0.0ProXaa: 0.0 ± 0.0
Gln
3.202GlnAla: 3.202 ± 1.279
0.0GlnCys: 0.0 ± 0.0
0.915GlnAsp: 0.915 ± 0.63
3.66GlnGlu: 3.66 ± 1.029
1.372GlnPhe: 1.372 ± 1.366
3.202GlnGly: 3.202 ± 0.843
1.372GlnHis: 1.372 ± 0.653
1.83GlnIle: 1.83 ± 0.772
1.372GlnLys: 1.372 ± 1.066
3.202GlnLeu: 3.202 ± 1.35
0.915GlnMet: 0.915 ± 0.53
1.83GlnAsn: 1.83 ± 0.887
3.66GlnPro: 3.66 ± 2.595
1.83GlnGln: 1.83 ± 1.061
2.745GlnArg: 2.745 ± 1.487
1.83GlnSer: 1.83 ± 0.772
3.202GlnThr: 3.202 ± 1.397
1.83GlnVal: 1.83 ± 1.061
0.915GlnTrp: 0.915 ± 0.53
1.372GlnTyr: 1.372 ± 0.603
0.0GlnXaa: 0.0 ± 0.0
Arg
3.202ArgAla: 3.202 ± 1.337
0.915ArgCys: 0.915 ± 0.932
1.83ArgAsp: 1.83 ± 1.665
2.287ArgGlu: 2.287 ± 1.326
3.202ArgPhe: 3.202 ± 0.843
3.202ArgGly: 3.202 ± 0.876
1.372ArgHis: 1.372 ± 1.823
0.457ArgIle: 0.457 ± 0.265
2.287ArgLys: 2.287 ± 1.774
6.404ArgLeu: 6.404 ± 0.872
0.0ArgMet: 0.0 ± 0.0
0.915ArgAsn: 0.915 ± 0.63
5.489ArgPro: 5.489 ± 3.423
3.66ArgGln: 3.66 ± 1.773
3.202ArgArg: 3.202 ± 0.862
4.575ArgSer: 4.575 ± 0.561
3.202ArgThr: 3.202 ± 1.241
2.745ArgVal: 2.745 ± 0.982
0.0ArgTrp: 0.0 ± 0.0
3.202ArgTyr: 3.202 ± 0.924
0.0ArgXaa: 0.0 ± 0.0
Ser
6.404SerAla: 6.404 ± 1.337
0.915SerCys: 0.915 ± 0.833
1.372SerAsp: 1.372 ± 0.796
3.202SerGlu: 3.202 ± 1.985
3.202SerPhe: 3.202 ± 1.279
5.032SerGly: 5.032 ± 2.909
2.745SerHis: 2.745 ± 2.209
2.287SerIle: 2.287 ± 0.99
1.83SerLys: 1.83 ± 0.772
8.234SerLeu: 8.234 ± 5.144
0.457SerMet: 0.457 ± 1.024
1.83SerAsn: 1.83 ± 2.08
10.979SerPro: 10.979 ± 6.327
4.575SerGln: 4.575 ± 0.916
4.575SerArg: 4.575 ± 2.181
8.234SerSer: 8.234 ± 7.14
6.862SerThr: 6.862 ± 5.922
2.745SerVal: 2.745 ± 1.817
1.372SerTrp: 1.372 ± 1.933
1.372SerTyr: 1.372 ± 0.796
0.0SerXaa: 0.0 ± 0.0
Thr
5.947ThrAla: 5.947 ± 2.227
0.457ThrCys: 0.457 ± 0.712
3.202ThrAsp: 3.202 ± 0.843
3.66ThrGlu: 3.66 ± 1.372
2.287ThrPhe: 2.287 ± 1.033
3.66ThrGly: 3.66 ± 1.519
4.117ThrHis: 4.117 ± 1.714
3.202ThrIle: 3.202 ± 1.264
3.66ThrLys: 3.66 ± 1.7
10.064ThrLeu: 10.064 ± 3.483
1.372ThrMet: 1.372 ± 0.796
2.287ThrAsn: 2.287 ± 0.661
7.777ThrPro: 7.777 ± 1.529
4.117ThrGln: 4.117 ± 1.476
5.489ThrArg: 5.489 ± 3.631
5.947ThrSer: 5.947 ± 5.605
5.032ThrThr: 5.032 ± 5.597
2.287ThrVal: 2.287 ± 0.953
0.915ThrTrp: 0.915 ± 0.53
1.83ThrTyr: 1.83 ± 1.061
0.0ThrXaa: 0.0 ± 0.0
Val
3.66ValAla: 3.66 ± 0.891
0.0ValCys: 0.0 ± 0.0
0.915ValAsp: 0.915 ± 0.833
1.372ValGlu: 1.372 ± 0.796
1.83ValPhe: 1.83 ± 0.927
2.745ValGly: 2.745 ± 1.817
0.457ValHis: 0.457 ± 0.265
2.287ValIle: 2.287 ± 0.953
0.915ValLys: 0.915 ± 0.53
7.319ValLeu: 7.319 ± 1.738
1.83ValMet: 1.83 ± 1.061
0.915ValAsn: 0.915 ± 0.63
4.117ValPro: 4.117 ± 1.982
2.287ValGln: 2.287 ± 0.953
2.745ValArg: 2.745 ± 0.924
5.489ValSer: 5.489 ± 1.829
1.83ValThr: 1.83 ± 1.439
2.287ValVal: 2.287 ± 1.45
0.0ValTrp: 0.0 ± 0.0
1.372ValTyr: 1.372 ± 0.796
0.0ValXaa: 0.0 ± 0.0
Trp
0.457TrpAla: 0.457 ± 0.265
0.0TrpCys: 0.0 ± 0.0
0.457TrpAsp: 0.457 ± 0.265
2.287TrpGlu: 2.287 ± 0.848
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.457TrpLys: 0.457 ± 0.265
0.915TrpLeu: 0.915 ± 0.53
0.915TrpMet: 0.915 ± 0.53
1.372TrpAsn: 1.372 ± 1.366
0.915TrpPro: 0.915 ± 1.694
0.0TrpGln: 0.0 ± 0.0
0.457TrpArg: 0.457 ± 0.265
0.915TrpSer: 0.915 ± 2.048
1.83TrpThr: 1.83 ± 0.686
0.457TrpVal: 0.457 ± 0.265
0.0TrpTrp: 0.0 ± 0.0
0.457TrpTyr: 0.457 ± 1.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.575TyrAla: 4.575 ± 2.652
1.372TyrCys: 1.372 ± 0.799
0.0TyrAsp: 0.0 ± 0.0
1.372TyrGlu: 1.372 ± 0.796
0.915TyrPhe: 0.915 ± 0.63
2.287TyrGly: 2.287 ± 1.326
0.457TyrHis: 0.457 ± 0.265
2.287TyrIle: 2.287 ± 1.294
1.372TyrLys: 1.372 ± 0.796
4.117TyrLeu: 4.117 ± 1.886
1.372TyrMet: 1.372 ± 0.796
0.0TyrAsn: 0.0 ± 0.0
1.372TyrPro: 1.372 ± 1.066
0.0TyrGln: 0.0 ± 0.0
0.457TyrArg: 0.457 ± 0.943
0.915TyrSer: 0.915 ± 0.53
2.745TyrThr: 2.745 ± 1.166
0.915TyrVal: 0.915 ± 0.63
0.457TyrTrp: 0.457 ± 0.265
0.915TyrTyr: 0.915 ± 0.53
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2187 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski