Amino acid dipepetide frequency for Drosophila albomicans (Fruit fly)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.213AlaAla: 11.213 ± 0.101
1.046AlaCys: 1.046 ± 0.019
3.509AlaAsp: 3.509 ± 0.021
4.837AlaGlu: 4.837 ± 0.04
2.048AlaPhe: 2.048 ± 0.016
4.73AlaGly: 4.73 ± 0.03
1.689AlaHis: 1.689 ± 0.016
3.425AlaIle: 3.425 ± 0.02
4.005AlaLys: 4.005 ± 0.026
6.29AlaLeu: 6.29 ± 0.032
1.701AlaMet: 1.701 ± 0.013
3.476AlaAsn: 3.476 ± 0.019
4.062AlaPro: 4.062 ± 0.043
3.595AlaGln: 3.595 ± 0.023
3.213AlaArg: 3.213 ± 0.017
6.366AlaSer: 6.366 ± 0.036
6.05AlaThr: 6.05 ± 0.046
4.502AlaVal: 4.502 ± 0.027
0.545AlaTrp: 0.545 ± 0.007
1.671AlaTyr: 1.671 ± 0.013
0.002AlaXaa: 0.002 ± 0.0
Cys
1.153CysAla: 1.153 ± 0.016
0.48CysCys: 0.48 ± 0.008
1.046CysAsp: 1.046 ± 0.013
1.073CysGlu: 1.073 ± 0.014
0.639CysPhe: 0.639 ± 0.009
1.19CysGly: 1.19 ± 0.021
0.465CysHis: 0.465 ± 0.007
0.944CysIle: 0.944 ± 0.013
0.878CysLys: 0.878 ± 0.01
1.57CysLeu: 1.57 ± 0.018
0.366CysMet: 0.366 ± 0.006
0.873CysAsn: 0.873 ± 0.013
0.836CysPro: 0.836 ± 0.024
0.783CysGln: 0.783 ± 0.017
0.955CysArg: 0.955 ± 0.025
1.442CysSer: 1.442 ± 0.02
0.882CysThr: 0.882 ± 0.012
1.07CysVal: 1.07 ± 0.02
0.178CysTrp: 0.178 ± 0.004
0.521CysTyr: 0.521 ± 0.008
0.001CysXaa: 0.001 ± 0.0
Asp
3.998AspAla: 3.998 ± 0.023
0.95AspCys: 0.95 ± 0.014
4.219AspAsp: 4.219 ± 0.04
4.466AspGlu: 4.466 ± 0.03
2.014AspPhe: 2.014 ± 0.013
3.071AspGly: 3.071 ± 0.024
1.042AspHis: 1.042 ± 0.01
2.894AspIle: 2.894 ± 0.018
2.849AspLys: 2.849 ± 0.024
4.535AspLeu: 4.535 ± 0.027
1.223AspMet: 1.223 ± 0.01
2.592AspAsn: 2.592 ± 0.016
2.134AspPro: 2.134 ± 0.025
1.942AspGln: 1.942 ± 0.02
2.401AspArg: 2.401 ± 0.022
3.933AspSer: 3.933 ± 0.028
2.53AspThr: 2.53 ± 0.015
3.506AspVal: 3.506 ± 0.028
0.579AspTrp: 0.579 ± 0.008
1.77AspTyr: 1.77 ± 0.013
0.001AspXaa: 0.001 ± 0.0
Glu
4.545GluAla: 4.545 ± 0.03
1.088GluCys: 1.088 ± 0.024
3.789GluAsp: 3.789 ± 0.027
5.913GluGlu: 5.913 ± 0.055
2.019GluPhe: 2.019 ± 0.015
2.61GluGly: 2.61 ± 0.021
1.7GluHis: 1.7 ± 0.014
3.315GluIle: 3.315 ± 0.027
4.238GluLys: 4.238 ± 0.058
6.548GluLeu: 6.548 ± 0.041
1.505GluMet: 1.505 ± 0.012
2.903GluAsn: 2.903 ± 0.02
2.843GluPro: 2.843 ± 0.028
4.101GluGln: 4.101 ± 0.031
4.077GluArg: 4.077 ± 0.032
4.485GluSer: 4.485 ± 0.056
3.573GluThr: 3.573 ± 0.031
3.536GluVal: 3.536 ± 0.035
0.555GluTrp: 0.555 ± 0.007
1.705GluTyr: 1.705 ± 0.012
0.002GluXaa: 0.002 ± 0.0
Phe
2.156PheAla: 2.156 ± 0.015
0.638PheCys: 0.638 ± 0.007
1.988PheAsp: 1.988 ± 0.015
2.059PheGlu: 2.059 ± 0.014
1.321PhePhe: 1.321 ± 0.014
2.188PheGly: 2.188 ± 0.018
0.819PheHis: 0.819 ± 0.01
1.834PheIle: 1.834 ± 0.017
1.814PheLys: 1.814 ± 0.014
2.941PheLeu: 2.941 ± 0.022
0.816PheMet: 0.816 ± 0.009
1.676PheAsn: 1.676 ± 0.014
1.272PhePro: 1.272 ± 0.012
1.359PheGln: 1.359 ± 0.012
1.663PheArg: 1.663 ± 0.014
2.342PheSer: 2.342 ± 0.015
1.725PheThr: 1.725 ± 0.015
2.26PheVal: 2.26 ± 0.015
0.383PheTrp: 0.383 ± 0.006
1.164PheTyr: 1.164 ± 0.011
0.001PheXaa: 0.001 ± 0.0
Gly
4.458GlyAla: 4.458 ± 0.033
0.91GlyCys: 0.91 ± 0.015
3.052GlyAsp: 3.052 ± 0.023
3.036GlyGlu: 3.036 ± 0.019
1.929GlyPhe: 1.929 ± 0.015
6.841GlyGly: 6.841 ± 0.105
1.515GlyHis: 1.515 ± 0.014
2.795GlyIle: 2.795 ± 0.018
2.866GlyLys: 2.866 ± 0.016
4.243GlyLeu: 4.243 ± 0.024
1.249GlyMet: 1.249 ± 0.012
3.241GlyAsn: 3.241 ± 0.023
2.51GlyPro: 2.51 ± 0.055
2.654GlyGln: 2.654 ± 0.056
2.905GlyArg: 2.905 ± 0.032
5.533GlySer: 5.533 ± 0.039
2.966GlyThr: 2.966 ± 0.024
3.431GlyVal: 3.431 ± 0.024
0.576GlyTrp: 0.576 ± 0.009
1.962GlyTyr: 1.962 ± 0.021
0.002GlyXaa: 0.002 ± 0.0
His
1.651HisAla: 1.651 ± 0.013
0.528HisCys: 0.528 ± 0.007
1.139HisAsp: 1.139 ± 0.01
1.442HisGlu: 1.442 ± 0.011
0.962HisPhe: 0.962 ± 0.009
1.463HisGly: 1.463 ± 0.013
1.474HisHis: 1.474 ± 0.022
1.295HisIle: 1.295 ± 0.011
1.348HisLys: 1.348 ± 0.011
2.417HisLeu: 2.417 ± 0.016
0.678HisMet: 0.678 ± 0.009
1.362HisAsn: 1.362 ± 0.014
1.277HisPro: 1.277 ± 0.013
1.74HisGln: 1.74 ± 0.018
1.358HisArg: 1.358 ± 0.012
2.237HisSer: 2.237 ± 0.02
1.332HisThr: 1.332 ± 0.013
1.401HisVal: 1.401 ± 0.011
0.259HisTrp: 0.259 ± 0.004
0.859HisTyr: 0.859 ± 0.01
0.001HisXaa: 0.001 ± 0.0
Ile
3.493IleAla: 3.493 ± 0.016
1.108IleCys: 1.108 ± 0.013
2.917IleAsp: 2.917 ± 0.02
3.459IleGlu: 3.459 ± 0.027
1.995IlePhe: 1.995 ± 0.019
2.625IleGly: 2.625 ± 0.019
1.043IleHis: 1.043 ± 0.01
2.832IleIle: 2.832 ± 0.018
2.949IleLys: 2.949 ± 0.023
3.994IleLeu: 3.994 ± 0.026
1.118IleMet: 1.118 ± 0.009
2.556IleAsn: 2.556 ± 0.015
2.224IlePro: 2.224 ± 0.016
1.933IleGln: 1.933 ± 0.015
2.339IleArg: 2.339 ± 0.016
4.006IleSer: 4.006 ± 0.023
2.843IleThr: 2.843 ± 0.016
3.291IleVal: 3.291 ± 0.022
0.511IleTrp: 0.511 ± 0.007
1.607IleTyr: 1.607 ± 0.015
0.002IleXaa: 0.002 ± 0.0
Lys
3.536LysAla: 3.536 ± 0.03
1.036LysCys: 1.036 ± 0.015
2.908LysAsp: 2.908 ± 0.025
3.892LysGlu: 3.892 ± 0.04
1.775LysPhe: 1.775 ± 0.012
2.256LysGly: 2.256 ± 0.018
1.419LysHis: 1.419 ± 0.012
2.755LysIle: 2.755 ± 0.024
3.997LysLys: 3.997 ± 0.052
5.412LysLeu: 5.412 ± 0.036
1.317LysMet: 1.317 ± 0.011
2.384LysAsn: 2.384 ± 0.018
3.074LysPro: 3.074 ± 0.03
3.052LysGln: 3.052 ± 0.025
3.545LysArg: 3.545 ± 0.021
4.362LysSer: 4.362 ± 0.03
3.106LysThr: 3.106 ± 0.021
2.92LysVal: 2.92 ± 0.024
0.548LysTrp: 0.548 ± 0.007
1.695LysTyr: 1.695 ± 0.013
0.002LysXaa: 0.002 ± 0.001
Leu
6.384LeuAla: 6.384 ± 0.028
1.569LeuCys: 1.569 ± 0.011
4.817LeuAsp: 4.817 ± 0.03
6.14LeuGlu: 6.14 ± 0.042
2.772LeuPhe: 2.772 ± 0.02
4.511LeuGly: 4.511 ± 0.027
2.492LeuHis: 2.492 ± 0.019
4.174LeuIle: 4.174 ± 0.029
5.305LeuLys: 5.305 ± 0.033
8.97LeuLeu: 8.97 ± 0.055
1.983LeuMet: 1.983 ± 0.014
4.468LeuAsn: 4.468 ± 0.023
5.008LeuPro: 5.008 ± 0.026
5.71LeuGln: 5.71 ± 0.041
5.383LeuArg: 5.383 ± 0.032
7.096LeuSer: 7.096 ± 0.037
4.732LeuThr: 4.732 ± 0.021
4.686LeuVal: 4.686 ± 0.029
0.81LeuTrp: 0.81 ± 0.009
2.33LeuTyr: 2.33 ± 0.015
0.004LeuXaa: 0.004 ± 0.001
Met
1.696MetAla: 1.696 ± 0.013
0.387MetCys: 0.387 ± 0.006
1.284MetAsp: 1.284 ± 0.011
1.526MetGlu: 1.526 ± 0.011
0.763MetPhe: 0.763 ± 0.01
1.246MetGly: 1.246 ± 0.014
0.618MetHis: 0.618 ± 0.007
0.905MetIle: 0.905 ± 0.009
1.165MetLys: 1.165 ± 0.011
2.205MetLeu: 2.205 ± 0.014
0.593MetMet: 0.593 ± 0.008
0.994MetAsn: 0.994 ± 0.009
1.362MetPro: 1.362 ± 0.014
1.339MetGln: 1.339 ± 0.014
1.414MetArg: 1.414 ± 0.01
1.81MetSer: 1.81 ± 0.014
1.126MetThr: 1.126 ± 0.011
1.135MetVal: 1.135 ± 0.011
0.208MetTrp: 0.208 ± 0.004
0.61MetTyr: 0.61 ± 0.007
0.001MetXaa: 0.001 ± 0.0
Asn
3.818AsnAla: 3.818 ± 0.022
0.958AsnCys: 0.958 ± 0.012
2.606AsnAsp: 2.606 ± 0.019
3.102AsnGlu: 3.102 ± 0.022
1.679AsnPhe: 1.679 ± 0.013
3.52AsnGly: 3.52 ± 0.028
1.102AsnHis: 1.102 ± 0.013
2.735AsnIle: 2.735 ± 0.019
2.53AsnLys: 2.53 ± 0.017
4.137AsnLeu: 4.137 ± 0.027
1.17AsnMet: 1.17 ± 0.011
4.447AsnAsn: 4.447 ± 0.05
1.999AsnPro: 1.999 ± 0.018
1.963AsnGln: 1.963 ± 0.017
2.273AsnArg: 2.273 ± 0.014
4.685AsnSer: 4.685 ± 0.034
2.643AsnThr: 2.643 ± 0.016
3.014AsnVal: 3.014 ± 0.017
0.49AsnTrp: 0.49 ± 0.007
1.563AsnTyr: 1.563 ± 0.011
0.002AsnXaa: 0.002 ± 0.0
Pro
4.289ProAla: 4.289 ± 0.041
0.67ProCys: 0.67 ± 0.029
2.293ProAsp: 2.293 ± 0.014
3.2ProGlu: 3.2 ± 0.034
1.459ProPhe: 1.459 ± 0.012
3.095ProGly: 3.095 ± 0.06
1.426ProHis: 1.426 ± 0.014
2.384ProIle: 2.384 ± 0.017
2.872ProLys: 2.872 ± 0.022
4.266ProLeu: 4.266 ± 0.024
1.064ProMet: 1.064 ± 0.01
2.328ProAsn: 2.328 ± 0.023
4.807ProPro: 4.807 ± 0.056
3.029ProGln: 3.029 ± 0.03
2.397ProArg: 2.397 ± 0.019
4.327ProSer: 4.327 ± 0.03
3.793ProThr: 3.793 ± 0.029
3.054ProVal: 3.054 ± 0.03
0.391ProTrp: 0.391 ± 0.005
1.353ProTyr: 1.353 ± 0.014
0.002ProXaa: 0.002 ± 0.0
Gln
3.478GlnAla: 3.478 ± 0.022
0.793GlnCys: 0.793 ± 0.017
1.985GlnAsp: 1.985 ± 0.022
2.964GlnGlu: 2.964 ± 0.022
1.484GlnPhe: 1.484 ± 0.012
2.251GlnGly: 2.251 ± 0.054
2.178GlnHis: 2.178 ± 0.021
2.205GlnIle: 2.205 ± 0.014
2.613GlnLys: 2.613 ± 0.026
6.242GlnLeu: 6.242 ± 0.048
1.222GlnMet: 1.222 ± 0.013
2.001GlnAsn: 2.001 ± 0.014
3.249GlnPro: 3.249 ± 0.035
11.396GlnGln: 11.396 ± 0.164
3.565GlnArg: 3.565 ± 0.025
3.818GlnSer: 3.818 ± 0.024
2.779GlnThr: 2.779 ± 0.02
2.469GlnVal: 2.469 ± 0.016
0.428GlnTrp: 0.428 ± 0.007
1.269GlnTyr: 1.269 ± 0.01
0.002GlnXaa: 0.002 ± 0.0
Arg
3.257ArgAla: 3.257 ± 0.015
0.957ArgCys: 0.957 ± 0.015
2.879ArgAsp: 2.879 ± 0.021
3.562ArgGlu: 3.562 ± 0.03
1.8ArgPhe: 1.8 ± 0.011
2.769ArgGly: 2.769 ± 0.033
1.545ArgHis: 1.545 ± 0.013
2.661ArgIle: 2.661 ± 0.016
3.386ArgLys: 3.386 ± 0.02
4.939ArgLeu: 4.939 ± 0.028
1.11ArgMet: 1.11 ± 0.01
2.709ArgAsn: 2.709 ± 0.017
2.576ArgPro: 2.576 ± 0.028
3.001ArgGln: 3.001 ± 0.022
4.287ArgArg: 4.287 ± 0.03
4.331ArgSer: 4.331 ± 0.029
2.65ArgThr: 2.65 ± 0.017
2.767ArgVal: 2.767 ± 0.018
0.534ArgTrp: 0.534 ± 0.008
1.622ArgTyr: 1.622 ± 0.012
0.003ArgXaa: 0.003 ± 0.001
Ser
6.129SerAla: 6.129 ± 0.038
1.395SerCys: 1.395 ± 0.02
4.073SerAsp: 4.073 ± 0.023
4.575SerGlu: 4.575 ± 0.047
2.487SerPhe: 2.487 ± 0.019
5.463SerGly: 5.463 ± 0.033
1.927SerHis: 1.927 ± 0.015
3.9SerIle: 3.9 ± 0.02
4.207SerLys: 4.207 ± 0.026
6.764SerLeu: 6.764 ± 0.033
1.768SerMet: 1.768 ± 0.014
4.957SerAsn: 4.957 ± 0.036
4.599SerPro: 4.599 ± 0.04
3.635SerGln: 3.635 ± 0.025
3.93SerArg: 3.93 ± 0.025
10.967SerSer: 10.967 ± 0.1
5.835SerThr: 5.835 ± 0.043
4.485SerVal: 4.485 ± 0.034
0.744SerTrp: 0.744 ± 0.01
2.177SerTyr: 2.177 ± 0.016
0.002SerXaa: 0.002 ± 0.0
Thr
5.4ThrAla: 5.4 ± 0.038
0.958ThrCys: 0.958 ± 0.015
2.685ThrAsp: 2.685 ± 0.019
3.536ThrGlu: 3.536 ± 0.031
1.778ThrPhe: 1.778 ± 0.013
3.251ThrGly: 3.251 ± 0.023
1.355ThrHis: 1.355 ± 0.011
2.978ThrIle: 2.978 ± 0.018
2.867ThrLys: 2.867 ± 0.022
5.182ThrLeu: 5.182 ± 0.026
1.23ThrMet: 1.23 ± 0.012
2.773ThrAsn: 2.773 ± 0.019
4.15ThrPro: 4.15 ± 0.028
2.652ThrGln: 2.652 ± 0.019
2.58ThrArg: 2.58 ± 0.016
5.208ThrSer: 5.208 ± 0.032
6.674ThrThr: 6.674 ± 0.103
3.507ThrVal: 3.507 ± 0.027
0.486ThrTrp: 0.486 ± 0.006
1.431ThrTyr: 1.431 ± 0.013
0.002ThrXaa: 0.002 ± 0.0
Val
4.87ValAla: 4.87 ± 0.035
1.077ValCys: 1.077 ± 0.013
3.279ValAsp: 3.279 ± 0.029
4.002ValGlu: 4.002 ± 0.042
1.924ValPhe: 1.924 ± 0.014
3.281ValGly: 3.281 ± 0.025
1.387ValHis: 1.387 ± 0.012
2.92ValIle: 2.92 ± 0.021
3.038ValLys: 3.038 ± 0.028
5.186ValLeu: 5.186 ± 0.029
1.281ValMet: 1.281 ± 0.011
2.674ValAsn: 2.674 ± 0.016
2.944ValPro: 2.944 ± 0.019
2.651ValGln: 2.651 ± 0.017
2.864ValArg: 2.864 ± 0.017
4.28ValSer: 4.28 ± 0.025
3.342ValThr: 3.342 ± 0.025
3.927ValVal: 3.927 ± 0.028
0.53ValTrp: 0.53 ± 0.007
1.623ValTyr: 1.623 ± 0.014
0.003ValXaa: 0.003 ± 0.0
Trp
0.498TrpAla: 0.498 ± 0.006
0.179TrpCys: 0.179 ± 0.004
0.472TrpAsp: 0.472 ± 0.007
0.495TrpGlu: 0.495 ± 0.007
0.358TrpPhe: 0.358 ± 0.006
0.443TrpGly: 0.443 ± 0.007
0.253TrpHis: 0.253 ± 0.004
0.505TrpIle: 0.505 ± 0.007
0.525TrpLys: 0.525 ± 0.007
1.066TrpLeu: 1.066 ± 0.014
0.263TrpMet: 0.263 ± 0.005
0.495TrpAsn: 0.495 ± 0.009
0.348TrpPro: 0.348 ± 0.005
0.525TrpGln: 0.525 ± 0.007
0.636TrpArg: 0.636 ± 0.007
0.714TrpSer: 0.714 ± 0.009
0.525TrpThr: 0.525 ± 0.007
0.48TrpVal: 0.48 ± 0.007
0.157TrpTrp: 0.157 ± 0.004
0.309TrpTyr: 0.309 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.953TyrAla: 1.953 ± 0.014
0.616TyrCys: 0.616 ± 0.008
1.708TyrAsp: 1.708 ± 0.012
1.768TyrGlu: 1.768 ± 0.012
1.211TyrPhe: 1.211 ± 0.011
1.867TyrGly: 1.867 ± 0.022
0.774TyrHis: 0.774 ± 0.009
1.388TyrIle: 1.388 ± 0.011
1.536TyrLys: 1.536 ± 0.013
2.433TyrLeu: 2.433 ± 0.015
0.73TyrMet: 0.73 ± 0.007
1.525TyrAsn: 1.525 ± 0.014
1.221TyrPro: 1.221 ± 0.011
1.328TyrGln: 1.328 ± 0.013
1.56TyrArg: 1.56 ± 0.012
2.073TyrSer: 2.073 ± 0.014
1.575TyrThr: 1.575 ± 0.013
1.642TyrVal: 1.642 ± 0.013
0.334TyrTrp: 0.334 ± 0.006
1.122TyrTyr: 1.122 ± 0.01
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.002XaaHis: 0.002 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.004XaaLeu: 0.004 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.002XaaGln: 0.002 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19749 proteins (14002934 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski