Amino acid dipepetide frequency for Drosophila sechellia (Fruit fly)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.556AlaAla: 8.556 ± 0.082
1.194AlaCys: 1.194 ± 0.028
3.403AlaAsp: 3.403 ± 0.021
4.515AlaGlu: 4.515 ± 0.03
2.272AlaPhe: 2.272 ± 0.02
4.84AlaGly: 4.84 ± 0.039
1.713AlaHis: 1.713 ± 0.017
3.493AlaIle: 3.493 ± 0.027
4.166AlaLys: 4.166 ± 0.034
6.319AlaLeu: 6.319 ± 0.037
1.753AlaMet: 1.753 ± 0.018
3.227AlaAsn: 3.227 ± 0.024
4.165AlaPro: 4.165 ± 0.044
3.299AlaGln: 3.299 ± 0.028
3.38AlaArg: 3.38 ± 0.024
6.105AlaSer: 6.105 ± 0.035
4.864AlaThr: 4.864 ± 0.033
4.561AlaVal: 4.561 ± 0.026
0.602AlaTrp: 0.602 ± 0.009
1.803AlaTyr: 1.803 ± 0.018
0.002AlaXaa: 0.002 ± 0.0
Cys
1.218CysAla: 1.218 ± 0.021
0.528CysCys: 0.528 ± 0.011
1.087CysAsp: 1.087 ± 0.02
1.119CysGlu: 1.119 ± 0.019
0.726CysPhe: 0.726 ± 0.011
1.404CysGly: 1.404 ± 0.033
0.524CysHis: 0.524 ± 0.011
0.975CysIle: 0.975 ± 0.019
0.938CysLys: 0.938 ± 0.015
1.84CysLeu: 1.84 ± 0.025
0.411CysMet: 0.411 ± 0.008
0.851CysAsn: 0.851 ± 0.018
1.059CysPro: 1.059 ± 0.032
0.872CysGln: 0.872 ± 0.025
1.18CysArg: 1.18 ± 0.036
1.662CysSer: 1.662 ± 0.03
0.974CysThr: 0.974 ± 0.018
1.194CysVal: 1.194 ± 0.025
0.215CysTrp: 0.215 ± 0.005
0.563CysTyr: 0.563 ± 0.011
0.001CysXaa: 0.001 ± 0.0
Asp
3.554AspAla: 3.554 ± 0.026
1.017AspCys: 1.017 ± 0.02
3.634AspAsp: 3.634 ± 0.036
4.193AspGlu: 4.193 ± 0.032
2.205AspPhe: 2.205 ± 0.018
3.107AspGly: 3.107 ± 0.025
1.134AspHis: 1.134 ± 0.014
2.807AspIle: 2.807 ± 0.018
2.628AspLys: 2.628 ± 0.023
4.799AspLeu: 4.799 ± 0.032
1.278AspMet: 1.278 ± 0.015
2.211AspAsn: 2.211 ± 0.024
2.436AspPro: 2.436 ± 0.032
2.027AspGln: 2.027 ± 0.02
2.525AspArg: 2.525 ± 0.024
3.927AspSer: 3.927 ± 0.028
2.425AspThr: 2.425 ± 0.023
3.305AspVal: 3.305 ± 0.024
0.615AspTrp: 0.615 ± 0.011
1.775AspTyr: 1.775 ± 0.016
0.0AspXaa: 0.0 ± 0.0
Glu
4.438GluAla: 4.438 ± 0.035
1.153GluCys: 1.153 ± 0.035
3.991GluAsp: 3.991 ± 0.034
5.783GluGlu: 5.783 ± 0.062
2.187GluPhe: 2.187 ± 0.022
2.803GluGly: 2.803 ± 0.027
1.629GluHis: 1.629 ± 0.015
3.205GluIle: 3.205 ± 0.024
4.013GluLys: 4.013 ± 0.037
6.151GluLeu: 6.151 ± 0.044
1.583GluMet: 1.583 ± 0.014
2.861GluAsn: 2.861 ± 0.022
2.827GluPro: 2.827 ± 0.029
3.463GluGln: 3.463 ± 0.03
3.949GluArg: 3.949 ± 0.035
4.34GluSer: 4.34 ± 0.029
3.202GluThr: 3.202 ± 0.027
3.586GluVal: 3.586 ± 0.028
0.581GluTrp: 0.581 ± 0.01
1.737GluTyr: 1.737 ± 0.016
0.001GluXaa: 0.001 ± 0.0
Phe
2.386PheAla: 2.386 ± 0.021
0.774PheCys: 0.774 ± 0.012
2.067PheAsp: 2.067 ± 0.019
2.192PheGlu: 2.192 ± 0.021
1.499PhePhe: 1.499 ± 0.02
2.472PheGly: 2.472 ± 0.026
0.909PheHis: 0.909 ± 0.01
1.847PheIle: 1.847 ± 0.022
1.873PheLys: 1.873 ± 0.018
3.443PheLeu: 3.443 ± 0.027
0.903PheMet: 0.903 ± 0.013
1.692PheAsn: 1.692 ± 0.016
1.494PhePro: 1.494 ± 0.017
1.504PheGln: 1.504 ± 0.016
1.964PheArg: 1.964 ± 0.019
2.627PheSer: 2.627 ± 0.022
1.839PheThr: 1.839 ± 0.016
2.472PheVal: 2.472 ± 0.019
0.457PheTrp: 0.457 ± 0.01
1.313PheTyr: 1.313 ± 0.016
0.001PheXaa: 0.001 ± 0.0
Gly
4.499GlyAla: 4.499 ± 0.034
1.102GlyCys: 1.102 ± 0.02
3.03GlyAsp: 3.03 ± 0.027
3.164GlyGlu: 3.164 ± 0.028
2.24GlyPhe: 2.24 ± 0.022
6.459GlyGly: 6.459 ± 0.09
1.697GlyHis: 1.697 ± 0.02
2.975GlyIle: 2.975 ± 0.026
3.141GlyLys: 3.141 ± 0.022
4.813GlyLeu: 4.813 ± 0.034
1.344GlyMet: 1.344 ± 0.016
2.99GlyAsn: 2.99 ± 0.03
2.492GlyPro: 2.492 ± 0.037
2.59GlyGln: 2.59 ± 0.023
3.092GlyArg: 3.092 ± 0.031
5.884GlySer: 5.884 ± 0.043
3.147GlyThr: 3.147 ± 0.025
3.634GlyVal: 3.634 ± 0.028
0.64GlyTrp: 0.64 ± 0.011
2.066GlyTyr: 2.066 ± 0.024
0.002GlyXaa: 0.002 ± 0.0
His
1.55HisAla: 1.55 ± 0.017
0.601HisCys: 0.601 ± 0.013
1.096HisAsp: 1.096 ± 0.013
1.356HisGlu: 1.356 ± 0.017
1.055HisPhe: 1.055 ± 0.014
1.566HisGly: 1.566 ± 0.021
1.474HisHis: 1.474 ± 0.031
1.338HisIle: 1.338 ± 0.015
1.315HisLys: 1.315 ± 0.013
2.663HisLeu: 2.663 ± 0.021
0.73HisMet: 0.73 ± 0.01
1.172HisAsn: 1.172 ± 0.015
1.541HisPro: 1.541 ± 0.02
1.647HisGln: 1.647 ± 0.024
1.536HisArg: 1.536 ± 0.015
2.141HisSer: 2.141 ± 0.021
1.317HisThr: 1.317 ± 0.016
1.449HisVal: 1.449 ± 0.017
0.315HisTrp: 0.315 ± 0.007
0.884HisTyr: 0.884 ± 0.012
0.001HisXaa: 0.001 ± 0.0
Ile
3.354IleAla: 3.354 ± 0.021
1.203IleCys: 1.203 ± 0.019
2.598IleAsp: 2.598 ± 0.023
2.988IleGlu: 2.988 ± 0.028
2.126IlePhe: 2.126 ± 0.019
2.69IleGly: 2.69 ± 0.025
1.138IleHis: 1.138 ± 0.013
2.575IleIle: 2.575 ± 0.025
2.781IleLys: 2.781 ± 0.025
4.441IleLeu: 4.441 ± 0.029
1.149IleMet: 1.149 ± 0.015
2.346IleAsn: 2.346 ± 0.02
2.408IlePro: 2.408 ± 0.02
1.99IleGln: 1.99 ± 0.017
2.65IleArg: 2.65 ± 0.021
4.048IleSer: 4.048 ± 0.029
2.731IleThr: 2.731 ± 0.023
3.147IleVal: 3.147 ± 0.024
0.574IleTrp: 0.574 ± 0.011
1.739IleTyr: 1.739 ± 0.015
0.001IleXaa: 0.001 ± 0.0
Lys
3.591LysAla: 3.591 ± 0.031
1.129LysCys: 1.129 ± 0.022
2.911LysAsp: 2.911 ± 0.025
3.855LysGlu: 3.855 ± 0.035
1.879LysPhe: 1.879 ± 0.019
2.417LysGly: 2.417 ± 0.025
1.364LysHis: 1.364 ± 0.013
2.777LysIle: 2.777 ± 0.022
4.161LysLys: 4.161 ± 0.055
5.258LysLeu: 5.258 ± 0.033
1.46LysMet: 1.46 ± 0.016
2.382LysAsn: 2.382 ± 0.02
3.133LysPro: 3.133 ± 0.035
2.67LysGln: 2.67 ± 0.024
3.696LysArg: 3.696 ± 0.029
4.222LysSer: 4.222 ± 0.028
3.033LysThr: 3.033 ± 0.023
3.14LysVal: 3.14 ± 0.026
0.608LysTrp: 0.608 ± 0.009
1.748LysTyr: 1.748 ± 0.018
0.001LysXaa: 0.001 ± 0.0
Leu
6.843LeuAla: 6.843 ± 0.04
1.729LeuCys: 1.729 ± 0.021
4.837LeuAsp: 4.837 ± 0.033
5.972LeuGlu: 5.972 ± 0.04
3.075LeuPhe: 3.075 ± 0.025
5.076LeuGly: 5.076 ± 0.036
2.505LeuHis: 2.505 ± 0.024
4.324LeuIle: 4.324 ± 0.031
5.389LeuLys: 5.389 ± 0.035
9.292LeuLeu: 9.292 ± 0.061
2.166LeuMet: 2.166 ± 0.021
4.197LeuAsn: 4.197 ± 0.028
5.178LeuPro: 5.178 ± 0.029
5.097LeuGln: 5.097 ± 0.041
5.614LeuArg: 5.614 ± 0.038
6.885LeuSer: 6.885 ± 0.038
4.74LeuThr: 4.74 ± 0.027
5.319LeuVal: 5.319 ± 0.034
0.915LeuTrp: 0.915 ± 0.012
2.509LeuTyr: 2.509 ± 0.023
0.001LeuXaa: 0.001 ± 0.0
Met
1.94MetAla: 1.94 ± 0.019
0.444MetCys: 0.444 ± 0.009
1.414MetAsp: 1.414 ± 0.014
1.669MetGlu: 1.669 ± 0.017
0.828MetPhe: 0.828 ± 0.013
1.444MetGly: 1.444 ± 0.019
0.638MetHis: 0.638 ± 0.009
1.002MetIle: 1.002 ± 0.013
1.231MetLys: 1.231 ± 0.015
2.205MetLeu: 2.205 ± 0.018
0.672MetMet: 0.672 ± 0.012
0.973MetAsn: 0.973 ± 0.012
1.367MetPro: 1.367 ± 0.015
1.252MetGln: 1.252 ± 0.015
1.502MetArg: 1.502 ± 0.014
1.893MetSer: 1.893 ± 0.017
1.194MetThr: 1.194 ± 0.014
1.336MetVal: 1.336 ± 0.016
0.256MetTrp: 0.256 ± 0.006
0.64MetTyr: 0.64 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
3.265AsnAla: 3.265 ± 0.029
0.991AsnCys: 0.991 ± 0.017
2.161AsnAsp: 2.161 ± 0.02
2.736AsnGlu: 2.736 ± 0.024
1.801AsnPhe: 1.801 ± 0.016
3.358AsnGly: 3.358 ± 0.032
1.132AsnHis: 1.132 ± 0.016
2.42AsnIle: 2.42 ± 0.02
2.279AsnLys: 2.279 ± 0.018
4.071AsnLeu: 4.071 ± 0.028
1.17AsnMet: 1.17 ± 0.015
2.841AsnAsn: 2.841 ± 0.037
2.381AsnPro: 2.381 ± 0.029
1.96AsnGln: 1.96 ± 0.019
2.361AsnArg: 2.361 ± 0.02
3.994AsnSer: 3.994 ± 0.034
2.333AsnThr: 2.333 ± 0.018
2.784AsnVal: 2.784 ± 0.021
0.533AsnTrp: 0.533 ± 0.009
1.565AsnTyr: 1.565 ± 0.016
0.001AsnXaa: 0.001 ± 0.0
Pro
4.424ProAla: 4.424 ± 0.041
0.791ProCys: 0.791 ± 0.04
2.435ProAsp: 2.435 ± 0.021
3.465ProGlu: 3.465 ± 0.036
1.662ProPhe: 1.662 ± 0.016
3.14ProGly: 3.14 ± 0.046
1.47ProHis: 1.47 ± 0.02
2.441ProIle: 2.441 ± 0.022
3.082ProLys: 3.082 ± 0.03
4.517ProLeu: 4.517 ± 0.03
1.154ProMet: 1.154 ± 0.015
2.402ProAsn: 2.402 ± 0.026
5.038ProPro: 5.038 ± 0.071
2.777ProGln: 2.777 ± 0.033
2.639ProArg: 2.639 ± 0.024
4.549ProSer: 4.549 ± 0.037
3.513ProThr: 3.513 ± 0.035
3.362ProVal: 3.362 ± 0.027
0.483ProTrp: 0.483 ± 0.008
1.429ProTyr: 1.429 ± 0.017
0.001ProXaa: 0.001 ± 0.0
Gln
3.299GlnAla: 3.299 ± 0.027
0.849GlnCys: 0.849 ± 0.024
2.037GlnAsp: 2.037 ± 0.019
3.021GlnGlu: 3.021 ± 0.029
1.563GlnPhe: 1.563 ± 0.016
2.064GlnGly: 2.064 ± 0.021
1.828GlnHis: 1.828 ± 0.027
2.159GlnIle: 2.159 ± 0.019
2.654GlnLys: 2.654 ± 0.023
5.304GlnLeu: 5.304 ± 0.044
1.308GlnMet: 1.308 ± 0.017
2.082GlnAsn: 2.082 ± 0.021
2.849GlnPro: 2.849 ± 0.031
6.717GlnGln: 6.717 ± 0.139
3.347GlnArg: 3.347 ± 0.025
3.43GlnSer: 3.43 ± 0.025
2.419GlnThr: 2.419 ± 0.02
2.571GlnVal: 2.571 ± 0.022
0.454GlnTrp: 0.454 ± 0.009
1.263GlnTyr: 1.263 ± 0.014
0.001GlnXaa: 0.001 ± 0.0
Arg
3.258ArgAla: 3.258 ± 0.023
1.142ArgCys: 1.142 ± 0.021
2.834ArgAsp: 2.834 ± 0.025
3.542ArgGlu: 3.542 ± 0.031
2.08ArgPhe: 2.08 ± 0.021
2.846ArgGly: 2.846 ± 0.029
1.661ArgHis: 1.661 ± 0.017
2.904ArgIle: 2.904 ± 0.025
3.563ArgLys: 3.563 ± 0.029
5.308ArgLeu: 5.308 ± 0.034
1.337ArgMet: 1.337 ± 0.014
2.839ArgAsn: 2.839 ± 0.026
2.761ArgPro: 2.761 ± 0.027
2.987ArgGln: 2.987 ± 0.027
4.698ArgArg: 4.698 ± 0.044
4.633ArgSer: 4.633 ± 0.038
2.85ArgThr: 2.85 ± 0.02
2.894ArgVal: 2.894 ± 0.026
0.595ArgTrp: 0.595 ± 0.01
1.706ArgTyr: 1.706 ± 0.018
0.002ArgXaa: 0.002 ± 0.0
Ser
6.17SerAla: 6.17 ± 0.042
1.529SerCys: 1.529 ± 0.03
3.961SerAsp: 3.961 ± 0.028
4.478SerGlu: 4.478 ± 0.031
2.72SerPhe: 2.72 ± 0.021
5.972SerGly: 5.972 ± 0.042
1.992SerHis: 1.992 ± 0.021
3.659SerIle: 3.659 ± 0.025
4.136SerLys: 4.136 ± 0.028
6.954SerLeu: 6.954 ± 0.043
1.829SerMet: 1.829 ± 0.02
4.073SerAsn: 4.073 ± 0.036
4.656SerPro: 4.656 ± 0.046
3.504SerGln: 3.504 ± 0.029
4.098SerArg: 4.098 ± 0.028
10.089SerSer: 10.089 ± 0.088
5.229SerThr: 5.229 ± 0.041
4.578SerVal: 4.578 ± 0.028
0.795SerTrp: 0.795 ± 0.012
2.272SerTyr: 2.272 ± 0.021
0.002SerXaa: 0.002 ± 0.0
Thr
4.525ThrAla: 4.525 ± 0.032
1.036ThrCys: 1.036 ± 0.021
2.639ThrAsp: 2.639 ± 0.022
3.238ThrGlu: 3.238 ± 0.031
1.881ThrPhe: 1.881 ± 0.018
3.356ThrGly: 3.356 ± 0.028
1.326ThrHis: 1.326 ± 0.014
2.809ThrIle: 2.809 ± 0.022
2.835ThrLys: 2.835 ± 0.023
4.893ThrLeu: 4.893 ± 0.03
1.209ThrMet: 1.209 ± 0.014
2.467ThrAsn: 2.467 ± 0.022
3.843ThrPro: 3.843 ± 0.037
2.277ThrGln: 2.277 ± 0.02
2.569ThrArg: 2.569 ± 0.023
4.948ThrSer: 4.948 ± 0.038
4.746ThrThr: 4.746 ± 0.093
3.317ThrVal: 3.317 ± 0.025
0.525ThrTrp: 0.525 ± 0.01
1.539ThrTyr: 1.539 ± 0.015
0.001ThrXaa: 0.001 ± 0.0
Val
4.777ValAla: 4.777 ± 0.031
1.257ValCys: 1.257 ± 0.022
3.239ValAsp: 3.239 ± 0.022
3.827ValGlu: 3.827 ± 0.035
2.21ValPhe: 2.21 ± 0.023
3.51ValGly: 3.51 ± 0.028
1.499ValHis: 1.499 ± 0.018
2.946ValIle: 2.946 ± 0.021
3.115ValLys: 3.115 ± 0.028
5.527ValLeu: 5.527 ± 0.036
1.36ValMet: 1.36 ± 0.015
2.563ValAsn: 2.563 ± 0.023
3.313ValPro: 3.313 ± 0.025
2.714ValGln: 2.714 ± 0.023
3.178ValArg: 3.178 ± 0.025
4.337ValSer: 4.337 ± 0.03
3.298ValThr: 3.298 ± 0.025
4.136ValVal: 4.136 ± 0.031
0.617ValTrp: 0.617 ± 0.01
1.737ValTyr: 1.737 ± 0.017
0.001ValXaa: 0.001 ± 0.0
Trp
0.586TrpAla: 0.586 ± 0.009
0.203TrpCys: 0.203 ± 0.006
0.524TrpAsp: 0.524 ± 0.01
0.545TrpGlu: 0.545 ± 0.009
0.403TrpPhe: 0.403 ± 0.009
0.534TrpGly: 0.534 ± 0.011
0.302TrpHis: 0.302 ± 0.007
0.567TrpIle: 0.567 ± 0.009
0.586TrpLys: 0.586 ± 0.01
1.13TrpLeu: 1.13 ± 0.014
0.295TrpMet: 0.295 ± 0.006
0.503TrpAsn: 0.503 ± 0.009
0.451TrpPro: 0.451 ± 0.008
0.534TrpGln: 0.534 ± 0.01
0.727TrpArg: 0.727 ± 0.011
0.827TrpSer: 0.827 ± 0.011
0.568TrpThr: 0.568 ± 0.01
0.54TrpVal: 0.54 ± 0.01
0.177TrpTrp: 0.177 ± 0.006
0.322TrpTyr: 0.322 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.997TyrAla: 1.997 ± 0.019
0.666TyrCys: 0.666 ± 0.011
1.701TyrAsp: 1.701 ± 0.016
1.825TyrGlu: 1.825 ± 0.017
1.315TyrPhe: 1.315 ± 0.016
1.941TyrGly: 1.941 ± 0.021
0.814TyrHis: 0.814 ± 0.01
1.454TyrIle: 1.454 ± 0.015
1.549TyrLys: 1.549 ± 0.016
2.735TyrLeu: 2.735 ± 0.025
0.787TyrMet: 0.787 ± 0.011
1.462TyrAsn: 1.462 ± 0.017
1.381TyrPro: 1.381 ± 0.017
1.339TyrGln: 1.339 ± 0.016
1.733TyrArg: 1.733 ± 0.018
2.157TyrSer: 2.157 ± 0.023
1.556TyrThr: 1.556 ± 0.02
1.833TyrVal: 1.833 ± 0.017
0.364TyrTrp: 0.364 ± 0.008
1.17TyrTyr: 1.17 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.001
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
1.67XaaXaa: 1.67 ± 0.232
Statistics based on 16138 proteins (7110928 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski