Amino acid dipepetide frequency for Drosophila melanogaster (Fruit fly)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.045AlaAla: 9.045 ± 0.087
1.229AlaCys: 1.229 ± 0.048
3.352AlaAsp: 3.352 ± 0.02
4.628AlaGlu: 4.628 ± 0.031
2.071AlaPhe: 2.071 ± 0.014
4.963AlaGly: 4.963 ± 0.032
1.676AlaHis: 1.676 ± 0.012
3.351AlaIle: 3.351 ± 0.017
3.971AlaLys: 3.971 ± 0.028
6.026AlaLeu: 6.026 ± 0.039
1.673AlaMet: 1.673 ± 0.015
3.299AlaAsn: 3.299 ± 0.019
4.234AlaPro: 4.234 ± 0.033
3.482AlaGln: 3.482 ± 0.024
3.218AlaArg: 3.218 ± 0.018
6.351AlaSer: 6.351 ± 0.038
4.987AlaThr: 4.987 ± 0.031
4.459AlaVal: 4.459 ± 0.022
0.536AlaTrp: 0.536 ± 0.007
1.657AlaTyr: 1.657 ± 0.012
0.002AlaXaa: 0.002 ± 0.0
Cys
1.177CysAla: 1.177 ± 0.029
0.445CysCys: 0.445 ± 0.008
1.062CysAsp: 1.062 ± 0.016
1.098CysGlu: 1.098 ± 0.018
0.656CysPhe: 0.656 ± 0.009
1.416CysGly: 1.416 ± 0.059
0.505CysHis: 0.505 ± 0.012
1.013CysIle: 1.013 ± 0.03
0.935CysLys: 0.935 ± 0.017
1.728CysLeu: 1.728 ± 0.038
0.361CysMet: 0.361 ± 0.005
0.854CysAsn: 0.854 ± 0.017
1.11CysPro: 1.11 ± 0.055
0.931CysGln: 0.931 ± 0.041
1.292CysArg: 1.292 ± 0.074
1.665CysSer: 1.665 ± 0.049
0.972CysThr: 0.972 ± 0.029
1.252CysVal: 1.252 ± 0.046
0.179CysTrp: 0.179 ± 0.003
0.525CysTyr: 0.525 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.525AspAla: 3.525 ± 0.02
1.025AspCys: 1.025 ± 0.026
3.753AspAsp: 3.753 ± 0.036
4.378AspGlu: 4.378 ± 0.037
2.09AspPhe: 2.09 ± 0.013
3.194AspGly: 3.194 ± 0.023
1.123AspHis: 1.123 ± 0.01
2.793AspIle: 2.793 ± 0.016
2.706AspLys: 2.706 ± 0.033
4.636AspLeu: 4.636 ± 0.032
1.206AspMet: 1.206 ± 0.011
2.25AspAsn: 2.25 ± 0.016
2.586AspPro: 2.586 ± 0.052
2.077AspGln: 2.077 ± 0.014
2.564AspArg: 2.564 ± 0.022
4.029AspSer: 4.029 ± 0.026
2.467AspThr: 2.467 ± 0.017
3.295AspVal: 3.295 ± 0.018
0.548AspTrp: 0.548 ± 0.007
1.72AspTyr: 1.72 ± 0.014
0.001AspXaa: 0.001 ± 0.0
Glu
4.537GluAla: 4.537 ± 0.028
1.326GluCys: 1.326 ± 0.065
4.139GluAsp: 4.139 ± 0.039
6.384GluGlu: 6.384 ± 0.111
2.085GluPhe: 2.085 ± 0.015
2.952GluGly: 2.952 ± 0.021
1.578GluHis: 1.578 ± 0.013
3.329GluIle: 3.329 ± 0.034
4.289GluLys: 4.289 ± 0.05
6.059GluLeu: 6.059 ± 0.042
1.513GluMet: 1.513 ± 0.015
2.9GluAsn: 2.9 ± 0.017
3.017GluPro: 3.017 ± 0.04
3.566GluGln: 3.566 ± 0.031
3.97GluArg: 3.97 ± 0.026
4.62GluSer: 4.62 ± 0.035
3.534GluThr: 3.534 ± 0.047
3.719GluVal: 3.719 ± 0.05
0.578GluTrp: 0.578 ± 0.009
1.726GluTyr: 1.726 ± 0.015
0.001GluXaa: 0.001 ± 0.0
Phe
2.102PheAla: 2.102 ± 0.017
0.689PheCys: 0.689 ± 0.007
1.949PheAsp: 1.949 ± 0.013
2.082PheGlu: 2.082 ± 0.015
1.321PhePhe: 1.321 ± 0.014
2.299PheGly: 2.299 ± 0.016
0.826PheHis: 0.826 ± 0.008
1.778PheIle: 1.778 ± 0.014
1.755PheLys: 1.755 ± 0.014
3.04PheLeu: 3.04 ± 0.025
0.799PheMet: 0.799 ± 0.01
1.615PheAsn: 1.615 ± 0.012
1.38PhePro: 1.38 ± 0.011
1.407PheGln: 1.407 ± 0.013
1.81PheArg: 1.81 ± 0.015
2.452PheSer: 2.452 ± 0.021
1.814PheThr: 1.814 ± 0.015
2.286PheVal: 2.286 ± 0.014
0.386PheTrp: 0.386 ± 0.005
1.137PheTyr: 1.137 ± 0.012
0.001PheXaa: 0.001 ± 0.0
Gly
4.678GlyAla: 4.678 ± 0.03
1.041GlyCys: 1.041 ± 0.028
3.21GlyAsp: 3.21 ± 0.025
3.247GlyGlu: 3.247 ± 0.025
2.066GlyPhe: 2.066 ± 0.016
7.056GlyGly: 7.056 ± 0.072
1.686GlyHis: 1.686 ± 0.013
2.787GlyIle: 2.787 ± 0.018
3.041GlyLys: 3.041 ± 0.02
4.56GlyLeu: 4.56 ± 0.028
1.331GlyMet: 1.331 ± 0.015
3.229GlyAsn: 3.229 ± 0.025
2.617GlyPro: 2.617 ± 0.025
2.703GlyGln: 2.703 ± 0.022
2.986GlyArg: 2.986 ± 0.02
6.429GlySer: 6.429 ± 0.039
3.259GlyThr: 3.259 ± 0.024
3.597GlyVal: 3.597 ± 0.022
0.573GlyTrp: 0.573 ± 0.008
2.0GlyTyr: 2.0 ± 0.025
0.001GlyXaa: 0.001 ± 0.0
His
1.564HisAla: 1.564 ± 0.012
0.554HisCys: 0.554 ± 0.011
1.106HisAsp: 1.106 ± 0.009
1.349HisGlu: 1.349 ± 0.012
0.968HisPhe: 0.968 ± 0.01
1.537HisGly: 1.537 ± 0.013
1.569HisHis: 1.569 ± 0.021
1.273HisIle: 1.273 ± 0.01
1.296HisLys: 1.296 ± 0.011
2.482HisLeu: 2.482 ± 0.018
0.674HisMet: 0.674 ± 0.009
1.205HisAsn: 1.205 ± 0.011
1.555HisPro: 1.555 ± 0.02
1.797HisGln: 1.797 ± 0.019
1.425HisArg: 1.425 ± 0.01
2.257HisSer: 2.257 ± 0.016
1.291HisThr: 1.291 ± 0.011
1.424HisVal: 1.424 ± 0.011
0.263HisTrp: 0.263 ± 0.004
0.845HisTyr: 0.845 ± 0.008
0.001HisXaa: 0.001 ± 0.0
Ile
3.249IleAla: 3.249 ± 0.019
1.145IleCys: 1.145 ± 0.029
2.617IleAsp: 2.617 ± 0.017
3.222IleGlu: 3.222 ± 0.039
1.946IlePhe: 1.946 ± 0.02
2.652IleGly: 2.652 ± 0.019
1.076IleHis: 1.076 ± 0.009
2.566IleIle: 2.566 ± 0.017
2.843IleLys: 2.843 ± 0.024
4.165IleLeu: 4.165 ± 0.03
1.057IleMet: 1.057 ± 0.011
2.397IleAsn: 2.397 ± 0.021
2.453IlePro: 2.453 ± 0.017
2.055IleGln: 2.055 ± 0.018
2.538IleArg: 2.538 ± 0.015
4.065IleSer: 4.065 ± 0.024
2.799IleThr: 2.799 ± 0.02
3.094IleVal: 3.094 ± 0.02
0.476IleTrp: 0.476 ± 0.005
1.555IleTyr: 1.555 ± 0.013
0.002IleXaa: 0.002 ± 0.0
Lys
3.528LysAla: 3.528 ± 0.023
1.13LysCys: 1.13 ± 0.031
2.884LysAsp: 2.884 ± 0.023
3.921LysGlu: 3.921 ± 0.035
1.744LysPhe: 1.744 ± 0.014
2.371LysGly: 2.371 ± 0.019
1.322LysHis: 1.322 ± 0.011
2.789LysIle: 2.789 ± 0.03
4.268LysLys: 4.268 ± 0.081
5.104LysLeu: 5.104 ± 0.036
1.327LysMet: 1.327 ± 0.013
2.359LysAsn: 2.359 ± 0.017
3.359LysPro: 3.359 ± 0.072
2.704LysGln: 2.704 ± 0.023
3.527LysArg: 3.527 ± 0.025
4.316LysSer: 4.316 ± 0.033
3.074LysThr: 3.074 ± 0.025
3.132LysVal: 3.132 ± 0.032
0.543LysTrp: 0.543 ± 0.007
1.641LysTyr: 1.641 ± 0.015
0.002LysXaa: 0.002 ± 0.0
Leu
6.397LeuAla: 6.397 ± 0.038
1.588LeuCys: 1.588 ± 0.019
4.772LeuAsp: 4.772 ± 0.029
6.105LeuGlu: 6.105 ± 0.042
2.746LeuPhe: 2.746 ± 0.024
4.72LeuGly: 4.72 ± 0.028
2.407LeuHis: 2.407 ± 0.019
4.02LeuIle: 4.02 ± 0.028
5.199LeuLys: 5.199 ± 0.036
8.336LeuLeu: 8.336 ± 0.059
1.924LeuMet: 1.924 ± 0.019
4.204LeuAsn: 4.204 ± 0.022
5.128LeuPro: 5.128 ± 0.027
5.1LeuGln: 5.1 ± 0.038
5.276LeuArg: 5.276 ± 0.036
6.79LeuSer: 6.79 ± 0.046
4.581LeuThr: 4.581 ± 0.031
4.849LeuVal: 4.849 ± 0.028
0.775LeuTrp: 0.775 ± 0.01
2.249LeuTyr: 2.249 ± 0.018
0.004LeuXaa: 0.004 ± 0.001
Met
1.749MetAla: 1.749 ± 0.016
0.394MetCys: 0.394 ± 0.006
1.319MetAsp: 1.319 ± 0.012
1.58MetGlu: 1.58 ± 0.014
0.732MetPhe: 0.732 ± 0.008
1.385MetGly: 1.385 ± 0.017
0.611MetHis: 0.611 ± 0.007
0.917MetIle: 0.917 ± 0.009
1.156MetLys: 1.156 ± 0.011
2.007MetLeu: 2.007 ± 0.019
0.613MetMet: 0.613 ± 0.008
0.967MetAsn: 0.967 ± 0.01
1.318MetPro: 1.318 ± 0.014
1.236MetGln: 1.236 ± 0.013
1.371MetArg: 1.371 ± 0.011
1.82MetSer: 1.82 ± 0.018
1.143MetThr: 1.143 ± 0.012
1.217MetVal: 1.217 ± 0.012
0.216MetTrp: 0.216 ± 0.004
0.615MetTyr: 0.615 ± 0.008
0.001MetXaa: 0.001 ± 0.0
Asn
3.45AsnAla: 3.45 ± 0.03
0.96AsnCys: 0.96 ± 0.02
2.271AsnAsp: 2.271 ± 0.016
2.802AsnGlu: 2.802 ± 0.016
1.658AsnPhe: 1.658 ± 0.013
3.633AsnGly: 3.633 ± 0.026
1.177AsnHis: 1.177 ± 0.018
2.473AsnIle: 2.473 ± 0.018
2.281AsnLys: 2.281 ± 0.016
3.981AsnLeu: 3.981 ± 0.025
1.077AsnMet: 1.077 ± 0.011
3.152AsnAsn: 3.152 ± 0.031
2.469AsnPro: 2.469 ± 0.037
2.157AsnGln: 2.157 ± 0.016
2.31AsnArg: 2.31 ± 0.017
4.363AsnSer: 4.363 ± 0.026
2.406AsnThr: 2.406 ± 0.016
2.838AsnVal: 2.838 ± 0.017
0.46AsnTrp: 0.46 ± 0.006
1.438AsnTyr: 1.438 ± 0.012
0.001AsnXaa: 0.001 ± 0.0
Pro
4.484ProAla: 4.484 ± 0.031
1.003ProCys: 1.003 ± 0.083
2.513ProAsp: 2.513 ± 0.017
3.738ProGlu: 3.738 ± 0.066
1.617ProPhe: 1.617 ± 0.02
3.425ProGly: 3.425 ± 0.038
1.51ProHis: 1.51 ± 0.014
2.519ProIle: 2.519 ± 0.025
3.078ProLys: 3.078 ± 0.044
4.459ProLeu: 4.459 ± 0.025
1.147ProMet: 1.147 ± 0.012
2.527ProAsn: 2.527 ± 0.024
5.736ProPro: 5.736 ± 0.055
3.014ProGln: 3.014 ± 0.028
2.609ProArg: 2.609 ± 0.015
5.051ProSer: 5.051 ± 0.074
3.897ProThr: 3.897 ± 0.041
3.42ProVal: 3.42 ± 0.026
0.434ProTrp: 0.434 ± 0.01
1.457ProTyr: 1.457 ± 0.015
0.001ProXaa: 0.001 ± 0.0
Gln
3.534GlnAla: 3.534 ± 0.024
0.931GlnCys: 0.931 ± 0.039
2.055GlnAsp: 2.055 ± 0.015
3.136GlnGlu: 3.136 ± 0.036
1.464GlnPhe: 1.464 ± 0.011
2.15GlnGly: 2.15 ± 0.015
2.005GlnHis: 2.005 ± 0.019
2.2GlnIle: 2.2 ± 0.014
2.69GlnLys: 2.69 ± 0.02
5.253GlnLeu: 5.253 ± 0.035
1.294GlnMet: 1.294 ± 0.016
2.185GlnAsn: 2.185 ± 0.014
3.135GlnPro: 3.135 ± 0.031
8.527GlnGln: 8.527 ± 0.122
3.346GlnArg: 3.346 ± 0.023
3.7GlnSer: 3.7 ± 0.031
2.639GlnThr: 2.639 ± 0.023
2.643GlnVal: 2.643 ± 0.016
0.426GlnTrp: 0.426 ± 0.005
1.239GlnTyr: 1.239 ± 0.01
0.001GlnXaa: 0.001 ± 0.0
Arg
3.2ArgAla: 3.2 ± 0.02
1.083ArgCys: 1.083 ± 0.028
2.901ArgAsp: 2.901 ± 0.023
3.589ArgGlu: 3.589 ± 0.024
1.861ArgPhe: 1.861 ± 0.013
2.772ArgGly: 2.772 ± 0.022
1.554ArgHis: 1.554 ± 0.013
2.702ArgIle: 2.702 ± 0.017
3.448ArgLys: 3.448 ± 0.022
4.907ArgLeu: 4.907 ± 0.031
1.167ArgMet: 1.167 ± 0.011
2.737ArgAsn: 2.737 ± 0.015
2.899ArgPro: 2.899 ± 0.04
2.952ArgGln: 2.952 ± 0.021
4.378ArgArg: 4.378 ± 0.034
4.578ArgSer: 4.578 ± 0.03
2.852ArgThr: 2.852 ± 0.019
2.907ArgVal: 2.907 ± 0.021
0.534ArgTrp: 0.534 ± 0.007
1.59ArgTyr: 1.59 ± 0.013
0.002ArgXaa: 0.002 ± 0.0
Ser
6.416SerAla: 6.416 ± 0.038
1.552SerCys: 1.552 ± 0.049
4.126SerAsp: 4.126 ± 0.032
4.701SerGlu: 4.701 ± 0.032
2.548SerPhe: 2.548 ± 0.021
6.261SerGly: 6.261 ± 0.042
2.041SerHis: 2.041 ± 0.017
3.761SerIle: 3.761 ± 0.027
4.1SerLys: 4.1 ± 0.026
6.97SerLeu: 6.97 ± 0.047
1.77SerMet: 1.77 ± 0.015
4.331SerAsn: 4.331 ± 0.024
5.262SerPro: 5.262 ± 0.052
3.847SerGln: 3.847 ± 0.032
4.127SerArg: 4.127 ± 0.026
11.01SerSer: 11.01 ± 0.103
5.703SerThr: 5.703 ± 0.046
4.709SerVal: 4.709 ± 0.024
0.737SerTrp: 0.737 ± 0.01
2.122SerTyr: 2.122 ± 0.017
0.004SerXaa: 0.004 ± 0.001
Thr
4.649ThrAla: 4.649 ± 0.027
1.081ThrCys: 1.081 ± 0.023
2.677ThrAsp: 2.677 ± 0.022
3.514ThrGlu: 3.514 ± 0.041
1.806ThrPhe: 1.806 ± 0.014
3.491ThrGly: 3.491 ± 0.026
1.301ThrHis: 1.301 ± 0.011
2.852ThrIle: 2.852 ± 0.018
2.881ThrLys: 2.881 ± 0.024
4.878ThrLeu: 4.878 ± 0.03
1.207ThrMet: 1.207 ± 0.011
2.619ThrAsn: 2.619 ± 0.02
4.265ThrPro: 4.265 ± 0.039
2.389ThrGln: 2.389 ± 0.016
2.624ThrArg: 2.624 ± 0.024
5.344ThrSer: 5.344 ± 0.04
5.622ThrThr: 5.622 ± 0.116
3.547ThrVal: 3.547 ± 0.041
0.497ThrTrp: 0.497 ± 0.006
1.474ThrTyr: 1.474 ± 0.012
0.002ThrXaa: 0.002 ± 0.0
Val
4.578ValAla: 4.578 ± 0.023
1.204ValCys: 1.204 ± 0.036
3.144ValAsp: 3.144 ± 0.017
4.054ValGlu: 4.054 ± 0.057
1.998ValPhe: 1.998 ± 0.014
3.52ValGly: 3.52 ± 0.02
1.428ValHis: 1.428 ± 0.011
3.0ValIle: 3.0 ± 0.021
3.122ValLys: 3.122 ± 0.028
5.147ValLeu: 5.147 ± 0.026
1.272ValMet: 1.272 ± 0.012
2.655ValAsn: 2.655 ± 0.021
3.43ValPro: 3.43 ± 0.038
2.796ValGln: 2.796 ± 0.018
3.035ValArg: 3.035 ± 0.018
4.423ValSer: 4.423 ± 0.02
3.577ValThr: 3.577 ± 0.046
4.081ValVal: 4.081 ± 0.03
0.541ValTrp: 0.541 ± 0.007
1.632ValTyr: 1.632 ± 0.012
0.003ValXaa: 0.003 ± 0.0
Trp
0.512TrpAla: 0.512 ± 0.007
0.17TrpCys: 0.17 ± 0.004
0.471TrpAsp: 0.471 ± 0.007
0.508TrpGlu: 0.508 ± 0.007
0.38TrpPhe: 0.38 ± 0.006
0.455TrpGly: 0.455 ± 0.007
0.254TrpHis: 0.254 ± 0.004
0.495TrpIle: 0.495 ± 0.007
0.525TrpLys: 0.525 ± 0.006
1.009TrpLeu: 1.009 ± 0.01
0.26TrpMet: 0.26 ± 0.004
0.473TrpAsn: 0.473 ± 0.007
0.371TrpPro: 0.371 ± 0.006
0.472TrpGln: 0.472 ± 0.006
0.612TrpArg: 0.612 ± 0.006
0.722TrpSer: 0.722 ± 0.009
0.519TrpThr: 0.519 ± 0.011
0.5TrpVal: 0.5 ± 0.006
0.15TrpTrp: 0.15 ± 0.004
0.315TrpTyr: 0.315 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.837TyrAla: 1.837 ± 0.015
0.624TyrCys: 0.624 ± 0.009
1.643TyrAsp: 1.643 ± 0.013
1.783TyrGlu: 1.783 ± 0.017
1.174TyrPhe: 1.174 ± 0.011
1.843TyrGly: 1.843 ± 0.015
0.785TyrHis: 0.785 ± 0.008
1.353TyrIle: 1.353 ± 0.012
1.458TyrLys: 1.458 ± 0.012
2.417TyrLeu: 2.417 ± 0.019
0.698TyrMet: 0.698 ± 0.008
1.397TyrAsn: 1.397 ± 0.011
1.311TyrPro: 1.311 ± 0.012
1.337TyrGln: 1.337 ± 0.013
1.591TyrArg: 1.591 ± 0.011
2.111TyrSer: 2.111 ± 0.016
1.586TyrThr: 1.586 ± 0.014
1.67TyrVal: 1.67 ± 0.012
0.321TyrTrp: 0.321 ± 0.006
1.053TyrTyr: 1.053 ± 0.011
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.004XaaHis: 0.004 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.004XaaLeu: 0.004 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.0
0.004XaaGln: 0.004 ± 0.001
0.003XaaArg: 0.003 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.002XaaTyr: 0.002 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23524 proteins (16289205 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski