Amino acid dipepetide frequency for Drosophila ananassae (Fruit fly)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.892AlaAla: 8.892 ± 0.097
1.296AlaCys: 1.296 ± 0.059
3.323AlaAsp: 3.323 ± 0.019
4.723AlaGlu: 4.723 ± 0.029
2.062AlaPhe: 2.062 ± 0.016
5.006AlaGly: 5.006 ± 0.031
1.584AlaHis: 1.584 ± 0.012
3.335AlaIle: 3.335 ± 0.02
4.116AlaLys: 4.116 ± 0.028
6.069AlaLeu: 6.069 ± 0.038
1.621AlaMet: 1.621 ± 0.013
3.149AlaAsn: 3.149 ± 0.021
4.401AlaPro: 4.401 ± 0.039
3.346AlaGln: 3.346 ± 0.027
3.322AlaArg: 3.322 ± 0.021
6.138AlaSer: 6.138 ± 0.043
4.862AlaThr: 4.862 ± 0.032
4.442AlaVal: 4.442 ± 0.022
0.568AlaTrp: 0.568 ± 0.008
1.687AlaTyr: 1.687 ± 0.013
0.002AlaXaa: 0.002 ± 0.0
Cys
1.196CysAla: 1.196 ± 0.035
0.444CysCys: 0.444 ± 0.008
1.047CysAsp: 1.047 ± 0.019
1.128CysGlu: 1.128 ± 0.023
0.674CysPhe: 0.674 ± 0.01
1.513CysGly: 1.513 ± 0.074
0.511CysHis: 0.511 ± 0.013
1.047CysIle: 1.047 ± 0.037
0.96CysLys: 0.96 ± 0.015
1.779CysLeu: 1.779 ± 0.05
0.354CysMet: 0.354 ± 0.006
0.893CysAsn: 0.893 ± 0.02
1.203CysPro: 1.203 ± 0.068
1.021CysGln: 1.021 ± 0.051
1.416CysArg: 1.416 ± 0.093
1.673CysSer: 1.673 ± 0.054
1.023CysThr: 1.023 ± 0.032
1.293CysVal: 1.293 ± 0.056
0.179CysTrp: 0.179 ± 0.004
0.552CysTyr: 0.552 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.61AspAla: 3.61 ± 0.024
1.058AspCys: 1.058 ± 0.032
3.699AspAsp: 3.699 ± 0.034
4.336AspGlu: 4.336 ± 0.029
2.086AspPhe: 2.086 ± 0.015
3.237AspGly: 3.237 ± 0.03
1.097AspHis: 1.097 ± 0.01
2.799AspIle: 2.799 ± 0.023
2.73AspLys: 2.73 ± 0.022
4.724AspLeu: 4.724 ± 0.036
1.203AspMet: 1.203 ± 0.013
2.26AspAsn: 2.26 ± 0.019
2.89AspPro: 2.89 ± 0.058
2.012AspGln: 2.012 ± 0.015
2.606AspArg: 2.606 ± 0.023
4.073AspSer: 4.073 ± 0.026
2.534AspThr: 2.534 ± 0.017
3.314AspVal: 3.314 ± 0.021
0.585AspTrp: 0.585 ± 0.009
1.718AspTyr: 1.718 ± 0.015
0.001AspXaa: 0.001 ± 0.0
Glu
4.786GluAla: 4.786 ± 0.039
1.396GluCys: 1.396 ± 0.079
4.147GluAsp: 4.147 ± 0.029
6.394GluGlu: 6.394 ± 0.072
2.105GluPhe: 2.105 ± 0.018
3.041GluGly: 3.041 ± 0.026
1.539GluHis: 1.539 ± 0.013
3.332GluIle: 3.332 ± 0.029
4.509GluLys: 4.509 ± 0.039
6.074GluLeu: 6.074 ± 0.046
1.492GluMet: 1.492 ± 0.016
3.009GluAsn: 3.009 ± 0.021
3.174GluPro: 3.174 ± 0.036
3.541GluGln: 3.541 ± 0.032
4.08GluArg: 4.08 ± 0.031
4.51GluSer: 4.51 ± 0.03
3.475GluThr: 3.475 ± 0.035
3.77GluVal: 3.77 ± 0.041
0.588GluTrp: 0.588 ± 0.01
1.78GluTyr: 1.78 ± 0.018
0.001GluXaa: 0.001 ± 0.0
Phe
2.096PheAla: 2.096 ± 0.015
0.69PheCys: 0.69 ± 0.009
2.029PheAsp: 2.029 ± 0.016
2.088PheGlu: 2.088 ± 0.016
1.363PhePhe: 1.363 ± 0.014
2.334PheGly: 2.334 ± 0.02
0.855PheHis: 0.855 ± 0.009
1.774PheIle: 1.774 ± 0.017
1.789PheLys: 1.789 ± 0.015
3.143PheLeu: 3.143 ± 0.025
0.801PheMet: 0.801 ± 0.009
1.631PheAsn: 1.631 ± 0.011
1.37PhePro: 1.37 ± 0.012
1.429PheGln: 1.429 ± 0.012
1.899PheArg: 1.899 ± 0.02
2.527PheSer: 2.527 ± 0.02
1.764PheThr: 1.764 ± 0.014
2.302PheVal: 2.302 ± 0.016
0.4PheTrp: 0.4 ± 0.008
1.17PheTyr: 1.17 ± 0.011
0.001PheXaa: 0.001 ± 0.0
Gly
4.661GlyAla: 4.661 ± 0.037
1.099GlyCys: 1.099 ± 0.031
3.228GlyAsp: 3.228 ± 0.03
3.435GlyGlu: 3.435 ± 0.034
2.164GlyPhe: 2.164 ± 0.022
7.04GlyGly: 7.04 ± 0.091
1.638GlyHis: 1.638 ± 0.016
2.794GlyIle: 2.794 ± 0.018
3.152GlyLys: 3.152 ± 0.026
4.677GlyLeu: 4.677 ± 0.026
1.32GlyMet: 1.32 ± 0.014
3.113GlyAsn: 3.113 ± 0.031
2.702GlyPro: 2.702 ± 0.029
2.693GlyGln: 2.693 ± 0.028
3.051GlyArg: 3.051 ± 0.019
6.17GlySer: 6.17 ± 0.043
3.252GlyThr: 3.252 ± 0.024
3.611GlyVal: 3.611 ± 0.024
0.581GlyTrp: 0.581 ± 0.008
2.045GlyTyr: 2.045 ± 0.028
0.001GlyXaa: 0.001 ± 0.0
His
1.47HisAla: 1.47 ± 0.013
0.558HisCys: 0.558 ± 0.012
1.084HisAsp: 1.084 ± 0.01
1.349HisGlu: 1.349 ± 0.011
0.994HisPhe: 0.994 ± 0.01
1.45HisGly: 1.45 ± 0.013
1.46HisHis: 1.46 ± 0.024
1.267HisIle: 1.267 ± 0.012
1.264HisLys: 1.264 ± 0.011
2.469HisLeu: 2.469 ± 0.016
0.652HisMet: 0.652 ± 0.008
1.157HisAsn: 1.157 ± 0.012
1.538HisPro: 1.538 ± 0.016
1.678HisGln: 1.678 ± 0.021
1.432HisArg: 1.432 ± 0.014
2.117HisSer: 2.117 ± 0.017
1.281HisThr: 1.281 ± 0.011
1.394HisVal: 1.394 ± 0.01
0.272HisTrp: 0.272 ± 0.005
0.814HisTyr: 0.814 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
3.206IleAla: 3.206 ± 0.018
1.192IleCys: 1.192 ± 0.035
2.695IleAsp: 2.695 ± 0.025
3.2IleGlu: 3.2 ± 0.033
1.972IlePhe: 1.972 ± 0.017
2.631IleGly: 2.631 ± 0.019
1.069IleHis: 1.069 ± 0.009
2.598IleIle: 2.598 ± 0.026
2.897IleLys: 2.897 ± 0.03
4.225IleLeu: 4.225 ± 0.032
1.04IleMet: 1.04 ± 0.011
2.449IleAsn: 2.449 ± 0.024
2.472IlePro: 2.472 ± 0.018
2.045IleGln: 2.045 ± 0.016
2.491IleArg: 2.491 ± 0.016
4.001IleSer: 4.001 ± 0.024
2.823IleThr: 2.823 ± 0.028
3.115IleVal: 3.115 ± 0.024
0.503IleTrp: 0.503 ± 0.008
1.566IleTyr: 1.566 ± 0.013
0.002IleXaa: 0.002 ± 0.0
Lys
3.691LysAla: 3.691 ± 0.025
1.201LysCys: 1.201 ± 0.033
3.124LysAsp: 3.124 ± 0.033
4.048LysGlu: 4.048 ± 0.031
1.854LysPhe: 1.854 ± 0.016
2.426LysGly: 2.426 ± 0.022
1.336LysHis: 1.336 ± 0.014
2.898LysIle: 2.898 ± 0.026
4.3LysLys: 4.3 ± 0.056
5.224LysLeu: 5.224 ± 0.043
1.348LysMet: 1.348 ± 0.013
2.504LysAsn: 2.504 ± 0.024
3.45LysPro: 3.45 ± 0.061
2.683LysGln: 2.683 ± 0.025
3.586LysArg: 3.586 ± 0.023
4.309LysSer: 4.309 ± 0.034
3.089LysThr: 3.089 ± 0.024
3.13LysVal: 3.13 ± 0.029
0.62LysTrp: 0.62 ± 0.013
1.686LysTyr: 1.686 ± 0.018
0.002LysXaa: 0.002 ± 0.0
Leu
6.453LeuAla: 6.453 ± 0.034
1.605LeuCys: 1.605 ± 0.021
4.834LeuAsp: 4.834 ± 0.039
6.293LeuGlu: 6.293 ± 0.05
2.775LeuPhe: 2.775 ± 0.024
4.721LeuGly: 4.721 ± 0.029
2.358LeuHis: 2.358 ± 0.017
4.12LeuIle: 4.12 ± 0.03
5.387LeuLys: 5.387 ± 0.04
8.415LeuLeu: 8.415 ± 0.061
1.94LeuMet: 1.94 ± 0.016
4.238LeuAsn: 4.238 ± 0.025
5.066LeuPro: 5.066 ± 0.027
5.083LeuGln: 5.083 ± 0.042
5.364LeuArg: 5.364 ± 0.038
6.647LeuSer: 6.647 ± 0.042
4.581LeuThr: 4.581 ± 0.023
4.948LeuVal: 4.948 ± 0.032
0.803LeuTrp: 0.803 ± 0.009
2.317LeuTyr: 2.317 ± 0.019
0.002LeuXaa: 0.002 ± 0.0
Met
1.751MetAla: 1.751 ± 0.016
0.392MetCys: 0.392 ± 0.007
1.302MetAsp: 1.302 ± 0.012
1.63MetGlu: 1.63 ± 0.013
0.733MetPhe: 0.733 ± 0.01
1.363MetGly: 1.363 ± 0.015
0.57MetHis: 0.57 ± 0.007
0.901MetIle: 0.901 ± 0.011
1.156MetLys: 1.156 ± 0.011
1.932MetLeu: 1.932 ± 0.016
0.58MetMet: 0.58 ± 0.008
0.946MetAsn: 0.946 ± 0.011
1.277MetPro: 1.277 ± 0.014
1.186MetGln: 1.186 ± 0.013
1.356MetArg: 1.356 ± 0.011
1.714MetSer: 1.714 ± 0.016
1.097MetThr: 1.097 ± 0.01
1.225MetVal: 1.225 ± 0.011
0.211MetTrp: 0.211 ± 0.005
0.595MetTyr: 0.595 ± 0.008
0.001MetXaa: 0.001 ± 0.0
Asn
3.369AsnAla: 3.369 ± 0.045
0.971AsnCys: 0.971 ± 0.023
2.246AsnAsp: 2.246 ± 0.02
2.857AsnGlu: 2.857 ± 0.021
1.695AsnPhe: 1.695 ± 0.014
3.564AsnGly: 3.564 ± 0.028
1.18AsnHis: 1.18 ± 0.024
2.49AsnIle: 2.49 ± 0.019
2.324AsnLys: 2.324 ± 0.02
4.136AsnLeu: 4.136 ± 0.028
1.078AsnMet: 1.078 ± 0.012
3.028AsnAsn: 3.028 ± 0.036
2.571AsnPro: 2.571 ± 0.049
2.058AsnGln: 2.058 ± 0.018
2.325AsnArg: 2.325 ± 0.014
4.164AsnSer: 4.164 ± 0.03
2.371AsnThr: 2.371 ± 0.016
2.733AsnVal: 2.733 ± 0.019
0.477AsnTrp: 0.477 ± 0.007
1.446AsnTyr: 1.446 ± 0.011
0.001AsnXaa: 0.001 ± 0.0
Pro
4.594ProAla: 4.594 ± 0.037
1.133ProCys: 1.133 ± 0.102
2.579ProAsp: 2.579 ± 0.015
3.896ProGlu: 3.896 ± 0.055
1.661ProPhe: 1.661 ± 0.021
3.581ProGly: 3.581 ± 0.046
1.47ProHis: 1.47 ± 0.015
2.564ProIle: 2.564 ± 0.029
3.287ProLys: 3.287 ± 0.037
4.484ProLeu: 4.484 ± 0.027
1.097ProMet: 1.097 ± 0.013
2.543ProAsn: 2.543 ± 0.037
5.897ProPro: 5.897 ± 0.074
2.968ProGln: 2.968 ± 0.027
2.677ProArg: 2.677 ± 0.02
4.964ProSer: 4.964 ± 0.039
3.71ProThr: 3.71 ± 0.027
3.509ProVal: 3.509 ± 0.028
0.425ProTrp: 0.425 ± 0.007
1.452ProTyr: 1.452 ± 0.018
0.001ProXaa: 0.001 ± 0.0
Gln
3.519GlnAla: 3.519 ± 0.024
0.946GlnCys: 0.946 ± 0.05
2.096GlnAsp: 2.096 ± 0.016
3.14GlnGlu: 3.14 ± 0.026
1.488GlnPhe: 1.488 ± 0.013
2.175GlnGly: 2.175 ± 0.018
1.821GlnHis: 1.821 ± 0.023
2.18GlnIle: 2.18 ± 0.02
2.727GlnLys: 2.727 ± 0.022
5.131GlnLeu: 5.131 ± 0.04
1.222GlnMet: 1.222 ± 0.013
2.159GlnAsn: 2.159 ± 0.017
3.157GlnPro: 3.157 ± 0.036
7.397GlnGln: 7.397 ± 0.132
3.3GlnArg: 3.3 ± 0.028
3.51GlnSer: 3.51 ± 0.026
2.493GlnThr: 2.493 ± 0.021
2.568GlnVal: 2.568 ± 0.02
0.435GlnTrp: 0.435 ± 0.006
1.232GlnTyr: 1.232 ± 0.013
0.001GlnXaa: 0.001 ± 0.0
Arg
3.307ArgAla: 3.307 ± 0.017
1.13ArgCys: 1.13 ± 0.033
2.975ArgAsp: 2.975 ± 0.023
3.754ArgGlu: 3.754 ± 0.033
1.872ArgPhe: 1.872 ± 0.014
2.792ArgGly: 2.792 ± 0.021
1.546ArgHis: 1.546 ± 0.012
2.697ArgIle: 2.697 ± 0.016
3.543ArgLys: 3.543 ± 0.025
5.057ArgLeu: 5.057 ± 0.034
1.175ArgMet: 1.175 ± 0.012
2.727ArgAsn: 2.727 ± 0.019
2.957ArgPro: 2.957 ± 0.039
2.978ArgGln: 2.978 ± 0.026
4.463ArgArg: 4.463 ± 0.036
4.527ArgSer: 4.527 ± 0.036
2.747ArgThr: 2.747 ± 0.019
2.924ArgVal: 2.924 ± 0.023
0.561ArgTrp: 0.561 ± 0.008
1.618ArgTyr: 1.618 ± 0.013
0.002ArgXaa: 0.002 ± 0.0
Ser
5.938SerAla: 5.938 ± 0.043
1.605SerCys: 1.605 ± 0.057
4.173SerAsp: 4.173 ± 0.026
4.69SerGlu: 4.69 ± 0.029
2.577SerPhe: 2.577 ± 0.02
6.166SerGly: 6.166 ± 0.043
1.971SerHis: 1.971 ± 0.017
3.684SerIle: 3.684 ± 0.025
4.15SerLys: 4.15 ± 0.031
6.817SerLeu: 6.817 ± 0.041
1.722SerMet: 1.722 ± 0.017
4.181SerAsn: 4.181 ± 0.029
5.075SerPro: 5.075 ± 0.043
3.724SerGln: 3.724 ± 0.026
4.115SerArg: 4.115 ± 0.03
10.278SerSer: 10.278 ± 0.096
5.359SerThr: 5.359 ± 0.047
4.556SerVal: 4.556 ± 0.025
0.74SerTrp: 0.74 ± 0.009
2.098SerTyr: 2.098 ± 0.017
0.003SerXaa: 0.003 ± 0.0
Thr
4.487ThrAla: 4.487 ± 0.029
1.097ThrCys: 1.097 ± 0.03
2.651ThrAsp: 2.651 ± 0.018
3.389ThrGlu: 3.389 ± 0.03
1.779ThrPhe: 1.779 ± 0.014
3.594ThrGly: 3.594 ± 0.031
1.289ThrHis: 1.289 ± 0.011
2.842ThrIle: 2.842 ± 0.022
2.935ThrLys: 2.935 ± 0.026
4.833ThrLeu: 4.833 ± 0.028
1.083ThrMet: 1.083 ± 0.009
2.552ThrAsn: 2.552 ± 0.016
4.149ThrPro: 4.149 ± 0.032
2.298ThrGln: 2.298 ± 0.017
2.553ThrArg: 2.553 ± 0.022
5.06ThrSer: 5.06 ± 0.037
5.145ThrThr: 5.145 ± 0.075
3.475ThrVal: 3.475 ± 0.031
0.513ThrTrp: 0.513 ± 0.01
1.438ThrTyr: 1.438 ± 0.011
0.002ThrXaa: 0.002 ± 0.0
Val
4.579ValAla: 4.579 ± 0.024
1.259ValCys: 1.259 ± 0.041
3.211ValAsp: 3.211 ± 0.024
4.082ValGlu: 4.082 ± 0.052
2.024ValPhe: 2.024 ± 0.015
3.481ValGly: 3.481 ± 0.024
1.391ValHis: 1.391 ± 0.012
2.938ValIle: 2.938 ± 0.022
3.206ValLys: 3.206 ± 0.034
5.183ValLeu: 5.183 ± 0.03
1.238ValMet: 1.238 ± 0.011
2.663ValAsn: 2.663 ± 0.021
3.437ValPro: 3.437 ± 0.023
2.731ValGln: 2.731 ± 0.02
3.052ValArg: 3.052 ± 0.016
4.403ValSer: 4.403 ± 0.02
3.446ValThr: 3.446 ± 0.031
4.081ValVal: 4.081 ± 0.029
0.561ValTrp: 0.561 ± 0.008
1.601ValTyr: 1.601 ± 0.014
0.002ValXaa: 0.002 ± 0.0
Trp
0.523TrpAla: 0.523 ± 0.007
0.185TrpCys: 0.185 ± 0.004
0.496TrpAsp: 0.496 ± 0.009
0.54TrpGlu: 0.54 ± 0.008
0.384TrpPhe: 0.384 ± 0.007
0.473TrpGly: 0.473 ± 0.008
0.253TrpHis: 0.253 ± 0.005
0.516TrpIle: 0.516 ± 0.008
0.555TrpLys: 0.555 ± 0.009
1.066TrpLeu: 1.066 ± 0.013
0.266TrpMet: 0.266 ± 0.005
0.483TrpAsn: 0.483 ± 0.008
0.37TrpPro: 0.37 ± 0.006
0.489TrpGln: 0.489 ± 0.007
0.632TrpArg: 0.632 ± 0.009
0.731TrpSer: 0.731 ± 0.011
0.551TrpThr: 0.551 ± 0.008
0.522TrpVal: 0.522 ± 0.008
0.155TrpTrp: 0.155 ± 0.005
0.313TrpTyr: 0.313 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.81TyrAla: 1.81 ± 0.013
0.648TyrCys: 0.648 ± 0.011
1.633TyrAsp: 1.633 ± 0.014
1.772TyrGlu: 1.772 ± 0.014
1.193TyrPhe: 1.193 ± 0.01
1.838TyrGly: 1.838 ± 0.017
0.76TyrHis: 0.76 ± 0.008
1.413TyrIle: 1.413 ± 0.014
1.524TyrLys: 1.524 ± 0.017
2.514TyrLeu: 2.514 ± 0.019
0.684TyrMet: 0.684 ± 0.008
1.399TyrAsn: 1.399 ± 0.014
1.335TyrPro: 1.335 ± 0.014
1.333TyrGln: 1.333 ± 0.012
1.627TyrArg: 1.627 ± 0.012
2.105TyrSer: 2.105 ± 0.016
1.563TyrThr: 1.563 ± 0.014
1.653TyrVal: 1.653 ± 0.014
0.326TyrTrp: 0.326 ± 0.007
1.035TyrTyr: 1.035 ± 0.011
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.003XaaHis: 0.003 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.003XaaLeu: 0.003 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.0
0.003XaaGln: 0.003 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.004XaaXaa: 0.004 ± 0.002
Statistics based on 19363 proteins (12990802 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski