Amino acid dipepetide frequency for Drosophila busckii (Fruit fly)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.688AlaAla: 11.688 ± 0.141
1.287AlaCys: 1.287 ± 0.017
3.662AlaAsp: 3.662 ± 0.025
4.974AlaGlu: 4.974 ± 0.04
2.276AlaPhe: 2.276 ± 0.024
4.794AlaGly: 4.794 ± 0.042
1.977AlaHis: 1.977 ± 0.021
3.499AlaIle: 3.499 ± 0.026
4.244AlaLys: 4.244 ± 0.034
7.215AlaLeu: 7.215 ± 0.05
1.827AlaMet: 1.827 ± 0.017
3.616AlaAsn: 3.616 ± 0.029
4.146AlaPro: 4.146 ± 0.044
4.009AlaGln: 4.009 ± 0.035
3.48AlaArg: 3.48 ± 0.023
6.046AlaSer: 6.046 ± 0.051
5.44AlaThr: 5.44 ± 0.053
4.65AlaVal: 4.65 ± 0.032
0.659AlaTrp: 0.659 ± 0.012
1.997AlaTyr: 1.997 ± 0.02
0.0AlaXaa: 0.0 ± 0.0
Cys
1.412CysAla: 1.412 ± 0.019
0.57CysCys: 0.57 ± 0.019
1.09CysAsp: 1.09 ± 0.022
1.186CysGlu: 1.186 ± 0.02
0.76CysPhe: 0.76 ± 0.013
1.329CysGly: 1.329 ± 0.023
0.503CysHis: 0.503 ± 0.012
1.022CysIle: 1.022 ± 0.016
1.025CysLys: 1.025 ± 0.017
1.875CysLeu: 1.875 ± 0.022
0.44CysMet: 0.44 ± 0.009
0.997CysAsn: 0.997 ± 0.019
0.929CysPro: 0.929 ± 0.022
0.873CysGln: 0.873 ± 0.015
1.05CysArg: 1.05 ± 0.018
1.63CysSer: 1.63 ± 0.021
1.003CysThr: 1.003 ± 0.014
1.21CysVal: 1.21 ± 0.016
0.22CysTrp: 0.22 ± 0.006
0.635CysTyr: 0.635 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
4.007AspAla: 4.007 ± 0.03
1.078AspCys: 1.078 ± 0.019
3.743AspAsp: 3.743 ± 0.05
4.273AspGlu: 4.273 ± 0.041
2.147AspPhe: 2.147 ± 0.021
2.805AspGly: 2.805 ± 0.027
0.963AspHis: 0.963 ± 0.011
2.877AspIle: 2.877 ± 0.026
2.811AspLys: 2.811 ± 0.029
4.423AspLeu: 4.423 ± 0.029
1.375AspMet: 1.375 ± 0.016
2.469AspAsn: 2.469 ± 0.027
1.929AspPro: 1.929 ± 0.023
1.64AspGln: 1.64 ± 0.017
2.125AspArg: 2.125 ± 0.025
3.667AspSer: 3.667 ± 0.031
2.42AspThr: 2.42 ± 0.02
3.253AspVal: 3.253 ± 0.023
0.627AspTrp: 0.627 ± 0.011
1.901AspTyr: 1.901 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
4.555GluAla: 4.555 ± 0.036
1.135GluCys: 1.135 ± 0.02
3.437GluAsp: 3.437 ± 0.035
5.146GluGlu: 5.146 ± 0.07
2.203GluPhe: 2.203 ± 0.021
2.415GluGly: 2.415 ± 0.024
1.867GluHis: 1.867 ± 0.023
3.053GluIle: 3.053 ± 0.031
3.561GluLys: 3.561 ± 0.039
7.185GluLeu: 7.185 ± 0.047
1.422GluMet: 1.422 ± 0.018
2.631GluAsn: 2.631 ± 0.023
2.849GluPro: 2.849 ± 0.038
4.617GluGln: 4.617 ± 0.056
4.123GluArg: 4.123 ± 0.043
4.092GluSer: 4.092 ± 0.043
3.309GluThr: 3.309 ± 0.035
3.177GluVal: 3.177 ± 0.034
0.546GluTrp: 0.546 ± 0.01
1.799GluTyr: 1.799 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
2.434PheAla: 2.434 ± 0.021
0.743PheCys: 0.743 ± 0.012
2.091PheAsp: 2.091 ± 0.02
2.254PheGlu: 2.254 ± 0.023
1.396PhePhe: 1.396 ± 0.019
2.377PheGly: 2.377 ± 0.028
0.863PheHis: 0.863 ± 0.013
1.922PheIle: 1.922 ± 0.022
1.991PheLys: 1.991 ± 0.022
3.203PheLeu: 3.203 ± 0.032
0.956PheMet: 0.956 ± 0.014
1.822PheAsn: 1.822 ± 0.017
1.356PhePro: 1.356 ± 0.018
1.406PheGln: 1.406 ± 0.016
1.758PheArg: 1.758 ± 0.019
2.493PheSer: 2.493 ± 0.024
1.794PheThr: 1.794 ± 0.02
2.42PheVal: 2.42 ± 0.027
0.444PheTrp: 0.444 ± 0.009
1.355PheTyr: 1.355 ± 0.02
0.0PheXaa: 0.0 ± 0.0
Gly
4.355GlyAla: 4.355 ± 0.041
1.022GlyCys: 1.022 ± 0.018
2.708GlyAsp: 2.708 ± 0.026
2.845GlyGlu: 2.845 ± 0.029
2.089GlyPhe: 2.089 ± 0.021
4.989GlyGly: 4.989 ± 0.072
1.461GlyHis: 1.461 ± 0.019
2.834GlyIle: 2.834 ± 0.026
2.933GlyLys: 2.933 ± 0.029
4.53GlyLeu: 4.53 ± 0.035
1.315GlyMet: 1.315 ± 0.018
2.924GlyAsn: 2.924 ± 0.028
1.969GlyPro: 1.969 ± 0.031
2.22GlyGln: 2.22 ± 0.023
2.7GlyArg: 2.7 ± 0.028
4.847GlySer: 4.847 ± 0.041
2.814GlyThr: 2.814 ± 0.028
3.348GlyVal: 3.348 ± 0.027
0.613GlyTrp: 0.613 ± 0.012
2.028GlyTyr: 2.028 ± 0.025
0.0GlyXaa: 0.0 ± 0.0
His
1.939HisAla: 1.939 ± 0.019
0.623HisCys: 0.623 ± 0.012
1.122HisAsp: 1.122 ± 0.012
1.532HisGlu: 1.532 ± 0.018
1.057HisPhe: 1.057 ± 0.014
1.464HisGly: 1.464 ± 0.017
1.238HisHis: 1.238 ± 0.029
1.446HisIle: 1.446 ± 0.013
1.422HisLys: 1.422 ± 0.017
2.482HisLeu: 2.482 ± 0.025
0.782HisMet: 0.782 ± 0.012
1.329HisAsn: 1.329 ± 0.016
1.2HisPro: 1.2 ± 0.018
1.41HisGln: 1.41 ± 0.02
1.306HisArg: 1.306 ± 0.017
2.074HisSer: 2.074 ± 0.025
1.417HisThr: 1.417 ± 0.017
1.546HisVal: 1.546 ± 0.017
0.328HisTrp: 0.328 ± 0.007
0.948HisTyr: 0.948 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
3.648IleAla: 3.648 ± 0.027
1.25IleCys: 1.25 ± 0.017
2.736IleAsp: 2.736 ± 0.026
3.208IleGlu: 3.208 ± 0.036
2.071IlePhe: 2.071 ± 0.02
2.561IleGly: 2.561 ± 0.027
1.044IleHis: 1.044 ± 0.014
2.797IleIle: 2.797 ± 0.025
2.987IleLys: 2.987 ± 0.03
4.092IleLeu: 4.092 ± 0.035
1.187IleMet: 1.187 ± 0.017
2.553IleAsn: 2.553 ± 0.025
2.034IlePro: 2.034 ± 0.02
1.832IleGln: 1.832 ± 0.021
2.306IleArg: 2.306 ± 0.02
3.902IleSer: 3.902 ± 0.027
2.813IleThr: 2.813 ± 0.027
3.258IleVal: 3.258 ± 0.03
0.573IleTrp: 0.573 ± 0.011
1.852IleTyr: 1.852 ± 0.023
0.0IleXaa: 0.0 ± 0.0
Lys
3.564LysAla: 3.564 ± 0.031
1.164LysCys: 1.164 ± 0.02
2.732LysAsp: 2.732 ± 0.035
3.483LysGlu: 3.483 ± 0.036
1.939LysPhe: 1.939 ± 0.018
2.132LysGly: 2.132 ± 0.03
1.533LysHis: 1.533 ± 0.019
2.722LysIle: 2.722 ± 0.029
3.742LysLys: 3.742 ± 0.058
5.837LysLeu: 5.837 ± 0.035
1.328LysMet: 1.328 ± 0.015
2.251LysAsn: 2.251 ± 0.023
3.08LysPro: 3.08 ± 0.055
3.323LysGln: 3.323 ± 0.032
3.775LysArg: 3.775 ± 0.032
4.123LysSer: 4.123 ± 0.037
2.975LysThr: 2.975 ± 0.034
2.81LysVal: 2.81 ± 0.029
0.578LysTrp: 0.578 ± 0.01
1.842LysTyr: 1.842 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
7.173LeuAla: 7.173 ± 0.045
1.894LeuCys: 1.894 ± 0.021
4.928LeuAsp: 4.928 ± 0.034
6.31LeuGlu: 6.31 ± 0.045
3.084LeuPhe: 3.084 ± 0.033
4.855LeuGly: 4.855 ± 0.038
2.724LeuHis: 2.724 ± 0.026
4.392LeuIle: 4.392 ± 0.04
5.554LeuLys: 5.554 ± 0.033
10.577LeuLeu: 10.577 ± 0.075
2.223LeuMet: 2.223 ± 0.022
4.619LeuAsn: 4.619 ± 0.035
5.452LeuPro: 5.452 ± 0.038
6.123LeuGln: 6.123 ± 0.066
5.751LeuArg: 5.751 ± 0.038
7.267LeuSer: 7.267 ± 0.043
5.014LeuThr: 5.014 ± 0.038
5.035LeuVal: 5.035 ± 0.038
0.92LeuTrp: 0.92 ± 0.013
2.757LeuTyr: 2.757 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
1.789MetAla: 1.789 ± 0.018
0.451MetCys: 0.451 ± 0.008
1.353MetAsp: 1.353 ± 0.017
1.573MetGlu: 1.573 ± 0.017
0.812MetPhe: 0.812 ± 0.014
1.312MetGly: 1.312 ± 0.019
0.681MetHis: 0.681 ± 0.011
0.953MetIle: 0.953 ± 0.013
1.167MetLys: 1.167 ± 0.015
2.53MetLeu: 2.53 ± 0.023
0.581MetMet: 0.581 ± 0.011
0.977MetAsn: 0.977 ± 0.014
1.468MetPro: 1.468 ± 0.019
1.444MetGln: 1.444 ± 0.017
1.554MetArg: 1.554 ± 0.018
1.859MetSer: 1.859 ± 0.018
1.121MetThr: 1.121 ± 0.015
1.167MetVal: 1.167 ± 0.018
0.238MetTrp: 0.238 ± 0.006
0.667MetTyr: 0.667 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.895AsnAla: 3.895 ± 0.027
1.092AsnCys: 1.092 ± 0.018
2.347AsnAsp: 2.347 ± 0.024
3.024AsnGlu: 3.024 ± 0.026
1.866AsnPhe: 1.866 ± 0.019
3.175AsnGly: 3.175 ± 0.036
1.015AsnHis: 1.015 ± 0.014
2.632AsnIle: 2.632 ± 0.024
2.555AsnLys: 2.555 ± 0.021
4.099AsnLeu: 4.099 ± 0.033
1.274AsnMet: 1.274 ± 0.017
3.701AsnAsn: 3.701 ± 0.052
1.976AsnPro: 1.976 ± 0.019
1.793AsnGln: 1.793 ± 0.022
2.15AsnArg: 2.15 ± 0.022
4.2AsnSer: 4.2 ± 0.041
2.491AsnThr: 2.491 ± 0.021
3.011AsnVal: 3.011 ± 0.023
0.571AsnTrp: 0.571 ± 0.01
1.739AsnTyr: 1.739 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
4.628ProAla: 4.628 ± 0.044
0.705ProCys: 0.705 ± 0.016
2.21ProAsp: 2.21 ± 0.021
3.161ProGlu: 3.161 ± 0.039
1.512ProPhe: 1.512 ± 0.018
2.512ProGly: 2.512 ± 0.043
1.368ProHis: 1.368 ± 0.018
2.267ProIle: 2.267 ± 0.022
2.864ProLys: 2.864 ± 0.042
4.522ProLeu: 4.522 ± 0.029
1.159ProMet: 1.159 ± 0.017
2.307ProAsn: 2.307 ± 0.022
4.358ProPro: 4.358 ± 0.06
2.846ProGln: 2.846 ± 0.03
2.374ProArg: 2.374 ± 0.025
3.83ProSer: 3.83 ± 0.033
3.541ProThr: 3.541 ± 0.044
2.902ProVal: 2.902 ± 0.031
0.444ProTrp: 0.444 ± 0.009
1.426ProTyr: 1.426 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.784GlnAla: 3.784 ± 0.035
0.895GlnCys: 0.895 ± 0.017
1.856GlnAsp: 1.856 ± 0.02
2.887GlnGlu: 2.887 ± 0.033
1.64GlnPhe: 1.64 ± 0.016
1.84GlnGly: 1.84 ± 0.021
2.023GlnHis: 2.023 ± 0.026
2.118GlnIle: 2.118 ± 0.025
2.423GlnLys: 2.423 ± 0.027
7.071GlnLeu: 7.071 ± 0.069
1.222GlnMet: 1.222 ± 0.018
1.902GlnAsn: 1.902 ± 0.019
3.091GlnPro: 3.091 ± 0.032
9.196GlnGln: 9.196 ± 0.195
3.85GlnArg: 3.85 ± 0.034
3.686GlnSer: 3.686 ± 0.035
2.719GlnThr: 2.719 ± 0.03
2.439GlnVal: 2.439 ± 0.022
0.449GlnTrp: 0.449 ± 0.009
1.346GlnTyr: 1.346 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
3.603ArgAla: 3.603 ± 0.027
1.139ArgCys: 1.139 ± 0.024
2.642ArgAsp: 2.642 ± 0.026
3.45ArgGlu: 3.45 ± 0.036
1.974ArgPhe: 1.974 ± 0.022
2.624ArgGly: 2.624 ± 0.027
1.589ArgHis: 1.589 ± 0.018
2.796ArgIle: 2.796 ± 0.023
3.423ArgLys: 3.423 ± 0.029
5.319ArgLeu: 5.319 ± 0.04
1.22ArgMet: 1.22 ± 0.015
2.723ArgAsn: 2.723 ± 0.024
2.436ArgPro: 2.436 ± 0.032
3.082ArgGln: 3.082 ± 0.029
4.255ArgArg: 4.255 ± 0.044
4.173ArgSer: 4.173 ± 0.042
2.664ArgThr: 2.664 ± 0.024
2.799ArgVal: 2.799 ± 0.026
0.558ArgTrp: 0.558 ± 0.009
1.793ArgTyr: 1.793 ± 0.018
0.0ArgXaa: 0.0 ± 0.0
Ser
6.161SerAla: 6.161 ± 0.049
1.586SerCys: 1.586 ± 0.025
3.81SerAsp: 3.81 ± 0.034
4.257SerGlu: 4.257 ± 0.034
2.673SerPhe: 2.673 ± 0.026
4.768SerGly: 4.768 ± 0.038
1.793SerHis: 1.793 ± 0.021
3.756SerIle: 3.756 ± 0.025
4.112SerLys: 4.112 ± 0.033
6.744SerLeu: 6.744 ± 0.047
1.809SerMet: 1.809 ± 0.014
4.469SerAsn: 4.469 ± 0.043
3.888SerPro: 3.888 ± 0.045
3.278SerGln: 3.278 ± 0.032
3.598SerArg: 3.598 ± 0.037
9.897SerSer: 9.897 ± 0.127
5.048SerThr: 5.048 ± 0.047
4.286SerVal: 4.286 ± 0.035
0.775SerTrp: 0.775 ± 0.013
2.397SerTyr: 2.397 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
5.145ThrAla: 5.145 ± 0.042
0.955ThrCys: 0.955 ± 0.02
2.654ThrAsp: 2.654 ± 0.03
3.3ThrGlu: 3.3 ± 0.041
1.763ThrPhe: 1.763 ± 0.02
3.031ThrGly: 3.031 ± 0.025
1.432ThrHis: 1.432 ± 0.019
2.756ThrIle: 2.756 ± 0.027
2.806ThrLys: 2.806 ± 0.028
5.421ThrLeu: 5.421 ± 0.034
1.162ThrMet: 1.162 ± 0.014
2.523ThrAsn: 2.523 ± 0.023
4.066ThrPro: 4.066 ± 0.046
2.619ThrGln: 2.619 ± 0.026
2.613ThrArg: 2.613 ± 0.021
4.374ThrSer: 4.374 ± 0.04
5.129ThrThr: 5.129 ± 0.096
3.35ThrVal: 3.35 ± 0.035
0.482ThrTrp: 0.482 ± 0.01
1.496ThrTyr: 1.496 ± 0.017
0.0ThrXaa: 0.0 ± 0.0
Val
4.776ValAla: 4.776 ± 0.038
1.243ValCys: 1.243 ± 0.017
3.107ValAsp: 3.107 ± 0.023
3.699ValGlu: 3.699 ± 0.041
2.049ValPhe: 2.049 ± 0.024
3.181ValGly: 3.181 ± 0.029
1.484ValHis: 1.484 ± 0.017
2.769ValIle: 2.769 ± 0.025
3.016ValLys: 3.016 ± 0.031
5.511ValLeu: 5.511 ± 0.04
1.284ValMet: 1.284 ± 0.016
2.648ValAsn: 2.648 ± 0.021
3.001ValPro: 3.001 ± 0.027
2.72ValGln: 2.72 ± 0.024
3.064ValArg: 3.064 ± 0.026
4.005ValSer: 4.005 ± 0.029
3.109ValThr: 3.109 ± 0.03
3.745ValVal: 3.745 ± 0.03
0.599ValTrp: 0.599 ± 0.01
1.804ValTyr: 1.804 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
0.567TrpAla: 0.567 ± 0.01
0.203TrpCys: 0.203 ± 0.006
0.495TrpAsp: 0.495 ± 0.01
0.522TrpGlu: 0.522 ± 0.009
0.401TrpPhe: 0.401 ± 0.009
0.509TrpGly: 0.509 ± 0.011
0.314TrpHis: 0.314 ± 0.007
0.538TrpIle: 0.538 ± 0.01
0.547TrpLys: 0.547 ± 0.013
1.204TrpLeu: 1.204 ± 0.018
0.265TrpMet: 0.265 ± 0.007
0.481TrpAsn: 0.481 ± 0.011
0.422TrpPro: 0.422 ± 0.008
0.631TrpGln: 0.631 ± 0.011
0.716TrpArg: 0.716 ± 0.012
0.786TrpSer: 0.786 ± 0.012
0.579TrpThr: 0.579 ± 0.009
0.487TrpVal: 0.487 ± 0.01
0.171TrpTrp: 0.171 ± 0.006
0.343TrpTyr: 0.343 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.361TyrAla: 2.361 ± 0.024
0.726TyrCys: 0.726 ± 0.013
1.809TyrAsp: 1.809 ± 0.02
2.038TyrGlu: 2.038 ± 0.021
1.365TyrPhe: 1.365 ± 0.017
1.871TyrGly: 1.871 ± 0.023
0.793TyrHis: 0.793 ± 0.013
1.553TyrIle: 1.553 ± 0.018
1.749TyrLys: 1.749 ± 0.021
2.812TyrLeu: 2.812 ± 0.03
0.868TyrMet: 0.868 ± 0.015
1.665TyrAsn: 1.665 ± 0.018
1.288TyrPro: 1.288 ± 0.019
1.376TyrGln: 1.376 ± 0.016
1.716TyrArg: 1.716 ± 0.018
2.153TyrSer: 2.153 ± 0.023
1.678TyrThr: 1.678 ± 0.02
1.919TyrVal: 1.919 ± 0.022
0.387TyrTrp: 0.387 ± 0.009
1.282TyrTyr: 1.282 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 11914 proteins (6422999 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski