Amino acid dipepetide frequency for Drosophila navojoa (Fruit fly)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.819AlaAla: 11.819 ± 0.146
1.217AlaCys: 1.217 ± 0.034
3.617AlaAsp: 3.617 ± 0.024
4.778AlaGlu: 4.778 ± 0.037
2.162AlaPhe: 2.162 ± 0.021
4.955AlaGly: 4.955 ± 0.045
1.837AlaHis: 1.837 ± 0.02
3.469AlaIle: 3.469 ± 0.024
4.123AlaLys: 4.123 ± 0.033
6.737AlaLeu: 6.737 ± 0.041
1.807AlaMet: 1.807 ± 0.016
3.658AlaAsn: 3.658 ± 0.026
4.245AlaPro: 4.245 ± 0.039
3.739AlaGln: 3.739 ± 0.028
3.491AlaArg: 3.491 ± 0.025
6.318AlaSer: 6.318 ± 0.046
5.769AlaThr: 5.769 ± 0.056
4.68AlaVal: 4.68 ± 0.03
0.602AlaTrp: 0.602 ± 0.011
1.862AlaTyr: 1.862 ± 0.018
0.002AlaXaa: 0.002 ± 0.0
Cys
1.315CysAla: 1.315 ± 0.027
0.531CysCys: 0.531 ± 0.01
1.094CysAsp: 1.094 ± 0.017
1.112CysGlu: 1.112 ± 0.021
0.692CysPhe: 0.692 ± 0.012
1.337CysGly: 1.337 ± 0.038
0.499CysHis: 0.499 ± 0.011
0.98CysIle: 0.98 ± 0.022
0.955CysLys: 0.955 ± 0.015
1.828CysLeu: 1.828 ± 0.028
0.425CysMet: 0.425 ± 0.009
0.952CysAsn: 0.952 ± 0.018
0.995CysPro: 0.995 ± 0.036
0.864CysGln: 0.864 ± 0.029
1.118CysArg: 1.118 ± 0.048
1.625CysSer: 1.625 ± 0.027
0.987CysThr: 0.987 ± 0.024
1.181CysVal: 1.181 ± 0.036
0.209CysTrp: 0.209 ± 0.005
0.598CysTyr: 0.598 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.886AspAla: 3.886 ± 0.024
1.028AspCys: 1.028 ± 0.021
4.048AspAsp: 4.048 ± 0.054
4.265AspGlu: 4.265 ± 0.037
2.015AspPhe: 2.015 ± 0.021
2.966AspGly: 2.966 ± 0.033
0.994AspHis: 0.994 ± 0.014
2.795AspIle: 2.795 ± 0.023
2.627AspLys: 2.627 ± 0.025
4.401AspLeu: 4.401 ± 0.029
1.313AspMet: 1.313 ± 0.015
2.44AspAsn: 2.44 ± 0.026
2.107AspPro: 2.107 ± 0.037
1.77AspGln: 1.77 ± 0.015
2.286AspArg: 2.286 ± 0.026
3.807AspSer: 3.807 ± 0.027
2.401AspThr: 2.401 ± 0.022
3.301AspVal: 3.301 ± 0.021
0.597AspTrp: 0.597 ± 0.012
1.844AspTyr: 1.844 ± 0.017
0.001AspXaa: 0.001 ± 0.0
Glu
4.518GluAla: 4.518 ± 0.035
1.129GluCys: 1.129 ± 0.041
3.536GluAsp: 3.536 ± 0.033
5.313GluGlu: 5.313 ± 0.059
2.018GluPhe: 2.018 ± 0.018
2.438GluGly: 2.438 ± 0.027
1.683GluHis: 1.683 ± 0.016
2.994GluIle: 2.994 ± 0.028
3.683GluLys: 3.683 ± 0.037
6.659GluLeu: 6.659 ± 0.044
1.504GluMet: 1.504 ± 0.015
2.664GluAsn: 2.664 ± 0.025
2.946GluPro: 2.946 ± 0.028
4.247GluGln: 4.247 ± 0.041
4.055GluArg: 4.055 ± 0.038
4.145GluSer: 4.145 ± 0.033
3.252GluThr: 3.252 ± 0.029
3.193GluVal: 3.193 ± 0.03
0.55GluTrp: 0.55 ± 0.009
1.747GluTyr: 1.747 ± 0.018
0.001GluXaa: 0.001 ± 0.0
Phe
2.281PheAla: 2.281 ± 0.018
0.721PheCys: 0.721 ± 0.012
1.979PheAsp: 1.979 ± 0.02
2.073PheGlu: 2.073 ± 0.021
1.322PhePhe: 1.322 ± 0.018
2.306PheGly: 2.306 ± 0.025
0.841PheHis: 0.841 ± 0.013
1.79PheIle: 1.79 ± 0.019
1.803PheLys: 1.803 ± 0.019
3.055PheLeu: 3.055 ± 0.025
0.889PheMet: 0.889 ± 0.013
1.664PheAsn: 1.664 ± 0.017
1.328PhePro: 1.328 ± 0.016
1.365PheGln: 1.365 ± 0.016
1.754PheArg: 1.754 ± 0.017
2.374PheSer: 2.374 ± 0.023
1.704PheThr: 1.704 ± 0.019
2.304PheVal: 2.304 ± 0.022
0.407PheTrp: 0.407 ± 0.008
1.279PheTyr: 1.279 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
4.51GlyAla: 4.51 ± 0.043
1.052GlyCys: 1.052 ± 0.022
2.805GlyAsp: 2.805 ± 0.029
2.83GlyGlu: 2.83 ± 0.026
1.988GlyPhe: 1.988 ± 0.022
5.686GlyGly: 5.686 ± 0.09
1.55GlyHis: 1.55 ± 0.017
2.845GlyIle: 2.845 ± 0.026
2.839GlyLys: 2.839 ± 0.028
4.456GlyLeu: 4.456 ± 0.038
1.257GlyMet: 1.257 ± 0.017
3.055GlyAsn: 3.055 ± 0.032
2.243GlyPro: 2.243 ± 0.037
2.379GlyGln: 2.379 ± 0.024
2.872GlyArg: 2.872 ± 0.028
5.37GlySer: 5.37 ± 0.043
2.975GlyThr: 2.975 ± 0.023
3.377GlyVal: 3.377 ± 0.026
0.608GlyTrp: 0.608 ± 0.012
2.046GlyTyr: 2.046 ± 0.025
0.001GlyXaa: 0.001 ± 0.0
His
1.721HisAla: 1.721 ± 0.017
0.598HisCys: 0.598 ± 0.012
1.093HisAsp: 1.093 ± 0.014
1.401HisGlu: 1.401 ± 0.015
0.983HisPhe: 0.983 ± 0.014
1.501HisGly: 1.501 ± 0.019
1.426HisHis: 1.426 ± 0.031
1.376HisIle: 1.376 ± 0.014
1.349HisLys: 1.349 ± 0.016
2.559HisLeu: 2.559 ± 0.024
0.742HisMet: 0.742 ± 0.012
1.283HisAsn: 1.283 ± 0.016
1.326HisPro: 1.326 ± 0.016
1.721HisGln: 1.721 ± 0.026
1.429HisArg: 1.429 ± 0.016
2.222HisSer: 2.222 ± 0.023
1.357HisThr: 1.357 ± 0.016
1.472HisVal: 1.472 ± 0.014
0.304HisTrp: 0.304 ± 0.006
0.898HisTyr: 0.898 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.524IleAla: 3.524 ± 0.022
1.202IleCys: 1.202 ± 0.023
2.737IleAsp: 2.737 ± 0.025
3.117IleGlu: 3.117 ± 0.031
1.993IlePhe: 1.993 ± 0.023
2.61IleGly: 2.61 ± 0.024
1.047IleHis: 1.047 ± 0.014
2.707IleIle: 2.707 ± 0.026
2.847IleLys: 2.847 ± 0.025
3.973IleLeu: 3.973 ± 0.03
1.155IleMet: 1.155 ± 0.014
2.51IleAsn: 2.51 ± 0.02
2.074IlePro: 2.074 ± 0.022
1.834IleGln: 1.834 ± 0.019
2.369IleArg: 2.369 ± 0.02
3.892IleSer: 3.892 ± 0.025
2.682IleThr: 2.682 ± 0.026
3.225IleVal: 3.225 ± 0.027
0.539IleTrp: 0.539 ± 0.01
1.768IleTyr: 1.768 ± 0.019
0.001IleXaa: 0.001 ± 0.0
Lys
3.548LysAla: 3.548 ± 0.028
1.125LysCys: 1.125 ± 0.025
2.648LysAsp: 2.648 ± 0.028
3.584LysGlu: 3.584 ± 0.035
1.779LysPhe: 1.779 ± 0.016
2.153LysGly: 2.153 ± 0.027
1.42LysHis: 1.42 ± 0.016
2.6LysIle: 2.6 ± 0.024
3.748LysLys: 3.748 ± 0.049
5.361LysLeu: 5.361 ± 0.038
1.353LysMet: 1.353 ± 0.014
2.197LysAsn: 2.197 ± 0.021
3.021LysPro: 3.021 ± 0.036
3.085LysGln: 3.085 ± 0.026
3.722LysArg: 3.722 ± 0.033
4.056LysSer: 4.056 ± 0.031
2.88LysThr: 2.88 ± 0.024
2.781LysVal: 2.781 ± 0.027
0.573LysTrp: 0.573 ± 0.01
1.764LysTyr: 1.764 ± 0.021
0.001LysXaa: 0.001 ± 0.0
Leu
6.97LeuAla: 6.97 ± 0.046
1.816LeuCys: 1.816 ± 0.021
4.74LeuAsp: 4.74 ± 0.029
5.943LeuGlu: 5.943 ± 0.042
2.911LeuPhe: 2.911 ± 0.026
4.709LeuGly: 4.709 ± 0.036
2.611LeuHis: 2.611 ± 0.021
4.286LeuIle: 4.286 ± 0.032
5.216LeuLys: 5.216 ± 0.035
9.801LeuLeu: 9.801 ± 0.07
2.162LeuMet: 2.162 ± 0.022
4.327LeuAsn: 4.327 ± 0.03
5.388LeuPro: 5.388 ± 0.034
5.673LeuGln: 5.673 ± 0.043
5.632LeuArg: 5.632 ± 0.04
6.974LeuSer: 6.974 ± 0.036
4.824LeuThr: 4.824 ± 0.03
4.85LeuVal: 4.85 ± 0.035
0.89LeuTrp: 0.89 ± 0.012
2.544LeuTyr: 2.544 ± 0.024
0.002LeuXaa: 0.002 ± 0.001
Met
1.865MetAla: 1.865 ± 0.019
0.435MetCys: 0.435 ± 0.009
1.362MetAsp: 1.362 ± 0.016
1.543MetGlu: 1.543 ± 0.017
0.768MetPhe: 0.768 ± 0.012
1.329MetGly: 1.329 ± 0.021
0.673MetHis: 0.673 ± 0.009
0.937MetIle: 0.937 ± 0.013
1.187MetLys: 1.187 ± 0.014
2.405MetLeu: 2.405 ± 0.022
0.61MetMet: 0.61 ± 0.011
0.981MetAsn: 0.981 ± 0.013
1.484MetPro: 1.484 ± 0.019
1.373MetGln: 1.373 ± 0.017
1.547MetArg: 1.547 ± 0.018
1.872MetSer: 1.872 ± 0.019
1.134MetThr: 1.134 ± 0.013
1.164MetVal: 1.164 ± 0.015
0.232MetTrp: 0.232 ± 0.006
0.622MetTyr: 0.622 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.793AsnAla: 3.793 ± 0.035
1.043AsnCys: 1.043 ± 0.014
2.419AsnAsp: 2.419 ± 0.027
2.963AsnGlu: 2.963 ± 0.024
1.737AsnPhe: 1.737 ± 0.019
3.456AsnGly: 3.456 ± 0.031
1.055AsnHis: 1.055 ± 0.018
2.554AsnIle: 2.554 ± 0.022
2.393AsnLys: 2.393 ± 0.021
3.927AsnLeu: 3.927 ± 0.032
1.267AsnMet: 1.267 ± 0.015
4.333AsnAsn: 4.333 ± 0.077
2.06AsnPro: 2.06 ± 0.029
1.819AsnGln: 1.819 ± 0.02
2.237AsnArg: 2.237 ± 0.018
4.737AsnSer: 4.737 ± 0.05
2.507AsnThr: 2.507 ± 0.02
2.919AsnVal: 2.919 ± 0.025
0.539AsnTrp: 0.539 ± 0.009
1.643AsnTyr: 1.643 ± 0.016
0.001AsnXaa: 0.001 ± 0.0
Pro
4.717ProAla: 4.717 ± 0.04
0.755ProCys: 0.755 ± 0.052
2.345ProAsp: 2.345 ± 0.02
3.217ProGlu: 3.217 ± 0.033
1.532ProPhe: 1.532 ± 0.019
2.739ProGly: 2.739 ± 0.047
1.456ProHis: 1.456 ± 0.017
2.39ProIle: 2.39 ± 0.021
2.88ProLys: 2.88 ± 0.028
4.509ProLeu: 4.509 ± 0.033
1.166ProMet: 1.166 ± 0.014
2.415ProAsn: 2.415 ± 0.025
4.685ProPro: 4.685 ± 0.066
2.972ProGln: 2.972 ± 0.03
2.453ProArg: 2.453 ± 0.024
4.191ProSer: 4.191 ± 0.044
3.645ProThr: 3.645 ± 0.04
3.104ProVal: 3.104 ± 0.028
0.439ProTrp: 0.439 ± 0.009
1.432ProTyr: 1.432 ± 0.018
0.001ProXaa: 0.001 ± 0.0
Gln
3.468GlnAla: 3.468 ± 0.032
0.871GlnCys: 0.871 ± 0.028
1.758GlnAsp: 1.758 ± 0.018
3.059GlnGlu: 3.059 ± 0.032
1.525GlnPhe: 1.525 ± 0.016
1.798GlnGly: 1.798 ± 0.02
2.185GlnHis: 2.185 ± 0.029
2.101GlnIle: 2.101 ± 0.02
2.501GlnLys: 2.501 ± 0.025
6.518GlnLeu: 6.518 ± 0.053
1.302GlnMet: 1.302 ± 0.016
1.908GlnAsn: 1.908 ± 0.018
3.378GlnPro: 3.378 ± 0.036
12.596GlnGln: 12.596 ± 0.244
3.71GlnArg: 3.71 ± 0.03
3.644GlnSer: 3.644 ± 0.029
2.562GlnThr: 2.562 ± 0.023
2.412GlnVal: 2.412 ± 0.021
0.462GlnTrp: 0.462 ± 0.009
1.299GlnTyr: 1.299 ± 0.015
0.003GlnXaa: 0.003 ± 0.001
Arg
3.574ArgAla: 3.574 ± 0.025
1.141ArgCys: 1.141 ± 0.023
2.709ArgAsp: 2.709 ± 0.026
3.465ArgGlu: 3.465 ± 0.035
1.938ArgPhe: 1.938 ± 0.018
2.667ArgGly: 2.667 ± 0.028
1.637ArgHis: 1.637 ± 0.021
2.753ArgIle: 2.753 ± 0.02
3.396ArgLys: 3.396 ± 0.027
5.266ArgLeu: 5.266 ± 0.038
1.259ArgMet: 1.259 ± 0.014
2.763ArgAsn: 2.763 ± 0.022
2.664ArgPro: 2.664 ± 0.026
3.129ArgGln: 3.129 ± 0.029
4.5ArgArg: 4.5 ± 0.044
4.365ArgSer: 4.365 ± 0.036
2.74ArgThr: 2.74 ± 0.021
2.791ArgVal: 2.791 ± 0.022
0.562ArgTrp: 0.562 ± 0.008
1.717ArgTyr: 1.717 ± 0.018
0.001ArgXaa: 0.001 ± 0.0
Ser
6.332SerAla: 6.332 ± 0.044
1.581SerCys: 1.581 ± 0.033
3.941SerAsp: 3.941 ± 0.03
4.295SerGlu: 4.295 ± 0.032
2.536SerPhe: 2.536 ± 0.026
5.267SerGly: 5.267 ± 0.042
1.926SerHis: 1.926 ± 0.02
3.71SerIle: 3.71 ± 0.023
3.996SerLys: 3.996 ± 0.029
6.597SerLeu: 6.597 ± 0.041
1.789SerMet: 1.789 ± 0.019
5.016SerAsn: 5.016 ± 0.05
4.226SerPro: 4.226 ± 0.044
3.427SerGln: 3.427 ± 0.028
3.851SerArg: 3.851 ± 0.037
11.016SerSer: 11.016 ± 0.139
5.337SerThr: 5.337 ± 0.044
4.274SerVal: 4.274 ± 0.027
0.741SerTrp: 0.741 ± 0.011
2.336SerTyr: 2.336 ± 0.024
0.002SerXaa: 0.002 ± 0.0
Thr
5.528ThrAla: 5.528 ± 0.048
0.969ThrCys: 0.969 ± 0.02
2.622ThrAsp: 2.622 ± 0.021
3.235ThrGlu: 3.235 ± 0.032
1.745ThrPhe: 1.745 ± 0.018
3.147ThrGly: 3.147 ± 0.027
1.356ThrHis: 1.356 ± 0.014
2.762ThrIle: 2.762 ± 0.023
2.755ThrLys: 2.755 ± 0.027
5.082ThrLeu: 5.082 ± 0.029
1.206ThrMet: 1.206 ± 0.012
2.568ThrAsn: 2.568 ± 0.025
3.991ThrPro: 3.991 ± 0.041
2.458ThrGln: 2.458 ± 0.021
2.579ThrArg: 2.579 ± 0.021
4.681ThrSer: 4.681 ± 0.039
5.522ThrThr: 5.522 ± 0.098
3.35ThrVal: 3.35 ± 0.027
0.472ThrTrp: 0.472 ± 0.01
1.488ThrTyr: 1.488 ± 0.017
0.002ThrXaa: 0.002 ± 0.001
Val
4.82ValAla: 4.82 ± 0.033
1.207ValCys: 1.207 ± 0.028
3.179ValAsp: 3.179 ± 0.022
3.693ValGlu: 3.693 ± 0.035
1.948ValPhe: 1.948 ± 0.02
3.29ValGly: 3.29 ± 0.024
1.463ValHis: 1.463 ± 0.015
2.779ValIle: 2.779 ± 0.024
2.968ValLys: 2.968 ± 0.026
5.298ValLeu: 5.298 ± 0.035
1.25ValMet: 1.25 ± 0.017
2.596ValAsn: 2.596 ± 0.023
3.139ValPro: 3.139 ± 0.025
2.651ValGln: 2.651 ± 0.025
3.026ValArg: 3.026 ± 0.023
4.021ValSer: 4.021 ± 0.025
3.091ValThr: 3.091 ± 0.028
3.746ValVal: 3.746 ± 0.032
0.577ValTrp: 0.577 ± 0.011
1.713ValTyr: 1.713 ± 0.019
0.002ValXaa: 0.002 ± 0.0
Trp
0.562TrpAla: 0.562 ± 0.01
0.194TrpCys: 0.194 ± 0.005
0.482TrpAsp: 0.482 ± 0.01
0.502TrpGlu: 0.502 ± 0.009
0.369TrpPhe: 0.369 ± 0.007
0.491TrpGly: 0.491 ± 0.01
0.305TrpHis: 0.305 ± 0.008
0.51TrpIle: 0.51 ± 0.009
0.526TrpLys: 0.526 ± 0.009
1.164TrpLeu: 1.164 ± 0.014
0.258TrpMet: 0.258 ± 0.006
0.474TrpAsn: 0.474 ± 0.009
0.436TrpPro: 0.436 ± 0.009
0.589TrpGln: 0.589 ± 0.01
0.702TrpArg: 0.702 ± 0.011
0.767TrpSer: 0.767 ± 0.012
0.543TrpThr: 0.543 ± 0.01
0.48TrpVal: 0.48 ± 0.01
0.155TrpTrp: 0.155 ± 0.005
0.306TrpTyr: 0.306 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.138TyrAla: 2.138 ± 0.021
0.681TyrCys: 0.681 ± 0.009
1.779TyrAsp: 1.779 ± 0.02
1.885TyrGlu: 1.885 ± 0.018
1.277TyrPhe: 1.277 ± 0.016
1.896TyrGly: 1.896 ± 0.025
0.797TyrHis: 0.797 ± 0.011
1.466TyrIle: 1.466 ± 0.016
1.607TyrLys: 1.607 ± 0.018
2.674TyrLeu: 2.674 ± 0.023
0.808TyrMet: 0.808 ± 0.012
1.596TyrAsn: 1.596 ± 0.018
1.302TyrPro: 1.302 ± 0.017
1.367TyrGln: 1.367 ± 0.018
1.705TyrArg: 1.705 ± 0.02
2.117TyrSer: 2.117 ± 0.019
1.605TyrThr: 1.605 ± 0.016
1.849TyrVal: 1.849 ± 0.019
0.357TyrTrp: 0.357 ± 0.009
1.299TyrTyr: 1.299 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.003XaaGln: 0.003 ± 0.001
0.001XaaArg: 0.001 ± 0.0
0.003XaaSer: 0.003 ± 0.001
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.155XaaXaa: 0.155 ± 0.028
Statistics based on 15505 proteins (7239635 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski