Amino acid dipepetide frequency for Daphnia pulex (Water flea)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.176AlaAla: 7.176 ± 0.054
1.111AlaCys: 1.111 ± 0.013
3.563AlaAsp: 3.563 ± 0.022
4.303AlaGlu: 4.303 ± 0.028
2.51AlaPhe: 2.51 ± 0.019
4.062AlaGly: 4.062 ± 0.027
1.397AlaHis: 1.397 ± 0.012
3.736AlaIle: 3.736 ± 0.022
3.729AlaLys: 3.729 ± 0.024
6.058AlaLeu: 6.058 ± 0.028
1.556AlaMet: 1.556 ± 0.013
3.002AlaAsn: 3.002 ± 0.018
3.736AlaPro: 3.736 ± 0.04
2.678AlaGln: 2.678 ± 0.022
3.28AlaArg: 3.28 ± 0.022
5.598AlaSer: 5.598 ± 0.027
4.367AlaThr: 4.367 ± 0.024
5.054AlaVal: 5.054 ± 0.027
0.67AlaTrp: 0.67 ± 0.009
1.775AlaTyr: 1.775 ± 0.019
0.001AlaXaa: 0.001 ± 0.0
Cys
1.076CysAla: 1.076 ± 0.013
0.54CysCys: 0.54 ± 0.009
1.083CysAsp: 1.083 ± 0.014
0.975CysGlu: 0.975 ± 0.012
0.796CysPhe: 0.796 ± 0.009
1.269CysGly: 1.269 ± 0.016
0.534CysHis: 0.534 ± 0.009
0.958CysIle: 0.958 ± 0.013
0.949CysLys: 0.949 ± 0.012
1.877CysLeu: 1.877 ± 0.014
0.381CysMet: 0.381 ± 0.007
0.846CysAsn: 0.846 ± 0.01
1.127CysPro: 1.127 ± 0.018
0.9CysGln: 0.9 ± 0.014
1.22CysArg: 1.22 ± 0.015
1.638CysSer: 1.638 ± 0.019
1.01CysThr: 1.01 ± 0.013
1.203CysVal: 1.203 ± 0.015
0.284CysTrp: 0.284 ± 0.007
0.62CysTyr: 0.62 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
3.325AspAla: 3.325 ± 0.021
1.062AspCys: 1.062 ± 0.013
3.666AspAsp: 3.666 ± 0.025
4.152AspGlu: 4.152 ± 0.026
2.316AspPhe: 2.316 ± 0.016
3.535AspGly: 3.535 ± 0.021
1.216AspHis: 1.216 ± 0.011
2.844AspIle: 2.844 ± 0.018
2.582AspLys: 2.582 ± 0.017
4.792AspLeu: 4.792 ± 0.023
1.111AspMet: 1.111 ± 0.009
2.195AspAsn: 2.195 ± 0.018
2.673AspPro: 2.673 ± 0.019
2.07AspGln: 2.07 ± 0.017
2.629AspArg: 2.629 ± 0.02
4.186AspSer: 4.186 ± 0.027
2.328AspThr: 2.328 ± 0.016
3.772AspVal: 3.772 ± 0.031
0.755AspTrp: 0.755 ± 0.009
1.755AspTyr: 1.755 ± 0.015
0.0AspXaa: 0.0 ± 0.0
Glu
4.404GluAla: 4.404 ± 0.028
1.092GluCys: 1.092 ± 0.017
3.902GluAsp: 3.902 ± 0.024
6.339GluGlu: 6.339 ± 0.05
2.244GluPhe: 2.244 ± 0.016
2.889GluGly: 2.889 ± 0.02
1.235GluHis: 1.235 ± 0.011
3.727GluIle: 3.727 ± 0.024
4.513GluLys: 4.513 ± 0.033
5.613GluLeu: 5.613 ± 0.03
1.703GluMet: 1.703 ± 0.014
3.317GluAsn: 3.317 ± 0.021
2.6GluPro: 2.6 ± 0.025
2.531GluGln: 2.531 ± 0.02
3.51GluArg: 3.51 ± 0.025
4.619GluSer: 4.619 ± 0.028
3.728GluThr: 3.728 ± 0.023
3.639GluVal: 3.639 ± 0.034
0.758GluTrp: 0.758 ± 0.009
1.692GluTyr: 1.692 ± 0.016
0.001GluXaa: 0.001 ± 0.0
Phe
2.397PheAla: 2.397 ± 0.019
0.867PheCys: 0.867 ± 0.01
2.22PheAsp: 2.22 ± 0.016
2.131PheGlu: 2.131 ± 0.017
1.843PhePhe: 1.843 ± 0.017
2.424PheGly: 2.424 ± 0.019
1.126PheHis: 1.126 ± 0.012
2.153PheIle: 2.153 ± 0.018
2.007PheLys: 2.007 ± 0.016
3.847PheLeu: 3.847 ± 0.021
0.868PheMet: 0.868 ± 0.011
1.885PheAsn: 1.885 ± 0.017
1.898PhePro: 1.898 ± 0.014
1.812PheGln: 1.812 ± 0.015
2.133PheArg: 2.133 ± 0.014
3.249PheSer: 3.249 ± 0.02
2.199PheThr: 2.199 ± 0.018
2.705PheVal: 2.705 ± 0.018
0.496PheTrp: 0.496 ± 0.008
1.358PheTyr: 1.358 ± 0.012
0.0PheXaa: 0.0 ± 0.0
Gly
3.464GlyAla: 3.464 ± 0.026
1.071GlyCys: 1.071 ± 0.012
2.907GlyAsp: 2.907 ± 0.025
3.093GlyGlu: 3.093 ± 0.021
2.349GlyPhe: 2.349 ± 0.02
4.736GlyGly: 4.736 ± 0.051
1.51GlyHis: 1.51 ± 0.014
3.021GlyIle: 3.021 ± 0.022
3.296GlyLys: 3.296 ± 0.022
4.773GlyLeu: 4.773 ± 0.029
1.266GlyMet: 1.266 ± 0.014
2.712GlyAsn: 2.712 ± 0.021
2.594GlyPro: 2.594 ± 0.034
2.416GlyGln: 2.416 ± 0.021
3.436GlyArg: 3.436 ± 0.024
4.982GlySer: 4.982 ± 0.036
3.098GlyThr: 3.098 ± 0.02
3.499GlyVal: 3.499 ± 0.023
0.782GlyTrp: 0.782 ± 0.01
1.919GlyTyr: 1.919 ± 0.021
0.001GlyXaa: 0.001 ± 0.0
His
1.285HisAla: 1.285 ± 0.012
0.549HisCys: 0.549 ± 0.009
1.045HisAsp: 1.045 ± 0.011
1.223HisGlu: 1.223 ± 0.012
1.182HisPhe: 1.182 ± 0.011
1.429HisGly: 1.429 ± 0.014
1.19HisHis: 1.19 ± 0.022
1.183HisIle: 1.183 ± 0.013
1.035HisLys: 1.035 ± 0.01
2.604HisLeu: 2.604 ± 0.018
0.499HisMet: 0.499 ± 0.008
0.984HisAsn: 0.984 ± 0.01
1.565HisPro: 1.565 ± 0.015
1.401HisGln: 1.401 ± 0.014
1.498HisArg: 1.498 ± 0.014
1.977HisSer: 1.977 ± 0.015
1.112HisThr: 1.112 ± 0.01
1.482HisVal: 1.482 ± 0.012
0.343HisTrp: 0.343 ± 0.007
0.863HisTyr: 0.863 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.471IleAla: 3.471 ± 0.02
1.107IleCys: 1.107 ± 0.011
3.011IleAsp: 3.011 ± 0.018
3.28IleGlu: 3.28 ± 0.02
2.221IlePhe: 2.221 ± 0.016
2.889IleGly: 2.889 ± 0.02
1.392IleHis: 1.392 ± 0.013
2.884IleIle: 2.884 ± 0.023
2.856IleLys: 2.856 ± 0.021
4.888IleLeu: 4.888 ± 0.028
1.078IleMet: 1.078 ± 0.01
2.463IleAsn: 2.463 ± 0.016
3.0IlePro: 3.0 ± 0.02
2.443IleGln: 2.443 ± 0.018
2.963IleArg: 2.963 ± 0.017
4.186IleSer: 4.186 ± 0.02
3.014IleThr: 3.014 ± 0.019
3.483IleVal: 3.483 ± 0.021
0.601IleTrp: 0.601 ± 0.007
1.547IleTyr: 1.547 ± 0.014
0.001IleXaa: 0.001 ± 0.0
Lys
3.659LysAla: 3.659 ± 0.025
1.057LysCys: 1.057 ± 0.013
2.842LysAsp: 2.842 ± 0.02
4.29LysGlu: 4.29 ± 0.034
2.117LysPhe: 2.117 ± 0.016
2.697LysGly: 2.697 ± 0.025
1.197LysHis: 1.197 ± 0.012
3.267LysIle: 3.267 ± 0.021
4.676LysLys: 4.676 ± 0.045
5.221LysLeu: 5.221 ± 0.027
1.563LysMet: 1.563 ± 0.016
2.795LysAsn: 2.795 ± 0.018
2.785LysPro: 2.785 ± 0.022
2.278LysGln: 2.278 ± 0.018
3.435LysArg: 3.435 ± 0.02
4.286LysSer: 4.286 ± 0.027
3.357LysThr: 3.357 ± 0.023
3.249LysVal: 3.249 ± 0.02
0.698LysTrp: 0.698 ± 0.009
1.726LysTyr: 1.726 ± 0.016
0.001LysXaa: 0.001 ± 0.0
Leu
6.603LeuAla: 6.603 ± 0.032
1.77LeuCys: 1.77 ± 0.016
4.792LeuAsp: 4.792 ± 0.027
5.828LeuGlu: 5.828 ± 0.032
3.492LeuPhe: 3.492 ± 0.023
4.527LeuGly: 4.527 ± 0.024
2.348LeuHis: 2.348 ± 0.018
4.711LeuIle: 4.711 ± 0.028
5.673LeuLys: 5.673 ± 0.031
9.398LeuLeu: 9.398 ± 0.049
2.186LeuMet: 2.186 ± 0.015
4.172LeuAsn: 4.172 ± 0.021
5.127LeuPro: 5.127 ± 0.028
4.249LeuGln: 4.249 ± 0.027
5.155LeuArg: 5.155 ± 0.025
7.274LeuSer: 7.274 ± 0.033
5.209LeuThr: 5.209 ± 0.025
5.725LeuVal: 5.725 ± 0.025
1.004LeuTrp: 1.004 ± 0.011
2.362LeuTyr: 2.362 ± 0.02
0.002LeuXaa: 0.002 ± 0.0
Met
2.089MetAla: 2.089 ± 0.016
0.363MetCys: 0.363 ± 0.006
1.403MetAsp: 1.403 ± 0.012
1.792MetGlu: 1.792 ± 0.015
0.78MetPhe: 0.78 ± 0.01
1.235MetGly: 1.235 ± 0.016
0.435MetHis: 0.435 ± 0.007
1.076MetIle: 1.076 ± 0.011
1.509MetLys: 1.509 ± 0.012
1.904MetLeu: 1.904 ± 0.016
0.657MetMet: 0.657 ± 0.01
1.041MetAsn: 1.041 ± 0.01
1.087MetPro: 1.087 ± 0.012
0.857MetGln: 0.857 ± 0.011
1.087MetArg: 1.087 ± 0.01
1.764MetSer: 1.764 ± 0.014
1.423MetThr: 1.423 ± 0.014
1.321MetVal: 1.321 ± 0.013
0.252MetTrp: 0.252 ± 0.005
0.534MetTyr: 0.534 ± 0.008
0.001MetXaa: 0.001 ± 0.0
Asn
2.861AsnAla: 2.861 ± 0.019
0.978AsnCys: 0.978 ± 0.013
2.327AsnAsp: 2.327 ± 0.016
2.856AsnGlu: 2.856 ± 0.017
1.982AsnPhe: 1.982 ± 0.014
3.002AsnGly: 3.002 ± 0.021
1.156AsnHis: 1.156 ± 0.011
2.481AsnIle: 2.481 ± 0.018
2.516AsnLys: 2.516 ± 0.016
4.276AsnLeu: 4.276 ± 0.024
0.992AsnMet: 0.992 ± 0.011
2.678AsnAsn: 2.678 ± 0.023
2.573AsnPro: 2.573 ± 0.018
2.178AsnGln: 2.178 ± 0.017
2.509AsnArg: 2.509 ± 0.017
3.806AsnSer: 3.806 ± 0.022
2.247AsnThr: 2.247 ± 0.016
3.037AsnVal: 3.037 ± 0.018
0.602AsnTrp: 0.602 ± 0.007
1.496AsnTyr: 1.496 ± 0.013
0.0AsnXaa: 0.0 ± 0.0
Pro
4.303ProAla: 4.303 ± 0.036
0.84ProCys: 0.84 ± 0.015
2.639ProAsp: 2.639 ± 0.018
3.34ProGlu: 3.34 ± 0.021
2.036ProPhe: 2.036 ± 0.017
2.991ProGly: 2.991 ± 0.036
1.14ProHis: 1.14 ± 0.012
2.751ProIle: 2.751 ± 0.018
2.811ProLys: 2.811 ± 0.021
4.378ProLeu: 4.378 ± 0.025
1.054ProMet: 1.054 ± 0.016
2.328ProAsn: 2.328 ± 0.017
4.29ProPro: 4.29 ± 0.049
2.052ProGln: 2.052 ± 0.018
2.499ProArg: 2.499 ± 0.019
5.297ProSer: 5.297 ± 0.037
3.657ProThr: 3.657 ± 0.029
3.956ProVal: 3.956 ± 0.027
0.544ProTrp: 0.544 ± 0.008
1.398ProTyr: 1.398 ± 0.014
0.001ProXaa: 0.001 ± 0.0
Gln
2.667GlnAla: 2.667 ± 0.022
0.821GlnCys: 0.821 ± 0.012
1.856GlnAsp: 1.856 ± 0.013
2.693GlnGlu: 2.693 ± 0.022
1.597GlnPhe: 1.597 ± 0.015
1.933GlnGly: 1.933 ± 0.017
1.334GlnHis: 1.334 ± 0.014
2.221GlnIle: 2.221 ± 0.013
2.338GlnLys: 2.338 ± 0.018
4.669GlnLeu: 4.669 ± 0.028
1.136GlnMet: 1.136 ± 0.012
1.833GlnAsn: 1.833 ± 0.017
2.679GlnPro: 2.679 ± 0.026
4.016GlnGln: 4.016 ± 0.059
2.652GlnArg: 2.652 ± 0.017
3.364GlnSer: 3.364 ± 0.022
2.382GlnThr: 2.382 ± 0.017
2.524GlnVal: 2.524 ± 0.017
0.545GlnTrp: 0.545 ± 0.007
1.14GlnTyr: 1.14 ± 0.013
0.001GlnXaa: 0.001 ± 0.0
Arg
3.162ArgAla: 3.162 ± 0.021
1.082ArgCys: 1.082 ± 0.013
2.714ArgAsp: 2.714 ± 0.018
3.256ArgGlu: 3.256 ± 0.023
2.173ArgPhe: 2.173 ± 0.016
2.821ArgGly: 2.821 ± 0.023
1.549ArgHis: 1.549 ± 0.013
2.957ArgIle: 2.957 ± 0.018
3.558ArgLys: 3.558 ± 0.024
5.304ArgLeu: 5.304 ± 0.027
1.287ArgMet: 1.287 ± 0.01
2.71ArgAsn: 2.71 ± 0.015
2.991ArgPro: 2.991 ± 0.025
2.739ArgGln: 2.739 ± 0.021
4.282ArgArg: 4.282 ± 0.03
4.374ArgSer: 4.374 ± 0.029
2.803ArgThr: 2.803 ± 0.019
3.081ArgVal: 3.081 ± 0.019
0.7ArgTrp: 0.7 ± 0.009
1.593ArgTyr: 1.593 ± 0.014
0.0ArgXaa: 0.0 ± 0.0
Ser
5.704SerAla: 5.704 ± 0.031
1.583SerCys: 1.583 ± 0.017
4.24SerAsp: 4.24 ± 0.023
4.353SerGlu: 4.353 ± 0.029
3.326SerPhe: 3.326 ± 0.019
5.044SerGly: 5.044 ± 0.029
1.936SerHis: 1.936 ± 0.016
4.078SerIle: 4.078 ± 0.022
4.219SerLys: 4.219 ± 0.023
7.38SerLeu: 7.38 ± 0.028
1.669SerMet: 1.669 ± 0.013
3.866SerAsn: 3.866 ± 0.025
4.645SerPro: 4.645 ± 0.032
3.346SerGln: 3.346 ± 0.021
4.469SerArg: 4.469 ± 0.027
9.868SerSer: 9.868 ± 0.069
5.501SerThr: 5.501 ± 0.035
5.188SerVal: 5.188 ± 0.025
0.939SerTrp: 0.939 ± 0.011
2.451SerTyr: 2.451 ± 0.023
0.001SerXaa: 0.001 ± 0.0
Thr
4.356ThrAla: 4.356 ± 0.022
1.18ThrCys: 1.18 ± 0.016
2.879ThrAsp: 2.879 ± 0.017
3.59ThrGlu: 3.59 ± 0.024
2.27ThrPhe: 2.27 ± 0.016
3.43ThrGly: 3.43 ± 0.024
1.234ThrHis: 1.234 ± 0.013
3.094ThrIle: 3.094 ± 0.021
3.201ThrLys: 3.201 ± 0.025
5.101ThrLeu: 5.101 ± 0.023
1.142ThrMet: 1.142 ± 0.01
2.645ThrAsn: 2.645 ± 0.018
3.59ThrPro: 3.59 ± 0.024
2.1ThrGln: 2.1 ± 0.015
2.747ThrArg: 2.747 ± 0.018
5.267ThrSer: 5.267 ± 0.029
5.276ThrThr: 5.276 ± 0.071
3.938ThrVal: 3.938 ± 0.021
0.656ThrTrp: 0.656 ± 0.008
1.646ThrTyr: 1.646 ± 0.014
0.001ThrXaa: 0.001 ± 0.0
Val
4.851ValAla: 4.851 ± 0.026
1.305ValCys: 1.305 ± 0.015
3.703ValAsp: 3.703 ± 0.033
4.296ValGlu: 4.296 ± 0.03
2.51ValPhe: 2.51 ± 0.02
3.428ValGly: 3.428 ± 0.031
1.496ValHis: 1.496 ± 0.012
3.486ValIle: 3.486 ± 0.019
3.503ValLys: 3.503 ± 0.021
5.668ValLeu: 5.668 ± 0.031
1.471ValMet: 1.471 ± 0.014
2.988ValAsn: 2.988 ± 0.019
3.292ValPro: 3.292 ± 0.024
2.617ValGln: 2.617 ± 0.017
3.23ValArg: 3.23 ± 0.022
4.897ValSer: 4.897 ± 0.025
3.989ValThr: 3.989 ± 0.023
4.797ValVal: 4.797 ± 0.04
0.737ValTrp: 0.737 ± 0.009
1.839ValTyr: 1.839 ± 0.017
0.001ValXaa: 0.001 ± 0.0
Trp
0.695TrpAla: 0.695 ± 0.009
0.241TrpCys: 0.241 ± 0.006
0.625TrpAsp: 0.625 ± 0.01
0.679TrpGlu: 0.679 ± 0.01
0.478TrpPhe: 0.478 ± 0.007
0.58TrpGly: 0.58 ± 0.009
0.273TrpHis: 0.273 ± 0.006
0.786TrpIle: 0.786 ± 0.01
0.825TrpLys: 0.825 ± 0.01
1.185TrpLeu: 1.185 ± 0.012
0.358TrpMet: 0.358 ± 0.006
0.725TrpAsn: 0.725 ± 0.009
0.532TrpPro: 0.532 ± 0.008
0.468TrpGln: 0.468 ± 0.008
0.742TrpArg: 0.742 ± 0.01
0.864TrpSer: 0.864 ± 0.011
0.767TrpThr: 0.767 ± 0.009
0.615TrpVal: 0.615 ± 0.008
0.192TrpTrp: 0.192 ± 0.005
0.338TrpTyr: 0.338 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.814TyrAla: 1.814 ± 0.02
0.665TyrCys: 0.665 ± 0.009
1.546TyrAsp: 1.546 ± 0.015
1.586TyrGlu: 1.586 ± 0.014
1.397TyrPhe: 1.397 ± 0.013
2.0TyrGly: 2.0 ± 0.024
0.749TyrHis: 0.749 ± 0.01
1.363TyrIle: 1.363 ± 0.013
1.398TyrLys: 1.398 ± 0.015
2.669TyrLeu: 2.669 ± 0.018
0.634TyrMet: 0.634 ± 0.008
1.369TyrAsn: 1.369 ± 0.013
1.519TyrPro: 1.519 ± 0.017
1.249TyrGln: 1.249 ± 0.013
1.661TyrArg: 1.661 ± 0.016
2.311TyrSer: 2.311 ± 0.018
1.877TyrThr: 1.877 ± 0.028
1.835TyrVal: 1.835 ± 0.015
0.41TyrTrp: 0.41 ± 0.007
1.471TyrTyr: 1.471 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 30137 proteins (9947364 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski