Amino acid dipepetide frequency for Oreochromis niloticus (Nile tilapia) (Tilapia nilotica)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.77AlaAla: 5.77 ± 0.017
1.265AlaCys: 1.265 ± 0.006
3.117AlaAsp: 3.117 ± 0.009
4.449AlaGlu: 4.449 ± 0.014
2.464AlaPhe: 2.464 ± 0.008
4.067AlaGly: 4.067 ± 0.012
1.445AlaHis: 1.445 ± 0.006
2.842AlaIle: 2.842 ± 0.009
3.39AlaLys: 3.39 ± 0.012
6.408AlaLeu: 6.408 ± 0.015
1.532AlaMet: 1.532 ± 0.006
2.222AlaAsn: 2.222 ± 0.008
3.112AlaPro: 3.112 ± 0.011
2.796AlaGln: 2.796 ± 0.01
2.995AlaArg: 2.995 ± 0.009
5.143AlaSer: 5.143 ± 0.014
3.391AlaThr: 3.391 ± 0.013
4.89AlaVal: 4.89 ± 0.012
0.65AlaTrp: 0.65 ± 0.003
1.6AlaTyr: 1.6 ± 0.007
0.0AlaXaa: 0.0 ± 0.0
Cys
1.205CysAla: 1.205 ± 0.007
0.652CysCys: 0.652 ± 0.005
1.226CysAsp: 1.226 ± 0.009
1.31CysGlu: 1.31 ± 0.007
0.991CysPhe: 0.991 ± 0.005
1.618CysGly: 1.618 ± 0.01
0.67CysHis: 0.67 ± 0.005
1.09CysIle: 1.09 ± 0.007
1.234CysLys: 1.234 ± 0.006
2.254CysLeu: 2.254 ± 0.008
0.474CysMet: 0.474 ± 0.004
0.898CysAsn: 0.898 ± 0.005
1.252CysPro: 1.252 ± 0.008
1.043CysGln: 1.043 ± 0.007
1.296CysArg: 1.296 ± 0.006
2.142CysSer: 2.142 ± 0.009
1.258CysThr: 1.258 ± 0.007
1.693CysVal: 1.693 ± 0.012
0.312CysTrp: 0.312 ± 0.003
0.67CysTyr: 0.67 ± 0.005
0.0CysXaa: 0.0 ± 0.0
Asp
2.891AspAla: 2.891 ± 0.008
1.183AspCys: 1.183 ± 0.007
3.059AspAsp: 3.059 ± 0.012
3.685AspGlu: 3.685 ± 0.011
2.259AspPhe: 2.259 ± 0.007
3.606AspGly: 3.606 ± 0.012
1.239AspHis: 1.239 ± 0.005
2.876AspIle: 2.876 ± 0.011
2.79AspLys: 2.79 ± 0.009
5.098AspLeu: 5.098 ± 0.014
1.275AspMet: 1.275 ± 0.005
2.008AspAsn: 2.008 ± 0.008
2.8AspPro: 2.8 ± 0.01
1.971AspGln: 1.971 ± 0.007
2.662AspArg: 2.662 ± 0.009
4.244AspSer: 4.244 ± 0.013
2.672AspThr: 2.672 ± 0.008
3.318AspVal: 3.318 ± 0.011
0.709AspTrp: 0.709 ± 0.005
1.656AspTyr: 1.656 ± 0.006
0.0AspXaa: 0.0 ± 0.0
Glu
4.558GluAla: 4.558 ± 0.015
1.291GluCys: 1.291 ± 0.008
4.207GluAsp: 4.207 ± 0.011
7.14GluGlu: 7.14 ± 0.027
2.033GluPhe: 2.033 ± 0.008
4.038GluGly: 4.038 ± 0.012
1.462GluHis: 1.462 ± 0.006
3.037GluIle: 3.037 ± 0.01
4.74GluLys: 4.74 ± 0.016
6.192GluLeu: 6.192 ± 0.016
1.745GluMet: 1.745 ± 0.007
2.831GluAsn: 2.831 ± 0.008
2.665GluPro: 2.665 ± 0.019
3.0GluGln: 3.0 ± 0.011
4.093GluArg: 4.093 ± 0.014
4.266GluSer: 4.266 ± 0.016
3.464GluThr: 3.464 ± 0.012
4.341GluVal: 4.341 ± 0.011
0.76GluTrp: 0.76 ± 0.005
1.683GluTyr: 1.683 ± 0.007
0.0GluXaa: 0.0 ± 0.0
Phe
2.008PheAla: 2.008 ± 0.007
1.026PheCys: 1.026 ± 0.005
1.881PheAsp: 1.881 ± 0.007
1.947PheGlu: 1.947 ± 0.007
1.848PhePhe: 1.848 ± 0.008
2.268PheGly: 2.268 ± 0.009
1.082PheHis: 1.082 ± 0.005
2.177PheIle: 2.177 ± 0.008
2.023PheLys: 2.023 ± 0.007
4.166PheLeu: 4.166 ± 0.013
0.884PheMet: 0.884 ± 0.004
1.639PheAsn: 1.639 ± 0.006
1.83PhePro: 1.83 ± 0.01
1.685PheGln: 1.685 ± 0.007
1.909PheArg: 1.909 ± 0.007
3.486PheSer: 3.486 ± 0.01
2.516PheThr: 2.516 ± 0.008
2.389PheVal: 2.389 ± 0.008
0.502PheTrp: 0.502 ± 0.003
1.335PheTyr: 1.335 ± 0.005
0.0PheXaa: 0.0 ± 0.0
Gly
3.705GlyAla: 3.705 ± 0.014
1.248GlyCys: 1.248 ± 0.006
3.159GlyAsp: 3.159 ± 0.01
3.953GlyGlu: 3.953 ± 0.012
2.457GlyPhe: 2.457 ± 0.009
4.806GlyGly: 4.806 ± 0.019
1.606GlyHis: 1.606 ± 0.007
2.789GlyIle: 2.789 ± 0.009
3.651GlyLys: 3.651 ± 0.011
5.393GlyLeu: 5.393 ± 0.013
1.453GlyMet: 1.453 ± 0.006
2.522GlyAsn: 2.522 ± 0.009
2.975GlyPro: 2.975 ± 0.019
2.629GlyGln: 2.629 ± 0.01
3.449GlyArg: 3.449 ± 0.011
5.279GlySer: 5.279 ± 0.015
3.392GlyThr: 3.392 ± 0.01
4.029GlyVal: 4.029 ± 0.01
0.781GlyTrp: 0.781 ± 0.004
1.884GlyTyr: 1.884 ± 0.009
0.0GlyXaa: 0.0 ± 0.0
His
1.318HisAla: 1.318 ± 0.006
0.807HisCys: 0.807 ± 0.005
0.987HisAsp: 0.987 ± 0.004
1.271HisGlu: 1.271 ± 0.006
1.119HisPhe: 1.119 ± 0.006
1.475HisGly: 1.475 ± 0.007
0.99HisHis: 0.99 ± 0.007
1.433HisIle: 1.433 ± 0.007
1.315HisLys: 1.315 ± 0.006
2.775HisLeu: 2.775 ± 0.009
0.648HisMet: 0.648 ± 0.005
1.047HisAsn: 1.047 ± 0.005
1.543HisPro: 1.543 ± 0.008
1.237HisGln: 1.237 ± 0.006
1.569HisArg: 1.569 ± 0.006
2.385HisSer: 2.385 ± 0.011
1.671HisThr: 1.671 ± 0.012
1.489HisVal: 1.489 ± 0.006
0.353HisTrp: 0.353 ± 0.002
0.86HisTyr: 0.86 ± 0.004
0.0HisXaa: 0.0 ± 0.0
Ile
2.651IleAla: 2.651 ± 0.009
1.172IleCys: 1.172 ± 0.006
2.275IleAsp: 2.275 ± 0.008
2.538IleGlu: 2.538 ± 0.008
2.113IlePhe: 2.113 ± 0.009
2.423IleGly: 2.423 ± 0.009
1.431IleHis: 1.431 ± 0.008
2.749IleIle: 2.749 ± 0.011
2.824IleLys: 2.824 ± 0.008
4.643IleLeu: 4.643 ± 0.012
1.145IleMet: 1.145 ± 0.005
2.202IleAsn: 2.202 ± 0.008
2.51IlePro: 2.51 ± 0.011
2.33IleGln: 2.33 ± 0.009
2.608IleArg: 2.608 ± 0.008
4.012IleSer: 4.012 ± 0.01
3.102IleThr: 3.102 ± 0.01
2.739IleVal: 2.739 ± 0.01
0.519IleTrp: 0.519 ± 0.003
1.656IleTyr: 1.656 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
3.803LysAla: 3.803 ± 0.012
1.104LysCys: 1.104 ± 0.006
3.337LysAsp: 3.337 ± 0.009
4.752LysGlu: 4.752 ± 0.015
1.739LysPhe: 1.739 ± 0.007
3.214LysGly: 3.214 ± 0.013
1.473LysHis: 1.473 ± 0.006
2.771LysIle: 2.771 ± 0.01
4.54LysLys: 4.54 ± 0.016
5.294LysLeu: 5.294 ± 0.012
1.507LysMet: 1.507 ± 0.006
2.488LysAsn: 2.488 ± 0.008
2.81LysPro: 2.81 ± 0.01
2.636LysGln: 2.636 ± 0.008
3.315LysArg: 3.315 ± 0.011
3.92LysSer: 3.92 ± 0.013
3.404LysThr: 3.404 ± 0.009
3.682LysVal: 3.682 ± 0.011
0.62LysTrp: 0.62 ± 0.004
1.647LysTyr: 1.647 ± 0.007
0.0LysXaa: 0.0 ± 0.0
Leu
5.982LeuAla: 5.982 ± 0.015
2.338LeuCys: 2.338 ± 0.009
4.978LeuAsp: 4.978 ± 0.011
6.452LeuGlu: 6.452 ± 0.019
3.702LeuPhe: 3.702 ± 0.011
5.039LeuGly: 5.039 ± 0.013
2.74LeuHis: 2.74 ± 0.008
4.201LeuIle: 4.201 ± 0.011
5.952LeuLys: 5.952 ± 0.014
10.268LeuLeu: 10.268 ± 0.027
2.168LeuMet: 2.168 ± 0.007
3.859LeuAsn: 3.859 ± 0.009
5.079LeuPro: 5.079 ± 0.014
5.429LeuGln: 5.429 ± 0.016
5.612LeuArg: 5.612 ± 0.012
8.341LeuSer: 8.341 ± 0.019
5.549LeuThr: 5.549 ± 0.013
5.5LeuVal: 5.5 ± 0.013
1.103LeuTrp: 1.103 ± 0.005
2.806LeuTyr: 2.806 ± 0.009
0.0LeuXaa: 0.0 ± 0.0
Met
1.818MetAla: 1.818 ± 0.006
0.526MetCys: 0.526 ± 0.003
1.403MetAsp: 1.403 ± 0.006
1.933MetGlu: 1.933 ± 0.006
0.922MetPhe: 0.922 ± 0.005
1.359MetGly: 1.359 ± 0.006
0.483MetHis: 0.483 ± 0.004
0.968MetIle: 0.968 ± 0.005
1.543MetLys: 1.543 ± 0.006
2.119MetLeu: 2.119 ± 0.007
0.695MetMet: 0.695 ± 0.004
0.958MetAsn: 0.958 ± 0.004
1.044MetPro: 1.044 ± 0.006
0.961MetGln: 0.961 ± 0.005
1.163MetArg: 1.163 ± 0.005
1.81MetSer: 1.81 ± 0.006
1.283MetThr: 1.283 ± 0.006
1.588MetVal: 1.588 ± 0.006
0.28MetTrp: 0.28 ± 0.002
0.686MetTyr: 0.686 ± 0.004
0.0MetXaa: 0.0 ± 0.0
Asn
2.206AsnAla: 2.206 ± 0.008
0.935AsnCys: 0.935 ± 0.005
1.838AsnAsp: 1.838 ± 0.007
2.178AsnGlu: 2.178 ± 0.008
1.554AsnPhe: 1.554 ± 0.006
2.76AsnGly: 2.76 ± 0.013
1.021AsnHis: 1.021 ± 0.005
2.411AsnIle: 2.411 ± 0.01
2.374AsnLys: 2.374 ± 0.009
3.885AsnLeu: 3.885 ± 0.011
1.075AsnMet: 1.075 ± 0.006
1.96AsnAsn: 1.96 ± 0.008
2.292AsnPro: 2.292 ± 0.008
1.805AsnGln: 1.805 ± 0.007
2.038AsnArg: 2.038 ± 0.007
3.267AsnSer: 3.267 ± 0.009
2.402AsnThr: 2.402 ± 0.01
2.348AsnVal: 2.348 ± 0.008
0.479AsnTrp: 0.479 ± 0.003
1.249AsnTyr: 1.249 ± 0.005
0.0AsnXaa: 0.0 ± 0.0
Pro
3.706ProAla: 3.706 ± 0.012
1.026ProCys: 1.026 ± 0.008
2.817ProAsp: 2.817 ± 0.008
3.605ProGlu: 3.605 ± 0.01
1.773ProPhe: 1.773 ± 0.007
3.724ProGly: 3.724 ± 0.024
1.415ProHis: 1.415 ± 0.007
1.97ProIle: 1.97 ± 0.007
2.46ProLys: 2.46 ± 0.012
4.63ProLeu: 4.63 ± 0.012
0.959ProMet: 0.959 ± 0.005
1.889ProAsn: 1.889 ± 0.008
4.627ProPro: 4.627 ± 0.024
2.448ProGln: 2.448 ± 0.011
2.494ProArg: 2.494 ± 0.009
5.081ProSer: 5.081 ± 0.023
2.982ProThr: 2.982 ± 0.018
3.641ProVal: 3.641 ± 0.012
0.546ProTrp: 0.546 ± 0.004
1.422ProTyr: 1.422 ± 0.006
0.0ProXaa: 0.0 ± 0.0
Gln
3.045GlnAla: 3.045 ± 0.011
1.061GlnCys: 1.061 ± 0.006
2.264GlnAsp: 2.264 ± 0.007
3.354GlnGlu: 3.354 ± 0.012
1.448GlnPhe: 1.448 ± 0.006
2.616GlnGly: 2.616 ± 0.011
1.328GlnHis: 1.328 ± 0.008
2.069GlnIle: 2.069 ± 0.006
2.686GlnLys: 2.686 ± 0.008
4.537GlnLeu: 4.537 ± 0.013
1.128GlnMet: 1.128 ± 0.005
1.768GlnAsn: 1.768 ± 0.006
2.328GlnPro: 2.328 ± 0.011
3.054GlnGln: 3.054 ± 0.018
2.923GlnArg: 2.923 ± 0.009
3.401GlnSer: 3.401 ± 0.011
2.552GlnThr: 2.552 ± 0.008
2.784GlnVal: 2.784 ± 0.007
0.586GlnTrp: 0.586 ± 0.004
1.301GlnTyr: 1.301 ± 0.006
0.0GlnXaa: 0.0 ± 0.0
Arg
3.293ArgAla: 3.293 ± 0.008
1.279ArgCys: 1.279 ± 0.008
2.859ArgAsp: 2.859 ± 0.008
3.783ArgGlu: 3.783 ± 0.013
2.024ArgPhe: 2.024 ± 0.008
3.261ArgGly: 3.261 ± 0.012
1.478ArgHis: 1.478 ± 0.006
2.493ArgIle: 2.493 ± 0.009
3.53ArgLys: 3.53 ± 0.01
5.302ArgLeu: 5.302 ± 0.014
1.277ArgMet: 1.277 ± 0.005
2.197ArgAsn: 2.197 ± 0.007
2.701ArgPro: 2.701 ± 0.01
2.574ArgGln: 2.574 ± 0.01
3.923ArgArg: 3.923 ± 0.012
4.216ArgSer: 4.216 ± 0.014
2.823ArgThr: 2.823 ± 0.009
3.347ArgVal: 3.347 ± 0.009
0.697ArgTrp: 0.697 ± 0.005
1.582ArgTyr: 1.582 ± 0.006
0.0ArgXaa: 0.0 ± 0.0
Ser
5.305SerAla: 5.305 ± 0.014
2.068SerCys: 2.068 ± 0.01
4.253SerAsp: 4.253 ± 0.012
4.843SerGlu: 4.843 ± 0.023
3.22SerPhe: 3.22 ± 0.009
5.373SerGly: 5.373 ± 0.015
2.206SerHis: 2.206 ± 0.009
3.497SerIle: 3.497 ± 0.01
4.015SerLys: 4.015 ± 0.013
8.18SerLeu: 8.18 ± 0.018
1.753SerMet: 1.753 ± 0.007
3.072SerAsn: 3.072 ± 0.01
5.319SerPro: 5.319 ± 0.019
3.736SerGln: 3.736 ± 0.013
4.227SerArg: 4.227 ± 0.012
9.382SerSer: 9.382 ± 0.03
4.718SerThr: 4.718 ± 0.017
5.468SerVal: 5.468 ± 0.016
1.007SerTrp: 1.007 ± 0.006
2.223SerTyr: 2.223 ± 0.008
0.0SerXaa: 0.0 ± 0.0
Thr
4.059ThrAla: 4.059 ± 0.014
1.434ThrCys: 1.434 ± 0.01
2.989ThrAsp: 2.989 ± 0.009
3.891ThrGlu: 3.891 ± 0.015
2.275ThrPhe: 2.275 ± 0.011
3.721ThrGly: 3.721 ± 0.011
1.499ThrHis: 1.499 ± 0.014
2.628ThrIle: 2.628 ± 0.008
2.799ThrLys: 2.799 ± 0.008
5.599ThrLeu: 5.599 ± 0.013
1.215ThrMet: 1.215 ± 0.005
2.084ThrAsn: 2.084 ± 0.01
3.424ThrPro: 3.424 ± 0.019
2.385ThrGln: 2.385 ± 0.008
2.492ThrArg: 2.492 ± 0.008
4.767ThrSer: 4.767 ± 0.017
3.635ThrThr: 3.635 ± 0.062
4.36ThrVal: 4.36 ± 0.013
0.699ThrTrp: 0.699 ± 0.004
1.55ThrTyr: 1.55 ± 0.007
0.0ThrXaa: 0.0 ± 0.0
Val
4.049ValAla: 4.049 ± 0.01
1.806ValCys: 1.806 ± 0.013
3.252ValAsp: 3.252 ± 0.009
4.017ValGlu: 4.017 ± 0.013
2.859ValPhe: 2.859 ± 0.01
3.438ValGly: 3.438 ± 0.01
1.616ValHis: 1.616 ± 0.007
3.243ValIle: 3.243 ± 0.01
3.805ValLys: 3.805 ± 0.012
6.343ValLeu: 6.343 ± 0.015
1.53ValMet: 1.53 ± 0.006
2.572ValAsn: 2.572 ± 0.008
3.152ValPro: 3.152 ± 0.011
2.747ValGln: 2.747 ± 0.009
3.347ValArg: 3.347 ± 0.009
5.289ValSer: 5.289 ± 0.015
4.163ValThr: 4.163 ± 0.021
4.451ValVal: 4.451 ± 0.013
0.828ValTrp: 0.828 ± 0.005
1.935ValTyr: 1.935 ± 0.007
0.0ValXaa: 0.0 ± 0.0
Trp
0.7TrpAla: 0.7 ± 0.004
0.281TrpCys: 0.281 ± 0.003
0.658TrpAsp: 0.658 ± 0.004
0.776TrpGlu: 0.776 ± 0.005
0.494TrpPhe: 0.494 ± 0.004
0.628TrpGly: 0.628 ± 0.004
0.279TrpHis: 0.279 ± 0.003
0.649TrpIle: 0.649 ± 0.004
0.765TrpLys: 0.765 ± 0.004
1.198TrpLeu: 1.198 ± 0.006
0.366TrpMet: 0.366 ± 0.003
0.562TrpAsn: 0.562 ± 0.004
0.441TrpPro: 0.441 ± 0.004
0.498TrpGln: 0.498 ± 0.003
0.744TrpArg: 0.744 ± 0.004
1.008TrpSer: 1.008 ± 0.006
0.741TrpThr: 0.741 ± 0.005
0.704TrpVal: 0.704 ± 0.004
0.206TrpTrp: 0.206 ± 0.002
0.379TrpTyr: 0.379 ± 0.003
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.477TyrAla: 1.477 ± 0.006
0.784TyrCys: 0.784 ± 0.004
1.44TyrAsp: 1.44 ± 0.006
1.668TyrGlu: 1.668 ± 0.006
1.31TyrPhe: 1.31 ± 0.005
1.724TyrGly: 1.724 ± 0.007
0.828TyrHis: 0.828 ± 0.004
1.747TyrIle: 1.747 ± 0.021
1.621TyrLys: 1.621 ± 0.007
2.816TyrLeu: 2.816 ± 0.008
0.715TyrMet: 0.715 ± 0.005
1.298TyrAsn: 1.298 ± 0.006
1.311TyrPro: 1.311 ± 0.006
1.27TyrGln: 1.27 ± 0.005
1.78TyrArg: 1.78 ± 0.007
2.43TyrSer: 2.43 ± 0.008
1.785TyrThr: 1.785 ± 0.008
1.68TyrVal: 1.68 ± 0.008
0.439TyrTrp: 0.439 ± 0.004
1.05TyrTyr: 1.05 ± 0.005
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 74622 proteins (50247682 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski