Amino acid dipepetide frequency for Betta splendens (Siamese fighting fish)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.258AlaAla: 7.258 ± 0.032
1.27AlaCys: 1.27 ± 0.008
3.39AlaAsp: 3.39 ± 0.013
5.044AlaGlu: 5.044 ± 0.024
2.22AlaPhe: 2.22 ± 0.01
4.565AlaGly: 4.565 ± 0.02
1.589AlaHis: 1.589 ± 0.008
2.454AlaIle: 2.454 ± 0.011
3.496AlaLys: 3.496 ± 0.021
6.669AlaLeu: 6.669 ± 0.028
1.555AlaMet: 1.555 ± 0.009
2.202AlaAsn: 2.202 ± 0.009
3.995AlaPro: 3.995 ± 0.022
3.27AlaGln: 3.27 ± 0.014
3.459AlaArg: 3.459 ± 0.014
6.173AlaSer: 6.173 ± 0.019
3.681AlaThr: 3.681 ± 0.017
4.995AlaVal: 4.995 ± 0.016
0.649AlaTrp: 0.649 ± 0.005
1.411AlaTyr: 1.411 ± 0.008
0.0AlaXaa: 0.0 ± 0.0
Cys
1.174CysAla: 1.174 ± 0.008
0.556CysCys: 0.556 ± 0.006
1.114CysAsp: 1.114 ± 0.011
1.269CysGlu: 1.269 ± 0.011
0.779CysPhe: 0.779 ± 0.006
1.479CysGly: 1.479 ± 0.011
0.605CysHis: 0.605 ± 0.005
0.853CysIle: 0.853 ± 0.007
1.016CysLys: 1.016 ± 0.008
1.983CysLeu: 1.983 ± 0.011
0.402CysMet: 0.402 ± 0.003
0.758CysAsn: 0.758 ± 0.006
1.192CysPro: 1.192 ± 0.012
0.989CysGln: 0.989 ± 0.007
1.246CysArg: 1.246 ± 0.007
2.131CysSer: 2.131 ± 0.013
1.077CysThr: 1.077 ± 0.008
1.458CysVal: 1.458 ± 0.009
0.254CysTrp: 0.254 ± 0.003
0.553CysTyr: 0.553 ± 0.004
0.0CysXaa: 0.0 ± 0.0
Asp
3.364AspAla: 3.364 ± 0.013
1.115AspCys: 1.115 ± 0.01
3.16AspAsp: 3.16 ± 0.016
3.899AspGlu: 3.899 ± 0.017
1.989AspPhe: 1.989 ± 0.008
3.862AspGly: 3.862 ± 0.018
1.183AspHis: 1.183 ± 0.006
2.514AspIle: 2.514 ± 0.014
2.764AspLys: 2.764 ± 0.011
4.975AspLeu: 4.975 ± 0.018
1.276AspMet: 1.276 ± 0.008
1.922AspAsn: 1.922 ± 0.011
2.924AspPro: 2.924 ± 0.013
2.038AspGln: 2.038 ± 0.009
2.872AspArg: 2.872 ± 0.013
4.691AspSer: 4.691 ± 0.016
2.711AspThr: 2.711 ± 0.01
3.478AspVal: 3.478 ± 0.018
0.647AspTrp: 0.647 ± 0.005
1.481AspTyr: 1.481 ± 0.008
0.001AspXaa: 0.001 ± 0.0
Glu
5.192GluAla: 5.192 ± 0.024
1.135GluCys: 1.135 ± 0.009
4.517GluAsp: 4.517 ± 0.016
8.007GluGlu: 8.007 ± 0.042
1.802GluPhe: 1.802 ± 0.008
4.225GluGly: 4.225 ± 0.02
1.521GluHis: 1.521 ± 0.008
2.694GluIle: 2.694 ± 0.012
4.692GluLys: 4.692 ± 0.024
6.358GluLeu: 6.358 ± 0.031
1.722GluMet: 1.722 ± 0.009
2.635GluAsn: 2.635 ± 0.011
3.088GluPro: 3.088 ± 0.017
3.414GluGln: 3.414 ± 0.021
4.591GluArg: 4.591 ± 0.023
4.583GluSer: 4.583 ± 0.017
3.544GluThr: 3.544 ± 0.014
4.372GluVal: 4.372 ± 0.016
0.659GluTrp: 0.659 ± 0.005
1.481GluTyr: 1.481 ± 0.011
0.001GluXaa: 0.001 ± 0.0
Phe
1.8PheAla: 1.8 ± 0.01
0.811PheCys: 0.811 ± 0.006
1.637PheAsp: 1.637 ± 0.008
1.763PheGlu: 1.763 ± 0.01
1.331PhePhe: 1.331 ± 0.009
1.959PheGly: 1.959 ± 0.012
0.944PheHis: 0.944 ± 0.006
1.668PheIle: 1.668 ± 0.01
1.675PheLys: 1.675 ± 0.01
3.365PheLeu: 3.365 ± 0.017
0.738PheMet: 0.738 ± 0.005
1.346PheAsn: 1.346 ± 0.009
1.69PhePro: 1.69 ± 0.009
1.504PheGln: 1.504 ± 0.007
1.759PheArg: 1.759 ± 0.012
3.112PheSer: 3.112 ± 0.015
2.155PheThr: 2.155 ± 0.01
2.018PheVal: 2.018 ± 0.012
0.4PheTrp: 0.4 ± 0.004
1.054PheTyr: 1.054 ± 0.008
0.0PheXaa: 0.0 ± 0.0
Gly
4.445GlyAla: 4.445 ± 0.02
1.183GlyCys: 1.183 ± 0.009
3.364GlyAsp: 3.364 ± 0.014
4.106GlyGlu: 4.106 ± 0.017
2.303GlyPhe: 2.303 ± 0.012
5.477GlyGly: 5.477 ± 0.031
1.709GlyHis: 1.709 ± 0.009
2.332GlyIle: 2.332 ± 0.011
3.387GlyLys: 3.387 ± 0.019
5.466GlyLeu: 5.466 ± 0.021
1.343GlyMet: 1.343 ± 0.01
2.347GlyAsn: 2.347 ± 0.011
3.766GlyPro: 3.766 ± 0.031
2.829GlyGln: 2.829 ± 0.014
3.866GlyArg: 3.866 ± 0.017
6.208GlySer: 6.208 ± 0.021
3.382GlyThr: 3.382 ± 0.015
3.803GlyVal: 3.803 ± 0.015
0.738GlyTrp: 0.738 ± 0.007
1.736GlyTyr: 1.736 ± 0.011
0.001GlyXaa: 0.001 ± 0.0
His
1.43HisAla: 1.43 ± 0.008
0.719HisCys: 0.719 ± 0.006
0.991HisAsp: 0.991 ± 0.006
1.261HisGlu: 1.261 ± 0.007
0.943HisPhe: 0.943 ± 0.006
1.595HisGly: 1.595 ± 0.01
1.145HisHis: 1.145 ± 0.01
1.267HisIle: 1.267 ± 0.007
1.333HisLys: 1.333 ± 0.006
2.695HisLeu: 2.695 ± 0.012
0.628HisMet: 0.628 ± 0.005
1.006HisAsn: 1.006 ± 0.006
1.641HisPro: 1.641 ± 0.01
1.367HisGln: 1.367 ± 0.008
1.746HisArg: 1.746 ± 0.008
2.608HisSer: 2.608 ± 0.011
1.581HisThr: 1.581 ± 0.008
1.529HisVal: 1.529 ± 0.008
0.312HisTrp: 0.312 ± 0.004
0.831HisTyr: 0.831 ± 0.005
0.0HisXaa: 0.0 ± 0.0
Ile
2.333IleAla: 2.333 ± 0.01
0.918IleCys: 0.918 ± 0.007
1.966IleAsp: 1.966 ± 0.011
2.296IleGlu: 2.296 ± 0.013
1.498IlePhe: 1.498 ± 0.01
2.093IleGly: 2.093 ± 0.01
1.166IleHis: 1.166 ± 0.007
2.116IleIle: 2.116 ± 0.012
2.424IleLys: 2.424 ± 0.015
3.802IleLeu: 3.802 ± 0.016
0.95IleMet: 0.95 ± 0.007
1.831IleAsn: 1.831 ± 0.011
2.266IlePro: 2.266 ± 0.01
2.153IleGln: 2.153 ± 0.01
2.322IleArg: 2.322 ± 0.009
3.564IleSer: 3.564 ± 0.013
2.671IleThr: 2.671 ± 0.015
2.363IleVal: 2.363 ± 0.013
0.394IleTrp: 0.394 ± 0.004
1.205IleTyr: 1.205 ± 0.007
0.0IleXaa: 0.0 ± 0.0
Lys
3.96LysAla: 3.96 ± 0.019
0.936LysCys: 0.936 ± 0.008
3.296LysAsp: 3.296 ± 0.017
4.759LysGlu: 4.759 ± 0.024
1.44LysPhe: 1.44 ± 0.01
3.284LysGly: 3.284 ± 0.025
1.406LysHis: 1.406 ± 0.008
2.246LysIle: 2.246 ± 0.015
4.299LysLys: 4.299 ± 0.025
4.845LysLeu: 4.845 ± 0.02
1.441LysMet: 1.441 ± 0.013
2.127LysAsn: 2.127 ± 0.01
2.916LysPro: 2.916 ± 0.018
2.641LysGln: 2.641 ± 0.015
3.371LysArg: 3.371 ± 0.014
3.898LysSer: 3.898 ± 0.015
3.161LysThr: 3.161 ± 0.015
3.462LysVal: 3.462 ± 0.015
0.536LysTrp: 0.536 ± 0.004
1.385LysTyr: 1.385 ± 0.013
0.001LysXaa: 0.001 ± 0.0
Leu
5.966LeuAla: 5.966 ± 0.024
2.11LeuCys: 2.11 ± 0.01
4.928LeuAsp: 4.928 ± 0.016
6.639LeuGlu: 6.639 ± 0.035
2.982LeuPhe: 2.982 ± 0.016
4.938LeuGly: 4.938 ± 0.019
2.803LeuHis: 2.803 ± 0.011
3.497LeuIle: 3.497 ± 0.013
5.449LeuLys: 5.449 ± 0.023
9.855LeuLeu: 9.855 ± 0.041
1.992LeuMet: 1.992 ± 0.01
3.528LeuAsn: 3.528 ± 0.014
5.419LeuPro: 5.419 ± 0.023
5.756LeuGln: 5.756 ± 0.032
5.698LeuArg: 5.698 ± 0.022
8.518LeuSer: 8.518 ± 0.027
5.145LeuThr: 5.145 ± 0.014
5.201LeuVal: 5.201 ± 0.018
0.965LeuTrp: 0.965 ± 0.008
2.392LeuTyr: 2.392 ± 0.01
0.001LeuXaa: 0.001 ± 0.0
Met
1.798MetAla: 1.798 ± 0.008
0.436MetCys: 0.436 ± 0.005
1.415MetAsp: 1.415 ± 0.007
1.984MetGlu: 1.984 ± 0.009
0.752MetPhe: 0.752 ± 0.006
1.348MetGly: 1.348 ± 0.008
0.469MetHis: 0.469 ± 0.003
0.782MetIle: 0.782 ± 0.006
1.422MetLys: 1.422 ± 0.008
1.988MetLeu: 1.988 ± 0.01
0.664MetMet: 0.664 ± 0.006
0.858MetAsn: 0.858 ± 0.007
1.12MetPro: 1.12 ± 0.016
1.007MetGln: 1.007 ± 0.007
1.169MetArg: 1.169 ± 0.006
1.881MetSer: 1.881 ± 0.007
1.226MetThr: 1.226 ± 0.007
1.432MetVal: 1.432 ± 0.007
0.239MetTrp: 0.239 ± 0.003
0.567MetTyr: 0.567 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
2.207AsnAla: 2.207 ± 0.009
0.769AsnCys: 0.769 ± 0.006
1.661AsnAsp: 1.661 ± 0.009
1.988AsnGlu: 1.988 ± 0.01
1.263AsnPhe: 1.263 ± 0.008
2.76AsnGly: 2.76 ± 0.015
0.99AsnHis: 0.99 ± 0.006
1.915AsnIle: 1.915 ± 0.01
2.151AsnLys: 2.151 ± 0.011
3.511AsnLeu: 3.511 ± 0.014
0.996AsnMet: 0.996 ± 0.006
1.7AsnAsn: 1.7 ± 0.011
2.136AsnPro: 2.136 ± 0.01
1.77AsnGln: 1.77 ± 0.009
1.975AsnArg: 1.975 ± 0.009
3.175AsnSer: 3.175 ± 0.012
2.205AsnThr: 2.205 ± 0.011
2.262AsnVal: 2.262 ± 0.011
0.396AsnTrp: 0.396 ± 0.004
1.054AsnTyr: 1.054 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
4.641ProAla: 4.641 ± 0.02
1.072ProCys: 1.072 ± 0.009
3.031ProAsp: 3.031 ± 0.015
3.847ProGlu: 3.847 ± 0.016
1.624ProPhe: 1.624 ± 0.01
4.531ProGly: 4.531 ± 0.04
1.641ProHis: 1.641 ± 0.011
1.806ProIle: 1.806 ± 0.011
2.576ProLys: 2.576 ± 0.022
4.922ProLeu: 4.922 ± 0.018
1.023ProMet: 1.023 ± 0.008
1.871ProAsn: 1.871 ± 0.01
6.139ProPro: 6.139 ± 0.041
2.858ProGln: 2.858 ± 0.017
2.947ProArg: 2.947 ± 0.014
6.053ProSer: 6.053 ± 0.023
3.291ProThr: 3.291 ± 0.019
3.99ProVal: 3.99 ± 0.016
0.498ProTrp: 0.498 ± 0.004
1.341ProTyr: 1.341 ± 0.008
0.001ProXaa: 0.001 ± 0.0
Gln
3.536GlnAla: 3.536 ± 0.018
0.962GlnCys: 0.962 ± 0.008
2.438GlnAsp: 2.438 ± 0.01
3.702GlnGlu: 3.702 ± 0.02
1.296GlnPhe: 1.296 ± 0.007
2.865GlnGly: 2.865 ± 0.014
1.502GlnHis: 1.502 ± 0.008
1.885GlnIle: 1.885 ± 0.009
2.607GlnLys: 2.607 ± 0.015
4.737GlnLeu: 4.737 ± 0.024
1.159GlnMet: 1.159 ± 0.007
1.803GlnAsn: 1.803 ± 0.009
2.794GlnPro: 2.794 ± 0.016
3.827GlnGln: 3.827 ± 0.032
3.368GlnArg: 3.368 ± 0.017
3.806GlnSer: 3.806 ± 0.016
2.766GlnThr: 2.766 ± 0.013
2.826GlnVal: 2.826 ± 0.012
0.562GlnTrp: 0.562 ± 0.005
1.219GlnTyr: 1.219 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
3.738ArgAla: 3.738 ± 0.013
1.264ArgCys: 1.264 ± 0.011
3.056ArgAsp: 3.056 ± 0.012
4.147ArgGlu: 4.147 ± 0.021
1.848ArgPhe: 1.848 ± 0.009
3.643ArgGly: 3.643 ± 0.019
1.6ArgHis: 1.6 ± 0.008
2.259ArgIle: 2.259 ± 0.009
3.505ArgLys: 3.505 ± 0.014
5.478ArgLeu: 5.478 ± 0.022
1.306ArgMet: 1.306 ± 0.007
2.046ArgAsn: 2.046 ± 0.008
3.162ArgPro: 3.162 ± 0.014
2.87ArgGln: 2.87 ± 0.015
4.745ArgArg: 4.745 ± 0.021
5.091ArgSer: 5.091 ± 0.023
3.108ArgThr: 3.108 ± 0.012
3.391ArgVal: 3.391 ± 0.014
0.68ArgTrp: 0.68 ± 0.006
1.463ArgTyr: 1.463 ± 0.007
0.0ArgXaa: 0.0 ± 0.0
Ser
6.15SerAla: 6.15 ± 0.019
1.93SerCys: 1.93 ± 0.013
4.708SerAsp: 4.708 ± 0.015
5.164SerGlu: 5.164 ± 0.018
2.889SerPhe: 2.889 ± 0.012
5.852SerGly: 5.852 ± 0.019
2.345SerHis: 2.345 ± 0.01
3.27SerIle: 3.27 ± 0.01
4.116SerLys: 4.116 ± 0.017
8.455SerLeu: 8.455 ± 0.028
1.808SerMet: 1.808 ± 0.008
3.009SerAsn: 3.009 ± 0.012
6.613SerPro: 6.613 ± 0.033
4.159SerGln: 4.159 ± 0.018
4.888SerArg: 4.888 ± 0.022
11.325SerSer: 11.325 ± 0.047
5.175SerThr: 5.175 ± 0.02
5.73SerVal: 5.73 ± 0.018
0.977SerTrp: 0.977 ± 0.007
2.077SerTyr: 2.077 ± 0.009
0.001SerXaa: 0.001 ± 0.0
Thr
4.171ThrAla: 4.171 ± 0.015
1.293ThrCys: 1.293 ± 0.012
2.973ThrAsp: 2.973 ± 0.011
3.874ThrGlu: 3.874 ± 0.017
1.959ThrPhe: 1.959 ± 0.01
3.787ThrGly: 3.787 ± 0.019
1.399ThrHis: 1.399 ± 0.008
2.254ThrIle: 2.254 ± 0.013
2.752ThrLys: 2.752 ± 0.015
5.288ThrLeu: 5.288 ± 0.016
1.198ThrMet: 1.198 ± 0.006
1.884ThrAsn: 1.884 ± 0.01
3.834ThrPro: 3.834 ± 0.021
2.537ThrGln: 2.537 ± 0.012
2.63ThrArg: 2.63 ± 0.011
5.127ThrSer: 5.127 ± 0.023
3.788ThrThr: 3.788 ± 0.065
4.191ThrVal: 4.191 ± 0.018
0.642ThrTrp: 0.642 ± 0.005
1.367ThrTyr: 1.367 ± 0.009
0.0ThrXaa: 0.0 ± 0.0
Val
4.215ValAla: 4.215 ± 0.014
1.551ValCys: 1.551 ± 0.009
3.293ValAsp: 3.293 ± 0.015
4.199ValGlu: 4.199 ± 0.019
2.388ValPhe: 2.388 ± 0.014
3.433ValGly: 3.433 ± 0.013
1.595ValHis: 1.595 ± 0.009
2.755ValIle: 2.755 ± 0.014
3.651ValLys: 3.651 ± 0.017
5.986ValLeu: 5.986 ± 0.019
1.434ValMet: 1.434 ± 0.008
2.434ValAsn: 2.434 ± 0.011
3.404ValPro: 3.404 ± 0.013
2.986ValGln: 2.986 ± 0.013
3.352ValArg: 3.352 ± 0.012
5.493ValSer: 5.493 ± 0.017
4.062ValThr: 4.062 ± 0.023
4.305ValVal: 4.305 ± 0.017
0.724ValTrp: 0.724 ± 0.005
1.674ValTyr: 1.674 ± 0.008
0.001ValXaa: 0.001 ± 0.0
Trp
0.636TrpAla: 0.636 ± 0.005
0.222TrpCys: 0.222 ± 0.003
0.6TrpAsp: 0.6 ± 0.005
0.732TrpGlu: 0.732 ± 0.005
0.414TrpPhe: 0.414 ± 0.004
0.56TrpGly: 0.56 ± 0.007
0.262TrpHis: 0.262 ± 0.003
0.49TrpIle: 0.49 ± 0.005
0.64TrpLys: 0.64 ± 0.005
1.096TrpLeu: 1.096 ± 0.008
0.322TrpMet: 0.322 ± 0.003
0.47TrpAsn: 0.47 ± 0.004
0.424TrpPro: 0.424 ± 0.005
0.468TrpGln: 0.468 ± 0.005
0.744TrpArg: 0.744 ± 0.005
0.92TrpSer: 0.92 ± 0.007
0.7TrpThr: 0.7 ± 0.006
0.627TrpVal: 0.627 ± 0.005
0.168TrpTrp: 0.168 ± 0.002
0.299TrpTyr: 0.299 ± 0.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.332TyrAla: 1.332 ± 0.007
0.636TyrCys: 0.636 ± 0.005
1.325TyrAsp: 1.325 ± 0.009
1.553TyrGlu: 1.553 ± 0.008
1.009TyrPhe: 1.009 ± 0.007
1.536TyrGly: 1.536 ± 0.009
0.754TyrHis: 0.754 ± 0.006
1.275TyrIle: 1.275 ± 0.009
1.455TyrLys: 1.455 ± 0.021
2.306TyrLeu: 2.306 ± 0.009
0.599TyrMet: 0.599 ± 0.004
1.106TyrAsn: 1.106 ± 0.007
1.178TyrPro: 1.178 ± 0.008
1.214TyrGln: 1.214 ± 0.007
1.654TyrArg: 1.654 ± 0.011
2.283TyrSer: 2.283 ± 0.011
1.52TyrThr: 1.52 ± 0.009
1.501TyrVal: 1.501 ± 0.01
0.355TyrTrp: 0.355 ± 0.005
0.891TyrTyr: 0.891 ± 0.007
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41292 proteins (32944339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski