Amino acid dipepetide frequency for Lingula unguis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.287AlaAla: 5.287 ± 0.025
1.317AlaCys: 1.317 ± 0.014
3.392AlaAsp: 3.392 ± 0.015
4.288AlaGlu: 4.288 ± 0.025
2.339AlaPhe: 2.339 ± 0.014
4.049AlaGly: 4.049 ± 0.028
1.301AlaHis: 1.301 ± 0.01
3.129AlaIle: 3.129 ± 0.015
3.849AlaLys: 3.849 ± 0.019
5.459AlaLeu: 5.459 ± 0.027
1.642AlaMet: 1.642 ± 0.01
2.513AlaAsn: 2.513 ± 0.014
2.986AlaPro: 2.986 ± 0.018
2.644AlaGln: 2.644 ± 0.016
2.944AlaArg: 2.944 ± 0.015
5.061AlaSer: 5.061 ± 0.02
3.965AlaThr: 3.965 ± 0.024
4.686AlaVal: 4.686 ± 0.019
0.672AlaTrp: 0.672 ± 0.007
1.746AlaTyr: 1.746 ± 0.012
0.001AlaXaa: 0.001 ± 0.0
Cys
1.263CysAla: 1.263 ± 0.015
0.561CysCys: 0.561 ± 0.014
1.362CysAsp: 1.362 ± 0.014
1.275CysGlu: 1.275 ± 0.016
0.751CysPhe: 0.751 ± 0.006
1.552CysGly: 1.552 ± 0.018
0.551CysHis: 0.551 ± 0.006
1.081CysIle: 1.081 ± 0.011
1.264CysLys: 1.264 ± 0.015
1.814CysLeu: 1.814 ± 0.015
0.491CysMet: 0.491 ± 0.006
1.096CysAsn: 1.096 ± 0.016
1.334CysPro: 1.334 ± 0.031
1.121CysGln: 1.121 ± 0.018
1.15CysArg: 1.15 ± 0.012
1.818CysSer: 1.818 ± 0.019
1.434CysThr: 1.434 ± 0.025
1.434CysVal: 1.434 ± 0.014
0.221CysTrp: 0.221 ± 0.003
0.63CysTyr: 0.63 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
3.229AspAla: 3.229 ± 0.017
1.18AspCys: 1.18 ± 0.013
3.791AspAsp: 3.791 ± 0.027
4.029AspGlu: 4.029 ± 0.022
2.24AspPhe: 2.24 ± 0.013
3.938AspGly: 3.938 ± 0.027
1.334AspHis: 1.334 ± 0.009
3.449AspIle: 3.449 ± 0.013
3.336AspLys: 3.336 ± 0.017
4.865AspLeu: 4.865 ± 0.021
1.453AspMet: 1.453 ± 0.009
2.477AspAsn: 2.477 ± 0.014
2.701AspPro: 2.701 ± 0.014
2.183AspGln: 2.183 ± 0.013
2.679AspArg: 2.679 ± 0.016
4.496AspSer: 4.496 ± 0.023
3.239AspThr: 3.239 ± 0.016
3.763AspVal: 3.763 ± 0.017
0.693AspTrp: 0.693 ± 0.008
1.781AspTyr: 1.781 ± 0.012
0.001AspXaa: 0.001 ± 0.0
Glu
4.301GluAla: 4.301 ± 0.03
1.298GluCys: 1.298 ± 0.018
4.518GluAsp: 4.518 ± 0.024
7.162GluGlu: 7.162 ± 0.064
2.124GluPhe: 2.124 ± 0.013
3.657GluGly: 3.657 ± 0.019
1.513GluHis: 1.513 ± 0.011
3.373GluIle: 3.373 ± 0.021
5.396GluLys: 5.396 ± 0.035
5.402GluLeu: 5.402 ± 0.029
1.812GluMet: 1.812 ± 0.011
3.404GluAsn: 3.404 ± 0.018
2.46GluPro: 2.46 ± 0.025
2.946GluGln: 2.946 ± 0.023
3.623GluArg: 3.623 ± 0.022
4.336GluSer: 4.336 ± 0.02
3.842GluThr: 3.842 ± 0.029
4.098GluVal: 4.098 ± 0.025
0.709GluTrp: 0.709 ± 0.008
1.85GluTyr: 1.85 ± 0.012
0.001GluXaa: 0.001 ± 0.0
Phe
2.063PheAla: 2.063 ± 0.012
0.843PheCys: 0.843 ± 0.009
2.103PheAsp: 2.103 ± 0.012
2.087PheGlu: 2.087 ± 0.012
1.486PhePhe: 1.486 ± 0.011
2.386PheGly: 2.386 ± 0.016
0.988PheHis: 0.988 ± 0.008
1.959PheIle: 1.959 ± 0.012
2.085PheLys: 2.085 ± 0.011
3.337PheLeu: 3.337 ± 0.017
0.859PheMet: 0.859 ± 0.006
1.626PheAsn: 1.626 ± 0.011
1.694PhePro: 1.694 ± 0.012
1.58PheGln: 1.58 ± 0.009
1.796PheArg: 1.796 ± 0.015
2.889PheSer: 2.889 ± 0.013
2.305PheThr: 2.305 ± 0.013
2.328PheVal: 2.328 ± 0.014
0.474PheTrp: 0.474 ± 0.007
1.233PheTyr: 1.233 ± 0.01
0.001PheXaa: 0.001 ± 0.0
Gly
3.948GlyAla: 3.948 ± 0.029
1.289GlyCys: 1.289 ± 0.015
3.629GlyAsp: 3.629 ± 0.018
3.762GlyGlu: 3.762 ± 0.018
2.441GlyPhe: 2.441 ± 0.015
5.173GlyGly: 5.173 ± 0.051
1.71GlyHis: 1.71 ± 0.012
3.092GlyIle: 3.092 ± 0.017
3.931GlyLys: 3.931 ± 0.018
4.765GlyLeu: 4.765 ± 0.021
1.548GlyMet: 1.548 ± 0.012
3.019GlyAsn: 3.019 ± 0.021
2.826GlyPro: 2.826 ± 0.039
3.05GlyGln: 3.05 ± 0.025
3.24GlyArg: 3.24 ± 0.019
5.511GlySer: 5.511 ± 0.032
4.024GlyThr: 4.024 ± 0.028
3.902GlyVal: 3.902 ± 0.02
0.758GlyTrp: 0.758 ± 0.007
2.415GlyTyr: 2.415 ± 0.023
0.002GlyXaa: 0.002 ± 0.0
His
1.356HisAla: 1.356 ± 0.011
0.627HisCys: 0.627 ± 0.008
1.161HisAsp: 1.161 ± 0.007
1.378HisGlu: 1.378 ± 0.013
1.025HisPhe: 1.025 ± 0.008
1.584HisGly: 1.584 ± 0.012
0.936HisHis: 0.936 ± 0.009
1.336HisIle: 1.336 ± 0.009
1.394HisLys: 1.394 ± 0.01
2.302HisLeu: 2.302 ± 0.014
0.638HisMet: 0.638 ± 0.007
1.049HisAsn: 1.049 ± 0.009
1.33HisPro: 1.33 ± 0.01
1.149HisGln: 1.149 ± 0.01
1.352HisArg: 1.352 ± 0.009
2.025HisSer: 2.025 ± 0.012
1.414HisThr: 1.414 ± 0.011
1.63HisVal: 1.63 ± 0.011
0.316HisTrp: 0.316 ± 0.005
0.819HisTyr: 0.819 ± 0.007
0.001HisXaa: 0.001 ± 0.0
Ile
3.113IleAla: 3.113 ± 0.014
1.112IleCys: 1.112 ± 0.011
2.758IleAsp: 2.758 ± 0.015
2.959IleGlu: 2.959 ± 0.017
1.931IlePhe: 1.931 ± 0.013
2.84IleGly: 2.84 ± 0.017
1.272IleHis: 1.272 ± 0.009
2.512IleIle: 2.512 ± 0.015
2.97IleLys: 2.97 ± 0.015
4.382IleLeu: 4.382 ± 0.02
1.1IleMet: 1.1 ± 0.008
2.192IleAsn: 2.192 ± 0.012
2.768IlePro: 2.768 ± 0.016
2.308IleGln: 2.308 ± 0.012
2.404IleArg: 2.404 ± 0.012
3.832IleSer: 3.832 ± 0.016
2.989IleThr: 2.989 ± 0.016
3.047IleVal: 3.047 ± 0.016
0.542IleTrp: 0.542 ± 0.006
1.508IleTyr: 1.508 ± 0.011
0.001IleXaa: 0.001 ± 0.0
Lys
3.887LysAla: 3.887 ± 0.018
1.332LysCys: 1.332 ± 0.015
3.727LysAsp: 3.727 ± 0.02
5.22LysGlu: 5.22 ± 0.034
1.972LysPhe: 1.972 ± 0.011
3.389LysGly: 3.389 ± 0.021
1.567LysHis: 1.567 ± 0.011
3.056LysIle: 3.056 ± 0.016
5.434LysLys: 5.434 ± 0.036
5.318LysLeu: 5.318 ± 0.028
1.663LysMet: 1.663 ± 0.01
2.847LysAsn: 2.847 ± 0.016
3.09LysPro: 3.09 ± 0.022
2.896LysGln: 2.896 ± 0.017
3.615LysArg: 3.615 ± 0.019
4.501LysSer: 4.501 ± 0.022
3.832LysThr: 3.832 ± 0.019
3.783LysVal: 3.783 ± 0.018
0.701LysTrp: 0.701 ± 0.007
1.944LysTyr: 1.944 ± 0.012
0.002LysXaa: 0.002 ± 0.0
Leu
5.405LeuAla: 5.405 ± 0.023
1.763LeuCys: 1.763 ± 0.012
4.725LeuAsp: 4.725 ± 0.021
5.856LeuGlu: 5.856 ± 0.035
2.977LeuPhe: 2.977 ± 0.016
4.705LeuGly: 4.705 ± 0.021
2.269LeuHis: 2.269 ± 0.014
3.656LeuIle: 3.656 ± 0.016
5.782LeuLys: 5.782 ± 0.03
7.763LeuLeu: 7.763 ± 0.035
1.973LeuMet: 1.973 ± 0.012
3.617LeuAsn: 3.617 ± 0.017
4.537LeuPro: 4.537 ± 0.021
4.641LeuGln: 4.641 ± 0.026
4.452LeuArg: 4.452 ± 0.02
6.656LeuSer: 6.656 ± 0.024
4.981LeuThr: 4.981 ± 0.019
5.093LeuVal: 5.093 ± 0.021
0.901LeuTrp: 0.901 ± 0.009
2.455LeuTyr: 2.455 ± 0.016
0.002LeuXaa: 0.002 ± 0.0
Met
1.982MetAla: 1.982 ± 0.01
0.498MetCys: 0.498 ± 0.006
1.4MetAsp: 1.4 ± 0.009
1.834MetGlu: 1.834 ± 0.012
0.895MetPhe: 0.895 ± 0.007
1.416MetGly: 1.416 ± 0.013
0.524MetHis: 0.524 ± 0.005
1.018MetIle: 1.018 ± 0.008
1.688MetLys: 1.688 ± 0.011
2.015MetLeu: 2.015 ± 0.012
0.715MetMet: 0.715 ± 0.008
1.064MetAsn: 1.064 ± 0.008
1.17MetPro: 1.17 ± 0.01
1.061MetGln: 1.061 ± 0.008
1.139MetArg: 1.139 ± 0.009
1.946MetSer: 1.946 ± 0.011
1.474MetThr: 1.474 ± 0.01
1.514MetVal: 1.514 ± 0.01
0.264MetTrp: 0.264 ± 0.003
0.739MetTyr: 0.739 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
2.691AsnAla: 2.691 ± 0.014
1.069AsnCys: 1.069 ± 0.013
2.246AsnAsp: 2.246 ± 0.015
2.604AsnGlu: 2.604 ± 0.015
1.679AsnPhe: 1.679 ± 0.011
3.334AsnGly: 3.334 ± 0.025
1.054AsnHis: 1.054 ± 0.006
2.681AsnIle: 2.681 ± 0.014
2.694AsnLys: 2.694 ± 0.014
3.817AsnLeu: 3.817 ± 0.018
1.193AsnMet: 1.193 ± 0.008
2.347AsnAsn: 2.347 ± 0.016
2.331AsnPro: 2.331 ± 0.015
1.885AsnGln: 1.885 ± 0.013
2.1AsnArg: 2.1 ± 0.012
3.5AsnSer: 3.5 ± 0.02
2.814AsnThr: 2.814 ± 0.017
2.877AsnVal: 2.877 ± 0.012
0.559AsnTrp: 0.559 ± 0.007
1.376AsnTyr: 1.376 ± 0.01
0.001AsnXaa: 0.001 ± 0.0
Pro
3.375ProAla: 3.375 ± 0.02
1.117ProCys: 1.117 ± 0.023
2.82ProAsp: 2.82 ± 0.018
3.366ProGlu: 3.366 ± 0.025
1.624ProPhe: 1.624 ± 0.011
3.786ProGly: 3.786 ± 0.037
1.232ProHis: 1.232 ± 0.009
1.978ProIle: 1.978 ± 0.012
2.963ProLys: 2.963 ± 0.018
3.792ProLeu: 3.792 ± 0.016
1.065ProMet: 1.065 ± 0.008
2.178ProAsn: 2.178 ± 0.013
4.457ProPro: 4.457 ± 0.036
2.358ProGln: 2.358 ± 0.02
2.507ProArg: 2.507 ± 0.018
4.605ProSer: 4.605 ± 0.026
3.286ProThr: 3.286 ± 0.03
3.672ProVal: 3.672 ± 0.019
0.553ProTrp: 0.553 ± 0.007
1.499ProTyr: 1.499 ± 0.011
0.002ProXaa: 0.002 ± 0.0
Gln
2.887GlnAla: 2.887 ± 0.015
1.105GlnCys: 1.105 ± 0.019
2.381GlnAsp: 2.381 ± 0.015
3.358GlnGlu: 3.358 ± 0.027
1.537GlnPhe: 1.537 ± 0.013
2.924GlnGly: 2.924 ± 0.025
1.267GlnHis: 1.267 ± 0.01
2.019GlnIle: 2.019 ± 0.012
2.896GlnLys: 2.896 ± 0.016
4.074GlnLeu: 4.074 ± 0.023
1.18GlnMet: 1.18 ± 0.011
2.17GlnAsn: 2.17 ± 0.015
2.29GlnPro: 2.29 ± 0.017
3.662GlnGln: 3.662 ± 0.052
2.51GlnArg: 2.51 ± 0.016
3.296GlnSer: 3.296 ± 0.016
2.623GlnThr: 2.623 ± 0.014
2.691GlnVal: 2.691 ± 0.013
0.615GlnTrp: 0.615 ± 0.014
1.475GlnTyr: 1.475 ± 0.01
0.001GlnXaa: 0.001 ± 0.0
Arg
2.836ArgAla: 2.836 ± 0.014
1.086ArgCys: 1.086 ± 0.012
2.866ArgAsp: 2.866 ± 0.016
3.511ArgGlu: 3.511 ± 0.019
1.744ArgPhe: 1.744 ± 0.011
3.075ArgGly: 3.075 ± 0.024
1.426ArgHis: 1.426 ± 0.009
2.337ArgIle: 2.337 ± 0.013
3.681ArgLys: 3.681 ± 0.017
4.343ArgLeu: 4.343 ± 0.021
1.219ArgMet: 1.219 ± 0.009
2.3ArgAsn: 2.3 ± 0.013
2.558ArgPro: 2.558 ± 0.014
2.538ArgGln: 2.538 ± 0.015
3.604ArgArg: 3.604 ± 0.022
3.892ArgSer: 3.892 ± 0.024
2.868ArgThr: 2.868 ± 0.019
2.879ArgVal: 2.879 ± 0.014
0.618ArgTrp: 0.618 ± 0.007
1.62ArgTyr: 1.62 ± 0.012
0.001ArgXaa: 0.001 ± 0.0
Ser
4.994SerAla: 4.994 ± 0.02
1.659SerCys: 1.659 ± 0.015
4.546SerAsp: 4.546 ± 0.021
4.751SerGlu: 4.751 ± 0.028
2.811SerPhe: 2.811 ± 0.013
5.459SerGly: 5.459 ± 0.03
1.901SerHis: 1.901 ± 0.012
3.444SerIle: 3.444 ± 0.016
4.553SerLys: 4.553 ± 0.021
6.552SerLeu: 6.552 ± 0.025
1.81SerMet: 1.81 ± 0.012
3.436SerAsn: 3.436 ± 0.018
4.87SerPro: 4.87 ± 0.035
3.707SerGln: 3.707 ± 0.022
4.024SerArg: 4.024 ± 0.022
8.603SerSer: 8.603 ± 0.05
5.226SerThr: 5.226 ± 0.038
5.023SerVal: 5.023 ± 0.02
0.883SerTrp: 0.883 ± 0.009
2.235SerTyr: 2.235 ± 0.013
0.002SerXaa: 0.002 ± 0.0
Thr
4.209ThrAla: 4.209 ± 0.022
1.757ThrCys: 1.757 ± 0.027
3.355ThrAsp: 3.355 ± 0.024
3.912ThrGlu: 3.912 ± 0.025
2.257ThrPhe: 2.257 ± 0.012
4.371ThrGly: 4.371 ± 0.038
1.287ThrHis: 1.287 ± 0.009
2.815ThrIle: 2.815 ± 0.016
3.338ThrLys: 3.338 ± 0.016
4.98ThrLeu: 4.98 ± 0.02
1.338ThrMet: 1.338 ± 0.009
2.551ThrAsn: 2.551 ± 0.017
3.798ThrPro: 3.798 ± 0.031
2.456ThrGln: 2.456 ± 0.019
2.662ThrArg: 2.662 ± 0.016
5.433ThrSer: 5.433 ± 0.035
4.985ThrThr: 4.985 ± 0.09
4.406ThrVal: 4.406 ± 0.021
0.716ThrTrp: 0.716 ± 0.007
1.832ThrTyr: 1.832 ± 0.013
0.001ThrXaa: 0.001 ± 0.0
Val
4.162ValAla: 4.162 ± 0.016
1.53ValCys: 1.53 ± 0.016
3.675ValAsp: 3.675 ± 0.016
4.034ValGlu: 4.034 ± 0.02
2.514ValPhe: 2.514 ± 0.014
3.442ValGly: 3.442 ± 0.018
1.552ValHis: 1.552 ± 0.01
3.376ValIle: 3.376 ± 0.018
3.975ValLys: 3.975 ± 0.016
5.534ValLeu: 5.534 ± 0.025
1.555ValMet: 1.555 ± 0.009
2.892ValAsn: 2.892 ± 0.015
3.259ValPro: 3.259 ± 0.018
2.898ValGln: 2.898 ± 0.015
2.912ValArg: 2.912 ± 0.014
4.91ValSer: 4.91 ± 0.017
4.47ValThr: 4.47 ± 0.024
4.359ValVal: 4.359 ± 0.021
0.71ValTrp: 0.71 ± 0.006
1.982ValTyr: 1.982 ± 0.011
0.001ValXaa: 0.001 ± 0.0
Trp
0.621TrpAla: 0.621 ± 0.007
0.252TrpCys: 0.252 ± 0.004
0.618TrpAsp: 0.618 ± 0.006
0.68TrpGlu: 0.68 ± 0.006
0.456TrpPhe: 0.456 ± 0.005
0.799TrpGly: 0.799 ± 0.016
0.279TrpHis: 0.279 ± 0.005
0.601TrpIle: 0.601 ± 0.008
0.768TrpLys: 0.768 ± 0.007
1.038TrpLeu: 1.038 ± 0.01
0.329TrpMet: 0.329 ± 0.005
0.565TrpAsn: 0.565 ± 0.007
0.428TrpPro: 0.428 ± 0.005
0.497TrpGln: 0.497 ± 0.006
0.622TrpArg: 0.622 ± 0.006
0.899TrpSer: 0.899 ± 0.011
0.8TrpThr: 0.8 ± 0.012
0.663TrpVal: 0.663 ± 0.007
0.193TrpTrp: 0.193 ± 0.003
0.388TrpTyr: 0.388 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.658TyrAla: 1.658 ± 0.01
0.811TyrCys: 0.811 ± 0.016
1.783TyrAsp: 1.783 ± 0.012
1.758TyrGlu: 1.758 ± 0.011
1.314TyrPhe: 1.314 ± 0.009
2.156TyrGly: 2.156 ± 0.018
0.858TyrHis: 0.858 ± 0.006
1.627TyrIle: 1.627 ± 0.011
1.777TyrLys: 1.777 ± 0.012
2.654TyrLeu: 2.654 ± 0.015
0.771TyrMet: 0.771 ± 0.007
1.508TyrAsn: 1.508 ± 0.01
1.348TyrPro: 1.348 ± 0.01
1.383TyrGln: 1.383 ± 0.012
1.667TyrArg: 1.667 ± 0.009
2.279TyrSer: 2.279 ± 0.014
1.884TyrThr: 1.884 ± 0.014
1.892TyrVal: 1.892 ± 0.014
0.398TyrTrp: 0.398 ± 0.006
1.206TyrTyr: 1.206 ± 0.013
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 34415 proteins (20595337 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski