Amino acid dipepetide frequency for Mytilus coruscus (Sea mussel)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.892AlaAla: 2.892 ± 0.014
1.165AlaCys: 1.165 ± 0.01
2.809AlaAsp: 2.809 ± 0.013
3.026AlaGlu: 3.026 ± 0.013
2.007AlaPhe: 2.007 ± 0.008
2.457AlaGly: 2.457 ± 0.013
0.985AlaHis: 0.985 ± 0.007
3.123AlaIle: 3.123 ± 0.012
3.324AlaLys: 3.324 ± 0.013
4.077AlaLeu: 4.077 ± 0.015
1.159AlaMet: 1.159 ± 0.007
2.384AlaAsn: 2.384 ± 0.01
1.724AlaPro: 1.724 ± 0.013
1.651AlaGln: 1.651 ± 0.01
2.052AlaArg: 2.052 ± 0.011
3.76AlaSer: 3.76 ± 0.015
2.92AlaThr: 2.92 ± 0.015
3.406AlaVal: 3.406 ± 0.012
0.463AlaTrp: 0.463 ± 0.005
1.521AlaTyr: 1.521 ± 0.009
0.0AlaXaa: 0.0 ± 0.0
Cys
0.943CysAla: 0.943 ± 0.007
0.651CysCys: 0.651 ± 0.007
1.491CysAsp: 1.491 ± 0.01
1.349CysGlu: 1.349 ± 0.011
0.945CysPhe: 0.945 ± 0.006
1.343CysGly: 1.343 ± 0.01
0.622CysHis: 0.622 ± 0.007
1.496CysIle: 1.496 ± 0.009
1.765CysLys: 1.765 ± 0.012
1.975CysLeu: 1.975 ± 0.011
0.494CysMet: 0.494 ± 0.004
1.43CysAsn: 1.43 ± 0.01
1.089CysPro: 1.089 ± 0.013
1.029CysGln: 1.029 ± 0.008
1.19CysArg: 1.19 ± 0.008
2.056CysSer: 2.056 ± 0.013
1.424CysThr: 1.424 ± 0.01
1.332CysVal: 1.332 ± 0.01
0.203CysTrp: 0.203 ± 0.003
0.828CysTyr: 0.828 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
2.495AspAla: 2.495 ± 0.011
1.277AspCys: 1.277 ± 0.009
4.239AspAsp: 4.239 ± 0.024
4.268AspGlu: 4.268 ± 0.017
2.529AspPhe: 2.529 ± 0.011
3.29AspGly: 3.29 ± 0.017
1.415AspHis: 1.415 ± 0.009
4.931AspIle: 4.931 ± 0.02
4.315AspLys: 4.315 ± 0.015
5.024AspLeu: 5.024 ± 0.018
1.427AspMet: 1.427 ± 0.009
3.599AspAsn: 3.599 ± 0.014
2.027AspPro: 2.027 ± 0.011
2.151AspGln: 2.151 ± 0.011
2.606AspArg: 2.606 ± 0.014
4.53AspSer: 4.53 ± 0.018
3.454AspThr: 3.454 ± 0.014
3.784AspVal: 3.784 ± 0.017
0.616AspTrp: 0.616 ± 0.005
1.967AspTyr: 1.967 ± 0.011
0.0AspXaa: 0.0 ± 0.0
Glu
2.972GluAla: 2.972 ± 0.015
1.365GluCys: 1.365 ± 0.012
4.119GluAsp: 4.119 ± 0.017
5.5GluGlu: 5.5 ± 0.025
2.234GluPhe: 2.234 ± 0.01
2.787GluGly: 2.787 ± 0.015
1.576GluHis: 1.576 ± 0.011
4.711GluIle: 4.711 ± 0.017
5.674GluLys: 5.674 ± 0.022
5.213GluLeu: 5.213 ± 0.02
1.66GluMet: 1.66 ± 0.009
4.591GluAsn: 4.591 ± 0.016
1.956GluPro: 1.956 ± 0.012
2.687GluGln: 2.687 ± 0.015
3.105GluArg: 3.105 ± 0.016
4.578GluSer: 4.578 ± 0.017
4.045GluThr: 4.045 ± 0.019
3.613GluVal: 3.613 ± 0.014
0.653GluTrp: 0.653 ± 0.006
2.04GluTyr: 2.04 ± 0.009
0.0GluXaa: 0.0 ± 0.0
Phe
1.742PheAla: 1.742 ± 0.009
0.988PheCys: 0.988 ± 0.007
2.476PheAsp: 2.476 ± 0.011
2.348PheGlu: 2.348 ± 0.011
1.448PhePhe: 1.448 ± 0.01
2.239PheGly: 2.239 ± 0.011
1.046PheHis: 1.046 ± 0.007
2.647PheIle: 2.647 ± 0.012
2.666PheLys: 2.666 ± 0.01
3.498PheLeu: 3.498 ± 0.012
0.934PheMet: 0.934 ± 0.006
2.172PheAsn: 2.172 ± 0.01
1.545PhePro: 1.545 ± 0.009
1.661PheGln: 1.661 ± 0.008
1.784PheArg: 1.784 ± 0.008
3.216PheSer: 3.216 ± 0.012
2.409PheThr: 2.409 ± 0.01
2.492PheVal: 2.492 ± 0.011
0.46PheTrp: 0.46 ± 0.004
1.437PheTyr: 1.437 ± 0.008
0.0PheXaa: 0.0 ± 0.0
Gly
2.274GlyAla: 2.274 ± 0.012
1.126GlyCys: 1.126 ± 0.009
2.864GlyAsp: 2.864 ± 0.014
2.773GlyGlu: 2.773 ± 0.012
2.118GlyPhe: 2.118 ± 0.011
3.166GlyGly: 3.166 ± 0.027
1.519GlyHis: 1.519 ± 0.01
3.49GlyIle: 3.49 ± 0.014
3.998GlyLys: 3.998 ± 0.018
3.913GlyLeu: 3.913 ± 0.017
1.241GlyMet: 1.241 ± 0.01
3.14GlyAsn: 3.14 ± 0.015
1.84GlyPro: 1.84 ± 0.016
2.197GlyGln: 2.197 ± 0.013
2.535GlyArg: 2.535 ± 0.014
4.107GlySer: 4.107 ± 0.017
3.144GlyThr: 3.144 ± 0.015
2.835GlyVal: 2.835 ± 0.013
0.599GlyTrp: 0.599 ± 0.006
2.11GlyTyr: 2.11 ± 0.017
0.0GlyXaa: 0.0 ± 0.0
His
1.101HisAla: 1.101 ± 0.007
0.627HisCys: 0.627 ± 0.008
1.381HisAsp: 1.381 ± 0.009
1.36HisGlu: 1.36 ± 0.008
1.119HisPhe: 1.119 ± 0.007
1.4HisGly: 1.4 ± 0.01
0.835HisHis: 0.835 ± 0.008
1.689HisIle: 1.689 ± 0.011
1.668HisLys: 1.668 ± 0.009
2.359HisLeu: 2.359 ± 0.012
0.605HisMet: 0.605 ± 0.005
1.35HisAsn: 1.35 ± 0.009
1.038HisPro: 1.038 ± 0.008
1.158HisGln: 1.158 ± 0.009
1.269HisArg: 1.269 ± 0.009
2.007HisSer: 2.007 ± 0.011
1.475HisThr: 1.475 ± 0.01
1.594HisVal: 1.594 ± 0.011
0.29HisTrp: 0.29 ± 0.004
0.915HisTyr: 0.915 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.208IleAla: 3.208 ± 0.013
1.656IleCys: 1.656 ± 0.01
4.079IleAsp: 4.079 ± 0.015
4.221IleGlu: 4.221 ± 0.014
2.472IlePhe: 2.472 ± 0.012
3.178IleGly: 3.178 ± 0.014
1.692IleHis: 1.692 ± 0.009
4.086IleIle: 4.086 ± 0.018
4.651IleLys: 4.651 ± 0.016
5.583IleLeu: 5.583 ± 0.017
1.37IleMet: 1.37 ± 0.007
3.772IleAsn: 3.772 ± 0.016
3.143IlePro: 3.143 ± 0.015
3.0IleGln: 3.0 ± 0.016
2.928IleArg: 2.928 ± 0.011
5.391IleSer: 5.391 ± 0.018
4.053IleThr: 4.053 ± 0.019
4.055IleVal: 4.055 ± 0.016
0.645IleTrp: 0.645 ± 0.005
2.134IleTyr: 2.134 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
3.474LysAla: 3.474 ± 0.014
1.836LysCys: 1.836 ± 0.012
4.53LysAsp: 4.53 ± 0.019
5.54LysGlu: 5.54 ± 0.024
2.606LysPhe: 2.606 ± 0.011
3.481LysGly: 3.481 ± 0.025
1.969LysHis: 1.969 ± 0.012
4.751LysIle: 4.751 ± 0.015
6.163LysLys: 6.163 ± 0.028
6.354LysLeu: 6.354 ± 0.019
1.828LysMet: 1.828 ± 0.009
4.016LysAsn: 4.016 ± 0.014
3.042LysPro: 3.042 ± 0.019
3.496LysGln: 3.496 ± 0.012
3.984LysArg: 3.984 ± 0.014
5.911LysSer: 5.911 ± 0.019
5.008LysThr: 5.008 ± 0.016
4.007LysVal: 4.007 ± 0.014
0.828LysTrp: 0.828 ± 0.006
2.613LysTyr: 2.613 ± 0.013
0.0LysXaa: 0.0 ± 0.0
Leu
4.001LeuAla: 4.001 ± 0.015
1.957LeuCys: 1.957 ± 0.011
4.8LeuAsp: 4.8 ± 0.017
5.465LeuGlu: 5.465 ± 0.022
3.329LeuPhe: 3.329 ± 0.014
3.696LeuGly: 3.696 ± 0.014
2.331LeuHis: 2.331 ± 0.012
4.656LeuIle: 4.656 ± 0.016
6.771LeuLys: 6.771 ± 0.02
7.623LeuLeu: 7.623 ± 0.028
1.925LeuMet: 1.925 ± 0.01
4.684LeuAsn: 4.684 ± 0.015
3.94LeuPro: 3.94 ± 0.016
4.462LeuGln: 4.462 ± 0.017
4.042LeuArg: 4.042 ± 0.015
6.968LeuSer: 6.968 ± 0.02
5.083LeuThr: 5.083 ± 0.018
4.531LeuVal: 4.531 ± 0.015
0.797LeuTrp: 0.797 ± 0.006
2.867LeuTyr: 2.867 ± 0.014
0.0LeuXaa: 0.0 ± 0.0
Met
1.406MetAla: 1.406 ± 0.008
0.504MetCys: 0.504 ± 0.005
1.359MetAsp: 1.359 ± 0.007
1.654MetGlu: 1.654 ± 0.008
1.01MetPhe: 1.01 ± 0.007
1.001MetGly: 1.001 ± 0.007
0.476MetHis: 0.476 ± 0.005
1.379MetIle: 1.379 ± 0.008
1.976MetLys: 1.976 ± 0.01
1.918MetLeu: 1.918 ± 0.01
0.686MetMet: 0.686 ± 0.006
1.329MetAsn: 1.329 ± 0.007
0.935MetPro: 0.935 ± 0.008
0.907MetGln: 0.907 ± 0.007
0.998MetArg: 0.998 ± 0.007
2.002MetSer: 2.002 ± 0.012
1.512MetThr: 1.512 ± 0.01
1.299MetVal: 1.299 ± 0.007
0.236MetTrp: 0.236 ± 0.003
0.822MetTyr: 0.822 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
2.58AsnAla: 2.58 ± 0.011
1.388AsnCys: 1.388 ± 0.01
3.335AsnAsp: 3.335 ± 0.014
3.685AsnGlu: 3.685 ± 0.015
2.244AsnPhe: 2.244 ± 0.009
3.417AsnGly: 3.417 ± 0.017
1.359AsnHis: 1.359 ± 0.008
4.595AsnIle: 4.595 ± 0.016
4.29AsnLys: 4.29 ± 0.016
4.763AsnLeu: 4.763 ± 0.015
1.417AsnMet: 1.417 ± 0.009
3.739AsnAsn: 3.739 ± 0.017
2.24AsnPro: 2.24 ± 0.011
2.471AsnGln: 2.471 ± 0.013
2.728AsnArg: 2.728 ± 0.013
4.537AsnSer: 4.537 ± 0.016
3.698AsnThr: 3.698 ± 0.016
3.671AsnVal: 3.671 ± 0.016
0.579AsnTrp: 0.579 ± 0.005
1.869AsnTyr: 1.869 ± 0.007
0.0AsnXaa: 0.0 ± 0.0
Pro
2.033ProAla: 2.033 ± 0.012
0.872ProCys: 0.872 ± 0.01
2.458ProAsp: 2.458 ± 0.013
2.708ProGlu: 2.708 ± 0.014
1.546ProPhe: 1.546 ± 0.007
2.305ProGly: 2.305 ± 0.019
0.921ProHis: 0.921 ± 0.007
2.339ProIle: 2.339 ± 0.015
2.868ProLys: 2.868 ± 0.015
3.347ProLeu: 3.347 ± 0.016
0.84ProMet: 0.84 ± 0.007
2.228ProAsn: 2.228 ± 0.011
2.653ProPro: 2.653 ± 0.023
1.64ProGln: 1.64 ± 0.011
1.834ProArg: 1.834 ± 0.012
3.723ProSer: 3.723 ± 0.023
2.787ProThr: 2.787 ± 0.021
2.965ProVal: 2.965 ± 0.014
0.41ProTrp: 0.41 ± 0.005
1.361ProTyr: 1.361 ± 0.008
0.0ProXaa: 0.0 ± 0.0
Gln
1.947GlnAla: 1.947 ± 0.011
0.968GlnCys: 0.968 ± 0.008
2.154GlnAsp: 2.154 ± 0.011
2.789GlnGlu: 2.789 ± 0.014
1.561GlnPhe: 1.561 ± 0.008
1.936GlnGly: 1.936 ± 0.014
1.233GlnHis: 1.233 ± 0.01
2.759GlnIle: 2.759 ± 0.012
3.284GlnLys: 3.284 ± 0.015
3.611GlnLeu: 3.611 ± 0.015
1.112GlnMet: 1.112 ± 0.008
2.868GlnAsn: 2.868 ± 0.012
1.803GlnPro: 1.803 ± 0.015
2.563GlnGln: 2.563 ± 0.02
2.243GlnArg: 2.243 ± 0.013
3.426GlnSer: 3.426 ± 0.014
2.899GlnThr: 2.899 ± 0.017
2.12GlnVal: 2.12 ± 0.01
0.513GlnTrp: 0.513 ± 0.005
1.535GlnTyr: 1.535 ± 0.01
0.0GlnXaa: 0.0 ± 0.0
Arg
1.961ArgAla: 1.961 ± 0.01
1.05ArgCys: 1.05 ± 0.008
2.46ArgAsp: 2.46 ± 0.014
2.908ArgGlu: 2.908 ± 0.015
1.848ArgPhe: 1.848 ± 0.008
2.278ArgGly: 2.278 ± 0.014
1.322ArgHis: 1.322 ± 0.009
2.861ArgIle: 2.861 ± 0.013
4.27ArgLys: 4.27 ± 0.016
4.09ArgLeu: 4.09 ± 0.015
1.137ArgMet: 1.137 ± 0.007
2.955ArgAsn: 2.955 ± 0.012
1.956ArgPro: 1.956 ± 0.011
2.379ArgGln: 2.379 ± 0.015
3.113ArgArg: 3.113 ± 0.018
3.66ArgSer: 3.66 ± 0.016
2.784ArgThr: 2.784 ± 0.013
2.311ArgVal: 2.311 ± 0.012
0.533ArgTrp: 0.533 ± 0.004
1.69ArgTyr: 1.69 ± 0.01
0.0ArgXaa: 0.0 ± 0.0
Ser
3.82SerAla: 3.82 ± 0.016
1.849SerCys: 1.849 ± 0.012
5.23SerAsp: 5.23 ± 0.018
5.028SerGlu: 5.028 ± 0.018
3.199SerPhe: 3.199 ± 0.012
4.464SerGly: 4.464 ± 0.019
1.907SerHis: 1.907 ± 0.011
4.774SerIle: 4.774 ± 0.017
5.7SerLys: 5.7 ± 0.02
6.557SerLeu: 6.557 ± 0.019
1.764SerMet: 1.764 ± 0.01
4.627SerAsn: 4.627 ± 0.018
3.734SerPro: 3.734 ± 0.025
3.238SerGln: 3.238 ± 0.013
3.723SerArg: 3.723 ± 0.016
8.134SerSer: 8.134 ± 0.037
5.401SerThr: 5.401 ± 0.028
5.066SerVal: 5.066 ± 0.017
0.786SerTrp: 0.786 ± 0.007
2.571SerTyr: 2.571 ± 0.013
0.0SerXaa: 0.0 ± 0.0
Thr
3.25ThrAla: 3.25 ± 0.015
1.676ThrCys: 1.676 ± 0.013
4.004ThrAsp: 4.004 ± 0.02
4.373ThrGlu: 4.373 ± 0.02
2.572ThrPhe: 2.572 ± 0.01
3.413ThrGly: 3.413 ± 0.017
1.327ThrHis: 1.327 ± 0.008
3.97ThrIle: 3.97 ± 0.016
4.291ThrLys: 4.291 ± 0.014
4.974ThrLeu: 4.974 ± 0.018
1.337ThrMet: 1.337 ± 0.01
3.481ThrAsn: 3.481 ± 0.015
2.998ThrPro: 2.998 ± 0.019
2.271ThrGln: 2.271 ± 0.013
2.528ThrArg: 2.528 ± 0.012
5.527ThrSer: 5.527 ± 0.025
4.86ThrThr: 4.86 ± 0.049
4.335ThrVal: 4.335 ± 0.019
0.633ThrTrp: 0.633 ± 0.005
1.979ThrTyr: 1.979 ± 0.011
0.0ThrXaa: 0.0 ± 0.0
Val
2.892ValAla: 2.892 ± 0.013
1.509ValCys: 1.509 ± 0.01
3.583ValAsp: 3.583 ± 0.013
3.624ValGlu: 3.624 ± 0.012
2.477ValPhe: 2.477 ± 0.011
2.714ValGly: 2.714 ± 0.012
1.512ValHis: 1.512 ± 0.01
4.017ValIle: 4.017 ± 0.016
4.442ValLys: 4.442 ± 0.019
5.166ValLeu: 5.166 ± 0.016
1.374ValMet: 1.374 ± 0.008
3.516ValAsn: 3.516 ± 0.016
2.54ValPro: 2.54 ± 0.014
2.538ValGln: 2.538 ± 0.012
2.6ValArg: 2.6 ± 0.012
4.643ValSer: 4.643 ± 0.017
4.026ValThr: 4.026 ± 0.019
3.897ValVal: 3.897 ± 0.016
0.62ValTrp: 0.62 ± 0.006
2.071ValTyr: 2.071 ± 0.011
0.0ValXaa: 0.0 ± 0.0
Trp
0.427TrpAla: 0.427 ± 0.004
0.263TrpCys: 0.263 ± 0.003
0.563TrpAsp: 0.563 ± 0.005
0.585TrpGlu: 0.585 ± 0.005
0.449TrpPhe: 0.449 ± 0.004
0.485TrpGly: 0.485 ± 0.005
0.251TrpHis: 0.251 ± 0.004
0.723TrpIle: 0.723 ± 0.006
0.945TrpLys: 0.945 ± 0.007
0.915TrpLeu: 0.915 ± 0.007
0.276TrpMet: 0.276 ± 0.004
0.669TrpAsn: 0.669 ± 0.005
0.362TrpPro: 0.362 ± 0.004
0.397TrpGln: 0.397 ± 0.004
0.546TrpArg: 0.546 ± 0.005
0.803TrpSer: 0.803 ± 0.008
0.737TrpThr: 0.737 ± 0.007
0.506TrpVal: 0.506 ± 0.005
0.15TrpTrp: 0.15 ± 0.003
0.393TrpTyr: 0.393 ± 0.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.486TyrAla: 1.486 ± 0.008
0.926TyrCys: 0.926 ± 0.009
2.01TyrAsp: 2.01 ± 0.012
1.874TyrGlu: 1.874 ± 0.009
1.492TyrPhe: 1.492 ± 0.009
1.939TyrGly: 1.939 ± 0.015
0.943TyrHis: 0.943 ± 0.008
2.32TyrIle: 2.32 ± 0.011
2.374TyrLys: 2.374 ± 0.014
2.967TyrLeu: 2.967 ± 0.015
0.82TyrMet: 0.82 ± 0.007
2.058TyrAsn: 2.058 ± 0.012
1.273TyrPro: 1.273 ± 0.009
1.429TyrGln: 1.429 ± 0.009
1.76TyrArg: 1.76 ± 0.009
2.595TyrSer: 2.595 ± 0.011
2.081TyrThr: 2.081 ± 0.013
1.943TyrVal: 1.943 ± 0.01
0.431TyrTrp: 0.431 ± 0.005
1.258TyrTyr: 1.258 ± 0.01
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64248 proteins (29520201 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski