Amino acid dipepetide frequency for Latimeria chalumnae (Coelacanth)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.269AlaAla: 5.269 ± 0.036
1.258AlaCys: 1.258 ± 0.013
2.829AlaAsp: 2.829 ± 0.015
4.354AlaGlu: 4.354 ± 0.025
2.54AlaPhe: 2.54 ± 0.019
3.647AlaGly: 3.647 ± 0.021
1.357AlaHis: 1.357 ± 0.012
3.106AlaIle: 3.106 ± 0.018
3.629AlaLys: 3.629 ± 0.021
6.045AlaLeu: 6.045 ± 0.029
1.455AlaMet: 1.455 ± 0.011
2.295AlaAsn: 2.295 ± 0.015
2.707AlaPro: 2.707 ± 0.021
2.594AlaGln: 2.594 ± 0.017
2.806AlaArg: 2.806 ± 0.017
4.961AlaSer: 4.961 ± 0.024
3.395AlaThr: 3.395 ± 0.021
4.596AlaVal: 4.596 ± 0.023
0.664AlaTrp: 0.664 ± 0.007
1.588AlaTyr: 1.588 ± 0.013
0.001AlaXaa: 0.001 ± 0.0
Cys
1.147CysAla: 1.147 ± 0.011
0.676CysCys: 0.676 ± 0.011
1.081CysAsp: 1.081 ± 0.015
1.326CysGlu: 1.326 ± 0.017
1.019CysPhe: 1.019 ± 0.011
1.449CysGly: 1.449 ± 0.02
0.617CysHis: 0.617 ± 0.008
1.25CysIle: 1.25 ± 0.014
1.418CysLys: 1.418 ± 0.012
2.168CysLeu: 2.168 ± 0.019
0.483CysMet: 0.483 ± 0.007
1.033CysAsn: 1.033 ± 0.013
1.172CysPro: 1.172 ± 0.018
1.035CysGln: 1.035 ± 0.014
1.221CysArg: 1.221 ± 0.012
2.035CysSer: 2.035 ± 0.019
1.32CysThr: 1.32 ± 0.014
1.398CysVal: 1.398 ± 0.014
0.31CysTrp: 0.31 ± 0.006
0.689CysTyr: 0.689 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
2.68AspAla: 2.68 ± 0.016
1.144AspCys: 1.144 ± 0.013
2.83AspAsp: 2.83 ± 0.026
3.521AspGlu: 3.521 ± 0.025
2.211AspPhe: 2.211 ± 0.012
3.136AspGly: 3.136 ± 0.023
1.175AspHis: 1.175 ± 0.011
2.958AspIle: 2.958 ± 0.018
2.775AspLys: 2.775 ± 0.018
5.016AspLeu: 5.016 ± 0.022
1.117AspMet: 1.117 ± 0.01
1.989AspAsn: 1.989 ± 0.014
2.569AspPro: 2.569 ± 0.016
1.914AspGln: 1.914 ± 0.014
2.391AspArg: 2.391 ± 0.019
4.142AspSer: 4.142 ± 0.028
2.519AspThr: 2.519 ± 0.018
3.068AspVal: 3.068 ± 0.02
0.647AspTrp: 0.647 ± 0.007
1.657AspTyr: 1.657 ± 0.015
0.0AspXaa: 0.0 ± 0.0
Glu
4.445GluAla: 4.445 ± 0.024
1.346GluCys: 1.346 ± 0.021
4.238GluAsp: 4.238 ± 0.024
7.658GluGlu: 7.658 ± 0.056
2.226GluPhe: 2.226 ± 0.014
3.837GluGly: 3.837 ± 0.02
1.493GluHis: 1.493 ± 0.011
3.567GluIle: 3.567 ± 0.02
5.881GluLys: 5.881 ± 0.041
6.321GluLeu: 6.321 ± 0.037
1.788GluMet: 1.788 ± 0.014
3.552GluAsn: 3.552 ± 0.02
2.476GluPro: 2.476 ± 0.02
3.148GluGln: 3.148 ± 0.023
3.862GluArg: 3.862 ± 0.028
4.56GluSer: 4.56 ± 0.026
3.672GluThr: 3.672 ± 0.022
4.309GluVal: 4.309 ± 0.021
0.729GluTrp: 0.729 ± 0.007
1.807GluTyr: 1.807 ± 0.016
0.002GluXaa: 0.002 ± 0.0
Phe
2.101PheAla: 2.101 ± 0.015
1.071PheCys: 1.071 ± 0.012
1.862PheAsp: 1.862 ± 0.013
2.115PheGlu: 2.115 ± 0.016
2.069PhePhe: 2.069 ± 0.019
2.247PheGly: 2.247 ± 0.016
1.093PheHis: 1.093 ± 0.009
2.178PheIle: 2.178 ± 0.017
2.227PheLys: 2.227 ± 0.015
4.341PheLeu: 4.341 ± 0.025
0.833PheMet: 0.833 ± 0.009
1.673PheAsn: 1.673 ± 0.015
1.983PhePro: 1.983 ± 0.015
1.876PheGln: 1.876 ± 0.013
1.895PheArg: 1.895 ± 0.017
3.565PheSer: 3.565 ± 0.021
2.338PheThr: 2.338 ± 0.015
2.383PheVal: 2.383 ± 0.018
0.522PheTrp: 0.522 ± 0.008
1.372PheTyr: 1.372 ± 0.012
0.001PheXaa: 0.001 ± 0.0
Gly
3.325GlyAla: 3.325 ± 0.021
1.177GlyCys: 1.177 ± 0.012
2.862GlyAsp: 2.862 ± 0.021
3.722GlyGlu: 3.722 ± 0.028
2.491GlyPhe: 2.491 ± 0.018
4.018GlyGly: 4.018 ± 0.028
1.447GlyHis: 1.447 ± 0.013
3.105GlyIle: 3.105 ± 0.018
3.982GlyLys: 3.982 ± 0.026
5.017GlyLeu: 5.017 ± 0.027
1.361GlyMet: 1.361 ± 0.014
2.649GlyAsn: 2.649 ± 0.019
2.638GlyPro: 2.638 ± 0.042
2.455GlyGln: 2.455 ± 0.016
3.059GlyArg: 3.059 ± 0.021
5.032GlySer: 5.032 ± 0.03
3.394GlyThr: 3.394 ± 0.019
3.496GlyVal: 3.496 ± 0.02
0.728GlyTrp: 0.728 ± 0.009
1.875GlyTyr: 1.875 ± 0.017
0.0GlyXaa: 0.0 ± 0.0
His
1.257HisAla: 1.257 ± 0.011
0.739HisCys: 0.739 ± 0.009
0.947HisAsp: 0.947 ± 0.01
1.32HisGlu: 1.32 ± 0.012
1.155HisPhe: 1.155 ± 0.01
1.405HisGly: 1.405 ± 0.012
0.881HisHis: 0.881 ± 0.011
1.379HisIle: 1.379 ± 0.01
1.436HisLys: 1.436 ± 0.012
2.713HisLeu: 2.713 ± 0.019
0.577HisMet: 0.577 ± 0.009
1.027HisAsn: 1.027 ± 0.011
1.418HisPro: 1.418 ± 0.013
1.207HisGln: 1.207 ± 0.013
1.423HisArg: 1.423 ± 0.012
2.243HisSer: 2.243 ± 0.016
1.437HisThr: 1.437 ± 0.016
1.413HisVal: 1.413 ± 0.013
0.351HisTrp: 0.351 ± 0.006
0.886HisTyr: 0.886 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
3.033IleAla: 3.033 ± 0.017
1.297IleCys: 1.297 ± 0.015
2.402IleAsp: 2.402 ± 0.017
3.037IleGlu: 3.037 ± 0.02
2.174IlePhe: 2.174 ± 0.02
2.608IleGly: 2.608 ± 0.015
1.404IleHis: 1.404 ± 0.015
3.012IleIle: 3.012 ± 0.02
3.267IleLys: 3.267 ± 0.02
5.119IleLeu: 5.119 ± 0.028
1.153IleMet: 1.153 ± 0.011
2.298IleAsn: 2.298 ± 0.016
2.834IlePro: 2.834 ± 0.015
2.575IleGln: 2.575 ± 0.017
2.572IleArg: 2.572 ± 0.015
4.277IleSer: 4.277 ± 0.022
3.08IleThr: 3.08 ± 0.02
3.084IleVal: 3.084 ± 0.021
0.599IleTrp: 0.599 ± 0.008
1.669IleTyr: 1.669 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
4.066LysAla: 4.066 ± 0.023
1.34LysCys: 1.34 ± 0.015
3.46LysAsp: 3.46 ± 0.022
5.731LysGlu: 5.731 ± 0.036
2.027LysPhe: 2.027 ± 0.014
3.521LysGly: 3.521 ± 0.025
1.642LysHis: 1.642 ± 0.014
3.333LysIle: 3.333 ± 0.021
5.908LysLys: 5.908 ± 0.039
6.055LysLeu: 6.055 ± 0.029
1.628LysMet: 1.628 ± 0.013
3.0LysAsn: 3.0 ± 0.018
3.084LysPro: 3.084 ± 0.026
3.101LysGln: 3.101 ± 0.019
3.667LysArg: 3.667 ± 0.023
4.458LysSer: 4.458 ± 0.022
3.615LysThr: 3.615 ± 0.018
4.02LysVal: 4.02 ± 0.021
0.705LysTrp: 0.705 ± 0.008
1.888LysTyr: 1.888 ± 0.015
0.003LysXaa: 0.003 ± 0.0
Leu
5.921LeuAla: 5.921 ± 0.027
2.202LeuCys: 2.202 ± 0.021
4.685LeuAsp: 4.685 ± 0.022
6.871LeuGlu: 6.871 ± 0.041
3.723LeuPhe: 3.723 ± 0.021
5.007LeuGly: 5.007 ± 0.027
2.682LeuHis: 2.682 ± 0.017
4.418LeuIle: 4.418 ± 0.023
6.773LeuLys: 6.773 ± 0.03
9.789LeuLeu: 9.789 ± 0.049
2.046LeuMet: 2.046 ± 0.016
4.158LeuAsn: 4.158 ± 0.019
5.074LeuPro: 5.074 ± 0.027
5.716LeuGln: 5.716 ± 0.036
5.042LeuArg: 5.042 ± 0.024
7.813LeuSer: 7.813 ± 0.031
5.138LeuThr: 5.138 ± 0.022
5.469LeuVal: 5.469 ± 0.027
1.069LeuTrp: 1.069 ± 0.011
2.848LeuTyr: 2.848 ± 0.018
0.002LeuXaa: 0.002 ± 0.0
Met
1.732MetAla: 1.732 ± 0.012
0.446MetCys: 0.446 ± 0.006
1.307MetAsp: 1.307 ± 0.01
1.931MetGlu: 1.931 ± 0.015
0.838MetPhe: 0.838 ± 0.009
1.265MetGly: 1.265 ± 0.012
0.512MetHis: 0.512 ± 0.007
1.023MetIle: 1.023 ± 0.008
1.656MetLys: 1.656 ± 0.011
2.085MetLeu: 2.085 ± 0.013
0.619MetMet: 0.619 ± 0.009
0.998MetAsn: 0.998 ± 0.01
1.005MetPro: 1.005 ± 0.011
1.036MetGln: 1.036 ± 0.011
1.035MetArg: 1.035 ± 0.009
1.66MetSer: 1.66 ± 0.013
1.198MetThr: 1.198 ± 0.013
1.442MetVal: 1.442 ± 0.011
0.258MetTrp: 0.258 ± 0.006
0.678MetTyr: 0.678 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.36AsnAla: 2.36 ± 0.015
1.069AsnCys: 1.069 ± 0.014
1.816AsnAsp: 1.816 ± 0.014
2.673AsnGlu: 2.673 ± 0.018
1.769AsnPhe: 1.769 ± 0.015
2.836AsnGly: 2.836 ± 0.025
1.043AsnHis: 1.043 ± 0.009
2.687AsnIle: 2.687 ± 0.02
2.768AsnLys: 2.768 ± 0.016
4.282AsnLeu: 4.282 ± 0.022
1.063AsnMet: 1.063 ± 0.011
2.015AsnAsn: 2.015 ± 0.016
2.352AsnPro: 2.352 ± 0.017
1.901AsnGln: 1.901 ± 0.014
2.134AsnArg: 2.134 ± 0.016
3.707AsnSer: 3.707 ± 0.021
2.423AsnThr: 2.423 ± 0.019
2.64AsnVal: 2.64 ± 0.016
0.545AsnTrp: 0.545 ± 0.008
1.371AsnTyr: 1.371 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
3.309ProAla: 3.309 ± 0.024
1.001ProCys: 1.001 ± 0.013
2.479ProAsp: 2.479 ± 0.016
3.573ProGlu: 3.573 ± 0.025
1.876ProPhe: 1.876 ± 0.015
3.432ProGly: 3.432 ± 0.057
1.224ProHis: 1.224 ± 0.011
2.092ProIle: 2.092 ± 0.017
2.813ProLys: 2.813 ± 0.024
4.454ProLeu: 4.454 ± 0.023
0.958ProMet: 0.958 ± 0.011
2.028ProAsn: 2.028 ± 0.015
4.258ProPro: 4.258 ± 0.046
2.321ProGln: 2.321 ± 0.02
2.359ProArg: 2.359 ± 0.019
4.735ProSer: 4.735 ± 0.032
2.742ProThr: 2.742 ± 0.022
3.604ProVal: 3.604 ± 0.022
0.543ProTrp: 0.543 ± 0.007
1.406ProTyr: 1.406 ± 0.013
0.001ProXaa: 0.001 ± 0.0
Gln
2.895GlnAla: 2.895 ± 0.017
1.0GlnCys: 1.0 ± 0.014
2.212GlnAsp: 2.212 ± 0.014
3.683GlnGlu: 3.683 ± 0.027
1.522GlnPhe: 1.522 ± 0.012
2.48GlnGly: 2.48 ± 0.019
1.308GlnHis: 1.308 ± 0.013
2.256GlnIle: 2.256 ± 0.016
3.294GlnLys: 3.294 ± 0.021
4.603GlnLeu: 4.603 ± 0.026
1.105GlnMet: 1.105 ± 0.01
2.19GlnAsn: 2.19 ± 0.015
2.222GlnPro: 2.222 ± 0.021
3.218GlnGln: 3.218 ± 0.041
2.616GlnArg: 2.616 ± 0.019
3.351GlnSer: 3.351 ± 0.022
2.452GlnThr: 2.452 ± 0.015
2.76GlnVal: 2.76 ± 0.018
0.544GlnTrp: 0.544 ± 0.007
1.353GlnTyr: 1.353 ± 0.011
0.001GlnXaa: 0.001 ± 0.0
Arg
2.897ArgAla: 2.897 ± 0.016
1.104ArgCys: 1.104 ± 0.013
2.513ArgAsp: 2.513 ± 0.021
3.64ArgGlu: 3.64 ± 0.026
1.927ArgPhe: 1.927 ± 0.015
2.86ArgGly: 2.86 ± 0.025
1.374ArgHis: 1.374 ± 0.012
2.654ArgIle: 2.654 ± 0.016
3.878ArgLys: 3.878 ± 0.021
4.754ArgLeu: 4.754 ± 0.027
1.189ArgMet: 1.189 ± 0.01
2.311ArgAsn: 2.311 ± 0.014
2.369ArgPro: 2.369 ± 0.019
2.376ArgGln: 2.376 ± 0.018
3.533ArgArg: 3.533 ± 0.024
4.023ArgSer: 4.023 ± 0.028
2.698ArgThr: 2.698 ± 0.016
2.908ArgVal: 2.908 ± 0.019
0.687ArgTrp: 0.687 ± 0.008
1.567ArgTyr: 1.567 ± 0.013
0.001ArgXaa: 0.001 ± 0.0
Ser
4.841SerAla: 4.841 ± 0.024
1.893SerCys: 1.893 ± 0.017
4.025SerAsp: 4.025 ± 0.024
5.193SerGlu: 5.193 ± 0.03
3.282SerPhe: 3.282 ± 0.017
4.979SerGly: 4.979 ± 0.027
2.013SerHis: 2.013 ± 0.015
3.862SerIle: 3.862 ± 0.019
4.717SerLys: 4.717 ± 0.023
7.942SerLeu: 7.942 ± 0.031
1.724SerMet: 1.724 ± 0.012
3.378SerAsn: 3.378 ± 0.02
4.938SerPro: 4.938 ± 0.04
3.661SerGln: 3.661 ± 0.025
4.11SerArg: 4.11 ± 0.025
9.107SerSer: 9.107 ± 0.064
4.811SerThr: 4.811 ± 0.028
5.152SerVal: 5.152 ± 0.026
0.973SerTrp: 0.973 ± 0.011
2.276SerTyr: 2.276 ± 0.015
0.002SerXaa: 0.002 ± 0.0
Thr
3.789ThrAla: 3.789 ± 0.019
1.377ThrCys: 1.377 ± 0.017
2.79ThrAsp: 2.79 ± 0.017
3.935ThrGlu: 3.935 ± 0.021
2.282ThrPhe: 2.282 ± 0.014
3.454ThrGly: 3.454 ± 0.02
1.258ThrHis: 1.258 ± 0.014
2.871ThrIle: 2.871 ± 0.021
3.084ThrLys: 3.084 ± 0.02
5.356ThrLeu: 5.356 ± 0.023
1.196ThrMet: 1.196 ± 0.01
2.153ThrAsn: 2.153 ± 0.016
3.133ThrPro: 3.133 ± 0.025
2.222ThrGln: 2.222 ± 0.014
2.343ThrArg: 2.343 ± 0.012
4.806ThrSer: 4.806 ± 0.03
3.28ThrThr: 3.28 ± 0.029
4.182ThrVal: 4.182 ± 0.021
0.692ThrTrp: 0.692 ± 0.01
1.576ThrTyr: 1.576 ± 0.013
0.001ThrXaa: 0.001 ± 0.0
Val
3.91ValAla: 3.91 ± 0.022
1.636ValCys: 1.636 ± 0.016
3.054ValAsp: 3.054 ± 0.019
4.093ValGlu: 4.093 ± 0.024
2.676ValPhe: 2.676 ± 0.019
3.254ValGly: 3.254 ± 0.019
1.59ValHis: 1.59 ± 0.013
3.343ValIle: 3.343 ± 0.021
4.002ValLys: 4.002 ± 0.024
6.228ValLeu: 6.228 ± 0.028
1.404ValMet: 1.404 ± 0.011
2.659ValAsn: 2.659 ± 0.017
3.197ValPro: 3.197 ± 0.018
2.851ValGln: 2.851 ± 0.017
2.834ValArg: 2.834 ± 0.016
5.035ValSer: 5.035 ± 0.026
3.906ValThr: 3.906 ± 0.022
4.099ValVal: 4.099 ± 0.025
0.744ValTrp: 0.744 ± 0.009
1.897ValTyr: 1.897 ± 0.014
0.001ValXaa: 0.001 ± 0.0
Trp
0.634TrpAla: 0.634 ± 0.007
0.25TrpCys: 0.25 ± 0.005
0.648TrpAsp: 0.648 ± 0.008
0.784TrpGlu: 0.784 ± 0.01
0.495TrpPhe: 0.495 ± 0.007
0.653TrpGly: 0.653 ± 0.011
0.294TrpHis: 0.294 ± 0.006
0.664TrpIle: 0.664 ± 0.009
0.903TrpLys: 0.903 ± 0.011
1.22TrpLeu: 1.22 ± 0.012
0.33TrpMet: 0.33 ± 0.006
0.641TrpAsn: 0.641 ± 0.008
0.408TrpPro: 0.408 ± 0.006
0.533TrpGln: 0.533 ± 0.007
0.683TrpArg: 0.683 ± 0.007
0.862TrpSer: 0.862 ± 0.011
0.657TrpThr: 0.657 ± 0.009
0.686TrpVal: 0.686 ± 0.008
0.199TrpTrp: 0.199 ± 0.004
0.386TrpTyr: 0.386 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.484TyrAla: 1.484 ± 0.012
0.821TyrCys: 0.821 ± 0.01
1.42TyrAsp: 1.42 ± 0.012
1.756TyrGlu: 1.756 ± 0.012
1.443TyrPhe: 1.443 ± 0.012
1.742TyrGly: 1.742 ± 0.014
0.807TyrHis: 0.807 ± 0.009
1.735TyrIle: 1.735 ± 0.016
1.839TyrLys: 1.839 ± 0.016
2.937TyrLeu: 2.937 ± 0.02
0.694TyrMet: 0.694 ± 0.009
1.408TyrAsn: 1.408 ± 0.012
1.369TyrPro: 1.369 ± 0.011
1.325TyrGln: 1.325 ± 0.011
1.678TyrArg: 1.678 ± 0.014
2.508TyrSer: 2.508 ± 0.016
1.703TyrThr: 1.703 ± 0.014
1.703TyrVal: 1.703 ± 0.015
0.418TyrTrp: 0.418 ± 0.008
1.146TyrTyr: 1.146 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.003XaaLys: 0.003 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.019XaaXaa: 0.019 ± 0.006
Statistics based on 23431 proteins (12138361 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski