Amino acid dipepetide frequency for Callorhinchus milii (Ghost shark)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.154AlaAla: 5.154 ± 0.022
1.257AlaCys: 1.257 ± 0.007
2.986AlaAsp: 2.986 ± 0.011
4.359AlaGlu: 4.359 ± 0.016
2.472AlaPhe: 2.472 ± 0.011
3.747AlaGly: 3.747 ± 0.015
1.391AlaHis: 1.391 ± 0.008
3.191AlaIle: 3.191 ± 0.011
3.543AlaLys: 3.543 ± 0.016
6.267AlaLeu: 6.267 ± 0.019
1.583AlaMet: 1.583 ± 0.011
2.357AlaAsn: 2.357 ± 0.009
2.737AlaPro: 2.737 ± 0.014
2.752AlaGln: 2.752 ± 0.012
3.008AlaArg: 3.008 ± 0.011
4.833AlaSer: 4.833 ± 0.016
3.333AlaThr: 3.333 ± 0.011
4.621AlaVal: 4.621 ± 0.015
0.664AlaTrp: 0.664 ± 0.006
1.556AlaTyr: 1.556 ± 0.008
0.001AlaXaa: 0.001 ± 0.0
Cys
1.235CysAla: 1.235 ± 0.009
0.646CysCys: 0.646 ± 0.005
1.147CysAsp: 1.147 ± 0.012
1.336CysGlu: 1.336 ± 0.009
0.982CysPhe: 0.982 ± 0.006
1.464CysGly: 1.464 ± 0.011
0.641CysHis: 0.641 ± 0.005
1.226CysIle: 1.226 ± 0.008
1.286CysLys: 1.286 ± 0.009
2.277CysLeu: 2.277 ± 0.011
0.466CysMet: 0.466 ± 0.004
0.96CysAsn: 0.96 ± 0.008
1.183CysPro: 1.183 ± 0.009
1.051CysGln: 1.051 ± 0.008
1.248CysArg: 1.248 ± 0.008
2.008CysSer: 2.008 ± 0.011
1.25CysThr: 1.25 ± 0.008
1.56CysVal: 1.56 ± 0.012
0.29CysTrp: 0.29 ± 0.003
0.64CysTyr: 0.64 ± 0.005
0.001CysXaa: 0.001 ± 0.0
Asp
2.753AspAla: 2.753 ± 0.01
1.18AspCys: 1.18 ± 0.01
2.919AspAsp: 2.919 ± 0.016
3.784AspGlu: 3.784 ± 0.014
2.18AspPhe: 2.18 ± 0.008
3.447AspGly: 3.447 ± 0.018
1.193AspHis: 1.193 ± 0.006
3.027AspIle: 3.027 ± 0.011
2.868AspLys: 2.868 ± 0.011
5.165AspLeu: 5.165 ± 0.014
1.186AspMet: 1.186 ± 0.007
2.077AspAsn: 2.077 ± 0.01
2.627AspPro: 2.627 ± 0.012
2.009AspGln: 2.009 ± 0.01
2.556AspArg: 2.556 ± 0.011
4.077AspSer: 4.077 ± 0.012
2.582AspThr: 2.582 ± 0.009
3.382AspVal: 3.382 ± 0.012
0.694AspTrp: 0.694 ± 0.005
1.718AspTyr: 1.718 ± 0.008
0.0AspXaa: 0.0 ± 0.0
Glu
4.482GluAla: 4.482 ± 0.017
1.365GluCys: 1.365 ± 0.011
4.179GluAsp: 4.179 ± 0.013
7.009GluGlu: 7.009 ± 0.03
2.3GluPhe: 2.3 ± 0.01
3.803GluGly: 3.803 ± 0.012
1.569GluHis: 1.569 ± 0.008
3.646GluIle: 3.646 ± 0.013
5.148GluLys: 5.148 ± 0.021
6.449GluLeu: 6.449 ± 0.021
1.883GluMet: 1.883 ± 0.008
3.272GluAsn: 3.272 ± 0.012
2.467GluPro: 2.467 ± 0.012
3.192GluGln: 3.192 ± 0.013
3.979GluArg: 3.979 ± 0.016
4.395GluSer: 4.395 ± 0.016
3.556GluThr: 3.556 ± 0.013
4.207GluVal: 4.207 ± 0.013
0.755GluTrp: 0.755 ± 0.005
1.851GluTyr: 1.851 ± 0.008
0.001GluXaa: 0.001 ± 0.0
Phe
2.24PheAla: 2.24 ± 0.009
1.011PheCys: 1.011 ± 0.007
1.997PheAsp: 1.997 ± 0.008
2.21PheGlu: 2.21 ± 0.009
1.701PhePhe: 1.701 ± 0.01
2.317PheGly: 2.317 ± 0.011
1.108PheHis: 1.108 ± 0.006
2.17PheIle: 2.17 ± 0.011
2.048PheLys: 2.048 ± 0.008
4.107PheLeu: 4.107 ± 0.015
0.835PheMet: 0.835 ± 0.006
1.69PheAsn: 1.69 ± 0.008
1.894PhePro: 1.894 ± 0.009
1.806PheGln: 1.806 ± 0.008
1.951PheArg: 1.951 ± 0.009
3.297PheSer: 3.297 ± 0.012
2.358PheThr: 2.358 ± 0.01
2.439PheVal: 2.439 ± 0.009
0.511PheTrp: 0.511 ± 0.005
1.32PheTyr: 1.32 ± 0.008
0.001PheXaa: 0.001 ± 0.0
Gly
3.433GlyAla: 3.433 ± 0.015
1.207GlyCys: 1.207 ± 0.009
3.019GlyAsp: 3.019 ± 0.012
3.714GlyGlu: 3.714 ± 0.014
2.441GlyPhe: 2.441 ± 0.011
4.105GlyGly: 4.105 ± 0.017
1.528GlyHis: 1.528 ± 0.008
3.166GlyIle: 3.166 ± 0.013
3.908GlyLys: 3.908 ± 0.012
5.206GlyLeu: 5.206 ± 0.014
1.417GlyMet: 1.417 ± 0.008
2.668GlyAsn: 2.668 ± 0.011
2.522GlyPro: 2.522 ± 0.019
2.522GlyGln: 2.522 ± 0.01
3.243GlyArg: 3.243 ± 0.015
5.043GlySer: 5.043 ± 0.016
3.458GlyThr: 3.458 ± 0.013
3.579GlyVal: 3.579 ± 0.013
0.776GlyTrp: 0.776 ± 0.006
1.904GlyTyr: 1.904 ± 0.011
0.001GlyXaa: 0.001 ± 0.0
His
1.219HisAla: 1.219 ± 0.006
0.761HisCys: 0.761 ± 0.006
1.011HisAsp: 1.011 ± 0.005
1.351HisGlu: 1.351 ± 0.007
1.131HisPhe: 1.131 ± 0.006
1.462HisGly: 1.462 ± 0.008
0.892HisHis: 0.892 ± 0.006
1.422HisIle: 1.422 ± 0.007
1.365HisLys: 1.365 ± 0.007
2.782HisLeu: 2.782 ± 0.01
0.586HisMet: 0.586 ± 0.004
1.07HisAsn: 1.07 ± 0.006
1.48HisPro: 1.48 ± 0.008
1.236HisGln: 1.236 ± 0.008
1.516HisArg: 1.516 ± 0.008
2.368HisSer: 2.368 ± 0.014
1.51HisThr: 1.51 ± 0.011
1.545HisVal: 1.545 ± 0.008
0.36HisTrp: 0.36 ± 0.003
0.899HisTyr: 0.899 ± 0.006
0.0HisXaa: 0.0 ± 0.0
Ile
3.134IleAla: 3.134 ± 0.01
1.271IleCys: 1.271 ± 0.007
2.647IleAsp: 2.647 ± 0.01
3.138IleGlu: 3.138 ± 0.012
2.22IlePhe: 2.22 ± 0.012
2.721IleGly: 2.721 ± 0.011
1.445IleHis: 1.445 ± 0.008
3.086IleIle: 3.086 ± 0.013
3.093IleLys: 3.093 ± 0.011
5.164IleLeu: 5.164 ± 0.015
1.176IleMet: 1.176 ± 0.006
2.458IleAsn: 2.458 ± 0.009
2.798IlePro: 2.798 ± 0.01
2.582IleGln: 2.582 ± 0.011
2.713IleArg: 2.713 ± 0.01
4.194IleSer: 4.194 ± 0.013
3.07IleThr: 3.07 ± 0.015
3.228IleVal: 3.228 ± 0.013
0.587IleTrp: 0.587 ± 0.004
1.649IleTyr: 1.649 ± 0.009
0.001IleXaa: 0.001 ± 0.0
Lys
3.887LysAla: 3.887 ± 0.017
1.246LysCys: 1.246 ± 0.009
3.421LysAsp: 3.421 ± 0.012
5.109LysGlu: 5.109 ± 0.019
1.931LysPhe: 1.931 ± 0.009
3.246LysGly: 3.246 ± 0.015
1.556LysHis: 1.556 ± 0.008
3.231LysIle: 3.231 ± 0.012
4.602LysLys: 4.602 ± 0.021
5.894LysLeu: 5.894 ± 0.018
1.579LysMet: 1.579 ± 0.008
2.651LysAsn: 2.651 ± 0.012
2.824LysPro: 2.824 ± 0.012
2.902LysGln: 2.902 ± 0.012
3.455LysArg: 3.455 ± 0.013
4.096LysSer: 4.096 ± 0.017
3.228LysThr: 3.228 ± 0.012
3.808LysVal: 3.808 ± 0.013
0.689LysTrp: 0.689 ± 0.005
1.841LysTyr: 1.841 ± 0.01
0.0LysXaa: 0.0 ± 0.0
Leu
6.011LeuAla: 6.011 ± 0.018
2.29LeuCys: 2.29 ± 0.012
4.948LeuAsp: 4.948 ± 0.014
6.805LeuGlu: 6.805 ± 0.022
3.787LeuPhe: 3.787 ± 0.015
5.07LeuGly: 5.07 ± 0.016
2.785LeuHis: 2.785 ± 0.011
4.568LeuIle: 4.568 ± 0.015
6.289LeuLys: 6.289 ± 0.017
10.091LeuLeu: 10.091 ± 0.032
2.2LeuMet: 2.2 ± 0.01
4.185LeuAsn: 4.185 ± 0.012
5.031LeuPro: 5.031 ± 0.015
5.634LeuGln: 5.634 ± 0.02
5.321LeuArg: 5.321 ± 0.017
8.018LeuSer: 8.018 ± 0.025
5.371LeuThr: 5.371 ± 0.015
5.522LeuVal: 5.522 ± 0.016
1.112LeuTrp: 1.112 ± 0.008
2.873LeuTyr: 2.873 ± 0.011
0.001LeuXaa: 0.001 ± 0.0
Met
1.713MetAla: 1.713 ± 0.008
0.507MetCys: 0.507 ± 0.004
1.38MetAsp: 1.38 ± 0.006
1.907MetGlu: 1.907 ± 0.008
0.926MetPhe: 0.926 ± 0.006
1.304MetGly: 1.304 ± 0.008
0.526MetHis: 0.526 ± 0.004
1.078MetIle: 1.078 ± 0.007
1.612MetLys: 1.612 ± 0.008
2.159MetLeu: 2.159 ± 0.01
0.639MetMet: 0.639 ± 0.005
1.08MetAsn: 1.08 ± 0.007
1.032MetPro: 1.032 ± 0.01
1.047MetGln: 1.047 ± 0.007
1.109MetArg: 1.109 ± 0.006
1.676MetSer: 1.676 ± 0.007
1.222MetThr: 1.222 ± 0.006
1.533MetVal: 1.533 ± 0.008
0.269MetTrp: 0.269 ± 0.003
0.713MetTyr: 0.713 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
2.396AsnAla: 2.396 ± 0.009
1.047AsnCys: 1.047 ± 0.009
1.934AsnAsp: 1.934 ± 0.009
2.646AsnGlu: 2.646 ± 0.011
1.687AsnPhe: 1.687 ± 0.008
2.84AsnGly: 2.84 ± 0.014
1.065AsnHis: 1.065 ± 0.006
2.698AsnIle: 2.698 ± 0.01
2.65AsnLys: 2.65 ± 0.011
4.276AsnLeu: 4.276 ± 0.013
1.064AsnMet: 1.064 ± 0.006
2.042AsnAsn: 2.042 ± 0.01
2.373AsnPro: 2.373 ± 0.012
1.883AsnGln: 1.883 ± 0.009
2.161AsnArg: 2.161 ± 0.009
3.5AsnSer: 3.5 ± 0.013
2.372AsnThr: 2.372 ± 0.009
2.741AsnVal: 2.741 ± 0.01
0.523AsnTrp: 0.523 ± 0.005
1.358AsnTyr: 1.358 ± 0.007
0.0AsnXaa: 0.0 ± 0.0
Pro
3.235ProAla: 3.235 ± 0.013
0.979ProCys: 0.979 ± 0.008
2.621ProAsp: 2.621 ± 0.012
3.556ProGlu: 3.556 ± 0.013
1.845ProPhe: 1.845 ± 0.009
3.529ProGly: 3.529 ± 0.027
1.334ProHis: 1.334 ± 0.009
2.168ProIle: 2.168 ± 0.009
2.545ProLys: 2.545 ± 0.01
4.408ProLeu: 4.408 ± 0.015
0.947ProMet: 0.947 ± 0.006
2.011ProAsn: 2.011 ± 0.009
4.003ProPro: 4.003 ± 0.026
2.23ProGln: 2.23 ± 0.011
2.404ProArg: 2.404 ± 0.011
4.611ProSer: 4.611 ± 0.016
2.861ProThr: 2.861 ± 0.011
3.396ProVal: 3.396 ± 0.014
0.536ProTrp: 0.536 ± 0.004
1.366ProTyr: 1.366 ± 0.008
0.002ProXaa: 0.002 ± 0.0
Gln
3.017GlnAla: 3.017 ± 0.013
1.045GlnCys: 1.045 ± 0.008
2.237GlnAsp: 2.237 ± 0.009
3.49GlnGlu: 3.49 ± 0.014
1.57GlnPhe: 1.57 ± 0.007
2.457GlnGly: 2.457 ± 0.011
1.322GlnHis: 1.322 ± 0.007
2.4GlnIle: 2.4 ± 0.008
2.867GlnLys: 2.867 ± 0.013
4.729GlnLeu: 4.729 ± 0.017
1.21GlnMet: 1.21 ± 0.007
2.045GlnAsn: 2.045 ± 0.009
2.297GlnPro: 2.297 ± 0.013
3.135GlnGln: 3.135 ± 0.021
2.729GlnArg: 2.729 ± 0.011
3.299GlnSer: 3.299 ± 0.013
2.531GlnThr: 2.531 ± 0.01
2.809GlnVal: 2.809 ± 0.01
0.582GlnTrp: 0.582 ± 0.005
1.364GlnTyr: 1.364 ± 0.007
0.001GlnXaa: 0.001 ± 0.0
Arg
3.108ArgAla: 3.108 ± 0.012
1.205ArgCys: 1.205 ± 0.009
2.725ArgAsp: 2.725 ± 0.011
3.739ArgGlu: 3.739 ± 0.016
1.987ArgPhe: 1.987 ± 0.009
2.963ArgGly: 2.963 ± 0.012
1.427ArgHis: 1.427 ± 0.008
2.744ArgIle: 2.744 ± 0.01
3.695ArgLys: 3.695 ± 0.013
5.113ArgLeu: 5.113 ± 0.014
1.272ArgMet: 1.272 ± 0.006
2.388ArgAsn: 2.388 ± 0.009
2.421ArgPro: 2.421 ± 0.011
2.514ArgGln: 2.514 ± 0.011
3.594ArgArg: 3.594 ± 0.015
3.998ArgSer: 3.998 ± 0.014
2.799ArgThr: 2.799 ± 0.009
3.087ArgVal: 3.087 ± 0.012
0.661ArgTrp: 0.661 ± 0.006
1.596ArgTyr: 1.596 ± 0.007
0.001ArgXaa: 0.001 ± 0.0
Ser
4.918SerAla: 4.918 ± 0.016
1.77SerCys: 1.77 ± 0.01
4.063SerAsp: 4.063 ± 0.016
4.965SerGlu: 4.965 ± 0.018
3.093SerPhe: 3.093 ± 0.012
5.028SerGly: 5.028 ± 0.016
2.13SerHis: 2.13 ± 0.016
3.862SerIle: 3.862 ± 0.014
4.297SerLys: 4.297 ± 0.015
8.046SerLeu: 8.046 ± 0.025
1.682SerMet: 1.682 ± 0.007
3.29SerAsn: 3.29 ± 0.011
4.931SerPro: 4.931 ± 0.02
3.554SerGln: 3.554 ± 0.013
4.034SerArg: 4.034 ± 0.015
8.136SerSer: 8.136 ± 0.032
4.589SerThr: 4.589 ± 0.016
5.16SerVal: 5.16 ± 0.013
0.92SerTrp: 0.92 ± 0.006
2.132SerTyr: 2.132 ± 0.01
0.001SerXaa: 0.001 ± 0.0
Thr
3.784ThrAla: 3.784 ± 0.011
1.361ThrCys: 1.361 ± 0.012
2.884ThrAsp: 2.884 ± 0.011
3.79ThrGlu: 3.79 ± 0.014
2.285ThrPhe: 2.285 ± 0.009
3.574ThrGly: 3.574 ± 0.013
1.38ThrHis: 1.38 ± 0.011
2.919ThrIle: 2.919 ± 0.011
2.928ThrLys: 2.928 ± 0.01
5.435ThrLeu: 5.435 ± 0.017
1.195ThrMet: 1.195 ± 0.007
2.152ThrAsn: 2.152 ± 0.009
3.121ThrPro: 3.121 ± 0.013
2.267ThrGln: 2.267 ± 0.01
2.445ThrArg: 2.445 ± 0.01
4.638ThrSer: 4.638 ± 0.016
3.088ThrThr: 3.088 ± 0.016
4.152ThrVal: 4.152 ± 0.016
0.676ThrTrp: 0.676 ± 0.006
1.546ThrTyr: 1.546 ± 0.008
0.0ThrXaa: 0.0 ± 0.0
Val
3.99ValAla: 3.99 ± 0.014
1.693ValCys: 1.693 ± 0.011
3.173ValAsp: 3.173 ± 0.011
4.051ValGlu: 4.051 ± 0.011
2.578ValPhe: 2.578 ± 0.01
3.415ValGly: 3.415 ± 0.011
1.568ValHis: 1.568 ± 0.007
3.354ValIle: 3.354 ± 0.012
3.882ValLys: 3.882 ± 0.012
6.161ValLeu: 6.161 ± 0.018
1.473ValMet: 1.473 ± 0.008
2.821ValAsn: 2.821 ± 0.011
3.095ValPro: 3.095 ± 0.012
2.937ValGln: 2.937 ± 0.01
3.208ValArg: 3.208 ± 0.011
5.099ValSer: 5.099 ± 0.015
3.987ValThr: 3.987 ± 0.015
4.079ValVal: 4.079 ± 0.015
0.775ValTrp: 0.775 ± 0.005
1.896ValTyr: 1.896 ± 0.009
0.001ValXaa: 0.001 ± 0.0
Trp
0.66TrpAla: 0.66 ± 0.005
0.273TrpCys: 0.273 ± 0.003
0.663TrpAsp: 0.663 ± 0.005
0.78TrpGlu: 0.78 ± 0.005
0.476TrpPhe: 0.476 ± 0.004
0.656TrpGly: 0.656 ± 0.006
0.285TrpHis: 0.285 ± 0.003
0.667TrpIle: 0.667 ± 0.005
0.846TrpLys: 0.846 ± 0.005
1.225TrpLeu: 1.225 ± 0.008
0.333TrpMet: 0.333 ± 0.004
0.625TrpAsn: 0.625 ± 0.005
0.437TrpPro: 0.437 ± 0.004
0.558TrpGln: 0.558 ± 0.005
0.667TrpArg: 0.667 ± 0.006
0.899TrpSer: 0.899 ± 0.006
0.708TrpThr: 0.708 ± 0.006
0.683TrpVal: 0.683 ± 0.006
0.211TrpTrp: 0.211 ± 0.003
0.372TrpTyr: 0.372 ± 0.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.439TyrAla: 1.439 ± 0.007
0.782TyrCys: 0.782 ± 0.006
1.468TyrAsp: 1.468 ± 0.008
1.768TyrGlu: 1.768 ± 0.009
1.418TyrPhe: 1.418 ± 0.007
1.711TyrGly: 1.711 ± 0.007
0.82TyrHis: 0.82 ± 0.005
1.753TyrIle: 1.753 ± 0.008
1.725TyrLys: 1.725 ± 0.011
2.959TyrLeu: 2.959 ± 0.012
0.707TyrMet: 0.707 ± 0.005
1.412TyrAsn: 1.412 ± 0.007
1.312TyrPro: 1.312 ± 0.009
1.324TyrGln: 1.324 ± 0.006
1.694TyrArg: 1.694 ± 0.009
2.416TyrSer: 2.416 ± 0.01
1.747TyrThr: 1.747 ± 0.009
1.703TyrVal: 1.703 ± 0.007
0.436TyrTrp: 0.436 ± 0.006
1.104TyrTyr: 1.104 ± 0.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.402XaaXaa: 0.402 ± 0.044
Statistics based on 49245 proteins (31969766 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski