Amino acid dipepetide frequency for Monodon monoceros (Narwhal) (Ceratodon monodon)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.973AlaAla: 7.973 ± 0.065
1.505AlaCys: 1.505 ± 0.013
2.94AlaAsp: 2.94 ± 0.019
4.943AlaGlu: 4.943 ± 0.035
2.72AlaPhe: 2.72 ± 0.022
5.611AlaGly: 5.611 ± 0.037
1.698AlaHis: 1.698 ± 0.018
2.587AlaIle: 2.587 ± 0.018
3.366AlaLys: 3.366 ± 0.023
7.69AlaLeu: 7.69 ± 0.043
1.486AlaMet: 1.486 ± 0.014
1.921AlaAsn: 1.921 ± 0.014
4.954AlaPro: 4.954 ± 0.043
3.375AlaGln: 3.375 ± 0.024
4.348AlaArg: 4.348 ± 0.031
6.006AlaSer: 6.006 ± 0.031
3.63AlaThr: 3.63 ± 0.028
4.943AlaVal: 4.943 ± 0.03
0.908AlaTrp: 0.908 ± 0.012
1.51AlaTyr: 1.51 ± 0.013
0.003AlaXaa: 0.003 ± 0.001
Cys
1.357CysAla: 1.357 ± 0.012
0.648CysCys: 0.648 ± 0.011
1.008CysAsp: 1.008 ± 0.013
1.265CysGlu: 1.265 ± 0.014
0.863CysPhe: 0.863 ± 0.01
1.931CysGly: 1.931 ± 0.023
0.699CysHis: 0.699 ± 0.009
0.878CysIle: 0.878 ± 0.011
1.126CysLys: 1.126 ± 0.016
2.22CysLeu: 2.22 ± 0.019
0.381CysMet: 0.381 ± 0.007
0.793CysAsn: 0.793 ± 0.011
1.541CysPro: 1.541 ± 0.017
1.062CysGln: 1.062 ± 0.012
1.407CysArg: 1.407 ± 0.014
2.047CysSer: 2.047 ± 0.017
1.103CysThr: 1.103 ± 0.012
1.337CysVal: 1.337 ± 0.013
0.319CysTrp: 0.319 ± 0.006
0.566CysTyr: 0.566 ± 0.008
0.001CysXaa: 0.001 ± 0.0
Asp
2.931AspAla: 2.931 ± 0.02
1.054AspCys: 1.054 ± 0.012
2.451AspAsp: 2.451 ± 0.02
3.289AspGlu: 3.289 ± 0.026
2.034AspPhe: 2.034 ± 0.017
3.311AspGly: 3.311 ± 0.022
1.115AspHis: 1.115 ± 0.01
2.377AspIle: 2.377 ± 0.022
2.342AspLys: 2.342 ± 0.019
4.863AspLeu: 4.863 ± 0.03
1.019AspMet: 1.019 ± 0.011
1.51AspAsn: 1.51 ± 0.016
3.008AspPro: 3.008 ± 0.02
1.756AspGln: 1.756 ± 0.015
2.44AspArg: 2.44 ± 0.019
3.984AspSer: 3.984 ± 0.025
2.371AspThr: 2.371 ± 0.015
3.003AspVal: 3.003 ± 0.033
0.633AspTrp: 0.633 ± 0.01
1.341AspTyr: 1.341 ± 0.013
0.002AspXaa: 0.002 ± 0.0
Glu
5.535GluAla: 5.535 ± 0.035
1.514GluCys: 1.514 ± 0.027
4.235GluAsp: 4.235 ± 0.026
7.595GluGlu: 7.595 ± 0.054
1.952GluPhe: 1.952 ± 0.015
4.418GluGly: 4.418 ± 0.029
1.48GluHis: 1.48 ± 0.012
2.885GluIle: 2.885 ± 0.023
5.21GluLys: 5.21 ± 0.04
6.323GluLeu: 6.323 ± 0.034
1.578GluMet: 1.578 ± 0.016
2.846GluAsn: 2.846 ± 0.024
3.432GluPro: 3.432 ± 0.025
3.017GluGln: 3.017 ± 0.016
4.129GluArg: 4.129 ± 0.029
4.221GluSer: 4.221 ± 0.027
3.307GluThr: 3.307 ± 0.022
4.114GluVal: 4.114 ± 0.024
0.683GluTrp: 0.683 ± 0.009
1.449GluTyr: 1.449 ± 0.014
0.003GluXaa: 0.003 ± 0.001
Phe
2.0PheAla: 2.0 ± 0.016
0.933PheCys: 0.933 ± 0.011
1.576PheAsp: 1.576 ± 0.014
1.924PheGlu: 1.924 ± 0.016
1.517PhePhe: 1.517 ± 0.016
2.161PheGly: 2.161 ± 0.019
1.054PheHis: 1.054 ± 0.013
1.717PheIle: 1.717 ± 0.017
1.699PheLys: 1.699 ± 0.017
4.066PheLeu: 4.066 ± 0.025
0.705PheMet: 0.705 ± 0.009
1.255PheAsn: 1.255 ± 0.013
2.094PhePro: 2.094 ± 0.016
1.785PheGln: 1.785 ± 0.016
2.017PheArg: 2.017 ± 0.018
3.459PheSer: 3.459 ± 0.022
1.991PheThr: 1.991 ± 0.017
2.099PheVal: 2.099 ± 0.018
0.493PheTrp: 0.493 ± 0.008
1.126PheTyr: 1.126 ± 0.012
0.002PheXaa: 0.002 ± 0.0
Gly
5.362GlyAla: 5.362 ± 0.039
1.424GlyCys: 1.424 ± 0.013
3.192GlyAsp: 3.192 ± 0.021
4.434GlyGlu: 4.434 ± 0.03
2.418GlyPhe: 2.418 ± 0.019
5.939GlyGly: 5.939 ± 0.057
1.829GlyHis: 1.829 ± 0.016
2.631GlyIle: 2.631 ± 0.02
3.944GlyLys: 3.944 ± 0.028
6.304GlyLeu: 6.304 ± 0.033
1.246GlyMet: 1.246 ± 0.012
2.264GlyAsn: 2.264 ± 0.016
4.941GlyPro: 4.941 ± 0.053
2.884GlyGln: 2.884 ± 0.024
4.49GlyArg: 4.49 ± 0.031
6.017GlySer: 6.017 ± 0.04
3.683GlyThr: 3.683 ± 0.03
3.719GlyVal: 3.719 ± 0.024
0.899GlyTrp: 0.899 ± 0.012
1.655GlyTyr: 1.655 ± 0.019
0.003GlyXaa: 0.003 ± 0.001
His
1.405HisAla: 1.405 ± 0.014
0.736HisCys: 0.736 ± 0.01
0.866HisAsp: 0.866 ± 0.01
1.277HisGlu: 1.277 ± 0.015
1.096HisPhe: 1.096 ± 0.012
1.643HisGly: 1.643 ± 0.013
0.919HisHis: 0.919 ± 0.016
1.165HisIle: 1.165 ± 0.012
1.228HisLys: 1.228 ± 0.01
3.042HisLeu: 3.042 ± 0.018
0.568HisMet: 0.568 ± 0.009
0.811HisAsn: 0.811 ± 0.01
1.766HisPro: 1.766 ± 0.016
1.445HisGln: 1.445 ± 0.02
1.71HisArg: 1.71 ± 0.014
2.269HisSer: 2.269 ± 0.018
1.611HisThr: 1.611 ± 0.018
1.567HisVal: 1.567 ± 0.013
0.371HisTrp: 0.371 ± 0.006
0.77HisTyr: 0.77 ± 0.01
0.002HisXaa: 0.002 ± 0.0
Ile
2.371IleAla: 2.371 ± 0.019
1.019IleCys: 1.019 ± 0.011
1.828IleAsp: 1.828 ± 0.019
2.318IleGlu: 2.318 ± 0.02
1.762IlePhe: 1.762 ± 0.017
2.054IleGly: 2.054 ± 0.019
1.385IleHis: 1.385 ± 0.018
2.131IleIle: 2.131 ± 0.023
2.395IleLys: 2.395 ± 0.021
4.279IleLeu: 4.279 ± 0.029
0.901IleMet: 0.901 ± 0.012
1.635IleAsn: 1.635 ± 0.016
2.474IlePro: 2.474 ± 0.019
2.13IleGln: 2.13 ± 0.016
2.28IleArg: 2.28 ± 0.017
3.478IleSer: 3.478 ± 0.023
2.407IleThr: 2.407 ± 0.02
2.298IleVal: 2.298 ± 0.02
0.506IleTrp: 0.506 ± 0.007
1.288IleTyr: 1.288 ± 0.014
0.001IleXaa: 0.001 ± 0.0
Lys
4.062LysAla: 4.062 ± 0.026
1.145LysCys: 1.145 ± 0.011
2.924LysAsp: 2.924 ± 0.022
4.84LysGlu: 4.84 ± 0.037
1.653LysPhe: 1.653 ± 0.016
3.256LysGly: 3.256 ± 0.03
1.383LysHis: 1.383 ± 0.013
2.556LysIle: 2.556 ± 0.022
4.422LysLys: 4.422 ± 0.037
4.916LysLeu: 4.916 ± 0.028
1.388LysMet: 1.388 ± 0.015
2.191LysAsn: 2.191 ± 0.019
3.134LysPro: 3.134 ± 0.022
2.469LysGln: 2.469 ± 0.019
3.265LysArg: 3.265 ± 0.022
3.721LysSer: 3.721 ± 0.029
2.982LysThr: 2.982 ± 0.023
3.335LysVal: 3.335 ± 0.025
0.579LysTrp: 0.579 ± 0.009
1.489LysTyr: 1.489 ± 0.015
0.004LysXaa: 0.004 ± 0.001
Leu
7.175LeuAla: 7.175 ± 0.034
2.302LeuCys: 2.302 ± 0.016
4.627LeuAsp: 4.627 ± 0.025
7.013LeuGlu: 7.013 ± 0.04
3.355LeuPhe: 3.355 ± 0.026
6.414LeuGly: 6.414 ± 0.034
2.745LeuHis: 2.745 ± 0.017
3.71LeuIle: 3.71 ± 0.029
5.534LeuLys: 5.534 ± 0.035
10.866LeuLeu: 10.866 ± 0.057
1.877LeuMet: 1.877 ± 0.016
3.29LeuAsn: 3.29 ± 0.024
6.41LeuPro: 6.41 ± 0.037
5.685LeuGln: 5.685 ± 0.037
6.328LeuArg: 6.328 ± 0.032
8.002LeuSer: 8.002 ± 0.036
5.051LeuThr: 5.051 ± 0.028
5.578LeuVal: 5.578 ± 0.028
1.222LeuTrp: 1.222 ± 0.013
2.459LeuTyr: 2.459 ± 0.021
0.004LeuXaa: 0.004 ± 0.001
Met
1.836MetAla: 1.836 ± 0.013
0.386MetCys: 0.386 ± 0.006
1.15MetAsp: 1.15 ± 0.011
1.724MetGlu: 1.724 ± 0.015
0.679MetPhe: 0.679 ± 0.009
1.276MetGly: 1.276 ± 0.015
0.435MetHis: 0.435 ± 0.008
0.765MetIle: 0.765 ± 0.01
1.329MetLys: 1.329 ± 0.015
1.845MetLeu: 1.845 ± 0.016
0.506MetMet: 0.506 ± 0.008
0.806MetAsn: 0.806 ± 0.01
1.047MetPro: 1.047 ± 0.016
0.864MetGln: 0.864 ± 0.01
1.031MetArg: 1.031 ± 0.011
1.426MetSer: 1.426 ± 0.013
1.043MetThr: 1.043 ± 0.01
1.316MetVal: 1.316 ± 0.013
0.234MetTrp: 0.234 ± 0.006
0.506MetTyr: 0.506 ± 0.007
0.001MetXaa: 0.001 ± 0.0
Asn
1.926AsnAla: 1.926 ± 0.016
0.801AsnCys: 0.801 ± 0.01
1.394AsnAsp: 1.394 ± 0.015
2.015AsnGlu: 2.015 ± 0.023
1.393AsnPhe: 1.393 ± 0.013
2.242AsnGly: 2.242 ± 0.021
0.882AsnHis: 0.882 ± 0.01
1.908AsnIle: 1.908 ± 0.017
2.002AsnLys: 2.002 ± 0.019
3.502AsnLeu: 3.502 ± 0.025
0.824AsnMet: 0.824 ± 0.009
1.366AsnAsn: 1.366 ± 0.014
2.09AsnPro: 2.09 ± 0.015
1.528AsnGln: 1.528 ± 0.013
1.764AsnArg: 1.764 ± 0.014
2.841AsnSer: 2.841 ± 0.022
1.787AsnThr: 1.787 ± 0.017
2.095AsnVal: 2.095 ± 0.017
0.427AsnTrp: 0.427 ± 0.006
1.02AsnTyr: 1.02 ± 0.011
0.002AsnXaa: 0.002 ± 0.0
Pro
5.67ProAla: 5.67 ± 0.042
1.264ProCys: 1.264 ± 0.016
2.861ProAsp: 2.861 ± 0.027
4.678ProGlu: 4.678 ± 0.033
1.971ProPhe: 1.971 ± 0.015
5.952ProGly: 5.952 ± 0.057
1.539ProHis: 1.539 ± 0.016
1.778ProIle: 1.778 ± 0.016
2.742ProLys: 2.742 ± 0.033
5.625ProLeu: 5.625 ± 0.034
0.978ProMet: 0.978 ± 0.011
1.728ProAsn: 1.728 ± 0.013
6.622ProPro: 6.622 ± 0.074
2.923ProGln: 2.923 ± 0.023
4.066ProArg: 4.066 ± 0.028
5.902ProSer: 5.902 ± 0.04
3.139ProThr: 3.139 ± 0.028
3.864ProVal: 3.864 ± 0.027
0.799ProTrp: 0.799 ± 0.01
1.569ProTyr: 1.569 ± 0.018
0.003ProXaa: 0.003 ± 0.001
Gln
3.656GlnAla: 3.656 ± 0.023
0.943GlnCys: 0.943 ± 0.013
2.281GlnAsp: 2.281 ± 0.016
3.78GlnGlu: 3.78 ± 0.028
1.294GlnPhe: 1.294 ± 0.012
2.984GlnGly: 2.984 ± 0.023
1.305GlnHis: 1.305 ± 0.014
1.846GlnIle: 1.846 ± 0.014
2.847GlnLys: 2.847 ± 0.021
4.705GlnLeu: 4.705 ± 0.028
1.013GlnMet: 1.013 ± 0.011
1.712GlnAsn: 1.712 ± 0.015
2.862GlnPro: 2.862 ± 0.023
2.805GlnGln: 2.805 ± 0.033
3.148GlnArg: 3.148 ± 0.025
3.021GlnSer: 3.021 ± 0.02
2.256GlnThr: 2.256 ± 0.017
2.813GlnVal: 2.813 ± 0.019
0.537GlnTrp: 0.537 ± 0.008
1.064GlnTyr: 1.064 ± 0.012
0.002GlnXaa: 0.002 ± 0.0
Arg
4.708ArgAla: 4.708 ± 0.031
1.342ArgCys: 1.342 ± 0.015
2.812ArgAsp: 2.812 ± 0.02
4.173ArgGlu: 4.173 ± 0.029
1.864ArgPhe: 1.864 ± 0.017
4.383ArgGly: 4.383 ± 0.033
1.644ArgHis: 1.644 ± 0.016
2.368ArgIle: 2.368 ± 0.017
3.697ArgLys: 3.697 ± 0.023
5.764ArgLeu: 5.764 ± 0.03
1.14ArgMet: 1.14 ± 0.012
1.988ArgAsn: 1.988 ± 0.016
3.833ArgPro: 3.833 ± 0.028
2.699ArgGln: 2.699 ± 0.021
4.988ArgArg: 4.988 ± 0.037
4.421ArgSer: 4.421 ± 0.028
2.921ArgThr: 2.921 ± 0.02
3.394ArgVal: 3.394 ± 0.02
0.765ArgTrp: 0.765 ± 0.009
1.415ArgTyr: 1.415 ± 0.012
0.003ArgXaa: 0.003 ± 0.0
Ser
5.532SerAla: 5.532 ± 0.03
1.869SerCys: 1.869 ± 0.017
3.671SerAsp: 3.671 ± 0.023
5.05SerGlu: 5.05 ± 0.032
3.036SerPhe: 3.036 ± 0.023
5.929SerGly: 5.929 ± 0.037
2.133SerHis: 2.133 ± 0.014
2.994SerIle: 2.994 ± 0.02
3.859SerLys: 3.859 ± 0.026
8.214SerLeu: 8.214 ± 0.039
1.431SerMet: 1.431 ± 0.013
2.448SerAsn: 2.448 ± 0.018
6.195SerPro: 6.195 ± 0.048
3.764SerGln: 3.764 ± 0.025
4.739SerArg: 4.739 ± 0.029
8.851SerSer: 8.851 ± 0.064
4.234SerThr: 4.234 ± 0.033
4.825SerVal: 4.825 ± 0.027
1.099SerTrp: 1.099 ± 0.011
1.969SerTyr: 1.969 ± 0.018
0.002SerXaa: 0.002 ± 0.0
Thr
3.895ThrAla: 3.895 ± 0.027
1.287ThrCys: 1.287 ± 0.016
2.335ThrAsp: 2.335 ± 0.019
3.378ThrGlu: 3.378 ± 0.021
2.077ThrPhe: 2.077 ± 0.016
3.722ThrGly: 3.722 ± 0.029
1.352ThrHis: 1.352 ± 0.014
2.188ThrIle: 2.188 ± 0.021
2.52ThrLys: 2.52 ± 0.02
5.261ThrLeu: 5.261 ± 0.027
1.079ThrMet: 1.079 ± 0.013
1.594ThrAsn: 1.594 ± 0.015
3.569ThrPro: 3.569 ± 0.028
2.244ThrGln: 2.244 ± 0.018
2.59ThrArg: 2.59 ± 0.018
4.383ThrSer: 4.383 ± 0.031
2.792ThrThr: 2.792 ± 0.028
3.746ThrVal: 3.746 ± 0.028
0.72ThrTrp: 0.72 ± 0.008
1.367ThrTyr: 1.367 ± 0.012
0.002ThrXaa: 0.002 ± 0.001
Val
4.457ValAla: 4.457 ± 0.025
1.486ValCys: 1.486 ± 0.015
2.839ValAsp: 2.839 ± 0.026
3.764ValGlu: 3.764 ± 0.023
2.365ValPhe: 2.365 ± 0.018
3.579ValGly: 3.579 ± 0.023
1.618ValHis: 1.618 ± 0.015
2.699ValIle: 2.699 ± 0.022
3.226ValLys: 3.226 ± 0.024
6.331ValLeu: 6.331 ± 0.034
1.247ValMet: 1.247 ± 0.013
2.125ValAsn: 2.125 ± 0.016
3.792ValPro: 3.792 ± 0.025
2.719ValGln: 2.719 ± 0.018
3.186ValArg: 3.186 ± 0.018
4.84ValSer: 4.84 ± 0.026
3.735ValThr: 3.735 ± 0.029
3.952ValVal: 3.952 ± 0.024
0.741ValTrp: 0.741 ± 0.012
1.561ValTyr: 1.561 ± 0.015
0.003ValXaa: 0.003 ± 0.001
Trp
0.906TrpAla: 0.906 ± 0.01
0.248TrpCys: 0.248 ± 0.006
0.661TrpAsp: 0.661 ± 0.009
0.83TrpGlu: 0.83 ± 0.011
0.443TrpPhe: 0.443 ± 0.008
0.882TrpGly: 0.882 ± 0.013
0.32TrpHis: 0.32 ± 0.006
0.508TrpIle: 0.508 ± 0.009
0.807TrpLys: 0.807 ± 0.011
1.254TrpLeu: 1.254 ± 0.014
0.299TrpMet: 0.299 ± 0.006
0.518TrpAsn: 0.518 ± 0.008
0.612TrpPro: 0.612 ± 0.009
0.532TrpGln: 0.532 ± 0.007
0.834TrpArg: 0.834 ± 0.009
0.881TrpSer: 0.881 ± 0.011
0.676TrpThr: 0.676 ± 0.009
0.746TrpVal: 0.746 ± 0.009
0.192TrpTrp: 0.192 ± 0.005
0.32TrpTyr: 0.32 ± 0.006
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.36TyrAla: 1.36 ± 0.012
0.646TyrCys: 0.646 ± 0.008
1.181TyrAsp: 1.181 ± 0.014
1.631TyrGlu: 1.631 ± 0.014
1.178TyrPhe: 1.178 ± 0.013
1.591TyrGly: 1.591 ± 0.016
0.732TyrHis: 0.732 ± 0.009
1.245TyrIle: 1.245 ± 0.012
1.418TyrLys: 1.418 ± 0.015
2.576TyrLeu: 2.576 ± 0.019
0.538TyrMet: 0.538 ± 0.008
1.006TyrAsn: 1.006 ± 0.013
1.294TyrPro: 1.294 ± 0.014
1.186TyrGln: 1.186 ± 0.013
1.555TyrArg: 1.555 ± 0.014
2.071TyrSer: 2.071 ± 0.017
1.379TyrThr: 1.379 ± 0.014
1.518TyrVal: 1.518 ± 0.012
0.342TyrTrp: 0.342 ± 0.007
0.892TyrTyr: 0.892 ± 0.013
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.001
0.002XaaCys: 0.002 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.004XaaGly: 0.004 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.003XaaLys: 0.003 ± 0.001
0.003XaaLeu: 0.003 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.001
0.002XaaGln: 0.002 ± 0.0
0.004XaaArg: 0.004 ± 0.001
0.003XaaSer: 0.003 ± 0.001
0.002XaaThr: 0.002 ± 0.0
0.003XaaVal: 0.003 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.102XaaXaa: 0.102 ± 0.016
Statistics based on 20731 proteins (9821986 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski