Amino acid dipepetide frequency for Caenorhabditis briggsae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.229AlaAla: 5.229 ± 0.058
1.086AlaCys: 1.086 ± 0.013
3.091AlaAsp: 3.091 ± 0.024
4.181AlaGlu: 4.181 ± 0.038
2.667AlaPhe: 2.667 ± 0.021
3.317AlaGly: 3.317 ± 0.039
1.384AlaHis: 1.384 ± 0.015
3.722AlaIle: 3.722 ± 0.029
3.784AlaLys: 3.784 ± 0.033
5.169AlaLeu: 5.169 ± 0.03
1.642AlaMet: 1.642 ± 0.015
2.684AlaAsn: 2.684 ± 0.022
3.723AlaPro: 3.723 ± 0.048
2.539AlaGln: 2.539 ± 0.028
3.08AlaArg: 3.08 ± 0.021
4.693AlaSer: 4.693 ± 0.036
3.77AlaThr: 3.77 ± 0.034
4.135AlaVal: 4.135 ± 0.033
0.577AlaTrp: 0.577 ± 0.008
1.741AlaTyr: 1.741 ± 0.019
0.001AlaXaa: 0.001 ± 0.0
Cys
1.185CysAla: 1.185 ± 0.016
0.548CysCys: 0.548 ± 0.015
1.066CysAsp: 1.066 ± 0.019
1.189CysGlu: 1.189 ± 0.019
0.969CysPhe: 0.969 ± 0.012
1.268CysGly: 1.268 ± 0.018
0.485CysHis: 0.485 ± 0.009
1.106CysIle: 1.106 ± 0.014
1.012CysLys: 1.012 ± 0.013
1.713CysLeu: 1.713 ± 0.018
0.462CysMet: 0.462 ± 0.009
0.85CysAsn: 0.85 ± 0.014
1.071CysPro: 1.071 ± 0.02
0.889CysGln: 0.889 ± 0.017
1.073CysArg: 1.073 ± 0.015
1.633CysSer: 1.633 ± 0.019
1.032CysThr: 1.032 ± 0.013
1.216CysVal: 1.216 ± 0.014
0.221CysTrp: 0.221 ± 0.005
0.641CysTyr: 0.641 ± 0.01
0.001CysXaa: 0.001 ± 0.0
Asp
3.195AspAla: 3.195 ± 0.025
0.968AspCys: 0.968 ± 0.016
3.927AspAsp: 3.927 ± 0.034
4.529AspGlu: 4.529 ± 0.039
2.573AspPhe: 2.573 ± 0.019
3.383AspGly: 3.383 ± 0.064
1.157AspHis: 1.157 ± 0.013
2.883AspIle: 2.883 ± 0.018
2.806AspLys: 2.806 ± 0.021
4.354AspLeu: 4.354 ± 0.027
1.29AspMet: 1.29 ± 0.011
2.137AspAsn: 2.137 ± 0.018
2.468AspPro: 2.468 ± 0.022
1.978AspGln: 1.978 ± 0.016
2.569AspArg: 2.569 ± 0.023
4.041AspSer: 4.041 ± 0.033
2.448AspThr: 2.448 ± 0.019
3.712AspVal: 3.712 ± 0.026
0.647AspTrp: 0.647 ± 0.01
1.818AspTyr: 1.818 ± 0.015
0.001AspXaa: 0.001 ± 0.0
Glu
4.151GluAla: 4.151 ± 0.051
1.118GluCys: 1.118 ± 0.015
4.193GluAsp: 4.193 ± 0.033
6.765GluGlu: 6.765 ± 0.065
2.678GluPhe: 2.678 ± 0.021
2.737GluGly: 2.737 ± 0.024
1.448GluHis: 1.448 ± 0.016
4.126GluIle: 4.126 ± 0.024
6.101GluLys: 6.101 ± 0.039
5.267GluLeu: 5.267 ± 0.036
2.082GluMet: 2.082 ± 0.018
3.896GluAsn: 3.896 ± 0.035
2.627GluPro: 2.627 ± 0.035
2.765GluGln: 2.765 ± 0.025
3.488GluArg: 3.488 ± 0.028
4.814GluSer: 4.814 ± 0.05
3.807GluThr: 3.807 ± 0.039
3.712GluVal: 3.712 ± 0.033
0.728GluTrp: 0.728 ± 0.011
2.01GluTyr: 2.01 ± 0.019
0.001GluXaa: 0.001 ± 0.0
Phe
2.726PheAla: 2.726 ± 0.022
1.074PheCys: 1.074 ± 0.013
2.622PheAsp: 2.622 ± 0.019
2.919PheGlu: 2.919 ± 0.022
2.669PhePhe: 2.669 ± 0.025
2.946PheGly: 2.946 ± 0.023
1.122PheHis: 1.122 ± 0.012
2.659PheIle: 2.659 ± 0.023
2.375PheLys: 2.375 ± 0.017
4.564PheLeu: 4.564 ± 0.037
1.187PheMet: 1.187 ± 0.013
2.07PheAsn: 2.07 ± 0.017
2.019PhePro: 2.019 ± 0.017
2.002PheGln: 2.002 ± 0.021
2.346PheArg: 2.346 ± 0.021
3.775PheSer: 3.775 ± 0.025
2.288PheThr: 2.288 ± 0.02
3.129PheVal: 3.129 ± 0.021
0.633PheTrp: 0.633 ± 0.01
1.689PheTyr: 1.689 ± 0.016
0.001PheXaa: 0.001 ± 0.0
Gly
3.427GlyAla: 3.427 ± 0.036
1.079GlyCys: 1.079 ± 0.015
2.695GlyAsp: 2.695 ± 0.021
3.113GlyGlu: 3.113 ± 0.031
2.683GlyPhe: 2.683 ± 0.02
4.161GlyGly: 4.161 ± 0.067
1.239GlyHis: 1.239 ± 0.017
3.238GlyIle: 3.238 ± 0.03
3.327GlyLys: 3.327 ± 0.03
3.902GlyLeu: 3.902 ± 0.031
1.334GlyMet: 1.334 ± 0.015
2.673GlyAsn: 2.673 ± 0.029
2.495GlyPro: 2.495 ± 0.046
2.04GlyGln: 2.04 ± 0.027
2.745GlyArg: 2.745 ± 0.026
4.316GlySer: 4.316 ± 0.034
3.094GlyThr: 3.094 ± 0.029
3.206GlyVal: 3.206 ± 0.022
0.654GlyTrp: 0.654 ± 0.009
1.996GlyTyr: 1.996 ± 0.023
0.002GlyXaa: 0.002 ± 0.001
His
1.175HisAla: 1.175 ± 0.011
0.482HisCys: 0.482 ± 0.008
1.058HisAsp: 1.058 ± 0.013
1.284HisGlu: 1.284 ± 0.013
1.274HisPhe: 1.274 ± 0.015
1.304HisGly: 1.304 ± 0.018
0.918HisHis: 0.918 ± 0.023
1.285HisIle: 1.285 ± 0.014
1.122HisLys: 1.122 ± 0.012
2.192HisLeu: 2.192 ± 0.021
0.589HisMet: 0.589 ± 0.009
0.98HisAsn: 0.98 ± 0.013
1.233HisPro: 1.233 ± 0.014
1.121HisGln: 1.121 ± 0.014
1.42HisArg: 1.42 ± 0.017
1.711HisSer: 1.711 ± 0.014
1.076HisThr: 1.076 ± 0.013
1.445HisVal: 1.445 ± 0.017
0.289HisTrp: 0.289 ± 0.005
0.802HisTyr: 0.802 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
3.756IleAla: 3.756 ± 0.026
1.29IleCys: 1.29 ± 0.013
3.319IleAsp: 3.319 ± 0.021
3.808IleGlu: 3.808 ± 0.023
3.167IlePhe: 3.167 ± 0.028
3.325IleGly: 3.325 ± 0.027
1.422IleHis: 1.422 ± 0.015
3.454IleIle: 3.454 ± 0.027
3.014IleLys: 3.014 ± 0.023
5.563IleLeu: 5.563 ± 0.033
1.351IleMet: 1.351 ± 0.014
2.478IleAsn: 2.478 ± 0.021
3.231IlePro: 3.231 ± 0.027
2.538IleGln: 2.538 ± 0.022
3.271IleArg: 3.271 ± 0.024
5.113IleSer: 5.113 ± 0.047
3.117IleThr: 3.117 ± 0.024
3.89IleVal: 3.89 ± 0.023
0.692IleTrp: 0.692 ± 0.01
1.903IleTyr: 1.903 ± 0.019
0.002IleXaa: 0.002 ± 0.0
Lys
3.497LysAla: 3.497 ± 0.028
1.33LysCys: 1.33 ± 0.016
3.116LysAsp: 3.116 ± 0.026
4.709LysGlu: 4.709 ± 0.039
2.676LysPhe: 2.676 ± 0.02
2.448LysGly: 2.448 ± 0.021
1.302LysHis: 1.302 ± 0.013
4.071LysIle: 4.071 ± 0.028
6.265LysLys: 6.265 ± 0.056
5.4LysLeu: 5.4 ± 0.029
2.029LysMet: 2.029 ± 0.016
3.613LysAsn: 3.613 ± 0.025
2.799LysPro: 2.799 ± 0.033
2.376LysGln: 2.376 ± 0.02
3.755LysArg: 3.755 ± 0.03
4.73LysSer: 4.73 ± 0.024
3.918LysThr: 3.918 ± 0.028
3.473LysVal: 3.473 ± 0.023
0.792LysTrp: 0.792 ± 0.009
2.017LysTyr: 2.017 ± 0.018
0.002LysXaa: 0.002 ± 0.0
Leu
5.593LeuAla: 5.593 ± 0.033
1.611LeuCys: 1.611 ± 0.016
4.393LeuAsp: 4.393 ± 0.035
5.786LeuGlu: 5.786 ± 0.036
4.141LeuPhe: 4.141 ± 0.032
3.969LeuGly: 3.969 ± 0.031
1.973LeuHis: 1.973 ± 0.019
5.157LeuIle: 5.157 ± 0.036
5.728LeuLys: 5.728 ± 0.038
8.087LeuLeu: 8.087 ± 0.053
2.178LeuMet: 2.178 ± 0.017
4.112LeuAsn: 4.112 ± 0.026
4.324LeuPro: 4.324 ± 0.043
3.475LeuGln: 3.475 ± 0.028
4.86LeuArg: 4.86 ± 0.046
6.43LeuSer: 6.43 ± 0.036
4.595LeuThr: 4.595 ± 0.029
5.07LeuVal: 5.07 ± 0.028
0.861LeuTrp: 0.861 ± 0.01
2.413LeuTyr: 2.413 ± 0.019
0.002LeuXaa: 0.002 ± 0.001
Met
1.781MetAla: 1.781 ± 0.016
0.505MetCys: 0.505 ± 0.008
1.497MetAsp: 1.497 ± 0.012
1.906MetGlu: 1.906 ± 0.019
1.203MetPhe: 1.203 ± 0.013
1.231MetGly: 1.231 ± 0.014
0.531MetHis: 0.531 ± 0.009
1.633MetIle: 1.633 ± 0.016
1.866MetLys: 1.866 ± 0.014
2.083MetLeu: 2.083 ± 0.018
0.878MetMet: 0.878 ± 0.011
1.365MetAsn: 1.365 ± 0.013
1.21MetPro: 1.21 ± 0.014
0.967MetGln: 0.967 ± 0.012
1.426MetArg: 1.426 ± 0.013
2.203MetSer: 2.203 ± 0.019
1.614MetThr: 1.614 ± 0.016
1.47MetVal: 1.47 ± 0.014
0.271MetTrp: 0.271 ± 0.006
0.749MetTyr: 0.749 ± 0.011
0.001MetXaa: 0.001 ± 0.0
Asn
2.802AsnAla: 2.802 ± 0.021
1.003AsnCys: 1.003 ± 0.013
2.434AsnAsp: 2.434 ± 0.018
3.125AsnGlu: 3.125 ± 0.023
2.41AsnPhe: 2.41 ± 0.018
3.306AsnGly: 3.306 ± 0.025
1.111AsnHis: 1.111 ± 0.011
2.66AsnIle: 2.66 ± 0.019
2.463AsnLys: 2.463 ± 0.019
4.524AsnLeu: 4.524 ± 0.045
1.245AsnMet: 1.245 ± 0.012
2.412AsnAsn: 2.412 ± 0.022
2.435AsnPro: 2.435 ± 0.025
2.183AsnGln: 2.183 ± 0.025
2.598AsnArg: 2.598 ± 0.021
3.884AsnSer: 3.884 ± 0.027
2.387AsnThr: 2.387 ± 0.018
3.054AsnVal: 3.054 ± 0.022
0.581AsnTrp: 0.581 ± 0.01
1.652AsnTyr: 1.652 ± 0.019
0.001AsnXaa: 0.001 ± 0.0
Pro
3.366ProAla: 3.366 ± 0.036
0.739ProCys: 0.739 ± 0.02
2.416ProAsp: 2.416 ± 0.037
3.478ProGlu: 3.478 ± 0.034
2.0ProPhe: 2.0 ± 0.021
2.88ProGly: 2.88 ± 0.066
1.007ProHis: 1.007 ± 0.014
3.051ProIle: 3.051 ± 0.027
3.082ProLys: 3.082 ± 0.027
3.67ProLeu: 3.67 ± 0.034
1.222ProMet: 1.222 ± 0.014
2.289ProAsn: 2.289 ± 0.018
4.199ProPro: 4.199 ± 0.057
2.062ProGln: 2.062 ± 0.025
2.478ProArg: 2.478 ± 0.033
4.42ProSer: 4.42 ± 0.04
3.423ProThr: 3.423 ± 0.054
3.19ProVal: 3.19 ± 0.044
0.447ProTrp: 0.447 ± 0.008
1.383ProTyr: 1.383 ± 0.016
0.002ProXaa: 0.002 ± 0.0
Gln
2.322GlnAla: 2.322 ± 0.026
0.86GlnCys: 0.86 ± 0.017
1.676GlnAsp: 1.676 ± 0.016
2.565GlnGlu: 2.565 ± 0.025
1.859GlnPhe: 1.859 ± 0.017
1.727GlnGly: 1.727 ± 0.023
0.991GlnHis: 0.991 ± 0.014
2.443GlnIle: 2.443 ± 0.017
3.254GlnLys: 3.254 ± 0.022
3.783GlnLeu: 3.783 ± 0.026
1.377GlnMet: 1.377 ± 0.016
2.734GlnAsn: 2.734 ± 0.047
2.091GlnPro: 2.091 ± 0.029
2.76GlnGln: 2.76 ± 0.063
2.23GlnArg: 2.23 ± 0.02
2.76GlnSer: 2.76 ± 0.021
2.181GlnThr: 2.181 ± 0.018
2.165GlnVal: 2.165 ± 0.019
0.494GlnTrp: 0.494 ± 0.008
1.325GlnTyr: 1.325 ± 0.015
0.001GlnXaa: 0.001 ± 0.0
Arg
2.985ArgAla: 2.985 ± 0.021
0.991ArgCys: 0.991 ± 0.016
2.629ArgAsp: 2.629 ± 0.022
3.409ArgGlu: 3.409 ± 0.026
2.378ArgPhe: 2.378 ± 0.016
2.556ArgGly: 2.556 ± 0.03
1.341ArgHis: 1.341 ± 0.015
3.662ArgIle: 3.662 ± 0.046
4.203ArgLys: 4.203 ± 0.031
4.4ArgLeu: 4.4 ± 0.031
1.444ArgMet: 1.444 ± 0.014
2.897ArgAsn: 2.897 ± 0.022
2.387ArgPro: 2.387 ± 0.021
2.295ArgGln: 2.295 ± 0.018
4.211ArgArg: 4.211 ± 0.038
3.974ArgSer: 3.974 ± 0.027
2.778ArgThr: 2.778 ± 0.021
2.956ArgVal: 2.956 ± 0.021
0.575ArgTrp: 0.575 ± 0.01
1.609ArgTyr: 1.609 ± 0.015
0.001ArgXaa: 0.001 ± 0.0
Ser
4.931SerAla: 4.931 ± 0.034
1.462SerCys: 1.462 ± 0.022
4.192SerAsp: 4.192 ± 0.026
5.494SerGlu: 5.494 ± 0.052
3.491SerPhe: 3.491 ± 0.023
4.486SerGly: 4.486 ± 0.043
1.659SerHis: 1.659 ± 0.017
4.621SerIle: 4.621 ± 0.027
4.608SerLys: 4.608 ± 0.029
6.389SerLeu: 6.389 ± 0.036
1.936SerMet: 1.936 ± 0.014
3.729SerAsn: 3.729 ± 0.027
4.004SerPro: 4.004 ± 0.034
3.374SerGln: 3.374 ± 0.045
4.112SerArg: 4.112 ± 0.033
8.848SerSer: 8.848 ± 0.081
5.549SerThr: 5.549 ± 0.068
4.698SerVal: 4.698 ± 0.028
0.839SerTrp: 0.839 ± 0.013
2.236SerTyr: 2.236 ± 0.019
0.001SerXaa: 0.001 ± 0.0
Thr
3.66ThrAla: 3.66 ± 0.032
1.211ThrCys: 1.211 ± 0.018
2.809ThrAsp: 2.809 ± 0.055
3.454ThrGlu: 3.454 ± 0.039
2.543ThrPhe: 2.543 ± 0.019
2.957ThrGly: 2.957 ± 0.02
1.149ThrHis: 1.149 ± 0.014
3.66ThrIle: 3.66 ± 0.028
3.149ThrLys: 3.149 ± 0.027
4.57ThrLeu: 4.57 ± 0.025
1.424ThrMet: 1.424 ± 0.014
2.551ThrAsn: 2.551 ± 0.019
3.413ThrPro: 3.413 ± 0.033
2.004ThrGln: 2.004 ± 0.02
2.655ThrArg: 2.655 ± 0.021
5.382ThrSer: 5.382 ± 0.057
4.936ThrThr: 4.936 ± 0.082
4.126ThrVal: 4.126 ± 0.027
0.633ThrTrp: 0.633 ± 0.01
1.599ThrTyr: 1.599 ± 0.015
0.001ThrXaa: 0.001 ± 0.0
Val
4.069ValAla: 4.069 ± 0.025
1.28ValCys: 1.28 ± 0.016
3.426ValAsp: 3.426 ± 0.021
4.275ValGlu: 4.275 ± 0.053
3.189ValPhe: 3.189 ± 0.021
2.984ValGly: 2.984 ± 0.025
1.439ValHis: 1.439 ± 0.014
3.732ValIle: 3.732 ± 0.024
3.585ValLys: 3.585 ± 0.026
5.412ValLeu: 5.412 ± 0.033
1.536ValMet: 1.536 ± 0.015
2.662ValAsn: 2.662 ± 0.019
3.225ValPro: 3.225 ± 0.027
2.416ValGln: 2.416 ± 0.022
2.978ValArg: 2.978 ± 0.024
4.704ValSer: 4.704 ± 0.031
3.527ValThr: 3.527 ± 0.028
4.286ValVal: 4.286 ± 0.037
0.68ValTrp: 0.68 ± 0.011
1.932ValTyr: 1.932 ± 0.018
0.001ValXaa: 0.001 ± 0.0
Trp
0.614TrpAla: 0.614 ± 0.009
0.222TrpCys: 0.222 ± 0.006
0.556TrpAsp: 0.556 ± 0.01
0.601TrpGlu: 0.601 ± 0.01
0.552TrpPhe: 0.552 ± 0.01
0.474TrpGly: 0.474 ± 0.009
0.247TrpHis: 0.247 ± 0.006
0.818TrpIle: 0.818 ± 0.012
0.91TrpLys: 0.91 ± 0.013
0.978TrpLeu: 0.978 ± 0.011
0.369TrpMet: 0.369 ± 0.007
0.693TrpAsn: 0.693 ± 0.012
0.412TrpPro: 0.412 ± 0.007
0.439TrpGln: 0.439 ± 0.008
0.665TrpArg: 0.665 ± 0.009
0.838TrpSer: 0.838 ± 0.013
0.714TrpThr: 0.714 ± 0.011
0.571TrpVal: 0.571 ± 0.01
0.168TrpTrp: 0.168 ± 0.005
0.364TrpTyr: 0.364 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.756TyrAla: 1.756 ± 0.016
0.772TyrCys: 0.772 ± 0.01
1.766TyrAsp: 1.766 ± 0.014
1.922TyrGlu: 1.922 ± 0.019
1.69TyrPhe: 1.69 ± 0.019
1.956TyrGly: 1.956 ± 0.023
0.836TyrHis: 0.836 ± 0.015
1.721TyrIle: 1.721 ± 0.017
1.664TyrLys: 1.664 ± 0.015
2.737TyrLeu: 2.737 ± 0.02
0.815TyrMet: 0.815 ± 0.01
1.442TyrAsn: 1.442 ± 0.017
1.437TyrPro: 1.437 ± 0.025
1.373TyrGln: 1.373 ± 0.014
1.721TyrArg: 1.721 ± 0.016
2.387TyrSer: 2.387 ± 0.021
1.633TyrThr: 1.633 ± 0.016
1.831TyrVal: 1.831 ± 0.016
0.424TyrTrp: 0.424 ± 0.009
1.296TyrTyr: 1.296 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.001
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
1.959XaaXaa: 1.959 ± 0.292
Statistics based on 21755 proteins (8683488 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski