Amino acid dipepetide frequency for Stenomitos frigidus ULC18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.095AlaAla: 10.095 ± 0.086
0.945AlaCys: 0.945 ± 0.022
5.102AlaAsp: 5.102 ± 0.055
5.796AlaGlu: 5.796 ± 0.073
3.471AlaPhe: 3.471 ± 0.044
6.188AlaGly: 6.188 ± 0.087
1.763AlaHis: 1.763 ± 0.034
7.583AlaIle: 7.583 ± 0.069
3.981AlaLys: 3.981 ± 0.054
11.387AlaLeu: 11.387 ± 0.091
2.033AlaMet: 2.033 ± 0.031
3.296AlaAsn: 3.296 ± 0.062
4.039AlaPro: 4.039 ± 0.059
5.3AlaGln: 5.3 ± 0.068
4.418AlaArg: 4.418 ± 0.054
5.864AlaSer: 5.864 ± 0.06
5.972AlaThr: 5.972 ± 0.067
6.72AlaVal: 6.72 ± 0.062
1.247AlaTrp: 1.247 ± 0.027
2.561AlaTyr: 2.561 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.686CysAla: 0.686 ± 0.02
0.157CysCys: 0.157 ± 0.009
0.576CysAsp: 0.576 ± 0.019
0.468CysGlu: 0.468 ± 0.016
0.432CysPhe: 0.432 ± 0.015
0.75CysGly: 0.75 ± 0.022
0.302CysHis: 0.302 ± 0.013
0.424CysIle: 0.424 ± 0.015
0.316CysLys: 0.316 ± 0.014
1.201CysLeu: 1.201 ± 0.028
0.17CysMet: 0.17 ± 0.01
0.317CysAsn: 0.317 ± 0.014
0.539CysPro: 0.539 ± 0.02
0.543CysGln: 0.543 ± 0.019
0.583CysArg: 0.583 ± 0.021
0.595CysSer: 0.595 ± 0.02
0.451CysThr: 0.451 ± 0.018
0.59CysVal: 0.59 ± 0.017
0.152CysTrp: 0.152 ± 0.01
0.314CysTyr: 0.314 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
4.785AspAla: 4.785 ± 0.055
0.539AspCys: 0.539 ± 0.016
2.678AspAsp: 2.678 ± 0.041
2.878AspGlu: 2.878 ± 0.047
2.036AspPhe: 2.036 ± 0.042
3.776AspGly: 3.776 ± 0.059
1.078AspHis: 1.078 ± 0.026
2.513AspIle: 2.513 ± 0.041
1.575AspLys: 1.575 ± 0.036
6.091AspLeu: 6.091 ± 0.064
0.745AspMet: 0.745 ± 0.021
1.36AspAsn: 1.36 ± 0.029
2.85AspPro: 2.85 ± 0.036
2.469AspGln: 2.469 ± 0.04
4.746AspArg: 4.746 ± 0.054
2.701AspSer: 2.701 ± 0.041
2.453AspThr: 2.453 ± 0.043
3.427AspVal: 3.427 ± 0.043
0.989AspTrp: 0.989 ± 0.025
1.775AspTyr: 1.775 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
6.608GluAla: 6.608 ± 0.068
0.461GluCys: 0.461 ± 0.015
2.381GluAsp: 2.381 ± 0.043
2.947GluGlu: 2.947 ± 0.049
2.018GluPhe: 2.018 ± 0.035
3.335GluGly: 3.335 ± 0.049
1.149GluHis: 1.149 ± 0.025
3.17GluIle: 3.17 ± 0.048
2.373GluLys: 2.373 ± 0.045
6.162GluLeu: 6.162 ± 0.066
1.24GluMet: 1.24 ± 0.027
1.663GluAsn: 1.663 ± 0.031
2.755GluPro: 2.755 ± 0.046
3.735GluGln: 3.735 ± 0.05
3.769GluArg: 3.769 ± 0.056
3.067GluSer: 3.067 ± 0.044
3.792GluThr: 3.792 ± 0.05
3.686GluVal: 3.686 ± 0.053
0.791GluTrp: 0.791 ± 0.023
1.386GluTyr: 1.386 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.3PheAla: 3.3 ± 0.047
0.479PheCys: 0.479 ± 0.016
2.208PheAsp: 2.208 ± 0.037
2.081PheGlu: 2.081 ± 0.037
1.559PhePhe: 1.559 ± 0.038
2.81PheGly: 2.81 ± 0.042
0.75PheHis: 0.75 ± 0.022
1.796PheIle: 1.796 ± 0.03
1.365PheLys: 1.365 ± 0.032
3.931PheLeu: 3.931 ± 0.051
0.641PheMet: 0.641 ± 0.02
1.434PheAsn: 1.434 ± 0.032
1.686PhePro: 1.686 ± 0.031
1.815PheGln: 1.815 ± 0.028
1.91PheArg: 1.91 ± 0.035
2.684PheSer: 2.684 ± 0.045
2.215PheThr: 2.215 ± 0.04
2.448PheVal: 2.448 ± 0.043
0.666PheTrp: 0.666 ± 0.021
1.213PheTyr: 1.213 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
5.764GlyAla: 5.764 ± 0.076
0.78GlyCys: 0.78 ± 0.02
3.578GlyAsp: 3.578 ± 0.045
3.654GlyGlu: 3.654 ± 0.052
2.979GlyPhe: 2.979 ± 0.042
4.739GlyGly: 4.739 ± 0.091
1.389GlyHis: 1.389 ± 0.034
4.316GlyIle: 4.316 ± 0.056
3.466GlyLys: 3.466 ± 0.049
7.463GlyLeu: 7.463 ± 0.078
1.67GlyMet: 1.67 ± 0.036
2.651GlyAsn: 2.651 ± 0.06
1.484GlyPro: 1.484 ± 0.029
3.508GlyGln: 3.508 ± 0.045
3.629GlyArg: 3.629 ± 0.057
4.453GlySer: 4.453 ± 0.064
4.296GlyThr: 4.296 ± 0.087
4.861GlyVal: 4.861 ± 0.057
1.182GlyTrp: 1.182 ± 0.027
2.236GlyTyr: 2.236 ± 0.035
0.0GlyXaa: 0.0 ± 0.0
His
1.608HisAla: 1.608 ± 0.037
0.315HisCys: 0.315 ± 0.013
1.017HisAsp: 1.017 ± 0.026
1.016HisGlu: 1.016 ± 0.023
0.852HisPhe: 0.852 ± 0.021
1.283HisGly: 1.283 ± 0.03
0.667HisHis: 0.667 ± 0.023
0.955HisIle: 0.955 ± 0.026
0.617HisLys: 0.617 ± 0.019
2.495HisLeu: 2.495 ± 0.044
0.304HisMet: 0.304 ± 0.014
0.635HisAsn: 0.635 ± 0.02
1.55HisPro: 1.55 ± 0.032
1.281HisGln: 1.281 ± 0.031
1.248HisArg: 1.248 ± 0.027
1.229HisSer: 1.229 ± 0.028
1.147HisThr: 1.147 ± 0.027
1.116HisVal: 1.116 ± 0.025
0.474HisTrp: 0.474 ± 0.017
0.785HisTyr: 0.785 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
7.417IleAla: 7.417 ± 0.07
0.536IleCys: 0.536 ± 0.019
3.295IleAsp: 3.295 ± 0.044
3.39IleGlu: 3.39 ± 0.051
1.859IlePhe: 1.859 ± 0.033
4.117IleGly: 4.117 ± 0.056
1.138IleHis: 1.138 ± 0.028
2.269IleIle: 2.269 ± 0.039
1.916IleLys: 1.916 ± 0.036
5.168IleLeu: 5.168 ± 0.057
0.707IleMet: 0.707 ± 0.019
1.88IleAsn: 1.88 ± 0.041
2.926IlePro: 2.926 ± 0.044
2.7IleGln: 2.7 ± 0.04
2.949IleArg: 2.949 ± 0.041
3.017IleSer: 3.017 ± 0.04
3.168IleThr: 3.168 ± 0.049
4.087IleVal: 4.087 ± 0.053
0.654IleTrp: 0.654 ± 0.019
1.374IleTyr: 1.374 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
4.324LysAla: 4.324 ± 0.05
0.21LysCys: 0.21 ± 0.011
1.902LysAsp: 1.902 ± 0.041
1.995LysGlu: 1.995 ± 0.04
1.145LysPhe: 1.145 ± 0.029
2.623LysGly: 2.623 ± 0.047
0.796LysHis: 0.796 ± 0.023
1.923LysIle: 1.923 ± 0.034
1.472LysLys: 1.472 ± 0.036
4.53LysLeu: 4.53 ± 0.054
0.716LysMet: 0.716 ± 0.021
1.122LysAsn: 1.122 ± 0.025
2.537LysPro: 2.537 ± 0.043
2.573LysGln: 2.573 ± 0.036
2.424LysArg: 2.424 ± 0.039
1.914LysSer: 1.914 ± 0.033
2.668LysThr: 2.668 ± 0.045
2.591LysVal: 2.591 ± 0.041
0.344LysTrp: 0.344 ± 0.016
0.857LysTyr: 0.857 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
11.041LeuAla: 11.041 ± 0.086
1.094LeuCys: 1.094 ± 0.022
5.727LeuAsp: 5.727 ± 0.058
7.065LeuGlu: 7.065 ± 0.075
3.731LeuPhe: 3.731 ± 0.051
7.538LeuGly: 7.538 ± 0.071
2.223LeuHis: 2.223 ± 0.038
5.625LeuIle: 5.625 ± 0.06
5.214LeuLys: 5.214 ± 0.058
12.673LeuLeu: 12.673 ± 0.125
2.293LeuMet: 2.293 ± 0.042
4.022LeuAsn: 4.022 ± 0.054
6.415LeuPro: 6.415 ± 0.072
6.315LeuGln: 6.315 ± 0.076
6.329LeuArg: 6.329 ± 0.064
7.821LeuSer: 7.821 ± 0.082
7.186LeuThr: 7.186 ± 0.076
7.677LeuVal: 7.677 ± 0.071
1.693LeuTrp: 1.693 ± 0.043
2.801LeuTyr: 2.801 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
1.957MetAla: 1.957 ± 0.033
0.11MetCys: 0.11 ± 0.008
0.789MetAsp: 0.789 ± 0.022
0.915MetGlu: 0.915 ± 0.022
0.487MetPhe: 0.487 ± 0.016
1.402MetGly: 1.402 ± 0.031
0.367MetHis: 0.367 ± 0.016
0.891MetIle: 0.891 ± 0.023
0.824MetLys: 0.824 ± 0.023
2.011MetLeu: 2.011 ± 0.039
0.449MetMet: 0.449 ± 0.017
0.709MetAsn: 0.709 ± 0.02
1.199MetPro: 1.199 ± 0.023
1.143MetGln: 1.143 ± 0.027
1.046MetArg: 1.046 ± 0.024
1.305MetSer: 1.305 ± 0.028
1.465MetThr: 1.465 ± 0.031
1.362MetVal: 1.362 ± 0.03
0.147MetTrp: 0.147 ± 0.009
0.298MetTyr: 0.298 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
3.26AsnAla: 3.26 ± 0.048
0.326AsnCys: 0.326 ± 0.014
1.612AsnAsp: 1.612 ± 0.036
1.436AsnGlu: 1.436 ± 0.024
1.322AsnPhe: 1.322 ± 0.03
2.582AsnGly: 2.582 ± 0.065
0.735AsnHis: 0.735 ± 0.02
1.531AsnIle: 1.531 ± 0.035
0.908AsnLys: 0.908 ± 0.026
4.316AsnLeu: 4.316 ± 0.065
0.453AsnMet: 0.453 ± 0.016
1.126AsnAsn: 1.126 ± 0.033
2.452AsnPro: 2.452 ± 0.039
2.084AsnGln: 2.084 ± 0.038
2.109AsnArg: 2.109 ± 0.036
1.88AsnSer: 1.88 ± 0.042
1.759AsnThr: 1.759 ± 0.046
2.181AsnVal: 2.181 ± 0.04
0.575AsnTrp: 0.575 ± 0.018
0.957AsnTyr: 0.957 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
4.528ProAla: 4.528 ± 0.063
0.39ProCys: 0.39 ± 0.015
3.494ProAsp: 3.494 ± 0.051
3.547ProGlu: 3.547 ± 0.049
1.862ProPhe: 1.862 ± 0.037
3.368ProGly: 3.368 ± 0.054
1.127ProHis: 1.127 ± 0.03
2.91ProIle: 2.91 ± 0.041
2.0ProLys: 2.0 ± 0.037
5.596ProLeu: 5.596 ± 0.06
0.909ProMet: 0.909 ± 0.023
1.921ProAsn: 1.921 ± 0.036
2.682ProPro: 2.682 ± 0.044
2.436ProGln: 2.436 ± 0.037
1.99ProArg: 1.99 ± 0.033
3.453ProSer: 3.453 ± 0.046
3.564ProThr: 3.564 ± 0.048
3.537ProVal: 3.537 ± 0.049
0.64ProTrp: 0.64 ± 0.02
1.325ProTyr: 1.325 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
6.389GlnAla: 6.389 ± 0.072
0.418GlnCys: 0.418 ± 0.016
2.152GlnAsp: 2.152 ± 0.033
3.025GlnGlu: 3.025 ± 0.049
1.912GlnPhe: 1.912 ± 0.037
3.389GlnGly: 3.389 ± 0.048
1.113GlnHis: 1.113 ± 0.029
3.097GlnIle: 3.097 ± 0.039
2.139GlnLys: 2.139 ± 0.038
6.331GlnLeu: 6.331 ± 0.072
1.127GlnMet: 1.127 ± 0.028
1.641GlnAsn: 1.641 ± 0.035
3.273GlnPro: 3.273 ± 0.044
4.279GlnGln: 4.279 ± 0.075
3.712GlnArg: 3.712 ± 0.052
3.304GlnSer: 3.304 ± 0.046
3.746GlnThr: 3.746 ± 0.052
3.894GlnVal: 3.894 ± 0.052
0.756GlnTrp: 0.756 ± 0.023
1.122GlnTyr: 1.122 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
4.429ArgAla: 4.429 ± 0.057
0.574ArgCys: 0.574 ± 0.019
2.991ArgAsp: 2.991 ± 0.042
3.245ArgGlu: 3.245 ± 0.047
2.401ArgPhe: 2.401 ± 0.039
3.236ArgGly: 3.236 ± 0.051
1.227ArgHis: 1.227 ± 0.028
2.889ArgIle: 2.889 ± 0.043
2.052ArgLys: 2.052 ± 0.04
7.289ArgLeu: 7.289 ± 0.069
1.165ArgMet: 1.165 ± 0.026
1.792ArgAsn: 1.792 ± 0.032
2.443ArgPro: 2.443 ± 0.039
3.841ArgGln: 3.841 ± 0.057
3.575ArgArg: 3.575 ± 0.053
4.367ArgSer: 4.367 ± 0.06
2.967ArgThr: 2.967 ± 0.041
3.85ArgVal: 3.85 ± 0.049
0.996ArgTrp: 0.996 ± 0.027
2.027ArgTyr: 2.027 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
5.314SerAla: 5.314 ± 0.062
0.532SerCys: 0.532 ± 0.019
3.343SerAsp: 3.343 ± 0.05
3.24SerGlu: 3.24 ± 0.049
2.413SerPhe: 2.413 ± 0.041
4.851SerGly: 4.851 ± 0.066
1.405SerHis: 1.405 ± 0.028
3.207SerIle: 3.207 ± 0.05
2.221SerLys: 2.221 ± 0.035
7.518SerLeu: 7.518 ± 0.081
1.149SerMet: 1.149 ± 0.031
2.151SerAsn: 2.151 ± 0.041
3.658SerPro: 3.658 ± 0.049
3.36SerGln: 3.36 ± 0.046
3.418SerArg: 3.418 ± 0.046
4.166SerSer: 4.166 ± 0.067
3.657SerThr: 3.657 ± 0.048
4.179SerVal: 4.179 ± 0.05
0.804SerTrp: 0.804 ± 0.024
1.653SerTyr: 1.653 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
6.248ThrAla: 6.248 ± 0.076
0.498ThrCys: 0.498 ± 0.015
2.953ThrAsp: 2.953 ± 0.044
3.182ThrGlu: 3.182 ± 0.041
2.207ThrPhe: 2.207 ± 0.036
4.434ThrGly: 4.434 ± 0.068
1.188ThrHis: 1.188 ± 0.023
3.753ThrIle: 3.753 ± 0.056
1.968ThrLys: 1.968 ± 0.036
7.41ThrLeu: 7.41 ± 0.084
0.905ThrMet: 0.905 ± 0.023
1.971ThrAsn: 1.971 ± 0.039
3.744ThrPro: 3.744 ± 0.056
3.062ThrGln: 3.062 ± 0.045
2.759ThrArg: 2.759 ± 0.04
3.388ThrSer: 3.388 ± 0.054
3.798ThrThr: 3.798 ± 0.069
4.748ThrVal: 4.748 ± 0.055
0.771ThrTrp: 0.771 ± 0.024
1.559ThrTyr: 1.559 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
6.624ValAla: 6.624 ± 0.05
0.691ValCys: 0.691 ± 0.02
3.545ValAsp: 3.545 ± 0.049
4.234ValGlu: 4.234 ± 0.05
2.557ValPhe: 2.557 ± 0.035
4.742ValGly: 4.742 ± 0.053
1.143ValHis: 1.143 ± 0.028
3.75ValIle: 3.75 ± 0.055
2.875ValLys: 2.875 ± 0.041
7.574ValLeu: 7.574 ± 0.071
1.485ValMet: 1.485 ± 0.031
2.515ValAsn: 2.515 ± 0.047
3.344ValPro: 3.344 ± 0.049
3.4ValGln: 3.4 ± 0.047
3.72ValArg: 3.72 ± 0.049
4.523ValSer: 4.523 ± 0.053
4.11ValThr: 4.11 ± 0.057
5.125ValVal: 5.125 ± 0.066
1.014ValTrp: 1.014 ± 0.027
1.754ValTyr: 1.754 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.056TrpAla: 1.056 ± 0.023
0.16TrpCys: 0.16 ± 0.009
0.716TrpAsp: 0.716 ± 0.023
0.783TrpGlu: 0.783 ± 0.024
0.631TrpPhe: 0.631 ± 0.02
0.932TrpGly: 0.932 ± 0.023
0.414TrpHis: 0.414 ± 0.016
0.723TrpIle: 0.723 ± 0.022
0.518TrpLys: 0.518 ± 0.017
2.159TrpLeu: 2.159 ± 0.043
0.337TrpMet: 0.337 ± 0.014
0.532TrpAsn: 0.532 ± 0.016
0.193TrpPro: 0.193 ± 0.011
1.3TrpGln: 1.3 ± 0.034
0.939TrpArg: 0.939 ± 0.027
0.938TrpSer: 0.938 ± 0.024
0.664TrpThr: 0.664 ± 0.02
1.019TrpVal: 1.019 ± 0.027
0.239TrpTrp: 0.239 ± 0.012
0.375TrpTyr: 0.375 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.329TyrAla: 2.329 ± 0.037
0.352TyrCys: 0.352 ± 0.013
1.403TyrAsp: 1.403 ± 0.028
1.591TyrGlu: 1.591 ± 0.03
1.116TyrPhe: 1.116 ± 0.023
2.002TyrGly: 2.002 ± 0.035
0.619TyrHis: 0.619 ± 0.019
1.243TyrIle: 1.243 ± 0.027
0.866TyrLys: 0.866 ± 0.023
3.178TyrLeu: 3.178 ± 0.04
0.379TyrMet: 0.379 ± 0.013
0.869TyrAsn: 0.869 ± 0.024
1.509TyrPro: 1.509 ± 0.03
1.676TyrGln: 1.676 ± 0.031
2.082TyrArg: 2.082 ± 0.034
1.568TyrSer: 1.568 ± 0.031
1.468TyrThr: 1.468 ± 0.028
1.627TyrVal: 1.627 ± 0.031
0.495TyrTrp: 0.495 ± 0.017
0.872TyrTyr: 0.872 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5916 proteins (1851076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski