Amino acid dipepetide frequency for Comamonas aquatica DA1877

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.559AlaAla: 17.559 ± 0.227
1.399AlaCys: 1.399 ± 0.04
5.945AlaAsp: 5.945 ± 0.073
6.373AlaGlu: 6.373 ± 0.091
3.772AlaPhe: 3.772 ± 0.057
9.834AlaGly: 9.834 ± 0.101
3.111AlaHis: 3.111 ± 0.061
5.079AlaIle: 5.079 ± 0.069
4.126AlaLys: 4.126 ± 0.087
15.375AlaLeu: 15.375 ± 0.164
3.687AlaMet: 3.687 ± 0.059
2.693AlaAsn: 2.693 ± 0.048
6.278AlaPro: 6.278 ± 0.102
8.279AlaGln: 8.279 ± 0.124
7.852AlaArg: 7.852 ± 0.093
6.423AlaSer: 6.423 ± 0.086
6.214AlaThr: 6.214 ± 0.086
9.477AlaVal: 9.477 ± 0.101
2.201AlaTrp: 2.201 ± 0.048
2.525AlaTyr: 2.525 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.236CysAla: 1.236 ± 0.034
0.112CysCys: 0.112 ± 0.011
0.504CysAsp: 0.504 ± 0.022
0.501CysGlu: 0.501 ± 0.021
0.285CysPhe: 0.285 ± 0.016
0.949CysGly: 0.949 ± 0.036
0.302CysHis: 0.302 ± 0.016
0.463CysIle: 0.463 ± 0.021
0.261CysLys: 0.261 ± 0.014
0.929CysLeu: 0.929 ± 0.029
0.275CysMet: 0.275 ± 0.017
0.26CysAsn: 0.26 ± 0.013
0.521CysPro: 0.521 ± 0.025
0.363CysGln: 0.363 ± 0.02
0.511CysArg: 0.511 ± 0.022
0.537CysSer: 0.537 ± 0.021
0.58CysThr: 0.58 ± 0.023
0.76CysVal: 0.76 ± 0.028
0.135CysTrp: 0.135 ± 0.012
0.204CysTyr: 0.204 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
6.742AspAla: 6.742 ± 0.091
0.481AspCys: 0.481 ± 0.021
2.27AspAsp: 2.27 ± 0.046
2.665AspGlu: 2.665 ± 0.059
1.952AspPhe: 1.952 ± 0.045
4.195AspGly: 4.195 ± 0.066
1.164AspHis: 1.164 ± 0.036
2.347AspIle: 2.347 ± 0.051
1.853AspLys: 1.853 ± 0.053
5.079AspLeu: 5.079 ± 0.062
1.425AspMet: 1.425 ± 0.032
1.195AspAsn: 1.195 ± 0.033
2.5AspPro: 2.5 ± 0.045
1.815AspGln: 1.815 ± 0.041
2.868AspArg: 2.868 ± 0.06
2.257AspSer: 2.257 ± 0.044
2.509AspThr: 2.509 ± 0.054
3.967AspVal: 3.967 ± 0.062
0.989AspTrp: 0.989 ± 0.033
1.339AspTyr: 1.339 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
6.599GluAla: 6.599 ± 0.087
0.37GluCys: 0.37 ± 0.019
2.294GluAsp: 2.294 ± 0.047
2.471GluGlu: 2.471 ± 0.054
1.617GluPhe: 1.617 ± 0.038
3.643GluGly: 3.643 ± 0.059
1.431GluHis: 1.431 ± 0.036
2.485GluIle: 2.485 ± 0.054
1.826GluLys: 1.826 ± 0.048
5.63GluLeu: 5.63 ± 0.083
1.287GluMet: 1.287 ± 0.038
1.168GluAsn: 1.168 ± 0.037
2.098GluPro: 2.098 ± 0.054
3.033GluGln: 3.033 ± 0.06
4.103GluArg: 4.103 ± 0.071
2.199GluSer: 2.199 ± 0.051
2.237GluThr: 2.237 ± 0.042
4.147GluVal: 4.147 ± 0.074
0.662GluTrp: 0.662 ± 0.023
0.955GluTyr: 0.955 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
4.06PheAla: 4.06 ± 0.067
0.37PheCys: 0.37 ± 0.021
2.173PheAsp: 2.173 ± 0.051
1.778PheGlu: 1.778 ± 0.045
1.276PhePhe: 1.276 ± 0.042
3.169PheGly: 3.169 ± 0.06
0.72PheHis: 0.72 ± 0.022
1.434PheIle: 1.434 ± 0.037
1.207PheLys: 1.207 ± 0.038
2.861PheLeu: 2.861 ± 0.059
0.824PheMet: 0.824 ± 0.027
1.054PheAsn: 1.054 ± 0.037
1.381PhePro: 1.381 ± 0.037
1.275PheGln: 1.275 ± 0.034
1.54PheArg: 1.54 ± 0.037
1.93PheSer: 1.93 ± 0.046
1.876PheThr: 1.876 ± 0.046
2.736PheVal: 2.736 ± 0.055
0.526PheTrp: 0.526 ± 0.023
0.819PheTyr: 0.819 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
8.689GlyAla: 8.689 ± 0.113
0.958GlyCys: 0.958 ± 0.029
3.675GlyAsp: 3.675 ± 0.064
3.89GlyGlu: 3.89 ± 0.066
3.039GlyPhe: 3.039 ± 0.061
6.338GlyGly: 6.338 ± 0.104
2.137GlyHis: 2.137 ± 0.053
3.817GlyIle: 3.817 ± 0.067
3.303GlyLys: 3.303 ± 0.06
9.043GlyLeu: 9.043 ± 0.1
2.554GlyMet: 2.554 ± 0.053
2.128GlyAsn: 2.128 ± 0.046
2.842GlyPro: 2.842 ± 0.052
4.128GlyGln: 4.128 ± 0.064
4.689GlyArg: 4.689 ± 0.074
4.367GlySer: 4.367 ± 0.062
4.371GlyThr: 4.371 ± 0.073
6.67GlyVal: 6.67 ± 0.093
1.531GlyTrp: 1.531 ± 0.046
2.193GlyTyr: 2.193 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
3.193HisAla: 3.193 ± 0.06
0.325HisCys: 0.325 ± 0.019
1.273HisAsp: 1.273 ± 0.036
1.109HisGlu: 1.109 ± 0.029
0.955HisPhe: 0.955 ± 0.031
2.249HisGly: 2.249 ± 0.046
0.76HisHis: 0.76 ± 0.027
1.157HisIle: 1.157 ± 0.03
0.69HisLys: 0.69 ± 0.023
2.582HisLeu: 2.582 ± 0.056
0.633HisMet: 0.633 ± 0.024
0.595HisAsn: 0.595 ± 0.025
1.697HisPro: 1.697 ± 0.039
1.073HisGln: 1.073 ± 0.03
1.449HisArg: 1.449 ± 0.034
1.342HisSer: 1.342 ± 0.037
1.335HisThr: 1.335 ± 0.037
1.683HisVal: 1.683 ± 0.034
0.544HisTrp: 0.544 ± 0.026
0.698HisTyr: 0.698 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.934IleAla: 5.934 ± 0.076
0.384IleCys: 0.384 ± 0.018
2.712IleAsp: 2.712 ± 0.053
2.68IleGlu: 2.68 ± 0.047
1.315IlePhe: 1.315 ± 0.035
3.795IleGly: 3.795 ± 0.057
0.899IleHis: 0.899 ± 0.026
1.596IleIle: 1.596 ± 0.054
1.506IleLys: 1.506 ± 0.044
3.369IleLeu: 3.369 ± 0.055
0.81IleMet: 0.81 ± 0.03
1.436IleAsn: 1.436 ± 0.041
2.069IlePro: 2.069 ± 0.049
1.613IleGln: 1.613 ± 0.042
2.383IleArg: 2.383 ± 0.045
2.324IleSer: 2.324 ± 0.05
2.634IleThr: 2.634 ± 0.05
3.222IleVal: 3.222 ± 0.064
0.508IleTrp: 0.508 ± 0.022
0.961IleTyr: 0.961 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.177LysAla: 4.177 ± 0.087
0.174LysCys: 0.174 ± 0.013
1.833LysAsp: 1.833 ± 0.054
1.65LysGlu: 1.65 ± 0.046
0.919LysPhe: 0.919 ± 0.031
2.419LysGly: 2.419 ± 0.058
0.76LysHis: 0.76 ± 0.028
1.497LysIle: 1.497 ± 0.043
1.504LysLys: 1.504 ± 0.06
3.718LysLeu: 3.718 ± 0.071
0.787LysMet: 0.787 ± 0.029
0.927LysAsn: 0.927 ± 0.031
2.026LysPro: 2.026 ± 0.043
1.507LysGln: 1.507 ± 0.041
1.956LysArg: 1.956 ± 0.05
1.753LysSer: 1.753 ± 0.046
1.858LysThr: 1.858 ± 0.04
2.561LysVal: 2.561 ± 0.058
0.325LysTrp: 0.325 ± 0.018
0.664LysTyr: 0.664 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
14.378LeuAla: 14.378 ± 0.139
1.201LeuCys: 1.201 ± 0.036
5.585LeuAsp: 5.585 ± 0.078
5.423LeuGlu: 5.423 ± 0.078
3.218LeuPhe: 3.218 ± 0.059
9.034LeuGly: 9.034 ± 0.12
2.859LeuHis: 2.859 ± 0.061
4.202LeuIle: 4.202 ± 0.074
3.373LeuLys: 3.373 ± 0.067
12.052LeuLeu: 12.052 ± 0.158
2.665LeuMet: 2.665 ± 0.053
2.785LeuAsn: 2.785 ± 0.045
6.414LeuPro: 6.414 ± 0.098
6.482LeuGln: 6.482 ± 0.107
7.625LeuArg: 7.625 ± 0.103
6.102LeuSer: 6.102 ± 0.078
5.239LeuThr: 5.239 ± 0.072
8.036LeuVal: 8.036 ± 0.1
1.587LeuTrp: 1.587 ± 0.049
2.034LeuTyr: 2.034 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
3.444MetAla: 3.444 ± 0.06
0.187MetCys: 0.187 ± 0.013
1.277MetAsp: 1.277 ± 0.035
1.248MetGlu: 1.248 ± 0.038
0.77MetPhe: 0.77 ± 0.029
2.064MetGly: 2.064 ± 0.046
0.696MetHis: 0.696 ± 0.026
0.903MetIle: 0.903 ± 0.033
0.877MetLys: 0.877 ± 0.026
2.937MetLeu: 2.937 ± 0.061
0.552MetMet: 0.552 ± 0.026
0.827MetAsn: 0.827 ± 0.027
1.586MetPro: 1.586 ± 0.037
1.467MetGln: 1.467 ± 0.037
1.675MetArg: 1.675 ± 0.048
1.587MetSer: 1.587 ± 0.037
1.488MetThr: 1.488 ± 0.042
2.014MetVal: 2.014 ± 0.037
0.245MetTrp: 0.245 ± 0.019
0.415MetTyr: 0.415 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.336AsnAla: 3.336 ± 0.065
0.243AsnCys: 0.243 ± 0.014
1.333AsnAsp: 1.333 ± 0.034
1.065AsnGlu: 1.065 ± 0.032
0.97AsnPhe: 0.97 ± 0.033
2.005AsnGly: 2.005 ± 0.049
0.521AsnHis: 0.521 ± 0.024
1.222AsnIle: 1.222 ± 0.035
0.839AsnLys: 0.839 ± 0.03
2.646AsnLeu: 2.646 ± 0.052
0.595AsnMet: 0.595 ± 0.025
0.699AsnAsn: 0.699 ± 0.026
1.838AsnPro: 1.838 ± 0.048
1.065AsnGln: 1.065 ± 0.032
1.432AsnArg: 1.432 ± 0.042
1.168AsnSer: 1.168 ± 0.033
1.444AsnThr: 1.444 ± 0.04
1.87AsnVal: 1.87 ± 0.043
0.432AsnTrp: 0.432 ± 0.019
0.668AsnTyr: 0.668 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
7.26ProAla: 7.26 ± 0.111
0.393ProCys: 0.393 ± 0.019
2.822ProAsp: 2.822 ± 0.046
3.228ProGlu: 3.228 ± 0.061
1.614ProPhe: 1.614 ± 0.038
4.419ProGly: 4.419 ± 0.071
1.376ProHis: 1.376 ± 0.044
1.784ProIle: 1.784 ± 0.045
1.486ProLys: 1.486 ± 0.043
5.339ProLeu: 5.339 ± 0.077
1.385ProMet: 1.385 ± 0.036
1.192ProAsn: 1.192 ± 0.03
2.474ProPro: 2.474 ± 0.063
2.926ProGln: 2.926 ± 0.059
2.647ProArg: 2.647 ± 0.062
2.727ProSer: 2.727 ± 0.049
2.537ProThr: 2.537 ± 0.052
4.288ProVal: 4.288 ± 0.072
0.864ProTrp: 0.864 ± 0.028
1.136ProTyr: 1.136 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
8.235GlnAla: 8.235 ± 0.108
0.384GlnCys: 0.384 ± 0.018
2.288GlnAsp: 2.288 ± 0.048
2.187GlnGlu: 2.187 ± 0.051
1.48GlnPhe: 1.48 ± 0.036
4.228GlnGly: 4.228 ± 0.068
1.526GlnHis: 1.526 ± 0.041
2.032GlnIle: 2.032 ± 0.047
1.294GlnLys: 1.294 ± 0.038
5.884GlnLeu: 5.884 ± 0.09
1.234GlnMet: 1.234 ± 0.034
1.002GlnAsn: 1.002 ± 0.034
2.986GlnPro: 2.986 ± 0.067
3.313GlnGln: 3.313 ± 0.076
4.272GlnArg: 4.272 ± 0.067
2.394GlnSer: 2.394 ± 0.052
2.374GlnThr: 2.374 ± 0.047
3.902GlnVal: 3.902 ± 0.068
0.968GlnTrp: 0.968 ± 0.033
0.849GlnTyr: 0.849 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
6.809ArgAla: 6.809 ± 0.09
0.599ArgCys: 0.599 ± 0.024
3.038ArgAsp: 3.038 ± 0.05
3.446ArgGlu: 3.446 ± 0.059
2.335ArgPhe: 2.335 ± 0.046
4.055ArgGly: 4.055 ± 0.066
1.825ArgHis: 1.825 ± 0.046
3.087ArgIle: 3.087 ± 0.05
2.186ArgLys: 2.186 ± 0.044
7.035ArgLeu: 7.035 ± 0.099
1.978ArgMet: 1.978 ± 0.042
1.804ArgAsn: 1.804 ± 0.04
2.863ArgPro: 2.863 ± 0.052
3.347ArgGln: 3.347 ± 0.057
4.139ArgArg: 4.139 ± 0.063
3.422ArgSer: 3.422 ± 0.061
3.123ArgThr: 3.123 ± 0.057
4.529ArgVal: 4.529 ± 0.07
1.281ArgTrp: 1.281 ± 0.041
1.641ArgTyr: 1.641 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
6.177SerAla: 6.177 ± 0.083
0.43SerCys: 0.43 ± 0.022
2.435SerAsp: 2.435 ± 0.051
2.302SerGlu: 2.302 ± 0.047
1.952SerPhe: 1.952 ± 0.038
4.584SerGly: 4.584 ± 0.065
1.264SerHis: 1.264 ± 0.035
2.373SerIle: 2.373 ± 0.051
1.681SerLys: 1.681 ± 0.042
5.88SerLeu: 5.88 ± 0.076
1.505SerMet: 1.505 ± 0.037
1.435SerAsn: 1.435 ± 0.038
2.698SerPro: 2.698 ± 0.053
2.275SerGln: 2.275 ± 0.044
2.908SerArg: 2.908 ± 0.052
2.951SerSer: 2.951 ± 0.063
3.047SerThr: 3.047 ± 0.049
4.019SerVal: 4.019 ± 0.064
0.745SerTrp: 0.745 ± 0.026
1.227SerTyr: 1.227 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
6.442ThrAla: 6.442 ± 0.079
0.437ThrCys: 0.437 ± 0.022
2.441ThrAsp: 2.441 ± 0.052
2.459ThrGlu: 2.459 ± 0.048
1.569ThrPhe: 1.569 ± 0.037
4.488ThrGly: 4.488 ± 0.066
1.23ThrHis: 1.23 ± 0.036
2.043ThrIle: 2.043 ± 0.045
1.382ThrLys: 1.382 ± 0.037
6.049ThrLeu: 6.049 ± 0.073
1.144ThrMet: 1.144 ± 0.031
1.214ThrAsn: 1.214 ± 0.033
3.685ThrPro: 3.685 ± 0.054
2.525ThrGln: 2.525 ± 0.047
2.885ThrArg: 2.885 ± 0.049
2.659ThrSer: 2.659 ± 0.046
2.978ThrThr: 2.978 ± 0.048
4.294ThrVal: 4.294 ± 0.069
0.742ThrTrp: 0.742 ± 0.03
1.083ThrTyr: 1.083 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
9.713ValAla: 9.713 ± 0.101
0.797ValCys: 0.797 ± 0.031
3.92ValAsp: 3.92 ± 0.059
4.075ValGlu: 4.075 ± 0.069
2.536ValPhe: 2.536 ± 0.048
5.777ValGly: 5.777 ± 0.087
1.868ValHis: 1.868 ± 0.043
3.214ValIle: 3.214 ± 0.061
2.375ValLys: 2.375 ± 0.061
9.149ValLeu: 9.149 ± 0.105
1.916ValMet: 1.916 ± 0.044
1.95ValAsn: 1.95 ± 0.041
4.018ValPro: 4.018 ± 0.065
4.244ValGln: 4.244 ± 0.074
4.96ValArg: 4.96 ± 0.064
3.754ValSer: 3.754 ± 0.06
3.975ValThr: 3.975 ± 0.059
6.631ValVal: 6.631 ± 0.091
1.144ValTrp: 1.144 ± 0.034
1.559ValTyr: 1.559 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
1.531TrpAla: 1.531 ± 0.042
0.2TrpCys: 0.2 ± 0.013
0.617TrpAsp: 0.617 ± 0.024
0.602TrpGlu: 0.602 ± 0.027
0.577TrpPhe: 0.577 ± 0.023
1.189TrpGly: 1.189 ± 0.035
0.424TrpHis: 0.424 ± 0.021
0.637TrpIle: 0.637 ± 0.026
0.523TrpLys: 0.523 ± 0.024
2.384TrpLeu: 2.384 ± 0.054
0.523TrpMet: 0.523 ± 0.023
0.435TrpAsn: 0.435 ± 0.022
0.811TrpPro: 0.811 ± 0.028
1.087TrpGln: 1.087 ± 0.037
1.205TrpArg: 1.205 ± 0.038
0.864TrpSer: 0.864 ± 0.031
0.73TrpThr: 0.73 ± 0.025
1.184TrpVal: 1.184 ± 0.032
0.337TrpTrp: 0.337 ± 0.02
0.268TrpTyr: 0.268 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.687TyrAla: 2.687 ± 0.057
0.242TyrCys: 0.242 ± 0.015
1.176TyrAsp: 1.176 ± 0.033
1.102TyrGlu: 1.102 ± 0.031
0.861TyrPhe: 0.861 ± 0.029
1.929TyrGly: 1.929 ± 0.042
0.47TyrHis: 0.47 ± 0.021
0.775TyrIle: 0.775 ± 0.029
0.726TyrLys: 0.726 ± 0.031
2.352TyrLeu: 2.352 ± 0.047
0.483TyrMet: 0.483 ± 0.018
0.593TyrAsn: 0.593 ± 0.027
1.111TyrPro: 1.111 ± 0.033
1.016TyrGln: 1.016 ± 0.03
1.439TyrArg: 1.439 ± 0.045
1.127TyrSer: 1.127 ± 0.029
1.202TyrThr: 1.202 ± 0.034
1.585TyrVal: 1.585 ± 0.039
0.362TyrTrp: 0.362 ± 0.019
0.557TyrTyr: 0.557 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3527 proteins (1109371 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski