Amino acid dipepetide frequency for Devosia sp. H239

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.932AlaAla: 15.932 ± 0.218
0.753AlaCys: 0.753 ± 0.033
7.022AlaAsp: 7.022 ± 0.097
7.838AlaGlu: 7.838 ± 0.098
4.224AlaPhe: 4.224 ± 0.077
10.026AlaGly: 10.026 ± 0.125
1.992AlaHis: 1.992 ± 0.05
6.809AlaIle: 6.809 ± 0.095
3.763AlaLys: 3.763 ± 0.073
13.471AlaLeu: 13.471 ± 0.139
3.65AlaMet: 3.65 ± 0.064
3.31AlaAsn: 3.31 ± 0.063
5.321AlaPro: 5.321 ± 0.106
4.341AlaGln: 4.341 ± 0.085
7.39AlaArg: 7.39 ± 0.102
6.411AlaSer: 6.411 ± 0.101
6.38AlaThr: 6.38 ± 0.102
8.817AlaVal: 8.817 ± 0.117
1.375AlaTrp: 1.375 ± 0.049
2.492AlaTyr: 2.492 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.76CysAla: 0.76 ± 0.031
0.113CysCys: 0.113 ± 0.013
0.419CysAsp: 0.419 ± 0.022
0.325CysGlu: 0.325 ± 0.018
0.282CysPhe: 0.282 ± 0.018
0.717CysGly: 0.717 ± 0.028
0.205CysHis: 0.205 ± 0.014
0.316CysIle: 0.316 ± 0.021
0.157CysLys: 0.157 ± 0.013
0.617CysLeu: 0.617 ± 0.026
0.144CysMet: 0.144 ± 0.012
0.176CysAsn: 0.176 ± 0.014
0.343CysPro: 0.343 ± 0.019
0.216CysGln: 0.216 ± 0.016
0.567CysArg: 0.567 ± 0.03
0.391CysSer: 0.391 ± 0.023
0.339CysThr: 0.339 ± 0.02
0.45CysVal: 0.45 ± 0.022
0.104CysTrp: 0.104 ± 0.012
0.172CysTyr: 0.172 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
6.996AspAla: 6.996 ± 0.095
0.405AspCys: 0.405 ± 0.023
3.306AspAsp: 3.306 ± 0.069
3.496AspGlu: 3.496 ± 0.079
2.371AspPhe: 2.371 ± 0.059
5.208AspGly: 5.208 ± 0.086
1.233AspHis: 1.233 ± 0.043
3.378AspIle: 3.378 ± 0.067
1.726AspLys: 1.726 ± 0.048
6.144AspLeu: 6.144 ± 0.091
1.345AspMet: 1.345 ± 0.04
1.636AspAsn: 1.636 ± 0.07
3.323AspPro: 3.323 ± 0.073
2.024AspGln: 2.024 ± 0.045
4.088AspArg: 4.088 ± 0.079
2.232AspSer: 2.232 ± 0.059
2.883AspThr: 2.883 ± 0.07
4.278AspVal: 4.278 ± 0.073
1.005AspTrp: 1.005 ± 0.036
1.498AspTyr: 1.498 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
7.675GluAla: 7.675 ± 0.1
0.29GluCys: 0.29 ± 0.02
3.054GluAsp: 3.054 ± 0.066
3.347GluGlu: 3.347 ± 0.088
1.878GluPhe: 1.878 ± 0.051
4.323GluGly: 4.323 ± 0.085
1.275GluHis: 1.275 ± 0.042
3.363GluIle: 3.363 ± 0.068
2.02GluLys: 2.02 ± 0.049
5.545GluLeu: 5.545 ± 0.087
1.468GluMet: 1.468 ± 0.038
1.63GluAsn: 1.63 ± 0.045
2.867GluPro: 2.867 ± 0.082
2.425GluGln: 2.425 ± 0.049
4.442GluArg: 4.442 ± 0.08
2.283GluSer: 2.283 ± 0.049
3.598GluThr: 3.598 ± 0.071
4.08GluVal: 4.08 ± 0.073
0.682GluTrp: 0.682 ± 0.029
1.011GluTyr: 1.011 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
4.758PheAla: 4.758 ± 0.078
0.337PheCys: 0.337 ± 0.021
2.705PheAsp: 2.705 ± 0.062
2.266PheGlu: 2.266 ± 0.054
1.519PhePhe: 1.519 ± 0.056
3.836PheGly: 3.836 ± 0.065
0.728PheHis: 0.728 ± 0.031
1.82PheIle: 1.82 ± 0.047
1.026PheLys: 1.026 ± 0.04
3.359PheLeu: 3.359 ± 0.069
0.851PheMet: 0.851 ± 0.035
1.227PheAsn: 1.227 ± 0.038
1.558PhePro: 1.558 ± 0.047
1.051PheGln: 1.051 ± 0.036
2.036PheArg: 2.036 ± 0.053
2.318PheSer: 2.318 ± 0.051
2.128PheThr: 2.128 ± 0.051
2.932PheVal: 2.932 ± 0.065
0.58PheTrp: 0.58 ± 0.029
0.968PheTyr: 0.968 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
8.938GlyAla: 8.938 ± 0.115
0.622GlyCys: 0.622 ± 0.027
4.503GlyAsp: 4.503 ± 0.09
4.832GlyGlu: 4.832 ± 0.086
3.613GlyPhe: 3.613 ± 0.073
7.022GlyGly: 7.022 ± 0.136
1.751GlyHis: 1.751 ± 0.042
4.612GlyIle: 4.612 ± 0.073
3.39GlyLys: 3.39 ± 0.076
8.665GlyLeu: 8.665 ± 0.107
2.252GlyMet: 2.252 ± 0.057
2.519GlyAsn: 2.519 ± 0.078
3.343GlyPro: 3.343 ± 0.065
3.205GlyGln: 3.205 ± 0.063
5.385GlyArg: 5.385 ± 0.087
4.919GlySer: 4.919 ± 0.107
4.82GlyThr: 4.82 ± 0.081
6.168GlyVal: 6.168 ± 0.08
1.309GlyTrp: 1.309 ± 0.042
2.166GlyTyr: 2.166 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
2.074HisAla: 2.074 ± 0.044
0.185HisCys: 0.185 ± 0.016
1.116HisAsp: 1.116 ± 0.032
1.082HisGlu: 1.082 ± 0.032
0.866HisPhe: 0.866 ± 0.033
1.758HisGly: 1.758 ± 0.049
0.585HisHis: 0.585 ± 0.028
0.983HisIle: 0.983 ± 0.033
0.48HisLys: 0.48 ± 0.022
2.01HisLeu: 2.01 ± 0.043
0.52HisMet: 0.52 ± 0.027
0.499HisAsn: 0.499 ± 0.025
1.233HisPro: 1.233 ± 0.038
0.641HisGln: 0.641 ± 0.028
1.476HisArg: 1.476 ± 0.05
0.888HisSer: 0.888 ± 0.032
0.809HisThr: 0.809 ± 0.034
1.42HisVal: 1.42 ± 0.044
0.309HisTrp: 0.309 ± 0.021
0.5HisTyr: 0.5 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
7.432IleAla: 7.432 ± 0.101
0.462IleCys: 0.462 ± 0.021
3.902IleAsp: 3.902 ± 0.069
3.774IleGlu: 3.774 ± 0.067
1.902IlePhe: 1.902 ± 0.055
5.142IleGly: 5.142 ± 0.088
0.858IleHis: 0.858 ± 0.033
2.811IleIle: 2.811 ± 0.069
1.607IleLys: 1.607 ± 0.044
4.683IleLeu: 4.683 ± 0.078
1.087IleMet: 1.087 ± 0.036
1.628IleAsn: 1.628 ± 0.054
2.259IlePro: 2.259 ± 0.054
1.308IleGln: 1.308 ± 0.041
2.964IleArg: 2.964 ± 0.063
3.165IleSer: 3.165 ± 0.061
3.096IleThr: 3.096 ± 0.075
4.438IleVal: 4.438 ± 0.081
0.679IleTrp: 0.679 ± 0.032
1.271IleTyr: 1.271 ± 0.04
0.0IleXaa: 0.0 ± 0.0
Lys
4.009LysAla: 4.009 ± 0.082
0.121LysCys: 0.121 ± 0.012
1.679LysAsp: 1.679 ± 0.051
1.278LysGlu: 1.278 ± 0.039
0.894LysPhe: 0.894 ± 0.027
2.41LysGly: 2.41 ± 0.052
0.633LysHis: 0.633 ± 0.028
1.619LysIle: 1.619 ± 0.046
1.128LysLys: 1.128 ± 0.045
3.342LysLeu: 3.342 ± 0.065
0.722LysMet: 0.722 ± 0.031
0.906LysAsn: 0.906 ± 0.036
2.028LysPro: 2.028 ± 0.062
1.022LysGln: 1.022 ± 0.031
2.267LysArg: 2.267 ± 0.055
1.934LysSer: 1.934 ± 0.05
1.848LysThr: 1.848 ± 0.054
2.417LysVal: 2.417 ± 0.062
0.386LysTrp: 0.386 ± 0.019
0.555LysTyr: 0.555 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
13.185LeuAla: 13.185 ± 0.134
0.719LeuCys: 0.719 ± 0.029
6.381LeuAsp: 6.381 ± 0.098
5.43LeuGlu: 5.43 ± 0.091
3.637LeuPhe: 3.637 ± 0.076
8.434LeuGly: 8.434 ± 0.127
1.728LeuHis: 1.728 ± 0.051
5.23LeuIle: 5.23 ± 0.091
3.142LeuLys: 3.142 ± 0.066
9.658LeuLeu: 9.658 ± 0.129
2.327LeuMet: 2.327 ± 0.049
2.83LeuAsn: 2.83 ± 0.062
5.547LeuPro: 5.547 ± 0.09
2.852LeuGln: 2.852 ± 0.054
6.712LeuArg: 6.712 ± 0.095
6.741LeuSer: 6.741 ± 0.082
5.833LeuThr: 5.833 ± 0.096
7.913LeuVal: 7.913 ± 0.107
1.147LeuTrp: 1.147 ± 0.039
2.107LeuTyr: 2.107 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
3.287MetAla: 3.287 ± 0.07
0.128MetCys: 0.128 ± 0.013
1.215MetAsp: 1.215 ± 0.039
1.058MetGlu: 1.058 ± 0.035
0.841MetPhe: 0.841 ± 0.033
1.794MetGly: 1.794 ± 0.045
0.453MetHis: 0.453 ± 0.024
1.375MetIle: 1.375 ± 0.041
0.845MetLys: 0.845 ± 0.029
2.616MetLeu: 2.616 ± 0.054
0.682MetMet: 0.682 ± 0.03
0.766MetAsn: 0.766 ± 0.029
1.498MetPro: 1.498 ± 0.04
0.861MetGln: 0.861 ± 0.034
1.765MetArg: 1.765 ± 0.046
1.812MetSer: 1.812 ± 0.043
1.918MetThr: 1.918 ± 0.046
1.873MetVal: 1.873 ± 0.042
0.213MetTrp: 0.213 ± 0.016
0.303MetTyr: 0.303 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.365AsnAla: 3.365 ± 0.078
0.202AsnCys: 0.202 ± 0.018
1.607AsnAsp: 1.607 ± 0.058
1.354AsnGlu: 1.354 ± 0.045
1.104AsnPhe: 1.104 ± 0.037
2.758AsnGly: 2.758 ± 0.075
0.552AsnHis: 0.552 ± 0.027
1.572AsnIle: 1.572 ± 0.043
0.815AsnLys: 0.815 ± 0.033
2.845AsnLeu: 2.845 ± 0.064
0.687AsnMet: 0.687 ± 0.028
0.853AsnAsn: 0.853 ± 0.033
1.892AsnPro: 1.892 ± 0.053
0.871AsnGln: 0.871 ± 0.033
1.951AsnArg: 1.951 ± 0.05
1.637AsnSer: 1.637 ± 0.068
1.504AsnThr: 1.504 ± 0.051
2.048AsnVal: 2.048 ± 0.063
0.525AsnTrp: 0.525 ± 0.024
0.793AsnTyr: 0.793 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
5.985ProAla: 5.985 ± 0.106
0.275ProCys: 0.275 ± 0.018
3.414ProAsp: 3.414 ± 0.076
3.608ProGlu: 3.608 ± 0.083
1.951ProPhe: 1.951 ± 0.042
4.08ProGly: 4.08 ± 0.078
1.026ProHis: 1.026 ± 0.033
2.693ProIle: 2.693 ± 0.058
1.63ProLys: 1.63 ± 0.046
4.568ProLeu: 4.568 ± 0.081
1.27ProMet: 1.27 ± 0.043
1.515ProAsn: 1.515 ± 0.043
2.189ProPro: 2.189 ± 0.059
1.703ProGln: 1.703 ± 0.046
2.735ProArg: 2.735 ± 0.057
2.831ProSer: 2.831 ± 0.057
2.67ProThr: 2.67 ± 0.061
4.064ProVal: 4.064 ± 0.08
0.647ProTrp: 0.647 ± 0.029
1.165ProTyr: 1.165 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
4.161GlnAla: 4.161 ± 0.068
0.234GlnCys: 0.234 ± 0.018
1.551GlnAsp: 1.551 ± 0.043
1.444GlnGlu: 1.444 ± 0.041
1.295GlnPhe: 1.295 ± 0.042
2.423GlnGly: 2.423 ± 0.054
0.73GlnHis: 0.73 ± 0.03
1.941GlnIle: 1.941 ± 0.043
1.019GlnLys: 1.019 ± 0.035
3.255GlnLeu: 3.255 ± 0.067
1.052GlnMet: 1.052 ± 0.037
1.024GlnAsn: 1.024 ± 0.034
1.969GlnPro: 1.969 ± 0.053
1.364GlnGln: 1.364 ± 0.046
2.564GlnArg: 2.564 ± 0.06
2.09GlnSer: 2.09 ± 0.051
1.936GlnThr: 1.936 ± 0.051
2.453GlnVal: 2.453 ± 0.06
0.401GlnTrp: 0.401 ± 0.021
0.728GlnTyr: 0.728 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
6.963ArgAla: 6.963 ± 0.096
0.447ArgCys: 0.447 ± 0.025
3.778ArgAsp: 3.778 ± 0.068
3.71ArgGlu: 3.71 ± 0.066
2.717ArgPhe: 2.717 ± 0.057
4.302ArgGly: 4.302 ± 0.069
1.59ArgHis: 1.59 ± 0.041
3.712ArgIle: 3.712 ± 0.074
2.16ArgLys: 2.16 ± 0.053
7.208ArgLeu: 7.208 ± 0.117
1.807ArgMet: 1.807 ± 0.043
1.922ArgAsn: 1.922 ± 0.046
3.251ArgPro: 3.251 ± 0.075
2.804ArgGln: 2.804 ± 0.054
4.942ArgArg: 4.942 ± 0.101
3.849ArgSer: 3.849 ± 0.068
3.292ArgThr: 3.292 ± 0.067
4.441ArgVal: 4.441 ± 0.073
0.905ArgTrp: 0.905 ± 0.034
1.586ArgTyr: 1.586 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
6.503SerAla: 6.503 ± 0.104
0.365SerCys: 0.365 ± 0.021
3.137SerAsp: 3.137 ± 0.07
2.934SerGlu: 2.934 ± 0.065
2.489SerPhe: 2.489 ± 0.059
5.659SerGly: 5.659 ± 0.103
1.056SerHis: 1.056 ± 0.033
3.069SerIle: 3.069 ± 0.068
1.698SerLys: 1.698 ± 0.047
5.56SerLeu: 5.56 ± 0.075
1.515SerMet: 1.515 ± 0.044
1.651SerAsn: 1.651 ± 0.051
2.678SerPro: 2.678 ± 0.065
1.785SerGln: 1.785 ± 0.051
3.658SerArg: 3.658 ± 0.067
3.302SerSer: 3.302 ± 0.081
3.107SerThr: 3.107 ± 0.082
4.49SerVal: 4.49 ± 0.097
0.815SerTrp: 0.815 ± 0.031
1.302SerTyr: 1.302 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
6.507ThrAla: 6.507 ± 0.105
0.326ThrCys: 0.326 ± 0.023
3.129ThrAsp: 3.129 ± 0.068
2.967ThrGlu: 2.967 ± 0.065
2.121ThrPhe: 2.121 ± 0.05
5.263ThrGly: 5.263 ± 0.097
1.026ThrHis: 1.026 ± 0.032
3.427ThrIle: 3.427 ± 0.065
1.528ThrLys: 1.528 ± 0.038
6.257ThrLeu: 6.257 ± 0.092
1.318ThrMet: 1.318 ± 0.036
1.589ThrAsn: 1.589 ± 0.052
3.195ThrPro: 3.195 ± 0.066
1.628ThrGln: 1.628 ± 0.038
3.169ThrArg: 3.169 ± 0.075
3.092ThrSer: 3.092 ± 0.077
3.241ThrThr: 3.241 ± 0.081
4.526ThrVal: 4.526 ± 0.091
0.646ThrTrp: 0.646 ± 0.029
1.226ThrTyr: 1.226 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
9.175ValAla: 9.175 ± 0.109
0.486ValCys: 0.486 ± 0.026
4.505ValAsp: 4.505 ± 0.071
4.727ValGlu: 4.727 ± 0.083
2.739ValPhe: 2.739 ± 0.068
5.874ValGly: 5.874 ± 0.092
1.293ValHis: 1.293 ± 0.039
4.18ValIle: 4.18 ± 0.077
2.105ValLys: 2.105 ± 0.053
7.864ValLeu: 7.864 ± 0.12
1.86ValMet: 1.86 ± 0.049
2.127ValAsn: 2.127 ± 0.06
3.761ValPro: 3.761 ± 0.067
2.122ValGln: 2.122 ± 0.055
4.555ValArg: 4.555 ± 0.072
4.718ValSer: 4.718 ± 0.073
4.842ValThr: 4.842 ± 0.083
6.304ValVal: 6.304 ± 0.097
0.88ValTrp: 0.88 ± 0.039
1.521ValTyr: 1.521 ± 0.043
0.0ValXaa: 0.0 ± 0.0
Trp
1.138TrpAla: 1.138 ± 0.043
0.128TrpCys: 0.128 ± 0.012
0.722TrpAsp: 0.722 ± 0.029
0.518TrpGlu: 0.518 ± 0.024
0.573TrpPhe: 0.573 ± 0.027
0.882TrpGly: 0.882 ± 0.031
0.309TrpHis: 0.309 ± 0.019
0.637TrpIle: 0.637 ± 0.029
0.381TrpLys: 0.381 ± 0.02
1.61TrpLeu: 1.61 ± 0.058
0.32TrpMet: 0.32 ± 0.017
0.447TrpAsn: 0.447 ± 0.021
0.729TrpPro: 0.729 ± 0.031
0.623TrpGln: 0.623 ± 0.026
1.052TrpArg: 1.052 ± 0.036
0.92TrpSer: 0.92 ± 0.034
0.851TrpThr: 0.851 ± 0.033
0.865TrpVal: 0.865 ± 0.031
0.233TrpTrp: 0.233 ± 0.019
0.28TrpTyr: 0.28 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.473TyrAla: 2.473 ± 0.047
0.215TyrCys: 0.215 ± 0.013
1.432TyrAsp: 1.432 ± 0.042
1.269TyrGlu: 1.269 ± 0.047
0.986TyrPhe: 0.986 ± 0.036
2.123TyrGly: 2.123 ± 0.056
0.413TyrHis: 0.413 ± 0.022
0.923TyrIle: 0.923 ± 0.032
0.577TyrLys: 0.577 ± 0.024
2.276TyrLeu: 2.276 ± 0.051
0.424TyrMet: 0.424 ± 0.022
0.678TyrAsn: 0.678 ± 0.026
1.036TyrPro: 1.036 ± 0.038
0.786TyrGln: 0.786 ± 0.036
1.667TyrArg: 1.667 ± 0.044
1.24TyrSer: 1.24 ± 0.043
1.086TyrThr: 1.086 ± 0.038
1.66TyrVal: 1.66 ± 0.049
0.377TyrTrp: 0.377 ± 0.021
0.61TyrTyr: 0.61 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3157 proteins (868461 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski