Amino acid dipepetide frequency for Thermoactinomyces sp. DSM 45891

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.279AlaAla: 4.279 ± 0.09
0.662AlaCys: 0.662 ± 0.029
3.377AlaAsp: 3.377 ± 0.066
4.403AlaGlu: 4.403 ± 0.081
2.791AlaPhe: 2.791 ± 0.058
4.946AlaGly: 4.946 ± 0.091
1.461AlaHis: 1.461 ± 0.043
5.432AlaIle: 5.432 ± 0.086
4.293AlaLys: 4.293 ± 0.081
6.601AlaLeu: 6.601 ± 0.093
1.946AlaMet: 1.946 ± 0.054
2.552AlaAsn: 2.552 ± 0.055
2.179AlaPro: 2.179 ± 0.054
2.439AlaGln: 2.439 ± 0.055
3.17AlaArg: 3.17 ± 0.067
4.05AlaSer: 4.05 ± 0.07
3.988AlaThr: 3.988 ± 0.085
4.908AlaVal: 4.908 ± 0.084
0.731AlaTrp: 0.731 ± 0.029
2.239AlaTyr: 2.239 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.48CysAla: 0.48 ± 0.031
0.113CysCys: 0.113 ± 0.011
0.446CysAsp: 0.446 ± 0.022
0.493CysGlu: 0.493 ± 0.025
0.408CysPhe: 0.408 ± 0.022
0.753CysGly: 0.753 ± 0.028
0.21CysHis: 0.21 ± 0.017
0.629CysIle: 0.629 ± 0.029
0.458CysLys: 0.458 ± 0.026
0.824CysLeu: 0.824 ± 0.036
0.259CysMet: 0.259 ± 0.017
0.319CysAsn: 0.319 ± 0.018
0.404CysPro: 0.404 ± 0.022
0.368CysGln: 0.368 ± 0.023
0.39CysArg: 0.39 ± 0.023
0.596CysSer: 0.596 ± 0.029
0.509CysThr: 0.509 ± 0.022
0.553CysVal: 0.553 ± 0.029
0.105CysTrp: 0.105 ± 0.013
0.322CysTyr: 0.322 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.128AspAla: 3.128 ± 0.069
0.419AspCys: 0.419 ± 0.024
2.372AspAsp: 2.372 ± 0.055
3.882AspGlu: 3.882 ± 0.083
2.326AspPhe: 2.326 ± 0.063
3.602AspGly: 3.602 ± 0.08
1.386AspHis: 1.386 ± 0.043
3.312AspIle: 3.312 ± 0.067
2.547AspLys: 2.547 ± 0.065
5.721AspLeu: 5.721 ± 0.089
1.117AspMet: 1.117 ± 0.039
1.406AspAsn: 1.406 ± 0.046
2.502AspPro: 2.502 ± 0.061
2.707AspGln: 2.707 ± 0.064
2.867AspArg: 2.867 ± 0.061
2.878AspSer: 2.878 ± 0.064
2.549AspThr: 2.549 ± 0.064
3.651AspVal: 3.651 ± 0.067
0.773AspTrp: 0.773 ± 0.031
1.889AspTyr: 1.889 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
4.858GluAla: 4.858 ± 0.08
0.542GluCys: 0.542 ± 0.022
3.275GluAsp: 3.275 ± 0.064
6.077GluGlu: 6.077 ± 0.109
2.359GluPhe: 2.359 ± 0.055
4.035GluGly: 4.035 ± 0.059
1.453GluHis: 1.453 ± 0.039
5.438GluIle: 5.438 ± 0.088
5.53GluLys: 5.53 ± 0.088
6.763GluLeu: 6.763 ± 0.093
2.298GluMet: 2.298 ± 0.054
2.704GluAsn: 2.704 ± 0.055
2.036GluPro: 2.036 ± 0.054
3.694GluGln: 3.694 ± 0.069
3.861GluArg: 3.861 ± 0.067
3.77GluSer: 3.77 ± 0.076
3.329GluThr: 3.329 ± 0.059
5.159GluVal: 5.159 ± 0.091
1.036GluTrp: 1.036 ± 0.035
2.24GluTyr: 2.24 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
2.794PheAla: 2.794 ± 0.055
0.42PheCys: 0.42 ± 0.021
2.307PheAsp: 2.307 ± 0.056
2.529PheGlu: 2.529 ± 0.057
2.082PhePhe: 2.082 ± 0.071
3.088PheGly: 3.088 ± 0.067
1.13PheHis: 1.13 ± 0.034
2.969PheIle: 2.969 ± 0.072
1.567PheLys: 1.567 ± 0.047
4.318PheLeu: 4.318 ± 0.083
1.039PheMet: 1.039 ± 0.038
1.312PheAsn: 1.312 ± 0.041
1.773PhePro: 1.773 ± 0.049
1.947PheGln: 1.947 ± 0.053
1.876PheArg: 1.876 ± 0.046
2.993PheSer: 2.993 ± 0.078
2.558PheThr: 2.558 ± 0.063
3.085PheVal: 3.085 ± 0.063
0.474PheTrp: 0.474 ± 0.025
1.522PheTyr: 1.522 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
4.991GlyAla: 4.991 ± 0.108
0.687GlyCys: 0.687 ± 0.031
3.528GlyAsp: 3.528 ± 0.074
4.529GlyGlu: 4.529 ± 0.072
3.185GlyPhe: 3.185 ± 0.074
4.846GlyGly: 4.846 ± 0.105
1.432GlyHis: 1.432 ± 0.041
5.617GlyIle: 5.617 ± 0.105
4.849GlyLys: 4.849 ± 0.087
6.473GlyLeu: 6.473 ± 0.086
2.105GlyMet: 2.105 ± 0.052
2.805GlyAsn: 2.805 ± 0.091
2.842GlyPro: 2.842 ± 0.19
2.265GlyGln: 2.265 ± 0.057
3.073GlyArg: 3.073 ± 0.068
4.46GlySer: 4.46 ± 0.077
4.265GlyThr: 4.265 ± 0.103
5.519GlyVal: 5.519 ± 0.095
0.91GlyTrp: 0.91 ± 0.039
2.538GlyTyr: 2.538 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
1.401HisAla: 1.401 ± 0.044
0.213HisCys: 0.213 ± 0.015
1.067HisAsp: 1.067 ± 0.028
1.421HisGlu: 1.421 ± 0.042
1.147HisPhe: 1.147 ± 0.04
1.487HisGly: 1.487 ± 0.041
0.768HisHis: 0.768 ± 0.034
1.64HisIle: 1.64 ± 0.043
1.1HisLys: 1.1 ± 0.038
2.402HisLeu: 2.402 ± 0.055
0.548HisMet: 0.548 ± 0.026
0.722HisAsn: 0.722 ± 0.029
1.333HisPro: 1.333 ± 0.036
1.158HisGln: 1.158 ± 0.039
1.213HisArg: 1.213 ± 0.04
1.528HisSer: 1.528 ± 0.044
1.193HisThr: 1.193 ± 0.037
1.591HisVal: 1.591 ± 0.047
0.271HisTrp: 0.271 ± 0.019
0.935HisTyr: 0.935 ± 0.033
0.001HisXaa: 0.001 ± 0.001
Ile
5.56IleAla: 5.56 ± 0.089
0.763IleCys: 0.763 ± 0.036
3.855IleAsp: 3.855 ± 0.07
5.059IleGlu: 5.059 ± 0.077
2.668IlePhe: 2.668 ± 0.065
5.558IleGly: 5.558 ± 0.097
2.007IleHis: 2.007 ± 0.054
4.353IleIle: 4.353 ± 0.087
3.448IleLys: 3.448 ± 0.067
6.645IleLeu: 6.645 ± 0.098
1.543IleMet: 1.543 ± 0.045
2.438IleAsn: 2.438 ± 0.053
3.587IlePro: 3.587 ± 0.066
3.657IleGln: 3.657 ± 0.073
3.871IleArg: 3.871 ± 0.072
4.887IleSer: 4.887 ± 0.085
4.232IleThr: 4.232 ± 0.076
4.942IleVal: 4.942 ± 0.085
0.788IleTrp: 0.788 ± 0.03
2.362IleTyr: 2.362 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
3.996LysAla: 3.996 ± 0.071
0.406LysCys: 0.406 ± 0.023
3.137LysAsp: 3.137 ± 0.079
5.443LysGlu: 5.443 ± 0.099
1.622LysPhe: 1.622 ± 0.05
4.072LysGly: 4.072 ± 0.07
1.215LysHis: 1.215 ± 0.04
3.97LysIle: 3.97 ± 0.076
5.523LysLys: 5.523 ± 0.119
5.115LysLeu: 5.115 ± 0.08
2.008LysMet: 2.008 ± 0.046
2.421LysAsn: 2.421 ± 0.056
2.347LysPro: 2.347 ± 0.059
3.133LysGln: 3.133 ± 0.06
3.546LysArg: 3.546 ± 0.068
3.429LysSer: 3.429 ± 0.063
3.047LysThr: 3.047 ± 0.065
4.565LysVal: 4.565 ± 0.079
0.879LysTrp: 0.879 ± 0.037
1.875LysTyr: 1.875 ± 0.055
0.0LysXaa: 0.0 ± 0.0
Leu
7.281LeuAla: 7.281 ± 0.098
0.9LeuCys: 0.9 ± 0.038
5.104LeuAsp: 5.104 ± 0.083
6.693LeuGlu: 6.693 ± 0.096
4.448LeuPhe: 4.448 ± 0.089
6.633LeuGly: 6.633 ± 0.092
2.257LeuHis: 2.257 ± 0.059
6.324LeuIle: 6.324 ± 0.094
5.196LeuLys: 5.196 ± 0.085
9.661LeuLeu: 9.661 ± 0.128
2.395LeuMet: 2.395 ± 0.054
3.373LeuAsn: 3.373 ± 0.072
4.132LeuPro: 4.132 ± 0.077
4.371LeuGln: 4.371 ± 0.08
4.382LeuArg: 4.382 ± 0.073
7.222LeuSer: 7.222 ± 0.117
5.421LeuThr: 5.421 ± 0.079
6.753LeuVal: 6.753 ± 0.093
0.93LeuTrp: 0.93 ± 0.033
3.269LeuTyr: 3.269 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
1.806MetAla: 1.806 ± 0.054
0.185MetCys: 0.185 ± 0.015
1.568MetAsp: 1.568 ± 0.043
1.989MetGlu: 1.989 ± 0.049
0.925MetPhe: 0.925 ± 0.034
1.973MetGly: 1.973 ± 0.05
0.434MetHis: 0.434 ± 0.023
2.315MetIle: 2.315 ± 0.06
2.384MetLys: 2.384 ± 0.055
2.328MetLeu: 2.328 ± 0.054
0.975MetMet: 0.975 ± 0.036
1.428MetAsn: 1.428 ± 0.041
0.841MetPro: 0.841 ± 0.033
0.911MetGln: 0.911 ± 0.036
1.249MetArg: 1.249 ± 0.04
1.839MetSer: 1.839 ± 0.048
1.449MetThr: 1.449 ± 0.038
1.936MetVal: 1.936 ± 0.046
0.231MetTrp: 0.231 ± 0.017
0.719MetTyr: 0.719 ± 0.032
0.0MetXaa: 0.0 ± 0.0
Asn
2.219AsnAla: 2.219 ± 0.051
0.304AsnCys: 0.304 ± 0.021
1.657AsnAsp: 1.657 ± 0.05
2.57AsnGlu: 2.57 ± 0.052
1.304AsnPhe: 1.304 ± 0.04
2.76AsnGly: 2.76 ± 0.077
1.057AsnHis: 1.057 ± 0.031
2.537AsnIle: 2.537 ± 0.065
2.181AsnLys: 2.181 ± 0.058
3.445AsnLeu: 3.445 ± 0.063
0.941AsnMet: 0.941 ± 0.033
1.385AsnAsn: 1.385 ± 0.047
2.169AsnPro: 2.169 ± 0.056
2.411AsnGln: 2.411 ± 0.051
2.145AsnArg: 2.145 ± 0.052
2.104AsnSer: 2.104 ± 0.058
2.105AsnThr: 2.105 ± 0.076
2.592AsnVal: 2.592 ± 0.058
0.548AsnTrp: 0.548 ± 0.027
1.191AsnTyr: 1.191 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
2.372ProAla: 2.372 ± 0.059
0.282ProCys: 0.282 ± 0.018
2.383ProAsp: 2.383 ± 0.054
3.128ProGlu: 3.128 ± 0.058
1.958ProPhe: 1.958 ± 0.048
2.433ProGly: 2.433 ± 0.059
0.944ProHis: 0.944 ± 0.036
3.05ProIle: 3.05 ± 0.066
2.264ProLys: 2.264 ± 0.061
3.571ProLeu: 3.571 ± 0.068
0.924ProMet: 0.924 ± 0.033
1.801ProAsn: 1.801 ± 0.05
1.224ProPro: 1.224 ± 0.043
2.004ProGln: 2.004 ± 0.118
1.538ProArg: 1.538 ± 0.043
2.767ProSer: 2.767 ± 0.059
2.699ProThr: 2.699 ± 0.072
2.982ProVal: 2.982 ± 0.064
0.444ProTrp: 0.444 ± 0.024
1.552ProTyr: 1.552 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
3.215GlnAla: 3.215 ± 0.065
0.269GlnCys: 0.269 ± 0.019
2.119GlnAsp: 2.119 ± 0.054
3.307GlnGlu: 3.307 ± 0.07
1.72GlnPhe: 1.72 ± 0.046
3.499GlnGly: 3.499 ± 0.135
0.883GlnHis: 0.883 ± 0.03
3.32GlnIle: 3.32 ± 0.07
3.096GlnLys: 3.096 ± 0.072
4.441GlnLeu: 4.441 ± 0.074
1.404GlnMet: 1.404 ± 0.043
1.695GlnAsn: 1.695 ± 0.043
1.457GlnPro: 1.457 ± 0.038
1.971GlnGln: 1.971 ± 0.06
1.869GlnArg: 1.869 ± 0.052
2.602GlnSer: 2.602 ± 0.059
2.286GlnThr: 2.286 ± 0.056
3.539GlnVal: 3.539 ± 0.074
0.47GlnTrp: 0.47 ± 0.028
1.513GlnTyr: 1.513 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
2.865ArgAla: 2.865 ± 0.059
0.389ArgCys: 0.389 ± 0.024
2.524ArgAsp: 2.524 ± 0.048
3.829ArgGlu: 3.829 ± 0.071
2.327ArgPhe: 2.327 ± 0.051
2.893ArgGly: 2.893 ± 0.058
0.933ArgHis: 0.933 ± 0.037
3.564ArgIle: 3.564 ± 0.062
3.379ArgLys: 3.379 ± 0.065
4.893ArgLeu: 4.893 ± 0.082
1.596ArgMet: 1.596 ± 0.044
2.045ArgAsn: 2.045 ± 0.052
1.703ArgPro: 1.703 ± 0.043
1.897ArgGln: 1.897 ± 0.046
2.394ArgArg: 2.394 ± 0.069
2.871ArgSer: 2.871 ± 0.053
2.468ArgThr: 2.468 ± 0.052
3.536ArgVal: 3.536 ± 0.063
0.626ArgTrp: 0.626 ± 0.027
1.918ArgTyr: 1.918 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
3.746SerAla: 3.746 ± 0.071
0.507SerCys: 0.507 ± 0.022
3.016SerAsp: 3.016 ± 0.065
3.673SerGlu: 3.673 ± 0.065
3.194SerPhe: 3.194 ± 0.072
4.739SerGly: 4.739 ± 0.076
1.519SerHis: 1.519 ± 0.045
4.937SerIle: 4.937 ± 0.074
3.941SerLys: 3.941 ± 0.072
6.446SerLeu: 6.446 ± 0.102
1.88SerMet: 1.88 ± 0.048
2.469SerAsn: 2.469 ± 0.063
2.433SerPro: 2.433 ± 0.061
2.449SerGln: 2.449 ± 0.049
2.947SerArg: 2.947 ± 0.058
4.444SerSer: 4.444 ± 0.084
3.64SerThr: 3.64 ± 0.07
4.393SerVal: 4.393 ± 0.069
0.786SerTrp: 0.786 ± 0.032
2.273SerTyr: 2.273 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
3.504ThrAla: 3.504 ± 0.063
0.48ThrCys: 0.48 ± 0.025
2.784ThrAsp: 2.784 ± 0.067
3.398ThrGlu: 3.398 ± 0.066
2.408ThrPhe: 2.408 ± 0.058
5.338ThrGly: 5.338 ± 0.195
1.289ThrHis: 1.289 ± 0.037
4.23ThrIle: 4.23 ± 0.075
3.14ThrLys: 3.14 ± 0.068
5.259ThrLeu: 5.259 ± 0.073
1.347ThrMet: 1.347 ± 0.041
2.194ThrAsn: 2.194 ± 0.059
2.687ThrPro: 2.687 ± 0.06
1.891ThrGln: 1.891 ± 0.049
2.255ThrArg: 2.255 ± 0.049
3.518ThrSer: 3.518 ± 0.06
3.304ThrThr: 3.304 ± 0.079
3.981ThrVal: 3.981 ± 0.081
0.691ThrTrp: 0.691 ± 0.033
2.049ThrTyr: 2.049 ± 0.073
0.002ThrXaa: 0.002 ± 0.002
Val
5.248ValAla: 5.248 ± 0.092
0.678ValCys: 0.678 ± 0.028
3.884ValAsp: 3.884 ± 0.064
5.175ValGlu: 5.175 ± 0.076
2.855ValPhe: 2.855 ± 0.061
5.248ValGly: 5.248 ± 0.093
1.613ValHis: 1.613 ± 0.046
5.216ValIle: 5.216 ± 0.097
4.292ValLys: 4.292 ± 0.081
6.71ValLeu: 6.71 ± 0.09
1.936ValMet: 1.936 ± 0.051
2.685ValAsn: 2.685 ± 0.062
2.957ValPro: 2.957 ± 0.062
2.919ValGln: 2.919 ± 0.058
3.519ValArg: 3.519 ± 0.067
4.694ValSer: 4.694 ± 0.082
4.23ValThr: 4.23 ± 0.079
5.572ValVal: 5.572 ± 0.097
0.737ValTrp: 0.737 ± 0.032
2.283ValTyr: 2.283 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.645TrpAla: 0.645 ± 0.027
0.121TrpCys: 0.121 ± 0.011
0.689TrpAsp: 0.689 ± 0.033
0.755TrpGlu: 0.755 ± 0.031
0.535TrpPhe: 0.535 ± 0.025
0.698TrpGly: 0.698 ± 0.031
0.22TrpHis: 0.22 ± 0.018
1.148TrpIle: 1.148 ± 0.044
0.899TrpLys: 0.899 ± 0.034
1.368TrpLeu: 1.368 ± 0.047
0.457TrpMet: 0.457 ± 0.024
0.624TrpAsn: 0.624 ± 0.034
0.24TrpPro: 0.24 ± 0.02
0.435TrpGln: 0.435 ± 0.023
0.507TrpArg: 0.507 ± 0.024
0.729TrpSer: 0.729 ± 0.031
0.613TrpThr: 0.613 ± 0.029
0.781TrpVal: 0.781 ± 0.032
0.183TrpTrp: 0.183 ± 0.014
0.375TrpTyr: 0.375 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.06TyrAla: 2.06 ± 0.051
0.296TyrCys: 0.296 ± 0.02
1.943TyrAsp: 1.943 ± 0.08
2.105TyrGlu: 2.105 ± 0.051
1.519TyrPhe: 1.519 ± 0.045
2.321TyrGly: 2.321 ± 0.052
0.928TyrHis: 0.928 ± 0.034
2.225TyrIle: 2.225 ± 0.05
1.663TyrLys: 1.663 ± 0.052
3.76TyrLeu: 3.76 ± 0.067
0.763TyrMet: 0.763 ± 0.034
1.249TyrAsn: 1.249 ± 0.05
1.475TyrPro: 1.475 ± 0.046
2.046TyrGln: 2.046 ± 0.049
1.976TyrArg: 1.976 ± 0.049
2.051TyrSer: 2.051 ± 0.051
1.863TyrThr: 1.863 ± 0.053
2.412TyrVal: 2.412 ± 0.049
0.408TyrTrp: 0.408 ± 0.021
1.356TyrTyr: 1.356 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.03XaaXaa: 0.03 ± 0.018
Statistics based on 2915 proteins (868414 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski