Amino acid dipepetide frequency for Thermomonas sp. SY21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.717AlaAla: 19.717 ± 0.246
1.439AlaCys: 1.439 ± 0.047
7.838AlaAsp: 7.838 ± 0.112
7.02AlaGlu: 7.02 ± 0.121
4.395AlaPhe: 4.395 ± 0.068
12.075AlaGly: 12.075 ± 0.132
2.773AlaHis: 2.773 ± 0.063
5.957AlaIle: 5.957 ± 0.087
4.338AlaLys: 4.338 ± 0.09
16.259AlaLeu: 16.259 ± 0.189
4.033AlaMet: 4.033 ± 0.068
3.29AlaAsn: 3.29 ± 0.058
6.277AlaPro: 6.277 ± 0.097
5.756AlaGln: 5.756 ± 0.111
10.277AlaArg: 10.277 ± 0.13
6.732AlaSer: 6.732 ± 0.101
6.06AlaThr: 6.06 ± 0.089
8.492AlaVal: 8.492 ± 0.103
2.2AlaTrp: 2.2 ± 0.057
2.707AlaTyr: 2.707 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
1.187CysAla: 1.187 ± 0.039
0.103CysCys: 0.103 ± 0.011
0.493CysAsp: 0.493 ± 0.025
0.496CysGlu: 0.496 ± 0.023
0.271CysPhe: 0.271 ± 0.017
0.96CysGly: 0.96 ± 0.033
0.224CysHis: 0.224 ± 0.015
0.392CysIle: 0.392 ± 0.022
0.253CysLys: 0.253 ± 0.016
0.771CysLeu: 0.771 ± 0.03
0.173CysMet: 0.173 ± 0.015
0.237CysAsn: 0.237 ± 0.016
0.399CysPro: 0.399 ± 0.02
0.185CysGln: 0.185 ± 0.016
0.611CysArg: 0.611 ± 0.03
0.458CysSer: 0.458 ± 0.025
0.398CysThr: 0.398 ± 0.02
0.66CysVal: 0.66 ± 0.03
0.141CysTrp: 0.141 ± 0.016
0.185CysTyr: 0.185 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
10.513AspAla: 10.513 ± 0.147
0.481AspCys: 0.481 ± 0.024
3.627AspAsp: 3.627 ± 0.074
3.45AspGlu: 3.45 ± 0.063
2.215AspPhe: 2.215 ± 0.051
6.433AspGly: 6.433 ± 0.096
1.103AspHis: 1.103 ± 0.036
2.596AspIle: 2.596 ± 0.06
1.856AspLys: 1.856 ± 0.055
5.233AspLeu: 5.233 ± 0.084
1.157AspMet: 1.157 ± 0.04
1.421AspAsn: 1.421 ± 0.04
3.44AspPro: 3.44 ± 0.056
1.489AspGln: 1.489 ± 0.045
3.894AspArg: 3.894 ± 0.077
2.401AspSer: 2.401 ± 0.055
2.465AspThr: 2.465 ± 0.056
4.143AspVal: 4.143 ± 0.063
1.112AspTrp: 1.112 ± 0.033
1.738AspTyr: 1.738 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
7.481GluAla: 7.481 ± 0.127
0.36GluCys: 0.36 ± 0.02
2.575GluAsp: 2.575 ± 0.051
2.195GluGlu: 2.195 ± 0.059
1.809GluPhe: 1.809 ± 0.049
3.744GluGly: 3.744 ± 0.069
1.426GluHis: 1.426 ± 0.04
2.494GluIle: 2.494 ± 0.06
1.636GluLys: 1.636 ± 0.048
6.162GluLeu: 6.162 ± 0.093
1.062GluMet: 1.062 ± 0.037
1.111GluAsn: 1.111 ± 0.038
2.401GluPro: 2.401 ± 0.051
2.451GluGln: 2.451 ± 0.051
5.291GluArg: 5.291 ± 0.093
2.397GluSer: 2.397 ± 0.048
2.559GluThr: 2.559 ± 0.05
3.654GluVal: 3.654 ± 0.064
0.693GluTrp: 0.693 ± 0.026
1.176GluTyr: 1.176 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
4.933PheAla: 4.933 ± 0.079
0.325PheCys: 0.325 ± 0.02
2.872PheAsp: 2.872 ± 0.054
1.943PheGlu: 1.943 ± 0.05
1.177PhePhe: 1.177 ± 0.039
3.406PheGly: 3.406 ± 0.078
0.807PheHis: 0.807 ± 0.035
1.261PheIle: 1.261 ± 0.039
0.999PheLys: 0.999 ± 0.031
2.992PheLeu: 2.992 ± 0.064
0.657PheMet: 0.657 ± 0.032
1.025PheAsn: 1.025 ± 0.036
1.433PhePro: 1.433 ± 0.041
0.901PheGln: 0.901 ± 0.031
2.184PheArg: 2.184 ± 0.061
1.794PheSer: 1.794 ± 0.045
1.506PheThr: 1.506 ± 0.049
2.566PheVal: 2.566 ± 0.055
0.524PheTrp: 0.524 ± 0.027
0.819PheTyr: 0.819 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
9.293GlyAla: 9.293 ± 0.131
0.893GlyCys: 0.893 ± 0.032
5.497GlyAsp: 5.497 ± 0.093
5.197GlyGlu: 5.197 ± 0.091
3.532GlyPhe: 3.532 ± 0.071
7.7GlyGly: 7.7 ± 0.126
2.047GlyHis: 2.047 ± 0.056
4.731GlyIle: 4.731 ± 0.083
3.752GlyLys: 3.752 ± 0.065
8.817GlyLeu: 8.817 ± 0.107
2.589GlyMet: 2.589 ± 0.063
2.679GlyAsn: 2.679 ± 0.07
2.959GlyPro: 2.959 ± 0.061
2.831GlyGln: 2.831 ± 0.062
5.997GlyArg: 5.997 ± 0.089
4.734GlySer: 4.734 ± 0.084
4.093GlyThr: 4.093 ± 0.074
6.087GlyVal: 6.087 ± 0.105
1.738GlyTrp: 1.738 ± 0.043
2.448GlyTyr: 2.448 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
3.325HisAla: 3.325 ± 0.074
0.271HisCys: 0.271 ± 0.017
1.373HisAsp: 1.373 ± 0.037
1.071HisGlu: 1.071 ± 0.036
0.807HisPhe: 0.807 ± 0.032
2.348HisGly: 2.348 ± 0.055
0.566HisHis: 0.566 ± 0.029
0.731HisIle: 0.731 ± 0.027
0.51HisLys: 0.51 ± 0.024
1.965HisLeu: 1.965 ± 0.048
0.446HisMet: 0.446 ± 0.022
0.473HisAsn: 0.473 ± 0.026
1.458HisPro: 1.458 ± 0.043
0.567HisGln: 0.567 ± 0.022
1.677HisArg: 1.677 ± 0.047
0.89HisSer: 0.89 ± 0.028
0.831HisThr: 0.831 ± 0.032
1.57HisVal: 1.57 ± 0.045
0.417HisTrp: 0.417 ± 0.02
0.588HisTyr: 0.588 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
7.129IleAla: 7.129 ± 0.083
0.33IleCys: 0.33 ± 0.019
3.509IleAsp: 3.509 ± 0.061
3.279IleGlu: 3.279 ± 0.059
1.08IlePhe: 1.08 ± 0.04
4.581IleGly: 4.581 ± 0.08
0.774IleHis: 0.774 ± 0.027
1.218IleIle: 1.218 ± 0.038
1.163IleLys: 1.163 ± 0.041
3.141IleLeu: 3.141 ± 0.067
0.491IleMet: 0.491 ± 0.023
1.123IleAsn: 1.123 ± 0.031
2.145IlePro: 2.145 ± 0.054
1.043IleGln: 1.043 ± 0.031
2.771IleArg: 2.771 ± 0.054
2.013IleSer: 2.013 ± 0.053
1.87IleThr: 1.87 ± 0.053
3.378IleVal: 3.378 ± 0.064
0.343IleTrp: 0.343 ± 0.022
0.771IleTyr: 0.771 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
4.09LysAla: 4.09 ± 0.077
0.146LysCys: 0.146 ± 0.015
1.755LysAsp: 1.755 ± 0.05
1.276LysGlu: 1.276 ± 0.044
0.956LysPhe: 0.956 ± 0.037
2.222LysGly: 2.222 ± 0.057
0.641LysHis: 0.641 ± 0.025
1.322LysIle: 1.322 ± 0.039
1.177LysLys: 1.177 ± 0.049
3.657LysLeu: 3.657 ± 0.072
0.657LysMet: 0.657 ± 0.027
0.706LysAsn: 0.706 ± 0.032
2.452LysPro: 2.452 ± 0.062
1.412LysGln: 1.412 ± 0.042
2.563LysArg: 2.563 ± 0.049
1.541LysSer: 1.541 ± 0.043
1.689LysThr: 1.689 ± 0.043
2.358LysVal: 2.358 ± 0.059
0.346LysTrp: 0.346 ± 0.02
0.758LysTyr: 0.758 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
15.582LeuAla: 15.582 ± 0.173
0.963LeuCys: 0.963 ± 0.033
6.971LeuAsp: 6.971 ± 0.1
5.175LeuGlu: 5.175 ± 0.097
3.373LeuPhe: 3.373 ± 0.072
8.678LeuGly: 8.678 ± 0.111
2.494LeuHis: 2.494 ± 0.053
3.469LeuIle: 3.469 ± 0.069
3.274LeuLys: 3.274 ± 0.062
12.238LeuLeu: 12.238 ± 0.184
1.976LeuMet: 1.976 ± 0.052
2.274LeuAsn: 2.274 ± 0.055
6.43LeuPro: 6.43 ± 0.092
4.17LeuGln: 4.17 ± 0.069
9.396LeuArg: 9.396 ± 0.12
5.522LeuSer: 5.522 ± 0.088
4.12LeuThr: 4.12 ± 0.059
7.707LeuVal: 7.707 ± 0.104
1.458LeuTrp: 1.458 ± 0.043
2.176LeuTyr: 2.176 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
2.885MetAla: 2.885 ± 0.059
0.134MetCys: 0.134 ± 0.014
1.151MetAsp: 1.151 ± 0.035
0.946MetGlu: 0.946 ± 0.033
0.622MetPhe: 0.622 ± 0.03
1.605MetGly: 1.605 ± 0.046
0.486MetHis: 0.486 ± 0.022
0.802MetIle: 0.802 ± 0.034
0.889MetLys: 0.889 ± 0.033
2.592MetLeu: 2.592 ± 0.062
0.437MetMet: 0.437 ± 0.021
0.69MetAsn: 0.69 ± 0.026
1.617MetPro: 1.617 ± 0.045
1.07MetGln: 1.07 ± 0.036
1.991MetArg: 1.991 ± 0.049
1.442MetSer: 1.442 ± 0.036
1.261MetThr: 1.261 ± 0.044
1.4MetVal: 1.4 ± 0.042
0.208MetTrp: 0.208 ± 0.013
0.379MetTyr: 0.379 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.684AsnAla: 3.684 ± 0.069
0.228AsnCys: 0.228 ± 0.016
1.447AsnAsp: 1.447 ± 0.042
1.179AsnGlu: 1.179 ± 0.035
0.896AsnPhe: 0.896 ± 0.032
2.491AsnGly: 2.491 ± 0.08
0.501AsnHis: 0.501 ± 0.025
1.113AsnIle: 1.113 ± 0.038
0.771AsnLys: 0.771 ± 0.03
2.417AsnLeu: 2.417 ± 0.052
0.445AsnMet: 0.445 ± 0.025
0.667AsnAsn: 0.667 ± 0.032
1.845AsnPro: 1.845 ± 0.048
0.727AsnGln: 0.727 ± 0.029
1.79AsnArg: 1.79 ± 0.05
1.127AsnSer: 1.127 ± 0.045
1.259AsnThr: 1.259 ± 0.045
1.7AsnVal: 1.7 ± 0.045
0.375AsnTrp: 0.375 ± 0.021
0.627AsnTyr: 0.627 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
7.542ProAla: 7.542 ± 0.127
0.333ProCys: 0.333 ± 0.022
3.321ProAsp: 3.321 ± 0.057
3.061ProGlu: 3.061 ± 0.061
1.684ProPhe: 1.684 ± 0.044
4.727ProGly: 4.727 ± 0.073
1.051ProHis: 1.051 ± 0.035
1.824ProIle: 1.824 ± 0.054
1.688ProLys: 1.688 ± 0.045
5.404ProLeu: 5.404 ± 0.083
1.321ProMet: 1.321 ± 0.04
1.241ProAsn: 1.241 ± 0.045
2.718ProPro: 2.718 ± 0.085
2.171ProGln: 2.171 ± 0.051
3.536ProArg: 3.536 ± 0.073
2.457ProSer: 2.457 ± 0.056
2.125ProThr: 2.125 ± 0.06
3.912ProVal: 3.912 ± 0.07
0.823ProTrp: 0.823 ± 0.035
1.172ProTyr: 1.172 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
5.685GlnAla: 5.685 ± 0.098
0.239GlnCys: 0.239 ± 0.017
1.621GlnAsp: 1.621 ± 0.043
1.352GlnGlu: 1.352 ± 0.038
1.179GlnPhe: 1.179 ± 0.038
2.773GlnGly: 2.773 ± 0.061
0.799GlnHis: 0.799 ± 0.029
1.328GlnIle: 1.328 ± 0.04
0.999GlnLys: 0.999 ± 0.035
4.089GlnLeu: 4.089 ± 0.074
0.729GlnMet: 0.729 ± 0.027
0.687GlnAsn: 0.687 ± 0.03
2.115GlnPro: 2.115 ± 0.053
1.767GlnGln: 1.767 ± 0.058
3.691GlnArg: 3.691 ± 0.076
1.608GlnSer: 1.608 ± 0.043
1.357GlnThr: 1.357 ± 0.042
2.859GlnVal: 2.859 ± 0.059
0.564GlnTrp: 0.564 ± 0.023
0.811GlnTyr: 0.811 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
9.358ArgAla: 9.358 ± 0.128
0.581ArgCys: 0.581 ± 0.027
5.068ArgAsp: 5.068 ± 0.1
4.918ArgGlu: 4.918 ± 0.088
2.878ArgPhe: 2.878 ± 0.059
5.841ArgGly: 5.841 ± 0.098
1.878ArgHis: 1.878 ± 0.052
4.416ArgIle: 4.416 ± 0.069
2.559ArgLys: 2.559 ± 0.058
8.149ArgLeu: 8.149 ± 0.119
2.135ArgMet: 2.135 ± 0.045
2.135ArgAsn: 2.135 ± 0.047
3.258ArgPro: 3.258 ± 0.076
2.642ArgGln: 2.642 ± 0.062
5.821ArgArg: 5.821 ± 0.118
3.497ArgSer: 3.497 ± 0.061
3.087ArgThr: 3.087 ± 0.065
5.074ArgVal: 5.074 ± 0.082
1.478ArgTrp: 1.478 ± 0.045
2.206ArgTyr: 2.206 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
5.863SerAla: 5.863 ± 0.095
0.386SerCys: 0.386 ± 0.023
2.666SerAsp: 2.666 ± 0.057
2.425SerGlu: 2.425 ± 0.048
1.774SerPhe: 1.774 ± 0.044
5.14SerGly: 5.14 ± 0.096
1.036SerHis: 1.036 ± 0.036
2.189SerIle: 2.189 ± 0.043
1.655SerLys: 1.655 ± 0.047
5.211SerLeu: 5.211 ± 0.084
1.106SerMet: 1.106 ± 0.038
1.408SerAsn: 1.408 ± 0.042
2.626SerPro: 2.626 ± 0.066
1.658SerGln: 1.658 ± 0.04
3.507SerArg: 3.507 ± 0.07
2.542SerSer: 2.542 ± 0.064
2.405SerThr: 2.405 ± 0.066
3.372SerVal: 3.372 ± 0.072
0.737SerTrp: 0.737 ± 0.031
1.23SerTyr: 1.23 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
5.431ThrAla: 5.431 ± 0.085
0.394ThrCys: 0.394 ± 0.026
2.36ThrAsp: 2.36 ± 0.053
1.924ThrGlu: 1.924 ± 0.045
1.368ThrPhe: 1.368 ± 0.039
4.262ThrGly: 4.262 ± 0.082
0.99ThrHis: 0.99 ± 0.037
1.894ThrIle: 1.894 ± 0.048
1.04ThrLys: 1.04 ± 0.039
5.436ThrLeu: 5.436 ± 0.083
0.9ThrMet: 0.9 ± 0.032
1.041ThrAsn: 1.041 ± 0.04
2.939ThrPro: 2.939 ± 0.058
1.457ThrGln: 1.457 ± 0.036
3.26ThrArg: 3.26 ± 0.065
2.174ThrSer: 2.174 ± 0.049
2.262ThrThr: 2.262 ± 0.063
3.472ThrVal: 3.472 ± 0.069
0.612ThrTrp: 0.612 ± 0.026
1.0ThrTyr: 1.0 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
9.529ValAla: 9.529 ± 0.115
0.635ValCys: 0.635 ± 0.029
4.535ValAsp: 4.535 ± 0.076
4.118ValGlu: 4.118 ± 0.078
2.58ValPhe: 2.58 ± 0.06
5.445ValGly: 5.445 ± 0.093
1.507ValHis: 1.507 ± 0.044
2.904ValIle: 2.904 ± 0.065
1.971ValLys: 1.971 ± 0.054
8.107ValLeu: 8.107 ± 0.102
1.416ValMet: 1.416 ± 0.047
1.894ValAsn: 1.894 ± 0.047
3.651ValPro: 3.651 ± 0.061
2.424ValGln: 2.424 ± 0.057
5.221ValArg: 5.221 ± 0.08
3.537ValSer: 3.537 ± 0.064
3.033ValThr: 3.033 ± 0.065
5.576ValVal: 5.576 ± 0.097
0.842ValTrp: 0.842 ± 0.035
1.488ValTyr: 1.488 ± 0.043
0.0ValXaa: 0.0 ± 0.0
Trp
1.297TrpAla: 1.297 ± 0.039
0.152TrpCys: 0.152 ± 0.012
0.708TrpAsp: 0.708 ± 0.031
0.52TrpGlu: 0.52 ± 0.023
0.614TrpPhe: 0.614 ± 0.025
0.946TrpGly: 0.946 ± 0.034
0.372TrpHis: 0.372 ± 0.021
0.715TrpIle: 0.715 ± 0.031
0.543TrpLys: 0.543 ± 0.028
2.391TrpLeu: 2.391 ± 0.062
0.441TrpMet: 0.441 ± 0.022
0.512TrpAsn: 0.512 ± 0.027
0.824TrpPro: 0.824 ± 0.029
0.753TrpGln: 0.753 ± 0.029
1.466TrpArg: 1.466 ± 0.043
0.838TrpSer: 0.838 ± 0.03
0.691TrpThr: 0.691 ± 0.031
0.863TrpVal: 0.863 ± 0.035
0.342TrpTrp: 0.342 ± 0.022
0.348TrpTyr: 0.348 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.11TyrAla: 3.11 ± 0.059
0.206TyrCys: 0.206 ± 0.016
1.375TyrAsp: 1.375 ± 0.045
1.135TyrGlu: 1.135 ± 0.038
0.915TyrPhe: 0.915 ± 0.033
2.233TyrGly: 2.233 ± 0.069
0.408TyrHis: 0.408 ± 0.018
0.796TyrIle: 0.796 ± 0.03
0.654TyrLys: 0.654 ± 0.027
2.416TyrLeu: 2.416 ± 0.044
0.434TyrMet: 0.434 ± 0.023
0.675TyrAsn: 0.675 ± 0.038
1.123TyrPro: 1.123 ± 0.04
0.779TyrGln: 0.779 ± 0.028
2.034TyrArg: 2.034 ± 0.055
1.232TyrSer: 1.232 ± 0.046
1.147TyrThr: 1.147 ± 0.037
1.571TyrVal: 1.571 ± 0.045
0.382TyrTrp: 0.382 ± 0.022
0.616TyrTyr: 0.616 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2753 proteins (912151 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski