Amino acid dipepetide frequency for Thermincola potens (strain JR)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.261AlaAla: 8.261 ± 0.132
0.93AlaCys: 0.93 ± 0.039
4.245AlaAsp: 4.245 ± 0.068
5.965AlaGlu: 5.965 ± 0.102
2.982AlaPhe: 2.982 ± 0.062
7.715AlaGly: 7.715 ± 0.108
1.296AlaHis: 1.296 ± 0.041
5.352AlaIle: 5.352 ± 0.088
5.152AlaLys: 5.152 ± 0.081
8.612AlaLeu: 8.612 ± 0.124
2.192AlaMet: 2.192 ± 0.052
2.888AlaAsn: 2.888 ± 0.061
2.554AlaPro: 2.554 ± 0.057
2.698AlaGln: 2.698 ± 0.059
4.735AlaArg: 4.735 ± 0.088
3.516AlaSer: 3.516 ± 0.065
3.55AlaThr: 3.55 ± 0.084
7.346AlaVal: 7.346 ± 0.106
0.733AlaTrp: 0.733 ± 0.033
2.372AlaTyr: 2.372 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.833CysAla: 0.833 ± 0.036
0.189CysCys: 0.189 ± 0.019
0.5CysAsp: 0.5 ± 0.027
0.543CysGlu: 0.543 ± 0.022
0.48CysPhe: 0.48 ± 0.024
1.208CysGly: 1.208 ± 0.038
0.733CysHis: 0.733 ± 0.116
0.67CysIle: 0.67 ± 0.028
0.532CysLys: 0.532 ± 0.027
1.081CysLeu: 1.081 ± 0.038
0.269CysMet: 0.269 ± 0.015
0.413CysAsn: 0.413 ± 0.023
0.72CysPro: 0.72 ± 0.034
0.378CysGln: 0.378 ± 0.022
0.717CysArg: 0.717 ± 0.034
0.744CysSer: 0.744 ± 0.035
0.609CysThr: 0.609 ± 0.032
0.645CysVal: 0.645 ± 0.029
0.115CysTrp: 0.115 ± 0.013
0.395CysTyr: 0.395 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
3.48AspAla: 3.48 ± 0.072
0.597AspCys: 0.597 ± 0.031
2.16AspAsp: 2.16 ± 0.06
3.442AspGlu: 3.442 ± 0.06
2.213AspPhe: 2.213 ± 0.052
3.305AspGly: 3.305 ± 0.071
0.75AspHis: 0.75 ± 0.029
4.379AspIle: 4.379 ± 0.074
3.526AspLys: 3.526 ± 0.061
5.096AspLeu: 5.096 ± 0.086
1.329AspMet: 1.329 ± 0.035
1.95AspAsn: 1.95 ± 0.054
2.331AspPro: 2.331 ± 0.053
1.25AspGln: 1.25 ± 0.036
2.825AspArg: 2.825 ± 0.062
2.274AspSer: 2.274 ± 0.055
2.46AspThr: 2.46 ± 0.066
3.574AspVal: 3.574 ± 0.061
0.593AspTrp: 0.593 ± 0.029
1.949AspTyr: 1.949 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
5.598GluAla: 5.598 ± 0.091
0.637GluCys: 0.637 ± 0.03
3.042GluAsp: 3.042 ± 0.066
5.856GluGlu: 5.856 ± 0.102
2.415GluPhe: 2.415 ± 0.05
4.309GluGly: 4.309 ± 0.072
1.102GluHis: 1.102 ± 0.035
5.91GluIle: 5.91 ± 0.107
6.262GluLys: 6.262 ± 0.095
6.757GluLeu: 6.757 ± 0.117
2.047GluMet: 2.047 ± 0.05
3.062GluAsn: 3.062 ± 0.061
2.221GluPro: 2.221 ± 0.056
2.677GluGln: 2.677 ± 0.06
3.358GluArg: 3.358 ± 0.071
2.403GluSer: 2.403 ± 0.052
3.371GluThr: 3.371 ± 0.06
5.125GluVal: 5.125 ± 0.077
0.526GluTrp: 0.526 ± 0.027
2.149GluTyr: 2.149 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
3.15PheAla: 3.15 ± 0.066
0.57PheCys: 0.57 ± 0.027
2.097PheAsp: 2.097 ± 0.053
2.133PheGlu: 2.133 ± 0.05
1.911PhePhe: 1.911 ± 0.056
3.286PheGly: 3.286 ± 0.064
0.673PheHis: 0.673 ± 0.028
2.894PheIle: 2.894 ± 0.054
2.287PheLys: 2.287 ± 0.053
4.137PheLeu: 4.137 ± 0.085
0.987PheMet: 0.987 ± 0.032
1.73PheAsn: 1.73 ± 0.045
1.682PhePro: 1.682 ± 0.046
1.056PheGln: 1.056 ± 0.032
1.897PheArg: 1.897 ± 0.046
2.477PheSer: 2.477 ± 0.054
2.339PheThr: 2.339 ± 0.053
2.896PheVal: 2.896 ± 0.058
0.452PheTrp: 0.452 ± 0.024
1.449PheTyr: 1.449 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
5.776GlyAla: 5.776 ± 0.082
1.162GlyCys: 1.162 ± 0.039
3.515GlyAsp: 3.515 ± 0.064
4.782GlyGlu: 4.782 ± 0.083
3.323GlyPhe: 3.323 ± 0.058
5.269GlyGly: 5.269 ± 0.087
1.408GlyHis: 1.408 ± 0.04
6.407GlyIle: 6.407 ± 0.108
5.44GlyLys: 5.44 ± 0.083
7.913GlyLeu: 7.913 ± 0.104
2.147GlyMet: 2.147 ± 0.057
2.905GlyAsn: 2.905 ± 0.071
2.48GlyPro: 2.48 ± 0.052
2.773GlyGln: 2.773 ± 0.062
3.904GlyArg: 3.904 ± 0.066
4.02GlySer: 4.02 ± 0.081
4.359GlyThr: 4.359 ± 0.104
5.565GlyVal: 5.565 ± 0.089
0.796GlyTrp: 0.796 ± 0.036
2.93GlyTyr: 2.93 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
1.229HisAla: 1.229 ± 0.043
0.241HisCys: 0.241 ± 0.019
0.91HisAsp: 0.91 ± 0.043
1.013HisGlu: 1.013 ± 0.032
0.748HisPhe: 0.748 ± 0.03
1.515HisGly: 1.515 ± 0.061
0.43HisHis: 0.43 ± 0.024
1.261HisIle: 1.261 ± 0.04
1.013HisLys: 1.013 ± 0.033
1.732HisLeu: 1.732 ± 0.049
0.423HisMet: 0.423 ± 0.022
0.807HisAsn: 0.807 ± 0.039
1.085HisPro: 1.085 ± 0.033
0.531HisGln: 0.531 ± 0.027
0.891HisArg: 0.891 ± 0.031
0.953HisSer: 0.953 ± 0.046
0.949HisThr: 0.949 ± 0.032
1.122HisVal: 1.122 ± 0.039
0.19HisTrp: 0.19 ± 0.016
0.701HisTyr: 0.701 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
6.049IleAla: 6.049 ± 0.091
0.863IleCys: 0.863 ± 0.036
3.897IleAsp: 3.897 ± 0.062
4.521IleGlu: 4.521 ± 0.081
2.877IlePhe: 2.877 ± 0.064
5.466IleGly: 5.466 ± 0.086
1.191IleHis: 1.191 ± 0.036
5.777IleIle: 5.777 ± 0.093
4.888IleLys: 4.888 ± 0.073
6.858IleLeu: 6.858 ± 0.106
1.8IleMet: 1.8 ± 0.052
3.309IleAsn: 3.309 ± 0.064
3.504IlePro: 3.504 ± 0.066
1.869IleGln: 1.869 ± 0.05
3.647IleArg: 3.647 ± 0.069
4.371IleSer: 4.371 ± 0.078
4.295IleThr: 4.295 ± 0.089
5.07IleVal: 5.07 ± 0.079
0.544IleTrp: 0.544 ± 0.025
2.245IleTyr: 2.245 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
5.443LysAla: 5.443 ± 0.083
0.657LysCys: 0.657 ± 0.037
3.34LysAsp: 3.34 ± 0.059
5.521LysGlu: 5.521 ± 0.102
2.209LysPhe: 2.209 ± 0.055
4.599LysGly: 4.599 ± 0.071
1.056LysHis: 1.056 ± 0.036
5.225LysIle: 5.225 ± 0.085
5.194LysLys: 5.194 ± 0.085
5.873LysLeu: 5.873 ± 0.083
1.876LysMet: 1.876 ± 0.044
3.147LysAsn: 3.147 ± 0.064
2.647LysPro: 2.647 ± 0.056
2.199LysGln: 2.199 ± 0.049
2.94LysArg: 2.94 ± 0.064
2.862LysSer: 2.862 ± 0.054
3.484LysThr: 3.484 ± 0.073
5.268LysVal: 5.268 ± 0.091
0.605LysTrp: 0.605 ± 0.028
2.318LysTyr: 2.318 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
9.615LeuAla: 9.615 ± 0.131
1.069LeuCys: 1.069 ± 0.045
4.849LeuAsp: 4.849 ± 0.077
6.622LeuGlu: 6.622 ± 0.111
3.935LeuPhe: 3.935 ± 0.077
7.503LeuGly: 7.503 ± 0.122
1.667LeuHis: 1.667 ± 0.053
6.576LeuIle: 6.576 ± 0.12
6.758LeuLys: 6.758 ± 0.089
9.671LeuLeu: 9.671 ± 0.136
2.387LeuMet: 2.387 ± 0.061
4.019LeuAsn: 4.019 ± 0.072
4.417LeuPro: 4.417 ± 0.073
3.419LeuGln: 3.419 ± 0.079
4.661LeuArg: 4.661 ± 0.072
5.277LeuSer: 5.277 ± 0.1
5.451LeuThr: 5.451 ± 0.088
7.435LeuVal: 7.435 ± 0.101
0.822LeuTrp: 0.822 ± 0.031
2.832LeuTyr: 2.832 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.645MetAla: 2.645 ± 0.061
0.21MetCys: 0.21 ± 0.016
1.413MetAsp: 1.413 ± 0.041
1.864MetGlu: 1.864 ± 0.049
0.898MetPhe: 0.898 ± 0.035
2.011MetGly: 2.011 ± 0.051
0.434MetHis: 0.434 ± 0.023
1.462MetIle: 1.462 ± 0.041
1.693MetLys: 1.693 ± 0.045
2.476MetLeu: 2.476 ± 0.052
0.623MetMet: 0.623 ± 0.03
1.069MetAsn: 1.069 ± 0.035
1.147MetPro: 1.147 ± 0.04
0.9MetGln: 0.9 ± 0.031
1.225MetArg: 1.225 ± 0.036
1.323MetSer: 1.323 ± 0.039
1.242MetThr: 1.242 ± 0.039
2.1MetVal: 2.1 ± 0.045
0.167MetTrp: 0.167 ± 0.015
0.638MetTyr: 0.638 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
2.689AsnAla: 2.689 ± 0.058
0.605AsnCys: 0.605 ± 0.042
1.663AsnAsp: 1.663 ± 0.047
2.304AsnGlu: 2.304 ± 0.05
1.692AsnPhe: 1.692 ± 0.046
2.928AsnGly: 2.928 ± 0.077
0.74AsnHis: 0.74 ± 0.03
3.374AsnIle: 3.374 ± 0.068
2.738AsnLys: 2.738 ± 0.051
4.434AsnLeu: 4.434 ± 0.073
1.067AsnMet: 1.067 ± 0.032
1.921AsnAsn: 1.921 ± 0.061
2.428AsnPro: 2.428 ± 0.066
1.239AsnGln: 1.239 ± 0.039
2.305AsnArg: 2.305 ± 0.051
2.245AsnSer: 2.245 ± 0.063
2.105AsnThr: 2.105 ± 0.061
2.706AsnVal: 2.706 ± 0.057
0.438AsnTrp: 0.438 ± 0.026
1.611AsnTyr: 1.611 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
3.659ProAla: 3.659 ± 0.073
0.42ProCys: 0.42 ± 0.02
2.628ProAsp: 2.628 ± 0.055
3.63ProGlu: 3.63 ± 0.067
1.705ProPhe: 1.705 ± 0.051
3.836ProGly: 3.836 ± 0.073
0.866ProHis: 0.866 ± 0.043
2.185ProIle: 2.185 ± 0.055
2.144ProLys: 2.144 ± 0.049
3.767ProLeu: 3.767 ± 0.067
0.86ProMet: 0.86 ± 0.03
1.536ProAsn: 1.536 ± 0.044
1.556ProPro: 1.556 ± 0.04
1.345ProGln: 1.345 ± 0.036
1.787ProArg: 1.787 ± 0.044
1.839ProSer: 1.839 ± 0.053
1.894ProThr: 1.894 ± 0.053
4.264ProVal: 4.264 ± 0.073
0.501ProTrp: 0.501 ± 0.026
1.415ProTyr: 1.415 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
2.911GlnAla: 2.911 ± 0.066
0.303GlnCys: 0.303 ± 0.02
1.434GlnAsp: 1.434 ± 0.038
2.454GlnGlu: 2.454 ± 0.067
1.073GlnPhe: 1.073 ± 0.033
2.259GlnGly: 2.259 ± 0.047
0.533GlnHis: 0.533 ± 0.022
2.314GlnIle: 2.314 ± 0.051
2.475GlnLys: 2.475 ± 0.054
3.006GlnLeu: 3.006 ± 0.06
0.939GlnMet: 0.939 ± 0.039
1.395GlnAsn: 1.395 ± 0.044
1.292GlnPro: 1.292 ± 0.038
1.322GlnGln: 1.322 ± 0.049
1.666GlnArg: 1.666 ± 0.044
1.408GlnSer: 1.408 ± 0.038
1.493GlnThr: 1.493 ± 0.049
2.82GlnVal: 2.82 ± 0.056
0.323GlnTrp: 0.323 ± 0.018
0.961GlnTyr: 0.961 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
3.52ArgAla: 3.52 ± 0.07
0.565ArgCys: 0.565 ± 0.028
2.56ArgAsp: 2.56 ± 0.057
4.39ArgGlu: 4.39 ± 0.07
2.074ArgPhe: 2.074 ± 0.051
3.22ArgGly: 3.22 ± 0.074
0.988ArgHis: 0.988 ± 0.033
3.724ArgIle: 3.724 ± 0.073
3.508ArgLys: 3.508 ± 0.062
5.278ArgLeu: 5.278 ± 0.091
1.335ArgMet: 1.335 ± 0.038
2.074ArgAsn: 2.074 ± 0.044
1.961ArgPro: 1.961 ± 0.058
2.099ArgGln: 2.099 ± 0.054
2.721ArgArg: 2.721 ± 0.067
2.165ArgSer: 2.165 ± 0.051
2.22ArgThr: 2.22 ± 0.054
3.933ArgVal: 3.933 ± 0.07
0.507ArgTrp: 0.507 ± 0.024
1.78ArgTyr: 1.78 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
3.965SerAla: 3.965 ± 0.082
0.636SerCys: 0.636 ± 0.034
2.298SerAsp: 2.298 ± 0.055
2.851SerGlu: 2.851 ± 0.063
2.284SerPhe: 2.284 ± 0.052
4.481SerGly: 4.481 ± 0.086
0.917SerHis: 0.917 ± 0.035
3.429SerIle: 3.429 ± 0.065
2.606SerLys: 2.606 ± 0.061
5.351SerLeu: 5.351 ± 0.088
1.268SerMet: 1.268 ± 0.04
1.774SerAsn: 1.774 ± 0.053
2.209SerPro: 2.209 ± 0.057
1.483SerGln: 1.483 ± 0.046
2.646SerArg: 2.646 ± 0.042
2.767SerSer: 2.767 ± 0.077
2.54SerThr: 2.54 ± 0.081
3.945SerVal: 3.945 ± 0.08
0.569SerTrp: 0.569 ± 0.027
1.676SerTyr: 1.676 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
4.612ThrAla: 4.612 ± 0.084
0.663ThrCys: 0.663 ± 0.044
2.632ThrAsp: 2.632 ± 0.067
3.168ThrGlu: 3.168 ± 0.058
1.994ThrPhe: 1.994 ± 0.05
5.49ThrGly: 5.49 ± 0.084
0.853ThrHis: 0.853 ± 0.033
3.295ThrIle: 3.295 ± 0.064
2.698ThrLys: 2.698 ± 0.061
4.743ThrLeu: 4.743 ± 0.073
1.096ThrMet: 1.096 ± 0.033
1.855ThrAsn: 1.855 ± 0.057
2.411ThrPro: 2.411 ± 0.051
1.258ThrGln: 1.258 ± 0.043
2.526ThrArg: 2.526 ± 0.055
2.471ThrSer: 2.471 ± 0.072
2.765ThrThr: 2.765 ± 0.082
5.058ThrVal: 5.058 ± 0.117
0.479ThrTrp: 0.479 ± 0.032
1.601ThrTyr: 1.601 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
6.716ValAla: 6.716 ± 0.087
0.892ValCys: 0.892 ± 0.034
4.047ValAsp: 4.047 ± 0.074
5.222ValGlu: 5.222 ± 0.084
3.324ValPhe: 3.324 ± 0.068
5.184ValGly: 5.184 ± 0.088
1.222ValHis: 1.222 ± 0.037
6.056ValIle: 6.056 ± 0.093
5.061ValLys: 5.061 ± 0.072
7.601ValLeu: 7.601 ± 0.116
1.925ValMet: 1.925 ± 0.055
3.427ValAsn: 3.427 ± 0.067
3.226ValPro: 3.226 ± 0.068
2.339ValGln: 2.339 ± 0.051
3.631ValArg: 3.631 ± 0.074
4.368ValSer: 4.368 ± 0.07
4.39ValThr: 4.39 ± 0.099
6.161ValVal: 6.161 ± 0.103
0.608ValTrp: 0.608 ± 0.028
2.488ValTyr: 2.488 ± 0.067
0.0ValXaa: 0.0 ± 0.0
Trp
0.717TrpAla: 0.717 ± 0.03
0.098TrpCys: 0.098 ± 0.011
0.567TrpAsp: 0.567 ± 0.026
0.644TrpGlu: 0.644 ± 0.026
0.421TrpPhe: 0.421 ± 0.022
0.694TrpGly: 0.694 ± 0.031
0.208TrpHis: 0.208 ± 0.016
0.486TrpIle: 0.486 ± 0.022
0.536TrpLys: 0.536 ± 0.025
1.032TrpLeu: 1.032 ± 0.035
0.25TrpMet: 0.25 ± 0.018
0.396TrpAsn: 0.396 ± 0.024
0.384TrpPro: 0.384 ± 0.027
0.488TrpGln: 0.488 ± 0.026
0.48TrpArg: 0.48 ± 0.025
0.489TrpSer: 0.489 ± 0.032
0.447TrpThr: 0.447 ± 0.023
0.706TrpVal: 0.706 ± 0.03
0.128TrpTrp: 0.128 ± 0.012
0.32TrpTyr: 0.32 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.23TyrAla: 2.23 ± 0.044
0.467TyrCys: 0.467 ± 0.024
1.688TyrAsp: 1.688 ± 0.049
1.904TyrGlu: 1.904 ± 0.048
1.545TyrPhe: 1.545 ± 0.048
2.595TyrGly: 2.595 ± 0.063
0.679TyrHis: 0.679 ± 0.033
2.265TyrIle: 2.265 ± 0.047
1.948TyrLys: 1.948 ± 0.055
3.567TyrLeu: 3.567 ± 0.071
0.719TyrMet: 0.719 ± 0.028
1.551TyrAsn: 1.551 ± 0.05
1.583TyrPro: 1.583 ± 0.049
1.058TyrGln: 1.058 ± 0.034
2.06TyrArg: 2.06 ± 0.055
1.721TyrSer: 1.721 ± 0.045
1.705TyrThr: 1.705 ± 0.061
2.15TyrVal: 2.15 ± 0.048
0.394TyrTrp: 0.394 ± 0.023
1.427TyrTyr: 1.427 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2908 proteins (885596 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski