Amino acid dipepetide frequency for Thermosipho africanus (strain TCF52B)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.395AlaAla: 3.395 ± 0.1
0.302AlaCys: 0.302 ± 0.026
2.22AlaAsp: 2.22 ± 0.061
3.01AlaGlu: 3.01 ± 0.073
2.767AlaPhe: 2.767 ± 0.067
3.47AlaGly: 3.47 ± 0.09
0.743AlaHis: 0.743 ± 0.041
5.234AlaIle: 5.234 ± 0.112
4.71AlaLys: 4.71 ± 0.094
5.61AlaLeu: 5.61 ± 0.085
1.254AlaMet: 1.254 ± 0.052
2.156AlaAsn: 2.156 ± 0.057
1.421AlaPro: 1.421 ± 0.046
1.334AlaGln: 1.334 ± 0.048
2.151AlaArg: 2.151 ± 0.066
2.9AlaSer: 2.9 ± 0.076
2.504AlaThr: 2.504 ± 0.072
3.62AlaVal: 3.62 ± 0.083
0.389AlaTrp: 0.389 ± 0.023
2.017AlaTyr: 2.017 ± 0.063
0.0AlaXaa: 0.0 ± 0.0
Cys
0.314CysAla: 0.314 ± 0.024
0.06CysCys: 0.06 ± 0.01
0.347CysAsp: 0.347 ± 0.028
0.423CysGlu: 0.423 ± 0.028
0.247CysPhe: 0.247 ± 0.022
0.646CysGly: 0.646 ± 0.037
0.112CysHis: 0.112 ± 0.014
0.384CysIle: 0.384 ± 0.027
0.513CysLys: 0.513 ± 0.035
0.346CysLeu: 0.346 ± 0.023
0.116CysMet: 0.116 ± 0.013
0.309CysAsn: 0.309 ± 0.023
0.376CysPro: 0.376 ± 0.032
0.129CysGln: 0.129 ± 0.015
0.19CysArg: 0.19 ± 0.02
0.341CysSer: 0.341 ± 0.026
0.268CysThr: 0.268 ± 0.023
0.369CysVal: 0.369 ± 0.028
0.045CysTrp: 0.045 ± 0.009
0.198CysTyr: 0.198 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
2.495AspAla: 2.495 ± 0.067
0.285AspCys: 0.285 ± 0.024
2.41AspAsp: 2.41 ± 0.069
4.749AspGlu: 4.749 ± 0.088
3.568AspPhe: 3.568 ± 0.082
3.254AspGly: 3.254 ± 0.084
0.542AspHis: 0.542 ± 0.029
5.554AspIle: 5.554 ± 0.101
4.172AspLys: 4.172 ± 0.088
4.905AspLeu: 4.905 ± 0.097
1.054AspMet: 1.054 ± 0.044
2.427AspAsn: 2.427 ± 0.066
1.871AspPro: 1.871 ± 0.064
0.8AspGln: 0.8 ± 0.035
1.598AspArg: 1.598 ± 0.055
2.599AspSer: 2.599 ± 0.065
2.037AspThr: 2.037 ± 0.054
3.896AspVal: 3.896 ± 0.082
0.522AspTrp: 0.522 ± 0.031
2.467AspTyr: 2.467 ± 0.069
0.0AspXaa: 0.0 ± 0.0
Glu
3.428GluAla: 3.428 ± 0.083
0.265GluCys: 0.265 ± 0.02
3.719GluAsp: 3.719 ± 0.084
7.164GluGlu: 7.164 ± 0.163
3.969GluPhe: 3.969 ± 0.087
3.928GluGly: 3.928 ± 0.087
0.941GluHis: 0.941 ± 0.042
8.491GluIle: 8.491 ± 0.138
10.399GluLys: 10.399 ± 0.158
7.003GluLeu: 7.003 ± 0.13
1.864GluMet: 1.864 ± 0.061
6.031GluAsn: 6.031 ± 0.128
1.581GluPro: 1.581 ± 0.051
1.445GluGln: 1.445 ± 0.052
2.992GluArg: 2.992 ± 0.079
3.509GluSer: 3.509 ± 0.079
2.925GluThr: 2.925 ± 0.073
4.927GluVal: 4.927 ± 0.099
0.56GluTrp: 0.56 ± 0.033
3.365GluTyr: 3.365 ± 0.08
0.0GluXaa: 0.0 ± 0.0
Phe
2.745PheAla: 2.745 ± 0.069
0.321PheCys: 0.321 ± 0.024
3.254PheAsp: 3.254 ± 0.072
4.739PheGlu: 4.739 ± 0.095
3.319PhePhe: 3.319 ± 0.098
3.745PheGly: 3.745 ± 0.083
0.638PheHis: 0.638 ± 0.034
4.529PheIle: 4.529 ± 0.107
4.496PheLys: 4.496 ± 0.086
6.159PheLeu: 6.159 ± 0.116
0.975PheMet: 0.975 ± 0.044
3.016PheAsn: 3.016 ± 0.088
1.742PhePro: 1.742 ± 0.063
1.057PheGln: 1.057 ± 0.044
1.621PheArg: 1.621 ± 0.05
5.059PheSer: 5.059 ± 0.136
2.282PheThr: 2.282 ± 0.065
4.192PheVal: 4.192 ± 0.093
0.557PheTrp: 0.557 ± 0.033
2.473PheTyr: 2.473 ± 0.067
0.0PheXaa: 0.0 ± 0.0
Gly
3.563GlyAla: 3.563 ± 0.089
0.487GlyCys: 0.487 ± 0.037
2.846GlyAsp: 2.846 ± 0.082
4.039GlyGlu: 4.039 ± 0.092
3.474GlyPhe: 3.474 ± 0.083
3.943GlyGly: 3.943 ± 0.103
0.936GlyHis: 0.936 ± 0.041
6.355GlyIle: 6.355 ± 0.103
6.392GlyLys: 6.392 ± 0.114
5.069GlyLeu: 5.069 ± 0.099
1.556GlyMet: 1.556 ± 0.054
2.947GlyAsn: 2.947 ± 0.068
1.473GlyPro: 1.473 ± 0.05
1.233GlyGln: 1.233 ± 0.047
2.16GlyArg: 2.16 ± 0.074
3.115GlySer: 3.115 ± 0.073
3.103GlyThr: 3.103 ± 0.087
4.418GlyVal: 4.418 ± 0.09
0.607GlyTrp: 0.607 ± 0.032
3.083GlyTyr: 3.083 ± 0.071
0.0GlyXaa: 0.0 ± 0.0
His
0.713HisAla: 0.713 ± 0.035
0.138HisCys: 0.138 ± 0.015
0.621HisAsp: 0.621 ± 0.031
0.891HisGlu: 0.891 ± 0.043
0.792HisPhe: 0.792 ± 0.036
1.029HisGly: 1.029 ± 0.05
0.267HisHis: 0.267 ± 0.023
1.166HisIle: 1.166 ± 0.045
0.851HisLys: 0.851 ± 0.034
1.193HisLeu: 1.193 ± 0.047
0.267HisMet: 0.267 ± 0.025
0.572HisAsn: 0.572 ± 0.035
0.789HisPro: 0.789 ± 0.038
0.253HisGln: 0.253 ± 0.022
0.525HisArg: 0.525 ± 0.031
0.8HisSer: 0.8 ± 0.039
0.623HisThr: 0.623 ± 0.035
1.015HisVal: 1.015 ± 0.043
0.106HisTrp: 0.106 ± 0.014
0.52HisTyr: 0.52 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.533IleAla: 5.533 ± 0.092
0.485IleCys: 0.485 ± 0.035
5.66IleAsp: 5.66 ± 0.098
7.947IleGlu: 7.947 ± 0.116
5.747IlePhe: 5.747 ± 0.136
5.531IleGly: 5.531 ± 0.113
1.237IleHis: 1.237 ± 0.044
8.577IleIle: 8.577 ± 0.131
8.169IleLys: 8.169 ± 0.14
10.248IleLeu: 10.248 ± 0.137
1.713IleMet: 1.713 ± 0.061
5.068IleAsn: 5.068 ± 0.104
4.007IlePro: 4.007 ± 0.079
1.77IleGln: 1.77 ± 0.057
3.041IleArg: 3.041 ± 0.086
7.14IleSer: 7.14 ± 0.122
4.298IleThr: 4.298 ± 0.087
6.87IleVal: 6.87 ± 0.116
0.671IleTrp: 0.671 ± 0.035
3.996IleTyr: 3.996 ± 0.095
0.0IleXaa: 0.0 ± 0.0
Lys
4.237LysAla: 4.237 ± 0.103
0.497LysCys: 0.497 ± 0.032
5.669LysAsp: 5.669 ± 0.118
8.79LysGlu: 8.79 ± 0.136
4.403LysPhe: 4.403 ± 0.086
4.69LysGly: 4.69 ± 0.094
1.238LysHis: 1.238 ± 0.042
10.589LysIle: 10.589 ± 0.141
9.874LysLys: 9.874 ± 0.156
8.582LysLeu: 8.582 ± 0.139
2.265LysMet: 2.265 ± 0.067
6.628LysAsn: 6.628 ± 0.134
2.391LysPro: 2.391 ± 0.067
1.851LysGln: 1.851 ± 0.063
3.586LysArg: 3.586 ± 0.08
4.813LysSer: 4.813 ± 0.102
3.791LysThr: 3.791 ± 0.08
6.887LysVal: 6.887 ± 0.092
0.73LysTrp: 0.73 ± 0.031
4.492LysTyr: 4.492 ± 0.099
0.0LysXaa: 0.0 ± 0.0
Leu
4.749LeuAla: 4.749 ± 0.094
0.463LeuCys: 0.463 ± 0.029
4.811LeuAsp: 4.811 ± 0.092
7.769LeuGlu: 7.769 ± 0.127
5.328LeuPhe: 5.328 ± 0.116
5.675LeuGly: 5.675 ± 0.102
1.155LeuHis: 1.155 ± 0.047
8.52LeuIle: 8.52 ± 0.127
10.54LeuLys: 10.54 ± 0.146
8.812LeuLeu: 8.812 ± 0.146
1.935LeuMet: 1.935 ± 0.06
5.885LeuAsn: 5.885 ± 0.118
3.252LeuPro: 3.252 ± 0.076
1.834LeuGln: 1.834 ± 0.053
3.49LeuArg: 3.49 ± 0.087
7.669LeuSer: 7.669 ± 0.131
4.293LeuThr: 4.293 ± 0.094
5.724LeuVal: 5.724 ± 0.097
0.735LeuTrp: 0.735 ± 0.04
3.754LeuTyr: 3.754 ± 0.074
0.0LeuXaa: 0.0 ± 0.0
Met
1.245MetAla: 1.245 ± 0.054
0.116MetCys: 0.116 ± 0.012
1.005MetAsp: 1.005 ± 0.04
1.527MetGlu: 1.527 ± 0.052
1.071MetPhe: 1.071 ± 0.044
1.331MetGly: 1.331 ± 0.051
0.285MetHis: 0.285 ± 0.02
1.968MetIle: 1.968 ± 0.057
2.479MetLys: 2.479 ± 0.063
1.968MetLeu: 1.968 ± 0.046
0.477MetMet: 0.477 ± 0.03
1.128MetAsn: 1.128 ± 0.047
0.78MetPro: 0.78 ± 0.034
0.502MetGln: 0.502 ± 0.028
0.935MetArg: 0.935 ± 0.042
1.059MetSer: 1.059 ± 0.047
0.868MetThr: 0.868 ± 0.038
1.411MetVal: 1.411 ± 0.051
0.188MetTrp: 0.188 ± 0.017
0.868MetTyr: 0.868 ± 0.042
0.0MetXaa: 0.0 ± 0.0
Asn
2.646AsnAla: 2.646 ± 0.061
0.357AsnCys: 0.357 ± 0.028
2.665AsnAsp: 2.665 ± 0.071
3.989AsnGlu: 3.989 ± 0.077
3.457AsnPhe: 3.457 ± 0.09
3.453AsnGly: 3.453 ± 0.087
0.621AsnHis: 0.621 ± 0.034
5.964AsnIle: 5.964 ± 0.121
4.539AsnLys: 4.539 ± 0.101
5.9AsnLeu: 5.9 ± 0.102
1.109AsnMet: 1.109 ± 0.041
3.098AsnAsn: 3.098 ± 0.086
2.26AsnPro: 2.26 ± 0.061
1.096AsnGln: 1.096 ± 0.051
1.532AsnArg: 1.532 ± 0.051
3.406AsnSer: 3.406 ± 0.094
2.401AsnThr: 2.401 ± 0.069
4.291AsnVal: 4.291 ± 0.088
0.48AsnTrp: 0.48 ± 0.033
2.641AsnTyr: 2.641 ± 0.084
0.0AsnXaa: 0.0 ± 0.0
Pro
1.497ProAla: 1.497 ± 0.055
0.168ProCys: 0.168 ± 0.016
1.829ProAsp: 1.829 ± 0.058
2.94ProGlu: 2.94 ± 0.077
1.953ProPhe: 1.953 ± 0.052
2.129ProGly: 2.129 ± 0.065
0.547ProHis: 0.547 ± 0.033
2.965ProIle: 2.965 ± 0.064
2.658ProLys: 2.658 ± 0.077
2.922ProLeu: 2.922 ± 0.066
0.659ProMet: 0.659 ± 0.037
1.628ProAsn: 1.628 ± 0.057
0.998ProPro: 0.998 ± 0.042
0.819ProGln: 0.819 ± 0.036
1.029ProArg: 1.029 ± 0.044
1.901ProSer: 1.901 ± 0.051
1.735ProThr: 1.735 ± 0.06
2.432ProVal: 2.432 ± 0.065
0.29ProTrp: 0.29 ± 0.022
1.611ProTyr: 1.611 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
1.185GlnAla: 1.185 ± 0.048
0.104GlnCys: 0.104 ± 0.015
0.878GlnAsp: 0.878 ± 0.035
1.623GlnGlu: 1.623 ± 0.056
0.97GlnPhe: 0.97 ± 0.046
1.005GlnGly: 1.005 ± 0.042
0.267GlnHis: 0.267 ± 0.023
2.247GlnIle: 2.247 ± 0.075
2.187GlnLys: 2.187 ± 0.061
1.921GlnLeu: 1.921 ± 0.061
0.606GlnMet: 0.606 ± 0.032
1.178GlnAsn: 1.178 ± 0.046
0.567GlnPro: 0.567 ± 0.032
0.535GlnGln: 0.535 ± 0.036
0.873GlnArg: 0.873 ± 0.043
1.014GlnSer: 1.014 ± 0.039
0.995GlnThr: 0.995 ± 0.044
1.356GlnVal: 1.356 ± 0.047
0.153GlnTrp: 0.153 ± 0.016
0.767GlnTyr: 0.767 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
1.817ArgAla: 1.817 ± 0.054
0.225ArgCys: 0.225 ± 0.023
1.509ArgAsp: 1.509 ± 0.055
2.794ArgGlu: 2.794 ± 0.081
1.925ArgPhe: 1.925 ± 0.057
2.059ArgGly: 2.059 ± 0.073
0.47ArgHis: 0.47 ± 0.031
3.37ArgIle: 3.37 ± 0.079
3.826ArgLys: 3.826 ± 0.079
3.321ArgLeu: 3.321 ± 0.076
0.965ArgMet: 0.965 ± 0.043
1.896ArgAsn: 1.896 ± 0.056
1.056ArgPro: 1.056 ± 0.045
0.797ArgGln: 0.797 ± 0.04
1.537ArgArg: 1.537 ± 0.057
1.447ArgSer: 1.447 ± 0.056
1.485ArgThr: 1.485 ± 0.049
2.286ArgVal: 2.286 ± 0.066
0.331ArgTrp: 0.331 ± 0.023
1.742ArgTyr: 1.742 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
3.002SerAla: 3.002 ± 0.07
0.359SerCys: 0.359 ± 0.026
2.859SerAsp: 2.859 ± 0.083
4.118SerGlu: 4.118 ± 0.082
3.922SerPhe: 3.922 ± 0.118
4.12SerGly: 4.12 ± 0.096
0.819SerHis: 0.819 ± 0.038
6.031SerIle: 6.031 ± 0.134
6.199SerLys: 6.199 ± 0.116
6.122SerLeu: 6.122 ± 0.108
1.193SerMet: 1.193 ± 0.037
3.257SerAsn: 3.257 ± 0.082
1.893SerPro: 1.893 ± 0.055
1.554SerGln: 1.554 ± 0.052
1.958SerArg: 1.958 ± 0.055
4.056SerSer: 4.056 ± 0.105
2.89SerThr: 2.89 ± 0.075
3.737SerVal: 3.737 ± 0.074
0.577SerTrp: 0.577 ± 0.029
2.693SerTyr: 2.693 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
2.423ThrAla: 2.423 ± 0.062
0.295ThrCys: 0.295 ± 0.025
2.032ThrAsp: 2.032 ± 0.061
2.356ThrGlu: 2.356 ± 0.063
2.68ThrPhe: 2.68 ± 0.069
3.126ThrGly: 3.126 ± 0.081
0.752ThrHis: 0.752 ± 0.034
4.433ThrIle: 4.433 ± 0.094
3.428ThrLys: 3.428 ± 0.075
4.714ThrLeu: 4.714 ± 0.08
0.834ThrMet: 0.834 ± 0.039
2.208ThrAsn: 2.208 ± 0.054
2.027ThrPro: 2.027 ± 0.057
0.998ThrGln: 0.998 ± 0.041
1.517ThrArg: 1.517 ± 0.055
2.72ThrSer: 2.72 ± 0.075
2.358ThrThr: 2.358 ± 0.061
3.044ThrVal: 3.044 ± 0.077
0.364ThrTrp: 0.364 ± 0.024
1.822ThrTyr: 1.822 ± 0.063
0.0ThrXaa: 0.0 ± 0.0
Val
3.803ValAla: 3.803 ± 0.084
0.451ValCys: 0.451 ± 0.034
4.041ValAsp: 4.041 ± 0.081
5.823ValGlu: 5.823 ± 0.112
3.985ValPhe: 3.985 ± 0.091
4.36ValGly: 4.36 ± 0.106
0.837ValHis: 0.837 ± 0.038
6.405ValIle: 6.405 ± 0.09
5.972ValLys: 5.972 ± 0.094
6.442ValLeu: 6.442 ± 0.107
1.363ValMet: 1.363 ± 0.053
3.447ValAsn: 3.447 ± 0.074
2.526ValPro: 2.526 ± 0.068
1.307ValGln: 1.307 ± 0.047
2.296ValArg: 2.296 ± 0.06
4.482ValSer: 4.482 ± 0.099
2.913ValThr: 2.913 ± 0.066
5.355ValVal: 5.355 ± 0.099
0.527ValTrp: 0.527 ± 0.031
2.945ValTyr: 2.945 ± 0.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.416TrpAla: 0.416 ± 0.023
0.055TrpCys: 0.055 ± 0.01
0.487TrpAsp: 0.487 ± 0.031
0.54TrpGlu: 0.54 ± 0.032
0.414TrpPhe: 0.414 ± 0.026
0.555TrpGly: 0.555 ± 0.033
0.139TrpHis: 0.139 ± 0.015
0.805TrpIle: 0.805 ± 0.043
0.831TrpLys: 0.831 ± 0.037
0.68TrpLeu: 0.68 ± 0.038
0.227TrpMet: 0.227 ± 0.02
0.52TrpAsn: 0.52 ± 0.034
0.235TrpPro: 0.235 ± 0.021
0.265TrpGln: 0.265 ± 0.026
0.352TrpArg: 0.352 ± 0.027
0.428TrpSer: 0.428 ± 0.024
0.342TrpThr: 0.342 ± 0.026
0.492TrpVal: 0.492 ± 0.031
0.134TrpTrp: 0.134 ± 0.016
0.445TrpTyr: 0.445 ± 0.031
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.002TyrAla: 2.002 ± 0.053
0.3TyrCys: 0.3 ± 0.023
2.344TyrAsp: 2.344 ± 0.065
3.276TyrGlu: 3.276 ± 0.08
2.928TyrPhe: 2.928 ± 0.08
2.747TyrGly: 2.747 ± 0.074
0.584TyrHis: 0.584 ± 0.03
4.12TyrIle: 4.12 ± 0.096
3.769TyrLys: 3.769 ± 0.085
4.477TyrLeu: 4.477 ± 0.101
0.777TyrMet: 0.777 ± 0.036
2.509TyrAsn: 2.509 ± 0.07
1.465TyrPro: 1.465 ± 0.05
0.847TyrGln: 0.847 ± 0.037
1.473TyrArg: 1.473 ± 0.051
2.901TyrSer: 2.901 ± 0.082
2.024TyrThr: 2.024 ± 0.058
2.958TyrVal: 2.958 ± 0.064
0.396TyrTrp: 0.396 ± 0.026
2.252TyrTyr: 2.252 ± 0.079
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1918 proteins (595921 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski