Amino acid dipepetide frequency for Thermodesulfitimonas autotrophica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.352AlaAla: 17.352 ± 0.289
1.491AlaCys: 1.491 ± 0.055
4.51AlaAsp: 4.51 ± 0.092
7.359AlaGlu: 7.359 ± 0.121
3.852AlaPhe: 3.852 ± 0.083
10.888AlaGly: 10.888 ± 0.173
1.698AlaHis: 1.698 ± 0.05
4.771AlaIle: 4.771 ± 0.09
4.116AlaLys: 4.116 ± 0.086
12.927AlaLeu: 12.927 ± 0.169
1.945AlaMet: 1.945 ± 0.06
2.328AlaAsn: 2.328 ± 0.057
4.353AlaPro: 4.353 ± 0.097
3.29AlaGln: 3.29 ± 0.073
8.584AlaArg: 8.584 ± 0.138
3.787AlaSer: 3.787 ± 0.079
5.082AlaThr: 5.082 ± 0.097
11.412AlaVal: 11.412 ± 0.168
1.169AlaTrp: 1.169 ± 0.043
2.89AlaTyr: 2.89 ± 0.076
0.0AlaXaa: 0.0 ± 0.0
Cys
1.242CysAla: 1.242 ± 0.053
0.249CysCys: 0.249 ± 0.022
0.524CysAsp: 0.524 ± 0.032
0.551CysGlu: 0.551 ± 0.029
0.496CysPhe: 0.496 ± 0.03
1.694CysGly: 1.694 ± 0.064
0.453CysHis: 0.453 ± 0.057
0.466CysIle: 0.466 ± 0.033
0.32CysLys: 0.32 ± 0.026
1.302CysLeu: 1.302 ± 0.05
0.168CysMet: 0.168 ± 0.017
0.326CysAsn: 0.326 ± 0.024
1.018CysPro: 1.018 ± 0.045
0.354CysGln: 0.354 ± 0.022
1.172CysArg: 1.172 ± 0.042
0.595CysSer: 0.595 ± 0.036
0.565CysThr: 0.565 ± 0.033
0.822CysVal: 0.822 ± 0.036
0.17CysTrp: 0.17 ± 0.016
0.437CysTyr: 0.437 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
4.408AspAla: 4.408 ± 0.082
0.572AspCys: 0.572 ± 0.032
1.559AspAsp: 1.559 ± 0.046
2.664AspGlu: 2.664 ± 0.077
2.116AspPhe: 2.116 ± 0.057
3.548AspGly: 3.548 ± 0.085
0.65AspHis: 0.65 ± 0.033
2.521AspIle: 2.521 ± 0.065
1.517AspLys: 1.517 ± 0.048
4.988AspLeu: 4.988 ± 0.083
0.684AspMet: 0.684 ± 0.032
0.965AspAsn: 0.965 ± 0.04
3.109AspPro: 3.109 ± 0.087
1.002AspGln: 1.002 ± 0.039
3.088AspArg: 3.088 ± 0.066
1.288AspSer: 1.288 ± 0.046
1.849AspThr: 1.849 ± 0.06
3.455AspVal: 3.455 ± 0.07
0.639AspTrp: 0.639 ± 0.032
1.586AspTyr: 1.586 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
8.548GluAla: 8.548 ± 0.128
0.587GluCys: 0.587 ± 0.032
2.581GluAsp: 2.581 ± 0.073
6.659GluGlu: 6.659 ± 0.112
2.311GluPhe: 2.311 ± 0.055
5.156GluGly: 5.156 ± 0.097
1.092GluHis: 1.092 ± 0.04
4.799GluIle: 4.799 ± 0.096
4.501GluLys: 4.501 ± 0.092
7.501GluLeu: 7.501 ± 0.127
1.647GluMet: 1.647 ± 0.053
1.968GluAsn: 1.968 ± 0.056
2.581GluPro: 2.581 ± 0.069
1.966GluGln: 1.966 ± 0.062
5.737GluArg: 5.737 ± 0.108
2.096GluSer: 2.096 ± 0.062
3.217GluThr: 3.217 ± 0.064
5.951GluVal: 5.951 ± 0.115
0.667GluTrp: 0.667 ± 0.034
1.597GluTyr: 1.597 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
4.11PheAla: 4.11 ± 0.088
0.686PheCys: 0.686 ± 0.035
1.781PheAsp: 1.781 ± 0.057
1.861PheGlu: 1.861 ± 0.054
1.982PhePhe: 1.982 ± 0.065
3.38PheGly: 3.38 ± 0.065
0.666PheHis: 0.666 ± 0.031
1.963PheIle: 1.963 ± 0.062
1.441PheLys: 1.441 ± 0.046
4.645PheLeu: 4.645 ± 0.103
0.636PheMet: 0.636 ± 0.031
1.05PheAsn: 1.05 ± 0.039
1.994PhePro: 1.994 ± 0.065
0.925PheGln: 0.925 ± 0.039
2.536PheArg: 2.536 ± 0.067
1.937PheSer: 1.937 ± 0.059
2.224PheThr: 2.224 ± 0.055
2.791PheVal: 2.791 ± 0.073
0.596PheTrp: 0.596 ± 0.031
1.341PheTyr: 1.341 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
8.468GlyAla: 8.468 ± 0.134
1.308GlyCys: 1.308 ± 0.053
3.533GlyAsp: 3.533 ± 0.075
5.86GlyGlu: 5.86 ± 0.096
3.437GlyPhe: 3.437 ± 0.08
6.858GlyGly: 6.858 ± 0.128
1.509GlyHis: 1.509 ± 0.055
4.929GlyIle: 4.929 ± 0.093
4.259GlyLys: 4.259 ± 0.094
8.883GlyLeu: 8.883 ± 0.139
2.033GlyMet: 2.033 ± 0.059
2.096GlyAsn: 2.096 ± 0.069
3.344GlyPro: 3.344 ± 0.077
2.53GlyGln: 2.53 ± 0.067
6.651GlyArg: 6.651 ± 0.103
3.786GlySer: 3.786 ± 0.076
4.52GlyThr: 4.52 ± 0.087
7.38GlyVal: 7.38 ± 0.112
1.002GlyTrp: 1.002 ± 0.044
2.887GlyTyr: 2.887 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
1.444HisAla: 1.444 ± 0.046
0.281HisCys: 0.281 ± 0.021
0.683HisAsp: 0.683 ± 0.032
0.802HisGlu: 0.802 ± 0.035
0.816HisPhe: 0.816 ± 0.04
1.447HisGly: 1.447 ± 0.053
0.463HisHis: 0.463 ± 0.026
0.836HisIle: 0.836 ± 0.036
0.519HisLys: 0.519 ± 0.032
2.142HisLeu: 2.142 ± 0.056
0.249HisMet: 0.249 ± 0.021
0.5HisAsn: 0.5 ± 0.029
1.395HisPro: 1.395 ± 0.052
0.562HisGln: 0.562 ± 0.028
1.406HisArg: 1.406 ± 0.043
0.714HisSer: 0.714 ± 0.031
0.843HisThr: 0.843 ± 0.042
1.106HisVal: 1.106 ± 0.042
0.215HisTrp: 0.215 ± 0.019
0.635HisTyr: 0.635 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.436IleAla: 5.436 ± 0.088
0.66IleCys: 0.66 ± 0.033
2.399IleAsp: 2.399 ± 0.067
3.428IleGlu: 3.428 ± 0.082
2.034IlePhe: 2.034 ± 0.062
4.172IleGly: 4.172 ± 0.088
0.82IleHis: 0.82 ± 0.037
2.881IleIle: 2.881 ± 0.078
2.201IleLys: 2.201 ± 0.06
5.352IleLeu: 5.352 ± 0.092
0.802IleMet: 0.802 ± 0.036
1.399IleAsn: 1.399 ± 0.045
2.726IlePro: 2.726 ± 0.065
1.155IleGln: 1.155 ± 0.041
3.479IleArg: 3.479 ± 0.08
2.229IleSer: 2.229 ± 0.061
2.77IleThr: 2.77 ± 0.065
3.685IleVal: 3.685 ± 0.075
0.479IleTrp: 0.479 ± 0.025
1.528IleTyr: 1.528 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
4.055LysAla: 4.055 ± 0.099
0.426LysCys: 0.426 ± 0.026
1.775LysAsp: 1.775 ± 0.051
4.272LysGlu: 4.272 ± 0.11
1.2LysPhe: 1.2 ± 0.055
3.363LysGly: 3.363 ± 0.075
0.661LysHis: 0.661 ± 0.031
2.521LysIle: 2.521 ± 0.063
2.58LysLys: 2.58 ± 0.081
3.868LysLeu: 3.868 ± 0.086
0.976LysMet: 0.976 ± 0.035
1.273LysAsn: 1.273 ± 0.042
2.116LysPro: 2.116 ± 0.051
1.208LysGln: 1.208 ± 0.046
3.027LysArg: 3.027 ± 0.083
1.651LysSer: 1.651 ± 0.05
2.142LysThr: 2.142 ± 0.06
3.896LysVal: 3.896 ± 0.086
0.348LysTrp: 0.348 ± 0.023
1.078LysTyr: 1.078 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
13.224LeuAla: 13.224 ± 0.179
1.457LeuCys: 1.457 ± 0.053
4.376LeuAsp: 4.376 ± 0.077
6.623LeuGlu: 6.623 ± 0.114
4.011LeuPhe: 4.011 ± 0.094
8.905LeuGly: 8.905 ± 0.126
1.708LeuHis: 1.708 ± 0.052
4.832LeuIle: 4.832 ± 0.091
5.414LeuLys: 5.414 ± 0.103
12.516LeuLeu: 12.516 ± 0.202
1.815LeuMet: 1.815 ± 0.059
2.746LeuAsn: 2.746 ± 0.07
6.216LeuPro: 6.216 ± 0.107
3.001LeuGln: 3.001 ± 0.074
8.56LeuArg: 8.56 ± 0.121
5.792LeuSer: 5.792 ± 0.104
5.951LeuThr: 5.951 ± 0.124
9.007LeuVal: 9.007 ± 0.122
1.196LeuTrp: 1.196 ± 0.052
2.96LeuTyr: 2.96 ± 0.07
0.0LeuXaa: 0.0 ± 0.0
Met
2.278MetAla: 2.278 ± 0.061
0.15MetCys: 0.15 ± 0.016
0.717MetAsp: 0.717 ± 0.038
1.196MetGlu: 1.196 ± 0.042
0.504MetPhe: 0.504 ± 0.029
1.521MetGly: 1.521 ± 0.05
0.309MetHis: 0.309 ± 0.021
0.833MetIle: 0.833 ± 0.04
0.897MetLys: 0.897 ± 0.038
1.959MetLeu: 1.959 ± 0.055
0.352MetMet: 0.352 ± 0.022
0.483MetAsn: 0.483 ± 0.025
0.989MetPro: 0.989 ± 0.047
0.521MetGln: 0.521 ± 0.028
1.518MetArg: 1.518 ± 0.056
0.828MetSer: 0.828 ± 0.037
0.837MetThr: 0.837 ± 0.033
1.722MetVal: 1.722 ± 0.054
0.162MetTrp: 0.162 ± 0.016
0.306MetTyr: 0.306 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.314AsnAla: 2.314 ± 0.066
0.334AsnCys: 0.334 ± 0.023
0.879AsnAsp: 0.879 ± 0.04
1.327AsnGlu: 1.327 ± 0.052
1.043AsnPhe: 1.043 ± 0.041
2.107AsnGly: 2.107 ± 0.073
0.431AsnHis: 0.431 ± 0.027
1.495AsnIle: 1.495 ± 0.056
0.837AsnLys: 0.837 ± 0.037
3.16AsnLeu: 3.16 ± 0.075
0.445AsnMet: 0.445 ± 0.026
0.658AsnAsn: 0.658 ± 0.034
2.042AsnPro: 2.042 ± 0.058
0.703AsnGln: 0.703 ± 0.038
1.989AsnArg: 1.989 ± 0.053
0.899AsnSer: 0.899 ± 0.039
1.179AsnThr: 1.179 ± 0.048
1.886AsnVal: 1.886 ± 0.052
0.361AsnTrp: 0.361 ± 0.025
0.819AsnTyr: 0.819 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
6.016ProAla: 6.016 ± 0.108
0.553ProCys: 0.553 ± 0.031
2.884ProAsp: 2.884 ± 0.067
5.227ProGlu: 5.227 ± 0.102
1.943ProPhe: 1.943 ± 0.067
5.275ProGly: 5.275 ± 0.11
0.987ProHis: 0.987 ± 0.041
1.318ProIle: 1.318 ± 0.048
1.531ProLys: 1.531 ± 0.056
4.94ProLeu: 4.94 ± 0.099
0.615ProMet: 0.615 ± 0.032
1.177ProAsn: 1.177 ± 0.045
3.038ProPro: 3.038 ± 0.081
1.63ProGln: 1.63 ± 0.049
3.137ProArg: 3.137 ± 0.067
1.766ProSer: 1.766 ± 0.06
1.969ProThr: 1.969 ± 0.061
5.644ProVal: 5.644 ± 0.089
0.605ProTrp: 0.605 ± 0.036
1.549ProTyr: 1.549 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
3.475GlnAla: 3.475 ± 0.082
0.235GlnCys: 0.235 ± 0.019
1.226GlnAsp: 1.226 ± 0.043
2.734GlnGlu: 2.734 ± 0.067
0.748GlnPhe: 0.748 ± 0.039
2.286GlnGly: 2.286 ± 0.052
0.463GlnHis: 0.463 ± 0.026
1.423GlnIle: 1.423 ± 0.049
1.535GlnLys: 1.535 ± 0.05
2.527GlnLeu: 2.527 ± 0.07
0.627GlnMet: 0.627 ± 0.034
0.819GlnAsn: 0.819 ± 0.035
1.375GlnPro: 1.375 ± 0.047
1.069GlnGln: 1.069 ± 0.046
2.125GlnArg: 2.125 ± 0.044
0.968GlnSer: 0.968 ± 0.042
1.285GlnThr: 1.285 ± 0.046
2.714GlnVal: 2.714 ± 0.065
0.267GlnTrp: 0.267 ± 0.021
0.556GlnTyr: 0.556 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
7.135ArgAla: 7.135 ± 0.117
1.009ArgCys: 1.009 ± 0.052
3.327ArgAsp: 3.327 ± 0.079
7.369ArgGlu: 7.369 ± 0.12
3.129ArgPhe: 3.129 ± 0.069
5.161ArgGly: 5.161 ± 0.091
1.484ArgHis: 1.484 ± 0.048
3.525ArgIle: 3.525 ± 0.083
2.862ArgLys: 2.862 ± 0.076
9.242ArgLeu: 9.242 ± 0.15
1.569ArgMet: 1.569 ± 0.051
1.67ArgAsn: 1.67 ± 0.054
3.415ArgPro: 3.415 ± 0.072
2.788ArgGln: 2.788 ± 0.064
6.569ArgArg: 6.569 ± 0.125
2.601ArgSer: 2.601 ± 0.054
2.953ArgThr: 2.953 ± 0.076
6.789ArgVal: 6.789 ± 0.123
0.962ArgTrp: 0.962 ± 0.047
2.431ArgTyr: 2.431 ± 0.07
0.0ArgXaa: 0.0 ± 0.0
Ser
3.529SerAla: 3.529 ± 0.081
0.602SerCys: 0.602 ± 0.034
1.555SerAsp: 1.555 ± 0.055
2.308SerGlu: 2.308 ± 0.063
2.13SerPhe: 2.13 ± 0.057
4.265SerGly: 4.265 ± 0.08
0.7SerHis: 0.7 ± 0.035
1.73SerIle: 1.73 ± 0.057
1.225SerLys: 1.225 ± 0.042
4.904SerLeu: 4.904 ± 0.095
0.723SerMet: 0.723 ± 0.038
0.822SerAsn: 0.822 ± 0.036
2.513SerPro: 2.513 ± 0.057
1.152SerGln: 1.152 ± 0.042
3.204SerArg: 3.204 ± 0.07
1.881SerSer: 1.881 ± 0.06
1.796SerThr: 1.796 ± 0.061
3.194SerVal: 3.194 ± 0.082
0.587SerTrp: 0.587 ± 0.036
1.267SerTyr: 1.267 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
5.69ThrAla: 5.69 ± 0.113
0.592ThrCys: 0.592 ± 0.031
2.053ThrAsp: 2.053 ± 0.066
2.887ThrGlu: 2.887 ± 0.063
1.82ThrPhe: 1.82 ± 0.05
5.595ThrGly: 5.595 ± 0.099
0.85ThrHis: 0.85 ± 0.037
2.393ThrIle: 2.393 ± 0.067
1.565ThrLys: 1.565 ± 0.055
5.054ThrLeu: 5.054 ± 0.096
0.695ThrMet: 0.695 ± 0.032
1.165ThrAsn: 1.165 ± 0.047
2.905ThrPro: 2.905 ± 0.071
1.084ThrGln: 1.084 ± 0.046
3.052ThrArg: 3.052 ± 0.071
1.806ThrSer: 1.806 ± 0.055
2.609ThrThr: 2.609 ± 0.081
5.102ThrVal: 5.102 ± 0.109
0.548ThrTrp: 0.548 ± 0.028
1.293ThrTyr: 1.293 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
11.243ValAla: 11.243 ± 0.162
1.126ValCys: 1.126 ± 0.043
3.858ValAsp: 3.858 ± 0.086
5.536ValGlu: 5.536 ± 0.098
3.191ValPhe: 3.191 ± 0.078
6.216ValGly: 6.216 ± 0.095
1.267ValHis: 1.267 ± 0.052
4.626ValIle: 4.626 ± 0.09
3.882ValLys: 3.882 ± 0.081
9.436ValLeu: 9.436 ± 0.12
1.546ValMet: 1.546 ± 0.045
2.379ValAsn: 2.379 ± 0.068
4.501ValPro: 4.501 ± 0.099
2.084ValGln: 2.084 ± 0.069
6.33ValArg: 6.33 ± 0.113
3.911ValSer: 3.911 ± 0.076
5.117ValThr: 5.117 ± 0.105
8.789ValVal: 8.789 ± 0.161
0.894ValTrp: 0.894 ± 0.033
2.328ValTyr: 2.328 ± 0.064
0.0ValXaa: 0.0 ± 0.0
Trp
1.063TrpAla: 1.063 ± 0.047
0.164TrpCys: 0.164 ± 0.016
0.525TrpAsp: 0.525 ± 0.032
0.953TrpGlu: 0.953 ± 0.04
0.436TrpPhe: 0.436 ± 0.026
0.935TrpGly: 0.935 ± 0.037
0.226TrpHis: 0.226 ± 0.017
0.431TrpIle: 0.431 ± 0.029
0.368TrpLys: 0.368 ± 0.024
1.398TrpLeu: 1.398 ± 0.056
0.193TrpMet: 0.193 ± 0.016
0.292TrpAsn: 0.292 ± 0.024
0.612TrpPro: 0.612 ± 0.027
0.473TrpGln: 0.473 ± 0.031
1.111TrpArg: 1.111 ± 0.044
0.409TrpSer: 0.409 ± 0.023
0.406TrpThr: 0.406 ± 0.03
0.916TrpVal: 0.916 ± 0.039
0.213TrpTrp: 0.213 ± 0.02
0.34TrpTyr: 0.34 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.774TyrAla: 2.774 ± 0.063
0.445TyrCys: 0.445 ± 0.029
1.466TyrAsp: 1.466 ± 0.081
1.546TyrGlu: 1.546 ± 0.047
1.348TyrPhe: 1.348 ± 0.046
2.502TyrGly: 2.502 ± 0.074
0.621TyrHis: 0.621 ± 0.031
1.342TyrIle: 1.342 ± 0.051
0.867TyrLys: 0.867 ± 0.034
3.567TyrLeu: 3.567 ± 0.074
0.36TyrMet: 0.36 ± 0.025
0.791TyrAsn: 0.791 ± 0.035
1.562TyrPro: 1.562 ± 0.049
0.961TyrGln: 0.961 ± 0.035
2.715TyrArg: 2.715 ± 0.074
1.143TyrSer: 1.143 ± 0.045
1.44TyrThr: 1.44 ± 0.05
1.999TyrVal: 1.999 ± 0.05
0.378TyrTrp: 0.378 ± 0.024
1.007TyrTyr: 1.007 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2146 proteins (647403 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski