Amino acid dipepetide frequency for Treponema pedis str. T A4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.2AlaAla: 7.2 ± 0.119
1.064AlaCys: 1.064 ± 0.039
3.845AlaAsp: 3.845 ± 0.075
6.112AlaGlu: 6.112 ± 0.097
3.514AlaPhe: 3.514 ± 0.071
5.639AlaGly: 5.639 ± 0.097
0.95AlaHis: 0.95 ± 0.032
4.616AlaIle: 4.616 ± 0.077
4.793AlaLys: 4.793 ± 0.081
6.899AlaLeu: 6.899 ± 0.095
1.642AlaMet: 1.642 ± 0.046
2.506AlaAsn: 2.506 ± 0.053
1.864AlaPro: 1.864 ± 0.046
1.81AlaGln: 1.81 ± 0.046
2.372AlaArg: 2.372 ± 0.063
4.522AlaSer: 4.522 ± 0.089
2.064AlaThr: 2.064 ± 0.051
6.784AlaVal: 6.784 ± 0.099
0.539AlaTrp: 0.539 ± 0.027
2.306AlaTyr: 2.306 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.777CysAla: 0.777 ± 0.026
0.179CysCys: 0.179 ± 0.015
0.552CysAsp: 0.552 ± 0.027
0.679CysGlu: 0.679 ± 0.032
0.81CysPhe: 0.81 ± 0.034
1.198CysGly: 1.198 ± 0.045
0.166CysHis: 0.166 ± 0.014
1.155CysIle: 1.155 ± 0.034
0.938CysLys: 0.938 ± 0.035
1.039CysLeu: 1.039 ± 0.036
0.279CysMet: 0.279 ± 0.019
0.543CysAsn: 0.543 ± 0.028
0.486CysPro: 0.486 ± 0.029
0.154CysGln: 0.154 ± 0.013
0.548CysArg: 0.548 ± 0.032
0.857CysSer: 0.857 ± 0.035
0.704CysThr: 0.704 ± 0.034
0.663CysVal: 0.663 ± 0.029
0.068CysTrp: 0.068 ± 0.009
0.413CysTyr: 0.413 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
2.796AspAla: 2.796 ± 0.063
0.586AspCys: 0.586 ± 0.025
2.175AspAsp: 2.175 ± 0.07
3.877AspGlu: 3.877 ± 0.086
3.138AspPhe: 3.138 ± 0.064
3.5AspGly: 3.5 ± 0.077
0.372AspHis: 0.372 ± 0.02
4.875AspIle: 4.875 ± 0.076
4.299AspLys: 4.299 ± 0.073
4.011AspLeu: 4.011 ± 0.065
1.237AspMet: 1.237 ± 0.041
2.275AspAsn: 2.275 ± 0.053
1.342AspPro: 1.342 ± 0.048
0.463AspGln: 0.463 ± 0.026
1.528AspArg: 1.528 ± 0.049
3.194AspSer: 3.194 ± 0.073
2.685AspThr: 2.685 ± 0.068
2.593AspVal: 2.593 ± 0.055
0.573AspTrp: 0.573 ± 0.027
2.232AspTyr: 2.232 ± 0.046
0.0AspXaa: 0.0 ± 0.0
Glu
5.176GluAla: 5.176 ± 0.095
0.597GluCys: 0.597 ± 0.026
3.253GluAsp: 3.253 ± 0.068
5.841GluGlu: 5.841 ± 0.105
3.367GluPhe: 3.367 ± 0.06
3.638GluGly: 3.638 ± 0.07
1.145GluHis: 1.145 ± 0.035
6.218GluIle: 6.218 ± 0.104
8.233GluLys: 8.233 ± 0.114
6.144GluLeu: 6.144 ± 0.087
1.717GluMet: 1.717 ± 0.051
5.12GluAsn: 5.12 ± 0.088
1.896GluPro: 1.896 ± 0.055
2.194GluGln: 2.194 ± 0.058
2.769GluArg: 2.769 ± 0.061
3.226GluSer: 3.226 ± 0.07
3.995GluThr: 3.995 ± 0.073
3.591GluVal: 3.591 ± 0.071
0.594GluTrp: 0.594 ± 0.029
2.834GluTyr: 2.834 ± 0.063
0.0GluXaa: 0.0 ± 0.0
Phe
3.79PheAla: 3.79 ± 0.066
0.794PheCys: 0.794 ± 0.034
2.659PheAsp: 2.659 ± 0.06
3.191PheGlu: 3.191 ± 0.064
3.951PhePhe: 3.951 ± 0.101
3.105PheGly: 3.105 ± 0.069
0.684PheHis: 0.684 ± 0.027
5.086PheIle: 5.086 ± 0.097
4.149PheLys: 4.149 ± 0.067
5.665PheLeu: 5.665 ± 0.111
1.311PheMet: 1.311 ± 0.035
2.751PheAsn: 2.751 ± 0.053
1.744PhePro: 1.744 ± 0.047
1.186PheGln: 1.186 ± 0.036
1.576PheArg: 1.576 ± 0.044
4.947PheSer: 4.947 ± 0.09
3.285PheThr: 3.285 ± 0.058
2.811PheVal: 2.811 ± 0.06
0.516PheTrp: 0.516 ± 0.027
2.37PheTyr: 2.37 ± 0.06
0.0PheXaa: 0.0 ± 0.0
Gly
4.25GlyAla: 4.25 ± 0.079
0.857GlyCys: 0.857 ± 0.039
2.649GlyAsp: 2.649 ± 0.057
4.251GlyGlu: 4.251 ± 0.076
3.78GlyPhe: 3.78 ± 0.082
4.654GlyGly: 4.654 ± 0.105
0.857GlyHis: 0.857 ± 0.033
6.118GlyIle: 6.118 ± 0.094
6.399GlyLys: 6.399 ± 0.097
5.766GlyLeu: 5.766 ± 0.1
1.671GlyMet: 1.671 ± 0.048
3.393GlyAsn: 3.393 ± 0.07
1.042GlyPro: 1.042 ± 0.039
1.514GlyGln: 1.514 ± 0.044
2.383GlyArg: 2.383 ± 0.059
4.311GlySer: 4.311 ± 0.076
3.642GlyThr: 3.642 ± 0.07
3.759GlyVal: 3.759 ± 0.079
0.603GlyTrp: 0.603 ± 0.03
2.438GlyTyr: 2.438 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
0.904HisAla: 0.904 ± 0.036
0.24HisCys: 0.24 ± 0.016
0.604HisAsp: 0.604 ± 0.028
0.754HisGlu: 0.754 ± 0.035
0.812HisPhe: 0.812 ± 0.035
0.967HisGly: 0.967 ± 0.033
0.264HisHis: 0.264 ± 0.02
1.508HisIle: 1.508 ± 0.039
1.05HisLys: 1.05 ± 0.037
1.404HisLeu: 1.404 ± 0.038
0.153HisMet: 0.153 ± 0.013
0.806HisAsn: 0.806 ± 0.032
0.567HisPro: 0.567 ± 0.025
0.345HisGln: 0.345 ± 0.018
0.546HisArg: 0.546 ± 0.03
1.018HisSer: 1.018 ± 0.037
0.81HisThr: 0.81 ± 0.033
0.6HisVal: 0.6 ± 0.029
0.127HisTrp: 0.127 ± 0.013
0.611HisTyr: 0.611 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
6.046IleAla: 6.046 ± 0.098
1.073IleCys: 1.073 ± 0.036
4.204IleAsp: 4.204 ± 0.075
5.915IleGlu: 5.915 ± 0.089
4.94IlePhe: 4.94 ± 0.098
5.077IleGly: 5.077 ± 0.089
1.195IleHis: 1.195 ± 0.04
7.708IleIle: 7.708 ± 0.124
7.287IleLys: 7.287 ± 0.084
8.684IleLeu: 8.684 ± 0.115
1.764IleMet: 1.764 ± 0.046
4.137IleAsn: 4.137 ± 0.072
3.625IlePro: 3.625 ± 0.065
2.212IleGln: 2.212 ± 0.054
3.119IleArg: 3.119 ± 0.059
6.267IleSer: 6.267 ± 0.094
4.781IleThr: 4.781 ± 0.082
4.256IleVal: 4.256 ± 0.076
0.564IleTrp: 0.564 ± 0.023
3.038IleTyr: 3.038 ± 0.061
0.0IleXaa: 0.0 ± 0.0
Lys
5.475LysAla: 5.475 ± 0.081
0.634LysCys: 0.634 ± 0.032
4.278LysAsp: 4.278 ± 0.074
6.928LysGlu: 6.928 ± 0.089
3.297LysPhe: 3.297 ± 0.06
4.025LysGly: 4.025 ± 0.071
1.224LysHis: 1.224 ± 0.037
8.302LysIle: 8.302 ± 0.121
8.907LysLys: 8.907 ± 0.133
6.986LysLeu: 6.986 ± 0.091
2.214LysMet: 2.214 ± 0.045
6.844LysAsn: 6.844 ± 0.09
2.749LysPro: 2.749 ± 0.065
2.558LysGln: 2.558 ± 0.053
3.38LysArg: 3.38 ± 0.078
4.505LysSer: 4.505 ± 0.081
6.285LysThr: 6.285 ± 0.107
3.517LysVal: 3.517 ± 0.073
0.554LysTrp: 0.554 ± 0.027
3.24LysTyr: 3.24 ± 0.063
0.0LysXaa: 0.0 ± 0.0
Leu
6.27LeuAla: 6.27 ± 0.091
1.316LeuCys: 1.316 ± 0.042
4.258LeuAsp: 4.258 ± 0.085
5.475LeuGlu: 5.475 ± 0.102
5.573LeuPhe: 5.573 ± 0.11
5.419LeuGly: 5.419 ± 0.098
1.496LeuHis: 1.496 ± 0.043
7.601LeuIle: 7.601 ± 0.121
8.819LeuLys: 8.819 ± 0.11
8.858LeuLeu: 8.858 ± 0.147
2.037LeuMet: 2.037 ± 0.056
5.512LeuAsn: 5.512 ± 0.095
3.908LeuPro: 3.908 ± 0.078
2.814LeuGln: 2.814 ± 0.062
3.377LeuArg: 3.377 ± 0.055
7.331LeuSer: 7.331 ± 0.098
5.289LeuThr: 5.289 ± 0.079
4.312LeuVal: 4.312 ± 0.076
0.753LeuTrp: 0.753 ± 0.031
3.703LeuTyr: 3.703 ± 0.074
0.0LeuXaa: 0.0 ± 0.0
Met
1.422MetAla: 1.422 ± 0.042
0.23MetCys: 0.23 ± 0.019
0.979MetAsp: 0.979 ± 0.036
1.678MetGlu: 1.678 ± 0.046
1.042MetPhe: 1.042 ± 0.03
1.476MetGly: 1.476 ± 0.043
0.352MetHis: 0.352 ± 0.021
1.715MetIle: 1.715 ± 0.05
2.081MetLys: 2.081 ± 0.044
2.306MetLeu: 2.306 ± 0.055
0.538MetMet: 0.538 ± 0.028
1.405MetAsn: 1.405 ± 0.04
1.004MetPro: 1.004 ± 0.039
1.092MetGln: 1.092 ± 0.036
1.015MetArg: 1.015 ± 0.032
1.465MetSer: 1.465 ± 0.044
1.169MetThr: 1.169 ± 0.038
0.992MetVal: 0.992 ± 0.038
0.144MetTrp: 0.144 ± 0.015
0.849MetTyr: 0.849 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
3.618AsnAla: 3.618 ± 0.064
0.669AsnCys: 0.669 ± 0.028
2.179AsnAsp: 2.179 ± 0.053
3.64AsnGlu: 3.64 ± 0.068
2.951AsnPhe: 2.951 ± 0.052
3.791AsnGly: 3.791 ± 0.085
0.579AsnHis: 0.579 ± 0.029
5.214AsnIle: 5.214 ± 0.085
4.44AsnLys: 4.44 ± 0.085
5.329AsnLeu: 5.329 ± 0.096
1.328AsnMet: 1.328 ± 0.036
2.682AsnAsn: 2.682 ± 0.064
2.303AsnPro: 2.303 ± 0.054
1.078AsnGln: 1.078 ± 0.036
1.95AsnArg: 1.95 ± 0.047
3.467AsnSer: 3.467 ± 0.063
2.921AsnThr: 2.921 ± 0.055
2.562AsnVal: 2.562 ± 0.064
0.504AsnTrp: 0.504 ± 0.025
2.083AsnTyr: 2.083 ± 0.046
0.0AsnXaa: 0.0 ± 0.0
Pro
2.499ProAla: 2.499 ± 0.059
0.371ProCys: 0.371 ± 0.018
2.056ProAsp: 2.056 ± 0.055
3.141ProGlu: 3.141 ± 0.069
1.914ProPhe: 1.914 ± 0.042
1.661ProGly: 1.661 ± 0.045
0.573ProHis: 0.573 ± 0.026
2.419ProIle: 2.419 ± 0.051
2.2ProLys: 2.2 ± 0.051
3.158ProLeu: 3.158 ± 0.068
0.727ProMet: 0.727 ± 0.031
1.37ProAsn: 1.37 ± 0.043
1.203ProPro: 1.203 ± 0.047
1.132ProGln: 1.132 ± 0.038
0.923ProArg: 0.923 ± 0.036
2.17ProSer: 2.17 ± 0.063
1.186ProThr: 1.186 ± 0.042
2.96ProVal: 2.96 ± 0.063
0.297ProTrp: 0.297 ± 0.018
1.448ProTyr: 1.448 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
2.054GlnAla: 2.054 ± 0.045
0.215GlnCys: 0.215 ± 0.015
1.122GlnAsp: 1.122 ± 0.039
1.725GlnGlu: 1.725 ± 0.054
1.152GlnPhe: 1.152 ± 0.038
1.567GlnGly: 1.567 ± 0.044
0.372GlnHis: 0.372 ± 0.021
2.349GlnIle: 2.349 ± 0.054
2.767GlnLys: 2.767 ± 0.066
2.063GlnLeu: 2.063 ± 0.051
0.67GlnMet: 0.67 ± 0.028
1.936GlnAsn: 1.936 ± 0.048
0.716GlnPro: 0.716 ± 0.031
0.689GlnGln: 0.689 ± 0.027
1.065GlnArg: 1.065 ± 0.038
1.628GlnSer: 1.628 ± 0.046
1.665GlnThr: 1.665 ± 0.047
1.322GlnVal: 1.322 ± 0.041
0.22GlnTrp: 0.22 ± 0.016
1.01GlnTyr: 1.01 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
2.462ArgAla: 2.462 ± 0.061
0.443ArgCys: 0.443 ± 0.027
1.516ArgAsp: 1.516 ± 0.044
2.621ArgGlu: 2.621 ± 0.062
2.073ArgPhe: 2.073 ± 0.048
2.115ArgGly: 2.115 ± 0.058
0.559ArgHis: 0.559 ± 0.025
3.359ArgIle: 3.359 ± 0.077
2.895ArgLys: 2.895 ± 0.07
3.759ArgLeu: 3.759 ± 0.08
0.907ArgMet: 0.907 ± 0.029
1.829ArgAsn: 1.829 ± 0.054
1.066ArgPro: 1.066 ± 0.036
1.158ArgGln: 1.158 ± 0.037
1.674ArgArg: 1.674 ± 0.051
2.192ArgSer: 2.192 ± 0.05
1.875ArgThr: 1.875 ± 0.053
1.86ArgVal: 1.86 ± 0.048
0.331ArgTrp: 0.331 ± 0.026
1.618ArgTyr: 1.618 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
5.125SerAla: 5.125 ± 0.08
0.873SerCys: 0.873 ± 0.038
3.631SerAsp: 3.631 ± 0.063
4.753SerGlu: 4.753 ± 0.07
4.046SerPhe: 4.046 ± 0.076
5.873SerGly: 5.873 ± 0.101
0.873SerHis: 0.873 ± 0.032
5.255SerIle: 5.255 ± 0.085
4.438SerLys: 4.438 ± 0.067
6.442SerLeu: 6.442 ± 0.098
1.39SerMet: 1.39 ± 0.04
2.58SerAsn: 2.58 ± 0.06
2.149SerPro: 2.149 ± 0.057
1.689SerGln: 1.689 ± 0.046
2.175SerArg: 2.175 ± 0.058
4.935SerSer: 4.935 ± 0.101
2.519SerThr: 2.519 ± 0.058
5.218SerVal: 5.218 ± 0.084
0.605SerTrp: 0.605 ± 0.028
2.505SerTyr: 2.505 ± 0.055
0.0SerXaa: 0.0 ± 0.0
Thr
5.163ThrAla: 5.163 ± 0.087
0.554ThrCys: 0.554 ± 0.028
3.072ThrAsp: 3.072 ± 0.062
4.57ThrGlu: 4.57 ± 0.094
2.223ThrPhe: 2.223 ± 0.054
4.594ThrGly: 4.594 ± 0.073
0.797ThrHis: 0.797 ± 0.031
3.579ThrIle: 3.579 ± 0.073
3.127ThrLys: 3.127 ± 0.058
4.736ThrLeu: 4.736 ± 0.079
1.064ThrMet: 1.064 ± 0.035
1.982ThrAsn: 1.982 ± 0.046
1.946ThrPro: 1.946 ± 0.049
1.356ThrGln: 1.356 ± 0.04
1.574ThrArg: 1.574 ± 0.043
3.174ThrSer: 3.174 ± 0.066
2.003ThrThr: 2.003 ± 0.049
5.012ThrVal: 5.012 ± 0.076
0.377ThrTrp: 0.377 ± 0.019
1.7ThrTyr: 1.7 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
3.004ValAla: 3.004 ± 0.07
0.905ValCys: 0.905 ± 0.032
2.257ValAsp: 2.257 ± 0.059
3.319ValGlu: 3.319 ± 0.06
4.122ValPhe: 4.122 ± 0.073
2.846ValGly: 2.846 ± 0.071
0.899ValHis: 0.899 ± 0.034
4.973ValIle: 4.973 ± 0.08
4.981ValLys: 4.981 ± 0.077
6.451ValLeu: 6.451 ± 0.095
1.418ValMet: 1.418 ± 0.04
2.962ValAsn: 2.962 ± 0.066
2.203ValPro: 2.203 ± 0.055
1.736ValGln: 1.736 ± 0.042
2.322ValArg: 2.322 ± 0.056
4.472ValSer: 4.472 ± 0.082
3.052ValThr: 3.052 ± 0.064
3.008ValVal: 3.008 ± 0.075
0.538ValTrp: 0.538 ± 0.023
2.601ValTyr: 2.601 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.54TrpAla: 0.54 ± 0.026
0.095TrpCys: 0.095 ± 0.011
0.405TrpAsp: 0.405 ± 0.022
0.593TrpGlu: 0.593 ± 0.032
0.516TrpPhe: 0.516 ± 0.026
0.494TrpGly: 0.494 ± 0.025
0.181TrpHis: 0.181 ± 0.017
0.679TrpIle: 0.679 ± 0.029
0.847TrpLys: 0.847 ± 0.031
0.833TrpLeu: 0.833 ± 0.036
0.159TrpMet: 0.159 ± 0.014
0.519TrpAsn: 0.519 ± 0.026
0.166TrpPro: 0.166 ± 0.014
0.298TrpGln: 0.298 ± 0.018
0.334TrpArg: 0.334 ± 0.018
0.453TrpSer: 0.453 ± 0.026
0.435TrpThr: 0.435 ± 0.022
0.424TrpVal: 0.424 ± 0.023
0.094TrpTrp: 0.094 ± 0.013
0.384TrpTyr: 0.384 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.47TyrAla: 2.47 ± 0.059
0.512TyrCys: 0.512 ± 0.024
2.056TyrAsp: 2.056 ± 0.047
2.483TyrGlu: 2.483 ± 0.051
2.349TyrPhe: 2.349 ± 0.055
2.698TyrGly: 2.698 ± 0.06
0.553TyrHis: 0.553 ± 0.025
3.164TyrIle: 3.164 ± 0.059
3.254TyrLys: 3.254 ± 0.056
3.589TyrLeu: 3.589 ± 0.065
0.724TyrMet: 0.724 ± 0.032
2.136TyrAsn: 2.136 ± 0.048
1.428TyrPro: 1.428 ± 0.045
0.83TyrGln: 0.83 ± 0.032
1.682TyrArg: 1.682 ± 0.053
2.937TyrSer: 2.937 ± 0.063
2.237TyrThr: 2.237 ± 0.049
1.828TyrVal: 1.828 ± 0.053
0.451TyrTrp: 0.451 ± 0.021
1.694TyrTyr: 1.694 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2775 proteins (849640 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski