Amino acid dipepetide frequency for Sulfitobacter mediterraneus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.391AlaAla: 16.391 ± 0.2
1.082AlaCys: 1.082 ± 0.031
7.298AlaAsp: 7.298 ± 0.091
7.638AlaGlu: 7.638 ± 0.085
4.276AlaPhe: 4.276 ± 0.068
10.372AlaGly: 10.372 ± 0.094
2.38AlaHis: 2.38 ± 0.054
5.743AlaIle: 5.743 ± 0.075
4.438AlaLys: 4.438 ± 0.071
13.341AlaLeu: 13.341 ± 0.152
3.904AlaMet: 3.904 ± 0.064
2.877AlaAsn: 2.877 ± 0.05
5.712AlaPro: 5.712 ± 0.08
5.18AlaGln: 5.18 ± 0.078
7.386AlaArg: 7.386 ± 0.098
5.452AlaSer: 5.452 ± 0.073
5.517AlaThr: 5.517 ± 0.078
8.172AlaVal: 8.172 ± 0.102
1.326AlaTrp: 1.326 ± 0.044
2.485AlaTyr: 2.485 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
1.17CysAla: 1.17 ± 0.032
0.112CysCys: 0.112 ± 0.011
0.649CysAsp: 0.649 ± 0.019
0.43CysGlu: 0.43 ± 0.019
0.384CysPhe: 0.384 ± 0.018
0.958CysGly: 0.958 ± 0.032
0.268CysHis: 0.268 ± 0.015
0.428CysIle: 0.428 ± 0.021
0.272CysLys: 0.272 ± 0.017
0.856CysLeu: 0.856 ± 0.027
0.167CysMet: 0.167 ± 0.013
0.246CysAsn: 0.246 ± 0.014
0.495CysPro: 0.495 ± 0.021
0.231CysGln: 0.231 ± 0.013
0.491CysArg: 0.491 ± 0.022
0.45CysSer: 0.45 ± 0.02
0.489CysThr: 0.489 ± 0.023
0.606CysVal: 0.606 ± 0.024
0.129CysTrp: 0.129 ± 0.011
0.234CysTyr: 0.234 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
7.505AspAla: 7.505 ± 0.093
0.528AspCys: 0.528 ± 0.02
3.843AspAsp: 3.843 ± 0.11
3.476AspGlu: 3.476 ± 0.055
2.327AspPhe: 2.327 ± 0.056
5.937AspGly: 5.937 ± 0.119
1.619AspHis: 1.619 ± 0.051
3.238AspIle: 3.238 ± 0.063
1.882AspLys: 1.882 ± 0.042
6.9AspLeu: 6.9 ± 0.081
1.788AspMet: 1.788 ± 0.041
1.531AspAsn: 1.531 ± 0.04
3.699AspPro: 3.699 ± 0.054
2.414AspGln: 2.414 ± 0.048
3.987AspArg: 3.987 ± 0.068
2.243AspSer: 2.243 ± 0.061
3.275AspThr: 3.275 ± 0.101
4.761AspVal: 4.761 ± 0.089
1.073AspTrp: 1.073 ± 0.032
1.591AspTyr: 1.591 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
7.414GluAla: 7.414 ± 0.096
0.34GluCys: 0.34 ± 0.018
3.421GluAsp: 3.421 ± 0.067
3.215GluGlu: 3.215 ± 0.06
1.804GluPhe: 1.804 ± 0.039
4.612GluGly: 4.612 ± 0.069
1.119GluHis: 1.119 ± 0.032
3.53GluIle: 3.53 ± 0.059
2.19GluLys: 2.19 ± 0.051
4.768GluLeu: 4.768 ± 0.068
1.922GluMet: 1.922 ± 0.043
1.948GluAsn: 1.948 ± 0.049
2.199GluPro: 2.199 ± 0.054
2.048GluGln: 2.048 ± 0.043
3.444GluArg: 3.444 ± 0.065
1.852GluSer: 1.852 ± 0.038
3.802GluThr: 3.802 ± 0.053
4.477GluVal: 4.477 ± 0.056
0.724GluTrp: 0.724 ± 0.027
1.068GluTyr: 1.068 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
4.623PheAla: 4.623 ± 0.073
0.475PheCys: 0.475 ± 0.021
3.228PheAsp: 3.228 ± 0.062
2.397PheGlu: 2.397 ± 0.044
1.506PhePhe: 1.506 ± 0.047
3.889PheGly: 3.889 ± 0.062
0.753PheHis: 0.753 ± 0.027
1.651PheIle: 1.651 ± 0.043
1.122PheLys: 1.122 ± 0.03
3.283PheLeu: 3.283 ± 0.06
0.912PheMet: 0.912 ± 0.03
1.084PheAsn: 1.084 ± 0.028
1.478PhePro: 1.478 ± 0.037
1.08PheGln: 1.08 ± 0.03
1.881PheArg: 1.881 ± 0.041
2.143PheSer: 2.143 ± 0.05
2.028PheThr: 2.028 ± 0.044
2.848PheVal: 2.848 ± 0.058
0.624PheTrp: 0.624 ± 0.024
0.978PheTyr: 0.978 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
9.801GlyAla: 9.801 ± 0.105
0.854GlyCys: 0.854 ± 0.026
5.046GlyAsp: 5.046 ± 0.13
4.354GlyGlu: 4.354 ± 0.068
3.843GlyPhe: 3.843 ± 0.067
7.606GlyGly: 7.606 ± 0.13
1.992GlyHis: 1.992 ± 0.04
4.509GlyIle: 4.509 ± 0.064
3.528GlyLys: 3.528 ± 0.059
8.937GlyLeu: 8.937 ± 0.097
2.795GlyMet: 2.795 ± 0.057
2.367GlyAsn: 2.367 ± 0.068
3.557GlyPro: 3.557 ± 0.051
3.504GlyGln: 3.504 ± 0.067
5.279GlyArg: 5.279 ± 0.081
4.355GlySer: 4.355 ± 0.079
4.615GlyThr: 4.615 ± 0.073
6.335GlyVal: 6.335 ± 0.081
1.484GlyTrp: 1.484 ± 0.042
2.283GlyTyr: 2.283 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.298HisAla: 2.298 ± 0.052
0.246HisCys: 0.246 ± 0.013
1.33HisAsp: 1.33 ± 0.042
1.027HisGlu: 1.027 ± 0.032
0.855HisPhe: 0.855 ± 0.03
1.917HisGly: 1.917 ± 0.047
0.573HisHis: 0.573 ± 0.023
1.089HisIle: 1.089 ± 0.029
0.585HisLys: 0.585 ± 0.026
2.235HisLeu: 2.235 ± 0.048
0.613HisMet: 0.613 ± 0.022
0.565HisAsn: 0.565 ± 0.024
1.407HisPro: 1.407 ± 0.041
0.713HisGln: 0.713 ± 0.024
1.241HisArg: 1.241 ± 0.037
1.019HisSer: 1.019 ± 0.035
0.846HisThr: 0.846 ± 0.032
1.51HisVal: 1.51 ± 0.037
0.364HisTrp: 0.364 ± 0.018
0.589HisTyr: 0.589 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.954IleAla: 6.954 ± 0.085
0.654IleCys: 0.654 ± 0.024
3.58IleAsp: 3.58 ± 0.061
3.517IleGlu: 3.517 ± 0.058
1.803IlePhe: 1.803 ± 0.045
4.981IleGly: 4.981 ± 0.067
0.916IleHis: 0.916 ± 0.03
2.446IleIle: 2.446 ± 0.051
1.851IleLys: 1.851 ± 0.04
4.748IleLeu: 4.748 ± 0.075
1.21IleMet: 1.21 ± 0.031
1.646IleAsn: 1.646 ± 0.042
2.234IlePro: 2.234 ± 0.049
1.245IleGln: 1.245 ± 0.037
2.927IleArg: 2.927 ± 0.049
3.161IleSer: 3.161 ± 0.061
3.33IleThr: 3.33 ± 0.059
3.72IleVal: 3.72 ± 0.063
0.821IleTrp: 0.821 ± 0.027
1.257IleTyr: 1.257 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
4.297LysAla: 4.297 ± 0.074
0.209LysCys: 0.209 ± 0.014
2.108LysAsp: 2.108 ± 0.048
1.727LysGlu: 1.727 ± 0.038
1.076LysPhe: 1.076 ± 0.031
3.109LysGly: 3.109 ± 0.058
0.715LysHis: 0.715 ± 0.027
1.835LysIle: 1.835 ± 0.046
1.355LysLys: 1.355 ± 0.042
3.159LysLeu: 3.159 ± 0.066
1.043LysMet: 1.043 ± 0.037
0.938LysAsn: 0.938 ± 0.031
1.979LysPro: 1.979 ± 0.05
1.11LysGln: 1.11 ± 0.037
2.323LysArg: 2.323 ± 0.049
2.02LysSer: 2.02 ± 0.038
2.19LysThr: 2.19 ± 0.041
2.544LysVal: 2.544 ± 0.059
0.447LysTrp: 0.447 ± 0.019
0.698LysTyr: 0.698 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
12.13LeuAla: 12.13 ± 0.12
1.001LeuCys: 1.001 ± 0.032
5.989LeuAsp: 5.989 ± 0.072
5.178LeuGlu: 5.178 ± 0.07
3.558LeuPhe: 3.558 ± 0.074
8.2LeuGly: 8.2 ± 0.092
1.916LeuHis: 1.916 ± 0.045
5.624LeuIle: 5.624 ± 0.088
3.384LeuLys: 3.384 ± 0.057
8.506LeuLeu: 8.506 ± 0.115
2.831LeuMet: 2.831 ± 0.054
2.931LeuAsn: 2.931 ± 0.051
5.451LeuPro: 5.451 ± 0.068
2.954LeuGln: 2.954 ± 0.051
6.72LeuArg: 6.72 ± 0.086
6.62LeuSer: 6.62 ± 0.086
6.048LeuThr: 6.048 ± 0.078
6.365LeuVal: 6.365 ± 0.087
1.303LeuTrp: 1.303 ± 0.033
1.912LeuTyr: 1.912 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
3.707MetAla: 3.707 ± 0.065
0.195MetCys: 0.195 ± 0.013
1.618MetAsp: 1.618 ± 0.036
1.321MetGlu: 1.321 ± 0.037
0.932MetPhe: 0.932 ± 0.031
2.435MetGly: 2.435 ± 0.051
0.529MetHis: 0.529 ± 0.021
1.742MetIle: 1.742 ± 0.045
1.119MetLys: 1.119 ± 0.031
2.661MetLeu: 2.661 ± 0.054
0.941MetMet: 0.941 ± 0.035
0.936MetAsn: 0.936 ± 0.029
1.6MetPro: 1.6 ± 0.042
1.214MetGln: 1.214 ± 0.034
1.906MetArg: 1.906 ± 0.041
1.839MetSer: 1.839 ± 0.039
2.265MetThr: 2.265 ± 0.042
1.946MetVal: 1.946 ± 0.046
0.262MetTrp: 0.262 ± 0.016
0.396MetTyr: 0.396 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.597AsnAla: 3.597 ± 0.053
0.284AsnCys: 0.284 ± 0.017
1.772AsnAsp: 1.772 ± 0.077
1.244AsnGlu: 1.244 ± 0.034
1.045AsnPhe: 1.045 ± 0.032
2.603AsnGly: 2.603 ± 0.056
0.572AsnHis: 0.572 ± 0.024
1.513AsnIle: 1.513 ± 0.034
0.813AsnLys: 0.813 ± 0.027
2.72AsnLeu: 2.72 ± 0.055
0.785AsnMet: 0.785 ± 0.025
0.802AsnAsn: 0.802 ± 0.03
1.907AsnPro: 1.907 ± 0.038
0.856AsnGln: 0.856 ± 0.03
1.777AsnArg: 1.777 ± 0.041
1.271AsnSer: 1.271 ± 0.035
1.496AsnThr: 1.496 ± 0.039
1.939AsnVal: 1.939 ± 0.045
0.514AsnTrp: 0.514 ± 0.024
0.692AsnTyr: 0.692 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
5.513ProAla: 5.513 ± 0.07
0.359ProCys: 0.359 ± 0.017
4.066ProAsp: 4.066 ± 0.062
3.697ProGlu: 3.697 ± 0.065
2.013ProPhe: 2.013 ± 0.04
3.926ProGly: 3.926 ± 0.059
1.063ProHis: 1.063 ± 0.032
2.308ProIle: 2.308 ± 0.049
1.93ProLys: 1.93 ± 0.048
4.606ProLeu: 4.606 ± 0.066
1.378ProMet: 1.378 ± 0.034
1.418ProAsn: 1.418 ± 0.035
2.081ProPro: 2.081 ± 0.047
1.973ProGln: 1.973 ± 0.043
2.457ProArg: 2.457 ± 0.051
2.472ProSer: 2.472 ± 0.047
2.252ProThr: 2.252 ± 0.046
3.97ProVal: 3.97 ± 0.066
0.601ProTrp: 0.601 ± 0.024
1.13ProTyr: 1.13 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
4.339GlnAla: 4.339 ± 0.07
0.231GlnCys: 0.231 ± 0.015
2.109GlnAsp: 2.109 ± 0.044
1.651GlnGlu: 1.651 ± 0.04
1.205GlnPhe: 1.205 ± 0.033
2.886GlnGly: 2.886 ± 0.059
0.68GlnHis: 0.68 ± 0.027
2.423GlnIle: 2.423 ± 0.051
1.216GlnLys: 1.216 ± 0.035
3.099GlnLeu: 3.099 ± 0.061
1.333GlnMet: 1.333 ± 0.034
1.137GlnAsn: 1.137 ± 0.034
1.627GlnPro: 1.627 ± 0.04
1.312GlnGln: 1.312 ± 0.043
2.158GlnArg: 2.158 ± 0.049
2.106GlnSer: 2.106 ± 0.045
2.303GlnThr: 2.303 ± 0.043
2.499GlnVal: 2.499 ± 0.046
0.452GlnTrp: 0.452 ± 0.021
0.629GlnTyr: 0.629 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
7.031ArgAla: 7.031 ± 0.093
0.48ArgCys: 0.48 ± 0.021
4.032ArgAsp: 4.032 ± 0.067
3.247ArgGlu: 3.247 ± 0.059
2.532ArgPhe: 2.532 ± 0.051
4.318ArgGly: 4.318 ± 0.062
1.394ArgHis: 1.394 ± 0.036
3.515ArgIle: 3.515 ± 0.058
2.407ArgLys: 2.407 ± 0.047
6.331ArgLeu: 6.331 ± 0.084
1.948ArgMet: 1.948 ± 0.043
1.758ArgAsn: 1.758 ± 0.041
2.863ArgPro: 2.863 ± 0.056
2.263ArgGln: 2.263 ± 0.045
4.331ArgArg: 4.331 ± 0.074
3.193ArgSer: 3.193 ± 0.055
2.661ArgThr: 2.661 ± 0.051
4.231ArgVal: 4.231 ± 0.071
0.909ArgTrp: 0.909 ± 0.027
1.466ArgTyr: 1.466 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
5.794SerAla: 5.794 ± 0.08
0.447SerCys: 0.447 ± 0.018
3.525SerAsp: 3.525 ± 0.064
2.791SerGlu: 2.791 ± 0.049
2.332SerPhe: 2.332 ± 0.05
5.447SerGly: 5.447 ± 0.084
1.133SerHis: 1.133 ± 0.031
2.577SerIle: 2.577 ± 0.053
1.71SerLys: 1.71 ± 0.043
4.888SerLeu: 4.888 ± 0.078
1.448SerMet: 1.448 ± 0.037
1.508SerAsn: 1.508 ± 0.042
2.325SerPro: 2.325 ± 0.046
1.689SerGln: 1.689 ± 0.036
2.937SerArg: 2.937 ± 0.051
2.579SerSer: 2.579 ± 0.048
2.541SerThr: 2.541 ± 0.046
3.82SerVal: 3.82 ± 0.063
0.703SerTrp: 0.703 ± 0.026
1.343SerTyr: 1.343 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
6.38ThrAla: 6.38 ± 0.093
0.506ThrCys: 0.506 ± 0.02
3.334ThrAsp: 3.334 ± 0.08
2.894ThrGlu: 2.894 ± 0.053
2.044ThrPhe: 2.044 ± 0.054
5.309ThrGly: 5.309 ± 0.091
1.133ThrHis: 1.133 ± 0.031
2.695ThrIle: 2.695 ± 0.06
1.678ThrLys: 1.678 ± 0.037
6.151ThrLeu: 6.151 ± 0.074
1.364ThrMet: 1.364 ± 0.037
1.392ThrAsn: 1.392 ± 0.043
3.373ThrPro: 3.373 ± 0.061
1.79ThrGln: 1.79 ± 0.042
3.201ThrArg: 3.201 ± 0.056
2.789ThrSer: 2.789 ± 0.049
2.801ThrThr: 2.801 ± 0.062
4.068ThrVal: 4.068 ± 0.077
0.673ThrTrp: 0.673 ± 0.024
1.324ThrTyr: 1.324 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
8.151ValAla: 8.151 ± 0.102
0.669ValCys: 0.669 ± 0.022
4.255ValAsp: 4.255 ± 0.072
4.235ValGlu: 4.235 ± 0.065
2.913ValPhe: 2.913 ± 0.052
5.314ValGly: 5.314 ± 0.082
1.363ValHis: 1.363 ± 0.038
4.395ValIle: 4.395 ± 0.07
2.314ValLys: 2.314 ± 0.053
7.485ValLeu: 7.485 ± 0.103
2.252ValMet: 2.252 ± 0.046
2.049ValAsn: 2.049 ± 0.046
3.452ValPro: 3.452 ± 0.059
2.428ValGln: 2.428 ± 0.049
3.923ValArg: 3.923 ± 0.066
4.086ValSer: 4.086 ± 0.068
4.588ValThr: 4.588 ± 0.092
5.601ValVal: 5.601 ± 0.093
0.927ValTrp: 0.927 ± 0.028
1.467ValTyr: 1.467 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
1.319TrpAla: 1.319 ± 0.037
0.139TrpCys: 0.139 ± 0.01
0.808TrpAsp: 0.808 ± 0.029
0.594TrpGlu: 0.594 ± 0.025
0.593TrpPhe: 0.593 ± 0.027
1.028TrpGly: 1.028 ± 0.031
0.356TrpHis: 0.356 ± 0.017
0.775TrpIle: 0.775 ± 0.025
0.44TrpLys: 0.44 ± 0.021
1.605TrpLeu: 1.605 ± 0.045
0.439TrpMet: 0.439 ± 0.019
0.444TrpAsn: 0.444 ± 0.02
0.723TrpPro: 0.723 ± 0.027
0.658TrpGln: 0.658 ± 0.022
1.044TrpArg: 1.044 ± 0.032
0.783TrpSer: 0.783 ± 0.027
0.747TrpThr: 0.747 ± 0.025
0.935TrpVal: 0.935 ± 0.032
0.224TrpTrp: 0.224 ± 0.015
0.262TrpTyr: 0.262 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.554TyrAla: 2.554 ± 0.041
0.255TyrCys: 0.255 ± 0.015
1.606TyrAsp: 1.606 ± 0.044
1.264TyrGlu: 1.264 ± 0.039
0.944TyrPhe: 0.944 ± 0.033
2.196TyrGly: 2.196 ± 0.043
0.548TyrHis: 0.548 ± 0.024
0.964TyrIle: 0.964 ± 0.032
0.651TyrLys: 0.651 ± 0.025
2.341TyrLeu: 2.341 ± 0.048
0.488TyrMet: 0.488 ± 0.019
0.626TyrAsn: 0.626 ± 0.022
1.052TyrPro: 1.052 ± 0.032
0.735TyrGln: 0.735 ± 0.022
1.497TyrArg: 1.497 ± 0.036
1.105TyrSer: 1.105 ± 0.031
1.104TyrThr: 1.104 ± 0.04
1.518TyrVal: 1.518 ± 0.039
0.358TyrTrp: 0.358 ± 0.018
0.587TyrTyr: 0.587 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3649 proteins (1135408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski