Amino acid dipepetide frequency for SAR202 cluster bacterium AC-647-N09_OGT_505m

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.448AlaAla: 7.448 ± 0.226
0.86AlaCys: 0.86 ± 0.059
4.049AlaAsp: 4.049 ± 0.124
5.03AlaGlu: 5.03 ± 0.131
3.417AlaPhe: 3.417 ± 0.13
6.72AlaGly: 6.72 ± 0.175
1.881AlaHis: 1.881 ± 0.085
5.791AlaIle: 5.791 ± 0.182
2.984AlaLys: 2.984 ± 0.119
9.693AlaLeu: 9.693 ± 0.245
2.782AlaMet: 2.782 ± 0.115
2.598AlaAsn: 2.598 ± 0.116
3.281AlaPro: 3.281 ± 0.101
3.425AlaGln: 3.425 ± 0.106
5.265AlaArg: 5.265 ± 0.158
5.438AlaSer: 5.438 ± 0.163
5.133AlaThr: 5.133 ± 0.162
6.408AlaVal: 6.408 ± 0.17
1.018AlaTrp: 1.018 ± 0.065
2.186AlaTyr: 2.186 ± 0.094
0.0AlaXaa: 0.0 ± 0.0
Cys
0.676CysAla: 0.676 ± 0.056
0.125CysCys: 0.125 ± 0.023
0.514CysAsp: 0.514 ± 0.051
0.47CysGlu: 0.47 ± 0.039
0.261CysPhe: 0.261 ± 0.029
1.08CysGly: 1.08 ± 0.075
0.272CysHis: 0.272 ± 0.035
0.474CysIle: 0.474 ± 0.041
0.276CysLys: 0.276 ± 0.03
0.797CysLeu: 0.797 ± 0.051
0.191CysMet: 0.191 ± 0.026
0.298CysAsn: 0.298 ± 0.037
0.496CysPro: 0.496 ± 0.045
0.408CysGln: 0.408 ± 0.039
0.533CysArg: 0.533 ± 0.036
0.577CysSer: 0.577 ± 0.05
0.441CysThr: 0.441 ± 0.042
0.581CysVal: 0.581 ± 0.047
0.121CysTrp: 0.121 ± 0.022
0.287CysTyr: 0.287 ± 0.035
0.0CysXaa: 0.0 ± 0.0
Asp
4.082AspAla: 4.082 ± 0.122
0.445AspCys: 0.445 ± 0.039
2.557AspAsp: 2.557 ± 0.094
3.296AspGlu: 3.296 ± 0.119
2.194AspPhe: 2.194 ± 0.087
4.523AspGly: 4.523 ± 0.217
1.132AspHis: 1.132 ± 0.056
3.605AspIle: 3.605 ± 0.128
1.797AspLys: 1.797 ± 0.087
5.684AspLeu: 5.684 ± 0.181
1.54AspMet: 1.54 ± 0.081
1.657AspAsn: 1.657 ± 0.08
2.726AspPro: 2.726 ± 0.1
1.587AspGln: 1.587 ± 0.079
2.962AspArg: 2.962 ± 0.119
3.402AspSer: 3.402 ± 0.11
2.921AspThr: 2.921 ± 0.121
4.409AspVal: 4.409 ± 0.134
0.834AspTrp: 0.834 ± 0.06
2.01AspTyr: 2.01 ± 0.093
0.0AspXaa: 0.0 ± 0.0
Glu
6.088GluAla: 6.088 ± 0.172
0.507GluCys: 0.507 ± 0.038
3.685GluAsp: 3.685 ± 0.12
5.82GluGlu: 5.82 ± 0.183
2.168GluPhe: 2.168 ± 0.085
4.957GluGly: 4.957 ± 0.133
1.521GluHis: 1.521 ± 0.083
3.913GluIle: 3.913 ± 0.131
2.579GluLys: 2.579 ± 0.105
5.89GluLeu: 5.89 ± 0.16
2.028GluMet: 2.028 ± 0.097
1.992GluAsn: 1.992 ± 0.087
2.587GluPro: 2.587 ± 0.099
2.359GluGln: 2.359 ± 0.096
4.611GluArg: 4.611 ± 0.171
3.469GluSer: 3.469 ± 0.113
3.531GluThr: 3.531 ± 0.111
5.254GluVal: 5.254 ± 0.143
0.794GluTrp: 0.794 ± 0.064
1.83GluTyr: 1.83 ± 0.08
0.0GluXaa: 0.0 ± 0.0
Phe
3.101PheAla: 3.101 ± 0.1
0.404PheCys: 0.404 ± 0.039
2.076PheAsp: 2.076 ± 0.089
1.874PheGlu: 1.874 ± 0.095
1.565PhePhe: 1.565 ± 0.079
3.38PheGly: 3.38 ± 0.141
0.922PheHis: 0.922 ± 0.06
2.124PheIle: 2.124 ± 0.092
1.117PheLys: 1.117 ± 0.057
4.137PheLeu: 4.137 ± 0.137
0.86PheMet: 0.86 ± 0.064
1.135PheAsn: 1.135 ± 0.065
1.793PhePro: 1.793 ± 0.086
1.349PheGln: 1.349 ± 0.072
2.01PheArg: 2.01 ± 0.099
2.616PheSer: 2.616 ± 0.09
2.12PheThr: 2.12 ± 0.096
2.609PheVal: 2.609 ± 0.113
0.522PheTrp: 0.522 ± 0.045
1.157PheTyr: 1.157 ± 0.072
0.0PheXaa: 0.0 ± 0.0
Gly
6.724GlyAla: 6.724 ± 0.224
0.79GlyCys: 0.79 ± 0.062
4.486GlyAsp: 4.486 ± 0.14
4.839GlyGlu: 4.839 ± 0.129
3.825GlyPhe: 3.825 ± 0.131
6.886GlyGly: 6.886 ± 0.199
2.003GlyHis: 2.003 ± 0.095
6.004GlyIle: 6.004 ± 0.171
3.197GlyLys: 3.197 ± 0.114
8.51GlyLeu: 8.51 ± 0.192
2.583GlyMet: 2.583 ± 0.102
2.778GlyAsn: 2.778 ± 0.119
3.487GlyPro: 3.487 ± 0.116
2.969GlyGln: 2.969 ± 0.118
5.019GlyArg: 5.019 ± 0.118
5.1GlySer: 5.1 ± 0.154
4.99GlyThr: 4.99 ± 0.198
6.235GlyVal: 6.235 ± 0.156
1.536GlyTrp: 1.536 ± 0.094
2.748GlyTyr: 2.748 ± 0.095
0.0GlyXaa: 0.0 ± 0.0
His
1.543HisAla: 1.543 ± 0.091
0.217HisCys: 0.217 ± 0.03
1.025HisAsp: 1.025 ± 0.062
1.202HisGlu: 1.202 ± 0.069
0.746HisPhe: 0.746 ± 0.061
1.778HisGly: 1.778 ± 0.083
0.654HisHis: 0.654 ± 0.054
1.389HisIle: 1.389 ± 0.075
0.61HisLys: 0.61 ± 0.046
2.418HisLeu: 2.418 ± 0.087
0.65HisMet: 0.65 ± 0.048
0.761HisAsn: 0.761 ± 0.052
1.488HisPro: 1.488 ± 0.075
0.816HisGln: 0.816 ± 0.053
1.661HisArg: 1.661 ± 0.091
1.352HisSer: 1.352 ± 0.077
1.242HisThr: 1.242 ± 0.072
1.639HisVal: 1.639 ± 0.078
0.243HisTrp: 0.243 ± 0.031
0.643HisTyr: 0.643 ± 0.047
0.0HisXaa: 0.0 ± 0.0
Ile
5.817IleAla: 5.817 ± 0.165
0.536IleCys: 0.536 ± 0.051
3.112IleAsp: 3.112 ± 0.107
3.531IleGlu: 3.531 ± 0.126
2.138IlePhe: 2.138 ± 0.097
5.467IleGly: 5.467 ± 0.174
1.33IleHis: 1.33 ± 0.065
3.432IleIle: 3.432 ± 0.142
1.786IleLys: 1.786 ± 0.086
6.199IleLeu: 6.199 ± 0.178
1.334IleMet: 1.334 ± 0.076
1.944IleAsn: 1.944 ± 0.077
3.542IlePro: 3.542 ± 0.117
2.12IleGln: 2.12 ± 0.106
3.601IleArg: 3.601 ± 0.118
4.27IleSer: 4.27 ± 0.138
3.586IleThr: 3.586 ± 0.106
4.578IleVal: 4.578 ± 0.155
0.768IleTrp: 0.768 ± 0.065
1.565IleTyr: 1.565 ± 0.092
0.0IleXaa: 0.0 ± 0.0
Lys
3.167LysAla: 3.167 ± 0.121
0.254LysCys: 0.254 ± 0.029
1.977LysAsp: 1.977 ± 0.082
2.906LysGlu: 2.906 ± 0.116
0.904LysPhe: 0.904 ± 0.049
2.837LysGly: 2.837 ± 0.122
0.694LysHis: 0.694 ± 0.048
1.811LysIle: 1.811 ± 0.075
1.235LysLys: 1.235 ± 0.067
3.289LysLeu: 3.289 ± 0.099
0.823LysMet: 0.823 ± 0.056
1.113LysAsn: 1.113 ± 0.065
1.653LysPro: 1.653 ± 0.084
1.095LysGln: 1.095 ± 0.065
2.3LysArg: 2.3 ± 0.103
1.918LysSer: 1.918 ± 0.081
1.867LysThr: 1.867 ± 0.098
2.745LysVal: 2.745 ± 0.113
0.448LysTrp: 0.448 ± 0.04
0.863LysTyr: 0.863 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
9.833LeuAla: 9.833 ± 0.217
0.889LeuCys: 0.889 ± 0.055
5.747LeuAsp: 5.747 ± 0.171
7.514LeuGlu: 7.514 ± 0.203
3.436LeuPhe: 3.436 ± 0.12
9.535LeuGly: 9.535 ± 0.211
1.889LeuHis: 1.889 ± 0.08
5.534LeuIle: 5.534 ± 0.166
3.557LeuLys: 3.557 ± 0.123
10.307LeuLeu: 10.307 ± 0.248
2.51LeuMet: 2.51 ± 0.098
3.116LeuAsn: 3.116 ± 0.127
4.986LeuPro: 4.986 ± 0.149
3.318LeuGln: 3.318 ± 0.118
6.338LeuArg: 6.338 ± 0.164
6.996LeuSer: 6.996 ± 0.192
5.467LeuThr: 5.467 ± 0.171
8.253LeuVal: 8.253 ± 0.22
1.227LeuTrp: 1.227 ± 0.072
2.33LeuTyr: 2.33 ± 0.094
0.0LeuXaa: 0.0 ± 0.0
Met
3.083MetAla: 3.083 ± 0.112
0.187MetCys: 0.187 ± 0.027
1.613MetAsp: 1.613 ± 0.074
2.076MetGlu: 2.076 ± 0.096
0.808MetPhe: 0.808 ± 0.053
2.539MetGly: 2.539 ± 0.102
0.371MetHis: 0.371 ± 0.035
1.231MetIle: 1.231 ± 0.069
1.021MetLys: 1.021 ± 0.056
2.565MetLeu: 2.565 ± 0.116
0.713MetMet: 0.713 ± 0.056
1.003MetAsn: 1.003 ± 0.072
1.481MetPro: 1.481 ± 0.083
0.783MetGln: 0.783 ± 0.055
1.404MetArg: 1.404 ± 0.075
1.775MetSer: 1.775 ± 0.082
1.661MetThr: 1.661 ± 0.077
2.304MetVal: 2.304 ± 0.093
0.301MetTrp: 0.301 ± 0.035
0.588MetTyr: 0.588 ± 0.049
0.0MetXaa: 0.0 ± 0.0
Asn
2.594AsnAla: 2.594 ± 0.089
0.261AsnCys: 0.261 ± 0.033
1.253AsnAsp: 1.253 ± 0.067
1.595AsnGlu: 1.595 ± 0.091
1.077AsnPhe: 1.077 ± 0.07
2.477AsnGly: 2.477 ± 0.112
0.823AsnHis: 0.823 ± 0.052
2.047AsnIle: 2.047 ± 0.096
1.033AsnLys: 1.033 ± 0.07
3.84AsnLeu: 3.84 ± 0.15
0.794AsnMet: 0.794 ± 0.057
1.036AsnAsn: 1.036 ± 0.087
2.293AsnPro: 2.293 ± 0.092
1.194AsnGln: 1.194 ± 0.079
2.124AsnArg: 2.124 ± 0.083
1.874AsnSer: 1.874 ± 0.097
1.756AsnThr: 1.756 ± 0.077
2.532AsnVal: 2.532 ± 0.096
0.478AsnTrp: 0.478 ± 0.043
0.882AsnTyr: 0.882 ± 0.056
0.0AsnXaa: 0.0 ± 0.0
Pro
3.252ProAla: 3.252 ± 0.107
0.356ProCys: 0.356 ± 0.035
3.046ProAsp: 3.046 ± 0.128
3.726ProGlu: 3.726 ± 0.138
1.723ProPhe: 1.723 ± 0.083
4.068ProGly: 4.068 ± 0.141
1.102ProHis: 1.102 ± 0.071
2.954ProIle: 2.954 ± 0.108
1.664ProLys: 1.664 ± 0.071
4.799ProLeu: 4.799 ± 0.14
1.275ProMet: 1.275 ± 0.064
1.815ProAsn: 1.815 ± 0.09
2.296ProPro: 2.296 ± 0.107
1.992ProGln: 1.992 ± 0.091
2.837ProArg: 2.837 ± 0.112
3.469ProSer: 3.469 ± 0.119
3.182ProThr: 3.182 ± 0.189
3.597ProVal: 3.597 ± 0.107
0.757ProTrp: 0.757 ± 0.05
1.341ProTyr: 1.341 ± 0.07
0.0ProXaa: 0.0 ± 0.0
Gln
3.542GlnAla: 3.542 ± 0.111
0.276GlnCys: 0.276 ± 0.035
1.969GlnAsp: 1.969 ± 0.093
3.002GlnGlu: 3.002 ± 0.109
1.033GlnPhe: 1.033 ± 0.063
2.928GlnGly: 2.928 ± 0.096
0.772GlnHis: 0.772 ± 0.054
1.874GlnIle: 1.874 ± 0.086
1.275GlnLys: 1.275 ± 0.077
3.035GlnLeu: 3.035 ± 0.106
1.055GlnMet: 1.055 ± 0.065
0.992GlnAsn: 0.992 ± 0.065
1.477GlnPro: 1.477 ± 0.079
1.411GlnGln: 1.411 ± 0.072
2.712GlnArg: 2.712 ± 0.104
2.293GlnSer: 2.293 ± 0.097
1.694GlnThr: 1.694 ± 0.087
3.006GlnVal: 3.006 ± 0.106
0.536GlnTrp: 0.536 ± 0.045
0.97GlnTyr: 0.97 ± 0.06
0.0GlnXaa: 0.0 ± 0.0
Arg
4.722ArgAla: 4.722 ± 0.156
0.577ArgCys: 0.577 ± 0.048
3.553ArgAsp: 3.553 ± 0.134
4.387ArgGlu: 4.387 ± 0.135
2.41ArgPhe: 2.41 ± 0.092
4.718ArgGly: 4.718 ± 0.136
1.518ArgHis: 1.518 ± 0.077
3.796ArgIle: 3.796 ± 0.118
2.179ArgLys: 2.179 ± 0.091
6.533ArgLeu: 6.533 ± 0.173
1.786ArgMet: 1.786 ± 0.084
2.087ArgAsn: 2.087 ± 0.099
2.748ArgPro: 2.748 ± 0.106
2.319ArgGln: 2.319 ± 0.112
4.527ArgArg: 4.527 ± 0.148
3.652ArgSer: 3.652 ± 0.126
3.031ArgThr: 3.031 ± 0.102
4.527ArgVal: 4.527 ± 0.147
0.772ArgTrp: 0.772 ± 0.058
1.914ArgTyr: 1.914 ± 0.081
0.0ArgXaa: 0.0 ± 0.0
Ser
4.744SerAla: 4.744 ± 0.145
0.625SerCys: 0.625 ± 0.052
3.12SerAsp: 3.12 ± 0.13
3.594SerGlu: 3.594 ± 0.126
2.521SerPhe: 2.521 ± 0.096
5.809SerGly: 5.809 ± 0.17
1.47SerHis: 1.47 ± 0.068
4.02SerIle: 4.02 ± 0.121
2.157SerLys: 2.157 ± 0.083
7.103SerLeu: 7.103 ± 0.154
1.962SerMet: 1.962 ± 0.095
1.94SerAsn: 1.94 ± 0.109
3.601SerPro: 3.601 ± 0.146
2.466SerGln: 2.466 ± 0.091
3.917SerArg: 3.917 ± 0.142
4.259SerSer: 4.259 ± 0.153
3.663SerThr: 3.663 ± 0.151
4.593SerVal: 4.593 ± 0.142
0.941SerTrp: 0.941 ± 0.061
1.767SerTyr: 1.767 ± 0.085
0.0SerXaa: 0.0 ± 0.0
Thr
4.828ThrAla: 4.828 ± 0.202
0.43ThrCys: 0.43 ± 0.046
2.793ThrAsp: 2.793 ± 0.13
2.932ThrGlu: 2.932 ± 0.109
2.058ThrPhe: 2.058 ± 0.081
5.019ThrGly: 5.019 ± 0.169
1.172ThrHis: 1.172 ± 0.061
3.498ThrIle: 3.498 ± 0.122
1.76ThrLys: 1.76 ± 0.08
5.894ThrLeu: 5.894 ± 0.146
1.466ThrMet: 1.466 ± 0.079
1.922ThrAsn: 1.922 ± 0.085
3.417ThrPro: 3.417 ± 0.166
2.043ThrGln: 2.043 ± 0.088
3.017ThrArg: 3.017 ± 0.107
3.932ThrSer: 3.932 ± 0.152
3.289ThrThr: 3.289 ± 0.155
4.409ThrVal: 4.409 ± 0.161
0.794ThrTrp: 0.794 ± 0.062
1.606ThrTyr: 1.606 ± 0.082
0.0ThrXaa: 0.0 ± 0.0
Val
7.062ValAla: 7.062 ± 0.191
0.672ValCys: 0.672 ± 0.048
4.49ValAsp: 4.49 ± 0.112
5.214ValGlu: 5.214 ± 0.169
2.866ValPhe: 2.866 ± 0.139
6.36ValGly: 6.36 ± 0.178
1.565ValHis: 1.565 ± 0.073
4.938ValIle: 4.938 ± 0.137
2.425ValLys: 2.425 ± 0.104
7.617ValLeu: 7.617 ± 0.199
2.028ValMet: 2.028 ± 0.085
2.418ValAsn: 2.418 ± 0.093
3.741ValPro: 3.741 ± 0.129
2.418ValGln: 2.418 ± 0.092
4.185ValArg: 4.185 ± 0.125
5.225ValSer: 5.225 ± 0.143
4.567ValThr: 4.567 ± 0.167
6.665ValVal: 6.665 ± 0.191
0.893ValTrp: 0.893 ± 0.061
1.951ValTyr: 1.951 ± 0.079
0.0ValXaa: 0.0 ± 0.0
Trp
1.01TrpAla: 1.01 ± 0.061
0.14TrpCys: 0.14 ± 0.022
0.746TrpAsp: 0.746 ± 0.055
0.992TrpGlu: 0.992 ± 0.07
0.566TrpPhe: 0.566 ± 0.046
1.102TrpGly: 1.102 ± 0.07
0.338TrpHis: 0.338 ± 0.037
0.728TrpIle: 0.728 ± 0.056
0.474TrpLys: 0.474 ± 0.041
1.529TrpLeu: 1.529 ± 0.07
0.456TrpMet: 0.456 ± 0.04
0.544TrpAsn: 0.544 ± 0.052
0.573TrpPro: 0.573 ± 0.053
0.614TrpGln: 0.614 ± 0.043
0.915TrpArg: 0.915 ± 0.055
0.79TrpSer: 0.79 ± 0.091
0.584TrpThr: 0.584 ± 0.05
0.977TrpVal: 0.977 ± 0.064
0.309TrpTrp: 0.309 ± 0.046
0.389TrpTyr: 0.389 ± 0.039
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.091TyrAla: 2.091 ± 0.087
0.327TyrCys: 0.327 ± 0.041
1.551TyrAsp: 1.551 ± 0.078
1.584TyrGlu: 1.584 ± 0.081
1.157TyrPhe: 1.157 ± 0.073
2.524TyrGly: 2.524 ± 0.1
0.647TyrHis: 0.647 ± 0.044
1.495TyrIle: 1.495 ± 0.08
0.775TyrLys: 0.775 ± 0.059
2.943TyrLeu: 2.943 ± 0.109
0.669TyrMet: 0.669 ± 0.05
0.812TyrAsn: 0.812 ± 0.056
1.584TyrPro: 1.584 ± 0.082
1.168TyrGln: 1.168 ± 0.064
1.819TyrArg: 1.819 ± 0.09
1.863TyrSer: 1.863 ± 0.085
1.554TyrThr: 1.554 ± 0.079
1.98TyrVal: 1.98 ± 0.085
0.485TyrTrp: 0.485 ± 0.045
0.797TyrTyr: 0.797 ± 0.056
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 888 proteins (272155 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski