Amino acid dipepetide frequency for Agrobacterium phage Atu_ph07

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.942AlaAla: 3.942 ± 0.231
0.379AlaCys: 0.379 ± 0.054
3.243AlaAsp: 3.243 ± 0.178
3.243AlaGlu: 3.243 ± 0.185
2.335AlaPhe: 2.335 ± 0.123
3.86AlaGly: 3.86 ± 0.299
0.684AlaHis: 0.684 ± 0.073
4.47AlaIle: 4.47 ± 0.218
3.689AlaLys: 3.689 ± 0.214
3.838AlaLeu: 3.838 ± 0.225
1.584AlaMet: 1.584 ± 0.125
3.399AlaAsn: 3.399 ± 0.159
1.629AlaPro: 1.629 ± 0.136
1.242AlaGln: 1.242 ± 0.089
2.209AlaArg: 2.209 ± 0.144
3.622AlaSer: 3.622 ± 0.18
3.801AlaThr: 3.801 ± 0.238
3.652AlaVal: 3.652 ± 0.206
0.61AlaTrp: 0.61 ± 0.076
2.365AlaTyr: 2.365 ± 0.127
0.0AlaXaa: 0.0 ± 0.0
Cys
0.513CysAla: 0.513 ± 0.067
0.082CysCys: 0.082 ± 0.024
0.58CysAsp: 0.58 ± 0.065
0.55CysGlu: 0.55 ± 0.072
0.298CysPhe: 0.298 ± 0.052
0.446CysGly: 0.446 ± 0.067
0.231CysHis: 0.231 ± 0.054
0.521CysIle: 0.521 ± 0.062
0.64CysLys: 0.64 ± 0.066
0.573CysLeu: 0.573 ± 0.06
0.201CysMet: 0.201 ± 0.037
0.483CysAsn: 0.483 ± 0.072
0.327CysPro: 0.327 ± 0.047
0.268CysGln: 0.268 ± 0.043
0.283CysArg: 0.283 ± 0.047
0.536CysSer: 0.536 ± 0.067
0.461CysThr: 0.461 ± 0.053
0.528CysVal: 0.528 ± 0.059
0.141CysTrp: 0.141 ± 0.034
0.283CysTyr: 0.283 ± 0.048
0.0CysXaa: 0.0 ± 0.0
Asp
3.793AspAla: 3.793 ± 0.173
0.558AspCys: 0.558 ± 0.072
5.266AspAsp: 5.266 ± 0.273
5.296AspGlu: 5.296 ± 0.246
4.18AspPhe: 4.18 ± 0.192
5.244AspGly: 5.244 ± 0.229
1.153AspHis: 1.153 ± 0.098
6.545AspIle: 6.545 ± 0.234
3.831AspLys: 3.831 ± 0.188
5.229AspLeu: 5.229 ± 0.179
1.867AspMet: 1.867 ± 0.103
3.823AspAsn: 3.823 ± 0.157
2.343AspPro: 2.343 ± 0.146
1.398AspGln: 1.398 ± 0.116
2.878AspArg: 2.878 ± 0.146
3.905AspSer: 3.905 ± 0.162
4.15AspThr: 4.15 ± 0.165
4.775AspVal: 4.775 ± 0.204
1.019AspTrp: 1.019 ± 0.087
3.116AspTyr: 3.116 ± 0.152
0.0AspXaa: 0.0 ± 0.0
Glu
3.407GluAla: 3.407 ± 0.191
0.543GluCys: 0.543 ± 0.073
4.433GluAsp: 4.433 ± 0.222
4.604GluGlu: 4.604 ± 0.267
3.607GluPhe: 3.607 ± 0.16
3.198GluGly: 3.198 ± 0.163
1.309GluHis: 1.309 ± 0.103
6.032GluIle: 6.032 ± 0.225
4.969GluLys: 4.969 ± 0.229
5.809GluLeu: 5.809 ± 0.258
2.112GluMet: 2.112 ± 0.158
4.507GluAsn: 4.507 ± 0.201
1.986GluPro: 1.986 ± 0.126
1.964GluGln: 1.964 ± 0.12
2.856GluArg: 2.856 ± 0.161
3.288GluSer: 3.288 ± 0.179
4.842GluThr: 4.842 ± 0.181
3.831GluVal: 3.831 ± 0.189
1.056GluTrp: 1.056 ± 0.088
3.868GluTyr: 3.868 ± 0.18
0.0GluXaa: 0.0 ± 0.0
Phe
2.335PheAla: 2.335 ± 0.138
0.513PheCys: 0.513 ± 0.065
4.745PheAsp: 4.745 ± 0.18
4.284PheGlu: 4.284 ± 0.195
2.105PhePhe: 2.105 ± 0.141
3.169PheGly: 3.169 ± 0.167
0.848PheHis: 0.848 ± 0.078
3.674PheIle: 3.674 ± 0.182
3.526PheLys: 3.526 ± 0.192
3.384PheLeu: 3.384 ± 0.174
1.19PheMet: 1.19 ± 0.098
3.139PheAsn: 3.139 ± 0.157
1.555PhePro: 1.555 ± 0.099
0.788PheGln: 0.788 ± 0.079
1.666PheArg: 1.666 ± 0.129
3.198PheSer: 3.198 ± 0.191
3.154PheThr: 3.154 ± 0.183
3.578PheVal: 3.578 ± 0.174
0.588PheTrp: 0.588 ± 0.082
2.179PheTyr: 2.179 ± 0.131
0.0PheXaa: 0.0 ± 0.0
Gly
3.079GlyAla: 3.079 ± 0.229
0.602GlyCys: 0.602 ± 0.07
4.388GlyAsp: 4.388 ± 0.247
3.89GlyGlu: 3.89 ± 0.25
3.221GlyPhe: 3.221 ± 0.165
3.964GlyGly: 3.964 ± 0.229
0.959GlyHis: 0.959 ± 0.073
4.247GlyIle: 4.247 ± 0.284
4.381GlyLys: 4.381 ± 0.199
3.853GlyLeu: 3.853 ± 0.208
1.785GlyMet: 1.785 ± 0.121
3.845GlyAsn: 3.845 ± 0.194
2.023GlyPro: 2.023 ± 0.359
1.703GlyGln: 1.703 ± 0.152
2.492GlyArg: 2.492 ± 0.156
4.492GlySer: 4.492 ± 0.228
4.478GlyThr: 4.478 ± 0.291
4.396GlyVal: 4.396 ± 0.191
0.967GlyTrp: 0.967 ± 0.095
2.931GlyTyr: 2.931 ± 0.187
0.0GlyXaa: 0.0 ± 0.0
His
0.602HisAla: 0.602 ± 0.07
0.238HisCys: 0.238 ± 0.043
0.967HisAsp: 0.967 ± 0.086
0.84HisGlu: 0.84 ± 0.099
1.064HisPhe: 1.064 ± 0.112
0.922HisGly: 0.922 ± 0.081
0.454HisHis: 0.454 ± 0.061
1.629HisIle: 1.629 ± 0.109
1.197HisLys: 1.197 ± 0.093
1.592HisLeu: 1.592 ± 0.123
0.476HisMet: 0.476 ± 0.071
1.004HisAsn: 1.004 ± 0.096
0.952HisPro: 0.952 ± 0.089
0.521HisGln: 0.521 ± 0.063
0.907HisArg: 0.907 ± 0.095
1.22HisSer: 1.22 ± 0.098
0.989HisThr: 0.989 ± 0.074
1.116HisVal: 1.116 ± 0.1
0.238HisTrp: 0.238 ± 0.041
0.922HisTyr: 0.922 ± 0.075
0.0HisXaa: 0.0 ± 0.0
Ile
4.284IleAla: 4.284 ± 0.178
0.64IleCys: 0.64 ± 0.075
6.716IleAsp: 6.716 ± 0.226
5.63IleGlu: 5.63 ± 0.193
3.124IlePhe: 3.124 ± 0.168
4.106IleGly: 4.106 ± 0.19
1.443IleHis: 1.443 ± 0.117
5.779IleIle: 5.779 ± 0.226
5.199IleLys: 5.199 ± 0.225
5.199IleLeu: 5.199 ± 0.185
1.763IleMet: 1.763 ± 0.117
4.686IleAsn: 4.686 ± 0.248
3.496IlePro: 3.496 ± 0.175
2.157IleGln: 2.157 ± 0.154
3.578IleArg: 3.578 ± 0.162
6.292IleSer: 6.292 ± 0.259
5.422IleThr: 5.422 ± 0.259
5.035IleVal: 5.035 ± 0.194
0.684IleTrp: 0.684 ± 0.081
2.893IleTyr: 2.893 ± 0.162
0.0IleXaa: 0.0 ± 0.0
Lys
3.526LysAla: 3.526 ± 0.188
0.461LysCys: 0.461 ± 0.054
4.24LysAsp: 4.24 ± 0.168
4.708LysGlu: 4.708 ± 0.277
3.206LysPhe: 3.206 ± 0.151
3.481LysGly: 3.481 ± 0.202
1.465LysHis: 1.465 ± 0.112
5.578LysIle: 5.578 ± 0.226
5.638LysLys: 5.638 ± 0.289
4.954LysLeu: 4.954 ± 0.255
2.202LysMet: 2.202 ± 0.147
4.582LysAsn: 4.582 ± 0.207
2.224LysPro: 2.224 ± 0.119
1.793LysGln: 1.793 ± 0.125
3.057LysArg: 3.057 ± 0.191
3.942LysSer: 3.942 ± 0.206
4.403LysThr: 4.403 ± 0.201
4.202LysVal: 4.202 ± 0.247
0.922LysTrp: 0.922 ± 0.09
3.451LysTyr: 3.451 ± 0.206
0.0LysXaa: 0.0 ± 0.0
Leu
3.726LeuAla: 3.726 ± 0.159
0.55LeuCys: 0.55 ± 0.07
4.916LeuAsp: 4.916 ± 0.184
5.095LeuGlu: 5.095 ± 0.234
3.258LeuPhe: 3.258 ± 0.168
3.756LeuGly: 3.756 ± 0.2
1.346LeuHis: 1.346 ± 0.103
5.021LeuIle: 5.021 ± 0.211
5.273LeuLys: 5.273 ± 0.233
4.983LeuLeu: 4.983 ± 0.205
1.926LeuMet: 1.926 ± 0.124
4.611LeuAsn: 4.611 ± 0.181
3.139LeuPro: 3.139 ± 0.164
2.194LeuGln: 2.194 ± 0.123
3.28LeuArg: 3.28 ± 0.184
5.333LeuSer: 5.333 ± 0.219
5.244LeuThr: 5.244 ± 0.204
4.701LeuVal: 4.701 ± 0.2
0.811LeuTrp: 0.811 ± 0.097
3.131LeuTyr: 3.131 ± 0.159
0.0LeuXaa: 0.0 ± 0.0
Met
1.621MetAla: 1.621 ± 0.128
0.179MetCys: 0.179 ± 0.035
1.48MetAsp: 1.48 ± 0.105
1.674MetGlu: 1.674 ± 0.131
1.383MetPhe: 1.383 ± 0.108
1.116MetGly: 1.116 ± 0.097
0.35MetHis: 0.35 ± 0.055
2.016MetIle: 2.016 ± 0.137
2.187MetLys: 2.187 ± 0.14
1.763MetLeu: 1.763 ± 0.132
0.565MetMet: 0.565 ± 0.07
1.837MetAsn: 1.837 ± 0.121
0.714MetPro: 0.714 ± 0.079
0.632MetGln: 0.632 ± 0.065
1.168MetArg: 1.168 ± 0.098
1.986MetSer: 1.986 ± 0.121
1.874MetThr: 1.874 ± 0.139
1.569MetVal: 1.569 ± 0.129
0.32MetTrp: 0.32 ± 0.051
0.982MetTyr: 0.982 ± 0.088
0.0MetXaa: 0.0 ± 0.0
Asn
3.354AsnAla: 3.354 ± 0.174
0.417AsnCys: 0.417 ± 0.054
4.277AsnAsp: 4.277 ± 0.183
4.492AsnGlu: 4.492 ± 0.207
2.566AsnPhe: 2.566 ± 0.141
5.378AsnGly: 5.378 ± 0.291
1.16AsnHis: 1.16 ± 0.095
4.894AsnIle: 4.894 ± 0.201
3.786AsnLys: 3.786 ± 0.163
4.664AsnLeu: 4.664 ± 0.194
1.361AsnMet: 1.361 ± 0.101
4.254AsnAsn: 4.254 ± 0.227
2.886AsnPro: 2.886 ± 0.178
1.413AsnGln: 1.413 ± 0.121
2.373AsnArg: 2.373 ± 0.129
3.689AsnSer: 3.689 ± 0.198
3.667AsnThr: 3.667 ± 0.197
4.626AsnVal: 4.626 ± 0.209
0.907AsnTrp: 0.907 ± 0.1
2.343AsnTyr: 2.343 ± 0.125
0.0AsnXaa: 0.0 ± 0.0
Pro
2.216ProAla: 2.216 ± 0.152
0.201ProCys: 0.201 ± 0.046
2.514ProAsp: 2.514 ± 0.15
2.871ProGlu: 2.871 ± 0.179
1.949ProPhe: 1.949 ± 0.132
1.681ProGly: 1.681 ± 0.167
0.558ProHis: 0.558 ± 0.07
2.588ProIle: 2.588 ± 0.143
1.993ProLys: 1.993 ± 0.139
2.462ProLeu: 2.462 ± 0.155
0.84ProMet: 0.84 ± 0.079
2.343ProAsn: 2.343 ± 0.166
1.376ProPro: 1.376 ± 0.141
1.093ProGln: 1.093 ± 0.161
1.145ProArg: 1.145 ± 0.088
2.953ProSer: 2.953 ± 0.152
2.819ProThr: 2.819 ± 0.175
2.893ProVal: 2.893 ± 0.139
0.364ProTrp: 0.364 ± 0.052
1.897ProTyr: 1.897 ± 0.126
0.0ProXaa: 0.0 ± 0.0
Gln
1.465GlnAla: 1.465 ± 0.115
0.104GlnCys: 0.104 ± 0.027
1.361GlnAsp: 1.361 ± 0.095
1.562GlnGlu: 1.562 ± 0.122
1.257GlnPhe: 1.257 ± 0.096
1.688GlnGly: 1.688 ± 0.368
0.402GlnHis: 0.402 ± 0.052
2.053GlnIle: 2.053 ± 0.122
1.577GlnLys: 1.577 ± 0.118
2.202GlnLeu: 2.202 ± 0.118
0.707GlnMet: 0.707 ± 0.075
1.376GlnAsn: 1.376 ± 0.094
0.915GlnPro: 0.915 ± 0.092
0.774GlnGln: 0.774 ± 0.083
1.145GlnArg: 1.145 ± 0.101
1.703GlnSer: 1.703 ± 0.136
1.607GlnThr: 1.607 ± 0.145
1.369GlnVal: 1.369 ± 0.101
0.417GlnTrp: 0.417 ± 0.06
1.421GlnTyr: 1.421 ± 0.101
0.0GlnXaa: 0.0 ± 0.0
Arg
2.053ArgAla: 2.053 ± 0.133
0.35ArgCys: 0.35 ± 0.062
2.499ArgAsp: 2.499 ± 0.142
2.886ArgGlu: 2.886 ± 0.161
2.492ArgPhe: 2.492 ± 0.134
2.447ArgGly: 2.447 ± 0.154
0.803ArgHis: 0.803 ± 0.086
2.901ArgIle: 2.901 ± 0.14
2.983ArgLys: 2.983 ± 0.163
3.317ArgLeu: 3.317 ± 0.171
1.168ArgMet: 1.168 ± 0.099
2.596ArgAsn: 2.596 ± 0.129
1.317ArgPro: 1.317 ± 0.093
1.383ArgGln: 1.383 ± 0.107
1.621ArgArg: 1.621 ± 0.122
2.678ArgSer: 2.678 ± 0.141
2.112ArgThr: 2.112 ± 0.123
2.804ArgVal: 2.804 ± 0.134
0.699ArgTrp: 0.699 ± 0.082
2.187ArgTyr: 2.187 ± 0.127
0.0ArgXaa: 0.0 ± 0.0
Ser
3.54SerAla: 3.54 ± 0.183
0.58SerCys: 0.58 ± 0.065
4.76SerAsp: 4.76 ± 0.232
4.269SerGlu: 4.269 ± 0.197
3.615SerPhe: 3.615 ± 0.156
4.738SerGly: 4.738 ± 0.287
1.22SerHis: 1.22 ± 0.101
5.497SerIle: 5.497 ± 0.259
4.455SerLys: 4.455 ± 0.175
4.686SerLeu: 4.686 ± 0.158
1.465SerMet: 1.465 ± 0.102
3.92SerAsn: 3.92 ± 0.197
2.581SerPro: 2.581 ± 0.157
1.383SerGln: 1.383 ± 0.115
2.774SerArg: 2.774 ± 0.151
4.827SerSer: 4.827 ± 0.239
4.411SerThr: 4.411 ± 0.271
4.924SerVal: 4.924 ± 0.236
0.759SerTrp: 0.759 ± 0.072
3.176SerTyr: 3.176 ± 0.192
0.0SerXaa: 0.0 ± 0.0
Thr
3.741ThrAla: 3.741 ± 0.223
0.409ThrCys: 0.409 ± 0.057
4.44ThrAsp: 4.44 ± 0.194
4.21ThrGlu: 4.21 ± 0.163
3.503ThrPhe: 3.503 ± 0.165
4.738ThrGly: 4.738 ± 0.303
1.064ThrHis: 1.064 ± 0.1
5.355ThrIle: 5.355 ± 0.245
4.076ThrLys: 4.076 ± 0.174
4.961ThrLeu: 4.961 ± 0.193
1.175ThrMet: 1.175 ± 0.098
4.039ThrAsn: 4.039 ± 0.237
2.812ThrPro: 2.812 ± 0.152
1.331ThrGln: 1.331 ± 0.116
2.254ThrArg: 2.254 ± 0.127
4.708ThrSer: 4.708 ± 0.249
4.634ThrThr: 4.634 ± 0.278
4.812ThrVal: 4.812 ± 0.243
0.602ThrTrp: 0.602 ± 0.059
3.131ThrTyr: 3.131 ± 0.141
0.0ThrXaa: 0.0 ± 0.0
Val
3.645ValAla: 3.645 ± 0.173
0.536ValCys: 0.536 ± 0.065
4.983ValAsp: 4.983 ± 0.207
4.559ValGlu: 4.559 ± 0.225
3.28ValPhe: 3.28 ± 0.159
4.113ValGly: 4.113 ± 0.219
1.064ValHis: 1.064 ± 0.093
4.857ValIle: 4.857 ± 0.165
4.634ValLys: 4.634 ± 0.204
5.065ValLeu: 5.065 ± 0.215
1.666ValMet: 1.666 ± 0.114
4.262ValAsn: 4.262 ± 0.195
2.67ValPro: 2.67 ± 0.157
1.488ValGln: 1.488 ± 0.105
2.834ValArg: 2.834 ± 0.165
5.355ValSer: 5.355 ± 0.23
4.135ValThr: 4.135 ± 0.243
5.08ValVal: 5.08 ± 0.188
0.93ValTrp: 0.93 ± 0.083
3.146ValTyr: 3.146 ± 0.185
0.0ValXaa: 0.0 ± 0.0
Trp
0.595TrpAla: 0.595 ± 0.066
0.164TrpCys: 0.164 ± 0.034
0.885TrpAsp: 0.885 ± 0.114
0.759TrpGlu: 0.759 ± 0.072
0.937TrpPhe: 0.937 ± 0.078
0.543TrpGly: 0.543 ± 0.068
0.29TrpHis: 0.29 ± 0.042
1.012TrpIle: 1.012 ± 0.084
0.989TrpLys: 0.989 ± 0.11
0.833TrpLeu: 0.833 ± 0.081
0.298TrpMet: 0.298 ± 0.052
0.811TrpAsn: 0.811 ± 0.084
0.335TrpPro: 0.335 ± 0.049
0.342TrpGln: 0.342 ± 0.048
0.632TrpArg: 0.632 ± 0.079
0.774TrpSer: 0.774 ± 0.077
0.751TrpThr: 0.751 ± 0.093
1.012TrpVal: 1.012 ± 0.078
0.171TrpTrp: 0.171 ± 0.038
0.662TrpTyr: 0.662 ± 0.072
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.358TyrAla: 2.358 ± 0.154
0.439TyrCys: 0.439 ± 0.062
3.682TyrAsp: 3.682 ± 0.168
2.797TyrGlu: 2.797 ± 0.152
2.239TyrPhe: 2.239 ± 0.125
3.109TyrGly: 3.109 ± 0.177
1.049TyrHis: 1.049 ± 0.094
3.459TyrIle: 3.459 ± 0.177
3.012TyrLys: 3.012 ± 0.18
2.931TyrLeu: 2.931 ± 0.16
1.056TyrMet: 1.056 ± 0.081
2.99TyrAsn: 2.99 ± 0.146
1.48TyrPro: 1.48 ± 0.11
1.235TyrGln: 1.235 ± 0.094
2.142TyrArg: 2.142 ± 0.121
3.079TyrSer: 3.079 ± 0.147
3.042TyrThr: 3.042 ± 0.16
3.399TyrVal: 3.399 ± 0.145
0.588TyrTrp: 0.588 ± 0.071
2.209TyrTyr: 2.209 ± 0.14
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 714 proteins (134448 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski