Amino acid dipepetide frequency for Vibrio phage phi-ST2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.089AlaAla: 5.089 ± 0.32
0.704AlaCys: 0.704 ± 0.111
4.212AlaAsp: 4.212 ± 0.256
5.395AlaGlu: 5.395 ± 0.291
2.884AlaPhe: 2.884 ± 0.207
4.159AlaGly: 4.159 ± 0.286
1.395AlaHis: 1.395 ± 0.126
4.797AlaIle: 4.797 ± 0.243
4.505AlaLys: 4.505 ± 0.276
5.236AlaLeu: 5.236 ± 0.268
1.887AlaMet: 1.887 ± 0.171
3.362AlaAsn: 3.362 ± 0.23
1.794AlaPro: 1.794 ± 0.148
2.618AlaGln: 2.618 ± 0.171
3.588AlaArg: 3.588 ± 0.264
4.172AlaSer: 4.172 ± 0.275
4.199AlaThr: 4.199 ± 0.318
4.305AlaVal: 4.305 ± 0.254
0.89AlaTrp: 0.89 ± 0.098
2.963AlaTyr: 2.963 ± 0.2
0.0AlaXaa: 0.0 ± 0.0
Cys
0.757CysAla: 0.757 ± 0.095
0.199CysCys: 0.199 ± 0.053
1.036CysAsp: 1.036 ± 0.116
0.943CysGlu: 0.943 ± 0.104
0.345CysPhe: 0.345 ± 0.068
0.731CysGly: 0.731 ± 0.1
0.385CysHis: 0.385 ± 0.064
0.784CysIle: 0.784 ± 0.09
1.036CysLys: 1.036 ± 0.109
0.718CysLeu: 0.718 ± 0.108
0.226CysMet: 0.226 ± 0.057
0.651CysAsn: 0.651 ± 0.1
0.425CysPro: 0.425 ± 0.069
0.399CysGln: 0.399 ± 0.075
0.545CysArg: 0.545 ± 0.092
0.585CysSer: 0.585 ± 0.101
0.611CysThr: 0.611 ± 0.092
0.704CysVal: 0.704 ± 0.106
0.173CysTrp: 0.173 ± 0.048
0.492CysTyr: 0.492 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
4.717AspAla: 4.717 ± 0.254
0.864AspCys: 0.864 ± 0.106
5.262AspAsp: 5.262 ± 0.288
6.046AspGlu: 6.046 ± 0.33
3.322AspPhe: 3.322 ± 0.193
4.956AspGly: 4.956 ± 0.291
1.316AspHis: 1.316 ± 0.133
4.77AspIle: 4.77 ± 0.239
4.053AspLys: 4.053 ± 0.247
4.903AspLeu: 4.903 ± 0.27
1.913AspMet: 1.913 ± 0.184
3.428AspAsn: 3.428 ± 0.214
2.339AspPro: 2.339 ± 0.202
1.475AspGln: 1.475 ± 0.131
2.671AspArg: 2.671 ± 0.199
4.292AspSer: 4.292 ± 0.266
3.561AspThr: 3.561 ± 0.244
5.129AspVal: 5.129 ± 0.276
1.409AspTrp: 1.409 ± 0.153
3.521AspTyr: 3.521 ± 0.266
0.0AspXaa: 0.0 ± 0.0
Glu
5.355GluAla: 5.355 ± 0.31
0.784GluCys: 0.784 ± 0.097
4.212GluAsp: 4.212 ± 0.28
5.74GluGlu: 5.74 ± 0.383
3.654GluPhe: 3.654 ± 0.218
3.043GluGly: 3.043 ± 0.199
2.033GluHis: 2.033 ± 0.16
5.608GluIle: 5.608 ± 0.269
4.996GluLys: 4.996 ± 0.358
7.574GluLeu: 7.574 ± 0.338
2.246GluMet: 2.246 ± 0.219
3.641GluAsn: 3.641 ± 0.218
1.781GluPro: 1.781 ± 0.143
2.95GluGln: 2.95 ± 0.222
4.664GluArg: 4.664 ± 0.302
4.279GluSer: 4.279 ± 0.252
4.505GluThr: 4.505 ± 0.225
4.611GluVal: 4.611 ± 0.225
1.09GluTrp: 1.09 ± 0.119
3.707GluTyr: 3.707 ± 0.223
0.0GluXaa: 0.0 ± 0.0
Phe
2.737PheAla: 2.737 ± 0.189
0.492PheCys: 0.492 ± 0.088
4.04PheAsp: 4.04 ± 0.23
3.641PheGlu: 3.641 ± 0.255
1.262PhePhe: 1.262 ± 0.113
2.751PheGly: 2.751 ± 0.195
1.023PheHis: 1.023 ± 0.103
2.83PheIle: 2.83 ± 0.197
2.844PheLys: 2.844 ± 0.218
2.139PheLeu: 2.139 ± 0.155
1.302PheMet: 1.302 ± 0.11
2.565PheAsn: 2.565 ± 0.174
1.103PhePro: 1.103 ± 0.134
1.129PheGln: 1.129 ± 0.124
1.674PheArg: 1.674 ± 0.156
2.578PheSer: 2.578 ± 0.155
2.857PheThr: 2.857 ± 0.2
2.697PheVal: 2.697 ± 0.23
0.505PheTrp: 0.505 ± 0.083
1.528PheTyr: 1.528 ± 0.141
0.0PheXaa: 0.0 ± 0.0
Gly
3.628GlyAla: 3.628 ± 0.248
0.983GlyCys: 0.983 ± 0.125
4.664GlyAsp: 4.664 ± 0.254
4.159GlyGlu: 4.159 ± 0.223
2.432GlyPhe: 2.432 ± 0.242
4.079GlyGly: 4.079 ± 0.326
1.223GlyHis: 1.223 ± 0.146
3.893GlyIle: 3.893 ± 0.212
4.319GlyLys: 4.319 ± 0.272
4.079GlyLeu: 4.079 ± 0.236
1.847GlyMet: 1.847 ± 0.138
3.229GlyAsn: 3.229 ± 0.258
0.824GlyPro: 0.824 ± 0.108
1.94GlyGln: 1.94 ± 0.162
2.83GlyArg: 2.83 ± 0.193
4.398GlySer: 4.398 ± 0.369
3.973GlyThr: 3.973 ± 0.302
4.491GlyVal: 4.491 ± 0.251
1.143GlyTrp: 1.143 ± 0.132
2.711GlyTyr: 2.711 ± 0.198
0.0GlyXaa: 0.0 ± 0.0
His
1.674HisAla: 1.674 ± 0.151
0.332HisCys: 0.332 ± 0.062
1.648HisAsp: 1.648 ± 0.147
1.86HisGlu: 1.86 ± 0.184
0.997HisPhe: 0.997 ± 0.109
1.595HisGly: 1.595 ± 0.14
0.638HisHis: 0.638 ± 0.103
1.475HisIle: 1.475 ± 0.144
1.329HisLys: 1.329 ± 0.139
1.541HisLeu: 1.541 ± 0.158
0.585HisMet: 0.585 ± 0.09
0.943HisAsn: 0.943 ± 0.099
1.076HisPro: 1.076 ± 0.124
0.638HisGln: 0.638 ± 0.089
0.93HisArg: 0.93 ± 0.111
1.103HisSer: 1.103 ± 0.123
1.541HisThr: 1.541 ± 0.177
1.634HisVal: 1.634 ± 0.149
0.345HisTrp: 0.345 ± 0.073
0.917HisTyr: 0.917 ± 0.113
0.0HisXaa: 0.0 ± 0.0
Ile
5.036IleAla: 5.036 ± 0.255
0.678IleCys: 0.678 ± 0.083
5.94IleAsp: 5.94 ± 0.253
6.206IleGlu: 6.206 ± 0.32
2.193IlePhe: 2.193 ± 0.159
3.628IleGly: 3.628 ± 0.2
1.395IleHis: 1.395 ± 0.133
3.787IleIle: 3.787 ± 0.245
4.611IleLys: 4.611 ± 0.295
3.787IleLeu: 3.787 ± 0.254
1.887IleMet: 1.887 ± 0.154
3.88IleAsn: 3.88 ± 0.254
2.046IlePro: 2.046 ± 0.158
2.113IleGln: 2.113 ± 0.171
3.229IleArg: 3.229 ± 0.199
3.628IleSer: 3.628 ± 0.239
4.079IleThr: 4.079 ± 0.237
4.638IleVal: 4.638 ± 0.278
0.571IleTrp: 0.571 ± 0.09
2.113IleTyr: 2.113 ± 0.163
0.0IleXaa: 0.0 ± 0.0
Lys
4.77LysAla: 4.77 ± 0.242
0.718LysCys: 0.718 ± 0.093
3.933LysAsp: 3.933 ± 0.275
4.638LysGlu: 4.638 ± 0.336
2.644LysPhe: 2.644 ± 0.193
3.442LysGly: 3.442 ± 0.201
1.794LysHis: 1.794 ± 0.195
4.279LysIle: 4.279 ± 0.295
4.93LysLys: 4.93 ± 0.378
5.382LysLeu: 5.382 ± 0.294
2.259LysMet: 2.259 ± 0.184
3.096LysAsn: 3.096 ± 0.22
2.392LysPro: 2.392 ± 0.174
2.684LysGln: 2.684 ± 0.226
4.226LysArg: 4.226 ± 0.272
3.84LysSer: 3.84 ± 0.235
3.84LysThr: 3.84 ± 0.237
4.04LysVal: 4.04 ± 0.283
0.997LysTrp: 0.997 ± 0.105
2.857LysTyr: 2.857 ± 0.25
0.0LysXaa: 0.0 ± 0.0
Leu
5.329LeuAla: 5.329 ± 0.25
0.85LeuCys: 0.85 ± 0.112
5.674LeuAsp: 5.674 ± 0.324
5.807LeuGlu: 5.807 ± 0.315
2.644LeuPhe: 2.644 ± 0.201
4.478LeuGly: 4.478 ± 0.255
1.541LeuHis: 1.541 ± 0.166
4.026LeuIle: 4.026 ± 0.231
5.023LeuLys: 5.023 ± 0.311
5.541LeuLeu: 5.541 ± 0.248
2.073LeuMet: 2.073 ± 0.18
4.345LeuAsn: 4.345 ± 0.238
2.724LeuPro: 2.724 ± 0.195
2.445LeuGln: 2.445 ± 0.171
4.133LeuArg: 4.133 ± 0.265
5.395LeuSer: 5.395 ± 0.276
4.385LeuThr: 4.385 ± 0.208
4.305LeuVal: 4.305 ± 0.239
0.824LeuTrp: 0.824 ± 0.101
3.096LeuTyr: 3.096 ± 0.206
0.0LeuXaa: 0.0 ± 0.0
Met
1.488MetAla: 1.488 ± 0.133
0.399MetCys: 0.399 ± 0.081
0.904MetAsp: 0.904 ± 0.123
0.997MetGlu: 0.997 ± 0.129
1.276MetPhe: 1.276 ± 0.126
1.169MetGly: 1.169 ± 0.12
0.532MetHis: 0.532 ± 0.087
2.232MetIle: 2.232 ± 0.202
2.684MetLys: 2.684 ± 0.216
2.153MetLeu: 2.153 ± 0.199
0.824MetMet: 0.824 ± 0.116
1.648MetAsn: 1.648 ± 0.146
1.103MetPro: 1.103 ± 0.122
1.262MetGln: 1.262 ± 0.135
1.382MetArg: 1.382 ± 0.16
2.671MetSer: 2.671 ± 0.221
2.046MetThr: 2.046 ± 0.16
1.076MetVal: 1.076 ± 0.117
0.239MetTrp: 0.239 ± 0.055
1.076MetTyr: 1.076 ± 0.111
0.0MetXaa: 0.0 ± 0.0
Asn
3.641AsnAla: 3.641 ± 0.274
0.585AsnCys: 0.585 ± 0.096
3.668AsnAsp: 3.668 ± 0.236
4.133AsnGlu: 4.133 ± 0.219
2.033AsnPhe: 2.033 ± 0.144
4.345AsnGly: 4.345 ± 0.267
0.97AsnHis: 0.97 ± 0.098
3.654AsnIle: 3.654 ± 0.229
3.402AsnLys: 3.402 ± 0.228
3.721AsnLeu: 3.721 ± 0.227
1.276AsnMet: 1.276 ± 0.141
2.963AsnAsn: 2.963 ± 0.214
2.033AsnPro: 2.033 ± 0.184
1.555AsnGln: 1.555 ± 0.146
2.458AsnArg: 2.458 ± 0.165
2.87AsnSer: 2.87 ± 0.212
2.83AsnThr: 2.83 ± 0.187
3.827AsnVal: 3.827 ± 0.254
0.571AsnTrp: 0.571 ± 0.086
2.193AsnTyr: 2.193 ± 0.198
0.0AsnXaa: 0.0 ± 0.0
Pro
1.82ProAla: 1.82 ± 0.148
0.332ProCys: 0.332 ± 0.061
2.259ProAsp: 2.259 ± 0.161
2.83ProGlu: 2.83 ± 0.213
1.409ProPhe: 1.409 ± 0.127
1.488ProGly: 1.488 ± 0.151
0.784ProHis: 0.784 ± 0.114
1.674ProIle: 1.674 ± 0.18
1.488ProLys: 1.488 ± 0.172
2.458ProLeu: 2.458 ± 0.174
0.638ProMet: 0.638 ± 0.088
2.007ProAsn: 2.007 ± 0.152
0.611ProPro: 0.611 ± 0.095
1.076ProGln: 1.076 ± 0.118
1.302ProArg: 1.302 ± 0.136
2.219ProSer: 2.219 ± 0.175
1.94ProThr: 1.94 ± 0.144
2.206ProVal: 2.206 ± 0.167
0.345ProTrp: 0.345 ± 0.063
1.422ProTyr: 1.422 ± 0.136
0.0ProXaa: 0.0 ± 0.0
Gln
2.179GlnAla: 2.179 ± 0.174
0.359GlnCys: 0.359 ± 0.071
1.887GlnAsp: 1.887 ± 0.171
2.817GlnGlu: 2.817 ± 0.183
1.435GlnPhe: 1.435 ± 0.145
1.94GlnGly: 1.94 ± 0.171
0.704GlnHis: 0.704 ± 0.094
2.538GlnIle: 2.538 ± 0.175
2.352GlnLys: 2.352 ± 0.193
2.804GlnLeu: 2.804 ± 0.202
0.93GlnMet: 0.93 ± 0.109
1.794GlnAsn: 1.794 ± 0.159
1.249GlnPro: 1.249 ± 0.122
1.249GlnGln: 1.249 ± 0.136
1.874GlnArg: 1.874 ± 0.16
1.874GlnSer: 1.874 ± 0.16
1.94GlnThr: 1.94 ± 0.178
1.993GlnVal: 1.993 ± 0.183
0.598GlnTrp: 0.598 ± 0.084
1.435GlnTyr: 1.435 ± 0.117
0.0GlnXaa: 0.0 ± 0.0
Arg
3.242ArgAla: 3.242 ± 0.234
0.611ArgCys: 0.611 ± 0.101
3.256ArgAsp: 3.256 ± 0.209
3.787ArgGlu: 3.787 ± 0.252
2.193ArgPhe: 2.193 ± 0.169
3.083ArgGly: 3.083 ± 0.198
1.05ArgHis: 1.05 ± 0.107
3.455ArgIle: 3.455 ± 0.184
3.428ArgLys: 3.428 ± 0.257
3.96ArgLeu: 3.96 ± 0.261
1.316ArgMet: 1.316 ± 0.146
2.179ArgAsn: 2.179 ± 0.181
1.488ArgPro: 1.488 ± 0.126
1.462ArgGln: 1.462 ± 0.13
2.538ArgArg: 2.538 ± 0.178
3.176ArgSer: 3.176 ± 0.236
2.671ArgThr: 2.671 ± 0.208
3.535ArgVal: 3.535 ± 0.216
0.864ArgTrp: 0.864 ± 0.096
2.392ArgTyr: 2.392 ± 0.215
0.0ArgXaa: 0.0 ± 0.0
Ser
4.026SerAla: 4.026 ± 0.24
0.638SerCys: 0.638 ± 0.102
4.305SerAsp: 4.305 ± 0.256
4.133SerGlu: 4.133 ± 0.266
3.07SerPhe: 3.07 ± 0.18
4.664SerGly: 4.664 ± 0.352
1.502SerHis: 1.502 ± 0.124
4.239SerIle: 4.239 ± 0.256
3.907SerLys: 3.907 ± 0.294
4.81SerLeu: 4.81 ± 0.234
1.701SerMet: 1.701 ± 0.158
3.402SerAsn: 3.402 ± 0.253
1.674SerPro: 1.674 ± 0.149
2.126SerGln: 2.126 ± 0.166
3.03SerArg: 3.03 ± 0.216
4.505SerSer: 4.505 ± 0.382
4.199SerThr: 4.199 ± 0.279
4.505SerVal: 4.505 ± 0.295
0.638SerTrp: 0.638 ± 0.084
2.379SerTyr: 2.379 ± 0.167
0.0SerXaa: 0.0 ± 0.0
Thr
4.093ThrAla: 4.093 ± 0.276
0.664ThrCys: 0.664 ± 0.096
3.774ThrAsp: 3.774 ± 0.265
4.252ThrGlu: 4.252 ± 0.235
2.498ThrPhe: 2.498 ± 0.182
4.279ThrGly: 4.279 ± 0.274
1.369ThrHis: 1.369 ± 0.147
4.226ThrIle: 4.226 ± 0.303
3.269ThrLys: 3.269 ± 0.247
4.797ThrLeu: 4.797 ± 0.27
1.329ThrMet: 1.329 ± 0.119
3.043ThrAsn: 3.043 ± 0.232
2.392ThrPro: 2.392 ± 0.214
2.472ThrGln: 2.472 ± 0.189
2.751ThrArg: 2.751 ± 0.204
3.654ThrSer: 3.654 ± 0.248
3.481ThrThr: 3.481 ± 0.249
4.863ThrVal: 4.863 ± 0.265
0.704ThrTrp: 0.704 ± 0.101
2.445ThrTyr: 2.445 ± 0.202
0.0ThrXaa: 0.0 ± 0.0
Val
4.558ValAla: 4.558 ± 0.29
0.824ValCys: 0.824 ± 0.116
5.01ValAsp: 5.01 ± 0.252
5.222ValGlu: 5.222 ± 0.262
3.189ValPhe: 3.189 ± 0.224
3.641ValGly: 3.641 ± 0.244
1.595ValHis: 1.595 ± 0.14
4.265ValIle: 4.265 ± 0.247
4.757ValLys: 4.757 ± 0.274
4.784ValLeu: 4.784 ± 0.282
1.462ValMet: 1.462 ± 0.113
3.415ValAsn: 3.415 ± 0.204
2.007ValPro: 2.007 ± 0.165
2.432ValGln: 2.432 ± 0.192
3.136ValArg: 3.136 ± 0.218
4.558ValSer: 4.558 ± 0.232
4.279ValThr: 4.279 ± 0.265
4.731ValVal: 4.731 ± 0.259
0.771ValTrp: 0.771 ± 0.096
2.937ValTyr: 2.937 ± 0.201
0.0ValXaa: 0.0 ± 0.0
Trp
0.864TrpAla: 0.864 ± 0.095
0.146TrpCys: 0.146 ± 0.043
1.01TrpAsp: 1.01 ± 0.128
1.01TrpGlu: 1.01 ± 0.107
0.678TrpPhe: 0.678 ± 0.095
0.757TrpGly: 0.757 ± 0.101
0.399TrpHis: 0.399 ± 0.068
0.505TrpIle: 0.505 ± 0.067
0.811TrpLys: 0.811 ± 0.093
1.129TrpLeu: 1.129 ± 0.122
0.399TrpMet: 0.399 ± 0.078
0.718TrpAsn: 0.718 ± 0.097
0.146TrpPro: 0.146 ± 0.044
0.478TrpGln: 0.478 ± 0.075
0.784TrpArg: 0.784 ± 0.095
0.943TrpSer: 0.943 ± 0.115
0.571TrpThr: 0.571 ± 0.103
1.129TrpVal: 1.129 ± 0.108
0.173TrpTrp: 0.173 ± 0.051
0.93TrpTyr: 0.93 ± 0.118
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.149TyrAla: 3.149 ± 0.199
0.585TyrCys: 0.585 ± 0.082
3.256TyrAsp: 3.256 ± 0.197
2.857TyrGlu: 2.857 ± 0.208
1.541TyrPhe: 1.541 ± 0.128
2.618TyrGly: 2.618 ± 0.214
1.156TyrHis: 1.156 ± 0.129
2.405TyrIle: 2.405 ± 0.203
3.016TyrLys: 3.016 ± 0.235
3.123TyrLeu: 3.123 ± 0.204
1.05TyrMet: 1.05 ± 0.117
2.418TyrAsn: 2.418 ± 0.161
1.103TyrPro: 1.103 ± 0.121
1.448TyrGln: 1.448 ± 0.138
1.94TyrArg: 1.94 ± 0.149
2.711TyrSer: 2.711 ± 0.178
2.857TyrThr: 2.857 ± 0.201
3.149TyrVal: 3.149 ± 0.225
0.731TyrTrp: 0.731 ± 0.124
1.581TyrTyr: 1.581 ± 0.195
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 383 proteins (75256 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski