Amino acid dipepetide frequency for Erwinia phage vB_EamM_Y3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.289AlaAla: 8.289 ± 0.491
0.695AlaCys: 0.695 ± 0.088
5.9AlaAsp: 5.9 ± 0.311
5.095AlaGlu: 5.095 ± 0.323
3.364AlaPhe: 3.364 ± 0.183
5.205AlaGly: 5.205 ± 0.325
1.646AlaHis: 1.646 ± 0.146
4.583AlaIle: 4.583 ± 0.257
6.29AlaLys: 6.29 ± 0.521
7.436AlaLeu: 7.436 ± 0.297
2.401AlaMet: 2.401 ± 0.176
3.633AlaAsn: 3.633 ± 0.212
2.974AlaPro: 2.974 ± 0.211
3.267AlaGln: 3.267 ± 0.211
4.035AlaArg: 4.035 ± 0.237
5.461AlaSer: 5.461 ± 0.266
4.693AlaThr: 4.693 ± 0.349
5.486AlaVal: 5.486 ± 0.278
0.939AlaTrp: 0.939 ± 0.093
2.743AlaTyr: 2.743 ± 0.172
0.0AlaXaa: 0.0 ± 0.0
Cys
0.768CysAla: 0.768 ± 0.097
0.134CysCys: 0.134 ± 0.041
0.841CysAsp: 0.841 ± 0.104
0.5CysGlu: 0.5 ± 0.081
0.488CysPhe: 0.488 ± 0.076
0.975CysGly: 0.975 ± 0.119
0.256CysHis: 0.256 ± 0.06
0.585CysIle: 0.585 ± 0.089
0.561CysLys: 0.561 ± 0.085
0.841CysLeu: 0.841 ± 0.101
0.28CysMet: 0.28 ± 0.07
0.317CysAsn: 0.317 ± 0.059
0.756CysPro: 0.756 ± 0.1
0.317CysGln: 0.317 ± 0.06
0.561CysArg: 0.561 ± 0.09
0.914CysSer: 0.914 ± 0.107
0.549CysThr: 0.549 ± 0.085
0.817CysVal: 0.817 ± 0.112
0.134CysTrp: 0.134 ± 0.047
0.366CysTyr: 0.366 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
5.595AspAla: 5.595 ± 0.272
0.914AspCys: 0.914 ± 0.106
5.132AspAsp: 5.132 ± 0.573
5.193AspGlu: 5.193 ± 0.554
2.56AspPhe: 2.56 ± 0.163
4.437AspGly: 4.437 ± 0.286
0.963AspHis: 0.963 ± 0.105
4.206AspIle: 4.206 ± 0.244
3.572AspLys: 3.572 ± 0.251
5.949AspLeu: 5.949 ± 0.262
1.889AspMet: 1.889 ± 0.133
2.865AspAsn: 2.865 ± 0.197
2.926AspPro: 2.926 ± 0.219
2.511AspGln: 2.511 ± 0.165
2.913AspArg: 2.913 ± 0.182
4.206AspSer: 4.206 ± 0.252
3.669AspThr: 3.669 ± 0.21
5.047AspVal: 5.047 ± 0.268
0.914AspTrp: 0.914 ± 0.102
2.523AspTyr: 2.523 ± 0.189
0.0AspXaa: 0.0 ± 0.0
Glu
4.498GluAla: 4.498 ± 0.323
0.622GluCys: 0.622 ± 0.085
4.669GluAsp: 4.669 ± 0.533
4.12GluGlu: 4.12 ± 0.536
2.389GluPhe: 2.389 ± 0.164
2.999GluGly: 2.999 ± 0.169
1.499GluHis: 1.499 ± 0.138
3.425GluIle: 3.425 ± 0.212
3.182GluLys: 3.182 ± 0.237
5.705GluLeu: 5.705 ± 0.32
1.633GluMet: 1.633 ± 0.152
2.475GluAsn: 2.475 ± 0.172
2.194GluPro: 2.194 ± 0.162
2.45GluGln: 2.45 ± 0.174
3.084GluArg: 3.084 ± 0.21
3.694GluSer: 3.694 ± 0.244
2.962GluThr: 2.962 ± 0.204
3.925GluVal: 3.925 ± 0.211
0.707GluTrp: 0.707 ± 0.102
2.34GluTyr: 2.34 ± 0.158
0.0GluXaa: 0.0 ± 0.0
Phe
3.096PheAla: 3.096 ± 0.196
0.585PheCys: 0.585 ± 0.092
3.95PheAsp: 3.95 ± 0.25
2.523PheGlu: 2.523 ± 0.175
1.304PhePhe: 1.304 ± 0.12
2.596PheGly: 2.596 ± 0.173
0.61PheHis: 0.61 ± 0.096
2.328PheIle: 2.328 ± 0.155
2.194PheLys: 2.194 ± 0.139
2.682PheLeu: 2.682 ± 0.173
1.012PheMet: 1.012 ± 0.111
2.048PheAsn: 2.048 ± 0.136
1.365PhePro: 1.365 ± 0.117
1.146PheGln: 1.146 ± 0.125
1.975PheArg: 1.975 ± 0.157
2.682PheSer: 2.682 ± 0.19
2.572PheThr: 2.572 ± 0.156
2.755PheVal: 2.755 ± 0.187
0.378PheTrp: 0.378 ± 0.066
1.292PheTyr: 1.292 ± 0.121
0.0PheXaa: 0.0 ± 0.0
Gly
4.913GlyAla: 4.913 ± 0.269
0.634GlyCys: 0.634 ± 0.091
3.657GlyAsp: 3.657 ± 0.265
3.108GlyGlu: 3.108 ± 0.195
2.633GlyPhe: 2.633 ± 0.16
4.583GlyGly: 4.583 ± 0.365
1.085GlyHis: 1.085 ± 0.119
3.389GlyIle: 3.389 ± 0.197
4.132GlyLys: 4.132 ± 0.269
5.095GlyLeu: 5.095 ± 0.213
1.768GlyMet: 1.768 ± 0.146
2.328GlyAsn: 2.328 ± 0.188
2.316GlyPro: 2.316 ± 0.271
2.401GlyGln: 2.401 ± 0.189
3.511GlyArg: 3.511 ± 0.244
4.888GlySer: 4.888 ± 0.287
4.681GlyThr: 4.681 ± 0.322
4.73GlyVal: 4.73 ± 0.217
0.878GlyTrp: 0.878 ± 0.12
2.158GlyTyr: 2.158 ± 0.187
0.0GlyXaa: 0.0 ± 0.0
His
1.487HisAla: 1.487 ± 0.134
0.232HisCys: 0.232 ± 0.061
1.231HisAsp: 1.231 ± 0.139
1.0HisGlu: 1.0 ± 0.093
0.744HisPhe: 0.744 ± 0.093
1.317HisGly: 1.317 ± 0.12
0.475HisHis: 0.475 ± 0.098
1.377HisIle: 1.377 ± 0.16
1.121HisLys: 1.121 ± 0.139
1.548HisLeu: 1.548 ± 0.158
0.622HisMet: 0.622 ± 0.098
1.048HisAsn: 1.048 ± 0.145
0.878HisPro: 0.878 ± 0.118
0.536HisGln: 0.536 ± 0.077
1.036HisArg: 1.036 ± 0.122
1.17HisSer: 1.17 ± 0.135
1.073HisThr: 1.073 ± 0.097
1.573HisVal: 1.573 ± 0.138
0.317HisTrp: 0.317 ± 0.056
0.744HisTyr: 0.744 ± 0.089
0.0HisXaa: 0.0 ± 0.0
Ile
4.998IleAla: 4.998 ± 0.277
0.61IleCys: 0.61 ± 0.087
4.267IleAsp: 4.267 ± 0.215
3.901IleGlu: 3.901 ± 0.194
1.816IlePhe: 1.816 ± 0.128
3.072IleGly: 3.072 ± 0.193
1.134IleHis: 1.134 ± 0.136
2.926IleIle: 2.926 ± 0.18
3.328IleLys: 3.328 ± 0.225
3.694IleLeu: 3.694 ± 0.203
1.487IleMet: 1.487 ± 0.148
2.743IleAsn: 2.743 ± 0.205
2.657IlePro: 2.657 ± 0.199
2.304IleGln: 2.304 ± 0.155
3.182IleArg: 3.182 ± 0.19
3.767IleSer: 3.767 ± 0.211
3.328IleThr: 3.328 ± 0.238
4.681IleVal: 4.681 ± 0.224
0.402IleTrp: 0.402 ± 0.081
1.646IleTyr: 1.646 ± 0.124
0.0IleXaa: 0.0 ± 0.0
Lys
5.9LysAla: 5.9 ± 0.587
0.536LysCys: 0.536 ± 0.082
3.486LysAsp: 3.486 ± 0.21
3.316LysGlu: 3.316 ± 0.258
2.219LysPhe: 2.219 ± 0.177
3.364LysGly: 3.364 ± 0.268
1.402LysHis: 1.402 ± 0.165
2.889LysIle: 2.889 ± 0.211
5.681LysLys: 5.681 ± 0.599
5.388LysLeu: 5.388 ± 0.311
1.487LysMet: 1.487 ± 0.153
2.304LysAsn: 2.304 ± 0.154
3.255LysPro: 3.255 ± 0.264
2.645LysGln: 2.645 ± 0.174
3.523LysArg: 3.523 ± 0.234
3.95LysSer: 3.95 ± 0.27
3.925LysThr: 3.925 ± 0.197
3.815LysVal: 3.815 ± 0.249
0.597LysTrp: 0.597 ± 0.085
2.28LysTyr: 2.28 ± 0.174
0.0LysXaa: 0.0 ± 0.0
Leu
6.79LeuAla: 6.79 ± 0.289
0.878LeuCys: 0.878 ± 0.11
6.229LeuAsp: 6.229 ± 0.331
4.437LeuGlu: 4.437 ± 0.226
3.011LeuPhe: 3.011 ± 0.17
5.108LeuGly: 5.108 ± 0.25
1.438LeuHis: 1.438 ± 0.132
4.608LeuIle: 4.608 ± 0.259
5.583LeuLys: 5.583 ± 0.381
6.485LeuLeu: 6.485 ± 0.325
2.511LeuMet: 2.511 ± 0.177
4.608LeuAsn: 4.608 ± 0.255
4.352LeuPro: 4.352 ± 0.206
2.584LeuGln: 2.584 ± 0.185
4.486LeuArg: 4.486 ± 0.335
6.241LeuSer: 6.241 ± 0.288
5.51LeuThr: 5.51 ± 0.282
5.315LeuVal: 5.315 ± 0.262
0.683LeuTrp: 0.683 ± 0.095
2.572LeuTyr: 2.572 ± 0.173
0.0LeuXaa: 0.0 ± 0.0
Met
2.048MetAla: 2.048 ± 0.164
0.293MetCys: 0.293 ± 0.062
1.56MetAsp: 1.56 ± 0.153
1.243MetGlu: 1.243 ± 0.124
1.451MetPhe: 1.451 ± 0.127
1.402MetGly: 1.402 ± 0.135
0.67MetHis: 0.67 ± 0.104
1.292MetIle: 1.292 ± 0.132
1.829MetLys: 1.829 ± 0.186
2.511MetLeu: 2.511 ± 0.175
0.805MetMet: 0.805 ± 0.115
1.243MetAsn: 1.243 ± 0.112
1.597MetPro: 1.597 ± 0.14
1.317MetGln: 1.317 ± 0.13
1.682MetArg: 1.682 ± 0.135
2.048MetSer: 2.048 ± 0.176
1.938MetThr: 1.938 ± 0.151
1.426MetVal: 1.426 ± 0.142
0.232MetTrp: 0.232 ± 0.054
1.085MetTyr: 1.085 ± 0.109
0.0MetXaa: 0.0 ± 0.0
Asn
3.767AsnAla: 3.767 ± 0.222
0.39AsnCys: 0.39 ± 0.075
2.389AsnAsp: 2.389 ± 0.187
2.255AsnGlu: 2.255 ± 0.171
2.109AsnPhe: 2.109 ± 0.154
3.523AsnGly: 3.523 ± 0.231
0.683AsnHis: 0.683 ± 0.096
2.779AsnIle: 2.779 ± 0.209
2.682AsnLys: 2.682 ± 0.181
3.474AsnLeu: 3.474 ± 0.195
1.487AsnMet: 1.487 ± 0.139
2.145AsnAsn: 2.145 ± 0.176
1.975AsnPro: 1.975 ± 0.16
1.658AsnGln: 1.658 ± 0.151
2.06AsnArg: 2.06 ± 0.172
3.243AsnSer: 3.243 ± 0.207
2.804AsnThr: 2.804 ± 0.212
3.486AsnVal: 3.486 ± 0.197
0.427AsnTrp: 0.427 ± 0.076
1.475AsnTyr: 1.475 ± 0.137
0.0AsnXaa: 0.0 ± 0.0
Pro
3.511ProAla: 3.511 ± 0.259
0.463ProCys: 0.463 ± 0.09
3.133ProAsp: 3.133 ± 0.209
2.706ProGlu: 2.706 ± 0.185
1.731ProPhe: 1.731 ± 0.144
2.072ProGly: 2.072 ± 0.192
0.841ProHis: 0.841 ± 0.102
2.231ProIle: 2.231 ± 0.162
2.913ProLys: 2.913 ± 0.209
3.072ProLeu: 3.072 ± 0.215
1.036ProMet: 1.036 ± 0.11
1.95ProAsn: 1.95 ± 0.131
1.182ProPro: 1.182 ± 0.131
1.938ProGln: 1.938 ± 0.171
2.024ProArg: 2.024 ± 0.162
2.938ProSer: 2.938 ± 0.207
3.267ProThr: 3.267 ± 0.226
4.035ProVal: 4.035 ± 0.199
0.329ProTrp: 0.329 ± 0.061
1.195ProTyr: 1.195 ± 0.117
0.0ProXaa: 0.0 ± 0.0
Gln
3.084GlnAla: 3.084 ± 0.175
0.305GlnCys: 0.305 ± 0.056
1.95GlnAsp: 1.95 ± 0.162
2.292GlnGlu: 2.292 ± 0.178
1.646GlnPhe: 1.646 ± 0.125
2.048GlnGly: 2.048 ± 0.154
0.658GlnHis: 0.658 ± 0.083
2.292GlnIle: 2.292 ± 0.164
1.902GlnLys: 1.902 ± 0.171
3.645GlnLeu: 3.645 ± 0.2
1.256GlnMet: 1.256 ± 0.138
1.621GlnAsn: 1.621 ± 0.137
1.426GlnPro: 1.426 ± 0.143
1.877GlnGln: 1.877 ± 0.151
2.353GlnArg: 2.353 ± 0.156
2.767GlnSer: 2.767 ± 0.207
2.572GlnThr: 2.572 ± 0.242
2.267GlnVal: 2.267 ± 0.148
0.524GlnTrp: 0.524 ± 0.094
1.548GlnTyr: 1.548 ± 0.142
0.0GlnXaa: 0.0 ± 0.0
Arg
3.913ArgAla: 3.913 ± 0.214
0.61ArgCys: 0.61 ± 0.084
3.133ArgAsp: 3.133 ± 0.212
2.633ArgGlu: 2.633 ± 0.199
2.206ArgPhe: 2.206 ± 0.159
3.169ArgGly: 3.169 ± 0.223
1.061ArgHis: 1.061 ± 0.113
3.462ArgIle: 3.462 ± 0.22
3.304ArgLys: 3.304 ± 0.245
4.852ArgLeu: 4.852 ± 0.269
1.67ArgMet: 1.67 ± 0.152
2.121ArgAsn: 2.121 ± 0.159
1.78ArgPro: 1.78 ± 0.148
1.804ArgGln: 1.804 ± 0.154
2.95ArgArg: 2.95 ± 0.198
3.206ArgSer: 3.206 ± 0.212
3.108ArgThr: 3.108 ± 0.175
3.828ArgVal: 3.828 ± 0.228
0.634ArgTrp: 0.634 ± 0.09
2.011ArgTyr: 2.011 ± 0.151
0.0ArgXaa: 0.0 ± 0.0
Ser
5.888SerAla: 5.888 ± 0.267
0.805SerCys: 0.805 ± 0.107
5.108SerAsp: 5.108 ± 0.255
3.864SerGlu: 3.864 ± 0.25
2.536SerPhe: 2.536 ± 0.158
5.047SerGly: 5.047 ± 0.265
1.207SerHis: 1.207 ± 0.138
3.986SerIle: 3.986 ± 0.239
4.011SerLys: 4.011 ± 0.233
5.62SerLeu: 5.62 ± 0.263
1.78SerMet: 1.78 ± 0.167
3.035SerAsn: 3.035 ± 0.206
2.755SerPro: 2.755 ± 0.196
2.194SerGln: 2.194 ± 0.147
2.974SerArg: 2.974 ± 0.173
4.937SerSer: 4.937 ± 0.247
4.474SerThr: 4.474 ± 0.245
5.522SerVal: 5.522 ± 0.264
0.841SerTrp: 0.841 ± 0.113
2.548SerTyr: 2.548 ± 0.187
0.0SerXaa: 0.0 ± 0.0
Thr
5.863ThrAla: 5.863 ± 0.417
0.695ThrCys: 0.695 ± 0.093
3.925ThrAsp: 3.925 ± 0.231
3.377ThrGlu: 3.377 ± 0.203
2.475ThrPhe: 2.475 ± 0.211
4.657ThrGly: 4.657 ± 0.318
1.353ThrHis: 1.353 ± 0.154
3.803ThrIle: 3.803 ± 0.255
3.157ThrLys: 3.157 ± 0.249
5.425ThrLeu: 5.425 ± 0.246
1.377ThrMet: 1.377 ± 0.128
3.035ThrAsn: 3.035 ± 0.155
2.889ThrPro: 2.889 ± 0.226
2.243ThrGln: 2.243 ± 0.194
2.389ThrArg: 2.389 ± 0.175
4.315ThrSer: 4.315 ± 0.265
3.925ThrThr: 3.925 ± 0.323
5.156ThrVal: 5.156 ± 0.439
0.67ThrTrp: 0.67 ± 0.083
2.353ThrTyr: 2.353 ± 0.191
0.0ThrXaa: 0.0 ± 0.0
Val
5.851ValAla: 5.851 ± 0.258
0.878ValCys: 0.878 ± 0.117
4.608ValAsp: 4.608 ± 0.306
4.632ValGlu: 4.632 ± 0.26
2.731ValPhe: 2.731 ± 0.192
4.242ValGly: 4.242 ± 0.257
1.304ValHis: 1.304 ± 0.12
3.547ValIle: 3.547 ± 0.203
4.108ValLys: 4.108 ± 0.26
5.949ValLeu: 5.949 ± 0.308
1.816ValMet: 1.816 ± 0.136
3.194ValAsn: 3.194 ± 0.211
3.596ValPro: 3.596 ± 0.197
2.852ValGln: 2.852 ± 0.179
4.108ValArg: 4.108 ± 0.262
5.278ValSer: 5.278 ± 0.264
5.132ValThr: 5.132 ± 0.373
5.156ValVal: 5.156 ± 0.26
0.951ValTrp: 0.951 ± 0.103
2.67ValTyr: 2.67 ± 0.204
0.0ValXaa: 0.0 ± 0.0
Trp
0.951TrpAla: 0.951 ± 0.118
0.146TrpCys: 0.146 ± 0.041
0.488TrpAsp: 0.488 ± 0.077
0.549TrpGlu: 0.549 ± 0.07
0.463TrpPhe: 0.463 ± 0.064
0.585TrpGly: 0.585 ± 0.072
0.366TrpHis: 0.366 ± 0.068
0.646TrpIle: 0.646 ± 0.084
0.646TrpLys: 0.646 ± 0.094
0.939TrpLeu: 0.939 ± 0.108
0.256TrpMet: 0.256 ± 0.063
0.463TrpAsn: 0.463 ± 0.072
0.475TrpPro: 0.475 ± 0.077
0.439TrpGln: 0.439 ± 0.08
0.658TrpArg: 0.658 ± 0.091
1.048TrpSer: 1.048 ± 0.119
0.744TrpThr: 0.744 ± 0.101
0.792TrpVal: 0.792 ± 0.119
0.073TrpTrp: 0.073 ± 0.026
0.402TrpTyr: 0.402 ± 0.069
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.926TyrAla: 2.926 ± 0.207
0.536TyrCys: 0.536 ± 0.094
2.328TyrAsp: 2.328 ± 0.164
1.841TyrGlu: 1.841 ± 0.169
1.219TyrPhe: 1.219 ± 0.114
2.487TyrGly: 2.487 ± 0.181
0.878TyrHis: 0.878 ± 0.115
1.585TyrIle: 1.585 ± 0.141
1.768TyrLys: 1.768 ± 0.148
3.121TyrLeu: 3.121 ± 0.2
1.085TyrMet: 1.085 ± 0.125
1.633TyrAsn: 1.633 ± 0.14
1.219TyrPro: 1.219 ± 0.103
1.548TyrGln: 1.548 ± 0.124
1.95TyrArg: 1.95 ± 0.175
2.377TyrSer: 2.377 ± 0.168
2.158TyrThr: 2.158 ± 0.2
2.792TyrVal: 2.792 ± 0.186
0.5TyrTrp: 0.5 ± 0.079
1.073TyrTyr: 1.073 ± 0.109
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 333 proteins (82035 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski