Amino acid dipepetide frequency for Erwinia phage vB_EamM_Bosolaphorus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.445AlaAla: 7.445 ± 0.415
0.675AlaCys: 0.675 ± 0.088
5.113AlaAsp: 5.113 ± 0.262
4.687AlaGlu: 4.687 ± 0.272
3.03AlaPhe: 3.03 ± 0.177
5.74AlaGly: 5.74 ± 0.385
1.503AlaHis: 1.503 ± 0.139
4.971AlaIle: 4.971 ± 0.271
4.853AlaLys: 4.853 ± 0.308
7.717AlaLeu: 7.717 ± 0.322
2.722AlaMet: 2.722 ± 0.181
4.19AlaAsn: 4.19 ± 0.24
2.983AlaPro: 2.983 ± 0.198
3.243AlaGln: 3.243 ± 0.281
3.681AlaArg: 3.681 ± 0.216
4.521AlaSer: 4.521 ± 0.269
4.983AlaThr: 4.983 ± 0.276
6.178AlaVal: 6.178 ± 0.32
1.053AlaTrp: 1.053 ± 0.12
2.734AlaTyr: 2.734 ± 0.207
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.092
0.071CysCys: 0.071 ± 0.032
0.556CysAsp: 0.556 ± 0.089
0.402CysGlu: 0.402 ± 0.073
0.379CysPhe: 0.379 ± 0.076
0.533CysGly: 0.533 ± 0.075
0.296CysHis: 0.296 ± 0.057
0.438CysIle: 0.438 ± 0.076
0.438CysLys: 0.438 ± 0.065
0.876CysLeu: 0.876 ± 0.099
0.296CysMet: 0.296 ± 0.066
0.462CysAsn: 0.462 ± 0.076
0.379CysPro: 0.379 ± 0.072
0.308CysGln: 0.308 ± 0.055
0.639CysArg: 0.639 ± 0.087
0.497CysSer: 0.497 ± 0.084
0.521CysThr: 0.521 ± 0.064
0.592CysVal: 0.592 ± 0.099
0.118CysTrp: 0.118 ± 0.038
0.379CysTyr: 0.379 ± 0.064
0.0CysXaa: 0.0 ± 0.0
Asp
5.255AspAla: 5.255 ± 0.276
0.462AspCys: 0.462 ± 0.077
3.965AspAsp: 3.965 ± 0.23
4.048AspGlu: 4.048 ± 0.249
2.415AspPhe: 2.415 ± 0.196
4.107AspGly: 4.107 ± 0.304
1.373AspHis: 1.373 ± 0.123
4.06AspIle: 4.06 ± 0.208
3.515AspLys: 3.515 ± 0.341
5.563AspLeu: 5.563 ± 0.255
1.622AspMet: 1.622 ± 0.123
3.089AspAsn: 3.089 ± 0.171
2.746AspPro: 2.746 ± 0.217
2.415AspGln: 2.415 ± 0.183
3.196AspArg: 3.196 ± 0.188
3.196AspSer: 3.196 ± 0.195
3.444AspThr: 3.444 ± 0.27
4.533AspVal: 4.533 ± 0.278
0.734AspTrp: 0.734 ± 0.119
2.663AspTyr: 2.663 ± 0.196
0.0AspXaa: 0.0 ± 0.0
Glu
4.841GluAla: 4.841 ± 0.304
0.509GluCys: 0.509 ± 0.081
3.29GluAsp: 3.29 ± 0.226
3.634GluGlu: 3.634 ± 0.269
2.272GluPhe: 2.272 ± 0.168
3.314GluGly: 3.314 ± 0.192
1.326GluHis: 1.326 ± 0.124
3.468GluIle: 3.468 ± 0.171
3.018GluLys: 3.018 ± 0.211
6.462GluLeu: 6.462 ± 0.345
1.87GluMet: 1.87 ± 0.136
2.225GluAsn: 2.225 ± 0.139
1.917GluPro: 1.917 ± 0.151
2.746GluGln: 2.746 ± 0.17
3.279GluArg: 3.279 ± 0.224
3.101GluSer: 3.101 ± 0.181
2.935GluThr: 2.935 ± 0.189
4.19GluVal: 4.19 ± 0.215
1.006GluTrp: 1.006 ± 0.093
2.107GluTyr: 2.107 ± 0.144
0.0GluXaa: 0.0 ± 0.0
Phe
2.628PheAla: 2.628 ± 0.155
0.379PheCys: 0.379 ± 0.068
2.781PheAsp: 2.781 ± 0.185
2.166PheGlu: 2.166 ± 0.158
1.219PhePhe: 1.219 ± 0.125
2.261PheGly: 2.261 ± 0.143
0.71PheHis: 0.71 ± 0.092
2.379PheIle: 2.379 ± 0.175
2.272PheLys: 2.272 ± 0.178
2.486PheLeu: 2.486 ± 0.169
1.006PheMet: 1.006 ± 0.102
2.722PheAsn: 2.722 ± 0.176
1.29PhePro: 1.29 ± 0.118
0.959PheGln: 0.959 ± 0.102
1.479PheArg: 1.479 ± 0.139
2.048PheSer: 2.048 ± 0.131
2.71PheThr: 2.71 ± 0.172
2.71PheVal: 2.71 ± 0.188
0.284PheTrp: 0.284 ± 0.052
1.373PheTyr: 1.373 ± 0.129
0.0PheXaa: 0.0 ± 0.0
Gly
5.314GlyAla: 5.314 ± 0.476
0.45GlyCys: 0.45 ± 0.074
4.841GlyAsp: 4.841 ± 0.839
4.119GlyGlu: 4.119 ± 0.258
2.249GlyPhe: 2.249 ± 0.155
5.942GlyGly: 5.942 ± 0.743
1.065GlyHis: 1.065 ± 0.128
4.202GlyIle: 4.202 ± 0.19
3.752GlyLys: 3.752 ± 0.211
5.729GlyLeu: 5.729 ± 0.267
2.355GlyMet: 2.355 ± 0.168
3.586GlyAsn: 3.586 ± 0.252
1.574GlyPro: 1.574 ± 0.234
2.533GlyGln: 2.533 ± 0.201
2.9GlyArg: 2.9 ± 0.196
4.474GlySer: 4.474 ± 0.266
4.19GlyThr: 4.19 ± 0.309
4.521GlyVal: 4.521 ± 0.214
1.042GlyTrp: 1.042 ± 0.106
3.219GlyTyr: 3.219 ± 0.225
0.0GlyXaa: 0.0 ± 0.0
His
1.527HisAla: 1.527 ± 0.127
0.249HisCys: 0.249 ± 0.051
1.408HisAsp: 1.408 ± 0.115
1.255HisGlu: 1.255 ± 0.127
0.817HisPhe: 0.817 ± 0.114
1.456HisGly: 1.456 ± 0.138
0.663HisHis: 0.663 ± 0.093
1.574HisIle: 1.574 ± 0.143
0.805HisLys: 0.805 ± 0.105
1.823HisLeu: 1.823 ± 0.157
0.663HisMet: 0.663 ± 0.105
0.971HisAsn: 0.971 ± 0.116
1.006HisPro: 1.006 ± 0.112
1.006HisGln: 1.006 ± 0.127
1.231HisArg: 1.231 ± 0.145
0.935HisSer: 0.935 ± 0.134
1.231HisThr: 1.231 ± 0.141
1.55HisVal: 1.55 ± 0.155
0.402HisTrp: 0.402 ± 0.068
1.018HisTyr: 1.018 ± 0.125
0.0HisXaa: 0.0 ± 0.0
Ile
5.231IleAla: 5.231 ± 0.244
0.45IleCys: 0.45 ± 0.079
4.32IleAsp: 4.32 ± 0.242
3.918IleGlu: 3.918 ± 0.237
1.669IlePhe: 1.669 ± 0.163
3.752IleGly: 3.752 ± 0.169
1.148IleHis: 1.148 ± 0.124
3.148IleIle: 3.148 ± 0.182
3.326IleLys: 3.326 ± 0.183
4.166IleLeu: 4.166 ± 0.199
1.503IleMet: 1.503 ± 0.136
3.681IleAsn: 3.681 ± 0.206
3.03IlePro: 3.03 ± 0.197
1.977IleGln: 1.977 ± 0.133
3.006IleArg: 3.006 ± 0.19
3.811IleSer: 3.811 ± 0.214
4.983IleThr: 4.983 ± 0.335
3.669IleVal: 3.669 ± 0.229
0.521IleTrp: 0.521 ± 0.078
1.811IleTyr: 1.811 ± 0.157
0.0IleXaa: 0.0 ± 0.0
Lys
5.149LysAla: 5.149 ± 0.259
0.379LysCys: 0.379 ± 0.075
3.172LysAsp: 3.172 ± 0.229
3.468LysGlu: 3.468 ± 0.239
1.799LysPhe: 1.799 ± 0.132
4.214LysGly: 4.214 ± 0.86
1.349LysHis: 1.349 ± 0.138
2.71LysIle: 2.71 ± 0.198
2.628LysLys: 2.628 ± 0.193
4.829LysLeu: 4.829 ± 0.264
1.515LysMet: 1.515 ± 0.137
2.059LysAsn: 2.059 ± 0.171
2.355LysPro: 2.355 ± 0.156
2.296LysGln: 2.296 ± 0.19
2.864LysArg: 2.864 ± 0.192
2.805LysSer: 2.805 ± 0.186
2.971LysThr: 2.971 ± 0.224
3.48LysVal: 3.48 ± 0.231
0.71LysTrp: 0.71 ± 0.092
2.166LysTyr: 2.166 ± 0.159
0.0LysXaa: 0.0 ± 0.0
Leu
6.983LeuAla: 6.983 ± 0.303
1.219LeuCys: 1.219 ± 0.119
5.326LeuAsp: 5.326 ± 0.261
4.9LeuGlu: 4.9 ± 0.243
3.042LeuPhe: 3.042 ± 0.179
5.74LeuGly: 5.74 ± 0.28
1.657LeuHis: 1.657 ± 0.15
4.912LeuIle: 4.912 ± 0.289
4.711LeuLys: 4.711 ± 0.222
7.184LeuLeu: 7.184 ± 0.373
2.497LeuMet: 2.497 ± 0.183
4.829LeuAsn: 4.829 ± 0.221
4.095LeuPro: 4.095 ± 0.198
3.563LeuGln: 3.563 ± 0.215
4.794LeuArg: 4.794 ± 0.243
6.746LeuSer: 6.746 ± 0.283
6.001LeuThr: 6.001 ± 0.25
6.439LeuVal: 6.439 ± 0.28
0.876LeuTrp: 0.876 ± 0.122
3.219LeuTyr: 3.219 ± 0.215
0.0LeuXaa: 0.0 ± 0.0
Met
2.45MetAla: 2.45 ± 0.176
0.166MetCys: 0.166 ± 0.049
1.491MetAsp: 1.491 ± 0.135
1.326MetGlu: 1.326 ± 0.138
1.195MetPhe: 1.195 ± 0.101
1.716MetGly: 1.716 ± 0.165
0.615MetHis: 0.615 ± 0.089
1.61MetIle: 1.61 ± 0.147
1.539MetLys: 1.539 ± 0.121
2.793MetLeu: 2.793 ± 0.192
1.018MetMet: 1.018 ± 0.154
1.444MetAsn: 1.444 ± 0.115
1.326MetPro: 1.326 ± 0.167
1.397MetGln: 1.397 ± 0.131
1.326MetArg: 1.326 ± 0.118
2.521MetSer: 2.521 ± 0.154
2.557MetThr: 2.557 ± 0.2
1.953MetVal: 1.953 ± 0.148
0.225MetTrp: 0.225 ± 0.051
1.124MetTyr: 1.124 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
4.131AsnAla: 4.131 ± 0.21
0.379AsnCys: 0.379 ± 0.056
3.054AsnAsp: 3.054 ± 0.187
2.639AsnGlu: 2.639 ± 0.146
1.61AsnPhe: 1.61 ± 0.146
4.024AsnGly: 4.024 ± 0.235
1.113AsnHis: 1.113 ± 0.099
3.065AsnIle: 3.065 ± 0.164
2.888AsnLys: 2.888 ± 0.201
4.273AsnLeu: 4.273 ± 0.209
1.479AsnMet: 1.479 ± 0.138
2.675AsnAsn: 2.675 ± 0.185
3.089AsnPro: 3.089 ± 0.215
1.988AsnGln: 1.988 ± 0.151
2.592AsnArg: 2.592 ± 0.197
2.722AsnSer: 2.722 ± 0.187
3.669AsnThr: 3.669 ± 0.197
3.421AsnVal: 3.421 ± 0.167
0.568AsnTrp: 0.568 ± 0.075
1.799AsnTyr: 1.799 ± 0.162
0.0AsnXaa: 0.0 ± 0.0
Pro
3.858ProAla: 3.858 ± 0.25
0.213ProCys: 0.213 ± 0.046
2.994ProAsp: 2.994 ± 0.239
2.947ProGlu: 2.947 ± 0.198
1.527ProPhe: 1.527 ± 0.17
2.841ProGly: 2.841 ± 0.185
0.9ProHis: 0.9 ± 0.105
2.497ProIle: 2.497 ± 0.182
2.048ProLys: 2.048 ± 0.156
3.456ProLeu: 3.456 ± 0.193
1.148ProMet: 1.148 ± 0.132
1.917ProAsn: 1.917 ± 0.169
1.657ProPro: 1.657 ± 0.179
2.024ProGln: 2.024 ± 0.287
1.669ProArg: 1.669 ± 0.147
2.545ProSer: 2.545 ± 0.181
2.971ProThr: 2.971 ± 0.204
3.728ProVal: 3.728 ± 0.239
0.521ProTrp: 0.521 ± 0.085
1.598ProTyr: 1.598 ± 0.138
0.0ProXaa: 0.0 ± 0.0
Gln
3.634GlnAla: 3.634 ± 0.241
0.402GlnCys: 0.402 ± 0.075
1.728GlnAsp: 1.728 ± 0.138
1.917GlnGlu: 1.917 ± 0.179
1.681GlnPhe: 1.681 ± 0.157
2.45GlnGly: 2.45 ± 0.301
1.219GlnHis: 1.219 ± 0.131
2.119GlnIle: 2.119 ± 0.146
1.598GlnLys: 1.598 ± 0.137
4.225GlnLeu: 4.225 ± 0.224
1.314GlnMet: 1.314 ± 0.206
1.657GlnAsn: 1.657 ± 0.16
2.166GlnPro: 2.166 ± 0.229
2.9GlnGln: 2.9 ± 0.417
2.225GlnArg: 2.225 ± 0.156
1.988GlnSer: 1.988 ± 0.151
2.367GlnThr: 2.367 ± 0.207
2.77GlnVal: 2.77 ± 0.189
0.58GlnTrp: 0.58 ± 0.091
1.835GlnTyr: 1.835 ± 0.125
0.0GlnXaa: 0.0 ± 0.0
Arg
3.634ArgAla: 3.634 ± 0.233
0.497ArgCys: 0.497 ± 0.09
2.912ArgAsp: 2.912 ± 0.212
3.113ArgGlu: 3.113 ± 0.211
2.379ArgPhe: 2.379 ± 0.15
2.817ArgGly: 2.817 ± 0.183
1.101ArgHis: 1.101 ± 0.118
3.042ArgIle: 3.042 ± 0.195
2.486ArgLys: 2.486 ± 0.178
5.149ArgLeu: 5.149 ± 0.265
1.479ArgMet: 1.479 ± 0.135
2.983ArgAsn: 2.983 ± 0.154
1.846ArgPro: 1.846 ± 0.158
2.036ArgGln: 2.036 ± 0.163
2.651ArgArg: 2.651 ± 0.208
2.912ArgSer: 2.912 ± 0.187
2.734ArgThr: 2.734 ± 0.213
3.255ArgVal: 3.255 ± 0.197
0.604ArgTrp: 0.604 ± 0.089
2.367ArgTyr: 2.367 ± 0.172
0.0ArgXaa: 0.0 ± 0.0
Ser
4.438SerAla: 4.438 ± 0.269
0.615SerCys: 0.615 ± 0.098
3.657SerAsp: 3.657 ± 0.195
3.042SerGlu: 3.042 ± 0.203
2.166SerPhe: 2.166 ± 0.136
4.296SerGly: 4.296 ± 0.282
1.136SerHis: 1.136 ± 0.13
3.716SerIle: 3.716 ± 0.258
3.125SerLys: 3.125 ± 0.192
5.373SerLeu: 5.373 ± 0.273
1.882SerMet: 1.882 ± 0.159
3.148SerAsn: 3.148 ± 0.198
2.557SerPro: 2.557 ± 0.161
2.308SerGln: 2.308 ± 0.171
2.888SerArg: 2.888 ± 0.18
3.503SerSer: 3.503 ± 0.28
4.237SerThr: 4.237 ± 0.242
4.509SerVal: 4.509 ± 0.238
0.698SerTrp: 0.698 ± 0.093
1.977SerTyr: 1.977 ± 0.143
0.0SerXaa: 0.0 ± 0.0
Thr
5.646ThrAla: 5.646 ± 0.299
0.379ThrCys: 0.379 ± 0.067
4.225ThrAsp: 4.225 ± 0.204
3.314ThrGlu: 3.314 ± 0.175
2.261ThrPhe: 2.261 ± 0.177
5.054ThrGly: 5.054 ± 0.439
1.586ThrHis: 1.586 ± 0.145
3.622ThrIle: 3.622 ± 0.226
3.125ThrLys: 3.125 ± 0.175
5.634ThrLeu: 5.634 ± 0.255
1.681ThrMet: 1.681 ± 0.18
3.089ThrAsn: 3.089 ± 0.248
3.93ThrPro: 3.93 ± 0.217
2.639ThrGln: 2.639 ± 0.183
3.113ThrArg: 3.113 ± 0.166
3.622ThrSer: 3.622 ± 0.232
4.521ThrThr: 4.521 ± 0.241
4.888ThrVal: 4.888 ± 0.249
0.675ThrTrp: 0.675 ± 0.08
2.687ThrTyr: 2.687 ± 0.192
0.0ThrXaa: 0.0 ± 0.0
Val
5.492ValAla: 5.492 ± 0.267
0.757ValCys: 0.757 ± 0.097
4.32ValAsp: 4.32 ± 0.233
4.036ValGlu: 4.036 ± 0.243
2.415ValPhe: 2.415 ± 0.176
4.58ValGly: 4.58 ± 0.226
1.373ValHis: 1.373 ± 0.128
4.9ValIle: 4.9 ± 0.26
4.379ValLys: 4.379 ± 0.244
5.705ValLeu: 5.705 ± 0.282
2.379ValMet: 2.379 ± 0.14
3.835ValAsn: 3.835 ± 0.226
2.817ValPro: 2.817 ± 0.202
2.237ValGln: 2.237 ± 0.172
3.456ValArg: 3.456 ± 0.207
4.616ValSer: 4.616 ± 0.214
5.35ValThr: 5.35 ± 0.26
6.32ValVal: 6.32 ± 0.327
0.734ValTrp: 0.734 ± 0.104
2.557ValTyr: 2.557 ± 0.178
0.0ValXaa: 0.0 ± 0.0
Trp
0.864TrpAla: 0.864 ± 0.088
0.059TrpCys: 0.059 ± 0.026
0.793TrpAsp: 0.793 ± 0.094
0.592TrpGlu: 0.592 ± 0.087
0.592TrpPhe: 0.592 ± 0.077
0.604TrpGly: 0.604 ± 0.076
0.355TrpHis: 0.355 ± 0.067
0.639TrpIle: 0.639 ± 0.088
0.521TrpLys: 0.521 ± 0.081
1.515TrpLeu: 1.515 ± 0.151
0.521TrpMet: 0.521 ± 0.071
0.462TrpAsn: 0.462 ± 0.076
0.32TrpPro: 0.32 ± 0.055
0.391TrpGln: 0.391 ± 0.073
0.722TrpArg: 0.722 ± 0.098
0.71TrpSer: 0.71 ± 0.107
0.793TrpThr: 0.793 ± 0.112
0.911TrpVal: 0.911 ± 0.108
0.166TrpTrp: 0.166 ± 0.045
0.544TrpTyr: 0.544 ± 0.071
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.864TyrAla: 2.864 ± 0.22
0.509TyrCys: 0.509 ± 0.086
2.45TyrAsp: 2.45 ± 0.175
1.988TyrGlu: 1.988 ± 0.158
1.148TyrPhe: 1.148 ± 0.11
2.616TyrGly: 2.616 ± 0.165
1.101TyrHis: 1.101 ± 0.107
2.048TyrIle: 2.048 ± 0.157
1.894TyrLys: 1.894 ± 0.139
3.776TyrLeu: 3.776 ± 0.201
0.829TyrMet: 0.829 ± 0.105
2.166TyrAsn: 2.166 ± 0.127
1.977TyrPro: 1.977 ± 0.219
1.811TyrGln: 1.811 ± 0.136
2.284TyrArg: 2.284 ± 0.145
2.0TyrSer: 2.0 ± 0.147
2.521TyrThr: 2.521 ± 0.149
2.639TyrVal: 2.639 ± 0.176
0.556TyrTrp: 0.556 ± 0.066
1.704TyrTyr: 1.704 ± 0.156
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 320 proteins (84490 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski