Amino acid dipepetide frequency for Escherichia phage vB_EcoM_Goslar

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.543AlaAla: 6.543 ± 0.47
0.558AlaCys: 0.558 ± 0.087
4.897AlaAsp: 4.897 ± 0.303
4.842AlaGlu: 4.842 ± 0.306
2.911AlaPhe: 2.911 ± 0.199
4.135AlaGly: 4.135 ± 0.341
1.143AlaHis: 1.143 ± 0.138
5.482AlaIle: 5.482 ± 0.31
4.121AlaLys: 4.121 ± 0.268
7.073AlaLeu: 7.073 ± 0.373
2.516AlaMet: 2.516 ± 0.189
3.632AlaAsn: 3.632 ± 0.215
2.816AlaPro: 2.816 ± 0.21
2.598AlaGln: 2.598 ± 0.19
3.999AlaArg: 3.999 ± 0.187
4.461AlaSer: 4.461 ± 0.308
5.495AlaThr: 5.495 ± 0.459
5.618AlaVal: 5.618 ± 0.333
1.319AlaTrp: 1.319 ± 0.099
3.237AlaTyr: 3.237 ± 0.19
0.0AlaXaa: 0.0 ± 0.0
Cys
0.517CysAla: 0.517 ± 0.089
0.286CysCys: 0.286 ± 0.065
0.53CysAsp: 0.53 ± 0.088
0.517CysGlu: 0.517 ± 0.074
0.422CysPhe: 0.422 ± 0.084
0.68CysGly: 0.68 ± 0.092
0.286CysHis: 0.286 ± 0.064
0.49CysIle: 0.49 ± 0.087
0.49CysLys: 0.49 ± 0.07
0.843CysLeu: 0.843 ± 0.101
0.313CysMet: 0.313 ± 0.061
0.367CysAsn: 0.367 ± 0.07
0.381CysPro: 0.381 ± 0.094
0.286CysGln: 0.286 ± 0.068
0.517CysArg: 0.517 ± 0.066
0.354CysSer: 0.354 ± 0.065
0.585CysThr: 0.585 ± 0.1
0.571CysVal: 0.571 ± 0.103
0.258CysTrp: 0.258 ± 0.067
0.789CysTyr: 0.789 ± 0.095
0.0CysXaa: 0.0 ± 0.0
Asp
5.21AspAla: 5.21 ± 0.286
0.762AspCys: 0.762 ± 0.108
4.584AspAsp: 4.584 ± 0.28
4.883AspGlu: 4.883 ± 0.299
2.448AspPhe: 2.448 ± 0.188
4.937AspGly: 4.937 ± 0.343
0.979AspHis: 0.979 ± 0.116
4.516AspIle: 4.516 ± 0.216
3.7AspLys: 3.7 ± 0.247
5.101AspLeu: 5.101 ± 0.305
1.523AspMet: 1.523 ± 0.139
3.21AspAsn: 3.21 ± 0.253
3.142AspPro: 3.142 ± 0.179
1.823AspGln: 1.823 ± 0.158
3.468AspArg: 3.468 ± 0.227
3.278AspSer: 3.278 ± 0.183
4.013AspThr: 4.013 ± 0.243
5.536AspVal: 5.536 ± 0.275
1.115AspTrp: 1.115 ± 0.121
2.598AspTyr: 2.598 ± 0.196
0.0AspXaa: 0.0 ± 0.0
Glu
4.992GluAla: 4.992 ± 0.333
0.544GluCys: 0.544 ± 0.084
4.094GluAsp: 4.094 ± 0.275
4.339GluGlu: 4.339 ± 0.249
2.38GluPhe: 2.38 ± 0.175
3.605GluGly: 3.605 ± 0.22
1.387GluHis: 1.387 ± 0.146
4.271GluIle: 4.271 ± 0.244
3.02GluLys: 3.02 ± 0.214
6.393GluLeu: 6.393 ± 0.356
1.523GluMet: 1.523 ± 0.161
3.033GluAsn: 3.033 ± 0.249
2.136GluPro: 2.136 ± 0.182
3.006GluGln: 3.006 ± 0.211
4.407GluArg: 4.407 ± 0.219
3.455GluSer: 3.455 ± 0.205
3.863GluThr: 3.863 ± 0.286
3.673GluVal: 3.673 ± 0.232
0.966GluTrp: 0.966 ± 0.108
2.435GluTyr: 2.435 ± 0.191
0.0GluXaa: 0.0 ± 0.0
Phe
2.571PheAla: 2.571 ± 0.187
0.367PheCys: 0.367 ± 0.075
2.788PheAsp: 2.788 ± 0.209
2.19PheGlu: 2.19 ± 0.168
1.292PhePhe: 1.292 ± 0.119
2.149PheGly: 2.149 ± 0.158
0.666PheHis: 0.666 ± 0.083
2.285PheIle: 2.285 ± 0.194
1.891PheLys: 1.891 ± 0.134
2.965PheLeu: 2.965 ± 0.205
1.306PheMet: 1.306 ± 0.141
2.353PheAsn: 2.353 ± 0.166
1.428PhePro: 1.428 ± 0.119
1.061PheGln: 1.061 ± 0.13
1.863PheArg: 1.863 ± 0.151
2.435PheSer: 2.435 ± 0.177
2.503PheThr: 2.503 ± 0.188
2.639PheVal: 2.639 ± 0.21
0.53PheTrp: 0.53 ± 0.076
1.469PheTyr: 1.469 ± 0.161
0.0PheXaa: 0.0 ± 0.0
Gly
3.455GlyAla: 3.455 ± 0.357
0.571GlyCys: 0.571 ± 0.087
3.523GlyAsp: 3.523 ± 0.233
3.428GlyGlu: 3.428 ± 0.239
2.421GlyPhe: 2.421 ± 0.173
3.7GlyGly: 3.7 ± 0.581
0.952GlyHis: 0.952 ± 0.116
4.013GlyIle: 4.013 ± 0.22
3.55GlyLys: 3.55 ± 0.212
5.128GlyLeu: 5.128 ± 0.352
1.863GlyMet: 1.863 ± 0.164
3.673GlyAsn: 3.673 ± 0.408
1.292GlyPro: 1.292 ± 0.126
2.149GlyGln: 2.149 ± 0.201
3.292GlyArg: 3.292 ± 0.231
3.564GlySer: 3.564 ± 0.31
4.04GlyThr: 4.04 ± 0.253
4.393GlyVal: 4.393 ± 0.353
0.925GlyTrp: 0.925 ± 0.118
3.047GlyTyr: 3.047 ± 0.204
0.0GlyXaa: 0.0 ± 0.0
His
1.469HisAla: 1.469 ± 0.151
0.245HisCys: 0.245 ± 0.059
1.251HisAsp: 1.251 ± 0.134
1.197HisGlu: 1.197 ± 0.121
0.721HisPhe: 0.721 ± 0.089
1.115HisGly: 1.115 ± 0.131
0.53HisHis: 0.53 ± 0.088
1.183HisIle: 1.183 ± 0.151
0.694HisLys: 0.694 ± 0.105
1.809HisLeu: 1.809 ± 0.158
0.381HisMet: 0.381 ± 0.068
0.707HisAsn: 0.707 ± 0.1
1.428HisPro: 1.428 ± 0.153
0.979HisGln: 0.979 ± 0.111
1.605HisArg: 1.605 ± 0.164
0.789HisSer: 0.789 ± 0.094
1.02HisThr: 1.02 ± 0.115
1.292HisVal: 1.292 ± 0.136
0.381HisTrp: 0.381 ± 0.075
1.115HisTyr: 1.115 ± 0.12
0.0HisXaa: 0.0 ± 0.0
Ile
5.196IleAla: 5.196 ± 0.243
0.585IleCys: 0.585 ± 0.086
5.101IleAsp: 5.101 ± 0.248
3.904IleGlu: 3.904 ± 0.236
1.564IlePhe: 1.564 ± 0.171
3.332IleGly: 3.332 ± 0.254
1.347IleHis: 1.347 ± 0.148
2.911IleIle: 2.911 ± 0.197
3.237IleLys: 3.237 ± 0.195
4.285IleLeu: 4.285 ± 0.223
1.265IleMet: 1.265 ± 0.137
3.006IleAsn: 3.006 ± 0.191
3.292IlePro: 3.292 ± 0.223
2.421IleGln: 2.421 ± 0.147
4.094IleArg: 4.094 ± 0.265
3.904IleSer: 3.904 ± 0.248
4.271IleThr: 4.271 ± 0.285
3.645IleVal: 3.645 ± 0.235
0.68IleTrp: 0.68 ± 0.081
2.353IleTyr: 2.353 ± 0.16
0.0IleXaa: 0.0 ± 0.0
Lys
4.244LysAla: 4.244 ± 0.286
0.422LysCys: 0.422 ± 0.082
3.264LysAsp: 3.264 ± 0.201
3.632LysGlu: 3.632 ± 0.266
1.591LysPhe: 1.591 ± 0.133
2.965LysGly: 2.965 ± 0.278
1.523LysHis: 1.523 ± 0.162
2.625LysIle: 2.625 ± 0.155
2.707LysLys: 2.707 ± 0.186
5.563LysLeu: 5.563 ± 0.252
1.415LysMet: 1.415 ± 0.125
2.408LysAsn: 2.408 ± 0.187
2.367LysPro: 2.367 ± 0.191
2.408LysGln: 2.408 ± 0.193
3.428LysArg: 3.428 ± 0.232
2.503LysSer: 2.503 ± 0.208
3.196LysThr: 3.196 ± 0.249
3.319LysVal: 3.319 ± 0.211
0.544LysTrp: 0.544 ± 0.102
1.931LysTyr: 1.931 ± 0.187
0.0LysXaa: 0.0 ± 0.0
Leu
7.032LeuAla: 7.032 ± 0.342
0.871LeuCys: 0.871 ± 0.111
6.243LeuAsp: 6.243 ± 0.274
5.522LeuGlu: 5.522 ± 0.286
3.414LeuPhe: 3.414 ± 0.26
4.978LeuGly: 4.978 ± 0.284
1.768LeuHis: 1.768 ± 0.172
4.761LeuIle: 4.761 ± 0.296
4.774LeuLys: 4.774 ± 0.302
7.467LeuLeu: 7.467 ± 0.37
2.204LeuMet: 2.204 ± 0.174
4.597LeuAsn: 4.597 ± 0.228
4.448LeuPro: 4.448 ± 0.243
3.264LeuGln: 3.264 ± 0.205
5.563LeuArg: 5.563 ± 0.28
5.998LeuSer: 5.998 ± 0.271
6.352LeuThr: 6.352 ± 0.289
5.101LeuVal: 5.101 ± 0.28
0.979LeuTrp: 0.979 ± 0.123
3.537LeuTyr: 3.537 ± 0.238
0.0LeuXaa: 0.0 ± 0.0
Met
2.122MetAla: 2.122 ± 0.173
0.136MetCys: 0.136 ± 0.047
1.564MetAsp: 1.564 ± 0.133
1.632MetGlu: 1.632 ± 0.169
1.102MetPhe: 1.102 ± 0.12
1.578MetGly: 1.578 ± 0.17
0.53MetHis: 0.53 ± 0.088
1.251MetIle: 1.251 ± 0.12
1.292MetLys: 1.292 ± 0.15
2.68MetLeu: 2.68 ± 0.184
0.762MetMet: 0.762 ± 0.097
1.469MetAsn: 1.469 ± 0.13
1.279MetPro: 1.279 ± 0.115
1.211MetGln: 1.211 ± 0.123
1.904MetArg: 1.904 ± 0.175
2.027MetSer: 2.027 ± 0.177
1.687MetThr: 1.687 ± 0.192
1.85MetVal: 1.85 ± 0.149
0.218MetTrp: 0.218 ± 0.046
0.83MetTyr: 0.83 ± 0.122
0.0MetXaa: 0.0 ± 0.0
Asn
4.421AsnAla: 4.421 ± 0.35
0.313AsnCys: 0.313 ± 0.06
3.047AsnAsp: 3.047 ± 0.156
3.006AsnGlu: 3.006 ± 0.202
1.619AsnPhe: 1.619 ± 0.136
3.605AsnGly: 3.605 ± 0.218
1.007AsnHis: 1.007 ± 0.097
3.128AsnIle: 3.128 ± 0.215
3.074AsnLys: 3.074 ± 0.194
3.673AsnLeu: 3.673 ± 0.215
1.17AsnMet: 1.17 ± 0.135
2.734AsnAsn: 2.734 ± 0.243
2.666AsnPro: 2.666 ± 0.234
1.863AsnGln: 1.863 ± 0.173
2.911AsnArg: 2.911 ± 0.222
2.516AsnSer: 2.516 ± 0.218
3.428AsnThr: 3.428 ± 0.228
3.414AsnVal: 3.414 ± 0.253
0.653AsnTrp: 0.653 ± 0.096
1.741AsnTyr: 1.741 ± 0.181
0.0AsnXaa: 0.0 ± 0.0
Pro
3.169ProAla: 3.169 ± 0.263
0.381ProCys: 0.381 ± 0.078
3.006ProAsp: 3.006 ± 0.207
3.4ProGlu: 3.4 ± 0.21
1.659ProPhe: 1.659 ± 0.141
2.516ProGly: 2.516 ± 0.271
1.02ProHis: 1.02 ± 0.154
2.448ProIle: 2.448 ± 0.215
2.299ProLys: 2.299 ± 0.155
3.509ProLeu: 3.509 ± 0.223
0.939ProMet: 0.939 ± 0.102
2.136ProAsn: 2.136 ± 0.169
1.333ProPro: 1.333 ± 0.154
1.483ProGln: 1.483 ± 0.167
2.067ProArg: 2.067 ± 0.192
2.408ProSer: 2.408 ± 0.167
3.224ProThr: 3.224 ± 0.194
3.169ProVal: 3.169 ± 0.241
0.585ProTrp: 0.585 ± 0.077
1.455ProTyr: 1.455 ± 0.143
0.0ProXaa: 0.0 ± 0.0
Gln
3.047GlnAla: 3.047 ± 0.203
0.313GlnCys: 0.313 ± 0.064
1.605GlnAsp: 1.605 ± 0.133
2.435GlnGlu: 2.435 ± 0.186
1.415GlnPhe: 1.415 ± 0.129
1.904GlnGly: 1.904 ± 0.154
1.075GlnHis: 1.075 ± 0.116
2.326GlnIle: 2.326 ± 0.178
1.564GlnLys: 1.564 ± 0.158
4.312GlnLeu: 4.312 ± 0.271
0.979GlnMet: 0.979 ± 0.132
1.374GlnAsn: 1.374 ± 0.147
1.877GlnPro: 1.877 ± 0.214
2.38GlnGln: 2.38 ± 0.236
2.652GlnArg: 2.652 ± 0.228
1.863GlnSer: 1.863 ± 0.153
2.217GlnThr: 2.217 ± 0.183
2.666GlnVal: 2.666 ± 0.216
0.598GlnTrp: 0.598 ± 0.101
1.537GlnTyr: 1.537 ± 0.16
0.0GlnXaa: 0.0 ± 0.0
Arg
4.013ArgAla: 4.013 ± 0.219
0.626ArgCys: 0.626 ± 0.094
4.108ArgAsp: 4.108 ± 0.262
3.618ArgGlu: 3.618 ± 0.225
2.734ArgPhe: 2.734 ± 0.181
2.938ArgGly: 2.938 ± 0.219
1.347ArgHis: 1.347 ± 0.145
3.999ArgIle: 3.999 ± 0.234
3.251ArgLys: 3.251 ± 0.203
5.414ArgLeu: 5.414 ± 0.313
1.85ArgMet: 1.85 ± 0.165
3.224ArgAsn: 3.224 ± 0.209
1.673ArgPro: 1.673 ± 0.139
2.326ArgGln: 2.326 ± 0.183
4.013ArgArg: 4.013 ± 0.281
3.06ArgSer: 3.06 ± 0.211
3.264ArgThr: 3.264 ± 0.179
3.89ArgVal: 3.89 ± 0.225
1.047ArgTrp: 1.047 ± 0.134
3.55ArgTyr: 3.55 ± 0.2
0.0ArgXaa: 0.0 ± 0.0
Ser
4.747SerAla: 4.747 ± 0.273
0.476SerCys: 0.476 ± 0.078
4.325SerAsp: 4.325 ± 0.271
3.468SerGlu: 3.468 ± 0.232
2.244SerPhe: 2.244 ± 0.176
3.904SerGly: 3.904 ± 0.303
0.925SerHis: 0.925 ± 0.116
3.414SerIle: 3.414 ± 0.207
2.53SerLys: 2.53 ± 0.21
5.373SerLeu: 5.373 ± 0.264
1.714SerMet: 1.714 ± 0.157
2.571SerAsn: 2.571 ± 0.208
2.163SerPro: 2.163 ± 0.173
1.945SerGln: 1.945 ± 0.132
2.761SerArg: 2.761 ± 0.191
3.305SerSer: 3.305 ± 0.238
3.414SerThr: 3.414 ± 0.236
4.584SerVal: 4.584 ± 0.235
0.762SerTrp: 0.762 ± 0.099
2.503SerTyr: 2.503 ± 0.216
0.0SerXaa: 0.0 ± 0.0
Thr
5.522ThrAla: 5.522 ± 0.502
0.558ThrCys: 0.558 ± 0.107
4.094ThrAsp: 4.094 ± 0.242
3.945ThrGlu: 3.945 ± 0.255
2.34ThrPhe: 2.34 ± 0.225
3.985ThrGly: 3.985 ± 0.337
1.17ThrHis: 1.17 ± 0.133
3.904ThrIle: 3.904 ± 0.256
3.509ThrLys: 3.509 ± 0.232
6.774ThrLeu: 6.774 ± 0.348
1.823ThrMet: 1.823 ± 0.154
2.979ThrAsn: 2.979 ± 0.212
3.142ThrPro: 3.142 ± 0.186
2.163ThrGln: 2.163 ± 0.165
3.21ThrArg: 3.21 ± 0.216
3.591ThrSer: 3.591 ± 0.252
5.019ThrThr: 5.019 ± 0.335
5.155ThrVal: 5.155 ± 0.285
0.857ThrTrp: 0.857 ± 0.091
2.462ThrTyr: 2.462 ± 0.196
0.0ThrXaa: 0.0 ± 0.0
Val
5.305ValAla: 5.305 ± 0.286
0.653ValCys: 0.653 ± 0.103
4.815ValAsp: 4.815 ± 0.297
4.91ValGlu: 4.91 ± 0.261
2.598ValPhe: 2.598 ± 0.16
3.713ValGly: 3.713 ± 0.248
1.061ValHis: 1.061 ± 0.121
4.04ValIle: 4.04 ± 0.246
3.904ValLys: 3.904 ± 0.242
5.142ValLeu: 5.142 ± 0.222
2.095ValMet: 2.095 ± 0.162
3.659ValAsn: 3.659 ± 0.262
2.897ValPro: 2.897 ± 0.248
2.285ValGln: 2.285 ± 0.192
4.081ValArg: 4.081 ± 0.225
4.353ValSer: 4.353 ± 0.253
4.869ValThr: 4.869 ± 0.333
5.046ValVal: 5.046 ± 0.309
0.803ValTrp: 0.803 ± 0.085
2.884ValTyr: 2.884 ± 0.189
0.0ValXaa: 0.0 ± 0.0
Trp
0.83TrpAla: 0.83 ± 0.109
0.231TrpCys: 0.231 ± 0.057
0.939TrpAsp: 0.939 ± 0.12
0.816TrpGlu: 0.816 ± 0.118
0.53TrpPhe: 0.53 ± 0.078
0.517TrpGly: 0.517 ± 0.084
0.34TrpHis: 0.34 ± 0.075
0.871TrpIle: 0.871 ± 0.127
0.694TrpLys: 0.694 ± 0.1
1.564TrpLeu: 1.564 ± 0.129
0.435TrpMet: 0.435 ± 0.066
0.748TrpAsn: 0.748 ± 0.111
0.34TrpPro: 0.34 ± 0.065
0.476TrpGln: 0.476 ± 0.063
1.075TrpArg: 1.075 ± 0.146
0.925TrpSer: 0.925 ± 0.122
0.911TrpThr: 0.911 ± 0.12
0.898TrpVal: 0.898 ± 0.101
0.326TrpTrp: 0.326 ± 0.067
0.639TrpTyr: 0.639 ± 0.088
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.992TyrAla: 2.992 ± 0.177
0.585TyrCys: 0.585 ± 0.085
3.047TyrAsp: 3.047 ± 0.27
1.836TyrGlu: 1.836 ± 0.166
1.347TyrPhe: 1.347 ± 0.132
2.72TyrGly: 2.72 ± 0.17
0.857TyrHis: 0.857 ± 0.11
2.476TyrIle: 2.476 ± 0.168
1.768TyrLys: 1.768 ± 0.144
3.809TyrLeu: 3.809 ± 0.225
1.034TyrMet: 1.034 ± 0.128
2.108TyrAsn: 2.108 ± 0.188
2.108TyrPro: 2.108 ± 0.16
2.013TyrGln: 2.013 ± 0.164
3.033TyrArg: 3.033 ± 0.244
2.34TyrSer: 2.34 ± 0.181
2.829TyrThr: 2.829 ± 0.22
2.652TyrVal: 2.652 ± 0.157
0.558TyrTrp: 0.558 ± 0.076
1.755TyrTyr: 1.755 ± 0.155
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 247 proteins (73520 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski