Amino acid dipepetide frequency for Enterobacteria phage phi92

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.613AlaAla: 4.613 ± 0.412
0.659AlaCys: 0.659 ± 0.111
3.646AlaAsp: 3.646 ± 0.323
4.195AlaGlu: 4.195 ± 0.359
2.482AlaPhe: 2.482 ± 0.224
4.415AlaGly: 4.415 ± 0.257
1.23AlaHis: 1.23 ± 0.169
4.042AlaIle: 4.042 ± 0.301
5.535AlaLys: 5.535 ± 0.434
5.667AlaLeu: 5.667 ± 0.366
2.043AlaMet: 2.043 ± 0.24
3.163AlaAsn: 3.163 ± 0.273
1.779AlaPro: 1.779 ± 0.208
2.548AlaGln: 2.548 ± 0.226
2.592AlaArg: 2.592 ± 0.247
4.042AlaSer: 4.042 ± 0.296
4.547AlaThr: 4.547 ± 0.336
3.822AlaVal: 3.822 ± 0.29
1.098AlaTrp: 1.098 ± 0.207
2.965AlaTyr: 2.965 ± 0.276
0.0AlaXaa: 0.0 ± 0.0
Cys
0.988CysAla: 0.988 ± 0.158
0.242CysCys: 0.242 ± 0.078
0.901CysAsp: 0.901 ± 0.15
0.725CysGlu: 0.725 ± 0.138
1.01CysPhe: 1.01 ± 0.16
1.054CysGly: 1.054 ± 0.135
0.22CysHis: 0.22 ± 0.067
0.923CysIle: 0.923 ± 0.144
0.988CysLys: 0.988 ± 0.163
1.252CysLeu: 1.252 ± 0.167
0.198CysMet: 0.198 ± 0.068
0.637CysAsn: 0.637 ± 0.108
0.637CysPro: 0.637 ± 0.176
0.373CysGln: 0.373 ± 0.082
0.615CysArg: 0.615 ± 0.111
0.791CysSer: 0.791 ± 0.147
0.615CysThr: 0.615 ± 0.113
1.01CysVal: 1.01 ± 0.143
0.264CysTrp: 0.264 ± 0.058
0.747CysTyr: 0.747 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
3.888AspAla: 3.888 ± 0.268
0.615AspCys: 0.615 ± 0.113
3.822AspAsp: 3.822 ± 0.247
3.954AspGlu: 3.954 ± 0.26
3.229AspPhe: 3.229 ± 0.287
4.92AspGly: 4.92 ± 0.361
1.494AspHis: 1.494 ± 0.189
4.635AspIle: 4.635 ± 0.301
4.525AspLys: 4.525 ± 0.353
5.777AspLeu: 5.777 ± 0.36
1.757AspMet: 1.757 ± 0.179
3.844AspAsn: 3.844 ± 0.299
2.526AspPro: 2.526 ± 0.258
2.087AspGln: 2.087 ± 0.209
2.504AspArg: 2.504 ± 0.218
2.79AspSer: 2.79 ± 0.264
3.471AspThr: 3.471 ± 0.289
4.767AspVal: 4.767 ± 0.318
1.164AspTrp: 1.164 ± 0.174
3.558AspTyr: 3.558 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
4.854GluAla: 4.854 ± 0.332
1.12GluCys: 1.12 ± 0.159
5.865GluAsp: 5.865 ± 0.363
7.93GluGlu: 7.93 ± 0.593
2.812GluPhe: 2.812 ± 0.279
4.151GluGly: 4.151 ± 0.322
1.406GluHis: 1.406 ± 0.169
4.81GluIle: 4.81 ± 0.314
5.228GluLys: 5.228 ± 0.319
5.074GluLeu: 5.074 ± 0.315
2.284GluMet: 2.284 ± 0.201
3.646GluAsn: 3.646 ± 0.306
1.538GluPro: 1.538 ± 0.187
2.768GluGln: 2.768 ± 0.27
2.987GluArg: 2.987 ± 0.235
3.405GluSer: 3.405 ± 0.271
3.976GluThr: 3.976 ± 0.281
4.745GluVal: 4.745 ± 0.345
1.603GluTrp: 1.603 ± 0.19
2.702GluTyr: 2.702 ± 0.273
0.0GluXaa: 0.0 ± 0.0
Phe
2.812PheAla: 2.812 ± 0.225
0.835PheCys: 0.835 ± 0.132
3.339PheAsp: 3.339 ± 0.301
2.702PheGlu: 2.702 ± 0.244
2.197PhePhe: 2.197 ± 0.241
2.943PheGly: 2.943 ± 0.282
0.659PheHis: 0.659 ± 0.117
2.834PheIle: 2.834 ± 0.287
3.361PheLys: 3.361 ± 0.256
3.624PheLeu: 3.624 ± 0.303
1.098PheMet: 1.098 ± 0.157
2.548PheAsn: 2.548 ± 0.205
1.34PhePro: 1.34 ± 0.18
1.296PheGln: 1.296 ± 0.174
1.713PheArg: 1.713 ± 0.204
3.295PheSer: 3.295 ± 0.278
2.768PheThr: 2.768 ± 0.254
3.207PheVal: 3.207 ± 0.303
0.659PheTrp: 0.659 ± 0.117
1.867PheTyr: 1.867 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
3.69GlyAla: 3.69 ± 0.297
1.076GlyCys: 1.076 ± 0.153
4.942GlyAsp: 4.942 ± 0.412
4.481GlyGlu: 4.481 ± 0.291
3.361GlyPhe: 3.361 ± 0.274
4.283GlyGly: 4.283 ± 0.487
1.208GlyHis: 1.208 ± 0.145
3.624GlyIle: 3.624 ± 0.238
4.876GlyLys: 4.876 ± 0.287
4.964GlyLeu: 4.964 ± 0.309
1.691GlyMet: 1.691 ± 0.196
3.449GlyAsn: 3.449 ± 0.283
0.395GlyPro: 0.395 ± 0.124
1.933GlyGln: 1.933 ± 0.215
2.416GlyArg: 2.416 ± 0.194
3.8GlySer: 3.8 ± 0.283
3.888GlyThr: 3.888 ± 0.408
5.535GlyVal: 5.535 ± 0.382
1.208GlyTrp: 1.208 ± 0.2
3.383GlyTyr: 3.383 ± 0.284
0.0GlyXaa: 0.0 ± 0.0
His
1.032HisAla: 1.032 ± 0.155
0.417HisCys: 0.417 ± 0.076
1.076HisAsp: 1.076 ± 0.153
0.747HisGlu: 0.747 ± 0.119
1.208HisPhe: 1.208 ± 0.159
1.34HisGly: 1.34 ± 0.198
0.264HisHis: 0.264 ± 0.079
1.23HisIle: 1.23 ± 0.19
1.208HisLys: 1.208 ± 0.167
1.516HisLeu: 1.516 ± 0.192
0.593HisMet: 0.593 ± 0.112
1.142HisAsn: 1.142 ± 0.185
0.923HisPro: 0.923 ± 0.129
0.395HisGln: 0.395 ± 0.098
1.054HisArg: 1.054 ± 0.161
0.879HisSer: 0.879 ± 0.131
1.12HisThr: 1.12 ± 0.148
1.274HisVal: 1.274 ± 0.164
0.351HisTrp: 0.351 ± 0.087
1.098HisTyr: 1.098 ± 0.159
0.0HisXaa: 0.0 ± 0.0
Ile
4.481IleAla: 4.481 ± 0.291
1.01IleCys: 1.01 ± 0.158
4.239IleAsp: 4.239 ± 0.32
4.788IleGlu: 4.788 ± 0.335
2.306IlePhe: 2.306 ± 0.22
3.449IleGly: 3.449 ± 0.329
1.142IleHis: 1.142 ± 0.169
3.888IleIle: 3.888 ± 0.312
3.998IleLys: 3.998 ± 0.26
5.008IleLeu: 5.008 ± 0.354
1.274IleMet: 1.274 ± 0.164
3.932IleAsn: 3.932 ± 0.309
2.965IlePro: 2.965 ± 0.258
2.306IleGln: 2.306 ± 0.23
2.372IleArg: 2.372 ± 0.199
4.02IleSer: 4.02 ± 0.328
4.459IleThr: 4.459 ± 0.265
5.272IleVal: 5.272 ± 0.332
0.571IleTrp: 0.571 ± 0.127
2.592IleTyr: 2.592 ± 0.242
0.0IleXaa: 0.0 ± 0.0
Lys
5.404LysAla: 5.404 ± 0.356
0.813LysCys: 0.813 ± 0.136
4.635LysAsp: 4.635 ± 0.351
6.48LysGlu: 6.48 ± 0.413
2.482LysPhe: 2.482 ± 0.272
4.503LysGly: 4.503 ± 0.477
1.757LysHis: 1.757 ± 0.197
4.283LysIle: 4.283 ± 0.241
4.283LysLys: 4.283 ± 0.363
5.25LysLeu: 5.25 ± 0.307
2.35LysMet: 2.35 ± 0.214
3.471LysAsn: 3.471 ± 0.3
1.977LysPro: 1.977 ± 0.225
2.746LysGln: 2.746 ± 0.283
3.427LysArg: 3.427 ± 0.334
3.646LysSer: 3.646 ± 0.291
3.954LysThr: 3.954 ± 0.271
4.679LysVal: 4.679 ± 0.344
1.23LysTrp: 1.23 ± 0.153
3.141LysTyr: 3.141 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
5.601LeuAla: 5.601 ± 0.323
1.142LeuCys: 1.142 ± 0.176
5.36LeuAsp: 5.36 ± 0.352
5.909LeuGlu: 5.909 ± 0.412
3.207LeuPhe: 3.207 ± 0.308
4.832LeuGly: 4.832 ± 0.329
1.56LeuHis: 1.56 ± 0.2
4.613LeuIle: 4.613 ± 0.29
6.041LeuLys: 6.041 ± 0.355
6.48LeuLeu: 6.48 ± 0.368
1.801LeuMet: 1.801 ± 0.203
4.086LeuAsn: 4.086 ± 0.312
3.273LeuPro: 3.273 ± 0.278
2.702LeuGln: 2.702 ± 0.223
3.514LeuArg: 3.514 ± 0.352
5.821LeuSer: 5.821 ± 0.334
5.557LeuThr: 5.557 ± 0.327
5.184LeuVal: 5.184 ± 0.307
0.988LeuTrp: 0.988 ± 0.123
3.361LeuTyr: 3.361 ± 0.256
0.0LeuXaa: 0.0 ± 0.0
Met
1.911MetAla: 1.911 ± 0.207
0.417MetCys: 0.417 ± 0.094
1.076MetAsp: 1.076 ± 0.121
2.131MetGlu: 2.131 ± 0.218
1.208MetPhe: 1.208 ± 0.173
1.516MetGly: 1.516 ± 0.176
0.505MetHis: 0.505 ± 0.108
2.065MetIle: 2.065 ± 0.189
2.021MetLys: 2.021 ± 0.199
2.438MetLeu: 2.438 ± 0.229
0.659MetMet: 0.659 ± 0.129
0.923MetAsn: 0.923 ± 0.165
0.813MetPro: 0.813 ± 0.153
0.901MetGln: 0.901 ± 0.152
1.01MetArg: 1.01 ± 0.132
2.087MetSer: 2.087 ± 0.231
1.603MetThr: 1.603 ± 0.173
1.691MetVal: 1.691 ± 0.201
0.329MetTrp: 0.329 ± 0.097
0.945MetTyr: 0.945 ± 0.127
0.0MetXaa: 0.0 ± 0.0
Asn
3.097AsnAla: 3.097 ± 0.242
0.637AsnCys: 0.637 ± 0.106
2.284AsnAsp: 2.284 ± 0.25
2.834AsnGlu: 2.834 ± 0.304
2.658AsnPhe: 2.658 ± 0.244
4.064AsnGly: 4.064 ± 0.304
0.945AsnHis: 0.945 ± 0.126
4.569AsnIle: 4.569 ± 0.278
3.8AsnLys: 3.8 ± 0.298
3.888AsnLeu: 3.888 ± 0.256
1.669AsnMet: 1.669 ± 0.216
2.943AsnAsn: 2.943 ± 0.222
2.46AsnPro: 2.46 ± 0.257
1.406AsnGln: 1.406 ± 0.166
2.021AsnArg: 2.021 ± 0.213
3.229AsnSer: 3.229 ± 0.293
3.954AsnThr: 3.954 ± 0.296
3.339AsnVal: 3.339 ± 0.253
0.615AsnTrp: 0.615 ± 0.121
2.394AsnTyr: 2.394 ± 0.209
0.0AsnXaa: 0.0 ± 0.0
Pro
2.24ProAla: 2.24 ± 0.23
0.483ProCys: 0.483 ± 0.102
2.438ProAsp: 2.438 ± 0.23
3.273ProGlu: 3.273 ± 0.277
1.801ProPhe: 1.801 ± 0.168
1.845ProGly: 1.845 ± 0.211
0.549ProHis: 0.549 ± 0.12
1.735ProIle: 1.735 ± 0.182
2.087ProLys: 2.087 ± 0.214
1.955ProLeu: 1.955 ± 0.209
0.769ProMet: 0.769 ± 0.13
2.065ProAsn: 2.065 ± 0.216
0.835ProPro: 0.835 ± 0.132
0.703ProGln: 0.703 ± 0.124
1.098ProArg: 1.098 ± 0.168
2.482ProSer: 2.482 ± 0.218
2.372ProThr: 2.372 ± 0.255
2.79ProVal: 2.79 ± 0.261
0.395ProTrp: 0.395 ± 0.102
1.889ProTyr: 1.889 ± 0.221
0.0ProXaa: 0.0 ± 0.0
Gln
2.043GlnAla: 2.043 ± 0.219
0.461GlnCys: 0.461 ± 0.106
2.065GlnAsp: 2.065 ± 0.189
2.79GlnGlu: 2.79 ± 0.249
1.45GlnPhe: 1.45 ± 0.149
2.262GlnGly: 2.262 ± 0.253
0.615GlnHis: 0.615 ± 0.109
2.284GlnIle: 2.284 ± 0.206
1.955GlnLys: 1.955 ± 0.22
2.394GlnLeu: 2.394 ± 0.225
1.186GlnMet: 1.186 ± 0.146
1.428GlnAsn: 1.428 ± 0.173
1.01GlnPro: 1.01 ± 0.136
1.34GlnGln: 1.34 ± 0.168
1.274GlnArg: 1.274 ± 0.182
1.625GlnSer: 1.625 ± 0.187
1.779GlnThr: 1.779 ± 0.225
2.153GlnVal: 2.153 ± 0.218
0.615GlnTrp: 0.615 ± 0.111
1.823GlnTyr: 1.823 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
2.306ArgAla: 2.306 ± 0.251
0.725ArgCys: 0.725 ± 0.124
2.943ArgAsp: 2.943 ± 0.211
2.636ArgGlu: 2.636 ± 0.239
1.582ArgPhe: 1.582 ± 0.176
2.416ArgGly: 2.416 ± 0.252
0.681ArgHis: 0.681 ± 0.129
2.768ArgIle: 2.768 ± 0.221
3.053ArgLys: 3.053 ± 0.266
3.383ArgLeu: 3.383 ± 0.303
1.186ArgMet: 1.186 ± 0.139
2.197ArgAsn: 2.197 ± 0.205
1.208ArgPro: 1.208 ± 0.157
1.472ArgGln: 1.472 ± 0.245
1.757ArgArg: 1.757 ± 0.206
2.482ArgSer: 2.482 ± 0.206
2.043ArgThr: 2.043 ± 0.195
2.614ArgVal: 2.614 ± 0.226
0.505ArgTrp: 0.505 ± 0.108
1.801ArgTyr: 1.801 ± 0.168
0.0ArgXaa: 0.0 ± 0.0
Ser
4.108SerAla: 4.108 ± 0.365
0.725SerCys: 0.725 ± 0.114
3.383SerAsp: 3.383 ± 0.258
3.756SerGlu: 3.756 ± 0.299
3.009SerPhe: 3.009 ± 0.241
4.371SerGly: 4.371 ± 0.377
1.032SerHis: 1.032 ± 0.156
3.427SerIle: 3.427 ± 0.287
4.459SerLys: 4.459 ± 0.286
4.964SerLeu: 4.964 ± 0.367
1.494SerMet: 1.494 ± 0.165
3.295SerAsn: 3.295 ± 0.264
2.438SerPro: 2.438 ± 0.242
1.582SerGln: 1.582 ± 0.191
2.438SerArg: 2.438 ± 0.217
3.822SerSer: 3.822 ± 0.354
3.558SerThr: 3.558 ± 0.287
4.064SerVal: 4.064 ± 0.302
0.791SerTrp: 0.791 ± 0.123
2.768SerTyr: 2.768 ± 0.234
0.0SerXaa: 0.0 ± 0.0
Thr
3.976ThrAla: 3.976 ± 0.365
0.747ThrCys: 0.747 ± 0.128
3.207ThrAsp: 3.207 ± 0.323
4.635ThrGlu: 4.635 ± 0.321
3.229ThrPhe: 3.229 ± 0.271
4.305ThrGly: 4.305 ± 0.361
1.032ThrHis: 1.032 ± 0.143
3.976ThrIle: 3.976 ± 0.277
4.173ThrLys: 4.173 ± 0.276
5.777ThrLeu: 5.777 ± 0.361
1.252ThrMet: 1.252 ± 0.16
2.812ThrAsn: 2.812 ± 0.265
2.943ThrPro: 2.943 ± 0.206
2.109ThrGln: 2.109 ± 0.186
1.999ThrArg: 1.999 ± 0.183
3.075ThrSer: 3.075 ± 0.265
3.558ThrThr: 3.558 ± 0.381
4.92ThrVal: 4.92 ± 0.376
1.01ThrTrp: 1.01 ± 0.132
2.812ThrTyr: 2.812 ± 0.226
0.0ThrXaa: 0.0 ± 0.0
Val
4.042ValAla: 4.042 ± 0.29
1.054ValCys: 1.054 ± 0.156
5.162ValAsp: 5.162 ± 0.317
4.613ValGlu: 4.613 ± 0.388
2.921ValPhe: 2.921 ± 0.29
3.954ValGly: 3.954 ± 0.268
1.34ValHis: 1.34 ± 0.164
5.008ValIle: 5.008 ± 0.376
5.25ValLys: 5.25 ± 0.334
5.711ValLeu: 5.711 ± 0.323
1.45ValMet: 1.45 ± 0.151
3.998ValAsn: 3.998 ± 0.295
2.526ValPro: 2.526 ± 0.271
1.911ValGln: 1.911 ± 0.236
2.526ValArg: 2.526 ± 0.228
4.657ValSer: 4.657 ± 0.368
4.767ValThr: 4.767 ± 0.38
5.579ValVal: 5.579 ± 0.462
0.835ValTrp: 0.835 ± 0.148
3.624ValTyr: 3.624 ± 0.3
0.0ValXaa: 0.0 ± 0.0
Trp
0.571TrpAla: 0.571 ± 0.113
0.242TrpCys: 0.242 ± 0.074
1.428TrpAsp: 1.428 ± 0.175
1.23TrpGlu: 1.23 ± 0.166
0.988TrpPhe: 0.988 ± 0.144
0.747TrpGly: 0.747 ± 0.118
0.417TrpHis: 0.417 ± 0.082
1.032TrpIle: 1.032 ± 0.147
0.966TrpLys: 0.966 ± 0.179
1.647TrpLeu: 1.647 ± 0.206
0.417TrpMet: 0.417 ± 0.106
0.791TrpAsn: 0.791 ± 0.139
0.264TrpPro: 0.264 ± 0.079
0.417TrpGln: 0.417 ± 0.096
0.593TrpArg: 0.593 ± 0.108
0.879TrpSer: 0.879 ± 0.133
0.703TrpThr: 0.703 ± 0.116
1.054TrpVal: 1.054 ± 0.16
0.308TrpTrp: 0.308 ± 0.077
0.659TrpTyr: 0.659 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.427TyrAla: 3.427 ± 0.265
0.769TyrCys: 0.769 ± 0.134
3.514TyrAsp: 3.514 ± 0.301
3.119TyrGlu: 3.119 ± 0.265
1.955TyrPhe: 1.955 ± 0.228
2.834TyrGly: 2.834 ± 0.23
0.813TyrHis: 0.813 ± 0.133
2.328TyrIle: 2.328 ± 0.225
2.702TyrLys: 2.702 ± 0.279
4.261TyrLeu: 4.261 ± 0.317
1.054TyrMet: 1.054 ± 0.138
2.438TyrAsn: 2.438 ± 0.248
1.955TyrPro: 1.955 ± 0.193
1.56TyrGln: 1.56 ± 0.189
1.845TyrArg: 1.845 ± 0.182
2.658TyrSer: 2.658 ± 0.271
2.856TyrThr: 2.856 ± 0.23
3.229TyrVal: 3.229 ± 0.252
0.813TyrTrp: 0.813 ± 0.107
2.109TyrTyr: 2.109 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 250 proteins (45527 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski