Amino acid dipepetide frequency for Vibrio phage ICP1_2006_D

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.498AlaAla: 3.498 ± 0.424
0.813AlaCys: 0.813 ± 0.117
2.82AlaAsp: 2.82 ± 0.257
3.552AlaGlu: 3.552 ± 0.301
2.142AlaPhe: 2.142 ± 0.195
2.847AlaGly: 2.847 ± 0.319
1.003AlaHis: 1.003 ± 0.155
3.66AlaIle: 3.66 ± 0.322
3.877AlaLys: 3.877 ± 0.382
4.718AlaLeu: 4.718 ± 0.425
1.193AlaMet: 1.193 ± 0.212
2.305AlaAsn: 2.305 ± 0.268
1.247AlaPro: 1.247 ± 0.21
1.573AlaGln: 1.573 ± 0.223
1.844AlaArg: 1.844 ± 0.27
2.982AlaSer: 2.982 ± 0.35
2.82AlaThr: 2.82 ± 0.282
2.766AlaVal: 2.766 ± 0.268
0.895AlaTrp: 0.895 ± 0.157
1.979AlaTyr: 1.979 ± 0.224
0.0AlaXaa: 0.0 ± 0.0
Cys
0.813CysAla: 0.813 ± 0.127
0.407CysCys: 0.407 ± 0.109
0.895CysAsp: 0.895 ± 0.16
1.6CysGlu: 1.6 ± 0.216
0.868CysPhe: 0.868 ± 0.154
1.762CysGly: 1.762 ± 0.271
0.407CysHis: 0.407 ± 0.106
1.057CysIle: 1.057 ± 0.157
1.817CysLys: 1.817 ± 0.251
0.84CysLeu: 0.84 ± 0.154
0.542CysMet: 0.542 ± 0.122
0.949CysAsn: 0.949 ± 0.165
0.705CysPro: 0.705 ± 0.153
0.569CysGln: 0.569 ± 0.12
0.868CysArg: 0.868 ± 0.129
1.301CysSer: 1.301 ± 0.291
0.976CysThr: 0.976 ± 0.157
1.247CysVal: 1.247 ± 0.167
0.298CysTrp: 0.298 ± 0.085
0.786CysTyr: 0.786 ± 0.151
0.0CysXaa: 0.0 ± 0.0
Asp
2.549AspAla: 2.549 ± 0.26
1.437AspCys: 1.437 ± 0.17
3.335AspAsp: 3.335 ± 0.4
4.609AspGlu: 4.609 ± 0.374
3.118AspPhe: 3.118 ± 0.329
4.799AspGly: 4.799 ± 0.328
1.085AspHis: 1.085 ± 0.199
4.609AspIle: 4.609 ± 0.358
5.667AspLys: 5.667 ± 0.479
5.721AspLeu: 5.721 ± 0.402
1.817AspMet: 1.817 ± 0.225
3.823AspAsn: 3.823 ± 0.302
1.762AspPro: 1.762 ± 0.227
1.789AspGln: 1.789 ± 0.244
2.766AspArg: 2.766 ± 0.257
3.714AspSer: 3.714 ± 0.346
3.443AspThr: 3.443 ± 0.322
4.121AspVal: 4.121 ± 0.316
1.166AspTrp: 1.166 ± 0.18
3.823AspTyr: 3.823 ± 0.31
0.0AspXaa: 0.0 ± 0.0
Glu
4.067GluAla: 4.067 ± 0.32
1.274GluCys: 1.274 ± 0.186
6.155GluAsp: 6.155 ± 0.354
7.781GluGlu: 7.781 ± 0.543
3.579GluPhe: 3.579 ± 0.283
4.962GluGly: 4.962 ± 0.349
1.329GluHis: 1.329 ± 0.185
4.935GluIle: 4.935 ± 0.418
5.314GluLys: 5.314 ± 0.364
6.778GluLeu: 6.778 ± 0.426
2.142GluMet: 2.142 ± 0.23
3.931GluAsn: 3.931 ± 0.34
1.545GluPro: 1.545 ± 0.234
2.576GluGln: 2.576 ± 0.269
3.226GluArg: 3.226 ± 0.362
4.772GluSer: 4.772 ± 0.392
3.01GluThr: 3.01 ± 0.265
5.721GluVal: 5.721 ± 0.408
1.329GluTrp: 1.329 ± 0.202
4.013GluTyr: 4.013 ± 0.293
0.0GluXaa: 0.0 ± 0.0
Phe
1.898PheAla: 1.898 ± 0.218
0.84PheCys: 0.84 ± 0.16
3.904PheAsp: 3.904 ± 0.328
3.606PheGlu: 3.606 ± 0.29
1.356PhePhe: 1.356 ± 0.19
2.793PheGly: 2.793 ± 0.295
0.895PheHis: 0.895 ± 0.136
2.874PheIle: 2.874 ± 0.309
3.389PheLys: 3.389 ± 0.342
3.145PheLeu: 3.145 ± 0.279
0.84PheMet: 0.84 ± 0.14
2.088PheAsn: 2.088 ± 0.222
1.464PhePro: 1.464 ± 0.241
1.356PheGln: 1.356 ± 0.213
1.708PheArg: 1.708 ± 0.221
2.874PheSer: 2.874 ± 0.253
3.01PheThr: 3.01 ± 0.273
2.982PheVal: 2.982 ± 0.238
0.705PheTrp: 0.705 ± 0.165
2.006PheTyr: 2.006 ± 0.222
0.0PheXaa: 0.0 ± 0.0
Gly
2.657GlyAla: 2.657 ± 0.266
1.383GlyCys: 1.383 ± 0.208
4.419GlyAsp: 4.419 ± 0.321
4.989GlyGlu: 4.989 ± 0.329
3.118GlyPhe: 3.118 ± 0.32
3.904GlyGly: 3.904 ± 0.341
0.651GlyHis: 0.651 ± 0.122
3.633GlyIle: 3.633 ± 0.269
5.395GlyLys: 5.395 ± 0.422
4.446GlyLeu: 4.446 ± 0.431
1.301GlyMet: 1.301 ± 0.166
3.47GlyAsn: 3.47 ± 0.257
0.244GlyPro: 0.244 ± 0.083
1.735GlyGln: 1.735 ± 0.211
2.603GlyArg: 2.603 ± 0.293
3.742GlySer: 3.742 ± 0.372
3.498GlyThr: 3.498 ± 0.29
5.45GlyVal: 5.45 ± 0.459
1.545GlyTrp: 1.545 ± 0.172
3.823GlyTyr: 3.823 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
0.678HisAla: 0.678 ± 0.114
0.515HisCys: 0.515 ± 0.098
0.922HisAsp: 0.922 ± 0.146
1.356HisGlu: 1.356 ± 0.177
0.895HisPhe: 0.895 ± 0.165
1.003HisGly: 1.003 ± 0.145
0.542HisHis: 0.542 ± 0.119
1.545HisIle: 1.545 ± 0.203
1.979HisLys: 1.979 ± 0.239
1.383HisLeu: 1.383 ± 0.177
0.569HisMet: 0.569 ± 0.119
0.976HisAsn: 0.976 ± 0.154
0.705HisPro: 0.705 ± 0.142
0.895HisGln: 0.895 ± 0.182
1.003HisArg: 1.003 ± 0.169
1.057HisSer: 1.057 ± 0.172
1.518HisThr: 1.518 ± 0.195
0.949HisVal: 0.949 ± 0.154
0.244HisTrp: 0.244 ± 0.075
0.895HisTyr: 0.895 ± 0.15
0.0HisXaa: 0.0 ± 0.0
Ile
3.308IleAla: 3.308 ± 0.309
0.895IleCys: 0.895 ± 0.15
4.962IleAsp: 4.962 ± 0.351
4.555IleGlu: 4.555 ± 0.322
2.277IlePhe: 2.277 ± 0.252
3.552IleGly: 3.552 ± 0.334
1.139IleHis: 1.139 ± 0.183
4.528IleIle: 4.528 ± 0.357
5.883IleLys: 5.883 ± 0.393
4.989IleLeu: 4.989 ± 0.341
1.22IleMet: 1.22 ± 0.189
3.525IleAsn: 3.525 ± 0.287
2.549IlePro: 2.549 ± 0.28
1.789IleGln: 1.789 ± 0.233
3.037IleArg: 3.037 ± 0.288
3.742IleSer: 3.742 ± 0.301
3.66IleThr: 3.66 ± 0.407
4.338IleVal: 4.338 ± 0.378
0.488IleTrp: 0.488 ± 0.111
2.901IleTyr: 2.901 ± 0.31
0.0IleXaa: 0.0 ± 0.0
Lys
4.067LysAla: 4.067 ± 0.38
1.437LysCys: 1.437 ± 0.221
5.667LysAsp: 5.667 ± 0.385
6.453LysGlu: 6.453 ± 0.395
3.226LysPhe: 3.226 ± 0.342
5.423LysGly: 5.423 ± 0.43
2.142LysHis: 2.142 ± 0.189
4.962LysIle: 4.962 ± 0.374
4.663LysLys: 4.663 ± 0.333
7.537LysLeu: 7.537 ± 0.511
2.305LysMet: 2.305 ± 0.254
3.416LysAsn: 3.416 ± 0.266
2.033LysPro: 2.033 ± 0.216
3.226LysGln: 3.226 ± 0.358
4.094LysArg: 4.094 ± 0.335
5.45LysSer: 5.45 ± 0.397
3.606LysThr: 3.606 ± 0.325
5.694LysVal: 5.694 ± 0.319
1.03LysTrp: 1.03 ± 0.171
4.175LysTyr: 4.175 ± 0.361
0.0LysXaa: 0.0 ± 0.0
Leu
4.392LeuAla: 4.392 ± 0.42
1.573LeuCys: 1.573 ± 0.174
5.856LeuAsp: 5.856 ± 0.429
7.158LeuGlu: 7.158 ± 0.428
3.145LeuPhe: 3.145 ± 0.282
5.395LeuGly: 5.395 ± 0.482
1.545LeuHis: 1.545 ± 0.215
4.148LeuIle: 4.148 ± 0.32
6.643LeuLys: 6.643 ± 0.434
6.995LeuLeu: 6.995 ± 0.48
2.142LeuMet: 2.142 ± 0.235
4.501LeuAsn: 4.501 ± 0.29
2.847LeuPro: 2.847 ± 0.345
2.738LeuGln: 2.738 ± 0.258
3.796LeuArg: 3.796 ± 0.321
6.86LeuSer: 6.86 ± 0.462
4.555LeuThr: 4.555 ± 0.429
5.368LeuVal: 5.368 ± 0.377
1.301LeuTrp: 1.301 ± 0.185
3.714LeuTyr: 3.714 ± 0.329
0.0LeuXaa: 0.0 ± 0.0
Met
1.735MetAla: 1.735 ± 0.24
0.542MetCys: 0.542 ± 0.149
1.274MetAsp: 1.274 ± 0.201
1.6MetGlu: 1.6 ± 0.215
1.003MetPhe: 1.003 ± 0.155
1.057MetGly: 1.057 ± 0.161
0.434MetHis: 0.434 ± 0.109
1.898MetIle: 1.898 ± 0.214
2.521MetLys: 2.521 ± 0.226
1.979MetLeu: 1.979 ± 0.298
0.759MetMet: 0.759 ± 0.153
1.573MetAsn: 1.573 ± 0.219
0.705MetPro: 0.705 ± 0.124
1.329MetGln: 1.329 ± 0.187
1.22MetArg: 1.22 ± 0.168
2.115MetSer: 2.115 ± 0.222
1.491MetThr: 1.491 ± 0.222
1.274MetVal: 1.274 ± 0.169
0.136MetTrp: 0.136 ± 0.059
1.003MetTyr: 1.003 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
2.115AsnAla: 2.115 ± 0.258
0.569AsnCys: 0.569 ± 0.117
3.01AsnAsp: 3.01 ± 0.222
2.82AsnGlu: 2.82 ± 0.257
2.44AsnPhe: 2.44 ± 0.245
3.958AsnGly: 3.958 ± 0.311
1.437AsnHis: 1.437 ± 0.194
3.986AsnIle: 3.986 ± 0.367
4.528AsnLys: 4.528 ± 0.386
5.016AsnLeu: 5.016 ± 0.41
1.301AsnMet: 1.301 ± 0.195
3.172AsnAsn: 3.172 ± 0.317
2.25AsnPro: 2.25 ± 0.208
2.033AsnGln: 2.033 ± 0.253
1.871AsnArg: 1.871 ± 0.279
3.226AsnSer: 3.226 ± 0.282
3.199AsnThr: 3.199 ± 0.3
2.576AsnVal: 2.576 ± 0.25
0.759AsnTrp: 0.759 ± 0.155
2.44AsnTyr: 2.44 ± 0.272
0.0AsnXaa: 0.0 ± 0.0
Pro
1.6ProAla: 1.6 ± 0.221
0.596ProCys: 0.596 ± 0.119
2.196ProAsp: 2.196 ± 0.226
2.82ProGlu: 2.82 ± 0.241
1.545ProPhe: 1.545 ± 0.203
0.081ProGly: 0.081 ± 0.054
1.057ProHis: 1.057 ± 0.163
1.437ProIle: 1.437 ± 0.22
2.115ProLys: 2.115 ± 0.278
1.952ProLeu: 1.952 ± 0.251
0.84ProMet: 0.84 ± 0.132
1.383ProAsn: 1.383 ± 0.214
0.895ProPro: 0.895 ± 0.194
1.193ProGln: 1.193 ± 0.209
1.003ProArg: 1.003 ± 0.179
1.925ProSer: 1.925 ± 0.231
1.871ProThr: 1.871 ± 0.189
2.25ProVal: 2.25 ± 0.264
0.542ProTrp: 0.542 ± 0.103
1.41ProTyr: 1.41 ± 0.211
0.0ProXaa: 0.0 ± 0.0
Gln
1.844GlnAla: 1.844 ± 0.26
0.515GlnCys: 0.515 ± 0.098
2.033GlnAsp: 2.033 ± 0.211
3.498GlnGlu: 3.498 ± 0.317
1.057GlnPhe: 1.057 ± 0.17
2.033GlnGly: 2.033 ± 0.236
0.813GlnHis: 0.813 ± 0.146
1.925GlnIle: 1.925 ± 0.199
2.576GlnLys: 2.576 ± 0.281
2.684GlnLeu: 2.684 ± 0.281
0.895GlnMet: 0.895 ± 0.167
1.654GlnAsn: 1.654 ± 0.188
0.868GlnPro: 0.868 ± 0.155
1.627GlnGln: 1.627 ± 0.227
1.762GlnArg: 1.762 ± 0.231
2.277GlnSer: 2.277 ± 0.21
1.627GlnThr: 1.627 ± 0.227
2.305GlnVal: 2.305 ± 0.266
0.434GlnTrp: 0.434 ± 0.117
1.6GlnTyr: 1.6 ± 0.262
0.0GlnXaa: 0.0 ± 0.0
Arg
1.817ArgAla: 1.817 ± 0.234
0.895ArgCys: 0.895 ± 0.174
2.386ArgAsp: 2.386 ± 0.306
3.308ArgGlu: 3.308 ± 0.306
2.142ArgPhe: 2.142 ± 0.229
2.847ArgGly: 2.847 ± 0.284
0.732ArgHis: 0.732 ± 0.137
2.82ArgIle: 2.82 ± 0.278
3.606ArgLys: 3.606 ± 0.336
4.067ArgLeu: 4.067 ± 0.335
1.22ArgMet: 1.22 ± 0.186
2.25ArgAsn: 2.25 ± 0.224
1.139ArgPro: 1.139 ± 0.181
1.627ArgGln: 1.627 ± 0.174
1.627ArgArg: 1.627 ± 0.209
2.386ArgSer: 2.386 ± 0.241
2.033ArgThr: 2.033 ± 0.207
2.901ArgVal: 2.901 ± 0.263
0.732ArgTrp: 0.732 ± 0.133
2.169ArgTyr: 2.169 ± 0.269
0.0ArgXaa: 0.0 ± 0.0
Ser
2.901SerAla: 2.901 ± 0.298
1.057SerCys: 1.057 ± 0.191
4.013SerAsp: 4.013 ± 0.324
5.151SerGlu: 5.151 ± 0.346
3.118SerPhe: 3.118 ± 0.326
4.663SerGly: 4.663 ± 0.376
0.976SerHis: 0.976 ± 0.164
3.66SerIle: 3.66 ± 0.319
5.856SerLys: 5.856 ± 0.397
5.775SerLeu: 5.775 ± 0.418
1.491SerMet: 1.491 ± 0.174
3.47SerAsn: 3.47 ± 0.275
1.6SerPro: 1.6 ± 0.155
1.871SerGln: 1.871 ± 0.221
2.711SerArg: 2.711 ± 0.272
4.311SerSer: 4.311 ± 0.445
3.118SerThr: 3.118 ± 0.312
4.148SerVal: 4.148 ± 0.365
1.166SerTrp: 1.166 ± 0.185
3.362SerTyr: 3.362 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
3.172ThrAla: 3.172 ± 0.344
0.895ThrCys: 0.895 ± 0.17
2.277ThrAsp: 2.277 ± 0.268
3.579ThrGlu: 3.579 ± 0.355
2.874ThrPhe: 2.874 ± 0.264
3.769ThrGly: 3.769 ± 0.348
1.085ThrHis: 1.085 ± 0.155
4.148ThrIle: 4.148 ± 0.368
4.121ThrLys: 4.121 ± 0.344
5.504ThrLeu: 5.504 ± 0.356
1.437ThrMet: 1.437 ± 0.232
2.982ThrAsn: 2.982 ± 0.243
2.684ThrPro: 2.684 ± 0.261
1.573ThrGln: 1.573 ± 0.218
2.169ThrArg: 2.169 ± 0.252
3.416ThrSer: 3.416 ± 0.301
3.498ThrThr: 3.498 ± 0.354
3.687ThrVal: 3.687 ± 0.301
0.678ThrTrp: 0.678 ± 0.115
2.305ThrTyr: 2.305 ± 0.214
0.0ThrXaa: 0.0 ± 0.0
Val
3.254ValAla: 3.254 ± 0.277
1.383ValCys: 1.383 ± 0.24
4.691ValAsp: 4.691 ± 0.311
5.639ValGlu: 5.639 ± 0.412
2.793ValPhe: 2.793 ± 0.275
3.85ValGly: 3.85 ± 0.311
0.868ValHis: 0.868 ± 0.14
3.796ValIle: 3.796 ± 0.325
5.423ValLys: 5.423 ± 0.426
5.395ValLeu: 5.395 ± 0.293
1.817ValMet: 1.817 ± 0.236
3.823ValAsn: 3.823 ± 0.329
1.654ValPro: 1.654 ± 0.186
2.277ValGln: 2.277 ± 0.268
2.603ValArg: 2.603 ± 0.242
4.067ValSer: 4.067 ± 0.318
4.528ValThr: 4.528 ± 0.398
5.26ValVal: 5.26 ± 0.375
0.895ValTrp: 0.895 ± 0.163
3.199ValTyr: 3.199 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
0.325TrpAla: 0.325 ± 0.081
0.407TrpCys: 0.407 ± 0.102
1.193TrpAsp: 1.193 ± 0.201
1.22TrpGlu: 1.22 ± 0.206
0.542TrpPhe: 0.542 ± 0.119
0.515TrpGly: 0.515 ± 0.105
0.217TrpHis: 0.217 ± 0.08
0.813TrpIle: 0.813 ± 0.154
1.545TrpLys: 1.545 ± 0.203
1.464TrpLeu: 1.464 ± 0.223
0.596TrpMet: 0.596 ± 0.125
0.759TrpAsn: 0.759 ± 0.159
0.244TrpPro: 0.244 ± 0.073
0.569TrpGln: 0.569 ± 0.112
0.515TrpArg: 0.515 ± 0.118
0.976TrpSer: 0.976 ± 0.165
0.786TrpThr: 0.786 ± 0.159
1.491TrpVal: 1.491 ± 0.212
0.217TrpTrp: 0.217 ± 0.06
0.732TrpTyr: 0.732 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.844TyrAla: 1.844 ± 0.221
1.274TyrCys: 1.274 ± 0.188
3.145TyrAsp: 3.145 ± 0.301
3.145TyrGlu: 3.145 ± 0.257
2.494TyrPhe: 2.494 ± 0.264
2.793TyrGly: 2.793 ± 0.307
1.274TyrHis: 1.274 ± 0.169
2.901TyrIle: 2.901 ± 0.219
3.687TyrLys: 3.687 ± 0.373
4.311TyrLeu: 4.311 ± 0.355
1.193TyrMet: 1.193 ± 0.188
2.657TyrAsn: 2.657 ± 0.315
1.573TyrPro: 1.573 ± 0.189
1.681TyrGln: 1.681 ± 0.217
2.277TyrArg: 2.277 ± 0.235
3.226TyrSer: 3.226 ± 0.293
3.606TyrThr: 3.606 ± 0.311
2.684TyrVal: 2.684 ± 0.311
0.569TyrTrp: 0.569 ± 0.119
2.44TyrTyr: 2.44 ± 0.293
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 227 proteins (36884 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski