Amino acid dipepetide frequency for Vibrio phage YC

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.21AlaAla: 6.21 ± 0.445
0.666AlaCys: 0.666 ± 0.127
5.243AlaAsp: 5.243 ± 0.342
5.866AlaGlu: 5.866 ± 0.415
2.686AlaPhe: 2.686 ± 0.272
5.372AlaGly: 5.372 ± 0.383
1.697AlaHis: 1.697 ± 0.171
4.061AlaIle: 4.061 ± 0.378
5.049AlaLys: 5.049 ± 0.399
5.909AlaLeu: 5.909 ± 0.358
2.063AlaMet: 2.063 ± 0.236
3.889AlaAsn: 3.889 ± 0.319
2.686AlaPro: 2.686 ± 0.31
2.278AlaGln: 2.278 ± 0.224
2.965AlaArg: 2.965 ± 0.249
4.92AlaSer: 4.92 ± 0.318
4.254AlaThr: 4.254 ± 0.406
4.963AlaVal: 4.963 ± 0.353
0.817AlaTrp: 0.817 ± 0.136
2.514AlaTyr: 2.514 ± 0.213
0.0AlaXaa: 0.0 ± 0.0
Cys
0.602CysAla: 0.602 ± 0.116
0.129CysCys: 0.129 ± 0.05
0.688CysAsp: 0.688 ± 0.129
1.053CysGlu: 1.053 ± 0.187
0.387CysPhe: 0.387 ± 0.094
1.01CysGly: 1.01 ± 0.182
0.451CysHis: 0.451 ± 0.105
0.494CysIle: 0.494 ± 0.114
0.602CysLys: 0.602 ± 0.102
0.645CysLeu: 0.645 ± 0.125
0.15CysMet: 0.15 ± 0.063
0.559CysAsn: 0.559 ± 0.116
0.494CysPro: 0.494 ± 0.095
0.58CysGln: 0.58 ± 0.117
0.602CysArg: 0.602 ± 0.121
0.838CysSer: 0.838 ± 0.169
0.602CysThr: 0.602 ± 0.117
0.924CysVal: 0.924 ± 0.142
0.086CysTrp: 0.086 ± 0.046
0.365CysTyr: 0.365 ± 0.099
0.0CysXaa: 0.0 ± 0.0
Asp
4.641AspAla: 4.641 ± 0.319
1.074AspCys: 1.074 ± 0.13
3.975AspAsp: 3.975 ± 0.316
5.329AspGlu: 5.329 ± 0.4
2.793AspPhe: 2.793 ± 0.274
5.307AspGly: 5.307 ± 0.428
1.633AspHis: 1.633 ± 0.2
3.954AspIle: 3.954 ± 0.315
3.524AspLys: 3.524 ± 0.305
5.501AspLeu: 5.501 ± 0.37
2.342AspMet: 2.342 ± 0.255
3.567AspAsn: 3.567 ± 0.277
2.364AspPro: 2.364 ± 0.272
1.977AspGln: 1.977 ± 0.191
3.395AspArg: 3.395 ± 0.263
3.868AspSer: 3.868 ± 0.278
3.438AspThr: 3.438 ± 0.292
4.856AspVal: 4.856 ± 0.327
0.924AspTrp: 0.924 ± 0.13
2.729AspTyr: 2.729 ± 0.271
0.0AspXaa: 0.0 ± 0.0
Glu
6.339GluAla: 6.339 ± 0.545
0.967GluCys: 0.967 ± 0.197
4.426GluAsp: 4.426 ± 0.352
4.663GluGlu: 4.663 ± 0.396
2.793GluPhe: 2.793 ± 0.221
5.221GluGly: 5.221 ± 0.351
1.826GluHis: 1.826 ± 0.232
4.125GluIle: 4.125 ± 0.279
3.674GluLys: 3.674 ± 0.363
6.962GluLeu: 6.962 ± 0.465
2.729GluMet: 2.729 ± 0.294
3.33GluAsn: 3.33 ± 0.258
1.998GluPro: 1.998 ± 0.246
2.299GluGln: 2.299 ± 0.25
3.159GluArg: 3.159 ± 0.293
3.631GluSer: 3.631 ± 0.35
4.125GluThr: 4.125 ± 0.277
5.006GluVal: 5.006 ± 0.367
1.246GluTrp: 1.246 ± 0.168
3.245GluTyr: 3.245 ± 0.263
0.0GluXaa: 0.0 ± 0.0
Phe
2.235PheAla: 2.235 ± 0.22
0.537PheCys: 0.537 ± 0.111
3.008PheAsp: 3.008 ± 0.305
2.299PheGlu: 2.299 ± 0.207
1.031PhePhe: 1.031 ± 0.147
2.041PheGly: 2.041 ± 0.23
0.795PheHis: 0.795 ± 0.115
2.364PheIle: 2.364 ± 0.228
2.643PheLys: 2.643 ± 0.287
2.492PheLeu: 2.492 ± 0.234
1.311PheMet: 1.311 ± 0.175
1.719PheAsn: 1.719 ± 0.208
1.375PhePro: 1.375 ± 0.178
1.117PheGln: 1.117 ± 0.175
2.106PheArg: 2.106 ± 0.21
2.836PheSer: 2.836 ± 0.243
2.987PheThr: 2.987 ± 0.232
2.342PheVal: 2.342 ± 0.251
0.322PheTrp: 0.322 ± 0.105
1.01PheTyr: 1.01 ± 0.157
0.0PheXaa: 0.0 ± 0.0
Gly
4.577GlyAla: 4.577 ± 0.373
0.709GlyCys: 0.709 ± 0.145
5.458GlyAsp: 5.458 ± 0.312
4.985GlyGlu: 4.985 ± 0.339
2.664GlyPhe: 2.664 ± 0.219
4.856GlyGly: 4.856 ± 0.422
1.418GlyHis: 1.418 ± 0.166
3.868GlyIle: 3.868 ± 0.332
3.975GlyLys: 3.975 ± 0.29
5.716GlyLeu: 5.716 ± 0.3
1.783GlyMet: 1.783 ± 0.18
3.03GlyAsn: 3.03 ± 0.272
1.418GlyPro: 1.418 ± 0.156
2.664GlyGln: 2.664 ± 0.236
3.459GlyArg: 3.459 ± 0.257
4.663GlySer: 4.663 ± 0.388
3.696GlyThr: 3.696 ± 0.319
5.758GlyVal: 5.758 ± 0.406
1.074GlyTrp: 1.074 ± 0.158
2.578GlyTyr: 2.578 ± 0.258
0.0GlyXaa: 0.0 ± 0.0
His
1.289HisAla: 1.289 ± 0.164
0.279HisCys: 0.279 ± 0.081
0.881HisAsp: 0.881 ± 0.143
1.268HisGlu: 1.268 ± 0.153
0.752HisPhe: 0.752 ± 0.113
1.697HisGly: 1.697 ± 0.215
0.752HisHis: 0.752 ± 0.129
1.289HisIle: 1.289 ± 0.191
1.526HisLys: 1.526 ± 0.206
2.149HisLeu: 2.149 ± 0.23
0.924HisMet: 0.924 ± 0.154
1.117HisAsn: 1.117 ± 0.162
1.096HisPro: 1.096 ± 0.187
0.559HisGln: 0.559 ± 0.118
1.096HisArg: 1.096 ± 0.153
1.418HisSer: 1.418 ± 0.16
1.225HisThr: 1.225 ± 0.159
1.547HisVal: 1.547 ± 0.194
0.387HisTrp: 0.387 ± 0.094
1.246HisTyr: 1.246 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
3.825IleAla: 3.825 ± 0.304
0.602IleCys: 0.602 ± 0.115
4.426IleAsp: 4.426 ± 0.293
4.598IleGlu: 4.598 ± 0.331
1.74IlePhe: 1.74 ± 0.248
3.674IleGly: 3.674 ± 0.279
1.182IleHis: 1.182 ± 0.175
3.524IleIle: 3.524 ± 0.285
4.168IleLys: 4.168 ± 0.312
4.233IleLeu: 4.233 ± 0.271
1.311IleMet: 1.311 ± 0.152
3.803IleAsn: 3.803 ± 0.312
2.471IlePro: 2.471 ± 0.237
2.213IleGln: 2.213 ± 0.2
3.094IleArg: 3.094 ± 0.268
3.674IleSer: 3.674 ± 0.288
4.018IleThr: 4.018 ± 0.386
3.846IleVal: 3.846 ± 0.258
0.58IleTrp: 0.58 ± 0.116
1.547IleTyr: 1.547 ± 0.171
0.0IleXaa: 0.0 ± 0.0
Lys
5.522LysAla: 5.522 ± 0.468
0.623LysCys: 0.623 ± 0.148
3.911LysAsp: 3.911 ± 0.321
4.383LysGlu: 4.383 ± 0.347
2.106LysPhe: 2.106 ± 0.277
4.469LysGly: 4.469 ± 0.35
1.569LysHis: 1.569 ± 0.207
3.094LysIle: 3.094 ± 0.302
4.168LysLys: 4.168 ± 0.406
4.813LysLeu: 4.813 ± 0.373
2.02LysMet: 2.02 ± 0.206
2.02LysAsn: 2.02 ± 0.231
2.321LysPro: 2.321 ± 0.265
1.998LysGln: 1.998 ± 0.279
3.202LysArg: 3.202 ± 0.339
3.223LysSer: 3.223 ± 0.273
3.545LysThr: 3.545 ± 0.259
4.276LysVal: 4.276 ± 0.331
0.731LysTrp: 0.731 ± 0.126
2.407LysTyr: 2.407 ± 0.237
0.0LysXaa: 0.0 ± 0.0
Leu
6.446LeuAla: 6.446 ± 0.376
0.924LeuCys: 0.924 ± 0.159
5.436LeuAsp: 5.436 ± 0.336
6.059LeuGlu: 6.059 ± 0.352
2.793LeuPhe: 2.793 ± 0.255
5.93LeuGly: 5.93 ± 0.375
1.44LeuHis: 1.44 ± 0.164
4.297LeuIle: 4.297 ± 0.326
5.479LeuLys: 5.479 ± 0.439
5.909LeuLeu: 5.909 ± 0.407
2.643LeuMet: 2.643 ± 0.268
3.33LeuAsn: 3.33 ± 0.246
3.008LeuPro: 3.008 ± 0.266
2.578LeuGln: 2.578 ± 0.241
4.577LeuArg: 4.577 ± 0.346
5.178LeuSer: 5.178 ± 0.311
5.264LeuThr: 5.264 ± 0.342
5.823LeuVal: 5.823 ± 0.466
0.902LeuTrp: 0.902 ± 0.137
2.428LeuTyr: 2.428 ± 0.272
0.0LeuXaa: 0.0 ± 0.0
Met
2.321MetAla: 2.321 ± 0.29
0.408MetCys: 0.408 ± 0.099
1.783MetAsp: 1.783 ± 0.19
1.891MetGlu: 1.891 ± 0.213
0.902MetPhe: 0.902 ± 0.128
1.676MetGly: 1.676 ± 0.18
0.623MetHis: 0.623 ± 0.108
1.912MetIle: 1.912 ± 0.191
1.912MetLys: 1.912 ± 0.201
2.514MetLeu: 2.514 ± 0.278
0.967MetMet: 0.967 ± 0.144
1.569MetAsn: 1.569 ± 0.15
1.096MetPro: 1.096 ± 0.158
1.139MetGln: 1.139 ± 0.177
1.719MetArg: 1.719 ± 0.177
2.407MetSer: 2.407 ± 0.198
2.02MetThr: 2.02 ± 0.21
2.385MetVal: 2.385 ± 0.226
0.279MetTrp: 0.279 ± 0.076
1.031MetTyr: 1.031 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
3.287AsnAla: 3.287 ± 0.267
0.236AsnCys: 0.236 ± 0.086
2.578AsnAsp: 2.578 ± 0.27
2.901AsnGlu: 2.901 ± 0.264
2.02AsnPhe: 2.02 ± 0.205
4.083AsnGly: 4.083 ± 0.358
1.139AsnHis: 1.139 ± 0.158
3.287AsnIle: 3.287 ± 0.255
2.299AsnLys: 2.299 ± 0.23
3.61AsnLeu: 3.61 ± 0.286
1.59AsnMet: 1.59 ± 0.204
2.192AsnAsn: 2.192 ± 0.254
2.17AsnPro: 2.17 ± 0.216
1.977AsnGln: 1.977 ± 0.205
2.492AsnArg: 2.492 ± 0.209
3.008AsnSer: 3.008 ± 0.265
3.545AsnThr: 3.545 ± 0.369
3.975AsnVal: 3.975 ± 0.253
0.731AsnTrp: 0.731 ± 0.131
1.826AsnTyr: 1.826 ± 0.184
0.0AsnXaa: 0.0 ± 0.0
Pro
2.213ProAla: 2.213 ± 0.237
0.473ProCys: 0.473 ± 0.102
2.621ProAsp: 2.621 ± 0.284
3.008ProGlu: 3.008 ± 0.341
1.311ProPhe: 1.311 ± 0.185
0.645ProGly: 0.645 ± 0.125
0.774ProHis: 0.774 ± 0.118
1.891ProIle: 1.891 ± 0.188
2.492ProLys: 2.492 ± 0.26
2.621ProLeu: 2.621 ± 0.241
1.139ProMet: 1.139 ± 0.15
1.998ProAsn: 1.998 ± 0.221
0.623ProPro: 0.623 ± 0.133
1.117ProGln: 1.117 ± 0.14
1.547ProArg: 1.547 ± 0.154
2.836ProSer: 2.836 ± 0.246
2.235ProThr: 2.235 ± 0.215
2.557ProVal: 2.557 ± 0.242
0.688ProTrp: 0.688 ± 0.125
1.375ProTyr: 1.375 ± 0.195
0.0ProXaa: 0.0 ± 0.0
Gln
3.202GlnAla: 3.202 ± 0.273
0.494GlnCys: 0.494 ± 0.101
1.654GlnAsp: 1.654 ± 0.19
2.235GlnGlu: 2.235 ± 0.249
1.16GlnPhe: 1.16 ± 0.177
2.45GlnGly: 2.45 ± 0.241
0.623GlnHis: 0.623 ± 0.114
2.063GlnIle: 2.063 ± 0.176
1.826GlnLys: 1.826 ± 0.244
3.502GlnLeu: 3.502 ± 0.303
1.01GlnMet: 1.01 ± 0.166
1.053GlnAsn: 1.053 ± 0.147
1.375GlnPro: 1.375 ± 0.191
1.182GlnGln: 1.182 ± 0.203
2.02GlnArg: 2.02 ± 0.23
2.041GlnSer: 2.041 ± 0.18
2.127GlnThr: 2.127 ± 0.205
2.213GlnVal: 2.213 ± 0.205
0.43GlnTrp: 0.43 ± 0.082
1.654GlnTyr: 1.654 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
3.545ArgAla: 3.545 ± 0.279
0.666ArgCys: 0.666 ± 0.117
3.653ArgAsp: 3.653 ± 0.343
4.018ArgGlu: 4.018 ± 0.35
2.17ArgPhe: 2.17 ± 0.167
3.159ArgGly: 3.159 ± 0.22
1.354ArgHis: 1.354 ± 0.18
3.094ArgIle: 3.094 ± 0.261
2.922ArgLys: 2.922 ± 0.299
4.663ArgLeu: 4.663 ± 0.321
1.654ArgMet: 1.654 ± 0.201
2.235ArgAsn: 2.235 ± 0.224
1.44ArgPro: 1.44 ± 0.17
2.235ArgGln: 2.235 ± 0.212
2.707ArgArg: 2.707 ± 0.289
2.45ArgSer: 2.45 ± 0.23
2.686ArgThr: 2.686 ± 0.26
3.825ArgVal: 3.825 ± 0.326
0.688ArgTrp: 0.688 ± 0.114
2.084ArgTyr: 2.084 ± 0.21
0.0ArgXaa: 0.0 ± 0.0
Ser
4.534SerAla: 4.534 ± 0.331
0.494SerCys: 0.494 ± 0.103
3.911SerAsp: 3.911 ± 0.313
4.598SerGlu: 4.598 ± 0.296
2.75SerPhe: 2.75 ± 0.261
4.899SerGly: 4.899 ± 0.47
1.354SerHis: 1.354 ± 0.177
4.147SerIle: 4.147 ± 0.325
3.868SerLys: 3.868 ± 0.357
4.641SerLeu: 4.641 ± 0.347
1.977SerMet: 1.977 ± 0.248
3.395SerAsn: 3.395 ± 0.266
1.934SerPro: 1.934 ± 0.234
2.6SerGln: 2.6 ± 0.268
2.643SerArg: 2.643 ± 0.219
3.846SerSer: 3.846 ± 0.34
3.61SerThr: 3.61 ± 0.316
4.62SerVal: 4.62 ± 0.379
0.838SerTrp: 0.838 ± 0.121
2.578SerTyr: 2.578 ± 0.227
0.0SerXaa: 0.0 ± 0.0
Thr
4.254ThrAla: 4.254 ± 0.308
0.645ThrCys: 0.645 ± 0.123
3.911ThrAsp: 3.911 ± 0.297
3.674ThrGlu: 3.674 ± 0.297
2.664ThrPhe: 2.664 ± 0.263
4.104ThrGly: 4.104 ± 0.337
1.031ThrHis: 1.031 ± 0.168
4.018ThrIle: 4.018 ± 0.355
3.33ThrLys: 3.33 ± 0.308
4.899ThrLeu: 4.899 ± 0.397
1.569ThrMet: 1.569 ± 0.208
3.696ThrAsn: 3.696 ± 0.324
2.707ThrPro: 2.707 ± 0.224
1.654ThrGln: 1.654 ± 0.29
3.202ThrArg: 3.202 ± 0.227
4.297ThrSer: 4.297 ± 0.458
4.383ThrThr: 4.383 ± 0.452
5.286ThrVal: 5.286 ± 0.34
0.731ThrTrp: 0.731 ± 0.112
2.321ThrTyr: 2.321 ± 0.244
0.0ThrXaa: 0.0 ± 0.0
Val
5.415ValAla: 5.415 ± 0.36
0.709ValCys: 0.709 ± 0.162
5.93ValAsp: 5.93 ± 0.414
5.887ValGlu: 5.887 ± 0.429
2.106ValPhe: 2.106 ± 0.225
4.426ValGly: 4.426 ± 0.311
1.569ValHis: 1.569 ± 0.17
4.254ValIle: 4.254 ± 0.325
4.426ValLys: 4.426 ± 0.337
5.243ValLeu: 5.243 ± 0.388
1.955ValMet: 1.955 ± 0.179
3.954ValAsn: 3.954 ± 0.291
2.063ValPro: 2.063 ± 0.23
2.342ValGln: 2.342 ± 0.211
4.061ValArg: 4.061 ± 0.328
4.727ValSer: 4.727 ± 0.363
5.501ValThr: 5.501 ± 0.391
6.188ValVal: 6.188 ± 0.465
0.881ValTrp: 0.881 ± 0.142
2.235ValTyr: 2.235 ± 0.218
0.0ValXaa: 0.0 ± 0.0
Trp
0.902TrpAla: 0.902 ± 0.128
0.236TrpCys: 0.236 ± 0.079
0.817TrpAsp: 0.817 ± 0.112
0.924TrpGlu: 0.924 ± 0.135
0.602TrpPhe: 0.602 ± 0.097
0.817TrpGly: 0.817 ± 0.128
0.236TrpHis: 0.236 ± 0.073
0.731TrpIle: 0.731 ± 0.133
0.645TrpLys: 0.645 ± 0.121
1.418TrpLeu: 1.418 ± 0.186
0.172TrpMet: 0.172 ± 0.061
0.709TrpAsn: 0.709 ± 0.128
0.365TrpPro: 0.365 ± 0.105
0.58TrpGln: 0.58 ± 0.11
1.01TrpArg: 1.01 ± 0.131
0.924TrpSer: 0.924 ± 0.147
0.516TrpThr: 0.516 ± 0.126
0.988TrpVal: 0.988 ± 0.138
0.279TrpTrp: 0.279 ± 0.087
0.301TrpTyr: 0.301 ± 0.085
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.75TyrAla: 2.75 ± 0.24
0.387TyrCys: 0.387 ± 0.09
2.879TyrAsp: 2.879 ± 0.222
2.127TyrGlu: 2.127 ± 0.227
1.139TyrPhe: 1.139 ± 0.192
2.514TyrGly: 2.514 ± 0.193
1.031TyrHis: 1.031 ± 0.164
2.063TyrIle: 2.063 ± 0.248
1.869TyrLys: 1.869 ± 0.213
3.008TyrLeu: 3.008 ± 0.236
1.225TyrMet: 1.225 ± 0.144
1.783TyrAsn: 1.783 ± 0.197
1.16TyrPro: 1.16 ± 0.144
1.225TyrGln: 1.225 ± 0.183
2.192TyrArg: 2.192 ± 0.245
2.45TyrSer: 2.45 ± 0.238
2.6TyrThr: 2.6 ± 0.314
2.492TyrVal: 2.492 ± 0.242
0.559TyrTrp: 0.559 ± 0.097
1.547TyrTyr: 1.547 ± 0.164
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 195 proteins (46541 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski