Amino acid dipepetide frequency for Vibrio phage Chazly21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.46AlaAla: 6.46 ± 0.851
0.811AlaCys: 0.811 ± 0.147
4.642AlaAsp: 4.642 ± 0.359
5.677AlaGlu: 5.677 ± 0.547
3.048AlaPhe: 3.048 ± 0.254
4.614AlaGly: 4.614 ± 0.349
1.23AlaHis: 1.23 ± 0.169
4.614AlaIle: 4.614 ± 0.343
6.823AlaLys: 6.823 ± 0.581
6.488AlaLeu: 6.488 ± 0.383
2.125AlaMet: 2.125 ± 0.299
4.027AlaAsn: 4.027 ± 0.358
2.517AlaPro: 2.517 ± 0.299
2.992AlaGln: 2.992 ± 0.355
3.607AlaArg: 3.607 ± 0.344
5.509AlaSer: 5.509 ± 0.585
4.782AlaThr: 4.782 ± 0.418
4.279AlaVal: 4.279 ± 0.392
0.923AlaTrp: 0.923 ± 0.164
2.796AlaTyr: 2.796 ± 0.234
0.0AlaXaa: 0.0 ± 0.0
Cys
0.839CysAla: 0.839 ± 0.155
0.14CysCys: 0.14 ± 0.068
0.587CysAsp: 0.587 ± 0.151
1.119CysGlu: 1.119 ± 0.166
0.447CysPhe: 0.447 ± 0.121
0.783CysGly: 0.783 ± 0.166
0.364CysHis: 0.364 ± 0.097
0.671CysIle: 0.671 ± 0.13
0.839CysLys: 0.839 ± 0.16
0.755CysLeu: 0.755 ± 0.16
0.28CysMet: 0.28 ± 0.104
0.699CysAsn: 0.699 ± 0.152
0.419CysPro: 0.419 ± 0.111
0.587CysGln: 0.587 ± 0.102
0.587CysArg: 0.587 ± 0.132
0.699CysSer: 0.699 ± 0.164
0.839CysThr: 0.839 ± 0.146
0.839CysVal: 0.839 ± 0.173
0.112CysTrp: 0.112 ± 0.051
0.531CysTyr: 0.531 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
4.838AspAla: 4.838 ± 0.325
0.727AspCys: 0.727 ± 0.159
3.412AspAsp: 3.412 ± 0.375
4.334AspGlu: 4.334 ± 0.413
3.356AspPhe: 3.356 ± 0.31
4.558AspGly: 4.558 ± 0.408
0.839AspHis: 0.839 ± 0.153
4.474AspIle: 4.474 ± 0.344
4.334AspLys: 4.334 ± 0.364
4.95AspLeu: 4.95 ± 0.395
1.678AspMet: 1.678 ± 0.227
3.468AspAsn: 3.468 ± 0.33
2.349AspPro: 2.349 ± 0.281
1.51AspGln: 1.51 ± 0.175
2.601AspArg: 2.601 ± 0.314
3.831AspSer: 3.831 ± 0.384
4.055AspThr: 4.055 ± 0.348
4.111AspVal: 4.111 ± 0.32
1.091AspTrp: 1.091 ± 0.182
2.321AspTyr: 2.321 ± 0.24
0.0AspXaa: 0.0 ± 0.0
Glu
6.124GluAla: 6.124 ± 0.52
1.035GluCys: 1.035 ± 0.195
3.887GluAsp: 3.887 ± 0.291
4.446GluGlu: 4.446 ± 0.401
2.629GluPhe: 2.629 ± 0.28
3.943GluGly: 3.943 ± 0.297
1.678GluHis: 1.678 ± 0.217
4.726GluIle: 4.726 ± 0.427
4.81GluLys: 4.81 ± 0.402
7.187GluLeu: 7.187 ± 0.508
2.013GluMet: 2.013 ± 0.22
3.551GluAsn: 3.551 ± 0.283
1.957GluPro: 1.957 ± 0.235
3.02GluGln: 3.02 ± 0.329
2.88GluArg: 2.88 ± 0.335
2.852GluSer: 2.852 ± 0.323
3.579GluThr: 3.579 ± 0.352
5.369GluVal: 5.369 ± 0.376
1.119GluTrp: 1.119 ± 0.178
3.16GluTyr: 3.16 ± 0.341
0.0GluXaa: 0.0 ± 0.0
Phe
2.824PheAla: 2.824 ± 0.326
0.615PheCys: 0.615 ± 0.121
3.188PheAsp: 3.188 ± 0.298
2.992PheGlu: 2.992 ± 0.276
1.454PhePhe: 1.454 ± 0.189
2.685PheGly: 2.685 ± 0.282
1.258PheHis: 1.258 ± 0.173
2.629PheIle: 2.629 ± 0.255
2.908PheLys: 2.908 ± 0.293
2.461PheLeu: 2.461 ± 0.264
1.091PheMet: 1.091 ± 0.184
2.265PheAsn: 2.265 ± 0.221
1.538PhePro: 1.538 ± 0.198
1.035PheGln: 1.035 ± 0.134
1.678PheArg: 1.678 ± 0.195
2.573PheSer: 2.573 ± 0.274
2.377PheThr: 2.377 ± 0.234
2.069PheVal: 2.069 ± 0.253
0.615PheTrp: 0.615 ± 0.099
1.426PheTyr: 1.426 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
4.279GlyAla: 4.279 ± 0.458
1.007GlyCys: 1.007 ± 0.207
4.866GlyAsp: 4.866 ± 0.457
3.999GlyGlu: 3.999 ± 0.344
3.216GlyPhe: 3.216 ± 0.306
3.691GlyGly: 3.691 ± 0.418
1.147GlyHis: 1.147 ± 0.191
4.195GlyIle: 4.195 ± 0.307
5.257GlyLys: 5.257 ± 0.462
5.201GlyLeu: 5.201 ± 0.422
1.93GlyMet: 1.93 ± 0.203
3.216GlyAsn: 3.216 ± 0.392
0.475GlyPro: 0.475 ± 0.119
1.957GlyGln: 1.957 ± 0.242
3.132GlyArg: 3.132 ± 0.271
4.167GlySer: 4.167 ± 0.457
4.754GlyThr: 4.754 ± 0.389
4.279GlyVal: 4.279 ± 0.331
1.454GlyTrp: 1.454 ± 0.238
3.048GlyTyr: 3.048 ± 0.349
0.0GlyXaa: 0.0 ± 0.0
His
1.147HisAla: 1.147 ± 0.179
0.391HisCys: 0.391 ± 0.108
1.566HisAsp: 1.566 ± 0.212
1.37HisGlu: 1.37 ± 0.208
0.867HisPhe: 0.867 ± 0.141
1.65HisGly: 1.65 ± 0.199
0.531HisHis: 0.531 ± 0.122
1.51HisIle: 1.51 ± 0.197
1.37HisLys: 1.37 ± 0.201
2.265HisLeu: 2.265 ± 0.248
0.587HisMet: 0.587 ± 0.14
0.979HisAsn: 0.979 ± 0.187
0.839HisPro: 0.839 ± 0.166
0.559HisGln: 0.559 ± 0.113
1.035HisArg: 1.035 ± 0.147
1.007HisSer: 1.007 ± 0.177
1.174HisThr: 1.174 ± 0.186
1.119HisVal: 1.119 ± 0.206
0.252HisTrp: 0.252 ± 0.085
0.951HisTyr: 0.951 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
4.055IleAla: 4.055 ± 0.313
0.615IleCys: 0.615 ± 0.141
3.887IleAsp: 3.887 ± 0.368
5.593IleGlu: 5.593 ± 0.437
1.818IlePhe: 1.818 ± 0.2
3.523IleGly: 3.523 ± 0.296
1.314IleHis: 1.314 ± 0.203
3.551IleIle: 3.551 ± 0.278
4.726IleLys: 4.726 ± 0.362
4.167IleLeu: 4.167 ± 0.283
1.79IleMet: 1.79 ± 0.209
3.579IleAsn: 3.579 ± 0.376
2.321IlePro: 2.321 ± 0.275
2.601IleGln: 2.601 ± 0.242
2.964IleArg: 2.964 ± 0.294
4.418IleSer: 4.418 ± 0.385
4.167IleThr: 4.167 ± 0.378
3.468IleVal: 3.468 ± 0.356
0.531IleTrp: 0.531 ± 0.116
2.293IleTyr: 2.293 ± 0.268
0.0IleXaa: 0.0 ± 0.0
Lys
6.767LysAla: 6.767 ± 0.591
0.671LysCys: 0.671 ± 0.135
4.027LysAsp: 4.027 ± 0.34
5.369LysGlu: 5.369 ± 0.436
3.02LysPhe: 3.02 ± 0.296
4.446LysGly: 4.446 ± 0.312
1.678LysHis: 1.678 ± 0.238
4.446LysIle: 4.446 ± 0.32
4.167LysLys: 4.167 ± 0.324
7.187LysLeu: 7.187 ± 0.551
2.293LysMet: 2.293 ± 0.271
2.796LysAsn: 2.796 ± 0.279
3.132LysPro: 3.132 ± 0.438
2.685LysGln: 2.685 ± 0.272
2.824LysArg: 2.824 ± 0.314
3.775LysSer: 3.775 ± 0.357
4.055LysThr: 4.055 ± 0.375
4.586LysVal: 4.586 ± 0.352
0.727LysTrp: 0.727 ± 0.13
2.88LysTyr: 2.88 ± 0.284
0.0LysXaa: 0.0 ± 0.0
Leu
7.578LeuAla: 7.578 ± 0.55
0.839LeuCys: 0.839 ± 0.147
6.264LeuAsp: 6.264 ± 0.438
6.544LeuGlu: 6.544 ± 0.441
2.377LeuPhe: 2.377 ± 0.25
5.621LeuGly: 5.621 ± 0.41
1.902LeuHis: 1.902 ± 0.231
4.558LeuIle: 4.558 ± 0.32
5.117LeuLys: 5.117 ± 0.4
6.432LeuLeu: 6.432 ± 0.435
2.153LeuMet: 2.153 ± 0.253
3.971LeuAsn: 3.971 ± 0.352
3.468LeuPro: 3.468 ± 0.346
2.992LeuGln: 2.992 ± 0.272
4.111LeuArg: 4.111 ± 0.373
4.81LeuSer: 4.81 ± 0.348
4.698LeuThr: 4.698 ± 0.308
6.264LeuVal: 6.264 ± 0.439
0.727LeuTrp: 0.727 ± 0.185
2.713LeuTyr: 2.713 ± 0.294
0.0LeuXaa: 0.0 ± 0.0
Met
2.293MetAla: 2.293 ± 0.28
0.364MetCys: 0.364 ± 0.114
1.51MetAsp: 1.51 ± 0.246
1.846MetGlu: 1.846 ± 0.204
1.063MetPhe: 1.063 ± 0.164
1.566MetGly: 1.566 ± 0.198
0.615MetHis: 0.615 ± 0.118
1.566MetIle: 1.566 ± 0.235
2.209MetLys: 2.209 ± 0.285
2.517MetLeu: 2.517 ± 0.27
0.643MetMet: 0.643 ± 0.155
1.37MetAsn: 1.37 ± 0.222
1.314MetPro: 1.314 ± 0.192
0.979MetGln: 0.979 ± 0.169
1.314MetArg: 1.314 ± 0.21
1.566MetSer: 1.566 ± 0.207
1.678MetThr: 1.678 ± 0.208
1.594MetVal: 1.594 ± 0.243
0.252MetTrp: 0.252 ± 0.073
0.839MetTyr: 0.839 ± 0.154
0.0MetXaa: 0.0 ± 0.0
Asn
3.775AsnAla: 3.775 ± 0.318
0.615AsnCys: 0.615 ± 0.144
2.209AsnAsp: 2.209 ± 0.241
2.852AsnGlu: 2.852 ± 0.253
1.538AsnPhe: 1.538 ± 0.223
3.803AsnGly: 3.803 ± 0.34
1.035AsnHis: 1.035 ± 0.193
3.132AsnIle: 3.132 ± 0.287
4.726AsnLys: 4.726 ± 0.411
4.251AsnLeu: 4.251 ± 0.307
1.342AsnMet: 1.342 ± 0.201
2.629AsnAsn: 2.629 ± 0.285
2.517AsnPro: 2.517 ± 0.27
1.51AsnGln: 1.51 ± 0.227
2.209AsnArg: 2.209 ± 0.284
3.803AsnSer: 3.803 ± 0.358
3.132AsnThr: 3.132 ± 0.36
3.719AsnVal: 3.719 ± 0.314
0.755AsnTrp: 0.755 ± 0.156
2.041AsnTyr: 2.041 ± 0.23
0.0AsnXaa: 0.0 ± 0.0
Pro
2.265ProAla: 2.265 ± 0.286
0.364ProCys: 0.364 ± 0.094
2.349ProAsp: 2.349 ± 0.279
3.216ProGlu: 3.216 ± 0.327
1.398ProPhe: 1.398 ± 0.163
1.902ProGly: 1.902 ± 0.329
0.783ProHis: 0.783 ± 0.144
2.153ProIle: 2.153 ± 0.238
2.069ProLys: 2.069 ± 0.26
2.461ProLeu: 2.461 ± 0.295
1.063ProMet: 1.063 ± 0.171
2.265ProAsn: 2.265 ± 0.3
0.839ProPro: 0.839 ± 0.155
1.035ProGln: 1.035 ± 0.16
1.454ProArg: 1.454 ± 0.192
2.125ProSer: 2.125 ± 0.314
2.237ProThr: 2.237 ± 0.289
2.517ProVal: 2.517 ± 0.29
0.447ProTrp: 0.447 ± 0.121
1.678ProTyr: 1.678 ± 0.258
0.0ProXaa: 0.0 ± 0.0
Gln
3.3GlnAla: 3.3 ± 0.319
0.559GlnCys: 0.559 ± 0.122
1.93GlnAsp: 1.93 ± 0.238
2.265GlnGlu: 2.265 ± 0.28
1.398GlnPhe: 1.398 ± 0.167
2.237GlnGly: 2.237 ± 0.249
0.699GlnHis: 0.699 ± 0.123
2.209GlnIle: 2.209 ± 0.237
1.985GlnLys: 1.985 ± 0.246
3.496GlnLeu: 3.496 ± 0.345
1.063GlnMet: 1.063 ± 0.171
1.174GlnAsn: 1.174 ± 0.219
1.035GlnPro: 1.035 ± 0.186
1.119GlnGln: 1.119 ± 0.236
1.594GlnArg: 1.594 ± 0.226
2.153GlnSer: 2.153 ± 0.36
2.013GlnThr: 2.013 ± 0.221
2.685GlnVal: 2.685 ± 0.278
0.531GlnTrp: 0.531 ± 0.126
1.538GlnTyr: 1.538 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
3.551ArgAla: 3.551 ± 0.357
0.447ArgCys: 0.447 ± 0.123
3.3ArgAsp: 3.3 ± 0.39
2.796ArgGlu: 2.796 ± 0.251
1.79ArgPhe: 1.79 ± 0.248
2.964ArgGly: 2.964 ± 0.269
0.923ArgHis: 0.923 ± 0.174
2.936ArgIle: 2.936 ± 0.272
3.328ArgLys: 3.328 ± 0.295
3.691ArgLeu: 3.691 ± 0.349
1.119ArgMet: 1.119 ± 0.187
2.433ArgAsn: 2.433 ± 0.281
1.37ArgPro: 1.37 ± 0.206
1.454ArgGln: 1.454 ± 0.203
2.125ArgArg: 2.125 ± 0.284
2.349ArgSer: 2.349 ± 0.294
2.517ArgThr: 2.517 ± 0.257
3.16ArgVal: 3.16 ± 0.306
0.615ArgTrp: 0.615 ± 0.135
1.734ArgTyr: 1.734 ± 0.246
0.0ArgXaa: 0.0 ± 0.0
Ser
5.034SerAla: 5.034 ± 0.546
0.531SerCys: 0.531 ± 0.118
3.496SerAsp: 3.496 ± 0.314
3.356SerGlu: 3.356 ± 0.385
2.405SerPhe: 2.405 ± 0.276
5.509SerGly: 5.509 ± 0.549
1.174SerHis: 1.174 ± 0.187
3.356SerIle: 3.356 ± 0.314
4.67SerLys: 4.67 ± 0.342
5.369SerLeu: 5.369 ± 0.452
1.762SerMet: 1.762 ± 0.192
3.16SerAsn: 3.16 ± 0.29
2.265SerPro: 2.265 ± 0.264
1.93SerGln: 1.93 ± 0.251
2.293SerArg: 2.293 ± 0.244
4.334SerSer: 4.334 ± 0.554
3.747SerThr: 3.747 ± 0.406
4.306SerVal: 4.306 ± 0.381
0.755SerTrp: 0.755 ± 0.172
2.377SerTyr: 2.377 ± 0.24
0.0SerXaa: 0.0 ± 0.0
Thr
4.67ThrAla: 4.67 ± 0.426
0.671ThrCys: 0.671 ± 0.137
3.887ThrAsp: 3.887 ± 0.337
3.579ThrGlu: 3.579 ± 0.326
2.852ThrPhe: 2.852 ± 0.327
4.279ThrGly: 4.279 ± 0.406
1.37ThrHis: 1.37 ± 0.205
4.167ThrIle: 4.167 ± 0.363
4.362ThrLys: 4.362 ± 0.415
4.81ThrLeu: 4.81 ± 0.353
1.119ThrMet: 1.119 ± 0.184
3.496ThrAsn: 3.496 ± 0.354
2.265ThrPro: 2.265 ± 0.272
2.041ThrGln: 2.041 ± 0.243
2.517ThrArg: 2.517 ± 0.188
3.971ThrSer: 3.971 ± 0.483
3.887ThrThr: 3.887 ± 0.374
3.775ThrVal: 3.775 ± 0.314
0.727ThrTrp: 0.727 ± 0.153
2.964ThrTyr: 2.964 ± 0.257
0.0ThrXaa: 0.0 ± 0.0
Val
4.67ValAla: 4.67 ± 0.433
0.811ValCys: 0.811 ± 0.158
4.195ValAsp: 4.195 ± 0.3
4.922ValGlu: 4.922 ± 0.331
3.048ValPhe: 3.048 ± 0.248
3.775ValGly: 3.775 ± 0.363
1.37ValHis: 1.37 ± 0.201
3.272ValIle: 3.272 ± 0.312
4.726ValLys: 4.726 ± 0.372
4.726ValLeu: 4.726 ± 0.349
1.426ValMet: 1.426 ± 0.204
3.44ValAsn: 3.44 ± 0.304
2.237ValPro: 2.237 ± 0.25
2.321ValGln: 2.321 ± 0.225
3.384ValArg: 3.384 ± 0.273
4.978ValSer: 4.978 ± 0.361
4.474ValThr: 4.474 ± 0.429
4.586ValVal: 4.586 ± 0.374
0.811ValTrp: 0.811 ± 0.157
2.489ValTyr: 2.489 ± 0.228
0.0ValXaa: 0.0 ± 0.0
Trp
1.174TrpAla: 1.174 ± 0.196
0.196TrpCys: 0.196 ± 0.072
1.007TrpAsp: 1.007 ± 0.161
0.727TrpGlu: 0.727 ± 0.13
0.531TrpPhe: 0.531 ± 0.119
0.895TrpGly: 0.895 ± 0.138
0.391TrpHis: 0.391 ± 0.091
0.531TrpIle: 0.531 ± 0.123
0.755TrpLys: 0.755 ± 0.142
1.23TrpLeu: 1.23 ± 0.173
0.391TrpMet: 0.391 ± 0.112
0.727TrpAsn: 0.727 ± 0.156
0.224TrpPro: 0.224 ± 0.083
0.587TrpGln: 0.587 ± 0.125
0.531TrpArg: 0.531 ± 0.115
0.839TrpSer: 0.839 ± 0.214
0.503TrpThr: 0.503 ± 0.121
1.007TrpVal: 1.007 ± 0.156
0.224TrpTrp: 0.224 ± 0.083
0.643TrpTyr: 0.643 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.321TyrAla: 2.321 ± 0.232
0.727TyrCys: 0.727 ± 0.144
2.629TyrAsp: 2.629 ± 0.305
2.964TyrGlu: 2.964 ± 0.302
1.426TyrPhe: 1.426 ± 0.159
2.824TyrGly: 2.824 ± 0.257
0.867TyrHis: 0.867 ± 0.144
2.685TyrIle: 2.685 ± 0.321
2.349TyrLys: 2.349 ± 0.264
3.523TyrLeu: 3.523 ± 0.389
1.202TyrMet: 1.202 ± 0.178
2.349TyrAsn: 2.349 ± 0.288
1.538TyrPro: 1.538 ± 0.152
2.069TyrGln: 2.069 ± 0.258
1.762TyrArg: 1.762 ± 0.195
2.069TyrSer: 2.069 ± 0.335
2.74TyrThr: 2.74 ± 0.29
1.93TyrVal: 1.93 ± 0.237
0.447TyrTrp: 0.447 ± 0.126
1.594TyrTyr: 1.594 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 180 proteins (35761 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski