Amino acid dipepetide frequency for Vibrio phage vB_VpS_CA8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.312AlaAla: 8.312 ± 1.132
0.854AlaCys: 0.854 ± 0.245
5.067AlaAsp: 5.067 ± 0.612
6.149AlaGlu: 6.149 ± 0.921
2.676AlaPhe: 2.676 ± 0.341
5.067AlaGly: 5.067 ± 0.732
1.309AlaHis: 1.309 ± 0.269
4.042AlaIle: 4.042 ± 0.482
5.409AlaLys: 5.409 ± 0.91
7.629AlaLeu: 7.629 ± 0.793
3.245AlaMet: 3.245 ± 0.529
3.587AlaAsn: 3.587 ± 0.554
2.961AlaPro: 2.961 ± 0.495
3.188AlaGln: 3.188 ± 0.525
4.156AlaArg: 4.156 ± 0.572
3.928AlaSer: 3.928 ± 0.464
6.263AlaThr: 6.263 ± 0.642
5.864AlaVal: 5.864 ± 0.565
1.423AlaTrp: 1.423 ± 0.281
2.847AlaTyr: 2.847 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.626CysAla: 0.626 ± 0.222
0.228CysCys: 0.228 ± 0.129
1.025CysAsp: 1.025 ± 0.262
1.253CysGlu: 1.253 ± 0.292
0.399CysPhe: 0.399 ± 0.149
1.253CysGly: 1.253 ± 0.291
0.569CysHis: 0.569 ± 0.274
0.626CysIle: 0.626 ± 0.166
1.082CysLys: 1.082 ± 0.248
1.025CysLeu: 1.025 ± 0.21
0.683CysMet: 0.683 ± 0.174
1.139CysAsn: 1.139 ± 0.265
0.626CysPro: 0.626 ± 0.163
0.342CysGln: 0.342 ± 0.123
1.253CysArg: 1.253 ± 0.307
1.025CysSer: 1.025 ± 0.327
0.797CysThr: 0.797 ± 0.2
0.683CysVal: 0.683 ± 0.243
0.057CysTrp: 0.057 ± 0.067
0.285CysTyr: 0.285 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
4.783AspAla: 4.783 ± 0.469
0.797AspCys: 0.797 ± 0.213
3.644AspAsp: 3.644 ± 0.648
3.473AspGlu: 3.473 ± 0.437
2.22AspPhe: 2.22 ± 0.389
5.409AspGly: 5.409 ± 0.587
2.277AspHis: 2.277 ± 0.328
3.644AspIle: 3.644 ± 0.35
3.245AspLys: 3.245 ± 0.475
3.985AspLeu: 3.985 ± 0.457
2.22AspMet: 2.22 ± 0.346
2.22AspAsn: 2.22 ± 0.389
2.391AspPro: 2.391 ± 0.315
1.708AspGln: 1.708 ± 0.247
2.847AspArg: 2.847 ± 0.343
2.961AspSer: 2.961 ± 0.397
2.79AspThr: 2.79 ± 0.459
3.302AspVal: 3.302 ± 0.371
1.309AspTrp: 1.309 ± 0.291
2.164AspTyr: 2.164 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
6.775GluAla: 6.775 ± 0.881
0.911GluCys: 0.911 ± 0.217
2.676GluAsp: 2.676 ± 0.423
3.758GluGlu: 3.758 ± 0.507
3.188GluPhe: 3.188 ± 0.451
3.701GluGly: 3.701 ± 0.547
1.651GluHis: 1.651 ± 0.32
3.416GluIle: 3.416 ± 0.514
5.01GluLys: 5.01 ± 0.7
6.661GluLeu: 6.661 ± 0.577
1.708GluMet: 1.708 ± 0.328
3.131GluAsn: 3.131 ± 0.39
2.619GluPro: 2.619 ± 0.461
3.359GluGln: 3.359 ± 0.467
4.839GluArg: 4.839 ± 0.576
2.733GluSer: 2.733 ± 0.491
3.928GluThr: 3.928 ± 0.463
4.669GluVal: 4.669 ± 0.487
0.968GluTrp: 0.968 ± 0.205
2.05GluTyr: 2.05 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
3.074PheAla: 3.074 ± 0.399
0.626PheCys: 0.626 ± 0.215
3.018PheAsp: 3.018 ± 0.401
2.391PheGlu: 2.391 ± 0.359
0.968PhePhe: 0.968 ± 0.233
2.961PheGly: 2.961 ± 0.43
0.74PheHis: 0.74 ± 0.196
2.334PheIle: 2.334 ± 0.358
1.765PheLys: 1.765 ± 0.375
2.619PheLeu: 2.619 ± 0.445
1.196PheMet: 1.196 ± 0.273
1.879PheAsn: 1.879 ± 0.305
1.082PhePro: 1.082 ± 0.217
1.309PheGln: 1.309 ± 0.323
2.05PheArg: 2.05 ± 0.34
1.708PheSer: 1.708 ± 0.287
2.847PheThr: 2.847 ± 0.464
2.79PheVal: 2.79 ± 0.472
0.512PheTrp: 0.512 ± 0.198
1.082PheTyr: 1.082 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
5.807GlyAla: 5.807 ± 0.731
1.082GlyCys: 1.082 ± 0.295
3.473GlyAsp: 3.473 ± 0.432
4.953GlyGlu: 4.953 ± 0.575
2.847GlyPhe: 2.847 ± 0.397
6.661GlyGly: 6.661 ± 0.681
1.879GlyHis: 1.879 ± 0.383
4.384GlyIle: 4.384 ± 0.529
4.783GlyLys: 4.783 ± 0.396
5.58GlyLeu: 5.58 ± 0.543
2.164GlyMet: 2.164 ± 0.347
3.758GlyAsn: 3.758 ± 0.445
1.594GlyPro: 1.594 ± 0.331
3.188GlyGln: 3.188 ± 0.44
3.701GlyArg: 3.701 ± 0.412
4.498GlySer: 4.498 ± 0.599
4.953GlyThr: 4.953 ± 0.59
6.206GlyVal: 6.206 ± 0.592
1.423GlyTrp: 1.423 ± 0.319
2.904GlyTyr: 2.904 ± 0.436
0.0GlyXaa: 0.0 ± 0.0
His
2.164HisAla: 2.164 ± 0.4
0.512HisCys: 0.512 ± 0.203
0.854HisAsp: 0.854 ± 0.242
1.423HisGlu: 1.423 ± 0.356
0.968HisPhe: 0.968 ± 0.249
1.936HisGly: 1.936 ± 0.324
0.569HisHis: 0.569 ± 0.227
1.936HisIle: 1.936 ± 0.257
0.74HisLys: 0.74 ± 0.17
1.253HisLeu: 1.253 ± 0.324
0.74HisMet: 0.74 ± 0.166
1.253HisAsn: 1.253 ± 0.257
1.082HisPro: 1.082 ± 0.239
0.399HisGln: 0.399 ± 0.122
0.968HisArg: 0.968 ± 0.203
1.366HisSer: 1.366 ± 0.242
1.423HisThr: 1.423 ± 0.3
1.765HisVal: 1.765 ± 0.338
0.455HisTrp: 0.455 ± 0.147
0.455HisTyr: 0.455 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
4.612IleAla: 4.612 ± 0.557
0.854IleCys: 0.854 ± 0.243
3.815IleAsp: 3.815 ± 0.476
4.213IleGlu: 4.213 ± 0.572
1.139IlePhe: 1.139 ± 0.301
3.872IleGly: 3.872 ± 0.473
0.797IleHis: 0.797 ± 0.196
2.448IleIle: 2.448 ± 0.423
3.644IleLys: 3.644 ± 0.489
3.188IleLeu: 3.188 ± 0.378
2.05IleMet: 2.05 ± 0.342
3.245IleAsn: 3.245 ± 0.411
2.505IlePro: 2.505 ± 0.374
2.05IleGln: 2.05 ± 0.312
2.847IleArg: 2.847 ± 0.35
2.22IleSer: 2.22 ± 0.384
2.904IleThr: 2.904 ± 0.523
4.099IleVal: 4.099 ± 0.525
0.399IleTrp: 0.399 ± 0.194
1.423IleTyr: 1.423 ± 0.245
0.0IleXaa: 0.0 ± 0.0
Lys
5.523LysAla: 5.523 ± 0.713
0.911LysCys: 0.911 ± 0.278
2.961LysAsp: 2.961 ± 0.403
4.327LysGlu: 4.327 ± 0.5
2.505LysPhe: 2.505 ± 0.404
4.441LysGly: 4.441 ± 0.546
1.139LysHis: 1.139 ± 0.249
2.505LysIle: 2.505 ± 0.361
3.416LysLys: 3.416 ± 0.657
6.035LysLeu: 6.035 ± 0.564
1.822LysMet: 1.822 ± 0.306
2.22LysAsn: 2.22 ± 0.397
2.733LysPro: 2.733 ± 0.329
3.985LysGln: 3.985 ± 0.488
4.27LysArg: 4.27 ± 0.616
2.961LysSer: 2.961 ± 0.401
3.131LysThr: 3.131 ± 0.39
4.099LysVal: 4.099 ± 0.478
0.911LysTrp: 0.911 ± 0.202
1.594LysTyr: 1.594 ± 0.298
0.0LysXaa: 0.0 ± 0.0
Leu
6.889LeuAla: 6.889 ± 0.736
1.253LeuCys: 1.253 ± 0.307
4.839LeuAsp: 4.839 ± 0.528
6.547LeuGlu: 6.547 ± 0.606
3.074LeuPhe: 3.074 ± 0.318
5.978LeuGly: 5.978 ± 0.505
1.48LeuHis: 1.48 ± 0.272
3.701LeuIle: 3.701 ± 0.515
4.783LeuLys: 4.783 ± 0.483
5.75LeuLeu: 5.75 ± 0.568
2.334LeuMet: 2.334 ± 0.312
2.619LeuAsn: 2.619 ± 0.397
3.245LeuPro: 3.245 ± 0.393
3.131LeuGln: 3.131 ± 0.398
5.01LeuArg: 5.01 ± 0.524
4.156LeuSer: 4.156 ± 0.446
5.466LeuThr: 5.466 ± 0.498
4.441LeuVal: 4.441 ± 0.469
1.025LeuTrp: 1.025 ± 0.257
2.22LeuTyr: 2.22 ± 0.336
0.0LeuXaa: 0.0 ± 0.0
Met
1.936MetAla: 1.936 ± 0.335
0.74MetCys: 0.74 ± 0.199
1.879MetAsp: 1.879 ± 0.33
3.074MetGlu: 3.074 ± 0.717
1.139MetPhe: 1.139 ± 0.212
1.765MetGly: 1.765 ± 0.25
0.911MetHis: 0.911 ± 0.185
0.911MetIle: 0.911 ± 0.23
2.562MetLys: 2.562 ± 0.362
2.164MetLeu: 2.164 ± 0.504
0.968MetMet: 0.968 ± 0.262
1.253MetAsn: 1.253 ± 0.208
1.196MetPro: 1.196 ± 0.29
1.309MetGln: 1.309 ± 0.263
1.309MetArg: 1.309 ± 0.252
2.334MetSer: 2.334 ± 0.381
1.48MetThr: 1.48 ± 0.275
2.334MetVal: 2.334 ± 0.46
0.569MetTrp: 0.569 ± 0.195
0.455MetTyr: 0.455 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
3.815AsnAla: 3.815 ± 0.521
0.797AsnCys: 0.797 ± 0.223
2.164AsnAsp: 2.164 ± 0.332
1.822AsnGlu: 1.822 ± 0.279
1.309AsnPhe: 1.309 ± 0.231
4.156AsnGly: 4.156 ± 0.526
1.196AsnHis: 1.196 ± 0.301
3.131AsnIle: 3.131 ± 0.406
2.562AsnLys: 2.562 ± 0.393
3.245AsnLeu: 3.245 ± 0.342
1.139AsnMet: 1.139 ± 0.236
2.164AsnAsn: 2.164 ± 0.315
2.448AsnPro: 2.448 ± 0.322
1.879AsnGln: 1.879 ± 0.311
2.107AsnArg: 2.107 ± 0.356
2.107AsnSer: 2.107 ± 0.298
3.074AsnThr: 3.074 ± 0.441
2.79AsnVal: 2.79 ± 0.352
0.854AsnTrp: 0.854 ± 0.189
1.309AsnTyr: 1.309 ± 0.303
0.0AsnXaa: 0.0 ± 0.0
Pro
2.562ProAla: 2.562 ± 0.401
0.74ProCys: 0.74 ± 0.178
3.245ProAsp: 3.245 ± 0.403
3.359ProGlu: 3.359 ± 0.468
2.107ProPhe: 2.107 ± 0.269
2.277ProGly: 2.277 ± 0.399
0.797ProHis: 0.797 ± 0.245
2.164ProIle: 2.164 ± 0.32
2.619ProLys: 2.619 ± 0.429
2.391ProLeu: 2.391 ± 0.313
1.025ProMet: 1.025 ± 0.189
1.822ProAsn: 1.822 ± 0.455
1.879ProPro: 1.879 ± 0.332
1.936ProGln: 1.936 ± 0.409
1.879ProArg: 1.879 ± 0.328
2.334ProSer: 2.334 ± 0.368
2.448ProThr: 2.448 ± 0.355
2.847ProVal: 2.847 ± 0.405
0.74ProTrp: 0.74 ± 0.202
1.765ProTyr: 1.765 ± 0.289
0.0ProXaa: 0.0 ± 0.0
Gln
3.359GlnAla: 3.359 ± 0.554
0.854GlnCys: 0.854 ± 0.219
1.537GlnAsp: 1.537 ± 0.368
2.619GlnGlu: 2.619 ± 0.361
1.993GlnPhe: 1.993 ± 0.341
2.961GlnGly: 2.961 ± 0.587
1.082GlnHis: 1.082 ± 0.292
1.822GlnIle: 1.822 ± 0.274
1.651GlnLys: 1.651 ± 0.335
3.644GlnLeu: 3.644 ± 0.431
1.366GlnMet: 1.366 ± 0.282
1.594GlnAsn: 1.594 ± 0.345
1.993GlnPro: 1.993 ± 0.345
4.783GlnGln: 4.783 ± 2.79
2.391GlnArg: 2.391 ± 0.335
2.562GlnSer: 2.562 ± 0.395
1.765GlnThr: 1.765 ± 0.292
2.391GlnVal: 2.391 ± 0.429
0.797GlnTrp: 0.797 ± 0.258
1.537GlnTyr: 1.537 ± 0.308
0.0GlnXaa: 0.0 ± 0.0
Arg
4.555ArgAla: 4.555 ± 0.528
0.455ArgCys: 0.455 ± 0.151
3.018ArgAsp: 3.018 ± 0.413
4.156ArgGlu: 4.156 ± 0.566
2.22ArgPhe: 2.22 ± 0.313
4.213ArgGly: 4.213 ± 0.488
1.309ArgHis: 1.309 ± 0.244
3.53ArgIle: 3.53 ± 0.379
3.815ArgLys: 3.815 ± 0.49
4.555ArgLeu: 4.555 ± 0.628
1.366ArgMet: 1.366 ± 0.3
2.619ArgAsn: 2.619 ± 0.407
1.366ArgPro: 1.366 ± 0.283
2.05ArgGln: 2.05 ± 0.395
2.676ArgArg: 2.676 ± 0.499
2.733ArgSer: 2.733 ± 0.358
3.188ArgThr: 3.188 ± 0.368
4.156ArgVal: 4.156 ± 0.473
1.253ArgTrp: 1.253 ± 0.243
1.879ArgTyr: 1.879 ± 0.32
0.0ArgXaa: 0.0 ± 0.0
Ser
4.441SerAla: 4.441 ± 0.505
0.968SerCys: 0.968 ± 0.226
2.619SerAsp: 2.619 ± 0.363
2.277SerGlu: 2.277 ± 0.408
1.936SerPhe: 1.936 ± 0.303
4.839SerGly: 4.839 ± 0.698
0.968SerHis: 0.968 ± 0.255
2.619SerIle: 2.619 ± 0.402
3.758SerLys: 3.758 ± 0.492
4.555SerLeu: 4.555 ± 0.443
1.708SerMet: 1.708 ± 0.327
2.448SerAsn: 2.448 ± 0.366
1.993SerPro: 1.993 ± 0.26
1.48SerGln: 1.48 ± 0.259
3.074SerArg: 3.074 ± 0.443
2.79SerSer: 2.79 ± 0.401
2.733SerThr: 2.733 ± 0.361
4.042SerVal: 4.042 ± 0.443
0.74SerTrp: 0.74 ± 0.182
2.334SerTyr: 2.334 ± 0.33
0.0SerXaa: 0.0 ± 0.0
Thr
5.466ThrAla: 5.466 ± 0.542
0.569ThrCys: 0.569 ± 0.205
3.473ThrAsp: 3.473 ± 0.399
4.213ThrGlu: 4.213 ± 0.482
2.164ThrPhe: 2.164 ± 0.306
6.206ThrGly: 6.206 ± 0.637
0.968ThrHis: 0.968 ± 0.231
3.587ThrIle: 3.587 ± 0.391
2.619ThrLys: 2.619 ± 0.321
5.352ThrLeu: 5.352 ± 0.606
0.968ThrMet: 0.968 ± 0.279
2.334ThrAsn: 2.334 ± 0.306
3.701ThrPro: 3.701 ± 0.466
2.448ThrGln: 2.448 ± 0.384
3.302ThrArg: 3.302 ± 0.562
3.188ThrSer: 3.188 ± 0.54
4.099ThrThr: 4.099 ± 0.573
4.327ThrVal: 4.327 ± 0.783
0.911ThrTrp: 0.911 ± 0.225
2.107ThrTyr: 2.107 ± 0.335
0.0ThrXaa: 0.0 ± 0.0
Val
5.693ValAla: 5.693 ± 0.551
0.74ValCys: 0.74 ± 0.186
4.327ValAsp: 4.327 ± 0.418
4.555ValGlu: 4.555 ± 0.415
2.22ValPhe: 2.22 ± 0.293
5.124ValGly: 5.124 ± 0.598
1.708ValHis: 1.708 ± 0.355
3.644ValIle: 3.644 ± 0.448
4.726ValLys: 4.726 ± 0.574
4.612ValLeu: 4.612 ± 0.467
2.05ValMet: 2.05 ± 0.355
2.676ValAsn: 2.676 ± 0.371
3.188ValPro: 3.188 ± 0.427
2.619ValGln: 2.619 ± 0.421
3.644ValArg: 3.644 ± 0.51
3.701ValSer: 3.701 ± 0.528
5.58ValThr: 5.58 ± 0.739
6.661ValVal: 6.661 ± 0.677
1.537ValTrp: 1.537 ± 0.282
2.391ValTyr: 2.391 ± 0.365
0.0ValXaa: 0.0 ± 0.0
Trp
1.139TrpAla: 1.139 ± 0.229
0.399TrpCys: 0.399 ± 0.155
1.253TrpAsp: 1.253 ± 0.373
1.025TrpGlu: 1.025 ± 0.242
0.797TrpPhe: 0.797 ± 0.253
0.455TrpGly: 0.455 ± 0.158
0.342TrpHis: 0.342 ± 0.133
0.399TrpIle: 0.399 ± 0.155
1.196TrpLys: 1.196 ± 0.244
1.253TrpLeu: 1.253 ± 0.301
0.626TrpMet: 0.626 ± 0.162
0.854TrpAsn: 0.854 ± 0.223
0.74TrpPro: 0.74 ± 0.17
0.797TrpGln: 0.797 ± 0.239
0.968TrpArg: 0.968 ± 0.202
1.366TrpSer: 1.366 ± 0.277
0.797TrpThr: 0.797 ± 0.18
1.082TrpVal: 1.082 ± 0.227
0.455TrpTrp: 0.455 ± 0.133
1.082TrpTyr: 1.082 ± 0.263
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.448TyrAla: 2.448 ± 0.373
0.683TyrCys: 0.683 ± 0.173
2.505TyrAsp: 2.505 ± 0.374
2.164TyrGlu: 2.164 ± 0.365
1.082TyrPhe: 1.082 ± 0.25
2.562TyrGly: 2.562 ± 0.321
0.569TyrHis: 0.569 ± 0.156
1.48TyrIle: 1.48 ± 0.284
2.107TyrLys: 2.107 ± 0.37
2.448TyrLeu: 2.448 ± 0.398
0.854TyrMet: 0.854 ± 0.198
1.253TyrAsn: 1.253 ± 0.217
1.594TyrPro: 1.594 ± 0.323
0.74TyrGln: 0.74 ± 0.259
1.822TyrArg: 1.822 ± 0.255
1.651TyrSer: 1.651 ± 0.301
2.334TyrThr: 2.334 ± 0.374
2.904TyrVal: 2.904 ± 0.387
0.683TyrTrp: 0.683 ± 0.181
1.366TyrTyr: 1.366 ± 0.353
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (17565 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski