Amino acid dipepetide frequency for Bacillus phage v_B-Bak6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.152AlaAla: 5.152 ± 0.761
0.724AlaCys: 0.724 ± 0.194
3.406AlaAsp: 3.406 ± 0.539
4.598AlaGlu: 4.598 ± 0.635
2.384AlaPhe: 2.384 ± 0.367
4.556AlaGly: 4.556 ± 0.533
0.639AlaHis: 0.639 ± 0.142
3.875AlaIle: 3.875 ± 0.482
5.961AlaLys: 5.961 ± 0.537
4.3AlaLeu: 4.3 ± 0.555
1.405AlaMet: 1.405 ± 0.302
3.96AlaAsn: 3.96 ± 0.6
1.192AlaPro: 1.192 ± 0.238
2.086AlaGln: 2.086 ± 0.407
1.959AlaArg: 1.959 ± 0.339
2.81AlaSer: 2.81 ± 0.43
3.704AlaThr: 3.704 ± 0.654
4.088AlaVal: 4.088 ± 0.451
0.937AlaTrp: 0.937 ± 0.214
2.512AlaTyr: 2.512 ± 0.427
0.0AlaXaa: 0.0 ± 0.0
Cys
0.255CysAla: 0.255 ± 0.108
0.043CysCys: 0.043 ± 0.049
0.937CysAsp: 0.937 ± 0.19
0.554CysGlu: 0.554 ± 0.177
0.468CysPhe: 0.468 ± 0.167
0.213CysGly: 0.213 ± 0.097
0.17CysHis: 0.17 ± 0.09
0.639CysIle: 0.639 ± 0.2
0.766CysLys: 0.766 ± 0.223
0.511CysLeu: 0.511 ± 0.196
0.426CysMet: 0.426 ± 0.129
0.511CysAsn: 0.511 ± 0.174
0.468CysPro: 0.468 ± 0.15
0.213CysGln: 0.213 ± 0.097
0.298CysArg: 0.298 ± 0.104
0.426CysSer: 0.426 ± 0.166
0.383CysThr: 0.383 ± 0.139
0.383CysVal: 0.383 ± 0.144
0.085CysTrp: 0.085 ± 0.056
0.298CysTyr: 0.298 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
3.108AspAla: 3.108 ± 0.425
0.511AspCys: 0.511 ± 0.165
3.491AspAsp: 3.491 ± 0.377
4.982AspGlu: 4.982 ± 0.554
2.895AspPhe: 2.895 ± 0.473
3.875AspGly: 3.875 ± 0.456
1.277AspHis: 1.277 ± 0.251
5.195AspIle: 5.195 ± 0.496
6.216AspLys: 6.216 ± 0.543
4.854AspLeu: 4.854 ± 0.421
1.405AspMet: 1.405 ± 0.249
4.002AspAsn: 4.002 ± 0.529
1.916AspPro: 1.916 ± 0.354
2.086AspGln: 2.086 ± 0.288
2.895AspArg: 2.895 ± 0.435
3.151AspSer: 3.151 ± 0.399
3.321AspThr: 3.321 ± 0.381
3.747AspVal: 3.747 ± 0.369
0.937AspTrp: 0.937 ± 0.182
3.449AspTyr: 3.449 ± 0.411
0.0AspXaa: 0.0 ± 0.0
Glu
4.598GluAla: 4.598 ± 0.636
0.596GluCys: 0.596 ± 0.158
3.577GluAsp: 3.577 ± 0.454
6.131GluGlu: 6.131 ± 0.748
2.938GluPhe: 2.938 ± 0.364
4.513GluGly: 4.513 ± 0.513
0.979GluHis: 0.979 ± 0.229
5.706GluIle: 5.706 ± 0.543
6.77GluLys: 6.77 ± 0.633
6.727GluLeu: 6.727 ± 0.667
2.938GluMet: 2.938 ± 0.327
4.513GluAsn: 4.513 ± 0.547
1.533GluPro: 1.533 ± 0.279
3.406GluGln: 3.406 ± 0.475
3.279GluArg: 3.279 ± 0.435
4.258GluSer: 4.258 ± 0.362
4.343GluThr: 4.343 ± 0.462
5.322GluVal: 5.322 ± 0.482
1.15GluTrp: 1.15 ± 0.221
3.023GluTyr: 3.023 ± 0.387
0.0GluXaa: 0.0 ± 0.0
Phe
2.725PheAla: 2.725 ± 0.329
0.468PheCys: 0.468 ± 0.168
3.364PheAsp: 3.364 ± 0.366
3.789PheGlu: 3.789 ± 0.427
1.533PhePhe: 1.533 ± 0.298
2.895PheGly: 2.895 ± 0.393
0.554PheHis: 0.554 ± 0.141
3.023PheIle: 3.023 ± 0.35
3.747PheLys: 3.747 ± 0.381
3.279PheLeu: 3.279 ± 0.49
1.15PheMet: 1.15 ± 0.224
2.768PheAsn: 2.768 ± 0.294
1.15PhePro: 1.15 ± 0.212
1.064PheGln: 1.064 ± 0.197
1.618PheArg: 1.618 ± 0.27
2.47PheSer: 2.47 ± 0.335
3.364PheThr: 3.364 ± 0.442
2.64PheVal: 2.64 ± 0.324
0.639PheTrp: 0.639 ± 0.185
1.788PheTyr: 1.788 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
3.236GlyAla: 3.236 ± 0.597
0.596GlyCys: 0.596 ± 0.199
2.725GlyAsp: 2.725 ± 0.356
4.3GlyGlu: 4.3 ± 0.382
2.895GlyPhe: 2.895 ± 0.421
5.024GlyGly: 5.024 ± 0.739
1.32GlyHis: 1.32 ± 0.257
4.258GlyIle: 4.258 ± 0.37
5.237GlyLys: 5.237 ± 0.457
5.663GlyLeu: 5.663 ± 0.484
2.129GlyMet: 2.129 ± 0.368
3.789GlyAsn: 3.789 ± 0.468
0.085GlyPro: 0.085 ± 0.05
2.895GlyGln: 2.895 ± 0.783
2.725GlyArg: 2.725 ± 0.308
3.875GlySer: 3.875 ± 0.456
4.386GlyThr: 4.386 ± 0.688
5.024GlyVal: 5.024 ± 0.65
1.022GlyTrp: 1.022 ± 0.194
3.151GlyTyr: 3.151 ± 0.424
0.0GlyXaa: 0.0 ± 0.0
His
0.766HisAla: 0.766 ± 0.179
0.128HisCys: 0.128 ± 0.083
1.064HisAsp: 1.064 ± 0.224
0.852HisGlu: 0.852 ± 0.218
0.937HisPhe: 0.937 ± 0.22
1.022HisGly: 1.022 ± 0.161
0.17HisHis: 0.17 ± 0.086
1.107HisIle: 1.107 ± 0.227
1.405HisLys: 1.405 ± 0.26
1.703HisLeu: 1.703 ± 0.317
0.383HisMet: 0.383 ± 0.139
1.32HisAsn: 1.32 ± 0.284
0.511HisPro: 0.511 ± 0.152
0.554HisGln: 0.554 ± 0.145
0.341HisArg: 0.341 ± 0.127
0.937HisSer: 0.937 ± 0.204
0.937HisThr: 0.937 ± 0.195
1.064HisVal: 1.064 ± 0.222
0.255HisTrp: 0.255 ± 0.108
0.724HisTyr: 0.724 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
4.13IleAla: 4.13 ± 0.447
0.681IleCys: 0.681 ± 0.211
6.259IleAsp: 6.259 ± 0.593
5.62IleGlu: 5.62 ± 0.533
2.64IlePhe: 2.64 ± 0.291
4.854IleGly: 4.854 ± 0.412
1.32IleHis: 1.32 ± 0.265
4.045IleIle: 4.045 ± 0.577
7.196IleLys: 7.196 ± 0.676
4.258IleLeu: 4.258 ± 0.38
1.959IleMet: 1.959 ± 0.299
4.173IleAsn: 4.173 ± 0.385
1.831IlePro: 1.831 ± 0.243
2.384IleGln: 2.384 ± 0.316
2.47IleArg: 2.47 ± 0.325
3.832IleSer: 3.832 ± 0.491
4.684IleThr: 4.684 ± 0.602
4.726IleVal: 4.726 ± 0.546
0.383IleTrp: 0.383 ± 0.116
2.682IleTyr: 2.682 ± 0.347
0.0IleXaa: 0.0 ± 0.0
Lys
4.939LysAla: 4.939 ± 0.509
0.894LysCys: 0.894 ± 0.188
5.918LysAsp: 5.918 ± 0.537
8.431LysGlu: 8.431 ± 0.792
3.832LysPhe: 3.832 ± 0.27
4.811LysGly: 4.811 ± 0.426
1.064LysHis: 1.064 ± 0.232
6.515LysIle: 6.515 ± 0.516
6.94LysLys: 6.94 ± 0.71
7.281LysLeu: 7.281 ± 0.586
3.108LysMet: 3.108 ± 0.386
4.513LysAsn: 4.513 ± 0.478
2.895LysPro: 2.895 ± 0.386
2.98LysGln: 2.98 ± 0.364
3.491LysArg: 3.491 ± 0.357
4.471LysSer: 4.471 ± 0.386
4.386LysThr: 4.386 ± 0.397
7.153LysVal: 7.153 ± 0.666
1.064LysTrp: 1.064 ± 0.18
4.258LysTyr: 4.258 ± 0.396
0.0LysXaa: 0.0 ± 0.0
Leu
5.195LeuAla: 5.195 ± 0.525
0.468LeuCys: 0.468 ± 0.124
5.237LeuAsp: 5.237 ± 0.429
6.515LeuGlu: 6.515 ± 0.624
3.023LeuPhe: 3.023 ± 0.415
4.343LeuGly: 4.343 ± 0.394
1.49LeuHis: 1.49 ± 0.222
4.684LeuIle: 4.684 ± 0.54
7.451LeuLys: 7.451 ± 0.516
4.811LeuLeu: 4.811 ± 0.53
2.086LeuMet: 2.086 ± 0.331
5.918LeuAsn: 5.918 ± 0.457
2.682LeuPro: 2.682 ± 0.442
2.853LeuGln: 2.853 ± 0.398
3.193LeuArg: 3.193 ± 0.354
4.045LeuSer: 4.045 ± 0.456
5.578LeuThr: 5.578 ± 0.481
3.917LeuVal: 3.917 ± 0.464
1.192LeuTrp: 1.192 ± 0.238
2.427LeuTyr: 2.427 ± 0.41
0.0LeuXaa: 0.0 ± 0.0
Met
1.831MetAla: 1.831 ± 0.3
0.255MetCys: 0.255 ± 0.121
1.575MetAsp: 1.575 ± 0.259
2.129MetGlu: 2.129 ± 0.314
1.32MetPhe: 1.32 ± 0.262
1.575MetGly: 1.575 ± 0.332
0.468MetHis: 0.468 ± 0.145
2.597MetIle: 2.597 ± 0.35
3.279MetLys: 3.279 ± 0.344
2.001MetLeu: 2.001 ± 0.306
0.809MetMet: 0.809 ± 0.209
1.703MetAsn: 1.703 ± 0.346
0.724MetPro: 0.724 ± 0.209
1.064MetGln: 1.064 ± 0.206
1.15MetArg: 1.15 ± 0.197
2.001MetSer: 2.001 ± 0.284
1.575MetThr: 1.575 ± 0.232
2.257MetVal: 2.257 ± 0.306
0.213MetTrp: 0.213 ± 0.087
1.49MetTyr: 1.49 ± 0.252
0.0MetXaa: 0.0 ± 0.0
Asn
2.98AsnAla: 2.98 ± 0.362
0.383AsnCys: 0.383 ± 0.17
3.704AsnAsp: 3.704 ± 0.385
4.386AsnGlu: 4.386 ± 0.471
2.384AsnPhe: 2.384 ± 0.32
4.513AsnGly: 4.513 ± 0.493
0.894AsnHis: 0.894 ± 0.21
4.3AsnIle: 4.3 ± 0.349
6.004AsnLys: 6.004 ± 0.553
5.407AsnLeu: 5.407 ± 0.395
2.129AsnMet: 2.129 ± 0.293
3.875AsnAsn: 3.875 ± 0.443
2.044AsnPro: 2.044 ± 0.389
2.47AsnGln: 2.47 ± 0.51
2.597AsnArg: 2.597 ± 0.449
3.704AsnSer: 3.704 ± 0.545
2.81AsnThr: 2.81 ± 0.455
3.789AsnVal: 3.789 ± 0.442
0.681AsnTrp: 0.681 ± 0.197
2.427AsnTyr: 2.427 ± 0.346
0.0AsnXaa: 0.0 ± 0.0
Pro
1.788ProAla: 1.788 ± 0.274
0.17ProCys: 0.17 ± 0.087
1.575ProAsp: 1.575 ± 0.347
2.214ProGlu: 2.214 ± 0.293
1.831ProPhe: 1.831 ± 0.334
0.0ProGly: 0.0 ± 0.0
0.298ProHis: 0.298 ± 0.092
2.129ProIle: 2.129 ± 0.313
2.384ProLys: 2.384 ± 0.356
1.831ProLeu: 1.831 ± 0.331
1.022ProMet: 1.022 ± 0.217
2.086ProAsn: 2.086 ± 0.399
0.341ProPro: 0.341 ± 0.15
1.064ProGln: 1.064 ± 0.225
0.511ProArg: 0.511 ± 0.12
1.831ProSer: 1.831 ± 0.348
1.746ProThr: 1.746 ± 0.375
1.916ProVal: 1.916 ± 0.269
0.213ProTrp: 0.213 ± 0.085
1.235ProTyr: 1.235 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
2.129GlnAla: 2.129 ± 0.472
0.255GlnCys: 0.255 ± 0.103
1.32GlnAsp: 1.32 ± 0.232
2.853GlnGlu: 2.853 ± 0.424
1.192GlnPhe: 1.192 ± 0.217
2.512GlnGly: 2.512 ± 0.456
0.468GlnHis: 0.468 ± 0.179
2.427GlnIle: 2.427 ± 0.335
2.81GlnLys: 2.81 ± 0.42
3.534GlnLeu: 3.534 ± 0.384
1.363GlnMet: 1.363 ± 0.252
2.257GlnAsn: 2.257 ± 0.395
1.064GlnPro: 1.064 ± 0.327
2.938GlnGln: 2.938 ± 1.35
1.703GlnArg: 1.703 ± 0.294
1.788GlnSer: 1.788 ± 0.271
2.001GlnThr: 2.001 ± 0.459
2.044GlnVal: 2.044 ± 0.292
0.426GlnTrp: 0.426 ± 0.143
1.363GlnTyr: 1.363 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
2.257ArgAla: 2.257 ± 0.363
0.255ArgCys: 0.255 ± 0.105
2.555ArgAsp: 2.555 ± 0.339
2.555ArgGlu: 2.555 ± 0.366
2.086ArgPhe: 2.086 ± 0.292
2.512ArgGly: 2.512 ± 0.354
0.852ArgHis: 0.852 ± 0.219
2.853ArgIle: 2.853 ± 0.332
3.364ArgLys: 3.364 ± 0.358
2.895ArgLeu: 2.895 ± 0.398
1.235ArgMet: 1.235 ± 0.222
2.299ArgAsn: 2.299 ± 0.287
1.107ArgPro: 1.107 ± 0.238
1.363ArgGln: 1.363 ± 0.22
1.405ArgArg: 1.405 ± 0.27
1.873ArgSer: 1.873 ± 0.217
2.129ArgThr: 2.129 ± 0.249
2.512ArgVal: 2.512 ± 0.354
0.383ArgTrp: 0.383 ± 0.124
1.703ArgTyr: 1.703 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
3.789SerAla: 3.789 ± 0.576
0.511SerCys: 0.511 ± 0.149
3.151SerAsp: 3.151 ± 0.376
3.151SerGlu: 3.151 ± 0.451
3.534SerPhe: 3.534 ± 0.442
5.067SerGly: 5.067 ± 0.532
1.022SerHis: 1.022 ± 0.24
3.832SerIle: 3.832 ± 0.465
4.3SerLys: 4.3 ± 0.397
4.854SerLeu: 4.854 ± 0.442
1.873SerMet: 1.873 ± 0.306
2.597SerAsn: 2.597 ± 0.355
1.49SerPro: 1.49 ± 0.297
1.788SerGln: 1.788 ± 0.314
1.235SerArg: 1.235 ± 0.25
3.406SerSer: 3.406 ± 0.539
3.108SerThr: 3.108 ± 0.342
2.895SerVal: 2.895 ± 0.369
0.766SerTrp: 0.766 ± 0.255
2.768SerTyr: 2.768 ± 0.342
0.0SerXaa: 0.0 ± 0.0
Thr
4.428ThrAla: 4.428 ± 0.881
0.255ThrCys: 0.255 ± 0.12
4.088ThrAsp: 4.088 ± 0.4
3.875ThrGlu: 3.875 ± 0.433
2.512ThrPhe: 2.512 ± 0.356
4.939ThrGly: 4.939 ± 0.605
1.022ThrHis: 1.022 ± 0.207
5.322ThrIle: 5.322 ± 0.488
4.726ThrLys: 4.726 ± 0.429
4.258ThrLeu: 4.258 ± 0.48
1.363ThrMet: 1.363 ± 0.274
3.449ThrAsn: 3.449 ± 0.417
2.044ThrPro: 2.044 ± 0.352
1.746ThrGln: 1.746 ± 0.352
2.044ThrArg: 2.044 ± 0.25
3.449ThrSer: 3.449 ± 0.545
4.641ThrThr: 4.641 ± 0.686
4.556ThrVal: 4.556 ± 0.51
0.341ThrTrp: 0.341 ± 0.117
2.512ThrTyr: 2.512 ± 0.31
0.0ThrXaa: 0.0 ± 0.0
Val
3.789ValAla: 3.789 ± 0.485
0.426ValCys: 0.426 ± 0.154
4.641ValAsp: 4.641 ± 0.463
4.854ValGlu: 4.854 ± 0.363
2.853ValPhe: 2.853 ± 0.373
3.789ValGly: 3.789 ± 0.406
1.15ValHis: 1.15 ± 0.254
4.045ValIle: 4.045 ± 0.447
6.557ValLys: 6.557 ± 0.57
4.513ValLeu: 4.513 ± 0.455
1.703ValMet: 1.703 ± 0.396
4.045ValAsn: 4.045 ± 0.411
2.342ValPro: 2.342 ± 0.3
2.129ValGln: 2.129 ± 0.372
2.938ValArg: 2.938 ± 0.358
3.577ValSer: 3.577 ± 0.329
4.939ValThr: 4.939 ± 0.681
4.726ValVal: 4.726 ± 0.482
0.937ValTrp: 0.937 ± 0.208
2.64ValTyr: 2.64 ± 0.385
0.0ValXaa: 0.0 ± 0.0
Trp
0.511TrpAla: 0.511 ± 0.208
0.128TrpCys: 0.128 ± 0.075
1.107TrpAsp: 1.107 ± 0.205
0.724TrpGlu: 0.724 ± 0.167
0.809TrpPhe: 0.809 ± 0.208
0.681TrpGly: 0.681 ± 0.176
0.298TrpHis: 0.298 ± 0.097
0.937TrpIle: 0.937 ± 0.258
0.681TrpLys: 0.681 ± 0.165
0.766TrpLeu: 0.766 ± 0.235
0.213TrpMet: 0.213 ± 0.087
0.852TrpAsn: 0.852 ± 0.194
0.043TrpPro: 0.043 ± 0.033
0.426TrpGln: 0.426 ± 0.148
0.426TrpArg: 0.426 ± 0.146
0.766TrpSer: 0.766 ± 0.265
0.937TrpThr: 0.937 ± 0.196
1.064TrpVal: 1.064 ± 0.21
0.085TrpTrp: 0.085 ± 0.07
0.681TrpTyr: 0.681 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.64TyrAla: 2.64 ± 0.407
0.298TyrCys: 0.298 ± 0.107
3.619TyrAsp: 3.619 ± 0.404
3.832TyrGlu: 3.832 ± 0.413
1.959TyrPhe: 1.959 ± 0.341
2.64TyrGly: 2.64 ± 0.326
0.724TyrHis: 0.724 ± 0.177
2.64TyrIle: 2.64 ± 0.313
3.151TyrLys: 3.151 ± 0.399
3.662TyrLeu: 3.662 ± 0.464
1.277TyrMet: 1.277 ± 0.238
2.853TyrAsn: 2.853 ± 0.394
0.724TyrPro: 0.724 ± 0.168
0.937TyrGln: 0.937 ± 0.182
1.916TyrArg: 1.916 ± 0.302
2.47TyrSer: 2.47 ± 0.322
2.597TyrThr: 2.597 ± 0.346
2.768TyrVal: 2.768 ± 0.348
0.426TyrTrp: 0.426 ± 0.13
2.47TyrTyr: 2.47 ± 0.477
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 120 proteins (23487 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski