Amino acid dipepetide frequency for Bordetella phage vB_BbrM_PHB04

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.817AlaAla: 16.817 ± 0.891
0.878AlaCys: 0.878 ± 0.215
7.829AlaAsp: 7.829 ± 0.577
7.864AlaGlu: 7.864 ± 0.784
3.265AlaPhe: 3.265 ± 0.345
10.849AlaGly: 10.849 ± 0.773
2.387AlaHis: 2.387 ± 0.289
4.88AlaIle: 4.88 ± 0.379
5.547AlaLys: 5.547 ± 0.681
10.743AlaLeu: 10.743 ± 0.504
3.827AlaMet: 3.827 ± 0.379
3.3AlaAsn: 3.3 ± 0.308
6.039AlaPro: 6.039 ± 0.46
5.196AlaGln: 5.196 ± 0.458
7.759AlaArg: 7.759 ± 0.569
5.688AlaSer: 5.688 ± 0.462
5.056AlaThr: 5.056 ± 0.569
7.408AlaVal: 7.408 ± 0.549
1.194AlaTrp: 1.194 ± 0.222
2.528AlaTyr: 2.528 ± 0.315
0.0AlaXaa: 0.0 ± 0.0
Cys
0.737CysAla: 0.737 ± 0.166
0.035CysCys: 0.035 ± 0.039
0.562CysAsp: 0.562 ± 0.149
0.492CysGlu: 0.492 ± 0.131
0.246CysPhe: 0.246 ± 0.091
1.088CysGly: 1.088 ± 0.194
0.176CysHis: 0.176 ± 0.083
0.421CysIle: 0.421 ± 0.122
0.246CysLys: 0.246 ± 0.115
0.667CysLeu: 0.667 ± 0.174
0.211CysMet: 0.211 ± 0.09
0.281CysAsn: 0.281 ± 0.097
0.527CysPro: 0.527 ± 0.184
0.281CysGln: 0.281 ± 0.104
0.667CysArg: 0.667 ± 0.167
0.632CysSer: 0.632 ± 0.191
0.386CysThr: 0.386 ± 0.144
0.667CysVal: 0.667 ± 0.171
0.176CysTrp: 0.176 ± 0.087
0.386CysTyr: 0.386 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
8.531AspAla: 8.531 ± 0.659
0.456AspCys: 0.456 ± 0.15
4.775AspAsp: 4.775 ± 0.557
4.95AspGlu: 4.95 ± 0.553
1.931AspPhe: 1.931 ± 0.271
6.987AspGly: 6.987 ± 0.59
0.772AspHis: 0.772 ± 0.212
2.528AspIle: 2.528 ± 0.308
2.528AspLys: 2.528 ± 0.39
5.968AspLeu: 5.968 ± 0.474
1.615AspMet: 1.615 ± 0.222
1.51AspAsn: 1.51 ± 0.223
3.651AspPro: 3.651 ± 0.368
2.177AspGln: 2.177 ± 0.294
4.424AspArg: 4.424 ± 0.483
2.142AspSer: 2.142 ± 0.267
3.16AspThr: 3.16 ± 0.329
4.002AspVal: 4.002 ± 0.5
1.369AspTrp: 1.369 ± 0.255
2.177AspTyr: 2.177 ± 0.254
0.0AspXaa: 0.0 ± 0.0
Glu
7.619GluAla: 7.619 ± 0.662
0.562GluCys: 0.562 ± 0.141
3.09GluAsp: 3.09 ± 0.333
2.458GluGlu: 2.458 ± 0.291
2.247GluPhe: 2.247 ± 0.328
4.002GluGly: 4.002 ± 0.432
1.896GluHis: 1.896 ± 0.289
3.406GluIle: 3.406 ± 0.396
2.809GluLys: 2.809 ± 0.286
6.39GluLeu: 6.39 ± 0.464
1.334GluMet: 1.334 ± 0.196
1.966GluAsn: 1.966 ± 0.32
2.633GluPro: 2.633 ± 0.299
3.019GluGln: 3.019 ± 0.305
5.337GluArg: 5.337 ± 0.46
3.476GluSer: 3.476 ± 0.299
2.809GluThr: 2.809 ± 0.382
3.546GluVal: 3.546 ± 0.354
1.123GluTrp: 1.123 ± 0.196
1.685GluTyr: 1.685 ± 0.236
0.0GluXaa: 0.0 ± 0.0
Phe
2.879PheAla: 2.879 ± 0.34
0.527PheCys: 0.527 ± 0.139
2.738PheAsp: 2.738 ± 0.3
1.791PheGlu: 1.791 ± 0.229
1.088PhePhe: 1.088 ± 0.203
3.3PheGly: 3.3 ± 0.375
0.386PheHis: 0.386 ± 0.106
1.299PheIle: 1.299 ± 0.21
1.439PheLys: 1.439 ± 0.196
2.282PheLeu: 2.282 ± 0.267
0.807PheMet: 0.807 ± 0.197
1.475PheAsn: 1.475 ± 0.199
1.791PhePro: 1.791 ± 0.325
0.983PheGln: 0.983 ± 0.181
2.493PheArg: 2.493 ± 0.313
1.861PheSer: 1.861 ± 0.266
2.247PheThr: 2.247 ± 0.281
2.352PheVal: 2.352 ± 0.27
0.386PheTrp: 0.386 ± 0.103
0.807PheTyr: 0.807 ± 0.16
0.0PheXaa: 0.0 ± 0.0
Gly
9.058GlyAla: 9.058 ± 0.635
0.597GlyCys: 0.597 ± 0.133
5.723GlyAsp: 5.723 ± 0.488
4.915GlyGlu: 4.915 ± 0.458
2.703GlyPhe: 2.703 ± 0.306
7.303GlyGly: 7.303 ± 0.816
1.51GlyHis: 1.51 ± 0.238
3.792GlyIle: 3.792 ± 0.323
4.424GlyLys: 4.424 ± 0.456
6.46GlyLeu: 6.46 ± 0.393
2.879GlyMet: 2.879 ± 0.266
3.054GlyAsn: 3.054 ± 0.416
2.809GlyPro: 2.809 ± 0.324
3.546GlyGln: 3.546 ± 0.395
5.898GlyArg: 5.898 ± 0.531
4.178GlySer: 4.178 ± 0.386
5.126GlyThr: 5.126 ± 0.546
5.372GlyVal: 5.372 ± 0.425
1.404GlyTrp: 1.404 ± 0.19
2.387GlyTyr: 2.387 ± 0.254
0.0GlyXaa: 0.0 ± 0.0
His
2.177HisAla: 2.177 ± 0.365
0.386HisCys: 0.386 ± 0.149
1.369HisAsp: 1.369 ± 0.225
1.475HisGlu: 1.475 ± 0.241
0.702HisPhe: 0.702 ± 0.172
1.685HisGly: 1.685 ± 0.266
0.702HisHis: 0.702 ± 0.175
0.983HisIle: 0.983 ± 0.178
0.983HisLys: 0.983 ± 0.185
1.615HisLeu: 1.615 ± 0.234
0.456HisMet: 0.456 ± 0.111
0.492HisAsn: 0.492 ± 0.123
1.053HisPro: 1.053 ± 0.177
0.702HisGln: 0.702 ± 0.186
1.369HisArg: 1.369 ± 0.275
1.053HisSer: 1.053 ± 0.185
1.018HisThr: 1.018 ± 0.167
1.475HisVal: 1.475 ± 0.306
0.105HisTrp: 0.105 ± 0.062
0.562HisTyr: 0.562 ± 0.128
0.0HisXaa: 0.0 ± 0.0
Ile
5.477IleAla: 5.477 ± 0.349
0.386IleCys: 0.386 ± 0.14
3.019IleAsp: 3.019 ± 0.34
3.3IleGlu: 3.3 ± 0.39
1.123IlePhe: 1.123 ± 0.156
3.827IleGly: 3.827 ± 0.347
1.334IleHis: 1.334 ± 0.224
1.299IleIle: 1.299 ± 0.256
2.071IleLys: 2.071 ± 0.31
2.598IleLeu: 2.598 ± 0.287
1.194IleMet: 1.194 ± 0.211
1.58IleAsn: 1.58 ± 0.249
1.966IlePro: 1.966 ± 0.249
2.036IleGln: 2.036 ± 0.3
2.879IleArg: 2.879 ± 0.361
2.142IleSer: 2.142 ± 0.301
3.019IleThr: 3.019 ± 0.352
3.722IleVal: 3.722 ± 0.311
0.562IleTrp: 0.562 ± 0.15
1.229IleTyr: 1.229 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
5.793LysAla: 5.793 ± 0.479
0.492LysCys: 0.492 ± 0.15
2.844LysAsp: 2.844 ± 0.37
2.352LysGlu: 2.352 ± 0.333
1.229LysPhe: 1.229 ± 0.208
2.984LysGly: 2.984 ± 0.342
1.159LysHis: 1.159 ± 0.216
2.001LysIle: 2.001 ± 0.252
2.703LysLys: 2.703 ± 0.348
3.406LysLeu: 3.406 ± 0.364
1.194LysMet: 1.194 ± 0.235
1.58LysAsn: 1.58 ± 0.284
3.195LysPro: 3.195 ± 0.321
1.896LysGln: 1.896 ± 0.308
3.476LysArg: 3.476 ± 0.305
2.142LysSer: 2.142 ± 0.25
2.422LysThr: 2.422 ± 0.326
3.406LysVal: 3.406 ± 0.409
0.843LysTrp: 0.843 ± 0.158
1.194LysTyr: 1.194 ± 0.229
0.0LysXaa: 0.0 ± 0.0
Leu
9.198LeuAla: 9.198 ± 0.6
0.597LeuCys: 0.597 ± 0.165
5.056LeuAsp: 5.056 ± 0.436
5.301LeuGlu: 5.301 ± 0.542
2.914LeuPhe: 2.914 ± 0.286
6.46LeuGly: 6.46 ± 0.54
1.439LeuHis: 1.439 ± 0.213
3.406LeuIle: 3.406 ± 0.39
4.002LeuLys: 4.002 ± 0.374
5.547LeuLeu: 5.547 ± 0.481
2.177LeuMet: 2.177 ± 0.237
2.984LeuAsn: 2.984 ± 0.345
4.073LeuPro: 4.073 ± 0.372
3.441LeuGln: 3.441 ± 0.368
6.846LeuArg: 6.846 ± 0.428
4.74LeuSer: 4.74 ± 0.413
4.564LeuThr: 4.564 ± 0.358
5.196LeuVal: 5.196 ± 0.373
1.123LeuTrp: 1.123 ± 0.204
1.755LeuTyr: 1.755 ± 0.277
0.0LeuXaa: 0.0 ± 0.0
Met
2.914MetAla: 2.914 ± 0.321
0.211MetCys: 0.211 ± 0.087
1.51MetAsp: 1.51 ± 0.222
1.439MetGlu: 1.439 ± 0.237
0.807MetPhe: 0.807 ± 0.153
1.896MetGly: 1.896 ± 0.284
0.316MetHis: 0.316 ± 0.092
1.439MetIle: 1.439 ± 0.227
0.983MetLys: 0.983 ± 0.172
2.528MetLeu: 2.528 ± 0.283
0.527MetMet: 0.527 ± 0.153
1.229MetAsn: 1.229 ± 0.202
1.826MetPro: 1.826 ± 0.263
1.299MetGln: 1.299 ± 0.214
2.563MetArg: 2.563 ± 0.254
1.58MetSer: 1.58 ± 0.234
2.071MetThr: 2.071 ± 0.277
1.439MetVal: 1.439 ± 0.261
0.176MetTrp: 0.176 ± 0.075
0.351MetTyr: 0.351 ± 0.11
0.0MetXaa: 0.0 ± 0.0
Asn
3.3AsnAla: 3.3 ± 0.31
0.281AsnCys: 0.281 ± 0.099
2.036AsnAsp: 2.036 ± 0.255
1.861AsnGlu: 1.861 ± 0.252
1.123AsnPhe: 1.123 ± 0.206
3.3AsnGly: 3.3 ± 0.379
0.562AsnHis: 0.562 ± 0.168
1.369AsnIle: 1.369 ± 0.3
1.369AsnLys: 1.369 ± 0.214
2.177AsnLeu: 2.177 ± 0.298
0.632AsnMet: 0.632 ± 0.13
0.913AsnAsn: 0.913 ± 0.145
2.598AsnPro: 2.598 ± 0.335
1.404AsnGln: 1.404 ± 0.23
2.774AsnArg: 2.774 ± 0.271
1.51AsnSer: 1.51 ± 0.223
1.404AsnThr: 1.404 ± 0.213
2.633AsnVal: 2.633 ± 0.365
0.702AsnTrp: 0.702 ± 0.156
0.913AsnTyr: 0.913 ± 0.205
0.0AsnXaa: 0.0 ± 0.0
Pro
7.935ProAla: 7.935 ± 0.507
0.351ProCys: 0.351 ± 0.108
3.686ProAsp: 3.686 ± 0.363
3.019ProGlu: 3.019 ± 0.365
1.545ProPhe: 1.545 ± 0.262
4.529ProGly: 4.529 ± 0.335
1.299ProHis: 1.299 ± 0.218
1.896ProIle: 1.896 ± 0.267
2.142ProLys: 2.142 ± 0.272
3.23ProLeu: 3.23 ± 0.363
0.983ProMet: 0.983 ± 0.158
1.334ProAsn: 1.334 ± 0.199
2.879ProPro: 2.879 ± 0.443
1.475ProGln: 1.475 ± 0.242
2.598ProArg: 2.598 ± 0.317
2.633ProSer: 2.633 ± 0.27
3.511ProThr: 3.511 ± 0.34
4.143ProVal: 4.143 ± 0.411
0.772ProTrp: 0.772 ± 0.157
1.088ProTyr: 1.088 ± 0.22
0.0ProXaa: 0.0 ± 0.0
Gln
4.74GlnAla: 4.74 ± 0.374
0.386GlnCys: 0.386 ± 0.146
2.036GlnAsp: 2.036 ± 0.26
2.036GlnGlu: 2.036 ± 0.296
1.861GlnPhe: 1.861 ± 0.261
2.177GlnGly: 2.177 ± 0.286
0.807GlnHis: 0.807 ± 0.153
2.036GlnIle: 2.036 ± 0.237
2.177GlnLys: 2.177 ± 0.308
2.598GlnLeu: 2.598 ± 0.308
1.264GlnMet: 1.264 ± 0.215
1.088GlnAsn: 1.088 ± 0.218
2.142GlnPro: 2.142 ± 0.283
2.668GlnGln: 2.668 ± 0.282
3.897GlnArg: 3.897 ± 0.607
2.001GlnSer: 2.001 ± 0.257
2.001GlnThr: 2.001 ± 0.275
2.458GlnVal: 2.458 ± 0.289
0.527GlnTrp: 0.527 ± 0.128
1.369GlnTyr: 1.369 ± 0.195
0.0GlnXaa: 0.0 ± 0.0
Arg
9.163ArgAla: 9.163 ± 0.56
0.702ArgCys: 0.702 ± 0.199
5.652ArgAsp: 5.652 ± 0.376
4.494ArgGlu: 4.494 ± 0.524
2.422ArgPhe: 2.422 ± 0.258
4.494ArgGly: 4.494 ± 0.415
1.72ArgHis: 1.72 ± 0.239
3.932ArgIle: 3.932 ± 0.357
3.792ArgLys: 3.792 ± 0.438
5.617ArgLeu: 5.617 ± 0.452
1.826ArgMet: 1.826 ± 0.281
2.352ArgAsn: 2.352 ± 0.276
3.23ArgPro: 3.23 ± 0.445
2.984ArgGln: 2.984 ± 0.377
5.547ArgArg: 5.547 ± 0.477
3.23ArgSer: 3.23 ± 0.276
3.441ArgThr: 3.441 ± 0.393
4.037ArgVal: 4.037 ± 0.353
1.439ArgTrp: 1.439 ± 0.237
2.107ArgTyr: 2.107 ± 0.272
0.0ArgXaa: 0.0 ± 0.0
Ser
5.301SerAla: 5.301 ± 0.477
0.386SerCys: 0.386 ± 0.139
3.019SerAsp: 3.019 ± 0.31
3.16SerGlu: 3.16 ± 0.386
1.685SerPhe: 1.685 ± 0.273
5.337SerGly: 5.337 ± 0.544
0.737SerHis: 0.737 ± 0.146
2.352SerIle: 2.352 ± 0.242
2.387SerLys: 2.387 ± 0.321
4.037SerLeu: 4.037 ± 0.349
1.615SerMet: 1.615 ± 0.218
1.861SerAsn: 1.861 ± 0.238
2.352SerPro: 2.352 ± 0.271
1.369SerGln: 1.369 ± 0.184
3.3SerArg: 3.3 ± 0.387
2.528SerSer: 2.528 ± 0.325
2.844SerThr: 2.844 ± 0.342
3.827SerVal: 3.827 ± 0.404
0.597SerTrp: 0.597 ± 0.152
1.123SerTyr: 1.123 ± 0.197
0.0SerXaa: 0.0 ± 0.0
Thr
6.6ThrAla: 6.6 ± 0.549
0.351ThrCys: 0.351 ± 0.125
3.511ThrAsp: 3.511 ± 0.358
2.809ThrGlu: 2.809 ± 0.348
2.142ThrPhe: 2.142 ± 0.245
5.301ThrGly: 5.301 ± 0.453
0.983ThrHis: 0.983 ± 0.204
2.387ThrIle: 2.387 ± 0.262
2.352ThrLys: 2.352 ± 0.343
5.161ThrLeu: 5.161 ± 0.424
1.615ThrMet: 1.615 ± 0.224
1.72ThrAsn: 1.72 ± 0.288
3.511ThrPro: 3.511 ± 0.408
1.615ThrGln: 1.615 ± 0.267
2.844ThrArg: 2.844 ± 0.334
2.774ThrSer: 2.774 ± 0.362
3.125ThrThr: 3.125 ± 0.431
4.353ThrVal: 4.353 ± 0.38
0.737ThrTrp: 0.737 ± 0.146
1.475ThrTyr: 1.475 ± 0.257
0.0ThrXaa: 0.0 ± 0.0
Val
7.232ValAla: 7.232 ± 0.56
0.807ValCys: 0.807 ± 0.199
4.845ValAsp: 4.845 ± 0.362
5.091ValGlu: 5.091 ± 0.454
2.142ValPhe: 2.142 ± 0.325
4.389ValGly: 4.389 ± 0.413
1.264ValHis: 1.264 ± 0.209
3.265ValIle: 3.265 ± 0.371
2.914ValLys: 2.914 ± 0.357
5.723ValLeu: 5.723 ± 0.452
1.826ValMet: 1.826 ± 0.251
2.177ValAsn: 2.177 ± 0.25
3.511ValPro: 3.511 ± 0.306
2.914ValGln: 2.914 ± 0.295
4.459ValArg: 4.459 ± 0.425
3.265ValSer: 3.265 ± 0.31
4.353ValThr: 4.353 ± 0.368
4.002ValVal: 4.002 ± 0.384
0.843ValTrp: 0.843 ± 0.207
1.931ValTyr: 1.931 ± 0.258
0.0ValXaa: 0.0 ± 0.0
Trp
1.264TrpAla: 1.264 ± 0.24
0.07TrpCys: 0.07 ± 0.051
0.737TrpAsp: 0.737 ± 0.126
1.053TrpGlu: 1.053 ± 0.181
0.632TrpPhe: 0.632 ± 0.168
0.878TrpGly: 0.878 ± 0.199
0.456TrpHis: 0.456 ± 0.113
0.807TrpIle: 0.807 ± 0.164
0.492TrpLys: 0.492 ± 0.152
1.229TrpLeu: 1.229 ± 0.213
0.632TrpMet: 0.632 ± 0.144
0.667TrpAsn: 0.667 ± 0.164
0.562TrpPro: 0.562 ± 0.121
0.316TrpGln: 0.316 ± 0.108
1.334TrpArg: 1.334 ± 0.196
0.983TrpSer: 0.983 ± 0.219
1.053TrpThr: 1.053 ± 0.193
0.913TrpVal: 0.913 ± 0.163
0.176TrpTrp: 0.176 ± 0.075
0.456TrpTyr: 0.456 ± 0.12
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.317TyrAla: 2.317 ± 0.271
0.421TyrCys: 0.421 ± 0.127
1.615TyrAsp: 1.615 ± 0.206
1.545TyrGlu: 1.545 ± 0.278
1.123TyrPhe: 1.123 ± 0.203
2.352TyrGly: 2.352 ± 0.331
0.456TyrHis: 0.456 ± 0.129
1.264TyrIle: 1.264 ± 0.19
0.948TyrLys: 0.948 ± 0.213
2.844TyrLeu: 2.844 ± 0.313
0.456TyrMet: 0.456 ± 0.139
1.299TyrAsn: 1.299 ± 0.24
0.702TyrPro: 0.702 ± 0.125
0.948TyrGln: 0.948 ± 0.169
1.755TyrArg: 1.755 ± 0.254
1.264TyrSer: 1.264 ± 0.182
1.755TyrThr: 1.755 ± 0.229
2.001TyrVal: 2.001 ± 0.281
0.456TyrTrp: 0.456 ± 0.16
0.843TyrTyr: 0.843 ± 0.16
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 124 proteins (28484 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski