Amino acid dipepetide frequency for Burkholderia virus BcepF1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.691AlaAla: 13.691 ± 1.297
1.148AlaCys: 1.148 ± 0.257
6.095AlaAsp: 6.095 ± 0.524
6.095AlaGlu: 6.095 ± 0.833
3.754AlaPhe: 3.754 ± 0.422
7.552AlaGly: 7.552 ± 0.709
1.502AlaHis: 1.502 ± 0.294
6.139AlaIle: 6.139 ± 0.446
6.492AlaLys: 6.492 ± 0.565
6.801AlaLeu: 6.801 ± 0.562
2.871AlaMet: 2.871 ± 0.343
4.637AlaAsn: 4.637 ± 0.644
4.151AlaPro: 4.151 ± 0.424
4.284AlaGln: 4.284 ± 0.43
6.448AlaArg: 6.448 ± 0.563
5.918AlaSer: 5.918 ± 0.562
5.255AlaThr: 5.255 ± 0.622
5.653AlaVal: 5.653 ± 0.472
1.899AlaTrp: 1.899 ± 0.298
2.341AlaTyr: 2.341 ± 0.29
0.0AlaXaa: 0.0 ± 0.0
Cys
1.59CysAla: 1.59 ± 0.307
0.177CysCys: 0.177 ± 0.098
0.795CysAsp: 0.795 ± 0.201
0.795CysGlu: 0.795 ± 0.199
0.486CysPhe: 0.486 ± 0.152
0.927CysGly: 0.927 ± 0.236
0.309CysHis: 0.309 ± 0.14
0.442CysIle: 0.442 ± 0.136
0.221CysLys: 0.221 ± 0.109
0.53CysLeu: 0.53 ± 0.145
0.221CysMet: 0.221 ± 0.107
0.397CysAsn: 0.397 ± 0.113
0.574CysPro: 0.574 ± 0.147
0.265CysGln: 0.265 ± 0.105
1.016CysArg: 1.016 ± 0.256
0.442CysSer: 0.442 ± 0.133
0.662CysThr: 0.662 ± 0.163
0.751CysVal: 0.751 ± 0.196
0.309CysTrp: 0.309 ± 0.129
0.309CysTyr: 0.309 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
7.243AspAla: 7.243 ± 0.544
0.751AspCys: 0.751 ± 0.181
3.931AspAsp: 3.931 ± 0.472
4.858AspGlu: 4.858 ± 0.502
2.65AspPhe: 2.65 ± 0.324
4.063AspGly: 4.063 ± 0.368
1.413AspHis: 1.413 ± 0.271
2.915AspIle: 2.915 ± 0.315
3.136AspLys: 3.136 ± 0.362
4.019AspLeu: 4.019 ± 0.454
2.12AspMet: 2.12 ± 0.282
1.987AspAsn: 1.987 ± 0.258
3.047AspPro: 3.047 ± 0.316
1.899AspGln: 1.899 ± 0.275
3.136AspArg: 3.136 ± 0.443
3.533AspSer: 3.533 ± 0.337
2.959AspThr: 2.959 ± 0.469
3.754AspVal: 3.754 ± 0.365
0.839AspTrp: 0.839 ± 0.244
1.767AspTyr: 1.767 ± 0.278
0.0AspXaa: 0.0 ± 0.0
Glu
7.022GluAla: 7.022 ± 0.742
0.486GluCys: 0.486 ± 0.16
3.003GluAsp: 3.003 ± 0.437
3.489GluGlu: 3.489 ± 0.487
3.533GluPhe: 3.533 ± 0.361
4.681GluGly: 4.681 ± 0.65
1.325GluHis: 1.325 ± 0.24
4.77GluIle: 4.77 ± 0.51
3.886GluLys: 3.886 ± 0.539
5.741GluLeu: 5.741 ± 0.519
1.722GluMet: 1.722 ± 0.263
3.224GluAsn: 3.224 ± 0.383
2.429GluPro: 2.429 ± 0.292
2.517GluGln: 2.517 ± 0.306
4.063GluArg: 4.063 ± 0.539
3.003GluSer: 3.003 ± 0.281
3.003GluThr: 3.003 ± 0.434
4.196GluVal: 4.196 ± 0.466
1.413GluTrp: 1.413 ± 0.31
1.546GluTyr: 1.546 ± 0.258
0.0GluXaa: 0.0 ± 0.0
Phe
3.312PheAla: 3.312 ± 0.395
0.53PheCys: 0.53 ± 0.148
3.533PheAsp: 3.533 ± 0.424
3.003PheGlu: 3.003 ± 0.348
1.546PhePhe: 1.546 ± 0.278
4.019PheGly: 4.019 ± 0.459
0.707PheHis: 0.707 ± 0.212
1.413PheIle: 1.413 ± 0.228
1.899PheLys: 1.899 ± 0.275
3.356PheLeu: 3.356 ± 0.381
1.06PheMet: 1.06 ± 0.221
2.032PheAsn: 2.032 ± 0.3
1.237PhePro: 1.237 ± 0.253
1.502PheGln: 1.502 ± 0.234
2.076PheArg: 2.076 ± 0.356
2.517PheSer: 2.517 ± 0.342
2.429PheThr: 2.429 ± 0.355
3.71PheVal: 3.71 ± 0.404
0.618PheTrp: 0.618 ± 0.168
1.237PheTyr: 1.237 ± 0.23
0.0PheXaa: 0.0 ± 0.0
Gly
7.42GlyAla: 7.42 ± 0.914
0.707GlyCys: 0.707 ± 0.164
5.079GlyAsp: 5.079 ± 0.593
4.991GlyGlu: 4.991 ± 0.519
3.975GlyPhe: 3.975 ± 0.351
5.83GlyGly: 5.83 ± 0.762
1.148GlyHis: 1.148 ± 0.253
4.593GlyIle: 4.593 ± 0.488
4.505GlyLys: 4.505 ± 0.467
3.975GlyLeu: 3.975 ± 0.479
1.943GlyMet: 1.943 ± 0.302
2.959GlyAsn: 2.959 ± 0.316
2.252GlyPro: 2.252 ± 0.366
3.356GlyGln: 3.356 ± 0.33
3.842GlyArg: 3.842 ± 0.376
4.372GlySer: 4.372 ± 0.487
3.798GlyThr: 3.798 ± 0.663
5.211GlyVal: 5.211 ± 0.525
1.546GlyTrp: 1.546 ± 0.209
1.943GlyTyr: 1.943 ± 0.29
0.0GlyXaa: 0.0 ± 0.0
His
1.855HisAla: 1.855 ± 0.317
0.309HisCys: 0.309 ± 0.118
0.839HisAsp: 0.839 ± 0.173
1.413HisGlu: 1.413 ± 0.278
0.662HisPhe: 0.662 ± 0.166
0.972HisGly: 0.972 ± 0.227
0.486HisHis: 0.486 ± 0.234
1.06HisIle: 1.06 ± 0.261
0.751HisLys: 0.751 ± 0.213
1.104HisLeu: 1.104 ± 0.217
0.53HisMet: 0.53 ± 0.159
0.442HisAsn: 0.442 ± 0.141
1.016HisPro: 1.016 ± 0.202
0.707HisGln: 0.707 ± 0.173
1.502HisArg: 1.502 ± 0.266
0.662HisSer: 0.662 ± 0.152
0.883HisThr: 0.883 ± 0.177
1.59HisVal: 1.59 ± 0.303
0.265HisTrp: 0.265 ± 0.112
0.662HisTyr: 0.662 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
6.845IleAla: 6.845 ± 0.478
0.486IleCys: 0.486 ± 0.141
4.637IleAsp: 4.637 ± 0.418
3.621IleGlu: 3.621 ± 0.418
1.457IlePhe: 1.457 ± 0.276
3.886IleGly: 3.886 ± 0.438
0.397IleHis: 0.397 ± 0.124
2.826IleIle: 2.826 ± 0.366
2.915IleLys: 2.915 ± 0.382
4.372IleLeu: 4.372 ± 0.463
1.369IleMet: 1.369 ± 0.219
1.855IleAsn: 1.855 ± 0.279
2.12IlePro: 2.12 ± 0.315
2.606IleGln: 2.606 ± 0.344
4.284IleArg: 4.284 ± 0.431
3.931IleSer: 3.931 ± 0.545
3.489IleThr: 3.489 ± 0.417
4.593IleVal: 4.593 ± 0.461
0.53IleTrp: 0.53 ± 0.151
1.59IleTyr: 1.59 ± 0.262
0.0IleXaa: 0.0 ± 0.0
Lys
5.697LysAla: 5.697 ± 0.55
0.707LysCys: 0.707 ± 0.182
3.224LysAsp: 3.224 ± 0.336
4.151LysGlu: 4.151 ± 0.532
2.297LysPhe: 2.297 ± 0.315
4.24LysGly: 4.24 ± 0.538
0.972LysHis: 0.972 ± 0.207
3.224LysIle: 3.224 ± 0.405
3.489LysLys: 3.489 ± 0.465
3.356LysLeu: 3.356 ± 0.353
2.517LysMet: 2.517 ± 0.353
2.208LysAsn: 2.208 ± 0.35
1.899LysPro: 1.899 ± 0.277
2.517LysGln: 2.517 ± 0.284
3.842LysArg: 3.842 ± 0.38
3.268LysSer: 3.268 ± 0.406
3.18LysThr: 3.18 ± 0.441
3.312LysVal: 3.312 ± 0.413
0.839LysTrp: 0.839 ± 0.181
1.634LysTyr: 1.634 ± 0.26
0.0LysXaa: 0.0 ± 0.0
Leu
6.227LeuAla: 6.227 ± 0.63
0.883LeuCys: 0.883 ± 0.25
4.372LeuAsp: 4.372 ± 0.467
4.196LeuGlu: 4.196 ± 0.452
2.782LeuPhe: 2.782 ± 0.376
4.063LeuGly: 4.063 ± 0.498
1.281LeuHis: 1.281 ± 0.282
3.931LeuIle: 3.931 ± 0.393
5.3LeuLys: 5.3 ± 0.457
4.284LeuLeu: 4.284 ± 0.436
2.076LeuMet: 2.076 ± 0.281
3.754LeuAsn: 3.754 ± 0.432
3.312LeuPro: 3.312 ± 0.351
3.224LeuGln: 3.224 ± 0.347
4.726LeuArg: 4.726 ± 0.61
5.874LeuSer: 5.874 ± 0.451
4.637LeuThr: 4.637 ± 0.471
4.372LeuVal: 4.372 ± 0.502
0.662LeuTrp: 0.662 ± 0.169
1.59LeuTyr: 1.59 ± 0.281
0.0LeuXaa: 0.0 ± 0.0
Met
2.473MetAla: 2.473 ± 0.315
0.221MetCys: 0.221 ± 0.103
1.148MetAsp: 1.148 ± 0.222
1.104MetGlu: 1.104 ± 0.194
1.148MetPhe: 1.148 ± 0.237
1.237MetGly: 1.237 ± 0.219
0.442MetHis: 0.442 ± 0.121
1.855MetIle: 1.855 ± 0.308
2.164MetLys: 2.164 ± 0.34
1.634MetLeu: 1.634 ± 0.271
0.707MetMet: 0.707 ± 0.166
1.811MetAsn: 1.811 ± 0.273
1.634MetPro: 1.634 ± 0.288
0.795MetGln: 0.795 ± 0.174
2.076MetArg: 2.076 ± 0.292
1.855MetSer: 1.855 ± 0.26
2.517MetThr: 2.517 ± 0.331
1.678MetVal: 1.678 ± 0.275
0.265MetTrp: 0.265 ± 0.107
0.972MetTyr: 0.972 ± 0.209
0.0MetXaa: 0.0 ± 0.0
Asn
4.151AsnAla: 4.151 ± 0.479
0.442AsnCys: 0.442 ± 0.135
2.473AsnAsp: 2.473 ± 0.339
2.385AsnGlu: 2.385 ± 0.347
1.767AsnPhe: 1.767 ± 0.285
4.063AsnGly: 4.063 ± 0.466
0.883AsnHis: 0.883 ± 0.199
2.297AsnIle: 2.297 ± 0.394
1.59AsnLys: 1.59 ± 0.249
3.975AsnLeu: 3.975 ± 0.465
1.016AsnMet: 1.016 ± 0.182
1.502AsnAsn: 1.502 ± 0.305
2.915AsnPro: 2.915 ± 0.36
1.767AsnGln: 1.767 ± 0.363
3.224AsnArg: 3.224 ± 0.398
1.59AsnSer: 1.59 ± 0.285
2.473AsnThr: 2.473 ± 0.379
2.473AsnVal: 2.473 ± 0.324
0.618AsnTrp: 0.618 ± 0.168
1.192AsnTyr: 1.192 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
4.77ProAla: 4.77 ± 0.49
0.442ProCys: 0.442 ± 0.14
2.517ProAsp: 2.517 ± 0.272
3.047ProGlu: 3.047 ± 0.451
1.59ProPhe: 1.59 ± 0.251
3.047ProGly: 3.047 ± 0.352
0.486ProHis: 0.486 ± 0.159
2.826ProIle: 2.826 ± 0.327
2.12ProLys: 2.12 ± 0.332
3.268ProLeu: 3.268 ± 0.406
1.104ProMet: 1.104 ± 0.207
1.943ProAsn: 1.943 ± 0.287
1.369ProPro: 1.369 ± 0.306
1.59ProGln: 1.59 ± 0.259
2.561ProArg: 2.561 ± 0.376
2.208ProSer: 2.208 ± 0.307
2.473ProThr: 2.473 ± 0.32
2.826ProVal: 2.826 ± 0.407
0.574ProTrp: 0.574 ± 0.187
1.325ProTyr: 1.325 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
3.577GlnAla: 3.577 ± 0.383
0.353GlnCys: 0.353 ± 0.161
1.104GlnAsp: 1.104 ± 0.19
2.341GlnGlu: 2.341 ± 0.273
2.341GlnPhe: 2.341 ± 0.319
3.18GlnGly: 3.18 ± 0.466
0.839GlnHis: 0.839 ± 0.192
2.694GlnIle: 2.694 ± 0.356
2.694GlnLys: 2.694 ± 0.349
3.268GlnLeu: 3.268 ± 0.377
1.104GlnMet: 1.104 ± 0.269
2.385GlnAsn: 2.385 ± 0.354
1.413GlnPro: 1.413 ± 0.228
2.032GlnGln: 2.032 ± 0.326
2.164GlnArg: 2.164 ± 0.337
2.915GlnSer: 2.915 ± 0.447
2.076GlnThr: 2.076 ± 0.274
1.767GlnVal: 1.767 ± 0.247
0.662GlnTrp: 0.662 ± 0.168
1.104GlnTyr: 1.104 ± 0.23
0.0GlnXaa: 0.0 ± 0.0
Arg
6.05ArgAla: 6.05 ± 0.617
0.927ArgCys: 0.927 ± 0.227
3.886ArgAsp: 3.886 ± 0.414
5.035ArgGlu: 5.035 ± 0.568
2.606ArgPhe: 2.606 ± 0.377
3.71ArgGly: 3.71 ± 0.397
1.148ArgHis: 1.148 ± 0.226
3.577ArgIle: 3.577 ± 0.381
3.445ArgLys: 3.445 ± 0.451
4.858ArgLeu: 4.858 ± 0.517
1.502ArgMet: 1.502 ± 0.238
3.047ArgAsn: 3.047 ± 0.498
1.767ArgPro: 1.767 ± 0.285
2.65ArgGln: 2.65 ± 0.336
4.77ArgArg: 4.77 ± 0.741
3.356ArgSer: 3.356 ± 0.433
2.871ArgThr: 2.871 ± 0.379
5.079ArgVal: 5.079 ± 0.428
0.53ArgTrp: 0.53 ± 0.156
2.208ArgTyr: 2.208 ± 0.302
0.0ArgXaa: 0.0 ± 0.0
Ser
4.593SerAla: 4.593 ± 0.508
0.574SerCys: 0.574 ± 0.152
2.782SerAsp: 2.782 ± 0.351
4.151SerGlu: 4.151 ± 0.619
2.385SerPhe: 2.385 ± 0.335
5.432SerGly: 5.432 ± 0.59
1.06SerHis: 1.06 ± 0.227
3.666SerIle: 3.666 ± 0.436
3.489SerLys: 3.489 ± 0.38
5.211SerLeu: 5.211 ± 0.486
1.546SerMet: 1.546 ± 0.254
1.855SerAsn: 1.855 ± 0.328
2.915SerPro: 2.915 ± 0.366
2.606SerGln: 2.606 ± 0.316
3.356SerArg: 3.356 ± 0.475
3.577SerSer: 3.577 ± 0.437
3.754SerThr: 3.754 ± 0.482
3.71SerVal: 3.71 ± 0.375
0.442SerTrp: 0.442 ± 0.143
1.634SerTyr: 1.634 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
6.139ThrAla: 6.139 ± 0.76
0.397ThrCys: 0.397 ± 0.137
3.401ThrAsp: 3.401 ± 0.401
2.65ThrGlu: 2.65 ± 0.378
2.076ThrPhe: 2.076 ± 0.365
5.167ThrGly: 5.167 ± 0.698
1.016ThrHis: 1.016 ± 0.226
4.019ThrIle: 4.019 ± 0.522
2.871ThrLys: 2.871 ± 0.442
4.416ThrLeu: 4.416 ± 0.52
1.016ThrMet: 1.016 ± 0.208
2.385ThrAsn: 2.385 ± 0.41
3.312ThrPro: 3.312 ± 0.386
2.032ThrGln: 2.032 ± 0.354
3.003ThrArg: 3.003 ± 0.331
3.047ThrSer: 3.047 ± 0.352
3.621ThrThr: 3.621 ± 0.625
4.461ThrVal: 4.461 ± 0.529
1.192ThrTrp: 1.192 ± 0.242
1.325ThrTyr: 1.325 ± 0.216
0.0ThrXaa: 0.0 ± 0.0
Val
6.625ValAla: 6.625 ± 0.471
1.104ValCys: 1.104 ± 0.225
4.593ValAsp: 4.593 ± 0.425
4.858ValGlu: 4.858 ± 0.51
3.268ValPhe: 3.268 ± 0.316
3.71ValGly: 3.71 ± 0.435
1.457ValHis: 1.457 ± 0.255
2.959ValIle: 2.959 ± 0.335
4.328ValLys: 4.328 ± 0.513
4.063ValLeu: 4.063 ± 0.369
1.987ValMet: 1.987 ± 0.24
2.606ValAsn: 2.606 ± 0.39
3.18ValPro: 3.18 ± 0.427
2.164ValGln: 2.164 ± 0.318
3.886ValArg: 3.886 ± 0.458
4.328ValSer: 4.328 ± 0.43
4.461ValThr: 4.461 ± 0.445
4.902ValVal: 4.902 ± 0.483
0.839ValTrp: 0.839 ± 0.212
2.208ValTyr: 2.208 ± 0.371
0.0ValXaa: 0.0 ± 0.0
Trp
1.281TrpAla: 1.281 ± 0.257
0.265TrpCys: 0.265 ± 0.102
0.751TrpAsp: 0.751 ± 0.202
0.839TrpGlu: 0.839 ± 0.164
0.618TrpPhe: 0.618 ± 0.171
0.751TrpGly: 0.751 ± 0.179
0.442TrpHis: 0.442 ± 0.152
0.883TrpIle: 0.883 ± 0.193
0.751TrpLys: 0.751 ± 0.175
1.502TrpLeu: 1.502 ± 0.359
0.353TrpMet: 0.353 ± 0.122
0.751TrpAsn: 0.751 ± 0.167
0.442TrpPro: 0.442 ± 0.112
0.486TrpGln: 0.486 ± 0.182
1.016TrpArg: 1.016 ± 0.245
0.751TrpSer: 0.751 ± 0.166
1.148TrpThr: 1.148 ± 0.192
1.06TrpVal: 1.06 ± 0.213
0.221TrpTrp: 0.221 ± 0.103
0.486TrpTyr: 0.486 ± 0.143
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.164TyrAla: 2.164 ± 0.315
0.309TyrCys: 0.309 ± 0.128
1.855TyrAsp: 1.855 ± 0.347
2.032TyrGlu: 2.032 ± 0.273
0.839TyrPhe: 0.839 ± 0.183
2.915TyrGly: 2.915 ± 0.396
0.397TyrHis: 0.397 ± 0.14
1.722TyrIle: 1.722 ± 0.304
0.53TyrLys: 0.53 ± 0.137
1.811TyrLeu: 1.811 ± 0.287
0.927TyrMet: 0.927 ± 0.198
1.016TyrAsn: 1.016 ± 0.229
1.104TyrPro: 1.104 ± 0.206
1.016TyrGln: 1.016 ± 0.229
2.032TyrArg: 2.032 ± 0.301
1.59TyrSer: 1.59 ± 0.282
1.855TyrThr: 1.855 ± 0.309
2.429TyrVal: 2.429 ± 0.34
0.53TyrTrp: 0.53 ± 0.149
0.839TyrTyr: 0.839 ± 0.209
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 127 proteins (22644 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski