Amino acid dipepetide frequency for Burkholderia virus phi1026b

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.976AlaAla: 17.976 ± 2.414
1.45AlaCys: 1.45 ± 0.354
6.437AlaAsp: 6.437 ± 0.688
8.176AlaGlu: 8.176 ± 0.918
3.769AlaPhe: 3.769 ± 0.327
9.974AlaGly: 9.974 ± 0.91
1.798AlaHis: 1.798 ± 0.308
5.741AlaIle: 5.741 ± 0.621
5.451AlaLys: 5.451 ± 0.536
10.728AlaLeu: 10.728 ± 0.777
2.667AlaMet: 2.667 ± 0.378
3.653AlaAsn: 3.653 ± 0.469
5.103AlaPro: 5.103 ± 0.775
4.349AlaGln: 4.349 ± 1.02
9.568AlaArg: 9.568 ± 1.211
6.901AlaSer: 6.901 ± 0.788
6.321AlaThr: 6.321 ± 0.776
8.524AlaVal: 8.524 ± 0.769
2.088AlaTrp: 2.088 ± 0.324
1.856AlaTyr: 1.856 ± 0.285
0.0AlaXaa: 0.0 ± 0.0
Cys
1.45CysAla: 1.45 ± 0.309
0.232CysCys: 0.232 ± 0.12
1.276CysAsp: 1.276 ± 0.285
0.986CysGlu: 0.986 ± 0.261
0.522CysPhe: 0.522 ± 0.14
1.218CysGly: 1.218 ± 0.336
0.174CysHis: 0.174 ± 0.127
0.58CysIle: 0.58 ± 0.197
0.58CysLys: 0.58 ± 0.177
0.928CysLeu: 0.928 ± 0.234
0.174CysMet: 0.174 ± 0.102
0.522CysAsn: 0.522 ± 0.183
0.464CysPro: 0.464 ± 0.159
0.58CysGln: 0.58 ± 0.19
0.928CysArg: 0.928 ± 0.279
0.87CysSer: 0.87 ± 0.259
0.696CysThr: 0.696 ± 0.17
1.218CysVal: 1.218 ± 0.292
0.116CysTrp: 0.116 ± 0.082
0.522CysTyr: 0.522 ± 0.154
0.0CysXaa: 0.0 ± 0.0
Asp
7.944AspAla: 7.944 ± 0.731
1.218AspCys: 1.218 ± 0.278
4.407AspAsp: 4.407 ± 0.457
5.335AspGlu: 5.335 ± 0.627
1.334AspPhe: 1.334 ± 0.245
6.785AspGly: 6.785 ± 0.832
0.928AspHis: 0.928 ± 0.297
2.03AspIle: 2.03 ± 0.336
2.088AspLys: 2.088 ± 0.265
4.291AspLeu: 4.291 ± 0.569
1.334AspMet: 1.334 ± 0.26
1.566AspAsn: 1.566 ± 0.348
2.493AspPro: 2.493 ± 0.375
1.798AspGln: 1.798 ± 0.304
4.639AspArg: 4.639 ± 0.588
1.914AspSer: 1.914 ± 0.364
2.146AspThr: 2.146 ± 0.29
4.059AspVal: 4.059 ± 0.52
1.45AspTrp: 1.45 ± 0.358
1.508AspTyr: 1.508 ± 0.27
0.0AspXaa: 0.0 ± 0.0
Glu
6.263GluAla: 6.263 ± 0.646
1.044GluCys: 1.044 ± 0.32
2.841GluAsp: 2.841 ± 0.455
2.725GluGlu: 2.725 ± 0.47
2.088GluPhe: 2.088 ± 0.303
3.711GluGly: 3.711 ± 0.415
1.45GluHis: 1.45 ± 0.309
3.421GluIle: 3.421 ± 0.339
3.363GluLys: 3.363 ± 0.454
6.321GluLeu: 6.321 ± 0.651
1.102GluMet: 1.102 ± 0.281
1.566GluAsn: 1.566 ± 0.326
3.131GluPro: 3.131 ± 0.441
2.957GluGln: 2.957 ± 0.472
6.031GluArg: 6.031 ± 0.613
4.291GluSer: 4.291 ± 0.448
3.537GluThr: 3.537 ± 0.506
4.059GluVal: 4.059 ± 0.475
1.392GluTrp: 1.392 ± 0.229
1.508GluTyr: 1.508 ± 0.274
0.0GluXaa: 0.0 ± 0.0
Phe
3.189PheAla: 3.189 ± 0.477
0.348PheCys: 0.348 ± 0.133
2.03PheAsp: 2.03 ± 0.29
2.204PheGlu: 2.204 ± 0.348
0.754PhePhe: 0.754 ± 0.242
3.363PheGly: 3.363 ± 0.469
0.522PheHis: 0.522 ± 0.148
1.102PheIle: 1.102 ± 0.273
1.392PheLys: 1.392 ± 0.297
2.493PheLeu: 2.493 ± 0.403
0.812PheMet: 0.812 ± 0.237
1.16PheAsn: 1.16 ± 0.265
1.798PhePro: 1.798 ± 0.356
1.044PheGln: 1.044 ± 0.235
2.378PheArg: 2.378 ± 0.351
2.435PheSer: 2.435 ± 0.404
1.45PheThr: 1.45 ± 0.317
1.624PheVal: 1.624 ± 0.285
0.58PheTrp: 0.58 ± 0.185
1.218PheTyr: 1.218 ± 0.257
0.0PheXaa: 0.0 ± 0.0
Gly
8.466GlyAla: 8.466 ± 0.834
1.16GlyCys: 1.16 ± 0.267
5.451GlyAsp: 5.451 ± 0.582
4.697GlyGlu: 4.697 ± 0.514
2.899GlyPhe: 2.899 ± 0.642
7.306GlyGly: 7.306 ± 0.838
1.276GlyHis: 1.276 ± 0.32
3.711GlyIle: 3.711 ± 0.408
4.581GlyLys: 4.581 ± 0.623
6.553GlyLeu: 6.553 ± 0.586
2.146GlyMet: 2.146 ± 0.36
3.073GlyAsn: 3.073 ± 0.433
2.262GlyPro: 2.262 ± 0.286
2.957GlyGln: 2.957 ± 0.368
6.205GlyArg: 6.205 ± 0.664
3.885GlySer: 3.885 ± 0.496
3.827GlyThr: 3.827 ± 0.483
6.669GlyVal: 6.669 ± 0.621
1.682GlyTrp: 1.682 ± 0.331
2.088GlyTyr: 2.088 ± 0.347
0.0GlyXaa: 0.0 ± 0.0
His
2.146HisAla: 2.146 ± 0.456
0.29HisCys: 0.29 ± 0.14
0.986HisAsp: 0.986 ± 0.266
0.928HisGlu: 0.928 ± 0.224
0.58HisPhe: 0.58 ± 0.236
1.798HisGly: 1.798 ± 0.383
0.522HisHis: 0.522 ± 0.187
0.754HisIle: 0.754 ± 0.281
0.522HisLys: 0.522 ± 0.153
1.682HisLeu: 1.682 ± 0.399
0.87HisMet: 0.87 ± 0.232
0.406HisAsn: 0.406 ± 0.157
1.044HisPro: 1.044 ± 0.232
0.58HisGln: 0.58 ± 0.193
1.45HisArg: 1.45 ± 0.301
1.276HisSer: 1.276 ± 0.288
1.16HisThr: 1.16 ± 0.251
1.044HisVal: 1.044 ± 0.216
0.058HisTrp: 0.058 ± 0.056
0.464HisTyr: 0.464 ± 0.135
0.0HisXaa: 0.0 ± 0.0
Ile
6.553IleAla: 6.553 ± 0.589
0.522IleCys: 0.522 ± 0.177
4.349IleAsp: 4.349 ± 0.447
4.291IleGlu: 4.291 ± 0.673
0.696IlePhe: 0.696 ± 0.207
4.291IleGly: 4.291 ± 0.519
0.696IleHis: 0.696 ± 0.215
1.624IleIle: 1.624 ± 0.327
2.204IleLys: 2.204 ± 0.34
2.899IleLeu: 2.899 ± 0.473
0.754IleMet: 0.754 ± 0.246
1.508IleAsn: 1.508 ± 0.331
1.856IlePro: 1.856 ± 0.307
1.45IleGln: 1.45 ± 0.317
2.899IleArg: 2.899 ± 0.4
2.667IleSer: 2.667 ± 0.411
2.841IleThr: 2.841 ± 0.498
4.465IleVal: 4.465 ± 0.573
0.58IleTrp: 0.58 ± 0.187
0.928IleTyr: 0.928 ± 0.248
0.0IleXaa: 0.0 ± 0.0
Lys
4.929LysAla: 4.929 ± 0.555
0.348LysCys: 0.348 ± 0.157
1.566LysAsp: 1.566 ± 0.283
1.914LysGlu: 1.914 ± 0.348
0.928LysPhe: 0.928 ± 0.26
2.378LysGly: 2.378 ± 0.459
1.16LysHis: 1.16 ± 0.312
1.508LysIle: 1.508 ± 0.249
1.972LysLys: 1.972 ± 0.347
4.291LysLeu: 4.291 ± 0.568
1.276LysMet: 1.276 ± 0.227
1.218LysAsn: 1.218 ± 0.265
1.798LysPro: 1.798 ± 0.358
1.914LysGln: 1.914 ± 0.335
3.885LysArg: 3.885 ± 0.605
3.131LysSer: 3.131 ± 0.501
2.551LysThr: 2.551 ± 0.43
2.551LysVal: 2.551 ± 0.528
0.754LysTrp: 0.754 ± 0.207
1.624LysTyr: 1.624 ± 0.334
0.0LysXaa: 0.0 ± 0.0
Leu
9.916LeuAla: 9.916 ± 0.942
1.218LeuCys: 1.218 ± 0.288
4.755LeuAsp: 4.755 ± 0.459
5.509LeuGlu: 5.509 ± 0.805
2.609LeuPhe: 2.609 ± 0.428
5.045LeuGly: 5.045 ± 0.674
1.218LeuHis: 1.218 ± 0.275
4.117LeuIle: 4.117 ± 0.473
2.899LeuLys: 2.899 ± 0.636
5.973LeuLeu: 5.973 ± 0.638
1.74LeuMet: 1.74 ± 0.319
2.783LeuAsn: 2.783 ± 0.366
3.827LeuPro: 3.827 ± 0.487
3.305LeuGln: 3.305 ± 0.447
7.538LeuArg: 7.538 ± 0.742
4.813LeuSer: 4.813 ± 0.516
5.219LeuThr: 5.219 ± 0.501
4.813LeuVal: 4.813 ± 0.572
1.102LeuTrp: 1.102 ± 0.219
2.204LeuTyr: 2.204 ± 0.335
0.0LeuXaa: 0.0 ± 0.0
Met
3.305MetAla: 3.305 ± 0.41
0.174MetCys: 0.174 ± 0.088
0.812MetAsp: 0.812 ± 0.236
0.754MetGlu: 0.754 ± 0.186
0.58MetPhe: 0.58 ± 0.212
1.624MetGly: 1.624 ± 0.309
0.464MetHis: 0.464 ± 0.148
1.218MetIle: 1.218 ± 0.224
1.45MetLys: 1.45 ± 0.283
1.682MetLeu: 1.682 ± 0.289
0.29MetMet: 0.29 ± 0.113
1.218MetAsn: 1.218 ± 0.275
0.928MetPro: 0.928 ± 0.245
0.986MetGln: 0.986 ± 0.22
1.682MetArg: 1.682 ± 0.352
1.856MetSer: 1.856 ± 0.335
1.972MetThr: 1.972 ± 0.375
1.276MetVal: 1.276 ± 0.283
0.348MetTrp: 0.348 ± 0.151
0.116MetTyr: 0.116 ± 0.072
0.0MetXaa: 0.0 ± 0.0
Asn
4.117AsnAla: 4.117 ± 0.521
0.29AsnCys: 0.29 ± 0.172
2.146AsnAsp: 2.146 ± 0.381
1.798AsnGlu: 1.798 ± 0.338
0.812AsnPhe: 0.812 ± 0.196
3.595AsnGly: 3.595 ± 0.468
0.696AsnHis: 0.696 ± 0.206
1.218AsnIle: 1.218 ± 0.192
0.638AsnLys: 0.638 ± 0.24
2.783AsnLeu: 2.783 ± 0.378
0.348AsnMet: 0.348 ± 0.165
0.986AsnAsn: 0.986 ± 0.236
1.798AsnPro: 1.798 ± 0.315
0.754AsnGln: 0.754 ± 0.168
1.856AsnArg: 1.856 ± 0.315
1.798AsnSer: 1.798 ± 0.298
1.392AsnThr: 1.392 ± 0.252
2.493AsnVal: 2.493 ± 0.437
0.754AsnTrp: 0.754 ± 0.184
0.87AsnTyr: 0.87 ± 0.238
0.0AsnXaa: 0.0 ± 0.0
Pro
5.161ProAla: 5.161 ± 0.765
0.696ProCys: 0.696 ± 0.184
2.493ProAsp: 2.493 ± 0.398
2.899ProGlu: 2.899 ± 0.438
1.508ProPhe: 1.508 ± 0.337
3.943ProGly: 3.943 ± 0.384
0.696ProHis: 0.696 ± 0.205
3.015ProIle: 3.015 ± 0.503
1.682ProLys: 1.682 ± 0.353
2.783ProLeu: 2.783 ± 0.402
1.334ProMet: 1.334 ± 0.259
1.508ProAsn: 1.508 ± 0.298
3.189ProPro: 3.189 ± 0.653
1.392ProGln: 1.392 ± 0.323
2.435ProArg: 2.435 ± 0.406
2.899ProSer: 2.899 ± 0.38
2.667ProThr: 2.667 ± 0.418
3.827ProVal: 3.827 ± 0.469
0.696ProTrp: 0.696 ± 0.172
0.986ProTyr: 0.986 ± 0.24
0.0ProXaa: 0.0 ± 0.0
Gln
5.161GlnAla: 5.161 ± 0.839
0.58GlnCys: 0.58 ± 0.153
1.914GlnAsp: 1.914 ± 0.329
1.45GlnGlu: 1.45 ± 0.335
1.45GlnPhe: 1.45 ± 0.314
2.667GlnGly: 2.667 ± 0.329
0.754GlnHis: 0.754 ± 0.192
2.262GlnIle: 2.262 ± 0.388
1.566GlnLys: 1.566 ± 0.287
3.189GlnLeu: 3.189 ± 0.459
0.812GlnMet: 0.812 ± 0.209
1.102GlnAsn: 1.102 ± 0.264
1.798GlnPro: 1.798 ± 0.247
2.03GlnGln: 2.03 ± 0.347
3.537GlnArg: 3.537 ± 0.45
2.378GlnSer: 2.378 ± 0.3
1.334GlnThr: 1.334 ± 0.301
2.03GlnVal: 2.03 ± 0.39
0.696GlnTrp: 0.696 ± 0.2
1.044GlnTyr: 1.044 ± 0.308
0.0GlnXaa: 0.0 ± 0.0
Arg
10.554ArgAla: 10.554 ± 0.996
0.754ArgCys: 0.754 ± 0.22
4.175ArgAsp: 4.175 ± 0.537
5.451ArgGlu: 5.451 ± 0.676
3.247ArgPhe: 3.247 ± 0.443
4.523ArgGly: 4.523 ± 0.714
1.74ArgHis: 1.74 ± 0.333
3.479ArgIle: 3.479 ± 0.422
2.783ArgLys: 2.783 ± 0.509
6.611ArgLeu: 6.611 ± 0.57
1.972ArgMet: 1.972 ± 0.271
2.03ArgAsn: 2.03 ± 0.327
2.841ArgPro: 2.841 ± 0.415
3.479ArgGln: 3.479 ± 0.35
5.509ArgArg: 5.509 ± 0.639
4.117ArgSer: 4.117 ± 0.538
3.537ArgThr: 3.537 ± 0.378
6.495ArgVal: 6.495 ± 0.636
1.334ArgTrp: 1.334 ± 0.26
1.682ArgTyr: 1.682 ± 0.328
0.0ArgXaa: 0.0 ± 0.0
Ser
6.901SerAla: 6.901 ± 0.763
1.218SerCys: 1.218 ± 0.302
4.001SerAsp: 4.001 ± 0.494
3.131SerGlu: 3.131 ± 0.46
1.74SerPhe: 1.74 ± 0.342
5.045SerGly: 5.045 ± 0.644
0.986SerHis: 0.986 ± 0.214
3.189SerIle: 3.189 ± 0.421
2.204SerLys: 2.204 ± 0.394
4.117SerLeu: 4.117 ± 0.53
1.914SerMet: 1.914 ± 0.28
2.146SerAsn: 2.146 ± 0.383
2.725SerPro: 2.725 ± 0.596
1.74SerGln: 1.74 ± 0.315
3.015SerArg: 3.015 ± 0.503
3.595SerSer: 3.595 ± 0.37
3.769SerThr: 3.769 ± 0.541
4.465SerVal: 4.465 ± 0.538
0.638SerTrp: 0.638 ± 0.206
1.914SerTyr: 1.914 ± 0.452
0.0SerXaa: 0.0 ± 0.0
Thr
6.959ThrAla: 6.959 ± 0.868
0.464ThrCys: 0.464 ± 0.154
2.725ThrAsp: 2.725 ± 0.359
2.841ThrGlu: 2.841 ± 0.371
2.32ThrPhe: 2.32 ± 0.42
3.653ThrGly: 3.653 ± 0.374
1.218ThrHis: 1.218 ± 0.292
2.899ThrIle: 2.899 ± 0.379
1.798ThrLys: 1.798 ± 0.345
4.001ThrLeu: 4.001 ± 0.505
1.276ThrMet: 1.276 ± 0.249
1.74ThrAsn: 1.74 ± 0.323
3.537ThrPro: 3.537 ± 0.372
1.972ThrGln: 1.972 ± 0.338
2.841ThrArg: 2.841 ± 0.447
3.015ThrSer: 3.015 ± 0.497
2.609ThrThr: 2.609 ± 0.358
4.523ThrVal: 4.523 ± 0.59
0.638ThrTrp: 0.638 ± 0.196
1.624ThrTyr: 1.624 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
7.19ValAla: 7.19 ± 0.596
0.986ValCys: 0.986 ± 0.251
4.755ValAsp: 4.755 ± 0.686
5.277ValGlu: 5.277 ± 0.709
2.204ValPhe: 2.204 ± 0.41
7.075ValGly: 7.075 ± 0.595
1.508ValHis: 1.508 ± 0.278
3.595ValIle: 3.595 ± 0.507
3.305ValLys: 3.305 ± 0.483
5.451ValLeu: 5.451 ± 0.642
1.218ValMet: 1.218 ± 0.243
1.972ValAsn: 1.972 ± 0.368
3.537ValPro: 3.537 ± 0.504
2.551ValGln: 2.551 ± 0.523
5.915ValArg: 5.915 ± 0.613
4.697ValSer: 4.697 ± 0.466
3.363ValThr: 3.363 ± 0.434
5.161ValVal: 5.161 ± 0.473
1.102ValTrp: 1.102 ± 0.223
2.03ValTyr: 2.03 ± 0.365
0.0ValXaa: 0.0 ± 0.0
Trp
1.392TrpAla: 1.392 ± 0.276
0.522TrpCys: 0.522 ± 0.141
0.928TrpAsp: 0.928 ± 0.294
0.87TrpGlu: 0.87 ± 0.241
0.638TrpPhe: 0.638 ± 0.166
0.928TrpGly: 0.928 ± 0.258
0.406TrpHis: 0.406 ± 0.182
1.45TrpIle: 1.45 ± 0.276
0.522TrpLys: 0.522 ± 0.175
2.088TrpLeu: 2.088 ± 0.333
0.348TrpMet: 0.348 ± 0.183
0.29TrpAsn: 0.29 ± 0.131
0.638TrpPro: 0.638 ± 0.203
0.638TrpGln: 0.638 ± 0.207
1.74TrpArg: 1.74 ± 0.311
0.696TrpSer: 0.696 ± 0.218
0.754TrpThr: 0.754 ± 0.221
1.392TrpVal: 1.392 ± 0.253
0.348TrpTrp: 0.348 ± 0.16
0.348TrpTyr: 0.348 ± 0.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.957TyrAla: 2.957 ± 0.43
0.522TyrCys: 0.522 ± 0.149
1.218TyrAsp: 1.218 ± 0.259
1.566TyrGlu: 1.566 ± 0.264
1.218TyrPhe: 1.218 ± 0.277
1.972TyrGly: 1.972 ± 0.344
0.348TyrHis: 0.348 ± 0.117
1.16TyrIle: 1.16 ± 0.272
0.754TyrLys: 0.754 ± 0.208
1.74TyrLeu: 1.74 ± 0.383
0.29TyrMet: 0.29 ± 0.124
0.696TyrAsn: 0.696 ± 0.165
1.044TyrPro: 1.044 ± 0.289
1.276TyrGln: 1.276 ± 0.28
2.262TyrArg: 2.262 ± 0.356
1.218TyrSer: 1.218 ± 0.273
1.508TyrThr: 1.508 ± 0.242
2.204TyrVal: 2.204 ± 0.352
0.58TyrTrp: 0.58 ± 0.177
0.464TyrTyr: 0.464 ± 0.151
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 83 proteins (17246 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski