Amino acid dipepetide frequency for Burkholderia phage PE067

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.383AlaAla: 20.383 ± 2.041
1.296AlaCys: 1.296 ± 0.358
6.986AlaAsp: 6.986 ± 0.704
8.643AlaGlu: 8.643 ± 1.436
4.538AlaPhe: 4.538 ± 0.655
9.435AlaGly: 9.435 ± 0.829
1.513AlaHis: 1.513 ± 0.275
6.338AlaIle: 6.338 ± 0.735
4.177AlaLys: 4.177 ± 0.593
11.596AlaLeu: 11.596 ± 0.919
4.249AlaMet: 4.249 ± 0.504
4.033AlaAsn: 4.033 ± 0.545
5.834AlaPro: 5.834 ± 0.78
6.554AlaGln: 6.554 ± 1.052
9.363AlaArg: 9.363 ± 1.054
8.067AlaSer: 8.067 ± 0.971
6.986AlaThr: 6.986 ± 0.906
8.859AlaVal: 8.859 ± 0.87
1.801AlaTrp: 1.801 ± 0.37
2.809AlaTyr: 2.809 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
1.008CysAla: 1.008 ± 0.262
0.216CysCys: 0.216 ± 0.161
0.576CysAsp: 0.576 ± 0.21
0.576CysGlu: 0.576 ± 0.166
0.216CysPhe: 0.216 ± 0.128
1.296CysGly: 1.296 ± 0.302
0.144CysHis: 0.144 ± 0.106
0.36CysIle: 0.36 ± 0.143
0.288CysLys: 0.288 ± 0.134
0.504CysLeu: 0.504 ± 0.206
0.504CysMet: 0.504 ± 0.183
0.216CysAsn: 0.216 ± 0.12
0.36CysPro: 0.36 ± 0.161
0.504CysGln: 0.504 ± 0.223
0.504CysArg: 0.504 ± 0.203
0.36CysSer: 0.36 ± 0.144
0.432CysThr: 0.432 ± 0.174
0.576CysVal: 0.576 ± 0.172
0.216CysTrp: 0.216 ± 0.107
0.36CysTyr: 0.36 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
7.058AspAla: 7.058 ± 0.772
0.648AspCys: 0.648 ± 0.208
3.169AspAsp: 3.169 ± 0.592
4.033AspGlu: 4.033 ± 0.666
1.513AspPhe: 1.513 ± 0.339
5.33AspGly: 5.33 ± 0.589
1.224AspHis: 1.224 ± 0.22
2.881AspIle: 2.881 ± 0.416
1.585AspLys: 1.585 ± 0.337
4.466AspLeu: 4.466 ± 0.519
1.368AspMet: 1.368 ± 0.278
2.521AspAsn: 2.521 ± 0.435
3.169AspPro: 3.169 ± 0.393
1.945AspGln: 1.945 ± 0.468
4.682AspArg: 4.682 ± 0.666
2.521AspSer: 2.521 ± 0.473
2.809AspThr: 2.809 ± 0.475
2.953AspVal: 2.953 ± 0.414
1.224AspTrp: 1.224 ± 0.321
0.864AspTyr: 0.864 ± 0.298
0.0AspXaa: 0.0 ± 0.0
Glu
7.058GluAla: 7.058 ± 1.305
0.576GluCys: 0.576 ± 0.248
2.377GluAsp: 2.377 ± 0.329
3.385GluGlu: 3.385 ± 0.771
1.729GluPhe: 1.729 ± 0.29
3.169GluGly: 3.169 ± 0.555
1.152GluHis: 1.152 ± 0.312
3.889GluIle: 3.889 ± 0.449
2.305GluLys: 2.305 ± 0.529
4.538GluLeu: 4.538 ± 0.611
1.513GluMet: 1.513 ± 0.394
1.513GluAsn: 1.513 ± 0.348
3.097GluPro: 3.097 ± 0.435
3.097GluGln: 3.097 ± 0.683
5.042GluArg: 5.042 ± 1.063
2.449GluSer: 2.449 ± 0.421
2.665GluThr: 2.665 ± 0.456
3.169GluVal: 3.169 ± 0.476
1.008GluTrp: 1.008 ± 0.306
1.152GluTyr: 1.152 ± 0.313
0.0GluXaa: 0.0 ± 0.0
Phe
4.177PheAla: 4.177 ± 0.554
0.144PheCys: 0.144 ± 0.097
2.737PheAsp: 2.737 ± 0.415
1.873PheGlu: 1.873 ± 0.355
0.936PhePhe: 0.936 ± 0.387
3.097PheGly: 3.097 ± 0.442
0.288PheHis: 0.288 ± 0.142
1.513PheIle: 1.513 ± 0.347
0.864PheLys: 0.864 ± 0.3
1.657PheLeu: 1.657 ± 0.322
0.72PheMet: 0.72 ± 0.223
1.513PheAsn: 1.513 ± 0.393
1.585PhePro: 1.585 ± 0.333
0.36PheGln: 0.36 ± 0.175
2.161PheArg: 2.161 ± 0.418
1.657PheSer: 1.657 ± 0.363
2.017PheThr: 2.017 ± 0.425
1.801PheVal: 1.801 ± 0.43
0.576PheTrp: 0.576 ± 0.246
0.648PheTyr: 0.648 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
10.012GlyAla: 10.012 ± 1.082
1.152GlyCys: 1.152 ± 0.328
4.826GlyAsp: 4.826 ± 0.582
3.601GlyGlu: 3.601 ± 0.56
3.313GlyPhe: 3.313 ± 0.481
7.131GlyGly: 7.131 ± 1.137
0.72GlyHis: 0.72 ± 0.235
4.249GlyIle: 4.249 ± 0.542
2.881GlyLys: 2.881 ± 0.487
5.402GlyLeu: 5.402 ± 0.682
2.377GlyMet: 2.377 ± 0.421
2.377GlyAsn: 2.377 ± 0.38
2.449GlyPro: 2.449 ± 0.346
3.529GlyGln: 3.529 ± 0.621
4.754GlyArg: 4.754 ± 0.639
4.394GlySer: 4.394 ± 0.654
4.754GlyThr: 4.754 ± 0.727
7.058GlyVal: 7.058 ± 0.74
1.008GlyTrp: 1.008 ± 0.225
2.809GlyTyr: 2.809 ± 0.414
0.0GlyXaa: 0.0 ± 0.0
His
2.521HisAla: 2.521 ± 0.316
0.576HisCys: 0.576 ± 0.231
0.504HisAsp: 0.504 ± 0.164
0.72HisGlu: 0.72 ± 0.226
0.504HisPhe: 0.504 ± 0.215
1.657HisGly: 1.657 ± 0.333
0.288HisHis: 0.288 ± 0.16
0.576HisIle: 0.576 ± 0.171
0.576HisLys: 0.576 ± 0.163
1.152HisLeu: 1.152 ± 0.311
0.216HisMet: 0.216 ± 0.123
0.432HisAsn: 0.432 ± 0.203
1.152HisPro: 1.152 ± 0.288
0.288HisGln: 0.288 ± 0.171
1.08HisArg: 1.08 ± 0.283
0.576HisSer: 0.576 ± 0.193
0.648HisThr: 0.648 ± 0.204
1.368HisVal: 1.368 ± 0.326
0.216HisTrp: 0.216 ± 0.116
0.288HisTyr: 0.288 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
6.914IleAla: 6.914 ± 0.559
0.288IleCys: 0.288 ± 0.143
3.601IleAsp: 3.601 ± 0.511
2.881IleGlu: 2.881 ± 0.415
1.224IlePhe: 1.224 ± 0.364
4.394IleGly: 4.394 ± 0.555
0.576IleHis: 0.576 ± 0.207
2.377IleIle: 2.377 ± 0.525
1.945IleLys: 1.945 ± 0.33
2.809IleLeu: 2.809 ± 0.377
0.432IleMet: 0.432 ± 0.181
1.873IleAsn: 1.873 ± 0.364
3.097IlePro: 3.097 ± 0.464
2.449IleGln: 2.449 ± 0.377
2.593IleArg: 2.593 ± 0.508
3.313IleSer: 3.313 ± 0.601
4.177IleThr: 4.177 ± 0.457
3.817IleVal: 3.817 ± 0.608
0.936IleTrp: 0.936 ± 0.295
1.224IleTyr: 1.224 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
5.546LysAla: 5.546 ± 0.772
0.216LysCys: 0.216 ± 0.115
1.368LysAsp: 1.368 ± 0.377
1.296LysGlu: 1.296 ± 0.346
0.936LysPhe: 0.936 ± 0.247
1.729LysGly: 1.729 ± 0.338
0.792LysHis: 0.792 ± 0.242
2.305LysIle: 2.305 ± 0.488
2.089LysLys: 2.089 ± 0.423
3.025LysLeu: 3.025 ± 0.592
0.936LysMet: 0.936 ± 0.247
1.08LysAsn: 1.08 ± 0.32
2.377LysPro: 2.377 ± 0.411
1.945LysGln: 1.945 ± 0.397
2.233LysArg: 2.233 ± 0.387
1.801LysSer: 1.801 ± 0.396
2.521LysThr: 2.521 ± 0.439
2.089LysVal: 2.089 ± 0.478
0.936LysTrp: 0.936 ± 0.317
0.936LysTyr: 0.936 ± 0.264
0.0LysXaa: 0.0 ± 0.0
Leu
10.588LeuAla: 10.588 ± 0.694
0.648LeuCys: 0.648 ± 0.196
4.249LeuAsp: 4.249 ± 0.536
3.817LeuGlu: 3.817 ± 0.683
2.161LeuPhe: 2.161 ± 0.464
5.618LeuGly: 5.618 ± 0.67
1.657LeuHis: 1.657 ± 0.396
4.322LeuIle: 4.322 ± 0.605
3.313LeuLys: 3.313 ± 0.798
6.41LeuLeu: 6.41 ± 0.78
1.296LeuMet: 1.296 ± 0.291
2.953LeuAsn: 2.953 ± 0.522
4.538LeuPro: 4.538 ± 0.566
4.033LeuGln: 4.033 ± 0.61
5.186LeuArg: 5.186 ± 0.594
5.402LeuSer: 5.402 ± 0.787
6.482LeuThr: 6.482 ± 0.662
5.546LeuVal: 5.546 ± 0.635
0.72LeuTrp: 0.72 ± 0.177
2.089LeuTyr: 2.089 ± 0.372
0.0LeuXaa: 0.0 ± 0.0
Met
3.313MetAla: 3.313 ± 0.463
0.072MetCys: 0.072 ± 0.081
1.729MetAsp: 1.729 ± 0.346
0.72MetGlu: 0.72 ± 0.22
0.432MetPhe: 0.432 ± 0.225
1.224MetGly: 1.224 ± 0.294
0.432MetHis: 0.432 ± 0.143
0.576MetIle: 0.576 ± 0.175
1.729MetLys: 1.729 ± 0.319
2.089MetLeu: 2.089 ± 0.336
0.432MetMet: 0.432 ± 0.246
1.008MetAsn: 1.008 ± 0.28
1.657MetPro: 1.657 ± 0.4
1.224MetGln: 1.224 ± 0.284
1.873MetArg: 1.873 ± 0.36
2.449MetSer: 2.449 ± 0.379
1.224MetThr: 1.224 ± 0.308
1.08MetVal: 1.08 ± 0.287
0.0MetTrp: 0.0 ± 0.0
0.144MetTyr: 0.144 ± 0.084
0.0MetXaa: 0.0 ± 0.0
Asn
4.105AsnAla: 4.105 ± 0.594
0.36AsnCys: 0.36 ± 0.138
1.801AsnAsp: 1.801 ± 0.368
1.945AsnGlu: 1.945 ± 0.317
1.008AsnPhe: 1.008 ± 0.252
3.817AsnGly: 3.817 ± 0.535
0.504AsnHis: 0.504 ± 0.187
1.657AsnIle: 1.657 ± 0.352
0.792AsnLys: 0.792 ± 0.241
3.097AsnLeu: 3.097 ± 0.569
0.72AsnMet: 0.72 ± 0.229
1.513AsnAsn: 1.513 ± 0.322
2.449AsnPro: 2.449 ± 0.46
1.296AsnGln: 1.296 ± 0.274
1.513AsnArg: 1.513 ± 0.319
2.593AsnSer: 2.593 ± 0.547
2.161AsnThr: 2.161 ± 0.408
3.313AsnVal: 3.313 ± 0.554
0.72AsnTrp: 0.72 ± 0.236
0.936AsnTyr: 0.936 ± 0.276
0.0AsnXaa: 0.0 ± 0.0
Pro
8.139ProAla: 8.139 ± 0.813
0.432ProCys: 0.432 ± 0.201
3.097ProAsp: 3.097 ± 0.469
3.025ProGlu: 3.025 ± 0.452
1.224ProPhe: 1.224 ± 0.295
4.394ProGly: 4.394 ± 0.569
0.432ProHis: 0.432 ± 0.198
2.809ProIle: 2.809 ± 0.55
1.513ProLys: 1.513 ± 0.377
3.889ProLeu: 3.889 ± 0.584
0.72ProMet: 0.72 ± 0.253
2.881ProAsn: 2.881 ± 0.505
2.593ProPro: 2.593 ± 0.48
2.233ProGln: 2.233 ± 0.451
3.025ProArg: 3.025 ± 0.591
2.593ProSer: 2.593 ± 0.636
3.601ProThr: 3.601 ± 0.542
3.745ProVal: 3.745 ± 0.632
0.576ProTrp: 0.576 ± 0.215
1.513ProTyr: 1.513 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
6.122GlnAla: 6.122 ± 0.83
0.144GlnCys: 0.144 ± 0.102
1.441GlnAsp: 1.441 ± 0.378
1.801GlnGlu: 1.801 ± 0.443
1.873GlnPhe: 1.873 ± 0.423
3.529GlnGly: 3.529 ± 0.575
0.72GlnHis: 0.72 ± 0.218
2.953GlnIle: 2.953 ± 0.435
1.513GlnLys: 1.513 ± 0.396
4.105GlnLeu: 4.105 ± 0.603
1.945GlnMet: 1.945 ± 0.469
2.233GlnAsn: 2.233 ± 0.48
2.089GlnPro: 2.089 ± 0.355
3.241GlnGln: 3.241 ± 0.653
2.665GlnArg: 2.665 ± 0.495
2.881GlnSer: 2.881 ± 0.49
2.161GlnThr: 2.161 ± 0.386
3.169GlnVal: 3.169 ± 0.506
0.936GlnTrp: 0.936 ± 0.293
1.441GlnTyr: 1.441 ± 0.359
0.0GlnXaa: 0.0 ± 0.0
Arg
9.219ArgAla: 9.219 ± 1.179
0.576ArgCys: 0.576 ± 0.283
4.033ArgAsp: 4.033 ± 0.622
4.754ArgGlu: 4.754 ± 1.27
1.945ArgPhe: 1.945 ± 0.358
3.817ArgGly: 3.817 ± 0.559
1.296ArgHis: 1.296 ± 0.299
2.665ArgIle: 2.665 ± 0.35
2.017ArgLys: 2.017 ± 0.372
4.826ArgLeu: 4.826 ± 0.54
1.657ArgMet: 1.657 ± 0.313
2.161ArgAsn: 2.161 ± 0.434
2.809ArgPro: 2.809 ± 0.497
2.305ArgGln: 2.305 ± 0.404
4.898ArgArg: 4.898 ± 0.754
2.953ArgSer: 2.953 ± 0.413
3.529ArgThr: 3.529 ± 0.551
4.466ArgVal: 4.466 ± 0.739
1.224ArgTrp: 1.224 ± 0.254
2.881ArgTyr: 2.881 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
7.635SerAla: 7.635 ± 1.27
0.648SerCys: 0.648 ± 0.21
3.601SerAsp: 3.601 ± 0.552
2.593SerGlu: 2.593 ± 0.433
1.657SerPhe: 1.657 ± 0.371
5.114SerGly: 5.114 ± 1.098
0.936SerHis: 0.936 ± 0.246
4.177SerIle: 4.177 ± 0.551
2.017SerLys: 2.017 ± 0.371
4.033SerLeu: 4.033 ± 0.621
1.368SerMet: 1.368 ± 0.356
1.873SerAsn: 1.873 ± 0.472
2.521SerPro: 2.521 ± 0.44
2.881SerGln: 2.881 ± 0.481
3.097SerArg: 3.097 ± 0.536
3.457SerSer: 3.457 ± 0.804
4.322SerThr: 4.322 ± 0.646
4.826SerVal: 4.826 ± 0.81
0.648SerTrp: 0.648 ± 0.274
1.368SerTyr: 1.368 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
7.707ThrAla: 7.707 ± 0.59
0.36ThrCys: 0.36 ± 0.147
3.313ThrAsp: 3.313 ± 0.47
2.665ThrGlu: 2.665 ± 0.542
2.017ThrPhe: 2.017 ± 0.477
6.194ThrGly: 6.194 ± 0.757
1.008ThrHis: 1.008 ± 0.319
3.241ThrIle: 3.241 ± 0.535
1.801ThrLys: 1.801 ± 0.363
5.546ThrLeu: 5.546 ± 0.604
0.864ThrMet: 0.864 ± 0.246
2.233ThrAsn: 2.233 ± 0.473
3.745ThrPro: 3.745 ± 0.545
3.241ThrGln: 3.241 ± 0.587
2.737ThrArg: 2.737 ± 0.419
3.889ThrSer: 3.889 ± 0.677
3.745ThrThr: 3.745 ± 0.58
4.682ThrVal: 4.682 ± 0.71
1.08ThrTrp: 1.08 ± 0.294
1.224ThrTyr: 1.224 ± 0.317
0.0ThrXaa: 0.0 ± 0.0
Val
7.851ValAla: 7.851 ± 0.756
0.36ValCys: 0.36 ± 0.151
4.61ValAsp: 4.61 ± 0.563
4.61ValGlu: 4.61 ± 0.551
1.368ValPhe: 1.368 ± 0.291
4.322ValGly: 4.322 ± 0.552
1.152ValHis: 1.152 ± 0.27
2.593ValIle: 2.593 ± 0.399
2.809ValLys: 2.809 ± 0.441
6.77ValLeu: 6.77 ± 0.88
1.224ValMet: 1.224 ± 0.266
2.233ValAsn: 2.233 ± 0.398
4.61ValPro: 4.61 ± 0.863
4.177ValGln: 4.177 ± 0.665
4.466ValArg: 4.466 ± 0.694
4.754ValSer: 4.754 ± 0.572
4.898ValThr: 4.898 ± 0.618
4.322ValVal: 4.322 ± 0.658
0.72ValTrp: 0.72 ± 0.184
1.873ValTyr: 1.873 ± 0.435
0.0ValXaa: 0.0 ± 0.0
Trp
1.08TrpAla: 1.08 ± 0.287
0.144TrpCys: 0.144 ± 0.102
0.864TrpAsp: 0.864 ± 0.232
0.792TrpGlu: 0.792 ± 0.26
0.504TrpPhe: 0.504 ± 0.172
1.08TrpGly: 1.08 ± 0.25
0.216TrpHis: 0.216 ± 0.117
0.648TrpIle: 0.648 ± 0.178
1.152TrpLys: 1.152 ± 0.339
1.729TrpLeu: 1.729 ± 0.323
0.216TrpMet: 0.216 ± 0.1
0.432TrpAsn: 0.432 ± 0.179
0.648TrpPro: 0.648 ± 0.189
0.792TrpGln: 0.792 ± 0.209
0.72TrpArg: 0.72 ± 0.217
1.08TrpSer: 1.08 ± 0.271
1.224TrpThr: 1.224 ± 0.305
0.792TrpVal: 0.792 ± 0.234
0.216TrpTrp: 0.216 ± 0.121
0.72TrpTyr: 0.72 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.025TyrAla: 3.025 ± 0.46
0.36TyrCys: 0.36 ± 0.158
1.08TyrAsp: 1.08 ± 0.286
1.152TyrGlu: 1.152 ± 0.253
1.224TyrPhe: 1.224 ± 0.306
2.449TyrGly: 2.449 ± 0.636
0.288TyrHis: 0.288 ± 0.133
0.792TyrIle: 0.792 ± 0.25
0.792TyrLys: 0.792 ± 0.231
3.385TyrLeu: 3.385 ± 0.731
0.576TyrMet: 0.576 ± 0.209
1.296TyrAsn: 1.296 ± 0.34
1.657TyrPro: 1.657 ± 0.362
0.864TyrGln: 0.864 ± 0.233
1.441TyrArg: 1.441 ± 0.291
1.585TyrSer: 1.585 ± 0.393
0.936TyrThr: 0.936 ± 0.219
2.089TyrVal: 2.089 ± 0.359
0.288TyrTrp: 0.288 ± 0.14
0.648TyrTyr: 0.648 ± 0.236
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (13885 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski