Amino acid dipepetide frequency for Pectobacterium phage Zenivior

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.8AlaAla: 12.8 ± 1.536
0.78AlaCys: 0.78 ± 0.31
5.385AlaAsp: 5.385 ± 0.67
5.541AlaGlu: 5.541 ± 0.758
2.966AlaPhe: 2.966 ± 0.514
6.556AlaGly: 6.556 ± 0.797
2.263AlaHis: 2.263 ± 0.443
3.278AlaIle: 3.278 ± 0.429
4.995AlaLys: 4.995 ± 0.558
9.209AlaLeu: 9.209 ± 0.809
2.654AlaMet: 2.654 ± 0.456
3.512AlaAsn: 3.512 ± 0.735
3.668AlaPro: 3.668 ± 0.649
5.151AlaGln: 5.151 ± 0.676
4.839AlaArg: 4.839 ± 0.606
6.79AlaSer: 6.79 ± 0.824
5.931AlaThr: 5.931 ± 1.028
8.117AlaVal: 8.117 ± 0.832
1.171AlaTrp: 1.171 ± 0.298
3.278AlaTyr: 3.278 ± 0.485
0.0AlaXaa: 0.0 ± 0.0
Cys
0.702CysAla: 0.702 ± 0.263
0.234CysCys: 0.234 ± 0.13
0.624CysAsp: 0.624 ± 0.191
0.39CysGlu: 0.39 ± 0.165
0.156CysPhe: 0.156 ± 0.098
0.78CysGly: 0.78 ± 0.256
0.468CysHis: 0.468 ± 0.211
1.015CysIle: 1.015 ± 0.329
0.156CysLys: 0.156 ± 0.113
0.859CysLeu: 0.859 ± 0.251
0.624CysMet: 0.624 ± 0.21
0.312CysAsn: 0.312 ± 0.147
0.624CysPro: 0.624 ± 0.279
0.39CysGln: 0.39 ± 0.15
0.624CysArg: 0.624 ± 0.262
0.937CysSer: 0.937 ± 0.316
0.702CysThr: 0.702 ± 0.227
0.78CysVal: 0.78 ± 0.242
0.234CysTrp: 0.234 ± 0.131
0.546CysTyr: 0.546 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
7.414AspAla: 7.414 ± 0.874
0.546AspCys: 0.546 ± 0.226
3.59AspAsp: 3.59 ± 0.574
3.59AspGlu: 3.59 ± 0.609
1.639AspPhe: 1.639 ± 0.255
4.605AspGly: 4.605 ± 0.687
0.859AspHis: 0.859 ± 0.273
3.824AspIle: 3.824 ± 0.511
2.419AspLys: 2.419 ± 0.629
4.527AspLeu: 4.527 ± 0.697
2.341AspMet: 2.341 ± 0.377
2.732AspAsn: 2.732 ± 0.406
1.873AspPro: 1.873 ± 0.311
1.249AspGln: 1.249 ± 0.352
3.278AspArg: 3.278 ± 0.609
4.371AspSer: 4.371 ± 0.552
4.293AspThr: 4.293 ± 0.533
4.605AspVal: 4.605 ± 0.568
1.483AspTrp: 1.483 ± 0.315
2.185AspTyr: 2.185 ± 0.494
0.0AspXaa: 0.0 ± 0.0
Glu
4.761GluAla: 4.761 ± 0.713
0.546GluCys: 0.546 ± 0.245
4.214GluAsp: 4.214 ± 0.511
3.278GluGlu: 3.278 ± 0.841
2.654GluPhe: 2.654 ± 0.52
2.732GluGly: 2.732 ± 0.515
1.327GluHis: 1.327 ± 0.322
2.497GluIle: 2.497 ± 0.42
2.497GluLys: 2.497 ± 0.496
5.073GluLeu: 5.073 ± 0.448
1.717GluMet: 1.717 ± 0.332
2.029GluAsn: 2.029 ± 0.406
1.327GluPro: 1.327 ± 0.28
3.2GluGln: 3.2 ± 0.489
2.654GluArg: 2.654 ± 0.407
2.654GluSer: 2.654 ± 0.459
2.888GluThr: 2.888 ± 0.49
3.902GluVal: 3.902 ± 0.497
0.624GluTrp: 0.624 ± 0.222
2.185GluTyr: 2.185 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
2.419PheAla: 2.419 ± 0.402
0.156PheCys: 0.156 ± 0.099
2.654PheAsp: 2.654 ± 0.469
1.093PheGlu: 1.093 ± 0.226
0.78PhePhe: 0.78 ± 0.253
2.654PheGly: 2.654 ± 0.451
0.546PheHis: 0.546 ± 0.228
1.951PheIle: 1.951 ± 0.327
1.093PheLys: 1.093 ± 0.29
2.185PheLeu: 2.185 ± 0.421
0.546PheMet: 0.546 ± 0.149
1.873PheAsn: 1.873 ± 0.52
1.171PhePro: 1.171 ± 0.265
1.483PheGln: 1.483 ± 0.322
1.561PheArg: 1.561 ± 0.332
1.795PheSer: 1.795 ± 0.367
1.639PheThr: 1.639 ± 0.445
2.185PheVal: 2.185 ± 0.455
0.312PheTrp: 0.312 ± 0.119
0.859PheTyr: 0.859 ± 0.252
0.0PheXaa: 0.0 ± 0.0
Gly
7.102GlyAla: 7.102 ± 0.839
0.937GlyCys: 0.937 ± 0.319
4.293GlyAsp: 4.293 ± 0.748
2.888GlyGlu: 2.888 ± 0.443
2.732GlyPhe: 2.732 ± 0.344
5.697GlyGly: 5.697 ± 0.691
0.702GlyHis: 0.702 ± 0.228
4.683GlyIle: 4.683 ± 0.73
3.824GlyLys: 3.824 ± 0.619
5.931GlyLeu: 5.931 ± 0.657
2.029GlyMet: 2.029 ± 0.319
3.122GlyAsn: 3.122 ± 0.658
1.249GlyPro: 1.249 ± 0.369
2.185GlyGln: 2.185 ± 0.369
3.668GlyArg: 3.668 ± 0.447
5.307GlySer: 5.307 ± 0.555
7.57GlyThr: 7.57 ± 0.884
6.322GlyVal: 6.322 ± 0.883
0.859GlyTrp: 0.859 ± 0.277
3.746GlyTyr: 3.746 ± 0.615
0.0GlyXaa: 0.0 ± 0.0
His
1.717HisAla: 1.717 ± 0.39
0.312HisCys: 0.312 ± 0.129
1.249HisAsp: 1.249 ± 0.298
1.249HisGlu: 1.249 ± 0.398
0.468HisPhe: 0.468 ± 0.157
1.873HisGly: 1.873 ± 0.438
0.624HisHis: 0.624 ± 0.233
1.483HisIle: 1.483 ± 0.258
1.249HisLys: 1.249 ± 0.395
2.107HisLeu: 2.107 ± 0.424
0.546HisMet: 0.546 ± 0.266
0.937HisAsn: 0.937 ± 0.221
0.937HisPro: 0.937 ± 0.305
1.093HisGln: 1.093 ± 0.32
1.327HisArg: 1.327 ± 0.323
0.937HisSer: 0.937 ± 0.246
1.327HisThr: 1.327 ± 0.503
1.171HisVal: 1.171 ± 0.327
0.624HisTrp: 0.624 ± 0.225
0.78HisTyr: 0.78 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
3.824IleAla: 3.824 ± 0.606
0.702IleCys: 0.702 ± 0.222
4.293IleAsp: 4.293 ± 0.616
2.732IleGlu: 2.732 ± 0.546
0.937IlePhe: 0.937 ± 0.228
3.044IleGly: 3.044 ± 0.446
0.937IleHis: 0.937 ± 0.207
2.185IleIle: 2.185 ± 0.359
2.497IleLys: 2.497 ± 0.373
3.902IleLeu: 3.902 ± 0.538
1.015IleMet: 1.015 ± 0.297
2.654IleAsn: 2.654 ± 0.597
2.263IlePro: 2.263 ± 0.386
2.107IleGln: 2.107 ± 0.34
1.951IleArg: 1.951 ± 0.332
2.888IleSer: 2.888 ± 0.371
4.136IleThr: 4.136 ± 0.574
2.81IleVal: 2.81 ± 0.496
0.624IleTrp: 0.624 ± 0.226
1.249IleTyr: 1.249 ± 0.316
0.0IleXaa: 0.0 ± 0.0
Lys
5.541LysAla: 5.541 ± 1.021
0.234LysCys: 0.234 ± 0.106
2.81LysAsp: 2.81 ± 0.437
3.512LysGlu: 3.512 ± 0.701
0.546LysPhe: 0.546 ± 0.213
3.278LysGly: 3.278 ± 0.47
0.937LysHis: 0.937 ± 0.272
1.327LysIle: 1.327 ± 0.251
2.107LysLys: 2.107 ± 0.569
5.463LysLeu: 5.463 ± 0.696
0.937LysMet: 0.937 ± 0.281
1.405LysAsn: 1.405 ± 0.396
2.107LysPro: 2.107 ± 0.366
2.888LysGln: 2.888 ± 0.508
2.654LysArg: 2.654 ± 0.49
2.81LysSer: 2.81 ± 0.415
1.483LysThr: 1.483 ± 0.332
3.278LysVal: 3.278 ± 0.567
0.546LysTrp: 0.546 ± 0.216
2.341LysTyr: 2.341 ± 0.499
0.0LysXaa: 0.0 ± 0.0
Leu
7.57LeuAla: 7.57 ± 0.733
1.093LeuCys: 1.093 ± 0.312
4.839LeuAsp: 4.839 ± 0.557
5.073LeuGlu: 5.073 ± 0.599
2.419LeuPhe: 2.419 ± 0.352
6.088LeuGly: 6.088 ± 0.761
2.263LeuHis: 2.263 ± 0.417
3.278LeuIle: 3.278 ± 0.667
3.902LeuLys: 3.902 ± 0.567
7.805LeuLeu: 7.805 ± 0.905
1.951LeuMet: 1.951 ± 0.393
4.371LeuAsn: 4.371 ± 0.593
4.449LeuPro: 4.449 ± 0.487
3.278LeuGln: 3.278 ± 0.616
5.463LeuArg: 5.463 ± 0.773
7.024LeuSer: 7.024 ± 0.899
5.307LeuThr: 5.307 ± 0.769
6.946LeuVal: 6.946 ± 0.626
0.624LeuTrp: 0.624 ± 0.243
3.356LeuTyr: 3.356 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
2.576MetAla: 2.576 ± 0.442
0.234MetCys: 0.234 ± 0.12
1.093MetAsp: 1.093 ± 0.235
0.937MetGlu: 0.937 ± 0.255
1.327MetPhe: 1.327 ± 0.269
1.873MetGly: 1.873 ± 0.378
0.546MetHis: 0.546 ± 0.174
0.702MetIle: 0.702 ± 0.224
0.859MetLys: 0.859 ± 0.212
2.497MetLeu: 2.497 ± 0.501
0.624MetMet: 0.624 ± 0.216
1.093MetAsn: 1.093 ± 0.308
1.015MetPro: 1.015 ± 0.369
1.561MetGln: 1.561 ± 0.369
2.419MetArg: 2.419 ± 0.51
2.185MetSer: 2.185 ± 0.468
1.483MetThr: 1.483 ± 0.279
1.561MetVal: 1.561 ± 0.341
0.234MetTrp: 0.234 ± 0.123
1.249MetTyr: 1.249 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
3.434AsnAla: 3.434 ± 0.492
1.015AsnCys: 1.015 ± 0.324
2.107AsnAsp: 2.107 ± 0.407
1.873AsnGlu: 1.873 ± 0.433
1.015AsnPhe: 1.015 ± 0.315
3.746AsnGly: 3.746 ± 0.546
0.78AsnHis: 0.78 ± 0.242
1.639AsnIle: 1.639 ± 0.437
3.2AsnLys: 3.2 ± 0.433
4.605AsnLeu: 4.605 ± 0.771
1.015AsnMet: 1.015 ± 0.226
2.419AsnAsn: 2.419 ± 0.428
2.732AsnPro: 2.732 ± 0.483
1.873AsnGln: 1.873 ± 0.291
1.951AsnArg: 1.951 ± 0.369
2.654AsnSer: 2.654 ± 0.549
3.902AsnThr: 3.902 ± 0.732
2.654AsnVal: 2.654 ± 0.382
0.624AsnTrp: 0.624 ± 0.208
0.39AsnTyr: 0.39 ± 0.167
0.0AsnXaa: 0.0 ± 0.0
Pro
4.058ProAla: 4.058 ± 0.441
0.234ProCys: 0.234 ± 0.131
3.824ProAsp: 3.824 ± 0.495
3.512ProGlu: 3.512 ± 0.567
0.78ProPhe: 0.78 ± 0.233
2.263ProGly: 2.263 ± 0.306
0.39ProHis: 0.39 ± 0.161
1.639ProIle: 1.639 ± 0.378
1.717ProLys: 1.717 ± 0.378
2.419ProLeu: 2.419 ± 0.501
1.015ProMet: 1.015 ± 0.289
1.405ProAsn: 1.405 ± 0.403
1.327ProPro: 1.327 ± 0.352
1.171ProGln: 1.171 ± 0.332
1.795ProArg: 1.795 ± 0.263
2.576ProSer: 2.576 ± 0.404
2.966ProThr: 2.966 ± 0.507
3.902ProVal: 3.902 ± 0.451
0.78ProTrp: 0.78 ± 0.293
1.795ProTyr: 1.795 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
6.088GlnAla: 6.088 ± 0.702
0.312GlnCys: 0.312 ± 0.139
2.497GlnAsp: 2.497 ± 0.467
2.107GlnGlu: 2.107 ± 0.414
1.483GlnPhe: 1.483 ± 0.324
3.512GlnGly: 3.512 ± 0.549
1.717GlnHis: 1.717 ± 0.317
1.405GlnIle: 1.405 ± 0.397
1.717GlnLys: 1.717 ± 0.371
3.746GlnLeu: 3.746 ± 0.501
0.859GlnMet: 0.859 ± 0.219
2.107GlnAsn: 2.107 ± 0.45
1.561GlnPro: 1.561 ± 0.363
2.576GlnGln: 2.576 ± 0.44
2.419GlnArg: 2.419 ± 0.622
2.263GlnSer: 2.263 ± 0.37
1.873GlnThr: 1.873 ± 0.357
3.2GlnVal: 3.2 ± 0.473
0.312GlnTrp: 0.312 ± 0.16
2.888GlnTyr: 2.888 ± 0.462
0.0GlnXaa: 0.0 ± 0.0
Arg
3.98ArgAla: 3.98 ± 0.571
0.78ArgCys: 0.78 ± 0.233
3.746ArgAsp: 3.746 ± 0.487
3.278ArgGlu: 3.278 ± 0.481
1.405ArgPhe: 1.405 ± 0.403
3.98ArgGly: 3.98 ± 0.588
1.483ArgHis: 1.483 ± 0.324
3.59ArgIle: 3.59 ± 0.617
2.419ArgLys: 2.419 ± 0.43
3.746ArgLeu: 3.746 ± 0.474
1.561ArgMet: 1.561 ± 0.375
2.654ArgAsn: 2.654 ± 0.527
1.483ArgPro: 1.483 ± 0.312
2.263ArgGln: 2.263 ± 0.326
4.995ArgArg: 4.995 ± 0.622
3.824ArgSer: 3.824 ± 0.672
3.512ArgThr: 3.512 ± 0.598
4.371ArgVal: 4.371 ± 0.558
0.859ArgTrp: 0.859 ± 0.221
2.81ArgTyr: 2.81 ± 0.494
0.0ArgXaa: 0.0 ± 0.0
Ser
7.18SerAla: 7.18 ± 0.84
0.702SerCys: 0.702 ± 0.238
3.044SerAsp: 3.044 ± 0.424
2.029SerGlu: 2.029 ± 0.365
1.951SerPhe: 1.951 ± 0.409
5.541SerGly: 5.541 ± 0.97
1.093SerHis: 1.093 ± 0.329
3.824SerIle: 3.824 ± 0.617
3.59SerLys: 3.59 ± 0.566
6.088SerLeu: 6.088 ± 0.621
2.029SerMet: 2.029 ± 0.389
3.356SerAsn: 3.356 ± 0.637
2.029SerPro: 2.029 ± 0.403
2.497SerGln: 2.497 ± 0.466
3.122SerArg: 3.122 ± 0.517
4.371SerSer: 4.371 ± 0.573
4.839SerThr: 4.839 ± 0.801
5.775SerVal: 5.775 ± 0.581
0.702SerTrp: 0.702 ± 0.23
1.795SerTyr: 1.795 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
7.57ThrAla: 7.57 ± 1.016
0.702ThrCys: 0.702 ± 0.29
3.356ThrAsp: 3.356 ± 0.582
3.746ThrGlu: 3.746 ± 0.444
1.405ThrPhe: 1.405 ± 0.368
6.322ThrGly: 6.322 ± 1.048
2.263ThrHis: 2.263 ± 0.482
2.185ThrIle: 2.185 ± 0.424
3.044ThrLys: 3.044 ± 0.401
5.463ThrLeu: 5.463 ± 0.652
1.093ThrMet: 1.093 ± 0.236
3.044ThrAsn: 3.044 ± 0.534
3.902ThrPro: 3.902 ± 0.681
1.405ThrGln: 1.405 ± 0.359
3.278ThrArg: 3.278 ± 0.432
4.371ThrSer: 4.371 ± 0.682
4.293ThrThr: 4.293 ± 1.118
5.463ThrVal: 5.463 ± 0.89
0.624ThrTrp: 0.624 ± 0.203
2.419ThrTyr: 2.419 ± 0.43
0.0ThrXaa: 0.0 ± 0.0
Val
6.634ValAla: 6.634 ± 0.689
0.78ValCys: 0.78 ± 0.226
4.527ValAsp: 4.527 ± 0.602
2.732ValGlu: 2.732 ± 0.397
2.185ValPhe: 2.185 ± 0.3
6.322ValGly: 6.322 ± 0.626
1.795ValHis: 1.795 ± 0.391
3.356ValIle: 3.356 ± 0.43
2.888ValLys: 2.888 ± 0.612
7.102ValLeu: 7.102 ± 0.835
1.873ValMet: 1.873 ± 0.419
2.576ValAsn: 2.576 ± 0.428
3.668ValPro: 3.668 ± 0.475
5.853ValGln: 5.853 ± 0.659
4.683ValArg: 4.683 ± 0.596
4.371ValSer: 4.371 ± 0.602
4.449ValThr: 4.449 ± 0.709
4.371ValVal: 4.371 ± 0.686
1.171ValTrp: 1.171 ± 0.319
3.044ValTyr: 3.044 ± 0.472
0.0ValXaa: 0.0 ± 0.0
Trp
0.859TrpAla: 0.859 ± 0.248
0.234TrpCys: 0.234 ± 0.13
0.624TrpAsp: 0.624 ± 0.196
0.937TrpGlu: 0.937 ± 0.251
0.702TrpPhe: 0.702 ± 0.28
1.171TrpGly: 1.171 ± 0.31
0.078TrpHis: 0.078 ± 0.07
0.39TrpIle: 0.39 ± 0.194
0.156TrpLys: 0.156 ± 0.114
1.327TrpLeu: 1.327 ± 0.392
0.39TrpMet: 0.39 ± 0.167
0.624TrpAsn: 0.624 ± 0.229
0.546TrpPro: 0.546 ± 0.196
0.624TrpGln: 0.624 ± 0.219
0.859TrpArg: 0.859 ± 0.25
0.702TrpSer: 0.702 ± 0.206
0.546TrpThr: 0.546 ± 0.189
1.171TrpVal: 1.171 ± 0.281
0.312TrpTrp: 0.312 ± 0.207
1.249TrpTyr: 1.249 ± 0.336
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.654TyrAla: 2.654 ± 0.456
0.624TyrCys: 0.624 ± 0.235
2.654TyrAsp: 2.654 ± 0.393
2.263TyrGlu: 2.263 ± 0.418
1.405TyrPhe: 1.405 ± 0.292
2.81TyrGly: 2.81 ± 0.613
0.937TyrHis: 0.937 ± 0.29
2.497TyrIle: 2.497 ± 0.367
1.873TyrLys: 1.873 ± 0.478
3.122TyrLeu: 3.122 ± 0.535
1.171TyrMet: 1.171 ± 0.342
1.405TyrAsn: 1.405 ± 0.271
1.639TyrPro: 1.639 ± 0.302
1.795TyrGln: 1.795 ± 0.3
3.2TyrArg: 3.2 ± 0.437
2.732TyrSer: 2.732 ± 0.442
2.81TyrThr: 2.81 ± 0.596
1.795TyrVal: 1.795 ± 0.342
0.78TyrTrp: 0.78 ± 0.305
1.717TyrTyr: 1.717 ± 0.455
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (12814 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski