Amino acid dipepetide frequency for Pectobacterium phage Slant

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.989AlaAla: 11.989 ± 1.466
1.017AlaCys: 1.017 ± 0.295
5.377AlaAsp: 5.377 ± 0.668
5.449AlaGlu: 5.449 ± 0.719
3.27AlaPhe: 3.27 ± 0.554
6.394AlaGly: 6.394 ± 0.636
1.962AlaHis: 1.962 ± 0.381
2.834AlaIle: 2.834 ± 0.407
5.086AlaLys: 5.086 ± 0.595
9.446AlaLeu: 9.446 ± 0.793
2.616AlaMet: 2.616 ± 0.454
3.851AlaAsn: 3.851 ± 0.668
3.633AlaPro: 3.633 ± 0.591
5.667AlaGln: 5.667 ± 0.871
4.432AlaArg: 4.432 ± 0.613
6.394AlaSer: 6.394 ± 0.973
4.432AlaThr: 4.432 ± 0.722
7.048AlaVal: 7.048 ± 0.813
1.381AlaTrp: 1.381 ± 0.284
3.27AlaTyr: 3.27 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
0.654CysAla: 0.654 ± 0.244
0.436CysCys: 0.436 ± 0.201
0.654CysAsp: 0.654 ± 0.204
0.436CysGlu: 0.436 ± 0.174
0.363CysPhe: 0.363 ± 0.172
0.799CysGly: 0.799 ± 0.229
0.363CysHis: 0.363 ± 0.174
0.799CysIle: 0.799 ± 0.283
0.145CysLys: 0.145 ± 0.097
0.945CysLeu: 0.945 ± 0.241
0.654CysMet: 0.654 ± 0.219
0.799CysAsn: 0.799 ± 0.271
0.581CysPro: 0.581 ± 0.2
0.363CysGln: 0.363 ± 0.15
0.581CysArg: 0.581 ± 0.202
0.945CysSer: 0.945 ± 0.259
0.799CysThr: 0.799 ± 0.237
0.872CysVal: 0.872 ± 0.271
0.218CysTrp: 0.218 ± 0.128
0.799CysTyr: 0.799 ± 0.214
0.0CysXaa: 0.0 ± 0.0
Asp
7.121AspAla: 7.121 ± 0.602
0.363AspCys: 0.363 ± 0.174
4.214AspAsp: 4.214 ± 0.586
3.27AspGlu: 3.27 ± 0.45
1.889AspPhe: 1.889 ± 0.317
4.432AspGly: 4.432 ± 0.538
0.799AspHis: 0.799 ± 0.263
3.996AspIle: 3.996 ± 0.453
2.834AspLys: 2.834 ± 0.57
4.868AspLeu: 4.868 ± 0.656
2.252AspMet: 2.252 ± 0.392
2.688AspAsn: 2.688 ± 0.408
2.252AspPro: 2.252 ± 0.39
1.235AspGln: 1.235 ± 0.346
2.834AspArg: 2.834 ± 0.528
4.723AspSer: 4.723 ± 0.556
4.36AspThr: 4.36 ± 0.471
4.505AspVal: 4.505 ± 0.549
1.598AspTrp: 1.598 ± 0.324
2.107AspTyr: 2.107 ± 0.42
0.0AspXaa: 0.0 ± 0.0
Glu
4.577GluAla: 4.577 ± 0.546
0.581GluCys: 0.581 ± 0.28
3.851GluAsp: 3.851 ± 0.529
3.052GluGlu: 3.052 ± 0.674
2.834GluPhe: 2.834 ± 0.415
2.834GluGly: 2.834 ± 0.38
1.308GluHis: 1.308 ± 0.381
2.034GluIle: 2.034 ± 0.43
2.834GluLys: 2.834 ± 0.523
4.795GluLeu: 4.795 ± 0.507
1.598GluMet: 1.598 ± 0.366
2.034GluAsn: 2.034 ± 0.349
1.235GluPro: 1.235 ± 0.263
3.197GluGln: 3.197 ± 0.477
2.543GluArg: 2.543 ± 0.504
3.124GluSer: 3.124 ± 0.398
2.688GluThr: 2.688 ± 0.449
4.069GluVal: 4.069 ± 0.671
0.654GluTrp: 0.654 ± 0.215
2.761GluTyr: 2.761 ± 0.49
0.0GluXaa: 0.0 ± 0.0
Phe
2.688PheAla: 2.688 ± 0.427
0.291PheCys: 0.291 ± 0.14
2.616PheAsp: 2.616 ± 0.443
1.526PheGlu: 1.526 ± 0.297
0.872PhePhe: 0.872 ± 0.257
2.616PheGly: 2.616 ± 0.446
0.509PheHis: 0.509 ± 0.192
1.453PheIle: 1.453 ± 0.309
1.744PheLys: 1.744 ± 0.358
2.107PheLeu: 2.107 ± 0.388
0.727PheMet: 0.727 ± 0.171
1.816PheAsn: 1.816 ± 0.493
1.09PhePro: 1.09 ± 0.273
1.526PheGln: 1.526 ± 0.274
1.744PheArg: 1.744 ± 0.372
1.744PheSer: 1.744 ± 0.36
1.453PheThr: 1.453 ± 0.358
2.47PheVal: 2.47 ± 0.51
0.291PheTrp: 0.291 ± 0.139
0.799PheTyr: 0.799 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
6.757GlyAla: 6.757 ± 0.78
1.453GlyCys: 1.453 ± 0.443
4.069GlyAsp: 4.069 ± 0.631
3.197GlyGlu: 3.197 ± 0.473
2.834GlyPhe: 2.834 ± 0.373
5.159GlyGly: 5.159 ± 0.782
0.799GlyHis: 0.799 ± 0.242
5.159GlyIle: 5.159 ± 0.646
3.633GlyLys: 3.633 ± 0.498
6.249GlyLeu: 6.249 ± 0.66
1.816GlyMet: 1.816 ± 0.292
3.706GlyAsn: 3.706 ± 0.472
1.235GlyPro: 1.235 ± 0.382
2.252GlyGln: 2.252 ± 0.434
4.214GlyArg: 4.214 ± 0.565
5.377GlySer: 5.377 ± 0.531
7.266GlyThr: 7.266 ± 0.734
6.031GlyVal: 6.031 ± 0.637
0.799GlyTrp: 0.799 ± 0.258
3.996GlyTyr: 3.996 ± 0.642
0.0GlyXaa: 0.0 ± 0.0
His
1.816HisAla: 1.816 ± 0.412
0.291HisCys: 0.291 ± 0.131
1.381HisAsp: 1.381 ± 0.325
1.308HisGlu: 1.308 ± 0.335
0.218HisPhe: 0.218 ± 0.145
1.381HisGly: 1.381 ± 0.405
0.363HisHis: 0.363 ± 0.143
1.308HisIle: 1.308 ± 0.315
0.799HisLys: 0.799 ± 0.376
1.889HisLeu: 1.889 ± 0.443
0.509HisMet: 0.509 ± 0.228
0.654HisAsn: 0.654 ± 0.188
1.308HisPro: 1.308 ± 0.348
0.727HisGln: 0.727 ± 0.225
1.09HisArg: 1.09 ± 0.273
0.945HisSer: 0.945 ± 0.265
0.945HisThr: 0.945 ± 0.321
1.526HisVal: 1.526 ± 0.308
0.436HisTrp: 0.436 ± 0.183
0.799HisTyr: 0.799 ± 0.31
0.0HisXaa: 0.0 ± 0.0
Ile
3.851IleAla: 3.851 ± 0.487
0.436IleCys: 0.436 ± 0.207
3.778IleAsp: 3.778 ± 0.67
2.688IleGlu: 2.688 ± 0.457
0.799IlePhe: 0.799 ± 0.244
2.688IleGly: 2.688 ± 0.417
0.799IleHis: 0.799 ± 0.211
2.398IleIle: 2.398 ± 0.399
2.543IleLys: 2.543 ± 0.481
3.996IleLeu: 3.996 ± 0.714
0.872IleMet: 0.872 ± 0.267
2.47IleAsn: 2.47 ± 0.559
2.18IlePro: 2.18 ± 0.294
2.761IleGln: 2.761 ± 0.44
1.598IleArg: 1.598 ± 0.323
3.415IleSer: 3.415 ± 0.465
4.505IleThr: 4.505 ± 0.713
2.325IleVal: 2.325 ± 0.425
0.799IleTrp: 0.799 ± 0.306
1.163IleTyr: 1.163 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
5.231LysAla: 5.231 ± 0.787
0.291LysCys: 0.291 ± 0.127
3.124LysAsp: 3.124 ± 0.512
3.778LysGlu: 3.778 ± 0.697
0.727LysPhe: 0.727 ± 0.269
2.979LysGly: 2.979 ± 0.44
0.945LysHis: 0.945 ± 0.298
1.308LysIle: 1.308 ± 0.292
2.18LysLys: 2.18 ± 0.58
4.868LysLeu: 4.868 ± 0.657
1.163LysMet: 1.163 ± 0.369
1.453LysAsn: 1.453 ± 0.392
1.962LysPro: 1.962 ± 0.309
2.398LysGln: 2.398 ± 0.494
2.688LysArg: 2.688 ± 0.53
2.398LysSer: 2.398 ± 0.466
1.744LysThr: 1.744 ± 0.411
3.342LysVal: 3.342 ± 0.503
0.436LysTrp: 0.436 ± 0.182
2.688LysTyr: 2.688 ± 0.501
0.0LysXaa: 0.0 ± 0.0
Leu
7.193LeuAla: 7.193 ± 0.93
1.09LeuCys: 1.09 ± 0.256
5.159LeuAsp: 5.159 ± 0.69
5.086LeuGlu: 5.086 ± 0.681
2.398LeuPhe: 2.398 ± 0.448
6.394LeuGly: 6.394 ± 0.871
2.107LeuHis: 2.107 ± 0.38
3.56LeuIle: 3.56 ± 0.651
3.56LeuLys: 3.56 ± 0.577
7.193LeuLeu: 7.193 ± 0.707
2.034LeuMet: 2.034 ± 0.306
4.723LeuAsn: 4.723 ± 0.649
4.795LeuPro: 4.795 ± 0.545
3.56LeuGln: 3.56 ± 0.597
5.595LeuArg: 5.595 ± 0.764
6.176LeuSer: 6.176 ± 0.612
5.086LeuThr: 5.086 ± 0.644
6.975LeuVal: 6.975 ± 0.646
0.654LeuTrp: 0.654 ± 0.22
2.979LeuTyr: 2.979 ± 0.492
0.0LeuXaa: 0.0 ± 0.0
Met
2.398MetAla: 2.398 ± 0.464
0.363MetCys: 0.363 ± 0.16
1.017MetAsp: 1.017 ± 0.28
0.945MetGlu: 0.945 ± 0.222
1.017MetPhe: 1.017 ± 0.254
2.107MetGly: 2.107 ± 0.36
0.654MetHis: 0.654 ± 0.204
0.727MetIle: 0.727 ± 0.217
0.945MetLys: 0.945 ± 0.295
2.47MetLeu: 2.47 ± 0.498
0.654MetMet: 0.654 ± 0.185
0.945MetAsn: 0.945 ± 0.274
1.453MetPro: 1.453 ± 0.466
2.034MetGln: 2.034 ± 0.432
2.034MetArg: 2.034 ± 0.409
1.816MetSer: 1.816 ± 0.42
1.598MetThr: 1.598 ± 0.307
1.744MetVal: 1.744 ± 0.331
0.218MetTrp: 0.218 ± 0.118
1.308MetTyr: 1.308 ± 0.305
0.0MetXaa: 0.0 ± 0.0
Asn
3.706AsnAla: 3.706 ± 0.549
0.727AsnCys: 0.727 ± 0.262
1.744AsnAsp: 1.744 ± 0.316
2.18AsnGlu: 2.18 ± 0.414
1.453AsnPhe: 1.453 ± 0.309
4.214AsnGly: 4.214 ± 0.603
0.872AsnHis: 0.872 ± 0.304
1.889AsnIle: 1.889 ± 0.482
2.398AsnLys: 2.398 ± 0.375
4.941AsnLeu: 4.941 ± 0.78
1.526AsnMet: 1.526 ± 0.336
2.616AsnAsn: 2.616 ± 0.442
2.325AsnPro: 2.325 ± 0.41
2.252AsnGln: 2.252 ± 0.479
2.834AsnArg: 2.834 ± 0.503
2.761AsnSer: 2.761 ± 0.76
3.633AsnThr: 3.633 ± 0.61
2.616AsnVal: 2.616 ± 0.471
0.509AsnTrp: 0.509 ± 0.207
0.872AsnTyr: 0.872 ± 0.192
0.0AsnXaa: 0.0 ± 0.0
Pro
4.069ProAla: 4.069 ± 0.47
0.145ProCys: 0.145 ± 0.101
3.778ProAsp: 3.778 ± 0.572
3.197ProGlu: 3.197 ± 0.48
0.654ProPhe: 0.654 ± 0.232
2.834ProGly: 2.834 ± 0.385
0.363ProHis: 0.363 ± 0.128
1.598ProIle: 1.598 ± 0.426
1.598ProLys: 1.598 ± 0.355
2.47ProLeu: 2.47 ± 0.453
0.799ProMet: 0.799 ± 0.234
1.889ProAsn: 1.889 ± 0.427
1.308ProPro: 1.308 ± 0.351
1.453ProGln: 1.453 ± 0.387
1.381ProArg: 1.381 ± 0.274
2.543ProSer: 2.543 ± 0.419
2.906ProThr: 2.906 ± 0.477
4.214ProVal: 4.214 ± 0.509
0.872ProTrp: 0.872 ± 0.304
1.598ProTyr: 1.598 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
5.885GlnAla: 5.885 ± 0.911
0.436GlnCys: 0.436 ± 0.179
2.834GlnAsp: 2.834 ± 0.471
2.18GlnGlu: 2.18 ± 0.476
1.744GlnPhe: 1.744 ± 0.34
3.851GlnGly: 3.851 ± 0.663
1.381GlnHis: 1.381 ± 0.313
1.671GlnIle: 1.671 ± 0.432
1.816GlnLys: 1.816 ± 0.401
3.851GlnLeu: 3.851 ± 0.685
1.235GlnMet: 1.235 ± 0.314
2.543GlnAsn: 2.543 ± 0.515
1.235GlnPro: 1.235 ± 0.285
3.052GlnGln: 3.052 ± 0.839
2.325GlnArg: 2.325 ± 0.532
2.688GlnSer: 2.688 ± 0.664
1.962GlnThr: 1.962 ± 0.349
3.27GlnVal: 3.27 ± 0.499
0.291GlnTrp: 0.291 ± 0.132
2.906GlnTyr: 2.906 ± 0.561
0.0GlnXaa: 0.0 ± 0.0
Arg
4.142ArgAla: 4.142 ± 0.549
0.872ArgCys: 0.872 ± 0.251
3.633ArgAsp: 3.633 ± 0.494
2.761ArgGlu: 2.761 ± 0.363
1.381ArgPhe: 1.381 ± 0.325
4.142ArgGly: 4.142 ± 0.565
1.163ArgHis: 1.163 ± 0.213
3.415ArgIle: 3.415 ± 0.6
2.906ArgLys: 2.906 ± 0.475
3.851ArgLeu: 3.851 ± 0.535
1.671ArgMet: 1.671 ± 0.473
2.616ArgAsn: 2.616 ± 0.529
1.235ArgPro: 1.235 ± 0.307
2.107ArgGln: 2.107 ± 0.411
4.287ArgArg: 4.287 ± 0.511
3.197ArgSer: 3.197 ± 0.662
2.834ArgThr: 2.834 ± 0.43
4.723ArgVal: 4.723 ± 0.657
1.09ArgTrp: 1.09 ± 0.289
2.18ArgTyr: 2.18 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
7.048SerAla: 7.048 ± 0.886
0.727SerCys: 0.727 ± 0.211
3.415SerAsp: 3.415 ± 0.482
2.18SerGlu: 2.18 ± 0.377
1.816SerPhe: 1.816 ± 0.444
6.103SerGly: 6.103 ± 0.807
1.163SerHis: 1.163 ± 0.319
3.924SerIle: 3.924 ± 0.717
2.979SerLys: 2.979 ± 0.474
5.304SerLeu: 5.304 ± 0.505
1.962SerMet: 1.962 ± 0.401
3.124SerAsn: 3.124 ± 0.551
1.816SerPro: 1.816 ± 0.373
2.325SerGln: 2.325 ± 0.437
3.197SerArg: 3.197 ± 0.467
4.432SerSer: 4.432 ± 0.549
4.723SerThr: 4.723 ± 0.705
6.83SerVal: 6.83 ± 0.667
1.163SerTrp: 1.163 ± 0.305
1.526SerTyr: 1.526 ± 0.289
0.0SerXaa: 0.0 ± 0.0
Thr
6.757ThrAla: 6.757 ± 0.757
0.654ThrCys: 0.654 ± 0.25
3.706ThrAsp: 3.706 ± 0.585
3.56ThrGlu: 3.56 ± 0.462
1.598ThrPhe: 1.598 ± 0.306
6.757ThrGly: 6.757 ± 1.043
1.235ThrHis: 1.235 ± 0.258
1.962ThrIle: 1.962 ± 0.35
2.834ThrLys: 2.834 ± 0.385
5.595ThrLeu: 5.595 ± 0.57
0.945ThrMet: 0.945 ± 0.265
2.543ThrAsn: 2.543 ± 0.455
4.142ThrPro: 4.142 ± 0.658
1.962ThrGln: 1.962 ± 0.473
2.906ThrArg: 2.906 ± 0.479
4.577ThrSer: 4.577 ± 0.659
4.795ThrThr: 4.795 ± 1.195
5.013ThrVal: 5.013 ± 0.681
0.436ThrTrp: 0.436 ± 0.193
2.398ThrTyr: 2.398 ± 0.429
0.0ThrXaa: 0.0 ± 0.0
Val
5.958ValAla: 5.958 ± 0.583
1.017ValCys: 1.017 ± 0.332
4.36ValAsp: 4.36 ± 0.597
3.124ValGlu: 3.124 ± 0.453
2.543ValPhe: 2.543 ± 0.255
5.958ValGly: 5.958 ± 0.584
2.034ValHis: 2.034 ± 0.43
3.197ValIle: 3.197 ± 0.52
3.052ValLys: 3.052 ± 0.676
6.685ValLeu: 6.685 ± 0.596
1.744ValMet: 1.744 ± 0.369
3.27ValAsn: 3.27 ± 0.593
3.706ValPro: 3.706 ± 0.537
5.449ValGln: 5.449 ± 0.731
4.432ValArg: 4.432 ± 0.712
4.795ValSer: 4.795 ± 0.568
5.086ValThr: 5.086 ± 0.696
4.65ValVal: 4.65 ± 0.634
0.872ValTrp: 0.872 ± 0.266
3.56ValTyr: 3.56 ± 0.447
0.0ValXaa: 0.0 ± 0.0
Trp
1.017TrpAla: 1.017 ± 0.296
0.291TrpCys: 0.291 ± 0.15
0.799TrpAsp: 0.799 ± 0.289
0.799TrpGlu: 0.799 ± 0.257
0.581TrpPhe: 0.581 ± 0.241
1.163TrpGly: 1.163 ± 0.391
0.145TrpHis: 0.145 ± 0.098
0.363TrpIle: 0.363 ± 0.207
0.218TrpLys: 0.218 ± 0.113
1.453TrpLeu: 1.453 ± 0.391
0.363TrpMet: 0.363 ± 0.16
0.581TrpAsn: 0.581 ± 0.181
0.436TrpPro: 0.436 ± 0.168
0.799TrpGln: 0.799 ± 0.24
0.799TrpArg: 0.799 ± 0.237
0.799TrpSer: 0.799 ± 0.247
0.581TrpThr: 0.581 ± 0.235
1.09TrpVal: 1.09 ± 0.278
0.218TrpTrp: 0.218 ± 0.13
1.163TrpTyr: 1.163 ± 0.325
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.688TyrAla: 2.688 ± 0.402
0.727TyrCys: 0.727 ± 0.241
2.834TyrAsp: 2.834 ± 0.429
1.962TyrGlu: 1.962 ± 0.384
1.09TyrPhe: 1.09 ± 0.285
3.197TyrGly: 3.197 ± 0.678
0.727TyrHis: 0.727 ± 0.218
2.47TyrIle: 2.47 ± 0.475
1.598TyrLys: 1.598 ± 0.412
3.342TyrLeu: 3.342 ± 0.567
1.235TyrMet: 1.235 ± 0.318
1.744TyrAsn: 1.744 ± 0.427
1.744TyrPro: 1.744 ± 0.31
2.034TyrGln: 2.034 ± 0.424
2.688TyrArg: 2.688 ± 0.431
2.906TyrSer: 2.906 ± 0.451
2.979TyrThr: 2.979 ± 0.451
2.18TyrVal: 2.18 ± 0.346
0.727TyrTrp: 0.727 ± 0.283
1.744TyrTyr: 1.744 ± 0.53
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (13764 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski