Amino acid dipepetide frequency for Pectobacterium phage DU_PP_II

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.672AlaAla: 8.672 ± 1.177
0.515AlaCys: 0.515 ± 0.204
4.722AlaAsp: 4.722 ± 0.646
5.152AlaGlu: 5.152 ± 0.636
3.263AlaPhe: 3.263 ± 0.452
7.813AlaGly: 7.813 ± 0.732
1.545AlaHis: 1.545 ± 0.299
4.207AlaIle: 4.207 ± 0.646
6.697AlaLys: 6.697 ± 0.806
7.985AlaLeu: 7.985 ± 0.814
2.662AlaMet: 2.662 ± 0.605
4.722AlaAsn: 4.722 ± 0.544
3.434AlaPro: 3.434 ± 0.474
4.121AlaGln: 4.121 ± 0.774
4.035AlaArg: 4.035 ± 0.738
6.01AlaSer: 6.01 ± 1.0
5.838AlaThr: 5.838 ± 0.749
5.237AlaVal: 5.237 ± 0.582
1.03AlaTrp: 1.03 ± 0.332
2.232AlaTyr: 2.232 ± 0.411
0.0AlaXaa: 0.0 ± 0.0
Cys
0.429CysAla: 0.429 ± 0.156
0.086CysCys: 0.086 ± 0.084
0.601CysAsp: 0.601 ± 0.28
0.859CysGlu: 0.859 ± 0.394
0.515CysPhe: 0.515 ± 0.3
0.515CysGly: 0.515 ± 0.2
0.429CysHis: 0.429 ± 0.189
0.515CysIle: 0.515 ± 0.213
0.515CysLys: 0.515 ± 0.265
0.687CysLeu: 0.687 ± 0.223
0.086CysMet: 0.086 ± 0.084
0.258CysAsn: 0.258 ± 0.146
0.601CysPro: 0.601 ± 0.257
0.343CysGln: 0.343 ± 0.167
0.773CysArg: 0.773 ± 0.271
0.601CysSer: 0.601 ± 0.284
0.172CysThr: 0.172 ± 0.117
0.515CysVal: 0.515 ± 0.25
0.343CysTrp: 0.343 ± 0.167
0.258CysTyr: 0.258 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
5.237AspAla: 5.237 ± 0.657
0.687AspCys: 0.687 ± 0.302
4.035AspAsp: 4.035 ± 0.666
3.778AspGlu: 3.778 ± 0.488
2.747AspPhe: 2.747 ± 0.358
5.495AspGly: 5.495 ± 0.594
0.773AspHis: 0.773 ± 0.208
3.606AspIle: 3.606 ± 0.5
3.606AspLys: 3.606 ± 0.56
4.121AspLeu: 4.121 ± 0.717
1.803AspMet: 1.803 ± 0.332
2.318AspAsn: 2.318 ± 0.34
3.177AspPro: 3.177 ± 0.49
1.631AspGln: 1.631 ± 0.387
2.49AspArg: 2.49 ± 0.511
3.778AspSer: 3.778 ± 0.457
3.864AspThr: 3.864 ± 0.483
4.379AspVal: 4.379 ± 0.553
0.601AspTrp: 0.601 ± 0.198
2.404AspTyr: 2.404 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
8.843GluAla: 8.843 ± 0.84
0.944GluCys: 0.944 ± 0.352
3.692GluAsp: 3.692 ± 0.694
4.293GluGlu: 4.293 ± 1.196
2.833GluPhe: 2.833 ± 0.341
5.409GluGly: 5.409 ± 0.716
1.202GluHis: 1.202 ± 0.349
3.177GluIle: 3.177 ± 0.455
2.49GluLys: 2.49 ± 0.427
5.237GluLeu: 5.237 ± 0.49
1.46GluMet: 1.46 ± 0.336
1.975GluAsn: 1.975 ± 0.32
2.662GluPro: 2.662 ± 0.48
2.833GluGln: 2.833 ± 0.629
3.606GluArg: 3.606 ± 0.556
4.035GluSer: 4.035 ± 0.627
3.005GluThr: 3.005 ± 0.346
3.864GluVal: 3.864 ± 0.57
0.859GluTrp: 0.859 ± 0.257
2.919GluTyr: 2.919 ± 0.488
0.0GluXaa: 0.0 ± 0.0
Phe
2.662PheAla: 2.662 ± 0.359
0.601PheCys: 0.601 ± 0.275
2.833PheAsp: 2.833 ± 0.484
1.374PheGlu: 1.374 ± 0.258
1.288PhePhe: 1.288 ± 0.38
3.091PheGly: 3.091 ± 0.529
0.859PheHis: 0.859 ± 0.279
1.374PheIle: 1.374 ± 0.323
2.662PheLys: 2.662 ± 0.468
3.434PheLeu: 3.434 ± 0.525
1.116PheMet: 1.116 ± 0.268
1.545PheAsn: 1.545 ± 0.429
1.46PhePro: 1.46 ± 0.38
1.631PheGln: 1.631 ± 0.32
1.975PheArg: 1.975 ± 0.476
2.318PheSer: 2.318 ± 0.52
3.52PheThr: 3.52 ± 0.561
2.404PheVal: 2.404 ± 0.429
0.429PheTrp: 0.429 ± 0.176
0.773PheTyr: 0.773 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
6.182GlyAla: 6.182 ± 0.588
0.601GlyCys: 0.601 ± 0.317
5.323GlyAsp: 5.323 ± 0.904
4.551GlyGlu: 4.551 ± 0.64
3.091GlyPhe: 3.091 ± 0.414
5.581GlyGly: 5.581 ± 0.812
1.202GlyHis: 1.202 ± 0.33
5.152GlyIle: 5.152 ± 0.703
5.409GlyLys: 5.409 ± 0.919
7.384GlyLeu: 7.384 ± 0.874
1.975GlyMet: 1.975 ± 0.438
4.121GlyAsn: 4.121 ± 0.571
0.601GlyPro: 0.601 ± 0.251
3.434GlyGln: 3.434 ± 0.494
4.465GlyArg: 4.465 ± 0.499
6.611GlySer: 6.611 ± 0.794
4.293GlyThr: 4.293 ± 0.497
4.808GlyVal: 4.808 ± 0.666
1.889GlyTrp: 1.889 ± 0.506
2.833GlyTyr: 2.833 ± 0.59
0.0GlyXaa: 0.0 ± 0.0
His
1.288HisAla: 1.288 ± 0.37
0.172HisCys: 0.172 ± 0.108
1.03HisAsp: 1.03 ± 0.258
1.202HisGlu: 1.202 ± 0.258
0.859HisPhe: 0.859 ± 0.283
1.116HisGly: 1.116 ± 0.273
0.343HisHis: 0.343 ± 0.169
1.374HisIle: 1.374 ± 0.298
1.288HisLys: 1.288 ± 0.376
1.545HisLeu: 1.545 ± 0.418
0.687HisMet: 0.687 ± 0.223
0.515HisAsn: 0.515 ± 0.189
0.429HisPro: 0.429 ± 0.17
0.343HisGln: 0.343 ± 0.159
0.773HisArg: 0.773 ± 0.237
1.202HisSer: 1.202 ± 0.276
1.03HisThr: 1.03 ± 0.343
0.944HisVal: 0.944 ± 0.25
0.258HisTrp: 0.258 ± 0.122
1.03HisTyr: 1.03 ± 0.277
0.0HisXaa: 0.0 ± 0.0
Ile
4.121IleAla: 4.121 ± 0.578
0.601IleCys: 0.601 ± 0.259
3.864IleAsp: 3.864 ± 0.558
3.95IleGlu: 3.95 ± 0.462
1.03IlePhe: 1.03 ± 0.267
4.121IleGly: 4.121 ± 0.457
1.288IleHis: 1.288 ± 0.353
2.833IleIle: 2.833 ± 0.539
3.263IleLys: 3.263 ± 0.55
4.035IleLeu: 4.035 ± 0.525
1.631IleMet: 1.631 ± 0.387
1.631IleAsn: 1.631 ± 0.634
3.177IlePro: 3.177 ± 0.459
1.717IleGln: 1.717 ± 0.47
3.349IleArg: 3.349 ± 0.479
3.263IleSer: 3.263 ± 0.431
3.177IleThr: 3.177 ± 0.461
3.177IleVal: 3.177 ± 0.418
0.515IleTrp: 0.515 ± 0.217
1.889IleTyr: 1.889 ± 0.336
0.0IleXaa: 0.0 ± 0.0
Lys
6.697LysAla: 6.697 ± 0.75
0.515LysCys: 0.515 ± 0.188
3.349LysAsp: 3.349 ± 0.479
4.035LysGlu: 4.035 ± 0.577
2.318LysPhe: 2.318 ± 0.47
5.066LysGly: 5.066 ± 0.611
1.545LysHis: 1.545 ± 0.425
1.46LysIle: 1.46 ± 0.37
2.919LysLys: 2.919 ± 0.564
4.98LysLeu: 4.98 ± 0.838
1.46LysMet: 1.46 ± 0.371
1.889LysAsn: 1.889 ± 0.306
2.662LysPro: 2.662 ± 0.382
3.091LysGln: 3.091 ± 0.662
4.035LysArg: 4.035 ± 0.747
3.778LysSer: 3.778 ± 0.572
3.434LysThr: 3.434 ± 0.428
5.838LysVal: 5.838 ± 0.773
0.515LysTrp: 0.515 ± 0.209
1.975LysTyr: 1.975 ± 0.349
0.0LysXaa: 0.0 ± 0.0
Leu
6.268LeuAla: 6.268 ± 0.848
0.258LeuCys: 0.258 ± 0.14
4.379LeuAsp: 4.379 ± 0.57
5.495LeuGlu: 5.495 ± 1.041
2.146LeuPhe: 2.146 ± 0.458
5.581LeuGly: 5.581 ± 0.718
0.773LeuHis: 0.773 ± 0.235
4.379LeuIle: 4.379 ± 0.719
6.182LeuLys: 6.182 ± 0.771
5.667LeuLeu: 5.667 ± 0.634
2.747LeuMet: 2.747 ± 0.475
4.551LeuAsn: 4.551 ± 0.799
3.263LeuPro: 3.263 ± 0.62
3.692LeuGln: 3.692 ± 0.563
4.465LeuArg: 4.465 ± 0.574
4.98LeuSer: 4.98 ± 0.587
6.525LeuThr: 6.525 ± 0.898
5.152LeuVal: 5.152 ± 0.585
1.631LeuTrp: 1.631 ± 0.391
2.662LeuTyr: 2.662 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
2.747MetAla: 2.747 ± 0.398
0.172MetCys: 0.172 ± 0.132
1.374MetAsp: 1.374 ± 0.319
1.975MetGlu: 1.975 ± 0.522
1.46MetPhe: 1.46 ± 0.275
1.803MetGly: 1.803 ± 0.38
0.0MetHis: 0.0 ± 0.0
0.773MetIle: 0.773 ± 0.188
1.03MetLys: 1.03 ± 0.235
3.091MetLeu: 3.091 ± 0.451
0.429MetMet: 0.429 ± 0.198
1.288MetAsn: 1.288 ± 0.31
0.773MetPro: 0.773 ± 0.239
1.288MetGln: 1.288 ± 0.561
1.116MetArg: 1.116 ± 0.252
1.803MetSer: 1.803 ± 0.346
1.545MetThr: 1.545 ± 0.307
2.833MetVal: 2.833 ± 0.495
0.258MetTrp: 0.258 ± 0.141
0.429MetTyr: 0.429 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.692AsnAla: 3.692 ± 0.527
0.343AsnCys: 0.343 ± 0.193
2.232AsnAsp: 2.232 ± 0.446
2.576AsnGlu: 2.576 ± 0.468
1.889AsnPhe: 1.889 ± 0.324
6.01AsnGly: 6.01 ± 0.801
0.601AsnHis: 0.601 ± 0.268
2.232AsnIle: 2.232 ± 0.468
2.576AsnLys: 2.576 ± 0.448
3.434AsnLeu: 3.434 ± 0.699
1.374AsnMet: 1.374 ± 0.348
1.374AsnAsn: 1.374 ± 0.363
2.833AsnPro: 2.833 ± 0.447
1.803AsnGln: 1.803 ± 0.316
2.318AsnArg: 2.318 ± 0.416
2.232AsnSer: 2.232 ± 0.53
1.803AsnThr: 1.803 ± 0.366
3.778AsnVal: 3.778 ± 0.832
0.859AsnTrp: 0.859 ± 0.235
1.116AsnTyr: 1.116 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
3.434ProAla: 3.434 ± 0.366
0.343ProCys: 0.343 ± 0.207
2.404ProAsp: 2.404 ± 0.483
4.636ProGlu: 4.636 ± 0.772
1.46ProPhe: 1.46 ± 0.332
0.0ProGly: 0.0 ± 0.0
0.515ProHis: 0.515 ± 0.174
1.975ProIle: 1.975 ± 0.352
3.005ProLys: 3.005 ± 0.562
2.747ProLeu: 2.747 ± 0.491
0.859ProMet: 0.859 ± 0.255
2.919ProAsn: 2.919 ± 0.44
0.944ProPro: 0.944 ± 0.275
1.288ProGln: 1.288 ± 0.284
1.46ProArg: 1.46 ± 0.278
2.919ProSer: 2.919 ± 0.405
1.975ProThr: 1.975 ± 0.319
3.177ProVal: 3.177 ± 0.39
1.03ProTrp: 1.03 ± 0.222
1.631ProTyr: 1.631 ± 0.463
0.0ProXaa: 0.0 ± 0.0
Gln
5.066GlnAla: 5.066 ± 0.653
0.343GlnCys: 0.343 ± 0.184
1.889GlnAsp: 1.889 ± 0.343
2.318GlnGlu: 2.318 ± 0.444
2.404GlnPhe: 2.404 ± 0.301
2.833GlnGly: 2.833 ± 0.481
0.429GlnHis: 0.429 ± 0.149
2.576GlnIle: 2.576 ± 0.367
2.404GlnLys: 2.404 ± 0.363
3.95GlnLeu: 3.95 ± 0.58
1.288GlnMet: 1.288 ± 0.402
1.374GlnAsn: 1.374 ± 0.374
1.116GlnPro: 1.116 ± 0.269
3.091GlnGln: 3.091 ± 0.551
2.146GlnArg: 2.146 ± 0.428
2.576GlnSer: 2.576 ± 0.512
2.146GlnThr: 2.146 ± 0.571
2.919GlnVal: 2.919 ± 0.571
0.773GlnTrp: 0.773 ± 0.247
1.374GlnTyr: 1.374 ± 0.443
0.0GlnXaa: 0.0 ± 0.0
Arg
3.52ArgAla: 3.52 ± 0.594
0.859ArgCys: 0.859 ± 0.29
3.349ArgAsp: 3.349 ± 0.555
3.52ArgGlu: 3.52 ± 0.544
1.545ArgPhe: 1.545 ± 0.414
3.692ArgGly: 3.692 ± 0.574
1.202ArgHis: 1.202 ± 0.26
1.975ArgIle: 1.975 ± 0.345
3.349ArgLys: 3.349 ± 0.556
5.237ArgLeu: 5.237 ± 0.66
1.374ArgMet: 1.374 ± 0.394
2.232ArgAsn: 2.232 ± 0.354
2.404ArgPro: 2.404 ± 0.436
2.576ArgGln: 2.576 ± 0.559
2.919ArgArg: 2.919 ± 0.467
4.035ArgSer: 4.035 ± 0.588
2.576ArgThr: 2.576 ± 0.438
3.263ArgVal: 3.263 ± 0.561
0.859ArgTrp: 0.859 ± 0.264
1.803ArgTyr: 1.803 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
6.182SerAla: 6.182 ± 0.609
0.601SerCys: 0.601 ± 0.222
5.323SerAsp: 5.323 ± 0.761
3.434SerGlu: 3.434 ± 0.384
2.576SerPhe: 2.576 ± 0.39
6.268SerGly: 6.268 ± 0.746
1.202SerHis: 1.202 ± 0.313
4.465SerIle: 4.465 ± 0.513
3.778SerLys: 3.778 ± 0.464
4.551SerLeu: 4.551 ± 0.489
1.288SerMet: 1.288 ± 0.304
3.177SerAsn: 3.177 ± 0.437
2.919SerPro: 2.919 ± 0.492
2.662SerGln: 2.662 ± 0.506
2.833SerArg: 2.833 ± 0.492
3.95SerSer: 3.95 ± 0.844
3.091SerThr: 3.091 ± 0.558
3.95SerVal: 3.95 ± 0.593
0.859SerTrp: 0.859 ± 0.269
2.747SerTyr: 2.747 ± 0.446
0.0SerXaa: 0.0 ± 0.0
Thr
4.551ThrAla: 4.551 ± 0.799
0.429ThrCys: 0.429 ± 0.253
3.52ThrAsp: 3.52 ± 0.589
4.035ThrGlu: 4.035 ± 0.594
2.576ThrPhe: 2.576 ± 0.481
5.323ThrGly: 5.323 ± 0.522
1.374ThrHis: 1.374 ± 0.293
3.606ThrIle: 3.606 ± 0.46
4.207ThrLys: 4.207 ± 0.513
5.323ThrLeu: 5.323 ± 0.836
1.03ThrMet: 1.03 ± 0.234
2.747ThrAsn: 2.747 ± 0.514
2.747ThrPro: 2.747 ± 0.375
3.005ThrGln: 3.005 ± 0.498
2.662ThrArg: 2.662 ± 0.429
3.606ThrSer: 3.606 ± 0.727
3.52ThrThr: 3.52 ± 0.598
3.091ThrVal: 3.091 ± 0.576
0.773ThrTrp: 0.773 ± 0.271
1.46ThrTyr: 1.46 ± 0.372
0.0ThrXaa: 0.0 ± 0.0
Val
6.354ValAla: 6.354 ± 0.722
0.258ValCys: 0.258 ± 0.141
3.263ValAsp: 3.263 ± 0.464
4.98ValGlu: 4.98 ± 0.549
2.061ValPhe: 2.061 ± 0.507
6.354ValGly: 6.354 ± 0.702
1.46ValHis: 1.46 ± 0.326
4.551ValIle: 4.551 ± 0.539
3.349ValLys: 3.349 ± 0.513
3.95ValLeu: 3.95 ± 0.49
1.374ValMet: 1.374 ± 0.247
3.692ValAsn: 3.692 ± 0.522
2.146ValPro: 2.146 ± 0.431
1.975ValGln: 1.975 ± 0.407
4.035ValArg: 4.035 ± 0.4
4.894ValSer: 4.894 ± 0.512
5.237ValThr: 5.237 ± 0.551
4.636ValVal: 4.636 ± 0.828
0.859ValTrp: 0.859 ± 0.287
1.975ValTyr: 1.975 ± 0.432
0.0ValXaa: 0.0 ± 0.0
Trp
1.03TrpAla: 1.03 ± 0.244
0.343TrpCys: 0.343 ± 0.17
0.773TrpAsp: 0.773 ± 0.258
0.773TrpGlu: 0.773 ± 0.222
0.343TrpPhe: 0.343 ± 0.163
1.116TrpGly: 1.116 ± 0.404
0.258TrpHis: 0.258 ± 0.138
0.515TrpIle: 0.515 ± 0.207
1.46TrpLys: 1.46 ± 0.314
1.46TrpLeu: 1.46 ± 0.38
0.515TrpMet: 0.515 ± 0.25
0.944TrpAsn: 0.944 ± 0.226
0.258TrpPro: 0.258 ± 0.139
0.859TrpGln: 0.859 ± 0.207
0.859TrpArg: 0.859 ± 0.257
0.944TrpSer: 0.944 ± 0.359
0.687TrpThr: 0.687 ± 0.237
1.288TrpVal: 1.288 ± 0.452
0.343TrpTrp: 0.343 ± 0.139
0.429TrpTyr: 0.429 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.919TyrAla: 2.919 ± 0.583
0.429TyrCys: 0.429 ± 0.175
2.576TyrAsp: 2.576 ± 0.395
2.404TyrGlu: 2.404 ± 0.416
0.944TyrPhe: 0.944 ± 0.285
2.318TyrGly: 2.318 ± 0.371
0.429TyrHis: 0.429 ± 0.234
1.889TyrIle: 1.889 ± 0.503
1.374TyrLys: 1.374 ± 0.327
1.975TyrLeu: 1.975 ± 0.299
0.944TyrMet: 0.944 ± 0.337
1.803TyrAsn: 1.803 ± 0.38
1.116TyrPro: 1.116 ± 0.243
1.631TyrGln: 1.631 ± 0.373
2.061TyrArg: 2.061 ± 0.492
2.318TyrSer: 2.318 ± 0.433
2.146TyrThr: 2.146 ± 0.413
2.146TyrVal: 2.146 ± 0.422
0.515TyrTrp: 0.515 ± 0.193
0.601TyrTyr: 0.601 ± 0.264
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (11648 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski