Amino acid dipepetide frequency for Pectobacterium phage Clickz_B6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.14AlaAla: 12.14 ± 1.428
0.984AlaCys: 0.984 ± 0.3
5.086AlaAsp: 5.086 ± 0.654
5.25AlaGlu: 5.25 ± 0.843
2.789AlaPhe: 2.789 ± 0.497
6.562AlaGly: 6.562 ± 0.694
2.051AlaHis: 2.051 ± 0.536
2.789AlaIle: 2.789 ± 0.484
4.429AlaLys: 4.429 ± 0.552
9.187AlaLeu: 9.187 ± 0.807
2.461AlaMet: 2.461 ± 0.436
3.445AlaAsn: 3.445 ± 0.612
3.855AlaPro: 3.855 ± 0.772
6.152AlaGln: 6.152 ± 0.994
4.429AlaArg: 4.429 ± 0.585
6.234AlaSer: 6.234 ± 0.783
4.594AlaThr: 4.594 ± 0.543
7.218AlaVal: 7.218 ± 0.827
1.066AlaTrp: 1.066 ± 0.293
3.445AlaTyr: 3.445 ± 0.494
0.0AlaXaa: 0.0 ± 0.0
Cys
0.656CysAla: 0.656 ± 0.211
0.328CysCys: 0.328 ± 0.157
0.902CysAsp: 0.902 ± 0.328
0.328CysGlu: 0.328 ± 0.141
0.246CysPhe: 0.246 ± 0.132
0.902CysGly: 0.902 ± 0.244
0.492CysHis: 0.492 ± 0.212
0.902CysIle: 0.902 ± 0.307
0.246CysLys: 0.246 ± 0.139
0.902CysLeu: 0.902 ± 0.247
0.738CysMet: 0.738 ± 0.228
0.738CysAsn: 0.738 ± 0.286
0.738CysPro: 0.738 ± 0.284
0.328CysGln: 0.328 ± 0.131
0.492CysArg: 0.492 ± 0.2
0.902CysSer: 0.902 ± 0.301
1.066CysThr: 1.066 ± 0.328
0.984CysVal: 0.984 ± 0.267
0.246CysTrp: 0.246 ± 0.16
0.574CysTyr: 0.574 ± 0.19
0.0CysXaa: 0.0 ± 0.0
Asp
6.726AspAla: 6.726 ± 0.612
0.738AspCys: 0.738 ± 0.265
3.773AspAsp: 3.773 ± 0.614
3.199AspGlu: 3.199 ± 0.532
2.051AspPhe: 2.051 ± 0.339
4.922AspGly: 4.922 ± 0.686
0.574AspHis: 0.574 ± 0.271
3.855AspIle: 3.855 ± 0.392
2.871AspLys: 2.871 ± 0.599
5.004AspLeu: 5.004 ± 0.672
2.215AspMet: 2.215 ± 0.359
2.379AspAsn: 2.379 ± 0.373
1.969AspPro: 1.969 ± 0.373
1.394AspGln: 1.394 ± 0.399
3.281AspArg: 3.281 ± 0.525
4.101AspSer: 4.101 ± 0.598
4.676AspThr: 4.676 ± 0.534
4.594AspVal: 4.594 ± 0.634
1.723AspTrp: 1.723 ± 0.317
2.051AspTyr: 2.051 ± 0.441
0.0AspXaa: 0.0 ± 0.0
Glu
4.594GluAla: 4.594 ± 0.65
0.574GluCys: 0.574 ± 0.272
3.609GluAsp: 3.609 ± 0.602
3.691GluGlu: 3.691 ± 0.662
2.707GluPhe: 2.707 ± 0.474
2.707GluGly: 2.707 ± 0.426
1.312GluHis: 1.312 ± 0.385
2.543GluIle: 2.543 ± 0.446
2.789GluLys: 2.789 ± 0.476
5.25GluLeu: 5.25 ± 0.565
1.559GluMet: 1.559 ± 0.368
1.805GluAsn: 1.805 ± 0.351
1.148GluPro: 1.148 ± 0.368
3.199GluGln: 3.199 ± 0.516
3.035GluArg: 3.035 ± 0.495
2.625GluSer: 2.625 ± 0.431
3.035GluThr: 3.035 ± 0.549
3.937GluVal: 3.937 ± 0.696
0.738GluTrp: 0.738 ± 0.254
2.543GluTyr: 2.543 ± 0.431
0.0GluXaa: 0.0 ± 0.0
Phe
3.035PheAla: 3.035 ± 0.403
0.164PheCys: 0.164 ± 0.099
2.789PheAsp: 2.789 ± 0.495
1.559PheGlu: 1.559 ± 0.279
1.066PhePhe: 1.066 ± 0.32
2.297PheGly: 2.297 ± 0.435
0.574PheHis: 0.574 ± 0.217
1.559PheIle: 1.559 ± 0.307
1.641PheLys: 1.641 ± 0.349
2.051PheLeu: 2.051 ± 0.485
0.656PheMet: 0.656 ± 0.212
1.805PheAsn: 1.805 ± 0.426
1.23PhePro: 1.23 ± 0.308
1.559PheGln: 1.559 ± 0.321
1.805PheArg: 1.805 ± 0.376
1.723PheSer: 1.723 ± 0.357
1.559PheThr: 1.559 ± 0.313
2.543PheVal: 2.543 ± 0.507
0.328PheTrp: 0.328 ± 0.148
0.902PheTyr: 0.902 ± 0.268
0.0PheXaa: 0.0 ± 0.0
Gly
6.726GlyAla: 6.726 ± 0.779
1.476GlyCys: 1.476 ± 0.467
4.758GlyAsp: 4.758 ± 0.868
3.117GlyGlu: 3.117 ± 0.533
2.707GlyPhe: 2.707 ± 0.352
5.906GlyGly: 5.906 ± 0.856
0.82GlyHis: 0.82 ± 0.254
5.086GlyIle: 5.086 ± 0.566
3.609GlyLys: 3.609 ± 0.528
6.562GlyLeu: 6.562 ± 0.594
1.969GlyMet: 1.969 ± 0.31
3.773GlyAsn: 3.773 ± 0.727
1.23GlyPro: 1.23 ± 0.375
2.051GlyGln: 2.051 ± 0.382
3.527GlyArg: 3.527 ± 0.495
5.168GlySer: 5.168 ± 0.67
7.382GlyThr: 7.382 ± 0.716
6.398GlyVal: 6.398 ± 0.778
1.066GlyTrp: 1.066 ± 0.325
3.937GlyTyr: 3.937 ± 0.67
0.0GlyXaa: 0.0 ± 0.0
His
1.476HisAla: 1.476 ± 0.339
0.492HisCys: 0.492 ± 0.163
1.394HisAsp: 1.394 ± 0.383
0.902HisGlu: 0.902 ± 0.325
0.328HisPhe: 0.328 ± 0.164
1.805HisGly: 1.805 ± 0.407
0.164HisHis: 0.164 ± 0.108
1.066HisIle: 1.066 ± 0.293
0.984HisLys: 0.984 ± 0.346
1.805HisLeu: 1.805 ± 0.439
0.492HisMet: 0.492 ± 0.233
0.82HisAsn: 0.82 ± 0.215
1.066HisPro: 1.066 ± 0.292
0.902HisGln: 0.902 ± 0.258
1.066HisArg: 1.066 ± 0.272
1.148HisSer: 1.148 ± 0.285
1.641HisThr: 1.641 ± 0.669
1.394HisVal: 1.394 ± 0.42
0.656HisTrp: 0.656 ± 0.207
0.656HisTyr: 0.656 ± 0.288
0.0HisXaa: 0.0 ± 0.0
Ile
3.199IleAla: 3.199 ± 0.558
0.984IleCys: 0.984 ± 0.278
3.855IleAsp: 3.855 ± 0.76
2.789IleGlu: 2.789 ± 0.551
0.82IlePhe: 0.82 ± 0.22
2.625IleGly: 2.625 ± 0.46
0.984IleHis: 0.984 ± 0.335
2.133IleIle: 2.133 ± 0.343
2.379IleLys: 2.379 ± 0.437
3.691IleLeu: 3.691 ± 0.637
1.148IleMet: 1.148 ± 0.25
2.871IleAsn: 2.871 ± 0.562
2.133IlePro: 2.133 ± 0.312
2.379IleGln: 2.379 ± 0.354
1.559IleArg: 1.559 ± 0.381
2.789IleSer: 2.789 ± 0.402
4.019IleThr: 4.019 ± 0.565
2.215IleVal: 2.215 ± 0.44
0.656IleTrp: 0.656 ± 0.226
1.312IleTyr: 1.312 ± 0.297
0.0IleXaa: 0.0 ± 0.0
Lys
4.429LysAla: 4.429 ± 0.758
0.328LysCys: 0.328 ± 0.163
3.281LysAsp: 3.281 ± 0.505
3.691LysGlu: 3.691 ± 0.556
0.902LysPhe: 0.902 ± 0.331
3.199LysGly: 3.199 ± 0.512
0.902LysHis: 0.902 ± 0.283
1.312LysIle: 1.312 ± 0.319
2.625LysLys: 2.625 ± 0.719
5.004LysLeu: 5.004 ± 0.716
1.066LysMet: 1.066 ± 0.345
1.641LysAsn: 1.641 ± 0.372
1.969LysPro: 1.969 ± 0.419
2.789LysGln: 2.789 ± 0.58
3.117LysArg: 3.117 ± 0.469
2.379LysSer: 2.379 ± 0.359
1.394LysThr: 1.394 ± 0.358
3.117LysVal: 3.117 ± 0.575
0.574LysTrp: 0.574 ± 0.215
2.297LysTyr: 2.297 ± 0.446
0.0LysXaa: 0.0 ± 0.0
Leu
7.382LeuAla: 7.382 ± 0.82
1.312LeuCys: 1.312 ± 0.324
5.004LeuAsp: 5.004 ± 0.653
4.676LeuGlu: 4.676 ± 0.585
2.707LeuPhe: 2.707 ± 0.438
6.234LeuGly: 6.234 ± 0.897
2.297LeuHis: 2.297 ± 0.442
3.363LeuIle: 3.363 ± 0.711
3.855LeuLys: 3.855 ± 0.559
7.875LeuLeu: 7.875 ± 0.738
2.215LeuMet: 2.215 ± 0.334
4.512LeuAsn: 4.512 ± 0.679
5.004LeuPro: 5.004 ± 0.575
3.691LeuGln: 3.691 ± 0.649
4.84LeuArg: 4.84 ± 0.818
6.644LeuSer: 6.644 ± 0.925
5.414LeuThr: 5.414 ± 0.602
6.972LeuVal: 6.972 ± 0.668
0.738LeuTrp: 0.738 ± 0.228
3.445LeuTyr: 3.445 ± 0.498
0.0LeuXaa: 0.0 ± 0.0
Met
2.871MetAla: 2.871 ± 0.47
0.246MetCys: 0.246 ± 0.138
1.066MetAsp: 1.066 ± 0.319
0.82MetGlu: 0.82 ± 0.233
1.066MetPhe: 1.066 ± 0.291
2.215MetGly: 2.215 ± 0.331
0.574MetHis: 0.574 ± 0.25
0.738MetIle: 0.738 ± 0.237
0.82MetLys: 0.82 ± 0.258
2.297MetLeu: 2.297 ± 0.523
0.492MetMet: 0.492 ± 0.177
0.902MetAsn: 0.902 ± 0.325
1.641MetPro: 1.641 ± 0.53
2.051MetGln: 2.051 ± 0.428
2.133MetArg: 2.133 ± 0.566
1.559MetSer: 1.559 ± 0.361
1.23MetThr: 1.23 ± 0.313
1.969MetVal: 1.969 ± 0.39
0.246MetTrp: 0.246 ± 0.132
1.559MetTyr: 1.559 ± 0.33
0.0MetXaa: 0.0 ± 0.0
Asn
3.855AsnAla: 3.855 ± 0.616
0.82AsnCys: 0.82 ± 0.317
1.559AsnAsp: 1.559 ± 0.412
2.215AsnGlu: 2.215 ± 0.434
1.394AsnPhe: 1.394 ± 0.319
3.609AsnGly: 3.609 ± 0.512
0.574AsnHis: 0.574 ± 0.21
1.476AsnIle: 1.476 ± 0.387
2.707AsnLys: 2.707 ± 0.429
4.676AsnLeu: 4.676 ± 0.83
1.23AsnMet: 1.23 ± 0.289
1.723AsnAsn: 1.723 ± 0.299
2.133AsnPro: 2.133 ± 0.383
2.051AsnGln: 2.051 ± 0.449
2.051AsnArg: 2.051 ± 0.409
2.871AsnSer: 2.871 ± 0.796
3.855AsnThr: 3.855 ± 0.598
3.199AsnVal: 3.199 ± 0.591
0.738AsnTrp: 0.738 ± 0.246
0.492AsnTyr: 0.492 ± 0.162
0.0AsnXaa: 0.0 ± 0.0
Pro
4.512ProAla: 4.512 ± 0.499
0.164ProCys: 0.164 ± 0.118
3.773ProAsp: 3.773 ± 0.542
3.117ProGlu: 3.117 ± 0.505
0.82ProPhe: 0.82 ± 0.249
2.871ProGly: 2.871 ± 0.389
0.574ProHis: 0.574 ± 0.198
1.723ProIle: 1.723 ± 0.428
1.394ProLys: 1.394 ± 0.321
2.461ProLeu: 2.461 ± 0.467
1.148ProMet: 1.148 ± 0.324
1.23ProAsn: 1.23 ± 0.266
1.476ProPro: 1.476 ± 0.4
1.559ProGln: 1.559 ± 0.363
1.723ProArg: 1.723 ± 0.33
2.379ProSer: 2.379 ± 0.469
2.871ProThr: 2.871 ± 0.581
3.691ProVal: 3.691 ± 0.498
0.902ProTrp: 0.902 ± 0.283
1.887ProTyr: 1.887 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
6.234GlnAla: 6.234 ± 0.938
0.41GlnCys: 0.41 ± 0.157
2.461GlnAsp: 2.461 ± 0.463
2.871GlnGlu: 2.871 ± 0.445
1.559GlnPhe: 1.559 ± 0.328
4.183GlnGly: 4.183 ± 0.686
1.312GlnHis: 1.312 ± 0.302
1.476GlnIle: 1.476 ± 0.371
1.969GlnLys: 1.969 ± 0.429
4.101GlnLeu: 4.101 ± 0.649
1.23GlnMet: 1.23 ± 0.316
2.215GlnAsn: 2.215 ± 0.598
1.23GlnPro: 1.23 ± 0.279
3.445GlnGln: 3.445 ± 0.878
2.543GlnArg: 2.543 ± 0.461
2.707GlnSer: 2.707 ± 0.528
2.051GlnThr: 2.051 ± 0.375
3.199GlnVal: 3.199 ± 0.554
0.574GlnTrp: 0.574 ± 0.171
2.789GlnTyr: 2.789 ± 0.527
0.0GlnXaa: 0.0 ± 0.0
Arg
4.019ArgAla: 4.019 ± 0.496
0.738ArgCys: 0.738 ± 0.224
3.527ArgAsp: 3.527 ± 0.622
3.363ArgGlu: 3.363 ± 0.577
1.066ArgPhe: 1.066 ± 0.274
4.019ArgGly: 4.019 ± 0.6
1.23ArgHis: 1.23 ± 0.315
3.527ArgIle: 3.527 ± 0.609
2.379ArgLys: 2.379 ± 0.422
3.855ArgLeu: 3.855 ± 0.563
1.559ArgMet: 1.559 ± 0.387
2.789ArgAsn: 2.789 ± 0.624
1.476ArgPro: 1.476 ± 0.302
1.969ArgGln: 1.969 ± 0.342
4.758ArgArg: 4.758 ± 0.613
3.117ArgSer: 3.117 ± 0.697
3.117ArgThr: 3.117 ± 0.536
4.265ArgVal: 4.265 ± 0.525
1.066ArgTrp: 1.066 ± 0.238
2.625ArgTyr: 2.625 ± 0.433
0.0ArgXaa: 0.0 ± 0.0
Ser
6.89SerAla: 6.89 ± 0.78
0.82SerCys: 0.82 ± 0.226
3.281SerAsp: 3.281 ± 0.502
2.215SerGlu: 2.215 ± 0.492
2.051SerPhe: 2.051 ± 0.524
5.988SerGly: 5.988 ± 0.956
1.23SerHis: 1.23 ± 0.374
3.117SerIle: 3.117 ± 0.491
3.691SerLys: 3.691 ± 0.568
5.578SerLeu: 5.578 ± 0.738
1.723SerMet: 1.723 ± 0.414
2.543SerAsn: 2.543 ± 0.483
2.379SerPro: 2.379 ± 0.435
2.297SerGln: 2.297 ± 0.425
2.461SerArg: 2.461 ± 0.437
3.937SerSer: 3.937 ± 0.656
4.758SerThr: 4.758 ± 0.776
5.578SerVal: 5.578 ± 0.763
0.82SerTrp: 0.82 ± 0.35
1.641SerTyr: 1.641 ± 0.314
0.0SerXaa: 0.0 ± 0.0
Thr
6.398ThrAla: 6.398 ± 0.806
0.656ThrCys: 0.656 ± 0.258
4.347ThrAsp: 4.347 ± 0.685
3.527ThrGlu: 3.527 ± 0.379
1.723ThrPhe: 1.723 ± 0.381
6.644ThrGly: 6.644 ± 0.848
1.641ThrHis: 1.641 ± 0.607
2.051ThrIle: 2.051 ± 0.35
2.789ThrLys: 2.789 ± 0.44
5.414ThrLeu: 5.414 ± 0.689
0.902ThrMet: 0.902 ± 0.271
2.707ThrAsn: 2.707 ± 0.502
3.937ThrPro: 3.937 ± 0.627
1.969ThrGln: 1.969 ± 0.441
3.281ThrArg: 3.281 ± 0.585
4.758ThrSer: 4.758 ± 0.542
4.347ThrThr: 4.347 ± 0.985
5.168ThrVal: 5.168 ± 0.947
0.574ThrTrp: 0.574 ± 0.22
2.461ThrTyr: 2.461 ± 0.395
0.0ThrXaa: 0.0 ± 0.0
Val
5.496ValAla: 5.496 ± 0.553
0.738ValCys: 0.738 ± 0.242
4.183ValAsp: 4.183 ± 0.607
2.871ValGlu: 2.871 ± 0.493
2.789ValPhe: 2.789 ± 0.36
5.824ValGly: 5.824 ± 0.695
2.133ValHis: 2.133 ± 0.352
3.117ValIle: 3.117 ± 0.543
3.035ValLys: 3.035 ± 0.723
7.629ValLeu: 7.629 ± 0.938
1.887ValMet: 1.887 ± 0.39
2.871ValAsn: 2.871 ± 0.609
3.445ValPro: 3.445 ± 0.489
6.07ValGln: 6.07 ± 0.709
4.594ValArg: 4.594 ± 0.615
4.265ValSer: 4.265 ± 0.59
4.758ValThr: 4.758 ± 0.598
4.429ValVal: 4.429 ± 0.74
0.902ValTrp: 0.902 ± 0.265
3.035ValTyr: 3.035 ± 0.429
0.0ValXaa: 0.0 ± 0.0
Trp
0.656TrpAla: 0.656 ± 0.21
0.164TrpCys: 0.164 ± 0.126
0.656TrpAsp: 0.656 ± 0.229
0.984TrpGlu: 0.984 ± 0.32
0.738TrpPhe: 0.738 ± 0.224
1.394TrpGly: 1.394 ± 0.386
0.082TrpHis: 0.082 ± 0.074
0.328TrpIle: 0.328 ± 0.168
0.246TrpLys: 0.246 ± 0.14
1.969TrpLeu: 1.969 ± 0.377
0.328TrpMet: 0.328 ± 0.157
0.902TrpAsn: 0.902 ± 0.28
0.328TrpPro: 0.328 ± 0.139
0.902TrpGln: 0.902 ± 0.269
0.656TrpArg: 0.656 ± 0.202
0.984TrpSer: 0.984 ± 0.281
0.574TrpThr: 0.574 ± 0.229
1.23TrpVal: 1.23 ± 0.257
0.328TrpTrp: 0.328 ± 0.178
1.23TrpTyr: 1.23 ± 0.347
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.871TyrAla: 2.871 ± 0.475
0.574TyrCys: 0.574 ± 0.198
2.789TyrAsp: 2.789 ± 0.377
2.215TyrGlu: 2.215 ± 0.451
1.312TyrPhe: 1.312 ± 0.34
3.363TyrGly: 3.363 ± 0.636
0.738TyrHis: 0.738 ± 0.23
2.379TyrIle: 2.379 ± 0.347
1.559TyrLys: 1.559 ± 0.475
2.953TyrLeu: 2.953 ± 0.472
1.394TyrMet: 1.394 ± 0.38
1.559TyrAsn: 1.559 ± 0.388
1.887TyrPro: 1.887 ± 0.379
1.805TyrGln: 1.805 ± 0.37
3.117TyrArg: 3.117 ± 0.419
2.707TyrSer: 2.707 ± 0.428
2.953TyrThr: 2.953 ± 0.587
1.969TyrVal: 1.969 ± 0.395
0.738TyrTrp: 0.738 ± 0.327
1.559TyrTyr: 1.559 ± 0.439
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12192 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski