Amino acid dipepetide frequency for Pectobacterium phage vB_PatP_CB5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.009AlaAla: 12.009 ± 1.26
0.924AlaCys: 0.924 ± 0.284
4.903AlaAsp: 4.903 ± 0.605
5.329AlaGlu: 5.329 ± 0.868
3.269AlaPhe: 3.269 ± 0.554
6.608AlaGly: 6.608 ± 0.71
2.274AlaHis: 2.274 ± 0.515
2.842AlaIle: 2.842 ± 0.38
4.974AlaLys: 4.974 ± 0.577
9.593AlaLeu: 9.593 ± 0.84
2.487AlaMet: 2.487 ± 0.373
3.908AlaAsn: 3.908 ± 0.624
3.482AlaPro: 3.482 ± 0.509
5.898AlaGln: 5.898 ± 0.719
4.335AlaArg: 4.335 ± 0.492
6.395AlaSer: 6.395 ± 0.801
5.045AlaThr: 5.045 ± 0.715
7.248AlaVal: 7.248 ± 0.808
1.421AlaTrp: 1.421 ± 0.279
3.979AlaTyr: 3.979 ± 0.627
0.0AlaXaa: 0.0 ± 0.0
Cys
0.711CysAla: 0.711 ± 0.235
0.213CysCys: 0.213 ± 0.132
0.711CysAsp: 0.711 ± 0.243
0.355CysGlu: 0.355 ± 0.138
0.284CysPhe: 0.284 ± 0.155
0.853CysGly: 0.853 ± 0.25
0.497CysHis: 0.497 ± 0.246
0.853CysIle: 0.853 ± 0.267
0.284CysLys: 0.284 ± 0.155
0.924CysLeu: 0.924 ± 0.293
0.64CysMet: 0.64 ± 0.213
0.782CysAsn: 0.782 ± 0.278
0.64CysPro: 0.64 ± 0.243
0.355CysGln: 0.355 ± 0.146
0.782CysArg: 0.782 ± 0.27
0.853CysSer: 0.853 ± 0.307
0.924CysThr: 0.924 ± 0.296
0.64CysVal: 0.64 ± 0.207
0.213CysTrp: 0.213 ± 0.133
0.568CysTyr: 0.568 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
7.177AspAla: 7.177 ± 0.658
0.497AspCys: 0.497 ± 0.202
3.908AspAsp: 3.908 ± 0.524
2.913AspGlu: 2.913 ± 0.396
1.99AspPhe: 1.99 ± 0.341
4.192AspGly: 4.192 ± 0.546
0.924AspHis: 0.924 ± 0.27
3.837AspIle: 3.837 ± 0.465
2.842AspLys: 2.842 ± 0.545
4.903AspLeu: 4.903 ± 0.547
2.345AspMet: 2.345 ± 0.376
2.7AspAsn: 2.7 ± 0.395
2.061AspPro: 2.061 ± 0.308
1.421AspGln: 1.421 ± 0.406
2.771AspArg: 2.771 ± 0.548
4.121AspSer: 4.121 ± 0.586
4.406AspThr: 4.406 ± 0.408
4.69AspVal: 4.69 ± 0.564
1.563AspTrp: 1.563 ± 0.358
1.99AspTyr: 1.99 ± 0.402
0.0AspXaa: 0.0 ± 0.0
Glu
4.974GluAla: 4.974 ± 0.719
0.711GluCys: 0.711 ± 0.281
3.979GluAsp: 3.979 ± 0.609
2.984GluGlu: 2.984 ± 0.554
2.7GluPhe: 2.7 ± 0.388
2.558GluGly: 2.558 ± 0.402
0.995GluHis: 0.995 ± 0.249
2.345GluIle: 2.345 ± 0.415
2.558GluLys: 2.558 ± 0.503
4.974GluLeu: 4.974 ± 0.496
1.848GluMet: 1.848 ± 0.382
1.99GluAsn: 1.99 ± 0.348
1.066GluPro: 1.066 ± 0.292
3.127GluGln: 3.127 ± 0.503
2.558GluArg: 2.558 ± 0.475
2.984GluSer: 2.984 ± 0.369
2.984GluThr: 2.984 ± 0.44
3.766GluVal: 3.766 ± 0.549
0.64GluTrp: 0.64 ± 0.223
2.203GluTyr: 2.203 ± 0.331
0.0GluXaa: 0.0 ± 0.0
Phe
2.771PheAla: 2.771 ± 0.407
0.213PheCys: 0.213 ± 0.109
2.629PheAsp: 2.629 ± 0.471
1.492PheGlu: 1.492 ± 0.324
1.208PhePhe: 1.208 ± 0.318
2.771PheGly: 2.771 ± 0.432
0.64PheHis: 0.64 ± 0.215
1.35PheIle: 1.35 ± 0.281
1.776PheLys: 1.776 ± 0.346
2.274PheLeu: 2.274 ± 0.47
0.568PheMet: 0.568 ± 0.178
1.492PheAsn: 1.492 ± 0.445
1.279PhePro: 1.279 ± 0.33
1.634PheGln: 1.634 ± 0.3
1.634PheArg: 1.634 ± 0.321
2.203PheSer: 2.203 ± 0.388
1.492PheThr: 1.492 ± 0.385
2.558PheVal: 2.558 ± 0.484
0.355PheTrp: 0.355 ± 0.137
0.853PheTyr: 0.853 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
7.248GlyAla: 7.248 ± 0.716
1.35GlyCys: 1.35 ± 0.479
4.477GlyAsp: 4.477 ± 0.656
2.771GlyGlu: 2.771 ± 0.426
2.416GlyPhe: 2.416 ± 0.359
5.471GlyGly: 5.471 ± 0.572
0.924GlyHis: 0.924 ± 0.24
5.4GlyIle: 5.4 ± 0.55
3.624GlyLys: 3.624 ± 0.518
6.466GlyLeu: 6.466 ± 0.566
1.99GlyMet: 1.99 ± 0.309
3.553GlyAsn: 3.553 ± 0.609
1.279GlyPro: 1.279 ± 0.357
2.416GlyGln: 2.416 ± 0.448
4.192GlyArg: 4.192 ± 0.536
5.329GlySer: 5.329 ± 0.572
6.893GlyThr: 6.893 ± 0.669
6.04GlyVal: 6.04 ± 0.731
0.711GlyTrp: 0.711 ± 0.211
4.406GlyTyr: 4.406 ± 0.627
0.0GlyXaa: 0.0 ± 0.0
His
1.492HisAla: 1.492 ± 0.378
0.568HisCys: 0.568 ± 0.193
1.208HisAsp: 1.208 ± 0.313
1.137HisGlu: 1.137 ± 0.379
0.355HisPhe: 0.355 ± 0.178
1.848HisGly: 1.848 ± 0.46
0.355HisHis: 0.355 ± 0.14
1.492HisIle: 1.492 ± 0.356
1.066HisLys: 1.066 ± 0.39
1.705HisLeu: 1.705 ± 0.348
0.355HisMet: 0.355 ± 0.147
0.782HisAsn: 0.782 ± 0.266
1.066HisPro: 1.066 ± 0.252
0.782HisGln: 0.782 ± 0.219
1.066HisArg: 1.066 ± 0.271
0.924HisSer: 0.924 ± 0.259
0.995HisThr: 0.995 ± 0.296
1.279HisVal: 1.279 ± 0.305
0.497HisTrp: 0.497 ± 0.184
0.711HisTyr: 0.711 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
3.766IleAla: 3.766 ± 0.484
0.568IleCys: 0.568 ± 0.202
3.269IleAsp: 3.269 ± 0.574
2.487IleGlu: 2.487 ± 0.542
0.853IlePhe: 0.853 ± 0.219
2.984IleGly: 2.984 ± 0.471
0.782IleHis: 0.782 ± 0.243
2.132IleIle: 2.132 ± 0.34
2.629IleLys: 2.629 ± 0.439
3.553IleLeu: 3.553 ± 0.608
1.35IleMet: 1.35 ± 0.362
2.771IleAsn: 2.771 ± 0.58
2.345IlePro: 2.345 ± 0.252
2.416IleGln: 2.416 ± 0.324
1.634IleArg: 1.634 ± 0.359
3.269IleSer: 3.269 ± 0.363
4.406IleThr: 4.406 ± 0.676
2.132IleVal: 2.132 ± 0.371
0.64IleTrp: 0.64 ± 0.267
1.279IleTyr: 1.279 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
4.974LysAla: 4.974 ± 0.82
0.355LysCys: 0.355 ± 0.147
3.411LysAsp: 3.411 ± 0.42
3.979LysGlu: 3.979 ± 0.653
0.782LysPhe: 0.782 ± 0.239
2.984LysGly: 2.984 ± 0.455
0.782LysHis: 0.782 ± 0.257
1.421LysIle: 1.421 ± 0.322
2.416LysLys: 2.416 ± 0.545
4.761LysLeu: 4.761 ± 0.623
1.066LysMet: 1.066 ± 0.29
1.421LysAsn: 1.421 ± 0.345
2.132LysPro: 2.132 ± 0.341
2.629LysGln: 2.629 ± 0.409
2.984LysArg: 2.984 ± 0.52
2.487LysSer: 2.487 ± 0.393
1.705LysThr: 1.705 ± 0.34
3.482LysVal: 3.482 ± 0.537
0.64LysTrp: 0.64 ± 0.23
2.629LysTyr: 2.629 ± 0.494
0.0LysXaa: 0.0 ± 0.0
Leu
7.319LeuAla: 7.319 ± 0.686
1.279LeuCys: 1.279 ± 0.367
4.832LeuAsp: 4.832 ± 0.546
4.761LeuGlu: 4.761 ± 0.586
2.274LeuPhe: 2.274 ± 0.454
6.751LeuGly: 6.751 ± 0.754
2.061LeuHis: 2.061 ± 0.367
3.198LeuIle: 3.198 ± 0.57
3.482LeuLys: 3.482 ± 0.49
7.959LeuLeu: 7.959 ± 0.763
2.132LeuMet: 2.132 ± 0.334
4.477LeuAsn: 4.477 ± 0.586
4.69LeuPro: 4.69 ± 0.493
3.695LeuGln: 3.695 ± 0.501
5.685LeuArg: 5.685 ± 0.869
6.324LeuSer: 6.324 ± 0.742
5.045LeuThr: 5.045 ± 0.683
6.608LeuVal: 6.608 ± 0.536
0.64LeuTrp: 0.64 ± 0.2
3.127LeuTyr: 3.127 ± 0.465
0.0LeuXaa: 0.0 ± 0.0
Met
2.771MetAla: 2.771 ± 0.479
0.284MetCys: 0.284 ± 0.132
1.492MetAsp: 1.492 ± 0.309
0.853MetGlu: 0.853 ± 0.234
1.208MetPhe: 1.208 ± 0.255
2.274MetGly: 2.274 ± 0.36
0.64MetHis: 0.64 ± 0.218
0.853MetIle: 0.853 ± 0.278
0.924MetLys: 0.924 ± 0.277
2.061MetLeu: 2.061 ± 0.449
0.355MetMet: 0.355 ± 0.127
0.924MetAsn: 0.924 ± 0.214
1.492MetPro: 1.492 ± 0.436
1.848MetGln: 1.848 ± 0.405
2.274MetArg: 2.274 ± 0.483
1.563MetSer: 1.563 ± 0.397
1.776MetThr: 1.776 ± 0.387
1.848MetVal: 1.848 ± 0.343
0.213MetTrp: 0.213 ± 0.113
1.279MetTyr: 1.279 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
3.269AsnAla: 3.269 ± 0.533
0.711AsnCys: 0.711 ± 0.256
1.634AsnAsp: 1.634 ± 0.32
1.705AsnGlu: 1.705 ± 0.355
1.492AsnPhe: 1.492 ± 0.391
3.837AsnGly: 3.837 ± 0.576
0.568AsnHis: 0.568 ± 0.196
1.421AsnIle: 1.421 ± 0.33
2.7AsnLys: 2.7 ± 0.391
4.619AsnLeu: 4.619 ± 0.879
1.279AsnMet: 1.279 ± 0.284
2.274AsnAsn: 2.274 ± 0.406
2.061AsnPro: 2.061 ± 0.348
2.345AsnGln: 2.345 ± 0.412
2.913AsnArg: 2.913 ± 0.491
2.984AsnSer: 2.984 ± 0.667
4.05AsnThr: 4.05 ± 0.599
2.984AsnVal: 2.984 ± 0.447
0.64AsnTrp: 0.64 ± 0.234
0.568AsnTyr: 0.568 ± 0.185
0.0AsnXaa: 0.0 ± 0.0
Pro
4.335ProAla: 4.335 ± 0.497
0.142ProCys: 0.142 ± 0.11
3.979ProAsp: 3.979 ± 0.579
3.482ProGlu: 3.482 ± 0.462
0.64ProPhe: 0.64 ± 0.199
2.629ProGly: 2.629 ± 0.354
0.497ProHis: 0.497 ± 0.176
1.492ProIle: 1.492 ± 0.322
1.279ProLys: 1.279 ± 0.358
2.487ProLeu: 2.487 ± 0.463
1.137ProMet: 1.137 ± 0.29
1.137ProAsn: 1.137 ± 0.251
1.279ProPro: 1.279 ± 0.292
1.208ProGln: 1.208 ± 0.319
1.563ProArg: 1.563 ± 0.293
2.842ProSer: 2.842 ± 0.528
3.198ProThr: 3.198 ± 0.542
3.695ProVal: 3.695 ± 0.439
0.853ProTrp: 0.853 ± 0.351
1.421ProTyr: 1.421 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
5.898GlnAla: 5.898 ± 0.669
0.355GlnCys: 0.355 ± 0.171
2.7GlnAsp: 2.7 ± 0.428
2.487GlnGlu: 2.487 ± 0.42
1.776GlnPhe: 1.776 ± 0.307
3.979GlnGly: 3.979 ± 0.612
1.279GlnHis: 1.279 ± 0.278
1.919GlnIle: 1.919 ± 0.44
1.776GlnLys: 1.776 ± 0.365
3.979GlnLeu: 3.979 ± 0.644
1.421GlnMet: 1.421 ± 0.319
2.274GlnAsn: 2.274 ± 0.553
1.208GlnPro: 1.208 ± 0.304
3.482GlnGln: 3.482 ± 0.871
2.771GlnArg: 2.771 ± 0.525
2.487GlnSer: 2.487 ± 0.511
1.705GlnThr: 1.705 ± 0.361
3.127GlnVal: 3.127 ± 0.419
0.355GlnTrp: 0.355 ± 0.159
3.127GlnTyr: 3.127 ± 0.5
0.0GlnXaa: 0.0 ± 0.0
Arg
4.69ArgAla: 4.69 ± 0.597
0.853ArgCys: 0.853 ± 0.241
3.695ArgAsp: 3.695 ± 0.611
2.984ArgGlu: 2.984 ± 0.374
1.35ArgPhe: 1.35 ± 0.335
3.979ArgGly: 3.979 ± 0.505
1.066ArgHis: 1.066 ± 0.258
3.411ArgIle: 3.411 ± 0.53
3.055ArgLys: 3.055 ± 0.437
3.553ArgLeu: 3.553 ± 0.548
1.776ArgMet: 1.776 ± 0.388
3.055ArgAsn: 3.055 ± 0.509
1.705ArgPro: 1.705 ± 0.384
2.203ArgGln: 2.203 ± 0.321
4.903ArgArg: 4.903 ± 0.584
3.411ArgSer: 3.411 ± 0.656
2.984ArgThr: 2.984 ± 0.476
4.548ArgVal: 4.548 ± 0.549
1.066ArgTrp: 1.066 ± 0.268
2.274ArgTyr: 2.274 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
7.603SerAla: 7.603 ± 0.805
0.711SerCys: 0.711 ± 0.255
3.411SerAsp: 3.411 ± 0.569
2.558SerGlu: 2.558 ± 0.45
2.061SerPhe: 2.061 ± 0.411
6.324SerGly: 6.324 ± 0.824
1.137SerHis: 1.137 ± 0.311
3.482SerIle: 3.482 ± 0.579
3.979SerLys: 3.979 ± 0.57
4.974SerLeu: 4.974 ± 0.589
1.776SerMet: 1.776 ± 0.458
2.274SerAsn: 2.274 ± 0.576
2.132SerPro: 2.132 ± 0.426
2.345SerGln: 2.345 ± 0.407
3.198SerArg: 3.198 ± 0.502
4.192SerSer: 4.192 ± 0.71
4.477SerThr: 4.477 ± 0.665
5.543SerVal: 5.543 ± 0.707
1.066SerTrp: 1.066 ± 0.331
1.563SerTyr: 1.563 ± 0.279
0.0SerXaa: 0.0 ± 0.0
Thr
6.679ThrAla: 6.679 ± 0.906
0.497ThrCys: 0.497 ± 0.221
3.624ThrAsp: 3.624 ± 0.638
3.837ThrGlu: 3.837 ± 0.443
1.634ThrPhe: 1.634 ± 0.396
7.106ThrGly: 7.106 ± 0.975
1.492ThrHis: 1.492 ± 0.298
2.132ThrIle: 2.132 ± 0.307
2.913ThrLys: 2.913 ± 0.471
5.614ThrLeu: 5.614 ± 0.664
0.924ThrMet: 0.924 ± 0.27
2.7ThrAsn: 2.7 ± 0.459
4.05ThrPro: 4.05 ± 0.648
2.345ThrGln: 2.345 ± 0.473
3.055ThrArg: 3.055 ± 0.515
4.477ThrSer: 4.477 ± 0.569
4.335ThrThr: 4.335 ± 1.087
4.619ThrVal: 4.619 ± 0.754
0.497ThrTrp: 0.497 ± 0.234
2.274ThrTyr: 2.274 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
5.756ValAla: 5.756 ± 0.537
0.853ValCys: 0.853 ± 0.29
4.263ValAsp: 4.263 ± 0.579
2.984ValGlu: 2.984 ± 0.399
2.842ValPhe: 2.842 ± 0.4
5.756ValGly: 5.756 ± 0.534
1.919ValHis: 1.919 ± 0.405
3.198ValIle: 3.198 ± 0.421
2.984ValLys: 2.984 ± 0.454
6.751ValLeu: 6.751 ± 0.872
1.776ValMet: 1.776 ± 0.318
2.842ValAsn: 2.842 ± 0.578
3.055ValPro: 3.055 ± 0.484
5.329ValGln: 5.329 ± 0.656
4.263ValArg: 4.263 ± 0.537
4.548ValSer: 4.548 ± 0.543
4.903ValThr: 4.903 ± 0.56
4.477ValVal: 4.477 ± 0.582
0.853ValTrp: 0.853 ± 0.272
3.269ValTyr: 3.269 ± 0.495
0.0ValXaa: 0.0 ± 0.0
Trp
0.924TrpAla: 0.924 ± 0.259
0.142TrpCys: 0.142 ± 0.096
0.64TrpAsp: 0.64 ± 0.228
0.782TrpGlu: 0.782 ± 0.24
0.711TrpPhe: 0.711 ± 0.292
1.208TrpGly: 1.208 ± 0.276
0.071TrpHis: 0.071 ± 0.072
0.213TrpIle: 0.213 ± 0.164
0.284TrpLys: 0.284 ± 0.133
1.848TrpLeu: 1.848 ± 0.469
0.284TrpMet: 0.284 ± 0.155
0.853TrpAsn: 0.853 ± 0.318
0.497TrpPro: 0.497 ± 0.195
0.782TrpGln: 0.782 ± 0.211
0.497TrpArg: 0.497 ± 0.173
0.924TrpSer: 0.924 ± 0.249
0.497TrpThr: 0.497 ± 0.2
1.208TrpVal: 1.208 ± 0.303
0.284TrpTrp: 0.284 ± 0.204
1.208TrpTyr: 1.208 ± 0.4
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.558TyrAla: 2.558 ± 0.51
0.853TyrCys: 0.853 ± 0.263
2.487TyrAsp: 2.487 ± 0.34
2.061TyrGlu: 2.061 ± 0.399
1.35TyrPhe: 1.35 ± 0.342
2.842TyrGly: 2.842 ± 0.547
0.782TyrHis: 0.782 ± 0.222
2.274TyrIle: 2.274 ± 0.49
1.492TyrLys: 1.492 ± 0.465
3.34TyrLeu: 3.34 ± 0.54
1.35TyrMet: 1.35 ± 0.333
1.848TyrAsn: 1.848 ± 0.4
1.776TyrPro: 1.776 ± 0.32
1.848TyrGln: 1.848 ± 0.361
3.482TyrArg: 3.482 ± 0.54
2.487TyrSer: 2.487 ± 0.405
2.984TyrThr: 2.984 ± 0.46
2.203TyrVal: 2.203 ± 0.346
0.711TyrTrp: 0.711 ± 0.3
1.563TyrTyr: 1.563 ± 0.423
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (14074 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski