Amino acid dipepetide frequency for Providencia phage vB PstP PS3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.092AlaAla: 9.092 ± 1.169
0.798AlaCys: 0.798 ± 0.357
4.706AlaAsp: 4.706 ± 0.725
5.104AlaGlu: 5.104 ± 0.607
3.35AlaPhe: 3.35 ± 0.504
6.221AlaGly: 6.221 ± 0.828
1.196AlaHis: 1.196 ± 0.337
4.068AlaIle: 4.068 ± 0.625
5.025AlaLys: 5.025 ± 0.718
8.375AlaLeu: 8.375 ± 1.03
1.914AlaMet: 1.914 ± 0.425
3.111AlaAsn: 3.111 ± 0.731
3.828AlaPro: 3.828 ± 0.98
5.424AlaGln: 5.424 ± 0.869
4.785AlaArg: 4.785 ± 1.062
4.068AlaSer: 4.068 ± 0.829
3.589AlaThr: 3.589 ± 0.569
5.743AlaVal: 5.743 ± 0.927
0.638AlaTrp: 0.638 ± 0.291
3.828AlaTyr: 3.828 ± 0.613
0.0AlaXaa: 0.0 ± 0.0
Cys
0.319CysAla: 0.319 ± 0.172
0.16CysCys: 0.16 ± 0.132
0.558CysAsp: 0.558 ± 0.209
0.638CysGlu: 0.638 ± 0.214
0.399CysPhe: 0.399 ± 0.267
0.957CysGly: 0.957 ± 0.306
0.239CysHis: 0.239 ± 0.172
0.877CysIle: 0.877 ± 0.293
0.399CysLys: 0.399 ± 0.179
1.117CysLeu: 1.117 ± 0.326
0.479CysMet: 0.479 ± 0.199
0.399CysAsn: 0.399 ± 0.183
0.399CysPro: 0.399 ± 0.202
0.399CysGln: 0.399 ± 0.24
0.239CysArg: 0.239 ± 0.141
0.479CysSer: 0.479 ± 0.199
1.117CysThr: 1.117 ± 0.282
0.798CysVal: 0.798 ± 0.265
0.239CysTrp: 0.239 ± 0.17
0.638CysTyr: 0.638 ± 0.211
0.0CysXaa: 0.0 ± 0.0
Asp
6.46AspAla: 6.46 ± 0.785
0.877AspCys: 0.877 ± 0.272
3.908AspAsp: 3.908 ± 0.62
4.466AspGlu: 4.466 ± 0.539
2.552AspPhe: 2.552 ± 0.389
4.227AspGly: 4.227 ± 0.578
0.558AspHis: 0.558 ± 0.202
3.669AspIle: 3.669 ± 0.474
2.393AspLys: 2.393 ± 0.4
6.141AspLeu: 6.141 ± 0.751
1.755AspMet: 1.755 ± 0.397
1.994AspAsn: 1.994 ± 0.375
3.031AspPro: 3.031 ± 0.392
1.196AspGln: 1.196 ± 0.39
2.074AspArg: 2.074 ± 0.423
3.749AspSer: 3.749 ± 0.585
5.025AspThr: 5.025 ± 0.647
3.19AspVal: 3.19 ± 0.455
0.638AspTrp: 0.638 ± 0.244
2.233AspTyr: 2.233 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
6.859GluAla: 6.859 ± 0.729
0.638GluCys: 0.638 ± 0.303
2.632GluAsp: 2.632 ± 0.483
2.792GluGlu: 2.792 ± 0.577
3.19GluPhe: 3.19 ± 0.413
3.43GluGly: 3.43 ± 0.56
1.595GluHis: 1.595 ± 0.491
2.153GluIle: 2.153 ± 0.473
3.111GluLys: 3.111 ± 0.704
6.46GluLeu: 6.46 ± 0.828
2.233GluMet: 2.233 ± 0.383
1.196GluAsn: 1.196 ± 0.271
2.153GluPro: 2.153 ± 0.706
3.669GluGln: 3.669 ± 0.697
3.669GluArg: 3.669 ± 0.653
4.147GluSer: 4.147 ± 0.443
3.19GluThr: 3.19 ± 0.625
5.025GluVal: 5.025 ± 0.613
0.798GluTrp: 0.798 ± 0.297
3.27GluTyr: 3.27 ± 0.568
0.0GluXaa: 0.0 ± 0.0
Phe
2.074PheAla: 2.074 ± 0.335
0.399PheCys: 0.399 ± 0.212
2.153PheAsp: 2.153 ± 0.423
1.356PheGlu: 1.356 ± 0.355
1.196PhePhe: 1.196 ± 0.251
3.031PheGly: 3.031 ± 0.424
0.638PheHis: 0.638 ± 0.245
2.153PheIle: 2.153 ± 0.483
2.632PheLys: 2.632 ± 0.642
2.472PheLeu: 2.472 ± 0.374
0.718PheMet: 0.718 ± 0.225
2.313PheAsn: 2.313 ± 0.406
0.957PhePro: 0.957 ± 0.305
1.515PheGln: 1.515 ± 0.281
1.675PheArg: 1.675 ± 0.36
2.951PheSer: 2.951 ± 0.567
2.313PheThr: 2.313 ± 0.438
1.515PheVal: 1.515 ± 0.348
0.399PheTrp: 0.399 ± 0.174
1.196PheTyr: 1.196 ± 0.394
0.0PheXaa: 0.0 ± 0.0
Gly
5.344GlyAla: 5.344 ± 0.784
1.037GlyCys: 1.037 ± 0.408
4.546GlyAsp: 4.546 ± 0.504
4.147GlyGlu: 4.147 ± 0.522
1.755GlyPhe: 1.755 ± 0.388
4.945GlyGly: 4.945 ± 0.585
0.718GlyHis: 0.718 ± 0.277
5.025GlyIle: 5.025 ± 0.487
3.669GlyLys: 3.669 ± 0.507
6.062GlyLeu: 6.062 ± 0.77
2.632GlyMet: 2.632 ± 0.621
2.074GlyAsn: 2.074 ± 0.44
0.0GlyPro: 0.0 ± 0.0
1.834GlyGln: 1.834 ± 0.354
3.749GlyArg: 3.749 ± 0.521
6.54GlySer: 6.54 ± 0.549
5.822GlyThr: 5.822 ± 0.73
5.104GlyVal: 5.104 ± 0.606
0.877GlyTrp: 0.877 ± 0.216
3.589GlyTyr: 3.589 ± 0.499
0.0GlyXaa: 0.0 ± 0.0
His
0.798HisAla: 0.798 ± 0.197
0.399HisCys: 0.399 ± 0.177
1.356HisAsp: 1.356 ± 0.277
1.356HisGlu: 1.356 ± 0.261
0.877HisPhe: 0.877 ± 0.28
1.515HisGly: 1.515 ± 0.367
0.558HisHis: 0.558 ± 0.321
0.957HisIle: 0.957 ± 0.296
1.356HisLys: 1.356 ± 0.363
2.472HisLeu: 2.472 ± 0.396
0.558HisMet: 0.558 ± 0.212
0.718HisAsn: 0.718 ± 0.272
0.718HisPro: 0.718 ± 0.194
0.877HisGln: 0.877 ± 0.303
0.558HisArg: 0.558 ± 0.181
1.037HisSer: 1.037 ± 0.27
1.515HisThr: 1.515 ± 0.266
0.877HisVal: 0.877 ± 0.256
0.319HisTrp: 0.319 ± 0.195
0.798HisTyr: 0.798 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
2.712IleAla: 2.712 ± 0.413
0.239IleCys: 0.239 ± 0.129
3.988IleAsp: 3.988 ± 0.536
3.031IleGlu: 3.031 ± 0.628
1.276IlePhe: 1.276 ± 0.28
3.35IleGly: 3.35 ± 0.542
1.515IleHis: 1.515 ± 0.409
2.393IleIle: 2.393 ± 0.362
4.785IleLys: 4.785 ± 0.523
4.068IleLeu: 4.068 ± 0.682
1.356IleMet: 1.356 ± 0.349
2.552IleAsn: 2.552 ± 0.465
2.792IlePro: 2.792 ± 0.371
3.19IleGln: 3.19 ± 0.552
2.632IleArg: 2.632 ± 0.575
2.712IleSer: 2.712 ± 0.532
3.509IleThr: 3.509 ± 0.575
3.509IleVal: 3.509 ± 0.531
0.479IleTrp: 0.479 ± 0.217
0.638IleTyr: 0.638 ± 0.281
0.0IleXaa: 0.0 ± 0.0
Lys
6.62LysAla: 6.62 ± 1.009
0.558LysCys: 0.558 ± 0.213
4.227LysAsp: 4.227 ± 0.483
4.785LysGlu: 4.785 ± 0.564
1.755LysPhe: 1.755 ± 0.621
4.068LysGly: 4.068 ± 0.499
1.276LysHis: 1.276 ± 0.36
1.755LysIle: 1.755 ± 0.326
2.632LysLys: 2.632 ± 0.511
6.301LysLeu: 6.301 ± 0.737
1.276LysMet: 1.276 ± 0.372
1.834LysAsn: 1.834 ± 0.383
2.712LysPro: 2.712 ± 0.456
3.111LysGln: 3.111 ± 0.502
2.792LysArg: 2.792 ± 0.582
3.031LysSer: 3.031 ± 0.387
2.951LysThr: 2.951 ± 0.427
5.104LysVal: 5.104 ± 0.686
0.957LysTrp: 0.957 ± 0.322
4.387LysTyr: 4.387 ± 0.58
0.0LysXaa: 0.0 ± 0.0
Leu
6.859LeuAla: 6.859 ± 0.961
0.877LeuCys: 0.877 ± 0.265
7.098LeuAsp: 7.098 ± 0.704
5.743LeuGlu: 5.743 ± 0.698
2.472LeuPhe: 2.472 ± 0.497
6.141LeuGly: 6.141 ± 0.872
1.834LeuHis: 1.834 ± 0.425
5.344LeuIle: 5.344 ± 0.753
6.301LeuLys: 6.301 ± 0.575
7.338LeuLeu: 7.338 ± 0.825
3.031LeuMet: 3.031 ± 0.457
4.387LeuAsn: 4.387 ± 0.573
3.031LeuPro: 3.031 ± 0.449
4.785LeuGln: 4.785 ± 0.814
3.828LeuArg: 3.828 ± 0.568
6.381LeuSer: 6.381 ± 0.743
4.147LeuThr: 4.147 ± 0.742
5.424LeuVal: 5.424 ± 0.87
1.037LeuTrp: 1.037 ± 0.359
3.749LeuTyr: 3.749 ± 0.424
0.0LeuXaa: 0.0 ± 0.0
Met
2.074MetAla: 2.074 ± 0.329
0.239MetCys: 0.239 ± 0.145
1.117MetAsp: 1.117 ± 0.33
1.117MetGlu: 1.117 ± 0.301
1.037MetPhe: 1.037 ± 0.214
1.595MetGly: 1.595 ± 0.374
0.558MetHis: 0.558 ± 0.196
1.356MetIle: 1.356 ± 0.25
1.914MetLys: 1.914 ± 0.394
3.111MetLeu: 3.111 ± 0.581
0.877MetMet: 0.877 ± 0.286
1.595MetAsn: 1.595 ± 0.331
0.638MetPro: 0.638 ± 0.321
2.393MetGln: 2.393 ± 0.501
1.196MetArg: 1.196 ± 0.445
2.472MetSer: 2.472 ± 0.389
1.356MetThr: 1.356 ± 0.387
1.994MetVal: 1.994 ± 0.372
0.239MetTrp: 0.239 ± 0.128
0.877MetTyr: 0.877 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
3.19AsnAla: 3.19 ± 0.674
0.319AsnCys: 0.319 ± 0.175
1.595AsnAsp: 1.595 ± 0.365
1.436AsnGlu: 1.436 ± 0.24
1.356AsnPhe: 1.356 ± 0.39
2.552AsnGly: 2.552 ± 0.426
0.638AsnHis: 0.638 ± 0.241
2.552AsnIle: 2.552 ± 0.521
3.27AsnLys: 3.27 ± 0.429
2.712AsnLeu: 2.712 ± 0.467
0.877AsnMet: 0.877 ± 0.182
1.834AsnAsn: 1.834 ± 0.334
2.552AsnPro: 2.552 ± 0.456
1.914AsnGln: 1.914 ± 0.407
1.994AsnArg: 1.994 ± 0.336
3.031AsnSer: 3.031 ± 0.55
3.509AsnThr: 3.509 ± 0.504
3.031AsnVal: 3.031 ± 0.427
0.558AsnTrp: 0.558 ± 0.203
1.276AsnTyr: 1.276 ± 0.311
0.0AsnXaa: 0.0 ± 0.0
Pro
3.031ProAla: 3.031 ± 0.579
0.399ProCys: 0.399 ± 0.179
2.792ProAsp: 2.792 ± 0.361
4.546ProGlu: 4.546 ± 0.876
1.276ProPhe: 1.276 ± 0.326
0.08ProGly: 0.08 ± 0.079
0.558ProHis: 0.558 ± 0.21
1.276ProIle: 1.276 ± 0.281
2.632ProLys: 2.632 ± 0.503
2.792ProLeu: 2.792 ± 0.444
1.196ProMet: 1.196 ± 0.379
1.117ProAsn: 1.117 ± 0.292
0.638ProPro: 0.638 ± 0.268
1.276ProGln: 1.276 ± 0.287
1.276ProArg: 1.276 ± 0.315
3.27ProSer: 3.27 ± 0.456
2.951ProThr: 2.951 ± 0.478
3.35ProVal: 3.35 ± 0.515
0.399ProTrp: 0.399 ± 0.188
1.356ProTyr: 1.356 ± 0.276
0.0ProXaa: 0.0 ± 0.0
Gln
5.743GlnAla: 5.743 ± 0.828
0.16GlnCys: 0.16 ± 0.113
2.792GlnAsp: 2.792 ± 0.403
4.147GlnGlu: 4.147 ± 0.718
1.515GlnPhe: 1.515 ± 0.371
4.706GlnGly: 4.706 ± 0.687
0.957GlnHis: 0.957 ± 0.313
2.074GlnIle: 2.074 ± 0.394
1.834GlnLys: 1.834 ± 0.318
4.068GlnLeu: 4.068 ± 0.688
1.276GlnMet: 1.276 ± 0.426
1.675GlnAsn: 1.675 ± 0.341
1.117GlnPro: 1.117 ± 0.325
2.951GlnGln: 2.951 ± 0.693
1.755GlnArg: 1.755 ± 0.47
3.749GlnSer: 3.749 ± 0.498
2.074GlnThr: 2.074 ± 0.308
3.35GlnVal: 3.35 ± 0.424
0.798GlnTrp: 0.798 ± 0.223
3.43GlnTyr: 3.43 ± 0.476
0.0GlnXaa: 0.0 ± 0.0
Arg
4.945ArgAla: 4.945 ± 0.871
0.319ArgCys: 0.319 ± 0.298
2.313ArgAsp: 2.313 ± 0.392
3.27ArgGlu: 3.27 ± 0.507
1.994ArgPhe: 1.994 ± 0.255
3.43ArgGly: 3.43 ± 0.502
0.798ArgHis: 0.798 ± 0.192
2.552ArgIle: 2.552 ± 0.347
3.43ArgLys: 3.43 ± 0.487
5.104ArgLeu: 5.104 ± 0.719
1.117ArgMet: 1.117 ± 0.247
1.196ArgAsn: 1.196 ± 0.302
1.196ArgPro: 1.196 ± 0.317
1.834ArgGln: 1.834 ± 0.345
3.589ArgArg: 3.589 ± 0.482
2.632ArgSer: 2.632 ± 0.6
3.27ArgThr: 3.27 ± 0.389
3.669ArgVal: 3.669 ± 0.54
0.558ArgTrp: 0.558 ± 0.189
1.037ArgTyr: 1.037 ± 0.298
0.0ArgXaa: 0.0 ± 0.0
Ser
5.104SerAla: 5.104 ± 0.827
0.558SerCys: 0.558 ± 0.23
3.27SerAsp: 3.27 ± 0.536
4.307SerGlu: 4.307 ± 0.622
1.117SerPhe: 1.117 ± 0.264
5.344SerGly: 5.344 ± 0.723
1.436SerHis: 1.436 ± 0.301
4.227SerIle: 4.227 ± 0.563
4.865SerLys: 4.865 ± 0.57
5.663SerLeu: 5.663 ± 0.714
2.313SerMet: 2.313 ± 0.378
2.472SerAsn: 2.472 ± 0.524
2.233SerPro: 2.233 ± 0.449
2.552SerGln: 2.552 ± 0.533
3.111SerArg: 3.111 ± 0.479
3.988SerSer: 3.988 ± 0.619
7.019SerThr: 7.019 ± 0.973
4.546SerVal: 4.546 ± 0.68
0.718SerTrp: 0.718 ± 0.27
1.595SerTyr: 1.595 ± 0.349
0.0SerXaa: 0.0 ± 0.0
Thr
5.583ThrAla: 5.583 ± 0.862
0.399ThrCys: 0.399 ± 0.177
3.749ThrAsp: 3.749 ± 0.512
4.546ThrGlu: 4.546 ± 0.739
2.313ThrPhe: 2.313 ± 0.465
6.062ThrGly: 6.062 ± 0.832
1.436ThrHis: 1.436 ± 0.345
2.393ThrIle: 2.393 ± 0.544
4.626ThrLys: 4.626 ± 0.751
4.546ThrLeu: 4.546 ± 0.595
1.276ThrMet: 1.276 ± 0.223
2.951ThrAsn: 2.951 ± 0.481
3.111ThrPro: 3.111 ± 0.525
2.552ThrGln: 2.552 ± 0.431
2.632ThrArg: 2.632 ± 0.549
4.387ThrSer: 4.387 ± 0.497
3.509ThrThr: 3.509 ± 0.703
6.381ThrVal: 6.381 ± 0.828
0.558ThrTrp: 0.558 ± 0.161
2.393ThrTyr: 2.393 ± 0.595
0.0ThrXaa: 0.0 ± 0.0
Val
5.104ValAla: 5.104 ± 0.744
1.595ValCys: 1.595 ± 0.408
3.988ValAsp: 3.988 ± 0.765
2.712ValGlu: 2.712 ± 0.503
2.393ValPhe: 2.393 ± 0.416
5.184ValGly: 5.184 ± 0.65
2.074ValHis: 2.074 ± 0.46
2.552ValIle: 2.552 ± 0.396
4.466ValLys: 4.466 ± 0.598
6.301ValLeu: 6.301 ± 0.818
1.276ValMet: 1.276 ± 0.295
3.43ValAsn: 3.43 ± 0.546
3.509ValPro: 3.509 ± 0.579
5.344ValGln: 5.344 ± 0.65
3.589ValArg: 3.589 ± 0.404
4.307ValSer: 4.307 ± 0.67
4.785ValThr: 4.785 ± 0.672
4.466ValVal: 4.466 ± 0.681
0.479ValTrp: 0.479 ± 0.162
2.472ValTyr: 2.472 ± 0.542
0.0ValXaa: 0.0 ± 0.0
Trp
0.479TrpAla: 0.479 ± 0.188
0.239TrpCys: 0.239 ± 0.136
1.117TrpAsp: 1.117 ± 0.27
0.558TrpGlu: 0.558 ± 0.202
0.877TrpPhe: 0.877 ± 0.208
0.319TrpGly: 0.319 ± 0.152
0.399TrpHis: 0.399 ± 0.189
0.479TrpIle: 0.479 ± 0.184
0.479TrpLys: 0.479 ± 0.199
1.196TrpLeu: 1.196 ± 0.325
0.239TrpMet: 0.239 ± 0.122
0.877TrpAsn: 0.877 ± 0.244
0.08TrpPro: 0.08 ± 0.076
0.399TrpGln: 0.399 ± 0.171
0.877TrpArg: 0.877 ± 0.271
0.479TrpSer: 0.479 ± 0.16
0.558TrpThr: 0.558 ± 0.192
0.798TrpVal: 0.798 ± 0.42
0.479TrpTrp: 0.479 ± 0.194
0.798TrpTyr: 0.798 ± 0.264
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.472TyrAla: 2.472 ± 0.397
0.638TyrCys: 0.638 ± 0.22
2.233TyrAsp: 2.233 ± 0.44
2.153TyrGlu: 2.153 ± 0.473
0.957TyrPhe: 0.957 ± 0.329
2.313TyrGly: 2.313 ± 0.47
0.638TyrHis: 0.638 ± 0.241
2.951TyrIle: 2.951 ± 0.565
2.313TyrLys: 2.313 ± 0.348
3.509TyrLeu: 3.509 ± 0.617
1.276TyrMet: 1.276 ± 0.336
2.472TyrAsn: 2.472 ± 0.506
1.515TyrPro: 1.515 ± 0.299
2.951TyrGln: 2.951 ± 0.486
2.233TyrArg: 2.233 ± 0.382
3.031TyrSer: 3.031 ± 0.499
3.19TyrThr: 3.19 ± 0.472
2.153TyrVal: 2.153 ± 0.344
0.558TyrTrp: 0.558 ± 0.22
0.798TyrTyr: 0.798 ± 0.275
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (12539 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski