Amino acid dipepetide frequency for Pseudomonas phage O4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.955AlaAla: 7.955 ± 1.23
0.884AlaCys: 0.884 ± 0.247
4.672AlaAsp: 4.672 ± 0.438
5.808AlaGlu: 5.808 ± 0.884
2.904AlaPhe: 2.904 ± 0.377
5.619AlaGly: 5.619 ± 0.636
1.705AlaHis: 1.705 ± 0.311
3.409AlaIle: 3.409 ± 0.394
5.682AlaLys: 5.682 ± 0.612
6.061AlaLeu: 6.061 ± 0.574
2.778AlaMet: 2.778 ± 0.48
3.851AlaAsn: 3.851 ± 0.48
2.399AlaPro: 2.399 ± 0.359
3.914AlaGln: 3.914 ± 0.676
5.051AlaArg: 5.051 ± 0.844
4.672AlaSer: 4.672 ± 0.766
5.177AlaThr: 5.177 ± 0.702
4.609AlaVal: 4.609 ± 0.596
1.326AlaTrp: 1.326 ± 0.273
3.03AlaTyr: 3.03 ± 0.356
0.0AlaXaa: 0.0 ± 0.0
Cys
0.694CysAla: 0.694 ± 0.196
0.189CysCys: 0.189 ± 0.118
0.568CysAsp: 0.568 ± 0.261
1.01CysGlu: 1.01 ± 0.303
0.505CysPhe: 0.505 ± 0.173
1.263CysGly: 1.263 ± 0.352
0.316CysHis: 0.316 ± 0.152
0.379CysIle: 0.379 ± 0.172
1.136CysLys: 1.136 ± 0.375
0.821CysLeu: 0.821 ± 0.283
0.253CysMet: 0.253 ± 0.128
0.379CysAsn: 0.379 ± 0.156
0.442CysPro: 0.442 ± 0.176
0.316CysGln: 0.316 ± 0.132
0.568CysArg: 0.568 ± 0.194
0.821CysSer: 0.821 ± 0.251
0.442CysThr: 0.442 ± 0.158
0.631CysVal: 0.631 ± 0.238
0.189CysTrp: 0.189 ± 0.1
0.316CysTyr: 0.316 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
4.483AspAla: 4.483 ± 0.506
0.694AspCys: 0.694 ± 0.228
4.546AspAsp: 4.546 ± 0.506
4.23AspGlu: 4.23 ± 0.464
3.157AspPhe: 3.157 ± 0.417
5.43AspGly: 5.43 ± 0.662
1.263AspHis: 1.263 ± 0.252
4.925AspIle: 4.925 ± 0.555
3.536AspLys: 3.536 ± 0.463
4.609AspLeu: 4.609 ± 0.484
1.136AspMet: 1.136 ± 0.233
3.536AspAsn: 3.536 ± 0.445
2.652AspPro: 2.652 ± 0.372
1.452AspGln: 1.452 ± 0.283
2.841AspArg: 2.841 ± 0.516
3.283AspSer: 3.283 ± 0.461
3.157AspThr: 3.157 ± 0.627
4.167AspVal: 4.167 ± 0.523
1.326AspTrp: 1.326 ± 0.274
2.904AspTyr: 2.904 ± 0.327
0.0AspXaa: 0.0 ± 0.0
Glu
6.629GluAla: 6.629 ± 0.773
0.947GluCys: 0.947 ± 0.277
4.293GluAsp: 4.293 ± 0.47
6.566GluGlu: 6.566 ± 0.835
3.03GluPhe: 3.03 ± 0.453
5.051GluGly: 5.051 ± 0.613
1.642GluHis: 1.642 ± 0.405
3.472GluIle: 3.472 ± 0.558
5.493GluLys: 5.493 ± 0.584
6.124GluLeu: 6.124 ± 0.664
2.083GluMet: 2.083 ± 0.335
2.336GluAsn: 2.336 ± 0.305
2.525GluPro: 2.525 ± 0.479
3.094GluGln: 3.094 ± 0.545
3.409GluArg: 3.409 ± 0.479
1.705GluSer: 1.705 ± 0.297
3.157GluThr: 3.157 ± 0.562
5.303GluVal: 5.303 ± 0.685
1.263GluTrp: 1.263 ± 0.302
2.967GluTyr: 2.967 ± 0.475
0.0GluXaa: 0.0 ± 0.0
Phe
2.967PheAla: 2.967 ± 0.463
0.694PheCys: 0.694 ± 0.276
3.283PheAsp: 3.283 ± 0.381
3.03PheGlu: 3.03 ± 0.429
1.578PhePhe: 1.578 ± 0.31
3.346PheGly: 3.346 ± 0.387
0.694PheHis: 0.694 ± 0.198
2.336PheIle: 2.336 ± 0.466
2.967PheLys: 2.967 ± 0.531
2.273PheLeu: 2.273 ± 0.3
0.947PheMet: 0.947 ± 0.211
2.336PheAsn: 2.336 ± 0.356
1.326PhePro: 1.326 ± 0.257
1.452PheGln: 1.452 ± 0.289
2.083PheArg: 2.083 ± 0.376
2.399PheSer: 2.399 ± 0.499
1.768PheThr: 1.768 ± 0.369
2.904PheVal: 2.904 ± 0.518
0.568PheTrp: 0.568 ± 0.186
1.263PheTyr: 1.263 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
5.619GlyAla: 5.619 ± 0.819
0.505GlyCys: 0.505 ± 0.157
4.861GlyAsp: 4.861 ± 0.638
4.798GlyGlu: 4.798 ± 0.489
3.409GlyPhe: 3.409 ± 0.542
5.051GlyGly: 5.051 ± 0.651
1.389GlyHis: 1.389 ± 0.31
3.346GlyIle: 3.346 ± 0.527
5.682GlyLys: 5.682 ± 0.558
5.367GlyLeu: 5.367 ± 0.55
2.083GlyMet: 2.083 ± 0.327
3.346GlyAsn: 3.346 ± 0.369
1.515GlyPro: 1.515 ± 0.375
1.831GlyGln: 1.831 ± 0.371
2.778GlyArg: 2.778 ± 0.415
4.293GlySer: 4.293 ± 0.526
4.167GlyThr: 4.167 ± 0.514
4.672GlyVal: 4.672 ± 0.513
2.147GlyTrp: 2.147 ± 0.395
3.22GlyTyr: 3.22 ± 0.404
0.0GlyXaa: 0.0 ± 0.0
His
0.947HisAla: 0.947 ± 0.241
0.568HisCys: 0.568 ± 0.187
1.136HisAsp: 1.136 ± 0.219
1.642HisGlu: 1.642 ± 0.422
1.073HisPhe: 1.073 ± 0.208
1.515HisGly: 1.515 ± 0.322
0.442HisHis: 0.442 ± 0.225
1.515HisIle: 1.515 ± 0.306
1.326HisLys: 1.326 ± 0.31
1.642HisLeu: 1.642 ± 0.349
0.631HisMet: 0.631 ± 0.177
1.136HisAsn: 1.136 ± 0.306
0.821HisPro: 0.821 ± 0.302
0.631HisGln: 0.631 ± 0.178
0.758HisArg: 0.758 ± 0.293
0.947HisSer: 0.947 ± 0.239
0.821HisThr: 0.821 ± 0.264
1.389HisVal: 1.389 ± 0.268
0.189HisTrp: 0.189 ± 0.095
0.884HisTyr: 0.884 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
4.167IleAla: 4.167 ± 0.435
0.568IleCys: 0.568 ± 0.227
4.419IleAsp: 4.419 ± 0.409
3.346IleGlu: 3.346 ± 0.486
1.705IlePhe: 1.705 ± 0.297
3.409IleGly: 3.409 ± 0.459
1.2IleHis: 1.2 ± 0.31
3.157IleIle: 3.157 ± 0.351
4.672IleLys: 4.672 ± 0.61
3.851IleLeu: 3.851 ± 0.558
1.01IleMet: 1.01 ± 0.207
2.715IleAsn: 2.715 ± 0.429
1.957IlePro: 1.957 ± 0.319
2.589IleGln: 2.589 ± 0.432
3.536IleArg: 3.536 ± 0.444
2.967IleSer: 2.967 ± 0.425
3.283IleThr: 3.283 ± 0.521
3.662IleVal: 3.662 ± 0.471
0.316IleTrp: 0.316 ± 0.137
2.336IleTyr: 2.336 ± 0.403
0.0IleXaa: 0.0 ± 0.0
Lys
6.819LysAla: 6.819 ± 0.963
0.821LysCys: 0.821 ± 0.296
4.798LysAsp: 4.798 ± 0.754
6.124LysGlu: 6.124 ± 0.641
2.715LysPhe: 2.715 ± 0.467
4.735LysGly: 4.735 ± 0.826
1.452LysHis: 1.452 ± 0.271
3.914LysIle: 3.914 ± 0.533
4.419LysLys: 4.419 ± 0.734
4.988LysLeu: 4.988 ± 0.798
1.768LysMet: 1.768 ± 0.404
3.599LysAsn: 3.599 ± 0.491
2.904LysPro: 2.904 ± 0.579
2.841LysGln: 2.841 ± 0.401
3.409LysArg: 3.409 ± 0.46
3.094LysSer: 3.094 ± 0.405
3.536LysThr: 3.536 ± 0.522
4.735LysVal: 4.735 ± 0.678
1.073LysTrp: 1.073 ± 0.248
2.336LysTyr: 2.336 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
6.377LeuAla: 6.377 ± 0.641
0.947LeuCys: 0.947 ± 0.241
4.798LeuAsp: 4.798 ± 0.493
6.503LeuGlu: 6.503 ± 0.633
2.589LeuPhe: 2.589 ± 0.325
4.798LeuGly: 4.798 ± 0.571
1.389LeuHis: 1.389 ± 0.324
3.472LeuIle: 3.472 ± 0.552
5.177LeuLys: 5.177 ± 0.524
5.24LeuLeu: 5.24 ± 0.573
1.263LeuMet: 1.263 ± 0.346
4.861LeuAsn: 4.861 ± 0.456
2.967LeuPro: 2.967 ± 0.367
3.409LeuGln: 3.409 ± 0.56
4.925LeuArg: 4.925 ± 0.656
4.167LeuSer: 4.167 ± 0.596
3.409LeuThr: 3.409 ± 0.437
6.629LeuVal: 6.629 ± 0.725
0.947LeuTrp: 0.947 ± 0.238
3.409LeuTyr: 3.409 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
2.904MetAla: 2.904 ± 0.37
0.442MetCys: 0.442 ± 0.156
1.073MetAsp: 1.073 ± 0.287
1.452MetGlu: 1.452 ± 0.298
1.073MetPhe: 1.073 ± 0.276
1.389MetGly: 1.389 ± 0.272
0.189MetHis: 0.189 ± 0.106
1.2MetIle: 1.2 ± 0.229
1.957MetLys: 1.957 ± 0.391
1.894MetLeu: 1.894 ± 0.325
0.442MetMet: 0.442 ± 0.156
1.073MetAsn: 1.073 ± 0.23
1.452MetPro: 1.452 ± 0.267
0.758MetGln: 0.758 ± 0.249
1.831MetArg: 1.831 ± 0.307
2.147MetSer: 2.147 ± 0.34
1.768MetThr: 1.768 ± 0.388
2.02MetVal: 2.02 ± 0.333
0.316MetTrp: 0.316 ± 0.134
1.073MetTyr: 1.073 ± 0.286
0.0MetXaa: 0.0 ± 0.0
Asn
4.167AsnAla: 4.167 ± 0.458
0.379AsnCys: 0.379 ± 0.144
2.904AsnAsp: 2.904 ± 0.467
2.273AsnGlu: 2.273 ± 0.436
2.21AsnPhe: 2.21 ± 0.381
4.735AsnGly: 4.735 ± 0.472
0.758AsnHis: 0.758 ± 0.25
4.041AsnIle: 4.041 ± 0.544
2.525AsnLys: 2.525 ± 0.357
5.43AsnLeu: 5.43 ± 0.544
1.578AsnMet: 1.578 ± 0.295
2.525AsnAsn: 2.525 ± 0.344
2.02AsnPro: 2.02 ± 0.337
1.831AsnGln: 1.831 ± 0.408
2.083AsnArg: 2.083 ± 0.224
2.778AsnSer: 2.778 ± 0.447
3.283AsnThr: 3.283 ± 0.421
3.409AsnVal: 3.409 ± 0.473
0.568AsnTrp: 0.568 ± 0.213
1.957AsnTyr: 1.957 ± 0.336
0.0AsnXaa: 0.0 ± 0.0
Pro
2.083ProAla: 2.083 ± 0.377
0.126ProCys: 0.126 ± 0.092
2.778ProAsp: 2.778 ± 0.354
3.725ProGlu: 3.725 ± 0.579
1.263ProPhe: 1.263 ± 0.254
2.399ProGly: 2.399 ± 0.434
1.01ProHis: 1.01 ± 0.224
1.831ProIle: 1.831 ± 0.398
2.652ProLys: 2.652 ± 0.603
2.778ProLeu: 2.778 ± 0.413
1.073ProMet: 1.073 ± 0.314
1.831ProAsn: 1.831 ± 0.301
1.2ProPro: 1.2 ± 0.263
1.263ProGln: 1.263 ± 0.324
1.326ProArg: 1.326 ± 0.269
2.715ProSer: 2.715 ± 0.537
1.957ProThr: 1.957 ± 0.381
3.157ProVal: 3.157 ± 0.367
0.568ProTrp: 0.568 ± 0.183
1.2ProTyr: 1.2 ± 0.259
0.0ProXaa: 0.0 ± 0.0
Gln
4.798GlnAla: 4.798 ± 0.837
0.316GlnCys: 0.316 ± 0.168
1.894GlnAsp: 1.894 ± 0.32
2.904GlnGlu: 2.904 ± 0.421
1.578GlnPhe: 1.578 ± 0.323
1.894GlnGly: 1.894 ± 0.369
0.947GlnHis: 0.947 ± 0.234
1.894GlnIle: 1.894 ± 0.353
2.652GlnLys: 2.652 ± 0.424
2.589GlnLeu: 2.589 ± 0.411
1.263GlnMet: 1.263 ± 0.335
1.452GlnAsn: 1.452 ± 0.367
1.452GlnPro: 1.452 ± 0.315
2.147GlnGln: 2.147 ± 0.602
2.778GlnArg: 2.778 ± 0.354
1.642GlnSer: 1.642 ± 0.329
1.894GlnThr: 1.894 ± 0.434
2.462GlnVal: 2.462 ± 0.463
0.758GlnTrp: 0.758 ± 0.175
1.705GlnTyr: 1.705 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
4.041ArgAla: 4.041 ± 0.509
0.758ArgCys: 0.758 ± 0.237
2.967ArgAsp: 2.967 ± 0.307
3.725ArgGlu: 3.725 ± 0.525
2.02ArgPhe: 2.02 ± 0.319
3.22ArgGly: 3.22 ± 0.689
1.136ArgHis: 1.136 ± 0.258
2.273ArgIle: 2.273 ± 0.349
3.851ArgLys: 3.851 ± 0.655
4.356ArgLeu: 4.356 ± 0.487
2.147ArgMet: 2.147 ± 0.409
3.157ArgAsn: 3.157 ± 0.436
2.336ArgPro: 2.336 ± 0.361
2.336ArgGln: 2.336 ± 0.396
2.715ArgArg: 2.715 ± 0.326
2.778ArgSer: 2.778 ± 0.394
1.894ArgThr: 1.894 ± 0.372
3.409ArgVal: 3.409 ± 0.439
0.758ArgTrp: 0.758 ± 0.195
2.273ArgTyr: 2.273 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
3.662SerAla: 3.662 ± 0.489
0.631SerCys: 0.631 ± 0.21
2.715SerAsp: 2.715 ± 0.389
2.967SerGlu: 2.967 ± 0.431
2.336SerPhe: 2.336 ± 0.297
3.978SerGly: 3.978 ± 0.523
0.947SerHis: 0.947 ± 0.218
2.652SerIle: 2.652 ± 0.348
3.472SerLys: 3.472 ± 0.545
4.735SerLeu: 4.735 ± 0.467
1.515SerMet: 1.515 ± 0.291
2.715SerAsn: 2.715 ± 0.431
2.336SerPro: 2.336 ± 0.563
2.399SerGln: 2.399 ± 0.479
3.03SerArg: 3.03 ± 0.467
3.157SerSer: 3.157 ± 0.406
3.346SerThr: 3.346 ± 0.466
5.051SerVal: 5.051 ± 0.607
1.263SerTrp: 1.263 ± 0.256
1.642SerTyr: 1.642 ± 0.335
0.0SerXaa: 0.0 ± 0.0
Thr
3.599ThrAla: 3.599 ± 0.546
0.316ThrCys: 0.316 ± 0.137
2.715ThrAsp: 2.715 ± 0.442
3.094ThrGlu: 3.094 ± 0.566
2.525ThrPhe: 2.525 ± 0.516
3.788ThrGly: 3.788 ± 0.604
0.758ThrHis: 0.758 ± 0.229
4.419ThrIle: 4.419 ± 0.533
3.157ThrLys: 3.157 ± 0.446
4.798ThrLeu: 4.798 ± 0.634
0.821ThrMet: 0.821 ± 0.232
3.03ThrAsn: 3.03 ± 0.452
2.715ThrPro: 2.715 ± 0.468
1.705ThrGln: 1.705 ± 0.406
2.589ThrArg: 2.589 ± 0.351
3.22ThrSer: 3.22 ± 0.575
3.409ThrThr: 3.409 ± 0.535
3.283ThrVal: 3.283 ± 0.551
1.01ThrTrp: 1.01 ± 0.242
1.957ThrTyr: 1.957 ± 0.404
0.0ThrXaa: 0.0 ± 0.0
Val
6.124ValAla: 6.124 ± 0.589
0.884ValCys: 0.884 ± 0.27
5.619ValAsp: 5.619 ± 0.606
3.978ValGlu: 3.978 ± 0.458
2.778ValPhe: 2.778 ± 0.456
4.672ValGly: 4.672 ± 0.54
1.705ValHis: 1.705 ± 0.398
3.094ValIle: 3.094 ± 0.456
5.43ValLys: 5.43 ± 0.797
4.483ValLeu: 4.483 ± 0.452
1.768ValMet: 1.768 ± 0.281
4.104ValAsn: 4.104 ± 0.702
2.462ValPro: 2.462 ± 0.445
2.967ValGln: 2.967 ± 0.451
3.157ValArg: 3.157 ± 0.524
4.735ValSer: 4.735 ± 0.469
3.851ValThr: 3.851 ± 0.603
4.925ValVal: 4.925 ± 0.664
1.642ValTrp: 1.642 ± 0.333
2.02ValTyr: 2.02 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
1.389TrpAla: 1.389 ± 0.335
0.379TrpCys: 0.379 ± 0.155
0.821TrpAsp: 0.821 ± 0.197
1.263TrpGlu: 1.263 ± 0.327
0.631TrpPhe: 0.631 ± 0.18
0.884TrpGly: 0.884 ± 0.195
0.379TrpHis: 0.379 ± 0.176
1.136TrpIle: 1.136 ± 0.247
1.263TrpLys: 1.263 ± 0.323
1.957TrpLeu: 1.957 ± 0.328
0.694TrpMet: 0.694 ± 0.242
0.631TrpAsn: 0.631 ± 0.222
0.505TrpPro: 0.505 ± 0.162
0.568TrpGln: 0.568 ± 0.143
1.263TrpArg: 1.263 ± 0.275
0.884TrpSer: 0.884 ± 0.276
1.01TrpThr: 1.01 ± 0.235
1.01TrpVal: 1.01 ± 0.244
0.253TrpTrp: 0.253 ± 0.13
0.568TrpTyr: 0.568 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.083TyrAla: 2.083 ± 0.363
0.189TyrCys: 0.189 ± 0.116
2.462TyrAsp: 2.462 ± 0.41
2.652TyrGlu: 2.652 ± 0.386
1.136TyrPhe: 1.136 ± 0.266
2.967TyrGly: 2.967 ± 0.335
0.694TyrHis: 0.694 ± 0.196
2.399TyrIle: 2.399 ± 0.389
3.03TyrLys: 3.03 ± 0.432
3.283TyrLeu: 3.283 ± 0.425
1.136TyrMet: 1.136 ± 0.295
2.841TyrAsn: 2.841 ± 0.448
0.947TyrPro: 0.947 ± 0.259
1.515TyrGln: 1.515 ± 0.313
2.02TyrArg: 2.02 ± 0.381
2.147TyrSer: 2.147 ± 0.345
1.578TyrThr: 1.578 ± 0.271
2.967TyrVal: 2.967 ± 0.482
1.01TyrTrp: 1.01 ± 0.252
1.894TyrTyr: 1.894 ± 0.309
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (15840 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski