Amino acid dipepetide frequency for Staphylococcus phage UPMK_2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.389AlaAla: 0.389 ± 0.168
0.467AlaCys: 0.467 ± 0.161
3.036AlaAsp: 3.036 ± 0.43
3.503AlaGlu: 3.503 ± 0.46
3.036AlaPhe: 3.036 ± 0.565
3.892AlaGly: 3.892 ± 0.707
1.323AlaHis: 1.323 ± 0.304
4.826AlaIle: 4.826 ± 0.645
5.994AlaLys: 5.994 ± 0.683
4.281AlaLeu: 4.281 ± 0.645
1.557AlaMet: 1.557 ± 0.443
3.27AlaAsn: 3.27 ± 0.459
2.102AlaPro: 2.102 ± 0.435
2.647AlaGln: 2.647 ± 0.489
2.647AlaArg: 2.647 ± 0.465
4.281AlaSer: 4.281 ± 0.756
4.204AlaThr: 4.204 ± 0.646
3.425AlaVal: 3.425 ± 0.728
0.856AlaTrp: 0.856 ± 0.363
2.491AlaTyr: 2.491 ± 0.473
0.0AlaXaa: 0.0 ± 0.0
Cys
0.156CysAla: 0.156 ± 0.117
0.0CysCys: 0.0 ± 0.0
0.234CysAsp: 0.234 ± 0.132
0.311CysGlu: 0.311 ± 0.197
0.389CysPhe: 0.389 ± 0.18
0.311CysGly: 0.311 ± 0.143
0.0CysHis: 0.0 ± 0.0
0.311CysIle: 0.311 ± 0.156
0.467CysLys: 0.467 ± 0.179
0.311CysLeu: 0.311 ± 0.146
0.156CysMet: 0.156 ± 0.132
0.467CysAsn: 0.467 ± 0.202
0.467CysPro: 0.467 ± 0.203
0.311CysGln: 0.311 ± 0.134
0.234CysArg: 0.234 ± 0.145
0.467CysSer: 0.467 ± 0.218
0.156CysThr: 0.156 ± 0.104
0.234CysVal: 0.234 ± 0.122
0.078CysTrp: 0.078 ± 0.075
0.234CysTyr: 0.234 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
3.97AspAla: 3.97 ± 0.548
0.078AspCys: 0.078 ± 0.073
4.515AspAsp: 4.515 ± 0.97
5.916AspGlu: 5.916 ± 0.73
3.97AspPhe: 3.97 ± 0.572
4.904AspGly: 4.904 ± 0.62
0.389AspHis: 0.389 ± 0.185
5.449AspIle: 5.449 ± 0.729
5.683AspLys: 5.683 ± 0.744
5.683AspLeu: 5.683 ± 0.639
1.479AspMet: 1.479 ± 0.337
3.503AspAsn: 3.503 ± 0.477
1.246AspPro: 1.246 ± 0.268
1.323AspGln: 1.323 ± 0.303
2.024AspArg: 2.024 ± 0.403
4.126AspSer: 4.126 ± 0.509
3.27AspThr: 3.27 ± 0.513
4.126AspVal: 4.126 ± 0.697
0.701AspTrp: 0.701 ± 0.263
3.114AspTyr: 3.114 ± 0.476
0.0AspXaa: 0.0 ± 0.0
Glu
3.892GluAla: 3.892 ± 0.618
0.623GluCys: 0.623 ± 0.233
4.437GluAsp: 4.437 ± 0.631
5.138GluGlu: 5.138 ± 0.867
2.647GluPhe: 2.647 ± 0.415
3.036GluGly: 3.036 ± 0.464
1.557GluHis: 1.557 ± 0.364
5.605GluIle: 5.605 ± 0.723
5.138GluLys: 5.138 ± 0.747
7.785GluLeu: 7.785 ± 1.146
1.946GluMet: 1.946 ± 0.521
5.138GluAsn: 5.138 ± 0.655
1.479GluPro: 1.479 ± 0.294
3.892GluGln: 3.892 ± 0.719
3.27GluArg: 3.27 ± 0.531
2.802GluSer: 2.802 ± 0.435
3.503GluThr: 3.503 ± 0.428
5.605GluVal: 5.605 ± 0.622
1.557GluTrp: 1.557 ± 0.336
4.359GluTyr: 4.359 ± 0.735
0.0GluXaa: 0.0 ± 0.0
Phe
1.557PheAla: 1.557 ± 0.403
0.467PheCys: 0.467 ± 0.156
4.204PheAsp: 4.204 ± 0.439
3.581PheGlu: 3.581 ± 0.603
1.012PhePhe: 1.012 ± 0.249
2.413PheGly: 2.413 ± 0.737
0.389PheHis: 0.389 ± 0.146
3.659PheIle: 3.659 ± 0.589
4.749PheLys: 4.749 ± 0.614
3.659PheLeu: 3.659 ± 0.514
0.856PheMet: 0.856 ± 0.275
2.413PheAsn: 2.413 ± 0.356
0.934PhePro: 0.934 ± 0.304
0.934PheGln: 0.934 ± 0.319
1.635PheArg: 1.635 ± 0.316
2.024PheSer: 2.024 ± 0.451
2.569PheThr: 2.569 ± 0.433
2.958PheVal: 2.958 ± 0.513
0.467PheTrp: 0.467 ± 0.205
1.635PheTyr: 1.635 ± 0.419
0.0PheXaa: 0.0 ± 0.0
Gly
4.515GlyAla: 4.515 ± 0.741
0.311GlyCys: 0.311 ± 0.157
3.737GlyAsp: 3.737 ± 0.57
2.258GlyGlu: 2.258 ± 0.378
2.802GlyPhe: 2.802 ± 0.571
2.958GlyGly: 2.958 ± 0.518
1.557GlyHis: 1.557 ± 0.417
4.593GlyIle: 4.593 ± 0.592
5.216GlyLys: 5.216 ± 0.608
4.437GlyLeu: 4.437 ± 0.698
1.323GlyMet: 1.323 ± 0.233
3.503GlyAsn: 3.503 ± 0.415
0.389GlyPro: 0.389 ± 0.17
2.569GlyGln: 2.569 ± 0.371
2.647GlyArg: 2.647 ± 0.502
3.036GlySer: 3.036 ± 0.52
4.126GlyThr: 4.126 ± 0.494
4.904GlyVal: 4.904 ± 0.788
1.168GlyTrp: 1.168 ± 0.339
2.725GlyTyr: 2.725 ± 0.454
0.0GlyXaa: 0.0 ± 0.0
His
1.09HisAla: 1.09 ± 0.291
0.078HisCys: 0.078 ± 0.085
0.545HisAsp: 0.545 ± 0.197
1.323HisGlu: 1.323 ± 0.3
0.934HisPhe: 0.934 ± 0.239
1.168HisGly: 1.168 ± 0.265
0.311HisHis: 0.311 ± 0.15
1.479HisIle: 1.479 ± 0.327
1.323HisLys: 1.323 ± 0.305
1.323HisLeu: 1.323 ± 0.322
0.234HisMet: 0.234 ± 0.121
1.168HisAsn: 1.168 ± 0.293
1.012HisPro: 1.012 ± 0.312
0.934HisGln: 0.934 ± 0.274
0.701HisArg: 0.701 ± 0.245
0.934HisSer: 0.934 ± 0.252
1.79HisThr: 1.79 ± 0.331
0.856HisVal: 0.856 ± 0.261
0.078HisTrp: 0.078 ± 0.07
0.856HisTyr: 0.856 ± 0.361
0.0HisXaa: 0.0 ± 0.0
Ile
4.281IleAla: 4.281 ± 0.607
0.234IleCys: 0.234 ± 0.145
6.228IleAsp: 6.228 ± 0.792
6.15IleGlu: 6.15 ± 0.883
2.88IlePhe: 2.88 ± 0.572
5.371IleGly: 5.371 ± 0.776
1.012IleHis: 1.012 ± 0.305
4.281IleIle: 4.281 ± 0.606
7.94IleLys: 7.94 ± 0.82
3.503IleLeu: 3.503 ± 0.485
1.79IleMet: 1.79 ± 0.361
4.437IleAsn: 4.437 ± 0.655
2.18IlePro: 2.18 ± 0.381
3.114IleGln: 3.114 ± 0.51
2.88IleArg: 2.88 ± 0.596
4.593IleSer: 4.593 ± 0.626
4.982IleThr: 4.982 ± 0.58
3.503IleVal: 3.503 ± 0.455
0.778IleTrp: 0.778 ± 0.303
3.036IleTyr: 3.036 ± 0.613
0.0IleXaa: 0.0 ± 0.0
Lys
4.904LysAla: 4.904 ± 0.624
0.467LysCys: 0.467 ± 0.166
6.305LysAsp: 6.305 ± 0.737
7.707LysGlu: 7.707 ± 0.773
3.036LysPhe: 3.036 ± 0.48
5.916LysGly: 5.916 ± 0.669
1.713LysHis: 1.713 ± 0.414
6.072LysIle: 6.072 ± 0.717
7.084LysLys: 7.084 ± 0.838
6.695LysLeu: 6.695 ± 0.766
2.491LysMet: 2.491 ± 0.452
6.072LysAsn: 6.072 ± 0.768
2.88LysPro: 2.88 ± 0.505
4.982LysGln: 4.982 ± 0.62
4.593LysArg: 4.593 ± 0.555
4.826LysSer: 4.826 ± 0.557
5.216LysThr: 5.216 ± 0.675
5.527LysVal: 5.527 ± 0.617
0.778LysTrp: 0.778 ± 0.234
3.814LysTyr: 3.814 ± 0.656
0.0LysXaa: 0.0 ± 0.0
Leu
4.749LeuAla: 4.749 ± 0.693
0.234LeuCys: 0.234 ± 0.185
5.06LeuAsp: 5.06 ± 0.56
5.838LeuGlu: 5.838 ± 0.989
3.659LeuPhe: 3.659 ± 0.512
3.347LeuGly: 3.347 ± 0.48
1.479LeuHis: 1.479 ± 0.373
4.281LeuIle: 4.281 ± 0.53
6.305LeuLys: 6.305 ± 0.587
4.515LeuLeu: 4.515 ± 0.609
2.18LeuMet: 2.18 ± 0.456
5.761LeuAsn: 5.761 ± 0.695
2.335LeuPro: 2.335 ± 0.442
2.88LeuGln: 2.88 ± 0.43
2.725LeuArg: 2.725 ± 0.559
4.281LeuSer: 4.281 ± 0.476
5.605LeuThr: 5.605 ± 0.701
4.749LeuVal: 4.749 ± 0.596
0.778LeuTrp: 0.778 ± 0.248
3.503LeuTyr: 3.503 ± 0.553
0.0LeuXaa: 0.0 ± 0.0
Met
1.557MetAla: 1.557 ± 0.474
0.0MetCys: 0.0 ± 0.0
1.635MetAsp: 1.635 ± 0.304
1.79MetGlu: 1.79 ± 0.34
1.168MetPhe: 1.168 ± 0.372
1.09MetGly: 1.09 ± 0.265
0.545MetHis: 0.545 ± 0.216
1.713MetIle: 1.713 ± 0.341
1.79MetLys: 1.79 ± 0.401
1.946MetLeu: 1.946 ± 0.323
0.623MetMet: 0.623 ± 0.211
1.713MetAsn: 1.713 ± 0.394
0.934MetPro: 0.934 ± 0.272
1.323MetGln: 1.323 ± 0.412
0.545MetArg: 0.545 ± 0.184
1.868MetSer: 1.868 ± 0.511
1.868MetThr: 1.868 ± 0.382
1.168MetVal: 1.168 ± 0.241
0.389MetTrp: 0.389 ± 0.159
1.09MetTyr: 1.09 ± 0.335
0.0MetXaa: 0.0 ± 0.0
Asn
4.749AsnAla: 4.749 ± 0.59
0.623AsnCys: 0.623 ± 0.232
4.826AsnAsp: 4.826 ± 0.643
4.515AsnGlu: 4.515 ± 0.611
2.647AsnPhe: 2.647 ± 0.468
4.126AsnGly: 4.126 ± 0.622
1.168AsnHis: 1.168 ± 0.387
4.048AsnIle: 4.048 ± 0.563
6.85AsnLys: 6.85 ± 0.816
3.347AsnLeu: 3.347 ± 0.596
1.557AsnMet: 1.557 ± 0.336
4.904AsnAsn: 4.904 ± 0.936
2.802AsnPro: 2.802 ± 0.458
2.491AsnGln: 2.491 ± 0.497
2.413AsnArg: 2.413 ± 0.368
3.659AsnSer: 3.659 ± 0.519
3.347AsnThr: 3.347 ± 0.469
3.814AsnVal: 3.814 ± 0.57
0.934AsnTrp: 0.934 ± 0.208
2.88AsnTyr: 2.88 ± 0.474
0.0AsnXaa: 0.0 ± 0.0
Pro
1.246ProAla: 1.246 ± 0.267
0.0ProCys: 0.0 ± 0.0
1.557ProAsp: 1.557 ± 0.336
1.79ProGlu: 1.79 ± 0.372
1.557ProPhe: 1.557 ± 0.297
1.79ProGly: 1.79 ± 0.453
0.545ProHis: 0.545 ± 0.213
2.102ProIle: 2.102 ± 0.417
3.425ProLys: 3.425 ± 0.564
1.557ProLeu: 1.557 ± 0.352
0.701ProMet: 0.701 ± 0.215
2.024ProAsn: 2.024 ± 0.425
0.389ProPro: 0.389 ± 0.143
0.701ProGln: 0.701 ± 0.22
1.09ProArg: 1.09 ± 0.26
2.024ProSer: 2.024 ± 0.487
2.258ProThr: 2.258 ± 0.407
1.713ProVal: 1.713 ± 0.402
0.078ProTrp: 0.078 ± 0.089
1.401ProTyr: 1.401 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
3.503GlnAla: 3.503 ± 0.579
0.311GlnCys: 0.311 ± 0.164
1.79GlnAsp: 1.79 ± 0.36
2.802GlnGlu: 2.802 ± 0.528
2.258GlnPhe: 2.258 ± 0.324
2.258GlnGly: 2.258 ± 0.426
1.012GlnHis: 1.012 ± 0.212
2.802GlnIle: 2.802 ± 0.425
3.659GlnLys: 3.659 ± 0.738
2.802GlnLeu: 2.802 ± 0.518
1.401GlnMet: 1.401 ± 0.377
2.413GlnAsn: 2.413 ± 0.381
1.868GlnPro: 1.868 ± 0.414
1.868GlnGln: 1.868 ± 0.539
2.102GlnArg: 2.102 ± 0.421
1.946GlnSer: 1.946 ± 0.349
1.946GlnThr: 1.946 ± 0.402
2.569GlnVal: 2.569 ± 0.562
0.156GlnTrp: 0.156 ± 0.113
1.323GlnTyr: 1.323 ± 0.359
0.0GlnXaa: 0.0 ± 0.0
Arg
1.946ArgAla: 1.946 ± 0.389
0.389ArgCys: 0.389 ± 0.169
2.88ArgAsp: 2.88 ± 0.535
2.569ArgGlu: 2.569 ± 0.413
2.258ArgPhe: 2.258 ± 0.498
1.79ArgGly: 1.79 ± 0.409
1.323ArgHis: 1.323 ± 0.38
3.425ArgIle: 3.425 ± 0.474
3.659ArgLys: 3.659 ± 0.687
3.97ArgLeu: 3.97 ± 0.644
0.701ArgMet: 0.701 ± 0.225
2.491ArgAsn: 2.491 ± 0.422
1.246ArgPro: 1.246 ± 0.264
1.868ArgGln: 1.868 ± 0.418
1.246ArgArg: 1.246 ± 0.294
1.713ArgSer: 1.713 ± 0.393
2.258ArgThr: 2.258 ± 0.468
2.18ArgVal: 2.18 ± 0.332
0.467ArgTrp: 0.467 ± 0.167
2.647ArgTyr: 2.647 ± 0.501
0.0ArgXaa: 0.0 ± 0.0
Ser
4.204SerAla: 4.204 ± 0.515
0.156SerCys: 0.156 ± 0.151
4.204SerAsp: 4.204 ± 0.602
3.581SerGlu: 3.581 ± 0.568
1.868SerPhe: 1.868 ± 0.492
3.892SerGly: 3.892 ± 0.639
0.778SerHis: 0.778 ± 0.219
4.749SerIle: 4.749 ± 0.715
5.683SerLys: 5.683 ± 0.733
3.97SerLeu: 3.97 ± 0.558
1.946SerMet: 1.946 ± 0.288
4.593SerAsn: 4.593 ± 0.608
0.778SerPro: 0.778 ± 0.278
2.491SerGln: 2.491 ± 0.427
2.024SerArg: 2.024 ± 0.279
3.036SerSer: 3.036 ± 0.577
3.347SerThr: 3.347 ± 0.426
3.737SerVal: 3.737 ± 0.601
0.623SerTrp: 0.623 ± 0.214
2.024SerTyr: 2.024 ± 0.31
0.0SerXaa: 0.0 ± 0.0
Thr
4.126ThrAla: 4.126 ± 0.624
0.0ThrCys: 0.0 ± 0.0
3.581ThrAsp: 3.581 ± 0.519
4.204ThrGlu: 4.204 ± 0.692
2.413ThrPhe: 2.413 ± 0.428
4.359ThrGly: 4.359 ± 0.658
1.246ThrHis: 1.246 ± 0.284
5.293ThrIle: 5.293 ± 0.76
4.281ThrLys: 4.281 ± 0.583
4.904ThrLeu: 4.904 ± 0.526
0.778ThrMet: 0.778 ± 0.244
4.437ThrAsn: 4.437 ± 0.71
1.868ThrPro: 1.868 ± 0.388
2.413ThrGln: 2.413 ± 0.515
2.725ThrArg: 2.725 ± 0.522
4.749ThrSer: 4.749 ± 0.932
3.814ThrThr: 3.814 ± 0.535
3.814ThrVal: 3.814 ± 0.598
1.012ThrTrp: 1.012 ± 0.294
2.335ThrTyr: 2.335 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
4.593ValAla: 4.593 ± 0.796
0.389ValCys: 0.389 ± 0.166
4.204ValAsp: 4.204 ± 0.813
5.06ValGlu: 5.06 ± 0.666
2.024ValPhe: 2.024 ± 0.36
3.192ValGly: 3.192 ± 0.637
0.623ValHis: 0.623 ± 0.203
4.126ValIle: 4.126 ± 0.552
6.773ValLys: 6.773 ± 0.562
5.06ValLeu: 5.06 ± 0.548
1.635ValMet: 1.635 ± 0.377
3.659ValAsn: 3.659 ± 0.541
2.102ValPro: 2.102 ± 0.41
1.479ValGln: 1.479 ± 0.37
2.491ValArg: 2.491 ± 0.425
3.737ValSer: 3.737 ± 0.608
4.359ValThr: 4.359 ± 0.664
3.97ValVal: 3.97 ± 0.538
0.856ValTrp: 0.856 ± 0.336
2.491ValTyr: 2.491 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
0.467TrpAla: 0.467 ± 0.217
0.234TrpCys: 0.234 ± 0.126
0.389TrpAsp: 0.389 ± 0.151
1.09TrpGlu: 1.09 ± 0.269
0.389TrpPhe: 0.389 ± 0.143
0.934TrpGly: 0.934 ± 0.317
0.311TrpHis: 0.311 ± 0.14
0.856TrpIle: 0.856 ± 0.301
1.246TrpLys: 1.246 ± 0.328
1.168TrpLeu: 1.168 ± 0.312
0.234TrpMet: 0.234 ± 0.14
0.856TrpAsn: 0.856 ± 0.267
0.0TrpPro: 0.0 ± 0.0
0.778TrpGln: 0.778 ± 0.246
0.389TrpArg: 0.389 ± 0.178
0.701TrpSer: 0.701 ± 0.221
0.934TrpThr: 0.934 ± 0.193
0.934TrpVal: 0.934 ± 0.283
0.078TrpTrp: 0.078 ± 0.073
0.545TrpTyr: 0.545 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.335TyrAla: 2.335 ± 0.452
0.234TyrCys: 0.234 ± 0.129
2.102TyrAsp: 2.102 ± 0.369
4.281TyrGlu: 4.281 ± 0.546
1.246TyrPhe: 1.246 ± 0.31
2.024TyrGly: 2.024 ± 0.443
0.856TyrHis: 0.856 ± 0.249
3.503TyrIle: 3.503 ± 0.574
3.814TyrLys: 3.814 ± 0.577
3.347TyrLeu: 3.347 ± 0.556
1.09TyrMet: 1.09 ± 0.294
2.88TyrAsn: 2.88 ± 0.524
0.934TyrPro: 0.934 ± 0.336
1.79TyrGln: 1.79 ± 0.281
2.647TyrArg: 2.647 ± 0.569
2.88TyrSer: 2.88 ± 0.516
2.802TyrThr: 2.802 ± 0.41
3.036TyrVal: 3.036 ± 0.469
0.701TyrTrp: 0.701 ± 0.241
1.946TyrTyr: 1.946 ± 0.413
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12847 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski