Amino acid dipepetide frequency for Xanthomonas phage phiL7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.944AlaAla: 12.944 ± 1.64
0.853AlaCys: 0.853 ± 0.236
6.278AlaAsp: 6.278 ± 0.743
6.201AlaGlu: 6.201 ± 0.711
2.79AlaPhe: 2.79 ± 0.357
9.223AlaGly: 9.223 ± 1.048
2.48AlaHis: 2.48 ± 0.474
4.108AlaIle: 4.108 ± 0.583
4.495AlaLys: 4.495 ± 0.701
9.533AlaLeu: 9.533 ± 1.02
3.178AlaMet: 3.178 ± 0.479
3.488AlaAsn: 3.488 ± 0.485
4.495AlaPro: 4.495 ± 0.608
4.883AlaGln: 4.883 ± 0.798
7.053AlaArg: 7.053 ± 0.689
6.821AlaSer: 6.821 ± 1.019
6.898AlaThr: 6.898 ± 0.89
7.441AlaVal: 7.441 ± 0.813
2.015AlaTrp: 2.015 ± 0.342
3.72AlaTyr: 3.72 ± 0.494
0.0AlaXaa: 0.0 ± 0.0
Cys
0.853CysAla: 0.853 ± 0.295
0.078CysCys: 0.078 ± 0.085
0.543CysAsp: 0.543 ± 0.271
0.543CysGlu: 0.543 ± 0.206
0.543CysPhe: 0.543 ± 0.187
1.473CysGly: 1.473 ± 0.436
0.543CysHis: 0.543 ± 0.208
0.775CysIle: 0.775 ± 0.259
0.465CysLys: 0.465 ± 0.18
0.465CysLeu: 0.465 ± 0.209
0.233CysMet: 0.233 ± 0.149
0.465CysAsn: 0.465 ± 0.178
0.543CysPro: 0.543 ± 0.323
0.465CysGln: 0.465 ± 0.214
0.543CysArg: 0.543 ± 0.214
0.775CysSer: 0.775 ± 0.284
0.543CysThr: 0.543 ± 0.219
0.93CysVal: 0.93 ± 0.218
0.31CysTrp: 0.31 ± 0.15
0.233CysTyr: 0.233 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
5.503AspAla: 5.503 ± 0.591
0.775AspCys: 0.775 ± 0.355
3.178AspAsp: 3.178 ± 0.498
2.945AspGlu: 2.945 ± 0.426
2.248AspPhe: 2.248 ± 0.303
4.96AspGly: 4.96 ± 0.713
1.55AspHis: 1.55 ± 0.373
2.79AspIle: 2.79 ± 0.42
2.713AspLys: 2.713 ± 0.528
3.798AspLeu: 3.798 ± 0.419
1.628AspMet: 1.628 ± 0.276
1.318AspAsn: 1.318 ± 0.282
2.635AspPro: 2.635 ± 0.665
1.705AspGln: 1.705 ± 0.29
3.72AspArg: 3.72 ± 0.68
3.255AspSer: 3.255 ± 0.45
3.178AspThr: 3.178 ± 0.642
3.488AspVal: 3.488 ± 0.405
1.163AspTrp: 1.163 ± 0.274
2.093AspTyr: 2.093 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
6.511GluAla: 6.511 ± 0.936
0.853GluCys: 0.853 ± 0.328
3.488GluAsp: 3.488 ± 0.551
3.41GluGlu: 3.41 ± 0.553
1.705GluPhe: 1.705 ± 0.425
5.038GluGly: 5.038 ± 0.62
1.008GluHis: 1.008 ± 0.353
1.938GluIle: 1.938 ± 0.465
1.395GluLys: 1.395 ± 0.428
5.581GluLeu: 5.581 ± 0.754
1.55GluMet: 1.55 ± 0.317
2.093GluAsn: 2.093 ± 0.401
1.705GluPro: 1.705 ± 0.379
2.635GluGln: 2.635 ± 0.582
2.945GluArg: 2.945 ± 0.636
3.72GluSer: 3.72 ± 0.444
2.79GluThr: 2.79 ± 0.474
4.728GluVal: 4.728 ± 0.522
0.93GluTrp: 0.93 ± 0.356
1.86GluTyr: 1.86 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
2.403PheAla: 2.403 ± 0.39
0.31PheCys: 0.31 ± 0.104
2.325PheAsp: 2.325 ± 0.348
1.705PheGlu: 1.705 ± 0.293
1.085PhePhe: 1.085 ± 0.212
2.325PheGly: 2.325 ± 0.34
0.543PheHis: 0.543 ± 0.218
1.318PheIle: 1.318 ± 0.278
1.705PheLys: 1.705 ± 0.512
2.403PheLeu: 2.403 ± 0.452
0.62PheMet: 0.62 ± 0.2
1.163PheAsn: 1.163 ± 0.303
1.318PhePro: 1.318 ± 0.31
1.163PheGln: 1.163 ± 0.256
1.705PheArg: 1.705 ± 0.491
2.48PheSer: 2.48 ± 0.481
1.705PheThr: 1.705 ± 0.311
2.868PheVal: 2.868 ± 0.583
0.698PheTrp: 0.698 ± 0.215
0.465PheTyr: 0.465 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
8.448GlyAla: 8.448 ± 1.146
0.93GlyCys: 0.93 ± 0.295
4.263GlyAsp: 4.263 ± 0.568
4.96GlyGlu: 4.96 ± 0.585
2.558GlyPhe: 2.558 ± 0.44
7.751GlyGly: 7.751 ± 0.991
1.395GlyHis: 1.395 ± 0.411
3.798GlyIle: 3.798 ± 0.478
5.115GlyLys: 5.115 ± 0.651
7.131GlyLeu: 7.131 ± 0.669
2.713GlyMet: 2.713 ± 0.386
3.565GlyAsn: 3.565 ± 0.485
2.403GlyPro: 2.403 ± 0.46
3.798GlyGln: 3.798 ± 0.64
4.805GlyArg: 4.805 ± 0.627
4.805GlySer: 4.805 ± 0.685
4.96GlyThr: 4.96 ± 0.59
7.208GlyVal: 7.208 ± 0.886
1.628GlyTrp: 1.628 ± 0.425
3.255GlyTyr: 3.255 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
2.248HisAla: 2.248 ± 0.453
0.543HisCys: 0.543 ± 0.184
1.628HisAsp: 1.628 ± 0.336
1.008HisGlu: 1.008 ± 0.293
0.62HisPhe: 0.62 ± 0.244
2.17HisGly: 2.17 ± 0.783
0.775HisHis: 0.775 ± 0.325
0.93HisIle: 0.93 ± 0.257
1.008HisLys: 1.008 ± 0.251
1.628HisLeu: 1.628 ± 0.544
0.465HisMet: 0.465 ± 0.191
0.93HisAsn: 0.93 ± 0.303
1.55HisPro: 1.55 ± 0.369
0.698HisGln: 0.698 ± 0.274
1.938HisArg: 1.938 ± 0.419
0.775HisSer: 0.775 ± 0.234
1.628HisThr: 1.628 ± 0.405
1.473HisVal: 1.473 ± 0.395
0.233HisTrp: 0.233 ± 0.167
0.62HisTyr: 0.62 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
5.193IleAla: 5.193 ± 0.757
0.31IleCys: 0.31 ± 0.144
2.79IleAsp: 2.79 ± 0.449
2.79IleGlu: 2.79 ± 0.49
1.008IlePhe: 1.008 ± 0.236
2.713IleGly: 2.713 ± 0.396
0.62IleHis: 0.62 ± 0.215
1.705IleIle: 1.705 ± 0.409
2.093IleLys: 2.093 ± 0.394
2.79IleLeu: 2.79 ± 0.414
0.62IleMet: 0.62 ± 0.2
1.705IleAsn: 1.705 ± 0.393
2.093IlePro: 2.093 ± 0.431
1.55IleGln: 1.55 ± 0.372
3.1IleArg: 3.1 ± 0.508
2.248IleSer: 2.248 ± 0.295
3.565IleThr: 3.565 ± 0.475
3.023IleVal: 3.023 ± 0.564
0.388IleTrp: 0.388 ± 0.197
1.318IleTyr: 1.318 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
5.658LysAla: 5.658 ± 0.791
0.155LysCys: 0.155 ± 0.097
2.48LysAsp: 2.48 ± 0.416
2.093LysGlu: 2.093 ± 0.379
1.318LysPhe: 1.318 ± 0.336
3.41LysGly: 3.41 ± 0.579
0.853LysHis: 0.853 ± 0.269
1.938LysIle: 1.938 ± 0.421
1.783LysLys: 1.783 ± 0.379
4.03LysLeu: 4.03 ± 0.646
1.395LysMet: 1.395 ± 0.363
1.55LysAsn: 1.55 ± 0.333
2.403LysPro: 2.403 ± 0.464
2.093LysGln: 2.093 ± 0.466
3.178LysArg: 3.178 ± 0.479
2.868LysSer: 2.868 ± 0.544
2.558LysThr: 2.558 ± 0.481
3.333LysVal: 3.333 ± 0.476
1.395LysTrp: 1.395 ± 0.385
1.318LysTyr: 1.318 ± 0.319
0.0LysXaa: 0.0 ± 0.0
Leu
9.843LeuAla: 9.843 ± 1.043
1.085LeuCys: 1.085 ± 0.274
4.185LeuAsp: 4.185 ± 0.621
5.271LeuGlu: 5.271 ± 0.669
1.55LeuPhe: 1.55 ± 0.298
5.736LeuGly: 5.736 ± 0.641
2.79LeuHis: 2.79 ± 0.536
3.255LeuIle: 3.255 ± 0.545
3.875LeuLys: 3.875 ± 0.556
7.208LeuLeu: 7.208 ± 1.064
1.473LeuMet: 1.473 ± 0.327
4.108LeuAsn: 4.108 ± 0.463
3.953LeuPro: 3.953 ± 0.67
3.798LeuGln: 3.798 ± 0.707
6.433LeuArg: 6.433 ± 0.538
6.511LeuSer: 6.511 ± 0.698
5.581LeuThr: 5.581 ± 0.725
5.115LeuVal: 5.115 ± 0.762
1.085LeuTrp: 1.085 ± 0.281
1.86LeuTyr: 1.86 ± 0.419
0.0LeuXaa: 0.0 ± 0.0
Met
2.403MetAla: 2.403 ± 0.403
0.31MetCys: 0.31 ± 0.169
1.938MetAsp: 1.938 ± 0.462
1.628MetGlu: 1.628 ± 0.281
0.465MetPhe: 0.465 ± 0.187
2.403MetGly: 2.403 ± 0.589
0.62MetHis: 0.62 ± 0.248
0.62MetIle: 0.62 ± 0.198
0.775MetLys: 0.775 ± 0.251
2.79MetLeu: 2.79 ± 0.61
0.31MetMet: 0.31 ± 0.154
0.698MetAsn: 0.698 ± 0.238
1.395MetPro: 1.395 ± 0.325
1.085MetGln: 1.085 ± 0.326
1.705MetArg: 1.705 ± 0.33
2.558MetSer: 2.558 ± 0.393
1.395MetThr: 1.395 ± 0.296
1.395MetVal: 1.395 ± 0.313
0.31MetTrp: 0.31 ± 0.179
0.465MetTyr: 0.465 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
3.72AsnAla: 3.72 ± 0.714
0.155AsnCys: 0.155 ± 0.096
1.473AsnAsp: 1.473 ± 0.364
1.318AsnGlu: 1.318 ± 0.421
1.085AsnPhe: 1.085 ± 0.272
3.72AsnGly: 3.72 ± 0.509
0.775AsnHis: 0.775 ± 0.249
1.628AsnIle: 1.628 ± 0.27
1.55AsnLys: 1.55 ± 0.369
3.1AsnLeu: 3.1 ± 0.471
0.775AsnMet: 0.775 ± 0.217
1.55AsnAsn: 1.55 ± 0.377
2.015AsnPro: 2.015 ± 0.481
1.163AsnGln: 1.163 ± 0.276
2.635AsnArg: 2.635 ± 0.505
1.86AsnSer: 1.86 ± 0.369
2.635AsnThr: 2.635 ± 0.493
2.17AsnVal: 2.17 ± 0.354
1.085AsnTrp: 1.085 ± 0.269
0.93AsnTyr: 0.93 ± 0.255
0.0AsnXaa: 0.0 ± 0.0
Pro
4.418ProAla: 4.418 ± 0.434
0.465ProCys: 0.465 ± 0.206
2.945ProAsp: 2.945 ± 0.502
3.565ProGlu: 3.565 ± 0.644
0.93ProPhe: 0.93 ± 0.309
4.805ProGly: 4.805 ± 0.731
1.085ProHis: 1.085 ± 0.323
2.093ProIle: 2.093 ± 0.356
2.48ProLys: 2.48 ± 0.518
3.333ProLeu: 3.333 ± 0.587
1.24ProMet: 1.24 ± 0.387
1.55ProAsn: 1.55 ± 0.348
1.473ProPro: 1.473 ± 0.344
1.938ProGln: 1.938 ± 0.469
2.325ProArg: 2.325 ± 0.428
2.48ProSer: 2.48 ± 0.538
2.558ProThr: 2.558 ± 0.483
2.868ProVal: 2.868 ± 0.575
0.775ProTrp: 0.775 ± 0.228
0.93ProTyr: 0.93 ± 0.448
0.0ProXaa: 0.0 ± 0.0
Gln
5.426GlnAla: 5.426 ± 0.767
0.543GlnCys: 0.543 ± 0.214
1.705GlnAsp: 1.705 ± 0.276
2.093GlnGlu: 2.093 ± 0.412
2.558GlnPhe: 2.558 ± 0.46
3.333GlnGly: 3.333 ± 0.645
0.543GlnHis: 0.543 ± 0.203
2.17GlnIle: 2.17 ± 0.429
1.008GlnLys: 1.008 ± 0.252
3.72GlnLeu: 3.72 ± 0.63
1.24GlnMet: 1.24 ± 0.282
1.163GlnAsn: 1.163 ± 0.34
1.318GlnPro: 1.318 ± 0.356
2.325GlnGln: 2.325 ± 0.52
2.558GlnArg: 2.558 ± 0.47
2.635GlnSer: 2.635 ± 0.689
1.938GlnThr: 1.938 ± 0.396
3.1GlnVal: 3.1 ± 0.573
0.62GlnTrp: 0.62 ± 0.225
1.085GlnTyr: 1.085 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
7.518ArgAla: 7.518 ± 0.518
1.085ArgCys: 1.085 ± 0.315
3.255ArgAsp: 3.255 ± 0.553
3.875ArgGlu: 3.875 ± 0.748
1.938ArgPhe: 1.938 ± 0.353
5.115ArgGly: 5.115 ± 0.667
1.628ArgHis: 1.628 ± 0.469
3.41ArgIle: 3.41 ± 0.388
3.875ArgLys: 3.875 ± 0.564
6.666ArgLeu: 6.666 ± 0.893
1.628ArgMet: 1.628 ± 0.347
1.783ArgAsn: 1.783 ± 0.353
2.093ArgPro: 2.093 ± 0.426
2.093ArgGln: 2.093 ± 0.408
4.263ArgArg: 4.263 ± 0.789
2.325ArgSer: 2.325 ± 0.429
3.178ArgThr: 3.178 ± 0.483
5.115ArgVal: 5.115 ± 0.625
1.24ArgTrp: 1.24 ± 0.301
2.403ArgTyr: 2.403 ± 0.468
0.0ArgXaa: 0.0 ± 0.0
Ser
8.138SerAla: 8.138 ± 1.098
0.233SerCys: 0.233 ± 0.168
2.945SerAsp: 2.945 ± 0.468
2.868SerGlu: 2.868 ± 0.437
2.093SerPhe: 2.093 ± 0.293
4.96SerGly: 4.96 ± 0.803
0.853SerHis: 0.853 ± 0.249
2.635SerIle: 2.635 ± 0.583
2.558SerLys: 2.558 ± 0.445
5.813SerLeu: 5.813 ± 0.608
1.86SerMet: 1.86 ± 0.339
2.17SerAsn: 2.17 ± 0.409
3.333SerPro: 3.333 ± 0.51
2.558SerGln: 2.558 ± 0.434
3.255SerArg: 3.255 ± 0.663
4.108SerSer: 4.108 ± 0.702
3.72SerThr: 3.72 ± 0.494
4.65SerVal: 4.65 ± 0.707
1.24SerTrp: 1.24 ± 0.275
1.783SerTyr: 1.783 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
7.286ThrAla: 7.286 ± 0.95
1.085ThrCys: 1.085 ± 0.35
2.48ThrAsp: 2.48 ± 0.424
3.1ThrGlu: 3.1 ± 0.422
1.938ThrPhe: 1.938 ± 0.402
6.666ThrGly: 6.666 ± 0.735
1.008ThrHis: 1.008 ± 0.328
2.248ThrIle: 2.248 ± 0.415
2.79ThrLys: 2.79 ± 0.473
5.193ThrLeu: 5.193 ± 0.925
1.55ThrMet: 1.55 ± 0.363
2.015ThrAsn: 2.015 ± 0.608
4.108ThrPro: 4.108 ± 0.566
2.945ThrGln: 2.945 ± 0.411
3.41ThrArg: 3.41 ± 0.522
4.108ThrSer: 4.108 ± 0.603
5.271ThrThr: 5.271 ± 0.754
3.488ThrVal: 3.488 ± 0.57
1.008ThrTrp: 1.008 ± 0.267
2.635ThrTyr: 2.635 ± 0.488
0.0ThrXaa: 0.0 ± 0.0
Val
6.511ValAla: 6.511 ± 0.673
0.62ValCys: 0.62 ± 0.207
3.798ValAsp: 3.798 ± 0.522
4.34ValGlu: 4.34 ± 0.694
2.403ValPhe: 2.403 ± 0.424
5.968ValGly: 5.968 ± 0.741
2.248ValHis: 2.248 ± 0.604
2.325ValIle: 2.325 ± 0.393
3.875ValLys: 3.875 ± 0.665
5.348ValLeu: 5.348 ± 0.544
1.473ValMet: 1.473 ± 0.35
2.79ValAsn: 2.79 ± 0.395
3.333ValPro: 3.333 ± 0.518
2.635ValGln: 2.635 ± 0.458
4.34ValArg: 4.34 ± 0.44
4.34ValSer: 4.34 ± 0.621
6.511ValThr: 6.511 ± 0.738
6.201ValVal: 6.201 ± 0.813
1.163ValTrp: 1.163 ± 0.229
2.17ValTyr: 2.17 ± 0.361
0.0ValXaa: 0.0 ± 0.0
Trp
1.008TrpAla: 1.008 ± 0.296
0.388TrpCys: 0.388 ± 0.193
1.318TrpAsp: 1.318 ± 0.375
0.93TrpGlu: 0.93 ± 0.298
0.62TrpPhe: 0.62 ± 0.226
1.008TrpGly: 1.008 ± 0.263
0.775TrpHis: 0.775 ± 0.244
0.62TrpIle: 0.62 ± 0.238
0.93TrpLys: 0.93 ± 0.302
1.783TrpLeu: 1.783 ± 0.499
0.465TrpMet: 0.465 ± 0.206
0.465TrpAsn: 0.465 ± 0.176
0.775TrpPro: 0.775 ± 0.268
0.775TrpGln: 0.775 ± 0.246
1.628TrpArg: 1.628 ± 0.371
1.24TrpSer: 1.24 ± 0.31
1.24TrpThr: 1.24 ± 0.295
1.008TrpVal: 1.008 ± 0.3
0.155TrpTrp: 0.155 ± 0.101
0.853TrpTyr: 0.853 ± 0.267
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.868TyrAla: 2.868 ± 0.566
0.543TyrCys: 0.543 ± 0.188
1.628TyrAsp: 1.628 ± 0.395
1.24TyrGlu: 1.24 ± 0.29
0.853TyrPhe: 0.853 ± 0.224
2.713TyrGly: 2.713 ± 0.436
0.775TyrHis: 0.775 ± 0.313
1.318TyrIle: 1.318 ± 0.424
1.318TyrLys: 1.318 ± 0.304
2.403TyrLeu: 2.403 ± 0.41
0.775TyrMet: 0.775 ± 0.212
0.775TyrAsn: 0.775 ± 0.296
1.318TyrPro: 1.318 ± 0.313
0.853TyrGln: 0.853 ± 0.196
2.945TyrArg: 2.945 ± 0.568
1.86TyrSer: 1.86 ± 0.648
2.48TyrThr: 2.48 ± 0.853
2.79TyrVal: 2.79 ± 0.464
0.543TyrTrp: 0.543 ± 0.229
1.55TyrTyr: 1.55 ± 0.344
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (12903 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski