Amino acid dipepetide frequency for Escherichia phage ST32

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.686AlaAla: 7.686 ± 0.862
1.08AlaCys: 1.08 ± 0.318
4.256AlaAsp: 4.256 ± 0.513
5.209AlaGlu: 5.209 ± 0.688
3.176AlaPhe: 3.176 ± 0.335
6.543AlaGly: 6.543 ± 0.935
1.461AlaHis: 1.461 ± 0.336
5.781AlaIle: 5.781 ± 0.713
5.273AlaLys: 5.273 ± 0.618
6.352AlaLeu: 6.352 ± 0.797
2.604AlaMet: 2.604 ± 0.42
3.621AlaAsn: 3.621 ± 0.454
2.096AlaPro: 2.096 ± 0.434
2.922AlaGln: 2.922 ± 0.579
4.383AlaArg: 4.383 ± 0.784
4.955AlaSer: 4.955 ± 0.569
5.145AlaThr: 5.145 ± 0.57
5.527AlaVal: 5.527 ± 0.545
0.762AlaTrp: 0.762 ± 0.192
2.795AlaTyr: 2.795 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.381CysAla: 0.381 ± 0.151
0.127CysCys: 0.127 ± 0.093
0.699CysAsp: 0.699 ± 0.288
0.762CysGlu: 0.762 ± 0.251
0.635CysPhe: 0.635 ± 0.18
0.889CysGly: 0.889 ± 0.246
0.318CysHis: 0.318 ± 0.133
0.635CysIle: 0.635 ± 0.233
1.08CysLys: 1.08 ± 0.334
0.762CysLeu: 0.762 ± 0.273
0.127CysMet: 0.127 ± 0.095
0.635CysAsn: 0.635 ± 0.177
0.508CysPro: 0.508 ± 0.205
0.318CysGln: 0.318 ± 0.164
0.635CysArg: 0.635 ± 0.208
0.699CysSer: 0.699 ± 0.195
1.08CysThr: 1.08 ± 0.331
1.016CysVal: 1.016 ± 0.305
0.254CysTrp: 0.254 ± 0.116
0.445CysTyr: 0.445 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
4.891AspAla: 4.891 ± 0.653
1.08AspCys: 1.08 ± 0.258
2.732AspAsp: 2.732 ± 0.39
4.129AspGlu: 4.129 ± 0.552
3.049AspPhe: 3.049 ± 0.426
5.209AspGly: 5.209 ± 0.595
0.953AspHis: 0.953 ± 0.205
3.367AspIle: 3.367 ± 0.616
3.684AspLys: 3.684 ± 0.585
4.51AspLeu: 4.51 ± 0.592
1.207AspMet: 1.207 ± 0.231
2.795AspAsn: 2.795 ± 0.441
2.541AspPro: 2.541 ± 0.442
0.762AspGln: 0.762 ± 0.19
2.033AspArg: 2.033 ± 0.442
5.018AspSer: 5.018 ± 0.639
2.986AspThr: 2.986 ± 0.359
4.51AspVal: 4.51 ± 0.603
1.08AspTrp: 1.08 ± 0.228
2.223AspTyr: 2.223 ± 0.356
0.0AspXaa: 0.0 ± 0.0
Glu
6.225GluAla: 6.225 ± 0.737
0.826GluCys: 0.826 ± 0.316
3.748GluAsp: 3.748 ± 0.505
5.463GluGlu: 5.463 ± 0.543
2.986GluPhe: 2.986 ± 0.376
3.176GluGly: 3.176 ± 0.44
1.588GluHis: 1.588 ± 0.29
4.129GluIle: 4.129 ± 0.603
3.24GluLys: 3.24 ± 0.5
6.988GluLeu: 6.988 ± 0.72
1.652GluMet: 1.652 ± 0.33
2.223GluAsn: 2.223 ± 0.353
1.652GluPro: 1.652 ± 0.318
3.049GluGln: 3.049 ± 0.441
3.494GluArg: 3.494 ± 0.392
3.43GluSer: 3.43 ± 0.545
3.748GluThr: 3.748 ± 0.546
4.764GluVal: 4.764 ± 0.628
0.762GluTrp: 0.762 ± 0.174
3.113GluTyr: 3.113 ± 0.485
0.0GluXaa: 0.0 ± 0.0
Phe
3.43PheAla: 3.43 ± 0.432
0.381PheCys: 0.381 ± 0.163
2.986PheAsp: 2.986 ± 0.381
2.414PheGlu: 2.414 ± 0.351
1.334PhePhe: 1.334 ± 0.34
3.176PheGly: 3.176 ± 0.39
0.889PheHis: 0.889 ± 0.255
2.477PheIle: 2.477 ± 0.421
3.24PheLys: 3.24 ± 0.508
2.541PheLeu: 2.541 ± 0.383
1.143PheMet: 1.143 ± 0.331
2.604PheAsn: 2.604 ± 0.438
1.334PhePro: 1.334 ± 0.258
1.016PheGln: 1.016 ± 0.226
1.906PheArg: 1.906 ± 0.361
2.732PheSer: 2.732 ± 0.392
2.477PheThr: 2.477 ± 0.386
2.604PheVal: 2.604 ± 0.363
0.445PheTrp: 0.445 ± 0.142
1.207PheTyr: 1.207 ± 0.303
0.0PheXaa: 0.0 ± 0.0
Gly
5.336GlyAla: 5.336 ± 0.775
0.889GlyCys: 0.889 ± 0.266
4.256GlyAsp: 4.256 ± 0.66
4.193GlyGlu: 4.193 ± 0.59
3.494GlyPhe: 3.494 ± 0.376
6.289GlyGly: 6.289 ± 0.761
1.334GlyHis: 1.334 ± 0.331
4.891GlyIle: 4.891 ± 0.554
4.51GlyLys: 4.51 ± 0.578
4.764GlyLeu: 4.764 ± 0.533
2.033GlyMet: 2.033 ± 0.44
3.811GlyAsn: 3.811 ± 0.546
1.842GlyPro: 1.842 ± 0.295
1.969GlyGln: 1.969 ± 0.296
3.557GlyArg: 3.557 ± 0.503
5.844GlySer: 5.844 ± 0.918
4.637GlyThr: 4.637 ± 0.708
5.336GlyVal: 5.336 ± 0.494
1.588GlyTrp: 1.588 ± 0.359
2.541GlyTyr: 2.541 ± 0.389
0.0GlyXaa: 0.0 ± 0.0
His
1.016HisAla: 1.016 ± 0.291
0.254HisCys: 0.254 ± 0.132
1.27HisAsp: 1.27 ± 0.283
1.715HisGlu: 1.715 ± 0.33
0.699HisPhe: 0.699 ± 0.218
1.525HisGly: 1.525 ± 0.378
0.635HisHis: 0.635 ± 0.211
1.27HisIle: 1.27 ± 0.313
0.953HisLys: 0.953 ± 0.286
1.143HisLeu: 1.143 ± 0.332
0.762HisMet: 0.762 ± 0.246
1.016HisAsn: 1.016 ± 0.284
0.953HisPro: 0.953 ± 0.276
0.826HisGln: 0.826 ± 0.235
0.635HisArg: 0.635 ± 0.225
1.398HisSer: 1.398 ± 0.411
1.27HisThr: 1.27 ± 0.551
1.27HisVal: 1.27 ± 0.231
0.254HisTrp: 0.254 ± 0.134
0.953HisTyr: 0.953 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
4.32IleAla: 4.32 ± 0.662
1.016IleCys: 1.016 ± 0.271
4.066IleAsp: 4.066 ± 0.611
4.002IleGlu: 4.002 ± 0.519
2.033IlePhe: 2.033 ± 0.392
4.193IleGly: 4.193 ± 0.612
1.08IleHis: 1.08 ± 0.274
2.604IleIle: 2.604 ± 0.502
4.828IleLys: 4.828 ± 0.68
4.32IleLeu: 4.32 ± 0.598
1.906IleMet: 1.906 ± 0.361
3.557IleAsn: 3.557 ± 0.431
2.604IlePro: 2.604 ± 0.422
3.24IleGln: 3.24 ± 0.436
3.113IleArg: 3.113 ± 0.443
3.811IleSer: 3.811 ± 0.481
4.32IleThr: 4.32 ± 0.429
4.002IleVal: 4.002 ± 0.579
0.635IleTrp: 0.635 ± 0.189
1.842IleTyr: 1.842 ± 0.395
0.0IleXaa: 0.0 ± 0.0
Lys
5.527LysAla: 5.527 ± 0.589
1.08LysCys: 1.08 ± 0.302
3.43LysAsp: 3.43 ± 0.575
4.129LysGlu: 4.129 ± 0.446
2.859LysPhe: 2.859 ± 0.566
3.684LysGly: 3.684 ± 0.396
1.779LysHis: 1.779 ± 0.369
3.24LysIle: 3.24 ± 0.42
3.049LysLys: 3.049 ± 0.474
5.844LysLeu: 5.844 ± 0.736
2.16LysMet: 2.16 ± 0.354
2.477LysAsn: 2.477 ± 0.494
2.795LysPro: 2.795 ± 0.536
2.541LysGln: 2.541 ± 0.427
2.287LysArg: 2.287 ± 0.354
4.637LysSer: 4.637 ± 0.547
3.557LysThr: 3.557 ± 0.49
4.129LysVal: 4.129 ± 0.539
0.762LysTrp: 0.762 ± 0.24
1.652LysTyr: 1.652 ± 0.353
0.0LysXaa: 0.0 ± 0.0
Leu
6.162LeuAla: 6.162 ± 0.565
0.762LeuCys: 0.762 ± 0.246
4.574LeuAsp: 4.574 ± 0.518
6.098LeuGlu: 6.098 ± 0.734
2.604LeuPhe: 2.604 ± 0.325
4.891LeuGly: 4.891 ± 0.583
1.461LeuHis: 1.461 ± 0.373
4.066LeuIle: 4.066 ± 0.475
5.527LeuLys: 5.527 ± 0.833
5.336LeuLeu: 5.336 ± 0.722
2.35LeuMet: 2.35 ± 0.376
4.828LeuAsn: 4.828 ± 0.58
3.367LeuPro: 3.367 ± 0.413
3.621LeuGln: 3.621 ± 0.505
3.875LeuArg: 3.875 ± 0.58
5.082LeuSer: 5.082 ± 0.579
4.955LeuThr: 4.955 ± 0.573
5.145LeuVal: 5.145 ± 0.678
0.572LeuTrp: 0.572 ± 0.174
2.16LeuTyr: 2.16 ± 0.328
0.0LeuXaa: 0.0 ± 0.0
Met
3.494MetAla: 3.494 ± 0.696
0.127MetCys: 0.127 ± 0.092
1.398MetAsp: 1.398 ± 0.304
1.588MetGlu: 1.588 ± 0.278
1.27MetPhe: 1.27 ± 0.298
1.207MetGly: 1.207 ± 0.239
0.635MetHis: 0.635 ± 0.198
1.143MetIle: 1.143 ± 0.241
2.287MetLys: 2.287 ± 0.395
2.033MetLeu: 2.033 ± 0.365
0.826MetMet: 0.826 ± 0.291
1.652MetAsn: 1.652 ± 0.338
1.334MetPro: 1.334 ± 0.35
1.715MetGln: 1.715 ± 0.338
1.207MetArg: 1.207 ± 0.231
1.969MetSer: 1.969 ± 0.321
1.398MetThr: 1.398 ± 0.308
1.334MetVal: 1.334 ± 0.294
0.508MetTrp: 0.508 ± 0.178
1.334MetTyr: 1.334 ± 0.351
0.0MetXaa: 0.0 ± 0.0
Asn
4.193AsnAla: 4.193 ± 0.547
0.572AsnCys: 0.572 ± 0.202
2.795AsnAsp: 2.795 ± 0.506
2.541AsnGlu: 2.541 ± 0.293
1.906AsnPhe: 1.906 ± 0.375
4.066AsnGly: 4.066 ± 0.599
0.762AsnHis: 0.762 ± 0.249
3.303AsnIle: 3.303 ± 0.414
2.541AsnLys: 2.541 ± 0.364
4.066AsnLeu: 4.066 ± 0.54
1.334AsnMet: 1.334 ± 0.286
2.35AsnAsn: 2.35 ± 0.363
3.049AsnPro: 3.049 ± 0.395
1.969AsnGln: 1.969 ± 0.308
2.414AsnArg: 2.414 ± 0.388
3.176AsnSer: 3.176 ± 0.428
2.795AsnThr: 2.795 ± 0.423
3.303AsnVal: 3.303 ± 0.495
0.699AsnTrp: 0.699 ± 0.209
2.096AsnTyr: 2.096 ± 0.316
0.0AsnXaa: 0.0 ± 0.0
Pro
3.24ProAla: 3.24 ± 0.509
0.254ProCys: 0.254 ± 0.114
2.16ProAsp: 2.16 ± 0.415
3.24ProGlu: 3.24 ± 0.434
1.143ProPhe: 1.143 ± 0.23
2.732ProGly: 2.732 ± 0.416
0.826ProHis: 0.826 ± 0.213
1.652ProIle: 1.652 ± 0.4
2.35ProLys: 2.35 ± 0.394
2.732ProLeu: 2.732 ± 0.459
0.953ProMet: 0.953 ± 0.245
2.223ProAsn: 2.223 ± 0.353
0.953ProPro: 0.953 ± 0.248
1.398ProGln: 1.398 ± 0.299
1.398ProArg: 1.398 ± 0.33
2.986ProSer: 2.986 ± 0.379
2.287ProThr: 2.287 ± 0.353
3.176ProVal: 3.176 ± 0.413
0.699ProTrp: 0.699 ± 0.158
1.525ProTyr: 1.525 ± 0.365
0.0ProXaa: 0.0 ± 0.0
Gln
2.922GlnAla: 2.922 ± 0.421
0.445GlnCys: 0.445 ± 0.183
1.461GlnAsp: 1.461 ± 0.255
2.922GlnGlu: 2.922 ± 0.486
1.969GlnPhe: 1.969 ± 0.364
1.906GlnGly: 1.906 ± 0.342
0.381GlnHis: 0.381 ± 0.206
3.24GlnIle: 3.24 ± 0.408
1.525GlnLys: 1.525 ± 0.311
3.367GlnLeu: 3.367 ± 0.478
1.016GlnMet: 1.016 ± 0.258
1.461GlnAsn: 1.461 ± 0.275
1.652GlnPro: 1.652 ± 0.351
2.414GlnGln: 2.414 ± 0.679
2.795GlnArg: 2.795 ± 0.383
1.969GlnSer: 1.969 ± 0.37
2.033GlnThr: 2.033 ± 0.474
2.477GlnVal: 2.477 ± 0.423
0.699GlnTrp: 0.699 ± 0.208
1.779GlnTyr: 1.779 ± 0.318
0.0GlnXaa: 0.0 ± 0.0
Arg
3.24ArgAla: 3.24 ± 0.523
0.762ArgCys: 0.762 ± 0.26
2.668ArgAsp: 2.668 ± 0.399
3.049ArgGlu: 3.049 ± 0.442
1.906ArgPhe: 1.906 ± 0.406
3.113ArgGly: 3.113 ± 0.495
0.762ArgHis: 0.762 ± 0.201
3.43ArgIle: 3.43 ± 0.406
3.049ArgLys: 3.049 ± 0.428
4.066ArgLeu: 4.066 ± 0.54
1.398ArgMet: 1.398 ± 0.331
2.477ArgAsn: 2.477 ± 0.353
1.842ArgPro: 1.842 ± 0.324
1.525ArgGln: 1.525 ± 0.341
1.969ArgArg: 1.969 ± 0.36
2.732ArgSer: 2.732 ± 0.551
2.35ArgThr: 2.35 ± 0.357
3.684ArgVal: 3.684 ± 0.502
0.699ArgTrp: 0.699 ± 0.182
2.096ArgTyr: 2.096 ± 0.388
0.0ArgXaa: 0.0 ± 0.0
Ser
5.527SerAla: 5.527 ± 0.895
0.445SerCys: 0.445 ± 0.178
4.383SerAsp: 4.383 ± 0.379
4.066SerGlu: 4.066 ± 0.484
2.541SerPhe: 2.541 ± 0.444
5.908SerGly: 5.908 ± 0.663
1.461SerHis: 1.461 ± 0.282
5.336SerIle: 5.336 ± 0.804
3.303SerLys: 3.303 ± 0.466
5.463SerLeu: 5.463 ± 0.549
1.842SerMet: 1.842 ± 0.346
3.113SerAsn: 3.113 ± 0.514
2.223SerPro: 2.223 ± 0.413
2.668SerGln: 2.668 ± 0.616
3.24SerArg: 3.24 ± 0.462
3.939SerSer: 3.939 ± 0.56
3.811SerThr: 3.811 ± 0.619
4.129SerVal: 4.129 ± 0.532
0.889SerTrp: 0.889 ± 0.256
2.541SerTyr: 2.541 ± 0.568
0.0SerXaa: 0.0 ± 0.0
Thr
5.209ThrAla: 5.209 ± 0.575
0.381ThrCys: 0.381 ± 0.133
3.176ThrAsp: 3.176 ± 0.39
3.24ThrGlu: 3.24 ± 0.449
2.35ThrPhe: 2.35 ± 0.391
5.844ThrGly: 5.844 ± 0.636
1.461ThrHis: 1.461 ± 0.488
4.129ThrIle: 4.129 ± 0.553
3.367ThrLys: 3.367 ± 0.412
4.32ThrLeu: 4.32 ± 0.536
1.334ThrMet: 1.334 ± 0.258
2.604ThrAsn: 2.604 ± 0.479
2.414ThrPro: 2.414 ± 0.327
2.223ThrGln: 2.223 ± 0.34
2.732ThrArg: 2.732 ± 0.435
4.002ThrSer: 4.002 ± 0.691
3.303ThrThr: 3.303 ± 0.515
5.336ThrVal: 5.336 ± 0.691
0.699ThrTrp: 0.699 ± 0.225
2.223ThrTyr: 2.223 ± 0.383
0.0ThrXaa: 0.0 ± 0.0
Val
5.908ValAla: 5.908 ± 0.699
0.953ValCys: 0.953 ± 0.294
5.082ValAsp: 5.082 ± 0.571
3.875ValGlu: 3.875 ± 0.679
2.732ValPhe: 2.732 ± 0.452
4.955ValGly: 4.955 ± 0.527
1.27ValHis: 1.27 ± 0.288
4.193ValIle: 4.193 ± 0.58
4.701ValLys: 4.701 ± 0.668
4.764ValLeu: 4.764 ± 0.576
1.969ValMet: 1.969 ± 0.293
3.494ValAsn: 3.494 ± 0.361
2.922ValPro: 2.922 ± 0.37
2.287ValGln: 2.287 ± 0.364
3.367ValArg: 3.367 ± 0.564
4.002ValSer: 4.002 ± 0.526
4.955ValThr: 4.955 ± 0.694
4.764ValVal: 4.764 ± 0.793
1.207ValTrp: 1.207 ± 0.271
1.715ValTyr: 1.715 ± 0.29
0.0ValXaa: 0.0 ± 0.0
Trp
0.699TrpAla: 0.699 ± 0.192
0.254TrpCys: 0.254 ± 0.11
0.699TrpAsp: 0.699 ± 0.239
1.525TrpGlu: 1.525 ± 0.358
0.572TrpPhe: 0.572 ± 0.228
0.508TrpGly: 0.508 ± 0.177
0.254TrpHis: 0.254 ± 0.135
0.635TrpIle: 0.635 ± 0.2
0.889TrpLys: 0.889 ± 0.24
1.016TrpLeu: 1.016 ± 0.247
0.381TrpMet: 0.381 ± 0.133
0.635TrpAsn: 0.635 ± 0.153
0.254TrpPro: 0.254 ± 0.109
0.762TrpGln: 0.762 ± 0.207
0.508TrpArg: 0.508 ± 0.211
1.461TrpSer: 1.461 ± 0.262
0.953TrpThr: 0.953 ± 0.211
0.762TrpVal: 0.762 ± 0.202
0.191TrpTrp: 0.191 ± 0.105
0.762TrpTyr: 0.762 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.223TyrAla: 2.223 ± 0.343
0.318TyrCys: 0.318 ± 0.127
2.922TyrAsp: 2.922 ± 0.419
2.096TyrGlu: 2.096 ± 0.337
1.08TyrPhe: 1.08 ± 0.313
3.303TyrGly: 3.303 ± 0.401
0.508TyrHis: 0.508 ± 0.215
1.969TyrIle: 1.969 ± 0.351
2.223TyrLys: 2.223 ± 0.401
3.049TyrLeu: 3.049 ± 0.565
1.398TyrMet: 1.398 ± 0.38
2.35TyrAsn: 2.35 ± 0.361
1.588TyrPro: 1.588 ± 0.343
1.207TyrGln: 1.207 ± 0.251
1.27TyrArg: 1.27 ± 0.265
2.986TyrSer: 2.986 ± 0.682
2.287TyrThr: 2.287 ± 0.374
1.842TyrVal: 1.842 ± 0.361
0.318TyrTrp: 0.318 ± 0.134
1.016TyrTyr: 1.016 ± 0.29
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (15743 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski