Amino acid dipepetide frequency for Clostridium phage HM T

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.108AlaAla: 4.108 ± 0.89
0.625AlaCys: 0.625 ± 0.228
2.947AlaAsp: 2.947 ± 0.797
2.947AlaGlu: 2.947 ± 0.421
2.143AlaPhe: 2.143 ± 0.529
2.5AlaGly: 2.5 ± 0.587
0.447AlaHis: 0.447 ± 0.242
3.929AlaIle: 3.929 ± 0.561
4.376AlaLys: 4.376 ± 0.637
5.001AlaLeu: 5.001 ± 0.825
2.054AlaMet: 2.054 ± 0.48
4.376AlaAsn: 4.376 ± 0.739
0.714AlaPro: 0.714 ± 0.293
2.5AlaGln: 2.5 ± 0.568
1.697AlaArg: 1.697 ± 0.493
2.322AlaSer: 2.322 ± 0.483
3.483AlaThr: 3.483 ± 0.859
3.036AlaVal: 3.036 ± 0.49
0.625AlaTrp: 0.625 ± 0.233
1.965AlaTyr: 1.965 ± 0.274
0.0AlaXaa: 0.0 ± 0.0
Cys
0.625CysAla: 0.625 ± 0.313
0.268CysCys: 0.268 ± 0.127
0.714CysAsp: 0.714 ± 0.301
0.447CysGlu: 0.447 ± 0.167
0.625CysPhe: 0.625 ± 0.2
0.357CysGly: 0.357 ± 0.18
0.179CysHis: 0.179 ± 0.118
0.893CysIle: 0.893 ± 0.272
1.518CysLys: 1.518 ± 0.296
0.893CysLeu: 0.893 ± 0.305
0.089CysMet: 0.089 ± 0.082
0.714CysAsn: 0.714 ± 0.37
0.268CysPro: 0.268 ± 0.188
0.179CysGln: 0.179 ± 0.134
0.447CysArg: 0.447 ± 0.191
0.447CysSer: 0.447 ± 0.202
0.625CysThr: 0.625 ± 0.221
0.357CysVal: 0.357 ± 0.154
0.179CysTrp: 0.179 ± 0.123
0.893CysTyr: 0.893 ± 0.326
0.0CysXaa: 0.0 ± 0.0
Asp
2.858AspAla: 2.858 ± 0.68
0.625AspCys: 0.625 ± 0.264
4.554AspAsp: 4.554 ± 0.776
6.073AspGlu: 6.073 ± 0.769
2.768AspPhe: 2.768 ± 0.612
3.751AspGly: 3.751 ± 0.681
0.714AspHis: 0.714 ± 0.243
6.966AspIle: 6.966 ± 0.825
7.055AspLys: 7.055 ± 0.808
6.162AspLeu: 6.162 ± 0.742
1.429AspMet: 1.429 ± 0.365
4.108AspAsn: 4.108 ± 0.61
1.25AspPro: 1.25 ± 0.356
1.786AspGln: 1.786 ± 0.45
1.965AspArg: 1.965 ± 0.437
5.001AspSer: 5.001 ± 0.491
3.393AspThr: 3.393 ± 0.532
2.411AspVal: 2.411 ± 0.522
0.625AspTrp: 0.625 ± 0.22
2.858AspTyr: 2.858 ± 0.5
0.0AspXaa: 0.0 ± 0.0
Glu
3.661GluAla: 3.661 ± 0.574
0.714GluCys: 0.714 ± 0.249
4.465GluAsp: 4.465 ± 0.756
5.894GluGlu: 5.894 ± 0.886
3.304GluPhe: 3.304 ± 0.528
3.572GluGly: 3.572 ± 0.554
0.804GluHis: 0.804 ± 0.252
7.412GluIle: 7.412 ± 0.868
5.983GluLys: 5.983 ± 1.024
9.198GluLeu: 9.198 ± 0.988
1.518GluMet: 1.518 ± 0.332
6.073GluAsn: 6.073 ± 0.748
1.25GluPro: 1.25 ± 0.387
1.965GluGln: 1.965 ± 0.445
2.411GluArg: 2.411 ± 0.641
3.751GluSer: 3.751 ± 0.518
3.84GluThr: 3.84 ± 0.601
4.286GluVal: 4.286 ± 0.526
1.429GluTrp: 1.429 ± 0.393
4.197GluTyr: 4.197 ± 0.6
0.0GluXaa: 0.0 ± 0.0
Phe
2.5PheAla: 2.5 ± 0.455
0.714PheCys: 0.714 ± 0.237
2.143PheAsp: 2.143 ± 0.419
2.59PheGlu: 2.59 ± 0.43
1.25PhePhe: 1.25 ± 0.404
2.5PheGly: 2.5 ± 0.468
0.0PheHis: 0.0 ± 0.0
3.572PheIle: 3.572 ± 0.461
3.751PheLys: 3.751 ± 0.495
2.233PheLeu: 2.233 ± 0.496
0.804PheMet: 0.804 ± 0.288
5.179PheAsn: 5.179 ± 0.741
0.536PhePro: 0.536 ± 0.215
0.893PheGln: 0.893 ± 0.273
1.429PheArg: 1.429 ± 0.353
2.59PheSer: 2.59 ± 0.939
2.59PheThr: 2.59 ± 0.733
1.429PheVal: 1.429 ± 0.401
0.089PheTrp: 0.089 ± 0.094
2.858PheTyr: 2.858 ± 0.465
0.0PheXaa: 0.0 ± 0.0
Gly
2.143GlyAla: 2.143 ± 0.41
0.357GlyCys: 0.357 ± 0.162
4.733GlyAsp: 4.733 ± 0.669
4.019GlyGlu: 4.019 ± 0.631
2.679GlyPhe: 2.679 ± 0.512
3.304GlyGly: 3.304 ± 0.507
0.982GlyHis: 0.982 ± 0.36
5.001GlyIle: 5.001 ± 1.237
5.983GlyLys: 5.983 ± 0.595
5.179GlyLeu: 5.179 ± 0.578
1.518GlyMet: 1.518 ± 0.477
3.929GlyAsn: 3.929 ± 0.516
0.357GlyPro: 0.357 ± 0.167
1.786GlyGln: 1.786 ± 0.401
1.072GlyArg: 1.072 ± 0.28
3.572GlySer: 3.572 ± 0.629
3.483GlyThr: 3.483 ± 0.476
3.483GlyVal: 3.483 ± 0.435
0.804GlyTrp: 0.804 ± 0.332
2.947GlyTyr: 2.947 ± 0.528
0.0GlyXaa: 0.0 ± 0.0
His
0.268HisAla: 0.268 ± 0.152
0.089HisCys: 0.089 ± 0.096
0.268HisAsp: 0.268 ± 0.178
0.357HisGlu: 0.357 ± 0.189
0.268HisPhe: 0.268 ± 0.141
0.536HisGly: 0.536 ± 0.234
0.179HisHis: 0.179 ± 0.123
0.714HisIle: 0.714 ± 0.281
1.072HisLys: 1.072 ± 0.305
0.625HisLeu: 0.625 ± 0.312
0.357HisMet: 0.357 ± 0.216
0.714HisAsn: 0.714 ± 0.223
0.179HisPro: 0.179 ± 0.126
0.268HisGln: 0.268 ± 0.14
0.536HisArg: 0.536 ± 0.21
0.893HisSer: 0.893 ± 0.245
0.536HisThr: 0.536 ± 0.206
0.714HisVal: 0.714 ± 0.224
0.0HisTrp: 0.0 ± 0.0
0.536HisTyr: 0.536 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
4.197IleAla: 4.197 ± 0.825
0.625IleCys: 0.625 ± 0.235
7.144IleAsp: 7.144 ± 1.321
7.412IleGlu: 7.412 ± 0.921
2.411IlePhe: 2.411 ± 0.361
4.286IleGly: 4.286 ± 0.683
1.072IleHis: 1.072 ± 0.313
7.055IleIle: 7.055 ± 0.926
9.377IleLys: 9.377 ± 0.879
6.698IleLeu: 6.698 ± 1.049
1.607IleMet: 1.607 ± 0.321
6.43IleAsn: 6.43 ± 0.524
2.947IlePro: 2.947 ± 0.62
2.411IleGln: 2.411 ± 0.428
3.751IleArg: 3.751 ± 0.69
6.608IleSer: 6.608 ± 0.69
5.894IleThr: 5.894 ± 0.827
4.733IleVal: 4.733 ± 0.641
1.34IleTrp: 1.34 ± 0.496
4.912IleTyr: 4.912 ± 0.78
0.0IleXaa: 0.0 ± 0.0
Lys
5.894LysAla: 5.894 ± 0.832
0.804LysCys: 0.804 ± 0.352
5.983LysAsp: 5.983 ± 0.818
9.109LysGlu: 9.109 ± 1.041
2.947LysPhe: 2.947 ± 0.477
5.715LysGly: 5.715 ± 0.709
1.072LysHis: 1.072 ± 0.296
8.394LysIle: 8.394 ± 0.891
7.68LysLys: 7.68 ± 1.031
7.859LysLeu: 7.859 ± 0.897
2.679LysMet: 2.679 ± 0.556
5.179LysAsn: 5.179 ± 0.634
1.607LysPro: 1.607 ± 0.338
3.393LysGln: 3.393 ± 0.465
3.126LysArg: 3.126 ± 0.582
5.894LysSer: 5.894 ± 0.711
5.358LysThr: 5.358 ± 0.64
5.447LysVal: 5.447 ± 0.85
0.536LysTrp: 0.536 ± 0.252
4.197LysTyr: 4.197 ± 0.736
0.0LysXaa: 0.0 ± 0.0
Leu
4.465LeuAla: 4.465 ± 0.752
1.072LeuCys: 1.072 ± 0.214
6.698LeuAsp: 6.698 ± 0.678
6.162LeuGlu: 6.162 ± 0.9
3.215LeuPhe: 3.215 ± 0.498
4.286LeuGly: 4.286 ± 0.641
0.179LeuHis: 0.179 ± 0.121
5.447LeuIle: 5.447 ± 0.915
8.752LeuLys: 8.752 ± 0.714
6.251LeuLeu: 6.251 ± 0.586
1.786LeuMet: 1.786 ± 0.38
6.519LeuAsn: 6.519 ± 0.604
1.607LeuPro: 1.607 ± 0.691
2.858LeuGln: 2.858 ± 0.509
2.59LeuArg: 2.59 ± 0.449
6.966LeuSer: 6.966 ± 0.878
4.822LeuThr: 4.822 ± 0.597
4.019LeuVal: 4.019 ± 0.472
0.714LeuTrp: 0.714 ± 0.211
3.036LeuTyr: 3.036 ± 0.603
0.0LeuXaa: 0.0 ± 0.0
Met
1.161MetAla: 1.161 ± 0.293
0.179MetCys: 0.179 ± 0.105
2.411MetAsp: 2.411 ± 0.475
1.518MetGlu: 1.518 ± 0.317
0.982MetPhe: 0.982 ± 0.262
1.34MetGly: 1.34 ± 0.285
0.089MetHis: 0.089 ± 0.082
2.054MetIle: 2.054 ± 0.45
3.304MetLys: 3.304 ± 0.742
2.054MetLeu: 2.054 ± 0.423
0.447MetMet: 0.447 ± 0.295
1.518MetAsn: 1.518 ± 0.483
0.179MetPro: 0.179 ± 0.12
0.714MetGln: 0.714 ± 0.307
0.536MetArg: 0.536 ± 0.173
2.411MetSer: 2.411 ± 0.451
0.893MetThr: 0.893 ± 0.272
1.34MetVal: 1.34 ± 0.36
0.357MetTrp: 0.357 ± 0.193
0.536MetTyr: 0.536 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
3.751AsnAla: 3.751 ± 0.709
0.893AsnCys: 0.893 ± 0.422
3.126AsnAsp: 3.126 ± 0.465
6.251AsnGlu: 6.251 ± 0.834
3.572AsnPhe: 3.572 ± 0.561
6.698AsnGly: 6.698 ± 0.6
0.357AsnHis: 0.357 ± 0.162
8.93AsnIle: 8.93 ± 1.178
6.698AsnLys: 6.698 ± 0.675
6.34AsnLeu: 6.34 ± 0.829
2.054AsnMet: 2.054 ± 0.419
7.859AsnAsn: 7.859 ± 0.921
2.143AsnPro: 2.143 ± 0.308
1.607AsnGln: 1.607 ± 0.411
2.411AsnArg: 2.411 ± 0.543
5.894AsnSer: 5.894 ± 0.822
4.733AsnThr: 4.733 ± 0.98
4.019AsnVal: 4.019 ± 0.766
0.447AsnTrp: 0.447 ± 0.199
3.661AsnTyr: 3.661 ± 0.677
0.0AsnXaa: 0.0 ± 0.0
Pro
0.804ProAla: 0.804 ± 0.242
0.089ProCys: 0.089 ± 0.087
1.34ProAsp: 1.34 ± 0.28
1.429ProGlu: 1.429 ± 0.368
0.893ProPhe: 0.893 ± 0.207
1.25ProGly: 1.25 ± 0.395
0.268ProHis: 0.268 ± 0.188
2.143ProIle: 2.143 ± 0.437
1.34ProLys: 1.34 ± 0.418
1.34ProLeu: 1.34 ± 0.262
0.447ProMet: 0.447 ± 0.155
1.965ProAsn: 1.965 ± 0.363
0.714ProPro: 0.714 ± 0.254
0.536ProGln: 0.536 ± 0.173
0.893ProArg: 0.893 ± 0.222
1.429ProSer: 1.429 ± 0.396
1.25ProThr: 1.25 ± 0.327
1.518ProVal: 1.518 ± 0.44
0.357ProTrp: 0.357 ± 0.249
1.161ProTyr: 1.161 ± 0.355
0.0ProXaa: 0.0 ± 0.0
Gln
2.5GlnAla: 2.5 ± 0.512
0.447GlnCys: 0.447 ± 0.236
1.161GlnAsp: 1.161 ± 0.248
2.411GlnGlu: 2.411 ± 0.363
0.804GlnPhe: 0.804 ± 0.291
2.768GlnGly: 2.768 ± 0.555
0.357GlnHis: 0.357 ± 0.16
2.411GlnIle: 2.411 ± 0.451
2.322GlnLys: 2.322 ± 0.474
3.126GlnLeu: 3.126 ± 0.504
0.625GlnMet: 0.625 ± 0.238
1.875GlnAsn: 1.875 ± 0.382
0.447GlnPro: 0.447 ± 0.219
1.161GlnGln: 1.161 ± 0.306
0.714GlnArg: 0.714 ± 0.252
1.965GlnSer: 1.965 ± 0.295
1.161GlnThr: 1.161 ± 0.262
2.411GlnVal: 2.411 ± 0.353
0.268GlnTrp: 0.268 ± 0.179
1.25GlnTyr: 1.25 ± 0.366
0.0GlnXaa: 0.0 ± 0.0
Arg
1.429ArgAla: 1.429 ± 0.31
0.536ArgCys: 0.536 ± 0.214
2.411ArgAsp: 2.411 ± 0.527
3.215ArgGlu: 3.215 ± 0.631
1.607ArgPhe: 1.607 ± 0.408
1.161ArgGly: 1.161 ± 0.383
0.179ArgHis: 0.179 ± 0.114
3.929ArgIle: 3.929 ± 0.779
2.768ArgLys: 2.768 ± 0.524
2.322ArgLeu: 2.322 ± 0.448
0.804ArgMet: 0.804 ± 0.243
2.5ArgAsn: 2.5 ± 0.444
1.161ArgPro: 1.161 ± 0.327
0.804ArgGln: 0.804 ± 0.263
0.893ArgArg: 0.893 ± 0.329
1.161ArgSer: 1.161 ± 0.275
1.518ArgThr: 1.518 ± 0.382
0.893ArgVal: 0.893 ± 0.246
0.447ArgTrp: 0.447 ± 0.183
1.25ArgTyr: 1.25 ± 0.446
0.0ArgXaa: 0.0 ± 0.0
Ser
3.572SerAla: 3.572 ± 0.511
0.447SerCys: 0.447 ± 0.155
4.465SerAsp: 4.465 ± 0.751
4.822SerGlu: 4.822 ± 0.544
2.768SerPhe: 2.768 ± 0.458
3.572SerGly: 3.572 ± 0.901
0.536SerHis: 0.536 ± 0.186
6.251SerIle: 6.251 ± 1.061
6.162SerLys: 6.162 ± 0.791
4.912SerLeu: 4.912 ± 0.746
2.411SerMet: 2.411 ± 0.463
8.037SerAsn: 8.037 ± 1.124
1.34SerPro: 1.34 ± 0.287
1.786SerGln: 1.786 ± 0.359
1.518SerArg: 1.518 ± 0.35
5.179SerSer: 5.179 ± 0.697
3.572SerThr: 3.572 ± 0.667
3.304SerVal: 3.304 ± 0.659
0.893SerTrp: 0.893 ± 0.309
2.054SerTyr: 2.054 ± 0.48
0.0SerXaa: 0.0 ± 0.0
Thr
3.661ThrAla: 3.661 ± 0.834
0.536ThrCys: 0.536 ± 0.188
2.59ThrAsp: 2.59 ± 0.508
3.215ThrGlu: 3.215 ± 0.438
2.768ThrPhe: 2.768 ± 0.562
4.197ThrGly: 4.197 ± 0.779
0.536ThrHis: 0.536 ± 0.212
5.715ThrIle: 5.715 ± 0.839
4.465ThrLys: 4.465 ± 0.496
4.465ThrLeu: 4.465 ± 0.837
0.804ThrMet: 0.804 ± 0.276
4.733ThrAsn: 4.733 ± 0.779
1.965ThrPro: 1.965 ± 0.621
2.143ThrGln: 2.143 ± 0.525
1.072ThrArg: 1.072 ± 0.344
3.751ThrSer: 3.751 ± 0.856
5.09ThrThr: 5.09 ± 0.788
3.84ThrVal: 3.84 ± 0.679
0.982ThrTrp: 0.982 ± 0.473
2.054ThrTyr: 2.054 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
1.697ValAla: 1.697 ± 0.4
0.804ValCys: 0.804 ± 0.288
4.376ValAsp: 4.376 ± 0.712
4.286ValGlu: 4.286 ± 0.721
1.965ValPhe: 1.965 ± 0.397
2.5ValGly: 2.5 ± 0.649
0.447ValHis: 0.447 ± 0.18
4.465ValIle: 4.465 ± 0.528
5.715ValLys: 5.715 ± 0.717
3.393ValLeu: 3.393 ± 0.44
1.161ValMet: 1.161 ± 0.281
4.644ValAsn: 4.644 ± 0.738
1.161ValPro: 1.161 ± 0.329
1.875ValGln: 1.875 ± 0.385
2.054ValArg: 2.054 ± 0.443
3.929ValSer: 3.929 ± 0.479
3.751ValThr: 3.751 ± 0.674
4.197ValVal: 4.197 ± 0.621
0.536ValTrp: 0.536 ± 0.186
1.786ValTyr: 1.786 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
0.268TrpAla: 0.268 ± 0.138
0.268TrpCys: 0.268 ± 0.148
0.893TrpAsp: 0.893 ± 0.255
0.447TrpGlu: 0.447 ± 0.188
0.625TrpPhe: 0.625 ± 0.268
0.714TrpGly: 0.714 ± 0.337
0.089TrpHis: 0.089 ± 0.082
1.607TrpIle: 1.607 ± 0.396
0.714TrpLys: 0.714 ± 0.255
0.982TrpLeu: 0.982 ± 0.262
0.357TrpMet: 0.357 ± 0.214
1.25TrpAsn: 1.25 ± 0.376
0.179TrpPro: 0.179 ± 0.115
0.357TrpGln: 0.357 ± 0.194
0.447TrpArg: 0.447 ± 0.188
0.268TrpSer: 0.268 ± 0.132
0.625TrpThr: 0.625 ± 0.214
0.089TrpVal: 0.089 ± 0.094
0.0TrpTrp: 0.0 ± 0.0
0.447TrpTyr: 0.447 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.233TyrAla: 2.233 ± 0.497
0.625TyrCys: 0.625 ± 0.29
3.929TyrAsp: 3.929 ± 0.77
3.126TyrGlu: 3.126 ± 0.535
2.054TyrPhe: 2.054 ± 0.424
2.322TyrGly: 2.322 ± 0.484
0.625TyrHis: 0.625 ± 0.216
4.019TyrIle: 4.019 ± 0.64
3.84TyrLys: 3.84 ± 0.482
2.5TyrLeu: 2.5 ± 0.451
0.982TyrMet: 0.982 ± 0.281
3.84TyrAsn: 3.84 ± 0.615
1.161TyrPro: 1.161 ± 0.353
1.161TyrGln: 1.161 ± 0.446
1.429TyrArg: 1.429 ± 0.339
3.572TyrSer: 3.572 ± 0.507
1.965TyrThr: 1.965 ± 0.48
3.215TyrVal: 3.215 ± 0.612
0.0TyrTrp: 0.0 ± 0.0
2.143TyrTyr: 2.143 ± 0.601
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (11199 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski