Amino acid dipepetide frequency for Escherichia phage HK446

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.882AlaAla: 10.882 ± 1.483
1.012AlaCys: 1.012 ± 0.282
7.002AlaAsp: 7.002 ± 0.818
5.821AlaGlu: 5.821 ± 0.709
3.206AlaPhe: 3.206 ± 0.449
7.508AlaGly: 7.508 ± 0.68
1.603AlaHis: 1.603 ± 0.358
6.58AlaIle: 6.58 ± 0.661
4.64AlaLys: 4.64 ± 0.587
7.592AlaLeu: 7.592 ± 0.851
4.387AlaMet: 4.387 ± 0.553
5.062AlaAsn: 5.062 ± 0.632
1.772AlaPro: 1.772 ± 0.334
4.555AlaGln: 4.555 ± 0.59
4.809AlaArg: 4.809 ± 0.588
5.821AlaSer: 5.821 ± 1.128
5.315AlaThr: 5.315 ± 0.807
6.074AlaVal: 6.074 ± 0.72
1.687AlaTrp: 1.687 ± 0.385
2.531AlaTyr: 2.531 ± 0.43
0.0AlaXaa: 0.0 ± 0.0
Cys
1.265CysAla: 1.265 ± 0.407
0.169CysCys: 0.169 ± 0.133
1.012CysAsp: 1.012 ± 0.283
0.759CysGlu: 0.759 ± 0.239
0.169CysPhe: 0.169 ± 0.111
1.012CysGly: 1.012 ± 0.362
0.422CysHis: 0.422 ± 0.22
0.844CysIle: 0.844 ± 0.233
1.012CysLys: 1.012 ± 0.326
0.844CysLeu: 0.844 ± 0.342
0.169CysMet: 0.169 ± 0.116
0.844CysAsn: 0.844 ± 0.235
0.337CysPro: 0.337 ± 0.176
0.253CysGln: 0.253 ± 0.157
0.675CysArg: 0.675 ± 0.266
0.759CysSer: 0.759 ± 0.286
0.759CysThr: 0.759 ± 0.226
0.422CysVal: 0.422 ± 0.172
0.253CysTrp: 0.253 ± 0.204
0.253CysTyr: 0.253 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
6.58AspAla: 6.58 ± 0.672
0.337AspCys: 0.337 ± 0.216
2.953AspAsp: 2.953 ± 0.427
4.387AspGlu: 4.387 ± 0.698
1.856AspPhe: 1.856 ± 0.482
5.483AspGly: 5.483 ± 0.831
0.337AspHis: 0.337 ± 0.179
2.446AspIle: 2.446 ± 0.348
3.206AspLys: 3.206 ± 0.514
5.568AspLeu: 5.568 ± 0.623
1.518AspMet: 1.518 ± 0.328
2.784AspAsn: 2.784 ± 0.455
1.603AspPro: 1.603 ± 0.325
2.025AspGln: 2.025 ± 0.405
3.206AspArg: 3.206 ± 0.567
3.459AspSer: 3.459 ± 0.549
2.362AspThr: 2.362 ± 0.513
4.387AspVal: 4.387 ± 0.677
0.422AspTrp: 0.422 ± 0.183
2.531AspTyr: 2.531 ± 0.53
0.0AspXaa: 0.0 ± 0.0
Glu
5.315GluAla: 5.315 ± 0.84
1.012GluCys: 1.012 ± 0.401
1.856GluAsp: 1.856 ± 0.306
4.893GluGlu: 4.893 ± 0.825
2.109GluPhe: 2.109 ± 0.355
4.049GluGly: 4.049 ± 0.571
0.844GluHis: 0.844 ± 0.25
4.302GluIle: 4.302 ± 0.396
4.049GluLys: 4.049 ± 0.63
5.736GluLeu: 5.736 ± 0.713
1.518GluMet: 1.518 ± 0.359
3.881GluAsn: 3.881 ± 0.466
1.94GluPro: 1.94 ± 0.467
3.206GluGln: 3.206 ± 0.56
4.218GluArg: 4.218 ± 0.722
3.627GluSer: 3.627 ± 0.53
3.121GluThr: 3.121 ± 0.63
3.29GluVal: 3.29 ± 0.586
1.265GluTrp: 1.265 ± 0.31
1.603GluTyr: 1.603 ± 0.339
0.0GluXaa: 0.0 ± 0.0
Phe
3.121PheAla: 3.121 ± 0.444
0.422PheCys: 0.422 ± 0.177
1.772PheAsp: 1.772 ± 0.396
2.025PheGlu: 2.025 ± 0.433
0.675PhePhe: 0.675 ± 0.261
2.7PheGly: 2.7 ± 0.435
0.506PheHis: 0.506 ± 0.198
2.362PheIle: 2.362 ± 0.541
1.35PheLys: 1.35 ± 0.308
1.772PheLeu: 1.772 ± 0.388
0.506PheMet: 0.506 ± 0.161
1.687PheAsn: 1.687 ± 0.314
1.265PhePro: 1.265 ± 0.291
0.928PheGln: 0.928 ± 0.286
1.94PheArg: 1.94 ± 0.477
2.362PheSer: 2.362 ± 0.395
2.362PheThr: 2.362 ± 0.464
2.109PheVal: 2.109 ± 0.413
0.506PheTrp: 0.506 ± 0.181
1.181PheTyr: 1.181 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
6.243GlyAla: 6.243 ± 0.939
0.844GlyCys: 0.844 ± 0.301
4.977GlyAsp: 4.977 ± 0.606
4.134GlyGlu: 4.134 ± 0.486
2.868GlyPhe: 2.868 ± 0.535
6.411GlyGly: 6.411 ± 0.76
1.35GlyHis: 1.35 ± 0.326
3.796GlyIle: 3.796 ± 0.456
4.218GlyLys: 4.218 ± 0.496
6.158GlyLeu: 6.158 ± 0.835
2.615GlyMet: 2.615 ± 0.44
4.471GlyAsn: 4.471 ± 0.775
1.35GlyPro: 1.35 ± 0.385
3.206GlyGln: 3.206 ± 0.447
4.049GlyArg: 4.049 ± 0.546
3.965GlySer: 3.965 ± 0.649
5.821GlyThr: 5.821 ± 0.752
5.146GlyVal: 5.146 ± 0.596
1.012GlyTrp: 1.012 ± 0.22
2.278GlyTyr: 2.278 ± 0.45
0.0GlyXaa: 0.0 ± 0.0
His
1.265HisAla: 1.265 ± 0.359
0.253HisCys: 0.253 ± 0.149
0.844HisAsp: 0.844 ± 0.245
0.844HisGlu: 0.844 ± 0.281
0.675HisPhe: 0.675 ± 0.271
1.097HisGly: 1.097 ± 0.316
0.422HisHis: 0.422 ± 0.188
0.759HisIle: 0.759 ± 0.287
1.181HisLys: 1.181 ± 0.373
1.012HisLeu: 1.012 ± 0.353
0.169HisMet: 0.169 ± 0.112
0.506HisAsn: 0.506 ± 0.17
1.012HisPro: 1.012 ± 0.281
1.012HisGln: 1.012 ± 0.307
1.687HisArg: 1.687 ± 0.375
0.928HisSer: 0.928 ± 0.296
0.844HisThr: 0.844 ± 0.26
0.675HisVal: 0.675 ± 0.294
0.253HisTrp: 0.253 ± 0.129
0.337HisTyr: 0.337 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
4.977IleAla: 4.977 ± 0.579
0.591IleCys: 0.591 ± 0.287
5.483IleAsp: 5.483 ± 0.644
3.206IleGlu: 3.206 ± 0.474
1.94IlePhe: 1.94 ± 0.458
3.459IleGly: 3.459 ± 0.551
0.928IleHis: 0.928 ± 0.257
3.712IleIle: 3.712 ± 0.575
2.615IleLys: 2.615 ± 0.459
3.965IleLeu: 3.965 ± 0.553
1.265IleMet: 1.265 ± 0.423
3.121IleAsn: 3.121 ± 0.462
2.193IlePro: 2.193 ± 0.441
2.953IleGln: 2.953 ± 0.607
3.543IleArg: 3.543 ± 0.483
5.23IleSer: 5.23 ± 0.549
3.796IleThr: 3.796 ± 0.419
3.037IleVal: 3.037 ± 0.568
0.928IleTrp: 0.928 ± 0.313
1.181IleTyr: 1.181 ± 0.372
0.0IleXaa: 0.0 ± 0.0
Lys
5.821LysAla: 5.821 ± 0.728
0.675LysCys: 0.675 ± 0.376
3.459LysAsp: 3.459 ± 0.644
3.881LysGlu: 3.881 ± 0.695
1.518LysPhe: 1.518 ± 0.383
3.121LysGly: 3.121 ± 0.448
0.675LysHis: 0.675 ± 0.235
2.531LysIle: 2.531 ± 0.451
3.459LysLys: 3.459 ± 0.594
3.796LysLeu: 3.796 ± 0.589
1.856LysMet: 1.856 ± 0.441
2.615LysAsn: 2.615 ± 0.431
3.459LysPro: 3.459 ± 0.659
3.627LysGln: 3.627 ± 0.603
2.868LysArg: 2.868 ± 0.639
4.64LysSer: 4.64 ± 0.719
3.965LysThr: 3.965 ± 0.556
3.206LysVal: 3.206 ± 0.47
0.844LysTrp: 0.844 ± 0.28
2.109LysTyr: 2.109 ± 0.51
0.0LysXaa: 0.0 ± 0.0
Leu
8.267LeuAla: 8.267 ± 0.788
1.434LeuCys: 1.434 ± 0.416
3.627LeuAsp: 3.627 ± 0.477
4.893LeuGlu: 4.893 ± 0.617
1.856LeuPhe: 1.856 ± 0.366
4.218LeuGly: 4.218 ± 0.837
1.265LeuHis: 1.265 ± 0.409
5.399LeuIle: 5.399 ± 0.681
5.568LeuLys: 5.568 ± 0.635
5.399LeuLeu: 5.399 ± 0.592
2.025LeuMet: 2.025 ± 0.474
4.387LeuAsn: 4.387 ± 0.639
3.121LeuPro: 3.121 ± 0.498
2.953LeuGln: 2.953 ± 0.47
5.905LeuArg: 5.905 ± 0.761
5.99LeuSer: 5.99 ± 0.757
4.977LeuThr: 4.977 ± 0.612
4.471LeuVal: 4.471 ± 0.627
0.337LeuTrp: 0.337 ± 0.174
1.687LeuTyr: 1.687 ± 0.348
0.0LeuXaa: 0.0 ± 0.0
Met
3.374MetAla: 3.374 ± 0.462
0.422MetCys: 0.422 ± 0.182
1.603MetAsp: 1.603 ± 0.359
1.265MetGlu: 1.265 ± 0.337
0.506MetPhe: 0.506 ± 0.201
1.434MetGly: 1.434 ± 0.251
0.506MetHis: 0.506 ± 0.245
1.181MetIle: 1.181 ± 0.311
2.193MetLys: 2.193 ± 0.382
2.025MetLeu: 2.025 ± 0.322
0.844MetMet: 0.844 ± 0.189
0.591MetAsn: 0.591 ± 0.235
1.265MetPro: 1.265 ± 0.292
1.265MetGln: 1.265 ± 0.235
2.025MetArg: 2.025 ± 0.346
2.615MetSer: 2.615 ± 0.566
2.109MetThr: 2.109 ± 0.406
1.265MetVal: 1.265 ± 0.314
0.337MetTrp: 0.337 ± 0.143
0.591MetTyr: 0.591 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
5.736AsnAla: 5.736 ± 0.719
0.422AsnCys: 0.422 ± 0.198
1.94AsnAsp: 1.94 ± 0.316
2.784AsnGlu: 2.784 ± 0.471
1.265AsnPhe: 1.265 ± 0.342
5.568AsnGly: 5.568 ± 0.616
1.181AsnHis: 1.181 ± 0.383
3.374AsnIle: 3.374 ± 0.55
1.856AsnLys: 1.856 ± 0.376
3.29AsnLeu: 3.29 ± 0.549
0.759AsnMet: 0.759 ± 0.216
2.278AsnAsn: 2.278 ± 0.462
2.784AsnPro: 2.784 ± 0.614
2.531AsnGln: 2.531 ± 0.413
2.278AsnArg: 2.278 ± 0.408
2.531AsnSer: 2.531 ± 0.487
3.374AsnThr: 3.374 ± 0.584
2.025AsnVal: 2.025 ± 0.46
0.844AsnTrp: 0.844 ± 0.198
1.518AsnTyr: 1.518 ± 0.399
0.0AsnXaa: 0.0 ± 0.0
Pro
3.459ProAla: 3.459 ± 0.45
0.591ProCys: 0.591 ± 0.243
1.518ProAsp: 1.518 ± 0.361
2.193ProGlu: 2.193 ± 0.418
1.35ProPhe: 1.35 ± 0.36
3.121ProGly: 3.121 ± 0.394
0.591ProHis: 0.591 ± 0.22
1.265ProIle: 1.265 ± 0.354
2.193ProLys: 2.193 ± 0.376
3.037ProLeu: 3.037 ± 0.64
1.181ProMet: 1.181 ± 0.354
1.097ProAsn: 1.097 ± 0.312
1.35ProPro: 1.35 ± 0.295
1.856ProGln: 1.856 ± 0.437
2.109ProArg: 2.109 ± 0.396
2.615ProSer: 2.615 ± 0.476
1.603ProThr: 1.603 ± 0.423
3.459ProVal: 3.459 ± 0.638
0.506ProTrp: 0.506 ± 0.262
1.012ProTyr: 1.012 ± 0.208
0.0ProXaa: 0.0 ± 0.0
Gln
4.809GlnAla: 4.809 ± 0.8
0.422GlnCys: 0.422 ± 0.203
1.687GlnAsp: 1.687 ± 0.374
2.278GlnGlu: 2.278 ± 0.317
1.181GlnPhe: 1.181 ± 0.301
2.193GlnGly: 2.193 ± 0.483
0.675GlnHis: 0.675 ± 0.223
2.615GlnIle: 2.615 ± 0.367
3.206GlnLys: 3.206 ± 0.565
4.302GlnLeu: 4.302 ± 0.667
1.35GlnMet: 1.35 ± 0.357
2.7GlnAsn: 2.7 ± 0.62
1.518GlnPro: 1.518 ± 0.359
3.29GlnGln: 3.29 ± 0.582
2.446GlnArg: 2.446 ± 0.601
3.796GlnSer: 3.796 ± 0.553
2.953GlnThr: 2.953 ± 0.541
3.459GlnVal: 3.459 ± 0.448
0.844GlnTrp: 0.844 ± 0.289
1.097GlnTyr: 1.097 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
4.64ArgAla: 4.64 ± 0.753
0.928ArgCys: 0.928 ± 0.347
3.543ArgAsp: 3.543 ± 0.582
4.134ArgGlu: 4.134 ± 0.67
2.193ArgPhe: 2.193 ± 0.425
3.627ArgGly: 3.627 ± 0.444
1.434ArgHis: 1.434 ± 0.359
2.953ArgIle: 2.953 ± 0.509
4.471ArgLys: 4.471 ± 0.653
5.652ArgLeu: 5.652 ± 0.634
1.518ArgMet: 1.518 ± 0.327
3.374ArgAsn: 3.374 ± 0.52
1.35ArgPro: 1.35 ± 0.331
2.868ArgGln: 2.868 ± 0.467
3.459ArgArg: 3.459 ± 0.603
2.868ArgSer: 2.868 ± 0.464
3.037ArgThr: 3.037 ± 0.382
3.796ArgVal: 3.796 ± 0.661
1.603ArgTrp: 1.603 ± 0.388
2.278ArgTyr: 2.278 ± 0.411
0.0ArgXaa: 0.0 ± 0.0
Ser
7.002SerAla: 7.002 ± 0.748
0.506SerCys: 0.506 ± 0.186
4.302SerAsp: 4.302 ± 0.592
3.543SerGlu: 3.543 ± 0.515
2.531SerPhe: 2.531 ± 0.487
5.821SerGly: 5.821 ± 0.809
0.675SerHis: 0.675 ± 0.26
3.881SerIle: 3.881 ± 0.613
3.206SerLys: 3.206 ± 0.628
4.893SerLeu: 4.893 ± 0.712
1.94SerMet: 1.94 ± 0.396
2.953SerAsn: 2.953 ± 0.435
2.868SerPro: 2.868 ± 0.505
4.302SerGln: 4.302 ± 0.633
4.893SerArg: 4.893 ± 0.451
4.471SerSer: 4.471 ± 0.8
3.121SerThr: 3.121 ± 0.506
5.652SerVal: 5.652 ± 0.888
1.434SerTrp: 1.434 ± 0.292
0.928SerTyr: 0.928 ± 0.202
0.0SerXaa: 0.0 ± 0.0
Thr
5.821ThrAla: 5.821 ± 0.601
0.675ThrCys: 0.675 ± 0.21
3.881ThrAsp: 3.881 ± 0.522
3.459ThrGlu: 3.459 ± 0.578
2.531ThrPhe: 2.531 ± 0.583
5.652ThrGly: 5.652 ± 0.774
0.506ThrHis: 0.506 ± 0.186
3.965ThrIle: 3.965 ± 0.554
3.29ThrLys: 3.29 ± 0.574
3.881ThrLeu: 3.881 ± 0.497
1.434ThrMet: 1.434 ± 0.277
2.109ThrAsn: 2.109 ± 0.346
2.953ThrPro: 2.953 ± 0.526
2.109ThrGln: 2.109 ± 0.346
2.278ThrArg: 2.278 ± 0.352
4.471ThrSer: 4.471 ± 0.559
2.531ThrThr: 2.531 ± 0.512
4.049ThrVal: 4.049 ± 0.465
1.35ThrTrp: 1.35 ± 0.357
1.35ThrTyr: 1.35 ± 0.396
0.0ThrXaa: 0.0 ± 0.0
Val
5.568ValAla: 5.568 ± 0.619
0.506ValCys: 0.506 ± 0.217
3.29ValAsp: 3.29 ± 0.417
4.555ValGlu: 4.555 ± 0.48
2.193ValPhe: 2.193 ± 0.366
4.471ValGly: 4.471 ± 0.72
1.012ValHis: 1.012 ± 0.299
3.459ValIle: 3.459 ± 0.716
4.64ValLys: 4.64 ± 0.644
5.315ValLeu: 5.315 ± 0.774
1.687ValMet: 1.687 ± 0.359
2.784ValAsn: 2.784 ± 0.456
2.193ValPro: 2.193 ± 0.418
1.687ValGln: 1.687 ± 0.3
3.459ValArg: 3.459 ± 0.426
5.399ValSer: 5.399 ± 0.672
3.543ValThr: 3.543 ± 0.462
5.399ValVal: 5.399 ± 0.922
0.928ValTrp: 0.928 ± 0.29
2.193ValTyr: 2.193 ± 0.407
0.0ValXaa: 0.0 ± 0.0
Trp
0.759TrpAla: 0.759 ± 0.226
0.506TrpCys: 0.506 ± 0.188
0.844TrpAsp: 0.844 ± 0.257
1.012TrpGlu: 1.012 ± 0.274
0.591TrpPhe: 0.591 ± 0.271
1.35TrpGly: 1.35 ± 0.35
0.591TrpHis: 0.591 ± 0.193
1.012TrpIle: 1.012 ± 0.251
0.928TrpLys: 0.928 ± 0.297
1.687TrpLeu: 1.687 ± 0.433
0.422TrpMet: 0.422 ± 0.178
0.337TrpAsn: 0.337 ± 0.165
0.422TrpPro: 0.422 ± 0.179
0.506TrpGln: 0.506 ± 0.201
0.928TrpArg: 0.928 ± 0.245
1.097TrpSer: 1.097 ± 0.244
1.097TrpThr: 1.097 ± 0.371
1.012TrpVal: 1.012 ± 0.248
0.422TrpTrp: 0.422 ± 0.191
0.675TrpTyr: 0.675 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.953TyrAla: 2.953 ± 0.44
0.506TyrCys: 0.506 ± 0.203
1.856TyrAsp: 1.856 ± 0.313
1.772TyrGlu: 1.772 ± 0.494
0.337TyrPhe: 0.337 ± 0.21
2.615TyrGly: 2.615 ± 0.552
0.253TyrHis: 0.253 ± 0.12
1.687TyrIle: 1.687 ± 0.401
0.928TyrLys: 0.928 ± 0.236
1.856TyrLeu: 1.856 ± 0.338
0.169TyrMet: 0.169 ± 0.117
0.759TyrAsn: 0.759 ± 0.234
1.35TyrPro: 1.35 ± 0.401
1.687TyrGln: 1.687 ± 0.381
2.953TyrArg: 2.953 ± 0.52
2.278TyrSer: 2.278 ± 0.468
1.603TyrThr: 1.603 ± 0.392
1.35TyrVal: 1.35 ± 0.308
0.422TyrTrp: 0.422 ± 0.164
0.928TyrTyr: 0.928 ± 0.29
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (11855 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski