Amino acid dipepetide frequency for Escherichia phage PHB10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.373AlaAla: 10.373 ± 1.005
0.784AlaCys: 0.784 ± 0.272
6.333AlaAsp: 6.333 ± 0.688
6.212AlaGlu: 6.212 ± 0.716
3.076AlaPhe: 3.076 ± 0.347
7.84AlaGly: 7.84 ± 0.659
1.146AlaHis: 1.146 ± 0.284
6.091AlaIle: 6.091 ± 0.645
4.403AlaLys: 4.403 ± 0.462
9.046AlaLeu: 9.046 ± 0.79
2.895AlaMet: 2.895 ± 0.425
4.101AlaAsn: 4.101 ± 0.541
2.774AlaPro: 2.774 ± 0.384
5.428AlaGln: 5.428 ± 0.753
4.041AlaArg: 4.041 ± 0.459
6.574AlaSer: 6.574 ± 0.509
5.126AlaThr: 5.126 ± 0.57
6.091AlaVal: 6.091 ± 0.555
1.749AlaTrp: 1.749 ± 0.293
2.714AlaTyr: 2.714 ± 0.481
0.0AlaXaa: 0.0 ± 0.0
Cys
0.603CysAla: 0.603 ± 0.197
0.362CysCys: 0.362 ± 0.154
1.206CysAsp: 1.206 ± 0.354
0.663CysGlu: 0.663 ± 0.168
0.302CysPhe: 0.302 ± 0.153
0.603CysGly: 0.603 ± 0.243
0.181CysHis: 0.181 ± 0.099
0.241CysIle: 0.241 ± 0.109
0.784CysLys: 0.784 ± 0.223
0.844CysLeu: 0.844 ± 0.267
0.241CysMet: 0.241 ± 0.108
0.543CysAsn: 0.543 ± 0.163
1.025CysPro: 1.025 ± 0.325
0.302CysGln: 0.302 ± 0.144
1.146CysArg: 1.146 ± 0.259
1.025CysSer: 1.025 ± 0.305
0.482CysThr: 0.482 ± 0.173
0.543CysVal: 0.543 ± 0.231
0.422CysTrp: 0.422 ± 0.186
0.241CysTyr: 0.241 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
5.669AspAla: 5.669 ± 0.623
0.724AspCys: 0.724 ± 0.232
3.257AspAsp: 3.257 ± 0.413
5.066AspGlu: 5.066 ± 0.68
1.689AspPhe: 1.689 ± 0.258
5.85AspGly: 5.85 ± 0.716
0.844AspHis: 0.844 ± 0.277
4.222AspIle: 4.222 ± 0.472
2.412AspLys: 2.412 ± 0.454
4.825AspLeu: 4.825 ± 0.628
1.568AspMet: 1.568 ± 0.31
2.231AspAsn: 2.231 ± 0.359
1.508AspPro: 1.508 ± 0.343
1.809AspGln: 1.809 ± 0.326
3.498AspArg: 3.498 ± 0.574
4.222AspSer: 4.222 ± 0.57
3.196AspThr: 3.196 ± 0.461
3.015AspVal: 3.015 ± 0.517
0.905AspTrp: 0.905 ± 0.263
2.593AspTyr: 2.593 ± 0.557
0.0AspXaa: 0.0 ± 0.0
Glu
5.126GluAla: 5.126 ± 0.496
0.603GluCys: 0.603 ± 0.206
2.895GluAsp: 2.895 ± 0.359
3.679GluGlu: 3.679 ± 0.637
2.171GluPhe: 2.171 ± 0.288
4.403GluGly: 4.403 ± 0.563
1.327GluHis: 1.327 ± 0.292
3.739GluIle: 3.739 ± 0.495
3.257GluLys: 3.257 ± 0.441
6.936GluLeu: 6.936 ± 0.621
1.206GluMet: 1.206 ± 0.274
2.895GluAsn: 2.895 ± 0.432
2.412GluPro: 2.412 ± 0.399
2.835GluGln: 2.835 ± 0.56
3.558GluArg: 3.558 ± 0.496
3.076GluSer: 3.076 ± 0.552
3.377GluThr: 3.377 ± 0.621
3.377GluVal: 3.377 ± 0.565
1.749GluTrp: 1.749 ± 0.409
1.749GluTyr: 1.749 ± 0.388
0.0GluXaa: 0.0 ± 0.0
Phe
3.377PheAla: 3.377 ± 0.541
0.844PheCys: 0.844 ± 0.239
2.473PheAsp: 2.473 ± 0.387
1.447PheGlu: 1.447 ± 0.266
1.327PhePhe: 1.327 ± 0.288
2.955PheGly: 2.955 ± 0.456
0.543PheHis: 0.543 ± 0.182
1.99PheIle: 1.99 ± 0.334
1.387PheLys: 1.387 ± 0.337
2.593PheLeu: 2.593 ± 0.507
0.844PheMet: 0.844 ± 0.218
2.171PheAsn: 2.171 ± 0.402
1.809PhePro: 1.809 ± 0.416
0.905PheGln: 0.905 ± 0.236
1.93PheArg: 1.93 ± 0.303
2.955PheSer: 2.955 ± 0.426
2.774PheThr: 2.774 ± 0.48
2.051PheVal: 2.051 ± 0.281
0.302PheTrp: 0.302 ± 0.119
1.267PheTyr: 1.267 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
5.971GlyAla: 5.971 ± 0.664
0.905GlyCys: 0.905 ± 0.25
4.403GlyAsp: 4.403 ± 0.726
4.041GlyGlu: 4.041 ± 0.522
2.835GlyPhe: 2.835 ± 0.414
5.729GlyGly: 5.729 ± 0.938
1.086GlyHis: 1.086 ± 0.246
6.091GlyIle: 6.091 ± 0.565
4.704GlyLys: 4.704 ± 0.525
5.488GlyLeu: 5.488 ± 0.7
2.835GlyMet: 2.835 ± 0.522
3.558GlyAsn: 3.558 ± 0.581
0.905GlyPro: 0.905 ± 0.223
2.593GlyGln: 2.593 ± 0.367
4.342GlyArg: 4.342 ± 0.519
4.644GlySer: 4.644 ± 0.632
4.523GlyThr: 4.523 ± 0.816
5.729GlyVal: 5.729 ± 0.545
1.206GlyTrp: 1.206 ± 0.316
3.196GlyTyr: 3.196 ± 0.413
0.0GlyXaa: 0.0 ± 0.0
His
1.508HisAla: 1.508 ± 0.27
0.181HisCys: 0.181 ± 0.13
0.784HisAsp: 0.784 ± 0.18
0.905HisGlu: 0.905 ± 0.217
0.663HisPhe: 0.663 ± 0.191
1.387HisGly: 1.387 ± 0.301
0.482HisHis: 0.482 ± 0.17
0.965HisIle: 0.965 ± 0.204
0.663HisLys: 0.663 ± 0.187
1.568HisLeu: 1.568 ± 0.382
0.181HisMet: 0.181 ± 0.093
0.905HisAsn: 0.905 ± 0.198
0.724HisPro: 0.724 ± 0.221
0.784HisGln: 0.784 ± 0.211
1.086HisArg: 1.086 ± 0.288
0.965HisSer: 0.965 ± 0.245
0.844HisThr: 0.844 ± 0.214
0.422HisVal: 0.422 ± 0.16
0.241HisTrp: 0.241 ± 0.114
0.663HisTyr: 0.663 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
5.91IleAla: 5.91 ± 0.513
1.086IleCys: 1.086 ± 0.278
4.342IleAsp: 4.342 ± 0.478
3.377IleGlu: 3.377 ± 0.416
2.292IlePhe: 2.292 ± 0.365
3.317IleGly: 3.317 ± 0.508
0.482IleHis: 0.482 ± 0.168
2.895IleIle: 2.895 ± 0.376
2.654IleLys: 2.654 ± 0.374
3.739IleLeu: 3.739 ± 0.663
1.025IleMet: 1.025 ± 0.212
4.161IleAsn: 4.161 ± 0.384
2.352IlePro: 2.352 ± 0.342
2.352IleGln: 2.352 ± 0.442
3.317IleArg: 3.317 ± 0.466
3.8IleSer: 3.8 ± 0.637
3.98IleThr: 3.98 ± 0.481
3.8IleVal: 3.8 ± 0.524
0.844IleTrp: 0.844 ± 0.216
1.568IleTyr: 1.568 ± 0.284
0.0IleXaa: 0.0 ± 0.0
Lys
5.247LysAla: 5.247 ± 0.71
0.422LysCys: 0.422 ± 0.21
2.835LysAsp: 2.835 ± 0.485
3.076LysGlu: 3.076 ± 0.503
1.689LysPhe: 1.689 ± 0.287
3.498LysGly: 3.498 ± 0.47
0.784LysHis: 0.784 ± 0.209
2.714LysIle: 2.714 ± 0.419
3.015LysLys: 3.015 ± 0.565
3.86LysLeu: 3.86 ± 0.554
1.146LysMet: 1.146 ± 0.353
2.171LysAsn: 2.171 ± 0.387
3.015LysPro: 3.015 ± 0.484
3.076LysGln: 3.076 ± 0.525
3.015LysArg: 3.015 ± 0.48
3.076LysSer: 3.076 ± 0.473
3.558LysThr: 3.558 ± 0.519
3.257LysVal: 3.257 ± 0.419
0.482LysTrp: 0.482 ± 0.215
2.111LysTyr: 2.111 ± 0.385
0.0LysXaa: 0.0 ± 0.0
Leu
8.866LeuAla: 8.866 ± 0.894
1.025LeuCys: 1.025 ± 0.28
5.006LeuAsp: 5.006 ± 0.522
6.091LeuGlu: 6.091 ± 0.661
2.654LeuPhe: 2.654 ± 0.486
5.91LeuGly: 5.91 ± 0.777
1.086LeuHis: 1.086 ± 0.258
4.282LeuIle: 4.282 ± 0.733
5.187LeuLys: 5.187 ± 0.564
7.117LeuLeu: 7.117 ± 0.875
1.689LeuMet: 1.689 ± 0.32
3.8LeuAsn: 3.8 ± 0.478
4.342LeuPro: 4.342 ± 0.562
3.86LeuGln: 3.86 ± 0.63
5.368LeuArg: 5.368 ± 0.621
6.574LeuSer: 6.574 ± 0.827
4.885LeuThr: 4.885 ± 0.67
5.488LeuVal: 5.488 ± 0.495
0.844LeuTrp: 0.844 ± 0.222
2.292LeuTyr: 2.292 ± 0.342
0.0LeuXaa: 0.0 ± 0.0
Met
2.774MetAla: 2.774 ± 0.315
0.241MetCys: 0.241 ± 0.133
1.568MetAsp: 1.568 ± 0.258
0.905MetGlu: 0.905 ± 0.2
0.965MetPhe: 0.965 ± 0.247
1.447MetGly: 1.447 ± 0.305
0.603MetHis: 0.603 ± 0.172
1.327MetIle: 1.327 ± 0.281
2.473MetLys: 2.473 ± 0.368
2.051MetLeu: 2.051 ± 0.361
0.422MetMet: 0.422 ± 0.169
0.784MetAsn: 0.784 ± 0.207
0.724MetPro: 0.724 ± 0.212
0.965MetGln: 0.965 ± 0.212
1.387MetArg: 1.387 ± 0.278
1.749MetSer: 1.749 ± 0.347
1.447MetThr: 1.447 ± 0.294
1.327MetVal: 1.327 ± 0.236
0.482MetTrp: 0.482 ± 0.168
0.543MetTyr: 0.543 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
4.222AsnAla: 4.222 ± 0.483
0.663AsnCys: 0.663 ± 0.198
2.051AsnAsp: 2.051 ± 0.416
2.352AsnGlu: 2.352 ± 0.301
1.086AsnPhe: 1.086 ± 0.273
4.222AsnGly: 4.222 ± 0.492
0.603AsnHis: 0.603 ± 0.202
2.533AsnIle: 2.533 ± 0.472
2.714AsnLys: 2.714 ± 0.429
3.92AsnLeu: 3.92 ± 0.449
0.965AsnMet: 0.965 ± 0.266
1.99AsnAsn: 1.99 ± 0.346
2.231AsnPro: 2.231 ± 0.426
1.508AsnGln: 1.508 ± 0.354
2.352AsnArg: 2.352 ± 0.354
2.895AsnSer: 2.895 ± 0.338
3.619AsnThr: 3.619 ± 0.524
2.955AsnVal: 2.955 ± 0.435
0.844AsnTrp: 0.844 ± 0.263
1.749AsnTyr: 1.749 ± 0.414
0.0AsnXaa: 0.0 ± 0.0
Pro
4.282ProAla: 4.282 ± 0.436
0.241ProCys: 0.241 ± 0.116
3.498ProAsp: 3.498 ± 0.529
3.317ProGlu: 3.317 ± 0.673
1.568ProPhe: 1.568 ± 0.255
3.257ProGly: 3.257 ± 0.444
0.422ProHis: 0.422 ± 0.163
1.267ProIle: 1.267 ± 0.285
1.689ProLys: 1.689 ± 0.308
3.076ProLeu: 3.076 ± 0.558
0.905ProMet: 0.905 ± 0.256
1.93ProAsn: 1.93 ± 0.354
1.447ProPro: 1.447 ± 0.283
1.327ProGln: 1.327 ± 0.332
2.051ProArg: 2.051 ± 0.355
2.654ProSer: 2.654 ± 0.408
2.231ProThr: 2.231 ± 0.438
3.196ProVal: 3.196 ± 0.418
0.543ProTrp: 0.543 ± 0.19
1.146ProTyr: 1.146 ± 0.271
0.0ProXaa: 0.0 ± 0.0
Gln
4.342GlnAla: 4.342 ± 0.667
0.422GlnCys: 0.422 ± 0.163
1.508GlnAsp: 1.508 ± 0.266
2.292GlnGlu: 2.292 ± 0.353
1.267GlnPhe: 1.267 ± 0.222
1.689GlnGly: 1.689 ± 0.318
0.844GlnHis: 0.844 ± 0.232
2.051GlnIle: 2.051 ± 0.347
2.774GlnLys: 2.774 ± 0.529
4.945GlnLeu: 4.945 ± 0.642
1.327GlnMet: 1.327 ± 0.287
1.689GlnAsn: 1.689 ± 0.319
1.93GlnPro: 1.93 ± 0.308
2.412GlnGln: 2.412 ± 0.511
2.111GlnArg: 2.111 ± 0.33
3.196GlnSer: 3.196 ± 0.604
2.533GlnThr: 2.533 ± 0.362
2.533GlnVal: 2.533 ± 0.463
0.784GlnTrp: 0.784 ± 0.233
1.206GlnTyr: 1.206 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
5.006ArgAla: 5.006 ± 0.519
0.482ArgCys: 0.482 ± 0.169
2.955ArgAsp: 2.955 ± 0.491
3.86ArgGlu: 3.86 ± 0.568
2.473ArgPhe: 2.473 ± 0.403
3.739ArgGly: 3.739 ± 0.397
1.327ArgHis: 1.327 ± 0.375
3.136ArgIle: 3.136 ± 0.48
3.196ArgLys: 3.196 ± 0.435
4.764ArgLeu: 4.764 ± 0.656
1.628ArgMet: 1.628 ± 0.331
2.895ArgAsn: 2.895 ± 0.421
1.568ArgPro: 1.568 ± 0.32
2.231ArgGln: 2.231 ± 0.398
3.438ArgArg: 3.438 ± 0.775
2.835ArgSer: 2.835 ± 0.378
2.955ArgThr: 2.955 ± 0.456
4.463ArgVal: 4.463 ± 0.496
0.965ArgTrp: 0.965 ± 0.253
3.015ArgTyr: 3.015 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
6.875SerAla: 6.875 ± 0.609
0.482SerCys: 0.482 ± 0.186
4.222SerAsp: 4.222 ± 0.467
3.92SerGlu: 3.92 ± 0.416
2.593SerPhe: 2.593 ± 0.459
5.669SerGly: 5.669 ± 0.782
1.387SerHis: 1.387 ± 0.287
3.196SerIle: 3.196 ± 0.386
2.593SerLys: 2.593 ± 0.43
5.669SerLeu: 5.669 ± 0.781
1.508SerMet: 1.508 ± 0.246
2.895SerAsn: 2.895 ± 0.418
2.654SerPro: 2.654 ± 0.44
3.015SerGln: 3.015 ± 0.458
3.498SerArg: 3.498 ± 0.433
4.222SerSer: 4.222 ± 0.741
4.463SerThr: 4.463 ± 0.542
4.342SerVal: 4.342 ± 0.532
0.603SerTrp: 0.603 ± 0.143
1.93SerTyr: 1.93 ± 0.323
0.0SerXaa: 0.0 ± 0.0
Thr
7.358ThrAla: 7.358 ± 0.8
0.482ThrCys: 0.482 ± 0.19
3.92ThrAsp: 3.92 ± 0.532
3.438ThrGlu: 3.438 ± 0.381
2.593ThrPhe: 2.593 ± 0.411
5.428ThrGly: 5.428 ± 0.672
0.663ThrHis: 0.663 ± 0.235
3.257ThrIle: 3.257 ± 0.514
2.292ThrLys: 2.292 ± 0.443
6.031ThrLeu: 6.031 ± 0.58
1.447ThrMet: 1.447 ± 0.279
1.568ThrAsn: 1.568 ± 0.369
2.895ThrPro: 2.895 ± 0.395
2.051ThrGln: 2.051 ± 0.479
2.835ThrArg: 2.835 ± 0.358
3.92ThrSer: 3.92 ± 0.475
4.222ThrThr: 4.222 ± 0.574
5.066ThrVal: 5.066 ± 0.663
1.025ThrTrp: 1.025 ± 0.25
1.568ThrTyr: 1.568 ± 0.289
0.0ThrXaa: 0.0 ± 0.0
Val
5.488ValAla: 5.488 ± 0.523
0.784ValCys: 0.784 ± 0.265
3.679ValAsp: 3.679 ± 0.514
3.136ValGlu: 3.136 ± 0.555
2.714ValPhe: 2.714 ± 0.373
3.739ValGly: 3.739 ± 0.501
1.086ValHis: 1.086 ± 0.253
4.523ValIle: 4.523 ± 0.604
4.282ValLys: 4.282 ± 0.671
5.609ValLeu: 5.609 ± 0.643
1.628ValMet: 1.628 ± 0.325
3.317ValAsn: 3.317 ± 0.444
2.774ValPro: 2.774 ± 0.427
1.99ValGln: 1.99 ± 0.29
3.92ValArg: 3.92 ± 0.557
4.282ValSer: 4.282 ± 0.594
5.307ValThr: 5.307 ± 0.622
4.523ValVal: 4.523 ± 0.74
0.663ValTrp: 0.663 ± 0.193
1.689ValTyr: 1.689 ± 0.279
0.0ValXaa: 0.0 ± 0.0
Trp
1.327TrpAla: 1.327 ± 0.287
0.362TrpCys: 0.362 ± 0.13
0.663TrpAsp: 0.663 ± 0.17
0.784TrpGlu: 0.784 ± 0.226
0.603TrpPhe: 0.603 ± 0.196
1.387TrpGly: 1.387 ± 0.324
0.241TrpHis: 0.241 ± 0.114
1.146TrpIle: 1.146 ± 0.29
0.663TrpLys: 0.663 ± 0.187
1.086TrpLeu: 1.086 ± 0.263
0.06TrpMet: 0.06 ± 0.057
0.543TrpAsn: 0.543 ± 0.17
0.724TrpPro: 0.724 ± 0.18
0.965TrpGln: 0.965 ± 0.215
1.206TrpArg: 1.206 ± 0.318
0.784TrpSer: 0.784 ± 0.183
0.905TrpThr: 0.905 ± 0.294
1.327TrpVal: 1.327 ± 0.31
0.241TrpTrp: 0.241 ± 0.123
0.482TrpTyr: 0.482 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.473TyrAla: 2.473 ± 0.461
0.663TyrCys: 0.663 ± 0.183
1.508TyrAsp: 1.508 ± 0.296
1.809TyrGlu: 1.809 ± 0.331
1.387TyrPhe: 1.387 ± 0.241
2.714TyrGly: 2.714 ± 0.407
0.784TyrHis: 0.784 ± 0.232
1.628TyrIle: 1.628 ± 0.313
0.905TyrLys: 0.905 ± 0.257
3.196TyrLeu: 3.196 ± 0.543
0.603TyrMet: 0.603 ± 0.184
1.327TyrAsn: 1.327 ± 0.268
2.231TyrPro: 2.231 ± 0.397
1.508TyrGln: 1.508 ± 0.281
2.835TyrArg: 2.835 ± 0.424
2.292TyrSer: 2.292 ± 0.355
1.628TyrThr: 1.628 ± 0.379
1.628TyrVal: 1.628 ± 0.339
0.603TyrTrp: 0.603 ± 0.176
0.965TyrTyr: 0.965 ± 0.2
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (16582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski