Amino acid dipepetide frequency for Escherichia phage flopper

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.218AlaAla: 8.218 ± 0.853
0.986AlaCys: 0.986 ± 0.295
4.208AlaAsp: 4.208 ± 0.522
5.654AlaGlu: 5.654 ± 0.702
3.024AlaPhe: 3.024 ± 0.365
6.64AlaGly: 6.64 ± 1.002
1.249AlaHis: 1.249 ± 0.355
5.654AlaIle: 5.654 ± 0.678
5.325AlaLys: 5.325 ± 0.683
6.312AlaLeu: 6.312 ± 0.736
2.498AlaMet: 2.498 ± 0.497
3.682AlaAsn: 3.682 ± 0.417
2.564AlaPro: 2.564 ± 0.434
2.959AlaGln: 2.959 ± 0.561
4.339AlaArg: 4.339 ± 0.72
5.654AlaSer: 5.654 ± 0.651
5.786AlaThr: 5.786 ± 0.804
5.72AlaVal: 5.72 ± 0.611
0.855AlaTrp: 0.855 ± 0.232
2.696AlaTyr: 2.696 ± 0.32
0.0AlaXaa: 0.0 ± 0.0
Cys
0.46CysAla: 0.46 ± 0.154
0.066CysCys: 0.066 ± 0.058
0.855CysAsp: 0.855 ± 0.328
0.657CysGlu: 0.657 ± 0.198
0.592CysPhe: 0.592 ± 0.185
0.723CysGly: 0.723 ± 0.176
0.329CysHis: 0.329 ± 0.139
0.723CysIle: 0.723 ± 0.209
0.789CysLys: 0.789 ± 0.253
0.855CysLeu: 0.855 ± 0.255
0.066CysMet: 0.066 ± 0.066
0.723CysAsn: 0.723 ± 0.237
0.329CysPro: 0.329 ± 0.182
0.329CysGln: 0.329 ± 0.167
0.526CysArg: 0.526 ± 0.192
0.657CysSer: 0.657 ± 0.191
0.855CysThr: 0.855 ± 0.22
0.789CysVal: 0.789 ± 0.243
0.131CysTrp: 0.131 ± 0.091
0.329CysTyr: 0.329 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
5.194AspAla: 5.194 ± 0.664
0.92AspCys: 0.92 ± 0.225
2.498AspAsp: 2.498 ± 0.415
3.813AspGlu: 3.813 ± 0.593
3.024AspPhe: 3.024 ± 0.466
5.194AspGly: 5.194 ± 0.586
1.118AspHis: 1.118 ± 0.282
3.616AspIle: 3.616 ± 0.594
3.682AspLys: 3.682 ± 0.583
4.208AspLeu: 4.208 ± 0.543
1.183AspMet: 1.183 ± 0.232
2.564AspAsn: 2.564 ± 0.417
2.63AspPro: 2.63 ± 0.391
0.657AspGln: 0.657 ± 0.161
2.367AspArg: 2.367 ± 0.448
4.536AspSer: 4.536 ± 0.511
2.959AspThr: 2.959 ± 0.348
4.208AspVal: 4.208 ± 0.569
1.052AspTrp: 1.052 ± 0.219
2.367AspTyr: 2.367 ± 0.369
0.0AspXaa: 0.0 ± 0.0
Glu
6.443GluAla: 6.443 ± 0.775
0.592GluCys: 0.592 ± 0.233
3.748GluAsp: 3.748 ± 0.504
5.523GluGlu: 5.523 ± 0.591
2.827GluPhe: 2.827 ± 0.382
3.024GluGly: 3.024 ± 0.435
1.644GluHis: 1.644 ± 0.303
4.405GluIle: 4.405 ± 0.608
3.419GluLys: 3.419 ± 0.458
6.838GluLeu: 6.838 ± 0.669
1.512GluMet: 1.512 ± 0.302
2.433GluAsn: 2.433 ± 0.317
1.709GluPro: 1.709 ± 0.313
2.696GluGln: 2.696 ± 0.387
3.616GluArg: 3.616 ± 0.483
3.287GluSer: 3.287 ± 0.495
3.748GluThr: 3.748 ± 0.567
4.734GluVal: 4.734 ± 0.596
0.723GluTrp: 0.723 ± 0.186
2.959GluTyr: 2.959 ± 0.385
0.0GluXaa: 0.0 ± 0.0
Phe
3.09PheAla: 3.09 ± 0.405
0.394PheCys: 0.394 ± 0.155
3.024PheAsp: 3.024 ± 0.348
2.367PheGlu: 2.367 ± 0.359
1.315PhePhe: 1.315 ± 0.295
2.893PheGly: 2.893 ± 0.355
0.855PheHis: 0.855 ± 0.22
2.235PheIle: 2.235 ± 0.366
3.09PheLys: 3.09 ± 0.491
2.761PheLeu: 2.761 ± 0.462
1.052PheMet: 1.052 ± 0.25
2.564PheAsn: 2.564 ± 0.45
1.118PhePro: 1.118 ± 0.249
1.052PheGln: 1.052 ± 0.277
2.038PheArg: 2.038 ± 0.295
2.827PheSer: 2.827 ± 0.416
2.367PheThr: 2.367 ± 0.416
2.761PheVal: 2.761 ± 0.352
0.526PheTrp: 0.526 ± 0.18
1.315PheTyr: 1.315 ± 0.321
0.0PheXaa: 0.0 ± 0.0
Gly
5.457GlyAla: 5.457 ± 0.62
0.855GlyCys: 0.855 ± 0.237
4.274GlyAsp: 4.274 ± 0.61
4.274GlyGlu: 4.274 ± 0.592
3.419GlyPhe: 3.419 ± 0.419
6.575GlyGly: 6.575 ± 0.799
1.381GlyHis: 1.381 ± 0.332
4.668GlyIle: 4.668 ± 0.482
4.076GlyLys: 4.076 ± 0.618
5.26GlyLeu: 5.26 ± 0.515
1.578GlyMet: 1.578 ± 0.44
3.879GlyAsn: 3.879 ± 0.603
1.841GlyPro: 1.841 ± 0.328
2.17GlyGln: 2.17 ± 0.341
3.419GlyArg: 3.419 ± 0.634
5.325GlySer: 5.325 ± 0.827
5.128GlyThr: 5.128 ± 0.705
5.786GlyVal: 5.786 ± 0.659
1.446GlyTrp: 1.446 ± 0.342
2.63GlyTyr: 2.63 ± 0.358
0.0GlyXaa: 0.0 ± 0.0
His
1.118HisAla: 1.118 ± 0.288
0.197HisCys: 0.197 ± 0.109
1.249HisAsp: 1.249 ± 0.269
1.512HisGlu: 1.512 ± 0.322
0.789HisPhe: 0.789 ± 0.258
1.381HisGly: 1.381 ± 0.251
0.592HisHis: 0.592 ± 0.181
1.249HisIle: 1.249 ± 0.249
0.986HisLys: 0.986 ± 0.311
1.183HisLeu: 1.183 ± 0.314
0.855HisMet: 0.855 ± 0.251
0.92HisAsn: 0.92 ± 0.296
0.657HisPro: 0.657 ± 0.197
0.789HisGln: 0.789 ± 0.241
0.592HisArg: 0.592 ± 0.188
1.249HisSer: 1.249 ± 0.353
1.249HisThr: 1.249 ± 0.484
1.249HisVal: 1.249 ± 0.259
0.263HisTrp: 0.263 ± 0.126
0.986HisTyr: 0.986 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
4.734IleAla: 4.734 ± 0.639
0.855IleCys: 0.855 ± 0.268
3.813IleAsp: 3.813 ± 0.431
4.076IleGlu: 4.076 ± 0.601
1.972IlePhe: 1.972 ± 0.425
3.813IleGly: 3.813 ± 0.55
0.855IleHis: 0.855 ± 0.231
2.761IleIle: 2.761 ± 0.502
4.734IleLys: 4.734 ± 0.638
4.011IleLeu: 4.011 ± 0.594
1.841IleMet: 1.841 ± 0.33
3.616IleAsn: 3.616 ± 0.583
2.761IlePro: 2.761 ± 0.456
3.222IleGln: 3.222 ± 0.484
3.156IleArg: 3.156 ± 0.447
3.748IleSer: 3.748 ± 0.468
4.536IleThr: 4.536 ± 0.484
3.945IleVal: 3.945 ± 0.616
0.723IleTrp: 0.723 ± 0.175
1.578IleTyr: 1.578 ± 0.322
0.0IleXaa: 0.0 ± 0.0
Lys
5.391LysAla: 5.391 ± 0.648
0.986LysCys: 0.986 ± 0.296
3.748LysAsp: 3.748 ± 0.515
3.879LysGlu: 3.879 ± 0.513
2.696LysPhe: 2.696 ± 0.444
3.485LysGly: 3.485 ± 0.367
1.709LysHis: 1.709 ± 0.406
3.156LysIle: 3.156 ± 0.426
3.024LysLys: 3.024 ± 0.475
5.523LysLeu: 5.523 ± 0.707
2.038LysMet: 2.038 ± 0.292
2.498LysAsn: 2.498 ± 0.567
2.696LysPro: 2.696 ± 0.421
2.827LysGln: 2.827 ± 0.387
2.301LysArg: 2.301 ± 0.357
4.076LysSer: 4.076 ± 0.447
3.682LysThr: 3.682 ± 0.526
4.011LysVal: 4.011 ± 0.544
0.855LysTrp: 0.855 ± 0.217
1.709LysTyr: 1.709 ± 0.315
0.0LysXaa: 0.0 ± 0.0
Leu
6.575LeuAla: 6.575 ± 0.694
0.657LeuCys: 0.657 ± 0.198
4.536LeuAsp: 4.536 ± 0.499
6.18LeuGlu: 6.18 ± 0.783
2.564LeuPhe: 2.564 ± 0.374
5.391LeuGly: 5.391 ± 0.607
1.512LeuHis: 1.512 ± 0.312
3.616LeuIle: 3.616 ± 0.448
5.062LeuLys: 5.062 ± 0.72
5.194LeuLeu: 5.194 ± 0.679
2.498LeuMet: 2.498 ± 0.346
4.931LeuAsn: 4.931 ± 0.555
3.222LeuPro: 3.222 ± 0.429
3.55LeuGln: 3.55 ± 0.478
3.485LeuArg: 3.485 ± 0.501
5.391LeuSer: 5.391 ± 0.607
5.325LeuThr: 5.325 ± 0.584
4.865LeuVal: 4.865 ± 0.631
0.526LeuTrp: 0.526 ± 0.187
2.433LeuTyr: 2.433 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
3.287MetAla: 3.287 ± 0.613
0.131MetCys: 0.131 ± 0.078
1.249MetAsp: 1.249 ± 0.3
1.315MetGlu: 1.315 ± 0.241
1.249MetPhe: 1.249 ± 0.28
1.118MetGly: 1.118 ± 0.245
0.526MetHis: 0.526 ± 0.161
1.249MetIle: 1.249 ± 0.233
2.235MetLys: 2.235 ± 0.445
2.038MetLeu: 2.038 ± 0.343
0.855MetMet: 0.855 ± 0.309
1.446MetAsn: 1.446 ± 0.327
1.446MetPro: 1.446 ± 0.358
1.709MetGln: 1.709 ± 0.322
1.249MetArg: 1.249 ± 0.27
1.907MetSer: 1.907 ± 0.345
1.578MetThr: 1.578 ± 0.336
1.381MetVal: 1.381 ± 0.281
0.592MetTrp: 0.592 ± 0.191
0.986MetTyr: 0.986 ± 0.267
0.0MetXaa: 0.0 ± 0.0
Asn
4.208AsnAla: 4.208 ± 0.563
0.723AsnCys: 0.723 ± 0.224
2.959AsnAsp: 2.959 ± 0.481
2.498AsnGlu: 2.498 ± 0.337
1.907AsnPhe: 1.907 ± 0.412
4.339AsnGly: 4.339 ± 0.557
0.723AsnHis: 0.723 ± 0.228
2.959AsnIle: 2.959 ± 0.42
2.893AsnLys: 2.893 ± 0.469
4.142AsnLeu: 4.142 ± 0.515
1.118AsnMet: 1.118 ± 0.302
2.038AsnAsn: 2.038 ± 0.369
3.156AsnPro: 3.156 ± 0.389
2.17AsnGln: 2.17 ± 0.292
2.696AsnArg: 2.696 ± 0.473
3.156AsnSer: 3.156 ± 0.472
2.564AsnThr: 2.564 ± 0.43
3.419AsnVal: 3.419 ± 0.551
0.789AsnTrp: 0.789 ± 0.2
1.907AsnTyr: 1.907 ± 0.303
0.0AsnXaa: 0.0 ± 0.0
Pro
3.222ProAla: 3.222 ± 0.437
0.263ProCys: 0.263 ± 0.116
2.17ProAsp: 2.17 ± 0.343
3.353ProGlu: 3.353 ± 0.375
1.183ProPhe: 1.183 ± 0.254
2.893ProGly: 2.893 ± 0.457
0.92ProHis: 0.92 ± 0.246
1.644ProIle: 1.644 ± 0.386
2.038ProLys: 2.038 ± 0.321
2.959ProLeu: 2.959 ± 0.453
0.789ProMet: 0.789 ± 0.232
2.498ProAsn: 2.498 ± 0.316
0.92ProPro: 0.92 ± 0.251
1.118ProGln: 1.118 ± 0.267
1.578ProArg: 1.578 ± 0.348
3.156ProSer: 3.156 ± 0.411
2.301ProThr: 2.301 ± 0.358
3.024ProVal: 3.024 ± 0.407
0.723ProTrp: 0.723 ± 0.182
1.578ProTyr: 1.578 ± 0.324
0.0ProXaa: 0.0 ± 0.0
Gln
2.959GlnAla: 2.959 ± 0.377
0.394GlnCys: 0.394 ± 0.192
1.183GlnAsp: 1.183 ± 0.231
2.761GlnGlu: 2.761 ± 0.471
1.709GlnPhe: 1.709 ± 0.326
1.841GlnGly: 1.841 ± 0.33
0.46GlnHis: 0.46 ± 0.232
3.287GlnIle: 3.287 ± 0.374
1.644GlnLys: 1.644 ± 0.293
3.353GlnLeu: 3.353 ± 0.531
1.052GlnMet: 1.052 ± 0.221
1.512GlnAsn: 1.512 ± 0.268
1.512GlnPro: 1.512 ± 0.342
2.564GlnGln: 2.564 ± 0.793
2.761GlnArg: 2.761 ± 0.419
1.907GlnSer: 1.907 ± 0.365
2.17GlnThr: 2.17 ± 0.471
2.696GlnVal: 2.696 ± 0.479
0.657GlnTrp: 0.657 ± 0.188
1.841GlnTyr: 1.841 ± 0.263
0.0GlnXaa: 0.0 ± 0.0
Arg
3.419ArgAla: 3.419 ± 0.524
0.723ArgCys: 0.723 ± 0.263
2.564ArgAsp: 2.564 ± 0.39
3.353ArgGlu: 3.353 ± 0.508
2.104ArgPhe: 2.104 ± 0.35
2.959ArgGly: 2.959 ± 0.471
0.789ArgHis: 0.789 ± 0.181
3.287ArgIle: 3.287 ± 0.428
2.893ArgLys: 2.893 ± 0.36
4.142ArgLeu: 4.142 ± 0.541
1.446ArgMet: 1.446 ± 0.319
2.564ArgAsn: 2.564 ± 0.383
1.841ArgPro: 1.841 ± 0.315
1.709ArgGln: 1.709 ± 0.342
1.775ArgArg: 1.775 ± 0.371
2.63ArgSer: 2.63 ± 0.484
2.367ArgThr: 2.367 ± 0.363
3.879ArgVal: 3.879 ± 0.545
0.789ArgTrp: 0.789 ± 0.205
1.644ArgTyr: 1.644 ± 0.295
0.0ArgXaa: 0.0 ± 0.0
Ser
5.786SerAla: 5.786 ± 0.917
0.329SerCys: 0.329 ± 0.148
4.208SerAsp: 4.208 ± 0.416
4.076SerGlu: 4.076 ± 0.5
2.433SerPhe: 2.433 ± 0.433
6.377SerGly: 6.377 ± 0.628
1.381SerHis: 1.381 ± 0.249
5.062SerIle: 5.062 ± 0.603
3.419SerLys: 3.419 ± 0.477
5.128SerLeu: 5.128 ± 0.496
2.038SerMet: 2.038 ± 0.345
2.893SerAsn: 2.893 ± 0.44
2.235SerPro: 2.235 ± 0.386
2.696SerGln: 2.696 ± 0.485
3.024SerArg: 3.024 ± 0.443
4.734SerSer: 4.734 ± 0.66
3.945SerThr: 3.945 ± 0.595
4.471SerVal: 4.471 ± 0.496
0.789SerTrp: 0.789 ± 0.217
2.235SerTyr: 2.235 ± 0.432
0.0SerXaa: 0.0 ± 0.0
Thr
5.128ThrAla: 5.128 ± 0.632
0.394ThrCys: 0.394 ± 0.143
2.959ThrAsp: 2.959 ± 0.417
3.485ThrGlu: 3.485 ± 0.398
2.696ThrPhe: 2.696 ± 0.36
5.983ThrGly: 5.983 ± 0.716
1.118ThrHis: 1.118 ± 0.364
4.602ThrIle: 4.602 ± 0.571
3.682ThrLys: 3.682 ± 0.514
4.471ThrLeu: 4.471 ± 0.63
1.315ThrMet: 1.315 ± 0.256
2.893ThrAsn: 2.893 ± 0.425
2.761ThrPro: 2.761 ± 0.413
2.104ThrGln: 2.104 ± 0.334
2.827ThrArg: 2.827 ± 0.426
3.55ThrSer: 3.55 ± 0.53
3.682ThrThr: 3.682 ± 0.569
5.654ThrVal: 5.654 ± 0.796
0.92ThrTrp: 0.92 ± 0.281
2.498ThrTyr: 2.498 ± 0.462
0.0ThrXaa: 0.0 ± 0.0
Val
6.18ValAla: 6.18 ± 0.711
0.789ValCys: 0.789 ± 0.223
5.194ValAsp: 5.194 ± 0.567
3.616ValGlu: 3.616 ± 0.661
2.63ValPhe: 2.63 ± 0.427
5.26ValGly: 5.26 ± 0.502
1.183ValHis: 1.183 ± 0.22
3.879ValIle: 3.879 ± 0.559
4.208ValLys: 4.208 ± 0.564
5.194ValLeu: 5.194 ± 0.568
1.709ValMet: 1.709 ± 0.32
3.353ValAsn: 3.353 ± 0.342
3.09ValPro: 3.09 ± 0.428
2.235ValGln: 2.235 ± 0.373
3.287ValArg: 3.287 ± 0.487
5.128ValSer: 5.128 ± 0.677
4.931ValThr: 4.931 ± 0.832
4.668ValVal: 4.668 ± 0.664
1.052ValTrp: 1.052 ± 0.256
2.104ValTyr: 2.104 ± 0.44
0.0ValXaa: 0.0 ± 0.0
Trp
0.92TrpAla: 0.92 ± 0.234
0.263TrpCys: 0.263 ± 0.12
0.526TrpAsp: 0.526 ± 0.218
1.578TrpGlu: 1.578 ± 0.336
0.394TrpPhe: 0.394 ± 0.152
0.592TrpGly: 0.592 ± 0.205
0.197TrpHis: 0.197 ± 0.098
0.526TrpIle: 0.526 ± 0.158
0.986TrpLys: 0.986 ± 0.259
0.986TrpLeu: 0.986 ± 0.21
0.46TrpMet: 0.46 ± 0.128
0.855TrpAsn: 0.855 ± 0.197
0.263TrpPro: 0.263 ± 0.115
0.657TrpGln: 0.657 ± 0.215
0.592TrpArg: 0.592 ± 0.197
1.381TrpSer: 1.381 ± 0.253
0.986TrpThr: 0.986 ± 0.229
0.657TrpVal: 0.657 ± 0.197
0.131TrpTrp: 0.131 ± 0.076
0.92TrpTyr: 0.92 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.235TyrAla: 2.235 ± 0.341
0.263TyrCys: 0.263 ± 0.103
2.827TyrAsp: 2.827 ± 0.412
1.907TyrGlu: 1.907 ± 0.307
1.118TyrPhe: 1.118 ± 0.245
2.959TyrGly: 2.959 ± 0.34
0.526TyrHis: 0.526 ± 0.232
2.17TyrIle: 2.17 ± 0.369
2.17TyrLys: 2.17 ± 0.45
2.959TyrLeu: 2.959 ± 0.611
1.578TyrMet: 1.578 ± 0.396
2.498TyrAsn: 2.498 ± 0.391
1.578TyrPro: 1.578 ± 0.282
1.052TyrGln: 1.052 ± 0.232
1.249TyrArg: 1.249 ± 0.274
2.959TyrSer: 2.959 ± 0.664
2.564TyrThr: 2.564 ± 0.346
1.775TyrVal: 1.775 ± 0.38
0.329TyrTrp: 0.329 ± 0.129
1.183TyrTyr: 1.183 ± 0.238
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (15211 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski