Amino acid dipepetide frequency for Escherichia phage C119

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.431AlaAla: 8.431 ± 1.036
0.488AlaCys: 0.488 ± 0.197
3.763AlaAsp: 3.763 ± 0.58
5.156AlaGlu: 5.156 ± 0.663
2.578AlaPhe: 2.578 ± 0.419
7.456AlaGly: 7.456 ± 1.035
1.185AlaHis: 1.185 ± 0.308
6.829AlaIle: 6.829 ± 0.761
6.271AlaLys: 6.271 ± 1.04
6.689AlaLeu: 6.689 ± 0.801
2.509AlaMet: 2.509 ± 0.443
3.763AlaAsn: 3.763 ± 0.585
1.463AlaPro: 1.463 ± 0.384
3.832AlaGln: 3.832 ± 0.408
4.599AlaArg: 4.599 ± 0.62
5.365AlaSer: 5.365 ± 0.666
4.39AlaThr: 4.39 ± 0.611
5.505AlaVal: 5.505 ± 0.596
1.324AlaTrp: 1.324 ± 0.333
2.787AlaTyr: 2.787 ± 0.389
0.0AlaXaa: 0.0 ± 0.0
Cys
0.557CysAla: 0.557 ± 0.222
0.139CysCys: 0.139 ± 0.097
0.697CysAsp: 0.697 ± 0.167
0.836CysGlu: 0.836 ± 0.231
0.627CysPhe: 0.627 ± 0.229
1.324CysGly: 1.324 ± 0.327
0.279CysHis: 0.279 ± 0.164
0.627CysIle: 0.627 ± 0.252
0.976CysLys: 0.976 ± 0.237
1.185CysLeu: 1.185 ± 0.271
0.418CysMet: 0.418 ± 0.199
0.557CysAsn: 0.557 ± 0.18
0.348CysPro: 0.348 ± 0.185
0.418CysGln: 0.418 ± 0.168
0.906CysArg: 0.906 ± 0.241
1.185CysSer: 1.185 ± 0.349
0.557CysThr: 0.557 ± 0.19
1.115CysVal: 1.115 ± 0.286
0.209CysTrp: 0.209 ± 0.12
0.348CysTyr: 0.348 ± 0.174
0.0CysXaa: 0.0 ± 0.0
Asp
4.39AspAla: 4.39 ± 0.668
0.348AspCys: 0.348 ± 0.152
4.042AspAsp: 4.042 ± 0.497
5.226AspGlu: 5.226 ± 0.601
2.439AspPhe: 2.439 ± 0.382
6.689AspGly: 6.689 ± 0.786
0.906AspHis: 0.906 ± 0.29
3.693AspIle: 3.693 ± 0.524
5.087AspLys: 5.087 ± 0.519
3.832AspLeu: 3.832 ± 0.519
1.324AspMet: 1.324 ± 0.307
2.857AspAsn: 2.857 ± 0.393
1.812AspPro: 1.812 ± 0.28
1.324AspGln: 1.324 ± 0.354
2.021AspArg: 2.021 ± 0.445
4.39AspSer: 4.39 ± 0.648
3.136AspThr: 3.136 ± 0.357
4.042AspVal: 4.042 ± 0.479
0.418AspTrp: 0.418 ± 0.16
3.136AspTyr: 3.136 ± 0.49
0.0AspXaa: 0.0 ± 0.0
Glu
5.784GluAla: 5.784 ± 0.668
0.766GluCys: 0.766 ± 0.249
2.927GluAsp: 2.927 ± 0.448
3.832GluGlu: 3.832 ± 0.652
3.484GluPhe: 3.484 ± 0.538
3.623GluGly: 3.623 ± 0.472
0.836GluHis: 0.836 ± 0.234
4.878GluIle: 4.878 ± 0.426
3.066GluLys: 3.066 ± 0.581
5.365GluLeu: 5.365 ± 0.527
2.718GluMet: 2.718 ± 0.633
3.345GluAsn: 3.345 ± 0.613
1.045GluPro: 1.045 ± 0.285
2.578GluGln: 2.578 ± 0.375
3.345GluArg: 3.345 ± 0.55
3.972GluSer: 3.972 ± 0.646
3.623GluThr: 3.623 ± 0.492
5.017GluVal: 5.017 ± 0.572
0.488GluTrp: 0.488 ± 0.161
2.439GluTyr: 2.439 ± 0.418
0.0GluXaa: 0.0 ± 0.0
Phe
2.996PheAla: 2.996 ± 0.487
0.766PheCys: 0.766 ± 0.237
3.623PheAsp: 3.623 ± 0.467
2.299PheGlu: 2.299 ± 0.422
0.976PhePhe: 0.976 ± 0.273
4.042PheGly: 4.042 ± 0.632
0.488PheHis: 0.488 ± 0.211
1.881PheIle: 1.881 ± 0.261
2.16PheLys: 2.16 ± 0.401
2.09PheLeu: 2.09 ± 0.408
1.045PheMet: 1.045 ± 0.267
2.787PheAsn: 2.787 ± 0.441
1.185PhePro: 1.185 ± 0.269
1.742PheGln: 1.742 ± 0.375
1.812PheArg: 1.812 ± 0.344
2.648PheSer: 2.648 ± 0.381
2.369PheThr: 2.369 ± 0.386
2.299PheVal: 2.299 ± 0.338
0.488PheTrp: 0.488 ± 0.166
1.324PheTyr: 1.324 ± 0.307
0.0PheXaa: 0.0 ± 0.0
Gly
4.738GlyAla: 4.738 ± 0.808
1.394GlyCys: 1.394 ± 0.339
4.808GlyAsp: 4.808 ± 0.681
5.296GlyGlu: 5.296 ± 0.714
3.484GlyPhe: 3.484 ± 0.666
6.271GlyGly: 6.271 ± 1.007
0.976GlyHis: 0.976 ± 0.336
5.644GlyIle: 5.644 ± 0.548
5.156GlyLys: 5.156 ± 0.595
6.689GlyLeu: 6.689 ± 0.587
2.16GlyMet: 2.16 ± 0.373
3.554GlyAsn: 3.554 ± 0.451
2.927GlyPro: 2.927 ± 2.182
1.742GlyGln: 1.742 ± 0.344
2.996GlyArg: 2.996 ± 0.417
5.714GlySer: 5.714 ± 0.626
4.669GlyThr: 4.669 ± 0.533
6.411GlyVal: 6.411 ± 0.62
1.115GlyTrp: 1.115 ± 0.328
3.832GlyTyr: 3.832 ± 0.419
0.0GlyXaa: 0.0 ± 0.0
His
0.697HisAla: 0.697 ± 0.208
0.279HisCys: 0.279 ± 0.138
0.976HisAsp: 0.976 ± 0.305
0.906HisGlu: 0.906 ± 0.304
0.418HisPhe: 0.418 ± 0.161
1.254HisGly: 1.254 ± 0.404
0.348HisHis: 0.348 ± 0.143
0.976HisIle: 0.976 ± 0.319
1.742HisLys: 1.742 ± 0.382
1.254HisLeu: 1.254 ± 0.309
0.209HisMet: 0.209 ± 0.106
0.627HisAsn: 0.627 ± 0.245
0.279HisPro: 0.279 ± 0.135
0.348HisGln: 0.348 ± 0.154
1.115HisArg: 1.115 ± 0.26
0.627HisSer: 0.627 ± 0.222
0.557HisThr: 0.557 ± 0.226
1.254HisVal: 1.254 ± 0.331
0.07HisTrp: 0.07 ± 0.067
0.697HisTyr: 0.697 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
5.087IleAla: 5.087 ± 0.563
0.906IleCys: 0.906 ± 0.242
5.784IleAsp: 5.784 ± 0.728
5.156IleGlu: 5.156 ± 0.615
1.742IlePhe: 1.742 ± 0.27
3.763IleGly: 3.763 ± 0.589
0.766IleHis: 0.766 ± 0.198
3.414IleIle: 3.414 ± 0.444
3.832IleLys: 3.832 ± 0.448
3.345IleLeu: 3.345 ± 0.495
1.463IleMet: 1.463 ± 0.402
3.902IleAsn: 3.902 ± 0.554
2.578IlePro: 2.578 ± 0.438
2.299IleGln: 2.299 ± 0.503
3.763IleArg: 3.763 ± 0.463
4.32IleSer: 4.32 ± 0.505
4.529IleThr: 4.529 ± 0.508
3.484IleVal: 3.484 ± 0.382
0.557IleTrp: 0.557 ± 0.164
2.369IleTyr: 2.369 ± 0.368
0.0IleXaa: 0.0 ± 0.0
Lys
6.55LysAla: 6.55 ± 0.843
0.627LysCys: 0.627 ± 0.266
3.693LysAsp: 3.693 ± 0.416
3.484LysGlu: 3.484 ± 0.649
2.509LysPhe: 2.509 ± 0.447
3.345LysGly: 3.345 ± 0.624
1.185LysHis: 1.185 ± 0.311
3.832LysIle: 3.832 ± 0.485
3.623LysLys: 3.623 ± 0.541
5.853LysLeu: 5.853 ± 0.712
3.136LysMet: 3.136 ± 0.611
2.09LysAsn: 2.09 ± 0.371
1.324LysPro: 1.324 ± 0.347
2.787LysGln: 2.787 ± 0.42
2.648LysArg: 2.648 ± 0.529
4.39LysSer: 4.39 ± 0.534
3.623LysThr: 3.623 ± 0.472
4.878LysVal: 4.878 ± 0.462
0.836LysTrp: 0.836 ± 0.269
2.23LysTyr: 2.23 ± 0.343
0.0LysXaa: 0.0 ± 0.0
Leu
6.62LeuAla: 6.62 ± 0.738
0.906LeuCys: 0.906 ± 0.25
4.251LeuAsp: 4.251 ± 0.572
3.902LeuGlu: 3.902 ± 0.517
2.021LeuPhe: 2.021 ± 0.377
4.46LeuGly: 4.46 ± 0.647
1.254LeuHis: 1.254 ± 0.303
3.972LeuIle: 3.972 ± 0.608
4.251LeuLys: 4.251 ± 0.579
4.042LeuLeu: 4.042 ± 0.585
1.603LeuMet: 1.603 ± 0.281
3.693LeuAsn: 3.693 ± 0.4
3.414LeuPro: 3.414 ± 0.461
2.578LeuGln: 2.578 ± 0.576
4.111LeuArg: 4.111 ± 0.559
6.271LeuSer: 6.271 ± 0.723
4.669LeuThr: 4.669 ± 0.558
5.017LeuVal: 5.017 ± 0.632
0.418LeuTrp: 0.418 ± 0.168
2.439LeuTyr: 2.439 ± 0.446
0.0LeuXaa: 0.0 ± 0.0
Met
3.554MetAla: 3.554 ± 0.407
0.209MetCys: 0.209 ± 0.125
0.836MetAsp: 0.836 ± 0.291
1.254MetGlu: 1.254 ± 0.299
0.976MetPhe: 0.976 ± 0.3
0.976MetGly: 0.976 ± 0.241
0.627MetHis: 0.627 ± 0.227
1.951MetIle: 1.951 ± 0.386
2.23MetLys: 2.23 ± 0.414
1.812MetLeu: 1.812 ± 0.435
0.976MetMet: 0.976 ± 0.323
1.324MetAsn: 1.324 ± 0.354
0.557MetPro: 0.557 ± 0.186
0.906MetGln: 0.906 ± 0.255
1.463MetArg: 1.463 ± 0.352
1.533MetSer: 1.533 ± 0.28
2.299MetThr: 2.299 ± 0.419
2.299MetVal: 2.299 ± 0.354
0.348MetTrp: 0.348 ± 0.126
0.488MetTyr: 0.488 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
3.832AsnAla: 3.832 ± 0.484
0.906AsnCys: 0.906 ± 0.269
3.414AsnAsp: 3.414 ± 0.415
3.205AsnGlu: 3.205 ± 0.515
1.533AsnPhe: 1.533 ± 0.287
6.062AsnGly: 6.062 ± 0.882
1.045AsnHis: 1.045 ± 0.301
2.648AsnIle: 2.648 ± 0.367
2.648AsnLys: 2.648 ± 0.421
4.32AsnLeu: 4.32 ± 0.47
1.115AsnMet: 1.115 ± 0.322
3.693AsnAsn: 3.693 ± 0.615
1.394AsnPro: 1.394 ± 0.253
1.672AsnGln: 1.672 ± 0.372
1.881AsnArg: 1.881 ± 0.405
3.972AsnSer: 3.972 ± 0.751
2.648AsnThr: 2.648 ± 0.475
3.832AsnVal: 3.832 ± 0.504
0.836AsnTrp: 0.836 ± 0.19
1.812AsnTyr: 1.812 ± 0.441
0.0AsnXaa: 0.0 ± 0.0
Pro
3.972ProAla: 3.972 ± 0.767
0.488ProCys: 0.488 ± 0.224
1.812ProAsp: 1.812 ± 0.499
2.439ProGlu: 2.439 ± 0.417
1.672ProPhe: 1.672 ± 0.355
1.951ProGly: 1.951 ± 0.405
0.418ProHis: 0.418 ± 0.173
1.185ProIle: 1.185 ± 0.27
1.254ProLys: 1.254 ± 0.313
1.603ProLeu: 1.603 ± 0.346
0.557ProMet: 0.557 ± 0.188
1.324ProAsn: 1.324 ± 0.3
0.836ProPro: 0.836 ± 0.284
2.578ProGln: 2.578 ± 0.743
1.394ProArg: 1.394 ± 0.268
1.394ProSer: 1.394 ± 0.306
1.672ProThr: 1.672 ± 0.377
3.414ProVal: 3.414 ± 0.458
0.348ProTrp: 0.348 ± 0.14
0.906ProTyr: 0.906 ± 0.242
0.0ProXaa: 0.0 ± 0.0
Gln
4.181GlnAla: 4.181 ± 0.794
0.557GlnCys: 0.557 ± 0.246
1.881GlnAsp: 1.881 ± 0.341
2.439GlnGlu: 2.439 ± 0.362
1.324GlnPhe: 1.324 ± 0.277
3.205GlnGly: 3.205 ± 1.334
0.418GlnHis: 0.418 ± 0.179
2.996GlnIle: 2.996 ± 0.66
1.812GlnLys: 1.812 ± 0.322
2.927GlnLeu: 2.927 ± 0.489
0.766GlnMet: 0.766 ± 0.178
2.16GlnAsn: 2.16 ± 0.507
1.463GlnPro: 1.463 ± 0.269
2.648GlnGln: 2.648 ± 0.856
1.672GlnArg: 1.672 ± 0.319
2.927GlnSer: 2.927 ± 0.423
1.672GlnThr: 1.672 ± 0.367
2.16GlnVal: 2.16 ± 0.327
0.557GlnTrp: 0.557 ± 0.189
1.533GlnTyr: 1.533 ± 0.461
0.0GlnXaa: 0.0 ± 0.0
Arg
3.902ArgAla: 3.902 ± 0.505
0.906ArgCys: 0.906 ± 0.343
2.509ArgAsp: 2.509 ± 0.384
3.136ArgGlu: 3.136 ± 0.414
2.23ArgPhe: 2.23 ± 0.345
3.066ArgGly: 3.066 ± 0.471
1.045ArgHis: 1.045 ± 0.325
3.693ArgIle: 3.693 ± 0.527
3.693ArgLys: 3.693 ± 0.606
3.414ArgLeu: 3.414 ± 0.553
0.697ArgMet: 0.697 ± 0.229
2.439ArgAsn: 2.439 ± 0.501
2.09ArgPro: 2.09 ± 0.37
2.439ArgGln: 2.439 ± 0.394
3.066ArgArg: 3.066 ± 0.446
2.439ArgSer: 2.439 ± 0.532
1.951ArgThr: 1.951 ± 0.398
3.275ArgVal: 3.275 ± 0.501
0.418ArgTrp: 0.418 ± 0.178
2.369ArgTyr: 2.369 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
6.271SerAla: 6.271 ± 0.859
0.557SerCys: 0.557 ± 0.209
5.505SerAsp: 5.505 ± 0.647
4.738SerGlu: 4.738 ± 0.602
3.484SerPhe: 3.484 ± 0.492
6.968SerGly: 6.968 ± 0.772
0.627SerHis: 0.627 ± 0.19
3.832SerIle: 3.832 ± 0.434
3.623SerLys: 3.623 ± 0.57
5.365SerLeu: 5.365 ± 0.598
1.324SerMet: 1.324 ± 0.306
3.554SerAsn: 3.554 ± 0.468
2.369SerPro: 2.369 ± 0.431
2.996SerGln: 2.996 ± 0.436
2.509SerArg: 2.509 ± 0.421
5.435SerSer: 5.435 ± 0.921
3.693SerThr: 3.693 ± 0.557
5.087SerVal: 5.087 ± 0.551
0.976SerTrp: 0.976 ± 0.282
2.648SerTyr: 2.648 ± 0.393
0.0SerXaa: 0.0 ± 0.0
Thr
5.505ThrAla: 5.505 ± 0.654
0.836ThrCys: 0.836 ± 0.224
2.857ThrAsp: 2.857 ± 0.393
2.857ThrGlu: 2.857 ± 0.408
1.951ThrPhe: 1.951 ± 0.391
5.714ThrGly: 5.714 ± 0.854
0.627ThrHis: 0.627 ± 0.212
3.763ThrIle: 3.763 ± 0.446
2.578ThrLys: 2.578 ± 0.45
3.345ThrLeu: 3.345 ± 0.539
1.463ThrMet: 1.463 ± 0.316
3.902ThrAsn: 3.902 ± 0.451
2.578ThrPro: 2.578 ± 0.483
2.16ThrGln: 2.16 ± 0.4
1.951ThrArg: 1.951 ± 0.296
4.181ThrSer: 4.181 ± 0.579
3.414ThrThr: 3.414 ± 0.673
3.972ThrVal: 3.972 ± 0.483
0.906ThrTrp: 0.906 ± 0.218
2.16ThrTyr: 2.16 ± 0.373
0.0ThrXaa: 0.0 ± 0.0
Val
5.017ValAla: 5.017 ± 0.665
1.115ValCys: 1.115 ± 0.318
4.251ValAsp: 4.251 ± 0.518
3.484ValGlu: 3.484 ± 0.604
2.996ValPhe: 2.996 ± 0.503
5.435ValGly: 5.435 ± 0.638
0.836ValHis: 0.836 ± 0.217
4.251ValIle: 4.251 ± 0.505
5.784ValLys: 5.784 ± 0.678
4.042ValLeu: 4.042 ± 0.494
2.021ValMet: 2.021 ± 0.332
3.832ValAsn: 3.832 ± 0.598
2.16ValPro: 2.16 ± 0.454
2.509ValGln: 2.509 ± 0.423
4.042ValArg: 4.042 ± 0.619
7.108ValSer: 7.108 ± 0.624
3.832ValThr: 3.832 ± 0.526
5.644ValVal: 5.644 ± 0.682
1.185ValTrp: 1.185 ± 0.248
2.787ValTyr: 2.787 ± 0.492
0.0ValXaa: 0.0 ± 0.0
Trp
0.418TrpAla: 0.418 ± 0.153
0.488TrpCys: 0.488 ± 0.175
0.697TrpAsp: 0.697 ± 0.202
0.697TrpGlu: 0.697 ± 0.218
0.488TrpPhe: 0.488 ± 0.211
0.836TrpGly: 0.836 ± 0.27
0.209TrpHis: 0.209 ± 0.098
1.045TrpIle: 1.045 ± 0.251
0.976TrpLys: 0.976 ± 0.266
1.045TrpLeu: 1.045 ± 0.227
0.139TrpMet: 0.139 ± 0.089
0.766TrpAsn: 0.766 ± 0.258
0.348TrpPro: 0.348 ± 0.154
0.139TrpGln: 0.139 ± 0.097
0.976TrpArg: 0.976 ± 0.243
1.115TrpSer: 1.115 ± 0.395
0.766TrpThr: 0.766 ± 0.272
0.766TrpVal: 0.766 ± 0.216
0.279TrpTrp: 0.279 ± 0.135
0.209TrpTyr: 0.209 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.881TyrAla: 1.881 ± 0.377
0.627TyrCys: 0.627 ± 0.223
2.787TyrAsp: 2.787 ± 0.498
2.787TyrGlu: 2.787 ± 0.478
2.09TyrPhe: 2.09 ± 0.425
3.136TyrGly: 3.136 ± 0.471
0.557TyrHis: 0.557 ± 0.195
1.951TyrIle: 1.951 ± 0.321
1.951TyrLys: 1.951 ± 0.431
1.672TyrLeu: 1.672 ± 0.248
0.906TyrMet: 0.906 ± 0.218
2.299TyrAsn: 2.299 ± 0.338
1.045TyrPro: 1.045 ± 0.253
1.603TyrGln: 1.603 ± 0.282
2.509TyrArg: 2.509 ± 0.406
2.578TyrSer: 2.578 ± 0.481
2.648TyrThr: 2.648 ± 0.428
2.857TyrVal: 2.857 ± 0.406
0.627TyrTrp: 0.627 ± 0.232
1.324TyrTyr: 1.324 ± 0.35
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (14352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski