Amino acid dipepetide frequency for Corynebacterium phage phi673

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.233AlaAla: 13.233 ± 2.718
0.286AlaCys: 0.286 ± 0.17
5.365AlaAsp: 5.365 ± 0.825
5.579AlaGlu: 5.579 ± 0.748
2.146AlaPhe: 2.146 ± 0.5
10.372AlaGly: 10.372 ± 0.926
2.217AlaHis: 2.217 ± 0.482
5.079AlaIle: 5.079 ± 0.602
4.077AlaLys: 4.077 ± 0.708
8.655AlaLeu: 8.655 ± 0.767
2.575AlaMet: 2.575 ± 0.446
2.933AlaAsn: 2.933 ± 0.463
4.936AlaPro: 4.936 ± 1.086
4.578AlaGln: 4.578 ± 0.642
5.293AlaArg: 5.293 ± 0.577
8.226AlaSer: 8.226 ± 1.976
8.655AlaThr: 8.655 ± 1.667
6.867AlaVal: 6.867 ± 0.783
2.003AlaTrp: 2.003 ± 0.351
2.146AlaTyr: 2.146 ± 0.333
0.0AlaXaa: 0.0 ± 0.0
Cys
0.501CysAla: 0.501 ± 0.189
0.0CysCys: 0.0 ± 0.0
0.572CysAsp: 0.572 ± 0.194
0.358CysGlu: 0.358 ± 0.175
0.143CysPhe: 0.143 ± 0.102
0.429CysGly: 0.429 ± 0.214
0.143CysHis: 0.143 ± 0.106
0.358CysIle: 0.358 ± 0.14
0.215CysLys: 0.215 ± 0.141
0.286CysLeu: 0.286 ± 0.167
0.072CysMet: 0.072 ± 0.062
0.072CysAsn: 0.072 ± 0.063
0.072CysPro: 0.072 ± 0.067
0.215CysGln: 0.215 ± 0.114
0.429CysArg: 0.429 ± 0.169
0.572CysSer: 0.572 ± 0.203
0.572CysThr: 0.572 ± 0.22
0.644CysVal: 0.644 ± 0.246
0.215CysTrp: 0.215 ± 0.122
0.143CysTyr: 0.143 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
6.795AspAla: 6.795 ± 0.631
0.358AspCys: 0.358 ± 0.129
4.077AspAsp: 4.077 ± 0.555
3.29AspGlu: 3.29 ± 0.506
2.217AspPhe: 2.217 ± 0.379
5.794AspGly: 5.794 ± 0.609
1.431AspHis: 1.431 ± 0.369
3.147AspIle: 3.147 ± 0.514
3.934AspLys: 3.934 ± 0.52
6.867AspLeu: 6.867 ± 0.766
2.933AspMet: 2.933 ± 0.582
2.933AspAsn: 2.933 ± 0.479
3.577AspPro: 3.577 ± 0.507
1.86AspGln: 1.86 ± 0.354
2.432AspArg: 2.432 ± 0.477
3.72AspSer: 3.72 ± 0.656
3.934AspThr: 3.934 ± 0.581
4.22AspVal: 4.22 ± 0.551
1.216AspTrp: 1.216 ± 0.284
1.502AspTyr: 1.502 ± 0.291
0.0AspXaa: 0.0 ± 0.0
Glu
6.366GluAla: 6.366 ± 0.426
0.501GluCys: 0.501 ± 0.199
4.292GluAsp: 4.292 ± 0.699
3.004GluGlu: 3.004 ± 0.588
1.717GluPhe: 1.717 ± 0.374
3.577GluGly: 3.577 ± 0.652
2.217GluHis: 2.217 ± 0.348
3.219GluIle: 3.219 ± 0.419
1.717GluLys: 1.717 ± 0.368
4.864GluLeu: 4.864 ± 0.584
1.216GluMet: 1.216 ± 0.321
1.931GluAsn: 1.931 ± 0.4
1.431GluPro: 1.431 ± 0.283
3.147GluGln: 3.147 ± 0.543
4.649GluArg: 4.649 ± 0.617
3.72GluSer: 3.72 ± 0.543
3.219GluThr: 3.219 ± 0.42
3.433GluVal: 3.433 ± 0.383
1.216GluTrp: 1.216 ± 0.258
1.717GluTyr: 1.717 ± 0.328
0.0GluXaa: 0.0 ± 0.0
Phe
2.575PheAla: 2.575 ± 0.385
0.072PheCys: 0.072 ± 0.073
2.217PheAsp: 2.217 ± 0.434
1.431PheGlu: 1.431 ± 0.313
0.644PhePhe: 0.644 ± 0.167
1.717PheGly: 1.717 ± 0.345
0.429PheHis: 0.429 ± 0.166
1.144PheIle: 1.144 ± 0.34
1.073PheLys: 1.073 ± 0.274
2.217PheLeu: 2.217 ± 0.413
0.787PheMet: 0.787 ± 0.205
1.216PheAsn: 1.216 ± 0.294
1.288PhePro: 1.288 ± 0.271
1.288PheGln: 1.288 ± 0.292
1.216PheArg: 1.216 ± 0.33
1.502PheSer: 1.502 ± 0.26
2.146PheThr: 2.146 ± 0.523
1.645PheVal: 1.645 ± 0.348
0.429PheTrp: 0.429 ± 0.239
0.429PheTyr: 0.429 ± 0.152
0.0PheXaa: 0.0 ± 0.0
Gly
7.868GlyAla: 7.868 ± 0.654
0.429GlyCys: 0.429 ± 0.226
5.508GlyAsp: 5.508 ± 0.631
4.077GlyGlu: 4.077 ± 0.477
2.933GlyPhe: 2.933 ± 0.494
8.441GlyGly: 8.441 ± 1.408
2.217GlyHis: 2.217 ± 0.396
4.292GlyIle: 4.292 ± 0.657
4.292GlyLys: 4.292 ± 0.51
6.438GlyLeu: 6.438 ± 0.932
2.504GlyMet: 2.504 ± 0.449
2.933GlyAsn: 2.933 ± 0.351
1.788GlyPro: 1.788 ± 0.265
2.79GlyGln: 2.79 ± 0.431
3.648GlyArg: 3.648 ± 0.412
5.508GlySer: 5.508 ± 0.736
6.295GlyThr: 6.295 ± 0.693
5.222GlyVal: 5.222 ± 0.572
2.361GlyTrp: 2.361 ± 0.504
2.432GlyTyr: 2.432 ± 0.443
0.0GlyXaa: 0.0 ± 0.0
His
2.217HisAla: 2.217 ± 0.421
0.215HisCys: 0.215 ± 0.131
1.359HisAsp: 1.359 ± 0.408
1.001HisGlu: 1.001 ± 0.319
0.501HisPhe: 0.501 ± 0.171
1.717HisGly: 1.717 ± 0.443
0.572HisHis: 0.572 ± 0.215
1.717HisIle: 1.717 ± 0.462
0.501HisLys: 0.501 ± 0.174
1.86HisLeu: 1.86 ± 0.477
0.644HisMet: 0.644 ± 0.264
0.715HisAsn: 0.715 ± 0.247
1.931HisPro: 1.931 ± 0.301
0.787HisGln: 0.787 ± 0.23
1.431HisArg: 1.431 ± 0.294
0.787HisSer: 0.787 ± 0.183
2.361HisThr: 2.361 ± 0.59
1.86HisVal: 1.86 ± 0.439
0.715HisTrp: 0.715 ± 0.237
0.429HisTyr: 0.429 ± 0.181
0.0HisXaa: 0.0 ± 0.0
Ile
4.721IleAla: 4.721 ± 0.514
0.358IleCys: 0.358 ± 0.165
3.648IleAsp: 3.648 ± 0.431
3.219IleGlu: 3.219 ± 0.575
1.144IlePhe: 1.144 ± 0.281
3.505IleGly: 3.505 ± 0.745
1.431IleHis: 1.431 ± 0.302
2.289IleIle: 2.289 ± 0.416
2.146IleLys: 2.146 ± 0.417
2.575IleLeu: 2.575 ± 0.396
1.001IleMet: 1.001 ± 0.284
2.074IleAsn: 2.074 ± 0.383
3.219IlePro: 3.219 ± 0.701
2.432IleGln: 2.432 ± 0.456
2.289IleArg: 2.289 ± 0.342
3.004IleSer: 3.004 ± 0.543
4.793IleThr: 4.793 ± 0.633
2.504IleVal: 2.504 ± 0.315
0.715IleTrp: 0.715 ± 0.154
1.073IleTyr: 1.073 ± 0.313
0.0IleXaa: 0.0 ± 0.0
Lys
4.149LysAla: 4.149 ± 0.595
0.286LysCys: 0.286 ± 0.142
2.718LysAsp: 2.718 ± 0.334
1.717LysGlu: 1.717 ± 0.416
1.073LysPhe: 1.073 ± 0.263
2.361LysGly: 2.361 ± 0.375
0.93LysHis: 0.93 ± 0.267
1.645LysIle: 1.645 ± 0.358
1.359LysLys: 1.359 ± 0.255
3.29LysLeu: 3.29 ± 0.523
1.574LysMet: 1.574 ± 0.41
1.574LysAsn: 1.574 ± 0.41
1.788LysPro: 1.788 ± 0.336
1.144LysGln: 1.144 ± 0.284
1.788LysArg: 1.788 ± 0.339
3.004LysSer: 3.004 ± 0.553
3.433LysThr: 3.433 ± 0.521
2.647LysVal: 2.647 ± 0.517
1.073LysTrp: 1.073 ± 0.306
1.788LysTyr: 1.788 ± 0.379
0.0LysXaa: 0.0 ± 0.0
Leu
8.083LeuAla: 8.083 ± 0.674
0.644LeuCys: 0.644 ± 0.264
6.08LeuAsp: 6.08 ± 0.701
4.864LeuGlu: 4.864 ± 0.499
2.217LeuPhe: 2.217 ± 0.379
7.296LeuGly: 7.296 ± 0.873
2.146LeuHis: 2.146 ± 0.402
3.577LeuIle: 3.577 ± 0.583
2.79LeuLys: 2.79 ± 0.351
5.15LeuLeu: 5.15 ± 0.613
2.146LeuMet: 2.146 ± 0.409
2.074LeuAsn: 2.074 ± 0.438
4.649LeuPro: 4.649 ± 0.501
3.076LeuGln: 3.076 ± 0.493
4.649LeuArg: 4.649 ± 0.651
5.866LeuSer: 5.866 ± 0.637
5.293LeuThr: 5.293 ± 0.476
5.579LeuVal: 5.579 ± 0.706
1.86LeuTrp: 1.86 ± 0.37
2.003LeuTyr: 2.003 ± 0.541
0.0LeuXaa: 0.0 ± 0.0
Met
3.004MetAla: 3.004 ± 0.436
0.501MetCys: 0.501 ± 0.196
2.074MetAsp: 2.074 ± 0.452
1.645MetGlu: 1.645 ± 0.314
0.501MetPhe: 0.501 ± 0.182
1.645MetGly: 1.645 ± 0.374
0.429MetHis: 0.429 ± 0.138
0.93MetIle: 0.93 ± 0.304
1.216MetLys: 1.216 ± 0.342
1.86MetLeu: 1.86 ± 0.314
0.787MetMet: 0.787 ± 0.276
0.787MetAsn: 0.787 ± 0.243
1.001MetPro: 1.001 ± 0.274
1.144MetGln: 1.144 ± 0.313
1.288MetArg: 1.288 ± 0.236
2.504MetSer: 2.504 ± 0.427
2.861MetThr: 2.861 ± 0.554
2.289MetVal: 2.289 ± 0.324
0.501MetTrp: 0.501 ± 0.199
0.644MetTyr: 0.644 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
3.577AsnAla: 3.577 ± 0.466
0.215AsnCys: 0.215 ± 0.108
1.574AsnAsp: 1.574 ± 0.319
2.146AsnGlu: 2.146 ± 0.363
0.715AsnPhe: 0.715 ± 0.179
3.29AsnGly: 3.29 ± 0.532
0.715AsnHis: 0.715 ± 0.222
1.788AsnIle: 1.788 ± 0.316
1.645AsnLys: 1.645 ± 0.313
3.362AsnLeu: 3.362 ± 0.515
0.501AsnMet: 0.501 ± 0.173
1.502AsnAsn: 1.502 ± 0.582
3.076AsnPro: 3.076 ± 0.52
1.931AsnGln: 1.931 ± 0.417
2.074AsnArg: 2.074 ± 0.293
2.003AsnSer: 2.003 ± 0.302
2.146AsnThr: 2.146 ± 0.33
2.003AsnVal: 2.003 ± 0.441
0.715AsnTrp: 0.715 ± 0.321
1.001AsnTyr: 1.001 ± 0.252
0.0AsnXaa: 0.0 ± 0.0
Pro
5.866ProAla: 5.866 ± 1.024
0.143ProCys: 0.143 ± 0.082
3.362ProAsp: 3.362 ± 0.603
4.864ProGlu: 4.864 ± 0.528
1.144ProPhe: 1.144 ± 0.279
4.864ProGly: 4.864 ± 0.535
1.502ProHis: 1.502 ± 0.377
2.361ProIle: 2.361 ± 0.403
2.504ProLys: 2.504 ± 0.448
3.004ProLeu: 3.004 ± 0.408
0.787ProMet: 0.787 ± 0.225
1.144ProAsn: 1.144 ± 0.308
2.361ProPro: 2.361 ± 0.483
1.86ProGln: 1.86 ± 0.616
1.788ProArg: 1.788 ± 0.5
3.934ProSer: 3.934 ± 0.606
3.004ProThr: 3.004 ± 0.395
3.72ProVal: 3.72 ± 0.531
1.073ProTrp: 1.073 ± 0.289
1.574ProTyr: 1.574 ± 0.285
0.0ProXaa: 0.0 ± 0.0
Gln
5.365GlnAla: 5.365 ± 0.8
0.358GlnCys: 0.358 ± 0.152
1.86GlnAsp: 1.86 ± 0.302
1.788GlnGlu: 1.788 ± 0.279
0.93GlnPhe: 0.93 ± 0.215
3.004GlnGly: 3.004 ± 0.514
0.644GlnHis: 0.644 ± 0.242
1.86GlnIle: 1.86 ± 0.424
0.858GlnLys: 0.858 ± 0.288
3.362GlnLeu: 3.362 ± 0.547
1.073GlnMet: 1.073 ± 0.285
1.288GlnAsn: 1.288 ± 0.44
2.074GlnPro: 2.074 ± 0.545
1.073GlnGln: 1.073 ± 0.35
2.504GlnArg: 2.504 ± 0.368
2.432GlnSer: 2.432 ± 0.392
2.074GlnThr: 2.074 ± 0.435
3.72GlnVal: 3.72 ± 0.565
1.359GlnTrp: 1.359 ± 0.43
0.93GlnTyr: 0.93 ± 0.25
0.0GlnXaa: 0.0 ± 0.0
Arg
5.794ArgAla: 5.794 ± 0.512
0.072ArgCys: 0.072 ± 0.073
3.934ArgAsp: 3.934 ± 0.562
3.863ArgGlu: 3.863 ± 0.636
1.216ArgPhe: 1.216 ± 0.24
3.863ArgGly: 3.863 ± 0.512
0.93ArgHis: 0.93 ± 0.292
2.003ArgIle: 2.003 ± 0.338
2.432ArgLys: 2.432 ± 0.491
3.648ArgLeu: 3.648 ± 0.551
2.003ArgMet: 2.003 ± 0.381
1.288ArgAsn: 1.288 ± 0.339
3.29ArgPro: 3.29 ± 0.537
1.359ArgGln: 1.359 ± 0.359
3.004ArgArg: 3.004 ± 0.51
3.505ArgSer: 3.505 ± 0.651
3.577ArgThr: 3.577 ± 0.496
3.577ArgVal: 3.577 ± 0.396
0.286ArgTrp: 0.286 ± 0.126
1.788ArgTyr: 1.788 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
8.155SerAla: 8.155 ± 1.993
0.215SerCys: 0.215 ± 0.123
4.435SerAsp: 4.435 ± 0.442
3.219SerGlu: 3.219 ± 0.44
1.645SerPhe: 1.645 ± 0.314
5.508SerGly: 5.508 ± 0.669
1.144SerHis: 1.144 ± 0.253
2.861SerIle: 2.861 ± 0.362
3.076SerLys: 3.076 ± 0.423
4.864SerLeu: 4.864 ± 0.823
1.86SerMet: 1.86 ± 0.375
2.432SerAsn: 2.432 ± 0.443
3.505SerPro: 3.505 ± 0.705
2.074SerGln: 2.074 ± 0.477
2.933SerArg: 2.933 ± 0.544
4.506SerSer: 4.506 ± 0.758
5.651SerThr: 5.651 ± 1.144
5.866SerVal: 5.866 ± 0.473
1.216SerTrp: 1.216 ± 0.284
1.502SerTyr: 1.502 ± 0.337
0.0SerXaa: 0.0 ± 0.0
Thr
7.01ThrAla: 7.01 ± 1.259
0.715ThrCys: 0.715 ± 0.259
4.292ThrAsp: 4.292 ± 0.482
3.648ThrGlu: 3.648 ± 0.498
2.003ThrPhe: 2.003 ± 0.443
5.365ThrGly: 5.365 ± 0.734
1.645ThrHis: 1.645 ± 0.372
3.505ThrIle: 3.505 ± 0.587
2.289ThrLys: 2.289 ± 0.366
6.366ThrLeu: 6.366 ± 0.621
2.361ThrMet: 2.361 ± 0.535
3.004ThrAsn: 3.004 ± 0.645
5.508ThrPro: 5.508 ± 0.516
2.718ThrGln: 2.718 ± 0.488
3.791ThrArg: 3.791 ± 0.46
4.793ThrSer: 4.793 ± 0.98
6.295ThrThr: 6.295 ± 0.886
6.581ThrVal: 6.581 ± 0.72
1.431ThrTrp: 1.431 ± 0.38
2.79ThrTyr: 2.79 ± 0.515
0.0ThrXaa: 0.0 ± 0.0
Val
5.794ValAla: 5.794 ± 0.739
0.358ValCys: 0.358 ± 0.167
5.722ValAsp: 5.722 ± 0.522
5.079ValGlu: 5.079 ± 0.554
1.645ValPhe: 1.645 ± 0.343
5.937ValGly: 5.937 ± 0.508
1.717ValHis: 1.717 ± 0.406
3.219ValIle: 3.219 ± 0.474
2.146ValLys: 2.146 ± 0.359
6.438ValLeu: 6.438 ± 0.655
1.574ValMet: 1.574 ± 0.294
3.934ValAsn: 3.934 ± 0.585
3.72ValPro: 3.72 ± 0.584
3.004ValGln: 3.004 ± 0.455
3.004ValArg: 3.004 ± 0.434
4.149ValSer: 4.149 ± 0.433
6.295ValThr: 6.295 ± 0.621
6.009ValVal: 6.009 ± 0.692
0.93ValTrp: 0.93 ± 0.228
1.86ValTyr: 1.86 ± 0.447
0.0ValXaa: 0.0 ± 0.0
Trp
1.502TrpAla: 1.502 ± 0.39
0.0TrpCys: 0.0 ± 0.0
1.574TrpAsp: 1.574 ± 0.572
0.858TrpGlu: 0.858 ± 0.152
0.286TrpPhe: 0.286 ± 0.155
1.001TrpGly: 1.001 ± 0.269
0.358TrpHis: 0.358 ± 0.163
1.502TrpIle: 1.502 ± 0.376
0.572TrpLys: 0.572 ± 0.203
1.788TrpLeu: 1.788 ± 0.33
0.429TrpMet: 0.429 ± 0.146
1.216TrpAsn: 1.216 ± 0.528
0.286TrpPro: 0.286 ± 0.142
0.93TrpGln: 0.93 ± 0.211
1.216TrpArg: 1.216 ± 0.276
1.86TrpSer: 1.86 ± 0.457
1.288TrpThr: 1.288 ± 0.29
1.931TrpVal: 1.931 ± 0.435
0.358TrpTrp: 0.358 ± 0.17
0.93TrpTyr: 0.93 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.647TyrAla: 2.647 ± 0.416
0.286TyrCys: 0.286 ± 0.148
1.574TyrAsp: 1.574 ± 0.375
1.502TyrGlu: 1.502 ± 0.339
0.644TyrPhe: 0.644 ± 0.199
2.289TyrGly: 2.289 ± 0.463
0.501TyrHis: 0.501 ± 0.191
1.645TyrIle: 1.645 ± 0.389
0.572TyrLys: 0.572 ± 0.242
3.147TyrLeu: 3.147 ± 0.538
0.715TyrMet: 0.715 ± 0.234
0.93TyrAsn: 0.93 ± 0.218
1.359TyrPro: 1.359 ± 0.34
1.144TyrGln: 1.144 ± 0.23
2.003TyrArg: 2.003 ± 0.516
1.073TyrSer: 1.073 ± 0.22
2.217TyrThr: 2.217 ± 0.366
2.217TyrVal: 2.217 ± 0.411
0.215TyrTrp: 0.215 ± 0.116
0.787TyrTyr: 0.787 ± 0.197
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (13981 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski