Amino acid dipepetide frequency for Erwinia phage EtG

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.426AlaAla: 10.426 ± 1.704
1.239AlaCys: 1.239 ± 0.422
6.504AlaAsp: 6.504 ± 1.006
4.955AlaGlu: 4.955 ± 0.758
2.994AlaPhe: 2.994 ± 0.537
8.981AlaGly: 8.981 ± 1.314
1.239AlaHis: 1.239 ± 0.406
5.884AlaIle: 5.884 ± 0.86
5.471AlaLys: 5.471 ± 0.897
10.633AlaLeu: 10.633 ± 1.022
3.303AlaMet: 3.303 ± 0.62
2.374AlaAsn: 2.374 ± 0.565
5.265AlaPro: 5.265 ± 0.793
4.026AlaGln: 4.026 ± 0.842
5.058AlaArg: 5.058 ± 0.705
8.258AlaSer: 8.258 ± 1.165
5.265AlaThr: 5.265 ± 0.767
7.02AlaVal: 7.02 ± 0.796
1.652AlaTrp: 1.652 ± 0.403
2.994AlaTyr: 2.994 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.929CysAla: 0.929 ± 0.259
0.0CysCys: 0.0 ± 0.0
0.619CysAsp: 0.619 ± 0.235
0.619CysGlu: 0.619 ± 0.273
0.31CysPhe: 0.31 ± 0.17
0.206CysGly: 0.206 ± 0.133
0.31CysHis: 0.31 ± 0.174
0.826CysIle: 0.826 ± 0.276
0.413CysLys: 0.413 ± 0.2
0.723CysLeu: 0.723 ± 0.24
0.619CysMet: 0.619 ± 0.253
0.413CysAsn: 0.413 ± 0.188
0.413CysPro: 0.413 ± 0.221
0.619CysGln: 0.619 ± 0.259
0.929CysArg: 0.929 ± 0.341
0.413CysSer: 0.413 ± 0.276
1.136CysThr: 1.136 ± 0.284
0.929CysVal: 0.929 ± 0.299
0.31CysTrp: 0.31 ± 0.192
0.516CysTyr: 0.516 ± 0.233
0.0CysXaa: 0.0 ± 0.0
Asp
6.916AspAla: 6.916 ± 0.787
0.619AspCys: 0.619 ± 0.259
4.749AspAsp: 4.749 ± 0.686
4.439AspGlu: 4.439 ± 0.707
2.787AspPhe: 2.787 ± 0.636
5.471AspGly: 5.471 ± 0.484
0.723AspHis: 0.723 ± 0.311
3.613AspIle: 3.613 ± 0.559
4.129AspLys: 4.129 ± 0.616
4.026AspLeu: 4.026 ± 0.605
1.342AspMet: 1.342 ± 0.347
1.858AspAsn: 1.858 ± 0.48
2.478AspPro: 2.478 ± 0.574
1.755AspGln: 1.755 ± 0.421
3.097AspArg: 3.097 ± 0.689
2.581AspSer: 2.581 ± 0.476
4.439AspThr: 4.439 ± 0.644
3.716AspVal: 3.716 ± 0.68
0.516AspTrp: 0.516 ± 0.251
1.548AspTyr: 1.548 ± 0.465
0.0AspXaa: 0.0 ± 0.0
Glu
5.368GluAla: 5.368 ± 0.8
1.032GluCys: 1.032 ± 0.336
2.374GluAsp: 2.374 ± 0.641
3.303GluGlu: 3.303 ± 0.577
2.065GluPhe: 2.065 ± 0.483
2.787GluGly: 2.787 ± 0.623
0.929GluHis: 0.929 ± 0.318
3.82GluIle: 3.82 ± 0.752
2.787GluLys: 2.787 ± 0.524
8.465GluLeu: 8.465 ± 1.077
2.374GluMet: 2.374 ± 0.519
3.097GluAsn: 3.097 ± 0.759
3.2GluPro: 3.2 ± 0.517
2.581GluGln: 2.581 ± 0.499
3.407GluArg: 3.407 ± 0.546
4.336GluSer: 4.336 ± 0.637
3.716GluThr: 3.716 ± 0.577
2.89GluVal: 2.89 ± 0.527
0.619GluTrp: 0.619 ± 0.218
2.478GluTyr: 2.478 ± 0.534
0.0GluXaa: 0.0 ± 0.0
Phe
3.2PheAla: 3.2 ± 0.682
0.619PheCys: 0.619 ± 0.207
1.858PheAsp: 1.858 ± 0.388
2.478PheGlu: 2.478 ± 0.517
1.445PhePhe: 1.445 ± 0.381
1.548PheGly: 1.548 ± 0.397
0.723PheHis: 0.723 ± 0.246
1.136PheIle: 1.136 ± 0.442
1.858PheLys: 1.858 ± 0.479
2.271PheLeu: 2.271 ± 0.432
0.826PheMet: 0.826 ± 0.292
2.271PheAsn: 2.271 ± 0.431
1.342PhePro: 1.342 ± 0.316
1.136PheGln: 1.136 ± 0.344
2.374PheArg: 2.374 ± 0.494
2.994PheSer: 2.994 ± 0.61
2.168PheThr: 2.168 ± 0.564
1.858PheVal: 1.858 ± 0.501
1.032PheTrp: 1.032 ± 0.337
0.723PheTyr: 0.723 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
6.091GlyAla: 6.091 ± 0.703
1.548GlyCys: 1.548 ± 0.387
4.232GlyAsp: 4.232 ± 0.539
5.162GlyGlu: 5.162 ± 0.794
2.684GlyPhe: 2.684 ± 0.475
4.852GlyGly: 4.852 ± 0.837
0.723GlyHis: 0.723 ± 0.248
4.026GlyIle: 4.026 ± 0.623
4.232GlyLys: 4.232 ± 0.589
5.162GlyLeu: 5.162 ± 0.698
2.065GlyMet: 2.065 ± 0.48
2.684GlyAsn: 2.684 ± 0.382
0.619GlyPro: 0.619 ± 0.278
2.168GlyGln: 2.168 ± 0.489
3.613GlyArg: 3.613 ± 0.614
3.407GlySer: 3.407 ± 0.515
4.336GlyThr: 4.336 ± 0.738
5.987GlyVal: 5.987 ± 0.927
1.445GlyTrp: 1.445 ± 0.336
1.445GlyTyr: 1.445 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
1.445HisAla: 1.445 ± 0.485
0.413HisCys: 0.413 ± 0.197
1.032HisAsp: 1.032 ± 0.308
0.723HisGlu: 0.723 ± 0.237
0.619HisPhe: 0.619 ± 0.294
1.239HisGly: 1.239 ± 0.326
0.723HisHis: 0.723 ± 0.254
0.929HisIle: 0.929 ± 0.269
0.826HisLys: 0.826 ± 0.28
1.445HisLeu: 1.445 ± 0.411
0.413HisMet: 0.413 ± 0.186
0.826HisAsn: 0.826 ± 0.302
0.826HisPro: 0.826 ± 0.253
1.239HisGln: 1.239 ± 0.313
1.445HisArg: 1.445 ± 0.356
0.723HisSer: 0.723 ± 0.285
1.239HisThr: 1.239 ± 0.491
0.723HisVal: 0.723 ± 0.285
0.413HisTrp: 0.413 ± 0.256
0.413HisTyr: 0.413 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
6.194IleAla: 6.194 ± 0.713
0.723IleCys: 0.723 ± 0.343
4.955IleAsp: 4.955 ± 0.675
3.097IleGlu: 3.097 ± 0.582
1.755IlePhe: 1.755 ± 0.397
3.613IleGly: 3.613 ± 0.617
0.826IleHis: 0.826 ± 0.337
2.581IleIle: 2.581 ± 0.522
2.684IleLys: 2.684 ± 0.626
3.51IleLeu: 3.51 ± 0.453
0.723IleMet: 0.723 ± 0.29
1.961IleAsn: 1.961 ± 0.436
1.445IlePro: 1.445 ± 0.367
1.652IleGln: 1.652 ± 0.427
4.955IleArg: 4.955 ± 0.998
4.439IleSer: 4.439 ± 0.704
4.645IleThr: 4.645 ± 0.652
3.716IleVal: 3.716 ± 0.738
0.206IleTrp: 0.206 ± 0.138
0.826IleTyr: 0.826 ± 0.363
0.0IleXaa: 0.0 ± 0.0
Lys
5.368LysAla: 5.368 ± 0.918
0.413LysCys: 0.413 ± 0.178
2.374LysAsp: 2.374 ± 0.475
3.097LysGlu: 3.097 ± 0.779
0.929LysPhe: 0.929 ± 0.283
3.82LysGly: 3.82 ± 0.805
1.445LysHis: 1.445 ± 0.403
1.961LysIle: 1.961 ± 0.433
3.923LysLys: 3.923 ± 0.614
4.542LysLeu: 4.542 ± 0.827
0.826LysMet: 0.826 ± 0.25
2.168LysAsn: 2.168 ± 0.45
2.994LysPro: 2.994 ± 0.632
1.652LysGln: 1.652 ± 0.34
4.336LysArg: 4.336 ± 0.615
2.787LysSer: 2.787 ± 0.639
3.82LysThr: 3.82 ± 0.771
4.232LysVal: 4.232 ± 0.722
1.136LysTrp: 1.136 ± 0.347
2.787LysTyr: 2.787 ± 0.58
0.0LysXaa: 0.0 ± 0.0
Leu
10.839LeuAla: 10.839 ± 0.882
0.826LeuCys: 0.826 ± 0.347
4.852LeuAsp: 4.852 ± 0.63
5.265LeuGlu: 5.265 ± 0.632
2.478LeuPhe: 2.478 ± 0.661
5.574LeuGly: 5.574 ± 1.153
1.652LeuHis: 1.652 ± 0.292
5.678LeuIle: 5.678 ± 0.718
4.542LeuLys: 4.542 ± 0.933
6.194LeuLeu: 6.194 ± 0.902
2.581LeuMet: 2.581 ± 0.572
5.574LeuAsn: 5.574 ± 0.593
4.439LeuPro: 4.439 ± 0.687
4.026LeuGln: 4.026 ± 0.485
5.884LeuArg: 5.884 ± 0.65
6.091LeuSer: 6.091 ± 0.758
8.568LeuThr: 8.568 ± 1.007
3.716LeuVal: 3.716 ± 0.584
1.755LeuTrp: 1.755 ± 0.344
2.065LeuTyr: 2.065 ± 0.385
0.0LeuXaa: 0.0 ± 0.0
Met
3.303MetAla: 3.303 ± 0.601
0.103MetCys: 0.103 ± 0.104
0.929MetAsp: 0.929 ± 0.307
1.239MetGlu: 1.239 ± 0.307
1.342MetPhe: 1.342 ± 0.437
0.723MetGly: 0.723 ± 0.317
0.516MetHis: 0.516 ± 0.192
0.826MetIle: 0.826 ± 0.32
1.239MetLys: 1.239 ± 0.326
2.581MetLeu: 2.581 ± 0.466
0.619MetMet: 0.619 ± 0.251
1.652MetAsn: 1.652 ± 0.35
0.723MetPro: 0.723 ± 0.326
1.032MetGln: 1.032 ± 0.303
2.065MetArg: 2.065 ± 0.538
1.961MetSer: 1.961 ± 0.338
2.065MetThr: 2.065 ± 0.577
0.929MetVal: 0.929 ± 0.34
0.103MetTrp: 0.103 ± 0.093
0.516MetTyr: 0.516 ± 0.259
0.0MetXaa: 0.0 ± 0.0
Asn
3.51AsnAla: 3.51 ± 0.538
0.619AsnCys: 0.619 ± 0.258
1.445AsnAsp: 1.445 ± 0.433
2.787AsnGlu: 2.787 ± 0.501
1.445AsnPhe: 1.445 ± 0.28
3.613AsnGly: 3.613 ± 0.702
0.413AsnHis: 0.413 ± 0.229
2.168AsnIle: 2.168 ± 0.673
2.168AsnLys: 2.168 ± 0.516
3.303AsnLeu: 3.303 ± 0.53
1.239AsnMet: 1.239 ± 0.347
1.652AsnAsn: 1.652 ± 0.474
2.581AsnPro: 2.581 ± 0.609
1.239AsnGln: 1.239 ± 0.311
2.581AsnArg: 2.581 ± 0.591
2.065AsnSer: 2.065 ± 0.362
2.684AsnThr: 2.684 ± 0.604
2.478AsnVal: 2.478 ± 0.469
0.31AsnTrp: 0.31 ± 0.168
0.619AsnTyr: 0.619 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
4.645ProAla: 4.645 ± 0.645
0.206ProCys: 0.206 ± 0.16
3.716ProAsp: 3.716 ± 0.674
4.232ProGlu: 4.232 ± 0.74
0.929ProPhe: 0.929 ± 0.455
2.065ProGly: 2.065 ± 0.469
1.342ProHis: 1.342 ± 0.439
2.168ProIle: 2.168 ± 0.449
1.961ProLys: 1.961 ± 0.508
3.923ProLeu: 3.923 ± 0.601
0.31ProMet: 0.31 ± 0.176
1.548ProAsn: 1.548 ± 0.475
1.548ProPro: 1.548 ± 0.312
1.755ProGln: 1.755 ± 0.408
2.065ProArg: 2.065 ± 0.49
2.581ProSer: 2.581 ± 0.573
1.239ProThr: 1.239 ± 0.427
4.749ProVal: 4.749 ± 0.852
0.516ProTrp: 0.516 ± 0.22
1.136ProTyr: 1.136 ± 0.388
0.0ProXaa: 0.0 ± 0.0
Gln
4.749GlnAla: 4.749 ± 1.338
0.31GlnCys: 0.31 ± 0.196
1.548GlnAsp: 1.548 ± 0.568
2.168GlnGlu: 2.168 ± 0.456
1.445GlnPhe: 1.445 ± 0.371
1.652GlnGly: 1.652 ± 0.332
0.619GlnHis: 0.619 ± 0.238
2.168GlnIle: 2.168 ± 0.587
2.684GlnLys: 2.684 ± 0.455
4.336GlnLeu: 4.336 ± 0.754
0.929GlnMet: 0.929 ± 0.347
0.723GlnAsn: 0.723 ± 0.296
1.548GlnPro: 1.548 ± 0.342
1.342GlnGln: 1.342 ± 0.52
3.303GlnArg: 3.303 ± 0.602
2.994GlnSer: 2.994 ± 0.487
2.168GlnThr: 2.168 ± 0.529
1.548GlnVal: 1.548 ± 0.337
0.619GlnTrp: 0.619 ± 0.343
1.136GlnTyr: 1.136 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
5.162ArgAla: 5.162 ± 0.949
0.413ArgCys: 0.413 ± 0.195
2.581ArgAsp: 2.581 ± 0.421
4.852ArgGlu: 4.852 ± 0.748
1.858ArgPhe: 1.858 ± 0.424
2.994ArgGly: 2.994 ± 0.717
2.168ArgHis: 2.168 ± 0.456
3.51ArgIle: 3.51 ± 0.731
4.232ArgLys: 4.232 ± 0.522
6.504ArgLeu: 6.504 ± 0.895
1.652ArgMet: 1.652 ± 0.409
2.374ArgAsn: 2.374 ± 0.355
1.445ArgPro: 1.445 ± 0.361
2.787ArgGln: 2.787 ± 0.65
4.232ArgArg: 4.232 ± 0.991
3.51ArgSer: 3.51 ± 0.427
3.923ArgThr: 3.923 ± 0.601
5.058ArgVal: 5.058 ± 0.794
1.652ArgTrp: 1.652 ± 0.496
1.548ArgTyr: 1.548 ± 0.419
0.0ArgXaa: 0.0 ± 0.0
Ser
7.846SerAla: 7.846 ± 0.809
0.516SerCys: 0.516 ± 0.212
4.129SerAsp: 4.129 ± 0.667
3.303SerGlu: 3.303 ± 0.63
2.581SerPhe: 2.581 ± 0.543
4.749SerGly: 4.749 ± 0.838
1.136SerHis: 1.136 ± 0.46
2.478SerIle: 2.478 ± 0.502
3.303SerLys: 3.303 ± 0.508
7.536SerLeu: 7.536 ± 0.938
1.342SerMet: 1.342 ± 0.401
2.271SerAsn: 2.271 ± 0.465
2.994SerPro: 2.994 ± 0.588
2.581SerGln: 2.581 ± 0.464
3.82SerArg: 3.82 ± 0.775
2.684SerSer: 2.684 ± 0.565
3.2SerThr: 3.2 ± 0.651
3.82SerVal: 3.82 ± 0.499
1.032SerTrp: 1.032 ± 0.342
1.239SerTyr: 1.239 ± 0.458
0.0SerXaa: 0.0 ± 0.0
Thr
7.329ThrAla: 7.329 ± 1.192
0.619ThrCys: 0.619 ± 0.255
4.645ThrAsp: 4.645 ± 0.551
3.2ThrGlu: 3.2 ± 0.486
1.858ThrPhe: 1.858 ± 0.459
6.4ThrGly: 6.4 ± 0.726
0.413ThrHis: 0.413 ± 0.259
4.232ThrIle: 4.232 ± 0.716
2.89ThrLys: 2.89 ± 0.412
7.639ThrLeu: 7.639 ± 0.881
1.342ThrMet: 1.342 ± 0.352
1.445ThrAsn: 1.445 ± 0.32
2.89ThrPro: 2.89 ± 0.5
2.065ThrGln: 2.065 ± 0.467
3.2ThrArg: 3.2 ± 0.496
4.026ThrSer: 4.026 ± 0.592
4.026ThrThr: 4.026 ± 0.618
4.955ThrVal: 4.955 ± 0.739
0.929ThrTrp: 0.929 ± 0.305
1.652ThrTyr: 1.652 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
6.4ValAla: 6.4 ± 0.776
0.826ValCys: 0.826 ± 0.366
4.852ValAsp: 4.852 ± 0.657
4.336ValGlu: 4.336 ± 0.748
2.478ValPhe: 2.478 ± 0.517
4.542ValGly: 4.542 ± 0.645
0.619ValHis: 0.619 ± 0.245
3.51ValIle: 3.51 ± 0.661
3.716ValLys: 3.716 ± 0.724
5.265ValLeu: 5.265 ± 0.719
1.136ValMet: 1.136 ± 0.38
2.684ValAsn: 2.684 ± 0.518
3.51ValPro: 3.51 ± 0.629
1.961ValGln: 1.961 ± 0.516
2.684ValArg: 2.684 ± 0.534
4.955ValSer: 4.955 ± 0.637
5.368ValThr: 5.368 ± 0.592
4.542ValVal: 4.542 ± 0.715
0.723ValTrp: 0.723 ± 0.361
1.858ValTyr: 1.858 ± 0.459
0.0ValXaa: 0.0 ± 0.0
Trp
1.548TrpAla: 1.548 ± 0.377
0.103TrpCys: 0.103 ± 0.107
1.136TrpAsp: 1.136 ± 0.357
0.723TrpGlu: 0.723 ± 0.258
0.619TrpPhe: 0.619 ± 0.276
0.413TrpGly: 0.413 ± 0.221
0.619TrpHis: 0.619 ± 0.233
1.239TrpIle: 1.239 ± 0.343
0.826TrpLys: 0.826 ± 0.324
2.374TrpLeu: 2.374 ± 0.533
0.103TrpMet: 0.103 ± 0.086
0.619TrpAsn: 0.619 ± 0.232
0.929TrpPro: 0.929 ± 0.284
0.723TrpGln: 0.723 ± 0.251
1.136TrpArg: 1.136 ± 0.373
0.826TrpSer: 0.826 ± 0.274
0.206TrpThr: 0.206 ± 0.166
0.516TrpVal: 0.516 ± 0.227
0.413TrpTrp: 0.413 ± 0.218
0.619TrpTyr: 0.619 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.374TyrAla: 2.374 ± 0.489
0.103TyrCys: 0.103 ± 0.113
2.271TyrAsp: 2.271 ± 0.394
1.652TyrGlu: 1.652 ± 0.446
1.239TyrPhe: 1.239 ± 0.454
1.239TyrGly: 1.239 ± 0.332
0.516TyrHis: 0.516 ± 0.218
1.755TyrIle: 1.755 ± 0.438
0.723TyrLys: 0.723 ± 0.234
2.168TyrLeu: 2.168 ± 0.478
0.619TyrMet: 0.619 ± 0.197
0.723TyrAsn: 0.723 ± 0.258
1.445TyrPro: 1.445 ± 0.39
1.652TyrGln: 1.652 ± 0.443
2.271TyrArg: 2.271 ± 0.55
0.929TyrSer: 0.929 ± 0.334
1.548TyrThr: 1.548 ± 0.408
2.581TyrVal: 2.581 ± 0.509
0.31TyrTrp: 0.31 ± 0.175
0.516TyrTyr: 0.516 ± 0.232
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (9688 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski