Amino acid dipepetide frequency for Escherichia phage vB_EcoP_SP7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.714AlaAla: 9.714 ± 1.12
0.913AlaCys: 0.913 ± 0.309
5.314AlaAsp: 5.314 ± 0.737
5.48AlaGlu: 5.48 ± 0.696
3.404AlaPhe: 3.404 ± 0.521
7.639AlaGly: 7.639 ± 1.157
0.83AlaHis: 0.83 ± 0.229
5.646AlaIle: 5.646 ± 0.796
5.231AlaLys: 5.231 ± 0.674
6.974AlaLeu: 6.974 ± 1.081
2.74AlaMet: 2.74 ± 0.486
3.404AlaAsn: 3.404 ± 0.507
2.823AlaPro: 2.823 ± 0.568
2.823AlaGln: 2.823 ± 0.475
3.57AlaArg: 3.57 ± 0.523
5.065AlaSer: 5.065 ± 0.529
3.985AlaThr: 3.985 ± 0.695
5.729AlaVal: 5.729 ± 1.049
1.495AlaTrp: 1.495 ± 0.432
2.574AlaTyr: 2.574 ± 0.48
0.0AlaXaa: 0.0 ± 0.0
Cys
0.498CysAla: 0.498 ± 0.171
0.083CysCys: 0.083 ± 0.072
0.664CysAsp: 0.664 ± 0.299
0.747CysGlu: 0.747 ± 0.246
0.498CysPhe: 0.498 ± 0.206
0.913CysGly: 0.913 ± 0.307
0.332CysHis: 0.332 ± 0.151
0.332CysIle: 0.332 ± 0.19
0.664CysLys: 0.664 ± 0.254
1.079CysLeu: 1.079 ± 0.288
0.415CysMet: 0.415 ± 0.24
0.415CysAsn: 0.415 ± 0.181
0.664CysPro: 0.664 ± 0.265
0.166CysGln: 0.166 ± 0.109
0.664CysArg: 0.664 ± 0.281
0.581CysSer: 0.581 ± 0.237
0.166CysThr: 0.166 ± 0.108
0.415CysVal: 0.415 ± 0.15
0.166CysTrp: 0.166 ± 0.118
0.249CysTyr: 0.249 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
5.895AspAla: 5.895 ± 0.743
0.415AspCys: 0.415 ± 0.223
4.151AspAsp: 4.151 ± 0.712
4.318AspGlu: 4.318 ± 0.675
2.325AspPhe: 2.325 ± 0.447
6.808AspGly: 6.808 ± 0.533
1.411AspHis: 1.411 ± 0.333
3.238AspIle: 3.238 ± 0.448
3.404AspLys: 3.404 ± 0.629
4.899AspLeu: 4.899 ± 0.543
1.993AspMet: 1.993 ± 0.399
2.408AspAsn: 2.408 ± 0.456
2.574AspPro: 2.574 ± 0.458
1.91AspGln: 1.91 ± 0.44
2.325AspArg: 2.325 ± 0.424
3.736AspSer: 3.736 ± 0.532
3.985AspThr: 3.985 ± 0.62
4.65AspVal: 4.65 ± 0.491
0.747AspTrp: 0.747 ± 0.238
2.076AspTyr: 2.076 ± 0.373
0.0AspXaa: 0.0 ± 0.0
Glu
6.31GluAla: 6.31 ± 0.939
0.581GluCys: 0.581 ± 0.195
5.397GluAsp: 5.397 ± 0.849
4.733GluGlu: 4.733 ± 0.804
2.491GluPhe: 2.491 ± 0.476
4.567GluGly: 4.567 ± 0.723
1.245GluHis: 1.245 ± 0.311
2.408GluIle: 2.408 ± 0.395
3.072GluLys: 3.072 ± 0.423
6.061GluLeu: 6.061 ± 0.638
1.993GluMet: 1.993 ± 0.468
2.574GluAsn: 2.574 ± 0.508
2.408GluPro: 2.408 ± 0.406
2.74GluGln: 2.74 ± 0.463
4.234GluArg: 4.234 ± 0.643
4.068GluSer: 4.068 ± 0.54
3.736GluThr: 3.736 ± 0.404
4.401GluVal: 4.401 ± 0.723
1.578GluTrp: 1.578 ± 0.256
2.989GluTyr: 2.989 ± 0.569
0.0GluXaa: 0.0 ± 0.0
Phe
2.989PheAla: 2.989 ± 0.382
0.498PheCys: 0.498 ± 0.228
2.657PheAsp: 2.657 ± 0.396
1.993PheGlu: 1.993 ± 0.327
1.245PhePhe: 1.245 ± 0.337
2.408PheGly: 2.408 ± 0.481
0.747PheHis: 0.747 ± 0.234
1.744PheIle: 1.744 ± 0.408
2.989PheLys: 2.989 ± 0.511
3.072PheLeu: 3.072 ± 0.413
1.079PheMet: 1.079 ± 0.354
2.574PheAsn: 2.574 ± 0.492
1.578PhePro: 1.578 ± 0.415
0.747PheGln: 0.747 ± 0.313
1.495PheArg: 1.495 ± 0.291
2.408PheSer: 2.408 ± 0.281
2.242PheThr: 2.242 ± 0.284
2.574PheVal: 2.574 ± 0.529
0.332PheTrp: 0.332 ± 0.128
1.328PheTyr: 1.328 ± 0.289
0.0PheXaa: 0.0 ± 0.0
Gly
6.476GlyAla: 6.476 ± 0.872
0.83GlyCys: 0.83 ± 0.319
4.899GlyAsp: 4.899 ± 0.904
5.397GlyGlu: 5.397 ± 0.53
2.159GlyPhe: 2.159 ± 0.388
5.563GlyGly: 5.563 ± 0.644
0.747GlyHis: 0.747 ± 0.213
4.484GlyIle: 4.484 ± 0.55
5.978GlyLys: 5.978 ± 0.797
5.646GlyLeu: 5.646 ± 0.768
2.657GlyMet: 2.657 ± 0.382
2.823GlyAsn: 2.823 ± 0.444
1.079GlyPro: 1.079 ± 0.31
2.74GlyGln: 2.74 ± 0.353
6.227GlyArg: 6.227 ± 0.802
6.393GlySer: 6.393 ± 0.726
4.899GlyThr: 4.899 ± 0.626
6.31GlyVal: 6.31 ± 0.935
1.328GlyTrp: 1.328 ± 0.342
3.819GlyTyr: 3.819 ± 0.57
0.0GlyXaa: 0.0 ± 0.0
His
0.664HisAla: 0.664 ± 0.259
0.166HisCys: 0.166 ± 0.11
1.245HisAsp: 1.245 ± 0.448
1.245HisGlu: 1.245 ± 0.418
0.747HisPhe: 0.747 ± 0.273
0.913HisGly: 0.913 ± 0.293
0.415HisHis: 0.415 ± 0.178
1.079HisIle: 1.079 ± 0.206
1.495HisLys: 1.495 ± 0.337
1.744HisLeu: 1.744 ± 0.355
0.581HisMet: 0.581 ± 0.206
0.581HisAsn: 0.581 ± 0.218
0.498HisPro: 0.498 ± 0.163
0.498HisGln: 0.498 ± 0.196
1.079HisArg: 1.079 ± 0.258
0.747HisSer: 0.747 ± 0.227
1.162HisThr: 1.162 ± 0.318
1.162HisVal: 1.162 ± 0.241
0.415HisTrp: 0.415 ± 0.195
0.498HisTyr: 0.498 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
3.487IleAla: 3.487 ± 0.55
0.664IleCys: 0.664 ± 0.285
3.321IleAsp: 3.321 ± 0.38
3.321IleGlu: 3.321 ± 0.405
1.245IlePhe: 1.245 ± 0.279
4.982IleGly: 4.982 ± 0.818
1.079IleHis: 1.079 ± 0.274
2.325IleIle: 2.325 ± 0.4
3.985IleLys: 3.985 ± 0.528
3.487IleLeu: 3.487 ± 0.434
0.664IleMet: 0.664 ± 0.189
2.491IleAsn: 2.491 ± 0.542
2.159IlePro: 2.159 ± 0.515
1.661IleGln: 1.661 ± 0.441
2.906IleArg: 2.906 ± 0.5
3.238IleSer: 3.238 ± 0.473
2.989IleThr: 2.989 ± 0.559
4.234IleVal: 4.234 ± 0.469
0.664IleTrp: 0.664 ± 0.211
1.328IleTyr: 1.328 ± 0.221
0.0IleXaa: 0.0 ± 0.0
Lys
7.39LysAla: 7.39 ± 0.757
0.83LysCys: 0.83 ± 0.269
3.736LysAsp: 3.736 ± 0.508
3.404LysGlu: 3.404 ± 0.423
2.408LysPhe: 2.408 ± 0.492
4.318LysGly: 4.318 ± 0.605
1.495LysHis: 1.495 ± 0.459
2.491LysIle: 2.491 ± 0.35
3.902LysLys: 3.902 ± 0.827
5.563LysLeu: 5.563 ± 0.693
1.744LysMet: 1.744 ± 0.382
2.242LysAsn: 2.242 ± 0.37
2.491LysPro: 2.491 ± 0.607
1.91LysGln: 1.91 ± 0.399
4.401LysArg: 4.401 ± 0.705
4.151LysSer: 4.151 ± 0.527
4.318LysThr: 4.318 ± 0.551
4.899LysVal: 4.899 ± 0.688
1.079LysTrp: 1.079 ± 0.297
2.325LysTyr: 2.325 ± 0.381
0.0LysXaa: 0.0 ± 0.0
Leu
5.978LeuAla: 5.978 ± 0.915
0.415LeuCys: 0.415 ± 0.161
4.234LeuAsp: 4.234 ± 0.501
6.725LeuGlu: 6.725 ± 0.626
2.242LeuPhe: 2.242 ± 0.368
4.982LeuGly: 4.982 ± 0.751
0.83LeuHis: 0.83 ± 0.231
3.321LeuIle: 3.321 ± 0.643
6.642LeuLys: 6.642 ± 0.633
5.397LeuLeu: 5.397 ± 0.662
2.989LeuMet: 2.989 ± 0.605
4.567LeuAsn: 4.567 ± 0.595
3.321LeuPro: 3.321 ± 0.423
4.318LeuGln: 4.318 ± 0.674
4.899LeuArg: 4.899 ± 0.47
5.895LeuSer: 5.895 ± 0.684
5.48LeuThr: 5.48 ± 0.655
5.231LeuVal: 5.231 ± 0.674
0.747LeuTrp: 0.747 ± 0.239
2.159LeuTyr: 2.159 ± 0.555
0.0LeuXaa: 0.0 ± 0.0
Met
2.989MetAla: 2.989 ± 0.519
0.332MetCys: 0.332 ± 0.191
1.328MetAsp: 1.328 ± 0.34
1.91MetGlu: 1.91 ± 0.351
1.411MetPhe: 1.411 ± 0.325
2.574MetGly: 2.574 ± 0.467
0.249MetHis: 0.249 ± 0.136
1.328MetIle: 1.328 ± 0.245
1.328MetLys: 1.328 ± 0.258
2.325MetLeu: 2.325 ± 0.396
0.83MetMet: 0.83 ± 0.229
1.162MetAsn: 1.162 ± 0.266
1.079MetPro: 1.079 ± 0.3
0.747MetGln: 0.747 ± 0.349
1.079MetArg: 1.079 ± 0.271
1.993MetSer: 1.993 ± 0.385
1.827MetThr: 1.827 ± 0.346
2.74MetVal: 2.74 ± 0.403
0.249MetTrp: 0.249 ± 0.137
1.245MetTyr: 1.245 ± 0.286
0.0MetXaa: 0.0 ± 0.0
Asn
4.484AsnAla: 4.484 ± 0.654
0.498AsnCys: 0.498 ± 0.205
2.242AsnAsp: 2.242 ± 0.481
2.574AsnGlu: 2.574 ± 0.476
1.827AsnPhe: 1.827 ± 0.299
4.899AsnGly: 4.899 ± 0.704
0.581AsnHis: 0.581 ± 0.177
2.74AsnIle: 2.74 ± 0.367
2.325AsnLys: 2.325 ± 0.408
3.57AsnLeu: 3.57 ± 0.64
1.162AsnMet: 1.162 ± 0.301
1.91AsnAsn: 1.91 ± 0.424
2.657AsnPro: 2.657 ± 0.514
1.827AsnGln: 1.827 ± 0.337
2.325AsnArg: 2.325 ± 0.575
2.159AsnSer: 2.159 ± 0.405
1.91AsnThr: 1.91 ± 0.333
3.155AsnVal: 3.155 ± 0.539
0.332AsnTrp: 0.332 ± 0.144
1.91AsnTyr: 1.91 ± 0.531
0.0AsnXaa: 0.0 ± 0.0
Pro
2.574ProAla: 2.574 ± 0.466
0.332ProCys: 0.332 ± 0.203
2.242ProAsp: 2.242 ± 0.369
3.072ProGlu: 3.072 ± 0.595
1.245ProPhe: 1.245 ± 0.257
1.578ProGly: 1.578 ± 0.325
0.664ProHis: 0.664 ± 0.203
1.91ProIle: 1.91 ± 0.335
3.321ProLys: 3.321 ± 0.635
2.242ProLeu: 2.242 ± 0.436
1.079ProMet: 1.079 ± 0.248
2.242ProAsn: 2.242 ± 0.373
0.747ProPro: 0.747 ± 0.216
1.411ProGln: 1.411 ± 0.333
1.661ProArg: 1.661 ± 0.432
2.408ProSer: 2.408 ± 0.315
3.155ProThr: 3.155 ± 0.336
2.657ProVal: 2.657 ± 0.331
0.664ProTrp: 0.664 ± 0.255
0.996ProTyr: 0.996 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
3.985GlnAla: 3.985 ± 0.418
0.332GlnCys: 0.332 ± 0.135
3.155GlnAsp: 3.155 ± 0.723
2.74GlnGlu: 2.74 ± 0.397
1.578GlnPhe: 1.578 ± 0.291
2.574GlnGly: 2.574 ± 0.44
0.664GlnHis: 0.664 ± 0.236
1.162GlnIle: 1.162 ± 0.289
2.076GlnLys: 2.076 ± 0.341
3.653GlnLeu: 3.653 ± 0.71
1.162GlnMet: 1.162 ± 0.345
1.495GlnAsn: 1.495 ± 0.415
0.913GlnPro: 0.913 ± 0.28
2.076GlnGln: 2.076 ± 0.632
2.408GlnArg: 2.408 ± 0.589
2.989GlnSer: 2.989 ± 0.477
1.91GlnThr: 1.91 ± 0.426
2.076GlnVal: 2.076 ± 0.329
0.664GlnTrp: 0.664 ± 0.196
1.162GlnTyr: 1.162 ± 0.406
0.0GlnXaa: 0.0 ± 0.0
Arg
4.484ArgAla: 4.484 ± 0.725
0.498ArgCys: 0.498 ± 0.17
4.65ArgAsp: 4.65 ± 0.46
3.653ArgGlu: 3.653 ± 0.503
2.989ArgPhe: 2.989 ± 0.375
4.068ArgGly: 4.068 ± 0.421
0.83ArgHis: 0.83 ± 0.294
3.653ArgIle: 3.653 ± 0.723
3.072ArgLys: 3.072 ± 0.612
5.563ArgLeu: 5.563 ± 0.685
0.83ArgMet: 0.83 ± 0.263
2.989ArgAsn: 2.989 ± 0.454
1.661ArgPro: 1.661 ± 0.326
2.574ArgGln: 2.574 ± 0.366
2.159ArgArg: 2.159 ± 0.374
3.072ArgSer: 3.072 ± 0.616
2.408ArgThr: 2.408 ± 0.362
2.74ArgVal: 2.74 ± 0.611
1.079ArgTrp: 1.079 ± 0.241
1.744ArgTyr: 1.744 ± 0.31
0.0ArgXaa: 0.0 ± 0.0
Ser
4.234SerAla: 4.234 ± 0.613
0.913SerCys: 0.913 ± 0.341
5.065SerAsp: 5.065 ± 0.511
3.321SerGlu: 3.321 ± 0.558
2.574SerPhe: 2.574 ± 0.433
6.393SerGly: 6.393 ± 1.023
1.993SerHis: 1.993 ± 0.436
3.155SerIle: 3.155 ± 0.538
3.487SerLys: 3.487 ± 0.463
4.151SerLeu: 4.151 ± 0.565
1.91SerMet: 1.91 ± 0.327
3.321SerAsn: 3.321 ± 0.545
2.574SerPro: 2.574 ± 0.449
2.491SerGln: 2.491 ± 0.527
2.989SerArg: 2.989 ± 0.514
4.234SerSer: 4.234 ± 0.561
2.906SerThr: 2.906 ± 0.311
4.068SerVal: 4.068 ± 0.502
0.747SerTrp: 0.747 ± 0.228
2.906SerTyr: 2.906 ± 0.566
0.0SerXaa: 0.0 ± 0.0
Thr
3.653ThrAla: 3.653 ± 0.665
0.415ThrCys: 0.415 ± 0.193
2.906ThrAsp: 2.906 ± 0.47
4.567ThrGlu: 4.567 ± 0.601
2.491ThrPhe: 2.491 ± 0.358
5.646ThrGly: 5.646 ± 0.629
0.664ThrHis: 0.664 ± 0.232
3.653ThrIle: 3.653 ± 0.596
3.321ThrLys: 3.321 ± 0.525
4.65ThrLeu: 4.65 ± 0.501
1.661ThrMet: 1.661 ± 0.358
1.827ThrAsn: 1.827 ± 0.383
2.906ThrPro: 2.906 ± 0.414
2.74ThrGln: 2.74 ± 0.481
2.906ThrArg: 2.906 ± 0.486
2.491ThrSer: 2.491 ± 0.494
3.238ThrThr: 3.238 ± 0.492
4.899ThrVal: 4.899 ± 0.632
0.498ThrTrp: 0.498 ± 0.127
1.411ThrTyr: 1.411 ± 0.275
0.0ThrXaa: 0.0 ± 0.0
Val
5.729ValAla: 5.729 ± 0.629
0.664ValCys: 0.664 ± 0.21
3.404ValAsp: 3.404 ± 0.639
5.397ValGlu: 5.397 ± 0.86
2.076ValPhe: 2.076 ± 0.481
5.812ValGly: 5.812 ± 0.629
1.245ValHis: 1.245 ± 0.544
3.404ValIle: 3.404 ± 0.564
5.231ValLys: 5.231 ± 0.67
5.231ValLeu: 5.231 ± 0.604
2.159ValMet: 2.159 ± 0.424
3.321ValAsn: 3.321 ± 0.518
2.574ValPro: 2.574 ± 0.483
3.155ValGln: 3.155 ± 0.496
4.151ValArg: 4.151 ± 0.602
4.899ValSer: 4.899 ± 0.611
3.736ValThr: 3.736 ± 0.517
6.061ValVal: 6.061 ± 0.823
0.83ValTrp: 0.83 ± 0.256
2.491ValTyr: 2.491 ± 0.505
0.0ValXaa: 0.0 ± 0.0
Trp
0.415TrpAla: 0.415 ± 0.148
0.083TrpCys: 0.083 ± 0.074
0.747TrpAsp: 0.747 ± 0.194
0.996TrpGlu: 0.996 ± 0.217
0.664TrpPhe: 0.664 ± 0.19
1.079TrpGly: 1.079 ± 0.334
0.332TrpHis: 0.332 ± 0.155
0.415TrpIle: 0.415 ± 0.184
1.661TrpLys: 1.661 ± 0.399
1.91TrpLeu: 1.91 ± 0.413
0.249TrpMet: 0.249 ± 0.134
0.996TrpAsn: 0.996 ± 0.345
0.166TrpPro: 0.166 ± 0.126
0.581TrpGln: 0.581 ± 0.233
0.664TrpArg: 0.664 ± 0.202
0.913TrpSer: 0.913 ± 0.425
0.581TrpThr: 0.581 ± 0.212
1.245TrpVal: 1.245 ± 0.349
0.249TrpTrp: 0.249 ± 0.156
0.415TrpTyr: 0.415 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.404TyrAla: 3.404 ± 0.508
0.249TyrCys: 0.249 ± 0.145
1.91TyrAsp: 1.91 ± 0.32
1.993TyrGlu: 1.993 ± 0.502
1.079TyrPhe: 1.079 ± 0.296
2.74TyrGly: 2.74 ± 0.448
0.581TyrHis: 0.581 ± 0.265
1.744TyrIle: 1.744 ± 0.515
1.91TyrLys: 1.91 ± 0.44
3.155TyrLeu: 3.155 ± 0.501
0.747TyrMet: 0.747 ± 0.287
1.744TyrAsn: 1.744 ± 0.429
1.328TyrPro: 1.328 ± 0.44
1.661TyrGln: 1.661 ± 0.381
2.574TyrArg: 2.574 ± 0.492
1.993TyrSer: 1.993 ± 0.381
1.91TyrThr: 1.91 ± 0.304
2.408TyrVal: 2.408 ± 0.467
0.498TyrTrp: 0.498 ± 0.199
1.245TyrTyr: 1.245 ± 0.332
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12045 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski