Amino acid dipepetide frequency for Achromobacter phage vB_AxyS_19-32_Axy16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.422AlaAla: 8.422 ± 0.995
0.642AlaCys: 0.642 ± 0.233
5.282AlaAsp: 5.282 ± 0.688
7.494AlaGlu: 7.494 ± 0.965
3.283AlaPhe: 3.283 ± 0.46
7.637AlaGly: 7.637 ± 0.962
1.642AlaHis: 1.642 ± 0.355
5.21AlaIle: 5.21 ± 0.703
4.996AlaLys: 4.996 ± 0.732
7.922AlaLeu: 7.922 ± 0.831
2.998AlaMet: 2.998 ± 0.495
3.711AlaAsn: 3.711 ± 0.557
3.569AlaPro: 3.569 ± 0.611
4.14AlaGln: 4.14 ± 0.68
5.282AlaArg: 5.282 ± 0.566
5.21AlaSer: 5.21 ± 0.56
6.209AlaThr: 6.209 ± 0.751
6.995AlaVal: 6.995 ± 0.751
2.141AlaTrp: 2.141 ± 0.311
2.712AlaTyr: 2.712 ± 0.379
0.0AlaXaa: 0.0 ± 0.0
Cys
0.714CysAla: 0.714 ± 0.266
0.143CysCys: 0.143 ± 0.102
0.357CysAsp: 0.357 ± 0.164
0.714CysGlu: 0.714 ± 0.234
0.571CysPhe: 0.571 ± 0.202
0.428CysGly: 0.428 ± 0.198
0.214CysHis: 0.214 ± 0.124
0.285CysIle: 0.285 ± 0.141
0.357CysLys: 0.357 ± 0.162
0.571CysLeu: 0.571 ± 0.172
0.428CysMet: 0.428 ± 0.155
0.357CysAsn: 0.357 ± 0.195
0.714CysPro: 0.714 ± 0.279
0.071CysGln: 0.071 ± 0.085
0.642CysArg: 0.642 ± 0.193
0.571CysSer: 0.571 ± 0.225
0.214CysThr: 0.214 ± 0.124
0.642CysVal: 0.642 ± 0.164
0.357CysTrp: 0.357 ± 0.161
0.357CysTyr: 0.357 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
5.424AspAla: 5.424 ± 0.483
0.428AspCys: 0.428 ± 0.143
3.854AspAsp: 3.854 ± 0.669
3.783AspGlu: 3.783 ± 0.593
2.926AspPhe: 2.926 ± 0.503
5.139AspGly: 5.139 ± 0.708
0.856AspHis: 0.856 ± 0.259
2.784AspIle: 2.784 ± 0.464
2.998AspLys: 2.998 ± 0.49
5.139AspLeu: 5.139 ± 0.572
2.141AspMet: 2.141 ± 0.407
2.427AspAsn: 2.427 ± 0.402
2.855AspPro: 2.855 ± 0.496
1.713AspGln: 1.713 ± 0.492
1.784AspArg: 1.784 ± 0.381
2.926AspSer: 2.926 ± 0.42
2.355AspThr: 2.355 ± 0.405
4.14AspVal: 4.14 ± 0.825
0.856AspTrp: 0.856 ± 0.244
2.284AspTyr: 2.284 ± 0.426
0.0AspXaa: 0.0 ± 0.0
Glu
7.637GluAla: 7.637 ± 0.861
0.714GluCys: 0.714 ± 0.243
3.711GluAsp: 3.711 ± 0.495
5.71GluGlu: 5.71 ± 0.616
1.927GluPhe: 1.927 ± 0.364
3.497GluGly: 3.497 ± 0.564
0.714GluHis: 0.714 ± 0.215
2.569GluIle: 2.569 ± 0.417
3.997GluLys: 3.997 ± 0.588
6.424GluLeu: 6.424 ± 0.553
2.07GluMet: 2.07 ± 0.393
2.141GluAsn: 2.141 ± 0.431
3.355GluPro: 3.355 ± 0.42
3.854GluGln: 3.854 ± 0.588
3.997GluArg: 3.997 ± 0.554
2.784GluSer: 2.784 ± 0.447
3.283GluThr: 3.283 ± 0.438
5.282GluVal: 5.282 ± 0.662
1.427GluTrp: 1.427 ± 0.335
2.141GluTyr: 2.141 ± 0.482
0.0GluXaa: 0.0 ± 0.0
Phe
2.998PheAla: 2.998 ± 0.385
0.428PheCys: 0.428 ± 0.184
2.641PheAsp: 2.641 ± 0.398
2.712PheGlu: 2.712 ± 0.475
0.928PhePhe: 0.928 ± 0.194
3.355PheGly: 3.355 ± 0.5
0.285PheHis: 0.285 ± 0.163
1.927PheIle: 1.927 ± 0.336
2.355PheLys: 2.355 ± 0.358
2.141PheLeu: 2.141 ± 0.402
1.356PheMet: 1.356 ± 0.284
1.213PheAsn: 1.213 ± 0.271
1.998PhePro: 1.998 ± 0.369
1.427PheGln: 1.427 ± 0.297
2.284PheArg: 2.284 ± 0.419
2.784PheSer: 2.784 ± 0.485
1.856PheThr: 1.856 ± 0.402
2.498PheVal: 2.498 ± 0.463
0.642PheTrp: 0.642 ± 0.237
1.356PheTyr: 1.356 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
6.638GlyAla: 6.638 ± 0.893
0.285GlyCys: 0.285 ± 0.128
4.211GlyAsp: 4.211 ± 0.433
3.997GlyGlu: 3.997 ± 0.487
3.497GlyPhe: 3.497 ± 0.491
6.923GlyGly: 6.923 ± 0.815
0.714GlyHis: 0.714 ± 0.227
4.425GlyIle: 4.425 ± 0.53
4.853GlyLys: 4.853 ± 0.685
6.709GlyLeu: 6.709 ± 0.595
1.927GlyMet: 1.927 ± 0.354
3.854GlyAsn: 3.854 ± 0.589
3.283GlyPro: 3.283 ± 0.561
2.855GlyGln: 2.855 ± 0.393
4.425GlyArg: 4.425 ± 0.446
5.924GlySer: 5.924 ± 0.9
4.14GlyThr: 4.14 ± 0.615
5.282GlyVal: 5.282 ± 0.697
1.713GlyTrp: 1.713 ± 0.332
2.855GlyTyr: 2.855 ± 0.409
0.0GlyXaa: 0.0 ± 0.0
His
1.642HisAla: 1.642 ± 0.416
0.357HisCys: 0.357 ± 0.142
0.714HisAsp: 0.714 ± 0.218
0.928HisGlu: 0.928 ± 0.268
0.714HisPhe: 0.714 ± 0.181
1.142HisGly: 1.142 ± 0.322
0.214HisHis: 0.214 ± 0.166
0.928HisIle: 0.928 ± 0.242
0.714HisLys: 0.714 ± 0.195
1.57HisLeu: 1.57 ± 0.337
0.357HisMet: 0.357 ± 0.17
0.785HisAsn: 0.785 ± 0.217
1.142HisPro: 1.142 ± 0.2
0.357HisGln: 0.357 ± 0.132
1.285HisArg: 1.285 ± 0.31
1.071HisSer: 1.071 ± 0.317
0.428HisThr: 0.428 ± 0.168
1.57HisVal: 1.57 ± 0.371
0.143HisTrp: 0.143 ± 0.104
0.714HisTyr: 0.714 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
4.925IleAla: 4.925 ± 0.618
0.5IleCys: 0.5 ± 0.217
3.854IleAsp: 3.854 ± 0.661
3.569IleGlu: 3.569 ± 0.479
1.213IlePhe: 1.213 ± 0.225
3.711IleGly: 3.711 ± 0.526
1.285IleHis: 1.285 ± 0.321
1.998IleIle: 1.998 ± 0.356
2.498IleLys: 2.498 ± 0.43
2.998IleLeu: 2.998 ± 0.439
0.856IleMet: 0.856 ± 0.232
2.855IleAsn: 2.855 ± 0.459
2.355IlePro: 2.355 ± 0.354
1.784IleGln: 1.784 ± 0.372
3.64IleArg: 3.64 ± 0.532
3.355IleSer: 3.355 ± 0.571
2.926IleThr: 2.926 ± 0.461
3.925IleVal: 3.925 ± 0.486
0.928IleTrp: 0.928 ± 0.238
1.499IleTyr: 1.499 ± 0.417
0.0IleXaa: 0.0 ± 0.0
Lys
5.995LysAla: 5.995 ± 0.772
0.357LysCys: 0.357 ± 0.147
2.355LysAsp: 2.355 ± 0.506
3.426LysGlu: 3.426 ± 0.537
2.569LysPhe: 2.569 ± 0.526
4.496LysGly: 4.496 ± 0.543
1.142LysHis: 1.142 ± 0.304
2.855LysIle: 2.855 ± 0.441
3.283LysLys: 3.283 ± 0.656
4.068LysLeu: 4.068 ± 0.665
1.427LysMet: 1.427 ± 0.309
1.285LysAsn: 1.285 ± 0.299
2.427LysPro: 2.427 ± 0.457
2.355LysGln: 2.355 ± 0.378
2.926LysArg: 2.926 ± 0.458
2.569LysSer: 2.569 ± 0.381
2.926LysThr: 2.926 ± 0.377
3.925LysVal: 3.925 ± 0.474
1.142LysTrp: 1.142 ± 0.289
1.642LysTyr: 1.642 ± 0.395
0.0LysXaa: 0.0 ± 0.0
Leu
9.136LeuAla: 9.136 ± 0.757
0.856LeuCys: 0.856 ± 0.216
4.211LeuAsp: 4.211 ± 0.572
6.424LeuGlu: 6.424 ± 0.74
2.213LeuPhe: 2.213 ± 0.426
5.139LeuGly: 5.139 ± 0.649
1.142LeuHis: 1.142 ± 0.191
3.426LeuIle: 3.426 ± 0.452
4.282LeuLys: 4.282 ± 0.614
5.995LeuLeu: 5.995 ± 0.694
2.355LeuMet: 2.355 ± 0.544
3.711LeuAsn: 3.711 ± 0.528
5.567LeuPro: 5.567 ± 0.756
4.639LeuGln: 4.639 ± 0.509
5.21LeuArg: 5.21 ± 0.648
4.639LeuSer: 4.639 ± 0.534
5.282LeuThr: 5.282 ± 0.704
4.496LeuVal: 4.496 ± 0.474
0.928LeuTrp: 0.928 ± 0.306
2.427LeuTyr: 2.427 ± 0.46
0.0LeuXaa: 0.0 ± 0.0
Met
2.998MetAla: 2.998 ± 0.552
0.285MetCys: 0.285 ± 0.15
1.642MetAsp: 1.642 ± 0.343
1.57MetGlu: 1.57 ± 0.329
1.142MetPhe: 1.142 ± 0.27
1.642MetGly: 1.642 ± 0.258
0.642MetHis: 0.642 ± 0.238
1.285MetIle: 1.285 ± 0.248
1.57MetLys: 1.57 ± 0.331
2.855MetLeu: 2.855 ± 0.457
0.714MetMet: 0.714 ± 0.202
1.642MetAsn: 1.642 ± 0.269
1.713MetPro: 1.713 ± 0.255
1.285MetGln: 1.285 ± 0.289
1.142MetArg: 1.142 ± 0.272
1.927MetSer: 1.927 ± 0.393
1.927MetThr: 1.927 ± 0.396
1.713MetVal: 1.713 ± 0.356
0.0MetTrp: 0.0 ± 0.0
0.642MetTyr: 0.642 ± 0.209
0.0MetXaa: 0.0 ± 0.0
Asn
4.354AsnAla: 4.354 ± 0.509
0.571AsnCys: 0.571 ± 0.181
1.998AsnAsp: 1.998 ± 0.32
2.284AsnGlu: 2.284 ± 0.376
1.642AsnPhe: 1.642 ± 0.273
4.282AsnGly: 4.282 ± 0.542
0.428AsnHis: 0.428 ± 0.177
2.07AsnIle: 2.07 ± 0.329
1.57AsnLys: 1.57 ± 0.358
3.997AsnLeu: 3.997 ± 0.573
1.213AsnMet: 1.213 ± 0.217
1.856AsnAsn: 1.856 ± 0.365
2.855AsnPro: 2.855 ± 0.434
1.642AsnGln: 1.642 ± 0.431
1.713AsnArg: 1.713 ± 0.329
2.141AsnSer: 2.141 ± 0.411
2.213AsnThr: 2.213 ± 0.417
2.284AsnVal: 2.284 ± 0.465
1.499AsnTrp: 1.499 ± 0.355
1.213AsnTyr: 1.213 ± 0.352
0.0AsnXaa: 0.0 ± 0.0
Pro
4.996ProAla: 4.996 ± 0.649
0.143ProCys: 0.143 ± 0.104
3.212ProAsp: 3.212 ± 0.552
3.783ProGlu: 3.783 ± 0.603
1.57ProPhe: 1.57 ± 0.4
5.067ProGly: 5.067 ± 0.775
0.928ProHis: 0.928 ± 0.173
2.855ProIle: 2.855 ± 0.545
3.283ProLys: 3.283 ± 0.524
2.926ProLeu: 2.926 ± 0.34
1.071ProMet: 1.071 ± 0.263
2.355ProAsn: 2.355 ± 0.428
1.927ProPro: 1.927 ± 0.341
2.427ProGln: 2.427 ± 0.367
2.498ProArg: 2.498 ± 0.444
3.069ProSer: 3.069 ± 0.412
3.283ProThr: 3.283 ± 0.391
3.283ProVal: 3.283 ± 0.419
0.571ProTrp: 0.571 ± 0.181
0.999ProTyr: 0.999 ± 0.22
0.0ProXaa: 0.0 ± 0.0
Gln
4.925GlnAla: 4.925 ± 0.604
0.285GlnCys: 0.285 ± 0.123
1.784GlnAsp: 1.784 ± 0.357
3.14GlnGlu: 3.14 ± 0.59
2.213GlnPhe: 2.213 ± 0.384
3.14GlnGly: 3.14 ± 0.395
0.785GlnHis: 0.785 ± 0.204
1.927GlnIle: 1.927 ± 0.36
1.713GlnLys: 1.713 ± 0.498
3.997GlnLeu: 3.997 ± 0.559
1.285GlnMet: 1.285 ± 0.304
1.57GlnAsn: 1.57 ± 0.389
1.642GlnPro: 1.642 ± 0.272
1.856GlnGln: 1.856 ± 0.472
2.784GlnArg: 2.784 ± 0.376
2.284GlnSer: 2.284 ± 0.324
2.07GlnThr: 2.07 ± 0.336
2.926GlnVal: 2.926 ± 0.506
0.714GlnTrp: 0.714 ± 0.215
1.285GlnTyr: 1.285 ± 0.32
0.0GlnXaa: 0.0 ± 0.0
Arg
5.71ArgAla: 5.71 ± 0.744
0.214ArgCys: 0.214 ± 0.114
3.212ArgAsp: 3.212 ± 0.617
3.426ArgGlu: 3.426 ± 0.562
1.713ArgPhe: 1.713 ± 0.453
4.211ArgGly: 4.211 ± 0.646
1.285ArgHis: 1.285 ± 0.34
2.784ArgIle: 2.784 ± 0.473
3.569ArgLys: 3.569 ± 0.518
4.996ArgLeu: 4.996 ± 0.636
2.569ArgMet: 2.569 ± 0.425
2.427ArgAsn: 2.427 ± 0.306
2.926ArgPro: 2.926 ± 0.505
2.284ArgGln: 2.284 ± 0.386
4.14ArgArg: 4.14 ± 0.548
2.712ArgSer: 2.712 ± 0.468
3.212ArgThr: 3.212 ± 0.427
4.354ArgVal: 4.354 ± 0.433
0.999ArgTrp: 0.999 ± 0.244
1.784ArgTyr: 1.784 ± 0.386
0.0ArgXaa: 0.0 ± 0.0
Ser
4.639SerAla: 4.639 ± 0.63
0.785SerCys: 0.785 ± 0.2
4.282SerAsp: 4.282 ± 0.529
3.283SerGlu: 3.283 ± 0.601
2.141SerPhe: 2.141 ± 0.424
4.711SerGly: 4.711 ± 0.635
1.142SerHis: 1.142 ± 0.315
3.426SerIle: 3.426 ± 0.409
2.498SerLys: 2.498 ± 0.427
4.425SerLeu: 4.425 ± 0.553
1.427SerMet: 1.427 ± 0.356
2.284SerAsn: 2.284 ± 0.39
1.998SerPro: 1.998 ± 0.432
2.284SerGln: 2.284 ± 0.42
3.64SerArg: 3.64 ± 0.429
2.998SerSer: 2.998 ± 0.458
3.283SerThr: 3.283 ± 0.527
4.425SerVal: 4.425 ± 0.607
0.999SerTrp: 0.999 ± 0.265
1.856SerTyr: 1.856 ± 0.366
0.0SerXaa: 0.0 ± 0.0
Thr
5.139ThrAla: 5.139 ± 0.79
0.357ThrCys: 0.357 ± 0.16
2.569ThrAsp: 2.569 ± 0.443
3.355ThrGlu: 3.355 ± 0.491
2.213ThrPhe: 2.213 ± 0.377
4.639ThrGly: 4.639 ± 0.575
0.999ThrHis: 0.999 ± 0.27
3.283ThrIle: 3.283 ± 0.376
2.569ThrLys: 2.569 ± 0.507
5.638ThrLeu: 5.638 ± 0.713
1.071ThrMet: 1.071 ± 0.292
2.712ThrAsn: 2.712 ± 0.573
3.14ThrPro: 3.14 ± 0.473
2.07ThrGln: 2.07 ± 0.418
2.569ThrArg: 2.569 ± 0.392
3.212ThrSer: 3.212 ± 0.503
3.64ThrThr: 3.64 ± 0.523
3.14ThrVal: 3.14 ± 0.379
1.57ThrTrp: 1.57 ± 0.351
1.927ThrTyr: 1.927 ± 0.415
0.0ThrXaa: 0.0 ± 0.0
Val
5.353ValAla: 5.353 ± 0.661
0.428ValCys: 0.428 ± 0.234
4.211ValAsp: 4.211 ± 0.507
4.425ValGlu: 4.425 ± 0.529
2.855ValPhe: 2.855 ± 0.46
5.496ValGly: 5.496 ± 0.483
1.427ValHis: 1.427 ± 0.319
3.355ValIle: 3.355 ± 0.44
3.569ValLys: 3.569 ± 0.62
5.781ValLeu: 5.781 ± 0.569
1.499ValMet: 1.499 ± 0.274
2.498ValAsn: 2.498 ± 0.545
4.211ValPro: 4.211 ± 0.56
3.355ValGln: 3.355 ± 0.471
4.782ValArg: 4.782 ± 0.701
3.355ValSer: 3.355 ± 0.398
3.925ValThr: 3.925 ± 0.476
4.568ValVal: 4.568 ± 0.664
1.427ValTrp: 1.427 ± 0.308
2.213ValTyr: 2.213 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
1.499TrpAla: 1.499 ± 0.395
0.428TrpCys: 0.428 ± 0.146
0.999TrpAsp: 0.999 ± 0.282
1.356TrpGlu: 1.356 ± 0.318
0.856TrpPhe: 0.856 ± 0.251
1.499TrpGly: 1.499 ± 0.319
0.143TrpHis: 0.143 ± 0.088
1.285TrpIle: 1.285 ± 0.326
0.999TrpLys: 0.999 ± 0.298
1.285TrpLeu: 1.285 ± 0.347
0.428TrpMet: 0.428 ± 0.135
0.642TrpAsn: 0.642 ± 0.203
0.999TrpPro: 0.999 ± 0.276
0.357TrpGln: 0.357 ± 0.145
1.213TrpArg: 1.213 ± 0.281
1.499TrpSer: 1.499 ± 0.272
0.999TrpThr: 0.999 ± 0.245
1.142TrpVal: 1.142 ± 0.283
0.357TrpTrp: 0.357 ± 0.13
1.071TrpTyr: 1.071 ± 0.302
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.355TyrAla: 2.355 ± 0.36
0.5TyrCys: 0.5 ± 0.218
1.713TyrAsp: 1.713 ± 0.303
1.927TyrGlu: 1.927 ± 0.34
0.999TyrPhe: 0.999 ± 0.23
2.355TyrGly: 2.355 ± 0.413
0.785TyrHis: 0.785 ± 0.225
1.998TyrIle: 1.998 ± 0.429
1.499TyrLys: 1.499 ± 0.369
2.641TyrLeu: 2.641 ± 0.406
1.071TyrMet: 1.071 ± 0.242
1.499TyrAsn: 1.499 ± 0.305
1.57TyrPro: 1.57 ± 0.317
1.57TyrGln: 1.57 ± 0.395
2.569TyrArg: 2.569 ± 0.452
1.499TyrSer: 1.499 ± 0.301
1.642TyrThr: 1.642 ± 0.351
2.141TyrVal: 2.141 ± 0.479
0.642TyrTrp: 0.642 ± 0.208
0.999TyrTyr: 0.999 ± 0.217
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (14012 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski