Amino acid dipepetide frequency for Aeromonas phage phiAS7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.879AlaAla: 12.879 ± 1.782
0.711AlaCys: 0.711 ± 0.254
5.926AlaAsp: 5.926 ± 0.674
7.98AlaGlu: 7.98 ± 0.988
3.635AlaPhe: 3.635 ± 0.384
7.664AlaGly: 7.664 ± 0.895
2.054AlaHis: 2.054 ± 0.346
5.373AlaIle: 5.373 ± 0.719
7.822AlaLys: 7.822 ± 0.792
9.561AlaLeu: 9.561 ± 0.947
2.765AlaMet: 2.765 ± 0.509
3.635AlaAsn: 3.635 ± 0.775
2.924AlaPro: 2.924 ± 0.559
5.215AlaGln: 5.215 ± 0.877
6.321AlaArg: 6.321 ± 0.599
4.899AlaSer: 4.899 ± 0.684
5.61AlaThr: 5.61 ± 0.777
7.19AlaVal: 7.19 ± 0.845
1.264AlaTrp: 1.264 ± 0.269
3.951AlaTyr: 3.951 ± 0.512
0.0AlaXaa: 0.0 ± 0.0
Cys
0.553CysAla: 0.553 ± 0.174
0.158CysCys: 0.158 ± 0.098
0.632CysAsp: 0.632 ± 0.212
0.553CysGlu: 0.553 ± 0.215
0.395CysPhe: 0.395 ± 0.194
0.553CysGly: 0.553 ± 0.206
0.316CysHis: 0.316 ± 0.135
0.553CysIle: 0.553 ± 0.221
0.316CysLys: 0.316 ± 0.156
0.79CysLeu: 0.79 ± 0.238
0.395CysMet: 0.395 ± 0.185
0.316CysAsn: 0.316 ± 0.148
0.79CysPro: 0.79 ± 0.22
0.237CysGln: 0.237 ± 0.123
0.632CysArg: 0.632 ± 0.306
0.632CysSer: 0.632 ± 0.183
0.711CysThr: 0.711 ± 0.246
0.553CysVal: 0.553 ± 0.205
0.079CysTrp: 0.079 ± 0.075
0.395CysTyr: 0.395 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
7.506AspAla: 7.506 ± 0.852
0.553AspCys: 0.553 ± 0.205
3.24AspAsp: 3.24 ± 0.52
4.583AspGlu: 4.583 ± 0.607
2.054AspPhe: 2.054 ± 0.435
4.978AspGly: 4.978 ± 0.605
1.027AspHis: 1.027 ± 0.38
2.686AspIle: 2.686 ± 0.4
3.872AspLys: 3.872 ± 0.531
4.741AspLeu: 4.741 ± 0.697
2.291AspMet: 2.291 ± 0.569
2.37AspAsn: 2.37 ± 0.405
2.212AspPro: 2.212 ± 0.421
1.264AspGln: 1.264 ± 0.346
2.449AspArg: 2.449 ± 0.615
3.714AspSer: 3.714 ± 0.485
3.714AspThr: 3.714 ± 0.645
3.556AspVal: 3.556 ± 0.597
0.711AspTrp: 0.711 ± 0.224
1.738AspTyr: 1.738 ± 0.479
0.0AspXaa: 0.0 ± 0.0
Glu
6.716GluAla: 6.716 ± 0.78
0.869GluCys: 0.869 ± 0.247
3.477GluAsp: 3.477 ± 0.592
5.373GluGlu: 5.373 ± 0.862
2.528GluPhe: 2.528 ± 0.305
4.267GluGly: 4.267 ± 0.45
1.659GluHis: 1.659 ± 0.369
2.449GluIle: 2.449 ± 0.432
3.951GluLys: 3.951 ± 0.56
6.242GluLeu: 6.242 ± 0.634
2.765GluMet: 2.765 ± 0.573
2.37GluAsn: 2.37 ± 0.398
2.133GluPro: 2.133 ± 0.51
6.163GluGln: 6.163 ± 0.711
4.109GluArg: 4.109 ± 0.551
2.686GluSer: 2.686 ± 0.358
2.686GluThr: 2.686 ± 0.499
4.425GluVal: 4.425 ± 0.552
1.422GluTrp: 1.422 ± 0.316
2.212GluTyr: 2.212 ± 0.316
0.0GluXaa: 0.0 ± 0.0
Phe
2.37PheAla: 2.37 ± 0.375
0.474PheCys: 0.474 ± 0.181
2.845PheAsp: 2.845 ± 0.475
1.659PheGlu: 1.659 ± 0.441
1.106PhePhe: 1.106 ± 0.336
2.37PheGly: 2.37 ± 0.518
0.632PheHis: 0.632 ± 0.178
2.054PheIle: 2.054 ± 0.426
2.449PheLys: 2.449 ± 0.417
2.449PheLeu: 2.449 ± 0.424
1.185PheMet: 1.185 ± 0.249
2.765PheAsn: 2.765 ± 0.451
1.027PhePro: 1.027 ± 0.256
1.343PheGln: 1.343 ± 0.224
2.133PheArg: 2.133 ± 0.474
1.659PheSer: 1.659 ± 0.278
1.975PheThr: 1.975 ± 0.35
2.133PheVal: 2.133 ± 0.389
0.474PheTrp: 0.474 ± 0.178
0.79PheTyr: 0.79 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
7.427GlyAla: 7.427 ± 0.982
1.106GlyCys: 1.106 ± 0.448
5.215GlyAsp: 5.215 ± 0.554
3.714GlyGlu: 3.714 ± 0.418
3.003GlyPhe: 3.003 ± 0.456
6.321GlyGly: 6.321 ± 0.818
1.58GlyHis: 1.58 ± 0.282
3.714GlyIle: 3.714 ± 0.528
5.452GlyLys: 5.452 ± 0.576
5.847GlyLeu: 5.847 ± 0.906
2.212GlyMet: 2.212 ± 0.352
3.24GlyAsn: 3.24 ± 0.402
1.896GlyPro: 1.896 ± 0.36
2.686GlyGln: 2.686 ± 0.481
4.82GlyArg: 4.82 ± 0.646
4.188GlySer: 4.188 ± 0.557
4.346GlyThr: 4.346 ± 0.633
4.267GlyVal: 4.267 ± 0.42
1.264GlyTrp: 1.264 ± 0.287
2.291GlyTyr: 2.291 ± 0.494
0.0GlyXaa: 0.0 ± 0.0
His
1.343HisAla: 1.343 ± 0.355
0.158HisCys: 0.158 ± 0.11
1.422HisAsp: 1.422 ± 0.316
1.501HisGlu: 1.501 ± 0.328
0.632HisPhe: 0.632 ± 0.198
1.817HisGly: 1.817 ± 0.38
0.632HisHis: 0.632 ± 0.248
1.501HisIle: 1.501 ± 0.407
1.027HisLys: 1.027 ± 0.264
1.896HisLeu: 1.896 ± 0.348
0.948HisMet: 0.948 ± 0.34
0.711HisAsn: 0.711 ± 0.205
1.106HisPro: 1.106 ± 0.31
0.711HisGln: 0.711 ± 0.218
1.264HisArg: 1.264 ± 0.316
0.79HisSer: 0.79 ± 0.297
1.106HisThr: 1.106 ± 0.289
1.343HisVal: 1.343 ± 0.311
0.474HisTrp: 0.474 ± 0.161
1.106HisTyr: 1.106 ± 0.326
0.0HisXaa: 0.0 ± 0.0
Ile
4.188IleAla: 4.188 ± 0.572
0.316IleCys: 0.316 ± 0.167
2.449IleAsp: 2.449 ± 0.399
3.714IleGlu: 3.714 ± 0.496
0.948IlePhe: 0.948 ± 0.384
3.082IleGly: 3.082 ± 0.508
1.264IleHis: 1.264 ± 0.287
2.37IleIle: 2.37 ± 0.588
2.924IleLys: 2.924 ± 0.429
3.556IleLeu: 3.556 ± 0.402
1.975IleMet: 1.975 ± 0.393
2.133IleAsn: 2.133 ± 0.314
1.58IlePro: 1.58 ± 0.386
1.659IleGln: 1.659 ± 0.39
3.951IleArg: 3.951 ± 0.364
2.686IleSer: 2.686 ± 0.547
2.37IleThr: 2.37 ± 0.487
2.607IleVal: 2.607 ± 0.37
0.553IleTrp: 0.553 ± 0.267
1.501IleTyr: 1.501 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
7.032LysAla: 7.032 ± 1.08
0.237LysCys: 0.237 ± 0.14
4.03LysAsp: 4.03 ± 0.547
4.109LysGlu: 4.109 ± 0.464
1.896LysPhe: 1.896 ± 0.346
4.662LysGly: 4.662 ± 0.557
1.58LysHis: 1.58 ± 0.328
1.896LysIle: 1.896 ± 0.623
3.872LysLys: 3.872 ± 0.636
5.926LysLeu: 5.926 ± 0.651
2.607LysMet: 2.607 ± 0.448
1.817LysAsn: 1.817 ± 0.366
3.24LysPro: 3.24 ± 0.357
2.686LysGln: 2.686 ± 0.486
3.635LysArg: 3.635 ± 0.507
3.082LysSer: 3.082 ± 0.483
2.765LysThr: 2.765 ± 0.407
4.267LysVal: 4.267 ± 0.554
1.185LysTrp: 1.185 ± 0.375
2.133LysTyr: 2.133 ± 0.322
0.0LysXaa: 0.0 ± 0.0
Leu
10.035LeuAla: 10.035 ± 0.969
0.948LeuCys: 0.948 ± 0.322
5.531LeuAsp: 5.531 ± 0.555
5.136LeuGlu: 5.136 ± 0.686
2.449LeuPhe: 2.449 ± 0.352
6.479LeuGly: 6.479 ± 0.661
1.422LeuHis: 1.422 ± 0.299
3.714LeuIle: 3.714 ± 0.539
5.057LeuLys: 5.057 ± 0.676
6.084LeuLeu: 6.084 ± 0.666
2.528LeuMet: 2.528 ± 0.511
2.765LeuAsn: 2.765 ± 0.277
3.714LeuPro: 3.714 ± 0.47
4.109LeuGln: 4.109 ± 0.567
4.109LeuArg: 4.109 ± 0.424
6.084LeuSer: 6.084 ± 0.855
5.531LeuThr: 5.531 ± 0.575
4.504LeuVal: 4.504 ± 0.677
1.027LeuTrp: 1.027 ± 0.23
2.133LeuTyr: 2.133 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
4.899MetAla: 4.899 ± 0.777
0.158MetCys: 0.158 ± 0.102
1.817MetAsp: 1.817 ± 0.346
2.133MetGlu: 2.133 ± 0.465
1.027MetPhe: 1.027 ± 0.261
1.738MetGly: 1.738 ± 0.357
0.948MetHis: 0.948 ± 0.226
0.711MetIle: 0.711 ± 0.188
1.975MetLys: 1.975 ± 0.437
3.082MetLeu: 3.082 ± 0.494
1.106MetMet: 1.106 ± 0.29
0.948MetAsn: 0.948 ± 0.31
1.343MetPro: 1.343 ± 0.318
1.422MetGln: 1.422 ± 0.412
2.37MetArg: 2.37 ± 0.363
2.528MetSer: 2.528 ± 0.346
1.659MetThr: 1.659 ± 0.261
2.686MetVal: 2.686 ± 0.413
0.474MetTrp: 0.474 ± 0.152
0.948MetTyr: 0.948 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
3.872AsnAla: 3.872 ± 0.506
0.474AsnCys: 0.474 ± 0.187
0.948AsnAsp: 0.948 ± 0.225
2.133AsnGlu: 2.133 ± 0.365
1.264AsnPhe: 1.264 ± 0.223
3.161AsnGly: 3.161 ± 0.382
1.027AsnHis: 1.027 ± 0.29
2.133AsnIle: 2.133 ± 0.467
2.291AsnLys: 2.291 ± 0.338
3.003AsnLeu: 3.003 ± 0.37
1.501AsnMet: 1.501 ± 0.384
0.948AsnAsn: 0.948 ± 0.254
2.528AsnPro: 2.528 ± 0.46
1.659AsnGln: 1.659 ± 0.337
2.528AsnArg: 2.528 ± 0.341
1.659AsnSer: 1.659 ± 0.283
2.449AsnThr: 2.449 ± 0.42
2.765AsnVal: 2.765 ± 0.424
0.79AsnTrp: 0.79 ± 0.21
1.343AsnTyr: 1.343 ± 0.362
0.0AsnXaa: 0.0 ± 0.0
Pro
3.477ProAla: 3.477 ± 0.783
0.316ProCys: 0.316 ± 0.144
2.528ProAsp: 2.528 ± 0.456
3.714ProGlu: 3.714 ± 0.601
1.185ProPhe: 1.185 ± 0.266
2.37ProGly: 2.37 ± 0.469
0.474ProHis: 0.474 ± 0.171
1.422ProIle: 1.422 ± 0.229
2.449ProLys: 2.449 ± 0.39
3.477ProLeu: 3.477 ± 0.509
1.027ProMet: 1.027 ± 0.339
1.58ProAsn: 1.58 ± 0.36
1.185ProPro: 1.185 ± 0.327
1.659ProGln: 1.659 ± 0.295
1.106ProArg: 1.106 ± 0.238
3.003ProSer: 3.003 ± 0.394
1.738ProThr: 1.738 ± 0.484
3.477ProVal: 3.477 ± 0.511
0.474ProTrp: 0.474 ± 0.156
1.264ProTyr: 1.264 ± 0.367
0.0ProXaa: 0.0 ± 0.0
Gln
5.452GlnAla: 5.452 ± 0.8
0.316GlnCys: 0.316 ± 0.147
2.212GlnAsp: 2.212 ± 0.517
3.319GlnGlu: 3.319 ± 0.627
1.422GlnPhe: 1.422 ± 0.307
3.24GlnGly: 3.24 ± 0.45
1.264GlnHis: 1.264 ± 0.275
2.37GlnIle: 2.37 ± 0.386
2.37GlnLys: 2.37 ± 0.442
4.03GlnLeu: 4.03 ± 0.437
2.449GlnMet: 2.449 ± 0.442
1.817GlnAsn: 1.817 ± 0.288
1.106GlnPro: 1.106 ± 0.383
2.133GlnGln: 2.133 ± 0.668
2.845GlnArg: 2.845 ± 0.444
1.975GlnSer: 1.975 ± 0.344
1.659GlnThr: 1.659 ± 0.348
3.082GlnVal: 3.082 ± 0.439
1.027GlnTrp: 1.027 ± 0.223
1.58GlnTyr: 1.58 ± 0.259
0.0GlnXaa: 0.0 ± 0.0
Arg
5.689ArgAla: 5.689 ± 0.605
0.553ArgCys: 0.553 ± 0.224
3.714ArgAsp: 3.714 ± 0.486
3.951ArgGlu: 3.951 ± 0.502
2.845ArgPhe: 2.845 ± 0.418
4.109ArgGly: 4.109 ± 0.456
1.343ArgHis: 1.343 ± 0.299
3.793ArgIle: 3.793 ± 0.558
3.003ArgLys: 3.003 ± 0.509
4.741ArgLeu: 4.741 ± 0.475
1.58ArgMet: 1.58 ± 0.414
2.765ArgAsn: 2.765 ± 0.41
1.343ArgPro: 1.343 ± 0.4
2.528ArgGln: 2.528 ± 0.528
2.845ArgArg: 2.845 ± 0.485
2.054ArgSer: 2.054 ± 0.305
2.686ArgThr: 2.686 ± 0.395
4.188ArgVal: 4.188 ± 0.613
1.106ArgTrp: 1.106 ± 0.378
2.528ArgTyr: 2.528 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
5.294SerAla: 5.294 ± 0.683
0.474SerCys: 0.474 ± 0.215
3.714SerAsp: 3.714 ± 0.52
3.319SerGlu: 3.319 ± 0.39
1.58SerPhe: 1.58 ± 0.334
4.267SerGly: 4.267 ± 0.602
0.711SerHis: 0.711 ± 0.206
2.37SerIle: 2.37 ± 0.373
4.267SerLys: 4.267 ± 0.624
4.109SerLeu: 4.109 ± 0.457
1.58SerMet: 1.58 ± 0.307
2.607SerAsn: 2.607 ± 0.502
2.845SerPro: 2.845 ± 0.473
2.37SerGln: 2.37 ± 0.421
3.556SerArg: 3.556 ± 0.693
2.686SerSer: 2.686 ± 0.521
2.686SerThr: 2.686 ± 0.454
3.635SerVal: 3.635 ± 0.547
0.948SerTrp: 0.948 ± 0.226
1.343SerTyr: 1.343 ± 0.29
0.0SerXaa: 0.0 ± 0.0
Thr
7.269ThrAla: 7.269 ± 0.921
0.632ThrCys: 0.632 ± 0.23
1.817ThrAsp: 1.817 ± 0.411
3.082ThrGlu: 3.082 ± 0.583
1.106ThrPhe: 1.106 ± 0.287
4.662ThrGly: 4.662 ± 0.772
1.422ThrHis: 1.422 ± 0.298
1.659ThrIle: 1.659 ± 0.412
3.003ThrLys: 3.003 ± 0.45
4.188ThrLeu: 4.188 ± 0.518
1.106ThrMet: 1.106 ± 0.246
1.501ThrAsn: 1.501 ± 0.348
2.686ThrPro: 2.686 ± 0.486
2.607ThrGln: 2.607 ± 0.399
2.291ThrArg: 2.291 ± 0.537
4.267ThrSer: 4.267 ± 0.563
1.896ThrThr: 1.896 ± 0.385
3.398ThrVal: 3.398 ± 0.443
0.711ThrTrp: 0.711 ± 0.2
1.896ThrTyr: 1.896 ± 0.392
0.0ThrXaa: 0.0 ± 0.0
Val
6.637ValAla: 6.637 ± 0.674
0.553ValCys: 0.553 ± 0.245
3.793ValAsp: 3.793 ± 0.494
5.61ValGlu: 5.61 ± 0.748
2.845ValPhe: 2.845 ± 0.448
5.531ValGly: 5.531 ± 0.703
0.948ValHis: 0.948 ± 0.221
3.161ValIle: 3.161 ± 0.597
3.714ValLys: 3.714 ± 0.552
4.82ValLeu: 4.82 ± 0.51
2.291ValMet: 2.291 ± 0.43
2.37ValAsn: 2.37 ± 0.388
2.686ValPro: 2.686 ± 0.482
3.398ValGln: 3.398 ± 0.444
4.109ValArg: 4.109 ± 0.737
3.082ValSer: 3.082 ± 0.432
3.556ValThr: 3.556 ± 0.655
4.504ValVal: 4.504 ± 0.691
0.632ValTrp: 0.632 ± 0.207
1.896ValTyr: 1.896 ± 0.584
0.0ValXaa: 0.0 ± 0.0
Trp
1.975TrpAla: 1.975 ± 0.342
0.079TrpCys: 0.079 ± 0.083
1.501TrpAsp: 1.501 ± 0.395
0.948TrpGlu: 0.948 ± 0.209
1.027TrpPhe: 1.027 ± 0.242
0.711TrpGly: 0.711 ± 0.268
0.316TrpHis: 0.316 ± 0.13
0.237TrpIle: 0.237 ± 0.131
0.948TrpLys: 0.948 ± 0.259
1.896TrpLeu: 1.896 ± 0.469
0.474TrpMet: 0.474 ± 0.191
0.316TrpAsn: 0.316 ± 0.169
0.553TrpPro: 0.553 ± 0.188
0.79TrpGln: 0.79 ± 0.275
0.474TrpArg: 0.474 ± 0.174
0.711TrpSer: 0.711 ± 0.181
0.711TrpThr: 0.711 ± 0.219
0.869TrpVal: 0.869 ± 0.238
0.79TrpTrp: 0.79 ± 0.185
0.474TrpTyr: 0.474 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.765TyrAla: 2.765 ± 0.605
0.474TyrCys: 0.474 ± 0.177
2.449TyrAsp: 2.449 ± 0.463
1.975TyrGlu: 1.975 ± 0.433
0.948TyrPhe: 0.948 ± 0.259
2.607TyrGly: 2.607 ± 0.539
0.869TyrHis: 0.869 ± 0.303
1.659TyrIle: 1.659 ± 0.374
1.896TyrLys: 1.896 ± 0.412
3.003TyrLeu: 3.003 ± 0.43
1.106TyrMet: 1.106 ± 0.36
1.501TyrAsn: 1.501 ± 0.321
1.027TyrPro: 1.027 ± 0.309
1.027TyrGln: 1.027 ± 0.265
1.817TyrArg: 1.817 ± 0.345
1.975TyrSer: 1.975 ± 0.376
1.343TyrThr: 1.343 ± 0.359
2.686TyrVal: 2.686 ± 0.441
0.395TyrTrp: 0.395 ± 0.162
1.027TyrTyr: 1.027 ± 0.345
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (12657 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski