Amino acid dipepetide frequency for Vibrio phage Va_90-11-286_p16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.97AlaAla: 7.97 ± 1.631
0.433AlaCys: 0.433 ± 0.155
4.678AlaAsp: 4.678 ± 0.553
5.978AlaGlu: 5.978 ± 0.628
3.032AlaPhe: 3.032 ± 0.523
4.505AlaGly: 4.505 ± 0.78
2.079AlaHis: 2.079 ± 0.458
5.025AlaIle: 5.025 ± 0.881
5.804AlaLys: 5.804 ± 0.776
9.789AlaLeu: 9.789 ± 1.232
2.426AlaMet: 2.426 ± 0.469
3.379AlaAsn: 3.379 ± 0.6
2.772AlaPro: 2.772 ± 0.417
3.292AlaGln: 3.292 ± 0.616
4.678AlaArg: 4.678 ± 0.621
5.718AlaSer: 5.718 ± 0.624
4.158AlaThr: 4.158 ± 0.656
4.765AlaVal: 4.765 ± 0.697
0.78AlaTrp: 0.78 ± 0.246
2.512AlaTyr: 2.512 ± 0.402
0.0AlaXaa: 0.0 ± 0.0
Cys
0.606CysAla: 0.606 ± 0.193
0.433CysCys: 0.433 ± 0.217
0.606CysAsp: 0.606 ± 0.21
0.866CysGlu: 0.866 ± 0.285
0.52CysPhe: 0.52 ± 0.205
0.78CysGly: 0.78 ± 0.324
0.347CysHis: 0.347 ± 0.168
0.606CysIle: 0.606 ± 0.237
0.866CysLys: 0.866 ± 0.26
0.953CysLeu: 0.953 ± 0.295
0.26CysMet: 0.26 ± 0.168
0.26CysAsn: 0.26 ± 0.143
0.606CysPro: 0.606 ± 0.261
0.953CysGln: 0.953 ± 0.279
1.213CysArg: 1.213 ± 0.253
0.693CysSer: 0.693 ± 0.248
0.347CysThr: 0.347 ± 0.148
0.52CysVal: 0.52 ± 0.196
0.087CysTrp: 0.087 ± 0.072
0.433CysTyr: 0.433 ± 0.229
0.0CysXaa: 0.0 ± 0.0
Asp
4.332AspAla: 4.332 ± 0.512
0.693AspCys: 0.693 ± 0.259
2.772AspAsp: 2.772 ± 0.54
4.765AspGlu: 4.765 ± 0.569
1.993AspPhe: 1.993 ± 0.458
4.938AspGly: 4.938 ± 0.676
1.213AspHis: 1.213 ± 0.379
4.332AspIle: 4.332 ± 0.699
3.205AspLys: 3.205 ± 0.581
5.371AspLeu: 5.371 ± 0.744
1.04AspMet: 1.04 ± 0.207
1.386AspAsn: 1.386 ± 0.378
1.906AspPro: 1.906 ± 0.511
1.213AspGln: 1.213 ± 0.316
2.859AspArg: 2.859 ± 0.449
3.292AspSer: 3.292 ± 0.469
2.946AspThr: 2.946 ± 0.469
3.552AspVal: 3.552 ± 0.461
0.953AspTrp: 0.953 ± 0.264
1.733AspTyr: 1.733 ± 0.305
0.0AspXaa: 0.0 ± 0.0
Glu
5.285GluAla: 5.285 ± 0.7
0.866GluCys: 0.866 ± 0.23
2.772GluAsp: 2.772 ± 0.52
4.158GluGlu: 4.158 ± 0.677
4.158GluPhe: 4.158 ± 0.674
4.072GluGly: 4.072 ± 0.575
1.299GluHis: 1.299 ± 0.323
4.851GluIle: 4.851 ± 0.765
3.725GluLys: 3.725 ± 0.451
6.757GluLeu: 6.757 ± 1.109
2.079GluMet: 2.079 ± 0.365
2.512GluAsn: 2.512 ± 0.328
1.993GluPro: 1.993 ± 0.376
2.946GluGln: 2.946 ± 0.512
3.985GluArg: 3.985 ± 0.611
5.285GluSer: 5.285 ± 0.687
3.552GluThr: 3.552 ± 0.644
4.851GluVal: 4.851 ± 0.666
0.866GluTrp: 0.866 ± 0.317
2.512GluTyr: 2.512 ± 0.434
0.0GluXaa: 0.0 ± 0.0
Phe
2.772PheAla: 2.772 ± 0.451
0.433PheCys: 0.433 ± 0.205
2.079PheAsp: 2.079 ± 0.433
2.426PheGlu: 2.426 ± 0.537
0.953PhePhe: 0.953 ± 0.376
1.993PheGly: 1.993 ± 0.345
1.559PheHis: 1.559 ± 0.404
2.686PheIle: 2.686 ± 0.558
1.906PheLys: 1.906 ± 0.389
3.552PheLeu: 3.552 ± 0.681
0.866PheMet: 0.866 ± 0.293
2.252PheAsn: 2.252 ± 0.487
1.299PhePro: 1.299 ± 0.354
1.559PheGln: 1.559 ± 0.367
2.512PheArg: 2.512 ± 0.474
3.032PheSer: 3.032 ± 0.601
1.819PheThr: 1.819 ± 0.427
2.599PheVal: 2.599 ± 0.488
0.52PheTrp: 0.52 ± 0.189
1.559PheTyr: 1.559 ± 0.405
0.0PheXaa: 0.0 ± 0.0
Gly
5.285GlyAla: 5.285 ± 0.724
0.606GlyCys: 0.606 ± 0.267
4.678GlyAsp: 4.678 ± 0.674
4.678GlyGlu: 4.678 ± 0.541
2.686GlyPhe: 2.686 ± 0.476
4.592GlyGly: 4.592 ± 1.404
1.213GlyHis: 1.213 ± 0.284
3.205GlyIle: 3.205 ± 0.603
5.111GlyLys: 5.111 ± 0.642
5.371GlyLeu: 5.371 ± 0.891
1.559GlyMet: 1.559 ± 0.416
2.426GlyAsn: 2.426 ± 0.339
1.213GlyPro: 1.213 ± 0.424
1.473GlyGln: 1.473 ± 0.396
3.639GlyArg: 3.639 ± 0.716
4.245GlySer: 4.245 ± 0.488
2.599GlyThr: 2.599 ± 0.42
5.631GlyVal: 5.631 ± 0.765
0.866GlyTrp: 0.866 ± 0.231
1.646GlyTyr: 1.646 ± 0.369
0.0GlyXaa: 0.0 ± 0.0
His
1.473HisAla: 1.473 ± 0.318
0.173HisCys: 0.173 ± 0.113
1.213HisAsp: 1.213 ± 0.297
1.299HisGlu: 1.299 ± 0.365
0.866HisPhe: 0.866 ± 0.266
1.04HisGly: 1.04 ± 0.246
0.606HisHis: 0.606 ± 0.246
1.733HisIle: 1.733 ± 0.406
0.866HisLys: 0.866 ± 0.25
2.166HisLeu: 2.166 ± 0.428
0.26HisMet: 0.26 ± 0.126
0.693HisAsn: 0.693 ± 0.274
1.386HisPro: 1.386 ± 0.309
0.953HisGln: 0.953 ± 0.231
1.299HisArg: 1.299 ± 0.377
1.906HisSer: 1.906 ± 0.47
1.299HisThr: 1.299 ± 0.311
1.126HisVal: 1.126 ± 0.261
0.433HisTrp: 0.433 ± 0.201
0.953HisTyr: 0.953 ± 0.351
0.0HisXaa: 0.0 ± 0.0
Ile
5.718IleAla: 5.718 ± 0.681
0.693IleCys: 0.693 ± 0.271
4.332IleAsp: 4.332 ± 0.506
5.978IleGlu: 5.978 ± 0.935
1.646IlePhe: 1.646 ± 0.395
4.505IleGly: 4.505 ± 0.758
1.646IleHis: 1.646 ± 0.363
2.772IleIle: 2.772 ± 0.523
4.678IleLys: 4.678 ± 0.549
3.898IleLeu: 3.898 ± 0.567
0.953IleMet: 0.953 ± 0.271
2.339IleAsn: 2.339 ± 0.4
2.426IlePro: 2.426 ± 0.454
2.772IleGln: 2.772 ± 0.452
2.946IleArg: 2.946 ± 0.447
5.198IleSer: 5.198 ± 0.773
3.552IleThr: 3.552 ± 0.484
2.686IleVal: 2.686 ± 0.454
0.26IleTrp: 0.26 ± 0.13
1.473IleTyr: 1.473 ± 0.447
0.0IleXaa: 0.0 ± 0.0
Lys
6.238LysAla: 6.238 ± 0.771
0.26LysCys: 0.26 ± 0.123
3.379LysAsp: 3.379 ± 0.536
3.032LysGlu: 3.032 ± 0.577
2.859LysPhe: 2.859 ± 0.611
4.245LysGly: 4.245 ± 0.669
1.386LysHis: 1.386 ± 0.39
3.205LysIle: 3.205 ± 0.533
4.505LysLys: 4.505 ± 0.668
5.978LysLeu: 5.978 ± 0.904
1.733LysMet: 1.733 ± 0.352
2.859LysAsn: 2.859 ± 0.448
2.772LysPro: 2.772 ± 0.539
4.158LysGln: 4.158 ± 0.692
4.072LysArg: 4.072 ± 0.569
4.851LysSer: 4.851 ± 0.724
3.812LysThr: 3.812 ± 0.719
4.592LysVal: 4.592 ± 0.746
1.299LysTrp: 1.299 ± 0.309
1.473LysTyr: 1.473 ± 0.336
0.0LysXaa: 0.0 ± 0.0
Leu
8.143LeuAla: 8.143 ± 0.986
1.906LeuCys: 1.906 ± 0.503
4.765LeuAsp: 4.765 ± 0.644
7.104LeuGlu: 7.104 ± 0.801
2.859LeuPhe: 2.859 ± 0.486
5.111LeuGly: 5.111 ± 0.78
2.426LeuHis: 2.426 ± 0.4
5.198LeuIle: 5.198 ± 0.681
6.757LeuLys: 6.757 ± 0.75
7.624LeuLeu: 7.624 ± 0.714
2.599LeuMet: 2.599 ± 0.457
6.671LeuAsn: 6.671 ± 0.793
3.205LeuPro: 3.205 ± 0.533
4.592LeuGln: 4.592 ± 0.618
4.678LeuArg: 4.678 ± 0.582
7.191LeuSer: 7.191 ± 0.778
5.111LeuThr: 5.111 ± 0.689
6.324LeuVal: 6.324 ± 0.915
0.78LeuTrp: 0.78 ± 0.217
2.252LeuTyr: 2.252 ± 0.449
0.0LeuXaa: 0.0 ± 0.0
Met
2.772MetAla: 2.772 ± 0.445
0.087MetCys: 0.087 ± 0.076
0.953MetAsp: 0.953 ± 0.277
1.04MetGlu: 1.04 ± 0.369
0.866MetPhe: 0.866 ± 0.229
2.339MetGly: 2.339 ± 0.531
0.433MetHis: 0.433 ± 0.172
1.473MetIle: 1.473 ± 0.374
1.559MetLys: 1.559 ± 0.328
2.079MetLeu: 2.079 ± 0.38
0.606MetMet: 0.606 ± 0.191
1.126MetAsn: 1.126 ± 0.279
1.04MetPro: 1.04 ± 0.323
1.04MetGln: 1.04 ± 0.261
1.04MetArg: 1.04 ± 0.223
2.252MetSer: 2.252 ± 0.383
0.953MetThr: 0.953 ± 0.311
1.993MetVal: 1.993 ± 0.332
0.26MetTrp: 0.26 ± 0.151
0.347MetTyr: 0.347 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
3.639AsnAla: 3.639 ± 0.628
0.433AsnCys: 0.433 ± 0.144
2.166AsnAsp: 2.166 ± 0.452
3.639AsnGlu: 3.639 ± 0.423
0.693AsnPhe: 0.693 ± 0.231
2.946AsnGly: 2.946 ± 0.533
0.866AsnHis: 0.866 ± 0.362
2.512AsnIle: 2.512 ± 0.588
2.512AsnLys: 2.512 ± 0.474
4.418AsnLeu: 4.418 ± 0.666
0.693AsnMet: 0.693 ± 0.248
2.166AsnAsn: 2.166 ± 0.451
2.339AsnPro: 2.339 ± 0.437
1.473AsnGln: 1.473 ± 0.364
3.292AsnArg: 3.292 ± 0.572
2.946AsnSer: 2.946 ± 0.458
2.166AsnThr: 2.166 ± 0.43
2.772AsnVal: 2.772 ± 0.481
0.78AsnTrp: 0.78 ± 0.226
2.339AsnTyr: 2.339 ± 0.532
0.0AsnXaa: 0.0 ± 0.0
Pro
2.426ProAla: 2.426 ± 0.374
0.26ProCys: 0.26 ± 0.127
2.166ProAsp: 2.166 ± 0.386
2.426ProGlu: 2.426 ± 0.443
1.473ProPhe: 1.473 ± 0.378
1.473ProGly: 1.473 ± 0.413
0.78ProHis: 0.78 ± 0.22
1.733ProIle: 1.733 ± 0.462
2.859ProLys: 2.859 ± 0.49
2.946ProLeu: 2.946 ± 0.435
1.04ProMet: 1.04 ± 0.275
2.426ProAsn: 2.426 ± 0.448
1.126ProPro: 1.126 ± 0.287
1.646ProGln: 1.646 ± 0.36
1.733ProArg: 1.733 ± 0.444
2.339ProSer: 2.339 ± 0.354
2.686ProThr: 2.686 ± 0.533
2.252ProVal: 2.252 ± 0.394
0.433ProTrp: 0.433 ± 0.155
1.213ProTyr: 1.213 ± 0.369
0.0ProXaa: 0.0 ± 0.0
Gln
5.025GlnAla: 5.025 ± 0.657
0.606GlnCys: 0.606 ± 0.204
1.559GlnAsp: 1.559 ± 0.383
2.512GlnGlu: 2.512 ± 0.584
2.079GlnPhe: 2.079 ± 0.429
1.559GlnGly: 1.559 ± 0.358
0.433GlnHis: 0.433 ± 0.197
2.686GlnIle: 2.686 ± 0.477
2.772GlnLys: 2.772 ± 0.492
4.765GlnLeu: 4.765 ± 0.497
0.953GlnMet: 0.953 ± 0.298
2.426GlnAsn: 2.426 ± 0.401
1.819GlnPro: 1.819 ± 0.385
2.079GlnGln: 2.079 ± 0.309
2.252GlnArg: 2.252 ± 0.435
2.426GlnSer: 2.426 ± 0.463
1.906GlnThr: 1.906 ± 0.461
2.859GlnVal: 2.859 ± 0.604
0.52GlnTrp: 0.52 ± 0.201
0.953GlnTyr: 0.953 ± 0.263
0.0GlnXaa: 0.0 ± 0.0
Arg
4.072ArgAla: 4.072 ± 0.637
0.866ArgCys: 0.866 ± 0.281
2.686ArgAsp: 2.686 ± 0.447
3.465ArgGlu: 3.465 ± 0.42
2.079ArgPhe: 2.079 ± 0.479
3.205ArgGly: 3.205 ± 0.465
1.386ArgHis: 1.386 ± 0.364
4.332ArgIle: 4.332 ± 0.618
3.812ArgLys: 3.812 ± 0.488
6.584ArgLeu: 6.584 ± 0.593
1.04ArgMet: 1.04 ± 0.259
2.166ArgAsn: 2.166 ± 0.363
1.473ArgPro: 1.473 ± 0.337
2.686ArgGln: 2.686 ± 0.455
2.426ArgArg: 2.426 ± 0.504
4.158ArgSer: 4.158 ± 0.758
2.686ArgThr: 2.686 ± 0.436
3.985ArgVal: 3.985 ± 0.563
0.78ArgTrp: 0.78 ± 0.306
1.819ArgTyr: 1.819 ± 0.418
0.0ArgXaa: 0.0 ± 0.0
Ser
5.891SerAla: 5.891 ± 0.739
1.126SerCys: 1.126 ± 0.325
4.158SerAsp: 4.158 ± 0.628
4.245SerGlu: 4.245 ± 0.657
3.292SerPhe: 3.292 ± 0.494
4.332SerGly: 4.332 ± 0.643
1.299SerHis: 1.299 ± 0.289
4.332SerIle: 4.332 ± 0.548
4.332SerLys: 4.332 ± 0.683
7.71SerLeu: 7.71 ± 0.947
2.079SerMet: 2.079 ± 0.398
2.772SerAsn: 2.772 ± 0.487
1.906SerPro: 1.906 ± 0.492
2.859SerGln: 2.859 ± 0.528
3.898SerArg: 3.898 ± 0.642
5.198SerSer: 5.198 ± 0.9
2.859SerThr: 2.859 ± 0.601
5.631SerVal: 5.631 ± 0.638
1.299SerTrp: 1.299 ± 0.321
1.906SerTyr: 1.906 ± 0.496
0.0SerXaa: 0.0 ± 0.0
Thr
5.025ThrAla: 5.025 ± 0.846
0.52ThrCys: 0.52 ± 0.236
3.032ThrAsp: 3.032 ± 0.614
3.292ThrGlu: 3.292 ± 0.567
1.213ThrPhe: 1.213 ± 0.296
3.812ThrGly: 3.812 ± 0.873
0.953ThrHis: 0.953 ± 0.299
3.205ThrIle: 3.205 ± 0.624
4.158ThrLys: 4.158 ± 0.703
4.592ThrLeu: 4.592 ± 0.409
1.04ThrMet: 1.04 ± 0.283
2.772ThrAsn: 2.772 ± 0.47
1.559ThrPro: 1.559 ± 0.338
1.819ThrGln: 1.819 ± 0.42
3.379ThrArg: 3.379 ± 0.517
2.946ThrSer: 2.946 ± 0.527
3.898ThrThr: 3.898 ± 0.629
2.686ThrVal: 2.686 ± 0.53
0.606ThrTrp: 0.606 ± 0.195
1.646ThrTyr: 1.646 ± 0.405
0.0ThrXaa: 0.0 ± 0.0
Val
3.985ValAla: 3.985 ± 0.585
1.299ValCys: 1.299 ± 0.343
3.985ValAsp: 3.985 ± 0.453
4.678ValGlu: 4.678 ± 0.781
2.946ValPhe: 2.946 ± 0.491
4.418ValGly: 4.418 ± 0.669
0.866ValHis: 0.866 ± 0.254
4.245ValIle: 4.245 ± 0.644
4.765ValLys: 4.765 ± 0.687
6.497ValLeu: 6.497 ± 0.975
1.993ValMet: 1.993 ± 0.346
2.079ValAsn: 2.079 ± 0.558
3.032ValPro: 3.032 ± 0.475
2.339ValGln: 2.339 ± 0.438
3.292ValArg: 3.292 ± 0.496
4.418ValSer: 4.418 ± 0.745
3.552ValThr: 3.552 ± 0.564
5.111ValVal: 5.111 ± 0.762
0.693ValTrp: 0.693 ± 0.276
2.599ValTyr: 2.599 ± 0.515
0.0ValXaa: 0.0 ± 0.0
Trp
0.78TrpAla: 0.78 ± 0.29
0.0TrpCys: 0.0 ± 0.0
0.78TrpAsp: 0.78 ± 0.245
0.52TrpGlu: 0.52 ± 0.215
1.126TrpPhe: 1.126 ± 0.375
0.693TrpGly: 0.693 ± 0.383
0.433TrpHis: 0.433 ± 0.182
0.606TrpIle: 0.606 ± 0.204
0.433TrpLys: 0.433 ± 0.197
1.473TrpLeu: 1.473 ± 0.364
0.26TrpMet: 0.26 ± 0.184
0.78TrpAsn: 0.78 ± 0.267
0.26TrpPro: 0.26 ± 0.215
1.126TrpGln: 1.126 ± 0.276
0.433TrpArg: 0.433 ± 0.214
0.866TrpSer: 0.866 ± 0.232
0.693TrpThr: 0.693 ± 0.283
1.04TrpVal: 1.04 ± 0.345
0.173TrpTrp: 0.173 ± 0.114
0.433TrpTyr: 0.433 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.339TyrAla: 2.339 ± 0.621
0.26TyrCys: 0.26 ± 0.15
1.733TyrAsp: 1.733 ± 0.321
2.426TyrGlu: 2.426 ± 0.543
1.04TyrPhe: 1.04 ± 0.276
2.166TyrGly: 2.166 ± 0.44
0.52TyrHis: 0.52 ± 0.229
1.733TyrIle: 1.733 ± 0.247
2.079TyrLys: 2.079 ± 0.463
2.772TyrLeu: 2.772 ± 0.462
0.693TyrMet: 0.693 ± 0.246
1.299TyrAsn: 1.299 ± 0.423
1.299TyrPro: 1.299 ± 0.327
1.299TyrGln: 1.299 ± 0.276
2.079TyrArg: 2.079 ± 0.464
2.166TyrSer: 2.166 ± 0.33
1.559TyrThr: 1.559 ± 0.394
1.819TyrVal: 1.819 ± 0.405
0.52TyrTrp: 0.52 ± 0.219
0.78TyrTyr: 0.78 ± 0.246
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11544 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski