Amino acid dipepetide frequency for Yersinia phage phiYeO3-12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.681AlaAla: 10.681 ± 0.934
0.932AlaCys: 0.932 ± 0.287
4.946AlaAsp: 4.946 ± 0.679
5.95AlaGlu: 5.95 ± 0.773
3.297AlaPhe: 3.297 ± 0.586
7.67AlaGly: 7.67 ± 0.846
1.434AlaHis: 1.434 ± 0.274
5.52AlaIle: 5.52 ± 0.474
6.022AlaLys: 6.022 ± 0.593
7.885AlaLeu: 7.885 ± 0.683
3.082AlaMet: 3.082 ± 0.484
4.659AlaAsn: 4.659 ± 0.525
2.366AlaPro: 2.366 ± 0.368
3.871AlaGln: 3.871 ± 0.65
5.233AlaArg: 5.233 ± 0.682
5.878AlaSer: 5.878 ± 0.804
4.373AlaThr: 4.373 ± 0.613
6.093AlaVal: 6.093 ± 0.535
1.72AlaTrp: 1.72 ± 0.328
1.935AlaTyr: 1.935 ± 0.333
0.0AlaXaa: 0.0 ± 0.0
Cys
0.932CysAla: 0.932 ± 0.302
0.143CysCys: 0.143 ± 0.117
0.789CysAsp: 0.789 ± 0.224
0.358CysGlu: 0.358 ± 0.167
0.573CysPhe: 0.573 ± 0.271
0.717CysGly: 0.717 ± 0.253
0.645CysHis: 0.645 ± 0.2
0.717CysIle: 0.717 ± 0.258
0.789CysLys: 0.789 ± 0.229
1.004CysLeu: 1.004 ± 0.323
0.143CysMet: 0.143 ± 0.096
0.287CysAsn: 0.287 ± 0.144
0.43CysPro: 0.43 ± 0.188
0.789CysGln: 0.789 ± 0.247
0.789CysArg: 0.789 ± 0.289
0.932CysSer: 0.932 ± 0.407
0.215CysThr: 0.215 ± 0.134
0.86CysVal: 0.86 ± 0.304
0.215CysTrp: 0.215 ± 0.128
0.573CysTyr: 0.573 ± 0.251
0.0CysXaa: 0.0 ± 0.0
Asp
6.237AspAla: 6.237 ± 0.627
0.573AspCys: 0.573 ± 0.231
3.943AspAsp: 3.943 ± 0.479
4.373AspGlu: 4.373 ± 0.661
2.509AspPhe: 2.509 ± 0.453
6.237AspGly: 6.237 ± 0.665
1.362AspHis: 1.362 ± 0.265
2.581AspIle: 2.581 ± 0.432
4.444AspLys: 4.444 ± 0.627
3.871AspLeu: 3.871 ± 0.646
2.294AspMet: 2.294 ± 0.434
2.796AspAsn: 2.796 ± 0.354
2.652AspPro: 2.652 ± 0.364
2.007AspGln: 2.007 ± 0.54
2.939AspArg: 2.939 ± 0.512
2.509AspSer: 2.509 ± 0.407
3.513AspThr: 3.513 ± 0.426
3.728AspVal: 3.728 ± 0.372
0.717AspTrp: 0.717 ± 0.245
1.792AspTyr: 1.792 ± 0.342
0.0AspXaa: 0.0 ± 0.0
Glu
7.24GluAla: 7.24 ± 0.853
0.573GluCys: 0.573 ± 0.185
4.588GluAsp: 4.588 ± 0.553
4.803GluGlu: 4.803 ± 0.769
2.652GluPhe: 2.652 ± 0.412
4.731GluGly: 4.731 ± 0.566
1.864GluHis: 1.864 ± 0.445
2.867GluIle: 2.867 ± 0.435
4.014GluLys: 4.014 ± 0.707
5.878GluLeu: 5.878 ± 0.578
2.509GluMet: 2.509 ± 0.527
2.366GluAsn: 2.366 ± 0.398
1.577GluPro: 1.577 ± 0.383
3.799GluGln: 3.799 ± 0.658
4.014GluArg: 4.014 ± 0.422
4.086GluSer: 4.086 ± 0.466
2.939GluThr: 2.939 ± 0.552
4.444GluVal: 4.444 ± 0.548
1.147GluTrp: 1.147 ± 0.289
3.154GluTyr: 3.154 ± 0.358
0.0GluXaa: 0.0 ± 0.0
Phe
2.724PheAla: 2.724 ± 0.364
0.43PheCys: 0.43 ± 0.178
2.724PheAsp: 2.724 ± 0.421
1.72PheGlu: 1.72 ± 0.266
0.86PhePhe: 0.86 ± 0.257
2.867PheGly: 2.867 ± 0.467
0.717PheHis: 0.717 ± 0.295
2.007PheIle: 2.007 ± 0.443
3.297PheLys: 3.297 ± 0.64
3.082PheLeu: 3.082 ± 0.502
0.932PheMet: 0.932 ± 0.241
1.577PheAsn: 1.577 ± 0.321
1.505PhePro: 1.505 ± 0.35
1.434PheGln: 1.434 ± 0.315
1.649PheArg: 1.649 ± 0.284
1.72PheSer: 1.72 ± 0.326
2.724PheThr: 2.724 ± 0.471
1.864PheVal: 1.864 ± 0.306
0.287PheTrp: 0.287 ± 0.124
1.004PheTyr: 1.004 ± 0.284
0.0PheXaa: 0.0 ± 0.0
Gly
7.097GlyAla: 7.097 ± 0.881
1.434GlyCys: 1.434 ± 0.464
4.946GlyAsp: 4.946 ± 0.632
5.95GlyGlu: 5.95 ± 0.502
2.939GlyPhe: 2.939 ± 0.353
5.448GlyGly: 5.448 ± 0.974
1.147GlyHis: 1.147 ± 0.283
4.373GlyIle: 4.373 ± 0.664
5.95GlyLys: 5.95 ± 0.706
7.168GlyLeu: 7.168 ± 0.742
2.222GlyMet: 2.222 ± 0.331
3.154GlyAsn: 3.154 ± 0.635
0.932GlyPro: 0.932 ± 0.256
2.796GlyGln: 2.796 ± 0.377
4.875GlyArg: 4.875 ± 0.488
4.086GlySer: 4.086 ± 0.626
3.082GlyThr: 3.082 ± 0.748
4.444GlyVal: 4.444 ± 0.529
1.792GlyTrp: 1.792 ± 0.422
2.652GlyTyr: 2.652 ± 0.46
0.0GlyXaa: 0.0 ± 0.0
His
1.29HisAla: 1.29 ± 0.295
0.43HisCys: 0.43 ± 0.191
0.645HisAsp: 0.645 ± 0.253
1.219HisGlu: 1.219 ± 0.358
0.86HisPhe: 0.86 ± 0.22
1.792HisGly: 1.792 ± 0.28
0.43HisHis: 0.43 ± 0.163
1.72HisIle: 1.72 ± 0.331
1.29HisLys: 1.29 ± 0.274
2.294HisLeu: 2.294 ± 0.427
0.789HisMet: 0.789 ± 0.221
0.573HisAsn: 0.573 ± 0.171
0.645HisPro: 0.645 ± 0.232
0.072HisGln: 0.072 ± 0.073
1.004HisArg: 1.004 ± 0.276
1.505HisSer: 1.505 ± 0.422
1.004HisThr: 1.004 ± 0.214
1.362HisVal: 1.362 ± 0.292
0.502HisTrp: 0.502 ± 0.176
0.717HisTyr: 0.717 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
4.229IleAla: 4.229 ± 0.507
0.573IleCys: 0.573 ± 0.161
3.369IleAsp: 3.369 ± 0.484
4.014IleGlu: 4.014 ± 0.494
1.219IlePhe: 1.219 ± 0.279
3.513IleGly: 3.513 ± 0.51
1.434IleHis: 1.434 ± 0.327
2.796IleIle: 2.796 ± 0.35
3.799IleLys: 3.799 ± 0.559
3.943IleLeu: 3.943 ± 0.481
1.004IleMet: 1.004 ± 0.325
2.509IleAsn: 2.509 ± 0.608
2.509IlePro: 2.509 ± 0.408
2.079IleGln: 2.079 ± 0.395
3.011IleArg: 3.011 ± 0.466
2.796IleSer: 2.796 ± 0.44
2.796IleThr: 2.796 ± 0.452
2.939IleVal: 2.939 ± 0.443
0.645IleTrp: 0.645 ± 0.22
1.792IleTyr: 1.792 ± 0.247
0.0IleXaa: 0.0 ± 0.0
Lys
7.455LysAla: 7.455 ± 0.958
0.645LysCys: 0.645 ± 0.223
3.441LysAsp: 3.441 ± 0.434
4.803LysGlu: 4.803 ± 0.501
2.151LysPhe: 2.151 ± 0.392
4.875LysGly: 4.875 ± 0.563
1.864LysHis: 1.864 ± 0.384
1.72LysIle: 1.72 ± 0.312
4.588LysLys: 4.588 ± 0.924
5.233LysLeu: 5.233 ± 0.726
1.649LysMet: 1.649 ± 0.31
2.366LysAsn: 2.366 ± 0.328
3.011LysPro: 3.011 ± 0.609
2.939LysGln: 2.939 ± 0.584
4.516LysArg: 4.516 ± 0.6
3.297LysSer: 3.297 ± 0.566
3.226LysThr: 3.226 ± 0.43
6.093LysVal: 6.093 ± 0.601
0.717LysTrp: 0.717 ± 0.262
1.864LysTyr: 1.864 ± 0.259
0.0LysXaa: 0.0 ± 0.0
Leu
8.1LeuAla: 8.1 ± 1.289
0.502LeuCys: 0.502 ± 0.195
4.588LeuAsp: 4.588 ± 0.453
5.878LeuGlu: 5.878 ± 0.775
2.939LeuPhe: 2.939 ± 0.496
4.229LeuGly: 4.229 ± 0.489
1.075LeuHis: 1.075 ± 0.356
4.158LeuIle: 4.158 ± 0.497
5.95LeuLys: 5.95 ± 0.815
5.09LeuLeu: 5.09 ± 0.592
2.437LeuMet: 2.437 ± 0.322
4.014LeuAsn: 4.014 ± 0.521
3.584LeuPro: 3.584 ± 0.488
3.441LeuGln: 3.441 ± 0.543
5.95LeuArg: 5.95 ± 0.593
4.731LeuSer: 4.731 ± 0.689
5.663LeuThr: 5.663 ± 0.808
4.875LeuVal: 4.875 ± 0.572
1.577LeuTrp: 1.577 ± 0.431
2.509LeuTyr: 2.509 ± 0.459
0.0LeuXaa: 0.0 ± 0.0
Met
2.796MetAla: 2.796 ± 0.345
0.215MetCys: 0.215 ± 0.124
1.649MetAsp: 1.649 ± 0.472
1.792MetGlu: 1.792 ± 0.502
0.86MetPhe: 0.86 ± 0.228
2.581MetGly: 2.581 ± 0.574
0.215MetHis: 0.215 ± 0.126
1.505MetIle: 1.505 ± 0.228
0.932MetLys: 0.932 ± 0.22
3.369MetLeu: 3.369 ± 0.479
1.004MetMet: 1.004 ± 0.268
1.29MetAsn: 1.29 ± 0.257
1.649MetPro: 1.649 ± 0.312
1.219MetGln: 1.219 ± 0.381
1.362MetArg: 1.362 ± 0.285
1.29MetSer: 1.29 ± 0.278
2.294MetThr: 2.294 ± 0.412
2.079MetVal: 2.079 ± 0.391
0.072MetTrp: 0.072 ± 0.07
0.573MetTyr: 0.573 ± 0.185
0.0MetXaa: 0.0 ± 0.0
Asn
3.584AsnAla: 3.584 ± 0.387
0.43AsnCys: 0.43 ± 0.228
2.796AsnAsp: 2.796 ± 0.483
2.652AsnGlu: 2.652 ± 0.374
1.434AsnPhe: 1.434 ± 0.311
3.656AsnGly: 3.656 ± 0.492
0.573AsnHis: 0.573 ± 0.198
2.939AsnIle: 2.939 ± 0.601
2.222AsnLys: 2.222 ± 0.27
3.154AsnLeu: 3.154 ± 0.552
0.86AsnMet: 0.86 ± 0.217
2.079AsnAsn: 2.079 ± 0.459
2.939AsnPro: 2.939 ± 0.463
1.792AsnGln: 1.792 ± 0.321
2.939AsnArg: 2.939 ± 0.572
2.222AsnSer: 2.222 ± 0.503
1.792AsnThr: 1.792 ± 0.332
2.294AsnVal: 2.294 ± 0.503
0.645AsnTrp: 0.645 ± 0.205
2.222AsnTyr: 2.222 ± 0.607
0.0AsnXaa: 0.0 ± 0.0
Pro
2.724ProAla: 2.724 ± 0.397
0.502ProCys: 0.502 ± 0.204
2.724ProAsp: 2.724 ± 0.447
3.297ProGlu: 3.297 ± 0.683
1.147ProPhe: 1.147 ± 0.301
2.007ProGly: 2.007 ± 0.317
0.789ProHis: 0.789 ± 0.204
1.29ProIle: 1.29 ± 0.347
2.724ProLys: 2.724 ± 0.494
2.581ProLeu: 2.581 ± 0.542
1.147ProMet: 1.147 ± 0.264
2.366ProAsn: 2.366 ± 0.487
0.86ProPro: 0.86 ± 0.304
1.075ProGln: 1.075 ± 0.274
2.007ProArg: 2.007 ± 0.39
2.222ProSer: 2.222 ± 0.344
1.935ProThr: 1.935 ± 0.387
2.437ProVal: 2.437 ± 0.427
0.789ProTrp: 0.789 ± 0.159
1.362ProTyr: 1.362 ± 0.337
0.0ProXaa: 0.0 ± 0.0
Gln
4.373GlnAla: 4.373 ± 0.885
0.502GlnCys: 0.502 ± 0.231
2.222GlnAsp: 2.222 ± 0.471
3.369GlnGlu: 3.369 ± 0.461
1.434GlnPhe: 1.434 ± 0.348
2.939GlnGly: 2.939 ± 0.31
0.645GlnHis: 0.645 ± 0.154
2.079GlnIle: 2.079 ± 0.36
2.867GlnLys: 2.867 ± 0.497
3.154GlnLeu: 3.154 ± 0.42
1.004GlnMet: 1.004 ± 0.254
1.505GlnAsn: 1.505 ± 0.353
1.434GlnPro: 1.434 ± 0.397
1.72GlnGln: 1.72 ± 0.321
1.792GlnArg: 1.792 ± 0.366
2.509GlnSer: 2.509 ± 0.429
1.935GlnThr: 1.935 ± 0.374
2.294GlnVal: 2.294 ± 0.402
0.86GlnTrp: 0.86 ± 0.233
0.86GlnTyr: 0.86 ± 0.327
0.0GlnXaa: 0.0 ± 0.0
Arg
5.591ArgAla: 5.591 ± 0.723
1.29ArgCys: 1.29 ± 0.327
3.728ArgAsp: 3.728 ± 0.571
4.229ArgGlu: 4.229 ± 0.607
3.154ArgPhe: 3.154 ± 0.492
3.799ArgGly: 3.799 ± 0.463
1.434ArgHis: 1.434 ± 0.287
2.509ArgIle: 2.509 ± 0.523
3.799ArgLys: 3.799 ± 0.497
5.52ArgLeu: 5.52 ± 0.574
1.649ArgMet: 1.649 ± 0.36
2.724ArgAsn: 2.724 ± 0.43
1.362ArgPro: 1.362 ± 0.305
2.079ArgGln: 2.079 ± 0.368
3.226ArgArg: 3.226 ± 0.424
4.731ArgSer: 4.731 ± 0.698
3.369ArgThr: 3.369 ± 0.525
3.441ArgVal: 3.441 ± 0.433
0.932ArgTrp: 0.932 ± 0.314
1.075ArgTyr: 1.075 ± 0.223
0.0ArgXaa: 0.0 ± 0.0
Ser
5.591SerAla: 5.591 ± 0.727
0.789SerCys: 0.789 ± 0.237
5.09SerAsp: 5.09 ± 0.604
3.656SerGlu: 3.656 ± 0.639
2.222SerPhe: 2.222 ± 0.339
5.376SerGly: 5.376 ± 0.624
1.147SerHis: 1.147 ± 0.262
3.082SerIle: 3.082 ± 0.672
3.082SerLys: 3.082 ± 0.536
3.656SerLeu: 3.656 ± 0.426
1.147SerMet: 1.147 ± 0.31
1.792SerAsn: 1.792 ± 0.32
2.079SerPro: 2.079 ± 0.539
1.935SerGln: 1.935 ± 0.356
3.441SerArg: 3.441 ± 0.461
4.229SerSer: 4.229 ± 0.598
3.441SerThr: 3.441 ± 0.422
3.799SerVal: 3.799 ± 0.461
0.645SerTrp: 0.645 ± 0.151
2.581SerTyr: 2.581 ± 0.518
0.0SerXaa: 0.0 ± 0.0
Thr
4.444ThrAla: 4.444 ± 0.869
0.86ThrCys: 0.86 ± 0.279
2.294ThrAsp: 2.294 ± 0.385
3.799ThrGlu: 3.799 ± 0.472
1.935ThrPhe: 1.935 ± 0.318
5.663ThrGly: 5.663 ± 0.637
1.649ThrHis: 1.649 ± 0.256
3.226ThrIle: 3.226 ± 0.482
3.728ThrLys: 3.728 ± 0.426
4.946ThrLeu: 4.946 ± 0.584
1.792ThrMet: 1.792 ± 0.376
1.72ThrAsn: 1.72 ± 0.422
2.796ThrPro: 2.796 ± 0.46
1.72ThrGln: 1.72 ± 0.279
2.294ThrArg: 2.294 ± 0.301
3.369ThrSer: 3.369 ± 0.513
2.939ThrThr: 2.939 ± 0.511
3.871ThrVal: 3.871 ± 0.524
0.287ThrTrp: 0.287 ± 0.117
1.792ThrTyr: 1.792 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
4.373ValAla: 4.373 ± 0.555
0.573ValCys: 0.573 ± 0.207
3.584ValAsp: 3.584 ± 0.499
4.444ValGlu: 4.444 ± 0.615
1.649ValPhe: 1.649 ± 0.378
5.018ValGly: 5.018 ± 0.691
0.932ValHis: 0.932 ± 0.219
3.154ValIle: 3.154 ± 0.403
4.158ValLys: 4.158 ± 0.532
5.52ValLeu: 5.52 ± 0.696
1.434ValMet: 1.434 ± 0.338
2.867ValAsn: 2.867 ± 0.376
2.437ValPro: 2.437 ± 0.363
2.509ValGln: 2.509 ± 0.505
5.448ValArg: 5.448 ± 0.613
3.943ValSer: 3.943 ± 0.557
4.946ValThr: 4.946 ± 0.586
5.591ValVal: 5.591 ± 0.592
1.147ValTrp: 1.147 ± 0.332
2.007ValTyr: 2.007 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
0.932TrpAla: 0.932 ± 0.218
0.287TrpCys: 0.287 ± 0.152
0.573TrpAsp: 0.573 ± 0.202
0.932TrpGlu: 0.932 ± 0.243
0.358TrpPhe: 0.358 ± 0.149
1.075TrpGly: 1.075 ± 0.28
0.215TrpHis: 0.215 ± 0.117
0.86TrpIle: 0.86 ± 0.399
1.004TrpLys: 1.004 ± 0.317
1.649TrpLeu: 1.649 ± 0.383
0.645TrpMet: 0.645 ± 0.209
1.362TrpAsn: 1.362 ± 0.3
0.358TrpPro: 0.358 ± 0.176
0.645TrpGln: 0.645 ± 0.214
0.932TrpArg: 0.932 ± 0.26
1.004TrpSer: 1.004 ± 0.329
0.789TrpThr: 0.789 ± 0.26
1.362TrpVal: 1.362 ± 0.353
0.287TrpTrp: 0.287 ± 0.141
0.287TrpTyr: 0.287 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.154TyrAla: 3.154 ± 0.436
0.215TyrCys: 0.215 ± 0.11
2.437TyrAsp: 2.437 ± 0.616
2.007TyrGlu: 2.007 ± 0.442
0.932TyrPhe: 0.932 ± 0.31
2.294TyrGly: 2.294 ± 0.492
0.573TyrHis: 0.573 ± 0.181
1.864TyrIle: 1.864 ± 0.505
1.792TyrLys: 1.792 ± 0.319
2.294TyrLeu: 2.294 ± 0.398
0.932TyrMet: 0.932 ± 0.232
1.219TyrAsn: 1.219 ± 0.235
0.86TyrPro: 0.86 ± 0.202
1.649TyrGln: 1.649 ± 0.397
2.222TyrArg: 2.222 ± 0.361
1.72TyrSer: 1.72 ± 0.36
2.151TyrThr: 2.151 ± 0.439
1.864TyrVal: 1.864 ± 0.372
0.573TyrTrp: 0.573 ± 0.221
0.717TyrTyr: 0.717 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (13951 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski