Amino acid dipepetide frequency for Streptomyces phage JXY1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.175AlaAla: 11.175 ± 1.308
0.648AlaCys: 0.648 ± 0.234
4.697AlaAsp: 4.697 ± 0.756
6.154AlaGlu: 6.154 ± 0.737
2.348AlaPhe: 2.348 ± 0.439
7.855AlaGly: 7.855 ± 0.812
1.539AlaHis: 1.539 ± 0.29
3.32AlaIle: 3.32 ± 0.646
5.102AlaLys: 5.102 ± 0.821
6.883AlaLeu: 6.883 ± 0.833
2.753AlaMet: 2.753 ± 0.417
3.725AlaAsn: 3.725 ± 0.463
3.563AlaPro: 3.563 ± 0.576
4.535AlaGln: 4.535 ± 1.034
4.859AlaArg: 4.859 ± 0.793
6.073AlaSer: 6.073 ± 0.685
6.802AlaThr: 6.802 ± 0.901
6.154AlaVal: 6.154 ± 0.688
2.105AlaTrp: 2.105 ± 0.366
3.887AlaTyr: 3.887 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
0.324CysAla: 0.324 ± 0.152
0.081CysCys: 0.081 ± 0.079
0.486CysAsp: 0.486 ± 0.217
0.486CysGlu: 0.486 ± 0.217
0.162CysPhe: 0.162 ± 0.105
0.81CysGly: 0.81 ± 0.279
0.162CysHis: 0.162 ± 0.123
0.405CysIle: 0.405 ± 0.193
0.486CysLys: 0.486 ± 0.207
0.648CysLeu: 0.648 ± 0.281
0.243CysMet: 0.243 ± 0.145
0.081CysAsn: 0.081 ± 0.081
0.81CysPro: 0.81 ± 0.273
0.243CysGln: 0.243 ± 0.135
0.567CysArg: 0.567 ± 0.164
0.486CysSer: 0.486 ± 0.206
0.405CysThr: 0.405 ± 0.202
0.891CysVal: 0.891 ± 0.299
0.243CysTrp: 0.243 ± 0.134
0.081CysTyr: 0.081 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
6.964AspAla: 6.964 ± 0.769
0.729AspCys: 0.729 ± 0.3
3.968AspAsp: 3.968 ± 0.539
3.725AspGlu: 3.725 ± 0.513
2.267AspPhe: 2.267 ± 0.553
5.264AspGly: 5.264 ± 0.616
0.891AspHis: 0.891 ± 0.248
4.13AspIle: 4.13 ± 0.493
2.996AspLys: 2.996 ± 0.51
4.697AspLeu: 4.697 ± 0.776
2.753AspMet: 2.753 ± 0.498
1.862AspAsn: 1.862 ± 0.404
4.454AspPro: 4.454 ± 0.588
2.105AspGln: 2.105 ± 0.412
3.32AspArg: 3.32 ± 0.551
3.644AspSer: 3.644 ± 0.692
3.401AspThr: 3.401 ± 0.662
4.211AspVal: 4.211 ± 0.698
1.377AspTrp: 1.377 ± 0.392
2.186AspTyr: 2.186 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
5.264GluAla: 5.264 ± 0.646
0.567GluCys: 0.567 ± 0.289
3.401GluAsp: 3.401 ± 0.543
3.401GluGlu: 3.401 ± 0.649
1.862GluPhe: 1.862 ± 0.47
4.292GluGly: 4.292 ± 0.503
0.972GluHis: 0.972 ± 0.249
3.482GluIle: 3.482 ± 0.477
2.51GluLys: 2.51 ± 0.47
5.83GluLeu: 5.83 ± 0.735
1.62GluMet: 1.62 ± 0.341
1.943GluAsn: 1.943 ± 0.343
1.62GluPro: 1.62 ± 0.365
2.915GluGln: 2.915 ± 0.577
3.077GluArg: 3.077 ± 0.528
3.482GluSer: 3.482 ± 0.594
3.482GluThr: 3.482 ± 0.602
3.725GluVal: 3.725 ± 0.568
1.053GluTrp: 1.053 ± 0.301
2.429GluTyr: 2.429 ± 0.541
0.0GluXaa: 0.0 ± 0.0
Phe
2.834PheAla: 2.834 ± 0.544
0.162PheCys: 0.162 ± 0.113
2.591PheAsp: 2.591 ± 0.402
2.105PheGlu: 2.105 ± 0.369
1.053PhePhe: 1.053 ± 0.224
2.834PheGly: 2.834 ± 0.606
0.729PheHis: 0.729 ± 0.225
1.215PheIle: 1.215 ± 0.307
1.134PheLys: 1.134 ± 0.306
2.105PheLeu: 2.105 ± 0.458
0.729PheMet: 0.729 ± 0.283
1.62PheAsn: 1.62 ± 0.332
1.62PhePro: 1.62 ± 0.401
1.782PheGln: 1.782 ± 0.354
1.862PheArg: 1.862 ± 0.462
2.267PheSer: 2.267 ± 0.385
1.701PheThr: 1.701 ± 0.281
2.024PheVal: 2.024 ± 0.408
0.243PheTrp: 0.243 ± 0.143
1.215PheTyr: 1.215 ± 0.335
0.0PheXaa: 0.0 ± 0.0
Gly
7.531GlyAla: 7.531 ± 0.819
0.891GlyCys: 0.891 ± 0.317
4.94GlyAsp: 4.94 ± 0.706
5.911GlyGlu: 5.911 ± 0.668
3.077GlyPhe: 3.077 ± 0.602
8.665GlyGly: 8.665 ± 1.233
1.458GlyHis: 1.458 ± 0.346
3.077GlyIle: 3.077 ± 0.621
5.426GlyLys: 5.426 ± 0.697
5.911GlyLeu: 5.911 ± 0.734
2.753GlyMet: 2.753 ± 0.41
3.158GlyAsn: 3.158 ± 0.532
3.077GlyPro: 3.077 ± 0.382
4.454GlyGln: 4.454 ± 0.647
4.049GlyArg: 4.049 ± 0.557
6.559GlySer: 6.559 ± 0.718
5.911GlyThr: 5.911 ± 0.857
5.426GlyVal: 5.426 ± 0.61
1.701GlyTrp: 1.701 ± 0.368
3.158GlyTyr: 3.158 ± 0.733
0.0GlyXaa: 0.0 ± 0.0
His
1.215HisAla: 1.215 ± 0.264
0.162HisCys: 0.162 ± 0.127
1.134HisAsp: 1.134 ± 0.323
0.729HisGlu: 0.729 ± 0.229
0.729HisPhe: 0.729 ± 0.218
1.539HisGly: 1.539 ± 0.354
0.405HisHis: 0.405 ± 0.205
1.134HisIle: 1.134 ± 0.335
0.324HisLys: 0.324 ± 0.168
1.134HisLeu: 1.134 ± 0.262
0.648HisMet: 0.648 ± 0.24
0.648HisAsn: 0.648 ± 0.262
0.567HisPro: 0.567 ± 0.272
0.648HisGln: 0.648 ± 0.256
1.053HisArg: 1.053 ± 0.312
1.215HisSer: 1.215 ± 0.28
1.215HisThr: 1.215 ± 0.36
1.296HisVal: 1.296 ± 0.318
0.243HisTrp: 0.243 ± 0.169
0.729HisTyr: 0.729 ± 0.246
0.0HisXaa: 0.0 ± 0.0
Ile
3.077IleAla: 3.077 ± 0.419
0.567IleCys: 0.567 ± 0.242
3.644IleAsp: 3.644 ± 0.661
2.348IleGlu: 2.348 ± 0.394
1.134IlePhe: 1.134 ± 0.346
3.158IleGly: 3.158 ± 0.388
0.972IleHis: 0.972 ± 0.281
1.943IleIle: 1.943 ± 0.363
3.32IleLys: 3.32 ± 0.72
3.239IleLeu: 3.239 ± 0.365
1.215IleMet: 1.215 ± 0.348
2.267IleAsn: 2.267 ± 0.446
1.701IlePro: 1.701 ± 0.325
1.377IleGln: 1.377 ± 0.403
2.834IleArg: 2.834 ± 0.453
2.024IleSer: 2.024 ± 0.442
3.644IleThr: 3.644 ± 0.547
2.753IleVal: 2.753 ± 0.497
0.405IleTrp: 0.405 ± 0.158
1.215IleTyr: 1.215 ± 0.437
0.0IleXaa: 0.0 ± 0.0
Lys
5.992LysAla: 5.992 ± 0.759
0.486LysCys: 0.486 ± 0.216
3.239LysAsp: 3.239 ± 0.605
2.591LysGlu: 2.591 ± 0.489
2.267LysPhe: 2.267 ± 0.382
4.211LysGly: 4.211 ± 0.751
0.972LysHis: 0.972 ± 0.239
2.186LysIle: 2.186 ± 0.368
4.13LysLys: 4.13 ± 0.743
4.859LysLeu: 4.859 ± 0.66
1.62LysMet: 1.62 ± 0.341
2.51LysAsn: 2.51 ± 0.485
2.834LysPro: 2.834 ± 0.427
1.943LysGln: 1.943 ± 0.357
2.834LysArg: 2.834 ± 0.453
2.915LysSer: 2.915 ± 0.471
2.834LysThr: 2.834 ± 0.522
4.697LysVal: 4.697 ± 0.671
1.053LysTrp: 1.053 ± 0.305
1.782LysTyr: 1.782 ± 0.393
0.0LysXaa: 0.0 ± 0.0
Leu
5.183LeuAla: 5.183 ± 0.757
0.405LeuCys: 0.405 ± 0.224
5.021LeuAsp: 5.021 ± 0.679
3.725LeuGlu: 3.725 ± 0.559
2.591LeuPhe: 2.591 ± 0.517
5.345LeuGly: 5.345 ± 0.665
1.296LeuHis: 1.296 ± 0.351
3.158LeuIle: 3.158 ± 0.506
3.968LeuLys: 3.968 ± 0.551
5.102LeuLeu: 5.102 ± 0.809
1.782LeuMet: 1.782 ± 0.375
2.51LeuAsn: 2.51 ± 0.432
3.077LeuPro: 3.077 ± 0.46
4.778LeuGln: 4.778 ± 0.702
5.264LeuArg: 5.264 ± 0.717
5.426LeuSer: 5.426 ± 0.7
3.563LeuThr: 3.563 ± 0.554
4.859LeuVal: 4.859 ± 0.617
0.81LeuTrp: 0.81 ± 0.308
2.51LeuTyr: 2.51 ± 0.459
0.0LeuXaa: 0.0 ± 0.0
Met
4.778MetAla: 4.778 ± 0.586
0.162MetCys: 0.162 ± 0.118
2.51MetAsp: 2.51 ± 0.44
1.701MetGlu: 1.701 ± 0.393
0.648MetPhe: 0.648 ± 0.226
3.563MetGly: 3.563 ± 0.536
0.243MetHis: 0.243 ± 0.153
1.134MetIle: 1.134 ± 0.283
1.296MetLys: 1.296 ± 0.469
1.215MetLeu: 1.215 ± 0.378
1.134MetMet: 1.134 ± 0.292
1.539MetAsn: 1.539 ± 0.363
1.782MetPro: 1.782 ± 0.361
0.729MetGln: 0.729 ± 0.294
1.458MetArg: 1.458 ± 0.33
2.591MetSer: 2.591 ± 0.458
1.943MetThr: 1.943 ± 0.369
2.105MetVal: 2.105 ± 0.43
0.405MetTrp: 0.405 ± 0.189
0.324MetTyr: 0.324 ± 0.137
0.0MetXaa: 0.0 ± 0.0
Asn
3.239AsnAla: 3.239 ± 0.602
0.405AsnCys: 0.405 ± 0.174
2.024AsnAsp: 2.024 ± 0.439
1.539AsnGlu: 1.539 ± 0.312
1.296AsnPhe: 1.296 ± 0.263
3.725AsnGly: 3.725 ± 0.683
0.648AsnHis: 0.648 ± 0.225
1.539AsnIle: 1.539 ± 0.349
2.591AsnLys: 2.591 ± 0.553
2.753AsnLeu: 2.753 ± 0.49
0.81AsnMet: 0.81 ± 0.293
0.81AsnAsn: 0.81 ± 0.248
2.672AsnPro: 2.672 ± 0.489
1.943AsnGln: 1.943 ± 0.457
1.862AsnArg: 1.862 ± 0.326
1.862AsnSer: 1.862 ± 0.401
2.834AsnThr: 2.834 ± 0.414
1.943AsnVal: 1.943 ± 0.357
0.81AsnTrp: 0.81 ± 0.246
1.377AsnTyr: 1.377 ± 0.35
0.0AsnXaa: 0.0 ± 0.0
Pro
4.859ProAla: 4.859 ± 0.487
0.405ProCys: 0.405 ± 0.187
3.077ProAsp: 3.077 ± 0.317
2.834ProGlu: 2.834 ± 0.453
1.377ProPhe: 1.377 ± 0.324
4.94ProGly: 4.94 ± 0.775
0.486ProHis: 0.486 ± 0.202
1.701ProIle: 1.701 ± 0.421
2.348ProLys: 2.348 ± 0.484
2.51ProLeu: 2.51 ± 0.397
1.377ProMet: 1.377 ± 0.431
1.458ProAsn: 1.458 ± 0.337
1.62ProPro: 1.62 ± 0.385
1.296ProGln: 1.296 ± 0.336
2.186ProArg: 2.186 ± 0.453
2.996ProSer: 2.996 ± 0.521
3.644ProThr: 3.644 ± 0.54
3.482ProVal: 3.482 ± 0.523
0.729ProTrp: 0.729 ± 0.262
1.458ProTyr: 1.458 ± 0.323
0.0ProXaa: 0.0 ± 0.0
Gln
4.94GlnAla: 4.94 ± 0.704
0.162GlnCys: 0.162 ± 0.105
2.51GlnAsp: 2.51 ± 0.399
2.591GlnGlu: 2.591 ± 0.511
1.296GlnPhe: 1.296 ± 0.359
3.158GlnGly: 3.158 ± 0.525
0.81GlnHis: 0.81 ± 0.212
2.105GlnIle: 2.105 ± 0.384
2.105GlnLys: 2.105 ± 0.41
3.644GlnLeu: 3.644 ± 0.495
1.62GlnMet: 1.62 ± 0.395
1.134GlnAsn: 1.134 ± 0.345
1.782GlnPro: 1.782 ± 0.426
2.672GlnGln: 2.672 ± 0.568
3.563GlnArg: 3.563 ± 0.725
2.51GlnSer: 2.51 ± 0.378
3.158GlnThr: 3.158 ± 0.569
2.753GlnVal: 2.753 ± 0.506
0.891GlnTrp: 0.891 ± 0.264
1.62GlnTyr: 1.62 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
5.587ArgAla: 5.587 ± 0.912
0.486ArgCys: 0.486 ± 0.199
3.887ArgAsp: 3.887 ± 0.514
3.725ArgGlu: 3.725 ± 0.577
1.862ArgPhe: 1.862 ± 0.462
4.373ArgGly: 4.373 ± 0.526
1.296ArgHis: 1.296 ± 0.286
2.186ArgIle: 2.186 ± 0.322
2.672ArgLys: 2.672 ± 0.554
4.292ArgLeu: 4.292 ± 0.679
2.267ArgMet: 2.267 ± 0.418
1.862ArgAsn: 1.862 ± 0.471
1.782ArgPro: 1.782 ± 0.357
2.591ArgGln: 2.591 ± 0.497
3.158ArgArg: 3.158 ± 0.686
4.535ArgSer: 4.535 ± 0.785
3.401ArgThr: 3.401 ± 0.562
5.102ArgVal: 5.102 ± 0.664
0.972ArgTrp: 0.972 ± 0.309
1.377ArgTyr: 1.377 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
5.749SerAla: 5.749 ± 0.778
0.486SerCys: 0.486 ± 0.231
4.535SerAsp: 4.535 ± 0.736
4.13SerGlu: 4.13 ± 0.816
1.377SerPhe: 1.377 ± 0.361
6.964SerGly: 6.964 ± 1.021
1.053SerHis: 1.053 ± 0.275
3.239SerIle: 3.239 ± 0.402
2.834SerLys: 2.834 ± 0.524
4.13SerLeu: 4.13 ± 0.462
1.701SerMet: 1.701 ± 0.35
3.239SerAsn: 3.239 ± 0.487
3.158SerPro: 3.158 ± 0.586
2.429SerGln: 2.429 ± 0.349
4.211SerArg: 4.211 ± 0.606
6.559SerSer: 6.559 ± 0.797
4.616SerThr: 4.616 ± 0.783
4.778SerVal: 4.778 ± 0.588
1.215SerTrp: 1.215 ± 0.333
2.51SerTyr: 2.51 ± 0.436
0.0SerXaa: 0.0 ± 0.0
Thr
5.183ThrAla: 5.183 ± 0.836
0.324ThrCys: 0.324 ± 0.168
4.13ThrAsp: 4.13 ± 0.493
2.591ThrGlu: 2.591 ± 0.557
2.672ThrPhe: 2.672 ± 0.413
6.316ThrGly: 6.316 ± 0.873
0.972ThrHis: 0.972 ± 0.257
3.158ThrIle: 3.158 ± 0.492
4.292ThrLys: 4.292 ± 0.726
4.697ThrLeu: 4.697 ± 0.754
2.348ThrMet: 2.348 ± 0.505
1.862ThrAsn: 1.862 ± 0.46
4.454ThrPro: 4.454 ± 0.55
3.158ThrGln: 3.158 ± 0.486
2.996ThrArg: 2.996 ± 0.438
4.94ThrSer: 4.94 ± 0.866
5.102ThrThr: 5.102 ± 0.817
4.211ThrVal: 4.211 ± 0.568
1.215ThrTrp: 1.215 ± 0.352
2.024ThrTyr: 2.024 ± 0.445
0.0ThrXaa: 0.0 ± 0.0
Val
5.83ValAla: 5.83 ± 0.672
0.648ValCys: 0.648 ± 0.215
5.911ValAsp: 5.911 ± 0.613
3.239ValGlu: 3.239 ± 0.539
2.024ValPhe: 2.024 ± 0.404
5.345ValGly: 5.345 ± 0.612
1.377ValHis: 1.377 ± 0.325
2.186ValIle: 2.186 ± 0.42
4.94ValLys: 4.94 ± 0.628
3.482ValLeu: 3.482 ± 0.482
2.267ValMet: 2.267 ± 0.383
2.51ValAsn: 2.51 ± 0.399
2.186ValPro: 2.186 ± 0.435
3.401ValGln: 3.401 ± 0.595
5.345ValArg: 5.345 ± 0.885
5.749ValSer: 5.749 ± 0.698
5.183ValThr: 5.183 ± 0.737
4.049ValVal: 4.049 ± 0.63
1.215ValTrp: 1.215 ± 0.294
2.186ValTyr: 2.186 ± 0.356
0.0ValXaa: 0.0 ± 0.0
Trp
1.458TrpAla: 1.458 ± 0.305
0.081TrpCys: 0.081 ± 0.087
1.296TrpAsp: 1.296 ± 0.325
1.134TrpGlu: 1.134 ± 0.331
0.81TrpPhe: 0.81 ± 0.268
1.053TrpGly: 1.053 ± 0.281
0.162TrpHis: 0.162 ± 0.124
0.567TrpIle: 0.567 ± 0.223
1.377TrpLys: 1.377 ± 0.314
1.215TrpLeu: 1.215 ± 0.314
0.567TrpMet: 0.567 ± 0.236
0.405TrpAsn: 0.405 ± 0.168
0.486TrpPro: 0.486 ± 0.205
0.729TrpGln: 0.729 ± 0.243
1.458TrpArg: 1.458 ± 0.34
0.81TrpSer: 0.81 ± 0.27
1.458TrpThr: 1.458 ± 0.368
1.377TrpVal: 1.377 ± 0.335
0.324TrpTrp: 0.324 ± 0.191
0.81TrpTyr: 0.81 ± 0.312
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.077TyrAla: 3.077 ± 0.508
0.243TyrCys: 0.243 ± 0.129
1.943TyrAsp: 1.943 ± 0.458
2.267TyrGlu: 2.267 ± 0.438
0.972TyrPhe: 0.972 ± 0.259
3.725TyrGly: 3.725 ± 0.475
0.324TyrHis: 0.324 ± 0.164
0.972TyrIle: 0.972 ± 0.292
2.105TyrLys: 2.105 ± 0.35
2.024TyrLeu: 2.024 ± 0.271
0.972TyrMet: 0.972 ± 0.318
1.539TyrAsn: 1.539 ± 0.305
1.539TyrPro: 1.539 ± 0.373
1.296TyrGln: 1.296 ± 0.307
1.458TyrArg: 1.458 ± 0.287
2.267TyrSer: 2.267 ± 0.406
2.348TyrThr: 2.348 ± 0.477
3.239TyrVal: 3.239 ± 0.542
0.567TyrTrp: 0.567 ± 0.218
0.729TyrTyr: 0.729 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (12350 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski