Amino acid dipepetide frequency for Streptococcus phage Javan405

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.652AlaAla: 3.652 ± 0.839
0.934AlaCys: 0.934 ± 0.274
3.652AlaAsp: 3.652 ± 0.556
4.586AlaGlu: 4.586 ± 0.72
2.548AlaPhe: 2.548 ± 0.432
4.332AlaGly: 4.332 ± 0.868
0.595AlaHis: 0.595 ± 0.212
6.115AlaIle: 6.115 ± 1.002
5.011AlaLys: 5.011 ± 0.547
6.54AlaLeu: 6.54 ± 1.048
1.953AlaMet: 1.953 ± 0.414
2.633AlaAsn: 2.633 ± 0.334
1.274AlaPro: 1.274 ± 0.311
2.208AlaGln: 2.208 ± 0.431
2.888AlaArg: 2.888 ± 0.434
4.332AlaSer: 4.332 ± 0.546
4.756AlaThr: 4.756 ± 0.766
4.332AlaVal: 4.332 ± 0.651
0.764AlaTrp: 0.764 ± 0.199
3.227AlaTyr: 3.227 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
0.425CysAla: 0.425 ± 0.201
0.17CysCys: 0.17 ± 0.119
0.425CysAsp: 0.425 ± 0.245
0.51CysGlu: 0.51 ± 0.197
0.255CysPhe: 0.255 ± 0.134
0.51CysGly: 0.51 ± 0.195
0.085CysHis: 0.085 ± 0.096
0.425CysIle: 0.425 ± 0.175
0.17CysLys: 0.17 ± 0.136
0.595CysLeu: 0.595 ± 0.202
0.085CysMet: 0.085 ± 0.089
0.255CysAsn: 0.255 ± 0.143
0.255CysPro: 0.255 ± 0.135
0.764CysGln: 0.764 ± 0.286
0.764CysArg: 0.764 ± 0.176
0.51CysSer: 0.51 ± 0.219
0.255CysThr: 0.255 ± 0.136
0.764CysVal: 0.764 ± 0.248
0.085CysTrp: 0.085 ± 0.093
0.595CysTyr: 0.595 ± 0.255
0.0CysXaa: 0.0 ± 0.0
Asp
3.482AspAla: 3.482 ± 0.593
0.34AspCys: 0.34 ± 0.208
3.143AspAsp: 3.143 ± 0.639
4.926AspGlu: 4.926 ± 0.539
3.143AspPhe: 3.143 ± 0.507
4.586AspGly: 4.586 ± 0.758
0.934AspHis: 0.934 ± 0.242
3.567AspIle: 3.567 ± 0.739
4.162AspLys: 4.162 ± 0.55
6.115AspLeu: 6.115 ± 0.919
2.123AspMet: 2.123 ± 0.432
2.208AspAsn: 2.208 ± 0.482
1.614AspPro: 1.614 ± 0.459
2.123AspGln: 2.123 ± 0.465
2.803AspArg: 2.803 ± 0.607
4.841AspSer: 4.841 ± 0.592
2.973AspThr: 2.973 ± 0.463
3.822AspVal: 3.822 ± 0.638
1.019AspTrp: 1.019 ± 0.27
2.038AspTyr: 2.038 ± 0.343
0.0AspXaa: 0.0 ± 0.0
Glu
5.011GluAla: 5.011 ± 0.769
0.34GluCys: 0.34 ± 0.149
4.247GluAsp: 4.247 ± 0.539
5.86GluGlu: 5.86 ± 0.891
2.548GluPhe: 2.548 ± 0.463
5.096GluGly: 5.096 ± 0.61
1.529GluHis: 1.529 ± 0.418
4.077GluIle: 4.077 ± 0.637
6.03GluLys: 6.03 ± 0.692
7.899GluLeu: 7.899 ± 0.762
1.953GluMet: 1.953 ± 0.43
3.822GluAsn: 3.822 ± 0.673
1.444GluPro: 1.444 ± 0.388
3.058GluGln: 3.058 ± 0.474
3.143GluArg: 3.143 ± 0.628
3.227GluSer: 3.227 ± 0.474
4.162GluThr: 4.162 ± 0.579
5.181GluVal: 5.181 ± 0.643
0.51GluTrp: 0.51 ± 0.221
1.274GluTyr: 1.274 ± 0.357
0.0GluXaa: 0.0 ± 0.0
Phe
1.444PheAla: 1.444 ± 0.417
0.51PheCys: 0.51 ± 0.189
3.312PheAsp: 3.312 ± 0.615
3.143PheGlu: 3.143 ± 0.419
1.529PhePhe: 1.529 ± 0.336
3.397PheGly: 3.397 ± 0.444
0.764PheHis: 0.764 ± 0.223
2.378PheIle: 2.378 ± 0.551
3.143PheLys: 3.143 ± 0.645
3.397PheLeu: 3.397 ± 0.727
1.019PheMet: 1.019 ± 0.25
2.123PheAsn: 2.123 ± 0.363
0.595PhePro: 0.595 ± 0.286
1.444PheGln: 1.444 ± 0.336
1.444PheArg: 1.444 ± 0.24
3.143PheSer: 3.143 ± 0.567
1.444PheThr: 1.444 ± 0.331
1.699PheVal: 1.699 ± 0.45
0.595PheTrp: 0.595 ± 0.212
1.529PheTyr: 1.529 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
3.143GlyAla: 3.143 ± 0.44
0.51GlyCys: 0.51 ± 0.212
4.162GlyAsp: 4.162 ± 0.611
3.312GlyGlu: 3.312 ± 0.632
2.293GlyPhe: 2.293 ± 0.296
4.926GlyGly: 4.926 ± 0.949
1.529GlyHis: 1.529 ± 0.456
5.606GlyIle: 5.606 ± 0.688
5.436GlyLys: 5.436 ± 0.581
5.86GlyLeu: 5.86 ± 0.78
1.529GlyMet: 1.529 ± 0.36
3.737GlyAsn: 3.737 ± 0.548
1.104GlyPro: 1.104 ± 0.278
3.227GlyGln: 3.227 ± 0.661
4.077GlyArg: 4.077 ± 0.543
5.096GlySer: 5.096 ± 1.139
4.586GlyThr: 4.586 ± 0.734
4.756GlyVal: 4.756 ± 0.641
0.849GlyTrp: 0.849 ± 0.226
2.633GlyTyr: 2.633 ± 0.419
0.0GlyXaa: 0.0 ± 0.0
His
1.529HisAla: 1.529 ± 0.424
0.085HisCys: 0.085 ± 0.081
1.359HisAsp: 1.359 ± 0.375
0.679HisGlu: 0.679 ± 0.279
0.679HisPhe: 0.679 ± 0.25
1.274HisGly: 1.274 ± 0.349
0.764HisHis: 0.764 ± 0.233
1.699HisIle: 1.699 ± 0.328
1.019HisLys: 1.019 ± 0.304
2.378HisLeu: 2.378 ± 0.405
0.255HisMet: 0.255 ± 0.165
1.104HisAsn: 1.104 ± 0.294
1.274HisPro: 1.274 ± 0.475
1.189HisGln: 1.189 ± 0.306
1.104HisArg: 1.104 ± 0.37
1.359HisSer: 1.359 ± 0.322
0.849HisThr: 0.849 ± 0.377
0.934HisVal: 0.934 ± 0.294
0.085HisTrp: 0.085 ± 0.07
0.679HisTyr: 0.679 ± 0.309
0.0HisXaa: 0.0 ± 0.0
Ile
4.756IleAla: 4.756 ± 0.468
0.51IleCys: 0.51 ± 0.211
5.945IleAsp: 5.945 ± 0.693
3.822IleGlu: 3.822 ± 0.546
2.208IlePhe: 2.208 ± 0.582
4.077IleGly: 4.077 ± 0.643
1.189IleHis: 1.189 ± 0.355
3.397IleIle: 3.397 ± 0.523
4.926IleLys: 4.926 ± 0.756
5.011IleLeu: 5.011 ± 0.522
0.764IleMet: 0.764 ± 0.259
2.463IleAsn: 2.463 ± 0.419
2.463IlePro: 2.463 ± 0.439
2.463IleGln: 2.463 ± 0.469
2.463IleArg: 2.463 ± 0.479
5.436IleSer: 5.436 ± 1.073
3.822IleThr: 3.822 ± 0.683
3.567IleVal: 3.567 ± 0.723
1.104IleTrp: 1.104 ± 0.414
2.973IleTyr: 2.973 ± 0.627
0.0IleXaa: 0.0 ± 0.0
Lys
5.86LysAla: 5.86 ± 0.788
0.34LysCys: 0.34 ± 0.162
4.077LysAsp: 4.077 ± 0.737
5.521LysGlu: 5.521 ± 0.63
2.378LysPhe: 2.378 ± 0.555
4.077LysGly: 4.077 ± 0.632
2.123LysHis: 2.123 ± 0.402
4.586LysIle: 4.586 ± 0.793
5.521LysLys: 5.521 ± 0.677
5.775LysLeu: 5.775 ± 0.665
1.614LysMet: 1.614 ± 0.35
2.633LysAsn: 2.633 ± 0.56
2.293LysPro: 2.293 ± 0.404
4.332LysGln: 4.332 ± 0.891
4.077LysArg: 4.077 ± 0.771
4.162LysSer: 4.162 ± 0.616
4.756LysThr: 4.756 ± 0.696
4.671LysVal: 4.671 ± 0.682
1.104LysTrp: 1.104 ± 0.342
1.699LysTyr: 1.699 ± 0.521
0.0LysXaa: 0.0 ± 0.0
Leu
6.71LeuAla: 6.71 ± 0.974
0.34LeuCys: 0.34 ± 0.202
4.926LeuAsp: 4.926 ± 0.653
8.238LeuGlu: 8.238 ± 0.766
2.378LeuPhe: 2.378 ± 0.485
4.756LeuGly: 4.756 ± 0.473
1.699LeuHis: 1.699 ± 0.337
4.162LeuIle: 4.162 ± 0.645
6.71LeuLys: 6.71 ± 0.619
8.069LeuLeu: 8.069 ± 0.692
2.633LeuMet: 2.633 ± 0.398
4.926LeuAsn: 4.926 ± 0.659
3.737LeuPro: 3.737 ± 0.612
3.312LeuGln: 3.312 ± 0.579
3.992LeuArg: 3.992 ± 0.588
7.984LeuSer: 7.984 ± 0.927
6.455LeuThr: 6.455 ± 0.914
6.285LeuVal: 6.285 ± 0.709
0.849LeuTrp: 0.849 ± 0.212
5.096LeuTyr: 5.096 ± 0.788
0.0LeuXaa: 0.0 ± 0.0
Met
1.784MetAla: 1.784 ± 0.449
0.255MetCys: 0.255 ± 0.151
1.614MetAsp: 1.614 ± 0.406
1.529MetGlu: 1.529 ± 0.393
0.51MetPhe: 0.51 ± 0.214
1.953MetGly: 1.953 ± 0.37
0.085MetHis: 0.085 ± 0.084
1.019MetIle: 1.019 ± 0.265
1.529MetLys: 1.529 ± 0.345
1.529MetLeu: 1.529 ± 0.414
0.51MetMet: 0.51 ± 0.234
0.849MetAsn: 0.849 ± 0.33
0.34MetPro: 0.34 ± 0.153
0.764MetGln: 0.764 ± 0.256
0.849MetArg: 0.849 ± 0.24
1.699MetSer: 1.699 ± 0.344
2.718MetThr: 2.718 ± 0.488
1.953MetVal: 1.953 ± 0.473
0.17MetTrp: 0.17 ± 0.128
0.679MetTyr: 0.679 ± 0.271
0.0MetXaa: 0.0 ± 0.0
Asn
2.888AsnAla: 2.888 ± 0.519
0.085AsnCys: 0.085 ± 0.082
1.699AsnAsp: 1.699 ± 0.403
2.633AsnGlu: 2.633 ± 0.416
1.869AsnPhe: 1.869 ± 0.384
5.266AsnGly: 5.266 ± 0.756
0.679AsnHis: 0.679 ± 0.218
1.869AsnIle: 1.869 ± 0.386
2.803AsnLys: 2.803 ± 0.415
3.312AsnLeu: 3.312 ± 0.395
1.189AsnMet: 1.189 ± 0.316
1.529AsnAsn: 1.529 ± 0.609
2.038AsnPro: 2.038 ± 0.334
2.208AsnGln: 2.208 ± 0.408
2.208AsnArg: 2.208 ± 0.449
3.227AsnSer: 3.227 ± 0.599
2.038AsnThr: 2.038 ± 0.448
2.803AsnVal: 2.803 ± 0.482
1.359AsnTrp: 1.359 ± 0.331
0.934AsnTyr: 0.934 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
1.189ProAla: 1.189 ± 0.403
0.255ProCys: 0.255 ± 0.155
1.614ProAsp: 1.614 ± 0.344
1.699ProGlu: 1.699 ± 0.41
0.934ProPhe: 0.934 ± 0.321
1.614ProGly: 1.614 ± 0.414
1.104ProHis: 1.104 ± 0.29
2.378ProIle: 2.378 ± 0.424
1.869ProLys: 1.869 ± 0.373
3.397ProLeu: 3.397 ± 0.462
0.17ProMet: 0.17 ± 0.114
1.359ProAsn: 1.359 ± 0.34
0.679ProPro: 0.679 ± 0.293
1.359ProGln: 1.359 ± 0.357
1.869ProArg: 1.869 ± 0.365
2.633ProSer: 2.633 ± 0.519
1.953ProThr: 1.953 ± 0.445
1.869ProVal: 1.869 ± 0.387
0.51ProTrp: 0.51 ± 0.192
0.934ProTyr: 0.934 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
5.096GlnAla: 5.096 ± 0.774
0.255GlnCys: 0.255 ± 0.14
2.633GlnAsp: 2.633 ± 0.459
2.888GlnGlu: 2.888 ± 0.375
1.869GlnPhe: 1.869 ± 0.508
2.718GlnGly: 2.718 ± 0.555
0.849GlnHis: 0.849 ± 0.23
2.293GlnIle: 2.293 ± 0.544
3.058GlnLys: 3.058 ± 0.495
4.332GlnLeu: 4.332 ± 0.66
1.274GlnMet: 1.274 ± 0.363
2.208GlnAsn: 2.208 ± 0.473
1.444GlnPro: 1.444 ± 0.377
1.529GlnGln: 1.529 ± 0.36
1.104GlnArg: 1.104 ± 0.283
2.633GlnSer: 2.633 ± 0.527
2.463GlnThr: 2.463 ± 0.355
3.907GlnVal: 3.907 ± 0.627
0.764GlnTrp: 0.764 ± 0.319
1.359GlnTyr: 1.359 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
2.293ArgAla: 2.293 ± 0.539
0.51ArgCys: 0.51 ± 0.217
2.718ArgAsp: 2.718 ± 0.476
2.888ArgGlu: 2.888 ± 0.453
1.869ArgPhe: 1.869 ± 0.372
2.038ArgGly: 2.038 ± 0.413
0.934ArgHis: 0.934 ± 0.327
2.803ArgIle: 2.803 ± 0.388
3.227ArgLys: 3.227 ± 0.67
4.671ArgLeu: 4.671 ± 0.704
0.51ArgMet: 0.51 ± 0.176
1.614ArgAsn: 1.614 ± 0.361
1.444ArgPro: 1.444 ± 0.379
3.143ArgGln: 3.143 ± 0.527
2.123ArgArg: 2.123 ± 0.477
2.293ArgSer: 2.293 ± 0.327
2.803ArgThr: 2.803 ± 0.578
4.162ArgVal: 4.162 ± 0.609
0.849ArgTrp: 0.849 ± 0.25
1.444ArgTyr: 1.444 ± 0.309
0.0ArgXaa: 0.0 ± 0.0
Ser
4.756SerAla: 4.756 ± 0.758
0.34SerCys: 0.34 ± 0.174
4.247SerAsp: 4.247 ± 0.699
5.436SerGlu: 5.436 ± 0.897
2.888SerPhe: 2.888 ± 0.576
5.521SerGly: 5.521 ± 0.933
2.208SerHis: 2.208 ± 0.442
4.417SerIle: 4.417 ± 0.51
4.586SerLys: 4.586 ± 0.602
6.54SerLeu: 6.54 ± 0.605
1.444SerMet: 1.444 ± 0.404
2.888SerAsn: 2.888 ± 0.598
2.208SerPro: 2.208 ± 0.511
2.463SerGln: 2.463 ± 0.385
2.718SerArg: 2.718 ± 0.543
6.37SerSer: 6.37 ± 1.052
4.926SerThr: 4.926 ± 0.915
3.822SerVal: 3.822 ± 0.637
1.274SerTrp: 1.274 ± 0.24
3.143SerTyr: 3.143 ± 0.48
0.0SerXaa: 0.0 ± 0.0
Thr
4.586ThrAla: 4.586 ± 0.693
0.255ThrCys: 0.255 ± 0.161
2.548ThrAsp: 2.548 ± 0.54
3.822ThrGlu: 3.822 ± 0.56
3.143ThrPhe: 3.143 ± 0.687
4.756ThrGly: 4.756 ± 0.775
1.104ThrHis: 1.104 ± 0.297
5.181ThrIle: 5.181 ± 1.292
4.586ThrLys: 4.586 ± 0.603
6.2ThrLeu: 6.2 ± 0.701
0.849ThrMet: 0.849 ± 0.239
1.953ThrAsn: 1.953 ± 0.408
1.784ThrPro: 1.784 ± 0.554
2.208ThrGln: 2.208 ± 0.348
2.208ThrArg: 2.208 ± 0.372
5.096ThrSer: 5.096 ± 0.789
5.096ThrThr: 5.096 ± 0.666
6.03ThrVal: 6.03 ± 0.821
1.359ThrTrp: 1.359 ± 0.406
2.888ThrTyr: 2.888 ± 0.506
0.0ThrXaa: 0.0 ± 0.0
Val
4.501ValAla: 4.501 ± 0.639
0.934ValCys: 0.934 ± 0.289
4.501ValAsp: 4.501 ± 0.594
4.332ValGlu: 4.332 ± 0.618
2.718ValPhe: 2.718 ± 0.435
3.822ValGly: 3.822 ± 0.575
1.189ValHis: 1.189 ± 0.228
4.926ValIle: 4.926 ± 0.776
4.586ValLys: 4.586 ± 0.642
6.88ValLeu: 6.88 ± 0.844
1.529ValMet: 1.529 ± 0.387
2.293ValAsn: 2.293 ± 0.389
1.614ValPro: 1.614 ± 0.438
2.803ValGln: 2.803 ± 0.497
2.548ValArg: 2.548 ± 0.393
4.671ValSer: 4.671 ± 0.789
6.115ValThr: 6.115 ± 0.907
3.822ValVal: 3.822 ± 0.624
1.104ValTrp: 1.104 ± 0.296
2.293ValTyr: 2.293 ± 0.463
0.0ValXaa: 0.0 ± 0.0
Trp
0.934TrpAla: 0.934 ± 0.314
0.425TrpCys: 0.425 ± 0.178
0.425TrpAsp: 0.425 ± 0.221
1.359TrpGlu: 1.359 ± 0.348
0.934TrpPhe: 0.934 ± 0.279
0.425TrpGly: 0.425 ± 0.192
0.255TrpHis: 0.255 ± 0.162
0.764TrpIle: 0.764 ± 0.274
0.849TrpLys: 0.849 ± 0.336
1.274TrpLeu: 1.274 ± 0.234
0.425TrpMet: 0.425 ± 0.166
1.104TrpAsn: 1.104 ± 0.307
0.425TrpPro: 0.425 ± 0.225
1.444TrpGln: 1.444 ± 0.316
0.51TrpArg: 0.51 ± 0.212
1.019TrpSer: 1.019 ± 0.3
1.359TrpThr: 1.359 ± 0.309
0.595TrpVal: 0.595 ± 0.27
0.17TrpTrp: 0.17 ± 0.098
0.17TrpTyr: 0.17 ± 0.101
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.378TyrAla: 2.378 ± 0.504
0.51TyrCys: 0.51 ± 0.233
2.718TyrAsp: 2.718 ± 0.653
3.227TyrGlu: 3.227 ± 0.5
1.359TyrPhe: 1.359 ± 0.436
3.227TyrGly: 3.227 ± 0.579
0.679TyrHis: 0.679 ± 0.24
1.699TyrIle: 1.699 ± 0.404
2.378TyrLys: 2.378 ± 0.572
3.482TyrLeu: 3.482 ± 0.703
0.425TyrMet: 0.425 ± 0.176
0.934TyrAsn: 0.934 ± 0.286
1.359TyrPro: 1.359 ± 0.339
2.633TyrGln: 2.633 ± 0.445
1.444TyrArg: 1.444 ± 0.352
2.463TyrSer: 2.463 ± 0.492
2.123TyrThr: 2.123 ± 0.384
2.208TyrVal: 2.208 ± 0.507
0.34TyrTrp: 0.34 ± 0.172
1.614TyrTyr: 1.614 ± 0.368
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 38 proteins (11775 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski