Amino acid dipepetide frequency for Bacillus phage Wip1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.541AlaAla: 4.541 ± 2.26
0.454AlaCys: 0.454 ± 0.287
2.498AlaAsp: 2.498 ± 0.691
2.725AlaGlu: 2.725 ± 0.725
2.725AlaPhe: 2.725 ± 0.702
5.45AlaGly: 5.45 ± 1.224
0.227AlaHis: 0.227 ± 0.184
4.768AlaIle: 4.768 ± 0.844
4.541AlaLys: 4.541 ± 0.958
3.633AlaLeu: 3.633 ± 0.987
2.044AlaMet: 2.044 ± 0.696
2.952AlaAsn: 2.952 ± 0.777
1.589AlaPro: 1.589 ± 0.61
1.817AlaGln: 1.817 ± 0.694
4.541AlaArg: 4.541 ± 0.857
3.86AlaSer: 3.86 ± 1.016
2.725AlaThr: 2.725 ± 1.032
2.044AlaVal: 2.044 ± 0.784
0.454AlaTrp: 0.454 ± 0.321
2.271AlaTyr: 2.271 ± 0.646
0.0AlaXaa: 0.0 ± 0.0
Cys
0.227CysAla: 0.227 ± 0.165
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.454CysGlu: 0.454 ± 0.287
0.681CysPhe: 0.681 ± 0.494
0.227CysGly: 0.227 ± 0.238
0.454CysHis: 0.454 ± 0.329
0.681CysIle: 0.681 ± 0.363
0.0CysLys: 0.0 ± 0.0
0.227CysLeu: 0.227 ± 0.204
0.0CysMet: 0.0 ± 0.0
0.454CysAsn: 0.454 ± 0.262
0.227CysPro: 0.227 ± 0.264
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.681CysSer: 0.681 ± 0.361
0.681CysThr: 0.681 ± 0.469
0.681CysVal: 0.681 ± 0.316
0.227CysTrp: 0.227 ± 0.238
0.227CysTyr: 0.227 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
3.633AspAla: 3.633 ± 0.975
0.681AspCys: 0.681 ± 0.367
3.179AspAsp: 3.179 ± 1.154
2.498AspGlu: 2.498 ± 0.533
4.768AspPhe: 4.768 ± 1.069
3.179AspGly: 3.179 ± 0.642
1.362AspHis: 1.362 ± 0.536
5.223AspIle: 5.223 ± 1.165
5.223AspLys: 5.223 ± 0.91
4.541AspLeu: 4.541 ± 1.061
0.908AspMet: 0.908 ± 0.426
2.498AspAsn: 2.498 ± 0.784
0.908AspPro: 0.908 ± 0.452
2.271AspGln: 2.271 ± 1.077
3.179AspArg: 3.179 ± 0.672
3.406AspSer: 3.406 ± 0.911
3.406AspThr: 3.406 ± 1.251
4.314AspVal: 4.314 ± 0.926
1.135AspTrp: 1.135 ± 0.397
1.362AspTyr: 1.362 ± 0.611
0.0AspXaa: 0.0 ± 0.0
Glu
2.952GluAla: 2.952 ± 0.631
0.227GluCys: 0.227 ± 0.165
3.86GluAsp: 3.86 ± 1.013
8.174GluGlu: 8.174 ± 3.793
2.498GluPhe: 2.498 ± 0.895
3.633GluGly: 3.633 ± 0.94
1.362GluHis: 1.362 ± 0.578
3.86GluIle: 3.86 ± 1.562
5.904GluLys: 5.904 ± 1.098
6.358GluLeu: 6.358 ± 1.479
2.725GluMet: 2.725 ± 0.502
4.541GluAsn: 4.541 ± 1.155
1.817GluPro: 1.817 ± 0.509
4.087GluGln: 4.087 ± 0.779
2.725GluArg: 2.725 ± 0.885
3.406GluSer: 3.406 ± 0.805
3.179GluThr: 3.179 ± 0.995
5.677GluVal: 5.677 ± 1.055
1.135GluTrp: 1.135 ± 0.497
2.952GluTyr: 2.952 ± 0.824
0.454GluXaa: 0.454 ± 0.363
Phe
1.589PheAla: 1.589 ± 0.567
0.227PheCys: 0.227 ± 0.165
3.633PheAsp: 3.633 ± 1.208
4.087PheGlu: 4.087 ± 1.088
2.044PhePhe: 2.044 ± 0.517
1.589PheGly: 1.589 ± 0.567
0.908PheHis: 0.908 ± 0.399
3.406PheIle: 3.406 ± 0.958
4.087PheLys: 4.087 ± 1.034
3.406PheLeu: 3.406 ± 0.976
2.044PheMet: 2.044 ± 0.952
2.044PheAsn: 2.044 ± 0.65
2.044PhePro: 2.044 ± 0.497
2.271PheGln: 2.271 ± 0.566
2.271PheArg: 2.271 ± 0.678
3.179PheSer: 3.179 ± 0.957
4.995PheThr: 4.995 ± 1.182
2.725PheVal: 2.725 ± 0.58
0.681PheTrp: 0.681 ± 0.308
1.135PheTyr: 1.135 ± 0.425
0.227PheXaa: 0.227 ± 0.228
Gly
4.087GlyAla: 4.087 ± 0.841
0.681GlyCys: 0.681 ± 0.367
3.406GlyAsp: 3.406 ± 1.023
3.86GlyGlu: 3.86 ± 0.871
4.314GlyPhe: 4.314 ± 0.769
9.537GlyGly: 9.537 ± 2.825
1.135GlyHis: 1.135 ± 0.453
5.45GlyIle: 5.45 ± 1.379
5.45GlyLys: 5.45 ± 0.86
5.677GlyLeu: 5.677 ± 1.258
1.362GlyMet: 1.362 ± 0.652
4.541GlyAsn: 4.541 ± 0.831
0.454GlyPro: 0.454 ± 0.33
1.817GlyGln: 1.817 ± 0.6
3.633GlyArg: 3.633 ± 1.097
4.541GlySer: 4.541 ± 0.961
4.768GlyThr: 4.768 ± 0.918
2.952GlyVal: 2.952 ± 1.013
1.135GlyTrp: 1.135 ± 0.45
3.633GlyTyr: 3.633 ± 1.301
0.0GlyXaa: 0.0 ± 0.0
His
0.454HisAla: 0.454 ± 0.294
0.454HisCys: 0.454 ± 0.317
0.681HisAsp: 0.681 ± 0.363
0.227HisGlu: 0.227 ± 0.165
1.135HisPhe: 1.135 ± 0.525
0.454HisGly: 0.454 ± 0.361
0.0HisHis: 0.0 ± 0.0
0.681HisIle: 0.681 ± 0.345
1.362HisLys: 1.362 ± 0.518
0.908HisLeu: 0.908 ± 0.379
0.227HisMet: 0.227 ± 0.228
0.908HisAsn: 0.908 ± 0.455
0.908HisPro: 0.908 ± 0.379
0.0HisGln: 0.0 ± 0.0
0.681HisArg: 0.681 ± 0.428
0.908HisSer: 0.908 ± 0.34
0.681HisThr: 0.681 ± 0.387
1.589HisVal: 1.589 ± 0.678
0.0HisTrp: 0.0 ± 0.0
1.362HisTyr: 1.362 ± 0.42
0.0HisXaa: 0.0 ± 0.0
Ile
2.725IleAla: 2.725 ± 0.969
0.908IleCys: 0.908 ± 0.395
4.314IleAsp: 4.314 ± 1.029
4.314IleGlu: 4.314 ± 1.347
2.498IlePhe: 2.498 ± 0.588
3.86IleGly: 3.86 ± 0.914
0.454IleHis: 0.454 ± 0.366
6.358IleIle: 6.358 ± 1.525
7.266IleLys: 7.266 ± 1.729
5.223IleLeu: 5.223 ± 1.337
2.725IleMet: 2.725 ± 0.72
4.541IleAsn: 4.541 ± 1.021
3.86IlePro: 3.86 ± 0.7
1.362IleGln: 1.362 ± 0.535
1.817IleArg: 1.817 ± 0.533
3.179IleSer: 3.179 ± 0.713
3.179IleThr: 3.179 ± 0.809
4.314IleVal: 4.314 ± 0.926
0.681IleTrp: 0.681 ± 0.576
2.952IleTyr: 2.952 ± 0.728
0.0IleXaa: 0.0 ± 0.0
Lys
4.541LysAla: 4.541 ± 1.434
0.454LysCys: 0.454 ± 0.262
6.812LysAsp: 6.812 ± 1.821
8.174LysGlu: 8.174 ± 1.282
4.541LysPhe: 4.541 ± 0.953
5.223LysGly: 5.223 ± 1.159
0.908LysHis: 0.908 ± 0.391
4.314LysIle: 4.314 ± 1.004
9.991LysLys: 9.991 ± 1.941
5.677LysLeu: 5.677 ± 1.279
2.498LysMet: 2.498 ± 0.894
4.087LysAsn: 4.087 ± 0.873
4.314LysPro: 4.314 ± 1.441
3.406LysGln: 3.406 ± 0.769
4.314LysArg: 4.314 ± 0.943
5.223LysSer: 5.223 ± 1.207
4.768LysThr: 4.768 ± 1.08
6.585LysVal: 6.585 ± 1.105
0.227LysTrp: 0.227 ± 0.28
2.498LysTyr: 2.498 ± 0.833
0.227LysXaa: 0.227 ± 0.231
Leu
5.45LeuAla: 5.45 ± 1.37
0.454LeuCys: 0.454 ± 0.35
4.314LeuAsp: 4.314 ± 0.957
4.087LeuGlu: 4.087 ± 1.013
4.995LeuPhe: 4.995 ± 1.137
6.131LeuGly: 6.131 ± 0.881
0.681LeuHis: 0.681 ± 0.394
4.314LeuIle: 4.314 ± 0.912
7.039LeuLys: 7.039 ± 1.575
6.358LeuLeu: 6.358 ± 1.332
1.362LeuMet: 1.362 ± 0.772
5.45LeuAsn: 5.45 ± 1.283
3.633LeuPro: 3.633 ± 1.437
3.179LeuGln: 3.179 ± 0.635
3.179LeuArg: 3.179 ± 0.835
7.493LeuSer: 7.493 ± 1.186
3.406LeuThr: 3.406 ± 0.711
4.314LeuVal: 4.314 ± 1.106
0.681LeuTrp: 0.681 ± 0.342
2.271LeuTyr: 2.271 ± 0.477
0.227LeuXaa: 0.227 ± 0.228
Met
2.498MetAla: 2.498 ± 0.841
0.0MetCys: 0.0 ± 0.0
1.589MetAsp: 1.589 ± 0.516
1.589MetGlu: 1.589 ± 1.101
1.817MetPhe: 1.817 ± 0.589
2.271MetGly: 2.271 ± 0.913
0.227MetHis: 0.227 ± 0.217
1.817MetIle: 1.817 ± 0.643
4.087MetLys: 4.087 ± 0.972
2.498MetLeu: 2.498 ± 0.736
1.135MetMet: 1.135 ± 0.574
1.362MetAsn: 1.362 ± 0.48
0.681MetPro: 0.681 ± 0.437
0.227MetGln: 0.227 ± 0.264
0.454MetArg: 0.454 ± 0.329
1.135MetSer: 1.135 ± 0.548
2.044MetThr: 2.044 ± 0.922
1.817MetVal: 1.817 ± 0.997
1.135MetTrp: 1.135 ± 0.602
1.135MetTyr: 1.135 ± 0.413
0.0MetXaa: 0.0 ± 0.0
Asn
4.541AsnAla: 4.541 ± 0.818
0.227AsnCys: 0.227 ± 0.238
5.223AsnAsp: 5.223 ± 1.281
3.86AsnGlu: 3.86 ± 1.235
1.817AsnPhe: 1.817 ± 0.506
6.585AsnGly: 6.585 ± 1.196
0.908AsnHis: 0.908 ± 0.389
4.768AsnIle: 4.768 ± 1.12
3.633AsnLys: 3.633 ± 0.854
3.633AsnLeu: 3.633 ± 0.743
1.817AsnMet: 1.817 ± 0.45
3.86AsnAsn: 3.86 ± 0.915
2.725AsnPro: 2.725 ± 1.009
2.271AsnGln: 2.271 ± 0.686
1.817AsnArg: 1.817 ± 0.501
2.044AsnSer: 2.044 ± 0.52
4.087AsnThr: 4.087 ± 0.763
2.498AsnVal: 2.498 ± 0.679
0.227AsnTrp: 0.227 ± 0.165
1.362AsnTyr: 1.362 ± 0.608
0.0AsnXaa: 0.0 ± 0.0
Pro
3.179ProAla: 3.179 ± 0.878
0.0ProCys: 0.0 ± 0.0
2.725ProAsp: 2.725 ± 0.853
2.498ProGlu: 2.498 ± 0.778
1.817ProPhe: 1.817 ± 0.504
0.908ProGly: 0.908 ± 0.444
0.227ProHis: 0.227 ± 0.212
2.952ProIle: 2.952 ± 0.735
5.45ProLys: 5.45 ± 1.416
2.044ProLeu: 2.044 ± 0.659
0.227ProMet: 0.227 ± 0.184
2.952ProAsn: 2.952 ± 0.873
2.044ProPro: 2.044 ± 0.474
1.135ProGln: 1.135 ± 0.505
0.454ProArg: 0.454 ± 0.369
4.314ProSer: 4.314 ± 1.143
1.135ProThr: 1.135 ± 0.476
2.498ProVal: 2.498 ± 0.858
0.227ProTrp: 0.227 ± 0.267
1.135ProTyr: 1.135 ± 0.553
0.227ProXaa: 0.227 ± 0.212
Gln
2.044GlnAla: 2.044 ± 0.79
0.227GlnCys: 0.227 ± 0.165
1.135GlnAsp: 1.135 ± 0.53
2.952GlnGlu: 2.952 ± 0.8
0.681GlnPhe: 0.681 ± 0.376
1.817GlnGly: 1.817 ± 0.562
0.454GlnHis: 0.454 ± 0.314
2.725GlnIle: 2.725 ± 0.718
2.952GlnLys: 2.952 ± 0.916
2.952GlnLeu: 2.952 ± 0.775
1.362GlnMet: 1.362 ± 0.709
3.179GlnAsn: 3.179 ± 1.064
1.362GlnPro: 1.362 ± 0.704
2.271GlnGln: 2.271 ± 0.863
1.589GlnArg: 1.589 ± 0.517
2.044GlnSer: 2.044 ± 0.899
3.633GlnThr: 3.633 ± 0.925
2.044GlnVal: 2.044 ± 0.761
0.681GlnTrp: 0.681 ± 0.416
1.589GlnTyr: 1.589 ± 0.569
0.0GlnXaa: 0.0 ± 0.0
Arg
1.362ArgAla: 1.362 ± 0.394
0.0ArgCys: 0.0 ± 0.0
2.271ArgAsp: 2.271 ± 0.711
4.314ArgGlu: 4.314 ± 0.957
0.908ArgPhe: 0.908 ± 0.395
3.179ArgGly: 3.179 ± 0.91
0.454ArgHis: 0.454 ± 0.317
2.952ArgIle: 2.952 ± 0.866
4.314ArgLys: 4.314 ± 0.982
4.541ArgLeu: 4.541 ± 1.029
0.681ArgMet: 0.681 ± 0.413
3.406ArgAsn: 3.406 ± 1.058
1.362ArgPro: 1.362 ± 0.53
2.271ArgGln: 2.271 ± 0.567
2.271ArgArg: 2.271 ± 0.578
2.271ArgSer: 2.271 ± 0.615
1.589ArgThr: 1.589 ± 0.529
4.087ArgVal: 4.087 ± 1.151
0.227ArgTrp: 0.227 ± 0.165
0.908ArgTyr: 0.908 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
2.952SerAla: 2.952 ± 0.702
0.0SerCys: 0.0 ± 0.0
2.498SerAsp: 2.498 ± 1.047
3.86SerGlu: 3.86 ± 0.831
2.952SerPhe: 2.952 ± 0.843
6.585SerGly: 6.585 ± 1.886
0.908SerHis: 0.908 ± 0.387
2.725SerIle: 2.725 ± 0.507
6.585SerLys: 6.585 ± 0.894
4.314SerLeu: 4.314 ± 0.828
2.725SerMet: 2.725 ± 0.761
2.725SerAsn: 2.725 ± 0.576
3.86SerPro: 3.86 ± 0.708
2.044SerGln: 2.044 ± 0.558
2.952SerArg: 2.952 ± 0.989
3.633SerSer: 3.633 ± 0.855
3.406SerThr: 3.406 ± 1.232
4.768SerVal: 4.768 ± 0.953
0.908SerTrp: 0.908 ± 0.386
1.817SerTyr: 1.817 ± 0.538
0.0SerXaa: 0.0 ± 0.0
Thr
2.498ThrAla: 2.498 ± 0.592
0.454ThrCys: 0.454 ± 0.244
2.725ThrAsp: 2.725 ± 0.751
5.45ThrGlu: 5.45 ± 1.062
2.952ThrPhe: 2.952 ± 0.62
4.768ThrGly: 4.768 ± 1.479
0.681ThrHis: 0.681 ± 0.243
4.541ThrIle: 4.541 ± 1.046
4.541ThrLys: 4.541 ± 1.533
5.904ThrLeu: 5.904 ± 1.538
1.135ThrMet: 1.135 ± 0.557
3.633ThrAsn: 3.633 ± 0.906
2.044ThrPro: 2.044 ± 0.671
2.498ThrGln: 2.498 ± 1.045
1.362ThrArg: 1.362 ± 0.56
2.952ThrSer: 2.952 ± 0.897
3.179ThrThr: 3.179 ± 1.048
3.86ThrVal: 3.86 ± 0.754
0.454ThrTrp: 0.454 ± 0.289
1.817ThrTyr: 1.817 ± 0.562
0.227ThrXaa: 0.227 ± 0.217
Val
2.952ValAla: 2.952 ± 1.329
0.227ValCys: 0.227 ± 0.264
3.86ValAsp: 3.86 ± 0.77
5.904ValGlu: 5.904 ± 1.263
3.179ValPhe: 3.179 ± 1.175
4.541ValGly: 4.541 ± 1.482
1.362ValHis: 1.362 ± 0.481
3.179ValIle: 3.179 ± 1.043
3.86ValLys: 3.86 ± 0.798
6.131ValLeu: 6.131 ± 0.836
3.406ValMet: 3.406 ± 0.965
1.589ValAsn: 1.589 ± 0.554
2.498ValPro: 2.498 ± 1.018
1.817ValGln: 1.817 ± 0.615
3.179ValArg: 3.179 ± 0.946
4.541ValSer: 4.541 ± 1.029
5.223ValThr: 5.223 ± 1.463
7.266ValVal: 7.266 ± 1.333
0.454ValTrp: 0.454 ± 0.528
1.589ValTyr: 1.589 ± 0.517
0.0ValXaa: 0.0 ± 0.0
Trp
0.908TrpAla: 0.908 ± 0.464
0.0TrpCys: 0.0 ± 0.0
0.908TrpAsp: 0.908 ± 0.562
0.908TrpGlu: 0.908 ± 0.515
0.227TrpPhe: 0.227 ± 0.27
0.454TrpGly: 0.454 ± 0.424
0.0TrpHis: 0.0 ± 0.0
0.227TrpIle: 0.227 ± 0.28
0.454TrpLys: 0.454 ± 0.318
1.135TrpLeu: 1.135 ± 0.473
0.227TrpMet: 0.227 ± 0.232
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.135TrpGln: 1.135 ± 0.524
0.681TrpArg: 0.681 ± 0.373
0.908TrpSer: 0.908 ± 0.328
0.681TrpThr: 0.681 ± 0.336
1.362TrpVal: 1.362 ± 0.759
0.227TrpTrp: 0.227 ± 0.28
0.681TrpTyr: 0.681 ± 0.489
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.952TyrAla: 2.952 ± 0.933
0.454TyrCys: 0.454 ± 0.265
2.044TyrAsp: 2.044 ± 0.582
1.589TyrGlu: 1.589 ± 0.483
2.044TyrPhe: 2.044 ± 0.63
1.817TyrGly: 1.817 ± 1.005
0.908TyrHis: 0.908 ± 0.471
1.589TyrIle: 1.589 ± 0.676
2.044TyrLys: 2.044 ± 0.857
3.406TyrLeu: 3.406 ± 0.808
0.681TyrMet: 0.681 ± 0.348
2.952TyrAsn: 2.952 ± 0.696
1.817TyrPro: 1.817 ± 0.543
1.589TyrGln: 1.589 ± 0.482
2.044TyrArg: 2.044 ± 0.653
2.498TyrSer: 2.498 ± 0.695
0.681TyrThr: 0.681 ± 0.316
1.362TyrVal: 1.362 ± 0.424
0.227TyrTrp: 0.227 ± 0.212
0.908TyrTyr: 0.908 ± 0.497
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.227XaaGly: 0.227 ± 0.228
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.454XaaLeu: 0.454 ± 0.318
0.0XaaMet: 0.0 ± 0.0
0.227XaaAsn: 0.227 ± 0.231
0.0XaaPro: 0.0 ± 0.0
0.227XaaGln: 0.227 ± 0.238
0.227XaaArg: 0.227 ± 0.228
0.0XaaSer: 0.0 ± 0.0
0.227XaaThr: 0.227 ± 0.217
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.227XaaXaa: 0.227 ± 0.228
Statistics based on 27 proteins (4405 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski