Amino acid dipepetide frequency for Streptococcus phage Javan629

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.174AlaAla: 2.174 ± 0.554
0.334AlaCys: 0.334 ± 0.182
5.184AlaAsp: 5.184 ± 0.491
5.351AlaGlu: 5.351 ± 0.656
2.425AlaPhe: 2.425 ± 0.514
3.595AlaGly: 3.595 ± 0.674
1.17AlaHis: 1.17 ± 0.307
5.267AlaIle: 5.267 ± 0.669
6.438AlaLys: 6.438 ± 1.146
5.434AlaLeu: 5.434 ± 1.079
2.09AlaMet: 2.09 ± 0.431
3.929AlaAsn: 3.929 ± 0.621
2.425AlaPro: 2.425 ± 0.54
3.01AlaGln: 3.01 ± 0.605
2.843AlaArg: 2.843 ± 0.39
5.016AlaSer: 5.016 ± 0.824
4.347AlaThr: 4.347 ± 0.924
2.675AlaVal: 2.675 ± 0.503
1.003AlaTrp: 1.003 ± 0.334
2.592AlaTyr: 2.592 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
0.418CysAla: 0.418 ± 0.183
0.084CysCys: 0.084 ± 0.093
0.084CysAsp: 0.084 ± 0.076
0.502CysGlu: 0.502 ± 0.264
0.084CysPhe: 0.084 ± 0.083
0.585CysGly: 0.585 ± 0.367
0.084CysHis: 0.084 ± 0.083
0.167CysIle: 0.167 ± 0.143
0.502CysLys: 0.502 ± 0.247
0.502CysLeu: 0.502 ± 0.375
0.167CysMet: 0.167 ± 0.132
0.418CysAsn: 0.418 ± 0.165
0.084CysPro: 0.084 ± 0.08
0.084CysGln: 0.084 ± 0.08
0.167CysArg: 0.167 ± 0.121
0.167CysSer: 0.167 ± 0.121
0.084CysThr: 0.084 ± 0.078
0.334CysVal: 0.334 ± 0.137
0.0CysTrp: 0.0 ± 0.0
0.334CysTyr: 0.334 ± 0.313
0.0CysXaa: 0.0 ± 0.0
Asp
4.347AspAla: 4.347 ± 0.66
0.334AspCys: 0.334 ± 0.204
3.929AspAsp: 3.929 ± 0.711
5.016AspGlu: 5.016 ± 0.668
3.344AspPhe: 3.344 ± 0.601
5.016AspGly: 5.016 ± 0.822
0.334AspHis: 0.334 ± 0.192
3.595AspIle: 3.595 ± 0.642
6.27AspLys: 6.27 ± 0.648
5.769AspLeu: 5.769 ± 0.625
1.421AspMet: 1.421 ± 0.382
3.428AspAsn: 3.428 ± 0.509
1.756AspPro: 1.756 ± 0.285
1.254AspGln: 1.254 ± 0.393
2.759AspArg: 2.759 ± 0.542
4.431AspSer: 4.431 ± 0.417
4.347AspThr: 4.347 ± 0.702
3.261AspVal: 3.261 ± 0.457
1.17AspTrp: 1.17 ± 0.327
3.511AspTyr: 3.511 ± 0.832
0.0AspXaa: 0.0 ± 0.0
Glu
4.515GluAla: 4.515 ± 0.809
0.669GluCys: 0.669 ± 0.304
4.598GluAsp: 4.598 ± 0.657
4.431GluGlu: 4.431 ± 0.482
2.592GluPhe: 2.592 ± 0.635
3.679GluGly: 3.679 ± 0.558
0.418GluHis: 0.418 ± 0.219
6.605GluIle: 6.605 ± 0.853
6.772GluLys: 6.772 ± 0.992
7.942GluLeu: 7.942 ± 0.996
2.257GluMet: 2.257 ± 0.577
4.765GluAsn: 4.765 ± 0.753
1.672GluPro: 1.672 ± 0.412
3.344GluGln: 3.344 ± 0.59
2.843GluArg: 2.843 ± 0.512
4.347GluSer: 4.347 ± 0.84
4.18GluThr: 4.18 ± 0.623
5.1GluVal: 5.1 ± 0.728
1.254GluTrp: 1.254 ± 0.294
2.508GluTyr: 2.508 ± 0.436
0.0GluXaa: 0.0 ± 0.0
Phe
2.759PheAla: 2.759 ± 0.432
0.0PheCys: 0.0 ± 0.0
2.759PheAsp: 2.759 ± 0.539
3.428PheGlu: 3.428 ± 0.662
1.338PhePhe: 1.338 ± 0.396
2.843PheGly: 2.843 ± 0.446
0.502PheHis: 0.502 ± 0.192
2.759PheIle: 2.759 ± 0.662
4.264PheLys: 4.264 ± 0.708
3.177PheLeu: 3.177 ± 0.656
1.087PheMet: 1.087 ± 0.338
2.174PheAsn: 2.174 ± 0.346
0.836PhePro: 0.836 ± 0.207
0.752PheGln: 0.752 ± 0.244
1.421PheArg: 1.421 ± 0.404
3.177PheSer: 3.177 ± 0.429
1.923PheThr: 1.923 ± 0.354
2.257PheVal: 2.257 ± 0.455
0.669PheTrp: 0.669 ± 0.332
1.087PheTyr: 1.087 ± 0.314
0.0PheXaa: 0.0 ± 0.0
Gly
4.097GlyAla: 4.097 ± 0.453
0.502GlyCys: 0.502 ± 0.245
3.595GlyAsp: 3.595 ± 0.411
2.843GlyGlu: 2.843 ± 0.461
3.428GlyPhe: 3.428 ± 0.689
4.598GlyGly: 4.598 ± 1.116
1.17GlyHis: 1.17 ± 0.344
4.264GlyIle: 4.264 ± 0.603
5.267GlyLys: 5.267 ± 0.84
6.354GlyLeu: 6.354 ± 1.116
1.003GlyMet: 1.003 ± 0.278
3.093GlyAsn: 3.093 ± 0.499
0.669GlyPro: 0.669 ± 0.194
2.174GlyGln: 2.174 ± 0.386
2.007GlyArg: 2.007 ± 0.437
4.431GlySer: 4.431 ± 0.856
5.685GlyThr: 5.685 ± 1.078
4.598GlyVal: 4.598 ± 0.728
1.17GlyTrp: 1.17 ± 0.285
2.675GlyTyr: 2.675 ± 0.52
0.0GlyXaa: 0.0 ± 0.0
His
0.836HisAla: 0.836 ± 0.234
0.167HisCys: 0.167 ± 0.185
1.087HisAsp: 1.087 ± 0.312
1.087HisGlu: 1.087 ± 0.255
0.418HisPhe: 0.418 ± 0.18
0.669HisGly: 0.669 ± 0.275
0.418HisHis: 0.418 ± 0.26
0.669HisIle: 0.669 ± 0.235
0.752HisLys: 0.752 ± 0.26
1.003HisLeu: 1.003 ± 0.349
0.0HisMet: 0.0 ± 0.0
0.418HisAsn: 0.418 ± 0.191
0.418HisPro: 0.418 ± 0.197
0.585HisGln: 0.585 ± 0.167
0.418HisArg: 0.418 ± 0.156
0.752HisSer: 0.752 ± 0.241
1.087HisThr: 1.087 ± 0.357
0.669HisVal: 0.669 ± 0.295
0.084HisTrp: 0.084 ± 0.076
0.334HisTyr: 0.334 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
5.1IleAla: 5.1 ± 0.68
0.251IleCys: 0.251 ± 0.143
6.02IleAsp: 6.02 ± 0.917
7.106IleGlu: 7.106 ± 0.974
2.09IlePhe: 2.09 ± 0.403
3.093IleGly: 3.093 ± 0.479
0.502IleHis: 0.502 ± 0.138
4.18IleIle: 4.18 ± 0.645
6.438IleLys: 6.438 ± 1.014
4.431IleLeu: 4.431 ± 0.646
0.836IleMet: 0.836 ± 0.273
4.18IleAsn: 4.18 ± 0.537
2.508IlePro: 2.508 ± 0.346
3.261IleGln: 3.261 ± 0.499
2.843IleArg: 2.843 ± 0.512
4.765IleSer: 4.765 ± 0.733
4.765IleThr: 4.765 ± 0.497
3.846IleVal: 3.846 ± 0.599
0.585IleTrp: 0.585 ± 0.228
2.843IleTyr: 2.843 ± 0.535
0.0IleXaa: 0.0 ± 0.0
Lys
6.521LysAla: 6.521 ± 1.14
0.0LysCys: 0.0 ± 0.0
4.515LysAsp: 4.515 ± 0.553
7.441LysGlu: 7.441 ± 1.153
3.01LysPhe: 3.01 ± 0.529
4.849LysGly: 4.849 ± 0.525
1.003LysHis: 1.003 ± 0.298
5.602LysIle: 5.602 ± 0.668
6.856LysLys: 6.856 ± 1.065
7.274LysLeu: 7.274 ± 1.408
2.425LysMet: 2.425 ± 0.568
5.852LysAsn: 5.852 ± 0.705
2.341LysPro: 2.341 ± 0.486
3.261LysGln: 3.261 ± 0.601
4.347LysArg: 4.347 ± 0.721
4.849LysSer: 4.849 ± 0.693
6.772LysThr: 6.772 ± 0.894
6.187LysVal: 6.187 ± 0.747
1.087LysTrp: 1.087 ± 0.331
2.174LysTyr: 2.174 ± 0.39
0.0LysXaa: 0.0 ± 0.0
Leu
6.27LeuAla: 6.27 ± 0.829
0.585LeuCys: 0.585 ± 0.281
6.02LeuAsp: 6.02 ± 0.834
6.939LeuGlu: 6.939 ± 0.907
2.843LeuPhe: 2.843 ± 0.642
5.016LeuGly: 5.016 ± 0.64
1.17LeuHis: 1.17 ± 0.341
6.605LeuIle: 6.605 ± 0.703
7.19LeuLys: 7.19 ± 0.852
7.023LeuLeu: 7.023 ± 0.829
2.09LeuMet: 2.09 ± 0.451
4.765LeuAsn: 4.765 ± 0.593
2.759LeuPro: 2.759 ± 0.446
2.926LeuGln: 2.926 ± 0.554
2.508LeuArg: 2.508 ± 0.401
5.184LeuSer: 5.184 ± 0.854
6.187LeuThr: 6.187 ± 0.808
4.18LeuVal: 4.18 ± 0.491
0.836LeuTrp: 0.836 ± 0.261
2.425LeuTyr: 2.425 ± 0.362
0.0LeuXaa: 0.0 ± 0.0
Met
1.839MetAla: 1.839 ± 0.438
0.084MetCys: 0.084 ± 0.08
1.923MetAsp: 1.923 ± 0.404
2.007MetGlu: 2.007 ± 0.453
0.585MetPhe: 0.585 ± 0.205
1.17MetGly: 1.17 ± 0.355
0.251MetHis: 0.251 ± 0.129
1.588MetIle: 1.588 ± 0.387
1.756MetLys: 1.756 ± 0.435
1.254MetLeu: 1.254 ± 0.349
0.251MetMet: 0.251 ± 0.169
1.17MetAsn: 1.17 ± 0.343
0.752MetPro: 0.752 ± 0.245
1.672MetGln: 1.672 ± 0.365
1.087MetArg: 1.087 ± 0.367
2.007MetSer: 2.007 ± 0.369
2.257MetThr: 2.257 ± 0.398
1.421MetVal: 1.421 ± 0.27
0.334MetTrp: 0.334 ± 0.14
0.585MetTyr: 0.585 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
3.261AsnAla: 3.261 ± 0.689
0.084AsnCys: 0.084 ± 0.078
3.762AsnAsp: 3.762 ± 0.546
4.264AsnGlu: 4.264 ± 0.733
2.174AsnPhe: 2.174 ± 0.471
5.184AsnGly: 5.184 ± 0.601
0.585AsnHis: 0.585 ± 0.197
3.177AsnIle: 3.177 ± 0.534
4.598AsnLys: 4.598 ± 0.634
4.933AsnLeu: 4.933 ± 0.573
1.672AsnMet: 1.672 ± 0.296
3.093AsnAsn: 3.093 ± 0.515
1.588AsnPro: 1.588 ± 0.498
2.843AsnGln: 2.843 ± 0.459
1.254AsnArg: 1.254 ± 0.332
4.18AsnSer: 4.18 ± 0.657
2.341AsnThr: 2.341 ± 0.501
4.347AsnVal: 4.347 ± 0.516
0.92AsnTrp: 0.92 ± 0.306
1.923AsnTyr: 1.923 ± 0.365
0.0AsnXaa: 0.0 ± 0.0
Pro
1.505ProAla: 1.505 ± 0.337
0.0ProCys: 0.0 ± 0.0
2.007ProAsp: 2.007 ± 0.39
2.592ProGlu: 2.592 ± 0.595
1.087ProPhe: 1.087 ± 0.355
1.254ProGly: 1.254 ± 0.284
0.251ProHis: 0.251 ± 0.131
2.341ProIle: 2.341 ± 0.489
2.341ProLys: 2.341 ± 0.487
2.675ProLeu: 2.675 ± 0.523
0.585ProMet: 0.585 ± 0.26
1.17ProAsn: 1.17 ± 0.369
0.585ProPro: 0.585 ± 0.22
1.672ProGln: 1.672 ± 0.438
0.418ProArg: 0.418 ± 0.167
1.923ProSer: 1.923 ± 0.492
2.508ProThr: 2.508 ± 0.391
2.174ProVal: 2.174 ± 0.432
0.167ProTrp: 0.167 ± 0.113
1.003ProTyr: 1.003 ± 0.247
0.0ProXaa: 0.0 ± 0.0
Gln
3.093GlnAla: 3.093 ± 0.531
0.251GlnCys: 0.251 ± 0.125
1.254GlnAsp: 1.254 ± 0.405
2.926GlnGlu: 2.926 ± 0.518
2.09GlnPhe: 2.09 ± 0.388
2.843GlnGly: 2.843 ± 0.653
1.087GlnHis: 1.087 ± 0.235
2.174GlnIle: 2.174 ± 0.369
3.846GlnLys: 3.846 ± 0.65
3.679GlnLeu: 3.679 ± 0.663
1.338GlnMet: 1.338 ± 0.311
2.09GlnAsn: 2.09 ± 0.506
1.672GlnPro: 1.672 ± 0.612
2.592GlnGln: 2.592 ± 0.731
2.174GlnArg: 2.174 ± 0.515
2.926GlnSer: 2.926 ± 0.454
1.923GlnThr: 1.923 ± 0.434
1.839GlnVal: 1.839 ± 0.49
0.585GlnTrp: 0.585 ± 0.216
1.672GlnTyr: 1.672 ± 0.401
0.0GlnXaa: 0.0 ± 0.0
Arg
1.338ArgAla: 1.338 ± 0.351
0.167ArgCys: 0.167 ± 0.117
2.174ArgAsp: 2.174 ± 0.528
2.843ArgGlu: 2.843 ± 0.367
1.505ArgPhe: 1.505 ± 0.427
2.174ArgGly: 2.174 ± 0.46
0.418ArgHis: 0.418 ± 0.16
2.675ArgIle: 2.675 ± 0.542
3.595ArgLys: 3.595 ± 0.668
3.261ArgLeu: 3.261 ± 0.58
1.254ArgMet: 1.254 ± 0.407
2.09ArgAsn: 2.09 ± 0.363
1.17ArgPro: 1.17 ± 0.381
1.421ArgGln: 1.421 ± 0.378
1.588ArgArg: 1.588 ± 0.46
1.756ArgSer: 1.756 ± 0.332
2.675ArgThr: 2.675 ± 0.661
2.341ArgVal: 2.341 ± 0.436
0.502ArgTrp: 0.502 ± 0.193
1.923ArgTyr: 1.923 ± 0.509
0.0ArgXaa: 0.0 ± 0.0
Ser
6.02SerAla: 6.02 ± 0.869
0.251SerCys: 0.251 ± 0.217
4.682SerAsp: 4.682 ± 0.588
4.264SerGlu: 4.264 ± 0.721
2.843SerPhe: 2.843 ± 0.556
4.431SerGly: 4.431 ± 0.699
0.836SerHis: 0.836 ± 0.249
5.434SerIle: 5.434 ± 0.786
4.18SerLys: 4.18 ± 0.501
4.515SerLeu: 4.515 ± 0.575
1.756SerMet: 1.756 ± 0.392
3.177SerAsn: 3.177 ± 0.487
1.839SerPro: 1.839 ± 0.3
3.762SerGln: 3.762 ± 0.611
2.09SerArg: 2.09 ± 0.432
5.016SerSer: 5.016 ± 1.048
5.184SerThr: 5.184 ± 0.751
3.846SerVal: 3.846 ± 0.509
0.92SerTrp: 0.92 ± 0.355
2.341SerTyr: 2.341 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
5.351ThrAla: 5.351 ± 0.863
0.084ThrCys: 0.084 ± 0.093
3.846ThrAsp: 3.846 ± 0.899
3.929ThrGlu: 3.929 ± 0.549
3.511ThrPhe: 3.511 ± 0.639
4.765ThrGly: 4.765 ± 0.473
0.669ThrHis: 0.669 ± 0.229
5.184ThrIle: 5.184 ± 0.699
5.769ThrLys: 5.769 ± 0.666
5.518ThrLeu: 5.518 ± 0.603
1.505ThrMet: 1.505 ± 0.28
3.679ThrAsn: 3.679 ± 0.496
1.923ThrPro: 1.923 ± 0.449
2.508ThrGln: 2.508 ± 0.535
2.007ThrArg: 2.007 ± 0.292
4.933ThrSer: 4.933 ± 0.803
3.846ThrThr: 3.846 ± 0.662
6.187ThrVal: 6.187 ± 0.871
0.836ThrTrp: 0.836 ± 0.247
2.257ThrTyr: 2.257 ± 0.704
0.0ThrXaa: 0.0 ± 0.0
Val
4.765ValAla: 4.765 ± 0.7
0.418ValCys: 0.418 ± 0.162
4.431ValAsp: 4.431 ± 0.483
4.013ValGlu: 4.013 ± 0.448
1.923ValPhe: 1.923 ± 0.321
4.765ValGly: 4.765 ± 0.59
0.418ValHis: 0.418 ± 0.214
5.016ValIle: 5.016 ± 0.563
5.518ValLys: 5.518 ± 0.589
4.849ValLeu: 4.849 ± 0.669
1.338ValMet: 1.338 ± 0.287
3.679ValAsn: 3.679 ± 0.585
1.839ValPro: 1.839 ± 0.415
2.257ValGln: 2.257 ± 0.486
2.174ValArg: 2.174 ± 0.627
4.264ValSer: 4.264 ± 0.762
4.097ValThr: 4.097 ± 0.442
3.762ValVal: 3.762 ± 0.801
0.502ValTrp: 0.502 ± 0.255
1.839ValTyr: 1.839 ± 0.386
0.0ValXaa: 0.0 ± 0.0
Trp
1.003TrpAla: 1.003 ± 0.302
0.084TrpCys: 0.084 ± 0.078
1.17TrpAsp: 1.17 ± 0.383
0.752TrpGlu: 0.752 ± 0.234
0.669TrpPhe: 0.669 ± 0.217
1.003TrpGly: 1.003 ± 0.305
0.167TrpHis: 0.167 ± 0.121
0.752TrpIle: 0.752 ± 0.357
0.92TrpLys: 0.92 ± 0.324
1.17TrpLeu: 1.17 ± 0.405
0.251TrpMet: 0.251 ± 0.162
0.92TrpAsn: 0.92 ± 0.34
0.084TrpPro: 0.084 ± 0.083
0.334TrpGln: 0.334 ± 0.176
0.502TrpArg: 0.502 ± 0.147
1.003TrpSer: 1.003 ± 0.263
1.087TrpThr: 1.087 ± 0.376
0.502TrpVal: 0.502 ± 0.21
0.167TrpTrp: 0.167 ± 0.109
0.502TrpTyr: 0.502 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.09TyrAla: 2.09 ± 0.516
0.502TyrCys: 0.502 ± 0.23
2.257TyrAsp: 2.257 ± 0.375
2.759TyrGlu: 2.759 ± 0.357
1.254TyrPhe: 1.254 ± 0.248
2.09TyrGly: 2.09 ± 0.455
0.418TyrHis: 0.418 ± 0.217
1.588TyrIle: 1.588 ± 0.482
3.093TyrLys: 3.093 ± 0.542
2.759TyrLeu: 2.759 ± 0.633
0.585TyrMet: 0.585 ± 0.25
2.09TyrAsn: 2.09 ± 0.425
1.338TyrPro: 1.338 ± 0.376
2.341TyrGln: 2.341 ± 0.562
1.421TyrArg: 1.421 ± 0.346
2.341TyrSer: 2.341 ± 0.542
3.01TyrThr: 3.01 ± 0.435
2.341TyrVal: 2.341 ± 0.508
0.251TyrTrp: 0.251 ± 0.167
1.338TyrTyr: 1.338 ± 0.343
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (11962 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski