Amino acid dipepetide frequency for Bacillus phage Pascal

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.385AlaAla: 6.385 ± 0.723
0.473AlaCys: 0.473 ± 0.218
4.651AlaAsp: 4.651 ± 0.575
5.518AlaGlu: 5.518 ± 0.917
3.39AlaPhe: 3.39 ± 0.459
6.07AlaGly: 6.07 ± 0.859
1.182AlaHis: 1.182 ± 0.299
3.311AlaIle: 3.311 ± 0.668
4.572AlaLys: 4.572 ± 0.855
5.833AlaLeu: 5.833 ± 0.874
2.286AlaMet: 2.286 ± 0.425
4.651AlaAsn: 4.651 ± 0.737
2.995AlaPro: 2.995 ± 0.659
3.784AlaGln: 3.784 ± 0.72
3.547AlaArg: 3.547 ± 0.645
3.547AlaSer: 3.547 ± 0.442
5.281AlaThr: 5.281 ± 0.612
4.651AlaVal: 4.651 ± 0.525
0.788AlaTrp: 0.788 ± 0.233
2.995AlaTyr: 2.995 ± 0.565
0.0AlaXaa: 0.0 ± 0.0
Cys
0.709CysAla: 0.709 ± 0.24
0.079CysCys: 0.079 ± 0.082
0.394CysAsp: 0.394 ± 0.171
0.631CysGlu: 0.631 ± 0.25
0.236CysPhe: 0.236 ± 0.143
1.025CysGly: 1.025 ± 0.34
0.079CysHis: 0.079 ± 0.095
0.315CysIle: 0.315 ± 0.147
0.709CysLys: 0.709 ± 0.266
0.473CysLeu: 0.473 ± 0.229
0.158CysMet: 0.158 ± 0.108
0.788CysAsn: 0.788 ± 0.293
0.236CysPro: 0.236 ± 0.136
0.236CysGln: 0.236 ± 0.139
0.315CysArg: 0.315 ± 0.153
0.631CysSer: 0.631 ± 0.246
0.315CysThr: 0.315 ± 0.133
0.315CysVal: 0.315 ± 0.171
0.158CysTrp: 0.158 ± 0.117
0.079CysTyr: 0.079 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
3.074AspAla: 3.074 ± 0.391
0.552AspCys: 0.552 ± 0.23
2.917AspAsp: 2.917 ± 0.734
3.705AspGlu: 3.705 ± 0.516
2.995AspPhe: 2.995 ± 0.609
4.02AspGly: 4.02 ± 0.622
0.709AspHis: 0.709 ± 0.199
4.572AspIle: 4.572 ± 0.578
4.178AspLys: 4.178 ± 0.556
3.941AspLeu: 3.941 ± 0.539
1.655AspMet: 1.655 ± 0.37
3.784AspAsn: 3.784 ± 0.541
1.971AspPro: 1.971 ± 0.43
2.05AspGln: 2.05 ± 0.551
1.655AspArg: 1.655 ± 0.387
3.39AspSer: 3.39 ± 0.638
2.838AspThr: 2.838 ± 0.451
3.153AspVal: 3.153 ± 0.46
0.946AspTrp: 0.946 ± 0.345
3.39AspTyr: 3.39 ± 0.509
0.0AspXaa: 0.0 ± 0.0
Glu
5.281GluAla: 5.281 ± 0.842
0.788GluCys: 0.788 ± 0.27
2.365GluAsp: 2.365 ± 0.463
6.07GluGlu: 6.07 ± 1.113
2.68GluPhe: 2.68 ± 0.469
4.257GluGly: 4.257 ± 0.68
1.34GluHis: 1.34 ± 0.469
4.414GluIle: 4.414 ± 0.759
6.858GluLys: 6.858 ± 0.891
5.833GluLeu: 5.833 ± 1.227
2.444GluMet: 2.444 ± 0.434
4.335GluAsn: 4.335 ± 0.689
2.286GluPro: 2.286 ± 0.457
4.178GluGln: 4.178 ± 0.636
3.39GluArg: 3.39 ± 0.575
2.601GluSer: 2.601 ± 0.498
2.995GluThr: 2.995 ± 0.675
3.941GluVal: 3.941 ± 0.694
0.788GluTrp: 0.788 ± 0.244
2.68GluTyr: 2.68 ± 0.522
0.0GluXaa: 0.0 ± 0.0
Phe
2.05PheAla: 2.05 ± 0.384
0.394PheCys: 0.394 ± 0.171
2.522PheAsp: 2.522 ± 0.387
3.074PheGlu: 3.074 ± 0.524
1.813PhePhe: 1.813 ± 0.389
2.917PheGly: 2.917 ± 0.367
0.631PheHis: 0.631 ± 0.273
3.311PheIle: 3.311 ± 0.552
4.493PheLys: 4.493 ± 0.585
3.074PheLeu: 3.074 ± 0.413
0.946PheMet: 0.946 ± 0.268
2.601PheAsn: 2.601 ± 0.428
1.971PhePro: 1.971 ± 0.493
1.892PheGln: 1.892 ± 0.39
1.34PheArg: 1.34 ± 0.458
2.68PheSer: 2.68 ± 0.647
3.074PheThr: 3.074 ± 0.512
1.498PheVal: 1.498 ± 0.286
0.236PheTrp: 0.236 ± 0.125
1.025PheTyr: 1.025 ± 0.252
0.0PheXaa: 0.0 ± 0.0
Gly
5.912GlyAla: 5.912 ± 0.725
0.158GlyCys: 0.158 ± 0.107
3.311GlyAsp: 3.311 ± 0.489
3.311GlyGlu: 3.311 ± 0.686
2.68GlyPhe: 2.68 ± 0.373
5.833GlyGly: 5.833 ± 1.377
0.867GlyHis: 0.867 ± 0.265
4.73GlyIle: 4.73 ± 0.515
6.385GlyLys: 6.385 ± 1.203
5.676GlyLeu: 5.676 ± 1.013
2.365GlyMet: 2.365 ± 0.393
4.335GlyAsn: 4.335 ± 0.681
0.552GlyPro: 0.552 ± 0.202
2.759GlyGln: 2.759 ± 0.615
2.68GlyArg: 2.68 ± 0.481
3.468GlySer: 3.468 ± 0.7
5.203GlyThr: 5.203 ± 0.827
4.966GlyVal: 4.966 ± 0.711
0.946GlyTrp: 0.946 ± 0.317
3.153GlyTyr: 3.153 ± 0.602
0.0GlyXaa: 0.0 ± 0.0
His
0.788HisAla: 0.788 ± 0.254
0.315HisCys: 0.315 ± 0.146
1.261HisAsp: 1.261 ± 0.298
1.104HisGlu: 1.104 ± 0.33
1.025HisPhe: 1.025 ± 0.281
0.473HisGly: 0.473 ± 0.191
0.236HisHis: 0.236 ± 0.155
0.709HisIle: 0.709 ± 0.219
1.34HisLys: 1.34 ± 0.314
1.104HisLeu: 1.104 ± 0.231
0.552HisMet: 0.552 ± 0.241
0.946HisAsn: 0.946 ± 0.282
1.025HisPro: 1.025 ± 0.342
0.631HisGln: 0.631 ± 0.175
0.867HisArg: 0.867 ± 0.334
1.025HisSer: 1.025 ± 0.34
0.631HisThr: 0.631 ± 0.213
0.552HisVal: 0.552 ± 0.25
0.473HisTrp: 0.473 ± 0.179
0.946HisTyr: 0.946 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
4.572IleAla: 4.572 ± 0.795
0.236IleCys: 0.236 ± 0.137
3.863IleAsp: 3.863 ± 0.719
4.257IleGlu: 4.257 ± 0.573
1.577IlePhe: 1.577 ± 0.498
4.335IleGly: 4.335 ± 0.734
1.734IleHis: 1.734 ± 0.378
3.468IleIle: 3.468 ± 0.699
5.676IleLys: 5.676 ± 0.906
3.547IleLeu: 3.547 ± 0.6
1.892IleMet: 1.892 ± 0.39
4.808IleAsn: 4.808 ± 0.64
1.971IlePro: 1.971 ± 0.5
2.838IleGln: 2.838 ± 0.453
2.05IleArg: 2.05 ± 0.447
2.522IleSer: 2.522 ± 0.494
3.547IleThr: 3.547 ± 0.696
3.153IleVal: 3.153 ± 0.473
0.552IleTrp: 0.552 ± 0.198
1.419IleTyr: 1.419 ± 0.381
0.0IleXaa: 0.0 ± 0.0
Lys
6.464LysAla: 6.464 ± 1.15
1.104LysCys: 1.104 ± 0.423
4.651LysAsp: 4.651 ± 0.897
6.937LysGlu: 6.937 ± 0.831
2.601LysPhe: 2.601 ± 0.382
5.912LysGly: 5.912 ± 1.134
1.892LysHis: 1.892 ± 0.414
4.966LysIle: 4.966 ± 0.598
8.434LysLys: 8.434 ± 0.982
5.281LysLeu: 5.281 ± 0.644
2.522LysMet: 2.522 ± 0.436
6.07LysAsn: 6.07 ± 0.493
2.601LysPro: 2.601 ± 0.596
4.966LysGln: 4.966 ± 0.925
4.257LysArg: 4.257 ± 0.598
4.73LysSer: 4.73 ± 0.854
5.124LysThr: 5.124 ± 0.607
4.257LysVal: 4.257 ± 0.492
0.552LysTrp: 0.552 ± 0.202
3.705LysTyr: 3.705 ± 0.609
0.0LysXaa: 0.0 ± 0.0
Leu
4.808LeuAla: 4.808 ± 0.592
0.552LeuCys: 0.552 ± 0.314
4.02LeuAsp: 4.02 ± 0.368
5.597LeuGlu: 5.597 ± 0.591
2.759LeuPhe: 2.759 ± 0.413
4.335LeuGly: 4.335 ± 0.751
1.34LeuHis: 1.34 ± 0.332
4.651LeuIle: 4.651 ± 0.697
5.597LeuLys: 5.597 ± 0.902
4.572LeuLeu: 4.572 ± 0.712
2.05LeuMet: 2.05 ± 0.378
5.597LeuAsn: 5.597 ± 0.605
2.759LeuPro: 2.759 ± 0.588
3.705LeuGln: 3.705 ± 0.615
2.601LeuArg: 2.601 ± 0.358
5.754LeuSer: 5.754 ± 0.662
5.439LeuThr: 5.439 ± 0.869
4.257LeuVal: 4.257 ± 0.669
0.631LeuTrp: 0.631 ± 0.237
2.68LeuTyr: 2.68 ± 0.566
0.0LeuXaa: 0.0 ± 0.0
Met
2.05MetAla: 2.05 ± 0.542
0.158MetCys: 0.158 ± 0.108
1.892MetAsp: 1.892 ± 0.399
1.182MetGlu: 1.182 ± 0.245
1.419MetPhe: 1.419 ± 0.329
1.498MetGly: 1.498 ± 0.524
0.158MetHis: 0.158 ± 0.123
1.498MetIle: 1.498 ± 0.374
2.759MetLys: 2.759 ± 0.524
2.05MetLeu: 2.05 ± 0.421
0.788MetMet: 0.788 ± 0.311
2.286MetAsn: 2.286 ± 0.329
1.892MetPro: 1.892 ± 0.433
1.655MetGln: 1.655 ± 0.431
1.025MetArg: 1.025 ± 0.31
1.498MetSer: 1.498 ± 0.388
1.419MetThr: 1.419 ± 0.437
1.34MetVal: 1.34 ± 0.338
0.394MetTrp: 0.394 ± 0.191
0.788MetTyr: 0.788 ± 0.299
0.0MetXaa: 0.0 ± 0.0
Asn
6.385AsnAla: 6.385 ± 0.726
0.394AsnCys: 0.394 ± 0.193
4.178AsnAsp: 4.178 ± 0.614
4.178AsnGlu: 4.178 ± 0.795
2.286AsnPhe: 2.286 ± 0.469
4.572AsnGly: 4.572 ± 0.578
1.34AsnHis: 1.34 ± 0.458
3.547AsnIle: 3.547 ± 0.493
5.045AsnLys: 5.045 ± 0.882
4.808AsnLeu: 4.808 ± 0.896
1.734AsnMet: 1.734 ± 0.462
4.572AsnAsn: 4.572 ± 1.061
2.601AsnPro: 2.601 ± 0.925
3.311AsnGln: 3.311 ± 0.437
2.444AsnArg: 2.444 ± 0.401
2.917AsnSer: 2.917 ± 0.328
3.941AsnThr: 3.941 ± 0.649
4.73AsnVal: 4.73 ± 0.9
1.025AsnTrp: 1.025 ± 0.407
2.444AsnTyr: 2.444 ± 0.506
0.0AsnXaa: 0.0 ± 0.0
Pro
2.68ProAla: 2.68 ± 0.473
0.0ProCys: 0.0 ± 0.0
2.365ProAsp: 2.365 ± 0.503
2.522ProGlu: 2.522 ± 0.471
2.207ProPhe: 2.207 ± 0.411
2.128ProGly: 2.128 ± 0.493
0.236ProHis: 0.236 ± 0.134
1.734ProIle: 1.734 ± 0.414
2.917ProLys: 2.917 ± 0.544
3.311ProLeu: 3.311 ± 0.609
0.709ProMet: 0.709 ± 0.326
1.577ProAsn: 1.577 ± 0.43
1.813ProPro: 1.813 ± 0.64
1.34ProGln: 1.34 ± 0.438
1.104ProArg: 1.104 ± 0.313
2.286ProSer: 2.286 ± 0.346
1.813ProThr: 1.813 ± 0.346
2.365ProVal: 2.365 ± 0.377
0.079ProTrp: 0.079 ± 0.079
1.182ProTyr: 1.182 ± 0.49
0.0ProXaa: 0.0 ± 0.0
Gln
4.02GlnAla: 4.02 ± 0.608
0.158GlnCys: 0.158 ± 0.097
2.128GlnAsp: 2.128 ± 0.384
3.941GlnGlu: 3.941 ± 0.781
2.522GlnPhe: 2.522 ± 0.927
4.178GlnGly: 4.178 ± 0.701
0.394GlnHis: 0.394 ± 0.175
1.971GlnIle: 1.971 ± 0.572
4.335GlnLys: 4.335 ± 0.819
4.887GlnLeu: 4.887 ± 0.601
0.946GlnMet: 0.946 ± 0.291
3.39GlnAsn: 3.39 ± 0.727
1.419GlnPro: 1.419 ± 0.44
4.572GlnGln: 4.572 ± 1.152
2.207GlnArg: 2.207 ± 0.56
2.838GlnSer: 2.838 ± 0.639
1.813GlnThr: 1.813 ± 0.482
2.522GlnVal: 2.522 ± 0.497
0.946GlnTrp: 0.946 ± 0.6
1.655GlnTyr: 1.655 ± 0.412
0.0GlnXaa: 0.0 ± 0.0
Arg
3.547ArgAla: 3.547 ± 0.555
0.394ArgCys: 0.394 ± 0.152
1.655ArgAsp: 1.655 ± 0.375
2.995ArgGlu: 2.995 ± 0.487
1.655ArgPhe: 1.655 ± 0.345
1.655ArgGly: 1.655 ± 0.442
0.709ArgHis: 0.709 ± 0.339
2.917ArgIle: 2.917 ± 0.392
3.784ArgLys: 3.784 ± 0.577
3.863ArgLeu: 3.863 ± 0.575
1.261ArgMet: 1.261 ± 0.24
2.917ArgAsn: 2.917 ± 0.459
0.394ArgPro: 0.394 ± 0.177
2.601ArgGln: 2.601 ± 0.577
2.05ArgArg: 2.05 ± 0.444
1.971ArgSer: 1.971 ± 0.335
1.892ArgThr: 1.892 ± 0.406
2.601ArgVal: 2.601 ± 0.399
0.473ArgTrp: 0.473 ± 0.164
2.05ArgTyr: 2.05 ± 0.418
0.0ArgXaa: 0.0 ± 0.0
Ser
4.02SerAla: 4.02 ± 0.487
0.473SerCys: 0.473 ± 0.2
2.917SerAsp: 2.917 ± 0.494
3.547SerGlu: 3.547 ± 0.638
2.444SerPhe: 2.444 ± 0.513
4.572SerGly: 4.572 ± 0.746
0.631SerHis: 0.631 ± 0.242
2.995SerIle: 2.995 ± 0.463
4.73SerLys: 4.73 ± 0.512
4.257SerLeu: 4.257 ± 0.499
1.419SerMet: 1.419 ± 0.439
3.232SerAsn: 3.232 ± 0.565
1.655SerPro: 1.655 ± 0.434
2.838SerGln: 2.838 ± 0.605
2.601SerArg: 2.601 ± 0.48
4.887SerSer: 4.887 ± 0.835
2.68SerThr: 2.68 ± 0.538
3.311SerVal: 3.311 ± 0.492
0.473SerTrp: 0.473 ± 0.169
3.153SerTyr: 3.153 ± 0.482
0.0SerXaa: 0.0 ± 0.0
Thr
5.36ThrAla: 5.36 ± 0.879
0.473ThrCys: 0.473 ± 0.247
3.311ThrAsp: 3.311 ± 0.483
3.863ThrGlu: 3.863 ± 0.624
3.39ThrPhe: 3.39 ± 0.848
5.124ThrGly: 5.124 ± 0.737
0.631ThrHis: 0.631 ± 0.267
3.153ThrIle: 3.153 ± 0.571
4.493ThrLys: 4.493 ± 0.578
3.784ThrLeu: 3.784 ± 0.525
1.261ThrMet: 1.261 ± 0.27
4.099ThrAsn: 4.099 ± 0.662
2.838ThrPro: 2.838 ± 0.523
2.68ThrGln: 2.68 ± 0.455
2.128ThrArg: 2.128 ± 0.421
2.286ThrSer: 2.286 ± 0.403
3.547ThrThr: 3.547 ± 0.678
3.311ThrVal: 3.311 ± 0.532
0.867ThrTrp: 0.867 ± 0.255
2.444ThrTyr: 2.444 ± 0.631
0.0ThrXaa: 0.0 ± 0.0
Val
4.73ValAla: 4.73 ± 0.888
0.473ValCys: 0.473 ± 0.245
4.178ValAsp: 4.178 ± 0.582
3.626ValGlu: 3.626 ± 0.653
1.813ValPhe: 1.813 ± 0.525
3.311ValGly: 3.311 ± 0.642
1.104ValHis: 1.104 ± 0.32
2.917ValIle: 2.917 ± 0.473
5.754ValLys: 5.754 ± 0.633
3.941ValLeu: 3.941 ± 0.582
1.419ValMet: 1.419 ± 0.32
2.917ValAsn: 2.917 ± 0.426
2.207ValPro: 2.207 ± 0.341
2.286ValGln: 2.286 ± 0.355
2.286ValArg: 2.286 ± 0.542
3.547ValSer: 3.547 ± 0.443
3.941ValThr: 3.941 ± 0.437
3.311ValVal: 3.311 ± 0.492
0.788ValTrp: 0.788 ± 0.27
2.05ValTyr: 2.05 ± 0.445
0.0ValXaa: 0.0 ± 0.0
Trp
0.867TrpAla: 0.867 ± 0.261
0.158TrpCys: 0.158 ± 0.101
0.315TrpAsp: 0.315 ± 0.162
0.473TrpGlu: 0.473 ± 0.234
0.552TrpPhe: 0.552 ± 0.158
0.394TrpGly: 0.394 ± 0.15
0.158TrpHis: 0.158 ± 0.122
0.552TrpIle: 0.552 ± 0.312
1.419TrpLys: 1.419 ± 0.37
1.182TrpLeu: 1.182 ± 0.25
0.394TrpMet: 0.394 ± 0.172
0.473TrpAsn: 0.473 ± 0.268
0.158TrpPro: 0.158 ± 0.174
1.104TrpGln: 1.104 ± 0.578
0.631TrpArg: 0.631 ± 0.277
1.182TrpSer: 1.182 ± 0.357
0.946TrpThr: 0.946 ± 0.345
0.236TrpVal: 0.236 ± 0.154
0.079TrpTrp: 0.079 ± 0.079
0.709TrpTyr: 0.709 ± 0.21
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.207TyrAla: 2.207 ± 0.41
0.552TyrCys: 0.552 ± 0.269
2.601TyrAsp: 2.601 ± 0.453
2.995TyrGlu: 2.995 ± 0.631
1.577TyrPhe: 1.577 ± 0.375
2.68TyrGly: 2.68 ± 0.476
0.631TyrHis: 0.631 ± 0.224
2.286TyrIle: 2.286 ± 0.416
3.547TyrLys: 3.547 ± 0.788
2.286TyrLeu: 2.286 ± 0.362
0.867TyrMet: 0.867 ± 0.282
3.074TyrAsn: 3.074 ± 0.498
1.025TyrPro: 1.025 ± 0.285
1.419TyrGln: 1.419 ± 0.372
2.207TyrArg: 2.207 ± 0.493
3.074TyrSer: 3.074 ± 0.527
2.601TyrThr: 2.601 ± 0.495
2.128TyrVal: 2.128 ± 0.49
0.788TyrTrp: 0.788 ± 0.221
1.813TyrTyr: 1.813 ± 0.418
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (12687 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski