Amino acid dipepetide frequency for Cellulophaga phage phi46:1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.801AlaAla: 6.801 ± 1.527
0.559AlaCys: 0.559 ± 0.237
2.888AlaAsp: 2.888 ± 0.504
6.336AlaGlu: 6.336 ± 1.151
3.447AlaPhe: 3.447 ± 0.481
5.497AlaGly: 5.497 ± 0.814
1.398AlaHis: 1.398 ± 0.335
4.286AlaIle: 4.286 ± 0.886
4.938AlaLys: 4.938 ± 0.615
7.081AlaLeu: 7.081 ± 1.228
1.398AlaMet: 1.398 ± 0.411
3.634AlaAsn: 3.634 ± 0.719
1.025AlaPro: 1.025 ± 0.285
3.075AlaGln: 3.075 ± 0.65
2.702AlaArg: 2.702 ± 0.495
3.82AlaSer: 3.82 ± 0.774
3.075AlaThr: 3.075 ± 0.725
4.193AlaVal: 4.193 ± 0.593
1.025AlaTrp: 1.025 ± 0.301
2.05AlaTyr: 2.05 ± 0.49
0.0AlaXaa: 0.0 ± 0.0
Cys
0.466CysAla: 0.466 ± 0.195
0.093CysCys: 0.093 ± 0.093
0.932CysAsp: 0.932 ± 0.318
0.373CysGlu: 0.373 ± 0.179
0.28CysPhe: 0.28 ± 0.187
0.466CysGly: 0.466 ± 0.219
0.466CysHis: 0.466 ± 0.18
0.559CysIle: 0.559 ± 0.193
1.025CysLys: 1.025 ± 0.387
1.118CysLeu: 1.118 ± 0.384
0.093CysMet: 0.093 ± 0.095
0.652CysAsn: 0.652 ± 0.236
0.559CysPro: 0.559 ± 0.235
0.373CysGln: 0.373 ± 0.185
0.093CysArg: 0.093 ± 0.103
0.559CysSer: 0.559 ± 0.25
0.559CysThr: 0.559 ± 0.222
0.093CysVal: 0.093 ± 0.088
0.0CysTrp: 0.0 ± 0.0
0.186CysTyr: 0.186 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
3.54AspAla: 3.54 ± 0.775
0.466AspCys: 0.466 ± 0.205
3.913AspAsp: 3.913 ± 0.756
4.565AspGlu: 4.565 ± 0.785
4.938AspPhe: 4.938 ± 0.658
3.447AspGly: 3.447 ± 0.644
0.652AspHis: 0.652 ± 0.299
3.82AspIle: 3.82 ± 0.688
4.752AspLys: 4.752 ± 0.742
5.683AspLeu: 5.683 ± 0.563
1.304AspMet: 1.304 ± 0.339
2.981AspAsn: 2.981 ± 0.392
1.957AspPro: 1.957 ± 0.557
1.118AspGln: 1.118 ± 0.363
1.118AspArg: 1.118 ± 0.289
2.795AspSer: 2.795 ± 0.464
1.957AspThr: 1.957 ± 0.446
3.913AspVal: 3.913 ± 0.646
0.466AspTrp: 0.466 ± 0.212
2.981AspTyr: 2.981 ± 0.519
0.0AspXaa: 0.0 ± 0.0
Glu
6.149GluAla: 6.149 ± 1.293
0.839GluCys: 0.839 ± 0.282
3.447GluAsp: 3.447 ± 0.482
4.006GluGlu: 4.006 ± 0.599
3.727GluPhe: 3.727 ± 0.54
3.261GluGly: 3.261 ± 0.43
1.025GluHis: 1.025 ± 0.339
6.615GluIle: 6.615 ± 0.781
8.199GluLys: 8.199 ± 0.83
8.479GluLeu: 8.479 ± 1.294
1.211GluMet: 1.211 ± 0.349
6.056GluAsn: 6.056 ± 0.814
1.211GluPro: 1.211 ± 0.316
3.075GluGln: 3.075 ± 0.571
2.795GluArg: 2.795 ± 0.447
4.193GluSer: 4.193 ± 0.638
4.938GluThr: 4.938 ± 0.562
2.981GluVal: 2.981 ± 0.52
0.28GluTrp: 0.28 ± 0.161
2.516GluTyr: 2.516 ± 0.475
0.0GluXaa: 0.0 ± 0.0
Phe
2.05PheAla: 2.05 ± 0.421
0.373PheCys: 0.373 ± 0.175
3.168PheAsp: 3.168 ± 0.513
4.938PheGlu: 4.938 ± 0.92
2.422PhePhe: 2.422 ± 0.398
2.981PheGly: 2.981 ± 0.496
0.745PheHis: 0.745 ± 0.246
3.727PheIle: 3.727 ± 0.542
4.006PheLys: 4.006 ± 0.634
4.006PheLeu: 4.006 ± 0.608
1.584PheMet: 1.584 ± 0.308
3.354PheAsn: 3.354 ± 0.563
1.398PhePro: 1.398 ± 0.394
1.584PheGln: 1.584 ± 0.434
1.584PheArg: 1.584 ± 0.37
4.1PheSer: 4.1 ± 0.497
2.981PheThr: 2.981 ± 0.592
2.795PheVal: 2.795 ± 0.47
0.559PheTrp: 0.559 ± 0.197
1.957PheTyr: 1.957 ± 0.498
0.0PheXaa: 0.0 ± 0.0
Gly
5.124GlyAla: 5.124 ± 1.013
0.28GlyCys: 0.28 ± 0.15
1.491GlyAsp: 1.491 ± 0.407
2.888GlyGlu: 2.888 ± 0.574
5.031GlyPhe: 5.031 ± 0.717
4.565GlyGly: 4.565 ± 0.784
0.652GlyHis: 0.652 ± 0.242
3.168GlyIle: 3.168 ± 0.627
4.379GlyLys: 4.379 ± 0.727
7.267GlyLeu: 7.267 ± 0.878
1.957GlyMet: 1.957 ± 0.392
3.168GlyAsn: 3.168 ± 0.524
0.0GlyPro: 0.0 ± 0.0
1.677GlyGln: 1.677 ± 0.515
2.702GlyArg: 2.702 ± 0.49
4.565GlySer: 4.565 ± 0.843
2.702GlyThr: 2.702 ± 0.517
5.497GlyVal: 5.497 ± 0.74
0.373GlyTrp: 0.373 ± 0.185
2.05GlyTyr: 2.05 ± 0.491
0.0GlyXaa: 0.0 ± 0.0
His
0.745HisAla: 0.745 ± 0.276
0.28HisCys: 0.28 ± 0.161
0.839HisAsp: 0.839 ± 0.257
1.304HisGlu: 1.304 ± 0.34
0.652HisPhe: 0.652 ± 0.223
0.745HisGly: 0.745 ± 0.233
0.28HisHis: 0.28 ± 0.157
1.118HisIle: 1.118 ± 0.325
1.491HisLys: 1.491 ± 0.45
1.863HisLeu: 1.863 ± 0.389
0.186HisMet: 0.186 ± 0.129
0.932HisAsn: 0.932 ± 0.313
0.186HisPro: 0.186 ± 0.165
0.466HisGln: 0.466 ± 0.207
0.652HisArg: 0.652 ± 0.235
1.491HisSer: 1.491 ± 0.42
1.118HisThr: 1.118 ± 0.295
0.839HisVal: 0.839 ± 0.259
0.093HisTrp: 0.093 ± 0.101
0.652HisTyr: 0.652 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
4.938IleAla: 4.938 ± 1.148
0.652IleCys: 0.652 ± 0.247
5.404IleAsp: 5.404 ± 0.667
6.895IleGlu: 6.895 ± 0.697
3.168IlePhe: 3.168 ± 0.511
2.888IleGly: 2.888 ± 0.495
1.584IleHis: 1.584 ± 0.328
5.218IleIle: 5.218 ± 0.542
8.479IleLys: 8.479 ± 0.848
4.938IleLeu: 4.938 ± 0.905
1.118IleMet: 1.118 ± 0.308
5.683IleAsn: 5.683 ± 0.685
1.584IlePro: 1.584 ± 0.452
1.863IleGln: 1.863 ± 0.363
2.795IleArg: 2.795 ± 0.46
4.379IleSer: 4.379 ± 0.812
3.727IleThr: 3.727 ± 0.628
3.447IleVal: 3.447 ± 0.531
0.186IleTrp: 0.186 ± 0.126
2.888IleTyr: 2.888 ± 0.579
0.0IleXaa: 0.0 ± 0.0
Lys
7.454LysAla: 7.454 ± 0.951
0.652LysCys: 0.652 ± 0.269
5.404LysAsp: 5.404 ± 0.796
7.36LysGlu: 7.36 ± 1.167
2.981LysPhe: 2.981 ± 0.608
5.031LysGly: 5.031 ± 0.723
1.584LysHis: 1.584 ± 0.548
6.801LysIle: 6.801 ± 0.819
8.665LysLys: 8.665 ± 1.023
7.081LysLeu: 7.081 ± 0.764
2.516LysMet: 2.516 ± 0.424
5.87LysAsn: 5.87 ± 0.772
2.702LysPro: 2.702 ± 0.621
2.702LysGln: 2.702 ± 0.532
5.497LysArg: 5.497 ± 0.825
5.218LysSer: 5.218 ± 0.633
7.267LysThr: 7.267 ± 0.757
5.031LysVal: 5.031 ± 0.678
1.118LysTrp: 1.118 ± 0.349
3.168LysTyr: 3.168 ± 0.571
0.0LysXaa: 0.0 ± 0.0
Leu
5.031LeuAla: 5.031 ± 0.619
0.932LeuCys: 0.932 ± 0.361
5.124LeuAsp: 5.124 ± 0.692
6.056LeuGlu: 6.056 ± 0.956
4.193LeuPhe: 4.193 ± 0.46
5.031LeuGly: 5.031 ± 0.841
1.398LeuHis: 1.398 ± 0.379
6.708LeuIle: 6.708 ± 0.759
10.528LeuLys: 10.528 ± 1.119
9.131LeuLeu: 9.131 ± 1.279
2.702LeuMet: 2.702 ± 0.458
5.683LeuAsn: 5.683 ± 0.694
3.168LeuPro: 3.168 ± 0.491
3.261LeuGln: 3.261 ± 0.503
3.447LeuArg: 3.447 ± 0.641
7.174LeuSer: 7.174 ± 0.739
6.336LeuThr: 6.336 ± 0.844
3.54LeuVal: 3.54 ± 0.615
0.466LeuTrp: 0.466 ± 0.202
3.075LeuTyr: 3.075 ± 0.519
0.0LeuXaa: 0.0 ± 0.0
Met
1.491MetAla: 1.491 ± 0.357
0.373MetCys: 0.373 ± 0.204
1.025MetAsp: 1.025 ± 0.281
1.677MetGlu: 1.677 ± 0.471
0.932MetPhe: 0.932 ± 0.292
1.398MetGly: 1.398 ± 0.385
0.559MetHis: 0.559 ± 0.249
1.211MetIle: 1.211 ± 0.417
2.236MetLys: 2.236 ± 0.456
1.118MetLeu: 1.118 ± 0.277
0.186MetMet: 0.186 ± 0.137
1.491MetAsn: 1.491 ± 0.442
0.373MetPro: 0.373 ± 0.178
0.186MetGln: 0.186 ± 0.136
0.839MetArg: 0.839 ± 0.303
1.957MetSer: 1.957 ± 0.355
1.211MetThr: 1.211 ± 0.29
0.932MetVal: 0.932 ± 0.263
0.186MetTrp: 0.186 ± 0.133
0.652MetTyr: 0.652 ± 0.241
0.0MetXaa: 0.0 ± 0.0
Asn
2.981AsnAla: 2.981 ± 0.648
0.559AsnCys: 0.559 ± 0.252
3.447AsnAsp: 3.447 ± 0.595
4.938AsnGlu: 4.938 ± 0.628
2.516AsnPhe: 2.516 ± 0.571
5.124AsnGly: 5.124 ± 0.679
0.839AsnHis: 0.839 ± 0.347
4.286AsnIle: 4.286 ± 0.588
5.683AsnLys: 5.683 ± 0.688
4.752AsnLeu: 4.752 ± 0.668
1.304AsnMet: 1.304 ± 0.361
2.981AsnAsn: 2.981 ± 0.551
2.516AsnPro: 2.516 ± 0.504
2.236AsnGln: 2.236 ± 0.564
3.075AsnArg: 3.075 ± 0.462
4.565AsnSer: 4.565 ± 0.717
2.236AsnThr: 2.236 ± 0.485
2.981AsnVal: 2.981 ± 0.622
0.466AsnTrp: 0.466 ± 0.202
2.981AsnTyr: 2.981 ± 0.685
0.0AsnXaa: 0.0 ± 0.0
Pro
2.702ProAla: 2.702 ± 0.474
0.093ProCys: 0.093 ± 0.077
1.863ProAsp: 1.863 ± 0.36
2.795ProGlu: 2.795 ± 0.542
1.677ProPhe: 1.677 ± 0.322
0.0ProGly: 0.0 ± 0.0
0.28ProHis: 0.28 ± 0.139
2.236ProIle: 2.236 ± 0.379
1.863ProLys: 1.863 ± 0.332
3.354ProLeu: 3.354 ± 0.659
0.186ProMet: 0.186 ± 0.12
1.491ProAsn: 1.491 ± 0.438
0.466ProPro: 0.466 ± 0.194
0.652ProGln: 0.652 ± 0.288
1.304ProArg: 1.304 ± 0.359
2.143ProSer: 2.143 ± 0.516
2.143ProThr: 2.143 ± 0.487
1.398ProVal: 1.398 ± 0.388
0.093ProTrp: 0.093 ± 0.09
0.932ProTyr: 0.932 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
2.236GlnAla: 2.236 ± 0.482
0.093GlnCys: 0.093 ± 0.104
1.957GlnAsp: 1.957 ± 0.409
2.236GlnGlu: 2.236 ± 0.639
1.211GlnPhe: 1.211 ± 0.303
1.77GlnGly: 1.77 ± 0.477
0.652GlnHis: 0.652 ± 0.297
3.168GlnIle: 3.168 ± 0.473
3.261GlnLys: 3.261 ± 0.513
2.888GlnLeu: 2.888 ± 0.535
0.559GlnMet: 0.559 ± 0.255
0.839GlnAsn: 0.839 ± 0.446
1.304GlnPro: 1.304 ± 0.298
1.304GlnGln: 1.304 ± 0.468
1.118GlnArg: 1.118 ± 0.293
1.491GlnSer: 1.491 ± 0.367
1.957GlnThr: 1.957 ± 0.488
1.77GlnVal: 1.77 ± 0.432
0.373GlnTrp: 0.373 ± 0.173
0.932GlnTyr: 0.932 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
2.236ArgAla: 2.236 ± 0.463
0.373ArgCys: 0.373 ± 0.174
2.888ArgAsp: 2.888 ± 0.554
2.422ArgGlu: 2.422 ± 0.587
1.957ArgPhe: 1.957 ± 0.427
2.329ArgGly: 2.329 ± 0.569
0.559ArgHis: 0.559 ± 0.219
3.354ArgIle: 3.354 ± 0.496
3.54ArgLys: 3.54 ± 0.679
3.261ArgLeu: 3.261 ± 0.564
0.466ArgMet: 0.466 ± 0.184
2.702ArgAsn: 2.702 ± 0.49
0.652ArgPro: 0.652 ± 0.22
1.584ArgGln: 1.584 ± 0.332
2.329ArgArg: 2.329 ± 0.492
2.422ArgSer: 2.422 ± 0.61
2.516ArgThr: 2.516 ± 0.509
3.168ArgVal: 3.168 ± 0.52
0.559ArgTrp: 0.559 ± 0.197
2.143ArgTyr: 2.143 ± 0.453
0.0ArgXaa: 0.0 ± 0.0
Ser
4.006SerAla: 4.006 ± 0.657
0.559SerCys: 0.559 ± 0.226
3.168SerAsp: 3.168 ± 0.623
5.311SerGlu: 5.311 ± 0.656
2.422SerPhe: 2.422 ± 0.439
5.311SerGly: 5.311 ± 0.658
0.745SerHis: 0.745 ± 0.304
5.124SerIle: 5.124 ± 0.764
6.988SerLys: 6.988 ± 0.879
5.404SerLeu: 5.404 ± 0.777
0.652SerMet: 0.652 ± 0.261
4.938SerAsn: 4.938 ± 0.876
2.329SerPro: 2.329 ± 0.564
1.584SerGln: 1.584 ± 0.503
2.329SerArg: 2.329 ± 0.502
3.261SerSer: 3.261 ± 0.644
2.795SerThr: 2.795 ± 0.431
4.752SerVal: 4.752 ± 0.693
0.839SerTrp: 0.839 ± 0.262
2.236SerTyr: 2.236 ± 0.614
0.0SerXaa: 0.0 ± 0.0
Thr
4.659ThrAla: 4.659 ± 0.835
0.466ThrCys: 0.466 ± 0.197
2.981ThrAsp: 2.981 ± 0.566
3.913ThrGlu: 3.913 ± 0.684
3.354ThrPhe: 3.354 ± 0.463
3.447ThrGly: 3.447 ± 0.619
1.025ThrHis: 1.025 ± 0.301
3.913ThrIle: 3.913 ± 0.672
4.193ThrLys: 4.193 ± 0.586
5.683ThrLeu: 5.683 ± 0.731
0.932ThrMet: 0.932 ± 0.283
2.981ThrAsn: 2.981 ± 0.744
3.261ThrPro: 3.261 ± 0.502
1.398ThrGln: 1.398 ± 0.3
2.236ThrArg: 2.236 ± 0.546
2.795ThrSer: 2.795 ± 0.565
3.913ThrThr: 3.913 ± 0.524
3.727ThrVal: 3.727 ± 0.465
0.559ThrTrp: 0.559 ± 0.23
1.584ThrTyr: 1.584 ± 0.315
0.0ThrXaa: 0.0 ± 0.0
Val
3.913ValAla: 3.913 ± 0.662
0.28ValCys: 0.28 ± 0.223
3.727ValAsp: 3.727 ± 0.614
4.659ValGlu: 4.659 ± 0.558
2.422ValPhe: 2.422 ± 0.429
3.447ValGly: 3.447 ± 0.72
0.839ValHis: 0.839 ± 0.214
3.075ValIle: 3.075 ± 0.506
6.149ValLys: 6.149 ± 0.743
4.472ValLeu: 4.472 ± 0.585
1.025ValMet: 1.025 ± 0.292
2.516ValAsn: 2.516 ± 0.436
1.957ValPro: 1.957 ± 0.28
1.677ValGln: 1.677 ± 0.3
2.609ValArg: 2.609 ± 0.464
4.472ValSer: 4.472 ± 0.719
3.168ValThr: 3.168 ± 0.598
3.447ValVal: 3.447 ± 0.743
0.839ValTrp: 0.839 ± 0.226
2.05ValTyr: 2.05 ± 0.599
0.0ValXaa: 0.0 ± 0.0
Trp
0.466TrpAla: 0.466 ± 0.203
0.373TrpCys: 0.373 ± 0.166
0.652TrpAsp: 0.652 ± 0.265
0.745TrpGlu: 0.745 ± 0.259
0.559TrpPhe: 0.559 ± 0.235
0.373TrpGly: 0.373 ± 0.15
0.093TrpHis: 0.093 ± 0.097
0.652TrpIle: 0.652 ± 0.26
0.839TrpLys: 0.839 ± 0.268
1.025TrpLeu: 1.025 ± 0.367
0.093TrpMet: 0.093 ± 0.087
0.466TrpAsn: 0.466 ± 0.187
0.0TrpPro: 0.0 ± 0.0
0.466TrpGln: 0.466 ± 0.2
0.559TrpArg: 0.559 ± 0.213
0.28TrpSer: 0.28 ± 0.16
0.28TrpThr: 0.28 ± 0.155
0.186TrpVal: 0.186 ± 0.139
0.093TrpTrp: 0.093 ± 0.097
0.559TrpTyr: 0.559 ± 0.231
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.236TyrAla: 2.236 ± 0.352
0.652TyrCys: 0.652 ± 0.241
2.05TyrAsp: 2.05 ± 0.489
1.584TyrGlu: 1.584 ± 0.345
2.422TyrPhe: 2.422 ± 0.562
2.422TyrGly: 2.422 ± 0.498
0.559TyrHis: 0.559 ± 0.219
2.795TyrIle: 2.795 ± 0.477
2.981TyrLys: 2.981 ± 0.566
4.379TyrLeu: 4.379 ± 0.718
0.466TyrMet: 0.466 ± 0.221
2.05TyrAsn: 2.05 ± 0.469
1.025TyrPro: 1.025 ± 0.335
0.839TyrGln: 0.839 ± 0.27
1.677TyrArg: 1.677 ± 0.443
3.075TyrSer: 3.075 ± 0.546
2.143TyrThr: 2.143 ± 0.49
2.05TyrVal: 2.05 ± 0.406
0.186TyrTrp: 0.186 ± 0.136
2.143TyrTyr: 2.143 ± 0.513
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (10734 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski