Amino acid dipepetide frequency for Clostridium phage phiCD506

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.164AlaAla: 2.164 ± 0.521
0.928AlaCys: 0.928 ± 0.339
2.473AlaAsp: 2.473 ± 0.45
4.019AlaGlu: 4.019 ± 0.745
1.443AlaPhe: 1.443 ± 0.322
3.195AlaGly: 3.195 ± 0.755
0.824AlaHis: 0.824 ± 0.252
5.978AlaIle: 5.978 ± 0.804
4.741AlaLys: 4.741 ± 0.739
4.329AlaLeu: 4.329 ± 0.655
1.34AlaMet: 1.34 ± 0.322
3.71AlaAsn: 3.71 ± 0.517
1.237AlaPro: 1.237 ± 0.381
1.237AlaGln: 1.237 ± 0.423
1.546AlaArg: 1.546 ± 0.407
3.813AlaSer: 3.813 ± 0.89
3.607AlaThr: 3.607 ± 0.666
2.989AlaVal: 2.989 ± 0.628
0.928AlaTrp: 0.928 ± 0.307
1.958AlaTyr: 1.958 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.618CysAla: 0.618 ± 0.205
0.0CysCys: 0.0 ± 0.0
0.515CysAsp: 0.515 ± 0.197
0.928CysGlu: 0.928 ± 0.304
0.515CysPhe: 0.515 ± 0.254
0.721CysGly: 0.721 ± 0.301
0.206CysHis: 0.206 ± 0.151
0.928CysIle: 0.928 ± 0.234
1.237CysLys: 1.237 ± 0.33
0.618CysLeu: 0.618 ± 0.206
0.309CysMet: 0.309 ± 0.179
0.928CysAsn: 0.928 ± 0.267
0.206CysPro: 0.206 ± 0.127
0.0CysGln: 0.0 ± 0.0
0.515CysArg: 0.515 ± 0.295
0.721CysSer: 0.721 ± 0.304
0.206CysThr: 0.206 ± 0.138
0.412CysVal: 0.412 ± 0.184
0.206CysTrp: 0.206 ± 0.149
0.618CysTyr: 0.618 ± 0.2
0.0CysXaa: 0.0 ± 0.0
Asp
2.68AspAla: 2.68 ± 0.502
1.134AspCys: 1.134 ± 0.327
3.298AspAsp: 3.298 ± 0.687
5.565AspGlu: 5.565 ± 0.896
2.989AspPhe: 2.989 ± 0.492
2.989AspGly: 2.989 ± 0.687
0.103AspHis: 0.103 ± 0.108
7.008AspIle: 7.008 ± 0.728
6.39AspLys: 6.39 ± 0.664
5.978AspLeu: 5.978 ± 0.766
2.37AspMet: 2.37 ± 0.381
4.122AspAsn: 4.122 ± 0.74
1.031AspPro: 1.031 ± 0.372
0.618AspGln: 0.618 ± 0.222
2.37AspArg: 2.37 ± 0.475
4.535AspSer: 4.535 ± 0.82
2.37AspThr: 2.37 ± 0.562
3.195AspVal: 3.195 ± 0.673
0.412AspTrp: 0.412 ± 0.198
2.577AspTyr: 2.577 ± 0.509
0.0AspXaa: 0.0 ± 0.0
Glu
4.741GluAla: 4.741 ± 0.9
0.928GluCys: 0.928 ± 0.315
4.741GluAsp: 4.741 ± 0.722
9.997GluGlu: 9.997 ± 1.626
4.122GluPhe: 4.122 ± 0.476
4.947GluGly: 4.947 ± 0.744
1.546GluHis: 1.546 ± 0.361
9.172GluIle: 9.172 ± 1.182
7.523GluLys: 7.523 ± 1.088
10.1GluLeu: 10.1 ± 1.334
2.164GluMet: 2.164 ± 0.523
7.214GluAsn: 7.214 ± 0.876
0.824GluPro: 0.824 ± 0.32
2.886GluGln: 2.886 ± 0.392
2.886GluArg: 2.886 ± 0.441
3.195GluSer: 3.195 ± 0.6
4.844GluThr: 4.844 ± 0.618
5.05GluVal: 5.05 ± 0.695
0.824GluTrp: 0.824 ± 0.3
3.504GluTyr: 3.504 ± 0.75
0.0GluXaa: 0.0 ± 0.0
Phe
1.031PheAla: 1.031 ± 0.367
0.412PheCys: 0.412 ± 0.196
2.577PheAsp: 2.577 ± 0.58
4.432PheGlu: 4.432 ± 0.672
1.134PhePhe: 1.134 ± 0.301
2.37PheGly: 2.37 ± 0.478
0.206PheHis: 0.206 ± 0.133
3.607PheIle: 3.607 ± 0.54
4.225PheLys: 4.225 ± 0.541
3.092PheLeu: 3.092 ± 0.639
1.752PheMet: 1.752 ± 0.387
3.813PheAsn: 3.813 ± 0.511
0.928PhePro: 0.928 ± 0.383
0.824PheGln: 0.824 ± 0.258
0.928PheArg: 0.928 ± 0.369
2.577PheSer: 2.577 ± 0.599
2.267PheThr: 2.267 ± 0.417
2.061PheVal: 2.061 ± 0.497
0.103PheTrp: 0.103 ± 0.11
1.855PheTyr: 1.855 ± 0.426
0.0PheXaa: 0.0 ± 0.0
Gly
2.989GlyAla: 2.989 ± 0.716
0.721GlyCys: 0.721 ± 0.236
3.092GlyAsp: 3.092 ± 0.751
4.329GlyGlu: 4.329 ± 0.772
2.886GlyPhe: 2.886 ± 0.554
3.195GlyGly: 3.195 ± 0.7
0.206GlyHis: 0.206 ± 0.138
4.947GlyIle: 4.947 ± 0.712
7.317GlyLys: 7.317 ± 0.981
4.225GlyLeu: 4.225 ± 0.731
1.546GlyMet: 1.546 ± 0.592
3.092GlyAsn: 3.092 ± 0.615
0.412GlyPro: 0.412 ± 0.224
0.824GlyGln: 0.824 ± 0.292
1.546GlyArg: 1.546 ± 0.444
1.752GlySer: 1.752 ± 0.497
3.71GlyThr: 3.71 ± 0.8
3.607GlyVal: 3.607 ± 0.608
0.618GlyTrp: 0.618 ± 0.262
2.37GlyTyr: 2.37 ± 0.526
0.0GlyXaa: 0.0 ± 0.0
His
0.721HisAla: 0.721 ± 0.243
0.0HisCys: 0.0 ± 0.0
0.618HisAsp: 0.618 ± 0.217
1.031HisGlu: 1.031 ± 0.396
0.515HisPhe: 0.515 ± 0.215
0.412HisGly: 0.412 ± 0.174
0.206HisHis: 0.206 ± 0.149
0.721HisIle: 0.721 ± 0.191
1.134HisLys: 1.134 ± 0.282
0.721HisLeu: 0.721 ± 0.301
0.309HisMet: 0.309 ± 0.27
0.928HisAsn: 0.928 ± 0.331
0.0HisPro: 0.0 ± 0.0
0.412HisGln: 0.412 ± 0.198
0.309HisArg: 0.309 ± 0.181
0.412HisSer: 0.412 ± 0.253
0.721HisThr: 0.721 ± 0.37
0.412HisVal: 0.412 ± 0.186
0.0HisTrp: 0.0 ± 0.0
0.618HisTyr: 0.618 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
5.978IleAla: 5.978 ± 0.805
1.134IleCys: 1.134 ± 0.331
6.081IleAsp: 6.081 ± 0.824
9.894IleGlu: 9.894 ± 1.027
3.195IlePhe: 3.195 ± 0.598
4.741IleGly: 4.741 ± 0.988
1.134IleHis: 1.134 ± 0.362
6.081IleIle: 6.081 ± 0.875
10.1IleLys: 10.1 ± 1.125
8.554IleLeu: 8.554 ± 0.871
2.37IleMet: 2.37 ± 0.49
5.462IleAsn: 5.462 ± 0.785
2.783IlePro: 2.783 ± 0.611
2.164IleGln: 2.164 ± 0.338
2.577IleArg: 2.577 ± 0.456
5.565IleSer: 5.565 ± 1.015
4.329IleThr: 4.329 ± 0.689
5.359IleVal: 5.359 ± 0.795
0.412IleTrp: 0.412 ± 0.276
5.359IleTyr: 5.359 ± 1.131
0.0IleXaa: 0.0 ± 0.0
Lys
6.493LysAla: 6.493 ± 0.712
0.824LysCys: 0.824 ± 0.302
7.111LysAsp: 7.111 ± 0.667
12.78LysGlu: 12.78 ± 1.075
3.401LysPhe: 3.401 ± 0.631
4.535LysGly: 4.535 ± 0.742
1.752LysHis: 1.752 ± 0.372
10.924LysIle: 10.924 ± 0.92
10.306LysLys: 10.306 ± 1.233
9.688LysLeu: 9.688 ± 1.015
2.783LysMet: 2.783 ± 0.661
5.05LysAsn: 5.05 ± 0.899
1.958LysPro: 1.958 ± 0.404
3.195LysGln: 3.195 ± 0.679
3.298LysArg: 3.298 ± 0.575
5.565LysSer: 5.565 ± 1.108
5.978LysThr: 5.978 ± 0.877
6.596LysVal: 6.596 ± 0.823
0.928LysTrp: 0.928 ± 0.286
3.71LysTyr: 3.71 ± 0.466
0.0LysXaa: 0.0 ± 0.0
Leu
5.462LeuAla: 5.462 ± 0.89
0.721LeuCys: 0.721 ± 0.344
6.493LeuAsp: 6.493 ± 0.802
7.523LeuGlu: 7.523 ± 1.269
3.71LeuPhe: 3.71 ± 0.53
3.607LeuGly: 3.607 ± 0.673
0.412LeuHis: 0.412 ± 0.197
6.184LeuIle: 6.184 ± 0.878
10.615LeuLys: 10.615 ± 1.192
5.978LeuLeu: 5.978 ± 0.865
1.855LeuMet: 1.855 ± 0.468
5.874LeuAsn: 5.874 ± 0.844
1.34LeuPro: 1.34 ± 0.396
2.886LeuGln: 2.886 ± 0.515
4.122LeuArg: 4.122 ± 0.699
6.802LeuSer: 6.802 ± 1.034
3.813LeuThr: 3.813 ± 0.721
4.019LeuVal: 4.019 ± 0.502
0.824LeuTrp: 0.824 ± 0.35
2.989LeuTyr: 2.989 ± 0.594
0.0LeuXaa: 0.0 ± 0.0
Met
1.855MetAla: 1.855 ± 0.731
0.103MetCys: 0.103 ± 0.1
1.34MetAsp: 1.34 ± 0.33
1.958MetGlu: 1.958 ± 0.505
1.546MetPhe: 1.546 ± 0.443
1.855MetGly: 1.855 ± 0.559
0.206MetHis: 0.206 ± 0.151
2.37MetIle: 2.37 ± 0.507
2.267MetLys: 2.267 ± 0.55
2.783MetLeu: 2.783 ± 0.549
0.206MetMet: 0.206 ± 0.133
2.061MetAsn: 2.061 ± 0.503
0.515MetPro: 0.515 ± 0.226
0.824MetGln: 0.824 ± 0.262
0.824MetArg: 0.824 ± 0.437
1.958MetSer: 1.958 ± 0.425
1.031MetThr: 1.031 ± 0.295
1.134MetVal: 1.134 ± 0.391
0.103MetTrp: 0.103 ± 0.104
0.824MetTyr: 0.824 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
3.401AsnAla: 3.401 ± 0.652
0.206AsnCys: 0.206 ± 0.135
3.298AsnAsp: 3.298 ± 0.756
5.256AsnGlu: 5.256 ± 0.811
2.473AsnPhe: 2.473 ± 0.524
3.813AsnGly: 3.813 ± 0.852
0.309AsnHis: 0.309 ± 0.172
8.348AsnIle: 8.348 ± 1.041
8.554AsnLys: 8.554 ± 0.843
5.462AsnLeu: 5.462 ± 0.761
1.649AsnMet: 1.649 ± 0.363
4.741AsnAsn: 4.741 ± 0.583
1.752AsnPro: 1.752 ± 0.502
0.824AsnGln: 0.824 ± 0.276
2.164AsnArg: 2.164 ± 0.448
4.019AsnSer: 4.019 ± 0.591
2.989AsnThr: 2.989 ± 0.627
4.638AsnVal: 4.638 ± 0.621
0.928AsnTrp: 0.928 ± 0.294
3.195AsnTyr: 3.195 ± 0.571
0.0AsnXaa: 0.0 ± 0.0
Pro
1.237ProAla: 1.237 ± 0.354
0.206ProCys: 0.206 ± 0.135
0.824ProAsp: 0.824 ± 0.257
1.031ProGlu: 1.031 ± 0.352
0.515ProPhe: 0.515 ± 0.25
1.237ProGly: 1.237 ± 0.398
0.515ProHis: 0.515 ± 0.23
2.164ProIle: 2.164 ± 0.494
1.855ProLys: 1.855 ± 0.438
1.34ProLeu: 1.34 ± 0.379
0.309ProMet: 0.309 ± 0.164
1.649ProAsn: 1.649 ± 0.439
0.103ProPro: 0.103 ± 0.107
0.515ProGln: 0.515 ± 0.203
0.928ProArg: 0.928 ± 0.291
2.267ProSer: 2.267 ± 0.52
1.34ProThr: 1.34 ± 0.353
1.134ProVal: 1.134 ± 0.539
0.309ProTrp: 0.309 ± 0.179
0.824ProTyr: 0.824 ± 0.342
0.0ProXaa: 0.0 ± 0.0
Gln
1.855GlnAla: 1.855 ± 0.542
0.206GlnCys: 0.206 ± 0.146
1.752GlnAsp: 1.752 ± 0.379
2.37GlnGlu: 2.37 ± 0.485
0.515GlnPhe: 0.515 ± 0.228
1.443GlnGly: 1.443 ± 0.342
0.206GlnHis: 0.206 ± 0.142
2.68GlnIle: 2.68 ± 0.47
2.577GlnLys: 2.577 ± 0.483
2.061GlnLeu: 2.061 ± 0.584
0.618GlnMet: 0.618 ± 0.342
1.134GlnAsn: 1.134 ± 0.329
0.515GlnPro: 0.515 ± 0.252
1.134GlnGln: 1.134 ± 0.42
0.824GlnArg: 0.824 ± 0.3
1.031GlnSer: 1.031 ± 0.308
1.443GlnThr: 1.443 ± 0.327
0.928GlnVal: 0.928 ± 0.236
0.103GlnTrp: 0.103 ± 0.108
1.443GlnTyr: 1.443 ± 0.319
0.0GlnXaa: 0.0 ± 0.0
Arg
1.34ArgAla: 1.34 ± 0.563
0.309ArgCys: 0.309 ± 0.142
1.546ArgAsp: 1.546 ± 0.349
3.607ArgGlu: 3.607 ± 0.497
1.34ArgPhe: 1.34 ± 0.295
2.267ArgGly: 2.267 ± 0.38
0.721ArgHis: 0.721 ± 0.242
3.195ArgIle: 3.195 ± 0.596
3.401ArgLys: 3.401 ± 0.668
1.855ArgLeu: 1.855 ± 0.487
1.031ArgMet: 1.031 ± 0.304
2.37ArgAsn: 2.37 ± 0.498
0.515ArgPro: 0.515 ± 0.235
1.134ArgGln: 1.134 ± 0.26
2.267ArgArg: 2.267 ± 0.555
1.752ArgSer: 1.752 ± 0.477
2.164ArgThr: 2.164 ± 0.509
1.237ArgVal: 1.237 ± 0.316
0.309ArgTrp: 0.309 ± 0.175
1.34ArgTyr: 1.34 ± 0.331
0.0ArgXaa: 0.0 ± 0.0
Ser
2.68SerAla: 2.68 ± 0.669
0.721SerCys: 0.721 ± 0.278
3.607SerAsp: 3.607 ± 0.613
3.916SerGlu: 3.916 ± 0.565
4.122SerPhe: 4.122 ± 0.716
2.989SerGly: 2.989 ± 0.788
0.412SerHis: 0.412 ± 0.198
4.741SerIle: 4.741 ± 0.926
6.39SerLys: 6.39 ± 0.844
4.844SerLeu: 4.844 ± 0.618
1.546SerMet: 1.546 ± 0.385
5.462SerAsn: 5.462 ± 0.8
1.546SerPro: 1.546 ± 0.411
1.134SerGln: 1.134 ± 0.386
1.958SerArg: 1.958 ± 0.401
3.71SerSer: 3.71 ± 0.936
3.504SerThr: 3.504 ± 0.814
2.68SerVal: 2.68 ± 0.61
0.824SerTrp: 0.824 ± 0.253
2.783SerTyr: 2.783 ± 0.478
0.0SerXaa: 0.0 ± 0.0
Thr
2.577ThrAla: 2.577 ± 0.541
0.206ThrCys: 0.206 ± 0.216
3.916ThrAsp: 3.916 ± 0.553
5.668ThrGlu: 5.668 ± 0.81
1.34ThrPhe: 1.34 ± 0.348
3.298ThrGly: 3.298 ± 0.886
0.412ThrHis: 0.412 ± 0.201
5.462ThrIle: 5.462 ± 0.733
5.978ThrLys: 5.978 ± 0.795
5.359ThrLeu: 5.359 ± 0.735
0.928ThrMet: 0.928 ± 0.328
3.092ThrAsn: 3.092 ± 0.467
1.443ThrPro: 1.443 ± 0.467
1.443ThrGln: 1.443 ± 0.458
1.443ThrArg: 1.443 ± 0.322
3.092ThrSer: 3.092 ± 0.611
3.916ThrThr: 3.916 ± 0.952
2.886ThrVal: 2.886 ± 0.513
0.618ThrTrp: 0.618 ± 0.232
1.855ThrTyr: 1.855 ± 0.358
0.0ThrXaa: 0.0 ± 0.0
Val
2.783ValAla: 2.783 ± 0.729
0.721ValCys: 0.721 ± 0.257
3.813ValAsp: 3.813 ± 0.658
3.195ValGlu: 3.195 ± 0.682
2.164ValPhe: 2.164 ± 0.42
3.71ValGly: 3.71 ± 0.569
0.206ValHis: 0.206 ± 0.157
3.71ValIle: 3.71 ± 0.744
6.39ValLys: 6.39 ± 0.764
5.05ValLeu: 5.05 ± 0.934
1.34ValMet: 1.34 ± 0.337
3.607ValAsn: 3.607 ± 0.715
1.855ValPro: 1.855 ± 0.331
1.958ValGln: 1.958 ± 0.37
1.546ValArg: 1.546 ± 0.398
3.813ValSer: 3.813 ± 0.828
3.195ValThr: 3.195 ± 0.648
4.019ValVal: 4.019 ± 0.661
0.309ValTrp: 0.309 ± 0.196
2.886ValTyr: 2.886 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
0.515TrpAla: 0.515 ± 0.21
0.412TrpCys: 0.412 ± 0.19
1.237TrpAsp: 1.237 ± 0.362
0.412TrpGlu: 0.412 ± 0.183
0.309TrpPhe: 0.309 ± 0.167
0.618TrpGly: 0.618 ± 0.261
0.206TrpHis: 0.206 ± 0.149
0.721TrpIle: 0.721 ± 0.277
0.928TrpLys: 0.928 ± 0.278
0.824TrpLeu: 0.824 ± 0.286
0.309TrpMet: 0.309 ± 0.184
0.309TrpAsn: 0.309 ± 0.178
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.103TrpArg: 0.103 ± 0.107
0.618TrpSer: 0.618 ± 0.233
0.412TrpThr: 0.412 ± 0.188
0.824TrpVal: 0.824 ± 0.278
0.206TrpTrp: 0.206 ± 0.144
0.412TrpTyr: 0.412 ± 0.209
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.34TyrAla: 1.34 ± 0.348
0.515TyrCys: 0.515 ± 0.234
3.401TyrAsp: 3.401 ± 0.593
3.401TyrGlu: 3.401 ± 0.69
1.958TyrPhe: 1.958 ± 0.481
1.443TyrGly: 1.443 ± 0.447
0.412TyrHis: 0.412 ± 0.176
4.329TyrIle: 4.329 ± 0.642
4.947TyrLys: 4.947 ± 0.732
2.37TyrLeu: 2.37 ± 0.51
0.928TyrMet: 0.928 ± 0.258
3.401TyrAsn: 3.401 ± 0.571
1.34TyrPro: 1.34 ± 0.33
0.824TyrGln: 0.824 ± 0.317
1.546TyrArg: 1.546 ± 0.49
2.37TyrSer: 2.37 ± 0.679
3.092TyrThr: 3.092 ± 0.632
3.092TyrVal: 3.092 ± 0.55
0.412TyrTrp: 0.412 ± 0.185
1.134TyrTyr: 1.134 ± 0.446
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (9704 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski