Amino acid dipepetide frequency for Clostridium phage phiCT453B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.625AlaAla: 1.625 ± 0.603
0.632AlaCys: 0.632 ± 0.229
2.708AlaAsp: 2.708 ± 0.459
4.875AlaGlu: 4.875 ± 0.694
1.535AlaPhe: 1.535 ± 0.402
2.889AlaGly: 2.889 ± 0.815
0.812AlaHis: 0.812 ± 0.374
4.514AlaIle: 4.514 ± 0.704
6.139AlaLys: 6.139 ± 0.716
4.694AlaLeu: 4.694 ± 0.707
0.812AlaMet: 0.812 ± 0.219
3.069AlaAsn: 3.069 ± 0.496
1.625AlaPro: 1.625 ± 0.402
2.257AlaGln: 2.257 ± 0.512
2.257AlaArg: 2.257 ± 0.451
1.896AlaSer: 1.896 ± 0.33
2.799AlaThr: 2.799 ± 0.482
3.25AlaVal: 3.25 ± 0.733
0.903AlaTrp: 0.903 ± 0.3
0.903AlaTyr: 0.903 ± 0.242
0.0AlaXaa: 0.0 ± 0.0
Cys
0.361CysAla: 0.361 ± 0.162
0.09CysCys: 0.09 ± 0.096
0.542CysAsp: 0.542 ± 0.265
0.993CysGlu: 0.993 ± 0.333
0.542CysPhe: 0.542 ± 0.21
0.903CysGly: 0.903 ± 0.326
0.271CysHis: 0.271 ± 0.18
1.896CysIle: 1.896 ± 0.407
1.625CysLys: 1.625 ± 0.332
1.083CysLeu: 1.083 ± 0.385
0.542CysMet: 0.542 ± 0.216
0.361CysAsn: 0.361 ± 0.172
0.632CysPro: 0.632 ± 0.229
0.632CysGln: 0.632 ± 0.201
0.542CysArg: 0.542 ± 0.228
0.722CysSer: 0.722 ± 0.265
0.722CysThr: 0.722 ± 0.267
0.271CysVal: 0.271 ± 0.224
0.09CysTrp: 0.09 ± 0.09
0.451CysTyr: 0.451 ± 0.19
0.0CysXaa: 0.0 ± 0.0
Asp
3.16AspAla: 3.16 ± 0.489
0.542AspCys: 0.542 ± 0.277
2.257AspAsp: 2.257 ± 0.529
2.979AspGlu: 2.979 ± 0.488
2.618AspPhe: 2.618 ± 0.538
3.431AspGly: 3.431 ± 0.485
0.09AspHis: 0.09 ± 0.088
7.493AspIle: 7.493 ± 0.843
5.597AspLys: 5.597 ± 0.677
5.778AspLeu: 5.778 ± 0.866
0.993AspMet: 0.993 ± 0.344
5.597AspAsn: 5.597 ± 0.583
1.354AspPro: 1.354 ± 0.322
0.451AspGln: 0.451 ± 0.177
2.618AspArg: 2.618 ± 0.584
3.701AspSer: 3.701 ± 0.641
3.25AspThr: 3.25 ± 0.588
2.437AspVal: 2.437 ± 0.39
0.632AspTrp: 0.632 ± 0.268
3.34AspTyr: 3.34 ± 0.592
0.0AspXaa: 0.0 ± 0.0
Glu
3.069GluAla: 3.069 ± 0.555
0.812GluCys: 0.812 ± 0.248
5.958GluAsp: 5.958 ± 0.827
8.847GluGlu: 8.847 ± 1.505
3.34GluPhe: 3.34 ± 0.627
5.778GluGly: 5.778 ± 0.77
1.535GluHis: 1.535 ± 0.465
8.305GluIle: 8.305 ± 0.922
7.493GluLys: 7.493 ± 0.982
8.667GluLeu: 8.667 ± 0.796
3.16GluMet: 3.16 ± 0.582
5.687GluAsn: 5.687 ± 0.962
1.083GluPro: 1.083 ± 0.297
2.618GluGln: 2.618 ± 0.458
2.889GluArg: 2.889 ± 0.596
3.25GluSer: 3.25 ± 0.534
3.431GluThr: 3.431 ± 0.459
4.243GluVal: 4.243 ± 0.581
1.174GluTrp: 1.174 ± 0.374
3.611GluTyr: 3.611 ± 0.585
0.0GluXaa: 0.0 ± 0.0
Phe
2.076PheAla: 2.076 ± 0.404
0.451PheCys: 0.451 ± 0.201
1.896PheAsp: 1.896 ± 0.445
2.618PheGlu: 2.618 ± 0.433
1.083PhePhe: 1.083 ± 0.329
2.347PheGly: 2.347 ± 0.481
0.361PheHis: 0.361 ± 0.181
3.25PheIle: 3.25 ± 0.664
4.333PheLys: 4.333 ± 0.595
3.25PheLeu: 3.25 ± 0.574
0.542PheMet: 0.542 ± 0.192
3.972PheAsn: 3.972 ± 0.589
0.993PhePro: 0.993 ± 0.311
0.632PheGln: 0.632 ± 0.237
2.076PheArg: 2.076 ± 0.523
1.806PheSer: 1.806 ± 0.433
1.806PheThr: 1.806 ± 0.484
1.264PheVal: 1.264 ± 0.326
0.271PheTrp: 0.271 ± 0.161
1.444PheTyr: 1.444 ± 0.299
0.0PheXaa: 0.0 ± 0.0
Gly
2.347GlyAla: 2.347 ± 0.583
0.903GlyCys: 0.903 ± 0.26
3.611GlyAsp: 3.611 ± 0.65
3.792GlyGlu: 3.792 ± 0.715
1.444GlyPhe: 1.444 ± 0.42
3.882GlyGly: 3.882 ± 0.534
0.542GlyHis: 0.542 ± 0.208
5.868GlyIle: 5.868 ± 0.815
5.778GlyLys: 5.778 ± 0.886
3.882GlyLeu: 3.882 ± 0.615
0.993GlyMet: 0.993 ± 0.419
3.34GlyAsn: 3.34 ± 0.497
0.451GlyPro: 0.451 ± 0.18
2.076GlyGln: 2.076 ± 0.471
1.806GlyArg: 1.806 ± 0.394
3.431GlySer: 3.431 ± 0.757
3.521GlyThr: 3.521 ± 0.519
3.34GlyVal: 3.34 ± 0.59
0.993GlyTrp: 0.993 ± 0.317
3.792GlyTyr: 3.792 ± 0.615
0.0GlyXaa: 0.0 ± 0.0
His
0.542HisAla: 0.542 ± 0.269
0.181HisCys: 0.181 ± 0.146
0.181HisAsp: 0.181 ± 0.131
0.542HisGlu: 0.542 ± 0.175
0.632HisPhe: 0.632 ± 0.281
0.542HisGly: 0.542 ± 0.231
0.361HisHis: 0.361 ± 0.245
1.174HisIle: 1.174 ± 0.32
1.715HisLys: 1.715 ± 0.36
0.903HisLeu: 0.903 ± 0.257
0.09HisMet: 0.09 ± 0.101
0.812HisAsn: 0.812 ± 0.279
0.271HisPro: 0.271 ± 0.147
0.361HisGln: 0.361 ± 0.179
0.542HisArg: 0.542 ± 0.217
0.271HisSer: 0.271 ± 0.137
0.722HisThr: 0.722 ± 0.223
0.451HisVal: 0.451 ± 0.261
0.0HisTrp: 0.0 ± 0.0
0.542HisTyr: 0.542 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
5.687IleAla: 5.687 ± 0.906
1.354IleCys: 1.354 ± 0.366
6.319IleAsp: 6.319 ± 0.748
10.021IleGlu: 10.021 ± 0.799
2.889IlePhe: 2.889 ± 0.478
3.611IleGly: 3.611 ± 0.688
0.903IleHis: 0.903 ± 0.227
7.042IleIle: 7.042 ± 0.598
13.18IleLys: 13.18 ± 0.909
6.951IleLeu: 6.951 ± 0.66
2.076IleMet: 2.076 ± 0.374
6.771IleAsn: 6.771 ± 0.731
2.257IlePro: 2.257 ± 0.385
1.986IleGln: 1.986 ± 0.506
3.792IleArg: 3.792 ± 0.694
4.694IleSer: 4.694 ± 0.514
5.507IleThr: 5.507 ± 0.866
4.514IleVal: 4.514 ± 0.614
0.903IleTrp: 0.903 ± 0.284
2.618IleTyr: 2.618 ± 0.478
0.0IleXaa: 0.0 ± 0.0
Lys
5.597LysAla: 5.597 ± 0.684
1.896LysCys: 1.896 ± 0.41
7.674LysAsp: 7.674 ± 0.751
11.555LysGlu: 11.555 ± 1.067
3.34LysPhe: 3.34 ± 0.481
5.236LysGly: 5.236 ± 0.839
1.444LysHis: 1.444 ± 0.372
10.292LysIle: 10.292 ± 0.793
9.84LysLys: 9.84 ± 1.11
10.833LysLeu: 10.833 ± 0.918
3.701LysMet: 3.701 ± 0.553
8.215LysAsn: 8.215 ± 1.125
1.444LysPro: 1.444 ± 0.353
3.792LysGln: 3.792 ± 0.751
3.16LysArg: 3.16 ± 0.683
4.875LysSer: 4.875 ± 0.571
5.417LysThr: 5.417 ± 0.685
8.576LysVal: 8.576 ± 0.987
0.632LysTrp: 0.632 ± 0.226
5.236LysTyr: 5.236 ± 0.958
0.0LysXaa: 0.0 ± 0.0
Leu
5.146LeuAla: 5.146 ± 0.714
1.444LeuCys: 1.444 ± 0.453
5.326LeuAsp: 5.326 ± 0.612
8.215LeuGlu: 8.215 ± 0.951
2.889LeuPhe: 2.889 ± 0.562
4.694LeuGly: 4.694 ± 0.725
0.722LeuHis: 0.722 ± 0.271
5.868LeuIle: 5.868 ± 0.873
10.021LeuLys: 10.021 ± 1.011
5.958LeuLeu: 5.958 ± 1.002
1.444LeuMet: 1.444 ± 0.398
7.312LeuAsn: 7.312 ± 0.706
1.264LeuPro: 1.264 ± 0.325
2.618LeuGln: 2.618 ± 0.537
4.153LeuArg: 4.153 ± 0.716
4.424LeuSer: 4.424 ± 0.608
6.319LeuThr: 6.319 ± 0.86
3.701LeuVal: 3.701 ± 0.518
0.361LeuTrp: 0.361 ± 0.154
2.437LeuTyr: 2.437 ± 0.493
0.0LeuXaa: 0.0 ± 0.0
Met
1.806MetAla: 1.806 ± 0.436
0.181MetCys: 0.181 ± 0.121
1.625MetAsp: 1.625 ± 0.384
2.979MetGlu: 2.979 ± 0.628
0.542MetPhe: 0.542 ± 0.187
1.083MetGly: 1.083 ± 0.328
0.271MetHis: 0.271 ± 0.15
2.437MetIle: 2.437 ± 0.534
2.979MetLys: 2.979 ± 0.415
1.715MetLeu: 1.715 ± 0.357
0.542MetMet: 0.542 ± 0.228
1.806MetAsn: 1.806 ± 0.454
1.083MetPro: 1.083 ± 0.264
0.722MetGln: 0.722 ± 0.251
0.722MetArg: 0.722 ± 0.256
1.806MetSer: 1.806 ± 0.338
0.993MetThr: 0.993 ± 0.292
1.174MetVal: 1.174 ± 0.37
0.09MetTrp: 0.09 ± 0.088
1.264MetTyr: 1.264 ± 0.369
0.0MetXaa: 0.0 ± 0.0
Asn
3.972AsnAla: 3.972 ± 0.63
0.632AsnCys: 0.632 ± 0.26
3.069AsnAsp: 3.069 ± 0.487
5.236AsnGlu: 5.236 ± 0.588
2.979AsnPhe: 2.979 ± 0.53
5.326AsnGly: 5.326 ± 0.842
0.361AsnHis: 0.361 ± 0.179
7.132AsnIle: 7.132 ± 0.799
9.479AsnLys: 9.479 ± 1.051
5.597AsnLeu: 5.597 ± 0.685
2.167AsnMet: 2.167 ± 0.523
4.875AsnAsn: 4.875 ± 0.81
2.076AsnPro: 2.076 ± 0.506
2.076AsnGln: 2.076 ± 0.364
2.076AsnArg: 2.076 ± 0.448
4.514AsnSer: 4.514 ± 0.777
3.792AsnThr: 3.792 ± 0.638
3.792AsnVal: 3.792 ± 0.786
1.083AsnTrp: 1.083 ± 0.315
2.618AsnTyr: 2.618 ± 0.62
0.0AsnXaa: 0.0 ± 0.0
Pro
0.903ProAla: 0.903 ± 0.259
0.271ProCys: 0.271 ± 0.147
1.174ProAsp: 1.174 ± 0.332
1.444ProGlu: 1.444 ± 0.36
0.632ProPhe: 0.632 ± 0.273
0.722ProGly: 0.722 ± 0.241
0.632ProHis: 0.632 ± 0.219
2.076ProIle: 2.076 ± 0.367
2.257ProLys: 2.257 ± 0.493
1.715ProLeu: 1.715 ± 0.333
0.09ProMet: 0.09 ± 0.088
1.806ProAsn: 1.806 ± 0.532
0.722ProPro: 0.722 ± 0.315
0.632ProGln: 0.632 ± 0.192
1.354ProArg: 1.354 ± 0.471
1.535ProSer: 1.535 ± 0.37
1.354ProThr: 1.354 ± 0.386
1.264ProVal: 1.264 ± 0.347
0.271ProTrp: 0.271 ± 0.145
1.444ProTyr: 1.444 ± 0.283
0.0ProXaa: 0.0 ± 0.0
Gln
1.806GlnAla: 1.806 ± 0.393
0.632GlnCys: 0.632 ± 0.27
1.264GlnAsp: 1.264 ± 0.315
2.257GlnGlu: 2.257 ± 0.406
1.444GlnPhe: 1.444 ± 0.288
1.264GlnGly: 1.264 ± 0.349
0.271GlnHis: 0.271 ± 0.155
2.708GlnIle: 2.708 ± 0.378
2.799GlnLys: 2.799 ± 0.474
2.799GlnLeu: 2.799 ± 0.53
0.993GlnMet: 0.993 ± 0.328
1.535GlnAsn: 1.535 ± 0.413
0.632GlnPro: 0.632 ± 0.209
1.264GlnGln: 1.264 ± 0.33
1.535GlnArg: 1.535 ± 0.335
0.993GlnSer: 0.993 ± 0.249
1.535GlnThr: 1.535 ± 0.44
1.174GlnVal: 1.174 ± 0.421
0.451GlnTrp: 0.451 ± 0.173
1.354GlnTyr: 1.354 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
0.903ArgAla: 0.903 ± 0.319
0.722ArgCys: 0.722 ± 0.235
1.986ArgAsp: 1.986 ± 0.39
3.701ArgGlu: 3.701 ± 0.693
1.715ArgPhe: 1.715 ± 0.415
1.896ArgGly: 1.896 ± 0.354
0.361ArgHis: 0.361 ± 0.149
2.889ArgIle: 2.889 ± 0.484
5.146ArgLys: 5.146 ± 0.681
2.799ArgLeu: 2.799 ± 0.55
1.174ArgMet: 1.174 ± 0.331
2.347ArgAsn: 2.347 ± 0.468
1.083ArgPro: 1.083 ± 0.286
1.806ArgGln: 1.806 ± 0.525
1.625ArgArg: 1.625 ± 0.36
2.167ArgSer: 2.167 ± 0.482
2.167ArgThr: 2.167 ± 0.473
1.444ArgVal: 1.444 ± 0.353
0.271ArgTrp: 0.271 ± 0.146
1.625ArgTyr: 1.625 ± 0.474
0.0ArgXaa: 0.0 ± 0.0
Ser
2.347SerAla: 2.347 ± 0.459
0.542SerCys: 0.542 ± 0.222
2.257SerAsp: 2.257 ± 0.433
3.069SerGlu: 3.069 ± 0.512
2.618SerPhe: 2.618 ± 0.582
2.437SerGly: 2.437 ± 0.534
0.271SerHis: 0.271 ± 0.146
4.875SerIle: 4.875 ± 0.558
6.319SerLys: 6.319 ± 0.695
3.701SerLeu: 3.701 ± 0.644
2.167SerMet: 2.167 ± 0.481
3.972SerAsn: 3.972 ± 0.644
0.903SerPro: 0.903 ± 0.303
0.993SerGln: 0.993 ± 0.289
1.444SerArg: 1.444 ± 0.295
2.528SerSer: 2.528 ± 0.526
2.618SerThr: 2.618 ± 0.431
3.972SerVal: 3.972 ± 0.571
0.903SerTrp: 0.903 ± 0.252
3.16SerTyr: 3.16 ± 0.62
0.0SerXaa: 0.0 ± 0.0
Thr
2.979ThrAla: 2.979 ± 0.597
0.361ThrCys: 0.361 ± 0.204
3.069ThrAsp: 3.069 ± 0.586
4.694ThrGlu: 4.694 ± 0.665
1.715ThrPhe: 1.715 ± 0.331
4.333ThrGly: 4.333 ± 0.725
0.181ThrHis: 0.181 ± 0.117
5.417ThrIle: 5.417 ± 0.804
6.139ThrLys: 6.139 ± 0.772
5.146ThrLeu: 5.146 ± 0.774
0.812ThrMet: 0.812 ± 0.23
3.069ThrAsn: 3.069 ± 0.635
1.896ThrPro: 1.896 ± 0.549
1.535ThrGln: 1.535 ± 0.356
1.535ThrArg: 1.535 ± 0.405
2.799ThrSer: 2.799 ± 0.421
3.34ThrThr: 3.34 ± 0.616
3.16ThrVal: 3.16 ± 0.674
1.174ThrTrp: 1.174 ± 0.293
1.625ThrTyr: 1.625 ± 0.412
0.0ThrXaa: 0.0 ± 0.0
Val
3.069ValAla: 3.069 ± 0.667
0.271ValCys: 0.271 ± 0.155
3.882ValAsp: 3.882 ± 0.496
3.25ValGlu: 3.25 ± 0.456
2.347ValPhe: 2.347 ± 0.521
2.708ValGly: 2.708 ± 0.619
0.632ValHis: 0.632 ± 0.249
5.056ValIle: 5.056 ± 0.699
5.326ValLys: 5.326 ± 0.669
4.604ValLeu: 4.604 ± 0.739
1.535ValMet: 1.535 ± 0.287
3.792ValAsn: 3.792 ± 0.565
1.535ValPro: 1.535 ± 0.353
1.444ValGln: 1.444 ± 0.461
1.896ValArg: 1.896 ± 0.427
3.34ValSer: 3.34 ± 0.773
3.34ValThr: 3.34 ± 0.661
2.528ValVal: 2.528 ± 0.573
0.451ValTrp: 0.451 ± 0.172
2.257ValTyr: 2.257 ± 0.525
0.0ValXaa: 0.0 ± 0.0
Trp
0.451TrpAla: 0.451 ± 0.183
0.0TrpCys: 0.0 ± 0.0
0.812TrpAsp: 0.812 ± 0.238
0.722TrpGlu: 0.722 ± 0.212
0.271TrpPhe: 0.271 ± 0.138
0.903TrpGly: 0.903 ± 0.308
0.361TrpHis: 0.361 ± 0.184
1.354TrpIle: 1.354 ± 0.299
0.903TrpLys: 0.903 ± 0.29
1.264TrpLeu: 1.264 ± 0.284
0.451TrpMet: 0.451 ± 0.171
0.903TrpAsn: 0.903 ± 0.298
0.0TrpPro: 0.0 ± 0.0
0.181TrpGln: 0.181 ± 0.135
0.09TrpArg: 0.09 ± 0.084
0.722TrpSer: 0.722 ± 0.226
0.361TrpThr: 0.361 ± 0.244
0.632TrpVal: 0.632 ± 0.264
0.0TrpTrp: 0.0 ± 0.0
0.632TrpTyr: 0.632 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.347TyrAla: 2.347 ± 0.56
1.354TyrCys: 1.354 ± 0.318
2.708TyrAsp: 2.708 ± 0.461
3.16TyrGlu: 3.16 ± 0.665
1.896TyrPhe: 1.896 ± 0.422
1.806TyrGly: 1.806 ± 0.431
0.451TyrHis: 0.451 ± 0.172
3.431TyrIle: 3.431 ± 0.631
5.778TyrLys: 5.778 ± 0.733
2.979TyrLeu: 2.979 ± 0.468
1.444TyrMet: 1.444 ± 0.391
3.611TyrAsn: 3.611 ± 0.76
0.993TyrPro: 0.993 ± 0.341
0.632TyrGln: 0.632 ± 0.244
1.715TyrArg: 1.715 ± 0.375
1.625TyrSer: 1.625 ± 0.435
1.986TyrThr: 1.986 ± 0.452
2.076TyrVal: 2.076 ± 0.386
0.361TyrTrp: 0.361 ± 0.162
1.806TyrTyr: 1.806 ± 0.457
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (11078 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski