Amino acid dipepetide frequency for Clostridium phage phiCTC2B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.978AlaAla: 1.978 ± 0.514
0.258AlaCys: 0.258 ± 0.156
2.408AlaAsp: 2.408 ± 0.408
4.128AlaGlu: 4.128 ± 0.511
1.978AlaPhe: 1.978 ± 0.358
2.666AlaGly: 2.666 ± 0.504
0.344AlaHis: 0.344 ± 0.224
5.16AlaIle: 5.16 ± 1.342
6.277AlaLys: 6.277 ± 0.841
5.331AlaLeu: 5.331 ± 0.738
1.376AlaMet: 1.376 ± 0.383
2.752AlaAsn: 2.752 ± 0.48
1.634AlaPro: 1.634 ± 0.353
1.892AlaGln: 1.892 ± 0.479
1.29AlaArg: 1.29 ± 0.356
2.064AlaSer: 2.064 ± 0.468
3.612AlaThr: 3.612 ± 0.531
3.01AlaVal: 3.01 ± 0.645
0.688AlaTrp: 0.688 ± 0.233
1.978AlaTyr: 1.978 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.258CysAla: 0.258 ± 0.187
0.43CysCys: 0.43 ± 0.197
0.516CysAsp: 0.516 ± 0.217
1.376CysGlu: 1.376 ± 0.391
0.516CysPhe: 0.516 ± 0.187
0.602CysGly: 0.602 ± 0.25
0.086CysHis: 0.086 ± 0.101
1.204CysIle: 1.204 ± 0.294
1.892CysLys: 1.892 ± 0.469
0.688CysLeu: 0.688 ± 0.248
0.43CysMet: 0.43 ± 0.186
1.29CysAsn: 1.29 ± 0.361
0.172CysPro: 0.172 ± 0.124
0.0CysGln: 0.0 ± 0.0
0.43CysArg: 0.43 ± 0.208
0.774CysSer: 0.774 ± 0.2
0.774CysThr: 0.774 ± 0.233
0.774CysVal: 0.774 ± 0.29
0.172CysTrp: 0.172 ± 0.123
0.258CysTyr: 0.258 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
2.838AspAla: 2.838 ± 0.36
0.43AspCys: 0.43 ± 0.222
1.978AspAsp: 1.978 ± 0.391
2.322AspGlu: 2.322 ± 0.483
2.15AspPhe: 2.15 ± 0.473
4.214AspGly: 4.214 ± 0.621
0.258AspHis: 0.258 ± 0.129
6.105AspIle: 6.105 ± 0.878
6.965AspLys: 6.965 ± 0.748
5.589AspLeu: 5.589 ± 0.512
1.204AspMet: 1.204 ± 0.311
4.128AspAsn: 4.128 ± 0.51
1.204AspPro: 1.204 ± 0.296
1.032AspGln: 1.032 ± 0.298
2.322AspArg: 2.322 ± 0.545
3.01AspSer: 3.01 ± 0.55
3.01AspThr: 3.01 ± 0.637
2.322AspVal: 2.322 ± 0.547
0.688AspTrp: 0.688 ± 0.213
3.612AspTyr: 3.612 ± 0.647
0.0AspXaa: 0.0 ± 0.0
Glu
4.214GluAla: 4.214 ± 0.62
1.032GluCys: 1.032 ± 0.301
4.386GluAsp: 4.386 ± 0.737
9.975GluGlu: 9.975 ± 1.262
2.838GluPhe: 2.838 ± 0.554
4.386GluGly: 4.386 ± 0.572
1.462GluHis: 1.462 ± 0.449
7.309GluIle: 7.309 ± 0.884
7.911GluLys: 7.911 ± 1.1
9.287GluLeu: 9.287 ± 0.885
2.236GluMet: 2.236 ± 0.458
5.675GluAsn: 5.675 ± 0.807
0.774GluPro: 0.774 ± 0.269
3.096GluGln: 3.096 ± 0.608
2.752GluArg: 2.752 ± 0.708
3.956GluSer: 3.956 ± 0.526
4.214GluThr: 4.214 ± 0.665
5.847GluVal: 5.847 ± 0.726
1.032GluTrp: 1.032 ± 0.279
3.526GluTyr: 3.526 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
2.15PheAla: 2.15 ± 0.359
0.43PheCys: 0.43 ± 0.196
1.978PheAsp: 1.978 ± 0.409
3.268PheGlu: 3.268 ± 0.561
1.29PhePhe: 1.29 ± 0.328
1.118PheGly: 1.118 ± 0.29
0.258PheHis: 0.258 ± 0.145
3.44PheIle: 3.44 ± 0.604
4.128PheLys: 4.128 ± 0.52
1.72PheLeu: 1.72 ± 0.405
0.774PheMet: 0.774 ± 0.277
2.838PheAsn: 2.838 ± 0.454
1.032PhePro: 1.032 ± 0.305
1.118PheGln: 1.118 ± 0.33
1.72PheArg: 1.72 ± 0.433
1.892PheSer: 1.892 ± 0.47
1.462PheThr: 1.462 ± 0.376
1.376PheVal: 1.376 ± 0.294
0.344PheTrp: 0.344 ± 0.184
1.204PheTyr: 1.204 ± 0.314
0.0PheXaa: 0.0 ± 0.0
Gly
2.236GlyAla: 2.236 ± 0.538
0.602GlyCys: 0.602 ± 0.197
3.096GlyAsp: 3.096 ± 0.567
4.042GlyGlu: 4.042 ± 0.685
1.72GlyPhe: 1.72 ± 0.386
3.44GlyGly: 3.44 ± 0.655
0.602GlyHis: 0.602 ± 0.223
4.73GlyIle: 4.73 ± 0.716
4.644GlyLys: 4.644 ± 0.637
4.042GlyLeu: 4.042 ± 0.462
1.29GlyMet: 1.29 ± 0.358
3.44GlyAsn: 3.44 ± 0.585
0.516GlyPro: 0.516 ± 0.21
2.064GlyGln: 2.064 ± 0.465
1.118GlyArg: 1.118 ± 0.294
3.354GlySer: 3.354 ± 0.622
3.526GlyThr: 3.526 ± 0.664
3.784GlyVal: 3.784 ± 0.528
1.376GlyTrp: 1.376 ± 0.404
4.386GlyTyr: 4.386 ± 0.729
0.0GlyXaa: 0.0 ± 0.0
His
0.258HisAla: 0.258 ± 0.13
0.172HisCys: 0.172 ± 0.115
0.602HisAsp: 0.602 ± 0.236
0.774HisGlu: 0.774 ± 0.242
0.688HisPhe: 0.688 ± 0.226
0.688HisGly: 0.688 ± 0.226
0.344HisHis: 0.344 ± 0.241
0.86HisIle: 0.86 ± 0.313
1.548HisLys: 1.548 ± 0.365
1.204HisLeu: 1.204 ± 0.298
0.258HisMet: 0.258 ± 0.145
0.86HisAsn: 0.86 ± 0.268
0.172HisPro: 0.172 ± 0.105
0.344HisGln: 0.344 ± 0.172
0.688HisArg: 0.688 ± 0.286
0.688HisSer: 0.688 ± 0.263
0.774HisThr: 0.774 ± 0.261
0.602HisVal: 0.602 ± 0.256
0.086HisTrp: 0.086 ± 0.098
0.688HisTyr: 0.688 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
4.214IleAla: 4.214 ± 0.745
0.86IleCys: 0.86 ± 0.261
6.363IleAsp: 6.363 ± 0.525
8.685IleGlu: 8.685 ± 0.869
2.408IlePhe: 2.408 ± 0.402
5.074IleGly: 5.074 ± 0.658
0.86IleHis: 0.86 ± 0.257
6.277IleIle: 6.277 ± 0.622
11.867IleLys: 11.867 ± 1.077
6.879IleLeu: 6.879 ± 0.697
2.064IleMet: 2.064 ± 0.498
8.685IleAsn: 8.685 ± 0.756
1.892IlePro: 1.892 ± 0.378
2.494IleGln: 2.494 ± 0.537
3.44IleArg: 3.44 ± 0.52
5.761IleSer: 5.761 ± 0.635
5.503IleThr: 5.503 ± 0.715
3.87IleVal: 3.87 ± 0.562
0.86IleTrp: 0.86 ± 0.239
2.666IleTyr: 2.666 ± 0.55
0.0IleXaa: 0.0 ± 0.0
Lys
5.331LysAla: 5.331 ± 0.883
1.634LysCys: 1.634 ± 0.304
7.051LysAsp: 7.051 ± 1.011
12.211LysGlu: 12.211 ± 1.111
2.838LysPhe: 2.838 ± 0.413
5.417LysGly: 5.417 ± 0.704
2.064LysHis: 2.064 ± 0.382
9.975LysIle: 9.975 ± 0.823
12.211LysLys: 12.211 ± 1.157
9.631LysLeu: 9.631 ± 0.938
3.526LysMet: 3.526 ± 0.637
7.911LysAsn: 7.911 ± 0.915
1.634LysPro: 1.634 ± 0.356
3.44LysGln: 3.44 ± 0.737
3.44LysArg: 3.44 ± 0.617
5.589LysSer: 5.589 ± 0.535
5.933LysThr: 5.933 ± 0.558
7.653LysVal: 7.653 ± 0.956
0.43LysTrp: 0.43 ± 0.23
5.589LysTyr: 5.589 ± 0.655
0.0LysXaa: 0.0 ± 0.0
Leu
4.816LeuAla: 4.816 ± 0.548
1.548LeuCys: 1.548 ± 0.481
4.988LeuAsp: 4.988 ± 0.69
8.427LeuGlu: 8.427 ± 1.067
2.064LeuPhe: 2.064 ± 0.488
4.472LeuGly: 4.472 ± 0.628
0.86LeuHis: 0.86 ± 0.257
7.309LeuIle: 7.309 ± 0.83
11.695LeuLys: 11.695 ± 1.012
7.223LeuLeu: 7.223 ± 1.098
1.892LeuMet: 1.892 ± 0.349
5.847LeuAsn: 5.847 ± 0.672
2.064LeuPro: 2.064 ± 0.446
2.58LeuGln: 2.58 ± 0.49
2.838LeuArg: 2.838 ± 0.462
5.589LeuSer: 5.589 ± 0.519
4.73LeuThr: 4.73 ± 0.602
3.612LeuVal: 3.612 ± 0.561
0.516LeuTrp: 0.516 ± 0.236
2.752LeuTyr: 2.752 ± 0.501
0.0LeuXaa: 0.0 ± 0.0
Met
2.064MetAla: 2.064 ± 0.51
0.258MetCys: 0.258 ± 0.148
1.118MetAsp: 1.118 ± 0.298
2.924MetGlu: 2.924 ± 0.527
0.602MetPhe: 0.602 ± 0.215
1.118MetGly: 1.118 ± 0.328
0.516MetHis: 0.516 ± 0.281
2.064MetIle: 2.064 ± 0.382
2.666MetLys: 2.666 ± 0.416
2.064MetLeu: 2.064 ± 0.394
0.774MetMet: 0.774 ± 0.271
2.236MetAsn: 2.236 ± 0.458
0.86MetPro: 0.86 ± 0.398
1.118MetGln: 1.118 ± 0.264
0.86MetArg: 0.86 ± 0.272
1.72MetSer: 1.72 ± 0.417
1.032MetThr: 1.032 ± 0.328
1.204MetVal: 1.204 ± 0.301
0.086MetTrp: 0.086 ± 0.083
1.376MetTyr: 1.376 ± 0.298
0.0MetXaa: 0.0 ± 0.0
Asn
3.784AsnAla: 3.784 ± 0.534
0.688AsnCys: 0.688 ± 0.297
3.354AsnAsp: 3.354 ± 0.541
5.246AsnGlu: 5.246 ± 0.701
2.15AsnPhe: 2.15 ± 0.354
4.472AsnGly: 4.472 ± 0.584
0.516AsnHis: 0.516 ± 0.214
7.825AsnIle: 7.825 ± 0.778
8.169AsnLys: 8.169 ± 1.033
5.417AsnLeu: 5.417 ± 0.608
2.15AsnMet: 2.15 ± 0.35
5.417AsnAsn: 5.417 ± 0.943
1.548AsnPro: 1.548 ± 0.43
1.72AsnGln: 1.72 ± 0.347
2.064AsnArg: 2.064 ± 0.476
4.816AsnSer: 4.816 ± 0.659
3.44AsnThr: 3.44 ± 0.641
3.526AsnVal: 3.526 ± 0.593
0.946AsnTrp: 0.946 ± 0.257
2.666AsnTyr: 2.666 ± 0.466
0.0AsnXaa: 0.0 ± 0.0
Pro
1.118ProAla: 1.118 ± 0.221
0.258ProCys: 0.258 ± 0.139
0.86ProAsp: 0.86 ± 0.31
1.806ProGlu: 1.806 ± 0.41
0.602ProPhe: 0.602 ± 0.257
1.204ProGly: 1.204 ± 0.332
0.43ProHis: 0.43 ± 0.19
2.064ProIle: 2.064 ± 0.349
2.064ProLys: 2.064 ± 0.398
1.72ProLeu: 1.72 ± 0.404
0.086ProMet: 0.086 ± 0.094
1.376ProAsn: 1.376 ± 0.333
0.86ProPro: 0.86 ± 0.27
0.602ProGln: 0.602 ± 0.316
0.43ProArg: 0.43 ± 0.234
2.15ProSer: 2.15 ± 0.427
1.462ProThr: 1.462 ± 0.226
1.29ProVal: 1.29 ± 0.332
0.258ProTrp: 0.258 ± 0.133
1.462ProTyr: 1.462 ± 0.292
0.0ProXaa: 0.0 ± 0.0
Gln
1.892GlnAla: 1.892 ± 0.448
0.344GlnCys: 0.344 ± 0.214
1.72GlnAsp: 1.72 ± 0.384
2.15GlnGlu: 2.15 ± 0.374
1.462GlnPhe: 1.462 ± 0.351
0.946GlnGly: 0.946 ± 0.27
0.774GlnHis: 0.774 ± 0.223
2.752GlnIle: 2.752 ± 0.484
2.838GlnLys: 2.838 ± 0.483
3.096GlnLeu: 3.096 ± 0.499
0.86GlnMet: 0.86 ± 0.243
1.978GlnAsn: 1.978 ± 0.531
0.86GlnPro: 0.86 ± 0.245
1.548GlnGln: 1.548 ± 0.403
1.376GlnArg: 1.376 ± 0.351
1.376GlnSer: 1.376 ± 0.294
1.462GlnThr: 1.462 ± 0.381
1.806GlnVal: 1.806 ± 0.442
0.172GlnTrp: 0.172 ± 0.11
1.118GlnTyr: 1.118 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
1.72ArgAla: 1.72 ± 0.247
0.602ArgCys: 0.602 ± 0.245
1.978ArgAsp: 1.978 ± 0.419
3.784ArgGlu: 3.784 ± 0.587
1.29ArgPhe: 1.29 ± 0.315
2.064ArgGly: 2.064 ± 0.444
0.43ArgHis: 0.43 ± 0.181
1.978ArgIle: 1.978 ± 0.493
3.44ArgLys: 3.44 ± 0.607
2.58ArgLeu: 2.58 ± 0.532
1.634ArgMet: 1.634 ± 0.389
2.15ArgAsn: 2.15 ± 0.351
1.032ArgPro: 1.032 ± 0.23
1.29ArgGln: 1.29 ± 0.312
1.29ArgArg: 1.29 ± 0.455
1.462ArgSer: 1.462 ± 0.346
1.806ArgThr: 1.806 ± 0.549
1.892ArgVal: 1.892 ± 0.352
0.43ArgTrp: 0.43 ± 0.191
1.806ArgTyr: 1.806 ± 0.348
0.0ArgXaa: 0.0 ± 0.0
Ser
2.666SerAla: 2.666 ± 0.397
0.774SerCys: 0.774 ± 0.239
3.612SerAsp: 3.612 ± 0.545
3.956SerGlu: 3.956 ± 0.485
2.408SerPhe: 2.408 ± 0.596
2.666SerGly: 2.666 ± 0.552
0.602SerHis: 0.602 ± 0.196
6.879SerIle: 6.879 ± 0.604
5.847SerLys: 5.847 ± 0.644
4.128SerLeu: 4.128 ± 0.509
1.72SerMet: 1.72 ± 0.371
4.472SerAsn: 4.472 ± 0.69
1.462SerPro: 1.462 ± 0.377
1.548SerGln: 1.548 ± 0.499
2.494SerArg: 2.494 ± 0.505
3.698SerSer: 3.698 ± 0.623
3.096SerThr: 3.096 ± 0.497
2.924SerVal: 2.924 ± 0.454
0.43SerTrp: 0.43 ± 0.173
2.58SerTyr: 2.58 ± 0.441
0.0SerXaa: 0.0 ± 0.0
Thr
3.526ThrAla: 3.526 ± 0.754
0.602ThrCys: 0.602 ± 0.23
2.408ThrAsp: 2.408 ± 0.448
4.042ThrGlu: 4.042 ± 0.568
2.408ThrPhe: 2.408 ± 0.405
4.386ThrGly: 4.386 ± 0.802
0.602ThrHis: 0.602 ± 0.261
4.386ThrIle: 4.386 ± 0.43
6.621ThrLys: 6.621 ± 0.655
5.16ThrLeu: 5.16 ± 0.713
1.376ThrMet: 1.376 ± 0.376
2.838ThrAsn: 2.838 ± 0.532
1.892ThrPro: 1.892 ± 0.475
1.892ThrGln: 1.892 ± 0.38
1.376ThrArg: 1.376 ± 0.39
2.838ThrSer: 2.838 ± 0.434
3.182ThrThr: 3.182 ± 0.611
3.354ThrVal: 3.354 ± 0.56
0.86ThrTrp: 0.86 ± 0.231
2.236ThrTyr: 2.236 ± 0.408
0.0ThrXaa: 0.0 ± 0.0
Val
3.01ValAla: 3.01 ± 0.597
0.688ValCys: 0.688 ± 0.269
3.784ValAsp: 3.784 ± 0.516
3.01ValGlu: 3.01 ± 0.497
2.408ValPhe: 2.408 ± 0.452
2.15ValGly: 2.15 ± 0.441
0.86ValHis: 0.86 ± 0.285
4.902ValIle: 4.902 ± 0.574
5.417ValLys: 5.417 ± 0.763
5.417ValLeu: 5.417 ± 0.656
1.462ValMet: 1.462 ± 0.413
2.58ValAsn: 2.58 ± 0.551
1.892ValPro: 1.892 ± 0.371
1.634ValGln: 1.634 ± 0.327
2.322ValArg: 2.322 ± 0.409
4.3ValSer: 4.3 ± 0.591
3.354ValThr: 3.354 ± 0.538
3.182ValVal: 3.182 ± 0.589
0.516ValTrp: 0.516 ± 0.206
1.376ValTyr: 1.376 ± 0.35
0.0ValXaa: 0.0 ± 0.0
Trp
0.43TrpAla: 0.43 ± 0.158
0.172TrpCys: 0.172 ± 0.129
0.602TrpAsp: 0.602 ± 0.206
0.688TrpGlu: 0.688 ± 0.233
0.516TrpPhe: 0.516 ± 0.21
0.946TrpGly: 0.946 ± 0.343
0.086TrpHis: 0.086 ± 0.082
1.118TrpIle: 1.118 ± 0.293
1.462TrpLys: 1.462 ± 0.438
0.946TrpLeu: 0.946 ± 0.288
0.43TrpMet: 0.43 ± 0.159
0.516TrpAsn: 0.516 ± 0.177
0.0TrpPro: 0.0 ± 0.0
0.086TrpGln: 0.086 ± 0.096
0.43TrpArg: 0.43 ± 0.184
0.516TrpSer: 0.516 ± 0.211
0.516TrpThr: 0.516 ± 0.192
0.43TrpVal: 0.43 ± 0.167
0.086TrpTrp: 0.086 ± 0.075
0.43TrpTyr: 0.43 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.322TyrAla: 2.322 ± 0.417
0.86TyrCys: 0.86 ± 0.276
2.58TyrAsp: 2.58 ± 0.511
3.096TyrGlu: 3.096 ± 0.595
1.204TyrPhe: 1.204 ± 0.304
1.978TyrGly: 1.978 ± 0.447
0.344TyrHis: 0.344 ± 0.198
4.386TyrIle: 4.386 ± 0.575
5.847TyrLys: 5.847 ± 0.786
3.698TyrLeu: 3.698 ± 0.684
1.032TyrMet: 1.032 ± 0.381
2.924TyrAsn: 2.924 ± 0.542
0.602TyrPro: 0.602 ± 0.191
1.032TyrGln: 1.032 ± 0.369
1.978TyrArg: 1.978 ± 0.478
2.408TyrSer: 2.408 ± 0.455
3.096TyrThr: 3.096 ± 0.514
1.892TyrVal: 1.892 ± 0.322
0.43TyrTrp: 0.43 ± 0.185
1.204TyrTyr: 1.204 ± 0.431
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (11630 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski