Amino acid dipepetide frequency for Clostridium phage CPD1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.091AlaAla: 1.091 ± 0.712
0.252AlaCys: 0.252 ± 0.127
2.015AlaAsp: 2.015 ± 0.38
3.442AlaGlu: 3.442 ± 0.592
2.602AlaPhe: 2.602 ± 0.544
2.518AlaGly: 2.518 ± 0.686
0.504AlaHis: 0.504 ± 0.186
5.96AlaIle: 5.96 ± 0.701
4.953AlaLys: 4.953 ± 0.711
5.373AlaLeu: 5.373 ± 0.847
1.007AlaMet: 1.007 ± 0.312
4.533AlaAsn: 4.533 ± 0.768
0.923AlaPro: 0.923 ± 0.292
1.511AlaGln: 1.511 ± 0.336
2.351AlaArg: 2.351 ± 0.379
3.022AlaSer: 3.022 ± 0.471
3.022AlaThr: 3.022 ± 0.503
3.274AlaVal: 3.274 ± 0.694
0.672AlaTrp: 0.672 ± 0.215
1.679AlaTyr: 1.679 ± 0.332
0.0AlaXaa: 0.0 ± 0.0
Cys
0.168CysAla: 0.168 ± 0.12
0.336CysCys: 0.336 ± 0.188
0.756CysAsp: 0.756 ± 0.246
1.091CysGlu: 1.091 ± 0.278
0.672CysPhe: 0.672 ± 0.247
1.091CysGly: 1.091 ± 0.338
0.168CysHis: 0.168 ± 0.11
0.923CysIle: 0.923 ± 0.276
1.259CysLys: 1.259 ± 0.296
1.175CysLeu: 1.175 ± 0.351
0.168CysMet: 0.168 ± 0.109
1.007CysAsn: 1.007 ± 0.332
0.252CysPro: 0.252 ± 0.143
0.252CysGln: 0.252 ± 0.198
0.672CysArg: 0.672 ± 0.232
1.007CysSer: 1.007 ± 0.293
0.756CysThr: 0.756 ± 0.222
0.504CysVal: 0.504 ± 0.234
0.084CysTrp: 0.084 ± 0.086
0.252CysTyr: 0.252 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
2.77AspAla: 2.77 ± 0.412
1.091AspCys: 1.091 ± 0.309
2.015AspAsp: 2.015 ± 0.39
4.617AspGlu: 4.617 ± 0.55
2.602AspPhe: 2.602 ± 0.429
3.442AspGly: 3.442 ± 0.51
0.336AspHis: 0.336 ± 0.144
4.785AspIle: 4.785 ± 0.753
5.457AspLys: 5.457 ± 0.643
5.792AspLeu: 5.792 ± 0.52
1.679AspMet: 1.679 ± 0.342
3.526AspAsn: 3.526 ± 0.465
0.672AspPro: 0.672 ± 0.262
1.091AspGln: 1.091 ± 0.283
1.343AspArg: 1.343 ± 0.364
3.022AspSer: 3.022 ± 0.347
2.938AspThr: 2.938 ± 0.424
2.77AspVal: 2.77 ± 0.46
1.427AspTrp: 1.427 ± 0.35
2.938AspTyr: 2.938 ± 0.549
0.0AspXaa: 0.0 ± 0.0
Glu
4.533GluAla: 4.533 ± 0.662
1.091GluCys: 1.091 ± 0.299
4.449GluAsp: 4.449 ± 0.648
8.059GluGlu: 8.059 ± 1.036
3.862GluPhe: 3.862 ± 0.444
5.792GluGly: 5.792 ± 0.625
0.756GluHis: 0.756 ± 0.257
8.647GluIle: 8.647 ± 1.05
8.731GluLys: 8.731 ± 1.125
7.471GluLeu: 7.471 ± 0.853
3.19GluMet: 3.19 ± 0.49
6.464GluAsn: 6.464 ± 0.843
1.175GluPro: 1.175 ± 0.309
2.854GluGln: 2.854 ± 0.469
2.686GluArg: 2.686 ± 0.499
3.694GluSer: 3.694 ± 0.534
3.61GluThr: 3.61 ± 0.465
4.869GluVal: 4.869 ± 0.627
0.756GluTrp: 0.756 ± 0.178
3.526GluTyr: 3.526 ± 0.532
0.0GluXaa: 0.0 ± 0.0
Phe
1.175PheAla: 1.175 ± 0.276
0.084PheCys: 0.084 ± 0.07
2.686PheAsp: 2.686 ± 0.387
3.694PheGlu: 3.694 ± 0.439
1.259PhePhe: 1.259 ± 0.387
2.854PheGly: 2.854 ± 0.65
0.168PheHis: 0.168 ± 0.124
3.778PheIle: 3.778 ± 0.547
3.862PheLys: 3.862 ± 0.643
3.274PheLeu: 3.274 ± 0.502
1.343PheMet: 1.343 ± 0.343
3.61PheAsn: 3.61 ± 0.462
0.756PhePro: 0.756 ± 0.238
1.175PheGln: 1.175 ± 0.276
1.427PheArg: 1.427 ± 0.296
2.938PheSer: 2.938 ± 0.426
2.77PheThr: 2.77 ± 0.48
2.351PheVal: 2.351 ± 0.382
0.42PheTrp: 0.42 ± 0.221
2.435PheTyr: 2.435 ± 0.456
0.0PheXaa: 0.0 ± 0.0
Gly
3.358GlyAla: 3.358 ± 0.733
0.923GlyCys: 0.923 ± 0.271
2.602GlyAsp: 2.602 ± 0.408
5.205GlyGlu: 5.205 ± 0.654
3.022GlyPhe: 3.022 ± 0.508
4.365GlyGly: 4.365 ± 0.866
0.756GlyHis: 0.756 ± 0.242
4.533GlyIle: 4.533 ± 0.882
4.533GlyLys: 4.533 ± 0.616
5.625GlyLeu: 5.625 ± 0.831
1.091GlyMet: 1.091 ± 0.324
4.197GlyAsn: 4.197 ± 0.759
0.504GlyPro: 0.504 ± 0.412
1.511GlyGln: 1.511 ± 0.445
2.015GlyArg: 2.015 ± 0.281
3.61GlySer: 3.61 ± 0.589
3.526GlyThr: 3.526 ± 0.523
4.113GlyVal: 4.113 ± 0.668
0.504GlyTrp: 0.504 ± 0.215
3.19GlyTyr: 3.19 ± 0.47
0.0GlyXaa: 0.0 ± 0.0
His
0.42HisAla: 0.42 ± 0.169
0.168HisCys: 0.168 ± 0.116
0.588HisAsp: 0.588 ± 0.241
0.672HisGlu: 0.672 ± 0.182
0.672HisPhe: 0.672 ± 0.22
0.756HisGly: 0.756 ± 0.195
0.252HisHis: 0.252 ± 0.133
0.588HisIle: 0.588 ± 0.214
1.259HisLys: 1.259 ± 0.332
0.923HisLeu: 0.923 ± 0.252
0.168HisMet: 0.168 ± 0.13
0.672HisAsn: 0.672 ± 0.225
0.252HisPro: 0.252 ± 0.138
0.336HisGln: 0.336 ± 0.139
0.336HisArg: 0.336 ± 0.174
0.756HisSer: 0.756 ± 0.263
0.672HisThr: 0.672 ± 0.279
0.336HisVal: 0.336 ± 0.133
0.0HisTrp: 0.0 ± 0.0
0.588HisTyr: 0.588 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
5.121IleAla: 5.121 ± 0.604
1.091IleCys: 1.091 ± 0.37
6.548IleAsp: 6.548 ± 0.612
8.647IleGlu: 8.647 ± 0.986
2.267IlePhe: 2.267 ± 0.529
3.358IleGly: 3.358 ± 0.405
0.923IleHis: 0.923 ± 0.235
7.052IleIle: 7.052 ± 0.81
9.654IleLys: 9.654 ± 0.964
4.869IleLeu: 4.869 ± 0.631
2.183IleMet: 2.183 ± 0.438
7.555IleAsn: 7.555 ± 1.136
2.686IlePro: 2.686 ± 0.393
2.854IleGln: 2.854 ± 0.516
3.694IleArg: 3.694 ± 0.573
5.373IleSer: 5.373 ± 0.609
4.533IleThr: 4.533 ± 0.679
4.281IleVal: 4.281 ± 0.613
0.672IleTrp: 0.672 ± 0.211
2.938IleTyr: 2.938 ± 0.52
0.0IleXaa: 0.0 ± 0.0
Lys
5.373LysAla: 5.373 ± 0.745
0.756LysCys: 0.756 ± 0.26
4.953LysAsp: 4.953 ± 0.598
12.424LysGlu: 12.424 ± 1.51
4.03LysPhe: 4.03 ± 0.479
5.876LysGly: 5.876 ± 0.747
1.427LysHis: 1.427 ± 0.349
9.402LysIle: 9.402 ± 0.941
7.555LysLys: 7.555 ± 0.792
6.884LysLeu: 6.884 ± 0.647
2.183LysMet: 2.183 ± 0.472
6.128LysAsn: 6.128 ± 0.755
2.854LysPro: 2.854 ± 0.422
3.694LysGln: 3.694 ± 0.573
5.037LysArg: 5.037 ± 0.72
4.281LysSer: 4.281 ± 0.517
4.449LysThr: 4.449 ± 0.569
6.548LysVal: 6.548 ± 0.695
0.588LysTrp: 0.588 ± 0.213
2.686LysTyr: 2.686 ± 0.408
0.0LysXaa: 0.0 ± 0.0
Leu
5.289LeuAla: 5.289 ± 0.979
0.672LeuCys: 0.672 ± 0.248
4.869LeuAsp: 4.869 ± 0.531
7.471LeuGlu: 7.471 ± 0.696
3.274LeuPhe: 3.274 ± 0.503
5.205LeuGly: 5.205 ± 0.921
1.175LeuHis: 1.175 ± 0.309
5.792LeuIle: 5.792 ± 0.903
9.402LeuLys: 9.402 ± 0.918
5.037LeuLeu: 5.037 ± 0.512
2.518LeuMet: 2.518 ± 0.421
7.22LeuAsn: 7.22 ± 0.798
1.427LeuPro: 1.427 ± 0.315
3.274LeuGln: 3.274 ± 0.538
2.686LeuArg: 2.686 ± 0.432
5.289LeuSer: 5.289 ± 0.633
5.289LeuThr: 5.289 ± 0.89
2.938LeuVal: 2.938 ± 0.491
0.504LeuTrp: 0.504 ± 0.205
3.274LeuTyr: 3.274 ± 0.614
0.0LeuXaa: 0.0 ± 0.0
Met
1.931MetAla: 1.931 ± 0.497
0.42MetCys: 0.42 ± 0.173
1.679MetAsp: 1.679 ± 0.393
2.099MetGlu: 2.099 ± 0.491
0.756MetPhe: 0.756 ± 0.21
1.427MetGly: 1.427 ± 0.345
0.0MetHis: 0.0 ± 0.0
2.183MetIle: 2.183 ± 0.439
2.854MetLys: 2.854 ± 0.484
2.015MetLeu: 2.015 ± 0.415
0.672MetMet: 0.672 ± 0.245
2.435MetAsn: 2.435 ± 0.53
0.42MetPro: 0.42 ± 0.169
1.427MetGln: 1.427 ± 0.495
1.007MetArg: 1.007 ± 0.244
1.343MetSer: 1.343 ± 0.354
1.679MetThr: 1.679 ± 0.34
1.259MetVal: 1.259 ± 0.325
0.252MetTrp: 0.252 ± 0.146
0.588MetTyr: 0.588 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
4.365AsnAla: 4.365 ± 0.596
1.091AsnCys: 1.091 ± 0.25
4.03AsnAsp: 4.03 ± 0.58
3.862AsnGlu: 3.862 ± 0.549
3.526AsnPhe: 3.526 ± 0.481
5.876AsnGly: 5.876 ± 0.831
1.091AsnHis: 1.091 ± 0.313
6.212AsnIle: 6.212 ± 1.269
7.22AsnLys: 7.22 ± 0.626
5.205AsnLeu: 5.205 ± 0.725
1.595AsnMet: 1.595 ± 0.35
6.296AsnAsn: 6.296 ± 1.139
2.77AsnPro: 2.77 ± 0.485
2.183AsnGln: 2.183 ± 0.403
3.022AsnArg: 3.022 ± 0.475
4.617AsnSer: 4.617 ± 0.928
4.449AsnThr: 4.449 ± 0.675
4.365AsnVal: 4.365 ± 0.757
1.091AsnTrp: 1.091 ± 0.3
3.946AsnTyr: 3.946 ± 0.469
0.0AsnXaa: 0.0 ± 0.0
Pro
1.091ProAla: 1.091 ± 0.358
0.252ProCys: 0.252 ± 0.145
1.259ProAsp: 1.259 ± 0.33
0.839ProGlu: 0.839 ± 0.25
1.511ProPhe: 1.511 ± 0.384
0.336ProGly: 0.336 ± 0.132
0.42ProHis: 0.42 ± 0.131
2.602ProIle: 2.602 ± 0.391
1.931ProLys: 1.931 ± 0.362
2.686ProLeu: 2.686 ± 0.473
0.756ProMet: 0.756 ± 0.341
2.015ProAsn: 2.015 ± 0.6
0.336ProPro: 0.336 ± 0.14
1.259ProGln: 1.259 ± 0.353
0.336ProArg: 0.336 ± 0.169
1.679ProSer: 1.679 ± 0.456
1.595ProThr: 1.595 ± 0.328
1.259ProVal: 1.259 ± 0.314
0.0ProTrp: 0.0 ± 0.0
1.343ProTyr: 1.343 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
2.183GlnAla: 2.183 ± 0.489
0.336GlnCys: 0.336 ± 0.249
1.427GlnAsp: 1.427 ± 0.316
2.602GlnGlu: 2.602 ± 0.495
1.679GlnPhe: 1.679 ± 0.442
2.518GlnGly: 2.518 ± 0.342
0.168GlnHis: 0.168 ± 0.103
2.015GlnIle: 2.015 ± 0.422
2.518GlnLys: 2.518 ± 0.484
3.694GlnLeu: 3.694 ± 0.466
1.343GlnMet: 1.343 ± 0.365
2.099GlnAsn: 2.099 ± 0.442
1.175GlnPro: 1.175 ± 0.303
2.518GlnGln: 2.518 ± 0.705
1.175GlnArg: 1.175 ± 0.352
1.847GlnSer: 1.847 ± 0.467
2.267GlnThr: 2.267 ± 0.444
1.259GlnVal: 1.259 ± 0.281
0.168GlnTrp: 0.168 ± 0.11
1.595GlnTyr: 1.595 ± 0.397
0.0GlnXaa: 0.0 ± 0.0
Arg
1.175ArgAla: 1.175 ± 0.267
1.175ArgCys: 1.175 ± 0.322
1.595ArgAsp: 1.595 ± 0.326
4.113ArgGlu: 4.113 ± 0.773
1.511ArgPhe: 1.511 ± 0.394
2.099ArgGly: 2.099 ± 0.349
0.252ArgHis: 0.252 ± 0.144
3.274ArgIle: 3.274 ± 0.512
4.449ArgLys: 4.449 ± 0.723
2.854ArgLeu: 2.854 ± 0.431
1.091ArgMet: 1.091 ± 0.32
3.106ArgAsn: 3.106 ± 0.423
1.175ArgPro: 1.175 ± 0.314
1.679ArgGln: 1.679 ± 0.4
2.015ArgArg: 2.015 ± 0.504
1.343ArgSer: 1.343 ± 0.338
1.763ArgThr: 1.763 ± 0.352
1.931ArgVal: 1.931 ± 0.443
0.588ArgTrp: 0.588 ± 0.253
1.595ArgTyr: 1.595 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
2.77SerAla: 2.77 ± 0.676
0.504SerCys: 0.504 ± 0.206
3.862SerAsp: 3.862 ± 0.609
3.526SerGlu: 3.526 ± 0.524
2.267SerPhe: 2.267 ± 0.465
2.854SerGly: 2.854 ± 0.673
0.504SerHis: 0.504 ± 0.145
5.876SerIle: 5.876 ± 0.75
6.464SerLys: 6.464 ± 0.773
4.785SerLeu: 4.785 ± 0.59
1.343SerMet: 1.343 ± 0.277
4.449SerAsn: 4.449 ± 0.629
1.175SerPro: 1.175 ± 0.279
1.931SerGln: 1.931 ± 0.332
2.602SerArg: 2.602 ± 0.355
2.267SerSer: 2.267 ± 0.429
3.358SerThr: 3.358 ± 0.606
2.77SerVal: 2.77 ± 0.477
0.839SerTrp: 0.839 ± 0.218
2.854SerTyr: 2.854 ± 0.454
0.0SerXaa: 0.0 ± 0.0
Thr
2.602ThrAla: 2.602 ± 0.518
0.756ThrCys: 0.756 ± 0.292
2.77ThrAsp: 2.77 ± 0.55
4.617ThrGlu: 4.617 ± 0.556
2.435ThrPhe: 2.435 ± 0.421
3.106ThrGly: 3.106 ± 0.552
0.756ThrHis: 0.756 ± 0.191
4.281ThrIle: 4.281 ± 0.752
4.113ThrLys: 4.113 ± 0.517
6.044ThrLeu: 6.044 ± 0.733
2.099ThrMet: 2.099 ± 0.438
4.113ThrAsn: 4.113 ± 0.697
2.267ThrPro: 2.267 ± 0.424
1.847ThrGln: 1.847 ± 0.344
1.931ThrArg: 1.931 ± 0.388
3.862ThrSer: 3.862 ± 0.607
2.938ThrThr: 2.938 ± 0.51
2.435ThrVal: 2.435 ± 0.384
0.588ThrTrp: 0.588 ± 0.185
1.931ThrTyr: 1.931 ± 0.44
0.0ThrXaa: 0.0 ± 0.0
Val
3.106ValAla: 3.106 ± 0.533
0.672ValCys: 0.672 ± 0.267
3.19ValAsp: 3.19 ± 0.565
3.862ValGlu: 3.862 ± 0.569
2.015ValPhe: 2.015 ± 0.352
2.938ValGly: 2.938 ± 0.611
0.252ValHis: 0.252 ± 0.123
4.701ValIle: 4.701 ± 0.644
5.709ValLys: 5.709 ± 0.751
3.946ValLeu: 3.946 ± 0.441
1.175ValMet: 1.175 ± 0.235
2.938ValAsn: 2.938 ± 0.447
1.595ValPro: 1.595 ± 0.323
1.427ValGln: 1.427 ± 0.265
2.267ValArg: 2.267 ± 0.433
3.106ValSer: 3.106 ± 0.455
3.106ValThr: 3.106 ± 0.464
4.03ValVal: 4.03 ± 0.743
0.672ValTrp: 0.672 ± 0.215
2.351ValTyr: 2.351 ± 0.518
0.0ValXaa: 0.0 ± 0.0
Trp
0.504TrpAla: 0.504 ± 0.229
0.168TrpCys: 0.168 ± 0.109
0.756TrpAsp: 0.756 ± 0.226
1.343TrpGlu: 1.343 ± 0.349
0.42TrpPhe: 0.42 ± 0.245
0.588TrpGly: 0.588 ± 0.194
0.168TrpHis: 0.168 ± 0.127
0.923TrpIle: 0.923 ± 0.272
0.168TrpLys: 0.168 ± 0.11
1.259TrpLeu: 1.259 ± 0.33
0.084TrpMet: 0.084 ± 0.075
0.672TrpAsn: 0.672 ± 0.215
0.0TrpPro: 0.0 ± 0.0
0.504TrpGln: 0.504 ± 0.174
0.252TrpArg: 0.252 ± 0.13
0.839TrpSer: 0.839 ± 0.264
0.504TrpThr: 0.504 ± 0.2
0.588TrpVal: 0.588 ± 0.216
0.0TrpTrp: 0.0 ± 0.0
0.504TrpTyr: 0.504 ± 0.269
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.679TyrAla: 1.679 ± 0.357
0.756TyrCys: 0.756 ± 0.212
2.267TyrAsp: 2.267 ± 0.372
4.03TyrGlu: 4.03 ± 0.598
1.763TyrPhe: 1.763 ± 0.359
2.183TyrGly: 2.183 ± 0.476
0.336TyrHis: 0.336 ± 0.186
2.77TyrIle: 2.77 ± 0.378
5.037TyrLys: 5.037 ± 0.676
3.694TyrLeu: 3.694 ± 0.486
0.672TyrMet: 0.672 ± 0.209
3.778TyrAsn: 3.778 ± 0.641
1.091TyrPro: 1.091 ± 0.345
1.259TyrGln: 1.259 ± 0.334
1.931TyrArg: 1.931 ± 0.47
3.106TyrSer: 3.106 ± 0.441
2.267TyrThr: 2.267 ± 0.453
1.175TyrVal: 1.175 ± 0.298
0.42TyrTrp: 0.42 ± 0.173
2.351TyrTyr: 2.351 ± 0.495
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (11913 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski