Amino acid dipepetide frequency for Clostridium phage phiCD505

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.986AlaAla: 1.986 ± 0.543
0.411AlaCys: 0.411 ± 0.202
2.397AlaAsp: 2.397 ± 0.413
4.315AlaGlu: 4.315 ± 0.607
1.781AlaPhe: 1.781 ± 0.441
2.671AlaGly: 2.671 ± 0.705
0.479AlaHis: 0.479 ± 0.182
4.11AlaIle: 4.11 ± 0.676
5.479AlaLys: 5.479 ± 0.528
5.616AlaLeu: 5.616 ± 0.674
1.918AlaMet: 1.918 ± 0.318
2.466AlaAsn: 2.466 ± 0.407
0.89AlaPro: 0.89 ± 0.26
1.301AlaGln: 1.301 ± 0.25
1.849AlaArg: 1.849 ± 0.378
4.247AlaSer: 4.247 ± 0.747
4.384AlaThr: 4.384 ± 0.586
3.425AlaVal: 3.425 ± 0.57
0.411AlaTrp: 0.411 ± 0.143
1.986AlaTyr: 1.986 ± 0.386
0.0AlaXaa: 0.0 ± 0.0
Cys
0.479CysAla: 0.479 ± 0.172
0.342CysCys: 0.342 ± 0.161
0.959CysAsp: 0.959 ± 0.294
1.027CysGlu: 1.027 ± 0.331
0.274CysPhe: 0.274 ± 0.16
0.822CysGly: 0.822 ± 0.205
0.068CysHis: 0.068 ± 0.069
1.438CysIle: 1.438 ± 0.291
1.507CysLys: 1.507 ± 0.374
0.685CysLeu: 0.685 ± 0.223
0.479CysMet: 0.479 ± 0.2
0.342CysAsn: 0.342 ± 0.167
0.274CysPro: 0.274 ± 0.131
0.205CysGln: 0.205 ± 0.103
0.616CysArg: 0.616 ± 0.212
0.548CysSer: 0.548 ± 0.191
0.411CysThr: 0.411 ± 0.16
0.548CysVal: 0.548 ± 0.176
0.274CysTrp: 0.274 ± 0.148
0.685CysTyr: 0.685 ± 0.216
0.0CysXaa: 0.0 ± 0.0
Asp
2.466AspAla: 2.466 ± 0.326
0.685AspCys: 0.685 ± 0.239
3.63AspAsp: 3.63 ± 0.552
5.205AspGlu: 5.205 ± 0.505
2.808AspPhe: 2.808 ± 0.503
3.699AspGly: 3.699 ± 0.573
0.068AspHis: 0.068 ± 0.06
6.164AspIle: 6.164 ± 0.599
6.164AspLys: 6.164 ± 0.805
4.11AspLeu: 4.11 ± 0.458
1.781AspMet: 1.781 ± 0.294
4.452AspAsn: 4.452 ± 0.606
0.959AspPro: 0.959 ± 0.277
0.411AspGln: 0.411 ± 0.171
2.329AspArg: 2.329 ± 0.394
3.493AspSer: 3.493 ± 0.425
2.808AspThr: 2.808 ± 0.406
3.151AspVal: 3.151 ± 0.437
0.411AspTrp: 0.411 ± 0.171
2.123AspTyr: 2.123 ± 0.458
0.0AspXaa: 0.0 ± 0.0
Glu
4.795GluAla: 4.795 ± 0.708
1.164GluCys: 1.164 ± 0.307
4.247GluAsp: 4.247 ± 0.619
7.192GluGlu: 7.192 ± 0.797
3.767GluPhe: 3.767 ± 0.551
4.384GluGly: 4.384 ± 0.498
0.822GluHis: 0.822 ± 0.23
7.945GluIle: 7.945 ± 0.838
9.041GluLys: 9.041 ± 1.065
8.493GluLeu: 8.493 ± 0.758
3.219GluMet: 3.219 ± 0.656
6.575GluAsn: 6.575 ± 0.914
1.096GluPro: 1.096 ± 0.242
2.808GluGln: 2.808 ± 0.39
3.356GluArg: 3.356 ± 0.544
3.288GluSer: 3.288 ± 0.43
4.589GluThr: 4.589 ± 0.595
5.0GluVal: 5.0 ± 0.619
1.027GluTrp: 1.027 ± 0.306
3.562GluTyr: 3.562 ± 0.579
0.0GluXaa: 0.0 ± 0.0
Phe
1.438PheAla: 1.438 ± 0.379
0.274PheCys: 0.274 ± 0.138
2.329PheAsp: 2.329 ± 0.303
3.699PheGlu: 3.699 ± 0.578
1.37PhePhe: 1.37 ± 0.339
2.534PheGly: 2.534 ± 0.399
0.548PheHis: 0.548 ± 0.189
3.904PheIle: 3.904 ± 0.501
4.315PheLys: 4.315 ± 0.558
2.74PheLeu: 2.74 ± 0.377
0.685PheMet: 0.685 ± 0.222
3.014PheAsn: 3.014 ± 0.432
0.959PhePro: 0.959 ± 0.297
0.753PheGln: 0.753 ± 0.229
1.644PheArg: 1.644 ± 0.355
2.534PheSer: 2.534 ± 0.405
2.466PheThr: 2.466 ± 0.365
2.397PheVal: 2.397 ± 0.602
0.342PheTrp: 0.342 ± 0.214
1.575PheTyr: 1.575 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
3.014GlyAla: 3.014 ± 0.703
0.753GlyCys: 0.753 ± 0.263
2.603GlyAsp: 2.603 ± 0.395
5.822GlyGlu: 5.822 ± 0.784
3.63GlyPhe: 3.63 ± 0.532
4.932GlyGly: 4.932 ± 1.505
0.822GlyHis: 0.822 ± 0.263
4.384GlyIle: 4.384 ± 0.747
5.0GlyLys: 5.0 ± 0.451
4.178GlyLeu: 4.178 ± 0.658
2.055GlyMet: 2.055 ± 0.303
3.356GlyAsn: 3.356 ± 0.461
0.548GlyPro: 0.548 ± 0.206
1.712GlyGln: 1.712 ± 0.413
2.055GlyArg: 2.055 ± 0.35
3.425GlySer: 3.425 ± 0.386
3.836GlyThr: 3.836 ± 0.681
4.178GlyVal: 4.178 ± 0.626
0.685GlyTrp: 0.685 ± 0.182
2.466GlyTyr: 2.466 ± 0.532
0.0GlyXaa: 0.0 ± 0.0
His
0.274HisAla: 0.274 ± 0.156
0.342HisCys: 0.342 ± 0.14
0.411HisAsp: 0.411 ± 0.197
0.685HisGlu: 0.685 ± 0.22
0.616HisPhe: 0.616 ± 0.201
0.411HisGly: 0.411 ± 0.187
0.137HisHis: 0.137 ± 0.094
0.616HisIle: 0.616 ± 0.244
1.164HisLys: 1.164 ± 0.339
0.89HisLeu: 0.89 ± 0.267
0.205HisMet: 0.205 ± 0.124
0.822HisAsn: 0.822 ± 0.24
0.479HisPro: 0.479 ± 0.244
0.411HisGln: 0.411 ± 0.181
0.137HisArg: 0.137 ± 0.081
0.685HisSer: 0.685 ± 0.199
0.685HisThr: 0.685 ± 0.219
0.274HisVal: 0.274 ± 0.129
0.068HisTrp: 0.068 ± 0.067
0.479HisTyr: 0.479 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
5.068IleAla: 5.068 ± 0.576
1.233IleCys: 1.233 ± 0.406
5.959IleAsp: 5.959 ± 0.704
6.986IleGlu: 6.986 ± 0.878
2.466IlePhe: 2.466 ± 0.438
5.137IleGly: 5.137 ± 0.524
1.027IleHis: 1.027 ± 0.268
6.027IleIle: 6.027 ± 0.672
10.137IleLys: 10.137 ± 1.0
7.26IleLeu: 7.26 ± 0.76
1.918IleMet: 1.918 ± 0.337
6.712IleAsn: 6.712 ± 0.691
2.329IlePro: 2.329 ± 0.338
2.671IleGln: 2.671 ± 0.433
3.562IleArg: 3.562 ± 0.635
5.959IleSer: 5.959 ± 1.041
4.521IleThr: 4.521 ± 0.54
4.863IleVal: 4.863 ± 0.618
0.685IleTrp: 0.685 ± 0.227
3.219IleTyr: 3.219 ± 0.58
0.0IleXaa: 0.0 ± 0.0
Lys
5.959LysAla: 5.959 ± 0.754
1.233LysCys: 1.233 ± 0.309
7.26LysAsp: 7.26 ± 0.801
11.644LysGlu: 11.644 ± 0.963
3.425LysPhe: 3.425 ± 0.395
5.753LysGly: 5.753 ± 0.457
0.616LysHis: 0.616 ± 0.226
8.973LysIle: 8.973 ± 0.759
10.548LysLys: 10.548 ± 1.351
7.671LysLeu: 7.671 ± 0.756
2.123LysMet: 2.123 ± 0.48
7.945LysAsn: 7.945 ± 0.855
2.192LysPro: 2.192 ± 0.378
4.11LysGln: 4.11 ± 0.576
4.11LysArg: 4.11 ± 0.628
6.575LysSer: 6.575 ± 0.737
4.452LysThr: 4.452 ± 0.436
7.055LysVal: 7.055 ± 0.574
1.164LysTrp: 1.164 ± 0.301
4.863LysTyr: 4.863 ± 0.665
0.0LysXaa: 0.0 ± 0.0
Leu
4.247LeuAla: 4.247 ± 0.52
0.959LeuCys: 0.959 ± 0.269
5.342LeuAsp: 5.342 ± 0.525
8.493LeuGlu: 8.493 ± 0.957
2.603LeuPhe: 2.603 ± 0.427
4.932LeuGly: 4.932 ± 0.748
0.479LeuHis: 0.479 ± 0.16
6.027LeuIle: 6.027 ± 0.621
11.37LeuLys: 11.37 ± 0.837
6.164LeuLeu: 6.164 ± 0.73
1.233LeuMet: 1.233 ± 0.26
5.89LeuAsn: 5.89 ± 0.687
1.438LeuPro: 1.438 ± 0.34
2.534LeuGln: 2.534 ± 0.452
3.151LeuArg: 3.151 ± 0.37
5.479LeuSer: 5.479 ± 0.596
5.548LeuThr: 5.548 ± 0.658
4.11LeuVal: 4.11 ± 0.472
0.342LeuTrp: 0.342 ± 0.164
3.219LeuTyr: 3.219 ± 0.492
0.0LeuXaa: 0.0 ± 0.0
Met
2.055MetAla: 2.055 ± 0.394
0.205MetCys: 0.205 ± 0.141
1.301MetAsp: 1.301 ± 0.31
1.507MetGlu: 1.507 ± 0.309
0.753MetPhe: 0.753 ± 0.165
1.027MetGly: 1.027 ± 0.289
0.274MetHis: 0.274 ± 0.128
1.781MetIle: 1.781 ± 0.367
2.671MetLys: 2.671 ± 0.487
2.534MetLeu: 2.534 ± 0.491
0.137MetMet: 0.137 ± 0.092
1.644MetAsn: 1.644 ± 0.389
0.685MetPro: 0.685 ± 0.202
0.685MetGln: 0.685 ± 0.193
0.685MetArg: 0.685 ± 0.161
1.781MetSer: 1.781 ± 0.39
1.37MetThr: 1.37 ± 0.294
0.959MetVal: 0.959 ± 0.252
0.479MetTrp: 0.479 ± 0.171
0.822MetTyr: 0.822 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
4.178AsnAla: 4.178 ± 0.51
0.685AsnCys: 0.685 ± 0.242
3.356AsnAsp: 3.356 ± 0.655
5.068AsnGlu: 5.068 ± 0.666
2.74AsnPhe: 2.74 ± 0.386
4.452AsnGly: 4.452 ± 0.6
0.342AsnHis: 0.342 ± 0.16
7.055AsnIle: 7.055 ± 0.673
7.192AsnLys: 7.192 ± 0.866
5.959AsnLeu: 5.959 ± 0.553
1.575AsnMet: 1.575 ± 0.328
5.0AsnAsn: 5.0 ± 0.782
1.781AsnPro: 1.781 ± 0.378
1.301AsnGln: 1.301 ± 0.285
2.534AsnArg: 2.534 ± 0.399
4.658AsnSer: 4.658 ± 0.573
2.945AsnThr: 2.945 ± 0.45
4.11AsnVal: 4.11 ± 0.545
0.753AsnTrp: 0.753 ± 0.241
1.849AsnTyr: 1.849 ± 0.411
0.0AsnXaa: 0.0 ± 0.0
Pro
0.89ProAla: 0.89 ± 0.258
0.205ProCys: 0.205 ± 0.124
0.685ProAsp: 0.685 ± 0.234
1.233ProGlu: 1.233 ± 0.29
1.027ProPhe: 1.027 ± 0.223
0.89ProGly: 0.89 ± 0.212
0.411ProHis: 0.411 ± 0.204
2.534ProIle: 2.534 ± 0.397
2.26ProLys: 2.26 ± 0.421
1.712ProLeu: 1.712 ± 0.397
0.411ProMet: 0.411 ± 0.152
1.233ProAsn: 1.233 ± 0.325
0.274ProPro: 0.274 ± 0.102
0.959ProGln: 0.959 ± 0.22
1.027ProArg: 1.027 ± 0.263
1.096ProSer: 1.096 ± 0.255
2.055ProThr: 2.055 ± 0.35
1.096ProVal: 1.096 ± 0.256
0.068ProTrp: 0.068 ± 0.067
0.616ProTyr: 0.616 ± 0.248
0.0ProXaa: 0.0 ± 0.0
Gln
2.26GlnAla: 2.26 ± 0.401
0.068GlnCys: 0.068 ± 0.061
1.575GlnAsp: 1.575 ± 0.327
2.671GlnGlu: 2.671 ± 0.489
0.89GlnPhe: 0.89 ± 0.185
1.781GlnGly: 1.781 ± 0.304
0.342GlnHis: 0.342 ± 0.144
2.603GlnIle: 2.603 ± 0.383
2.671GlnLys: 2.671 ± 0.483
2.877GlnLeu: 2.877 ± 0.45
0.411GlnMet: 0.411 ± 0.126
1.781GlnAsn: 1.781 ± 0.3
0.411GlnPro: 0.411 ± 0.165
0.89GlnGln: 0.89 ± 0.267
0.616GlnArg: 0.616 ± 0.231
1.849GlnSer: 1.849 ± 0.421
1.986GlnThr: 1.986 ± 0.275
1.37GlnVal: 1.37 ± 0.304
0.205GlnTrp: 0.205 ± 0.11
0.959GlnTyr: 0.959 ± 0.327
0.0GlnXaa: 0.0 ± 0.0
Arg
1.644ArgAla: 1.644 ± 0.311
0.548ArgCys: 0.548 ± 0.214
1.507ArgAsp: 1.507 ± 0.33
3.699ArgGlu: 3.699 ± 0.512
1.096ArgPhe: 1.096 ± 0.267
2.055ArgGly: 2.055 ± 0.386
0.548ArgHis: 0.548 ± 0.189
4.178ArgIle: 4.178 ± 0.554
3.288ArgLys: 3.288 ± 0.449
3.493ArgLeu: 3.493 ± 0.642
1.575ArgMet: 1.575 ± 0.285
1.301ArgAsn: 1.301 ± 0.287
0.959ArgPro: 0.959 ± 0.243
0.822ArgGln: 0.822 ± 0.217
1.301ArgArg: 1.301 ± 0.395
1.507ArgSer: 1.507 ± 0.28
2.123ArgThr: 2.123 ± 0.401
2.877ArgVal: 2.877 ± 0.338
0.548ArgTrp: 0.548 ± 0.194
1.096ArgTyr: 1.096 ± 0.237
0.0ArgXaa: 0.0 ± 0.0
Ser
2.945SerAla: 2.945 ± 0.638
0.616SerCys: 0.616 ± 0.175
3.836SerAsp: 3.836 ± 0.535
4.247SerGlu: 4.247 ± 0.533
3.425SerPhe: 3.425 ± 0.489
3.836SerGly: 3.836 ± 0.763
0.822SerHis: 0.822 ± 0.355
5.959SerIle: 5.959 ± 0.703
6.438SerLys: 6.438 ± 0.598
4.795SerLeu: 4.795 ± 0.626
1.164SerMet: 1.164 ± 0.25
5.137SerAsn: 5.137 ± 0.75
1.233SerPro: 1.233 ± 0.261
1.918SerGln: 1.918 ± 0.377
1.644SerArg: 1.644 ± 0.236
5.205SerSer: 5.205 ± 0.78
3.425SerThr: 3.425 ± 0.392
3.082SerVal: 3.082 ± 0.478
0.342SerTrp: 0.342 ± 0.174
3.014SerTyr: 3.014 ± 0.503
0.0SerXaa: 0.0 ± 0.0
Thr
3.767ThrAla: 3.767 ± 0.581
0.411ThrCys: 0.411 ± 0.204
3.151ThrAsp: 3.151 ± 0.515
3.836ThrGlu: 3.836 ± 0.607
2.603ThrPhe: 2.603 ± 0.403
3.356ThrGly: 3.356 ± 0.487
0.822ThrHis: 0.822 ± 0.187
5.205ThrIle: 5.205 ± 0.64
7.055ThrLys: 7.055 ± 0.617
5.89ThrLeu: 5.89 ± 0.627
0.822ThrMet: 0.822 ± 0.295
2.534ThrAsn: 2.534 ± 0.349
2.26ThrPro: 2.26 ± 0.5
1.644ThrGln: 1.644 ± 0.3
1.644ThrArg: 1.644 ± 0.272
4.11ThrSer: 4.11 ± 0.476
3.562ThrThr: 3.562 ± 0.585
3.425ThrVal: 3.425 ± 0.577
0.479ThrTrp: 0.479 ± 0.173
1.849ThrTyr: 1.849 ± 0.335
0.0ThrXaa: 0.0 ± 0.0
Val
2.877ValAla: 2.877 ± 0.428
0.753ValCys: 0.753 ± 0.226
3.356ValAsp: 3.356 ± 0.585
4.863ValGlu: 4.863 ± 0.576
2.192ValPhe: 2.192 ± 0.47
4.247ValGly: 4.247 ± 0.76
0.411ValHis: 0.411 ± 0.143
4.726ValIle: 4.726 ± 0.724
5.89ValLys: 5.89 ± 0.584
4.658ValLeu: 4.658 ± 0.546
0.822ValMet: 0.822 ± 0.195
4.178ValAsn: 4.178 ± 0.467
0.685ValPro: 0.685 ± 0.199
1.37ValGln: 1.37 ± 0.316
2.397ValArg: 2.397 ± 0.343
3.63ValSer: 3.63 ± 0.594
4.041ValThr: 4.041 ± 0.508
4.315ValVal: 4.315 ± 0.636
0.822ValTrp: 0.822 ± 0.251
2.329ValTyr: 2.329 ± 0.432
0.0ValXaa: 0.0 ± 0.0
Trp
0.548TrpAla: 0.548 ± 0.179
0.205TrpCys: 0.205 ± 0.118
0.616TrpAsp: 0.616 ± 0.186
0.959TrpGlu: 0.959 ± 0.235
0.342TrpPhe: 0.342 ± 0.16
0.685TrpGly: 0.685 ± 0.165
0.068TrpHis: 0.068 ± 0.07
0.959TrpIle: 0.959 ± 0.268
0.411TrpLys: 0.411 ± 0.154
0.616TrpLeu: 0.616 ± 0.193
0.137TrpMet: 0.137 ± 0.093
0.616TrpAsn: 0.616 ± 0.178
0.205TrpPro: 0.205 ± 0.132
0.479TrpGln: 0.479 ± 0.153
0.479TrpArg: 0.479 ± 0.242
0.342TrpSer: 0.342 ± 0.136
0.616TrpThr: 0.616 ± 0.258
0.822TrpVal: 0.822 ± 0.216
0.274TrpTrp: 0.274 ± 0.15
0.342TrpTyr: 0.342 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.233TyrAla: 1.233 ± 0.357
0.89TyrCys: 0.89 ± 0.244
2.397TyrAsp: 2.397 ± 0.457
3.219TyrGlu: 3.219 ± 0.48
1.712TyrPhe: 1.712 ± 0.298
1.849TyrGly: 1.849 ± 0.395
0.685TyrHis: 0.685 ± 0.215
3.219TyrIle: 3.219 ± 0.474
5.068TyrLys: 5.068 ± 0.745
3.219TyrLeu: 3.219 ± 0.542
0.616TyrMet: 0.616 ± 0.195
2.534TyrAsn: 2.534 ± 0.428
1.027TyrPro: 1.027 ± 0.317
1.233TyrGln: 1.233 ± 0.253
1.027TyrArg: 1.027 ± 0.241
2.671TyrSer: 2.671 ± 0.428
2.603TyrThr: 2.603 ± 0.529
1.507TyrVal: 1.507 ± 0.286
0.342TyrTrp: 0.342 ± 0.185
2.055TyrTyr: 2.055 ± 0.426
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (14601 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski