Amino acid dipepetide frequency for Clostridium phage phiCTP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.04AlaAla: 3.04 ± 0.78
0.326AlaCys: 0.326 ± 0.154
4.885AlaAsp: 4.885 ± 0.589
3.746AlaGlu: 3.746 ± 0.47
2.008AlaPhe: 2.008 ± 0.484
3.691AlaGly: 3.691 ± 0.596
1.086AlaHis: 1.086 ± 0.222
4.994AlaIle: 4.994 ± 0.766
5.754AlaLys: 5.754 ± 0.62
6.568AlaLeu: 6.568 ± 0.732
1.628AlaMet: 1.628 ± 0.314
3.908AlaAsn: 3.908 ± 0.628
2.008AlaPro: 2.008 ± 0.339
3.094AlaGln: 3.094 ± 0.455
1.737AlaArg: 1.737 ± 0.268
4.18AlaSer: 4.18 ± 0.689
4.017AlaThr: 4.017 ± 0.584
4.451AlaVal: 4.451 ± 0.596
0.76AlaTrp: 0.76 ± 0.231
2.931AlaTyr: 2.931 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
0.543CysAla: 0.543 ± 0.186
0.597CysCys: 0.597 ± 0.232
1.194CysAsp: 1.194 ± 0.353
0.814CysGlu: 0.814 ± 0.248
0.597CysPhe: 0.597 ± 0.201
0.977CysGly: 0.977 ± 0.267
0.38CysHis: 0.38 ± 0.195
0.814CysIle: 0.814 ± 0.241
1.249CysLys: 1.249 ± 0.294
1.14CysLeu: 1.14 ± 0.287
0.597CysMet: 0.597 ± 0.147
1.14CysAsn: 1.14 ± 0.254
0.597CysPro: 0.597 ± 0.206
0.271CysGln: 0.271 ± 0.131
0.38CysArg: 0.38 ± 0.154
0.597CysSer: 0.597 ± 0.202
0.814CysThr: 0.814 ± 0.175
0.977CysVal: 0.977 ± 0.256
0.38CysTrp: 0.38 ± 0.127
0.76CysTyr: 0.76 ± 0.265
0.0CysXaa: 0.0 ± 0.0
Asp
5.483AspAla: 5.483 ± 1.002
1.086AspCys: 1.086 ± 0.236
5.428AspAsp: 5.428 ± 0.627
3.691AspGlu: 3.691 ± 0.539
3.148AspPhe: 3.148 ± 0.452
4.614AspGly: 4.614 ± 0.507
0.543AspHis: 0.543 ± 0.177
6.025AspIle: 6.025 ± 0.637
6.568AspLys: 6.568 ± 0.552
5.971AspLeu: 5.971 ± 0.556
1.737AspMet: 1.737 ± 0.359
4.18AspAsn: 4.18 ± 0.442
0.923AspPro: 0.923 ± 0.253
1.086AspGln: 1.086 ± 0.197
2.443AspArg: 2.443 ± 0.299
3.366AspSer: 3.366 ± 0.446
4.071AspThr: 4.071 ± 0.539
4.885AspVal: 4.885 ± 0.504
0.597AspTrp: 0.597 ± 0.18
3.094AspTyr: 3.094 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
3.257GluAla: 3.257 ± 0.354
0.869GluCys: 0.869 ± 0.205
4.56GluAsp: 4.56 ± 0.524
3.366GluGlu: 3.366 ± 0.449
2.931GluPhe: 2.931 ± 0.491
2.388GluGly: 2.388 ± 0.4
0.923GluHis: 0.923 ± 0.196
4.071GluIle: 4.071 ± 0.506
4.885GluLys: 4.885 ± 0.715
5.754GluLeu: 5.754 ± 0.521
1.846GluMet: 1.846 ± 0.281
2.768GluAsn: 2.768 ± 0.487
1.303GluPro: 1.303 ± 0.253
1.9GluGln: 1.9 ± 0.341
1.411GluArg: 1.411 ± 0.26
2.497GluSer: 2.497 ± 0.378
2.334GluThr: 2.334 ± 0.331
4.017GluVal: 4.017 ± 0.439
0.434GluTrp: 0.434 ± 0.2
2.443GluTyr: 2.443 ± 0.466
0.0GluXaa: 0.0 ± 0.0
Phe
1.683PheAla: 1.683 ± 0.384
0.814PheCys: 0.814 ± 0.219
3.203PheAsp: 3.203 ± 0.387
1.954PheGlu: 1.954 ± 0.363
1.52PhePhe: 1.52 ± 0.378
3.04PheGly: 3.04 ± 0.553
0.38PheHis: 0.38 ± 0.157
3.04PheIle: 3.04 ± 0.422
4.397PheLys: 4.397 ± 0.559
2.497PheLeu: 2.497 ± 0.41
1.031PheMet: 1.031 ± 0.241
3.366PheAsn: 3.366 ± 0.422
0.814PhePro: 0.814 ± 0.236
0.869PheGln: 0.869 ± 0.268
1.303PheArg: 1.303 ± 0.242
3.094PheSer: 3.094 ± 0.449
2.551PheThr: 2.551 ± 0.386
2.117PheVal: 2.117 ± 0.386
0.326PheTrp: 0.326 ± 0.127
1.357PheTyr: 1.357 ± 0.285
0.0PheXaa: 0.0 ± 0.0
Gly
4.017GlyAla: 4.017 ± 0.676
0.814GlyCys: 0.814 ± 0.199
4.017GlyAsp: 4.017 ± 0.527
3.366GlyGlu: 3.366 ± 0.473
1.9GlyPhe: 1.9 ± 0.337
4.288GlyGly: 4.288 ± 0.594
0.814GlyHis: 0.814 ± 0.315
5.048GlyIle: 5.048 ± 0.783
5.645GlyLys: 5.645 ± 0.503
5.103GlyLeu: 5.103 ± 0.658
1.737GlyMet: 1.737 ± 0.357
3.963GlyAsn: 3.963 ± 0.372
0.651GlyPro: 0.651 ± 0.172
1.683GlyGln: 1.683 ± 0.379
1.411GlyArg: 1.411 ± 0.298
3.963GlySer: 3.963 ± 0.535
6.46GlyThr: 6.46 ± 1.013
3.854GlyVal: 3.854 ± 0.398
0.543GlyTrp: 0.543 ± 0.177
3.257GlyTyr: 3.257 ± 0.399
0.0GlyXaa: 0.0 ± 0.0
His
0.597HisAla: 0.597 ± 0.176
0.38HisCys: 0.38 ± 0.131
0.977HisAsp: 0.977 ± 0.245
0.869HisGlu: 0.869 ± 0.207
0.651HisPhe: 0.651 ± 0.187
0.977HisGly: 0.977 ± 0.238
0.109HisHis: 0.109 ± 0.073
0.869HisIle: 0.869 ± 0.253
1.954HisLys: 1.954 ± 0.448
1.303HisLeu: 1.303 ± 0.241
0.38HisMet: 0.38 ± 0.127
1.249HisAsn: 1.249 ± 0.273
0.38HisPro: 0.38 ± 0.141
0.434HisGln: 0.434 ± 0.165
0.271HisArg: 0.271 ± 0.109
0.651HisSer: 0.651 ± 0.215
0.923HisThr: 0.923 ± 0.294
0.76HisVal: 0.76 ± 0.202
0.163HisTrp: 0.163 ± 0.088
0.706HisTyr: 0.706 ± 0.216
0.0HisXaa: 0.0 ± 0.0
Ile
4.94IleAla: 4.94 ± 0.589
1.249IleCys: 1.249 ± 0.344
5.32IleAsp: 5.32 ± 0.529
3.528IleGlu: 3.528 ± 0.553
2.823IlePhe: 2.823 ± 0.418
5.754IleGly: 5.754 ± 0.58
0.869IleHis: 0.869 ± 0.221
5.917IleIle: 5.917 ± 0.636
7.545IleLys: 7.545 ± 0.508
5.157IleLeu: 5.157 ± 0.595
1.737IleMet: 1.737 ± 0.331
5.211IleAsn: 5.211 ± 0.661
2.551IlePro: 2.551 ± 0.414
2.606IleGln: 2.606 ± 0.439
3.474IleArg: 3.474 ± 0.527
4.234IleSer: 4.234 ± 0.68
5.917IleThr: 5.917 ± 0.88
4.126IleVal: 4.126 ± 0.476
0.597IleTrp: 0.597 ± 0.208
3.474IleTyr: 3.474 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
5.157LysAla: 5.157 ± 0.7
1.303LysCys: 1.303 ± 0.308
5.917LysAsp: 5.917 ± 0.635
5.645LysGlu: 5.645 ± 0.675
2.823LysPhe: 2.823 ± 0.437
4.397LysGly: 4.397 ± 0.448
1.628LysHis: 1.628 ± 0.28
6.46LysIle: 6.46 ± 0.733
7.491LysLys: 7.491 ± 0.727
7.98LysLeu: 7.98 ± 0.732
4.017LysMet: 4.017 ± 0.501
6.134LysAsn: 6.134 ± 0.552
2.986LysPro: 2.986 ± 0.46
3.908LysGln: 3.908 ± 0.448
3.474LysArg: 3.474 ± 0.51
3.746LysSer: 3.746 ± 0.442
4.831LysThr: 4.831 ± 0.499
6.514LysVal: 6.514 ± 0.647
1.357LysTrp: 1.357 ± 0.324
5.211LysTyr: 5.211 ± 0.557
0.0LysXaa: 0.0 ± 0.0
Leu
6.568LeuAla: 6.568 ± 0.589
0.869LeuCys: 0.869 ± 0.23
7.654LeuAsp: 7.654 ± 0.827
5.211LeuGlu: 5.211 ± 0.72
3.583LeuPhe: 3.583 ± 0.43
5.048LeuGly: 5.048 ± 0.592
1.194LeuHis: 1.194 ± 0.241
7.274LeuIle: 7.274 ± 0.571
7.925LeuLys: 7.925 ± 0.624
6.568LeuLeu: 6.568 ± 0.658
1.791LeuMet: 1.791 ± 0.311
6.623LeuAsn: 6.623 ± 0.573
2.986LeuPro: 2.986 ± 0.382
2.66LeuGln: 2.66 ± 0.398
2.66LeuArg: 2.66 ± 0.378
4.505LeuSer: 4.505 ± 0.451
5.211LeuThr: 5.211 ± 0.463
4.614LeuVal: 4.614 ± 0.5
0.434LeuTrp: 0.434 ± 0.169
3.637LeuTyr: 3.637 ± 0.47
0.0LeuXaa: 0.0 ± 0.0
Met
1.574MetAla: 1.574 ± 0.28
0.434MetCys: 0.434 ± 0.186
1.737MetAsp: 1.737 ± 0.318
1.466MetGlu: 1.466 ± 0.337
0.706MetPhe: 0.706 ± 0.215
1.574MetGly: 1.574 ± 0.253
0.38MetHis: 0.38 ± 0.164
1.846MetIle: 1.846 ± 0.326
2.443MetLys: 2.443 ± 0.458
2.714MetLeu: 2.714 ± 0.455
0.76MetMet: 0.76 ± 0.226
2.063MetAsn: 2.063 ± 0.363
1.031MetPro: 1.031 ± 0.231
1.086MetGln: 1.086 ± 0.285
1.086MetArg: 1.086 ± 0.265
1.954MetSer: 1.954 ± 0.327
1.14MetThr: 1.14 ± 0.227
1.303MetVal: 1.303 ± 0.315
0.217MetTrp: 0.217 ± 0.103
1.411MetTyr: 1.411 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
4.723AsnAla: 4.723 ± 0.602
1.194AsnCys: 1.194 ± 0.312
3.366AsnAsp: 3.366 ± 0.399
3.691AsnGlu: 3.691 ± 0.372
2.877AsnPhe: 2.877 ± 0.328
4.56AsnGly: 4.56 ± 0.542
0.76AsnHis: 0.76 ± 0.239
6.134AsnIle: 6.134 ± 0.685
6.731AsnLys: 6.731 ± 0.786
6.297AsnLeu: 6.297 ± 0.561
1.14AsnMet: 1.14 ± 0.298
4.885AsnAsn: 4.885 ± 0.432
2.117AsnPro: 2.117 ± 0.325
1.846AsnGln: 1.846 ± 0.376
2.388AsnArg: 2.388 ± 0.398
4.505AsnSer: 4.505 ± 0.594
3.8AsnThr: 3.8 ± 0.523
3.637AsnVal: 3.637 ± 0.475
0.706AsnTrp: 0.706 ± 0.187
3.04AsnTyr: 3.04 ± 0.44
0.0AsnXaa: 0.0 ± 0.0
Pro
2.388ProAla: 2.388 ± 0.393
0.271ProCys: 0.271 ± 0.137
1.9ProAsp: 1.9 ± 0.333
1.303ProGlu: 1.303 ± 0.294
1.411ProPhe: 1.411 ± 0.286
0.706ProGly: 0.706 ± 0.203
0.923ProHis: 0.923 ± 0.202
2.334ProIle: 2.334 ± 0.29
2.443ProLys: 2.443 ± 0.427
2.931ProLeu: 2.931 ± 0.431
0.434ProMet: 0.434 ± 0.152
1.249ProAsn: 1.249 ± 0.293
0.597ProPro: 0.597 ± 0.206
1.194ProGln: 1.194 ± 0.259
0.869ProArg: 0.869 ± 0.287
1.628ProSer: 1.628 ± 0.273
1.737ProThr: 1.737 ± 0.292
1.954ProVal: 1.954 ± 0.335
0.109ProTrp: 0.109 ± 0.073
1.466ProTyr: 1.466 ± 0.31
0.0ProXaa: 0.0 ± 0.0
Gln
2.551GlnAla: 2.551 ± 0.408
0.651GlnCys: 0.651 ± 0.207
1.954GlnAsp: 1.954 ± 0.298
1.249GlnGlu: 1.249 ± 0.276
1.411GlnPhe: 1.411 ± 0.278
1.791GlnGly: 1.791 ± 0.3
0.38GlnHis: 0.38 ± 0.162
1.9GlnIle: 1.9 ± 0.409
2.443GlnLys: 2.443 ± 0.364
3.094GlnLeu: 3.094 ± 0.498
0.869GlnMet: 0.869 ± 0.214
2.063GlnAsn: 2.063 ± 0.389
0.76GlnPro: 0.76 ± 0.186
0.869GlnGln: 0.869 ± 0.222
1.031GlnArg: 1.031 ± 0.279
1.52GlnSer: 1.52 ± 0.324
2.226GlnThr: 2.226 ± 0.336
2.171GlnVal: 2.171 ± 0.348
0.271GlnTrp: 0.271 ± 0.141
1.846GlnTyr: 1.846 ± 0.344
0.0GlnXaa: 0.0 ± 0.0
Arg
2.008ArgAla: 2.008 ± 0.367
0.597ArgCys: 0.597 ± 0.209
1.791ArgAsp: 1.791 ± 0.348
1.14ArgGlu: 1.14 ± 0.242
2.171ArgPhe: 2.171 ± 0.348
1.9ArgGly: 1.9 ± 0.341
0.706ArgHis: 0.706 ± 0.221
2.334ArgIle: 2.334 ± 0.461
2.768ArgLys: 2.768 ± 0.493
3.203ArgLeu: 3.203 ± 0.533
1.357ArgMet: 1.357 ± 0.27
2.117ArgAsn: 2.117 ± 0.329
0.543ArgPro: 0.543 ± 0.17
1.086ArgGln: 1.086 ± 0.232
1.683ArgArg: 1.683 ± 0.324
1.194ArgSer: 1.194 ± 0.227
1.683ArgThr: 1.683 ± 0.369
2.226ArgVal: 2.226 ± 0.354
0.434ArgTrp: 0.434 ± 0.131
1.846ArgTyr: 1.846 ± 0.363
0.0ArgXaa: 0.0 ± 0.0
Ser
3.474SerAla: 3.474 ± 0.784
0.814SerCys: 0.814 ± 0.209
2.931SerAsp: 2.931 ± 0.376
2.443SerGlu: 2.443 ± 0.381
2.063SerPhe: 2.063 ± 0.323
4.56SerGly: 4.56 ± 0.709
0.543SerHis: 0.543 ± 0.161
4.614SerIle: 4.614 ± 0.56
4.614SerLys: 4.614 ± 0.451
4.288SerLeu: 4.288 ± 0.482
1.52SerMet: 1.52 ± 0.294
3.746SerAsn: 3.746 ± 0.35
1.574SerPro: 1.574 ± 0.327
1.52SerGln: 1.52 ± 0.315
1.683SerArg: 1.683 ± 0.268
2.66SerSer: 2.66 ± 0.462
3.474SerThr: 3.474 ± 0.56
3.528SerVal: 3.528 ± 0.351
0.38SerTrp: 0.38 ± 0.129
2.66SerTyr: 2.66 ± 0.389
0.0SerXaa: 0.0 ± 0.0
Thr
4.343ThrAla: 4.343 ± 0.681
0.434ThrCys: 0.434 ± 0.197
4.126ThrAsp: 4.126 ± 0.757
3.8ThrGlu: 3.8 ± 0.403
2.823ThrPhe: 2.823 ± 0.377
6.134ThrGly: 6.134 ± 0.701
0.76ThrHis: 0.76 ± 0.226
5.374ThrIle: 5.374 ± 0.74
5.265ThrLys: 5.265 ± 0.526
5.645ThrLeu: 5.645 ± 0.514
1.303ThrMet: 1.303 ± 0.218
4.288ThrAsn: 4.288 ± 0.534
1.954ThrPro: 1.954 ± 0.303
1.357ThrGln: 1.357 ± 0.25
1.683ThrArg: 1.683 ± 0.301
2.986ThrSer: 2.986 ± 0.427
5.917ThrThr: 5.917 ± 1.055
5.157ThrVal: 5.157 ± 0.633
0.543ThrTrp: 0.543 ± 0.176
1.52ThrTyr: 1.52 ± 0.272
0.0ThrXaa: 0.0 ± 0.0
Val
5.428ValAla: 5.428 ± 0.598
1.086ValCys: 1.086 ± 0.296
4.343ValAsp: 4.343 ± 0.524
3.366ValGlu: 3.366 ± 0.43
1.846ValPhe: 1.846 ± 0.344
3.203ValGly: 3.203 ± 0.562
1.194ValHis: 1.194 ± 0.28
3.908ValIle: 3.908 ± 0.586
6.134ValLys: 6.134 ± 0.785
5.428ValLeu: 5.428 ± 0.564
1.411ValMet: 1.411 ± 0.318
4.831ValAsn: 4.831 ± 0.503
2.877ValPro: 2.877 ± 0.453
2.443ValGln: 2.443 ± 0.331
1.628ValArg: 1.628 ± 0.299
2.551ValSer: 2.551 ± 0.425
4.288ValThr: 4.288 ± 0.451
5.32ValVal: 5.32 ± 0.534
0.543ValTrp: 0.543 ± 0.204
2.768ValTyr: 2.768 ± 0.372
0.0ValXaa: 0.0 ± 0.0
Trp
0.597TrpAla: 0.597 ± 0.191
0.109TrpCys: 0.109 ± 0.067
0.923TrpAsp: 0.923 ± 0.201
0.76TrpGlu: 0.76 ± 0.218
0.489TrpPhe: 0.489 ± 0.171
0.271TrpGly: 0.271 ± 0.116
0.217TrpHis: 0.217 ± 0.127
0.543TrpIle: 0.543 ± 0.179
0.489TrpLys: 0.489 ± 0.183
1.14TrpLeu: 1.14 ± 0.314
0.271TrpMet: 0.271 ± 0.115
0.977TrpAsn: 0.977 ± 0.397
0.0TrpPro: 0.0 ± 0.0
0.38TrpGln: 0.38 ± 0.154
0.217TrpArg: 0.217 ± 0.119
0.651TrpSer: 0.651 ± 0.173
0.597TrpThr: 0.597 ± 0.148
0.271TrpVal: 0.271 ± 0.138
0.163TrpTrp: 0.163 ± 0.093
0.489TrpTyr: 0.489 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.443TyrAla: 2.443 ± 0.3
0.923TyrCys: 0.923 ± 0.277
2.714TyrAsp: 2.714 ± 0.321
2.551TyrGlu: 2.551 ± 0.388
1.628TyrPhe: 1.628 ± 0.278
2.66TyrGly: 2.66 ± 0.36
0.814TyrHis: 0.814 ± 0.204
3.528TyrIle: 3.528 ± 0.428
4.234TyrLys: 4.234 ± 0.598
4.234TyrLeu: 4.234 ± 0.452
1.303TyrMet: 1.303 ± 0.307
3.637TyrAsn: 3.637 ± 0.565
1.194TyrPro: 1.194 ± 0.295
0.76TyrGln: 0.76 ± 0.198
1.954TyrArg: 1.954 ± 0.298
2.606TyrSer: 2.606 ± 0.366
3.42TyrThr: 3.42 ± 0.505
2.714TyrVal: 2.714 ± 0.53
0.597TyrTrp: 0.597 ± 0.167
2.388TyrTyr: 2.388 ± 0.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (18423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski