Amino acid dipepetide frequency for Cellulophaga phage phi10:1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.663AlaAla: 2.663 ± 0.688
0.545AlaCys: 0.545 ± 0.17
2.179AlaAsp: 2.179 ± 0.387
3.934AlaGlu: 3.934 ± 0.958
2.36AlaPhe: 2.36 ± 0.302
2.36AlaGly: 2.36 ± 0.409
0.666AlaHis: 0.666 ± 0.204
4.539AlaIle: 4.539 ± 0.591
3.692AlaLys: 3.692 ± 0.675
5.084AlaLeu: 5.084 ± 0.714
1.089AlaMet: 1.089 ± 0.253
3.208AlaAsn: 3.208 ± 0.496
1.15AlaPro: 1.15 ± 0.299
1.513AlaGln: 1.513 ± 0.316
2.118AlaArg: 2.118 ± 0.306
3.934AlaSer: 3.934 ± 0.484
2.966AlaThr: 2.966 ± 0.417
3.268AlaVal: 3.268 ± 0.478
0.303AlaTrp: 0.303 ± 0.115
2.058AlaTyr: 2.058 ± 0.399
0.0AlaXaa: 0.0 ± 0.0
Cys
0.605CysAla: 0.605 ± 0.192
0.242CysCys: 0.242 ± 0.145
0.908CysAsp: 0.908 ± 0.236
0.847CysGlu: 0.847 ± 0.189
0.726CysPhe: 0.726 ± 0.191
1.15CysGly: 1.15 ± 0.218
0.121CysHis: 0.121 ± 0.088
0.726CysIle: 0.726 ± 0.202
1.089CysLys: 1.089 ± 0.23
0.666CysLeu: 0.666 ± 0.179
0.303CysMet: 0.303 ± 0.121
1.089CysAsn: 1.089 ± 0.3
0.182CysPro: 0.182 ± 0.1
0.424CysGln: 0.424 ± 0.173
0.787CysArg: 0.787 ± 0.226
0.908CysSer: 0.908 ± 0.251
0.484CysThr: 0.484 ± 0.198
0.605CysVal: 0.605 ± 0.189
0.0CysTrp: 0.0 ± 0.0
0.726CysTyr: 0.726 ± 0.216
0.0CysXaa: 0.0 ± 0.0
Asp
3.752AspAla: 3.752 ± 0.578
0.908AspCys: 0.908 ± 0.237
4.115AspAsp: 4.115 ± 0.509
4.721AspGlu: 4.721 ± 0.607
4.358AspPhe: 4.358 ± 0.528
4.6AspGly: 4.6 ± 0.676
1.029AspHis: 1.029 ± 0.206
3.873AspIle: 3.873 ± 0.518
5.205AspLys: 5.205 ± 0.479
6.173AspLeu: 6.173 ± 0.673
1.453AspMet: 1.453 ± 0.301
4.721AspAsn: 4.721 ± 0.44
1.634AspPro: 1.634 ± 0.272
0.787AspGln: 0.787 ± 0.203
1.876AspArg: 1.876 ± 0.356
3.692AspSer: 3.692 ± 0.546
3.389AspThr: 3.389 ± 0.395
4.358AspVal: 4.358 ± 0.499
0.968AspTrp: 0.968 ± 0.275
3.389AspTyr: 3.389 ± 0.472
0.0AspXaa: 0.0 ± 0.0
Glu
3.51GluAla: 3.51 ± 0.741
0.787GluCys: 0.787 ± 0.238
3.873GluAsp: 3.873 ± 0.687
5.75GluGlu: 5.75 ± 0.735
3.813GluPhe: 3.813 ± 0.464
4.176GluGly: 4.176 ± 0.607
0.847GluHis: 0.847 ± 0.211
6.476GluIle: 6.476 ± 0.648
6.355GluLys: 6.355 ± 0.946
8.352GluLeu: 8.352 ± 0.806
1.634GluMet: 1.634 ± 0.353
4.479GluAsn: 4.479 ± 0.536
1.876GluPro: 1.876 ± 0.313
2.784GluGln: 2.784 ± 0.474
2.723GluArg: 2.723 ± 0.645
4.6GluSer: 4.6 ± 0.557
4.539GluThr: 4.539 ± 0.481
3.873GluVal: 3.873 ± 0.494
1.271GluTrp: 1.271 ± 0.319
2.905GluTyr: 2.905 ± 0.484
0.0GluXaa: 0.0 ± 0.0
Phe
2.118PheAla: 2.118 ± 0.361
0.605PheCys: 0.605 ± 0.198
3.208PheAsp: 3.208 ± 0.395
3.45PheGlu: 3.45 ± 0.396
1.755PhePhe: 1.755 ± 0.385
2.602PheGly: 2.602 ± 0.394
0.908PheHis: 0.908 ± 0.209
3.389PheIle: 3.389 ± 0.432
4.176PheLys: 4.176 ± 0.552
4.115PheLeu: 4.115 ± 0.49
1.089PheMet: 1.089 ± 0.284
3.692PheAsn: 3.692 ± 0.484
1.21PhePro: 1.21 ± 0.25
1.271PheGln: 1.271 ± 0.245
1.937PheArg: 1.937 ± 0.381
2.784PheSer: 2.784 ± 0.444
3.147PheThr: 3.147 ± 0.44
2.663PheVal: 2.663 ± 0.38
0.424PheTrp: 0.424 ± 0.228
2.058PheTyr: 2.058 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
2.36GlyAla: 2.36 ± 0.409
0.787GlyCys: 0.787 ± 0.21
3.752GlyAsp: 3.752 ± 0.487
4.055GlyGlu: 4.055 ± 0.489
2.36GlyPhe: 2.36 ± 0.367
3.752GlyGly: 3.752 ± 0.459
0.787GlyHis: 0.787 ± 0.285
3.873GlyIle: 3.873 ± 0.475
4.237GlyLys: 4.237 ± 0.548
5.144GlyLeu: 5.144 ± 0.521
0.968GlyMet: 0.968 ± 0.211
3.389GlyAsn: 3.389 ± 0.613
0.0GlyPro: 0.0 ± 0.0
1.15GlyGln: 1.15 ± 0.279
2.058GlyArg: 2.058 ± 0.433
4.539GlySer: 4.539 ± 0.614
3.994GlyThr: 3.994 ± 0.733
4.358GlyVal: 4.358 ± 0.54
1.392GlyTrp: 1.392 ± 0.285
3.087GlyTyr: 3.087 ± 0.371
0.0GlyXaa: 0.0 ± 0.0
His
0.666HisAla: 0.666 ± 0.219
0.182HisCys: 0.182 ± 0.112
0.545HisAsp: 0.545 ± 0.171
1.029HisGlu: 1.029 ± 0.286
1.029HisPhe: 1.029 ± 0.304
0.666HisGly: 0.666 ± 0.186
0.545HisHis: 0.545 ± 0.212
1.392HisIle: 1.392 ± 0.264
1.15HisLys: 1.15 ± 0.285
1.21HisLeu: 1.21 ± 0.276
0.242HisMet: 0.242 ± 0.111
0.666HisAsn: 0.666 ± 0.221
0.787HisPro: 0.787 ± 0.205
0.726HisGln: 0.726 ± 0.199
0.303HisArg: 0.303 ± 0.117
0.968HisSer: 0.968 ± 0.25
0.787HisThr: 0.787 ± 0.189
0.666HisVal: 0.666 ± 0.207
0.182HisTrp: 0.182 ± 0.095
0.908HisTyr: 0.908 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
3.873IleAla: 3.873 ± 0.492
1.089IleCys: 1.089 ± 0.176
6.113IleAsp: 6.113 ± 0.629
6.173IleGlu: 6.173 ± 0.676
2.845IlePhe: 2.845 ± 0.335
4.237IleGly: 4.237 ± 0.458
1.271IleHis: 1.271 ± 0.257
3.813IleIle: 3.813 ± 0.612
7.868IleLys: 7.868 ± 0.826
6.96IleLeu: 6.96 ± 0.668
2.421IleMet: 2.421 ± 0.38
6.718IleAsn: 6.718 ± 0.968
2.3IlePro: 2.3 ± 0.358
2.663IleGln: 2.663 ± 0.447
3.087IleArg: 3.087 ± 0.404
5.992IleSer: 5.992 ± 0.671
4.237IleThr: 4.237 ± 0.607
4.237IleVal: 4.237 ± 0.487
0.726IleTrp: 0.726 ± 0.189
2.966IleTyr: 2.966 ± 0.408
0.0IleXaa: 0.0 ± 0.0
Lys
5.144LysAla: 5.144 ± 1.08
1.574LysCys: 1.574 ± 0.33
5.931LysAsp: 5.931 ± 0.701
9.441LysGlu: 9.441 ± 1.184
2.663LysPhe: 2.663 ± 0.404
4.539LysGly: 4.539 ± 0.62
1.392LysHis: 1.392 ± 0.324
8.17LysIle: 8.17 ± 0.592
7.505LysLys: 7.505 ± 0.834
7.505LysLeu: 7.505 ± 0.801
2.421LysMet: 2.421 ± 0.338
4.963LysAsn: 4.963 ± 0.63
2.542LysPro: 2.542 ± 0.428
3.208LysGln: 3.208 ± 0.403
3.026LysArg: 3.026 ± 0.38
4.479LysSer: 4.479 ± 0.477
5.992LysThr: 5.992 ± 0.515
4.902LysVal: 4.902 ± 0.58
1.15LysTrp: 1.15 ± 0.283
4.297LysTyr: 4.297 ± 0.541
0.0LysXaa: 0.0 ± 0.0
Leu
4.176LeuAla: 4.176 ± 0.546
0.484LeuCys: 0.484 ± 0.21
6.778LeuAsp: 6.778 ± 0.619
7.081LeuGlu: 7.081 ± 0.847
3.692LeuPhe: 3.692 ± 0.406
4.115LeuGly: 4.115 ± 0.434
0.968LeuHis: 0.968 ± 0.277
7.263LeuIle: 7.263 ± 0.71
8.957LeuLys: 8.957 ± 0.831
6.657LeuLeu: 6.657 ± 0.788
1.755LeuMet: 1.755 ± 0.364
4.902LeuAsn: 4.902 ± 0.631
3.026LeuPro: 3.026 ± 0.412
1.876LeuGln: 1.876 ± 0.326
3.873LeuArg: 3.873 ± 0.537
7.202LeuSer: 7.202 ± 0.665
6.355LeuThr: 6.355 ± 0.702
5.144LeuVal: 5.144 ± 0.752
0.484LeuTrp: 0.484 ± 0.158
3.026LeuTyr: 3.026 ± 0.421
0.0LeuXaa: 0.0 ± 0.0
Met
1.331MetAla: 1.331 ± 0.282
0.424MetCys: 0.424 ± 0.151
1.453MetAsp: 1.453 ± 0.332
1.937MetGlu: 1.937 ± 0.335
0.545MetPhe: 0.545 ± 0.138
0.968MetGly: 0.968 ± 0.229
0.182MetHis: 0.182 ± 0.091
1.997MetIle: 1.997 ± 0.331
2.723MetLys: 2.723 ± 0.36
1.15MetLeu: 1.15 ± 0.257
0.605MetMet: 0.605 ± 0.199
1.997MetAsn: 1.997 ± 0.363
0.908MetPro: 0.908 ± 0.202
0.847MetGln: 0.847 ± 0.243
0.605MetArg: 0.605 ± 0.187
1.21MetSer: 1.21 ± 0.286
1.089MetThr: 1.089 ± 0.236
0.908MetVal: 0.908 ± 0.263
0.303MetTrp: 0.303 ± 0.148
0.666MetTyr: 0.666 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
3.51AsnAla: 3.51 ± 0.45
0.726AsnCys: 0.726 ± 0.182
4.176AsnAsp: 4.176 ± 0.479
4.418AsnGlu: 4.418 ± 0.511
2.845AsnPhe: 2.845 ± 0.405
4.297AsnGly: 4.297 ± 0.478
1.029AsnHis: 1.029 ± 0.249
6.294AsnIle: 6.294 ± 0.72
7.202AsnLys: 7.202 ± 0.697
4.902AsnLeu: 4.902 ± 0.464
2.3AsnMet: 2.3 ± 0.442
5.386AsnAsn: 5.386 ± 0.832
1.997AsnPro: 1.997 ± 0.29
1.755AsnGln: 1.755 ± 0.306
2.118AsnArg: 2.118 ± 0.378
3.268AsnSer: 3.268 ± 0.359
4.418AsnThr: 4.418 ± 0.521
3.208AsnVal: 3.208 ± 0.531
1.331AsnTrp: 1.331 ± 0.262
2.723AsnTyr: 2.723 ± 0.444
0.0AsnXaa: 0.0 ± 0.0
Pro
0.787ProAla: 0.787 ± 0.168
0.424ProCys: 0.424 ± 0.183
1.695ProAsp: 1.695 ± 0.346
1.513ProGlu: 1.513 ± 0.263
1.634ProPhe: 1.634 ± 0.299
0.363ProGly: 0.363 ± 0.132
0.424ProHis: 0.424 ± 0.193
2.36ProIle: 2.36 ± 0.344
2.602ProLys: 2.602 ± 0.479
2.542ProLeu: 2.542 ± 0.443
0.545ProMet: 0.545 ± 0.182
1.513ProAsn: 1.513 ± 0.348
0.605ProPro: 0.605 ± 0.213
0.787ProGln: 0.787 ± 0.187
0.545ProArg: 0.545 ± 0.178
2.421ProSer: 2.421 ± 0.395
2.179ProThr: 2.179 ± 0.412
1.331ProVal: 1.331 ± 0.279
0.121ProTrp: 0.121 ± 0.085
1.755ProTyr: 1.755 ± 0.377
0.0ProXaa: 0.0 ± 0.0
Gln
1.634GlnAla: 1.634 ± 0.33
0.182GlnCys: 0.182 ± 0.093
1.695GlnAsp: 1.695 ± 0.26
2.058GlnGlu: 2.058 ± 0.414
0.968GlnPhe: 0.968 ± 0.231
1.21GlnGly: 1.21 ± 0.293
0.726GlnHis: 0.726 ± 0.213
2.421GlnIle: 2.421 ± 0.355
2.239GlnLys: 2.239 ± 0.348
2.36GlnLeu: 2.36 ± 0.408
0.545GlnMet: 0.545 ± 0.159
2.179GlnAsn: 2.179 ± 0.4
0.726GlnPro: 0.726 ± 0.171
1.029GlnGln: 1.029 ± 0.28
0.908GlnArg: 0.908 ± 0.275
2.3GlnSer: 2.3 ± 0.341
2.542GlnThr: 2.542 ± 0.335
1.331GlnVal: 1.331 ± 0.244
0.121GlnTrp: 0.121 ± 0.09
0.908GlnTyr: 0.908 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
1.029ArgAla: 1.029 ± 0.247
0.484ArgCys: 0.484 ± 0.173
2.118ArgAsp: 2.118 ± 0.379
1.695ArgGlu: 1.695 ± 0.352
2.118ArgPhe: 2.118 ± 0.302
1.755ArgGly: 1.755 ± 0.314
0.303ArgHis: 0.303 ± 0.128
3.329ArgIle: 3.329 ± 0.454
3.692ArgLys: 3.692 ± 0.608
3.51ArgLeu: 3.51 ± 0.514
0.968ArgMet: 0.968 ± 0.263
3.026ArgAsn: 3.026 ± 0.421
1.029ArgPro: 1.029 ± 0.292
1.21ArgGln: 1.21 ± 0.237
1.816ArgArg: 1.816 ± 0.33
1.089ArgSer: 1.089 ± 0.221
2.118ArgThr: 2.118 ± 0.345
1.997ArgVal: 1.997 ± 0.359
0.484ArgTrp: 0.484 ± 0.159
1.937ArgTyr: 1.937 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
3.389SerAla: 3.389 ± 0.574
0.847SerCys: 0.847 ± 0.249
4.297SerAsp: 4.297 ± 0.451
4.418SerGlu: 4.418 ± 0.465
3.873SerPhe: 3.873 ± 0.476
4.902SerGly: 4.902 ± 0.844
0.908SerHis: 0.908 ± 0.251
5.507SerIle: 5.507 ± 0.609
5.81SerLys: 5.81 ± 0.574
5.144SerLeu: 5.144 ± 0.638
0.666SerMet: 0.666 ± 0.192
4.721SerAsn: 4.721 ± 0.62
1.331SerPro: 1.331 ± 0.275
1.21SerGln: 1.21 ± 0.343
2.179SerArg: 2.179 ± 0.359
4.902SerSer: 4.902 ± 0.922
5.144SerThr: 5.144 ± 0.636
3.813SerVal: 3.813 ± 0.506
0.545SerTrp: 0.545 ± 0.162
2.905SerTyr: 2.905 ± 0.381
0.0SerXaa: 0.0 ± 0.0
Thr
3.934ThrAla: 3.934 ± 0.51
0.605ThrCys: 0.605 ± 0.184
4.176ThrAsp: 4.176 ± 0.59
4.842ThrGlu: 4.842 ± 0.54
2.845ThrPhe: 2.845 ± 0.411
4.418ThrGly: 4.418 ± 0.714
0.787ThrHis: 0.787 ± 0.21
5.931ThrIle: 5.931 ± 0.688
5.084ThrLys: 5.084 ± 0.507
5.931ThrLeu: 5.931 ± 0.717
0.605ThrMet: 0.605 ± 0.23
3.571ThrAsn: 3.571 ± 0.46
1.997ThrPro: 1.997 ± 0.379
1.695ThrGln: 1.695 ± 0.275
1.695ThrArg: 1.695 ± 0.324
4.6ThrSer: 4.6 ± 0.586
4.721ThrThr: 4.721 ± 0.807
3.268ThrVal: 3.268 ± 0.468
0.545ThrTrp: 0.545 ± 0.215
2.602ThrTyr: 2.602 ± 0.447
0.0ThrXaa: 0.0 ± 0.0
Val
2.481ValAla: 2.481 ± 0.428
1.029ValCys: 1.029 ± 0.265
4.115ValAsp: 4.115 ± 0.502
3.752ValGlu: 3.752 ± 0.496
3.147ValPhe: 3.147 ± 0.419
2.784ValGly: 2.784 ± 0.372
0.726ValHis: 0.726 ± 0.224
4.297ValIle: 4.297 ± 0.528
5.507ValLys: 5.507 ± 0.58
4.479ValLeu: 4.479 ± 0.452
0.726ValMet: 0.726 ± 0.207
3.692ValAsn: 3.692 ± 0.488
1.634ValPro: 1.634 ± 0.276
1.513ValGln: 1.513 ± 0.268
2.118ValArg: 2.118 ± 0.312
4.176ValSer: 4.176 ± 0.519
3.268ValThr: 3.268 ± 0.388
3.51ValVal: 3.51 ± 0.372
0.666ValTrp: 0.666 ± 0.211
2.481ValTyr: 2.481 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
0.605TrpAla: 0.605 ± 0.229
0.121TrpCys: 0.121 ± 0.095
0.666TrpAsp: 0.666 ± 0.204
0.787TrpGlu: 0.787 ± 0.226
0.484TrpPhe: 0.484 ± 0.231
0.424TrpGly: 0.424 ± 0.173
0.363TrpHis: 0.363 ± 0.146
0.908TrpIle: 0.908 ± 0.256
1.089TrpLys: 1.089 ± 0.239
1.15TrpLeu: 1.15 ± 0.281
0.424TrpMet: 0.424 ± 0.141
1.15TrpAsn: 1.15 ± 0.28
0.0TrpPro: 0.0 ± 0.0
0.726TrpGln: 0.726 ± 0.18
0.424TrpArg: 0.424 ± 0.18
0.666TrpSer: 0.666 ± 0.219
0.484TrpThr: 0.484 ± 0.165
0.605TrpVal: 0.605 ± 0.19
0.303TrpTrp: 0.303 ± 0.119
0.545TrpTyr: 0.545 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.058TyrAla: 2.058 ± 0.392
0.545TyrCys: 0.545 ± 0.169
3.208TyrAsp: 3.208 ± 0.471
2.723TyrGlu: 2.723 ± 0.411
2.542TyrPhe: 2.542 ± 0.399
2.723TyrGly: 2.723 ± 0.332
0.666TyrHis: 0.666 ± 0.193
3.026TyrIle: 3.026 ± 0.444
4.66TyrLys: 4.66 ± 0.505
4.781TyrLeu: 4.781 ± 0.503
1.029TyrMet: 1.029 ± 0.202
2.845TyrAsn: 2.845 ± 0.408
1.21TyrPro: 1.21 ± 0.292
0.908TyrGln: 0.908 ± 0.235
1.513TyrArg: 1.513 ± 0.318
2.784TyrSer: 2.784 ± 0.448
1.997TyrThr: 1.997 ± 0.376
2.118TyrVal: 2.118 ± 0.364
0.605TyrTrp: 0.605 ± 0.188
2.602TyrTyr: 2.602 ± 0.374
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 106 proteins (16524 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski