Amino acid dipepetide frequency for Yersinia phage phiR8-01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.6AlaAla: 11.6 ± 1.581
0.992AlaCys: 0.992 ± 0.38
5.419AlaAsp: 5.419 ± 0.732
5.266AlaGlu: 5.266 ± 0.781
2.519AlaPhe: 2.519 ± 0.577
6.792AlaGly: 6.792 ± 1.018
1.908AlaHis: 1.908 ± 0.426
3.053AlaIle: 3.053 ± 0.502
6.258AlaLys: 6.258 ± 0.889
8.166AlaLeu: 8.166 ± 0.969
3.969AlaMet: 3.969 ± 0.639
3.74AlaAsn: 3.74 ± 0.546
3.053AlaPro: 3.053 ± 0.509
4.655AlaGln: 4.655 ± 0.913
6.334AlaArg: 6.334 ± 0.849
4.045AlaSer: 4.045 ± 0.544
5.266AlaThr: 5.266 ± 0.645
7.021AlaVal: 7.021 ± 0.64
2.137AlaTrp: 2.137 ± 0.407
2.824AlaTyr: 2.824 ± 0.491
0.0AlaXaa: 0.0 ± 0.0
Cys
0.611CysAla: 0.611 ± 0.233
0.229CysCys: 0.229 ± 0.133
0.534CysAsp: 0.534 ± 0.221
0.458CysGlu: 0.458 ± 0.246
0.458CysPhe: 0.458 ± 0.163
0.687CysGly: 0.687 ± 0.224
0.305CysHis: 0.305 ± 0.154
0.305CysIle: 0.305 ± 0.142
0.076CysLys: 0.076 ± 0.088
0.611CysLeu: 0.611 ± 0.239
0.611CysMet: 0.611 ± 0.175
0.382CysAsn: 0.382 ± 0.152
0.382CysPro: 0.382 ± 0.188
0.458CysGln: 0.458 ± 0.199
0.763CysArg: 0.763 ± 0.273
0.534CysSer: 0.534 ± 0.2
0.763CysThr: 0.763 ± 0.301
1.068CysVal: 1.068 ± 0.337
0.382CysTrp: 0.382 ± 0.191
0.458CysTyr: 0.458 ± 0.204
0.0CysXaa: 0.0 ± 0.0
Asp
4.655AspAla: 4.655 ± 0.672
1.068AspCys: 1.068 ± 0.263
4.426AspAsp: 4.426 ± 0.581
3.434AspGlu: 3.434 ± 0.628
2.976AspPhe: 2.976 ± 0.477
5.953AspGly: 5.953 ± 0.774
0.84AspHis: 0.84 ± 0.307
3.053AspIle: 3.053 ± 0.593
3.816AspLys: 3.816 ± 0.468
4.732AspLeu: 4.732 ± 0.893
2.213AspMet: 2.213 ± 0.391
2.9AspAsn: 2.9 ± 0.383
2.519AspPro: 2.519 ± 0.392
1.603AspGln: 1.603 ± 0.407
3.053AspArg: 3.053 ± 0.422
4.961AspSer: 4.961 ± 0.651
2.976AspThr: 2.976 ± 0.454
4.579AspVal: 4.579 ± 0.548
1.145AspTrp: 1.145 ± 0.389
2.366AspTyr: 2.366 ± 0.466
0.076AspXaa: 0.076 ± 0.071
Glu
6.411GluAla: 6.411 ± 0.68
0.687GluCys: 0.687 ± 0.274
3.511GluAsp: 3.511 ± 0.531
4.961GluGlu: 4.961 ± 0.68
2.519GluPhe: 2.519 ± 0.339
4.503GluGly: 4.503 ± 0.633
1.297GluHis: 1.297 ± 0.265
2.442GluIle: 2.442 ± 0.408
3.587GluLys: 3.587 ± 0.497
5.037GluLeu: 5.037 ± 0.603
1.832GluMet: 1.832 ± 0.429
2.366GluAsn: 2.366 ± 0.51
1.908GluPro: 1.908 ± 0.431
4.35GluGln: 4.35 ± 0.718
2.9GluArg: 2.9 ± 0.491
2.9GluSer: 2.9 ± 0.47
3.205GluThr: 3.205 ± 0.426
4.121GluVal: 4.121 ± 0.499
1.908GluTrp: 1.908 ± 0.553
1.832GluTyr: 1.832 ± 0.368
0.0GluXaa: 0.0 ± 0.0
Phe
2.213PheAla: 2.213 ± 0.357
0.763PheCys: 0.763 ± 0.278
2.671PheAsp: 2.671 ± 0.466
2.061PheGlu: 2.061 ± 0.437
1.374PhePhe: 1.374 ± 0.375
2.747PheGly: 2.747 ± 0.634
0.382PheHis: 0.382 ± 0.195
2.366PheIle: 2.366 ± 0.335
2.366PheLys: 2.366 ± 0.428
2.747PheLeu: 2.747 ± 0.354
1.679PheMet: 1.679 ± 0.347
2.061PheAsn: 2.061 ± 0.4
0.992PhePro: 0.992 ± 0.301
1.603PheGln: 1.603 ± 0.285
1.832PheArg: 1.832 ± 0.363
2.671PheSer: 2.671 ± 0.513
2.29PheThr: 2.29 ± 0.375
2.213PheVal: 2.213 ± 0.388
0.458PheTrp: 0.458 ± 0.189
1.221PheTyr: 1.221 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
6.563GlyAla: 6.563 ± 1.344
0.763GlyCys: 0.763 ± 0.285
6.029GlyAsp: 6.029 ± 0.786
4.045GlyGlu: 4.045 ± 0.417
2.442GlyPhe: 2.442 ± 0.346
5.953GlyGly: 5.953 ± 0.764
1.374GlyHis: 1.374 ± 0.348
3.816GlyIle: 3.816 ± 0.52
5.8GlyLys: 5.8 ± 0.662
6.029GlyLeu: 6.029 ± 0.853
2.595GlyMet: 2.595 ± 0.464
3.282GlyAsn: 3.282 ± 0.605
1.832GlyPro: 1.832 ± 0.439
2.9GlyGln: 2.9 ± 0.502
5.113GlyArg: 5.113 ± 0.692
4.503GlySer: 4.503 ± 0.639
4.045GlyThr: 4.045 ± 0.483
4.808GlyVal: 4.808 ± 0.724
1.068GlyTrp: 1.068 ± 0.33
2.824GlyTyr: 2.824 ± 0.611
0.0GlyXaa: 0.0 ± 0.0
His
1.45HisAla: 1.45 ± 0.317
0.229HisCys: 0.229 ± 0.13
1.679HisAsp: 1.679 ± 0.301
0.916HisGlu: 0.916 ± 0.28
0.611HisPhe: 0.611 ± 0.272
1.45HisGly: 1.45 ± 0.391
0.458HisHis: 0.458 ± 0.22
0.916HisIle: 0.916 ± 0.305
1.068HisLys: 1.068 ± 0.193
1.221HisLeu: 1.221 ± 0.292
0.992HisMet: 0.992 ± 0.264
1.068HisAsn: 1.068 ± 0.294
1.221HisPro: 1.221 ± 0.331
0.611HisGln: 0.611 ± 0.198
1.297HisArg: 1.297 ± 0.396
0.763HisSer: 0.763 ± 0.223
0.534HisThr: 0.534 ± 0.242
1.221HisVal: 1.221 ± 0.285
0.382HisTrp: 0.382 ± 0.129
0.992HisTyr: 0.992 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
4.045IleAla: 4.045 ± 0.613
0.305IleCys: 0.305 ± 0.151
2.824IleAsp: 2.824 ± 0.439
2.442IleGlu: 2.442 ± 0.512
1.374IlePhe: 1.374 ± 0.308
4.045IleGly: 4.045 ± 0.528
0.992IleHis: 0.992 ± 0.294
2.9IleIle: 2.9 ± 0.477
3.205IleLys: 3.205 ± 0.569
2.976IleLeu: 2.976 ± 0.368
1.374IleMet: 1.374 ± 0.254
2.595IleAsn: 2.595 ± 0.372
1.755IlePro: 1.755 ± 0.325
1.603IleGln: 1.603 ± 0.368
2.976IleArg: 2.976 ± 0.533
2.595IleSer: 2.595 ± 0.362
2.824IleThr: 2.824 ± 0.554
2.366IleVal: 2.366 ± 0.569
0.534IleTrp: 0.534 ± 0.196
1.374IleTyr: 1.374 ± 0.345
0.0IleXaa: 0.0 ± 0.0
Lys
6.258LysAla: 6.258 ± 0.734
0.229LysCys: 0.229 ± 0.129
3.74LysAsp: 3.74 ± 0.611
4.121LysGlu: 4.121 ± 0.594
2.137LysPhe: 2.137 ± 0.4
4.35LysGly: 4.35 ± 0.46
1.45LysHis: 1.45 ± 0.303
2.29LysIle: 2.29 ± 0.496
3.969LysLys: 3.969 ± 0.495
5.877LysLeu: 5.877 ± 0.786
1.908LysMet: 1.908 ± 0.391
1.755LysAsn: 1.755 ± 0.355
3.663LysPro: 3.663 ± 0.601
2.747LysGln: 2.747 ± 0.393
3.816LysArg: 3.816 ± 0.5
2.595LysSer: 2.595 ± 0.403
3.892LysThr: 3.892 ± 0.506
3.74LysVal: 3.74 ± 0.434
0.916LysTrp: 0.916 ± 0.237
1.984LysTyr: 1.984 ± 0.281
0.0LysXaa: 0.0 ± 0.0
Leu
8.853LeuAla: 8.853 ± 1.151
0.916LeuCys: 0.916 ± 0.318
5.648LeuAsp: 5.648 ± 0.463
4.579LeuGlu: 4.579 ± 0.66
2.213LeuPhe: 2.213 ± 0.406
5.266LeuGly: 5.266 ± 0.735
1.374LeuHis: 1.374 ± 0.297
3.282LeuIle: 3.282 ± 0.579
4.579LeuLys: 4.579 ± 0.487
5.724LeuLeu: 5.724 ± 0.691
2.9LeuMet: 2.9 ± 0.548
4.274LeuAsn: 4.274 ± 0.584
4.045LeuPro: 4.045 ± 0.502
3.358LeuGln: 3.358 ± 0.638
4.732LeuArg: 4.732 ± 0.69
5.648LeuSer: 5.648 ± 0.785
5.724LeuThr: 5.724 ± 0.653
5.342LeuVal: 5.342 ± 0.693
0.534LeuTrp: 0.534 ± 0.192
2.519LeuTyr: 2.519 ± 0.417
0.0LeuXaa: 0.0 ± 0.0
Met
4.426MetAla: 4.426 ± 0.662
0.076MetCys: 0.076 ± 0.071
2.137MetAsp: 2.137 ± 0.414
2.137MetGlu: 2.137 ± 0.435
1.832MetPhe: 1.832 ± 0.33
1.603MetGly: 1.603 ± 0.537
0.84MetHis: 0.84 ± 0.258
0.992MetIle: 0.992 ± 0.257
1.45MetLys: 1.45 ± 0.387
3.587MetLeu: 3.587 ± 0.535
0.611MetMet: 0.611 ± 0.289
1.374MetAsn: 1.374 ± 0.273
1.45MetPro: 1.45 ± 0.279
1.603MetGln: 1.603 ± 0.346
2.137MetArg: 2.137 ± 0.356
1.755MetSer: 1.755 ± 0.422
1.908MetThr: 1.908 ± 0.346
1.908MetVal: 1.908 ± 0.413
0.687MetTrp: 0.687 ± 0.144
1.145MetTyr: 1.145 ± 0.309
0.0MetXaa: 0.0 ± 0.0
Asn
4.503AsnAla: 4.503 ± 0.715
0.534AsnCys: 0.534 ± 0.22
1.984AsnAsp: 1.984 ± 0.413
2.9AsnGlu: 2.9 ± 0.561
2.137AsnPhe: 2.137 ± 0.362
3.129AsnGly: 3.129 ± 0.56
0.992AsnHis: 0.992 ± 0.25
2.213AsnIle: 2.213 ± 0.476
3.74AsnLys: 3.74 ± 0.538
2.976AsnLeu: 2.976 ± 0.438
1.221AsnMet: 1.221 ± 0.268
2.29AsnAsn: 2.29 ± 0.435
2.061AsnPro: 2.061 ± 0.43
2.061AsnGln: 2.061 ± 0.513
2.9AsnArg: 2.9 ± 0.358
2.061AsnSer: 2.061 ± 0.374
2.824AsnThr: 2.824 ± 0.422
2.29AsnVal: 2.29 ± 0.422
0.534AsnTrp: 0.534 ± 0.322
1.297AsnTyr: 1.297 ± 0.292
0.0AsnXaa: 0.0 ± 0.0
Pro
3.511ProAla: 3.511 ± 0.812
0.382ProCys: 0.382 ± 0.172
2.442ProAsp: 2.442 ± 0.459
4.121ProGlu: 4.121 ± 0.651
1.45ProPhe: 1.45 ± 0.39
3.434ProGly: 3.434 ± 0.589
0.611ProHis: 0.611 ± 0.188
1.374ProIle: 1.374 ± 0.359
2.137ProLys: 2.137 ± 0.383
3.663ProLeu: 3.663 ± 0.487
0.763ProMet: 0.763 ± 0.263
2.137ProAsn: 2.137 ± 0.438
1.221ProPro: 1.221 ± 0.338
1.832ProGln: 1.832 ± 0.396
1.526ProArg: 1.526 ± 0.235
2.519ProSer: 2.519 ± 0.37
1.755ProThr: 1.755 ± 0.314
3.587ProVal: 3.587 ± 0.622
0.534ProTrp: 0.534 ± 0.195
1.755ProTyr: 1.755 ± 0.374
0.0ProXaa: 0.0 ± 0.0
Gln
4.579GlnAla: 4.579 ± 0.91
0.534GlnCys: 0.534 ± 0.22
2.29GlnAsp: 2.29 ± 0.368
2.519GlnGlu: 2.519 ± 0.493
1.45GlnPhe: 1.45 ± 0.363
3.053GlnGly: 3.053 ± 0.679
0.992GlnHis: 0.992 ± 0.242
1.297GlnIle: 1.297 ± 0.317
2.366GlnLys: 2.366 ± 0.473
3.282GlnLeu: 3.282 ± 0.449
2.595GlnMet: 2.595 ± 0.428
1.984GlnAsn: 1.984 ± 0.41
1.526GlnPro: 1.526 ± 0.367
2.747GlnGln: 2.747 ± 0.569
2.519GlnArg: 2.519 ± 0.354
2.061GlnSer: 2.061 ± 0.489
1.832GlnThr: 1.832 ± 0.305
2.671GlnVal: 2.671 ± 0.395
0.611GlnTrp: 0.611 ± 0.227
2.29GlnTyr: 2.29 ± 0.439
0.0GlnXaa: 0.0 ± 0.0
Arg
5.8ArgAla: 5.8 ± 0.706
0.534ArgCys: 0.534 ± 0.222
3.663ArgAsp: 3.663 ± 0.466
3.358ArgGlu: 3.358 ± 0.475
2.366ArgPhe: 2.366 ± 0.454
4.121ArgGly: 4.121 ± 0.638
1.145ArgHis: 1.145 ± 0.328
2.824ArgIle: 2.824 ± 0.507
3.663ArgLys: 3.663 ± 0.539
5.419ArgLeu: 5.419 ± 0.689
1.755ArgMet: 1.755 ± 0.381
2.9ArgAsn: 2.9 ± 0.437
1.755ArgPro: 1.755 ± 0.419
1.679ArgGln: 1.679 ± 0.351
3.282ArgArg: 3.282 ± 0.421
2.824ArgSer: 2.824 ± 0.408
2.747ArgThr: 2.747 ± 0.456
3.892ArgVal: 3.892 ± 0.603
1.526ArgTrp: 1.526 ± 0.374
1.679ArgTyr: 1.679 ± 0.352
0.0ArgXaa: 0.0 ± 0.0
Ser
4.579SerAla: 4.579 ± 0.573
0.229SerCys: 0.229 ± 0.123
3.511SerAsp: 3.511 ± 0.55
3.434SerGlu: 3.434 ± 0.675
1.603SerPhe: 1.603 ± 0.412
4.732SerGly: 4.732 ± 0.613
0.84SerHis: 0.84 ± 0.24
3.053SerIle: 3.053 ± 0.385
3.358SerLys: 3.358 ± 0.543
4.808SerLeu: 4.808 ± 0.798
1.832SerMet: 1.832 ± 0.349
2.137SerAsn: 2.137 ± 0.453
2.747SerPro: 2.747 ± 0.506
2.595SerGln: 2.595 ± 0.578
2.747SerArg: 2.747 ± 0.437
2.9SerSer: 2.9 ± 0.647
3.587SerThr: 3.587 ± 0.586
4.274SerVal: 4.274 ± 0.566
0.916SerTrp: 0.916 ± 0.161
2.366SerTyr: 2.366 ± 0.373
0.0SerXaa: 0.0 ± 0.0
Thr
5.724ThrAla: 5.724 ± 1.013
0.229ThrCys: 0.229 ± 0.166
3.816ThrAsp: 3.816 ± 0.568
3.129ThrGlu: 3.129 ± 0.649
1.908ThrPhe: 1.908 ± 0.466
5.19ThrGly: 5.19 ± 0.748
1.297ThrHis: 1.297 ± 0.326
2.519ThrIle: 2.519 ± 0.519
3.129ThrLys: 3.129 ± 0.421
5.113ThrLeu: 5.113 ± 0.528
1.526ThrMet: 1.526 ± 0.287
1.984ThrAsn: 1.984 ± 0.409
2.824ThrPro: 2.824 ± 0.382
1.526ThrGln: 1.526 ± 0.297
3.205ThrArg: 3.205 ± 0.457
3.663ThrSer: 3.663 ± 0.455
3.053ThrThr: 3.053 ± 0.539
4.579ThrVal: 4.579 ± 0.656
0.687ThrTrp: 0.687 ± 0.242
1.679ThrTyr: 1.679 ± 0.431
0.0ThrXaa: 0.0 ± 0.0
Val
5.037ValAla: 5.037 ± 0.66
0.763ValCys: 0.763 ± 0.274
3.587ValAsp: 3.587 ± 0.557
4.884ValGlu: 4.884 ± 0.611
2.519ValPhe: 2.519 ± 0.392
4.732ValGly: 4.732 ± 0.634
1.145ValHis: 1.145 ± 0.32
3.969ValIle: 3.969 ± 0.735
4.274ValLys: 4.274 ± 0.548
4.961ValLeu: 4.961 ± 0.587
2.061ValMet: 2.061 ± 0.502
3.053ValAsn: 3.053 ± 0.414
3.816ValPro: 3.816 ± 0.622
3.282ValGln: 3.282 ± 0.432
3.358ValArg: 3.358 ± 0.547
4.121ValSer: 4.121 ± 0.685
4.732ValThr: 4.732 ± 0.958
4.274ValVal: 4.274 ± 0.739
0.458ValTrp: 0.458 ± 0.163
1.984ValTyr: 1.984 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
1.297TrpAla: 1.297 ± 0.36
0.153TrpCys: 0.153 ± 0.092
1.374TrpAsp: 1.374 ± 0.48
1.068TrpGlu: 1.068 ± 0.307
1.145TrpPhe: 1.145 ± 0.188
1.145TrpGly: 1.145 ± 0.47
0.382TrpHis: 0.382 ± 0.227
0.229TrpIle: 0.229 ± 0.13
0.84TrpLys: 0.84 ± 0.228
1.984TrpLeu: 1.984 ± 0.451
0.153TrpMet: 0.153 ± 0.104
0.534TrpAsn: 0.534 ± 0.27
0.763TrpPro: 0.763 ± 0.238
0.458TrpGln: 0.458 ± 0.257
0.534TrpArg: 0.534 ± 0.166
0.992TrpSer: 0.992 ± 0.409
0.763TrpThr: 0.763 ± 0.224
1.297TrpVal: 1.297 ± 0.342
0.534TrpTrp: 0.534 ± 0.165
0.687TrpTyr: 0.687 ± 0.283
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.9TyrAla: 2.9 ± 0.361
0.458TyrCys: 0.458 ± 0.169
1.984TyrAsp: 1.984 ± 0.395
2.442TyrGlu: 2.442 ± 0.494
1.679TyrPhe: 1.679 ± 0.391
2.9TyrGly: 2.9 ± 0.396
0.458TyrHis: 0.458 ± 0.161
2.213TyrIle: 2.213 ± 0.413
1.755TyrLys: 1.755 ± 0.325
2.595TyrLeu: 2.595 ± 0.54
0.992TyrMet: 0.992 ± 0.245
1.832TyrAsn: 1.832 ± 0.372
1.221TyrPro: 1.221 ± 0.347
1.603TyrGln: 1.603 ± 0.306
1.908TyrArg: 1.908 ± 0.37
2.137TyrSer: 2.137 ± 0.311
2.061TyrThr: 2.061 ± 0.361
1.755TyrVal: 1.755 ± 0.38
0.382TyrTrp: 0.382 ± 0.17
0.763TyrTyr: 0.763 ± 0.312
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.076XaaAsp: 0.076 ± 0.071
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (13104 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski