Amino acid dipepetide frequency for Vibrio phage douglas 12A4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.685AlaAla: 6.685 ± 0.878
0.658AlaCys: 0.658 ± 0.186
6.082AlaAsp: 6.082 ± 1.437
6.685AlaGlu: 6.685 ± 1.122
2.137AlaPhe: 2.137 ± 0.351
5.699AlaGly: 5.699 ± 0.769
1.096AlaHis: 1.096 ± 0.268
6.959AlaIle: 6.959 ± 0.782
5.534AlaLys: 5.534 ± 0.558
7.123AlaLeu: 7.123 ± 0.715
2.356AlaMet: 2.356 ± 0.331
2.575AlaAsn: 2.575 ± 0.354
2.027AlaPro: 2.027 ± 0.332
4.438AlaGln: 4.438 ± 0.788
4.0AlaArg: 4.0 ± 0.653
5.096AlaSer: 5.096 ± 0.506
4.822AlaThr: 4.822 ± 0.631
4.11AlaVal: 4.11 ± 0.482
0.877AlaTrp: 0.877 ± 0.203
2.685AlaTyr: 2.685 ± 0.337
0.0AlaXaa: 0.0 ± 0.0
Cys
0.603CysAla: 0.603 ± 0.185
0.219CysCys: 0.219 ± 0.128
0.658CysAsp: 0.658 ± 0.214
0.493CysGlu: 0.493 ± 0.183
0.329CysPhe: 0.329 ± 0.135
0.932CysGly: 0.932 ± 0.298
0.329CysHis: 0.329 ± 0.168
0.712CysIle: 0.712 ± 0.26
0.932CysLys: 0.932 ± 0.298
0.822CysLeu: 0.822 ± 0.246
0.055CysMet: 0.055 ± 0.051
0.493CysAsn: 0.493 ± 0.193
0.329CysPro: 0.329 ± 0.165
0.219CysGln: 0.219 ± 0.113
0.219CysArg: 0.219 ± 0.131
0.767CysSer: 0.767 ± 0.283
0.384CysThr: 0.384 ± 0.134
0.603CysVal: 0.603 ± 0.206
0.11CysTrp: 0.11 ± 0.073
0.548CysTyr: 0.548 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
5.753AspAla: 5.753 ± 0.584
0.877AspCys: 0.877 ± 0.266
4.603AspAsp: 4.603 ± 0.563
4.822AspGlu: 4.822 ± 0.523
2.74AspPhe: 2.74 ± 0.267
4.658AspGly: 4.658 ± 0.668
1.26AspHis: 1.26 ± 0.253
4.055AspIle: 4.055 ± 0.429
4.986AspLys: 4.986 ± 0.559
4.986AspLeu: 4.986 ± 0.435
2.027AspMet: 2.027 ± 0.357
3.288AspAsn: 3.288 ± 0.43
2.411AspPro: 2.411 ± 0.657
2.192AspGln: 2.192 ± 0.424
2.959AspArg: 2.959 ± 0.417
4.219AspSer: 4.219 ± 0.625
3.342AspThr: 3.342 ± 0.396
4.548AspVal: 4.548 ± 0.541
1.096AspTrp: 1.096 ± 0.231
2.466AspTyr: 2.466 ± 0.326
0.0AspXaa: 0.0 ± 0.0
Glu
6.521GluAla: 6.521 ± 0.745
0.603GluCys: 0.603 ± 0.221
3.726GluAsp: 3.726 ± 0.506
5.644GluGlu: 5.644 ± 0.75
2.356GluPhe: 2.356 ± 0.319
3.507GluGly: 3.507 ± 0.413
1.205GluHis: 1.205 ± 0.242
5.37GluIle: 5.37 ± 0.648
5.37GluLys: 5.37 ± 0.758
6.74GluLeu: 6.74 ± 0.833
2.192GluMet: 2.192 ± 0.333
3.068GluAsn: 3.068 ± 0.332
1.918GluPro: 1.918 ± 0.411
3.616GluGln: 3.616 ± 0.574
4.11GluArg: 4.11 ± 0.559
4.384GluSer: 4.384 ± 0.646
4.11GluThr: 4.11 ± 0.468
4.877GluVal: 4.877 ± 0.713
1.26GluTrp: 1.26 ± 0.243
1.753GluTyr: 1.753 ± 0.373
0.0GluXaa: 0.0 ± 0.0
Phe
2.685PheAla: 2.685 ± 0.338
0.548PheCys: 0.548 ± 0.208
2.575PheAsp: 2.575 ± 0.412
2.192PheGlu: 2.192 ± 0.348
1.205PhePhe: 1.205 ± 0.208
2.795PheGly: 2.795 ± 0.402
0.603PheHis: 0.603 ± 0.187
2.521PheIle: 2.521 ± 0.337
2.904PheLys: 2.904 ± 0.553
2.027PheLeu: 2.027 ± 0.366
0.822PheMet: 0.822 ± 0.195
2.082PheAsn: 2.082 ± 0.442
0.712PhePro: 0.712 ± 0.16
0.986PheGln: 0.986 ± 0.21
1.26PheArg: 1.26 ± 0.221
3.068PheSer: 3.068 ± 0.361
1.808PheThr: 1.808 ± 0.381
1.753PheVal: 1.753 ± 0.304
0.438PheTrp: 0.438 ± 0.168
1.644PheTyr: 1.644 ± 0.466
0.0PheXaa: 0.0 ± 0.0
Gly
5.37GlyAla: 5.37 ± 0.696
0.438GlyCys: 0.438 ± 0.203
5.151GlyAsp: 5.151 ± 0.507
4.548GlyGlu: 4.548 ± 0.436
2.959GlyPhe: 2.959 ± 0.435
5.753GlyGly: 5.753 ± 0.508
0.986GlyHis: 0.986 ± 0.192
3.726GlyIle: 3.726 ± 0.416
5.26GlyLys: 5.26 ± 0.526
4.822GlyLeu: 4.822 ± 0.606
1.534GlyMet: 1.534 ± 0.262
3.068GlyAsn: 3.068 ± 0.429
1.753GlyPro: 1.753 ± 0.363
1.699GlyGln: 1.699 ± 0.311
2.63GlyArg: 2.63 ± 0.377
3.89GlySer: 3.89 ± 0.685
4.712GlyThr: 4.712 ± 0.665
5.644GlyVal: 5.644 ± 0.641
0.712GlyTrp: 0.712 ± 0.197
2.466GlyTyr: 2.466 ± 0.404
0.0GlyXaa: 0.0 ± 0.0
His
1.151HisAla: 1.151 ± 0.2
0.219HisCys: 0.219 ± 0.099
1.151HisAsp: 1.151 ± 0.316
1.151HisGlu: 1.151 ± 0.265
0.767HisPhe: 0.767 ± 0.223
1.26HisGly: 1.26 ± 0.313
0.438HisHis: 0.438 ± 0.188
0.877HisIle: 0.877 ± 0.223
1.096HisLys: 1.096 ± 0.218
1.479HisLeu: 1.479 ± 0.387
0.274HisMet: 0.274 ± 0.105
1.041HisAsn: 1.041 ± 0.299
0.658HisPro: 0.658 ± 0.213
0.658HisGln: 0.658 ± 0.163
0.329HisArg: 0.329 ± 0.118
0.986HisSer: 0.986 ± 0.264
0.658HisThr: 0.658 ± 0.207
0.658HisVal: 0.658 ± 0.225
0.219HisTrp: 0.219 ± 0.119
0.767HisTyr: 0.767 ± 0.21
0.0HisXaa: 0.0 ± 0.0
Ile
5.918IleAla: 5.918 ± 0.525
0.603IleCys: 0.603 ± 0.209
5.753IleAsp: 5.753 ± 0.759
5.096IleGlu: 5.096 ± 0.552
1.753IlePhe: 1.753 ± 0.381
4.384IleGly: 4.384 ± 0.386
0.877IleHis: 0.877 ± 0.233
4.274IleIle: 4.274 ± 0.688
5.753IleLys: 5.753 ± 0.725
3.233IleLeu: 3.233 ± 0.446
1.425IleMet: 1.425 ± 0.344
4.658IleAsn: 4.658 ± 0.502
2.685IlePro: 2.685 ± 0.36
2.795IleGln: 2.795 ± 0.389
2.356IleArg: 2.356 ± 0.364
4.877IleSer: 4.877 ± 0.608
4.0IleThr: 4.0 ± 0.544
3.123IleVal: 3.123 ± 0.401
0.329IleTrp: 0.329 ± 0.129
2.027IleTyr: 2.027 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
7.781LysAla: 7.781 ± 0.919
0.548LysCys: 0.548 ± 0.22
4.603LysAsp: 4.603 ± 0.542
5.973LysGlu: 5.973 ± 0.883
1.973LysPhe: 1.973 ± 0.351
4.11LysGly: 4.11 ± 0.427
1.479LysHis: 1.479 ± 0.36
5.151LysIle: 5.151 ± 0.586
5.863LysLys: 5.863 ± 0.821
6.192LysLeu: 6.192 ± 0.544
2.137LysMet: 2.137 ± 0.293
3.507LysAsn: 3.507 ± 0.39
3.671LysPro: 3.671 ± 0.486
3.397LysGln: 3.397 ± 0.412
3.781LysArg: 3.781 ± 0.524
5.37LysSer: 5.37 ± 0.646
3.945LysThr: 3.945 ± 0.401
4.658LysVal: 4.658 ± 0.607
1.205LysTrp: 1.205 ± 0.29
2.301LysTyr: 2.301 ± 0.365
0.0LysXaa: 0.0 ± 0.0
Leu
5.918LeuAla: 5.918 ± 0.434
0.712LeuCys: 0.712 ± 0.248
4.986LeuAsp: 4.986 ± 0.475
5.096LeuGlu: 5.096 ± 0.583
2.411LeuPhe: 2.411 ± 0.45
4.986LeuGly: 4.986 ± 0.467
1.479LeuHis: 1.479 ± 0.299
4.219LeuIle: 4.219 ± 0.738
5.699LeuLys: 5.699 ± 0.632
5.425LeuLeu: 5.425 ± 0.698
2.301LeuMet: 2.301 ± 0.394
5.205LeuAsn: 5.205 ± 0.617
3.671LeuPro: 3.671 ± 0.471
2.904LeuGln: 2.904 ± 0.39
2.959LeuArg: 2.959 ± 0.575
6.082LeuSer: 6.082 ± 0.526
4.986LeuThr: 4.986 ± 0.503
4.603LeuVal: 4.603 ± 0.544
0.438LeuTrp: 0.438 ± 0.152
1.644LeuTyr: 1.644 ± 0.306
0.0LeuXaa: 0.0 ± 0.0
Met
3.233MetAla: 3.233 ± 0.57
0.274MetCys: 0.274 ± 0.131
1.151MetAsp: 1.151 ± 0.284
1.096MetGlu: 1.096 ± 0.231
0.932MetPhe: 0.932 ± 0.295
1.699MetGly: 1.699 ± 0.242
0.493MetHis: 0.493 ± 0.201
1.37MetIle: 1.37 ± 0.289
2.137MetLys: 2.137 ± 0.34
2.082MetLeu: 2.082 ± 0.362
0.603MetMet: 0.603 ± 0.174
1.808MetAsn: 1.808 ± 0.371
1.151MetPro: 1.151 ± 0.279
1.041MetGln: 1.041 ± 0.216
1.151MetArg: 1.151 ± 0.254
1.973MetSer: 1.973 ± 0.304
1.863MetThr: 1.863 ± 0.23
1.644MetVal: 1.644 ± 0.327
0.274MetTrp: 0.274 ± 0.108
0.658MetTyr: 0.658 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
3.342AsnAla: 3.342 ± 0.628
0.274AsnCys: 0.274 ± 0.132
3.781AsnAsp: 3.781 ± 0.47
3.836AsnGlu: 3.836 ± 0.416
1.534AsnPhe: 1.534 ± 0.386
4.055AsnGly: 4.055 ± 0.502
0.932AsnHis: 0.932 ± 0.201
3.014AsnIle: 3.014 ± 0.361
4.877AsnLys: 4.877 ± 0.549
4.055AsnLeu: 4.055 ± 0.344
1.37AsnMet: 1.37 ± 0.296
3.068AsnAsn: 3.068 ± 0.671
2.685AsnPro: 2.685 ± 0.401
2.247AsnGln: 2.247 ± 0.451
1.644AsnArg: 1.644 ± 0.3
3.781AsnSer: 3.781 ± 0.456
2.356AsnThr: 2.356 ± 0.312
3.178AsnVal: 3.178 ± 0.368
0.767AsnTrp: 0.767 ± 0.234
1.918AsnTyr: 1.918 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
1.699ProAla: 1.699 ± 0.314
0.274ProCys: 0.274 ± 0.146
2.685ProAsp: 2.685 ± 0.403
3.452ProGlu: 3.452 ± 0.488
1.26ProPhe: 1.26 ± 0.329
1.918ProGly: 1.918 ± 0.372
0.603ProHis: 0.603 ± 0.192
2.356ProIle: 2.356 ± 0.369
2.301ProLys: 2.301 ± 0.347
2.356ProLeu: 2.356 ± 0.455
0.548ProMet: 0.548 ± 0.153
1.973ProAsn: 1.973 ± 0.452
1.151ProPro: 1.151 ± 0.269
1.37ProGln: 1.37 ± 0.313
1.534ProArg: 1.534 ± 0.347
2.63ProSer: 2.63 ± 0.448
2.63ProThr: 2.63 ± 0.461
2.904ProVal: 2.904 ± 0.382
0.329ProTrp: 0.329 ± 0.118
1.315ProTyr: 1.315 ± 0.403
0.0ProXaa: 0.0 ± 0.0
Gln
4.164GlnAla: 4.164 ± 0.651
0.219GlnCys: 0.219 ± 0.114
2.795GlnAsp: 2.795 ± 0.5
2.301GlnGlu: 2.301 ± 0.465
1.589GlnPhe: 1.589 ± 0.274
2.192GlnGly: 2.192 ± 0.321
0.384GlnHis: 0.384 ± 0.143
2.63GlnIle: 2.63 ± 0.358
2.795GlnLys: 2.795 ± 0.347
3.507GlnLeu: 3.507 ± 0.419
0.877GlnMet: 0.877 ± 0.216
2.082GlnAsn: 2.082 ± 0.324
1.37GlnPro: 1.37 ± 0.236
2.301GlnGln: 2.301 ± 0.565
2.027GlnArg: 2.027 ± 0.372
2.356GlnSer: 2.356 ± 0.399
2.137GlnThr: 2.137 ± 0.334
2.795GlnVal: 2.795 ± 0.406
0.603GlnTrp: 0.603 ± 0.219
1.644GlnTyr: 1.644 ± 0.332
0.0GlnXaa: 0.0 ± 0.0
Arg
2.795ArgAla: 2.795 ± 0.388
0.438ArgCys: 0.438 ± 0.208
2.301ArgAsp: 2.301 ± 0.352
3.452ArgGlu: 3.452 ± 0.651
1.699ArgPhe: 1.699 ± 0.282
2.192ArgGly: 2.192 ± 0.362
0.712ArgHis: 0.712 ± 0.192
2.849ArgIle: 2.849 ± 0.365
3.342ArgLys: 3.342 ± 0.506
3.89ArgLeu: 3.89 ± 0.614
1.589ArgMet: 1.589 ± 0.318
1.918ArgAsn: 1.918 ± 0.339
0.986ArgPro: 0.986 ± 0.269
2.137ArgGln: 2.137 ± 0.482
1.808ArgArg: 1.808 ± 0.543
2.575ArgSer: 2.575 ± 0.325
1.918ArgThr: 1.918 ± 0.322
2.356ArgVal: 2.356 ± 0.375
0.712ArgTrp: 0.712 ± 0.213
1.096ArgTyr: 1.096 ± 0.308
0.0ArgXaa: 0.0 ± 0.0
Ser
4.438SerAla: 4.438 ± 0.682
0.877SerCys: 0.877 ± 0.314
4.986SerAsp: 4.986 ± 0.462
5.096SerGlu: 5.096 ± 0.641
2.521SerPhe: 2.521 ± 0.339
5.479SerGly: 5.479 ± 0.461
0.493SerHis: 0.493 ± 0.203
4.932SerIle: 4.932 ± 0.549
5.589SerLys: 5.589 ± 0.465
5.589SerLeu: 5.589 ± 0.499
2.192SerMet: 2.192 ± 0.372
3.397SerAsn: 3.397 ± 0.535
1.808SerPro: 1.808 ± 0.29
2.411SerGln: 2.411 ± 0.364
2.082SerArg: 2.082 ± 0.384
5.151SerSer: 5.151 ± 1.104
4.274SerThr: 4.274 ± 0.453
4.11SerVal: 4.11 ± 0.619
0.767SerTrp: 0.767 ± 0.152
2.247SerTyr: 2.247 ± 0.449
0.0SerXaa: 0.0 ± 0.0
Thr
4.0ThrAla: 4.0 ± 0.548
0.658ThrCys: 0.658 ± 0.247
3.562ThrAsp: 3.562 ± 0.665
3.945ThrGlu: 3.945 ± 0.597
2.027ThrPhe: 2.027 ± 0.279
4.712ThrGly: 4.712 ± 0.402
0.986ThrHis: 0.986 ± 0.224
4.164ThrIle: 4.164 ± 0.421
5.699ThrLys: 5.699 ± 0.721
4.11ThrLeu: 4.11 ± 0.39
1.315ThrMet: 1.315 ± 0.25
3.616ThrAsn: 3.616 ± 0.469
2.959ThrPro: 2.959 ± 0.371
3.233ThrGln: 3.233 ± 0.355
1.534ThrArg: 1.534 ± 0.33
3.671ThrSer: 3.671 ± 0.561
4.603ThrThr: 4.603 ± 0.611
3.616ThrVal: 3.616 ± 0.444
0.712ThrTrp: 0.712 ± 0.199
1.753ThrTyr: 1.753 ± 0.408
0.0ThrXaa: 0.0 ± 0.0
Val
6.356ValAla: 6.356 ± 0.583
0.822ValCys: 0.822 ± 0.273
3.836ValAsp: 3.836 ± 0.414
4.274ValGlu: 4.274 ± 0.506
2.027ValPhe: 2.027 ± 0.378
4.11ValGly: 4.11 ± 0.503
0.767ValHis: 0.767 ± 0.196
4.384ValIle: 4.384 ± 0.508
4.274ValLys: 4.274 ± 0.367
3.89ValLeu: 3.89 ± 0.509
1.973ValMet: 1.973 ± 0.288
3.89ValAsn: 3.89 ± 0.404
1.589ValPro: 1.589 ± 0.283
1.644ValGln: 1.644 ± 0.298
2.082ValArg: 2.082 ± 0.326
5.151ValSer: 5.151 ± 0.624
4.493ValThr: 4.493 ± 0.535
3.726ValVal: 3.726 ± 0.511
0.493ValTrp: 0.493 ± 0.14
1.973ValTyr: 1.973 ± 0.381
0.0ValXaa: 0.0 ± 0.0
Trp
0.877TrpAla: 0.877 ± 0.192
0.11TrpCys: 0.11 ± 0.073
0.603TrpAsp: 0.603 ± 0.157
0.767TrpGlu: 0.767 ± 0.181
0.658TrpPhe: 0.658 ± 0.192
0.658TrpGly: 0.658 ± 0.224
0.219TrpHis: 0.219 ± 0.092
0.658TrpIle: 0.658 ± 0.184
0.877TrpLys: 0.877 ± 0.198
1.425TrpLeu: 1.425 ± 0.249
0.11TrpMet: 0.11 ± 0.077
0.603TrpAsn: 0.603 ± 0.179
0.493TrpPro: 0.493 ± 0.163
0.603TrpGln: 0.603 ± 0.192
0.822TrpArg: 0.822 ± 0.227
0.384TrpSer: 0.384 ± 0.133
0.712TrpThr: 0.712 ± 0.214
0.877TrpVal: 0.877 ± 0.211
0.329TrpTrp: 0.329 ± 0.134
0.274TrpTyr: 0.274 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.356TyrAla: 2.356 ± 0.331
0.384TyrCys: 0.384 ± 0.16
2.301TyrAsp: 2.301 ± 0.316
2.411TyrGlu: 2.411 ± 0.342
1.753TyrPhe: 1.753 ± 0.279
2.027TyrGly: 2.027 ± 0.362
0.493TyrHis: 0.493 ± 0.194
1.699TyrIle: 1.699 ± 0.275
2.356TyrLys: 2.356 ± 0.346
1.973TyrLeu: 1.973 ± 0.387
0.877TyrMet: 0.877 ± 0.22
1.37TyrAsn: 1.37 ± 0.297
1.26TyrPro: 1.26 ± 0.338
0.877TyrGln: 0.877 ± 0.239
1.479TyrArg: 1.479 ± 0.268
2.027TyrSer: 2.027 ± 0.313
3.123TyrThr: 3.123 ± 0.969
2.027TyrVal: 2.027 ± 0.283
0.384TyrTrp: 0.384 ± 0.131
1.644TyrTyr: 1.644 ± 0.55
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (18251 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski