Amino acid dipepetide frequency for Pseudomonas phage vB_PaeP_YA3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.779AlaAla: 15.779 ± 2.603
0.991AlaCys: 0.991 ± 0.315
4.67AlaAsp: 4.67 ± 0.501
9.552AlaGlu: 9.552 ± 1.557
3.538AlaPhe: 3.538 ± 0.531
7.642AlaGly: 7.642 ± 0.863
1.274AlaHis: 1.274 ± 0.288
5.731AlaIle: 5.731 ± 0.767
4.67AlaLys: 4.67 ± 0.746
9.057AlaLeu: 9.057 ± 1.038
4.104AlaMet: 4.104 ± 0.703
3.962AlaAsn: 3.962 ± 0.482
4.033AlaPro: 4.033 ± 0.607
5.307AlaGln: 5.307 ± 0.995
7.146AlaArg: 7.146 ± 1.024
8.845AlaSer: 8.845 ± 2.412
6.651AlaThr: 6.651 ± 0.761
5.802AlaVal: 5.802 ± 0.661
2.052AlaTrp: 2.052 ± 0.484
2.193AlaTyr: 2.193 ± 0.353
0.0AlaXaa: 0.0 ± 0.0
Cys
1.415CysAla: 1.415 ± 0.327
0.283CysCys: 0.283 ± 0.149
0.849CysAsp: 0.849 ± 0.243
1.274CysGlu: 1.274 ± 0.3
0.212CysPhe: 0.212 ± 0.119
1.415CysGly: 1.415 ± 0.38
0.283CysHis: 0.283 ± 0.148
0.637CysIle: 0.637 ± 0.182
0.566CysLys: 0.566 ± 0.188
0.495CysLeu: 0.495 ± 0.181
0.142CysMet: 0.142 ± 0.1
0.566CysAsn: 0.566 ± 0.212
1.203CysPro: 1.203 ± 0.367
0.566CysGln: 0.566 ± 0.184
0.637CysArg: 0.637 ± 0.221
0.778CysSer: 0.778 ± 0.254
0.425CysThr: 0.425 ± 0.162
0.778CysVal: 0.778 ± 0.242
0.354CysTrp: 0.354 ± 0.166
0.708CysTyr: 0.708 ± 0.204
0.0CysXaa: 0.0 ± 0.0
Asp
6.156AspAla: 6.156 ± 0.88
1.061AspCys: 1.061 ± 0.301
2.972AspAsp: 2.972 ± 0.568
4.528AspGlu: 4.528 ± 0.591
1.981AspPhe: 1.981 ± 0.471
5.448AspGly: 5.448 ± 0.641
1.061AspHis: 1.061 ± 0.301
3.396AspIle: 3.396 ± 0.501
1.981AspLys: 1.981 ± 0.324
5.165AspLeu: 5.165 ± 0.532
1.132AspMet: 1.132 ± 0.306
1.486AspAsn: 1.486 ± 0.331
2.618AspPro: 2.618 ± 0.397
3.255AspGln: 3.255 ± 0.594
3.609AspArg: 3.609 ± 0.472
2.759AspSer: 2.759 ± 0.386
2.335AspThr: 2.335 ± 0.478
3.75AspVal: 3.75 ± 0.583
1.486AspTrp: 1.486 ± 0.37
2.193AspTyr: 2.193 ± 0.553
0.0AspXaa: 0.0 ± 0.0
Glu
9.057GluAla: 9.057 ± 1.628
0.849GluCys: 0.849 ± 0.276
3.679GluAsp: 3.679 ± 0.534
2.972GluGlu: 2.972 ± 0.645
1.91GluPhe: 1.91 ± 0.279
4.387GluGly: 4.387 ± 0.643
0.991GluHis: 0.991 ± 0.269
4.104GluIle: 4.104 ± 0.627
2.618GluLys: 2.618 ± 0.476
6.51GluLeu: 6.51 ± 0.742
1.981GluMet: 1.981 ± 0.387
2.123GluAsn: 2.123 ± 0.319
2.689GluPro: 2.689 ± 0.417
4.104GluGln: 4.104 ± 0.672
6.297GluArg: 6.297 ± 1.075
4.245GluSer: 4.245 ± 0.561
2.476GluThr: 2.476 ± 0.324
3.962GluVal: 3.962 ± 0.576
1.769GluTrp: 1.769 ± 0.338
1.84GluTyr: 1.84 ± 0.319
0.0GluXaa: 0.0 ± 0.0
Phe
2.264PheAla: 2.264 ± 0.33
0.142PheCys: 0.142 ± 0.108
2.052PheAsp: 2.052 ± 0.39
2.193PheGlu: 2.193 ± 0.348
0.637PhePhe: 0.637 ± 0.202
2.759PheGly: 2.759 ± 0.33
0.778PheHis: 0.778 ± 0.172
1.769PheIle: 1.769 ± 0.373
2.264PheLys: 2.264 ± 0.378
1.84PheLeu: 1.84 ± 0.374
0.637PheMet: 0.637 ± 0.221
1.344PheAsn: 1.344 ± 0.217
1.132PhePro: 1.132 ± 0.323
0.991PheGln: 0.991 ± 0.33
1.486PheArg: 1.486 ± 0.299
3.255PheSer: 3.255 ± 0.674
1.486PheThr: 1.486 ± 0.461
2.123PheVal: 2.123 ± 0.368
0.778PheTrp: 0.778 ± 0.25
0.991PheTyr: 0.991 ± 0.342
0.0PheXaa: 0.0 ± 0.0
Gly
7.712GlyAla: 7.712 ± 1.047
1.274GlyCys: 1.274 ± 0.371
5.873GlyAsp: 5.873 ± 0.808
4.811GlyGlu: 4.811 ± 0.533
2.759GlyPhe: 2.759 ± 0.391
6.651GlyGly: 6.651 ± 0.945
1.344GlyHis: 1.344 ± 0.299
4.458GlyIle: 4.458 ± 0.543
4.104GlyLys: 4.104 ± 0.565
5.519GlyLeu: 5.519 ± 0.454
2.193GlyMet: 2.193 ± 0.44
3.962GlyAsn: 3.962 ± 1.063
2.264GlyPro: 2.264 ± 0.401
2.759GlyGln: 2.759 ± 0.508
5.236GlyArg: 5.236 ± 0.653
5.307GlySer: 5.307 ± 0.494
3.75GlyThr: 3.75 ± 0.486
5.873GlyVal: 5.873 ± 0.648
1.274GlyTrp: 1.274 ± 0.271
3.113GlyTyr: 3.113 ± 0.722
0.0GlyXaa: 0.0 ± 0.0
His
1.769HisAla: 1.769 ± 0.424
0.071HisCys: 0.071 ± 0.067
1.061HisAsp: 1.061 ± 0.282
1.627HisGlu: 1.627 ± 0.345
0.354HisPhe: 0.354 ± 0.158
0.991HisGly: 0.991 ± 0.269
0.354HisHis: 0.354 ± 0.151
0.991HisIle: 0.991 ± 0.265
1.132HisLys: 1.132 ± 0.33
1.557HisLeu: 1.557 ± 0.412
0.354HisMet: 0.354 ± 0.17
0.354HisAsn: 0.354 ± 0.143
1.132HisPro: 1.132 ± 0.313
0.637HisGln: 0.637 ± 0.198
1.274HisArg: 1.274 ± 0.278
1.061HisSer: 1.061 ± 0.229
0.566HisThr: 0.566 ± 0.265
0.991HisVal: 0.991 ± 0.268
0.212HisTrp: 0.212 ± 0.112
0.495HisTyr: 0.495 ± 0.15
0.0HisXaa: 0.0 ± 0.0
Ile
6.58IleAla: 6.58 ± 0.883
0.425IleCys: 0.425 ± 0.172
4.528IleAsp: 4.528 ± 0.54
4.245IleGlu: 4.245 ± 0.554
1.415IlePhe: 1.415 ± 0.324
4.316IleGly: 4.316 ± 0.514
0.495IleHis: 0.495 ± 0.155
1.769IleIle: 1.769 ± 0.407
2.335IleLys: 2.335 ± 0.438
3.538IleLeu: 3.538 ± 0.687
1.061IleMet: 1.061 ± 0.245
2.406IleAsn: 2.406 ± 0.392
2.476IlePro: 2.476 ± 0.37
1.84IleGln: 1.84 ± 0.307
3.962IleArg: 3.962 ± 0.438
4.811IleSer: 4.811 ± 0.809
2.901IleThr: 2.901 ± 0.459
2.618IleVal: 2.618 ± 0.548
0.495IleTrp: 0.495 ± 0.167
0.778IleTyr: 0.778 ± 0.222
0.0IleXaa: 0.0 ± 0.0
Lys
5.024LysAla: 5.024 ± 0.696
0.778LysCys: 0.778 ± 0.293
2.901LysAsp: 2.901 ± 0.394
2.759LysGlu: 2.759 ± 0.47
1.627LysPhe: 1.627 ± 0.313
3.326LysGly: 3.326 ± 0.612
0.849LysHis: 0.849 ± 0.254
1.981LysIle: 1.981 ± 0.362
1.274LysLys: 1.274 ± 0.249
3.892LysLeu: 3.892 ± 0.63
0.849LysMet: 0.849 ± 0.235
1.344LysAsn: 1.344 ± 0.313
2.335LysPro: 2.335 ± 0.443
1.91LysGln: 1.91 ± 0.449
3.255LysArg: 3.255 ± 0.541
3.184LysSer: 3.184 ± 0.467
1.91LysThr: 1.91 ± 0.35
2.264LysVal: 2.264 ± 0.386
0.708LysTrp: 0.708 ± 0.207
0.849LysTyr: 0.849 ± 0.236
0.0LysXaa: 0.0 ± 0.0
Leu
8.491LeuAla: 8.491 ± 0.839
0.991LeuCys: 0.991 ± 0.265
4.175LeuAsp: 4.175 ± 0.629
4.599LeuGlu: 4.599 ± 0.498
1.84LeuPhe: 1.84 ± 0.387
6.085LeuGly: 6.085 ± 0.678
1.415LeuHis: 1.415 ± 0.337
3.609LeuIle: 3.609 ± 0.672
3.75LeuLys: 3.75 ± 0.632
6.722LeuLeu: 6.722 ± 0.734
1.84LeuMet: 1.84 ± 0.405
2.547LeuAsn: 2.547 ± 0.572
3.326LeuPro: 3.326 ± 0.416
1.981LeuGln: 1.981 ± 0.544
6.793LeuArg: 6.793 ± 0.651
6.156LeuSer: 6.156 ± 0.697
4.316LeuThr: 4.316 ± 0.697
4.387LeuVal: 4.387 ± 0.607
0.92LeuTrp: 0.92 ± 0.246
2.052LeuTyr: 2.052 ± 0.407
0.0LeuXaa: 0.0 ± 0.0
Met
2.759MetAla: 2.759 ± 0.415
0.283MetCys: 0.283 ± 0.158
1.415MetAsp: 1.415 ± 0.246
1.132MetGlu: 1.132 ± 0.277
0.708MetPhe: 0.708 ± 0.234
1.627MetGly: 1.627 ± 0.372
0.708MetHis: 0.708 ± 0.247
1.274MetIle: 1.274 ± 0.25
1.203MetLys: 1.203 ± 0.289
2.052MetLeu: 2.052 ± 0.383
0.849MetMet: 0.849 ± 0.244
0.637MetAsn: 0.637 ± 0.219
2.052MetPro: 2.052 ± 0.32
0.991MetGln: 0.991 ± 0.284
1.84MetArg: 1.84 ± 0.404
2.264MetSer: 2.264 ± 0.354
1.769MetThr: 1.769 ± 0.333
0.637MetVal: 0.637 ± 0.226
0.283MetTrp: 0.283 ± 0.12
0.991MetTyr: 0.991 ± 0.305
0.0MetXaa: 0.0 ± 0.0
Asn
3.043AsnAla: 3.043 ± 0.543
1.061AsnCys: 1.061 ± 0.266
1.557AsnAsp: 1.557 ± 0.319
1.84AsnGlu: 1.84 ± 0.343
0.708AsnPhe: 0.708 ± 0.241
3.326AsnGly: 3.326 ± 0.441
0.495AsnHis: 0.495 ± 0.192
1.274AsnIle: 1.274 ± 0.302
1.203AsnLys: 1.203 ± 0.273
1.981AsnLeu: 1.981 ± 0.318
0.637AsnMet: 0.637 ± 0.222
0.92AsnAsn: 0.92 ± 0.231
2.052AsnPro: 2.052 ± 0.356
1.274AsnGln: 1.274 ± 0.275
3.113AsnArg: 3.113 ± 0.975
1.84AsnSer: 1.84 ± 0.394
3.467AsnThr: 3.467 ± 2.046
2.193AsnVal: 2.193 ± 0.45
0.849AsnTrp: 0.849 ± 0.214
1.061AsnTyr: 1.061 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
4.104ProAla: 4.104 ± 0.436
0.778ProCys: 0.778 ± 0.306
3.467ProAsp: 3.467 ± 0.589
4.953ProGlu: 4.953 ± 0.685
1.415ProPhe: 1.415 ± 0.364
3.326ProGly: 3.326 ± 0.549
0.637ProHis: 0.637 ± 0.212
2.335ProIle: 2.335 ± 0.395
1.769ProLys: 1.769 ± 0.389
3.255ProLeu: 3.255 ± 0.453
0.637ProMet: 0.637 ± 0.209
1.557ProAsn: 1.557 ± 0.349
1.344ProPro: 1.344 ± 0.355
1.769ProGln: 1.769 ± 0.355
2.052ProArg: 2.052 ± 0.362
2.335ProSer: 2.335 ± 0.588
2.335ProThr: 2.335 ± 0.511
3.255ProVal: 3.255 ± 0.603
0.637ProTrp: 0.637 ± 0.22
1.274ProTyr: 1.274 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
6.793GlnAla: 6.793 ± 1.145
0.708GlnCys: 0.708 ± 0.215
1.274GlnAsp: 1.274 ± 0.294
2.476GlnGlu: 2.476 ± 0.427
1.627GlnPhe: 1.627 ± 0.343
3.184GlnGly: 3.184 ± 0.588
0.991GlnHis: 0.991 ± 0.259
2.123GlnIle: 2.123 ± 0.371
1.274GlnLys: 1.274 ± 0.563
2.547GlnLeu: 2.547 ± 0.521
0.849GlnMet: 0.849 ± 0.262
0.991GlnAsn: 0.991 ± 0.242
2.052GlnPro: 2.052 ± 0.448
3.538GlnGln: 3.538 ± 0.859
3.821GlnArg: 3.821 ± 0.616
2.759GlnSer: 2.759 ± 0.693
1.91GlnThr: 1.91 ± 0.356
2.689GlnVal: 2.689 ± 0.402
0.637GlnTrp: 0.637 ± 0.278
1.557GlnTyr: 1.557 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
7.995ArgAla: 7.995 ± 1.031
0.778ArgCys: 0.778 ± 0.285
4.387ArgAsp: 4.387 ± 0.593
5.448ArgGlu: 5.448 ± 1.002
2.193ArgPhe: 2.193 ± 0.381
4.458ArgGly: 4.458 ± 0.6
1.627ArgHis: 1.627 ± 0.304
4.316ArgIle: 4.316 ± 0.567
3.467ArgLys: 3.467 ± 0.752
5.448ArgLeu: 5.448 ± 0.595
2.193ArgMet: 2.193 ± 0.361
1.981ArgAsn: 1.981 ± 0.306
2.759ArgPro: 2.759 ± 0.523
3.821ArgGln: 3.821 ± 0.595
5.802ArgArg: 5.802 ± 0.759
4.811ArgSer: 4.811 ± 0.967
3.184ArgThr: 3.184 ± 0.423
3.679ArgVal: 3.679 ± 0.397
1.203ArgTrp: 1.203 ± 0.259
2.264ArgTyr: 2.264 ± 0.45
0.0ArgXaa: 0.0 ± 0.0
Ser
8.845SerAla: 8.845 ± 2.091
1.203SerCys: 1.203 ± 0.307
4.316SerAsp: 4.316 ± 0.555
3.467SerGlu: 3.467 ± 0.523
2.052SerPhe: 2.052 ± 0.385
6.58SerGly: 6.58 ± 0.72
1.203SerHis: 1.203 ± 0.325
3.75SerIle: 3.75 ± 0.524
2.406SerLys: 2.406 ± 0.324
4.245SerLeu: 4.245 ± 0.507
1.981SerMet: 1.981 ± 0.291
3.892SerAsn: 3.892 ± 1.978
2.759SerPro: 2.759 ± 0.454
2.264SerGln: 2.264 ± 0.417
5.377SerArg: 5.377 ± 0.588
4.458SerSer: 4.458 ± 0.59
3.679SerThr: 3.679 ± 0.639
4.316SerVal: 4.316 ± 0.54
0.849SerTrp: 0.849 ± 0.22
2.193SerTyr: 2.193 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
6.934ThrAla: 6.934 ± 0.676
0.495ThrCys: 0.495 ± 0.222
2.476ThrAsp: 2.476 ± 0.393
2.759ThrGlu: 2.759 ± 0.425
1.769ThrPhe: 1.769 ± 0.413
7.076ThrGly: 7.076 ± 1.922
0.637ThrHis: 0.637 ± 0.208
3.255ThrIle: 3.255 ± 0.574
2.123ThrLys: 2.123 ± 0.417
4.387ThrLeu: 4.387 ± 0.716
1.203ThrMet: 1.203 ± 0.227
0.92ThrAsn: 0.92 ± 0.237
2.193ThrPro: 2.193 ± 0.368
1.84ThrGln: 1.84 ± 0.299
2.618ThrArg: 2.618 ± 0.482
2.901ThrSer: 2.901 ± 0.368
3.75ThrThr: 3.75 ± 0.641
3.396ThrVal: 3.396 ± 0.534
0.708ThrTrp: 0.708 ± 0.282
1.344ThrTyr: 1.344 ± 0.302
0.0ThrXaa: 0.0 ± 0.0
Val
4.741ValAla: 4.741 ± 0.45
0.708ValCys: 0.708 ± 0.286
4.67ValAsp: 4.67 ± 0.655
4.316ValGlu: 4.316 ± 0.569
2.123ValPhe: 2.123 ± 0.413
4.67ValGly: 4.67 ± 0.543
0.778ValHis: 0.778 ± 0.222
3.679ValIle: 3.679 ± 0.634
2.406ValLys: 2.406 ± 0.389
4.316ValLeu: 4.316 ± 0.721
1.274ValMet: 1.274 ± 0.352
1.627ValAsn: 1.627 ± 0.258
2.83ValPro: 2.83 ± 0.398
2.335ValGln: 2.335 ± 0.349
3.892ValArg: 3.892 ± 0.58
5.094ValSer: 5.094 ± 0.771
3.326ValThr: 3.326 ± 0.492
4.033ValVal: 4.033 ± 0.712
0.849ValTrp: 0.849 ± 0.229
1.698ValTyr: 1.698 ± 0.41
0.0ValXaa: 0.0 ± 0.0
Trp
1.486TrpAla: 1.486 ± 0.338
0.142TrpCys: 0.142 ± 0.101
0.849TrpAsp: 0.849 ± 0.245
1.061TrpGlu: 1.061 ± 0.264
0.849TrpPhe: 0.849 ± 0.234
1.132TrpGly: 1.132 ± 0.279
0.495TrpHis: 0.495 ± 0.203
0.778TrpIle: 0.778 ± 0.223
1.274TrpLys: 1.274 ± 0.271
1.557TrpLeu: 1.557 ± 0.272
0.778TrpMet: 0.778 ± 0.29
0.212TrpAsn: 0.212 ± 0.124
0.354TrpPro: 0.354 ± 0.188
1.061TrpGln: 1.061 ± 0.33
1.415TrpArg: 1.415 ± 0.311
1.061TrpSer: 1.061 ± 0.265
0.708TrpThr: 0.708 ± 0.265
1.132TrpVal: 1.132 ± 0.232
0.212TrpTrp: 0.212 ± 0.11
0.425TrpTyr: 0.425 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.052TyrAla: 2.052 ± 0.291
0.566TyrCys: 0.566 ± 0.213
1.203TyrAsp: 1.203 ± 0.318
2.547TyrGlu: 2.547 ± 0.452
1.203TyrPhe: 1.203 ± 0.343
1.84TyrGly: 1.84 ± 0.314
0.566TyrHis: 0.566 ± 0.222
1.769TyrIle: 1.769 ± 0.402
1.203TyrLys: 1.203 ± 0.289
1.981TyrLeu: 1.981 ± 0.483
0.708TyrMet: 0.708 ± 0.209
1.061TyrAsn: 1.061 ± 0.305
1.627TyrPro: 1.627 ± 0.436
1.557TyrGln: 1.557 ± 0.722
2.193TyrArg: 2.193 ± 0.43
1.91TyrSer: 1.91 ± 0.553
1.84TyrThr: 1.84 ± 0.322
1.486TyrVal: 1.486 ± 0.39
0.708TyrTrp: 0.708 ± 0.264
1.132TyrTyr: 1.132 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (14134 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski