Amino acid dipepetide frequency for Pseudomonas phage YuA

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.036AlaAla: 13.036 ± 1.196
0.986AlaCys: 0.986 ± 0.232
6.189AlaAsp: 6.189 ± 0.595
8.216AlaGlu: 8.216 ± 0.678
4.327AlaPhe: 4.327 ± 0.422
8.161AlaGly: 8.161 ± 0.893
1.972AlaHis: 1.972 ± 0.304
4.984AlaIle: 4.984 ± 0.56
5.258AlaLys: 5.258 ± 0.656
8.928AlaLeu: 8.928 ± 0.609
2.191AlaMet: 2.191 ± 0.338
3.122AlaAsn: 3.122 ± 0.544
4.711AlaPro: 4.711 ± 0.558
4.382AlaGln: 4.382 ± 0.733
8.106AlaArg: 8.106 ± 0.736
4.053AlaSer: 4.053 ± 0.452
5.861AlaThr: 5.861 ± 0.59
7.23AlaVal: 7.23 ± 0.682
1.643AlaTrp: 1.643 ± 0.31
2.684AlaTyr: 2.684 ± 0.358
0.0AlaXaa: 0.0 ± 0.0
Cys
1.041CysAla: 1.041 ± 0.256
0.219CysCys: 0.219 ± 0.099
0.548CysAsp: 0.548 ± 0.182
0.767CysGlu: 0.767 ± 0.23
0.274CysPhe: 0.274 ± 0.12
0.767CysGly: 0.767 ± 0.203
0.438CysHis: 0.438 ± 0.151
0.11CysIle: 0.11 ± 0.069
0.274CysLys: 0.274 ± 0.117
1.095CysLeu: 1.095 ± 0.211
0.11CysMet: 0.11 ± 0.067
0.493CysAsn: 0.493 ± 0.156
0.493CysPro: 0.493 ± 0.174
0.219CysGln: 0.219 ± 0.114
1.041CysArg: 1.041 ± 0.217
0.438CysSer: 0.438 ± 0.145
0.329CysThr: 0.329 ± 0.124
0.329CysVal: 0.329 ± 0.127
0.274CysTrp: 0.274 ± 0.146
0.438CysTyr: 0.438 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
5.532AspAla: 5.532 ± 0.698
0.493AspCys: 0.493 ± 0.175
2.684AspAsp: 2.684 ± 0.432
3.998AspGlu: 3.998 ± 0.476
2.3AspPhe: 2.3 ± 0.415
5.642AspGly: 5.642 ± 0.57
1.479AspHis: 1.479 ± 0.288
3.232AspIle: 3.232 ± 0.45
2.574AspLys: 2.574 ± 0.411
6.682AspLeu: 6.682 ± 0.598
1.534AspMet: 1.534 ± 0.31
1.698AspAsn: 1.698 ± 0.249
3.834AspPro: 3.834 ± 0.461
2.574AspGln: 2.574 ± 0.323
3.451AspArg: 3.451 ± 0.402
2.793AspSer: 2.793 ± 0.329
2.246AspThr: 2.246 ± 0.288
2.903AspVal: 2.903 ± 0.4
1.862AspTrp: 1.862 ± 0.301
2.136AspTyr: 2.136 ± 0.33
0.0AspXaa: 0.0 ± 0.0
Glu
8.161GluAla: 8.161 ± 0.771
0.548GluCys: 0.548 ± 0.146
3.834GluAsp: 3.834 ± 0.448
5.696GluGlu: 5.696 ± 0.778
2.574GluPhe: 2.574 ± 0.334
5.423GluGly: 5.423 ± 0.633
1.424GluHis: 1.424 ± 0.301
2.081GluIle: 2.081 ± 0.374
2.903GluLys: 2.903 ± 0.422
6.682GluLeu: 6.682 ± 0.678
1.369GluMet: 1.369 ± 0.216
2.081GluAsn: 2.081 ± 0.311
3.177GluPro: 3.177 ± 0.452
3.232GluGln: 3.232 ± 0.516
5.313GluArg: 5.313 ± 0.605
2.355GluSer: 2.355 ± 0.301
3.341GluThr: 3.341 ± 0.43
5.477GluVal: 5.477 ± 0.611
1.917GluTrp: 1.917 ± 0.354
1.972GluTyr: 1.972 ± 0.332
0.0GluXaa: 0.0 ± 0.0
Phe
4.218PheAla: 4.218 ± 0.423
0.219PheCys: 0.219 ± 0.093
2.684PheAsp: 2.684 ± 0.365
3.122PheGlu: 3.122 ± 0.371
1.424PhePhe: 1.424 ± 0.299
3.013PheGly: 3.013 ± 0.451
0.493PheHis: 0.493 ± 0.197
1.095PheIle: 1.095 ± 0.206
1.315PheLys: 1.315 ± 0.284
3.341PheLeu: 3.341 ± 0.411
1.15PheMet: 1.15 ± 0.266
2.136PheAsn: 2.136 ± 0.332
1.205PhePro: 1.205 ± 0.276
1.424PheGln: 1.424 ± 0.299
2.903PheArg: 2.903 ± 0.377
1.643PheSer: 1.643 ± 0.267
1.808PheThr: 1.808 ± 0.322
1.862PheVal: 1.862 ± 0.321
0.767PheTrp: 0.767 ± 0.205
0.986PheTyr: 0.986 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
6.956GlyAla: 6.956 ± 0.726
0.986GlyCys: 0.986 ± 0.223
4.327GlyAsp: 4.327 ± 0.544
5.642GlyGlu: 5.642 ± 0.62
3.013GlyPhe: 3.013 ± 0.455
7.285GlyGly: 7.285 ± 1.001
1.479GlyHis: 1.479 ± 0.309
3.725GlyIle: 3.725 ± 0.477
4.382GlyLys: 4.382 ± 0.489
6.628GlyLeu: 6.628 ± 0.625
1.534GlyMet: 1.534 ± 0.279
2.52GlyAsn: 2.52 ± 0.371
3.506GlyPro: 3.506 ± 0.423
3.286GlyGln: 3.286 ± 0.459
5.258GlyArg: 5.258 ± 0.512
5.258GlySer: 5.258 ± 0.824
5.203GlyThr: 5.203 ± 0.529
4.656GlyVal: 4.656 ± 0.53
1.315GlyTrp: 1.315 ± 0.243
3.396GlyTyr: 3.396 ± 0.474
0.0GlyXaa: 0.0 ± 0.0
His
1.753HisAla: 1.753 ± 0.292
0.274HisCys: 0.274 ± 0.112
0.876HisAsp: 0.876 ± 0.265
1.369HisGlu: 1.369 ± 0.273
0.876HisPhe: 0.876 ± 0.23
1.424HisGly: 1.424 ± 0.27
0.219HisHis: 0.219 ± 0.106
1.095HisIle: 1.095 ± 0.214
1.15HisLys: 1.15 ± 0.268
1.643HisLeu: 1.643 ± 0.362
0.438HisMet: 0.438 ± 0.173
0.438HisAsn: 0.438 ± 0.151
1.15HisPro: 1.15 ± 0.187
0.548HisGln: 0.548 ± 0.154
1.205HisArg: 1.205 ± 0.283
0.767HisSer: 0.767 ± 0.201
0.657HisThr: 0.657 ± 0.17
1.753HisVal: 1.753 ± 0.349
0.493HisTrp: 0.493 ± 0.15
0.931HisTyr: 0.931 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
4.601IleAla: 4.601 ± 0.501
0.274IleCys: 0.274 ± 0.113
3.779IleAsp: 3.779 ± 0.524
3.341IleGlu: 3.341 ± 0.397
1.479IlePhe: 1.479 ± 0.34
2.191IleGly: 2.191 ± 0.372
1.15IleHis: 1.15 ± 0.309
1.862IleIle: 1.862 ± 0.28
2.355IleLys: 2.355 ± 0.454
2.52IleLeu: 2.52 ± 0.369
1.205IleMet: 1.205 ± 0.241
1.862IleAsn: 1.862 ± 0.346
2.246IlePro: 2.246 ± 0.403
1.917IleGln: 1.917 ± 0.327
2.684IleArg: 2.684 ± 0.341
2.136IleSer: 2.136 ± 0.368
3.286IleThr: 3.286 ± 0.417
2.903IleVal: 2.903 ± 0.404
0.712IleTrp: 0.712 ± 0.182
1.369IleTyr: 1.369 ± 0.301
0.0IleXaa: 0.0 ± 0.0
Lys
6.354LysAla: 6.354 ± 0.734
0.274LysCys: 0.274 ± 0.114
2.574LysAsp: 2.574 ± 0.488
2.739LysGlu: 2.739 ± 0.38
1.588LysPhe: 1.588 ± 0.255
2.465LysGly: 2.465 ± 0.435
0.986LysHis: 0.986 ± 0.293
1.479LysIle: 1.479 ± 0.325
2.355LysLys: 2.355 ± 0.318
3.56LysLeu: 3.56 ± 0.397
0.986LysMet: 0.986 ± 0.222
1.424LysAsn: 1.424 ± 0.274
3.122LysPro: 3.122 ± 0.481
1.315LysGln: 1.315 ± 0.274
3.122LysArg: 3.122 ± 0.421
2.465LysSer: 2.465 ± 0.396
2.958LysThr: 2.958 ± 0.459
3.56LysVal: 3.56 ± 0.482
0.767LysTrp: 0.767 ± 0.192
1.643LysTyr: 1.643 ± 0.286
0.0LysXaa: 0.0 ± 0.0
Leu
8.106LeuAla: 8.106 ± 0.81
1.15LeuCys: 1.15 ± 0.251
5.094LeuAsp: 5.094 ± 0.493
4.984LeuGlu: 4.984 ± 0.533
2.574LeuPhe: 2.574 ± 0.338
6.956LeuGly: 6.956 ± 0.703
1.315LeuHis: 1.315 ± 0.281
3.615LeuIle: 3.615 ± 0.411
4.218LeuLys: 4.218 ± 0.579
7.23LeuLeu: 7.23 ± 0.966
1.808LeuMet: 1.808 ± 0.365
3.834LeuAsn: 3.834 ± 0.52
5.477LeuPro: 5.477 ± 0.613
3.725LeuGln: 3.725 ± 0.373
7.285LeuArg: 7.285 ± 0.608
4.546LeuSer: 4.546 ± 0.582
5.806LeuThr: 5.806 ± 0.571
4.82LeuVal: 4.82 ± 0.507
1.643LeuTrp: 1.643 ± 0.253
2.246LeuTyr: 2.246 ± 0.441
0.0LeuXaa: 0.0 ± 0.0
Met
2.52MetAla: 2.52 ± 0.417
0.219MetCys: 0.219 ± 0.104
1.643MetAsp: 1.643 ± 0.329
1.26MetGlu: 1.26 ± 0.288
0.822MetPhe: 0.822 ± 0.197
1.534MetGly: 1.534 ± 0.36
0.164MetHis: 0.164 ± 0.091
1.479MetIle: 1.479 ± 0.289
0.767MetLys: 0.767 ± 0.184
1.15MetLeu: 1.15 ± 0.194
1.095MetMet: 1.095 ± 0.278
1.205MetAsn: 1.205 ± 0.24
0.822MetPro: 0.822 ± 0.201
0.274MetGln: 0.274 ± 0.108
1.808MetArg: 1.808 ± 0.296
1.808MetSer: 1.808 ± 0.269
1.369MetThr: 1.369 ± 0.28
1.041MetVal: 1.041 ± 0.187
0.274MetTrp: 0.274 ± 0.142
0.383MetTyr: 0.383 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
3.834AsnAla: 3.834 ± 0.532
0.274AsnCys: 0.274 ± 0.126
1.808AsnAsp: 1.808 ± 0.323
2.41AsnGlu: 2.41 ± 0.425
0.986AsnPhe: 0.986 ± 0.22
4.108AsnGly: 4.108 ± 0.565
0.548AsnHis: 0.548 ± 0.154
1.588AsnIle: 1.588 ± 0.259
1.479AsnLys: 1.479 ± 0.301
3.067AsnLeu: 3.067 ± 0.407
0.657AsnMet: 0.657 ± 0.151
1.643AsnAsn: 1.643 ± 0.439
2.574AsnPro: 2.574 ± 0.378
1.917AsnGln: 1.917 ± 0.358
2.081AsnArg: 2.081 ± 0.29
1.917AsnSer: 1.917 ± 0.317
1.588AsnThr: 1.588 ± 0.261
3.506AsnVal: 3.506 ± 0.623
0.822AsnTrp: 0.822 ± 0.202
1.095AsnTyr: 1.095 ± 0.321
0.0AsnXaa: 0.0 ± 0.0
Pro
5.258ProAla: 5.258 ± 0.526
0.603ProCys: 0.603 ± 0.166
3.615ProAsp: 3.615 ± 0.423
4.437ProGlu: 4.437 ± 0.563
2.3ProPhe: 2.3 ± 0.337
4.82ProGly: 4.82 ± 0.649
0.767ProHis: 0.767 ± 0.222
2.739ProIle: 2.739 ± 0.273
2.465ProLys: 2.465 ± 0.422
3.944ProLeu: 3.944 ± 0.427
0.931ProMet: 0.931 ± 0.193
2.081ProAsn: 2.081 ± 0.377
2.465ProPro: 2.465 ± 0.392
1.479ProGln: 1.479 ± 0.257
2.793ProArg: 2.793 ± 0.497
2.465ProSer: 2.465 ± 0.317
2.684ProThr: 2.684 ± 0.373
3.286ProVal: 3.286 ± 0.476
1.205ProTrp: 1.205 ± 0.332
1.588ProTyr: 1.588 ± 0.349
0.0ProXaa: 0.0 ± 0.0
Gln
4.272GlnAla: 4.272 ± 0.491
0.164GlnCys: 0.164 ± 0.086
2.081GlnAsp: 2.081 ± 0.33
2.739GlnGlu: 2.739 ± 0.454
1.424GlnPhe: 1.424 ± 0.254
3.122GlnGly: 3.122 ± 0.432
1.205GlnHis: 1.205 ± 0.261
1.643GlnIle: 1.643 ± 0.255
1.369GlnLys: 1.369 ± 0.219
3.67GlnLeu: 3.67 ± 0.666
1.205GlnMet: 1.205 ± 0.228
1.26GlnAsn: 1.26 ± 0.347
2.027GlnPro: 2.027 ± 0.332
2.684GlnGln: 2.684 ± 0.452
2.793GlnArg: 2.793 ± 0.38
1.424GlnSer: 1.424 ± 0.285
2.355GlnThr: 2.355 ± 0.298
2.848GlnVal: 2.848 ± 0.424
0.603GlnTrp: 0.603 ± 0.18
0.767GlnTyr: 0.767 ± 0.2
0.0GlnXaa: 0.0 ± 0.0
Arg
7.121ArgAla: 7.121 ± 0.704
0.603ArgCys: 0.603 ± 0.178
5.368ArgAsp: 5.368 ± 0.54
5.258ArgGlu: 5.258 ± 0.589
2.629ArgPhe: 2.629 ± 0.384
4.546ArgGly: 4.546 ± 0.523
1.534ArgHis: 1.534 ± 0.331
2.629ArgIle: 2.629 ± 0.38
2.903ArgLys: 2.903 ± 0.424
6.682ArgLeu: 6.682 ± 0.66
1.315ArgMet: 1.315 ± 0.3
2.465ArgAsn: 2.465 ± 0.324
3.067ArgPro: 3.067 ± 0.438
3.286ArgGln: 3.286 ± 0.447
5.313ArgArg: 5.313 ± 0.783
3.286ArgSer: 3.286 ± 0.475
3.506ArgThr: 3.506 ± 0.411
5.039ArgVal: 5.039 ± 0.488
1.479ArgTrp: 1.479 ± 0.305
1.917ArgTyr: 1.917 ± 0.355
0.0ArgXaa: 0.0 ± 0.0
Ser
4.711SerAla: 4.711 ± 0.506
0.274SerCys: 0.274 ± 0.13
2.465SerAsp: 2.465 ± 0.33
2.958SerGlu: 2.958 ± 0.41
2.739SerPhe: 2.739 ± 0.389
5.423SerGly: 5.423 ± 0.582
0.767SerHis: 0.767 ± 0.196
1.972SerIle: 1.972 ± 0.341
2.903SerLys: 2.903 ± 0.385
3.451SerLeu: 3.451 ± 0.433
0.931SerMet: 0.931 ± 0.208
2.41SerAsn: 2.41 ± 0.332
2.3SerPro: 2.3 ± 0.334
1.588SerGln: 1.588 ± 0.307
3.067SerArg: 3.067 ± 0.353
2.684SerSer: 2.684 ± 0.344
1.917SerThr: 1.917 ± 0.342
2.793SerVal: 2.793 ± 0.319
1.315SerTrp: 1.315 ± 0.32
1.808SerTyr: 1.808 ± 0.345
0.0SerXaa: 0.0 ± 0.0
Thr
6.299ThrAla: 6.299 ± 0.686
0.548ThrCys: 0.548 ± 0.22
2.793ThrAsp: 2.793 ± 0.442
2.739ThrGlu: 2.739 ± 0.307
2.136ThrPhe: 2.136 ± 0.335
5.039ThrGly: 5.039 ± 0.459
0.603ThrHis: 0.603 ± 0.182
3.177ThrIle: 3.177 ± 0.377
2.136ThrLys: 2.136 ± 0.368
4.93ThrLeu: 4.93 ± 0.526
1.041ThrMet: 1.041 ± 0.29
1.917ThrAsn: 1.917 ± 0.313
2.848ThrPro: 2.848 ± 0.368
1.534ThrGln: 1.534 ± 0.294
3.067ThrArg: 3.067 ± 0.464
2.684ThrSer: 2.684 ± 0.407
3.615ThrThr: 3.615 ± 0.504
5.149ThrVal: 5.149 ± 0.499
0.822ThrTrp: 0.822 ± 0.212
1.917ThrTyr: 1.917 ± 0.33
0.0ThrXaa: 0.0 ± 0.0
Val
7.011ValAla: 7.011 ± 0.635
0.767ValCys: 0.767 ± 0.223
4.491ValAsp: 4.491 ± 0.527
4.711ValGlu: 4.711 ± 0.444
1.808ValPhe: 1.808 ± 0.261
4.218ValGly: 4.218 ± 0.447
1.479ValHis: 1.479 ± 0.268
2.958ValIle: 2.958 ± 0.415
2.684ValLys: 2.684 ± 0.401
6.463ValLeu: 6.463 ± 0.644
1.26ValMet: 1.26 ± 0.242
2.629ValAsn: 2.629 ± 0.462
3.889ValPro: 3.889 ± 0.444
2.246ValGln: 2.246 ± 0.349
5.477ValArg: 5.477 ± 0.454
3.067ValSer: 3.067 ± 0.483
3.506ValThr: 3.506 ± 0.445
4.711ValVal: 4.711 ± 0.549
1.479ValTrp: 1.479 ± 0.286
2.081ValTyr: 2.081 ± 0.315
0.0ValXaa: 0.0 ± 0.0
Trp
2.41TrpAla: 2.41 ± 0.292
0.329TrpCys: 0.329 ± 0.141
1.369TrpAsp: 1.369 ± 0.264
1.753TrpGlu: 1.753 ± 0.274
0.603TrpPhe: 0.603 ± 0.177
0.931TrpGly: 0.931 ± 0.226
0.383TrpHis: 0.383 ± 0.134
0.876TrpIle: 0.876 ± 0.23
1.15TrpLys: 1.15 ± 0.229
2.081TrpLeu: 2.081 ± 0.402
0.438TrpMet: 0.438 ± 0.146
1.15TrpAsn: 1.15 ± 0.267
0.822TrpPro: 0.822 ± 0.219
0.603TrpGln: 0.603 ± 0.189
1.095TrpArg: 1.095 ± 0.26
0.876TrpSer: 0.876 ± 0.253
1.315TrpThr: 1.315 ± 0.289
1.15TrpVal: 1.15 ± 0.242
0.274TrpTrp: 0.274 ± 0.148
0.931TrpTyr: 0.931 ± 0.228
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.848TyrAla: 2.848 ± 0.446
0.493TyrCys: 0.493 ± 0.16
1.424TyrAsp: 1.424 ± 0.27
1.424TyrGlu: 1.424 ± 0.269
0.931TyrPhe: 0.931 ± 0.242
3.286TyrGly: 3.286 ± 0.462
0.657TyrHis: 0.657 ± 0.191
1.534TyrIle: 1.534 ± 0.347
1.095TyrLys: 1.095 ± 0.304
2.684TyrLeu: 2.684 ± 0.353
0.383TyrMet: 0.383 ± 0.153
1.534TyrAsn: 1.534 ± 0.245
2.191TyrPro: 2.191 ± 0.425
1.424TyrGln: 1.424 ± 0.222
2.081TyrArg: 2.081 ± 0.321
1.808TyrSer: 1.808 ± 0.305
1.698TyrThr: 1.698 ± 0.303
1.917TyrVal: 1.917 ± 0.287
0.876TyrTrp: 0.876 ± 0.193
0.493TyrTyr: 0.493 ± 0.167
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (18258 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski