Amino acid dipepetide frequency for Pseudomonas phage Henu5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.444AlaAla: 6.444 ± 0.633
1.44AlaCys: 1.44 ± 0.258
4.207AlaAsp: 4.207 ± 0.42
5.458AlaGlu: 5.458 ± 0.538
2.994AlaPhe: 2.994 ± 0.33
5.648AlaGly: 5.648 ± 0.555
1.744AlaHis: 1.744 ± 0.244
5.079AlaIle: 5.079 ± 0.356
4.927AlaLys: 4.927 ± 0.568
6.519AlaLeu: 6.519 ± 0.574
2.729AlaMet: 2.729 ± 0.323
3.222AlaAsn: 3.222 ± 0.437
2.54AlaPro: 2.54 ± 0.391
3.222AlaGln: 3.222 ± 0.455
5.269AlaArg: 5.269 ± 0.44
4.131AlaSer: 4.131 ± 0.451
5.382AlaThr: 5.382 ± 0.553
5.534AlaVal: 5.534 ± 0.465
1.289AlaTrp: 1.289 ± 0.237
2.426AlaTyr: 2.426 ± 0.335
0.038AlaXaa: 0.038 ± 0.036
Cys
0.872CysAla: 0.872 ± 0.189
0.227CysCys: 0.227 ± 0.096
0.872CysAsp: 0.872 ± 0.195
0.91CysGlu: 0.91 ± 0.176
0.606CysPhe: 0.606 ± 0.132
0.834CysGly: 0.834 ± 0.192
0.265CysHis: 0.265 ± 0.117
0.644CysIle: 0.644 ± 0.141
0.872CysLys: 0.872 ± 0.194
0.644CysLeu: 0.644 ± 0.158
0.644CysMet: 0.644 ± 0.155
0.948CysAsn: 0.948 ± 0.208
0.606CysPro: 0.606 ± 0.183
0.796CysGln: 0.796 ± 0.18
0.985CysArg: 0.985 ± 0.213
0.985CysSer: 0.985 ± 0.188
0.606CysThr: 0.606 ± 0.149
0.72CysVal: 0.72 ± 0.205
0.379CysTrp: 0.379 ± 0.121
0.531CysTyr: 0.531 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
4.245AspAla: 4.245 ± 0.439
0.758AspCys: 0.758 ± 0.21
3.26AspAsp: 3.26 ± 0.434
3.411AspGlu: 3.411 ± 0.422
2.464AspPhe: 2.464 ± 0.281
4.094AspGly: 4.094 ± 0.386
1.175AspHis: 1.175 ± 0.217
3.449AspIle: 3.449 ± 0.366
3.298AspLys: 3.298 ± 0.333
5.079AspLeu: 5.079 ± 0.416
1.706AspMet: 1.706 ± 0.273
2.009AspAsn: 2.009 ± 0.301
3.108AspPro: 3.108 ± 0.318
1.933AspGln: 1.933 ± 0.294
3.487AspArg: 3.487 ± 0.423
2.881AspSer: 2.881 ± 0.324
3.373AspThr: 3.373 ± 0.384
4.018AspVal: 4.018 ± 0.453
1.365AspTrp: 1.365 ± 0.288
2.426AspTyr: 2.426 ± 0.289
0.0AspXaa: 0.0 ± 0.0
Glu
6.785GluAla: 6.785 ± 0.529
1.251GluCys: 1.251 ± 0.175
4.207GluAsp: 4.207 ± 0.383
6.444GluGlu: 6.444 ± 0.623
3.222GluPhe: 3.222 ± 0.334
4.738GluGly: 4.738 ± 0.425
1.137GluHis: 1.137 ± 0.226
3.601GluIle: 3.601 ± 0.362
3.487GluLys: 3.487 ± 0.375
6.254GluLeu: 6.254 ± 0.668
1.971GluMet: 1.971 ± 0.274
2.464GluAsn: 2.464 ± 0.308
1.706GluPro: 1.706 ± 0.252
2.691GluGln: 2.691 ± 0.358
3.942GluArg: 3.942 ± 0.382
3.79GluSer: 3.79 ± 0.414
3.26GluThr: 3.26 ± 0.36
5.003GluVal: 5.003 ± 0.498
1.592GluTrp: 1.592 ± 0.21
3.26GluTyr: 3.26 ± 0.325
0.038GluXaa: 0.038 ± 0.041
Phe
2.919PheAla: 2.919 ± 0.332
0.644PheCys: 0.644 ± 0.163
2.274PheAsp: 2.274 ± 0.339
2.994PheGlu: 2.994 ± 0.299
1.933PhePhe: 1.933 ± 0.291
2.843PheGly: 2.843 ± 0.297
1.061PheHis: 1.061 ± 0.199
2.198PheIle: 2.198 ± 0.301
3.032PheLys: 3.032 ± 0.382
3.487PheLeu: 3.487 ± 0.384
1.099PheMet: 1.099 ± 0.182
2.236PheAsn: 2.236 ± 0.238
1.706PhePro: 1.706 ± 0.221
1.478PheGln: 1.478 ± 0.209
2.16PheArg: 2.16 ± 0.286
2.919PheSer: 2.919 ± 0.351
2.919PheThr: 2.919 ± 0.336
3.184PheVal: 3.184 ± 0.318
0.796PheTrp: 0.796 ± 0.167
1.289PheTyr: 1.289 ± 0.234
0.038PheXaa: 0.038 ± 0.041
Gly
4.814GlyAla: 4.814 ± 0.489
1.099GlyCys: 1.099 ± 0.201
4.473GlyAsp: 4.473 ± 0.336
4.7GlyGlu: 4.7 ± 0.447
3.487GlyPhe: 3.487 ± 0.352
5.306GlyGly: 5.306 ± 0.584
1.365GlyHis: 1.365 ± 0.274
3.98GlyIle: 3.98 ± 0.471
3.942GlyLys: 3.942 ± 0.488
5.723GlyLeu: 5.723 ± 0.474
2.047GlyMet: 2.047 ± 0.294
2.919GlyAsn: 2.919 ± 0.352
1.478GlyPro: 1.478 ± 0.241
2.805GlyGln: 2.805 ± 0.34
4.056GlyArg: 4.056 ± 0.364
4.548GlySer: 4.548 ± 0.415
3.222GlyThr: 3.222 ± 0.372
5.572GlyVal: 5.572 ± 0.421
1.402GlyTrp: 1.402 ± 0.205
3.07GlyTyr: 3.07 ± 0.316
0.0GlyXaa: 0.0 ± 0.0
His
1.744HisAla: 1.744 ± 0.262
0.341HisCys: 0.341 ± 0.13
0.985HisAsp: 0.985 ± 0.167
0.872HisGlu: 0.872 ± 0.184
1.251HisPhe: 1.251 ± 0.27
1.478HisGly: 1.478 ± 0.326
0.379HisHis: 0.379 ± 0.137
1.365HisIle: 1.365 ± 0.21
1.251HisLys: 1.251 ± 0.251
1.781HisLeu: 1.781 ± 0.279
0.606HisMet: 0.606 ± 0.121
0.872HisAsn: 0.872 ± 0.191
1.061HisPro: 1.061 ± 0.19
0.644HisGln: 0.644 ± 0.154
1.137HisArg: 1.137 ± 0.201
1.213HisSer: 1.213 ± 0.24
1.44HisThr: 1.44 ± 0.24
1.251HisVal: 1.251 ± 0.192
0.379HisTrp: 0.379 ± 0.129
0.796HisTyr: 0.796 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
4.662IleAla: 4.662 ± 0.41
0.455IleCys: 0.455 ± 0.139
3.828IleAsp: 3.828 ± 0.352
3.752IleGlu: 3.752 ± 0.382
1.857IlePhe: 1.857 ± 0.241
3.866IleGly: 3.866 ± 0.383
1.706IleHis: 1.706 ± 0.233
2.312IleIle: 2.312 ± 0.308
3.26IleLys: 3.26 ± 0.365
4.662IleLeu: 4.662 ± 0.448
1.402IleMet: 1.402 ± 0.231
2.388IleAsn: 2.388 ± 0.32
3.108IlePro: 3.108 ± 0.337
1.971IleGln: 1.971 ± 0.276
3.79IleArg: 3.79 ± 0.438
3.677IleSer: 3.677 ± 0.389
2.577IleThr: 2.577 ± 0.35
3.98IleVal: 3.98 ± 0.379
0.796IleTrp: 0.796 ± 0.2
1.402IleTyr: 1.402 ± 0.229
0.0IleXaa: 0.0 ± 0.0
Lys
5.685LysAla: 5.685 ± 0.619
0.72LysCys: 0.72 ± 0.165
3.411LysAsp: 3.411 ± 0.373
5.041LysGlu: 5.041 ± 0.561
1.744LysPhe: 1.744 ± 0.293
4.056LysGly: 4.056 ± 0.418
0.985LysHis: 0.985 ± 0.203
3.222LysIle: 3.222 ± 0.347
3.26LysLys: 3.26 ± 0.384
4.927LysLeu: 4.927 ± 0.398
1.516LysMet: 1.516 ± 0.242
2.085LysAsn: 2.085 ± 0.29
1.706LysPro: 1.706 ± 0.285
1.744LysGln: 1.744 ± 0.231
3.108LysArg: 3.108 ± 0.425
3.373LysSer: 3.373 ± 0.353
2.881LysThr: 2.881 ± 0.254
4.056LysVal: 4.056 ± 0.423
1.061LysTrp: 1.061 ± 0.217
2.085LysTyr: 2.085 ± 0.329
0.0LysXaa: 0.0 ± 0.0
Leu
6.254LeuAla: 6.254 ± 0.568
0.91LeuCys: 0.91 ± 0.177
5.723LeuAsp: 5.723 ± 0.472
5.837LeuGlu: 5.837 ± 0.543
3.26LeuPhe: 3.26 ± 0.378
5.913LeuGly: 5.913 ± 0.509
2.047LeuHis: 2.047 ± 0.297
4.51LeuIle: 4.51 ± 0.444
5.344LeuLys: 5.344 ± 0.507
6.254LeuLeu: 6.254 ± 0.563
2.54LeuMet: 2.54 ± 0.284
3.26LeuAsn: 3.26 ± 0.326
4.018LeuPro: 4.018 ± 0.34
2.994LeuGln: 2.994 ± 0.296
4.7LeuArg: 4.7 ± 0.405
5.799LeuSer: 5.799 ± 0.611
4.927LeuThr: 4.927 ± 0.4
5.534LeuVal: 5.534 ± 0.531
1.289LeuTrp: 1.289 ± 0.178
3.108LeuTyr: 3.108 ± 0.331
0.038LeuXaa: 0.038 ± 0.039
Met
3.222MetAla: 3.222 ± 0.384
0.417MetCys: 0.417 ± 0.135
1.516MetAsp: 1.516 ± 0.276
2.426MetGlu: 2.426 ± 0.298
0.872MetPhe: 0.872 ± 0.192
1.895MetGly: 1.895 ± 0.286
0.227MetHis: 0.227 ± 0.09
1.365MetIle: 1.365 ± 0.209
2.047MetLys: 2.047 ± 0.245
2.388MetLeu: 2.388 ± 0.324
1.251MetMet: 1.251 ± 0.251
1.137MetAsn: 1.137 ± 0.224
1.289MetPro: 1.289 ± 0.213
1.365MetGln: 1.365 ± 0.265
1.365MetArg: 1.365 ± 0.209
1.857MetSer: 1.857 ± 0.198
2.085MetThr: 2.085 ± 0.283
1.554MetVal: 1.554 ± 0.241
0.417MetTrp: 0.417 ± 0.113
0.985MetTyr: 0.985 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
3.298AsnAla: 3.298 ± 0.47
0.265AsnCys: 0.265 ± 0.092
2.009AsnAsp: 2.009 ± 0.233
1.971AsnGlu: 1.971 ± 0.277
1.781AsnPhe: 1.781 ± 0.281
3.411AsnGly: 3.411 ± 0.407
0.91AsnHis: 0.91 ± 0.178
3.07AsnIle: 3.07 ± 0.339
2.123AsnLys: 2.123 ± 0.247
4.056AsnLeu: 4.056 ± 0.446
1.099AsnMet: 1.099 ± 0.205
1.781AsnAsn: 1.781 ± 0.25
1.706AsnPro: 1.706 ± 0.269
1.137AsnGln: 1.137 ± 0.298
1.857AsnArg: 1.857 ± 0.237
2.843AsnSer: 2.843 ± 0.394
2.236AsnThr: 2.236 ± 0.325
3.146AsnVal: 3.146 ± 0.382
0.606AsnTrp: 0.606 ± 0.171
1.289AsnTyr: 1.289 ± 0.223
0.0AsnXaa: 0.0 ± 0.0
Pro
3.184ProAla: 3.184 ± 0.322
0.644ProCys: 0.644 ± 0.16
2.16ProAsp: 2.16 ± 0.368
3.904ProGlu: 3.904 ± 0.344
1.365ProPhe: 1.365 ± 0.23
2.54ProGly: 2.54 ± 0.338
1.061ProHis: 1.061 ± 0.24
1.895ProIle: 1.895 ± 0.297
1.706ProLys: 1.706 ± 0.264
2.919ProLeu: 2.919 ± 0.335
1.023ProMet: 1.023 ± 0.204
1.213ProAsn: 1.213 ± 0.22
1.516ProPro: 1.516 ± 0.249
0.985ProGln: 0.985 ± 0.199
1.895ProArg: 1.895 ± 0.261
2.919ProSer: 2.919 ± 0.355
2.577ProThr: 2.577 ± 0.349
3.563ProVal: 3.563 ± 0.335
0.303ProTrp: 0.303 ± 0.099
1.213ProTyr: 1.213 ± 0.205
0.0ProXaa: 0.0 ± 0.0
Gln
3.146GlnAla: 3.146 ± 0.336
0.455GlnCys: 0.455 ± 0.143
1.706GlnAsp: 1.706 ± 0.261
2.388GlnGlu: 2.388 ± 0.28
1.706GlnPhe: 1.706 ± 0.224
2.123GlnGly: 2.123 ± 0.318
1.023GlnHis: 1.023 ± 0.199
2.236GlnIle: 2.236 ± 0.301
1.402GlnLys: 1.402 ± 0.239
3.032GlnLeu: 3.032 ± 0.321
1.478GlnMet: 1.478 ± 0.275
1.099GlnAsn: 1.099 ± 0.19
0.834GlnPro: 0.834 ± 0.2
0.834GlnGln: 0.834 ± 0.157
2.236GlnArg: 2.236 ± 0.255
2.236GlnSer: 2.236 ± 0.293
2.085GlnThr: 2.085 ± 0.262
3.032GlnVal: 3.032 ± 0.316
0.606GlnTrp: 0.606 ± 0.15
1.365GlnTyr: 1.365 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
4.283ArgAla: 4.283 ± 0.419
0.796ArgCys: 0.796 ± 0.156
3.146ArgAsp: 3.146 ± 0.369
3.828ArgGlu: 3.828 ± 0.312
2.881ArgPhe: 2.881 ± 0.356
4.321ArgGly: 4.321 ± 0.49
1.099ArgHis: 1.099 ± 0.186
2.729ArgIle: 2.729 ± 0.272
3.942ArgLys: 3.942 ± 0.379
5.685ArgLeu: 5.685 ± 0.463
1.592ArgMet: 1.592 ± 0.245
2.047ArgAsn: 2.047 ± 0.279
2.426ArgPro: 2.426 ± 0.27
2.35ArgGln: 2.35 ± 0.298
3.828ArgArg: 3.828 ± 0.46
3.146ArgSer: 3.146 ± 0.338
2.956ArgThr: 2.956 ± 0.307
4.056ArgVal: 4.056 ± 0.31
0.985ArgTrp: 0.985 ± 0.234
1.857ArgTyr: 1.857 ± 0.239
0.0ArgXaa: 0.0 ± 0.0
Ser
4.359SerAla: 4.359 ± 0.415
0.834SerCys: 0.834 ± 0.184
3.184SerAsp: 3.184 ± 0.372
3.942SerGlu: 3.942 ± 0.35
3.222SerPhe: 3.222 ± 0.359
4.435SerGly: 4.435 ± 0.451
1.402SerHis: 1.402 ± 0.269
2.956SerIle: 2.956 ± 0.308
3.184SerLys: 3.184 ± 0.36
5.648SerLeu: 5.648 ± 0.583
1.592SerMet: 1.592 ± 0.255
2.729SerAsn: 2.729 ± 0.37
2.464SerPro: 2.464 ± 0.292
1.933SerGln: 1.933 ± 0.299
3.866SerArg: 3.866 ± 0.413
3.79SerSer: 3.79 ± 0.487
3.715SerThr: 3.715 ± 0.372
5.534SerVal: 5.534 ± 0.43
0.834SerTrp: 0.834 ± 0.167
1.971SerTyr: 1.971 ± 0.232
0.0SerXaa: 0.0 ± 0.0
Thr
4.51ThrAla: 4.51 ± 0.436
0.72ThrCys: 0.72 ± 0.157
3.07ThrAsp: 3.07 ± 0.383
4.359ThrGlu: 4.359 ± 0.417
2.843ThrPhe: 2.843 ± 0.287
4.586ThrGly: 4.586 ± 0.517
1.061ThrHis: 1.061 ± 0.182
3.335ThrIle: 3.335 ± 0.316
2.502ThrLys: 2.502 ± 0.301
5.306ThrLeu: 5.306 ± 0.48
1.175ThrMet: 1.175 ± 0.206
2.236ThrAsn: 2.236 ± 0.294
2.312ThrPro: 2.312 ± 0.279
2.009ThrGln: 2.009 ± 0.283
2.994ThrArg: 2.994 ± 0.335
3.373ThrSer: 3.373 ± 0.363
2.994ThrThr: 2.994 ± 0.352
4.738ThrVal: 4.738 ± 0.478
0.758ThrTrp: 0.758 ± 0.153
1.971ThrTyr: 1.971 ± 0.251
0.0ThrXaa: 0.0 ± 0.0
Val
6.216ValAla: 6.216 ± 0.513
0.872ValCys: 0.872 ± 0.202
4.359ValAsp: 4.359 ± 0.443
5.155ValGlu: 5.155 ± 0.558
3.335ValPhe: 3.335 ± 0.339
4.435ValGly: 4.435 ± 0.457
1.327ValHis: 1.327 ± 0.27
4.397ValIle: 4.397 ± 0.448
4.018ValLys: 4.018 ± 0.39
6.102ValLeu: 6.102 ± 0.503
2.426ValMet: 2.426 ± 0.308
3.373ValAsn: 3.373 ± 0.337
3.108ValPro: 3.108 ± 0.456
2.312ValGln: 2.312 ± 0.258
3.98ValArg: 3.98 ± 0.36
4.89ValSer: 4.89 ± 0.396
4.435ValThr: 4.435 ± 0.398
6.065ValVal: 6.065 ± 0.674
1.251ValTrp: 1.251 ± 0.245
2.464ValTyr: 2.464 ± 0.311
0.0ValXaa: 0.0 ± 0.0
Trp
0.91TrpAla: 0.91 ± 0.182
0.379TrpCys: 0.379 ± 0.124
1.023TrpAsp: 1.023 ± 0.181
1.402TrpGlu: 1.402 ± 0.22
0.758TrpPhe: 0.758 ± 0.178
0.872TrpGly: 0.872 ± 0.192
0.341TrpHis: 0.341 ± 0.119
0.796TrpIle: 0.796 ± 0.18
0.985TrpLys: 0.985 ± 0.2
1.365TrpLeu: 1.365 ± 0.203
0.606TrpMet: 0.606 ± 0.15
0.682TrpAsn: 0.682 ± 0.156
0.644TrpPro: 0.644 ± 0.153
0.493TrpGln: 0.493 ± 0.116
1.289TrpArg: 1.289 ± 0.245
0.872TrpSer: 0.872 ± 0.198
0.91TrpThr: 0.91 ± 0.179
1.402TrpVal: 1.402 ± 0.235
0.341TrpTrp: 0.341 ± 0.11
0.569TrpTyr: 0.569 ± 0.142
0.038TrpXaa: 0.038 ± 0.033
Tyr
2.729TyrAla: 2.729 ± 0.365
0.72TyrCys: 0.72 ± 0.162
2.123TyrAsp: 2.123 ± 0.27
2.047TyrGlu: 2.047 ± 0.354
1.63TyrPhe: 1.63 ± 0.217
2.577TyrGly: 2.577 ± 0.354
0.531TyrHis: 0.531 ± 0.165
2.312TyrIle: 2.312 ± 0.319
2.085TyrLys: 2.085 ± 0.273
2.35TyrLeu: 2.35 ± 0.277
1.137TyrMet: 1.137 ± 0.196
1.895TyrAsn: 1.895 ± 0.249
1.213TyrPro: 1.213 ± 0.193
1.327TyrGln: 1.327 ± 0.227
2.009TyrArg: 2.009 ± 0.247
2.35TyrSer: 2.35 ± 0.324
2.274TyrThr: 2.274 ± 0.313
2.54TyrVal: 2.54 ± 0.306
0.227TyrTrp: 0.227 ± 0.081
1.251TyrTyr: 1.251 ± 0.251
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.038XaaGly: 0.038 ± 0.041
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.076XaaLeu: 0.076 ± 0.05
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.038XaaPro: 0.038 ± 0.036
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.038XaaVal: 0.038 ± 0.039
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 145 proteins (26384 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski