Amino acid dipepetide frequency for Pseudomonas phage YMC11/06/C171_PPU_BP

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.854AlaAla: 15.854 ± 1.414
0.972AlaCys: 0.972 ± 0.256
6.581AlaAsp: 6.581 ± 0.769
5.235AlaGlu: 5.235 ± 0.791
3.365AlaPhe: 3.365 ± 0.512
9.722AlaGly: 9.722 ± 0.883
2.169AlaHis: 2.169 ± 0.392
4.711AlaIle: 4.711 ± 0.633
4.786AlaLys: 4.786 ± 0.693
11.666AlaLeu: 11.666 ± 1.157
3.141AlaMet: 3.141 ± 0.364
5.384AlaAsn: 5.384 ± 0.851
3.739AlaPro: 3.739 ± 0.483
5.534AlaGln: 5.534 ± 0.768
4.861AlaArg: 4.861 ± 0.725
6.656AlaSer: 6.656 ± 0.592
5.908AlaThr: 5.908 ± 0.59
8.151AlaVal: 8.151 ± 0.834
1.271AlaTrp: 1.271 ± 0.341
3.814AlaTyr: 3.814 ± 0.541
0.0AlaXaa: 0.0 ± 0.0
Cys
1.122CysAla: 1.122 ± 0.298
0.075CysCys: 0.075 ± 0.082
0.897CysAsp: 0.897 ± 0.234
0.374CysGlu: 0.374 ± 0.176
0.075CysPhe: 0.075 ± 0.084
0.897CysGly: 0.897 ± 0.246
0.598CysHis: 0.598 ± 0.215
0.224CysIle: 0.224 ± 0.122
0.224CysLys: 0.224 ± 0.137
0.972CysLeu: 0.972 ± 0.261
0.449CysMet: 0.449 ± 0.17
0.748CysAsn: 0.748 ± 0.257
0.598CysPro: 0.598 ± 0.204
0.299CysGln: 0.299 ± 0.137
0.673CysArg: 0.673 ± 0.252
0.598CysSer: 0.598 ± 0.226
0.897CysThr: 0.897 ± 0.236
0.374CysVal: 0.374 ± 0.157
0.15CysTrp: 0.15 ± 0.103
0.449CysTyr: 0.449 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
7.553AspAla: 7.553 ± 0.83
0.897AspCys: 0.897 ± 0.238
3.365AspAsp: 3.365 ± 0.499
4.637AspGlu: 4.637 ± 0.545
2.767AspPhe: 2.767 ± 0.411
5.684AspGly: 5.684 ± 0.831
0.972AspHis: 0.972 ± 0.347
3.515AspIle: 3.515 ± 0.402
2.617AspLys: 2.617 ± 0.331
5.609AspLeu: 5.609 ± 0.736
2.468AspMet: 2.468 ± 0.402
2.991AspAsn: 2.991 ± 0.346
2.617AspPro: 2.617 ± 0.379
1.421AspGln: 1.421 ± 0.251
2.243AspArg: 2.243 ± 0.489
3.59AspSer: 3.59 ± 0.689
4.113AspThr: 4.113 ± 0.333
4.188AspVal: 4.188 ± 0.702
1.421AspTrp: 1.421 ± 0.366
1.72AspTyr: 1.72 ± 0.329
0.0AspXaa: 0.0 ± 0.0
Glu
5.609GluAla: 5.609 ± 0.681
0.449GluCys: 0.449 ± 0.207
3.365GluAsp: 3.365 ± 0.477
3.814GluGlu: 3.814 ± 0.783
2.468GluPhe: 2.468 ± 0.425
4.412GluGly: 4.412 ± 0.494
1.197GluHis: 1.197 ± 0.325
1.795GluIle: 1.795 ± 0.274
1.795GluLys: 1.795 ± 0.393
4.711GluLeu: 4.711 ± 0.685
2.617GluMet: 2.617 ± 0.571
1.944GluAsn: 1.944 ± 0.331
2.169GluPro: 2.169 ± 0.376
2.767GluGln: 2.767 ± 0.459
3.365GluArg: 3.365 ± 0.577
2.767GluSer: 2.767 ± 0.425
2.767GluThr: 2.767 ± 0.38
3.29GluVal: 3.29 ± 0.681
0.673GluTrp: 0.673 ± 0.239
2.019GluTyr: 2.019 ± 0.295
0.0GluXaa: 0.0 ± 0.0
Phe
3.814PheAla: 3.814 ± 0.447
0.224PheCys: 0.224 ± 0.128
2.617PheAsp: 2.617 ± 0.396
2.318PheGlu: 2.318 ± 0.409
1.197PhePhe: 1.197 ± 0.414
2.393PheGly: 2.393 ± 0.445
0.299PheHis: 0.299 ± 0.146
1.346PheIle: 1.346 ± 0.276
1.197PheLys: 1.197 ± 0.293
2.917PheLeu: 2.917 ± 0.377
0.523PheMet: 0.523 ± 0.171
1.645PheAsn: 1.645 ± 0.362
1.57PhePro: 1.57 ± 0.307
1.795PheGln: 1.795 ± 0.435
1.944PheArg: 1.944 ± 0.299
1.795PheSer: 1.795 ± 0.286
2.169PheThr: 2.169 ± 0.404
2.842PheVal: 2.842 ± 0.417
0.224PheTrp: 0.224 ± 0.122
0.598PheTyr: 0.598 ± 0.182
0.0PheXaa: 0.0 ± 0.0
Gly
9.497GlyAla: 9.497 ± 0.947
1.122GlyCys: 1.122 ± 0.352
4.936GlyAsp: 4.936 ± 0.583
3.515GlyGlu: 3.515 ± 0.547
2.543GlyPhe: 2.543 ± 0.315
6.955GlyGly: 6.955 ± 0.857
1.047GlyHis: 1.047 ± 0.317
3.59GlyIle: 3.59 ± 0.596
3.814GlyLys: 3.814 ± 0.521
7.478GlyLeu: 7.478 ± 0.466
2.991GlyMet: 2.991 ± 0.512
3.664GlyAsn: 3.664 ± 0.605
2.617GlyPro: 2.617 ± 0.558
3.365GlyGln: 3.365 ± 0.467
5.31GlyArg: 5.31 ± 0.482
4.337GlySer: 4.337 ± 0.704
6.73GlyThr: 6.73 ± 1.207
5.684GlyVal: 5.684 ± 0.75
0.897GlyTrp: 0.897 ± 0.27
2.318GlyTyr: 2.318 ± 0.434
0.0GlyXaa: 0.0 ± 0.0
His
1.87HisAla: 1.87 ± 0.372
0.449HisCys: 0.449 ± 0.146
0.748HisAsp: 0.748 ± 0.26
1.197HisGlu: 1.197 ± 0.313
0.15HisPhe: 0.15 ± 0.108
1.57HisGly: 1.57 ± 0.371
0.299HisHis: 0.299 ± 0.131
1.047HisIle: 1.047 ± 0.241
0.598HisLys: 0.598 ± 0.231
1.496HisLeu: 1.496 ± 0.437
0.449HisMet: 0.449 ± 0.2
0.972HisAsn: 0.972 ± 0.265
1.421HisPro: 1.421 ± 0.274
0.673HisGln: 0.673 ± 0.189
1.346HisArg: 1.346 ± 0.394
0.748HisSer: 0.748 ± 0.203
0.673HisThr: 0.673 ± 0.21
1.197HisVal: 1.197 ± 0.383
0.598HisTrp: 0.598 ± 0.235
0.299HisTyr: 0.299 ± 0.115
0.0HisXaa: 0.0 ± 0.0
Ile
4.861IleAla: 4.861 ± 0.653
0.598IleCys: 0.598 ± 0.203
4.188IleAsp: 4.188 ± 0.638
3.365IleGlu: 3.365 ± 0.434
0.673IlePhe: 0.673 ± 0.228
2.917IleGly: 2.917 ± 0.526
0.374IleHis: 0.374 ± 0.181
1.72IleIle: 1.72 ± 0.327
1.72IleLys: 1.72 ± 0.384
3.141IleLeu: 3.141 ± 0.417
1.346IleMet: 1.346 ± 0.35
1.645IleAsn: 1.645 ± 0.407
2.243IlePro: 2.243 ± 0.46
1.944IleGln: 1.944 ± 0.442
2.692IleArg: 2.692 ± 0.495
2.169IleSer: 2.169 ± 0.488
3.59IleThr: 3.59 ± 0.598
2.169IleVal: 2.169 ± 0.476
0.374IleTrp: 0.374 ± 0.154
1.197IleTyr: 1.197 ± 0.279
0.0IleXaa: 0.0 ± 0.0
Lys
4.861LysAla: 4.861 ± 0.67
0.449LysCys: 0.449 ± 0.199
2.991LysAsp: 2.991 ± 0.597
2.318LysGlu: 2.318 ± 0.395
1.122LysPhe: 1.122 ± 0.345
2.692LysGly: 2.692 ± 0.52
0.075LysHis: 0.075 ± 0.078
1.87LysIle: 1.87 ± 0.281
1.645LysLys: 1.645 ± 0.484
4.188LysLeu: 4.188 ± 0.549
0.972LysMet: 0.972 ± 0.257
1.496LysAsn: 1.496 ± 0.337
2.393LysPro: 2.393 ± 0.514
1.57LysGln: 1.57 ± 0.374
2.243LysArg: 2.243 ± 0.479
2.468LysSer: 2.468 ± 0.456
2.169LysThr: 2.169 ± 0.417
2.543LysVal: 2.543 ± 0.401
0.673LysTrp: 0.673 ± 0.294
1.57LysTyr: 1.57 ± 0.415
0.0LysXaa: 0.0 ± 0.0
Leu
9.049LeuAla: 9.049 ± 0.96
0.972LeuCys: 0.972 ± 0.27
6.132LeuAsp: 6.132 ± 0.591
3.664LeuGlu: 3.664 ± 0.469
3.216LeuPhe: 3.216 ± 0.506
7.777LeuGly: 7.777 ± 1.06
1.496LeuHis: 1.496 ± 0.333
3.29LeuIle: 3.29 ± 0.473
4.038LeuLys: 4.038 ± 0.629
7.254LeuLeu: 7.254 ± 0.769
2.468LeuMet: 2.468 ± 0.378
3.216LeuAsn: 3.216 ± 0.536
4.861LeuPro: 4.861 ± 0.532
4.637LeuGln: 4.637 ± 0.621
6.282LeuArg: 6.282 ± 0.824
4.562LeuSer: 4.562 ± 0.572
5.758LeuThr: 5.758 ± 0.628
5.908LeuVal: 5.908 ± 0.767
0.972LeuTrp: 0.972 ± 0.215
2.543LeuTyr: 2.543 ± 0.39
0.0LeuXaa: 0.0 ± 0.0
Met
3.141MetAla: 3.141 ± 0.469
0.224MetCys: 0.224 ± 0.112
1.122MetAsp: 1.122 ± 0.279
0.972MetGlu: 0.972 ± 0.389
1.645MetPhe: 1.645 ± 0.386
1.87MetGly: 1.87 ± 0.413
0.598MetHis: 0.598 ± 0.215
0.972MetIle: 0.972 ± 0.29
0.897MetLys: 0.897 ± 0.278
2.991MetLeu: 2.991 ± 0.462
1.047MetMet: 1.047 ± 0.29
0.673MetAsn: 0.673 ± 0.25
1.795MetPro: 1.795 ± 0.392
0.972MetGln: 0.972 ± 0.242
2.169MetArg: 2.169 ± 0.341
2.543MetSer: 2.543 ± 0.429
1.795MetThr: 1.795 ± 0.486
1.421MetVal: 1.421 ± 0.309
0.449MetTrp: 0.449 ± 0.16
1.197MetTyr: 1.197 ± 0.276
0.0MetXaa: 0.0 ± 0.0
Asn
4.412AsnAla: 4.412 ± 0.849
0.374AsnCys: 0.374 ± 0.169
1.57AsnAsp: 1.57 ± 0.376
2.243AsnGlu: 2.243 ± 0.397
1.197AsnPhe: 1.197 ± 0.262
3.44AsnGly: 3.44 ± 0.593
0.598AsnHis: 0.598 ± 0.179
2.243AsnIle: 2.243 ± 0.515
2.318AsnLys: 2.318 ± 0.375
2.767AsnLeu: 2.767 ± 0.369
1.271AsnMet: 1.271 ± 0.345
1.57AsnAsn: 1.57 ± 0.269
1.795AsnPro: 1.795 ± 0.279
1.197AsnGln: 1.197 ± 0.244
1.87AsnArg: 1.87 ± 0.255
2.917AsnSer: 2.917 ± 0.544
3.29AsnThr: 3.29 ± 0.664
4.337AsnVal: 4.337 ± 0.558
0.972AsnTrp: 0.972 ± 0.286
0.673AsnTyr: 0.673 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
4.337ProAla: 4.337 ± 0.422
0.299ProCys: 0.299 ± 0.146
4.263ProAsp: 4.263 ± 0.886
3.44ProGlu: 3.44 ± 0.425
0.897ProPhe: 0.897 ± 0.28
3.365ProGly: 3.365 ± 0.532
0.598ProHis: 0.598 ± 0.189
1.795ProIle: 1.795 ± 0.477
1.87ProLys: 1.87 ± 0.451
2.243ProLeu: 2.243 ± 0.312
0.897ProMet: 0.897 ± 0.307
1.271ProAsn: 1.271 ± 0.306
1.496ProPro: 1.496 ± 0.333
1.57ProGln: 1.57 ± 0.335
2.318ProArg: 2.318 ± 0.393
2.543ProSer: 2.543 ± 0.449
3.216ProThr: 3.216 ± 0.639
3.964ProVal: 3.964 ± 0.601
0.598ProTrp: 0.598 ± 0.24
1.496ProTyr: 1.496 ± 0.337
0.0ProXaa: 0.0 ± 0.0
Gln
4.637GlnAla: 4.637 ± 0.735
0.224GlnCys: 0.224 ± 0.111
1.87GlnAsp: 1.87 ± 0.466
2.692GlnGlu: 2.692 ± 0.509
1.496GlnPhe: 1.496 ± 0.352
3.889GlnGly: 3.889 ± 0.507
1.047GlnHis: 1.047 ± 0.281
1.795GlnIle: 1.795 ± 0.41
1.346GlnLys: 1.346 ± 0.306
3.739GlnLeu: 3.739 ± 0.637
1.346GlnMet: 1.346 ± 0.419
1.57GlnAsn: 1.57 ± 0.373
1.122GlnPro: 1.122 ± 0.302
2.842GlnGln: 2.842 ± 0.709
2.917GlnArg: 2.917 ± 0.632
2.543GlnSer: 2.543 ± 0.499
2.019GlnThr: 2.019 ± 0.449
3.44GlnVal: 3.44 ± 0.697
0.449GlnTrp: 0.449 ± 0.169
1.795GlnTyr: 1.795 ± 0.47
0.0GlnXaa: 0.0 ± 0.0
Arg
7.254ArgAla: 7.254 ± 1.444
0.748ArgCys: 0.748 ± 0.277
4.263ArgAsp: 4.263 ± 0.847
3.365ArgGlu: 3.365 ± 0.636
2.468ArgPhe: 2.468 ± 0.522
4.412ArgGly: 4.412 ± 0.643
1.346ArgHis: 1.346 ± 0.36
3.365ArgIle: 3.365 ± 0.458
1.87ArgLys: 1.87 ± 0.431
5.609ArgLeu: 5.609 ± 0.657
1.346ArgMet: 1.346 ± 0.359
2.094ArgAsn: 2.094 ± 0.386
2.094ArgPro: 2.094 ± 0.365
2.318ArgGln: 2.318 ± 0.473
4.337ArgArg: 4.337 ± 0.62
3.216ArgSer: 3.216 ± 0.586
3.066ArgThr: 3.066 ± 0.658
3.59ArgVal: 3.59 ± 0.57
1.346ArgTrp: 1.346 ± 0.329
1.496ArgTyr: 1.496 ± 0.393
0.0ArgXaa: 0.0 ± 0.0
Ser
6.282SerAla: 6.282 ± 0.56
0.374SerCys: 0.374 ± 0.156
3.59SerAsp: 3.59 ± 0.507
1.72SerGlu: 1.72 ± 0.32
2.019SerPhe: 2.019 ± 0.368
6.805SerGly: 6.805 ± 1.079
0.823SerHis: 0.823 ± 0.224
2.543SerIle: 2.543 ± 0.463
2.393SerLys: 2.393 ± 0.382
6.057SerLeu: 6.057 ± 0.436
1.87SerMet: 1.87 ± 0.365
2.094SerAsn: 2.094 ± 0.405
1.496SerPro: 1.496 ± 0.23
1.944SerGln: 1.944 ± 0.328
3.44SerArg: 3.44 ± 0.513
3.515SerSer: 3.515 ± 0.518
4.113SerThr: 4.113 ± 0.624
4.188SerVal: 4.188 ± 0.621
0.972SerTrp: 0.972 ± 0.301
1.87SerTyr: 1.87 ± 0.431
0.0SerXaa: 0.0 ± 0.0
Thr
7.553ThrAla: 7.553 ± 1.02
0.673ThrCys: 0.673 ± 0.239
3.964ThrAsp: 3.964 ± 0.51
2.917ThrGlu: 2.917 ± 0.447
2.169ThrPhe: 2.169 ± 0.469
6.581ThrGly: 6.581 ± 1.012
0.897ThrHis: 0.897 ± 0.246
2.019ThrIle: 2.019 ± 0.471
2.468ThrLys: 2.468 ± 0.369
5.384ThrLeu: 5.384 ± 0.513
0.748ThrMet: 0.748 ± 0.298
2.543ThrAsn: 2.543 ± 0.529
3.141ThrPro: 3.141 ± 0.449
2.468ThrGln: 2.468 ± 0.644
3.59ThrArg: 3.59 ± 0.523
4.337ThrSer: 4.337 ± 0.647
3.59ThrThr: 3.59 ± 0.861
5.758ThrVal: 5.758 ± 0.903
1.047ThrTrp: 1.047 ± 0.304
1.795ThrTyr: 1.795 ± 0.259
0.0ThrXaa: 0.0 ± 0.0
Val
8.002ValAla: 8.002 ± 0.8
0.823ValCys: 0.823 ± 0.246
4.861ValAsp: 4.861 ± 0.531
3.216ValGlu: 3.216 ± 0.507
2.169ValPhe: 2.169 ± 0.372
3.964ValGly: 3.964 ± 0.537
1.72ValHis: 1.72 ± 0.375
3.515ValIle: 3.515 ± 0.481
3.066ValLys: 3.066 ± 0.733
5.758ValLeu: 5.758 ± 0.697
1.496ValMet: 1.496 ± 0.247
3.515ValAsn: 3.515 ± 0.52
3.365ValPro: 3.365 ± 0.523
3.664ValGln: 3.664 ± 0.636
4.861ValArg: 4.861 ± 0.572
3.44ValSer: 3.44 ± 0.474
5.085ValThr: 5.085 ± 1.015
4.188ValVal: 4.188 ± 0.74
0.972ValTrp: 0.972 ± 0.227
2.767ValTyr: 2.767 ± 0.52
0.0ValXaa: 0.0 ± 0.0
Trp
0.897TrpAla: 0.897 ± 0.298
0.224TrpCys: 0.224 ± 0.129
0.972TrpAsp: 0.972 ± 0.251
1.122TrpGlu: 1.122 ± 0.379
0.673TrpPhe: 0.673 ± 0.352
0.823TrpGly: 0.823 ± 0.219
0.374TrpHis: 0.374 ± 0.176
0.449TrpIle: 0.449 ± 0.217
0.673TrpLys: 0.673 ± 0.229
1.645TrpLeu: 1.645 ± 0.322
0.299TrpMet: 0.299 ± 0.141
0.748TrpAsn: 0.748 ± 0.239
0.673TrpPro: 0.673 ± 0.214
0.523TrpGln: 0.523 ± 0.223
1.122TrpArg: 1.122 ± 0.263
0.673TrpSer: 0.673 ± 0.211
0.972TrpThr: 0.972 ± 0.264
1.122TrpVal: 1.122 ± 0.382
0.15TrpTrp: 0.15 ± 0.124
0.823TrpTyr: 0.823 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.917TyrAla: 2.917 ± 0.433
0.523TyrCys: 0.523 ± 0.185
2.094TyrAsp: 2.094 ± 0.391
1.87TyrGlu: 1.87 ± 0.329
1.122TyrPhe: 1.122 ± 0.341
2.243TyrGly: 2.243 ± 0.41
1.271TyrHis: 1.271 ± 0.279
1.122TyrIle: 1.122 ± 0.248
0.972TyrLys: 0.972 ± 0.303
2.468TyrLeu: 2.468 ± 0.684
0.748TyrMet: 0.748 ± 0.215
1.271TyrAsn: 1.271 ± 0.309
1.271TyrPro: 1.271 ± 0.298
1.271TyrGln: 1.271 ± 0.403
2.019TyrArg: 2.019 ± 0.4
2.692TyrSer: 2.692 ± 0.396
1.72TyrThr: 1.72 ± 0.385
2.094TyrVal: 2.094 ± 0.347
0.748TyrTrp: 0.748 ± 0.21
0.449TyrTyr: 0.449 ± 0.147
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (13373 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski