Amino acid dipepetide frequency for Pseudomonas phage TC7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.602AlaAla: 17.602 ± 1.883
1.421AlaCys: 1.421 ± 0.336
6.946AlaAsp: 6.946 ± 0.855
8.13AlaGlu: 8.13 ± 0.971
3.947AlaPhe: 3.947 ± 0.515
10.893AlaGly: 10.893 ± 1.329
2.21AlaHis: 2.21 ± 0.427
5.683AlaIle: 5.683 ± 0.704
5.288AlaLys: 5.288 ± 0.887
12.314AlaLeu: 12.314 ± 1.383
3.157AlaMet: 3.157 ± 0.477
2.763AlaAsn: 2.763 ± 0.502
6.157AlaPro: 6.157 ± 0.805
5.367AlaGln: 5.367 ± 0.899
8.367AlaArg: 8.367 ± 0.846
6.946AlaSer: 6.946 ± 1.113
5.762AlaThr: 5.762 ± 1.046
7.972AlaVal: 7.972 ± 0.767
2.289AlaTrp: 2.289 ± 0.399
2.605AlaTyr: 2.605 ± 0.452
0.0AlaXaa: 0.0 ± 0.0
Cys
1.184CysAla: 1.184 ± 0.29
0.237CysCys: 0.237 ± 0.126
0.631CysAsp: 0.631 ± 0.214
1.263CysGlu: 1.263 ± 0.38
0.395CysPhe: 0.395 ± 0.213
1.579CysGly: 1.579 ± 0.397
0.316CysHis: 0.316 ± 0.168
0.71CysIle: 0.71 ± 0.28
0.553CysLys: 0.553 ± 0.204
0.553CysLeu: 0.553 ± 0.205
0.079CysMet: 0.079 ± 0.077
0.395CysAsn: 0.395 ± 0.185
1.105CysPro: 1.105 ± 0.342
0.158CysGln: 0.158 ± 0.095
1.105CysArg: 1.105 ± 0.304
0.789CysSer: 0.789 ± 0.214
0.631CysThr: 0.631 ± 0.227
0.868CysVal: 0.868 ± 0.301
0.631CysTrp: 0.631 ± 0.262
0.316CysTyr: 0.316 ± 0.215
0.0CysXaa: 0.0 ± 0.0
Asp
6.157AspAla: 6.157 ± 0.631
0.789AspCys: 0.789 ± 0.239
3.157AspAsp: 3.157 ± 0.559
2.999AspGlu: 2.999 ± 0.574
1.421AspPhe: 1.421 ± 0.305
6.551AspGly: 6.551 ± 0.583
0.947AspHis: 0.947 ± 0.315
1.973AspIle: 1.973 ± 0.302
1.894AspLys: 1.894 ± 0.382
5.525AspLeu: 5.525 ± 0.852
0.868AspMet: 0.868 ± 0.224
0.71AspAsn: 0.71 ± 0.237
3.236AspPro: 3.236 ± 0.546
2.289AspGln: 2.289 ± 0.417
2.921AspArg: 2.921 ± 0.438
1.973AspSer: 1.973 ± 0.391
1.894AspThr: 1.894 ± 0.341
3.473AspVal: 3.473 ± 0.473
1.263AspTrp: 1.263 ± 0.304
1.5AspTyr: 1.5 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
7.578GluAla: 7.578 ± 0.91
1.184GluCys: 1.184 ± 0.347
2.052GluAsp: 2.052 ± 0.395
3.71GluGlu: 3.71 ± 0.517
2.763GluPhe: 2.763 ± 0.485
4.499GluGly: 4.499 ± 0.688
0.631GluHis: 0.631 ± 0.192
3.236GluIle: 3.236 ± 0.398
2.921GluLys: 2.921 ± 0.417
4.499GluLeu: 4.499 ± 0.682
1.342GluMet: 1.342 ± 0.31
2.447GluAsn: 2.447 ± 0.45
3.947GluPro: 3.947 ± 0.637
4.341GluGln: 4.341 ± 0.582
6.078GluArg: 6.078 ± 0.814
2.684GluSer: 2.684 ± 0.558
2.605GluThr: 2.605 ± 0.4
4.026GluVal: 4.026 ± 0.584
1.105GluTrp: 1.105 ± 0.31
1.263GluTyr: 1.263 ± 0.337
0.0GluXaa: 0.0 ± 0.0
Phe
2.763PheAla: 2.763 ± 0.427
0.237PheCys: 0.237 ± 0.169
1.737PheAsp: 1.737 ± 0.365
2.21PheGlu: 2.21 ± 0.373
1.342PhePhe: 1.342 ± 0.318
3.315PheGly: 3.315 ± 0.634
0.631PheHis: 0.631 ± 0.227
1.263PheIle: 1.263 ± 0.326
1.105PheLys: 1.105 ± 0.285
1.658PheLeu: 1.658 ± 0.333
0.947PheMet: 0.947 ± 0.306
1.5PheAsn: 1.5 ± 0.404
1.658PhePro: 1.658 ± 0.329
0.395PheGln: 0.395 ± 0.174
1.658PheArg: 1.658 ± 0.372
1.658PheSer: 1.658 ± 0.346
1.894PheThr: 1.894 ± 0.359
2.447PheVal: 2.447 ± 0.455
0.631PheTrp: 0.631 ± 0.199
1.105PheTyr: 1.105 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
9.156GlyAla: 9.156 ± 0.945
1.105GlyCys: 1.105 ± 0.391
4.341GlyAsp: 4.341 ± 0.509
5.131GlyGlu: 5.131 ± 0.611
2.684GlyPhe: 2.684 ± 0.458
6.867GlyGly: 6.867 ± 0.993
1.5GlyHis: 1.5 ± 0.33
3.236GlyIle: 3.236 ± 0.588
3.473GlyLys: 3.473 ± 0.463
6.63GlyLeu: 6.63 ± 0.67
3.078GlyMet: 3.078 ± 0.424
2.447GlyAsn: 2.447 ± 0.565
2.605GlyPro: 2.605 ± 0.401
3.631GlyGln: 3.631 ± 0.641
5.446GlyArg: 5.446 ± 0.737
4.183GlySer: 4.183 ± 0.616
4.973GlyThr: 4.973 ± 0.688
5.21GlyVal: 5.21 ± 0.713
1.5GlyTrp: 1.5 ± 0.333
2.131GlyTyr: 2.131 ± 0.421
0.0GlyXaa: 0.0 ± 0.0
His
1.737HisAla: 1.737 ± 0.331
0.158HisCys: 0.158 ± 0.116
0.789HisAsp: 0.789 ± 0.234
1.342HisGlu: 1.342 ± 0.275
0.395HisPhe: 0.395 ± 0.158
1.579HisGly: 1.579 ± 0.522
0.474HisHis: 0.474 ± 0.212
0.868HisIle: 0.868 ± 0.247
0.71HisLys: 0.71 ± 0.25
1.5HisLeu: 1.5 ± 0.35
0.789HisMet: 0.789 ± 0.223
0.395HisAsn: 0.395 ± 0.14
1.579HisPro: 1.579 ± 0.348
0.789HisGln: 0.789 ± 0.219
0.868HisArg: 0.868 ± 0.295
0.553HisSer: 0.553 ± 0.207
0.789HisThr: 0.789 ± 0.306
1.184HisVal: 1.184 ± 0.318
0.316HisTrp: 0.316 ± 0.147
0.553HisTyr: 0.553 ± 0.209
0.0HisXaa: 0.0 ± 0.0
Ile
4.499IleAla: 4.499 ± 0.557
0.631IleCys: 0.631 ± 0.216
2.921IleAsp: 2.921 ± 0.424
3.789IleGlu: 3.789 ± 0.598
0.947IlePhe: 0.947 ± 0.248
2.842IleGly: 2.842 ± 0.544
1.026IleHis: 1.026 ± 0.31
0.947IleIle: 0.947 ± 0.253
1.658IleLys: 1.658 ± 0.329
2.605IleLeu: 2.605 ± 0.461
0.71IleMet: 0.71 ± 0.236
1.658IleAsn: 1.658 ± 0.449
2.131IlePro: 2.131 ± 0.392
1.737IleGln: 1.737 ± 0.388
3.631IleArg: 3.631 ± 0.543
2.763IleSer: 2.763 ± 0.503
2.921IleThr: 2.921 ± 0.474
2.684IleVal: 2.684 ± 0.393
0.316IleTrp: 0.316 ± 0.187
1.894IleTyr: 1.894 ± 0.372
0.0IleXaa: 0.0 ± 0.0
Lys
6.472LysAla: 6.472 ± 0.775
0.474LysCys: 0.474 ± 0.184
1.105LysAsp: 1.105 ± 0.26
2.131LysGlu: 2.131 ± 0.472
0.947LysPhe: 0.947 ± 0.261
1.737LysGly: 1.737 ± 0.371
0.631LysHis: 0.631 ± 0.265
1.658LysIle: 1.658 ± 0.39
2.447LysLys: 2.447 ± 0.512
3.71LysLeu: 3.71 ± 0.56
0.474LysMet: 0.474 ± 0.177
0.631LysAsn: 0.631 ± 0.223
1.894LysPro: 1.894 ± 0.376
3.157LysGln: 3.157 ± 0.505
3.315LysArg: 3.315 ± 0.594
2.526LysSer: 2.526 ± 0.535
1.263LysThr: 1.263 ± 0.306
3.394LysVal: 3.394 ± 0.505
1.026LysTrp: 1.026 ± 0.303
0.789LysTyr: 0.789 ± 0.269
0.0LysXaa: 0.0 ± 0.0
Leu
11.366LeuAla: 11.366 ± 1.246
0.631LeuCys: 0.631 ± 0.211
4.341LeuAsp: 4.341 ± 0.484
5.683LeuGlu: 5.683 ± 0.586
2.447LeuPhe: 2.447 ± 0.377
4.578LeuGly: 4.578 ± 0.683
1.421LeuHis: 1.421 ± 0.341
3.394LeuIle: 3.394 ± 0.412
4.578LeuLys: 4.578 ± 0.698
6.236LeuLeu: 6.236 ± 0.773
1.973LeuMet: 1.973 ± 0.39
2.921LeuAsn: 2.921 ± 0.433
5.052LeuPro: 5.052 ± 0.689
4.026LeuGln: 4.026 ± 0.899
6.551LeuArg: 6.551 ± 0.812
5.367LeuSer: 5.367 ± 0.595
4.578LeuThr: 4.578 ± 0.605
5.999LeuVal: 5.999 ± 0.533
0.71LeuTrp: 0.71 ± 0.252
2.052LeuTyr: 2.052 ± 0.421
0.0LeuXaa: 0.0 ± 0.0
Met
2.21MetAla: 2.21 ± 0.421
0.316MetCys: 0.316 ± 0.137
1.184MetAsp: 1.184 ± 0.298
0.868MetGlu: 0.868 ± 0.283
0.474MetPhe: 0.474 ± 0.188
1.5MetGly: 1.5 ± 0.333
1.105MetHis: 1.105 ± 0.23
1.184MetIle: 1.184 ± 0.346
0.947MetLys: 0.947 ± 0.265
2.052MetLeu: 2.052 ± 0.347
0.237MetMet: 0.237 ± 0.13
1.263MetAsn: 1.263 ± 0.262
1.342MetPro: 1.342 ± 0.26
1.184MetGln: 1.184 ± 0.298
1.894MetArg: 1.894 ± 0.389
1.421MetSer: 1.421 ± 0.281
1.658MetThr: 1.658 ± 0.318
1.737MetVal: 1.737 ± 0.46
0.237MetTrp: 0.237 ± 0.14
0.237MetTyr: 0.237 ± 0.113
0.0MetXaa: 0.0 ± 0.0
Asn
3.473AsnAla: 3.473 ± 0.461
0.237AsnCys: 0.237 ± 0.129
1.894AsnAsp: 1.894 ± 0.366
1.658AsnGlu: 1.658 ± 0.343
0.395AsnPhe: 0.395 ± 0.183
3.315AsnGly: 3.315 ± 0.621
0.395AsnHis: 0.395 ± 0.25
1.105AsnIle: 1.105 ± 0.291
0.553AsnLys: 0.553 ± 0.228
2.21AsnLeu: 2.21 ± 0.428
0.395AsnMet: 0.395 ± 0.183
0.553AsnAsn: 0.553 ± 0.273
2.131AsnPro: 2.131 ± 0.527
1.026AsnGln: 1.026 ± 0.278
2.763AsnArg: 2.763 ± 0.726
1.973AsnSer: 1.973 ± 0.392
2.368AsnThr: 2.368 ± 0.651
1.815AsnVal: 1.815 ± 0.395
0.631AsnTrp: 0.631 ± 0.278
0.553AsnTyr: 0.553 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
8.367ProAla: 8.367 ± 1.277
1.184ProCys: 1.184 ± 0.367
3.71ProAsp: 3.71 ± 0.611
4.183ProGlu: 4.183 ± 0.538
1.658ProPhe: 1.658 ± 0.375
4.262ProGly: 4.262 ± 0.544
0.395ProHis: 0.395 ± 0.173
2.447ProIle: 2.447 ± 0.514
1.421ProLys: 1.421 ± 0.274
4.105ProLeu: 4.105 ± 0.546
1.263ProMet: 1.263 ± 0.219
1.5ProAsn: 1.5 ± 0.354
3.473ProPro: 3.473 ± 0.641
1.737ProGln: 1.737 ± 0.351
3.631ProArg: 3.631 ± 0.506
4.026ProSer: 4.026 ± 0.664
3.473ProThr: 3.473 ± 0.541
3.473ProVal: 3.473 ± 0.543
0.947ProTrp: 0.947 ± 0.251
1.184ProTyr: 1.184 ± 0.359
0.0ProXaa: 0.0 ± 0.0
Gln
7.104GlnAla: 7.104 ± 1.121
0.553GlnCys: 0.553 ± 0.229
1.579GlnAsp: 1.579 ± 0.385
2.605GlnGlu: 2.605 ± 0.496
1.658GlnPhe: 1.658 ± 0.29
3.631GlnGly: 3.631 ± 0.544
0.868GlnHis: 0.868 ± 0.277
2.21GlnIle: 2.21 ± 0.427
0.868GlnLys: 0.868 ± 0.249
4.815GlnLeu: 4.815 ± 0.751
1.342GlnMet: 1.342 ± 0.323
0.631GlnAsn: 0.631 ± 0.273
2.368GlnPro: 2.368 ± 0.471
2.526GlnGln: 2.526 ± 0.514
3.315GlnArg: 3.315 ± 1.09
1.973GlnSer: 1.973 ± 0.334
2.289GlnThr: 2.289 ± 0.401
2.763GlnVal: 2.763 ± 0.389
0.474GlnTrp: 0.474 ± 0.178
1.184GlnTyr: 1.184 ± 0.242
0.0GlnXaa: 0.0 ± 0.0
Arg
8.446ArgAla: 8.446 ± 1.056
0.789ArgCys: 0.789 ± 0.31
3.157ArgAsp: 3.157 ± 0.427
4.499ArgGlu: 4.499 ± 0.653
2.131ArgPhe: 2.131 ± 0.491
5.131ArgGly: 5.131 ± 0.723
1.342ArgHis: 1.342 ± 0.288
3.236ArgIle: 3.236 ± 0.514
3.789ArgLys: 3.789 ± 0.675
7.893ArgLeu: 7.893 ± 0.797
1.5ArgMet: 1.5 ± 0.336
1.894ArgAsn: 1.894 ± 0.4
3.71ArgPro: 3.71 ± 0.593
4.815ArgGln: 4.815 ± 1.2
8.13ArgArg: 8.13 ± 1.353
3.789ArgSer: 3.789 ± 0.565
4.026ArgThr: 4.026 ± 1.035
4.42ArgVal: 4.42 ± 0.62
1.894ArgTrp: 1.894 ± 0.355
1.658ArgTyr: 1.658 ± 0.284
0.0ArgXaa: 0.0 ± 0.0
Ser
8.288SerAla: 8.288 ± 0.957
1.026SerCys: 1.026 ± 0.223
2.921SerAsp: 2.921 ± 0.487
2.763SerGlu: 2.763 ± 0.59
1.658SerPhe: 1.658 ± 0.501
4.341SerGly: 4.341 ± 0.6
0.868SerHis: 0.868 ± 0.254
2.842SerIle: 2.842 ± 0.464
1.658SerLys: 1.658 ± 0.392
3.868SerLeu: 3.868 ± 0.588
1.184SerMet: 1.184 ± 0.262
1.658SerAsn: 1.658 ± 0.597
2.447SerPro: 2.447 ± 0.358
1.737SerGln: 1.737 ± 0.279
3.473SerArg: 3.473 ± 0.496
3.552SerSer: 3.552 ± 0.509
4.026SerThr: 4.026 ± 0.427
4.105SerVal: 4.105 ± 0.577
1.026SerTrp: 1.026 ± 0.363
1.737SerTyr: 1.737 ± 0.359
0.0SerXaa: 0.0 ± 0.0
Thr
7.183ThrAla: 7.183 ± 0.684
0.71ThrCys: 0.71 ± 0.184
2.368ThrAsp: 2.368 ± 0.475
2.368ThrGlu: 2.368 ± 0.53
1.579ThrPhe: 1.579 ± 0.274
5.604ThrGly: 5.604 ± 0.761
0.868ThrHis: 0.868 ± 0.254
2.21ThrIle: 2.21 ± 0.332
1.973ThrLys: 1.973 ± 0.488
4.657ThrLeu: 4.657 ± 0.669
1.105ThrMet: 1.105 ± 0.283
2.131ThrAsn: 2.131 ± 0.425
5.052ThrPro: 5.052 ± 1.104
1.5ThrGln: 1.5 ± 0.323
3.473ThrArg: 3.473 ± 0.728
3.078ThrSer: 3.078 ± 0.626
4.262ThrThr: 4.262 ± 0.845
4.42ThrVal: 4.42 ± 0.699
1.342ThrTrp: 1.342 ± 0.356
1.342ThrTyr: 1.342 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
7.341ValAla: 7.341 ± 0.698
0.868ValCys: 0.868 ± 0.294
4.499ValAsp: 4.499 ± 0.623
5.683ValGlu: 5.683 ± 0.685
2.052ValPhe: 2.052 ± 0.386
4.657ValGly: 4.657 ± 0.617
1.026ValHis: 1.026 ± 0.316
2.447ValIle: 2.447 ± 0.367
2.999ValLys: 2.999 ± 0.428
4.894ValLeu: 4.894 ± 0.68
1.658ValMet: 1.658 ± 0.353
2.289ValAsn: 2.289 ± 0.42
4.026ValPro: 4.026 ± 0.675
2.763ValGln: 2.763 ± 0.455
5.21ValArg: 5.21 ± 0.601
3.868ValSer: 3.868 ± 0.691
5.131ValThr: 5.131 ± 0.897
5.21ValVal: 5.21 ± 0.672
0.71ValTrp: 0.71 ± 0.229
1.105ValTyr: 1.105 ± 0.29
0.0ValXaa: 0.0 ± 0.0
Trp
2.052TrpAla: 2.052 ± 0.443
0.474TrpCys: 0.474 ± 0.177
0.868TrpAsp: 0.868 ± 0.3
0.631TrpGlu: 0.631 ± 0.21
0.947TrpPhe: 0.947 ± 0.239
0.947TrpGly: 0.947 ± 0.286
0.158TrpHis: 0.158 ± 0.112
0.789TrpIle: 0.789 ± 0.207
0.316TrpLys: 0.316 ± 0.151
1.579TrpLeu: 1.579 ± 0.369
0.553TrpMet: 0.553 ± 0.233
0.71TrpAsn: 0.71 ± 0.297
1.026TrpPro: 1.026 ± 0.289
0.553TrpGln: 0.553 ± 0.194
1.342TrpArg: 1.342 ± 0.328
1.105TrpSer: 1.105 ± 0.328
1.184TrpThr: 1.184 ± 0.265
1.815TrpVal: 1.815 ± 0.436
0.71TrpTrp: 0.71 ± 0.23
0.395TrpTyr: 0.395 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.842TyrAla: 2.842 ± 0.533
0.474TyrCys: 0.474 ± 0.187
1.342TyrAsp: 1.342 ± 0.301
1.263TyrGlu: 1.263 ± 0.289
0.316TyrPhe: 0.316 ± 0.152
1.5TyrGly: 1.5 ± 0.388
0.553TyrHis: 0.553 ± 0.202
0.868TyrIle: 0.868 ± 0.237
0.631TyrLys: 0.631 ± 0.227
2.605TyrLeu: 2.605 ± 0.51
0.316TyrMet: 0.316 ± 0.184
1.026TyrAsn: 1.026 ± 0.239
1.5TyrPro: 1.5 ± 0.296
1.026TyrGln: 1.026 ± 0.285
3.078TyrArg: 3.078 ± 0.558
0.947TyrSer: 0.947 ± 0.244
1.5TyrThr: 1.5 ± 0.348
1.421TyrVal: 1.421 ± 0.352
0.395TyrTrp: 0.395 ± 0.172
0.868TyrTyr: 0.868 ± 0.291
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (12670 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski