Amino acid dipepetide frequency for Pseudomonas phage TC6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.05AlaAla: 8.05 ± 1.24
1.086AlaCys: 1.086 ± 0.266
4.344AlaAsp: 4.344 ± 0.452
6.325AlaGlu: 6.325 ± 0.646
3.003AlaPhe: 3.003 ± 0.385
5.75AlaGly: 5.75 ± 0.611
1.533AlaHis: 1.533 ± 0.344
3.258AlaIle: 3.258 ± 0.374
5.366AlaLys: 5.366 ± 0.632
6.325AlaLeu: 6.325 ± 0.489
2.939AlaMet: 2.939 ± 0.453
3.961AlaAsn: 3.961 ± 0.483
2.364AlaPro: 2.364 ± 0.379
4.153AlaGln: 4.153 ± 0.613
5.239AlaArg: 5.239 ± 0.742
4.791AlaSer: 4.791 ± 0.697
4.855AlaThr: 4.855 ± 0.684
4.791AlaVal: 4.791 ± 0.536
1.533AlaTrp: 1.533 ± 0.321
3.578AlaTyr: 3.578 ± 0.374
0.0AlaXaa: 0.0 ± 0.0
Cys
0.958CysAla: 0.958 ± 0.232
0.319CysCys: 0.319 ± 0.149
0.894CysAsp: 0.894 ± 0.288
0.894CysGlu: 0.894 ± 0.22
0.447CysPhe: 0.447 ± 0.165
1.214CysGly: 1.214 ± 0.33
0.319CysHis: 0.319 ± 0.171
0.575CysIle: 0.575 ± 0.211
1.022CysLys: 1.022 ± 0.29
0.894CysLeu: 0.894 ± 0.25
0.319CysMet: 0.319 ± 0.143
0.511CysAsn: 0.511 ± 0.156
0.319CysPro: 0.319 ± 0.142
0.256CysGln: 0.256 ± 0.119
0.767CysArg: 0.767 ± 0.242
0.958CysSer: 0.958 ± 0.229
0.447CysThr: 0.447 ± 0.152
0.894CysVal: 0.894 ± 0.297
0.128CysTrp: 0.128 ± 0.087
0.383CysTyr: 0.383 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
4.536AspAla: 4.536 ± 0.588
0.639AspCys: 0.639 ± 0.196
4.089AspAsp: 4.089 ± 0.39
4.025AspGlu: 4.025 ± 0.58
2.939AspPhe: 2.939 ± 0.444
5.494AspGly: 5.494 ± 0.674
1.342AspHis: 1.342 ± 0.274
4.153AspIle: 4.153 ± 0.496
3.258AspLys: 3.258 ± 0.4
4.728AspLeu: 4.728 ± 0.443
1.278AspMet: 1.278 ± 0.229
3.322AspAsn: 3.322 ± 0.484
2.555AspPro: 2.555 ± 0.419
1.533AspGln: 1.533 ± 0.287
3.13AspArg: 3.13 ± 0.538
3.897AspSer: 3.897 ± 0.531
3.322AspThr: 3.322 ± 0.597
3.833AspVal: 3.833 ± 0.464
1.342AspTrp: 1.342 ± 0.22
2.875AspTyr: 2.875 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
6.325GluAla: 6.325 ± 0.761
1.469GluCys: 1.469 ± 0.334
4.216GluAsp: 4.216 ± 0.425
6.133GluGlu: 6.133 ± 0.722
2.811GluPhe: 2.811 ± 0.396
5.239GluGly: 5.239 ± 0.519
1.533GluHis: 1.533 ± 0.418
3.641GluIle: 3.641 ± 0.501
5.111GluLys: 5.111 ± 0.643
5.75GluLeu: 5.75 ± 0.657
1.853GluMet: 1.853 ± 0.411
1.917GluAsn: 1.917 ± 0.253
2.172GluPro: 2.172 ± 0.464
3.45GluGln: 3.45 ± 0.502
3.705GluArg: 3.705 ± 0.6
1.98GluSer: 1.98 ± 0.38
3.13GluThr: 3.13 ± 0.594
5.366GluVal: 5.366 ± 0.779
1.342GluTrp: 1.342 ± 0.368
2.939GluTyr: 2.939 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
3.322PheAla: 3.322 ± 0.471
0.703PheCys: 0.703 ± 0.282
3.13PheAsp: 3.13 ± 0.348
2.683PheGlu: 2.683 ± 0.465
1.789PhePhe: 1.789 ± 0.352
3.322PheGly: 3.322 ± 0.354
0.703PheHis: 0.703 ± 0.193
2.364PheIle: 2.364 ± 0.455
2.875PheLys: 2.875 ± 0.421
2.044PheLeu: 2.044 ± 0.338
0.958PheMet: 0.958 ± 0.218
2.619PheAsn: 2.619 ± 0.4
1.342PhePro: 1.342 ± 0.225
1.533PheGln: 1.533 ± 0.327
1.98PheArg: 1.98 ± 0.351
2.364PheSer: 2.364 ± 0.395
1.853PheThr: 1.853 ± 0.377
2.555PheVal: 2.555 ± 0.426
0.575PheTrp: 0.575 ± 0.176
1.278PheTyr: 1.278 ± 0.284
0.0PheXaa: 0.0 ± 0.0
Gly
5.43GlyAla: 5.43 ± 0.948
0.447GlyCys: 0.447 ± 0.168
4.919GlyAsp: 4.919 ± 0.58
4.855GlyGlu: 4.855 ± 0.51
3.705GlyPhe: 3.705 ± 0.56
5.239GlyGly: 5.239 ± 0.588
1.405GlyHis: 1.405 ± 0.295
3.258GlyIle: 3.258 ± 0.447
5.43GlyLys: 5.43 ± 0.549
6.005GlyLeu: 6.005 ± 0.609
1.725GlyMet: 1.725 ± 0.275
3.705GlyAsn: 3.705 ± 0.529
1.789GlyPro: 1.789 ± 0.323
2.172GlyGln: 2.172 ± 0.411
3.067GlyArg: 3.067 ± 0.47
4.216GlySer: 4.216 ± 0.524
3.961GlyThr: 3.961 ± 0.559
5.111GlyVal: 5.111 ± 0.506
1.917GlyTrp: 1.917 ± 0.363
3.322GlyTyr: 3.322 ± 0.444
0.0GlyXaa: 0.0 ± 0.0
His
1.022HisAla: 1.022 ± 0.274
0.447HisCys: 0.447 ± 0.145
1.214HisAsp: 1.214 ± 0.28
1.597HisGlu: 1.597 ± 0.332
0.894HisPhe: 0.894 ± 0.238
1.214HisGly: 1.214 ± 0.246
0.192HisHis: 0.192 ± 0.13
1.469HisIle: 1.469 ± 0.266
1.278HisLys: 1.278 ± 0.274
2.044HisLeu: 2.044 ± 0.375
0.575HisMet: 0.575 ± 0.139
0.894HisAsn: 0.894 ± 0.292
0.767HisPro: 0.767 ± 0.308
0.575HisGln: 0.575 ± 0.2
0.639HisArg: 0.639 ± 0.235
1.086HisSer: 1.086 ± 0.255
0.575HisThr: 0.575 ± 0.215
1.661HisVal: 1.661 ± 0.304
0.256HisTrp: 0.256 ± 0.126
1.022HisTyr: 1.022 ± 0.33
0.0HisXaa: 0.0 ± 0.0
Ile
3.641IleAla: 3.641 ± 0.384
0.447IleCys: 0.447 ± 0.233
4.408IleAsp: 4.408 ± 0.451
3.322IleGlu: 3.322 ± 0.416
1.661IlePhe: 1.661 ± 0.35
3.641IleGly: 3.641 ± 0.48
1.214IleHis: 1.214 ± 0.285
3.003IleIle: 3.003 ± 0.445
4.28IleLys: 4.28 ± 0.575
3.322IleLeu: 3.322 ± 0.547
1.086IleMet: 1.086 ± 0.283
2.811IleAsn: 2.811 ± 0.501
1.917IlePro: 1.917 ± 0.265
2.236IleGln: 2.236 ± 0.36
3.322IleArg: 3.322 ± 0.433
2.811IleSer: 2.811 ± 0.35
3.067IleThr: 3.067 ± 0.468
3.578IleVal: 3.578 ± 0.508
0.575IleTrp: 0.575 ± 0.181
2.108IleTyr: 2.108 ± 0.456
0.0IleXaa: 0.0 ± 0.0
Lys
6.452LysAla: 6.452 ± 0.766
0.767LysCys: 0.767 ± 0.288
4.408LysAsp: 4.408 ± 0.802
5.622LysGlu: 5.622 ± 0.65
2.747LysPhe: 2.747 ± 0.377
4.983LysGly: 4.983 ± 0.772
1.214LysHis: 1.214 ± 0.288
3.578LysIle: 3.578 ± 0.497
4.6LysLys: 4.6 ± 0.742
5.047LysLeu: 5.047 ± 0.663
1.853LysMet: 1.853 ± 0.37
3.769LysAsn: 3.769 ± 0.513
2.619LysPro: 2.619 ± 0.5
2.939LysGln: 2.939 ± 0.421
3.322LysArg: 3.322 ± 0.358
3.322LysSer: 3.322 ± 0.433
3.45LysThr: 3.45 ± 0.406
4.6LysVal: 4.6 ± 0.632
1.214LysTrp: 1.214 ± 0.219
1.725LysTyr: 1.725 ± 0.297
0.0LysXaa: 0.0 ± 0.0
Leu
6.772LeuAla: 6.772 ± 0.658
1.086LeuCys: 1.086 ± 0.272
4.728LeuAsp: 4.728 ± 0.48
6.197LeuGlu: 6.197 ± 0.727
2.428LeuPhe: 2.428 ± 0.339
5.111LeuGly: 5.111 ± 0.634
1.725LeuHis: 1.725 ± 0.352
2.811LeuIle: 2.811 ± 0.487
5.494LeuLys: 5.494 ± 0.477
5.047LeuLeu: 5.047 ± 0.658
1.405LeuMet: 1.405 ± 0.353
5.111LeuAsn: 5.111 ± 0.625
2.939LeuPro: 2.939 ± 0.378
3.833LeuGln: 3.833 ± 0.615
4.855LeuArg: 4.855 ± 0.664
4.28LeuSer: 4.28 ± 0.541
3.13LeuThr: 3.13 ± 0.456
6.197LeuVal: 6.197 ± 0.844
0.767LeuTrp: 0.767 ± 0.268
3.194LeuTyr: 3.194 ± 0.374
0.0LeuXaa: 0.0 ± 0.0
Met
2.875MetAla: 2.875 ± 0.365
0.511MetCys: 0.511 ± 0.225
1.086MetAsp: 1.086 ± 0.304
1.086MetGlu: 1.086 ± 0.29
1.022MetPhe: 1.022 ± 0.21
1.342MetGly: 1.342 ± 0.237
0.256MetHis: 0.256 ± 0.136
1.15MetIle: 1.15 ± 0.245
1.98MetLys: 1.98 ± 0.389
2.044MetLeu: 2.044 ± 0.329
0.383MetMet: 0.383 ± 0.145
1.022MetAsn: 1.022 ± 0.266
1.661MetPro: 1.661 ± 0.248
0.767MetGln: 0.767 ± 0.239
1.789MetArg: 1.789 ± 0.337
2.172MetSer: 2.172 ± 0.325
1.469MetThr: 1.469 ± 0.335
1.917MetVal: 1.917 ± 0.313
0.319MetTrp: 0.319 ± 0.118
1.022MetTyr: 1.022 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
4.344AsnAla: 4.344 ± 0.544
0.447AsnCys: 0.447 ± 0.161
2.683AsnAsp: 2.683 ± 0.464
2.236AsnGlu: 2.236 ± 0.381
2.044AsnPhe: 2.044 ± 0.418
4.6AsnGly: 4.6 ± 0.534
0.767AsnHis: 0.767 ± 0.231
4.344AsnIle: 4.344 ± 0.596
2.811AsnLys: 2.811 ± 0.322
5.558AsnLeu: 5.558 ± 0.518
1.469AsnMet: 1.469 ± 0.24
2.236AsnAsn: 2.236 ± 0.33
2.428AsnPro: 2.428 ± 0.362
1.98AsnGln: 1.98 ± 0.383
2.044AsnArg: 2.044 ± 0.279
2.939AsnSer: 2.939 ± 0.405
3.003AsnThr: 3.003 ± 0.5
3.194AsnVal: 3.194 ± 0.46
0.703AsnTrp: 0.703 ± 0.222
1.98AsnTyr: 1.98 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
2.683ProAla: 2.683 ± 0.407
0.319ProCys: 0.319 ± 0.143
2.939ProAsp: 2.939 ± 0.438
3.386ProGlu: 3.386 ± 0.53
1.15ProPhe: 1.15 ± 0.234
2.364ProGly: 2.364 ± 0.444
1.15ProHis: 1.15 ± 0.219
1.661ProIle: 1.661 ± 0.342
2.875ProLys: 2.875 ± 0.557
2.3ProLeu: 2.3 ± 0.358
1.086ProMet: 1.086 ± 0.266
1.661ProAsn: 1.661 ± 0.275
1.342ProPro: 1.342 ± 0.299
1.405ProGln: 1.405 ± 0.295
1.469ProArg: 1.469 ± 0.278
2.619ProSer: 2.619 ± 0.479
2.236ProThr: 2.236 ± 0.347
3.705ProVal: 3.705 ± 0.502
0.511ProTrp: 0.511 ± 0.162
1.15ProTyr: 1.15 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
4.791GlnAla: 4.791 ± 0.829
0.256GlnCys: 0.256 ± 0.111
1.98GlnAsp: 1.98 ± 0.304
3.258GlnGlu: 3.258 ± 0.443
1.725GlnPhe: 1.725 ± 0.353
1.789GlnGly: 1.789 ± 0.351
1.022GlnHis: 1.022 ± 0.249
1.853GlnIle: 1.853 ± 0.293
2.747GlnLys: 2.747 ± 0.426
2.811GlnLeu: 2.811 ± 0.442
1.022GlnMet: 1.022 ± 0.329
1.533GlnAsn: 1.533 ± 0.346
1.533GlnPro: 1.533 ± 0.338
2.3GlnGln: 2.3 ± 0.561
2.619GlnArg: 2.619 ± 0.334
1.725GlnSer: 1.725 ± 0.342
2.044GlnThr: 2.044 ± 0.497
2.811GlnVal: 2.811 ± 0.556
0.767GlnTrp: 0.767 ± 0.187
1.917GlnTyr: 1.917 ± 0.363
0.0GlnXaa: 0.0 ± 0.0
Arg
4.153ArgAla: 4.153 ± 0.621
0.767ArgCys: 0.767 ± 0.263
2.619ArgAsp: 2.619 ± 0.313
3.514ArgGlu: 3.514 ± 0.497
2.044ArgPhe: 2.044 ± 0.406
3.641ArgGly: 3.641 ± 0.619
1.278ArgHis: 1.278 ± 0.377
2.492ArgIle: 2.492 ± 0.436
3.578ArgLys: 3.578 ± 0.578
3.897ArgLeu: 3.897 ± 0.479
2.044ArgMet: 2.044 ± 0.402
3.258ArgAsn: 3.258 ± 0.393
2.364ArgPro: 2.364 ± 0.369
2.3ArgGln: 2.3 ± 0.327
2.236ArgArg: 2.236 ± 0.304
2.875ArgSer: 2.875 ± 0.337
2.044ArgThr: 2.044 ± 0.35
3.45ArgVal: 3.45 ± 0.481
0.831ArgTrp: 0.831 ± 0.174
1.853ArgTyr: 1.853 ± 0.316
0.0ArgXaa: 0.0 ± 0.0
Ser
3.641SerAla: 3.641 ± 0.51
0.831SerCys: 0.831 ± 0.249
2.428SerAsp: 2.428 ± 0.414
3.13SerGlu: 3.13 ± 0.492
2.428SerPhe: 2.428 ± 0.354
4.472SerGly: 4.472 ± 0.598
0.894SerHis: 0.894 ± 0.222
3.003SerIle: 3.003 ± 0.358
3.514SerLys: 3.514 ± 0.493
4.728SerLeu: 4.728 ± 0.55
1.405SerMet: 1.405 ± 0.296
2.875SerAsn: 2.875 ± 0.375
2.875SerPro: 2.875 ± 0.489
2.236SerGln: 2.236 ± 0.46
3.003SerArg: 3.003 ± 0.458
2.875SerSer: 2.875 ± 0.405
3.578SerThr: 3.578 ± 0.537
4.855SerVal: 4.855 ± 0.571
1.022SerTrp: 1.022 ± 0.253
1.917SerTyr: 1.917 ± 0.353
0.0SerXaa: 0.0 ± 0.0
Thr
3.769ThrAla: 3.769 ± 0.442
0.511ThrCys: 0.511 ± 0.183
2.875ThrAsp: 2.875 ± 0.396
3.194ThrGlu: 3.194 ± 0.485
2.619ThrPhe: 2.619 ± 0.483
3.705ThrGly: 3.705 ± 0.556
0.511ThrHis: 0.511 ± 0.178
3.578ThrIle: 3.578 ± 0.495
3.003ThrLys: 3.003 ± 0.408
4.728ThrLeu: 4.728 ± 0.686
0.958ThrMet: 0.958 ± 0.25
2.683ThrAsn: 2.683 ± 0.364
3.003ThrPro: 3.003 ± 0.465
1.853ThrGln: 1.853 ± 0.427
2.172ThrArg: 2.172 ± 0.315
3.514ThrSer: 3.514 ± 0.657
3.641ThrThr: 3.641 ± 0.57
3.45ThrVal: 3.45 ± 0.58
1.022ThrTrp: 1.022 ± 0.244
2.108ThrTyr: 2.108 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
6.836ValAla: 6.836 ± 0.686
0.767ValCys: 0.767 ± 0.218
5.877ValAsp: 5.877 ± 0.505
4.536ValGlu: 4.536 ± 0.517
3.003ValPhe: 3.003 ± 0.459
4.216ValGly: 4.216 ± 0.579
1.214ValHis: 1.214 ± 0.334
2.939ValIle: 2.939 ± 0.448
4.6ValLys: 4.6 ± 0.478
4.6ValLeu: 4.6 ± 0.596
1.853ValMet: 1.853 ± 0.27
4.472ValAsn: 4.472 ± 0.721
2.428ValPro: 2.428 ± 0.422
2.939ValGln: 2.939 ± 0.474
3.003ValArg: 3.003 ± 0.479
4.472ValSer: 4.472 ± 0.499
4.664ValThr: 4.664 ± 0.732
5.111ValVal: 5.111 ± 0.603
1.342ValTrp: 1.342 ± 0.3
2.044ValTyr: 2.044 ± 0.31
0.0ValXaa: 0.0 ± 0.0
Trp
1.533TrpAla: 1.533 ± 0.304
0.319TrpCys: 0.319 ± 0.131
0.894TrpAsp: 0.894 ± 0.23
1.342TrpGlu: 1.342 ± 0.271
0.575TrpPhe: 0.575 ± 0.156
0.958TrpGly: 0.958 ± 0.207
0.383TrpHis: 0.383 ± 0.155
0.894TrpIle: 0.894 ± 0.237
1.342TrpLys: 1.342 ± 0.337
1.98TrpLeu: 1.98 ± 0.302
0.639TrpMet: 0.639 ± 0.192
0.831TrpAsn: 0.831 ± 0.274
0.447TrpPro: 0.447 ± 0.139
0.511TrpGln: 0.511 ± 0.129
1.278TrpArg: 1.278 ± 0.273
0.894TrpSer: 0.894 ± 0.252
0.958TrpThr: 0.958 ± 0.26
1.086TrpVal: 1.086 ± 0.281
0.192TrpTrp: 0.192 ± 0.106
0.447TrpTyr: 0.447 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.172TyrAla: 2.172 ± 0.387
0.383TyrCys: 0.383 ± 0.126
2.555TyrAsp: 2.555 ± 0.37
2.747TyrGlu: 2.747 ± 0.508
0.958TyrPhe: 0.958 ± 0.267
3.386TyrGly: 3.386 ± 0.375
0.767TyrHis: 0.767 ± 0.242
2.428TyrIle: 2.428 ± 0.388
3.067TyrLys: 3.067 ± 0.51
3.258TyrLeu: 3.258 ± 0.446
0.958TyrMet: 0.958 ± 0.208
2.619TyrAsn: 2.619 ± 0.392
1.15TyrPro: 1.15 ± 0.309
1.533TyrGln: 1.533 ± 0.306
1.597TyrArg: 1.597 ± 0.285
1.917TyrSer: 1.917 ± 0.314
1.469TyrThr: 1.469 ± 0.28
2.747TyrVal: 2.747 ± 0.412
1.15TyrTrp: 1.15 ± 0.277
1.661TyrTyr: 1.661 ± 0.271
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (15654 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski