Amino acid dipepetide frequency for Pseudomonas phage TL

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.049AlaAla: 8.049 ± 1.502
0.827AlaCys: 0.827 ± 0.239
4.363AlaAsp: 4.363 ± 0.541
6.319AlaGlu: 6.319 ± 0.982
2.934AlaPhe: 2.934 ± 0.47
6.77AlaGly: 6.77 ± 1.301
0.978AlaHis: 0.978 ± 0.242
4.664AlaIle: 4.664 ± 0.698
4.889AlaLys: 4.889 ± 0.624
7.597AlaLeu: 7.597 ± 0.761
3.159AlaMet: 3.159 ± 0.507
3.46AlaAsn: 3.46 ± 0.645
3.385AlaPro: 3.385 ± 0.47
4.889AlaGln: 4.889 ± 1.079
3.009AlaArg: 3.009 ± 0.605
5.341AlaSer: 5.341 ± 0.634
3.385AlaThr: 3.385 ± 0.591
6.093AlaVal: 6.093 ± 0.824
0.903AlaTrp: 0.903 ± 0.255
2.558AlaTyr: 2.558 ± 0.446
0.0AlaXaa: 0.0 ± 0.0
Cys
0.677CysAla: 0.677 ± 0.265
0.226CysCys: 0.226 ± 0.158
0.752CysAsp: 0.752 ± 0.307
1.053CysGlu: 1.053 ± 0.263
0.15CysPhe: 0.15 ± 0.11
0.827CysGly: 0.827 ± 0.258
0.752CysHis: 0.752 ± 0.234
0.602CysIle: 0.602 ± 0.225
0.376CysLys: 0.376 ± 0.154
0.301CysLeu: 0.301 ± 0.134
0.301CysMet: 0.301 ± 0.147
0.527CysAsn: 0.527 ± 0.199
0.451CysPro: 0.451 ± 0.229
0.301CysGln: 0.301 ± 0.125
0.451CysArg: 0.451 ± 0.149
0.903CysSer: 0.903 ± 0.264
0.677CysThr: 0.677 ± 0.209
0.451CysVal: 0.451 ± 0.262
0.451CysTrp: 0.451 ± 0.221
0.376CysTyr: 0.376 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
4.889AspAla: 4.889 ± 0.563
0.376AspCys: 0.376 ± 0.163
3.159AspAsp: 3.159 ± 0.496
4.288AspGlu: 4.288 ± 0.756
2.031AspPhe: 2.031 ± 0.433
5.04AspGly: 5.04 ± 0.745
1.655AspHis: 1.655 ± 0.447
3.686AspIle: 3.686 ± 0.441
2.633AspLys: 2.633 ± 0.459
6.243AspLeu: 6.243 ± 0.707
1.655AspMet: 1.655 ± 0.305
2.181AspAsn: 2.181 ± 0.325
3.535AspPro: 3.535 ± 0.61
1.655AspGln: 1.655 ± 0.399
3.535AspArg: 3.535 ± 0.496
2.708AspSer: 2.708 ± 0.521
3.611AspThr: 3.611 ± 0.587
2.934AspVal: 2.934 ± 0.456
1.73AspTrp: 1.73 ± 0.331
1.73AspTyr: 1.73 ± 0.317
0.0AspXaa: 0.0 ± 0.0
Glu
6.845GluAla: 6.845 ± 1.019
0.752GluCys: 0.752 ± 0.246
4.363GluAsp: 4.363 ± 0.701
7.372GluGlu: 7.372 ± 1.372
2.783GluPhe: 2.783 ± 0.416
5.416GluGly: 5.416 ± 0.742
1.204GluHis: 1.204 ± 0.289
3.836GluIle: 3.836 ± 0.452
3.761GluLys: 3.761 ± 0.595
6.92GluLeu: 6.92 ± 0.806
2.558GluMet: 2.558 ± 0.448
2.332GluAsn: 2.332 ± 0.411
1.956GluPro: 1.956 ± 0.3
2.332GluGln: 2.332 ± 0.472
4.664GluArg: 4.664 ± 0.692
2.858GluSer: 2.858 ± 0.518
3.235GluThr: 3.235 ± 0.446
6.469GluVal: 6.469 ± 0.65
0.903GluTrp: 0.903 ± 0.224
2.106GluTyr: 2.106 ± 0.5
0.0GluXaa: 0.0 ± 0.0
Phe
1.956PheAla: 1.956 ± 0.4
0.451PheCys: 0.451 ± 0.183
2.407PheAsp: 2.407 ± 0.571
2.257PheGlu: 2.257 ± 0.462
1.204PhePhe: 1.204 ± 0.304
4.288PheGly: 4.288 ± 0.477
0.752PheHis: 0.752 ± 0.213
2.407PheIle: 2.407 ± 0.442
2.031PheLys: 2.031 ± 0.425
3.535PheLeu: 3.535 ± 0.398
0.903PheMet: 0.903 ± 0.215
1.805PheAsn: 1.805 ± 0.348
1.805PhePro: 1.805 ± 0.399
1.655PheGln: 1.655 ± 0.306
1.805PheArg: 1.805 ± 0.412
2.633PheSer: 2.633 ± 0.407
2.106PheThr: 2.106 ± 0.303
2.332PheVal: 2.332 ± 0.356
0.978PheTrp: 0.978 ± 0.296
1.053PheTyr: 1.053 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
7.597GlyAla: 7.597 ± 1.282
0.827GlyCys: 0.827 ± 0.232
5.115GlyAsp: 5.115 ± 0.548
4.739GlyGlu: 4.739 ± 0.571
3.159GlyPhe: 3.159 ± 0.458
7.071GlyGly: 7.071 ± 1.161
1.655GlyHis: 1.655 ± 0.398
4.363GlyIle: 4.363 ± 0.513
5.341GlyLys: 5.341 ± 0.722
6.168GlyLeu: 6.168 ± 0.851
2.558GlyMet: 2.558 ± 0.429
3.46GlyAsn: 3.46 ± 0.596
2.934GlyPro: 2.934 ± 0.476
3.159GlyGln: 3.159 ± 0.554
4.739GlyArg: 4.739 ± 0.638
5.717GlySer: 5.717 ± 0.833
4.965GlyThr: 4.965 ± 0.717
6.394GlyVal: 6.394 ± 0.686
1.429GlyTrp: 1.429 ± 0.343
2.407GlyTyr: 2.407 ± 0.415
0.0GlyXaa: 0.0 ± 0.0
His
1.504HisAla: 1.504 ± 0.392
0.527HisCys: 0.527 ± 0.218
0.827HisAsp: 0.827 ± 0.325
1.504HisGlu: 1.504 ± 0.502
1.204HisPhe: 1.204 ± 0.337
1.279HisGly: 1.279 ± 0.326
0.602HisHis: 0.602 ± 0.2
1.204HisIle: 1.204 ± 0.314
1.279HisLys: 1.279 ± 0.292
3.084HisLeu: 3.084 ± 0.49
0.226HisMet: 0.226 ± 0.113
0.602HisAsn: 0.602 ± 0.195
0.677HisPro: 0.677 ± 0.198
0.677HisGln: 0.677 ± 0.216
1.354HisArg: 1.354 ± 0.34
0.903HisSer: 0.903 ± 0.298
0.677HisThr: 0.677 ± 0.227
0.978HisVal: 0.978 ± 0.29
0.527HisTrp: 0.527 ± 0.226
0.752HisTyr: 0.752 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
3.761IleAla: 3.761 ± 0.541
0.602IleCys: 0.602 ± 0.193
2.633IleAsp: 2.633 ± 0.51
3.009IleGlu: 3.009 ± 0.447
2.106IlePhe: 2.106 ± 0.335
4.965IleGly: 4.965 ± 0.559
0.752IleHis: 0.752 ± 0.252
2.482IleIle: 2.482 ± 0.455
3.761IleLys: 3.761 ± 0.536
3.987IleLeu: 3.987 ± 0.53
1.128IleMet: 1.128 ± 0.331
1.956IleAsn: 1.956 ± 0.407
3.235IlePro: 3.235 ± 0.605
2.558IleGln: 2.558 ± 0.501
3.084IleArg: 3.084 ± 0.456
2.482IleSer: 2.482 ± 0.399
2.934IleThr: 2.934 ± 0.491
2.558IleVal: 2.558 ± 0.533
0.677IleTrp: 0.677 ± 0.185
1.73IleTyr: 1.73 ± 0.29
0.0IleXaa: 0.0 ± 0.0
Lys
4.513LysAla: 4.513 ± 0.729
0.527LysCys: 0.527 ± 0.209
4.363LysAsp: 4.363 ± 0.532
4.062LysGlu: 4.062 ± 0.514
2.181LysPhe: 2.181 ± 0.406
4.137LysGly: 4.137 ± 0.616
1.354LysHis: 1.354 ± 0.334
2.783LysIle: 2.783 ± 0.347
3.535LysLys: 3.535 ± 0.528
4.513LysLeu: 4.513 ± 0.558
1.053LysMet: 1.053 ± 0.273
2.181LysAsn: 2.181 ± 0.39
2.708LysPro: 2.708 ± 0.582
1.504LysGln: 1.504 ± 0.326
3.46LysArg: 3.46 ± 0.676
3.31LysSer: 3.31 ± 0.506
2.934LysThr: 2.934 ± 0.476
4.513LysVal: 4.513 ± 0.539
1.58LysTrp: 1.58 ± 0.284
2.783LysTyr: 2.783 ± 0.473
0.0LysXaa: 0.0 ± 0.0
Leu
7.071LeuAla: 7.071 ± 0.714
0.752LeuCys: 0.752 ± 0.212
5.642LeuAsp: 5.642 ± 0.554
6.168LeuGlu: 6.168 ± 0.825
2.783LeuPhe: 2.783 ± 0.493
7.522LeuGly: 7.522 ± 1.017
1.053LeuHis: 1.053 ± 0.332
2.934LeuIle: 2.934 ± 0.354
5.642LeuLys: 5.642 ± 0.677
7.297LeuLeu: 7.297 ± 0.702
2.257LeuMet: 2.257 ± 0.552
3.385LeuAsn: 3.385 ± 0.465
3.385LeuPro: 3.385 ± 0.626
3.761LeuGln: 3.761 ± 0.691
6.093LeuArg: 6.093 ± 0.602
4.814LeuSer: 4.814 ± 0.654
3.761LeuThr: 3.761 ± 0.51
5.115LeuVal: 5.115 ± 0.555
0.903LeuTrp: 0.903 ± 0.357
2.106LeuTyr: 2.106 ± 0.463
0.0LeuXaa: 0.0 ± 0.0
Met
3.46MetAla: 3.46 ± 0.5
0.15MetCys: 0.15 ± 0.115
1.881MetAsp: 1.881 ± 0.404
1.279MetGlu: 1.279 ± 0.24
1.204MetPhe: 1.204 ± 0.274
1.956MetGly: 1.956 ± 0.472
0.451MetHis: 0.451 ± 0.178
1.58MetIle: 1.58 ± 0.365
2.181MetLys: 2.181 ± 0.387
1.805MetLeu: 1.805 ± 0.293
0.602MetMet: 0.602 ± 0.184
1.204MetAsn: 1.204 ± 0.313
1.354MetPro: 1.354 ± 0.285
1.279MetGln: 1.279 ± 0.293
1.429MetArg: 1.429 ± 0.306
2.106MetSer: 2.106 ± 0.43
1.805MetThr: 1.805 ± 0.373
1.053MetVal: 1.053 ± 0.275
0.451MetTrp: 0.451 ± 0.171
0.677MetTyr: 0.677 ± 0.171
0.0MetXaa: 0.0 ± 0.0
Asn
3.535AsnAla: 3.535 ± 0.647
0.827AsnCys: 0.827 ± 0.271
2.031AsnAsp: 2.031 ± 0.353
3.084AsnGlu: 3.084 ± 0.466
1.73AsnPhe: 1.73 ± 0.334
3.235AsnGly: 3.235 ± 0.562
1.128AsnHis: 1.128 ± 0.397
3.009AsnIle: 3.009 ± 0.518
1.58AsnLys: 1.58 ± 0.304
2.783AsnLeu: 2.783 ± 0.474
1.053AsnMet: 1.053 ± 0.303
1.429AsnAsn: 1.429 ± 0.316
2.708AsnPro: 2.708 ± 0.467
2.181AsnGln: 2.181 ± 0.478
2.633AsnArg: 2.633 ± 0.36
1.805AsnSer: 1.805 ± 0.377
2.858AsnThr: 2.858 ± 0.43
2.407AsnVal: 2.407 ± 0.465
0.827AsnTrp: 0.827 ± 0.179
1.354AsnTyr: 1.354 ± 0.342
0.0AsnXaa: 0.0 ± 0.0
Pro
3.46ProAla: 3.46 ± 0.535
0.451ProCys: 0.451 ± 0.171
3.159ProAsp: 3.159 ± 0.45
4.739ProGlu: 4.739 ± 0.599
2.332ProPhe: 2.332 ± 0.386
3.46ProGly: 3.46 ± 0.584
0.677ProHis: 0.677 ± 0.2
1.504ProIle: 1.504 ± 0.272
3.084ProLys: 3.084 ± 0.529
2.407ProLeu: 2.407 ± 0.335
0.903ProMet: 0.903 ± 0.242
1.956ProAsn: 1.956 ± 0.434
1.58ProPro: 1.58 ± 0.56
2.181ProGln: 2.181 ± 0.463
1.504ProArg: 1.504 ± 0.336
2.257ProSer: 2.257 ± 0.523
2.858ProThr: 2.858 ± 0.461
2.407ProVal: 2.407 ± 0.385
0.827ProTrp: 0.827 ± 0.195
1.805ProTyr: 1.805 ± 0.385
0.0ProXaa: 0.0 ± 0.0
Gln
4.664GlnAla: 4.664 ± 0.882
0.301GlnCys: 0.301 ± 0.155
2.257GlnAsp: 2.257 ± 0.411
3.46GlnGlu: 3.46 ± 0.488
1.204GlnPhe: 1.204 ± 0.275
3.009GlnGly: 3.009 ± 0.626
0.376GlnHis: 0.376 ± 0.139
1.504GlnIle: 1.504 ± 0.319
1.73GlnLys: 1.73 ± 0.344
3.159GlnLeu: 3.159 ± 0.572
2.031GlnMet: 2.031 ± 0.365
2.181GlnAsn: 2.181 ± 0.527
1.204GlnPro: 1.204 ± 0.303
2.181GlnGln: 2.181 ± 0.524
2.934GlnArg: 2.934 ± 0.488
2.482GlnSer: 2.482 ± 0.417
1.73GlnThr: 1.73 ± 0.397
3.535GlnVal: 3.535 ± 0.433
0.527GlnTrp: 0.527 ± 0.216
0.827GlnTyr: 0.827 ± 0.275
0.0GlnXaa: 0.0 ± 0.0
Arg
4.438ArgAla: 4.438 ± 0.783
0.527ArgCys: 0.527 ± 0.161
3.084ArgAsp: 3.084 ± 0.461
3.385ArgGlu: 3.385 ± 0.553
2.181ArgPhe: 2.181 ± 0.45
4.739ArgGly: 4.739 ± 0.621
1.354ArgHis: 1.354 ± 0.328
3.385ArgIle: 3.385 ± 0.542
3.46ArgLys: 3.46 ± 0.668
5.19ArgLeu: 5.19 ± 0.555
2.181ArgMet: 2.181 ± 0.363
3.009ArgAsn: 3.009 ± 0.494
2.031ArgPro: 2.031 ± 0.441
2.934ArgGln: 2.934 ± 0.451
3.987ArgArg: 3.987 ± 0.556
3.159ArgSer: 3.159 ± 0.488
1.956ArgThr: 1.956 ± 0.348
5.266ArgVal: 5.266 ± 0.631
0.978ArgTrp: 0.978 ± 0.288
1.204ArgTyr: 1.204 ± 0.291
0.0ArgXaa: 0.0 ± 0.0
Ser
4.438SerAla: 4.438 ± 0.606
0.527SerCys: 0.527 ± 0.219
3.009SerAsp: 3.009 ± 0.503
4.589SerGlu: 4.589 ± 0.548
2.407SerPhe: 2.407 ± 0.58
6.018SerGly: 6.018 ± 0.663
1.58SerHis: 1.58 ± 0.363
3.235SerIle: 3.235 ± 0.49
3.611SerLys: 3.611 ± 0.608
3.686SerLeu: 3.686 ± 0.708
1.053SerMet: 1.053 ± 0.295
2.558SerAsn: 2.558 ± 0.397
2.633SerPro: 2.633 ± 0.549
2.106SerGln: 2.106 ± 0.355
2.257SerArg: 2.257 ± 0.423
4.212SerSer: 4.212 ± 0.707
2.708SerThr: 2.708 ± 0.463
3.761SerVal: 3.761 ± 0.569
1.354SerTrp: 1.354 ± 0.324
2.031SerTyr: 2.031 ± 0.459
0.0SerXaa: 0.0 ± 0.0
Thr
4.664ThrAla: 4.664 ± 0.596
0.527ThrCys: 0.527 ± 0.174
1.956ThrAsp: 1.956 ± 0.425
3.235ThrGlu: 3.235 ± 0.413
1.956ThrPhe: 1.956 ± 0.394
4.889ThrGly: 4.889 ± 0.816
1.504ThrHis: 1.504 ± 0.279
2.257ThrIle: 2.257 ± 0.496
2.181ThrLys: 2.181 ± 0.379
5.04ThrLeu: 5.04 ± 0.626
1.279ThrMet: 1.279 ± 0.305
2.031ThrAsn: 2.031 ± 0.375
3.535ThrPro: 3.535 ± 0.57
2.783ThrGln: 2.783 ± 0.387
3.31ThrArg: 3.31 ± 0.447
2.934ThrSer: 2.934 ± 0.567
2.558ThrThr: 2.558 ± 0.531
3.385ThrVal: 3.385 ± 0.43
0.677ThrTrp: 0.677 ± 0.271
1.204ThrTyr: 1.204 ± 0.273
0.0ThrXaa: 0.0 ± 0.0
Val
5.19ValAla: 5.19 ± 0.675
0.677ValCys: 0.677 ± 0.221
4.363ValAsp: 4.363 ± 0.502
5.04ValGlu: 5.04 ± 0.577
3.009ValPhe: 3.009 ± 0.455
5.115ValGly: 5.115 ± 0.535
1.805ValHis: 1.805 ± 0.375
2.708ValIle: 2.708 ± 0.489
3.385ValLys: 3.385 ± 0.475
4.965ValLeu: 4.965 ± 0.67
1.805ValMet: 1.805 ± 0.387
3.686ValAsn: 3.686 ± 0.403
2.181ValPro: 2.181 ± 0.346
1.73ValGln: 1.73 ± 0.429
5.115ValArg: 5.115 ± 0.53
4.363ValSer: 4.363 ± 0.532
4.212ValThr: 4.212 ± 0.502
5.792ValVal: 5.792 ± 0.704
0.903ValTrp: 0.903 ± 0.269
2.783ValTyr: 2.783 ± 0.501
0.0ValXaa: 0.0 ± 0.0
Trp
1.204TrpAla: 1.204 ± 0.32
0.226TrpCys: 0.226 ± 0.118
1.204TrpAsp: 1.204 ± 0.276
1.504TrpGlu: 1.504 ± 0.28
0.602TrpPhe: 0.602 ± 0.191
1.429TrpGly: 1.429 ± 0.266
0.0TrpHis: 0.0 ± 0.0
0.527TrpIle: 0.527 ± 0.215
1.805TrpLys: 1.805 ± 0.383
1.354TrpLeu: 1.354 ± 0.353
0.677TrpMet: 0.677 ± 0.214
0.677TrpAsn: 0.677 ± 0.229
0.677TrpPro: 0.677 ± 0.264
0.376TrpGln: 0.376 ± 0.153
1.053TrpArg: 1.053 ± 0.252
0.978TrpSer: 0.978 ± 0.302
1.204TrpThr: 1.204 ± 0.263
0.827TrpVal: 0.827 ± 0.278
0.301TrpTrp: 0.301 ± 0.148
0.602TrpTyr: 0.602 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.655TyrAla: 1.655 ± 0.36
0.527TyrCys: 0.527 ± 0.18
2.708TyrAsp: 2.708 ± 0.429
1.58TyrGlu: 1.58 ± 0.314
1.204TyrPhe: 1.204 ± 0.261
2.633TyrGly: 2.633 ± 0.415
0.752TyrHis: 0.752 ± 0.251
1.73TyrIle: 1.73 ± 0.471
1.73TyrLys: 1.73 ± 0.329
2.407TyrLeu: 2.407 ± 0.388
0.451TyrMet: 0.451 ± 0.202
1.58TyrAsn: 1.58 ± 0.383
1.58TyrPro: 1.58 ± 0.332
1.053TyrGln: 1.053 ± 0.255
2.031TyrArg: 2.031 ± 0.339
1.881TyrSer: 1.881 ± 0.374
1.73TyrThr: 1.73 ± 0.424
2.558TyrVal: 2.558 ± 0.411
0.301TyrTrp: 0.301 ± 0.194
1.279TyrTyr: 1.279 ± 0.324
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (13295 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski