Amino acid dipepetide frequency for Pseudomonas phage vB_PaeS-Yazdi-M

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.192AlaAla: 17.192 ± 3.061
0.789AlaCys: 0.789 ± 0.193
3.943AlaAsp: 3.943 ± 0.583
7.334AlaGlu: 7.334 ± 0.891
3.391AlaPhe: 3.391 ± 0.544
8.044AlaGly: 8.044 ± 0.742
1.656AlaHis: 1.656 ± 0.378
6.782AlaIle: 6.782 ± 0.736
6.782AlaLys: 6.782 ± 0.838
8.281AlaLeu: 8.281 ± 1.008
2.366AlaMet: 2.366 ± 0.554
6.782AlaAsn: 6.782 ± 0.939
4.811AlaPro: 4.811 ± 0.762
4.811AlaGln: 4.811 ± 1.493
5.915AlaArg: 5.915 ± 0.633
7.098AlaSer: 7.098 ± 1.019
7.177AlaThr: 7.177 ± 1.135
7.571AlaVal: 7.571 ± 0.951
1.498AlaTrp: 1.498 ± 0.399
3.549AlaTyr: 3.549 ± 0.403
0.0AlaXaa: 0.0 ± 0.0
Cys
1.183CysAla: 1.183 ± 0.374
0.158CysCys: 0.158 ± 0.095
0.71CysAsp: 0.71 ± 0.236
0.868CysGlu: 0.868 ± 0.278
0.473CysPhe: 0.473 ± 0.181
0.789CysGly: 0.789 ± 0.311
0.315CysHis: 0.315 ± 0.149
0.315CysIle: 0.315 ± 0.152
0.631CysLys: 0.631 ± 0.232
0.631CysLeu: 0.631 ± 0.208
0.079CysMet: 0.079 ± 0.091
0.315CysAsn: 0.315 ± 0.14
0.552CysPro: 0.552 ± 0.249
0.237CysGln: 0.237 ± 0.138
0.868CysArg: 0.868 ± 0.256
0.158CysSer: 0.158 ± 0.093
0.394CysThr: 0.394 ± 0.176
0.868CysVal: 0.868 ± 0.262
0.0CysTrp: 0.0 ± 0.0
0.473CysTyr: 0.473 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
5.205AspAla: 5.205 ± 0.694
0.631AspCys: 0.631 ± 0.221
3.391AspAsp: 3.391 ± 0.34
4.89AspGlu: 4.89 ± 0.787
2.445AspPhe: 2.445 ± 0.422
4.811AspGly: 4.811 ± 0.787
0.394AspHis: 0.394 ± 0.192
2.76AspIle: 2.76 ± 0.525
2.445AspLys: 2.445 ± 0.541
4.101AspLeu: 4.101 ± 0.46
1.025AspMet: 1.025 ± 0.298
2.603AspAsn: 2.603 ± 0.314
2.681AspPro: 2.681 ± 0.518
0.868AspGln: 0.868 ± 0.311
2.445AspArg: 2.445 ± 0.531
2.76AspSer: 2.76 ± 0.477
2.287AspThr: 2.287 ± 0.374
3.549AspVal: 3.549 ± 0.488
0.552AspTrp: 0.552 ± 0.193
2.129AspTyr: 2.129 ± 0.482
0.0AspXaa: 0.0 ± 0.0
Glu
8.044GluAla: 8.044 ± 0.834
0.71GluCys: 0.71 ± 0.23
3.155GluAsp: 3.155 ± 0.538
3.943GluGlu: 3.943 ± 0.481
2.366GluPhe: 2.366 ± 0.375
3.549GluGly: 3.549 ± 0.543
0.868GluHis: 0.868 ± 0.273
3.785GluIle: 3.785 ± 0.545
3.076GluLys: 3.076 ± 0.608
6.309GluLeu: 6.309 ± 0.749
1.814GluMet: 1.814 ± 0.399
2.287GluAsn: 2.287 ± 0.374
2.603GluPro: 2.603 ± 0.565
2.918GluGln: 2.918 ± 0.489
3.47GluArg: 3.47 ± 0.536
2.287GluSer: 2.287 ± 0.344
3.312GluThr: 3.312 ± 0.442
4.811GluVal: 4.811 ± 0.586
1.42GluTrp: 1.42 ± 0.391
1.735GluTyr: 1.735 ± 0.456
0.0GluXaa: 0.0 ± 0.0
Phe
3.233PheAla: 3.233 ± 0.584
0.71PheCys: 0.71 ± 0.283
3.785PheAsp: 3.785 ± 0.539
2.681PheGlu: 2.681 ± 0.399
1.735PhePhe: 1.735 ± 0.355
2.997PheGly: 2.997 ± 0.34
0.158PheHis: 0.158 ± 0.115
2.603PheIle: 2.603 ± 0.452
1.42PheLys: 1.42 ± 0.313
1.42PheLeu: 1.42 ± 0.293
1.183PheMet: 1.183 ± 0.271
2.287PheAsn: 2.287 ± 0.488
1.42PhePro: 1.42 ± 0.365
1.262PheGln: 1.262 ± 0.335
2.129PheArg: 2.129 ± 0.335
1.577PheSer: 1.577 ± 0.384
2.524PheThr: 2.524 ± 0.491
3.233PheVal: 3.233 ± 0.463
0.868PheTrp: 0.868 ± 0.378
1.656PheTyr: 1.656 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
7.729GlyAla: 7.729 ± 0.985
0.789GlyCys: 0.789 ± 0.332
4.495GlyAsp: 4.495 ± 0.718
5.442GlyGlu: 5.442 ± 0.601
3.391GlyPhe: 3.391 ± 0.461
6.625GlyGly: 6.625 ± 1.137
0.71GlyHis: 0.71 ± 0.291
3.628GlyIle: 3.628 ± 0.487
4.101GlyLys: 4.101 ± 0.605
6.467GlyLeu: 6.467 ± 0.732
1.104GlyMet: 1.104 ± 0.304
2.839GlyAsn: 2.839 ± 0.561
3.233GlyPro: 3.233 ± 0.411
4.574GlyGln: 4.574 ± 1.032
2.839GlyArg: 2.839 ± 0.433
4.495GlySer: 4.495 ± 1.181
5.284GlyThr: 5.284 ± 0.746
5.363GlyVal: 5.363 ± 0.569
1.341GlyTrp: 1.341 ± 0.287
2.287GlyTyr: 2.287 ± 0.353
0.0GlyXaa: 0.0 ± 0.0
His
1.341HisAla: 1.341 ± 0.389
0.315HisCys: 0.315 ± 0.219
0.394HisAsp: 0.394 ± 0.17
0.631HisGlu: 0.631 ± 0.25
0.71HisPhe: 0.71 ± 0.247
0.868HisGly: 0.868 ± 0.224
0.0HisHis: 0.0 ± 0.0
0.71HisIle: 0.71 ± 0.222
0.71HisLys: 0.71 ± 0.196
1.262HisLeu: 1.262 ± 0.388
0.237HisMet: 0.237 ± 0.153
0.473HisAsn: 0.473 ± 0.206
0.552HisPro: 0.552 ± 0.221
0.158HisGln: 0.158 ± 0.121
1.104HisArg: 1.104 ± 0.335
0.868HisSer: 0.868 ± 0.247
0.552HisThr: 0.552 ± 0.263
1.42HisVal: 1.42 ± 0.38
0.079HisTrp: 0.079 ± 0.085
0.315HisTyr: 0.315 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
7.098IleAla: 7.098 ± 0.632
0.631IleCys: 0.631 ± 0.216
4.338IleAsp: 4.338 ± 0.565
4.89IleGlu: 4.89 ± 0.684
1.183IlePhe: 1.183 ± 0.313
3.864IleGly: 3.864 ± 0.466
0.868IleHis: 0.868 ± 0.266
3.076IleIle: 3.076 ± 0.604
2.839IleLys: 2.839 ± 0.445
2.603IleLeu: 2.603 ± 0.326
1.183IleMet: 1.183 ± 0.289
3.076IleAsn: 3.076 ± 0.564
2.681IlePro: 2.681 ± 0.519
1.498IleGln: 1.498 ± 0.375
2.445IleArg: 2.445 ± 0.455
3.312IleSer: 3.312 ± 0.637
2.997IleThr: 2.997 ± 0.534
4.18IleVal: 4.18 ± 0.544
0.631IleTrp: 0.631 ± 0.248
1.341IleTyr: 1.341 ± 0.377
0.0IleXaa: 0.0 ± 0.0
Lys
7.65LysAla: 7.65 ± 0.953
0.394LysCys: 0.394 ± 0.138
2.997LysAsp: 2.997 ± 0.526
2.76LysGlu: 2.76 ± 0.45
2.208LysPhe: 2.208 ± 0.384
3.233LysGly: 3.233 ± 0.455
0.868LysHis: 0.868 ± 0.255
2.839LysIle: 2.839 ± 0.486
4.101LysLys: 4.101 ± 0.754
5.599LysLeu: 5.599 ± 0.716
1.577LysMet: 1.577 ± 0.378
2.366LysAsn: 2.366 ± 0.426
2.208LysPro: 2.208 ± 0.502
1.183LysGln: 1.183 ± 0.233
2.997LysArg: 2.997 ± 0.504
3.391LysSer: 3.391 ± 0.557
3.233LysThr: 3.233 ± 0.401
3.864LysVal: 3.864 ± 0.479
0.71LysTrp: 0.71 ± 0.219
1.262LysTyr: 1.262 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
8.202LeuAla: 8.202 ± 1.075
0.315LeuCys: 0.315 ± 0.174
3.864LeuAsp: 3.864 ± 0.502
3.943LeuGlu: 3.943 ± 0.543
3.312LeuPhe: 3.312 ± 0.423
4.495LeuGly: 4.495 ± 0.597
1.262LeuHis: 1.262 ± 0.351
3.864LeuIle: 3.864 ± 0.513
3.943LeuLys: 3.943 ± 0.536
4.968LeuLeu: 4.968 ± 0.743
1.42LeuMet: 1.42 ± 0.295
3.785LeuAsn: 3.785 ± 0.637
4.653LeuPro: 4.653 ± 0.721
3.628LeuGln: 3.628 ± 0.799
4.968LeuArg: 4.968 ± 0.761
6.151LeuSer: 6.151 ± 0.639
4.18LeuThr: 4.18 ± 0.565
3.785LeuVal: 3.785 ± 0.662
1.025LeuTrp: 1.025 ± 0.24
2.524LeuTyr: 2.524 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
3.155MetAla: 3.155 ± 0.455
0.158MetCys: 0.158 ± 0.106
0.789MetAsp: 0.789 ± 0.193
0.631MetGlu: 0.631 ± 0.21
0.868MetPhe: 0.868 ± 0.27
1.814MetGly: 1.814 ± 0.367
0.394MetHis: 0.394 ± 0.215
1.025MetIle: 1.025 ± 0.288
1.498MetLys: 1.498 ± 0.304
1.972MetLeu: 1.972 ± 0.482
0.237MetMet: 0.237 ± 0.151
1.262MetAsn: 1.262 ± 0.289
1.893MetPro: 1.893 ± 0.343
0.946MetGln: 0.946 ± 0.277
1.104MetArg: 1.104 ± 0.32
1.341MetSer: 1.341 ± 0.323
1.341MetThr: 1.341 ± 0.288
0.946MetVal: 0.946 ± 0.281
0.315MetTrp: 0.315 ± 0.148
0.631MetTyr: 0.631 ± 0.213
0.0MetXaa: 0.0 ± 0.0
Asn
5.836AsnAla: 5.836 ± 0.924
0.552AsnCys: 0.552 ± 0.219
1.893AsnAsp: 1.893 ± 0.411
2.839AsnGlu: 2.839 ± 0.516
1.498AsnPhe: 1.498 ± 0.341
4.653AsnGly: 4.653 ± 0.691
0.315AsnHis: 0.315 ± 0.142
2.129AsnIle: 2.129 ± 0.497
2.681AsnLys: 2.681 ± 0.424
2.839AsnLeu: 2.839 ± 0.403
0.631AsnMet: 0.631 ± 0.195
1.972AsnAsn: 1.972 ± 0.576
3.391AsnPro: 3.391 ± 0.492
1.341AsnGln: 1.341 ± 0.273
2.997AsnArg: 2.997 ± 0.461
2.208AsnSer: 2.208 ± 0.438
2.918AsnThr: 2.918 ± 0.464
4.732AsnVal: 4.732 ± 0.489
0.552AsnTrp: 0.552 ± 0.207
1.183AsnTyr: 1.183 ± 0.399
0.0AsnXaa: 0.0 ± 0.0
Pro
5.363ProAla: 5.363 ± 0.706
0.315ProCys: 0.315 ± 0.15
2.681ProAsp: 2.681 ± 0.546
3.312ProGlu: 3.312 ± 0.597
1.735ProPhe: 1.735 ± 0.333
3.864ProGly: 3.864 ± 0.679
1.104ProHis: 1.104 ± 0.322
2.918ProIle: 2.918 ± 0.431
2.366ProLys: 2.366 ± 0.45
2.918ProLeu: 2.918 ± 0.6
1.104ProMet: 1.104 ± 0.3
3.233ProAsn: 3.233 ± 0.654
3.233ProPro: 3.233 ± 0.523
3.155ProGln: 3.155 ± 1.138
1.814ProArg: 1.814 ± 0.435
2.76ProSer: 2.76 ± 0.568
3.864ProThr: 3.864 ± 0.661
3.785ProVal: 3.785 ± 0.512
0.552ProTrp: 0.552 ± 0.222
1.577ProTyr: 1.577 ± 0.363
0.0ProXaa: 0.0 ± 0.0
Gln
4.653GlnAla: 4.653 ± 0.88
0.473GlnCys: 0.473 ± 0.2
1.341GlnAsp: 1.341 ± 0.375
1.104GlnGlu: 1.104 ± 0.359
0.789GlnPhe: 0.789 ± 0.272
3.312GlnGly: 3.312 ± 0.887
0.473GlnHis: 0.473 ± 0.249
2.208GlnIle: 2.208 ± 0.557
2.445GlnLys: 2.445 ± 0.514
4.338GlnLeu: 4.338 ± 0.783
1.262GlnMet: 1.262 ± 0.294
2.129GlnAsn: 2.129 ± 0.446
2.445GlnPro: 2.445 ± 1.204
4.574GlnGln: 4.574 ± 1.981
2.918GlnArg: 2.918 ± 0.643
2.366GlnSer: 2.366 ± 0.492
2.839GlnThr: 2.839 ± 0.822
2.208GlnVal: 2.208 ± 0.364
0.394GlnTrp: 0.394 ± 0.158
1.104GlnTyr: 1.104 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
5.284ArgAla: 5.284 ± 0.876
0.473ArgCys: 0.473 ± 0.194
3.47ArgAsp: 3.47 ± 0.527
2.839ArgGlu: 2.839 ± 0.558
2.366ArgPhe: 2.366 ± 0.425
2.997ArgGly: 2.997 ± 0.472
0.552ArgHis: 0.552 ± 0.202
3.943ArgIle: 3.943 ± 0.582
3.233ArgLys: 3.233 ± 0.558
4.338ArgLeu: 4.338 ± 0.555
2.129ArgMet: 2.129 ± 0.352
1.262ArgAsn: 1.262 ± 0.276
2.603ArgPro: 2.603 ± 0.555
2.445ArgGln: 2.445 ± 0.386
2.524ArgArg: 2.524 ± 0.488
2.129ArgSer: 2.129 ± 0.403
2.76ArgThr: 2.76 ± 0.41
2.997ArgVal: 2.997 ± 0.458
1.104ArgTrp: 1.104 ± 0.285
2.05ArgTyr: 2.05 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
5.047SerAla: 5.047 ± 0.634
0.473SerCys: 0.473 ± 0.173
3.233SerAsp: 3.233 ± 0.583
3.076SerGlu: 3.076 ± 0.539
2.524SerPhe: 2.524 ± 0.503
5.836SerGly: 5.836 ± 0.906
0.473SerHis: 0.473 ± 0.24
2.918SerIle: 2.918 ± 0.55
4.101SerLys: 4.101 ± 0.571
4.18SerLeu: 4.18 ± 0.585
0.631SerMet: 0.631 ± 0.276
2.208SerAsn: 2.208 ± 0.561
2.287SerPro: 2.287 ± 0.41
3.076SerGln: 3.076 ± 0.624
2.839SerArg: 2.839 ± 0.379
3.312SerSer: 3.312 ± 0.572
3.155SerThr: 3.155 ± 0.585
3.47SerVal: 3.47 ± 0.376
0.71SerTrp: 0.71 ± 0.259
2.129SerTyr: 2.129 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
6.388ThrAla: 6.388 ± 1.464
0.237ThrCys: 0.237 ± 0.17
2.524ThrAsp: 2.524 ± 0.414
5.047ThrGlu: 5.047 ± 0.79
2.603ThrPhe: 2.603 ± 0.535
6.467ThrGly: 6.467 ± 0.691
0.631ThrHis: 0.631 ± 0.203
2.839ThrIle: 2.839 ± 0.515
3.47ThrLys: 3.47 ± 0.561
5.047ThrLeu: 5.047 ± 0.631
1.104ThrMet: 1.104 ± 0.266
2.76ThrAsn: 2.76 ± 0.549
3.707ThrPro: 3.707 ± 0.581
1.893ThrGln: 1.893 ± 0.419
2.208ThrArg: 2.208 ± 0.464
2.997ThrSer: 2.997 ± 0.531
3.628ThrThr: 3.628 ± 0.565
3.628ThrVal: 3.628 ± 0.672
0.71ThrTrp: 0.71 ± 0.185
2.287ThrTyr: 2.287 ± 0.419
0.0ThrXaa: 0.0 ± 0.0
Val
9.227ValAla: 9.227 ± 0.893
0.868ValCys: 0.868 ± 0.268
2.287ValAsp: 2.287 ± 0.318
3.864ValGlu: 3.864 ± 0.784
3.312ValPhe: 3.312 ± 0.547
4.811ValGly: 4.811 ± 0.587
0.868ValHis: 0.868 ± 0.171
3.864ValIle: 3.864 ± 0.436
4.022ValLys: 4.022 ± 0.457
3.943ValLeu: 3.943 ± 0.507
1.972ValMet: 1.972 ± 0.319
2.839ValAsn: 2.839 ± 0.469
4.416ValPro: 4.416 ± 0.728
2.76ValGln: 2.76 ± 0.559
2.997ValArg: 2.997 ± 0.549
3.312ValSer: 3.312 ± 0.505
5.284ValThr: 5.284 ± 0.927
4.338ValVal: 4.338 ± 0.566
0.946ValTrp: 0.946 ± 0.362
2.287ValTyr: 2.287 ± 0.572
0.0ValXaa: 0.0 ± 0.0
Trp
1.262TrpAla: 1.262 ± 0.318
0.315TrpCys: 0.315 ± 0.182
0.315TrpAsp: 0.315 ± 0.142
0.394TrpGlu: 0.394 ± 0.154
0.71TrpPhe: 0.71 ± 0.211
1.025TrpGly: 1.025 ± 0.401
0.394TrpHis: 0.394 ± 0.176
0.71TrpIle: 0.71 ± 0.235
0.473TrpLys: 0.473 ± 0.219
1.025TrpLeu: 1.025 ± 0.232
0.315TrpMet: 0.315 ± 0.138
0.394TrpAsn: 0.394 ± 0.173
0.946TrpPro: 0.946 ± 0.286
0.789TrpGln: 0.789 ± 0.304
1.262TrpArg: 1.262 ± 0.369
0.473TrpSer: 0.473 ± 0.198
0.868TrpThr: 0.868 ± 0.256
1.104TrpVal: 1.104 ± 0.254
0.315TrpTrp: 0.315 ± 0.143
0.946TrpTyr: 0.946 ± 0.296
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.681TyrAla: 2.681 ± 0.608
0.71TyrCys: 0.71 ± 0.307
2.366TyrAsp: 2.366 ± 0.459
1.972TyrGlu: 1.972 ± 0.362
1.341TyrPhe: 1.341 ± 0.312
2.681TyrGly: 2.681 ± 0.504
0.394TyrHis: 0.394 ± 0.153
1.814TyrIle: 1.814 ± 0.322
1.104TyrLys: 1.104 ± 0.398
2.05TyrLeu: 2.05 ± 0.367
0.868TyrMet: 0.868 ± 0.34
2.287TyrAsn: 2.287 ± 0.525
1.341TyrPro: 1.341 ± 0.331
1.262TyrGln: 1.262 ± 0.391
1.656TyrArg: 1.656 ± 0.339
2.681TyrSer: 2.681 ± 0.486
1.577TyrThr: 1.577 ± 0.337
2.366TyrVal: 2.366 ± 0.397
0.315TyrTrp: 0.315 ± 0.16
1.42TyrTyr: 1.42 ± 0.396
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12681 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski