Amino acid dipepetide frequency for Bifidobacterium phage PMBT6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.646AlaAla: 12.646 ± 1.646
1.311AlaCys: 1.311 ± 0.342
7.681AlaAsp: 7.681 ± 0.991
5.621AlaGlu: 5.621 ± 1.027
2.904AlaPhe: 2.904 ± 0.603
9.18AlaGly: 9.18 ± 1.059
1.874AlaHis: 1.874 ± 0.442
4.122AlaIle: 4.122 ± 0.827
4.122AlaLys: 4.122 ± 0.641
8.431AlaLeu: 8.431 ± 1.297
3.372AlaMet: 3.372 ± 0.515
3.56AlaAsn: 3.56 ± 0.539
2.81AlaPro: 2.81 ± 0.597
2.81AlaGln: 2.81 ± 0.629
5.902AlaArg: 5.902 ± 0.864
6.838AlaSer: 6.838 ± 1.108
5.527AlaThr: 5.527 ± 0.877
6.932AlaVal: 6.932 ± 1.353
2.623AlaTrp: 2.623 ± 0.44
2.623AlaTyr: 2.623 ± 0.506
0.0AlaXaa: 0.0 ± 0.0
Cys
1.124CysAla: 1.124 ± 0.359
0.094CysCys: 0.094 ± 0.087
0.843CysAsp: 0.843 ± 0.274
0.749CysGlu: 0.749 ± 0.283
0.375CysPhe: 0.375 ± 0.183
1.218CysGly: 1.218 ± 0.42
0.375CysHis: 0.375 ± 0.203
0.094CysIle: 0.094 ± 0.088
0.187CysLys: 0.187 ± 0.132
0.656CysLeu: 0.656 ± 0.239
0.749CysMet: 0.749 ± 0.283
0.375CysAsn: 0.375 ± 0.18
0.843CysPro: 0.843 ± 0.313
0.281CysGln: 0.281 ± 0.144
1.218CysArg: 1.218 ± 0.305
0.562CysSer: 0.562 ± 0.302
0.468CysThr: 0.468 ± 0.219
0.562CysVal: 0.562 ± 0.225
0.375CysTrp: 0.375 ± 0.175
0.187CysTyr: 0.187 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
8.244AspAla: 8.244 ± 0.794
0.749AspCys: 0.749 ± 0.37
3.934AspAsp: 3.934 ± 0.71
4.684AspGlu: 4.684 ± 0.713
1.03AspPhe: 1.03 ± 0.241
7.494AspGly: 7.494 ± 1.034
1.124AspHis: 1.124 ± 0.313
3.372AspIle: 3.372 ± 0.589
3.185AspLys: 3.185 ± 0.479
7.213AspLeu: 7.213 ± 0.943
3.091AspMet: 3.091 ± 0.507
1.874AspAsn: 1.874 ± 0.343
3.466AspPro: 3.466 ± 0.546
2.342AspGln: 2.342 ± 0.589
3.653AspArg: 3.653 ± 0.462
4.028AspSer: 4.028 ± 0.56
3.841AspThr: 3.841 ± 0.682
4.122AspVal: 4.122 ± 0.852
1.311AspTrp: 1.311 ± 0.323
2.717AspTyr: 2.717 ± 0.543
0.0AspXaa: 0.0 ± 0.0
Glu
4.215GluAla: 4.215 ± 0.687
0.468GluCys: 0.468 ± 0.213
3.841GluAsp: 3.841 ± 0.779
2.529GluGlu: 2.529 ± 0.631
0.749GluPhe: 0.749 ± 0.217
3.185GluGly: 3.185 ± 0.383
1.218GluHis: 1.218 ± 0.33
2.81GluIle: 2.81 ± 0.518
1.78GluLys: 1.78 ± 0.43
4.684GluLeu: 4.684 ± 0.813
1.78GluMet: 1.78 ± 0.421
1.311GluAsn: 1.311 ± 0.314
2.81GluPro: 2.81 ± 0.595
2.061GluGln: 2.061 ± 0.544
5.433GluArg: 5.433 ± 0.971
3.091GluSer: 3.091 ± 0.619
5.059GluThr: 5.059 ± 0.598
2.529GluVal: 2.529 ± 0.464
0.749GluTrp: 0.749 ± 0.231
1.593GluTyr: 1.593 ± 0.416
0.0GluXaa: 0.0 ± 0.0
Phe
2.061PheAla: 2.061 ± 0.378
0.094PheCys: 0.094 ± 0.084
1.967PheAsp: 1.967 ± 0.42
1.124PheGlu: 1.124 ± 0.406
0.468PhePhe: 0.468 ± 0.224
3.279PheGly: 3.279 ± 0.535
0.281PheHis: 0.281 ± 0.155
0.749PheIle: 0.749 ± 0.255
1.03PheLys: 1.03 ± 0.269
1.874PheLeu: 1.874 ± 0.517
0.749PheMet: 0.749 ± 0.287
1.499PheAsn: 1.499 ± 0.323
0.749PhePro: 0.749 ± 0.27
0.749PheGln: 0.749 ± 0.264
1.593PheArg: 1.593 ± 0.358
1.405PheSer: 1.405 ± 0.295
2.061PheThr: 2.061 ± 0.54
0.656PheVal: 0.656 ± 0.275
0.094PheTrp: 0.094 ± 0.098
0.468PheTyr: 0.468 ± 0.242
0.0PheXaa: 0.0 ± 0.0
Gly
6.183GlyAla: 6.183 ± 0.981
0.375GlyCys: 0.375 ± 0.178
6.464GlyAsp: 6.464 ± 0.884
3.279GlyGlu: 3.279 ± 0.669
2.436GlyPhe: 2.436 ± 0.498
6.089GlyGly: 6.089 ± 1.055
1.78GlyHis: 1.78 ± 0.457
5.808GlyIle: 5.808 ± 1.094
4.778GlyLys: 4.778 ± 0.805
7.494GlyLeu: 7.494 ± 1.09
2.342GlyMet: 2.342 ± 0.454
3.372GlyAsn: 3.372 ± 0.627
2.436GlyPro: 2.436 ± 0.602
2.623GlyGln: 2.623 ± 0.682
5.808GlyArg: 5.808 ± 0.807
5.995GlySer: 5.995 ± 0.913
7.588GlyThr: 7.588 ± 1.159
6.37GlyVal: 6.37 ± 0.751
1.686GlyTrp: 1.686 ± 0.396
1.874GlyTyr: 1.874 ± 0.614
0.0GlyXaa: 0.0 ± 0.0
His
2.248HisAla: 2.248 ± 0.373
0.281HisCys: 0.281 ± 0.152
2.155HisAsp: 2.155 ± 0.518
1.03HisGlu: 1.03 ± 0.299
0.281HisPhe: 0.281 ± 0.162
1.78HisGly: 1.78 ± 0.468
0.562HisHis: 0.562 ± 0.233
1.03HisIle: 1.03 ± 0.336
0.468HisLys: 0.468 ± 0.21
1.218HisLeu: 1.218 ± 0.255
0.0HisMet: 0.0 ± 0.0
0.562HisAsn: 0.562 ± 0.238
1.499HisPro: 1.499 ± 0.393
0.375HisGln: 0.375 ± 0.24
1.311HisArg: 1.311 ± 0.398
1.405HisSer: 1.405 ± 0.364
1.686HisThr: 1.686 ± 0.389
2.248HisVal: 2.248 ± 0.485
0.562HisTrp: 0.562 ± 0.34
0.562HisTyr: 0.562 ± 0.285
0.0HisXaa: 0.0 ± 0.0
Ile
4.496IleAla: 4.496 ± 0.764
0.562IleCys: 0.562 ± 0.236
3.372IleAsp: 3.372 ± 0.64
2.717IleGlu: 2.717 ± 0.503
0.749IlePhe: 0.749 ± 0.246
5.433IleGly: 5.433 ± 0.99
1.405IleHis: 1.405 ± 0.354
3.279IleIle: 3.279 ± 0.708
2.904IleLys: 2.904 ± 0.464
3.653IleLeu: 3.653 ± 0.505
1.03IleMet: 1.03 ± 0.275
1.686IleAsn: 1.686 ± 0.4
2.717IlePro: 2.717 ± 0.564
2.342IleGln: 2.342 ± 0.721
3.653IleArg: 3.653 ± 0.709
4.403IleSer: 4.403 ± 0.858
4.871IleThr: 4.871 ± 0.868
3.279IleVal: 3.279 ± 0.484
0.656IleTrp: 0.656 ± 0.278
0.843IleTyr: 0.843 ± 0.273
0.0IleXaa: 0.0 ± 0.0
Lys
5.902LysAla: 5.902 ± 0.876
0.187LysCys: 0.187 ± 0.118
4.028LysAsp: 4.028 ± 0.854
2.248LysGlu: 2.248 ± 0.447
0.656LysPhe: 0.656 ± 0.266
3.934LysGly: 3.934 ± 0.972
1.03LysHis: 1.03 ± 0.249
1.967LysIle: 1.967 ± 0.555
1.967LysLys: 1.967 ± 0.521
2.998LysLeu: 2.998 ± 0.45
1.311LysMet: 1.311 ± 0.406
0.749LysAsn: 0.749 ± 0.256
2.623LysPro: 2.623 ± 0.623
2.061LysGln: 2.061 ± 0.432
2.717LysArg: 2.717 ± 0.539
2.061LysSer: 2.061 ± 0.464
4.59LysThr: 4.59 ± 0.875
1.874LysVal: 1.874 ± 0.495
0.749LysTrp: 0.749 ± 0.239
1.124LysTyr: 1.124 ± 0.287
0.0LysXaa: 0.0 ± 0.0
Leu
9.274LeuAla: 9.274 ± 0.844
0.843LeuCys: 0.843 ± 0.284
6.276LeuAsp: 6.276 ± 0.748
4.403LeuGlu: 4.403 ± 0.694
2.155LeuPhe: 2.155 ± 0.501
5.714LeuGly: 5.714 ± 0.814
1.686LeuHis: 1.686 ± 0.373
4.965LeuIle: 4.965 ± 0.594
4.59LeuLys: 4.59 ± 0.678
5.714LeuLeu: 5.714 ± 0.736
1.218LeuMet: 1.218 ± 0.311
2.998LeuAsn: 2.998 ± 0.646
4.122LeuPro: 4.122 ± 0.632
2.904LeuGln: 2.904 ± 0.538
6.745LeuArg: 6.745 ± 0.848
4.59LeuSer: 4.59 ± 0.616
5.34LeuThr: 5.34 ± 0.575
4.215LeuVal: 4.215 ± 0.661
1.405LeuTrp: 1.405 ± 0.32
1.874LeuTyr: 1.874 ± 0.425
0.0LeuXaa: 0.0 ± 0.0
Met
2.904MetAla: 2.904 ± 0.449
0.375MetCys: 0.375 ± 0.191
1.499MetAsp: 1.499 ± 0.378
0.843MetGlu: 0.843 ± 0.245
0.375MetPhe: 0.375 ± 0.171
2.155MetGly: 2.155 ± 0.41
0.562MetHis: 0.562 ± 0.217
1.03MetIle: 1.03 ± 0.352
1.311MetLys: 1.311 ± 0.367
2.529MetLeu: 2.529 ± 0.494
0.656MetMet: 0.656 ± 0.262
0.468MetAsn: 0.468 ± 0.192
1.124MetPro: 1.124 ± 0.328
1.311MetGln: 1.311 ± 0.314
2.342MetArg: 2.342 ± 0.523
2.623MetSer: 2.623 ± 0.474
2.061MetThr: 2.061 ± 0.534
1.78MetVal: 1.78 ± 0.473
0.375MetTrp: 0.375 ± 0.194
0.468MetTyr: 0.468 ± 0.244
0.0MetXaa: 0.0 ± 0.0
Asn
3.185AsnAla: 3.185 ± 0.432
0.375AsnCys: 0.375 ± 0.187
1.593AsnAsp: 1.593 ± 0.374
1.593AsnGlu: 1.593 ± 0.465
0.843AsnPhe: 0.843 ± 0.289
3.934AsnGly: 3.934 ± 0.842
1.03AsnHis: 1.03 ± 0.28
1.686AsnIle: 1.686 ± 0.443
1.311AsnLys: 1.311 ± 0.358
2.248AsnLeu: 2.248 ± 0.429
0.562AsnMet: 0.562 ± 0.249
1.218AsnAsn: 1.218 ± 0.328
2.904AsnPro: 2.904 ± 0.529
1.405AsnGln: 1.405 ± 0.342
1.874AsnArg: 1.874 ± 0.506
1.686AsnSer: 1.686 ± 0.414
1.686AsnThr: 1.686 ± 0.312
2.529AsnVal: 2.529 ± 0.517
0.468AsnTrp: 0.468 ± 0.215
0.187AsnTyr: 0.187 ± 0.14
0.0AsnXaa: 0.0 ± 0.0
Pro
3.841ProAla: 3.841 ± 0.689
0.468ProCys: 0.468 ± 0.196
4.59ProAsp: 4.59 ± 0.731
2.998ProGlu: 2.998 ± 0.567
0.656ProPhe: 0.656 ± 0.228
2.998ProGly: 2.998 ± 0.615
1.593ProHis: 1.593 ± 0.367
2.155ProIle: 2.155 ± 0.637
2.155ProLys: 2.155 ± 0.451
2.529ProLeu: 2.529 ± 0.477
0.562ProMet: 0.562 ± 0.266
1.405ProAsn: 1.405 ± 0.374
1.967ProPro: 1.967 ± 0.447
2.061ProGln: 2.061 ± 0.371
2.81ProArg: 2.81 ± 0.692
2.904ProSer: 2.904 ± 0.417
3.56ProThr: 3.56 ± 0.543
4.403ProVal: 4.403 ± 0.733
0.843ProTrp: 0.843 ± 0.271
0.843ProTyr: 0.843 ± 0.262
0.0ProXaa: 0.0 ± 0.0
Gln
4.59GlnAla: 4.59 ± 0.592
0.562GlnCys: 0.562 ± 0.269
1.499GlnAsp: 1.499 ± 0.396
1.593GlnGlu: 1.593 ± 0.409
0.843GlnPhe: 0.843 ± 0.303
2.529GlnGly: 2.529 ± 0.594
0.749GlnHis: 0.749 ± 0.228
2.81GlnIle: 2.81 ± 0.633
1.03GlnLys: 1.03 ± 0.275
2.904GlnLeu: 2.904 ± 0.635
1.03GlnMet: 1.03 ± 0.312
1.218GlnAsn: 1.218 ± 0.381
1.874GlnPro: 1.874 ± 0.335
1.686GlnGln: 1.686 ± 0.488
1.967GlnArg: 1.967 ± 0.55
1.874GlnSer: 1.874 ± 0.465
2.998GlnThr: 2.998 ± 0.708
2.248GlnVal: 2.248 ± 0.47
0.749GlnTrp: 0.749 ± 0.286
0.656GlnTyr: 0.656 ± 0.243
0.0GlnXaa: 0.0 ± 0.0
Arg
4.684ArgAla: 4.684 ± 0.757
1.593ArgCys: 1.593 ± 0.442
4.028ArgAsp: 4.028 ± 0.534
4.028ArgGlu: 4.028 ± 0.741
1.78ArgPhe: 1.78 ± 0.413
4.403ArgGly: 4.403 ± 0.707
1.499ArgHis: 1.499 ± 0.485
4.122ArgIle: 4.122 ± 0.698
2.436ArgLys: 2.436 ± 0.382
8.431ArgLeu: 8.431 ± 1.044
2.436ArgMet: 2.436 ± 0.54
1.967ArgAsn: 1.967 ± 0.47
2.342ArgPro: 2.342 ± 0.514
2.061ArgGln: 2.061 ± 0.656
5.995ArgArg: 5.995 ± 1.247
3.091ArgSer: 3.091 ± 0.462
4.028ArgThr: 4.028 ± 0.714
4.496ArgVal: 4.496 ± 0.765
1.686ArgTrp: 1.686 ± 0.449
2.436ArgTyr: 2.436 ± 0.502
0.0ArgXaa: 0.0 ± 0.0
Ser
5.995SerAla: 5.995 ± 1.471
0.562SerCys: 0.562 ± 0.198
5.059SerAsp: 5.059 ± 0.536
3.56SerGlu: 3.56 ± 0.732
2.342SerPhe: 2.342 ± 0.388
7.213SerGly: 7.213 ± 1.058
0.749SerHis: 0.749 ± 0.248
3.185SerIle: 3.185 ± 0.487
2.436SerLys: 2.436 ± 0.554
5.152SerLeu: 5.152 ± 0.696
1.874SerMet: 1.874 ± 0.427
1.874SerAsn: 1.874 ± 0.479
2.904SerPro: 2.904 ± 0.481
2.248SerGln: 2.248 ± 0.547
3.372SerArg: 3.372 ± 0.617
3.466SerSer: 3.466 ± 0.582
4.028SerThr: 4.028 ± 0.612
4.309SerVal: 4.309 ± 0.7
1.499SerTrp: 1.499 ± 0.379
1.593SerTyr: 1.593 ± 0.39
0.0SerXaa: 0.0 ± 0.0
Thr
9.18ThrAla: 9.18 ± 1.429
0.749ThrCys: 0.749 ± 0.255
4.403ThrAsp: 4.403 ± 0.852
3.747ThrGlu: 3.747 ± 0.73
1.78ThrPhe: 1.78 ± 0.446
6.37ThrGly: 6.37 ± 0.877
1.405ThrHis: 1.405 ± 0.42
6.464ThrIle: 6.464 ± 0.906
3.747ThrLys: 3.747 ± 0.566
5.34ThrLeu: 5.34 ± 0.845
1.686ThrMet: 1.686 ± 0.382
2.436ThrAsn: 2.436 ± 0.488
3.934ThrPro: 3.934 ± 0.679
2.061ThrGln: 2.061 ± 0.399
3.279ThrArg: 3.279 ± 0.611
3.841ThrSer: 3.841 ± 0.799
6.183ThrThr: 6.183 ± 1.109
6.276ThrVal: 6.276 ± 1.007
0.843ThrTrp: 0.843 ± 0.223
1.686ThrTyr: 1.686 ± 0.495
0.0ThrXaa: 0.0 ± 0.0
Val
5.808ValAla: 5.808 ± 0.686
0.843ValCys: 0.843 ± 0.355
5.34ValAsp: 5.34 ± 0.733
3.185ValGlu: 3.185 ± 0.516
1.405ValPhe: 1.405 ± 0.321
4.215ValGly: 4.215 ± 0.883
1.03ValHis: 1.03 ± 0.258
2.998ValIle: 2.998 ± 0.491
4.028ValLys: 4.028 ± 0.53
3.934ValLeu: 3.934 ± 0.66
1.405ValMet: 1.405 ± 0.325
1.686ValAsn: 1.686 ± 0.394
2.436ValPro: 2.436 ± 0.455
2.248ValGln: 2.248 ± 0.564
4.309ValArg: 4.309 ± 0.575
6.838ValSer: 6.838 ± 1.182
6.557ValThr: 6.557 ± 1.019
3.934ValVal: 3.934 ± 0.71
2.342ValTrp: 2.342 ± 0.538
1.124ValTyr: 1.124 ± 0.264
0.0ValXaa: 0.0 ± 0.0
Trp
1.499TrpAla: 1.499 ± 0.398
0.562TrpCys: 0.562 ± 0.232
1.218TrpAsp: 1.218 ± 0.323
0.468TrpGlu: 0.468 ± 0.196
0.468TrpPhe: 0.468 ± 0.232
1.03TrpGly: 1.03 ± 0.313
0.749TrpHis: 0.749 ± 0.255
0.937TrpIle: 0.937 ± 0.281
0.843TrpLys: 0.843 ± 0.335
2.248TrpLeu: 2.248 ± 0.5
0.468TrpMet: 0.468 ± 0.211
1.405TrpAsn: 1.405 ± 0.616
0.937TrpPro: 0.937 ± 0.274
1.124TrpGln: 1.124 ± 0.295
1.593TrpArg: 1.593 ± 0.393
1.499TrpSer: 1.499 ± 0.43
1.686TrpThr: 1.686 ± 0.354
0.937TrpVal: 0.937 ± 0.36
0.094TrpTrp: 0.094 ± 0.095
0.094TrpTyr: 0.094 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.81TyrAla: 2.81 ± 0.46
0.281TyrCys: 0.281 ± 0.149
1.686TyrAsp: 1.686 ± 0.417
1.499TyrGlu: 1.499 ± 0.388
0.656TyrPhe: 0.656 ± 0.267
2.623TyrGly: 2.623 ± 0.639
0.562TyrHis: 0.562 ± 0.229
0.468TyrIle: 0.468 ± 0.226
0.468TyrLys: 0.468 ± 0.182
1.78TyrLeu: 1.78 ± 0.378
0.375TyrMet: 0.375 ± 0.173
0.656TyrAsn: 0.656 ± 0.227
0.843TyrPro: 0.843 ± 0.235
0.749TyrGln: 0.749 ± 0.227
1.967TyrArg: 1.967 ± 0.568
1.218TyrSer: 1.218 ± 0.316
1.593TyrThr: 1.593 ± 0.366
1.967TyrVal: 1.967 ± 0.367
0.749TyrTrp: 0.749 ± 0.273
0.843TyrTyr: 0.843 ± 0.323
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (10676 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski