Amino acid dipepetide frequency for Staphylococcus phage TEM126

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.564AlaAla: 1.564 ± 0.534
0.695AlaCys: 0.695 ± 0.235
2.519AlaAsp: 2.519 ± 0.437
3.735AlaGlu: 3.735 ± 0.669
2.953AlaPhe: 2.953 ± 0.52
3.735AlaGly: 3.735 ± 0.672
0.695AlaHis: 0.695 ± 0.306
5.038AlaIle: 5.038 ± 1.252
4.256AlaLys: 4.256 ± 0.64
4.604AlaLeu: 4.604 ± 0.776
1.998AlaMet: 1.998 ± 0.489
3.996AlaAsn: 3.996 ± 0.592
1.477AlaPro: 1.477 ± 0.296
1.998AlaGln: 1.998 ± 0.395
3.127AlaArg: 3.127 ± 0.607
3.648AlaSer: 3.648 ± 0.665
3.475AlaThr: 3.475 ± 0.55
3.214AlaVal: 3.214 ± 0.737
0.782AlaTrp: 0.782 ± 0.308
3.127AlaTyr: 3.127 ± 0.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.174CysAla: 0.174 ± 0.125
0.0CysCys: 0.0 ± 0.0
0.261CysAsp: 0.261 ± 0.132
0.261CysGlu: 0.261 ± 0.151
0.347CysPhe: 0.347 ± 0.16
0.087CysGly: 0.087 ± 0.087
0.0CysHis: 0.0 ± 0.0
0.608CysIle: 0.608 ± 0.219
0.695CysLys: 0.695 ± 0.284
0.434CysLeu: 0.434 ± 0.209
0.0CysMet: 0.0 ± 0.0
0.434CysAsn: 0.434 ± 0.256
0.261CysPro: 0.261 ± 0.115
0.261CysGln: 0.261 ± 0.147
0.434CysArg: 0.434 ± 0.223
0.434CysSer: 0.434 ± 0.214
0.261CysThr: 0.261 ± 0.142
0.0CysVal: 0.0 ± 0.0
0.087CysTrp: 0.087 ± 0.079
0.347CysTyr: 0.347 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
2.345AspAla: 2.345 ± 0.371
0.347AspCys: 0.347 ± 0.137
3.735AspAsp: 3.735 ± 0.773
5.994AspGlu: 5.994 ± 0.929
3.127AspPhe: 3.127 ± 0.615
3.996AspGly: 3.996 ± 0.606
0.521AspHis: 0.521 ± 0.24
4.083AspIle: 4.083 ± 0.612
5.994AspLys: 5.994 ± 0.668
4.951AspLeu: 4.951 ± 0.848
1.737AspMet: 1.737 ± 0.38
4.343AspAsn: 4.343 ± 0.71
1.303AspPro: 1.303 ± 0.368
0.782AspGln: 0.782 ± 0.255
2.085AspArg: 2.085 ± 0.458
3.648AspSer: 3.648 ± 0.659
3.735AspThr: 3.735 ± 0.538
3.648AspVal: 3.648 ± 0.543
0.695AspTrp: 0.695 ± 0.17
2.519AspTyr: 2.519 ± 0.539
0.0AspXaa: 0.0 ± 0.0
Glu
5.125GluAla: 5.125 ± 0.865
0.347GluCys: 0.347 ± 0.162
2.78GluAsp: 2.78 ± 0.532
6.341GluGlu: 6.341 ± 0.952
3.562GluPhe: 3.562 ± 0.554
3.301GluGly: 3.301 ± 0.576
1.303GluHis: 1.303 ± 0.306
6.167GluIle: 6.167 ± 0.982
5.646GluLys: 5.646 ± 1.016
6.167GluLeu: 6.167 ± 1.007
2.085GluMet: 2.085 ± 0.458
4.43GluAsn: 4.43 ± 0.643
1.737GluPro: 1.737 ± 0.379
3.562GluGln: 3.562 ± 0.565
3.735GluArg: 3.735 ± 0.652
4.256GluSer: 4.256 ± 0.751
3.127GluThr: 3.127 ± 0.526
5.125GluVal: 5.125 ± 0.66
0.608GluTrp: 0.608 ± 0.247
5.125GluTyr: 5.125 ± 0.698
0.0GluXaa: 0.0 ± 0.0
Phe
1.911PheAla: 1.911 ± 0.359
0.347PheCys: 0.347 ± 0.13
2.867PheAsp: 2.867 ± 0.471
4.083PheGlu: 4.083 ± 0.654
1.303PhePhe: 1.303 ± 0.264
2.78PheGly: 2.78 ± 0.505
0.695PheHis: 0.695 ± 0.467
3.475PheIle: 3.475 ± 0.607
4.083PheLys: 4.083 ± 0.592
3.127PheLeu: 3.127 ± 0.602
1.216PheMet: 1.216 ± 0.375
3.562PheAsn: 3.562 ± 0.527
0.869PhePro: 0.869 ± 0.316
1.477PheGln: 1.477 ± 0.617
1.042PheArg: 1.042 ± 0.316
2.953PheSer: 2.953 ± 0.539
2.953PheThr: 2.953 ± 0.49
2.867PheVal: 2.867 ± 0.633
0.261PheTrp: 0.261 ± 0.144
1.737PheTyr: 1.737 ± 0.328
0.0PheXaa: 0.0 ± 0.0
Gly
3.301GlyAla: 3.301 ± 0.559
0.261GlyCys: 0.261 ± 0.145
3.214GlyAsp: 3.214 ± 0.522
3.301GlyGlu: 3.301 ± 0.574
2.085GlyPhe: 2.085 ± 0.484
3.562GlyGly: 3.562 ± 0.644
1.737GlyHis: 1.737 ± 0.496
4.17GlyIle: 4.17 ± 0.559
5.125GlyLys: 5.125 ± 0.606
5.386GlyLeu: 5.386 ± 0.866
1.477GlyMet: 1.477 ± 0.344
3.301GlyAsn: 3.301 ± 0.605
0.608GlyPro: 0.608 ± 0.25
1.824GlyGln: 1.824 ± 0.395
2.345GlyArg: 2.345 ± 0.457
2.519GlySer: 2.519 ± 0.442
3.909GlyThr: 3.909 ± 0.665
4.517GlyVal: 4.517 ± 0.594
1.129GlyTrp: 1.129 ± 0.321
2.78GlyTyr: 2.78 ± 0.399
0.0GlyXaa: 0.0 ± 0.0
His
1.737HisAla: 1.737 ± 0.446
0.174HisCys: 0.174 ± 0.118
1.129HisAsp: 1.129 ± 0.262
1.129HisGlu: 1.129 ± 0.308
0.521HisPhe: 0.521 ± 0.194
1.477HisGly: 1.477 ± 0.323
0.347HisHis: 0.347 ± 0.18
1.216HisIle: 1.216 ± 0.452
0.782HisLys: 0.782 ± 0.273
1.216HisLeu: 1.216 ± 0.252
0.521HisMet: 0.521 ± 0.262
1.042HisAsn: 1.042 ± 0.357
0.434HisPro: 0.434 ± 0.159
0.434HisGln: 0.434 ± 0.22
0.261HisArg: 0.261 ± 0.152
1.042HisSer: 1.042 ± 0.257
0.608HisThr: 0.608 ± 0.238
1.216HisVal: 1.216 ± 0.365
0.0HisTrp: 0.0 ± 0.0
0.869HisTyr: 0.869 ± 0.35
0.0HisXaa: 0.0 ± 0.0
Ile
5.82IleAla: 5.82 ± 1.005
0.087IleCys: 0.087 ± 0.09
5.646IleAsp: 5.646 ± 0.662
5.646IleGlu: 5.646 ± 0.907
3.127IlePhe: 3.127 ± 0.494
4.083IleGly: 4.083 ± 0.596
1.911IleHis: 1.911 ± 0.399
4.517IleIle: 4.517 ± 0.905
6.776IleLys: 6.776 ± 0.753
4.951IleLeu: 4.951 ± 0.58
1.824IleMet: 1.824 ± 0.348
4.951IleAsn: 4.951 ± 0.543
1.737IlePro: 1.737 ± 0.281
2.172IleGln: 2.172 ± 0.426
2.867IleArg: 2.867 ± 0.583
4.517IleSer: 4.517 ± 0.654
5.559IleThr: 5.559 ± 0.877
5.733IleVal: 5.733 ± 1.531
1.477IleTrp: 1.477 ± 0.639
2.953IleTyr: 2.953 ± 0.502
0.0IleXaa: 0.0 ± 0.0
Lys
4.691LysAla: 4.691 ± 0.493
0.174LysCys: 0.174 ± 0.137
6.081LysAsp: 6.081 ± 0.832
7.036LysGlu: 7.036 ± 1.02
2.867LysPhe: 2.867 ± 0.562
5.125LysGly: 5.125 ± 0.742
1.303LysHis: 1.303 ± 0.375
5.646LysIle: 5.646 ± 0.775
8.773LysLys: 8.773 ± 1.007
7.036LysLeu: 7.036 ± 0.982
2.519LysMet: 2.519 ± 0.481
6.515LysAsn: 6.515 ± 1.008
2.867LysPro: 2.867 ± 0.555
4.778LysGln: 4.778 ± 0.809
4.343LysArg: 4.343 ± 0.796
5.125LysSer: 5.125 ± 0.732
5.299LysThr: 5.299 ± 0.826
5.907LysVal: 5.907 ± 0.755
0.695LysTrp: 0.695 ± 0.231
3.909LysTyr: 3.909 ± 0.783
0.0LysXaa: 0.0 ± 0.0
Leu
2.78LeuAla: 2.78 ± 0.606
0.434LeuCys: 0.434 ± 0.201
3.735LeuAsp: 3.735 ± 0.564
5.82LeuGlu: 5.82 ± 0.7
3.648LeuPhe: 3.648 ± 0.663
4.256LeuGly: 4.256 ± 0.529
1.042LeuHis: 1.042 ± 0.291
4.691LeuIle: 4.691 ± 0.627
7.731LeuLys: 7.731 ± 0.612
5.125LeuLeu: 5.125 ± 0.616
1.998LeuMet: 1.998 ± 0.394
5.559LeuAsn: 5.559 ± 0.506
2.606LeuPro: 2.606 ± 0.482
3.648LeuGln: 3.648 ± 0.579
3.822LeuArg: 3.822 ± 0.667
4.343LeuSer: 4.343 ± 0.641
5.125LeuThr: 5.125 ± 0.719
4.517LeuVal: 4.517 ± 0.91
0.695LeuTrp: 0.695 ± 0.247
3.475LeuTyr: 3.475 ± 0.677
0.0LeuXaa: 0.0 ± 0.0
Met
1.477MetAla: 1.477 ± 0.349
0.087MetCys: 0.087 ± 0.081
1.129MetAsp: 1.129 ± 0.271
2.259MetGlu: 2.259 ± 0.499
0.695MetPhe: 0.695 ± 0.203
1.042MetGly: 1.042 ± 0.316
0.261MetHis: 0.261 ± 0.204
1.737MetIle: 1.737 ± 0.329
1.824MetLys: 1.824 ± 0.407
2.519MetLeu: 2.519 ± 0.367
0.782MetMet: 0.782 ± 0.208
1.824MetAsn: 1.824 ± 0.36
0.608MetPro: 0.608 ± 0.221
1.564MetGln: 1.564 ± 0.388
1.042MetArg: 1.042 ± 0.287
1.39MetSer: 1.39 ± 0.371
2.78MetThr: 2.78 ± 0.534
1.042MetVal: 1.042 ± 0.301
0.521MetTrp: 0.521 ± 0.22
1.303MetTyr: 1.303 ± 0.341
0.0MetXaa: 0.0 ± 0.0
Asn
5.733AsnAla: 5.733 ± 0.924
0.261AsnCys: 0.261 ± 0.196
4.864AsnAsp: 4.864 ± 0.67
4.778AsnGlu: 4.778 ± 0.628
3.909AsnPhe: 3.909 ± 0.67
5.125AsnGly: 5.125 ± 0.67
0.869AsnHis: 0.869 ± 0.27
4.691AsnIle: 4.691 ± 0.528
6.081AsnLys: 6.081 ± 0.732
4.604AsnLeu: 4.604 ± 0.65
1.564AsnMet: 1.564 ± 0.337
4.951AsnAsn: 4.951 ± 0.885
2.78AsnPro: 2.78 ± 0.501
2.432AsnGln: 2.432 ± 0.446
2.78AsnArg: 2.78 ± 0.517
3.214AsnSer: 3.214 ± 0.482
3.648AsnThr: 3.648 ± 0.521
4.778AsnVal: 4.778 ± 0.608
0.782AsnTrp: 0.782 ± 0.21
3.214AsnTyr: 3.214 ± 0.579
0.0AsnXaa: 0.0 ± 0.0
Pro
1.303ProAla: 1.303 ± 0.295
0.261ProCys: 0.261 ± 0.151
1.477ProAsp: 1.477 ± 0.307
1.477ProGlu: 1.477 ± 0.255
1.39ProPhe: 1.39 ± 0.359
1.65ProGly: 1.65 ± 0.43
0.174ProHis: 0.174 ± 0.111
2.085ProIle: 2.085 ± 0.422
2.953ProLys: 2.953 ± 0.595
1.129ProLeu: 1.129 ± 0.288
0.956ProMet: 0.956 ± 0.292
2.259ProAsn: 2.259 ± 0.358
0.608ProPro: 0.608 ± 0.251
1.042ProGln: 1.042 ± 0.328
1.303ProArg: 1.303 ± 0.377
1.129ProSer: 1.129 ± 0.337
1.477ProThr: 1.477 ± 0.371
2.172ProVal: 2.172 ± 0.463
0.087ProTrp: 0.087 ± 0.102
1.216ProTyr: 1.216 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
2.345GlnAla: 2.345 ± 0.397
0.434GlnCys: 0.434 ± 0.183
2.606GlnAsp: 2.606 ± 0.497
2.78GlnGlu: 2.78 ± 0.576
1.998GlnPhe: 1.998 ± 0.366
2.172GlnGly: 2.172 ± 0.364
0.521GlnHis: 0.521 ± 0.161
2.867GlnIle: 2.867 ± 0.814
3.127GlnLys: 3.127 ± 0.481
2.693GlnLeu: 2.693 ± 0.523
1.042GlnMet: 1.042 ± 0.293
2.78GlnAsn: 2.78 ± 0.735
1.216GlnPro: 1.216 ± 0.27
1.477GlnGln: 1.477 ± 0.445
1.65GlnArg: 1.65 ± 0.441
2.345GlnSer: 2.345 ± 0.462
2.172GlnThr: 2.172 ± 0.491
1.911GlnVal: 1.911 ± 0.431
0.347GlnTrp: 0.347 ± 0.178
0.956GlnTyr: 0.956 ± 0.318
0.0GlnXaa: 0.0 ± 0.0
Arg
1.564ArgAla: 1.564 ± 0.363
0.434ArgCys: 0.434 ± 0.192
1.998ArgAsp: 1.998 ± 0.535
2.519ArgGlu: 2.519 ± 0.391
1.737ArgPhe: 1.737 ± 0.447
2.085ArgGly: 2.085 ± 0.573
1.129ArgHis: 1.129 ± 0.324
3.301ArgIle: 3.301 ± 0.463
5.038ArgLys: 5.038 ± 0.805
4.17ArgLeu: 4.17 ± 0.585
0.869ArgMet: 0.869 ± 0.257
3.648ArgAsn: 3.648 ± 0.654
0.782ArgPro: 0.782 ± 0.208
1.65ArgGln: 1.65 ± 0.428
2.085ArgArg: 2.085 ± 0.488
2.085ArgSer: 2.085 ± 0.488
1.998ArgThr: 1.998 ± 0.438
2.606ArgVal: 2.606 ± 0.55
0.434ArgTrp: 0.434 ± 0.204
2.345ArgTyr: 2.345 ± 0.53
0.0ArgXaa: 0.0 ± 0.0
Ser
4.691SerAla: 4.691 ± 0.68
0.174SerCys: 0.174 ± 0.112
4.43SerAsp: 4.43 ± 0.794
3.301SerGlu: 3.301 ± 0.557
2.085SerPhe: 2.085 ± 0.44
3.301SerGly: 3.301 ± 0.602
1.216SerHis: 1.216 ± 0.371
5.733SerIle: 5.733 ± 0.601
5.212SerLys: 5.212 ± 0.603
4.256SerLeu: 4.256 ± 0.679
1.564SerMet: 1.564 ± 0.37
3.735SerAsn: 3.735 ± 0.507
1.564SerPro: 1.564 ± 0.401
1.824SerGln: 1.824 ± 0.468
2.172SerArg: 2.172 ± 0.35
3.909SerSer: 3.909 ± 0.653
3.648SerThr: 3.648 ± 0.473
2.867SerVal: 2.867 ± 0.495
0.347SerTrp: 0.347 ± 0.21
2.606SerTyr: 2.606 ± 0.411
0.0SerXaa: 0.0 ± 0.0
Thr
3.735ThrAla: 3.735 ± 0.649
0.087ThrCys: 0.087 ± 0.076
3.214ThrAsp: 3.214 ± 0.705
4.691ThrGlu: 4.691 ± 0.763
2.432ThrPhe: 2.432 ± 0.459
3.562ThrGly: 3.562 ± 0.61
0.695ThrHis: 0.695 ± 0.191
5.907ThrIle: 5.907 ± 1.191
4.951ThrLys: 4.951 ± 0.616
5.212ThrLeu: 5.212 ± 0.584
0.695ThrMet: 0.695 ± 0.273
4.691ThrAsn: 4.691 ± 0.654
1.564ThrPro: 1.564 ± 0.323
2.606ThrGln: 2.606 ± 0.569
2.519ThrArg: 2.519 ± 0.397
4.083ThrSer: 4.083 ± 0.67
4.517ThrThr: 4.517 ± 0.892
4.43ThrVal: 4.43 ± 0.958
0.521ThrTrp: 0.521 ± 0.212
2.085ThrTyr: 2.085 ± 0.425
0.0ThrXaa: 0.0 ± 0.0
Val
3.648ValAla: 3.648 ± 0.821
0.347ValCys: 0.347 ± 0.174
4.17ValAsp: 4.17 ± 0.655
5.559ValGlu: 5.559 ± 0.582
3.04ValPhe: 3.04 ± 0.69
2.345ValGly: 2.345 ± 0.488
0.869ValHis: 0.869 ± 0.236
6.167ValIle: 6.167 ± 0.905
5.733ValLys: 5.733 ± 0.523
4.517ValLeu: 4.517 ± 0.735
1.911ValMet: 1.911 ± 0.396
4.604ValAsn: 4.604 ± 0.676
1.998ValPro: 1.998 ± 0.483
1.824ValGln: 1.824 ± 0.417
2.085ValArg: 2.085 ± 0.464
4.17ValSer: 4.17 ± 0.645
3.301ValThr: 3.301 ± 0.538
4.604ValVal: 4.604 ± 1.041
1.129ValTrp: 1.129 ± 0.357
2.693ValTyr: 2.693 ± 0.617
0.0ValXaa: 0.0 ± 0.0
Trp
0.956TrpAla: 0.956 ± 0.311
0.087TrpCys: 0.087 ± 0.079
0.347TrpAsp: 0.347 ± 0.162
0.608TrpGlu: 0.608 ± 0.259
0.347TrpPhe: 0.347 ± 0.19
0.434TrpGly: 0.434 ± 0.328
0.174TrpHis: 0.174 ± 0.117
0.782TrpIle: 0.782 ± 0.248
0.869TrpLys: 0.869 ± 0.298
0.347TrpLeu: 0.347 ± 0.184
0.087TrpMet: 0.087 ± 0.091
1.824TrpAsn: 1.824 ± 0.989
0.174TrpPro: 0.174 ± 0.128
0.521TrpGln: 0.521 ± 0.2
0.261TrpArg: 0.261 ± 0.166
0.869TrpSer: 0.869 ± 0.328
1.216TrpThr: 1.216 ± 0.344
0.869TrpVal: 0.869 ± 0.287
0.0TrpTrp: 0.0 ± 0.0
0.521TrpTyr: 0.521 ± 0.199
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.737TyrAla: 1.737 ± 0.432
0.347TyrCys: 0.347 ± 0.15
3.388TyrAsp: 3.388 ± 0.608
3.214TyrGlu: 3.214 ± 0.563
2.432TyrPhe: 2.432 ± 0.397
2.432TyrGly: 2.432 ± 0.638
0.695TyrHis: 0.695 ± 0.285
3.475TyrIle: 3.475 ± 0.681
5.038TyrLys: 5.038 ± 0.761
2.693TyrLeu: 2.693 ± 0.478
0.956TyrMet: 0.956 ± 0.287
2.606TyrAsn: 2.606 ± 0.35
0.956TyrPro: 0.956 ± 0.312
1.564TyrGln: 1.564 ± 0.341
2.519TyrArg: 2.519 ± 0.645
2.953TyrSer: 2.953 ± 0.51
3.388TyrThr: 3.388 ± 0.449
2.693TyrVal: 2.693 ± 0.589
0.695TyrTrp: 0.695 ± 0.234
1.824TyrTyr: 1.824 ± 0.389
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11513 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski