Amino acid dipepetide frequency for Streptococcus phage Javan290

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.959AlaAla: 2.959 ± 0.739
0.573AlaCys: 0.573 ± 0.276
4.581AlaAsp: 4.581 ± 0.713
6.299AlaGlu: 6.299 ± 1.005
2.577AlaPhe: 2.577 ± 0.564
5.249AlaGly: 5.249 ± 1.042
0.477AlaHis: 0.477 ± 0.21
6.013AlaIle: 6.013 ± 1.166
5.058AlaLys: 5.058 ± 0.738
7.73AlaLeu: 7.73 ± 0.835
2.291AlaMet: 2.291 ± 0.442
2.863AlaAsn: 2.863 ± 0.54
1.241AlaPro: 1.241 ± 0.436
2.386AlaGln: 2.386 ± 0.465
3.436AlaArg: 3.436 ± 0.583
4.486AlaSer: 4.486 ± 0.644
4.486AlaThr: 4.486 ± 0.625
5.249AlaVal: 5.249 ± 0.733
1.145AlaTrp: 1.145 ± 0.375
1.909AlaTyr: 1.909 ± 0.444
0.0AlaXaa: 0.0 ± 0.0
Cys
0.573CysAla: 0.573 ± 0.175
0.095CysCys: 0.095 ± 0.09
0.573CysAsp: 0.573 ± 0.224
0.286CysGlu: 0.286 ± 0.169
0.286CysPhe: 0.286 ± 0.163
0.477CysGly: 0.477 ± 0.227
0.191CysHis: 0.191 ± 0.121
0.191CysIle: 0.191 ± 0.135
0.191CysLys: 0.191 ± 0.133
0.382CysLeu: 0.382 ± 0.17
0.095CysMet: 0.095 ± 0.095
0.286CysAsn: 0.286 ± 0.173
0.095CysPro: 0.095 ± 0.105
0.286CysGln: 0.286 ± 0.152
0.382CysArg: 0.382 ± 0.213
0.286CysSer: 0.286 ± 0.154
0.286CysThr: 0.286 ± 0.162
0.191CysVal: 0.191 ± 0.126
0.095CysTrp: 0.095 ± 0.101
0.286CysTyr: 0.286 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
3.149AspAla: 3.149 ± 0.459
0.286AspCys: 0.286 ± 0.142
4.104AspAsp: 4.104 ± 0.841
5.535AspGlu: 5.535 ± 0.894
3.436AspPhe: 3.436 ± 0.545
5.249AspGly: 5.249 ± 0.717
0.668AspHis: 0.668 ± 0.262
4.963AspIle: 4.963 ± 0.787
5.249AspLys: 5.249 ± 0.697
4.963AspLeu: 4.963 ± 0.778
1.432AspMet: 1.432 ± 0.384
2.291AspAsn: 2.291 ± 0.594
1.05AspPro: 1.05 ± 0.258
1.622AspGln: 1.622 ± 0.404
1.813AspArg: 1.813 ± 0.45
3.531AspSer: 3.531 ± 0.521
3.531AspThr: 3.531 ± 0.602
4.295AspVal: 4.295 ± 0.703
1.241AspTrp: 1.241 ± 0.306
2.195AspTyr: 2.195 ± 0.645
0.0AspXaa: 0.0 ± 0.0
Glu
6.681GluAla: 6.681 ± 0.664
0.095GluCys: 0.095 ± 0.088
3.627GluAsp: 3.627 ± 0.598
5.917GluGlu: 5.917 ± 1.29
2.291GluPhe: 2.291 ± 0.469
3.818GluGly: 3.818 ± 0.513
1.527GluHis: 1.527 ± 0.404
6.013GluIle: 6.013 ± 0.714
7.349GluLys: 7.349 ± 0.94
7.253GluLeu: 7.253 ± 1.2
2.577GluMet: 2.577 ± 0.479
4.963GluAsn: 4.963 ± 0.851
1.813GluPro: 1.813 ± 0.386
4.104GluGln: 4.104 ± 0.624
6.013GluArg: 6.013 ± 0.823
4.104GluSer: 4.104 ± 0.586
3.245GluThr: 3.245 ± 0.541
5.726GluVal: 5.726 ± 0.728
0.859GluTrp: 0.859 ± 0.341
2.1GluTyr: 2.1 ± 0.406
0.0GluXaa: 0.0 ± 0.0
Phe
3.054PheAla: 3.054 ± 0.512
0.0PheCys: 0.0 ± 0.0
3.149PheAsp: 3.149 ± 0.501
3.818PheGlu: 3.818 ± 0.528
1.05PhePhe: 1.05 ± 0.275
3.054PheGly: 3.054 ± 0.443
0.382PheHis: 0.382 ± 0.215
1.813PheIle: 1.813 ± 0.445
3.054PheLys: 3.054 ± 0.551
2.1PheLeu: 2.1 ± 0.448
1.145PheMet: 1.145 ± 0.377
2.004PheAsn: 2.004 ± 0.497
1.813PhePro: 1.813 ± 0.445
1.145PheGln: 1.145 ± 0.341
1.909PheArg: 1.909 ± 0.461
3.054PheSer: 3.054 ± 0.972
2.481PheThr: 2.481 ± 0.415
2.004PheVal: 2.004 ± 0.359
0.191PheTrp: 0.191 ± 0.13
1.241PheTyr: 1.241 ± 0.322
0.0PheXaa: 0.0 ± 0.0
Gly
4.104GlyAla: 4.104 ± 1.032
0.382GlyCys: 0.382 ± 0.159
5.535GlyAsp: 5.535 ± 0.692
5.058GlyGlu: 5.058 ± 0.776
3.436GlyPhe: 3.436 ± 0.705
4.486GlyGly: 4.486 ± 0.807
1.432GlyHis: 1.432 ± 0.374
5.44GlyIle: 5.44 ± 0.807
6.108GlyLys: 6.108 ± 0.696
5.822GlyLeu: 5.822 ± 1.299
2.768GlyMet: 2.768 ± 0.47
3.149GlyAsn: 3.149 ± 0.575
1.05GlyPro: 1.05 ± 0.447
3.34GlyGln: 3.34 ± 0.72
3.245GlyArg: 3.245 ± 0.536
3.818GlySer: 3.818 ± 0.596
3.818GlyThr: 3.818 ± 0.593
4.963GlyVal: 4.963 ± 0.799
0.668GlyTrp: 0.668 ± 0.223
3.627GlyTyr: 3.627 ± 0.546
0.0GlyXaa: 0.0 ± 0.0
His
1.05HisAla: 1.05 ± 0.3
0.191HisCys: 0.191 ± 0.118
0.477HisAsp: 0.477 ± 0.177
1.432HisGlu: 1.432 ± 0.37
0.859HisPhe: 0.859 ± 0.274
1.336HisGly: 1.336 ± 0.252
0.477HisHis: 0.477 ± 0.208
0.859HisIle: 0.859 ± 0.283
0.573HisLys: 0.573 ± 0.235
1.813HisLeu: 1.813 ± 0.392
0.382HisMet: 0.382 ± 0.175
0.477HisAsn: 0.477 ± 0.199
0.668HisPro: 0.668 ± 0.288
0.859HisGln: 0.859 ± 0.248
0.954HisArg: 0.954 ± 0.239
1.241HisSer: 1.241 ± 0.232
0.954HisThr: 0.954 ± 0.361
0.764HisVal: 0.764 ± 0.31
0.286HisTrp: 0.286 ± 0.141
0.573HisTyr: 0.573 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
4.104IleAla: 4.104 ± 0.726
0.286IleCys: 0.286 ± 0.174
4.581IleAsp: 4.581 ± 0.75
6.013IleGlu: 6.013 ± 0.748
1.718IlePhe: 1.718 ± 0.379
4.676IleGly: 4.676 ± 0.721
1.05IleHis: 1.05 ± 0.29
2.768IleIle: 2.768 ± 0.423
5.154IleLys: 5.154 ± 0.711
4.963IleLeu: 4.963 ± 0.761
0.954IleMet: 0.954 ± 0.325
2.672IleAsn: 2.672 ± 0.488
2.577IlePro: 2.577 ± 0.491
2.863IleGln: 2.863 ± 0.623
3.245IleArg: 3.245 ± 0.499
4.486IleSer: 4.486 ± 0.694
4.008IleThr: 4.008 ± 0.724
4.676IleVal: 4.676 ± 0.648
0.477IleTrp: 0.477 ± 0.205
2.195IleTyr: 2.195 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
5.535LysAla: 5.535 ± 0.781
0.191LysCys: 0.191 ± 0.134
4.581LysAsp: 4.581 ± 0.734
6.013LysGlu: 6.013 ± 1.077
2.291LysPhe: 2.291 ± 0.401
6.108LysGly: 6.108 ± 0.71
0.954LysHis: 0.954 ± 0.252
4.104LysIle: 4.104 ± 0.505
6.013LysLys: 6.013 ± 1.055
5.726LysLeu: 5.726 ± 0.694
1.813LysMet: 1.813 ± 0.358
4.39LysAsn: 4.39 ± 0.619
1.527LysPro: 1.527 ± 0.357
3.818LysGln: 3.818 ± 0.592
3.722LysArg: 3.722 ± 0.486
4.581LysSer: 4.581 ± 0.726
4.295LysThr: 4.295 ± 0.722
7.349LysVal: 7.349 ± 0.94
1.336LysTrp: 1.336 ± 0.333
3.245LysTyr: 3.245 ± 0.519
0.0LysXaa: 0.0 ± 0.0
Leu
7.444LeuAla: 7.444 ± 0.807
0.382LeuCys: 0.382 ± 0.181
5.345LeuAsp: 5.345 ± 0.639
7.062LeuGlu: 7.062 ± 0.758
4.199LeuPhe: 4.199 ± 0.478
5.822LeuGly: 5.822 ± 0.932
1.336LeuHis: 1.336 ± 0.313
3.531LeuIle: 3.531 ± 0.415
7.062LeuLys: 7.062 ± 1.204
7.444LeuLeu: 7.444 ± 0.941
1.336LeuMet: 1.336 ± 0.382
3.627LeuAsn: 3.627 ± 0.522
2.291LeuPro: 2.291 ± 0.456
3.818LeuGln: 3.818 ± 0.619
3.818LeuArg: 3.818 ± 0.557
6.013LeuSer: 6.013 ± 0.628
5.345LeuThr: 5.345 ± 0.725
5.154LeuVal: 5.154 ± 0.7
0.859LeuTrp: 0.859 ± 0.316
2.768LeuTyr: 2.768 ± 0.543
0.0LeuXaa: 0.0 ± 0.0
Met
2.195MetAla: 2.195 ± 0.606
0.191MetCys: 0.191 ± 0.114
1.05MetAsp: 1.05 ± 0.294
1.432MetGlu: 1.432 ± 0.299
0.859MetPhe: 0.859 ± 0.309
1.241MetGly: 1.241 ± 0.375
0.286MetHis: 0.286 ± 0.176
1.909MetIle: 1.909 ± 0.345
2.959MetLys: 2.959 ± 0.593
1.718MetLeu: 1.718 ± 0.335
0.286MetMet: 0.286 ± 0.148
1.527MetAsn: 1.527 ± 0.343
0.573MetPro: 0.573 ± 0.214
1.05MetGln: 1.05 ± 0.291
1.241MetArg: 1.241 ± 0.384
1.527MetSer: 1.527 ± 0.401
2.386MetThr: 2.386 ± 0.419
1.336MetVal: 1.336 ± 0.314
0.477MetTrp: 0.477 ± 0.198
1.05MetTyr: 1.05 ± 0.369
0.0MetXaa: 0.0 ± 0.0
Asn
3.054AsnAla: 3.054 ± 0.671
0.477AsnCys: 0.477 ± 0.224
1.622AsnAsp: 1.622 ± 0.371
3.436AsnGlu: 3.436 ± 0.586
1.718AsnPhe: 1.718 ± 0.38
5.726AsnGly: 5.726 ± 0.795
0.764AsnHis: 0.764 ± 0.226
3.245AsnIle: 3.245 ± 0.472
3.436AsnLys: 3.436 ± 0.505
3.722AsnLeu: 3.722 ± 0.579
0.382AsnMet: 0.382 ± 0.184
1.622AsnAsn: 1.622 ± 0.521
1.813AsnPro: 1.813 ± 0.434
2.195AsnGln: 2.195 ± 0.53
2.863AsnArg: 2.863 ± 0.658
2.672AsnSer: 2.672 ± 0.613
2.195AsnThr: 2.195 ± 0.611
3.34AsnVal: 3.34 ± 0.498
0.859AsnTrp: 0.859 ± 0.322
1.813AsnTyr: 1.813 ± 0.377
0.0AsnXaa: 0.0 ± 0.0
Pro
1.909ProAla: 1.909 ± 0.492
0.286ProCys: 0.286 ± 0.156
1.813ProAsp: 1.813 ± 0.494
1.718ProGlu: 1.718 ± 0.36
1.336ProPhe: 1.336 ± 0.377
1.527ProGly: 1.527 ± 0.366
0.668ProHis: 0.668 ± 0.278
1.813ProIle: 1.813 ± 0.438
2.195ProLys: 2.195 ± 0.398
2.195ProLeu: 2.195 ± 0.417
0.859ProMet: 0.859 ± 0.203
1.145ProAsn: 1.145 ± 0.273
0.477ProPro: 0.477 ± 0.188
0.954ProGln: 0.954 ± 0.292
0.859ProArg: 0.859 ± 0.267
2.195ProSer: 2.195 ± 0.413
1.622ProThr: 1.622 ± 0.417
2.291ProVal: 2.291 ± 0.493
0.095ProTrp: 0.095 ± 0.09
1.241ProTyr: 1.241 ± 0.324
0.0ProXaa: 0.0 ± 0.0
Gln
3.436GlnAla: 3.436 ± 0.599
0.191GlnCys: 0.191 ± 0.139
2.004GlnAsp: 2.004 ± 0.317
3.818GlnGlu: 3.818 ± 0.709
1.718GlnPhe: 1.718 ± 0.403
1.909GlnGly: 1.909 ± 0.456
0.764GlnHis: 0.764 ± 0.261
1.813GlnIle: 1.813 ± 0.371
2.768GlnLys: 2.768 ± 0.64
3.818GlnLeu: 3.818 ± 0.555
0.573GlnMet: 0.573 ± 0.217
3.627GlnAsn: 3.627 ± 0.553
1.05GlnPro: 1.05 ± 0.306
1.527GlnGln: 1.527 ± 0.359
2.1GlnArg: 2.1 ± 0.478
2.768GlnSer: 2.768 ± 0.615
3.245GlnThr: 3.245 ± 0.528
2.481GlnVal: 2.481 ± 0.517
0.382GlnTrp: 0.382 ± 0.161
1.813GlnTyr: 1.813 ± 0.39
0.0GlnXaa: 0.0 ± 0.0
Arg
3.054ArgAla: 3.054 ± 0.628
0.286ArgCys: 0.286 ± 0.173
2.577ArgAsp: 2.577 ± 0.588
2.959ArgGlu: 2.959 ± 0.586
1.432ArgPhe: 1.432 ± 0.338
2.291ArgGly: 2.291 ± 0.573
0.573ArgHis: 0.573 ± 0.218
3.149ArgIle: 3.149 ± 0.587
4.772ArgLys: 4.772 ± 0.918
4.772ArgLeu: 4.772 ± 0.616
1.05ArgMet: 1.05 ± 0.338
2.195ArgAsn: 2.195 ± 0.407
1.432ArgPro: 1.432 ± 0.404
2.863ArgGln: 2.863 ± 0.548
1.527ArgArg: 1.527 ± 0.487
2.959ArgSer: 2.959 ± 0.563
2.959ArgThr: 2.959 ± 0.476
2.863ArgVal: 2.863 ± 0.513
0.764ArgTrp: 0.764 ± 0.249
2.386ArgTyr: 2.386 ± 0.452
0.0ArgXaa: 0.0 ± 0.0
Ser
5.535SerAla: 5.535 ± 1.164
0.286SerCys: 0.286 ± 0.161
3.913SerAsp: 3.913 ± 0.585
4.39SerGlu: 4.39 ± 0.651
3.054SerPhe: 3.054 ± 0.54
5.822SerGly: 5.822 ± 1.095
0.954SerHis: 0.954 ± 0.261
3.627SerIle: 3.627 ± 0.647
5.726SerLys: 5.726 ± 0.702
5.631SerLeu: 5.631 ± 0.834
2.1SerMet: 2.1 ± 0.542
2.577SerAsn: 2.577 ± 0.463
2.291SerPro: 2.291 ± 0.493
1.432SerGln: 1.432 ± 0.346
1.718SerArg: 1.718 ± 0.447
4.39SerSer: 4.39 ± 0.561
2.672SerThr: 2.672 ± 0.535
4.199SerVal: 4.199 ± 0.78
1.145SerTrp: 1.145 ± 0.392
2.768SerTyr: 2.768 ± 0.481
0.0SerXaa: 0.0 ± 0.0
Thr
4.39ThrAla: 4.39 ± 0.578
0.286ThrCys: 0.286 ± 0.167
3.34ThrAsp: 3.34 ± 0.521
4.676ThrGlu: 4.676 ± 0.592
2.672ThrPhe: 2.672 ± 0.427
5.345ThrGly: 5.345 ± 0.896
1.05ThrHis: 1.05 ± 0.351
5.535ThrIle: 5.535 ± 0.93
3.149ThrLys: 3.149 ± 0.593
4.39ThrLeu: 4.39 ± 0.56
1.527ThrMet: 1.527 ± 0.305
2.481ThrAsn: 2.481 ± 0.482
1.622ThrPro: 1.622 ± 0.345
1.718ThrGln: 1.718 ± 0.283
2.195ThrArg: 2.195 ± 0.399
3.436ThrSer: 3.436 ± 0.445
3.245ThrThr: 3.245 ± 0.469
5.249ThrVal: 5.249 ± 0.584
0.477ThrTrp: 0.477 ± 0.275
1.527ThrTyr: 1.527 ± 0.335
0.0ThrXaa: 0.0 ± 0.0
Val
5.345ValAla: 5.345 ± 0.784
0.286ValCys: 0.286 ± 0.153
4.581ValAsp: 4.581 ± 0.546
6.013ValGlu: 6.013 ± 0.807
1.813ValPhe: 1.813 ± 0.477
4.486ValGly: 4.486 ± 0.739
1.527ValHis: 1.527 ± 0.343
4.104ValIle: 4.104 ± 0.663
4.676ValLys: 4.676 ± 0.538
6.108ValLeu: 6.108 ± 0.671
2.291ValMet: 2.291 ± 0.404
3.054ValAsn: 3.054 ± 0.51
2.386ValPro: 2.386 ± 0.604
2.768ValGln: 2.768 ± 0.508
3.054ValArg: 3.054 ± 0.474
5.44ValSer: 5.44 ± 0.604
5.154ValThr: 5.154 ± 0.602
5.249ValVal: 5.249 ± 0.833
0.477ValTrp: 0.477 ± 0.219
1.813ValTyr: 1.813 ± 0.357
0.0ValXaa: 0.0 ± 0.0
Trp
0.764TrpAla: 0.764 ± 0.269
0.191TrpCys: 0.191 ± 0.12
0.477TrpAsp: 0.477 ± 0.218
1.05TrpGlu: 1.05 ± 0.353
0.859TrpPhe: 0.859 ± 0.292
1.336TrpGly: 1.336 ± 0.337
0.0TrpHis: 0.0 ± 0.0
0.668TrpIle: 0.668 ± 0.278
0.573TrpLys: 0.573 ± 0.275
1.241TrpLeu: 1.241 ± 0.359
0.286TrpMet: 0.286 ± 0.13
0.477TrpAsn: 0.477 ± 0.213
0.095TrpPro: 0.095 ± 0.095
0.954TrpGln: 0.954 ± 0.306
0.668TrpArg: 0.668 ± 0.226
1.05TrpSer: 1.05 ± 0.282
0.477TrpThr: 0.477 ± 0.181
0.573TrpVal: 0.573 ± 0.219
0.0TrpTrp: 0.0 ± 0.0
0.668TrpTyr: 0.668 ± 0.234
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.672TyrAla: 2.672 ± 0.423
0.573TyrCys: 0.573 ± 0.214
2.672TyrAsp: 2.672 ± 0.534
3.149TyrGlu: 3.149 ± 0.524
0.859TyrPhe: 0.859 ± 0.255
2.291TyrGly: 2.291 ± 0.357
1.145TyrHis: 1.145 ± 0.333
1.909TyrIle: 1.909 ± 0.525
1.622TyrLys: 1.622 ± 0.429
2.863TyrLeu: 2.863 ± 0.491
1.05TyrMet: 1.05 ± 0.321
1.527TyrAsn: 1.527 ± 0.387
1.336TyrPro: 1.336 ± 0.336
2.1TyrGln: 2.1 ± 0.361
2.1TyrArg: 2.1 ± 0.35
2.195TyrSer: 2.195 ± 0.395
1.909TyrThr: 1.909 ± 0.409
2.672TyrVal: 2.672 ± 0.47
0.573TyrTrp: 0.573 ± 0.203
1.241TyrTyr: 1.241 ± 0.322
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (10479 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski