Amino acid dipepetide frequency for Streptococcus phage Javan464

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.887AlaAla: 2.887 ± 0.632
0.481AlaCys: 0.481 ± 0.179
4.812AlaAsp: 4.812 ± 0.639
6.736AlaGlu: 6.736 ± 0.683
2.807AlaPhe: 2.807 ± 0.466
4.651AlaGly: 4.651 ± 1.153
0.08AlaHis: 0.08 ± 0.077
5.854AlaIle: 5.854 ± 0.783
7.057AlaLys: 7.057 ± 0.652
6.095AlaLeu: 6.095 ± 1.122
1.443AlaMet: 1.443 ± 0.341
3.609AlaAsn: 3.609 ± 0.449
1.363AlaPro: 1.363 ± 0.343
2.566AlaGln: 2.566 ± 0.354
3.208AlaArg: 3.208 ± 0.484
3.448AlaSer: 3.448 ± 0.49
3.849AlaThr: 3.849 ± 0.589
4.812AlaVal: 4.812 ± 0.546
0.962AlaTrp: 0.962 ± 0.314
2.085AlaTyr: 2.085 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.401CysAla: 0.401 ± 0.165
0.08CysCys: 0.08 ± 0.074
0.401CysAsp: 0.401 ± 0.142
0.561CysGlu: 0.561 ± 0.23
0.401CysPhe: 0.401 ± 0.171
0.321CysGly: 0.321 ± 0.139
0.321CysHis: 0.321 ± 0.183
0.321CysIle: 0.321 ± 0.15
0.481CysLys: 0.481 ± 0.174
0.642CysLeu: 0.642 ± 0.239
0.16CysMet: 0.16 ± 0.103
0.321CysAsn: 0.321 ± 0.151
0.08CysPro: 0.08 ± 0.082
0.241CysGln: 0.241 ± 0.123
0.481CysArg: 0.481 ± 0.173
0.561CysSer: 0.561 ± 0.199
0.08CysThr: 0.08 ± 0.073
0.401CysVal: 0.401 ± 0.169
0.08CysTrp: 0.08 ± 0.077
0.321CysTyr: 0.321 ± 0.158
0.0CysXaa: 0.0 ± 0.0
Asp
4.731AspAla: 4.731 ± 0.498
0.642AspCys: 0.642 ± 0.207
4.17AspAsp: 4.17 ± 0.634
5.533AspGlu: 5.533 ± 0.635
3.288AspPhe: 3.288 ± 0.384
4.411AspGly: 4.411 ± 0.69
0.962AspHis: 0.962 ± 0.264
4.411AspIle: 4.411 ± 0.566
6.415AspLys: 6.415 ± 0.723
6.014AspLeu: 6.014 ± 0.679
1.844AspMet: 1.844 ± 0.327
3.128AspAsn: 3.128 ± 0.675
1.443AspPro: 1.443 ± 0.336
1.203AspGln: 1.203 ± 0.328
3.047AspArg: 3.047 ± 0.557
4.01AspSer: 4.01 ± 0.642
3.528AspThr: 3.528 ± 0.516
4.892AspVal: 4.892 ± 0.642
0.962AspTrp: 0.962 ± 0.354
3.448AspTyr: 3.448 ± 0.585
0.0AspXaa: 0.0 ± 0.0
Glu
4.892GluAla: 4.892 ± 0.688
0.401GluCys: 0.401 ± 0.194
4.01GluAsp: 4.01 ± 0.548
6.095GluGlu: 6.095 ± 0.756
2.727GluPhe: 2.727 ± 0.492
3.128GluGly: 3.128 ± 0.504
1.604GluHis: 1.604 ± 0.394
7.298GluIle: 7.298 ± 0.818
5.373GluLys: 5.373 ± 0.739
9.062GluLeu: 9.062 ± 0.869
2.165GluMet: 2.165 ± 0.46
4.33GluAsn: 4.33 ± 0.725
2.245GluPro: 2.245 ± 0.416
3.128GluGln: 3.128 ± 0.562
3.448GluArg: 3.448 ± 0.62
3.288GluSer: 3.288 ± 0.54
4.01GluThr: 4.01 ± 0.543
4.17GluVal: 4.17 ± 0.723
0.642GluTrp: 0.642 ± 0.232
3.368GluTyr: 3.368 ± 0.661
0.0GluXaa: 0.0 ± 0.0
Phe
1.925PheAla: 1.925 ± 0.354
0.321PheCys: 0.321 ± 0.147
3.929PheAsp: 3.929 ± 0.548
2.326PheGlu: 2.326 ± 0.472
1.684PhePhe: 1.684 ± 0.361
2.967PheGly: 2.967 ± 0.586
0.401PheHis: 0.401 ± 0.138
3.128PheIle: 3.128 ± 0.497
3.929PheLys: 3.929 ± 0.46
3.368PheLeu: 3.368 ± 0.538
1.043PheMet: 1.043 ± 0.289
2.005PheAsn: 2.005 ± 0.39
0.401PhePro: 0.401 ± 0.209
1.203PheGln: 1.203 ± 0.332
1.604PheArg: 1.604 ± 0.368
2.887PheSer: 2.887 ± 0.662
2.566PheThr: 2.566 ± 0.494
3.288PheVal: 3.288 ± 0.564
0.722PheTrp: 0.722 ± 0.238
1.604PheTyr: 1.604 ± 0.305
0.0PheXaa: 0.0 ± 0.0
Gly
3.288GlyAla: 3.288 ± 0.537
0.722GlyCys: 0.722 ± 0.252
3.448GlyAsp: 3.448 ± 0.648
3.929GlyGlu: 3.929 ± 0.497
3.128GlyPhe: 3.128 ± 0.603
3.528GlyGly: 3.528 ± 0.652
1.123GlyHis: 1.123 ± 0.285
5.293GlyIle: 5.293 ± 0.877
5.373GlyLys: 5.373 ± 0.663
4.09GlyLeu: 4.09 ± 0.567
1.443GlyMet: 1.443 ± 0.328
4.17GlyAsn: 4.17 ± 0.501
1.123GlyPro: 1.123 ± 0.481
2.085GlyGln: 2.085 ± 0.446
1.764GlyArg: 1.764 ± 0.453
2.646GlySer: 2.646 ± 0.425
3.288GlyThr: 3.288 ± 0.559
4.09GlyVal: 4.09 ± 0.565
0.722GlyTrp: 0.722 ± 0.445
2.646GlyTyr: 2.646 ± 0.544
0.0GlyXaa: 0.0 ± 0.0
His
1.123HisAla: 1.123 ± 0.296
0.08HisCys: 0.08 ± 0.075
0.722HisAsp: 0.722 ± 0.227
1.363HisGlu: 1.363 ± 0.322
0.722HisPhe: 0.722 ± 0.219
1.203HisGly: 1.203 ± 0.275
0.241HisHis: 0.241 ± 0.137
1.043HisIle: 1.043 ± 0.301
0.722HisLys: 0.722 ± 0.244
1.283HisLeu: 1.283 ± 0.315
0.16HisMet: 0.16 ± 0.109
0.722HisAsn: 0.722 ± 0.195
0.241HisPro: 0.241 ± 0.158
0.561HisGln: 0.561 ± 0.209
0.642HisArg: 0.642 ± 0.221
0.802HisSer: 0.802 ± 0.312
1.123HisThr: 1.123 ± 0.263
1.123HisVal: 1.123 ± 0.331
0.241HisTrp: 0.241 ± 0.148
0.481HisTyr: 0.481 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
5.373IleAla: 5.373 ± 0.613
0.561IleCys: 0.561 ± 0.2
5.533IleAsp: 5.533 ± 0.733
5.132IleGlu: 5.132 ± 0.836
2.085IlePhe: 2.085 ± 0.445
3.849IleGly: 3.849 ± 0.539
0.642IleHis: 0.642 ± 0.248
5.694IleIle: 5.694 ± 0.862
7.538IleLys: 7.538 ± 0.717
4.731IleLeu: 4.731 ± 0.524
2.245IleMet: 2.245 ± 0.518
4.892IleAsn: 4.892 ± 0.774
2.005IlePro: 2.005 ± 0.432
2.085IleGln: 2.085 ± 0.409
2.727IleArg: 2.727 ± 0.461
4.972IleSer: 4.972 ± 0.752
4.892IleThr: 4.892 ± 0.588
4.491IleVal: 4.491 ± 0.677
0.962IleTrp: 0.962 ± 0.42
2.326IleTyr: 2.326 ± 0.447
0.0IleXaa: 0.0 ± 0.0
Lys
7.618LysAla: 7.618 ± 0.647
0.481LysCys: 0.481 ± 0.18
5.453LysAsp: 5.453 ± 0.661
6.415LysGlu: 6.415 ± 0.707
1.925LysPhe: 1.925 ± 0.375
3.849LysGly: 3.849 ± 0.55
1.203LysHis: 1.203 ± 0.315
6.335LysIle: 6.335 ± 0.712
7.698LysLys: 7.698 ± 0.85
6.736LysLeu: 6.736 ± 0.714
3.128LysMet: 3.128 ± 0.475
5.293LysAsn: 5.293 ± 0.6
3.128LysPro: 3.128 ± 0.521
5.533LysGln: 5.533 ± 0.671
4.17LysArg: 4.17 ± 0.734
5.694LysSer: 5.694 ± 0.57
5.213LysThr: 5.213 ± 0.549
5.934LysVal: 5.934 ± 0.862
1.443LysTrp: 1.443 ± 0.313
3.047LysTyr: 3.047 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
5.934LeuAla: 5.934 ± 0.896
0.0LeuCys: 0.0 ± 0.0
5.934LeuAsp: 5.934 ± 0.791
6.897LeuGlu: 6.897 ± 0.822
3.929LeuPhe: 3.929 ± 0.654
3.528LeuGly: 3.528 ± 0.66
1.443LeuHis: 1.443 ± 0.386
5.694LeuIle: 5.694 ± 0.681
9.944LeuLys: 9.944 ± 0.868
6.977LeuLeu: 6.977 ± 0.887
1.684LeuMet: 1.684 ± 0.481
4.411LeuAsn: 4.411 ± 0.552
2.326LeuPro: 2.326 ± 0.34
2.967LeuGln: 2.967 ± 0.43
4.25LeuArg: 4.25 ± 0.539
4.411LeuSer: 4.411 ± 0.771
5.213LeuThr: 5.213 ± 0.661
4.571LeuVal: 4.571 ± 0.6
0.882LeuTrp: 0.882 ± 0.241
2.406LeuTyr: 2.406 ± 0.438
0.0LeuXaa: 0.0 ± 0.0
Met
3.128MetAla: 3.128 ± 0.626
0.0MetCys: 0.0 ± 0.0
1.123MetAsp: 1.123 ± 0.329
1.684MetGlu: 1.684 ± 0.37
1.123MetPhe: 1.123 ± 0.295
1.123MetGly: 1.123 ± 0.302
0.401MetHis: 0.401 ± 0.175
1.283MetIle: 1.283 ± 0.266
1.363MetLys: 1.363 ± 0.304
2.085MetLeu: 2.085 ± 0.398
0.561MetMet: 0.561 ± 0.189
1.123MetAsn: 1.123 ± 0.248
1.123MetPro: 1.123 ± 0.268
1.524MetGln: 1.524 ± 0.355
2.165MetArg: 2.165 ± 0.357
1.363MetSer: 1.363 ± 0.283
1.524MetThr: 1.524 ± 0.311
0.481MetVal: 0.481 ± 0.184
0.401MetTrp: 0.401 ± 0.203
0.722MetTyr: 0.722 ± 0.218
0.0MetXaa: 0.0 ± 0.0
Asn
4.25AsnAla: 4.25 ± 0.804
0.401AsnCys: 0.401 ± 0.138
2.807AsnAsp: 2.807 ± 0.42
3.128AsnGlu: 3.128 ± 0.593
1.925AsnPhe: 1.925 ± 0.355
4.01AsnGly: 4.01 ± 0.613
1.043AsnHis: 1.043 ± 0.33
4.571AsnIle: 4.571 ± 0.549
4.33AsnLys: 4.33 ± 0.544
4.411AsnLeu: 4.411 ± 0.556
1.604AsnMet: 1.604 ± 0.333
2.967AsnAsn: 2.967 ± 0.612
2.165AsnPro: 2.165 ± 0.393
2.406AsnGln: 2.406 ± 0.546
2.326AsnArg: 2.326 ± 0.371
3.849AsnSer: 3.849 ± 0.662
3.849AsnThr: 3.849 ± 0.6
2.566AsnVal: 2.566 ± 0.431
0.722AsnTrp: 0.722 ± 0.246
2.406AsnTyr: 2.406 ± 0.423
0.0AsnXaa: 0.0 ± 0.0
Pro
1.043ProAla: 1.043 ± 0.261
0.321ProCys: 0.321 ± 0.166
2.646ProAsp: 2.646 ± 0.477
1.684ProGlu: 1.684 ± 0.334
1.123ProPhe: 1.123 ± 0.338
0.722ProGly: 0.722 ± 0.294
0.561ProHis: 0.561 ± 0.217
1.363ProIle: 1.363 ± 0.401
2.566ProLys: 2.566 ± 0.433
1.684ProLeu: 1.684 ± 0.326
0.401ProMet: 0.401 ± 0.174
1.844ProAsn: 1.844 ± 0.432
0.642ProPro: 0.642 ± 0.246
1.363ProGln: 1.363 ± 0.374
1.363ProArg: 1.363 ± 0.287
2.005ProSer: 2.005 ± 0.414
1.684ProThr: 1.684 ± 0.409
1.684ProVal: 1.684 ± 0.373
0.241ProTrp: 0.241 ± 0.151
1.123ProTyr: 1.123 ± 0.29
0.0ProXaa: 0.0 ± 0.0
Gln
2.406GlnAla: 2.406 ± 0.441
0.401GlnCys: 0.401 ± 0.224
2.005GlnAsp: 2.005 ± 0.335
2.807GlnGlu: 2.807 ± 0.502
2.005GlnPhe: 2.005 ± 0.499
2.486GlnGly: 2.486 ± 0.44
0.241GlnHis: 0.241 ± 0.16
2.646GlnIle: 2.646 ± 0.417
3.849GlnLys: 3.849 ± 0.531
3.609GlnLeu: 3.609 ± 0.668
0.882GlnMet: 0.882 ± 0.23
2.486GlnAsn: 2.486 ± 0.442
0.962GlnPro: 0.962 ± 0.275
1.844GlnGln: 1.844 ± 0.391
0.882GlnArg: 0.882 ± 0.278
3.208GlnSer: 3.208 ± 0.534
3.047GlnThr: 3.047 ± 0.546
2.486GlnVal: 2.486 ± 0.434
0.802GlnTrp: 0.802 ± 0.331
0.962GlnTyr: 0.962 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
3.528ArgAla: 3.528 ± 0.458
0.401ArgCys: 0.401 ± 0.166
2.165ArgAsp: 2.165 ± 0.399
3.208ArgGlu: 3.208 ± 0.48
1.764ArgPhe: 1.764 ± 0.395
2.566ArgGly: 2.566 ± 0.462
0.561ArgHis: 0.561 ± 0.271
3.288ArgIle: 3.288 ± 0.45
3.849ArgLys: 3.849 ± 0.555
4.01ArgLeu: 4.01 ± 0.624
1.283ArgMet: 1.283 ± 0.279
2.486ArgAsn: 2.486 ± 0.432
0.962ArgPro: 0.962 ± 0.294
2.566ArgGln: 2.566 ± 0.465
2.807ArgArg: 2.807 ± 0.469
1.443ArgSer: 1.443 ± 0.315
1.604ArgThr: 1.604 ± 0.281
2.406ArgVal: 2.406 ± 0.488
0.16ArgTrp: 0.16 ± 0.111
2.245ArgTyr: 2.245 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
4.571SerAla: 4.571 ± 0.845
0.321SerCys: 0.321 ± 0.16
5.052SerAsp: 5.052 ± 0.528
4.571SerGlu: 4.571 ± 0.64
2.646SerPhe: 2.646 ± 0.565
4.25SerGly: 4.25 ± 0.682
1.043SerHis: 1.043 ± 0.276
4.01SerIle: 4.01 ± 0.442
5.213SerLys: 5.213 ± 0.611
3.929SerLeu: 3.929 ± 0.681
1.123SerMet: 1.123 ± 0.311
3.288SerAsn: 3.288 ± 0.571
0.802SerPro: 0.802 ± 0.249
2.406SerGln: 2.406 ± 0.4
2.326SerArg: 2.326 ± 0.492
4.17SerSer: 4.17 ± 0.63
2.967SerThr: 2.967 ± 0.474
4.571SerVal: 4.571 ± 0.651
0.722SerTrp: 0.722 ± 0.218
1.844SerTyr: 1.844 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
4.651ThrAla: 4.651 ± 1.07
0.16ThrCys: 0.16 ± 0.107
4.731ThrAsp: 4.731 ± 0.549
3.609ThrGlu: 3.609 ± 0.48
2.727ThrPhe: 2.727 ± 0.541
4.25ThrGly: 4.25 ± 0.727
0.882ThrHis: 0.882 ± 0.279
4.25ThrIle: 4.25 ± 0.583
5.293ThrLys: 5.293 ± 0.911
6.095ThrLeu: 6.095 ± 0.649
0.722ThrMet: 0.722 ± 0.186
3.528ThrAsn: 3.528 ± 0.551
1.925ThrPro: 1.925 ± 0.416
2.406ThrGln: 2.406 ± 0.349
1.363ThrArg: 1.363 ± 0.368
3.448ThrSer: 3.448 ± 0.447
4.411ThrThr: 4.411 ± 0.609
3.368ThrVal: 3.368 ± 0.473
0.561ThrTrp: 0.561 ± 0.222
2.406ThrTyr: 2.406 ± 0.472
0.0ThrXaa: 0.0 ± 0.0
Val
4.25ValAla: 4.25 ± 0.599
0.561ValCys: 0.561 ± 0.228
4.411ValAsp: 4.411 ± 0.522
6.656ValGlu: 6.656 ± 0.833
2.807ValPhe: 2.807 ± 0.413
3.769ValGly: 3.769 ± 0.688
0.802ValHis: 0.802 ± 0.217
3.609ValIle: 3.609 ± 0.505
4.01ValLys: 4.01 ± 0.535
4.731ValLeu: 4.731 ± 0.637
1.043ValMet: 1.043 ± 0.354
2.486ValAsn: 2.486 ± 0.389
1.764ValPro: 1.764 ± 0.473
1.844ValGln: 1.844 ± 0.32
2.887ValArg: 2.887 ± 0.474
4.411ValSer: 4.411 ± 0.537
4.651ValThr: 4.651 ± 0.624
3.689ValVal: 3.689 ± 0.413
0.802ValTrp: 0.802 ± 0.282
2.165ValTyr: 2.165 ± 0.48
0.0ValXaa: 0.0 ± 0.0
Trp
0.882TrpAla: 0.882 ± 0.214
0.08TrpCys: 0.08 ± 0.071
0.802TrpAsp: 0.802 ± 0.25
0.882TrpGlu: 0.882 ± 0.429
0.722TrpPhe: 0.722 ± 0.256
0.882TrpGly: 0.882 ± 0.273
0.08TrpHis: 0.08 ± 0.073
0.561TrpIle: 0.561 ± 0.181
1.123TrpLys: 1.123 ± 0.325
1.043TrpLeu: 1.043 ± 0.314
0.481TrpMet: 0.481 ± 0.186
1.043TrpAsn: 1.043 ± 0.379
0.241TrpPro: 0.241 ± 0.131
0.722TrpGln: 0.722 ± 0.213
0.561TrpArg: 0.561 ± 0.23
1.043TrpSer: 1.043 ± 0.222
0.561TrpThr: 0.561 ± 0.25
0.321TrpVal: 0.321 ± 0.16
0.241TrpTrp: 0.241 ± 0.129
0.16TrpTyr: 0.16 ± 0.118
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.925TyrAla: 1.925 ± 0.48
0.241TyrCys: 0.241 ± 0.142
3.929TyrAsp: 3.929 ± 0.541
2.165TyrGlu: 2.165 ± 0.401
1.764TyrPhe: 1.764 ± 0.47
2.406TyrGly: 2.406 ± 0.584
0.962TyrHis: 0.962 ± 0.247
2.085TyrIle: 2.085 ± 0.461
4.17TyrLys: 4.17 ± 0.678
2.807TyrLeu: 2.807 ± 0.549
0.802TyrMet: 0.802 ± 0.247
1.604TyrAsn: 1.604 ± 0.442
1.123TyrPro: 1.123 ± 0.299
1.363TyrGln: 1.363 ± 0.357
1.283TyrArg: 1.283 ± 0.433
2.165TyrSer: 2.165 ± 0.494
2.727TyrThr: 2.727 ± 0.399
2.085TyrVal: 2.085 ± 0.383
0.16TyrTrp: 0.16 ± 0.125
1.283TyrTyr: 1.283 ± 0.346
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (12471 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski