Amino acid dipepetide frequency for Streptococcus phage Javan93

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.226AlaAla: 4.226 ± 0.893
0.845AlaCys: 0.845 ± 0.279
4.649AlaAsp: 4.649 ± 0.612
3.55AlaGlu: 3.55 ± 0.525
2.198AlaPhe: 2.198 ± 0.381
4.818AlaGly: 4.818 ± 0.742
0.93AlaHis: 0.93 ± 0.256
5.071AlaIle: 5.071 ± 0.997
5.832AlaLys: 5.832 ± 0.658
5.832AlaLeu: 5.832 ± 0.95
2.282AlaMet: 2.282 ± 0.422
3.635AlaAsn: 3.635 ± 0.483
1.606AlaPro: 1.606 ± 0.314
3.212AlaGln: 3.212 ± 0.558
3.719AlaArg: 3.719 ± 0.554
4.142AlaSer: 4.142 ± 0.542
5.41AlaThr: 5.41 ± 0.853
4.48AlaVal: 4.48 ± 0.574
0.676AlaTrp: 0.676 ± 0.158
3.212AlaTyr: 3.212 ± 0.509
0.0AlaXaa: 0.0 ± 0.0
Cys
0.338CysAla: 0.338 ± 0.208
0.169CysCys: 0.169 ± 0.115
0.423CysAsp: 0.423 ± 0.191
0.761CysGlu: 0.761 ± 0.207
0.169CysPhe: 0.169 ± 0.109
1.099CysGly: 1.099 ± 0.335
0.254CysHis: 0.254 ± 0.155
0.338CysIle: 0.338 ± 0.14
0.507CysLys: 0.507 ± 0.248
0.845CysLeu: 0.845 ± 0.269
0.169CysMet: 0.169 ± 0.102
0.254CysAsn: 0.254 ± 0.118
0.592CysPro: 0.592 ± 0.247
0.676CysGln: 0.676 ± 0.191
0.93CysArg: 0.93 ± 0.312
0.676CysSer: 0.676 ± 0.233
0.085CysThr: 0.085 ± 0.079
0.507CysVal: 0.507 ± 0.211
0.0CysTrp: 0.0 ± 0.0
0.761CysTyr: 0.761 ± 0.223
0.0CysXaa: 0.0 ± 0.0
Asp
3.888AspAla: 3.888 ± 0.435
0.592AspCys: 0.592 ± 0.19
3.973AspAsp: 3.973 ± 0.762
4.987AspGlu: 4.987 ± 0.642
3.043AspPhe: 3.043 ± 0.446
4.733AspGly: 4.733 ± 0.733
1.437AspHis: 1.437 ± 0.385
3.381AspIle: 3.381 ± 0.492
4.311AspLys: 4.311 ± 0.621
6.762AspLeu: 6.762 ± 0.828
2.029AspMet: 2.029 ± 0.321
2.113AspAsn: 2.113 ± 0.52
1.437AspPro: 1.437 ± 0.394
2.198AspGln: 2.198 ± 0.496
2.282AspArg: 2.282 ± 0.598
3.043AspSer: 3.043 ± 0.498
3.127AspThr: 3.127 ± 0.423
3.888AspVal: 3.888 ± 0.599
0.592AspTrp: 0.592 ± 0.165
3.381AspTyr: 3.381 ± 0.706
0.0AspXaa: 0.0 ± 0.0
Glu
5.579GluAla: 5.579 ± 0.798
0.592GluCys: 0.592 ± 0.215
4.733GluAsp: 4.733 ± 0.692
6.339GluGlu: 6.339 ± 0.954
2.367GluPhe: 2.367 ± 0.391
5.325GluGly: 5.325 ± 0.589
1.268GluHis: 1.268 ± 0.343
3.127GluIle: 3.127 ± 0.55
6.17GluLys: 6.17 ± 0.647
8.537GluLeu: 8.537 ± 0.735
1.944GluMet: 1.944 ± 0.445
3.212GluAsn: 3.212 ± 0.441
1.944GluPro: 1.944 ± 0.492
3.381GluGln: 3.381 ± 0.499
2.451GluArg: 2.451 ± 0.489
2.705GluSer: 2.705 ± 0.455
4.564GluThr: 4.564 ± 0.687
4.649GluVal: 4.649 ± 0.63
0.845GluTrp: 0.845 ± 0.278
1.69GluTyr: 1.69 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
2.029PheAla: 2.029 ± 0.405
0.761PheCys: 0.761 ± 0.274
2.874PheAsp: 2.874 ± 0.492
2.874PheGlu: 2.874 ± 0.456
1.437PhePhe: 1.437 ± 0.388
3.212PheGly: 3.212 ± 0.505
0.761PheHis: 0.761 ± 0.258
1.86PheIle: 1.86 ± 0.416
3.296PheLys: 3.296 ± 0.749
2.705PheLeu: 2.705 ± 0.581
0.93PheMet: 0.93 ± 0.328
1.775PheAsn: 1.775 ± 0.261
0.592PhePro: 0.592 ± 0.244
1.606PheGln: 1.606 ± 0.332
1.521PheArg: 1.521 ± 0.292
2.029PheSer: 2.029 ± 0.354
2.451PheThr: 2.451 ± 0.378
1.521PheVal: 1.521 ± 0.406
0.761PheTrp: 0.761 ± 0.25
1.69PheTyr: 1.69 ± 0.391
0.0PheXaa: 0.0 ± 0.0
Gly
3.55GlyAla: 3.55 ± 0.73
0.169GlyCys: 0.169 ± 0.128
4.057GlyAsp: 4.057 ± 0.577
4.142GlyGlu: 4.142 ± 0.564
2.451GlyPhe: 2.451 ± 0.38
5.325GlyGly: 5.325 ± 0.774
1.944GlyHis: 1.944 ± 0.389
5.24GlyIle: 5.24 ± 0.711
5.41GlyLys: 5.41 ± 0.635
5.325GlyLeu: 5.325 ± 0.62
1.86GlyMet: 1.86 ± 0.298
3.465GlyAsn: 3.465 ± 0.522
0.761GlyPro: 0.761 ± 0.204
2.789GlyGln: 2.789 ± 0.421
4.142GlyArg: 4.142 ± 0.523
4.818GlySer: 4.818 ± 0.973
5.156GlyThr: 5.156 ± 0.623
4.649GlyVal: 4.649 ± 0.77
0.676GlyTrp: 0.676 ± 0.207
2.705GlyTyr: 2.705 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
0.93HisAla: 0.93 ± 0.28
0.169HisCys: 0.169 ± 0.12
1.606HisAsp: 1.606 ± 0.324
0.845HisGlu: 0.845 ± 0.323
0.845HisPhe: 0.845 ± 0.269
1.352HisGly: 1.352 ± 0.349
0.676HisHis: 0.676 ± 0.233
1.099HisIle: 1.099 ± 0.268
1.099HisLys: 1.099 ± 0.26
1.775HisLeu: 1.775 ± 0.379
0.423HisMet: 0.423 ± 0.185
1.183HisAsn: 1.183 ± 0.255
1.183HisPro: 1.183 ± 0.412
0.592HisGln: 0.592 ± 0.228
1.352HisArg: 1.352 ± 0.314
1.437HisSer: 1.437 ± 0.332
0.761HisThr: 0.761 ± 0.325
1.437HisVal: 1.437 ± 0.366
0.169HisTrp: 0.169 ± 0.123
0.845HisTyr: 0.845 ± 0.323
0.0HisXaa: 0.0 ± 0.0
Ile
4.311IleAla: 4.311 ± 0.533
0.592IleCys: 0.592 ± 0.243
4.987IleAsp: 4.987 ± 0.544
3.719IleGlu: 3.719 ± 0.61
1.69IlePhe: 1.69 ± 0.513
4.057IleGly: 4.057 ± 0.557
0.93IleHis: 0.93 ± 0.221
3.043IleIle: 3.043 ± 0.591
4.733IleLys: 4.733 ± 0.758
4.649IleLeu: 4.649 ± 0.569
1.099IleMet: 1.099 ± 0.273
2.029IleAsn: 2.029 ± 0.408
2.113IlePro: 2.113 ± 0.421
2.367IleGln: 2.367 ± 0.421
1.606IleArg: 1.606 ± 0.38
3.804IleSer: 3.804 ± 0.614
4.057IleThr: 4.057 ± 0.622
3.719IleVal: 3.719 ± 0.698
0.93IleTrp: 0.93 ± 0.285
2.029IleTyr: 2.029 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
8.199LysAla: 8.199 ± 0.79
0.507LysCys: 0.507 ± 0.164
3.888LysAsp: 3.888 ± 0.54
5.24LysGlu: 5.24 ± 0.579
1.86LysPhe: 1.86 ± 0.351
4.987LysGly: 4.987 ± 0.488
1.86LysHis: 1.86 ± 0.391
4.057LysIle: 4.057 ± 0.56
4.818LysLys: 4.818 ± 0.761
6.086LysLeu: 6.086 ± 0.86
1.268LysMet: 1.268 ± 0.362
2.62LysAsn: 2.62 ± 0.455
2.705LysPro: 2.705 ± 0.439
3.55LysGln: 3.55 ± 0.475
5.325LysArg: 5.325 ± 0.832
4.395LysSer: 4.395 ± 0.439
4.733LysThr: 4.733 ± 0.576
4.902LysVal: 4.902 ± 0.69
0.845LysTrp: 0.845 ± 0.252
2.029LysTyr: 2.029 ± 0.412
0.0LysXaa: 0.0 ± 0.0
Leu
6.677LeuAla: 6.677 ± 0.866
0.592LeuCys: 0.592 ± 0.26
5.41LeuAsp: 5.41 ± 0.494
7.607LeuGlu: 7.607 ± 0.726
2.62LeuPhe: 2.62 ± 0.513
5.748LeuGly: 5.748 ± 0.625
1.014LeuHis: 1.014 ± 0.358
4.311LeuIle: 4.311 ± 0.576
7.354LeuLys: 7.354 ± 0.826
7.776LeuLeu: 7.776 ± 0.756
2.282LeuMet: 2.282 ± 0.514
3.719LeuAsn: 3.719 ± 0.599
3.719LeuPro: 3.719 ± 0.638
3.381LeuGln: 3.381 ± 0.54
3.804LeuArg: 3.804 ± 0.645
8.452LeuSer: 8.452 ± 1.102
7.354LeuThr: 7.354 ± 0.814
6.001LeuVal: 6.001 ± 0.731
0.845LeuTrp: 0.845 ± 0.202
3.888LeuTyr: 3.888 ± 0.764
0.0LeuXaa: 0.0 ± 0.0
Met
2.367MetAla: 2.367 ± 0.416
0.085MetCys: 0.085 ± 0.081
1.183MetAsp: 1.183 ± 0.345
1.437MetGlu: 1.437 ± 0.321
0.93MetPhe: 0.93 ± 0.265
1.86MetGly: 1.86 ± 0.487
0.0MetHis: 0.0 ± 0.0
1.099MetIle: 1.099 ± 0.307
1.775MetLys: 1.775 ± 0.343
1.352MetLeu: 1.352 ± 0.404
0.507MetMet: 0.507 ± 0.176
0.761MetAsn: 0.761 ± 0.218
0.423MetPro: 0.423 ± 0.175
0.761MetGln: 0.761 ± 0.207
1.268MetArg: 1.268 ± 0.296
1.775MetSer: 1.775 ± 0.41
3.043MetThr: 3.043 ± 0.406
1.521MetVal: 1.521 ± 0.344
0.085MetTrp: 0.085 ± 0.081
0.423MetTyr: 0.423 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
3.635AsnAla: 3.635 ± 0.528
0.423AsnCys: 0.423 ± 0.188
2.029AsnAsp: 2.029 ± 0.443
3.043AsnGlu: 3.043 ± 0.409
1.606AsnPhe: 1.606 ± 0.487
4.733AsnGly: 4.733 ± 0.735
1.268AsnHis: 1.268 ± 0.346
1.606AsnIle: 1.606 ± 0.385
2.789AsnLys: 2.789 ± 0.464
3.55AsnLeu: 3.55 ± 0.492
1.099AsnMet: 1.099 ± 0.253
1.437AsnAsn: 1.437 ± 0.459
1.944AsnPro: 1.944 ± 0.351
2.029AsnGln: 2.029 ± 0.424
1.437AsnArg: 1.437 ± 0.269
3.212AsnSer: 3.212 ± 0.53
2.536AsnThr: 2.536 ± 0.519
2.367AsnVal: 2.367 ± 0.523
1.014AsnTrp: 1.014 ± 0.325
0.592AsnTyr: 0.592 ± 0.251
0.0AsnXaa: 0.0 ± 0.0
Pro
1.268ProAla: 1.268 ± 0.275
0.507ProCys: 0.507 ± 0.201
1.86ProAsp: 1.86 ± 0.43
2.198ProGlu: 2.198 ± 0.497
1.014ProPhe: 1.014 ± 0.327
1.099ProGly: 1.099 ± 0.381
1.014ProHis: 1.014 ± 0.247
1.521ProIle: 1.521 ± 0.377
2.874ProLys: 2.874 ± 0.413
2.958ProLeu: 2.958 ± 0.369
0.592ProMet: 0.592 ± 0.193
1.437ProAsn: 1.437 ± 0.379
1.099ProPro: 1.099 ± 0.343
1.521ProGln: 1.521 ± 0.326
1.099ProArg: 1.099 ± 0.269
2.705ProSer: 2.705 ± 0.525
2.536ProThr: 2.536 ± 0.548
1.69ProVal: 1.69 ± 0.326
0.423ProTrp: 0.423 ± 0.21
1.521ProTyr: 1.521 ± 0.33
0.0ProXaa: 0.0 ± 0.0
Gln
3.973GlnAla: 3.973 ± 0.559
0.254GlnCys: 0.254 ± 0.144
1.69GlnAsp: 1.69 ± 0.334
3.888GlnGlu: 3.888 ± 0.661
2.029GlnPhe: 2.029 ± 0.489
2.113GlnGly: 2.113 ± 0.408
0.845GlnHis: 0.845 ± 0.212
2.282GlnIle: 2.282 ± 0.467
2.958GlnLys: 2.958 ± 0.397
4.987GlnLeu: 4.987 ± 0.693
1.099GlnMet: 1.099 ± 0.325
2.282GlnAsn: 2.282 ± 0.499
1.352GlnPro: 1.352 ± 0.332
2.029GlnGln: 2.029 ± 0.523
1.521GlnArg: 1.521 ± 0.305
2.451GlnSer: 2.451 ± 0.394
2.958GlnThr: 2.958 ± 0.434
3.465GlnVal: 3.465 ± 0.478
0.676GlnTrp: 0.676 ± 0.276
0.845GlnTyr: 0.845 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
2.367ArgAla: 2.367 ± 0.516
0.676ArgCys: 0.676 ± 0.246
2.367ArgAsp: 2.367 ± 0.522
2.789ArgGlu: 2.789 ± 0.438
2.029ArgPhe: 2.029 ± 0.453
2.367ArgGly: 2.367 ± 0.438
1.099ArgHis: 1.099 ± 0.318
1.944ArgIle: 1.944 ± 0.437
4.057ArgLys: 4.057 ± 0.657
5.41ArgLeu: 5.41 ± 0.667
0.761ArgMet: 0.761 ± 0.244
1.775ArgAsn: 1.775 ± 0.38
1.352ArgPro: 1.352 ± 0.354
3.635ArgGln: 3.635 ± 0.549
2.367ArgArg: 2.367 ± 0.524
2.367ArgSer: 2.367 ± 0.342
2.029ArgThr: 2.029 ± 0.37
3.804ArgVal: 3.804 ± 0.543
0.845ArgTrp: 0.845 ± 0.276
1.69ArgTyr: 1.69 ± 0.463
0.0ArgXaa: 0.0 ± 0.0
Ser
4.987SerAla: 4.987 ± 0.747
0.338SerCys: 0.338 ± 0.178
4.395SerAsp: 4.395 ± 0.675
4.226SerGlu: 4.226 ± 0.711
3.043SerPhe: 3.043 ± 0.486
4.818SerGly: 4.818 ± 0.777
1.437SerHis: 1.437 ± 0.267
4.142SerIle: 4.142 ± 0.842
3.888SerLys: 3.888 ± 0.561
6.17SerLeu: 6.17 ± 0.821
0.761SerMet: 0.761 ± 0.27
2.367SerAsn: 2.367 ± 0.432
2.451SerPro: 2.451 ± 0.379
2.874SerGln: 2.874 ± 0.613
2.62SerArg: 2.62 ± 0.416
5.156SerSer: 5.156 ± 0.918
4.48SerThr: 4.48 ± 0.794
5.24SerVal: 5.24 ± 0.5
1.352SerTrp: 1.352 ± 0.228
2.536SerTyr: 2.536 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
4.818ThrAla: 4.818 ± 0.698
0.423ThrCys: 0.423 ± 0.195
3.719ThrAsp: 3.719 ± 0.671
4.987ThrGlu: 4.987 ± 0.516
3.127ThrPhe: 3.127 ± 0.701
3.973ThrGly: 3.973 ± 0.601
1.014ThrHis: 1.014 ± 0.249
5.663ThrIle: 5.663 ± 1.038
5.24ThrLys: 5.24 ± 0.537
6.001ThrLeu: 6.001 ± 0.847
1.352ThrMet: 1.352 ± 0.309
3.212ThrAsn: 3.212 ± 0.412
2.62ThrPro: 2.62 ± 0.586
2.029ThrGln: 2.029 ± 0.433
2.282ThrArg: 2.282 ± 0.399
6.17ThrSer: 6.17 ± 0.995
4.818ThrThr: 4.818 ± 0.542
5.24ThrVal: 5.24 ± 0.636
1.437ThrTrp: 1.437 ± 0.404
2.198ThrTyr: 2.198 ± 0.364
0.0ThrXaa: 0.0 ± 0.0
Val
3.381ValAla: 3.381 ± 0.522
0.507ValCys: 0.507 ± 0.199
3.804ValAsp: 3.804 ± 0.527
5.494ValGlu: 5.494 ± 0.76
2.198ValPhe: 2.198 ± 0.504
3.635ValGly: 3.635 ± 0.486
0.93ValHis: 0.93 ± 0.293
4.311ValIle: 4.311 ± 0.679
4.311ValLys: 4.311 ± 0.606
7.1ValLeu: 7.1 ± 0.899
1.099ValMet: 1.099 ± 0.327
2.451ValAsn: 2.451 ± 0.367
1.775ValPro: 1.775 ± 0.405
1.944ValGln: 1.944 ± 0.319
3.127ValArg: 3.127 ± 0.591
5.071ValSer: 5.071 ± 0.837
6.677ValThr: 6.677 ± 0.755
4.057ValVal: 4.057 ± 0.664
1.268ValTrp: 1.268 ± 0.375
2.958ValTyr: 2.958 ± 0.543
0.0ValXaa: 0.0 ± 0.0
Trp
0.93TrpAla: 0.93 ± 0.26
0.169TrpCys: 0.169 ± 0.121
0.676TrpAsp: 0.676 ± 0.192
1.014TrpGlu: 1.014 ± 0.255
0.845TrpPhe: 0.845 ± 0.285
0.592TrpGly: 0.592 ± 0.183
0.254TrpHis: 0.254 ± 0.116
0.507TrpIle: 0.507 ± 0.212
0.676TrpLys: 0.676 ± 0.251
1.268TrpLeu: 1.268 ± 0.219
0.507TrpMet: 0.507 ± 0.17
1.521TrpAsn: 1.521 ± 0.347
0.085TrpPro: 0.085 ± 0.075
1.014TrpGln: 1.014 ± 0.22
0.93TrpArg: 0.93 ± 0.277
0.761TrpSer: 0.761 ± 0.25
1.014TrpThr: 1.014 ± 0.27
0.845TrpVal: 0.845 ± 0.237
0.254TrpTrp: 0.254 ± 0.128
0.169TrpTyr: 0.169 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.874TyrAla: 2.874 ± 0.457
1.183TyrCys: 1.183 ± 0.393
2.874TyrAsp: 2.874 ± 0.585
2.282TyrGlu: 2.282 ± 0.367
1.183TyrPhe: 1.183 ± 0.3
2.789TyrGly: 2.789 ± 0.551
0.845TyrHis: 0.845 ± 0.294
1.86TyrIle: 1.86 ± 0.444
1.775TyrLys: 1.775 ± 0.356
3.55TyrLeu: 3.55 ± 0.707
0.507TyrMet: 0.507 ± 0.2
1.099TyrAsn: 1.099 ± 0.318
1.183TyrPro: 1.183 ± 0.226
2.113TyrGln: 2.113 ± 0.449
1.944TyrArg: 1.944 ± 0.338
2.113TyrSer: 2.113 ± 0.397
2.536TyrThr: 2.536 ± 0.363
2.113TyrVal: 2.113 ± 0.391
0.338TyrTrp: 0.338 ± 0.153
1.268TyrTyr: 1.268 ± 0.327
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (11832 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski