Amino acid dipepetide frequency for Streptococcus phage Javan393

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.19AlaAla: 1.19 ± 0.364
0.555AlaCys: 0.555 ± 0.216
5.157AlaAsp: 5.157 ± 0.698
4.284AlaGlu: 4.284 ± 0.523
1.825AlaPhe: 1.825 ± 0.363
4.046AlaGly: 4.046 ± 0.793
0.714AlaHis: 0.714 ± 0.283
5.315AlaIle: 5.315 ± 0.723
6.664AlaLys: 6.664 ± 1.067
5.157AlaLeu: 5.157 ± 1.167
2.221AlaMet: 2.221 ± 0.725
3.649AlaAsn: 3.649 ± 0.702
1.587AlaPro: 1.587 ± 0.336
2.459AlaGln: 2.459 ± 0.717
2.856AlaArg: 2.856 ± 0.529
4.284AlaSer: 4.284 ± 1.114
4.681AlaThr: 4.681 ± 0.782
3.253AlaVal: 3.253 ± 0.558
0.793AlaTrp: 0.793 ± 0.281
2.777AlaTyr: 2.777 ± 0.519
0.0AlaXaa: 0.0 ± 0.0
Cys
0.159CysAla: 0.159 ± 0.122
0.079CysCys: 0.079 ± 0.084
0.397CysAsp: 0.397 ± 0.211
0.476CysGlu: 0.476 ± 0.236
0.238CysPhe: 0.238 ± 0.139
0.238CysGly: 0.238 ± 0.146
0.317CysHis: 0.317 ± 0.162
0.238CysIle: 0.238 ± 0.161
0.476CysLys: 0.476 ± 0.213
0.317CysLeu: 0.317 ± 0.147
0.079CysMet: 0.079 ± 0.1
0.476CysAsn: 0.476 ± 0.203
0.238CysPro: 0.238 ± 0.14
0.0CysGln: 0.0 ± 0.0
0.317CysArg: 0.317 ± 0.19
0.238CysSer: 0.238 ± 0.123
0.238CysThr: 0.238 ± 0.139
0.555CysVal: 0.555 ± 0.237
0.238CysTrp: 0.238 ± 0.158
0.238CysTyr: 0.238 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
3.57AspAla: 3.57 ± 0.504
0.555AspCys: 0.555 ± 0.215
3.887AspAsp: 3.887 ± 0.789
4.443AspGlu: 4.443 ± 0.681
3.332AspPhe: 3.332 ± 0.481
5.871AspGly: 5.871 ± 1.049
0.397AspHis: 0.397 ± 0.176
4.76AspIle: 4.76 ± 0.848
6.664AspLys: 6.664 ± 0.797
5.633AspLeu: 5.633 ± 0.847
1.269AspMet: 1.269 ± 0.33
3.491AspAsn: 3.491 ± 0.396
1.587AspPro: 1.587 ± 0.366
1.269AspGln: 1.269 ± 0.582
2.459AspArg: 2.459 ± 0.42
4.125AspSer: 4.125 ± 0.502
3.411AspThr: 3.411 ± 0.539
3.332AspVal: 3.332 ± 0.529
0.952AspTrp: 0.952 ± 0.373
3.967AspTyr: 3.967 ± 0.627
0.0AspXaa: 0.0 ± 0.0
Glu
3.729GluAla: 3.729 ± 0.457
0.159GluCys: 0.159 ± 0.105
4.046GluAsp: 4.046 ± 0.76
7.378GluGlu: 7.378 ± 0.985
2.697GluPhe: 2.697 ± 0.404
2.856GluGly: 2.856 ± 0.525
0.793GluHis: 0.793 ± 0.266
6.664GluIle: 6.664 ± 0.924
5.95GluLys: 5.95 ± 0.92
6.505GluLeu: 6.505 ± 0.767
2.221GluMet: 2.221 ± 0.507
3.887GluAsn: 3.887 ± 0.671
1.19GluPro: 1.19 ± 0.365
3.094GluGln: 3.094 ± 0.424
2.459GluArg: 2.459 ± 0.458
4.363GluSer: 4.363 ± 0.596
3.967GluThr: 3.967 ± 0.537
5.871GluVal: 5.871 ± 0.842
1.031GluTrp: 1.031 ± 0.355
3.173GluTyr: 3.173 ± 0.483
0.0GluXaa: 0.0 ± 0.0
Phe
1.983PheAla: 1.983 ± 0.375
0.397PheCys: 0.397 ± 0.158
3.649PheAsp: 3.649 ± 0.574
3.015PheGlu: 3.015 ± 0.575
1.349PhePhe: 1.349 ± 0.468
2.697PheGly: 2.697 ± 0.397
0.793PheHis: 0.793 ± 0.185
2.063PheIle: 2.063 ± 0.463
3.094PheLys: 3.094 ± 0.4
2.856PheLeu: 2.856 ± 0.606
1.745PheMet: 1.745 ± 0.452
3.253PheAsn: 3.253 ± 0.457
1.031PhePro: 1.031 ± 0.229
1.031PheGln: 1.031 ± 0.293
1.111PheArg: 1.111 ± 0.336
2.38PheSer: 2.38 ± 0.495
2.142PheThr: 2.142 ± 0.513
2.618PheVal: 2.618 ± 0.568
0.476PheTrp: 0.476 ± 0.191
1.19PheTyr: 1.19 ± 0.317
0.0PheXaa: 0.0 ± 0.0
Gly
4.363GlyAla: 4.363 ± 0.925
0.079GlyCys: 0.079 ± 0.091
3.411GlyAsp: 3.411 ± 0.495
3.094GlyGlu: 3.094 ± 0.447
2.539GlyPhe: 2.539 ± 0.473
5.236GlyGly: 5.236 ± 1.028
0.714GlyHis: 0.714 ± 0.268
3.967GlyIle: 3.967 ± 0.462
5.553GlyLys: 5.553 ± 0.632
6.188GlyLeu: 6.188 ± 1.138
2.777GlyMet: 2.777 ± 0.622
3.015GlyAsn: 3.015 ± 0.418
0.873GlyPro: 0.873 ± 0.26
2.539GlyGln: 2.539 ± 0.42
2.063GlyArg: 2.063 ± 0.376
3.808GlySer: 3.808 ± 0.923
5.95GlyThr: 5.95 ± 1.426
4.125GlyVal: 4.125 ± 0.472
1.031GlyTrp: 1.031 ± 0.3
2.935GlyTyr: 2.935 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
0.476HisAla: 0.476 ± 0.179
0.159HisCys: 0.159 ± 0.109
0.714HisAsp: 0.714 ± 0.272
0.793HisGlu: 0.793 ± 0.329
0.555HisPhe: 0.555 ± 0.225
1.349HisGly: 1.349 ± 0.393
0.159HisHis: 0.159 ± 0.149
1.031HisIle: 1.031 ± 0.302
0.952HisLys: 0.952 ± 0.241
1.111HisLeu: 1.111 ± 0.282
0.317HisMet: 0.317 ± 0.159
0.714HisAsn: 0.714 ± 0.233
0.317HisPro: 0.317 ± 0.161
0.317HisGln: 0.317 ± 0.151
0.476HisArg: 0.476 ± 0.206
0.952HisSer: 0.952 ± 0.316
1.428HisThr: 1.428 ± 0.374
0.952HisVal: 0.952 ± 0.435
0.079HisTrp: 0.079 ± 0.079
0.397HisTyr: 0.397 ± 0.152
0.0HisXaa: 0.0 ± 0.0
Ile
4.76IleAla: 4.76 ± 0.634
0.476IleCys: 0.476 ± 0.195
5.474IleAsp: 5.474 ± 0.686
6.347IleGlu: 6.347 ± 0.976
2.142IlePhe: 2.142 ± 0.551
3.253IleGly: 3.253 ± 0.554
0.873IleHis: 0.873 ± 0.247
4.522IleIle: 4.522 ± 0.861
6.347IleLys: 6.347 ± 0.691
5.633IleLeu: 5.633 ± 0.673
1.269IleMet: 1.269 ± 0.412
4.522IleAsn: 4.522 ± 0.813
2.459IlePro: 2.459 ± 0.45
2.38IleGln: 2.38 ± 0.424
2.063IleArg: 2.063 ± 0.419
4.601IleSer: 4.601 ± 0.767
4.601IleThr: 4.601 ± 0.516
3.808IleVal: 3.808 ± 0.688
0.555IleTrp: 0.555 ± 0.179
2.539IleTyr: 2.539 ± 0.45
0.0IleXaa: 0.0 ± 0.0
Lys
6.505LysAla: 6.505 ± 1.038
0.238LysCys: 0.238 ± 0.13
4.839LysAsp: 4.839 ± 0.641
8.33LysGlu: 8.33 ± 1.043
2.935LysPhe: 2.935 ± 0.607
5.236LysGly: 5.236 ± 0.579
1.587LysHis: 1.587 ± 0.365
4.919LysIle: 4.919 ± 0.965
5.871LysLys: 5.871 ± 0.853
6.664LysLeu: 6.664 ± 0.796
1.904LysMet: 1.904 ± 0.381
6.267LysAsn: 6.267 ± 0.626
2.38LysPro: 2.38 ± 0.483
3.173LysGln: 3.173 ± 0.682
2.459LysArg: 2.459 ± 0.437
5.791LysSer: 5.791 ± 0.664
7.457LysThr: 7.457 ± 0.853
5.871LysVal: 5.871 ± 0.652
1.269LysTrp: 1.269 ± 0.316
4.046LysTyr: 4.046 ± 0.607
0.0LysXaa: 0.0 ± 0.0
Leu
5.077LeuAla: 5.077 ± 0.591
0.555LeuCys: 0.555 ± 0.214
5.95LeuAsp: 5.95 ± 0.62
6.109LeuGlu: 6.109 ± 0.928
2.38LeuPhe: 2.38 ± 0.573
4.363LeuGly: 4.363 ± 0.815
0.635LeuHis: 0.635 ± 0.262
5.791LeuIle: 5.791 ± 0.677
7.299LeuLys: 7.299 ± 1.078
7.061LeuLeu: 7.061 ± 0.77
1.745LeuMet: 1.745 ± 0.357
5.871LeuAsn: 5.871 ± 0.553
2.697LeuPro: 2.697 ± 0.516
3.015LeuGln: 3.015 ± 0.421
3.253LeuArg: 3.253 ± 0.518
6.664LeuSer: 6.664 ± 0.945
6.347LeuThr: 6.347 ± 0.72
4.443LeuVal: 4.443 ± 0.727
0.714LeuTrp: 0.714 ± 0.35
2.539LeuTyr: 2.539 ± 0.599
0.0LeuXaa: 0.0 ± 0.0
Met
1.904MetAla: 1.904 ± 0.486
0.159MetCys: 0.159 ± 0.116
1.269MetAsp: 1.269 ± 0.309
2.142MetGlu: 2.142 ± 0.445
0.793MetPhe: 0.793 ± 0.295
1.428MetGly: 1.428 ± 0.332
0.238MetHis: 0.238 ± 0.112
2.063MetIle: 2.063 ± 0.416
2.777MetLys: 2.777 ± 0.513
1.983MetLeu: 1.983 ± 0.417
0.635MetMet: 0.635 ± 0.25
1.269MetAsn: 1.269 ± 0.388
1.111MetPro: 1.111 ± 0.275
1.269MetGln: 1.269 ± 0.378
1.19MetArg: 1.19 ± 0.342
1.904MetSer: 1.904 ± 0.498
1.745MetThr: 1.745 ± 0.421
1.428MetVal: 1.428 ± 0.383
0.238MetTrp: 0.238 ± 0.139
0.635MetTyr: 0.635 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
4.919AsnAla: 4.919 ± 1.444
0.159AsnCys: 0.159 ± 0.118
3.967AsnAsp: 3.967 ± 0.566
4.205AsnGlu: 4.205 ± 0.683
2.459AsnPhe: 2.459 ± 0.557
5.791AsnGly: 5.791 ± 0.829
0.555AsnHis: 0.555 ± 0.247
3.332AsnIle: 3.332 ± 0.465
4.76AsnLys: 4.76 ± 0.63
3.967AsnLeu: 3.967 ± 0.589
1.745AsnMet: 1.745 ± 0.366
3.729AsnAsn: 3.729 ± 0.514
2.221AsnPro: 2.221 ± 0.447
3.015AsnGln: 3.015 ± 0.51
1.666AsnArg: 1.666 ± 0.327
4.205AsnSer: 4.205 ± 0.556
2.935AsnThr: 2.935 ± 0.538
3.808AsnVal: 3.808 ± 0.557
0.952AsnTrp: 0.952 ± 0.371
2.539AsnTyr: 2.539 ± 0.649
0.0AsnXaa: 0.0 ± 0.0
Pro
1.666ProAla: 1.666 ± 0.336
0.0ProCys: 0.0 ± 0.0
1.983ProAsp: 1.983 ± 0.331
1.825ProGlu: 1.825 ± 0.396
1.825ProPhe: 1.825 ± 0.412
0.714ProGly: 0.714 ± 0.302
0.397ProHis: 0.397 ± 0.179
1.904ProIle: 1.904 ± 0.344
2.142ProLys: 2.142 ± 0.444
3.015ProLeu: 3.015 ± 0.698
0.397ProMet: 0.397 ± 0.188
1.666ProAsn: 1.666 ± 0.39
0.793ProPro: 0.793 ± 0.244
1.19ProGln: 1.19 ± 0.287
0.873ProArg: 0.873 ± 0.284
1.904ProSer: 1.904 ± 0.585
2.063ProThr: 2.063 ± 0.359
2.063ProVal: 2.063 ± 0.388
0.079ProTrp: 0.079 ± 0.066
1.587ProTyr: 1.587 ± 0.389
0.0ProXaa: 0.0 ± 0.0
Gln
2.777GlnAla: 2.777 ± 0.521
0.476GlnCys: 0.476 ± 0.209
1.269GlnAsp: 1.269 ± 0.34
2.301GlnGlu: 2.301 ± 0.477
2.063GlnPhe: 2.063 ± 0.549
2.063GlnGly: 2.063 ± 0.555
0.555GlnHis: 0.555 ± 0.198
2.459GlnIle: 2.459 ± 0.441
3.094GlnLys: 3.094 ± 0.608
3.967GlnLeu: 3.967 ± 0.658
1.031GlnMet: 1.031 ± 0.311
2.459GlnAsn: 2.459 ± 0.389
0.952GlnPro: 0.952 ± 0.297
1.19GlnGln: 1.19 ± 0.348
1.269GlnArg: 1.269 ± 0.292
2.697GlnSer: 2.697 ± 0.457
3.015GlnThr: 3.015 ± 0.604
1.587GlnVal: 1.587 ± 0.338
0.397GlnTrp: 0.397 ± 0.192
1.111GlnTyr: 1.111 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
1.825ArgAla: 1.825 ± 0.302
0.0ArgCys: 0.0 ± 0.0
2.301ArgAsp: 2.301 ± 0.482
2.221ArgGlu: 2.221 ± 0.412
1.269ArgPhe: 1.269 ± 0.303
1.587ArgGly: 1.587 ± 0.33
0.873ArgHis: 0.873 ± 0.263
2.697ArgIle: 2.697 ± 0.47
3.411ArgLys: 3.411 ± 0.602
2.935ArgLeu: 2.935 ± 0.498
1.111ArgMet: 1.111 ± 0.287
2.063ArgAsn: 2.063 ± 0.322
0.952ArgPro: 0.952 ± 0.262
1.745ArgGln: 1.745 ± 0.405
1.111ArgArg: 1.111 ± 0.383
1.587ArgSer: 1.587 ± 0.354
2.38ArgThr: 2.38 ± 0.569
2.063ArgVal: 2.063 ± 0.408
0.397ArgTrp: 0.397 ± 0.156
2.063ArgTyr: 2.063 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
4.681SerAla: 4.681 ± 0.964
0.159SerCys: 0.159 ± 0.13
5.315SerAsp: 5.315 ± 0.869
3.411SerGlu: 3.411 ± 0.514
3.094SerPhe: 3.094 ± 0.485
4.998SerGly: 4.998 ± 0.976
1.111SerHis: 1.111 ± 0.391
4.601SerIle: 4.601 ± 0.503
6.505SerLys: 6.505 ± 0.899
4.363SerLeu: 4.363 ± 0.846
1.349SerMet: 1.349 ± 0.572
4.363SerAsn: 4.363 ± 0.649
1.666SerPro: 1.666 ± 0.347
2.935SerGln: 2.935 ± 0.484
2.618SerArg: 2.618 ± 0.489
4.125SerSer: 4.125 ± 0.847
4.522SerThr: 4.522 ± 0.817
4.205SerVal: 4.205 ± 0.546
0.635SerTrp: 0.635 ± 0.318
2.697SerTyr: 2.697 ± 0.592
0.0SerXaa: 0.0 ± 0.0
Thr
5.791ThrAla: 5.791 ± 1.23
0.397ThrCys: 0.397 ± 0.184
3.887ThrAsp: 3.887 ± 0.553
3.887ThrGlu: 3.887 ± 0.672
2.301ThrPhe: 2.301 ± 0.389
5.077ThrGly: 5.077 ± 0.666
1.031ThrHis: 1.031 ± 0.321
5.553ThrIle: 5.553 ± 1.093
6.664ThrLys: 6.664 ± 0.809
6.109ThrLeu: 6.109 ± 0.943
1.745ThrMet: 1.745 ± 0.439
2.935ThrAsn: 2.935 ± 0.453
2.459ThrPro: 2.459 ± 0.333
2.697ThrGln: 2.697 ± 0.599
2.063ThrArg: 2.063 ± 0.407
4.205ThrSer: 4.205 ± 0.955
5.077ThrThr: 5.077 ± 0.795
4.839ThrVal: 4.839 ± 0.832
1.031ThrTrp: 1.031 ± 0.317
3.491ThrTyr: 3.491 ± 0.527
0.0ThrXaa: 0.0 ± 0.0
Val
4.76ValAla: 4.76 ± 0.621
0.159ValCys: 0.159 ± 0.129
4.443ValAsp: 4.443 ± 0.503
4.205ValGlu: 4.205 ± 0.751
2.618ValPhe: 2.618 ± 0.461
3.967ValGly: 3.967 ± 0.48
0.635ValHis: 0.635 ± 0.228
2.539ValIle: 2.539 ± 0.48
5.236ValLys: 5.236 ± 0.787
5.157ValLeu: 5.157 ± 0.665
1.587ValMet: 1.587 ± 0.398
4.284ValAsn: 4.284 ± 0.698
2.38ValPro: 2.38 ± 0.4
0.873ValGln: 0.873 ± 0.203
2.142ValArg: 2.142 ± 0.472
5.95ValSer: 5.95 ± 0.683
4.443ValThr: 4.443 ± 0.488
3.332ValVal: 3.332 ± 0.53
0.476ValTrp: 0.476 ± 0.235
1.666ValTyr: 1.666 ± 0.295
0.0ValXaa: 0.0 ± 0.0
Trp
0.714TrpAla: 0.714 ± 0.217
0.159TrpCys: 0.159 ± 0.126
0.555TrpAsp: 0.555 ± 0.193
1.031TrpGlu: 1.031 ± 0.332
0.397TrpPhe: 0.397 ± 0.181
0.635TrpGly: 0.635 ± 0.273
0.317TrpHis: 0.317 ± 0.169
1.111TrpIle: 1.111 ± 0.293
0.873TrpLys: 0.873 ± 0.307
0.873TrpLeu: 0.873 ± 0.236
0.079TrpMet: 0.079 ± 0.065
0.555TrpAsn: 0.555 ± 0.257
0.159TrpPro: 0.159 ± 0.133
0.476TrpGln: 0.476 ± 0.24
0.793TrpArg: 0.793 ± 0.313
0.873TrpSer: 0.873 ± 0.381
1.269TrpThr: 1.269 ± 0.298
0.635TrpVal: 0.635 ± 0.239
0.079TrpTrp: 0.079 ± 0.084
0.397TrpTyr: 0.397 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.618TyrAla: 2.618 ± 0.491
0.635TyrCys: 0.635 ± 0.241
2.459TyrAsp: 2.459 ± 0.438
2.539TyrGlu: 2.539 ± 0.485
2.063TyrPhe: 2.063 ± 0.447
2.856TyrGly: 2.856 ± 0.446
0.555TyrHis: 0.555 ± 0.3
3.173TyrIle: 3.173 ± 0.657
3.411TyrLys: 3.411 ± 0.461
3.015TyrLeu: 3.015 ± 0.616
0.873TyrMet: 0.873 ± 0.249
2.459TyrAsn: 2.459 ± 0.326
1.031TyrPro: 1.031 ± 0.296
2.063TyrGln: 2.063 ± 0.37
1.349TyrArg: 1.349 ± 0.333
2.697TyrSer: 2.697 ± 0.522
3.57TyrThr: 3.57 ± 0.62
2.063TyrVal: 2.063 ± 0.4
0.555TyrTrp: 0.555 ± 0.224
0.793TyrTyr: 0.793 ± 0.246
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12606 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski