Amino acid dipepetide frequency for Streptococcus phage Javan616

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.262AlaAla: 3.262 ± 1.038
0.167AlaCys: 0.167 ± 0.123
3.68AlaAsp: 3.68 ± 0.555
6.942AlaGlu: 6.942 ± 0.812
2.676AlaPhe: 2.676 ± 0.832
4.433AlaGly: 4.433 ± 1.053
0.92AlaHis: 0.92 ± 0.276
5.353AlaIle: 5.353 ± 0.848
6.774AlaLys: 6.774 ± 0.569
6.189AlaLeu: 6.189 ± 1.359
2.76AlaMet: 2.76 ± 0.551
4.767AlaAsn: 4.767 ± 1.093
2.007AlaPro: 2.007 ± 0.416
4.265AlaGln: 4.265 ± 0.816
2.091AlaArg: 2.091 ± 0.477
4.433AlaSer: 4.433 ± 1.316
4.683AlaThr: 4.683 ± 1.045
6.022AlaVal: 6.022 ± 0.825
0.669AlaTrp: 0.669 ± 0.221
2.509AlaTyr: 2.509 ± 0.494
0.0AlaXaa: 0.0 ± 0.0
Cys
0.084CysAla: 0.084 ± 0.092
0.0CysCys: 0.0 ± 0.0
0.251CysAsp: 0.251 ± 0.147
0.084CysGlu: 0.084 ± 0.092
0.167CysPhe: 0.167 ± 0.181
0.502CysGly: 0.502 ± 0.34
0.167CysHis: 0.167 ± 0.111
0.167CysIle: 0.167 ± 0.128
0.335CysLys: 0.335 ± 0.201
0.335CysLeu: 0.335 ± 0.252
0.167CysMet: 0.167 ± 0.143
0.167CysAsn: 0.167 ± 0.118
0.167CysPro: 0.167 ± 0.112
0.502CysGln: 0.502 ± 0.265
0.502CysArg: 0.502 ± 0.228
0.502CysSer: 0.502 ± 0.368
0.167CysThr: 0.167 ± 0.126
0.251CysVal: 0.251 ± 0.148
0.084CysTrp: 0.084 ± 0.066
0.167CysTyr: 0.167 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
4.433AspAla: 4.433 ± 0.839
0.084AspCys: 0.084 ± 0.095
3.094AspAsp: 3.094 ± 0.578
5.018AspGlu: 5.018 ± 0.769
3.429AspPhe: 3.429 ± 0.62
3.345AspGly: 3.345 ± 0.596
0.335AspHis: 0.335 ± 0.139
4.182AspIle: 4.182 ± 0.616
5.854AspLys: 5.854 ± 1.012
5.018AspLeu: 5.018 ± 0.752
1.505AspMet: 1.505 ± 0.367
3.931AspAsn: 3.931 ± 0.772
1.087AspPro: 1.087 ± 0.333
1.673AspGln: 1.673 ± 0.415
1.84AspArg: 1.84 ± 0.448
3.763AspSer: 3.763 ± 0.484
2.593AspThr: 2.593 ± 0.563
2.927AspVal: 2.927 ± 0.616
1.254AspTrp: 1.254 ± 0.313
3.178AspTyr: 3.178 ± 0.532
0.0AspXaa: 0.0 ± 0.0
Glu
5.018GluAla: 5.018 ± 0.808
0.418GluCys: 0.418 ± 0.285
3.345GluAsp: 3.345 ± 0.627
6.607GluGlu: 6.607 ± 1.458
2.76GluPhe: 2.76 ± 0.514
3.68GluGly: 3.68 ± 0.62
1.422GluHis: 1.422 ± 0.356
4.683GluIle: 4.683 ± 0.622
5.687GluLys: 5.687 ± 1.047
7.025GluLeu: 7.025 ± 1.183
2.676GluMet: 2.676 ± 0.62
4.433GluAsn: 4.433 ± 0.933
1.505GluPro: 1.505 ± 0.34
3.429GluGln: 3.429 ± 0.552
4.433GluArg: 4.433 ± 1.001
3.596GluSer: 3.596 ± 0.538
4.6GluThr: 4.6 ± 0.822
4.265GluVal: 4.265 ± 0.819
0.585GluTrp: 0.585 ± 0.26
3.262GluTyr: 3.262 ± 0.55
0.0GluXaa: 0.0 ± 0.0
Phe
1.924PheAla: 1.924 ± 0.515
0.251PheCys: 0.251 ± 0.207
2.342PheAsp: 2.342 ± 0.432
3.596PheGlu: 3.596 ± 0.547
1.171PhePhe: 1.171 ± 0.282
2.676PheGly: 2.676 ± 0.414
0.251PheHis: 0.251 ± 0.14
3.262PheIle: 3.262 ± 0.63
2.844PheLys: 2.844 ± 0.599
2.676PheLeu: 2.676 ± 0.558
0.836PheMet: 0.836 ± 0.345
3.094PheAsn: 3.094 ± 0.456
0.585PhePro: 0.585 ± 0.178
1.004PheGln: 1.004 ± 0.336
1.087PheArg: 1.087 ± 0.281
2.676PheSer: 2.676 ± 0.532
2.174PheThr: 2.174 ± 0.36
2.342PheVal: 2.342 ± 0.329
0.335PheTrp: 0.335 ± 0.174
1.087PheTyr: 1.087 ± 0.38
0.0PheXaa: 0.0 ± 0.0
Gly
4.182GlyAla: 4.182 ± 0.971
0.084GlyCys: 0.084 ± 0.091
4.265GlyAsp: 4.265 ± 0.816
4.182GlyGlu: 4.182 ± 0.564
3.262GlyPhe: 3.262 ± 0.745
2.509GlyGly: 2.509 ± 0.455
0.585GlyHis: 0.585 ± 0.197
4.098GlyIle: 4.098 ± 0.95
4.683GlyLys: 4.683 ± 0.712
5.687GlyLeu: 5.687 ± 1.062
1.338GlyMet: 1.338 ± 0.303
2.593GlyAsn: 2.593 ± 0.555
0.502GlyPro: 0.502 ± 0.235
2.342GlyGln: 2.342 ± 0.685
2.927GlyArg: 2.927 ± 0.6
4.014GlySer: 4.014 ± 0.686
3.931GlyThr: 3.931 ± 0.651
4.683GlyVal: 4.683 ± 0.918
0.335GlyTrp: 0.335 ± 0.158
2.593GlyTyr: 2.593 ± 0.379
0.0GlyXaa: 0.0 ± 0.0
His
0.502HisAla: 0.502 ± 0.247
0.167HisCys: 0.167 ± 0.152
0.669HisAsp: 0.669 ± 0.277
0.585HisGlu: 0.585 ± 0.249
0.92HisPhe: 0.92 ± 0.288
1.087HisGly: 1.087 ± 0.386
0.167HisHis: 0.167 ± 0.129
1.505HisIle: 1.505 ± 0.349
1.422HisLys: 1.422 ± 0.404
1.254HisLeu: 1.254 ± 0.393
0.502HisMet: 0.502 ± 0.25
0.669HisAsn: 0.669 ± 0.201
0.753HisPro: 0.753 ± 0.277
0.502HisGln: 0.502 ± 0.267
0.418HisArg: 0.418 ± 0.206
1.087HisSer: 1.087 ± 0.329
0.753HisThr: 0.753 ± 0.266
0.502HisVal: 0.502 ± 0.222
0.084HisTrp: 0.084 ± 0.066
1.004HisTyr: 1.004 ± 0.346
0.0HisXaa: 0.0 ± 0.0
Ile
4.934IleAla: 4.934 ± 0.735
0.167IleCys: 0.167 ± 0.126
5.018IleAsp: 5.018 ± 0.676
5.603IleGlu: 5.603 ± 0.795
2.174IlePhe: 2.174 ± 0.373
4.014IleGly: 4.014 ± 0.784
0.836IleHis: 0.836 ± 0.247
3.847IleIle: 3.847 ± 0.619
5.938IleLys: 5.938 ± 0.872
5.436IleLeu: 5.436 ± 1.039
1.171IleMet: 1.171 ± 0.272
4.014IleAsn: 4.014 ± 0.443
2.844IlePro: 2.844 ± 0.381
3.011IleGln: 3.011 ± 0.589
3.011IleArg: 3.011 ± 0.539
5.52IleSer: 5.52 ± 0.985
5.603IleThr: 5.603 ± 0.949
4.349IleVal: 4.349 ± 0.701
0.418IleTrp: 0.418 ± 0.21
2.509IleTyr: 2.509 ± 0.513
0.0IleXaa: 0.0 ± 0.0
Lys
6.022LysAla: 6.022 ± 0.797
0.502LysCys: 0.502 ± 0.227
4.014LysAsp: 4.014 ± 0.63
6.44LysGlu: 6.44 ± 0.998
2.007LysPhe: 2.007 ± 0.463
4.851LysGly: 4.851 ± 0.719
1.756LysHis: 1.756 ± 0.524
6.691LysIle: 6.691 ± 0.683
5.52LysLys: 5.52 ± 1.213
6.858LysLeu: 6.858 ± 0.71
1.756LysMet: 1.756 ± 0.553
3.429LysAsn: 3.429 ± 0.678
1.589LysPro: 1.589 ± 0.375
3.094LysGln: 3.094 ± 0.612
4.516LysArg: 4.516 ± 0.925
6.356LysSer: 6.356 ± 0.825
7.025LysThr: 7.025 ± 0.768
5.018LysVal: 5.018 ± 0.777
1.087LysTrp: 1.087 ± 0.283
2.927LysTyr: 2.927 ± 0.802
0.0LysXaa: 0.0 ± 0.0
Leu
5.436LeuAla: 5.436 ± 0.632
0.335LeuCys: 0.335 ± 0.229
6.523LeuAsp: 6.523 ± 0.918
5.269LeuGlu: 5.269 ± 0.974
2.509LeuPhe: 2.509 ± 0.524
5.603LeuGly: 5.603 ± 1.261
1.422LeuHis: 1.422 ± 0.573
4.851LeuIle: 4.851 ± 0.656
5.603LeuLys: 5.603 ± 0.951
5.938LeuLeu: 5.938 ± 0.779
2.844LeuMet: 2.844 ± 0.485
5.771LeuAsn: 5.771 ± 0.655
2.844LeuPro: 2.844 ± 0.632
2.425LeuGln: 2.425 ± 0.501
3.429LeuArg: 3.429 ± 0.76
7.694LeuSer: 7.694 ± 1.136
5.436LeuThr: 5.436 ± 0.643
5.102LeuVal: 5.102 ± 0.56
0.418LeuTrp: 0.418 ± 0.22
2.342LeuTyr: 2.342 ± 0.649
0.0LeuXaa: 0.0 ± 0.0
Met
4.767MetAla: 4.767 ± 1.169
0.0MetCys: 0.0 ± 0.0
1.422MetAsp: 1.422 ± 0.504
2.091MetGlu: 2.091 ± 0.643
0.669MetPhe: 0.669 ± 0.294
0.753MetGly: 0.753 ± 0.221
0.502MetHis: 0.502 ± 0.247
1.589MetIle: 1.589 ± 0.385
2.007MetLys: 2.007 ± 0.391
1.422MetLeu: 1.422 ± 0.378
0.669MetMet: 0.669 ± 0.252
2.174MetAsn: 2.174 ± 0.608
0.335MetPro: 0.335 ± 0.148
0.836MetGln: 0.836 ± 0.213
1.004MetArg: 1.004 ± 0.3
2.425MetSer: 2.425 ± 0.534
2.007MetThr: 2.007 ± 0.434
1.171MetVal: 1.171 ± 0.24
0.084MetTrp: 0.084 ± 0.091
0.502MetTyr: 0.502 ± 0.232
0.0MetXaa: 0.0 ± 0.0
Asn
5.353AsnAla: 5.353 ± 1.064
0.335AsnCys: 0.335 ± 0.166
3.262AsnAsp: 3.262 ± 0.62
4.349AsnGlu: 4.349 ± 0.734
2.174AsnPhe: 2.174 ± 0.413
4.934AsnGly: 4.934 ± 0.803
0.753AsnHis: 0.753 ± 0.28
3.847AsnIle: 3.847 ± 0.566
4.433AsnLys: 4.433 ± 0.728
5.185AsnLeu: 5.185 ± 0.63
1.338AsnMet: 1.338 ± 0.436
3.345AsnAsn: 3.345 ± 0.448
2.593AsnPro: 2.593 ± 0.56
3.011AsnGln: 3.011 ± 0.526
2.676AsnArg: 2.676 ± 0.494
4.098AsnSer: 4.098 ± 0.725
3.011AsnThr: 3.011 ± 0.496
3.345AsnVal: 3.345 ± 0.555
0.669AsnTrp: 0.669 ± 0.271
2.593AsnTyr: 2.593 ± 0.571
0.0AsnXaa: 0.0 ± 0.0
Pro
2.007ProAla: 2.007 ± 0.528
0.084ProCys: 0.084 ± 0.113
1.924ProAsp: 1.924 ± 0.381
1.254ProGlu: 1.254 ± 0.309
1.004ProPhe: 1.004 ± 0.378
0.669ProGly: 0.669 ± 0.211
0.418ProHis: 0.418 ± 0.195
2.593ProIle: 2.593 ± 0.445
2.342ProLys: 2.342 ± 0.459
1.505ProLeu: 1.505 ± 0.309
0.418ProMet: 0.418 ± 0.216
1.254ProAsn: 1.254 ± 0.406
0.418ProPro: 0.418 ± 0.226
1.505ProGln: 1.505 ± 0.422
0.585ProArg: 0.585 ± 0.209
1.84ProSer: 1.84 ± 0.468
1.756ProThr: 1.756 ± 0.443
2.007ProVal: 2.007 ± 0.44
0.167ProTrp: 0.167 ± 0.12
0.753ProTyr: 0.753 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
4.349GlnAla: 4.349 ± 0.989
0.335GlnCys: 0.335 ± 0.185
1.589GlnAsp: 1.589 ± 0.38
2.676GlnGlu: 2.676 ± 0.58
1.254GlnPhe: 1.254 ± 0.301
1.84GlnGly: 1.84 ± 0.465
0.335GlnHis: 0.335 ± 0.174
2.676GlnIle: 2.676 ± 0.591
4.182GlnLys: 4.182 ± 0.683
5.102GlnLeu: 5.102 ± 0.76
1.004GlnMet: 1.004 ± 0.253
2.76GlnAsn: 2.76 ± 0.499
0.753GlnPro: 0.753 ± 0.233
1.505GlnGln: 1.505 ± 0.324
2.342GlnArg: 2.342 ± 0.369
3.429GlnSer: 3.429 ± 0.853
2.593GlnThr: 2.593 ± 0.43
1.422GlnVal: 1.422 ± 0.293
0.251GlnTrp: 0.251 ± 0.139
1.505GlnTyr: 1.505 ± 0.45
0.0GlnXaa: 0.0 ± 0.0
Arg
2.509ArgAla: 2.509 ± 0.388
0.335ArgCys: 0.335 ± 0.168
1.254ArgAsp: 1.254 ± 0.366
2.342ArgGlu: 2.342 ± 0.527
1.422ArgPhe: 1.422 ± 0.309
2.342ArgGly: 2.342 ± 0.402
0.669ArgHis: 0.669 ± 0.237
4.014ArgIle: 4.014 ± 0.599
3.513ArgLys: 3.513 ± 0.801
5.52ArgLeu: 5.52 ± 0.71
1.673ArgMet: 1.673 ± 0.523
3.429ArgAsn: 3.429 ± 0.529
0.836ArgPro: 0.836 ± 0.292
1.171ArgGln: 1.171 ± 0.321
1.422ArgArg: 1.422 ± 0.328
1.924ArgSer: 1.924 ± 0.452
2.509ArgThr: 2.509 ± 0.515
2.844ArgVal: 2.844 ± 0.564
0.0ArgTrp: 0.0 ± 0.0
2.258ArgTyr: 2.258 ± 0.525
0.0ArgXaa: 0.0 ± 0.0
Ser
6.44SerAla: 6.44 ± 2.311
0.167SerCys: 0.167 ± 0.152
3.513SerAsp: 3.513 ± 0.5
4.934SerGlu: 4.934 ± 0.92
2.76SerPhe: 2.76 ± 0.517
4.934SerGly: 4.934 ± 1.07
0.669SerHis: 0.669 ± 0.331
5.269SerIle: 5.269 ± 0.833
6.272SerLys: 6.272 ± 0.861
4.851SerLeu: 4.851 ± 0.841
1.004SerMet: 1.004 ± 0.27
4.265SerAsn: 4.265 ± 0.803
1.673SerPro: 1.673 ± 0.351
2.844SerGln: 2.844 ± 0.538
2.927SerArg: 2.927 ± 0.419
6.272SerSer: 6.272 ± 1.676
5.102SerThr: 5.102 ± 0.85
4.433SerVal: 4.433 ± 0.446
0.585SerTrp: 0.585 ± 0.203
3.178SerTyr: 3.178 ± 0.597
0.0SerXaa: 0.0 ± 0.0
Thr
5.52ThrAla: 5.52 ± 1.015
0.502ThrCys: 0.502 ± 0.197
3.68ThrAsp: 3.68 ± 0.567
4.516ThrGlu: 4.516 ± 0.764
2.425ThrPhe: 2.425 ± 0.437
4.265ThrGly: 4.265 ± 0.778
0.753ThrHis: 0.753 ± 0.284
4.851ThrIle: 4.851 ± 0.657
4.098ThrLys: 4.098 ± 0.599
5.436ThrLeu: 5.436 ± 0.772
1.673ThrMet: 1.673 ± 0.416
3.763ThrAsn: 3.763 ± 0.542
1.254ThrPro: 1.254 ± 0.414
4.265ThrGln: 4.265 ± 0.669
1.924ThrArg: 1.924 ± 0.459
4.851ThrSer: 4.851 ± 0.882
5.687ThrThr: 5.687 ± 0.862
5.102ThrVal: 5.102 ± 0.793
0.335ThrTrp: 0.335 ± 0.199
2.676ThrTyr: 2.676 ± 0.48
0.0ThrXaa: 0.0 ± 0.0
Val
3.763ValAla: 3.763 ± 0.499
0.418ValCys: 0.418 ± 0.275
4.683ValAsp: 4.683 ± 0.678
4.433ValGlu: 4.433 ± 0.842
1.756ValPhe: 1.756 ± 0.399
3.763ValGly: 3.763 ± 0.612
0.92ValHis: 0.92 ± 0.213
3.429ValIle: 3.429 ± 0.586
6.189ValLys: 6.189 ± 0.654
3.345ValLeu: 3.345 ± 0.545
1.589ValMet: 1.589 ± 0.328
4.349ValAsn: 4.349 ± 0.654
1.673ValPro: 1.673 ± 0.451
2.844ValGln: 2.844 ± 0.549
2.342ValArg: 2.342 ± 0.502
4.934ValSer: 4.934 ± 0.952
4.516ValThr: 4.516 ± 0.689
4.265ValVal: 4.265 ± 0.667
0.502ValTrp: 0.502 ± 0.176
2.342ValTyr: 2.342 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
0.753TrpAla: 0.753 ± 0.22
0.0TrpCys: 0.0 ± 0.0
0.585TrpAsp: 0.585 ± 0.193
0.669TrpGlu: 0.669 ± 0.206
0.167TrpPhe: 0.167 ± 0.097
0.585TrpGly: 0.585 ± 0.245
0.335TrpHis: 0.335 ± 0.178
0.836TrpIle: 0.836 ± 0.233
0.585TrpLys: 0.585 ± 0.224
0.418TrpLeu: 0.418 ± 0.214
0.418TrpMet: 0.418 ± 0.174
0.585TrpAsn: 0.585 ± 0.217
0.084TrpPro: 0.084 ± 0.098
0.335TrpGln: 0.335 ± 0.184
0.251TrpArg: 0.251 ± 0.144
0.669TrpSer: 0.669 ± 0.29
0.418TrpThr: 0.418 ± 0.183
0.335TrpVal: 0.335 ± 0.18
0.084TrpTrp: 0.084 ± 0.091
0.084TrpTyr: 0.084 ± 0.092
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.345TyrAla: 3.345 ± 0.62
0.418TyrCys: 0.418 ± 0.238
3.429TyrAsp: 3.429 ± 0.83
1.84TyrGlu: 1.84 ± 0.37
1.505TyrPhe: 1.505 ± 0.392
2.007TyrGly: 2.007 ± 0.354
1.338TyrHis: 1.338 ± 0.395
2.676TyrIle: 2.676 ± 0.614
3.178TyrLys: 3.178 ± 0.665
2.676TyrLeu: 2.676 ± 0.536
1.004TyrMet: 1.004 ± 0.253
2.509TyrAsn: 2.509 ± 0.556
1.004TyrPro: 1.004 ± 0.313
1.338TyrGln: 1.338 ± 0.426
2.258TyrArg: 2.258 ± 0.45
2.007TyrSer: 2.007 ± 0.423
3.011TyrThr: 3.011 ± 0.575
1.673TyrVal: 1.673 ± 0.343
0.251TyrTrp: 0.251 ± 0.114
1.924TyrTyr: 1.924 ± 0.511
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11958 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski