Amino acid dipepetide frequency for Streptococcus phage MM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.829AlaAla: 3.829 ± 1.42
0.469AlaCys: 0.469 ± 0.219
6.407AlaAsp: 6.407 ± 2.102
5.626AlaGlu: 5.626 ± 1.166
3.438AlaPhe: 3.438 ± 0.433
4.454AlaGly: 4.454 ± 0.618
0.547AlaHis: 0.547 ± 0.202
4.845AlaIle: 4.845 ± 0.93
5.079AlaLys: 5.079 ± 0.726
5.86AlaLeu: 5.86 ± 1.155
1.719AlaMet: 1.719 ± 0.643
3.829AlaAsn: 3.829 ± 0.566
1.563AlaPro: 1.563 ± 0.254
2.422AlaGln: 2.422 ± 0.723
3.204AlaArg: 3.204 ± 0.605
4.141AlaSer: 4.141 ± 0.737
4.688AlaThr: 4.688 ± 0.747
4.219AlaVal: 4.219 ± 0.807
0.547AlaTrp: 0.547 ± 0.215
3.047AlaTyr: 3.047 ± 0.946
0.0AlaXaa: 0.0 ± 0.0
Cys
0.156CysAla: 0.156 ± 0.106
0.078CysCys: 0.078 ± 0.076
0.078CysAsp: 0.078 ± 0.076
0.781CysGlu: 0.781 ± 0.264
0.469CysPhe: 0.469 ± 0.198
0.391CysGly: 0.391 ± 0.16
0.0CysHis: 0.0 ± 0.0
0.313CysIle: 0.313 ± 0.15
0.469CysLys: 0.469 ± 0.196
0.313CysLeu: 0.313 ± 0.167
0.234CysMet: 0.234 ± 0.165
0.0CysAsn: 0.0 ± 0.0
0.234CysPro: 0.234 ± 0.147
0.234CysGln: 0.234 ± 0.152
0.0CysArg: 0.0 ± 0.0
0.313CysSer: 0.313 ± 0.173
0.078CysThr: 0.078 ± 0.085
0.391CysVal: 0.391 ± 0.185
0.156CysTrp: 0.156 ± 0.117
0.156CysTyr: 0.156 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
3.438AspAla: 3.438 ± 0.519
0.234AspCys: 0.234 ± 0.144
3.672AspAsp: 3.672 ± 0.819
3.829AspGlu: 3.829 ± 0.676
4.219AspPhe: 4.219 ± 0.559
7.267AspGly: 7.267 ± 1.525
0.391AspHis: 0.391 ± 0.176
4.298AspIle: 4.298 ± 0.553
6.485AspLys: 6.485 ± 0.723
3.985AspLeu: 3.985 ± 0.604
1.406AspMet: 1.406 ± 0.355
3.751AspAsn: 3.751 ± 0.598
1.641AspPro: 1.641 ± 0.412
1.094AspGln: 1.094 ± 0.291
2.735AspArg: 2.735 ± 0.507
4.141AspSer: 4.141 ± 1.008
3.751AspThr: 3.751 ± 0.483
3.829AspVal: 3.829 ± 0.643
0.469AspTrp: 0.469 ± 0.221
4.454AspTyr: 4.454 ± 0.742
0.0AspXaa: 0.0 ± 0.0
Glu
3.751GluAla: 3.751 ± 0.597
0.234GluCys: 0.234 ± 0.181
4.063GluAsp: 4.063 ± 0.919
5.938GluGlu: 5.938 ± 1.025
2.657GluPhe: 2.657 ± 0.404
2.422GluGly: 2.422 ± 0.514
1.094GluHis: 1.094 ± 0.313
6.485GluIle: 6.485 ± 1.075
6.564GluLys: 6.564 ± 0.793
7.657GluLeu: 7.657 ± 1.076
2.344GluMet: 2.344 ± 0.472
3.672GluAsn: 3.672 ± 0.563
1.641GluPro: 1.641 ± 0.483
4.766GluGln: 4.766 ± 0.578
3.516GluArg: 3.516 ± 0.603
3.204GluSer: 3.204 ± 0.819
3.516GluThr: 3.516 ± 0.518
4.688GluVal: 4.688 ± 0.648
0.547GluTrp: 0.547 ± 0.188
2.657GluTyr: 2.657 ± 0.488
0.0GluXaa: 0.0 ± 0.0
Phe
2.188PheAla: 2.188 ± 0.41
0.234PheCys: 0.234 ± 0.138
4.141PheAsp: 4.141 ± 0.524
4.454PheGlu: 4.454 ± 0.828
1.641PhePhe: 1.641 ± 0.438
2.11PheGly: 2.11 ± 0.476
0.547PheHis: 0.547 ± 0.214
2.032PheIle: 2.032 ± 0.276
2.032PheLys: 2.032 ± 0.391
2.188PheLeu: 2.188 ± 0.473
0.938PheMet: 0.938 ± 0.337
3.204PheAsn: 3.204 ± 0.481
0.781PhePro: 0.781 ± 0.313
1.797PheGln: 1.797 ± 0.491
1.719PheArg: 1.719 ± 0.368
3.594PheSer: 3.594 ± 0.77
2.969PheThr: 2.969 ± 0.612
2.5PheVal: 2.5 ± 0.532
0.234PheTrp: 0.234 ± 0.146
1.641PheTyr: 1.641 ± 0.386
0.0PheXaa: 0.0 ± 0.0
Gly
5.626GlyAla: 5.626 ± 1.38
0.313GlyCys: 0.313 ± 0.192
2.969GlyAsp: 2.969 ± 0.599
4.219GlyGlu: 4.219 ± 1.032
3.907GlyPhe: 3.907 ± 0.789
5.548GlyGly: 5.548 ± 0.577
1.094GlyHis: 1.094 ± 0.347
4.923GlyIle: 4.923 ± 0.537
4.923GlyLys: 4.923 ± 0.546
6.017GlyLeu: 6.017 ± 1.064
2.266GlyMet: 2.266 ± 0.473
3.751GlyAsn: 3.751 ± 0.73
0.703GlyPro: 0.703 ± 0.295
2.188GlyGln: 2.188 ± 0.507
5.157GlyArg: 5.157 ± 1.566
5.157GlySer: 5.157 ± 1.07
4.923GlyThr: 4.923 ± 0.865
5.938GlyVal: 5.938 ± 0.797
0.938GlyTrp: 0.938 ± 0.467
1.953GlyTyr: 1.953 ± 0.465
0.0GlyXaa: 0.0 ± 0.0
His
0.781HisAla: 0.781 ± 0.24
0.156HisCys: 0.156 ± 0.117
0.86HisAsp: 0.86 ± 0.328
0.86HisGlu: 0.86 ± 0.221
0.625HisPhe: 0.625 ± 0.199
0.781HisGly: 0.781 ± 0.297
0.391HisHis: 0.391 ± 0.164
1.328HisIle: 1.328 ± 0.385
0.703HisLys: 0.703 ± 0.265
1.016HisLeu: 1.016 ± 0.228
0.547HisMet: 0.547 ± 0.318
0.547HisAsn: 0.547 ± 0.23
0.703HisPro: 0.703 ± 0.287
0.469HisGln: 0.469 ± 0.163
0.547HisArg: 0.547 ± 0.182
1.485HisSer: 1.485 ± 0.395
0.938HisThr: 0.938 ± 0.247
0.547HisVal: 0.547 ± 0.242
0.234HisTrp: 0.234 ± 0.133
0.703HisTyr: 0.703 ± 0.239
0.0HisXaa: 0.0 ± 0.0
Ile
5.157IleAla: 5.157 ± 0.787
0.313IleCys: 0.313 ± 0.136
4.923IleAsp: 4.923 ± 0.559
6.876IleGlu: 6.876 ± 1.017
2.032IlePhe: 2.032 ± 0.507
5.157IleGly: 5.157 ± 0.869
0.86IleHis: 0.86 ± 0.221
3.985IleIle: 3.985 ± 0.741
6.72IleLys: 6.72 ± 0.937
3.594IleLeu: 3.594 ± 0.738
1.094IleMet: 1.094 ± 0.254
3.672IleAsn: 3.672 ± 0.455
2.735IlePro: 2.735 ± 0.432
2.5IleGln: 2.5 ± 0.371
1.797IleArg: 1.797 ± 0.529
4.376IleSer: 4.376 ± 0.454
3.829IleThr: 3.829 ± 0.435
3.438IleVal: 3.438 ± 0.551
0.547IleTrp: 0.547 ± 0.184
1.953IleTyr: 1.953 ± 0.412
0.0IleXaa: 0.0 ± 0.0
Lys
6.72LysAla: 6.72 ± 0.937
0.547LysCys: 0.547 ± 0.206
3.438LysAsp: 3.438 ± 0.698
5.391LysGlu: 5.391 ± 0.765
2.188LysPhe: 2.188 ± 0.361
7.267LysGly: 7.267 ± 1.591
1.172LysHis: 1.172 ± 0.356
5.313LysIle: 5.313 ± 1.035
7.11LysLys: 7.11 ± 1.059
5.782LysLeu: 5.782 ± 0.799
1.328LysMet: 1.328 ± 0.361
5.782LysAsn: 5.782 ± 0.681
1.953LysPro: 1.953 ± 0.354
3.751LysGln: 3.751 ± 0.768
2.735LysArg: 2.735 ± 0.435
5.626LysSer: 5.626 ± 0.692
5.626LysThr: 5.626 ± 0.624
5.626LysVal: 5.626 ± 0.554
1.485LysTrp: 1.485 ± 0.37
3.204LysTyr: 3.204 ± 0.571
0.0LysXaa: 0.0 ± 0.0
Leu
5.548LeuAla: 5.548 ± 1.058
0.156LeuCys: 0.156 ± 0.12
5.001LeuAsp: 5.001 ± 0.693
6.485LeuGlu: 6.485 ± 1.072
2.344LeuPhe: 2.344 ± 0.499
4.063LeuGly: 4.063 ± 0.747
1.172LeuHis: 1.172 ± 0.369
2.969LeuIle: 2.969 ± 0.617
7.657LeuLys: 7.657 ± 1.125
4.766LeuLeu: 4.766 ± 0.858
1.328LeuMet: 1.328 ± 0.341
4.845LeuAsn: 4.845 ± 0.779
2.5LeuPro: 2.5 ± 0.443
4.688LeuGln: 4.688 ± 0.943
2.735LeuArg: 2.735 ± 0.523
5.235LeuSer: 5.235 ± 0.712
5.391LeuThr: 5.391 ± 0.912
3.985LeuVal: 3.985 ± 0.83
0.703LeuTrp: 0.703 ± 0.211
2.266LeuTyr: 2.266 ± 0.343
0.0LeuXaa: 0.0 ± 0.0
Met
2.344MetAla: 2.344 ± 0.415
0.234MetCys: 0.234 ± 0.136
1.172MetAsp: 1.172 ± 0.285
1.563MetGlu: 1.563 ± 0.388
0.625MetPhe: 0.625 ± 0.224
0.781MetGly: 0.781 ± 0.274
0.156MetHis: 0.156 ± 0.112
1.406MetIle: 1.406 ± 0.336
1.953MetLys: 1.953 ± 0.28
1.328MetLeu: 1.328 ± 0.281
0.625MetMet: 0.625 ± 0.246
1.328MetAsn: 1.328 ± 0.353
0.86MetPro: 0.86 ± 0.243
1.406MetGln: 1.406 ± 0.365
1.485MetArg: 1.485 ± 0.45
1.016MetSer: 1.016 ± 0.283
2.344MetThr: 2.344 ± 0.452
0.938MetVal: 0.938 ± 0.32
0.313MetTrp: 0.313 ± 0.162
0.547MetTyr: 0.547 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
4.376AsnAla: 4.376 ± 0.584
0.391AsnCys: 0.391 ± 0.245
3.985AsnAsp: 3.985 ± 0.596
3.907AsnGlu: 3.907 ± 0.626
2.266AsnPhe: 2.266 ± 0.542
5.47AsnGly: 5.47 ± 1.008
0.86AsnHis: 0.86 ± 0.302
3.672AsnIle: 3.672 ± 0.5
4.766AsnLys: 4.766 ± 0.57
4.454AsnLeu: 4.454 ± 1.037
1.406AsnMet: 1.406 ± 0.316
4.063AsnAsn: 4.063 ± 0.709
2.813AsnPro: 2.813 ± 0.801
2.344AsnGln: 2.344 ± 0.551
1.953AsnArg: 1.953 ± 0.411
3.516AsnSer: 3.516 ± 0.702
2.969AsnThr: 2.969 ± 0.391
2.813AsnVal: 2.813 ± 0.554
0.86AsnTrp: 0.86 ± 0.244
1.953AsnTyr: 1.953 ± 0.393
0.0AsnXaa: 0.0 ± 0.0
Pro
1.875ProAla: 1.875 ± 0.426
0.156ProCys: 0.156 ± 0.124
2.266ProAsp: 2.266 ± 0.607
1.797ProGlu: 1.797 ± 0.319
1.25ProPhe: 1.25 ± 0.383
1.406ProGly: 1.406 ± 0.587
0.234ProHis: 0.234 ± 0.176
1.797ProIle: 1.797 ± 0.371
2.422ProLys: 2.422 ± 0.482
1.641ProLeu: 1.641 ± 0.391
0.313ProMet: 0.313 ± 0.205
1.875ProAsn: 1.875 ± 0.524
0.313ProPro: 0.313 ± 0.156
1.172ProGln: 1.172 ± 0.362
1.328ProArg: 1.328 ± 0.318
1.875ProSer: 1.875 ± 0.344
1.875ProThr: 1.875 ± 0.507
1.875ProVal: 1.875 ± 0.317
0.156ProTrp: 0.156 ± 0.165
1.953ProTyr: 1.953 ± 0.522
0.0ProXaa: 0.0 ± 0.0
Gln
3.516GlnAla: 3.516 ± 0.568
0.078GlnCys: 0.078 ± 0.096
1.406GlnAsp: 1.406 ± 0.357
3.438GlnGlu: 3.438 ± 0.769
1.485GlnPhe: 1.485 ± 0.386
5.235GlnGly: 5.235 ± 1.811
0.391GlnHis: 0.391 ± 0.18
2.579GlnIle: 2.579 ± 0.426
3.751GlnLys: 3.751 ± 0.843
2.657GlnLeu: 2.657 ± 0.428
1.094GlnMet: 1.094 ± 0.342
2.032GlnAsn: 2.032 ± 0.499
1.172GlnPro: 1.172 ± 0.301
2.188GlnGln: 2.188 ± 0.58
2.266GlnArg: 2.266 ± 0.482
2.5GlnSer: 2.5 ± 0.568
2.266GlnThr: 2.266 ± 0.537
2.891GlnVal: 2.891 ± 0.626
0.469GlnTrp: 0.469 ± 0.198
2.11GlnTyr: 2.11 ± 0.422
0.0GlnXaa: 0.0 ± 0.0
Arg
3.36ArgAla: 3.36 ± 0.468
0.078ArgCys: 0.078 ± 0.084
2.5ArgAsp: 2.5 ± 0.425
2.266ArgGlu: 2.266 ± 0.569
2.032ArgPhe: 2.032 ± 0.36
4.063ArgGly: 4.063 ± 0.97
0.703ArgHis: 0.703 ± 0.276
2.344ArgIle: 2.344 ± 0.392
2.969ArgLys: 2.969 ± 0.483
3.672ArgLeu: 3.672 ± 0.566
0.86ArgMet: 0.86 ± 0.288
2.579ArgAsn: 2.579 ± 0.519
1.641ArgPro: 1.641 ± 0.54
1.641ArgGln: 1.641 ± 0.36
1.641ArgArg: 1.641 ± 0.322
1.485ArgSer: 1.485 ± 0.296
3.047ArgThr: 3.047 ± 0.882
2.5ArgVal: 2.5 ± 0.385
1.485ArgTrp: 1.485 ± 0.722
1.953ArgTyr: 1.953 ± 0.289
0.0ArgXaa: 0.0 ± 0.0
Ser
4.219SerAla: 4.219 ± 0.694
0.156SerCys: 0.156 ± 0.108
4.454SerAsp: 4.454 ± 0.647
3.985SerGlu: 3.985 ± 0.677
2.422SerPhe: 2.422 ± 0.527
3.829SerGly: 3.829 ± 0.969
1.328SerHis: 1.328 ± 0.341
3.125SerIle: 3.125 ± 0.489
3.829SerLys: 3.829 ± 0.564
4.923SerLeu: 4.923 ± 0.708
1.797SerMet: 1.797 ± 0.345
3.985SerAsn: 3.985 ± 0.601
1.406SerPro: 1.406 ± 0.344
3.438SerGln: 3.438 ± 0.657
2.11SerArg: 2.11 ± 0.352
4.063SerSer: 4.063 ± 0.967
6.017SerThr: 6.017 ± 1.291
4.298SerVal: 4.298 ± 0.869
0.938SerTrp: 0.938 ± 0.279
1.719SerTyr: 1.719 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
3.594ThrAla: 3.594 ± 1.107
0.391ThrCys: 0.391 ± 0.161
6.251ThrAsp: 6.251 ± 1.702
3.672ThrGlu: 3.672 ± 0.593
2.891ThrPhe: 2.891 ± 0.42
5.235ThrGly: 5.235 ± 0.664
0.938ThrHis: 0.938 ± 0.517
5.391ThrIle: 5.391 ± 0.542
5.001ThrLys: 5.001 ± 0.596
5.86ThrLeu: 5.86 ± 0.809
1.094ThrMet: 1.094 ± 0.321
4.141ThrAsn: 4.141 ± 0.736
2.735ThrPro: 2.735 ± 0.659
2.891ThrGln: 2.891 ± 0.714
2.813ThrArg: 2.813 ± 0.653
3.047ThrSer: 3.047 ± 0.523
5.235ThrThr: 5.235 ± 0.837
4.219ThrVal: 4.219 ± 0.66
1.094ThrTrp: 1.094 ± 0.324
2.422ThrTyr: 2.422 ± 0.786
0.0ThrXaa: 0.0 ± 0.0
Val
5.235ValAla: 5.235 ± 0.664
0.313ValCys: 0.313 ± 0.174
4.532ValAsp: 4.532 ± 0.64
3.829ValGlu: 3.829 ± 0.638
2.188ValPhe: 2.188 ± 0.429
4.219ValGly: 4.219 ± 0.564
1.406ValHis: 1.406 ± 0.436
4.766ValIle: 4.766 ± 0.655
5.391ValLys: 5.391 ± 0.676
3.516ValLeu: 3.516 ± 0.647
0.781ValMet: 0.781 ± 0.26
3.282ValAsn: 3.282 ± 0.521
0.703ValPro: 0.703 ± 0.257
2.032ValGln: 2.032 ± 0.471
1.953ValArg: 1.953 ± 0.37
4.141ValSer: 4.141 ± 0.634
5.079ValThr: 5.079 ± 0.534
4.063ValVal: 4.063 ± 0.679
0.781ValTrp: 0.781 ± 0.346
2.579ValTyr: 2.579 ± 0.519
0.0ValXaa: 0.0 ± 0.0
Trp
1.875TrpAla: 1.875 ± 0.457
0.0TrpCys: 0.0 ± 0.0
0.469TrpAsp: 0.469 ± 0.187
0.469TrpGlu: 0.469 ± 0.171
0.469TrpPhe: 0.469 ± 0.197
0.391TrpGly: 0.391 ± 0.169
0.391TrpHis: 0.391 ± 0.189
0.703TrpIle: 0.703 ± 0.28
1.094TrpLys: 1.094 ± 0.346
0.86TrpLeu: 0.86 ± 0.273
0.234TrpMet: 0.234 ± 0.157
0.625TrpAsn: 0.625 ± 0.169
0.156TrpPro: 0.156 ± 0.145
0.703TrpGln: 0.703 ± 0.248
0.547TrpArg: 0.547 ± 0.221
0.547TrpSer: 0.547 ± 0.148
1.172TrpThr: 1.172 ± 0.65
0.547TrpVal: 0.547 ± 0.208
0.234TrpTrp: 0.234 ± 0.115
1.016TrpTyr: 1.016 ± 0.585
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.344TyrAla: 2.344 ± 0.679
0.313TyrCys: 0.313 ± 0.196
2.5TyrAsp: 2.5 ± 0.541
2.032TyrGlu: 2.032 ± 0.504
1.797TyrPhe: 1.797 ± 0.373
2.422TyrGly: 2.422 ± 0.42
0.547TyrHis: 0.547 ± 0.248
3.282TyrIle: 3.282 ± 0.88
3.204TyrLys: 3.204 ± 0.687
3.751TyrLeu: 3.751 ± 0.593
0.938TyrMet: 0.938 ± 0.333
1.875TyrAsn: 1.875 ± 0.392
1.328TyrPro: 1.328 ± 0.338
1.797TyrGln: 1.797 ± 0.396
2.579TyrArg: 2.579 ± 0.517
2.579TyrSer: 2.579 ± 0.448
3.204TyrThr: 3.204 ± 1.113
1.485TyrVal: 1.485 ± 0.349
0.313TyrTrp: 0.313 ± 0.222
1.875TyrTyr: 1.875 ± 0.678
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski