Amino acid dipepetide frequency for Streptococcus phage Javan355

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.369AlaAla: 2.369 ± 0.414
0.254AlaCys: 0.254 ± 0.134
5.162AlaAsp: 5.162 ± 0.568
5.246AlaGlu: 5.246 ± 0.846
2.115AlaPhe: 2.115 ± 0.68
4.992AlaGly: 4.992 ± 0.849
1.692AlaHis: 1.692 ± 0.487
6.515AlaIle: 6.515 ± 0.946
6.262AlaLys: 6.262 ± 0.777
6.008AlaLeu: 6.008 ± 0.832
1.523AlaMet: 1.523 ± 0.342
4.315AlaAsn: 4.315 ± 0.565
3.469AlaPro: 3.469 ± 0.856
2.877AlaGln: 2.877 ± 0.503
3.046AlaArg: 3.046 ± 0.577
2.877AlaSer: 2.877 ± 0.551
3.723AlaThr: 3.723 ± 0.574
4.315AlaVal: 4.315 ± 0.957
1.1AlaTrp: 1.1 ± 0.269
1.946AlaTyr: 1.946 ± 0.441
0.0AlaXaa: 0.0 ± 0.0
Cys
0.254CysAla: 0.254 ± 0.139
0.085CysCys: 0.085 ± 0.086
0.254CysAsp: 0.254 ± 0.164
0.592CysGlu: 0.592 ± 0.21
0.338CysPhe: 0.338 ± 0.154
0.338CysGly: 0.338 ± 0.179
0.169CysHis: 0.169 ± 0.104
0.338CysIle: 0.338 ± 0.198
0.338CysLys: 0.338 ± 0.174
0.169CysLeu: 0.169 ± 0.119
0.085CysMet: 0.085 ± 0.092
0.677CysAsn: 0.677 ± 0.243
0.169CysPro: 0.169 ± 0.116
0.423CysGln: 0.423 ± 0.174
0.254CysArg: 0.254 ± 0.152
0.592CysSer: 0.592 ± 0.238
0.169CysThr: 0.169 ± 0.095
0.338CysVal: 0.338 ± 0.149
0.254CysTrp: 0.254 ± 0.148
0.254CysTyr: 0.254 ± 0.14
0.0CysXaa: 0.0 ± 0.0
Asp
4.146AspAla: 4.146 ± 0.606
0.762AspCys: 0.762 ± 0.274
3.554AspAsp: 3.554 ± 0.614
5.585AspGlu: 5.585 ± 0.71
3.469AspPhe: 3.469 ± 0.472
3.892AspGly: 3.892 ± 0.855
0.677AspHis: 0.677 ± 0.212
4.823AspIle: 4.823 ± 0.748
5.754AspLys: 5.754 ± 0.749
4.992AspLeu: 4.992 ± 0.612
1.946AspMet: 1.946 ± 0.567
3.046AspAsn: 3.046 ± 0.462
1.692AspPro: 1.692 ± 0.278
1.777AspGln: 1.777 ± 0.306
3.215AspArg: 3.215 ± 0.684
3.977AspSer: 3.977 ± 0.52
3.131AspThr: 3.131 ± 0.404
3.385AspVal: 3.385 ± 0.519
1.354AspTrp: 1.354 ± 0.485
3.046AspTyr: 3.046 ± 0.582
0.0AspXaa: 0.0 ± 0.0
Glu
5.077GluAla: 5.077 ± 0.851
0.677GluCys: 0.677 ± 0.261
3.215GluAsp: 3.215 ± 0.621
5.923GluGlu: 5.923 ± 0.9
2.369GluPhe: 2.369 ± 0.443
4.485GluGly: 4.485 ± 0.669
1.1GluHis: 1.1 ± 0.342
5.331GluIle: 5.331 ± 0.85
7.192GluLys: 7.192 ± 0.791
9.9GluLeu: 9.9 ± 1.015
2.369GluMet: 2.369 ± 0.548
4.569GluAsn: 4.569 ± 0.518
1.354GluPro: 1.354 ± 0.353
4.569GluGln: 4.569 ± 0.763
4.062GluArg: 4.062 ± 0.763
3.469GluSer: 3.469 ± 0.583
4.062GluThr: 4.062 ± 0.73
5.415GluVal: 5.415 ± 0.771
1.1GluTrp: 1.1 ± 0.226
2.031GluTyr: 2.031 ± 0.377
0.0GluXaa: 0.0 ± 0.0
Phe
2.962PheAla: 2.962 ± 0.532
0.254PheCys: 0.254 ± 0.145
3.469PheAsp: 3.469 ± 0.5
3.046PheGlu: 3.046 ± 0.461
1.354PhePhe: 1.354 ± 0.355
2.539PheGly: 2.539 ± 0.545
0.254PheHis: 0.254 ± 0.172
1.692PheIle: 1.692 ± 0.383
2.792PheLys: 2.792 ± 0.543
2.877PheLeu: 2.877 ± 0.567
0.846PheMet: 0.846 ± 0.374
2.877PheAsn: 2.877 ± 0.587
1.015PhePro: 1.015 ± 0.501
1.269PheGln: 1.269 ± 0.376
1.354PheArg: 1.354 ± 0.395
3.046PheSer: 3.046 ± 0.526
1.946PheThr: 1.946 ± 0.436
2.454PheVal: 2.454 ± 0.363
0.508PheTrp: 0.508 ± 0.181
1.269PheTyr: 1.269 ± 0.347
0.0PheXaa: 0.0 ± 0.0
Gly
3.215GlyAla: 3.215 ± 0.535
0.169GlyCys: 0.169 ± 0.123
4.146GlyAsp: 4.146 ± 0.715
4.992GlyGlu: 4.992 ± 0.682
2.539GlyPhe: 2.539 ± 0.716
4.146GlyGly: 4.146 ± 1.261
0.931GlyHis: 0.931 ± 0.246
4.485GlyIle: 4.485 ± 0.698
5.754GlyLys: 5.754 ± 0.841
5.585GlyLeu: 5.585 ± 0.802
2.454GlyMet: 2.454 ± 0.359
2.962GlyAsn: 2.962 ± 0.566
0.846GlyPro: 0.846 ± 0.277
3.385GlyGln: 3.385 ± 0.574
2.454GlyArg: 2.454 ± 0.398
3.892GlySer: 3.892 ± 0.794
2.623GlyThr: 2.623 ± 0.427
3.469GlyVal: 3.469 ± 0.649
1.269GlyTrp: 1.269 ± 0.429
3.469GlyTyr: 3.469 ± 0.562
0.0GlyXaa: 0.0 ± 0.0
His
1.438HisAla: 1.438 ± 0.44
0.085HisCys: 0.085 ± 0.088
1.015HisAsp: 1.015 ± 0.33
1.185HisGlu: 1.185 ± 0.331
1.015HisPhe: 1.015 ± 0.307
1.1HisGly: 1.1 ± 0.348
0.508HisHis: 0.508 ± 0.211
1.269HisIle: 1.269 ± 0.338
1.015HisLys: 1.015 ± 0.231
1.1HisLeu: 1.1 ± 0.32
0.085HisMet: 0.085 ± 0.097
0.931HisAsn: 0.931 ± 0.31
0.254HisPro: 0.254 ± 0.146
0.423HisGln: 0.423 ± 0.182
0.677HisArg: 0.677 ± 0.248
1.185HisSer: 1.185 ± 0.35
1.015HisThr: 1.015 ± 0.308
0.508HisVal: 0.508 ± 0.29
0.254HisTrp: 0.254 ± 0.134
0.677HisTyr: 0.677 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
5.754IleAla: 5.754 ± 0.831
0.508IleCys: 0.508 ± 0.176
4.739IleAsp: 4.739 ± 0.777
5.754IleGlu: 5.754 ± 0.578
2.623IlePhe: 2.623 ± 0.625
4.062IleGly: 4.062 ± 1.069
1.185IleHis: 1.185 ± 0.332
4.654IleIle: 4.654 ± 0.584
6.769IleLys: 6.769 ± 0.824
4.4IleLeu: 4.4 ± 0.765
1.523IleMet: 1.523 ± 0.396
3.808IleAsn: 3.808 ± 0.673
1.777IlePro: 1.777 ± 0.312
2.285IleGln: 2.285 ± 0.4
2.962IleArg: 2.962 ± 0.442
4.231IleSer: 4.231 ± 0.645
4.739IleThr: 4.739 ± 0.862
3.808IleVal: 3.808 ± 0.682
0.592IleTrp: 0.592 ± 0.253
2.539IleTyr: 2.539 ± 0.537
0.0IleXaa: 0.0 ± 0.0
Lys
5.5LysAla: 5.5 ± 0.828
0.169LysCys: 0.169 ± 0.095
5.246LysAsp: 5.246 ± 0.549
7.616LysGlu: 7.616 ± 0.952
2.792LysPhe: 2.792 ± 0.532
5.5LysGly: 5.5 ± 0.567
1.1LysHis: 1.1 ± 0.362
5.754LysIle: 5.754 ± 0.839
7.192LysLys: 7.192 ± 1.012
6.515LysLeu: 6.515 ± 0.786
2.708LysMet: 2.708 ± 0.503
4.739LysAsn: 4.739 ± 0.796
2.792LysPro: 2.792 ± 0.57
3.215LysGln: 3.215 ± 0.519
3.639LysArg: 3.639 ± 0.71
4.908LysSer: 4.908 ± 0.87
5.415LysThr: 5.415 ± 0.8
5.754LysVal: 5.754 ± 0.79
1.438LysTrp: 1.438 ± 0.318
3.131LysTyr: 3.131 ± 0.565
0.0LysXaa: 0.0 ± 0.0
Leu
6.854LeuAla: 6.854 ± 0.922
0.592LeuCys: 0.592 ± 0.325
6.177LeuAsp: 6.177 ± 0.703
7.7LeuGlu: 7.7 ± 0.978
2.877LeuPhe: 2.877 ± 0.446
5.162LeuGly: 5.162 ± 1.094
1.185LeuHis: 1.185 ± 0.307
4.823LeuIle: 4.823 ± 0.658
6.346LeuLys: 6.346 ± 0.819
7.616LeuLeu: 7.616 ± 0.862
1.862LeuMet: 1.862 ± 0.507
3.892LeuAsn: 3.892 ± 0.506
4.146LeuPro: 4.146 ± 0.653
3.892LeuGln: 3.892 ± 0.703
4.062LeuArg: 4.062 ± 0.595
4.146LeuSer: 4.146 ± 0.561
5.162LeuThr: 5.162 ± 0.621
4.4LeuVal: 4.4 ± 0.574
0.592LeuTrp: 0.592 ± 0.203
3.046LeuTyr: 3.046 ± 0.53
0.0LeuXaa: 0.0 ± 0.0
Met
2.623MetAla: 2.623 ± 0.377
0.0MetCys: 0.0 ± 0.0
1.692MetAsp: 1.692 ± 0.417
1.523MetGlu: 1.523 ± 0.413
1.1MetPhe: 1.1 ± 0.378
1.354MetGly: 1.354 ± 0.401
0.508MetHis: 0.508 ± 0.179
1.608MetIle: 1.608 ± 0.573
1.692MetLys: 1.692 ± 0.532
2.031MetLeu: 2.031 ± 0.504
0.169MetMet: 0.169 ± 0.123
1.438MetAsn: 1.438 ± 0.365
1.015MetPro: 1.015 ± 0.4
0.931MetGln: 0.931 ± 0.286
1.438MetArg: 1.438 ± 0.359
2.369MetSer: 2.369 ± 0.387
1.354MetThr: 1.354 ± 0.336
1.185MetVal: 1.185 ± 0.33
0.254MetTrp: 0.254 ± 0.146
0.846MetTyr: 0.846 ± 0.269
0.0MetXaa: 0.0 ± 0.0
Asn
4.4AsnAla: 4.4 ± 0.702
0.423AsnCys: 0.423 ± 0.223
3.131AsnAsp: 3.131 ± 0.547
3.385AsnGlu: 3.385 ± 0.588
1.438AsnPhe: 1.438 ± 0.402
3.892AsnGly: 3.892 ± 0.646
0.762AsnHis: 0.762 ± 0.344
3.469AsnIle: 3.469 ± 0.662
4.654AsnLys: 4.654 ± 0.559
4.231AsnLeu: 4.231 ± 0.659
1.692AsnMet: 1.692 ± 0.347
2.539AsnAsn: 2.539 ± 0.493
2.031AsnPro: 2.031 ± 0.413
2.792AsnGln: 2.792 ± 0.404
2.454AsnArg: 2.454 ± 0.525
2.454AsnSer: 2.454 ± 0.479
2.792AsnThr: 2.792 ± 0.379
3.554AsnVal: 3.554 ± 0.543
0.931AsnTrp: 0.931 ± 0.257
2.285AsnTyr: 2.285 ± 0.414
0.0AsnXaa: 0.0 ± 0.0
Pro
2.454ProAla: 2.454 ± 0.656
0.338ProCys: 0.338 ± 0.185
2.792ProAsp: 2.792 ± 0.618
2.792ProGlu: 2.792 ± 0.536
1.1ProPhe: 1.1 ± 0.321
1.608ProGly: 1.608 ± 0.358
0.085ProHis: 0.085 ± 0.098
1.692ProIle: 1.692 ± 0.324
2.2ProLys: 2.2 ± 0.446
2.708ProLeu: 2.708 ± 0.572
0.677ProMet: 0.677 ± 0.254
1.354ProAsn: 1.354 ± 0.284
0.931ProPro: 0.931 ± 0.326
1.777ProGln: 1.777 ± 0.523
1.354ProArg: 1.354 ± 0.404
2.115ProSer: 2.115 ± 0.497
1.862ProThr: 1.862 ± 0.542
1.438ProVal: 1.438 ± 0.414
0.592ProTrp: 0.592 ± 0.23
1.862ProTyr: 1.862 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
3.892GlnAla: 3.892 ± 0.591
0.169GlnCys: 0.169 ± 0.126
1.354GlnAsp: 1.354 ± 0.349
4.231GlnGlu: 4.231 ± 0.56
1.946GlnPhe: 1.946 ± 0.366
2.369GlnGly: 2.369 ± 0.391
0.423GlnHis: 0.423 ± 0.163
3.977GlnIle: 3.977 ± 0.604
4.146GlnLys: 4.146 ± 0.7
3.3GlnLeu: 3.3 ± 0.518
1.523GlnMet: 1.523 ± 0.33
2.539GlnAsn: 2.539 ± 0.503
1.354GlnPro: 1.354 ± 0.51
2.708GlnGln: 2.708 ± 0.584
1.862GlnArg: 1.862 ± 0.422
2.539GlnSer: 2.539 ± 0.476
2.031GlnThr: 2.031 ± 0.425
2.115GlnVal: 2.115 ± 0.341
0.169GlnTrp: 0.169 ± 0.112
1.523GlnTyr: 1.523 ± 0.305
0.0GlnXaa: 0.0 ± 0.0
Arg
2.454ArgAla: 2.454 ± 0.541
0.085ArgCys: 0.085 ± 0.086
2.285ArgAsp: 2.285 ± 0.393
2.792ArgGlu: 2.792 ± 0.566
1.692ArgPhe: 1.692 ± 0.33
2.623ArgGly: 2.623 ± 0.478
1.015ArgHis: 1.015 ± 0.297
3.131ArgIle: 3.131 ± 0.445
3.385ArgLys: 3.385 ± 0.65
4.654ArgLeu: 4.654 ± 0.71
1.438ArgMet: 1.438 ± 0.493
2.454ArgAsn: 2.454 ± 0.598
1.354ArgPro: 1.354 ± 0.414
1.354ArgGln: 1.354 ± 0.305
1.608ArgArg: 1.608 ± 0.467
3.131ArgSer: 3.131 ± 0.601
2.2ArgThr: 2.2 ± 0.428
3.215ArgVal: 3.215 ± 0.539
0.423ArgTrp: 0.423 ± 0.191
1.946ArgTyr: 1.946 ± 0.415
0.0ArgXaa: 0.0 ± 0.0
Ser
3.977SerAla: 3.977 ± 0.76
0.254SerCys: 0.254 ± 0.181
4.823SerAsp: 4.823 ± 0.787
4.062SerGlu: 4.062 ± 0.692
1.862SerPhe: 1.862 ± 0.38
3.808SerGly: 3.808 ± 0.744
1.185SerHis: 1.185 ± 0.391
4.146SerIle: 4.146 ± 0.535
5.585SerLys: 5.585 ± 0.796
4.654SerLeu: 4.654 ± 0.719
1.015SerMet: 1.015 ± 0.323
2.877SerAsn: 2.877 ± 0.496
1.354SerPro: 1.354 ± 0.37
3.3SerGln: 3.3 ± 0.544
2.285SerArg: 2.285 ± 0.486
3.723SerSer: 3.723 ± 0.663
4.231SerThr: 4.231 ± 0.644
3.3SerVal: 3.3 ± 0.508
1.015SerTrp: 1.015 ± 0.241
2.877SerTyr: 2.877 ± 0.691
0.0SerXaa: 0.0 ± 0.0
Thr
4.146ThrAla: 4.146 ± 0.526
0.169ThrCys: 0.169 ± 0.167
2.962ThrAsp: 2.962 ± 0.411
4.146ThrGlu: 4.146 ± 0.567
2.539ThrPhe: 2.539 ± 0.704
4.4ThrGly: 4.4 ± 0.609
1.1ThrHis: 1.1 ± 0.27
4.569ThrIle: 4.569 ± 0.644
3.639ThrLys: 3.639 ± 0.583
4.654ThrLeu: 4.654 ± 0.721
0.931ThrMet: 0.931 ± 0.297
2.369ThrAsn: 2.369 ± 0.418
2.115ThrPro: 2.115 ± 0.45
2.623ThrGln: 2.623 ± 0.527
1.608ThrArg: 1.608 ± 0.368
3.639ThrSer: 3.639 ± 0.619
2.708ThrThr: 2.708 ± 0.606
4.485ThrVal: 4.485 ± 0.656
0.762ThrTrp: 0.762 ± 0.253
1.777ThrTyr: 1.777 ± 0.462
0.0ThrXaa: 0.0 ± 0.0
Val
4.992ValAla: 4.992 ± 0.894
0.423ValCys: 0.423 ± 0.19
3.977ValAsp: 3.977 ± 0.523
4.823ValGlu: 4.823 ± 0.715
1.692ValPhe: 1.692 ± 0.458
4.315ValGly: 4.315 ± 0.723
0.846ValHis: 0.846 ± 0.28
3.554ValIle: 3.554 ± 0.484
4.908ValLys: 4.908 ± 0.598
5.246ValLeu: 5.246 ± 0.886
1.438ValMet: 1.438 ± 0.468
3.046ValAsn: 3.046 ± 0.617
1.862ValPro: 1.862 ± 0.393
2.031ValGln: 2.031 ± 0.474
2.2ValArg: 2.2 ± 0.408
4.908ValSer: 4.908 ± 0.699
3.554ValThr: 3.554 ± 0.524
3.808ValVal: 3.808 ± 0.724
0.592ValTrp: 0.592 ± 0.199
2.115ValTyr: 2.115 ± 0.375
0.0ValXaa: 0.0 ± 0.0
Trp
1.015TrpAla: 1.015 ± 0.337
0.169TrpCys: 0.169 ± 0.141
1.438TrpAsp: 1.438 ± 0.492
0.423TrpGlu: 0.423 ± 0.182
1.185TrpPhe: 1.185 ± 0.372
0.677TrpGly: 0.677 ± 0.222
0.254TrpHis: 0.254 ± 0.15
0.677TrpIle: 0.677 ± 0.231
1.354TrpLys: 1.354 ± 0.408
1.269TrpLeu: 1.269 ± 0.339
0.254TrpMet: 0.254 ± 0.154
0.931TrpAsn: 0.931 ± 0.359
0.338TrpPro: 0.338 ± 0.225
0.423TrpGln: 0.423 ± 0.195
0.762TrpArg: 0.762 ± 0.206
0.254TrpSer: 0.254 ± 0.131
0.677TrpThr: 0.677 ± 0.25
1.1TrpVal: 1.1 ± 0.357
0.0TrpTrp: 0.0 ± 0.0
0.846TrpTyr: 0.846 ± 0.391
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.285TyrAla: 2.285 ± 0.488
0.508TyrCys: 0.508 ± 0.169
2.792TyrAsp: 2.792 ± 0.523
2.454TyrGlu: 2.454 ± 0.52
1.777TyrPhe: 1.777 ± 0.408
1.523TyrGly: 1.523 ± 0.317
0.762TyrHis: 0.762 ± 0.276
2.2TyrIle: 2.2 ± 0.47
4.231TyrLys: 4.231 ± 0.642
2.623TyrLeu: 2.623 ± 0.423
0.508TyrMet: 0.508 ± 0.283
1.946TyrAsn: 1.946 ± 0.48
1.862TyrPro: 1.862 ± 0.35
2.115TyrGln: 2.115 ± 0.384
1.946TyrArg: 1.946 ± 0.381
2.877TyrSer: 2.877 ± 0.626
1.946TyrThr: 1.946 ± 0.461
2.285TyrVal: 2.285 ± 0.432
0.846TyrTrp: 0.846 ± 0.212
1.777TyrTyr: 1.777 ± 0.816
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11819 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski