Amino acid dipepetide frequency for Streptococcus phage Javan491

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.04AlaAla: 5.04 ± 0.988
0.0AlaCys: 0.0 ± 0.0
5.706AlaAsp: 5.706 ± 0.618
6.942AlaGlu: 6.942 ± 0.82
2.758AlaPhe: 2.758 ± 0.5
4.374AlaGly: 4.374 ± 0.981
0.951AlaHis: 0.951 ± 0.371
6.657AlaIle: 6.657 ± 1.025
8.273AlaLys: 8.273 ± 1.103
5.706AlaLeu: 5.706 ± 0.631
1.807AlaMet: 1.807 ± 0.439
4.945AlaAsn: 4.945 ± 0.699
0.571AlaPro: 0.571 ± 0.205
3.233AlaGln: 3.233 ± 0.409
2.568AlaArg: 2.568 ± 0.502
4.469AlaSer: 4.469 ± 0.731
4.089AlaThr: 4.089 ± 0.639
5.135AlaVal: 5.135 ± 0.677
1.141AlaTrp: 1.141 ± 0.368
2.187AlaTyr: 2.187 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
0.38CysAla: 0.38 ± 0.181
0.19CysCys: 0.19 ± 0.118
0.285CysAsp: 0.285 ± 0.154
0.666CysGlu: 0.666 ± 0.249
0.38CysPhe: 0.38 ± 0.148
0.666CysGly: 0.666 ± 0.269
0.095CysHis: 0.095 ± 0.103
0.095CysIle: 0.095 ± 0.084
0.666CysLys: 0.666 ± 0.226
0.571CysLeu: 0.571 ± 0.26
0.095CysMet: 0.095 ± 0.102
0.285CysAsn: 0.285 ± 0.15
0.0CysPro: 0.0 ± 0.0
0.19CysGln: 0.19 ± 0.14
0.285CysArg: 0.285 ± 0.188
0.475CysSer: 0.475 ± 0.212
0.285CysThr: 0.285 ± 0.204
0.475CysVal: 0.475 ± 0.216
0.095CysTrp: 0.095 ± 0.093
0.38CysTyr: 0.38 ± 0.244
0.0CysXaa: 0.0 ± 0.0
Asp
4.184AspAla: 4.184 ± 0.653
0.761AspCys: 0.761 ± 0.289
4.564AspAsp: 4.564 ± 0.644
4.755AspGlu: 4.755 ± 0.815
2.568AspPhe: 2.568 ± 0.423
5.135AspGly: 5.135 ± 0.734
0.666AspHis: 0.666 ± 0.232
4.564AspIle: 4.564 ± 0.577
5.325AspLys: 5.325 ± 0.783
5.991AspLeu: 5.991 ± 0.778
1.236AspMet: 1.236 ± 0.254
3.994AspAsn: 3.994 ± 0.482
2.472AspPro: 2.472 ± 0.563
1.141AspGln: 1.141 ± 0.259
2.853AspArg: 2.853 ± 0.614
3.233AspSer: 3.233 ± 0.517
3.994AspThr: 3.994 ± 0.879
3.709AspVal: 3.709 ± 0.502
1.141AspTrp: 1.141 ± 0.381
2.853AspTyr: 2.853 ± 0.51
0.0AspXaa: 0.0 ± 0.0
Glu
6.086GluAla: 6.086 ± 0.826
0.19GluCys: 0.19 ± 0.128
3.899GluAsp: 3.899 ± 0.677
4.564GluGlu: 4.564 ± 0.79
3.043GluPhe: 3.043 ± 0.586
3.518GluGly: 3.518 ± 0.6
1.426GluHis: 1.426 ± 0.317
5.135GluIle: 5.135 ± 0.671
7.417GluLys: 7.417 ± 0.841
8.273GluLeu: 8.273 ± 1.039
2.758GluMet: 2.758 ± 0.591
3.423GluAsn: 3.423 ± 0.569
1.997GluPro: 1.997 ± 0.495
3.804GluGln: 3.804 ± 0.763
2.472GluArg: 2.472 ± 0.499
3.043GluSer: 3.043 ± 0.521
4.089GluThr: 4.089 ± 0.614
4.85GluVal: 4.85 ± 1.059
1.046GluTrp: 1.046 ± 0.273
2.282GluTyr: 2.282 ± 0.626
0.0GluXaa: 0.0 ± 0.0
Phe
3.804PheAla: 3.804 ± 0.721
0.0PheCys: 0.0 ± 0.0
2.948PheAsp: 2.948 ± 0.402
3.994PheGlu: 3.994 ± 0.698
1.141PhePhe: 1.141 ± 0.321
2.853PheGly: 2.853 ± 0.427
0.38PheHis: 0.38 ± 0.179
2.187PheIle: 2.187 ± 0.429
3.614PheLys: 3.614 ± 0.491
1.902PheLeu: 1.902 ± 0.405
1.046PheMet: 1.046 ± 0.341
1.997PheAsn: 1.997 ± 0.457
0.761PhePro: 0.761 ± 0.286
0.761PheGln: 0.761 ± 0.259
1.617PheArg: 1.617 ± 0.363
2.758PheSer: 2.758 ± 0.595
1.902PheThr: 1.902 ± 0.421
2.282PheVal: 2.282 ± 0.467
0.38PheTrp: 0.38 ± 0.198
1.141PheTyr: 1.141 ± 0.266
0.0PheXaa: 0.0 ± 0.0
Gly
5.04GlyAla: 5.04 ± 0.988
0.19GlyCys: 0.19 ± 0.116
4.279GlyAsp: 4.279 ± 0.561
4.374GlyGlu: 4.374 ± 0.589
3.423GlyPhe: 3.423 ± 0.568
4.945GlyGly: 4.945 ± 0.841
1.141GlyHis: 1.141 ± 0.342
6.086GlyIle: 6.086 ± 0.83
4.85GlyLys: 4.85 ± 0.76
6.086GlyLeu: 6.086 ± 0.879
2.472GlyMet: 2.472 ± 0.448
3.899GlyAsn: 3.899 ± 0.551
0.856GlyPro: 0.856 ± 0.304
2.282GlyGln: 2.282 ± 0.553
1.712GlyArg: 1.712 ± 0.425
4.089GlySer: 4.089 ± 0.612
3.994GlyThr: 3.994 ± 0.635
2.853GlyVal: 2.853 ± 0.559
1.046GlyTrp: 1.046 ± 0.305
3.709GlyTyr: 3.709 ± 0.559
0.0GlyXaa: 0.0 ± 0.0
His
0.666HisAla: 0.666 ± 0.222
0.095HisCys: 0.095 ± 0.076
0.856HisAsp: 0.856 ± 0.26
1.807HisGlu: 1.807 ± 0.474
0.666HisPhe: 0.666 ± 0.203
0.951HisGly: 0.951 ± 0.287
0.19HisHis: 0.19 ± 0.122
0.666HisIle: 0.666 ± 0.197
0.571HisLys: 0.571 ± 0.216
1.617HisLeu: 1.617 ± 0.346
0.095HisMet: 0.095 ± 0.088
0.666HisAsn: 0.666 ± 0.265
0.666HisPro: 0.666 ± 0.286
0.571HisGln: 0.571 ± 0.186
0.856HisArg: 0.856 ± 0.263
0.571HisSer: 0.571 ± 0.196
0.856HisThr: 0.856 ± 0.25
0.666HisVal: 0.666 ± 0.254
0.19HisTrp: 0.19 ± 0.14
0.38HisTyr: 0.38 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.04IleAla: 5.04 ± 0.568
0.475IleCys: 0.475 ± 0.24
6.942IleAsp: 6.942 ± 0.704
4.374IleGlu: 4.374 ± 0.679
2.092IlePhe: 2.092 ± 0.431
4.374IleGly: 4.374 ± 1.065
1.141IleHis: 1.141 ± 0.288
3.994IleIle: 3.994 ± 0.562
8.368IleLys: 8.368 ± 1.157
3.899IleLeu: 3.899 ± 0.508
1.807IleMet: 1.807 ± 0.573
4.089IleAsn: 4.089 ± 0.787
1.902IlePro: 1.902 ± 0.564
2.282IleGln: 2.282 ± 0.459
2.377IleArg: 2.377 ± 0.378
5.135IleSer: 5.135 ± 1.126
4.755IleThr: 4.755 ± 0.675
4.279IleVal: 4.279 ± 0.736
0.38IleTrp: 0.38 ± 0.159
1.426IleTyr: 1.426 ± 0.394
0.0IleXaa: 0.0 ± 0.0
Lys
7.703LysAla: 7.703 ± 1.002
0.571LysCys: 0.571 ± 0.261
5.135LysAsp: 5.135 ± 0.716
7.703LysGlu: 7.703 ± 0.911
2.472LysPhe: 2.472 ± 0.387
5.325LysGly: 5.325 ± 0.651
1.236LysHis: 1.236 ± 0.348
5.135LysIle: 5.135 ± 0.874
9.509LysLys: 9.509 ± 1.304
6.466LysLeu: 6.466 ± 0.824
2.663LysMet: 2.663 ± 0.434
4.564LysAsn: 4.564 ± 0.713
3.328LysPro: 3.328 ± 0.51
4.469LysGln: 4.469 ± 0.83
4.469LysArg: 4.469 ± 0.682
6.752LysSer: 6.752 ± 0.794
5.135LysThr: 5.135 ± 0.717
6.847LysVal: 6.847 ± 0.727
1.141LysTrp: 1.141 ± 0.316
3.328LysTyr: 3.328 ± 0.465
0.0LysXaa: 0.0 ± 0.0
Leu
7.132LeuAla: 7.132 ± 0.78
0.666LeuCys: 0.666 ± 0.229
5.991LeuAsp: 5.991 ± 0.694
6.181LeuGlu: 6.181 ± 1.08
3.138LeuPhe: 3.138 ± 0.372
4.66LeuGly: 4.66 ± 0.82
0.951LeuHis: 0.951 ± 0.344
5.23LeuIle: 5.23 ± 0.747
9.224LeuLys: 9.224 ± 1.142
6.276LeuLeu: 6.276 ± 0.685
1.141LeuMet: 1.141 ± 0.321
4.374LeuAsn: 4.374 ± 0.585
2.377LeuPro: 2.377 ± 0.448
3.138LeuGln: 3.138 ± 0.577
3.709LeuArg: 3.709 ± 0.554
6.657LeuSer: 6.657 ± 1.048
4.564LeuThr: 4.564 ± 0.629
4.564LeuVal: 4.564 ± 0.622
0.666LeuTrp: 0.666 ± 0.285
2.472LeuTyr: 2.472 ± 0.491
0.0LeuXaa: 0.0 ± 0.0
Met
1.997MetAla: 1.997 ± 0.574
0.19MetCys: 0.19 ± 0.131
1.521MetAsp: 1.521 ± 0.449
1.617MetGlu: 1.617 ± 0.387
0.761MetPhe: 0.761 ± 0.237
2.092MetGly: 2.092 ± 0.577
0.475MetHis: 0.475 ± 0.19
1.521MetIle: 1.521 ± 0.378
1.521MetLys: 1.521 ± 0.259
1.617MetLeu: 1.617 ± 0.415
0.19MetMet: 0.19 ± 0.138
1.617MetAsn: 1.617 ± 0.308
0.951MetPro: 0.951 ± 0.28
1.141MetGln: 1.141 ± 0.31
0.951MetArg: 0.951 ± 0.298
2.187MetSer: 2.187 ± 0.361
2.092MetThr: 2.092 ± 0.467
1.521MetVal: 1.521 ± 0.384
0.285MetTrp: 0.285 ± 0.161
0.761MetTyr: 0.761 ± 0.315
0.0MetXaa: 0.0 ± 0.0
Asn
3.328AsnAla: 3.328 ± 0.646
0.38AsnCys: 0.38 ± 0.157
1.521AsnAsp: 1.521 ± 0.356
3.709AsnGlu: 3.709 ± 0.556
2.282AsnPhe: 2.282 ± 0.415
5.23AsnGly: 5.23 ± 0.712
0.951AsnHis: 0.951 ± 0.308
3.614AsnIle: 3.614 ± 0.604
4.564AsnLys: 4.564 ± 0.615
4.469AsnLeu: 4.469 ± 0.574
1.521AsnMet: 1.521 ± 0.362
2.377AsnAsn: 2.377 ± 0.574
1.997AsnPro: 1.997 ± 0.54
2.472AsnGln: 2.472 ± 0.451
2.568AsnArg: 2.568 ± 0.522
3.043AsnSer: 3.043 ± 0.549
2.472AsnThr: 2.472 ± 0.417
3.233AsnVal: 3.233 ± 0.528
0.38AsnTrp: 0.38 ± 0.173
2.853AsnTyr: 2.853 ± 0.546
0.0AsnXaa: 0.0 ± 0.0
Pro
1.712ProAla: 1.712 ± 0.425
0.19ProCys: 0.19 ± 0.134
2.377ProAsp: 2.377 ± 0.529
1.426ProGlu: 1.426 ± 0.348
1.046ProPhe: 1.046 ± 0.313
0.761ProGly: 0.761 ± 0.245
0.19ProHis: 0.19 ± 0.124
2.187ProIle: 2.187 ± 0.423
3.709ProLys: 3.709 ± 0.627
2.187ProLeu: 2.187 ± 0.472
0.571ProMet: 0.571 ± 0.193
1.141ProAsn: 1.141 ± 0.421
0.38ProPro: 0.38 ± 0.15
1.331ProGln: 1.331 ± 0.388
0.761ProArg: 0.761 ± 0.291
2.377ProSer: 2.377 ± 0.543
1.712ProThr: 1.712 ± 0.454
1.331ProVal: 1.331 ± 0.459
0.285ProTrp: 0.285 ± 0.154
0.856ProTyr: 0.856 ± 0.251
0.0ProXaa: 0.0 ± 0.0
Gln
2.948GlnAla: 2.948 ± 0.683
0.571GlnCys: 0.571 ± 0.228
1.617GlnAsp: 1.617 ± 0.336
3.233GlnGlu: 3.233 ± 0.869
1.046GlnPhe: 1.046 ± 0.308
2.663GlnGly: 2.663 ± 0.416
0.475GlnHis: 0.475 ± 0.194
3.233GlnIle: 3.233 ± 0.651
3.614GlnLys: 3.614 ± 0.783
3.614GlnLeu: 3.614 ± 0.632
0.38GlnMet: 0.38 ± 0.163
1.807GlnAsn: 1.807 ± 0.469
1.331GlnPro: 1.331 ± 0.386
1.521GlnGln: 1.521 ± 0.287
1.712GlnArg: 1.712 ± 0.412
2.663GlnSer: 2.663 ± 0.602
2.282GlnThr: 2.282 ± 0.747
2.377GlnVal: 2.377 ± 0.585
0.761GlnTrp: 0.761 ± 0.345
0.856GlnTyr: 0.856 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
2.948ArgAla: 2.948 ± 0.63
0.666ArgCys: 0.666 ± 0.265
2.282ArgAsp: 2.282 ± 0.556
3.233ArgGlu: 3.233 ± 0.543
1.141ArgPhe: 1.141 ± 0.322
1.807ArgGly: 1.807 ± 0.382
0.666ArgHis: 0.666 ± 0.246
3.233ArgIle: 3.233 ± 0.467
3.043ArgLys: 3.043 ± 0.567
3.994ArgLeu: 3.994 ± 0.5
0.856ArgMet: 0.856 ± 0.253
2.663ArgAsn: 2.663 ± 0.484
1.046ArgPro: 1.046 ± 0.277
1.331ArgGln: 1.331 ± 0.416
1.046ArgArg: 1.046 ± 0.369
1.997ArgSer: 1.997 ± 0.445
1.902ArgThr: 1.902 ± 0.423
2.092ArgVal: 2.092 ± 0.399
0.285ArgTrp: 0.285 ± 0.171
2.092ArgTyr: 2.092 ± 0.439
0.0ArgXaa: 0.0 ± 0.0
Ser
4.755SerAla: 4.755 ± 1.472
0.571SerCys: 0.571 ± 0.256
4.184SerAsp: 4.184 ± 0.703
3.614SerGlu: 3.614 ± 0.514
3.328SerPhe: 3.328 ± 0.717
6.466SerGly: 6.466 ± 1.035
0.666SerHis: 0.666 ± 0.241
3.423SerIle: 3.423 ± 0.603
5.42SerLys: 5.42 ± 0.634
6.086SerLeu: 6.086 ± 0.752
1.521SerMet: 1.521 ± 0.342
3.233SerAsn: 3.233 ± 0.565
1.426SerPro: 1.426 ± 0.316
2.568SerGln: 2.568 ± 0.621
2.092SerArg: 2.092 ± 0.682
4.755SerSer: 4.755 ± 0.846
3.328SerThr: 3.328 ± 0.521
3.804SerVal: 3.804 ± 0.596
0.856SerTrp: 0.856 ± 0.334
3.043SerTyr: 3.043 ± 0.54
0.0SerXaa: 0.0 ± 0.0
Thr
5.515ThrAla: 5.515 ± 0.742
0.285ThrCys: 0.285 ± 0.167
3.614ThrAsp: 3.614 ± 0.692
3.138ThrGlu: 3.138 ± 0.589
2.282ThrPhe: 2.282 ± 0.321
4.66ThrGly: 4.66 ± 0.728
0.951ThrHis: 0.951 ± 0.254
4.755ThrIle: 4.755 ± 0.602
4.85ThrLys: 4.85 ± 0.592
5.42ThrLeu: 5.42 ± 0.874
1.521ThrMet: 1.521 ± 0.297
2.377ThrAsn: 2.377 ± 0.441
1.236ThrPro: 1.236 ± 0.289
2.187ThrGln: 2.187 ± 0.571
0.951ThrArg: 0.951 ± 0.239
4.184ThrSer: 4.184 ± 0.691
2.758ThrThr: 2.758 ± 0.448
4.66ThrVal: 4.66 ± 0.508
0.951ThrTrp: 0.951 ± 0.285
1.521ThrTyr: 1.521 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
3.994ValAla: 3.994 ± 0.529
0.38ValCys: 0.38 ± 0.179
3.804ValAsp: 3.804 ± 0.573
5.04ValGlu: 5.04 ± 0.72
2.092ValPhe: 2.092 ± 0.426
3.899ValGly: 3.899 ± 0.638
0.475ValHis: 0.475 ± 0.229
4.374ValIle: 4.374 ± 0.596
5.325ValLys: 5.325 ± 0.515
5.42ValLeu: 5.42 ± 0.783
1.712ValMet: 1.712 ± 0.314
3.138ValAsn: 3.138 ± 0.478
1.617ValPro: 1.617 ± 0.397
2.092ValGln: 2.092 ± 0.453
2.187ValArg: 2.187 ± 0.523
4.089ValSer: 4.089 ± 0.448
5.04ValThr: 5.04 ± 0.679
4.374ValVal: 4.374 ± 0.626
0.19ValTrp: 0.19 ± 0.126
2.092ValTyr: 2.092 ± 0.453
0.0ValXaa: 0.0 ± 0.0
Trp
1.141TrpAla: 1.141 ± 0.364
0.095TrpCys: 0.095 ± 0.088
0.285TrpAsp: 0.285 ± 0.147
0.856TrpGlu: 0.856 ± 0.279
0.19TrpPhe: 0.19 ± 0.126
1.141TrpGly: 1.141 ± 0.331
0.19TrpHis: 0.19 ± 0.168
0.761TrpIle: 0.761 ± 0.284
0.856TrpLys: 0.856 ± 0.295
0.856TrpLeu: 0.856 ± 0.388
0.38TrpMet: 0.38 ± 0.186
0.38TrpAsn: 0.38 ± 0.155
0.19TrpPro: 0.19 ± 0.137
0.571TrpGln: 0.571 ± 0.222
1.141TrpArg: 1.141 ± 0.346
0.856TrpSer: 0.856 ± 0.254
0.571TrpThr: 0.571 ± 0.181
0.38TrpVal: 0.38 ± 0.169
0.095TrpTrp: 0.095 ± 0.089
0.571TrpTyr: 0.571 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.138TyrAla: 3.138 ± 0.475
0.095TyrCys: 0.095 ± 0.093
3.328TyrAsp: 3.328 ± 0.654
2.092TyrGlu: 2.092 ± 0.539
1.617TyrPhe: 1.617 ± 0.404
1.807TyrGly: 1.807 ± 0.327
0.38TyrHis: 0.38 ± 0.219
2.377TyrIle: 2.377 ± 0.37
3.233TyrLys: 3.233 ± 0.558
2.663TyrLeu: 2.663 ± 0.554
1.141TyrMet: 1.141 ± 0.362
1.997TyrAsn: 1.997 ± 0.513
1.426TyrPro: 1.426 ± 0.375
1.617TyrGln: 1.617 ± 0.507
1.997TyrArg: 1.997 ± 0.56
1.807TyrSer: 1.807 ± 0.452
1.997TyrThr: 1.997 ± 0.404
1.902TyrVal: 1.902 ± 0.336
0.19TyrTrp: 0.19 ± 0.146
1.712TyrTyr: 1.712 ± 0.462
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (10517 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski