Amino acid dipepetide frequency for Streptococcus phage Javan35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.604AlaAla: 2.604 ± 0.744
0.09AlaCys: 0.09 ± 0.087
4.22AlaAsp: 4.22 ± 0.515
4.489AlaGlu: 4.489 ± 0.674
2.065AlaPhe: 2.065 ± 0.454
4.22AlaGly: 4.22 ± 0.861
0.988AlaHis: 0.988 ± 0.328
6.195AlaIle: 6.195 ± 0.848
6.105AlaLys: 6.105 ± 0.695
4.669AlaLeu: 4.669 ± 0.575
1.167AlaMet: 1.167 ± 0.331
4.22AlaAsn: 4.22 ± 0.597
1.526AlaPro: 1.526 ± 0.434
2.604AlaGln: 2.604 ± 0.411
2.873AlaArg: 2.873 ± 0.516
3.771AlaSer: 3.771 ± 0.598
4.31AlaThr: 4.31 ± 0.709
3.322AlaVal: 3.322 ± 0.712
0.808AlaTrp: 0.808 ± 0.237
2.693AlaTyr: 2.693 ± 0.627
0.0AlaXaa: 0.0 ± 0.0
Cys
0.359CysAla: 0.359 ± 0.197
0.18CysCys: 0.18 ± 0.128
0.718CysAsp: 0.718 ± 0.322
0.269CysGlu: 0.269 ± 0.139
0.18CysPhe: 0.18 ± 0.129
0.628CysGly: 0.628 ± 0.238
0.18CysHis: 0.18 ± 0.13
0.449CysIle: 0.449 ± 0.228
0.269CysLys: 0.269 ± 0.184
0.808CysLeu: 0.808 ± 0.283
0.09CysMet: 0.09 ± 0.084
0.359CysAsn: 0.359 ± 0.163
0.18CysPro: 0.18 ± 0.113
0.269CysGln: 0.269 ± 0.162
0.18CysArg: 0.18 ± 0.133
0.359CysSer: 0.359 ± 0.173
0.359CysThr: 0.359 ± 0.223
0.539CysVal: 0.539 ± 0.229
0.0CysTrp: 0.0 ± 0.0
0.449CysTyr: 0.449 ± 0.28
0.0CysXaa: 0.0 ± 0.0
Asp
2.604AspAla: 2.604 ± 0.497
0.628AspCys: 0.628 ± 0.221
3.95AspAsp: 3.95 ± 0.603
4.399AspGlu: 4.399 ± 0.561
3.053AspPhe: 3.053 ± 0.386
5.297AspGly: 5.297 ± 0.806
0.808AspHis: 0.808 ± 0.25
5.656AspIle: 5.656 ± 0.661
6.195AspLys: 6.195 ± 0.902
5.207AspLeu: 5.207 ± 0.785
1.347AspMet: 1.347 ± 0.302
4.758AspAsn: 4.758 ± 0.622
1.526AspPro: 1.526 ± 0.359
1.616AspGln: 1.616 ± 0.406
2.514AspArg: 2.514 ± 0.388
3.591AspSer: 3.591 ± 0.757
2.783AspThr: 2.783 ± 0.454
3.412AspVal: 3.412 ± 0.653
0.808AspTrp: 0.808 ± 0.288
3.681AspTyr: 3.681 ± 0.726
0.0AspXaa: 0.0 ± 0.0
Glu
5.118GluAla: 5.118 ± 0.85
0.449GluCys: 0.449 ± 0.228
3.861GluAsp: 3.861 ± 0.575
5.656GluGlu: 5.656 ± 0.805
2.873GluPhe: 2.873 ± 0.39
1.616GluGly: 1.616 ± 0.28
1.347GluHis: 1.347 ± 0.422
5.836GluIle: 5.836 ± 0.698
6.734GluLys: 6.734 ± 1.275
6.375GluLeu: 6.375 ± 0.903
1.975GluMet: 1.975 ± 0.465
4.31GluAsn: 4.31 ± 0.709
1.437GluPro: 1.437 ± 0.381
3.142GluGln: 3.142 ± 0.61
3.142GluArg: 3.142 ± 0.549
4.31GluSer: 4.31 ± 0.716
4.489GluThr: 4.489 ± 0.688
3.95GluVal: 3.95 ± 0.675
1.257GluTrp: 1.257 ± 0.364
2.604GluTyr: 2.604 ± 0.455
0.0GluXaa: 0.0 ± 0.0
Phe
2.873PheAla: 2.873 ± 0.425
0.09PheCys: 0.09 ± 0.1
3.681PheAsp: 3.681 ± 0.621
2.963PheGlu: 2.963 ± 0.669
1.257PhePhe: 1.257 ± 0.286
2.963PheGly: 2.963 ± 0.468
0.269PheHis: 0.269 ± 0.155
3.053PheIle: 3.053 ± 0.583
3.502PheLys: 3.502 ± 0.522
2.245PheLeu: 2.245 ± 0.376
1.257PheMet: 1.257 ± 0.411
2.155PheAsn: 2.155 ± 0.471
0.808PhePro: 0.808 ± 0.342
0.808PheGln: 0.808 ± 0.254
1.616PheArg: 1.616 ± 0.409
2.604PheSer: 2.604 ± 0.486
1.975PheThr: 1.975 ± 0.384
3.95PheVal: 3.95 ± 0.568
0.359PheTrp: 0.359 ± 0.135
1.167PheTyr: 1.167 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
4.22GlyAla: 4.22 ± 0.9
0.269GlyCys: 0.269 ± 0.204
3.861GlyAsp: 3.861 ± 0.496
3.681GlyGlu: 3.681 ± 0.697
3.771GlyPhe: 3.771 ± 0.723
3.502GlyGly: 3.502 ± 0.719
0.808GlyHis: 0.808 ± 0.233
5.118GlyIle: 5.118 ± 0.745
5.926GlyLys: 5.926 ± 0.944
4.04GlyLeu: 4.04 ± 0.491
2.245GlyMet: 2.245 ± 0.52
3.95GlyAsn: 3.95 ± 0.8
1.347GlyPro: 1.347 ± 0.464
1.885GlyGln: 1.885 ± 0.423
2.334GlyArg: 2.334 ± 0.504
3.412GlySer: 3.412 ± 0.684
3.861GlyThr: 3.861 ± 0.815
3.502GlyVal: 3.502 ± 0.561
1.347GlyTrp: 1.347 ± 0.326
3.591GlyTyr: 3.591 ± 0.516
0.0GlyXaa: 0.0 ± 0.0
His
0.628HisAla: 0.628 ± 0.215
0.359HisCys: 0.359 ± 0.189
0.628HisAsp: 0.628 ± 0.231
0.808HisGlu: 0.808 ± 0.305
0.628HisPhe: 0.628 ± 0.305
0.628HisGly: 0.628 ± 0.192
0.18HisHis: 0.18 ± 0.192
1.257HisIle: 1.257 ± 0.375
1.257HisLys: 1.257 ± 0.266
1.526HisLeu: 1.526 ± 0.594
0.269HisMet: 0.269 ± 0.15
0.988HisAsn: 0.988 ± 0.309
0.449HisPro: 0.449 ± 0.239
0.718HisGln: 0.718 ± 0.246
0.718HisArg: 0.718 ± 0.275
0.539HisSer: 0.539 ± 0.303
1.257HisThr: 1.257 ± 0.353
1.167HisVal: 1.167 ± 0.299
0.269HisTrp: 0.269 ± 0.143
0.898HisTyr: 0.898 ± 0.274
0.0HisXaa: 0.0 ± 0.0
Ile
5.387IleAla: 5.387 ± 0.713
0.359IleCys: 0.359 ± 0.211
5.656IleAsp: 5.656 ± 0.633
5.926IleGlu: 5.926 ± 1.139
2.424IlePhe: 2.424 ± 0.589
3.861IleGly: 3.861 ± 0.705
1.257IleHis: 1.257 ± 0.311
4.22IleIle: 4.22 ± 0.604
6.015IleLys: 6.015 ± 0.764
5.387IleLeu: 5.387 ± 0.687
1.706IleMet: 1.706 ± 0.325
5.118IleAsn: 5.118 ± 0.766
2.783IlePro: 2.783 ± 0.517
3.142IleGln: 3.142 ± 0.485
2.783IleArg: 2.783 ± 0.624
6.464IleSer: 6.464 ± 0.896
4.848IleThr: 4.848 ± 0.715
4.13IleVal: 4.13 ± 0.767
0.808IleTrp: 0.808 ± 0.284
2.604IleTyr: 2.604 ± 0.495
0.0IleXaa: 0.0 ± 0.0
Lys
6.015LysAla: 6.015 ± 0.975
0.628LysCys: 0.628 ± 0.286
5.297LysAsp: 5.297 ± 0.727
5.567LysGlu: 5.567 ± 1.026
2.155LysPhe: 2.155 ± 0.362
5.207LysGly: 5.207 ± 0.653
1.257LysHis: 1.257 ± 0.38
5.297LysIle: 5.297 ± 0.914
7.721LysLys: 7.721 ± 1.141
6.375LysLeu: 6.375 ± 0.686
2.873LysMet: 2.873 ± 0.726
6.015LysAsn: 6.015 ± 0.688
3.142LysPro: 3.142 ± 0.513
3.861LysGln: 3.861 ± 0.647
4.489LysArg: 4.489 ± 0.598
5.118LysSer: 5.118 ± 0.901
6.015LysThr: 6.015 ± 0.677
6.105LysVal: 6.105 ± 0.771
1.347LysTrp: 1.347 ± 0.406
4.22LysTyr: 4.22 ± 0.489
0.0LysXaa: 0.0 ± 0.0
Leu
5.567LeuAla: 5.567 ± 0.722
0.269LeuCys: 0.269 ± 0.143
4.579LeuAsp: 4.579 ± 0.679
6.554LeuGlu: 6.554 ± 0.836
2.963LeuPhe: 2.963 ± 0.451
4.579LeuGly: 4.579 ± 0.897
0.988LeuHis: 0.988 ± 0.295
6.375LeuIle: 6.375 ± 0.785
9.068LeuLys: 9.068 ± 1.133
6.734LeuLeu: 6.734 ± 0.876
2.424LeuMet: 2.424 ± 0.406
4.758LeuAsn: 4.758 ± 0.56
2.604LeuPro: 2.604 ± 0.49
3.232LeuGln: 3.232 ± 0.516
3.232LeuArg: 3.232 ± 0.499
5.387LeuSer: 5.387 ± 0.672
5.207LeuThr: 5.207 ± 0.794
4.399LeuVal: 4.399 ± 0.64
0.898LeuTrp: 0.898 ± 0.336
2.424LeuTyr: 2.424 ± 0.494
0.0LeuXaa: 0.0 ± 0.0
Met
2.155MetAla: 2.155 ± 0.554
0.18MetCys: 0.18 ± 0.128
1.706MetAsp: 1.706 ± 0.519
1.347MetGlu: 1.347 ± 0.341
1.347MetPhe: 1.347 ± 0.363
1.077MetGly: 1.077 ± 0.315
0.449MetHis: 0.449 ± 0.211
1.526MetIle: 1.526 ± 0.446
1.616MetLys: 1.616 ± 0.423
1.796MetLeu: 1.796 ± 0.345
0.269MetMet: 0.269 ± 0.154
1.167MetAsn: 1.167 ± 0.312
0.628MetPro: 0.628 ± 0.243
1.437MetGln: 1.437 ± 0.4
0.539MetArg: 0.539 ± 0.211
2.065MetSer: 2.065 ± 0.451
2.155MetThr: 2.155 ± 0.567
1.257MetVal: 1.257 ± 0.313
0.18MetTrp: 0.18 ± 0.119
1.167MetTyr: 1.167 ± 0.374
0.0MetXaa: 0.0 ± 0.0
Asn
2.693AsnAla: 2.693 ± 0.575
0.359AsnCys: 0.359 ± 0.183
3.412AsnAsp: 3.412 ± 0.698
3.771AsnGlu: 3.771 ± 0.534
3.322AsnPhe: 3.322 ± 0.587
5.207AsnGly: 5.207 ± 0.87
0.988AsnHis: 0.988 ± 0.291
4.31AsnIle: 4.31 ± 0.463
4.04AsnLys: 4.04 ± 0.572
6.195AsnLeu: 6.195 ± 0.944
1.077AsnMet: 1.077 ± 0.272
3.502AsnAsn: 3.502 ± 0.38
2.514AsnPro: 2.514 ± 0.451
2.245AsnGln: 2.245 ± 0.594
2.783AsnArg: 2.783 ± 0.528
4.31AsnSer: 4.31 ± 0.703
2.873AsnThr: 2.873 ± 0.652
3.502AsnVal: 3.502 ± 0.537
0.988AsnTrp: 0.988 ± 0.267
2.693AsnTyr: 2.693 ± 0.45
0.0AsnXaa: 0.0 ± 0.0
Pro
1.347ProAla: 1.347 ± 0.376
0.0ProCys: 0.0 ± 0.0
1.975ProAsp: 1.975 ± 0.482
1.706ProGlu: 1.706 ± 0.442
1.437ProPhe: 1.437 ± 0.351
1.167ProGly: 1.167 ± 0.44
0.18ProHis: 0.18 ± 0.153
1.975ProIle: 1.975 ± 0.443
3.232ProLys: 3.232 ± 0.591
2.783ProLeu: 2.783 ± 0.481
0.718ProMet: 0.718 ± 0.24
1.347ProAsn: 1.347 ± 0.419
0.988ProPro: 0.988 ± 0.29
1.257ProGln: 1.257 ± 0.287
0.808ProArg: 0.808 ± 0.276
2.783ProSer: 2.783 ± 0.418
1.796ProThr: 1.796 ± 0.465
1.257ProVal: 1.257 ± 0.377
0.09ProTrp: 0.09 ± 0.11
1.167ProTyr: 1.167 ± 0.314
0.0ProXaa: 0.0 ± 0.0
Gln
3.142GlnAla: 3.142 ± 0.541
0.269GlnCys: 0.269 ± 0.146
2.693GlnAsp: 2.693 ± 0.598
2.783GlnGlu: 2.783 ± 0.73
1.347GlnPhe: 1.347 ± 0.297
2.065GlnGly: 2.065 ± 0.795
0.449GlnHis: 0.449 ± 0.211
2.783GlnIle: 2.783 ± 0.435
3.681GlnLys: 3.681 ± 0.387
3.771GlnLeu: 3.771 ± 0.641
0.898GlnMet: 0.898 ± 0.253
1.975GlnAsn: 1.975 ± 0.431
1.077GlnPro: 1.077 ± 0.335
1.616GlnGln: 1.616 ± 0.491
1.796GlnArg: 1.796 ± 0.443
3.053GlnSer: 3.053 ± 0.622
2.155GlnThr: 2.155 ± 0.359
1.796GlnVal: 1.796 ± 0.357
0.449GlnTrp: 0.449 ± 0.189
0.808GlnTyr: 0.808 ± 0.222
0.0GlnXaa: 0.0 ± 0.0
Arg
2.873ArgAla: 2.873 ± 0.433
0.449ArgCys: 0.449 ± 0.222
2.334ArgAsp: 2.334 ± 0.478
2.604ArgGlu: 2.604 ± 0.547
0.988ArgPhe: 0.988 ± 0.26
1.706ArgGly: 1.706 ± 0.388
0.898ArgHis: 0.898 ± 0.362
3.681ArgIle: 3.681 ± 0.606
3.502ArgLys: 3.502 ± 0.585
4.399ArgLeu: 4.399 ± 0.742
0.988ArgMet: 0.988 ± 0.255
3.053ArgAsn: 3.053 ± 0.589
1.437ArgPro: 1.437 ± 0.39
1.616ArgGln: 1.616 ± 0.388
1.437ArgArg: 1.437 ± 0.401
1.706ArgSer: 1.706 ± 0.526
2.155ArgThr: 2.155 ± 0.406
1.257ArgVal: 1.257 ± 0.315
0.628ArgTrp: 0.628 ± 0.196
2.514ArgTyr: 2.514 ± 0.466
0.0ArgXaa: 0.0 ± 0.0
Ser
3.412SerAla: 3.412 ± 0.943
0.449SerCys: 0.449 ± 0.202
4.31SerAsp: 4.31 ± 0.607
4.669SerGlu: 4.669 ± 0.816
3.861SerPhe: 3.861 ± 0.627
5.207SerGly: 5.207 ± 0.688
1.257SerHis: 1.257 ± 0.338
5.387SerIle: 5.387 ± 0.84
4.758SerLys: 4.758 ± 0.687
4.848SerLeu: 4.848 ± 0.639
1.526SerMet: 1.526 ± 0.39
4.489SerAsn: 4.489 ± 0.615
0.539SerPro: 0.539 ± 0.198
2.604SerGln: 2.604 ± 0.454
2.065SerArg: 2.065 ± 0.475
4.938SerSer: 4.938 ± 0.691
3.412SerThr: 3.412 ± 0.566
4.04SerVal: 4.04 ± 0.729
0.988SerTrp: 0.988 ± 0.31
2.693SerTyr: 2.693 ± 0.397
0.0SerXaa: 0.0 ± 0.0
Thr
3.861ThrAla: 3.861 ± 0.56
0.18ThrCys: 0.18 ± 0.121
3.053ThrAsp: 3.053 ± 0.41
4.22ThrGlu: 4.22 ± 0.7
2.693ThrPhe: 2.693 ± 0.439
5.746ThrGly: 5.746 ± 0.858
0.808ThrHis: 0.808 ± 0.249
4.399ThrIle: 4.399 ± 0.91
5.746ThrLys: 5.746 ± 0.668
5.477ThrLeu: 5.477 ± 0.746
0.898ThrMet: 0.898 ± 0.3
3.053ThrAsn: 3.053 ± 0.45
2.065ThrPro: 2.065 ± 0.39
2.155ThrGln: 2.155 ± 0.471
1.706ThrArg: 1.706 ± 0.356
3.681ThrSer: 3.681 ± 0.7
3.502ThrThr: 3.502 ± 0.555
4.489ThrVal: 4.489 ± 0.669
0.718ThrTrp: 0.718 ± 0.269
2.514ThrTyr: 2.514 ± 0.502
0.0ThrXaa: 0.0 ± 0.0
Val
3.861ValAla: 3.861 ± 0.543
0.718ValCys: 0.718 ± 0.249
4.669ValAsp: 4.669 ± 0.57
5.567ValGlu: 5.567 ± 0.699
1.437ValPhe: 1.437 ± 0.369
4.13ValGly: 4.13 ± 0.607
0.539ValHis: 0.539 ± 0.239
3.412ValIle: 3.412 ± 0.577
4.938ValLys: 4.938 ± 0.647
4.489ValLeu: 4.489 ± 0.538
0.988ValMet: 0.988 ± 0.363
2.693ValAsn: 2.693 ± 0.669
1.796ValPro: 1.796 ± 0.485
1.975ValGln: 1.975 ± 0.361
2.424ValArg: 2.424 ± 0.425
4.758ValSer: 4.758 ± 0.601
4.04ValThr: 4.04 ± 0.607
4.399ValVal: 4.399 ± 0.535
0.628ValTrp: 0.628 ± 0.256
1.706ValTyr: 1.706 ± 0.403
0.0ValXaa: 0.0 ± 0.0
Trp
0.988TrpAla: 0.988 ± 0.338
0.09TrpCys: 0.09 ± 0.098
0.18TrpAsp: 0.18 ± 0.14
0.718TrpGlu: 0.718 ± 0.269
0.18TrpPhe: 0.18 ± 0.115
1.167TrpGly: 1.167 ± 0.338
0.359TrpHis: 0.359 ± 0.177
1.437TrpIle: 1.437 ± 0.32
0.718TrpLys: 0.718 ± 0.265
1.257TrpLeu: 1.257 ± 0.443
0.359TrpMet: 0.359 ± 0.176
0.359TrpAsn: 0.359 ± 0.139
0.18TrpPro: 0.18 ± 0.137
0.718TrpGln: 0.718 ± 0.259
0.898TrpArg: 0.898 ± 0.292
1.167TrpSer: 1.167 ± 0.448
0.898TrpThr: 0.898 ± 0.27
0.449TrpVal: 0.449 ± 0.245
0.09TrpTrp: 0.09 ± 0.096
0.808TrpTyr: 0.808 ± 0.264
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.232TyrAla: 3.232 ± 0.59
0.808TyrCys: 0.808 ± 0.273
3.053TyrAsp: 3.053 ± 0.609
2.783TyrGlu: 2.783 ± 0.505
1.616TyrPhe: 1.616 ± 0.378
2.693TyrGly: 2.693 ± 0.366
1.167TyrHis: 1.167 ± 0.371
2.514TyrIle: 2.514 ± 0.42
3.771TyrLys: 3.771 ± 0.715
3.861TyrLeu: 3.861 ± 0.594
0.898TyrMet: 0.898 ± 0.261
2.514TyrAsn: 2.514 ± 0.581
0.988TyrPro: 0.988 ± 0.32
1.706TyrGln: 1.706 ± 0.275
1.885TyrArg: 1.885 ± 0.415
1.437TyrSer: 1.437 ± 0.376
2.873TyrThr: 2.873 ± 0.506
2.245TyrVal: 2.245 ± 0.369
0.359TyrTrp: 0.359 ± 0.145
1.975TyrTyr: 1.975 ± 0.442
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (11139 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski