Amino acid dipepetide frequency for Streptococcus phage Javan597

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.358AlaAla: 4.358 ± 0.72
0.419AlaCys: 0.419 ± 0.226
6.034AlaAsp: 6.034 ± 0.815
5.447AlaGlu: 5.447 ± 0.843
3.352AlaPhe: 3.352 ± 0.564
4.106AlaGly: 4.106 ± 0.872
0.754AlaHis: 0.754 ± 0.347
5.866AlaIle: 5.866 ± 0.934
6.536AlaLys: 6.536 ± 0.861
5.698AlaLeu: 5.698 ± 0.819
1.425AlaMet: 1.425 ± 0.432
3.52AlaAsn: 3.52 ± 0.446
1.173AlaPro: 1.173 ± 0.4
2.849AlaGln: 2.849 ± 0.575
2.43AlaArg: 2.43 ± 0.491
3.939AlaSer: 3.939 ± 0.594
3.855AlaThr: 3.855 ± 0.549
5.531AlaVal: 5.531 ± 0.948
1.341AlaTrp: 1.341 ± 0.541
2.682AlaTyr: 2.682 ± 0.555
0.0AlaXaa: 0.0 ± 0.0
Cys
0.587CysAla: 0.587 ± 0.213
0.251CysCys: 0.251 ± 0.14
0.419CysAsp: 0.419 ± 0.202
0.419CysGlu: 0.419 ± 0.235
0.084CysPhe: 0.084 ± 0.088
0.419CysGly: 0.419 ± 0.173
0.084CysHis: 0.084 ± 0.094
0.335CysIle: 0.335 ± 0.163
0.587CysLys: 0.587 ± 0.253
0.419CysLeu: 0.419 ± 0.178
0.168CysMet: 0.168 ± 0.119
0.168CysAsn: 0.168 ± 0.116
0.168CysPro: 0.168 ± 0.116
0.251CysGln: 0.251 ± 0.145
0.251CysArg: 0.251 ± 0.132
0.419CysSer: 0.419 ± 0.182
0.251CysThr: 0.251 ± 0.171
0.419CysVal: 0.419 ± 0.169
0.0CysTrp: 0.0 ± 0.0
0.168CysTyr: 0.168 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
2.598AspAla: 2.598 ± 0.631
0.335AspCys: 0.335 ± 0.177
4.274AspAsp: 4.274 ± 0.735
5.028AspGlu: 5.028 ± 1.028
3.352AspPhe: 3.352 ± 0.564
4.441AspGly: 4.441 ± 0.817
0.587AspHis: 0.587 ± 0.237
4.274AspIle: 4.274 ± 0.64
5.112AspLys: 5.112 ± 0.802
4.525AspLeu: 4.525 ± 0.672
1.676AspMet: 1.676 ± 0.322
3.52AspAsn: 3.52 ± 0.632
1.341AspPro: 1.341 ± 0.313
1.676AspGln: 1.676 ± 0.483
1.844AspArg: 1.844 ± 0.379
3.771AspSer: 3.771 ± 0.499
2.682AspThr: 2.682 ± 0.437
5.615AspVal: 5.615 ± 0.691
1.341AspTrp: 1.341 ± 0.413
2.598AspTyr: 2.598 ± 0.457
0.0AspXaa: 0.0 ± 0.0
Glu
6.117GluAla: 6.117 ± 0.897
0.754GluCys: 0.754 ± 0.289
3.52GluAsp: 3.52 ± 0.598
6.034GluGlu: 6.034 ± 0.923
2.765GluPhe: 2.765 ± 0.531
3.101GluGly: 3.101 ± 0.606
1.006GluHis: 1.006 ± 0.269
5.782GluIle: 5.782 ± 0.664
5.95GluLys: 5.95 ± 0.878
7.458GluLeu: 7.458 ± 1.073
2.346GluMet: 2.346 ± 0.54
4.358GluAsn: 4.358 ± 0.515
2.346GluPro: 2.346 ± 0.492
3.603GluGln: 3.603 ± 0.611
3.436GluArg: 3.436 ± 0.679
3.771GluSer: 3.771 ± 0.545
4.441GluThr: 4.441 ± 0.654
5.028GluVal: 5.028 ± 0.706
0.503GluTrp: 0.503 ± 0.256
3.352GluTyr: 3.352 ± 0.692
0.0GluXaa: 0.0 ± 0.0
Phe
3.52PheAla: 3.52 ± 0.78
0.084PheCys: 0.084 ± 0.084
2.765PheAsp: 2.765 ± 0.559
3.687PheGlu: 3.687 ± 0.544
2.179PhePhe: 2.179 ± 0.42
2.765PheGly: 2.765 ± 0.398
0.754PheHis: 0.754 ± 0.271
1.844PheIle: 1.844 ± 0.498
2.933PheLys: 2.933 ± 0.565
2.346PheLeu: 2.346 ± 0.469
1.006PheMet: 1.006 ± 0.255
3.184PheAsn: 3.184 ± 0.447
1.508PhePro: 1.508 ± 0.363
1.425PheGln: 1.425 ± 0.279
2.011PheArg: 2.011 ± 0.341
3.017PheSer: 3.017 ± 0.413
3.268PheThr: 3.268 ± 0.465
2.514PheVal: 2.514 ± 0.489
0.419PheTrp: 0.419 ± 0.214
1.089PheTyr: 1.089 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
3.268GlyAla: 3.268 ± 0.549
0.335GlyCys: 0.335 ± 0.165
3.687GlyAsp: 3.687 ± 0.829
4.106GlyGlu: 4.106 ± 0.633
2.765GlyPhe: 2.765 ± 0.354
4.358GlyGly: 4.358 ± 0.742
0.754GlyHis: 0.754 ± 0.213
5.028GlyIle: 5.028 ± 0.628
4.693GlyLys: 4.693 ± 0.638
5.698GlyLeu: 5.698 ± 0.712
1.927GlyMet: 1.927 ± 0.532
4.274GlyAsn: 4.274 ± 0.525
1.425GlyPro: 1.425 ± 0.632
2.765GlyGln: 2.765 ± 0.748
2.933GlyArg: 2.933 ± 0.6
4.022GlySer: 4.022 ± 0.808
4.022GlyThr: 4.022 ± 0.661
3.352GlyVal: 3.352 ± 0.629
1.089GlyTrp: 1.089 ± 0.281
3.101GlyTyr: 3.101 ± 0.503
0.0GlyXaa: 0.0 ± 0.0
His
1.425HisAla: 1.425 ± 0.401
0.251HisCys: 0.251 ± 0.148
0.587HisAsp: 0.587 ± 0.243
0.67HisGlu: 0.67 ± 0.271
1.341HisPhe: 1.341 ± 0.315
1.089HisGly: 1.089 ± 0.317
0.084HisHis: 0.084 ± 0.07
1.257HisIle: 1.257 ± 0.397
0.838HisLys: 0.838 ± 0.382
1.508HisLeu: 1.508 ± 0.418
0.251HisMet: 0.251 ± 0.14
1.173HisAsn: 1.173 ± 0.282
0.67HisPro: 0.67 ± 0.207
0.335HisGln: 0.335 ± 0.142
0.67HisArg: 0.67 ± 0.266
1.089HisSer: 1.089 ± 0.327
0.754HisThr: 0.754 ± 0.254
0.754HisVal: 0.754 ± 0.229
0.335HisTrp: 0.335 ± 0.147
0.67HisTyr: 0.67 ± 0.328
0.0HisXaa: 0.0 ± 0.0
Ile
4.86IleAla: 4.86 ± 0.719
0.503IleCys: 0.503 ± 0.177
4.609IleAsp: 4.609 ± 0.61
6.369IleGlu: 6.369 ± 0.972
2.011IlePhe: 2.011 ± 0.372
4.106IleGly: 4.106 ± 0.612
1.508IleHis: 1.508 ± 0.373
2.765IleIle: 2.765 ± 0.539
4.944IleLys: 4.944 ± 0.798
5.279IleLeu: 5.279 ± 0.687
2.263IleMet: 2.263 ± 0.394
4.777IleAsn: 4.777 ± 0.735
3.101IlePro: 3.101 ± 0.506
2.682IleGln: 2.682 ± 0.546
3.687IleArg: 3.687 ± 0.482
4.441IleSer: 4.441 ± 0.82
3.771IleThr: 3.771 ± 0.604
3.687IleVal: 3.687 ± 0.635
0.922IleTrp: 0.922 ± 0.339
2.095IleTyr: 2.095 ± 0.445
0.0IleXaa: 0.0 ± 0.0
Lys
5.112LysAla: 5.112 ± 0.602
0.67LysCys: 0.67 ± 0.192
3.939LysAsp: 3.939 ± 0.737
6.62LysGlu: 6.62 ± 0.714
2.43LysPhe: 2.43 ± 0.549
4.358LysGly: 4.358 ± 0.645
1.676LysHis: 1.676 ± 0.438
6.62LysIle: 6.62 ± 0.594
6.453LysLys: 6.453 ± 0.963
6.453LysLeu: 6.453 ± 0.945
1.927LysMet: 1.927 ± 0.442
3.771LysAsn: 3.771 ± 0.417
1.844LysPro: 1.844 ± 0.328
3.52LysGln: 3.52 ± 0.614
4.022LysArg: 4.022 ± 0.699
4.525LysSer: 4.525 ± 0.626
5.112LysThr: 5.112 ± 0.702
5.531LysVal: 5.531 ± 0.784
1.089LysTrp: 1.089 ± 0.322
2.598LysTyr: 2.598 ± 0.549
0.0LysXaa: 0.0 ± 0.0
Leu
7.207LeuAla: 7.207 ± 0.942
0.168LeuCys: 0.168 ± 0.12
4.777LeuAsp: 4.777 ± 0.738
7.039LeuGlu: 7.039 ± 0.825
3.603LeuPhe: 3.603 ± 0.685
4.944LeuGly: 4.944 ± 0.834
1.257LeuHis: 1.257 ± 0.333
4.777LeuIle: 4.777 ± 0.719
6.872LeuLys: 6.872 ± 0.804
6.034LeuLeu: 6.034 ± 0.747
1.76LeuMet: 1.76 ± 0.426
3.101LeuAsn: 3.101 ± 0.49
3.268LeuPro: 3.268 ± 0.604
2.765LeuGln: 2.765 ± 0.499
4.609LeuArg: 4.609 ± 0.58
5.112LeuSer: 5.112 ± 0.727
5.866LeuThr: 5.866 ± 0.857
5.112LeuVal: 5.112 ± 0.857
1.006LeuTrp: 1.006 ± 0.335
2.598LeuTyr: 2.598 ± 0.545
0.0LeuXaa: 0.0 ± 0.0
Met
2.346MetAla: 2.346 ± 0.539
0.084MetCys: 0.084 ± 0.072
1.257MetAsp: 1.257 ± 0.332
1.089MetGlu: 1.089 ± 0.342
1.089MetPhe: 1.089 ± 0.325
1.173MetGly: 1.173 ± 0.339
0.587MetHis: 0.587 ± 0.235
1.592MetIle: 1.592 ± 0.328
2.598MetLys: 2.598 ± 0.501
1.508MetLeu: 1.508 ± 0.345
0.922MetMet: 0.922 ± 0.29
1.592MetAsn: 1.592 ± 0.373
0.754MetPro: 0.754 ± 0.197
1.089MetGln: 1.089 ± 0.331
0.587MetArg: 0.587 ± 0.208
2.095MetSer: 2.095 ± 0.456
1.927MetThr: 1.927 ± 0.476
1.425MetVal: 1.425 ± 0.314
0.335MetTrp: 0.335 ± 0.173
0.922MetTyr: 0.922 ± 0.289
0.0MetXaa: 0.0 ± 0.0
Asn
4.358AsnAla: 4.358 ± 0.566
0.251AsnCys: 0.251 ± 0.131
2.514AsnAsp: 2.514 ± 0.449
3.352AsnGlu: 3.352 ± 0.575
2.765AsnPhe: 2.765 ± 0.455
5.698AsnGly: 5.698 ± 1.036
0.587AsnHis: 0.587 ± 0.201
3.855AsnIle: 3.855 ± 0.413
3.771AsnLys: 3.771 ± 0.589
4.022AsnLeu: 4.022 ± 0.641
1.257AsnMet: 1.257 ± 0.38
2.849AsnAsn: 2.849 ± 0.534
2.263AsnPro: 2.263 ± 0.44
2.43AsnGln: 2.43 ± 0.534
2.598AsnArg: 2.598 ± 0.481
3.017AsnSer: 3.017 ± 0.657
3.101AsnThr: 3.101 ± 0.689
3.771AsnVal: 3.771 ± 0.479
1.173AsnTrp: 1.173 ± 0.395
2.179AsnTyr: 2.179 ± 0.462
0.0AsnXaa: 0.0 ± 0.0
Pro
2.011ProAla: 2.011 ± 0.406
0.084ProCys: 0.084 ± 0.088
1.592ProAsp: 1.592 ± 0.341
2.682ProGlu: 2.682 ± 0.619
1.508ProPhe: 1.508 ± 0.293
1.76ProGly: 1.76 ± 0.521
0.419ProHis: 0.419 ± 0.233
1.676ProIle: 1.676 ± 0.293
2.514ProLys: 2.514 ± 0.492
3.017ProLeu: 3.017 ± 0.454
0.587ProMet: 0.587 ± 0.22
1.676ProAsn: 1.676 ± 0.34
0.67ProPro: 0.67 ± 0.195
1.173ProGln: 1.173 ± 0.344
1.089ProArg: 1.089 ± 0.308
1.676ProSer: 1.676 ± 0.574
1.425ProThr: 1.425 ± 0.452
2.933ProVal: 2.933 ± 0.485
0.838ProTrp: 0.838 ± 0.257
1.173ProTyr: 1.173 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
3.855GlnAla: 3.855 ± 0.586
0.084GlnCys: 0.084 ± 0.09
1.844GlnAsp: 1.844 ± 0.371
3.184GlnGlu: 3.184 ± 0.553
1.257GlnPhe: 1.257 ± 0.359
1.844GlnGly: 1.844 ± 0.479
1.006GlnHis: 1.006 ± 0.38
2.849GlnIle: 2.849 ± 0.509
2.849GlnLys: 2.849 ± 0.546
4.022GlnLeu: 4.022 ± 0.438
0.754GlnMet: 0.754 ± 0.323
2.011GlnAsn: 2.011 ± 0.401
1.089GlnPro: 1.089 ± 0.374
2.179GlnGln: 2.179 ± 0.454
1.844GlnArg: 1.844 ± 0.446
2.598GlnSer: 2.598 ± 0.571
2.011GlnThr: 2.011 ± 0.368
2.514GlnVal: 2.514 ± 0.389
0.084GlnTrp: 0.084 ± 0.068
1.173GlnTyr: 1.173 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
2.598ArgAla: 2.598 ± 0.397
0.503ArgCys: 0.503 ± 0.216
3.017ArgAsp: 3.017 ± 0.638
2.514ArgGlu: 2.514 ± 0.431
1.76ArgPhe: 1.76 ± 0.385
2.346ArgGly: 2.346 ± 0.493
0.754ArgHis: 0.754 ± 0.222
3.352ArgIle: 3.352 ± 0.617
3.939ArgLys: 3.939 ± 0.617
4.106ArgLeu: 4.106 ± 0.756
1.425ArgMet: 1.425 ± 0.415
3.268ArgAsn: 3.268 ± 0.487
1.592ArgPro: 1.592 ± 0.4
1.425ArgGln: 1.425 ± 0.417
2.263ArgArg: 2.263 ± 0.556
1.927ArgSer: 1.927 ± 0.397
2.011ArgThr: 2.011 ± 0.509
2.514ArgVal: 2.514 ± 0.36
1.257ArgTrp: 1.257 ± 0.295
2.263ArgTyr: 2.263 ± 0.411
0.0ArgXaa: 0.0 ± 0.0
Ser
4.022SerAla: 4.022 ± 0.814
0.335SerCys: 0.335 ± 0.165
4.525SerAsp: 4.525 ± 0.531
5.112SerGlu: 5.112 ± 0.818
2.514SerPhe: 2.514 ± 0.409
4.944SerGly: 4.944 ± 0.898
0.838SerHis: 0.838 ± 0.227
3.184SerIle: 3.184 ± 0.509
5.196SerLys: 5.196 ± 0.665
4.944SerLeu: 4.944 ± 0.565
1.173SerMet: 1.173 ± 0.292
3.52SerAsn: 3.52 ± 0.754
1.592SerPro: 1.592 ± 0.457
2.765SerGln: 2.765 ± 0.494
2.598SerArg: 2.598 ± 0.372
4.358SerSer: 4.358 ± 0.856
4.106SerThr: 4.106 ± 0.759
2.933SerVal: 2.933 ± 0.467
0.754SerTrp: 0.754 ± 0.225
2.682SerTyr: 2.682 ± 0.467
0.0SerXaa: 0.0 ± 0.0
Thr
4.19ThrAla: 4.19 ± 0.536
0.251ThrCys: 0.251 ± 0.116
3.687ThrAsp: 3.687 ± 0.507
3.352ThrGlu: 3.352 ± 0.647
3.268ThrPhe: 3.268 ± 0.551
5.196ThrGly: 5.196 ± 1.01
1.173ThrHis: 1.173 ± 0.294
4.944ThrIle: 4.944 ± 1.079
4.525ThrLys: 4.525 ± 0.523
4.441ThrLeu: 4.441 ± 0.589
1.089ThrMet: 1.089 ± 0.289
3.101ThrAsn: 3.101 ± 0.577
1.592ThrPro: 1.592 ± 0.36
2.011ThrGln: 2.011 ± 0.522
1.927ThrArg: 1.927 ± 0.366
4.022ThrSer: 4.022 ± 0.749
4.19ThrThr: 4.19 ± 0.72
5.279ThrVal: 5.279 ± 0.83
1.006ThrTrp: 1.006 ± 0.436
1.844ThrTyr: 1.844 ± 0.456
0.0ThrXaa: 0.0 ± 0.0
Val
5.531ValAla: 5.531 ± 0.66
0.251ValCys: 0.251 ± 0.136
4.358ValAsp: 4.358 ± 0.605
4.693ValGlu: 4.693 ± 0.597
2.179ValPhe: 2.179 ± 0.459
4.358ValGly: 4.358 ± 0.58
1.006ValHis: 1.006 ± 0.358
4.693ValIle: 4.693 ± 0.623
4.106ValLys: 4.106 ± 0.543
5.196ValLeu: 5.196 ± 0.773
2.095ValMet: 2.095 ± 0.356
3.184ValAsn: 3.184 ± 0.484
2.346ValPro: 2.346 ± 0.44
1.927ValGln: 1.927 ± 0.398
2.598ValArg: 2.598 ± 0.409
5.279ValSer: 5.279 ± 0.646
4.693ValThr: 4.693 ± 0.602
3.603ValVal: 3.603 ± 0.665
1.425ValTrp: 1.425 ± 0.396
1.676ValTyr: 1.676 ± 0.378
0.0ValXaa: 0.0 ± 0.0
Trp
1.508TrpAla: 1.508 ± 0.386
0.0TrpCys: 0.0 ± 0.0
0.838TrpAsp: 0.838 ± 0.346
1.592TrpGlu: 1.592 ± 0.338
0.503TrpPhe: 0.503 ± 0.199
0.67TrpGly: 0.67 ± 0.308
0.419TrpHis: 0.419 ± 0.198
1.006TrpIle: 1.006 ± 0.307
1.006TrpLys: 1.006 ± 0.324
1.341TrpLeu: 1.341 ± 0.334
0.503TrpMet: 0.503 ± 0.177
0.922TrpAsn: 0.922 ± 0.32
0.084TrpPro: 0.084 ± 0.083
0.67TrpGln: 0.67 ± 0.285
0.67TrpArg: 0.67 ± 0.261
0.67TrpSer: 0.67 ± 0.264
1.676TrpThr: 1.676 ± 0.725
0.67TrpVal: 0.67 ± 0.204
0.084TrpTrp: 0.084 ± 0.092
0.754TrpTyr: 0.754 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.844TyrAla: 1.844 ± 0.399
0.251TyrCys: 0.251 ± 0.15
2.346TyrAsp: 2.346 ± 0.423
2.682TyrGlu: 2.682 ± 0.513
1.592TyrPhe: 1.592 ± 0.386
1.927TyrGly: 1.927 ± 0.401
0.503TyrHis: 0.503 ± 0.212
2.682TyrIle: 2.682 ± 0.571
2.849TyrLys: 2.849 ± 0.678
3.855TyrLeu: 3.855 ± 0.556
0.335TyrMet: 0.335 ± 0.21
1.844TyrAsn: 1.844 ± 0.361
1.676TyrPro: 1.676 ± 0.363
1.508TyrGln: 1.508 ± 0.43
2.849TyrArg: 2.849 ± 0.546
2.346TyrSer: 2.346 ± 0.495
1.927TyrThr: 1.927 ± 0.562
2.011TyrVal: 2.011 ± 0.431
0.587TyrTrp: 0.587 ± 0.223
1.592TyrTyr: 1.592 ± 0.479
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (11934 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski