Amino acid dipepetide frequency for Streptococcus phage Javan499

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.026AlaAla: 4.026 ± 0.821
0.537AlaCys: 0.537 ± 0.214
3.489AlaAsp: 3.489 ± 0.594
4.026AlaGlu: 4.026 ± 0.649
2.595AlaPhe: 2.595 ± 0.403
4.205AlaGly: 4.205 ± 0.838
0.537AlaHis: 0.537 ± 0.218
4.742AlaIle: 4.742 ± 0.999
4.921AlaLys: 4.921 ± 0.569
6.173AlaLeu: 6.173 ± 0.958
1.7AlaMet: 1.7 ± 0.342
3.668AlaAsn: 3.668 ± 0.522
1.879AlaPro: 1.879 ± 0.318
2.774AlaGln: 2.774 ± 0.474
2.416AlaArg: 2.416 ± 0.482
4.742AlaSer: 4.742 ± 0.68
4.116AlaThr: 4.116 ± 0.662
4.831AlaVal: 4.831 ± 0.669
0.716AlaTrp: 0.716 ± 0.299
3.31AlaTyr: 3.31 ± 0.614
0.0AlaXaa: 0.0 ± 0.0
Cys
0.089CysAla: 0.089 ± 0.08
0.268CysCys: 0.268 ± 0.161
0.447CysAsp: 0.447 ± 0.171
0.537CysGlu: 0.537 ± 0.191
0.268CysPhe: 0.268 ± 0.142
0.626CysGly: 0.626 ± 0.277
0.089CysHis: 0.089 ± 0.094
0.537CysIle: 0.537 ± 0.251
0.358CysLys: 0.358 ± 0.217
0.895CysLeu: 0.895 ± 0.302
0.089CysMet: 0.089 ± 0.09
0.268CysAsn: 0.268 ± 0.18
0.447CysPro: 0.447 ± 0.193
0.626CysGln: 0.626 ± 0.229
0.626CysArg: 0.626 ± 0.216
0.537CysSer: 0.537 ± 0.196
0.268CysThr: 0.268 ± 0.165
0.447CysVal: 0.447 ± 0.177
0.0CysTrp: 0.0 ± 0.0
0.716CysTyr: 0.716 ± 0.278
0.0CysXaa: 0.0 ± 0.0
Asp
2.684AspAla: 2.684 ± 0.56
0.268AspCys: 0.268 ± 0.144
3.042AspAsp: 3.042 ± 0.78
5.547AspGlu: 5.547 ± 0.713
3.131AspPhe: 3.131 ± 0.495
5.458AspGly: 5.458 ± 0.682
0.895AspHis: 0.895 ± 0.343
4.026AspIle: 4.026 ± 0.452
3.758AspLys: 3.758 ± 0.512
5.189AspLeu: 5.189 ± 0.589
1.968AspMet: 1.968 ± 0.464
2.326AspAsn: 2.326 ± 0.464
1.253AspPro: 1.253 ± 0.359
2.147AspGln: 2.147 ± 0.454
2.863AspArg: 2.863 ± 0.511
3.131AspSer: 3.131 ± 0.656
2.416AspThr: 2.416 ± 0.497
3.131AspVal: 3.131 ± 0.522
0.716AspTrp: 0.716 ± 0.242
2.774AspTyr: 2.774 ± 0.663
0.0AspXaa: 0.0 ± 0.0
Glu
5.1GluAla: 5.1 ± 0.697
0.716GluCys: 0.716 ± 0.23
4.295GluAsp: 4.295 ± 0.735
5.816GluGlu: 5.816 ± 1.006
2.684GluPhe: 2.684 ± 0.369
4.473GluGly: 4.473 ± 0.457
1.163GluHis: 1.163 ± 0.29
3.847GluIle: 3.847 ± 0.592
6.71GluLys: 6.71 ± 0.764
8.321GluLeu: 8.321 ± 0.939
1.7GluMet: 1.7 ± 0.488
3.221GluAsn: 3.221 ± 0.55
1.61GluPro: 1.61 ± 0.49
3.579GluGln: 3.579 ± 0.581
3.31GluArg: 3.31 ± 0.583
4.295GluSer: 4.295 ± 0.619
5.458GluThr: 5.458 ± 0.673
4.742GluVal: 4.742 ± 0.564
0.716GluTrp: 0.716 ± 0.322
1.879GluTyr: 1.879 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
2.326PheAla: 2.326 ± 0.469
0.447PheCys: 0.447 ± 0.183
3.131PheAsp: 3.131 ± 0.586
2.774PheGlu: 2.774 ± 0.575
1.879PhePhe: 1.879 ± 0.476
3.489PheGly: 3.489 ± 0.47
0.716PheHis: 0.716 ± 0.323
2.147PheIle: 2.147 ± 0.498
3.489PheLys: 3.489 ± 0.564
2.863PheLeu: 2.863 ± 0.461
0.984PheMet: 0.984 ± 0.403
1.879PheAsn: 1.879 ± 0.327
0.805PhePro: 0.805 ± 0.247
1.432PheGln: 1.432 ± 0.348
1.432PheArg: 1.432 ± 0.305
2.684PheSer: 2.684 ± 0.455
1.879PheThr: 1.879 ± 0.415
2.416PheVal: 2.416 ± 0.415
0.537PheTrp: 0.537 ± 0.228
1.432PheTyr: 1.432 ± 0.432
0.0PheXaa: 0.0 ± 0.0
Gly
3.758GlyAla: 3.758 ± 0.54
0.358GlyCys: 0.358 ± 0.195
4.026GlyAsp: 4.026 ± 0.591
4.295GlyGlu: 4.295 ± 0.663
2.505GlyPhe: 2.505 ± 0.36
5.189GlyGly: 5.189 ± 0.848
1.879GlyHis: 1.879 ± 0.491
5.01GlyIle: 5.01 ± 0.749
5.726GlyLys: 5.726 ± 0.524
6.084GlyLeu: 6.084 ± 1.007
2.326GlyMet: 2.326 ± 0.45
4.026GlyAsn: 4.026 ± 0.694
1.074GlyPro: 1.074 ± 0.26
2.595GlyGln: 2.595 ± 0.526
3.937GlyArg: 3.937 ± 0.713
4.026GlySer: 4.026 ± 0.724
4.205GlyThr: 4.205 ± 0.512
5.368GlyVal: 5.368 ± 0.923
0.805GlyTrp: 0.805 ± 0.217
3.131GlyTyr: 3.131 ± 0.529
0.0GlyXaa: 0.0 ± 0.0
His
0.447HisAla: 0.447 ± 0.23
0.179HisCys: 0.179 ± 0.129
1.074HisAsp: 1.074 ± 0.279
0.716HisGlu: 0.716 ± 0.279
0.984HisPhe: 0.984 ± 0.331
1.074HisGly: 1.074 ± 0.247
0.805HisHis: 0.805 ± 0.279
1.253HisIle: 1.253 ± 0.358
0.984HisLys: 0.984 ± 0.265
1.879HisLeu: 1.879 ± 0.356
0.447HisMet: 0.447 ± 0.215
1.163HisAsn: 1.163 ± 0.288
1.163HisPro: 1.163 ± 0.358
0.984HisGln: 0.984 ± 0.363
0.626HisArg: 0.626 ± 0.228
0.895HisSer: 0.895 ± 0.263
1.163HisThr: 1.163 ± 0.339
0.984HisVal: 0.984 ± 0.323
0.447HisTrp: 0.447 ± 0.186
0.716HisTyr: 0.716 ± 0.299
0.0HisXaa: 0.0 ± 0.0
Ile
4.742IleAla: 4.742 ± 0.581
0.716IleCys: 0.716 ± 0.303
4.921IleAsp: 4.921 ± 0.576
4.026IleGlu: 4.026 ± 0.646
1.163IlePhe: 1.163 ± 0.473
4.921IleGly: 4.921 ± 0.807
0.984IleHis: 0.984 ± 0.262
3.579IleIle: 3.579 ± 0.538
5.368IleLys: 5.368 ± 0.842
4.563IleLeu: 4.563 ± 0.521
1.432IleMet: 1.432 ± 0.37
2.147IleAsn: 2.147 ± 0.541
1.968IlePro: 1.968 ± 0.434
3.31IleGln: 3.31 ± 0.5
1.879IleArg: 1.879 ± 0.38
5.01IleSer: 5.01 ± 1.018
4.384IleThr: 4.384 ± 0.529
3.847IleVal: 3.847 ± 0.669
1.163IleTrp: 1.163 ± 0.37
2.147IleTyr: 2.147 ± 0.383
0.0IleXaa: 0.0 ± 0.0
Lys
6.263LysAla: 6.263 ± 0.633
0.447LysCys: 0.447 ± 0.191
3.579LysAsp: 3.579 ± 0.572
6.621LysGlu: 6.621 ± 0.821
2.416LysPhe: 2.416 ± 0.365
4.652LysGly: 4.652 ± 0.748
1.7LysHis: 1.7 ± 0.44
4.295LysIle: 4.295 ± 0.561
4.742LysLys: 4.742 ± 0.767
5.458LysLeu: 5.458 ± 0.701
1.432LysMet: 1.432 ± 0.373
4.205LysAsn: 4.205 ± 0.641
3.042LysPro: 3.042 ± 0.598
3.131LysGln: 3.131 ± 0.529
4.384LysArg: 4.384 ± 0.675
4.384LysSer: 4.384 ± 0.621
4.116LysThr: 4.116 ± 0.516
5.726LysVal: 5.726 ± 0.856
0.895LysTrp: 0.895 ± 0.272
1.879LysTyr: 1.879 ± 0.462
0.0LysXaa: 0.0 ± 0.0
Leu
5.368LeuAla: 5.368 ± 0.783
0.358LeuCys: 0.358 ± 0.153
5.01LeuAsp: 5.01 ± 0.532
7.784LeuGlu: 7.784 ± 1.111
2.684LeuPhe: 2.684 ± 0.394
4.921LeuGly: 4.921 ± 0.761
1.61LeuHis: 1.61 ± 0.411
5.1LeuIle: 5.1 ± 0.631
6.531LeuLys: 6.531 ± 0.732
6.8LeuLeu: 6.8 ± 0.847
2.326LeuMet: 2.326 ± 0.456
4.384LeuAsn: 4.384 ± 0.589
3.4LeuPro: 3.4 ± 0.686
3.668LeuGln: 3.668 ± 0.433
4.473LeuArg: 4.473 ± 0.677
7.605LeuSer: 7.605 ± 0.762
6.352LeuThr: 6.352 ± 0.798
5.905LeuVal: 5.905 ± 0.7
0.805LeuTrp: 0.805 ± 0.21
3.221LeuTyr: 3.221 ± 0.749
0.0LeuXaa: 0.0 ± 0.0
Met
1.521MetAla: 1.521 ± 0.323
0.089MetCys: 0.089 ± 0.082
1.521MetAsp: 1.521 ± 0.477
1.61MetGlu: 1.61 ± 0.396
0.716MetPhe: 0.716 ± 0.272
1.968MetGly: 1.968 ± 0.406
0.179MetHis: 0.179 ± 0.133
2.058MetIle: 2.058 ± 0.473
2.058MetLys: 2.058 ± 0.454
1.074MetLeu: 1.074 ± 0.281
1.074MetMet: 1.074 ± 0.323
1.163MetAsn: 1.163 ± 0.346
0.358MetPro: 0.358 ± 0.171
0.626MetGln: 0.626 ± 0.233
1.342MetArg: 1.342 ± 0.329
1.968MetSer: 1.968 ± 0.356
2.595MetThr: 2.595 ± 0.655
1.61MetVal: 1.61 ± 0.394
0.179MetTrp: 0.179 ± 0.138
0.626MetTyr: 0.626 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
4.295AsnAla: 4.295 ± 0.651
0.268AsnCys: 0.268 ± 0.168
2.416AsnAsp: 2.416 ± 0.375
3.221AsnGlu: 3.221 ± 0.588
1.968AsnPhe: 1.968 ± 0.49
5.01AsnGly: 5.01 ± 0.627
0.895AsnHis: 0.895 ± 0.288
2.237AsnIle: 2.237 ± 0.474
2.952AsnLys: 2.952 ± 0.595
3.847AsnLeu: 3.847 ± 0.581
1.342AsnMet: 1.342 ± 0.351
1.61AsnAsn: 1.61 ± 0.363
2.326AsnPro: 2.326 ± 0.511
2.326AsnGln: 2.326 ± 0.351
2.147AsnArg: 2.147 ± 0.344
2.952AsnSer: 2.952 ± 0.527
2.595AsnThr: 2.595 ± 0.627
3.31AsnVal: 3.31 ± 0.563
0.716AsnTrp: 0.716 ± 0.186
0.895AsnTyr: 0.895 ± 0.25
0.0AsnXaa: 0.0 ± 0.0
Pro
0.805ProAla: 0.805 ± 0.254
0.268ProCys: 0.268 ± 0.15
1.789ProAsp: 1.789 ± 0.532
1.789ProGlu: 1.789 ± 0.446
0.984ProPhe: 0.984 ± 0.36
1.163ProGly: 1.163 ± 0.378
0.716ProHis: 0.716 ± 0.239
2.505ProIle: 2.505 ± 0.474
2.952ProLys: 2.952 ± 0.613
3.131ProLeu: 3.131 ± 0.529
0.626ProMet: 0.626 ± 0.27
1.253ProAsn: 1.253 ± 0.448
1.074ProPro: 1.074 ± 0.4
1.879ProGln: 1.879 ± 0.379
1.432ProArg: 1.432 ± 0.271
3.042ProSer: 3.042 ± 0.485
2.058ProThr: 2.058 ± 0.381
2.147ProVal: 2.147 ± 0.559
0.358ProTrp: 0.358 ± 0.181
1.879ProTyr: 1.879 ± 0.465
0.0ProXaa: 0.0 ± 0.0
Gln
3.937GlnAla: 3.937 ± 0.823
0.358GlnCys: 0.358 ± 0.204
2.147GlnAsp: 2.147 ± 0.42
3.4GlnGlu: 3.4 ± 0.663
1.879GlnPhe: 1.879 ± 0.334
2.595GlnGly: 2.595 ± 0.447
0.537GlnHis: 0.537 ± 0.217
2.147GlnIle: 2.147 ± 0.487
3.131GlnLys: 3.131 ± 0.629
4.831GlnLeu: 4.831 ± 0.58
0.984GlnMet: 0.984 ± 0.344
2.684GlnAsn: 2.684 ± 0.442
1.253GlnPro: 1.253 ± 0.391
1.789GlnGln: 1.789 ± 0.601
1.253GlnArg: 1.253 ± 0.397
2.595GlnSer: 2.595 ± 0.465
2.505GlnThr: 2.505 ± 0.536
3.937GlnVal: 3.937 ± 0.55
0.447GlnTrp: 0.447 ± 0.239
0.537GlnTyr: 0.537 ± 0.196
0.0GlnXaa: 0.0 ± 0.0
Arg
2.863ArgAla: 2.863 ± 0.607
0.537ArgCys: 0.537 ± 0.228
2.147ArgAsp: 2.147 ± 0.49
3.221ArgGlu: 3.221 ± 0.535
1.7ArgPhe: 1.7 ± 0.361
2.416ArgGly: 2.416 ± 0.518
0.626ArgHis: 0.626 ± 0.239
2.952ArgIle: 2.952 ± 0.567
3.758ArgLys: 3.758 ± 0.682
4.384ArgLeu: 4.384 ± 0.612
0.895ArgMet: 0.895 ± 0.332
1.789ArgAsn: 1.789 ± 0.431
1.253ArgPro: 1.253 ± 0.333
2.684ArgGln: 2.684 ± 0.477
2.416ArgArg: 2.416 ± 0.519
2.774ArgSer: 2.774 ± 0.4
2.058ArgThr: 2.058 ± 0.417
4.295ArgVal: 4.295 ± 0.77
0.895ArgTrp: 0.895 ± 0.366
1.7ArgTyr: 1.7 ± 0.486
0.0ArgXaa: 0.0 ± 0.0
Ser
3.937SerAla: 3.937 ± 0.715
0.537SerCys: 0.537 ± 0.233
3.937SerAsp: 3.937 ± 0.534
5.189SerGlu: 5.189 ± 0.636
2.774SerPhe: 2.774 ± 0.706
6.352SerGly: 6.352 ± 0.745
1.879SerHis: 1.879 ± 0.421
4.205SerIle: 4.205 ± 0.763
4.473SerLys: 4.473 ± 0.499
4.563SerLeu: 4.563 ± 0.709
1.253SerMet: 1.253 ± 0.356
3.4SerAsn: 3.4 ± 0.588
2.684SerPro: 2.684 ± 0.547
2.595SerGln: 2.595 ± 0.596
3.042SerArg: 3.042 ± 0.602
5.458SerSer: 5.458 ± 0.773
5.816SerThr: 5.816 ± 0.563
4.563SerVal: 4.563 ± 0.538
1.521SerTrp: 1.521 ± 0.374
2.147SerTyr: 2.147 ± 0.366
0.0SerXaa: 0.0 ± 0.0
Thr
4.205ThrAla: 4.205 ± 0.562
0.447ThrCys: 0.447 ± 0.247
2.952ThrAsp: 2.952 ± 0.514
3.937ThrGlu: 3.937 ± 0.581
2.952ThrPhe: 2.952 ± 0.465
4.295ThrGly: 4.295 ± 0.835
0.716ThrHis: 0.716 ± 0.259
5.01ThrIle: 5.01 ± 0.922
4.831ThrLys: 4.831 ± 0.751
6.889ThrLeu: 6.889 ± 0.626
1.521ThrMet: 1.521 ± 0.349
2.416ThrAsn: 2.416 ± 0.484
2.774ThrPro: 2.774 ± 0.707
1.61ThrGln: 1.61 ± 0.463
2.505ThrArg: 2.505 ± 0.529
4.921ThrSer: 4.921 ± 0.683
3.4ThrThr: 3.4 ± 0.607
5.01ThrVal: 5.01 ± 0.713
1.253ThrTrp: 1.253 ± 0.3
1.789ThrTyr: 1.789 ± 0.267
0.0ThrXaa: 0.0 ± 0.0
Val
5.01ValAla: 5.01 ± 0.688
0.537ValCys: 0.537 ± 0.225
3.668ValAsp: 3.668 ± 0.605
5.905ValGlu: 5.905 ± 0.781
2.774ValPhe: 2.774 ± 0.46
4.742ValGly: 4.742 ± 0.691
1.163ValHis: 1.163 ± 0.276
4.384ValIle: 4.384 ± 0.616
3.221ValLys: 3.221 ± 0.585
6.979ValLeu: 6.979 ± 0.722
1.432ValMet: 1.432 ± 0.31
2.863ValAsn: 2.863 ± 0.598
2.147ValPro: 2.147 ± 0.426
2.147ValGln: 2.147 ± 0.358
2.774ValArg: 2.774 ± 0.595
5.905ValSer: 5.905 ± 0.875
5.189ValThr: 5.189 ± 0.659
5.189ValVal: 5.189 ± 0.791
1.253ValTrp: 1.253 ± 0.371
2.774ValTyr: 2.774 ± 0.608
0.0ValXaa: 0.0 ± 0.0
Trp
0.984TrpAla: 0.984 ± 0.305
0.089TrpCys: 0.089 ± 0.094
0.537TrpAsp: 0.537 ± 0.207
1.163TrpGlu: 1.163 ± 0.253
0.805TrpPhe: 0.805 ± 0.276
0.537TrpGly: 0.537 ± 0.237
0.179TrpHis: 0.179 ± 0.122
0.626TrpIle: 0.626 ± 0.229
0.626TrpLys: 0.626 ± 0.325
1.61TrpLeu: 1.61 ± 0.311
0.268TrpMet: 0.268 ± 0.132
1.432TrpAsn: 1.432 ± 0.339
0.089TrpPro: 0.089 ± 0.078
0.984TrpGln: 0.984 ± 0.235
0.626TrpArg: 0.626 ± 0.282
1.432TrpSer: 1.432 ± 0.395
0.984TrpThr: 0.984 ± 0.345
0.447TrpVal: 0.447 ± 0.224
0.179TrpTrp: 0.179 ± 0.107
0.447TrpTyr: 0.447 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.042TyrAla: 3.042 ± 0.463
0.716TyrCys: 0.716 ± 0.26
2.595TyrAsp: 2.595 ± 0.696
2.058TyrGlu: 2.058 ± 0.281
1.879TyrPhe: 1.879 ± 0.383
2.237TyrGly: 2.237 ± 0.459
0.716TyrHis: 0.716 ± 0.296
1.61TyrIle: 1.61 ± 0.448
2.774TyrLys: 2.774 ± 0.556
2.774TyrLeu: 2.774 ± 0.533
0.447TyrMet: 0.447 ± 0.215
1.432TyrAsn: 1.432 ± 0.477
1.253TyrPro: 1.253 ± 0.29
2.058TyrGln: 2.058 ± 0.479
1.879TyrArg: 1.879 ± 0.445
1.968TyrSer: 1.968 ± 0.496
1.968TyrThr: 1.968 ± 0.425
2.147TyrVal: 2.147 ± 0.465
0.447TyrTrp: 0.447 ± 0.241
1.432TyrTyr: 1.432 ± 0.414
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (11178 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski