Amino acid dipepetide frequency for Streptococcus phage Javan255

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.263AlaAla: 3.263 ± 0.533
0.489AlaCys: 0.489 ± 0.18
3.67AlaAsp: 3.67 ± 0.541
5.22AlaGlu: 5.22 ± 0.535
3.181AlaPhe: 3.181 ± 0.586
4.894AlaGly: 4.894 ± 0.757
0.979AlaHis: 0.979 ± 0.274
5.22AlaIle: 5.22 ± 0.665
5.628AlaLys: 5.628 ± 0.84
6.688AlaLeu: 6.688 ± 1.089
1.876AlaMet: 1.876 ± 0.327
3.507AlaAsn: 3.507 ± 0.402
1.387AlaPro: 1.387 ± 0.305
3.018AlaGln: 3.018 ± 0.54
3.344AlaArg: 3.344 ± 0.408
4.16AlaSer: 4.16 ± 0.707
3.915AlaThr: 3.915 ± 0.585
4.976AlaVal: 4.976 ± 0.691
0.653AlaTrp: 0.653 ± 0.236
3.507AlaTyr: 3.507 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.408CysAla: 0.408 ± 0.185
0.326CysCys: 0.326 ± 0.161
0.489CysAsp: 0.489 ± 0.248
0.571CysGlu: 0.571 ± 0.19
0.408CysPhe: 0.408 ± 0.184
0.653CysGly: 0.653 ± 0.204
0.163CysHis: 0.163 ± 0.122
0.571CysIle: 0.571 ± 0.209
0.653CysLys: 0.653 ± 0.379
0.734CysLeu: 0.734 ± 0.252
0.163CysMet: 0.163 ± 0.121
0.408CysAsn: 0.408 ± 0.203
0.408CysPro: 0.408 ± 0.187
0.653CysGln: 0.653 ± 0.225
0.571CysArg: 0.571 ± 0.27
0.816CysSer: 0.816 ± 0.365
0.082CysThr: 0.082 ± 0.073
0.653CysVal: 0.653 ± 0.243
0.082CysTrp: 0.082 ± 0.086
0.979CysTyr: 0.979 ± 0.305
0.0CysXaa: 0.0 ± 0.0
Asp
3.426AspAla: 3.426 ± 0.628
0.653AspCys: 0.653 ± 0.199
2.936AspAsp: 2.936 ± 0.718
4.976AspGlu: 4.976 ± 0.681
3.344AspPhe: 3.344 ± 0.474
4.731AspGly: 4.731 ± 0.539
1.223AspHis: 1.223 ± 0.337
3.589AspIle: 3.589 ± 0.461
4.323AspLys: 4.323 ± 0.529
4.568AspLeu: 4.568 ± 0.728
1.958AspMet: 1.958 ± 0.457
1.958AspAsn: 1.958 ± 0.358
1.55AspPro: 1.55 ± 0.4
1.794AspGln: 1.794 ± 0.343
2.039AspArg: 2.039 ± 0.531
3.263AspSer: 3.263 ± 0.432
2.773AspThr: 2.773 ± 0.393
3.834AspVal: 3.834 ± 0.496
0.897AspTrp: 0.897 ± 0.236
2.121AspTyr: 2.121 ± 0.464
0.0AspXaa: 0.0 ± 0.0
Glu
5.71GluAla: 5.71 ± 0.579
0.571GluCys: 0.571 ± 0.242
3.344GluAsp: 3.344 ± 0.577
6.77GluGlu: 6.77 ± 0.93
2.692GluPhe: 2.692 ± 0.589
4.731GluGly: 4.731 ± 0.539
1.06GluHis: 1.06 ± 0.328
3.997GluIle: 3.997 ± 0.535
7.504GluLys: 7.504 ± 0.94
8.646GluLeu: 8.646 ± 0.969
2.202GluMet: 2.202 ± 0.441
3.263GluAsn: 3.263 ± 0.531
1.794GluPro: 1.794 ± 0.493
4.241GluGln: 4.241 ± 0.567
3.1GluArg: 3.1 ± 0.495
3.263GluSer: 3.263 ± 0.523
5.302GluThr: 5.302 ± 0.693
4.241GluVal: 4.241 ± 0.635
0.653GluTrp: 0.653 ± 0.179
1.468GluTyr: 1.468 ± 0.434
0.0GluXaa: 0.0 ± 0.0
Phe
2.365PheAla: 2.365 ± 0.42
0.571PheCys: 0.571 ± 0.226
2.855PheAsp: 2.855 ± 0.493
2.447PheGlu: 2.447 ± 0.501
1.794PhePhe: 1.794 ± 0.507
3.344PheGly: 3.344 ± 0.527
0.897PheHis: 0.897 ± 0.254
2.529PheIle: 2.529 ± 0.453
3.263PheLys: 3.263 ± 0.653
2.773PheLeu: 2.773 ± 0.576
0.897PheMet: 0.897 ± 0.287
1.794PheAsn: 1.794 ± 0.408
0.734PhePro: 0.734 ± 0.261
1.223PheGln: 1.223 ± 0.3
1.876PheArg: 1.876 ± 0.445
2.284PheSer: 2.284 ± 0.322
2.202PheThr: 2.202 ± 0.419
2.284PheVal: 2.284 ± 0.569
0.653PheTrp: 0.653 ± 0.243
1.713PheTyr: 1.713 ± 0.426
0.0PheXaa: 0.0 ± 0.0
Gly
3.344GlyAla: 3.344 ± 0.538
0.245GlyCys: 0.245 ± 0.16
4.405GlyAsp: 4.405 ± 0.638
3.834GlyGlu: 3.834 ± 0.523
2.529GlyPhe: 2.529 ± 0.396
4.405GlyGly: 4.405 ± 0.598
1.876GlyHis: 1.876 ± 0.42
5.546GlyIle: 5.546 ± 0.893
5.139GlyLys: 5.139 ± 0.673
6.117GlyLeu: 6.117 ± 0.657
1.713GlyMet: 1.713 ± 0.435
3.1GlyAsn: 3.1 ± 0.534
0.571GlyPro: 0.571 ± 0.265
2.855GlyGln: 2.855 ± 0.569
3.997GlyArg: 3.997 ± 0.582
3.344GlySer: 3.344 ± 0.409
4.405GlyThr: 4.405 ± 0.77
3.997GlyVal: 3.997 ± 0.648
0.653GlyTrp: 0.653 ± 0.171
2.692GlyTyr: 2.692 ± 0.395
0.0GlyXaa: 0.0 ± 0.0
His
1.06HisAla: 1.06 ± 0.222
0.163HisCys: 0.163 ± 0.109
0.816HisAsp: 0.816 ± 0.241
0.979HisGlu: 0.979 ± 0.333
1.142HisPhe: 1.142 ± 0.337
1.468HisGly: 1.468 ± 0.283
0.734HisHis: 0.734 ± 0.31
1.223HisIle: 1.223 ± 0.374
0.979HisLys: 0.979 ± 0.3
1.876HisLeu: 1.876 ± 0.326
0.489HisMet: 0.489 ± 0.214
1.142HisAsn: 1.142 ± 0.298
1.142HisPro: 1.142 ± 0.283
1.958HisGln: 1.958 ± 0.528
0.979HisArg: 0.979 ± 0.212
1.06HisSer: 1.06 ± 0.23
0.979HisThr: 0.979 ± 0.361
0.816HisVal: 0.816 ± 0.237
0.163HisTrp: 0.163 ± 0.109
0.408HisTyr: 0.408 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
4.731IleAla: 4.731 ± 0.443
0.816IleCys: 0.816 ± 0.254
4.241IleAsp: 4.241 ± 0.502
4.241IleGlu: 4.241 ± 0.55
1.387IlePhe: 1.387 ± 0.386
3.752IleGly: 3.752 ± 0.504
0.816IleHis: 0.816 ± 0.256
3.426IleIle: 3.426 ± 0.672
3.997IleLys: 3.997 ± 0.703
5.546IleLeu: 5.546 ± 0.629
0.979IleMet: 0.979 ± 0.292
2.039IleAsn: 2.039 ± 0.346
1.876IlePro: 1.876 ± 0.433
2.936IleGln: 2.936 ± 0.443
3.181IleArg: 3.181 ± 0.541
5.302IleSer: 5.302 ± 1.043
4.894IleThr: 4.894 ± 1.02
3.589IleVal: 3.589 ± 0.658
1.06IleTrp: 1.06 ± 0.299
1.142IleTyr: 1.142 ± 0.247
0.0IleXaa: 0.0 ± 0.0
Lys
7.749LysAla: 7.749 ± 0.799
0.653LysCys: 0.653 ± 0.268
3.67LysAsp: 3.67 ± 0.619
5.873LysGlu: 5.873 ± 0.882
1.876LysPhe: 1.876 ± 0.334
4.078LysGly: 4.078 ± 0.517
1.305LysHis: 1.305 ± 0.314
4.16LysIle: 4.16 ± 0.602
5.546LysLys: 5.546 ± 0.921
6.688LysLeu: 6.688 ± 0.736
1.713LysMet: 1.713 ± 0.408
2.202LysAsn: 2.202 ± 0.395
2.202LysPro: 2.202 ± 0.478
4.078LysGln: 4.078 ± 0.644
4.486LysArg: 4.486 ± 0.774
4.486LysSer: 4.486 ± 0.756
4.894LysThr: 4.894 ± 0.399
5.383LysVal: 5.383 ± 1.02
1.223LysTrp: 1.223 ± 0.365
2.121LysTyr: 2.121 ± 0.599
0.0LysXaa: 0.0 ± 0.0
Leu
6.688LeuAla: 6.688 ± 1.089
0.489LeuCys: 0.489 ± 0.199
5.302LeuAsp: 5.302 ± 0.65
7.83LeuGlu: 7.83 ± 0.759
2.773LeuPhe: 2.773 ± 0.493
5.71LeuGly: 5.71 ± 0.717
1.55LeuHis: 1.55 ± 0.388
3.997LeuIle: 3.997 ± 0.634
7.259LeuLys: 7.259 ± 0.678
7.667LeuLeu: 7.667 ± 0.772
2.284LeuMet: 2.284 ± 0.486
4.078LeuAsn: 4.078 ± 0.656
3.752LeuPro: 3.752 ± 0.727
3.1LeuGln: 3.1 ± 0.594
3.507LeuArg: 3.507 ± 0.468
8.238LeuSer: 8.238 ± 0.858
7.423LeuThr: 7.423 ± 0.818
6.77LeuVal: 6.77 ± 0.751
0.816LeuTrp: 0.816 ± 0.241
3.915LeuTyr: 3.915 ± 0.864
0.0LeuXaa: 0.0 ± 0.0
Met
2.284MetAla: 2.284 ± 0.407
0.082MetCys: 0.082 ± 0.072
1.223MetAsp: 1.223 ± 0.375
1.876MetGlu: 1.876 ± 0.581
0.816MetPhe: 0.816 ± 0.28
2.121MetGly: 2.121 ± 0.352
0.163MetHis: 0.163 ± 0.098
1.468MetIle: 1.468 ± 0.325
1.876MetLys: 1.876 ± 0.431
1.387MetLeu: 1.387 ± 0.364
0.571MetMet: 0.571 ± 0.213
0.571MetAsn: 0.571 ± 0.242
0.326MetPro: 0.326 ± 0.156
0.653MetGln: 0.653 ± 0.262
1.142MetArg: 1.142 ± 0.275
1.631MetSer: 1.631 ± 0.358
2.447MetThr: 2.447 ± 0.451
1.55MetVal: 1.55 ± 0.444
0.082MetTrp: 0.082 ± 0.077
0.326MetTyr: 0.326 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
3.67AsnAla: 3.67 ± 0.629
0.489AsnCys: 0.489 ± 0.267
2.365AsnAsp: 2.365 ± 0.391
3.018AsnGlu: 3.018 ± 0.584
1.631AsnPhe: 1.631 ± 0.421
4.078AsnGly: 4.078 ± 0.619
1.55AsnHis: 1.55 ± 0.334
1.958AsnIle: 1.958 ± 0.405
1.876AsnLys: 1.876 ± 0.318
4.241AsnLeu: 4.241 ± 0.486
0.571AsnMet: 0.571 ± 0.2
1.468AsnAsn: 1.468 ± 0.431
2.039AsnPro: 2.039 ± 0.386
1.958AsnGln: 1.958 ± 0.362
2.365AsnArg: 2.365 ± 0.531
2.692AsnSer: 2.692 ± 0.578
3.181AsnThr: 3.181 ± 0.742
2.284AsnVal: 2.284 ± 0.518
0.816AsnTrp: 0.816 ± 0.275
0.897AsnTyr: 0.897 ± 0.295
0.0AsnXaa: 0.0 ± 0.0
Pro
0.979ProAla: 0.979 ± 0.214
0.408ProCys: 0.408 ± 0.184
1.713ProAsp: 1.713 ± 0.455
1.958ProGlu: 1.958 ± 0.475
1.387ProPhe: 1.387 ± 0.29
0.734ProGly: 0.734 ± 0.289
0.979ProHis: 0.979 ± 0.24
1.55ProIle: 1.55 ± 0.394
2.773ProLys: 2.773 ± 0.494
2.773ProLeu: 2.773 ± 0.454
0.489ProMet: 0.489 ± 0.15
1.468ProAsn: 1.468 ± 0.424
0.979ProPro: 0.979 ± 0.305
1.55ProGln: 1.55 ± 0.456
1.223ProArg: 1.223 ± 0.243
2.855ProSer: 2.855 ± 0.518
2.447ProThr: 2.447 ± 0.495
2.284ProVal: 2.284 ± 0.45
0.408ProTrp: 0.408 ± 0.178
1.305ProTyr: 1.305 ± 0.406
0.0ProXaa: 0.0 ± 0.0
Gln
4.323GlnAla: 4.323 ± 0.65
0.571GlnCys: 0.571 ± 0.23
1.631GlnAsp: 1.631 ± 0.31
4.323GlnGlu: 4.323 ± 0.608
1.876GlnPhe: 1.876 ± 0.379
2.365GlnGly: 2.365 ± 0.415
0.489GlnHis: 0.489 ± 0.212
2.855GlnIle: 2.855 ± 0.678
2.773GlnLys: 2.773 ± 0.519
4.812GlnLeu: 4.812 ± 0.498
1.06GlnMet: 1.06 ± 0.267
2.365GlnAsn: 2.365 ± 0.295
1.713GlnPro: 1.713 ± 0.444
2.447GlnGln: 2.447 ± 0.52
1.631GlnArg: 1.631 ± 0.375
2.692GlnSer: 2.692 ± 0.42
3.426GlnThr: 3.426 ± 0.806
3.915GlnVal: 3.915 ± 0.481
0.734GlnTrp: 0.734 ± 0.307
0.897GlnTyr: 0.897 ± 0.279
0.0GlnXaa: 0.0 ± 0.0
Arg
2.773ArgAla: 2.773 ± 0.589
1.223ArgCys: 1.223 ± 0.36
2.039ArgAsp: 2.039 ± 0.446
3.507ArgGlu: 3.507 ± 0.51
1.794ArgPhe: 1.794 ± 0.314
2.855ArgGly: 2.855 ± 0.388
0.734ArgHis: 0.734 ± 0.225
2.121ArgIle: 2.121 ± 0.481
4.323ArgLys: 4.323 ± 1.015
5.628ArgLeu: 5.628 ± 0.615
0.816ArgMet: 0.816 ± 0.198
1.958ArgAsn: 1.958 ± 0.426
1.55ArgPro: 1.55 ± 0.266
3.507ArgGln: 3.507 ± 0.503
1.958ArgArg: 1.958 ± 0.484
2.447ArgSer: 2.447 ± 0.436
2.936ArgThr: 2.936 ± 0.473
2.855ArgVal: 2.855 ± 0.57
0.897ArgTrp: 0.897 ± 0.234
1.713ArgTyr: 1.713 ± 0.418
0.0ArgXaa: 0.0 ± 0.0
Ser
3.915SerAla: 3.915 ± 0.929
0.408SerCys: 0.408 ± 0.151
4.241SerAsp: 4.241 ± 0.699
4.486SerGlu: 4.486 ± 0.588
2.855SerPhe: 2.855 ± 0.609
3.997SerGly: 3.997 ± 0.696
1.794SerHis: 1.794 ± 0.383
4.731SerIle: 4.731 ± 0.667
5.139SerLys: 5.139 ± 0.871
6.199SerLeu: 6.199 ± 0.716
1.223SerMet: 1.223 ± 0.43
3.1SerAsn: 3.1 ± 0.502
3.1SerPro: 3.1 ± 0.49
2.692SerGln: 2.692 ± 0.404
2.855SerArg: 2.855 ± 0.511
5.873SerSer: 5.873 ± 1.039
4.649SerThr: 4.649 ± 0.753
3.834SerVal: 3.834 ± 0.56
1.223SerTrp: 1.223 ± 0.227
2.365SerTyr: 2.365 ± 0.454
0.0SerXaa: 0.0 ± 0.0
Thr
5.302ThrAla: 5.302 ± 0.732
0.326ThrCys: 0.326 ± 0.186
3.67ThrAsp: 3.67 ± 0.62
3.997ThrGlu: 3.997 ± 0.596
2.61ThrPhe: 2.61 ± 0.568
4.241ThrGly: 4.241 ± 0.51
0.979ThrHis: 0.979 ± 0.305
3.997ThrIle: 3.997 ± 0.808
4.976ThrLys: 4.976 ± 0.625
6.444ThrLeu: 6.444 ± 0.874
1.06ThrMet: 1.06 ± 0.289
3.018ThrAsn: 3.018 ± 0.754
2.039ThrPro: 2.039 ± 0.424
3.344ThrGln: 3.344 ± 0.872
2.61ThrArg: 2.61 ± 0.513
5.628ThrSer: 5.628 ± 1.253
4.976ThrThr: 4.976 ± 1.055
6.036ThrVal: 6.036 ± 0.736
1.142ThrTrp: 1.142 ± 0.396
2.529ThrTyr: 2.529 ± 0.852
0.0ThrXaa: 0.0 ± 0.0
Val
4.568ValAla: 4.568 ± 0.796
0.245ValCys: 0.245 ± 0.201
3.426ValAsp: 3.426 ± 0.574
4.976ValGlu: 4.976 ± 0.775
2.365ValPhe: 2.365 ± 0.523
3.67ValGly: 3.67 ± 0.517
1.06ValHis: 1.06 ± 0.242
4.323ValIle: 4.323 ± 0.663
4.486ValLys: 4.486 ± 0.661
6.525ValLeu: 6.525 ± 0.695
1.142ValMet: 1.142 ± 0.31
2.855ValAsn: 2.855 ± 0.607
2.121ValPro: 2.121 ± 0.447
1.958ValGln: 1.958 ± 0.269
4.078ValArg: 4.078 ± 0.518
4.976ValSer: 4.976 ± 0.679
4.812ValThr: 4.812 ± 0.537
3.018ValVal: 3.018 ± 0.59
1.142ValTrp: 1.142 ± 0.333
2.447ValTyr: 2.447 ± 0.562
0.0ValXaa: 0.0 ± 0.0
Trp
0.979TrpAla: 0.979 ± 0.302
0.326TrpCys: 0.326 ± 0.162
0.653TrpAsp: 0.653 ± 0.24
1.223TrpGlu: 1.223 ± 0.333
0.897TrpPhe: 0.897 ± 0.288
0.734TrpGly: 0.734 ± 0.255
0.326TrpHis: 0.326 ± 0.154
0.571TrpIle: 0.571 ± 0.229
0.326TrpLys: 0.326 ± 0.199
0.897TrpLeu: 0.897 ± 0.22
0.571TrpMet: 0.571 ± 0.231
1.387TrpAsn: 1.387 ± 0.362
0.082TrpPro: 0.082 ± 0.078
0.734TrpGln: 0.734 ± 0.292
0.734TrpArg: 0.734 ± 0.354
1.142TrpSer: 1.142 ± 0.358
0.979TrpThr: 0.979 ± 0.299
0.653TrpVal: 0.653 ± 0.206
0.245TrpTrp: 0.245 ± 0.131
0.245TrpTyr: 0.245 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.039TyrAla: 2.039 ± 0.345
0.734TyrCys: 0.734 ± 0.311
3.263TyrAsp: 3.263 ± 0.708
2.447TyrGlu: 2.447 ± 0.369
1.142TyrPhe: 1.142 ± 0.351
2.202TyrGly: 2.202 ± 0.378
0.897TyrHis: 0.897 ± 0.289
2.121TyrIle: 2.121 ± 0.611
1.55TyrLys: 1.55 ± 0.313
2.936TyrLeu: 2.936 ± 0.53
0.734TyrMet: 0.734 ± 0.278
1.55TyrAsn: 1.55 ± 0.432
0.897TyrPro: 0.897 ± 0.268
2.202TyrGln: 2.202 ± 0.549
1.958TyrArg: 1.958 ± 0.41
2.529TyrSer: 2.529 ± 0.576
2.202TyrThr: 2.202 ± 0.566
1.223TyrVal: 1.223 ± 0.263
0.163TyrTrp: 0.163 ± 0.121
0.979TyrTyr: 0.979 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (12261 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski