Amino acid dipepetide frequency for Streptococcus phage Javan558

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.739AlaAla: 3.739 ± 1.048
0.499AlaCys: 0.499 ± 0.189
3.906AlaAsp: 3.906 ± 0.506
4.072AlaGlu: 4.072 ± 0.505
2.992AlaPhe: 2.992 ± 0.361
5.069AlaGly: 5.069 ± 0.663
0.665AlaHis: 0.665 ± 0.284
5.9AlaIle: 5.9 ± 0.735
5.734AlaLys: 5.734 ± 0.638
5.484AlaLeu: 5.484 ± 0.797
1.579AlaMet: 1.579 ± 0.385
3.407AlaAsn: 3.407 ± 0.583
1.413AlaPro: 1.413 ± 0.29
3.158AlaGln: 3.158 ± 0.782
3.075AlaArg: 3.075 ± 0.481
5.318AlaSer: 5.318 ± 0.949
4.57AlaThr: 4.57 ± 0.626
3.407AlaVal: 3.407 ± 0.555
0.831AlaTrp: 0.831 ± 0.306
3.324AlaTyr: 3.324 ± 0.601
0.0AlaXaa: 0.0 ± 0.0
Cys
0.415CysAla: 0.415 ± 0.194
0.166CysCys: 0.166 ± 0.106
0.332CysAsp: 0.332 ± 0.159
0.665CysGlu: 0.665 ± 0.201
0.415CysPhe: 0.415 ± 0.189
0.914CysGly: 0.914 ± 0.219
0.166CysHis: 0.166 ± 0.117
0.415CysIle: 0.415 ± 0.212
0.415CysLys: 0.415 ± 0.215
0.831CysLeu: 0.831 ± 0.276
0.083CysMet: 0.083 ± 0.09
0.332CysAsn: 0.332 ± 0.161
0.332CysPro: 0.332 ± 0.175
0.582CysGln: 0.582 ± 0.182
0.249CysArg: 0.249 ± 0.197
0.415CysSer: 0.415 ± 0.245
0.166CysThr: 0.166 ± 0.13
0.582CysVal: 0.582 ± 0.241
0.0CysTrp: 0.0 ± 0.0
0.582CysTyr: 0.582 ± 0.229
0.0CysXaa: 0.0 ± 0.0
Asp
3.075AspAla: 3.075 ± 0.509
0.582AspCys: 0.582 ± 0.237
3.324AspAsp: 3.324 ± 0.705
4.737AspGlu: 4.737 ± 0.885
3.324AspPhe: 3.324 ± 0.424
4.903AspGly: 4.903 ± 0.655
1.08AspHis: 1.08 ± 0.325
3.906AspIle: 3.906 ± 0.469
4.238AspLys: 4.238 ± 0.39
5.318AspLeu: 5.318 ± 0.854
2.077AspMet: 2.077 ± 0.44
2.244AspAsn: 2.244 ± 0.427
1.662AspPro: 1.662 ± 0.495
1.496AspGln: 1.496 ± 0.339
2.742AspArg: 2.742 ± 0.54
3.823AspSer: 3.823 ± 0.655
2.908AspThr: 2.908 ± 0.495
3.407AspVal: 3.407 ± 0.507
1.08AspTrp: 1.08 ± 0.266
2.825AspTyr: 2.825 ± 0.652
0.0AspXaa: 0.0 ± 0.0
Glu
5.152GluAla: 5.152 ± 0.569
0.499GluCys: 0.499 ± 0.219
4.82GluAsp: 4.82 ± 0.83
6.066GluGlu: 6.066 ± 1.05
1.994GluPhe: 1.994 ± 0.477
4.487GluGly: 4.487 ± 0.517
0.831GluHis: 0.831 ± 0.342
4.155GluIle: 4.155 ± 0.478
6.399GluLys: 6.399 ± 1.052
7.728GluLeu: 7.728 ± 0.851
2.327GluMet: 2.327 ± 0.517
4.072GluAsn: 4.072 ± 0.6
1.496GluPro: 1.496 ± 0.469
4.487GluGln: 4.487 ± 0.556
2.825GluArg: 2.825 ± 0.489
3.407GluSer: 3.407 ± 0.53
4.82GluThr: 4.82 ± 0.65
3.656GluVal: 3.656 ± 0.578
0.831GluTrp: 0.831 ± 0.273
1.745GluTyr: 1.745 ± 0.369
0.0GluXaa: 0.0 ± 0.0
Phe
2.244PheAla: 2.244 ± 0.516
0.332PheCys: 0.332 ± 0.159
2.908PheAsp: 2.908 ± 0.529
2.742PheGlu: 2.742 ± 0.553
1.745PhePhe: 1.745 ± 0.379
2.992PheGly: 2.992 ± 0.495
0.831PheHis: 0.831 ± 0.266
1.579PheIle: 1.579 ± 0.442
3.656PheLys: 3.656 ± 0.822
2.742PheLeu: 2.742 ± 0.532
0.665PheMet: 0.665 ± 0.246
1.828PheAsn: 1.828 ± 0.317
0.582PhePro: 0.582 ± 0.244
1.08PheGln: 1.08 ± 0.3
1.994PheArg: 1.994 ± 0.368
2.327PheSer: 2.327 ± 0.468
1.911PheThr: 1.911 ± 0.369
2.077PheVal: 2.077 ± 0.413
0.582PheTrp: 0.582 ± 0.261
2.077PheTyr: 2.077 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
3.656GlyAla: 3.656 ± 0.592
0.332GlyCys: 0.332 ± 0.159
4.487GlyAsp: 4.487 ± 0.583
3.739GlyGlu: 3.739 ± 0.563
2.576GlyPhe: 2.576 ± 0.409
4.155GlyGly: 4.155 ± 0.77
1.828GlyHis: 1.828 ± 0.47
5.817GlyIle: 5.817 ± 0.657
5.069GlyLys: 5.069 ± 0.585
6.399GlyLeu: 6.399 ± 0.875
1.828GlyMet: 1.828 ± 0.408
3.739GlyAsn: 3.739 ± 0.601
0.831GlyPro: 0.831 ± 0.231
3.158GlyGln: 3.158 ± 0.463
4.155GlyArg: 4.155 ± 0.444
3.739GlySer: 3.739 ± 0.526
3.823GlyThr: 3.823 ± 0.688
4.238GlyVal: 4.238 ± 0.654
0.582GlyTrp: 0.582 ± 0.189
2.742GlyTyr: 2.742 ± 0.543
0.0GlyXaa: 0.0 ± 0.0
His
0.997HisAla: 0.997 ± 0.196
0.083HisCys: 0.083 ± 0.085
0.997HisAsp: 0.997 ± 0.275
1.163HisGlu: 1.163 ± 0.341
0.914HisPhe: 0.914 ± 0.3
1.579HisGly: 1.579 ± 0.332
0.665HisHis: 0.665 ± 0.238
1.33HisIle: 1.33 ± 0.27
0.748HisLys: 0.748 ± 0.266
2.077HisLeu: 2.077 ± 0.322
0.332HisMet: 0.332 ± 0.177
0.914HisAsn: 0.914 ± 0.293
1.33HisPro: 1.33 ± 0.346
1.163HisGln: 1.163 ± 0.381
0.831HisArg: 0.831 ± 0.221
0.665HisSer: 0.665 ± 0.215
1.08HisThr: 1.08 ± 0.326
1.08HisVal: 1.08 ± 0.272
0.332HisTrp: 0.332 ± 0.149
0.582HisTyr: 0.582 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
5.069IleAla: 5.069 ± 0.477
0.332IleCys: 0.332 ± 0.154
4.903IleAsp: 4.903 ± 0.499
3.823IleGlu: 3.823 ± 0.567
1.33IlePhe: 1.33 ± 0.367
4.653IleGly: 4.653 ± 0.588
0.748IleHis: 0.748 ± 0.211
3.158IleIle: 3.158 ± 0.47
4.737IleLys: 4.737 ± 0.544
4.986IleLeu: 4.986 ± 0.702
1.08IleMet: 1.08 ± 0.278
2.742IleAsn: 2.742 ± 0.38
2.327IlePro: 2.327 ± 0.363
2.992IleGln: 2.992 ± 0.399
3.158IleArg: 3.158 ± 0.611
4.986IleSer: 4.986 ± 0.843
5.318IleThr: 5.318 ± 0.92
4.903IleVal: 4.903 ± 0.78
1.163IleTrp: 1.163 ± 0.336
2.161IleTyr: 2.161 ± 0.436
0.0IleXaa: 0.0 ± 0.0
Lys
6.482LysAla: 6.482 ± 0.683
0.499LysCys: 0.499 ± 0.195
3.823LysAsp: 3.823 ± 0.568
5.318LysGlu: 5.318 ± 0.629
1.911LysPhe: 1.911 ± 0.34
4.82LysGly: 4.82 ± 0.678
1.994LysHis: 1.994 ± 0.328
4.737LysIle: 4.737 ± 0.481
4.487LysLys: 4.487 ± 0.679
6.814LysLeu: 6.814 ± 0.785
1.496LysMet: 1.496 ± 0.356
3.075LysAsn: 3.075 ± 0.534
2.161LysPro: 2.161 ± 0.407
4.072LysGln: 4.072 ± 0.632
4.155LysArg: 4.155 ± 0.568
4.321LysSer: 4.321 ± 0.675
4.072LysThr: 4.072 ± 0.565
4.986LysVal: 4.986 ± 0.736
1.163LysTrp: 1.163 ± 0.313
2.41LysTyr: 2.41 ± 0.649
0.0LysXaa: 0.0 ± 0.0
Leu
6.482LeuAla: 6.482 ± 0.831
0.332LeuCys: 0.332 ± 0.19
5.401LeuAsp: 5.401 ± 0.647
7.063LeuGlu: 7.063 ± 1.125
2.244LeuPhe: 2.244 ± 0.464
5.069LeuGly: 5.069 ± 0.631
1.579LeuHis: 1.579 ± 0.317
4.986LeuIle: 4.986 ± 0.501
6.98LeuLys: 6.98 ± 0.639
7.063LeuLeu: 7.063 ± 1.054
2.327LeuMet: 2.327 ± 0.486
4.737LeuAsn: 4.737 ± 0.709
3.49LeuPro: 3.49 ± 0.585
4.072LeuGln: 4.072 ± 0.629
3.324LeuArg: 3.324 ± 0.537
7.23LeuSer: 7.23 ± 0.784
6.731LeuThr: 6.731 ± 0.693
6.399LeuVal: 6.399 ± 0.981
0.748LeuTrp: 0.748 ± 0.278
3.823LeuTyr: 3.823 ± 0.691
0.0LeuXaa: 0.0 ± 0.0
Met
1.828MetAla: 1.828 ± 0.371
0.083MetCys: 0.083 ± 0.085
1.496MetAsp: 1.496 ± 0.361
1.662MetGlu: 1.662 ± 0.416
0.831MetPhe: 0.831 ± 0.266
1.496MetGly: 1.496 ± 0.433
0.166MetHis: 0.166 ± 0.147
1.33MetIle: 1.33 ± 0.32
1.745MetLys: 1.745 ± 0.391
0.997MetLeu: 0.997 ± 0.262
0.914MetMet: 0.914 ± 0.348
0.748MetAsn: 0.748 ± 0.259
0.332MetPro: 0.332 ± 0.17
0.831MetGln: 0.831 ± 0.314
1.579MetArg: 1.579 ± 0.406
1.994MetSer: 1.994 ± 0.512
1.911MetThr: 1.911 ± 0.481
2.077MetVal: 2.077 ± 0.456
0.083MetTrp: 0.083 ± 0.077
0.582MetTyr: 0.582 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.737AsnAla: 4.737 ± 0.69
0.249AsnCys: 0.249 ± 0.136
2.493AsnAsp: 2.493 ± 0.421
2.742AsnGlu: 2.742 ± 0.508
1.828AsnPhe: 1.828 ± 0.366
4.653AsnGly: 4.653 ± 0.603
1.08AsnHis: 1.08 ± 0.247
2.576AsnIle: 2.576 ± 0.427
2.992AsnLys: 2.992 ± 0.521
4.737AsnLeu: 4.737 ± 0.925
0.665AsnMet: 0.665 ± 0.234
1.994AsnAsn: 1.994 ± 0.48
1.994AsnPro: 1.994 ± 0.417
1.828AsnGln: 1.828 ± 0.335
2.161AsnArg: 2.161 ± 0.402
2.825AsnSer: 2.825 ± 0.547
2.077AsnThr: 2.077 ± 0.474
2.41AsnVal: 2.41 ± 0.494
0.914AsnTrp: 0.914 ± 0.288
1.163AsnTyr: 1.163 ± 0.242
0.0AsnXaa: 0.0 ± 0.0
Pro
1.163ProAla: 1.163 ± 0.299
0.499ProCys: 0.499 ± 0.166
1.911ProAsp: 1.911 ± 0.377
2.077ProGlu: 2.077 ± 0.493
1.08ProPhe: 1.08 ± 0.332
0.997ProGly: 0.997 ± 0.378
0.748ProHis: 0.748 ± 0.2
2.077ProIle: 2.077 ± 0.461
2.825ProLys: 2.825 ± 0.626
2.742ProLeu: 2.742 ± 0.394
0.332ProMet: 0.332 ± 0.156
1.413ProAsn: 1.413 ± 0.311
1.08ProPro: 1.08 ± 0.374
0.914ProGln: 0.914 ± 0.341
1.579ProArg: 1.579 ± 0.344
2.659ProSer: 2.659 ± 0.513
1.994ProThr: 1.994 ± 0.474
2.244ProVal: 2.244 ± 0.429
0.415ProTrp: 0.415 ± 0.167
1.413ProTyr: 1.413 ± 0.381
0.0ProXaa: 0.0 ± 0.0
Gln
4.238GlnAla: 4.238 ± 0.682
0.415GlnCys: 0.415 ± 0.175
2.493GlnAsp: 2.493 ± 0.515
3.656GlnGlu: 3.656 ± 0.661
2.161GlnPhe: 2.161 ± 0.404
2.244GlnGly: 2.244 ± 0.441
0.831GlnHis: 0.831 ± 0.288
2.908GlnIle: 2.908 ± 0.446
2.576GlnLys: 2.576 ± 0.477
4.903GlnLeu: 4.903 ± 0.576
1.246GlnMet: 1.246 ± 0.313
2.077GlnAsn: 2.077 ± 0.423
1.745GlnPro: 1.745 ± 0.416
2.244GlnGln: 2.244 ± 0.391
1.579GlnArg: 1.579 ± 0.466
2.908GlnSer: 2.908 ± 0.516
3.324GlnThr: 3.324 ± 0.905
3.989GlnVal: 3.989 ± 0.63
0.665GlnTrp: 0.665 ± 0.286
0.582GlnTyr: 0.582 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
2.244ArgAla: 2.244 ± 0.462
0.831ArgCys: 0.831 ± 0.23
2.161ArgAsp: 2.161 ± 0.487
3.324ArgGlu: 3.324 ± 0.44
1.828ArgPhe: 1.828 ± 0.533
2.576ArgGly: 2.576 ± 0.495
0.748ArgHis: 0.748 ± 0.208
3.075ArgIle: 3.075 ± 0.568
3.656ArgLys: 3.656 ± 0.718
5.069ArgLeu: 5.069 ± 0.702
0.582ArgMet: 0.582 ± 0.238
2.742ArgAsn: 2.742 ± 0.56
1.246ArgPro: 1.246 ± 0.288
2.992ArgGln: 2.992 ± 0.391
2.244ArgArg: 2.244 ± 0.536
2.327ArgSer: 2.327 ± 0.362
3.656ArgThr: 3.656 ± 0.706
3.49ArgVal: 3.49 ± 0.546
1.163ArgTrp: 1.163 ± 0.27
1.745ArgTyr: 1.745 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
3.49SerAla: 3.49 ± 0.657
0.582SerCys: 0.582 ± 0.343
4.238SerAsp: 4.238 ± 0.561
4.321SerGlu: 4.321 ± 0.606
2.41SerPhe: 2.41 ± 0.711
4.903SerGly: 4.903 ± 0.614
1.662SerHis: 1.662 ± 0.286
4.82SerIle: 4.82 ± 0.746
4.653SerLys: 4.653 ± 0.561
5.651SerLeu: 5.651 ± 0.665
1.08SerMet: 1.08 ± 0.223
2.576SerAsn: 2.576 ± 0.6
2.41SerPro: 2.41 ± 0.36
3.823SerGln: 3.823 ± 0.787
2.493SerArg: 2.493 ± 0.484
5.484SerSer: 5.484 ± 1.042
4.57SerThr: 4.57 ± 0.555
4.653SerVal: 4.653 ± 0.462
1.08SerTrp: 1.08 ± 0.243
2.327SerTyr: 2.327 ± 0.465
0.0SerXaa: 0.0 ± 0.0
Thr
5.318ThrAla: 5.318 ± 0.562
0.166ThrCys: 0.166 ± 0.133
2.742ThrAsp: 2.742 ± 0.579
4.986ThrGlu: 4.986 ± 0.653
2.742ThrPhe: 2.742 ± 0.645
4.487ThrGly: 4.487 ± 0.851
0.665ThrHis: 0.665 ± 0.211
5.152ThrIle: 5.152 ± 0.747
4.57ThrLys: 4.57 ± 0.518
5.817ThrLeu: 5.817 ± 0.561
1.163ThrMet: 1.163 ± 0.26
2.493ThrAsn: 2.493 ± 0.512
1.911ThrPro: 1.911 ± 0.476
2.825ThrGln: 2.825 ± 0.876
2.825ThrArg: 2.825 ± 0.582
5.235ThrSer: 5.235 ± 1.145
5.983ThrThr: 5.983 ± 0.95
5.484ThrVal: 5.484 ± 0.774
0.831ThrTrp: 0.831 ± 0.227
2.077ThrTyr: 2.077 ± 0.488
0.0ThrXaa: 0.0 ± 0.0
Val
4.404ValAla: 4.404 ± 0.64
0.748ValCys: 0.748 ± 0.293
3.324ValAsp: 3.324 ± 0.556
5.568ValGlu: 5.568 ± 0.753
2.161ValPhe: 2.161 ± 0.464
3.739ValGly: 3.739 ± 0.691
1.163ValHis: 1.163 ± 0.274
4.072ValIle: 4.072 ± 0.64
4.653ValLys: 4.653 ± 0.582
6.232ValLeu: 6.232 ± 0.595
1.745ValMet: 1.745 ± 0.394
2.161ValAsn: 2.161 ± 0.382
2.659ValPro: 2.659 ± 0.406
2.244ValGln: 2.244 ± 0.326
3.989ValArg: 3.989 ± 0.679
4.487ValSer: 4.487 ± 0.871
4.986ValThr: 4.986 ± 0.538
3.324ValVal: 3.324 ± 0.514
0.997ValTrp: 0.997 ± 0.323
2.244ValTyr: 2.244 ± 0.425
0.0ValXaa: 0.0 ± 0.0
Trp
0.997TrpAla: 0.997 ± 0.269
0.083TrpCys: 0.083 ± 0.097
0.499TrpAsp: 0.499 ± 0.171
1.246TrpGlu: 1.246 ± 0.289
0.831TrpPhe: 0.831 ± 0.291
0.831TrpGly: 0.831 ± 0.183
0.332TrpHis: 0.332 ± 0.148
0.665TrpIle: 0.665 ± 0.211
0.665TrpLys: 0.665 ± 0.281
1.163TrpLeu: 1.163 ± 0.318
0.415TrpMet: 0.415 ± 0.171
1.33TrpAsn: 1.33 ± 0.35
0.083TrpPro: 0.083 ± 0.067
0.582TrpGln: 0.582 ± 0.215
0.582TrpArg: 0.582 ± 0.257
0.997TrpSer: 0.997 ± 0.336
1.33TrpThr: 1.33 ± 0.401
0.997TrpVal: 0.997 ± 0.26
0.166TrpTrp: 0.166 ± 0.104
0.249TrpTyr: 0.249 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.41TyrAla: 2.41 ± 0.433
0.748TyrCys: 0.748 ± 0.27
2.576TyrAsp: 2.576 ± 0.619
2.825TyrGlu: 2.825 ± 0.55
1.579TyrPhe: 1.579 ± 0.432
2.327TyrGly: 2.327 ± 0.503
1.08TyrHis: 1.08 ± 0.333
1.911TyrIle: 1.911 ± 0.398
1.994TyrLys: 1.994 ± 0.496
3.324TyrLeu: 3.324 ± 0.567
0.665TyrMet: 0.665 ± 0.283
1.496TyrAsn: 1.496 ± 0.411
1.08TyrPro: 1.08 ± 0.227
2.244TyrGln: 2.244 ± 0.379
2.161TyrArg: 2.161 ± 0.434
2.077TyrSer: 2.077 ± 0.483
2.244TyrThr: 2.244 ± 0.454
1.496TyrVal: 1.496 ± 0.385
0.415TyrTrp: 0.415 ± 0.163
1.413TyrTyr: 1.413 ± 0.491
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (12035 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski