Amino acid dipepetide frequency for Streptococcus phage Javan172

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.551AlaAla: 3.551 ± 1.005
0.433AlaCys: 0.433 ± 0.168
3.637AlaAsp: 3.637 ± 0.597
5.196AlaGlu: 5.196 ± 0.809
2.425AlaPhe: 2.425 ± 0.433
4.07AlaGly: 4.07 ± 0.799
0.606AlaHis: 0.606 ± 0.278
5.716AlaIle: 5.716 ± 0.581
5.976AlaLys: 5.976 ± 0.946
6.668AlaLeu: 6.668 ± 1.41
1.472AlaMet: 1.472 ± 0.374
5.11AlaAsn: 5.11 ± 0.835
1.126AlaPro: 1.126 ± 0.397
3.291AlaGln: 3.291 ± 0.653
2.598AlaArg: 2.598 ± 0.468
5.023AlaSer: 5.023 ± 1.007
2.771AlaThr: 2.771 ± 0.367
3.897AlaVal: 3.897 ± 0.577
0.346AlaTrp: 0.346 ± 0.166
1.992AlaTyr: 1.992 ± 0.396
0.0AlaXaa: 0.0 ± 0.0
Cys
0.346CysAla: 0.346 ± 0.181
0.0CysCys: 0.0 ± 0.0
0.173CysAsp: 0.173 ± 0.129
0.52CysGlu: 0.52 ± 0.211
0.173CysPhe: 0.173 ± 0.119
0.606CysGly: 0.606 ± 0.236
0.173CysHis: 0.173 ± 0.122
0.866CysIle: 0.866 ± 0.241
0.52CysLys: 0.52 ± 0.213
0.26CysLeu: 0.26 ± 0.145
0.087CysMet: 0.087 ± 0.088
0.346CysAsn: 0.346 ± 0.182
0.26CysPro: 0.26 ± 0.142
0.26CysGln: 0.26 ± 0.2
0.346CysArg: 0.346 ± 0.186
0.26CysSer: 0.26 ± 0.134
0.087CysThr: 0.087 ± 0.089
0.433CysVal: 0.433 ± 0.217
0.173CysTrp: 0.173 ± 0.112
0.346CysTyr: 0.346 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
3.897AspAla: 3.897 ± 0.574
0.606AspCys: 0.606 ± 0.213
2.944AspAsp: 2.944 ± 0.548
5.11AspGlu: 5.11 ± 0.794
3.291AspPhe: 3.291 ± 0.402
5.11AspGly: 5.11 ± 0.827
0.433AspHis: 0.433 ± 0.186
4.417AspIle: 4.417 ± 0.848
6.582AspLys: 6.582 ± 0.609
5.889AspLeu: 5.889 ± 0.788
1.645AspMet: 1.645 ± 0.402
3.464AspAsn: 3.464 ± 0.516
1.472AspPro: 1.472 ± 0.396
1.819AspGln: 1.819 ± 0.314
1.299AspArg: 1.299 ± 0.399
4.244AspSer: 4.244 ± 0.508
3.637AspThr: 3.637 ± 0.654
3.724AspVal: 3.724 ± 0.451
1.472AspTrp: 1.472 ± 0.308
3.984AspTyr: 3.984 ± 0.614
0.0AspXaa: 0.0 ± 0.0
Glu
5.976GluAla: 5.976 ± 1.148
0.52GluCys: 0.52 ± 0.225
3.551GluAsp: 3.551 ± 0.466
7.361GluGlu: 7.361 ± 1.095
2.858GluPhe: 2.858 ± 0.476
3.031GluGly: 3.031 ± 0.47
0.953GluHis: 0.953 ± 0.23
7.101GluIle: 7.101 ± 0.803
7.534GluLys: 7.534 ± 0.832
9.526GluLeu: 9.526 ± 0.927
1.905GluMet: 1.905 ± 0.48
3.464GluAsn: 3.464 ± 0.53
1.905GluPro: 1.905 ± 0.441
3.291GluGln: 3.291 ± 0.781
3.464GluArg: 3.464 ± 0.473
4.59GluSer: 4.59 ± 0.608
4.417GluThr: 4.417 ± 0.48
5.716GluVal: 5.716 ± 0.867
1.039GluTrp: 1.039 ± 0.308
2.078GluTyr: 2.078 ± 0.441
0.0GluXaa: 0.0 ± 0.0
Phe
2.252PheAla: 2.252 ± 0.389
0.346PheCys: 0.346 ± 0.173
3.811PheAsp: 3.811 ± 0.543
3.464PheGlu: 3.464 ± 0.616
1.732PhePhe: 1.732 ± 0.421
3.118PheGly: 3.118 ± 0.554
0.52PheHis: 0.52 ± 0.225
3.118PheIle: 3.118 ± 0.418
3.897PheLys: 3.897 ± 0.503
2.425PheLeu: 2.425 ± 0.456
0.953PheMet: 0.953 ± 0.264
2.165PheAsn: 2.165 ± 0.477
0.866PhePro: 0.866 ± 0.331
1.126PheGln: 1.126 ± 0.241
1.386PheArg: 1.386 ± 0.288
1.992PheSer: 1.992 ± 0.466
2.598PheThr: 2.598 ± 0.335
2.771PheVal: 2.771 ± 0.453
0.346PheTrp: 0.346 ± 0.179
1.472PheTyr: 1.472 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
2.858GlyAla: 2.858 ± 0.614
0.346GlyCys: 0.346 ± 0.177
4.07GlyAsp: 4.07 ± 0.746
4.677GlyGlu: 4.677 ± 0.673
2.252GlyPhe: 2.252 ± 0.552
2.944GlyGly: 2.944 ± 0.39
0.779GlyHis: 0.779 ± 0.214
5.543GlyIle: 5.543 ± 0.696
6.409GlyLys: 6.409 ± 0.737
6.322GlyLeu: 6.322 ± 0.827
1.819GlyMet: 1.819 ± 0.357
3.031GlyAsn: 3.031 ± 0.49
2.511GlyPro: 2.511 ± 1.338
2.338GlyGln: 2.338 ± 0.536
2.771GlyArg: 2.771 ± 0.376
2.338GlySer: 2.338 ± 0.55
3.378GlyThr: 3.378 ± 0.719
3.291GlyVal: 3.291 ± 0.676
0.953GlyTrp: 0.953 ± 0.424
2.685GlyTyr: 2.685 ± 0.647
0.0GlyXaa: 0.0 ± 0.0
His
0.433HisAla: 0.433 ± 0.232
0.087HisCys: 0.087 ± 0.088
1.126HisAsp: 1.126 ± 0.372
1.386HisGlu: 1.386 ± 0.382
0.779HisPhe: 0.779 ± 0.274
1.039HisGly: 1.039 ± 0.232
0.433HisHis: 0.433 ± 0.201
0.866HisIle: 0.866 ± 0.309
0.866HisLys: 0.866 ± 0.258
0.606HisLeu: 0.606 ± 0.198
0.693HisMet: 0.693 ± 0.248
0.866HisAsn: 0.866 ± 0.289
0.953HisPro: 0.953 ± 0.258
0.779HisGln: 0.779 ± 0.276
0.606HisArg: 0.606 ± 0.202
0.606HisSer: 0.606 ± 0.247
0.779HisThr: 0.779 ± 0.258
0.346HisVal: 0.346 ± 0.16
0.173HisTrp: 0.173 ± 0.111
0.606HisTyr: 0.606 ± 0.307
0.0HisXaa: 0.0 ± 0.0
Ile
4.677IleAla: 4.677 ± 0.636
0.433IleCys: 0.433 ± 0.178
6.582IleAsp: 6.582 ± 0.606
7.101IleGlu: 7.101 ± 0.745
2.858IlePhe: 2.858 ± 0.459
4.85IleGly: 4.85 ± 0.602
0.693IleHis: 0.693 ± 0.249
3.464IleIle: 3.464 ± 0.479
8.054IleLys: 8.054 ± 1.039
4.244IleLeu: 4.244 ± 0.681
1.299IleMet: 1.299 ± 0.342
5.023IleAsn: 5.023 ± 0.738
2.252IlePro: 2.252 ± 0.422
1.732IleGln: 1.732 ± 0.335
2.858IleArg: 2.858 ± 0.386
4.33IleSer: 4.33 ± 0.43
4.157IleThr: 4.157 ± 0.57
3.897IleVal: 3.897 ± 0.561
0.346IleTrp: 0.346 ± 0.164
2.944IleTyr: 2.944 ± 0.478
0.0IleXaa: 0.0 ± 0.0
Lys
7.188LysAla: 7.188 ± 1.074
0.52LysCys: 0.52 ± 0.199
6.409LysAsp: 6.409 ± 0.604
9.093LysGlu: 9.093 ± 0.979
3.204LysPhe: 3.204 ± 0.596
4.936LysGly: 4.936 ± 0.791
1.386LysHis: 1.386 ± 0.304
7.967LysIle: 7.967 ± 0.849
7.881LysLys: 7.881 ± 0.773
8.4LysLeu: 8.4 ± 0.871
3.204LysMet: 3.204 ± 0.53
5.889LysAsn: 5.889 ± 0.722
2.078LysPro: 2.078 ± 0.441
4.07LysGln: 4.07 ± 0.738
3.984LysArg: 3.984 ± 0.624
5.196LysSer: 5.196 ± 0.56
4.936LysThr: 4.936 ± 0.644
5.976LysVal: 5.976 ± 0.796
1.126LysTrp: 1.126 ± 0.349
2.685LysTyr: 2.685 ± 0.504
0.0LysXaa: 0.0 ± 0.0
Leu
5.889LeuAla: 5.889 ± 0.813
0.26LeuCys: 0.26 ± 0.148
6.928LeuAsp: 6.928 ± 0.759
7.188LeuGlu: 7.188 ± 0.952
2.252LeuPhe: 2.252 ± 0.433
4.59LeuGly: 4.59 ± 0.799
0.866LeuHis: 0.866 ± 0.259
4.763LeuIle: 4.763 ± 0.649
10.392LeuLys: 10.392 ± 1.036
6.495LeuLeu: 6.495 ± 0.669
2.252LeuMet: 2.252 ± 0.497
5.023LeuAsn: 5.023 ± 0.747
2.771LeuPro: 2.771 ± 0.474
2.771LeuGln: 2.771 ± 0.496
2.944LeuArg: 2.944 ± 0.534
6.668LeuSer: 6.668 ± 0.696
5.369LeuThr: 5.369 ± 0.849
4.07LeuVal: 4.07 ± 0.488
1.039LeuTrp: 1.039 ± 0.258
2.252LeuTyr: 2.252 ± 0.443
0.0LeuXaa: 0.0 ± 0.0
Met
1.472MetAla: 1.472 ± 0.323
0.26MetCys: 0.26 ± 0.144
1.126MetAsp: 1.126 ± 0.351
2.078MetGlu: 2.078 ± 0.458
1.126MetPhe: 1.126 ± 0.345
1.126MetGly: 1.126 ± 0.294
0.26MetHis: 0.26 ± 0.132
2.425MetIle: 2.425 ± 0.342
1.819MetLys: 1.819 ± 0.387
2.858MetLeu: 2.858 ± 0.474
1.039MetMet: 1.039 ± 0.341
0.866MetAsn: 0.866 ± 0.213
0.779MetPro: 0.779 ± 0.266
0.693MetGln: 0.693 ± 0.255
1.732MetArg: 1.732 ± 0.418
1.645MetSer: 1.645 ± 0.386
2.771MetThr: 2.771 ± 0.439
1.126MetVal: 1.126 ± 0.245
0.346MetTrp: 0.346 ± 0.176
0.953MetTyr: 0.953 ± 0.295
0.0MetXaa: 0.0 ± 0.0
Asn
3.811AsnAla: 3.811 ± 0.625
0.52AsnCys: 0.52 ± 0.205
3.031AsnAsp: 3.031 ± 0.435
2.425AsnGlu: 2.425 ± 0.415
2.078AsnPhe: 2.078 ± 0.441
4.33AsnGly: 4.33 ± 0.646
0.779AsnHis: 0.779 ± 0.277
3.031AsnIle: 3.031 ± 0.481
4.503AsnLys: 4.503 ± 0.755
4.763AsnLeu: 4.763 ± 0.674
1.645AsnMet: 1.645 ± 0.365
3.118AsnAsn: 3.118 ± 0.479
2.425AsnPro: 2.425 ± 0.443
3.291AsnGln: 3.291 ± 0.629
3.031AsnArg: 3.031 ± 0.49
3.378AsnSer: 3.378 ± 0.586
2.511AsnThr: 2.511 ± 0.469
3.378AsnVal: 3.378 ± 0.575
0.779AsnTrp: 0.779 ± 0.25
2.338AsnTyr: 2.338 ± 0.469
0.0AsnXaa: 0.0 ± 0.0
Pro
1.559ProAla: 1.559 ± 0.312
0.26ProCys: 0.26 ± 0.153
1.992ProAsp: 1.992 ± 0.459
2.511ProGlu: 2.511 ± 0.407
1.645ProPhe: 1.645 ± 0.415
1.559ProGly: 1.559 ± 0.466
0.866ProHis: 0.866 ± 0.285
1.732ProIle: 1.732 ± 0.454
3.291ProLys: 3.291 ± 0.528
1.472ProLeu: 1.472 ± 0.37
0.779ProMet: 0.779 ± 0.229
1.126ProAsn: 1.126 ± 0.362
0.693ProPro: 0.693 ± 0.23
1.126ProGln: 1.126 ± 0.358
1.732ProArg: 1.732 ± 0.4
1.819ProSer: 1.819 ± 0.465
1.905ProThr: 1.905 ± 0.382
1.645ProVal: 1.645 ± 0.317
0.26ProTrp: 0.26 ± 0.18
0.606ProTyr: 0.606 ± 0.243
0.0ProXaa: 0.0 ± 0.0
Gln
3.464GlnAla: 3.464 ± 0.443
0.173GlnCys: 0.173 ± 0.163
1.732GlnAsp: 1.732 ± 0.429
3.378GlnGlu: 3.378 ± 0.465
1.645GlnPhe: 1.645 ± 0.45
1.819GlnGly: 1.819 ± 0.5
0.606GlnHis: 0.606 ± 0.224
2.425GlnIle: 2.425 ± 0.405
4.157GlnLys: 4.157 ± 0.568
3.984GlnLeu: 3.984 ± 0.619
0.606GlnMet: 0.606 ± 0.218
1.559GlnAsn: 1.559 ± 0.3
1.299GlnPro: 1.299 ± 0.361
0.866GlnGln: 0.866 ± 0.241
2.078GlnArg: 2.078 ± 0.463
3.118GlnSer: 3.118 ± 0.528
2.598GlnThr: 2.598 ± 0.489
2.252GlnVal: 2.252 ± 0.534
0.173GlnTrp: 0.173 ± 0.124
1.126GlnTyr: 1.126 ± 0.393
0.0GlnXaa: 0.0 ± 0.0
Arg
2.252ArgAla: 2.252 ± 0.512
0.26ArgCys: 0.26 ± 0.164
3.118ArgAsp: 3.118 ± 0.45
2.858ArgGlu: 2.858 ± 0.492
1.732ArgPhe: 1.732 ± 0.337
2.944ArgGly: 2.944 ± 0.839
0.866ArgHis: 0.866 ± 0.331
2.511ArgIle: 2.511 ± 0.367
3.897ArgLys: 3.897 ± 0.69
3.637ArgLeu: 3.637 ± 0.543
1.212ArgMet: 1.212 ± 0.304
2.165ArgAsn: 2.165 ± 0.473
0.606ArgPro: 0.606 ± 0.206
1.645ArgGln: 1.645 ± 0.306
2.165ArgArg: 2.165 ± 0.551
1.819ArgSer: 1.819 ± 0.361
2.252ArgThr: 2.252 ± 0.387
2.685ArgVal: 2.685 ± 0.37
1.039ArgTrp: 1.039 ± 0.27
2.078ArgTyr: 2.078 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
3.897SerAla: 3.897 ± 0.742
0.26SerCys: 0.26 ± 0.175
3.897SerAsp: 3.897 ± 0.647
4.157SerGlu: 4.157 ± 0.495
2.858SerPhe: 2.858 ± 0.461
4.59SerGly: 4.59 ± 0.756
0.866SerHis: 0.866 ± 0.292
4.417SerIle: 4.417 ± 0.702
5.369SerLys: 5.369 ± 0.759
4.503SerLeu: 4.503 ± 0.623
1.126SerMet: 1.126 ± 0.376
3.897SerAsn: 3.897 ± 0.636
1.645SerPro: 1.645 ± 0.387
3.118SerGln: 3.118 ± 0.429
2.338SerArg: 2.338 ± 0.423
3.031SerSer: 3.031 ± 0.638
3.378SerThr: 3.378 ± 0.59
4.417SerVal: 4.417 ± 0.665
0.606SerTrp: 0.606 ± 0.231
1.386SerTyr: 1.386 ± 0.26
0.0SerXaa: 0.0 ± 0.0
Thr
4.936ThrAla: 4.936 ± 0.874
0.433ThrCys: 0.433 ± 0.22
3.464ThrAsp: 3.464 ± 0.568
3.031ThrGlu: 3.031 ± 0.544
2.771ThrPhe: 2.771 ± 0.486
4.417ThrGly: 4.417 ± 0.702
0.693ThrHis: 0.693 ± 0.255
3.811ThrIle: 3.811 ± 0.474
5.456ThrLys: 5.456 ± 0.681
4.33ThrLeu: 4.33 ± 0.526
1.386ThrMet: 1.386 ± 0.358
2.858ThrAsn: 2.858 ± 0.443
1.992ThrPro: 1.992 ± 0.362
2.338ThrGln: 2.338 ± 0.391
1.732ThrArg: 1.732 ± 0.407
3.897ThrSer: 3.897 ± 0.506
3.464ThrThr: 3.464 ± 0.844
3.118ThrVal: 3.118 ± 0.603
0.693ThrTrp: 0.693 ± 0.254
1.645ThrTyr: 1.645 ± 0.335
0.0ThrXaa: 0.0 ± 0.0
Val
4.85ValAla: 4.85 ± 0.629
0.26ValCys: 0.26 ± 0.153
4.157ValAsp: 4.157 ± 0.53
4.59ValGlu: 4.59 ± 0.533
2.425ValPhe: 2.425 ± 0.473
3.291ValGly: 3.291 ± 0.54
0.866ValHis: 0.866 ± 0.264
4.417ValIle: 4.417 ± 0.608
5.11ValLys: 5.11 ± 0.7
4.59ValLeu: 4.59 ± 0.638
1.645ValMet: 1.645 ± 0.358
2.944ValAsn: 2.944 ± 0.471
1.299ValPro: 1.299 ± 0.323
1.819ValGln: 1.819 ± 0.398
2.944ValArg: 2.944 ± 0.458
3.897ValSer: 3.897 ± 0.5
3.551ValThr: 3.551 ± 0.477
3.984ValVal: 3.984 ± 0.499
0.779ValTrp: 0.779 ± 0.246
1.905ValTyr: 1.905 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
0.693TrpAla: 0.693 ± 0.216
0.173TrpCys: 0.173 ± 0.122
0.606TrpAsp: 0.606 ± 0.188
1.299TrpGlu: 1.299 ± 0.275
0.866TrpPhe: 0.866 ± 0.317
1.039TrpGly: 1.039 ± 0.329
0.346TrpHis: 0.346 ± 0.159
0.866TrpIle: 0.866 ± 0.244
0.866TrpLys: 0.866 ± 0.239
1.212TrpLeu: 1.212 ± 0.409
0.346TrpMet: 0.346 ± 0.187
0.52TrpAsn: 0.52 ± 0.234
0.346TrpPro: 0.346 ± 0.188
0.52TrpGln: 0.52 ± 0.198
0.52TrpArg: 0.52 ± 0.197
0.433TrpSer: 0.433 ± 0.184
0.779TrpThr: 0.779 ± 0.272
0.346TrpVal: 0.346 ± 0.178
0.087TrpTrp: 0.087 ± 0.09
0.433TrpTyr: 0.433 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.905TyrAla: 1.905 ± 0.462
0.087TyrCys: 0.087 ± 0.087
2.685TyrAsp: 2.685 ± 0.44
2.858TyrGlu: 2.858 ± 0.522
1.386TyrPhe: 1.386 ± 0.344
1.905TyrGly: 1.905 ± 0.4
1.039TyrHis: 1.039 ± 0.311
2.338TyrIle: 2.338 ± 0.641
3.551TyrLys: 3.551 ± 0.611
2.598TyrLeu: 2.598 ± 0.58
1.299TyrMet: 1.299 ± 0.336
1.732TyrAsn: 1.732 ± 0.435
1.299TyrPro: 1.299 ± 0.325
2.165TyrGln: 2.165 ± 0.529
1.212TyrArg: 1.212 ± 0.308
1.472TyrSer: 1.472 ± 0.316
1.126TyrThr: 1.126 ± 0.296
2.252TyrVal: 2.252 ± 0.518
0.52TyrTrp: 0.52 ± 0.208
1.819TyrTyr: 1.819 ± 0.31
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (11548 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski