Amino acid dipepetide frequency for Streptococcus phage Javan447

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.006AlaAla: 5.006 ± 1.382
0.41AlaCys: 0.41 ± 0.178
4.514AlaAsp: 4.514 ± 0.622
6.565AlaGlu: 6.565 ± 0.921
2.872AlaPhe: 2.872 ± 0.392
3.529AlaGly: 3.529 ± 0.649
0.41AlaHis: 0.41 ± 0.133
5.581AlaIle: 5.581 ± 0.636
7.222AlaLys: 7.222 ± 0.809
6.73AlaLeu: 6.73 ± 0.902
1.641AlaMet: 1.641 ± 0.423
3.529AlaAsn: 3.529 ± 0.535
2.134AlaPro: 2.134 ± 0.457
3.119AlaGln: 3.119 ± 0.574
2.872AlaArg: 2.872 ± 0.595
3.857AlaSer: 3.857 ± 1.075
4.185AlaThr: 4.185 ± 0.805
4.678AlaVal: 4.678 ± 0.656
0.821AlaTrp: 0.821 ± 0.242
2.216AlaTyr: 2.216 ± 0.332
0.0AlaXaa: 0.0 ± 0.0
Cys
0.328CysAla: 0.328 ± 0.166
0.0CysCys: 0.0 ± 0.0
0.41CysAsp: 0.41 ± 0.185
0.574CysGlu: 0.574 ± 0.246
0.492CysPhe: 0.492 ± 0.199
0.082CysGly: 0.082 ± 0.087
0.164CysHis: 0.164 ± 0.12
0.0CysIle: 0.0 ± 0.0
0.492CysLys: 0.492 ± 0.191
0.985CysLeu: 0.985 ± 0.276
0.082CysMet: 0.082 ± 0.086
0.164CysAsn: 0.164 ± 0.106
0.164CysPro: 0.164 ± 0.112
0.0CysGln: 0.0 ± 0.0
0.492CysArg: 0.492 ± 0.201
0.246CysSer: 0.246 ± 0.151
0.0CysThr: 0.0 ± 0.0
0.328CysVal: 0.328 ± 0.144
0.082CysTrp: 0.082 ± 0.079
0.246CysTyr: 0.246 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
4.596AspAla: 4.596 ± 0.656
0.328AspCys: 0.328 ± 0.172
5.006AspAsp: 5.006 ± 0.874
5.334AspGlu: 5.334 ± 0.681
3.037AspPhe: 3.037 ± 0.481
5.252AspGly: 5.252 ± 0.655
0.821AspHis: 0.821 ± 0.221
4.103AspIle: 4.103 ± 0.623
6.237AspLys: 6.237 ± 0.621
7.14AspLeu: 7.14 ± 0.928
1.477AspMet: 1.477 ± 0.301
4.35AspAsn: 4.35 ± 0.554
1.067AspPro: 1.067 ± 0.282
1.641AspGln: 1.641 ± 0.417
2.134AspArg: 2.134 ± 0.407
3.529AspSer: 3.529 ± 0.666
3.283AspThr: 3.283 ± 0.475
5.663AspVal: 5.663 ± 0.534
0.821AspTrp: 0.821 ± 0.282
2.708AspTyr: 2.708 ± 0.574
0.0AspXaa: 0.0 ± 0.0
Glu
5.499GluAla: 5.499 ± 0.652
0.246GluCys: 0.246 ± 0.146
3.447GluAsp: 3.447 ± 0.677
6.237GluGlu: 6.237 ± 0.803
3.037GluPhe: 3.037 ± 0.53
2.462GluGly: 2.462 ± 0.463
1.067GluHis: 1.067 ± 0.293
7.222GluIle: 7.222 ± 1.097
5.991GluLys: 5.991 ± 0.757
8.863GluLeu: 8.863 ± 0.854
1.477GluMet: 1.477 ± 0.342
3.939GluAsn: 3.939 ± 0.586
1.723GluPro: 1.723 ± 0.466
3.365GluGln: 3.365 ± 0.623
3.119GluArg: 3.119 ± 0.544
5.581GluSer: 5.581 ± 0.654
4.021GluThr: 4.021 ± 0.599
5.088GluVal: 5.088 ± 0.677
0.903GluTrp: 0.903 ± 0.225
3.037GluTyr: 3.037 ± 0.57
0.0GluXaa: 0.0 ± 0.0
Phe
2.216PheAla: 2.216 ± 0.423
0.492PheCys: 0.492 ± 0.263
4.103PheAsp: 4.103 ± 0.519
2.544PheGlu: 2.544 ± 0.577
1.149PhePhe: 1.149 ± 0.362
2.38PheGly: 2.38 ± 0.466
0.082PheHis: 0.082 ± 0.088
2.298PheIle: 2.298 ± 0.458
2.544PheLys: 2.544 ± 0.438
2.38PheLeu: 2.38 ± 0.507
0.739PheMet: 0.739 ± 0.245
2.38PheAsn: 2.38 ± 0.375
0.574PhePro: 0.574 ± 0.22
0.574PheGln: 0.574 ± 0.229
1.888PheArg: 1.888 ± 0.461
2.79PheSer: 2.79 ± 0.473
2.626PheThr: 2.626 ± 0.397
2.38PheVal: 2.38 ± 0.407
0.328PheTrp: 0.328 ± 0.158
1.641PheTyr: 1.641 ± 0.344
0.0PheXaa: 0.0 ± 0.0
Gly
4.35GlyAla: 4.35 ± 0.746
0.41GlyCys: 0.41 ± 0.208
3.611GlyAsp: 3.611 ± 0.464
3.447GlyGlu: 3.447 ± 0.515
1.97GlyPhe: 1.97 ± 0.391
3.037GlyGly: 3.037 ± 0.54
1.149GlyHis: 1.149 ± 0.268
5.006GlyIle: 5.006 ± 0.735
5.006GlyLys: 5.006 ± 0.545
4.76GlyLeu: 4.76 ± 0.716
1.395GlyMet: 1.395 ± 0.341
2.954GlyAsn: 2.954 ± 0.484
0.903GlyPro: 0.903 ± 0.307
2.544GlyGln: 2.544 ± 0.416
2.216GlyArg: 2.216 ± 0.352
3.119GlySer: 3.119 ± 0.399
2.708GlyThr: 2.708 ± 0.505
5.991GlyVal: 5.991 ± 0.964
0.903GlyTrp: 0.903 ± 0.371
2.216GlyTyr: 2.216 ± 0.555
0.0GlyXaa: 0.0 ± 0.0
His
1.149HisAla: 1.149 ± 0.308
0.082HisCys: 0.082 ± 0.085
0.739HisAsp: 0.739 ± 0.245
0.739HisGlu: 0.739 ± 0.261
1.067HisPhe: 1.067 ± 0.279
0.985HisGly: 0.985 ± 0.268
0.082HisHis: 0.082 ± 0.09
1.805HisIle: 1.805 ± 0.434
1.231HisLys: 1.231 ± 0.294
1.395HisLeu: 1.395 ± 0.353
0.246HisMet: 0.246 ± 0.162
0.574HisAsn: 0.574 ± 0.192
0.41HisPro: 0.41 ± 0.183
0.574HisGln: 0.574 ± 0.214
0.821HisArg: 0.821 ± 0.261
0.821HisSer: 0.821 ± 0.351
1.067HisThr: 1.067 ± 0.325
0.574HisVal: 0.574 ± 0.234
0.246HisTrp: 0.246 ± 0.145
0.492HisTyr: 0.492 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
5.416IleAla: 5.416 ± 0.682
0.164IleCys: 0.164 ± 0.116
5.827IleAsp: 5.827 ± 0.704
6.648IleGlu: 6.648 ± 0.992
1.723IlePhe: 1.723 ± 0.341
3.037IleGly: 3.037 ± 0.406
0.821IleHis: 0.821 ± 0.303
3.857IleIle: 3.857 ± 0.64
8.617IleLys: 8.617 ± 1.02
4.76IleLeu: 4.76 ± 0.749
1.231IleMet: 1.231 ± 0.427
5.745IleAsn: 5.745 ± 0.755
1.149IlePro: 1.149 ± 0.302
2.052IleGln: 2.052 ± 0.353
2.626IleArg: 2.626 ± 0.539
4.514IleSer: 4.514 ± 0.587
4.432IleThr: 4.432 ± 0.5
4.268IleVal: 4.268 ± 0.756
0.657IleTrp: 0.657 ± 0.284
2.462IleTyr: 2.462 ± 0.37
0.0IleXaa: 0.0 ± 0.0
Lys
7.386LysAla: 7.386 ± 0.764
0.492LysCys: 0.492 ± 0.171
5.334LysAsp: 5.334 ± 0.601
6.894LysGlu: 6.894 ± 0.935
1.559LysPhe: 1.559 ± 0.351
4.924LysGly: 4.924 ± 0.673
1.641LysHis: 1.641 ± 0.381
6.565LysIle: 6.565 ± 0.821
7.304LysLys: 7.304 ± 0.826
7.14LysLeu: 7.14 ± 0.771
2.298LysMet: 2.298 ± 0.431
6.073LysAsn: 6.073 ± 0.759
3.037LysPro: 3.037 ± 0.669
4.596LysGln: 4.596 ± 0.742
4.268LysArg: 4.268 ± 0.727
4.103LysSer: 4.103 ± 0.599
4.924LysThr: 4.924 ± 0.508
6.565LysVal: 6.565 ± 0.757
1.231LysTrp: 1.231 ± 0.264
4.185LysTyr: 4.185 ± 0.533
0.0LysXaa: 0.0 ± 0.0
Leu
6.319LeuAla: 6.319 ± 0.973
0.328LeuCys: 0.328 ± 0.153
7.304LeuAsp: 7.304 ± 0.94
6.483LeuGlu: 6.483 ± 0.698
2.872LeuPhe: 2.872 ± 0.524
5.499LeuGly: 5.499 ± 0.609
1.231LeuHis: 1.231 ± 0.414
5.006LeuIle: 5.006 ± 0.829
8.371LeuLys: 8.371 ± 0.804
6.812LeuLeu: 6.812 ± 0.999
1.641LeuMet: 1.641 ± 0.48
5.827LeuAsn: 5.827 ± 0.729
1.97LeuPro: 1.97 ± 0.421
2.626LeuGln: 2.626 ± 0.51
3.611LeuArg: 3.611 ± 0.537
6.073LeuSer: 6.073 ± 0.662
5.006LeuThr: 5.006 ± 0.553
4.678LeuVal: 4.678 ± 0.532
0.903LeuTrp: 0.903 ± 0.208
3.037LeuTyr: 3.037 ± 0.628
0.0LeuXaa: 0.0 ± 0.0
Met
1.805MetAla: 1.805 ± 0.311
0.164MetCys: 0.164 ± 0.12
1.641MetAsp: 1.641 ± 0.392
1.641MetGlu: 1.641 ± 0.358
0.41MetPhe: 0.41 ± 0.202
0.985MetGly: 0.985 ± 0.379
0.492MetHis: 0.492 ± 0.164
1.477MetIle: 1.477 ± 0.24
1.805MetLys: 1.805 ± 0.313
1.477MetLeu: 1.477 ± 0.36
0.328MetMet: 0.328 ± 0.155
1.149MetAsn: 1.149 ± 0.27
0.903MetPro: 0.903 ± 0.259
1.067MetGln: 1.067 ± 0.43
1.723MetArg: 1.723 ± 0.355
2.052MetSer: 2.052 ± 0.513
1.805MetThr: 1.805 ± 0.324
0.985MetVal: 0.985 ± 0.283
0.328MetTrp: 0.328 ± 0.183
0.492MetTyr: 0.492 ± 0.194
0.0MetXaa: 0.0 ± 0.0
Asn
4.842AsnAla: 4.842 ± 0.824
0.492AsnCys: 0.492 ± 0.178
3.447AsnAsp: 3.447 ± 0.449
3.939AsnGlu: 3.939 ± 0.698
1.805AsnPhe: 1.805 ± 0.434
4.596AsnGly: 4.596 ± 0.647
0.985AsnHis: 0.985 ± 0.281
3.693AsnIle: 3.693 ± 0.505
4.924AsnLys: 4.924 ± 0.791
4.596AsnLeu: 4.596 ± 0.441
1.477AsnMet: 1.477 ± 0.314
3.119AsnAsn: 3.119 ± 0.476
2.38AsnPro: 2.38 ± 0.475
2.954AsnGln: 2.954 ± 0.496
2.134AsnArg: 2.134 ± 0.397
3.939AsnSer: 3.939 ± 0.663
2.79AsnThr: 2.79 ± 0.514
2.544AsnVal: 2.544 ± 0.406
0.574AsnTrp: 0.574 ± 0.255
1.888AsnTyr: 1.888 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
1.067ProAla: 1.067 ± 0.297
0.164ProCys: 0.164 ± 0.114
1.477ProAsp: 1.477 ± 0.312
1.723ProGlu: 1.723 ± 0.389
1.723ProPhe: 1.723 ± 0.4
1.067ProGly: 1.067 ± 0.289
0.492ProHis: 0.492 ± 0.196
1.723ProIle: 1.723 ± 0.533
2.298ProLys: 2.298 ± 0.436
2.298ProLeu: 2.298 ± 0.537
0.657ProMet: 0.657 ± 0.22
1.477ProAsn: 1.477 ± 0.368
0.328ProPro: 0.328 ± 0.15
0.985ProGln: 0.985 ± 0.281
1.149ProArg: 1.149 ± 0.314
1.723ProSer: 1.723 ± 0.328
1.395ProThr: 1.395 ± 0.388
1.888ProVal: 1.888 ± 0.362
0.164ProTrp: 0.164 ± 0.108
1.805ProTyr: 1.805 ± 0.423
0.0ProXaa: 0.0 ± 0.0
Gln
3.201GlnAla: 3.201 ± 0.664
0.328GlnCys: 0.328 ± 0.194
2.052GlnAsp: 2.052 ± 0.458
3.283GlnGlu: 3.283 ± 0.59
1.805GlnPhe: 1.805 ± 0.445
2.052GlnGly: 2.052 ± 0.544
0.328GlnHis: 0.328 ± 0.199
2.708GlnIle: 2.708 ± 0.441
4.678GlnLys: 4.678 ± 0.53
3.365GlnLeu: 3.365 ± 0.624
1.231GlnMet: 1.231 ± 0.295
2.38GlnAsn: 2.38 ± 0.551
0.821GlnPro: 0.821 ± 0.232
2.052GlnGln: 2.052 ± 0.647
2.216GlnArg: 2.216 ± 0.442
2.708GlnSer: 2.708 ± 0.501
2.544GlnThr: 2.544 ± 0.535
1.888GlnVal: 1.888 ± 0.387
0.492GlnTrp: 0.492 ± 0.165
0.985GlnTyr: 0.985 ± 0.249
0.0GlnXaa: 0.0 ± 0.0
Arg
3.365ArgAla: 3.365 ± 0.562
0.164ArgCys: 0.164 ± 0.159
2.298ArgAsp: 2.298 ± 0.548
3.283ArgGlu: 3.283 ± 0.552
1.395ArgPhe: 1.395 ± 0.33
2.462ArgGly: 2.462 ± 0.501
0.903ArgHis: 0.903 ± 0.332
3.119ArgIle: 3.119 ± 0.387
4.021ArgLys: 4.021 ± 0.624
3.529ArgLeu: 3.529 ± 0.524
0.985ArgMet: 0.985 ± 0.245
1.97ArgAsn: 1.97 ± 0.31
0.739ArgPro: 0.739 ± 0.302
2.134ArgGln: 2.134 ± 0.476
2.38ArgArg: 2.38 ± 0.423
2.052ArgSer: 2.052 ± 0.373
2.954ArgThr: 2.954 ± 0.546
1.97ArgVal: 1.97 ± 0.519
0.574ArgTrp: 0.574 ± 0.229
1.805ArgTyr: 1.805 ± 0.542
0.0ArgXaa: 0.0 ± 0.0
Ser
4.185SerAla: 4.185 ± 0.819
0.246SerCys: 0.246 ± 0.133
3.939SerAsp: 3.939 ± 0.563
4.76SerGlu: 4.76 ± 0.49
2.954SerPhe: 2.954 ± 0.469
3.775SerGly: 3.775 ± 0.652
1.067SerHis: 1.067 ± 0.28
4.678SerIle: 4.678 ± 0.58
5.581SerLys: 5.581 ± 0.598
5.499SerLeu: 5.499 ± 0.633
1.477SerMet: 1.477 ± 0.359
3.119SerAsn: 3.119 ± 0.612
1.477SerPro: 1.477 ± 0.284
3.365SerGln: 3.365 ± 0.552
1.805SerArg: 1.805 ± 0.466
3.283SerSer: 3.283 ± 0.686
3.529SerThr: 3.529 ± 0.724
3.447SerVal: 3.447 ± 0.533
0.821SerTrp: 0.821 ± 0.3
1.477SerTyr: 1.477 ± 0.411
0.0SerXaa: 0.0 ± 0.0
Thr
4.268ThrAla: 4.268 ± 0.979
0.082ThrCys: 0.082 ± 0.073
4.35ThrAsp: 4.35 ± 0.461
3.529ThrGlu: 3.529 ± 0.42
2.216ThrPhe: 2.216 ± 0.456
4.185ThrGly: 4.185 ± 0.793
0.821ThrHis: 0.821 ± 0.227
3.447ThrIle: 3.447 ± 0.56
4.514ThrLys: 4.514 ± 0.515
5.17ThrLeu: 5.17 ± 0.605
0.985ThrMet: 0.985 ± 0.413
3.119ThrAsn: 3.119 ± 0.544
2.216ThrPro: 2.216 ± 0.337
2.216ThrGln: 2.216 ± 0.415
2.216ThrArg: 2.216 ± 0.344
3.611ThrSer: 3.611 ± 0.438
4.35ThrThr: 4.35 ± 0.616
4.76ThrVal: 4.76 ± 0.631
0.739ThrTrp: 0.739 ± 0.224
1.97ThrTyr: 1.97 ± 0.448
0.0ThrXaa: 0.0 ± 0.0
Val
3.857ValAla: 3.857 ± 0.604
0.41ValCys: 0.41 ± 0.229
4.924ValAsp: 4.924 ± 0.486
5.745ValGlu: 5.745 ± 0.719
2.052ValPhe: 2.052 ± 0.638
4.596ValGly: 4.596 ± 0.744
1.313ValHis: 1.313 ± 0.339
4.35ValIle: 4.35 ± 0.639
5.499ValLys: 5.499 ± 0.707
4.678ValLeu: 4.678 ± 0.615
2.216ValMet: 2.216 ± 0.447
3.119ValAsn: 3.119 ± 0.418
1.888ValPro: 1.888 ± 0.465
1.888ValGln: 1.888 ± 0.33
2.954ValArg: 2.954 ± 0.421
4.021ValSer: 4.021 ± 0.483
4.185ValThr: 4.185 ± 0.568
4.678ValVal: 4.678 ± 0.451
0.492ValTrp: 0.492 ± 0.204
2.544ValTyr: 2.544 ± 0.595
0.0ValXaa: 0.0 ± 0.0
Trp
0.574TrpAla: 0.574 ± 0.189
0.082TrpCys: 0.082 ± 0.087
0.903TrpAsp: 0.903 ± 0.281
0.574TrpGlu: 0.574 ± 0.207
0.739TrpPhe: 0.739 ± 0.237
0.739TrpGly: 0.739 ± 0.296
0.328TrpHis: 0.328 ± 0.157
0.821TrpIle: 0.821 ± 0.284
0.985TrpLys: 0.985 ± 0.293
0.985TrpLeu: 0.985 ± 0.35
0.492TrpMet: 0.492 ± 0.196
0.492TrpAsn: 0.492 ± 0.173
0.164TrpPro: 0.164 ± 0.13
0.821TrpGln: 0.821 ± 0.212
0.574TrpArg: 0.574 ± 0.244
0.574TrpSer: 0.574 ± 0.187
0.739TrpThr: 0.739 ± 0.285
0.657TrpVal: 0.657 ± 0.221
0.0TrpTrp: 0.0 ± 0.0
0.574TrpTyr: 0.574 ± 0.201
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.462TyrAla: 2.462 ± 0.523
0.246TyrCys: 0.246 ± 0.139
3.201TyrAsp: 3.201 ± 0.55
2.38TyrGlu: 2.38 ± 0.428
1.231TyrPhe: 1.231 ± 0.38
2.134TyrGly: 2.134 ± 0.331
0.903TyrHis: 0.903 ± 0.327
2.544TyrIle: 2.544 ± 0.5
3.201TyrLys: 3.201 ± 0.626
3.037TyrLeu: 3.037 ± 0.605
0.574TyrMet: 0.574 ± 0.244
1.641TyrAsn: 1.641 ± 0.332
1.559TyrPro: 1.559 ± 0.358
2.544TyrGln: 2.544 ± 0.424
0.903TyrArg: 0.903 ± 0.283
1.888TyrSer: 1.888 ± 0.498
2.216TyrThr: 2.216 ± 0.432
2.38TyrVal: 2.38 ± 0.449
0.739TyrTrp: 0.739 ± 0.238
1.97TyrTyr: 1.97 ± 0.403
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12186 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski