Amino acid dipepetide frequency for Streptococcus phage TP-J34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.513AlaAla: 4.513 ± 1.409
0.353AlaCys: 0.353 ± 0.187
5.218AlaAsp: 5.218 ± 0.929
5.077AlaGlu: 5.077 ± 0.732
2.468AlaPhe: 2.468 ± 0.746
5.218AlaGly: 5.218 ± 0.949
0.635AlaHis: 0.635 ± 0.227
5.712AlaIle: 5.712 ± 1.235
5.5AlaLys: 5.5 ± 0.779
6.276AlaLeu: 6.276 ± 0.845
1.974AlaMet: 1.974 ± 0.821
3.878AlaAsn: 3.878 ± 0.565
1.974AlaPro: 1.974 ± 0.477
3.032AlaGln: 3.032 ± 0.811
2.75AlaArg: 2.75 ± 0.599
5.289AlaSer: 5.289 ± 0.985
3.314AlaThr: 3.314 ± 0.582
4.795AlaVal: 4.795 ± 0.845
1.41AlaTrp: 1.41 ± 0.603
3.244AlaTyr: 3.244 ± 0.63
0.0AlaXaa: 0.0 ± 0.0
Cys
0.141CysAla: 0.141 ± 0.096
0.071CysCys: 0.071 ± 0.071
0.494CysAsp: 0.494 ± 0.229
0.635CysGlu: 0.635 ± 0.221
0.141CysPhe: 0.141 ± 0.117
0.353CysGly: 0.353 ± 0.166
0.071CysHis: 0.071 ± 0.091
0.282CysIle: 0.282 ± 0.135
0.423CysLys: 0.423 ± 0.168
0.353CysLeu: 0.353 ± 0.175
0.071CysMet: 0.071 ± 0.067
0.423CysAsn: 0.423 ± 0.205
0.141CysPro: 0.141 ± 0.116
0.071CysGln: 0.071 ± 0.079
0.141CysArg: 0.141 ± 0.099
0.635CysSer: 0.635 ± 0.173
0.282CysThr: 0.282 ± 0.15
0.282CysVal: 0.282 ± 0.133
0.141CysTrp: 0.141 ± 0.101
0.353CysTyr: 0.353 ± 0.275
0.0CysXaa: 0.0 ± 0.0
Asp
3.173AspAla: 3.173 ± 0.51
0.494AspCys: 0.494 ± 0.189
4.372AspAsp: 4.372 ± 0.547
3.314AspGlu: 3.314 ± 0.734
2.891AspPhe: 2.891 ± 0.516
7.334AspGly: 7.334 ± 1.888
1.128AspHis: 1.128 ± 0.261
4.09AspIle: 4.09 ± 0.51
4.866AspLys: 4.866 ± 0.792
4.443AspLeu: 4.443 ± 0.701
1.41AspMet: 1.41 ± 0.335
4.513AspAsn: 4.513 ± 0.869
1.128AspPro: 1.128 ± 0.296
1.481AspGln: 1.481 ± 0.306
3.103AspArg: 3.103 ± 0.728
4.936AspSer: 4.936 ± 0.865
3.103AspThr: 3.103 ± 0.41
2.962AspVal: 2.962 ± 0.575
1.199AspTrp: 1.199 ± 0.314
3.596AspTyr: 3.596 ± 0.724
0.0AspXaa: 0.0 ± 0.0
Glu
4.513GluAla: 4.513 ± 0.708
0.353GluCys: 0.353 ± 0.192
3.244GluAsp: 3.244 ± 0.512
4.725GluGlu: 4.725 ± 1.076
2.821GluPhe: 2.821 ± 0.418
3.032GluGly: 3.032 ± 0.468
1.128GluHis: 1.128 ± 0.329
5.5GluIle: 5.5 ± 1.1
5.007GluLys: 5.007 ± 1.058
6.911GluLeu: 6.911 ± 1.117
1.974GluMet: 1.974 ± 0.416
3.667GluAsn: 3.667 ± 0.703
1.833GluPro: 1.833 ± 0.472
2.891GluGln: 2.891 ± 0.54
4.019GluArg: 4.019 ± 0.788
3.455GluSer: 3.455 ± 0.477
3.032GluThr: 3.032 ± 0.638
5.43GluVal: 5.43 ± 0.869
0.846GluTrp: 0.846 ± 0.264
3.526GluTyr: 3.526 ± 0.674
0.0GluXaa: 0.0 ± 0.0
Phe
2.257PheAla: 2.257 ± 0.418
0.212PheCys: 0.212 ± 0.108
2.68PheAsp: 2.68 ± 0.421
3.032PheGlu: 3.032 ± 0.774
0.987PhePhe: 0.987 ± 0.312
3.032PheGly: 3.032 ± 0.62
0.635PheHis: 0.635 ± 0.246
1.904PheIle: 1.904 ± 0.442
4.16PheLys: 4.16 ± 0.685
2.186PheLeu: 2.186 ± 0.642
1.058PheMet: 1.058 ± 0.273
3.103PheAsn: 3.103 ± 0.393
0.564PhePro: 0.564 ± 0.232
1.269PheGln: 1.269 ± 0.336
1.41PheArg: 1.41 ± 0.296
4.09PheSer: 4.09 ± 0.765
2.68PheThr: 2.68 ± 0.529
1.763PheVal: 1.763 ± 0.381
0.423PheTrp: 0.423 ± 0.17
1.481PheTyr: 1.481 ± 0.406
0.0PheXaa: 0.0 ± 0.0
Gly
5.148GlyAla: 5.148 ± 0.778
0.071GlyCys: 0.071 ± 0.086
3.455GlyAsp: 3.455 ± 0.623
4.372GlyGlu: 4.372 ± 1.05
2.68GlyPhe: 2.68 ± 0.377
3.667GlyGly: 3.667 ± 0.607
1.199GlyHis: 1.199 ± 0.354
6.488GlyIle: 6.488 ± 1.497
5.782GlyLys: 5.782 ± 0.815
5.148GlyLeu: 5.148 ± 0.7
1.904GlyMet: 1.904 ± 0.562
4.372GlyAsn: 4.372 ± 0.854
2.468GlyPro: 2.468 ± 1.152
3.314GlyGln: 3.314 ± 0.544
3.385GlyArg: 3.385 ± 0.611
4.372GlySer: 4.372 ± 0.76
5.359GlyThr: 5.359 ± 0.674
3.949GlyVal: 3.949 ± 0.562
0.846GlyTrp: 0.846 ± 0.271
2.609GlyTyr: 2.609 ± 0.533
0.0GlyXaa: 0.0 ± 0.0
His
0.987HisAla: 0.987 ± 0.231
0.0HisCys: 0.0 ± 0.0
1.199HisAsp: 1.199 ± 0.235
0.564HisGlu: 0.564 ± 0.195
0.564HisPhe: 0.564 ± 0.176
0.776HisGly: 0.776 ± 0.266
0.282HisHis: 0.282 ± 0.15
0.987HisIle: 0.987 ± 0.417
0.846HisLys: 0.846 ± 0.224
0.987HisLeu: 0.987 ± 0.315
0.212HisMet: 0.212 ± 0.122
0.705HisAsn: 0.705 ± 0.236
0.282HisPro: 0.282 ± 0.152
0.635HisGln: 0.635 ± 0.229
0.564HisArg: 0.564 ± 0.226
0.564HisSer: 0.564 ± 0.238
1.058HisThr: 1.058 ± 0.272
0.987HisVal: 0.987 ± 0.257
0.141HisTrp: 0.141 ± 0.109
0.635HisTyr: 0.635 ± 0.252
0.0HisXaa: 0.0 ± 0.0
Ile
5.641IleAla: 5.641 ± 1.097
0.564IleCys: 0.564 ± 0.205
5.641IleAsp: 5.641 ± 0.589
4.936IleGlu: 4.936 ± 0.979
1.481IlePhe: 1.481 ± 0.334
5.359IleGly: 5.359 ± 0.952
0.987IleHis: 0.987 ± 0.218
3.878IleIle: 3.878 ± 0.634
5.359IleLys: 5.359 ± 0.614
3.667IleLeu: 3.667 ± 0.603
0.917IleMet: 0.917 ± 0.209
3.949IleAsn: 3.949 ± 0.632
2.75IlePro: 2.75 ± 0.572
2.821IleGln: 2.821 ± 0.488
2.468IleArg: 2.468 ± 0.388
5.359IleSer: 5.359 ± 1.043
4.866IleThr: 4.866 ± 0.744
3.596IleVal: 3.596 ± 0.613
0.635IleTrp: 0.635 ± 0.255
2.539IleTyr: 2.539 ± 0.563
0.0IleXaa: 0.0 ± 0.0
Lys
6.629LysAla: 6.629 ± 0.726
0.212LysCys: 0.212 ± 0.142
4.09LysAsp: 4.09 ± 0.613
5.853LysGlu: 5.853 ± 1.119
2.398LysPhe: 2.398 ± 0.632
4.936LysGly: 4.936 ± 0.76
0.846LysHis: 0.846 ± 0.24
5.148LysIle: 5.148 ± 0.738
4.866LysLys: 4.866 ± 0.956
6.77LysLeu: 6.77 ± 0.9
1.833LysMet: 1.833 ± 0.527
4.16LysAsn: 4.16 ± 0.574
3.103LysPro: 3.103 ± 0.589
3.385LysGln: 3.385 ± 0.64
4.513LysArg: 4.513 ± 0.845
4.372LysSer: 4.372 ± 0.544
5.148LysThr: 5.148 ± 0.629
4.019LysVal: 4.019 ± 0.499
1.34LysTrp: 1.34 ± 0.438
3.244LysTyr: 3.244 ± 0.756
0.0LysXaa: 0.0 ± 0.0
Leu
5.923LeuAla: 5.923 ± 0.641
0.494LeuCys: 0.494 ± 0.162
4.584LeuAsp: 4.584 ± 0.678
6.417LeuGlu: 6.417 ± 1.026
3.032LeuPhe: 3.032 ± 0.403
5.148LeuGly: 5.148 ± 0.735
0.635LeuHis: 0.635 ± 0.231
4.09LeuIle: 4.09 ± 0.541
6.488LeuLys: 6.488 ± 0.976
4.443LeuLeu: 4.443 ± 0.68
1.974LeuMet: 1.974 ± 0.262
4.866LeuAsn: 4.866 ± 0.475
1.974LeuPro: 1.974 ± 0.403
2.609LeuGln: 2.609 ± 0.397
2.68LeuArg: 2.68 ± 0.456
5.712LeuSer: 5.712 ± 0.719
6.699LeuThr: 6.699 ± 0.901
4.019LeuVal: 4.019 ± 0.48
0.423LeuTrp: 0.423 ± 0.136
3.173LeuTyr: 3.173 ± 0.629
0.0LeuXaa: 0.0 ± 0.0
Met
2.257MetAla: 2.257 ± 0.595
0.0MetCys: 0.0 ± 0.0
0.917MetAsp: 0.917 ± 0.306
1.41MetGlu: 1.41 ± 0.288
0.776MetPhe: 0.776 ± 0.236
1.269MetGly: 1.269 ± 0.436
0.423MetHis: 0.423 ± 0.179
1.763MetIle: 1.763 ± 0.403
2.116MetLys: 2.116 ± 0.458
1.974MetLeu: 1.974 ± 0.466
1.058MetMet: 1.058 ± 0.4
0.917MetAsn: 0.917 ± 0.235
0.917MetPro: 0.917 ± 0.369
1.34MetGln: 1.34 ± 0.516
0.987MetArg: 0.987 ± 0.231
2.045MetSer: 2.045 ± 0.461
1.481MetThr: 1.481 ± 0.312
0.917MetVal: 0.917 ± 0.255
0.071MetTrp: 0.071 ± 0.067
0.776MetTyr: 0.776 ± 0.299
0.0MetXaa: 0.0 ± 0.0
Asn
4.231AsnAla: 4.231 ± 0.593
0.282AsnCys: 0.282 ± 0.145
3.103AsnAsp: 3.103 ± 0.683
4.443AsnGlu: 4.443 ± 0.764
2.327AsnPhe: 2.327 ± 0.426
5.359AsnGly: 5.359 ± 0.922
1.058AsnHis: 1.058 ± 0.368
3.173AsnIle: 3.173 ± 0.36
3.808AsnLys: 3.808 ± 0.514
4.795AsnLeu: 4.795 ± 0.616
0.987AsnMet: 0.987 ± 0.272
3.244AsnAsn: 3.244 ± 0.603
2.68AsnPro: 2.68 ± 0.698
1.833AsnGln: 1.833 ± 0.355
2.398AsnArg: 2.398 ± 0.482
3.455AsnSer: 3.455 ± 0.51
3.455AsnThr: 3.455 ± 0.487
4.372AsnVal: 4.372 ± 0.642
1.34AsnTrp: 1.34 ± 0.317
2.327AsnTyr: 2.327 ± 0.474
0.0AsnXaa: 0.0 ± 0.0
Pro
1.269ProAla: 1.269 ± 0.281
0.071ProCys: 0.071 ± 0.073
1.763ProAsp: 1.763 ± 0.371
1.904ProGlu: 1.904 ± 0.5
1.128ProPhe: 1.128 ± 0.35
1.622ProGly: 1.622 ± 0.671
0.282ProHis: 0.282 ± 0.134
2.327ProIle: 2.327 ± 0.481
3.103ProLys: 3.103 ± 0.614
1.622ProLeu: 1.622 ± 0.462
0.212ProMet: 0.212 ± 0.143
2.468ProAsn: 2.468 ± 0.68
0.846ProPro: 0.846 ± 0.24
2.68ProGln: 2.68 ± 1.116
1.622ProArg: 1.622 ± 0.294
2.327ProSer: 2.327 ± 0.451
1.269ProThr: 1.269 ± 0.306
2.257ProVal: 2.257 ± 0.449
0.564ProTrp: 0.564 ± 0.259
1.481ProTyr: 1.481 ± 0.566
0.0ProXaa: 0.0 ± 0.0
Gln
3.596GlnAla: 3.596 ± 0.702
0.282GlnCys: 0.282 ± 0.139
2.609GlnAsp: 2.609 ± 0.532
2.891GlnGlu: 2.891 ± 0.593
1.551GlnPhe: 1.551 ± 0.317
3.173GlnGly: 3.173 ± 1.336
0.353GlnHis: 0.353 ± 0.164
2.398GlnIle: 2.398 ± 0.467
2.398GlnLys: 2.398 ± 0.528
3.526GlnLeu: 3.526 ± 0.517
1.199GlnMet: 1.199 ± 0.318
2.045GlnAsn: 2.045 ± 0.429
1.128GlnPro: 1.128 ± 0.296
1.622GlnGln: 1.622 ± 0.474
1.481GlnArg: 1.481 ± 0.343
2.75GlnSer: 2.75 ± 0.711
3.173GlnThr: 3.173 ± 0.394
2.327GlnVal: 2.327 ± 0.39
0.846GlnTrp: 0.846 ± 0.218
2.398GlnTyr: 2.398 ± 0.673
0.0GlnXaa: 0.0 ± 0.0
Arg
2.75ArgAla: 2.75 ± 0.507
0.564ArgCys: 0.564 ± 0.203
3.596ArgAsp: 3.596 ± 0.871
2.539ArgGlu: 2.539 ± 0.655
2.257ArgPhe: 2.257 ± 0.313
3.878ArgGly: 3.878 ± 1.265
0.635ArgHis: 0.635 ± 0.216
2.891ArgIle: 2.891 ± 0.536
3.314ArgLys: 3.314 ± 0.756
3.314ArgLeu: 3.314 ± 0.543
1.41ArgMet: 1.41 ± 0.357
2.116ArgAsn: 2.116 ± 0.424
0.917ArgPro: 0.917 ± 0.274
1.551ArgGln: 1.551 ± 0.344
1.41ArgArg: 1.41 ± 0.466
2.116ArgSer: 2.116 ± 0.426
1.833ArgThr: 1.833 ± 0.39
2.257ArgVal: 2.257 ± 0.426
0.705ArgTrp: 0.705 ± 0.347
1.904ArgTyr: 1.904 ± 0.296
0.0ArgXaa: 0.0 ± 0.0
Ser
6.417SerAla: 6.417 ± 1.659
0.635SerCys: 0.635 ± 0.222
5.571SerAsp: 5.571 ± 0.759
3.385SerGlu: 3.385 ± 0.458
2.962SerPhe: 2.962 ± 0.518
5.571SerGly: 5.571 ± 0.866
0.423SerHis: 0.423 ± 0.248
4.443SerIle: 4.443 ± 0.582
4.795SerLys: 4.795 ± 0.568
5.007SerLeu: 5.007 ± 0.591
1.058SerMet: 1.058 ± 0.281
3.808SerAsn: 3.808 ± 0.472
2.327SerPro: 2.327 ± 0.402
3.949SerGln: 3.949 ± 0.653
2.539SerArg: 2.539 ± 0.484
4.302SerSer: 4.302 ± 1.068
4.725SerThr: 4.725 ± 0.625
4.654SerVal: 4.654 ± 0.866
0.353SerTrp: 0.353 ± 0.149
2.68SerTyr: 2.68 ± 0.528
0.0SerXaa: 0.0 ± 0.0
Thr
4.866ThrAla: 4.866 ± 0.895
0.282ThrCys: 0.282 ± 0.205
3.808ThrAsp: 3.808 ± 0.564
3.949ThrGlu: 3.949 ± 0.576
3.526ThrPhe: 3.526 ± 0.573
3.667ThrGly: 3.667 ± 0.551
0.776ThrHis: 0.776 ± 0.263
5.43ThrIle: 5.43 ± 1.142
5.148ThrLys: 5.148 ± 0.633
5.5ThrLeu: 5.5 ± 0.633
0.846ThrMet: 0.846 ± 0.346
3.526ThrAsn: 3.526 ± 0.724
2.327ThrPro: 2.327 ± 0.523
2.75ThrGln: 2.75 ± 0.554
1.974ThrArg: 1.974 ± 0.415
4.019ThrSer: 4.019 ± 0.823
4.795ThrThr: 4.795 ± 0.777
4.443ThrVal: 4.443 ± 0.553
0.987ThrTrp: 0.987 ± 0.478
2.75ThrTyr: 2.75 ± 0.487
0.0ThrXaa: 0.0 ± 0.0
Val
3.526ValAla: 3.526 ± 0.726
0.212ValCys: 0.212 ± 0.127
3.808ValAsp: 3.808 ± 0.574
5.077ValGlu: 5.077 ± 0.878
2.257ValPhe: 2.257 ± 0.378
4.019ValGly: 4.019 ± 0.745
0.846ValHis: 0.846 ± 0.339
3.244ValIle: 3.244 ± 0.451
4.936ValLys: 4.936 ± 0.54
4.584ValLeu: 4.584 ± 0.605
1.199ValMet: 1.199 ± 0.297
4.019ValAsn: 4.019 ± 0.76
1.41ValPro: 1.41 ± 0.274
2.186ValGln: 2.186 ± 0.48
1.622ValArg: 1.622 ± 0.444
5.148ValSer: 5.148 ± 0.636
4.654ValThr: 4.654 ± 0.606
4.302ValVal: 4.302 ± 0.7
0.987ValTrp: 0.987 ± 0.282
1.904ValTyr: 1.904 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
0.705TrpAla: 0.705 ± 0.249
0.071TrpCys: 0.071 ± 0.067
0.564TrpAsp: 0.564 ± 0.161
1.128TrpGlu: 1.128 ± 0.28
0.635TrpPhe: 0.635 ± 0.226
0.917TrpGly: 0.917 ± 0.262
0.212TrpHis: 0.212 ± 0.132
0.564TrpIle: 0.564 ± 0.189
0.635TrpLys: 0.635 ± 0.247
0.776TrpLeu: 0.776 ± 0.233
0.423TrpMet: 0.423 ± 0.264
0.987TrpAsn: 0.987 ± 0.324
0.353TrpPro: 0.353 ± 0.197
0.494TrpGln: 0.494 ± 0.197
0.705TrpArg: 0.705 ± 0.246
1.833TrpSer: 1.833 ± 0.467
1.269TrpThr: 1.269 ± 0.502
0.846TrpVal: 0.846 ± 0.239
0.282TrpTrp: 0.282 ± 0.149
0.423TrpTyr: 0.423 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.09TyrAla: 4.09 ± 0.658
0.282TyrCys: 0.282 ± 0.123
2.821TyrAsp: 2.821 ± 0.675
2.257TyrGlu: 2.257 ± 0.562
2.045TyrPhe: 2.045 ± 0.385
2.257TyrGly: 2.257 ± 0.546
0.423TyrHis: 0.423 ± 0.185
3.103TyrIle: 3.103 ± 0.492
3.385TyrLys: 3.385 ± 0.481
3.103TyrLeu: 3.103 ± 0.636
1.622TyrMet: 1.622 ± 0.449
1.904TyrAsn: 1.904 ± 0.403
1.692TyrPro: 1.692 ± 0.543
1.833TyrGln: 1.833 ± 0.426
2.186TyrArg: 2.186 ± 0.488
2.821TyrSer: 2.821 ± 0.538
3.173TyrThr: 3.173 ± 0.993
1.763TyrVal: 1.763 ± 0.356
0.282TyrTrp: 0.282 ± 0.159
1.904TyrTyr: 1.904 ± 0.566
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (14182 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski