Amino acid dipepetide frequency for Streptococcus phage vB_SthS_VA214

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.016AlaAla: 4.016 ± 0.95
0.171AlaCys: 0.171 ± 0.116
3.759AlaAsp: 3.759 ± 0.509
5.383AlaGlu: 5.383 ± 0.618
2.734AlaPhe: 2.734 ± 0.636
3.93AlaGly: 3.93 ± 0.703
0.598AlaHis: 0.598 ± 0.304
4.87AlaIle: 4.87 ± 0.69
5.041AlaLys: 5.041 ± 0.827
6.237AlaLeu: 6.237 ± 0.57
1.623AlaMet: 1.623 ± 0.337
4.699AlaAsn: 4.699 ± 0.867
1.452AlaPro: 1.452 ± 0.362
3.161AlaGln: 3.161 ± 0.677
2.563AlaArg: 2.563 ± 0.48
4.187AlaSer: 4.187 ± 0.699
3.076AlaThr: 3.076 ± 0.636
4.357AlaVal: 4.357 ± 0.695
1.452AlaTrp: 1.452 ± 0.329
2.734AlaTyr: 2.734 ± 0.4
0.0AlaXaa: 0.0 ± 0.0
Cys
0.171CysAla: 0.171 ± 0.116
0.085CysCys: 0.085 ± 0.081
0.684CysAsp: 0.684 ± 0.256
0.513CysGlu: 0.513 ± 0.248
0.256CysPhe: 0.256 ± 0.183
0.342CysGly: 0.342 ± 0.164
0.171CysHis: 0.171 ± 0.11
0.0CysIle: 0.0 ± 0.0
0.513CysLys: 0.513 ± 0.238
0.427CysLeu: 0.427 ± 0.2
0.085CysMet: 0.085 ± 0.084
0.171CysAsn: 0.171 ± 0.116
0.256CysPro: 0.256 ± 0.136
0.171CysGln: 0.171 ± 0.133
0.513CysArg: 0.513 ± 0.281
0.684CysSer: 0.684 ± 0.289
0.342CysThr: 0.342 ± 0.195
0.256CysVal: 0.256 ± 0.139
0.171CysTrp: 0.171 ± 0.124
0.085CysTyr: 0.085 ± 0.075
0.0CysXaa: 0.0 ± 0.0
Asp
3.589AspAla: 3.589 ± 0.557
0.94AspCys: 0.94 ± 0.315
4.614AspAsp: 4.614 ± 0.612
3.845AspGlu: 3.845 ± 0.638
3.93AspPhe: 3.93 ± 0.525
6.664AspGly: 6.664 ± 1.329
1.111AspHis: 1.111 ± 0.355
4.528AspIle: 4.528 ± 0.532
4.785AspLys: 4.785 ± 0.729
4.785AspLeu: 4.785 ± 0.805
2.221AspMet: 2.221 ± 0.518
4.016AspAsn: 4.016 ± 0.78
1.794AspPro: 1.794 ± 0.4
1.538AspGln: 1.538 ± 0.38
2.392AspArg: 2.392 ± 0.704
3.589AspSer: 3.589 ± 0.585
3.674AspThr: 3.674 ± 0.553
3.332AspVal: 3.332 ± 0.477
0.854AspTrp: 0.854 ± 0.303
2.734AspTyr: 2.734 ± 0.444
0.0AspXaa: 0.0 ± 0.0
Glu
3.759GluAla: 3.759 ± 0.577
0.171GluCys: 0.171 ± 0.104
3.759GluAsp: 3.759 ± 0.64
5.895GluGlu: 5.895 ± 1.254
2.221GluPhe: 2.221 ± 0.534
3.076GluGly: 3.076 ± 0.513
1.282GluHis: 1.282 ± 0.392
7.092GluIle: 7.092 ± 0.876
5.468GluLys: 5.468 ± 1.235
6.835GluLeu: 6.835 ± 1.011
2.478GluMet: 2.478 ± 0.486
3.674GluAsn: 3.674 ± 0.587
1.623GluPro: 1.623 ± 0.421
3.076GluGln: 3.076 ± 0.661
3.247GluArg: 3.247 ± 0.575
3.589GluSer: 3.589 ± 0.647
4.187GluThr: 4.187 ± 0.581
4.187GluVal: 4.187 ± 0.626
1.111GluTrp: 1.111 ± 0.308
3.93GluTyr: 3.93 ± 0.678
0.0GluXaa: 0.0 ± 0.0
Phe
3.247PheAla: 3.247 ± 0.466
0.171PheCys: 0.171 ± 0.162
4.101PheAsp: 4.101 ± 0.602
2.734PheGlu: 2.734 ± 0.479
1.623PhePhe: 1.623 ± 0.335
3.076PheGly: 3.076 ± 0.542
0.342PheHis: 0.342 ± 0.163
2.307PheIle: 2.307 ± 0.357
4.357PheLys: 4.357 ± 0.645
2.734PheLeu: 2.734 ± 0.451
0.598PheMet: 0.598 ± 0.247
2.905PheAsn: 2.905 ± 0.622
0.769PhePro: 0.769 ± 0.221
1.025PheGln: 1.025 ± 0.272
1.367PheArg: 1.367 ± 0.316
3.076PheSer: 3.076 ± 0.518
2.478PheThr: 2.478 ± 0.479
2.392PheVal: 2.392 ± 0.375
0.854PheTrp: 0.854 ± 0.244
1.965PheTyr: 1.965 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
2.649GlyAla: 2.649 ± 0.518
0.427GlyCys: 0.427 ± 0.265
3.759GlyAsp: 3.759 ± 0.645
3.845GlyGlu: 3.845 ± 0.568
2.734GlyPhe: 2.734 ± 0.526
4.016GlyGly: 4.016 ± 0.633
0.427GlyHis: 0.427 ± 0.183
4.357GlyIle: 4.357 ± 0.528
6.408GlyLys: 6.408 ± 1.069
6.408GlyLeu: 6.408 ± 0.843
1.709GlyMet: 1.709 ± 0.322
3.93GlyAsn: 3.93 ± 0.616
1.623GlyPro: 1.623 ± 0.56
3.674GlyGln: 3.674 ± 0.686
2.99GlyArg: 2.99 ± 0.605
4.699GlySer: 4.699 ± 0.865
4.443GlyThr: 4.443 ± 0.68
3.076GlyVal: 3.076 ± 0.617
1.111GlyTrp: 1.111 ± 0.412
2.99GlyTyr: 2.99 ± 0.436
0.0GlyXaa: 0.0 ± 0.0
His
0.598HisAla: 0.598 ± 0.254
0.0HisCys: 0.0 ± 0.0
1.196HisAsp: 1.196 ± 0.341
0.342HisGlu: 0.342 ± 0.183
0.427HisPhe: 0.427 ± 0.186
0.684HisGly: 0.684 ± 0.19
0.598HisHis: 0.598 ± 0.217
0.769HisIle: 0.769 ± 0.286
1.111HisLys: 1.111 ± 0.321
1.196HisLeu: 1.196 ± 0.298
0.513HisMet: 0.513 ± 0.2
0.94HisAsn: 0.94 ± 0.297
0.342HisPro: 0.342 ± 0.161
0.769HisGln: 0.769 ± 0.263
0.94HisArg: 0.94 ± 0.343
0.854HisSer: 0.854 ± 0.246
0.769HisThr: 0.769 ± 0.261
1.111HisVal: 1.111 ± 0.255
0.0HisTrp: 0.0 ± 0.0
0.769HisTyr: 0.769 ± 0.332
0.0HisXaa: 0.0 ± 0.0
Ile
4.956IleAla: 4.956 ± 0.662
0.256IleCys: 0.256 ± 0.151
4.614IleAsp: 4.614 ± 0.571
6.152IleGlu: 6.152 ± 0.969
2.051IlePhe: 2.051 ± 0.397
4.187IleGly: 4.187 ± 0.503
0.94IleHis: 0.94 ± 0.257
3.161IleIle: 3.161 ± 0.505
7.519IleLys: 7.519 ± 0.676
4.101IleLeu: 4.101 ± 0.627
1.794IleMet: 1.794 ± 0.472
4.443IleAsn: 4.443 ± 0.536
3.161IlePro: 3.161 ± 0.445
3.332IleGln: 3.332 ± 0.405
2.307IleArg: 2.307 ± 0.466
3.589IleSer: 3.589 ± 0.61
3.845IleThr: 3.845 ± 0.53
2.734IleVal: 2.734 ± 0.516
0.854IleTrp: 0.854 ± 0.236
1.965IleTyr: 1.965 ± 0.381
0.0IleXaa: 0.0 ± 0.0
Lys
5.981LysAla: 5.981 ± 0.746
0.342LysCys: 0.342 ± 0.166
5.297LysAsp: 5.297 ± 0.709
7.433LysGlu: 7.433 ± 1.121
3.93LysPhe: 3.93 ± 0.878
5.468LysGly: 5.468 ± 0.739
1.452LysHis: 1.452 ± 0.383
4.956LysIle: 4.956 ± 0.574
7.775LysLys: 7.775 ± 1.497
7.177LysLeu: 7.177 ± 1.179
2.563LysMet: 2.563 ± 0.552
5.725LysAsn: 5.725 ± 0.782
3.076LysPro: 3.076 ± 0.385
4.272LysGln: 4.272 ± 0.666
3.845LysArg: 3.845 ± 0.578
3.93LysSer: 3.93 ± 0.437
5.383LysThr: 5.383 ± 0.801
4.187LysVal: 4.187 ± 0.629
1.111LysTrp: 1.111 ± 0.274
4.357LysTyr: 4.357 ± 0.808
0.0LysXaa: 0.0 ± 0.0
Leu
5.895LeuAla: 5.895 ± 0.805
0.513LeuCys: 0.513 ± 0.217
6.579LeuAsp: 6.579 ± 0.924
7.092LeuGlu: 7.092 ± 1.04
2.905LeuPhe: 2.905 ± 0.34
5.468LeuGly: 5.468 ± 1.069
0.94LeuHis: 0.94 ± 0.316
5.041LeuIle: 5.041 ± 0.495
8.031LeuLys: 8.031 ± 0.937
5.212LeuLeu: 5.212 ± 0.748
2.478LeuMet: 2.478 ± 0.38
5.468LeuAsn: 5.468 ± 0.55
2.307LeuPro: 2.307 ± 0.435
2.563LeuGln: 2.563 ± 0.507
2.99LeuArg: 2.99 ± 0.67
5.468LeuSer: 5.468 ± 0.534
5.383LeuThr: 5.383 ± 0.795
4.187LeuVal: 4.187 ± 0.516
0.684LeuTrp: 0.684 ± 0.224
2.136LeuTyr: 2.136 ± 0.605
0.0LeuXaa: 0.0 ± 0.0
Met
1.709MetAla: 1.709 ± 0.394
0.0MetCys: 0.0 ± 0.0
1.025MetAsp: 1.025 ± 0.363
1.623MetGlu: 1.623 ± 0.378
1.025MetPhe: 1.025 ± 0.284
1.025MetGly: 1.025 ± 0.285
0.256MetHis: 0.256 ± 0.157
2.221MetIle: 2.221 ± 0.447
2.905MetLys: 2.905 ± 0.617
2.051MetLeu: 2.051 ± 0.282
0.684MetMet: 0.684 ± 0.239
1.623MetAsn: 1.623 ± 0.323
0.769MetPro: 0.769 ± 0.196
0.854MetGln: 0.854 ± 0.242
0.94MetArg: 0.94 ± 0.253
2.136MetSer: 2.136 ± 0.376
1.623MetThr: 1.623 ± 0.395
1.452MetVal: 1.452 ± 0.311
0.085MetTrp: 0.085 ± 0.067
1.111MetTyr: 1.111 ± 0.365
0.0MetXaa: 0.0 ± 0.0
Asn
4.443AsnAla: 4.443 ± 0.923
0.342AsnCys: 0.342 ± 0.185
3.93AsnAsp: 3.93 ± 0.594
3.589AsnGlu: 3.589 ± 0.701
3.076AsnPhe: 3.076 ± 0.665
6.152AsnGly: 6.152 ± 1.03
0.769AsnHis: 0.769 ± 0.218
3.674AsnIle: 3.674 ± 0.546
5.041AsnLys: 5.041 ± 0.6
5.554AsnLeu: 5.554 ± 0.567
1.025AsnMet: 1.025 ± 0.272
3.845AsnAsn: 3.845 ± 0.538
3.247AsnPro: 3.247 ± 0.626
2.392AsnGln: 2.392 ± 0.477
2.392AsnArg: 2.392 ± 0.569
4.187AsnSer: 4.187 ± 0.683
2.905AsnThr: 2.905 ± 0.701
3.845AsnVal: 3.845 ± 0.58
1.452AsnTrp: 1.452 ± 0.31
2.392AsnTyr: 2.392 ± 0.408
0.0AsnXaa: 0.0 ± 0.0
Pro
2.051ProAla: 2.051 ± 0.387
0.085ProCys: 0.085 ± 0.081
1.196ProAsp: 1.196 ± 0.348
1.538ProGlu: 1.538 ± 0.421
1.025ProPhe: 1.025 ± 0.398
1.196ProGly: 1.196 ± 0.49
0.342ProHis: 0.342 ± 0.154
1.623ProIle: 1.623 ± 0.342
3.759ProLys: 3.759 ± 0.545
2.221ProLeu: 2.221 ± 0.433
0.513ProMet: 0.513 ± 0.181
2.649ProAsn: 2.649 ± 0.498
0.342ProPro: 0.342 ± 0.196
1.88ProGln: 1.88 ± 0.302
0.94ProArg: 0.94 ± 0.34
2.221ProSer: 2.221 ± 0.4
2.051ProThr: 2.051 ± 0.394
1.623ProVal: 1.623 ± 0.348
0.342ProTrp: 0.342 ± 0.15
0.94ProTyr: 0.94 ± 0.299
0.0ProXaa: 0.0 ± 0.0
Gln
4.016GlnAla: 4.016 ± 0.649
0.256GlnCys: 0.256 ± 0.138
1.794GlnAsp: 1.794 ± 0.378
3.759GlnGlu: 3.759 ± 0.741
1.452GlnPhe: 1.452 ± 0.299
2.905GlnGly: 2.905 ± 0.797
0.684GlnHis: 0.684 ± 0.21
2.136GlnIle: 2.136 ± 0.412
3.332GlnLys: 3.332 ± 0.578
3.93GlnLeu: 3.93 ± 0.48
1.452GlnMet: 1.452 ± 0.313
2.478GlnAsn: 2.478 ± 0.415
0.769GlnPro: 0.769 ± 0.244
2.905GlnGln: 2.905 ± 0.685
1.367GlnArg: 1.367 ± 0.319
2.136GlnSer: 2.136 ± 0.347
2.99GlnThr: 2.99 ± 0.468
1.965GlnVal: 1.965 ± 0.431
0.598GlnTrp: 0.598 ± 0.177
1.965GlnTyr: 1.965 ± 0.353
0.0GlnXaa: 0.0 ± 0.0
Arg
2.307ArgAla: 2.307 ± 0.372
0.171ArgCys: 0.171 ± 0.121
2.051ArgAsp: 2.051 ± 0.409
2.734ArgGlu: 2.734 ± 0.606
2.136ArgPhe: 2.136 ± 0.395
1.965ArgGly: 1.965 ± 0.381
0.513ArgHis: 0.513 ± 0.188
2.905ArgIle: 2.905 ± 0.589
3.161ArgLys: 3.161 ± 0.639
4.101ArgLeu: 4.101 ± 0.614
1.025ArgMet: 1.025 ± 0.359
2.478ArgAsn: 2.478 ± 0.374
0.94ArgPro: 0.94 ± 0.242
2.051ArgGln: 2.051 ± 0.467
1.367ArgArg: 1.367 ± 0.312
1.709ArgSer: 1.709 ± 0.385
2.734ArgThr: 2.734 ± 0.753
2.221ArgVal: 2.221 ± 0.338
1.111ArgTrp: 1.111 ± 0.32
1.965ArgTyr: 1.965 ± 0.471
0.0ArgXaa: 0.0 ± 0.0
Ser
3.845SerAla: 3.845 ± 0.571
0.427SerCys: 0.427 ± 0.194
4.016SerAsp: 4.016 ± 0.606
4.187SerGlu: 4.187 ± 0.512
3.161SerPhe: 3.161 ± 0.798
4.528SerGly: 4.528 ± 0.554
0.769SerHis: 0.769 ± 0.257
4.357SerIle: 4.357 ± 0.708
5.212SerLys: 5.212 ± 0.791
4.614SerLeu: 4.614 ± 0.569
1.538SerMet: 1.538 ± 0.314
4.443SerAsn: 4.443 ± 0.754
1.965SerPro: 1.965 ± 0.46
2.649SerGln: 2.649 ± 0.575
3.076SerArg: 3.076 ± 0.632
4.699SerSer: 4.699 ± 0.863
3.247SerThr: 3.247 ± 0.626
4.785SerVal: 4.785 ± 0.768
0.513SerTrp: 0.513 ± 0.208
1.538SerTyr: 1.538 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
4.443ThrAla: 4.443 ± 0.679
0.427ThrCys: 0.427 ± 0.187
4.443ThrAsp: 4.443 ± 0.651
3.247ThrGlu: 3.247 ± 0.604
2.478ThrPhe: 2.478 ± 0.544
3.759ThrGly: 3.759 ± 0.673
0.854ThrHis: 0.854 ± 0.232
5.041ThrIle: 5.041 ± 0.885
5.212ThrLys: 5.212 ± 0.722
5.468ThrLeu: 5.468 ± 0.762
0.598ThrMet: 0.598 ± 0.229
3.503ThrAsn: 3.503 ± 0.403
1.367ThrPro: 1.367 ± 0.479
1.965ThrGln: 1.965 ± 0.473
2.051ThrArg: 2.051 ± 0.395
3.418ThrSer: 3.418 ± 0.427
2.82ThrThr: 2.82 ± 0.608
4.272ThrVal: 4.272 ± 0.675
1.025ThrTrp: 1.025 ± 0.318
3.076ThrTyr: 3.076 ± 0.619
0.0ThrXaa: 0.0 ± 0.0
Val
4.699ValAla: 4.699 ± 0.867
0.427ValCys: 0.427 ± 0.177
4.443ValAsp: 4.443 ± 0.578
3.247ValGlu: 3.247 ± 0.673
2.392ValPhe: 2.392 ± 0.458
4.101ValGly: 4.101 ± 0.568
0.684ValHis: 0.684 ± 0.219
3.589ValIle: 3.589 ± 0.547
4.443ValLys: 4.443 ± 0.678
3.161ValLeu: 3.161 ± 0.584
0.94ValMet: 0.94 ± 0.314
4.016ValAsn: 4.016 ± 0.677
1.709ValPro: 1.709 ± 0.391
1.794ValGln: 1.794 ± 0.408
1.538ValArg: 1.538 ± 0.522
5.041ValSer: 5.041 ± 0.8
4.699ValThr: 4.699 ± 0.619
3.076ValVal: 3.076 ± 0.525
0.769ValTrp: 0.769 ± 0.243
1.965ValTyr: 1.965 ± 0.421
0.0ValXaa: 0.0 ± 0.0
Trp
0.854TrpAla: 0.854 ± 0.271
0.0TrpCys: 0.0 ± 0.0
0.854TrpAsp: 0.854 ± 0.418
0.854TrpGlu: 0.854 ± 0.236
0.769TrpPhe: 0.769 ± 0.204
0.513TrpGly: 0.513 ± 0.187
0.427TrpHis: 0.427 ± 0.203
0.769TrpIle: 0.769 ± 0.245
1.025TrpLys: 1.025 ± 0.333
1.367TrpLeu: 1.367 ± 0.349
0.171TrpMet: 0.171 ± 0.123
0.94TrpAsn: 0.94 ± 0.274
0.085TrpPro: 0.085 ± 0.09
0.684TrpGln: 0.684 ± 0.223
0.94TrpArg: 0.94 ± 0.282
1.623TrpSer: 1.623 ± 0.458
0.854TrpThr: 0.854 ± 0.394
1.111TrpVal: 1.111 ± 0.265
0.256TrpTrp: 0.256 ± 0.177
0.256TrpTyr: 0.256 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.82TyrAla: 2.82 ± 0.437
0.684TyrCys: 0.684 ± 0.328
2.82TyrAsp: 2.82 ± 0.491
2.649TyrGlu: 2.649 ± 0.5
1.794TyrPhe: 1.794 ± 0.339
2.221TyrGly: 2.221 ± 0.576
0.769TyrHis: 0.769 ± 0.227
2.563TyrIle: 2.563 ± 0.503
3.161TyrLys: 3.161 ± 0.556
3.759TyrLeu: 3.759 ± 0.42
1.025TyrMet: 1.025 ± 0.308
2.307TyrAsn: 2.307 ± 0.398
0.94TyrPro: 0.94 ± 0.277
2.136TyrGln: 2.136 ± 0.401
1.88TyrArg: 1.88 ± 0.355
2.734TyrSer: 2.734 ± 0.706
2.051TyrThr: 2.051 ± 0.453
2.563TyrVal: 2.563 ± 0.569
0.0TyrTrp: 0.0 ± 0.0
2.136TyrTyr: 2.136 ± 0.472
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11705 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski