Amino acid dipepetide frequency for Staphylococcus phage StauST398-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.866AlaAla: 0.866 ± 0.308
0.315AlaCys: 0.315 ± 0.162
2.755AlaAsp: 2.755 ± 0.45
3.7AlaGlu: 3.7 ± 0.633
2.677AlaPhe: 2.677 ± 0.643
3.464AlaGly: 3.464 ± 0.682
1.181AlaHis: 1.181 ± 0.284
5.117AlaIle: 5.117 ± 0.663
4.959AlaLys: 4.959 ± 0.552
4.566AlaLeu: 4.566 ± 0.719
1.653AlaMet: 1.653 ± 0.423
3.464AlaAsn: 3.464 ± 0.471
1.889AlaPro: 1.889 ± 0.466
2.519AlaGln: 2.519 ± 0.576
2.598AlaArg: 2.598 ± 0.457
4.172AlaSer: 4.172 ± 0.642
3.936AlaThr: 3.936 ± 0.688
3.779AlaVal: 3.779 ± 0.725
0.708AlaTrp: 0.708 ± 0.267
2.677AlaTyr: 2.677 ± 0.423
0.0AlaXaa: 0.0 ± 0.0
Cys
0.157CysAla: 0.157 ± 0.128
0.079CysCys: 0.079 ± 0.077
0.236CysAsp: 0.236 ± 0.122
0.079CysGlu: 0.079 ± 0.081
0.315CysPhe: 0.315 ± 0.169
0.394CysGly: 0.394 ± 0.186
0.0CysHis: 0.0 ± 0.0
0.079CysIle: 0.079 ± 0.073
0.315CysLys: 0.315 ± 0.151
0.394CysLeu: 0.394 ± 0.152
0.236CysMet: 0.236 ± 0.135
0.394CysAsn: 0.394 ± 0.175
0.236CysPro: 0.236 ± 0.196
0.236CysGln: 0.236 ± 0.154
0.236CysArg: 0.236 ± 0.142
0.315CysSer: 0.315 ± 0.167
0.157CysThr: 0.157 ± 0.101
0.394CysVal: 0.394 ± 0.199
0.0CysTrp: 0.0 ± 0.0
0.315CysTyr: 0.315 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
4.094AspAla: 4.094 ± 0.639
0.157AspCys: 0.157 ± 0.124
6.062AspAsp: 6.062 ± 0.937
4.881AspGlu: 4.881 ± 0.717
3.149AspPhe: 3.149 ± 0.588
4.959AspGly: 4.959 ± 0.669
0.63AspHis: 0.63 ± 0.239
4.723AspIle: 4.723 ± 0.516
5.747AspLys: 5.747 ± 0.865
4.881AspLeu: 4.881 ± 0.442
1.181AspMet: 1.181 ± 0.268
3.621AspAsn: 3.621 ± 0.643
1.889AspPro: 1.889 ± 0.274
1.023AspGln: 1.023 ± 0.316
2.991AspArg: 2.991 ± 0.417
4.408AspSer: 4.408 ± 0.431
2.991AspThr: 2.991 ± 0.438
5.274AspVal: 5.274 ± 0.747
0.945AspTrp: 0.945 ± 0.319
4.251AspTyr: 4.251 ± 0.667
0.0AspXaa: 0.0 ± 0.0
Glu
4.251GluAla: 4.251 ± 0.623
0.63GluCys: 0.63 ± 0.196
3.464GluAsp: 3.464 ± 0.617
5.196GluGlu: 5.196 ± 0.821
3.306GluPhe: 3.306 ± 0.575
2.991GluGly: 2.991 ± 0.454
1.574GluHis: 1.574 ± 0.323
5.668GluIle: 5.668 ± 0.767
5.904GluLys: 5.904 ± 0.965
7.164GluLeu: 7.164 ± 1.093
2.834GluMet: 2.834 ± 0.7
4.172GluAsn: 4.172 ± 0.7
1.574GluPro: 1.574 ± 0.276
3.621GluGln: 3.621 ± 0.509
2.677GluArg: 2.677 ± 0.473
4.015GluSer: 4.015 ± 0.541
3.857GluThr: 3.857 ± 0.491
5.511GluVal: 5.511 ± 0.785
0.866GluTrp: 0.866 ± 0.247
4.566GluTyr: 4.566 ± 0.671
0.0GluXaa: 0.0 ± 0.0
Phe
1.889PheAla: 1.889 ± 0.375
0.079PheCys: 0.079 ± 0.068
4.959PheAsp: 4.959 ± 0.451
3.385PheGlu: 3.385 ± 0.616
1.26PhePhe: 1.26 ± 0.371
2.913PheGly: 2.913 ± 0.637
0.394PheHis: 0.394 ± 0.184
3.542PheIle: 3.542 ± 0.558
4.645PheLys: 4.645 ± 0.653
2.519PheLeu: 2.519 ± 0.465
1.023PheMet: 1.023 ± 0.268
3.306PheAsn: 3.306 ± 0.451
0.866PhePro: 0.866 ± 0.311
1.023PheGln: 1.023 ± 0.393
1.496PheArg: 1.496 ± 0.316
1.811PheSer: 1.811 ± 0.388
2.834PheThr: 2.834 ± 0.43
2.047PheVal: 2.047 ± 0.375
0.472PheTrp: 0.472 ± 0.223
2.125PheTyr: 2.125 ± 0.42
0.0PheXaa: 0.0 ± 0.0
Gly
4.172GlyAla: 4.172 ± 0.729
0.236GlyCys: 0.236 ± 0.141
4.094GlyAsp: 4.094 ± 0.58
2.362GlyGlu: 2.362 ± 0.43
2.519GlyPhe: 2.519 ± 0.459
2.991GlyGly: 2.991 ± 0.564
1.496GlyHis: 1.496 ± 0.377
4.251GlyIle: 4.251 ± 0.646
4.408GlyLys: 4.408 ± 0.431
4.802GlyLeu: 4.802 ± 0.755
1.653GlyMet: 1.653 ± 0.411
3.306GlyAsn: 3.306 ± 0.601
0.472GlyPro: 0.472 ± 0.202
2.913GlyGln: 2.913 ± 0.496
2.834GlyArg: 2.834 ± 0.535
2.519GlySer: 2.519 ± 0.581
3.936GlyThr: 3.936 ± 0.55
5.274GlyVal: 5.274 ± 0.765
1.26GlyTrp: 1.26 ± 0.606
3.149GlyTyr: 3.149 ± 0.667
0.0GlyXaa: 0.0 ± 0.0
His
1.338HisAla: 1.338 ± 0.312
0.0HisCys: 0.0 ± 0.0
0.394HisAsp: 0.394 ± 0.145
0.866HisGlu: 0.866 ± 0.241
0.708HisPhe: 0.708 ± 0.254
1.26HisGly: 1.26 ± 0.25
0.315HisHis: 0.315 ± 0.152
1.181HisIle: 1.181 ± 0.283
1.26HisLys: 1.26 ± 0.274
1.023HisLeu: 1.023 ± 0.323
0.236HisMet: 0.236 ± 0.126
0.866HisAsn: 0.866 ± 0.302
0.866HisPro: 0.866 ± 0.336
1.102HisGln: 1.102 ± 0.304
0.708HisArg: 0.708 ± 0.227
1.023HisSer: 1.023 ± 0.284
1.26HisThr: 1.26 ± 0.373
0.866HisVal: 0.866 ± 0.291
0.079HisTrp: 0.079 ± 0.08
0.866HisTyr: 0.866 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
4.094IleAla: 4.094 ± 0.717
0.157IleCys: 0.157 ± 0.107
6.376IleAsp: 6.376 ± 0.831
7.872IleGlu: 7.872 ± 1.093
2.519IlePhe: 2.519 ± 0.391
5.432IleGly: 5.432 ± 0.849
0.787IleHis: 0.787 ± 0.249
3.936IleIle: 3.936 ± 0.584
7.006IleLys: 7.006 ± 0.685
4.566IleLeu: 4.566 ± 0.745
2.204IleMet: 2.204 ± 0.38
4.094IleAsn: 4.094 ± 0.579
1.968IlePro: 1.968 ± 0.329
2.991IleGln: 2.991 ± 0.441
3.228IleArg: 3.228 ± 0.659
3.936IleSer: 3.936 ± 0.554
5.432IleThr: 5.432 ± 0.686
3.149IleVal: 3.149 ± 0.464
0.63IleTrp: 0.63 ± 0.297
2.047IleTyr: 2.047 ± 0.559
0.0IleXaa: 0.0 ± 0.0
Lys
5.825LysAla: 5.825 ± 0.553
0.394LysCys: 0.394 ± 0.155
5.825LysAsp: 5.825 ± 0.602
8.581LysGlu: 8.581 ± 0.87
3.621LysPhe: 3.621 ± 0.488
4.408LysGly: 4.408 ± 0.584
1.26LysHis: 1.26 ± 0.296
6.298LysIle: 6.298 ± 0.758
8.896LysLys: 8.896 ± 1.452
6.927LysLeu: 6.927 ± 0.799
3.07LysMet: 3.07 ± 0.467
4.802LysAsn: 4.802 ± 0.617
2.204LysPro: 2.204 ± 0.481
4.566LysGln: 4.566 ± 0.647
5.117LysArg: 5.117 ± 0.752
5.196LysSer: 5.196 ± 0.628
5.274LysThr: 5.274 ± 0.717
4.172LysVal: 4.172 ± 0.571
1.023LysTrp: 1.023 ± 0.266
3.7LysTyr: 3.7 ± 0.525
0.0LysXaa: 0.0 ± 0.0
Leu
4.723LeuAla: 4.723 ± 0.697
0.394LeuCys: 0.394 ± 0.222
5.825LeuAsp: 5.825 ± 0.593
5.117LeuGlu: 5.117 ± 0.799
2.991LeuPhe: 2.991 ± 0.523
2.991LeuGly: 2.991 ± 0.561
1.181LeuHis: 1.181 ± 0.287
5.668LeuIle: 5.668 ± 0.677
7.872LeuLys: 7.872 ± 0.625
5.511LeuLeu: 5.511 ± 0.669
1.968LeuMet: 1.968 ± 0.386
4.881LeuAsn: 4.881 ± 0.52
2.44LeuPro: 2.44 ± 0.506
2.598LeuGln: 2.598 ± 0.401
3.385LeuArg: 3.385 ± 0.575
5.511LeuSer: 5.511 ± 0.6
4.566LeuThr: 4.566 ± 0.624
3.621LeuVal: 3.621 ± 0.473
0.708LeuTrp: 0.708 ± 0.305
4.094LeuTyr: 4.094 ± 0.6
0.0LeuXaa: 0.0 ± 0.0
Met
2.047MetAla: 2.047 ± 0.465
0.0MetCys: 0.0 ± 0.0
1.023MetAsp: 1.023 ± 0.359
1.338MetGlu: 1.338 ± 0.278
1.181MetPhe: 1.181 ± 0.254
1.102MetGly: 1.102 ± 0.281
0.472MetHis: 0.472 ± 0.19
1.181MetIle: 1.181 ± 0.353
2.125MetLys: 2.125 ± 0.421
2.677MetLeu: 2.677 ± 0.419
0.787MetMet: 0.787 ± 0.258
2.047MetAsn: 2.047 ± 0.367
0.787MetPro: 0.787 ± 0.228
1.338MetGln: 1.338 ± 0.363
1.26MetArg: 1.26 ± 0.266
1.653MetSer: 1.653 ± 0.49
3.07MetThr: 3.07 ± 0.568
0.708MetVal: 0.708 ± 0.283
0.551MetTrp: 0.551 ± 0.191
0.708MetTyr: 0.708 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
3.779AsnAla: 3.779 ± 0.54
0.472AsnCys: 0.472 ± 0.251
4.33AsnAsp: 4.33 ± 0.65
5.668AsnGlu: 5.668 ± 0.653
3.149AsnPhe: 3.149 ± 0.533
3.936AsnGly: 3.936 ± 0.582
0.63AsnHis: 0.63 ± 0.166
3.857AsnIle: 3.857 ± 0.537
6.455AsnLys: 6.455 ± 0.858
4.487AsnLeu: 4.487 ± 0.67
0.787AsnMet: 0.787 ± 0.222
5.747AsnAsn: 5.747 ± 1.025
2.519AsnPro: 2.519 ± 0.436
2.362AsnGln: 2.362 ± 0.478
2.047AsnArg: 2.047 ± 0.373
3.306AsnSer: 3.306 ± 0.349
3.779AsnThr: 3.779 ± 0.509
4.251AsnVal: 4.251 ± 0.656
1.023AsnTrp: 1.023 ± 0.269
2.598AsnTyr: 2.598 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
1.26ProAla: 1.26 ± 0.284
0.0ProCys: 0.0 ± 0.0
1.338ProAsp: 1.338 ± 0.353
2.047ProGlu: 2.047 ± 0.419
1.417ProPhe: 1.417 ± 0.318
1.732ProGly: 1.732 ± 0.473
0.63ProHis: 0.63 ± 0.224
2.362ProIle: 2.362 ± 0.446
3.385ProLys: 3.385 ± 0.539
2.125ProLeu: 2.125 ± 0.429
0.866ProMet: 0.866 ± 0.265
1.732ProAsn: 1.732 ± 0.39
0.551ProPro: 0.551 ± 0.187
1.102ProGln: 1.102 ± 0.338
0.787ProArg: 0.787 ± 0.273
1.574ProSer: 1.574 ± 0.402
2.047ProThr: 2.047 ± 0.371
1.732ProVal: 1.732 ± 0.361
0.157ProTrp: 0.157 ± 0.116
1.338ProTyr: 1.338 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
3.7GlnAla: 3.7 ± 0.556
0.394GlnCys: 0.394 ± 0.171
2.047GlnAsp: 2.047 ± 0.398
2.755GlnGlu: 2.755 ± 0.424
1.968GlnPhe: 1.968 ± 0.344
2.125GlnGly: 2.125 ± 0.367
0.945GlnHis: 0.945 ± 0.236
2.755GlnIle: 2.755 ± 0.436
2.991GlnLys: 2.991 ± 0.49
2.834GlnLeu: 2.834 ± 0.478
1.338GlnMet: 1.338 ± 0.299
3.464GlnAsn: 3.464 ± 0.612
1.417GlnPro: 1.417 ± 0.437
2.125GlnGln: 2.125 ± 0.46
1.811GlnArg: 1.811 ± 0.353
2.204GlnSer: 2.204 ± 0.396
1.968GlnThr: 1.968 ± 0.415
2.755GlnVal: 2.755 ± 0.458
0.472GlnTrp: 0.472 ± 0.229
1.496GlnTyr: 1.496 ± 0.367
0.0GlnXaa: 0.0 ± 0.0
Arg
1.653ArgAla: 1.653 ± 0.376
0.236ArgCys: 0.236 ± 0.133
2.677ArgAsp: 2.677 ± 0.549
2.44ArgGlu: 2.44 ± 0.479
1.968ArgPhe: 1.968 ± 0.451
2.598ArgGly: 2.598 ± 0.438
1.102ArgHis: 1.102 ± 0.262
3.07ArgIle: 3.07 ± 0.513
3.7ArgLys: 3.7 ± 0.553
4.015ArgLeu: 4.015 ± 0.618
1.023ArgMet: 1.023 ± 0.28
3.464ArgAsn: 3.464 ± 0.459
1.417ArgPro: 1.417 ± 0.274
1.889ArgGln: 1.889 ± 0.418
1.811ArgArg: 1.811 ± 0.482
1.574ArgSer: 1.574 ± 0.374
2.44ArgThr: 2.44 ± 0.52
2.283ArgVal: 2.283 ± 0.341
0.315ArgTrp: 0.315 ± 0.148
2.44ArgTyr: 2.44 ± 0.475
0.0ArgXaa: 0.0 ± 0.0
Ser
3.779SerAla: 3.779 ± 0.592
0.236SerCys: 0.236 ± 0.224
4.645SerAsp: 4.645 ± 0.601
4.251SerGlu: 4.251 ± 0.716
2.834SerPhe: 2.834 ± 0.427
3.779SerGly: 3.779 ± 0.571
0.708SerHis: 0.708 ± 0.21
4.33SerIle: 4.33 ± 0.592
5.668SerLys: 5.668 ± 0.769
3.936SerLeu: 3.936 ± 0.465
1.338SerMet: 1.338 ± 0.374
3.779SerAsn: 3.779 ± 0.641
1.023SerPro: 1.023 ± 0.28
2.834SerGln: 2.834 ± 0.57
2.125SerArg: 2.125 ± 0.351
2.834SerSer: 2.834 ± 0.544
3.149SerThr: 3.149 ± 0.412
3.542SerVal: 3.542 ± 0.696
1.102SerTrp: 1.102 ± 0.302
2.125SerTyr: 2.125 ± 0.426
0.0SerXaa: 0.0 ± 0.0
Thr
3.542ThrAla: 3.542 ± 0.466
0.079ThrCys: 0.079 ± 0.086
3.385ThrAsp: 3.385 ± 0.474
4.015ThrGlu: 4.015 ± 0.574
3.07ThrPhe: 3.07 ± 0.571
4.802ThrGly: 4.802 ± 0.684
1.023ThrHis: 1.023 ± 0.23
4.881ThrIle: 4.881 ± 0.756
5.196ThrLys: 5.196 ± 0.659
4.33ThrLeu: 4.33 ± 0.589
1.023ThrMet: 1.023 ± 0.275
4.487ThrAsn: 4.487 ± 0.532
1.968ThrPro: 1.968 ± 0.413
2.834ThrGln: 2.834 ± 0.424
2.519ThrArg: 2.519 ± 0.452
4.408ThrSer: 4.408 ± 0.789
3.228ThrThr: 3.228 ± 0.657
3.542ThrVal: 3.542 ± 0.633
0.708ThrTrp: 0.708 ± 0.232
2.047ThrTyr: 2.047 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
3.306ValAla: 3.306 ± 0.761
0.315ValCys: 0.315 ± 0.172
5.038ValAsp: 5.038 ± 0.715
4.487ValGlu: 4.487 ± 0.648
1.811ValPhe: 1.811 ± 0.373
3.306ValGly: 3.306 ± 0.539
0.472ValHis: 0.472 ± 0.196
5.353ValIle: 5.353 ± 0.575
6.376ValLys: 6.376 ± 0.766
4.645ValLeu: 4.645 ± 0.692
1.653ValMet: 1.653 ± 0.321
3.07ValAsn: 3.07 ± 0.574
2.519ValPro: 2.519 ± 0.52
1.653ValGln: 1.653 ± 0.332
2.204ValArg: 2.204 ± 0.349
3.936ValSer: 3.936 ± 0.701
3.779ValThr: 3.779 ± 0.518
2.991ValVal: 2.991 ± 0.499
0.866ValTrp: 0.866 ± 0.345
1.889ValTyr: 1.889 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
0.866TrpAla: 0.866 ± 0.29
0.079TrpCys: 0.079 ± 0.081
0.551TrpAsp: 0.551 ± 0.177
0.866TrpGlu: 0.866 ± 0.203
0.708TrpPhe: 0.708 ± 0.177
0.551TrpGly: 0.551 ± 0.339
0.551TrpHis: 0.551 ± 0.197
0.787TrpIle: 0.787 ± 0.264
0.945TrpLys: 0.945 ± 0.308
1.338TrpLeu: 1.338 ± 0.319
0.157TrpMet: 0.157 ± 0.105
0.787TrpAsn: 0.787 ± 0.255
0.079TrpPro: 0.079 ± 0.065
0.787TrpGln: 0.787 ± 0.341
0.315TrpArg: 0.315 ± 0.152
0.787TrpSer: 0.787 ± 0.263
0.866TrpThr: 0.866 ± 0.209
1.102TrpVal: 1.102 ± 0.331
0.079TrpTrp: 0.079 ± 0.072
0.63TrpTyr: 0.63 ± 0.222
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.653TyrAla: 1.653 ± 0.35
0.315TyrCys: 0.315 ± 0.153
2.834TyrAsp: 2.834 ± 0.546
3.936TyrGlu: 3.936 ± 0.626
1.417TyrPhe: 1.417 ± 0.333
2.913TyrGly: 2.913 ± 0.828
0.866TyrHis: 0.866 ± 0.256
3.385TyrIle: 3.385 ± 0.522
3.306TyrLys: 3.306 ± 0.523
3.306TyrLeu: 3.306 ± 0.502
0.945TyrMet: 0.945 ± 0.295
3.7TyrAsn: 3.7 ± 0.546
1.496TyrPro: 1.496 ± 0.404
2.047TyrGln: 2.047 ± 0.382
1.968TyrArg: 1.968 ± 0.455
2.913TyrSer: 2.913 ± 0.487
2.519TyrThr: 2.519 ± 0.506
2.677TyrVal: 2.677 ± 0.429
0.866TyrTrp: 0.866 ± 0.263
1.889TyrTyr: 1.889 ± 0.455
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (12704 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski