Amino acid dipepetide frequency for Staphylococcus phage DW2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.032AlaAla: 1.032 ± 0.358
0.238AlaCys: 0.238 ± 0.132
2.62AlaAsp: 2.62 ± 0.461
4.129AlaGlu: 4.129 ± 0.62
2.779AlaPhe: 2.779 ± 0.706
4.129AlaGly: 4.129 ± 0.725
1.191AlaHis: 1.191 ± 0.311
5.241AlaIle: 5.241 ± 0.584
5.796AlaLys: 5.796 ± 0.621
3.97AlaLeu: 3.97 ± 0.695
1.588AlaMet: 1.588 ± 0.347
3.811AlaAsn: 3.811 ± 0.554
1.906AlaPro: 1.906 ± 0.421
3.017AlaGln: 3.017 ± 0.535
2.461AlaArg: 2.461 ± 0.552
3.494AlaSer: 3.494 ± 0.469
4.526AlaThr: 4.526 ± 0.667
3.573AlaVal: 3.573 ± 0.797
0.873AlaTrp: 0.873 ± 0.387
2.382AlaTyr: 2.382 ± 0.469
0.0AlaXaa: 0.0 ± 0.0
Cys
0.397CysAla: 0.397 ± 0.184
0.0CysCys: 0.0 ± 0.0
0.318CysAsp: 0.318 ± 0.145
0.476CysGlu: 0.476 ± 0.275
0.159CysPhe: 0.159 ± 0.121
0.397CysGly: 0.397 ± 0.187
0.079CysHis: 0.079 ± 0.075
0.0CysIle: 0.0 ± 0.0
0.318CysLys: 0.318 ± 0.171
0.159CysLeu: 0.159 ± 0.115
0.238CysMet: 0.238 ± 0.14
0.238CysAsn: 0.238 ± 0.132
0.318CysPro: 0.318 ± 0.192
0.238CysGln: 0.238 ± 0.141
0.238CysArg: 0.238 ± 0.136
0.397CysSer: 0.397 ± 0.183
0.079CysThr: 0.079 ± 0.081
0.318CysVal: 0.318 ± 0.177
0.0CysTrp: 0.0 ± 0.0
0.397CysTyr: 0.397 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
4.208AspAla: 4.208 ± 0.715
0.079AspCys: 0.079 ± 0.08
4.923AspAsp: 4.923 ± 1.0
4.526AspGlu: 4.526 ± 0.773
3.017AspPhe: 3.017 ± 0.597
4.367AspGly: 4.367 ± 0.625
0.476AspHis: 0.476 ± 0.177
4.447AspIle: 4.447 ± 0.491
5.479AspLys: 5.479 ± 1.05
5.479AspLeu: 5.479 ± 0.647
1.35AspMet: 1.35 ± 0.276
3.97AspAsn: 3.97 ± 0.641
1.588AspPro: 1.588 ± 0.216
0.873AspGln: 0.873 ± 0.268
2.461AspArg: 2.461 ± 0.441
4.208AspSer: 4.208 ± 0.575
2.938AspThr: 2.938 ± 0.425
4.685AspVal: 4.685 ± 0.597
0.794AspTrp: 0.794 ± 0.269
3.891AspTyr: 3.891 ± 0.499
0.0AspXaa: 0.0 ± 0.0
Glu
4.764GluAla: 4.764 ± 0.705
0.635GluCys: 0.635 ± 0.225
4.05GluAsp: 4.05 ± 0.543
6.035GluGlu: 6.035 ± 0.947
3.256GluPhe: 3.256 ± 0.496
2.779GluGly: 2.779 ± 0.419
1.667GluHis: 1.667 ± 0.36
4.605GluIle: 4.605 ± 0.591
6.035GluLys: 6.035 ± 1.026
6.829GluLeu: 6.829 ± 0.777
3.256GluMet: 3.256 ± 0.595
4.605GluAsn: 4.605 ± 0.632
1.35GluPro: 1.35 ± 0.254
4.208GluGln: 4.208 ± 0.543
4.05GluArg: 4.05 ± 0.555
4.208GluSer: 4.208 ± 0.652
3.891GluThr: 3.891 ± 0.473
6.035GluVal: 6.035 ± 0.926
1.191GluTrp: 1.191 ± 0.258
4.288GluTyr: 4.288 ± 0.596
0.0GluXaa: 0.0 ± 0.0
Phe
1.985PheAla: 1.985 ± 0.297
0.238PheCys: 0.238 ± 0.16
4.923PheAsp: 4.923 ± 0.478
3.732PheGlu: 3.732 ± 0.677
1.032PhePhe: 1.032 ± 0.233
3.017PheGly: 3.017 ± 0.86
0.556PheHis: 0.556 ± 0.214
3.335PheIle: 3.335 ± 0.603
4.764PheLys: 4.764 ± 0.533
2.303PheLeu: 2.303 ± 0.337
1.35PheMet: 1.35 ± 0.337
3.017PheAsn: 3.017 ± 0.488
0.635PhePro: 0.635 ± 0.226
1.27PheGln: 1.27 ± 0.352
1.35PheArg: 1.35 ± 0.283
2.144PheSer: 2.144 ± 0.445
3.176PheThr: 3.176 ± 0.472
1.191PheVal: 1.191 ± 0.232
0.397PheTrp: 0.397 ± 0.152
1.985PheTyr: 1.985 ± 0.442
0.0PheXaa: 0.0 ± 0.0
Gly
4.526GlyAla: 4.526 ± 0.749
0.318GlyCys: 0.318 ± 0.14
4.129GlyAsp: 4.129 ± 0.677
3.017GlyGlu: 3.017 ± 0.571
2.303GlyPhe: 2.303 ± 0.418
3.256GlyGly: 3.256 ± 0.499
1.27GlyHis: 1.27 ± 0.356
3.811GlyIle: 3.811 ± 0.502
5.161GlyLys: 5.161 ± 0.647
4.605GlyLeu: 4.605 ± 0.757
1.747GlyMet: 1.747 ± 0.391
3.335GlyAsn: 3.335 ± 0.446
0.318GlyPro: 0.318 ± 0.142
2.779GlyGln: 2.779 ± 0.605
2.541GlyArg: 2.541 ± 0.452
2.7GlySer: 2.7 ± 0.483
4.208GlyThr: 4.208 ± 0.59
5.876GlyVal: 5.876 ± 0.67
1.112GlyTrp: 1.112 ± 0.432
3.256GlyTyr: 3.256 ± 0.625
0.0GlyXaa: 0.0 ± 0.0
His
1.112HisAla: 1.112 ± 0.265
0.0HisCys: 0.0 ± 0.0
0.556HisAsp: 0.556 ± 0.163
0.953HisGlu: 0.953 ± 0.24
0.635HisPhe: 0.635 ± 0.179
1.429HisGly: 1.429 ± 0.303
0.318HisHis: 0.318 ± 0.148
1.35HisIle: 1.35 ± 0.332
1.27HisLys: 1.27 ± 0.288
1.191HisLeu: 1.191 ± 0.32
0.159HisMet: 0.159 ± 0.098
0.794HisAsn: 0.794 ± 0.225
0.715HisPro: 0.715 ± 0.3
0.635HisGln: 0.635 ± 0.22
0.635HisArg: 0.635 ± 0.214
1.191HisSer: 1.191 ± 0.285
0.873HisThr: 0.873 ± 0.281
1.032HisVal: 1.032 ± 0.275
0.079HisTrp: 0.079 ± 0.079
0.715HisTyr: 0.715 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
3.732IleAla: 3.732 ± 0.804
0.238IleCys: 0.238 ± 0.138
5.241IleAsp: 5.241 ± 0.694
7.702IleGlu: 7.702 ± 0.874
2.7IlePhe: 2.7 ± 0.374
5.241IleGly: 5.241 ± 0.691
0.794IleHis: 0.794 ± 0.238
3.732IleIle: 3.732 ± 0.589
7.464IleLys: 7.464 ± 0.82
4.288IleLeu: 4.288 ± 0.647
2.303IleMet: 2.303 ± 0.43
4.605IleAsn: 4.605 ± 0.85
2.144IlePro: 2.144 ± 0.383
2.7IleGln: 2.7 ± 0.467
3.494IleArg: 3.494 ± 0.562
3.256IleSer: 3.256 ± 0.596
4.923IleThr: 4.923 ± 0.511
2.859IleVal: 2.859 ± 0.437
0.635IleTrp: 0.635 ± 0.357
1.826IleTyr: 1.826 ± 0.496
0.0IleXaa: 0.0 ± 0.0
Lys
5.082LysAla: 5.082 ± 0.779
0.397LysCys: 0.397 ± 0.159
5.002LysAsp: 5.002 ± 0.541
8.417LysGlu: 8.417 ± 0.79
3.414LysPhe: 3.414 ± 0.419
5.479LysGly: 5.479 ± 0.761
1.747LysHis: 1.747 ± 0.381
5.558LysIle: 5.558 ± 0.622
6.987LysLys: 6.987 ± 1.212
8.178LysLeu: 8.178 ± 0.797
2.859LysMet: 2.859 ± 0.409
5.32LysAsn: 5.32 ± 0.744
2.064LysPro: 2.064 ± 0.464
4.605LysGln: 4.605 ± 0.731
4.208LysArg: 4.208 ± 0.669
4.923LysSer: 4.923 ± 0.594
5.241LysThr: 5.241 ± 0.638
4.844LysVal: 4.844 ± 0.545
1.191LysTrp: 1.191 ± 0.243
4.129LysTyr: 4.129 ± 0.724
0.0LysXaa: 0.0 ± 0.0
Leu
4.208LeuAla: 4.208 ± 0.558
0.556LeuCys: 0.556 ± 0.259
5.558LeuAsp: 5.558 ± 0.618
5.479LeuGlu: 5.479 ± 0.757
3.256LeuPhe: 3.256 ± 0.5
3.653LeuGly: 3.653 ± 0.572
1.032LeuHis: 1.032 ± 0.335
5.876LeuIle: 5.876 ± 0.714
7.781LeuLys: 7.781 ± 0.633
5.32LeuLeu: 5.32 ± 0.736
2.064LeuMet: 2.064 ± 0.445
4.129LeuAsn: 4.129 ± 0.567
2.62LeuPro: 2.62 ± 0.488
3.017LeuGln: 3.017 ± 0.42
3.176LeuArg: 3.176 ± 0.557
4.685LeuSer: 4.685 ± 0.488
5.082LeuThr: 5.082 ± 0.684
4.288LeuVal: 4.288 ± 0.558
0.715LeuTrp: 0.715 ± 0.325
3.176LeuTyr: 3.176 ± 0.576
0.0LeuXaa: 0.0 ± 0.0
Met
2.461MetAla: 2.461 ± 0.633
0.0MetCys: 0.0 ± 0.0
1.032MetAsp: 1.032 ± 0.206
1.112MetGlu: 1.112 ± 0.293
1.27MetPhe: 1.27 ± 0.278
1.27MetGly: 1.27 ± 0.247
0.318MetHis: 0.318 ± 0.157
1.27MetIle: 1.27 ± 0.381
2.144MetLys: 2.144 ± 0.428
1.826MetLeu: 1.826 ± 0.36
0.476MetMet: 0.476 ± 0.181
1.985MetAsn: 1.985 ± 0.502
0.873MetPro: 0.873 ± 0.225
1.509MetGln: 1.509 ± 0.371
0.953MetArg: 0.953 ± 0.284
1.985MetSer: 1.985 ± 0.531
2.859MetThr: 2.859 ± 0.443
1.35MetVal: 1.35 ± 0.321
0.476MetTrp: 0.476 ± 0.18
0.635MetTyr: 0.635 ± 0.204
0.0MetXaa: 0.0 ± 0.0
Asn
4.129AsnAla: 4.129 ± 0.658
0.397AsnCys: 0.397 ± 0.229
4.526AsnAsp: 4.526 ± 0.666
5.082AsnGlu: 5.082 ± 0.541
2.779AsnPhe: 2.779 ± 0.518
5.32AsnGly: 5.32 ± 0.656
0.794AsnHis: 0.794 ± 0.21
3.573AsnIle: 3.573 ± 0.508
5.796AsnLys: 5.796 ± 0.747
4.208AsnLeu: 4.208 ± 0.634
1.191AsnMet: 1.191 ± 0.28
5.558AsnAsn: 5.558 ± 0.806
2.461AsnPro: 2.461 ± 0.469
2.938AsnGln: 2.938 ± 0.555
2.303AsnArg: 2.303 ± 0.416
2.541AsnSer: 2.541 ± 0.356
3.573AsnThr: 3.573 ± 0.567
5.32AsnVal: 5.32 ± 0.777
0.715AsnTrp: 0.715 ± 0.212
2.461AsnTyr: 2.461 ± 0.426
0.0AsnXaa: 0.0 ± 0.0
Pro
1.032ProAla: 1.032 ± 0.257
0.0ProCys: 0.0 ± 0.0
1.112ProAsp: 1.112 ± 0.291
2.303ProGlu: 2.303 ± 0.471
1.588ProPhe: 1.588 ± 0.359
1.588ProGly: 1.588 ± 0.411
0.476ProHis: 0.476 ± 0.206
2.7ProIle: 2.7 ± 0.48
2.859ProLys: 2.859 ± 0.433
1.826ProLeu: 1.826 ± 0.404
0.556ProMet: 0.556 ± 0.196
1.985ProAsn: 1.985 ± 0.323
0.715ProPro: 0.715 ± 0.296
1.112ProGln: 1.112 ± 0.308
0.794ProArg: 0.794 ± 0.216
1.429ProSer: 1.429 ± 0.382
1.826ProThr: 1.826 ± 0.349
1.747ProVal: 1.747 ± 0.418
0.159ProTrp: 0.159 ± 0.12
1.35ProTyr: 1.35 ± 0.342
0.0ProXaa: 0.0 ± 0.0
Gln
3.97GlnAla: 3.97 ± 0.708
0.238GlnCys: 0.238 ± 0.129
1.826GlnAsp: 1.826 ± 0.368
2.779GlnGlu: 2.779 ± 0.508
1.826GlnPhe: 1.826 ± 0.332
2.064GlnGly: 2.064 ± 0.303
0.794GlnHis: 0.794 ± 0.242
2.7GlnIle: 2.7 ± 0.411
3.573GlnLys: 3.573 ± 0.471
2.859GlnLeu: 2.859 ± 0.485
0.873GlnMet: 0.873 ± 0.273
2.938GlnAsn: 2.938 ± 0.466
1.747GlnPro: 1.747 ± 0.468
2.303GlnGln: 2.303 ± 0.627
2.144GlnArg: 2.144 ± 0.483
2.859GlnSer: 2.859 ± 0.412
1.906GlnThr: 1.906 ± 0.371
3.097GlnVal: 3.097 ± 0.525
0.397GlnTrp: 0.397 ± 0.179
1.588GlnTyr: 1.588 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
2.144ArgAla: 2.144 ± 0.409
0.318ArgCys: 0.318 ± 0.147
2.461ArgAsp: 2.461 ± 0.554
3.097ArgGlu: 3.097 ± 0.657
2.144ArgPhe: 2.144 ± 0.429
2.382ArgGly: 2.382 ± 0.401
0.794ArgHis: 0.794 ± 0.235
3.494ArgIle: 3.494 ± 0.563
3.732ArgLys: 3.732 ± 0.492
3.891ArgLeu: 3.891 ± 0.609
0.715ArgMet: 0.715 ± 0.274
3.414ArgAsn: 3.414 ± 0.494
1.27ArgPro: 1.27 ± 0.244
1.35ArgGln: 1.35 ± 0.419
2.382ArgArg: 2.382 ± 0.501
1.906ArgSer: 1.906 ± 0.379
2.223ArgThr: 2.223 ± 0.435
2.62ArgVal: 2.62 ± 0.387
0.318ArgTrp: 0.318 ± 0.15
2.223ArgTyr: 2.223 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
4.208SerAla: 4.208 ± 0.704
0.397SerCys: 0.397 ± 0.234
3.335SerAsp: 3.335 ± 0.628
4.605SerGlu: 4.605 ± 0.714
2.859SerPhe: 2.859 ± 0.497
4.05SerGly: 4.05 ± 0.709
0.476SerHis: 0.476 ± 0.181
5.32SerIle: 5.32 ± 0.697
5.32SerLys: 5.32 ± 0.656
4.05SerLeu: 4.05 ± 0.507
1.191SerMet: 1.191 ± 0.281
3.653SerAsn: 3.653 ± 0.542
0.476SerPro: 0.476 ± 0.152
3.017SerGln: 3.017 ± 0.503
2.303SerArg: 2.303 ± 0.387
3.891SerSer: 3.891 ± 0.697
3.017SerThr: 3.017 ± 0.488
3.176SerVal: 3.176 ± 0.569
0.794SerTrp: 0.794 ± 0.252
2.303SerTyr: 2.303 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
3.653ThrAla: 3.653 ± 0.577
0.159ThrCys: 0.159 ± 0.149
3.653ThrAsp: 3.653 ± 0.552
4.367ThrGlu: 4.367 ± 0.601
2.7ThrPhe: 2.7 ± 0.534
3.573ThrGly: 3.573 ± 0.532
1.191ThrHis: 1.191 ± 0.283
4.208ThrIle: 4.208 ± 0.606
4.605ThrLys: 4.605 ± 0.674
5.241ThrLeu: 5.241 ± 0.51
1.112ThrMet: 1.112 ± 0.308
4.685ThrAsn: 4.685 ± 0.679
1.906ThrPro: 1.906 ± 0.438
2.62ThrGln: 2.62 ± 0.43
2.7ThrArg: 2.7 ± 0.477
4.685ThrSer: 4.685 ± 0.828
3.176ThrThr: 3.176 ± 0.564
3.414ThrVal: 3.414 ± 0.562
0.635ThrTrp: 0.635 ± 0.22
2.223ThrTyr: 2.223 ± 0.491
0.0ThrXaa: 0.0 ± 0.0
Val
3.414ValAla: 3.414 ± 0.803
0.238ValCys: 0.238 ± 0.143
5.32ValAsp: 5.32 ± 0.599
4.844ValGlu: 4.844 ± 0.745
2.382ValPhe: 2.382 ± 0.374
3.414ValGly: 3.414 ± 0.415
0.238ValHis: 0.238 ± 0.117
4.923ValIle: 4.923 ± 0.504
6.273ValLys: 6.273 ± 0.901
4.764ValLeu: 4.764 ± 0.602
1.35ValMet: 1.35 ± 0.304
3.811ValAsn: 3.811 ± 0.624
2.779ValPro: 2.779 ± 0.501
1.985ValGln: 1.985 ± 0.397
2.461ValArg: 2.461 ± 0.444
4.844ValSer: 4.844 ± 0.681
3.811ValThr: 3.811 ± 0.506
4.129ValVal: 4.129 ± 0.59
0.715ValTrp: 0.715 ± 0.259
1.985ValTyr: 1.985 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
0.635TrpAla: 0.635 ± 0.25
0.0TrpCys: 0.0 ± 0.0
0.476TrpAsp: 0.476 ± 0.177
1.112TrpGlu: 1.112 ± 0.227
0.635TrpPhe: 0.635 ± 0.179
0.476TrpGly: 0.476 ± 0.221
0.397TrpHis: 0.397 ± 0.167
0.635TrpIle: 0.635 ± 0.232
0.794TrpLys: 0.794 ± 0.261
1.112TrpLeu: 1.112 ± 0.325
0.318TrpMet: 0.318 ± 0.154
0.635TrpAsn: 0.635 ± 0.262
0.079TrpPro: 0.079 ± 0.071
0.715TrpGln: 0.715 ± 0.339
0.476TrpArg: 0.476 ± 0.211
0.794TrpSer: 0.794 ± 0.253
0.794TrpThr: 0.794 ± 0.182
1.27TrpVal: 1.27 ± 0.299
0.079TrpTrp: 0.079 ± 0.091
0.476TrpTyr: 0.476 ± 0.202
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.064TyrAla: 2.064 ± 0.393
0.318TyrCys: 0.318 ± 0.164
2.7TyrAsp: 2.7 ± 0.506
3.97TyrGlu: 3.97 ± 0.661
1.667TyrPhe: 1.667 ± 0.398
2.382TyrGly: 2.382 ± 0.615
1.032TyrHis: 1.032 ± 0.283
3.097TyrIle: 3.097 ± 0.587
3.494TyrLys: 3.494 ± 0.474
3.573TyrLeu: 3.573 ± 0.612
1.032TyrMet: 1.032 ± 0.237
3.176TyrAsn: 3.176 ± 0.603
1.191TyrPro: 1.191 ± 0.35
1.588TyrGln: 1.588 ± 0.326
1.747TyrArg: 1.747 ± 0.406
2.382TyrSer: 2.382 ± 0.455
2.382TyrThr: 2.382 ± 0.385
2.859TyrVal: 2.859 ± 0.511
0.635TyrTrp: 0.635 ± 0.2
1.667TyrTyr: 1.667 ± 0.303
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12595 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski