Amino acid dipepetide frequency for Staphylococcus phage IME1365_01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.365AlaAla: 1.365 ± 0.337
0.144AlaCys: 0.144 ± 0.1
2.945AlaAsp: 2.945 ± 0.597
4.741AlaGlu: 4.741 ± 0.73
2.586AlaPhe: 2.586 ± 0.779
3.951AlaGly: 3.951 ± 0.86
1.293AlaHis: 1.293 ± 0.301
5.172AlaIle: 5.172 ± 0.569
6.25AlaLys: 6.25 ± 0.786
4.741AlaLeu: 4.741 ± 0.59
1.437AlaMet: 1.437 ± 0.309
4.382AlaAsn: 4.382 ± 0.525
1.868AlaPro: 1.868 ± 0.288
2.874AlaGln: 2.874 ± 0.771
2.73AlaArg: 2.73 ± 0.45
3.448AlaSer: 3.448 ± 0.63
3.305AlaThr: 3.305 ± 0.524
2.945AlaVal: 2.945 ± 0.651
0.575AlaTrp: 0.575 ± 0.253
2.443AlaTyr: 2.443 ± 0.442
0.0AlaXaa: 0.0 ± 0.0
Cys
0.359CysAla: 0.359 ± 0.185
0.072CysCys: 0.072 ± 0.071
0.144CysAsp: 0.144 ± 0.096
0.647CysGlu: 0.647 ± 0.228
0.575CysPhe: 0.575 ± 0.228
0.503CysGly: 0.503 ± 0.201
0.0CysHis: 0.0 ± 0.0
0.431CysIle: 0.431 ± 0.181
0.287CysLys: 0.287 ± 0.145
0.287CysLeu: 0.287 ± 0.186
0.0CysMet: 0.0 ± 0.0
0.647CysAsn: 0.647 ± 0.204
0.216CysPro: 0.216 ± 0.167
0.072CysGln: 0.072 ± 0.082
0.287CysArg: 0.287 ± 0.16
0.287CysSer: 0.287 ± 0.122
0.287CysThr: 0.287 ± 0.14
0.359CysVal: 0.359 ± 0.173
0.216CysTrp: 0.216 ± 0.12
0.287CysTyr: 0.287 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
2.658AspAla: 2.658 ± 0.45
0.144AspCys: 0.144 ± 0.109
5.532AspAsp: 5.532 ± 0.73
4.957AspGlu: 4.957 ± 0.731
3.376AspPhe: 3.376 ± 0.376
3.807AspGly: 3.807 ± 0.523
0.359AspHis: 0.359 ± 0.138
5.172AspIle: 5.172 ± 0.722
6.25AspLys: 6.25 ± 0.704
5.101AspLeu: 5.101 ± 0.733
1.724AspMet: 1.724 ± 0.364
5.029AspAsn: 5.029 ± 0.761
1.796AspPro: 1.796 ± 0.354
1.221AspGln: 1.221 ± 0.271
1.365AspArg: 1.365 ± 0.29
3.161AspSer: 3.161 ± 0.478
4.095AspThr: 4.095 ± 0.566
4.598AspVal: 4.598 ± 0.518
1.006AspTrp: 1.006 ± 0.27
3.161AspTyr: 3.161 ± 0.52
0.0AspXaa: 0.0 ± 0.0
Glu
3.233GluAla: 3.233 ± 0.666
0.79GluCys: 0.79 ± 0.304
2.73GluAsp: 2.73 ± 0.406
4.167GluGlu: 4.167 ± 0.589
2.443GluPhe: 2.443 ± 0.496
2.802GluGly: 2.802 ± 0.436
1.509GluHis: 1.509 ± 0.379
4.598GluIle: 4.598 ± 0.616
4.67GluLys: 4.67 ± 0.614
7.471GluLeu: 7.471 ± 1.075
1.868GluMet: 1.868 ± 0.427
4.023GluAsn: 4.023 ± 0.528
1.652GluPro: 1.652 ± 0.422
3.736GluGln: 3.736 ± 0.514
3.52GluArg: 3.52 ± 0.708
3.592GluSer: 3.592 ± 0.674
4.167GluThr: 4.167 ± 0.538
3.879GluVal: 3.879 ± 0.59
1.724GluTrp: 1.724 ± 0.336
3.592GluTyr: 3.592 ± 0.564
0.0GluXaa: 0.0 ± 0.0
Phe
2.658PheAla: 2.658 ± 0.46
0.144PheCys: 0.144 ± 0.095
3.161PheAsp: 3.161 ± 0.413
2.371PheGlu: 2.371 ± 0.541
1.149PhePhe: 1.149 ± 0.327
2.155PheGly: 2.155 ± 0.435
0.503PheHis: 0.503 ± 0.178
3.52PheIle: 3.52 ± 0.502
4.239PheLys: 4.239 ± 0.574
2.299PheLeu: 2.299 ± 0.477
1.078PheMet: 1.078 ± 0.264
2.802PheAsn: 2.802 ± 0.533
1.006PhePro: 1.006 ± 0.304
1.006PheGln: 1.006 ± 0.406
2.083PheArg: 2.083 ± 0.349
2.083PheSer: 2.083 ± 0.446
3.017PheThr: 3.017 ± 0.479
2.802PheVal: 2.802 ± 0.502
0.287PheTrp: 0.287 ± 0.169
1.58PheTyr: 1.58 ± 0.355
0.0PheXaa: 0.0 ± 0.0
Gly
3.52GlyAla: 3.52 ± 0.768
0.503GlyCys: 0.503 ± 0.201
3.664GlyAsp: 3.664 ± 0.657
2.658GlyGlu: 2.658 ± 0.561
2.586GlyPhe: 2.586 ± 0.446
2.945GlyGly: 2.945 ± 0.571
1.149GlyHis: 1.149 ± 0.273
5.244GlyIle: 5.244 ± 0.777
5.603GlyLys: 5.603 ± 0.615
4.813GlyLeu: 4.813 ± 0.719
1.724GlyMet: 1.724 ± 0.513
3.664GlyAsn: 3.664 ± 0.665
0.934GlyPro: 0.934 ± 0.276
3.017GlyGln: 3.017 ± 0.408
2.443GlyArg: 2.443 ± 0.434
2.945GlySer: 2.945 ± 0.549
4.526GlyThr: 4.526 ± 0.615
4.167GlyVal: 4.167 ± 0.578
0.79GlyTrp: 0.79 ± 0.321
3.305GlyTyr: 3.305 ± 0.459
0.0GlyXaa: 0.0 ± 0.0
His
0.934HisAla: 0.934 ± 0.246
0.216HisCys: 0.216 ± 0.114
1.006HisAsp: 1.006 ± 0.222
0.934HisGlu: 0.934 ± 0.258
0.503HisPhe: 0.503 ± 0.206
1.293HisGly: 1.293 ± 0.38
0.647HisHis: 0.647 ± 0.285
1.365HisIle: 1.365 ± 0.308
1.437HisLys: 1.437 ± 0.313
1.365HisLeu: 1.365 ± 0.306
0.287HisMet: 0.287 ± 0.137
0.718HisAsn: 0.718 ± 0.191
0.647HisPro: 0.647 ± 0.187
0.647HisGln: 0.647 ± 0.266
0.718HisArg: 0.718 ± 0.238
1.365HisSer: 1.365 ± 0.254
0.934HisThr: 0.934 ± 0.284
1.293HisVal: 1.293 ± 0.247
0.072HisTrp: 0.072 ± 0.064
0.934HisTyr: 0.934 ± 0.288
0.0HisXaa: 0.0 ± 0.0
Ile
4.885IleAla: 4.885 ± 0.621
0.575IleCys: 0.575 ± 0.236
5.675IleAsp: 5.675 ± 0.799
5.819IleGlu: 5.819 ± 0.698
2.371IlePhe: 2.371 ± 0.458
5.029IleGly: 5.029 ± 0.854
0.934IleHis: 0.934 ± 0.214
4.382IleIle: 4.382 ± 0.551
6.968IleLys: 6.968 ± 0.857
3.807IleLeu: 3.807 ± 0.727
1.58IleMet: 1.58 ± 0.449
4.741IleAsn: 4.741 ± 0.663
2.299IlePro: 2.299 ± 0.328
2.083IleGln: 2.083 ± 0.364
2.945IleArg: 2.945 ± 0.396
4.598IleSer: 4.598 ± 0.88
4.095IleThr: 4.095 ± 0.612
4.598IleVal: 4.598 ± 0.595
1.509IleTrp: 1.509 ± 0.31
3.161IleTyr: 3.161 ± 0.481
0.0IleXaa: 0.0 ± 0.0
Lys
5.029LysAla: 5.029 ± 0.725
0.287LysCys: 0.287 ± 0.142
6.537LysAsp: 6.537 ± 0.614
5.819LysGlu: 5.819 ± 0.923
3.736LysPhe: 3.736 ± 0.461
5.747LysGly: 5.747 ± 0.634
1.652LysHis: 1.652 ± 0.309
4.957LysIle: 4.957 ± 0.751
7.615LysLys: 7.615 ± 1.128
5.747LysLeu: 5.747 ± 0.68
2.227LysMet: 2.227 ± 0.456
5.747LysAsn: 5.747 ± 0.627
3.233LysPro: 3.233 ± 0.795
4.023LysGln: 4.023 ± 0.716
5.244LysArg: 5.244 ± 0.959
5.891LysSer: 5.891 ± 0.694
5.747LysThr: 5.747 ± 0.739
4.885LysVal: 4.885 ± 0.531
1.221LysTrp: 1.221 ± 0.237
4.095LysTyr: 4.095 ± 0.59
0.0LysXaa: 0.0 ± 0.0
Leu
4.598LeuAla: 4.598 ± 0.766
0.359LeuCys: 0.359 ± 0.177
4.885LeuAsp: 4.885 ± 0.611
4.454LeuGlu: 4.454 ± 0.689
3.233LeuPhe: 3.233 ± 0.371
4.095LeuGly: 4.095 ± 0.695
1.006LeuHis: 1.006 ± 0.3
4.741LeuIle: 4.741 ± 0.493
8.046LeuLys: 8.046 ± 0.785
4.813LeuLeu: 4.813 ± 0.613
1.796LeuMet: 1.796 ± 0.36
4.813LeuAsn: 4.813 ± 0.433
2.802LeuPro: 2.802 ± 0.431
3.161LeuGln: 3.161 ± 0.574
3.448LeuArg: 3.448 ± 0.505
4.454LeuSer: 4.454 ± 0.461
3.951LeuThr: 3.951 ± 0.524
4.167LeuVal: 4.167 ± 0.517
0.79LeuTrp: 0.79 ± 0.275
3.089LeuTyr: 3.089 ± 0.559
0.0LeuXaa: 0.0 ± 0.0
Met
2.083MetAla: 2.083 ± 0.551
0.072MetCys: 0.072 ± 0.073
1.221MetAsp: 1.221 ± 0.278
1.006MetGlu: 1.006 ± 0.283
0.862MetPhe: 0.862 ± 0.236
0.862MetGly: 0.862 ± 0.235
0.359MetHis: 0.359 ± 0.165
1.868MetIle: 1.868 ± 0.381
1.94MetLys: 1.94 ± 0.365
1.652MetLeu: 1.652 ± 0.288
0.862MetMet: 0.862 ± 0.306
1.365MetAsn: 1.365 ± 0.33
0.718MetPro: 0.718 ± 0.191
1.509MetGln: 1.509 ± 0.329
1.078MetArg: 1.078 ± 0.258
1.94MetSer: 1.94 ± 0.398
2.371MetThr: 2.371 ± 0.376
1.221MetVal: 1.221 ± 0.319
0.359MetTrp: 0.359 ± 0.142
0.862MetTyr: 0.862 ± 0.268
0.0MetXaa: 0.0 ± 0.0
Asn
5.029AsnAla: 5.029 ± 0.572
0.647AsnCys: 0.647 ± 0.229
5.029AsnAsp: 5.029 ± 0.64
4.741AsnGlu: 4.741 ± 0.604
2.514AsnPhe: 2.514 ± 0.485
5.532AsnGly: 5.532 ± 0.654
1.293AsnHis: 1.293 ± 0.268
4.67AsnIle: 4.67 ± 0.657
5.029AsnLys: 5.029 ± 0.775
5.029AsnLeu: 5.029 ± 0.647
1.149AsnMet: 1.149 ± 0.29
5.532AsnAsn: 5.532 ± 0.776
2.227AsnPro: 2.227 ± 0.474
2.443AsnGln: 2.443 ± 0.433
2.945AsnArg: 2.945 ± 0.477
3.305AsnSer: 3.305 ± 0.469
3.52AsnThr: 3.52 ± 0.589
4.813AsnVal: 4.813 ± 0.729
0.431AsnTrp: 0.431 ± 0.15
2.299AsnTyr: 2.299 ± 0.442
0.0AsnXaa: 0.0 ± 0.0
Pro
1.652ProAla: 1.652 ± 0.31
0.216ProCys: 0.216 ± 0.116
2.443ProAsp: 2.443 ± 0.433
2.227ProGlu: 2.227 ± 0.378
1.437ProPhe: 1.437 ± 0.34
1.58ProGly: 1.58 ± 0.401
0.647ProHis: 0.647 ± 0.243
2.658ProIle: 2.658 ± 0.495
2.514ProLys: 2.514 ± 0.503
1.437ProLeu: 1.437 ± 0.317
0.359ProMet: 0.359 ± 0.218
1.58ProAsn: 1.58 ± 0.442
1.006ProPro: 1.006 ± 0.296
1.006ProGln: 1.006 ± 0.248
1.437ProArg: 1.437 ± 0.361
1.796ProSer: 1.796 ± 0.477
2.802ProThr: 2.802 ± 0.676
2.227ProVal: 2.227 ± 0.393
0.431ProTrp: 0.431 ± 0.205
1.58ProTyr: 1.58 ± 0.359
0.0ProXaa: 0.0 ± 0.0
Gln
3.017GlnAla: 3.017 ± 0.524
0.287GlnCys: 0.287 ± 0.135
2.227GlnAsp: 2.227 ± 0.445
1.868GlnGlu: 1.868 ± 0.399
1.94GlnPhe: 1.94 ± 0.352
2.083GlnGly: 2.083 ± 0.393
0.718GlnHis: 0.718 ± 0.237
2.514GlnIle: 2.514 ± 0.427
3.448GlnLys: 3.448 ± 0.649
2.945GlnLeu: 2.945 ± 0.519
1.293GlnMet: 1.293 ± 0.343
3.592GlnAsn: 3.592 ± 0.591
1.437GlnPro: 1.437 ± 0.274
2.155GlnGln: 2.155 ± 0.723
1.724GlnArg: 1.724 ± 0.368
3.161GlnSer: 3.161 ± 0.516
2.514GlnThr: 2.514 ± 0.41
2.083GlnVal: 2.083 ± 0.428
0.575GlnTrp: 0.575 ± 0.246
1.365GlnTyr: 1.365 ± 0.389
0.0GlnXaa: 0.0 ± 0.0
Arg
3.233ArgAla: 3.233 ± 0.554
0.359ArgCys: 0.359 ± 0.148
2.514ArgAsp: 2.514 ± 0.471
3.089ArgGlu: 3.089 ± 0.503
1.509ArgPhe: 1.509 ± 0.361
2.586ArgGly: 2.586 ± 0.383
0.79ArgHis: 0.79 ± 0.196
2.874ArgIle: 2.874 ± 0.493
3.448ArgLys: 3.448 ± 0.587
3.376ArgLeu: 3.376 ± 0.407
1.078ArgMet: 1.078 ± 0.214
3.376ArgAsn: 3.376 ± 0.483
1.365ArgPro: 1.365 ± 0.231
1.652ArgGln: 1.652 ± 0.436
1.365ArgArg: 1.365 ± 0.305
2.227ArgSer: 2.227 ± 0.496
2.011ArgThr: 2.011 ± 0.41
2.658ArgVal: 2.658 ± 0.496
1.006ArgTrp: 1.006 ± 0.31
2.874ArgTyr: 2.874 ± 0.565
0.0ArgXaa: 0.0 ± 0.0
Ser
3.879SerAla: 3.879 ± 0.681
0.144SerCys: 0.144 ± 0.102
2.945SerAsp: 2.945 ± 0.436
4.239SerGlu: 4.239 ± 0.619
1.58SerPhe: 1.58 ± 0.395
4.239SerGly: 4.239 ± 0.555
1.078SerHis: 1.078 ± 0.277
4.239SerIle: 4.239 ± 0.694
5.316SerLys: 5.316 ± 0.703
4.31SerLeu: 4.31 ± 0.517
1.437SerMet: 1.437 ± 0.274
3.879SerAsn: 3.879 ± 0.501
1.365SerPro: 1.365 ± 0.33
2.083SerGln: 2.083 ± 0.4
3.161SerArg: 3.161 ± 0.544
4.095SerSer: 4.095 ± 0.572
4.526SerThr: 4.526 ± 0.522
4.095SerVal: 4.095 ± 0.772
0.647SerTrp: 0.647 ± 0.196
2.586SerTyr: 2.586 ± 0.385
0.0SerXaa: 0.0 ± 0.0
Thr
3.592ThrAla: 3.592 ± 0.503
0.144ThrCys: 0.144 ± 0.105
4.741ThrAsp: 4.741 ± 0.617
3.017ThrGlu: 3.017 ± 0.472
2.371ThrPhe: 2.371 ± 0.501
4.598ThrGly: 4.598 ± 0.604
1.365ThrHis: 1.365 ± 0.261
5.388ThrIle: 5.388 ± 0.569
5.891ThrLys: 5.891 ± 0.844
4.598ThrLeu: 4.598 ± 0.529
1.149ThrMet: 1.149 ± 0.308
4.31ThrAsn: 4.31 ± 0.687
2.658ThrPro: 2.658 ± 0.427
3.233ThrGln: 3.233 ± 0.534
1.724ThrArg: 1.724 ± 0.304
3.376ThrSer: 3.376 ± 0.538
5.172ThrThr: 5.172 ± 0.858
4.741ThrVal: 4.741 ± 0.578
0.431ThrTrp: 0.431 ± 0.14
2.371ThrTyr: 2.371 ± 0.446
0.0ThrXaa: 0.0 ± 0.0
Val
3.807ValAla: 3.807 ± 0.631
0.287ValCys: 0.287 ± 0.143
3.951ValAsp: 3.951 ± 0.547
4.167ValGlu: 4.167 ± 0.67
2.658ValPhe: 2.658 ± 0.425
3.161ValGly: 3.161 ± 0.618
0.934ValHis: 0.934 ± 0.271
4.382ValIle: 4.382 ± 0.739
4.885ValLys: 4.885 ± 0.548
4.382ValLeu: 4.382 ± 0.544
1.724ValMet: 1.724 ± 0.337
3.951ValAsn: 3.951 ± 0.663
2.371ValPro: 2.371 ± 0.398
2.73ValGln: 2.73 ± 0.483
2.443ValArg: 2.443 ± 0.568
4.382ValSer: 4.382 ± 0.878
4.885ValThr: 4.885 ± 0.627
3.305ValVal: 3.305 ± 0.479
1.006ValTrp: 1.006 ± 0.266
2.371ValTyr: 2.371 ± 0.516
0.0ValXaa: 0.0 ± 0.0
Trp
1.006TrpAla: 1.006 ± 0.295
0.359TrpCys: 0.359 ± 0.175
0.718TrpAsp: 0.718 ± 0.242
0.862TrpGlu: 0.862 ± 0.274
0.718TrpPhe: 0.718 ± 0.247
0.575TrpGly: 0.575 ± 0.232
0.216TrpHis: 0.216 ± 0.118
1.293TrpIle: 1.293 ± 0.283
0.718TrpLys: 0.718 ± 0.254
1.149TrpLeu: 1.149 ± 0.251
0.144TrpMet: 0.144 ± 0.121
1.078TrpAsn: 1.078 ± 0.32
0.072TrpPro: 0.072 ± 0.067
0.647TrpGln: 0.647 ± 0.21
0.718TrpArg: 0.718 ± 0.187
0.862TrpSer: 0.862 ± 0.246
0.934TrpThr: 0.934 ± 0.272
0.862TrpVal: 0.862 ± 0.323
0.072TrpTrp: 0.072 ± 0.083
0.575TrpTyr: 0.575 ± 0.338
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.658TyrAla: 2.658 ± 0.515
0.216TyrCys: 0.216 ± 0.122
2.443TyrAsp: 2.443 ± 0.438
4.382TyrGlu: 4.382 ± 0.761
1.796TyrPhe: 1.796 ± 0.369
2.658TyrGly: 2.658 ± 0.503
0.862TyrHis: 0.862 ± 0.247
2.586TyrIle: 2.586 ± 0.456
4.957TyrLys: 4.957 ± 0.739
3.376TyrLeu: 3.376 ± 0.47
1.221TyrMet: 1.221 ± 0.391
2.945TyrAsn: 2.945 ± 0.526
1.365TyrPro: 1.365 ± 0.368
1.652TyrGln: 1.652 ± 0.331
2.011TyrArg: 2.011 ± 0.36
2.874TyrSer: 2.874 ± 0.441
2.011TyrThr: 2.011 ± 0.347
2.083TyrVal: 2.083 ± 0.433
0.431TyrTrp: 0.431 ± 0.162
2.155TyrTyr: 2.155 ± 0.52
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13921 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski