Amino acid dipepetide frequency for Staphylococcus phage IME1323_01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.785AlaAla: 2.785 ± 0.824
0.073AlaCys: 0.073 ± 0.061
3.445AlaAsp: 3.445 ± 0.55
4.031AlaGlu: 4.031 ± 0.516
2.272AlaPhe: 2.272 ± 0.437
2.639AlaGly: 2.639 ± 0.809
1.466AlaHis: 1.466 ± 0.333
4.471AlaIle: 4.471 ± 1.199
5.13AlaLys: 5.13 ± 0.631
4.691AlaLeu: 4.691 ± 0.69
2.272AlaMet: 2.272 ± 0.345
3.591AlaAsn: 3.591 ± 0.634
1.686AlaPro: 1.686 ± 0.454
3.005AlaGln: 3.005 ± 0.705
2.272AlaArg: 2.272 ± 0.341
3.078AlaSer: 3.078 ± 0.642
4.324AlaThr: 4.324 ± 0.493
3.445AlaVal: 3.445 ± 0.85
0.586AlaTrp: 0.586 ± 0.335
2.125AlaTyr: 2.125 ± 0.361
0.0AlaXaa: 0.0 ± 0.0
Cys
0.44CysAla: 0.44 ± 0.22
0.073CysCys: 0.073 ± 0.066
1.026CysAsp: 1.026 ± 0.283
0.366CysGlu: 0.366 ± 0.157
0.147CysPhe: 0.147 ± 0.108
0.66CysGly: 0.66 ± 0.204
0.147CysHis: 0.147 ± 0.135
0.586CysIle: 0.586 ± 0.164
0.513CysLys: 0.513 ± 0.183
0.366CysLeu: 0.366 ± 0.156
0.147CysMet: 0.147 ± 0.103
0.147CysAsn: 0.147 ± 0.095
0.073CysPro: 0.073 ± 0.078
0.0CysGln: 0.0 ± 0.0
0.44CysArg: 0.44 ± 0.196
0.513CysSer: 0.513 ± 0.202
0.0CysThr: 0.0 ± 0.0
0.366CysVal: 0.366 ± 0.141
0.147CysTrp: 0.147 ± 0.098
0.366CysTyr: 0.366 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
4.251AspAla: 4.251 ± 0.907
0.22AspCys: 0.22 ± 0.129
4.031AspAsp: 4.031 ± 0.739
4.984AspGlu: 4.984 ± 0.947
3.884AspPhe: 3.884 ± 0.482
4.544AspGly: 4.544 ± 0.816
0.44AspHis: 0.44 ± 0.189
4.837AspIle: 4.837 ± 0.51
4.911AspLys: 4.911 ± 0.733
5.204AspLeu: 5.204 ± 0.568
2.199AspMet: 2.199 ± 0.372
3.811AspAsn: 3.811 ± 0.488
1.612AspPro: 1.612 ± 0.327
0.88AspGln: 0.88 ± 0.24
2.345AspArg: 2.345 ± 0.445
3.298AspSer: 3.298 ± 0.575
3.078AspThr: 3.078 ± 0.503
4.617AspVal: 4.617 ± 0.541
0.586AspTrp: 0.586 ± 0.195
2.858AspTyr: 2.858 ± 0.476
0.0AspXaa: 0.0 ± 0.0
Glu
3.445GluAla: 3.445 ± 0.653
0.366GluCys: 0.366 ± 0.138
4.251GluAsp: 4.251 ± 0.773
4.324GluGlu: 4.324 ± 0.741
3.884GluPhe: 3.884 ± 0.569
2.932GluGly: 2.932 ± 0.377
1.832GluHis: 1.832 ± 0.394
5.644GluIle: 5.644 ± 0.739
6.596GluLys: 6.596 ± 0.759
6.157GluLeu: 6.157 ± 0.74
2.125GluMet: 2.125 ± 0.383
4.911GluAsn: 4.911 ± 0.761
1.173GluPro: 1.173 ± 0.291
3.445GluGln: 3.445 ± 0.474
3.152GluArg: 3.152 ± 0.68
4.031GluSer: 4.031 ± 0.503
3.958GluThr: 3.958 ± 0.559
4.471GluVal: 4.471 ± 0.553
1.026GluTrp: 1.026 ± 0.343
3.591GluTyr: 3.591 ± 0.519
0.0GluXaa: 0.0 ± 0.0
Phe
2.052PheAla: 2.052 ± 0.423
0.073PheCys: 0.073 ± 0.065
2.639PheAsp: 2.639 ± 0.374
3.371PheGlu: 3.371 ± 0.413
1.979PhePhe: 1.979 ± 0.486
2.932PheGly: 2.932 ± 0.444
0.66PheHis: 0.66 ± 0.206
4.104PheIle: 4.104 ± 0.51
3.665PheLys: 3.665 ± 0.51
2.272PheLeu: 2.272 ± 0.4
1.246PheMet: 1.246 ± 0.277
3.371PheAsn: 3.371 ± 0.444
0.88PhePro: 0.88 ± 0.303
1.246PheGln: 1.246 ± 0.274
1.393PheArg: 1.393 ± 0.319
3.371PheSer: 3.371 ± 0.504
3.152PheThr: 3.152 ± 0.491
2.272PheVal: 2.272 ± 0.407
0.22PheTrp: 0.22 ± 0.116
2.052PheTyr: 2.052 ± 0.404
0.0PheXaa: 0.0 ± 0.0
Gly
3.225GlyAla: 3.225 ± 0.639
0.44GlyCys: 0.44 ± 0.181
2.858GlyAsp: 2.858 ± 0.441
3.078GlyGlu: 3.078 ± 0.555
2.639GlyPhe: 2.639 ± 0.485
2.785GlyGly: 2.785 ± 0.464
1.393GlyHis: 1.393 ± 0.372
5.13GlyIle: 5.13 ± 0.825
5.497GlyLys: 5.497 ± 0.844
3.884GlyLeu: 3.884 ± 0.411
1.246GlyMet: 1.246 ± 0.438
3.738GlyAsn: 3.738 ± 0.423
0.953GlyPro: 0.953 ± 0.345
2.419GlyGln: 2.419 ± 0.425
2.125GlyArg: 2.125 ± 0.316
4.031GlySer: 4.031 ± 0.628
3.005GlyThr: 3.005 ± 0.421
4.764GlyVal: 4.764 ± 0.54
0.586GlyTrp: 0.586 ± 0.235
3.152GlyTyr: 3.152 ± 0.66
0.0GlyXaa: 0.0 ± 0.0
His
1.319HisAla: 1.319 ± 0.295
0.22HisCys: 0.22 ± 0.119
1.099HisAsp: 1.099 ± 0.302
0.88HisGlu: 0.88 ± 0.248
0.88HisPhe: 0.88 ± 0.265
1.319HisGly: 1.319 ± 0.271
0.66HisHis: 0.66 ± 0.31
1.979HisIle: 1.979 ± 0.431
1.173HisLys: 1.173 ± 0.28
1.979HisLeu: 1.979 ± 0.349
0.293HisMet: 0.293 ± 0.133
1.099HisAsn: 1.099 ± 0.341
0.513HisPro: 0.513 ± 0.172
0.586HisGln: 0.586 ± 0.203
0.366HisArg: 0.366 ± 0.141
1.099HisSer: 1.099 ± 0.367
1.686HisThr: 1.686 ± 0.393
1.393HisVal: 1.393 ± 0.353
0.44HisTrp: 0.44 ± 0.176
0.806HisTyr: 0.806 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
4.617IleAla: 4.617 ± 0.709
0.293IleCys: 0.293 ± 0.137
4.984IleAsp: 4.984 ± 0.581
5.13IleGlu: 5.13 ± 0.676
2.712IlePhe: 2.712 ± 0.521
4.178IleGly: 4.178 ± 0.694
1.319IleHis: 1.319 ± 0.304
4.324IleIle: 4.324 ± 0.517
7.916IleLys: 7.916 ± 0.712
5.35IleLeu: 5.35 ± 0.587
1.612IleMet: 1.612 ± 0.287
5.937IleAsn: 5.937 ± 0.859
3.078IlePro: 3.078 ± 0.427
1.979IleGln: 1.979 ± 0.343
2.785IleArg: 2.785 ± 0.401
4.617IleSer: 4.617 ± 0.457
4.544IleThr: 4.544 ± 0.598
5.79IleVal: 5.79 ± 1.227
1.759IleTrp: 1.759 ± 0.776
2.419IleTyr: 2.419 ± 0.615
0.0IleXaa: 0.0 ± 0.0
Lys
5.863LysAla: 5.863 ± 0.582
0.66LysCys: 0.66 ± 0.298
5.79LysAsp: 5.79 ± 0.824
7.109LysGlu: 7.109 ± 1.126
3.884LysPhe: 3.884 ± 0.591
6.23LysGly: 6.23 ± 0.822
2.125LysHis: 2.125 ± 0.373
6.157LysIle: 6.157 ± 0.729
7.842LysLys: 7.842 ± 1.022
6.596LysLeu: 6.596 ± 0.754
2.125LysMet: 2.125 ± 0.36
6.303LysAsn: 6.303 ± 0.783
2.345LysPro: 2.345 ± 0.564
3.958LysGln: 3.958 ± 0.436
4.837LysArg: 4.837 ± 0.68
5.717LysSer: 5.717 ± 0.51
4.617LysThr: 4.617 ± 0.508
3.884LysVal: 3.884 ± 0.524
0.88LysTrp: 0.88 ± 0.231
4.544LysTyr: 4.544 ± 0.76
0.0LysXaa: 0.0 ± 0.0
Leu
4.104LeuAla: 4.104 ± 0.875
0.293LeuCys: 0.293 ± 0.123
4.837LeuAsp: 4.837 ± 0.546
6.816LeuGlu: 6.816 ± 0.686
3.152LeuPhe: 3.152 ± 0.367
4.031LeuGly: 4.031 ± 0.548
1.246LeuHis: 1.246 ± 0.396
5.424LeuIle: 5.424 ± 0.524
6.67LeuLys: 6.67 ± 0.691
5.13LeuLeu: 5.13 ± 0.619
1.906LeuMet: 1.906 ± 0.408
6.157LeuAsn: 6.157 ± 0.63
2.125LeuPro: 2.125 ± 0.373
2.932LeuGln: 2.932 ± 0.39
2.932LeuArg: 2.932 ± 0.37
5.35LeuSer: 5.35 ± 0.529
3.738LeuThr: 3.738 ± 0.514
4.178LeuVal: 4.178 ± 0.585
0.513LeuTrp: 0.513 ± 0.208
3.005LeuTyr: 3.005 ± 0.591
0.0LeuXaa: 0.0 ± 0.0
Met
2.125MetAla: 2.125 ± 0.668
0.366MetCys: 0.366 ± 0.146
1.026MetAsp: 1.026 ± 0.248
2.125MetGlu: 2.125 ± 0.4
0.66MetPhe: 0.66 ± 0.197
1.246MetGly: 1.246 ± 0.492
0.44MetHis: 0.44 ± 0.2
1.393MetIle: 1.393 ± 0.306
2.419MetLys: 2.419 ± 0.397
1.759MetLeu: 1.759 ± 0.455
0.366MetMet: 0.366 ± 0.134
1.906MetAsn: 1.906 ± 0.36
0.586MetPro: 0.586 ± 0.164
1.026MetGln: 1.026 ± 0.284
1.173MetArg: 1.173 ± 0.276
1.906MetSer: 1.906 ± 0.364
1.832MetThr: 1.832 ± 0.363
0.513MetVal: 0.513 ± 0.192
0.366MetTrp: 0.366 ± 0.178
1.026MetTyr: 1.026 ± 0.324
0.0MetXaa: 0.0 ± 0.0
Asn
3.665AsnAla: 3.665 ± 0.598
0.88AsnCys: 0.88 ± 0.369
4.617AsnAsp: 4.617 ± 0.619
4.691AsnGlu: 4.691 ± 0.698
2.785AsnPhe: 2.785 ± 0.466
4.837AsnGly: 4.837 ± 0.6
1.466AsnHis: 1.466 ± 0.334
4.691AsnIle: 4.691 ± 0.558
6.963AsnLys: 6.963 ± 0.739
4.837AsnLeu: 4.837 ± 0.553
1.319AsnMet: 1.319 ± 0.257
5.717AsnAsn: 5.717 ± 1.005
2.345AsnPro: 2.345 ± 0.447
2.492AsnGln: 2.492 ± 0.374
2.858AsnArg: 2.858 ± 0.47
4.104AsnSer: 4.104 ± 0.587
3.445AsnThr: 3.445 ± 0.411
4.617AsnVal: 4.617 ± 0.692
0.806AsnTrp: 0.806 ± 0.269
3.152AsnTyr: 3.152 ± 0.534
0.0AsnXaa: 0.0 ± 0.0
Pro
0.806ProAla: 0.806 ± 0.19
0.44ProCys: 0.44 ± 0.197
1.539ProAsp: 1.539 ± 0.311
1.612ProGlu: 1.612 ± 0.341
1.393ProPhe: 1.393 ± 0.296
0.953ProGly: 0.953 ± 0.343
0.44ProHis: 0.44 ± 0.178
2.565ProIle: 2.565 ± 0.481
2.785ProLys: 2.785 ± 0.43
1.759ProLeu: 1.759 ± 0.347
0.586ProMet: 0.586 ± 0.188
1.393ProAsn: 1.393 ± 0.227
0.733ProPro: 0.733 ± 0.154
1.246ProGln: 1.246 ± 0.339
1.393ProArg: 1.393 ± 0.303
1.759ProSer: 1.759 ± 0.4
1.539ProThr: 1.539 ± 0.413
2.052ProVal: 2.052 ± 0.345
0.147ProTrp: 0.147 ± 0.091
1.686ProTyr: 1.686 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
2.052GlnAla: 2.052 ± 0.408
0.22GlnCys: 0.22 ± 0.131
2.052GlnAsp: 2.052 ± 0.378
2.419GlnGlu: 2.419 ± 0.535
1.539GlnPhe: 1.539 ± 0.316
1.466GlnGly: 1.466 ± 0.371
0.22GlnHis: 0.22 ± 0.127
2.125GlnIle: 2.125 ± 0.369
3.152GlnLys: 3.152 ± 0.473
3.518GlnLeu: 3.518 ± 0.404
0.733GlnMet: 0.733 ± 0.24
3.078GlnAsn: 3.078 ± 0.48
1.246GlnPro: 1.246 ± 0.292
2.052GlnGln: 2.052 ± 0.557
1.832GlnArg: 1.832 ± 0.386
2.345GlnSer: 2.345 ± 0.43
2.565GlnThr: 2.565 ± 0.427
2.199GlnVal: 2.199 ± 0.349
0.293GlnTrp: 0.293 ± 0.149
1.832GlnTyr: 1.832 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
2.345ArgAla: 2.345 ± 0.484
0.073ArgCys: 0.073 ± 0.07
2.932ArgAsp: 2.932 ± 0.477
3.005ArgGlu: 3.005 ± 0.445
1.832ArgPhe: 1.832 ± 0.393
2.419ArgGly: 2.419 ± 0.372
1.246ArgHis: 1.246 ± 0.273
2.785ArgIle: 2.785 ± 0.423
4.471ArgLys: 4.471 ± 0.642
3.371ArgLeu: 3.371 ± 0.685
0.733ArgMet: 0.733 ± 0.214
2.712ArgAsn: 2.712 ± 0.369
0.586ArgPro: 0.586 ± 0.21
1.393ArgGln: 1.393 ± 0.302
1.832ArgArg: 1.832 ± 0.488
2.052ArgSer: 2.052 ± 0.427
1.979ArgThr: 1.979 ± 0.316
1.979ArgVal: 1.979 ± 0.315
0.586ArgTrp: 0.586 ± 0.22
2.419ArgTyr: 2.419 ± 0.463
0.0ArgXaa: 0.0 ± 0.0
Ser
3.884SerAla: 3.884 ± 1.228
0.44SerCys: 0.44 ± 0.183
4.544SerAsp: 4.544 ± 0.559
3.518SerGlu: 3.518 ± 0.492
2.272SerPhe: 2.272 ± 0.389
4.398SerGly: 4.398 ± 0.58
1.686SerHis: 1.686 ± 0.338
5.057SerIle: 5.057 ± 0.61
6.01SerLys: 6.01 ± 0.684
4.984SerLeu: 4.984 ± 0.496
0.953SerMet: 0.953 ± 0.232
4.471SerAsn: 4.471 ± 0.497
1.759SerPro: 1.759 ± 0.354
2.199SerGln: 2.199 ± 0.348
2.125SerArg: 2.125 ± 0.292
4.031SerSer: 4.031 ± 0.794
3.225SerThr: 3.225 ± 0.438
3.738SerVal: 3.738 ± 0.569
0.586SerTrp: 0.586 ± 0.166
2.199SerTyr: 2.199 ± 0.458
0.0SerXaa: 0.0 ± 0.0
Thr
2.858ThrAla: 2.858 ± 0.444
0.147ThrCys: 0.147 ± 0.103
3.445ThrAsp: 3.445 ± 0.438
4.178ThrGlu: 4.178 ± 0.657
2.932ThrPhe: 2.932 ± 0.432
2.785ThrGly: 2.785 ± 0.374
1.393ThrHis: 1.393 ± 0.272
5.57ThrIle: 5.57 ± 1.167
4.544ThrLys: 4.544 ± 0.651
4.764ThrLeu: 4.764 ± 0.765
1.539ThrMet: 1.539 ± 0.345
3.225ThrAsn: 3.225 ± 0.42
2.199ThrPro: 2.199 ± 0.314
1.832ThrGln: 1.832 ± 0.385
1.832ThrArg: 1.832 ± 0.319
3.884ThrSer: 3.884 ± 0.559
5.277ThrThr: 5.277 ± 0.915
5.204ThrVal: 5.204 ± 0.809
0.513ThrTrp: 0.513 ± 0.181
2.712ThrTyr: 2.712 ± 0.525
0.0ThrXaa: 0.0 ± 0.0
Val
4.251ValAla: 4.251 ± 0.946
0.733ValCys: 0.733 ± 0.202
4.837ValAsp: 4.837 ± 0.536
5.057ValGlu: 5.057 ± 0.705
1.686ValPhe: 1.686 ± 0.268
3.225ValGly: 3.225 ± 0.493
0.88ValHis: 0.88 ± 0.284
4.691ValIle: 4.691 ± 0.657
6.01ValLys: 6.01 ± 0.762
3.225ValLeu: 3.225 ± 0.611
1.466ValMet: 1.466 ± 0.342
4.837ValAsn: 4.837 ± 0.74
2.125ValPro: 2.125 ± 0.384
1.979ValGln: 1.979 ± 0.338
2.565ValArg: 2.565 ± 0.334
3.665ValSer: 3.665 ± 0.883
5.13ValThr: 5.13 ± 1.051
4.398ValVal: 4.398 ± 0.598
0.66ValTrp: 0.66 ± 0.244
1.686ValTyr: 1.686 ± 0.398
0.0ValXaa: 0.0 ± 0.0
Trp
0.953TrpAla: 0.953 ± 0.288
0.147TrpCys: 0.147 ± 0.106
0.366TrpAsp: 0.366 ± 0.128
0.806TrpGlu: 0.806 ± 0.24
0.586TrpPhe: 0.586 ± 0.244
0.733TrpGly: 0.733 ± 0.202
0.147TrpHis: 0.147 ± 0.126
0.806TrpIle: 0.806 ± 0.297
0.88TrpLys: 0.88 ± 0.29
1.319TrpLeu: 1.319 ± 0.422
0.147TrpMet: 0.147 ± 0.116
1.173TrpAsn: 1.173 ± 0.285
0.0TrpPro: 0.0 ± 0.0
0.66TrpGln: 0.66 ± 0.196
0.366TrpArg: 0.366 ± 0.155
0.88TrpSer: 0.88 ± 0.413
1.173TrpThr: 1.173 ± 0.676
0.66TrpVal: 0.66 ± 0.165
0.073TrpTrp: 0.073 ± 0.062
0.293TrpTyr: 0.293 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.345TyrAla: 2.345 ± 0.427
0.366TyrCys: 0.366 ± 0.179
2.272TyrAsp: 2.272 ± 0.495
3.811TyrGlu: 3.811 ± 0.593
1.832TyrPhe: 1.832 ± 0.426
2.639TyrGly: 2.639 ± 0.563
0.586TyrHis: 0.586 ± 0.205
3.005TyrIle: 3.005 ± 0.578
4.398TyrLys: 4.398 ± 0.736
3.371TyrLeu: 3.371 ± 0.671
1.173TyrMet: 1.173 ± 0.339
2.712TyrAsn: 2.712 ± 0.555
0.806TyrPro: 0.806 ± 0.258
1.612TyrGln: 1.612 ± 0.3
2.125TyrArg: 2.125 ± 0.498
2.345TyrSer: 2.345 ± 0.378
2.565TyrThr: 2.565 ± 0.45
2.565TyrVal: 2.565 ± 0.37
1.393TyrTrp: 1.393 ± 0.515
1.979TyrTyr: 1.979 ± 0.472
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (13645 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski