Amino acid dipepetide frequency for Staphylococcus phage PMBT8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.247AlaAla: 1.247 ± 0.45
0.443AlaCys: 0.443 ± 0.164
2.253AlaAsp: 2.253 ± 0.272
2.937AlaGlu: 2.937 ± 0.393
1.931AlaPhe: 1.931 ± 0.258
2.776AlaGly: 2.776 ± 0.542
0.885AlaHis: 0.885 ± 0.192
3.782AlaIle: 3.782 ± 0.368
3.822AlaLys: 3.822 ± 0.486
4.305AlaLeu: 4.305 ± 0.409
1.972AlaMet: 1.972 ± 0.362
2.696AlaAsn: 2.696 ± 0.504
0.724AlaPro: 0.724 ± 0.21
1.811AlaGln: 1.811 ± 0.304
2.535AlaArg: 2.535 ± 0.371
3.179AlaSer: 3.179 ± 0.461
3.299AlaThr: 3.299 ± 0.424
2.454AlaVal: 2.454 ± 0.448
0.322AlaTrp: 0.322 ± 0.116
2.334AlaTyr: 2.334 ± 0.291
0.0AlaXaa: 0.0 ± 0.0
Cys
0.201CysAla: 0.201 ± 0.084
0.04CysCys: 0.04 ± 0.037
0.402CysAsp: 0.402 ± 0.111
0.08CysGlu: 0.08 ± 0.053
0.282CysPhe: 0.282 ± 0.105
0.724CysGly: 0.724 ± 0.235
0.201CysHis: 0.201 ± 0.118
0.845CysIle: 0.845 ± 0.211
0.684CysLys: 0.684 ± 0.199
0.523CysLeu: 0.523 ± 0.184
0.241CysMet: 0.241 ± 0.104
0.764CysAsn: 0.764 ± 0.231
0.604CysPro: 0.604 ± 0.197
0.402CysGln: 0.402 ± 0.142
0.402CysArg: 0.402 ± 0.151
0.443CysSer: 0.443 ± 0.171
0.322CysThr: 0.322 ± 0.131
0.402CysVal: 0.402 ± 0.141
0.201CysTrp: 0.201 ± 0.1
0.483CysTyr: 0.483 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
3.259AspAla: 3.259 ± 0.311
0.523AspCys: 0.523 ± 0.152
4.104AspAsp: 4.104 ± 0.48
5.432AspGlu: 5.432 ± 0.503
3.662AspPhe: 3.662 ± 0.424
3.822AspGly: 3.822 ± 0.465
0.443AspHis: 0.443 ± 0.142
6.478AspIle: 6.478 ± 0.721
6.76AspLys: 6.76 ± 0.568
5.271AspLeu: 5.271 ± 0.539
2.374AspMet: 2.374 ± 0.352
4.225AspAsn: 4.225 ± 0.372
0.966AspPro: 0.966 ± 0.239
0.684AspGln: 0.684 ± 0.142
2.173AspArg: 2.173 ± 0.284
4.466AspSer: 4.466 ± 0.482
2.897AspThr: 2.897 ± 0.33
3.662AspVal: 3.662 ± 0.379
0.604AspTrp: 0.604 ± 0.129
3.621AspTyr: 3.621 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
2.575GluAla: 2.575 ± 0.363
0.724GluCys: 0.724 ± 0.162
4.949GluAsp: 4.949 ± 0.671
7.202GluGlu: 7.202 ± 0.868
3.42GluPhe: 3.42 ± 0.376
3.581GluGly: 3.581 ± 0.312
1.69GluHis: 1.69 ± 0.262
5.834GluIle: 5.834 ± 0.54
7.001GluLys: 7.001 ± 0.533
7.404GluLeu: 7.404 ± 0.591
2.052GluMet: 2.052 ± 0.311
4.667GluAsn: 4.667 ± 0.359
1.167GluPro: 1.167 ± 0.256
3.662GluGln: 3.662 ± 0.408
3.179GluArg: 3.179 ± 0.324
4.627GluSer: 4.627 ± 0.496
3.863GluThr: 3.863 ± 0.404
4.506GluVal: 4.506 ± 0.469
1.006GluTrp: 1.006 ± 0.232
4.748GluTyr: 4.748 ± 0.59
0.0GluXaa: 0.0 ± 0.0
Phe
1.609PheAla: 1.609 ± 0.261
0.362PheCys: 0.362 ± 0.122
3.299PheAsp: 3.299 ± 0.378
2.092PheGlu: 2.092 ± 0.29
2.173PhePhe: 2.173 ± 0.307
2.414PheGly: 2.414 ± 0.348
0.764PheHis: 0.764 ± 0.182
3.702PheIle: 3.702 ± 0.439
4.506PheLys: 4.506 ± 0.467
3.621PheLeu: 3.621 ± 0.477
0.764PheMet: 0.764 ± 0.184
2.615PheAsn: 2.615 ± 0.409
0.443PhePro: 0.443 ± 0.14
1.288PheGln: 1.288 ± 0.222
1.851PheArg: 1.851 ± 0.287
2.575PheSer: 2.575 ± 0.327
1.972PheThr: 1.972 ± 0.317
2.374PheVal: 2.374 ± 0.391
0.362PheTrp: 0.362 ± 0.133
2.575PheTyr: 2.575 ± 0.318
0.0PheXaa: 0.0 ± 0.0
Gly
3.018GlyAla: 3.018 ± 0.764
0.523GlyCys: 0.523 ± 0.183
3.42GlyAsp: 3.42 ± 0.387
3.581GlyGlu: 3.581 ± 0.359
2.334GlyPhe: 2.334 ± 0.346
3.018GlyGly: 3.018 ± 0.859
0.805GlyHis: 0.805 ± 0.195
4.989GlyIle: 4.989 ± 0.534
5.995GlyLys: 5.995 ± 0.519
5.271GlyLeu: 5.271 ± 0.545
1.891GlyMet: 1.891 ± 0.268
3.179GlyAsn: 3.179 ± 0.485
0.563GlyPro: 0.563 ± 0.193
1.972GlyGln: 1.972 ± 0.312
2.575GlyArg: 2.575 ± 0.311
3.219GlySer: 3.219 ± 0.475
3.179GlyThr: 3.179 ± 0.541
3.219GlyVal: 3.219 ± 0.369
1.046GlyTrp: 1.046 ± 0.379
3.219GlyTyr: 3.219 ± 0.334
0.0GlyXaa: 0.0 ± 0.0
His
0.724HisAla: 0.724 ± 0.169
0.201HisCys: 0.201 ± 0.089
1.046HisAsp: 1.046 ± 0.231
1.207HisGlu: 1.207 ± 0.247
0.845HisPhe: 0.845 ± 0.17
1.449HisGly: 1.449 ± 0.225
0.483HisHis: 0.483 ± 0.147
1.368HisIle: 1.368 ± 0.277
1.449HisLys: 1.449 ± 0.251
1.529HisLeu: 1.529 ± 0.239
0.322HisMet: 0.322 ± 0.125
1.207HisAsn: 1.207 ± 0.231
0.443HisPro: 0.443 ± 0.128
0.644HisGln: 0.644 ± 0.148
0.805HisArg: 0.805 ± 0.213
1.288HisSer: 1.288 ± 0.237
0.885HisThr: 0.885 ± 0.183
1.006HisVal: 1.006 ± 0.19
0.04HisTrp: 0.04 ± 0.042
1.167HisTyr: 1.167 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
3.098IleAla: 3.098 ± 0.38
0.764IleCys: 0.764 ± 0.2
5.432IleAsp: 5.432 ± 0.499
6.639IleGlu: 6.639 ± 0.534
2.736IlePhe: 2.736 ± 0.343
4.024IleGly: 4.024 ± 0.505
0.925IleHis: 0.925 ± 0.193
6.277IleIle: 6.277 ± 0.568
7.645IleLys: 7.645 ± 0.514
6.076IleLeu: 6.076 ± 0.549
2.293IleMet: 2.293 ± 0.329
5.512IleAsn: 5.512 ± 0.469
2.293IlePro: 2.293 ± 0.291
3.179IleGln: 3.179 ± 0.393
2.857IleArg: 2.857 ± 0.316
5.231IleSer: 5.231 ± 0.454
4.627IleThr: 4.627 ± 0.379
3.903IleVal: 3.903 ± 0.385
0.845IleTrp: 0.845 ± 0.232
4.185IleTyr: 4.185 ± 0.466
0.0IleXaa: 0.0 ± 0.0
Lys
4.466LysAla: 4.466 ± 0.444
0.523LysCys: 0.523 ± 0.177
6.317LysAsp: 6.317 ± 0.568
9.617LysGlu: 9.617 ± 0.73
2.978LysPhe: 2.978 ± 0.346
5.351LysGly: 5.351 ± 0.493
2.213LysHis: 2.213 ± 0.298
6.8LysIle: 6.8 ± 0.5
9.456LysLys: 9.456 ± 0.706
6.88LysLeu: 6.88 ± 0.547
2.696LysMet: 2.696 ± 0.265
6.559LysAsn: 6.559 ± 0.505
2.173LysPro: 2.173 ± 0.34
3.702LysGln: 3.702 ± 0.395
4.346LysArg: 4.346 ± 0.433
4.828LysSer: 4.828 ± 0.586
4.949LysThr: 4.949 ± 0.439
5.15LysVal: 5.15 ± 0.366
0.885LysTrp: 0.885 ± 0.2
5.432LysTyr: 5.432 ± 0.518
0.0LysXaa: 0.0 ± 0.0
Leu
3.782LeuAla: 3.782 ± 0.513
0.443LeuCys: 0.443 ± 0.137
5.995LeuAsp: 5.995 ± 0.681
6.398LeuGlu: 6.398 ± 0.678
3.138LeuPhe: 3.138 ± 0.46
4.587LeuGly: 4.587 ± 0.553
1.449LeuHis: 1.449 ± 0.254
5.231LeuIle: 5.231 ± 0.448
7.202LeuLys: 7.202 ± 0.522
6.961LeuLeu: 6.961 ± 0.628
1.972LeuMet: 1.972 ± 0.257
5.673LeuAsn: 5.673 ± 0.455
2.092LeuPro: 2.092 ± 0.341
3.058LeuGln: 3.058 ± 0.355
3.782LeuArg: 3.782 ± 0.448
6.398LeuSer: 6.398 ± 0.379
4.225LeuThr: 4.225 ± 0.368
4.305LeuVal: 4.305 ± 0.487
1.006LeuTrp: 1.006 ± 0.254
3.822LeuTyr: 3.822 ± 0.462
0.0LeuXaa: 0.0 ± 0.0
Met
2.253MetAla: 2.253 ± 0.441
0.08MetCys: 0.08 ± 0.058
1.569MetAsp: 1.569 ± 0.226
2.173MetGlu: 2.173 ± 0.269
0.885MetPhe: 0.885 ± 0.196
1.449MetGly: 1.449 ± 0.264
0.362MetHis: 0.362 ± 0.12
2.293MetIle: 2.293 ± 0.293
2.897MetLys: 2.897 ± 0.261
1.529MetLeu: 1.529 ± 0.206
0.443MetMet: 0.443 ± 0.132
1.891MetAsn: 1.891 ± 0.295
1.006MetPro: 1.006 ± 0.226
0.805MetGln: 0.805 ± 0.195
1.006MetArg: 1.006 ± 0.161
1.69MetSer: 1.69 ± 0.26
1.247MetThr: 1.247 ± 0.22
1.77MetVal: 1.77 ± 0.255
0.402MetTrp: 0.402 ± 0.145
1.247MetTyr: 1.247 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
3.058AsnAla: 3.058 ± 0.413
0.443AsnCys: 0.443 ± 0.153
3.662AsnAsp: 3.662 ± 0.348
4.667AsnGlu: 4.667 ± 0.472
2.293AsnPhe: 2.293 ± 0.299
4.788AsnGly: 4.788 ± 0.465
0.966AsnHis: 0.966 ± 0.215
7.001AsnIle: 7.001 ± 0.576
8.651AsnLys: 8.651 ± 0.571
4.828AsnLeu: 4.828 ± 0.635
1.77AsnMet: 1.77 ± 0.295
5.03AsnAsn: 5.03 ± 0.363
1.972AsnPro: 1.972 ± 0.28
2.293AsnGln: 2.293 ± 0.306
2.776AsnArg: 2.776 ± 0.361
3.138AsnSer: 3.138 ± 0.38
3.782AsnThr: 3.782 ± 0.398
3.299AsnVal: 3.299 ± 0.365
0.684AsnTrp: 0.684 ± 0.159
2.817AsnTyr: 2.817 ± 0.336
0.0AsnXaa: 0.0 ± 0.0
Pro
0.684ProAla: 0.684 ± 0.174
0.121ProCys: 0.121 ± 0.065
1.569ProAsp: 1.569 ± 0.258
2.334ProGlu: 2.334 ± 0.291
1.046ProPhe: 1.046 ± 0.246
0.684ProGly: 0.684 ± 0.185
0.282ProHis: 0.282 ± 0.09
1.65ProIle: 1.65 ± 0.259
1.73ProLys: 1.73 ± 0.305
1.77ProLeu: 1.77 ± 0.249
0.483ProMet: 0.483 ± 0.125
1.65ProAsn: 1.65 ± 0.32
0.322ProPro: 0.322 ± 0.151
0.604ProGln: 0.604 ± 0.136
0.764ProArg: 0.764 ± 0.166
1.65ProSer: 1.65 ± 0.254
1.288ProThr: 1.288 ± 0.226
2.052ProVal: 2.052 ± 0.3
0.08ProTrp: 0.08 ± 0.047
1.127ProTyr: 1.127 ± 0.212
0.0ProXaa: 0.0 ± 0.0
Gln
1.328GlnAla: 1.328 ± 0.315
0.322GlnCys: 0.322 ± 0.127
2.656GlnAsp: 2.656 ± 0.312
3.38GlnGlu: 3.38 ± 0.383
1.328GlnPhe: 1.328 ± 0.208
1.73GlnGly: 1.73 ± 0.236
1.086GlnHis: 1.086 ± 0.205
2.535GlnIle: 2.535 ± 0.322
3.098GlnLys: 3.098 ± 0.346
2.656GlnLeu: 2.656 ± 0.344
0.805GlnMet: 0.805 ± 0.206
2.334GlnAsn: 2.334 ± 0.343
0.644GlnPro: 0.644 ± 0.175
2.173GlnGln: 2.173 ± 0.434
2.012GlnArg: 2.012 ± 0.255
2.495GlnSer: 2.495 ± 0.288
1.207GlnThr: 1.207 ± 0.212
2.012GlnVal: 2.012 ± 0.257
0.402GlnTrp: 0.402 ± 0.113
2.012GlnTyr: 2.012 ± 0.262
0.0GlnXaa: 0.0 ± 0.0
Arg
2.495ArgAla: 2.495 ± 0.308
0.483ArgCys: 0.483 ± 0.166
2.656ArgAsp: 2.656 ± 0.337
3.46ArgGlu: 3.46 ± 0.37
1.931ArgPhe: 1.931 ± 0.227
3.018ArgGly: 3.018 ± 0.383
0.845ArgHis: 0.845 ± 0.18
2.293ArgIle: 2.293 ± 0.28
4.024ArgLys: 4.024 ± 0.417
3.138ArgLeu: 3.138 ± 0.327
1.207ArgMet: 1.207 ± 0.244
3.299ArgAsn: 3.299 ± 0.331
0.604ArgPro: 0.604 ± 0.13
1.328ArgGln: 1.328 ± 0.222
2.173ArgArg: 2.173 ± 0.283
1.891ArgSer: 1.891 ± 0.254
2.133ArgThr: 2.133 ± 0.272
2.495ArgVal: 2.495 ± 0.29
0.523ArgTrp: 0.523 ± 0.145
1.972ArgTyr: 1.972 ± 0.236
0.0ArgXaa: 0.0 ± 0.0
Ser
2.736SerAla: 2.736 ± 0.393
0.604SerCys: 0.604 ± 0.179
3.299SerAsp: 3.299 ± 0.392
4.064SerGlu: 4.064 ± 0.429
2.978SerPhe: 2.978 ± 0.275
3.662SerGly: 3.662 ± 0.491
1.529SerHis: 1.529 ± 0.257
5.191SerIle: 5.191 ± 0.461
5.955SerLys: 5.955 ± 0.608
5.311SerLeu: 5.311 ± 0.414
1.449SerMet: 1.449 ± 0.292
4.667SerAsn: 4.667 ± 0.463
1.569SerPro: 1.569 ± 0.262
2.173SerGln: 2.173 ± 0.328
2.334SerArg: 2.334 ± 0.237
3.943SerSer: 3.943 ± 0.457
3.501SerThr: 3.501 ± 0.458
3.822SerVal: 3.822 ± 0.388
0.724SerTrp: 0.724 ± 0.215
3.058SerTyr: 3.058 ± 0.298
0.0SerXaa: 0.0 ± 0.0
Thr
2.857ThrAla: 2.857 ± 0.58
0.322ThrCys: 0.322 ± 0.116
3.621ThrAsp: 3.621 ± 0.395
3.179ThrGlu: 3.179 ± 0.373
2.334ThrPhe: 2.334 ± 0.328
3.621ThrGly: 3.621 ± 0.665
0.845ThrHis: 0.845 ± 0.201
3.903ThrIle: 3.903 ± 0.353
4.024ThrLys: 4.024 ± 0.388
4.104ThrLeu: 4.104 ± 0.42
1.288ThrMet: 1.288 ± 0.205
2.937ThrAsn: 2.937 ± 0.412
1.529ThrPro: 1.529 ± 0.29
2.535ThrGln: 2.535 ± 0.371
1.851ThrArg: 1.851 ± 0.277
2.817ThrSer: 2.817 ± 0.373
4.869ThrThr: 4.869 ± 1.242
3.742ThrVal: 3.742 ± 0.356
0.443ThrTrp: 0.443 ± 0.185
2.535ThrTyr: 2.535 ± 0.345
0.0ThrXaa: 0.0 ± 0.0
Val
3.299ValAla: 3.299 ± 0.34
0.443ValCys: 0.443 ± 0.158
4.305ValAsp: 4.305 ± 0.431
4.989ValGlu: 4.989 ± 0.654
2.414ValPhe: 2.414 ± 0.276
3.138ValGly: 3.138 ± 0.333
1.167ValHis: 1.167 ± 0.213
3.943ValIle: 3.943 ± 0.329
4.627ValLys: 4.627 ± 0.416
4.989ValLeu: 4.989 ± 0.454
1.328ValMet: 1.328 ± 0.276
4.144ValAsn: 4.144 ± 0.404
1.489ValPro: 1.489 ± 0.256
1.73ValGln: 1.73 ± 0.236
2.133ValArg: 2.133 ± 0.274
4.305ValSer: 4.305 ± 0.441
2.052ValThr: 2.052 ± 0.294
3.138ValVal: 3.138 ± 0.41
0.644ValTrp: 0.644 ± 0.167
2.535ValTyr: 2.535 ± 0.336
0.0ValXaa: 0.0 ± 0.0
Trp
0.402TrpAla: 0.402 ± 0.169
0.201TrpCys: 0.201 ± 0.096
0.684TrpAsp: 0.684 ± 0.195
0.845TrpGlu: 0.845 ± 0.178
0.483TrpPhe: 0.483 ± 0.233
0.483TrpGly: 0.483 ± 0.148
0.241TrpHis: 0.241 ± 0.096
0.604TrpIle: 0.604 ± 0.144
1.046TrpLys: 1.046 ± 0.26
0.966TrpLeu: 0.966 ± 0.227
0.201TrpMet: 0.201 ± 0.087
1.167TrpAsn: 1.167 ± 0.351
0.0TrpPro: 0.0 ± 0.0
0.322TrpGln: 0.322 ± 0.132
0.483TrpArg: 0.483 ± 0.116
0.724TrpSer: 0.724 ± 0.21
0.563TrpThr: 0.563 ± 0.199
0.805TrpVal: 0.805 ± 0.178
0.08TrpTrp: 0.08 ± 0.053
0.604TrpTyr: 0.604 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.615TyrAla: 2.615 ± 0.404
0.684TyrCys: 0.684 ± 0.235
4.185TyrAsp: 4.185 ± 0.401
3.581TyrGlu: 3.581 ± 0.43
2.173TyrPhe: 2.173 ± 0.304
2.696TyrGly: 2.696 ± 0.377
0.925TyrHis: 0.925 ± 0.172
3.822TyrIle: 3.822 ± 0.433
4.748TyrLys: 4.748 ± 0.473
4.386TyrLeu: 4.386 ± 0.471
1.408TyrMet: 1.408 ± 0.244
3.702TyrAsn: 3.702 ± 0.47
1.288TyrPro: 1.288 ± 0.175
1.851TyrGln: 1.851 ± 0.281
1.972TyrArg: 1.972 ± 0.265
3.662TyrSer: 3.662 ± 0.42
2.495TyrThr: 2.495 ± 0.331
2.696TyrVal: 2.696 ± 0.329
0.604TyrTrp: 0.604 ± 0.153
2.776TyrTyr: 2.776 ± 0.46
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 122 proteins (24854 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski