Amino acid dipepetide frequency for Staphylococcus phage f2b1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.545AlaAla: 5.545 ± 0.499
0.327AlaCys: 0.327 ± 0.077
4.685AlaAsp: 4.685 ± 0.352
5.033AlaGlu: 5.033 ± 0.335
2.557AlaPhe: 2.557 ± 0.226
5.258AlaGly: 5.258 ± 0.416
0.982AlaHis: 0.982 ± 0.155
4.256AlaIle: 4.256 ± 0.284
4.89AlaLys: 4.89 ± 0.329
5.647AlaLeu: 5.647 ± 0.357
1.596AlaMet: 1.596 ± 0.163
3.028AlaAsn: 3.028 ± 0.224
2.455AlaPro: 2.455 ± 0.26
2.823AlaGln: 2.823 ± 0.29
2.762AlaArg: 2.762 ± 0.262
3.58AlaSer: 3.58 ± 0.255
4.46AlaThr: 4.46 ± 0.289
4.235AlaVal: 4.235 ± 0.304
0.675AlaTrp: 0.675 ± 0.122
3.048AlaTyr: 3.048 ± 0.235
0.0AlaXaa: 0.0 ± 0.0
Cys
0.389CysAla: 0.389 ± 0.095
0.205CysCys: 0.205 ± 0.076
0.675CysAsp: 0.675 ± 0.124
0.368CysGlu: 0.368 ± 0.074
0.389CysPhe: 0.389 ± 0.079
0.593CysGly: 0.593 ± 0.109
0.164CysHis: 0.164 ± 0.053
0.327CysIle: 0.327 ± 0.085
0.696CysLys: 0.696 ± 0.145
0.348CysLeu: 0.348 ± 0.08
0.246CysMet: 0.246 ± 0.067
0.409CysAsn: 0.409 ± 0.092
0.368CysPro: 0.368 ± 0.096
0.143CysGln: 0.143 ± 0.052
0.327CysArg: 0.327 ± 0.084
0.43CysSer: 0.43 ± 0.106
0.552CysThr: 0.552 ± 0.109
0.348CysVal: 0.348 ± 0.08
0.102CysTrp: 0.102 ± 0.058
0.286CysTyr: 0.286 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
3.99AspAla: 3.99 ± 0.251
0.818AspCys: 0.818 ± 0.14
2.496AspAsp: 2.496 ± 0.251
4.256AspGlu: 4.256 ± 0.375
2.885AspPhe: 2.885 ± 0.238
4.337AspGly: 4.337 ± 0.368
0.614AspHis: 0.614 ± 0.118
4.501AspIle: 4.501 ± 0.326
4.276AspLys: 4.276 ± 0.343
5.033AspLeu: 5.033 ± 0.311
1.698AspMet: 1.698 ± 0.16
3.437AspAsn: 3.437 ± 0.239
1.882AspPro: 1.882 ± 0.204
1.678AspGln: 1.678 ± 0.17
2.926AspArg: 2.926 ± 0.274
3.437AspSer: 3.437 ± 0.26
4.399AspThr: 4.399 ± 0.374
4.624AspVal: 4.624 ± 0.231
0.655AspTrp: 0.655 ± 0.119
3.335AspTyr: 3.335 ± 0.275
0.0AspXaa: 0.0 ± 0.0
Glu
5.319GluAla: 5.319 ± 0.349
0.43GluCys: 0.43 ± 0.09
4.972GluAsp: 4.972 ± 0.4
7.304GluGlu: 7.304 ± 0.66
2.823GluPhe: 2.823 ± 0.258
4.624GluGly: 4.624 ± 0.362
1.371GluHis: 1.371 ± 0.204
4.522GluIle: 4.522 ± 0.291
5.585GluLys: 5.585 ± 0.503
6.547GluLeu: 6.547 ± 0.471
2.087GluMet: 2.087 ± 0.211
3.642GluAsn: 3.642 ± 0.273
2.107GluPro: 2.107 ± 0.305
3.458GluGln: 3.458 ± 0.297
3.028GluArg: 3.028 ± 0.273
3.437GluSer: 3.437 ± 0.284
3.458GluThr: 3.458 ± 0.291
5.422GluVal: 5.422 ± 0.344
0.9GluTrp: 0.9 ± 0.122
2.946GluTyr: 2.946 ± 0.249
0.0GluXaa: 0.0 ± 0.0
Phe
2.271PheAla: 2.271 ± 0.213
0.286PheCys: 0.286 ± 0.078
2.517PheAsp: 2.517 ± 0.242
2.496PheGlu: 2.496 ± 0.204
1.453PhePhe: 1.453 ± 0.169
2.701PheGly: 2.701 ± 0.22
0.798PheHis: 0.798 ± 0.123
3.171PheIle: 3.171 ± 0.28
2.823PheLys: 2.823 ± 0.235
2.844PheLeu: 2.844 ± 0.271
1.064PheMet: 1.064 ± 0.162
2.557PheAsn: 2.557 ± 0.263
1.166PhePro: 1.166 ± 0.154
1.228PheGln: 1.228 ± 0.162
1.719PheArg: 1.719 ± 0.204
3.171PheSer: 3.171 ± 0.278
2.803PheThr: 2.803 ± 0.259
2.435PheVal: 2.435 ± 0.198
0.286PheTrp: 0.286 ± 0.068
2.046PheTyr: 2.046 ± 0.202
0.0PheXaa: 0.0 ± 0.0
Gly
4.194GlyAla: 4.194 ± 0.37
0.634GlyCys: 0.634 ± 0.121
3.765GlyAsp: 3.765 ± 0.247
4.726GlyGlu: 4.726 ± 0.343
2.66GlyPhe: 2.66 ± 0.198
5.872GlyGly: 5.872 ± 0.633
1.105GlyHis: 1.105 ± 0.168
4.276GlyIle: 4.276 ± 0.4
4.296GlyLys: 4.296 ± 0.285
5.381GlyLeu: 5.381 ± 0.333
1.903GlyMet: 1.903 ± 0.201
4.051GlyAsn: 4.051 ± 0.369
0.246GlyPro: 0.246 ± 0.067
2.291GlyGln: 2.291 ± 0.232
2.926GlyArg: 2.926 ± 0.255
4.562GlySer: 4.562 ± 0.471
5.729GlyThr: 5.729 ± 0.512
5.545GlyVal: 5.545 ± 0.373
1.023GlyTrp: 1.023 ± 0.176
3.376GlyTyr: 3.376 ± 0.295
0.0GlyXaa: 0.0 ± 0.0
His
1.064HisAla: 1.064 ± 0.14
0.164HisCys: 0.164 ± 0.054
0.88HisAsp: 0.88 ± 0.11
1.084HisGlu: 1.084 ± 0.166
0.818HisPhe: 0.818 ± 0.136
1.268HisGly: 1.268 ± 0.151
0.491HisHis: 0.491 ± 0.114
1.534HisIle: 1.534 ± 0.198
1.453HisLys: 1.453 ± 0.186
1.371HisLeu: 1.371 ± 0.16
0.389HisMet: 0.389 ± 0.102
1.207HisAsn: 1.207 ± 0.133
0.614HisPro: 0.614 ± 0.097
0.511HisGln: 0.511 ± 0.101
0.818HisArg: 0.818 ± 0.114
1.043HisSer: 1.043 ± 0.165
0.982HisThr: 0.982 ± 0.132
1.105HisVal: 1.105 ± 0.157
0.205HisTrp: 0.205 ± 0.066
0.777HisTyr: 0.777 ± 0.122
0.0HisXaa: 0.0 ± 0.0
Ile
4.583IleAla: 4.583 ± 0.311
0.655IleCys: 0.655 ± 0.125
4.44IleAsp: 4.44 ± 0.267
4.808IleGlu: 4.808 ± 0.367
2.21IlePhe: 2.21 ± 0.228
3.437IleGly: 3.437 ± 0.269
1.514IleHis: 1.514 ± 0.194
3.99IleIle: 3.99 ± 0.29
4.849IleLys: 4.849 ± 0.268
4.44IleLeu: 4.44 ± 0.315
1.412IleMet: 1.412 ± 0.157
3.744IleAsn: 3.744 ± 0.25
2.312IlePro: 2.312 ± 0.238
2.517IleGln: 2.517 ± 0.23
3.192IleArg: 3.192 ± 0.263
3.969IleSer: 3.969 ± 0.282
4.726IleThr: 4.726 ± 0.31
4.317IleVal: 4.317 ± 0.283
0.327IleTrp: 0.327 ± 0.08
2.517IleTyr: 2.517 ± 0.2
0.0IleXaa: 0.0 ± 0.0
Lys
5.299LysAla: 5.299 ± 0.308
0.532LysCys: 0.532 ± 0.14
4.44LysAsp: 4.44 ± 0.308
6.69LysGlu: 6.69 ± 0.535
2.782LysPhe: 2.782 ± 0.248
4.665LysGly: 4.665 ± 0.294
1.391LysHis: 1.391 ± 0.204
4.317LysIle: 4.317 ± 0.247
6.097LysLys: 6.097 ± 0.434
6.036LysLeu: 6.036 ± 0.355
2.557LysMet: 2.557 ± 0.223
3.662LysAsn: 3.662 ± 0.239
2.312LysPro: 2.312 ± 0.24
3.192LysGln: 3.192 ± 0.272
3.253LysArg: 3.253 ± 0.26
3.928LysSer: 3.928 ± 0.35
3.928LysThr: 3.928 ± 0.261
5.176LysVal: 5.176 ± 0.283
0.737LysTrp: 0.737 ± 0.127
2.803LysTyr: 2.803 ± 0.249
0.0LysXaa: 0.0 ± 0.0
Leu
5.299LeuAla: 5.299 ± 0.371
0.43LeuCys: 0.43 ± 0.088
5.585LeuAsp: 5.585 ± 0.289
6.199LeuGlu: 6.199 ± 0.451
2.844LeuPhe: 2.844 ± 0.264
4.849LeuGly: 4.849 ± 0.305
1.371LeuHis: 1.371 ± 0.165
4.828LeuIle: 4.828 ± 0.372
5.851LeuLys: 5.851 ± 0.285
5.504LeuLeu: 5.504 ± 0.46
2.128LeuMet: 2.128 ± 0.217
4.603LeuAsn: 4.603 ± 0.321
3.171LeuPro: 3.171 ± 0.29
3.089LeuGln: 3.089 ± 0.222
4.092LeuArg: 4.092 ± 0.295
4.828LeuSer: 4.828 ± 0.299
5.074LeuThr: 5.074 ± 0.332
5.401LeuVal: 5.401 ± 0.359
0.593LeuTrp: 0.593 ± 0.102
2.967LeuTyr: 2.967 ± 0.213
0.0LeuXaa: 0.0 ± 0.0
Met
2.005MetAla: 2.005 ± 0.185
0.143MetCys: 0.143 ± 0.053
1.678MetAsp: 1.678 ± 0.168
2.23MetGlu: 2.23 ± 0.226
1.105MetPhe: 1.105 ± 0.143
1.473MetGly: 1.473 ± 0.185
0.43MetHis: 0.43 ± 0.098
1.514MetIle: 1.514 ± 0.177
2.68MetLys: 2.68 ± 0.217
1.841MetLeu: 1.841 ± 0.224
0.593MetMet: 0.593 ± 0.103
1.248MetAsn: 1.248 ± 0.155
0.941MetPro: 0.941 ± 0.15
0.982MetGln: 0.982 ± 0.152
1.207MetArg: 1.207 ± 0.143
1.678MetSer: 1.678 ± 0.167
1.76MetThr: 1.76 ± 0.147
1.494MetVal: 1.494 ± 0.183
0.266MetTrp: 0.266 ± 0.073
1.391MetTyr: 1.391 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
3.642AsnAla: 3.642 ± 0.267
0.593AsnCys: 0.593 ± 0.18
2.251AsnAsp: 2.251 ± 0.226
3.008AsnGlu: 3.008 ± 0.212
2.271AsnPhe: 2.271 ± 0.182
4.624AsnGly: 4.624 ± 0.447
0.941AsnHis: 0.941 ± 0.123
3.56AsnIle: 3.56 ± 0.269
3.928AsnLys: 3.928 ± 0.24
4.317AsnLeu: 4.317 ± 0.267
1.268AsnMet: 1.268 ± 0.137
2.885AsnAsn: 2.885 ± 0.295
2.598AsnPro: 2.598 ± 0.265
1.739AsnGln: 1.739 ± 0.199
2.905AsnArg: 2.905 ± 0.201
3.376AsnSer: 3.376 ± 0.265
4.01AsnThr: 4.01 ± 0.308
3.887AsnVal: 3.887 ± 0.343
0.614AsnTrp: 0.614 ± 0.115
2.312AsnTyr: 2.312 ± 0.228
0.0AsnXaa: 0.0 ± 0.0
Pro
2.414ProAla: 2.414 ± 0.198
0.143ProCys: 0.143 ± 0.047
2.517ProAsp: 2.517 ± 0.246
3.151ProGlu: 3.151 ± 0.319
1.166ProPhe: 1.166 ± 0.144
0.962ProGly: 0.962 ± 0.134
0.471ProHis: 0.471 ± 0.086
2.189ProIle: 2.189 ± 0.198
2.291ProLys: 2.291 ± 0.195
2.619ProLeu: 2.619 ± 0.243
0.859ProMet: 0.859 ± 0.14
1.923ProAsn: 1.923 ± 0.225
0.859ProPro: 0.859 ± 0.136
0.88ProGln: 0.88 ± 0.162
1.228ProArg: 1.228 ± 0.146
2.025ProSer: 2.025 ± 0.23
2.803ProThr: 2.803 ± 0.316
2.455ProVal: 2.455 ± 0.277
0.266ProTrp: 0.266 ± 0.075
1.412ProTyr: 1.412 ± 0.186
0.0ProXaa: 0.0 ± 0.0
Gln
2.455GlnAla: 2.455 ± 0.258
0.246GlnCys: 0.246 ± 0.07
2.005GlnAsp: 2.005 ± 0.233
2.885GlnGlu: 2.885 ± 0.274
1.412GlnPhe: 1.412 ± 0.18
2.476GlnGly: 2.476 ± 0.245
0.737GlnHis: 0.737 ± 0.119
2.066GlnIle: 2.066 ± 0.214
2.312GlnLys: 2.312 ± 0.244
3.458GlnLeu: 3.458 ± 0.236
1.207GlnMet: 1.207 ± 0.167
1.555GlnAsn: 1.555 ± 0.176
1.514GlnPro: 1.514 ± 0.317
1.78GlnGln: 1.78 ± 0.299
1.739GlnArg: 1.739 ± 0.198
2.353GlnSer: 2.353 ± 0.193
2.025GlnThr: 2.025 ± 0.253
2.782GlnVal: 2.782 ± 0.207
0.491GlnTrp: 0.491 ± 0.113
1.78GlnTyr: 1.78 ± 0.171
0.0GlnXaa: 0.0 ± 0.0
Arg
2.701ArgAla: 2.701 ± 0.263
0.225ArgCys: 0.225 ± 0.074
2.946ArgAsp: 2.946 ± 0.24
3.805ArgGlu: 3.805 ± 0.379
1.78ArgPhe: 1.78 ± 0.224
3.13ArgGly: 3.13 ± 0.265
0.839ArgHis: 0.839 ± 0.113
2.639ArgIle: 2.639 ± 0.25
3.396ArgLys: 3.396 ± 0.263
4.174ArgLeu: 4.174 ± 0.275
1.494ArgMet: 1.494 ± 0.179
2.312ArgAsn: 2.312 ± 0.235
1.105ArgPro: 1.105 ± 0.153
1.616ArgGln: 1.616 ± 0.195
1.739ArgArg: 1.739 ± 0.193
2.107ArgSer: 2.107 ± 0.211
2.353ArgThr: 2.353 ± 0.183
3.151ArgVal: 3.151 ± 0.242
0.532ArgTrp: 0.532 ± 0.091
2.005ArgTyr: 2.005 ± 0.217
0.0ArgXaa: 0.0 ± 0.0
Ser
3.887SerAla: 3.887 ± 0.258
0.389SerCys: 0.389 ± 0.087
3.089SerAsp: 3.089 ± 0.268
2.905SerGlu: 2.905 ± 0.248
2.844SerPhe: 2.844 ± 0.255
5.094SerGly: 5.094 ± 0.416
1.105SerHis: 1.105 ± 0.163
4.071SerIle: 4.071 ± 0.297
4.194SerLys: 4.194 ± 0.313
4.747SerLeu: 4.747 ± 0.326
1.76SerMet: 1.76 ± 0.195
3.396SerAsn: 3.396 ± 0.317
2.087SerPro: 2.087 ± 0.252
2.373SerGln: 2.373 ± 0.2
2.21SerArg: 2.21 ± 0.255
4.235SerSer: 4.235 ± 0.461
4.071SerThr: 4.071 ± 0.331
4.071SerVal: 4.071 ± 0.359
0.675SerTrp: 0.675 ± 0.128
2.517SerTyr: 2.517 ± 0.231
0.0SerXaa: 0.0 ± 0.0
Thr
5.013ThrAla: 5.013 ± 0.412
0.286ThrCys: 0.286 ± 0.079
4.071ThrAsp: 4.071 ± 0.349
4.419ThrGlu: 4.419 ± 0.336
3.171ThrPhe: 3.171 ± 0.282
5.176ThrGly: 5.176 ± 0.446
1.207ThrHis: 1.207 ± 0.145
4.358ThrIle: 4.358 ± 0.281
4.522ThrLys: 4.522 ± 0.301
5.258ThrLeu: 5.258 ± 0.358
1.105ThrMet: 1.105 ± 0.148
3.56ThrAsn: 3.56 ± 0.282
3.048ThrPro: 3.048 ± 0.286
2.291ThrGln: 2.291 ± 0.254
2.435ThrArg: 2.435 ± 0.254
3.805ThrSer: 3.805 ± 0.325
4.358ThrThr: 4.358 ± 0.422
5.688ThrVal: 5.688 ± 0.366
0.593ThrTrp: 0.593 ± 0.121
2.967ThrTyr: 2.967 ± 0.251
0.0ThrXaa: 0.0 ± 0.0
Val
4.562ValAla: 4.562 ± 0.326
0.348ValCys: 0.348 ± 0.099
5.094ValAsp: 5.094 ± 0.34
5.135ValGlu: 5.135 ± 0.357
2.782ValPhe: 2.782 ± 0.228
4.399ValGly: 4.399 ± 0.284
1.166ValHis: 1.166 ± 0.133
4.358ValIle: 4.358 ± 0.318
5.524ValLys: 5.524 ± 0.391
4.624ValLeu: 4.624 ± 0.349
1.841ValMet: 1.841 ± 0.167
4.44ValAsn: 4.44 ± 0.302
2.537ValPro: 2.537 ± 0.284
2.762ValGln: 2.762 ± 0.242
3.089ValArg: 3.089 ± 0.279
3.867ValSer: 3.867 ± 0.259
5.954ValThr: 5.954 ± 0.455
4.256ValVal: 4.256 ± 0.35
0.573ValTrp: 0.573 ± 0.11
3.233ValTyr: 3.233 ± 0.287
0.0ValXaa: 0.0 ± 0.0
Trp
0.593TrpAla: 0.593 ± 0.107
0.102TrpCys: 0.102 ± 0.05
0.716TrpAsp: 0.716 ± 0.127
0.859TrpGlu: 0.859 ± 0.12
0.491TrpPhe: 0.491 ± 0.123
0.696TrpGly: 0.696 ± 0.121
0.184TrpHis: 0.184 ± 0.063
0.573TrpIle: 0.573 ± 0.116
0.839TrpLys: 0.839 ± 0.118
0.757TrpLeu: 0.757 ± 0.124
0.205TrpMet: 0.205 ± 0.064
0.45TrpAsn: 0.45 ± 0.097
0.0TrpPro: 0.0 ± 0.0
0.307TrpGln: 0.307 ± 0.067
0.368TrpArg: 0.368 ± 0.077
0.818TrpSer: 0.818 ± 0.121
0.471TrpThr: 0.471 ± 0.097
0.9TrpVal: 0.9 ± 0.127
0.143TrpTrp: 0.143 ± 0.071
0.532TrpTyr: 0.532 ± 0.104
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.619TyrAla: 2.619 ± 0.247
0.409TyrCys: 0.409 ± 0.089
2.455TyrAsp: 2.455 ± 0.213
2.619TyrGlu: 2.619 ± 0.214
1.391TyrPhe: 1.391 ± 0.163
2.926TyrGly: 2.926 ± 0.27
0.9TyrHis: 0.9 ± 0.146
2.905TyrIle: 2.905 ± 0.217
3.294TyrLys: 3.294 ± 0.254
3.765TyrLeu: 3.765 ± 0.308
1.228TyrMet: 1.228 ± 0.178
2.639TyrAsn: 2.639 ± 0.25
1.309TyrPro: 1.309 ± 0.146
1.616TyrGln: 1.616 ± 0.159
2.087TyrArg: 2.087 ± 0.167
3.069TyrSer: 3.069 ± 0.24
3.417TyrThr: 3.417 ± 0.263
3.253TyrVal: 3.253 ± 0.252
0.348TyrTrp: 0.348 ± 0.089
1.698TyrTyr: 1.698 ± 0.183
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 224 proteins (48878 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski