Amino acid dipepetide frequency for Erwinia phage phiEa104

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.932AlaAla: 6.932 ± 0.8
1.142AlaCys: 1.142 ± 0.207
4.2AlaAsp: 4.2 ± 0.437
5.546AlaGlu: 5.546 ± 0.514
2.732AlaPhe: 2.732 ± 0.32
5.913AlaGly: 5.913 ± 0.673
1.468AlaHis: 1.468 ± 0.261
4.649AlaIle: 4.649 ± 0.419
5.342AlaLys: 5.342 ± 0.488
6.157AlaLeu: 6.157 ± 0.464
2.895AlaMet: 2.895 ± 0.403
3.507AlaAsn: 3.507 ± 0.438
1.916AlaPro: 1.916 ± 0.229
2.854AlaGln: 2.854 ± 0.397
3.221AlaArg: 3.221 ± 0.365
5.097AlaSer: 5.097 ± 0.507
5.138AlaThr: 5.138 ± 0.668
5.872AlaVal: 5.872 ± 0.472
0.979AlaTrp: 0.979 ± 0.233
3.384AlaTyr: 3.384 ± 0.289
0.0AlaXaa: 0.0 ± 0.0
Cys
0.693CysAla: 0.693 ± 0.177
0.163CysCys: 0.163 ± 0.075
0.775CysAsp: 0.775 ± 0.149
0.938CysGlu: 0.938 ± 0.226
0.734CysPhe: 0.734 ± 0.177
0.652CysGly: 0.652 ± 0.18
0.408CysHis: 0.408 ± 0.113
0.571CysIle: 0.571 ± 0.169
0.816CysLys: 0.816 ± 0.173
0.897CysLeu: 0.897 ± 0.171
0.204CysMet: 0.204 ± 0.084
0.571CysAsn: 0.571 ± 0.144
0.53CysPro: 0.53 ± 0.167
0.489CysGln: 0.489 ± 0.115
0.449CysArg: 0.449 ± 0.127
0.693CysSer: 0.693 ± 0.163
0.449CysThr: 0.449 ± 0.156
0.734CysVal: 0.734 ± 0.164
0.0CysTrp: 0.0 ± 0.0
0.652CysTyr: 0.652 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
5.546AspAla: 5.546 ± 0.385
0.571AspCys: 0.571 ± 0.132
4.363AspAsp: 4.363 ± 0.578
5.138AspGlu: 5.138 ± 0.4
3.221AspPhe: 3.221 ± 0.341
4.771AspGly: 4.771 ± 0.512
0.775AspHis: 0.775 ± 0.163
3.384AspIle: 3.384 ± 0.334
4.037AspLys: 4.037 ± 0.341
4.893AspLeu: 4.893 ± 0.491
1.631AspMet: 1.631 ± 0.24
3.874AspAsn: 3.874 ± 0.459
1.713AspPro: 1.713 ± 0.307
1.101AspGln: 1.101 ± 0.21
2.08AspArg: 2.08 ± 0.35
5.056AspSer: 5.056 ± 0.415
3.425AspThr: 3.425 ± 0.443
4.526AspVal: 4.526 ± 0.382
1.06AspTrp: 1.06 ± 0.232
3.017AspTyr: 3.017 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
4.771GluAla: 4.771 ± 0.523
0.775GluCys: 0.775 ± 0.185
3.466GluAsp: 3.466 ± 0.35
4.649GluGlu: 4.649 ± 0.533
3.466GluPhe: 3.466 ± 0.385
4.078GluGly: 4.078 ± 0.346
1.509GluHis: 1.509 ± 0.233
4.159GluIle: 4.159 ± 0.434
4.689GluLys: 4.689 ± 0.396
5.342GluLeu: 5.342 ± 0.452
1.957GluMet: 1.957 ± 0.334
3.181GluAsn: 3.181 ± 0.358
1.55GluPro: 1.55 ± 0.321
2.487GluGln: 2.487 ± 0.334
3.099GluArg: 3.099 ± 0.372
4.322GluSer: 4.322 ± 0.382
3.384GluThr: 3.384 ± 0.335
4.037GluVal: 4.037 ± 0.401
0.775GluTrp: 0.775 ± 0.174
1.998GluTyr: 1.998 ± 0.317
0.0GluXaa: 0.0 ± 0.0
Phe
3.466PheAla: 3.466 ± 0.424
0.285PheCys: 0.285 ± 0.106
2.895PheAsp: 2.895 ± 0.348
2.447PheGlu: 2.447 ± 0.309
1.998PhePhe: 1.998 ± 0.239
2.936PheGly: 2.936 ± 0.305
0.326PheHis: 0.326 ± 0.113
2.61PheIle: 2.61 ± 0.327
2.936PheLys: 2.936 ± 0.312
2.977PheLeu: 2.977 ± 0.329
1.427PheMet: 1.427 ± 0.251
2.161PheAsn: 2.161 ± 0.323
1.916PhePro: 1.916 ± 0.324
1.183PheGln: 1.183 ± 0.182
1.753PheArg: 1.753 ± 0.269
2.61PheSer: 2.61 ± 0.317
3.262PheThr: 3.262 ± 0.403
2.08PheVal: 2.08 ± 0.3
0.449PheTrp: 0.449 ± 0.117
1.468PheTyr: 1.468 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
5.505GlyAla: 5.505 ± 0.476
0.897GlyCys: 0.897 ± 0.203
4.363GlyAsp: 4.363 ± 0.462
4.404GlyGlu: 4.404 ± 0.453
3.099GlyPhe: 3.099 ± 0.42
4.118GlyGly: 4.118 ± 0.543
1.386GlyHis: 1.386 ± 0.203
4.485GlyIle: 4.485 ± 0.424
5.831GlyLys: 5.831 ± 0.534
6.28GlyLeu: 6.28 ± 0.5
1.631GlyMet: 1.631 ± 0.246
3.181GlyAsn: 3.181 ± 0.445
0.204GlyPro: 0.204 ± 0.112
2.487GlyGln: 2.487 ± 0.34
2.854GlyArg: 2.854 ± 0.318
4.404GlySer: 4.404 ± 0.461
3.751GlyThr: 3.751 ± 0.444
5.709GlyVal: 5.709 ± 0.474
0.856GlyTrp: 0.856 ± 0.211
3.384GlyTyr: 3.384 ± 0.362
0.0GlyXaa: 0.0 ± 0.0
His
1.264HisAla: 1.264 ± 0.248
0.245HisCys: 0.245 ± 0.093
1.468HisAsp: 1.468 ± 0.255
1.06HisGlu: 1.06 ± 0.164
0.652HisPhe: 0.652 ± 0.171
1.509HisGly: 1.509 ± 0.271
0.449HisHis: 0.449 ± 0.134
0.856HisIle: 0.856 ± 0.186
1.55HisLys: 1.55 ± 0.238
1.346HisLeu: 1.346 ± 0.233
0.612HisMet: 0.612 ± 0.137
1.183HisAsn: 1.183 ± 0.275
0.734HisPro: 0.734 ± 0.166
0.856HisGln: 0.856 ± 0.184
1.101HisArg: 1.101 ± 0.214
1.427HisSer: 1.427 ± 0.224
1.019HisThr: 1.019 ± 0.215
1.305HisVal: 1.305 ± 0.24
0.204HisTrp: 0.204 ± 0.078
0.856HisTyr: 0.856 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
5.097IleAla: 5.097 ± 0.467
0.571IleCys: 0.571 ± 0.15
3.996IleAsp: 3.996 ± 0.361
4.2IleGlu: 4.2 ± 0.424
2.039IlePhe: 2.039 ± 0.247
3.996IleGly: 3.996 ± 0.397
1.142IleHis: 1.142 ± 0.209
3.303IleIle: 3.303 ± 0.367
4.445IleLys: 4.445 ± 0.415
4.241IleLeu: 4.241 ± 0.463
1.346IleMet: 1.346 ± 0.29
3.711IleAsn: 3.711 ± 0.399
1.916IlePro: 1.916 ± 0.275
2.528IleGln: 2.528 ± 0.341
2.447IleArg: 2.447 ± 0.28
3.915IleSer: 3.915 ± 0.397
3.507IleThr: 3.507 ± 0.401
3.751IleVal: 3.751 ± 0.289
0.734IleTrp: 0.734 ± 0.146
1.998IleTyr: 1.998 ± 0.226
0.0IleXaa: 0.0 ± 0.0
Lys
6.932LysAla: 6.932 ± 0.593
0.612LysCys: 0.612 ± 0.184
4.608LysAsp: 4.608 ± 0.46
3.915LysGlu: 3.915 ± 0.411
2.039LysPhe: 2.039 ± 0.244
5.342LysGly: 5.342 ± 0.505
0.938LysHis: 0.938 ± 0.163
3.425LysIle: 3.425 ± 0.365
4.771LysLys: 4.771 ± 0.415
5.097LysLeu: 5.097 ± 0.389
1.794LysMet: 1.794 ± 0.27
3.588LysAsn: 3.588 ± 0.361
2.365LysPro: 2.365 ± 0.393
2.569LysGln: 2.569 ± 0.324
3.548LysArg: 3.548 ± 0.38
5.097LysSer: 5.097 ± 0.448
4.485LysThr: 4.485 ± 0.393
5.382LysVal: 5.382 ± 0.445
0.612LysTrp: 0.612 ± 0.148
2.691LysTyr: 2.691 ± 0.339
0.0LysXaa: 0.0 ± 0.0
Leu
6.728LeuAla: 6.728 ± 0.427
0.938LeuCys: 0.938 ± 0.243
5.627LeuAsp: 5.627 ± 0.42
5.505LeuGlu: 5.505 ± 0.584
3.058LeuPhe: 3.058 ± 0.372
3.711LeuGly: 3.711 ± 0.352
1.55LeuHis: 1.55 ± 0.263
4.2LeuIle: 4.2 ± 0.414
5.872LeuLys: 5.872 ± 0.586
5.994LeuLeu: 5.994 ± 0.519
2.528LeuMet: 2.528 ± 0.378
4.282LeuAsn: 4.282 ± 0.405
3.262LeuPro: 3.262 ± 0.369
2.691LeuGln: 2.691 ± 0.332
3.833LeuArg: 3.833 ± 0.514
5.26LeuSer: 5.26 ± 0.476
5.709LeuThr: 5.709 ± 0.377
4.608LeuVal: 4.608 ± 0.401
1.305LeuTrp: 1.305 ± 0.199
2.487LeuTyr: 2.487 ± 0.262
0.0LeuXaa: 0.0 ± 0.0
Met
2.65MetAla: 2.65 ± 0.333
0.285MetCys: 0.285 ± 0.114
1.55MetAsp: 1.55 ± 0.274
1.55MetGlu: 1.55 ± 0.236
0.652MetPhe: 0.652 ± 0.164
1.957MetGly: 1.957 ± 0.304
0.285MetHis: 0.285 ± 0.114
1.794MetIle: 1.794 ± 0.253
2.406MetLys: 2.406 ± 0.275
2.161MetLeu: 2.161 ± 0.317
0.693MetMet: 0.693 ± 0.182
1.916MetAsn: 1.916 ± 0.298
0.856MetPro: 0.856 ± 0.181
1.183MetGln: 1.183 ± 0.219
1.101MetArg: 1.101 ± 0.19
2.283MetSer: 2.283 ± 0.324
2.243MetThr: 2.243 ± 0.238
1.427MetVal: 1.427 ± 0.26
0.0MetTrp: 0.0 ± 0.0
1.06MetTyr: 1.06 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
3.466AsnAla: 3.466 ± 0.424
0.53AsnCys: 0.53 ± 0.131
2.732AsnAsp: 2.732 ± 0.346
2.324AsnGlu: 2.324 ± 0.249
1.794AsnPhe: 1.794 ± 0.244
4.73AsnGly: 4.73 ± 0.51
1.305AsnHis: 1.305 ± 0.282
3.874AsnIle: 3.874 ± 0.42
2.936AsnLys: 2.936 ± 0.311
4.689AsnLeu: 4.689 ± 0.364
1.305AsnMet: 1.305 ± 0.208
2.406AsnAsn: 2.406 ± 0.296
3.181AsnPro: 3.181 ± 0.357
1.998AsnGln: 1.998 ± 0.266
2.365AsnArg: 2.365 ± 0.295
3.792AsnSer: 3.792 ± 0.392
2.65AsnThr: 2.65 ± 0.338
2.936AsnVal: 2.936 ± 0.271
0.775AsnTrp: 0.775 ± 0.146
1.794AsnTyr: 1.794 ± 0.266
0.0AsnXaa: 0.0 ± 0.0
Pro
2.202ProAla: 2.202 ± 0.278
0.204ProCys: 0.204 ± 0.088
2.569ProAsp: 2.569 ± 0.324
2.936ProGlu: 2.936 ± 0.385
2.12ProPhe: 2.12 ± 0.324
0.489ProGly: 0.489 ± 0.195
0.652ProHis: 0.652 ± 0.189
1.386ProIle: 1.386 ± 0.208
1.998ProLys: 1.998 ± 0.275
2.569ProLeu: 2.569 ± 0.321
0.693ProMet: 0.693 ± 0.155
1.916ProAsn: 1.916 ± 0.255
0.775ProPro: 0.775 ± 0.172
1.06ProGln: 1.06 ± 0.23
1.305ProArg: 1.305 ± 0.266
1.957ProSer: 1.957 ± 0.317
2.283ProThr: 2.283 ± 0.347
2.569ProVal: 2.569 ± 0.387
0.408ProTrp: 0.408 ± 0.15
1.386ProTyr: 1.386 ± 0.208
0.0ProXaa: 0.0 ± 0.0
Gln
2.773GlnAla: 2.773 ± 0.369
0.449GlnCys: 0.449 ± 0.124
1.386GlnAsp: 1.386 ± 0.22
1.876GlnGlu: 1.876 ± 0.254
1.916GlnPhe: 1.916 ± 0.296
2.528GlnGly: 2.528 ± 0.418
0.693GlnHis: 0.693 ± 0.209
2.569GlnIle: 2.569 ± 0.281
2.365GlnLys: 2.365 ± 0.294
3.221GlnLeu: 3.221 ± 0.349
1.427GlnMet: 1.427 ± 0.244
1.998GlnAsn: 1.998 ± 0.311
1.468GlnPro: 1.468 ± 0.221
1.631GlnGln: 1.631 ± 0.259
1.427GlnArg: 1.427 ± 0.22
2.202GlnSer: 2.202 ± 0.268
2.732GlnThr: 2.732 ± 0.342
1.794GlnVal: 1.794 ± 0.248
0.612GlnTrp: 0.612 ± 0.147
1.631GlnTyr: 1.631 ± 0.219
0.0GlnXaa: 0.0 ± 0.0
Arg
2.936ArgAla: 2.936 ± 0.316
0.652ArgCys: 0.652 ± 0.136
2.61ArgAsp: 2.61 ± 0.385
2.406ArgGlu: 2.406 ± 0.293
1.794ArgPhe: 1.794 ± 0.265
2.61ArgGly: 2.61 ± 0.366
1.509ArgHis: 1.509 ± 0.236
3.017ArgIle: 3.017 ± 0.304
3.181ArgLys: 3.181 ± 0.387
3.425ArgLeu: 3.425 ± 0.365
1.59ArgMet: 1.59 ± 0.237
1.916ArgAsn: 1.916 ± 0.262
0.979ArgPro: 0.979 ± 0.148
2.039ArgGln: 2.039 ± 0.293
2.283ArgArg: 2.283 ± 0.38
2.161ArgSer: 2.161 ± 0.316
1.957ArgThr: 1.957 ± 0.27
3.629ArgVal: 3.629 ± 0.419
1.019ArgTrp: 1.019 ± 0.212
2.161ArgTyr: 2.161 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
4.934SerAla: 4.934 ± 0.559
0.734SerCys: 0.734 ± 0.191
4.689SerAsp: 4.689 ± 0.42
3.833SerGlu: 3.833 ± 0.45
2.814SerPhe: 2.814 ± 0.289
5.913SerGly: 5.913 ± 0.521
1.346SerHis: 1.346 ± 0.254
3.384SerIle: 3.384 ± 0.393
4.445SerLys: 4.445 ± 0.463
5.138SerLeu: 5.138 ± 0.473
1.876SerMet: 1.876 ± 0.276
2.977SerAsn: 2.977 ± 0.401
2.161SerPro: 2.161 ± 0.322
2.895SerGln: 2.895 ± 0.295
2.691SerArg: 2.691 ± 0.354
4.934SerSer: 4.934 ± 0.532
3.425SerThr: 3.425 ± 0.409
5.546SerVal: 5.546 ± 0.364
0.693SerTrp: 0.693 ± 0.165
3.14SerTyr: 3.14 ± 0.374
0.0SerXaa: 0.0 ± 0.0
Thr
5.015ThrAla: 5.015 ± 0.475
0.734ThrCys: 0.734 ± 0.167
4.282ThrAsp: 4.282 ± 0.427
3.017ThrGlu: 3.017 ± 0.37
2.447ThrPhe: 2.447 ± 0.342
5.382ThrGly: 5.382 ± 0.556
1.305ThrHis: 1.305 ± 0.248
4.159ThrIle: 4.159 ± 0.421
3.262ThrLys: 3.262 ± 0.332
5.546ThrLeu: 5.546 ± 0.507
1.019ThrMet: 1.019 ± 0.159
2.732ThrAsn: 2.732 ± 0.368
2.202ThrPro: 2.202 ± 0.282
2.283ThrGln: 2.283 ± 0.299
2.528ThrArg: 2.528 ± 0.298
4.037ThrSer: 4.037 ± 0.431
4.526ThrThr: 4.526 ± 0.595
5.138ThrVal: 5.138 ± 0.494
0.693ThrTrp: 0.693 ± 0.235
2.365ThrTyr: 2.365 ± 0.31
0.0ThrXaa: 0.0 ± 0.0
Val
4.771ValAla: 4.771 ± 0.495
0.856ValCys: 0.856 ± 0.223
4.852ValAsp: 4.852 ± 0.391
4.689ValGlu: 4.689 ± 0.477
2.406ValPhe: 2.406 ± 0.227
4.241ValGly: 4.241 ± 0.388
1.142ValHis: 1.142 ± 0.193
4.485ValIle: 4.485 ± 0.453
5.097ValLys: 5.097 ± 0.403
4.445ValLeu: 4.445 ± 0.481
2.202ValMet: 2.202 ± 0.273
3.874ValAsn: 3.874 ± 0.507
2.08ValPro: 2.08 ± 0.333
2.202ValGln: 2.202 ± 0.268
3.221ValArg: 3.221 ± 0.388
4.689ValSer: 4.689 ± 0.474
5.056ValThr: 5.056 ± 0.53
5.26ValVal: 5.26 ± 0.585
0.612ValTrp: 0.612 ± 0.196
2.977ValTyr: 2.977 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
0.938TrpAla: 0.938 ± 0.197
0.122TrpCys: 0.122 ± 0.07
0.938TrpAsp: 0.938 ± 0.258
0.775TrpGlu: 0.775 ± 0.178
0.53TrpPhe: 0.53 ± 0.124
0.489TrpGly: 0.489 ± 0.146
0.326TrpHis: 0.326 ± 0.119
0.53TrpIle: 0.53 ± 0.236
1.346TrpLys: 1.346 ± 0.238
1.183TrpLeu: 1.183 ± 0.268
0.367TrpMet: 0.367 ± 0.122
0.775TrpAsn: 0.775 ± 0.148
0.245TrpPro: 0.245 ± 0.105
0.367TrpGln: 0.367 ± 0.119
0.53TrpArg: 0.53 ± 0.155
0.816TrpSer: 0.816 ± 0.248
0.734TrpThr: 0.734 ± 0.201
0.775TrpVal: 0.775 ± 0.157
0.326TrpTrp: 0.326 ± 0.109
0.53TrpTyr: 0.53 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.324TyrAla: 2.324 ± 0.318
0.652TyrCys: 0.652 ± 0.176
2.814TyrAsp: 2.814 ± 0.329
2.487TyrGlu: 2.487 ± 0.313
1.672TyrPhe: 1.672 ± 0.269
3.466TyrGly: 3.466 ± 0.363
1.183TyrHis: 1.183 ± 0.182
2.039TyrIle: 2.039 ± 0.341
2.487TyrLys: 2.487 ± 0.332
3.548TyrLeu: 3.548 ± 0.368
0.816TyrMet: 0.816 ± 0.216
1.835TyrAsn: 1.835 ± 0.281
1.427TyrPro: 1.427 ± 0.223
1.794TyrGln: 1.794 ± 0.257
2.039TyrArg: 2.039 ± 0.237
2.854TyrSer: 2.854 ± 0.346
2.936TyrThr: 2.936 ± 0.397
2.08TyrVal: 2.08 ± 0.285
0.53TyrTrp: 0.53 ± 0.15
1.509TyrTyr: 1.509 ± 0.245
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 118 proteins (24525 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski