Amino acid dipepetide frequency for Escherichia phage IME11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.486AlaAla: 8.486 ± 1.32
0.722AlaCys: 0.722 ± 0.201
5.101AlaAsp: 5.101 ± 0.511
5.327AlaGlu: 5.327 ± 0.472
2.754AlaPhe: 2.754 ± 0.295
6.816AlaGly: 6.816 ± 0.761
0.948AlaHis: 0.948 ± 0.181
4.604AlaIle: 4.604 ± 0.472
5.597AlaLys: 5.597 ± 0.617
7.177AlaLeu: 7.177 ± 0.988
2.754AlaMet: 2.754 ± 0.558
4.83AlaAsn: 4.83 ± 0.473
2.708AlaPro: 2.708 ± 0.503
4.514AlaGln: 4.514 ± 0.984
3.566AlaArg: 3.566 ± 0.352
5.281AlaSer: 5.281 ± 0.525
5.778AlaThr: 5.778 ± 0.617
5.327AlaVal: 5.327 ± 0.512
0.903AlaTrp: 0.903 ± 0.183
3.837AlaTyr: 3.837 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
0.677CysAla: 0.677 ± 0.199
0.045CysCys: 0.045 ± 0.045
0.406CysAsp: 0.406 ± 0.133
0.767CysGlu: 0.767 ± 0.238
0.361CysPhe: 0.361 ± 0.132
0.406CysGly: 0.406 ± 0.151
0.271CysHis: 0.271 ± 0.11
0.677CysIle: 0.677 ± 0.194
0.813CysLys: 0.813 ± 0.203
0.677CysLeu: 0.677 ± 0.207
0.226CysMet: 0.226 ± 0.116
0.451CysAsn: 0.451 ± 0.155
0.361CysPro: 0.361 ± 0.152
0.0CysGln: 0.0 ± 0.0
0.226CysArg: 0.226 ± 0.103
0.497CysSer: 0.497 ± 0.155
0.542CysThr: 0.542 ± 0.199
0.632CysVal: 0.632 ± 0.202
0.09CysTrp: 0.09 ± 0.063
0.361CysTyr: 0.361 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
4.469AspAla: 4.469 ± 0.499
0.858AspCys: 0.858 ± 0.242
3.205AspAsp: 3.205 ± 0.347
3.566AspGlu: 3.566 ± 0.471
2.799AspPhe: 2.799 ± 0.338
3.024AspGly: 3.024 ± 0.466
0.813AspHis: 0.813 ± 0.221
4.604AspIle: 4.604 ± 0.581
3.115AspLys: 3.115 ± 0.301
4.695AspLeu: 4.695 ± 0.508
1.67AspMet: 1.67 ± 0.28
2.167AspAsn: 2.167 ± 0.256
2.799AspPro: 2.799 ± 0.324
1.941AspGln: 1.941 ± 0.334
2.302AspArg: 2.302 ± 0.27
4.379AspSer: 4.379 ± 0.387
4.108AspThr: 4.108 ± 0.451
3.386AspVal: 3.386 ± 0.375
0.813AspTrp: 0.813 ± 0.2
2.257AspTyr: 2.257 ± 0.433
0.0AspXaa: 0.0 ± 0.0
Glu
7.177GluAla: 7.177 ± 0.856
0.361GluCys: 0.361 ± 0.133
4.153GluAsp: 4.153 ± 0.542
5.868GluGlu: 5.868 ± 0.835
2.347GluPhe: 2.347 ± 0.411
4.018GluGly: 4.018 ± 0.46
0.767GluHis: 0.767 ± 0.171
3.25GluIle: 3.25 ± 0.372
3.747GluLys: 3.747 ± 0.508
6.049GluLeu: 6.049 ± 0.517
2.257GluMet: 2.257 ± 0.293
3.34GluAsn: 3.34 ± 0.357
3.16GluPro: 3.16 ± 0.482
2.844GluGln: 2.844 ± 0.41
2.076GluArg: 2.076 ± 0.361
3.34GluSer: 3.34 ± 0.434
3.476GluThr: 3.476 ± 0.36
4.469GluVal: 4.469 ± 0.623
0.722GluTrp: 0.722 ± 0.164
2.889GluTyr: 2.889 ± 0.413
0.0GluXaa: 0.0 ± 0.0
Phe
2.844PheAla: 2.844 ± 0.346
0.497PheCys: 0.497 ± 0.173
2.212PheAsp: 2.212 ± 0.394
1.941PheGlu: 1.941 ± 0.329
1.174PhePhe: 1.174 ± 0.272
3.07PheGly: 3.07 ± 0.348
0.497PheHis: 0.497 ± 0.127
3.295PheIle: 3.295 ± 0.477
1.986PheLys: 1.986 ± 0.357
2.438PheLeu: 2.438 ± 0.333
1.625PheMet: 1.625 ± 0.337
2.212PheAsn: 2.212 ± 0.364
1.219PhePro: 1.219 ± 0.214
1.535PheGln: 1.535 ± 0.273
1.941PheArg: 1.941 ± 0.287
2.438PheSer: 2.438 ± 0.298
2.708PheThr: 2.708 ± 0.404
2.302PheVal: 2.302 ± 0.389
0.406PheTrp: 0.406 ± 0.148
1.219PheTyr: 1.219 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
5.281GlyAla: 5.281 ± 0.633
0.813GlyCys: 0.813 ± 0.236
3.07GlyAsp: 3.07 ± 0.384
3.882GlyGlu: 3.882 ± 0.443
3.25GlyPhe: 3.25 ± 0.529
3.566GlyGly: 3.566 ± 0.508
1.309GlyHis: 1.309 ± 0.22
3.34GlyIle: 3.34 ± 0.423
5.372GlyLys: 5.372 ± 0.573
5.733GlyLeu: 5.733 ± 0.549
2.212GlyMet: 2.212 ± 0.296
4.875GlyAsn: 4.875 ± 0.598
0.993GlyPro: 0.993 ± 0.206
2.392GlyGln: 2.392 ± 0.425
2.483GlyArg: 2.483 ± 0.418
5.281GlySer: 5.281 ± 0.612
4.108GlyThr: 4.108 ± 0.406
4.83GlyVal: 4.83 ± 0.66
1.219GlyTrp: 1.219 ± 0.254
2.528GlyTyr: 2.528 ± 0.304
0.0GlyXaa: 0.0 ± 0.0
His
1.264HisAla: 1.264 ± 0.235
0.135HisCys: 0.135 ± 0.078
1.038HisAsp: 1.038 ± 0.257
0.993HisGlu: 0.993 ± 0.231
0.451HisPhe: 0.451 ± 0.184
0.858HisGly: 0.858 ± 0.232
0.497HisHis: 0.497 ± 0.177
1.038HisIle: 1.038 ± 0.24
1.399HisLys: 1.399 ± 0.3
1.986HisLeu: 1.986 ± 0.311
0.406HisMet: 0.406 ± 0.127
0.858HisAsn: 0.858 ± 0.151
0.813HisPro: 0.813 ± 0.231
0.542HisGln: 0.542 ± 0.156
0.632HisArg: 0.632 ± 0.169
1.219HisSer: 1.219 ± 0.254
0.632HisThr: 0.632 ± 0.158
1.038HisVal: 1.038 ± 0.208
0.316HisTrp: 0.316 ± 0.155
0.993HisTyr: 0.993 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
4.695IleAla: 4.695 ± 0.431
0.451IleCys: 0.451 ± 0.133
3.882IleAsp: 3.882 ± 0.569
3.927IleGlu: 3.927 ± 0.454
1.535IlePhe: 1.535 ± 0.272
3.566IleGly: 3.566 ± 0.393
0.948IleHis: 0.948 ± 0.196
3.16IleIle: 3.16 ± 0.478
4.063IleLys: 4.063 ± 0.521
3.837IleLeu: 3.837 ± 0.454
1.354IleMet: 1.354 ± 0.234
3.386IleAsn: 3.386 ± 0.449
2.979IlePro: 2.979 ± 0.403
2.212IleGln: 2.212 ± 0.297
3.024IleArg: 3.024 ± 0.381
2.889IleSer: 2.889 ± 0.35
4.965IleThr: 4.965 ± 0.466
3.386IleVal: 3.386 ± 0.416
0.497IleTrp: 0.497 ± 0.166
2.528IleTyr: 2.528 ± 0.347
0.0IleXaa: 0.0 ± 0.0
Lys
6.591LysAla: 6.591 ± 0.582
0.316LysCys: 0.316 ± 0.132
3.07LysAsp: 3.07 ± 0.321
4.785LysGlu: 4.785 ± 0.555
2.392LysPhe: 2.392 ± 0.39
3.792LysGly: 3.792 ± 0.504
1.535LysHis: 1.535 ± 0.372
2.754LysIle: 2.754 ± 0.389
3.205LysLys: 3.205 ± 0.465
6.816LysLeu: 6.816 ± 0.561
1.941LysMet: 1.941 ± 0.287
3.205LysAsn: 3.205 ± 0.373
2.799LysPro: 2.799 ± 0.48
2.844LysGln: 2.844 ± 0.369
2.708LysArg: 2.708 ± 0.37
4.153LysSer: 4.153 ± 0.451
4.559LysThr: 4.559 ± 0.408
3.882LysVal: 3.882 ± 0.33
0.767LysTrp: 0.767 ± 0.181
1.941LysTyr: 1.941 ± 0.359
0.0LysXaa: 0.0 ± 0.0
Leu
7.358LeuAla: 7.358 ± 0.625
0.632LeuCys: 0.632 ± 0.18
4.333LeuAsp: 4.333 ± 0.449
4.514LeuGlu: 4.514 ± 0.471
3.386LeuPhe: 3.386 ± 0.325
6.32LeuGly: 6.32 ± 0.562
1.625LeuHis: 1.625 ± 0.294
4.379LeuIle: 4.379 ± 0.464
5.191LeuLys: 5.191 ± 0.412
6.365LeuLeu: 6.365 ± 0.731
2.663LeuMet: 2.663 ± 0.317
4.92LeuAsn: 4.92 ± 0.447
4.153LeuPro: 4.153 ± 0.438
3.611LeuGln: 3.611 ± 0.444
3.792LeuArg: 3.792 ± 0.567
5.056LeuSer: 5.056 ± 0.467
5.643LeuThr: 5.643 ± 0.512
5.688LeuVal: 5.688 ± 0.501
0.948LeuTrp: 0.948 ± 0.178
2.618LeuTyr: 2.618 ± 0.317
0.0LeuXaa: 0.0 ± 0.0
Met
2.483MetAla: 2.483 ± 0.475
0.135MetCys: 0.135 ± 0.073
1.219MetAsp: 1.219 ± 0.306
2.212MetGlu: 2.212 ± 0.336
0.361MetPhe: 0.361 ± 0.148
1.896MetGly: 1.896 ± 0.26
0.271MetHis: 0.271 ± 0.106
1.715MetIle: 1.715 ± 0.263
2.483MetLys: 2.483 ± 0.317
2.483MetLeu: 2.483 ± 0.346
0.677MetMet: 0.677 ± 0.186
2.076MetAsn: 2.076 ± 0.284
1.174MetPro: 1.174 ± 0.277
1.444MetGln: 1.444 ± 0.285
1.264MetArg: 1.264 ± 0.192
2.799MetSer: 2.799 ± 0.33
2.076MetThr: 2.076 ± 0.316
1.535MetVal: 1.535 ± 0.277
0.361MetTrp: 0.361 ± 0.113
0.993MetTyr: 0.993 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
4.333AsnAla: 4.333 ± 0.649
0.226AsnCys: 0.226 ± 0.098
2.754AsnAsp: 2.754 ± 0.341
3.16AsnGlu: 3.16 ± 0.38
1.986AsnPhe: 1.986 ± 0.312
4.063AsnGly: 4.063 ± 0.45
1.264AsnHis: 1.264 ± 0.262
3.07AsnIle: 3.07 ± 0.417
3.566AsnLys: 3.566 ± 0.349
4.875AsnLeu: 4.875 ± 0.562
1.354AsnMet: 1.354 ± 0.248
3.386AsnAsn: 3.386 ± 0.426
3.611AsnPro: 3.611 ± 0.326
3.16AsnGln: 3.16 ± 0.411
3.115AsnArg: 3.115 ± 0.318
3.295AsnSer: 3.295 ± 0.499
3.34AsnThr: 3.34 ± 0.509
3.25AsnVal: 3.25 ± 0.347
0.813AsnTrp: 0.813 ± 0.221
1.896AsnTyr: 1.896 ± 0.42
0.0AsnXaa: 0.0 ± 0.0
Pro
3.566ProAla: 3.566 ± 0.451
0.226ProCys: 0.226 ± 0.101
2.708ProAsp: 2.708 ± 0.417
3.927ProGlu: 3.927 ± 0.505
1.941ProPhe: 1.941 ± 0.304
2.618ProGly: 2.618 ± 0.342
0.451ProHis: 0.451 ± 0.161
2.347ProIle: 2.347 ± 0.336
2.167ProLys: 2.167 ± 0.285
2.934ProLeu: 2.934 ± 0.312
1.129ProMet: 1.129 ± 0.169
1.625ProAsn: 1.625 ± 0.318
0.858ProPro: 0.858 ± 0.281
1.625ProGln: 1.625 ± 0.263
1.219ProArg: 1.219 ± 0.3
2.663ProSer: 2.663 ± 0.294
3.386ProThr: 3.386 ± 0.408
3.521ProVal: 3.521 ± 0.437
0.542ProTrp: 0.542 ± 0.19
1.219ProTyr: 1.219 ± 0.211
0.0ProXaa: 0.0 ± 0.0
Gln
5.236GlnAla: 5.236 ± 0.979
0.226GlnCys: 0.226 ± 0.116
2.076GlnAsp: 2.076 ± 0.372
3.386GlnGlu: 3.386 ± 0.349
1.49GlnPhe: 1.49 ± 0.238
2.663GlnGly: 2.663 ± 0.623
0.587GlnHis: 0.587 ± 0.171
2.257GlnIle: 2.257 ± 0.272
2.934GlnLys: 2.934 ± 0.434
3.07GlnLeu: 3.07 ± 0.41
1.129GlnMet: 1.129 ± 0.226
2.257GlnAsn: 2.257 ± 0.392
1.083GlnPro: 1.083 ± 0.282
1.806GlnGln: 1.806 ± 0.328
1.625GlnArg: 1.625 ± 0.336
2.663GlnSer: 2.663 ± 0.388
2.618GlnThr: 2.618 ± 0.323
3.386GlnVal: 3.386 ± 0.37
0.497GlnTrp: 0.497 ± 0.146
1.986GlnTyr: 1.986 ± 0.278
0.0GlnXaa: 0.0 ± 0.0
Arg
3.386ArgAla: 3.386 ± 0.554
0.451ArgCys: 0.451 ± 0.165
2.212ArgAsp: 2.212 ± 0.27
2.844ArgGlu: 2.844 ± 0.364
1.67ArgPhe: 1.67 ± 0.259
2.708ArgGly: 2.708 ± 0.324
0.722ArgHis: 0.722 ± 0.174
2.708ArgIle: 2.708 ± 0.349
3.205ArgLys: 3.205 ± 0.424
3.837ArgLeu: 3.837 ± 0.489
1.49ArgMet: 1.49 ± 0.265
3.16ArgAsn: 3.16 ± 0.425
1.58ArgPro: 1.58 ± 0.279
2.076ArgGln: 2.076 ± 0.394
1.986ArgArg: 1.986 ± 0.368
2.663ArgSer: 2.663 ± 0.357
2.122ArgThr: 2.122 ± 0.257
2.528ArgVal: 2.528 ± 0.37
0.542ArgTrp: 0.542 ± 0.143
1.219ArgTyr: 1.219 ± 0.183
0.0ArgXaa: 0.0 ± 0.0
Ser
4.875SerAla: 4.875 ± 0.496
0.722SerCys: 0.722 ± 0.193
3.476SerAsp: 3.476 ± 0.392
4.018SerGlu: 4.018 ± 0.442
2.122SerPhe: 2.122 ± 0.378
4.649SerGly: 4.649 ± 0.487
0.993SerHis: 0.993 ± 0.199
3.972SerIle: 3.972 ± 0.375
3.882SerLys: 3.882 ± 0.462
6.32SerLeu: 6.32 ± 0.521
1.941SerMet: 1.941 ± 0.332
3.205SerAsn: 3.205 ± 0.505
2.573SerPro: 2.573 ± 0.471
2.392SerGln: 2.392 ± 0.41
2.708SerArg: 2.708 ± 0.394
3.656SerSer: 3.656 ± 0.576
3.972SerThr: 3.972 ± 0.589
4.108SerVal: 4.108 ± 0.457
0.722SerTrp: 0.722 ± 0.226
2.257SerTyr: 2.257 ± 0.374
0.0SerXaa: 0.0 ± 0.0
Thr
4.604ThrAla: 4.604 ± 0.618
0.271ThrCys: 0.271 ± 0.115
4.198ThrAsp: 4.198 ± 0.419
3.476ThrGlu: 3.476 ± 0.39
3.07ThrPhe: 3.07 ± 0.303
4.514ThrGly: 4.514 ± 0.439
1.174ThrHis: 1.174 ± 0.325
3.927ThrIle: 3.927 ± 0.479
4.198ThrLys: 4.198 ± 0.423
5.417ThrLeu: 5.417 ± 0.501
1.354ThrMet: 1.354 ± 0.208
4.153ThrAsn: 4.153 ± 0.478
3.34ThrPro: 3.34 ± 0.369
2.302ThrGln: 2.302 ± 0.306
2.708ThrArg: 2.708 ± 0.292
3.837ThrSer: 3.837 ± 0.481
3.34ThrThr: 3.34 ± 0.566
5.146ThrVal: 5.146 ± 0.457
0.813ThrTrp: 0.813 ± 0.242
1.941ThrTyr: 1.941 ± 0.36
0.0ThrXaa: 0.0 ± 0.0
Val
6.094ValAla: 6.094 ± 0.663
0.677ValCys: 0.677 ± 0.216
3.882ValAsp: 3.882 ± 0.459
4.74ValGlu: 4.74 ± 0.55
2.302ValPhe: 2.302 ± 0.405
4.785ValGly: 4.785 ± 0.592
1.399ValHis: 1.399 ± 0.213
3.476ValIle: 3.476 ± 0.584
4.018ValLys: 4.018 ± 0.441
4.785ValLeu: 4.785 ± 0.471
2.302ValMet: 2.302 ± 0.282
3.611ValAsn: 3.611 ± 0.388
2.573ValPro: 2.573 ± 0.331
3.476ValGln: 3.476 ± 0.392
3.702ValArg: 3.702 ± 0.346
3.521ValSer: 3.521 ± 0.476
4.469ValThr: 4.469 ± 0.582
4.649ValVal: 4.649 ± 0.478
0.632ValTrp: 0.632 ± 0.161
2.392ValTyr: 2.392 ± 0.321
0.0ValXaa: 0.0 ± 0.0
Trp
0.903TrpAla: 0.903 ± 0.205
0.226TrpCys: 0.226 ± 0.095
1.309TrpAsp: 1.309 ± 0.305
0.632TrpGlu: 0.632 ± 0.209
0.451TrpPhe: 0.451 ± 0.149
0.722TrpGly: 0.722 ± 0.155
0.226TrpHis: 0.226 ± 0.089
0.451TrpIle: 0.451 ± 0.137
0.767TrpLys: 0.767 ± 0.18
1.038TrpLeu: 1.038 ± 0.186
0.226TrpMet: 0.226 ± 0.109
0.451TrpAsn: 0.451 ± 0.218
0.361TrpPro: 0.361 ± 0.111
0.587TrpGln: 0.587 ± 0.126
0.542TrpArg: 0.542 ± 0.127
0.587TrpSer: 0.587 ± 0.142
0.406TrpThr: 0.406 ± 0.126
1.49TrpVal: 1.49 ± 0.287
0.045TrpTrp: 0.045 ± 0.042
0.542TrpTyr: 0.542 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.708TyrAla: 2.708 ± 0.334
0.497TyrCys: 0.497 ± 0.167
2.663TyrAsp: 2.663 ± 0.261
2.347TyrGlu: 2.347 ± 0.339
1.715TyrPhe: 1.715 ± 0.272
2.483TyrGly: 2.483 ± 0.356
0.813TyrHis: 0.813 ± 0.202
2.302TyrIle: 2.302 ± 0.372
2.347TyrLys: 2.347 ± 0.449
2.663TyrLeu: 2.663 ± 0.359
0.903TyrMet: 0.903 ± 0.175
2.347TyrAsn: 2.347 ± 0.37
1.535TyrPro: 1.535 ± 0.34
1.67TyrGln: 1.67 ± 0.287
1.49TyrArg: 1.49 ± 0.29
2.347TyrSer: 2.347 ± 0.389
1.535TyrThr: 1.535 ± 0.291
2.934TyrVal: 2.934 ± 0.401
0.316TyrTrp: 0.316 ± 0.095
1.129TyrTyr: 1.129 ± 0.227
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (22154 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski