Amino acid dipepetide frequency for Mycobacterium phage Delton

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.633AlaAla: 10.633 ± 0.835
0.722AlaCys: 0.722 ± 0.165
5.677AlaAsp: 5.677 ± 0.499
7.939AlaGlu: 7.939 ± 0.835
2.117AlaPhe: 2.117 ± 0.31
7.217AlaGly: 7.217 ± 0.652
1.54AlaHis: 1.54 ± 0.272
4.09AlaIle: 4.09 ± 0.48
5.774AlaLys: 5.774 ± 0.719
8.035AlaLeu: 8.035 ± 0.732
2.454AlaMet: 2.454 ± 0.295
3.512AlaAsn: 3.512 ± 0.467
3.753AlaPro: 3.753 ± 0.381
3.464AlaGln: 3.464 ± 0.421
4.763AlaArg: 4.763 ± 0.513
5.389AlaSer: 5.389 ± 0.427
4.956AlaThr: 4.956 ± 0.526
6.207AlaVal: 6.207 ± 0.671
1.54AlaTrp: 1.54 ± 0.265
2.839AlaTyr: 2.839 ± 0.348
0.0AlaXaa: 0.0 ± 0.0
Cys
1.01CysAla: 1.01 ± 0.221
0.144CysCys: 0.144 ± 0.083
0.481CysAsp: 0.481 ± 0.172
0.674CysGlu: 0.674 ± 0.207
0.289CysPhe: 0.289 ± 0.115
0.625CysGly: 0.625 ± 0.165
0.241CysHis: 0.241 ± 0.113
0.337CysIle: 0.337 ± 0.117
0.385CysLys: 0.385 ± 0.121
0.577CysLeu: 0.577 ± 0.18
0.192CysMet: 0.192 ± 0.103
0.577CysAsn: 0.577 ± 0.17
0.962CysPro: 0.962 ± 0.321
0.241CysGln: 0.241 ± 0.109
0.577CysArg: 0.577 ± 0.179
0.722CysSer: 0.722 ± 0.176
0.577CysThr: 0.577 ± 0.149
0.481CysVal: 0.481 ± 0.158
0.144CysTrp: 0.144 ± 0.08
0.385CysTyr: 0.385 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
5.677AspAla: 5.677 ± 0.452
0.337AspCys: 0.337 ± 0.145
6.928AspAsp: 6.928 ± 1.598
5.822AspGlu: 5.822 ± 1.103
2.358AspPhe: 2.358 ± 0.309
5.485AspGly: 5.485 ± 0.619
1.395AspHis: 1.395 ± 0.307
2.935AspIle: 2.935 ± 0.36
3.32AspLys: 3.32 ± 0.456
6.303AspLeu: 6.303 ± 0.72
1.395AspMet: 1.395 ± 0.22
2.165AspAsn: 2.165 ± 0.294
4.138AspPro: 4.138 ± 0.438
2.117AspGln: 2.117 ± 0.295
4.763AspArg: 4.763 ± 0.516
3.32AspSer: 3.32 ± 0.424
3.079AspThr: 3.079 ± 0.41
4.33AspVal: 4.33 ± 0.405
1.395AspTrp: 1.395 ± 0.294
2.213AspTyr: 2.213 ± 0.279
0.0AspXaa: 0.0 ± 0.0
Glu
6.592GluAla: 6.592 ± 0.615
0.577GluCys: 0.577 ± 0.239
6.351GluAsp: 6.351 ± 1.265
5.966GluGlu: 5.966 ± 0.51
2.742GluPhe: 2.742 ± 0.359
4.186GluGly: 4.186 ± 0.494
1.395GluHis: 1.395 ± 0.309
3.416GluIle: 3.416 ± 0.377
2.598GluLys: 2.598 ± 0.436
7.025GluLeu: 7.025 ± 0.523
1.684GluMet: 1.684 ± 0.268
2.117GluAsn: 2.117 ± 0.242
2.454GluPro: 2.454 ± 0.408
2.935GluGln: 2.935 ± 0.495
5.148GluArg: 5.148 ± 0.698
3.897GluSer: 3.897 ± 0.38
2.694GluThr: 2.694 ± 0.353
4.234GluVal: 4.234 ± 0.552
1.54GluTrp: 1.54 ± 0.267
1.828GluTyr: 1.828 ± 0.294
0.0GluXaa: 0.0 ± 0.0
Phe
2.309PheAla: 2.309 ± 0.403
0.385PheCys: 0.385 ± 0.135
2.213PheAsp: 2.213 ± 0.278
2.021PheGlu: 2.021 ± 0.33
0.77PhePhe: 0.77 ± 0.187
3.705PheGly: 3.705 ± 0.422
0.529PheHis: 0.529 ± 0.16
1.395PheIle: 1.395 ± 0.328
1.925PheLys: 1.925 ± 0.325
2.646PheLeu: 2.646 ± 0.442
0.577PheMet: 0.577 ± 0.168
1.443PheAsn: 1.443 ± 0.331
1.492PhePro: 1.492 ± 0.283
1.443PheGln: 1.443 ± 0.274
2.261PheArg: 2.261 ± 0.358
1.78PheSer: 1.78 ± 0.308
1.636PheThr: 1.636 ± 0.209
1.54PheVal: 1.54 ± 0.233
0.433PheTrp: 0.433 ± 0.135
1.107PheTyr: 1.107 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
6.495GlyAla: 6.495 ± 0.715
1.107GlyCys: 1.107 ± 0.218
5.1GlyAsp: 5.1 ± 0.447
5.1GlyGlu: 5.1 ± 0.415
2.598GlyPhe: 2.598 ± 0.357
6.159GlyGly: 6.159 ± 0.68
1.636GlyHis: 1.636 ± 0.296
4.475GlyIle: 4.475 ± 0.492
5.341GlyLys: 5.341 ± 0.443
6.207GlyLeu: 6.207 ± 0.675
2.309GlyMet: 2.309 ± 0.302
2.55GlyAsn: 2.55 ± 0.271
4.282GlyPro: 4.282 ± 0.441
2.694GlyGln: 2.694 ± 0.374
3.945GlyArg: 3.945 ± 0.443
5.629GlySer: 5.629 ± 0.617
6.543GlyThr: 6.543 ± 0.714
5.148GlyVal: 5.148 ± 0.405
1.732GlyTrp: 1.732 ± 0.266
3.127GlyTyr: 3.127 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
1.636HisAla: 1.636 ± 0.286
0.241HisCys: 0.241 ± 0.106
1.059HisAsp: 1.059 ± 0.254
1.107HisGlu: 1.107 ± 0.26
0.914HisPhe: 0.914 ± 0.215
1.78HisGly: 1.78 ± 0.305
0.433HisHis: 0.433 ± 0.155
0.866HisIle: 0.866 ± 0.184
1.059HisLys: 1.059 ± 0.229
1.828HisLeu: 1.828 ± 0.325
0.529HisMet: 0.529 ± 0.146
0.962HisAsn: 0.962 ± 0.24
1.443HisPro: 1.443 ± 0.29
0.722HisGln: 0.722 ± 0.214
1.347HisArg: 1.347 ± 0.289
0.625HisSer: 0.625 ± 0.182
0.818HisThr: 0.818 ± 0.187
1.107HisVal: 1.107 ± 0.268
0.192HisTrp: 0.192 ± 0.109
0.818HisTyr: 0.818 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
4.475IleAla: 4.475 ± 0.582
0.481IleCys: 0.481 ± 0.155
3.416IleAsp: 3.416 ± 0.343
2.983IleGlu: 2.983 ± 0.423
1.347IlePhe: 1.347 ± 0.207
3.801IleGly: 3.801 ± 0.461
0.77IleHis: 0.77 ± 0.185
2.694IleIle: 2.694 ± 0.362
2.454IleLys: 2.454 ± 0.346
3.079IleLeu: 3.079 ± 0.41
1.299IleMet: 1.299 ± 0.248
2.598IleAsn: 2.598 ± 0.418
2.983IlePro: 2.983 ± 0.437
2.069IleGln: 2.069 ± 0.469
3.56IleArg: 3.56 ± 0.373
2.406IleSer: 2.406 ± 0.311
3.224IleThr: 3.224 ± 0.368
3.32IleVal: 3.32 ± 0.492
0.866IleTrp: 0.866 ± 0.169
1.01IleTyr: 1.01 ± 0.268
0.0IleXaa: 0.0 ± 0.0
Lys
5.918LysAla: 5.918 ± 0.828
0.433LysCys: 0.433 ± 0.136
3.657LysAsp: 3.657 ± 0.409
3.705LysGlu: 3.705 ± 0.416
1.395LysPhe: 1.395 ± 0.197
3.416LysGly: 3.416 ± 0.468
1.155LysHis: 1.155 ± 0.256
2.887LysIle: 2.887 ± 0.289
3.272LysLys: 3.272 ± 0.514
4.426LysLeu: 4.426 ± 0.376
0.914LysMet: 0.914 ± 0.192
2.021LysAsn: 2.021 ± 0.344
3.224LysPro: 3.224 ± 0.362
1.78LysGln: 1.78 ± 0.276
4.571LysArg: 4.571 ± 0.515
2.983LysSer: 2.983 ± 0.279
3.127LysThr: 3.127 ± 0.432
3.993LysVal: 3.993 ± 0.487
1.059LysTrp: 1.059 ± 0.181
1.443LysTyr: 1.443 ± 0.262
0.0LysXaa: 0.0 ± 0.0
Leu
7.169LeuAla: 7.169 ± 0.52
0.577LeuCys: 0.577 ± 0.213
5.244LeuAsp: 5.244 ± 0.556
5.148LeuGlu: 5.148 ± 0.608
2.406LeuPhe: 2.406 ± 0.324
6.159LeuGly: 6.159 ± 0.623
1.588LeuHis: 1.588 ± 0.271
2.935LeuIle: 2.935 ± 0.426
4.763LeuLys: 4.763 ± 0.459
5.87LeuLeu: 5.87 ± 0.555
1.828LeuMet: 1.828 ± 0.28
3.32LeuAsn: 3.32 ± 0.378
4.234LeuPro: 4.234 ± 0.476
2.598LeuGln: 2.598 ± 0.309
5.533LeuArg: 5.533 ± 0.553
5.389LeuSer: 5.389 ± 0.549
5.004LeuThr: 5.004 ± 0.427
4.426LeuVal: 4.426 ± 0.468
1.059LeuTrp: 1.059 ± 0.269
1.251LeuTyr: 1.251 ± 0.252
0.0LeuXaa: 0.0 ± 0.0
Met
2.502MetAla: 2.502 ± 0.299
0.241MetCys: 0.241 ± 0.091
1.299MetAsp: 1.299 ± 0.234
1.155MetGlu: 1.155 ± 0.189
0.625MetPhe: 0.625 ± 0.156
1.588MetGly: 1.588 ± 0.281
0.337MetHis: 0.337 ± 0.119
1.155MetIle: 1.155 ± 0.212
1.347MetLys: 1.347 ± 0.245
1.251MetLeu: 1.251 ± 0.262
0.625MetMet: 0.625 ± 0.196
1.01MetAsn: 1.01 ± 0.247
1.443MetPro: 1.443 ± 0.233
0.914MetGln: 0.914 ± 0.195
1.492MetArg: 1.492 ± 0.232
1.78MetSer: 1.78 ± 0.298
1.54MetThr: 1.54 ± 0.239
1.01MetVal: 1.01 ± 0.251
0.433MetTrp: 0.433 ± 0.145
1.01MetTyr: 1.01 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
4.186AsnAla: 4.186 ± 0.506
0.144AsnCys: 0.144 ± 0.081
2.646AsnAsp: 2.646 ± 0.421
2.406AsnGlu: 2.406 ± 0.353
1.251AsnPhe: 1.251 ± 0.278
3.56AsnGly: 3.56 ± 0.479
0.77AsnHis: 0.77 ± 0.168
1.876AsnIle: 1.876 ± 0.311
1.973AsnLys: 1.973 ± 0.313
2.309AsnLeu: 2.309 ± 0.294
0.77AsnMet: 0.77 ± 0.201
1.395AsnAsn: 1.395 ± 0.332
3.127AsnPro: 3.127 ± 0.511
1.395AsnGln: 1.395 ± 0.291
2.887AsnArg: 2.887 ± 0.394
2.117AsnSer: 2.117 ± 0.326
2.069AsnThr: 2.069 ± 0.285
2.117AsnVal: 2.117 ± 0.305
0.674AsnTrp: 0.674 ± 0.162
1.443AsnTyr: 1.443 ± 0.229
0.0AsnXaa: 0.0 ± 0.0
Pro
3.993ProAla: 3.993 ± 0.47
0.385ProCys: 0.385 ± 0.145
3.32ProAsp: 3.32 ± 0.543
3.945ProGlu: 3.945 ± 0.46
1.876ProPhe: 1.876 ± 0.291
5.148ProGly: 5.148 ± 0.488
1.299ProHis: 1.299 ± 0.19
2.117ProIle: 2.117 ± 0.292
3.224ProLys: 3.224 ± 0.49
3.32ProLeu: 3.32 ± 0.385
0.866ProMet: 0.866 ± 0.215
2.694ProAsn: 2.694 ± 0.376
2.358ProPro: 2.358 ± 0.411
1.684ProGln: 1.684 ± 0.284
2.406ProArg: 2.406 ± 0.371
3.368ProSer: 3.368 ± 0.38
3.272ProThr: 3.272 ± 0.364
3.272ProVal: 3.272 ± 0.387
1.155ProTrp: 1.155 ± 0.249
1.54ProTyr: 1.54 ± 0.276
0.0ProXaa: 0.0 ± 0.0
Gln
3.079GlnAla: 3.079 ± 0.434
0.241GlnCys: 0.241 ± 0.116
2.358GlnAsp: 2.358 ± 0.286
2.069GlnGlu: 2.069 ± 0.314
1.588GlnPhe: 1.588 ± 0.233
2.694GlnGly: 2.694 ± 0.405
0.722GlnHis: 0.722 ± 0.18
2.646GlnIle: 2.646 ± 0.278
2.117GlnLys: 2.117 ± 0.398
2.694GlnLeu: 2.694 ± 0.331
1.059GlnMet: 1.059 ± 0.192
1.828GlnAsn: 1.828 ± 0.245
1.395GlnPro: 1.395 ± 0.245
1.299GlnGln: 1.299 ± 0.257
2.839GlnArg: 2.839 ± 0.368
2.117GlnSer: 2.117 ± 0.37
1.876GlnThr: 1.876 ± 0.29
2.309GlnVal: 2.309 ± 0.32
0.577GlnTrp: 0.577 ± 0.161
0.914GlnTyr: 0.914 ± 0.236
0.0GlnXaa: 0.0 ± 0.0
Arg
5.966ArgAla: 5.966 ± 0.661
0.481ArgCys: 0.481 ± 0.163
3.849ArgAsp: 3.849 ± 0.46
4.234ArgGlu: 4.234 ± 0.536
1.78ArgPhe: 1.78 ± 0.283
5.052ArgGly: 5.052 ± 0.443
0.962ArgHis: 0.962 ± 0.223
3.609ArgIle: 3.609 ± 0.452
4.523ArgLys: 4.523 ± 0.68
4.234ArgLeu: 4.234 ± 0.482
1.828ArgMet: 1.828 ± 0.345
2.742ArgAsn: 2.742 ± 0.413
2.694ArgPro: 2.694 ± 0.411
2.598ArgGln: 2.598 ± 0.254
5.148ArgArg: 5.148 ± 0.51
3.464ArgSer: 3.464 ± 0.408
3.32ArgThr: 3.32 ± 0.37
5.196ArgVal: 5.196 ± 0.536
1.492ArgTrp: 1.492 ± 0.272
2.213ArgTyr: 2.213 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
6.207SerAla: 6.207 ± 0.574
0.866SerCys: 0.866 ± 0.256
4.667SerAsp: 4.667 ± 0.544
3.079SerGlu: 3.079 ± 0.384
2.358SerPhe: 2.358 ± 0.363
6.495SerGly: 6.495 ± 0.58
1.01SerHis: 1.01 ± 0.219
2.598SerIle: 2.598 ± 0.427
2.742SerLys: 2.742 ± 0.36
3.897SerLeu: 3.897 ± 0.451
1.443SerMet: 1.443 ± 0.278
1.732SerAsn: 1.732 ± 0.292
2.406SerPro: 2.406 ± 0.364
2.309SerGln: 2.309 ± 0.32
3.079SerArg: 3.079 ± 0.384
3.897SerSer: 3.897 ± 0.634
4.234SerThr: 4.234 ± 0.498
2.598SerVal: 2.598 ± 0.316
1.588SerTrp: 1.588 ± 0.319
1.684SerTyr: 1.684 ± 0.251
0.0SerXaa: 0.0 ± 0.0
Thr
5.485ThrAla: 5.485 ± 0.768
0.77ThrCys: 0.77 ± 0.178
3.609ThrAsp: 3.609 ± 0.456
4.282ThrGlu: 4.282 ± 0.48
2.213ThrPhe: 2.213 ± 0.331
5.677ThrGly: 5.677 ± 0.57
1.347ThrHis: 1.347 ± 0.256
3.464ThrIle: 3.464 ± 0.527
2.791ThrLys: 2.791 ± 0.357
3.657ThrLeu: 3.657 ± 0.491
1.01ThrMet: 1.01 ± 0.21
2.261ThrAsn: 2.261 ± 0.368
3.368ThrPro: 3.368 ± 0.297
1.684ThrGln: 1.684 ± 0.327
2.646ThrArg: 2.646 ± 0.353
3.945ThrSer: 3.945 ± 0.512
3.416ThrThr: 3.416 ± 0.548
3.705ThrVal: 3.705 ± 0.515
1.395ThrTrp: 1.395 ± 0.223
1.684ThrTyr: 1.684 ± 0.212
0.0ThrXaa: 0.0 ± 0.0
Val
5.629ValAla: 5.629 ± 0.55
0.77ValCys: 0.77 ± 0.241
4.042ValAsp: 4.042 ± 0.412
4.667ValGlu: 4.667 ± 0.555
1.78ValPhe: 1.78 ± 0.33
5.629ValGly: 5.629 ± 0.556
1.251ValHis: 1.251 ± 0.312
2.839ValIle: 2.839 ± 0.353
3.56ValLys: 3.56 ± 0.502
5.293ValLeu: 5.293 ± 0.571
1.059ValMet: 1.059 ± 0.227
1.925ValAsn: 1.925 ± 0.307
3.368ValPro: 3.368 ± 0.361
2.213ValGln: 2.213 ± 0.345
3.945ValArg: 3.945 ± 0.462
3.079ValSer: 3.079 ± 0.374
3.753ValThr: 3.753 ± 0.38
4.33ValVal: 4.33 ± 0.486
1.107ValTrp: 1.107 ± 0.279
2.694ValTyr: 2.694 ± 0.268
0.0ValXaa: 0.0 ± 0.0
Trp
1.443TrpAla: 1.443 ± 0.278
0.241TrpCys: 0.241 ± 0.116
1.155TrpAsp: 1.155 ± 0.274
1.107TrpGlu: 1.107 ± 0.226
0.481TrpPhe: 0.481 ± 0.212
1.251TrpGly: 1.251 ± 0.272
0.481TrpHis: 0.481 ± 0.142
1.155TrpIle: 1.155 ± 0.259
1.01TrpLys: 1.01 ± 0.215
1.492TrpLeu: 1.492 ± 0.251
0.289TrpMet: 0.289 ± 0.112
0.818TrpAsn: 0.818 ± 0.213
0.722TrpPro: 0.722 ± 0.211
0.962TrpGln: 0.962 ± 0.179
1.588TrpArg: 1.588 ± 0.306
1.155TrpSer: 1.155 ± 0.248
1.636TrpThr: 1.636 ± 0.253
1.54TrpVal: 1.54 ± 0.24
0.192TrpTrp: 0.192 ± 0.091
0.433TrpTyr: 0.433 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.213TyrAla: 2.213 ± 0.379
0.481TyrCys: 0.481 ± 0.152
2.406TyrAsp: 2.406 ± 0.409
1.973TyrGlu: 1.973 ± 0.338
0.722TyrPhe: 0.722 ± 0.181
2.55TyrGly: 2.55 ± 0.342
0.625TyrHis: 0.625 ± 0.175
1.347TyrIle: 1.347 ± 0.261
1.203TyrLys: 1.203 ± 0.226
2.358TyrLeu: 2.358 ± 0.314
0.577TyrMet: 0.577 ± 0.14
1.443TyrAsn: 1.443 ± 0.224
1.395TyrPro: 1.395 ± 0.284
1.299TyrGln: 1.299 ± 0.245
2.935TyrArg: 2.935 ± 0.395
1.828TyrSer: 1.828 ± 0.367
1.636TyrThr: 1.636 ± 0.331
2.069TyrVal: 2.069 ± 0.343
0.577TyrTrp: 0.577 ± 0.168
0.962TyrTyr: 0.962 ± 0.221
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (20785 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski