Amino acid dipepetide frequency for Mycobacterium phage Antsirabe

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.912AlaAla: 18.912 ± 1.482
0.825AlaCys: 0.825 ± 0.264
7.634AlaAsp: 7.634 ± 0.812
8.528AlaGlu: 8.528 ± 1.412
4.676AlaPhe: 4.676 ± 0.662
13.754AlaGly: 13.754 ± 1.382
2.063AlaHis: 2.063 ± 0.395
7.771AlaIle: 7.771 ± 0.641
5.433AlaLys: 5.433 ± 0.641
11.485AlaLeu: 11.485 ± 1.199
3.163AlaMet: 3.163 ± 0.492
3.507AlaAsn: 3.507 ± 0.49
7.083AlaPro: 7.083 ± 0.683
5.983AlaGln: 5.983 ± 0.641
6.946AlaArg: 6.946 ± 0.703
4.195AlaSer: 4.195 ± 0.821
7.977AlaThr: 7.977 ± 0.688
7.771AlaVal: 7.771 ± 0.879
1.994AlaTrp: 1.994 ± 0.414
2.201AlaTyr: 2.201 ± 0.409
0.0AlaXaa: 0.0 ± 0.0
Cys
0.825CysAla: 0.825 ± 0.275
0.138CysCys: 0.138 ± 0.086
0.413CysAsp: 0.413 ± 0.213
0.55CysGlu: 0.55 ± 0.223
0.481CysPhe: 0.481 ± 0.206
1.444CysGly: 1.444 ± 0.477
0.0CysHis: 0.0 ± 0.0
0.413CysIle: 0.413 ± 0.147
0.069CysLys: 0.069 ± 0.061
0.275CysLeu: 0.275 ± 0.138
0.0CysMet: 0.0 ± 0.0
0.481CysAsn: 0.481 ± 0.182
0.825CysPro: 0.825 ± 0.311
0.413CysGln: 0.413 ± 0.179
1.307CysArg: 1.307 ± 0.389
0.756CysSer: 0.756 ± 0.23
0.344CysThr: 0.344 ± 0.223
0.275CysVal: 0.275 ± 0.154
0.275CysTrp: 0.275 ± 0.132
0.206CysTyr: 0.206 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
9.559AspAla: 9.559 ± 1.182
1.307AspCys: 1.307 ± 0.422
6.396AspAsp: 6.396 ± 0.858
5.364AspGlu: 5.364 ± 0.832
2.063AspPhe: 2.063 ± 0.456
8.528AspGly: 8.528 ± 0.825
1.444AspHis: 1.444 ± 0.334
1.032AspIle: 1.032 ± 0.308
1.582AspLys: 1.582 ± 0.385
4.195AspLeu: 4.195 ± 0.625
1.032AspMet: 1.032 ± 0.203
1.857AspAsn: 1.857 ± 0.383
5.295AspPro: 5.295 ± 0.806
3.507AspGln: 3.507 ± 0.618
4.333AspArg: 4.333 ± 0.501
1.926AspSer: 1.926 ± 0.406
3.851AspThr: 3.851 ± 0.392
5.914AspVal: 5.914 ± 0.731
1.582AspTrp: 1.582 ± 0.291
2.132AspTyr: 2.132 ± 0.442
0.0AspXaa: 0.0 ± 0.0
Glu
5.846GluAla: 5.846 ± 0.972
0.275GluCys: 0.275 ± 0.131
4.333GluAsp: 4.333 ± 0.574
2.545GluGlu: 2.545 ± 0.484
1.582GluPhe: 1.582 ± 0.343
2.613GluGly: 2.613 ± 0.399
1.857GluHis: 1.857 ± 0.375
2.063GluIle: 2.063 ± 0.442
1.651GluLys: 1.651 ± 0.398
4.676GluLeu: 4.676 ± 0.667
1.032GluMet: 1.032 ± 0.273
1.651GluAsn: 1.651 ± 0.457
4.195GluPro: 4.195 ± 0.75
3.163GluGln: 3.163 ± 0.522
5.02GluArg: 5.02 ± 0.762
1.994GluSer: 1.994 ± 0.406
2.476GluThr: 2.476 ± 0.39
4.539GluVal: 4.539 ± 0.613
1.307GluTrp: 1.307 ± 0.274
1.375GluTyr: 1.375 ± 0.293
0.0GluXaa: 0.0 ± 0.0
Phe
4.333PheAla: 4.333 ± 0.501
0.481PheCys: 0.481 ± 0.193
2.751PheAsp: 2.751 ± 0.426
2.613PheGlu: 2.613 ± 0.469
0.481PhePhe: 0.481 ± 0.175
3.645PheGly: 3.645 ± 0.505
0.413PheHis: 0.413 ± 0.173
0.688PheIle: 0.688 ± 0.167
1.307PheLys: 1.307 ± 0.278
1.719PheLeu: 1.719 ± 0.434
0.55PheMet: 0.55 ± 0.19
0.894PheAsn: 0.894 ± 0.294
1.307PhePro: 1.307 ± 0.33
0.825PheGln: 0.825 ± 0.24
1.169PheArg: 1.169 ± 0.252
1.926PheSer: 1.926 ± 0.363
1.307PheThr: 1.307 ± 0.318
2.063PheVal: 2.063 ± 0.396
0.481PheTrp: 0.481 ± 0.18
0.825PheTyr: 0.825 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
10.935GlyAla: 10.935 ± 1.436
0.688GlyCys: 0.688 ± 0.217
6.946GlyAsp: 6.946 ± 0.786
4.814GlyGlu: 4.814 ± 0.63
3.026GlyPhe: 3.026 ± 0.421
11.622GlyGly: 11.622 ± 3.069
1.719GlyHis: 1.719 ± 0.429
4.333GlyIle: 4.333 ± 0.702
3.507GlyLys: 3.507 ± 0.592
6.946GlyLeu: 6.946 ± 0.599
1.994GlyMet: 1.994 ± 0.31
3.301GlyAsn: 3.301 ± 0.509
5.227GlyPro: 5.227 ± 0.77
2.751GlyGln: 2.751 ± 0.465
6.808GlyArg: 6.808 ± 0.687
5.02GlySer: 5.02 ± 0.84
6.671GlyThr: 6.671 ± 0.773
6.327GlyVal: 6.327 ± 0.886
1.926GlyTrp: 1.926 ± 0.404
1.926GlyTyr: 1.926 ± 0.382
0.0GlyXaa: 0.0 ± 0.0
His
2.476HisAla: 2.476 ± 0.602
0.138HisCys: 0.138 ± 0.109
0.825HisAsp: 0.825 ± 0.288
0.481HisGlu: 0.481 ± 0.199
0.481HisPhe: 0.481 ± 0.18
1.582HisGly: 1.582 ± 0.295
0.344HisHis: 0.344 ± 0.148
0.894HisIle: 0.894 ± 0.3
0.413HisLys: 0.413 ± 0.178
0.963HisLeu: 0.963 ± 0.272
0.619HisMet: 0.619 ± 0.233
0.344HisAsn: 0.344 ± 0.152
1.651HisPro: 1.651 ± 0.385
1.238HisGln: 1.238 ± 0.313
2.132HisArg: 2.132 ± 0.381
0.963HisSer: 0.963 ± 0.285
0.963HisThr: 0.963 ± 0.26
2.063HisVal: 2.063 ± 0.367
0.275HisTrp: 0.275 ± 0.17
0.069HisTyr: 0.069 ± 0.074
0.0HisXaa: 0.0 ± 0.0
Ile
6.808IleAla: 6.808 ± 0.658
0.344IleCys: 0.344 ± 0.195
3.095IleAsp: 3.095 ± 0.536
2.132IleGlu: 2.132 ± 0.356
1.238IlePhe: 1.238 ± 0.311
5.433IleGly: 5.433 ± 1.067
0.894IleHis: 0.894 ± 0.23
1.307IleIle: 1.307 ± 0.34
0.963IleLys: 0.963 ± 0.343
2.201IleLeu: 2.201 ± 0.353
0.688IleMet: 0.688 ± 0.242
0.688IleAsn: 0.688 ± 0.224
3.232IlePro: 3.232 ± 0.681
1.169IleGln: 1.169 ± 0.301
3.851IleArg: 3.851 ± 0.461
1.719IleSer: 1.719 ± 0.327
1.926IleThr: 1.926 ± 0.323
3.232IleVal: 3.232 ± 0.453
0.894IleTrp: 0.894 ± 0.255
0.55IleTyr: 0.55 ± 0.164
0.0IleXaa: 0.0 ± 0.0
Lys
5.364LysAla: 5.364 ± 0.77
0.481LysCys: 0.481 ± 0.241
1.513LysAsp: 1.513 ± 0.372
1.032LysGlu: 1.032 ± 0.202
1.1LysPhe: 1.1 ± 0.242
1.857LysGly: 1.857 ± 0.409
0.344LysHis: 0.344 ± 0.175
1.582LysIle: 1.582 ± 0.363
1.238LysLys: 1.238 ± 0.388
2.682LysLeu: 2.682 ± 0.439
1.1LysMet: 1.1 ± 0.279
1.444LysAsn: 1.444 ± 0.267
1.926LysPro: 1.926 ± 0.38
1.238LysGln: 1.238 ± 0.347
2.888LysArg: 2.888 ± 0.505
1.857LysSer: 1.857 ± 0.418
2.201LysThr: 2.201 ± 0.445
2.063LysVal: 2.063 ± 0.386
0.688LysTrp: 0.688 ± 0.226
0.413LysTyr: 0.413 ± 0.168
0.0LysXaa: 0.0 ± 0.0
Leu
8.046LeuAla: 8.046 ± 0.72
0.481LeuCys: 0.481 ± 0.195
7.29LeuAsp: 7.29 ± 0.901
3.645LeuGlu: 3.645 ± 0.529
2.338LeuPhe: 2.338 ± 0.439
7.152LeuGly: 7.152 ± 1.375
1.444LeuHis: 1.444 ± 0.293
3.782LeuIle: 3.782 ± 0.51
2.545LeuLys: 2.545 ± 0.516
5.914LeuLeu: 5.914 ± 0.659
1.375LeuMet: 1.375 ± 0.393
2.201LeuAsn: 2.201 ± 0.482
4.401LeuPro: 4.401 ± 0.674
1.307LeuGln: 1.307 ± 0.334
5.089LeuArg: 5.089 ± 0.651
4.401LeuSer: 4.401 ± 0.58
4.883LeuThr: 4.883 ± 0.543
5.708LeuVal: 5.708 ± 0.567
1.238LeuTrp: 1.238 ± 0.396
1.444LeuTyr: 1.444 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
3.714MetAla: 3.714 ± 0.487
0.138MetCys: 0.138 ± 0.083
1.238MetAsp: 1.238 ± 0.281
0.688MetGlu: 0.688 ± 0.227
0.756MetPhe: 0.756 ± 0.178
1.375MetGly: 1.375 ± 0.341
0.206MetHis: 0.206 ± 0.103
1.1MetIle: 1.1 ± 0.332
0.413MetLys: 0.413 ± 0.145
1.444MetLeu: 1.444 ± 0.32
0.344MetMet: 0.344 ± 0.161
0.413MetAsn: 0.413 ± 0.15
0.963MetPro: 0.963 ± 0.2
0.688MetGln: 0.688 ± 0.2
1.513MetArg: 1.513 ± 0.41
2.132MetSer: 2.132 ± 0.364
2.201MetThr: 2.201 ± 0.376
1.169MetVal: 1.169 ± 0.225
0.344MetTrp: 0.344 ± 0.153
0.138MetTyr: 0.138 ± 0.088
0.0MetXaa: 0.0 ± 0.0
Asn
3.92AsnAla: 3.92 ± 0.533
0.344AsnCys: 0.344 ± 0.186
1.651AsnAsp: 1.651 ± 0.391
1.307AsnGlu: 1.307 ± 0.264
0.894AsnPhe: 0.894 ± 0.223
3.645AsnGly: 3.645 ± 0.534
0.481AsnHis: 0.481 ± 0.165
0.756AsnIle: 0.756 ± 0.227
0.894AsnLys: 0.894 ± 0.234
1.788AsnLeu: 1.788 ± 0.327
0.55AsnMet: 0.55 ± 0.184
0.756AsnAsn: 0.756 ± 0.255
3.163AsnPro: 3.163 ± 0.419
1.444AsnGln: 1.444 ± 0.312
2.888AsnArg: 2.888 ± 0.447
0.894AsnSer: 0.894 ± 0.255
1.307AsnThr: 1.307 ± 0.226
1.788AsnVal: 1.788 ± 0.366
0.963AsnTrp: 0.963 ± 0.237
0.619AsnTyr: 0.619 ± 0.2
0.0AsnXaa: 0.0 ± 0.0
Pro
9.903ProAla: 9.903 ± 1.068
0.206ProCys: 0.206 ± 0.115
7.702ProAsp: 7.702 ± 1.062
3.782ProGlu: 3.782 ± 0.742
0.825ProPhe: 0.825 ± 0.21
4.883ProGly: 4.883 ± 0.612
1.169ProHis: 1.169 ± 0.314
2.888ProIle: 2.888 ± 0.365
2.888ProLys: 2.888 ± 0.583
4.676ProLeu: 4.676 ± 0.521
1.032ProMet: 1.032 ± 0.303
1.651ProAsn: 1.651 ± 0.383
3.37ProPro: 3.37 ± 0.508
2.201ProGln: 2.201 ± 0.371
3.507ProArg: 3.507 ± 0.583
2.201ProSer: 2.201 ± 0.502
4.401ProThr: 4.401 ± 0.558
3.095ProVal: 3.095 ± 0.554
1.169ProTrp: 1.169 ± 0.341
1.169ProTyr: 1.169 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
5.502GlnAla: 5.502 ± 0.677
0.206GlnCys: 0.206 ± 0.128
2.338GlnAsp: 2.338 ± 0.502
1.032GlnGlu: 1.032 ± 0.249
1.651GlnPhe: 1.651 ± 0.317
2.888GlnGly: 2.888 ± 0.429
0.619GlnHis: 0.619 ± 0.154
2.063GlnIle: 2.063 ± 0.326
0.894GlnLys: 0.894 ± 0.207
3.92GlnLeu: 3.92 ± 0.501
1.169GlnMet: 1.169 ± 0.274
1.307GlnAsn: 1.307 ± 0.326
2.476GlnPro: 2.476 ± 0.498
1.238GlnGln: 1.238 ± 0.374
2.82GlnArg: 2.82 ± 0.458
1.307GlnSer: 1.307 ± 0.288
1.513GlnThr: 1.513 ± 0.285
2.407GlnVal: 2.407 ± 0.455
0.55GlnTrp: 0.55 ± 0.178
0.688GlnTyr: 0.688 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
9.422ArgAla: 9.422 ± 1.13
1.169ArgCys: 1.169 ± 0.379
4.47ArgAsp: 4.47 ± 0.475
3.782ArgGlu: 3.782 ± 0.514
1.651ArgPhe: 1.651 ± 0.34
4.883ArgGly: 4.883 ± 0.685
1.651ArgHis: 1.651 ± 0.35
2.957ArgIle: 2.957 ± 0.416
2.957ArgLys: 2.957 ± 0.563
6.533ArgLeu: 6.533 ± 0.61
1.651ArgMet: 1.651 ± 0.405
2.269ArgAsn: 2.269 ± 0.348
4.47ArgPro: 4.47 ± 0.552
2.613ArgGln: 2.613 ± 0.491
6.946ArgArg: 6.946 ± 1.055
2.957ArgSer: 2.957 ± 0.402
3.92ArgThr: 3.92 ± 0.514
3.026ArgVal: 3.026 ± 0.375
1.719ArgTrp: 1.719 ± 0.41
1.926ArgTyr: 1.926 ± 0.409
0.0ArgXaa: 0.0 ± 0.0
Ser
4.745SerAla: 4.745 ± 0.525
0.413SerCys: 0.413 ± 0.204
2.957SerAsp: 2.957 ± 0.431
2.201SerGlu: 2.201 ± 0.439
1.582SerPhe: 1.582 ± 0.33
4.676SerGly: 4.676 ± 0.922
0.619SerHis: 0.619 ± 0.223
2.269SerIle: 2.269 ± 0.497
1.719SerLys: 1.719 ± 0.329
3.163SerLeu: 3.163 ± 0.729
1.032SerMet: 1.032 ± 0.262
1.719SerAsn: 1.719 ± 0.354
2.545SerPro: 2.545 ± 0.327
0.894SerGln: 0.894 ± 0.248
2.82SerArg: 2.82 ± 0.414
3.026SerSer: 3.026 ± 0.788
2.957SerThr: 2.957 ± 0.461
3.095SerVal: 3.095 ± 0.514
1.375SerTrp: 1.375 ± 0.324
0.963SerTyr: 0.963 ± 0.245
0.0SerXaa: 0.0 ± 0.0
Thr
7.977ThrAla: 7.977 ± 0.723
0.55ThrCys: 0.55 ± 0.212
3.782ThrAsp: 3.782 ± 0.57
3.232ThrGlu: 3.232 ± 0.406
1.719ThrPhe: 1.719 ± 0.437
6.808ThrGly: 6.808 ± 0.578
1.238ThrHis: 1.238 ± 0.265
2.613ThrIle: 2.613 ± 0.404
1.719ThrLys: 1.719 ± 0.324
3.782ThrLeu: 3.782 ± 0.461
1.651ThrMet: 1.651 ± 0.31
2.063ThrAsn: 2.063 ± 0.307
5.502ThrPro: 5.502 ± 0.732
1.513ThrGln: 1.513 ± 0.36
3.37ThrArg: 3.37 ± 0.475
2.682ThrSer: 2.682 ± 0.433
3.301ThrThr: 3.301 ± 0.435
5.089ThrVal: 5.089 ± 0.721
0.894ThrTrp: 0.894 ± 0.236
0.619ThrTyr: 0.619 ± 0.196
0.0ThrXaa: 0.0 ± 0.0
Val
8.871ValAla: 8.871 ± 0.984
0.963ValCys: 0.963 ± 0.367
5.089ValAsp: 5.089 ± 0.491
4.608ValGlu: 4.608 ± 0.668
1.994ValPhe: 1.994 ± 0.31
6.052ValGly: 6.052 ± 0.651
0.894ValHis: 0.894 ± 0.371
2.682ValIle: 2.682 ± 0.428
1.857ValLys: 1.857 ± 0.43
4.745ValLeu: 4.745 ± 0.6
1.375ValMet: 1.375 ± 0.376
2.063ValAsn: 2.063 ± 0.352
3.163ValPro: 3.163 ± 0.443
2.751ValGln: 2.751 ± 0.501
4.676ValArg: 4.676 ± 0.626
2.751ValSer: 2.751 ± 0.504
4.814ValThr: 4.814 ± 0.48
5.914ValVal: 5.914 ± 0.599
2.063ValTrp: 2.063 ± 0.38
2.063ValTyr: 2.063 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
2.201TrpAla: 2.201 ± 0.383
0.138TrpCys: 0.138 ± 0.107
1.444TrpAsp: 1.444 ± 0.259
0.756TrpGlu: 0.756 ± 0.253
0.894TrpPhe: 0.894 ± 0.239
1.307TrpGly: 1.307 ± 0.342
0.825TrpHis: 0.825 ± 0.283
0.688TrpIle: 0.688 ± 0.232
0.619TrpLys: 0.619 ± 0.248
1.994TrpLeu: 1.994 ± 0.498
0.275TrpMet: 0.275 ± 0.11
1.032TrpAsn: 1.032 ± 0.243
1.032TrpPro: 1.032 ± 0.226
1.032TrpGln: 1.032 ± 0.254
1.651TrpArg: 1.651 ± 0.33
0.825TrpSer: 0.825 ± 0.218
1.857TrpThr: 1.857 ± 0.432
1.1TrpVal: 1.1 ± 0.25
0.413TrpTrp: 0.413 ± 0.157
0.481TrpTyr: 0.481 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.545TyrAla: 2.545 ± 0.488
0.206TyrCys: 0.206 ± 0.115
1.032TyrAsp: 1.032 ± 0.291
1.1TyrGlu: 1.1 ± 0.304
0.344TyrPhe: 0.344 ± 0.171
2.338TyrGly: 2.338 ± 0.5
0.688TyrHis: 0.688 ± 0.26
0.481TyrIle: 0.481 ± 0.157
0.344TyrLys: 0.344 ± 0.16
1.582TyrLeu: 1.582 ± 0.291
0.275TyrMet: 0.275 ± 0.149
0.413TyrAsn: 0.413 ± 0.147
0.894TyrPro: 0.894 ± 0.263
0.619TyrGln: 0.619 ± 0.176
1.307TyrArg: 1.307 ± 0.265
1.238TyrSer: 1.238 ± 0.322
1.238TyrThr: 1.238 ± 0.299
2.751TyrVal: 2.751 ± 0.515
0.413TyrTrp: 0.413 ± 0.192
0.344TyrTyr: 0.344 ± 0.157
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (14542 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski