Amino acid dipepetide frequency for Mycobacterium phage Llama

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.606AlaAla: 14.606 ± 1.816
1.066AlaCys: 1.066 ± 0.223
7.09AlaAsp: 7.09 ± 0.562
7.783AlaGlu: 7.783 ± 0.705
3.305AlaPhe: 3.305 ± 0.48
9.861AlaGly: 9.861 ± 1.314
2.239AlaHis: 2.239 ± 0.323
4.211AlaIle: 4.211 ± 0.539
4.104AlaLys: 4.104 ± 0.439
8.475AlaLeu: 8.475 ± 0.763
2.772AlaMet: 2.772 ± 0.411
2.399AlaAsn: 2.399 ± 0.41
5.277AlaPro: 5.277 ± 0.556
3.998AlaGln: 3.998 ± 0.406
6.983AlaArg: 6.983 ± 0.68
5.544AlaSer: 5.544 ± 0.587
6.343AlaThr: 6.343 ± 0.555
6.823AlaVal: 6.823 ± 0.614
2.345AlaTrp: 2.345 ± 0.363
2.559AlaTyr: 2.559 ± 0.295
0.0AlaXaa: 0.0 ± 0.0
Cys
1.119CysAla: 1.119 ± 0.241
0.053CysCys: 0.053 ± 0.047
1.066CysAsp: 1.066 ± 0.285
0.746CysGlu: 0.746 ± 0.204
0.267CysPhe: 0.267 ± 0.104
1.599CysGly: 1.599 ± 0.336
0.213CysHis: 0.213 ± 0.108
0.16CysIle: 0.16 ± 0.086
0.373CysLys: 0.373 ± 0.157
0.8CysLeu: 0.8 ± 0.224
0.0CysMet: 0.0 ± 0.0
0.32CysAsn: 0.32 ± 0.145
1.066CysPro: 1.066 ± 0.267
0.48CysGln: 0.48 ± 0.164
0.8CysArg: 0.8 ± 0.208
0.8CysSer: 0.8 ± 0.263
0.693CysThr: 0.693 ± 0.209
0.586CysVal: 0.586 ± 0.178
0.267CysTrp: 0.267 ± 0.107
0.16CysTyr: 0.16 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
6.823AspAla: 6.823 ± 0.593
0.853AspCys: 0.853 ± 0.217
3.945AspAsp: 3.945 ± 0.51
3.731AspGlu: 3.731 ± 0.452
1.493AspPhe: 1.493 ± 0.259
6.716AspGly: 6.716 ± 0.605
1.333AspHis: 1.333 ± 0.261
2.239AspIle: 2.239 ± 0.35
1.919AspLys: 1.919 ± 0.256
5.757AspLeu: 5.757 ± 0.532
1.013AspMet: 1.013 ± 0.239
1.599AspAsn: 1.599 ± 0.358
4.211AspPro: 4.211 ± 0.468
2.292AspGln: 2.292 ± 0.326
5.49AspArg: 5.49 ± 0.59
3.092AspSer: 3.092 ± 0.418
4.264AspThr: 4.264 ± 0.546
4.264AspVal: 4.264 ± 0.466
1.652AspTrp: 1.652 ± 0.275
1.866AspTyr: 1.866 ± 0.31
0.0AspXaa: 0.0 ± 0.0
Glu
6.823GluAla: 6.823 ± 0.638
0.959GluCys: 0.959 ± 0.267
3.145GluAsp: 3.145 ± 0.386
3.305GluGlu: 3.305 ± 0.6
2.079GluPhe: 2.079 ± 0.331
3.305GluGly: 3.305 ± 0.412
1.919GluHis: 1.919 ± 0.337
2.505GluIle: 2.505 ± 0.319
1.599GluLys: 1.599 ± 0.268
5.544GluLeu: 5.544 ± 0.556
1.333GluMet: 1.333 ± 0.28
2.132GluAsn: 2.132 ± 0.288
3.198GluPro: 3.198 ± 0.475
2.932GluGln: 2.932 ± 0.357
4.797GluArg: 4.797 ± 0.556
3.518GluSer: 3.518 ± 0.467
3.838GluThr: 3.838 ± 0.582
3.731GluVal: 3.731 ± 0.494
1.546GluTrp: 1.546 ± 0.274
2.026GluTyr: 2.026 ± 0.378
0.0GluXaa: 0.0 ± 0.0
Phe
3.092PheAla: 3.092 ± 0.416
0.16PheCys: 0.16 ± 0.094
2.452PheAsp: 2.452 ± 0.433
2.079PheGlu: 2.079 ± 0.326
0.906PhePhe: 0.906 ± 0.249
2.825PheGly: 2.825 ± 0.641
0.533PheHis: 0.533 ± 0.167
1.493PheIle: 1.493 ± 0.319
0.8PheLys: 0.8 ± 0.201
1.493PheLeu: 1.493 ± 0.225
0.64PheMet: 0.64 ± 0.191
1.333PheAsn: 1.333 ± 0.311
1.866PhePro: 1.866 ± 0.338
1.226PheGln: 1.226 ± 0.324
1.706PheArg: 1.706 ± 0.286
1.493PheSer: 1.493 ± 0.272
1.972PheThr: 1.972 ± 0.275
2.399PheVal: 2.399 ± 0.324
0.533PheTrp: 0.533 ± 0.158
1.013PheTyr: 1.013 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
8.795GlyAla: 8.795 ± 1.07
1.226GlyCys: 1.226 ± 0.31
5.544GlyAsp: 5.544 ± 0.564
4.424GlyGlu: 4.424 ± 0.562
3.092GlyPhe: 3.092 ± 0.435
11.034GlyGly: 11.034 ± 1.752
1.652GlyHis: 1.652 ± 0.224
3.678GlyIle: 3.678 ± 0.585
2.612GlyLys: 2.612 ± 0.329
6.023GlyLeu: 6.023 ± 0.509
2.559GlyMet: 2.559 ± 0.462
3.038GlyAsn: 3.038 ± 0.367
3.838GlyPro: 3.838 ± 0.568
2.292GlyGln: 2.292 ± 0.578
5.224GlyArg: 5.224 ± 0.528
6.183GlySer: 6.183 ± 0.778
6.823GlyThr: 6.823 ± 0.804
6.343GlyVal: 6.343 ± 0.545
2.772GlyTrp: 2.772 ± 0.374
1.759GlyTyr: 1.759 ± 0.307
0.0GlyXaa: 0.0 ± 0.0
His
2.186HisAla: 2.186 ± 0.375
0.426HisCys: 0.426 ± 0.185
0.853HisAsp: 0.853 ± 0.197
0.853HisGlu: 0.853 ± 0.195
0.64HisPhe: 0.64 ± 0.17
1.652HisGly: 1.652 ± 0.296
0.693HisHis: 0.693 ± 0.243
1.333HisIle: 1.333 ± 0.32
0.64HisLys: 0.64 ± 0.162
1.706HisLeu: 1.706 ± 0.284
0.533HisMet: 0.533 ± 0.151
0.693HisAsn: 0.693 ± 0.158
1.173HisPro: 1.173 ± 0.251
0.959HisGln: 0.959 ± 0.238
2.292HisArg: 2.292 ± 0.386
1.226HisSer: 1.226 ± 0.268
1.759HisThr: 1.759 ± 0.327
1.493HisVal: 1.493 ± 0.3
0.693HisTrp: 0.693 ± 0.182
0.906HisTyr: 0.906 ± 0.218
0.0HisXaa: 0.0 ± 0.0
Ile
4.904IleAla: 4.904 ± 0.561
0.586IleCys: 0.586 ± 0.187
3.465IleAsp: 3.465 ± 0.445
3.358IleGlu: 3.358 ± 0.417
0.64IlePhe: 0.64 ± 0.192
3.731IleGly: 3.731 ± 0.395
1.333IleHis: 1.333 ± 0.274
1.599IleIle: 1.599 ± 0.29
1.119IleLys: 1.119 ± 0.244
2.239IleLeu: 2.239 ± 0.41
0.48IleMet: 0.48 ± 0.127
1.812IleAsn: 1.812 ± 0.252
2.719IlePro: 2.719 ± 0.349
1.599IleGln: 1.599 ± 0.29
2.665IleArg: 2.665 ± 0.348
2.559IleSer: 2.559 ± 0.479
3.891IleThr: 3.891 ± 0.44
2.665IleVal: 2.665 ± 0.356
0.853IleTrp: 0.853 ± 0.206
0.8IleTyr: 0.8 ± 0.186
0.0IleXaa: 0.0 ± 0.0
Lys
3.945LysAla: 3.945 ± 0.523
0.373LysCys: 0.373 ± 0.139
1.652LysAsp: 1.652 ± 0.285
1.493LysGlu: 1.493 ± 0.326
1.439LysPhe: 1.439 ± 0.219
2.505LysGly: 2.505 ± 0.307
1.119LysHis: 1.119 ± 0.291
1.013LysIle: 1.013 ± 0.212
1.386LysLys: 1.386 ± 0.403
2.452LysLeu: 2.452 ± 0.472
0.64LysMet: 0.64 ± 0.17
0.906LysAsn: 0.906 ± 0.21
2.026LysPro: 2.026 ± 0.394
1.866LysGln: 1.866 ± 0.283
2.399LysArg: 2.399 ± 0.376
2.239LysSer: 2.239 ± 0.361
1.972LysThr: 1.972 ± 0.35
2.612LysVal: 2.612 ± 0.439
0.906LysTrp: 0.906 ± 0.249
0.8LysTyr: 0.8 ± 0.223
0.0LysXaa: 0.0 ± 0.0
Leu
7.783LeuAla: 7.783 ± 0.699
0.693LeuCys: 0.693 ± 0.207
4.957LeuAsp: 4.957 ± 0.601
4.211LeuGlu: 4.211 ± 0.444
1.919LeuPhe: 1.919 ± 0.285
5.171LeuGly: 5.171 ± 0.508
1.333LeuHis: 1.333 ± 0.27
3.092LeuIle: 3.092 ± 0.356
2.292LeuLys: 2.292 ± 0.372
4.584LeuLeu: 4.584 ± 0.513
1.226LeuMet: 1.226 ± 0.239
2.505LeuAsn: 2.505 ± 0.341
5.544LeuPro: 5.544 ± 0.662
2.665LeuGln: 2.665 ± 0.359
5.171LeuArg: 5.171 ± 0.58
5.224LeuSer: 5.224 ± 0.556
5.117LeuThr: 5.117 ± 0.46
5.544LeuVal: 5.544 ± 0.52
1.226LeuTrp: 1.226 ± 0.237
2.079LeuTyr: 2.079 ± 0.35
0.0LeuXaa: 0.0 ± 0.0
Met
2.239MetAla: 2.239 ± 0.358
0.107MetCys: 0.107 ± 0.077
1.173MetAsp: 1.173 ± 0.254
1.013MetGlu: 1.013 ± 0.199
0.8MetPhe: 0.8 ± 0.209
1.652MetGly: 1.652 ± 0.255
0.107MetHis: 0.107 ± 0.071
0.746MetIle: 0.746 ± 0.219
0.906MetLys: 0.906 ± 0.204
1.812MetLeu: 1.812 ± 0.28
0.48MetMet: 0.48 ± 0.189
0.853MetAsn: 0.853 ± 0.19
1.119MetPro: 1.119 ± 0.218
0.586MetGln: 0.586 ± 0.161
1.279MetArg: 1.279 ± 0.269
2.932MetSer: 2.932 ± 0.378
2.239MetThr: 2.239 ± 0.287
1.226MetVal: 1.226 ± 0.29
0.32MetTrp: 0.32 ± 0.13
0.373MetTyr: 0.373 ± 0.133
0.0MetXaa: 0.0 ± 0.0
Asn
3.252AsnAla: 3.252 ± 0.372
0.267AsnCys: 0.267 ± 0.117
1.546AsnAsp: 1.546 ± 0.262
1.599AsnGlu: 1.599 ± 0.384
0.693AsnPhe: 0.693 ± 0.22
4.051AsnGly: 4.051 ± 0.522
0.906AsnHis: 0.906 ± 0.229
1.599AsnIle: 1.599 ± 0.415
1.119AsnLys: 1.119 ± 0.263
2.186AsnLeu: 2.186 ± 0.307
0.48AsnMet: 0.48 ± 0.142
1.652AsnAsn: 1.652 ± 0.306
2.345AsnPro: 2.345 ± 0.291
1.119AsnGln: 1.119 ± 0.252
1.652AsnArg: 1.652 ± 0.295
1.812AsnSer: 1.812 ± 0.341
1.972AsnThr: 1.972 ± 0.289
1.972AsnVal: 1.972 ± 0.343
0.64AsnTrp: 0.64 ± 0.166
0.906AsnTyr: 0.906 ± 0.194
0.0AsnXaa: 0.0 ± 0.0
Pro
5.65ProAla: 5.65 ± 0.613
0.693ProCys: 0.693 ± 0.204
3.998ProAsp: 3.998 ± 0.466
4.851ProGlu: 4.851 ± 0.477
1.652ProPhe: 1.652 ± 0.265
6.397ProGly: 6.397 ± 0.61
1.279ProHis: 1.279 ± 0.275
1.972ProIle: 1.972 ± 0.344
2.452ProLys: 2.452 ± 0.347
3.998ProLeu: 3.998 ± 0.503
1.706ProMet: 1.706 ± 0.306
2.026ProAsn: 2.026 ± 0.285
3.678ProPro: 3.678 ± 0.577
2.186ProGln: 2.186 ± 0.351
3.358ProArg: 3.358 ± 0.462
3.305ProSer: 3.305 ± 0.396
3.038ProThr: 3.038 ± 0.414
4.371ProVal: 4.371 ± 0.441
1.119ProTrp: 1.119 ± 0.217
1.386ProTyr: 1.386 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
4.478GlnAla: 4.478 ± 0.518
0.426GlnCys: 0.426 ± 0.178
1.546GlnAsp: 1.546 ± 0.271
1.599GlnGlu: 1.599 ± 0.241
1.066GlnPhe: 1.066 ± 0.211
2.825GlnGly: 2.825 ± 0.431
1.066GlnHis: 1.066 ± 0.247
2.452GlnIle: 2.452 ± 0.357
1.386GlnLys: 1.386 ± 0.287
2.825GlnLeu: 2.825 ± 0.409
0.853GlnMet: 0.853 ± 0.227
0.746GlnAsn: 0.746 ± 0.243
2.452GlnPro: 2.452 ± 0.341
1.119GlnGln: 1.119 ± 0.257
2.452GlnArg: 2.452 ± 0.37
2.559GlnSer: 2.559 ± 0.351
1.706GlnThr: 1.706 ± 0.326
2.345GlnVal: 2.345 ± 0.261
0.746GlnTrp: 0.746 ± 0.259
0.959GlnTyr: 0.959 ± 0.219
0.0GlnXaa: 0.0 ± 0.0
Arg
6.663ArgAla: 6.663 ± 0.629
0.8ArgCys: 0.8 ± 0.231
5.064ArgAsp: 5.064 ± 0.541
5.171ArgGlu: 5.171 ± 0.661
2.239ArgPhe: 2.239 ± 0.396
3.838ArgGly: 3.838 ± 0.364
1.652ArgHis: 1.652 ± 0.277
3.838ArgIle: 3.838 ± 0.516
2.505ArgLys: 2.505 ± 0.381
5.277ArgLeu: 5.277 ± 0.635
2.665ArgMet: 2.665 ± 0.412
2.186ArgAsn: 2.186 ± 0.381
3.891ArgPro: 3.891 ± 0.428
1.812ArgGln: 1.812 ± 0.346
6.023ArgArg: 6.023 ± 0.887
3.731ArgSer: 3.731 ± 0.426
3.465ArgThr: 3.465 ± 0.511
4.904ArgVal: 4.904 ± 0.63
1.972ArgTrp: 1.972 ± 0.35
1.759ArgTyr: 1.759 ± 0.29
0.0ArgXaa: 0.0 ± 0.0
Ser
6.716SerAla: 6.716 ± 0.953
0.48SerCys: 0.48 ± 0.162
4.158SerAsp: 4.158 ± 0.548
2.878SerGlu: 2.878 ± 0.383
2.026SerPhe: 2.026 ± 0.385
6.93SerGly: 6.93 ± 0.896
1.279SerHis: 1.279 ± 0.299
2.878SerIle: 2.878 ± 0.427
2.132SerLys: 2.132 ± 0.359
4.478SerLeu: 4.478 ± 0.581
1.013SerMet: 1.013 ± 0.189
2.132SerAsn: 2.132 ± 0.4
3.731SerPro: 3.731 ± 0.403
1.759SerGln: 1.759 ± 0.28
3.891SerArg: 3.891 ± 0.414
3.945SerSer: 3.945 ± 0.677
3.838SerThr: 3.838 ± 0.384
4.371SerVal: 4.371 ± 0.481
1.066SerTrp: 1.066 ± 0.239
1.493SerTyr: 1.493 ± 0.228
0.0SerXaa: 0.0 ± 0.0
Thr
7.036ThrAla: 7.036 ± 0.614
0.746ThrCys: 0.746 ± 0.245
4.051ThrAsp: 4.051 ± 0.541
4.158ThrGlu: 4.158 ± 0.392
2.132ThrPhe: 2.132 ± 0.325
6.237ThrGly: 6.237 ± 0.567
1.652ThrHis: 1.652 ± 0.382
3.252ThrIle: 3.252 ± 0.354
2.345ThrLys: 2.345 ± 0.336
3.945ThrLeu: 3.945 ± 0.504
1.386ThrMet: 1.386 ± 0.287
2.186ThrAsn: 2.186 ± 0.422
4.158ThrPro: 4.158 ± 0.403
1.866ThrGln: 1.866 ± 0.326
4.051ThrArg: 4.051 ± 0.562
3.571ThrSer: 3.571 ± 0.468
4.691ThrThr: 4.691 ± 0.697
6.183ThrVal: 6.183 ± 0.723
1.226ThrTrp: 1.226 ± 0.291
1.546ThrTyr: 1.546 ± 0.316
0.0ThrXaa: 0.0 ± 0.0
Val
7.303ValAla: 7.303 ± 0.528
1.333ValCys: 1.333 ± 0.252
5.33ValAsp: 5.33 ± 0.544
4.424ValGlu: 4.424 ± 0.49
2.292ValPhe: 2.292 ± 0.368
5.757ValGly: 5.757 ± 0.681
1.386ValHis: 1.386 ± 0.277
2.772ValIle: 2.772 ± 0.447
2.612ValLys: 2.612 ± 0.414
5.224ValLeu: 5.224 ± 0.541
1.013ValMet: 1.013 ± 0.183
1.759ValAsn: 1.759 ± 0.258
4.211ValPro: 4.211 ± 0.457
2.878ValGln: 2.878 ± 0.33
4.584ValArg: 4.584 ± 0.616
5.011ValSer: 5.011 ± 0.628
5.011ValThr: 5.011 ± 0.611
5.97ValVal: 5.97 ± 0.704
1.706ValTrp: 1.706 ± 0.373
1.226ValTyr: 1.226 ± 0.271
0.0ValXaa: 0.0 ± 0.0
Trp
1.812TrpAla: 1.812 ± 0.268
0.053TrpCys: 0.053 ± 0.055
1.599TrpAsp: 1.599 ± 0.284
1.066TrpGlu: 1.066 ± 0.29
0.746TrpPhe: 0.746 ± 0.21
0.853TrpGly: 0.853 ± 0.176
0.64TrpHis: 0.64 ± 0.197
1.066TrpIle: 1.066 ± 0.196
0.746TrpLys: 0.746 ± 0.173
1.812TrpLeu: 1.812 ± 0.347
1.013TrpMet: 1.013 ± 0.274
0.64TrpAsn: 0.64 ± 0.188
1.439TrpPro: 1.439 ± 0.289
1.173TrpGln: 1.173 ± 0.269
2.345TrpArg: 2.345 ± 0.433
1.279TrpSer: 1.279 ± 0.31
1.812TrpThr: 1.812 ± 0.321
1.546TrpVal: 1.546 ± 0.354
0.906TrpTrp: 0.906 ± 0.19
0.586TrpTyr: 0.586 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.559TyrAla: 2.559 ± 0.378
0.267TyrCys: 0.267 ± 0.109
1.919TyrAsp: 1.919 ± 0.408
1.546TyrGlu: 1.546 ± 0.284
0.586TyrPhe: 0.586 ± 0.165
1.972TyrGly: 1.972 ± 0.37
0.373TyrHis: 0.373 ± 0.109
1.119TyrIle: 1.119 ± 0.223
0.693TyrLys: 0.693 ± 0.189
1.706TyrLeu: 1.706 ± 0.302
0.107TyrMet: 0.107 ± 0.071
0.853TyrAsn: 0.853 ± 0.225
1.173TyrPro: 1.173 ± 0.198
0.8TyrGln: 0.8 ± 0.207
2.292TyrArg: 2.292 ± 0.37
1.173TyrSer: 1.173 ± 0.23
1.972TyrThr: 1.972 ± 0.368
2.452TyrVal: 2.452 ± 0.335
0.64TyrTrp: 0.64 ± 0.158
0.48TyrTyr: 0.48 ± 0.146
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 111 proteins (18761 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski