Amino acid dipepetide frequency for Mycobacterium phage Leogania

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.183AlaAla: 11.183 ± 0.909
0.526AlaCys: 0.526 ± 0.187
5.46AlaAsp: 5.46 ± 0.74
6.973AlaGlu: 6.973 ± 0.947
4.407AlaPhe: 4.407 ± 0.483
8.091AlaGly: 8.091 ± 0.808
1.645AlaHis: 1.645 ± 0.337
4.078AlaIle: 4.078 ± 0.468
5.131AlaLys: 5.131 ± 0.647
8.025AlaLeu: 8.025 ± 0.992
2.894AlaMet: 2.894 ± 0.401
3.289AlaAsn: 3.289 ± 0.529
4.67AlaPro: 4.67 ± 0.716
3.684AlaGln: 3.684 ± 0.66
6.052AlaArg: 6.052 ± 0.681
4.67AlaSer: 4.67 ± 0.568
5.986AlaThr: 5.986 ± 0.736
7.499AlaVal: 7.499 ± 0.73
1.776AlaTrp: 1.776 ± 0.293
2.763AlaTyr: 2.763 ± 0.443
0.0AlaXaa: 0.0 ± 0.0
Cys
0.526CysAla: 0.526 ± 0.173
0.0CysCys: 0.0 ± 0.0
0.592CysAsp: 0.592 ± 0.196
0.329CysGlu: 0.329 ± 0.148
0.329CysPhe: 0.329 ± 0.169
0.46CysGly: 0.46 ± 0.179
0.263CysHis: 0.263 ± 0.178
0.132CysIle: 0.132 ± 0.095
0.197CysLys: 0.197 ± 0.101
0.724CysLeu: 0.724 ± 0.234
0.0CysMet: 0.0 ± 0.0
0.658CysAsn: 0.658 ± 0.196
0.395CysPro: 0.395 ± 0.2
0.132CysGln: 0.132 ± 0.105
0.395CysArg: 0.395 ± 0.145
0.329CysSer: 0.329 ± 0.173
0.46CysThr: 0.46 ± 0.217
0.592CysVal: 0.592 ± 0.195
0.395CysTrp: 0.395 ± 0.176
0.263CysTyr: 0.263 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
5.986AspAla: 5.986 ± 0.655
0.46AspCys: 0.46 ± 0.192
3.75AspAsp: 3.75 ± 0.52
5.394AspGlu: 5.394 ± 0.787
2.829AspPhe: 2.829 ± 0.451
5.986AspGly: 5.986 ± 0.785
1.381AspHis: 1.381 ± 0.397
3.289AspIle: 3.289 ± 0.464
1.973AspLys: 1.973 ± 0.356
5.591AspLeu: 5.591 ± 0.763
1.842AspMet: 1.842 ± 0.35
1.447AspAsn: 1.447 ± 0.305
5.131AspPro: 5.131 ± 0.684
2.302AspGln: 2.302 ± 0.356
2.565AspArg: 2.565 ± 0.394
2.96AspSer: 2.96 ± 0.416
3.421AspThr: 3.421 ± 0.552
4.276AspVal: 4.276 ± 0.517
1.052AspTrp: 1.052 ± 0.248
2.039AspTyr: 2.039 ± 0.326
0.0AspXaa: 0.0 ± 0.0
Glu
6.447GluAla: 6.447 ± 0.771
0.066GluCys: 0.066 ± 0.06
4.605GluAsp: 4.605 ± 0.543
4.67GluGlu: 4.67 ± 0.709
2.565GluPhe: 2.565 ± 0.435
4.802GluGly: 4.802 ± 0.501
1.645GluHis: 1.645 ± 0.365
3.684GluIle: 3.684 ± 0.51
2.302GluLys: 2.302 ± 0.422
8.025GluLeu: 8.025 ± 0.757
2.5GluMet: 2.5 ± 0.271
2.105GluAsn: 2.105 ± 0.33
2.96GluPro: 2.96 ± 0.532
2.434GluGln: 2.434 ± 0.33
4.407GluArg: 4.407 ± 0.526
2.894GluSer: 2.894 ± 0.342
3.815GluThr: 3.815 ± 0.438
4.473GluVal: 4.473 ± 0.588
1.513GluTrp: 1.513 ± 0.269
2.105GluTyr: 2.105 ± 0.354
0.0GluXaa: 0.0 ± 0.0
Phe
3.157PheAla: 3.157 ± 0.551
0.329PheCys: 0.329 ± 0.137
2.434PheAsp: 2.434 ± 0.507
2.171PheGlu: 2.171 ± 0.348
0.855PhePhe: 0.855 ± 0.269
3.289PheGly: 3.289 ± 0.485
0.724PheHis: 0.724 ± 0.328
1.447PheIle: 1.447 ± 0.316
1.184PheLys: 1.184 ± 0.278
3.092PheLeu: 3.092 ± 0.518
0.526PheMet: 0.526 ± 0.166
2.171PheAsn: 2.171 ± 0.347
1.842PhePro: 1.842 ± 0.379
1.184PheGln: 1.184 ± 0.32
2.434PheArg: 2.434 ± 0.341
1.842PheSer: 1.842 ± 0.363
2.039PheThr: 2.039 ± 0.36
2.5PheVal: 2.5 ± 0.417
0.526PheTrp: 0.526 ± 0.226
0.658PheTyr: 0.658 ± 0.171
0.0PheXaa: 0.0 ± 0.0
Gly
7.367GlyAla: 7.367 ± 1.168
0.789GlyCys: 0.789 ± 0.23
6.71GlyAsp: 6.71 ± 1.098
4.868GlyGlu: 4.868 ± 0.582
3.289GlyPhe: 3.289 ± 0.48
7.367GlyGly: 7.367 ± 1.478
1.71GlyHis: 1.71 ± 0.3
3.881GlyIle: 3.881 ± 0.521
3.618GlyLys: 3.618 ± 0.473
7.236GlyLeu: 7.236 ± 0.979
2.105GlyMet: 2.105 ± 0.336
3.552GlyAsn: 3.552 ± 0.657
5.854GlyPro: 5.854 ± 2.081
3.618GlyGln: 3.618 ± 0.551
3.618GlyArg: 3.618 ± 0.424
4.407GlySer: 4.407 ± 0.688
4.999GlyThr: 4.999 ± 0.716
6.71GlyVal: 6.71 ± 0.749
1.118GlyTrp: 1.118 ± 0.231
2.631GlyTyr: 2.631 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
1.118HisAla: 1.118 ± 0.3
0.329HisCys: 0.329 ± 0.139
1.579HisAsp: 1.579 ± 0.33
1.25HisGlu: 1.25 ± 0.307
0.46HisPhe: 0.46 ± 0.176
1.645HisGly: 1.645 ± 0.413
0.592HisHis: 0.592 ± 0.174
1.052HisIle: 1.052 ± 0.256
1.25HisLys: 1.25 ± 0.321
0.921HisLeu: 0.921 ± 0.248
0.329HisMet: 0.329 ± 0.152
0.526HisAsn: 0.526 ± 0.201
1.381HisPro: 1.381 ± 0.317
1.118HisGln: 1.118 ± 0.288
1.579HisArg: 1.579 ± 0.33
0.724HisSer: 0.724 ± 0.217
0.921HisThr: 0.921 ± 0.23
1.25HisVal: 1.25 ± 0.285
0.46HisTrp: 0.46 ± 0.189
0.789HisTyr: 0.789 ± 0.3
0.0HisXaa: 0.0 ± 0.0
Ile
5.46IleAla: 5.46 ± 0.622
0.526IleCys: 0.526 ± 0.163
3.552IleAsp: 3.552 ± 0.456
4.342IleGlu: 4.342 ± 0.417
1.052IlePhe: 1.052 ± 0.266
4.276IleGly: 4.276 ± 0.717
0.921IleHis: 0.921 ± 0.209
1.842IleIle: 1.842 ± 0.352
2.368IleLys: 2.368 ± 0.462
4.276IleLeu: 4.276 ± 0.465
0.46IleMet: 0.46 ± 0.164
2.302IleAsn: 2.302 ± 0.373
3.355IlePro: 3.355 ± 0.483
1.447IleGln: 1.447 ± 0.316
3.289IleArg: 3.289 ± 0.54
3.026IleSer: 3.026 ± 0.512
2.829IleThr: 2.829 ± 0.351
2.763IleVal: 2.763 ± 0.432
0.526IleTrp: 0.526 ± 0.154
1.316IleTyr: 1.316 ± 0.324
0.0IleXaa: 0.0 ± 0.0
Lys
5.328LysAla: 5.328 ± 0.697
0.197LysCys: 0.197 ± 0.13
1.842LysAsp: 1.842 ± 0.36
2.171LysGlu: 2.171 ± 0.433
0.921LysPhe: 0.921 ± 0.25
5.131LysGly: 5.131 ± 1.041
0.46LysHis: 0.46 ± 0.163
2.5LysIle: 2.5 ± 0.388
2.894LysLys: 2.894 ± 0.504
3.881LysLeu: 3.881 ± 0.602
1.118LysMet: 1.118 ± 0.259
1.513LysAsn: 1.513 ± 0.289
3.223LysPro: 3.223 ± 0.59
1.842LysGln: 1.842 ± 0.392
2.829LysArg: 2.829 ± 0.438
2.039LysSer: 2.039 ± 0.396
2.5LysThr: 2.5 ± 0.399
3.684LysVal: 3.684 ± 0.571
1.052LysTrp: 1.052 ± 0.248
1.316LysTyr: 1.316 ± 0.282
0.0LysXaa: 0.0 ± 0.0
Leu
8.88LeuAla: 8.88 ± 0.964
0.724LeuCys: 0.724 ± 0.248
5.394LeuAsp: 5.394 ± 0.621
5.328LeuGlu: 5.328 ± 0.684
2.368LeuPhe: 2.368 ± 0.296
6.512LeuGly: 6.512 ± 0.963
1.71LeuHis: 1.71 ± 0.442
4.407LeuIle: 4.407 ± 0.593
3.486LeuLys: 3.486 ± 0.465
4.736LeuLeu: 4.736 ± 0.579
2.894LeuMet: 2.894 ± 0.561
2.171LeuAsn: 2.171 ± 0.42
5.197LeuPro: 5.197 ± 0.588
2.96LeuGln: 2.96 ± 0.545
6.447LeuArg: 6.447 ± 0.745
5.657LeuSer: 5.657 ± 0.625
5.46LeuThr: 5.46 ± 0.631
4.144LeuVal: 4.144 ± 0.596
1.25LeuTrp: 1.25 ± 0.259
2.302LeuTyr: 2.302 ± 0.434
0.0LeuXaa: 0.0 ± 0.0
Met
2.829MetAla: 2.829 ± 0.489
0.066MetCys: 0.066 ± 0.066
1.118MetAsp: 1.118 ± 0.298
1.645MetGlu: 1.645 ± 0.315
0.789MetPhe: 0.789 ± 0.2
1.842MetGly: 1.842 ± 0.328
0.46MetHis: 0.46 ± 0.164
1.447MetIle: 1.447 ± 0.257
1.447MetLys: 1.447 ± 0.333
1.447MetLeu: 1.447 ± 0.367
0.658MetMet: 0.658 ± 0.213
0.724MetAsn: 0.724 ± 0.198
1.25MetPro: 1.25 ± 0.316
0.789MetGln: 0.789 ± 0.224
1.513MetArg: 1.513 ± 0.331
2.039MetSer: 2.039 ± 0.287
2.302MetThr: 2.302 ± 0.361
1.579MetVal: 1.579 ± 0.324
0.197MetTrp: 0.197 ± 0.104
0.921MetTyr: 0.921 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
3.289AsnAla: 3.289 ± 0.579
0.46AsnCys: 0.46 ± 0.173
1.908AsnAsp: 1.908 ± 0.357
2.302AsnGlu: 2.302 ± 0.416
1.25AsnPhe: 1.25 ± 0.332
3.815AsnGly: 3.815 ± 0.565
0.921AsnHis: 0.921 ± 0.203
1.842AsnIle: 1.842 ± 0.299
1.25AsnLys: 1.25 ± 0.304
2.829AsnLeu: 2.829 ± 0.422
0.526AsnMet: 0.526 ± 0.178
0.329AsnAsn: 0.329 ± 0.132
2.105AsnPro: 2.105 ± 0.334
0.592AsnGln: 0.592 ± 0.189
2.368AsnArg: 2.368 ± 0.396
2.039AsnSer: 2.039 ± 0.38
1.579AsnThr: 1.579 ± 0.339
2.105AsnVal: 2.105 ± 0.325
0.724AsnTrp: 0.724 ± 0.236
1.118AsnTyr: 1.118 ± 0.237
0.0AsnXaa: 0.0 ± 0.0
Pro
5.591ProAla: 5.591 ± 0.699
0.263ProCys: 0.263 ± 0.155
4.144ProAsp: 4.144 ± 0.596
5.197ProGlu: 5.197 ± 0.592
1.71ProPhe: 1.71 ± 0.343
4.999ProGly: 4.999 ± 0.827
1.052ProHis: 1.052 ± 0.259
2.96ProIle: 2.96 ± 0.436
3.421ProLys: 3.421 ± 0.842
3.486ProLeu: 3.486 ± 0.556
1.052ProMet: 1.052 ± 0.297
2.105ProAsn: 2.105 ± 0.407
3.092ProPro: 3.092 ± 0.476
2.434ProGln: 2.434 ± 0.71
3.618ProArg: 3.618 ± 0.591
2.697ProSer: 2.697 ± 0.486
3.881ProThr: 3.881 ± 0.512
3.947ProVal: 3.947 ± 0.447
1.316ProTrp: 1.316 ± 0.429
1.513ProTyr: 1.513 ± 0.29
0.0ProXaa: 0.0 ± 0.0
Gln
4.407GlnAla: 4.407 ± 0.676
0.197GlnCys: 0.197 ± 0.117
1.447GlnAsp: 1.447 ± 0.267
1.776GlnGlu: 1.776 ± 0.417
1.513GlnPhe: 1.513 ± 0.243
3.75GlnGly: 3.75 ± 1.045
0.526GlnHis: 0.526 ± 0.204
2.565GlnIle: 2.565 ± 0.414
1.513GlnLys: 1.513 ± 0.324
3.355GlnLeu: 3.355 ± 0.58
0.987GlnMet: 0.987 ± 0.264
0.987GlnAsn: 0.987 ± 0.326
1.381GlnPro: 1.381 ± 0.381
2.039GlnGln: 2.039 ± 0.416
2.171GlnArg: 2.171 ± 0.367
1.776GlnSer: 1.776 ± 0.338
2.105GlnThr: 2.105 ± 0.434
3.421GlnVal: 3.421 ± 0.435
0.724GlnTrp: 0.724 ± 0.222
0.921GlnTyr: 0.921 ± 0.206
0.0GlnXaa: 0.0 ± 0.0
Arg
5.723ArgAla: 5.723 ± 0.753
0.526ArgCys: 0.526 ± 0.235
4.013ArgAsp: 4.013 ± 0.531
4.736ArgGlu: 4.736 ± 0.825
2.434ArgPhe: 2.434 ± 0.484
4.802ArgGly: 4.802 ± 0.735
1.513ArgHis: 1.513 ± 0.358
3.618ArgIle: 3.618 ± 0.497
3.421ArgLys: 3.421 ± 0.555
5.92ArgLeu: 5.92 ± 0.592
1.71ArgMet: 1.71 ± 0.345
1.973ArgAsn: 1.973 ± 0.31
2.763ArgPro: 2.763 ± 0.425
2.237ArgGln: 2.237 ± 0.464
5.131ArgArg: 5.131 ± 0.641
2.763ArgSer: 2.763 ± 0.509
2.631ArgThr: 2.631 ± 0.415
4.276ArgVal: 4.276 ± 0.558
0.987ArgTrp: 0.987 ± 0.243
1.513ArgTyr: 1.513 ± 0.364
0.0ArgXaa: 0.0 ± 0.0
Ser
5.131SerAla: 5.131 ± 0.617
0.329SerCys: 0.329 ± 0.137
3.552SerAsp: 3.552 ± 0.558
3.289SerGlu: 3.289 ± 0.426
1.776SerPhe: 1.776 ± 0.373
4.342SerGly: 4.342 ± 0.593
0.658SerHis: 0.658 ± 0.227
2.434SerIle: 2.434 ± 0.391
2.368SerLys: 2.368 ± 0.515
4.67SerLeu: 4.67 ± 0.704
1.579SerMet: 1.579 ± 0.306
1.184SerAsn: 1.184 ± 0.313
3.157SerPro: 3.157 ± 0.527
2.237SerGln: 2.237 ± 0.397
3.618SerArg: 3.618 ± 0.512
2.368SerSer: 2.368 ± 0.418
3.355SerThr: 3.355 ± 0.43
3.421SerVal: 3.421 ± 0.448
1.118SerTrp: 1.118 ± 0.247
1.184SerTyr: 1.184 ± 0.305
0.0SerXaa: 0.0 ± 0.0
Thr
5.131ThrAla: 5.131 ± 0.628
0.395ThrCys: 0.395 ± 0.137
3.486ThrAsp: 3.486 ± 0.52
3.815ThrGlu: 3.815 ± 0.505
2.105ThrPhe: 2.105 ± 0.432
5.92ThrGly: 5.92 ± 1.26
0.855ThrHis: 0.855 ± 0.265
2.631ThrIle: 2.631 ± 0.445
3.092ThrLys: 3.092 ± 0.608
4.868ThrLeu: 4.868 ± 0.862
1.447ThrMet: 1.447 ± 0.275
1.776ThrAsn: 1.776 ± 0.289
5.065ThrPro: 5.065 ± 0.672
1.973ThrGln: 1.973 ± 0.303
2.763ThrArg: 2.763 ± 0.442
3.157ThrSer: 3.157 ± 0.406
3.157ThrThr: 3.157 ± 0.548
4.934ThrVal: 4.934 ± 0.617
1.25ThrTrp: 1.25 ± 0.256
1.513ThrTyr: 1.513 ± 0.263
0.0ThrXaa: 0.0 ± 0.0
Val
6.841ValAla: 6.841 ± 0.969
0.592ValCys: 0.592 ± 0.188
5.262ValAsp: 5.262 ± 0.457
4.605ValGlu: 4.605 ± 0.646
2.171ValPhe: 2.171 ± 0.441
5.065ValGly: 5.065 ± 0.555
1.316ValHis: 1.316 ± 0.313
3.421ValIle: 3.421 ± 0.53
3.881ValLys: 3.881 ± 0.41
5.526ValLeu: 5.526 ± 0.641
1.447ValMet: 1.447 ± 0.34
2.829ValAsn: 2.829 ± 0.512
3.289ValPro: 3.289 ± 0.557
2.171ValGln: 2.171 ± 0.454
4.67ValArg: 4.67 ± 0.695
3.881ValSer: 3.881 ± 0.588
4.736ValThr: 4.736 ± 0.547
5.394ValVal: 5.394 ± 0.566
1.447ValTrp: 1.447 ± 0.321
1.908ValTyr: 1.908 ± 0.417
0.0ValXaa: 0.0 ± 0.0
Trp
1.645TrpAla: 1.645 ± 0.419
0.263TrpCys: 0.263 ± 0.158
0.921TrpAsp: 0.921 ± 0.229
0.921TrpGlu: 0.921 ± 0.24
0.789TrpPhe: 0.789 ± 0.217
1.381TrpGly: 1.381 ± 0.278
0.46TrpHis: 0.46 ± 0.165
1.25TrpIle: 1.25 ± 0.236
0.526TrpLys: 0.526 ± 0.196
1.052TrpLeu: 1.052 ± 0.267
0.395TrpMet: 0.395 ± 0.142
0.855TrpAsn: 0.855 ± 0.27
0.724TrpPro: 0.724 ± 0.26
1.184TrpGln: 1.184 ± 0.293
1.184TrpArg: 1.184 ± 0.19
0.987TrpSer: 0.987 ± 0.307
1.71TrpThr: 1.71 ± 0.341
1.184TrpVal: 1.184 ± 0.213
0.395TrpTrp: 0.395 ± 0.19
0.46TrpTyr: 0.46 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.763TyrAla: 2.763 ± 0.487
0.132TyrCys: 0.132 ± 0.09
1.908TyrAsp: 1.908 ± 0.338
2.237TyrGlu: 2.237 ± 0.333
0.789TyrPhe: 0.789 ± 0.217
1.973TyrGly: 1.973 ± 0.341
0.46TyrHis: 0.46 ± 0.18
1.447TyrIle: 1.447 ± 0.278
1.118TyrLys: 1.118 ± 0.253
2.368TyrLeu: 2.368 ± 0.358
0.526TyrMet: 0.526 ± 0.212
0.855TyrAsn: 0.855 ± 0.223
1.645TyrPro: 1.645 ± 0.332
1.184TyrGln: 1.184 ± 0.318
2.039TyrArg: 2.039 ± 0.395
1.579TyrSer: 1.579 ± 0.28
1.447TyrThr: 1.447 ± 0.268
2.302TyrVal: 2.302 ± 0.418
0.46TyrTrp: 0.46 ± 0.17
0.592TyrTyr: 0.592 ± 0.185
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (15203 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski