Amino acid dipepetide frequency for Mycobacterium phage Phlei

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.562AlaAla: 8.562 ± 0.784
0.654AlaCys: 0.654 ± 0.194
6.275AlaAsp: 6.275 ± 0.856
7.516AlaGlu: 7.516 ± 0.635
3.66AlaPhe: 3.66 ± 0.515
7.647AlaGly: 7.647 ± 0.698
1.307AlaHis: 1.307 ± 0.242
4.902AlaIle: 4.902 ± 0.479
4.641AlaLys: 4.641 ± 0.562
7.974AlaLeu: 7.974 ± 0.708
2.222AlaMet: 2.222 ± 0.398
3.595AlaAsn: 3.595 ± 0.701
4.575AlaPro: 4.575 ± 0.735
3.529AlaGln: 3.529 ± 0.512
5.098AlaArg: 5.098 ± 0.589
5.621AlaSer: 5.621 ± 0.584
5.098AlaThr: 5.098 ± 0.725
6.797AlaVal: 6.797 ± 0.77
1.307AlaTrp: 1.307 ± 0.263
2.81AlaTyr: 2.81 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.523CysAla: 0.523 ± 0.176
0.0CysCys: 0.0 ± 0.0
0.719CysAsp: 0.719 ± 0.234
0.392CysGlu: 0.392 ± 0.144
0.327CysPhe: 0.327 ± 0.17
0.915CysGly: 0.915 ± 0.232
0.196CysHis: 0.196 ± 0.099
0.392CysIle: 0.392 ± 0.146
0.261CysLys: 0.261 ± 0.131
0.654CysLeu: 0.654 ± 0.189
0.327CysMet: 0.327 ± 0.146
0.458CysAsn: 0.458 ± 0.157
0.654CysPro: 0.654 ± 0.187
0.392CysGln: 0.392 ± 0.151
0.458CysArg: 0.458 ± 0.168
0.523CysSer: 0.523 ± 0.208
0.392CysThr: 0.392 ± 0.168
0.327CysVal: 0.327 ± 0.128
0.327CysTrp: 0.327 ± 0.156
0.261CysTyr: 0.261 ± 0.099
0.0CysXaa: 0.0 ± 0.0
Asp
5.686AspAla: 5.686 ± 0.61
0.458AspCys: 0.458 ± 0.157
4.575AspAsp: 4.575 ± 0.808
5.033AspGlu: 5.033 ± 0.594
2.353AspPhe: 2.353 ± 0.388
5.948AspGly: 5.948 ± 0.664
1.176AspHis: 1.176 ± 0.273
3.987AspIle: 3.987 ± 0.547
3.072AspLys: 3.072 ± 0.487
6.667AspLeu: 6.667 ± 0.681
1.699AspMet: 1.699 ± 0.329
1.961AspAsn: 1.961 ± 0.365
4.51AspPro: 4.51 ± 0.47
1.83AspGln: 1.83 ± 0.335
3.203AspArg: 3.203 ± 0.526
3.007AspSer: 3.007 ± 0.345
4.706AspThr: 4.706 ± 0.568
5.882AspVal: 5.882 ± 0.68
1.895AspTrp: 1.895 ± 0.307
2.81AspTyr: 2.81 ± 0.384
0.0AspXaa: 0.0 ± 0.0
Glu
7.386GluAla: 7.386 ± 0.623
0.327GluCys: 0.327 ± 0.145
4.902GluAsp: 4.902 ± 0.522
5.425GluGlu: 5.425 ± 0.691
3.137GluPhe: 3.137 ± 0.479
5.49GluGly: 5.49 ± 0.558
1.046GluHis: 1.046 ± 0.22
5.556GluIle: 5.556 ± 0.573
3.333GluLys: 3.333 ± 0.414
6.993GluLeu: 6.993 ± 0.878
2.157GluMet: 2.157 ± 0.36
2.157GluAsn: 2.157 ± 0.353
2.68GluPro: 2.68 ± 0.496
2.484GluGln: 2.484 ± 0.369
4.641GluArg: 4.641 ± 0.629
3.203GluSer: 3.203 ± 0.537
3.791GluThr: 3.791 ± 0.631
5.621GluVal: 5.621 ± 0.622
1.242GluTrp: 1.242 ± 0.261
2.157GluTyr: 2.157 ± 0.317
0.0GluXaa: 0.0 ± 0.0
Phe
2.81PheAla: 2.81 ± 0.422
0.261PheCys: 0.261 ± 0.134
2.68PheAsp: 2.68 ± 0.364
2.288PheGlu: 2.288 ± 0.38
0.784PhePhe: 0.784 ± 0.24
3.268PheGly: 3.268 ± 0.501
0.719PheHis: 0.719 ± 0.212
1.83PheIle: 1.83 ± 0.29
1.242PheLys: 1.242 ± 0.236
1.961PheLeu: 1.961 ± 0.32
0.588PheMet: 0.588 ± 0.176
2.288PheAsn: 2.288 ± 0.363
1.634PhePro: 1.634 ± 0.305
1.111PheGln: 1.111 ± 0.209
1.503PheArg: 1.503 ± 0.329
2.484PheSer: 2.484 ± 0.277
2.484PheThr: 2.484 ± 0.367
2.418PheVal: 2.418 ± 0.362
0.327PheTrp: 0.327 ± 0.144
1.111PheTyr: 1.111 ± 0.26
0.0PheXaa: 0.0 ± 0.0
Gly
5.948GlyAla: 5.948 ± 0.703
0.85GlyCys: 0.85 ± 0.215
6.34GlyAsp: 6.34 ± 0.727
6.013GlyGlu: 6.013 ± 0.56
3.399GlyPhe: 3.399 ± 0.487
6.536GlyGly: 6.536 ± 1.334
1.699GlyHis: 1.699 ± 0.297
3.464GlyIle: 3.464 ± 0.471
3.203GlyLys: 3.203 ± 0.554
7.059GlyLeu: 7.059 ± 0.8
2.092GlyMet: 2.092 ± 0.288
3.007GlyAsn: 3.007 ± 0.548
3.268GlyPro: 3.268 ± 0.394
2.484GlyGln: 2.484 ± 0.337
2.876GlyArg: 2.876 ± 0.343
4.379GlySer: 4.379 ± 0.723
4.771GlyThr: 4.771 ± 0.604
6.013GlyVal: 6.013 ± 0.651
2.288GlyTrp: 2.288 ± 0.449
2.418GlyTyr: 2.418 ± 0.41
0.0GlyXaa: 0.0 ± 0.0
His
0.98HisAla: 0.98 ± 0.229
0.131HisCys: 0.131 ± 0.086
1.242HisAsp: 1.242 ± 0.23
1.242HisGlu: 1.242 ± 0.253
0.392HisPhe: 0.392 ± 0.126
1.242HisGly: 1.242 ± 0.332
0.523HisHis: 0.523 ± 0.179
0.915HisIle: 0.915 ± 0.275
0.915HisLys: 0.915 ± 0.251
1.307HisLeu: 1.307 ± 0.282
0.261HisMet: 0.261 ± 0.113
0.392HisAsn: 0.392 ± 0.157
1.046HisPro: 1.046 ± 0.25
1.046HisGln: 1.046 ± 0.237
1.569HisArg: 1.569 ± 0.256
0.392HisSer: 0.392 ± 0.15
0.915HisThr: 0.915 ± 0.228
1.242HisVal: 1.242 ± 0.406
0.261HisTrp: 0.261 ± 0.159
0.392HisTyr: 0.392 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
5.686IleAla: 5.686 ± 0.514
0.523IleCys: 0.523 ± 0.192
4.314IleAsp: 4.314 ± 0.486
5.621IleGlu: 5.621 ± 0.531
1.373IlePhe: 1.373 ± 0.317
3.987IleGly: 3.987 ± 0.501
0.98IleHis: 0.98 ± 0.262
1.961IleIle: 1.961 ± 0.338
2.026IleLys: 2.026 ± 0.355
4.248IleLeu: 4.248 ± 0.455
0.98IleMet: 0.98 ± 0.227
2.157IleAsn: 2.157 ± 0.433
3.203IlePro: 3.203 ± 0.435
1.373IleGln: 1.373 ± 0.372
4.118IleArg: 4.118 ± 0.458
2.418IleSer: 2.418 ± 0.386
3.725IleThr: 3.725 ± 0.367
3.529IleVal: 3.529 ± 0.522
0.915IleTrp: 0.915 ± 0.27
1.242IleTyr: 1.242 ± 0.236
0.0IleXaa: 0.0 ± 0.0
Lys
5.556LysAla: 5.556 ± 0.7
0.261LysCys: 0.261 ± 0.136
2.81LysAsp: 2.81 ± 0.403
2.157LysGlu: 2.157 ± 0.432
1.699LysPhe: 1.699 ± 0.325
3.464LysGly: 3.464 ± 0.464
0.458LysHis: 0.458 ± 0.146
2.68LysIle: 2.68 ± 0.411
2.876LysLys: 2.876 ± 0.436
3.268LysLeu: 3.268 ± 0.468
1.046LysMet: 1.046 ± 0.259
1.176LysAsn: 1.176 ± 0.289
2.745LysPro: 2.745 ± 0.459
1.895LysGln: 1.895 ± 0.298
3.268LysArg: 3.268 ± 0.583
2.745LysSer: 2.745 ± 0.423
2.353LysThr: 2.353 ± 0.384
4.314LysVal: 4.314 ± 0.421
0.915LysTrp: 0.915 ± 0.211
1.307LysTyr: 1.307 ± 0.35
0.0LysXaa: 0.0 ± 0.0
Leu
8.562LeuAla: 8.562 ± 0.687
0.719LeuCys: 0.719 ± 0.215
5.425LeuAsp: 5.425 ± 0.653
6.732LeuGlu: 6.732 ± 0.628
2.484LeuPhe: 2.484 ± 0.369
5.882LeuGly: 5.882 ± 0.554
1.242LeuHis: 1.242 ± 0.295
4.641LeuIle: 4.641 ± 0.407
4.641LeuLys: 4.641 ± 0.536
6.078LeuLeu: 6.078 ± 0.645
2.484LeuMet: 2.484 ± 0.399
3.203LeuAsn: 3.203 ± 0.354
4.052LeuPro: 4.052 ± 0.509
1.699LeuGln: 1.699 ± 0.394
4.902LeuArg: 4.902 ± 0.484
5.425LeuSer: 5.425 ± 0.477
4.314LeuThr: 4.314 ± 0.542
5.163LeuVal: 5.163 ± 0.482
1.242LeuTrp: 1.242 ± 0.268
2.092LeuTyr: 2.092 ± 0.347
0.0LeuXaa: 0.0 ± 0.0
Met
2.81MetAla: 2.81 ± 0.402
0.0MetCys: 0.0 ± 0.0
1.569MetAsp: 1.569 ± 0.367
1.438MetGlu: 1.438 ± 0.245
0.588MetPhe: 0.588 ± 0.177
1.634MetGly: 1.634 ± 0.334
0.261MetHis: 0.261 ± 0.105
1.373MetIle: 1.373 ± 0.262
1.503MetLys: 1.503 ± 0.314
1.373MetLeu: 1.373 ± 0.269
0.392MetMet: 0.392 ± 0.164
1.373MetAsn: 1.373 ± 0.285
0.85MetPro: 0.85 ± 0.224
0.98MetGln: 0.98 ± 0.266
1.176MetArg: 1.176 ± 0.296
1.895MetSer: 1.895 ± 0.268
2.418MetThr: 2.418 ± 0.316
0.915MetVal: 0.915 ± 0.193
0.196MetTrp: 0.196 ± 0.105
0.85MetTyr: 0.85 ± 0.231
0.0MetXaa: 0.0 ± 0.0
Asn
3.791AsnAla: 3.791 ± 0.407
0.588AsnCys: 0.588 ± 0.181
2.418AsnAsp: 2.418 ± 0.35
2.68AsnGlu: 2.68 ± 0.438
0.915AsnPhe: 0.915 ± 0.247
3.856AsnGly: 3.856 ± 0.593
1.438AsnHis: 1.438 ± 0.281
1.895AsnIle: 1.895 ± 0.325
1.569AsnLys: 1.569 ± 0.33
2.745AsnLeu: 2.745 ± 0.403
0.915AsnMet: 0.915 ± 0.254
1.176AsnAsn: 1.176 ± 0.222
2.288AsnPro: 2.288 ± 0.414
0.654AsnGln: 0.654 ± 0.224
2.157AsnArg: 2.157 ± 0.382
1.895AsnSer: 1.895 ± 0.349
1.765AsnThr: 1.765 ± 0.265
1.895AsnVal: 1.895 ± 0.328
0.784AsnTrp: 0.784 ± 0.193
0.98AsnTyr: 0.98 ± 0.268
0.0AsnXaa: 0.0 ± 0.0
Pro
3.66ProAla: 3.66 ± 0.437
0.588ProCys: 0.588 ± 0.195
3.66ProAsp: 3.66 ± 0.426
3.987ProGlu: 3.987 ± 0.449
2.092ProPhe: 2.092 ± 0.349
4.314ProGly: 4.314 ± 0.733
0.98ProHis: 0.98 ± 0.24
2.941ProIle: 2.941 ± 0.362
2.745ProLys: 2.745 ± 0.453
3.007ProLeu: 3.007 ± 0.48
0.654ProMet: 0.654 ± 0.246
2.418ProAsn: 2.418 ± 0.393
2.484ProPro: 2.484 ± 0.439
1.765ProGln: 1.765 ± 0.384
2.222ProArg: 2.222 ± 0.447
2.81ProSer: 2.81 ± 0.391
3.399ProThr: 3.399 ± 0.531
4.118ProVal: 4.118 ± 0.531
1.373ProTrp: 1.373 ± 0.436
1.111ProTyr: 1.111 ± 0.223
0.0ProXaa: 0.0 ± 0.0
Gln
3.66GlnAla: 3.66 ± 0.404
0.327GlnCys: 0.327 ± 0.142
1.83GlnAsp: 1.83 ± 0.325
2.092GlnGlu: 2.092 ± 0.387
1.307GlnPhe: 1.307 ± 0.236
1.765GlnGly: 1.765 ± 0.27
0.523GlnHis: 0.523 ± 0.178
2.157GlnIle: 2.157 ± 0.342
1.373GlnLys: 1.373 ± 0.317
3.007GlnLeu: 3.007 ± 0.491
0.98GlnMet: 0.98 ± 0.236
0.85GlnAsn: 0.85 ± 0.283
1.242GlnPro: 1.242 ± 0.277
2.157GlnGln: 2.157 ± 0.413
1.961GlnArg: 1.961 ± 0.387
1.895GlnSer: 1.895 ± 0.366
1.634GlnThr: 1.634 ± 0.296
2.81GlnVal: 2.81 ± 0.375
0.784GlnTrp: 0.784 ± 0.22
1.503GlnTyr: 1.503 ± 0.289
0.0GlnXaa: 0.0 ± 0.0
Arg
5.817ArgAla: 5.817 ± 0.669
0.784ArgCys: 0.784 ± 0.325
3.007ArgAsp: 3.007 ± 0.39
3.987ArgGlu: 3.987 ± 0.557
2.484ArgPhe: 2.484 ± 0.361
3.464ArgGly: 3.464 ± 0.522
0.98ArgHis: 0.98 ± 0.264
3.464ArgIle: 3.464 ± 0.469
3.268ArgLys: 3.268 ± 0.456
4.706ArgLeu: 4.706 ± 0.564
1.634ArgMet: 1.634 ± 0.318
2.484ArgAsn: 2.484 ± 0.388
2.418ArgPro: 2.418 ± 0.388
1.961ArgGln: 1.961 ± 0.311
5.425ArgArg: 5.425 ± 0.748
2.745ArgSer: 2.745 ± 0.392
2.941ArgThr: 2.941 ± 0.328
3.856ArgVal: 3.856 ± 0.436
0.98ArgTrp: 0.98 ± 0.29
2.353ArgTyr: 2.353 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
5.294SerAla: 5.294 ± 0.576
0.458SerCys: 0.458 ± 0.208
4.052SerAsp: 4.052 ± 0.462
3.791SerGlu: 3.791 ± 0.574
1.503SerPhe: 1.503 ± 0.326
4.902SerGly: 4.902 ± 0.706
0.588SerHis: 0.588 ± 0.168
3.007SerIle: 3.007 ± 0.376
1.961SerLys: 1.961 ± 0.37
4.314SerLeu: 4.314 ± 0.479
1.373SerMet: 1.373 ± 0.254
2.222SerAsn: 2.222 ± 0.33
3.203SerPro: 3.203 ± 0.466
1.961SerGln: 1.961 ± 0.316
4.444SerArg: 4.444 ± 0.508
3.268SerSer: 3.268 ± 0.523
2.222SerThr: 2.222 ± 0.403
3.399SerVal: 3.399 ± 0.399
1.111SerTrp: 1.111 ± 0.289
2.484SerTyr: 2.484 ± 0.387
0.0SerXaa: 0.0 ± 0.0
Thr
6.013ThrAla: 6.013 ± 0.826
0.458ThrCys: 0.458 ± 0.149
4.052ThrAsp: 4.052 ± 0.553
3.66ThrGlu: 3.66 ± 0.431
2.157ThrPhe: 2.157 ± 0.421
5.033ThrGly: 5.033 ± 0.566
0.654ThrHis: 0.654 ± 0.225
2.614ThrIle: 2.614 ± 0.409
2.68ThrLys: 2.68 ± 0.296
6.013ThrLeu: 6.013 ± 0.51
1.111ThrMet: 1.111 ± 0.247
1.634ThrAsn: 1.634 ± 0.293
3.66ThrPro: 3.66 ± 0.506
2.026ThrGln: 2.026 ± 0.338
2.222ThrArg: 2.222 ± 0.461
3.268ThrSer: 3.268 ± 0.483
3.203ThrThr: 3.203 ± 0.407
4.837ThrVal: 4.837 ± 0.585
1.046ThrTrp: 1.046 ± 0.253
2.092ThrTyr: 2.092 ± 0.373
0.0ThrXaa: 0.0 ± 0.0
Val
6.601ValAla: 6.601 ± 0.61
0.654ValCys: 0.654 ± 0.201
6.471ValAsp: 6.471 ± 0.667
5.425ValGlu: 5.425 ± 0.608
1.699ValPhe: 1.699 ± 0.267
5.294ValGly: 5.294 ± 0.542
0.85ValHis: 0.85 ± 0.196
3.072ValIle: 3.072 ± 0.388
4.052ValLys: 4.052 ± 0.577
5.752ValLeu: 5.752 ± 0.653
1.307ValMet: 1.307 ± 0.337
2.222ValAsn: 2.222 ± 0.482
3.725ValPro: 3.725 ± 0.624
2.549ValGln: 2.549 ± 0.417
3.922ValArg: 3.922 ± 0.5
4.118ValSer: 4.118 ± 0.571
5.033ValThr: 5.033 ± 0.571
6.275ValVal: 6.275 ± 0.758
1.83ValTrp: 1.83 ± 0.314
2.353ValTyr: 2.353 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
1.895TrpAla: 1.895 ± 0.456
0.261TrpCys: 0.261 ± 0.113
1.307TrpAsp: 1.307 ± 0.249
1.307TrpGlu: 1.307 ± 0.255
0.458TrpPhe: 0.458 ± 0.156
1.569TrpGly: 1.569 ± 0.303
0.261TrpHis: 0.261 ± 0.154
1.634TrpIle: 1.634 ± 0.336
0.588TrpLys: 0.588 ± 0.191
0.98TrpLeu: 0.98 ± 0.173
0.719TrpMet: 0.719 ± 0.269
0.85TrpAsn: 0.85 ± 0.215
0.654TrpPro: 0.654 ± 0.226
0.98TrpGln: 0.98 ± 0.255
1.373TrpArg: 1.373 ± 0.265
1.438TrpSer: 1.438 ± 0.323
1.307TrpThr: 1.307 ± 0.272
1.373TrpVal: 1.373 ± 0.29
0.654TrpTrp: 0.654 ± 0.209
0.85TrpTyr: 0.85 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.549TyrAla: 2.549 ± 0.463
0.196TyrCys: 0.196 ± 0.114
2.81TyrAsp: 2.81 ± 0.345
2.941TyrGlu: 2.941 ± 0.4
0.98TyrPhe: 0.98 ± 0.25
1.895TyrGly: 1.895 ± 0.332
0.523TyrHis: 0.523 ± 0.145
1.895TyrIle: 1.895 ± 0.368
1.046TyrLys: 1.046 ± 0.21
2.941TyrLeu: 2.941 ± 0.38
0.588TyrMet: 0.588 ± 0.185
0.784TyrAsn: 0.784 ± 0.206
1.438TyrPro: 1.438 ± 0.243
0.98TyrGln: 0.98 ± 0.23
2.222TyrArg: 2.222 ± 0.384
2.026TyrSer: 2.026 ± 0.343
1.961TyrThr: 1.961 ± 0.396
2.418TyrVal: 2.418 ± 0.414
0.98TyrTrp: 0.98 ± 0.29
1.111TyrTyr: 1.111 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 81 proteins (15301 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski