Amino acid dipepetide frequency for Microbacterium phage Jacko

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.613AlaAla: 12.613 ± 1.117
0.491AlaCys: 0.491 ± 0.165
6.279AlaAsp: 6.279 ± 0.569
8.081AlaGlu: 8.081 ± 0.805
3.331AlaPhe: 3.331 ± 0.43
6.771AlaGly: 6.771 ± 0.918
1.911AlaHis: 1.911 ± 0.301
5.351AlaIle: 5.351 ± 0.583
5.133AlaLys: 5.133 ± 0.636
9.392AlaLeu: 9.392 ± 0.993
3.331AlaMet: 3.331 ± 0.48
3.822AlaAsn: 3.822 ± 0.535
4.314AlaPro: 4.314 ± 0.432
3.222AlaGln: 3.222 ± 0.492
7.699AlaArg: 7.699 ± 0.714
7.098AlaSer: 7.098 ± 0.567
5.57AlaThr: 5.57 ± 0.657
7.262AlaVal: 7.262 ± 0.564
2.184AlaTrp: 2.184 ± 0.384
2.403AlaTyr: 2.403 ± 0.353
0.0AlaXaa: 0.0 ± 0.0
Cys
0.273CysAla: 0.273 ± 0.119
0.055CysCys: 0.055 ± 0.051
0.218CysAsp: 0.218 ± 0.089
0.491CysGlu: 0.491 ± 0.165
0.273CysPhe: 0.273 ± 0.144
0.382CysGly: 0.382 ± 0.17
0.109CysHis: 0.109 ± 0.073
0.164CysIle: 0.164 ± 0.141
0.328CysLys: 0.328 ± 0.149
0.218CysLeu: 0.218 ± 0.1
0.109CysMet: 0.109 ± 0.089
0.109CysAsn: 0.109 ± 0.068
0.273CysPro: 0.273 ± 0.127
0.0CysGln: 0.0 ± 0.0
0.655CysArg: 0.655 ± 0.211
0.273CysSer: 0.273 ± 0.126
0.218CysThr: 0.218 ± 0.118
0.218CysVal: 0.218 ± 0.109
0.109CysTrp: 0.109 ± 0.078
0.055CysTyr: 0.055 ± 0.05
0.0CysXaa: 0.0 ± 0.0
Asp
7.371AspAla: 7.371 ± 0.709
0.437AspCys: 0.437 ± 0.175
4.696AspAsp: 4.696 ± 0.493
4.86AspGlu: 4.86 ± 0.613
2.348AspPhe: 2.348 ± 0.296
4.587AspGly: 4.587 ± 0.413
1.092AspHis: 1.092 ± 0.221
3.877AspIle: 3.877 ± 0.452
2.403AspLys: 2.403 ± 0.365
5.242AspLeu: 5.242 ± 0.617
1.474AspMet: 1.474 ± 0.299
1.911AspAsn: 1.911 ± 0.312
3.658AspPro: 3.658 ± 0.559
1.802AspGln: 1.802 ± 0.365
3.495AspArg: 3.495 ± 0.381
3.986AspSer: 3.986 ± 0.401
3.058AspThr: 3.058 ± 0.44
3.931AspVal: 3.931 ± 0.406
1.638AspTrp: 1.638 ± 0.37
1.911AspTyr: 1.911 ± 0.405
0.0AspXaa: 0.0 ± 0.0
Glu
9.556GluAla: 9.556 ± 0.929
0.382GluCys: 0.382 ± 0.127
4.423GluAsp: 4.423 ± 0.479
5.897GluGlu: 5.897 ± 0.657
2.02GluPhe: 2.02 ± 0.311
4.75GluGly: 4.75 ± 0.484
1.474GluHis: 1.474 ± 0.307
4.477GluIle: 4.477 ± 0.586
3.713GluLys: 3.713 ± 0.515
5.406GluLeu: 5.406 ± 0.551
2.239GluMet: 2.239 ± 0.396
2.839GluAsn: 2.839 ± 0.426
3.058GluPro: 3.058 ± 0.413
3.331GluGln: 3.331 ± 0.41
6.389GluArg: 6.389 ± 0.617
3.003GluSer: 3.003 ± 0.436
4.15GluThr: 4.15 ± 0.432
5.624GluVal: 5.624 ± 0.604
1.911GluTrp: 1.911 ± 0.296
2.403GluTyr: 2.403 ± 0.399
0.0GluXaa: 0.0 ± 0.0
Phe
3.222PheAla: 3.222 ± 0.392
0.109PheCys: 0.109 ± 0.085
2.512PheAsp: 2.512 ± 0.366
2.239PheGlu: 2.239 ± 0.445
0.655PhePhe: 0.655 ± 0.171
3.604PheGly: 3.604 ± 0.397
0.601PheHis: 0.601 ± 0.185
1.42PheIle: 1.42 ± 0.286
1.147PheLys: 1.147 ± 0.278
1.365PheLeu: 1.365 ± 0.318
1.037PheMet: 1.037 ± 0.215
0.874PheAsn: 0.874 ± 0.244
1.037PhePro: 1.037 ± 0.235
1.037PheGln: 1.037 ± 0.213
2.403PheArg: 2.403 ± 0.363
1.693PheSer: 1.693 ± 0.477
2.02PheThr: 2.02 ± 0.311
3.058PheVal: 3.058 ± 0.449
0.437PheTrp: 0.437 ± 0.145
1.147PheTyr: 1.147 ± 0.268
0.0PheXaa: 0.0 ± 0.0
Gly
7.699GlyAla: 7.699 ± 0.831
0.273GlyCys: 0.273 ± 0.106
5.187GlyAsp: 5.187 ± 0.535
6.771GlyGlu: 6.771 ± 0.652
3.713GlyPhe: 3.713 ± 0.463
7.262GlyGly: 7.262 ± 0.908
1.147GlyHis: 1.147 ± 0.29
4.095GlyIle: 4.095 ± 0.608
3.167GlyLys: 3.167 ± 0.446
6.116GlyLeu: 6.116 ± 0.547
1.747GlyMet: 1.747 ± 0.311
2.839GlyAsn: 2.839 ± 0.341
3.877GlyPro: 3.877 ± 1.41
3.385GlyGln: 3.385 ± 0.42
5.897GlyArg: 5.897 ± 0.535
4.095GlySer: 4.095 ± 0.56
5.788GlyThr: 5.788 ± 0.583
6.607GlyVal: 6.607 ± 0.603
1.857GlyTrp: 1.857 ± 0.318
2.457GlyTyr: 2.457 ± 0.336
0.0GlyXaa: 0.0 ± 0.0
His
1.802HisAla: 1.802 ± 0.306
0.218HisCys: 0.218 ± 0.097
1.201HisAsp: 1.201 ± 0.289
1.42HisGlu: 1.42 ± 0.326
0.655HisPhe: 0.655 ± 0.175
1.857HisGly: 1.857 ± 0.415
0.273HisHis: 0.273 ± 0.119
0.546HisIle: 0.546 ± 0.153
0.491HisLys: 0.491 ± 0.141
1.966HisLeu: 1.966 ± 0.343
0.382HisMet: 0.382 ± 0.147
0.546HisAsn: 0.546 ± 0.176
0.983HisPro: 0.983 ± 0.218
0.437HisGln: 0.437 ± 0.173
1.365HisArg: 1.365 ± 0.28
0.983HisSer: 0.983 ± 0.282
0.764HisThr: 0.764 ± 0.241
1.201HisVal: 1.201 ± 0.253
0.218HisTrp: 0.218 ± 0.09
0.874HisTyr: 0.874 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.351IleAla: 5.351 ± 0.642
0.109IleCys: 0.109 ± 0.078
3.495IleAsp: 3.495 ± 0.459
5.788IleGlu: 5.788 ± 0.568
1.365IlePhe: 1.365 ± 0.258
3.058IleGly: 3.058 ± 0.373
0.874IleHis: 0.874 ± 0.252
2.02IleIle: 2.02 ± 0.339
1.911IleLys: 1.911 ± 0.369
2.894IleLeu: 2.894 ± 0.494
0.928IleMet: 0.928 ± 0.213
1.256IleAsn: 1.256 ± 0.262
2.566IlePro: 2.566 ± 0.44
1.583IleGln: 1.583 ± 0.273
3.549IleArg: 3.549 ± 0.406
3.003IleSer: 3.003 ± 0.366
3.986IleThr: 3.986 ± 0.532
3.385IleVal: 3.385 ± 0.441
0.655IleTrp: 0.655 ± 0.191
1.256IleTyr: 1.256 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
5.023LysAla: 5.023 ± 0.593
0.055LysCys: 0.055 ± 0.051
2.184LysAsp: 2.184 ± 0.369
2.457LysGlu: 2.457 ± 0.371
1.147LysPhe: 1.147 ± 0.329
3.713LysGly: 3.713 ± 0.595
0.764LysHis: 0.764 ± 0.228
1.911LysIle: 1.911 ± 0.274
2.403LysLys: 2.403 ± 0.424
3.331LysLeu: 3.331 ± 0.495
1.31LysMet: 1.31 ± 0.268
1.529LysAsn: 1.529 ± 0.319
2.184LysPro: 2.184 ± 0.455
1.201LysGln: 1.201 ± 0.209
4.204LysArg: 4.204 ± 0.487
2.239LysSer: 2.239 ± 0.374
2.13LysThr: 2.13 ± 0.356
3.112LysVal: 3.112 ± 0.44
0.546LysTrp: 0.546 ± 0.181
1.201LysTyr: 1.201 ± 0.331
0.0LysXaa: 0.0 ± 0.0
Leu
7.972LeuAla: 7.972 ± 0.669
0.382LeuCys: 0.382 ± 0.159
5.023LeuAsp: 5.023 ± 0.489
5.515LeuGlu: 5.515 ± 0.575
2.075LeuPhe: 2.075 ± 0.295
5.952LeuGly: 5.952 ± 0.58
1.583LeuHis: 1.583 ± 0.271
3.495LeuIle: 3.495 ± 0.473
2.403LeuLys: 2.403 ± 0.318
4.095LeuLeu: 4.095 ± 0.612
1.529LeuMet: 1.529 ± 0.287
3.276LeuAsn: 3.276 ± 0.49
3.658LeuPro: 3.658 ± 0.517
1.857LeuGln: 1.857 ± 0.346
6.116LeuArg: 6.116 ± 0.638
4.368LeuSer: 4.368 ± 0.512
4.368LeuThr: 4.368 ± 0.476
4.423LeuVal: 4.423 ± 0.438
1.529LeuTrp: 1.529 ± 0.275
1.693LeuTyr: 1.693 ± 0.306
0.0LeuXaa: 0.0 ± 0.0
Met
2.949MetAla: 2.949 ± 0.4
0.055MetCys: 0.055 ± 0.064
1.42MetAsp: 1.42 ± 0.289
1.42MetGlu: 1.42 ± 0.308
0.764MetPhe: 0.764 ± 0.224
1.42MetGly: 1.42 ± 0.25
0.764MetHis: 0.764 ± 0.197
1.037MetIle: 1.037 ± 0.315
1.092MetLys: 1.092 ± 0.256
0.983MetLeu: 0.983 ± 0.25
0.764MetMet: 0.764 ± 0.245
0.983MetAsn: 0.983 ± 0.27
0.71MetPro: 0.71 ± 0.211
0.71MetGln: 0.71 ± 0.185
1.42MetArg: 1.42 ± 0.284
2.512MetSer: 2.512 ± 0.394
2.403MetThr: 2.403 ± 0.313
1.31MetVal: 1.31 ± 0.324
0.328MetTrp: 0.328 ± 0.109
0.546MetTyr: 0.546 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
4.15AsnAla: 4.15 ± 0.589
0.055AsnCys: 0.055 ± 0.051
1.638AsnAsp: 1.638 ± 0.257
2.13AsnGlu: 2.13 ± 0.391
1.256AsnPhe: 1.256 ± 0.267
4.095AsnGly: 4.095 ± 0.55
0.764AsnHis: 0.764 ± 0.206
1.201AsnIle: 1.201 ± 0.272
1.037AsnLys: 1.037 ± 0.291
3.604AsnLeu: 3.604 ± 0.414
0.71AsnMet: 0.71 ± 0.182
1.747AsnAsn: 1.747 ± 0.302
2.676AsnPro: 2.676 ± 0.405
0.819AsnGln: 0.819 ± 0.213
2.239AsnArg: 2.239 ± 0.413
2.348AsnSer: 2.348 ± 0.396
1.365AsnThr: 1.365 ± 0.302
2.184AsnVal: 2.184 ± 0.376
0.382AsnTrp: 0.382 ± 0.126
1.802AsnTyr: 1.802 ± 0.332
0.0AsnXaa: 0.0 ± 0.0
Pro
3.986ProAla: 3.986 ± 0.416
0.164ProCys: 0.164 ± 0.08
3.276ProAsp: 3.276 ± 0.624
3.931ProGlu: 3.931 ± 0.583
1.31ProPhe: 1.31 ± 0.279
4.805ProGly: 4.805 ± 0.559
1.037ProHis: 1.037 ± 0.253
2.348ProIle: 2.348 ± 0.363
2.839ProLys: 2.839 ± 0.452
2.839ProLeu: 2.839 ± 0.344
0.928ProMet: 0.928 ± 0.243
1.583ProAsn: 1.583 ± 0.281
1.966ProPro: 1.966 ± 0.35
0.983ProGln: 0.983 ± 0.427
2.839ProArg: 2.839 ± 0.42
3.003ProSer: 3.003 ± 0.377
3.058ProThr: 3.058 ± 0.433
3.877ProVal: 3.877 ± 0.424
1.092ProTrp: 1.092 ± 0.236
1.31ProTyr: 1.31 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
3.495GlnAla: 3.495 ± 0.472
0.273GlnCys: 0.273 ± 0.134
1.474GlnAsp: 1.474 ± 0.32
2.839GlnGlu: 2.839 ± 0.379
0.819GlnPhe: 0.819 ± 0.259
3.549GlnGly: 3.549 ± 1.054
0.328GlnHis: 0.328 ± 0.145
1.911GlnIle: 1.911 ± 0.369
1.638GlnLys: 1.638 ± 0.361
1.966GlnLeu: 1.966 ± 0.388
1.092GlnMet: 1.092 ± 0.227
0.928GlnAsn: 0.928 ± 0.236
1.092GlnPro: 1.092 ± 0.267
1.092GlnGln: 1.092 ± 0.3
2.13GlnArg: 2.13 ± 0.343
1.693GlnSer: 1.693 ± 0.444
1.256GlnThr: 1.256 ± 0.271
2.403GlnVal: 2.403 ± 0.362
0.437GlnTrp: 0.437 ± 0.14
1.037GlnTyr: 1.037 ± 0.19
0.0GlnXaa: 0.0 ± 0.0
Arg
7.972ArgAla: 7.972 ± 0.696
0.328ArgCys: 0.328 ± 0.124
4.368ArgAsp: 4.368 ± 0.479
5.843ArgGlu: 5.843 ± 0.681
2.02ArgPhe: 2.02 ± 0.313
5.843ArgGly: 5.843 ± 0.43
1.42ArgHis: 1.42 ± 0.285
4.15ArgIle: 4.15 ± 0.575
3.44ArgLys: 3.44 ± 0.537
5.733ArgLeu: 5.733 ± 0.634
1.529ArgMet: 1.529 ± 0.364
2.566ArgAsn: 2.566 ± 0.386
2.621ArgPro: 2.621 ± 0.32
2.02ArgGln: 2.02 ± 0.52
5.187ArgArg: 5.187 ± 0.558
4.15ArgSer: 4.15 ± 0.691
3.986ArgThr: 3.986 ± 0.528
5.897ArgVal: 5.897 ± 0.606
1.802ArgTrp: 1.802 ± 0.369
2.403ArgTyr: 2.403 ± 0.377
0.0ArgXaa: 0.0 ± 0.0
Ser
6.116SerAla: 6.116 ± 0.719
0.0SerCys: 0.0 ± 0.0
4.368SerAsp: 4.368 ± 0.455
3.877SerGlu: 3.877 ± 0.459
1.802SerPhe: 1.802 ± 0.299
5.843SerGly: 5.843 ± 0.634
1.31SerHis: 1.31 ± 0.31
2.894SerIle: 2.894 ± 0.414
2.13SerLys: 2.13 ± 0.339
4.641SerLeu: 4.641 ± 0.543
1.092SerMet: 1.092 ± 0.29
2.457SerAsn: 2.457 ± 0.416
2.73SerPro: 2.73 ± 0.396
1.802SerGln: 1.802 ± 0.288
4.095SerArg: 4.095 ± 0.457
3.385SerSer: 3.385 ± 0.558
3.003SerThr: 3.003 ± 0.468
4.423SerVal: 4.423 ± 0.513
1.092SerTrp: 1.092 ± 0.297
2.184SerTyr: 2.184 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
4.532ThrAla: 4.532 ± 0.662
0.437ThrCys: 0.437 ± 0.239
3.112ThrAsp: 3.112 ± 0.416
3.658ThrGlu: 3.658 ± 0.429
2.785ThrPhe: 2.785 ± 0.326
6.225ThrGly: 6.225 ± 0.952
0.874ThrHis: 0.874 ± 0.194
2.457ThrIle: 2.457 ± 0.458
2.403ThrLys: 2.403 ± 0.414
4.095ThrLeu: 4.095 ± 0.47
0.71ThrMet: 0.71 ± 0.2
2.184ThrAsn: 2.184 ± 0.358
3.768ThrPro: 3.768 ± 0.518
1.638ThrGln: 1.638 ± 0.307
3.44ThrArg: 3.44 ± 0.465
3.167ThrSer: 3.167 ± 0.474
3.058ThrThr: 3.058 ± 0.388
5.133ThrVal: 5.133 ± 0.753
1.256ThrTrp: 1.256 ± 0.28
1.966ThrTyr: 1.966 ± 0.344
0.0ThrXaa: 0.0 ± 0.0
Val
6.935ValAla: 6.935 ± 0.631
0.437ValCys: 0.437 ± 0.186
4.423ValAsp: 4.423 ± 0.491
5.897ValGlu: 5.897 ± 0.555
2.02ValPhe: 2.02 ± 0.38
6.498ValGly: 6.498 ± 0.609
1.147ValHis: 1.147 ± 0.302
3.822ValIle: 3.822 ± 0.53
3.385ValLys: 3.385 ± 0.411
4.314ValLeu: 4.314 ± 0.527
1.31ValMet: 1.31 ± 0.313
2.949ValAsn: 2.949 ± 0.382
3.604ValPro: 3.604 ± 0.46
2.621ValGln: 2.621 ± 0.325
6.17ValArg: 6.17 ± 0.6
4.805ValSer: 4.805 ± 0.566
4.15ValThr: 4.15 ± 0.424
5.515ValVal: 5.515 ± 0.603
1.256ValTrp: 1.256 ± 0.206
2.293ValTyr: 2.293 ± 0.398
0.0ValXaa: 0.0 ± 0.0
Trp
1.638TrpAla: 1.638 ± 0.289
0.109TrpCys: 0.109 ± 0.081
2.02TrpAsp: 2.02 ± 0.362
1.474TrpGlu: 1.474 ± 0.344
0.491TrpPhe: 0.491 ± 0.15
1.583TrpGly: 1.583 ± 0.304
0.273TrpHis: 0.273 ± 0.115
0.983TrpIle: 0.983 ± 0.286
0.601TrpLys: 0.601 ± 0.229
0.819TrpLeu: 0.819 ± 0.218
0.874TrpMet: 0.874 ± 0.208
0.874TrpAsn: 0.874 ± 0.219
0.655TrpPro: 0.655 ± 0.173
0.601TrpGln: 0.601 ± 0.185
1.966TrpArg: 1.966 ± 0.338
1.365TrpSer: 1.365 ± 0.266
0.764TrpThr: 0.764 ± 0.236
1.529TrpVal: 1.529 ± 0.247
0.437TrpTrp: 0.437 ± 0.138
0.71TrpTyr: 0.71 ± 0.184
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.658TyrAla: 3.658 ± 0.432
0.164TyrCys: 0.164 ± 0.09
2.457TyrAsp: 2.457 ± 0.451
2.457TyrGlu: 2.457 ± 0.426
0.764TyrPhe: 0.764 ± 0.192
2.184TyrGly: 2.184 ± 0.355
0.437TyrHis: 0.437 ± 0.144
0.874TyrIle: 0.874 ± 0.227
0.983TyrLys: 0.983 ± 0.233
2.293TyrLeu: 2.293 ± 0.422
0.437TyrMet: 0.437 ± 0.155
0.928TyrAsn: 0.928 ± 0.209
1.747TyrPro: 1.747 ± 0.297
1.31TyrGln: 1.31 ± 0.292
2.02TyrArg: 2.02 ± 0.361
2.075TyrSer: 2.075 ± 0.421
1.802TyrThr: 1.802 ± 0.316
2.457TyrVal: 2.457 ± 0.343
0.601TyrTrp: 0.601 ± 0.17
0.819TyrTyr: 0.819 ± 0.252
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 106 proteins (18315 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski