Amino acid dipepetide frequency for Microbacterium phage Johann

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.119AlaAla: 11.119 ± 1.31
0.81AlaCys: 0.81 ± 0.309
5.67AlaAsp: 5.67 ± 0.586
7.879AlaGlu: 7.879 ± 0.797
2.651AlaPhe: 2.651 ± 0.412
9.941AlaGly: 9.941 ± 1.028
2.135AlaHis: 2.135 ± 0.373
4.86AlaIle: 4.86 ± 0.807
6.996AlaLys: 6.996 ± 0.805
8.321AlaLeu: 8.321 ± 0.707
3.019AlaMet: 3.019 ± 0.471
3.682AlaAsn: 3.682 ± 0.514
6.406AlaPro: 6.406 ± 0.924
4.639AlaGln: 4.639 ± 0.6
6.922AlaArg: 6.922 ± 1.033
5.67AlaSer: 5.67 ± 0.637
6.259AlaThr: 6.259 ± 0.925
7.585AlaVal: 7.585 ± 0.903
2.577AlaTrp: 2.577 ± 0.492
3.093AlaTyr: 3.093 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.295CysAla: 0.295 ± 0.161
0.147CysCys: 0.147 ± 0.095
0.368CysAsp: 0.368 ± 0.21
0.147CysGlu: 0.147 ± 0.093
0.0CysPhe: 0.0 ± 0.0
0.81CysGly: 0.81 ± 0.36
0.221CysHis: 0.221 ± 0.135
0.295CysIle: 0.295 ± 0.152
0.147CysLys: 0.147 ± 0.115
0.884CysLeu: 0.884 ± 0.261
0.074CysMet: 0.074 ± 0.07
0.074CysAsn: 0.074 ± 0.072
0.295CysPro: 0.295 ± 0.146
0.0CysGln: 0.0 ± 0.0
0.368CysArg: 0.368 ± 0.164
0.221CysSer: 0.221 ± 0.151
0.589CysThr: 0.589 ± 0.27
0.442CysVal: 0.442 ± 0.215
0.074CysTrp: 0.074 ± 0.079
0.147CysTyr: 0.147 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
7.585AspAla: 7.585 ± 0.744
0.221AspCys: 0.221 ± 0.108
4.271AspAsp: 4.271 ± 0.583
4.418AspGlu: 4.418 ± 0.611
2.209AspPhe: 2.209 ± 0.302
5.081AspGly: 5.081 ± 0.726
0.663AspHis: 0.663 ± 0.186
2.356AspIle: 2.356 ± 0.491
2.209AspLys: 2.209 ± 0.4
5.67AspLeu: 5.67 ± 0.903
1.399AspMet: 1.399 ± 0.258
1.546AspAsn: 1.546 ± 0.328
4.934AspPro: 4.934 ± 0.669
3.682AspGln: 3.682 ± 0.603
3.166AspArg: 3.166 ± 0.519
3.387AspSer: 3.387 ± 0.391
4.197AspThr: 4.197 ± 0.573
3.535AspVal: 3.535 ± 0.519
1.325AspTrp: 1.325 ± 0.265
1.988AspTyr: 1.988 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
8.542GluAla: 8.542 ± 0.725
0.147GluCys: 0.147 ± 0.097
4.271GluAsp: 4.271 ± 0.521
4.271GluGlu: 4.271 ± 0.526
2.798GluPhe: 2.798 ± 0.623
6.259GluGly: 6.259 ± 0.643
1.546GluHis: 1.546 ± 0.41
3.093GluIle: 3.093 ± 0.413
2.43GluLys: 2.43 ± 0.445
5.007GluLeu: 5.007 ± 0.589
1.399GluMet: 1.399 ± 0.311
1.399GluAsn: 1.399 ± 0.405
3.535GluPro: 3.535 ± 0.555
2.651GluGln: 2.651 ± 0.503
4.124GluArg: 4.124 ± 0.696
3.166GluSer: 3.166 ± 0.603
4.197GluThr: 4.197 ± 0.527
4.639GluVal: 4.639 ± 0.587
2.135GluTrp: 2.135 ± 0.373
1.399GluTyr: 1.399 ± 0.289
0.0GluXaa: 0.0 ± 0.0
Phe
2.946PheAla: 2.946 ± 0.331
0.074PheCys: 0.074 ± 0.064
2.43PheAsp: 2.43 ± 0.473
2.135PheGlu: 2.135 ± 0.331
0.515PhePhe: 0.515 ± 0.217
3.24PheGly: 3.24 ± 0.513
0.368PheHis: 0.368 ± 0.144
1.546PheIle: 1.546 ± 0.525
1.105PheLys: 1.105 ± 0.33
1.915PheLeu: 1.915 ± 0.374
0.295PheMet: 0.295 ± 0.141
0.81PheAsn: 0.81 ± 0.248
1.399PhePro: 1.399 ± 0.334
1.105PheGln: 1.105 ± 0.383
1.767PheArg: 1.767 ± 0.363
1.399PheSer: 1.399 ± 0.346
1.988PheThr: 1.988 ± 0.398
1.325PheVal: 1.325 ± 0.361
0.589PheTrp: 0.589 ± 0.251
0.515PheTyr: 0.515 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
9.057GlyAla: 9.057 ± 0.989
0.663GlyCys: 0.663 ± 0.248
4.713GlyAsp: 4.713 ± 0.739
5.449GlyGlu: 5.449 ± 0.582
3.093GlyPhe: 3.093 ± 0.774
7.364GlyGly: 7.364 ± 0.784
1.694GlyHis: 1.694 ± 0.479
4.86GlyIle: 4.86 ± 0.54
4.492GlyLys: 4.492 ± 0.689
7.216GlyLeu: 7.216 ± 0.938
1.252GlyMet: 1.252 ± 0.273
3.093GlyAsn: 3.093 ± 0.497
3.756GlyPro: 3.756 ± 0.686
4.713GlyGln: 4.713 ± 0.552
4.713GlyArg: 4.713 ± 0.617
4.639GlySer: 4.639 ± 0.654
5.523GlyThr: 5.523 ± 0.645
7.216GlyVal: 7.216 ± 0.601
2.504GlyTrp: 2.504 ± 0.427
2.651GlyTyr: 2.651 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
1.031HisAla: 1.031 ± 0.248
0.074HisCys: 0.074 ± 0.072
1.546HisAsp: 1.546 ± 0.337
1.105HisGlu: 1.105 ± 0.28
0.442HisPhe: 0.442 ± 0.178
1.915HisGly: 1.915 ± 0.415
0.147HisHis: 0.147 ± 0.112
0.442HisIle: 0.442 ± 0.193
0.663HisLys: 0.663 ± 0.229
1.325HisLeu: 1.325 ± 0.307
0.221HisMet: 0.221 ± 0.107
0.81HisAsn: 0.81 ± 0.204
1.988HisPro: 1.988 ± 0.339
0.368HisGln: 0.368 ± 0.15
1.325HisArg: 1.325 ± 0.305
0.515HisSer: 0.515 ± 0.184
0.589HisThr: 0.589 ± 0.249
1.473HisVal: 1.473 ± 0.373
0.368HisTrp: 0.368 ± 0.168
1.105HisTyr: 1.105 ± 0.293
0.0HisXaa: 0.0 ± 0.0
Ile
5.007IleAla: 5.007 ± 0.523
0.295IleCys: 0.295 ± 0.171
3.535IleAsp: 3.535 ± 0.581
3.093IleGlu: 3.093 ± 0.499
0.81IlePhe: 0.81 ± 0.244
4.124IleGly: 4.124 ± 0.766
0.884IleHis: 0.884 ± 0.235
1.841IleIle: 1.841 ± 0.532
2.062IleLys: 2.062 ± 0.418
3.093IleLeu: 3.093 ± 0.525
0.663IleMet: 0.663 ± 0.279
0.957IleAsn: 0.957 ± 0.251
2.872IlePro: 2.872 ± 0.482
2.504IleGln: 2.504 ± 0.503
2.798IleArg: 2.798 ± 0.437
1.546IleSer: 1.546 ± 0.417
3.608IleThr: 3.608 ± 0.535
2.356IleVal: 2.356 ± 0.401
1.399IleTrp: 1.399 ± 0.394
0.884IleTyr: 0.884 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
5.965LysAla: 5.965 ± 1.089
0.295LysCys: 0.295 ± 0.135
2.43LysAsp: 2.43 ± 0.401
2.135LysGlu: 2.135 ± 0.368
1.399LysPhe: 1.399 ± 0.338
3.608LysGly: 3.608 ± 0.54
0.736LysHis: 0.736 ± 0.245
3.093LysIle: 3.093 ± 0.533
2.43LysLys: 2.43 ± 0.503
4.418LysLeu: 4.418 ± 0.489
1.105LysMet: 1.105 ± 0.29
1.694LysAsn: 1.694 ± 0.325
2.725LysPro: 2.725 ± 0.604
0.663LysGln: 0.663 ± 0.226
2.577LysArg: 2.577 ± 0.514
2.651LysSer: 2.651 ± 0.402
2.504LysThr: 2.504 ± 0.502
3.829LysVal: 3.829 ± 0.542
0.81LysTrp: 0.81 ± 0.188
1.105LysTyr: 1.105 ± 0.446
0.0LysXaa: 0.0 ± 0.0
Leu
8.027LeuAla: 8.027 ± 0.663
0.515LeuCys: 0.515 ± 0.241
5.965LeuAsp: 5.965 ± 0.678
4.786LeuGlu: 4.786 ± 0.484
2.577LeuPhe: 2.577 ± 0.385
6.038LeuGly: 6.038 ± 0.677
0.957LeuHis: 0.957 ± 0.372
4.124LeuIle: 4.124 ± 0.446
2.798LeuLys: 2.798 ± 0.455
5.67LeuLeu: 5.67 ± 0.687
1.62LeuMet: 1.62 ± 0.467
2.356LeuAsn: 2.356 ± 0.494
3.093LeuPro: 3.093 ± 0.464
2.43LeuGln: 2.43 ± 0.447
5.155LeuArg: 5.155 ± 0.597
4.271LeuSer: 4.271 ± 0.565
5.817LeuThr: 5.817 ± 0.603
5.596LeuVal: 5.596 ± 0.564
1.546LeuTrp: 1.546 ± 0.653
2.135LeuTyr: 2.135 ± 0.269
0.0LeuXaa: 0.0 ± 0.0
Met
2.43MetAla: 2.43 ± 0.455
0.074MetCys: 0.074 ± 0.07
1.767MetAsp: 1.767 ± 0.317
1.178MetGlu: 1.178 ± 0.289
0.0MetPhe: 0.0 ± 0.0
0.884MetGly: 0.884 ± 0.322
0.368MetHis: 0.368 ± 0.135
0.957MetIle: 0.957 ± 0.204
0.589MetLys: 0.589 ± 0.276
1.694MetLeu: 1.694 ± 0.348
0.368MetMet: 0.368 ± 0.178
0.884MetAsn: 0.884 ± 0.26
1.841MetPro: 1.841 ± 0.38
0.515MetGln: 0.515 ± 0.216
1.841MetArg: 1.841 ± 0.398
1.546MetSer: 1.546 ± 0.325
1.62MetThr: 1.62 ± 0.289
1.767MetVal: 1.767 ± 0.378
0.147MetTrp: 0.147 ± 0.099
0.221MetTyr: 0.221 ± 0.185
0.0MetXaa: 0.0 ± 0.0
Asn
4.345AsnAla: 4.345 ± 0.577
0.295AsnCys: 0.295 ± 0.125
1.252AsnAsp: 1.252 ± 0.273
2.725AsnGlu: 2.725 ± 0.384
0.736AsnPhe: 0.736 ± 0.199
3.976AsnGly: 3.976 ± 0.746
0.368AsnHis: 0.368 ± 0.229
0.589AsnIle: 0.589 ± 0.304
1.252AsnLys: 1.252 ± 0.333
3.019AsnLeu: 3.019 ± 0.363
0.147AsnMet: 0.147 ± 0.096
0.663AsnAsn: 0.663 ± 0.223
1.988AsnPro: 1.988 ± 0.251
1.546AsnGln: 1.546 ± 0.334
2.283AsnArg: 2.283 ± 0.461
1.62AsnSer: 1.62 ± 0.383
1.694AsnThr: 1.694 ± 0.416
1.62AsnVal: 1.62 ± 0.322
0.957AsnTrp: 0.957 ± 0.315
0.884AsnTyr: 0.884 ± 0.35
0.0AsnXaa: 0.0 ± 0.0
Pro
6.259ProAla: 6.259 ± 1.169
0.221ProCys: 0.221 ± 0.145
3.682ProAsp: 3.682 ± 0.546
5.376ProGlu: 5.376 ± 1.089
1.325ProPhe: 1.325 ± 0.277
6.112ProGly: 6.112 ± 0.828
0.736ProHis: 0.736 ± 0.212
2.577ProIle: 2.577 ± 0.507
3.093ProLys: 3.093 ± 0.799
2.577ProLeu: 2.577 ± 0.582
1.252ProMet: 1.252 ± 0.282
2.43ProAsn: 2.43 ± 0.377
3.314ProPro: 3.314 ± 0.64
1.841ProGln: 1.841 ± 0.361
2.504ProArg: 2.504 ± 0.374
3.387ProSer: 3.387 ± 0.541
4.124ProThr: 4.124 ± 0.625
3.387ProVal: 3.387 ± 0.431
0.884ProTrp: 0.884 ± 0.315
1.325ProTyr: 1.325 ± 0.262
0.0ProXaa: 0.0 ± 0.0
Gln
5.965GlnAla: 5.965 ± 0.72
0.368GlnCys: 0.368 ± 0.191
2.062GlnAsp: 2.062 ± 0.415
3.461GlnGlu: 3.461 ± 0.566
1.178GlnPhe: 1.178 ± 0.232
3.829GlnGly: 3.829 ± 0.422
0.884GlnHis: 0.884 ± 0.248
1.841GlnIle: 1.841 ± 0.335
1.841GlnLys: 1.841 ± 0.443
3.019GlnLeu: 3.019 ± 0.422
0.589GlnMet: 0.589 ± 0.193
1.62GlnAsn: 1.62 ± 0.366
1.252GlnPro: 1.252 ± 0.213
1.325GlnGln: 1.325 ± 0.328
1.62GlnArg: 1.62 ± 0.339
2.209GlnSer: 2.209 ± 0.31
2.504GlnThr: 2.504 ± 0.424
2.946GlnVal: 2.946 ± 0.47
1.325GlnTrp: 1.325 ± 0.314
0.884GlnTyr: 0.884 ± 0.204
0.0GlnXaa: 0.0 ± 0.0
Arg
5.817ArgAla: 5.817 ± 0.872
0.589ArgCys: 0.589 ± 0.234
3.461ArgAsp: 3.461 ± 0.43
4.639ArgGlu: 4.639 ± 0.577
1.399ArgPhe: 1.399 ± 0.321
3.535ArgGly: 3.535 ± 0.488
1.767ArgHis: 1.767 ± 0.345
2.283ArgIle: 2.283 ± 0.439
3.535ArgLys: 3.535 ± 0.569
3.829ArgLeu: 3.829 ± 0.688
1.473ArgMet: 1.473 ± 0.289
1.841ArgAsn: 1.841 ± 0.414
3.093ArgPro: 3.093 ± 0.474
3.019ArgGln: 3.019 ± 0.528
3.756ArgArg: 3.756 ± 0.445
3.093ArgSer: 3.093 ± 0.534
2.209ArgThr: 2.209 ± 0.344
5.523ArgVal: 5.523 ± 0.651
1.767ArgTrp: 1.767 ± 0.331
2.135ArgTyr: 2.135 ± 0.326
0.0ArgXaa: 0.0 ± 0.0
Ser
5.228SerAla: 5.228 ± 0.631
0.147SerCys: 0.147 ± 0.093
3.387SerAsp: 3.387 ± 0.53
2.872SerGlu: 2.872 ± 0.57
1.473SerPhe: 1.473 ± 0.294
6.775SerGly: 6.775 ± 0.679
0.589SerHis: 0.589 ± 0.201
1.767SerIle: 1.767 ± 0.367
2.283SerLys: 2.283 ± 0.335
3.24SerLeu: 3.24 ± 0.487
1.62SerMet: 1.62 ± 0.366
1.325SerAsn: 1.325 ± 0.285
3.314SerPro: 3.314 ± 0.606
1.546SerGln: 1.546 ± 0.244
2.798SerArg: 2.798 ± 0.544
2.872SerSer: 2.872 ± 0.588
3.756SerThr: 3.756 ± 0.539
4.713SerVal: 4.713 ± 0.693
2.062SerTrp: 2.062 ± 0.582
1.325SerTyr: 1.325 ± 0.321
0.0SerXaa: 0.0 ± 0.0
Thr
7.069ThrAla: 7.069 ± 0.847
0.295ThrCys: 0.295 ± 0.132
4.271ThrAsp: 4.271 ± 0.58
3.976ThrGlu: 3.976 ± 0.534
1.694ThrPhe: 1.694 ± 0.354
5.596ThrGly: 5.596 ± 0.67
0.81ThrHis: 0.81 ± 0.241
2.43ThrIle: 2.43 ± 0.517
2.725ThrLys: 2.725 ± 0.484
5.007ThrLeu: 5.007 ± 0.648
1.473ThrMet: 1.473 ± 0.274
2.135ThrAsn: 2.135 ± 0.437
4.86ThrPro: 4.86 ± 0.837
2.651ThrGln: 2.651 ± 0.429
4.197ThrArg: 4.197 ± 0.685
3.535ThrSer: 3.535 ± 0.576
4.566ThrThr: 4.566 ± 0.767
5.081ThrVal: 5.081 ± 0.532
1.767ThrTrp: 1.767 ± 0.37
1.325ThrTyr: 1.325 ± 0.355
0.0ThrXaa: 0.0 ± 0.0
Val
7.879ValAla: 7.879 ± 0.795
0.221ValCys: 0.221 ± 0.161
5.155ValAsp: 5.155 ± 0.583
4.566ValGlu: 4.566 ± 0.582
1.399ValPhe: 1.399 ± 0.338
5.817ValGly: 5.817 ± 0.845
1.546ValHis: 1.546 ± 0.336
3.019ValIle: 3.019 ± 0.436
3.535ValLys: 3.535 ± 0.434
5.67ValLeu: 5.67 ± 0.747
1.62ValMet: 1.62 ± 0.307
2.283ValAsn: 2.283 ± 0.338
3.756ValPro: 3.756 ± 0.6
3.24ValGln: 3.24 ± 0.579
3.682ValArg: 3.682 ± 0.497
4.786ValSer: 4.786 ± 0.71
5.155ValThr: 5.155 ± 0.615
6.922ValVal: 6.922 ± 0.831
2.135ValTrp: 2.135 ± 0.632
1.62ValTyr: 1.62 ± 0.298
0.0ValXaa: 0.0 ± 0.0
Trp
3.093TrpAla: 3.093 ± 0.665
0.074TrpCys: 0.074 ± 0.079
2.135TrpAsp: 2.135 ± 0.566
1.178TrpGlu: 1.178 ± 0.251
0.884TrpPhe: 0.884 ± 0.256
1.694TrpGly: 1.694 ± 0.382
0.515TrpHis: 0.515 ± 0.171
1.694TrpIle: 1.694 ± 0.646
0.957TrpLys: 0.957 ± 0.239
1.62TrpLeu: 1.62 ± 0.337
0.515TrpMet: 0.515 ± 0.218
1.473TrpAsn: 1.473 ± 0.752
0.663TrpPro: 0.663 ± 0.222
0.736TrpGln: 0.736 ± 0.238
1.694TrpArg: 1.694 ± 0.426
1.399TrpSer: 1.399 ± 0.432
2.283TrpThr: 2.283 ± 0.409
1.546TrpVal: 1.546 ± 0.338
0.663TrpTrp: 0.663 ± 0.274
0.957TrpTyr: 0.957 ± 0.268
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.577TyrAla: 2.577 ± 0.49
0.074TyrCys: 0.074 ± 0.071
1.473TyrAsp: 1.473 ± 0.305
1.62TyrGlu: 1.62 ± 0.392
0.884TyrPhe: 0.884 ± 0.193
2.062TyrGly: 2.062 ± 0.387
0.515TyrHis: 0.515 ± 0.207
0.663TyrIle: 0.663 ± 0.245
1.031TyrLys: 1.031 ± 0.327
2.283TyrLeu: 2.283 ± 0.324
0.515TyrMet: 0.515 ± 0.192
0.884TyrAsn: 0.884 ± 0.204
1.325TyrPro: 1.325 ± 0.293
1.546TyrGln: 1.546 ± 0.336
1.546TyrArg: 1.546 ± 0.32
1.178TyrSer: 1.178 ± 0.287
2.283TyrThr: 2.283 ± 0.42
2.504TyrVal: 2.504 ± 0.442
0.736TyrTrp: 0.736 ± 0.159
1.178TyrTyr: 1.178 ± 0.343
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski