Amino acid dipepetide frequency for Methylophilaceae phage P19250A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.087AlaAla: 5.087 ± 1.089
0.574AlaCys: 0.574 ± 0.205
4.349AlaAsp: 4.349 ± 0.616
3.282AlaGlu: 3.282 ± 0.627
2.379AlaPhe: 2.379 ± 0.599
5.825AlaGly: 5.825 ± 1.5
0.328AlaHis: 0.328 ± 0.16
6.236AlaIle: 6.236 ± 0.715
5.169AlaLys: 5.169 ± 0.986
6.564AlaLeu: 6.564 ± 0.787
1.805AlaMet: 1.805 ± 0.352
5.989AlaAsn: 5.989 ± 0.687
2.297AlaPro: 2.297 ± 0.388
3.856AlaGln: 3.856 ± 0.457
3.2AlaArg: 3.2 ± 0.641
6.81AlaSer: 6.81 ± 1.415
6.154AlaThr: 6.154 ± 0.906
3.938AlaVal: 3.938 ± 0.497
1.559AlaTrp: 1.559 ± 0.304
3.528AlaTyr: 3.528 ± 0.49
0.0AlaXaa: 0.0 ± 0.0
Cys
0.738CysAla: 0.738 ± 0.271
0.164CysCys: 0.164 ± 0.098
0.492CysAsp: 0.492 ± 0.252
0.492CysGlu: 0.492 ± 0.203
0.41CysPhe: 0.41 ± 0.146
0.574CysGly: 0.574 ± 0.225
0.082CysHis: 0.082 ± 0.082
0.492CysIle: 0.492 ± 0.229
0.492CysLys: 0.492 ± 0.213
0.492CysLeu: 0.492 ± 0.203
0.082CysMet: 0.082 ± 0.084
0.738CysAsn: 0.738 ± 0.246
0.492CysPro: 0.492 ± 0.242
0.246CysGln: 0.246 ± 0.128
0.246CysArg: 0.246 ± 0.126
0.82CysSer: 0.82 ± 0.366
0.328CysThr: 0.328 ± 0.19
0.574CysVal: 0.574 ± 0.165
0.0CysTrp: 0.0 ± 0.0
0.246CysTyr: 0.246 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
4.102AspAla: 4.102 ± 0.658
0.492AspCys: 0.492 ± 0.222
2.215AspAsp: 2.215 ± 0.519
3.364AspGlu: 3.364 ± 0.87
2.133AspPhe: 2.133 ± 0.446
3.528AspGly: 3.528 ± 0.465
0.574AspHis: 0.574 ± 0.22
3.61AspIle: 3.61 ± 0.616
4.349AspLys: 4.349 ± 0.846
3.282AspLeu: 3.282 ± 0.472
1.477AspMet: 1.477 ± 0.376
3.528AspAsn: 3.528 ± 0.542
2.379AspPro: 2.379 ± 0.493
2.708AspGln: 2.708 ± 0.591
2.543AspArg: 2.543 ± 0.522
2.954AspSer: 2.954 ± 0.642
2.79AspThr: 2.79 ± 0.406
2.708AspVal: 2.708 ± 0.47
0.738AspTrp: 0.738 ± 0.182
1.723AspTyr: 1.723 ± 0.454
0.0AspXaa: 0.0 ± 0.0
Glu
5.251GluAla: 5.251 ± 0.975
0.246GluCys: 0.246 ± 0.185
2.708GluAsp: 2.708 ± 0.644
2.461GluGlu: 2.461 ± 0.582
2.708GluPhe: 2.708 ± 0.514
2.133GluGly: 2.133 ± 0.513
0.82GluHis: 0.82 ± 0.279
4.595GluIle: 4.595 ± 0.536
3.774GluLys: 3.774 ± 0.798
3.61GluLeu: 3.61 ± 0.709
1.231GluMet: 1.231 ± 0.379
2.215GluAsn: 2.215 ± 0.401
1.559GluPro: 1.559 ± 0.316
2.79GluGln: 2.79 ± 0.574
1.395GluArg: 1.395 ± 0.391
3.282GluSer: 3.282 ± 0.687
2.297GluThr: 2.297 ± 0.426
3.364GluVal: 3.364 ± 0.638
1.149GluTrp: 1.149 ± 0.324
2.379GluTyr: 2.379 ± 0.451
0.0GluXaa: 0.0 ± 0.0
Phe
3.364PheAla: 3.364 ± 0.666
0.246PheCys: 0.246 ± 0.138
2.708PheAsp: 2.708 ± 0.553
2.79PheGlu: 2.79 ± 0.534
1.887PhePhe: 1.887 ± 0.516
2.872PheGly: 2.872 ± 0.448
0.492PheHis: 0.492 ± 0.182
1.641PheIle: 1.641 ± 0.399
2.133PheLys: 2.133 ± 0.564
3.036PheLeu: 3.036 ± 0.441
1.149PheMet: 1.149 ± 0.293
3.692PheAsn: 3.692 ± 0.519
1.231PhePro: 1.231 ± 0.297
1.559PheGln: 1.559 ± 0.359
1.231PheArg: 1.231 ± 0.402
5.579PheSer: 5.579 ± 2.403
2.954PheThr: 2.954 ± 0.675
2.379PheVal: 2.379 ± 0.528
0.656PheTrp: 0.656 ± 0.273
1.313PheTyr: 1.313 ± 0.332
0.0PheXaa: 0.0 ± 0.0
Gly
8.533GlyAla: 8.533 ± 2.375
0.656GlyCys: 0.656 ± 0.27
4.184GlyAsp: 4.184 ± 0.912
4.266GlyGlu: 4.266 ± 1.624
6.236GlyPhe: 6.236 ± 2.712
8.533GlyGly: 8.533 ± 4.244
1.149GlyHis: 1.149 ± 0.469
7.877GlyIle: 7.877 ± 2.8
2.872GlyLys: 2.872 ± 0.683
6.154GlyLeu: 6.154 ± 0.845
2.051GlyMet: 2.051 ± 0.529
3.774GlyAsn: 3.774 ± 0.729
1.395GlyPro: 1.395 ± 0.565
3.2GlyGln: 3.2 ± 0.478
2.297GlyArg: 2.297 ± 0.45
5.661GlySer: 5.661 ± 1.081
6.974GlyThr: 6.974 ± 1.796
3.938GlyVal: 3.938 ± 0.476
2.379GlyTrp: 2.379 ± 0.754
9.107GlyTyr: 9.107 ± 5.054
0.0GlyXaa: 0.0 ± 0.0
His
0.492HisAla: 0.492 ± 0.171
0.41HisCys: 0.41 ± 0.19
0.574HisAsp: 0.574 ± 0.238
0.82HisGlu: 0.82 ± 0.294
0.492HisPhe: 0.492 ± 0.239
0.985HisGly: 0.985 ± 0.34
0.574HisHis: 0.574 ± 0.262
1.641HisIle: 1.641 ± 0.436
0.738HisLys: 0.738 ± 0.216
1.067HisLeu: 1.067 ± 0.266
0.0HisMet: 0.0 ± 0.0
0.492HisAsn: 0.492 ± 0.188
0.656HisPro: 0.656 ± 0.263
0.328HisGln: 0.328 ± 0.196
0.574HisArg: 0.574 ± 0.214
1.067HisSer: 1.067 ± 0.416
0.985HisThr: 0.985 ± 0.271
0.82HisVal: 0.82 ± 0.272
0.41HisTrp: 0.41 ± 0.21
0.492HisTyr: 0.492 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
4.759IleAla: 4.759 ± 0.617
0.492IleCys: 0.492 ± 0.193
4.266IleAsp: 4.266 ± 0.568
4.102IleGlu: 4.102 ± 0.761
1.969IlePhe: 1.969 ± 0.411
4.102IleGly: 4.102 ± 0.644
0.985IleHis: 0.985 ± 0.308
4.349IleIle: 4.349 ± 0.595
4.595IleLys: 4.595 ± 0.916
3.938IleLeu: 3.938 ± 0.632
1.641IleMet: 1.641 ± 0.37
5.661IleAsn: 5.661 ± 0.717
3.528IlePro: 3.528 ± 0.543
3.282IleGln: 3.282 ± 0.533
2.543IleArg: 2.543 ± 0.673
7.548IleSer: 7.548 ± 2.242
5.743IleThr: 5.743 ± 1.019
4.349IleVal: 4.349 ± 0.722
0.41IleTrp: 0.41 ± 0.178
2.133IleTyr: 2.133 ± 0.504
0.0IleXaa: 0.0 ± 0.0
Lys
4.266LysAla: 4.266 ± 0.856
0.492LysCys: 0.492 ± 0.177
3.2LysAsp: 3.2 ± 0.748
4.184LysGlu: 4.184 ± 1.013
2.79LysPhe: 2.79 ± 0.508
2.79LysGly: 2.79 ± 0.608
1.067LysHis: 1.067 ± 0.44
3.692LysIle: 3.692 ± 0.654
3.692LysLys: 3.692 ± 0.77
5.333LysLeu: 5.333 ± 0.81
1.969LysMet: 1.969 ± 0.465
2.872LysAsn: 2.872 ± 0.744
2.051LysPro: 2.051 ± 0.42
2.626LysGln: 2.626 ± 0.576
2.543LysArg: 2.543 ± 0.52
3.692LysSer: 3.692 ± 0.738
2.872LysThr: 2.872 ± 0.603
2.461LysVal: 2.461 ± 0.536
0.492LysTrp: 0.492 ± 0.196
2.215LysTyr: 2.215 ± 0.49
0.0LysXaa: 0.0 ± 0.0
Leu
5.497LeuAla: 5.497 ± 0.607
1.231LeuCys: 1.231 ± 0.346
4.349LeuAsp: 4.349 ± 0.701
3.774LeuGlu: 3.774 ± 0.725
3.036LeuPhe: 3.036 ± 0.508
5.087LeuGly: 5.087 ± 0.911
1.395LeuHis: 1.395 ± 0.381
4.677LeuIle: 4.677 ± 0.941
4.266LeuLys: 4.266 ± 0.849
5.087LeuLeu: 5.087 ± 0.874
1.723LeuMet: 1.723 ± 0.386
4.759LeuAsn: 4.759 ± 1.038
3.2LeuPro: 3.2 ± 0.443
3.446LeuGln: 3.446 ± 0.616
2.133LeuArg: 2.133 ± 0.515
5.825LeuSer: 5.825 ± 0.704
6.974LeuThr: 6.974 ± 1.08
3.2LeuVal: 3.2 ± 0.472
0.903LeuTrp: 0.903 ± 0.255
3.528LeuTyr: 3.528 ± 0.628
0.0LeuXaa: 0.0 ± 0.0
Met
3.2MetAla: 3.2 ± 0.533
0.246MetCys: 0.246 ± 0.14
1.313MetAsp: 1.313 ± 0.393
1.231MetGlu: 1.231 ± 0.399
0.574MetPhe: 0.574 ± 0.192
0.82MetGly: 0.82 ± 0.209
0.328MetHis: 0.328 ± 0.202
1.313MetIle: 1.313 ± 0.29
1.559MetLys: 1.559 ± 0.36
1.887MetLeu: 1.887 ± 0.584
0.574MetMet: 0.574 ± 0.216
1.313MetAsn: 1.313 ± 0.309
1.395MetPro: 1.395 ± 0.314
0.574MetGln: 0.574 ± 0.25
1.231MetArg: 1.231 ± 0.341
2.297MetSer: 2.297 ± 0.42
1.559MetThr: 1.559 ± 0.328
1.149MetVal: 1.149 ± 0.298
0.164MetTrp: 0.164 ± 0.137
0.492MetTyr: 0.492 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
4.841AsnAla: 4.841 ± 0.747
0.574AsnCys: 0.574 ± 0.206
2.543AsnAsp: 2.543 ± 0.451
2.708AsnGlu: 2.708 ± 0.49
2.79AsnPhe: 2.79 ± 0.615
6.892AsnGly: 6.892 ± 1.115
0.903AsnHis: 0.903 ± 0.271
4.431AsnIle: 4.431 ± 0.737
3.446AsnLys: 3.446 ± 0.516
5.087AsnLeu: 5.087 ± 0.741
1.067AsnMet: 1.067 ± 0.274
4.266AsnAsn: 4.266 ± 0.798
3.938AsnPro: 3.938 ± 0.645
2.708AsnGln: 2.708 ± 0.436
2.133AsnArg: 2.133 ± 0.36
3.364AsnSer: 3.364 ± 0.609
4.923AsnThr: 4.923 ± 1.154
3.364AsnVal: 3.364 ± 0.425
0.903AsnTrp: 0.903 ± 0.333
2.708AsnTyr: 2.708 ± 0.504
0.0AsnXaa: 0.0 ± 0.0
Pro
2.79ProAla: 2.79 ± 0.498
0.082ProCys: 0.082 ± 0.088
2.215ProAsp: 2.215 ± 0.522
1.969ProGlu: 1.969 ± 0.431
1.477ProPhe: 1.477 ± 0.388
1.805ProGly: 1.805 ± 0.435
0.738ProHis: 0.738 ± 0.23
3.036ProIle: 3.036 ± 0.588
1.477ProLys: 1.477 ± 0.438
2.954ProLeu: 2.954 ± 0.578
0.82ProMet: 0.82 ± 0.264
3.528ProAsn: 3.528 ± 0.501
0.985ProPro: 0.985 ± 0.298
1.313ProGln: 1.313 ± 0.311
0.656ProArg: 0.656 ± 0.251
3.118ProSer: 3.118 ± 0.599
3.61ProThr: 3.61 ± 0.644
2.215ProVal: 2.215 ± 0.461
0.164ProTrp: 0.164 ± 0.097
1.477ProTyr: 1.477 ± 0.368
0.0ProXaa: 0.0 ± 0.0
Gln
3.2GlnAla: 3.2 ± 0.725
0.492GlnCys: 0.492 ± 0.201
2.626GlnAsp: 2.626 ± 0.524
1.805GlnGlu: 1.805 ± 0.381
2.133GlnPhe: 2.133 ± 0.407
3.528GlnGly: 3.528 ± 0.796
0.738GlnHis: 0.738 ± 0.261
3.036GlnIle: 3.036 ± 0.499
2.297GlnLys: 2.297 ± 0.407
3.774GlnLeu: 3.774 ± 0.661
0.985GlnMet: 0.985 ± 0.286
1.887GlnAsn: 1.887 ± 0.463
1.067GlnPro: 1.067 ± 0.289
1.887GlnGln: 1.887 ± 0.409
1.231GlnArg: 1.231 ± 0.318
2.79GlnSer: 2.79 ± 0.528
2.79GlnThr: 2.79 ± 0.588
1.805GlnVal: 1.805 ± 0.34
0.903GlnTrp: 0.903 ± 0.39
2.215GlnTyr: 2.215 ± 0.492
0.0GlnXaa: 0.0 ± 0.0
Arg
2.215ArgAla: 2.215 ± 0.443
0.246ArgCys: 0.246 ± 0.111
1.477ArgAsp: 1.477 ± 0.408
2.379ArgGlu: 2.379 ± 0.595
1.067ArgPhe: 1.067 ± 0.262
1.477ArgGly: 1.477 ± 0.469
0.82ArgHis: 0.82 ± 0.338
2.543ArgIle: 2.543 ± 0.494
2.708ArgLys: 2.708 ± 0.552
2.708ArgLeu: 2.708 ± 0.526
1.149ArgMet: 1.149 ± 0.311
2.297ArgAsn: 2.297 ± 0.426
1.067ArgPro: 1.067 ± 0.289
1.067ArgGln: 1.067 ± 0.328
0.738ArgArg: 0.738 ± 0.244
2.133ArgSer: 2.133 ± 0.497
1.641ArgThr: 1.641 ± 0.3
2.051ArgVal: 2.051 ± 0.433
0.328ArgTrp: 0.328 ± 0.218
1.805ArgTyr: 1.805 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
5.907SerAla: 5.907 ± 1.097
0.328SerCys: 0.328 ± 0.179
2.379SerAsp: 2.379 ± 0.468
2.626SerGlu: 2.626 ± 0.39
3.036SerPhe: 3.036 ± 0.605
23.384SerGly: 23.384 ± 15.407
0.492SerHis: 0.492 ± 0.192
4.841SerIle: 4.841 ± 0.627
3.938SerLys: 3.938 ± 0.721
5.251SerLeu: 5.251 ± 0.634
1.641SerMet: 1.641 ± 0.375
4.02SerAsn: 4.02 ± 0.884
2.708SerPro: 2.708 ± 0.513
2.708SerGln: 2.708 ± 0.5
1.887SerArg: 1.887 ± 0.324
4.102SerSer: 4.102 ± 0.778
5.579SerThr: 5.579 ± 1.052
5.333SerVal: 5.333 ± 0.701
0.492SerTrp: 0.492 ± 0.286
1.805SerTyr: 1.805 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
6.81ThrAla: 6.81 ± 1.608
0.41ThrCys: 0.41 ± 0.335
3.036ThrAsp: 3.036 ± 0.581
2.379ThrGlu: 2.379 ± 0.422
3.282ThrPhe: 3.282 ± 0.807
7.384ThrGly: 7.384 ± 1.718
0.492ThrHis: 0.492 ± 0.207
4.431ThrIle: 4.431 ± 0.975
2.626ThrLys: 2.626 ± 0.508
6.646ThrLeu: 6.646 ± 0.965
1.805ThrMet: 1.805 ± 0.45
5.743ThrAsn: 5.743 ± 0.986
3.364ThrPro: 3.364 ± 0.682
2.626ThrGln: 2.626 ± 0.486
1.723ThrArg: 1.723 ± 0.373
5.825ThrSer: 5.825 ± 0.785
6.728ThrThr: 6.728 ± 1.435
3.774ThrVal: 3.774 ± 0.92
1.395ThrTrp: 1.395 ± 0.34
3.036ThrTyr: 3.036 ± 0.524
0.0ThrXaa: 0.0 ± 0.0
Val
4.02ValAla: 4.02 ± 0.612
0.574ValCys: 0.574 ± 0.242
2.872ValAsp: 2.872 ± 0.631
2.872ValGlu: 2.872 ± 0.531
2.626ValPhe: 2.626 ± 0.625
5.087ValGly: 5.087 ± 0.809
0.82ValHis: 0.82 ± 0.272
3.774ValIle: 3.774 ± 0.52
2.133ValLys: 2.133 ± 0.561
3.528ValLeu: 3.528 ± 0.772
1.067ValMet: 1.067 ± 0.315
3.446ValAsn: 3.446 ± 0.461
1.641ValPro: 1.641 ± 0.365
1.887ValGln: 1.887 ± 0.384
1.149ValArg: 1.149 ± 0.329
4.431ValSer: 4.431 ± 0.605
4.431ValThr: 4.431 ± 0.801
2.379ValVal: 2.379 ± 0.443
0.82ValTrp: 0.82 ± 0.217
2.379ValTyr: 2.379 ± 0.426
0.0ValXaa: 0.0 ± 0.0
Trp
1.149TrpAla: 1.149 ± 0.299
0.082TrpCys: 0.082 ± 0.081
0.82TrpAsp: 0.82 ± 0.259
0.492TrpGlu: 0.492 ± 0.257
0.492TrpPhe: 0.492 ± 0.204
0.985TrpGly: 0.985 ± 0.257
0.41TrpHis: 0.41 ± 0.182
1.067TrpIle: 1.067 ± 0.277
0.656TrpLys: 0.656 ± 0.225
0.656TrpLeu: 0.656 ± 0.226
0.328TrpMet: 0.328 ± 0.166
0.985TrpAsn: 0.985 ± 0.258
0.0TrpPro: 0.0 ± 0.0
0.82TrpGln: 0.82 ± 0.315
0.574TrpArg: 0.574 ± 0.226
2.461TrpSer: 2.461 ± 0.782
1.149TrpThr: 1.149 ± 0.332
0.656TrpVal: 0.656 ± 0.282
0.328TrpTrp: 0.328 ± 0.14
0.82TrpTyr: 0.82 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.708TyrAla: 2.708 ± 0.472
0.164TyrCys: 0.164 ± 0.108
2.954TyrAsp: 2.954 ± 0.565
1.969TyrGlu: 1.969 ± 0.491
1.395TyrPhe: 1.395 ± 0.325
3.446TyrGly: 3.446 ± 0.575
0.41TyrHis: 0.41 ± 0.191
3.118TyrIle: 3.118 ± 0.534
2.379TyrLys: 2.379 ± 0.448
3.036TyrLeu: 3.036 ± 0.53
0.574TyrMet: 0.574 ± 0.233
2.379TyrAsn: 2.379 ± 0.437
1.477TyrPro: 1.477 ± 0.314
1.559TyrGln: 1.559 ± 0.325
1.887TyrArg: 1.887 ± 0.449
8.697TyrSer: 8.697 ± 5.254
3.036TyrThr: 3.036 ± 0.774
1.477TyrVal: 1.477 ± 0.271
0.738TyrTrp: 0.738 ± 0.192
1.559TyrTyr: 1.559 ± 0.519
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (12189 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski