Amino acid dipepetide frequency for Methanobacterium virus PhiF3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.705AlaAla: 0.705 ± 0.309
0.705AlaCys: 0.705 ± 0.243
2.82AlaAsp: 2.82 ± 0.791
3.927AlaGlu: 3.927 ± 1.018
2.316AlaPhe: 2.316 ± 1.081
5.841AlaGly: 5.841 ± 1.312
0.604AlaHis: 0.604 ± 0.263
3.122AlaIle: 3.122 ± 0.93
2.417AlaLys: 2.417 ± 0.64
6.848AlaLeu: 6.848 ± 1.104
1.913AlaMet: 1.913 ± 0.587
1.41AlaAsn: 1.41 ± 0.63
3.021AlaPro: 3.021 ± 0.539
1.511AlaGln: 1.511 ± 0.396
3.525AlaArg: 3.525 ± 0.647
5.337AlaSer: 5.337 ± 1.157
3.625AlaThr: 3.625 ± 0.538
5.035AlaVal: 5.035 ± 0.951
1.712AlaTrp: 1.712 ± 0.538
2.417AlaTyr: 2.417 ± 0.458
0.0AlaXaa: 0.0 ± 0.0
Cys
0.403CysAla: 0.403 ± 0.162
0.101CysCys: 0.101 ± 0.119
0.403CysAsp: 0.403 ± 0.233
0.604CysGlu: 0.604 ± 0.28
0.201CysPhe: 0.201 ± 0.133
1.712CysGly: 1.712 ± 0.748
0.302CysHis: 0.302 ± 0.186
1.108CysIle: 1.108 ± 0.554
1.813CysLys: 1.813 ± 0.708
1.309CysLeu: 1.309 ± 0.384
0.0CysMet: 0.0 ± 0.0
0.302CysAsn: 0.302 ± 0.195
0.604CysPro: 0.604 ± 0.293
0.201CysGln: 0.201 ± 0.151
0.806CysArg: 0.806 ± 0.47
0.504CysSer: 0.504 ± 0.23
0.705CysThr: 0.705 ± 0.29
0.705CysVal: 0.705 ± 0.306
0.101CysTrp: 0.101 ± 0.098
0.604CysTyr: 0.604 ± 0.287
0.0CysXaa: 0.0 ± 0.0
Asp
3.323AspAla: 3.323 ± 0.849
1.309AspCys: 1.309 ± 0.479
4.431AspAsp: 4.431 ± 0.953
5.74AspGlu: 5.74 ± 1.196
3.424AspPhe: 3.424 ± 0.796
4.935AspGly: 4.935 ± 0.905
1.208AspHis: 1.208 ± 0.391
2.82AspIle: 2.82 ± 0.712
4.129AspLys: 4.129 ± 0.544
4.431AspLeu: 4.431 ± 0.814
1.208AspMet: 1.208 ± 0.29
1.41AspAsn: 1.41 ± 0.388
3.927AspPro: 3.927 ± 0.892
0.806AspGln: 0.806 ± 0.246
2.518AspArg: 2.518 ± 0.504
3.827AspSer: 3.827 ± 0.576
2.618AspThr: 2.618 ± 0.538
3.021AspVal: 3.021 ± 0.517
0.906AspTrp: 0.906 ± 0.333
3.424AspTyr: 3.424 ± 0.612
0.0AspXaa: 0.0 ± 0.0
Glu
5.237GluAla: 5.237 ± 0.628
0.604GluCys: 0.604 ± 0.327
3.424GluAsp: 3.424 ± 0.808
6.143GluGlu: 6.143 ± 1.199
3.223GluPhe: 3.223 ± 0.55
6.042GluGly: 6.042 ± 0.722
0.302GluHis: 0.302 ± 0.187
3.726GluIle: 3.726 ± 0.858
3.827GluLys: 3.827 ± 0.887
6.848GluLeu: 6.848 ± 1.004
1.511GluMet: 1.511 ± 0.393
1.712GluAsn: 1.712 ± 0.515
4.129GluPro: 4.129 ± 1.103
1.41GluGln: 1.41 ± 0.457
3.021GluArg: 3.021 ± 0.594
5.136GluSer: 5.136 ± 0.781
3.827GluThr: 3.827 ± 0.558
5.74GluVal: 5.74 ± 0.755
1.41GluTrp: 1.41 ± 0.328
4.028GluTyr: 4.028 ± 0.529
0.0GluXaa: 0.0 ± 0.0
Phe
0.906PheAla: 0.906 ± 0.389
0.403PheCys: 0.403 ± 0.28
3.323PheAsp: 3.323 ± 0.578
1.611PheGlu: 1.611 ± 0.389
1.108PhePhe: 1.108 ± 0.336
1.813PheGly: 1.813 ± 0.542
0.705PheHis: 0.705 ± 0.229
2.316PheIle: 2.316 ± 0.465
3.927PheLys: 3.927 ± 0.672
2.417PheLeu: 2.417 ± 0.503
1.511PheMet: 1.511 ± 0.366
3.122PheAsn: 3.122 ± 0.38
1.108PhePro: 1.108 ± 0.42
1.913PheGln: 1.913 ± 0.518
2.216PheArg: 2.216 ± 0.463
2.014PheSer: 2.014 ± 0.456
3.625PheThr: 3.625 ± 0.465
1.712PheVal: 1.712 ± 0.448
0.403PheTrp: 0.403 ± 0.188
1.611PheTyr: 1.611 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
6.042GlyAla: 6.042 ± 1.711
0.705GlyCys: 0.705 ± 0.258
4.935GlyAsp: 4.935 ± 0.702
6.143GlyGlu: 6.143 ± 0.856
3.927GlyPhe: 3.927 ± 0.901
8.459GlyGly: 8.459 ± 0.973
1.007GlyHis: 1.007 ± 0.389
3.625GlyIle: 3.625 ± 0.766
4.129GlyLys: 4.129 ± 0.58
7.15GlyLeu: 7.15 ± 1.382
1.813GlyMet: 1.813 ± 0.402
2.618GlyAsn: 2.618 ± 0.508
2.417GlyPro: 2.417 ± 0.635
1.611GlyGln: 1.611 ± 0.432
3.223GlyArg: 3.223 ± 0.609
5.841GlySer: 5.841 ± 0.983
4.733GlyThr: 4.733 ± 0.962
7.956GlyVal: 7.956 ± 0.974
1.41GlyTrp: 1.41 ± 0.38
4.532GlyTyr: 4.532 ± 0.559
0.0GlyXaa: 0.0 ± 0.0
His
0.604HisAla: 0.604 ± 0.271
0.201HisCys: 0.201 ± 0.116
0.201HisAsp: 0.201 ± 0.146
1.108HisGlu: 1.108 ± 0.405
0.403HisPhe: 0.403 ± 0.247
0.906HisGly: 0.906 ± 0.307
0.604HisHis: 0.604 ± 0.241
1.309HisIle: 1.309 ± 0.426
0.705HisLys: 0.705 ± 0.311
1.913HisLeu: 1.913 ± 0.687
0.806HisMet: 0.806 ± 0.388
0.705HisAsn: 0.705 ± 0.362
1.41HisPro: 1.41 ± 0.511
0.403HisGln: 0.403 ± 0.144
0.604HisArg: 0.604 ± 0.232
0.906HisSer: 0.906 ± 0.408
1.007HisThr: 1.007 ± 0.314
0.604HisVal: 0.604 ± 0.302
0.302HisTrp: 0.302 ± 0.183
1.108HisTyr: 1.108 ± 0.322
0.0HisXaa: 0.0 ± 0.0
Ile
2.618IleAla: 2.618 ± 0.618
1.007IleCys: 1.007 ± 0.583
3.726IleAsp: 3.726 ± 0.494
4.028IleGlu: 4.028 ± 0.803
1.309IlePhe: 1.309 ± 0.356
3.323IleGly: 3.323 ± 0.654
1.007IleHis: 1.007 ± 0.355
3.625IleIle: 3.625 ± 0.734
4.431IleLys: 4.431 ± 0.837
5.237IleLeu: 5.237 ± 0.62
0.806IleMet: 0.806 ± 0.227
2.518IleAsn: 2.518 ± 0.515
2.014IlePro: 2.014 ± 0.387
2.216IleGln: 2.216 ± 0.293
5.337IleArg: 5.337 ± 0.758
3.927IleSer: 3.927 ± 1.283
3.927IleThr: 3.927 ± 0.599
2.92IleVal: 2.92 ± 0.53
0.504IleTrp: 0.504 ± 0.247
2.115IleTyr: 2.115 ± 0.499
0.0IleXaa: 0.0 ± 0.0
Lys
3.424LysAla: 3.424 ± 0.631
0.806LysCys: 0.806 ± 0.309
5.237LysAsp: 5.237 ± 1.044
5.337LysGlu: 5.337 ± 1.27
1.913LysPhe: 1.913 ± 0.382
6.344LysGly: 6.344 ± 1.031
1.007LysHis: 1.007 ± 0.295
3.223LysIle: 3.223 ± 0.715
4.028LysLys: 4.028 ± 0.869
5.337LysLeu: 5.337 ± 0.825
1.712LysMet: 1.712 ± 0.369
2.618LysAsn: 2.618 ± 0.458
3.525LysPro: 3.525 ± 0.73
1.813LysGln: 1.813 ± 0.47
3.223LysArg: 3.223 ± 0.665
4.028LysSer: 4.028 ± 0.861
3.021LysThr: 3.021 ± 0.563
4.632LysVal: 4.632 ± 0.782
0.604LysTrp: 0.604 ± 0.289
2.82LysTyr: 2.82 ± 0.73
0.0LysXaa: 0.0 ± 0.0
Leu
4.431LeuAla: 4.431 ± 0.548
0.604LeuCys: 0.604 ± 0.347
4.834LeuAsp: 4.834 ± 0.715
5.74LeuGlu: 5.74 ± 0.695
3.323LeuPhe: 3.323 ± 0.753
5.639LeuGly: 5.639 ± 0.813
1.309LeuHis: 1.309 ± 0.495
6.445LeuIle: 6.445 ± 0.618
7.553LeuLys: 7.553 ± 0.971
7.049LeuLeu: 7.049 ± 0.915
2.316LeuMet: 2.316 ± 0.788
3.827LeuAsn: 3.827 ± 0.571
3.827LeuPro: 3.827 ± 0.568
3.525LeuGln: 3.525 ± 0.719
6.244LeuArg: 6.244 ± 0.946
6.848LeuSer: 6.848 ± 0.84
5.639LeuThr: 5.639 ± 0.645
4.431LeuVal: 4.431 ± 0.6
1.208LeuTrp: 1.208 ± 0.355
3.625LeuTyr: 3.625 ± 0.679
0.0LeuXaa: 0.0 ± 0.0
Met
1.611MetAla: 1.611 ± 0.452
0.403MetCys: 0.403 ± 0.21
1.41MetAsp: 1.41 ± 0.336
1.511MetGlu: 1.511 ± 0.418
0.403MetPhe: 0.403 ± 0.211
1.611MetGly: 1.611 ± 0.331
0.0MetHis: 0.0 ± 0.0
1.208MetIle: 1.208 ± 0.487
3.424MetLys: 3.424 ± 0.505
1.712MetLeu: 1.712 ± 0.456
0.806MetMet: 0.806 ± 0.26
1.007MetAsn: 1.007 ± 0.288
0.806MetPro: 0.806 ± 0.344
0.705MetGln: 0.705 ± 0.258
1.712MetArg: 1.712 ± 0.418
2.316MetSer: 2.316 ± 0.607
0.906MetThr: 0.906 ± 0.352
2.216MetVal: 2.216 ± 0.462
0.101MetTrp: 0.101 ± 0.084
0.504MetTyr: 0.504 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
1.611AsnAla: 1.611 ± 0.557
0.504AsnCys: 0.504 ± 0.285
1.712AsnAsp: 1.712 ± 0.358
2.115AsnGlu: 2.115 ± 0.419
0.504AsnPhe: 0.504 ± 0.27
2.92AsnGly: 2.92 ± 0.617
0.806AsnHis: 0.806 ± 0.289
2.216AsnIle: 2.216 ± 0.49
1.208AsnLys: 1.208 ± 0.392
3.927AsnLeu: 3.927 ± 0.584
0.504AsnMet: 0.504 ± 0.285
1.309AsnAsn: 1.309 ± 0.247
3.625AsnPro: 3.625 ± 0.526
1.41AsnGln: 1.41 ± 0.381
1.813AsnArg: 1.813 ± 0.38
2.316AsnSer: 2.316 ± 0.487
3.424AsnThr: 3.424 ± 0.727
3.021AsnVal: 3.021 ± 0.604
0.604AsnTrp: 0.604 ± 0.28
0.906AsnTyr: 0.906 ± 0.407
0.0AsnXaa: 0.0 ± 0.0
Pro
3.525ProAla: 3.525 ± 0.541
0.403ProCys: 0.403 ± 0.162
2.115ProAsp: 2.115 ± 0.584
4.935ProGlu: 4.935 ± 1.144
1.712ProPhe: 1.712 ± 0.376
3.424ProGly: 3.424 ± 0.612
1.41ProHis: 1.41 ± 0.49
2.014ProIle: 2.014 ± 0.314
2.014ProLys: 2.014 ± 0.531
4.431ProLeu: 4.431 ± 0.741
0.806ProMet: 0.806 ± 0.417
1.007ProAsn: 1.007 ± 0.29
2.719ProPro: 2.719 ± 0.79
0.906ProGln: 0.906 ± 0.379
2.216ProArg: 2.216 ± 0.531
3.625ProSer: 3.625 ± 0.724
2.216ProThr: 2.216 ± 0.619
3.927ProVal: 3.927 ± 0.566
0.604ProTrp: 0.604 ± 0.204
1.813ProTyr: 1.813 ± 0.591
0.0ProXaa: 0.0 ± 0.0
Gln
2.316GlnAla: 2.316 ± 0.577
0.504GlnCys: 0.504 ± 0.242
2.014GlnAsp: 2.014 ± 0.621
1.511GlnGlu: 1.511 ± 0.315
1.41GlnPhe: 1.41 ± 0.353
2.316GlnGly: 2.316 ± 0.444
0.101GlnHis: 0.101 ± 0.084
1.309GlnIle: 1.309 ± 0.268
0.806GlnLys: 0.806 ± 0.274
3.323GlnLeu: 3.323 ± 0.644
0.504GlnMet: 0.504 ± 0.201
0.403GlnAsn: 0.403 ± 0.19
0.906GlnPro: 0.906 ± 0.243
0.705GlnGln: 0.705 ± 0.228
1.712GlnArg: 1.712 ± 0.535
1.913GlnSer: 1.913 ± 0.421
1.41GlnThr: 1.41 ± 0.518
2.618GlnVal: 2.618 ± 0.516
0.504GlnTrp: 0.504 ± 0.173
1.309GlnTyr: 1.309 ± 0.407
0.0GlnXaa: 0.0 ± 0.0
Arg
3.625ArgAla: 3.625 ± 0.616
0.705ArgCys: 0.705 ± 0.299
2.417ArgAsp: 2.417 ± 0.482
4.632ArgGlu: 4.632 ± 0.853
2.518ArgPhe: 2.518 ± 0.477
3.625ArgGly: 3.625 ± 0.934
1.108ArgHis: 1.108 ± 0.35
3.525ArgIle: 3.525 ± 0.652
3.726ArgLys: 3.726 ± 0.889
5.237ArgLeu: 5.237 ± 0.632
2.014ArgMet: 2.014 ± 0.611
2.417ArgAsn: 2.417 ± 0.547
1.712ArgPro: 1.712 ± 0.442
1.712ArgGln: 1.712 ± 0.492
4.935ArgArg: 4.935 ± 0.913
2.92ArgSer: 2.92 ± 0.492
3.525ArgThr: 3.525 ± 0.691
5.136ArgVal: 5.136 ± 0.925
1.007ArgTrp: 1.007 ± 0.351
2.115ArgTyr: 2.115 ± 0.47
0.0ArgXaa: 0.0 ± 0.0
Ser
6.143SerAla: 6.143 ± 1.564
0.705SerCys: 0.705 ± 0.343
4.129SerAsp: 4.129 ± 0.705
4.028SerGlu: 4.028 ± 0.748
3.625SerPhe: 3.625 ± 0.519
7.351SerGly: 7.351 ± 1.125
0.705SerHis: 0.705 ± 0.258
3.525SerIle: 3.525 ± 0.8
4.129SerLys: 4.129 ± 0.597
5.539SerLeu: 5.539 ± 0.685
1.813SerMet: 1.813 ± 0.364
1.913SerAsn: 1.913 ± 0.408
1.611SerPro: 1.611 ± 0.419
2.014SerGln: 2.014 ± 0.447
4.028SerArg: 4.028 ± 0.793
4.129SerSer: 4.129 ± 0.701
5.639SerThr: 5.639 ± 1.139
4.834SerVal: 4.834 ± 0.891
0.906SerTrp: 0.906 ± 0.405
2.417SerTyr: 2.417 ± 0.876
0.0SerXaa: 0.0 ± 0.0
Thr
4.935ThrAla: 4.935 ± 0.751
1.208ThrCys: 1.208 ± 0.507
3.726ThrAsp: 3.726 ± 0.82
3.424ThrGlu: 3.424 ± 0.786
2.115ThrPhe: 2.115 ± 0.454
8.359ThrGly: 8.359 ± 1.163
0.806ThrHis: 0.806 ± 0.29
2.719ThrIle: 2.719 ± 0.48
2.014ThrLys: 2.014 ± 0.416
5.942ThrLeu: 5.942 ± 0.904
1.208ThrMet: 1.208 ± 0.502
1.913ThrAsn: 1.913 ± 0.393
2.82ThrPro: 2.82 ± 0.596
1.913ThrGln: 1.913 ± 0.445
3.223ThrArg: 3.223 ± 0.458
3.525ThrSer: 3.525 ± 0.819
3.323ThrThr: 3.323 ± 0.62
5.639ThrVal: 5.639 ± 1.191
0.906ThrTrp: 0.906 ± 0.326
2.216ThrTyr: 2.216 ± 0.766
0.0ThrXaa: 0.0 ± 0.0
Val
3.726ValAla: 3.726 ± 0.932
0.705ValCys: 0.705 ± 0.337
4.834ValAsp: 4.834 ± 0.519
6.042ValGlu: 6.042 ± 0.846
2.518ValPhe: 2.518 ± 0.406
3.726ValGly: 3.726 ± 0.619
1.309ValHis: 1.309 ± 0.396
4.532ValIle: 4.532 ± 0.699
7.654ValLys: 7.654 ± 0.823
5.136ValLeu: 5.136 ± 0.667
1.913ValMet: 1.913 ± 0.45
3.726ValAsn: 3.726 ± 0.708
3.021ValPro: 3.021 ± 0.774
1.41ValGln: 1.41 ± 0.35
4.632ValArg: 4.632 ± 0.891
5.438ValSer: 5.438 ± 0.855
4.532ValThr: 4.532 ± 0.558
4.632ValVal: 4.632 ± 0.723
0.806ValTrp: 0.806 ± 0.264
2.014ValTyr: 2.014 ± 0.603
0.0ValXaa: 0.0 ± 0.0
Trp
1.007TrpAla: 1.007 ± 0.367
0.101TrpCys: 0.101 ± 0.128
0.705TrpAsp: 0.705 ± 0.252
0.806TrpGlu: 0.806 ± 0.255
0.302TrpPhe: 0.302 ± 0.15
0.604TrpGly: 0.604 ± 0.176
0.302TrpHis: 0.302 ± 0.147
1.208TrpIle: 1.208 ± 0.314
0.806TrpLys: 0.806 ± 0.284
1.309TrpLeu: 1.309 ± 0.299
0.201TrpMet: 0.201 ± 0.125
0.403TrpAsn: 0.403 ± 0.193
0.604TrpPro: 0.604 ± 0.257
0.604TrpGln: 0.604 ± 0.222
0.705TrpArg: 0.705 ± 0.273
1.913TrpSer: 1.913 ± 0.478
1.208TrpThr: 1.208 ± 0.413
1.007TrpVal: 1.007 ± 0.473
0.101TrpTrp: 0.101 ± 0.089
0.403TrpTyr: 0.403 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.618TyrAla: 2.618 ± 0.389
0.806TyrCys: 0.806 ± 0.382
3.021TyrAsp: 3.021 ± 0.833
1.913TyrGlu: 1.913 ± 0.462
1.511TyrPhe: 1.511 ± 0.437
3.323TyrGly: 3.323 ± 0.599
1.208TyrHis: 1.208 ± 0.419
3.021TyrIle: 3.021 ± 0.523
1.913TyrLys: 1.913 ± 0.612
3.323TyrLeu: 3.323 ± 0.771
0.906TyrMet: 0.906 ± 0.223
2.115TyrAsn: 2.115 ± 0.746
2.014TyrPro: 2.014 ± 0.547
0.806TyrGln: 0.806 ± 0.345
2.92TyrArg: 2.92 ± 0.703
2.719TyrSer: 2.719 ± 0.667
3.122TyrThr: 3.122 ± 0.927
2.518TyrVal: 2.518 ± 0.507
0.201TyrTrp: 0.201 ± 0.178
1.913TyrTyr: 1.913 ± 0.371
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 30 proteins (9931 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski