Amino acid dipepetide frequency for Hydrogenobaculum phage 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.331AlaAla: 0.331 ± 0.211
0.662AlaCys: 0.662 ± 0.371
2.317AlaAsp: 2.317 ± 0.607
4.303AlaGlu: 4.303 ± 0.774
3.31AlaPhe: 3.31 ± 0.625
4.965AlaGly: 4.965 ± 1.563
1.324AlaHis: 1.324 ± 0.536
4.8AlaIle: 4.8 ± 0.883
4.138AlaLys: 4.138 ± 0.726
5.958AlaLeu: 5.958 ± 0.643
2.483AlaMet: 2.483 ± 0.795
2.317AlaAsn: 2.317 ± 0.686
2.814AlaPro: 2.814 ± 0.819
3.641AlaGln: 3.641 ± 0.874
4.138AlaArg: 4.138 ± 0.674
1.986AlaSer: 1.986 ± 0.54
2.648AlaThr: 2.648 ± 0.682
3.476AlaVal: 3.476 ± 0.985
1.159AlaTrp: 1.159 ± 0.57
3.972AlaTyr: 3.972 ± 0.803
0.0AlaXaa: 0.0 ± 0.0
Cys
0.497CysAla: 0.497 ± 0.463
0.0CysCys: 0.0 ± 0.0
0.331CysAsp: 0.331 ± 0.309
0.662CysGlu: 0.662 ± 0.334
0.0CysPhe: 0.0 ± 0.0
0.331CysGly: 0.331 ± 0.263
0.166CysHis: 0.166 ± 0.186
0.331CysIle: 0.331 ± 0.253
0.497CysLys: 0.497 ± 0.382
0.828CysLeu: 0.828 ± 0.461
0.331CysMet: 0.331 ± 0.256
0.497CysAsn: 0.497 ± 0.357
0.166CysPro: 0.166 ± 0.164
0.166CysGln: 0.166 ± 0.165
0.331CysArg: 0.331 ± 0.253
0.331CysSer: 0.331 ± 0.295
0.0CysThr: 0.0 ± 0.0
0.662CysVal: 0.662 ± 0.467
0.166CysTrp: 0.166 ± 0.184
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.979AspAla: 2.979 ± 0.691
0.166AspCys: 0.166 ± 0.154
2.648AspAsp: 2.648 ± 0.708
4.138AspGlu: 4.138 ± 1.259
3.31AspPhe: 3.31 ± 0.827
2.483AspGly: 2.483 ± 0.946
0.331AspHis: 0.331 ± 0.309
3.807AspIle: 3.807 ± 0.837
3.807AspLys: 3.807 ± 0.816
6.62AspLeu: 6.62 ± 1.246
0.662AspMet: 0.662 ± 0.262
1.655AspAsn: 1.655 ± 0.345
2.483AspPro: 2.483 ± 0.664
1.324AspGln: 1.324 ± 0.35
2.483AspArg: 2.483 ± 0.683
1.655AspSer: 1.655 ± 0.582
0.662AspThr: 0.662 ± 0.3
2.979AspVal: 2.979 ± 0.601
0.993AspTrp: 0.993 ± 0.498
2.152AspTyr: 2.152 ± 0.789
0.0AspXaa: 0.0 ± 0.0
Glu
6.455GluAla: 6.455 ± 0.992
0.331GluCys: 0.331 ± 0.224
5.296GluAsp: 5.296 ± 1.273
13.241GluGlu: 13.241 ± 3.757
4.303GluPhe: 4.303 ± 0.766
3.641GluGly: 3.641 ± 0.813
1.159GluHis: 1.159 ± 0.343
7.779GluIle: 7.779 ± 0.993
9.93GluLys: 9.93 ± 1.722
10.427GluLeu: 10.427 ± 1.385
2.317GluMet: 2.317 ± 0.476
2.814GluAsn: 2.814 ± 0.598
1.821GluPro: 1.821 ± 0.517
3.145GluGln: 3.145 ± 1.039
4.634GluArg: 4.634 ± 1.303
2.483GluSer: 2.483 ± 0.724
3.807GluThr: 3.807 ± 1.058
6.289GluVal: 6.289 ± 1.072
0.331GluTrp: 0.331 ± 0.186
3.807GluTyr: 3.807 ± 0.748
0.331GluXaa: 0.331 ± 0.168
Phe
1.986PheAla: 1.986 ± 0.514
0.0PheCys: 0.0 ± 0.0
2.814PheAsp: 2.814 ± 0.678
5.627PheGlu: 5.627 ± 0.915
1.655PhePhe: 1.655 ± 0.673
1.821PheGly: 1.821 ± 0.404
1.159PheHis: 1.159 ± 0.492
4.138PheIle: 4.138 ± 0.837
3.476PheLys: 3.476 ± 0.873
5.793PheLeu: 5.793 ± 1.48
0.331PheMet: 0.331 ± 0.168
2.483PheAsn: 2.483 ± 0.377
1.49PhePro: 1.49 ± 0.394
1.324PheGln: 1.324 ± 0.318
2.648PheArg: 2.648 ± 0.447
2.648PheSer: 2.648 ± 0.954
3.145PheThr: 3.145 ± 0.495
2.317PheVal: 2.317 ± 0.551
0.993PheTrp: 0.993 ± 0.441
3.641PheTyr: 3.641 ± 0.729
0.0PheXaa: 0.0 ± 0.0
Gly
3.476GlyAla: 3.476 ± 0.767
0.497GlyCys: 0.497 ± 0.263
2.317GlyAsp: 2.317 ± 0.785
2.979GlyGlu: 2.979 ± 0.534
3.145GlyPhe: 3.145 ± 0.888
1.986GlyGly: 1.986 ± 0.545
1.655GlyHis: 1.655 ± 0.482
3.476GlyIle: 3.476 ± 0.792
3.972GlyLys: 3.972 ± 0.757
5.627GlyLeu: 5.627 ± 1.125
1.159GlyMet: 1.159 ± 0.573
1.986GlyAsn: 1.986 ± 0.388
0.828GlyPro: 0.828 ± 0.247
2.317GlyGln: 2.317 ± 1.057
5.296GlyArg: 5.296 ± 1.053
2.979GlySer: 2.979 ± 0.787
1.986GlyThr: 1.986 ± 0.558
3.641GlyVal: 3.641 ± 0.968
1.49GlyTrp: 1.49 ± 0.373
1.655GlyTyr: 1.655 ± 0.463
0.0GlyXaa: 0.0 ± 0.0
His
1.324HisAla: 1.324 ± 0.78
0.166HisCys: 0.166 ± 0.15
0.497HisAsp: 0.497 ± 0.364
1.655HisGlu: 1.655 ± 0.445
1.324HisPhe: 1.324 ± 0.502
0.497HisGly: 0.497 ± 0.382
0.662HisHis: 0.662 ± 0.279
0.828HisIle: 0.828 ± 0.511
0.497HisLys: 0.497 ± 0.279
1.49HisLeu: 1.49 ± 0.764
0.497HisMet: 0.497 ± 0.351
0.828HisAsn: 0.828 ± 0.474
1.324HisPro: 1.324 ± 0.289
1.655HisGln: 1.655 ± 0.547
0.166HisArg: 0.166 ± 0.131
0.828HisSer: 0.828 ± 0.356
0.331HisThr: 0.331 ± 0.223
0.497HisVal: 0.497 ± 0.238
0.497HisTrp: 0.497 ± 0.223
1.159HisTyr: 1.159 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
5.131IleAla: 5.131 ± 0.806
0.662IleCys: 0.662 ± 0.324
3.641IleAsp: 3.641 ± 0.62
8.11IleGlu: 8.11 ± 1.027
2.483IlePhe: 2.483 ± 0.665
3.476IleGly: 3.476 ± 0.742
1.159IleHis: 1.159 ± 0.398
3.476IleIle: 3.476 ± 0.899
7.117IleLys: 7.117 ± 1.137
7.282IleLeu: 7.282 ± 0.887
0.662IleMet: 0.662 ± 0.463
2.483IleAsn: 2.483 ± 0.532
3.476IlePro: 3.476 ± 0.709
2.317IleGln: 2.317 ± 0.651
4.634IleArg: 4.634 ± 1.044
3.807IleSer: 3.807 ± 0.716
3.476IleThr: 3.476 ± 0.543
4.965IleVal: 4.965 ± 0.856
0.993IleTrp: 0.993 ± 0.427
1.324IleTyr: 1.324 ± 0.699
0.0IleXaa: 0.0 ± 0.0
Lys
6.786LysAla: 6.786 ± 1.267
0.331LysCys: 0.331 ± 0.295
4.303LysAsp: 4.303 ± 0.751
10.924LysGlu: 10.924 ± 1.307
3.972LysPhe: 3.972 ± 0.737
3.641LysGly: 3.641 ± 0.669
1.49LysHis: 1.49 ± 0.334
8.275LysIle: 8.275 ± 1.091
9.434LysLys: 9.434 ± 1.033
7.282LysLeu: 7.282 ± 1.562
2.648LysMet: 2.648 ± 0.749
1.986LysAsn: 1.986 ± 0.701
3.145LysPro: 3.145 ± 0.794
3.972LysGln: 3.972 ± 0.856
3.476LysArg: 3.476 ± 0.945
2.979LysSer: 2.979 ± 0.666
5.627LysThr: 5.627 ± 1.072
5.296LysVal: 5.296 ± 1.119
1.159LysTrp: 1.159 ± 0.499
3.641LysTyr: 3.641 ± 0.55
0.166LysXaa: 0.166 ± 0.131
Leu
6.455LeuAla: 6.455 ± 1.132
1.159LeuCys: 1.159 ± 0.438
3.972LeuAsp: 3.972 ± 1.02
8.772LeuGlu: 8.772 ± 1.592
5.793LeuPhe: 5.793 ± 1.303
5.296LeuGly: 5.296 ± 0.774
1.49LeuHis: 1.49 ± 0.37
6.124LeuIle: 6.124 ± 0.952
10.427LeuLys: 10.427 ± 1.543
8.606LeuLeu: 8.606 ± 1.215
3.31LeuMet: 3.31 ± 0.594
4.469LeuAsn: 4.469 ± 0.959
3.145LeuPro: 3.145 ± 0.805
4.8LeuGln: 4.8 ± 0.574
5.793LeuArg: 5.793 ± 0.86
8.11LeuSer: 8.11 ± 1.137
4.634LeuThr: 4.634 ± 0.808
3.31LeuVal: 3.31 ± 0.674
1.49LeuTrp: 1.49 ± 0.665
5.131LeuTyr: 5.131 ± 1.147
0.331LeuXaa: 0.331 ± 0.183
Met
2.483MetAla: 2.483 ± 0.884
0.166MetCys: 0.166 ± 0.164
0.331MetAsp: 0.331 ± 0.25
1.655MetGlu: 1.655 ± 0.841
1.655MetPhe: 1.655 ± 0.552
1.49MetGly: 1.49 ± 0.472
0.166MetHis: 0.166 ± 0.187
0.993MetIle: 0.993 ± 0.423
2.317MetLys: 2.317 ± 0.597
3.145MetLeu: 3.145 ± 0.733
0.828MetMet: 0.828 ± 0.57
0.497MetAsn: 0.497 ± 0.375
1.49MetPro: 1.49 ± 0.522
2.483MetGln: 2.483 ± 0.772
1.324MetArg: 1.324 ± 0.482
1.655MetSer: 1.655 ± 0.425
1.655MetThr: 1.655 ± 0.325
0.662MetVal: 0.662 ± 0.364
0.497MetTrp: 0.497 ± 0.316
0.993MetTyr: 0.993 ± 0.473
0.0MetXaa: 0.0 ± 0.0
Asn
2.483AsnAla: 2.483 ± 0.538
0.166AsnCys: 0.166 ± 0.154
0.828AsnAsp: 0.828 ± 0.314
3.641AsnGlu: 3.641 ± 0.544
1.49AsnPhe: 1.49 ± 0.419
1.986AsnGly: 1.986 ± 0.667
0.497AsnHis: 0.497 ± 0.247
2.317AsnIle: 2.317 ± 0.632
2.648AsnLys: 2.648 ± 0.67
4.8AsnLeu: 4.8 ± 0.83
1.159AsnMet: 1.159 ± 0.534
1.655AsnAsn: 1.655 ± 0.423
1.986AsnPro: 1.986 ± 0.629
1.49AsnGln: 1.49 ± 0.562
1.986AsnArg: 1.986 ± 0.654
1.655AsnSer: 1.655 ± 0.703
0.993AsnThr: 0.993 ± 0.422
2.317AsnVal: 2.317 ± 0.548
0.331AsnTrp: 0.331 ± 0.247
2.317AsnTyr: 2.317 ± 0.705
0.0AsnXaa: 0.0 ± 0.0
Pro
1.159ProAla: 1.159 ± 0.354
0.166ProCys: 0.166 ± 0.164
1.655ProAsp: 1.655 ± 0.326
3.476ProGlu: 3.476 ± 0.619
1.159ProPhe: 1.159 ± 0.241
1.159ProGly: 1.159 ± 0.382
0.828ProHis: 0.828 ± 0.384
2.648ProIle: 2.648 ± 0.956
4.965ProLys: 4.965 ± 1.199
3.476ProLeu: 3.476 ± 0.594
0.497ProMet: 0.497 ± 0.331
1.159ProAsn: 1.159 ± 0.457
1.821ProPro: 1.821 ± 0.656
2.152ProGln: 2.152 ± 0.934
0.993ProArg: 0.993 ± 0.442
1.49ProSer: 1.49 ± 0.448
2.317ProThr: 2.317 ± 0.462
2.814ProVal: 2.814 ± 0.562
0.497ProTrp: 0.497 ± 0.271
1.49ProTyr: 1.49 ± 0.481
0.166ProXaa: 0.166 ± 0.131
Gln
3.145GlnAla: 3.145 ± 0.964
0.0GlnCys: 0.0 ± 0.0
1.821GlnAsp: 1.821 ± 0.687
2.979GlnGlu: 2.979 ± 0.887
1.986GlnPhe: 1.986 ± 0.53
2.648GlnGly: 2.648 ± 0.642
0.662GlnHis: 0.662 ± 0.387
3.807GlnIle: 3.807 ± 0.961
3.807GlnLys: 3.807 ± 1.066
4.965GlnLeu: 4.965 ± 1.173
1.49GlnMet: 1.49 ± 0.473
1.986GlnAsn: 1.986 ± 0.755
1.49GlnPro: 1.49 ± 0.584
4.138GlnGln: 4.138 ± 2.23
1.986GlnArg: 1.986 ± 0.498
1.324GlnSer: 1.324 ± 0.536
1.655GlnThr: 1.655 ± 0.518
2.814GlnVal: 2.814 ± 0.77
0.662GlnTrp: 0.662 ± 0.505
0.662GlnTyr: 0.662 ± 0.315
0.331GlnXaa: 0.331 ± 0.195
Arg
3.972ArgAla: 3.972 ± 0.726
0.662ArgCys: 0.662 ± 0.431
1.986ArgAsp: 1.986 ± 0.769
4.634ArgGlu: 4.634 ± 0.661
2.814ArgPhe: 2.814 ± 0.769
5.296ArgGly: 5.296 ± 0.844
0.828ArgHis: 0.828 ± 0.246
3.476ArgIle: 3.476 ± 1.014
4.8ArgLys: 4.8 ± 1.008
5.627ArgLeu: 5.627 ± 0.803
1.655ArgMet: 1.655 ± 0.554
1.655ArgAsn: 1.655 ± 0.676
0.662ArgPro: 0.662 ± 0.319
1.49ArgGln: 1.49 ± 0.662
3.807ArgArg: 3.807 ± 1.201
2.648ArgSer: 2.648 ± 0.512
2.979ArgThr: 2.979 ± 0.644
2.152ArgVal: 2.152 ± 0.609
0.828ArgTrp: 0.828 ± 0.321
3.145ArgTyr: 3.145 ± 0.851
0.0ArgXaa: 0.0 ± 0.0
Ser
2.979SerAla: 2.979 ± 0.475
0.166SerCys: 0.166 ± 0.187
2.648SerAsp: 2.648 ± 0.553
5.296SerGlu: 5.296 ± 0.787
2.317SerPhe: 2.317 ± 0.568
2.814SerGly: 2.814 ± 0.52
0.828SerHis: 0.828 ± 0.356
2.648SerIle: 2.648 ± 0.571
3.807SerLys: 3.807 ± 0.657
4.965SerLeu: 4.965 ± 0.822
1.159SerMet: 1.159 ± 0.585
1.821SerAsn: 1.821 ± 0.707
1.821SerPro: 1.821 ± 0.504
1.49SerGln: 1.49 ± 0.425
2.648SerArg: 2.648 ± 0.544
2.483SerSer: 2.483 ± 0.716
2.152SerThr: 2.152 ± 0.512
1.821SerVal: 1.821 ± 0.618
0.828SerTrp: 0.828 ± 0.365
1.821SerTyr: 1.821 ± 0.664
0.0SerXaa: 0.0 ± 0.0
Thr
3.31ThrAla: 3.31 ± 0.61
0.0ThrCys: 0.0 ± 0.0
2.648ThrAsp: 2.648 ± 0.654
3.807ThrGlu: 3.807 ± 0.924
1.986ThrPhe: 1.986 ± 0.645
2.979ThrGly: 2.979 ± 0.733
0.331ThrHis: 0.331 ± 0.262
3.972ThrIle: 3.972 ± 0.672
3.476ThrLys: 3.476 ± 0.657
4.303ThrLeu: 4.303 ± 0.814
1.655ThrMet: 1.655 ± 0.48
1.49ThrAsn: 1.49 ± 0.439
1.49ThrPro: 1.49 ± 0.641
1.821ThrGln: 1.821 ± 0.638
1.49ThrArg: 1.49 ± 0.627
2.648ThrSer: 2.648 ± 0.777
1.655ThrThr: 1.655 ± 0.54
2.979ThrVal: 2.979 ± 0.554
1.159ThrTrp: 1.159 ± 0.31
1.49ThrTyr: 1.49 ± 0.367
0.0ThrXaa: 0.0 ± 0.0
Val
1.986ValAla: 1.986 ± 0.494
0.166ValCys: 0.166 ± 0.164
2.648ValAsp: 2.648 ± 0.512
5.462ValGlu: 5.462 ± 1.549
3.641ValPhe: 3.641 ± 1.062
1.986ValGly: 1.986 ± 0.657
0.828ValHis: 0.828 ± 0.597
4.469ValIle: 4.469 ± 0.869
6.786ValLys: 6.786 ± 1.07
4.303ValLeu: 4.303 ± 1.249
1.821ValMet: 1.821 ± 0.66
1.821ValAsn: 1.821 ± 0.553
2.814ValPro: 2.814 ± 0.72
1.655ValGln: 1.655 ± 0.409
3.145ValArg: 3.145 ± 0.552
2.814ValSer: 2.814 ± 0.715
2.483ValThr: 2.483 ± 0.694
4.965ValVal: 4.965 ± 0.854
0.828ValTrp: 0.828 ± 0.435
2.648ValTyr: 2.648 ± 0.883
0.0ValXaa: 0.0 ± 0.0
Trp
0.828TrpAla: 0.828 ± 0.475
0.166TrpCys: 0.166 ± 0.231
1.49TrpAsp: 1.49 ± 0.547
0.497TrpGlu: 0.497 ± 0.254
1.159TrpPhe: 1.159 ± 0.335
1.324TrpGly: 1.324 ± 0.408
0.166TrpHis: 0.166 ± 0.207
0.166TrpIle: 0.166 ± 0.15
1.655TrpLys: 1.655 ± 0.506
2.317TrpLeu: 2.317 ± 0.628
0.331TrpMet: 0.331 ± 0.185
0.497TrpAsn: 0.497 ± 0.281
0.166TrpPro: 0.166 ± 0.131
0.993TrpGln: 0.993 ± 0.36
0.828TrpArg: 0.828 ± 0.397
0.497TrpSer: 0.497 ± 0.248
0.497TrpThr: 0.497 ± 0.28
1.324TrpVal: 1.324 ± 0.447
0.497TrpTrp: 0.497 ± 0.45
0.993TrpTyr: 0.993 ± 0.441
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.814TyrAla: 2.814 ± 0.672
0.662TyrCys: 0.662 ± 0.373
3.145TyrAsp: 3.145 ± 0.601
2.483TyrGlu: 2.483 ± 0.567
1.986TyrPhe: 1.986 ± 0.577
2.648TyrGly: 2.648 ± 0.463
0.828TyrHis: 0.828 ± 0.302
2.979TyrIle: 2.979 ± 0.629
2.979TyrLys: 2.979 ± 0.7
4.469TyrLeu: 4.469 ± 1.052
1.324TyrMet: 1.324 ± 0.307
2.648TyrAsn: 2.648 ± 0.765
1.655TyrPro: 1.655 ± 0.638
1.821TyrGln: 1.821 ± 0.606
2.979TyrArg: 2.979 ± 0.807
1.655TyrSer: 1.655 ± 0.444
1.986TyrThr: 1.986 ± 0.437
1.986TyrVal: 1.986 ± 0.72
0.993TyrTrp: 0.993 ± 0.387
1.821TyrTyr: 1.821 ± 0.435
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.166XaaAla: 0.166 ± 0.131
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.166XaaHis: 0.166 ± 0.131
0.166XaaIle: 0.166 ± 0.167
0.0XaaLys: 0.0 ± 0.0
0.331XaaLeu: 0.331 ± 0.262
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.166XaaGln: 0.166 ± 0.131
0.331XaaArg: 0.331 ± 0.234
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (6043 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski