Amino acid dipepetide frequency for Arthrobacter phage Mendel

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.434AlaAla: 12.434 ± 1.635
1.022AlaCys: 1.022 ± 0.357
5.962AlaAsp: 5.962 ± 1.142
4.769AlaGlu: 4.769 ± 1.061
4.599AlaPhe: 4.599 ± 0.702
11.412AlaGly: 11.412 ± 1.513
1.874AlaHis: 1.874 ± 0.562
6.302AlaIle: 6.302 ± 0.975
4.769AlaLys: 4.769 ± 1.005
7.154AlaLeu: 7.154 ± 1.504
1.533AlaMet: 1.533 ± 0.495
5.451AlaAsn: 5.451 ± 1.028
5.451AlaPro: 5.451 ± 0.95
3.918AlaGln: 3.918 ± 1.007
4.088AlaArg: 4.088 ± 0.735
4.94AlaSer: 4.94 ± 0.969
8.687AlaThr: 8.687 ± 1.894
6.813AlaVal: 6.813 ± 1.086
1.533AlaTrp: 1.533 ± 0.52
4.599AlaTyr: 4.599 ± 1.262
0.0AlaXaa: 0.0 ± 0.0
Cys
0.17CysAla: 0.17 ± 0.23
0.0CysCys: 0.0 ± 0.0
0.341CysAsp: 0.341 ± 0.228
0.511CysGlu: 0.511 ± 0.277
0.17CysPhe: 0.17 ± 0.158
0.511CysGly: 0.511 ± 0.279
0.341CysHis: 0.341 ± 0.251
0.17CysIle: 0.17 ± 0.158
0.0CysLys: 0.0 ± 0.0
0.681CysLeu: 0.681 ± 0.402
0.17CysMet: 0.17 ± 0.177
0.0CysAsn: 0.0 ± 0.0
0.852CysPro: 0.852 ± 0.318
0.0CysGln: 0.0 ± 0.0
1.022CysArg: 1.022 ± 0.396
0.0CysSer: 0.0 ± 0.0
0.341CysThr: 0.341 ± 0.216
0.17CysVal: 0.17 ± 0.177
0.17CysTrp: 0.17 ± 0.179
0.511CysTyr: 0.511 ± 0.277
0.0CysXaa: 0.0 ± 0.0
Asp
7.494AspAla: 7.494 ± 1.379
0.341AspCys: 0.341 ± 0.317
3.066AspAsp: 3.066 ± 0.643
3.918AspGlu: 3.918 ± 0.861
2.555AspPhe: 2.555 ± 0.488
4.429AspGly: 4.429 ± 0.905
1.192AspHis: 1.192 ± 0.583
3.236AspIle: 3.236 ± 0.739
3.747AspLys: 3.747 ± 0.995
2.385AspLeu: 2.385 ± 0.714
1.022AspMet: 1.022 ± 0.387
4.088AspAsn: 4.088 ± 0.69
5.28AspPro: 5.28 ± 0.969
2.044AspGln: 2.044 ± 0.685
2.725AspArg: 2.725 ± 0.93
2.896AspSer: 2.896 ± 0.67
5.11AspThr: 5.11 ± 0.771
3.918AspVal: 3.918 ± 1.048
1.363AspTrp: 1.363 ± 0.43
1.533AspTyr: 1.533 ± 0.423
0.0AspXaa: 0.0 ± 0.0
Glu
3.577GluAla: 3.577 ± 0.873
0.681GluCys: 0.681 ± 0.404
1.874GluAsp: 1.874 ± 0.604
2.725GluGlu: 2.725 ± 0.865
3.747GluPhe: 3.747 ± 0.768
2.725GluGly: 2.725 ± 0.7
0.511GluHis: 0.511 ± 0.232
3.577GluIle: 3.577 ± 0.935
2.214GluLys: 2.214 ± 0.641
5.621GluLeu: 5.621 ± 1.091
1.533GluMet: 1.533 ± 0.33
2.214GluAsn: 2.214 ± 0.614
1.533GluPro: 1.533 ± 0.558
2.385GluGln: 2.385 ± 0.809
4.088GluArg: 4.088 ± 1.071
1.533GluSer: 1.533 ± 0.559
4.429GluThr: 4.429 ± 0.585
2.044GluVal: 2.044 ± 0.51
1.022GluTrp: 1.022 ± 0.429
3.066GluTyr: 3.066 ± 0.657
0.0GluXaa: 0.0 ± 0.0
Phe
4.088PheAla: 4.088 ± 0.727
0.0PheCys: 0.0 ± 0.0
2.385PheAsp: 2.385 ± 0.637
2.896PheGlu: 2.896 ± 0.751
1.363PhePhe: 1.363 ± 0.46
2.044PheGly: 2.044 ± 0.504
0.511PheHis: 0.511 ± 0.246
3.066PheIle: 3.066 ± 0.907
2.555PheLys: 2.555 ± 0.606
2.555PheLeu: 2.555 ± 0.626
1.363PheMet: 1.363 ± 0.508
2.896PheAsn: 2.896 ± 0.651
1.192PhePro: 1.192 ± 0.34
1.022PheGln: 1.022 ± 0.358
2.044PheArg: 2.044 ± 0.743
0.852PheSer: 0.852 ± 0.285
3.747PheThr: 3.747 ± 0.733
2.725PheVal: 2.725 ± 0.736
0.511PheTrp: 0.511 ± 0.246
1.022PheTyr: 1.022 ± 0.447
0.0PheXaa: 0.0 ± 0.0
Gly
8.687GlyAla: 8.687 ± 1.313
0.0GlyCys: 0.0 ± 0.0
6.813GlyAsp: 6.813 ± 1.855
4.769GlyGlu: 4.769 ± 1.006
3.236GlyPhe: 3.236 ± 0.731
6.472GlyGly: 6.472 ± 1.16
1.874GlyHis: 1.874 ± 0.398
5.621GlyIle: 5.621 ± 1.159
3.577GlyLys: 3.577 ± 0.873
4.258GlyLeu: 4.258 ± 0.93
1.022GlyMet: 1.022 ± 0.356
4.769GlyAsn: 4.769 ± 0.903
2.555GlyPro: 2.555 ± 0.77
2.214GlyGln: 2.214 ± 0.485
4.258GlyArg: 4.258 ± 1.188
4.769GlySer: 4.769 ± 0.966
8.346GlyThr: 8.346 ± 1.104
5.28GlyVal: 5.28 ± 1.007
1.022GlyTrp: 1.022 ± 0.386
3.918GlyTyr: 3.918 ± 0.926
0.0GlyXaa: 0.0 ± 0.0
His
1.192HisAla: 1.192 ± 0.499
0.341HisCys: 0.341 ± 0.221
1.022HisAsp: 1.022 ± 0.399
0.511HisGlu: 0.511 ± 0.304
1.533HisPhe: 1.533 ± 0.477
1.703HisGly: 1.703 ± 0.388
0.341HisHis: 0.341 ± 0.262
0.852HisIle: 0.852 ± 0.425
0.852HisLys: 0.852 ± 0.365
2.044HisLeu: 2.044 ± 0.597
0.341HisMet: 0.341 ± 0.213
0.852HisAsn: 0.852 ± 0.357
0.511HisPro: 0.511 ± 0.374
0.341HisGln: 0.341 ± 0.278
2.044HisArg: 2.044 ± 0.596
0.852HisSer: 0.852 ± 0.469
0.681HisThr: 0.681 ± 0.372
1.363HisVal: 1.363 ± 0.586
0.341HisTrp: 0.341 ± 0.237
0.681HisTyr: 0.681 ± 0.42
0.0HisXaa: 0.0 ± 0.0
Ile
4.94IleAla: 4.94 ± 0.826
0.511IleCys: 0.511 ± 0.304
3.407IleAsp: 3.407 ± 0.676
3.407IleGlu: 3.407 ± 0.659
1.192IlePhe: 1.192 ± 0.393
5.28IleGly: 5.28 ± 0.899
0.681IleHis: 0.681 ± 0.29
2.555IleIle: 2.555 ± 0.573
2.555IleLys: 2.555 ± 0.66
2.214IleLeu: 2.214 ± 0.458
1.703IleMet: 1.703 ± 0.569
4.258IleAsn: 4.258 ± 1.265
2.555IlePro: 2.555 ± 0.604
2.896IleGln: 2.896 ± 0.635
2.555IleArg: 2.555 ± 0.739
2.555IleSer: 2.555 ± 0.649
3.747IleThr: 3.747 ± 0.729
4.258IleVal: 4.258 ± 0.742
0.852IleTrp: 0.852 ± 0.389
1.703IleTyr: 1.703 ± 0.67
0.0IleXaa: 0.0 ± 0.0
Lys
3.236LysAla: 3.236 ± 0.903
0.0LysCys: 0.0 ± 0.0
1.703LysAsp: 1.703 ± 0.449
3.236LysGlu: 3.236 ± 0.785
1.022LysPhe: 1.022 ± 0.388
3.747LysGly: 3.747 ± 1.304
0.511LysHis: 0.511 ± 0.28
1.533LysIle: 1.533 ± 0.579
1.874LysLys: 1.874 ± 0.63
4.088LysLeu: 4.088 ± 1.232
1.533LysMet: 1.533 ± 0.451
1.533LysAsn: 1.533 ± 0.418
2.555LysPro: 2.555 ± 0.569
2.214LysGln: 2.214 ± 0.782
3.407LysArg: 3.407 ± 0.964
1.022LysSer: 1.022 ± 0.422
3.577LysThr: 3.577 ± 0.971
3.066LysVal: 3.066 ± 0.718
1.022LysTrp: 1.022 ± 0.379
0.681LysTyr: 0.681 ± 0.329
0.0LysXaa: 0.0 ± 0.0
Leu
7.835LeuAla: 7.835 ± 0.995
0.17LeuCys: 0.17 ± 0.168
5.11LeuAsp: 5.11 ± 0.973
3.066LeuGlu: 3.066 ± 0.513
3.577LeuPhe: 3.577 ± 0.897
4.429LeuGly: 4.429 ± 1.112
0.852LeuHis: 0.852 ± 0.437
2.725LeuIle: 2.725 ± 0.693
4.258LeuLys: 4.258 ± 0.917
3.066LeuLeu: 3.066 ± 0.6
2.385LeuMet: 2.385 ± 0.609
4.599LeuAsn: 4.599 ± 1.376
4.258LeuPro: 4.258 ± 0.959
3.918LeuGln: 3.918 ± 0.702
2.385LeuArg: 2.385 ± 0.664
4.599LeuSer: 4.599 ± 0.962
5.791LeuThr: 5.791 ± 0.978
4.599LeuVal: 4.599 ± 0.703
1.533LeuTrp: 1.533 ± 0.507
1.703LeuTyr: 1.703 ± 0.581
0.0LeuXaa: 0.0 ± 0.0
Met
3.407MetAla: 3.407 ± 0.554
0.341MetCys: 0.341 ± 0.207
0.681MetAsp: 0.681 ± 0.373
0.681MetGlu: 0.681 ± 0.411
0.852MetPhe: 0.852 ± 0.721
1.703MetGly: 1.703 ± 0.557
0.17MetHis: 0.17 ± 0.23
0.511MetIle: 0.511 ± 0.354
0.681MetLys: 0.681 ± 0.338
2.555MetLeu: 2.555 ± 0.608
0.681MetMet: 0.681 ± 0.378
1.192MetAsn: 1.192 ± 0.411
1.192MetPro: 1.192 ± 0.435
0.511MetGln: 0.511 ± 0.293
1.533MetArg: 1.533 ± 0.689
1.533MetSer: 1.533 ± 0.511
1.874MetThr: 1.874 ± 0.469
1.192MetVal: 1.192 ± 0.4
0.341MetTrp: 0.341 ± 0.259
1.022MetTyr: 1.022 ± 0.419
0.0MetXaa: 0.0 ± 0.0
Asn
6.302AsnAla: 6.302 ± 1.008
0.341AsnCys: 0.341 ± 0.213
3.066AsnAsp: 3.066 ± 0.67
1.533AsnGlu: 1.533 ± 0.596
1.363AsnPhe: 1.363 ± 0.395
5.451AsnGly: 5.451 ± 0.881
0.852AsnHis: 0.852 ± 0.463
3.066AsnIle: 3.066 ± 0.992
1.022AsnLys: 1.022 ± 0.283
3.407AsnLeu: 3.407 ± 0.794
0.511AsnMet: 0.511 ± 0.283
3.747AsnAsn: 3.747 ± 0.83
3.236AsnPro: 3.236 ± 0.689
2.044AsnGln: 2.044 ± 0.522
2.896AsnArg: 2.896 ± 0.668
2.385AsnSer: 2.385 ± 0.547
4.258AsnThr: 4.258 ± 1.207
2.896AsnVal: 2.896 ± 0.662
1.192AsnTrp: 1.192 ± 0.513
2.555AsnTyr: 2.555 ± 0.794
0.0AsnXaa: 0.0 ± 0.0
Pro
5.621ProAla: 5.621 ± 1.295
0.341ProCys: 0.341 ± 0.241
3.918ProAsp: 3.918 ± 0.861
4.599ProGlu: 4.599 ± 0.932
0.341ProPhe: 0.341 ± 0.202
5.451ProGly: 5.451 ± 1.164
1.363ProHis: 1.363 ± 0.521
2.725ProIle: 2.725 ± 0.642
2.214ProLys: 2.214 ± 0.662
3.747ProLeu: 3.747 ± 0.815
0.341ProMet: 0.341 ± 0.243
2.385ProAsn: 2.385 ± 0.833
2.725ProPro: 2.725 ± 1.084
1.192ProGln: 1.192 ± 0.422
1.022ProArg: 1.022 ± 0.362
2.555ProSer: 2.555 ± 0.81
4.088ProThr: 4.088 ± 0.672
3.747ProVal: 3.747 ± 0.813
0.852ProTrp: 0.852 ± 0.279
2.044ProTyr: 2.044 ± 0.444
0.0ProXaa: 0.0 ± 0.0
Gln
5.11GlnAla: 5.11 ± 0.731
0.17GlnCys: 0.17 ± 0.165
2.725GlnAsp: 2.725 ± 0.831
2.385GlnGlu: 2.385 ± 0.815
2.385GlnPhe: 2.385 ± 0.655
2.555GlnGly: 2.555 ± 0.513
0.852GlnHis: 0.852 ± 0.396
1.533GlnIle: 1.533 ± 0.515
0.681GlnLys: 0.681 ± 0.384
3.236GlnLeu: 3.236 ± 0.798
0.511GlnMet: 0.511 ± 0.328
1.192GlnAsn: 1.192 ± 0.453
2.214GlnPro: 2.214 ± 0.619
2.725GlnGln: 2.725 ± 1.133
1.703GlnArg: 1.703 ± 0.426
1.703GlnSer: 1.703 ± 0.553
2.555GlnThr: 2.555 ± 0.601
2.214GlnVal: 2.214 ± 0.53
0.511GlnTrp: 0.511 ± 0.33
1.192GlnTyr: 1.192 ± 0.368
0.0GlnXaa: 0.0 ± 0.0
Arg
4.599ArgAla: 4.599 ± 1.236
0.341ArgCys: 0.341 ± 0.46
2.896ArgAsp: 2.896 ± 0.646
2.896ArgGlu: 2.896 ± 0.71
2.555ArgPhe: 2.555 ± 0.911
3.577ArgGly: 3.577 ± 0.707
1.192ArgHis: 1.192 ± 0.467
2.555ArgIle: 2.555 ± 0.702
2.044ArgLys: 2.044 ± 0.718
4.258ArgLeu: 4.258 ± 1.097
1.703ArgMet: 1.703 ± 0.539
2.725ArgAsn: 2.725 ± 0.705
2.385ArgPro: 2.385 ± 0.552
2.555ArgGln: 2.555 ± 0.889
2.896ArgArg: 2.896 ± 0.969
2.044ArgSer: 2.044 ± 0.516
2.385ArgThr: 2.385 ± 0.643
2.725ArgVal: 2.725 ± 0.753
0.511ArgTrp: 0.511 ± 0.223
2.214ArgTyr: 2.214 ± 0.66
0.0ArgXaa: 0.0 ± 0.0
Ser
6.302SerAla: 6.302 ± 1.199
0.511SerCys: 0.511 ± 0.216
3.407SerAsp: 3.407 ± 0.781
2.385SerGlu: 2.385 ± 0.86
1.363SerPhe: 1.363 ± 0.506
3.407SerGly: 3.407 ± 0.962
0.852SerHis: 0.852 ± 0.397
3.236SerIle: 3.236 ± 0.534
1.022SerLys: 1.022 ± 0.492
4.258SerLeu: 4.258 ± 1.015
1.703SerMet: 1.703 ± 0.482
1.703SerAsn: 1.703 ± 0.653
2.044SerPro: 2.044 ± 0.641
1.192SerGln: 1.192 ± 0.482
2.555SerArg: 2.555 ± 0.729
2.555SerSer: 2.555 ± 0.765
4.94SerThr: 4.94 ± 0.9
4.599SerVal: 4.599 ± 1.302
0.511SerTrp: 0.511 ± 0.231
1.192SerTyr: 1.192 ± 0.331
0.0SerXaa: 0.0 ± 0.0
Thr
10.049ThrAla: 10.049 ± 1.603
0.341ThrCys: 0.341 ± 0.248
4.088ThrAsp: 4.088 ± 0.773
3.066ThrGlu: 3.066 ± 0.677
3.236ThrPhe: 3.236 ± 0.832
8.687ThrGly: 8.687 ± 1.521
1.703ThrHis: 1.703 ± 0.583
4.94ThrIle: 4.94 ± 0.708
2.385ThrLys: 2.385 ± 0.617
6.302ThrLeu: 6.302 ± 1.081
2.044ThrMet: 2.044 ± 0.38
2.725ThrAsn: 2.725 ± 0.684
5.11ThrPro: 5.11 ± 0.96
1.874ThrGln: 1.874 ± 0.425
3.066ThrArg: 3.066 ± 0.749
4.769ThrSer: 4.769 ± 0.812
5.962ThrThr: 5.962 ± 1.309
7.154ThrVal: 7.154 ± 1.314
1.533ThrTrp: 1.533 ± 0.711
4.258ThrTyr: 4.258 ± 0.727
0.0ThrXaa: 0.0 ± 0.0
Val
5.962ValAla: 5.962 ± 0.896
0.341ValCys: 0.341 ± 0.232
5.11ValAsp: 5.11 ± 0.95
2.385ValGlu: 2.385 ± 0.742
2.214ValPhe: 2.214 ± 0.641
5.28ValGly: 5.28 ± 0.7
1.363ValHis: 1.363 ± 0.444
3.236ValIle: 3.236 ± 0.572
2.385ValLys: 2.385 ± 0.562
5.11ValLeu: 5.11 ± 1.103
0.681ValMet: 0.681 ± 0.371
3.066ValAsn: 3.066 ± 0.748
3.407ValPro: 3.407 ± 0.74
3.236ValGln: 3.236 ± 0.868
2.044ValArg: 2.044 ± 0.69
5.451ValSer: 5.451 ± 0.874
7.835ValThr: 7.835 ± 2.189
4.088ValVal: 4.088 ± 0.992
1.192ValTrp: 1.192 ± 0.46
1.363ValTyr: 1.363 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
2.044TrpAla: 2.044 ± 0.644
0.17TrpCys: 0.17 ± 0.17
2.214TrpAsp: 2.214 ± 0.461
0.681TrpGlu: 0.681 ± 0.35
0.341TrpPhe: 0.341 ± 0.362
1.192TrpGly: 1.192 ± 0.386
0.341TrpHis: 0.341 ± 0.237
0.852TrpIle: 0.852 ± 0.399
1.363TrpLys: 1.363 ± 0.46
1.533TrpLeu: 1.533 ± 0.412
0.511TrpMet: 0.511 ± 0.238
0.681TrpAsn: 0.681 ± 0.26
0.852TrpPro: 0.852 ± 0.356
0.681TrpGln: 0.681 ± 0.268
0.341TrpArg: 0.341 ± 0.2
0.681TrpSer: 0.681 ± 0.279
1.022TrpThr: 1.022 ± 0.421
0.681TrpVal: 0.681 ± 0.404
0.17TrpTrp: 0.17 ± 0.168
0.511TrpTyr: 0.511 ± 0.309
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.429TyrAla: 4.429 ± 1.033
0.17TyrCys: 0.17 ± 0.17
2.555TyrAsp: 2.555 ± 0.798
0.681TyrGlu: 0.681 ± 0.307
1.192TyrPhe: 1.192 ± 0.444
3.407TyrGly: 3.407 ± 0.94
1.022TyrHis: 1.022 ± 0.428
1.533TyrIle: 1.533 ± 0.511
1.363TyrLys: 1.363 ± 0.609
2.385TyrLeu: 2.385 ± 0.569
1.192TyrMet: 1.192 ± 0.436
2.044TyrAsn: 2.044 ± 0.432
1.363TyrPro: 1.363 ± 0.537
1.192TyrGln: 1.192 ± 0.392
2.385TyrArg: 2.385 ± 0.567
2.044TyrSer: 2.044 ± 0.692
3.918TyrThr: 3.918 ± 0.837
2.214TyrVal: 2.214 ± 0.615
0.681TyrTrp: 0.681 ± 0.317
1.533TyrTyr: 1.533 ± 0.383
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 28 proteins (5872 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski