Amino acid dipepetide frequency for Microbacterium phage TeddyBoy

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.028AlaAla: 23.028 ± 2.868
0.181AlaCys: 0.181 ± 0.147
8.16AlaAsp: 8.16 ± 1.388
8.522AlaGlu: 8.522 ± 1.441
5.258AlaPhe: 5.258 ± 0.701
16.863AlaGly: 16.863 ± 1.828
1.632AlaHis: 1.632 ± 0.591
5.621AlaIle: 5.621 ± 0.973
4.17AlaLys: 4.17 ± 0.874
12.874AlaLeu: 12.874 ± 1.749
3.626AlaMet: 3.626 ± 1.162
2.357AlaAsn: 2.357 ± 0.876
8.522AlaPro: 8.522 ± 1.306
4.533AlaGln: 4.533 ± 1.151
8.341AlaArg: 8.341 ± 1.199
5.621AlaSer: 5.621 ± 1.105
6.709AlaThr: 6.709 ± 1.29
8.522AlaVal: 8.522 ± 1.759
1.995AlaTrp: 1.995 ± 0.521
2.72AlaTyr: 2.72 ± 0.708
0.0AlaXaa: 0.0 ± 0.0
Cys
0.181CysAla: 0.181 ± 0.197
0.0CysCys: 0.0 ± 0.0
0.181CysAsp: 0.181 ± 0.149
0.181CysGlu: 0.181 ± 0.149
0.181CysPhe: 0.181 ± 0.149
0.181CysGly: 0.181 ± 0.216
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.181CysLys: 0.181 ± 0.183
0.363CysLeu: 0.363 ± 0.276
0.0CysMet: 0.0 ± 0.0
0.363CysAsn: 0.363 ± 0.244
0.0CysPro: 0.0 ± 0.0
0.181CysGln: 0.181 ± 0.216
0.363CysArg: 0.363 ± 0.305
0.0CysSer: 0.0 ± 0.0
0.544CysThr: 0.544 ± 0.436
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.363CysTyr: 0.363 ± 0.243
0.0CysXaa: 0.0 ± 0.0
Asp
10.154AspAla: 10.154 ± 1.568
0.181AspCys: 0.181 ± 0.208
3.445AspAsp: 3.445 ± 1.044
3.264AspGlu: 3.264 ± 1.148
0.725AspPhe: 0.725 ± 0.515
5.44AspGly: 5.44 ± 1.044
1.269AspHis: 1.269 ± 0.508
0.363AspIle: 0.363 ± 0.226
0.181AspLys: 0.181 ± 0.192
7.797AspLeu: 7.797 ± 1.655
0.544AspMet: 0.544 ± 0.337
1.088AspAsn: 1.088 ± 0.381
2.901AspPro: 2.901 ± 0.733
1.269AspGln: 1.269 ± 0.643
3.445AspArg: 3.445 ± 1.046
2.357AspSer: 2.357 ± 0.583
2.901AspThr: 2.901 ± 0.902
4.896AspVal: 4.896 ± 1.127
1.088AspTrp: 1.088 ± 0.432
1.451AspTyr: 1.451 ± 0.512
0.0AspXaa: 0.0 ± 0.0
Glu
6.89GluAla: 6.89 ± 1.361
0.181GluCys: 0.181 ± 0.216
2.72GluAsp: 2.72 ± 0.662
0.725GluGlu: 0.725 ± 0.483
0.725GluPhe: 0.725 ± 0.314
3.989GluGly: 3.989 ± 0.877
0.363GluHis: 0.363 ± 0.268
3.989GluIle: 3.989 ± 0.735
0.363GluLys: 0.363 ± 0.239
8.522GluLeu: 8.522 ± 1.2
0.363GluMet: 0.363 ± 0.263
3.626GluAsn: 3.626 ± 0.906
1.632GluPro: 1.632 ± 0.706
3.445GluGln: 3.445 ± 0.853
5.077GluArg: 5.077 ± 1.589
3.264GluSer: 3.264 ± 0.692
3.445GluThr: 3.445 ± 0.811
2.357GluVal: 2.357 ± 0.528
1.088GluTrp: 1.088 ± 0.522
1.451GluTyr: 1.451 ± 0.429
0.0GluXaa: 0.0 ± 0.0
Phe
4.17PheAla: 4.17 ± 0.974
0.0PheCys: 0.0 ± 0.0
2.357PheAsp: 2.357 ± 0.594
1.269PheGlu: 1.269 ± 0.645
0.181PhePhe: 0.181 ± 0.147
3.989PheGly: 3.989 ± 0.918
0.181PheHis: 0.181 ± 0.197
1.088PheIle: 1.088 ± 0.322
0.544PheLys: 0.544 ± 0.241
2.72PheLeu: 2.72 ± 0.697
0.907PheMet: 0.907 ± 0.299
1.451PheAsn: 1.451 ± 0.469
0.725PhePro: 0.725 ± 0.29
1.088PheGln: 1.088 ± 0.561
1.632PheArg: 1.632 ± 0.476
0.907PheSer: 0.907 ± 0.459
3.445PheThr: 3.445 ± 0.856
2.539PheVal: 2.539 ± 0.555
0.181PheTrp: 0.181 ± 0.211
0.544PheTyr: 0.544 ± 0.357
0.0PheXaa: 0.0 ± 0.0
Gly
11.967GlyAla: 11.967 ± 2.744
0.181GlyCys: 0.181 ± 0.183
4.714GlyAsp: 4.714 ± 1.251
3.626GlyGlu: 3.626 ± 0.856
2.72GlyPhe: 2.72 ± 0.472
6.709GlyGly: 6.709 ± 1.307
1.088GlyHis: 1.088 ± 0.487
5.44GlyIle: 5.44 ± 0.94
2.357GlyLys: 2.357 ± 0.554
7.978GlyLeu: 7.978 ± 2.316
1.632GlyMet: 1.632 ± 0.432
2.539GlyAsn: 2.539 ± 0.797
2.901GlyPro: 2.901 ± 1.178
3.808GlyGln: 3.808 ± 0.962
6.346GlyArg: 6.346 ± 0.816
7.797GlySer: 7.797 ± 1.19
4.17GlyThr: 4.17 ± 0.818
9.791GlyVal: 9.791 ± 1.349
2.357GlyTrp: 2.357 ± 0.673
1.995GlyTyr: 1.995 ± 0.541
0.0GlyXaa: 0.0 ± 0.0
His
1.813HisAla: 1.813 ± 0.406
0.0HisCys: 0.0 ± 0.0
1.451HisAsp: 1.451 ± 0.505
0.181HisGlu: 0.181 ± 0.209
0.544HisPhe: 0.544 ± 0.358
1.632HisGly: 1.632 ± 0.691
0.181HisHis: 0.181 ± 0.209
0.181HisIle: 0.181 ± 0.149
0.363HisLys: 0.363 ± 0.261
1.088HisLeu: 1.088 ± 0.4
0.181HisMet: 0.181 ± 0.147
0.363HisAsn: 0.363 ± 0.268
0.725HisPro: 0.725 ± 0.34
0.181HisGln: 0.181 ± 0.149
1.269HisArg: 1.269 ± 0.547
0.725HisSer: 0.725 ± 0.332
1.088HisThr: 1.088 ± 0.469
2.357HisVal: 2.357 ± 0.679
0.363HisTrp: 0.363 ± 0.224
0.907HisTyr: 0.907 ± 0.33
0.0HisXaa: 0.0 ± 0.0
Ile
4.533IleAla: 4.533 ± 0.875
0.181IleCys: 0.181 ± 0.193
3.808IleAsp: 3.808 ± 0.731
5.621IleGlu: 5.621 ± 1.12
0.725IlePhe: 0.725 ± 0.324
3.626IleGly: 3.626 ± 0.871
1.451IleHis: 1.451 ± 0.471
0.907IleIle: 0.907 ± 0.478
0.544IleLys: 0.544 ± 0.257
3.445IleLeu: 3.445 ± 1.407
0.181IleMet: 0.181 ± 0.192
0.725IleAsn: 0.725 ± 0.331
2.901IlePro: 2.901 ± 1.115
1.088IleGln: 1.088 ± 0.454
3.808IleArg: 3.808 ± 0.633
3.989IleSer: 3.989 ± 0.797
2.901IleThr: 2.901 ± 0.66
3.808IleVal: 3.808 ± 0.825
0.363IleTrp: 0.363 ± 0.222
0.363IleTyr: 0.363 ± 0.254
0.0IleXaa: 0.0 ± 0.0
Lys
1.632LysAla: 1.632 ± 0.43
0.0LysCys: 0.0 ± 0.0
1.269LysAsp: 1.269 ± 0.597
0.544LysGlu: 0.544 ± 0.306
0.725LysPhe: 0.725 ± 0.524
1.088LysGly: 1.088 ± 0.388
0.363LysHis: 0.363 ± 0.229
1.269LysIle: 1.269 ± 0.478
0.544LysLys: 0.544 ± 0.323
1.632LysLeu: 1.632 ± 0.651
0.544LysMet: 0.544 ± 0.255
0.907LysAsn: 0.907 ± 0.516
0.544LysPro: 0.544 ± 0.443
0.181LysGln: 0.181 ± 0.182
2.176LysArg: 2.176 ± 0.89
2.176LysSer: 2.176 ± 0.631
1.995LysThr: 1.995 ± 0.594
0.544LysVal: 0.544 ± 0.489
0.363LysTrp: 0.363 ± 0.258
0.363LysTyr: 0.363 ± 0.224
0.0LysXaa: 0.0 ± 0.0
Leu
9.61LeuAla: 9.61 ± 1.176
0.544LeuCys: 0.544 ± 0.327
6.165LeuAsp: 6.165 ± 1.104
2.176LeuGlu: 2.176 ± 0.543
3.808LeuPhe: 3.808 ± 1.06
8.341LeuGly: 8.341 ± 1.037
1.632LeuHis: 1.632 ± 0.599
4.714LeuIle: 4.714 ± 1.449
1.088LeuLys: 1.088 ± 0.483
7.434LeuLeu: 7.434 ± 2.279
2.539LeuMet: 2.539 ± 0.677
3.626LeuAsn: 3.626 ± 1.057
7.978LeuPro: 7.978 ± 1.627
5.621LeuGln: 5.621 ± 0.958
6.709LeuArg: 6.709 ± 0.942
5.802LeuSer: 5.802 ± 1.055
8.341LeuThr: 8.341 ± 1.275
6.165LeuVal: 6.165 ± 1.319
0.544LeuTrp: 0.544 ± 0.259
1.813LeuTyr: 1.813 ± 0.643
0.0LeuXaa: 0.0 ± 0.0
Met
2.72MetAla: 2.72 ± 0.591
0.0MetCys: 0.0 ± 0.0
1.269MetAsp: 1.269 ± 0.49
0.725MetGlu: 0.725 ± 0.372
0.544MetPhe: 0.544 ± 0.277
2.357MetGly: 2.357 ± 0.731
0.363MetHis: 0.363 ± 0.268
1.451MetIle: 1.451 ± 0.786
0.181MetLys: 0.181 ± 0.149
0.544MetLeu: 0.544 ± 0.336
0.0MetMet: 0.0 ± 0.0
1.088MetAsn: 1.088 ± 0.444
1.088MetPro: 1.088 ± 0.435
0.363MetGln: 0.363 ± 0.335
1.632MetArg: 1.632 ± 0.889
2.539MetSer: 2.539 ± 0.991
1.632MetThr: 1.632 ± 0.572
1.451MetVal: 1.451 ± 0.528
0.181MetTrp: 0.181 ± 0.149
0.181MetTyr: 0.181 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
7.072AsnAla: 7.072 ± 1.024
0.181AsnCys: 0.181 ± 0.149
0.181AsnAsp: 0.181 ± 0.183
1.088AsnGlu: 1.088 ± 0.305
0.363AsnPhe: 0.363 ± 0.182
3.808AsnGly: 3.808 ± 0.587
0.544AsnHis: 0.544 ± 0.259
0.907AsnIle: 0.907 ± 0.457
0.725AsnLys: 0.725 ± 0.355
2.901AsnLeu: 2.901 ± 0.89
0.544AsnMet: 0.544 ± 0.418
1.269AsnAsn: 1.269 ± 0.479
1.451AsnPro: 1.451 ± 0.478
0.0AsnGln: 0.0 ± 0.0
2.357AsnArg: 2.357 ± 0.668
2.176AsnSer: 2.176 ± 0.588
1.451AsnThr: 1.451 ± 0.465
2.357AsnVal: 2.357 ± 0.688
0.0AsnTrp: 0.0 ± 0.0
0.544AsnTyr: 0.544 ± 0.289
0.0AsnXaa: 0.0 ± 0.0
Pro
9.248ProAla: 9.248 ± 1.355
0.363ProCys: 0.363 ± 0.288
2.901ProAsp: 2.901 ± 0.92
2.901ProGlu: 2.901 ± 0.828
1.088ProPhe: 1.088 ± 0.382
3.808ProGly: 3.808 ± 0.858
0.725ProHis: 0.725 ± 0.563
2.176ProIle: 2.176 ± 0.436
1.995ProLys: 1.995 ± 0.793
4.17ProLeu: 4.17 ± 0.917
1.088ProMet: 1.088 ± 0.548
1.451ProAsn: 1.451 ± 0.477
1.632ProPro: 1.632 ± 0.902
1.995ProGln: 1.995 ± 0.544
3.083ProArg: 3.083 ± 0.892
3.445ProSer: 3.445 ± 0.453
3.808ProThr: 3.808 ± 0.792
3.626ProVal: 3.626 ± 0.848
0.363ProTrp: 0.363 ± 0.267
1.451ProTyr: 1.451 ± 0.459
0.0ProXaa: 0.0 ± 0.0
Gln
4.533GlnAla: 4.533 ± 1.272
0.0GlnCys: 0.0 ± 0.0
1.451GlnAsp: 1.451 ± 0.442
1.269GlnGlu: 1.269 ± 0.405
1.813GlnPhe: 1.813 ± 0.552
2.72GlnGly: 2.72 ± 0.603
1.269GlnHis: 1.269 ± 0.444
1.813GlnIle: 1.813 ± 0.56
0.544GlnLys: 0.544 ± 0.306
9.61GlnLeu: 9.61 ± 1.57
0.725GlnMet: 0.725 ± 0.515
1.451GlnAsn: 1.451 ± 0.477
2.901GlnPro: 2.901 ± 0.945
2.539GlnGln: 2.539 ± 0.657
3.445GlnArg: 3.445 ± 0.63
1.632GlnSer: 1.632 ± 0.493
1.269GlnThr: 1.269 ± 0.471
1.632GlnVal: 1.632 ± 0.473
0.181GlnTrp: 0.181 ± 0.168
0.725GlnTyr: 0.725 ± 0.38
0.0GlnXaa: 0.0 ± 0.0
Arg
9.066ArgAla: 9.066 ± 1.638
0.544ArgCys: 0.544 ± 0.344
5.258ArgAsp: 5.258 ± 0.929
6.709ArgGlu: 6.709 ± 1.65
2.539ArgPhe: 2.539 ± 0.7
4.533ArgGly: 4.533 ± 0.878
0.907ArgHis: 0.907 ± 0.551
3.264ArgIle: 3.264 ± 0.652
1.269ArgLys: 1.269 ± 0.505
7.253ArgLeu: 7.253 ± 1.097
1.088ArgMet: 1.088 ± 0.393
1.813ArgAsn: 1.813 ± 0.553
2.539ArgPro: 2.539 ± 0.775
4.533ArgGln: 4.533 ± 1.03
8.522ArgArg: 8.522 ± 1.814
2.357ArgSer: 2.357 ± 0.615
2.72ArgThr: 2.72 ± 0.664
5.984ArgVal: 5.984 ± 1.134
2.176ArgTrp: 2.176 ± 0.705
1.995ArgTyr: 1.995 ± 0.427
0.0ArgXaa: 0.0 ± 0.0
Ser
9.066SerAla: 9.066 ± 1.004
0.363SerCys: 0.363 ± 0.386
2.176SerAsp: 2.176 ± 0.503
3.989SerGlu: 3.989 ± 0.885
1.088SerPhe: 1.088 ± 0.345
4.352SerGly: 4.352 ± 0.759
1.269SerHis: 1.269 ± 0.4
3.445SerIle: 3.445 ± 0.807
1.269SerLys: 1.269 ± 0.499
4.17SerLeu: 4.17 ± 1.142
2.176SerMet: 2.176 ± 0.684
1.451SerAsn: 1.451 ± 0.63
2.176SerPro: 2.176 ± 0.647
2.72SerGln: 2.72 ± 0.774
3.626SerArg: 3.626 ± 0.806
4.533SerSer: 4.533 ± 0.977
3.989SerThr: 3.989 ± 0.803
4.714SerVal: 4.714 ± 0.872
1.088SerTrp: 1.088 ± 0.445
1.451SerTyr: 1.451 ± 0.599
0.0SerXaa: 0.0 ± 0.0
Thr
8.704ThrAla: 8.704 ± 1.48
0.0ThrCys: 0.0 ± 0.0
2.357ThrAsp: 2.357 ± 0.709
3.626ThrGlu: 3.626 ± 0.858
3.989ThrPhe: 3.989 ± 0.905
7.072ThrGly: 7.072 ± 1.736
0.907ThrHis: 0.907 ± 0.456
3.264ThrIle: 3.264 ± 1.106
0.725ThrLys: 0.725 ± 0.386
4.533ThrLeu: 4.533 ± 1.217
1.813ThrMet: 1.813 ± 0.665
1.632ThrAsn: 1.632 ± 0.523
4.533ThrPro: 4.533 ± 1.127
1.451ThrGln: 1.451 ± 0.6
4.352ThrArg: 4.352 ± 1.018
2.901ThrSer: 2.901 ± 0.773
5.984ThrThr: 5.984 ± 1.474
5.44ThrVal: 5.44 ± 1.197
1.088ThrTrp: 1.088 ± 0.461
1.088ThrTyr: 1.088 ± 0.457
0.0ThrXaa: 0.0 ± 0.0
Val
10.879ValAla: 10.879 ± 1.49
0.0ValCys: 0.0 ± 0.0
3.445ValAsp: 3.445 ± 0.862
5.44ValGlu: 5.44 ± 1.357
1.269ValPhe: 1.269 ± 0.613
5.802ValGly: 5.802 ± 1.12
0.544ValHis: 0.544 ± 0.296
3.445ValIle: 3.445 ± 1.037
1.269ValLys: 1.269 ± 0.555
4.714ValLeu: 4.714 ± 1.08
1.269ValMet: 1.269 ± 0.709
2.176ValAsn: 2.176 ± 0.962
3.989ValPro: 3.989 ± 0.739
4.533ValGln: 4.533 ± 1.293
6.165ValArg: 6.165 ± 1.377
5.44ValSer: 5.44 ± 1.139
7.072ValThr: 7.072 ± 1.317
5.984ValVal: 5.984 ± 1.461
0.544ValTrp: 0.544 ± 0.332
1.451ValTyr: 1.451 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
1.632TrpAla: 1.632 ± 0.418
0.0TrpCys: 0.0 ± 0.0
0.544TrpAsp: 0.544 ± 0.299
0.725TrpGlu: 0.725 ± 0.38
1.088TrpPhe: 1.088 ± 0.335
0.725TrpGly: 0.725 ± 0.244
0.907TrpHis: 0.907 ± 0.461
1.269TrpIle: 1.269 ± 0.46
0.0TrpLys: 0.0 ± 0.0
2.357TrpLeu: 2.357 ± 0.685
0.181TrpMet: 0.181 ± 0.208
0.363TrpAsn: 0.363 ± 0.224
0.363TrpPro: 0.363 ± 0.299
0.725TrpGln: 0.725 ± 0.347
0.544TrpArg: 0.544 ± 0.338
0.725TrpSer: 0.725 ± 0.367
1.451TrpThr: 1.451 ± 0.596
0.363TrpVal: 0.363 ± 0.263
0.0TrpTrp: 0.0 ± 0.0
0.363TrpTyr: 0.363 ± 0.226
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.357TyrAla: 2.357 ± 0.696
0.363TyrCys: 0.363 ± 0.236
0.907TyrAsp: 0.907 ± 0.378
1.632TyrGlu: 1.632 ± 0.443
0.544TyrPhe: 0.544 ± 0.343
3.083TyrGly: 3.083 ± 0.621
0.0TyrHis: 0.0 ± 0.0
0.363TyrIle: 0.363 ± 0.294
0.544TyrLys: 0.544 ± 0.307
0.544TyrLeu: 0.544 ± 0.302
0.725TyrMet: 0.725 ± 0.449
0.363TyrAsn: 0.363 ± 0.235
1.813TyrPro: 1.813 ± 0.592
1.269TyrGln: 1.269 ± 0.474
2.176TyrArg: 2.176 ± 0.74
0.725TyrSer: 0.725 ± 0.417
0.544TyrThr: 0.544 ± 0.338
2.901TyrVal: 2.901 ± 0.713
0.363TyrTrp: 0.363 ± 0.294
0.725TyrTyr: 0.725 ± 0.374
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (5516 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski