Amino acid dipepetide frequency for TM7 phage DolZOral124_53_65

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.407AlaAla: 8.407 ± 1.256
0.168AlaCys: 0.168 ± 0.119
5.969AlaAsp: 5.969 ± 0.693
5.549AlaGlu: 5.549 ± 1.24
2.018AlaPhe: 2.018 ± 0.359
6.978AlaGly: 6.978 ± 0.958
0.673AlaHis: 0.673 ± 0.259
4.708AlaIle: 4.708 ± 0.661
6.221AlaLys: 6.221 ± 0.696
6.978AlaLeu: 6.978 ± 0.848
1.093AlaMet: 1.093 ± 0.241
4.792AlaAsn: 4.792 ± 0.627
4.456AlaPro: 4.456 ± 0.465
5.044AlaGln: 5.044 ± 0.877
4.96AlaArg: 4.96 ± 1.033
4.288AlaSer: 4.288 ± 0.62
4.372AlaThr: 4.372 ± 0.497
4.96AlaVal: 4.96 ± 0.601
1.009AlaTrp: 1.009 ± 0.271
3.867AlaTyr: 3.867 ± 0.444
0.0AlaXaa: 0.0 ± 0.0
Cys
0.336CysAla: 0.336 ± 0.165
0.084CysCys: 0.084 ± 0.093
0.336CysAsp: 0.336 ± 0.143
0.336CysGlu: 0.336 ± 0.148
0.42CysPhe: 0.42 ± 0.232
0.504CysGly: 0.504 ± 0.197
0.168CysHis: 0.168 ± 0.101
0.336CysIle: 0.336 ± 0.17
0.168CysLys: 0.168 ± 0.105
0.168CysLeu: 0.168 ± 0.099
0.168CysMet: 0.168 ± 0.106
0.084CysAsn: 0.084 ± 0.084
0.0CysPro: 0.0 ± 0.0
0.336CysGln: 0.336 ± 0.213
0.757CysArg: 0.757 ± 0.24
0.336CysSer: 0.336 ± 0.193
0.504CysThr: 0.504 ± 0.223
0.42CysVal: 0.42 ± 0.177
0.084CysTrp: 0.084 ± 0.09
0.252CysTyr: 0.252 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
6.137AspAla: 6.137 ± 0.666
0.252AspCys: 0.252 ± 0.14
5.969AspAsp: 5.969 ± 0.756
3.699AspGlu: 3.699 ± 0.886
2.186AspPhe: 2.186 ± 0.386
5.549AspGly: 5.549 ± 0.758
0.757AspHis: 0.757 ± 0.298
3.867AspIle: 3.867 ± 0.49
4.708AspLys: 4.708 ± 0.584
4.792AspLeu: 4.792 ± 0.579
1.765AspMet: 1.765 ± 0.381
2.774AspAsn: 2.774 ± 0.43
2.942AspPro: 2.942 ± 0.543
2.606AspGln: 2.606 ± 0.503
4.288AspArg: 4.288 ± 0.765
4.792AspSer: 4.792 ± 0.67
4.372AspThr: 4.372 ± 0.649
4.288AspVal: 4.288 ± 0.694
0.925AspTrp: 0.925 ± 0.322
3.447AspTyr: 3.447 ± 0.635
0.0AspXaa: 0.0 ± 0.0
Glu
6.053GluAla: 6.053 ± 1.13
0.42GluCys: 0.42 ± 0.151
2.942GluAsp: 2.942 ± 0.611
4.54GluGlu: 4.54 ± 1.233
2.438GluPhe: 2.438 ± 0.513
2.522GluGly: 2.522 ± 0.575
0.925GluHis: 0.925 ± 0.243
3.783GluIle: 3.783 ± 0.571
5.549GluLys: 5.549 ± 1.217
4.708GluLeu: 4.708 ± 0.71
0.841GluMet: 0.841 ± 0.258
3.363GluAsn: 3.363 ± 0.596
2.27GluPro: 2.27 ± 0.456
2.606GluGln: 2.606 ± 0.653
4.203GluArg: 4.203 ± 0.682
3.279GluSer: 3.279 ± 0.553
2.438GluThr: 2.438 ± 0.499
3.951GluVal: 3.951 ± 0.542
1.009GluTrp: 1.009 ± 0.353
1.85GluTyr: 1.85 ± 0.425
0.0GluXaa: 0.0 ± 0.0
Phe
2.69PheAla: 2.69 ± 0.586
0.084PheCys: 0.084 ± 0.08
3.615PheAsp: 3.615 ± 0.519
1.261PheGlu: 1.261 ± 0.31
0.673PhePhe: 0.673 ± 0.224
2.69PheGly: 2.69 ± 0.612
0.841PheHis: 0.841 ± 0.28
1.934PheIle: 1.934 ± 0.404
1.934PheLys: 1.934 ± 0.336
1.681PheLeu: 1.681 ± 0.353
0.757PheMet: 0.757 ± 0.245
1.429PheAsn: 1.429 ± 0.366
0.925PhePro: 0.925 ± 0.379
1.093PheGln: 1.093 ± 0.279
2.186PheArg: 2.186 ± 0.488
1.765PheSer: 1.765 ± 0.359
1.934PheThr: 1.934 ± 0.372
2.438PheVal: 2.438 ± 0.581
0.841PheTrp: 0.841 ± 0.266
1.009PheTyr: 1.009 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
5.801GlyAla: 5.801 ± 0.924
0.504GlyCys: 0.504 ± 0.234
5.38GlyAsp: 5.38 ± 0.691
4.035GlyGlu: 4.035 ± 0.485
2.69GlyPhe: 2.69 ± 0.607
8.827GlyGly: 8.827 ± 1.325
1.681GlyHis: 1.681 ± 0.371
3.447GlyIle: 3.447 ± 0.58
3.867GlyLys: 3.867 ± 0.638
4.96GlyLeu: 4.96 ± 0.641
2.522GlyMet: 2.522 ± 0.473
3.783GlyAsn: 3.783 ± 0.538
1.429GlyPro: 1.429 ± 0.394
3.363GlyGln: 3.363 ± 0.605
3.531GlyArg: 3.531 ± 0.506
5.38GlySer: 5.38 ± 0.707
6.894GlyThr: 6.894 ± 1.111
5.38GlyVal: 5.38 ± 0.665
0.673GlyTrp: 0.673 ± 0.258
2.606GlyTyr: 2.606 ± 0.459
0.0GlyXaa: 0.0 ± 0.0
His
1.597HisAla: 1.597 ± 0.458
0.084HisCys: 0.084 ± 0.072
1.765HisAsp: 1.765 ± 0.305
0.841HisGlu: 0.841 ± 0.269
0.336HisPhe: 0.336 ± 0.162
2.102HisGly: 2.102 ± 0.373
0.252HisHis: 0.252 ± 0.181
0.841HisIle: 0.841 ± 0.275
0.925HisLys: 0.925 ± 0.275
0.757HisLeu: 0.757 ± 0.283
0.252HisMet: 0.252 ± 0.149
1.009HisAsn: 1.009 ± 0.263
1.009HisPro: 1.009 ± 0.273
0.673HisGln: 0.673 ± 0.257
1.009HisArg: 1.009 ± 0.303
0.757HisSer: 0.757 ± 0.239
0.504HisThr: 0.504 ± 0.207
0.841HisVal: 0.841 ± 0.276
0.336HisTrp: 0.336 ± 0.179
1.009HisTyr: 1.009 ± 0.35
0.0HisXaa: 0.0 ± 0.0
Ile
4.96IleAla: 4.96 ± 0.692
0.084IleCys: 0.084 ± 0.076
5.296IleAsp: 5.296 ± 0.513
4.372IleGlu: 4.372 ± 0.702
1.261IlePhe: 1.261 ± 0.308
4.624IleGly: 4.624 ± 0.797
1.093IleHis: 1.093 ± 0.258
2.354IleIle: 2.354 ± 0.486
3.783IleLys: 3.783 ± 0.595
3.699IleLeu: 3.699 ± 0.525
1.009IleMet: 1.009 ± 0.307
3.195IleAsn: 3.195 ± 0.508
2.186IlePro: 2.186 ± 0.361
1.934IleGln: 1.934 ± 0.358
4.203IleArg: 4.203 ± 0.666
3.783IleSer: 3.783 ± 0.486
3.699IleThr: 3.699 ± 0.671
3.951IleVal: 3.951 ± 0.535
0.504IleTrp: 0.504 ± 0.211
1.765IleTyr: 1.765 ± 0.395
0.0IleXaa: 0.0 ± 0.0
Lys
5.128LysAla: 5.128 ± 0.89
0.42LysCys: 0.42 ± 0.193
3.699LysAsp: 3.699 ± 0.523
3.531LysGlu: 3.531 ± 0.747
1.681LysPhe: 1.681 ± 0.384
3.195LysGly: 3.195 ± 0.607
1.681LysHis: 1.681 ± 0.43
4.035LysIle: 4.035 ± 0.561
4.54LysLys: 4.54 ± 0.779
4.624LysLeu: 4.624 ± 0.574
1.934LysMet: 1.934 ± 0.462
3.363LysAsn: 3.363 ± 0.671
2.354LysPro: 2.354 ± 0.668
2.438LysGln: 2.438 ± 0.456
4.288LysArg: 4.288 ± 0.647
3.447LysSer: 3.447 ± 0.525
3.615LysThr: 3.615 ± 0.524
2.942LysVal: 2.942 ± 0.487
1.177LysTrp: 1.177 ± 0.303
2.354LysTyr: 2.354 ± 0.433
0.0LysXaa: 0.0 ± 0.0
Leu
6.305LeuAla: 6.305 ± 0.847
0.336LeuCys: 0.336 ± 0.157
4.119LeuAsp: 4.119 ± 0.566
3.699LeuGlu: 3.699 ± 0.609
2.186LeuPhe: 2.186 ± 0.385
5.549LeuGly: 5.549 ± 0.701
1.009LeuHis: 1.009 ± 0.327
3.699LeuIle: 3.699 ± 0.662
3.951LeuLys: 3.951 ± 0.647
4.96LeuLeu: 4.96 ± 0.747
2.186LeuMet: 2.186 ± 0.431
3.195LeuAsn: 3.195 ± 0.413
4.035LeuPro: 4.035 ± 0.536
3.195LeuGln: 3.195 ± 0.563
3.447LeuArg: 3.447 ± 0.521
5.296LeuSer: 5.296 ± 0.58
4.288LeuThr: 4.288 ± 0.697
4.876LeuVal: 4.876 ± 0.682
0.673LeuTrp: 0.673 ± 0.256
2.69LeuTyr: 2.69 ± 0.39
0.0LeuXaa: 0.0 ± 0.0
Met
2.858MetAla: 2.858 ± 0.434
0.168MetCys: 0.168 ± 0.105
1.513MetAsp: 1.513 ± 0.325
0.757MetGlu: 0.757 ± 0.223
1.009MetPhe: 1.009 ± 0.339
1.429MetGly: 1.429 ± 0.281
0.504MetHis: 0.504 ± 0.203
1.513MetIle: 1.513 ± 0.374
1.261MetLys: 1.261 ± 0.335
0.841MetLeu: 0.841 ± 0.252
1.345MetMet: 1.345 ± 0.39
1.429MetAsn: 1.429 ± 0.382
1.934MetPro: 1.934 ± 0.49
1.093MetGln: 1.093 ± 0.313
1.513MetArg: 1.513 ± 0.427
1.597MetSer: 1.597 ± 0.456
1.681MetThr: 1.681 ± 0.354
1.093MetVal: 1.093 ± 0.428
0.084MetTrp: 0.084 ± 0.097
0.673MetTyr: 0.673 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
3.447AsnAla: 3.447 ± 0.654
0.757AsnCys: 0.757 ± 0.354
3.195AsnAsp: 3.195 ± 0.58
2.27AsnGlu: 2.27 ± 0.422
1.177AsnPhe: 1.177 ± 0.265
4.456AsnGly: 4.456 ± 0.838
1.009AsnHis: 1.009 ± 0.283
3.026AsnIle: 3.026 ± 0.5
2.858AsnLys: 2.858 ± 0.457
2.69AsnLeu: 2.69 ± 0.559
1.261AsnMet: 1.261 ± 0.325
1.597AsnAsn: 1.597 ± 0.437
3.026AsnPro: 3.026 ± 0.521
2.354AsnGln: 2.354 ± 0.507
2.522AsnArg: 2.522 ± 0.376
4.624AsnSer: 4.624 ± 0.704
3.531AsnThr: 3.531 ± 0.604
2.438AsnVal: 2.438 ± 0.491
0.841AsnTrp: 0.841 ± 0.305
1.934AsnTyr: 1.934 ± 0.415
0.0AsnXaa: 0.0 ± 0.0
Pro
4.203ProAla: 4.203 ± 0.546
0.336ProCys: 0.336 ± 0.179
3.279ProAsp: 3.279 ± 0.642
2.69ProGlu: 2.69 ± 0.451
1.597ProPhe: 1.597 ± 0.385
2.522ProGly: 2.522 ± 0.439
0.841ProHis: 0.841 ± 0.275
2.438ProIle: 2.438 ± 0.564
2.69ProLys: 2.69 ± 0.585
2.438ProLeu: 2.438 ± 0.445
1.513ProMet: 1.513 ± 0.418
2.27ProAsn: 2.27 ± 0.515
2.606ProPro: 2.606 ± 0.803
2.018ProGln: 2.018 ± 0.326
2.018ProArg: 2.018 ± 0.383
2.69ProSer: 2.69 ± 0.494
2.774ProThr: 2.774 ± 0.345
3.363ProVal: 3.363 ± 0.5
0.336ProTrp: 0.336 ± 0.223
1.093ProTyr: 1.093 ± 0.326
0.0ProXaa: 0.0 ± 0.0
Gln
3.951GlnAla: 3.951 ± 0.556
0.252GlnCys: 0.252 ± 0.163
2.018GlnAsp: 2.018 ± 0.565
3.111GlnGlu: 3.111 ± 0.663
1.765GlnPhe: 1.765 ± 0.428
2.27GlnGly: 2.27 ± 0.422
0.673GlnHis: 0.673 ± 0.237
2.354GlnIle: 2.354 ± 0.368
2.606GlnLys: 2.606 ± 0.59
3.867GlnLeu: 3.867 ± 0.538
1.009GlnMet: 1.009 ± 0.293
1.513GlnAsn: 1.513 ± 0.396
1.597GlnPro: 1.597 ± 0.395
3.447GlnGln: 3.447 ± 0.909
3.195GlnArg: 3.195 ± 0.592
3.195GlnSer: 3.195 ± 0.445
3.615GlnThr: 3.615 ± 0.499
2.522GlnVal: 2.522 ± 0.502
0.588GlnTrp: 0.588 ± 0.263
1.934GlnTyr: 1.934 ± 0.448
0.0GlnXaa: 0.0 ± 0.0
Arg
5.633ArgAla: 5.633 ± 1.064
0.168ArgCys: 0.168 ± 0.099
3.699ArgAsp: 3.699 ± 0.483
4.54ArgGlu: 4.54 ± 0.698
1.345ArgPhe: 1.345 ± 0.345
3.615ArgGly: 3.615 ± 0.465
1.261ArgHis: 1.261 ± 0.269
4.035ArgIle: 4.035 ± 0.687
3.783ArgLys: 3.783 ± 0.571
4.119ArgLeu: 4.119 ± 0.593
1.681ArgMet: 1.681 ± 0.424
1.765ArgAsn: 1.765 ± 0.38
2.354ArgPro: 2.354 ± 0.443
2.858ArgGln: 2.858 ± 0.565
3.111ArgArg: 3.111 ± 0.791
2.522ArgSer: 2.522 ± 0.623
2.942ArgThr: 2.942 ± 0.382
3.699ArgVal: 3.699 ± 0.566
1.345ArgTrp: 1.345 ± 0.28
2.186ArgTyr: 2.186 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
5.549SerAla: 5.549 ± 0.886
0.673SerCys: 0.673 ± 0.223
3.699SerAsp: 3.699 ± 0.437
4.288SerGlu: 4.288 ± 0.857
3.111SerPhe: 3.111 ± 0.556
5.717SerGly: 5.717 ± 0.746
1.177SerHis: 1.177 ± 0.284
4.372SerIle: 4.372 ± 0.619
2.438SerLys: 2.438 ± 0.379
5.464SerLeu: 5.464 ± 0.602
1.093SerMet: 1.093 ± 0.232
2.942SerAsn: 2.942 ± 0.563
2.27SerPro: 2.27 ± 0.403
3.279SerGln: 3.279 ± 0.421
2.522SerArg: 2.522 ± 0.441
4.54SerSer: 4.54 ± 0.773
4.54SerThr: 4.54 ± 0.875
4.792SerVal: 4.792 ± 0.679
0.925SerTrp: 0.925 ± 0.246
2.102SerTyr: 2.102 ± 0.44
0.0SerXaa: 0.0 ± 0.0
Thr
5.296ThrAla: 5.296 ± 0.834
0.252ThrCys: 0.252 ± 0.127
4.54ThrAsp: 4.54 ± 0.602
2.606ThrGlu: 2.606 ± 0.462
2.186ThrPhe: 2.186 ± 0.437
6.473ThrGly: 6.473 ± 0.821
0.841ThrHis: 0.841 ± 0.239
4.288ThrIle: 4.288 ± 0.682
3.026ThrLys: 3.026 ± 0.518
4.876ThrLeu: 4.876 ± 0.708
1.177ThrMet: 1.177 ± 0.301
3.951ThrAsn: 3.951 ± 0.45
3.699ThrPro: 3.699 ± 0.609
2.522ThrGln: 2.522 ± 0.455
2.858ThrArg: 2.858 ± 0.49
4.372ThrSer: 4.372 ± 0.709
5.38ThrThr: 5.38 ± 0.841
2.942ThrVal: 2.942 ± 0.527
0.42ThrTrp: 0.42 ± 0.193
3.111ThrTyr: 3.111 ± 0.636
0.0ThrXaa: 0.0 ± 0.0
Val
5.044ValAla: 5.044 ± 0.687
0.252ValCys: 0.252 ± 0.132
5.549ValAsp: 5.549 ± 0.643
4.792ValGlu: 4.792 ± 0.639
1.85ValPhe: 1.85 ± 0.484
3.615ValGly: 3.615 ± 0.514
0.588ValHis: 0.588 ± 0.216
3.783ValIle: 3.783 ± 0.663
3.195ValLys: 3.195 ± 0.413
4.876ValLeu: 4.876 ± 0.646
1.345ValMet: 1.345 ± 0.328
3.026ValAsn: 3.026 ± 0.405
3.111ValPro: 3.111 ± 0.499
1.934ValGln: 1.934 ± 0.407
3.026ValArg: 3.026 ± 0.544
5.801ValSer: 5.801 ± 0.958
4.203ValThr: 4.203 ± 0.631
4.119ValVal: 4.119 ± 0.64
0.42ValTrp: 0.42 ± 0.167
2.354ValTyr: 2.354 ± 0.439
0.0ValXaa: 0.0 ± 0.0
Trp
0.841TrpAla: 0.841 ± 0.241
0.252TrpCys: 0.252 ± 0.114
0.925TrpAsp: 0.925 ± 0.265
0.757TrpGlu: 0.757 ± 0.237
0.588TrpPhe: 0.588 ± 0.273
0.673TrpGly: 0.673 ± 0.247
0.336TrpHis: 0.336 ± 0.164
0.588TrpIle: 0.588 ± 0.223
0.588TrpLys: 0.588 ± 0.253
1.513TrpLeu: 1.513 ± 0.343
0.084TrpMet: 0.084 ± 0.08
1.093TrpAsn: 1.093 ± 0.335
0.336TrpPro: 0.336 ± 0.164
0.757TrpGln: 0.757 ± 0.333
0.588TrpArg: 0.588 ± 0.25
0.673TrpSer: 0.673 ± 0.21
0.841TrpThr: 0.841 ± 0.274
0.925TrpVal: 0.925 ± 0.279
0.42TrpTrp: 0.42 ± 0.217
0.673TrpTyr: 0.673 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.27TyrAla: 2.27 ± 0.397
0.252TyrCys: 0.252 ± 0.142
2.606TyrAsp: 2.606 ± 0.514
2.018TyrGlu: 2.018 ± 0.387
1.261TyrPhe: 1.261 ± 0.265
3.531TyrGly: 3.531 ± 0.499
0.588TyrHis: 0.588 ± 0.22
2.354TyrIle: 2.354 ± 0.434
2.354TyrLys: 2.354 ± 0.486
1.681TyrLeu: 1.681 ± 0.281
1.177TyrMet: 1.177 ± 0.32
2.438TyrAsn: 2.438 ± 0.504
1.177TyrPro: 1.177 ± 0.279
1.934TyrGln: 1.934 ± 0.336
2.438TyrArg: 2.438 ± 0.448
2.354TyrSer: 2.354 ± 0.478
2.69TyrThr: 2.69 ± 0.554
2.942TyrVal: 2.942 ± 0.623
0.841TyrTrp: 0.841 ± 0.252
1.85TyrTyr: 1.85 ± 0.376
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (11896 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski