Amino acid dipepetide frequency for Nonlabens phage P12024S

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.404AlaAla: 6.404 ± 1.634
0.361AlaCys: 0.361 ± 0.184
3.608AlaAsp: 3.608 ± 0.658
4.24AlaGlu: 4.24 ± 0.877
2.345AlaPhe: 2.345 ± 0.497
3.247AlaGly: 3.247 ± 0.692
0.992AlaHis: 0.992 ± 0.324
5.322AlaIle: 5.322 ± 0.665
5.593AlaLys: 5.593 ± 0.884
5.773AlaLeu: 5.773 ± 0.873
1.624AlaMet: 1.624 ± 0.342
3.157AlaAsn: 3.157 ± 0.485
1.263AlaPro: 1.263 ± 0.238
2.345AlaGln: 2.345 ± 0.535
1.984AlaArg: 1.984 ± 0.511
2.706AlaSer: 2.706 ± 0.553
4.691AlaThr: 4.691 ± 0.924
3.428AlaVal: 3.428 ± 0.512
0.631AlaTrp: 0.631 ± 0.323
3.067AlaTyr: 3.067 ± 0.478
0.0AlaXaa: 0.0 ± 0.0
Cys
0.09CysAla: 0.09 ± 0.082
0.0CysCys: 0.0 ± 0.0
0.631CysAsp: 0.631 ± 0.227
1.173CysGlu: 1.173 ± 0.313
0.18CysPhe: 0.18 ± 0.139
0.902CysGly: 0.902 ± 0.27
0.271CysHis: 0.271 ± 0.133
0.722CysIle: 0.722 ± 0.279
1.263CysLys: 1.263 ± 0.326
0.541CysLeu: 0.541 ± 0.242
0.09CysMet: 0.09 ± 0.085
0.812CysAsn: 0.812 ± 0.283
0.361CysPro: 0.361 ± 0.157
0.18CysGln: 0.18 ± 0.133
0.271CysArg: 0.271 ± 0.161
0.902CysSer: 0.902 ± 0.285
0.361CysThr: 0.361 ± 0.149
0.18CysVal: 0.18 ± 0.109
0.09CysTrp: 0.09 ± 0.08
0.541CysTyr: 0.541 ± 0.252
0.0CysXaa: 0.0 ± 0.0
Asp
3.879AspAla: 3.879 ± 0.51
0.631AspCys: 0.631 ± 0.253
3.247AspAsp: 3.247 ± 0.59
5.051AspGlu: 5.051 ± 0.634
3.247AspPhe: 3.247 ± 0.676
4.781AspGly: 4.781 ± 0.81
0.631AspHis: 0.631 ± 0.244
5.232AspIle: 5.232 ± 0.724
5.593AspLys: 5.593 ± 0.606
8.118AspLeu: 8.118 ± 0.756
1.804AspMet: 1.804 ± 0.399
3.608AspAsn: 3.608 ± 0.686
1.984AspPro: 1.984 ± 0.417
1.804AspGln: 1.804 ± 0.323
2.526AspArg: 2.526 ± 0.464
4.6AspSer: 4.6 ± 0.656
3.518AspThr: 3.518 ± 0.646
4.42AspVal: 4.42 ± 0.749
1.714AspTrp: 1.714 ± 0.371
2.255AspTyr: 2.255 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
3.969GluAla: 3.969 ± 0.796
0.992GluCys: 0.992 ± 0.316
4.51GluAsp: 4.51 ± 0.479
5.863GluGlu: 5.863 ± 1.294
4.149GluPhe: 4.149 ± 0.631
3.067GluGly: 3.067 ± 0.529
0.541GluHis: 0.541 ± 0.198
5.232GluIle: 5.232 ± 0.773
5.412GluLys: 5.412 ± 1.12
7.307GluLeu: 7.307 ± 1.087
1.894GluMet: 1.894 ± 0.473
3.247GluAsn: 3.247 ± 0.49
2.436GluPro: 2.436 ± 0.462
2.977GluGln: 2.977 ± 0.482
3.428GluArg: 3.428 ± 0.483
3.608GluSer: 3.608 ± 0.504
4.33GluThr: 4.33 ± 0.696
4.149GluVal: 4.149 ± 0.674
1.263GluTrp: 1.263 ± 0.341
3.157GluTyr: 3.157 ± 0.663
0.0GluXaa: 0.0 ± 0.0
Phe
2.616PheAla: 2.616 ± 0.365
0.451PheCys: 0.451 ± 0.171
4.24PheAsp: 4.24 ± 0.571
4.42PheGlu: 4.42 ± 0.741
1.443PhePhe: 1.443 ± 0.324
2.887PheGly: 2.887 ± 0.503
0.541PheHis: 0.541 ± 0.188
2.977PheIle: 2.977 ± 0.509
4.059PheLys: 4.059 ± 0.643
3.789PheLeu: 3.789 ± 0.698
1.082PheMet: 1.082 ± 0.25
3.698PheAsn: 3.698 ± 0.455
1.173PhePro: 1.173 ± 0.328
1.173PheGln: 1.173 ± 0.284
1.533PheArg: 1.533 ± 0.271
3.067PheSer: 3.067 ± 0.517
4.33PheThr: 4.33 ± 0.881
1.353PheVal: 1.353 ± 0.329
0.271PheTrp: 0.271 ± 0.143
1.263PheTyr: 1.263 ± 0.325
0.0PheXaa: 0.0 ± 0.0
Gly
3.608GlyAla: 3.608 ± 0.619
0.722GlyCys: 0.722 ± 0.253
2.706GlyAsp: 2.706 ± 0.538
3.067GlyGlu: 3.067 ± 0.64
3.247GlyPhe: 3.247 ± 0.495
5.142GlyGly: 5.142 ± 0.746
0.812GlyHis: 0.812 ± 0.278
4.33GlyIle: 4.33 ± 0.565
5.142GlyLys: 5.142 ± 0.629
5.051GlyLeu: 5.051 ± 0.557
0.722GlyMet: 0.722 ± 0.251
3.789GlyAsn: 3.789 ± 0.631
0.18GlyPro: 0.18 ± 0.122
1.624GlyGln: 1.624 ± 0.348
2.616GlyArg: 2.616 ± 0.379
4.6GlySer: 4.6 ± 0.792
4.6GlyThr: 4.6 ± 0.712
5.683GlyVal: 5.683 ± 0.587
0.902GlyTrp: 0.902 ± 0.252
3.247GlyTyr: 3.247 ± 0.558
0.0GlyXaa: 0.0 ± 0.0
His
0.271HisAla: 0.271 ± 0.162
0.271HisCys: 0.271 ± 0.213
0.992HisAsp: 0.992 ± 0.305
0.271HisGlu: 0.271 ± 0.148
0.631HisPhe: 0.631 ± 0.241
0.541HisGly: 0.541 ± 0.232
0.09HisHis: 0.09 ± 0.076
1.082HisIle: 1.082 ± 0.329
1.173HisLys: 1.173 ± 0.331
1.714HisLeu: 1.714 ± 0.455
0.0HisMet: 0.0 ± 0.0
0.722HisAsn: 0.722 ± 0.237
0.451HisPro: 0.451 ± 0.213
0.361HisGln: 0.361 ± 0.174
0.992HisArg: 0.992 ± 0.349
0.451HisSer: 0.451 ± 0.234
1.173HisThr: 1.173 ± 0.358
0.992HisVal: 0.992 ± 0.332
0.18HisTrp: 0.18 ± 0.146
0.992HisTyr: 0.992 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
4.871IleAla: 4.871 ± 0.801
0.631IleCys: 0.631 ± 0.256
7.487IleAsp: 7.487 ± 0.706
6.675IleGlu: 6.675 ± 0.719
2.616IlePhe: 2.616 ± 0.476
4.059IleGly: 4.059 ± 0.755
1.173IleHis: 1.173 ± 0.284
4.42IleIle: 4.42 ± 0.688
7.036IleLys: 7.036 ± 0.853
5.683IleLeu: 5.683 ± 0.683
0.541IleMet: 0.541 ± 0.204
4.691IleAsn: 4.691 ± 0.792
2.075IlePro: 2.075 ± 0.351
2.436IleGln: 2.436 ± 0.354
2.345IleArg: 2.345 ± 0.559
4.42IleSer: 4.42 ± 0.516
5.142IleThr: 5.142 ± 0.595
4.059IleVal: 4.059 ± 0.51
0.451IleTrp: 0.451 ± 0.2
2.436IleTyr: 2.436 ± 0.552
0.0IleXaa: 0.0 ± 0.0
Lys
6.044LysAla: 6.044 ± 0.991
0.722LysCys: 0.722 ± 0.213
5.051LysAsp: 5.051 ± 0.748
7.216LysGlu: 7.216 ± 0.803
3.157LysPhe: 3.157 ± 0.626
5.502LysGly: 5.502 ± 0.753
1.533LysHis: 1.533 ± 0.372
5.322LysIle: 5.322 ± 0.758
6.585LysLys: 6.585 ± 0.816
5.953LysLeu: 5.953 ± 0.837
1.894LysMet: 1.894 ± 0.435
5.232LysAsn: 5.232 ± 0.778
3.608LysPro: 3.608 ± 0.824
3.789LysGln: 3.789 ± 0.691
4.781LysArg: 4.781 ± 0.735
4.149LysSer: 4.149 ± 0.601
4.33LysThr: 4.33 ± 0.642
5.051LysVal: 5.051 ± 0.667
1.443LysTrp: 1.443 ± 0.476
3.518LysTyr: 3.518 ± 0.525
0.0LysXaa: 0.0 ± 0.0
Leu
5.051LeuAla: 5.051 ± 0.586
0.812LeuCys: 0.812 ± 0.24
6.765LeuAsp: 6.765 ± 0.673
6.044LeuGlu: 6.044 ± 0.791
4.149LeuPhe: 4.149 ± 0.529
5.502LeuGly: 5.502 ± 0.573
0.451LeuHis: 0.451 ± 0.226
6.855LeuIle: 6.855 ± 0.766
7.036LeuLys: 7.036 ± 0.89
6.946LeuLeu: 6.946 ± 1.016
2.165LeuMet: 2.165 ± 0.474
5.322LeuAsn: 5.322 ± 0.879
3.157LeuPro: 3.157 ± 0.463
3.879LeuGln: 3.879 ± 0.574
3.247LeuArg: 3.247 ± 0.567
6.044LeuSer: 6.044 ± 0.67
7.036LeuThr: 7.036 ± 0.751
3.067LeuVal: 3.067 ± 0.526
0.812LeuTrp: 0.812 ± 0.272
3.067LeuTyr: 3.067 ± 0.62
0.0LeuXaa: 0.0 ± 0.0
Met
1.353MetAla: 1.353 ± 0.281
0.09MetCys: 0.09 ± 0.104
1.082MetAsp: 1.082 ± 0.324
1.624MetGlu: 1.624 ± 0.451
0.361MetPhe: 0.361 ± 0.172
1.533MetGly: 1.533 ± 0.42
0.18MetHis: 0.18 ± 0.121
1.173MetIle: 1.173 ± 0.338
2.165MetLys: 2.165 ± 0.53
1.173MetLeu: 1.173 ± 0.349
0.361MetMet: 0.361 ± 0.157
1.173MetAsn: 1.173 ± 0.228
0.722MetPro: 0.722 ± 0.246
0.812MetGln: 0.812 ± 0.228
0.812MetArg: 0.812 ± 0.271
1.804MetSer: 1.804 ± 0.387
0.631MetThr: 0.631 ± 0.269
0.902MetVal: 0.902 ± 0.365
0.18MetTrp: 0.18 ± 0.133
0.361MetTyr: 0.361 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
3.338AsnAla: 3.338 ± 0.628
0.541AsnCys: 0.541 ± 0.194
4.691AsnAsp: 4.691 ± 0.641
4.51AsnGlu: 4.51 ± 0.646
2.436AsnPhe: 2.436 ± 0.547
5.051AsnGly: 5.051 ± 0.753
1.082AsnHis: 1.082 ± 0.325
4.149AsnIle: 4.149 ± 0.707
5.683AsnLys: 5.683 ± 0.794
4.42AsnLeu: 4.42 ± 0.708
0.992AsnMet: 0.992 ± 0.28
4.059AsnAsn: 4.059 ± 0.728
2.526AsnPro: 2.526 ± 0.478
1.984AsnGln: 1.984 ± 0.427
2.255AsnArg: 2.255 ± 0.512
3.338AsnSer: 3.338 ± 0.566
3.067AsnThr: 3.067 ± 0.568
3.518AsnVal: 3.518 ± 0.537
0.812AsnTrp: 0.812 ± 0.281
2.255AsnTyr: 2.255 ± 0.529
0.0AsnXaa: 0.0 ± 0.0
Pro
2.255ProAla: 2.255 ± 0.487
0.451ProCys: 0.451 ± 0.273
2.436ProAsp: 2.436 ± 0.451
1.533ProGlu: 1.533 ± 0.29
1.263ProPhe: 1.263 ± 0.307
0.09ProGly: 0.09 ± 0.076
0.541ProHis: 0.541 ± 0.19
2.345ProIle: 2.345 ± 0.559
3.067ProLys: 3.067 ± 0.589
3.067ProLeu: 3.067 ± 0.598
0.361ProMet: 0.361 ± 0.157
2.706ProAsn: 2.706 ± 0.49
0.812ProPro: 0.812 ± 0.274
1.714ProGln: 1.714 ± 0.484
0.902ProArg: 0.902 ± 0.231
2.616ProSer: 2.616 ± 0.454
1.984ProThr: 1.984 ± 0.419
1.263ProVal: 1.263 ± 0.363
0.0ProTrp: 0.0 ± 0.0
0.722ProTyr: 0.722 ± 0.219
0.0ProXaa: 0.0 ± 0.0
Gln
2.887GlnAla: 2.887 ± 0.51
0.18GlnCys: 0.18 ± 0.163
2.255GlnAsp: 2.255 ± 0.428
2.345GlnGlu: 2.345 ± 0.622
1.714GlnPhe: 1.714 ± 0.59
2.436GlnGly: 2.436 ± 0.452
0.631GlnHis: 0.631 ± 0.243
2.796GlnIle: 2.796 ± 0.484
2.887GlnLys: 2.887 ± 0.552
3.518GlnLeu: 3.518 ± 0.57
0.451GlnMet: 0.451 ± 0.165
1.714GlnAsn: 1.714 ± 0.324
0.992GlnPro: 0.992 ± 0.355
1.804GlnGln: 1.804 ± 0.391
1.443GlnArg: 1.443 ± 0.567
2.526GlnSer: 2.526 ± 0.495
1.894GlnThr: 1.894 ± 0.428
1.624GlnVal: 1.624 ± 0.478
0.361GlnTrp: 0.361 ± 0.174
2.165GlnTyr: 2.165 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
2.796ArgAla: 2.796 ± 0.524
0.541ArgCys: 0.541 ± 0.212
2.345ArgAsp: 2.345 ± 0.506
2.706ArgGlu: 2.706 ± 0.624
2.526ArgPhe: 2.526 ± 0.525
1.804ArgGly: 1.804 ± 0.42
0.451ArgHis: 0.451 ± 0.187
3.157ArgIle: 3.157 ± 0.445
3.247ArgLys: 3.247 ± 0.523
4.24ArgLeu: 4.24 ± 0.686
0.812ArgMet: 0.812 ± 0.263
1.984ArgAsn: 1.984 ± 0.446
1.263ArgPro: 1.263 ± 0.322
0.902ArgGln: 0.902 ± 0.234
2.345ArgArg: 2.345 ± 0.45
2.165ArgSer: 2.165 ± 0.548
1.894ArgThr: 1.894 ± 0.507
2.436ArgVal: 2.436 ± 0.399
0.451ArgTrp: 0.451 ± 0.192
2.075ArgTyr: 2.075 ± 0.378
0.0ArgXaa: 0.0 ± 0.0
Ser
3.879SerAla: 3.879 ± 0.666
0.992SerCys: 0.992 ± 0.274
4.42SerAsp: 4.42 ± 0.618
3.608SerGlu: 3.608 ± 0.55
3.338SerPhe: 3.338 ± 0.639
5.863SerGly: 5.863 ± 0.955
0.722SerHis: 0.722 ± 0.283
4.059SerIle: 4.059 ± 0.475
4.871SerLys: 4.871 ± 0.868
5.863SerLeu: 5.863 ± 0.855
0.541SerMet: 0.541 ± 0.176
3.789SerAsn: 3.789 ± 0.519
1.173SerPro: 1.173 ± 0.333
2.345SerGln: 2.345 ± 0.46
2.526SerArg: 2.526 ± 0.393
4.51SerSer: 4.51 ± 0.613
2.796SerThr: 2.796 ± 0.383
3.518SerVal: 3.518 ± 0.669
1.082SerTrp: 1.082 ± 0.324
2.436SerTyr: 2.436 ± 0.464
0.0SerXaa: 0.0 ± 0.0
Thr
3.969ThrAla: 3.969 ± 0.622
0.361ThrCys: 0.361 ± 0.172
4.24ThrAsp: 4.24 ± 0.888
4.51ThrGlu: 4.51 ± 0.605
2.887ThrPhe: 2.887 ± 0.401
3.698ThrGly: 3.698 ± 0.485
1.173ThrHis: 1.173 ± 0.385
4.781ThrIle: 4.781 ± 0.734
4.059ThrLys: 4.059 ± 0.517
5.322ThrLeu: 5.322 ± 1.034
0.992ThrMet: 0.992 ± 0.28
3.247ThrAsn: 3.247 ± 0.525
3.067ThrPro: 3.067 ± 0.568
2.436ThrGln: 2.436 ± 0.452
2.255ThrArg: 2.255 ± 0.516
3.879ThrSer: 3.879 ± 0.599
4.149ThrThr: 4.149 ± 0.936
3.518ThrVal: 3.518 ± 0.763
0.541ThrTrp: 0.541 ± 0.277
2.526ThrTyr: 2.526 ± 0.521
0.0ThrXaa: 0.0 ± 0.0
Val
2.977ValAla: 2.977 ± 0.5
0.18ValCys: 0.18 ± 0.132
3.789ValAsp: 3.789 ± 0.622
3.518ValGlu: 3.518 ± 0.387
3.067ValPhe: 3.067 ± 0.523
3.518ValGly: 3.518 ± 0.565
0.451ValHis: 0.451 ± 0.212
4.961ValIle: 4.961 ± 0.646
5.142ValLys: 5.142 ± 0.646
4.51ValLeu: 4.51 ± 0.669
1.263ValMet: 1.263 ± 0.368
4.149ValAsn: 4.149 ± 0.658
1.984ValPro: 1.984 ± 0.385
1.984ValGln: 1.984 ± 0.387
1.443ValArg: 1.443 ± 0.376
3.698ValSer: 3.698 ± 0.656
2.887ValThr: 2.887 ± 0.505
3.789ValVal: 3.789 ± 0.546
0.631ValTrp: 0.631 ± 0.279
2.345ValTyr: 2.345 ± 0.412
0.0ValXaa: 0.0 ± 0.0
Trp
0.541TrpAla: 0.541 ± 0.276
0.0TrpCys: 0.0 ± 0.0
0.812TrpAsp: 0.812 ± 0.283
0.992TrpGlu: 0.992 ± 0.286
0.902TrpPhe: 0.902 ± 0.289
0.18TrpGly: 0.18 ± 0.119
0.0TrpHis: 0.0 ± 0.0
1.443TrpIle: 1.443 ± 0.361
0.992TrpLys: 0.992 ± 0.233
1.353TrpLeu: 1.353 ± 0.355
0.09TrpMet: 0.09 ± 0.082
0.902TrpAsn: 0.902 ± 0.286
0.0TrpPro: 0.0 ± 0.0
0.902TrpGln: 0.902 ± 0.32
0.451TrpArg: 0.451 ± 0.23
0.992TrpSer: 0.992 ± 0.279
0.541TrpThr: 0.541 ± 0.192
0.812TrpVal: 0.812 ± 0.235
0.361TrpTrp: 0.361 ± 0.244
0.722TrpTyr: 0.722 ± 0.211
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.984TyrAla: 1.984 ± 0.359
0.631TyrCys: 0.631 ± 0.249
3.338TyrAsp: 3.338 ± 0.54
2.165TyrGlu: 2.165 ± 0.522
2.706TyrPhe: 2.706 ± 0.495
1.984TyrGly: 1.984 ± 0.494
1.082TyrHis: 1.082 ± 0.319
2.887TyrIle: 2.887 ± 0.43
3.879TyrLys: 3.879 ± 0.645
3.067TyrLeu: 3.067 ± 0.527
0.722TyrMet: 0.722 ± 0.212
2.706TyrAsn: 2.706 ± 0.523
0.992TyrPro: 0.992 ± 0.281
1.173TyrGln: 1.173 ± 0.305
1.984TyrArg: 1.984 ± 0.406
2.255TyrSer: 2.255 ± 0.383
2.255TyrThr: 2.255 ± 0.564
2.526TyrVal: 2.526 ± 0.488
0.812TyrTrp: 0.812 ± 0.241
1.984TyrTyr: 1.984 ± 0.4
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11087 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski