Amino acid dipepetide frequency for Streptococcus phage P4761

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.421AlaAla: 5.421 ± 1.622
0.169AlaCys: 0.169 ± 0.119
4.066AlaAsp: 4.066 ± 0.617
5.252AlaGlu: 5.252 ± 0.736
3.049AlaPhe: 3.049 ± 1.188
5.336AlaGly: 5.336 ± 1.16
0.424AlaHis: 0.424 ± 0.139
5.59AlaIle: 5.59 ± 1.342
6.099AlaLys: 6.099 ± 0.644
7.539AlaLeu: 7.539 ± 0.942
2.033AlaMet: 2.033 ± 1.062
4.574AlaAsn: 4.574 ± 0.687
2.033AlaPro: 2.033 ± 0.568
2.965AlaGln: 2.965 ± 0.818
3.134AlaArg: 3.134 ± 0.551
5.59AlaSer: 5.59 ± 1.459
3.558AlaThr: 3.558 ± 0.872
3.642AlaVal: 3.642 ± 0.852
0.678AlaTrp: 0.678 ± 0.247
2.88AlaTyr: 2.88 ± 0.879
0.0AlaXaa: 0.0 ± 0.0
Cys
0.169CysAla: 0.169 ± 0.129
0.085CysCys: 0.085 ± 0.089
0.593CysAsp: 0.593 ± 0.291
0.508CysGlu: 0.508 ± 0.3
0.254CysPhe: 0.254 ± 0.156
0.339CysGly: 0.339 ± 0.242
0.169CysHis: 0.169 ± 0.133
0.169CysIle: 0.169 ± 0.11
0.424CysLys: 0.424 ± 0.202
0.339CysLeu: 0.339 ± 0.225
0.085CysMet: 0.085 ± 0.094
0.169CysAsn: 0.169 ± 0.138
0.085CysPro: 0.085 ± 0.099
0.169CysGln: 0.169 ± 0.131
0.339CysArg: 0.339 ± 0.214
0.762CysSer: 0.762 ± 0.372
0.0CysThr: 0.0 ± 0.0
0.254CysVal: 0.254 ± 0.172
0.085CysTrp: 0.085 ± 0.087
0.508CysTyr: 0.508 ± 0.329
0.0CysXaa: 0.0 ± 0.0
Asp
3.303AspAla: 3.303 ± 0.52
0.424AspCys: 0.424 ± 0.258
4.743AspAsp: 4.743 ± 0.55
3.642AspGlu: 3.642 ± 0.638
3.558AspPhe: 3.558 ± 0.493
5.844AspGly: 5.844 ± 1.76
0.847AspHis: 0.847 ± 0.335
3.896AspIle: 3.896 ± 0.578
4.997AspLys: 4.997 ± 0.776
4.405AspLeu: 4.405 ± 0.849
1.694AspMet: 1.694 ± 0.419
4.066AspAsn: 4.066 ± 0.581
0.762AspPro: 0.762 ± 0.241
1.44AspGln: 1.44 ± 0.34
3.049AspArg: 3.049 ± 0.474
4.743AspSer: 4.743 ± 1.111
2.795AspThr: 2.795 ± 0.486
3.049AspVal: 3.049 ± 0.482
1.186AspTrp: 1.186 ± 0.408
2.71AspTyr: 2.71 ± 0.553
0.0AspXaa: 0.0 ± 0.0
Glu
4.913GluAla: 4.913 ± 0.842
0.169GluCys: 0.169 ± 0.132
2.626GluAsp: 2.626 ± 0.497
5.082GluGlu: 5.082 ± 1.178
2.71GluPhe: 2.71 ± 0.464
3.134GluGly: 3.134 ± 0.494
1.016GluHis: 1.016 ± 0.386
5.421GluIle: 5.421 ± 0.975
5.082GluLys: 5.082 ± 1.363
6.353GluLeu: 6.353 ± 1.141
2.541GluMet: 2.541 ± 0.652
3.812GluAsn: 3.812 ± 0.651
1.609GluPro: 1.609 ± 0.556
3.134GluGln: 3.134 ± 0.677
3.558GluArg: 3.558 ± 0.728
3.219GluSer: 3.219 ± 0.672
4.066GluThr: 4.066 ± 0.762
5.336GluVal: 5.336 ± 0.792
1.101GluTrp: 1.101 ± 0.38
3.049GluTyr: 3.049 ± 0.638
0.0GluXaa: 0.0 ± 0.0
Phe
2.626PheAla: 2.626 ± 0.444
0.339PheCys: 0.339 ± 0.209
3.219PheAsp: 3.219 ± 0.483
4.15PheGlu: 4.15 ± 0.665
1.44PhePhe: 1.44 ± 0.38
3.134PheGly: 3.134 ± 0.873
0.424PheHis: 0.424 ± 0.156
2.71PheIle: 2.71 ± 0.425
4.997PheLys: 4.997 ± 0.52
1.948PheLeu: 1.948 ± 0.415
0.847PheMet: 0.847 ± 0.256
2.965PheAsn: 2.965 ± 0.446
0.678PhePro: 0.678 ± 0.279
0.593PheGln: 0.593 ± 0.259
1.271PheArg: 1.271 ± 0.322
3.049PheSer: 3.049 ± 0.469
2.88PheThr: 2.88 ± 0.568
1.694PheVal: 1.694 ± 0.303
0.678PheTrp: 0.678 ± 0.212
1.186PheTyr: 1.186 ± 0.301
0.0PheXaa: 0.0 ± 0.0
Gly
4.828GlyAla: 4.828 ± 0.851
0.169GlyCys: 0.169 ± 0.13
2.795GlyAsp: 2.795 ± 0.442
3.642GlyGlu: 3.642 ± 0.478
2.541GlyPhe: 2.541 ± 0.418
3.049GlyGly: 3.049 ± 0.523
0.762GlyHis: 0.762 ± 0.28
7.03GlyIle: 7.03 ± 1.786
5.929GlyLys: 5.929 ± 1.171
5.59GlyLeu: 5.59 ± 0.933
1.694GlyMet: 1.694 ± 0.666
3.642GlyAsn: 3.642 ± 0.687
1.948GlyPro: 1.948 ± 0.779
3.049GlyGln: 3.049 ± 0.569
3.558GlyArg: 3.558 ± 0.92
4.405GlySer: 4.405 ± 0.732
6.099GlyThr: 6.099 ± 1.078
3.896GlyVal: 3.896 ± 0.676
0.762GlyTrp: 0.762 ± 0.337
2.287GlyTyr: 2.287 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
0.678HisAla: 0.678 ± 0.238
0.085HisCys: 0.085 ± 0.093
1.016HisAsp: 1.016 ± 0.275
0.424HisGlu: 0.424 ± 0.256
0.339HisPhe: 0.339 ± 0.148
1.016HisGly: 1.016 ± 0.369
0.424HisHis: 0.424 ± 0.199
1.186HisIle: 1.186 ± 0.264
0.932HisLys: 0.932 ± 0.337
1.101HisLeu: 1.101 ± 0.299
0.339HisMet: 0.339 ± 0.255
0.762HisAsn: 0.762 ± 0.256
0.254HisPro: 0.254 ± 0.163
0.424HisGln: 0.424 ± 0.207
0.424HisArg: 0.424 ± 0.214
0.847HisSer: 0.847 ± 0.316
1.016HisThr: 1.016 ± 0.239
0.847HisVal: 0.847 ± 0.315
0.085HisTrp: 0.085 ± 0.095
0.932HisTyr: 0.932 ± 0.301
0.0HisXaa: 0.0 ± 0.0
Ile
4.743IleAla: 4.743 ± 1.362
0.339IleCys: 0.339 ± 0.178
4.405IleAsp: 4.405 ± 0.475
4.743IleGlu: 4.743 ± 1.059
1.863IlePhe: 1.863 ± 0.342
5.506IleGly: 5.506 ± 0.969
1.016IleHis: 1.016 ± 0.253
3.896IleIle: 3.896 ± 0.731
5.844IleLys: 5.844 ± 0.674
3.981IleLeu: 3.981 ± 0.521
1.525IleMet: 1.525 ± 0.36
3.812IleAsn: 3.812 ± 0.777
2.287IlePro: 2.287 ± 0.492
3.558IleGln: 3.558 ± 0.483
2.965IleArg: 2.965 ± 0.621
5.929IleSer: 5.929 ± 1.133
4.574IleThr: 4.574 ± 0.717
3.558IleVal: 3.558 ± 0.914
0.678IleTrp: 0.678 ± 0.249
2.965IleTyr: 2.965 ± 0.589
0.0IleXaa: 0.0 ± 0.0
Lys
7.284LysAla: 7.284 ± 0.752
0.593LysCys: 0.593 ± 0.331
5.167LysAsp: 5.167 ± 0.868
6.692LysGlu: 6.692 ± 1.219
2.287LysPhe: 2.287 ± 0.506
5.76LysGly: 5.76 ± 1.072
1.44LysHis: 1.44 ± 0.472
4.32LysIle: 4.32 ± 0.874
7.369LysLys: 7.369 ± 1.482
6.946LysLeu: 6.946 ± 1.068
2.033LysMet: 2.033 ± 0.575
2.795LysAsn: 2.795 ± 0.524
2.795LysPro: 2.795 ± 0.466
2.71LysGln: 2.71 ± 0.531
5.167LysArg: 5.167 ± 0.901
4.574LysSer: 4.574 ± 0.725
5.167LysThr: 5.167 ± 0.785
3.981LysVal: 3.981 ± 0.673
0.847LysTrp: 0.847 ± 0.291
4.659LysTyr: 4.659 ± 1.105
0.0LysXaa: 0.0 ± 0.0
Leu
6.353LeuAla: 6.353 ± 0.897
0.508LeuCys: 0.508 ± 0.226
4.828LeuAsp: 4.828 ± 0.863
6.607LeuGlu: 6.607 ± 1.086
2.456LeuPhe: 2.456 ± 0.4
5.082LeuGly: 5.082 ± 0.936
0.593LeuHis: 0.593 ± 0.277
3.812LeuIle: 3.812 ± 0.511
6.692LeuLys: 6.692 ± 0.987
4.828LeuLeu: 4.828 ± 0.67
1.863LeuMet: 1.863 ± 0.414
5.167LeuAsn: 5.167 ± 0.517
2.118LeuPro: 2.118 ± 0.456
2.626LeuGln: 2.626 ± 0.484
2.626LeuArg: 2.626 ± 0.64
6.183LeuSer: 6.183 ± 0.787
5.675LeuThr: 5.675 ± 0.974
4.574LeuVal: 4.574 ± 0.693
0.847LeuTrp: 0.847 ± 0.349
3.303LeuTyr: 3.303 ± 0.685
0.0LeuXaa: 0.0 ± 0.0
Met
2.626MetAla: 2.626 ± 0.838
0.0MetCys: 0.0 ± 0.0
1.101MetAsp: 1.101 ± 0.344
1.355MetGlu: 1.355 ± 0.355
1.016MetPhe: 1.016 ± 0.367
1.271MetGly: 1.271 ± 0.459
0.254MetHis: 0.254 ± 0.126
1.863MetIle: 1.863 ± 0.491
2.287MetLys: 2.287 ± 0.49
1.694MetLeu: 1.694 ± 0.354
1.186MetMet: 1.186 ± 0.532
1.271MetAsn: 1.271 ± 0.349
0.339MetPro: 0.339 ± 0.177
1.863MetGln: 1.863 ± 0.574
0.847MetArg: 0.847 ± 0.279
1.609MetSer: 1.609 ± 0.638
1.694MetThr: 1.694 ± 0.368
1.101MetVal: 1.101 ± 0.408
0.0MetTrp: 0.0 ± 0.0
0.932MetTyr: 0.932 ± 0.36
0.0MetXaa: 0.0 ± 0.0
Asn
3.727AsnAla: 3.727 ± 0.736
0.424AsnCys: 0.424 ± 0.174
3.727AsnAsp: 3.727 ± 0.858
4.235AsnGlu: 4.235 ± 1.015
3.219AsnPhe: 3.219 ± 0.681
5.336AsnGly: 5.336 ± 0.871
1.186AsnHis: 1.186 ± 0.404
2.965AsnIle: 2.965 ± 0.491
4.32AsnLys: 4.32 ± 0.729
4.15AsnLeu: 4.15 ± 0.44
0.762AsnMet: 0.762 ± 0.258
2.88AsnAsn: 2.88 ± 0.479
2.287AsnPro: 2.287 ± 0.481
1.863AsnGln: 1.863 ± 0.497
1.948AsnArg: 1.948 ± 0.477
3.642AsnSer: 3.642 ± 0.482
3.303AsnThr: 3.303 ± 0.553
3.727AsnVal: 3.727 ± 0.513
1.271AsnTrp: 1.271 ± 0.382
2.287AsnTyr: 2.287 ± 0.533
0.0AsnXaa: 0.0 ± 0.0
Pro
1.694ProAla: 1.694 ± 0.422
0.254ProCys: 0.254 ± 0.157
2.033ProAsp: 2.033 ± 0.532
1.186ProGlu: 1.186 ± 0.378
1.186ProPhe: 1.186 ± 0.394
1.44ProGly: 1.44 ± 0.582
0.085ProHis: 0.085 ± 0.087
1.863ProIle: 1.863 ± 0.39
2.626ProLys: 2.626 ± 0.505
1.694ProLeu: 1.694 ± 0.486
0.254ProMet: 0.254 ± 0.188
2.202ProAsn: 2.202 ± 0.445
0.932ProPro: 0.932 ± 0.272
1.525ProGln: 1.525 ± 0.331
1.44ProArg: 1.44 ± 0.463
2.372ProSer: 2.372 ± 0.414
1.101ProThr: 1.101 ± 0.419
1.948ProVal: 1.948 ± 0.322
0.424ProTrp: 0.424 ± 0.173
1.271ProTyr: 1.271 ± 0.361
0.0ProXaa: 0.0 ± 0.0
Gln
3.896GlnAla: 3.896 ± 0.964
0.254GlnCys: 0.254 ± 0.119
2.118GlnAsp: 2.118 ± 0.385
2.88GlnGlu: 2.88 ± 0.699
2.118GlnPhe: 2.118 ± 0.397
2.795GlnGly: 2.795 ± 0.948
0.424GlnHis: 0.424 ± 0.219
2.541GlnIle: 2.541 ± 0.654
2.456GlnLys: 2.456 ± 0.633
3.642GlnLeu: 3.642 ± 0.565
1.271GlnMet: 1.271 ± 0.391
2.118GlnAsn: 2.118 ± 0.424
1.101GlnPro: 1.101 ± 0.525
1.948GlnGln: 1.948 ± 0.726
1.44GlnArg: 1.44 ± 0.372
2.88GlnSer: 2.88 ± 0.637
2.626GlnThr: 2.626 ± 0.436
2.372GlnVal: 2.372 ± 0.479
0.424GlnTrp: 0.424 ± 0.189
1.609GlnTyr: 1.609 ± 0.528
0.0GlnXaa: 0.0 ± 0.0
Arg
3.134ArgAla: 3.134 ± 0.451
0.508ArgCys: 0.508 ± 0.254
2.372ArgAsp: 2.372 ± 0.369
2.541ArgGlu: 2.541 ± 0.727
1.863ArgPhe: 1.863 ± 0.415
2.287ArgGly: 2.287 ± 0.399
0.339ArgHis: 0.339 ± 0.183
3.303ArgIle: 3.303 ± 0.738
2.88ArgLys: 2.88 ± 0.675
3.812ArgLeu: 3.812 ± 0.755
1.609ArgMet: 1.609 ± 0.379
2.118ArgAsn: 2.118 ± 0.363
0.762ArgPro: 0.762 ± 0.232
2.202ArgGln: 2.202 ± 0.493
2.033ArgArg: 2.033 ± 0.474
2.287ArgSer: 2.287 ± 0.439
2.456ArgThr: 2.456 ± 0.578
2.795ArgVal: 2.795 ± 0.623
0.678ArgTrp: 0.678 ± 0.225
2.118ArgTyr: 2.118 ± 0.411
0.0ArgXaa: 0.0 ± 0.0
Ser
6.099SerAla: 6.099 ± 2.165
0.169SerCys: 0.169 ± 0.136
4.489SerAsp: 4.489 ± 0.856
3.388SerGlu: 3.388 ± 0.611
3.219SerPhe: 3.219 ± 0.669
4.913SerGly: 4.913 ± 0.848
0.847SerHis: 0.847 ± 0.304
5.421SerIle: 5.421 ± 0.908
5.421SerLys: 5.421 ± 0.73
4.743SerLeu: 4.743 ± 0.792
1.44SerMet: 1.44 ± 0.308
4.489SerAsn: 4.489 ± 0.706
2.033SerPro: 2.033 ± 0.346
3.558SerGln: 3.558 ± 0.681
2.71SerArg: 2.71 ± 0.571
4.828SerSer: 4.828 ± 1.355
3.896SerThr: 3.896 ± 0.62
6.776SerVal: 6.776 ± 0.874
0.593SerTrp: 0.593 ± 0.167
2.118SerTyr: 2.118 ± 0.506
0.0SerXaa: 0.0 ± 0.0
Thr
4.405ThrAla: 4.405 ± 1.204
0.169ThrCys: 0.169 ± 0.133
3.388ThrAsp: 3.388 ± 0.715
3.219ThrGlu: 3.219 ± 0.644
2.88ThrPhe: 2.88 ± 0.502
4.489ThrGly: 4.489 ± 0.794
1.016ThrHis: 1.016 ± 0.302
5.421ThrIle: 5.421 ± 0.892
5.252ThrLys: 5.252 ± 0.672
5.929ThrLeu: 5.929 ± 0.609
0.847ThrMet: 0.847 ± 0.402
3.219ThrAsn: 3.219 ± 0.511
1.948ThrPro: 1.948 ± 0.443
2.88ThrGln: 2.88 ± 0.499
2.033ThrArg: 2.033 ± 0.407
4.574ThrSer: 4.574 ± 1.035
3.896ThrThr: 3.896 ± 0.794
4.997ThrVal: 4.997 ± 0.831
0.424ThrTrp: 0.424 ± 0.247
2.71ThrTyr: 2.71 ± 0.75
0.0ThrXaa: 0.0 ± 0.0
Val
4.235ValAla: 4.235 ± 0.805
0.339ValCys: 0.339 ± 0.19
4.235ValAsp: 4.235 ± 0.614
5.675ValGlu: 5.675 ± 0.972
2.71ValPhe: 2.71 ± 0.511
3.642ValGly: 3.642 ± 0.622
0.932ValHis: 0.932 ± 0.289
3.558ValIle: 3.558 ± 0.654
5.167ValLys: 5.167 ± 0.692
3.727ValLeu: 3.727 ± 0.562
0.932ValMet: 0.932 ± 0.278
3.981ValAsn: 3.981 ± 0.858
2.033ValPro: 2.033 ± 0.471
2.372ValGln: 2.372 ± 0.404
1.525ValArg: 1.525 ± 0.465
5.844ValSer: 5.844 ± 0.726
4.828ValThr: 4.828 ± 0.724
4.405ValVal: 4.405 ± 0.589
0.847ValTrp: 0.847 ± 0.258
1.355ValTyr: 1.355 ± 0.423
0.0ValXaa: 0.0 ± 0.0
Trp
0.678TrpAla: 0.678 ± 0.237
0.0TrpCys: 0.0 ± 0.0
0.508TrpAsp: 0.508 ± 0.229
0.932TrpGlu: 0.932 ± 0.304
0.424TrpPhe: 0.424 ± 0.209
0.847TrpGly: 0.847 ± 0.241
0.169TrpHis: 0.169 ± 0.128
0.508TrpIle: 0.508 ± 0.309
0.678TrpLys: 0.678 ± 0.206
0.847TrpLeu: 0.847 ± 0.264
0.169TrpMet: 0.169 ± 0.118
0.847TrpAsn: 0.847 ± 0.391
0.169TrpPro: 0.169 ± 0.129
0.508TrpGln: 0.508 ± 0.199
0.593TrpArg: 0.593 ± 0.201
1.355TrpSer: 1.355 ± 0.568
1.355TrpThr: 1.355 ± 0.839
1.016TrpVal: 1.016 ± 0.392
0.254TrpTrp: 0.254 ± 0.192
0.424TrpTyr: 0.424 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.219TyrAla: 3.219 ± 0.727
0.508TyrCys: 0.508 ± 0.255
3.727TyrAsp: 3.727 ± 0.923
1.779TyrGlu: 1.779 ± 0.515
1.694TyrPhe: 1.694 ± 0.379
2.456TyrGly: 2.456 ± 0.492
0.762TyrHis: 0.762 ± 0.317
2.71TyrIle: 2.71 ± 0.731
3.134TyrLys: 3.134 ± 0.672
3.473TyrLeu: 3.473 ± 0.762
1.101TyrMet: 1.101 ± 0.326
2.372TyrAsn: 2.372 ± 0.468
1.355TyrPro: 1.355 ± 0.524
1.609TyrGln: 1.609 ± 0.404
1.44TyrArg: 1.44 ± 0.386
2.456TyrSer: 2.456 ± 0.563
2.71TyrThr: 2.71 ± 0.752
2.372TyrVal: 2.372 ± 0.468
0.424TyrTrp: 0.424 ± 0.164
1.355TyrTyr: 1.355 ± 0.514
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (11807 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski