Amino acid dipepetide frequency for Lactococcus phage asccphi28

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.685AlaAla: 0.685 ± 0.435
0.171AlaCys: 0.171 ± 0.177
4.282AlaAsp: 4.282 ± 0.954
2.911AlaGlu: 2.911 ± 0.677
2.911AlaPhe: 2.911 ± 0.732
3.768AlaGly: 3.768 ± 1.081
0.343AlaHis: 0.343 ± 0.27
3.425AlaIle: 3.425 ± 0.813
4.11AlaLys: 4.11 ± 0.804
3.254AlaLeu: 3.254 ± 0.563
0.685AlaMet: 0.685 ± 0.319
4.624AlaAsn: 4.624 ± 1.114
1.37AlaPro: 1.37 ± 0.552
2.74AlaGln: 2.74 ± 0.728
2.226AlaArg: 2.226 ± 0.66
3.254AlaSer: 3.254 ± 0.745
4.11AlaThr: 4.11 ± 1.142
3.768AlaVal: 3.768 ± 0.879
1.199AlaTrp: 1.199 ± 0.369
3.768AlaTyr: 3.768 ± 0.957
0.0AlaXaa: 0.0 ± 0.0
Cys
0.343CysAla: 0.343 ± 0.224
0.514CysCys: 0.514 ± 0.234
0.343CysAsp: 0.343 ± 0.232
0.343CysGlu: 0.343 ± 0.209
0.343CysPhe: 0.343 ± 0.207
0.685CysGly: 0.685 ± 0.361
0.171CysHis: 0.171 ± 0.185
0.514CysIle: 0.514 ± 0.287
0.685CysLys: 0.685 ± 0.302
0.856CysLeu: 0.856 ± 0.558
0.0CysMet: 0.0 ± 0.0
0.343CysAsn: 0.343 ± 0.246
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.343CysArg: 0.343 ± 0.222
0.343CysSer: 0.343 ± 0.257
0.343CysThr: 0.343 ± 0.307
0.343CysVal: 0.343 ± 0.208
0.171CysTrp: 0.171 ± 0.179
0.343CysTyr: 0.343 ± 0.223
0.0CysXaa: 0.0 ± 0.0
Asp
2.055AspAla: 2.055 ± 0.696
0.171AspCys: 0.171 ± 0.179
3.597AspAsp: 3.597 ± 0.6
4.11AspGlu: 4.11 ± 1.162
4.453AspPhe: 4.453 ± 0.619
3.939AspGly: 3.939 ± 0.913
1.028AspHis: 1.028 ± 0.356
4.795AspIle: 4.795 ± 0.741
3.939AspLys: 3.939 ± 1.007
3.425AspLeu: 3.425 ± 0.564
1.884AspMet: 1.884 ± 0.564
3.254AspAsn: 3.254 ± 0.91
1.884AspPro: 1.884 ± 0.629
0.685AspGln: 0.685 ± 0.4
1.884AspArg: 1.884 ± 0.773
2.911AspSer: 2.911 ± 0.877
4.282AspThr: 4.282 ± 0.991
3.254AspVal: 3.254 ± 0.878
1.37AspTrp: 1.37 ± 0.54
3.254AspTyr: 3.254 ± 0.896
0.0AspXaa: 0.0 ± 0.0
Glu
3.083GluAla: 3.083 ± 0.994
0.343GluCys: 0.343 ± 0.238
3.254GluAsp: 3.254 ± 0.94
4.453GluGlu: 4.453 ± 1.139
3.254GluPhe: 3.254 ± 0.878
1.199GluGly: 1.199 ± 0.433
1.541GluHis: 1.541 ± 0.529
5.823GluIle: 5.823 ± 0.952
7.022GluLys: 7.022 ± 2.001
7.193GluLeu: 7.193 ± 1.094
3.083GluMet: 3.083 ± 0.676
4.795GluAsn: 4.795 ± 0.953
0.856GluPro: 0.856 ± 0.311
3.254GluGln: 3.254 ± 0.759
1.713GluArg: 1.713 ± 0.434
2.055GluSer: 2.055 ± 0.623
4.282GluThr: 4.282 ± 1.029
3.083GluVal: 3.083 ± 0.857
1.028GluTrp: 1.028 ± 0.311
3.768GluTyr: 3.768 ± 0.986
0.0GluXaa: 0.0 ± 0.0
Phe
2.911PheAla: 2.911 ± 0.634
0.0PheCys: 0.0 ± 0.0
3.083PheAsp: 3.083 ± 0.717
2.398PheGlu: 2.398 ± 0.58
1.541PhePhe: 1.541 ± 0.493
2.398PheGly: 2.398 ± 0.611
0.171PheHis: 0.171 ± 0.155
2.74PheIle: 2.74 ± 0.763
3.768PheLys: 3.768 ± 0.763
2.74PheLeu: 2.74 ± 0.837
1.541PheMet: 1.541 ± 0.629
5.138PheAsn: 5.138 ± 0.856
1.884PhePro: 1.884 ± 0.779
1.028PheGln: 1.028 ± 0.452
0.514PheArg: 0.514 ± 0.282
4.11PheSer: 4.11 ± 0.692
4.282PheThr: 4.282 ± 0.99
2.569PheVal: 2.569 ± 0.692
0.856PheTrp: 0.856 ± 0.368
2.569PheTyr: 2.569 ± 1.079
0.0PheXaa: 0.0 ± 0.0
Gly
4.795GlyAla: 4.795 ± 0.961
0.171GlyCys: 0.171 ± 0.179
2.398GlyAsp: 2.398 ± 0.885
2.911GlyGlu: 2.911 ± 0.582
1.884GlyPhe: 1.884 ± 0.445
3.083GlyGly: 3.083 ± 0.809
1.37GlyHis: 1.37 ± 0.5
3.597GlyIle: 3.597 ± 0.903
3.939GlyLys: 3.939 ± 0.668
4.11GlyLeu: 4.11 ± 0.813
1.884GlyMet: 1.884 ± 0.695
3.254GlyAsn: 3.254 ± 0.818
0.171GlyPro: 0.171 ± 0.15
2.226GlyGln: 2.226 ± 0.885
1.199GlyArg: 1.199 ± 0.389
3.425GlySer: 3.425 ± 0.97
4.795GlyThr: 4.795 ± 0.818
2.74GlyVal: 2.74 ± 0.649
2.055GlyTrp: 2.055 ± 1.101
3.254GlyTyr: 3.254 ± 1.09
0.0GlyXaa: 0.0 ± 0.0
His
0.514HisAla: 0.514 ± 0.253
0.343HisCys: 0.343 ± 0.241
1.199HisAsp: 1.199 ± 0.378
2.055HisGlu: 2.055 ± 0.565
0.856HisPhe: 0.856 ± 0.342
0.685HisGly: 0.685 ± 0.313
0.171HisHis: 0.171 ± 0.179
2.055HisIle: 2.055 ± 0.567
1.37HisLys: 1.37 ± 0.6
1.884HisLeu: 1.884 ± 0.538
0.514HisMet: 0.514 ± 0.26
0.685HisAsn: 0.685 ± 0.384
0.343HisPro: 0.343 ± 0.207
0.343HisGln: 0.343 ± 0.241
0.171HisArg: 0.171 ± 0.179
0.856HisSer: 0.856 ± 0.337
1.028HisThr: 1.028 ± 0.506
1.199HisVal: 1.199 ± 0.44
0.343HisTrp: 0.343 ± 0.236
1.028HisTyr: 1.028 ± 0.345
0.0HisXaa: 0.0 ± 0.0
Ile
5.138IleAla: 5.138 ± 0.663
0.171IleCys: 0.171 ± 0.212
4.624IleAsp: 4.624 ± 0.789
5.994IleGlu: 5.994 ± 1.078
3.083IlePhe: 3.083 ± 0.781
3.939IleGly: 3.939 ± 0.645
0.856IleHis: 0.856 ± 0.369
2.74IleIle: 2.74 ± 0.568
6.85IleLys: 6.85 ± 1.529
4.453IleLeu: 4.453 ± 1.114
2.055IleMet: 2.055 ± 0.644
4.624IleAsn: 4.624 ± 1.14
2.226IlePro: 2.226 ± 0.42
3.083IleGln: 3.083 ± 0.698
2.74IleArg: 2.74 ± 0.764
2.911IleSer: 2.911 ± 0.765
6.508IleThr: 6.508 ± 1.288
4.967IleVal: 4.967 ± 0.756
0.856IleTrp: 0.856 ± 0.361
2.74IleTyr: 2.74 ± 0.806
0.0IleXaa: 0.0 ± 0.0
Lys
4.624LysAla: 4.624 ± 0.929
0.514LysCys: 0.514 ± 0.39
4.282LysAsp: 4.282 ± 0.912
8.049LysGlu: 8.049 ± 1.586
3.768LysPhe: 3.768 ± 0.723
2.569LysGly: 2.569 ± 0.652
1.199LysHis: 1.199 ± 0.431
5.994LysIle: 5.994 ± 0.891
5.48LysLys: 5.48 ± 0.98
7.193LysLeu: 7.193 ± 0.937
2.569LysMet: 2.569 ± 0.707
7.707LysAsn: 7.707 ± 0.984
1.541LysPro: 1.541 ± 0.459
2.226LysGln: 2.226 ± 0.657
4.11LysArg: 4.11 ± 1.03
5.309LysSer: 5.309 ± 1.065
4.967LysThr: 4.967 ± 1.008
5.138LysVal: 5.138 ± 1.81
1.199LysTrp: 1.199 ± 0.548
3.425LysTyr: 3.425 ± 0.682
0.0LysXaa: 0.0 ± 0.0
Leu
3.083LeuAla: 3.083 ± 0.854
0.685LeuCys: 0.685 ± 0.355
4.795LeuAsp: 4.795 ± 1.061
5.652LeuGlu: 5.652 ± 1.271
3.939LeuPhe: 3.939 ± 0.66
4.453LeuGly: 4.453 ± 0.885
1.028LeuHis: 1.028 ± 0.491
4.624LeuIle: 4.624 ± 1.019
7.707LeuLys: 7.707 ± 1.132
7.878LeuLeu: 7.878 ± 1.66
3.254LeuMet: 3.254 ± 1.199
4.624LeuAsn: 4.624 ± 0.769
2.911LeuPro: 2.911 ± 0.842
4.795LeuGln: 4.795 ± 1.001
3.425LeuArg: 3.425 ± 0.9
5.138LeuSer: 5.138 ± 1.005
6.337LeuThr: 6.337 ± 0.992
5.138LeuVal: 5.138 ± 1.086
1.028LeuTrp: 1.028 ± 0.417
2.74LeuTyr: 2.74 ± 0.641
0.0LeuXaa: 0.0 ± 0.0
Met
1.713MetAla: 1.713 ± 0.479
0.0MetCys: 0.0 ± 0.0
1.028MetAsp: 1.028 ± 0.453
1.884MetGlu: 1.884 ± 0.625
0.856MetPhe: 0.856 ± 0.307
1.541MetGly: 1.541 ± 0.474
0.343MetHis: 0.343 ± 0.193
1.541MetIle: 1.541 ± 0.481
2.398MetLys: 2.398 ± 0.657
2.911MetLeu: 2.911 ± 0.78
0.856MetMet: 0.856 ± 0.489
1.199MetAsn: 1.199 ± 0.55
1.713MetPro: 1.713 ± 0.538
1.028MetGln: 1.028 ± 0.405
0.685MetArg: 0.685 ± 0.28
1.541MetSer: 1.541 ± 0.452
2.569MetThr: 2.569 ± 0.693
1.713MetVal: 1.713 ± 0.675
0.343MetTrp: 0.343 ± 0.333
1.199MetTyr: 1.199 ± 0.42
0.0MetXaa: 0.0 ± 0.0
Asn
3.768AsnAla: 3.768 ± 0.754
1.199AsnCys: 1.199 ± 0.58
3.254AsnAsp: 3.254 ± 0.717
4.624AsnGlu: 4.624 ± 1.013
4.11AsnPhe: 4.11 ± 0.876
5.309AsnGly: 5.309 ± 1.475
1.541AsnHis: 1.541 ± 0.727
5.309AsnIle: 5.309 ± 0.843
7.022AsnLys: 7.022 ± 1.225
6.85AsnLeu: 6.85 ± 1.111
1.199AsnMet: 1.199 ± 0.503
5.652AsnAsn: 5.652 ± 0.846
3.254AsnPro: 3.254 ± 0.698
3.083AsnGln: 3.083 ± 0.959
2.226AsnArg: 2.226 ± 0.694
5.48AsnSer: 5.48 ± 1.131
4.624AsnThr: 4.624 ± 1.229
2.74AsnVal: 2.74 ± 0.641
0.856AsnTrp: 0.856 ± 0.345
2.911AsnTyr: 2.911 ± 0.859
0.0AsnXaa: 0.0 ± 0.0
Pro
0.856ProAla: 0.856 ± 0.359
0.0ProCys: 0.0 ± 0.0
2.911ProAsp: 2.911 ± 0.561
2.226ProGlu: 2.226 ± 0.753
1.37ProPhe: 1.37 ± 0.415
0.171ProGly: 0.171 ± 0.149
0.171ProHis: 0.171 ± 0.185
2.911ProIle: 2.911 ± 0.699
2.569ProLys: 2.569 ± 0.643
2.569ProLeu: 2.569 ± 0.8
1.028ProMet: 1.028 ± 0.366
3.425ProAsn: 3.425 ± 0.825
1.028ProPro: 1.028 ± 0.27
0.514ProGln: 0.514 ± 0.323
1.028ProArg: 1.028 ± 0.337
2.055ProSer: 2.055 ± 0.392
2.398ProThr: 2.398 ± 0.611
1.713ProVal: 1.713 ± 0.45
0.171ProTrp: 0.171 ± 0.15
1.199ProTyr: 1.199 ± 0.376
0.0ProXaa: 0.0 ± 0.0
Gln
2.74GlnAla: 2.74 ± 0.729
0.343GlnCys: 0.343 ± 0.211
1.37GlnAsp: 1.37 ± 0.576
1.884GlnGlu: 1.884 ± 0.506
2.398GlnPhe: 2.398 ± 0.675
3.254GlnGly: 3.254 ± 0.96
0.856GlnHis: 0.856 ± 0.395
3.083GlnIle: 3.083 ± 0.872
2.74GlnLys: 2.74 ± 1.018
4.453GlnLeu: 4.453 ± 1.183
1.199GlnMet: 1.199 ± 0.477
4.624GlnAsn: 4.624 ± 0.991
1.884GlnPro: 1.884 ± 0.543
4.11GlnGln: 4.11 ± 1.199
1.713GlnArg: 1.713 ± 0.568
2.569GlnSer: 2.569 ± 0.778
2.398GlnThr: 2.398 ± 0.819
2.74GlnVal: 2.74 ± 0.741
0.514GlnTrp: 0.514 ± 0.268
1.37GlnTyr: 1.37 ± 0.613
0.0GlnXaa: 0.0 ± 0.0
Arg
2.911ArgAla: 2.911 ± 0.507
0.171ArgCys: 0.171 ± 0.177
0.685ArgAsp: 0.685 ± 0.329
2.74ArgGlu: 2.74 ± 0.539
1.713ArgPhe: 1.713 ± 0.548
1.199ArgGly: 1.199 ± 0.418
0.856ArgHis: 0.856 ± 0.349
1.541ArgIle: 1.541 ± 0.494
4.624ArgLys: 4.624 ± 0.919
3.425ArgLeu: 3.425 ± 0.825
0.343ArgMet: 0.343 ± 0.258
1.713ArgAsn: 1.713 ± 0.539
0.856ArgPro: 0.856 ± 0.351
1.713ArgGln: 1.713 ± 0.39
0.856ArgArg: 0.856 ± 0.272
1.028ArgSer: 1.028 ± 0.442
1.884ArgThr: 1.884 ± 0.604
1.713ArgVal: 1.713 ± 0.542
0.171ArgTrp: 0.171 ± 0.174
1.028ArgTyr: 1.028 ± 0.457
0.0ArgXaa: 0.0 ± 0.0
Ser
3.254SerAla: 3.254 ± 0.847
0.343SerCys: 0.343 ± 0.273
4.11SerAsp: 4.11 ± 0.865
2.74SerGlu: 2.74 ± 0.786
1.884SerPhe: 1.884 ± 0.555
3.768SerGly: 3.768 ± 1.251
1.541SerHis: 1.541 ± 0.485
5.309SerIle: 5.309 ± 0.809
4.967SerLys: 4.967 ± 1.108
4.11SerLeu: 4.11 ± 0.907
2.226SerMet: 2.226 ± 0.679
4.967SerAsn: 4.967 ± 1.211
2.398SerPro: 2.398 ± 0.542
3.768SerGln: 3.768 ± 0.856
1.37SerArg: 1.37 ± 0.408
4.282SerSer: 4.282 ± 0.821
4.795SerThr: 4.795 ± 1.078
2.74SerVal: 2.74 ± 0.558
0.514SerTrp: 0.514 ± 0.368
2.398SerTyr: 2.398 ± 0.471
0.0SerXaa: 0.0 ± 0.0
Thr
5.652ThrAla: 5.652 ± 1.175
0.514ThrCys: 0.514 ± 0.328
4.624ThrAsp: 4.624 ± 0.951
4.624ThrGlu: 4.624 ± 0.701
3.083ThrPhe: 3.083 ± 0.526
4.11ThrGly: 4.11 ± 0.991
1.541ThrHis: 1.541 ± 0.42
7.022ThrIle: 7.022 ± 1.066
4.453ThrLys: 4.453 ± 0.705
7.022ThrLeu: 7.022 ± 1.334
0.343ThrMet: 0.343 ± 0.306
3.083ThrAsn: 3.083 ± 0.72
3.425ThrPro: 3.425 ± 0.882
4.453ThrGln: 4.453 ± 0.996
1.541ThrArg: 1.541 ± 0.592
4.967ThrSer: 4.967 ± 0.994
4.624ThrThr: 4.624 ± 0.965
4.282ThrVal: 4.282 ± 0.937
0.343ThrTrp: 0.343 ± 0.215
3.939ThrTyr: 3.939 ± 0.944
0.0ThrXaa: 0.0 ± 0.0
Val
2.398ValAla: 2.398 ± 0.71
0.514ValCys: 0.514 ± 0.426
3.597ValAsp: 3.597 ± 1.015
2.398ValGlu: 2.398 ± 1.076
2.398ValPhe: 2.398 ± 0.715
2.569ValGly: 2.569 ± 0.79
1.884ValHis: 1.884 ± 0.458
4.282ValIle: 4.282 ± 0.807
4.282ValLys: 4.282 ± 0.942
3.597ValLeu: 3.597 ± 0.667
1.028ValMet: 1.028 ± 0.439
5.652ValAsn: 5.652 ± 1.269
1.199ValPro: 1.199 ± 0.455
3.254ValGln: 3.254 ± 0.568
2.226ValArg: 2.226 ± 0.513
4.11ValSer: 4.11 ± 0.939
5.48ValThr: 5.48 ± 1.39
3.939ValVal: 3.939 ± 0.904
0.0ValTrp: 0.0 ± 0.0
3.254ValTyr: 3.254 ± 1.142
0.0ValXaa: 0.0 ± 0.0
Trp
0.514TrpAla: 0.514 ± 0.311
0.171TrpCys: 0.171 ± 0.16
0.685TrpAsp: 0.685 ± 0.297
0.343TrpGlu: 0.343 ± 0.268
0.343TrpPhe: 0.343 ± 0.246
0.514TrpGly: 0.514 ± 0.292
0.343TrpHis: 0.343 ± 0.264
1.028TrpIle: 1.028 ± 0.369
0.343TrpLys: 0.343 ± 0.221
1.713TrpLeu: 1.713 ± 0.895
0.171TrpMet: 0.171 ± 0.177
1.028TrpAsn: 1.028 ± 0.454
0.0TrpPro: 0.0 ± 0.0
1.199TrpGln: 1.199 ± 0.474
0.514TrpArg: 0.514 ± 0.301
1.713TrpSer: 1.713 ± 0.579
1.028TrpThr: 1.028 ± 0.358
0.856TrpVal: 0.856 ± 0.465
0.343TrpTrp: 0.343 ± 0.208
0.856TrpTyr: 0.856 ± 0.458
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.569TyrAla: 2.569 ± 0.554
0.685TyrCys: 0.685 ± 0.348
2.055TyrAsp: 2.055 ± 0.901
2.74TyrGlu: 2.74 ± 0.7
2.055TyrPhe: 2.055 ± 0.798
4.11TyrGly: 4.11 ± 0.992
0.685TyrHis: 0.685 ± 0.32
2.226TyrIle: 2.226 ± 0.555
3.597TyrLys: 3.597 ± 1.143
3.768TyrLeu: 3.768 ± 1.058
1.028TyrMet: 1.028 ± 0.423
4.624TyrAsn: 4.624 ± 1.051
1.37TyrPro: 1.37 ± 0.564
2.74TyrGln: 2.74 ± 0.607
0.685TyrArg: 0.685 ± 0.418
3.425TyrSer: 3.425 ± 0.868
2.911TyrThr: 2.911 ± 0.881
3.425TyrVal: 3.425 ± 0.801
0.343TyrTrp: 0.343 ± 0.211
3.597TyrTyr: 3.597 ± 0.78
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 28 proteins (5840 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski