Amino acid dipepetide frequency for Lactococcus phage jm2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.531AlaAla: 0.531 ± 0.26
0.319AlaCys: 0.319 ± 0.182
2.549AlaAsp: 2.549 ± 0.485
4.249AlaGlu: 4.249 ± 0.834
3.505AlaPhe: 3.505 ± 0.802
3.718AlaGly: 3.718 ± 0.684
0.85AlaHis: 0.85 ± 0.276
5.311AlaIle: 5.311 ± 0.871
7.011AlaLys: 7.011 ± 1.058
6.055AlaLeu: 6.055 ± 0.965
2.018AlaMet: 2.018 ± 0.534
4.461AlaAsn: 4.461 ± 0.956
1.062AlaPro: 1.062 ± 0.356
2.443AlaGln: 2.443 ± 0.53
2.018AlaArg: 2.018 ± 0.42
2.443AlaSer: 2.443 ± 0.651
2.868AlaThr: 2.868 ± 0.678
3.293AlaVal: 3.293 ± 0.749
1.381AlaTrp: 1.381 ± 0.497
2.124AlaTyr: 2.124 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.212CysAla: 0.212 ± 0.154
0.0CysCys: 0.0 ± 0.0
0.212CysAsp: 0.212 ± 0.144
0.425CysGlu: 0.425 ± 0.199
0.212CysPhe: 0.212 ± 0.147
0.85CysGly: 0.85 ± 0.352
0.106CysHis: 0.106 ± 0.114
0.425CysIle: 0.425 ± 0.226
0.744CysLys: 0.744 ± 0.371
0.212CysLeu: 0.212 ± 0.15
0.106CysMet: 0.106 ± 0.121
0.637CysAsn: 0.637 ± 0.27
0.106CysPro: 0.106 ± 0.096
0.319CysGln: 0.319 ± 0.19
0.425CysArg: 0.425 ± 0.197
0.637CysSer: 0.637 ± 0.335
0.106CysThr: 0.106 ± 0.088
0.531CysVal: 0.531 ± 0.246
0.0CysTrp: 0.0 ± 0.0
0.106CysTyr: 0.106 ± 0.092
0.0CysXaa: 0.0 ± 0.0
Asp
1.912AspAla: 1.912 ± 0.503
0.212AspCys: 0.212 ± 0.142
3.718AspAsp: 3.718 ± 0.896
3.293AspGlu: 3.293 ± 0.775
3.824AspPhe: 3.824 ± 0.666
3.399AspGly: 3.399 ± 0.603
1.062AspHis: 1.062 ± 0.361
4.037AspIle: 4.037 ± 0.651
5.524AspLys: 5.524 ± 0.761
6.905AspLeu: 6.905 ± 0.742
1.168AspMet: 1.168 ± 0.411
4.249AspAsn: 4.249 ± 0.709
1.275AspPro: 1.275 ± 0.389
0.637AspGln: 0.637 ± 0.273
1.487AspArg: 1.487 ± 0.554
2.974AspSer: 2.974 ± 0.555
4.461AspThr: 4.461 ± 0.684
2.656AspVal: 2.656 ± 0.521
0.956AspTrp: 0.956 ± 0.365
3.081AspTyr: 3.081 ± 0.526
0.0AspXaa: 0.0 ± 0.0
Glu
3.824GluAla: 3.824 ± 0.629
0.425GluCys: 0.425 ± 0.235
3.505GluAsp: 3.505 ± 0.761
3.824GluGlu: 3.824 ± 0.777
4.037GluPhe: 4.037 ± 0.569
2.337GluGly: 2.337 ± 0.42
0.637GluHis: 0.637 ± 0.258
5.524GluIle: 5.524 ± 0.953
5.949GluLys: 5.949 ± 1.149
9.135GluLeu: 9.135 ± 1.77
2.762GluMet: 2.762 ± 0.579
6.373GluAsn: 6.373 ± 0.667
1.381GluPro: 1.381 ± 0.338
3.824GluGln: 3.824 ± 0.684
2.974GluArg: 2.974 ± 0.639
2.974GluSer: 2.974 ± 0.57
4.249GluThr: 4.249 ± 0.733
3.824GluVal: 3.824 ± 0.644
0.956GluTrp: 0.956 ± 0.323
3.293GluTyr: 3.293 ± 0.661
0.0GluXaa: 0.0 ± 0.0
Phe
2.549PheAla: 2.549 ± 0.543
0.212PheCys: 0.212 ± 0.169
3.187PheAsp: 3.187 ± 0.657
2.868PheGlu: 2.868 ± 0.67
1.806PhePhe: 1.806 ± 0.483
1.806PheGly: 1.806 ± 0.503
0.85PheHis: 0.85 ± 0.328
2.974PheIle: 2.974 ± 0.616
4.143PheLys: 4.143 ± 0.634
2.762PheLeu: 2.762 ± 0.511
0.956PheMet: 0.956 ± 0.332
2.868PheAsn: 2.868 ± 0.608
0.744PhePro: 0.744 ± 0.268
1.275PheGln: 1.275 ± 0.431
1.487PheArg: 1.487 ± 0.339
3.505PheSer: 3.505 ± 0.667
2.974PheThr: 2.974 ± 0.558
2.762PheVal: 2.762 ± 0.448
0.531PheTrp: 0.531 ± 0.221
1.912PheTyr: 1.912 ± 0.487
0.0PheXaa: 0.0 ± 0.0
Gly
3.824GlyAla: 3.824 ± 1.115
0.425GlyCys: 0.425 ± 0.276
2.974GlyAsp: 2.974 ± 0.598
3.93GlyGlu: 3.93 ± 0.534
1.7GlyPhe: 1.7 ± 0.387
4.037GlyGly: 4.037 ± 0.734
1.062GlyHis: 1.062 ± 0.344
4.143GlyIle: 4.143 ± 1.205
6.905GlyLys: 6.905 ± 0.655
6.161GlyLeu: 6.161 ± 0.939
1.062GlyMet: 1.062 ± 0.428
4.037GlyAsn: 4.037 ± 0.668
0.319GlyPro: 0.319 ± 0.183
1.912GlyGln: 1.912 ± 0.554
1.381GlyArg: 1.381 ± 0.267
4.568GlySer: 4.568 ± 1.025
3.399GlyThr: 3.399 ± 0.716
4.461GlyVal: 4.461 ± 0.756
1.062GlyTrp: 1.062 ± 0.315
3.399GlyTyr: 3.399 ± 0.649
0.0GlyXaa: 0.0 ± 0.0
His
1.062HisAla: 1.062 ± 0.363
0.531HisCys: 0.531 ± 0.314
0.637HisAsp: 0.637 ± 0.29
0.637HisGlu: 0.637 ± 0.245
0.425HisPhe: 0.425 ± 0.212
1.168HisGly: 1.168 ± 0.395
0.106HisHis: 0.106 ± 0.101
1.062HisIle: 1.062 ± 0.334
0.956HisLys: 0.956 ± 0.301
1.168HisLeu: 1.168 ± 0.332
0.319HisMet: 0.319 ± 0.215
1.7HisAsn: 1.7 ± 0.379
0.106HisPro: 0.106 ± 0.117
0.212HisGln: 0.212 ± 0.166
0.319HisArg: 0.319 ± 0.174
0.319HisSer: 0.319 ± 0.174
0.85HisThr: 0.85 ± 0.306
0.85HisVal: 0.85 ± 0.368
0.106HisTrp: 0.106 ± 0.123
0.744HisTyr: 0.744 ± 0.301
0.0HisXaa: 0.0 ± 0.0
Ile
3.718IleAla: 3.718 ± 0.563
0.106IleCys: 0.106 ± 0.1
4.674IleAsp: 4.674 ± 0.595
6.055IleGlu: 6.055 ± 0.988
2.762IlePhe: 2.762 ± 0.501
3.718IleGly: 3.718 ± 0.959
1.381IleHis: 1.381 ± 0.395
4.993IleIle: 4.993 ± 0.658
7.754IleLys: 7.754 ± 1.139
4.78IleLeu: 4.78 ± 0.785
1.487IleMet: 1.487 ± 0.346
4.993IleAsn: 4.993 ± 0.627
1.7IlePro: 1.7 ± 0.464
2.443IleGln: 2.443 ± 0.437
1.912IleArg: 1.912 ± 0.4
4.78IleSer: 4.78 ± 1.072
5.524IleThr: 5.524 ± 0.975
5.099IleVal: 5.099 ± 0.721
0.956IleTrp: 0.956 ± 0.366
2.231IleTyr: 2.231 ± 0.439
0.0IleXaa: 0.0 ± 0.0
Lys
6.692LysAla: 6.692 ± 0.986
0.531LysCys: 0.531 ± 0.304
4.886LysAsp: 4.886 ± 0.691
9.029LysGlu: 9.029 ± 1.808
2.231LysPhe: 2.231 ± 0.476
6.48LysGly: 6.48 ± 1.01
1.275LysHis: 1.275 ± 0.485
5.311LysIle: 5.311 ± 0.823
9.242LysLys: 9.242 ± 1.231
7.223LysLeu: 7.223 ± 0.828
3.293LysMet: 3.293 ± 0.555
5.099LysAsn: 5.099 ± 0.72
2.018LysPro: 2.018 ± 0.495
3.718LysGln: 3.718 ± 0.802
3.718LysArg: 3.718 ± 0.806
5.736LysSer: 5.736 ± 0.926
4.993LysThr: 4.993 ± 0.695
7.33LysVal: 7.33 ± 0.744
1.062LysTrp: 1.062 ± 0.274
3.93LysTyr: 3.93 ± 0.726
0.0LysXaa: 0.0 ± 0.0
Leu
4.461LeuAla: 4.461 ± 0.656
0.319LeuCys: 0.319 ± 0.182
4.886LeuAsp: 4.886 ± 0.63
5.736LeuGlu: 5.736 ± 0.841
3.187LeuPhe: 3.187 ± 0.658
4.037LeuGly: 4.037 ± 0.68
0.956LeuHis: 0.956 ± 0.329
7.648LeuIle: 7.648 ± 1.03
7.967LeuLys: 7.967 ± 0.864
6.798LeuLeu: 6.798 ± 1.219
2.018LeuMet: 2.018 ± 0.531
6.48LeuAsn: 6.48 ± 0.808
2.868LeuPro: 2.868 ± 0.625
3.824LeuGln: 3.824 ± 0.614
2.868LeuArg: 2.868 ± 0.487
4.674LeuSer: 4.674 ± 0.613
6.48LeuThr: 6.48 ± 0.695
6.692LeuVal: 6.692 ± 0.689
1.062LeuTrp: 1.062 ± 0.308
4.461LeuTyr: 4.461 ± 0.845
0.0LeuXaa: 0.0 ± 0.0
Met
2.443MetAla: 2.443 ± 0.471
0.106MetCys: 0.106 ± 0.097
1.381MetAsp: 1.381 ± 0.409
1.806MetGlu: 1.806 ± 0.628
0.531MetPhe: 0.531 ± 0.201
0.956MetGly: 0.956 ± 0.251
0.319MetHis: 0.319 ± 0.203
2.018MetIle: 2.018 ± 0.454
2.231MetLys: 2.231 ± 0.51
1.7MetLeu: 1.7 ± 0.584
0.212MetMet: 0.212 ± 0.157
2.549MetAsn: 2.549 ± 0.545
0.637MetPro: 0.637 ± 0.232
1.487MetGln: 1.487 ± 0.343
0.531MetArg: 0.531 ± 0.239
1.806MetSer: 1.806 ± 0.45
1.487MetThr: 1.487 ± 0.359
1.806MetVal: 1.806 ± 0.396
0.106MetTrp: 0.106 ± 0.1
1.168MetTyr: 1.168 ± 0.367
0.0MetXaa: 0.0 ± 0.0
Asn
5.099AsnAla: 5.099 ± 0.96
0.106AsnCys: 0.106 ± 0.109
4.674AsnAsp: 4.674 ± 0.753
4.993AsnGlu: 4.993 ± 0.928
2.124AsnPhe: 2.124 ± 0.565
7.117AsnGly: 7.117 ± 0.784
0.85AsnHis: 0.85 ± 0.255
4.461AsnIle: 4.461 ± 0.812
6.48AsnLys: 6.48 ± 1.044
6.373AsnLeu: 6.373 ± 0.74
1.912AsnMet: 1.912 ± 0.495
4.249AsnAsn: 4.249 ± 0.853
2.018AsnPro: 2.018 ± 0.473
2.337AsnGln: 2.337 ± 0.543
1.593AsnArg: 1.593 ± 0.381
5.205AsnSer: 5.205 ± 0.543
4.674AsnThr: 4.674 ± 1.094
3.718AsnVal: 3.718 ± 0.743
0.956AsnTrp: 0.956 ± 0.322
2.868AsnTyr: 2.868 ± 0.706
0.0AsnXaa: 0.0 ± 0.0
Pro
1.487ProAla: 1.487 ± 0.424
0.106ProCys: 0.106 ± 0.106
1.381ProAsp: 1.381 ± 0.412
1.275ProGlu: 1.275 ± 0.34
1.381ProPhe: 1.381 ± 0.374
0.425ProGly: 0.425 ± 0.202
0.106ProHis: 0.106 ± 0.1
2.231ProIle: 2.231 ± 0.484
1.7ProLys: 1.7 ± 0.534
1.806ProLeu: 1.806 ± 0.374
0.425ProMet: 0.425 ± 0.193
2.124ProAsn: 2.124 ± 0.674
0.85ProPro: 0.85 ± 0.33
0.85ProGln: 0.85 ± 0.418
0.531ProArg: 0.531 ± 0.205
0.956ProSer: 0.956 ± 0.307
2.124ProThr: 2.124 ± 0.428
1.7ProVal: 1.7 ± 0.471
0.212ProTrp: 0.212 ± 0.139
0.85ProTyr: 0.85 ± 0.383
0.0ProXaa: 0.0 ± 0.0
Gln
2.868GlnAla: 2.868 ± 0.635
0.212GlnCys: 0.212 ± 0.142
1.912GlnAsp: 1.912 ± 0.526
3.081GlnGlu: 3.081 ± 0.562
1.275GlnPhe: 1.275 ± 0.383
2.443GlnGly: 2.443 ± 0.519
0.212GlnHis: 0.212 ± 0.132
1.593GlnIle: 1.593 ± 0.32
2.443GlnLys: 2.443 ± 0.552
3.824GlnLeu: 3.824 ± 0.813
0.744GlnMet: 0.744 ± 0.24
2.231GlnAsn: 2.231 ± 0.37
1.487GlnPro: 1.487 ± 0.437
1.806GlnGln: 1.806 ± 0.518
1.593GlnArg: 1.593 ± 0.442
2.443GlnSer: 2.443 ± 0.507
3.187GlnThr: 3.187 ± 0.716
2.231GlnVal: 2.231 ± 0.485
0.744GlnTrp: 0.744 ± 0.234
1.912GlnTyr: 1.912 ± 0.396
0.0GlnXaa: 0.0 ± 0.0
Arg
2.231ArgAla: 2.231 ± 0.614
0.319ArgCys: 0.319 ± 0.19
1.168ArgAsp: 1.168 ± 0.372
2.549ArgGlu: 2.549 ± 0.536
0.956ArgPhe: 0.956 ± 0.323
2.124ArgGly: 2.124 ± 0.379
0.531ArgHis: 0.531 ± 0.236
1.912ArgIle: 1.912 ± 0.512
3.399ArgLys: 3.399 ± 0.839
3.93ArgLeu: 3.93 ± 0.73
0.637ArgMet: 0.637 ± 0.29
2.231ArgAsn: 2.231 ± 0.539
0.637ArgPro: 0.637 ± 0.22
1.593ArgGln: 1.593 ± 0.375
1.381ArgArg: 1.381 ± 0.379
1.7ArgSer: 1.7 ± 0.347
1.912ArgThr: 1.912 ± 0.422
1.806ArgVal: 1.806 ± 0.421
0.319ArgTrp: 0.319 ± 0.148
1.806ArgTyr: 1.806 ± 0.47
0.0ArgXaa: 0.0 ± 0.0
Ser
3.612SerAla: 3.612 ± 0.929
0.744SerCys: 0.744 ± 0.317
4.355SerAsp: 4.355 ± 0.568
3.081SerGlu: 3.081 ± 0.456
3.081SerPhe: 3.081 ± 0.678
5.417SerGly: 5.417 ± 1.256
0.744SerHis: 0.744 ± 0.261
4.249SerIle: 4.249 ± 0.788
5.205SerLys: 5.205 ± 0.87
5.311SerLeu: 5.311 ± 0.586
2.018SerMet: 2.018 ± 0.407
3.187SerAsn: 3.187 ± 0.551
0.85SerPro: 0.85 ± 0.314
2.124SerGln: 2.124 ± 0.675
2.549SerArg: 2.549 ± 0.434
4.143SerSer: 4.143 ± 0.615
2.868SerThr: 2.868 ± 0.526
3.612SerVal: 3.612 ± 0.544
0.425SerTrp: 0.425 ± 0.188
2.868SerTyr: 2.868 ± 0.553
0.0SerXaa: 0.0 ± 0.0
Thr
4.886ThrAla: 4.886 ± 0.729
0.425ThrCys: 0.425 ± 0.191
3.187ThrAsp: 3.187 ± 0.695
6.161ThrGlu: 6.161 ± 0.769
2.549ThrPhe: 2.549 ± 0.405
3.824ThrGly: 3.824 ± 0.618
0.0ThrHis: 0.0 ± 0.0
4.886ThrIle: 4.886 ± 0.857
4.886ThrLys: 4.886 ± 0.531
5.524ThrLeu: 5.524 ± 0.778
1.275ThrMet: 1.275 ± 0.299
5.311ThrAsn: 5.311 ± 0.729
1.593ThrPro: 1.593 ± 0.283
2.762ThrGln: 2.762 ± 0.531
1.806ThrArg: 1.806 ± 0.429
4.037ThrSer: 4.037 ± 0.788
3.612ThrThr: 3.612 ± 0.538
4.568ThrVal: 4.568 ± 0.871
1.062ThrTrp: 1.062 ± 0.433
2.231ThrTyr: 2.231 ± 0.511
0.0ThrXaa: 0.0 ± 0.0
Val
3.93ValAla: 3.93 ± 0.663
0.531ValCys: 0.531 ± 0.204
4.461ValAsp: 4.461 ± 0.724
4.461ValGlu: 4.461 ± 0.625
3.081ValPhe: 3.081 ± 0.621
3.399ValGly: 3.399 ± 0.618
0.744ValHis: 0.744 ± 0.271
4.461ValIle: 4.461 ± 0.537
7.117ValLys: 7.117 ± 0.607
2.974ValLeu: 2.974 ± 0.52
1.7ValMet: 1.7 ± 0.431
3.612ValAsn: 3.612 ± 0.944
1.806ValPro: 1.806 ± 0.44
2.018ValGln: 2.018 ± 0.576
2.762ValArg: 2.762 ± 0.616
4.568ValSer: 4.568 ± 1.046
5.205ValThr: 5.205 ± 0.868
3.718ValVal: 3.718 ± 0.57
0.531ValTrp: 0.531 ± 0.202
3.081ValTyr: 3.081 ± 0.554
0.0ValXaa: 0.0 ± 0.0
Trp
0.744TrpAla: 0.744 ± 0.266
0.212TrpCys: 0.212 ± 0.146
0.744TrpAsp: 0.744 ± 0.263
0.531TrpGlu: 0.531 ± 0.274
0.85TrpPhe: 0.85 ± 0.367
0.744TrpGly: 0.744 ± 0.28
0.319TrpHis: 0.319 ± 0.213
0.319TrpIle: 0.319 ± 0.191
0.956TrpLys: 0.956 ± 0.327
1.275TrpLeu: 1.275 ± 0.354
0.212TrpMet: 0.212 ± 0.143
1.487TrpAsn: 1.487 ± 0.533
0.0TrpPro: 0.0 ± 0.0
0.85TrpGln: 0.85 ± 0.267
0.319TrpArg: 0.319 ± 0.246
0.85TrpSer: 0.85 ± 0.29
0.531TrpThr: 0.531 ± 0.267
0.85TrpVal: 0.85 ± 0.349
0.212TrpTrp: 0.212 ± 0.143
0.85TrpTyr: 0.85 ± 0.288
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.124TyrAla: 2.124 ± 0.487
0.531TyrCys: 0.531 ± 0.278
2.337TyrAsp: 2.337 ± 0.664
3.93TyrGlu: 3.93 ± 0.657
2.656TyrPhe: 2.656 ± 0.497
2.762TyrGly: 2.762 ± 0.569
1.168TyrHis: 1.168 ± 0.375
3.505TyrIle: 3.505 ± 0.607
3.187TyrLys: 3.187 ± 0.755
3.505TyrLeu: 3.505 ± 0.741
0.85TyrMet: 0.85 ± 0.27
3.93TyrAsn: 3.93 ± 0.6
1.062TyrPro: 1.062 ± 0.386
1.806TyrGln: 1.806 ± 0.436
1.487TyrArg: 1.487 ± 0.489
2.124TyrSer: 2.124 ± 0.592
3.081TyrThr: 3.081 ± 0.632
2.656TyrVal: 2.656 ± 0.512
0.212TyrTrp: 0.212 ± 0.149
2.337TyrTyr: 2.337 ± 0.665
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (9415 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski