Amino acid dipepetide frequency for Lactococcus phage GE1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.409AlaAla: 0.409 ± 0.263
0.545AlaCys: 0.545 ± 0.277
2.317AlaAsp: 2.317 ± 0.423
3.407AlaGlu: 3.407 ± 0.549
2.862AlaPhe: 2.862 ± 0.664
5.315AlaGly: 5.315 ± 1.507
0.545AlaHis: 0.545 ± 0.242
4.633AlaIle: 4.633 ± 0.674
5.587AlaLys: 5.587 ± 0.972
7.632AlaLeu: 7.632 ± 0.965
3.679AlaMet: 3.679 ± 0.587
2.862AlaAsn: 2.862 ± 0.573
1.635AlaPro: 1.635 ± 0.541
3.271AlaGln: 3.271 ± 0.881
1.908AlaArg: 1.908 ± 0.521
5.724AlaSer: 5.724 ± 1.309
4.361AlaThr: 4.361 ± 0.797
5.042AlaVal: 5.042 ± 0.845
1.363AlaTrp: 1.363 ± 0.441
2.044AlaTyr: 2.044 ± 0.587
0.0AlaXaa: 0.0 ± 0.0
Cys
0.273CysAla: 0.273 ± 0.184
0.136CysCys: 0.136 ± 0.138
0.273CysAsp: 0.273 ± 0.187
0.273CysGlu: 0.273 ± 0.235
0.0CysPhe: 0.0 ± 0.0
0.818CysGly: 0.818 ± 0.42
0.409CysHis: 0.409 ± 0.418
0.136CysIle: 0.136 ± 0.138
0.954CysLys: 0.954 ± 0.553
0.273CysLeu: 0.273 ± 0.2
0.0CysMet: 0.0 ± 0.0
0.545CysAsn: 0.545 ± 0.257
0.273CysPro: 0.273 ± 0.241
0.273CysGln: 0.273 ± 0.184
0.273CysArg: 0.273 ± 0.207
0.136CysSer: 0.136 ± 0.13
0.136CysThr: 0.136 ± 0.138
0.136CysVal: 0.136 ± 0.125
0.0CysTrp: 0.0 ± 0.0
0.273CysTyr: 0.273 ± 0.196
0.0CysXaa: 0.0 ± 0.0
Asp
2.862AspAla: 2.862 ± 0.691
0.136AspCys: 0.136 ± 0.146
3.407AspAsp: 3.407 ± 0.607
3.816AspGlu: 3.816 ± 0.815
3.679AspPhe: 3.679 ± 0.76
4.906AspGly: 4.906 ± 0.912
0.545AspHis: 0.545 ± 0.31
6.132AspIle: 6.132 ± 0.951
4.633AspLys: 4.633 ± 0.594
4.361AspLeu: 4.361 ± 0.711
2.589AspMet: 2.589 ± 0.624
4.361AspAsn: 4.361 ± 0.571
2.453AspPro: 2.453 ± 0.551
1.363AspGln: 1.363 ± 0.372
1.635AspArg: 1.635 ± 0.491
3.679AspSer: 3.679 ± 0.585
2.18AspThr: 2.18 ± 0.471
2.862AspVal: 2.862 ± 0.578
0.409AspTrp: 0.409 ± 0.238
4.225AspTyr: 4.225 ± 0.85
0.0AspXaa: 0.0 ± 0.0
Glu
6.269GluAla: 6.269 ± 0.882
0.818GluCys: 0.818 ± 0.468
4.225GluAsp: 4.225 ± 0.894
6.405GluGlu: 6.405 ± 1.571
3.271GluPhe: 3.271 ± 0.62
3.816GluGly: 3.816 ± 0.863
1.363GluHis: 1.363 ± 0.381
4.906GluIle: 4.906 ± 0.96
6.269GluLys: 6.269 ± 1.31
5.996GluLeu: 5.996 ± 0.977
1.635GluMet: 1.635 ± 0.432
4.088GluAsn: 4.088 ± 0.654
1.499GluPro: 1.499 ± 0.559
2.862GluGln: 2.862 ± 0.67
2.589GluArg: 2.589 ± 0.702
3.134GluSer: 3.134 ± 0.522
2.044GluThr: 2.044 ± 0.487
6.269GluVal: 6.269 ± 1.012
0.681GluTrp: 0.681 ± 0.317
3.816GluTyr: 3.816 ± 0.72
0.0GluXaa: 0.0 ± 0.0
Phe
3.407PheAla: 3.407 ± 0.774
0.273PheCys: 0.273 ± 0.182
3.271PheAsp: 3.271 ± 0.605
2.317PheGlu: 2.317 ± 0.403
1.772PhePhe: 1.772 ± 0.42
3.134PheGly: 3.134 ± 0.844
0.273PheHis: 0.273 ± 0.195
2.726PheIle: 2.726 ± 0.644
3.816PheLys: 3.816 ± 0.608
3.271PheLeu: 3.271 ± 0.736
0.954PheMet: 0.954 ± 0.394
2.862PheAsn: 2.862 ± 0.577
1.226PhePro: 1.226 ± 0.323
0.954PheGln: 0.954 ± 0.425
0.818PheArg: 0.818 ± 0.323
2.18PheSer: 2.18 ± 0.479
2.998PheThr: 2.998 ± 0.646
3.271PheVal: 3.271 ± 0.874
0.409PheTrp: 0.409 ± 0.26
0.954PheTyr: 0.954 ± 0.325
0.0PheXaa: 0.0 ± 0.0
Gly
4.906GlyAla: 4.906 ± 1.549
0.136GlyCys: 0.136 ± 0.125
2.998GlyAsp: 2.998 ± 0.594
4.361GlyGlu: 4.361 ± 0.963
2.726GlyPhe: 2.726 ± 0.431
3.952GlyGly: 3.952 ± 0.679
0.954GlyHis: 0.954 ± 0.428
4.361GlyIle: 4.361 ± 1.14
7.086GlyLys: 7.086 ± 1.057
7.632GlyLeu: 7.632 ± 0.866
2.317GlyMet: 2.317 ± 0.632
2.726GlyAsn: 2.726 ± 0.673
0.818GlyPro: 0.818 ± 0.385
2.862GlyGln: 2.862 ± 0.769
2.998GlyArg: 2.998 ± 0.703
3.134GlySer: 3.134 ± 0.779
6.405GlyThr: 6.405 ± 0.84
5.451GlyVal: 5.451 ± 0.766
1.09GlyTrp: 1.09 ± 0.396
2.18GlyTyr: 2.18 ± 0.532
0.0GlyXaa: 0.0 ± 0.0
His
0.409HisAla: 0.409 ± 0.269
0.273HisCys: 0.273 ± 0.181
0.818HisAsp: 0.818 ± 0.277
0.681HisGlu: 0.681 ± 0.271
0.136HisPhe: 0.136 ± 0.139
1.226HisGly: 1.226 ± 0.382
0.273HisHis: 0.273 ± 0.215
0.954HisIle: 0.954 ± 0.341
1.09HisLys: 1.09 ± 0.424
0.818HisLeu: 0.818 ± 0.506
0.954HisMet: 0.954 ± 0.352
0.954HisAsn: 0.954 ± 0.301
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.545HisArg: 0.545 ± 0.286
0.954HisSer: 0.954 ± 0.377
0.136HisThr: 0.136 ± 0.13
0.681HisVal: 0.681 ± 0.271
0.0HisTrp: 0.0 ± 0.0
0.954HisTyr: 0.954 ± 0.361
0.0HisXaa: 0.0 ± 0.0
Ile
4.225IleAla: 4.225 ± 0.734
0.409IleCys: 0.409 ± 0.197
3.543IleAsp: 3.543 ± 0.667
5.996IleGlu: 5.996 ± 1.166
2.589IlePhe: 2.589 ± 0.656
4.225IleGly: 4.225 ± 1.099
0.136IleHis: 0.136 ± 0.117
2.589IleIle: 2.589 ± 0.599
6.678IleLys: 6.678 ± 1.062
4.088IleLeu: 4.088 ± 0.745
2.044IleMet: 2.044 ± 0.461
5.315IleAsn: 5.315 ± 0.785
1.226IlePro: 1.226 ± 0.415
2.589IleGln: 2.589 ± 0.566
3.271IleArg: 3.271 ± 0.663
3.543IleSer: 3.543 ± 0.6
6.405IleThr: 6.405 ± 1.007
4.497IleVal: 4.497 ± 0.746
0.681IleTrp: 0.681 ± 0.311
2.726IleTyr: 2.726 ± 0.754
0.0IleXaa: 0.0 ± 0.0
Lys
6.405LysAla: 6.405 ± 1.196
0.409LysCys: 0.409 ± 0.271
4.497LysAsp: 4.497 ± 0.785
8.313LysGlu: 8.313 ± 1.496
1.908LysPhe: 1.908 ± 0.438
6.678LysGly: 6.678 ± 1.076
1.363LysHis: 1.363 ± 0.433
4.088LysIle: 4.088 ± 0.564
6.814LysLys: 6.814 ± 1.422
5.724LysLeu: 5.724 ± 0.987
3.679LysMet: 3.679 ± 0.826
3.543LysAsn: 3.543 ± 0.538
2.317LysPro: 2.317 ± 0.483
4.633LysGln: 4.633 ± 0.784
5.996LysArg: 5.996 ± 0.932
7.086LysSer: 7.086 ± 1.243
4.225LysThr: 4.225 ± 0.694
3.952LysVal: 3.952 ± 0.653
0.681LysTrp: 0.681 ± 0.263
5.315LysTyr: 5.315 ± 1.019
0.0LysXaa: 0.0 ± 0.0
Leu
5.86LeuAla: 5.86 ± 0.829
0.273LeuCys: 0.273 ± 0.21
6.95LeuAsp: 6.95 ± 0.853
6.405LeuGlu: 6.405 ± 0.965
2.453LeuPhe: 2.453 ± 0.565
4.906LeuGly: 4.906 ± 0.907
0.818LeuHis: 0.818 ± 0.306
5.996LeuIle: 5.996 ± 0.87
7.768LeuLys: 7.768 ± 0.984
4.633LeuLeu: 4.633 ± 0.61
1.908LeuMet: 1.908 ± 0.435
4.225LeuAsn: 4.225 ± 0.789
2.726LeuPro: 2.726 ± 0.7
3.271LeuGln: 3.271 ± 0.724
3.543LeuArg: 3.543 ± 0.73
5.587LeuSer: 5.587 ± 1.024
4.77LeuThr: 4.77 ± 0.729
4.225LeuVal: 4.225 ± 0.601
0.409LeuTrp: 0.409 ± 0.265
2.862LeuTyr: 2.862 ± 0.648
0.0LeuXaa: 0.0 ± 0.0
Met
3.134MetAla: 3.134 ± 0.831
0.273MetCys: 0.273 ± 0.195
2.589MetAsp: 2.589 ± 0.662
1.908MetGlu: 1.908 ± 0.575
1.499MetPhe: 1.499 ± 0.52
1.908MetGly: 1.908 ± 0.498
0.136MetHis: 0.136 ± 0.138
2.453MetIle: 2.453 ± 0.689
2.998MetLys: 2.998 ± 0.727
2.18MetLeu: 2.18 ± 0.662
1.09MetMet: 1.09 ± 0.345
2.589MetAsn: 2.589 ± 0.48
0.681MetPro: 0.681 ± 0.287
1.499MetGln: 1.499 ± 0.482
2.044MetArg: 2.044 ± 0.58
2.18MetSer: 2.18 ± 0.385
2.044MetThr: 2.044 ± 0.523
1.226MetVal: 1.226 ± 0.285
0.136MetTrp: 0.136 ± 0.151
0.818MetTyr: 0.818 ± 0.318
0.0MetXaa: 0.0 ± 0.0
Asn
2.862AsnAla: 2.862 ± 0.538
0.273AsnCys: 0.273 ± 0.191
3.407AsnAsp: 3.407 ± 0.773
4.088AsnGlu: 4.088 ± 0.783
2.18AsnPhe: 2.18 ± 0.663
4.361AsnGly: 4.361 ± 0.798
0.273AsnHis: 0.273 ± 0.203
3.271AsnIle: 3.271 ± 0.722
5.724AsnLys: 5.724 ± 0.982
4.361AsnLeu: 4.361 ± 0.767
2.18AsnMet: 2.18 ± 0.579
2.589AsnAsn: 2.589 ± 0.639
1.908AsnPro: 1.908 ± 0.562
1.772AsnGln: 1.772 ± 0.484
2.044AsnArg: 2.044 ± 0.478
4.088AsnSer: 4.088 ± 0.77
3.407AsnThr: 3.407 ± 0.704
4.633AsnVal: 4.633 ± 0.639
0.409AsnTrp: 0.409 ± 0.246
2.862AsnTyr: 2.862 ± 0.968
0.0AsnXaa: 0.0 ± 0.0
Pro
1.772ProAla: 1.772 ± 0.682
0.0ProCys: 0.0 ± 0.0
2.044ProAsp: 2.044 ± 0.679
2.589ProGlu: 2.589 ± 0.729
1.635ProPhe: 1.635 ± 0.473
0.409ProGly: 0.409 ± 0.251
0.0ProHis: 0.0 ± 0.0
2.862ProIle: 2.862 ± 0.789
2.317ProLys: 2.317 ± 0.667
2.317ProLeu: 2.317 ± 0.545
0.545ProMet: 0.545 ± 0.246
2.589ProAsn: 2.589 ± 0.514
0.954ProPro: 0.954 ± 0.426
1.226ProGln: 1.226 ± 0.379
1.09ProArg: 1.09 ± 0.508
1.363ProSer: 1.363 ± 0.45
1.499ProThr: 1.499 ± 0.557
1.635ProVal: 1.635 ± 0.415
0.273ProTrp: 0.273 ± 0.192
0.954ProTyr: 0.954 ± 0.321
0.0ProXaa: 0.0 ± 0.0
Gln
3.543GlnAla: 3.543 ± 0.554
0.0GlnCys: 0.0 ± 0.0
1.499GlnAsp: 1.499 ± 0.377
1.772GlnGlu: 1.772 ± 0.537
1.908GlnPhe: 1.908 ± 0.453
2.998GlnGly: 2.998 ± 0.588
0.954GlnHis: 0.954 ± 0.399
2.18GlnIle: 2.18 ± 0.568
2.726GlnLys: 2.726 ± 0.532
4.361GlnLeu: 4.361 ± 0.832
1.226GlnMet: 1.226 ± 0.404
1.499GlnAsn: 1.499 ± 0.326
0.818GlnPro: 0.818 ± 0.308
1.772GlnGln: 1.772 ± 0.465
2.453GlnArg: 2.453 ± 0.785
2.044GlnSer: 2.044 ± 0.404
1.226GlnThr: 1.226 ± 0.396
2.453GlnVal: 2.453 ± 0.482
0.681GlnTrp: 0.681 ± 0.295
1.226GlnTyr: 1.226 ± 0.46
0.0GlnXaa: 0.0 ± 0.0
Arg
3.543ArgAla: 3.543 ± 0.636
0.545ArgCys: 0.545 ± 0.285
2.589ArgAsp: 2.589 ± 0.761
2.453ArgGlu: 2.453 ± 0.578
2.18ArgPhe: 2.18 ± 0.592
2.998ArgGly: 2.998 ± 0.551
0.954ArgHis: 0.954 ± 0.411
1.499ArgIle: 1.499 ± 0.583
3.134ArgLys: 3.134 ± 0.712
2.18ArgLeu: 2.18 ± 0.583
0.681ArgMet: 0.681 ± 0.298
2.044ArgAsn: 2.044 ± 0.425
0.818ArgPro: 0.818 ± 0.444
1.908ArgGln: 1.908 ± 0.578
1.363ArgArg: 1.363 ± 0.504
3.679ArgSer: 3.679 ± 0.825
2.18ArgThr: 2.18 ± 0.626
3.271ArgVal: 3.271 ± 0.778
1.09ArgTrp: 1.09 ± 0.378
1.499ArgTyr: 1.499 ± 0.651
0.0ArgXaa: 0.0 ± 0.0
Ser
3.543SerAla: 3.543 ± 1.156
0.0SerCys: 0.0 ± 0.0
3.543SerAsp: 3.543 ± 0.77
4.633SerGlu: 4.633 ± 0.89
3.816SerPhe: 3.816 ± 0.586
6.269SerGly: 6.269 ± 0.995
0.545SerHis: 0.545 ± 0.262
4.225SerIle: 4.225 ± 1.005
4.633SerLys: 4.633 ± 0.974
5.86SerLeu: 5.86 ± 1.091
1.635SerMet: 1.635 ± 0.504
3.543SerAsn: 3.543 ± 0.658
2.862SerPro: 2.862 ± 0.634
1.363SerGln: 1.363 ± 0.423
1.772SerArg: 1.772 ± 0.48
3.952SerSer: 3.952 ± 0.834
3.952SerThr: 3.952 ± 0.664
4.633SerVal: 4.633 ± 0.614
0.818SerTrp: 0.818 ± 0.299
2.044SerTyr: 2.044 ± 0.598
0.0SerXaa: 0.0 ± 0.0
Thr
5.042ThrAla: 5.042 ± 0.746
0.136ThrCys: 0.136 ± 0.158
3.816ThrAsp: 3.816 ± 0.688
5.042ThrGlu: 5.042 ± 0.964
1.772ThrPhe: 1.772 ± 0.364
3.952ThrGly: 3.952 ± 0.917
0.409ThrHis: 0.409 ± 0.226
5.179ThrIle: 5.179 ± 1.046
4.906ThrLys: 4.906 ± 0.903
5.451ThrLeu: 5.451 ± 0.784
1.499ThrMet: 1.499 ± 0.479
2.726ThrAsn: 2.726 ± 0.75
2.317ThrPro: 2.317 ± 0.531
2.18ThrGln: 2.18 ± 0.456
1.499ThrArg: 1.499 ± 0.42
3.271ThrSer: 3.271 ± 0.785
3.134ThrThr: 3.134 ± 0.776
4.225ThrVal: 4.225 ± 0.761
0.681ThrTrp: 0.681 ± 0.363
2.862ThrTyr: 2.862 ± 0.676
0.0ThrXaa: 0.0 ± 0.0
Val
3.271ValAla: 3.271 ± 0.81
0.409ValCys: 0.409 ± 0.254
5.042ValAsp: 5.042 ± 0.821
4.088ValGlu: 4.088 ± 0.865
2.726ValPhe: 2.726 ± 0.715
4.77ValGly: 4.77 ± 1.026
1.09ValHis: 1.09 ± 0.303
3.952ValIle: 3.952 ± 0.749
5.451ValLys: 5.451 ± 0.922
4.906ValLeu: 4.906 ± 0.954
2.453ValMet: 2.453 ± 0.642
3.679ValAsn: 3.679 ± 0.648
2.317ValPro: 2.317 ± 0.698
2.317ValGln: 2.317 ± 0.445
2.862ValArg: 2.862 ± 0.582
5.042ValSer: 5.042 ± 0.745
4.77ValThr: 4.77 ± 0.698
5.451ValVal: 5.451 ± 0.754
0.545ValTrp: 0.545 ± 0.281
2.453ValTyr: 2.453 ± 0.7
0.0ValXaa: 0.0 ± 0.0
Trp
1.09TrpAla: 1.09 ± 0.426
0.273TrpCys: 0.273 ± 0.205
0.818TrpAsp: 0.818 ± 0.327
0.818TrpGlu: 0.818 ± 0.307
0.273TrpPhe: 0.273 ± 0.209
0.409TrpGly: 0.409 ± 0.248
0.0TrpHis: 0.0 ± 0.0
0.818TrpIle: 0.818 ± 0.354
0.409TrpLys: 0.409 ± 0.242
0.681TrpLeu: 0.681 ± 0.336
0.273TrpMet: 0.273 ± 0.17
0.681TrpAsn: 0.681 ± 0.22
0.0TrpPro: 0.0 ± 0.0
0.273TrpGln: 0.273 ± 0.192
0.545TrpArg: 0.545 ± 0.268
0.818TrpSer: 0.818 ± 0.369
0.954TrpThr: 0.954 ± 0.329
0.818TrpVal: 0.818 ± 0.311
0.273TrpTrp: 0.273 ± 0.198
1.09TrpTyr: 1.09 ± 0.37
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.317TyrAla: 2.317 ± 0.54
0.273TyrCys: 0.273 ± 0.277
2.589TyrAsp: 2.589 ± 0.776
2.589TyrGlu: 2.589 ± 0.632
1.499TyrPhe: 1.499 ± 0.561
2.044TyrGly: 2.044 ± 0.598
0.818TyrHis: 0.818 ± 0.327
3.816TyrIle: 3.816 ± 0.928
4.77TyrLys: 4.77 ± 0.911
2.862TyrLeu: 2.862 ± 0.724
1.908TyrMet: 1.908 ± 0.512
3.271TyrAsn: 3.271 ± 0.648
1.226TyrPro: 1.226 ± 0.396
0.954TyrGln: 0.954 ± 0.326
1.363TyrArg: 1.363 ± 0.644
2.044TyrSer: 2.044 ± 0.525
3.271TyrThr: 3.271 ± 0.84
2.862TyrVal: 2.862 ± 0.826
0.681TyrTrp: 0.681 ± 0.274
1.363TyrTyr: 1.363 ± 0.449
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (7339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski