Amino acid dipepetide frequency for Lactococcus phage 936 group phage Phi129

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.91AlaAla: 0.91 ± 0.428
0.228AlaCys: 0.228 ± 0.16
3.868AlaAsp: 3.868 ± 0.694
4.55AlaGlu: 4.55 ± 0.796
3.868AlaPhe: 3.868 ± 0.858
4.55AlaGly: 4.55 ± 0.851
0.91AlaHis: 0.91 ± 0.338
4.778AlaIle: 4.778 ± 0.904
6.484AlaLys: 6.484 ± 1.033
6.939AlaLeu: 6.939 ± 1.174
2.503AlaMet: 2.503 ± 0.74
4.436AlaAsn: 4.436 ± 0.785
0.796AlaPro: 0.796 ± 0.371
2.958AlaGln: 2.958 ± 0.631
2.275AlaArg: 2.275 ± 0.485
3.185AlaSer: 3.185 ± 0.8
2.958AlaThr: 2.958 ± 0.608
4.436AlaVal: 4.436 ± 1.121
2.275AlaTrp: 2.275 ± 0.886
1.479AlaTyr: 1.479 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.228CysAla: 0.228 ± 0.156
0.0CysCys: 0.0 ± 0.0
0.228CysAsp: 0.228 ± 0.16
0.228CysGlu: 0.228 ± 0.152
0.114CysPhe: 0.114 ± 0.105
0.91CysGly: 0.91 ± 0.325
0.228CysHis: 0.228 ± 0.141
0.341CysIle: 0.341 ± 0.184
0.683CysLys: 0.683 ± 0.336
0.114CysLeu: 0.114 ± 0.111
0.228CysMet: 0.228 ± 0.159
0.569CysAsn: 0.569 ± 0.263
0.114CysPro: 0.114 ± 0.095
0.228CysGln: 0.228 ± 0.144
0.683CysArg: 0.683 ± 0.283
0.114CysSer: 0.114 ± 0.123
0.341CysThr: 0.341 ± 0.193
0.341CysVal: 0.341 ± 0.204
0.341CysTrp: 0.341 ± 0.194
0.228CysTyr: 0.228 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
2.275AspAla: 2.275 ± 0.609
0.114AspCys: 0.114 ± 0.111
3.185AspAsp: 3.185 ± 0.735
3.526AspGlu: 3.526 ± 0.759
3.754AspPhe: 3.754 ± 0.574
4.095AspGly: 4.095 ± 0.695
1.024AspHis: 1.024 ± 0.357
3.64AspIle: 3.64 ± 0.72
5.574AspLys: 5.574 ± 0.727
6.256AspLeu: 6.256 ± 0.886
1.138AspMet: 1.138 ± 0.248
3.754AspAsn: 3.754 ± 0.639
1.593AspPro: 1.593 ± 0.457
0.683AspGln: 0.683 ± 0.27
1.138AspArg: 1.138 ± 0.385
3.299AspSer: 3.299 ± 0.609
3.981AspThr: 3.981 ± 0.718
3.071AspVal: 3.071 ± 0.682
0.796AspTrp: 0.796 ± 0.223
2.844AspTyr: 2.844 ± 0.594
0.0AspXaa: 0.0 ± 0.0
Glu
4.664GluAla: 4.664 ± 0.681
0.569GluCys: 0.569 ± 0.271
3.413GluAsp: 3.413 ± 0.744
4.891GluGlu: 4.891 ± 0.889
3.526GluPhe: 3.526 ± 0.539
2.161GluGly: 2.161 ± 0.452
1.024GluHis: 1.024 ± 0.349
6.256GluIle: 6.256 ± 0.797
5.233GluLys: 5.233 ± 1.061
10.01GluLeu: 10.01 ± 1.495
2.048GluMet: 2.048 ± 0.395
4.891GluAsn: 4.891 ± 0.934
1.024GluPro: 1.024 ± 0.315
3.754GluGln: 3.754 ± 0.765
3.071GluArg: 3.071 ± 0.615
3.981GluSer: 3.981 ± 0.58
3.868GluThr: 3.868 ± 0.577
4.664GluVal: 4.664 ± 0.719
0.91GluTrp: 0.91 ± 0.276
2.958GluTyr: 2.958 ± 0.669
0.0GluXaa: 0.0 ± 0.0
Phe
2.616PheAla: 2.616 ± 0.524
0.341PheCys: 0.341 ± 0.186
2.844PheAsp: 2.844 ± 0.501
2.73PheGlu: 2.73 ± 0.709
1.706PhePhe: 1.706 ± 0.522
2.161PheGly: 2.161 ± 0.55
0.341PheHis: 0.341 ± 0.282
3.185PheIle: 3.185 ± 0.666
3.868PheLys: 3.868 ± 0.537
2.844PheLeu: 2.844 ± 0.49
0.796PheMet: 0.796 ± 0.293
2.73PheAsn: 2.73 ± 0.712
1.024PhePro: 1.024 ± 0.348
1.706PheGln: 1.706 ± 0.421
1.251PheArg: 1.251 ± 0.365
4.209PheSer: 4.209 ± 0.852
3.185PheThr: 3.185 ± 0.517
2.275PheVal: 2.275 ± 0.458
0.228PheTrp: 0.228 ± 0.133
1.593PheTyr: 1.593 ± 0.426
0.0PheXaa: 0.0 ± 0.0
Gly
3.981GlyAla: 3.981 ± 1.137
0.569GlyCys: 0.569 ± 0.242
3.526GlyAsp: 3.526 ± 0.659
4.095GlyGlu: 4.095 ± 0.574
2.616GlyPhe: 2.616 ± 0.525
4.436GlyGly: 4.436 ± 0.748
0.91GlyHis: 0.91 ± 0.314
4.55GlyIle: 4.55 ± 1.139
6.825GlyLys: 6.825 ± 0.882
6.143GlyLeu: 6.143 ± 1.093
1.365GlyMet: 1.365 ± 0.45
3.868GlyAsn: 3.868 ± 0.594
0.228GlyPro: 0.228 ± 0.159
2.275GlyGln: 2.275 ± 0.417
1.934GlyArg: 1.934 ± 0.321
5.119GlySer: 5.119 ± 0.863
4.095GlyThr: 4.095 ± 0.954
5.574GlyVal: 5.574 ± 1.174
1.251GlyTrp: 1.251 ± 0.37
2.958GlyTyr: 2.958 ± 0.579
0.0GlyXaa: 0.0 ± 0.0
His
0.91HisAla: 0.91 ± 0.268
0.455HisCys: 0.455 ± 0.239
0.683HisAsp: 0.683 ± 0.248
0.569HisGlu: 0.569 ± 0.217
0.455HisPhe: 0.455 ± 0.219
1.706HisGly: 1.706 ± 0.566
0.114HisHis: 0.114 ± 0.106
1.138HisIle: 1.138 ± 0.319
1.024HisLys: 1.024 ± 0.351
0.91HisLeu: 0.91 ± 0.328
0.0HisMet: 0.0 ± 0.0
1.251HisAsn: 1.251 ± 0.393
0.0HisPro: 0.0 ± 0.0
0.228HisGln: 0.228 ± 0.139
0.683HisArg: 0.683 ± 0.285
0.341HisSer: 0.341 ± 0.217
1.251HisThr: 1.251 ± 0.353
0.569HisVal: 0.569 ± 0.207
0.0HisTrp: 0.0 ± 0.0
0.569HisTyr: 0.569 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
5.574IleAla: 5.574 ± 1.046
0.0IleCys: 0.0 ± 0.0
3.64IleAsp: 3.64 ± 0.557
7.28IleGlu: 7.28 ± 0.899
3.185IlePhe: 3.185 ± 0.563
3.981IleGly: 3.981 ± 0.984
0.796IleHis: 0.796 ± 0.333
4.55IleIle: 4.55 ± 0.708
6.939IleLys: 6.939 ± 0.998
4.55IleLeu: 4.55 ± 0.729
1.934IleMet: 1.934 ± 0.446
5.005IleAsn: 5.005 ± 0.564
2.048IlePro: 2.048 ± 0.472
2.844IleGln: 2.844 ± 0.442
1.479IleArg: 1.479 ± 0.365
3.64IleSer: 3.64 ± 0.657
5.119IleThr: 5.119 ± 0.794
4.436IleVal: 4.436 ± 0.615
1.138IleTrp: 1.138 ± 0.314
2.503IleTyr: 2.503 ± 0.543
0.0IleXaa: 0.0 ± 0.0
Lys
6.711LysAla: 6.711 ± 0.996
0.683LysCys: 0.683 ± 0.267
4.436LysAsp: 4.436 ± 0.604
8.19LysGlu: 8.19 ± 1.518
2.161LysPhe: 2.161 ± 0.499
6.37LysGly: 6.37 ± 0.787
1.251LysHis: 1.251 ± 0.433
5.801LysIle: 5.801 ± 0.847
9.669LysLys: 9.669 ± 1.162
7.621LysLeu: 7.621 ± 0.675
2.958LysMet: 2.958 ± 0.482
5.801LysAsn: 5.801 ± 0.725
1.251LysPro: 1.251 ± 0.41
3.526LysGln: 3.526 ± 0.71
3.185LysArg: 3.185 ± 0.725
4.891LysSer: 4.891 ± 0.819
5.46LysThr: 5.46 ± 0.737
5.801LysVal: 5.801 ± 0.812
1.365LysTrp: 1.365 ± 0.346
3.185LysTyr: 3.185 ± 0.674
0.0LysXaa: 0.0 ± 0.0
Leu
5.574LeuAla: 5.574 ± 0.668
0.341LeuCys: 0.341 ± 0.169
4.778LeuAsp: 4.778 ± 0.613
5.574LeuGlu: 5.574 ± 0.73
3.185LeuPhe: 3.185 ± 0.559
5.005LeuGly: 5.005 ± 0.943
1.138LeuHis: 1.138 ± 0.354
7.28LeuIle: 7.28 ± 0.818
7.963LeuLys: 7.963 ± 0.764
6.711LeuLeu: 6.711 ± 1.075
1.706LeuMet: 1.706 ± 0.494
5.005LeuAsn: 5.005 ± 0.975
3.071LeuPro: 3.071 ± 0.567
3.185LeuGln: 3.185 ± 0.452
3.071LeuArg: 3.071 ± 0.681
5.005LeuSer: 5.005 ± 0.651
6.256LeuThr: 6.256 ± 0.775
6.143LeuVal: 6.143 ± 0.73
1.593LeuTrp: 1.593 ± 0.407
4.436LeuTyr: 4.436 ± 0.831
0.0LeuXaa: 0.0 ± 0.0
Met
2.161MetAla: 2.161 ± 0.566
0.114MetCys: 0.114 ± 0.123
1.251MetAsp: 1.251 ± 0.381
1.934MetGlu: 1.934 ± 0.535
0.455MetPhe: 0.455 ± 0.19
1.138MetGly: 1.138 ± 0.367
0.228MetHis: 0.228 ± 0.175
2.275MetIle: 2.275 ± 0.447
2.958MetLys: 2.958 ± 0.547
1.024MetLeu: 1.024 ± 0.304
0.455MetMet: 0.455 ± 0.252
2.048MetAsn: 2.048 ± 0.535
0.341MetPro: 0.341 ± 0.211
1.934MetGln: 1.934 ± 0.395
0.341MetArg: 0.341 ± 0.182
1.479MetSer: 1.479 ± 0.408
1.82MetThr: 1.82 ± 0.499
0.91MetVal: 0.91 ± 0.292
0.114MetTrp: 0.114 ± 0.115
0.91MetTyr: 0.91 ± 0.374
0.0MetXaa: 0.0 ± 0.0
Asn
5.801AsnAla: 5.801 ± 0.982
0.228AsnCys: 0.228 ± 0.152
4.55AsnAsp: 4.55 ± 0.795
4.778AsnGlu: 4.778 ± 0.887
1.82AsnPhe: 1.82 ± 0.47
6.029AsnGly: 6.029 ± 0.977
0.569AsnHis: 0.569 ± 0.251
4.323AsnIle: 4.323 ± 0.687
5.005AsnLys: 5.005 ± 1.036
6.143AsnLeu: 6.143 ± 0.75
1.138AsnMet: 1.138 ± 0.313
4.323AsnAsn: 4.323 ± 1.019
2.161AsnPro: 2.161 ± 0.477
1.934AsnGln: 1.934 ± 0.508
1.934AsnArg: 1.934 ± 0.38
4.664AsnSer: 4.664 ± 0.539
3.981AsnThr: 3.981 ± 0.735
3.64AsnVal: 3.64 ± 0.605
0.91AsnTrp: 0.91 ± 0.365
2.73AsnTyr: 2.73 ± 0.705
0.0AsnXaa: 0.0 ± 0.0
Pro
1.251ProAla: 1.251 ± 0.352
0.114ProCys: 0.114 ± 0.131
1.479ProAsp: 1.479 ± 0.469
1.934ProGlu: 1.934 ± 0.473
0.91ProPhe: 0.91 ± 0.279
0.228ProGly: 0.228 ± 0.165
0.0ProHis: 0.0 ± 0.0
1.593ProIle: 1.593 ± 0.45
1.706ProLys: 1.706 ± 0.416
1.82ProLeu: 1.82 ± 0.343
0.569ProMet: 0.569 ± 0.256
2.389ProAsn: 2.389 ± 0.836
0.683ProPro: 0.683 ± 0.277
0.683ProGln: 0.683 ± 0.332
0.455ProArg: 0.455 ± 0.178
0.91ProSer: 0.91 ± 0.295
2.389ProThr: 2.389 ± 0.496
1.593ProVal: 1.593 ± 0.357
0.114ProTrp: 0.114 ± 0.131
0.91ProTyr: 0.91 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
3.413GlnAla: 3.413 ± 0.74
0.228GlnCys: 0.228 ± 0.162
2.048GlnAsp: 2.048 ± 0.556
2.73GlnGlu: 2.73 ± 0.485
1.365GlnPhe: 1.365 ± 0.455
2.844GlnGly: 2.844 ± 0.567
0.455GlnHis: 0.455 ± 0.202
1.251GlnIle: 1.251 ± 0.271
2.958GlnLys: 2.958 ± 0.568
2.958GlnLeu: 2.958 ± 0.486
1.138GlnMet: 1.138 ± 0.312
1.593GlnAsn: 1.593 ± 0.338
1.138GlnPro: 1.138 ± 0.3
1.593GlnGln: 1.593 ± 0.499
1.934GlnArg: 1.934 ± 0.499
2.616GlnSer: 2.616 ± 0.486
2.844GlnThr: 2.844 ± 0.603
2.275GlnVal: 2.275 ± 0.512
0.796GlnTrp: 0.796 ± 0.23
1.365GlnTyr: 1.365 ± 0.361
0.0GlnXaa: 0.0 ± 0.0
Arg
2.048ArgAla: 2.048 ± 0.489
0.228ArgCys: 0.228 ± 0.167
1.934ArgAsp: 1.934 ± 0.503
1.82ArgGlu: 1.82 ± 0.344
0.569ArgPhe: 0.569 ± 0.301
2.161ArgGly: 2.161 ± 0.37
0.91ArgHis: 0.91 ± 0.276
1.934ArgIle: 1.934 ± 0.446
3.64ArgLys: 3.64 ± 0.759
3.64ArgLeu: 3.64 ± 0.702
0.796ArgMet: 0.796 ± 0.397
2.503ArgAsn: 2.503 ± 0.591
0.569ArgPro: 0.569 ± 0.235
1.593ArgGln: 1.593 ± 0.384
2.275ArgArg: 2.275 ± 0.591
1.593ArgSer: 1.593 ± 0.443
1.82ArgThr: 1.82 ± 0.409
1.82ArgVal: 1.82 ± 0.472
0.341ArgTrp: 0.341 ± 0.234
1.82ArgTyr: 1.82 ± 0.504
0.0ArgXaa: 0.0 ± 0.0
Ser
4.55SerAla: 4.55 ± 1.201
0.683SerCys: 0.683 ± 0.323
3.413SerAsp: 3.413 ± 0.542
3.754SerGlu: 3.754 ± 0.615
2.844SerPhe: 2.844 ± 0.495
6.143SerGly: 6.143 ± 1.609
0.683SerHis: 0.683 ± 0.239
5.005SerIle: 5.005 ± 0.824
4.209SerLys: 4.209 ± 0.689
6.143SerLeu: 6.143 ± 0.94
1.365SerMet: 1.365 ± 0.335
3.754SerAsn: 3.754 ± 0.622
1.479SerPro: 1.479 ± 0.428
1.706SerGln: 1.706 ± 0.368
2.389SerArg: 2.389 ± 0.514
5.801SerSer: 5.801 ± 1.305
2.844SerThr: 2.844 ± 0.597
4.209SerVal: 4.209 ± 0.903
1.251SerTrp: 1.251 ± 0.321
1.82SerTyr: 1.82 ± 0.48
0.0SerXaa: 0.0 ± 0.0
Thr
5.801ThrAla: 5.801 ± 0.865
0.228ThrCys: 0.228 ± 0.17
3.185ThrAsp: 3.185 ± 0.571
5.574ThrGlu: 5.574 ± 0.741
2.844ThrPhe: 2.844 ± 0.539
4.891ThrGly: 4.891 ± 0.676
0.114ThrHis: 0.114 ± 0.115
4.209ThrIle: 4.209 ± 0.693
5.46ThrLys: 5.46 ± 0.636
5.005ThrLeu: 5.005 ± 0.693
1.024ThrMet: 1.024 ± 0.307
4.436ThrAsn: 4.436 ± 0.619
1.593ThrPro: 1.593 ± 0.297
2.844ThrGln: 2.844 ± 0.492
1.593ThrArg: 1.593 ± 0.443
4.095ThrSer: 4.095 ± 0.633
4.209ThrThr: 4.209 ± 0.719
5.005ThrVal: 5.005 ± 0.892
1.251ThrTrp: 1.251 ± 0.317
1.706ThrTyr: 1.706 ± 0.423
0.0ThrXaa: 0.0 ± 0.0
Val
3.754ValAla: 3.754 ± 0.599
0.569ValCys: 0.569 ± 0.277
4.323ValAsp: 4.323 ± 0.646
4.095ValGlu: 4.095 ± 0.603
2.73ValPhe: 2.73 ± 0.458
3.413ValGly: 3.413 ± 0.534
0.569ValHis: 0.569 ± 0.217
5.233ValIle: 5.233 ± 0.568
6.143ValLys: 6.143 ± 0.809
3.64ValLeu: 3.64 ± 0.489
1.706ValMet: 1.706 ± 0.527
3.299ValAsn: 3.299 ± 0.661
1.593ValPro: 1.593 ± 0.42
1.82ValGln: 1.82 ± 0.511
2.958ValArg: 2.958 ± 0.683
6.37ValSer: 6.37 ± 1.496
5.119ValThr: 5.119 ± 0.869
3.185ValVal: 3.185 ± 0.575
0.341ValTrp: 0.341 ± 0.193
2.73ValTyr: 2.73 ± 0.552
0.0ValXaa: 0.0 ± 0.0
Trp
0.683TrpAla: 0.683 ± 0.25
0.228TrpCys: 0.228 ± 0.166
0.91TrpAsp: 0.91 ± 0.387
0.796TrpGlu: 0.796 ± 0.301
1.251TrpPhe: 1.251 ± 0.636
1.024TrpGly: 1.024 ± 0.31
0.228TrpHis: 0.228 ± 0.144
0.683TrpIle: 0.683 ± 0.328
1.251TrpLys: 1.251 ± 0.315
1.138TrpLeu: 1.138 ± 0.346
0.341TrpMet: 0.341 ± 0.169
1.82TrpAsn: 1.82 ± 0.481
0.0TrpPro: 0.0 ± 0.0
0.91TrpGln: 0.91 ± 0.303
0.341TrpArg: 0.341 ± 0.291
1.251TrpSer: 1.251 ± 0.299
0.91TrpThr: 0.91 ± 0.331
0.569TrpVal: 0.569 ± 0.217
0.228TrpTrp: 0.228 ± 0.14
1.024TrpTyr: 1.024 ± 0.252
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.479TyrAla: 1.479 ± 0.5
0.341TyrCys: 0.341 ± 0.319
2.048TyrAsp: 2.048 ± 0.632
3.868TyrGlu: 3.868 ± 0.814
2.275TyrPhe: 2.275 ± 0.565
3.071TyrGly: 3.071 ± 0.53
1.365TyrHis: 1.365 ± 0.416
2.616TyrIle: 2.616 ± 0.637
2.844TyrLys: 2.844 ± 0.826
3.413TyrLeu: 3.413 ± 0.803
0.569TyrMet: 0.569 ± 0.257
3.185TyrAsn: 3.185 ± 0.532
1.024TyrPro: 1.024 ± 0.403
1.138TyrGln: 1.138 ± 0.593
1.024TyrArg: 1.024 ± 0.335
1.479TyrSer: 1.479 ± 0.455
2.616TyrThr: 2.616 ± 0.595
3.071TyrVal: 3.071 ± 0.582
0.341TyrTrp: 0.341 ± 0.184
2.048TyrTyr: 2.048 ± 0.573
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (8792 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski