Amino acid dipepetide frequency for Lactococcus phage CHPC958

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.132AlaAla: 1.132 ± 0.488
0.206AlaCys: 0.206 ± 0.159
3.293AlaAsp: 3.293 ± 0.515
4.836AlaGlu: 4.836 ± 0.86
3.19AlaPhe: 3.19 ± 0.666
4.116AlaGly: 4.116 ± 0.584
0.514AlaHis: 0.514 ± 0.248
4.424AlaIle: 4.424 ± 0.88
5.042AlaLys: 5.042 ± 0.856
6.276AlaLeu: 6.276 ± 1.091
2.058AlaMet: 2.058 ± 0.514
4.321AlaAsn: 4.321 ± 0.647
0.926AlaPro: 0.926 ± 0.318
2.058AlaGln: 2.058 ± 0.456
1.749AlaArg: 1.749 ± 0.342
2.984AlaSer: 2.984 ± 0.716
2.984AlaThr: 2.984 ± 0.551
4.219AlaVal: 4.219 ± 1.241
2.058AlaTrp: 2.058 ± 0.742
2.264AlaTyr: 2.264 ± 0.405
0.0AlaXaa: 0.0 ± 0.0
Cys
0.206CysAla: 0.206 ± 0.151
0.103CysCys: 0.103 ± 0.121
0.309CysAsp: 0.309 ± 0.175
0.514CysGlu: 0.514 ± 0.245
0.206CysPhe: 0.206 ± 0.154
0.926CysGly: 0.926 ± 0.292
0.206CysHis: 0.206 ± 0.157
0.309CysIle: 0.309 ± 0.188
0.926CysLys: 0.926 ± 0.288
0.309CysLeu: 0.309 ± 0.181
0.103CysMet: 0.103 ± 0.104
0.514CysAsn: 0.514 ± 0.271
0.206CysPro: 0.206 ± 0.158
0.412CysGln: 0.412 ± 0.23
0.412CysArg: 0.412 ± 0.186
0.412CysSer: 0.412 ± 0.213
0.103CysThr: 0.103 ± 0.121
0.206CysVal: 0.206 ± 0.123
0.309CysTrp: 0.309 ± 0.173
0.206CysTyr: 0.206 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
1.235AspAla: 1.235 ± 0.384
0.103AspCys: 0.103 ± 0.104
3.498AspAsp: 3.498 ± 0.628
4.116AspGlu: 4.116 ± 0.616
4.116AspPhe: 4.116 ± 0.515
4.63AspGly: 4.63 ± 0.697
0.617AspHis: 0.617 ± 0.22
3.91AspIle: 3.91 ± 0.769
5.762AspLys: 5.762 ± 0.705
6.071AspLeu: 6.071 ± 0.808
0.617AspMet: 0.617 ± 0.175
4.836AspAsn: 4.836 ± 0.636
1.44AspPro: 1.44 ± 0.353
0.823AspGln: 0.823 ± 0.254
1.749AspArg: 1.749 ± 0.489
2.984AspSer: 2.984 ± 0.529
4.116AspThr: 4.116 ± 0.63
2.984AspVal: 2.984 ± 0.65
1.132AspTrp: 1.132 ± 0.276
2.366AspTyr: 2.366 ± 0.434
0.0AspXaa: 0.0 ± 0.0
Glu
3.601GluAla: 3.601 ± 0.577
0.103GluCys: 0.103 ± 0.108
3.498GluAsp: 3.498 ± 0.584
5.042GluGlu: 5.042 ± 0.741
3.704GluPhe: 3.704 ± 0.606
2.161GluGly: 2.161 ± 0.434
0.823GluHis: 0.823 ± 0.289
6.379GluIle: 6.379 ± 0.993
7.511GluLys: 7.511 ± 1.08
8.849GluLeu: 8.849 ± 1.225
2.572GluMet: 2.572 ± 0.556
4.939GluAsn: 4.939 ± 0.795
2.161GluPro: 2.161 ± 0.638
3.498GluGln: 3.498 ± 0.783
2.881GluArg: 2.881 ± 0.64
3.807GluSer: 3.807 ± 0.487
4.321GluThr: 4.321 ± 0.663
4.63GluVal: 4.63 ± 0.685
1.338GluTrp: 1.338 ± 0.334
2.984GluTyr: 2.984 ± 0.563
0.0GluXaa: 0.0 ± 0.0
Phe
3.498PheAla: 3.498 ± 0.609
0.206PheCys: 0.206 ± 0.138
3.19PheAsp: 3.19 ± 0.539
2.572PheGlu: 2.572 ± 0.593
1.543PhePhe: 1.543 ± 0.563
2.366PheGly: 2.366 ± 0.573
0.412PheHis: 0.412 ± 0.188
3.293PheIle: 3.293 ± 0.656
4.219PheLys: 4.219 ± 0.594
2.881PheLeu: 2.881 ± 0.52
1.029PheMet: 1.029 ± 0.261
2.675PheAsn: 2.675 ± 0.71
0.617PhePro: 0.617 ± 0.232
1.029PheGln: 1.029 ± 0.268
1.543PheArg: 1.543 ± 0.394
4.116PheSer: 4.116 ± 0.835
3.087PheThr: 3.087 ± 0.47
2.161PheVal: 2.161 ± 0.428
0.412PheTrp: 0.412 ± 0.207
1.646PheTyr: 1.646 ± 0.322
0.0PheXaa: 0.0 ± 0.0
Gly
3.91GlyAla: 3.91 ± 0.895
0.617GlyCys: 0.617 ± 0.249
2.675GlyAsp: 2.675 ± 0.69
4.116GlyGlu: 4.116 ± 0.642
2.161GlyPhe: 2.161 ± 0.477
3.807GlyGly: 3.807 ± 0.685
1.338GlyHis: 1.338 ± 0.369
4.219GlyIle: 4.219 ± 1.018
6.276GlyLys: 6.276 ± 0.698
5.865GlyLeu: 5.865 ± 1.052
1.029GlyMet: 1.029 ± 0.291
3.395GlyAsn: 3.395 ± 0.605
0.206GlyPro: 0.206 ± 0.136
2.058GlyGln: 2.058 ± 0.482
1.749GlyArg: 1.749 ± 0.408
5.247GlySer: 5.247 ± 0.962
3.807GlyThr: 3.807 ± 0.754
5.247GlyVal: 5.247 ± 1.205
1.235GlyTrp: 1.235 ± 0.344
3.19GlyTyr: 3.19 ± 0.463
0.0GlyXaa: 0.0 ± 0.0
His
0.72HisAla: 0.72 ± 0.27
0.72HisCys: 0.72 ± 0.366
0.309HisAsp: 0.309 ± 0.171
0.617HisGlu: 0.617 ± 0.279
0.617HisPhe: 0.617 ± 0.276
1.338HisGly: 1.338 ± 0.394
0.0HisHis: 0.0 ± 0.0
1.132HisIle: 1.132 ± 0.375
1.338HisLys: 1.338 ± 0.349
0.926HisLeu: 0.926 ± 0.297
0.103HisMet: 0.103 ± 0.104
1.44HisAsn: 1.44 ± 0.368
0.0HisPro: 0.0 ± 0.0
0.412HisGln: 0.412 ± 0.22
0.514HisArg: 0.514 ± 0.225
0.103HisSer: 0.103 ± 0.095
0.514HisThr: 0.514 ± 0.247
0.72HisVal: 0.72 ± 0.246
0.103HisTrp: 0.103 ± 0.117
0.617HisTyr: 0.617 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
4.939IleAla: 4.939 ± 0.694
0.309IleCys: 0.309 ± 0.174
4.321IleAsp: 4.321 ± 0.658
6.585IleGlu: 6.585 ± 0.942
2.881IlePhe: 2.881 ± 0.488
4.116IleGly: 4.116 ± 0.835
1.235IleHis: 1.235 ± 0.306
5.865IleIle: 5.865 ± 0.799
6.688IleLys: 6.688 ± 0.849
4.939IleLeu: 4.939 ± 0.907
1.646IleMet: 1.646 ± 0.504
5.35IleAsn: 5.35 ± 0.668
1.955IlePro: 1.955 ± 0.534
2.469IleGln: 2.469 ± 0.372
2.161IleArg: 2.161 ± 0.448
4.733IleSer: 4.733 ± 0.638
5.556IleThr: 5.556 ± 0.932
4.321IleVal: 4.321 ± 0.754
1.235IleTrp: 1.235 ± 0.439
2.675IleTyr: 2.675 ± 0.537
0.0IleXaa: 0.0 ± 0.0
Lys
7.202LysAla: 7.202 ± 0.923
0.617LysCys: 0.617 ± 0.275
5.042LysAsp: 5.042 ± 0.804
8.437LysGlu: 8.437 ± 1.064
2.058LysPhe: 2.058 ± 0.454
6.276LysGly: 6.276 ± 1.014
1.132LysHis: 1.132 ± 0.411
5.865LysIle: 5.865 ± 0.621
8.746LysLys: 8.746 ± 1.098
6.585LysLeu: 6.585 ± 0.707
3.293LysMet: 3.293 ± 0.463
5.042LysAsn: 5.042 ± 0.725
1.852LysPro: 1.852 ± 0.482
3.498LysGln: 3.498 ± 0.726
4.013LysArg: 4.013 ± 0.702
4.733LysSer: 4.733 ± 0.751
6.482LysThr: 6.482 ± 0.951
6.688LysVal: 6.688 ± 0.926
1.646LysTrp: 1.646 ± 0.317
3.704LysTyr: 3.704 ± 0.792
0.0LysXaa: 0.0 ± 0.0
Leu
4.219LeuAla: 4.219 ± 0.65
0.412LeuCys: 0.412 ± 0.227
5.042LeuAsp: 5.042 ± 0.591
6.276LeuGlu: 6.276 ± 0.609
3.601LeuPhe: 3.601 ± 0.579
4.939LeuGly: 4.939 ± 0.694
1.235LeuHis: 1.235 ± 0.394
7.202LeuIle: 7.202 ± 0.81
7.717LeuLys: 7.717 ± 0.752
6.791LeuLeu: 6.791 ± 1.289
1.646LeuMet: 1.646 ± 0.393
4.836LeuAsn: 4.836 ± 0.778
2.881LeuPro: 2.881 ± 0.571
2.881LeuGln: 2.881 ± 0.486
2.778LeuArg: 2.778 ± 0.458
5.042LeuSer: 5.042 ± 0.835
5.762LeuThr: 5.762 ± 0.645
5.762LeuVal: 5.762 ± 0.72
1.543LeuTrp: 1.543 ± 0.32
3.601LeuTyr: 3.601 ± 0.713
0.0LeuXaa: 0.0 ± 0.0
Met
1.955MetAla: 1.955 ± 0.483
0.0MetCys: 0.0 ± 0.0
1.235MetAsp: 1.235 ± 0.411
1.852MetGlu: 1.852 ± 0.507
0.617MetPhe: 0.617 ± 0.345
1.029MetGly: 1.029 ± 0.311
0.309MetHis: 0.309 ± 0.185
2.572MetIle: 2.572 ± 0.491
3.293MetLys: 3.293 ± 0.558
1.029MetLeu: 1.029 ± 0.287
0.309MetMet: 0.309 ± 0.194
2.161MetAsn: 2.161 ± 0.479
0.514MetPro: 0.514 ± 0.239
1.44MetGln: 1.44 ± 0.444
0.412MetArg: 0.412 ± 0.235
1.235MetSer: 1.235 ± 0.308
1.44MetThr: 1.44 ± 0.353
1.029MetVal: 1.029 ± 0.267
0.206MetTrp: 0.206 ± 0.17
1.132MetTyr: 1.132 ± 0.332
0.0MetXaa: 0.0 ± 0.0
Asn
4.836AsnAla: 4.836 ± 1.133
0.206AsnCys: 0.206 ± 0.154
4.013AsnAsp: 4.013 ± 0.666
4.63AsnGlu: 4.63 ± 0.578
1.235AsnPhe: 1.235 ± 0.509
5.762AsnGly: 5.762 ± 0.675
0.926AsnHis: 0.926 ± 0.292
4.424AsnIle: 4.424 ± 0.573
5.556AsnLys: 5.556 ± 0.977
6.997AsnLeu: 6.997 ± 0.878
1.235AsnMet: 1.235 ± 0.312
3.807AsnAsn: 3.807 ± 0.764
2.572AsnPro: 2.572 ± 0.528
2.161AsnGln: 2.161 ± 0.427
1.749AsnArg: 1.749 ± 0.392
4.733AsnSer: 4.733 ± 0.641
3.91AsnThr: 3.91 ± 0.65
3.395AsnVal: 3.395 ± 0.612
1.132AsnTrp: 1.132 ± 0.327
2.881AsnTyr: 2.881 ± 0.651
0.0AsnXaa: 0.0 ± 0.0
Pro
1.338ProAla: 1.338 ± 0.351
0.206ProCys: 0.206 ± 0.135
2.366ProAsp: 2.366 ± 0.625
1.646ProGlu: 1.646 ± 0.433
0.823ProPhe: 0.823 ± 0.259
0.206ProGly: 0.206 ± 0.141
0.0ProHis: 0.0 ± 0.0
1.44ProIle: 1.44 ± 0.371
2.675ProLys: 2.675 ± 0.647
1.955ProLeu: 1.955 ± 0.386
0.514ProMet: 0.514 ± 0.232
2.058ProAsn: 2.058 ± 0.582
0.617ProPro: 0.617 ± 0.261
0.72ProGln: 0.72 ± 0.248
0.412ProArg: 0.412 ± 0.177
1.543ProSer: 1.543 ± 0.685
2.469ProThr: 2.469 ± 0.429
1.543ProVal: 1.543 ± 0.36
0.412ProTrp: 0.412 ± 0.212
1.029ProTyr: 1.029 ± 0.244
0.0ProXaa: 0.0 ± 0.0
Gln
3.19GlnAla: 3.19 ± 0.671
0.206GlnCys: 0.206 ± 0.161
1.749GlnAsp: 1.749 ± 0.529
2.778GlnGlu: 2.778 ± 0.535
1.029GlnPhe: 1.029 ± 0.288
1.955GlnGly: 1.955 ± 0.473
0.309GlnHis: 0.309 ± 0.164
2.058GlnIle: 2.058 ± 0.38
2.469GlnLys: 2.469 ± 0.599
3.498GlnLeu: 3.498 ± 0.662
0.926GlnMet: 0.926 ± 0.251
2.469GlnAsn: 2.469 ± 0.511
1.029GlnPro: 1.029 ± 0.306
2.058GlnGln: 2.058 ± 0.464
1.338GlnArg: 1.338 ± 0.347
2.058GlnSer: 2.058 ± 0.402
2.572GlnThr: 2.572 ± 0.475
2.264GlnVal: 2.264 ± 0.46
0.72GlnTrp: 0.72 ± 0.223
1.44GlnTyr: 1.44 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
1.749ArgAla: 1.749 ± 0.425
0.412ArgCys: 0.412 ± 0.221
1.852ArgAsp: 1.852 ± 0.376
2.366ArgGlu: 2.366 ± 0.525
1.029ArgPhe: 1.029 ± 0.262
1.543ArgGly: 1.543 ± 0.379
0.309ArgHis: 0.309 ± 0.153
2.984ArgIle: 2.984 ± 0.553
3.91ArgLys: 3.91 ± 0.678
3.395ArgLeu: 3.395 ± 0.52
0.617ArgMet: 0.617 ± 0.284
2.058ArgAsn: 2.058 ± 0.508
0.617ArgPro: 0.617 ± 0.232
1.749ArgGln: 1.749 ± 0.315
1.852ArgArg: 1.852 ± 0.492
1.543ArgSer: 1.543 ± 0.398
1.338ArgThr: 1.338 ± 0.378
1.646ArgVal: 1.646 ± 0.401
0.412ArgTrp: 0.412 ± 0.18
2.058ArgTyr: 2.058 ± 0.49
0.0ArgXaa: 0.0 ± 0.0
Ser
4.527SerAla: 4.527 ± 1.035
0.617SerCys: 0.617 ± 0.308
3.293SerAsp: 3.293 ± 0.573
4.219SerGlu: 4.219 ± 0.561
3.293SerPhe: 3.293 ± 0.635
5.762SerGly: 5.762 ± 1.221
0.617SerHis: 0.617 ± 0.268
4.321SerIle: 4.321 ± 0.625
4.527SerLys: 4.527 ± 0.637
4.836SerLeu: 4.836 ± 0.641
1.955SerMet: 1.955 ± 0.364
3.498SerAsn: 3.498 ± 0.545
1.44SerPro: 1.44 ± 0.37
1.955SerGln: 1.955 ± 0.442
2.675SerArg: 2.675 ± 0.391
4.836SerSer: 4.836 ± 1.2
3.395SerThr: 3.395 ± 0.617
3.91SerVal: 3.91 ± 0.574
1.029SerTrp: 1.029 ± 0.371
2.264SerTyr: 2.264 ± 0.517
0.0SerXaa: 0.0 ± 0.0
Thr
5.042ThrAla: 5.042 ± 0.761
0.412ThrCys: 0.412 ± 0.241
3.601ThrAsp: 3.601 ± 0.662
4.836ThrGlu: 4.836 ± 0.606
3.704ThrPhe: 3.704 ± 0.627
5.042ThrGly: 5.042 ± 0.718
0.309ThrHis: 0.309 ± 0.216
4.424ThrIle: 4.424 ± 0.707
5.145ThrLys: 5.145 ± 0.593
4.836ThrLeu: 4.836 ± 0.526
0.823ThrMet: 0.823 ± 0.308
4.424ThrAsn: 4.424 ± 0.649
1.852ThrPro: 1.852 ± 0.362
2.469ThrGln: 2.469 ± 0.395
1.749ThrArg: 1.749 ± 0.429
3.91ThrSer: 3.91 ± 0.678
4.527ThrThr: 4.527 ± 0.712
4.733ThrVal: 4.733 ± 0.812
1.132ThrTrp: 1.132 ± 0.252
2.264ThrTyr: 2.264 ± 0.516
0.0ThrXaa: 0.0 ± 0.0
Val
3.293ValAla: 3.293 ± 0.442
0.514ValCys: 0.514 ± 0.211
4.219ValAsp: 4.219 ± 0.731
4.63ValGlu: 4.63 ± 0.564
3.087ValPhe: 3.087 ± 0.595
2.881ValGly: 2.881 ± 0.438
0.617ValHis: 0.617 ± 0.233
4.939ValIle: 4.939 ± 0.604
6.997ValLys: 6.997 ± 0.728
3.293ValLeu: 3.293 ± 0.616
1.955ValMet: 1.955 ± 0.374
2.881ValAsn: 2.881 ± 0.522
1.955ValPro: 1.955 ± 0.461
1.749ValGln: 1.749 ± 0.354
2.366ValArg: 2.366 ± 0.481
4.939ValSer: 4.939 ± 1.349
4.527ValThr: 4.527 ± 0.832
3.704ValVal: 3.704 ± 0.66
0.823ValTrp: 0.823 ± 0.392
3.293ValTyr: 3.293 ± 0.571
0.0ValXaa: 0.0 ± 0.0
Trp
0.617TrpAla: 0.617 ± 0.291
0.412TrpCys: 0.412 ± 0.214
0.823TrpAsp: 0.823 ± 0.392
1.029TrpGlu: 1.029 ± 0.36
1.132TrpPhe: 1.132 ± 0.488
0.926TrpGly: 0.926 ± 0.302
0.309TrpHis: 0.309 ± 0.171
0.617TrpIle: 0.617 ± 0.223
1.132TrpLys: 1.132 ± 0.357
1.44TrpLeu: 1.44 ± 0.43
0.514TrpMet: 0.514 ± 0.201
1.852TrpAsn: 1.852 ± 0.572
0.103TrpPro: 0.103 ± 0.094
0.926TrpGln: 0.926 ± 0.296
0.617TrpArg: 0.617 ± 0.29
1.646TrpSer: 1.646 ± 0.303
1.338TrpThr: 1.338 ± 0.365
0.514TrpVal: 0.514 ± 0.304
0.412TrpTrp: 0.412 ± 0.191
1.029TrpTyr: 1.029 ± 0.305
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.44TyrAla: 1.44 ± 0.471
0.514TyrCys: 0.514 ± 0.246
2.778TyrAsp: 2.778 ± 0.628
3.704TyrGlu: 3.704 ± 0.543
2.469TyrPhe: 2.469 ± 0.533
2.366TyrGly: 2.366 ± 0.472
1.029TyrHis: 1.029 ± 0.35
3.601TyrIle: 3.601 ± 0.657
2.984TyrLys: 2.984 ± 0.598
3.395TyrLeu: 3.395 ± 0.702
0.72TyrMet: 0.72 ± 0.231
3.601TyrAsn: 3.601 ± 0.521
1.029TyrPro: 1.029 ± 0.361
1.749TyrGln: 1.749 ± 0.545
0.823TyrArg: 0.823 ± 0.235
2.366TyrSer: 2.366 ± 0.489
2.984TyrThr: 2.984 ± 0.709
2.984TyrVal: 2.984 ± 0.508
0.103TyrTrp: 0.103 ± 0.117
2.264TyrTyr: 2.264 ± 0.478
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (9720 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski