Amino acid dipepetide frequency for Lactococcus phage 62502

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.791AlaAla: 4.791 ± 0.894
0.208AlaCys: 0.208 ± 0.145
4.062AlaAsp: 4.062 ± 0.69
3.333AlaGlu: 3.333 ± 0.653
3.021AlaPhe: 3.021 ± 0.519
3.229AlaGly: 3.229 ± 1.106
1.354AlaHis: 1.354 ± 0.374
5.729AlaIle: 5.729 ± 1.612
5.208AlaLys: 5.208 ± 0.831
5.104AlaLeu: 5.104 ± 0.651
2.083AlaMet: 2.083 ± 0.572
3.645AlaAsn: 3.645 ± 0.574
1.666AlaPro: 1.666 ± 0.429
2.812AlaGln: 2.812 ± 0.561
2.5AlaArg: 2.5 ± 0.477
4.999AlaSer: 4.999 ± 0.818
4.791AlaThr: 4.791 ± 1.52
4.062AlaVal: 4.062 ± 0.857
0.833AlaTrp: 0.833 ± 0.251
1.979AlaTyr: 1.979 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
0.208CysAla: 0.208 ± 0.142
0.312CysCys: 0.312 ± 0.181
0.312CysAsp: 0.312 ± 0.162
0.833CysGlu: 0.833 ± 0.269
0.312CysPhe: 0.312 ± 0.176
0.729CysGly: 0.729 ± 0.297
0.0CysHis: 0.0 ± 0.0
0.208CysIle: 0.208 ± 0.135
0.312CysLys: 0.312 ± 0.172
0.521CysLeu: 0.521 ± 0.233
0.0CysMet: 0.0 ± 0.0
0.208CysAsn: 0.208 ± 0.14
0.104CysPro: 0.104 ± 0.103
0.0CysGln: 0.0 ± 0.0
0.208CysArg: 0.208 ± 0.135
0.312CysSer: 0.312 ± 0.18
0.312CysThr: 0.312 ± 0.182
0.208CysVal: 0.208 ± 0.127
0.104CysTrp: 0.104 ± 0.098
0.417CysTyr: 0.417 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
4.062AspAla: 4.062 ± 0.755
0.521AspCys: 0.521 ± 0.243
5.208AspAsp: 5.208 ± 1.089
4.999AspGlu: 4.999 ± 0.815
3.541AspPhe: 3.541 ± 0.487
4.479AspGly: 4.479 ± 0.79
0.625AspHis: 0.625 ± 0.204
3.854AspIle: 3.854 ± 0.658
6.666AspLys: 6.666 ± 1.033
6.041AspLeu: 6.041 ± 0.812
2.083AspMet: 2.083 ± 0.509
3.645AspAsn: 3.645 ± 0.601
1.458AspPro: 1.458 ± 0.331
0.937AspGln: 0.937 ± 0.292
2.291AspArg: 2.291 ± 0.478
3.541AspSer: 3.541 ± 0.561
2.5AspThr: 2.5 ± 0.513
2.708AspVal: 2.708 ± 0.452
1.146AspTrp: 1.146 ± 0.289
3.021AspTyr: 3.021 ± 0.568
0.0AspXaa: 0.0 ± 0.0
Glu
4.791GluAla: 4.791 ± 0.737
0.417GluCys: 0.417 ± 0.195
3.645GluAsp: 3.645 ± 0.775
5.729GluGlu: 5.729 ± 1.117
3.541GluPhe: 3.541 ± 0.622
2.396GluGly: 2.396 ± 0.432
0.833GluHis: 0.833 ± 0.343
5.833GluIle: 5.833 ± 0.862
6.145GluLys: 6.145 ± 0.954
7.499GluLeu: 7.499 ± 1.208
1.458GluMet: 1.458 ± 0.363
5.208GluAsn: 5.208 ± 0.62
1.146GluPro: 1.146 ± 0.332
3.645GluGln: 3.645 ± 0.729
2.396GluArg: 2.396 ± 0.665
3.021GluSer: 3.021 ± 0.571
4.583GluThr: 4.583 ± 0.758
5.52GluVal: 5.52 ± 0.758
1.042GluTrp: 1.042 ± 0.297
2.187GluTyr: 2.187 ± 0.455
0.0GluXaa: 0.0 ± 0.0
Phe
2.5PheAla: 2.5 ± 0.513
0.104PheCys: 0.104 ± 0.102
2.812PheAsp: 2.812 ± 0.71
3.437PheGlu: 3.437 ± 0.63
2.916PhePhe: 2.916 ± 1.01
3.645PheGly: 3.645 ± 0.503
0.417PheHis: 0.417 ± 0.186
2.5PheIle: 2.5 ± 0.569
3.541PheLys: 3.541 ± 0.642
3.021PheLeu: 3.021 ± 0.527
1.354PheMet: 1.354 ± 0.371
4.062PheAsn: 4.062 ± 0.65
1.25PhePro: 1.25 ± 0.367
1.771PheGln: 1.771 ± 0.48
1.562PheArg: 1.562 ± 0.418
3.229PheSer: 3.229 ± 0.547
2.916PheThr: 2.916 ± 0.638
2.396PheVal: 2.396 ± 0.474
0.208PheTrp: 0.208 ± 0.136
2.396PheTyr: 2.396 ± 0.463
0.0PheXaa: 0.0 ± 0.0
Gly
3.333GlyAla: 3.333 ± 0.867
0.208GlyCys: 0.208 ± 0.133
3.229GlyAsp: 3.229 ± 0.736
4.791GlyGlu: 4.791 ± 0.791
4.062GlyPhe: 4.062 ± 0.879
5.937GlyGly: 5.937 ± 1.003
0.417GlyHis: 0.417 ± 0.172
6.041GlyIle: 6.041 ± 1.077
6.978GlyLys: 6.978 ± 0.849
4.791GlyLeu: 4.791 ± 0.627
1.666GlyMet: 1.666 ± 0.375
3.541GlyAsn: 3.541 ± 0.739
0.833GlyPro: 0.833 ± 0.296
2.916GlyGln: 2.916 ± 0.586
2.812GlyArg: 2.812 ± 0.541
3.437GlySer: 3.437 ± 0.811
5.208GlyThr: 5.208 ± 0.779
3.229GlyVal: 3.229 ± 0.573
1.042GlyTrp: 1.042 ± 0.365
2.396GlyTyr: 2.396 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
1.146HisAla: 1.146 ± 0.394
0.0HisCys: 0.0 ± 0.0
0.833HisAsp: 0.833 ± 0.277
1.458HisGlu: 1.458 ± 0.41
0.208HisPhe: 0.208 ± 0.155
0.833HisGly: 0.833 ± 0.281
0.312HisHis: 0.312 ± 0.24
0.833HisIle: 0.833 ± 0.244
1.458HisLys: 1.458 ± 0.346
0.937HisLeu: 0.937 ± 0.316
0.312HisMet: 0.312 ± 0.157
0.937HisAsn: 0.937 ± 0.304
0.521HisPro: 0.521 ± 0.227
0.312HisGln: 0.312 ± 0.216
0.312HisArg: 0.312 ± 0.185
0.833HisSer: 0.833 ± 0.322
0.417HisThr: 0.417 ± 0.178
0.625HisVal: 0.625 ± 0.236
0.0HisTrp: 0.0 ± 0.0
0.521HisTyr: 0.521 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
5.833IleAla: 5.833 ± 0.605
0.312IleCys: 0.312 ± 0.174
6.041IleAsp: 6.041 ± 0.81
5.208IleGlu: 5.208 ± 0.983
3.021IlePhe: 3.021 ± 0.65
5.416IleGly: 5.416 ± 1.782
1.354IleHis: 1.354 ± 0.525
4.791IleIle: 4.791 ± 0.778
6.041IleLys: 6.041 ± 0.81
3.333IleLeu: 3.333 ± 0.53
1.875IleMet: 1.875 ± 0.404
4.062IleAsn: 4.062 ± 0.729
2.604IlePro: 2.604 ± 0.557
2.916IleGln: 2.916 ± 0.461
1.875IleArg: 1.875 ± 0.446
4.895IleSer: 4.895 ± 0.731
5.416IleThr: 5.416 ± 0.878
3.854IleVal: 3.854 ± 0.625
0.729IleTrp: 0.729 ± 0.296
2.291IleTyr: 2.291 ± 0.393
0.0IleXaa: 0.0 ± 0.0
Lys
5.52LysAla: 5.52 ± 0.808
0.312LysCys: 0.312 ± 0.168
6.041LysAsp: 6.041 ± 0.67
7.708LysGlu: 7.708 ± 0.997
3.437LysPhe: 3.437 ± 0.649
5.416LysGly: 5.416 ± 0.58
1.979LysHis: 1.979 ± 0.572
4.791LysIle: 4.791 ± 0.714
9.686LysLys: 9.686 ± 1.028
7.603LysLeu: 7.603 ± 1.089
3.125LysMet: 3.125 ± 0.472
4.999LysAsn: 4.999 ± 0.762
2.187LysPro: 2.187 ± 0.449
3.125LysGln: 3.125 ± 0.564
3.854LysArg: 3.854 ± 0.648
4.583LysSer: 4.583 ± 0.63
5.312LysThr: 5.312 ± 0.621
5.416LysVal: 5.416 ± 0.855
1.354LysTrp: 1.354 ± 0.321
4.895LysTyr: 4.895 ± 0.757
0.0LysXaa: 0.0 ± 0.0
Leu
5.52LeuAla: 5.52 ± 0.687
0.312LeuCys: 0.312 ± 0.194
6.041LeuAsp: 6.041 ± 0.981
6.458LeuGlu: 6.458 ± 1.216
2.916LeuPhe: 2.916 ± 0.525
4.687LeuGly: 4.687 ± 0.594
0.625LeuHis: 0.625 ± 0.229
5.416LeuIle: 5.416 ± 0.622
7.708LeuLys: 7.708 ± 1.1
4.27LeuLeu: 4.27 ± 0.739
1.562LeuMet: 1.562 ± 0.471
5.833LeuAsn: 5.833 ± 0.726
2.5LeuPro: 2.5 ± 0.655
4.791LeuGln: 4.791 ± 0.607
2.916LeuArg: 2.916 ± 0.551
4.895LeuSer: 4.895 ± 0.651
4.479LeuThr: 4.479 ± 0.693
3.645LeuVal: 3.645 ± 0.541
0.833LeuTrp: 0.833 ± 0.285
1.979LeuTyr: 1.979 ± 0.452
0.0LeuXaa: 0.0 ± 0.0
Met
1.458MetAla: 1.458 ± 0.312
0.104MetCys: 0.104 ± 0.092
1.354MetAsp: 1.354 ± 0.377
1.666MetGlu: 1.666 ± 0.376
0.937MetPhe: 0.937 ± 0.355
1.25MetGly: 1.25 ± 0.454
0.104MetHis: 0.104 ± 0.102
1.562MetIle: 1.562 ± 0.298
2.187MetLys: 2.187 ± 0.572
1.666MetLeu: 1.666 ± 0.386
0.417MetMet: 0.417 ± 0.197
2.187MetAsn: 2.187 ± 0.557
1.042MetPro: 1.042 ± 0.284
1.042MetGln: 1.042 ± 0.305
1.458MetArg: 1.458 ± 0.376
1.25MetSer: 1.25 ± 0.355
3.021MetThr: 3.021 ± 0.537
1.354MetVal: 1.354 ± 0.408
0.208MetTrp: 0.208 ± 0.128
0.312MetTyr: 0.312 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
4.791AsnAla: 4.791 ± 1.284
0.521AsnCys: 0.521 ± 0.225
2.708AsnAsp: 2.708 ± 0.554
3.645AsnGlu: 3.645 ± 0.573
2.708AsnPhe: 2.708 ± 0.464
4.687AsnGly: 4.687 ± 0.685
0.833AsnHis: 0.833 ± 0.376
4.27AsnIle: 4.27 ± 0.697
4.895AsnLys: 4.895 ± 0.553
3.958AsnLeu: 3.958 ± 0.596
1.458AsnMet: 1.458 ± 0.323
5.416AsnAsn: 5.416 ± 0.783
2.291AsnPro: 2.291 ± 0.473
4.166AsnGln: 4.166 ± 0.715
2.083AsnArg: 2.083 ± 0.424
3.125AsnSer: 3.125 ± 0.41
2.5AsnThr: 2.5 ± 0.437
4.375AsnVal: 4.375 ± 0.505
1.042AsnTrp: 1.042 ± 0.321
3.125AsnTyr: 3.125 ± 0.759
0.0AsnXaa: 0.0 ± 0.0
Pro
1.25ProAla: 1.25 ± 0.379
0.104ProCys: 0.104 ± 0.1
2.5ProAsp: 2.5 ± 0.469
1.562ProGlu: 1.562 ± 0.411
1.458ProPhe: 1.458 ± 0.466
0.729ProGly: 0.729 ± 0.427
0.625ProHis: 0.625 ± 0.229
1.666ProIle: 1.666 ± 0.341
2.708ProLys: 2.708 ± 0.572
1.979ProLeu: 1.979 ± 0.405
0.521ProMet: 0.521 ± 0.186
1.458ProAsn: 1.458 ± 0.49
0.521ProPro: 0.521 ± 0.227
1.666ProGln: 1.666 ± 0.522
0.521ProArg: 0.521 ± 0.197
1.979ProSer: 1.979 ± 0.411
1.458ProThr: 1.458 ± 0.345
2.812ProVal: 2.812 ± 0.413
0.417ProTrp: 0.417 ± 0.244
0.729ProTyr: 0.729 ± 0.232
0.0ProXaa: 0.0 ± 0.0
Gln
3.958GlnAla: 3.958 ± 0.513
0.104GlnCys: 0.104 ± 0.099
1.146GlnAsp: 1.146 ± 0.328
2.708GlnGlu: 2.708 ± 0.618
2.5GlnPhe: 2.5 ± 0.426
2.708GlnGly: 2.708 ± 0.679
0.729GlnHis: 0.729 ± 0.251
2.5GlnIle: 2.5 ± 0.668
3.021GlnLys: 3.021 ± 0.493
4.375GlnLeu: 4.375 ± 0.693
1.042GlnMet: 1.042 ± 0.316
3.229GlnAsn: 3.229 ± 0.727
1.562GlnPro: 1.562 ± 0.353
4.375GlnGln: 4.375 ± 1.232
1.979GlnArg: 1.979 ± 0.549
2.708GlnSer: 2.708 ± 0.625
2.291GlnThr: 2.291 ± 0.465
3.229GlnVal: 3.229 ± 0.602
0.625GlnTrp: 0.625 ± 0.217
1.458GlnTyr: 1.458 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
2.291ArgAla: 2.291 ± 0.505
0.625ArgCys: 0.625 ± 0.259
2.187ArgAsp: 2.187 ± 0.557
2.291ArgGlu: 2.291 ± 0.512
1.042ArgPhe: 1.042 ± 0.325
2.812ArgGly: 2.812 ± 0.484
0.0ArgHis: 0.0 ± 0.0
3.333ArgIle: 3.333 ± 0.519
3.229ArgLys: 3.229 ± 0.858
3.437ArgLeu: 3.437 ± 0.569
1.042ArgMet: 1.042 ± 0.353
2.5ArgAsn: 2.5 ± 0.51
0.625ArgPro: 0.625 ± 0.245
1.25ArgGln: 1.25 ± 0.415
2.083ArgArg: 2.083 ± 0.492
1.771ArgSer: 1.771 ± 0.378
2.396ArgThr: 2.396 ± 0.374
1.875ArgVal: 1.875 ± 0.422
0.625ArgTrp: 0.625 ± 0.271
2.083ArgTyr: 2.083 ± 0.426
0.0ArgXaa: 0.0 ± 0.0
Ser
2.916SerAla: 2.916 ± 0.868
0.104SerCys: 0.104 ± 0.09
4.583SerAsp: 4.583 ± 0.757
4.479SerGlu: 4.479 ± 0.762
3.333SerPhe: 3.333 ± 0.708
5.937SerGly: 5.937 ± 0.878
0.104SerHis: 0.104 ± 0.095
4.583SerIle: 4.583 ± 0.769
6.041SerLys: 6.041 ± 0.715
4.479SerLeu: 4.479 ± 0.394
1.354SerMet: 1.354 ± 0.378
3.229SerAsn: 3.229 ± 0.54
0.937SerPro: 0.937 ± 0.305
2.916SerGln: 2.916 ± 0.511
2.396SerArg: 2.396 ± 0.402
3.125SerSer: 3.125 ± 0.73
3.125SerThr: 3.125 ± 0.561
2.916SerVal: 2.916 ± 0.47
0.312SerTrp: 0.312 ± 0.161
2.5SerTyr: 2.5 ± 0.42
0.0SerXaa: 0.0 ± 0.0
Thr
4.166ThrAla: 4.166 ± 1.043
0.417ThrCys: 0.417 ± 0.182
3.645ThrAsp: 3.645 ± 0.744
4.062ThrGlu: 4.062 ± 0.647
2.396ThrPhe: 2.396 ± 0.504
4.791ThrGly: 4.791 ± 0.665
0.937ThrHis: 0.937 ± 0.257
5.52ThrIle: 5.52 ± 0.657
4.375ThrLys: 4.375 ± 0.691
5.104ThrLeu: 5.104 ± 0.815
0.625ThrMet: 0.625 ± 0.272
3.854ThrAsn: 3.854 ± 0.685
2.604ThrPro: 2.604 ± 0.747
2.396ThrGln: 2.396 ± 0.554
1.875ThrArg: 1.875 ± 0.484
3.541ThrSer: 3.541 ± 0.498
3.645ThrThr: 3.645 ± 0.499
3.854ThrVal: 3.854 ± 0.638
0.521ThrTrp: 0.521 ± 0.205
2.812ThrTyr: 2.812 ± 0.863
0.0ThrXaa: 0.0 ± 0.0
Val
3.958ValAla: 3.958 ± 0.688
0.521ValCys: 0.521 ± 0.24
3.541ValAsp: 3.541 ± 0.713
4.166ValGlu: 4.166 ± 0.596
2.083ValPhe: 2.083 ± 0.429
4.166ValGly: 4.166 ± 0.519
0.833ValHis: 0.833 ± 0.272
4.999ValIle: 4.999 ± 0.693
5.937ValLys: 5.937 ± 0.772
4.166ValLeu: 4.166 ± 0.649
1.042ValMet: 1.042 ± 0.424
2.083ValAsn: 2.083 ± 0.4
1.875ValPro: 1.875 ± 0.395
1.875ValGln: 1.875 ± 0.366
2.604ValArg: 2.604 ± 0.543
3.958ValSer: 3.958 ± 0.64
4.375ValThr: 4.375 ± 0.778
4.27ValVal: 4.27 ± 0.782
0.417ValTrp: 0.417 ± 0.212
1.771ValTyr: 1.771 ± 0.345
0.0ValXaa: 0.0 ± 0.0
Trp
0.417TrpAla: 0.417 ± 0.204
0.208TrpCys: 0.208 ± 0.142
0.833TrpAsp: 0.833 ± 0.321
0.417TrpGlu: 0.417 ± 0.187
0.521TrpPhe: 0.521 ± 0.244
0.417TrpGly: 0.417 ± 0.179
0.104TrpHis: 0.104 ± 0.091
1.042TrpIle: 1.042 ± 0.33
1.354TrpLys: 1.354 ± 0.498
1.666TrpLeu: 1.666 ± 0.417
0.417TrpMet: 0.417 ± 0.209
0.729TrpAsn: 0.729 ± 0.309
0.0TrpPro: 0.0 ± 0.0
1.042TrpGln: 1.042 ± 0.413
0.625TrpArg: 0.625 ± 0.262
1.146TrpSer: 1.146 ± 0.328
0.417TrpThr: 0.417 ± 0.18
0.729TrpVal: 0.729 ± 0.233
0.104TrpTrp: 0.104 ± 0.092
0.312TrpTyr: 0.312 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.5TyrAla: 2.5 ± 0.512
0.208TyrCys: 0.208 ± 0.146
2.604TyrAsp: 2.604 ± 0.589
2.083TyrGlu: 2.083 ± 0.436
2.083TyrPhe: 2.083 ± 0.415
2.604TyrGly: 2.604 ± 0.603
0.625TyrHis: 0.625 ± 0.232
2.396TyrIle: 2.396 ± 0.485
3.854TyrLys: 3.854 ± 0.618
3.437TyrLeu: 3.437 ± 0.609
1.042TyrMet: 1.042 ± 0.292
1.875TyrAsn: 1.875 ± 0.526
0.937TyrPro: 0.937 ± 0.36
2.291TyrGln: 2.291 ± 0.566
1.146TyrArg: 1.146 ± 0.509
3.021TyrSer: 3.021 ± 0.466
1.979TyrThr: 1.979 ± 0.457
1.562TyrVal: 1.562 ± 0.399
0.937TyrTrp: 0.937 ± 0.382
1.875TyrTyr: 1.875 ± 0.433
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (9602 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski