Amino acid dipepetide frequency for Lactococcus phage 5205F

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.319AlaAla: 0.319 ± 0.222
0.159AlaCys: 0.159 ± 0.162
3.984AlaAsp: 3.984 ± 0.524
5.578AlaGlu: 5.578 ± 1.071
2.55AlaPhe: 2.55 ± 0.503
3.028AlaGly: 3.028 ± 0.736
0.319AlaHis: 0.319 ± 0.218
4.303AlaIle: 4.303 ± 0.913
5.737AlaLys: 5.737 ± 1.163
7.649AlaLeu: 7.649 ± 1.568
1.753AlaMet: 1.753 ± 0.407
5.259AlaAsn: 5.259 ± 0.793
0.956AlaPro: 0.956 ± 0.391
3.347AlaGln: 3.347 ± 1.064
2.072AlaArg: 2.072 ± 0.561
3.506AlaSer: 3.506 ± 0.678
4.143AlaThr: 4.143 ± 1.141
4.143AlaVal: 4.143 ± 1.3
1.116AlaTrp: 1.116 ± 0.417
3.347AlaTyr: 3.347 ± 0.977
0.0AlaXaa: 0.0 ± 0.0
Cys
0.478CysAla: 0.478 ± 0.282
0.159CysCys: 0.159 ± 0.149
0.159CysAsp: 0.159 ± 0.157
0.637CysGlu: 0.637 ± 0.319
0.319CysPhe: 0.319 ± 0.222
0.797CysGly: 0.797 ± 0.355
0.637CysHis: 0.637 ± 0.507
0.159CysIle: 0.159 ± 0.176
1.116CysLys: 1.116 ± 0.504
1.116CysLeu: 1.116 ± 0.56
0.0CysMet: 0.0 ± 0.0
0.478CysAsn: 0.478 ± 0.338
0.319CysPro: 0.319 ± 0.2
0.159CysGln: 0.159 ± 0.161
0.159CysArg: 0.159 ± 0.133
0.637CysSer: 0.637 ± 0.298
0.797CysThr: 0.797 ± 0.436
0.478CysVal: 0.478 ± 0.341
0.319CysTrp: 0.319 ± 0.206
0.478CysTyr: 0.478 ± 0.257
0.0CysXaa: 0.0 ± 0.0
Asp
3.665AspAla: 3.665 ± 0.94
0.159AspCys: 0.159 ± 0.133
3.665AspAsp: 3.665 ± 0.868
4.303AspGlu: 4.303 ± 0.851
4.462AspPhe: 4.462 ± 0.86
4.781AspGly: 4.781 ± 1.125
0.159AspHis: 0.159 ± 0.167
4.462AspIle: 4.462 ± 0.868
5.259AspLys: 5.259 ± 0.887
5.737AspLeu: 5.737 ± 0.876
2.231AspMet: 2.231 ± 0.608
3.506AspAsn: 3.506 ± 0.745
1.434AspPro: 1.434 ± 0.336
0.956AspGln: 0.956 ± 0.421
1.594AspArg: 1.594 ± 0.451
3.506AspSer: 3.506 ± 0.636
3.347AspThr: 3.347 ± 0.688
2.231AspVal: 2.231 ± 0.515
1.434AspTrp: 1.434 ± 0.503
4.143AspTyr: 4.143 ± 1.006
0.0AspXaa: 0.0 ± 0.0
Glu
3.187GluAla: 3.187 ± 0.577
1.275GluCys: 1.275 ± 0.461
3.506GluAsp: 3.506 ± 0.573
5.737GluGlu: 5.737 ± 1.26
4.462GluPhe: 4.462 ± 0.961
3.187GluGly: 3.187 ± 0.529
1.434GluHis: 1.434 ± 0.542
4.622GluIle: 4.622 ± 0.712
4.94GluLys: 4.94 ± 0.987
6.534GluLeu: 6.534 ± 0.865
3.506GluMet: 3.506 ± 0.74
3.665GluAsn: 3.665 ± 0.616
0.956GluPro: 0.956 ± 0.424
4.143GluGln: 4.143 ± 0.671
3.347GluArg: 3.347 ± 0.957
2.231GluSer: 2.231 ± 0.548
5.737GluThr: 5.737 ± 0.932
5.1GluVal: 5.1 ± 0.958
1.275GluTrp: 1.275 ± 0.411
2.39GluTyr: 2.39 ± 0.758
0.0GluXaa: 0.0 ± 0.0
Phe
2.231PheAla: 2.231 ± 0.506
0.797PheCys: 0.797 ± 0.383
3.187PheAsp: 3.187 ± 0.656
2.55PheGlu: 2.55 ± 0.684
1.116PhePhe: 1.116 ± 0.401
3.347PheGly: 3.347 ± 0.48
0.319PheHis: 0.319 ± 0.18
3.665PheIle: 3.665 ± 0.976
4.781PheLys: 4.781 ± 0.917
2.39PheLeu: 2.39 ± 0.439
1.116PheMet: 1.116 ± 0.408
4.303PheAsn: 4.303 ± 0.805
1.116PhePro: 1.116 ± 0.442
0.797PheGln: 0.797 ± 0.306
2.072PheArg: 2.072 ± 0.537
2.55PheSer: 2.55 ± 0.569
3.347PheThr: 3.347 ± 0.68
2.709PheVal: 2.709 ± 0.58
0.797PheTrp: 0.797 ± 0.346
2.39PheTyr: 2.39 ± 0.649
0.0PheXaa: 0.0 ± 0.0
Gly
5.259GlyAla: 5.259 ± 1.052
0.159GlyCys: 0.159 ± 0.149
3.825GlyAsp: 3.825 ± 0.932
2.869GlyGlu: 2.869 ± 0.714
4.622GlyPhe: 4.622 ± 1.164
4.781GlyGly: 4.781 ± 0.75
0.478GlyHis: 0.478 ± 0.323
4.143GlyIle: 4.143 ± 0.873
4.781GlyLys: 4.781 ± 0.846
5.896GlyLeu: 5.896 ± 1.026
1.912GlyMet: 1.912 ± 0.572
3.347GlyAsn: 3.347 ± 0.658
0.637GlyPro: 0.637 ± 0.241
2.55GlyGln: 2.55 ± 0.789
3.028GlyArg: 3.028 ± 0.565
5.578GlySer: 5.578 ± 0.914
5.737GlyThr: 5.737 ± 0.955
5.259GlyVal: 5.259 ± 0.983
1.116GlyTrp: 1.116 ± 0.509
3.187GlyTyr: 3.187 ± 0.743
0.0GlyXaa: 0.0 ± 0.0
His
0.797HisAla: 0.797 ± 0.402
0.159HisCys: 0.159 ± 0.171
0.637HisAsp: 0.637 ± 0.285
0.478HisGlu: 0.478 ± 0.336
0.797HisPhe: 0.797 ± 0.419
0.478HisGly: 0.478 ± 0.265
0.159HisHis: 0.159 ± 0.171
0.956HisIle: 0.956 ± 0.37
1.275HisLys: 1.275 ± 0.443
1.434HisLeu: 1.434 ± 0.5
0.159HisMet: 0.159 ± 0.165
0.637HisAsn: 0.637 ± 0.3
0.319HisPro: 0.319 ± 0.207
0.319HisGln: 0.319 ± 0.234
0.478HisArg: 0.478 ± 0.258
0.797HisSer: 0.797 ± 0.335
0.159HisThr: 0.159 ± 0.148
0.637HisVal: 0.637 ± 0.315
0.319HisTrp: 0.319 ± 0.217
0.797HisTyr: 0.797 ± 0.352
0.0HisXaa: 0.0 ± 0.0
Ile
3.825IleAla: 3.825 ± 0.621
0.319IleCys: 0.319 ± 0.206
5.737IleAsp: 5.737 ± 0.997
6.375IleGlu: 6.375 ± 0.876
2.231IlePhe: 2.231 ± 0.704
4.622IleGly: 4.622 ± 0.745
0.956IleHis: 0.956 ± 0.457
3.984IleIle: 3.984 ± 0.863
4.781IleLys: 4.781 ± 0.682
3.984IleLeu: 3.984 ± 1.009
1.753IleMet: 1.753 ± 0.518
5.418IleAsn: 5.418 ± 0.934
1.912IlePro: 1.912 ± 0.381
2.39IleGln: 2.39 ± 0.581
1.912IleArg: 1.912 ± 0.58
2.709IleSer: 2.709 ± 0.522
3.984IleThr: 3.984 ± 0.77
2.231IleVal: 2.231 ± 0.535
0.478IleTrp: 0.478 ± 0.249
2.55IleTyr: 2.55 ± 0.548
0.0IleXaa: 0.0 ± 0.0
Lys
7.968LysAla: 7.968 ± 1.003
0.956LysCys: 0.956 ± 0.37
5.259LysAsp: 5.259 ± 1.313
8.287LysGlu: 8.287 ± 1.456
2.709LysPhe: 2.709 ± 0.531
7.331LysGly: 7.331 ± 0.916
0.797LysHis: 0.797 ± 0.317
4.622LysIle: 4.622 ± 1.368
6.853LysLys: 6.853 ± 1.284
6.215LysLeu: 6.215 ± 1.078
3.347LysMet: 3.347 ± 0.674
4.462LysAsn: 4.462 ± 0.654
2.869LysPro: 2.869 ± 0.732
3.984LysGln: 3.984 ± 0.709
3.506LysArg: 3.506 ± 0.788
3.825LysSer: 3.825 ± 0.622
4.143LysThr: 4.143 ± 0.902
7.49LysVal: 7.49 ± 1.14
0.956LysTrp: 0.956 ± 0.4
3.506LysTyr: 3.506 ± 0.793
0.0LysXaa: 0.0 ± 0.0
Leu
4.462LeuAla: 4.462 ± 1.098
1.116LeuCys: 1.116 ± 0.391
3.506LeuAsp: 3.506 ± 0.715
7.809LeuGlu: 7.809 ± 1.277
3.028LeuPhe: 3.028 ± 0.723
6.056LeuGly: 6.056 ± 1.485
1.434LeuHis: 1.434 ± 0.587
4.94LeuIle: 4.94 ± 1.015
8.127LeuLys: 8.127 ± 1.057
6.853LeuLeu: 6.853 ± 1.042
2.55LeuMet: 2.55 ± 1.152
6.056LeuAsn: 6.056 ± 1.09
2.869LeuPro: 2.869 ± 0.585
4.462LeuGln: 4.462 ± 0.871
1.753LeuArg: 1.753 ± 0.715
5.1LeuSer: 5.1 ± 1.032
5.578LeuThr: 5.578 ± 0.921
3.506LeuVal: 3.506 ± 0.762
0.319LeuTrp: 0.319 ± 0.208
2.709LeuTyr: 2.709 ± 0.693
0.0LeuXaa: 0.0 ± 0.0
Met
3.347MetAla: 3.347 ± 0.746
0.159MetCys: 0.159 ± 0.157
1.275MetAsp: 1.275 ± 0.428
2.072MetGlu: 2.072 ± 0.653
1.116MetPhe: 1.116 ± 0.466
1.116MetGly: 1.116 ± 0.345
0.0MetHis: 0.0 ± 0.0
2.709MetIle: 2.709 ± 0.587
3.347MetLys: 3.347 ± 0.859
2.39MetLeu: 2.39 ± 0.549
0.797MetMet: 0.797 ± 0.27
1.434MetAsn: 1.434 ± 0.564
0.637MetPro: 0.637 ± 0.341
1.434MetGln: 1.434 ± 0.486
0.797MetArg: 0.797 ± 0.45
1.594MetSer: 1.594 ± 0.43
1.912MetThr: 1.912 ± 0.716
1.753MetVal: 1.753 ± 0.526
0.159MetTrp: 0.159 ± 0.166
0.956MetTyr: 0.956 ± 0.651
0.0MetXaa: 0.0 ± 0.0
Asn
4.94AsnAla: 4.94 ± 1.331
0.956AsnCys: 0.956 ± 0.387
4.143AsnAsp: 4.143 ± 1.069
5.418AsnGlu: 5.418 ± 0.908
2.869AsnPhe: 2.869 ± 0.724
5.259AsnGly: 5.259 ± 0.734
0.319AsnHis: 0.319 ± 0.223
3.506AsnIle: 3.506 ± 0.826
6.853AsnLys: 6.853 ± 1.186
4.622AsnLeu: 4.622 ± 0.733
1.753AsnMet: 1.753 ± 0.425
4.622AsnAsn: 4.622 ± 0.931
2.709AsnPro: 2.709 ± 0.945
2.55AsnGln: 2.55 ± 0.656
1.753AsnArg: 1.753 ± 0.472
2.709AsnSer: 2.709 ± 0.545
3.984AsnThr: 3.984 ± 0.877
4.462AsnVal: 4.462 ± 0.678
1.116AsnTrp: 1.116 ± 0.429
2.231AsnTyr: 2.231 ± 0.485
0.0AsnXaa: 0.0 ± 0.0
Pro
0.956ProAla: 0.956 ± 0.359
0.0ProCys: 0.0 ± 0.0
2.231ProAsp: 2.231 ± 0.47
0.797ProGlu: 0.797 ± 0.286
1.434ProPhe: 1.434 ± 0.523
0.319ProGly: 0.319 ± 0.267
0.0ProHis: 0.0 ± 0.0
1.434ProIle: 1.434 ± 0.423
2.709ProLys: 2.709 ± 0.617
2.869ProLeu: 2.869 ± 0.536
0.319ProMet: 0.319 ± 0.262
2.39ProAsn: 2.39 ± 0.851
0.956ProPro: 0.956 ± 0.394
1.912ProGln: 1.912 ± 0.593
0.797ProArg: 0.797 ± 0.337
1.594ProSer: 1.594 ± 0.544
1.594ProThr: 1.594 ± 0.418
1.912ProVal: 1.912 ± 0.708
0.159ProTrp: 0.159 ± 0.209
1.594ProTyr: 1.594 ± 0.442
0.0ProXaa: 0.0 ± 0.0
Gln
3.984GlnAla: 3.984 ± 0.964
0.319GlnCys: 0.319 ± 0.218
1.434GlnAsp: 1.434 ± 0.576
2.072GlnGlu: 2.072 ± 0.511
2.072GlnPhe: 2.072 ± 0.578
2.55GlnGly: 2.55 ± 0.63
0.478GlnHis: 0.478 ± 0.254
3.506GlnIle: 3.506 ± 0.558
3.984GlnLys: 3.984 ± 1.012
2.709GlnLeu: 2.709 ± 0.803
1.753GlnMet: 1.753 ± 0.525
2.39GlnAsn: 2.39 ± 0.784
0.637GlnPro: 0.637 ± 0.276
2.072GlnGln: 2.072 ± 0.61
1.753GlnArg: 1.753 ± 0.567
3.187GlnSer: 3.187 ± 0.652
2.55GlnThr: 2.55 ± 0.547
1.275GlnVal: 1.275 ± 0.384
0.478GlnTrp: 0.478 ± 0.27
1.594GlnTyr: 1.594 ± 0.492
0.0GlnXaa: 0.0 ± 0.0
Arg
1.434ArgAla: 1.434 ± 0.332
0.478ArgCys: 0.478 ± 0.32
2.072ArgAsp: 2.072 ± 0.509
3.665ArgGlu: 3.665 ± 0.99
1.594ArgPhe: 1.594 ± 0.543
2.39ArgGly: 2.39 ± 0.512
0.319ArgHis: 0.319 ± 0.222
1.434ArgIle: 1.434 ± 0.383
3.347ArgLys: 3.347 ± 0.789
2.39ArgLeu: 2.39 ± 0.566
1.116ArgMet: 1.116 ± 0.516
2.072ArgAsn: 2.072 ± 0.732
1.116ArgPro: 1.116 ± 0.597
0.797ArgGln: 0.797 ± 0.314
0.956ArgArg: 0.956 ± 0.526
1.753ArgSer: 1.753 ± 0.515
2.231ArgThr: 2.231 ± 0.608
2.55ArgVal: 2.55 ± 0.719
0.956ArgTrp: 0.956 ± 0.462
1.275ArgTyr: 1.275 ± 0.469
0.0ArgXaa: 0.0 ± 0.0
Ser
3.347SerAla: 3.347 ± 0.696
0.319SerCys: 0.319 ± 0.322
3.825SerAsp: 3.825 ± 0.769
3.187SerGlu: 3.187 ± 0.74
2.55SerPhe: 2.55 ± 0.493
4.622SerGly: 4.622 ± 1.055
0.956SerHis: 0.956 ± 0.395
2.869SerIle: 2.869 ± 0.738
5.259SerLys: 5.259 ± 0.941
5.259SerLeu: 5.259 ± 0.671
1.116SerMet: 1.116 ± 0.334
3.187SerAsn: 3.187 ± 0.787
0.797SerPro: 0.797 ± 0.362
1.594SerGln: 1.594 ± 0.584
1.753SerArg: 1.753 ± 0.411
4.303SerSer: 4.303 ± 0.702
3.506SerThr: 3.506 ± 0.865
3.984SerVal: 3.984 ± 0.692
0.797SerTrp: 0.797 ± 0.563
2.55SerTyr: 2.55 ± 0.614
0.0SerXaa: 0.0 ± 0.0
Thr
4.303ThrAla: 4.303 ± 0.846
0.159ThrCys: 0.159 ± 0.142
5.259ThrAsp: 5.259 ± 1.291
3.028ThrGlu: 3.028 ± 0.829
2.869ThrPhe: 2.869 ± 0.719
6.375ThrGly: 6.375 ± 1.179
0.637ThrHis: 0.637 ± 0.328
4.143ThrIle: 4.143 ± 0.925
6.693ThrLys: 6.693 ± 0.776
7.012ThrLeu: 7.012 ± 1.197
0.956ThrMet: 0.956 ± 0.421
3.028ThrAsn: 3.028 ± 0.624
2.39ThrPro: 2.39 ± 0.604
2.709ThrGln: 2.709 ± 0.6
1.434ThrArg: 1.434 ± 0.423
3.187ThrSer: 3.187 ± 0.758
3.028ThrThr: 3.028 ± 0.7
3.984ThrVal: 3.984 ± 0.805
0.797ThrTrp: 0.797 ± 0.41
2.39ThrTyr: 2.39 ± 0.699
0.0ThrXaa: 0.0 ± 0.0
Val
2.869ValAla: 2.869 ± 0.595
0.478ValCys: 0.478 ± 0.289
4.781ValAsp: 4.781 ± 0.663
3.187ValGlu: 3.187 ± 0.794
1.912ValPhe: 1.912 ± 0.569
2.869ValGly: 2.869 ± 0.808
0.956ValHis: 0.956 ± 0.306
3.984ValIle: 3.984 ± 0.734
5.259ValLys: 5.259 ± 0.868
3.665ValLeu: 3.665 ± 0.71
1.594ValMet: 1.594 ± 0.526
4.622ValAsn: 4.622 ± 0.949
2.072ValPro: 2.072 ± 0.563
2.55ValGln: 2.55 ± 0.602
3.187ValArg: 3.187 ± 0.667
4.462ValSer: 4.462 ± 1.181
4.303ValThr: 4.303 ± 0.922
2.55ValVal: 2.55 ± 0.7
1.594ValTrp: 1.594 ± 0.409
2.709ValTyr: 2.709 ± 0.66
0.0ValXaa: 0.0 ± 0.0
Trp
1.912TrpAla: 1.912 ± 0.529
0.478TrpCys: 0.478 ± 0.233
0.637TrpAsp: 0.637 ± 0.263
0.637TrpGlu: 0.637 ± 0.297
0.956TrpPhe: 0.956 ± 0.346
1.594TrpGly: 1.594 ± 0.544
0.319TrpHis: 0.319 ± 0.201
0.637TrpIle: 0.637 ± 0.289
0.637TrpLys: 0.637 ± 0.265
1.594TrpLeu: 1.594 ± 0.394
0.159TrpMet: 0.159 ± 0.153
1.434TrpAsn: 1.434 ± 0.662
0.0TrpPro: 0.0 ± 0.0
0.797TrpGln: 0.797 ± 0.346
0.319TrpArg: 0.319 ± 0.211
0.478TrpSer: 0.478 ± 0.302
1.594TrpThr: 1.594 ± 0.581
0.797TrpVal: 0.797 ± 0.417
0.319TrpTrp: 0.319 ± 0.201
0.159TrpTyr: 0.159 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.984TyrAla: 3.984 ± 0.578
0.797TyrCys: 0.797 ± 0.29
1.912TyrAsp: 1.912 ± 0.517
2.709TyrGlu: 2.709 ± 0.543
1.912TyrPhe: 1.912 ± 0.556
3.665TyrGly: 3.665 ± 0.815
1.116TyrHis: 1.116 ± 0.426
2.072TyrIle: 2.072 ± 0.539
3.347TyrLys: 3.347 ± 0.768
1.912TyrLeu: 1.912 ± 0.683
0.956TyrMet: 0.956 ± 0.454
4.462TyrAsn: 4.462 ± 0.937
1.275TyrPro: 1.275 ± 0.487
1.275TyrGln: 1.275 ± 0.355
1.275TyrArg: 1.275 ± 0.525
2.072TyrSer: 2.072 ± 0.521
2.709TyrThr: 2.709 ± 0.531
2.55TyrVal: 2.55 ± 0.552
0.956TyrTrp: 0.956 ± 0.437
2.39TyrTyr: 2.39 ± 0.776
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 35 proteins (6276 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski