Amino acid dipepetide frequency for Lactococcus phage ul36.t1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.343AlaAla: 4.343 ± 1.026
0.271AlaCys: 0.271 ± 0.184
4.705AlaAsp: 4.705 ± 0.507
3.167AlaGlu: 3.167 ± 0.551
2.352AlaPhe: 2.352 ± 0.434
5.247AlaGly: 5.247 ± 0.999
1.086AlaHis: 1.086 ± 0.292
4.795AlaIle: 4.795 ± 0.972
4.433AlaLys: 4.433 ± 0.559
6.605AlaLeu: 6.605 ± 1.321
1.719AlaMet: 1.719 ± 0.321
4.162AlaAsn: 4.162 ± 0.664
2.352AlaPro: 2.352 ± 0.533
2.986AlaGln: 2.986 ± 0.391
2.081AlaArg: 2.081 ± 0.505
3.528AlaSer: 3.528 ± 0.575
4.343AlaThr: 4.343 ± 0.811
4.162AlaVal: 4.162 ± 0.558
0.905AlaTrp: 0.905 ± 0.265
2.171AlaTyr: 2.171 ± 0.454
0.0AlaXaa: 0.0 ± 0.0
Cys
0.09CysAla: 0.09 ± 0.103
0.0CysCys: 0.0 ± 0.0
0.452CysAsp: 0.452 ± 0.218
0.814CysGlu: 0.814 ± 0.376
0.452CysPhe: 0.452 ± 0.19
0.814CysGly: 0.814 ± 0.303
0.452CysHis: 0.452 ± 0.315
0.271CysIle: 0.271 ± 0.132
0.362CysLys: 0.362 ± 0.182
0.181CysLeu: 0.181 ± 0.119
0.09CysMet: 0.09 ± 0.094
0.09CysAsn: 0.09 ± 0.104
0.09CysPro: 0.09 ± 0.09
0.0CysGln: 0.0 ± 0.0
0.09CysArg: 0.09 ± 0.093
0.543CysSer: 0.543 ± 0.211
0.181CysThr: 0.181 ± 0.18
0.181CysVal: 0.181 ± 0.115
0.09CysTrp: 0.09 ± 0.094
0.09CysTyr: 0.09 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
3.257AspAla: 3.257 ± 0.647
0.543AspCys: 0.543 ± 0.275
3.619AspAsp: 3.619 ± 0.531
5.247AspGlu: 5.247 ± 0.841
3.89AspPhe: 3.89 ± 0.628
7.147AspGly: 7.147 ± 1.99
0.362AspHis: 0.362 ± 0.178
3.8AspIle: 3.8 ± 0.482
5.79AspLys: 5.79 ± 0.794
4.705AspLeu: 4.705 ± 0.74
1.9AspMet: 1.9 ± 0.559
3.8AspAsn: 3.8 ± 0.428
1.448AspPro: 1.448 ± 0.371
0.633AspGln: 0.633 ± 0.266
2.081AspArg: 2.081 ± 0.425
4.614AspSer: 4.614 ± 0.748
3.076AspThr: 3.076 ± 0.553
3.528AspVal: 3.528 ± 0.664
0.905AspTrp: 0.905 ± 0.237
2.986AspTyr: 2.986 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
4.343GluAla: 4.343 ± 0.794
0.633GluCys: 0.633 ± 0.241
3.076GluAsp: 3.076 ± 0.576
6.062GluGlu: 6.062 ± 1.308
3.438GluPhe: 3.438 ± 0.579
2.171GluGly: 2.171 ± 0.486
0.905GluHis: 0.905 ± 0.287
5.519GluIle: 5.519 ± 0.819
6.333GluLys: 6.333 ± 1.232
7.238GluLeu: 7.238 ± 1.1
1.9GluMet: 1.9 ± 0.466
4.433GluAsn: 4.433 ± 0.836
1.719GluPro: 1.719 ± 0.371
3.528GluGln: 3.528 ± 0.774
2.714GluArg: 2.714 ± 0.647
3.709GluSer: 3.709 ± 0.707
4.252GluThr: 4.252 ± 0.632
4.343GluVal: 4.343 ± 0.644
1.357GluTrp: 1.357 ± 0.419
2.352GluTyr: 2.352 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
2.352PheAla: 2.352 ± 0.452
0.09PheCys: 0.09 ± 0.094
2.895PheAsp: 2.895 ± 0.548
4.162PheGlu: 4.162 ± 0.747
1.719PhePhe: 1.719 ± 0.364
2.081PheGly: 2.081 ± 0.457
0.271PheHis: 0.271 ± 0.134
3.348PheIle: 3.348 ± 0.604
3.528PheLys: 3.528 ± 0.469
2.533PheLeu: 2.533 ± 0.483
1.448PheMet: 1.448 ± 0.385
2.714PheAsn: 2.714 ± 0.428
0.995PhePro: 0.995 ± 0.276
1.176PheGln: 1.176 ± 0.344
1.357PheArg: 1.357 ± 0.497
3.167PheSer: 3.167 ± 0.478
3.528PheThr: 3.528 ± 0.626
2.895PheVal: 2.895 ± 0.587
0.724PheTrp: 0.724 ± 0.248
2.262PheTyr: 2.262 ± 0.511
0.0PheXaa: 0.0 ± 0.0
Gly
2.714GlyAla: 2.714 ± 0.615
0.271GlyCys: 0.271 ± 0.174
3.89GlyAsp: 3.89 ± 0.518
4.162GlyGlu: 4.162 ± 0.661
3.528GlyPhe: 3.528 ± 0.514
4.252GlyGly: 4.252 ± 0.653
0.814GlyHis: 0.814 ± 0.26
6.152GlyIle: 6.152 ± 0.804
5.79GlyLys: 5.79 ± 0.748
4.886GlyLeu: 4.886 ± 0.926
1.267GlyMet: 1.267 ± 0.368
3.709GlyAsn: 3.709 ± 0.632
1.176GlyPro: 1.176 ± 0.403
3.438GlyGln: 3.438 ± 0.475
2.262GlyArg: 2.262 ± 0.423
5.971GlySer: 5.971 ± 0.994
6.333GlyThr: 6.333 ± 1.205
3.89GlyVal: 3.89 ± 0.498
1.176GlyTrp: 1.176 ± 0.46
2.081GlyTyr: 2.081 ± 0.41
0.0GlyXaa: 0.0 ± 0.0
His
0.905HisAla: 0.905 ± 0.382
0.181HisCys: 0.181 ± 0.128
0.995HisAsp: 0.995 ± 0.321
1.267HisGlu: 1.267 ± 0.368
0.452HisPhe: 0.452 ± 0.184
1.086HisGly: 1.086 ± 0.32
0.362HisHis: 0.362 ± 0.19
0.905HisIle: 0.905 ± 0.271
0.633HisLys: 0.633 ± 0.235
0.905HisLeu: 0.905 ± 0.252
0.271HisMet: 0.271 ± 0.118
0.633HisAsn: 0.633 ± 0.236
0.362HisPro: 0.362 ± 0.186
0.452HisGln: 0.452 ± 0.202
0.362HisArg: 0.362 ± 0.183
0.995HisSer: 0.995 ± 0.327
0.724HisThr: 0.724 ± 0.217
0.633HisVal: 0.633 ± 0.29
0.09HisTrp: 0.09 ± 0.077
0.452HisTyr: 0.452 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
5.247IleAla: 5.247 ± 0.634
0.543IleCys: 0.543 ± 0.201
4.614IleAsp: 4.614 ± 0.676
4.433IleGlu: 4.433 ± 0.804
1.99IlePhe: 1.99 ± 0.432
5.338IleGly: 5.338 ± 0.866
1.086IleHis: 1.086 ± 0.357
5.79IleIle: 5.79 ± 1.661
6.514IleLys: 6.514 ± 0.804
5.066IleLeu: 5.066 ± 0.663
1.448IleMet: 1.448 ± 0.541
4.252IleAsn: 4.252 ± 0.567
3.167IlePro: 3.167 ± 0.747
2.986IleGln: 2.986 ± 0.446
2.624IleArg: 2.624 ± 0.551
6.333IleSer: 6.333 ± 0.864
4.343IleThr: 4.343 ± 0.823
3.709IleVal: 3.709 ± 0.712
0.271IleTrp: 0.271 ± 0.149
1.086IleTyr: 1.086 ± 0.271
0.0IleXaa: 0.0 ± 0.0
Lys
5.7LysAla: 5.7 ± 0.934
0.452LysCys: 0.452 ± 0.234
5.519LysAsp: 5.519 ± 1.022
6.514LysGlu: 6.514 ± 1.216
3.981LysPhe: 3.981 ± 0.707
5.247LysGly: 5.247 ± 0.501
0.814LysHis: 0.814 ± 0.241
5.519LysIle: 5.519 ± 0.79
7.509LysLys: 7.509 ± 1.39
7.147LysLeu: 7.147 ± 0.978
2.805LysMet: 2.805 ± 0.544
5.247LysAsn: 5.247 ± 0.852
2.443LysPro: 2.443 ± 0.365
3.89LysGln: 3.89 ± 0.586
2.714LysArg: 2.714 ± 0.454
4.705LysSer: 4.705 ± 0.824
4.614LysThr: 4.614 ± 0.674
5.428LysVal: 5.428 ± 0.633
1.719LysTrp: 1.719 ± 0.377
2.533LysTyr: 2.533 ± 0.497
0.0LysXaa: 0.0 ± 0.0
Leu
5.519LeuAla: 5.519 ± 0.815
0.271LeuCys: 0.271 ± 0.147
4.343LeuAsp: 4.343 ± 0.686
5.7LeuGlu: 5.7 ± 0.913
3.076LeuPhe: 3.076 ± 0.516
5.338LeuGly: 5.338 ± 0.924
0.543LeuHis: 0.543 ± 0.235
5.066LeuIle: 5.066 ± 1.005
6.695LeuLys: 6.695 ± 0.908
5.7LeuLeu: 5.7 ± 1.057
2.262LeuMet: 2.262 ± 0.578
3.438LeuAsn: 3.438 ± 0.509
4.162LeuPro: 4.162 ± 0.976
2.986LeuGln: 2.986 ± 0.617
2.624LeuArg: 2.624 ± 0.542
6.333LeuSer: 6.333 ± 0.695
6.062LeuThr: 6.062 ± 0.73
3.619LeuVal: 3.619 ± 0.684
1.267LeuTrp: 1.267 ± 0.386
2.171LeuTyr: 2.171 ± 0.514
0.0LeuXaa: 0.0 ± 0.0
Met
1.9MetAla: 1.9 ± 0.443
0.09MetCys: 0.09 ± 0.09
1.629MetAsp: 1.629 ± 0.449
1.719MetGlu: 1.719 ± 0.355
0.814MetPhe: 0.814 ± 0.281
1.538MetGly: 1.538 ± 0.412
0.181MetHis: 0.181 ± 0.119
1.538MetIle: 1.538 ± 0.496
1.99MetLys: 1.99 ± 0.517
1.176MetLeu: 1.176 ± 0.309
0.452MetMet: 0.452 ± 0.219
1.719MetAsn: 1.719 ± 0.366
0.633MetPro: 0.633 ± 0.198
1.176MetGln: 1.176 ± 0.356
1.267MetArg: 1.267 ± 0.423
2.171MetSer: 2.171 ± 0.498
2.533MetThr: 2.533 ± 0.403
1.086MetVal: 1.086 ± 0.258
0.271MetTrp: 0.271 ± 0.164
0.271MetTyr: 0.271 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
3.438AsnAla: 3.438 ± 0.815
0.181AsnCys: 0.181 ± 0.127
4.524AsnAsp: 4.524 ± 0.731
2.714AsnGlu: 2.714 ± 0.532
2.986AsnPhe: 2.986 ± 0.553
6.243AsnGly: 6.243 ± 1.031
0.905AsnHis: 0.905 ± 0.403
4.252AsnIle: 4.252 ± 0.52
3.709AsnLys: 3.709 ± 0.549
3.619AsnLeu: 3.619 ± 0.516
0.905AsnMet: 0.905 ± 0.272
4.071AsnAsn: 4.071 ± 0.739
2.443AsnPro: 2.443 ± 0.519
1.99AsnGln: 1.99 ± 0.45
2.443AsnArg: 2.443 ± 0.499
4.071AsnSer: 4.071 ± 0.711
2.714AsnThr: 2.714 ± 0.431
3.619AsnVal: 3.619 ± 0.562
0.543AsnTrp: 0.543 ± 0.215
2.262AsnTyr: 2.262 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
1.448ProAla: 1.448 ± 0.443
0.0ProCys: 0.0 ± 0.0
2.262ProAsp: 2.262 ± 0.486
1.719ProGlu: 1.719 ± 0.405
0.905ProPhe: 0.905 ± 0.312
0.905ProGly: 0.905 ± 0.383
0.633ProHis: 0.633 ± 0.212
1.809ProIle: 1.809 ± 0.31
2.171ProLys: 2.171 ± 0.419
2.262ProLeu: 2.262 ± 0.656
0.995ProMet: 0.995 ± 0.223
1.086ProAsn: 1.086 ± 0.32
1.086ProPro: 1.086 ± 0.306
2.986ProGln: 2.986 ± 0.617
1.086ProArg: 1.086 ± 0.437
3.348ProSer: 3.348 ± 0.503
3.438ProThr: 3.438 ± 0.678
1.809ProVal: 1.809 ± 0.447
0.452ProTrp: 0.452 ± 0.191
0.995ProTyr: 0.995 ± 0.296
0.0ProXaa: 0.0 ± 0.0
Gln
3.89GlnAla: 3.89 ± 0.858
0.09GlnCys: 0.09 ± 0.094
1.809GlnAsp: 1.809 ± 0.408
3.438GlnGlu: 3.438 ± 0.663
1.9GlnPhe: 1.9 ± 0.436
2.805GlnGly: 2.805 ± 0.509
0.543GlnHis: 0.543 ± 0.224
2.443GlnIle: 2.443 ± 0.545
3.528GlnLys: 3.528 ± 0.525
3.709GlnLeu: 3.709 ± 0.711
1.357GlnMet: 1.357 ± 0.37
2.352GlnAsn: 2.352 ± 0.515
1.176GlnPro: 1.176 ± 0.306
2.262GlnGln: 2.262 ± 0.45
1.809GlnArg: 1.809 ± 0.456
2.081GlnSer: 2.081 ± 0.352
2.443GlnThr: 2.443 ± 0.426
2.624GlnVal: 2.624 ± 0.461
0.452GlnTrp: 0.452 ± 0.254
1.267GlnTyr: 1.267 ± 0.352
0.0GlnXaa: 0.0 ± 0.0
Arg
2.443ArgAla: 2.443 ± 0.477
0.271ArgCys: 0.271 ± 0.158
2.262ArgAsp: 2.262 ± 0.552
2.533ArgGlu: 2.533 ± 0.64
0.995ArgPhe: 0.995 ± 0.306
1.719ArgGly: 1.719 ± 0.39
0.543ArgHis: 0.543 ± 0.259
2.895ArgIle: 2.895 ± 0.494
3.528ArgLys: 3.528 ± 0.553
3.076ArgLeu: 3.076 ± 0.587
0.905ArgMet: 0.905 ± 0.291
1.629ArgAsn: 1.629 ± 0.384
0.814ArgPro: 0.814 ± 0.298
1.719ArgGln: 1.719 ± 0.429
1.448ArgArg: 1.448 ± 0.394
1.176ArgSer: 1.176 ± 0.332
1.538ArgThr: 1.538 ± 0.293
2.533ArgVal: 2.533 ± 0.499
0.543ArgTrp: 0.543 ± 0.222
1.629ArgTyr: 1.629 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
4.614SerAla: 4.614 ± 0.662
0.452SerCys: 0.452 ± 0.181
5.609SerAsp: 5.609 ± 0.846
4.343SerGlu: 4.343 ± 0.586
3.89SerPhe: 3.89 ± 0.473
5.338SerGly: 5.338 ± 0.879
0.633SerHis: 0.633 ± 0.241
4.524SerIle: 4.524 ± 0.726
6.333SerLys: 6.333 ± 0.818
4.795SerLeu: 4.795 ± 0.668
1.176SerMet: 1.176 ± 0.272
5.066SerAsn: 5.066 ± 0.618
1.357SerPro: 1.357 ± 0.445
2.262SerGln: 2.262 ± 0.445
1.99SerArg: 1.99 ± 0.462
5.971SerSer: 5.971 ± 1.086
3.89SerThr: 3.89 ± 0.906
4.433SerVal: 4.433 ± 0.646
1.086SerTrp: 1.086 ± 0.332
3.8SerTyr: 3.8 ± 0.644
0.0SerXaa: 0.0 ± 0.0
Thr
5.79ThrAla: 5.79 ± 1.127
0.633ThrCys: 0.633 ± 0.32
3.619ThrAsp: 3.619 ± 0.781
4.795ThrGlu: 4.795 ± 0.678
1.99ThrPhe: 1.99 ± 0.507
4.976ThrGly: 4.976 ± 0.668
1.448ThrHis: 1.448 ± 0.358
4.343ThrIle: 4.343 ± 0.73
5.7ThrLys: 5.7 ± 0.931
5.519ThrLeu: 5.519 ± 0.768
0.814ThrMet: 0.814 ± 0.195
4.071ThrAsn: 4.071 ± 0.743
3.438ThrPro: 3.438 ± 0.611
2.533ThrGln: 2.533 ± 0.393
1.99ThrArg: 1.99 ± 0.35
4.614ThrSer: 4.614 ± 0.797
6.062ThrThr: 6.062 ± 1.483
4.976ThrVal: 4.976 ± 1.145
0.724ThrTrp: 0.724 ± 0.249
2.262ThrTyr: 2.262 ± 0.887
0.0ThrXaa: 0.0 ± 0.0
Val
3.981ValAla: 3.981 ± 0.486
0.271ValCys: 0.271 ± 0.163
4.343ValAsp: 4.343 ± 0.551
4.524ValGlu: 4.524 ± 0.741
2.171ValPhe: 2.171 ± 0.367
3.167ValGly: 3.167 ± 0.469
0.452ValHis: 0.452 ± 0.193
4.433ValIle: 4.433 ± 0.569
5.7ValLys: 5.7 ± 0.869
3.348ValLeu: 3.348 ± 0.685
1.176ValMet: 1.176 ± 0.264
2.895ValAsn: 2.895 ± 0.614
1.357ValPro: 1.357 ± 0.535
3.257ValGln: 3.257 ± 0.864
1.629ValArg: 1.629 ± 0.397
4.252ValSer: 4.252 ± 0.545
5.79ValThr: 5.79 ± 0.635
3.89ValVal: 3.89 ± 0.719
0.814ValTrp: 0.814 ± 0.33
2.081ValTyr: 2.081 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.905TrpAla: 0.905 ± 0.407
0.09TrpCys: 0.09 ± 0.093
0.905TrpAsp: 0.905 ± 0.268
0.633TrpGlu: 0.633 ± 0.241
0.633TrpPhe: 0.633 ± 0.245
0.452TrpGly: 0.452 ± 0.185
0.09TrpHis: 0.09 ± 0.09
0.995TrpIle: 0.995 ± 0.226
1.538TrpLys: 1.538 ± 0.348
1.357TrpLeu: 1.357 ± 0.344
0.452TrpMet: 0.452 ± 0.209
1.086TrpAsn: 1.086 ± 0.335
0.181TrpPro: 0.181 ± 0.106
0.452TrpGln: 0.452 ± 0.202
0.543TrpArg: 0.543 ± 0.195
1.086TrpSer: 1.086 ± 0.469
1.176TrpThr: 1.176 ± 0.773
0.724TrpVal: 0.724 ± 0.248
0.271TrpTrp: 0.271 ± 0.147
0.452TrpTyr: 0.452 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.533TyrAla: 2.533 ± 0.674
0.0TyrCys: 0.0 ± 0.0
2.352TyrAsp: 2.352 ± 0.439
2.081TyrGlu: 2.081 ± 0.438
2.081TyrPhe: 2.081 ± 0.434
1.9TyrGly: 1.9 ± 0.454
0.543TyrHis: 0.543 ± 0.224
2.443TyrIle: 2.443 ± 0.622
2.714TyrLys: 2.714 ± 0.501
3.348TyrLeu: 3.348 ± 0.589
0.724TyrMet: 0.724 ± 0.245
1.176TyrAsn: 1.176 ± 0.246
0.995TyrPro: 0.995 ± 0.349
1.267TyrGln: 1.267 ± 0.295
1.086TyrArg: 1.086 ± 0.294
2.895TyrSer: 2.895 ± 0.645
3.257TyrThr: 3.257 ± 0.91
1.448TyrVal: 1.448 ± 0.355
0.362TyrTrp: 0.362 ± 0.204
1.719TyrTyr: 1.719 ± 0.389
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (11054 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski