Amino acid dipepetide frequency for Lactococcus phage bIL309

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.269AlaAla: 3.269 ± 0.707
0.363AlaCys: 0.363 ± 0.18
4.358AlaAsp: 4.358 ± 0.562
3.541AlaGlu: 3.541 ± 0.661
2.542AlaPhe: 2.542 ± 0.511
4.994AlaGly: 4.994 ± 0.595
0.908AlaHis: 0.908 ± 0.26
4.177AlaIle: 4.177 ± 0.558
5.357AlaLys: 5.357 ± 0.863
5.811AlaLeu: 5.811 ± 0.817
1.816AlaMet: 1.816 ± 0.385
4.358AlaAsn: 4.358 ± 0.692
1.725AlaPro: 1.725 ± 0.509
2.452AlaGln: 2.452 ± 0.47
1.816AlaArg: 1.816 ± 0.38
4.358AlaSer: 4.358 ± 0.549
3.814AlaThr: 3.814 ± 0.582
4.722AlaVal: 4.722 ± 0.663
0.817AlaTrp: 0.817 ± 0.245
2.179AlaTyr: 2.179 ± 0.447
0.0AlaXaa: 0.0 ± 0.0
Cys
0.182CysAla: 0.182 ± 0.13
0.0CysCys: 0.0 ± 0.0
0.363CysAsp: 0.363 ± 0.194
0.454CysGlu: 0.454 ± 0.323
0.0CysPhe: 0.0 ± 0.0
0.545CysGly: 0.545 ± 0.341
0.363CysHis: 0.363 ± 0.264
0.0CysIle: 0.0 ± 0.0
0.363CysLys: 0.363 ± 0.189
0.272CysLeu: 0.272 ± 0.157
0.182CysMet: 0.182 ± 0.133
0.091CysAsn: 0.091 ± 0.1
0.182CysPro: 0.182 ± 0.139
0.182CysGln: 0.182 ± 0.136
0.454CysArg: 0.454 ± 0.198
0.636CysSer: 0.636 ± 0.275
0.272CysThr: 0.272 ± 0.161
0.091CysVal: 0.091 ± 0.077
0.091CysTrp: 0.091 ± 0.086
0.091CysTyr: 0.091 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
3.632AspAla: 3.632 ± 0.446
0.636AspCys: 0.636 ± 0.348
3.814AspAsp: 3.814 ± 0.593
4.086AspGlu: 4.086 ± 1.056
2.542AspPhe: 2.542 ± 0.513
6.992AspGly: 6.992 ± 1.855
0.908AspHis: 0.908 ± 0.235
3.995AspIle: 3.995 ± 0.58
4.812AspLys: 4.812 ± 0.682
4.177AspLeu: 4.177 ± 0.59
1.362AspMet: 1.362 ± 0.349
3.995AspAsn: 3.995 ± 0.544
1.362AspPro: 1.362 ± 0.305
1.725AspGln: 1.725 ± 0.413
1.453AspArg: 1.453 ± 0.354
5.267AspSer: 5.267 ± 0.552
3.178AspThr: 3.178 ± 0.57
3.45AspVal: 3.45 ± 0.442
1.09AspTrp: 1.09 ± 0.316
2.633AspTyr: 2.633 ± 0.587
0.0AspXaa: 0.0 ± 0.0
Glu
3.995GluAla: 3.995 ± 0.593
0.272GluCys: 0.272 ± 0.188
2.906GluAsp: 2.906 ± 0.659
4.722GluGlu: 4.722 ± 0.856
2.452GluPhe: 2.452 ± 0.481
2.815GluGly: 2.815 ± 0.55
0.908GluHis: 0.908 ± 0.287
6.175GluIle: 6.175 ± 0.828
7.264GluLys: 7.264 ± 1.222
7.083GluLeu: 7.083 ± 1.272
2.361GluMet: 2.361 ± 0.535
3.269GluAsn: 3.269 ± 0.628
1.816GluPro: 1.816 ± 0.489
2.996GluGln: 2.996 ± 0.512
1.907GluArg: 1.907 ± 0.477
3.995GluSer: 3.995 ± 0.524
2.542GluThr: 2.542 ± 0.467
3.995GluVal: 3.995 ± 0.688
0.817GluTrp: 0.817 ± 0.339
2.724GluTyr: 2.724 ± 0.483
0.0GluXaa: 0.0 ± 0.0
Phe
2.27PheAla: 2.27 ± 0.423
0.272PheCys: 0.272 ± 0.184
2.27PheAsp: 2.27 ± 0.383
2.542PheGlu: 2.542 ± 0.565
0.908PhePhe: 0.908 ± 0.273
2.996PheGly: 2.996 ± 0.547
0.636PheHis: 0.636 ± 0.243
4.177PheIle: 4.177 ± 0.806
3.995PheLys: 3.995 ± 0.539
2.906PheLeu: 2.906 ± 0.461
0.817PheMet: 0.817 ± 0.239
3.632PheAsn: 3.632 ± 0.515
1.18PhePro: 1.18 ± 0.285
1.09PheGln: 1.09 ± 0.269
0.726PheArg: 0.726 ± 0.222
3.723PheSer: 3.723 ± 0.555
2.452PheThr: 2.452 ± 0.548
2.452PheVal: 2.452 ± 0.566
0.454PheTrp: 0.454 ± 0.191
1.816PheTyr: 1.816 ± 0.38
0.0PheXaa: 0.0 ± 0.0
Gly
3.995GlyAla: 3.995 ± 0.636
0.272GlyCys: 0.272 ± 0.171
3.45GlyAsp: 3.45 ± 0.511
3.087GlyGlu: 3.087 ± 0.503
4.358GlyPhe: 4.358 ± 0.684
3.632GlyGly: 3.632 ± 0.804
1.362GlyHis: 1.362 ± 0.351
5.176GlyIle: 5.176 ± 0.622
5.993GlyLys: 5.993 ± 0.923
5.721GlyLeu: 5.721 ± 0.904
1.998GlyMet: 1.998 ± 0.292
4.086GlyAsn: 4.086 ± 0.883
0.726GlyPro: 0.726 ± 0.208
3.269GlyGln: 3.269 ± 0.445
2.179GlyArg: 2.179 ± 0.363
4.994GlySer: 4.994 ± 0.789
7.264GlyThr: 7.264 ± 1.795
4.54GlyVal: 4.54 ± 0.982
1.453GlyTrp: 1.453 ± 0.391
2.542GlyTyr: 2.542 ± 0.467
0.0GlyXaa: 0.0 ± 0.0
His
0.636HisAla: 0.636 ± 0.243
0.182HisCys: 0.182 ± 0.14
0.726HisAsp: 0.726 ± 0.301
1.362HisGlu: 1.362 ± 0.485
0.999HisPhe: 0.999 ± 0.299
0.817HisGly: 0.817 ± 0.34
0.182HisHis: 0.182 ± 0.131
0.999HisIle: 0.999 ± 0.273
0.454HisLys: 0.454 ± 0.216
1.18HisLeu: 1.18 ± 0.371
0.272HisMet: 0.272 ± 0.173
0.908HisAsn: 0.908 ± 0.348
0.636HisPro: 0.636 ± 0.227
0.545HisGln: 0.545 ± 0.25
0.272HisArg: 0.272 ± 0.161
1.362HisSer: 1.362 ± 0.371
0.636HisThr: 0.636 ± 0.218
0.636HisVal: 0.636 ± 0.228
0.091HisTrp: 0.091 ± 0.077
0.817HisTyr: 0.817 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
5.448IleAla: 5.448 ± 0.576
0.0IleCys: 0.0 ± 0.0
4.54IleAsp: 4.54 ± 0.629
5.539IleGlu: 5.539 ± 0.765
2.452IlePhe: 2.452 ± 0.585
4.903IleGly: 4.903 ± 0.767
0.726IleHis: 0.726 ± 0.341
4.177IleIle: 4.177 ± 0.765
7.537IleLys: 7.537 ± 0.819
3.36IleLeu: 3.36 ± 0.588
1.271IleMet: 1.271 ± 0.425
5.902IleAsn: 5.902 ± 0.847
2.724IlePro: 2.724 ± 0.392
2.724IleGln: 2.724 ± 0.44
2.542IleArg: 2.542 ± 0.508
5.811IleSer: 5.811 ± 0.59
5.902IleThr: 5.902 ± 1.122
3.178IleVal: 3.178 ± 0.577
0.908IleTrp: 0.908 ± 0.319
2.906IleTyr: 2.906 ± 0.559
0.0IleXaa: 0.0 ± 0.0
Lys
6.356LysAla: 6.356 ± 0.906
0.454LysCys: 0.454 ± 0.218
5.902LysAsp: 5.902 ± 0.585
6.356LysGlu: 6.356 ± 1.008
3.36LysPhe: 3.36 ± 0.471
5.902LysGly: 5.902 ± 0.926
1.816LysHis: 1.816 ± 0.416
5.993LysIle: 5.993 ± 0.7
8.899LysLys: 8.899 ± 1.435
7.264LysLeu: 7.264 ± 1.405
2.088LysMet: 2.088 ± 0.568
6.356LysAsn: 6.356 ± 0.832
1.453LysPro: 1.453 ± 0.315
2.996LysGln: 2.996 ± 0.495
3.087LysArg: 3.087 ± 0.652
4.358LysSer: 4.358 ± 0.738
6.175LysThr: 6.175 ± 0.966
4.54LysVal: 4.54 ± 0.578
1.453LysTrp: 1.453 ± 0.487
3.178LysTyr: 3.178 ± 0.648
0.0LysXaa: 0.0 ± 0.0
Leu
5.721LeuAla: 5.721 ± 0.829
0.091LeuCys: 0.091 ± 0.092
5.448LeuAsp: 5.448 ± 0.93
4.449LeuGlu: 4.449 ± 0.707
2.815LeuPhe: 2.815 ± 0.497
5.357LeuGly: 5.357 ± 0.874
0.454LeuHis: 0.454 ± 0.268
6.356LeuIle: 6.356 ± 0.891
8.081LeuLys: 8.081 ± 1.373
5.721LeuLeu: 5.721 ± 0.857
1.998LeuMet: 1.998 ± 0.495
5.721LeuAsn: 5.721 ± 0.806
2.088LeuPro: 2.088 ± 0.486
2.452LeuGln: 2.452 ± 0.656
2.542LeuArg: 2.542 ± 0.408
6.719LeuSer: 6.719 ± 0.861
6.719LeuThr: 6.719 ± 0.809
3.904LeuVal: 3.904 ± 0.525
1.453LeuTrp: 1.453 ± 0.481
2.27LeuTyr: 2.27 ± 0.419
0.0LeuXaa: 0.0 ± 0.0
Met
2.27MetAla: 2.27 ± 0.524
0.182MetCys: 0.182 ± 0.145
0.817MetAsp: 0.817 ± 0.28
2.542MetGlu: 2.542 ± 0.541
0.545MetPhe: 0.545 ± 0.221
1.634MetGly: 1.634 ± 0.365
0.091MetHis: 0.091 ± 0.09
1.725MetIle: 1.725 ± 0.473
2.452MetLys: 2.452 ± 0.391
1.453MetLeu: 1.453 ± 0.391
0.636MetMet: 0.636 ± 0.307
1.453MetAsn: 1.453 ± 0.348
0.726MetPro: 0.726 ± 0.267
1.18MetGln: 1.18 ± 0.342
0.999MetArg: 0.999 ± 0.35
1.816MetSer: 1.816 ± 0.429
1.816MetThr: 1.816 ± 0.519
1.362MetVal: 1.362 ± 0.422
0.272MetTrp: 0.272 ± 0.131
0.636MetTyr: 0.636 ± 0.245
0.0MetXaa: 0.0 ± 0.0
Asn
4.449AsnAla: 4.449 ± 0.652
0.363AsnCys: 0.363 ± 0.203
3.632AsnAsp: 3.632 ± 0.907
2.996AsnGlu: 2.996 ± 0.448
3.178AsnPhe: 3.178 ± 0.542
6.538AsnGly: 6.538 ± 1.062
0.817AsnHis: 0.817 ± 0.367
4.812AsnIle: 4.812 ± 0.588
3.087AsnLys: 3.087 ± 0.641
4.994AsnLeu: 4.994 ± 0.522
1.453AsnMet: 1.453 ± 0.343
3.995AsnAsn: 3.995 ± 0.663
2.27AsnPro: 2.27 ± 0.377
2.452AsnGln: 2.452 ± 0.509
2.27AsnArg: 2.27 ± 0.435
5.63AsnSer: 5.63 ± 0.945
3.723AsnThr: 3.723 ± 0.593
4.086AsnVal: 4.086 ± 0.598
0.726AsnTrp: 0.726 ± 0.256
1.998AsnTyr: 1.998 ± 0.439
0.0AsnXaa: 0.0 ± 0.0
Pro
1.634ProAla: 1.634 ± 0.407
0.0ProCys: 0.0 ± 0.0
1.725ProAsp: 1.725 ± 0.535
2.088ProGlu: 2.088 ± 0.53
1.09ProPhe: 1.09 ± 0.299
1.271ProGly: 1.271 ± 0.45
0.636ProHis: 0.636 ± 0.234
1.544ProIle: 1.544 ± 0.417
1.998ProLys: 1.998 ± 0.376
2.724ProLeu: 2.724 ± 0.426
0.636ProMet: 0.636 ± 0.237
1.271ProAsn: 1.271 ± 0.483
0.636ProPro: 0.636 ± 0.296
1.09ProGln: 1.09 ± 0.338
0.545ProArg: 0.545 ± 0.219
1.907ProSer: 1.907 ± 0.546
2.633ProThr: 2.633 ± 0.997
1.907ProVal: 1.907 ± 0.562
0.272ProTrp: 0.272 ± 0.134
1.09ProTyr: 1.09 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
2.815GlnAla: 2.815 ± 0.355
0.091GlnCys: 0.091 ± 0.1
1.907GlnAsp: 1.907 ± 0.417
3.36GlnGlu: 3.36 ± 0.573
2.27GlnPhe: 2.27 ± 0.404
3.269GlnGly: 3.269 ± 0.842
0.636GlnHis: 0.636 ± 0.238
2.724GlnIle: 2.724 ± 0.475
3.269GlnLys: 3.269 ± 0.525
3.36GlnLeu: 3.36 ± 0.553
1.09GlnMet: 1.09 ± 0.373
1.816GlnAsn: 1.816 ± 0.394
0.636GlnPro: 0.636 ± 0.274
1.634GlnGln: 1.634 ± 0.466
1.453GlnArg: 1.453 ± 0.523
2.633GlnSer: 2.633 ± 0.459
2.452GlnThr: 2.452 ± 0.371
2.361GlnVal: 2.361 ± 0.419
0.545GlnTrp: 0.545 ± 0.192
1.453GlnTyr: 1.453 ± 0.422
0.0GlnXaa: 0.0 ± 0.0
Arg
1.725ArgAla: 1.725 ± 0.311
0.363ArgCys: 0.363 ± 0.205
2.906ArgAsp: 2.906 ± 0.692
1.362ArgGlu: 1.362 ± 0.473
1.998ArgPhe: 1.998 ± 0.405
1.09ArgGly: 1.09 ± 0.393
0.272ArgHis: 0.272 ± 0.14
2.27ArgIle: 2.27 ± 0.566
3.178ArgLys: 3.178 ± 0.536
3.45ArgLeu: 3.45 ± 0.619
0.363ArgMet: 0.363 ± 0.202
1.725ArgAsn: 1.725 ± 0.395
0.726ArgPro: 0.726 ± 0.261
1.271ArgGln: 1.271 ± 0.333
1.544ArgArg: 1.544 ± 0.511
2.179ArgSer: 2.179 ± 0.415
2.542ArgThr: 2.542 ± 0.492
1.634ArgVal: 1.634 ± 0.322
0.182ArgTrp: 0.182 ± 0.125
1.18ArgTyr: 1.18 ± 0.267
0.0ArgXaa: 0.0 ± 0.0
Ser
4.449SerAla: 4.449 ± 0.597
0.091SerCys: 0.091 ± 0.117
5.357SerAsp: 5.357 ± 0.772
5.448SerGlu: 5.448 ± 0.511
3.178SerPhe: 3.178 ± 0.43
5.539SerGly: 5.539 ± 0.932
0.817SerHis: 0.817 ± 0.21
4.812SerIle: 4.812 ± 0.732
6.356SerLys: 6.356 ± 0.945
5.63SerLeu: 5.63 ± 0.83
2.088SerMet: 2.088 ± 0.503
3.904SerAsn: 3.904 ± 0.65
2.179SerPro: 2.179 ± 0.507
3.541SerGln: 3.541 ± 0.549
2.27SerArg: 2.27 ± 0.518
4.903SerSer: 4.903 ± 0.81
4.812SerThr: 4.812 ± 0.792
5.902SerVal: 5.902 ± 0.704
0.908SerTrp: 0.908 ± 0.279
2.633SerTyr: 2.633 ± 0.427
0.0SerXaa: 0.0 ± 0.0
Thr
4.722ThrAla: 4.722 ± 0.671
0.545ThrCys: 0.545 ± 0.204
4.177ThrAsp: 4.177 ± 0.786
4.268ThrGlu: 4.268 ± 0.591
1.816ThrPhe: 1.816 ± 0.392
5.085ThrGly: 5.085 ± 0.946
1.18ThrHis: 1.18 ± 0.287
5.993ThrIle: 5.993 ± 1.138
5.176ThrLys: 5.176 ± 0.705
5.539ThrLeu: 5.539 ± 0.614
1.544ThrMet: 1.544 ± 0.411
4.722ThrAsn: 4.722 ± 1.156
1.998ThrPro: 1.998 ± 0.644
2.633ThrGln: 2.633 ± 0.479
2.361ThrArg: 2.361 ± 0.508
4.268ThrSer: 4.268 ± 0.813
5.267ThrThr: 5.267 ± 1.934
4.358ThrVal: 4.358 ± 0.938
1.453ThrTrp: 1.453 ± 0.431
2.361ThrTyr: 2.361 ± 0.605
0.0ThrXaa: 0.0 ± 0.0
Val
2.996ValAla: 2.996 ± 0.623
0.091ValCys: 0.091 ± 0.1
3.45ValAsp: 3.45 ± 0.473
3.995ValGlu: 3.995 ± 0.688
2.633ValPhe: 2.633 ± 0.556
4.086ValGly: 4.086 ± 0.602
0.726ValHis: 0.726 ± 0.192
4.268ValIle: 4.268 ± 0.718
5.357ValLys: 5.357 ± 0.733
4.994ValLeu: 4.994 ± 0.661
1.362ValMet: 1.362 ± 0.368
3.904ValAsn: 3.904 ± 0.598
2.27ValPro: 2.27 ± 0.54
2.633ValGln: 2.633 ± 0.487
1.362ValArg: 1.362 ± 0.349
5.721ValSer: 5.721 ± 0.749
3.723ValThr: 3.723 ± 0.621
3.36ValVal: 3.36 ± 0.603
0.817ValTrp: 0.817 ± 0.278
1.18ValTyr: 1.18 ± 0.318
0.0ValXaa: 0.0 ± 0.0
Trp
0.726TrpAla: 0.726 ± 0.273
0.0TrpCys: 0.0 ± 0.0
1.09TrpAsp: 1.09 ± 0.321
0.545TrpGlu: 0.545 ± 0.231
0.545TrpPhe: 0.545 ± 0.181
0.726TrpGly: 0.726 ± 0.291
0.091TrpHis: 0.091 ± 0.077
0.908TrpIle: 0.908 ± 0.213
0.999TrpLys: 0.999 ± 0.296
1.544TrpLeu: 1.544 ± 0.407
0.363TrpMet: 0.363 ± 0.149
0.908TrpAsn: 0.908 ± 0.269
0.182TrpPro: 0.182 ± 0.118
0.636TrpGln: 0.636 ± 0.17
0.636TrpArg: 0.636 ± 0.24
1.816TrpSer: 1.816 ± 0.515
1.271TrpThr: 1.271 ± 0.644
0.636TrpVal: 0.636 ± 0.295
0.454TrpTrp: 0.454 ± 0.214
0.636TrpTyr: 0.636 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.088TyrAla: 2.088 ± 0.367
0.545TyrCys: 0.545 ± 0.215
2.27TyrAsp: 2.27 ± 0.547
2.361TyrGlu: 2.361 ± 0.433
1.634TyrPhe: 1.634 ± 0.347
1.907TyrGly: 1.907 ± 0.318
0.182TyrHis: 0.182 ± 0.117
2.27TyrIle: 2.27 ± 0.429
3.36TyrLys: 3.36 ± 0.618
3.178TyrLeu: 3.178 ± 0.442
0.817TyrMet: 0.817 ± 0.223
1.453TyrAsn: 1.453 ± 0.319
1.18TyrPro: 1.18 ± 0.325
2.27TyrGln: 2.27 ± 0.42
1.544TyrArg: 1.544 ± 0.47
2.724TyrSer: 2.724 ± 0.593
2.27TyrThr: 2.27 ± 0.666
1.816TyrVal: 1.816 ± 0.365
0.454TyrTrp: 0.454 ± 0.162
0.908TyrTyr: 0.908 ± 0.258
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (11014 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski