Amino acid dipepetide frequency for Bacillus phage DK2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.403AlaAla: 0.403 ± 0.258
0.537AlaCys: 0.537 ± 0.28
3.489AlaAsp: 3.489 ± 0.714
4.16AlaGlu: 4.16 ± 0.706
3.221AlaPhe: 3.221 ± 0.626
2.818AlaGly: 2.818 ± 0.676
0.403AlaHis: 0.403 ± 0.265
3.489AlaIle: 3.489 ± 0.797
4.16AlaLys: 4.16 ± 0.788
2.281AlaLeu: 2.281 ± 0.449
1.476AlaMet: 1.476 ± 0.372
2.013AlaAsn: 2.013 ± 0.714
1.342AlaPro: 1.342 ± 0.637
1.61AlaGln: 1.61 ± 0.503
1.342AlaArg: 1.342 ± 0.424
2.013AlaSer: 2.013 ± 0.527
2.55AlaThr: 2.55 ± 0.676
2.415AlaVal: 2.415 ± 0.68
0.403AlaTrp: 0.403 ± 0.192
2.013AlaTyr: 2.013 ± 0.493
0.0AlaXaa: 0.0 ± 0.0
Cys
0.671CysAla: 0.671 ± 0.283
0.134CysCys: 0.134 ± 0.145
1.208CysAsp: 1.208 ± 0.492
1.61CysGlu: 1.61 ± 0.618
0.268CysPhe: 0.268 ± 0.188
1.074CysGly: 1.074 ± 0.449
0.0CysHis: 0.0 ± 0.0
0.268CysIle: 0.268 ± 0.165
1.476CysLys: 1.476 ± 0.364
0.537CysLeu: 0.537 ± 0.257
0.403CysMet: 0.403 ± 0.193
0.403CysAsn: 0.403 ± 0.21
0.403CysPro: 0.403 ± 0.218
0.134CysGln: 0.134 ± 0.132
0.134CysArg: 0.134 ± 0.129
0.268CysSer: 0.268 ± 0.19
0.403CysThr: 0.403 ± 0.216
1.074CysVal: 1.074 ± 0.407
0.268CysTrp: 0.268 ± 0.188
0.537CysTyr: 0.537 ± 0.234
0.0CysXaa: 0.0 ± 0.0
Asp
3.086AspAla: 3.086 ± 0.502
0.671AspCys: 0.671 ± 0.243
4.563AspAsp: 4.563 ± 0.849
5.099AspGlu: 5.099 ± 0.725
4.026AspPhe: 4.026 ± 0.51
4.697AspGly: 4.697 ± 1.052
0.805AspHis: 0.805 ± 0.264
4.16AspIle: 4.16 ± 0.746
6.844AspLys: 6.844 ± 1.075
4.697AspLeu: 4.697 ± 0.809
2.684AspMet: 2.684 ± 0.578
4.294AspAsn: 4.294 ± 0.719
2.818AspPro: 2.818 ± 0.67
2.684AspGln: 2.684 ± 0.543
2.415AspArg: 2.415 ± 0.54
3.623AspSer: 3.623 ± 0.809
2.952AspThr: 2.952 ± 0.871
5.904AspVal: 5.904 ± 0.826
0.537AspTrp: 0.537 ± 0.339
1.744AspTyr: 1.744 ± 0.397
0.0AspXaa: 0.0 ± 0.0
Glu
3.489GluAla: 3.489 ± 0.597
1.074GluCys: 1.074 ± 0.414
4.026GluAsp: 4.026 ± 0.532
5.099GluGlu: 5.099 ± 1.198
4.16GluPhe: 4.16 ± 0.889
4.428GluGly: 4.428 ± 0.707
0.939GluHis: 0.939 ± 0.292
6.575GluIle: 6.575 ± 0.874
6.978GluLys: 6.978 ± 1.326
8.052GluLeu: 8.052 ± 1.421
2.415GluMet: 2.415 ± 0.677
7.246GluAsn: 7.246 ± 1.181
1.744GluPro: 1.744 ± 0.478
2.281GluGln: 2.281 ± 0.545
4.294GluArg: 4.294 ± 1.097
3.489GluSer: 3.489 ± 0.51
3.086GluThr: 3.086 ± 0.846
5.099GluVal: 5.099 ± 0.925
1.208GluTrp: 1.208 ± 0.409
3.757GluTyr: 3.757 ± 0.518
0.0GluXaa: 0.0 ± 0.0
Phe
0.939PheAla: 0.939 ± 0.306
0.671PheCys: 0.671 ± 0.329
4.026PheAsp: 4.026 ± 0.648
2.952PheGlu: 2.952 ± 0.546
1.208PhePhe: 1.208 ± 0.361
1.879PheGly: 1.879 ± 0.514
1.744PheHis: 1.744 ± 0.573
3.355PheIle: 3.355 ± 0.663
4.965PheLys: 4.965 ± 0.997
2.818PheLeu: 2.818 ± 0.511
1.744PheMet: 1.744 ± 0.507
3.489PheAsn: 3.489 ± 0.637
1.208PhePro: 1.208 ± 0.392
1.342PheGln: 1.342 ± 0.352
0.805PheArg: 0.805 ± 0.379
2.281PheSer: 2.281 ± 0.521
3.086PheThr: 3.086 ± 0.579
3.221PheVal: 3.221 ± 0.618
0.268PheTrp: 0.268 ± 0.181
2.684PheTyr: 2.684 ± 0.488
0.0PheXaa: 0.0 ± 0.0
Gly
3.086GlyAla: 3.086 ± 0.881
0.537GlyCys: 0.537 ± 0.298
3.489GlyAsp: 3.489 ± 0.857
4.965GlyGlu: 4.965 ± 0.619
2.952GlyPhe: 2.952 ± 0.51
2.818GlyGly: 2.818 ± 0.575
0.671GlyHis: 0.671 ± 0.265
5.502GlyIle: 5.502 ± 0.77
5.904GlyLys: 5.904 ± 1.2
5.502GlyLeu: 5.502 ± 0.63
2.147GlyMet: 2.147 ± 0.624
4.16GlyAsn: 4.16 ± 0.871
0.134GlyPro: 0.134 ± 0.114
2.415GlyGln: 2.415 ± 0.596
2.415GlyArg: 2.415 ± 0.545
4.697GlySer: 4.697 ± 0.55
3.757GlyThr: 3.757 ± 0.779
3.489GlyVal: 3.489 ± 0.893
0.805GlyTrp: 0.805 ± 0.362
3.892GlyTyr: 3.892 ± 0.645
0.0GlyXaa: 0.0 ± 0.0
His
0.537HisAla: 0.537 ± 0.256
0.537HisCys: 0.537 ± 0.288
0.939HisAsp: 0.939 ± 0.3
0.939HisGlu: 0.939 ± 0.34
0.537HisPhe: 0.537 ± 0.214
0.671HisGly: 0.671 ± 0.286
0.403HisHis: 0.403 ± 0.199
2.147HisIle: 2.147 ± 0.453
0.939HisLys: 0.939 ± 0.326
2.147HisLeu: 2.147 ± 0.538
0.268HisMet: 0.268 ± 0.175
1.208HisAsn: 1.208 ± 0.36
0.134HisPro: 0.134 ± 0.122
0.537HisGln: 0.537 ± 0.221
0.939HisArg: 0.939 ± 0.324
1.074HisSer: 1.074 ± 0.394
1.208HisThr: 1.208 ± 0.471
0.939HisVal: 0.939 ± 0.276
0.134HisTrp: 0.134 ± 0.141
1.208HisTyr: 1.208 ± 0.351
0.0HisXaa: 0.0 ± 0.0
Ile
3.355IleAla: 3.355 ± 0.621
0.671IleCys: 0.671 ± 0.293
5.636IleAsp: 5.636 ± 0.718
5.77IleGlu: 5.77 ± 0.957
2.013IlePhe: 2.013 ± 0.436
4.697IleGly: 4.697 ± 0.684
1.476IleHis: 1.476 ± 0.407
4.428IleIle: 4.428 ± 1.062
7.381IleLys: 7.381 ± 1.186
3.355IleLeu: 3.355 ± 0.649
2.55IleMet: 2.55 ± 0.591
4.697IleAsn: 4.697 ± 0.822
1.879IlePro: 1.879 ± 0.505
2.684IleGln: 2.684 ± 0.485
3.489IleArg: 3.489 ± 0.67
3.623IleSer: 3.623 ± 0.654
4.16IleThr: 4.16 ± 0.698
4.16IleVal: 4.16 ± 0.836
0.805IleTrp: 0.805 ± 0.302
3.355IleTyr: 3.355 ± 0.537
0.0IleXaa: 0.0 ± 0.0
Lys
3.355LysAla: 3.355 ± 0.884
0.939LysCys: 0.939 ± 0.358
6.71LysAsp: 6.71 ± 1.081
9.528LysGlu: 9.528 ± 1.887
4.563LysPhe: 4.563 ± 0.808
4.563LysGly: 4.563 ± 0.62
1.61LysHis: 1.61 ± 0.464
4.563LysIle: 4.563 ± 0.736
9.125LysLys: 9.125 ± 1.325
6.844LysLeu: 6.844 ± 0.86
4.026LysMet: 4.026 ± 0.812
5.904LysAsn: 5.904 ± 1.004
2.013LysPro: 2.013 ± 0.5
3.086LysGln: 3.086 ± 0.802
4.026LysArg: 4.026 ± 0.476
3.623LysSer: 3.623 ± 0.642
5.904LysThr: 5.904 ± 0.883
7.246LysVal: 7.246 ± 0.754
1.476LysTrp: 1.476 ± 0.423
3.623LysTyr: 3.623 ± 0.909
0.0LysXaa: 0.0 ± 0.0
Leu
2.818LeuAla: 2.818 ± 0.677
0.671LeuCys: 0.671 ± 0.305
6.441LeuAsp: 6.441 ± 0.961
4.563LeuGlu: 4.563 ± 0.771
2.281LeuPhe: 2.281 ± 0.49
4.16LeuGly: 4.16 ± 0.622
2.147LeuHis: 2.147 ± 0.618
3.623LeuIle: 3.623 ± 0.589
7.112LeuLys: 7.112 ± 0.972
4.16LeuLeu: 4.16 ± 0.787
3.221LeuMet: 3.221 ± 0.592
5.636LeuAsn: 5.636 ± 0.837
2.013LeuPro: 2.013 ± 0.447
2.818LeuGln: 2.818 ± 0.67
4.697LeuArg: 4.697 ± 0.701
3.623LeuSer: 3.623 ± 0.805
4.697LeuThr: 4.697 ± 0.782
3.892LeuVal: 3.892 ± 0.585
0.805LeuTrp: 0.805 ± 0.356
2.415LeuTyr: 2.415 ± 0.575
0.0LeuXaa: 0.0 ± 0.0
Met
1.342MetAla: 1.342 ± 0.339
0.403MetCys: 0.403 ± 0.207
1.476MetAsp: 1.476 ± 0.411
2.818MetGlu: 2.818 ± 0.767
1.744MetPhe: 1.744 ± 0.515
2.684MetGly: 2.684 ± 0.574
0.268MetHis: 0.268 ± 0.179
2.55MetIle: 2.55 ± 0.68
3.489MetLys: 3.489 ± 0.849
2.818MetLeu: 2.818 ± 0.609
1.342MetMet: 1.342 ± 0.498
3.086MetAsn: 3.086 ± 0.773
0.671MetPro: 0.671 ± 0.292
0.939MetGln: 0.939 ± 0.309
1.476MetArg: 1.476 ± 0.411
1.61MetSer: 1.61 ± 0.365
1.879MetThr: 1.879 ± 0.639
1.342MetVal: 1.342 ± 0.478
0.671MetTrp: 0.671 ± 0.313
1.61MetTyr: 1.61 ± 0.434
0.0MetXaa: 0.0 ± 0.0
Asn
2.281AsnAla: 2.281 ± 0.707
0.537AsnCys: 0.537 ± 0.323
4.16AsnAsp: 4.16 ± 0.741
5.636AsnGlu: 5.636 ± 0.702
2.818AsnPhe: 2.818 ± 0.582
4.428AsnGly: 4.428 ± 0.769
1.074AsnHis: 1.074 ± 0.327
5.233AsnIle: 5.233 ± 0.927
6.307AsnLys: 6.307 ± 1.085
4.831AsnLeu: 4.831 ± 0.861
2.147AsnMet: 2.147 ± 0.525
4.428AsnAsn: 4.428 ± 0.93
2.281AsnPro: 2.281 ± 0.474
2.684AsnGln: 2.684 ± 0.645
3.355AsnArg: 3.355 ± 0.821
4.831AsnSer: 4.831 ± 0.666
2.952AsnThr: 2.952 ± 0.654
3.623AsnVal: 3.623 ± 0.806
1.208AsnTrp: 1.208 ± 0.256
3.489AsnTyr: 3.489 ± 0.765
0.0AsnXaa: 0.0 ± 0.0
Pro
0.671ProAla: 0.671 ± 0.254
0.134ProCys: 0.134 ± 0.129
1.208ProAsp: 1.208 ± 0.449
2.684ProGlu: 2.684 ± 0.567
1.476ProPhe: 1.476 ± 0.426
1.342ProGly: 1.342 ± 0.414
0.403ProHis: 0.403 ± 0.186
2.147ProIle: 2.147 ± 0.443
1.744ProLys: 1.744 ± 0.436
1.61ProLeu: 1.61 ± 0.421
0.805ProMet: 0.805 ± 0.308
1.61ProAsn: 1.61 ± 0.531
1.208ProPro: 1.208 ± 0.265
1.074ProGln: 1.074 ± 0.313
1.074ProArg: 1.074 ± 0.336
1.61ProSer: 1.61 ± 0.601
2.013ProThr: 2.013 ± 0.57
1.208ProVal: 1.208 ± 0.435
0.268ProTrp: 0.268 ± 0.167
1.744ProTyr: 1.744 ± 0.433
0.0ProXaa: 0.0 ± 0.0
Gln
1.744GlnAla: 1.744 ± 0.486
0.403GlnCys: 0.403 ± 0.255
2.013GlnAsp: 2.013 ± 0.616
2.013GlnGlu: 2.013 ± 0.687
1.074GlnPhe: 1.074 ± 0.4
2.55GlnGly: 2.55 ± 0.854
0.403GlnHis: 0.403 ± 0.242
2.952GlnIle: 2.952 ± 0.621
2.952GlnLys: 2.952 ± 0.608
2.818GlnLeu: 2.818 ± 0.614
0.939GlnMet: 0.939 ± 0.43
2.013GlnAsn: 2.013 ± 0.496
1.074GlnPro: 1.074 ± 0.353
1.476GlnGln: 1.476 ± 0.466
2.147GlnArg: 2.147 ± 0.619
1.342GlnSer: 1.342 ± 0.373
2.281GlnThr: 2.281 ± 0.458
2.013GlnVal: 2.013 ± 0.577
0.268GlnTrp: 0.268 ± 0.173
2.281GlnTyr: 2.281 ± 0.457
0.0GlnXaa: 0.0 ± 0.0
Arg
2.684ArgAla: 2.684 ± 0.445
0.671ArgCys: 0.671 ± 0.268
2.147ArgAsp: 2.147 ± 0.534
4.294ArgGlu: 4.294 ± 0.771
1.879ArgPhe: 1.879 ± 0.437
2.684ArgGly: 2.684 ± 0.862
0.403ArgHis: 0.403 ± 0.273
2.281ArgIle: 2.281 ± 0.59
5.368ArgLys: 5.368 ± 0.886
3.086ArgLeu: 3.086 ± 0.663
1.476ArgMet: 1.476 ± 0.497
2.818ArgAsn: 2.818 ± 0.625
1.074ArgPro: 1.074 ± 0.372
2.281ArgGln: 2.281 ± 0.589
1.61ArgArg: 1.61 ± 0.398
1.342ArgSer: 1.342 ± 0.365
2.281ArgThr: 2.281 ± 0.606
1.744ArgVal: 1.744 ± 0.527
0.537ArgTrp: 0.537 ± 0.188
2.55ArgTyr: 2.55 ± 0.565
0.0ArgXaa: 0.0 ± 0.0
Ser
2.684SerAla: 2.684 ± 0.69
0.805SerCys: 0.805 ± 0.241
2.415SerAsp: 2.415 ± 0.748
3.355SerGlu: 3.355 ± 0.605
2.147SerPhe: 2.147 ± 0.602
3.355SerGly: 3.355 ± 0.54
1.208SerHis: 1.208 ± 0.362
4.428SerIle: 4.428 ± 0.683
2.818SerLys: 2.818 ± 0.568
4.026SerLeu: 4.026 ± 0.615
1.61SerMet: 1.61 ± 0.276
3.892SerAsn: 3.892 ± 0.701
1.074SerPro: 1.074 ± 0.346
1.744SerGln: 1.744 ± 0.464
1.61SerArg: 1.61 ± 0.571
1.879SerSer: 1.879 ± 0.597
2.013SerThr: 2.013 ± 0.752
4.026SerVal: 4.026 ± 0.56
0.403SerTrp: 0.403 ± 0.188
3.355SerTyr: 3.355 ± 0.573
0.0SerXaa: 0.0 ± 0.0
Thr
3.355ThrAla: 3.355 ± 1.22
0.671ThrCys: 0.671 ± 0.272
4.428ThrAsp: 4.428 ± 0.855
3.892ThrGlu: 3.892 ± 0.766
2.415ThrPhe: 2.415 ± 0.539
4.428ThrGly: 4.428 ± 0.92
0.671ThrHis: 0.671 ± 0.288
4.16ThrIle: 4.16 ± 0.629
4.563ThrLys: 4.563 ± 0.569
4.294ThrLeu: 4.294 ± 0.871
1.61ThrMet: 1.61 ± 0.512
3.623ThrAsn: 3.623 ± 0.674
0.805ThrPro: 0.805 ± 0.306
1.61ThrGln: 1.61 ± 0.475
2.55ThrArg: 2.55 ± 0.634
2.281ThrSer: 2.281 ± 0.637
4.697ThrThr: 4.697 ± 1.189
5.099ThrVal: 5.099 ± 0.801
0.268ThrTrp: 0.268 ± 0.163
2.55ThrTyr: 2.55 ± 0.511
0.0ThrXaa: 0.0 ± 0.0
Val
2.55ValAla: 2.55 ± 0.572
0.537ValCys: 0.537 ± 0.26
5.636ValAsp: 5.636 ± 0.826
5.233ValGlu: 5.233 ± 0.654
2.55ValPhe: 2.55 ± 0.493
5.77ValGly: 5.77 ± 0.647
1.61ValHis: 1.61 ± 0.479
4.563ValIle: 4.563 ± 0.813
5.904ValLys: 5.904 ± 0.801
4.026ValLeu: 4.026 ± 0.646
1.208ValMet: 1.208 ± 0.427
3.623ValAsn: 3.623 ± 0.553
2.281ValPro: 2.281 ± 0.55
1.744ValGln: 1.744 ± 0.557
2.147ValArg: 2.147 ± 0.399
2.818ValSer: 2.818 ± 0.594
5.233ValThr: 5.233 ± 1.195
4.16ValVal: 4.16 ± 0.886
1.744ValTrp: 1.744 ± 0.437
1.342ValTyr: 1.342 ± 0.456
0.0ValXaa: 0.0 ± 0.0
Trp
0.537TrpAla: 0.537 ± 0.26
0.0TrpCys: 0.0 ± 0.0
0.671TrpAsp: 0.671 ± 0.281
0.939TrpGlu: 0.939 ± 0.413
1.476TrpPhe: 1.476 ± 0.371
1.074TrpGly: 1.074 ± 0.385
0.537TrpHis: 0.537 ± 0.202
0.671TrpIle: 0.671 ± 0.261
1.208TrpLys: 1.208 ± 0.445
0.939TrpLeu: 0.939 ± 0.319
0.939TrpMet: 0.939 ± 0.314
0.805TrpAsn: 0.805 ± 0.334
0.0TrpPro: 0.0 ± 0.0
0.403TrpGln: 0.403 ± 0.225
0.537TrpArg: 0.537 ± 0.266
0.403TrpSer: 0.403 ± 0.181
0.134TrpThr: 0.134 ± 0.132
0.939TrpVal: 0.939 ± 0.285
0.268TrpTrp: 0.268 ± 0.194
0.671TrpTyr: 0.671 ± 0.292
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.55TyrAla: 2.55 ± 0.495
0.671TyrCys: 0.671 ± 0.28
3.623TyrAsp: 3.623 ± 0.61
3.892TyrGlu: 3.892 ± 0.641
1.879TyrPhe: 1.879 ± 0.389
3.221TyrGly: 3.221 ± 0.568
0.671TyrHis: 0.671 ± 0.309
3.086TyrIle: 3.086 ± 0.652
3.623TyrLys: 3.623 ± 0.662
2.952TyrLeu: 2.952 ± 0.572
1.074TyrMet: 1.074 ± 0.35
3.355TyrAsn: 3.355 ± 0.545
1.879TyrPro: 1.879 ± 0.421
1.208TyrGln: 1.208 ± 0.412
2.147TyrArg: 2.147 ± 0.619
2.415TyrSer: 2.415 ± 0.532
2.684TyrThr: 2.684 ± 0.487
3.086TyrVal: 3.086 ± 0.738
0.805TyrTrp: 0.805 ± 0.339
2.684TyrTyr: 2.684 ± 0.521
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (7453 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski