Amino acid dipepetide frequency for Bacillus phage PfEFR-5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.507AlaAla: 4.507 ± 0.717
0.656AlaCys: 0.656 ± 0.269
2.295AlaAsp: 2.295 ± 0.474
5.245AlaGlu: 5.245 ± 0.758
2.049AlaPhe: 2.049 ± 0.472
4.016AlaGly: 4.016 ± 0.99
0.82AlaHis: 0.82 ± 0.277
4.999AlaIle: 4.999 ± 0.588
5.573AlaLys: 5.573 ± 0.477
6.31AlaLeu: 6.31 ± 1.137
1.721AlaMet: 1.721 ± 0.392
3.524AlaAsn: 3.524 ± 0.709
1.393AlaPro: 1.393 ± 0.49
2.131AlaGln: 2.131 ± 0.462
2.377AlaArg: 2.377 ± 0.618
2.95AlaSer: 2.95 ± 0.656
3.934AlaThr: 3.934 ± 0.69
4.344AlaVal: 4.344 ± 0.895
1.393AlaTrp: 1.393 ± 0.397
1.885AlaTyr: 1.885 ± 0.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.492CysAla: 0.492 ± 0.208
0.0CysCys: 0.0 ± 0.0
0.41CysAsp: 0.41 ± 0.168
1.065CysGlu: 1.065 ± 0.399
0.492CysPhe: 0.492 ± 0.24
0.41CysGly: 0.41 ± 0.211
0.164CysHis: 0.164 ± 0.12
0.246CysIle: 0.246 ± 0.151
0.41CysLys: 0.41 ± 0.273
0.738CysLeu: 0.738 ± 0.272
0.492CysMet: 0.492 ± 0.208
0.41CysAsn: 0.41 ± 0.183
0.41CysPro: 0.41 ± 0.213
0.246CysGln: 0.246 ± 0.15
0.164CysArg: 0.164 ± 0.118
0.328CysSer: 0.328 ± 0.163
0.246CysThr: 0.246 ± 0.15
0.41CysVal: 0.41 ± 0.199
0.0CysTrp: 0.0 ± 0.0
0.164CysTyr: 0.164 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
3.688AspAla: 3.688 ± 0.673
0.328AspCys: 0.328 ± 0.192
3.524AspAsp: 3.524 ± 0.637
4.344AspGlu: 4.344 ± 0.76
1.885AspPhe: 1.885 ± 0.371
3.442AspGly: 3.442 ± 0.533
1.065AspHis: 1.065 ± 0.269
3.934AspIle: 3.934 ± 0.599
4.999AspLys: 4.999 ± 0.566
4.671AspLeu: 4.671 ± 0.615
1.475AspMet: 1.475 ± 0.372
2.623AspAsn: 2.623 ± 0.482
1.475AspPro: 1.475 ± 0.272
1.557AspGln: 1.557 ± 0.325
1.885AspArg: 1.885 ± 0.52
2.95AspSer: 2.95 ± 0.474
3.278AspThr: 3.278 ± 0.539
3.688AspVal: 3.688 ± 0.504
1.065AspTrp: 1.065 ± 0.325
1.803AspTyr: 1.803 ± 0.391
0.0AspXaa: 0.0 ± 0.0
Glu
4.917GluAla: 4.917 ± 0.671
0.983GluCys: 0.983 ± 0.377
2.95GluAsp: 2.95 ± 0.418
6.228GluGlu: 6.228 ± 0.757
3.114GluPhe: 3.114 ± 0.479
5.573GluGly: 5.573 ± 0.695
1.229GluHis: 1.229 ± 0.318
7.13GluIle: 7.13 ± 0.631
6.966GluLys: 6.966 ± 0.726
9.097GluLeu: 9.097 ± 0.862
3.278GluMet: 3.278 ± 0.622
3.852GluAsn: 3.852 ± 0.587
1.557GluPro: 1.557 ± 0.437
3.934GluGln: 3.934 ± 0.581
4.753GluArg: 4.753 ± 0.801
5.163GluSer: 5.163 ± 0.794
4.344GluThr: 4.344 ± 0.743
5.983GluVal: 5.983 ± 0.685
1.065GluTrp: 1.065 ± 0.341
2.704GluTyr: 2.704 ± 0.473
0.0GluXaa: 0.0 ± 0.0
Phe
1.803PheAla: 1.803 ± 0.361
0.492PheCys: 0.492 ± 0.188
2.541PheAsp: 2.541 ± 0.555
2.623PheGlu: 2.623 ± 0.334
1.311PhePhe: 1.311 ± 0.362
2.377PheGly: 2.377 ± 0.361
0.901PheHis: 0.901 ± 0.26
2.95PheIle: 2.95 ± 0.605
4.753PheLys: 4.753 ± 0.753
3.032PheLeu: 3.032 ± 0.558
1.229PheMet: 1.229 ± 0.29
2.704PheAsn: 2.704 ± 0.446
0.983PhePro: 0.983 ± 0.346
1.803PheGln: 1.803 ± 0.352
2.623PheArg: 2.623 ± 0.474
1.803PheSer: 1.803 ± 0.374
2.295PheThr: 2.295 ± 0.476
2.131PheVal: 2.131 ± 0.43
0.164PheTrp: 0.164 ± 0.127
1.803PheTyr: 1.803 ± 0.373
0.0PheXaa: 0.0 ± 0.0
Gly
3.524GlyAla: 3.524 ± 1.18
0.246GlyCys: 0.246 ± 0.214
3.77GlyAsp: 3.77 ± 0.438
4.18GlyGlu: 4.18 ± 0.749
3.278GlyPhe: 3.278 ± 0.729
3.032GlyGly: 3.032 ± 0.632
0.656GlyHis: 0.656 ± 0.163
5.409GlyIle: 5.409 ± 0.963
6.228GlyLys: 6.228 ± 0.68
4.835GlyLeu: 4.835 ± 0.733
1.885GlyMet: 1.885 ± 0.354
3.278GlyAsn: 3.278 ± 0.477
0.492GlyPro: 0.492 ± 0.189
2.786GlyGln: 2.786 ± 0.768
2.459GlyArg: 2.459 ± 0.465
4.344GlySer: 4.344 ± 0.689
2.541GlyThr: 2.541 ± 0.434
4.426GlyVal: 4.426 ± 0.587
1.639GlyTrp: 1.639 ± 0.332
3.196GlyTyr: 3.196 ± 0.592
0.0GlyXaa: 0.0 ± 0.0
His
0.738HisAla: 0.738 ± 0.226
0.164HisCys: 0.164 ± 0.126
0.82HisAsp: 0.82 ± 0.256
0.738HisGlu: 0.738 ± 0.248
1.147HisPhe: 1.147 ± 0.294
0.901HisGly: 0.901 ± 0.275
0.328HisHis: 0.328 ± 0.154
1.065HisIle: 1.065 ± 0.213
1.475HisLys: 1.475 ± 0.395
1.475HisLeu: 1.475 ± 0.434
0.328HisMet: 0.328 ± 0.151
0.574HisAsn: 0.574 ± 0.168
0.492HisPro: 0.492 ± 0.245
0.656HisGln: 0.656 ± 0.227
0.574HisArg: 0.574 ± 0.176
1.311HisSer: 1.311 ± 0.383
1.065HisThr: 1.065 ± 0.314
1.147HisVal: 1.147 ± 0.306
0.082HisTrp: 0.082 ± 0.08
0.901HisTyr: 0.901 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
4.753IleAla: 4.753 ± 0.476
0.492IleCys: 0.492 ± 0.168
4.917IleAsp: 4.917 ± 0.602
7.13IleGlu: 7.13 ± 0.723
2.623IlePhe: 2.623 ± 0.592
3.606IleGly: 3.606 ± 0.504
0.983IleHis: 0.983 ± 0.255
4.098IleIle: 4.098 ± 0.554
6.474IleLys: 6.474 ± 0.646
4.344IleLeu: 4.344 ± 0.48
1.393IleMet: 1.393 ± 0.358
3.852IleAsn: 3.852 ± 0.709
2.377IlePro: 2.377 ± 0.332
3.524IleGln: 3.524 ± 0.604
3.606IleArg: 3.606 ± 0.601
4.999IleSer: 4.999 ± 0.725
4.589IleThr: 4.589 ± 0.522
3.032IleVal: 3.032 ± 0.511
1.147IleTrp: 1.147 ± 0.41
2.049IleTyr: 2.049 ± 0.436
0.0IleXaa: 0.0 ± 0.0
Lys
6.474LysAla: 6.474 ± 0.716
0.656LysCys: 0.656 ± 0.225
4.917LysAsp: 4.917 ± 0.728
9.343LysGlu: 9.343 ± 0.952
3.36LysPhe: 3.36 ± 0.506
5.737LysGly: 5.737 ± 0.512
1.311LysHis: 1.311 ± 0.323
6.065LysIle: 6.065 ± 0.59
7.458LysLys: 7.458 ± 1.229
6.556LysLeu: 6.556 ± 0.742
3.852LysMet: 3.852 ± 0.67
5.573LysAsn: 5.573 ± 0.66
2.213LysPro: 2.213 ± 0.351
5.491LysGln: 5.491 ± 0.644
3.934LysArg: 3.934 ± 0.524
4.507LysSer: 4.507 ± 0.541
5.983LysThr: 5.983 ± 0.76
5.737LysVal: 5.737 ± 0.768
1.229LysTrp: 1.229 ± 0.336
3.278LysTyr: 3.278 ± 0.536
0.0LysXaa: 0.0 ± 0.0
Leu
4.262LeuAla: 4.262 ± 0.934
0.656LeuCys: 0.656 ± 0.243
4.426LeuAsp: 4.426 ± 0.608
6.474LeuGlu: 6.474 ± 0.681
2.459LeuPhe: 2.459 ± 0.496
5.491LeuGly: 5.491 ± 0.591
0.983LeuHis: 0.983 ± 0.286
4.753LeuIle: 4.753 ± 0.701
9.015LeuLys: 9.015 ± 0.923
5.573LeuLeu: 5.573 ± 0.895
1.721LeuMet: 1.721 ± 0.343
4.917LeuAsn: 4.917 ± 0.567
2.704LeuPro: 2.704 ± 0.591
4.262LeuGln: 4.262 ± 0.843
4.344LeuArg: 4.344 ± 0.956
4.835LeuSer: 4.835 ± 0.8
4.507LeuThr: 4.507 ± 0.532
3.77LeuVal: 3.77 ± 0.459
0.656LeuTrp: 0.656 ± 0.244
2.786LeuTyr: 2.786 ± 0.461
0.0LeuXaa: 0.0 ± 0.0
Met
2.295MetAla: 2.295 ± 0.565
0.246MetCys: 0.246 ± 0.144
1.803MetAsp: 1.803 ± 0.377
2.541MetGlu: 2.541 ± 0.404
0.983MetPhe: 0.983 ± 0.31
1.639MetGly: 1.639 ± 0.35
0.0MetHis: 0.0 ± 0.0
2.213MetIle: 2.213 ± 0.441
3.852MetLys: 3.852 ± 0.526
2.131MetLeu: 2.131 ± 0.462
0.983MetMet: 0.983 ± 0.236
2.049MetAsn: 2.049 ± 0.371
0.656MetPro: 0.656 ± 0.321
1.393MetGln: 1.393 ± 0.415
2.213MetArg: 2.213 ± 0.415
1.393MetSer: 1.393 ± 0.397
1.967MetThr: 1.967 ± 0.377
1.311MetVal: 1.311 ± 0.338
0.41MetTrp: 0.41 ± 0.177
0.492MetTyr: 0.492 ± 0.175
0.0MetXaa: 0.0 ± 0.0
Asn
3.606AsnAla: 3.606 ± 0.511
0.082AsnCys: 0.082 ± 0.088
2.377AsnAsp: 2.377 ± 0.41
5.163AsnGlu: 5.163 ± 0.653
1.393AsnPhe: 1.393 ± 0.268
4.589AsnGly: 4.589 ± 0.686
1.147AsnHis: 1.147 ± 0.398
2.95AsnIle: 2.95 ± 0.693
5.655AsnLys: 5.655 ± 0.628
3.278AsnLeu: 3.278 ± 0.511
2.131AsnMet: 2.131 ± 0.386
2.377AsnAsn: 2.377 ± 0.621
2.049AsnPro: 2.049 ± 0.376
2.868AsnGln: 2.868 ± 0.582
2.704AsnArg: 2.704 ± 0.472
2.131AsnSer: 2.131 ± 0.472
3.278AsnThr: 3.278 ± 0.542
3.524AsnVal: 3.524 ± 0.546
0.574AsnTrp: 0.574 ± 0.247
1.803AsnTyr: 1.803 ± 0.4
0.0AsnXaa: 0.0 ± 0.0
Pro
0.983ProAla: 0.983 ± 0.26
0.0ProCys: 0.0 ± 0.0
1.311ProAsp: 1.311 ± 0.349
2.377ProGlu: 2.377 ± 0.372
1.065ProPhe: 1.065 ± 0.243
1.229ProGly: 1.229 ± 0.293
0.983ProHis: 0.983 ± 0.35
2.131ProIle: 2.131 ± 0.526
2.868ProLys: 2.868 ± 0.703
1.721ProLeu: 1.721 ± 0.465
0.738ProMet: 0.738 ± 0.32
0.574ProAsn: 0.574 ± 0.201
1.557ProPro: 1.557 ± 0.463
1.147ProGln: 1.147 ± 0.355
0.983ProArg: 0.983 ± 0.299
2.704ProSer: 2.704 ± 0.662
1.639ProThr: 1.639 ± 0.436
2.295ProVal: 2.295 ± 0.496
0.164ProTrp: 0.164 ± 0.163
1.065ProTyr: 1.065 ± 0.227
0.0ProXaa: 0.0 ± 0.0
Gln
3.606GlnAla: 3.606 ± 0.874
0.164GlnCys: 0.164 ± 0.111
1.393GlnAsp: 1.393 ± 0.286
3.77GlnGlu: 3.77 ± 0.621
2.049GlnPhe: 2.049 ± 0.471
3.032GlnGly: 3.032 ± 0.926
0.901GlnHis: 0.901 ± 0.299
1.803GlnIle: 1.803 ± 0.308
4.589GlnLys: 4.589 ± 0.726
4.016GlnLeu: 4.016 ± 0.532
1.639GlnMet: 1.639 ± 0.285
2.704GlnAsn: 2.704 ± 0.721
1.639GlnPro: 1.639 ± 0.365
2.95GlnGln: 2.95 ± 0.54
2.786GlnArg: 2.786 ± 0.45
1.967GlnSer: 1.967 ± 0.389
2.377GlnThr: 2.377 ± 0.564
2.786GlnVal: 2.786 ± 0.445
0.983GlnTrp: 0.983 ± 0.274
1.967GlnTyr: 1.967 ± 0.38
0.0GlnXaa: 0.0 ± 0.0
Arg
2.213ArgAla: 2.213 ± 0.442
0.328ArgCys: 0.328 ± 0.183
3.688ArgAsp: 3.688 ± 0.511
4.016ArgGlu: 4.016 ± 0.662
2.95ArgPhe: 2.95 ± 0.421
2.868ArgGly: 2.868 ± 0.534
0.901ArgHis: 0.901 ± 0.309
4.262ArgIle: 4.262 ± 0.754
4.917ArgLys: 4.917 ± 0.946
3.77ArgLeu: 3.77 ± 0.556
1.229ArgMet: 1.229 ± 0.265
2.868ArgAsn: 2.868 ± 0.483
0.901ArgPro: 0.901 ± 0.296
1.885ArgGln: 1.885 ± 0.399
2.049ArgArg: 2.049 ± 0.481
1.967ArgSer: 1.967 ± 0.395
2.131ArgThr: 2.131 ± 0.42
3.278ArgVal: 3.278 ± 0.602
0.82ArgTrp: 0.82 ± 0.272
1.639ArgTyr: 1.639 ± 0.338
0.0ArgXaa: 0.0 ± 0.0
Ser
3.934SerAla: 3.934 ± 0.605
0.328SerCys: 0.328 ± 0.153
2.704SerAsp: 2.704 ± 0.457
4.098SerGlu: 4.098 ± 0.462
2.049SerPhe: 2.049 ± 0.34
4.098SerGly: 4.098 ± 0.7
0.492SerHis: 0.492 ± 0.184
3.934SerIle: 3.934 ± 0.534
4.753SerLys: 4.753 ± 0.635
4.18SerLeu: 4.18 ± 0.402
2.213SerMet: 2.213 ± 0.525
2.868SerAsn: 2.868 ± 0.439
1.475SerPro: 1.475 ± 0.417
1.967SerGln: 1.967 ± 0.515
2.95SerArg: 2.95 ± 0.597
3.196SerSer: 3.196 ± 0.64
3.442SerThr: 3.442 ± 0.511
4.18SerVal: 4.18 ± 0.744
0.574SerTrp: 0.574 ± 0.209
1.967SerTyr: 1.967 ± 0.565
0.0SerXaa: 0.0 ± 0.0
Thr
3.688ThrAla: 3.688 ± 0.594
0.41ThrCys: 0.41 ± 0.25
2.623ThrAsp: 2.623 ± 0.409
4.917ThrGlu: 4.917 ± 0.579
3.278ThrPhe: 3.278 ± 0.356
4.344ThrGly: 4.344 ± 0.652
1.311ThrHis: 1.311 ± 0.316
4.262ThrIle: 4.262 ± 0.558
4.999ThrLys: 4.999 ± 0.716
5.081ThrLeu: 5.081 ± 0.856
1.475ThrMet: 1.475 ± 0.357
2.541ThrAsn: 2.541 ± 0.593
2.049ThrPro: 2.049 ± 0.446
2.213ThrGln: 2.213 ± 0.586
1.475ThrArg: 1.475 ± 0.302
2.95ThrSer: 2.95 ± 0.408
4.344ThrThr: 4.344 ± 0.768
3.278ThrVal: 3.278 ± 0.67
0.574ThrTrp: 0.574 ± 0.205
1.803ThrTyr: 1.803 ± 0.396
0.0ThrXaa: 0.0 ± 0.0
Val
3.852ValAla: 3.852 ± 0.687
0.41ValCys: 0.41 ± 0.281
3.852ValAsp: 3.852 ± 0.499
6.228ValGlu: 6.228 ± 0.959
2.377ValPhe: 2.377 ± 0.403
3.278ValGly: 3.278 ± 0.393
0.738ValHis: 0.738 ± 0.292
3.688ValIle: 3.688 ± 0.51
5.573ValLys: 5.573 ± 0.614
4.671ValLeu: 4.671 ± 0.674
1.475ValMet: 1.475 ± 0.472
3.606ValAsn: 3.606 ± 0.436
2.131ValPro: 2.131 ± 0.478
3.524ValGln: 3.524 ± 0.464
3.688ValArg: 3.688 ± 0.513
3.606ValSer: 3.606 ± 0.383
3.524ValThr: 3.524 ± 0.564
3.688ValVal: 3.688 ± 0.532
0.164ValTrp: 0.164 ± 0.122
1.967ValTyr: 1.967 ± 0.367
0.0ValXaa: 0.0 ± 0.0
Trp
0.738TrpAla: 0.738 ± 0.302
0.164TrpCys: 0.164 ± 0.14
0.983TrpAsp: 0.983 ± 0.401
0.738TrpGlu: 0.738 ± 0.288
1.065TrpPhe: 1.065 ± 0.32
0.328TrpGly: 0.328 ± 0.149
0.164TrpHis: 0.164 ± 0.114
1.311TrpIle: 1.311 ± 0.385
0.983TrpLys: 0.983 ± 0.29
1.065TrpLeu: 1.065 ± 0.326
0.246TrpMet: 0.246 ± 0.157
0.738TrpAsn: 0.738 ± 0.225
0.082TrpPro: 0.082 ± 0.089
0.656TrpGln: 0.656 ± 0.197
1.065TrpArg: 1.065 ± 0.36
0.738TrpSer: 0.738 ± 0.272
0.41TrpThr: 0.41 ± 0.185
0.82TrpVal: 0.82 ± 0.291
0.082TrpTrp: 0.082 ± 0.08
0.82TrpTyr: 0.82 ± 0.311
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.049TyrAla: 2.049 ± 0.556
0.492TyrCys: 0.492 ± 0.208
2.377TyrAsp: 2.377 ± 0.646
3.606TyrGlu: 3.606 ± 0.61
1.393TyrPhe: 1.393 ± 0.354
2.213TyrGly: 2.213 ± 0.358
0.901TyrHis: 0.901 ± 0.362
2.786TyrIle: 2.786 ± 0.446
2.131TyrLys: 2.131 ± 0.359
2.049TyrLeu: 2.049 ± 0.448
0.983TyrMet: 0.983 ± 0.334
2.049TyrAsn: 2.049 ± 0.434
0.901TyrPro: 0.901 ± 0.343
2.131TyrGln: 2.131 ± 0.593
2.049TyrArg: 2.049 ± 0.424
1.557TyrSer: 1.557 ± 0.333
1.721TyrThr: 1.721 ± 0.323
2.295TyrVal: 2.295 ± 0.408
0.41TyrTrp: 0.41 ± 0.183
1.393TyrTyr: 1.393 ± 0.457
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12203 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski