Amino acid dipepetide frequency for Bacillus phage vB_Bpu_PumA1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.174AlaAla: 0.174 ± 0.157
0.696AlaCys: 0.696 ± 0.386
2.784AlaAsp: 2.784 ± 0.681
2.436AlaGlu: 2.436 ± 0.641
1.914AlaPhe: 1.914 ± 0.61
4.002AlaGly: 4.002 ± 0.99
1.566AlaHis: 1.566 ± 0.541
4.176AlaIle: 4.176 ± 0.787
3.654AlaLys: 3.654 ± 0.776
4.524AlaLeu: 4.524 ± 1.173
1.914AlaMet: 1.914 ± 0.51
4.524AlaAsn: 4.524 ± 1.303
1.74AlaPro: 1.74 ± 0.501
1.914AlaGln: 1.914 ± 0.661
1.914AlaArg: 1.914 ± 0.567
3.654AlaSer: 3.654 ± 1.244
5.22AlaThr: 5.22 ± 1.646
4.524AlaVal: 4.524 ± 0.676
0.174AlaTrp: 0.174 ± 0.161
2.436AlaTyr: 2.436 ± 0.636
0.0AlaXaa: 0.0 ± 0.0
Cys
0.87CysAla: 0.87 ± 0.439
0.174CysCys: 0.174 ± 0.23
0.696CysAsp: 0.696 ± 0.314
0.522CysGlu: 0.522 ± 0.396
0.348CysPhe: 0.348 ± 0.226
1.044CysGly: 1.044 ± 0.82
0.0CysHis: 0.0 ± 0.0
0.348CysIle: 0.348 ± 0.237
0.348CysLys: 0.348 ± 0.212
0.348CysLeu: 0.348 ± 0.285
0.174CysMet: 0.174 ± 0.152
0.696CysAsn: 0.696 ± 0.276
0.174CysPro: 0.174 ± 0.166
0.348CysGln: 0.348 ± 0.274
0.174CysArg: 0.174 ± 0.152
0.696CysSer: 0.696 ± 0.329
0.522CysThr: 0.522 ± 0.31
0.87CysVal: 0.87 ± 0.377
0.0CysTrp: 0.0 ± 0.0
0.87CysTyr: 0.87 ± 0.397
0.0CysXaa: 0.0 ± 0.0
Asp
3.306AspAla: 3.306 ± 1.083
0.696AspCys: 0.696 ± 0.379
4.872AspAsp: 4.872 ± 0.732
3.828AspGlu: 3.828 ± 0.593
2.61AspPhe: 2.61 ± 0.683
4.002AspGly: 4.002 ± 1.092
1.392AspHis: 1.392 ± 0.604
5.742AspIle: 5.742 ± 0.784
5.394AspLys: 5.394 ± 0.9
2.958AspLeu: 2.958 ± 0.714
1.566AspMet: 1.566 ± 0.549
3.828AspAsn: 3.828 ± 0.774
2.436AspPro: 2.436 ± 0.565
1.044AspGln: 1.044 ± 0.422
2.088AspArg: 2.088 ± 0.715
4.698AspSer: 4.698 ± 0.926
3.132AspThr: 3.132 ± 0.644
4.872AspVal: 4.872 ± 0.692
0.522AspTrp: 0.522 ± 0.28
3.48AspTyr: 3.48 ± 1.054
0.0AspXaa: 0.0 ± 0.0
Glu
4.698GluAla: 4.698 ± 0.782
0.87GluCys: 0.87 ± 0.603
3.48GluAsp: 3.48 ± 0.877
5.046GluGlu: 5.046 ± 1.213
4.35GluPhe: 4.35 ± 0.849
4.176GluGly: 4.176 ± 0.751
1.218GluHis: 1.218 ± 0.484
5.22GluIle: 5.22 ± 0.863
4.35GluLys: 4.35 ± 0.832
6.612GluLeu: 6.612 ± 1.097
3.132GluMet: 3.132 ± 0.848
4.35GluAsn: 4.35 ± 0.947
1.914GluPro: 1.914 ± 0.776
2.436GluGln: 2.436 ± 0.616
1.914GluArg: 1.914 ± 0.644
3.48GluSer: 3.48 ± 0.706
4.524GluThr: 4.524 ± 1.071
5.568GluVal: 5.568 ± 1.029
0.696GluTrp: 0.696 ± 0.31
4.35GluTyr: 4.35 ± 1.184
0.0GluXaa: 0.0 ± 0.0
Phe
1.74PheAla: 1.74 ± 0.478
0.174PheCys: 0.174 ± 0.203
4.176PheAsp: 4.176 ± 1.001
3.828PheGlu: 3.828 ± 1.084
1.914PhePhe: 1.914 ± 0.739
2.61PheGly: 2.61 ± 0.632
1.566PheHis: 1.566 ± 0.643
4.002PheIle: 4.002 ± 0.892
5.742PheLys: 5.742 ± 1.2
3.654PheLeu: 3.654 ± 0.888
1.566PheMet: 1.566 ± 0.543
4.002PheAsn: 4.002 ± 1.008
0.87PhePro: 0.87 ± 0.334
1.392PheGln: 1.392 ± 0.754
1.392PheArg: 1.392 ± 0.426
1.914PheSer: 1.914 ± 0.662
2.436PheThr: 2.436 ± 0.768
2.784PheVal: 2.784 ± 0.687
0.174PheTrp: 0.174 ± 0.161
1.392PheTyr: 1.392 ± 0.756
0.0PheXaa: 0.0 ± 0.0
Gly
4.35GlyAla: 4.35 ± 1.496
0.174GlyCys: 0.174 ± 0.23
2.088GlyAsp: 2.088 ± 0.867
4.698GlyGlu: 4.698 ± 0.744
2.61GlyPhe: 2.61 ± 0.837
4.35GlyGly: 4.35 ± 1.429
0.87GlyHis: 0.87 ± 0.317
3.48GlyIle: 3.48 ± 1.087
5.394GlyLys: 5.394 ± 1.058
4.002GlyLeu: 4.002 ± 0.787
1.566GlyMet: 1.566 ± 0.483
3.306GlyAsn: 3.306 ± 1.096
0.0GlyPro: 0.0 ± 0.0
3.132GlyGln: 3.132 ± 0.548
2.784GlyArg: 2.784 ± 0.657
5.568GlySer: 5.568 ± 1.647
4.35GlyThr: 4.35 ± 0.772
4.35GlyVal: 4.35 ± 0.637
0.696GlyTrp: 0.696 ± 0.468
4.176GlyTyr: 4.176 ± 0.962
0.0GlyXaa: 0.0 ± 0.0
His
0.522HisAla: 0.522 ± 0.376
0.522HisCys: 0.522 ± 0.345
0.348HisAsp: 0.348 ± 0.229
1.392HisGlu: 1.392 ± 0.466
1.044HisPhe: 1.044 ± 0.37
1.218HisGly: 1.218 ± 0.603
0.348HisHis: 0.348 ± 0.262
1.392HisIle: 1.392 ± 0.539
1.392HisLys: 1.392 ± 0.584
1.74HisLeu: 1.74 ± 0.634
0.174HisMet: 0.174 ± 0.167
1.044HisAsn: 1.044 ± 0.389
0.696HisPro: 0.696 ± 0.352
0.348HisGln: 0.348 ± 0.208
0.0HisArg: 0.0 ± 0.0
1.218HisSer: 1.218 ± 0.377
0.87HisThr: 0.87 ± 0.314
1.392HisVal: 1.392 ± 0.446
0.174HisTrp: 0.174 ± 0.184
1.044HisTyr: 1.044 ± 0.65
0.0HisXaa: 0.0 ± 0.0
Ile
3.306IleAla: 3.306 ± 0.832
0.522IleCys: 0.522 ± 0.287
6.438IleAsp: 6.438 ± 0.953
5.394IleGlu: 5.394 ± 1.231
4.002IlePhe: 4.002 ± 0.792
4.176IleGly: 4.176 ± 0.88
0.696IleHis: 0.696 ± 0.613
3.48IleIle: 3.48 ± 0.767
4.872IleLys: 4.872 ± 1.093
4.002IleLeu: 4.002 ± 1.164
1.392IleMet: 1.392 ± 0.526
5.046IleAsn: 5.046 ± 0.786
1.914IlePro: 1.914 ± 0.476
2.61IleGln: 2.61 ± 0.472
3.132IleArg: 3.132 ± 0.62
2.262IleSer: 2.262 ± 0.51
4.524IleThr: 4.524 ± 0.802
5.22IleVal: 5.22 ± 0.966
0.87IleTrp: 0.87 ± 0.366
3.132IleTyr: 3.132 ± 0.506
0.0IleXaa: 0.0 ± 0.0
Lys
3.132LysAla: 3.132 ± 0.72
0.348LysCys: 0.348 ± 0.229
6.264LysAsp: 6.264 ± 0.93
6.612LysGlu: 6.612 ± 1.056
5.046LysPhe: 5.046 ± 1.339
5.22LysGly: 5.22 ± 0.979
0.87LysHis: 0.87 ± 0.348
5.568LysIle: 5.568 ± 1.115
7.482LysLys: 7.482 ± 1.548
5.394LysLeu: 5.394 ± 0.976
2.784LysMet: 2.784 ± 0.637
4.524LysAsn: 4.524 ± 1.065
2.61LysPro: 2.61 ± 0.7
2.61LysGln: 2.61 ± 0.549
2.784LysArg: 2.784 ± 0.67
2.436LysSer: 2.436 ± 0.7
5.742LysThr: 5.742 ± 1.088
4.002LysVal: 4.002 ± 0.891
1.218LysTrp: 1.218 ± 0.489
4.698LysTyr: 4.698 ± 0.994
0.0LysXaa: 0.0 ± 0.0
Leu
3.654LeuAla: 3.654 ± 0.719
0.522LeuCys: 0.522 ± 0.282
4.35LeuAsp: 4.35 ± 0.748
4.524LeuGlu: 4.524 ± 0.773
3.306LeuPhe: 3.306 ± 0.712
3.654LeuGly: 3.654 ± 0.89
1.566LeuHis: 1.566 ± 0.558
2.784LeuIle: 2.784 ± 0.697
6.612LeuLys: 6.612 ± 1.039
5.22LeuLeu: 5.22 ± 1.078
2.262LeuMet: 2.262 ± 0.378
6.438LeuAsn: 6.438 ± 0.882
2.436LeuPro: 2.436 ± 0.515
3.48LeuGln: 3.48 ± 0.822
4.002LeuArg: 4.002 ± 0.749
5.046LeuSer: 5.046 ± 0.936
6.786LeuThr: 6.786 ± 0.94
4.35LeuVal: 4.35 ± 0.84
0.522LeuTrp: 0.522 ± 0.276
3.132LeuTyr: 3.132 ± 0.636
0.0LeuXaa: 0.0 ± 0.0
Met
1.566MetAla: 1.566 ± 0.489
0.348MetCys: 0.348 ± 0.258
1.392MetAsp: 1.392 ± 0.505
2.088MetGlu: 2.088 ± 0.687
1.566MetPhe: 1.566 ± 0.564
1.392MetGly: 1.392 ± 0.582
0.348MetHis: 0.348 ± 0.218
1.566MetIle: 1.566 ± 0.595
2.262MetLys: 2.262 ± 0.603
1.74MetLeu: 1.74 ± 0.635
1.392MetMet: 1.392 ± 0.681
2.088MetAsn: 2.088 ± 0.435
1.218MetPro: 1.218 ± 0.342
0.348MetGln: 0.348 ± 0.252
1.044MetArg: 1.044 ± 0.545
2.436MetSer: 2.436 ± 0.853
3.306MetThr: 3.306 ± 0.819
2.088MetVal: 2.088 ± 0.784
0.0MetTrp: 0.0 ± 0.0
1.74MetTyr: 1.74 ± 0.536
0.0MetXaa: 0.0 ± 0.0
Asn
5.394AsnAla: 5.394 ± 1.379
0.522AsnCys: 0.522 ± 0.337
3.48AsnAsp: 3.48 ± 0.795
6.09AsnGlu: 6.09 ± 1.292
1.566AsnPhe: 1.566 ± 0.465
5.22AsnGly: 5.22 ± 1.242
0.696AsnHis: 0.696 ± 0.327
4.872AsnIle: 4.872 ± 0.622
4.176AsnLys: 4.176 ± 0.84
6.438AsnLeu: 6.438 ± 1.411
2.262AsnMet: 2.262 ± 0.699
4.002AsnAsn: 4.002 ± 1.05
1.914AsnPro: 1.914 ± 0.658
1.566AsnGln: 1.566 ± 0.427
4.176AsnArg: 4.176 ± 1.091
5.394AsnSer: 5.394 ± 0.575
2.262AsnThr: 2.262 ± 0.984
4.176AsnVal: 4.176 ± 0.656
0.348AsnTrp: 0.348 ± 0.231
3.132AsnTyr: 3.132 ± 0.522
0.0AsnXaa: 0.0 ± 0.0
Pro
2.262ProAla: 2.262 ± 0.533
0.348ProCys: 0.348 ± 0.237
2.436ProAsp: 2.436 ± 0.795
2.262ProGlu: 2.262 ± 0.744
1.044ProPhe: 1.044 ± 0.422
0.174ProGly: 0.174 ± 0.205
0.174ProHis: 0.174 ± 0.176
1.566ProIle: 1.566 ± 0.492
1.566ProLys: 1.566 ± 0.536
2.784ProLeu: 2.784 ± 0.65
0.87ProMet: 0.87 ± 0.372
1.74ProAsn: 1.74 ± 0.497
0.696ProPro: 0.696 ± 0.428
0.522ProGln: 0.522 ± 0.378
0.522ProArg: 0.522 ± 0.336
0.87ProSer: 0.87 ± 0.507
1.566ProThr: 1.566 ± 0.564
3.306ProVal: 3.306 ± 0.878
0.174ProTrp: 0.174 ± 0.205
3.306ProTyr: 3.306 ± 0.539
0.0ProXaa: 0.0 ± 0.0
Gln
2.784GlnAla: 2.784 ± 0.636
0.174GlnCys: 0.174 ± 0.169
1.566GlnAsp: 1.566 ± 0.48
2.088GlnGlu: 2.088 ± 0.659
2.262GlnPhe: 2.262 ± 0.652
3.306GlnGly: 3.306 ± 0.774
0.87GlnHis: 0.87 ± 0.474
2.436GlnIle: 2.436 ± 0.584
2.436GlnLys: 2.436 ± 0.661
2.958GlnLeu: 2.958 ± 0.605
0.348GlnMet: 0.348 ± 0.243
2.262GlnAsn: 2.262 ± 0.638
1.566GlnPro: 1.566 ± 0.687
1.566GlnGln: 1.566 ± 0.664
1.218GlnArg: 1.218 ± 0.579
1.566GlnSer: 1.566 ± 0.65
2.262GlnThr: 2.262 ± 0.636
2.958GlnVal: 2.958 ± 0.776
0.348GlnTrp: 0.348 ± 0.41
0.348GlnTyr: 0.348 ± 0.235
0.0GlnXaa: 0.0 ± 0.0
Arg
2.088ArgAla: 2.088 ± 0.562
0.348ArgCys: 0.348 ± 0.2
1.392ArgAsp: 1.392 ± 0.411
3.48ArgGlu: 3.48 ± 0.693
2.262ArgPhe: 2.262 ± 0.688
3.48ArgGly: 3.48 ± 1.295
1.044ArgHis: 1.044 ± 0.364
1.914ArgIle: 1.914 ± 0.556
3.828ArgLys: 3.828 ± 0.944
2.436ArgLeu: 2.436 ± 0.568
1.566ArgMet: 1.566 ± 0.481
2.61ArgAsn: 2.61 ± 0.697
1.044ArgPro: 1.044 ± 0.4
1.914ArgGln: 1.914 ± 0.62
1.218ArgArg: 1.218 ± 0.329
2.958ArgSer: 2.958 ± 0.585
2.784ArgThr: 2.784 ± 0.845
2.088ArgVal: 2.088 ± 0.452
0.0ArgTrp: 0.0 ± 0.0
2.61ArgTyr: 2.61 ± 0.796
0.0ArgXaa: 0.0 ± 0.0
Ser
3.654SerAla: 3.654 ± 1.118
1.218SerCys: 1.218 ± 0.467
3.654SerAsp: 3.654 ± 0.934
4.002SerGlu: 4.002 ± 0.614
2.262SerPhe: 2.262 ± 0.487
3.828SerGly: 3.828 ± 1.321
1.218SerHis: 1.218 ± 0.763
4.698SerIle: 4.698 ± 0.767
4.698SerLys: 4.698 ± 1.004
4.698SerLeu: 4.698 ± 1.007
2.088SerMet: 2.088 ± 0.545
3.828SerAsn: 3.828 ± 0.871
1.044SerPro: 1.044 ± 0.404
2.61SerGln: 2.61 ± 0.779
2.61SerArg: 2.61 ± 0.587
4.176SerSer: 4.176 ± 0.827
3.306SerThr: 3.306 ± 0.825
2.436SerVal: 2.436 ± 0.746
0.174SerTrp: 0.174 ± 0.169
2.784SerTyr: 2.784 ± 0.623
0.0SerXaa: 0.0 ± 0.0
Thr
4.35ThrAla: 4.35 ± 0.978
0.348ThrCys: 0.348 ± 0.244
3.654ThrAsp: 3.654 ± 0.938
3.306ThrGlu: 3.306 ± 0.763
4.002ThrPhe: 4.002 ± 0.981
3.48ThrGly: 3.48 ± 0.677
0.696ThrHis: 0.696 ± 0.384
5.568ThrIle: 5.568 ± 1.14
3.828ThrLys: 3.828 ± 0.744
5.568ThrLeu: 5.568 ± 0.708
2.262ThrMet: 2.262 ± 0.944
4.176ThrAsn: 4.176 ± 0.7
1.74ThrPro: 1.74 ± 0.452
3.306ThrGln: 3.306 ± 0.614
4.176ThrArg: 4.176 ± 0.591
3.48ThrSer: 3.48 ± 0.797
4.524ThrThr: 4.524 ± 1.155
4.872ThrVal: 4.872 ± 0.769
0.696ThrTrp: 0.696 ± 0.292
2.958ThrTyr: 2.958 ± 0.602
0.0ThrXaa: 0.0 ± 0.0
Val
2.61ValAla: 2.61 ± 0.581
0.522ValCys: 0.522 ± 0.34
5.742ValAsp: 5.742 ± 1.463
6.612ValGlu: 6.612 ± 0.832
2.436ValPhe: 2.436 ± 0.618
3.306ValGly: 3.306 ± 0.71
0.696ValHis: 0.696 ± 0.324
5.22ValIle: 5.22 ± 1.112
5.568ValLys: 5.568 ± 0.871
3.828ValLeu: 3.828 ± 0.88
1.392ValMet: 1.392 ± 0.471
5.046ValAsn: 5.046 ± 1.042
1.74ValPro: 1.74 ± 0.638
2.61ValGln: 2.61 ± 0.702
3.132ValArg: 3.132 ± 0.768
4.176ValSer: 4.176 ± 1.031
5.394ValThr: 5.394 ± 1.193
4.002ValVal: 4.002 ± 0.818
1.218ValTrp: 1.218 ± 0.444
2.436ValTyr: 2.436 ± 0.725
0.0ValXaa: 0.0 ± 0.0
Trp
0.348TrpAla: 0.348 ± 0.237
0.174TrpCys: 0.174 ± 0.161
0.348TrpAsp: 0.348 ± 0.22
0.174TrpGlu: 0.174 ± 0.205
0.348TrpPhe: 0.348 ± 0.273
0.348TrpGly: 0.348 ± 0.229
0.174TrpHis: 0.174 ± 0.169
0.348TrpIle: 0.348 ± 0.229
0.696TrpLys: 0.696 ± 0.267
1.392TrpLeu: 1.392 ± 0.433
0.174TrpMet: 0.174 ± 0.152
0.87TrpAsn: 0.87 ± 0.383
0.0TrpPro: 0.0 ± 0.0
0.522TrpGln: 0.522 ± 0.249
1.044TrpArg: 1.044 ± 0.444
1.218TrpSer: 1.218 ± 0.317
0.348TrpThr: 0.348 ± 0.265
0.348TrpVal: 0.348 ± 0.231
0.0TrpTrp: 0.0 ± 0.0
0.174TrpTyr: 0.174 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.132TyrAla: 3.132 ± 1.235
0.522TyrCys: 0.522 ± 0.312
3.306TyrAsp: 3.306 ± 0.846
4.002TyrGlu: 4.002 ± 0.966
2.436TyrPhe: 2.436 ± 0.721
2.958TyrGly: 2.958 ± 0.784
0.87TyrHis: 0.87 ± 0.392
2.784TyrIle: 2.784 ± 0.557
5.046TyrLys: 5.046 ± 1.175
4.35TyrLeu: 4.35 ± 0.816
0.87TyrMet: 0.87 ± 0.372
3.306TyrAsn: 3.306 ± 0.477
2.436TyrPro: 2.436 ± 0.614
1.044TyrGln: 1.044 ± 0.524
1.74TyrArg: 1.74 ± 0.698
1.74TyrSer: 1.74 ± 0.447
3.132TyrThr: 3.132 ± 0.698
3.48TyrVal: 3.48 ± 0.74
1.044TyrTrp: 1.044 ± 0.34
1.74TyrTyr: 1.74 ± 0.549
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (5748 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski