Amino acid dipepetide frequency for Bacillus phage Stitch

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.293AlaAla: 0.293 ± 0.168
1.024AlaCys: 1.024 ± 0.331
2.781AlaAsp: 2.781 ± 0.715
3.366AlaGlu: 3.366 ± 0.876
3.22AlaPhe: 3.22 ± 0.726
3.805AlaGly: 3.805 ± 0.877
0.0AlaHis: 0.0 ± 0.0
2.634AlaIle: 2.634 ± 0.624
4.83AlaLys: 4.83 ± 0.766
2.634AlaLeu: 2.634 ± 0.607
2.195AlaMet: 2.195 ± 0.505
2.488AlaAsn: 2.488 ± 0.444
1.61AlaPro: 1.61 ± 0.66
2.195AlaGln: 2.195 ± 0.463
1.317AlaArg: 1.317 ± 0.307
2.049AlaSer: 2.049 ± 0.421
3.22AlaThr: 3.22 ± 0.662
2.634AlaVal: 2.634 ± 0.774
0.732AlaTrp: 0.732 ± 0.368
2.195AlaTyr: 2.195 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
0.585CysAla: 0.585 ± 0.312
0.146CysCys: 0.146 ± 0.147
1.61CysAsp: 1.61 ± 0.448
0.878CysGlu: 0.878 ± 0.312
0.293CysPhe: 0.293 ± 0.191
0.878CysGly: 0.878 ± 0.483
0.146CysHis: 0.146 ± 0.126
0.585CysIle: 0.585 ± 0.252
1.171CysLys: 1.171 ± 0.366
0.293CysLeu: 0.293 ± 0.2
0.439CysMet: 0.439 ± 0.229
0.293CysAsn: 0.293 ± 0.187
0.732CysPro: 0.732 ± 0.336
0.146CysGln: 0.146 ± 0.144
0.439CysArg: 0.439 ± 0.3
0.439CysSer: 0.439 ± 0.229
0.146CysThr: 0.146 ± 0.167
0.439CysVal: 0.439 ± 0.287
0.293CysTrp: 0.293 ± 0.186
0.585CysTyr: 0.585 ± 0.245
0.0CysXaa: 0.0 ± 0.0
Asp
3.073AspAla: 3.073 ± 0.509
1.463AspCys: 1.463 ± 0.639
3.366AspAsp: 3.366 ± 0.624
5.122AspGlu: 5.122 ± 0.836
4.244AspPhe: 4.244 ± 0.622
5.708AspGly: 5.708 ± 1.274
0.878AspHis: 0.878 ± 0.356
5.269AspIle: 5.269 ± 0.836
6.586AspLys: 6.586 ± 0.877
3.805AspLeu: 3.805 ± 0.793
2.634AspMet: 2.634 ± 0.606
4.537AspAsn: 4.537 ± 0.78
2.342AspPro: 2.342 ± 0.595
2.195AspGln: 2.195 ± 0.379
1.903AspArg: 1.903 ± 0.554
2.195AspSer: 2.195 ± 0.745
3.366AspThr: 3.366 ± 0.724
5.269AspVal: 5.269 ± 0.907
0.293AspTrp: 0.293 ± 0.176
3.073AspTyr: 3.073 ± 0.543
0.0AspXaa: 0.0 ± 0.0
Glu
2.927GluAla: 2.927 ± 0.704
0.585GluCys: 0.585 ± 0.268
3.659GluAsp: 3.659 ± 0.644
5.708GluGlu: 5.708 ± 1.518
4.098GluPhe: 4.098 ± 0.969
4.244GluGly: 4.244 ± 0.919
0.732GluHis: 0.732 ± 0.245
5.854GluIle: 5.854 ± 0.915
6.439GluLys: 6.439 ± 1.169
9.22GluLeu: 9.22 ± 1.634
2.634GluMet: 2.634 ± 0.703
5.708GluAsn: 5.708 ± 0.718
0.878GluPro: 0.878 ± 0.456
3.22GluGln: 3.22 ± 0.648
3.805GluArg: 3.805 ± 0.952
4.976GluSer: 4.976 ± 0.761
4.537GluThr: 4.537 ± 1.071
5.122GluVal: 5.122 ± 0.867
1.171GluTrp: 1.171 ± 0.364
3.366GluTyr: 3.366 ± 0.674
0.0GluXaa: 0.0 ± 0.0
Phe
2.049PheAla: 2.049 ± 0.415
0.439PheCys: 0.439 ± 0.218
3.659PheAsp: 3.659 ± 0.54
4.39PheGlu: 4.39 ± 0.64
1.756PhePhe: 1.756 ± 0.427
2.049PheGly: 2.049 ± 0.647
1.463PheHis: 1.463 ± 0.427
4.39PheIle: 4.39 ± 0.74
5.269PheLys: 5.269 ± 1.146
2.634PheLeu: 2.634 ± 0.522
2.195PheMet: 2.195 ± 0.582
2.781PheAsn: 2.781 ± 0.813
0.878PhePro: 0.878 ± 0.361
0.878PheGln: 0.878 ± 0.392
0.878PheArg: 0.878 ± 0.37
4.098PheSer: 4.098 ± 0.763
2.342PheThr: 2.342 ± 0.594
2.634PheVal: 2.634 ± 0.716
0.439PheTrp: 0.439 ± 0.221
2.342PheTyr: 2.342 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
3.22GlyAla: 3.22 ± 0.997
0.146GlyCys: 0.146 ± 0.155
2.488GlyAsp: 2.488 ± 0.802
6.147GlyGlu: 6.147 ± 0.804
3.073GlyPhe: 3.073 ± 0.563
4.098GlyGly: 4.098 ± 0.865
0.878GlyHis: 0.878 ± 0.39
4.976GlyIle: 4.976 ± 0.89
6.586GlyLys: 6.586 ± 1.245
4.83GlyLeu: 4.83 ± 0.776
2.634GlyMet: 2.634 ± 0.678
5.122GlyAsn: 5.122 ± 1.06
0.293GlyPro: 0.293 ± 0.175
1.903GlyGln: 1.903 ± 0.575
2.195GlyArg: 2.195 ± 0.455
4.098GlySer: 4.098 ± 0.646
5.854GlyThr: 5.854 ± 1.314
3.659GlyVal: 3.659 ± 0.63
0.293GlyTrp: 0.293 ± 0.218
3.22GlyTyr: 3.22 ± 0.682
0.0GlyXaa: 0.0 ± 0.0
His
0.585HisAla: 0.585 ± 0.24
0.146HisCys: 0.146 ± 0.128
1.317HisAsp: 1.317 ± 0.506
0.439HisGlu: 0.439 ± 0.275
0.732HisPhe: 0.732 ± 0.292
0.439HisGly: 0.439 ± 0.231
0.585HisHis: 0.585 ± 0.323
1.756HisIle: 1.756 ± 0.422
0.878HisLys: 0.878 ± 0.326
1.317HisLeu: 1.317 ± 0.501
0.293HisMet: 0.293 ± 0.179
0.732HisAsn: 0.732 ± 0.256
0.0HisPro: 0.0 ± 0.0
0.439HisGln: 0.439 ± 0.221
0.585HisArg: 0.585 ± 0.233
0.293HisSer: 0.293 ± 0.212
1.463HisThr: 1.463 ± 0.402
1.171HisVal: 1.171 ± 0.48
0.146HisTrp: 0.146 ± 0.161
0.878HisTyr: 0.878 ± 0.381
0.0HisXaa: 0.0 ± 0.0
Ile
3.512IleAla: 3.512 ± 0.625
0.293IleCys: 0.293 ± 0.194
6.147IleAsp: 6.147 ± 0.701
6.147IleGlu: 6.147 ± 1.337
2.634IlePhe: 2.634 ± 0.568
4.976IleGly: 4.976 ± 0.601
1.024IleHis: 1.024 ± 0.459
4.098IleIle: 4.098 ± 0.988
7.025IleLys: 7.025 ± 1.233
2.049IleLeu: 2.049 ± 0.667
2.195IleMet: 2.195 ± 0.466
5.269IleAsn: 5.269 ± 0.753
1.463IlePro: 1.463 ± 0.473
2.488IleGln: 2.488 ± 0.4
3.805IleArg: 3.805 ± 0.72
2.927IleSer: 2.927 ± 0.608
5.122IleThr: 5.122 ± 0.82
3.805IleVal: 3.805 ± 0.782
0.878IleTrp: 0.878 ± 0.375
2.634IleTyr: 2.634 ± 0.574
0.0IleXaa: 0.0 ± 0.0
Lys
4.683LysAla: 4.683 ± 1.126
1.317LysCys: 1.317 ± 0.471
8.049LysAsp: 8.049 ± 1.133
8.781LysGlu: 8.781 ± 1.833
4.39LysPhe: 4.39 ± 0.82
6.732LysGly: 6.732 ± 1.05
1.463LysHis: 1.463 ± 0.462
6.439LysIle: 6.439 ± 0.885
10.976LysLys: 10.976 ± 1.193
6.586LysLeu: 6.586 ± 0.837
3.366LysMet: 3.366 ± 0.852
4.683LysAsn: 4.683 ± 0.741
1.317LysPro: 1.317 ± 0.548
3.366LysGln: 3.366 ± 0.888
3.805LysArg: 3.805 ± 0.595
3.951LysSer: 3.951 ± 0.815
4.976LysThr: 4.976 ± 0.803
6.878LysVal: 6.878 ± 0.907
0.878LysTrp: 0.878 ± 0.304
4.537LysTyr: 4.537 ± 0.821
0.0LysXaa: 0.0 ± 0.0
Leu
2.781LeuAla: 2.781 ± 0.549
0.146LeuCys: 0.146 ± 0.147
5.708LeuAsp: 5.708 ± 0.846
5.561LeuGlu: 5.561 ± 1.067
2.342LeuPhe: 2.342 ± 0.582
4.39LeuGly: 4.39 ± 0.845
1.317LeuHis: 1.317 ± 0.402
3.512LeuIle: 3.512 ± 0.811
8.488LeuLys: 8.488 ± 1.387
3.659LeuLeu: 3.659 ± 0.668
1.756LeuMet: 1.756 ± 0.452
4.244LeuAsn: 4.244 ± 0.504
2.342LeuPro: 2.342 ± 0.469
3.512LeuGln: 3.512 ± 0.81
2.781LeuArg: 2.781 ± 0.591
3.512LeuSer: 3.512 ± 0.634
3.512LeuThr: 3.512 ± 0.556
3.659LeuVal: 3.659 ± 0.709
0.732LeuTrp: 0.732 ± 0.283
2.927LeuTyr: 2.927 ± 0.569
0.0LeuXaa: 0.0 ± 0.0
Met
1.024MetAla: 1.024 ± 0.324
0.878MetCys: 0.878 ± 0.372
2.049MetAsp: 2.049 ± 0.499
1.903MetGlu: 1.903 ± 0.555
2.195MetPhe: 2.195 ± 0.611
1.903MetGly: 1.903 ± 0.473
0.146MetHis: 0.146 ± 0.128
2.049MetIle: 2.049 ± 0.727
2.342MetLys: 2.342 ± 0.522
2.049MetLeu: 2.049 ± 0.467
1.463MetMet: 1.463 ± 0.599
3.659MetAsn: 3.659 ± 0.572
0.585MetPro: 0.585 ± 0.291
1.463MetGln: 1.463 ± 0.444
1.317MetArg: 1.317 ± 0.402
2.342MetSer: 2.342 ± 0.493
1.756MetThr: 1.756 ± 0.41
1.463MetVal: 1.463 ± 0.429
1.317MetTrp: 1.317 ± 0.507
1.463MetTyr: 1.463 ± 0.369
0.0MetXaa: 0.0 ± 0.0
Asn
2.634AsnAla: 2.634 ± 0.717
0.585AsnCys: 0.585 ± 0.369
4.976AsnAsp: 4.976 ± 0.577
4.683AsnGlu: 4.683 ± 0.76
3.22AsnPhe: 3.22 ± 0.671
4.537AsnGly: 4.537 ± 1.102
0.878AsnHis: 0.878 ± 0.293
4.83AsnIle: 4.83 ± 0.639
5.415AsnLys: 5.415 ± 0.78
4.39AsnLeu: 4.39 ± 0.784
1.171AsnMet: 1.171 ± 0.459
6.439AsnAsn: 6.439 ± 1.327
2.634AsnPro: 2.634 ± 0.592
2.927AsnGln: 2.927 ± 0.619
3.073AsnArg: 3.073 ± 0.793
4.537AsnSer: 4.537 ± 0.844
3.073AsnThr: 3.073 ± 1.069
4.976AsnVal: 4.976 ± 0.898
0.878AsnTrp: 0.878 ± 0.354
3.366AsnTyr: 3.366 ± 0.624
0.0AsnXaa: 0.0 ± 0.0
Pro
1.024ProAla: 1.024 ± 0.45
0.293ProCys: 0.293 ± 0.194
1.61ProAsp: 1.61 ± 0.407
2.195ProGlu: 2.195 ± 0.657
1.317ProPhe: 1.317 ± 0.418
0.878ProGly: 0.878 ± 0.324
0.146ProHis: 0.146 ± 0.148
1.463ProIle: 1.463 ± 0.41
2.049ProLys: 2.049 ± 0.603
1.317ProLeu: 1.317 ± 0.369
0.878ProMet: 0.878 ± 0.497
1.756ProAsn: 1.756 ± 0.617
0.732ProPro: 0.732 ± 0.25
0.439ProGln: 0.439 ± 0.219
0.732ProArg: 0.732 ± 0.372
1.61ProSer: 1.61 ± 0.596
1.903ProThr: 1.903 ± 0.56
2.195ProVal: 2.195 ± 0.649
0.293ProTrp: 0.293 ± 0.202
1.61ProTyr: 1.61 ± 0.539
0.0ProXaa: 0.0 ± 0.0
Gln
1.024GlnAla: 1.024 ± 0.316
0.146GlnCys: 0.146 ± 0.126
1.903GlnAsp: 1.903 ± 0.623
1.61GlnGlu: 1.61 ± 0.623
1.317GlnPhe: 1.317 ± 0.372
2.342GlnGly: 2.342 ± 0.818
0.146GlnHis: 0.146 ± 0.123
1.756GlnIle: 1.756 ± 0.541
3.22GlnLys: 3.22 ± 0.651
4.683GlnLeu: 4.683 ± 1.041
0.585GlnMet: 0.585 ± 0.269
3.22GlnAsn: 3.22 ± 0.45
1.024GlnPro: 1.024 ± 0.349
0.732GlnGln: 0.732 ± 0.264
1.61GlnArg: 1.61 ± 0.467
1.61GlnSer: 1.61 ± 0.349
2.488GlnThr: 2.488 ± 0.607
2.049GlnVal: 2.049 ± 0.494
0.293GlnTrp: 0.293 ± 0.194
2.634GlnTyr: 2.634 ± 0.518
0.0GlnXaa: 0.0 ± 0.0
Arg
3.512ArgAla: 3.512 ± 0.627
0.293ArgCys: 0.293 ± 0.196
2.488ArgAsp: 2.488 ± 0.634
3.659ArgGlu: 3.659 ± 0.873
2.488ArgPhe: 2.488 ± 0.498
2.195ArgGly: 2.195 ± 0.658
0.585ArgHis: 0.585 ± 0.3
1.756ArgIle: 1.756 ± 0.477
4.39ArgLys: 4.39 ± 0.741
1.903ArgLeu: 1.903 ± 0.473
1.317ArgMet: 1.317 ± 0.431
1.903ArgAsn: 1.903 ± 0.502
0.878ArgPro: 0.878 ± 0.394
1.61ArgGln: 1.61 ± 0.448
2.342ArgArg: 2.342 ± 0.712
1.756ArgSer: 1.756 ± 0.344
2.049ArgThr: 2.049 ± 0.708
1.756ArgVal: 1.756 ± 0.738
0.293ArgTrp: 0.293 ± 0.139
2.634ArgTyr: 2.634 ± 0.76
0.0ArgXaa: 0.0 ± 0.0
Ser
2.634SerAla: 2.634 ± 0.525
0.585SerCys: 0.585 ± 0.26
3.073SerAsp: 3.073 ± 0.667
3.805SerGlu: 3.805 ± 0.61
2.634SerPhe: 2.634 ± 0.601
3.073SerGly: 3.073 ± 0.588
1.171SerHis: 1.171 ± 0.371
3.366SerIle: 3.366 ± 0.8
4.098SerLys: 4.098 ± 0.793
4.537SerLeu: 4.537 ± 0.72
1.756SerMet: 1.756 ± 0.8
4.39SerAsn: 4.39 ± 0.887
0.878SerPro: 0.878 ± 0.39
2.049SerGln: 2.049 ± 0.558
2.488SerArg: 2.488 ± 0.667
2.634SerSer: 2.634 ± 0.73
2.781SerThr: 2.781 ± 0.683
3.659SerVal: 3.659 ± 0.739
0.293SerTrp: 0.293 ± 0.222
2.927SerTyr: 2.927 ± 0.673
0.0SerXaa: 0.0 ± 0.0
Thr
3.659ThrAla: 3.659 ± 0.897
0.878ThrCys: 0.878 ± 0.362
3.22ThrAsp: 3.22 ± 0.721
4.244ThrGlu: 4.244 ± 0.917
2.049ThrPhe: 2.049 ± 0.536
4.683ThrGly: 4.683 ± 1.138
0.585ThrHis: 0.585 ± 0.329
6.586ThrIle: 6.586 ± 0.8
4.83ThrLys: 4.83 ± 0.812
4.976ThrLeu: 4.976 ± 0.98
2.342ThrMet: 2.342 ± 0.479
3.22ThrAsn: 3.22 ± 0.64
1.756ThrPro: 1.756 ± 0.455
1.317ThrGln: 1.317 ± 0.394
1.903ThrArg: 1.903 ± 0.457
3.951ThrSer: 3.951 ± 0.813
4.39ThrThr: 4.39 ± 1.518
3.805ThrVal: 3.805 ± 0.619
0.293ThrTrp: 0.293 ± 0.183
1.463ThrTyr: 1.463 ± 0.42
0.0ThrXaa: 0.0 ± 0.0
Val
2.634ValAla: 2.634 ± 0.533
0.146ValCys: 0.146 ± 0.134
5.269ValAsp: 5.269 ± 0.81
5.561ValGlu: 5.561 ± 0.817
2.342ValPhe: 2.342 ± 0.601
5.269ValGly: 5.269 ± 0.6
1.024ValHis: 1.024 ± 0.399
3.805ValIle: 3.805 ± 0.689
6.586ValLys: 6.586 ± 0.959
3.366ValLeu: 3.366 ± 0.621
1.61ValMet: 1.61 ± 0.468
3.512ValAsn: 3.512 ± 0.748
3.073ValPro: 3.073 ± 0.749
1.463ValGln: 1.463 ± 0.479
2.195ValArg: 2.195 ± 0.451
3.366ValSer: 3.366 ± 0.441
4.244ValThr: 4.244 ± 1.121
4.683ValVal: 4.683 ± 1.022
1.024ValTrp: 1.024 ± 0.424
2.781ValTyr: 2.781 ± 0.605
0.0ValXaa: 0.0 ± 0.0
Trp
0.146TrpAla: 0.146 ± 0.167
0.146TrpCys: 0.146 ± 0.123
0.439TrpAsp: 0.439 ± 0.35
0.585TrpGlu: 0.585 ± 0.32
1.61TrpPhe: 1.61 ± 0.403
0.585TrpGly: 0.585 ± 0.259
0.585TrpHis: 0.585 ± 0.228
0.293TrpIle: 0.293 ± 0.207
0.878TrpLys: 0.878 ± 0.313
0.878TrpLeu: 0.878 ± 0.457
0.585TrpMet: 0.585 ± 0.297
1.61TrpAsn: 1.61 ± 0.498
0.0TrpPro: 0.0 ± 0.0
0.585TrpGln: 0.585 ± 0.243
0.878TrpArg: 0.878 ± 0.341
0.439TrpSer: 0.439 ± 0.265
0.293TrpThr: 0.293 ± 0.215
0.732TrpVal: 0.732 ± 0.332
0.293TrpTrp: 0.293 ± 0.227
0.585TrpTyr: 0.585 ± 0.321
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.22TyrAla: 3.22 ± 0.607
0.878TyrCys: 0.878 ± 0.374
3.512TyrAsp: 3.512 ± 0.571
4.098TyrGlu: 4.098 ± 0.738
1.61TyrPhe: 1.61 ± 0.388
2.927TyrGly: 2.927 ± 0.564
0.439TyrHis: 0.439 ± 0.222
3.073TyrIle: 3.073 ± 0.84
5.122TyrLys: 5.122 ± 0.874
1.756TyrLeu: 1.756 ± 0.523
1.171TyrMet: 1.171 ± 0.421
3.659TyrAsn: 3.659 ± 0.541
1.171TyrPro: 1.171 ± 0.292
1.463TyrGln: 1.463 ± 0.356
1.756TyrArg: 1.756 ± 0.471
2.049TyrSer: 2.049 ± 0.526
2.634TyrThr: 2.634 ± 0.539
3.366TyrVal: 3.366 ± 0.693
1.317TyrTrp: 1.317 ± 0.576
2.488TyrTyr: 2.488 ± 0.625
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 36 proteins (6834 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski