Amino acid dipepetide frequency for Streptococcus phage Javan346

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.258AlaAla: 3.258 ± 0.76
0.296AlaCys: 0.296 ± 0.172
3.851AlaAsp: 3.851 ± 0.633
4.641AlaGlu: 4.641 ± 0.719
2.172AlaPhe: 2.172 ± 0.346
4.147AlaGly: 4.147 ± 0.803
0.691AlaHis: 0.691 ± 0.223
5.233AlaIle: 5.233 ± 0.998
5.628AlaLys: 5.628 ± 0.585
6.023AlaLeu: 6.023 ± 0.92
1.679AlaMet: 1.679 ± 0.452
3.258AlaAsn: 3.258 ± 0.519
1.58AlaPro: 1.58 ± 0.443
1.876AlaGln: 1.876 ± 0.455
2.666AlaArg: 2.666 ± 0.651
4.739AlaSer: 4.739 ± 0.892
3.752AlaThr: 3.752 ± 0.618
3.949AlaVal: 3.949 ± 0.657
0.494AlaTrp: 0.494 ± 0.215
3.752AlaTyr: 3.752 ± 0.528
0.0AlaXaa: 0.0 ± 0.0
Cys
0.494CysAla: 0.494 ± 0.312
0.099CysCys: 0.099 ± 0.105
0.395CysAsp: 0.395 ± 0.209
0.592CysGlu: 0.592 ± 0.237
0.296CysPhe: 0.296 ± 0.194
0.395CysGly: 0.395 ± 0.165
0.494CysHis: 0.494 ± 0.205
0.592CysIle: 0.592 ± 0.288
0.296CysLys: 0.296 ± 0.2
0.691CysLeu: 0.691 ± 0.341
0.0CysMet: 0.0 ± 0.0
0.099CysAsn: 0.099 ± 0.096
0.296CysPro: 0.296 ± 0.172
0.592CysGln: 0.592 ± 0.263
0.197CysArg: 0.197 ± 0.14
0.296CysSer: 0.296 ± 0.188
0.197CysThr: 0.197 ± 0.151
0.395CysVal: 0.395 ± 0.187
0.0CysTrp: 0.0 ± 0.0
0.691CysTyr: 0.691 ± 0.287
0.0CysXaa: 0.0 ± 0.0
Asp
3.258AspAla: 3.258 ± 0.506
0.197AspCys: 0.197 ± 0.13
3.555AspAsp: 3.555 ± 0.811
5.924AspGlu: 5.924 ± 0.785
3.752AspPhe: 3.752 ± 0.665
3.357AspGly: 3.357 ± 0.873
0.79AspHis: 0.79 ± 0.265
4.739AspIle: 4.739 ± 0.589
5.727AspLys: 5.727 ± 0.624
5.529AspLeu: 5.529 ± 0.712
1.481AspMet: 1.481 ± 0.35
2.468AspAsn: 2.468 ± 0.491
2.271AspPro: 2.271 ± 0.58
1.382AspGln: 1.382 ± 0.362
2.468AspArg: 2.468 ± 0.586
4.641AspSer: 4.641 ± 0.81
2.863AspThr: 2.863 ± 0.4
3.357AspVal: 3.357 ± 0.519
0.592AspTrp: 0.592 ± 0.163
2.765AspTyr: 2.765 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
4.246GluAla: 4.246 ± 0.589
0.592GluCys: 0.592 ± 0.281
3.851GluAsp: 3.851 ± 0.768
5.529GluGlu: 5.529 ± 1.016
2.666GluPhe: 2.666 ± 0.543
4.542GluGly: 4.542 ± 0.532
1.284GluHis: 1.284 ± 0.363
4.838GluIle: 4.838 ± 0.643
6.22GluLys: 6.22 ± 0.779
8.195GluLeu: 8.195 ± 0.994
2.567GluMet: 2.567 ± 0.644
4.542GluAsn: 4.542 ± 0.598
1.975GluPro: 1.975 ± 0.381
4.344GluGln: 4.344 ± 0.717
1.876GluArg: 1.876 ± 0.432
4.443GluSer: 4.443 ± 0.646
5.233GluThr: 5.233 ± 0.747
4.443GluVal: 4.443 ± 0.635
0.889GluTrp: 0.889 ± 0.371
1.876GluTyr: 1.876 ± 0.419
0.0GluXaa: 0.0 ± 0.0
Phe
2.073PheAla: 2.073 ± 0.457
0.592PheCys: 0.592 ± 0.226
2.962PheAsp: 2.962 ± 0.46
2.271PheGlu: 2.271 ± 0.407
1.679PhePhe: 1.679 ± 0.55
2.37PheGly: 2.37 ± 0.365
1.086PheHis: 1.086 ± 0.313
2.962PheIle: 2.962 ± 0.594
3.16PheLys: 3.16 ± 0.676
2.863PheLeu: 2.863 ± 0.556
1.086PheMet: 1.086 ± 0.311
2.468PheAsn: 2.468 ± 0.35
0.691PhePro: 0.691 ± 0.333
1.58PheGln: 1.58 ± 0.32
1.481PheArg: 1.481 ± 0.301
2.666PheSer: 2.666 ± 0.405
2.271PheThr: 2.271 ± 0.594
2.073PheVal: 2.073 ± 0.485
0.592PheTrp: 0.592 ± 0.264
1.876PheTyr: 1.876 ± 0.503
0.0PheXaa: 0.0 ± 0.0
Gly
4.344GlyAla: 4.344 ± 0.538
0.296GlyCys: 0.296 ± 0.171
4.246GlyAsp: 4.246 ± 0.684
3.357GlyGlu: 3.357 ± 0.403
2.666GlyPhe: 2.666 ± 0.61
4.542GlyGly: 4.542 ± 0.776
1.481GlyHis: 1.481 ± 0.357
5.134GlyIle: 5.134 ± 0.727
4.641GlyLys: 4.641 ± 0.685
7.01GlyLeu: 7.01 ± 0.644
2.271GlyMet: 2.271 ± 0.433
4.048GlyAsn: 4.048 ± 0.743
0.494GlyPro: 0.494 ± 0.208
1.975GlyGln: 1.975 ± 0.432
3.653GlyArg: 3.653 ± 0.68
3.555GlySer: 3.555 ± 0.512
4.048GlyThr: 4.048 ± 0.642
3.456GlyVal: 3.456 ± 0.789
1.284GlyTrp: 1.284 ± 0.552
2.172GlyTyr: 2.172 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
0.889HisAla: 0.889 ± 0.276
0.197HisCys: 0.197 ± 0.2
1.185HisAsp: 1.185 ± 0.411
0.889HisGlu: 0.889 ± 0.272
0.691HisPhe: 0.691 ± 0.233
1.284HisGly: 1.284 ± 0.307
0.296HisHis: 0.296 ± 0.188
1.481HisIle: 1.481 ± 0.365
0.79HisLys: 0.79 ± 0.267
2.271HisLeu: 2.271 ± 0.526
0.494HisMet: 0.494 ± 0.257
0.987HisAsn: 0.987 ± 0.285
0.889HisPro: 0.889 ± 0.336
0.889HisGln: 0.889 ± 0.297
0.987HisArg: 0.987 ± 0.28
0.987HisSer: 0.987 ± 0.31
0.889HisThr: 0.889 ± 0.384
0.987HisVal: 0.987 ± 0.334
0.296HisTrp: 0.296 ± 0.143
0.691HisTyr: 0.691 ± 0.407
0.0HisXaa: 0.0 ± 0.0
Ile
4.937IleAla: 4.937 ± 0.509
0.691IleCys: 0.691 ± 0.251
5.036IleAsp: 5.036 ± 0.631
6.319IleGlu: 6.319 ± 0.588
1.58IlePhe: 1.58 ± 0.254
5.233IleGly: 5.233 ± 0.64
1.185IleHis: 1.185 ± 0.343
4.048IleIle: 4.048 ± 0.866
7.01IleLys: 7.01 ± 0.973
5.628IleLeu: 5.628 ± 0.728
1.185IleMet: 1.185 ± 0.278
3.555IleAsn: 3.555 ± 0.71
1.975IlePro: 1.975 ± 0.42
2.172IleGln: 2.172 ± 0.399
1.876IleArg: 1.876 ± 0.384
6.22IleSer: 6.22 ± 1.331
4.937IleThr: 4.937 ± 0.883
4.443IleVal: 4.443 ± 0.858
1.185IleTrp: 1.185 ± 0.391
1.975IleTyr: 1.975 ± 0.31
0.0IleXaa: 0.0 ± 0.0
Lys
6.22LysAla: 6.22 ± 0.628
0.395LysCys: 0.395 ± 0.2
5.529LysAsp: 5.529 ± 0.71
6.714LysGlu: 6.714 ± 0.821
2.37LysPhe: 2.37 ± 0.385
3.851LysGly: 3.851 ± 0.529
1.086LysHis: 1.086 ± 0.334
6.122LysIle: 6.122 ± 0.733
4.937LysLys: 4.937 ± 0.801
6.912LysLeu: 6.912 ± 0.858
1.086LysMet: 1.086 ± 0.305
3.456LysAsn: 3.456 ± 0.636
3.061LysPro: 3.061 ± 0.515
4.246LysGln: 4.246 ± 0.848
3.949LysArg: 3.949 ± 0.671
6.122LysSer: 6.122 ± 0.733
5.727LysThr: 5.727 ± 0.75
4.048LysVal: 4.048 ± 0.47
1.185LysTrp: 1.185 ± 0.309
1.679LysTyr: 1.679 ± 0.497
0.0LysXaa: 0.0 ± 0.0
Leu
7.603LeuAla: 7.603 ± 1.057
0.494LeuCys: 0.494 ± 0.217
5.036LeuAsp: 5.036 ± 0.57
7.01LeuGlu: 7.01 ± 1.0
2.666LeuPhe: 2.666 ± 0.531
4.542LeuGly: 4.542 ± 0.698
1.481LeuHis: 1.481 ± 0.366
5.233LeuIle: 5.233 ± 0.813
7.701LeuLys: 7.701 ± 0.687
7.306LeuLeu: 7.306 ± 0.964
2.172LeuMet: 2.172 ± 0.481
4.937LeuAsn: 4.937 ± 0.743
3.456LeuPro: 3.456 ± 0.553
3.555LeuGln: 3.555 ± 0.581
4.542LeuArg: 4.542 ± 0.714
7.306LeuSer: 7.306 ± 1.028
6.813LeuThr: 6.813 ± 0.682
5.233LeuVal: 5.233 ± 0.681
0.889LeuTrp: 0.889 ± 0.215
4.147LeuTyr: 4.147 ± 0.979
0.0LeuXaa: 0.0 ± 0.0
Met
1.382MetAla: 1.382 ± 0.326
0.099MetCys: 0.099 ± 0.1
2.073MetAsp: 2.073 ± 0.582
1.777MetGlu: 1.777 ± 0.433
0.889MetPhe: 0.889 ± 0.254
1.679MetGly: 1.679 ± 0.456
0.099MetHis: 0.099 ± 0.11
1.679MetIle: 1.679 ± 0.342
1.777MetLys: 1.777 ± 0.449
1.382MetLeu: 1.382 ± 0.386
0.592MetMet: 0.592 ± 0.262
0.889MetAsn: 0.889 ± 0.353
0.296MetPro: 0.296 ± 0.136
0.79MetGln: 0.79 ± 0.201
1.382MetArg: 1.382 ± 0.396
1.481MetSer: 1.481 ± 0.404
2.37MetThr: 2.37 ± 0.603
1.086MetVal: 1.086 ± 0.325
0.296MetTrp: 0.296 ± 0.189
0.197MetTyr: 0.197 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
3.949AsnAla: 3.949 ± 0.642
0.197AsnCys: 0.197 ± 0.13
2.37AsnAsp: 2.37 ± 0.572
4.443AsnGlu: 4.443 ± 0.548
2.567AsnPhe: 2.567 ± 0.507
5.727AsnGly: 5.727 ± 0.937
1.481AsnHis: 1.481 ± 0.419
2.765AsnIle: 2.765 ± 0.492
2.863AsnLys: 2.863 ± 0.426
5.43AsnLeu: 5.43 ± 0.841
0.987AsnMet: 0.987 ± 0.318
1.284AsnAsn: 1.284 ± 0.334
1.876AsnPro: 1.876 ± 0.401
1.876AsnGln: 1.876 ± 0.324
2.567AsnArg: 2.567 ± 0.43
4.048AsnSer: 4.048 ± 0.831
2.468AsnThr: 2.468 ± 0.594
3.258AsnVal: 3.258 ± 0.455
0.889AsnTrp: 0.889 ± 0.271
1.382AsnTyr: 1.382 ± 0.371
0.0AsnXaa: 0.0 ± 0.0
Pro
0.79ProAla: 0.79 ± 0.272
0.296ProCys: 0.296 ± 0.177
1.481ProAsp: 1.481 ± 0.288
1.481ProGlu: 1.481 ± 0.51
0.987ProPhe: 0.987 ± 0.354
0.987ProGly: 0.987 ± 0.416
0.592ProHis: 0.592 ± 0.284
2.271ProIle: 2.271 ± 0.458
3.258ProLys: 3.258 ± 0.653
2.863ProLeu: 2.863 ± 0.404
0.395ProMet: 0.395 ± 0.153
2.271ProAsn: 2.271 ± 0.584
1.185ProPro: 1.185 ± 0.409
0.987ProGln: 0.987 ± 0.392
1.481ProArg: 1.481 ± 0.353
1.975ProSer: 1.975 ± 0.571
2.666ProThr: 2.666 ± 0.569
1.777ProVal: 1.777 ± 0.343
0.494ProTrp: 0.494 ± 0.192
1.58ProTyr: 1.58 ± 0.425
0.0ProXaa: 0.0 ± 0.0
Gln
3.653GlnAla: 3.653 ± 0.844
0.099GlnCys: 0.099 ± 0.111
1.086GlnAsp: 1.086 ± 0.281
3.456GlnGlu: 3.456 ± 0.579
2.666GlnPhe: 2.666 ± 0.483
2.073GlnGly: 2.073 ± 0.45
0.987GlnHis: 0.987 ± 0.279
3.16GlnIle: 3.16 ± 0.578
2.271GlnLys: 2.271 ± 0.506
3.258GlnLeu: 3.258 ± 0.644
1.382GlnMet: 1.382 ± 0.343
2.765GlnAsn: 2.765 ± 0.51
1.777GlnPro: 1.777 ± 0.503
1.382GlnGln: 1.382 ± 0.319
1.481GlnArg: 1.481 ± 0.398
1.876GlnSer: 1.876 ± 0.442
2.468GlnThr: 2.468 ± 0.533
2.567GlnVal: 2.567 ± 0.495
0.79GlnTrp: 0.79 ± 0.355
0.987GlnTyr: 0.987 ± 0.356
0.0GlnXaa: 0.0 ± 0.0
Arg
1.284ArgAla: 1.284 ± 0.287
0.691ArgCys: 0.691 ± 0.256
2.666ArgAsp: 2.666 ± 0.726
3.258ArgGlu: 3.258 ± 0.597
1.086ArgPhe: 1.086 ± 0.293
2.666ArgGly: 2.666 ± 0.482
0.79ArgHis: 0.79 ± 0.342
3.357ArgIle: 3.357 ± 0.644
3.752ArgLys: 3.752 ± 0.737
3.851ArgLeu: 3.851 ± 0.575
0.987ArgMet: 0.987 ± 0.363
2.468ArgAsn: 2.468 ± 0.449
1.185ArgPro: 1.185 ± 0.379
2.666ArgGln: 2.666 ± 0.52
1.284ArgArg: 1.284 ± 0.374
1.679ArgSer: 1.679 ± 0.474
2.962ArgThr: 2.962 ± 0.571
2.765ArgVal: 2.765 ± 0.564
0.691ArgTrp: 0.691 ± 0.26
1.777ArgTyr: 1.777 ± 0.528
0.0ArgXaa: 0.0 ± 0.0
Ser
3.851SerAla: 3.851 ± 0.882
0.296SerCys: 0.296 ± 0.231
4.641SerAsp: 4.641 ± 0.631
4.641SerGlu: 4.641 ± 0.951
2.468SerPhe: 2.468 ± 0.427
5.628SerGly: 5.628 ± 0.885
1.382SerHis: 1.382 ± 0.311
4.739SerIle: 4.739 ± 0.825
4.937SerLys: 4.937 ± 0.563
6.122SerLeu: 6.122 ± 0.61
0.889SerMet: 0.889 ± 0.333
3.752SerAsn: 3.752 ± 0.521
2.172SerPro: 2.172 ± 0.481
2.765SerGln: 2.765 ± 0.638
2.567SerArg: 2.567 ± 0.433
5.036SerSer: 5.036 ± 0.877
4.838SerThr: 4.838 ± 1.035
4.542SerVal: 4.542 ± 0.69
1.185SerTrp: 1.185 ± 0.252
2.863SerTyr: 2.863 ± 0.516
0.0SerXaa: 0.0 ± 0.0
Thr
4.641ThrAla: 4.641 ± 0.625
0.197ThrCys: 0.197 ± 0.109
3.061ThrAsp: 3.061 ± 0.557
3.653ThrGlu: 3.653 ± 0.561
3.16ThrPhe: 3.16 ± 0.551
4.937ThrGly: 4.937 ± 0.749
0.592ThrHis: 0.592 ± 0.23
5.134ThrIle: 5.134 ± 0.929
5.628ThrLys: 5.628 ± 0.849
5.924ThrLeu: 5.924 ± 0.697
0.987ThrMet: 0.987 ± 0.322
3.851ThrAsn: 3.851 ± 0.847
1.679ThrPro: 1.679 ± 0.409
1.876ThrGln: 1.876 ± 0.386
2.271ThrArg: 2.271 ± 0.416
5.036ThrSer: 5.036 ± 0.881
4.739ThrThr: 4.739 ± 0.854
5.134ThrVal: 5.134 ± 0.729
0.987ThrTrp: 0.987 ± 0.263
1.777ThrTyr: 1.777 ± 0.427
0.0ThrXaa: 0.0 ± 0.0
Val
3.16ValAla: 3.16 ± 0.585
0.395ValCys: 0.395 ± 0.218
4.641ValAsp: 4.641 ± 0.722
3.949ValGlu: 3.949 ± 0.63
1.975ValPhe: 1.975 ± 0.516
3.357ValGly: 3.357 ± 0.847
0.987ValHis: 0.987 ± 0.238
4.443ValIle: 4.443 ± 0.847
4.443ValLys: 4.443 ± 0.628
7.01ValLeu: 7.01 ± 0.571
1.086ValMet: 1.086 ± 0.307
1.975ValAsn: 1.975 ± 0.332
1.58ValPro: 1.58 ± 0.381
2.468ValGln: 2.468 ± 0.517
2.37ValArg: 2.37 ± 0.482
4.147ValSer: 4.147 ± 0.682
3.851ValThr: 3.851 ± 0.492
3.555ValVal: 3.555 ± 0.439
1.185ValTrp: 1.185 ± 0.365
2.073ValTyr: 2.073 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
0.889TrpAla: 0.889 ± 0.317
0.296TrpCys: 0.296 ± 0.177
0.79TrpAsp: 0.79 ± 0.306
1.185TrpGlu: 1.185 ± 0.336
1.086TrpPhe: 1.086 ± 0.429
0.592TrpGly: 0.592 ± 0.214
0.0TrpHis: 0.0 ± 0.0
0.987TrpIle: 0.987 ± 0.261
1.185TrpLys: 1.185 ± 0.397
1.284TrpLeu: 1.284 ± 0.235
0.494TrpMet: 0.494 ± 0.181
1.58TrpAsn: 1.58 ± 0.384
0.099TrpPro: 0.099 ± 0.088
0.79TrpGln: 0.79 ± 0.292
0.691TrpArg: 0.691 ± 0.3
0.79TrpSer: 0.79 ± 0.291
0.691TrpThr: 0.691 ± 0.211
0.592TrpVal: 0.592 ± 0.28
0.395TrpTrp: 0.395 ± 0.224
0.395TrpTyr: 0.395 ± 0.327
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.765TyrAla: 2.765 ± 0.432
0.691TyrCys: 0.691 ± 0.255
2.666TyrAsp: 2.666 ± 0.694
3.061TyrGlu: 3.061 ± 0.331
1.382TyrPhe: 1.382 ± 0.535
2.765TyrGly: 2.765 ± 0.46
1.284TyrHis: 1.284 ± 0.336
2.271TyrIle: 2.271 ± 0.447
2.468TyrLys: 2.468 ± 0.594
2.567TyrLeu: 2.567 ± 0.513
0.296TyrMet: 0.296 ± 0.166
1.876TyrAsn: 1.876 ± 0.363
1.284TyrPro: 1.284 ± 0.326
1.975TyrGln: 1.975 ± 0.547
2.073TyrArg: 2.073 ± 0.498
2.271TyrSer: 2.271 ± 0.359
1.481TyrThr: 1.481 ± 0.442
0.987TyrVal: 0.987 ± 0.331
0.494TyrTrp: 0.494 ± 0.183
1.185TyrTyr: 1.185 ± 0.51
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 35 proteins (10129 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski