Amino acid dipepetide frequency for Streptococcus phage A25

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.828AlaAla: 5.828 ± 1.874
0.564AlaCys: 0.564 ± 0.237
3.76AlaAsp: 3.76 ± 0.658
5.076AlaGlu: 5.076 ± 0.876
3.196AlaPhe: 3.196 ± 0.761
7.05AlaGly: 7.05 ± 1.754
1.034AlaHis: 1.034 ± 0.324
5.734AlaIle: 5.734 ± 1.294
6.392AlaLys: 6.392 ± 0.79
7.238AlaLeu: 7.238 ± 1.491
3.102AlaMet: 3.102 ± 0.976
3.29AlaAsn: 3.29 ± 0.568
1.88AlaPro: 1.88 ± 0.339
2.914AlaGln: 2.914 ± 0.95
2.914AlaArg: 2.914 ± 0.517
5.076AlaSer: 5.076 ± 1.075
4.136AlaThr: 4.136 ± 0.759
5.358AlaVal: 5.358 ± 1.267
0.376AlaTrp: 0.376 ± 0.173
2.632AlaTyr: 2.632 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.188CysAla: 0.188 ± 0.118
0.094CysCys: 0.094 ± 0.087
0.376CysAsp: 0.376 ± 0.16
0.564CysGlu: 0.564 ± 0.203
0.282CysPhe: 0.282 ± 0.161
0.282CysGly: 0.282 ± 0.174
0.094CysHis: 0.094 ± 0.087
0.188CysIle: 0.188 ± 0.151
0.376CysLys: 0.376 ± 0.212
0.47CysLeu: 0.47 ± 0.189
0.094CysMet: 0.094 ± 0.098
0.188CysAsn: 0.188 ± 0.121
0.188CysPro: 0.188 ± 0.144
0.188CysGln: 0.188 ± 0.136
0.282CysArg: 0.282 ± 0.173
0.282CysSer: 0.282 ± 0.17
0.188CysThr: 0.188 ± 0.122
0.188CysVal: 0.188 ± 0.12
0.0CysTrp: 0.0 ± 0.0
0.282CysTyr: 0.282 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
4.23AspAla: 4.23 ± 0.61
0.282AspCys: 0.282 ± 0.136
5.734AspAsp: 5.734 ± 0.821
4.7AspGlu: 4.7 ± 0.785
2.82AspPhe: 2.82 ± 0.516
6.486AspGly: 6.486 ± 0.857
0.846AspHis: 0.846 ± 0.252
3.572AspIle: 3.572 ± 0.65
5.264AspLys: 5.264 ± 0.809
4.606AspLeu: 4.606 ± 0.691
1.316AspMet: 1.316 ± 0.362
3.478AspAsn: 3.478 ± 0.503
0.846AspPro: 0.846 ± 0.354
1.316AspGln: 1.316 ± 0.354
2.538AspArg: 2.538 ± 0.441
3.572AspSer: 3.572 ± 0.476
4.23AspThr: 4.23 ± 0.734
3.854AspVal: 3.854 ± 0.646
0.658AspTrp: 0.658 ± 0.309
3.854AspTyr: 3.854 ± 0.782
0.0AspXaa: 0.0 ± 0.0
Glu
3.572GluAla: 3.572 ± 0.585
0.188GluCys: 0.188 ± 0.137
2.82GluAsp: 2.82 ± 0.569
4.136GluGlu: 4.136 ± 0.627
2.726GluPhe: 2.726 ± 0.459
2.068GluGly: 2.068 ± 0.427
0.94GluHis: 0.94 ± 0.355
4.136GluIle: 4.136 ± 0.749
4.606GluLys: 4.606 ± 0.613
6.392GluLeu: 6.392 ± 1.067
2.726GluMet: 2.726 ± 0.559
4.136GluAsn: 4.136 ± 0.599
1.504GluPro: 1.504 ± 0.574
3.384GluGln: 3.384 ± 0.488
3.572GluArg: 3.572 ± 0.567
2.162GluSer: 2.162 ± 0.545
4.136GluThr: 4.136 ± 0.778
4.324GluVal: 4.324 ± 0.616
1.128GluTrp: 1.128 ± 0.304
2.726GluTyr: 2.726 ± 0.534
0.0GluXaa: 0.0 ± 0.0
Phe
2.538PheAla: 2.538 ± 0.401
0.188PheCys: 0.188 ± 0.134
4.7PheAsp: 4.7 ± 0.546
2.538PheGlu: 2.538 ± 0.48
1.128PhePhe: 1.128 ± 0.315
2.35PheGly: 2.35 ± 0.49
0.564PheHis: 0.564 ± 0.228
2.256PheIle: 2.256 ± 0.381
4.418PheLys: 4.418 ± 0.648
2.914PheLeu: 2.914 ± 0.564
0.94PheMet: 0.94 ± 0.267
2.632PheAsn: 2.632 ± 0.598
0.658PhePro: 0.658 ± 0.292
1.41PheGln: 1.41 ± 0.411
1.222PheArg: 1.222 ± 0.339
3.008PheSer: 3.008 ± 0.411
3.666PheThr: 3.666 ± 0.741
2.538PheVal: 2.538 ± 0.527
0.376PheTrp: 0.376 ± 0.157
1.41PheTyr: 1.41 ± 0.349
0.0PheXaa: 0.0 ± 0.0
Gly
5.734GlyAla: 5.734 ± 2.168
0.376GlyCys: 0.376 ± 0.201
3.572GlyAsp: 3.572 ± 0.509
2.444GlyGlu: 2.444 ± 0.467
4.23GlyPhe: 4.23 ± 0.802
4.418GlyGly: 4.418 ± 0.6
0.94GlyHis: 0.94 ± 0.259
6.392GlyIle: 6.392 ± 1.006
6.298GlyLys: 6.298 ± 0.879
6.204GlyLeu: 6.204 ± 1.182
2.162GlyMet: 2.162 ± 0.393
4.512GlyAsn: 4.512 ± 0.869
0.188GlyPro: 0.188 ± 0.113
2.35GlyGln: 2.35 ± 0.365
2.538GlyArg: 2.538 ± 0.5
4.418GlySer: 4.418 ± 1.169
4.7GlyThr: 4.7 ± 0.734
5.828GlyVal: 5.828 ± 0.88
1.034GlyTrp: 1.034 ± 0.3
2.068GlyTyr: 2.068 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
0.564HisAla: 0.564 ± 0.211
0.188HisCys: 0.188 ± 0.134
0.752HisAsp: 0.752 ± 0.229
0.752HisGlu: 0.752 ± 0.244
0.846HisPhe: 0.846 ± 0.278
0.846HisGly: 0.846 ± 0.291
0.47HisHis: 0.47 ± 0.232
0.94HisIle: 0.94 ± 0.277
1.316HisLys: 1.316 ± 0.375
1.222HisLeu: 1.222 ± 0.306
0.47HisMet: 0.47 ± 0.263
1.128HisAsn: 1.128 ± 0.363
0.376HisPro: 0.376 ± 0.281
0.564HisGln: 0.564 ± 0.233
1.034HisArg: 1.034 ± 0.354
0.658HisSer: 0.658 ± 0.237
0.658HisThr: 0.658 ± 0.218
0.846HisVal: 0.846 ± 0.199
0.094HisTrp: 0.094 ± 0.105
1.128HisTyr: 1.128 ± 0.421
0.0HisXaa: 0.0 ± 0.0
Ile
5.828IleAla: 5.828 ± 1.225
0.188IleCys: 0.188 ± 0.116
5.734IleAsp: 5.734 ± 0.806
4.7IleGlu: 4.7 ± 0.635
1.974IlePhe: 1.974 ± 0.429
5.17IleGly: 5.17 ± 0.881
1.222IleHis: 1.222 ± 0.274
3.478IleIle: 3.478 ± 0.676
6.298IleLys: 6.298 ± 0.77
3.854IleLeu: 3.854 ± 0.589
1.316IleMet: 1.316 ± 0.276
4.7IleAsn: 4.7 ± 0.812
2.256IlePro: 2.256 ± 0.714
2.256IleGln: 2.256 ± 0.429
1.692IleArg: 1.692 ± 0.406
4.324IleSer: 4.324 ± 0.64
4.794IleThr: 4.794 ± 0.774
3.478IleVal: 3.478 ± 0.527
1.034IleTrp: 1.034 ± 0.261
3.948IleTyr: 3.948 ± 0.501
0.0IleXaa: 0.0 ± 0.0
Lys
7.708LysAla: 7.708 ± 1.062
0.47LysCys: 0.47 ± 0.209
4.042LysAsp: 4.042 ± 0.619
5.64LysGlu: 5.64 ± 1.095
2.632LysPhe: 2.632 ± 0.433
5.076LysGly: 5.076 ± 0.775
1.692LysHis: 1.692 ± 0.403
4.794LysIle: 4.794 ± 0.804
7.332LysLys: 7.332 ± 1.111
4.888LysLeu: 4.888 ± 0.734
2.444LysMet: 2.444 ± 0.587
4.888LysAsn: 4.888 ± 0.709
3.196LysPro: 3.196 ± 0.745
2.632LysGln: 2.632 ± 0.656
4.418LysArg: 4.418 ± 0.807
6.486LysSer: 6.486 ± 0.732
6.016LysThr: 6.016 ± 0.722
5.264LysVal: 5.264 ± 0.755
0.846LysTrp: 0.846 ± 0.305
1.88LysTyr: 1.88 ± 0.4
0.0LysXaa: 0.0 ± 0.0
Leu
7.144LeuAla: 7.144 ± 1.101
0.47LeuCys: 0.47 ± 0.17
6.392LeuAsp: 6.392 ± 0.855
4.606LeuGlu: 4.606 ± 0.796
2.35LeuPhe: 2.35 ± 0.424
5.452LeuGly: 5.452 ± 1.085
0.752LeuHis: 0.752 ± 0.338
3.948LeuIle: 3.948 ± 0.537
7.144LeuLys: 7.144 ± 0.895
5.358LeuLeu: 5.358 ± 0.69
1.316LeuMet: 1.316 ± 0.358
4.794LeuAsn: 4.794 ± 0.719
3.196LeuPro: 3.196 ± 0.594
3.572LeuGln: 3.572 ± 0.556
3.102LeuArg: 3.102 ± 0.769
6.486LeuSer: 6.486 ± 0.856
4.982LeuThr: 4.982 ± 0.649
4.324LeuVal: 4.324 ± 0.682
0.752LeuTrp: 0.752 ± 0.358
2.162LeuTyr: 2.162 ± 0.462
0.0LeuXaa: 0.0 ± 0.0
Met
2.632MetAla: 2.632 ± 0.562
0.094MetCys: 0.094 ± 0.105
1.504MetAsp: 1.504 ± 0.428
1.034MetGlu: 1.034 ± 0.322
1.222MetPhe: 1.222 ± 0.315
1.598MetGly: 1.598 ± 0.387
0.282MetHis: 0.282 ± 0.209
1.88MetIle: 1.88 ± 0.36
1.786MetLys: 1.786 ± 0.317
2.256MetLeu: 2.256 ± 0.309
0.564MetMet: 0.564 ± 0.325
1.504MetAsn: 1.504 ± 0.333
0.94MetPro: 0.94 ± 0.213
1.692MetGln: 1.692 ± 0.374
1.41MetArg: 1.41 ± 0.326
1.504MetSer: 1.504 ± 0.3
2.444MetThr: 2.444 ± 0.379
1.504MetVal: 1.504 ± 0.439
0.282MetTrp: 0.282 ± 0.168
0.846MetTyr: 0.846 ± 0.305
0.0MetXaa: 0.0 ± 0.0
Asn
4.606AsnAla: 4.606 ± 0.674
0.188AsnCys: 0.188 ± 0.128
3.478AsnAsp: 3.478 ± 0.629
3.572AsnGlu: 3.572 ± 0.589
1.786AsnPhe: 1.786 ± 0.457
5.076AsnGly: 5.076 ± 1.061
0.752AsnHis: 0.752 ± 0.244
3.384AsnIle: 3.384 ± 0.65
4.23AsnLys: 4.23 ± 0.746
4.7AsnLeu: 4.7 ± 0.774
1.222AsnMet: 1.222 ± 0.284
4.042AsnAsn: 4.042 ± 0.686
2.444AsnPro: 2.444 ± 0.469
2.162AsnGln: 2.162 ± 0.42
2.068AsnArg: 2.068 ± 0.418
3.196AsnSer: 3.196 ± 0.765
3.102AsnThr: 3.102 ± 0.774
3.384AsnVal: 3.384 ± 0.504
0.658AsnTrp: 0.658 ± 0.263
1.88AsnTyr: 1.88 ± 0.474
0.0AsnXaa: 0.0 ± 0.0
Pro
1.88ProAla: 1.88 ± 0.426
0.094ProCys: 0.094 ± 0.089
2.35ProAsp: 2.35 ± 0.552
1.88ProGlu: 1.88 ± 0.487
1.41ProPhe: 1.41 ± 0.328
1.222ProGly: 1.222 ± 0.456
0.47ProHis: 0.47 ± 0.166
1.974ProIle: 1.974 ± 0.525
1.786ProLys: 1.786 ± 0.513
2.35ProLeu: 2.35 ± 0.509
0.188ProMet: 0.188 ± 0.134
1.598ProAsn: 1.598 ± 0.498
0.94ProPro: 0.94 ± 0.292
0.658ProGln: 0.658 ± 0.251
0.564ProArg: 0.564 ± 0.185
2.444ProSer: 2.444 ± 0.564
2.162ProThr: 2.162 ± 0.401
2.068ProVal: 2.068 ± 0.397
0.0ProTrp: 0.0 ± 0.0
1.504ProTyr: 1.504 ± 0.341
0.0ProXaa: 0.0 ± 0.0
Gln
4.136GlnAla: 4.136 ± 0.873
0.0GlnCys: 0.0 ± 0.0
1.41GlnAsp: 1.41 ± 0.306
1.974GlnGlu: 1.974 ± 0.481
1.316GlnPhe: 1.316 ± 0.32
3.29GlnGly: 3.29 ± 0.881
0.376GlnHis: 0.376 ± 0.186
2.256GlnIle: 2.256 ± 0.533
3.008GlnLys: 3.008 ± 0.404
3.854GlnLeu: 3.854 ± 0.515
1.598GlnMet: 1.598 ± 0.4
1.598GlnAsn: 1.598 ± 0.407
0.846GlnPro: 0.846 ± 0.261
1.504GlnGln: 1.504 ± 0.441
1.504GlnArg: 1.504 ± 0.358
3.478GlnSer: 3.478 ± 0.809
1.692GlnThr: 1.692 ± 0.426
2.444GlnVal: 2.444 ± 0.515
0.564GlnTrp: 0.564 ± 0.168
1.692GlnTyr: 1.692 ± 0.406
0.0GlnXaa: 0.0 ± 0.0
Arg
2.632ArgAla: 2.632 ± 0.775
0.376ArgCys: 0.376 ± 0.259
1.504ArgAsp: 1.504 ± 0.357
1.974ArgGlu: 1.974 ± 0.511
1.786ArgPhe: 1.786 ± 0.348
2.538ArgGly: 2.538 ± 0.426
0.658ArgHis: 0.658 ± 0.265
3.008ArgIle: 3.008 ± 0.553
3.948ArgLys: 3.948 ± 0.769
3.666ArgLeu: 3.666 ± 0.551
0.658ArgMet: 0.658 ± 0.218
1.786ArgAsn: 1.786 ± 0.322
1.598ArgPro: 1.598 ± 0.336
1.504ArgGln: 1.504 ± 0.44
1.786ArgArg: 1.786 ± 0.4
2.068ArgSer: 2.068 ± 0.494
2.068ArgThr: 2.068 ± 0.522
3.196ArgVal: 3.196 ± 0.573
0.846ArgTrp: 0.846 ± 0.41
1.598ArgTyr: 1.598 ± 0.368
0.0ArgXaa: 0.0 ± 0.0
Ser
5.734SerAla: 5.734 ± 1.119
0.282SerCys: 0.282 ± 0.162
4.512SerAsp: 4.512 ± 0.7
3.478SerGlu: 3.478 ± 0.524
2.726SerPhe: 2.726 ± 0.556
4.794SerGly: 4.794 ± 1.109
0.846SerHis: 0.846 ± 0.298
4.324SerIle: 4.324 ± 0.636
3.572SerLys: 3.572 ± 0.499
4.982SerLeu: 4.982 ± 0.452
2.632SerMet: 2.632 ± 0.462
2.914SerAsn: 2.914 ± 0.52
1.88SerPro: 1.88 ± 0.52
3.008SerGln: 3.008 ± 0.714
1.41SerArg: 1.41 ± 0.345
4.7SerSer: 4.7 ± 0.928
4.982SerThr: 4.982 ± 0.566
4.982SerVal: 4.982 ± 0.653
0.846SerTrp: 0.846 ± 0.257
2.444SerTyr: 2.444 ± 0.712
0.0SerXaa: 0.0 ± 0.0
Thr
4.324ThrAla: 4.324 ± 0.765
0.094ThrCys: 0.094 ± 0.087
3.384ThrAsp: 3.384 ± 0.592
4.7ThrGlu: 4.7 ± 0.761
3.76ThrPhe: 3.76 ± 0.481
4.418ThrGly: 4.418 ± 0.568
0.94ThrHis: 0.94 ± 0.281
6.862ThrIle: 6.862 ± 0.798
6.016ThrLys: 6.016 ± 0.639
4.418ThrLeu: 4.418 ± 0.668
1.316ThrMet: 1.316 ± 0.289
2.162ThrAsn: 2.162 ± 0.627
2.256ThrPro: 2.256 ± 0.362
2.82ThrGln: 2.82 ± 0.473
1.786ThrArg: 1.786 ± 0.375
3.854ThrSer: 3.854 ± 0.752
4.418ThrThr: 4.418 ± 0.769
5.17ThrVal: 5.17 ± 0.714
0.94ThrTrp: 0.94 ± 0.292
2.538ThrTyr: 2.538 ± 0.465
0.0ThrXaa: 0.0 ± 0.0
Val
5.922ValAla: 5.922 ± 1.431
0.376ValCys: 0.376 ± 0.244
4.606ValAsp: 4.606 ± 0.554
5.17ValGlu: 5.17 ± 0.743
2.632ValPhe: 2.632 ± 0.437
5.264ValGly: 5.264 ± 1.012
0.752ValHis: 0.752 ± 0.288
5.64ValIle: 5.64 ± 0.623
4.794ValLys: 4.794 ± 0.688
4.136ValLeu: 4.136 ± 0.577
1.128ValMet: 1.128 ± 0.366
3.29ValAsn: 3.29 ± 0.444
1.222ValPro: 1.222 ± 0.321
1.786ValGln: 1.786 ± 0.372
2.538ValArg: 2.538 ± 0.402
3.948ValSer: 3.948 ± 0.428
4.606ValThr: 4.606 ± 0.755
3.948ValVal: 3.948 ± 0.811
0.564ValTrp: 0.564 ± 0.238
2.726ValTyr: 2.726 ± 0.391
0.0ValXaa: 0.0 ± 0.0
Trp
0.188TrpAla: 0.188 ± 0.134
0.094TrpCys: 0.094 ± 0.099
0.564TrpAsp: 0.564 ± 0.327
0.752TrpGlu: 0.752 ± 0.23
0.658TrpPhe: 0.658 ± 0.213
0.94TrpGly: 0.94 ± 0.344
0.47TrpHis: 0.47 ± 0.193
0.564TrpIle: 0.564 ± 0.213
1.316TrpLys: 1.316 ± 0.356
1.316TrpLeu: 1.316 ± 0.364
0.376TrpMet: 0.376 ± 0.219
0.47TrpAsn: 0.47 ± 0.158
0.0TrpPro: 0.0 ± 0.0
0.752TrpGln: 0.752 ± 0.283
0.47TrpArg: 0.47 ± 0.2
1.034TrpSer: 1.034 ± 0.358
0.846TrpThr: 0.846 ± 0.299
0.188TrpVal: 0.188 ± 0.12
0.188TrpTrp: 0.188 ± 0.204
0.658TrpTyr: 0.658 ± 0.351
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.162TyrAla: 2.162 ± 0.531
0.188TyrCys: 0.188 ± 0.159
3.008TyrAsp: 3.008 ± 0.6
1.692TyrGlu: 1.692 ± 0.482
1.974TyrPhe: 1.974 ± 0.568
2.068TyrGly: 2.068 ± 0.369
0.752TyrHis: 0.752 ± 0.245
2.914TyrIle: 2.914 ± 0.607
2.444TyrLys: 2.444 ± 0.422
3.478TyrLeu: 3.478 ± 0.613
1.316TyrMet: 1.316 ± 0.322
2.82TyrAsn: 2.82 ± 0.548
1.128TyrPro: 1.128 ± 0.276
2.068TyrGln: 2.068 ± 0.474
2.35TyrArg: 2.35 ± 0.497
2.35TyrSer: 2.35 ± 0.459
2.35TyrThr: 2.35 ± 0.423
2.256TyrVal: 2.256 ± 0.388
0.658TyrTrp: 0.658 ± 0.218
2.162TyrTyr: 2.162 ± 0.461
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10639 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski