Amino acid dipepetide frequency for Streptococcus phage IPP16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.471AlaAla: 2.471 ± 0.622
0.38AlaCys: 0.38 ± 0.159
4.277AlaAsp: 4.277 ± 0.544
5.607AlaGlu: 5.607 ± 0.557
2.186AlaPhe: 2.186 ± 0.464
3.707AlaGly: 3.707 ± 0.717
1.616AlaHis: 1.616 ± 0.443
5.512AlaIle: 5.512 ± 0.632
4.562AlaLys: 4.562 ± 0.565
5.322AlaLeu: 5.322 ± 0.771
1.14AlaMet: 1.14 ± 0.342
2.946AlaAsn: 2.946 ± 0.489
0.855AlaPro: 0.855 ± 0.32
2.471AlaGln: 2.471 ± 0.476
2.756AlaArg: 2.756 ± 0.542
3.516AlaSer: 3.516 ± 0.502
4.182AlaThr: 4.182 ± 0.669
3.516AlaVal: 3.516 ± 0.812
0.57AlaTrp: 0.57 ± 0.249
2.281AlaTyr: 2.281 ± 0.393
0.0AlaXaa: 0.0 ± 0.0
Cys
0.19CysAla: 0.19 ± 0.125
0.0CysCys: 0.0 ± 0.0
0.38CysAsp: 0.38 ± 0.195
0.38CysGlu: 0.38 ± 0.144
0.095CysPhe: 0.095 ± 0.099
0.38CysGly: 0.38 ± 0.203
0.095CysHis: 0.095 ± 0.095
0.095CysIle: 0.095 ± 0.082
0.57CysLys: 0.57 ± 0.229
1.045CysLeu: 1.045 ± 0.374
0.0CysMet: 0.0 ± 0.0
0.285CysAsn: 0.285 ± 0.155
0.19CysPro: 0.19 ± 0.14
0.38CysGln: 0.38 ± 0.206
0.855CysArg: 0.855 ± 0.282
0.38CysSer: 0.38 ± 0.164
0.19CysThr: 0.19 ± 0.125
0.095CysVal: 0.095 ± 0.093
0.095CysTrp: 0.095 ± 0.094
0.475CysTyr: 0.475 ± 0.228
0.0CysXaa: 0.0 ± 0.0
Asp
3.516AspAla: 3.516 ± 0.542
0.38AspCys: 0.38 ± 0.161
4.942AspAsp: 4.942 ± 0.713
5.512AspGlu: 5.512 ± 0.581
4.372AspPhe: 4.372 ± 0.572
5.322AspGly: 5.322 ± 0.882
0.665AspHis: 0.665 ± 0.223
5.417AspIle: 5.417 ± 0.691
4.752AspLys: 4.752 ± 0.758
6.748AspLeu: 6.748 ± 0.689
1.806AspMet: 1.806 ± 0.328
3.611AspAsn: 3.611 ± 0.605
1.426AspPro: 1.426 ± 0.423
0.665AspGln: 0.665 ± 0.24
1.806AspArg: 1.806 ± 0.422
3.136AspSer: 3.136 ± 0.392
2.851AspThr: 2.851 ± 0.575
3.707AspVal: 3.707 ± 0.746
0.475AspTrp: 0.475 ± 0.205
3.041AspTyr: 3.041 ± 0.41
0.0AspXaa: 0.0 ± 0.0
Glu
5.132GluAla: 5.132 ± 0.551
0.0GluCys: 0.0 ± 0.0
3.136GluAsp: 3.136 ± 0.509
6.368GluGlu: 6.368 ± 0.867
2.946GluPhe: 2.946 ± 0.466
4.562GluGly: 4.562 ± 0.761
1.521GluHis: 1.521 ± 0.489
6.463GluIle: 6.463 ± 0.887
6.273GluLys: 6.273 ± 0.844
8.458GluLeu: 8.458 ± 0.858
2.091GluMet: 2.091 ± 0.548
4.467GluAsn: 4.467 ± 0.717
1.616GluPro: 1.616 ± 0.462
3.992GluGln: 3.992 ± 0.548
3.992GluArg: 3.992 ± 0.669
4.277GluSer: 4.277 ± 0.548
5.037GluThr: 5.037 ± 0.677
5.227GluVal: 5.227 ± 0.867
1.045GluTrp: 1.045 ± 0.283
3.041GluTyr: 3.041 ± 0.511
0.0GluXaa: 0.0 ± 0.0
Phe
2.376PheAla: 2.376 ± 0.499
0.285PheCys: 0.285 ± 0.21
2.851PheAsp: 2.851 ± 0.399
3.421PheGlu: 3.421 ± 0.668
3.136PhePhe: 3.136 ± 0.553
2.471PheGly: 2.471 ± 0.473
0.855PheHis: 0.855 ± 0.275
2.471PheIle: 2.471 ± 0.544
3.231PheLys: 3.231 ± 0.395
3.231PheLeu: 3.231 ± 0.667
0.855PheMet: 0.855 ± 0.293
2.946PheAsn: 2.946 ± 0.442
1.045PhePro: 1.045 ± 0.33
1.806PheGln: 1.806 ± 0.402
1.711PheArg: 1.711 ± 0.392
2.186PheSer: 2.186 ± 0.428
1.806PheThr: 1.806 ± 0.341
2.661PheVal: 2.661 ± 0.434
0.665PheTrp: 0.665 ± 0.213
1.045PheTyr: 1.045 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
3.231GlyAla: 3.231 ± 0.929
0.475GlyCys: 0.475 ± 0.211
3.707GlyAsp: 3.707 ± 0.629
3.992GlyGlu: 3.992 ± 0.598
2.091GlyPhe: 2.091 ± 0.375
3.041GlyGly: 3.041 ± 0.622
1.045GlyHis: 1.045 ± 0.226
4.752GlyIle: 4.752 ± 0.764
4.942GlyLys: 4.942 ± 0.871
5.417GlyLeu: 5.417 ± 0.781
2.091GlyMet: 2.091 ± 0.542
3.611GlyAsn: 3.611 ± 0.577
0.57GlyPro: 0.57 ± 0.253
2.091GlyGln: 2.091 ± 0.437
1.996GlyArg: 1.996 ± 0.464
3.516GlySer: 3.516 ± 0.687
3.611GlyThr: 3.611 ± 0.533
3.516GlyVal: 3.516 ± 0.568
1.426GlyTrp: 1.426 ± 0.383
3.326GlyTyr: 3.326 ± 0.559
0.0GlyXaa: 0.0 ± 0.0
His
0.95HisAla: 0.95 ± 0.262
0.095HisCys: 0.095 ± 0.091
1.236HisAsp: 1.236 ± 0.327
1.14HisGlu: 1.14 ± 0.33
1.045HisPhe: 1.045 ± 0.35
1.14HisGly: 1.14 ± 0.374
0.38HisHis: 0.38 ± 0.265
1.426HisIle: 1.426 ± 0.337
1.331HisLys: 1.331 ± 0.389
1.045HisLeu: 1.045 ± 0.312
0.475HisMet: 0.475 ± 0.168
0.665HisAsn: 0.665 ± 0.262
0.76HisPro: 0.76 ± 0.264
0.57HisGln: 0.57 ± 0.187
0.475HisArg: 0.475 ± 0.254
0.76HisSer: 0.76 ± 0.249
0.95HisThr: 0.95 ± 0.221
0.76HisVal: 0.76 ± 0.266
0.19HisTrp: 0.19 ± 0.123
0.855HisTyr: 0.855 ± 0.357
0.0HisXaa: 0.0 ± 0.0
Ile
4.942IleAla: 4.942 ± 0.572
0.19IleCys: 0.19 ± 0.126
5.892IleAsp: 5.892 ± 0.76
7.033IleGlu: 7.033 ± 0.69
1.996IlePhe: 1.996 ± 0.393
3.992IleGly: 3.992 ± 0.659
1.331IleHis: 1.331 ± 0.373
4.562IleIle: 4.562 ± 0.676
6.843IleLys: 6.843 ± 0.832
6.748IleLeu: 6.748 ± 0.807
2.091IleMet: 2.091 ± 0.376
4.372IleAsn: 4.372 ± 0.697
2.566IlePro: 2.566 ± 0.596
2.376IleGln: 2.376 ± 0.56
2.946IleArg: 2.946 ± 0.527
5.987IleSer: 5.987 ± 0.772
3.992IleThr: 3.992 ± 0.632
3.421IleVal: 3.421 ± 0.52
0.855IleTrp: 0.855 ± 0.389
2.851IleTyr: 2.851 ± 0.506
0.0IleXaa: 0.0 ± 0.0
Lys
6.843LysAla: 6.843 ± 0.775
0.475LysCys: 0.475 ± 0.19
4.942LysAsp: 4.942 ± 0.781
7.983LysGlu: 7.983 ± 0.896
2.376LysPhe: 2.376 ± 0.424
4.657LysGly: 4.657 ± 0.646
1.236LysHis: 1.236 ± 0.315
6.653LysIle: 6.653 ± 0.808
7.983LysLys: 7.983 ± 1.378
9.219LysLeu: 9.219 ± 0.864
1.616LysMet: 1.616 ± 0.377
4.277LysAsn: 4.277 ± 0.459
2.471LysPro: 2.471 ± 0.413
3.992LysGln: 3.992 ± 0.658
4.277LysArg: 4.277 ± 0.821
5.512LysSer: 5.512 ± 0.768
5.702LysThr: 5.702 ± 1.015
4.562LysVal: 4.562 ± 0.476
1.901LysTrp: 1.901 ± 0.421
2.471LysTyr: 2.471 ± 0.407
0.0LysXaa: 0.0 ± 0.0
Leu
5.607LeuAla: 5.607 ± 0.612
0.95LeuCys: 0.95 ± 0.365
8.554LeuAsp: 8.554 ± 0.986
8.173LeuGlu: 8.173 ± 0.964
3.326LeuPhe: 3.326 ± 0.688
4.847LeuGly: 4.847 ± 0.718
0.76LeuHis: 0.76 ± 0.256
5.892LeuIle: 5.892 ± 0.679
9.124LeuLys: 9.124 ± 1.209
8.744LeuLeu: 8.744 ± 0.885
1.806LeuMet: 1.806 ± 0.445
4.752LeuAsn: 4.752 ± 0.666
2.661LeuPro: 2.661 ± 0.505
3.136LeuGln: 3.136 ± 0.498
3.326LeuArg: 3.326 ± 0.625
7.033LeuSer: 7.033 ± 0.756
5.037LeuThr: 5.037 ± 0.647
4.562LeuVal: 4.562 ± 0.599
1.14LeuTrp: 1.14 ± 0.263
2.946LeuTyr: 2.946 ± 0.668
0.0LeuXaa: 0.0 ± 0.0
Met
1.806MetAla: 1.806 ± 0.345
0.095MetCys: 0.095 ± 0.111
1.426MetAsp: 1.426 ± 0.474
1.521MetGlu: 1.521 ± 0.335
0.57MetPhe: 0.57 ± 0.198
1.236MetGly: 1.236 ± 0.304
0.475MetHis: 0.475 ± 0.198
1.616MetIle: 1.616 ± 0.379
2.091MetLys: 2.091 ± 0.578
1.806MetLeu: 1.806 ± 0.374
0.38MetMet: 0.38 ± 0.186
1.426MetAsn: 1.426 ± 0.341
0.57MetPro: 0.57 ± 0.202
1.14MetGln: 1.14 ± 0.296
1.14MetArg: 1.14 ± 0.359
1.996MetSer: 1.996 ± 0.535
1.901MetThr: 1.901 ± 0.376
1.521MetVal: 1.521 ± 0.35
0.285MetTrp: 0.285 ± 0.14
0.665MetTyr: 0.665 ± 0.302
0.0MetXaa: 0.0 ± 0.0
Asn
2.756AsnAla: 2.756 ± 0.58
0.57AsnCys: 0.57 ± 0.278
3.421AsnAsp: 3.421 ± 0.564
5.132AsnGlu: 5.132 ± 0.688
2.186AsnPhe: 2.186 ± 0.429
4.847AsnGly: 4.847 ± 0.643
0.95AsnHis: 0.95 ± 0.33
4.657AsnIle: 4.657 ± 0.57
4.372AsnLys: 4.372 ± 0.591
5.322AsnLeu: 5.322 ± 0.56
1.045AsnMet: 1.045 ± 0.295
3.611AsnAsn: 3.611 ± 0.483
2.661AsnPro: 2.661 ± 0.477
1.426AsnGln: 1.426 ± 0.311
2.281AsnArg: 2.281 ± 0.553
3.611AsnSer: 3.611 ± 0.456
3.421AsnThr: 3.421 ± 0.438
3.421AsnVal: 3.421 ± 0.576
0.57AsnTrp: 0.57 ± 0.234
2.281AsnTyr: 2.281 ± 0.403
0.0AsnXaa: 0.0 ± 0.0
Pro
1.426ProAla: 1.426 ± 0.311
0.285ProCys: 0.285 ± 0.154
2.471ProAsp: 2.471 ± 0.459
2.376ProGlu: 2.376 ± 0.439
1.14ProPhe: 1.14 ± 0.356
1.426ProGly: 1.426 ± 0.377
0.475ProHis: 0.475 ± 0.236
2.756ProIle: 2.756 ± 0.501
3.041ProLys: 3.041 ± 0.535
0.95ProLeu: 0.95 ± 0.23
0.095ProMet: 0.095 ± 0.094
1.901ProAsn: 1.901 ± 0.479
1.045ProPro: 1.045 ± 0.326
0.855ProGln: 0.855 ± 0.255
0.665ProArg: 0.665 ± 0.223
1.426ProSer: 1.426 ± 0.438
1.14ProThr: 1.14 ± 0.339
1.711ProVal: 1.711 ± 0.471
0.0ProTrp: 0.0 ± 0.0
1.045ProTyr: 1.045 ± 0.262
0.0ProXaa: 0.0 ± 0.0
Gln
2.851GlnAla: 2.851 ± 0.433
0.38GlnCys: 0.38 ± 0.214
1.711GlnAsp: 1.711 ± 0.381
2.186GlnGlu: 2.186 ± 0.478
1.521GlnPhe: 1.521 ± 0.371
1.521GlnGly: 1.521 ± 0.279
0.38GlnHis: 0.38 ± 0.179
2.471GlnIle: 2.471 ± 0.412
3.707GlnLys: 3.707 ± 0.578
3.516GlnLeu: 3.516 ± 0.545
0.855GlnMet: 0.855 ± 0.285
2.471GlnAsn: 2.471 ± 0.396
0.57GlnPro: 0.57 ± 0.219
1.331GlnGln: 1.331 ± 0.394
1.806GlnArg: 1.806 ± 0.351
2.566GlnSer: 2.566 ± 0.483
2.091GlnThr: 2.091 ± 0.357
2.471GlnVal: 2.471 ± 0.522
0.285GlnTrp: 0.285 ± 0.171
1.426GlnTyr: 1.426 ± 0.278
0.0GlnXaa: 0.0 ± 0.0
Arg
1.806ArgAla: 1.806 ± 0.397
0.57ArgCys: 0.57 ± 0.216
2.091ArgAsp: 2.091 ± 0.426
2.376ArgGlu: 2.376 ± 0.51
1.806ArgPhe: 1.806 ± 0.397
1.806ArgGly: 1.806 ± 0.437
0.76ArgHis: 0.76 ± 0.239
2.376ArgIle: 2.376 ± 0.467
4.372ArgLys: 4.372 ± 0.738
5.132ArgLeu: 5.132 ± 0.891
1.426ArgMet: 1.426 ± 0.35
2.186ArgAsn: 2.186 ± 0.59
0.855ArgPro: 0.855 ± 0.292
1.806ArgGln: 1.806 ± 0.495
2.091ArgArg: 2.091 ± 0.53
1.996ArgSer: 1.996 ± 0.372
3.136ArgThr: 3.136 ± 0.593
2.946ArgVal: 2.946 ± 0.507
0.665ArgTrp: 0.665 ± 0.344
1.616ArgTyr: 1.616 ± 0.354
0.0ArgXaa: 0.0 ± 0.0
Ser
3.516SerAla: 3.516 ± 0.646
0.0SerCys: 0.0 ± 0.0
4.277SerAsp: 4.277 ± 0.605
5.227SerGlu: 5.227 ± 0.926
2.471SerPhe: 2.471 ± 0.542
3.041SerGly: 3.041 ± 0.611
0.95SerHis: 0.95 ± 0.26
4.847SerIle: 4.847 ± 0.596
6.938SerLys: 6.938 ± 1.04
6.368SerLeu: 6.368 ± 0.792
2.091SerMet: 2.091 ± 0.492
3.136SerAsn: 3.136 ± 0.443
1.236SerPro: 1.236 ± 0.309
1.901SerGln: 1.901 ± 0.39
2.566SerArg: 2.566 ± 0.405
3.897SerSer: 3.897 ± 0.834
3.611SerThr: 3.611 ± 0.49
3.421SerVal: 3.421 ± 0.587
0.475SerTrp: 0.475 ± 0.234
2.281SerTyr: 2.281 ± 0.472
0.0SerXaa: 0.0 ± 0.0
Thr
4.087ThrAla: 4.087 ± 0.652
0.38ThrCys: 0.38 ± 0.181
3.041ThrAsp: 3.041 ± 0.418
3.611ThrGlu: 3.611 ± 0.782
2.091ThrPhe: 2.091 ± 0.473
4.752ThrGly: 4.752 ± 0.734
0.855ThrHis: 0.855 ± 0.329
5.322ThrIle: 5.322 ± 0.542
5.417ThrLys: 5.417 ± 0.815
4.942ThrLeu: 4.942 ± 0.579
1.331ThrMet: 1.331 ± 0.307
3.802ThrAsn: 3.802 ± 0.53
1.616ThrPro: 1.616 ± 0.377
1.996ThrGln: 1.996 ± 0.462
1.806ThrArg: 1.806 ± 0.402
3.231ThrSer: 3.231 ± 0.51
3.992ThrThr: 3.992 ± 0.753
4.372ThrVal: 4.372 ± 0.788
0.38ThrTrp: 0.38 ± 0.185
1.901ThrTyr: 1.901 ± 0.46
0.0ThrXaa: 0.0 ± 0.0
Val
3.326ValAla: 3.326 ± 0.654
0.19ValCys: 0.19 ± 0.112
3.421ValAsp: 3.421 ± 0.528
4.182ValGlu: 4.182 ± 0.663
2.661ValPhe: 2.661 ± 0.323
3.326ValGly: 3.326 ± 0.712
0.38ValHis: 0.38 ± 0.15
4.847ValIle: 4.847 ± 0.617
5.132ValLys: 5.132 ± 0.679
3.041ValLeu: 3.041 ± 0.423
1.521ValMet: 1.521 ± 0.392
4.657ValAsn: 4.657 ± 0.768
1.616ValPro: 1.616 ± 0.378
2.186ValGln: 2.186 ± 0.43
2.281ValArg: 2.281 ± 0.396
4.182ValSer: 4.182 ± 0.674
4.182ValThr: 4.182 ± 0.511
3.992ValVal: 3.992 ± 0.736
0.665ValTrp: 0.665 ± 0.208
2.471ValTyr: 2.471 ± 0.552
0.0ValXaa: 0.0 ± 0.0
Trp
0.57TrpAla: 0.57 ± 0.241
0.0TrpCys: 0.0 ± 0.0
0.38TrpAsp: 0.38 ± 0.202
0.855TrpGlu: 0.855 ± 0.233
0.57TrpPhe: 0.57 ± 0.229
0.19TrpGly: 0.19 ± 0.136
0.285TrpHis: 0.285 ± 0.181
0.475TrpIle: 0.475 ± 0.202
1.045TrpLys: 1.045 ± 0.34
1.236TrpLeu: 1.236 ± 0.393
0.38TrpMet: 0.38 ± 0.173
1.236TrpAsn: 1.236 ± 0.384
0.285TrpPro: 0.285 ± 0.173
0.57TrpGln: 0.57 ± 0.205
0.76TrpArg: 0.76 ± 0.281
0.665TrpSer: 0.665 ± 0.337
1.045TrpThr: 1.045 ± 0.253
0.665TrpVal: 0.665 ± 0.217
0.285TrpTrp: 0.285 ± 0.187
0.95TrpTyr: 0.95 ± 0.416
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.471TyrAla: 2.471 ± 0.406
0.475TyrCys: 0.475 ± 0.192
1.996TyrAsp: 1.996 ± 0.524
2.566TyrGlu: 2.566 ± 0.5
2.471TyrPhe: 2.471 ± 0.447
2.186TyrGly: 2.186 ± 0.517
1.236TyrHis: 1.236 ± 0.424
2.471TyrIle: 2.471 ± 0.492
3.421TyrLys: 3.421 ± 0.455
4.087TyrLeu: 4.087 ± 0.707
0.475TyrMet: 0.475 ± 0.23
2.186TyrAsn: 2.186 ± 0.382
1.616TyrPro: 1.616 ± 0.38
1.426TyrGln: 1.426 ± 0.331
2.281TyrArg: 2.281 ± 0.526
2.376TyrSer: 2.376 ± 0.497
1.045TyrThr: 1.045 ± 0.279
1.711TyrVal: 1.711 ± 0.379
0.475TyrTrp: 0.475 ± 0.188
2.661TyrTyr: 2.661 ± 0.639
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (10523 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski