Amino acid dipepetide frequency for Streptococcus phage Javan584

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.415AlaAla: 3.415 ± 0.592
0.0AlaCys: 0.0 ± 0.0
4.853AlaAsp: 4.853 ± 0.857
6.561AlaGlu: 6.561 ± 1.013
1.797AlaPhe: 1.797 ± 0.485
4.044AlaGly: 4.044 ± 0.764
0.629AlaHis: 0.629 ± 0.236
4.763AlaIle: 4.763 ± 0.764
5.482AlaLys: 5.482 ± 0.673
5.123AlaLeu: 5.123 ± 0.922
2.247AlaMet: 2.247 ± 0.503
4.314AlaAsn: 4.314 ± 0.465
2.067AlaPro: 2.067 ± 0.573
2.247AlaGln: 2.247 ± 0.625
3.595AlaArg: 3.595 ± 0.567
5.572AlaSer: 5.572 ± 0.829
4.583AlaThr: 4.583 ± 0.682
4.673AlaVal: 4.673 ± 0.728
0.719AlaTrp: 0.719 ± 0.231
2.966AlaTyr: 2.966 ± 0.694
0.0AlaXaa: 0.0 ± 0.0
Cys
0.18CysAla: 0.18 ± 0.122
0.0CysCys: 0.0 ± 0.0
0.27CysAsp: 0.27 ± 0.187
0.809CysGlu: 0.809 ± 0.268
0.0CysPhe: 0.0 ± 0.0
0.27CysGly: 0.27 ± 0.144
0.27CysHis: 0.27 ± 0.168
0.359CysIle: 0.359 ± 0.211
0.539CysLys: 0.539 ± 0.246
0.27CysLeu: 0.27 ± 0.153
0.0CysMet: 0.0 ± 0.0
0.27CysAsn: 0.27 ± 0.151
0.27CysPro: 0.27 ± 0.165
0.18CysGln: 0.18 ± 0.132
0.27CysArg: 0.27 ± 0.148
0.539CysSer: 0.539 ± 0.22
0.18CysThr: 0.18 ± 0.15
0.449CysVal: 0.449 ± 0.235
0.09CysTrp: 0.09 ± 0.082
0.09CysTyr: 0.09 ± 0.091
0.0CysXaa: 0.0 ± 0.0
Asp
2.876AspAla: 2.876 ± 0.641
0.719AspCys: 0.719 ± 0.288
4.044AspAsp: 4.044 ± 0.673
7.01AspGlu: 7.01 ± 1.122
2.876AspPhe: 2.876 ± 0.407
6.111AspGly: 6.111 ± 1.283
1.078AspHis: 1.078 ± 0.39
4.943AspIle: 4.943 ± 0.538
5.123AspLys: 5.123 ± 0.457
4.673AspLeu: 4.673 ± 0.588
0.809AspMet: 0.809 ± 0.251
4.134AspAsn: 4.134 ± 0.616
1.168AspPro: 1.168 ± 0.326
1.977AspGln: 1.977 ± 0.454
2.786AspArg: 2.786 ± 0.536
3.415AspSer: 3.415 ± 0.461
3.325AspThr: 3.325 ± 0.582
3.595AspVal: 3.595 ± 0.605
1.168AspTrp: 1.168 ± 0.334
2.606AspTyr: 2.606 ± 0.539
0.0AspXaa: 0.0 ± 0.0
Glu
6.381GluAla: 6.381 ± 1.016
0.539GluCys: 0.539 ± 0.261
3.864GluAsp: 3.864 ± 0.666
5.662GluGlu: 5.662 ± 0.964
3.595GluPhe: 3.595 ± 0.491
3.595GluGly: 3.595 ± 0.717
1.078GluHis: 1.078 ± 0.317
6.92GluIle: 6.92 ± 0.971
6.381GluLys: 6.381 ± 1.058
7.549GluLeu: 7.549 ± 0.99
2.516GluMet: 2.516 ± 0.518
3.775GluAsn: 3.775 ± 0.532
1.887GluPro: 1.887 ± 0.381
3.325GluGln: 3.325 ± 0.668
4.224GluArg: 4.224 ± 0.561
5.123GluSer: 5.123 ± 0.679
4.404GluThr: 4.404 ± 0.433
4.853GluVal: 4.853 ± 0.688
1.258GluTrp: 1.258 ± 0.275
1.977GluTyr: 1.977 ± 0.313
0.0GluXaa: 0.0 ± 0.0
Phe
2.876PheAla: 2.876 ± 0.534
0.18PheCys: 0.18 ± 0.132
2.786PheAsp: 2.786 ± 0.708
3.056PheGlu: 3.056 ± 0.531
1.078PhePhe: 1.078 ± 0.297
2.876PheGly: 2.876 ± 0.508
0.629PheHis: 0.629 ± 0.217
2.157PheIle: 2.157 ± 0.545
3.056PheLys: 3.056 ± 0.397
2.337PheLeu: 2.337 ± 0.397
0.809PheMet: 0.809 ± 0.248
2.516PheAsn: 2.516 ± 0.626
1.258PhePro: 1.258 ± 0.362
1.078PheGln: 1.078 ± 0.312
1.797PheArg: 1.797 ± 0.37
2.516PheSer: 2.516 ± 0.679
2.966PheThr: 2.966 ± 0.609
3.056PheVal: 3.056 ± 0.609
0.359PheTrp: 0.359 ± 0.203
2.157PheTyr: 2.157 ± 0.499
0.0PheXaa: 0.0 ± 0.0
Gly
3.595GlyAla: 3.595 ± 0.838
0.359GlyCys: 0.359 ± 0.166
3.685GlyAsp: 3.685 ± 0.519
4.853GlyGlu: 4.853 ± 0.674
3.325GlyPhe: 3.325 ± 0.591
4.404GlyGly: 4.404 ± 0.824
1.348GlyHis: 1.348 ± 0.404
4.673GlyIle: 4.673 ± 0.689
5.752GlyLys: 5.752 ± 0.66
5.213GlyLeu: 5.213 ± 0.653
3.056GlyMet: 3.056 ± 0.526
4.583GlyAsn: 4.583 ± 0.836
1.528GlyPro: 1.528 ± 0.733
2.516GlyGln: 2.516 ± 0.384
3.775GlyArg: 3.775 ± 0.772
2.966GlySer: 2.966 ± 0.718
4.044GlyThr: 4.044 ± 1.128
4.224GlyVal: 4.224 ± 0.655
1.168GlyTrp: 1.168 ± 0.361
3.146GlyTyr: 3.146 ± 0.522
0.0GlyXaa: 0.0 ± 0.0
His
0.899HisAla: 0.899 ± 0.302
0.18HisCys: 0.18 ± 0.117
1.708HisAsp: 1.708 ± 0.533
0.899HisGlu: 0.899 ± 0.267
0.719HisPhe: 0.719 ± 0.364
1.258HisGly: 1.258 ± 0.322
0.359HisHis: 0.359 ± 0.183
0.899HisIle: 0.899 ± 0.237
0.539HisLys: 0.539 ± 0.219
1.438HisLeu: 1.438 ± 0.427
0.27HisMet: 0.27 ± 0.156
0.719HisAsn: 0.719 ± 0.217
0.629HisPro: 0.629 ± 0.19
0.449HisGln: 0.449 ± 0.165
0.719HisArg: 0.719 ± 0.214
0.629HisSer: 0.629 ± 0.203
0.719HisThr: 0.719 ± 0.16
0.809HisVal: 0.809 ± 0.3
0.359HisTrp: 0.359 ± 0.159
0.629HisTyr: 0.629 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
4.044IleAla: 4.044 ± 0.517
0.449IleCys: 0.449 ± 0.267
4.583IleAsp: 4.583 ± 0.559
7.01IleGlu: 7.01 ± 0.927
2.966IlePhe: 2.966 ± 0.474
5.033IleGly: 5.033 ± 0.964
1.618IleHis: 1.618 ± 0.311
3.595IleIle: 3.595 ± 0.723
6.92IleLys: 6.92 ± 0.775
3.954IleLeu: 3.954 ± 0.57
1.078IleMet: 1.078 ± 0.266
3.685IleAsn: 3.685 ± 0.608
2.786IlePro: 2.786 ± 0.588
1.797IleGln: 1.797 ± 0.33
3.954IleArg: 3.954 ± 0.652
2.516IleSer: 2.516 ± 0.433
4.583IleThr: 4.583 ± 0.715
3.954IleVal: 3.954 ± 0.627
0.539IleTrp: 0.539 ± 0.209
2.516IleTyr: 2.516 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
6.381LysAla: 6.381 ± 0.898
0.18LysCys: 0.18 ± 0.129
4.943LysAsp: 4.943 ± 0.689
7.19LysGlu: 7.19 ± 1.346
2.696LysPhe: 2.696 ± 0.619
4.853LysGly: 4.853 ± 0.881
0.989LysHis: 0.989 ± 0.344
6.201LysIle: 6.201 ± 0.836
5.842LysLys: 5.842 ± 1.05
7.369LysLeu: 7.369 ± 0.831
1.977LysMet: 1.977 ± 0.394
4.314LysAsn: 4.314 ± 0.703
2.067LysPro: 2.067 ± 0.388
3.595LysGln: 3.595 ± 0.665
3.595LysArg: 3.595 ± 0.735
3.505LysSer: 3.505 ± 0.499
4.583LysThr: 4.583 ± 0.643
5.932LysVal: 5.932 ± 0.941
1.078LysTrp: 1.078 ± 0.357
2.067LysTyr: 2.067 ± 0.382
0.0LysXaa: 0.0 ± 0.0
Leu
7.28LeuAla: 7.28 ± 1.055
0.629LeuCys: 0.629 ± 0.245
5.572LeuAsp: 5.572 ± 0.629
6.92LeuGlu: 6.92 ± 1.056
2.067LeuPhe: 2.067 ± 0.536
4.494LeuGly: 4.494 ± 0.906
0.899LeuHis: 0.899 ± 0.269
3.954LeuIle: 3.954 ± 0.545
6.471LeuLys: 6.471 ± 0.861
6.021LeuLeu: 6.021 ± 0.773
1.887LeuMet: 1.887 ± 0.411
4.853LeuAsn: 4.853 ± 0.667
2.247LeuPro: 2.247 ± 0.326
2.067LeuGln: 2.067 ± 0.428
3.415LeuArg: 3.415 ± 0.566
5.033LeuSer: 5.033 ± 0.696
6.021LeuThr: 6.021 ± 0.865
4.673LeuVal: 4.673 ± 0.745
0.719LeuTrp: 0.719 ± 0.215
2.606LeuTyr: 2.606 ± 0.537
0.0LeuXaa: 0.0 ± 0.0
Met
2.786MetAla: 2.786 ± 0.565
0.0MetCys: 0.0 ± 0.0
1.258MetAsp: 1.258 ± 0.312
1.618MetGlu: 1.618 ± 0.382
0.719MetPhe: 0.719 ± 0.225
1.438MetGly: 1.438 ± 0.324
0.359MetHis: 0.359 ± 0.194
1.887MetIle: 1.887 ± 0.418
2.786MetLys: 2.786 ± 0.592
1.618MetLeu: 1.618 ± 0.381
0.809MetMet: 0.809 ± 0.294
1.887MetAsn: 1.887 ± 0.377
0.27MetPro: 0.27 ± 0.145
0.989MetGln: 0.989 ± 0.312
0.989MetArg: 0.989 ± 0.323
2.247MetSer: 2.247 ± 0.553
1.797MetThr: 1.797 ± 0.305
1.618MetVal: 1.618 ± 0.402
0.18MetTrp: 0.18 ± 0.125
1.078MetTyr: 1.078 ± 0.305
0.0MetXaa: 0.0 ± 0.0
Asn
4.134AsnAla: 4.134 ± 0.705
0.359AsnCys: 0.359 ± 0.166
2.966AsnAsp: 2.966 ± 0.618
4.134AsnGlu: 4.134 ± 0.576
2.606AsnPhe: 2.606 ± 0.367
6.201AsnGly: 6.201 ± 0.67
1.258AsnHis: 1.258 ± 0.296
3.595AsnIle: 3.595 ± 0.494
3.235AsnLys: 3.235 ± 0.421
3.056AsnLeu: 3.056 ± 0.491
1.438AsnMet: 1.438 ± 0.405
2.247AsnAsn: 2.247 ± 0.373
2.247AsnPro: 2.247 ± 0.525
2.786AsnGln: 2.786 ± 0.538
2.427AsnArg: 2.427 ± 0.439
3.235AsnSer: 3.235 ± 0.565
3.595AsnThr: 3.595 ± 0.712
3.415AsnVal: 3.415 ± 0.463
1.168AsnTrp: 1.168 ± 0.438
2.427AsnTyr: 2.427 ± 0.407
0.0AsnXaa: 0.0 ± 0.0
Pro
1.618ProAla: 1.618 ± 0.329
0.09ProCys: 0.09 ± 0.082
1.887ProAsp: 1.887 ± 0.45
1.708ProGlu: 1.708 ± 0.452
1.708ProPhe: 1.708 ± 0.339
1.258ProGly: 1.258 ± 0.546
0.0ProHis: 0.0 ± 0.0
1.438ProIle: 1.438 ± 0.403
3.146ProLys: 3.146 ± 0.681
2.067ProLeu: 2.067 ± 0.477
0.539ProMet: 0.539 ± 0.227
2.427ProAsn: 2.427 ± 0.401
0.629ProPro: 0.629 ± 0.25
0.899ProGln: 0.899 ± 0.274
1.348ProArg: 1.348 ± 0.425
2.247ProSer: 2.247 ± 0.633
2.606ProThr: 2.606 ± 0.647
1.797ProVal: 1.797 ± 0.292
0.09ProTrp: 0.09 ± 0.085
0.899ProTyr: 0.899 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
3.505GlnAla: 3.505 ± 0.62
0.09GlnCys: 0.09 ± 0.082
1.887GlnAsp: 1.887 ± 0.513
2.606GlnGlu: 2.606 ± 0.536
0.989GlnPhe: 0.989 ± 0.265
2.067GlnGly: 2.067 ± 0.689
0.539GlnHis: 0.539 ± 0.246
2.337GlnIle: 2.337 ± 0.452
3.415GlnLys: 3.415 ± 0.617
2.337GlnLeu: 2.337 ± 0.751
1.348GlnMet: 1.348 ± 0.307
1.258GlnAsn: 1.258 ± 0.243
1.258GlnPro: 1.258 ± 0.358
1.438GlnGln: 1.438 ± 0.337
2.337GlnArg: 2.337 ± 0.59
2.157GlnSer: 2.157 ± 0.406
1.887GlnThr: 1.887 ± 0.474
2.247GlnVal: 2.247 ± 0.467
0.449GlnTrp: 0.449 ± 0.18
1.348GlnTyr: 1.348 ± 0.355
0.0GlnXaa: 0.0 ± 0.0
Arg
3.056ArgAla: 3.056 ± 0.562
0.09ArgCys: 0.09 ± 0.091
2.696ArgAsp: 2.696 ± 0.714
3.775ArgGlu: 3.775 ± 0.603
2.247ArgPhe: 2.247 ± 0.446
2.516ArgGly: 2.516 ± 0.482
0.539ArgHis: 0.539 ± 0.171
3.056ArgIle: 3.056 ± 0.534
3.775ArgLys: 3.775 ± 0.597
4.224ArgLeu: 4.224 ± 0.665
1.618ArgMet: 1.618 ± 0.386
2.247ArgAsn: 2.247 ± 0.434
1.708ArgPro: 1.708 ± 0.412
1.887ArgGln: 1.887 ± 0.447
2.427ArgArg: 2.427 ± 0.482
1.618ArgSer: 1.618 ± 0.395
2.516ArgThr: 2.516 ± 0.632
3.685ArgVal: 3.685 ± 0.584
1.348ArgTrp: 1.348 ± 0.376
2.427ArgTyr: 2.427 ± 0.355
0.0ArgXaa: 0.0 ± 0.0
Ser
4.404SerAla: 4.404 ± 0.775
0.27SerCys: 0.27 ± 0.194
4.853SerAsp: 4.853 ± 0.501
4.224SerGlu: 4.224 ± 0.601
2.966SerPhe: 2.966 ± 0.553
3.954SerGly: 3.954 ± 0.961
0.539SerHis: 0.539 ± 0.239
3.146SerIle: 3.146 ± 0.573
4.044SerLys: 4.044 ± 0.769
5.392SerLeu: 5.392 ± 0.556
1.887SerMet: 1.887 ± 0.345
3.235SerAsn: 3.235 ± 0.476
1.348SerPro: 1.348 ± 0.336
2.516SerGln: 2.516 ± 0.51
1.797SerArg: 1.797 ± 0.423
5.302SerSer: 5.302 ± 1.055
4.494SerThr: 4.494 ± 0.873
3.775SerVal: 3.775 ± 0.601
0.989SerTrp: 0.989 ± 0.274
2.427SerTyr: 2.427 ± 0.504
0.0SerXaa: 0.0 ± 0.0
Thr
3.954ThrAla: 3.954 ± 0.549
0.449ThrCys: 0.449 ± 0.211
4.224ThrAsp: 4.224 ± 0.696
3.775ThrGlu: 3.775 ± 0.623
2.427ThrPhe: 2.427 ± 0.454
5.302ThrGly: 5.302 ± 0.988
1.078ThrHis: 1.078 ± 0.347
5.482ThrIle: 5.482 ± 0.819
4.224ThrLys: 4.224 ± 0.652
5.482ThrLeu: 5.482 ± 0.761
1.258ThrMet: 1.258 ± 0.259
3.505ThrAsn: 3.505 ± 0.709
1.438ThrPro: 1.438 ± 0.435
1.528ThrGln: 1.528 ± 0.628
2.157ThrArg: 2.157 ± 0.421
3.864ThrSer: 3.864 ± 0.798
2.966ThrThr: 2.966 ± 0.617
5.752ThrVal: 5.752 ± 1.169
0.899ThrTrp: 0.899 ± 0.34
1.977ThrTyr: 1.977 ± 0.471
0.0ThrXaa: 0.0 ± 0.0
Val
4.673ValAla: 4.673 ± 0.598
0.539ValCys: 0.539 ± 0.225
4.673ValAsp: 4.673 ± 0.602
3.685ValGlu: 3.685 ± 0.614
2.516ValPhe: 2.516 ± 0.472
4.853ValGly: 4.853 ± 0.594
0.629ValHis: 0.629 ± 0.22
5.033ValIle: 5.033 ± 0.648
4.853ValLys: 4.853 ± 0.838
5.302ValLeu: 5.302 ± 0.838
1.797ValMet: 1.797 ± 0.342
3.056ValAsn: 3.056 ± 0.628
2.696ValPro: 2.696 ± 0.534
1.708ValGln: 1.708 ± 0.403
2.606ValArg: 2.606 ± 0.539
4.673ValSer: 4.673 ± 0.631
3.864ValThr: 3.864 ± 0.704
4.044ValVal: 4.044 ± 0.65
0.989ValTrp: 0.989 ± 0.251
2.696ValTyr: 2.696 ± 0.53
0.0ValXaa: 0.0 ± 0.0
Trp
1.258TrpAla: 1.258 ± 0.356
0.0TrpCys: 0.0 ± 0.0
0.989TrpAsp: 0.989 ± 0.27
1.168TrpGlu: 1.168 ± 0.363
0.989TrpPhe: 0.989 ± 0.24
1.078TrpGly: 1.078 ± 0.338
0.09TrpHis: 0.09 ± 0.102
0.809TrpIle: 0.809 ± 0.298
0.719TrpLys: 0.719 ± 0.226
0.809TrpLeu: 0.809 ± 0.246
0.359TrpMet: 0.359 ± 0.149
1.078TrpAsn: 1.078 ± 0.337
0.0TrpPro: 0.0 ± 0.0
0.629TrpGln: 0.629 ± 0.269
0.629TrpArg: 0.629 ± 0.236
0.899TrpSer: 0.899 ± 0.347
1.438TrpThr: 1.438 ± 0.541
0.629TrpVal: 0.629 ± 0.321
0.18TrpTrp: 0.18 ± 0.144
0.27TrpTyr: 0.27 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.337TyrAla: 2.337 ± 0.442
0.18TyrCys: 0.18 ± 0.152
2.696TyrAsp: 2.696 ± 0.651
2.247TyrGlu: 2.247 ± 0.463
1.258TyrPhe: 1.258 ± 0.457
2.606TyrGly: 2.606 ± 0.49
0.899TyrHis: 0.899 ± 0.289
2.427TyrIle: 2.427 ± 0.526
3.056TyrLys: 3.056 ± 0.585
3.595TyrLeu: 3.595 ± 0.628
0.539TyrMet: 0.539 ± 0.249
2.427TyrAsn: 2.427 ± 0.527
0.899TyrPro: 0.899 ± 0.234
1.887TyrGln: 1.887 ± 0.371
2.606TyrArg: 2.606 ± 0.397
3.505TyrSer: 3.505 ± 0.648
1.168TyrThr: 1.168 ± 0.403
1.708TyrVal: 1.708 ± 0.384
0.27TyrTrp: 0.27 ± 0.229
1.618TyrTyr: 1.618 ± 0.426
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (11128 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski