Amino acid dipepetide frequency for Streptococcus phage CHPC1084

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.26AlaAla: 6.26 ± 2.221
0.363AlaCys: 0.363 ± 0.198
5.352AlaAsp: 5.352 ± 0.817
4.627AlaGlu: 4.627 ± 0.745
3.266AlaPhe: 3.266 ± 1.11
4.808AlaGly: 4.808 ± 1.341
0.907AlaHis: 0.907 ± 0.298
5.715AlaIle: 5.715 ± 1.498
4.355AlaLys: 4.355 ± 0.478
5.987AlaLeu: 5.987 ± 0.983
2.268AlaMet: 2.268 ± 0.956
3.992AlaAsn: 3.992 ± 0.75
2.268AlaPro: 2.268 ± 0.474
3.175AlaGln: 3.175 ± 0.84
3.629AlaArg: 3.629 ± 0.761
6.169AlaSer: 6.169 ± 1.431
3.81AlaThr: 3.81 ± 1.001
5.806AlaVal: 5.806 ± 1.229
0.454AlaTrp: 0.454 ± 0.145
1.996AlaTyr: 1.996 ± 0.395
0.0AlaXaa: 0.0 ± 0.0
Cys
0.272CysAla: 0.272 ± 0.152
0.0CysCys: 0.0 ± 0.0
0.726CysAsp: 0.726 ± 0.292
0.544CysGlu: 0.544 ± 0.225
0.181CysPhe: 0.181 ± 0.136
0.363CysGly: 0.363 ± 0.247
0.091CysHis: 0.091 ± 0.093
0.181CysIle: 0.181 ± 0.121
0.454CysLys: 0.454 ± 0.223
0.272CysLeu: 0.272 ± 0.162
0.091CysMet: 0.091 ± 0.093
0.181CysAsn: 0.181 ± 0.144
0.091CysPro: 0.091 ± 0.094
0.0CysGln: 0.0 ± 0.0
0.181CysArg: 0.181 ± 0.145
0.635CysSer: 0.635 ± 0.227
0.0CysThr: 0.0 ± 0.0
0.454CysVal: 0.454 ± 0.183
0.181CysTrp: 0.181 ± 0.14
0.091CysTyr: 0.091 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
3.084AspAla: 3.084 ± 0.564
0.363AspCys: 0.363 ± 0.204
4.536AspAsp: 4.536 ± 0.772
4.173AspGlu: 4.173 ± 0.928
3.81AspPhe: 3.81 ± 0.736
6.26AspGly: 6.26 ± 1.146
0.272AspHis: 0.272 ± 0.179
3.719AspIle: 3.719 ± 0.716
4.445AspLys: 4.445 ± 0.743
5.171AspLeu: 5.171 ± 0.617
1.452AspMet: 1.452 ± 0.371
4.445AspAsn: 4.445 ± 0.825
0.816AspPro: 0.816 ± 0.29
1.179AspGln: 1.179 ± 0.302
2.722AspArg: 2.722 ± 0.66
4.445AspSer: 4.445 ± 0.923
3.447AspThr: 3.447 ± 0.677
2.994AspVal: 2.994 ± 0.558
1.089AspTrp: 1.089 ± 0.342
3.175AspTyr: 3.175 ± 0.589
0.0AspXaa: 0.0 ± 0.0
Glu
5.171GluAla: 5.171 ± 0.899
0.181GluCys: 0.181 ± 0.122
2.812GluAsp: 2.812 ± 0.424
4.445GluGlu: 4.445 ± 1.036
2.54GluPhe: 2.54 ± 0.436
3.084GluGly: 3.084 ± 0.485
1.452GluHis: 1.452 ± 0.46
4.445GluIle: 4.445 ± 0.789
5.534GluLys: 5.534 ± 1.17
7.167GluLeu: 7.167 ± 1.343
2.812GluMet: 2.812 ± 0.6
4.264GluAsn: 4.264 ± 0.737
1.452GluPro: 1.452 ± 0.414
2.812GluGln: 2.812 ± 0.564
4.445GluArg: 4.445 ± 0.887
2.631GluSer: 2.631 ± 0.647
3.266GluThr: 3.266 ± 0.725
5.715GluVal: 5.715 ± 0.873
0.907GluTrp: 0.907 ± 0.406
3.175GluTyr: 3.175 ± 0.723
0.0GluXaa: 0.0 ± 0.0
Phe
2.268PheAla: 2.268 ± 0.355
0.091PheCys: 0.091 ± 0.092
2.54PheAsp: 2.54 ± 0.567
4.264PheGlu: 4.264 ± 0.766
1.27PhePhe: 1.27 ± 0.467
4.082PheGly: 4.082 ± 0.807
0.363PheHis: 0.363 ± 0.172
2.994PheIle: 2.994 ± 0.46
5.262PheLys: 5.262 ± 0.667
2.087PheLeu: 2.087 ± 0.562
0.635PheMet: 0.635 ± 0.244
3.357PheAsn: 3.357 ± 0.459
0.635PhePro: 0.635 ± 0.288
1.27PheGln: 1.27 ± 0.34
1.361PheArg: 1.361 ± 0.357
4.082PheSer: 4.082 ± 0.664
2.631PheThr: 2.631 ± 0.631
1.905PheVal: 1.905 ± 0.429
0.726PheTrp: 0.726 ± 0.258
1.452PheTyr: 1.452 ± 0.449
0.0PheXaa: 0.0 ± 0.0
Gly
4.99GlyAla: 4.99 ± 1.016
0.363GlyCys: 0.363 ± 0.17
3.538GlyAsp: 3.538 ± 0.504
2.903GlyGlu: 2.903 ± 0.452
3.357GlyPhe: 3.357 ± 0.48
2.994GlyGly: 2.994 ± 0.616
1.27GlyHis: 1.27 ± 0.438
6.713GlyIle: 6.713 ± 1.556
6.26GlyLys: 6.26 ± 0.933
6.35GlyLeu: 6.35 ± 0.955
1.542GlyMet: 1.542 ± 0.761
3.629GlyAsn: 3.629 ± 0.627
1.361GlyPro: 1.361 ± 0.643
3.266GlyGln: 3.266 ± 0.494
2.994GlyArg: 2.994 ± 0.61
3.538GlySer: 3.538 ± 0.806
4.627GlyThr: 4.627 ± 0.856
5.171GlyVal: 5.171 ± 0.692
0.635GlyTrp: 0.635 ± 0.241
2.812GlyTyr: 2.812 ± 0.482
0.0GlyXaa: 0.0 ± 0.0
His
1.089HisAla: 1.089 ± 0.298
0.091HisCys: 0.091 ± 0.109
0.816HisAsp: 0.816 ± 0.237
0.635HisGlu: 0.635 ± 0.315
0.635HisPhe: 0.635 ± 0.202
0.998HisGly: 0.998 ± 0.316
0.454HisHis: 0.454 ± 0.197
0.816HisIle: 0.816 ± 0.254
1.179HisLys: 1.179 ± 0.344
1.27HisLeu: 1.27 ± 0.352
0.363HisMet: 0.363 ± 0.161
0.635HisAsn: 0.635 ± 0.313
0.272HisPro: 0.272 ± 0.14
0.363HisGln: 0.363 ± 0.168
0.816HisArg: 0.816 ± 0.258
1.089HisSer: 1.089 ± 0.373
0.998HisThr: 0.998 ± 0.256
1.27HisVal: 1.27 ± 0.392
0.091HisTrp: 0.091 ± 0.092
0.454HisTyr: 0.454 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
4.899IleAla: 4.899 ± 1.035
0.272IleCys: 0.272 ± 0.14
5.625IleAsp: 5.625 ± 0.636
4.355IleGlu: 4.355 ± 0.861
1.452IlePhe: 1.452 ± 0.445
5.897IleGly: 5.897 ± 1.076
0.907IleHis: 0.907 ± 0.276
2.903IleIle: 2.903 ± 0.627
4.536IleLys: 4.536 ± 0.604
3.538IleLeu: 3.538 ± 0.693
1.905IleMet: 1.905 ± 0.362
3.719IleAsn: 3.719 ± 0.869
2.631IlePro: 2.631 ± 0.659
3.538IleGln: 3.538 ± 0.548
2.631IleArg: 2.631 ± 0.675
5.625IleSer: 5.625 ± 1.665
4.355IleThr: 4.355 ± 0.787
4.173IleVal: 4.173 ± 0.801
0.726IleTrp: 0.726 ± 0.314
2.449IleTyr: 2.449 ± 0.719
0.0IleXaa: 0.0 ± 0.0
Lys
6.895LysAla: 6.895 ± 0.838
0.272LysCys: 0.272 ± 0.243
4.717LysAsp: 4.717 ± 0.699
7.076LysGlu: 7.076 ± 1.647
1.996LysPhe: 1.996 ± 0.447
5.08LysGly: 5.08 ± 0.814
1.27LysHis: 1.27 ± 0.42
5.443LysIle: 5.443 ± 0.613
5.08LysLys: 5.08 ± 1.127
7.076LysLeu: 7.076 ± 1.147
1.724LysMet: 1.724 ± 0.522
2.903LysAsn: 2.903 ± 0.529
3.719LysPro: 3.719 ± 0.693
2.812LysGln: 2.812 ± 0.565
4.808LysArg: 4.808 ± 0.745
4.627LysSer: 4.627 ± 0.596
4.717LysThr: 4.717 ± 0.672
3.266LysVal: 3.266 ± 0.598
1.089LysTrp: 1.089 ± 0.317
3.992LysTyr: 3.992 ± 0.769
0.0LysXaa: 0.0 ± 0.0
Leu
5.806LeuAla: 5.806 ± 0.964
0.181LeuCys: 0.181 ± 0.136
4.173LeuAsp: 4.173 ± 0.597
6.078LeuGlu: 6.078 ± 1.016
3.357LeuPhe: 3.357 ± 0.52
6.078LeuGly: 6.078 ± 0.969
0.816LeuHis: 0.816 ± 0.306
4.355LeuIle: 4.355 ± 0.607
6.441LeuLys: 6.441 ± 0.823
4.536LeuLeu: 4.536 ± 0.69
1.542LeuMet: 1.542 ± 0.322
5.352LeuAsn: 5.352 ± 0.593
1.905LeuPro: 1.905 ± 0.476
2.631LeuGln: 2.631 ± 0.435
2.903LeuArg: 2.903 ± 0.638
6.078LeuSer: 6.078 ± 0.642
6.26LeuThr: 6.26 ± 0.895
5.171LeuVal: 5.171 ± 0.547
0.454LeuTrp: 0.454 ± 0.227
2.994LeuTyr: 2.994 ± 0.574
0.0LeuXaa: 0.0 ± 0.0
Met
2.722MetAla: 2.722 ± 0.944
0.0MetCys: 0.0 ± 0.0
1.179MetAsp: 1.179 ± 0.277
0.998MetGlu: 0.998 ± 0.313
1.361MetPhe: 1.361 ± 0.265
1.089MetGly: 1.089 ± 0.38
0.363MetHis: 0.363 ± 0.198
1.452MetIle: 1.452 ± 0.392
2.087MetLys: 2.087 ± 0.544
1.452MetLeu: 1.452 ± 0.344
1.089MetMet: 1.089 ± 0.451
1.179MetAsn: 1.179 ± 0.335
0.544MetPro: 0.544 ± 0.223
1.361MetGln: 1.361 ± 0.445
0.998MetArg: 0.998 ± 0.313
2.359MetSer: 2.359 ± 0.438
1.633MetThr: 1.633 ± 0.366
1.633MetVal: 1.633 ± 0.503
0.0MetTrp: 0.0 ± 0.0
0.635MetTyr: 0.635 ± 0.221
0.0MetXaa: 0.0 ± 0.0
Asn
3.901AsnAla: 3.901 ± 0.604
0.272AsnCys: 0.272 ± 0.149
3.084AsnAsp: 3.084 ± 0.734
4.355AsnGlu: 4.355 ± 0.907
2.54AsnPhe: 2.54 ± 0.456
5.806AsnGly: 5.806 ± 1.018
1.27AsnHis: 1.27 ± 0.518
2.631AsnIle: 2.631 ± 0.579
4.717AsnLys: 4.717 ± 0.865
4.173AsnLeu: 4.173 ± 0.606
0.907AsnMet: 0.907 ± 0.271
3.629AsnAsn: 3.629 ± 0.735
2.631AsnPro: 2.631 ± 0.62
2.177AsnGln: 2.177 ± 0.44
2.54AsnArg: 2.54 ± 0.591
3.81AsnSer: 3.81 ± 0.647
2.994AsnThr: 2.994 ± 0.548
2.903AsnVal: 2.903 ± 0.482
1.179AsnTrp: 1.179 ± 0.378
1.814AsnTyr: 1.814 ± 0.419
0.0AsnXaa: 0.0 ± 0.0
Pro
1.724ProAla: 1.724 ± 0.338
0.181ProCys: 0.181 ± 0.134
1.724ProAsp: 1.724 ± 0.443
1.633ProGlu: 1.633 ± 0.418
1.27ProPhe: 1.27 ± 0.421
1.179ProGly: 1.179 ± 0.396
0.363ProHis: 0.363 ± 0.163
1.724ProIle: 1.724 ± 0.388
2.903ProLys: 2.903 ± 0.535
1.814ProLeu: 1.814 ± 0.39
0.091ProMet: 0.091 ± 0.081
2.177ProAsn: 2.177 ± 0.491
0.816ProPro: 0.816 ± 0.209
2.177ProGln: 2.177 ± 0.646
1.179ProArg: 1.179 ± 0.413
1.814ProSer: 1.814 ± 0.351
1.633ProThr: 1.633 ± 0.559
1.724ProVal: 1.724 ± 0.368
0.363ProTrp: 0.363 ± 0.157
1.089ProTyr: 1.089 ± 0.417
0.0ProXaa: 0.0 ± 0.0
Gln
3.901GlnAla: 3.901 ± 0.819
0.272GlnCys: 0.272 ± 0.157
2.812GlnAsp: 2.812 ± 0.513
2.903GlnGlu: 2.903 ± 0.703
2.449GlnPhe: 2.449 ± 0.61
2.812GlnGly: 2.812 ± 0.885
0.363GlnHis: 0.363 ± 0.154
2.449GlnIle: 2.449 ± 0.585
2.722GlnLys: 2.722 ± 0.466
4.264GlnLeu: 4.264 ± 0.451
1.452GlnMet: 1.452 ± 0.412
1.542GlnAsn: 1.542 ± 0.415
1.27GlnPro: 1.27 ± 0.32
1.633GlnGln: 1.633 ± 0.612
0.816GlnArg: 0.816 ± 0.281
2.54GlnSer: 2.54 ± 0.658
2.994GlnThr: 2.994 ± 0.425
2.359GlnVal: 2.359 ± 0.367
0.726GlnTrp: 0.726 ± 0.286
1.452GlnTyr: 1.452 ± 0.384
0.0GlnXaa: 0.0 ± 0.0
Arg
3.719ArgAla: 3.719 ± 0.526
0.726ArgCys: 0.726 ± 0.277
2.54ArgAsp: 2.54 ± 0.697
3.719ArgGlu: 3.719 ± 0.766
2.177ArgPhe: 2.177 ± 0.486
2.812ArgGly: 2.812 ± 0.51
0.454ArgHis: 0.454 ± 0.203
2.722ArgIle: 2.722 ± 0.615
3.447ArgLys: 3.447 ± 0.814
3.719ArgLeu: 3.719 ± 0.565
1.179ArgMet: 1.179 ± 0.284
1.452ArgAsn: 1.452 ± 0.303
0.816ArgPro: 0.816 ± 0.276
2.087ArgGln: 2.087 ± 0.482
1.452ArgArg: 1.452 ± 0.463
2.722ArgSer: 2.722 ± 0.469
2.087ArgThr: 2.087 ± 0.487
2.359ArgVal: 2.359 ± 0.513
0.635ArgTrp: 0.635 ± 0.278
2.177ArgTyr: 2.177 ± 0.47
0.0ArgXaa: 0.0 ± 0.0
Ser
6.985SerAla: 6.985 ± 2.788
0.544SerCys: 0.544 ± 0.241
4.808SerAsp: 4.808 ± 0.717
3.357SerGlu: 3.357 ± 0.632
2.631SerPhe: 2.631 ± 0.542
4.264SerGly: 4.264 ± 0.667
0.907SerHis: 0.907 ± 0.279
5.534SerIle: 5.534 ± 0.648
5.08SerLys: 5.08 ± 0.579
4.627SerLeu: 4.627 ± 0.864
1.27SerMet: 1.27 ± 0.274
4.082SerAsn: 4.082 ± 0.629
1.542SerPro: 1.542 ± 0.31
4.173SerGln: 4.173 ± 1.031
2.268SerArg: 2.268 ± 0.46
4.082SerSer: 4.082 ± 1.173
5.262SerThr: 5.262 ± 0.698
5.171SerVal: 5.171 ± 0.766
0.454SerTrp: 0.454 ± 0.215
1.996SerTyr: 1.996 ± 0.41
0.0SerXaa: 0.0 ± 0.0
Thr
4.627ThrAla: 4.627 ± 1.536
0.0ThrCys: 0.0 ± 0.0
3.084ThrAsp: 3.084 ± 0.562
3.992ThrGlu: 3.992 ± 0.651
3.538ThrPhe: 3.538 ± 0.418
4.082ThrGly: 4.082 ± 0.486
1.089ThrHis: 1.089 ± 0.394
4.717ThrIle: 4.717 ± 0.761
6.169ThrLys: 6.169 ± 0.855
5.262ThrLeu: 5.262 ± 0.635
1.452ThrMet: 1.452 ± 0.776
3.629ThrAsn: 3.629 ± 0.639
1.542ThrPro: 1.542 ± 0.357
2.994ThrGln: 2.994 ± 0.51
2.087ThrArg: 2.087 ± 0.522
3.266ThrSer: 3.266 ± 0.9
4.627ThrThr: 4.627 ± 0.73
4.99ThrVal: 4.99 ± 0.519
0.454ThrTrp: 0.454 ± 0.267
2.722ThrTyr: 2.722 ± 0.606
0.0ThrXaa: 0.0 ± 0.0
Val
4.082ValAla: 4.082 ± 1.107
0.363ValCys: 0.363 ± 0.207
4.173ValAsp: 4.173 ± 0.676
5.443ValGlu: 5.443 ± 0.749
2.631ValPhe: 2.631 ± 0.455
3.357ValGly: 3.357 ± 0.753
1.089ValHis: 1.089 ± 0.334
4.264ValIle: 4.264 ± 0.605
4.899ValLys: 4.899 ± 0.571
4.445ValLeu: 4.445 ± 0.47
0.816ValMet: 0.816 ± 0.344
4.536ValAsn: 4.536 ± 0.955
1.905ValPro: 1.905 ± 0.323
2.359ValGln: 2.359 ± 0.697
2.359ValArg: 2.359 ± 0.506
5.534ValSer: 5.534 ± 0.671
5.08ValThr: 5.08 ± 0.797
4.536ValVal: 4.536 ± 0.64
0.907ValTrp: 0.907 ± 0.292
1.905ValTyr: 1.905 ± 0.421
0.0ValXaa: 0.0 ± 0.0
Trp
0.363TrpAla: 0.363 ± 0.182
0.0TrpCys: 0.0 ± 0.0
0.544TrpAsp: 0.544 ± 0.223
0.816TrpGlu: 0.816 ± 0.275
0.544TrpPhe: 0.544 ± 0.247
0.816TrpGly: 0.816 ± 0.25
0.091TrpHis: 0.091 ± 0.067
0.544TrpIle: 0.544 ± 0.244
0.635TrpLys: 0.635 ± 0.267
0.726TrpLeu: 0.726 ± 0.258
0.181TrpMet: 0.181 ± 0.107
0.998TrpAsn: 0.998 ± 0.336
0.091TrpPro: 0.091 ± 0.094
0.181TrpGln: 0.181 ± 0.133
0.544TrpArg: 0.544 ± 0.237
1.542TrpSer: 1.542 ± 0.51
1.361TrpThr: 1.361 ± 0.628
1.089TrpVal: 1.089 ± 0.289
0.272TrpTrp: 0.272 ± 0.172
0.454TrpTyr: 0.454 ± 0.262
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.449TyrAla: 2.449 ± 0.424
0.454TyrCys: 0.454 ± 0.195
2.812TyrAsp: 2.812 ± 0.745
1.996TyrGlu: 1.996 ± 0.546
2.268TyrPhe: 2.268 ± 0.612
2.449TyrGly: 2.449 ± 0.487
0.544TyrHis: 0.544 ± 0.28
2.812TyrIle: 2.812 ± 0.617
2.359TyrLys: 2.359 ± 0.598
3.266TyrLeu: 3.266 ± 0.626
1.27TyrMet: 1.27 ± 0.467
1.633TyrAsn: 1.633 ± 0.574
1.27TyrPro: 1.27 ± 0.352
1.542TyrGln: 1.542 ± 0.4
2.268TyrArg: 2.268 ± 0.521
2.449TyrSer: 2.449 ± 0.483
2.54TyrThr: 2.54 ± 0.762
1.996TyrVal: 1.996 ± 0.336
0.454TyrTrp: 0.454 ± 0.187
1.724TyrTyr: 1.724 ± 0.671
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (11024 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski