Amino acid dipepetide frequency for Streptococcus phage CHPC1151

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.882AlaAla: 5.882 ± 2.563
0.196AlaCys: 0.196 ± 0.134
3.725AlaAsp: 3.725 ± 0.642
4.607AlaGlu: 4.607 ± 0.859
2.843AlaPhe: 2.843 ± 0.71
4.803AlaGly: 4.803 ± 1.27
0.588AlaHis: 0.588 ± 0.222
6.568AlaIle: 6.568 ± 1.52
5.882AlaLys: 5.882 ± 0.788
6.274AlaLeu: 6.274 ± 0.87
3.039AlaMet: 3.039 ± 0.927
5.196AlaAsn: 5.196 ± 0.724
2.745AlaPro: 2.745 ± 0.499
3.627AlaGln: 3.627 ± 0.75
3.039AlaArg: 3.039 ± 0.614
3.921AlaSer: 3.921 ± 0.854
5.0AlaThr: 5.0 ± 1.26
4.803AlaVal: 4.803 ± 1.42
0.392AlaTrp: 0.392 ± 0.189
2.059AlaTyr: 2.059 ± 0.438
0.0AlaXaa: 0.0 ± 0.0
Cys
0.49CysAla: 0.49 ± 0.24
0.196CysCys: 0.196 ± 0.181
0.49CysAsp: 0.49 ± 0.232
0.784CysGlu: 0.784 ± 0.258
0.196CysPhe: 0.196 ± 0.129
0.49CysGly: 0.49 ± 0.21
0.294CysHis: 0.294 ± 0.184
0.294CysIle: 0.294 ± 0.145
0.588CysLys: 0.588 ± 0.261
0.392CysLeu: 0.392 ± 0.203
0.098CysMet: 0.098 ± 0.116
0.0CysAsn: 0.0 ± 0.0
0.196CysPro: 0.196 ± 0.146
0.196CysGln: 0.196 ± 0.149
0.196CysArg: 0.196 ± 0.142
0.784CysSer: 0.784 ± 0.26
0.098CysThr: 0.098 ± 0.08
0.294CysVal: 0.294 ± 0.156
0.196CysTrp: 0.196 ± 0.161
0.294CysTyr: 0.294 ± 0.174
0.0CysXaa: 0.0 ± 0.0
Asp
3.921AspAla: 3.921 ± 0.708
0.294AspCys: 0.294 ± 0.179
4.705AspAsp: 4.705 ± 1.079
4.019AspGlu: 4.019 ± 0.806
3.921AspPhe: 3.921 ± 0.687
4.411AspGly: 4.411 ± 0.662
0.49AspHis: 0.49 ± 0.206
4.215AspIle: 4.215 ± 0.64
5.0AspLys: 5.0 ± 0.867
4.019AspLeu: 4.019 ± 0.889
1.667AspMet: 1.667 ± 0.339
3.137AspAsn: 3.137 ± 0.602
1.078AspPro: 1.078 ± 0.27
1.568AspGln: 1.568 ± 0.352
3.039AspArg: 3.039 ± 0.882
2.843AspSer: 2.843 ± 0.801
3.333AspThr: 3.333 ± 0.528
3.333AspVal: 3.333 ± 0.665
0.588AspTrp: 0.588 ± 0.26
2.941AspTyr: 2.941 ± 0.597
0.0AspXaa: 0.0 ± 0.0
Glu
4.019GluAla: 4.019 ± 0.573
0.392GluCys: 0.392 ± 0.293
3.627GluAsp: 3.627 ± 0.731
7.45GluGlu: 7.45 ± 1.712
2.745GluPhe: 2.745 ± 0.548
2.843GluGly: 2.843 ± 0.405
1.078GluHis: 1.078 ± 0.353
5.098GluIle: 5.098 ± 1.111
5.196GluLys: 5.196 ± 0.846
7.45GluLeu: 7.45 ± 1.189
2.059GluMet: 2.059 ± 0.532
4.705GluAsn: 4.705 ± 0.83
1.765GluPro: 1.765 ± 0.469
3.725GluGln: 3.725 ± 0.917
3.921GluArg: 3.921 ± 0.792
3.333GluSer: 3.333 ± 0.464
4.215GluThr: 4.215 ± 0.781
6.372GluVal: 6.372 ± 0.964
0.784GluTrp: 0.784 ± 0.274
2.941GluTyr: 2.941 ± 0.8
0.0GluXaa: 0.0 ± 0.0
Phe
1.961PheAla: 1.961 ± 0.47
0.49PheCys: 0.49 ± 0.227
3.431PheAsp: 3.431 ± 0.645
4.019PheGlu: 4.019 ± 0.619
1.176PhePhe: 1.176 ± 0.265
3.431PheGly: 3.431 ± 0.71
0.784PheHis: 0.784 ± 0.202
2.745PheIle: 2.745 ± 0.426
3.333PheLys: 3.333 ± 0.649
1.568PheLeu: 1.568 ± 0.442
1.176PheMet: 1.176 ± 0.32
3.921PheAsn: 3.921 ± 0.589
0.784PhePro: 0.784 ± 0.381
1.176PheGln: 1.176 ± 0.29
0.98PheArg: 0.98 ± 0.302
3.725PheSer: 3.725 ± 0.687
2.059PheThr: 2.059 ± 0.309
2.059PheVal: 2.059 ± 0.368
0.196PheTrp: 0.196 ± 0.127
1.765PheTyr: 1.765 ± 0.561
0.0PheXaa: 0.0 ± 0.0
Gly
3.921GlyAla: 3.921 ± 1.159
0.49GlyCys: 0.49 ± 0.219
3.235GlyAsp: 3.235 ± 0.731
3.921GlyGlu: 3.921 ± 0.647
3.725GlyPhe: 3.725 ± 0.723
3.137GlyGly: 3.137 ± 0.562
1.372GlyHis: 1.372 ± 0.445
7.058GlyIle: 7.058 ± 1.787
4.705GlyLys: 4.705 ± 0.731
5.392GlyLeu: 5.392 ± 0.995
2.157GlyMet: 2.157 ± 0.71
3.333GlyAsn: 3.333 ± 0.645
0.392GlyPro: 0.392 ± 0.185
2.843GlyGln: 2.843 ± 0.414
1.961GlyArg: 1.961 ± 0.455
3.627GlySer: 3.627 ± 0.96
5.196GlyThr: 5.196 ± 1.137
2.255GlyVal: 2.255 ± 0.372
0.588GlyTrp: 0.588 ± 0.153
2.549GlyTyr: 2.549 ± 0.515
0.0GlyXaa: 0.0 ± 0.0
His
0.392HisAla: 0.392 ± 0.183
0.098HisCys: 0.098 ± 0.091
0.686HisAsp: 0.686 ± 0.327
0.784HisGlu: 0.784 ± 0.277
0.588HisPhe: 0.588 ± 0.202
0.686HisGly: 0.686 ± 0.281
0.392HisHis: 0.392 ± 0.201
0.392HisIle: 0.392 ± 0.217
1.176HisLys: 1.176 ± 0.384
1.47HisLeu: 1.47 ± 0.349
0.098HisMet: 0.098 ± 0.076
0.49HisAsn: 0.49 ± 0.204
0.588HisPro: 0.588 ± 0.227
0.49HisGln: 0.49 ± 0.205
0.784HisArg: 0.784 ± 0.222
0.686HisSer: 0.686 ± 0.226
0.686HisThr: 0.686 ± 0.222
0.588HisVal: 0.588 ± 0.247
0.294HisTrp: 0.294 ± 0.159
0.294HisTyr: 0.294 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
7.058IleAla: 7.058 ± 1.27
0.392IleCys: 0.392 ± 0.19
4.901IleAsp: 4.901 ± 0.678
5.392IleGlu: 5.392 ± 0.915
1.078IlePhe: 1.078 ± 0.269
5.49IleGly: 5.49 ± 1.376
0.784IleHis: 0.784 ± 0.338
3.823IleIle: 3.823 ± 1.013
7.45IleLys: 7.45 ± 0.677
4.607IleLeu: 4.607 ± 0.744
1.47IleMet: 1.47 ± 0.395
4.803IleAsn: 4.803 ± 0.743
2.157IlePro: 2.157 ± 0.438
2.353IleGln: 2.353 ± 0.394
2.843IleArg: 2.843 ± 0.608
6.666IleSer: 6.666 ± 1.243
5.196IleThr: 5.196 ± 0.756
3.921IleVal: 3.921 ± 0.483
0.588IleTrp: 0.588 ± 0.254
2.451IleTyr: 2.451 ± 0.569
0.0IleXaa: 0.0 ± 0.0
Lys
6.568LysAla: 6.568 ± 0.764
0.882LysCys: 0.882 ± 0.338
4.019LysAsp: 4.019 ± 0.605
6.372LysGlu: 6.372 ± 1.183
2.549LysPhe: 2.549 ± 0.457
4.607LysGly: 4.607 ± 0.497
0.98LysHis: 0.98 ± 0.327
5.294LysIle: 5.294 ± 0.81
5.0LysLys: 5.0 ± 0.862
6.568LysLeu: 6.568 ± 0.973
2.549LysMet: 2.549 ± 0.515
3.431LysAsn: 3.431 ± 0.5
3.235LysPro: 3.235 ± 0.551
4.019LysGln: 4.019 ± 0.666
4.509LysArg: 4.509 ± 0.888
6.372LysSer: 6.372 ± 0.759
5.784LysThr: 5.784 ± 0.799
4.607LysVal: 4.607 ± 0.717
0.882LysTrp: 0.882 ± 0.312
4.019LysTyr: 4.019 ± 0.812
0.0LysXaa: 0.0 ± 0.0
Leu
5.882LeuAla: 5.882 ± 0.833
0.294LeuCys: 0.294 ± 0.183
5.196LeuAsp: 5.196 ± 0.755
6.274LeuGlu: 6.274 ± 0.991
2.745LeuPhe: 2.745 ± 0.54
5.196LeuGly: 5.196 ± 0.883
0.784LeuHis: 0.784 ± 0.297
4.019LeuIle: 4.019 ± 0.706
7.94LeuLys: 7.94 ± 1.032
3.235LeuLeu: 3.235 ± 0.567
1.863LeuMet: 1.863 ± 0.486
5.0LeuAsn: 5.0 ± 0.741
2.353LeuPro: 2.353 ± 0.623
3.333LeuGln: 3.333 ± 0.631
3.627LeuArg: 3.627 ± 0.615
5.294LeuSer: 5.294 ± 0.861
5.196LeuThr: 5.196 ± 0.551
4.607LeuVal: 4.607 ± 0.766
0.392LeuTrp: 0.392 ± 0.191
2.941LeuTyr: 2.941 ± 0.644
0.0LeuXaa: 0.0 ± 0.0
Met
2.157MetAla: 2.157 ± 0.888
0.098MetCys: 0.098 ± 0.097
1.274MetAsp: 1.274 ± 0.402
1.274MetGlu: 1.274 ± 0.38
0.98MetPhe: 0.98 ± 0.244
1.667MetGly: 1.667 ± 0.404
0.392MetHis: 0.392 ± 0.176
2.745MetIle: 2.745 ± 0.523
2.843MetLys: 2.843 ± 0.523
2.353MetLeu: 2.353 ± 0.587
1.176MetMet: 1.176 ± 0.429
1.372MetAsn: 1.372 ± 0.32
0.294MetPro: 0.294 ± 0.116
1.667MetGln: 1.667 ± 0.575
1.47MetArg: 1.47 ± 0.432
2.157MetSer: 2.157 ± 0.453
2.157MetThr: 2.157 ± 0.45
1.863MetVal: 1.863 ± 0.619
0.196MetTrp: 0.196 ± 0.128
0.784MetTyr: 0.784 ± 0.368
0.0MetXaa: 0.0 ± 0.0
Asn
3.725AsnAla: 3.725 ± 0.531
0.49AsnCys: 0.49 ± 0.191
4.215AsnAsp: 4.215 ± 0.821
3.529AsnGlu: 3.529 ± 0.581
2.941AsnPhe: 2.941 ± 0.667
3.823AsnGly: 3.823 ± 0.613
0.686AsnHis: 0.686 ± 0.255
4.019AsnIle: 4.019 ± 0.723
4.313AsnLys: 4.313 ± 0.782
3.823AsnLeu: 3.823 ± 0.563
1.274AsnMet: 1.274 ± 0.32
2.451AsnAsn: 2.451 ± 0.382
2.647AsnPro: 2.647 ± 0.517
2.647AsnGln: 2.647 ± 0.536
1.47AsnArg: 1.47 ± 0.393
4.117AsnSer: 4.117 ± 0.714
3.039AsnThr: 3.039 ± 0.626
2.647AsnVal: 2.647 ± 0.511
1.274AsnTrp: 1.274 ± 0.347
1.863AsnTyr: 1.863 ± 0.368
0.0AsnXaa: 0.0 ± 0.0
Pro
1.372ProAla: 1.372 ± 0.425
0.196ProCys: 0.196 ± 0.252
1.176ProAsp: 1.176 ± 0.279
1.863ProGlu: 1.863 ± 0.428
1.47ProPhe: 1.47 ± 0.357
1.47ProGly: 1.47 ± 0.367
0.196ProHis: 0.196 ± 0.122
2.353ProIle: 2.353 ± 0.412
3.137ProLys: 3.137 ± 0.567
1.765ProLeu: 1.765 ± 0.519
0.196ProMet: 0.196 ± 0.139
1.176ProAsn: 1.176 ± 0.436
0.588ProPro: 0.588 ± 0.166
1.863ProGln: 1.863 ± 0.383
1.078ProArg: 1.078 ± 0.37
1.765ProSer: 1.765 ± 0.461
1.568ProThr: 1.568 ± 0.345
2.157ProVal: 2.157 ± 0.491
0.196ProTrp: 0.196 ± 0.134
1.961ProTyr: 1.961 ± 0.559
0.0ProXaa: 0.0 ± 0.0
Gln
5.392GlnAla: 5.392 ± 1.044
0.196GlnCys: 0.196 ± 0.149
1.961GlnAsp: 1.961 ± 0.451
2.647GlnGlu: 2.647 ± 0.758
1.765GlnPhe: 1.765 ± 0.402
2.647GlnGly: 2.647 ± 0.64
0.392GlnHis: 0.392 ± 0.187
3.529GlnIle: 3.529 ± 0.747
3.039GlnLys: 3.039 ± 0.478
4.607GlnLeu: 4.607 ± 0.828
2.353GlnMet: 2.353 ± 0.463
1.863GlnAsn: 1.863 ± 0.388
0.588GlnPro: 0.588 ± 0.231
3.039GlnGln: 3.039 ± 0.658
1.47GlnArg: 1.47 ± 0.371
3.235GlnSer: 3.235 ± 0.639
2.353GlnThr: 2.353 ± 0.394
3.039GlnVal: 3.039 ± 0.388
0.882GlnTrp: 0.882 ± 0.28
0.98GlnTyr: 0.98 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
2.549ArgAla: 2.549 ± 0.342
0.196ArgCys: 0.196 ± 0.112
3.431ArgAsp: 3.431 ± 0.9
3.627ArgGlu: 3.627 ± 0.841
1.47ArgPhe: 1.47 ± 0.461
1.961ArgGly: 1.961 ± 0.489
0.588ArgHis: 0.588 ± 0.243
2.451ArgIle: 2.451 ± 0.471
4.019ArgLys: 4.019 ± 0.946
3.431ArgLeu: 3.431 ± 0.613
1.274ArgMet: 1.274 ± 0.405
2.451ArgAsn: 2.451 ± 0.488
1.078ArgPro: 1.078 ± 0.399
1.765ArgGln: 1.765 ± 0.413
1.667ArgArg: 1.667 ± 0.465
2.255ArgSer: 2.255 ± 0.378
2.059ArgThr: 2.059 ± 0.562
1.961ArgVal: 1.961 ± 0.605
0.49ArgTrp: 0.49 ± 0.206
2.157ArgTyr: 2.157 ± 0.493
0.0ArgXaa: 0.0 ± 0.0
Ser
7.352SerAla: 7.352 ± 2.661
0.49SerCys: 0.49 ± 0.216
2.941SerAsp: 2.941 ± 0.573
4.019SerGlu: 4.019 ± 0.788
3.039SerPhe: 3.039 ± 0.551
4.901SerGly: 4.901 ± 0.856
0.392SerHis: 0.392 ± 0.17
5.49SerIle: 5.49 ± 0.776
5.392SerLys: 5.392 ± 0.678
6.862SerLeu: 6.862 ± 0.856
2.157SerMet: 2.157 ± 0.505
2.941SerAsn: 2.941 ± 0.552
1.863SerPro: 1.863 ± 0.363
3.725SerGln: 3.725 ± 0.762
1.961SerArg: 1.961 ± 0.596
4.607SerSer: 4.607 ± 0.893
3.529SerThr: 3.529 ± 0.829
5.0SerVal: 5.0 ± 0.66
0.49SerTrp: 0.49 ± 0.2
1.47SerTyr: 1.47 ± 0.422
0.0SerXaa: 0.0 ± 0.0
Thr
5.098ThrAla: 5.098 ± 1.165
0.294ThrCys: 0.294 ± 0.175
3.627ThrAsp: 3.627 ± 0.726
4.117ThrGlu: 4.117 ± 0.614
2.647ThrPhe: 2.647 ± 0.509
3.235ThrGly: 3.235 ± 0.767
0.686ThrHis: 0.686 ± 0.242
4.901ThrIle: 4.901 ± 0.609
5.686ThrLys: 5.686 ± 0.87
4.901ThrLeu: 4.901 ± 0.661
1.47ThrMet: 1.47 ± 0.551
2.745ThrAsn: 2.745 ± 0.653
2.353ThrPro: 2.353 ± 0.468
3.235ThrGln: 3.235 ± 0.744
2.059ThrArg: 2.059 ± 0.452
4.313ThrSer: 4.313 ± 1.121
3.823ThrThr: 3.823 ± 0.854
4.705ThrVal: 4.705 ± 0.51
0.294ThrTrp: 0.294 ± 0.129
2.647ThrTyr: 2.647 ± 0.615
0.0ThrXaa: 0.0 ± 0.0
Val
5.196ValAla: 5.196 ± 0.737
0.294ValCys: 0.294 ± 0.167
2.353ValAsp: 2.353 ± 0.517
4.901ValGlu: 4.901 ± 1.114
2.843ValPhe: 2.843 ± 0.499
3.921ValGly: 3.921 ± 0.744
0.294ValHis: 0.294 ± 0.152
5.196ValIle: 5.196 ± 0.575
4.019ValLys: 4.019 ± 0.656
2.745ValLeu: 2.745 ± 0.487
1.372ValMet: 1.372 ± 0.324
3.627ValAsn: 3.627 ± 0.689
1.667ValPro: 1.667 ± 0.327
2.255ValGln: 2.255 ± 0.585
1.863ValArg: 1.863 ± 0.293
4.803ValSer: 4.803 ± 0.839
4.901ValThr: 4.901 ± 0.603
3.529ValVal: 3.529 ± 0.672
0.98ValTrp: 0.98 ± 0.347
2.941ValTyr: 2.941 ± 0.704
0.0ValXaa: 0.0 ± 0.0
Trp
0.588TrpAla: 0.588 ± 0.249
0.098TrpCys: 0.098 ± 0.103
0.784TrpAsp: 0.784 ± 0.264
0.784TrpGlu: 0.784 ± 0.29
0.294TrpPhe: 0.294 ± 0.158
0.294TrpGly: 0.294 ± 0.191
0.196TrpHis: 0.196 ± 0.137
0.686TrpIle: 0.686 ± 0.224
0.49TrpLys: 0.49 ± 0.196
0.686TrpLeu: 0.686 ± 0.254
0.392TrpMet: 0.392 ± 0.179
0.49TrpAsn: 0.49 ± 0.172
0.196TrpPro: 0.196 ± 0.127
0.686TrpGln: 0.686 ± 0.25
0.784TrpArg: 0.784 ± 0.309
0.588TrpSer: 0.588 ± 0.204
0.882TrpThr: 0.882 ± 0.357
0.392TrpVal: 0.392 ± 0.194
0.196TrpTrp: 0.196 ± 0.121
0.686TrpTyr: 0.686 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.667TyrAla: 1.667 ± 0.439
0.49TyrCys: 0.49 ± 0.229
2.745TyrAsp: 2.745 ± 0.586
3.333TyrGlu: 3.333 ± 0.691
1.765TyrPhe: 1.765 ± 0.445
2.843TyrGly: 2.843 ± 0.609
0.294TyrHis: 0.294 ± 0.19
2.549TyrIle: 2.549 ± 0.438
3.039TyrLys: 3.039 ± 0.68
4.117TyrLeu: 4.117 ± 0.936
0.98TyrMet: 0.98 ± 0.34
1.863TyrAsn: 1.863 ± 0.418
0.98TyrPro: 0.98 ± 0.374
1.765TyrGln: 1.765 ± 0.529
2.157TyrArg: 2.157 ± 0.585
3.529TyrSer: 3.529 ± 0.616
1.765TyrThr: 1.765 ± 0.425
1.568TyrVal: 1.568 ± 0.488
0.294TyrTrp: 0.294 ± 0.138
2.451TyrTyr: 2.451 ± 0.647
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10202 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski