Amino acid dipepetide frequency for Streptococcus phage Javan345

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.553AlaAla: 4.553 ± 1.346
0.372AlaCys: 0.372 ± 0.156
5.017AlaAsp: 5.017 ± 0.71
5.668AlaGlu: 5.668 ± 0.806
3.066AlaPhe: 3.066 ± 0.795
4.924AlaGly: 4.924 ± 1.183
0.65AlaHis: 0.65 ± 0.205
5.389AlaIle: 5.389 ± 1.736
6.225AlaLys: 6.225 ± 0.789
7.247AlaLeu: 7.247 ± 1.532
2.973AlaMet: 2.973 ± 0.819
3.624AlaAsn: 3.624 ± 0.65
1.765AlaPro: 1.765 ± 0.435
3.066AlaGln: 3.066 ± 1.003
2.787AlaArg: 2.787 ± 0.585
5.575AlaSer: 5.575 ± 1.277
4.088AlaThr: 4.088 ± 0.881
3.531AlaVal: 3.531 ± 0.605
0.836AlaTrp: 0.836 ± 0.279
2.602AlaTyr: 2.602 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.279CysAla: 0.279 ± 0.158
0.0CysCys: 0.0 ± 0.0
0.279CysAsp: 0.279 ± 0.162
0.65CysGlu: 0.65 ± 0.257
0.372CysPhe: 0.372 ± 0.172
0.65CysGly: 0.65 ± 0.279
0.093CysHis: 0.093 ± 0.086
0.372CysIle: 0.372 ± 0.193
0.557CysLys: 0.557 ± 0.215
0.372CysLeu: 0.372 ± 0.166
0.0CysMet: 0.0 ± 0.0
0.279CysAsn: 0.279 ± 0.134
0.372CysPro: 0.372 ± 0.245
0.093CysGln: 0.093 ± 0.085
0.279CysArg: 0.279 ± 0.159
0.372CysSer: 0.372 ± 0.179
0.465CysThr: 0.465 ± 0.172
0.465CysVal: 0.465 ± 0.241
0.0CysTrp: 0.0 ± 0.0
0.465CysTyr: 0.465 ± 0.202
0.0CysXaa: 0.0 ± 0.0
Asp
3.345AspAla: 3.345 ± 0.739
0.65AspCys: 0.65 ± 0.236
3.531AspAsp: 3.531 ± 0.835
5.017AspGlu: 5.017 ± 0.772
3.438AspPhe: 3.438 ± 0.683
5.203AspGly: 5.203 ± 0.903
0.465AspHis: 0.465 ± 0.222
4.924AspIle: 4.924 ± 0.537
5.76AspLys: 5.76 ± 0.839
5.668AspLeu: 5.668 ± 0.816
1.858AspMet: 1.858 ± 0.313
3.995AspAsn: 3.995 ± 0.605
0.743AspPro: 0.743 ± 0.275
1.301AspGln: 1.301 ± 0.305
1.951AspArg: 1.951 ± 0.404
2.602AspSer: 2.602 ± 0.431
4.367AspThr: 4.367 ± 0.641
4.553AspVal: 4.553 ± 0.597
0.465AspTrp: 0.465 ± 0.216
2.787AspTyr: 2.787 ± 0.639
0.0AspXaa: 0.0 ± 0.0
Glu
5.575GluAla: 5.575 ± 0.756
0.186GluCys: 0.186 ± 0.133
4.553GluAsp: 4.553 ± 0.71
5.76GluGlu: 5.76 ± 1.092
3.438GluPhe: 3.438 ± 0.585
4.553GluGly: 4.553 ± 0.584
0.557GluHis: 0.557 ± 0.251
4.646GluIle: 4.646 ± 0.799
7.433GluLys: 7.433 ± 1.11
8.269GluLeu: 8.269 ± 1.174
2.23GluMet: 2.23 ± 0.495
3.716GluAsn: 3.716 ± 0.674
1.394GluPro: 1.394 ± 0.378
4.46GluGln: 4.46 ± 0.731
3.159GluArg: 3.159 ± 0.62
3.438GluSer: 3.438 ± 0.548
3.438GluThr: 3.438 ± 0.626
5.76GluVal: 5.76 ± 0.799
1.115GluTrp: 1.115 ± 0.28
2.694GluTyr: 2.694 ± 0.637
0.0GluXaa: 0.0 ± 0.0
Phe
3.159PheAla: 3.159 ± 1.021
0.279PheCys: 0.279 ± 0.153
3.716PheAsp: 3.716 ± 0.445
3.809PheGlu: 3.809 ± 0.644
1.579PhePhe: 1.579 ± 0.536
2.323PheGly: 2.323 ± 0.634
0.465PheHis: 0.465 ± 0.207
2.323PheIle: 2.323 ± 0.427
3.809PheLys: 3.809 ± 0.441
2.509PheLeu: 2.509 ± 0.414
1.208PheMet: 1.208 ± 0.339
2.509PheAsn: 2.509 ± 0.371
1.022PhePro: 1.022 ± 0.263
1.394PheGln: 1.394 ± 0.354
1.301PheArg: 1.301 ± 0.338
3.624PheSer: 3.624 ± 0.584
2.602PheThr: 2.602 ± 0.551
2.602PheVal: 2.602 ± 0.51
0.465PheTrp: 0.465 ± 0.202
1.487PheTyr: 1.487 ± 0.4
0.0PheXaa: 0.0 ± 0.0
Gly
3.995GlyAla: 3.995 ± 1.128
0.465GlyCys: 0.465 ± 0.304
3.995GlyAsp: 3.995 ± 0.529
4.367GlyGlu: 4.367 ± 0.541
2.787GlyPhe: 2.787 ± 0.41
2.602GlyGly: 2.602 ± 0.546
0.743GlyHis: 0.743 ± 0.237
5.946GlyIle: 5.946 ± 1.005
3.995GlyLys: 3.995 ± 0.688
5.668GlyLeu: 5.668 ± 0.856
1.487GlyMet: 1.487 ± 0.394
2.88GlyAsn: 2.88 ± 0.56
1.487GlyPro: 1.487 ± 0.303
3.438GlyGln: 3.438 ± 0.567
3.066GlyArg: 3.066 ± 0.47
4.367GlySer: 4.367 ± 0.984
3.809GlyThr: 3.809 ± 0.838
5.946GlyVal: 5.946 ± 1.192
1.487GlyTrp: 1.487 ± 0.542
2.23GlyTyr: 2.23 ± 0.506
0.0GlyXaa: 0.0 ± 0.0
His
0.929HisAla: 0.929 ± 0.298
0.372HisCys: 0.372 ± 0.202
0.929HisAsp: 0.929 ± 0.316
1.022HisGlu: 1.022 ± 0.305
0.65HisPhe: 0.65 ± 0.222
0.929HisGly: 0.929 ± 0.267
0.186HisHis: 0.186 ± 0.12
0.836HisIle: 0.836 ± 0.258
0.65HisLys: 0.65 ± 0.223
0.65HisLeu: 0.65 ± 0.283
0.465HisMet: 0.465 ± 0.228
0.557HisAsn: 0.557 ± 0.208
0.465HisPro: 0.465 ± 0.186
0.557HisGln: 0.557 ± 0.19
0.186HisArg: 0.186 ± 0.128
0.836HisSer: 0.836 ± 0.282
1.115HisThr: 1.115 ± 0.348
1.301HisVal: 1.301 ± 0.426
0.279HisTrp: 0.279 ± 0.151
0.372HisTyr: 0.372 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
5.203IleAla: 5.203 ± 1.261
0.372IleCys: 0.372 ± 0.165
4.831IleAsp: 4.831 ± 0.604
5.76IleGlu: 5.76 ± 0.89
1.858IlePhe: 1.858 ± 0.425
3.624IleGly: 3.624 ± 0.766
0.929IleHis: 0.929 ± 0.27
3.438IleIle: 3.438 ± 0.636
7.247IleLys: 7.247 ± 0.748
4.181IleLeu: 4.181 ± 0.675
0.929IleMet: 0.929 ± 0.339
4.738IleAsn: 4.738 ± 0.695
1.858IlePro: 1.858 ± 0.382
3.809IleGln: 3.809 ± 0.72
2.509IleArg: 2.509 ± 0.527
5.575IleSer: 5.575 ± 0.924
2.787IleThr: 2.787 ± 0.58
3.716IleVal: 3.716 ± 0.601
0.186IleTrp: 0.186 ± 0.126
1.765IleTyr: 1.765 ± 0.417
0.0IleXaa: 0.0 ± 0.0
Lys
6.69LysAla: 6.69 ± 1.008
0.557LysCys: 0.557 ± 0.28
4.646LysAsp: 4.646 ± 0.685
6.968LysGlu: 6.968 ± 1.281
2.694LysPhe: 2.694 ± 0.525
5.017LysGly: 5.017 ± 0.658
1.765LysHis: 1.765 ± 0.465
5.946LysIle: 5.946 ± 0.805
6.782LysLys: 6.782 ± 1.204
5.853LysLeu: 5.853 ± 0.774
1.765LysMet: 1.765 ± 0.354
4.646LysAsn: 4.646 ± 0.807
1.951LysPro: 1.951 ± 0.436
3.809LysGln: 3.809 ± 0.676
4.274LysArg: 4.274 ± 0.802
5.853LysSer: 5.853 ± 0.794
5.296LysThr: 5.296 ± 0.627
5.203LysVal: 5.203 ± 0.906
1.022LysTrp: 1.022 ± 0.354
3.438LysTyr: 3.438 ± 0.55
0.0LysXaa: 0.0 ± 0.0
Leu
6.782LeuAla: 6.782 ± 0.929
0.279LeuCys: 0.279 ± 0.149
5.575LeuAsp: 5.575 ± 0.636
6.225LeuGlu: 6.225 ± 0.714
2.88LeuPhe: 2.88 ± 0.443
5.946LeuGly: 5.946 ± 1.034
1.208LeuHis: 1.208 ± 0.306
4.274LeuIle: 4.274 ± 0.509
8.269LeuLys: 8.269 ± 0.882
4.924LeuLeu: 4.924 ± 0.664
1.208LeuMet: 1.208 ± 0.36
4.274LeuAsn: 4.274 ± 0.822
1.858LeuPro: 1.858 ± 0.478
3.624LeuGln: 3.624 ± 0.529
2.323LeuArg: 2.323 ± 0.492
6.411LeuSer: 6.411 ± 0.57
5.482LeuThr: 5.482 ± 0.929
4.738LeuVal: 4.738 ± 0.599
0.557LeuTrp: 0.557 ± 0.279
2.416LeuTyr: 2.416 ± 0.463
0.0LeuXaa: 0.0 ± 0.0
Met
2.602MetAla: 2.602 ± 0.994
0.0MetCys: 0.0 ± 0.0
1.858MetAsp: 1.858 ± 0.419
0.929MetGlu: 0.929 ± 0.283
0.65MetPhe: 0.65 ± 0.2
1.394MetGly: 1.394 ± 0.428
0.557MetHis: 0.557 ± 0.232
1.487MetIle: 1.487 ± 0.39
1.765MetLys: 1.765 ± 0.457
2.323MetLeu: 2.323 ± 0.389
0.465MetMet: 0.465 ± 0.345
1.115MetAsn: 1.115 ± 0.324
0.743MetPro: 0.743 ± 0.404
1.022MetGln: 1.022 ± 0.259
1.672MetArg: 1.672 ± 0.35
1.301MetSer: 1.301 ± 0.354
1.487MetThr: 1.487 ± 0.34
1.487MetVal: 1.487 ± 0.342
0.0MetTrp: 0.0 ± 0.0
0.836MetTyr: 0.836 ± 0.268
0.0MetXaa: 0.0 ± 0.0
Asn
3.995AsnAla: 3.995 ± 0.652
0.465AsnCys: 0.465 ± 0.204
3.066AsnAsp: 3.066 ± 0.632
3.716AsnGlu: 3.716 ± 0.828
2.416AsnPhe: 2.416 ± 0.368
4.738AsnGly: 4.738 ± 0.817
0.65AsnHis: 0.65 ± 0.24
3.066AsnIle: 3.066 ± 0.534
2.973AsnLys: 2.973 ± 0.667
3.809AsnLeu: 3.809 ± 0.43
1.022AsnMet: 1.022 ± 0.277
2.323AsnAsn: 2.323 ± 0.437
1.672AsnPro: 1.672 ± 0.407
3.159AsnGln: 3.159 ± 0.578
2.509AsnArg: 2.509 ± 0.502
3.902AsnSer: 3.902 ± 0.564
2.509AsnThr: 2.509 ± 0.421
4.088AsnVal: 4.088 ± 0.601
1.394AsnTrp: 1.394 ± 0.385
1.765AsnTyr: 1.765 ± 0.407
0.0AsnXaa: 0.0 ± 0.0
Pro
2.137ProAla: 2.137 ± 0.411
0.186ProCys: 0.186 ± 0.129
1.672ProAsp: 1.672 ± 0.439
2.23ProGlu: 2.23 ± 0.437
1.208ProPhe: 1.208 ± 0.344
1.115ProGly: 1.115 ± 0.287
0.279ProHis: 0.279 ± 0.143
1.951ProIle: 1.951 ± 0.372
2.23ProLys: 2.23 ± 0.469
1.579ProLeu: 1.579 ± 0.393
0.372ProMet: 0.372 ± 0.167
1.301ProAsn: 1.301 ± 0.307
0.557ProPro: 0.557 ± 0.202
1.579ProGln: 1.579 ± 0.442
1.208ProArg: 1.208 ± 0.37
1.208ProSer: 1.208 ± 0.381
1.487ProThr: 1.487 ± 0.402
1.765ProVal: 1.765 ± 0.34
0.372ProTrp: 0.372 ± 0.188
1.301ProTyr: 1.301 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
4.274GlnAla: 4.274 ± 1.115
0.557GlnCys: 0.557 ± 0.262
1.858GlnAsp: 1.858 ± 0.459
4.738GlnGlu: 4.738 ± 0.811
1.765GlnPhe: 1.765 ± 0.509
2.787GlnGly: 2.787 ± 0.52
0.557GlnHis: 0.557 ± 0.216
2.88GlnIle: 2.88 ± 0.636
3.252GlnLys: 3.252 ± 0.58
3.624GlnLeu: 3.624 ± 0.553
1.765GlnMet: 1.765 ± 0.379
3.159GlnAsn: 3.159 ± 0.53
0.743GlnPro: 0.743 ± 0.221
1.765GlnGln: 1.765 ± 0.426
1.579GlnArg: 1.579 ± 0.488
3.624GlnSer: 3.624 ± 0.772
2.044GlnThr: 2.044 ± 0.421
2.509GlnVal: 2.509 ± 0.38
0.186GlnTrp: 0.186 ± 0.128
1.022GlnTyr: 1.022 ± 0.249
0.0GlnXaa: 0.0 ± 0.0
Arg
2.602ArgAla: 2.602 ± 0.413
0.279ArgCys: 0.279 ± 0.186
2.137ArgAsp: 2.137 ± 0.35
3.531ArgGlu: 3.531 ± 0.587
1.672ArgPhe: 1.672 ± 0.334
2.602ArgGly: 2.602 ± 0.464
0.372ArgHis: 0.372 ± 0.195
2.137ArgIle: 2.137 ± 0.392
3.252ArgLys: 3.252 ± 0.653
3.902ArgLeu: 3.902 ± 0.672
1.022ArgMet: 1.022 ± 0.323
1.951ArgAsn: 1.951 ± 0.376
1.579ArgPro: 1.579 ± 0.369
1.301ArgGln: 1.301 ± 0.452
1.579ArgArg: 1.579 ± 0.445
2.137ArgSer: 2.137 ± 0.45
2.137ArgThr: 2.137 ± 0.43
2.23ArgVal: 2.23 ± 0.581
1.208ArgTrp: 1.208 ± 0.387
2.323ArgTyr: 2.323 ± 0.644
0.0ArgXaa: 0.0 ± 0.0
Ser
5.575SerAla: 5.575 ± 1.426
0.372SerCys: 0.372 ± 0.154
4.274SerAsp: 4.274 ± 0.523
4.46SerGlu: 4.46 ± 0.578
2.787SerPhe: 2.787 ± 0.62
4.831SerGly: 4.831 ± 1.025
1.022SerHis: 1.022 ± 0.343
4.367SerIle: 4.367 ± 0.682
4.553SerLys: 4.553 ± 0.687
5.482SerLeu: 5.482 ± 1.084
1.672SerMet: 1.672 ± 0.402
4.924SerAsn: 4.924 ± 0.735
1.951SerPro: 1.951 ± 0.527
3.531SerGln: 3.531 ± 0.368
1.765SerArg: 1.765 ± 0.384
4.924SerSer: 4.924 ± 1.192
4.46SerThr: 4.46 ± 1.087
3.902SerVal: 3.902 ± 0.835
0.743SerTrp: 0.743 ± 0.307
2.602SerTyr: 2.602 ± 0.593
0.0SerXaa: 0.0 ± 0.0
Thr
4.367ThrAla: 4.367 ± 0.997
0.186ThrCys: 0.186 ± 0.102
3.438ThrAsp: 3.438 ± 0.585
3.716ThrGlu: 3.716 ± 0.768
3.438ThrPhe: 3.438 ± 0.748
4.181ThrGly: 4.181 ± 0.633
0.929ThrHis: 0.929 ± 0.331
4.367ThrIle: 4.367 ± 0.973
4.738ThrLys: 4.738 ± 0.742
5.203ThrLeu: 5.203 ± 0.712
0.743ThrMet: 0.743 ± 0.285
2.23ThrAsn: 2.23 ± 0.47
1.951ThrPro: 1.951 ± 0.561
2.137ThrGln: 2.137 ± 0.459
2.694ThrArg: 2.694 ± 0.705
3.438ThrSer: 3.438 ± 0.691
3.252ThrThr: 3.252 ± 0.628
4.738ThrVal: 4.738 ± 0.747
1.022ThrTrp: 1.022 ± 0.358
2.694ThrTyr: 2.694 ± 0.646
0.0ThrXaa: 0.0 ± 0.0
Val
5.203ValAla: 5.203 ± 1.022
0.093ValCys: 0.093 ± 0.076
4.831ValAsp: 4.831 ± 0.702
4.738ValGlu: 4.738 ± 0.606
2.509ValPhe: 2.509 ± 0.424
4.831ValGly: 4.831 ± 0.8
0.372ValHis: 0.372 ± 0.209
3.716ValIle: 3.716 ± 0.514
5.946ValLys: 5.946 ± 0.568
3.159ValLeu: 3.159 ± 0.504
1.672ValMet: 1.672 ± 0.302
2.973ValAsn: 2.973 ± 0.39
1.951ValPro: 1.951 ± 0.534
2.602ValGln: 2.602 ± 0.454
2.23ValArg: 2.23 ± 0.504
6.039ValSer: 6.039 ± 0.786
5.11ValThr: 5.11 ± 0.562
4.924ValVal: 4.924 ± 0.766
0.929ValTrp: 0.929 ± 0.271
2.323ValTyr: 2.323 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
0.557TrpAla: 0.557 ± 0.186
0.186TrpCys: 0.186 ± 0.125
0.65TrpAsp: 0.65 ± 0.227
1.115TrpGlu: 1.115 ± 0.28
0.465TrpPhe: 0.465 ± 0.281
0.465TrpGly: 0.465 ± 0.208
0.093TrpHis: 0.093 ± 0.092
0.836TrpIle: 0.836 ± 0.267
1.672TrpLys: 1.672 ± 0.393
1.115TrpLeu: 1.115 ± 0.323
0.279TrpMet: 0.279 ± 0.142
0.279TrpAsn: 0.279 ± 0.146
0.186TrpPro: 0.186 ± 0.141
0.743TrpGln: 0.743 ± 0.259
0.557TrpArg: 0.557 ± 0.194
0.929TrpSer: 0.929 ± 0.26
1.115TrpThr: 1.115 ± 0.372
0.743TrpVal: 0.743 ± 0.206
0.279TrpTrp: 0.279 ± 0.191
0.743TrpTyr: 0.743 ± 0.412
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.137TyrAla: 2.137 ± 0.34
0.557TyrCys: 0.557 ± 0.228
1.765TyrAsp: 1.765 ± 0.376
2.044TyrGlu: 2.044 ± 0.43
2.509TyrPhe: 2.509 ± 0.547
2.044TyrGly: 2.044 ± 0.32
1.394TyrHis: 1.394 ± 0.392
2.416TyrIle: 2.416 ± 0.547
3.066TyrLys: 3.066 ± 0.738
3.716TyrLeu: 3.716 ± 0.709
0.465TyrMet: 0.465 ± 0.175
1.765TyrAsn: 1.765 ± 0.386
1.487TyrPro: 1.487 ± 0.382
1.301TyrGln: 1.301 ± 0.281
2.323TyrArg: 2.323 ± 0.503
2.044TyrSer: 2.044 ± 0.371
2.416TyrThr: 2.416 ± 0.434
1.951TyrVal: 1.951 ± 0.402
0.465TyrTrp: 0.465 ± 0.187
1.672TyrTyr: 1.672 ± 0.474
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (10764 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski