Amino acid dipepetide frequency for Streptococcus phage Javan241

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.117AlaAla: 5.117 ± 1.557
0.457AlaCys: 0.457 ± 0.162
4.02AlaAsp: 4.02 ± 0.62
5.391AlaGlu: 5.391 ± 0.648
2.741AlaPhe: 2.741 ± 1.144
5.939AlaGly: 5.939 ± 1.567
0.914AlaHis: 0.914 ± 0.257
5.3AlaIle: 5.3 ± 0.944
6.488AlaLys: 6.488 ± 0.699
9.046AlaLeu: 9.046 ± 2.039
2.193AlaMet: 2.193 ± 0.586
4.477AlaAsn: 4.477 ± 0.665
1.919AlaPro: 1.919 ± 0.364
2.924AlaGln: 2.924 ± 0.889
2.741AlaArg: 2.741 ± 0.487
5.026AlaSer: 5.026 ± 1.183
3.746AlaThr: 3.746 ± 0.797
6.67AlaVal: 6.67 ± 1.751
0.457AlaTrp: 0.457 ± 0.167
2.193AlaTyr: 2.193 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.365CysAla: 0.365 ± 0.185
0.091CysCys: 0.091 ± 0.095
0.548CysAsp: 0.548 ± 0.189
0.365CysGlu: 0.365 ± 0.162
0.183CysPhe: 0.183 ± 0.114
0.548CysGly: 0.548 ± 0.23
0.091CysHis: 0.091 ± 0.093
0.183CysIle: 0.183 ± 0.113
0.274CysLys: 0.274 ± 0.163
0.274CysLeu: 0.274 ± 0.117
0.091CysMet: 0.091 ± 0.108
0.091CysAsn: 0.091 ± 0.078
0.274CysPro: 0.274 ± 0.174
0.183CysGln: 0.183 ± 0.122
0.091CysArg: 0.091 ± 0.108
0.365CysSer: 0.365 ± 0.163
0.091CysThr: 0.091 ± 0.079
0.183CysVal: 0.183 ± 0.13
0.0CysTrp: 0.0 ± 0.0
0.274CysTyr: 0.274 ± 0.14
0.0CysXaa: 0.0 ± 0.0
Asp
5.117AspAla: 5.117 ± 0.879
0.457AspCys: 0.457 ± 0.269
4.751AspAsp: 4.751 ± 0.781
5.574AspGlu: 5.574 ± 0.842
2.833AspPhe: 2.833 ± 0.681
6.213AspGly: 6.213 ± 0.824
0.457AspHis: 0.457 ± 0.201
3.929AspIle: 3.929 ± 0.8
5.574AspLys: 5.574 ± 0.81
5.665AspLeu: 5.665 ± 0.725
1.919AspMet: 1.919 ± 0.386
2.833AspAsn: 2.833 ± 0.59
1.553AspPro: 1.553 ± 0.37
2.01AspGln: 2.01 ± 0.398
3.107AspArg: 3.107 ± 0.602
2.924AspSer: 2.924 ± 0.627
4.203AspThr: 4.203 ± 0.619
4.112AspVal: 4.112 ± 0.612
0.365AspTrp: 0.365 ± 0.175
3.655AspTyr: 3.655 ± 0.651
0.0AspXaa: 0.0 ± 0.0
Glu
4.934GluAla: 4.934 ± 0.941
0.274GluCys: 0.274 ± 0.158
3.746GluAsp: 3.746 ± 0.499
5.026GluGlu: 5.026 ± 0.983
3.015GluPhe: 3.015 ± 0.659
2.65GluGly: 2.65 ± 0.495
1.188GluHis: 1.188 ± 0.308
5.3GluIle: 5.3 ± 0.876
5.848GluLys: 5.848 ± 1.002
7.95GluLeu: 7.95 ± 1.313
1.462GluMet: 1.462 ± 0.431
4.386GluAsn: 4.386 ± 0.786
1.371GluPro: 1.371 ± 0.378
3.746GluGln: 3.746 ± 0.652
3.746GluArg: 3.746 ± 0.491
3.289GluSer: 3.289 ± 0.372
3.929GluThr: 3.929 ± 0.524
4.386GluVal: 4.386 ± 0.66
1.005GluTrp: 1.005 ± 0.252
2.467GluTyr: 2.467 ± 0.515
0.0GluXaa: 0.0 ± 0.0
Phe
2.741PheAla: 2.741 ± 0.63
0.183PheCys: 0.183 ± 0.144
4.569PheAsp: 4.569 ± 0.71
4.112PheGlu: 4.112 ± 0.771
1.005PhePhe: 1.005 ± 0.259
3.472PheGly: 3.472 ± 0.733
0.548PheHis: 0.548 ± 0.213
2.01PheIle: 2.01 ± 0.351
4.386PheLys: 4.386 ± 0.708
2.376PheLeu: 2.376 ± 0.376
1.005PheMet: 1.005 ± 0.247
2.193PheAsn: 2.193 ± 0.317
0.914PhePro: 0.914 ± 0.208
1.188PheGln: 1.188 ± 0.388
1.827PheArg: 1.827 ± 0.429
3.198PheSer: 3.198 ± 0.563
1.736PheThr: 1.736 ± 0.381
2.376PheVal: 2.376 ± 0.557
0.365PheTrp: 0.365 ± 0.203
0.914PheTyr: 0.914 ± 0.369
0.0PheXaa: 0.0 ± 0.0
Gly
5.026GlyAla: 5.026 ± 1.745
0.183GlyCys: 0.183 ± 0.135
3.655GlyAsp: 3.655 ± 0.702
3.015GlyGlu: 3.015 ± 0.501
3.655GlyPhe: 3.655 ± 0.627
3.472GlyGly: 3.472 ± 0.548
0.822GlyHis: 0.822 ± 0.223
5.482GlyIle: 5.482 ± 1.274
5.482GlyLys: 5.482 ± 0.664
5.848GlyLeu: 5.848 ± 1.152
2.193GlyMet: 2.193 ± 0.417
3.472GlyAsn: 3.472 ± 0.643
0.64GlyPro: 0.64 ± 0.226
3.381GlyGln: 3.381 ± 0.544
3.564GlyArg: 3.564 ± 0.548
4.02GlySer: 4.02 ± 0.754
4.02GlyThr: 4.02 ± 0.708
5.3GlyVal: 5.3 ± 0.945
0.365GlyTrp: 0.365 ± 0.165
2.741GlyTyr: 2.741 ± 0.57
0.0GlyXaa: 0.0 ± 0.0
His
0.822HisAla: 0.822 ± 0.235
0.091HisCys: 0.091 ± 0.086
0.457HisAsp: 0.457 ± 0.21
0.822HisGlu: 0.822 ± 0.208
0.64HisPhe: 0.64 ± 0.307
0.183HisGly: 0.183 ± 0.143
0.0HisHis: 0.0 ± 0.0
1.371HisIle: 1.371 ± 0.403
0.731HisLys: 0.731 ± 0.303
0.822HisLeu: 0.822 ± 0.32
0.457HisMet: 0.457 ± 0.226
0.64HisAsn: 0.64 ± 0.288
0.64HisPro: 0.64 ± 0.291
0.548HisGln: 0.548 ± 0.246
0.548HisArg: 0.548 ± 0.239
0.457HisSer: 0.457 ± 0.175
0.914HisThr: 0.914 ± 0.346
0.64HisVal: 0.64 ± 0.245
0.091HisTrp: 0.091 ± 0.083
0.64HisTyr: 0.64 ± 0.284
0.0HisXaa: 0.0 ± 0.0
Ile
6.122IleAla: 6.122 ± 0.959
0.091IleCys: 0.091 ± 0.088
4.751IleAsp: 4.751 ± 0.589
5.757IleGlu: 5.757 ± 0.817
1.553IlePhe: 1.553 ± 0.275
5.482IleGly: 5.482 ± 0.881
0.914IleHis: 0.914 ± 0.27
3.746IleIle: 3.746 ± 0.504
5.3IleLys: 5.3 ± 0.755
5.117IleLeu: 5.117 ± 0.632
1.371IleMet: 1.371 ± 0.27
4.569IleAsn: 4.569 ± 0.602
1.919IlePro: 1.919 ± 0.36
2.284IleGln: 2.284 ± 0.401
3.015IleArg: 3.015 ± 0.759
5.117IleSer: 5.117 ± 0.822
4.843IleThr: 4.843 ± 0.697
3.198IleVal: 3.198 ± 0.372
0.731IleTrp: 0.731 ± 0.321
2.924IleTyr: 2.924 ± 0.517
0.0IleXaa: 0.0 ± 0.0
Lys
7.127LysAla: 7.127 ± 0.955
0.548LysCys: 0.548 ± 0.208
4.02LysAsp: 4.02 ± 0.673
6.67LysGlu: 6.67 ± 0.932
2.741LysPhe: 2.741 ± 0.478
4.295LysGly: 4.295 ± 0.71
1.371LysHis: 1.371 ± 0.377
5.3LysIle: 5.3 ± 0.792
5.482LysLys: 5.482 ± 0.744
6.305LysLeu: 6.305 ± 0.88
2.65LysMet: 2.65 ± 0.562
5.117LysAsn: 5.117 ± 0.755
2.467LysPro: 2.467 ± 0.525
2.924LysGln: 2.924 ± 0.612
3.107LysArg: 3.107 ± 0.624
6.213LysSer: 6.213 ± 0.624
3.655LysThr: 3.655 ± 0.546
4.295LysVal: 4.295 ± 0.587
1.188LysTrp: 1.188 ± 0.29
2.284LysTyr: 2.284 ± 0.496
0.0LysXaa: 0.0 ± 0.0
Leu
6.579LeuAla: 6.579 ± 1.235
0.274LeuCys: 0.274 ± 0.15
6.396LeuAsp: 6.396 ± 0.679
5.939LeuGlu: 5.939 ± 0.975
2.284LeuPhe: 2.284 ± 0.437
5.026LeuGly: 5.026 ± 1.08
0.822LeuHis: 0.822 ± 0.289
4.66LeuIle: 4.66 ± 0.556
8.315LeuLys: 8.315 ± 0.869
5.482LeuLeu: 5.482 ± 0.676
1.827LeuMet: 1.827 ± 0.565
5.574LeuAsn: 5.574 ± 0.759
2.741LeuPro: 2.741 ± 0.652
3.929LeuGln: 3.929 ± 0.618
2.924LeuArg: 2.924 ± 0.659
6.67LeuSer: 6.67 ± 0.765
5.3LeuThr: 5.3 ± 0.663
4.843LeuVal: 4.843 ± 0.582
0.274LeuTrp: 0.274 ± 0.151
3.015LeuTyr: 3.015 ± 0.388
0.0LeuXaa: 0.0 ± 0.0
Met
2.193MetAla: 2.193 ± 0.573
0.091MetCys: 0.091 ± 0.091
1.827MetAsp: 1.827 ± 0.419
1.005MetGlu: 1.005 ± 0.285
0.731MetPhe: 0.731 ± 0.206
1.462MetGly: 1.462 ± 0.517
0.183MetHis: 0.183 ± 0.117
1.827MetIle: 1.827 ± 0.394
1.553MetLys: 1.553 ± 0.309
1.005MetLeu: 1.005 ± 0.247
1.096MetMet: 1.096 ± 0.372
1.188MetAsn: 1.188 ± 0.289
0.914MetPro: 0.914 ± 0.294
2.193MetGln: 2.193 ± 0.442
1.188MetArg: 1.188 ± 0.265
2.467MetSer: 2.467 ± 0.361
1.462MetThr: 1.462 ± 0.288
1.462MetVal: 1.462 ± 0.372
0.274MetTrp: 0.274 ± 0.145
0.914MetTyr: 0.914 ± 0.296
0.0MetXaa: 0.0 ± 0.0
Asn
4.569AsnAla: 4.569 ± 0.651
0.274AsnCys: 0.274 ± 0.153
4.295AsnAsp: 4.295 ± 0.561
4.295AsnGlu: 4.295 ± 0.69
2.65AsnPhe: 2.65 ± 0.439
4.66AsnGly: 4.66 ± 0.659
0.274AsnHis: 0.274 ± 0.189
3.198AsnIle: 3.198 ± 0.574
4.477AsnLys: 4.477 ± 0.753
4.295AsnLeu: 4.295 ± 0.786
1.279AsnMet: 1.279 ± 0.264
4.295AsnAsn: 4.295 ± 0.823
2.741AsnPro: 2.741 ± 0.605
2.558AsnGln: 2.558 ± 0.566
2.558AsnArg: 2.558 ± 0.594
2.102AsnSer: 2.102 ± 0.454
3.015AsnThr: 3.015 ± 0.515
3.381AsnVal: 3.381 ± 0.684
0.64AsnTrp: 0.64 ± 0.223
2.376AsnTyr: 2.376 ± 0.565
0.0AsnXaa: 0.0 ± 0.0
Pro
2.284ProAla: 2.284 ± 0.44
0.091ProCys: 0.091 ± 0.09
2.01ProAsp: 2.01 ± 0.446
2.558ProGlu: 2.558 ± 0.585
1.462ProPhe: 1.462 ± 0.402
1.462ProGly: 1.462 ± 0.381
0.365ProHis: 0.365 ± 0.145
1.827ProIle: 1.827 ± 0.359
1.827ProLys: 1.827 ± 0.356
2.467ProLeu: 2.467 ± 0.617
0.457ProMet: 0.457 ± 0.233
1.279ProAsn: 1.279 ± 0.292
0.731ProPro: 0.731 ± 0.222
0.914ProGln: 0.914 ± 0.245
0.822ProArg: 0.822 ± 0.28
1.736ProSer: 1.736 ± 0.335
1.919ProThr: 1.919 ± 0.439
2.467ProVal: 2.467 ± 0.551
0.183ProTrp: 0.183 ± 0.104
1.188ProTyr: 1.188 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
3.929GlnAla: 3.929 ± 0.93
0.091GlnCys: 0.091 ± 0.082
2.284GlnAsp: 2.284 ± 0.565
2.924GlnGlu: 2.924 ± 0.722
1.827GlnPhe: 1.827 ± 0.4
3.472GlnGly: 3.472 ± 0.889
0.64GlnHis: 0.64 ± 0.252
3.015GlnIle: 3.015 ± 0.502
3.838GlnLys: 3.838 ± 0.612
3.838GlnLeu: 3.838 ± 0.546
1.371GlnMet: 1.371 ± 0.337
3.015GlnAsn: 3.015 ± 0.503
0.822GlnPro: 0.822 ± 0.267
2.284GlnGln: 2.284 ± 0.58
1.736GlnArg: 1.736 ± 0.492
2.741GlnSer: 2.741 ± 0.525
1.645GlnThr: 1.645 ± 0.351
2.376GlnVal: 2.376 ± 0.574
0.365GlnTrp: 0.365 ± 0.153
1.371GlnTyr: 1.371 ± 0.403
0.0GlnXaa: 0.0 ± 0.0
Arg
2.833ArgAla: 2.833 ± 0.492
0.365ArgCys: 0.365 ± 0.17
2.924ArgAsp: 2.924 ± 0.447
2.102ArgGlu: 2.102 ± 0.429
2.193ArgPhe: 2.193 ± 0.369
2.284ArgGly: 2.284 ± 0.452
0.457ArgHis: 0.457 ± 0.213
3.107ArgIle: 3.107 ± 0.571
3.381ArgLys: 3.381 ± 0.615
3.838ArgLeu: 3.838 ± 0.792
1.096ArgMet: 1.096 ± 0.319
2.741ArgAsn: 2.741 ± 0.629
1.371ArgPro: 1.371 ± 0.262
1.279ArgGln: 1.279 ± 0.304
1.279ArgArg: 1.279 ± 0.378
2.467ArgSer: 2.467 ± 0.504
2.65ArgThr: 2.65 ± 0.542
2.65ArgVal: 2.65 ± 0.402
0.64ArgTrp: 0.64 ± 0.266
1.919ArgTyr: 1.919 ± 0.366
0.0ArgXaa: 0.0 ± 0.0
Ser
4.751SerAla: 4.751 ± 1.296
0.183SerCys: 0.183 ± 0.127
4.66SerAsp: 4.66 ± 0.665
3.929SerGlu: 3.929 ± 0.759
2.924SerPhe: 2.924 ± 0.609
4.203SerGly: 4.203 ± 0.684
0.548SerHis: 0.548 ± 0.197
4.569SerIle: 4.569 ± 0.645
3.929SerLys: 3.929 ± 0.613
6.396SerLeu: 6.396 ± 0.593
1.645SerMet: 1.645 ± 0.471
3.381SerAsn: 3.381 ± 0.59
1.919SerPro: 1.919 ± 0.33
3.564SerGln: 3.564 ± 0.602
2.558SerArg: 2.558 ± 0.38
4.569SerSer: 4.569 ± 0.672
4.569SerThr: 4.569 ± 0.649
4.386SerVal: 4.386 ± 0.888
0.457SerTrp: 0.457 ± 0.193
2.65SerTyr: 2.65 ± 0.756
0.0SerXaa: 0.0 ± 0.0
Thr
4.386ThrAla: 4.386 ± 1.468
0.091ThrCys: 0.091 ± 0.095
3.198ThrAsp: 3.198 ± 0.614
3.198ThrGlu: 3.198 ± 0.63
3.381ThrPhe: 3.381 ± 0.566
4.203ThrGly: 4.203 ± 0.489
1.005ThrHis: 1.005 ± 0.315
5.848ThrIle: 5.848 ± 0.898
3.564ThrLys: 3.564 ± 0.727
4.295ThrLeu: 4.295 ± 0.551
1.005ThrMet: 1.005 ± 0.381
2.284ThrAsn: 2.284 ± 0.507
2.376ThrPro: 2.376 ± 0.508
2.558ThrGln: 2.558 ± 0.469
2.102ThrArg: 2.102 ± 0.42
4.02ThrSer: 4.02 ± 0.595
3.198ThrThr: 3.198 ± 0.501
3.929ThrVal: 3.929 ± 0.755
0.731ThrTrp: 0.731 ± 0.264
2.102ThrTyr: 2.102 ± 0.444
0.0ThrXaa: 0.0 ± 0.0
Val
6.305ValAla: 6.305 ± 1.163
0.183ValCys: 0.183 ± 0.109
4.66ValAsp: 4.66 ± 0.506
4.386ValGlu: 4.386 ± 0.649
2.741ValPhe: 2.741 ± 0.433
4.295ValGly: 4.295 ± 1.123
0.365ValHis: 0.365 ± 0.157
4.569ValIle: 4.569 ± 0.63
4.203ValLys: 4.203 ± 0.696
3.289ValLeu: 3.289 ± 0.463
1.005ValMet: 1.005 ± 0.248
4.02ValAsn: 4.02 ± 0.799
1.279ValPro: 1.279 ± 0.343
2.65ValGln: 2.65 ± 0.653
2.01ValArg: 2.01 ± 0.519
5.574ValSer: 5.574 ± 0.647
3.929ValThr: 3.929 ± 0.54
4.477ValVal: 4.477 ± 0.829
1.279ValTrp: 1.279 ± 0.383
3.015ValTyr: 3.015 ± 0.596
0.0ValXaa: 0.0 ± 0.0
Trp
0.548TrpAla: 0.548 ± 0.236
0.0TrpCys: 0.0 ± 0.0
0.548TrpAsp: 0.548 ± 0.246
0.548TrpGlu: 0.548 ± 0.234
0.457TrpPhe: 0.457 ± 0.22
0.548TrpGly: 0.548 ± 0.201
0.183TrpHis: 0.183 ± 0.123
0.822TrpIle: 0.822 ± 0.28
0.548TrpLys: 0.548 ± 0.213
0.822TrpLeu: 0.822 ± 0.302
0.091TrpMet: 0.091 ± 0.091
0.365TrpAsn: 0.365 ± 0.166
0.365TrpPro: 0.365 ± 0.176
0.457TrpGln: 0.457 ± 0.217
0.64TrpArg: 0.64 ± 0.224
1.005TrpSer: 1.005 ± 0.265
0.365TrpThr: 0.365 ± 0.255
0.64TrpVal: 0.64 ± 0.273
0.183TrpTrp: 0.183 ± 0.123
0.64TrpTyr: 0.64 ± 0.299
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.01TyrAla: 2.01 ± 0.572
0.548TyrCys: 0.548 ± 0.336
3.929TyrAsp: 3.929 ± 0.763
2.102TyrGlu: 2.102 ± 0.467
2.102TyrPhe: 2.102 ± 0.619
2.467TyrGly: 2.467 ± 0.506
0.365TyrHis: 0.365 ± 0.135
2.741TyrIle: 2.741 ± 0.448
2.65TyrLys: 2.65 ± 0.541
3.746TyrLeu: 3.746 ± 0.818
0.731TyrMet: 0.731 ± 0.234
2.01TyrAsn: 2.01 ± 0.424
1.096TyrPro: 1.096 ± 0.329
2.01TyrGln: 2.01 ± 0.345
2.01TyrArg: 2.01 ± 0.457
1.827TyrSer: 1.827 ± 0.411
2.284TyrThr: 2.284 ± 0.458
2.467TyrVal: 2.467 ± 0.508
0.183TyrTrp: 0.183 ± 0.133
1.553TyrTyr: 1.553 ± 0.482
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (10945 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski