Amino acid dipepetide frequency for Streptococcus phage IC1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.369AlaAla: 2.369 ± 0.782
0.273AlaCys: 0.273 ± 0.193
6.013AlaAsp: 6.013 ± 0.647
7.379AlaGlu: 7.379 ± 0.893
2.551AlaPhe: 2.551 ± 0.774
5.375AlaGly: 5.375 ± 1.147
0.364AlaHis: 0.364 ± 0.206
4.646AlaIle: 4.646 ± 1.003
5.739AlaLys: 5.739 ± 0.843
6.468AlaLeu: 6.468 ± 0.824
2.186AlaMet: 2.186 ± 0.437
3.553AlaAsn: 3.553 ± 0.793
2.369AlaPro: 2.369 ± 0.389
2.915AlaGln: 2.915 ± 0.517
2.915AlaArg: 2.915 ± 0.635
3.006AlaSer: 3.006 ± 1.124
4.737AlaThr: 4.737 ± 0.661
5.375AlaVal: 5.375 ± 0.898
1.549AlaTrp: 1.549 ± 0.432
1.822AlaTyr: 1.822 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
0.455CysAla: 0.455 ± 0.214
0.273CysCys: 0.273 ± 0.183
0.364CysAsp: 0.364 ± 0.187
0.455CysGlu: 0.455 ± 0.185
0.364CysPhe: 0.364 ± 0.173
0.547CysGly: 0.547 ± 0.349
0.182CysHis: 0.182 ± 0.093
0.364CysIle: 0.364 ± 0.275
0.729CysLys: 0.729 ± 0.243
0.364CysLeu: 0.364 ± 0.175
0.0CysMet: 0.0 ± 0.0
0.182CysAsn: 0.182 ± 0.209
0.091CysPro: 0.091 ± 0.078
0.182CysGln: 0.182 ± 0.12
0.455CysArg: 0.455 ± 0.175
0.455CysSer: 0.455 ± 0.244
0.273CysThr: 0.273 ± 0.181
0.0CysVal: 0.0 ± 0.0
0.273CysTrp: 0.273 ± 0.152
0.547CysTyr: 0.547 ± 0.209
0.0CysXaa: 0.0 ± 0.0
Asp
3.553AspAla: 3.553 ± 0.712
0.729AspCys: 0.729 ± 0.32
2.824AspAsp: 2.824 ± 0.772
5.375AspGlu: 5.375 ± 1.305
3.188AspPhe: 3.188 ± 0.595
5.193AspGly: 5.193 ± 0.706
0.273AspHis: 0.273 ± 0.164
5.739AspIle: 5.739 ± 0.506
4.828AspLys: 4.828 ± 0.924
4.919AspLeu: 4.919 ± 0.61
1.822AspMet: 1.822 ± 0.457
2.915AspAsn: 2.915 ± 0.486
2.095AspPro: 2.095 ± 0.591
1.731AspGln: 1.731 ± 0.451
2.551AspArg: 2.551 ± 0.437
3.371AspSer: 3.371 ± 0.565
3.644AspThr: 3.644 ± 0.557
4.099AspVal: 4.099 ± 0.504
1.275AspTrp: 1.275 ± 0.307
3.371AspTyr: 3.371 ± 0.563
0.0AspXaa: 0.0 ± 0.0
Glu
6.468GluAla: 6.468 ± 1.106
0.455GluCys: 0.455 ± 0.198
4.373GluAsp: 4.373 ± 0.747
6.195GluGlu: 6.195 ± 1.075
4.008GluPhe: 4.008 ± 0.773
3.735GluGly: 3.735 ± 0.634
0.729GluHis: 0.729 ± 0.251
6.741GluIle: 6.741 ± 0.667
5.921GluLys: 5.921 ± 1.06
9.748GluLeu: 9.748 ± 1.028
1.913GluMet: 1.913 ± 0.535
4.646GluAsn: 4.646 ± 0.717
1.64GluPro: 1.64 ± 0.46
3.553GluGln: 3.553 ± 0.65
4.373GluArg: 4.373 ± 0.771
5.102GluSer: 5.102 ± 0.544
3.826GluThr: 3.826 ± 0.612
5.739GluVal: 5.739 ± 0.815
0.729GluTrp: 0.729 ± 0.23
2.642GluTyr: 2.642 ± 0.563
0.0GluXaa: 0.0 ± 0.0
Phe
2.186PheAla: 2.186 ± 0.704
0.182PheCys: 0.182 ± 0.156
4.555PheAsp: 4.555 ± 0.609
4.191PheGlu: 4.191 ± 0.889
1.275PhePhe: 1.275 ± 0.409
2.369PheGly: 2.369 ± 0.638
0.364PheHis: 0.364 ± 0.254
1.913PheIle: 1.913 ± 0.431
3.553PheLys: 3.553 ± 0.575
2.095PheLeu: 2.095 ± 0.364
1.275PheMet: 1.275 ± 0.455
2.642PheAsn: 2.642 ± 0.589
0.455PhePro: 0.455 ± 0.175
1.731PheGln: 1.731 ± 0.401
1.002PheArg: 1.002 ± 0.237
3.735PheSer: 3.735 ± 0.745
2.642PheThr: 2.642 ± 0.425
1.458PheVal: 1.458 ± 0.372
0.547PheTrp: 0.547 ± 0.223
1.822PheTyr: 1.822 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
3.28GlyAla: 3.28 ± 0.506
0.182GlyCys: 0.182 ± 0.114
4.099GlyAsp: 4.099 ± 0.54
4.646GlyGlu: 4.646 ± 0.798
2.824GlyPhe: 2.824 ± 0.64
5.648GlyGly: 5.648 ± 1.267
0.547GlyHis: 0.547 ± 0.209
3.644GlyIle: 3.644 ± 0.688
4.646GlyLys: 4.646 ± 0.636
5.648GlyLeu: 5.648 ± 1.258
2.004GlyMet: 2.004 ± 0.407
4.008GlyAsn: 4.008 ± 0.621
1.002GlyPro: 1.002 ± 0.34
4.008GlyGln: 4.008 ± 0.459
4.099GlyArg: 4.099 ± 0.636
4.464GlySer: 4.464 ± 0.748
3.462GlyThr: 3.462 ± 0.549
4.464GlyVal: 4.464 ± 0.704
1.184GlyTrp: 1.184 ± 0.474
3.371GlyTyr: 3.371 ± 0.66
0.0GlyXaa: 0.0 ± 0.0
His
0.638HisAla: 0.638 ± 0.217
0.0HisCys: 0.0 ± 0.0
0.82HisAsp: 0.82 ± 0.346
1.093HisGlu: 1.093 ± 0.296
0.911HisPhe: 0.911 ± 0.215
0.82HisGly: 0.82 ± 0.261
0.091HisHis: 0.091 ± 0.097
0.547HisIle: 0.547 ± 0.29
0.638HisLys: 0.638 ± 0.222
0.911HisLeu: 0.911 ± 0.277
0.182HisMet: 0.182 ± 0.125
1.002HisAsn: 1.002 ± 0.294
0.455HisPro: 0.455 ± 0.144
0.547HisGln: 0.547 ± 0.241
0.364HisArg: 0.364 ± 0.16
0.911HisSer: 0.911 ± 0.426
0.364HisThr: 0.364 ± 0.182
0.82HisVal: 0.82 ± 0.233
0.364HisTrp: 0.364 ± 0.21
0.273HisTyr: 0.273 ± 0.171
0.0HisXaa: 0.0 ± 0.0
Ile
5.102IleAla: 5.102 ± 0.826
0.455IleCys: 0.455 ± 0.195
2.915IleAsp: 2.915 ± 0.534
5.83IleGlu: 5.83 ± 0.752
2.642IlePhe: 2.642 ± 0.581
5.01IleGly: 5.01 ± 0.812
0.547IleHis: 0.547 ± 0.261
3.371IleIle: 3.371 ± 0.519
6.013IleLys: 6.013 ± 0.533
3.735IleLeu: 3.735 ± 0.511
1.458IleMet: 1.458 ± 0.437
3.735IleAsn: 3.735 ± 0.562
1.458IlePro: 1.458 ± 0.383
2.277IleGln: 2.277 ± 0.337
3.644IleArg: 3.644 ± 0.662
5.375IleSer: 5.375 ± 0.937
4.555IleThr: 4.555 ± 0.561
2.915IleVal: 2.915 ± 0.68
0.638IleTrp: 0.638 ± 0.236
1.731IleTyr: 1.731 ± 0.59
0.0IleXaa: 0.0 ± 0.0
Lys
5.648LysAla: 5.648 ± 0.876
0.364LysCys: 0.364 ± 0.198
5.648LysAsp: 5.648 ± 0.696
6.832LysGlu: 6.832 ± 0.843
2.369LysPhe: 2.369 ± 0.449
4.099LysGly: 4.099 ± 0.48
0.911LysHis: 0.911 ± 0.205
5.193LysIle: 5.193 ± 0.765
6.013LysLys: 6.013 ± 0.772
6.832LysLeu: 6.832 ± 0.769
2.551LysMet: 2.551 ± 0.429
5.01LysAsn: 5.01 ± 0.534
2.46LysPro: 2.46 ± 0.551
3.462LysGln: 3.462 ± 0.756
3.371LysArg: 3.371 ± 0.407
4.555LysSer: 4.555 ± 0.654
4.646LysThr: 4.646 ± 0.522
6.377LysVal: 6.377 ± 0.759
1.184LysTrp: 1.184 ± 0.43
3.097LysTyr: 3.097 ± 0.443
0.0LysXaa: 0.0 ± 0.0
Leu
6.741LeuAla: 6.741 ± 1.0
0.638LeuCys: 0.638 ± 0.304
6.195LeuAsp: 6.195 ± 0.971
7.926LeuGlu: 7.926 ± 0.892
2.824LeuPhe: 2.824 ± 0.421
6.104LeuGly: 6.104 ± 1.422
1.093LeuHis: 1.093 ± 0.281
3.28LeuIle: 3.28 ± 0.395
6.559LeuLys: 6.559 ± 0.828
6.468LeuLeu: 6.468 ± 0.945
1.731LeuMet: 1.731 ± 0.406
4.373LeuAsn: 4.373 ± 0.708
2.551LeuPro: 2.551 ± 0.691
2.642LeuGln: 2.642 ± 0.672
4.646LeuArg: 4.646 ± 0.77
5.284LeuSer: 5.284 ± 1.05
4.282LeuThr: 4.282 ± 0.818
3.826LeuVal: 3.826 ± 0.492
0.638LeuTrp: 0.638 ± 0.268
2.186LeuTyr: 2.186 ± 0.318
0.0LeuXaa: 0.0 ± 0.0
Met
2.277MetAla: 2.277 ± 0.586
0.0MetCys: 0.0 ± 0.0
1.366MetAsp: 1.366 ± 0.277
1.458MetGlu: 1.458 ± 0.35
0.911MetPhe: 0.911 ± 0.228
1.366MetGly: 1.366 ± 0.449
0.273MetHis: 0.273 ± 0.181
1.913MetIle: 1.913 ± 0.454
2.095MetLys: 2.095 ± 0.538
1.64MetLeu: 1.64 ± 0.365
0.455MetMet: 0.455 ± 0.245
1.275MetAsn: 1.275 ± 0.472
0.82MetPro: 0.82 ± 0.263
0.911MetGln: 0.911 ± 0.373
0.911MetArg: 0.911 ± 0.273
1.275MetSer: 1.275 ± 0.367
1.913MetThr: 1.913 ± 0.44
1.275MetVal: 1.275 ± 0.304
0.182MetTrp: 0.182 ± 0.136
0.638MetTyr: 0.638 ± 0.198
0.0MetXaa: 0.0 ± 0.0
Asn
4.919AsnAla: 4.919 ± 0.924
0.364AsnCys: 0.364 ± 0.174
2.915AsnAsp: 2.915 ± 0.434
3.917AsnGlu: 3.917 ± 0.674
1.731AsnPhe: 1.731 ± 0.405
4.555AsnGly: 4.555 ± 0.71
1.184AsnHis: 1.184 ± 0.339
3.188AsnIle: 3.188 ± 0.427
4.191AsnLys: 4.191 ± 0.455
4.099AsnLeu: 4.099 ± 0.802
1.093AsnMet: 1.093 ± 0.313
1.64AsnAsn: 1.64 ± 0.447
2.186AsnPro: 2.186 ± 0.462
2.733AsnGln: 2.733 ± 0.741
2.551AsnArg: 2.551 ± 0.542
3.097AsnSer: 3.097 ± 0.736
3.006AsnThr: 3.006 ± 0.593
3.553AsnVal: 3.553 ± 0.546
1.002AsnTrp: 1.002 ± 0.199
1.549AsnTyr: 1.549 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
2.004ProAla: 2.004 ± 0.419
0.182ProCys: 0.182 ± 0.142
2.46ProAsp: 2.46 ± 0.58
3.188ProGlu: 3.188 ± 0.451
1.458ProPhe: 1.458 ± 0.551
1.093ProGly: 1.093 ± 0.319
0.364ProHis: 0.364 ± 0.157
1.093ProIle: 1.093 ± 0.477
2.369ProLys: 2.369 ± 0.418
1.275ProLeu: 1.275 ± 0.355
0.273ProMet: 0.273 ± 0.137
1.366ProAsn: 1.366 ± 0.437
0.82ProPro: 0.82 ± 0.261
1.093ProGln: 1.093 ± 0.41
1.458ProArg: 1.458 ± 0.318
0.82ProSer: 0.82 ± 0.293
0.911ProThr: 0.911 ± 0.273
1.822ProVal: 1.822 ± 0.367
0.547ProTrp: 0.547 ± 0.259
1.184ProTyr: 1.184 ± 0.46
0.0ProXaa: 0.0 ± 0.0
Gln
3.735GlnAla: 3.735 ± 0.588
0.273GlnCys: 0.273 ± 0.173
2.46GlnAsp: 2.46 ± 0.412
3.917GlnGlu: 3.917 ± 0.86
1.275GlnPhe: 1.275 ± 0.357
2.186GlnGly: 2.186 ± 0.404
0.091GlnHis: 0.091 ± 0.111
3.644GlnIle: 3.644 ± 0.54
3.735GlnLys: 3.735 ± 0.542
3.006GlnLeu: 3.006 ± 0.451
0.547GlnMet: 0.547 ± 0.171
2.004GlnAsn: 2.004 ± 0.383
0.911GlnPro: 0.911 ± 0.319
2.004GlnGln: 2.004 ± 0.524
2.46GlnArg: 2.46 ± 0.441
2.915GlnSer: 2.915 ± 0.424
2.915GlnThr: 2.915 ± 0.462
3.644GlnVal: 3.644 ± 0.554
0.547GlnTrp: 0.547 ± 0.181
0.911GlnTyr: 0.911 ± 0.262
0.0GlnXaa: 0.0 ± 0.0
Arg
3.644ArgAla: 3.644 ± 0.516
0.455ArgCys: 0.455 ± 0.165
2.46ArgAsp: 2.46 ± 0.483
3.188ArgGlu: 3.188 ± 0.654
1.549ArgPhe: 1.549 ± 0.421
2.004ArgGly: 2.004 ± 0.556
0.729ArgHis: 0.729 ± 0.237
3.097ArgIle: 3.097 ± 0.514
3.644ArgLys: 3.644 ± 0.738
4.737ArgLeu: 4.737 ± 0.765
1.731ArgMet: 1.731 ± 0.424
2.824ArgAsn: 2.824 ± 0.543
0.729ArgPro: 0.729 ± 0.206
3.553ArgGln: 3.553 ± 0.593
2.277ArgArg: 2.277 ± 0.628
2.277ArgSer: 2.277 ± 0.396
2.915ArgThr: 2.915 ± 0.786
2.642ArgVal: 2.642 ± 0.423
0.364ArgTrp: 0.364 ± 0.169
1.731ArgTyr: 1.731 ± 0.449
0.0ArgXaa: 0.0 ± 0.0
Ser
5.102SerAla: 5.102 ± 0.989
0.273SerCys: 0.273 ± 0.152
3.826SerAsp: 3.826 ± 0.585
4.555SerGlu: 4.555 ± 0.728
1.913SerPhe: 1.913 ± 0.432
5.193SerGly: 5.193 ± 0.866
1.184SerHis: 1.184 ± 0.35
4.099SerIle: 4.099 ± 0.637
4.828SerLys: 4.828 ± 0.708
5.193SerLeu: 5.193 ± 0.652
1.093SerMet: 1.093 ± 0.395
2.915SerAsn: 2.915 ± 0.512
1.64SerPro: 1.64 ± 0.321
2.46SerGln: 2.46 ± 0.427
3.371SerArg: 3.371 ± 0.644
4.099SerSer: 4.099 ± 0.924
4.464SerThr: 4.464 ± 0.635
3.462SerVal: 3.462 ± 0.882
1.002SerTrp: 1.002 ± 0.403
2.46SerTyr: 2.46 ± 0.558
0.0SerXaa: 0.0 ± 0.0
Thr
5.557ThrAla: 5.557 ± 1.11
0.182ThrCys: 0.182 ± 0.139
4.373ThrAsp: 4.373 ± 0.758
3.644ThrGlu: 3.644 ± 0.475
3.097ThrPhe: 3.097 ± 0.656
4.282ThrGly: 4.282 ± 0.751
1.002ThrHis: 1.002 ± 0.414
4.555ThrIle: 4.555 ± 0.723
4.191ThrLys: 4.191 ± 0.686
4.919ThrLeu: 4.919 ± 0.727
0.638ThrMet: 0.638 ± 0.251
3.006ThrAsn: 3.006 ± 0.512
1.002ThrPro: 1.002 ± 0.35
3.006ThrGln: 3.006 ± 0.86
1.275ThrArg: 1.275 ± 0.349
4.008ThrSer: 4.008 ± 0.626
4.099ThrThr: 4.099 ± 0.866
4.919ThrVal: 4.919 ± 0.707
0.911ThrTrp: 0.911 ± 0.307
2.46ThrTyr: 2.46 ± 0.57
0.0ThrXaa: 0.0 ± 0.0
Val
4.737ValAla: 4.737 ± 0.619
0.364ValCys: 0.364 ± 0.212
3.097ValAsp: 3.097 ± 0.503
5.284ValGlu: 5.284 ± 0.649
2.46ValPhe: 2.46 ± 0.508
4.919ValGly: 4.919 ± 0.799
0.911ValHis: 0.911 ± 0.337
3.097ValIle: 3.097 ± 0.51
5.921ValLys: 5.921 ± 0.713
4.191ValLeu: 4.191 ± 0.683
0.911ValMet: 0.911 ± 0.293
3.735ValAsn: 3.735 ± 0.881
1.64ValPro: 1.64 ± 0.33
1.913ValGln: 1.913 ± 0.403
2.824ValArg: 2.824 ± 0.374
5.01ValSer: 5.01 ± 0.586
5.557ValThr: 5.557 ± 0.607
4.919ValVal: 4.919 ± 0.924
1.002ValTrp: 1.002 ± 0.341
2.824ValTyr: 2.824 ± 0.699
0.0ValXaa: 0.0 ± 0.0
Trp
1.275TrpAla: 1.275 ± 0.361
0.182TrpCys: 0.182 ± 0.11
0.729TrpAsp: 0.729 ± 0.296
0.638TrpGlu: 0.638 ± 0.295
1.275TrpPhe: 1.275 ± 0.532
0.729TrpGly: 0.729 ± 0.226
0.091TrpHis: 0.091 ± 0.082
0.638TrpIle: 0.638 ± 0.258
1.275TrpLys: 1.275 ± 0.384
0.82TrpLeu: 0.82 ± 0.435
0.547TrpMet: 0.547 ± 0.227
1.002TrpAsn: 1.002 ± 0.315
0.091TrpPro: 0.091 ± 0.097
0.729TrpGln: 0.729 ± 0.323
0.547TrpArg: 0.547 ± 0.275
0.82TrpSer: 0.82 ± 0.296
1.002TrpThr: 1.002 ± 0.293
1.093TrpVal: 1.093 ± 0.283
0.182TrpTrp: 0.182 ± 0.099
0.729TrpTyr: 0.729 ± 0.613
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.913TyrAla: 1.913 ± 0.367
0.729TyrCys: 0.729 ± 0.302
2.004TyrAsp: 2.004 ± 0.394
2.824TyrGlu: 2.824 ± 0.544
1.366TyrPhe: 1.366 ± 0.443
2.004TyrGly: 2.004 ± 0.364
0.82TyrHis: 0.82 ± 0.231
2.642TyrIle: 2.642 ± 0.442
3.735TyrLys: 3.735 ± 0.684
2.915TyrLeu: 2.915 ± 0.584
0.455TyrMet: 0.455 ± 0.253
1.64TyrAsn: 1.64 ± 0.314
1.366TyrPro: 1.366 ± 0.366
1.822TyrGln: 1.822 ± 0.43
1.458TyrArg: 1.458 ± 0.434
2.551TyrSer: 2.551 ± 0.529
1.913TyrThr: 1.913 ± 0.329
2.915TyrVal: 2.915 ± 0.586
0.273TyrTrp: 0.273 ± 0.138
1.549TyrTyr: 1.549 ± 0.586
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (10978 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski