Amino acid dipepetide frequency for Streptococcus phage CHPC1148

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.245AlaAla: 2.245 ± 0.726
0.416AlaCys: 0.416 ± 0.202
4.241AlaAsp: 4.241 ± 0.614
3.576AlaGlu: 3.576 ± 0.538
2.328AlaPhe: 2.328 ± 0.595
4.906AlaGly: 4.906 ± 1.021
0.665AlaHis: 0.665 ± 0.226
4.573AlaIle: 4.573 ± 0.684
5.821AlaLys: 5.821 ± 1.071
5.821AlaLeu: 5.821 ± 0.686
1.663AlaMet: 1.663 ± 0.402
3.825AlaAsn: 3.825 ± 0.696
1.33AlaPro: 1.33 ± 0.329
1.58AlaGln: 1.58 ± 0.421
2.411AlaArg: 2.411 ± 0.486
4.906AlaSer: 4.906 ± 0.901
4.158AlaThr: 4.158 ± 0.654
4.241AlaVal: 4.241 ± 0.677
0.832AlaTrp: 0.832 ± 0.203
1.996AlaTyr: 1.996 ± 0.374
0.0AlaXaa: 0.0 ± 0.0
Cys
0.166CysAla: 0.166 ± 0.132
0.0CysCys: 0.0 ± 0.0
0.665CysAsp: 0.665 ± 0.229
0.416CysGlu: 0.416 ± 0.18
0.166CysPhe: 0.166 ± 0.111
0.249CysGly: 0.249 ± 0.152
0.0CysHis: 0.0 ± 0.0
0.249CysIle: 0.249 ± 0.211
0.998CysLys: 0.998 ± 0.298
0.665CysLeu: 0.665 ± 0.26
0.083CysMet: 0.083 ± 0.093
0.333CysAsn: 0.333 ± 0.151
0.249CysPro: 0.249 ± 0.14
0.166CysGln: 0.166 ± 0.131
0.333CysArg: 0.333 ± 0.166
0.249CysSer: 0.249 ± 0.126
0.333CysThr: 0.333 ± 0.172
0.333CysVal: 0.333 ± 0.162
0.249CysTrp: 0.249 ± 0.149
0.665CysTyr: 0.665 ± 0.249
0.0CysXaa: 0.0 ± 0.0
Asp
3.742AspAla: 3.742 ± 0.499
0.582AspCys: 0.582 ± 0.229
3.576AspAsp: 3.576 ± 0.522
4.657AspGlu: 4.657 ± 0.719
2.994AspPhe: 2.994 ± 0.434
5.654AspGly: 5.654 ± 0.861
0.998AspHis: 0.998 ± 0.293
5.322AspIle: 5.322 ± 0.733
5.904AspLys: 5.904 ± 0.818
4.657AspLeu: 4.657 ± 0.769
1.58AspMet: 1.58 ± 0.334
3.16AspAsn: 3.16 ± 0.683
1.663AspPro: 1.663 ± 0.343
1.414AspGln: 1.414 ± 0.284
2.162AspArg: 2.162 ± 0.447
3.409AspSer: 3.409 ± 0.461
3.742AspThr: 3.742 ± 0.596
3.825AspVal: 3.825 ± 0.657
1.58AspTrp: 1.58 ± 0.35
2.411AspTyr: 2.411 ± 0.405
0.0AspXaa: 0.0 ± 0.0
Glu
3.825GluAla: 3.825 ± 0.635
0.416GluCys: 0.416 ± 0.233
3.742GluAsp: 3.742 ± 0.478
4.573GluGlu: 4.573 ± 0.902
3.077GluPhe: 3.077 ± 0.632
3.243GluGly: 3.243 ± 0.444
1.081GluHis: 1.081 ± 0.337
6.403GluIle: 6.403 ± 0.77
4.407GluLys: 4.407 ± 0.925
6.985GluLeu: 6.985 ± 0.646
1.996GluMet: 1.996 ± 0.416
3.742GluAsn: 3.742 ± 0.558
1.996GluPro: 1.996 ± 0.474
3.16GluGln: 3.16 ± 0.585
3.991GluArg: 3.991 ± 0.662
3.659GluSer: 3.659 ± 0.487
3.16GluThr: 3.16 ± 0.502
4.657GluVal: 4.657 ± 0.492
1.33GluTrp: 1.33 ± 0.321
2.827GluTyr: 2.827 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
3.243PheAla: 3.243 ± 0.453
0.499PheCys: 0.499 ± 0.265
3.659PheAsp: 3.659 ± 0.544
2.661PheGlu: 2.661 ± 0.428
1.58PhePhe: 1.58 ± 0.326
3.659PheGly: 3.659 ± 0.692
0.333PheHis: 0.333 ± 0.172
2.827PheIle: 2.827 ± 0.508
4.241PheLys: 4.241 ± 0.738
3.16PheLeu: 3.16 ± 0.602
0.499PheMet: 0.499 ± 0.167
2.744PheAsn: 2.744 ± 0.698
0.499PhePro: 0.499 ± 0.305
0.915PheGln: 0.915 ± 0.26
1.497PheArg: 1.497 ± 0.381
3.742PheSer: 3.742 ± 0.512
2.495PheThr: 2.495 ± 0.5
3.326PheVal: 3.326 ± 0.507
0.582PheTrp: 0.582 ± 0.203
1.996PheTyr: 1.996 ± 0.531
0.0PheXaa: 0.0 ± 0.0
Gly
3.825GlyAla: 3.825 ± 0.716
0.333GlyCys: 0.333 ± 0.198
4.324GlyAsp: 4.324 ± 0.916
3.742GlyGlu: 3.742 ± 0.719
3.742GlyPhe: 3.742 ± 0.58
4.241GlyGly: 4.241 ± 0.754
0.998GlyHis: 0.998 ± 0.323
5.322GlyIle: 5.322 ± 0.766
5.155GlyLys: 5.155 ± 0.659
5.405GlyLeu: 5.405 ± 0.59
2.328GlyMet: 2.328 ± 0.548
3.825GlyAsn: 3.825 ± 0.687
1.081GlyPro: 1.081 ± 0.371
2.744GlyGln: 2.744 ± 0.776
2.91GlyArg: 2.91 ± 0.542
4.324GlySer: 4.324 ± 0.877
3.991GlyThr: 3.991 ± 0.552
3.077GlyVal: 3.077 ± 0.552
1.247GlyTrp: 1.247 ± 0.3
3.16GlyTyr: 3.16 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
0.249HisAla: 0.249 ± 0.134
0.166HisCys: 0.166 ± 0.122
0.832HisAsp: 0.832 ± 0.295
0.665HisGlu: 0.665 ± 0.355
0.665HisPhe: 0.665 ± 0.27
0.748HisGly: 0.748 ± 0.259
0.499HisHis: 0.499 ± 0.172
0.832HisIle: 0.832 ± 0.25
1.58HisLys: 1.58 ± 0.435
0.998HisLeu: 0.998 ± 0.273
0.333HisMet: 0.333 ± 0.144
1.081HisAsn: 1.081 ± 0.362
0.499HisPro: 0.499 ± 0.213
0.499HisGln: 0.499 ± 0.221
0.665HisArg: 0.665 ± 0.189
0.748HisSer: 0.748 ± 0.242
0.499HisThr: 0.499 ± 0.25
1.164HisVal: 1.164 ± 0.28
0.333HisTrp: 0.333 ± 0.16
1.33HisTyr: 1.33 ± 0.381
0.0HisXaa: 0.0 ± 0.0
Ile
4.74IleAla: 4.74 ± 0.718
0.416IleCys: 0.416 ± 0.219
6.403IleAsp: 6.403 ± 0.905
5.155IleGlu: 5.155 ± 0.851
2.079IlePhe: 2.079 ± 0.444
3.991IleGly: 3.991 ± 0.54
0.665IleHis: 0.665 ± 0.214
3.492IleIle: 3.492 ± 0.703
6.32IleLys: 6.32 ± 0.64
4.657IleLeu: 4.657 ± 0.744
1.746IleMet: 1.746 ± 0.409
4.49IleAsn: 4.49 ± 0.464
3.492IlePro: 3.492 ± 0.545
2.744IleGln: 2.744 ± 0.524
2.827IleArg: 2.827 ± 0.505
4.49IleSer: 4.49 ± 0.596
4.407IleThr: 4.407 ± 0.599
3.326IleVal: 3.326 ± 0.65
1.33IleTrp: 1.33 ± 0.3
2.411IleTyr: 2.411 ± 0.453
0.0IleXaa: 0.0 ± 0.0
Lys
5.488LysAla: 5.488 ± 0.641
0.166LysCys: 0.166 ± 0.118
4.49LysAsp: 4.49 ± 0.663
6.902LysGlu: 6.902 ± 0.757
3.742LysPhe: 3.742 ± 0.589
5.072LysGly: 5.072 ± 0.583
1.58LysHis: 1.58 ± 0.497
6.153LysIle: 6.153 ± 0.659
7.983LysLys: 7.983 ± 1.201
7.567LysLeu: 7.567 ± 0.706
1.829LysMet: 1.829 ± 0.512
6.32LysAsn: 6.32 ± 0.869
2.328LysPro: 2.328 ± 0.376
3.576LysGln: 3.576 ± 0.498
4.075LysArg: 4.075 ± 0.491
5.571LysSer: 5.571 ± 0.605
5.322LysThr: 5.322 ± 0.724
4.075LysVal: 4.075 ± 0.633
1.33LysTrp: 1.33 ± 0.293
3.243LysTyr: 3.243 ± 0.592
0.0LysXaa: 0.0 ± 0.0
Leu
6.153LeuAla: 6.153 ± 1.009
0.499LeuCys: 0.499 ± 0.171
5.654LeuAsp: 5.654 ± 0.811
5.738LeuGlu: 5.738 ± 0.787
3.326LeuPhe: 3.326 ± 0.452
4.573LeuGly: 4.573 ± 0.848
0.998LeuHis: 0.998 ± 0.264
4.407LeuIle: 4.407 ± 0.638
6.902LeuLys: 6.902 ± 0.708
5.654LeuLeu: 5.654 ± 0.628
2.661LeuMet: 2.661 ± 0.387
5.405LeuAsn: 5.405 ± 0.669
3.077LeuPro: 3.077 ± 0.474
2.744LeuGln: 2.744 ± 0.595
3.243LeuArg: 3.243 ± 0.587
5.654LeuSer: 5.654 ± 0.69
6.32LeuThr: 6.32 ± 0.848
4.075LeuVal: 4.075 ± 0.59
0.665LeuTrp: 0.665 ± 0.222
2.162LeuTyr: 2.162 ± 0.351
0.0LeuXaa: 0.0 ± 0.0
Met
1.829MetAla: 1.829 ± 0.454
0.083MetCys: 0.083 ± 0.093
0.832MetAsp: 0.832 ± 0.273
1.829MetGlu: 1.829 ± 0.387
1.081MetPhe: 1.081 ± 0.324
1.081MetGly: 1.081 ± 0.362
0.166MetHis: 0.166 ± 0.139
1.746MetIle: 1.746 ± 0.404
2.91MetLys: 2.91 ± 0.542
1.58MetLeu: 1.58 ± 0.296
0.416MetMet: 0.416 ± 0.204
0.665MetAsn: 0.665 ± 0.214
0.915MetPro: 0.915 ± 0.223
0.915MetGln: 0.915 ± 0.298
0.832MetArg: 0.832 ± 0.232
1.996MetSer: 1.996 ± 0.402
1.58MetThr: 1.58 ± 0.343
1.829MetVal: 1.829 ± 0.438
0.083MetTrp: 0.083 ± 0.074
0.915MetTyr: 0.915 ± 0.271
0.0MetXaa: 0.0 ± 0.0
Asn
3.659AsnAla: 3.659 ± 0.916
0.249AsnCys: 0.249 ± 0.188
2.994AsnAsp: 2.994 ± 0.633
3.742AsnGlu: 3.742 ± 0.657
2.661AsnPhe: 2.661 ± 0.426
6.652AsnGly: 6.652 ± 0.978
1.33AsnHis: 1.33 ± 0.275
3.908AsnIle: 3.908 ± 0.593
4.075AsnLys: 4.075 ± 0.496
4.241AsnLeu: 4.241 ± 0.531
0.915AsnMet: 0.915 ± 0.253
3.908AsnAsn: 3.908 ± 0.739
2.994AsnPro: 2.994 ± 0.547
2.578AsnGln: 2.578 ± 0.456
2.744AsnArg: 2.744 ± 0.487
3.991AsnSer: 3.991 ± 0.583
3.576AsnThr: 3.576 ± 0.703
3.409AsnVal: 3.409 ± 0.473
1.33AsnTrp: 1.33 ± 0.357
1.829AsnTyr: 1.829 ± 0.354
0.0AsnXaa: 0.0 ± 0.0
Pro
1.58ProAla: 1.58 ± 0.326
0.166ProCys: 0.166 ± 0.196
1.497ProAsp: 1.497 ± 0.341
2.245ProGlu: 2.245 ± 0.535
1.414ProPhe: 1.414 ± 0.366
1.247ProGly: 1.247 ± 0.32
0.499ProHis: 0.499 ± 0.194
1.829ProIle: 1.829 ± 0.384
3.742ProLys: 3.742 ± 0.704
2.162ProLeu: 2.162 ± 0.422
0.582ProMet: 0.582 ± 0.235
2.328ProAsn: 2.328 ± 0.403
0.665ProPro: 0.665 ± 0.241
1.247ProGln: 1.247 ± 0.27
1.247ProArg: 1.247 ± 0.374
2.744ProSer: 2.744 ± 0.535
2.079ProThr: 2.079 ± 0.321
1.414ProVal: 1.414 ± 0.406
0.333ProTrp: 0.333 ± 0.152
1.081ProTyr: 1.081 ± 0.347
0.0ProXaa: 0.0 ± 0.0
Gln
2.91GlnAla: 2.91 ± 0.397
0.333GlnCys: 0.333 ± 0.15
1.414GlnAsp: 1.414 ± 0.373
2.245GlnGlu: 2.245 ± 0.453
1.58GlnPhe: 1.58 ± 0.327
2.578GlnGly: 2.578 ± 0.548
0.249GlnHis: 0.249 ± 0.176
2.245GlnIle: 2.245 ± 0.462
2.411GlnLys: 2.411 ± 0.403
3.991GlnLeu: 3.991 ± 0.415
0.832GlnMet: 0.832 ± 0.209
2.411GlnAsn: 2.411 ± 0.454
0.665GlnPro: 0.665 ± 0.255
2.162GlnGln: 2.162 ± 0.389
1.414GlnArg: 1.414 ± 0.3
2.495GlnSer: 2.495 ± 0.436
2.578GlnThr: 2.578 ± 0.443
2.328GlnVal: 2.328 ± 0.565
0.665GlnTrp: 0.665 ± 0.338
1.829GlnTyr: 1.829 ± 0.365
0.0GlnXaa: 0.0 ± 0.0
Arg
2.245ArgAla: 2.245 ± 0.392
0.249ArgCys: 0.249 ± 0.135
2.245ArgAsp: 2.245 ± 0.5
3.492ArgGlu: 3.492 ± 0.484
2.328ArgPhe: 2.328 ± 0.527
2.578ArgGly: 2.578 ± 0.611
0.748ArgHis: 0.748 ± 0.254
2.578ArgIle: 2.578 ± 0.409
3.825ArgLys: 3.825 ± 0.613
3.908ArgLeu: 3.908 ± 0.499
0.748ArgMet: 0.748 ± 0.243
2.079ArgAsn: 2.079 ± 0.348
1.33ArgPro: 1.33 ± 0.312
1.497ArgGln: 1.497 ± 0.359
1.164ArgArg: 1.164 ± 0.336
2.328ArgSer: 2.328 ± 0.44
2.661ArgThr: 2.661 ± 0.614
3.077ArgVal: 3.077 ± 0.484
0.832ArgTrp: 0.832 ± 0.221
2.162ArgTyr: 2.162 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
3.409SerAla: 3.409 ± 0.588
0.582SerCys: 0.582 ± 0.194
4.158SerAsp: 4.158 ± 0.493
5.072SerGlu: 5.072 ± 0.524
3.16SerPhe: 3.16 ± 0.567
4.324SerGly: 4.324 ± 0.612
0.748SerHis: 0.748 ± 0.251
4.241SerIle: 4.241 ± 0.503
5.904SerLys: 5.904 ± 1.204
5.072SerLeu: 5.072 ± 0.612
2.079SerMet: 2.079 ± 0.345
3.742SerAsn: 3.742 ± 0.576
1.913SerPro: 1.913 ± 0.405
2.661SerGln: 2.661 ± 0.576
2.245SerArg: 2.245 ± 0.472
3.576SerSer: 3.576 ± 0.649
4.573SerThr: 4.573 ± 0.455
5.405SerVal: 5.405 ± 0.63
1.247SerTrp: 1.247 ± 0.568
2.495SerTyr: 2.495 ± 0.414
0.0SerXaa: 0.0 ± 0.0
Thr
4.49ThrAla: 4.49 ± 0.796
0.249ThrCys: 0.249 ± 0.135
3.576ThrAsp: 3.576 ± 0.606
3.576ThrGlu: 3.576 ± 0.547
3.326ThrPhe: 3.326 ± 0.503
3.825ThrGly: 3.825 ± 0.506
1.164ThrHis: 1.164 ± 0.401
4.989ThrIle: 4.989 ± 0.7
4.823ThrLys: 4.823 ± 0.624
5.821ThrLeu: 5.821 ± 0.699
1.247ThrMet: 1.247 ± 0.369
4.158ThrAsn: 4.158 ± 0.811
1.829ThrPro: 1.829 ± 0.393
1.829ThrGln: 1.829 ± 0.409
2.162ThrArg: 2.162 ± 0.41
3.742ThrSer: 3.742 ± 0.664
3.077ThrThr: 3.077 ± 0.736
4.407ThrVal: 4.407 ± 0.639
1.164ThrTrp: 1.164 ± 0.403
2.91ThrTyr: 2.91 ± 0.526
0.0ThrXaa: 0.0 ± 0.0
Val
4.158ValAla: 4.158 ± 0.659
0.499ValCys: 0.499 ± 0.19
5.072ValAsp: 5.072 ± 0.568
4.158ValGlu: 4.158 ± 0.594
2.328ValPhe: 2.328 ± 0.53
4.573ValGly: 4.573 ± 0.521
0.333ValHis: 0.333 ± 0.18
4.823ValIle: 4.823 ± 0.545
5.654ValLys: 5.654 ± 0.581
3.243ValLeu: 3.243 ± 0.52
1.081ValMet: 1.081 ± 0.32
3.991ValAsn: 3.991 ± 0.664
2.245ValPro: 2.245 ± 0.494
1.58ValGln: 1.58 ± 0.382
2.328ValArg: 2.328 ± 0.523
5.155ValSer: 5.155 ± 0.695
4.158ValThr: 4.158 ± 0.718
3.326ValVal: 3.326 ± 0.534
0.998ValTrp: 0.998 ± 0.316
1.913ValTyr: 1.913 ± 0.349
0.0ValXaa: 0.0 ± 0.0
Trp
0.832TrpAla: 0.832 ± 0.25
0.083TrpCys: 0.083 ± 0.071
1.247TrpAsp: 1.247 ± 0.472
1.164TrpGlu: 1.164 ± 0.286
0.832TrpPhe: 0.832 ± 0.274
0.582TrpGly: 0.582 ± 0.174
0.499TrpHis: 0.499 ± 0.247
0.832TrpIle: 0.832 ± 0.223
0.915TrpLys: 0.915 ± 0.228
1.33TrpLeu: 1.33 ± 0.349
0.083TrpMet: 0.083 ± 0.08
1.081TrpAsn: 1.081 ± 0.278
0.166TrpPro: 0.166 ± 0.106
0.665TrpGln: 0.665 ± 0.292
0.998TrpArg: 0.998 ± 0.301
1.58TrpSer: 1.58 ± 0.623
1.663TrpThr: 1.663 ± 0.35
1.414TrpVal: 1.414 ± 0.284
0.499TrpTrp: 0.499 ± 0.244
0.249TrpTyr: 0.249 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.744TyrAla: 2.744 ± 0.439
0.582TyrCys: 0.582 ± 0.25
2.411TyrAsp: 2.411 ± 0.448
2.661TyrGlu: 2.661 ± 0.562
1.746TyrPhe: 1.746 ± 0.278
1.996TyrGly: 1.996 ± 0.568
0.832TyrHis: 0.832 ± 0.264
2.411TyrIle: 2.411 ± 0.425
3.243TyrLys: 3.243 ± 0.559
3.077TyrLeu: 3.077 ± 0.387
0.499TyrMet: 0.499 ± 0.189
1.497TyrAsn: 1.497 ± 0.338
1.164TyrPro: 1.164 ± 0.312
2.661TyrGln: 2.661 ± 0.446
2.827TyrArg: 2.827 ± 0.387
2.245TyrSer: 2.245 ± 0.485
1.996TyrThr: 1.996 ± 0.442
2.91TyrVal: 2.91 ± 0.517
0.083TyrTrp: 0.083 ± 0.079
1.913TyrTyr: 1.913 ± 0.532
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (12027 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski