Amino acid dipepetide frequency for Streptococcus phage CHPC1246

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.307AlaAla: 6.307 ± 2.544
0.27AlaCys: 0.27 ± 0.167
5.136AlaAsp: 5.136 ± 0.874
5.136AlaGlu: 5.136 ± 0.837
2.974AlaPhe: 2.974 ± 1.187
5.046AlaGly: 5.046 ± 1.451
0.541AlaHis: 0.541 ± 0.175
5.857AlaIle: 5.857 ± 1.568
5.496AlaLys: 5.496 ± 0.797
7.209AlaLeu: 7.209 ± 0.859
2.793AlaMet: 2.793 ± 1.225
4.595AlaAsn: 4.595 ± 0.736
2.163AlaPro: 2.163 ± 0.51
3.514AlaGln: 3.514 ± 0.935
3.875AlaArg: 3.875 ± 0.713
6.217AlaSer: 6.217 ± 1.781
4.055AlaThr: 4.055 ± 0.828
4.686AlaVal: 4.686 ± 1.028
0.631AlaTrp: 0.631 ± 0.237
2.163AlaTyr: 2.163 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.36CysAla: 0.36 ± 0.182
0.0CysCys: 0.0 ± 0.0
0.451CysAsp: 0.451 ± 0.242
0.811CysGlu: 0.811 ± 0.32
0.09CysPhe: 0.09 ± 0.098
0.27CysGly: 0.27 ± 0.206
0.18CysHis: 0.18 ± 0.114
0.18CysIle: 0.18 ± 0.141
0.721CysLys: 0.721 ± 0.254
0.451CysLeu: 0.451 ± 0.283
0.09CysMet: 0.09 ± 0.088
0.36CysAsn: 0.36 ± 0.199
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.36CysArg: 0.36 ± 0.192
0.541CysSer: 0.541 ± 0.241
0.18CysThr: 0.18 ± 0.134
0.451CysVal: 0.451 ± 0.211
0.09CysTrp: 0.09 ± 0.082
0.36CysTyr: 0.36 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
3.514AspAla: 3.514 ± 0.466
0.541AspCys: 0.541 ± 0.267
4.415AspAsp: 4.415 ± 0.513
4.055AspGlu: 4.055 ± 0.626
3.424AspPhe: 3.424 ± 0.454
5.406AspGly: 5.406 ± 0.796
0.631AspHis: 0.631 ± 0.292
3.424AspIle: 3.424 ± 0.727
4.686AspLys: 4.686 ± 0.81
4.505AspLeu: 4.505 ± 0.793
1.442AspMet: 1.442 ± 0.355
3.424AspAsn: 3.424 ± 0.639
0.811AspPro: 0.811 ± 0.297
0.901AspGln: 0.901 ± 0.283
3.604AspArg: 3.604 ± 0.818
4.145AspSer: 4.145 ± 0.675
3.875AspThr: 3.875 ± 0.711
3.604AspVal: 3.604 ± 0.581
0.991AspTrp: 0.991 ± 0.365
2.793AspTyr: 2.793 ± 0.555
0.0AspXaa: 0.0 ± 0.0
Glu
5.046GluAla: 5.046 ± 0.675
0.27GluCys: 0.27 ± 0.167
2.974GluAsp: 2.974 ± 0.74
4.956GluGlu: 4.956 ± 1.189
2.793GluPhe: 2.793 ± 0.589
3.064GluGly: 3.064 ± 0.583
0.991GluHis: 0.991 ± 0.281
5.406GluIle: 5.406 ± 0.951
5.316GluLys: 5.316 ± 1.214
7.479GluLeu: 7.479 ± 1.225
2.163GluMet: 2.163 ± 0.485
4.956GluAsn: 4.956 ± 0.751
1.802GluPro: 1.802 ± 0.53
3.244GluGln: 3.244 ± 0.582
3.965GluArg: 3.965 ± 0.666
2.523GluSer: 2.523 ± 0.674
3.694GluThr: 3.694 ± 0.723
5.677GluVal: 5.677 ± 0.94
0.901GluTrp: 0.901 ± 0.297
3.604GluTyr: 3.604 ± 0.784
0.0GluXaa: 0.0 ± 0.0
Phe
2.883PheAla: 2.883 ± 0.421
0.18PheCys: 0.18 ± 0.142
2.613PheAsp: 2.613 ± 0.566
4.235PheGlu: 4.235 ± 0.695
0.901PhePhe: 0.901 ± 0.286
3.604PheGly: 3.604 ± 0.625
0.451PheHis: 0.451 ± 0.21
3.154PheIle: 3.154 ± 0.43
5.136PheLys: 5.136 ± 0.608
1.802PheLeu: 1.802 ± 0.585
0.631PheMet: 0.631 ± 0.273
2.613PheAsn: 2.613 ± 0.37
0.451PhePro: 0.451 ± 0.248
1.081PheGln: 1.081 ± 0.285
0.901PheArg: 0.901 ± 0.25
3.514PheSer: 3.514 ± 0.761
2.613PheThr: 2.613 ± 0.532
2.072PheVal: 2.072 ± 0.534
0.721PheTrp: 0.721 ± 0.246
1.261PheTyr: 1.261 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
5.406GlyAla: 5.406 ± 0.898
0.36GlyCys: 0.36 ± 0.184
3.064GlyAsp: 3.064 ± 0.359
2.974GlyGlu: 2.974 ± 0.529
2.523GlyPhe: 2.523 ± 0.502
2.793GlyGly: 2.793 ± 0.436
0.901GlyHis: 0.901 ± 0.396
6.398GlyIle: 6.398 ± 1.827
5.767GlyLys: 5.767 ± 0.789
5.677GlyLeu: 5.677 ± 1.093
1.712GlyMet: 1.712 ± 0.726
2.883GlyAsn: 2.883 ± 0.495
0.451GlyPro: 0.451 ± 0.189
2.974GlyGln: 2.974 ± 0.441
2.793GlyArg: 2.793 ± 0.481
4.415GlySer: 4.415 ± 0.743
5.136GlyThr: 5.136 ± 0.984
4.415GlyVal: 4.415 ± 0.74
0.631GlyTrp: 0.631 ± 0.279
3.514GlyTyr: 3.514 ± 0.601
0.0GlyXaa: 0.0 ± 0.0
His
0.811HisAla: 0.811 ± 0.254
0.09HisCys: 0.09 ± 0.098
0.991HisAsp: 0.991 ± 0.3
0.541HisGlu: 0.541 ± 0.212
0.811HisPhe: 0.811 ± 0.26
0.811HisGly: 0.811 ± 0.275
0.541HisHis: 0.541 ± 0.248
0.901HisIle: 0.901 ± 0.29
0.901HisLys: 0.901 ± 0.269
1.081HisLeu: 1.081 ± 0.314
0.36HisMet: 0.36 ± 0.172
0.811HisAsn: 0.811 ± 0.348
0.09HisPro: 0.09 ± 0.087
0.36HisGln: 0.36 ± 0.172
0.991HisArg: 0.991 ± 0.295
0.991HisSer: 0.991 ± 0.317
1.171HisThr: 1.171 ± 0.355
0.811HisVal: 0.811 ± 0.362
0.27HisTrp: 0.27 ± 0.164
0.631HisTyr: 0.631 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
5.316IleAla: 5.316 ± 1.374
0.541IleCys: 0.541 ± 0.278
4.866IleAsp: 4.866 ± 0.704
5.046IleGlu: 5.046 ± 0.878
1.622IlePhe: 1.622 ± 0.393
5.767IleGly: 5.767 ± 0.98
1.171IleHis: 1.171 ± 0.344
2.974IleIle: 2.974 ± 0.757
6.037IleLys: 6.037 ± 0.734
3.965IleLeu: 3.965 ± 0.722
1.982IleMet: 1.982 ± 0.375
3.784IleAsn: 3.784 ± 0.865
2.163IlePro: 2.163 ± 0.645
3.064IleGln: 3.064 ± 0.631
2.613IleArg: 2.613 ± 0.617
5.136IleSer: 5.136 ± 1.232
4.505IleThr: 4.505 ± 0.91
4.325IleVal: 4.325 ± 0.738
0.541IleTrp: 0.541 ± 0.2
2.793IleTyr: 2.793 ± 0.533
0.0IleXaa: 0.0 ± 0.0
Lys
6.938LysAla: 6.938 ± 0.975
0.451LysCys: 0.451 ± 0.196
4.145LysAsp: 4.145 ± 0.739
7.569LysGlu: 7.569 ± 1.463
2.703LysPhe: 2.703 ± 0.574
4.866LysGly: 4.866 ± 0.524
1.622LysHis: 1.622 ± 0.511
6.217LysIle: 6.217 ± 0.856
6.307LysLys: 6.307 ± 1.286
6.398LysLeu: 6.398 ± 0.839
2.613LysMet: 2.613 ± 0.606
2.883LysAsn: 2.883 ± 0.682
2.974LysPro: 2.974 ± 0.532
3.244LysGln: 3.244 ± 0.667
5.046LysArg: 5.046 ± 0.894
4.776LysSer: 4.776 ± 0.592
5.136LysThr: 5.136 ± 0.797
3.604LysVal: 3.604 ± 0.541
1.171LysTrp: 1.171 ± 0.287
3.965LysTyr: 3.965 ± 0.731
0.0LysXaa: 0.0 ± 0.0
Leu
6.488LeuAla: 6.488 ± 0.859
0.18LeuCys: 0.18 ± 0.108
4.776LeuAsp: 4.776 ± 0.809
7.118LeuGlu: 7.118 ± 1.096
2.974LeuPhe: 2.974 ± 0.457
5.587LeuGly: 5.587 ± 1.174
0.721LeuHis: 0.721 ± 0.256
4.145LeuIle: 4.145 ± 0.568
6.938LeuLys: 6.938 ± 0.986
4.415LeuLeu: 4.415 ± 0.631
2.072LeuMet: 2.072 ± 0.388
5.316LeuAsn: 5.316 ± 0.836
2.163LeuPro: 2.163 ± 0.407
2.433LeuGln: 2.433 ± 0.461
3.154LeuArg: 3.154 ± 0.612
5.677LeuSer: 5.677 ± 0.762
5.046LeuThr: 5.046 ± 0.911
4.055LeuVal: 4.055 ± 0.535
0.27LeuTrp: 0.27 ± 0.208
3.154LeuTyr: 3.154 ± 0.702
0.0LeuXaa: 0.0 ± 0.0
Met
2.703MetAla: 2.703 ± 1.04
0.09MetCys: 0.09 ± 0.098
1.081MetAsp: 1.081 ± 0.277
0.991MetGlu: 0.991 ± 0.288
1.442MetPhe: 1.442 ± 0.292
1.352MetGly: 1.352 ± 0.414
0.18MetHis: 0.18 ± 0.124
1.532MetIle: 1.532 ± 0.351
2.703MetLys: 2.703 ± 0.537
1.892MetLeu: 1.892 ± 0.422
0.811MetMet: 0.811 ± 0.451
1.712MetAsn: 1.712 ± 0.419
0.541MetPro: 0.541 ± 0.24
1.532MetGln: 1.532 ± 0.556
0.901MetArg: 0.901 ± 0.312
2.072MetSer: 2.072 ± 0.584
1.261MetThr: 1.261 ± 0.31
2.253MetVal: 2.253 ± 0.567
0.0MetTrp: 0.0 ± 0.0
0.811MetTyr: 0.811 ± 0.237
0.0MetXaa: 0.0 ± 0.0
Asn
4.235AsnAla: 4.235 ± 0.539
0.18AsnCys: 0.18 ± 0.126
3.784AsnAsp: 3.784 ± 0.738
3.784AsnGlu: 3.784 ± 0.892
3.154AsnPhe: 3.154 ± 0.597
5.226AsnGly: 5.226 ± 0.959
1.171AsnHis: 1.171 ± 0.368
3.334AsnIle: 3.334 ± 0.71
4.776AsnLys: 4.776 ± 0.77
4.145AsnLeu: 4.145 ± 0.673
1.081AsnMet: 1.081 ± 0.276
3.694AsnAsn: 3.694 ± 0.724
1.982AsnPro: 1.982 ± 0.425
2.163AsnGln: 2.163 ± 0.456
1.982AsnArg: 1.982 ± 0.46
4.145AsnSer: 4.145 ± 0.715
3.064AsnThr: 3.064 ± 0.462
2.883AsnVal: 2.883 ± 0.466
0.991AsnTrp: 0.991 ± 0.31
1.532AsnTyr: 1.532 ± 0.449
0.0AsnXaa: 0.0 ± 0.0
Pro
1.622ProAla: 1.622 ± 0.337
0.27ProCys: 0.27 ± 0.196
1.532ProAsp: 1.532 ± 0.51
1.442ProGlu: 1.442 ± 0.399
1.081ProPhe: 1.081 ± 0.329
0.811ProGly: 0.811 ± 0.307
0.451ProHis: 0.451 ± 0.188
1.712ProIle: 1.712 ± 0.378
2.883ProLys: 2.883 ± 0.512
1.622ProLeu: 1.622 ± 0.44
0.0ProMet: 0.0 ± 0.0
1.532ProAsn: 1.532 ± 0.469
1.081ProPro: 1.081 ± 0.225
1.352ProGln: 1.352 ± 0.321
1.532ProArg: 1.532 ± 0.498
1.802ProSer: 1.802 ± 0.332
0.991ProThr: 0.991 ± 0.343
1.712ProVal: 1.712 ± 0.307
0.36ProTrp: 0.36 ± 0.19
1.081ProTyr: 1.081 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
4.235GlnAla: 4.235 ± 0.864
0.18GlnCys: 0.18 ± 0.135
1.982GlnAsp: 1.982 ± 0.444
2.883GlnGlu: 2.883 ± 0.674
1.892GlnPhe: 1.892 ± 0.503
1.712GlnGly: 1.712 ± 0.646
0.811GlnHis: 0.811 ± 0.307
2.343GlnIle: 2.343 ± 0.487
2.793GlnLys: 2.793 ± 0.613
3.875GlnLeu: 3.875 ± 0.528
1.532GlnMet: 1.532 ± 0.272
2.072GlnAsn: 2.072 ± 0.413
0.811GlnPro: 0.811 ± 0.287
1.802GlnGln: 1.802 ± 0.438
1.081GlnArg: 1.081 ± 0.337
2.703GlnSer: 2.703 ± 0.701
2.433GlnThr: 2.433 ± 0.451
2.523GlnVal: 2.523 ± 0.35
0.721GlnTrp: 0.721 ± 0.247
1.352GlnTyr: 1.352 ± 0.394
0.0GlnXaa: 0.0 ± 0.0
Arg
3.875ArgAla: 3.875 ± 0.446
0.721ArgCys: 0.721 ± 0.256
3.154ArgAsp: 3.154 ± 0.816
3.064ArgGlu: 3.064 ± 0.606
1.712ArgPhe: 1.712 ± 0.422
2.523ArgGly: 2.523 ± 0.446
0.631ArgHis: 0.631 ± 0.239
3.424ArgIle: 3.424 ± 0.692
3.694ArgLys: 3.694 ± 0.742
4.415ArgLeu: 4.415 ± 0.534
1.261ArgMet: 1.261 ± 0.353
1.532ArgAsn: 1.532 ± 0.397
1.261ArgPro: 1.261 ± 0.405
1.982ArgGln: 1.982 ± 0.487
1.532ArgArg: 1.532 ± 0.42
2.253ArgSer: 2.253 ± 0.433
1.802ArgThr: 1.802 ± 0.499
2.613ArgVal: 2.613 ± 0.569
0.451ArgTrp: 0.451 ± 0.211
2.343ArgTyr: 2.343 ± 0.533
0.0ArgXaa: 0.0 ± 0.0
Ser
6.488SerAla: 6.488 ± 2.438
0.541SerCys: 0.541 ± 0.229
4.325SerAsp: 4.325 ± 0.762
3.694SerGlu: 3.694 ± 0.732
2.343SerPhe: 2.343 ± 0.432
4.956SerGly: 4.956 ± 0.702
0.631SerHis: 0.631 ± 0.256
5.496SerIle: 5.496 ± 1.066
4.776SerLys: 4.776 ± 0.756
5.136SerLeu: 5.136 ± 0.699
1.352SerMet: 1.352 ± 0.316
3.965SerAsn: 3.965 ± 0.655
1.622SerPro: 1.622 ± 0.393
3.334SerGln: 3.334 ± 1.142
2.613SerArg: 2.613 ± 0.423
3.875SerSer: 3.875 ± 1.025
4.686SerThr: 4.686 ± 1.002
5.226SerVal: 5.226 ± 0.837
0.811SerTrp: 0.811 ± 0.238
2.343SerTyr: 2.343 ± 0.385
0.0SerXaa: 0.0 ± 0.0
Thr
4.595ThrAla: 4.595 ± 1.522
0.27ThrCys: 0.27 ± 0.187
2.703ThrAsp: 2.703 ± 0.584
3.965ThrGlu: 3.965 ± 0.806
3.334ThrPhe: 3.334 ± 0.573
4.145ThrGly: 4.145 ± 0.811
1.081ThrHis: 1.081 ± 0.339
4.595ThrIle: 4.595 ± 0.686
5.857ThrLys: 5.857 ± 0.78
4.325ThrLeu: 4.325 ± 0.728
1.442ThrMet: 1.442 ± 0.948
3.514ThrAsn: 3.514 ± 0.531
1.622ThrPro: 1.622 ± 0.439
2.883ThrGln: 2.883 ± 0.516
2.613ThrArg: 2.613 ± 0.465
3.694ThrSer: 3.694 ± 0.888
3.875ThrThr: 3.875 ± 0.714
4.415ThrVal: 4.415 ± 0.602
0.27ThrTrp: 0.27 ± 0.168
2.253ThrTyr: 2.253 ± 0.565
0.0ThrXaa: 0.0 ± 0.0
Val
3.965ValAla: 3.965 ± 1.119
0.18ValCys: 0.18 ± 0.131
4.595ValAsp: 4.595 ± 0.826
5.496ValGlu: 5.496 ± 1.037
2.523ValPhe: 2.523 ± 0.322
3.424ValGly: 3.424 ± 0.597
0.631ValHis: 0.631 ± 0.248
3.514ValIle: 3.514 ± 0.594
4.686ValLys: 4.686 ± 0.524
4.325ValLeu: 4.325 ± 0.574
1.352ValMet: 1.352 ± 0.29
4.776ValAsn: 4.776 ± 0.811
1.442ValPro: 1.442 ± 0.323
2.253ValGln: 2.253 ± 0.57
1.982ValArg: 1.982 ± 0.389
5.677ValSer: 5.677 ± 0.612
4.866ValThr: 4.866 ± 0.625
4.956ValVal: 4.956 ± 0.629
0.811ValTrp: 0.811 ± 0.249
2.072ValTyr: 2.072 ± 0.561
0.0ValXaa: 0.0 ± 0.0
Trp
0.631TrpAla: 0.631 ± 0.239
0.09TrpCys: 0.09 ± 0.096
0.631TrpAsp: 0.631 ± 0.218
1.081TrpGlu: 1.081 ± 0.341
0.631TrpPhe: 0.631 ± 0.276
0.901TrpGly: 0.901 ± 0.319
0.27TrpHis: 0.27 ± 0.15
0.541TrpIle: 0.541 ± 0.239
0.631TrpLys: 0.631 ± 0.234
0.991TrpLeu: 0.991 ± 0.297
0.09TrpMet: 0.09 ± 0.101
0.541TrpAsn: 0.541 ± 0.219
0.09TrpPro: 0.09 ± 0.087
0.36TrpGln: 0.36 ± 0.164
0.451TrpArg: 0.451 ± 0.212
1.352TrpSer: 1.352 ± 0.497
0.36TrpThr: 0.36 ± 0.22
1.171TrpVal: 1.171 ± 0.306
0.36TrpTrp: 0.36 ± 0.217
0.27TrpTyr: 0.27 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.244TyrAla: 3.244 ± 0.564
0.541TyrCys: 0.541 ± 0.199
2.974TyrAsp: 2.974 ± 0.851
1.982TyrGlu: 1.982 ± 0.483
1.892TyrPhe: 1.892 ± 0.444
2.523TyrGly: 2.523 ± 0.547
0.36TyrHis: 0.36 ± 0.18
3.064TyrIle: 3.064 ± 0.583
2.613TyrLys: 2.613 ± 0.525
2.974TyrLeu: 2.974 ± 0.636
0.991TyrMet: 0.991 ± 0.378
2.343TyrAsn: 2.343 ± 0.527
1.352TyrPro: 1.352 ± 0.356
1.261TyrGln: 1.261 ± 0.323
2.253TyrArg: 2.253 ± 0.637
2.703TyrSer: 2.703 ± 0.595
2.703TyrThr: 2.703 ± 0.645
2.072TyrVal: 2.072 ± 0.323
0.451TyrTrp: 0.451 ± 0.187
2.343TyrTyr: 2.343 ± 0.655
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11099 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski