Amino acid dipepetide frequency for Streptococcus phage phiSS12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.993AlaAla: 2.993 ± 0.629
0.181AlaCys: 0.181 ± 0.117
5.624AlaAsp: 5.624 ± 0.73
5.805AlaGlu: 5.805 ± 0.895
2.177AlaPhe: 2.177 ± 0.542
5.17AlaGly: 5.17 ± 0.935
0.454AlaHis: 0.454 ± 0.194
5.986AlaIle: 5.986 ± 0.552
6.984AlaLys: 6.984 ± 0.731
5.896AlaLeu: 5.896 ± 0.592
2.268AlaMet: 2.268 ± 0.658
4.172AlaAsn: 4.172 ± 0.581
1.995AlaPro: 1.995 ± 0.48
3.628AlaGln: 3.628 ± 0.444
3.356AlaArg: 3.356 ± 0.662
3.447AlaSer: 3.447 ± 0.638
5.624AlaThr: 5.624 ± 0.567
5.079AlaVal: 5.079 ± 0.867
0.544AlaTrp: 0.544 ± 0.261
2.812AlaTyr: 2.812 ± 0.492
0.0AlaXaa: 0.0 ± 0.0
Cys
0.363CysAla: 0.363 ± 0.195
0.091CysCys: 0.091 ± 0.078
0.454CysAsp: 0.454 ± 0.244
0.363CysGlu: 0.363 ± 0.168
0.0CysPhe: 0.0 ± 0.0
0.454CysGly: 0.454 ± 0.199
0.181CysHis: 0.181 ± 0.112
0.272CysIle: 0.272 ± 0.183
0.726CysLys: 0.726 ± 0.296
0.454CysLeu: 0.454 ± 0.224
0.181CysMet: 0.181 ± 0.135
0.181CysAsn: 0.181 ± 0.117
0.272CysPro: 0.272 ± 0.157
0.181CysGln: 0.181 ± 0.153
0.272CysArg: 0.272 ± 0.198
0.363CysSer: 0.363 ± 0.173
0.0CysThr: 0.0 ± 0.0
0.181CysVal: 0.181 ± 0.103
0.0CysTrp: 0.0 ± 0.0
0.272CysTyr: 0.272 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
4.354AspAla: 4.354 ± 0.617
0.091AspCys: 0.091 ± 0.105
4.082AspAsp: 4.082 ± 0.732
4.989AspGlu: 4.989 ± 0.736
2.902AspPhe: 2.902 ± 0.514
5.17AspGly: 5.17 ± 0.716
0.635AspHis: 0.635 ± 0.282
4.172AspIle: 4.172 ± 0.581
5.624AspLys: 5.624 ± 0.767
5.442AspLeu: 5.442 ± 0.602
1.361AspMet: 1.361 ± 0.41
3.265AspAsn: 3.265 ± 0.493
1.542AspPro: 1.542 ± 0.42
1.542AspGln: 1.542 ± 0.392
2.54AspArg: 2.54 ± 0.377
3.719AspSer: 3.719 ± 0.59
4.444AspThr: 4.444 ± 0.649
4.807AspVal: 4.807 ± 0.514
1.179AspTrp: 1.179 ± 0.293
2.721AspTyr: 2.721 ± 0.471
0.0AspXaa: 0.0 ± 0.0
Glu
6.349GluAla: 6.349 ± 0.84
0.363GluCys: 0.363 ± 0.189
3.084GluAsp: 3.084 ± 0.612
5.533GluGlu: 5.533 ± 1.065
2.902GluPhe: 2.902 ± 0.597
3.81GluGly: 3.81 ± 0.592
1.451GluHis: 1.451 ± 0.385
6.077GluIle: 6.077 ± 0.677
5.805GluLys: 5.805 ± 0.981
7.256GluLeu: 7.256 ± 0.728
1.905GluMet: 1.905 ± 0.57
4.807GluAsn: 4.807 ± 0.706
1.814GluPro: 1.814 ± 0.511
4.354GluGln: 4.354 ± 0.772
3.81GluArg: 3.81 ± 0.507
3.356GluSer: 3.356 ± 0.399
3.719GluThr: 3.719 ± 0.616
4.626GluVal: 4.626 ± 0.7
1.088GluTrp: 1.088 ± 0.242
3.265GluTyr: 3.265 ± 0.648
0.0GluXaa: 0.0 ± 0.0
Phe
2.54PheAla: 2.54 ± 0.5
0.363PheCys: 0.363 ± 0.168
3.719PheAsp: 3.719 ± 0.555
3.447PheGlu: 3.447 ± 0.583
1.088PhePhe: 1.088 ± 0.333
2.086PheGly: 2.086 ± 0.626
0.816PheHis: 0.816 ± 0.275
2.268PheIle: 2.268 ± 0.532
2.993PheLys: 2.993 ± 0.395
2.086PheLeu: 2.086 ± 0.486
0.816PheMet: 0.816 ± 0.209
2.63PheAsn: 2.63 ± 0.678
1.361PhePro: 1.361 ± 0.427
1.451PheGln: 1.451 ± 0.388
1.451PheArg: 1.451 ± 0.321
2.812PheSer: 2.812 ± 0.555
1.542PheThr: 1.542 ± 0.371
2.902PheVal: 2.902 ± 0.5
0.635PheTrp: 0.635 ± 0.222
1.905PheTyr: 1.905 ± 0.355
0.0PheXaa: 0.0 ± 0.0
Gly
4.082GlyAla: 4.082 ± 0.721
0.363GlyCys: 0.363 ± 0.225
3.537GlyAsp: 3.537 ± 0.574
4.535GlyGlu: 4.535 ± 0.644
2.902GlyPhe: 2.902 ± 0.534
5.079GlyGly: 5.079 ± 0.952
1.179GlyHis: 1.179 ± 0.309
4.263GlyIle: 4.263 ± 0.806
5.079GlyLys: 5.079 ± 0.671
4.535GlyLeu: 4.535 ± 0.764
1.814GlyMet: 1.814 ± 0.431
2.812GlyAsn: 2.812 ± 0.586
0.998GlyPro: 0.998 ± 0.38
3.175GlyGln: 3.175 ± 0.573
4.263GlyArg: 4.263 ± 0.598
3.628GlySer: 3.628 ± 0.634
3.447GlyThr: 3.447 ± 0.663
3.991GlyVal: 3.991 ± 0.386
1.088GlyTrp: 1.088 ± 0.312
2.54GlyTyr: 2.54 ± 0.377
0.0GlyXaa: 0.0 ± 0.0
His
0.998HisAla: 0.998 ± 0.361
0.0HisCys: 0.0 ± 0.0
0.998HisAsp: 0.998 ± 0.308
1.27HisGlu: 1.27 ± 0.327
0.998HisPhe: 0.998 ± 0.453
0.998HisGly: 0.998 ± 0.299
0.181HisHis: 0.181 ± 0.11
1.179HisIle: 1.179 ± 0.283
0.726HisLys: 0.726 ± 0.278
1.088HisLeu: 1.088 ± 0.335
0.091HisMet: 0.091 ± 0.083
0.635HisAsn: 0.635 ± 0.217
0.454HisPro: 0.454 ± 0.169
0.544HisGln: 0.544 ± 0.218
0.816HisArg: 0.816 ± 0.274
0.907HisSer: 0.907 ± 0.299
0.363HisThr: 0.363 ± 0.159
0.635HisVal: 0.635 ± 0.244
0.091HisTrp: 0.091 ± 0.089
0.454HisTyr: 0.454 ± 0.224
0.0HisXaa: 0.0 ± 0.0
Ile
5.442IleAla: 5.442 ± 0.753
0.544IleCys: 0.544 ± 0.196
4.626IleAsp: 4.626 ± 0.468
6.259IleGlu: 6.259 ± 0.948
2.449IlePhe: 2.449 ± 0.774
4.626IleGly: 4.626 ± 0.956
0.726IleHis: 0.726 ± 0.221
5.351IleIle: 5.351 ± 0.65
5.986IleLys: 5.986 ± 0.639
4.626IleLeu: 4.626 ± 0.733
0.907IleMet: 0.907 ± 0.295
3.537IleAsn: 3.537 ± 0.722
1.814IlePro: 1.814 ± 0.339
2.268IleGln: 2.268 ± 0.376
3.537IleArg: 3.537 ± 0.45
3.9IleSer: 3.9 ± 0.554
4.807IleThr: 4.807 ± 0.958
3.628IleVal: 3.628 ± 0.665
0.907IleTrp: 0.907 ± 0.269
2.721IleTyr: 2.721 ± 0.551
0.0IleXaa: 0.0 ± 0.0
Lys
5.714LysAla: 5.714 ± 0.859
0.272LysCys: 0.272 ± 0.165
4.807LysAsp: 4.807 ± 0.666
4.898LysGlu: 4.898 ± 0.769
1.995LysPhe: 1.995 ± 0.368
4.626LysGly: 4.626 ± 0.541
1.088LysHis: 1.088 ± 0.357
5.17LysIle: 5.17 ± 0.659
5.986LysLys: 5.986 ± 1.056
6.531LysLeu: 6.531 ± 0.665
1.905LysMet: 1.905 ± 0.423
3.719LysAsn: 3.719 ± 0.501
3.084LysPro: 3.084 ± 0.579
4.989LysGln: 4.989 ± 0.865
4.807LysArg: 4.807 ± 0.882
4.444LysSer: 4.444 ± 0.594
5.533LysThr: 5.533 ± 1.066
4.807LysVal: 4.807 ± 0.667
0.907LysTrp: 0.907 ± 0.263
3.265LysTyr: 3.265 ± 0.664
0.0LysXaa: 0.0 ± 0.0
Leu
6.893LeuAla: 6.893 ± 0.897
0.635LeuCys: 0.635 ± 0.285
5.986LeuAsp: 5.986 ± 0.742
6.531LeuGlu: 6.531 ± 0.897
3.447LeuPhe: 3.447 ± 0.804
5.261LeuGly: 5.261 ± 0.97
0.816LeuHis: 0.816 ± 0.268
3.991LeuIle: 3.991 ± 0.581
7.256LeuLys: 7.256 ± 0.792
5.714LeuLeu: 5.714 ± 0.953
1.905LeuMet: 1.905 ± 0.452
3.719LeuAsn: 3.719 ± 0.571
2.54LeuPro: 2.54 ± 0.748
3.537LeuGln: 3.537 ± 0.602
3.991LeuArg: 3.991 ± 0.658
5.261LeuSer: 5.261 ± 0.535
5.351LeuThr: 5.351 ± 1.301
3.81LeuVal: 3.81 ± 0.53
0.272LeuTrp: 0.272 ± 0.117
2.086LeuTyr: 2.086 ± 0.302
0.0LeuXaa: 0.0 ± 0.0
Met
2.177MetAla: 2.177 ± 0.528
0.181MetCys: 0.181 ± 0.153
1.451MetAsp: 1.451 ± 0.287
1.361MetGlu: 1.361 ± 0.347
0.816MetPhe: 0.816 ± 0.297
0.998MetGly: 0.998 ± 0.367
0.272MetHis: 0.272 ± 0.163
1.542MetIle: 1.542 ± 0.36
1.361MetLys: 1.361 ± 0.393
1.814MetLeu: 1.814 ± 0.358
0.454MetMet: 0.454 ± 0.175
1.27MetAsn: 1.27 ± 0.348
0.998MetPro: 0.998 ± 0.293
0.726MetGln: 0.726 ± 0.267
0.907MetArg: 0.907 ± 0.391
2.177MetSer: 2.177 ± 0.519
2.086MetThr: 2.086 ± 0.388
0.726MetVal: 0.726 ± 0.271
0.091MetTrp: 0.091 ± 0.086
0.635MetTyr: 0.635 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
4.263AsnAla: 4.263 ± 0.698
0.272AsnCys: 0.272 ± 0.157
2.902AsnAsp: 2.902 ± 0.482
3.356AsnGlu: 3.356 ± 0.612
2.086AsnPhe: 2.086 ± 0.369
4.172AsnGly: 4.172 ± 0.693
0.998AsnHis: 0.998 ± 0.241
3.81AsnIle: 3.81 ± 0.608
2.993AsnLys: 2.993 ± 0.512
4.354AsnLeu: 4.354 ± 0.674
0.998AsnMet: 0.998 ± 0.223
2.812AsnAsn: 2.812 ± 0.402
1.633AsnPro: 1.633 ± 0.487
2.721AsnGln: 2.721 ± 0.534
2.721AsnArg: 2.721 ± 0.507
3.175AsnSer: 3.175 ± 0.534
3.81AsnThr: 3.81 ± 0.772
3.81AsnVal: 3.81 ± 0.65
0.907AsnTrp: 0.907 ± 0.262
2.358AsnTyr: 2.358 ± 0.527
0.0AsnXaa: 0.0 ± 0.0
Pro
2.358ProAla: 2.358 ± 0.449
0.181ProCys: 0.181 ± 0.123
2.177ProAsp: 2.177 ± 0.428
2.268ProGlu: 2.268 ± 0.374
1.814ProPhe: 1.814 ± 0.396
1.361ProGly: 1.361 ± 0.402
0.272ProHis: 0.272 ± 0.154
2.54ProIle: 2.54 ± 0.537
1.723ProLys: 1.723 ± 0.529
2.177ProLeu: 2.177 ± 0.497
0.454ProMet: 0.454 ± 0.173
1.27ProAsn: 1.27 ± 0.393
0.544ProPro: 0.544 ± 0.23
1.27ProGln: 1.27 ± 0.352
1.361ProArg: 1.361 ± 0.444
1.451ProSer: 1.451 ± 0.307
1.723ProThr: 1.723 ± 0.338
1.905ProVal: 1.905 ± 0.3
0.635ProTrp: 0.635 ± 0.262
1.179ProTyr: 1.179 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
5.442GlnAla: 5.442 ± 0.667
0.272GlnCys: 0.272 ± 0.15
1.814GlnAsp: 1.814 ± 0.417
2.721GlnGlu: 2.721 ± 0.569
0.998GlnPhe: 0.998 ± 0.232
1.814GlnGly: 1.814 ± 0.407
0.272GlnHis: 0.272 ± 0.155
3.356GlnIle: 3.356 ± 0.588
3.265GlnLys: 3.265 ± 0.627
4.172GlnLeu: 4.172 ± 0.599
1.088GlnMet: 1.088 ± 0.313
3.628GlnAsn: 3.628 ± 0.454
1.451GlnPro: 1.451 ± 0.311
2.63GlnGln: 2.63 ± 0.524
2.268GlnArg: 2.268 ± 0.508
3.265GlnSer: 3.265 ± 0.713
3.265GlnThr: 3.265 ± 0.546
3.719GlnVal: 3.719 ± 0.548
0.726GlnTrp: 0.726 ± 0.224
1.088GlnTyr: 1.088 ± 0.334
0.0GlnXaa: 0.0 ± 0.0
Arg
3.175ArgAla: 3.175 ± 0.537
0.454ArgCys: 0.454 ± 0.205
2.54ArgAsp: 2.54 ± 0.458
4.172ArgGlu: 4.172 ± 0.744
2.177ArgPhe: 2.177 ± 0.521
1.905ArgGly: 1.905 ± 0.322
0.907ArgHis: 0.907 ± 0.263
3.537ArgIle: 3.537 ± 0.532
4.172ArgLys: 4.172 ± 0.742
4.717ArgLeu: 4.717 ± 0.707
1.179ArgMet: 1.179 ± 0.345
3.175ArgAsn: 3.175 ± 0.527
1.179ArgPro: 1.179 ± 0.423
2.54ArgGln: 2.54 ± 0.471
2.268ArgArg: 2.268 ± 0.6
2.086ArgSer: 2.086 ± 0.51
2.812ArgThr: 2.812 ± 0.562
1.995ArgVal: 1.995 ± 0.334
0.635ArgTrp: 0.635 ± 0.273
2.358ArgTyr: 2.358 ± 0.503
0.0ArgXaa: 0.0 ± 0.0
Ser
4.263SerAla: 4.263 ± 0.788
0.181SerCys: 0.181 ± 0.123
4.989SerAsp: 4.989 ± 0.571
4.354SerGlu: 4.354 ± 0.649
2.449SerPhe: 2.449 ± 0.582
4.263SerGly: 4.263 ± 0.645
1.179SerHis: 1.179 ± 0.362
3.537SerIle: 3.537 ± 0.523
4.717SerLys: 4.717 ± 0.827
5.624SerLeu: 5.624 ± 0.94
1.179SerMet: 1.179 ± 0.281
2.721SerAsn: 2.721 ± 0.457
1.542SerPro: 1.542 ± 0.265
2.993SerGln: 2.993 ± 0.563
2.54SerArg: 2.54 ± 0.492
3.265SerSer: 3.265 ± 0.504
3.084SerThr: 3.084 ± 0.592
2.54SerVal: 2.54 ± 0.326
0.726SerTrp: 0.726 ± 0.304
3.719SerTyr: 3.719 ± 0.702
0.0SerXaa: 0.0 ± 0.0
Thr
5.624ThrAla: 5.624 ± 1.306
0.272ThrCys: 0.272 ± 0.11
4.717ThrAsp: 4.717 ± 0.588
5.261ThrGlu: 5.261 ± 0.964
2.449ThrPhe: 2.449 ± 0.476
4.898ThrGly: 4.898 ± 0.561
0.454ThrHis: 0.454 ± 0.188
3.719ThrIle: 3.719 ± 0.594
5.17ThrLys: 5.17 ± 0.799
4.717ThrLeu: 4.717 ± 0.624
0.907ThrMet: 0.907 ± 0.318
2.449ThrAsn: 2.449 ± 0.527
1.542ThrPro: 1.542 ± 0.375
3.537ThrGln: 3.537 ± 1.012
1.633ThrArg: 1.633 ± 0.519
4.172ThrSer: 4.172 ± 0.656
4.354ThrThr: 4.354 ± 1.448
4.626ThrVal: 4.626 ± 0.932
0.635ThrTrp: 0.635 ± 0.198
2.812ThrTyr: 2.812 ± 0.497
0.0ThrXaa: 0.0 ± 0.0
Val
4.989ValAla: 4.989 ± 0.692
0.181ValCys: 0.181 ± 0.156
4.082ValAsp: 4.082 ± 0.477
4.354ValGlu: 4.354 ± 0.462
2.268ValPhe: 2.268 ± 0.487
3.81ValGly: 3.81 ± 0.574
0.998ValHis: 0.998 ± 0.338
2.993ValIle: 2.993 ± 0.413
3.81ValLys: 3.81 ± 0.474
3.81ValLeu: 3.81 ± 0.737
1.451ValMet: 1.451 ± 0.427
4.354ValAsn: 4.354 ± 0.512
1.905ValPro: 1.905 ± 0.335
2.177ValGln: 2.177 ± 0.398
2.449ValArg: 2.449 ± 0.473
5.624ValSer: 5.624 ± 0.685
4.535ValThr: 4.535 ± 0.74
4.354ValVal: 4.354 ± 1.092
0.816ValTrp: 0.816 ± 0.274
2.177ValTyr: 2.177 ± 0.517
0.0ValXaa: 0.0 ± 0.0
Trp
0.816TrpAla: 0.816 ± 0.263
0.091TrpCys: 0.091 ± 0.095
0.635TrpAsp: 0.635 ± 0.34
1.088TrpGlu: 1.088 ± 0.365
0.816TrpPhe: 0.816 ± 0.407
0.544TrpGly: 0.544 ± 0.213
0.091TrpHis: 0.091 ± 0.089
0.726TrpIle: 0.726 ± 0.201
0.998TrpLys: 0.998 ± 0.233
1.27TrpLeu: 1.27 ± 0.404
0.272TrpMet: 0.272 ± 0.143
1.27TrpAsn: 1.27 ± 0.398
0.091TrpPro: 0.091 ± 0.096
0.907TrpGln: 0.907 ± 0.238
0.454TrpArg: 0.454 ± 0.204
0.635TrpSer: 0.635 ± 0.23
0.726TrpThr: 0.726 ± 0.25
0.816TrpVal: 0.816 ± 0.309
0.091TrpTrp: 0.091 ± 0.089
0.635TrpTyr: 0.635 ± 0.21
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.361TyrAla: 1.361 ± 0.386
0.272TyrCys: 0.272 ± 0.117
2.358TyrAsp: 2.358 ± 0.442
2.812TyrGlu: 2.812 ± 0.646
2.177TyrPhe: 2.177 ± 0.428
2.268TyrGly: 2.268 ± 0.34
0.454TyrHis: 0.454 ± 0.229
4.172TyrIle: 4.172 ± 0.69
3.175TyrLys: 3.175 ± 0.7
2.721TyrLeu: 2.721 ± 0.545
0.816TyrMet: 0.816 ± 0.313
1.723TyrAsn: 1.723 ± 0.373
1.905TyrPro: 1.905 ± 0.429
2.086TyrGln: 2.086 ± 0.489
2.54TyrArg: 2.54 ± 0.606
2.268TyrSer: 2.268 ± 0.349
2.721TyrThr: 2.721 ± 0.436
2.177TyrVal: 2.177 ± 0.424
0.998TyrTrp: 0.998 ± 0.305
1.723TyrTyr: 1.723 ± 0.685
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (11026 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski