Amino acid dipepetide frequency for Streptococcus phage D1024

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.006AlaAla: 3.006 ± 1.016
0.354AlaCys: 0.354 ± 0.201
3.89AlaAsp: 3.89 ± 0.733
4.244AlaGlu: 4.244 ± 0.643
2.299AlaPhe: 2.299 ± 0.565
4.597AlaGly: 4.597 ± 0.771
0.619AlaHis: 0.619 ± 0.207
4.863AlaIle: 4.863 ± 0.727
6.189AlaLys: 6.189 ± 0.701
5.57AlaLeu: 5.57 ± 0.611
1.591AlaMet: 1.591 ± 0.427
4.509AlaAsn: 4.509 ± 0.788
1.415AlaPro: 1.415 ± 0.332
2.741AlaGln: 2.741 ± 0.587
2.475AlaArg: 2.475 ± 0.531
3.802AlaSer: 3.802 ± 0.647
4.067AlaThr: 4.067 ± 0.872
4.067AlaVal: 4.067 ± 0.661
0.973AlaTrp: 0.973 ± 0.256
2.918AlaTyr: 2.918 ± 0.539
0.0AlaXaa: 0.0 ± 0.0
Cys
0.265CysAla: 0.265 ± 0.129
0.0CysCys: 0.0 ± 0.0
0.619CysAsp: 0.619 ± 0.281
0.265CysGlu: 0.265 ± 0.132
0.442CysPhe: 0.442 ± 0.246
0.442CysGly: 0.442 ± 0.198
0.088CysHis: 0.088 ± 0.097
0.177CysIle: 0.177 ± 0.115
0.796CysLys: 0.796 ± 0.293
0.796CysLeu: 0.796 ± 0.311
0.088CysMet: 0.088 ± 0.098
0.619CysAsn: 0.619 ± 0.321
0.442CysPro: 0.442 ± 0.225
0.088CysGln: 0.088 ± 0.09
0.53CysArg: 0.53 ± 0.316
0.53CysSer: 0.53 ± 0.272
0.265CysThr: 0.265 ± 0.172
0.088CysVal: 0.088 ± 0.072
0.177CysTrp: 0.177 ± 0.156
0.265CysTyr: 0.265 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
3.271AspAla: 3.271 ± 0.627
0.354AspCys: 0.354 ± 0.189
4.863AspAsp: 4.863 ± 0.711
4.686AspGlu: 4.686 ± 0.764
3.536AspPhe: 3.536 ± 0.488
6.631AspGly: 6.631 ± 1.017
1.149AspHis: 1.149 ± 0.328
5.57AspIle: 5.57 ± 0.768
4.597AspLys: 4.597 ± 0.463
3.713AspLeu: 3.713 ± 0.738
1.857AspMet: 1.857 ± 0.469
3.625AspAsn: 3.625 ± 0.747
2.033AspPro: 2.033 ± 0.456
1.503AspGln: 1.503 ± 0.307
2.387AspArg: 2.387 ± 0.421
3.978AspSer: 3.978 ± 0.788
4.244AspThr: 4.244 ± 0.625
3.36AspVal: 3.36 ± 0.532
1.149AspTrp: 1.149 ± 0.313
2.387AspTyr: 2.387 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
3.271GluAla: 3.271 ± 0.416
0.265GluCys: 0.265 ± 0.132
3.978GluAsp: 3.978 ± 0.734
4.067GluGlu: 4.067 ± 1.025
2.741GluPhe: 2.741 ± 0.514
2.918GluGly: 2.918 ± 0.562
1.061GluHis: 1.061 ± 0.346
5.835GluIle: 5.835 ± 0.761
4.155GluLys: 4.155 ± 0.97
6.808GluLeu: 6.808 ± 0.955
1.945GluMet: 1.945 ± 0.37
3.713GluAsn: 3.713 ± 0.589
1.503GluPro: 1.503 ± 0.539
4.067GluGln: 4.067 ± 0.618
3.094GluArg: 3.094 ± 0.62
3.713GluSer: 3.713 ± 0.55
3.271GluThr: 3.271 ± 0.456
4.597GluVal: 4.597 ± 0.617
1.326GluTrp: 1.326 ± 0.364
3.802GluTyr: 3.802 ± 0.649
0.0GluXaa: 0.0 ± 0.0
Phe
3.271PheAla: 3.271 ± 0.563
0.265PheCys: 0.265 ± 0.186
3.271PheAsp: 3.271 ± 0.407
2.387PheGlu: 2.387 ± 0.561
1.857PhePhe: 1.857 ± 0.481
3.183PheGly: 3.183 ± 0.773
0.53PheHis: 0.53 ± 0.158
2.652PheIle: 2.652 ± 0.462
3.89PheLys: 3.89 ± 0.601
2.829PheLeu: 2.829 ± 0.467
0.442PheMet: 0.442 ± 0.188
3.536PheAsn: 3.536 ± 0.64
0.354PhePro: 0.354 ± 0.175
1.061PheGln: 1.061 ± 0.29
1.768PheArg: 1.768 ± 0.328
3.183PheSer: 3.183 ± 0.527
2.918PheThr: 2.918 ± 0.557
3.094PheVal: 3.094 ± 0.425
0.53PheTrp: 0.53 ± 0.164
2.033PheTyr: 2.033 ± 0.418
0.0PheXaa: 0.0 ± 0.0
Gly
3.536GlyAla: 3.536 ± 0.571
0.707GlyCys: 0.707 ± 0.289
4.155GlyAsp: 4.155 ± 0.635
3.802GlyGlu: 3.802 ± 0.669
3.271GlyPhe: 3.271 ± 0.553
4.597GlyGly: 4.597 ± 0.848
0.707GlyHis: 0.707 ± 0.228
4.863GlyIle: 4.863 ± 0.632
6.808GlyLys: 6.808 ± 0.842
5.481GlyLeu: 5.481 ± 0.879
1.945GlyMet: 1.945 ± 0.497
4.509GlyAsn: 4.509 ± 0.852
1.415GlyPro: 1.415 ± 0.701
2.741GlyGln: 2.741 ± 0.573
3.183GlyArg: 3.183 ± 0.497
4.42GlySer: 4.42 ± 0.784
4.42GlyThr: 4.42 ± 0.722
4.155GlyVal: 4.155 ± 0.791
1.238GlyTrp: 1.238 ± 0.275
2.918GlyTyr: 2.918 ± 0.481
0.0GlyXaa: 0.0 ± 0.0
His
0.53HisAla: 0.53 ± 0.262
0.0HisCys: 0.0 ± 0.0
0.973HisAsp: 0.973 ± 0.289
0.707HisGlu: 0.707 ± 0.28
0.619HisPhe: 0.619 ± 0.231
0.796HisGly: 0.796 ± 0.253
0.354HisHis: 0.354 ± 0.144
1.061HisIle: 1.061 ± 0.267
1.415HisLys: 1.415 ± 0.333
1.149HisLeu: 1.149 ± 0.247
0.265HisMet: 0.265 ± 0.172
0.796HisAsn: 0.796 ± 0.27
0.53HisPro: 0.53 ± 0.181
0.619HisGln: 0.619 ± 0.223
0.707HisArg: 0.707 ± 0.203
0.973HisSer: 0.973 ± 0.306
0.53HisThr: 0.53 ± 0.187
0.973HisVal: 0.973 ± 0.234
0.177HisTrp: 0.177 ± 0.141
1.061HisTyr: 1.061 ± 0.344
0.0HisXaa: 0.0 ± 0.0
Ile
5.039IleAla: 5.039 ± 0.714
0.354IleCys: 0.354 ± 0.206
5.57IleAsp: 5.57 ± 0.726
4.067IleGlu: 4.067 ± 0.573
1.68IlePhe: 1.68 ± 0.368
4.509IleGly: 4.509 ± 0.622
0.884IleHis: 0.884 ± 0.23
2.564IleIle: 2.564 ± 0.41
7.868IleLys: 7.868 ± 0.791
4.42IleLeu: 4.42 ± 0.759
2.033IleMet: 2.033 ± 0.489
3.978IleAsn: 3.978 ± 0.59
3.536IlePro: 3.536 ± 0.465
2.918IleGln: 2.918 ± 0.571
3.183IleArg: 3.183 ± 0.462
3.802IleSer: 3.802 ± 0.66
3.536IleThr: 3.536 ± 0.492
3.713IleVal: 3.713 ± 0.601
1.149IleTrp: 1.149 ± 0.256
2.033IleTyr: 2.033 ± 0.381
0.0IleXaa: 0.0 ± 0.0
Lys
5.923LysAla: 5.923 ± 0.597
0.442LysCys: 0.442 ± 0.283
3.89LysAsp: 3.89 ± 0.778
7.25LysGlu: 7.25 ± 0.874
3.536LysPhe: 3.536 ± 0.783
6.189LysGly: 6.189 ± 0.699
1.591LysHis: 1.591 ± 0.382
6.1LysIle: 6.1 ± 0.816
6.984LysLys: 6.984 ± 1.241
6.189LysLeu: 6.189 ± 0.728
1.857LysMet: 1.857 ± 0.365
6.1LysAsn: 6.1 ± 1.039
2.475LysPro: 2.475 ± 0.431
3.89LysGln: 3.89 ± 0.534
4.332LysArg: 4.332 ± 0.524
4.244LysSer: 4.244 ± 0.481
6.1LysThr: 6.1 ± 0.866
4.509LysVal: 4.509 ± 0.585
0.884LysTrp: 0.884 ± 0.255
3.094LysTyr: 3.094 ± 0.609
0.0LysXaa: 0.0 ± 0.0
Leu
6.365LeuAla: 6.365 ± 0.676
0.619LeuCys: 0.619 ± 0.23
5.835LeuAsp: 5.835 ± 0.889
6.1LeuGlu: 6.1 ± 1.1
3.006LeuPhe: 3.006 ± 0.367
4.951LeuGly: 4.951 ± 0.896
0.884LeuHis: 0.884 ± 0.306
3.802LeuIle: 3.802 ± 0.584
6.808LeuLys: 6.808 ± 0.734
5.57LeuLeu: 5.57 ± 0.602
2.475LeuMet: 2.475 ± 0.46
4.774LeuAsn: 4.774 ± 0.562
2.829LeuPro: 2.829 ± 0.395
2.918LeuGln: 2.918 ± 0.526
3.625LeuArg: 3.625 ± 0.645
4.951LeuSer: 4.951 ± 0.79
5.658LeuThr: 5.658 ± 0.846
3.89LeuVal: 3.89 ± 0.631
1.061LeuTrp: 1.061 ± 0.257
1.591LeuTyr: 1.591 ± 0.445
0.0LeuXaa: 0.0 ± 0.0
Met
1.945MetAla: 1.945 ± 0.369
0.0MetCys: 0.0 ± 0.0
0.796MetAsp: 0.796 ± 0.25
1.68MetGlu: 1.68 ± 0.419
1.238MetPhe: 1.238 ± 0.278
0.707MetGly: 0.707 ± 0.233
0.354MetHis: 0.354 ± 0.163
1.945MetIle: 1.945 ± 0.368
3.006MetLys: 3.006 ± 0.494
1.945MetLeu: 1.945 ± 0.311
0.53MetMet: 0.53 ± 0.232
0.884MetAsn: 0.884 ± 0.305
0.973MetPro: 0.973 ± 0.291
0.53MetGln: 0.53 ± 0.19
0.884MetArg: 0.884 ± 0.244
1.945MetSer: 1.945 ± 0.479
1.503MetThr: 1.503 ± 0.374
2.21MetVal: 2.21 ± 0.523
0.088MetTrp: 0.088 ± 0.072
0.442MetTyr: 0.442 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
4.155AsnAla: 4.155 ± 1.049
0.354AsnCys: 0.354 ± 0.181
3.802AsnAsp: 3.802 ± 0.482
4.42AsnGlu: 4.42 ± 0.842
2.652AsnPhe: 2.652 ± 0.385
6.719AsnGly: 6.719 ± 1.121
0.884AsnHis: 0.884 ± 0.222
3.802AsnIle: 3.802 ± 0.599
4.42AsnLys: 4.42 ± 0.508
4.774AsnLeu: 4.774 ± 0.553
0.884AsnMet: 0.884 ± 0.259
4.597AsnAsn: 4.597 ± 0.634
3.006AsnPro: 3.006 ± 0.644
2.829AsnGln: 2.829 ± 0.489
2.741AsnArg: 2.741 ± 0.577
3.625AsnSer: 3.625 ± 0.517
3.094AsnThr: 3.094 ± 0.517
3.36AsnVal: 3.36 ± 0.546
1.238AsnTrp: 1.238 ± 0.286
2.652AsnTyr: 2.652 ± 0.406
0.0AsnXaa: 0.0 ± 0.0
Pro
1.857ProAla: 1.857 ± 0.401
0.0ProCys: 0.0 ± 0.0
1.503ProAsp: 1.503 ± 0.364
2.299ProGlu: 2.299 ± 0.503
1.326ProPhe: 1.326 ± 0.261
1.503ProGly: 1.503 ± 0.382
0.442ProHis: 0.442 ± 0.16
1.768ProIle: 1.768 ± 0.34
3.625ProLys: 3.625 ± 0.527
2.033ProLeu: 2.033 ± 0.343
0.177ProMet: 0.177 ± 0.181
2.741ProAsn: 2.741 ± 0.403
0.707ProPro: 0.707 ± 0.238
1.415ProGln: 1.415 ± 0.319
0.707ProArg: 0.707 ± 0.206
2.387ProSer: 2.387 ± 0.417
2.475ProThr: 2.475 ± 0.516
1.326ProVal: 1.326 ± 0.399
0.442ProTrp: 0.442 ± 0.167
1.149ProTyr: 1.149 ± 0.303
0.0ProXaa: 0.0 ± 0.0
Gln
3.978GlnAla: 3.978 ± 0.734
0.177GlnCys: 0.177 ± 0.109
1.945GlnAsp: 1.945 ± 0.546
2.387GlnGlu: 2.387 ± 0.453
1.768GlnPhe: 1.768 ± 0.338
3.36GlnGly: 3.36 ± 0.799
0.442GlnHis: 0.442 ± 0.161
2.475GlnIle: 2.475 ± 0.515
3.094GlnLys: 3.094 ± 0.477
3.271GlnLeu: 3.271 ± 0.404
1.415GlnMet: 1.415 ± 0.351
2.21GlnAsn: 2.21 ± 0.43
0.796GlnPro: 0.796 ± 0.3
2.741GlnGln: 2.741 ± 0.59
2.122GlnArg: 2.122 ± 0.402
2.652GlnSer: 2.652 ± 0.471
2.564GlnThr: 2.564 ± 0.371
2.21GlnVal: 2.21 ± 0.437
0.619GlnTrp: 0.619 ± 0.261
2.21GlnTyr: 2.21 ± 0.394
0.0GlnXaa: 0.0 ± 0.0
Arg
2.299ArgAla: 2.299 ± 0.421
0.354ArgCys: 0.354 ± 0.263
2.387ArgAsp: 2.387 ± 0.388
2.299ArgGlu: 2.299 ± 0.416
2.564ArgPhe: 2.564 ± 0.614
2.918ArgGly: 2.918 ± 0.541
0.707ArgHis: 0.707 ± 0.228
2.918ArgIle: 2.918 ± 0.524
3.271ArgLys: 3.271 ± 0.619
4.332ArgLeu: 4.332 ± 0.759
0.973ArgMet: 0.973 ± 0.295
2.652ArgAsn: 2.652 ± 0.428
1.149ArgPro: 1.149 ± 0.271
1.945ArgGln: 1.945 ± 0.44
0.884ArgArg: 0.884 ± 0.236
1.68ArgSer: 1.68 ± 0.483
2.299ArgThr: 2.299 ± 0.66
3.271ArgVal: 3.271 ± 0.465
1.238ArgTrp: 1.238 ± 0.267
2.387ArgTyr: 2.387 ± 0.529
0.0ArgXaa: 0.0 ± 0.0
Ser
3.713SerAla: 3.713 ± 0.491
0.796SerCys: 0.796 ± 0.312
4.155SerAsp: 4.155 ± 0.626
4.42SerGlu: 4.42 ± 0.6
3.006SerPhe: 3.006 ± 0.434
4.509SerGly: 4.509 ± 0.448
0.53SerHis: 0.53 ± 0.202
4.42SerIle: 4.42 ± 0.586
4.42SerLys: 4.42 ± 0.821
4.863SerLeu: 4.863 ± 0.505
1.945SerMet: 1.945 ± 0.307
3.536SerAsn: 3.536 ± 0.559
1.857SerPro: 1.857 ± 0.412
2.918SerGln: 2.918 ± 0.625
2.829SerArg: 2.829 ± 0.637
3.271SerSer: 3.271 ± 0.569
3.625SerThr: 3.625 ± 0.442
4.42SerVal: 4.42 ± 0.619
0.884SerTrp: 0.884 ± 0.412
1.503SerTyr: 1.503 ± 0.329
0.0SerXaa: 0.0 ± 0.0
Thr
4.332ThrAla: 4.332 ± 0.683
0.442ThrCys: 0.442 ± 0.164
5.039ThrAsp: 5.039 ± 0.816
2.829ThrGlu: 2.829 ± 0.462
2.299ThrPhe: 2.299 ± 0.457
3.713ThrGly: 3.713 ± 0.583
1.238ThrHis: 1.238 ± 0.263
4.332ThrIle: 4.332 ± 0.637
4.774ThrLys: 4.774 ± 0.721
5.923ThrLeu: 5.923 ± 0.904
0.973ThrMet: 0.973 ± 0.249
4.42ThrAsn: 4.42 ± 0.489
1.326ThrPro: 1.326 ± 0.356
2.564ThrGln: 2.564 ± 0.509
1.945ThrArg: 1.945 ± 0.495
3.536ThrSer: 3.536 ± 0.554
2.918ThrThr: 2.918 ± 0.602
4.155ThrVal: 4.155 ± 0.828
0.973ThrTrp: 0.973 ± 0.259
3.448ThrTyr: 3.448 ± 0.657
0.0ThrXaa: 0.0 ± 0.0
Val
4.244ValAla: 4.244 ± 0.694
0.442ValCys: 0.442 ± 0.194
4.42ValAsp: 4.42 ± 0.51
4.597ValGlu: 4.597 ± 0.79
2.741ValPhe: 2.741 ± 0.475
4.155ValGly: 4.155 ± 0.558
0.619ValHis: 0.619 ± 0.171
3.713ValIle: 3.713 ± 0.569
5.393ValLys: 5.393 ± 0.635
3.36ValLeu: 3.36 ± 0.602
1.149ValMet: 1.149 ± 0.317
3.713ValAsn: 3.713 ± 0.651
2.122ValPro: 2.122 ± 0.343
1.768ValGln: 1.768 ± 0.391
1.945ValArg: 1.945 ± 0.437
4.863ValSer: 4.863 ± 0.835
4.774ValThr: 4.774 ± 0.645
3.094ValVal: 3.094 ± 0.605
1.238ValTrp: 1.238 ± 0.262
1.68ValTyr: 1.68 ± 0.374
0.0ValXaa: 0.0 ± 0.0
Trp
0.707TrpAla: 0.707 ± 0.242
0.088TrpCys: 0.088 ± 0.089
1.149TrpAsp: 1.149 ± 0.355
0.884TrpGlu: 0.884 ± 0.197
0.973TrpPhe: 0.973 ± 0.268
0.354TrpGly: 0.354 ± 0.166
0.442TrpHis: 0.442 ± 0.207
0.707TrpIle: 0.707 ± 0.213
1.149TrpLys: 1.149 ± 0.374
1.503TrpLeu: 1.503 ± 0.424
0.088TrpMet: 0.088 ± 0.096
1.503TrpAsn: 1.503 ± 0.345
0.177TrpPro: 0.177 ± 0.12
0.707TrpGln: 0.707 ± 0.267
0.884TrpArg: 0.884 ± 0.264
1.591TrpSer: 1.591 ± 0.541
0.973TrpThr: 0.973 ± 0.344
1.149TrpVal: 1.149 ± 0.217
0.265TrpTrp: 0.265 ± 0.123
0.354TrpTyr: 0.354 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.475TyrAla: 2.475 ± 0.407
0.973TyrCys: 0.973 ± 0.377
2.652TyrAsp: 2.652 ± 0.49
2.741TyrGlu: 2.741 ± 0.608
1.503TyrPhe: 1.503 ± 0.348
2.122TyrGly: 2.122 ± 0.505
0.707TyrHis: 0.707 ± 0.264
3.271TyrIle: 3.271 ± 0.494
2.918TyrLys: 2.918 ± 0.65
3.271TyrLeu: 3.271 ± 0.438
0.884TyrMet: 0.884 ± 0.335
1.591TyrAsn: 1.591 ± 0.354
1.238TyrPro: 1.238 ± 0.297
2.475TyrGln: 2.475 ± 0.427
2.299TyrArg: 2.299 ± 0.427
2.475TyrSer: 2.475 ± 0.483
1.945TyrThr: 1.945 ± 0.368
2.387TyrVal: 2.387 ± 0.426
0.0TyrTrp: 0.0 ± 0.0
2.652TyrTyr: 2.652 ± 0.6
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski