Amino acid dipepetide frequency for Streptomyces phage Omar

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.111AlaAla: 12.111 ± 1.309
0.74AlaCys: 0.74 ± 0.251
8.679AlaAsp: 8.679 ± 0.66
9.419AlaGlu: 9.419 ± 0.905
2.96AlaPhe: 2.96 ± 0.519
9.621AlaGly: 9.621 ± 0.888
1.682AlaHis: 1.682 ± 0.335
4.441AlaIle: 4.441 ± 0.51
5.853AlaLys: 5.853 ± 0.849
11.572AlaLeu: 11.572 ± 1.238
2.893AlaMet: 2.893 ± 0.516
2.489AlaAsn: 2.489 ± 0.377
4.575AlaPro: 4.575 ± 0.548
4.037AlaGln: 4.037 ± 0.542
7.199AlaArg: 7.199 ± 0.634
5.517AlaSer: 5.517 ± 0.77
5.113AlaThr: 5.113 ± 0.517
8.343AlaVal: 8.343 ± 0.905
2.22AlaTrp: 2.22 ± 0.34
2.96AlaTyr: 2.96 ± 0.512
0.0AlaXaa: 0.0 ± 0.0
Cys
0.807CysAla: 0.807 ± 0.25
0.067CysCys: 0.067 ± 0.074
0.538CysAsp: 0.538 ± 0.18
0.404CysGlu: 0.404 ± 0.2
0.269CysPhe: 0.269 ± 0.121
0.875CysGly: 0.875 ± 0.279
0.336CysHis: 0.336 ± 0.154
0.269CysIle: 0.269 ± 0.13
0.135CysLys: 0.135 ± 0.106
0.74CysLeu: 0.74 ± 0.206
0.135CysMet: 0.135 ± 0.11
0.269CysAsn: 0.269 ± 0.124
0.606CysPro: 0.606 ± 0.318
0.269CysGln: 0.269 ± 0.142
0.807CysArg: 0.807 ± 0.31
0.336CysSer: 0.336 ± 0.132
0.807CysThr: 0.807 ± 0.29
0.404CysVal: 0.404 ± 0.194
0.269CysTrp: 0.269 ± 0.12
0.135CysTyr: 0.135 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
7.401AspAla: 7.401 ± 0.778
0.606AspCys: 0.606 ± 0.211
4.642AspAsp: 4.642 ± 0.622
5.248AspGlu: 5.248 ± 0.726
1.749AspPhe: 1.749 ± 0.334
7.065AspGly: 7.065 ± 0.544
1.413AspHis: 1.413 ± 0.338
2.624AspIle: 2.624 ± 0.367
2.355AspLys: 2.355 ± 0.441
5.921AspLeu: 5.921 ± 0.683
1.615AspMet: 1.615 ± 0.252
1.278AspAsn: 1.278 ± 0.282
4.037AspPro: 4.037 ± 0.546
1.884AspGln: 1.884 ± 0.342
3.095AspArg: 3.095 ± 0.366
3.229AspSer: 3.229 ± 0.61
3.902AspThr: 3.902 ± 0.524
3.835AspVal: 3.835 ± 0.553
1.884AspTrp: 1.884 ± 0.342
1.615AspTyr: 1.615 ± 0.318
0.0AspXaa: 0.0 ± 0.0
Glu
8.477GluAla: 8.477 ± 1.072
1.009GluCys: 1.009 ± 0.424
4.844GluAsp: 4.844 ± 0.601
5.113GluGlu: 5.113 ± 0.98
2.086GluPhe: 2.086 ± 0.413
5.584GluGly: 5.584 ± 0.598
1.682GluHis: 1.682 ± 0.393
3.297GluIle: 3.297 ± 0.462
2.288GluLys: 2.288 ± 0.429
6.661GluLeu: 6.661 ± 0.779
0.74GluMet: 0.74 ± 0.189
1.413GluAsn: 1.413 ± 0.3
2.759GluPro: 2.759 ± 0.595
3.499GluGln: 3.499 ± 0.47
3.768GluArg: 3.768 ± 0.672
3.633GluSer: 3.633 ± 0.562
4.306GluThr: 4.306 ± 0.878
5.315GluVal: 5.315 ± 0.6
1.211GluTrp: 1.211 ± 0.301
2.826GluTyr: 2.826 ± 0.465
0.0GluXaa: 0.0 ± 0.0
Phe
3.633PheAla: 3.633 ± 0.508
0.336PheCys: 0.336 ± 0.143
2.422PheAsp: 2.422 ± 0.396
2.489PheGlu: 2.489 ± 0.44
0.942PhePhe: 0.942 ± 0.437
2.893PheGly: 2.893 ± 0.472
0.807PheHis: 0.807 ± 0.238
1.144PheIle: 1.144 ± 0.309
0.807PheLys: 0.807 ± 0.257
1.749PheLeu: 1.749 ± 0.298
0.673PheMet: 0.673 ± 0.237
1.009PheAsn: 1.009 ± 0.258
1.076PhePro: 1.076 ± 0.246
1.278PheGln: 1.278 ± 0.29
1.547PheArg: 1.547 ± 0.307
1.547PheSer: 1.547 ± 0.362
2.826PheThr: 2.826 ± 0.622
2.018PheVal: 2.018 ± 0.392
0.538PheTrp: 0.538 ± 0.184
0.74PheTyr: 0.74 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
8.612GlyAla: 8.612 ± 1.012
0.606GlyCys: 0.606 ± 0.232
4.373GlyAsp: 4.373 ± 0.623
5.113GlyGlu: 5.113 ± 0.596
3.229GlyPhe: 3.229 ± 0.412
6.257GlyGly: 6.257 ± 0.8
2.422GlyHis: 2.422 ± 0.498
3.902GlyIle: 3.902 ± 1.114
5.248GlyLys: 5.248 ± 0.653
7.065GlyLeu: 7.065 ± 0.827
1.009GlyMet: 1.009 ± 0.266
2.624GlyAsn: 2.624 ± 0.454
3.499GlyPro: 3.499 ± 0.416
2.557GlyGln: 2.557 ± 0.374
5.382GlyArg: 5.382 ± 0.667
5.181GlySer: 5.181 ± 0.605
4.844GlyThr: 4.844 ± 0.657
6.526GlyVal: 6.526 ± 0.732
2.153GlyTrp: 2.153 ± 0.413
2.96GlyTyr: 2.96 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
1.749HisAla: 1.749 ± 0.377
0.202HisCys: 0.202 ± 0.11
1.009HisAsp: 1.009 ± 0.223
1.615HisGlu: 1.615 ± 0.395
1.48HisPhe: 1.48 ± 0.303
1.615HisGly: 1.615 ± 0.349
1.144HisHis: 1.144 ± 0.407
0.673HisIle: 0.673 ± 0.235
0.269HisLys: 0.269 ± 0.129
1.884HisLeu: 1.884 ± 0.371
0.606HisMet: 0.606 ± 0.189
0.404HisAsn: 0.404 ± 0.124
0.942HisPro: 0.942 ± 0.227
0.74HisGln: 0.74 ± 0.255
1.413HisArg: 1.413 ± 0.373
1.346HisSer: 1.346 ± 0.43
1.144HisThr: 1.144 ± 0.25
1.278HisVal: 1.278 ± 0.298
0.807HisTrp: 0.807 ± 0.207
0.538HisTyr: 0.538 ± 0.202
0.0HisXaa: 0.0 ± 0.0
Ile
4.912IleAla: 4.912 ± 0.589
0.336IleCys: 0.336 ± 0.146
3.229IleAsp: 3.229 ± 0.52
4.037IleGlu: 4.037 ± 0.625
1.278IlePhe: 1.278 ± 0.304
3.566IleGly: 3.566 ± 0.793
0.875IleHis: 0.875 ± 0.237
1.48IleIle: 1.48 ± 0.424
1.951IleLys: 1.951 ± 0.52
2.96IleLeu: 2.96 ± 0.558
0.471IleMet: 0.471 ± 0.248
1.144IleAsn: 1.144 ± 0.304
2.086IlePro: 2.086 ± 0.336
1.615IleGln: 1.615 ± 0.348
3.095IleArg: 3.095 ± 0.404
2.018IleSer: 2.018 ± 0.494
2.826IleThr: 2.826 ± 0.392
3.431IleVal: 3.431 ± 0.457
0.135IleTrp: 0.135 ± 0.09
1.076IleTyr: 1.076 ± 0.24
0.0IleXaa: 0.0 ± 0.0
Lys
5.315LysAla: 5.315 ± 1.028
0.202LysCys: 0.202 ± 0.138
2.557LysAsp: 2.557 ± 0.503
2.288LysGlu: 2.288 ± 0.404
1.076LysPhe: 1.076 ± 0.212
4.171LysGly: 4.171 ± 0.55
0.538LysHis: 0.538 ± 0.197
1.682LysIle: 1.682 ± 0.391
2.893LysLys: 2.893 ± 0.622
4.037LysLeu: 4.037 ± 0.694
0.673LysMet: 0.673 ± 0.254
1.076LysAsn: 1.076 ± 0.286
2.624LysPro: 2.624 ± 0.621
1.682LysGln: 1.682 ± 0.393
3.835LysArg: 3.835 ± 0.566
2.288LysSer: 2.288 ± 0.395
2.826LysThr: 2.826 ± 0.471
3.229LysVal: 3.229 ± 0.576
0.471LysTrp: 0.471 ± 0.174
1.547LysTyr: 1.547 ± 0.399
0.0LysXaa: 0.0 ± 0.0
Leu
11.034LeuAla: 11.034 ± 1.013
0.606LeuCys: 0.606 ± 0.198
5.248LeuAsp: 5.248 ± 0.617
4.104LeuGlu: 4.104 ± 0.488
2.086LeuPhe: 2.086 ± 0.439
7.468LeuGly: 7.468 ± 0.879
1.615LeuHis: 1.615 ± 0.317
3.768LeuIle: 3.768 ± 0.536
3.364LeuLys: 3.364 ± 0.686
5.584LeuLeu: 5.584 ± 0.803
2.086LeuMet: 2.086 ± 0.412
3.229LeuAsn: 3.229 ± 0.585
3.902LeuPro: 3.902 ± 0.612
2.691LeuGln: 2.691 ± 0.405
5.45LeuArg: 5.45 ± 0.786
5.584LeuSer: 5.584 ± 0.687
5.652LeuThr: 5.652 ± 0.464
5.921LeuVal: 5.921 ± 0.727
0.942LeuTrp: 0.942 ± 0.236
1.951LeuTyr: 1.951 ± 0.351
0.0LeuXaa: 0.0 ± 0.0
Met
3.229MetAla: 3.229 ± 0.476
0.135MetCys: 0.135 ± 0.097
1.009MetAsp: 1.009 ± 0.228
1.48MetGlu: 1.48 ± 0.306
0.673MetPhe: 0.673 ± 0.261
1.076MetGly: 1.076 ± 0.276
0.606MetHis: 0.606 ± 0.216
1.009MetIle: 1.009 ± 0.252
0.74MetLys: 0.74 ± 0.248
1.346MetLeu: 1.346 ± 0.282
0.471MetMet: 0.471 ± 0.165
0.336MetAsn: 0.336 ± 0.136
0.942MetPro: 0.942 ± 0.226
0.74MetGln: 0.74 ± 0.2
1.615MetArg: 1.615 ± 0.399
1.749MetSer: 1.749 ± 0.421
1.682MetThr: 1.682 ± 0.409
1.278MetVal: 1.278 ± 0.255
0.336MetTrp: 0.336 ± 0.151
0.538MetTyr: 0.538 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
3.028AsnAla: 3.028 ± 0.492
0.471AsnCys: 0.471 ± 0.233
1.547AsnAsp: 1.547 ± 0.35
1.547AsnGlu: 1.547 ± 0.308
0.673AsnPhe: 0.673 ± 0.235
2.96AsnGly: 2.96 ± 0.304
0.606AsnHis: 0.606 ± 0.222
1.413AsnIle: 1.413 ± 0.319
0.875AsnLys: 0.875 ± 0.255
2.018AsnLeu: 2.018 ± 0.409
0.471AsnMet: 0.471 ± 0.185
0.673AsnAsn: 0.673 ± 0.22
1.346AsnPro: 1.346 ± 0.288
1.144AsnGln: 1.144 ± 0.252
1.48AsnArg: 1.48 ± 0.264
1.211AsnSer: 1.211 ± 0.261
2.288AsnThr: 2.288 ± 0.357
1.951AsnVal: 1.951 ± 0.307
0.673AsnTrp: 0.673 ± 0.219
0.673AsnTyr: 0.673 ± 0.166
0.0AsnXaa: 0.0 ± 0.0
Pro
6.19ProAla: 6.19 ± 0.707
0.673ProCys: 0.673 ± 0.248
3.566ProAsp: 3.566 ± 0.459
3.835ProGlu: 3.835 ± 0.671
1.144ProPhe: 1.144 ± 0.276
4.037ProGly: 4.037 ± 0.507
0.404ProHis: 0.404 ± 0.154
1.884ProIle: 1.884 ± 0.359
2.893ProLys: 2.893 ± 0.533
2.893ProLeu: 2.893 ± 0.504
1.076ProMet: 1.076 ± 0.295
1.144ProAsn: 1.144 ± 0.253
2.489ProPro: 2.489 ± 0.553
1.211ProGln: 1.211 ± 0.292
2.96ProArg: 2.96 ± 0.406
3.499ProSer: 3.499 ± 0.639
2.759ProThr: 2.759 ± 0.481
3.499ProVal: 3.499 ± 0.528
0.673ProTrp: 0.673 ± 0.228
1.144ProTyr: 1.144 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
4.777GlnAla: 4.777 ± 0.595
0.202GlnCys: 0.202 ± 0.122
1.547GlnAsp: 1.547 ± 0.315
2.086GlnGlu: 2.086 ± 0.454
0.673GlnPhe: 0.673 ± 0.187
2.018GlnGly: 2.018 ± 0.336
0.673GlnHis: 0.673 ± 0.217
1.346GlnIle: 1.346 ± 0.258
1.817GlnLys: 1.817 ± 0.317
2.691GlnLeu: 2.691 ± 0.865
1.278GlnMet: 1.278 ± 0.334
0.74GlnAsn: 0.74 ± 0.191
1.211GlnPro: 1.211 ± 0.262
1.211GlnGln: 1.211 ± 0.308
3.162GlnArg: 3.162 ± 0.508
2.288GlnSer: 2.288 ± 0.343
1.615GlnThr: 1.615 ± 0.338
3.028GlnVal: 3.028 ± 0.386
0.74GlnTrp: 0.74 ± 0.216
0.74GlnTyr: 0.74 ± 0.247
0.0GlnXaa: 0.0 ± 0.0
Arg
5.921ArgAla: 5.921 ± 0.933
0.538ArgCys: 0.538 ± 0.19
4.171ArgAsp: 4.171 ± 0.497
5.113ArgGlu: 5.113 ± 0.763
2.489ArgPhe: 2.489 ± 0.363
5.113ArgGly: 5.113 ± 0.715
1.48ArgHis: 1.48 ± 0.324
3.095ArgIle: 3.095 ± 0.418
3.229ArgLys: 3.229 ± 0.46
5.786ArgLeu: 5.786 ± 0.623
1.413ArgMet: 1.413 ± 0.338
1.48ArgAsn: 1.48 ± 0.27
3.499ArgPro: 3.499 ± 0.541
2.288ArgGln: 2.288 ± 0.543
5.315ArgArg: 5.315 ± 0.781
3.297ArgSer: 3.297 ± 0.476
3.499ArgThr: 3.499 ± 0.535
4.642ArgVal: 4.642 ± 0.638
1.144ArgTrp: 1.144 ± 0.258
2.422ArgTyr: 2.422 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
5.584SerAla: 5.584 ± 0.757
0.067SerCys: 0.067 ± 0.078
3.162SerAsp: 3.162 ± 0.524
4.306SerGlu: 4.306 ± 0.398
1.615SerPhe: 1.615 ± 0.317
5.719SerGly: 5.719 ± 0.853
1.076SerHis: 1.076 ± 0.479
2.557SerIle: 2.557 ± 0.423
2.826SerLys: 2.826 ± 0.452
4.844SerLeu: 4.844 ± 0.619
1.346SerMet: 1.346 ± 0.348
1.951SerAsn: 1.951 ± 0.328
2.557SerPro: 2.557 ± 0.396
2.018SerGln: 2.018 ± 0.378
3.902SerArg: 3.902 ± 0.675
4.239SerSer: 4.239 ± 0.72
4.373SerThr: 4.373 ± 0.828
3.499SerVal: 3.499 ± 0.505
1.144SerTrp: 1.144 ± 0.252
1.884SerTyr: 1.884 ± 0.332
0.0SerXaa: 0.0 ± 0.0
Thr
5.719ThrAla: 5.719 ± 0.529
0.471ThrCys: 0.471 ± 0.17
4.441ThrAsp: 4.441 ± 0.437
4.104ThrGlu: 4.104 ± 0.587
2.557ThrPhe: 2.557 ± 0.378
4.912ThrGly: 4.912 ± 0.756
1.076ThrHis: 1.076 ± 0.317
2.691ThrIle: 2.691 ± 0.467
1.682ThrLys: 1.682 ± 0.357
4.777ThrLeu: 4.777 ± 0.69
1.278ThrMet: 1.278 ± 0.301
1.547ThrAsn: 1.547 ± 0.318
3.902ThrPro: 3.902 ± 0.873
1.413ThrGln: 1.413 ± 0.266
3.768ThrArg: 3.768 ± 0.562
4.912ThrSer: 4.912 ± 0.846
3.566ThrThr: 3.566 ± 0.524
6.19ThrVal: 6.19 ± 0.729
1.211ThrTrp: 1.211 ± 0.295
2.691ThrTyr: 2.691 ± 0.42
0.0ThrXaa: 0.0 ± 0.0
Val
8.612ValAla: 8.612 ± 0.687
0.471ValCys: 0.471 ± 0.172
4.575ValAsp: 4.575 ± 0.514
4.71ValGlu: 4.71 ± 0.725
1.951ValPhe: 1.951 ± 0.363
4.306ValGly: 4.306 ± 0.517
1.48ValHis: 1.48 ± 0.427
3.7ValIle: 3.7 ± 0.481
4.104ValLys: 4.104 ± 0.701
6.257ValLeu: 6.257 ± 0.657
1.547ValMet: 1.547 ± 0.262
2.489ValAsn: 2.489 ± 0.382
3.633ValPro: 3.633 ± 0.481
2.759ValGln: 2.759 ± 0.382
4.373ValArg: 4.373 ± 0.631
3.633ValSer: 3.633 ± 0.539
5.719ValThr: 5.719 ± 0.569
4.306ValVal: 4.306 ± 0.623
1.413ValTrp: 1.413 ± 0.291
2.355ValTyr: 2.355 ± 0.554
0.0ValXaa: 0.0 ± 0.0
Trp
2.153TrpAla: 2.153 ± 0.351
0.336TrpCys: 0.336 ± 0.145
1.615TrpAsp: 1.615 ± 0.361
1.278TrpGlu: 1.278 ± 0.262
0.807TrpPhe: 0.807 ± 0.224
1.278TrpGly: 1.278 ± 0.28
0.404TrpHis: 0.404 ± 0.166
0.269TrpIle: 0.269 ± 0.121
0.875TrpLys: 0.875 ± 0.23
1.278TrpLeu: 1.278 ± 0.334
0.606TrpMet: 0.606 ± 0.179
0.606TrpAsn: 0.606 ± 0.201
0.74TrpPro: 0.74 ± 0.233
0.336TrpGln: 0.336 ± 0.21
1.413TrpArg: 1.413 ± 0.26
1.211TrpSer: 1.211 ± 0.29
1.413TrpThr: 1.413 ± 0.342
1.413TrpVal: 1.413 ± 0.34
0.067TrpTrp: 0.067 ± 0.064
0.404TrpTyr: 0.404 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.162TyrAla: 3.162 ± 0.498
0.404TyrCys: 0.404 ± 0.235
2.22TyrAsp: 2.22 ± 0.47
2.22TyrGlu: 2.22 ± 0.513
0.74TyrPhe: 0.74 ± 0.213
3.028TyrGly: 3.028 ± 0.615
0.538TyrHis: 0.538 ± 0.226
1.278TyrIle: 1.278 ± 0.268
0.807TyrLys: 0.807 ± 0.244
2.22TyrLeu: 2.22 ± 0.365
0.538TyrMet: 0.538 ± 0.189
1.144TyrAsn: 1.144 ± 0.283
1.547TyrPro: 1.547 ± 0.461
0.471TyrGln: 0.471 ± 0.185
2.355TyrArg: 2.355 ± 0.392
1.951TyrSer: 1.951 ± 0.416
1.749TyrThr: 1.749 ± 0.382
2.355TyrVal: 2.355 ± 0.375
0.471TyrTrp: 0.471 ± 0.168
1.009TyrTyr: 1.009 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (14864 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski