Amino acid dipepetide frequency for Gordonia phage Opie

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.266AlaAla: 19.266 ± 1.297
0.718AlaCys: 0.718 ± 0.195
7.772AlaAsp: 7.772 ± 0.658
7.38AlaGlu: 7.38 ± 0.778
3.527AlaPhe: 3.527 ± 0.492
11.625AlaGly: 11.625 ± 0.901
2.025AlaHis: 2.025 ± 0.422
4.768AlaIle: 4.768 ± 0.59
3.657AlaLys: 3.657 ± 0.514
11.037AlaLeu: 11.037 ± 0.976
2.155AlaMet: 2.155 ± 0.382
2.743AlaAsn: 2.743 ± 0.357
6.727AlaPro: 6.727 ± 0.577
5.094AlaGln: 5.094 ± 0.757
8.294AlaArg: 8.294 ± 0.788
6.139AlaSer: 6.139 ± 0.783
8.947AlaThr: 8.947 ± 0.895
8.686AlaVal: 8.686 ± 1.081
1.633AlaTrp: 1.633 ± 0.3
2.286AlaTyr: 2.286 ± 0.331
0.0AlaXaa: 0.0 ± 0.0
Cys
0.588CysAla: 0.588 ± 0.221
0.131CysCys: 0.131 ± 0.075
0.718CysAsp: 0.718 ± 0.224
0.522CysGlu: 0.522 ± 0.199
0.065CysPhe: 0.065 ± 0.064
0.914CysGly: 0.914 ± 0.229
0.131CysHis: 0.131 ± 0.09
0.131CysIle: 0.131 ± 0.077
0.196CysLys: 0.196 ± 0.121
0.653CysLeu: 0.653 ± 0.228
0.065CysMet: 0.065 ± 0.069
0.392CysAsn: 0.392 ± 0.151
0.588CysPro: 0.588 ± 0.232
0.196CysGln: 0.196 ± 0.11
0.784CysArg: 0.784 ± 0.243
0.588CysSer: 0.588 ± 0.174
0.522CysThr: 0.522 ± 0.192
0.653CysVal: 0.653 ± 0.2
0.392CysTrp: 0.392 ± 0.186
0.131CysTyr: 0.131 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
7.576AspAla: 7.576 ± 0.692
0.588AspCys: 0.588 ± 0.247
5.029AspAsp: 5.029 ± 0.754
5.812AspGlu: 5.812 ± 0.684
2.155AspPhe: 2.155 ± 0.336
5.486AspGly: 5.486 ± 0.77
1.633AspHis: 1.633 ± 0.355
1.502AspIle: 1.502 ± 0.39
1.241AspLys: 1.241 ± 0.344
6.857AspLeu: 6.857 ± 0.686
0.849AspMet: 0.849 ± 0.214
1.241AspAsn: 1.241 ± 0.264
4.441AspPro: 4.441 ± 0.689
2.286AspGln: 2.286 ± 0.358
5.159AspArg: 5.159 ± 0.665
3.527AspSer: 3.527 ± 0.445
2.547AspThr: 2.547 ± 0.377
4.702AspVal: 4.702 ± 0.59
1.437AspTrp: 1.437 ± 0.25
1.502AspTyr: 1.502 ± 0.311
0.0AspXaa: 0.0 ± 0.0
Glu
6.596GluAla: 6.596 ± 0.712
0.457GluCys: 0.457 ± 0.173
3.853GluAsp: 3.853 ± 0.54
2.286GluGlu: 2.286 ± 0.412
1.894GluPhe: 1.894 ± 0.419
3.984GluGly: 3.984 ± 0.694
1.11GluHis: 1.11 ± 0.292
3.331GluIle: 3.331 ± 0.539
1.371GluLys: 1.371 ± 0.222
5.225GluLeu: 5.225 ± 0.558
1.176GluMet: 1.176 ± 0.26
1.306GluAsn: 1.306 ± 0.322
3.984GluPro: 3.984 ± 0.707
2.808GluGln: 2.808 ± 0.364
4.898GluArg: 4.898 ± 0.608
3.135GluSer: 3.135 ± 0.391
3.396GluThr: 3.396 ± 0.435
3.853GluVal: 3.853 ± 0.505
1.698GluTrp: 1.698 ± 0.41
2.155GluTyr: 2.155 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
3.527PheAla: 3.527 ± 0.494
0.196PheCys: 0.196 ± 0.115
2.09PheAsp: 2.09 ± 0.329
2.025PheGlu: 2.025 ± 0.382
0.522PhePhe: 0.522 ± 0.143
2.286PheGly: 2.286 ± 0.371
0.653PheHis: 0.653 ± 0.163
0.914PheIle: 0.914 ± 0.254
0.588PheLys: 0.588 ± 0.209
2.155PheLeu: 2.155 ± 0.498
0.522PheMet: 0.522 ± 0.164
0.392PheAsn: 0.392 ± 0.229
1.371PhePro: 1.371 ± 0.243
0.849PheGln: 0.849 ± 0.21
1.633PheArg: 1.633 ± 0.332
1.829PheSer: 1.829 ± 0.357
2.155PheThr: 2.155 ± 0.39
2.155PheVal: 2.155 ± 0.315
0.522PheTrp: 0.522 ± 0.182
0.522PheTyr: 0.522 ± 0.167
0.0PheXaa: 0.0 ± 0.0
Gly
7.837GlyAla: 7.837 ± 0.801
0.588GlyCys: 0.588 ± 0.222
5.486GlyAsp: 5.486 ± 0.608
4.702GlyGlu: 4.702 ± 0.335
2.743GlyPhe: 2.743 ± 0.482
7.641GlyGly: 7.641 ± 0.845
2.155GlyHis: 2.155 ± 0.355
3.788GlyIle: 3.788 ± 0.898
2.547GlyLys: 2.547 ± 0.41
8.033GlyLeu: 8.033 ± 0.79
2.025GlyMet: 2.025 ± 0.377
2.09GlyAsn: 2.09 ± 0.346
3.2GlyPro: 3.2 ± 0.466
3.461GlyGln: 3.461 ± 0.475
7.249GlyArg: 7.249 ± 0.821
4.637GlySer: 4.637 ± 0.51
6.008GlyThr: 6.008 ± 0.608
7.249GlyVal: 7.249 ± 0.513
1.633GlyTrp: 1.633 ± 0.304
1.894GlyTyr: 1.894 ± 0.274
0.0GlyXaa: 0.0 ± 0.0
His
2.547HisAla: 2.547 ± 0.431
0.392HisCys: 0.392 ± 0.183
1.11HisAsp: 1.11 ± 0.255
1.11HisGlu: 1.11 ± 0.308
0.588HisPhe: 0.588 ± 0.215
1.959HisGly: 1.959 ± 0.456
0.718HisHis: 0.718 ± 0.245
0.784HisIle: 0.784 ± 0.212
0.457HisLys: 0.457 ± 0.163
1.763HisLeu: 1.763 ± 0.471
0.457HisMet: 0.457 ± 0.164
0.588HisAsn: 0.588 ± 0.229
1.371HisPro: 1.371 ± 0.248
0.522HisGln: 0.522 ± 0.189
1.176HisArg: 1.176 ± 0.282
1.371HisSer: 1.371 ± 0.315
0.98HisThr: 0.98 ± 0.307
1.045HisVal: 1.045 ± 0.304
0.392HisTrp: 0.392 ± 0.16
0.653HisTyr: 0.653 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
7.053IleAla: 7.053 ± 0.696
0.392IleCys: 0.392 ± 0.169
3.396IleAsp: 3.396 ± 0.592
3.135IleGlu: 3.135 ± 0.381
1.045IlePhe: 1.045 ± 0.243
3.331IleGly: 3.331 ± 0.449
0.588IleHis: 0.588 ± 0.18
1.763IleIle: 1.763 ± 0.377
1.371IleLys: 1.371 ± 0.403
2.155IleLeu: 2.155 ± 0.377
0.327IleMet: 0.327 ± 0.154
1.045IleAsn: 1.045 ± 0.253
2.09IlePro: 2.09 ± 0.311
1.698IleGln: 1.698 ± 0.35
3.461IleArg: 3.461 ± 0.478
2.416IleSer: 2.416 ± 0.396
3.265IleThr: 3.265 ± 0.505
3.788IleVal: 3.788 ± 0.484
0.784IleTrp: 0.784 ± 0.236
0.522IleTyr: 0.522 ± 0.183
0.0IleXaa: 0.0 ± 0.0
Lys
3.461LysAla: 3.461 ± 0.461
0.457LysCys: 0.457 ± 0.153
1.894LysAsp: 1.894 ± 0.432
0.849LysGlu: 0.849 ± 0.206
0.98LysPhe: 0.98 ± 0.303
2.286LysGly: 2.286 ± 0.382
0.522LysHis: 0.522 ± 0.217
1.829LysIle: 1.829 ± 0.394
1.502LysLys: 1.502 ± 0.378
2.874LysLeu: 2.874 ± 0.448
0.522LysMet: 0.522 ± 0.183
0.327LysAsn: 0.327 ± 0.111
1.176LysPro: 1.176 ± 0.309
0.588LysGln: 0.588 ± 0.253
2.743LysArg: 2.743 ± 0.391
1.567LysSer: 1.567 ± 0.272
2.482LysThr: 2.482 ± 0.486
2.286LysVal: 2.286 ± 0.423
0.784LysTrp: 0.784 ± 0.218
0.522LysTyr: 0.522 ± 0.18
0.0LysXaa: 0.0 ± 0.0
Leu
11.102LeuAla: 11.102 ± 0.733
0.588LeuCys: 0.588 ± 0.182
4.702LeuAsp: 4.702 ± 0.627
4.833LeuGlu: 4.833 ± 0.852
2.22LeuPhe: 2.22 ± 0.358
6.857LeuGly: 6.857 ± 0.631
1.633LeuHis: 1.633 ± 0.406
3.2LeuIle: 3.2 ± 0.57
2.416LeuLys: 2.416 ± 0.42
6.727LeuLeu: 6.727 ± 0.828
1.306LeuMet: 1.306 ± 0.266
2.09LeuAsn: 2.09 ± 0.372
5.421LeuPro: 5.421 ± 0.665
2.547LeuGln: 2.547 ± 0.394
6.531LeuArg: 6.531 ± 0.76
5.225LeuSer: 5.225 ± 0.534
5.551LeuThr: 5.551 ± 0.622
6.727LeuVal: 6.727 ± 0.824
1.176LeuTrp: 1.176 ± 0.264
1.567LeuTyr: 1.567 ± 0.32
0.0LeuXaa: 0.0 ± 0.0
Met
2.612MetAla: 2.612 ± 0.534
0.131MetCys: 0.131 ± 0.096
0.914MetAsp: 0.914 ± 0.257
0.392MetGlu: 0.392 ± 0.167
0.457MetPhe: 0.457 ± 0.167
1.241MetGly: 1.241 ± 0.348
0.457MetHis: 0.457 ± 0.167
1.437MetIle: 1.437 ± 0.342
0.653MetLys: 0.653 ± 0.187
1.633MetLeu: 1.633 ± 0.329
0.392MetMet: 0.392 ± 0.152
1.11MetAsn: 1.11 ± 0.365
1.176MetPro: 1.176 ± 0.247
0.588MetGln: 0.588 ± 0.162
1.371MetArg: 1.371 ± 0.33
1.698MetSer: 1.698 ± 0.381
2.22MetThr: 2.22 ± 0.434
0.914MetVal: 0.914 ± 0.294
0.522MetTrp: 0.522 ± 0.182
0.196MetTyr: 0.196 ± 0.122
0.0MetXaa: 0.0 ± 0.0
Asn
3.2AsnAla: 3.2 ± 0.481
0.065AsnCys: 0.065 ± 0.063
2.025AsnAsp: 2.025 ± 0.333
1.11AsnGlu: 1.11 ± 0.236
0.718AsnPhe: 0.718 ± 0.219
3.069AsnGly: 3.069 ± 0.498
0.718AsnHis: 0.718 ± 0.207
0.588AsnIle: 0.588 ± 0.211
0.588AsnLys: 0.588 ± 0.22
1.959AsnLeu: 1.959 ± 0.332
0.327AsnMet: 0.327 ± 0.119
0.718AsnAsn: 0.718 ± 0.31
1.959AsnPro: 1.959 ± 0.352
1.176AsnGln: 1.176 ± 0.329
2.22AsnArg: 2.22 ± 0.374
0.98AsnSer: 0.98 ± 0.241
1.176AsnThr: 1.176 ± 0.269
2.09AsnVal: 2.09 ± 0.503
0.718AsnTrp: 0.718 ± 0.213
0.718AsnTyr: 0.718 ± 0.259
0.0AsnXaa: 0.0 ± 0.0
Pro
8.098ProAla: 8.098 ± 0.822
0.522ProCys: 0.522 ± 0.193
3.788ProAsp: 3.788 ± 0.531
4.18ProGlu: 4.18 ± 0.528
1.698ProPhe: 1.698 ± 0.334
4.963ProGly: 4.963 ± 0.787
0.98ProHis: 0.98 ± 0.322
2.416ProIle: 2.416 ± 0.35
1.698ProLys: 1.698 ± 0.281
3.592ProLeu: 3.592 ± 0.446
1.371ProMet: 1.371 ± 0.303
1.829ProAsn: 1.829 ± 0.43
4.114ProPro: 4.114 ± 0.818
2.155ProGln: 2.155 ± 0.347
3.788ProArg: 3.788 ± 0.711
3.657ProSer: 3.657 ± 0.596
3.918ProThr: 3.918 ± 0.633
4.572ProVal: 4.572 ± 0.604
1.829ProTrp: 1.829 ± 0.463
0.392ProTyr: 0.392 ± 0.161
0.0ProXaa: 0.0 ± 0.0
Gln
3.461GlnAla: 3.461 ± 0.742
0.588GlnCys: 0.588 ± 0.19
1.698GlnAsp: 1.698 ± 0.303
1.567GlnGlu: 1.567 ± 0.286
0.98GlnPhe: 0.98 ± 0.22
2.743GlnGly: 2.743 ± 0.471
0.914GlnHis: 0.914 ± 0.275
2.09GlnIle: 2.09 ± 0.398
1.176GlnLys: 1.176 ± 0.328
3.461GlnLeu: 3.461 ± 0.408
1.371GlnMet: 1.371 ± 0.355
0.849GlnAsn: 0.849 ± 0.24
1.633GlnPro: 1.633 ± 0.349
2.22GlnGln: 2.22 ± 0.419
2.612GlnArg: 2.612 ± 0.515
1.894GlnSer: 1.894 ± 0.349
1.763GlnThr: 1.763 ± 0.31
2.874GlnVal: 2.874 ± 0.391
0.784GlnTrp: 0.784 ± 0.201
0.522GlnTyr: 0.522 ± 0.186
0.0GlnXaa: 0.0 ± 0.0
Arg
7.968ArgAla: 7.968 ± 0.902
0.653ArgCys: 0.653 ± 0.192
5.355ArgAsp: 5.355 ± 0.492
4.441ArgGlu: 4.441 ± 0.574
1.437ArgPhe: 1.437 ± 0.26
5.682ArgGly: 5.682 ± 0.687
1.437ArgHis: 1.437 ± 0.31
3.723ArgIle: 3.723 ± 0.53
3.396ArgLys: 3.396 ± 0.415
5.551ArgLeu: 5.551 ± 0.638
2.874ArgMet: 2.874 ± 0.436
2.547ArgAsn: 2.547 ± 0.488
4.441ArgPro: 4.441 ± 0.804
3.004ArgGln: 3.004 ± 0.43
7.772ArgArg: 7.772 ± 0.955
4.049ArgSer: 4.049 ± 0.526
4.506ArgThr: 4.506 ± 0.569
4.768ArgVal: 4.768 ± 0.505
0.784ArgTrp: 0.784 ± 0.222
1.829ArgTyr: 1.829 ± 0.395
0.0ArgXaa: 0.0 ± 0.0
Ser
6.531SerAla: 6.531 ± 0.884
0.327SerCys: 0.327 ± 0.139
3.657SerAsp: 3.657 ± 0.458
3.135SerGlu: 3.135 ± 0.51
1.045SerPhe: 1.045 ± 0.28
6.661SerGly: 6.661 ± 0.834
1.241SerHis: 1.241 ± 0.288
3.069SerIle: 3.069 ± 0.352
1.502SerLys: 1.502 ± 0.276
4.245SerLeu: 4.245 ± 0.468
1.241SerMet: 1.241 ± 0.319
1.829SerAsn: 1.829 ± 0.286
3.984SerPro: 3.984 ± 0.652
1.045SerGln: 1.045 ± 0.276
3.265SerArg: 3.265 ± 0.543
3.723SerSer: 3.723 ± 0.533
3.657SerThr: 3.657 ± 0.559
4.31SerVal: 4.31 ± 0.622
1.176SerTrp: 1.176 ± 0.265
1.045SerTyr: 1.045 ± 0.274
0.0SerXaa: 0.0 ± 0.0
Thr
7.706ThrAla: 7.706 ± 0.787
0.522ThrCys: 0.522 ± 0.179
4.245ThrAsp: 4.245 ± 0.602
3.853ThrGlu: 3.853 ± 0.574
1.698ThrPhe: 1.698 ± 0.388
5.682ThrGly: 5.682 ± 0.611
0.718ThrHis: 0.718 ± 0.224
2.939ThrIle: 2.939 ± 0.546
1.437ThrLys: 1.437 ± 0.358
5.486ThrLeu: 5.486 ± 0.662
0.588ThrMet: 0.588 ± 0.229
2.025ThrAsn: 2.025 ± 0.412
5.029ThrPro: 5.029 ± 0.599
1.894ThrGln: 1.894 ± 0.362
4.114ThrArg: 4.114 ± 0.478
4.245ThrSer: 4.245 ± 0.515
4.702ThrThr: 4.702 ± 0.597
5.812ThrVal: 5.812 ± 0.688
1.437ThrTrp: 1.437 ± 0.283
1.567ThrTyr: 1.567 ± 0.425
0.0ThrXaa: 0.0 ± 0.0
Val
10.711ValAla: 10.711 ± 1.138
0.327ValCys: 0.327 ± 0.158
5.486ValAsp: 5.486 ± 0.743
4.768ValGlu: 4.768 ± 0.694
1.698ValPhe: 1.698 ± 0.328
5.682ValGly: 5.682 ± 0.785
1.176ValHis: 1.176 ± 0.28
3.592ValIle: 3.592 ± 0.452
2.547ValLys: 2.547 ± 0.422
5.029ValLeu: 5.029 ± 0.673
1.502ValMet: 1.502 ± 0.303
1.763ValAsn: 1.763 ± 0.288
5.029ValPro: 5.029 ± 0.602
1.959ValGln: 1.959 ± 0.439
6.008ValArg: 6.008 ± 0.651
4.18ValSer: 4.18 ± 0.527
5.225ValThr: 5.225 ± 0.695
5.878ValVal: 5.878 ± 0.819
1.502ValTrp: 1.502 ± 0.294
1.437ValTyr: 1.437 ± 0.269
0.0ValXaa: 0.0 ± 0.0
Trp
2.22TrpAla: 2.22 ± 0.408
0.392TrpCys: 0.392 ± 0.176
1.176TrpAsp: 1.176 ± 0.289
0.98TrpGlu: 0.98 ± 0.283
0.718TrpPhe: 0.718 ± 0.169
1.241TrpGly: 1.241 ± 0.258
0.653TrpHis: 0.653 ± 0.253
1.176TrpIle: 1.176 ± 0.307
0.522TrpLys: 0.522 ± 0.17
2.286TrpLeu: 2.286 ± 0.447
0.718TrpMet: 0.718 ± 0.228
0.784TrpAsn: 0.784 ± 0.341
0.849TrpPro: 0.849 ± 0.24
0.784TrpGln: 0.784 ± 0.211
1.241TrpArg: 1.241 ± 0.297
1.045TrpSer: 1.045 ± 0.267
1.306TrpThr: 1.306 ± 0.316
1.437TrpVal: 1.437 ± 0.285
0.522TrpTrp: 0.522 ± 0.206
0.131TrpTyr: 0.131 ± 0.092
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.416TyrAla: 2.416 ± 0.404
0.327TyrCys: 0.327 ± 0.136
1.176TyrAsp: 1.176 ± 0.276
1.437TyrGlu: 1.437 ± 0.372
0.718TyrPhe: 0.718 ± 0.214
1.633TyrGly: 1.633 ± 0.295
0.522TyrHis: 0.522 ± 0.187
0.588TyrIle: 0.588 ± 0.239
0.784TyrLys: 0.784 ± 0.227
1.698TyrLeu: 1.698 ± 0.418
0.261TyrMet: 0.261 ± 0.101
0.457TyrAsn: 0.457 ± 0.152
0.98TyrPro: 0.98 ± 0.275
0.392TyrGln: 0.392 ± 0.148
1.894TyrArg: 1.894 ± 0.393
0.784TyrSer: 0.784 ± 0.195
1.371TyrThr: 1.371 ± 0.343
1.698TyrVal: 1.698 ± 0.373
0.457TyrTrp: 0.457 ± 0.169
0.653TyrTyr: 0.653 ± 0.148
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (15313 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski