Amino acid dipepetide frequency for Gordonia phage EMoore

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.015AlaAla: 20.015 ± 1.86
1.065AlaCys: 1.065 ± 0.256
8.57AlaAsp: 8.57 ± 1.163
7.931AlaGlu: 7.931 ± 0.868
2.981AlaPhe: 2.981 ± 0.518
10.007AlaGly: 10.007 ± 0.826
2.449AlaHis: 2.449 ± 0.421
5.483AlaIle: 5.483 ± 0.646
3.407AlaLys: 3.407 ± 0.52
10.699AlaLeu: 10.699 ± 0.928
2.502AlaMet: 2.502 ± 0.397
2.502AlaAsn: 2.502 ± 0.444
6.601AlaPro: 6.601 ± 0.577
5.376AlaGln: 5.376 ± 0.669
9.049AlaArg: 9.049 ± 0.762
4.897AlaSer: 4.897 ± 0.589
8.57AlaThr: 8.57 ± 0.73
7.985AlaVal: 7.985 ± 0.726
2.395AlaTrp: 2.395 ± 0.382
2.821AlaTyr: 2.821 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
0.958CysAla: 0.958 ± 0.215
0.16CysCys: 0.16 ± 0.114
1.065CysAsp: 1.065 ± 0.296
0.532CysGlu: 0.532 ± 0.22
0.213CysPhe: 0.213 ± 0.104
1.278CysGly: 1.278 ± 0.327
0.16CysHis: 0.16 ± 0.1
0.373CysIle: 0.373 ± 0.158
0.053CysLys: 0.053 ± 0.051
0.319CysLeu: 0.319 ± 0.16
0.16CysMet: 0.16 ± 0.077
0.426CysAsn: 0.426 ± 0.132
0.905CysPro: 0.905 ± 0.231
0.213CysGln: 0.213 ± 0.105
0.958CysArg: 0.958 ± 0.236
0.319CysSer: 0.319 ± 0.154
0.532CysThr: 0.532 ± 0.21
0.373CysVal: 0.373 ± 0.131
0.106CysTrp: 0.106 ± 0.083
0.213CysTyr: 0.213 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
8.89AspAla: 8.89 ± 0.944
0.798AspCys: 0.798 ± 0.195
7.346AspAsp: 7.346 ± 1.042
5.536AspGlu: 5.536 ± 0.86
1.065AspPhe: 1.065 ± 0.22
6.441AspGly: 6.441 ± 0.61
1.703AspHis: 1.703 ± 0.359
1.597AspIle: 1.597 ± 0.249
1.544AspLys: 1.544 ± 0.283
4.95AspLeu: 4.95 ± 0.614
1.597AspMet: 1.597 ± 0.309
1.757AspAsn: 1.757 ± 0.343
5.749AspPro: 5.749 ± 0.652
2.981AspGln: 2.981 ± 0.445
4.738AspArg: 4.738 ± 0.6
2.555AspSer: 2.555 ± 0.503
4.205AspThr: 4.205 ± 0.437
4.844AspVal: 4.844 ± 0.603
1.81AspTrp: 1.81 ± 0.298
1.171AspTyr: 1.171 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
6.441GluAla: 6.441 ± 0.649
0.532GluCys: 0.532 ± 0.187
2.608GluAsp: 2.608 ± 0.386
1.544GluGlu: 1.544 ± 0.374
2.023GluPhe: 2.023 ± 0.279
3.407GluGly: 3.407 ± 0.437
1.597GluHis: 1.597 ± 0.363
2.182GluIle: 2.182 ± 0.408
0.586GluLys: 0.586 ± 0.237
5.217GluLeu: 5.217 ± 0.753
1.171GluMet: 1.171 ± 0.224
1.49GluAsn: 1.49 ± 0.293
3.354GluPro: 3.354 ± 0.445
3.939GluGln: 3.939 ± 0.565
4.258GluArg: 4.258 ± 0.698
2.874GluSer: 2.874 ± 0.368
3.194GluThr: 3.194 ± 0.383
4.578GluVal: 4.578 ± 0.54
1.65GluTrp: 1.65 ± 0.355
1.757GluTyr: 1.757 ± 0.381
0.0GluXaa: 0.0 ± 0.0
Phe
2.821PheAla: 2.821 ± 0.537
0.319PheCys: 0.319 ± 0.171
2.342PheAsp: 2.342 ± 0.42
1.544PheGlu: 1.544 ± 0.293
0.639PhePhe: 0.639 ± 0.251
2.608PheGly: 2.608 ± 0.39
0.532PheHis: 0.532 ± 0.181
0.479PheIle: 0.479 ± 0.179
0.745PheLys: 0.745 ± 0.22
1.49PheLeu: 1.49 ± 0.242
0.266PheMet: 0.266 ± 0.11
0.798PheAsn: 0.798 ± 0.29
1.118PhePro: 1.118 ± 0.271
0.692PheGln: 0.692 ± 0.202
1.224PheArg: 1.224 ± 0.26
0.905PheSer: 0.905 ± 0.2
1.703PheThr: 1.703 ± 0.293
1.97PheVal: 1.97 ± 0.339
0.532PheTrp: 0.532 ± 0.188
0.426PheTyr: 0.426 ± 0.16
0.0PheXaa: 0.0 ± 0.0
Gly
10.007GlyAla: 10.007 ± 1.289
0.479GlyCys: 0.479 ± 0.188
5.802GlyAsp: 5.802 ± 0.634
4.631GlyGlu: 4.631 ± 0.381
1.703GlyPhe: 1.703 ± 0.284
10.22GlyGly: 10.22 ± 2.215
1.49GlyHis: 1.49 ± 0.287
4.684GlyIle: 4.684 ± 0.601
2.449GlyLys: 2.449 ± 0.361
5.483GlyLeu: 5.483 ± 0.66
2.076GlyMet: 2.076 ± 0.363
3.354GlyAsn: 3.354 ± 0.562
3.833GlyPro: 3.833 ± 0.611
3.247GlyGln: 3.247 ± 0.493
5.057GlyArg: 5.057 ± 0.471
4.844GlySer: 4.844 ± 0.622
6.068GlyThr: 6.068 ± 0.89
7.133GlyVal: 7.133 ± 0.62
1.544GlyTrp: 1.544 ± 0.262
2.449GlyTyr: 2.449 ± 0.469
0.0GlyXaa: 0.0 ± 0.0
His
1.916HisAla: 1.916 ± 0.354
0.16HisCys: 0.16 ± 0.098
1.703HisAsp: 1.703 ± 0.445
1.331HisGlu: 1.331 ± 0.286
0.426HisPhe: 0.426 ± 0.151
1.544HisGly: 1.544 ± 0.302
0.639HisHis: 0.639 ± 0.219
0.798HisIle: 0.798 ± 0.203
0.266HisLys: 0.266 ± 0.1
2.289HisLeu: 2.289 ± 0.369
0.532HisMet: 0.532 ± 0.176
0.426HisAsn: 0.426 ± 0.152
1.49HisPro: 1.49 ± 0.295
0.905HisGln: 0.905 ± 0.189
2.449HisArg: 2.449 ± 0.431
0.586HisSer: 0.586 ± 0.189
1.81HisThr: 1.81 ± 0.343
1.65HisVal: 1.65 ± 0.353
0.213HisTrp: 0.213 ± 0.109
0.479HisTyr: 0.479 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
5.483IleAla: 5.483 ± 0.619
0.266IleCys: 0.266 ± 0.121
3.513IleAsp: 3.513 ± 0.411
3.354IleGlu: 3.354 ± 0.386
0.639IlePhe: 0.639 ± 0.24
4.312IleGly: 4.312 ± 1.024
0.745IleHis: 0.745 ± 0.212
1.544IleIle: 1.544 ± 0.37
1.384IleLys: 1.384 ± 0.365
2.076IleLeu: 2.076 ± 0.328
0.532IleMet: 0.532 ± 0.148
1.597IleAsn: 1.597 ± 0.269
2.928IlePro: 2.928 ± 0.313
1.065IleGln: 1.065 ± 0.307
1.863IleArg: 1.863 ± 0.26
1.703IleSer: 1.703 ± 0.348
3.566IleThr: 3.566 ± 0.345
3.513IleVal: 3.513 ± 0.468
0.639IleTrp: 0.639 ± 0.166
0.426IleTyr: 0.426 ± 0.151
0.0IleXaa: 0.0 ± 0.0
Lys
3.513LysAla: 3.513 ± 0.485
0.266LysCys: 0.266 ± 0.111
0.958LysAsp: 0.958 ± 0.254
0.639LysGlu: 0.639 ± 0.162
1.065LysPhe: 1.065 ± 0.257
1.49LysGly: 1.49 ± 0.243
0.586LysHis: 0.586 ± 0.177
1.011LysIle: 1.011 ± 0.283
0.426LysLys: 0.426 ± 0.127
2.874LysLeu: 2.874 ± 0.427
0.586LysMet: 0.586 ± 0.147
0.745LysAsn: 0.745 ± 0.231
1.597LysPro: 1.597 ± 0.28
1.224LysGln: 1.224 ± 0.281
2.289LysArg: 2.289 ± 0.311
1.49LysSer: 1.49 ± 0.333
1.81LysThr: 1.81 ± 0.354
2.236LysVal: 2.236 ± 0.414
0.213LysTrp: 0.213 ± 0.116
0.852LysTyr: 0.852 ± 0.207
0.0LysXaa: 0.0 ± 0.0
Leu
9.688LeuAla: 9.688 ± 0.625
0.532LeuCys: 0.532 ± 0.178
6.76LeuAsp: 6.76 ± 0.798
4.471LeuGlu: 4.471 ± 0.436
1.597LeuPhe: 1.597 ± 0.348
7.186LeuGly: 7.186 ± 1.067
1.65LeuHis: 1.65 ± 0.381
3.354LeuIle: 3.354 ± 0.404
1.81LeuLys: 1.81 ± 0.343
5.855LeuLeu: 5.855 ± 0.523
1.65LeuMet: 1.65 ± 0.297
2.395LeuAsn: 2.395 ± 0.345
5.323LeuPro: 5.323 ± 0.571
2.023LeuGln: 2.023 ± 0.572
5.217LeuArg: 5.217 ± 0.59
3.3LeuSer: 3.3 ± 0.344
6.068LeuThr: 6.068 ± 0.574
6.707LeuVal: 6.707 ± 0.614
1.81LeuTrp: 1.81 ± 0.413
1.384LeuTyr: 1.384 ± 0.284
0.0LeuXaa: 0.0 ± 0.0
Met
2.502MetAla: 2.502 ± 0.374
0.266MetCys: 0.266 ± 0.108
0.639MetAsp: 0.639 ± 0.169
0.479MetGlu: 0.479 ± 0.137
0.266MetPhe: 0.266 ± 0.099
1.597MetGly: 1.597 ± 0.276
0.426MetHis: 0.426 ± 0.157
0.905MetIle: 0.905 ± 0.259
0.586MetLys: 0.586 ± 0.18
1.703MetLeu: 1.703 ± 0.327
0.639MetMet: 0.639 ± 0.175
1.011MetAsn: 1.011 ± 0.197
2.502MetPro: 2.502 ± 0.341
0.639MetGln: 0.639 ± 0.174
1.544MetArg: 1.544 ± 0.275
1.49MetSer: 1.49 ± 0.314
3.194MetThr: 3.194 ± 0.373
1.171MetVal: 1.171 ± 0.274
0.319MetTrp: 0.319 ± 0.136
0.373MetTyr: 0.373 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
3.726AsnAla: 3.726 ± 0.659
0.213AsnCys: 0.213 ± 0.114
2.023AsnAsp: 2.023 ± 0.376
1.49AsnGlu: 1.49 ± 0.255
0.639AsnPhe: 0.639 ± 0.162
3.886AsnGly: 3.886 ± 0.495
0.532AsnHis: 0.532 ± 0.157
1.011AsnIle: 1.011 ± 0.365
0.958AsnLys: 0.958 ± 0.222
2.874AsnLeu: 2.874 ± 0.352
0.266AsnMet: 0.266 ± 0.112
0.639AsnAsn: 0.639 ± 0.23
2.395AsnPro: 2.395 ± 0.398
0.532AsnGln: 0.532 ± 0.183
1.757AsnArg: 1.757 ± 0.344
1.597AsnSer: 1.597 ± 0.244
1.171AsnThr: 1.171 ± 0.211
2.821AsnVal: 2.821 ± 0.548
0.745AsnTrp: 0.745 ± 0.203
0.586AsnTyr: 0.586 ± 0.193
0.0AsnXaa: 0.0 ± 0.0
Pro
8.091ProAla: 8.091 ± 0.705
0.479ProCys: 0.479 ± 0.166
5.483ProAsp: 5.483 ± 0.575
3.939ProGlu: 3.939 ± 0.504
1.384ProPhe: 1.384 ± 0.313
5.483ProGly: 5.483 ± 0.638
1.703ProHis: 1.703 ± 0.31
2.449ProIle: 2.449 ± 0.39
1.703ProLys: 1.703 ± 0.297
3.513ProLeu: 3.513 ± 0.437
1.49ProMet: 1.49 ± 0.256
2.289ProAsn: 2.289 ± 0.318
4.791ProPro: 4.791 ± 0.618
1.863ProGln: 1.863 ± 0.293
4.471ProArg: 4.471 ± 0.671
2.342ProSer: 2.342 ± 0.331
4.578ProThr: 4.578 ± 0.506
4.791ProVal: 4.791 ± 0.452
1.544ProTrp: 1.544 ± 0.375
1.544ProTyr: 1.544 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
4.631GlnAla: 4.631 ± 0.465
0.692GlnCys: 0.692 ± 0.177
1.278GlnAsp: 1.278 ± 0.271
1.171GlnGlu: 1.171 ± 0.242
1.331GlnPhe: 1.331 ± 0.274
2.449GlnGly: 2.449 ± 0.455
0.479GlnHis: 0.479 ± 0.134
1.49GlnIle: 1.49 ± 0.263
0.852GlnLys: 0.852 ± 0.25
3.992GlnLeu: 3.992 ± 0.486
1.597GlnMet: 1.597 ± 0.297
1.065GlnAsn: 1.065 ± 0.195
2.236GlnPro: 2.236 ± 0.384
2.874GlnGln: 2.874 ± 0.504
3.141GlnArg: 3.141 ± 0.402
1.544GlnSer: 1.544 ± 0.351
2.129GlnThr: 2.129 ± 0.3
3.247GlnVal: 3.247 ± 0.387
0.692GlnTrp: 0.692 ± 0.213
0.745GlnTyr: 0.745 ± 0.218
0.0GlnXaa: 0.0 ± 0.0
Arg
8.623ArgAla: 8.623 ± 0.848
1.118ArgCys: 1.118 ± 0.326
4.312ArgAsp: 4.312 ± 0.61
3.726ArgGlu: 3.726 ± 0.614
1.011ArgPhe: 1.011 ± 0.239
4.791ArgGly: 4.791 ± 0.583
1.597ArgHis: 1.597 ± 0.391
3.833ArgIle: 3.833 ± 0.467
2.874ArgLys: 2.874 ± 0.462
5.696ArgLeu: 5.696 ± 0.558
2.076ArgMet: 2.076 ± 0.346
2.076ArgAsn: 2.076 ± 0.278
3.407ArgPro: 3.407 ± 0.617
3.141ArgGln: 3.141 ± 0.419
7.399ArgArg: 7.399 ± 1.006
2.928ArgSer: 2.928 ± 0.39
4.046ArgThr: 4.046 ± 0.527
5.589ArgVal: 5.589 ± 0.637
1.544ArgTrp: 1.544 ± 0.342
2.023ArgTyr: 2.023 ± 0.353
0.0ArgXaa: 0.0 ± 0.0
Ser
5.855SerAla: 5.855 ± 0.874
0.16SerCys: 0.16 ± 0.097
2.555SerAsp: 2.555 ± 0.355
1.65SerGlu: 1.65 ± 0.33
0.745SerPhe: 0.745 ± 0.163
5.27SerGly: 5.27 ± 0.544
1.171SerHis: 1.171 ± 0.246
1.81SerIle: 1.81 ± 0.331
1.011SerLys: 1.011 ± 0.271
3.779SerLeu: 3.779 ± 0.516
1.224SerMet: 1.224 ± 0.249
1.49SerAsn: 1.49 ± 0.339
2.715SerPro: 2.715 ± 0.386
1.437SerGln: 1.437 ± 0.301
2.395SerArg: 2.395 ± 0.372
3.3SerSer: 3.3 ± 0.501
4.418SerThr: 4.418 ± 0.561
3.939SerVal: 3.939 ± 0.417
1.118SerTrp: 1.118 ± 0.205
0.905SerTyr: 0.905 ± 0.212
0.0SerXaa: 0.0 ± 0.0
Thr
8.89ThrAla: 8.89 ± 0.93
0.798ThrCys: 0.798 ± 0.175
5.483ThrAsp: 5.483 ± 0.706
4.099ThrGlu: 4.099 ± 0.568
2.076ThrPhe: 2.076 ± 0.293
6.228ThrGly: 6.228 ± 0.759
1.544ThrHis: 1.544 ± 0.382
3.886ThrIle: 3.886 ± 0.488
1.916ThrLys: 1.916 ± 0.334
5.004ThrLeu: 5.004 ± 0.581
1.703ThrMet: 1.703 ± 0.29
2.023ThrAsn: 2.023 ± 0.39
5.696ThrPro: 5.696 ± 0.56
1.118ThrGln: 1.118 ± 0.245
4.897ThrArg: 4.897 ± 0.518
3.779ThrSer: 3.779 ± 0.522
5.11ThrThr: 5.11 ± 0.516
4.631ThrVal: 4.631 ± 0.454
1.171ThrTrp: 1.171 ± 0.225
0.905ThrTyr: 0.905 ± 0.168
0.0ThrXaa: 0.0 ± 0.0
Val
8.517ValAla: 8.517 ± 0.733
0.426ValCys: 0.426 ± 0.161
5.483ValAsp: 5.483 ± 0.525
4.525ValGlu: 4.525 ± 0.445
2.023ValPhe: 2.023 ± 0.39
5.589ValGly: 5.589 ± 0.583
1.597ValHis: 1.597 ± 0.299
3.087ValIle: 3.087 ± 0.505
2.555ValLys: 2.555 ± 0.376
6.707ValLeu: 6.707 ± 0.713
1.171ValMet: 1.171 ± 0.287
2.608ValAsn: 2.608 ± 0.342
4.631ValPro: 4.631 ± 0.555
2.715ValGln: 2.715 ± 0.389
5.323ValArg: 5.323 ± 0.516
3.566ValSer: 3.566 ± 0.526
5.642ValThr: 5.642 ± 0.536
6.441ValVal: 6.441 ± 0.65
1.703ValTrp: 1.703 ± 0.344
2.076ValTyr: 2.076 ± 0.433
0.0ValXaa: 0.0 ± 0.0
Trp
2.129TrpAla: 2.129 ± 0.354
0.479TrpCys: 0.479 ± 0.173
1.011TrpAsp: 1.011 ± 0.236
0.798TrpGlu: 0.798 ± 0.236
0.692TrpPhe: 0.692 ± 0.202
0.798TrpGly: 0.798 ± 0.242
0.639TrpHis: 0.639 ± 0.158
0.639TrpIle: 0.639 ± 0.157
0.373TrpLys: 0.373 ± 0.144
2.395TrpLeu: 2.395 ± 0.409
0.532TrpMet: 0.532 ± 0.166
0.426TrpAsn: 0.426 ± 0.147
1.278TrpPro: 1.278 ± 0.251
0.905TrpGln: 0.905 ± 0.209
1.97TrpArg: 1.97 ± 0.352
1.863TrpSer: 1.863 ± 0.431
1.49TrpThr: 1.49 ± 0.29
1.437TrpVal: 1.437 ± 0.262
0.532TrpTrp: 0.532 ± 0.228
0.319TrpTyr: 0.319 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.608TyrAla: 2.608 ± 0.321
0.106TyrCys: 0.106 ± 0.062
2.342TyrAsp: 2.342 ± 0.493
1.011TyrGlu: 1.011 ± 0.236
0.639TyrPhe: 0.639 ± 0.147
1.97TyrGly: 1.97 ± 0.284
0.373TyrHis: 0.373 ± 0.167
0.532TyrIle: 0.532 ± 0.175
0.373TyrLys: 0.373 ± 0.127
1.863TyrLeu: 1.863 ± 0.392
0.426TyrMet: 0.426 ± 0.175
0.745TyrAsn: 0.745 ± 0.178
1.384TyrPro: 1.384 ± 0.293
0.745TyrGln: 0.745 ± 0.192
1.863TyrArg: 1.863 ± 0.356
1.065TyrSer: 1.065 ± 0.197
1.65TyrThr: 1.65 ± 0.347
1.331TyrVal: 1.331 ± 0.283
0.426TyrTrp: 0.426 ± 0.12
0.639TyrTyr: 0.639 ± 0.233
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (18787 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski