Amino acid dipepetide frequency for Gordonia phage Easley

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.375AlaAla: 17.375 ± 1.455
1.11AlaCys: 1.11 ± 0.313
7.577AlaAsp: 7.577 ± 0.719
9.406AlaGlu: 9.406 ± 0.792
2.548AlaPhe: 2.548 ± 0.411
8.492AlaGly: 8.492 ± 0.73
2.352AlaHis: 2.352 ± 0.483
5.356AlaIle: 5.356 ± 0.588
5.356AlaLys: 5.356 ± 0.648
9.472AlaLeu: 9.472 ± 0.876
3.005AlaMet: 3.005 ± 0.619
3.985AlaAsn: 3.985 ± 0.591
5.814AlaPro: 5.814 ± 0.69
5.356AlaGln: 5.356 ± 0.765
7.969AlaArg: 7.969 ± 0.845
5.552AlaSer: 5.552 ± 0.768
7.708AlaThr: 7.708 ± 1.098
7.577AlaVal: 7.577 ± 1.177
1.698AlaTrp: 1.698 ± 0.326
2.156AlaTyr: 2.156 ± 0.311
0.0AlaXaa: 0.0 ± 0.0
Cys
0.653CysAla: 0.653 ± 0.239
0.131CysCys: 0.131 ± 0.077
0.719CysAsp: 0.719 ± 0.23
0.523CysGlu: 0.523 ± 0.198
0.196CysPhe: 0.196 ± 0.114
1.437CysGly: 1.437 ± 0.425
0.392CysHis: 0.392 ± 0.179
0.131CysIle: 0.131 ± 0.094
0.065CysLys: 0.065 ± 0.068
0.392CysLeu: 0.392 ± 0.194
0.0CysMet: 0.0 ± 0.0
0.392CysAsn: 0.392 ± 0.185
0.653CysPro: 0.653 ± 0.256
0.065CysGln: 0.065 ± 0.069
1.045CysArg: 1.045 ± 0.359
0.719CysSer: 0.719 ± 0.283
0.784CysThr: 0.784 ± 0.302
0.327CysVal: 0.327 ± 0.128
0.392CysTrp: 0.392 ± 0.192
0.261CysTyr: 0.261 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
7.055AspAla: 7.055 ± 0.702
0.588AspCys: 0.588 ± 0.266
6.206AspAsp: 6.206 ± 0.873
4.507AspGlu: 4.507 ± 0.664
1.698AspPhe: 1.698 ± 0.33
5.683AspGly: 5.683 ± 0.759
2.025AspHis: 2.025 ± 0.412
2.286AspIle: 2.286 ± 0.377
1.96AspLys: 1.96 ± 0.298
6.989AspLeu: 6.989 ± 0.528
1.176AspMet: 1.176 ± 0.301
1.241AspAsn: 1.241 ± 0.413
5.422AspPro: 5.422 ± 0.622
1.764AspGln: 1.764 ± 0.356
4.115AspArg: 4.115 ± 0.666
2.221AspSer: 2.221 ± 0.457
3.527AspThr: 3.527 ± 0.498
4.964AspVal: 4.964 ± 0.542
1.502AspTrp: 1.502 ± 0.292
1.176AspTyr: 1.176 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
7.12GluAla: 7.12 ± 0.748
0.457GluCys: 0.457 ± 0.202
2.939GluAsp: 2.939 ± 0.497
3.005GluGlu: 3.005 ± 0.552
1.894GluPhe: 1.894 ± 0.386
3.658GluGly: 3.658 ± 0.456
1.11GluHis: 1.11 ± 0.291
3.201GluIle: 3.201 ± 0.463
2.156GluLys: 2.156 ± 0.426
6.597GluLeu: 6.597 ± 0.63
1.306GluMet: 1.306 ± 0.326
1.372GluAsn: 1.372 ± 0.347
3.201GluPro: 3.201 ± 0.594
3.462GluGln: 3.462 ± 0.54
4.899GluArg: 4.899 ± 0.778
2.743GluSer: 2.743 ± 0.405
3.201GluThr: 3.201 ± 0.38
4.377GluVal: 4.377 ± 0.584
1.11GluTrp: 1.11 ± 0.257
1.437GluTyr: 1.437 ± 0.342
0.0GluXaa: 0.0 ± 0.0
Phe
2.809PheAla: 2.809 ± 0.521
0.065PheCys: 0.065 ± 0.063
2.286PheAsp: 2.286 ± 0.319
1.045PheGlu: 1.045 ± 0.327
0.784PhePhe: 0.784 ± 0.276
2.613PheGly: 2.613 ± 0.368
0.523PheHis: 0.523 ± 0.189
1.045PheIle: 1.045 ± 0.278
0.457PheLys: 0.457 ± 0.185
1.633PheLeu: 1.633 ± 0.341
0.588PheMet: 0.588 ± 0.187
1.176PheAsn: 1.176 ± 0.34
1.502PhePro: 1.502 ± 0.33
0.784PheGln: 0.784 ± 0.263
1.437PheArg: 1.437 ± 0.266
1.176PheSer: 1.176 ± 0.227
2.221PheThr: 2.221 ± 0.336
1.568PheVal: 1.568 ± 0.428
0.457PheTrp: 0.457 ± 0.175
0.588PheTyr: 0.588 ± 0.205
0.0PheXaa: 0.0 ± 0.0
Gly
7.904GlyAla: 7.904 ± 0.954
0.719GlyCys: 0.719 ± 0.343
5.356GlyAsp: 5.356 ± 0.553
4.703GlyGlu: 4.703 ± 0.437
2.09GlyPhe: 2.09 ± 0.345
7.643GlyGly: 7.643 ± 0.897
1.829GlyHis: 1.829 ± 0.396
3.658GlyIle: 3.658 ± 0.507
3.07GlyLys: 3.07 ± 0.459
6.467GlyLeu: 6.467 ± 0.735
2.417GlyMet: 2.417 ± 0.462
3.005GlyAsn: 3.005 ± 0.473
4.377GlyPro: 4.377 ± 0.583
3.135GlyGln: 3.135 ± 0.438
6.989GlyArg: 6.989 ± 0.702
4.638GlySer: 4.638 ± 0.536
5.226GlyThr: 5.226 ± 0.916
7.381GlyVal: 7.381 ± 0.732
1.502GlyTrp: 1.502 ± 0.307
2.221GlyTyr: 2.221 ± 0.322
0.0GlyXaa: 0.0 ± 0.0
His
2.286HisAla: 2.286 ± 0.338
0.196HisCys: 0.196 ± 0.136
1.176HisAsp: 1.176 ± 0.344
1.306HisGlu: 1.306 ± 0.303
0.327HisPhe: 0.327 ± 0.15
1.894HisGly: 1.894 ± 0.376
0.457HisHis: 0.457 ± 0.23
1.11HisIle: 1.11 ± 0.39
0.327HisLys: 0.327 ± 0.119
1.372HisLeu: 1.372 ± 0.272
0.457HisMet: 0.457 ± 0.157
0.392HisAsn: 0.392 ± 0.158
2.286HisPro: 2.286 ± 0.515
0.98HisGln: 0.98 ± 0.221
1.372HisArg: 1.372 ± 0.331
0.719HisSer: 0.719 ± 0.217
2.09HisThr: 2.09 ± 0.358
1.372HisVal: 1.372 ± 0.273
0.653HisTrp: 0.653 ± 0.199
0.719HisTyr: 0.719 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
6.14IleAla: 6.14 ± 0.695
0.261IleCys: 0.261 ± 0.182
4.442IleAsp: 4.442 ± 0.551
3.593IleGlu: 3.593 ± 0.633
0.719IlePhe: 0.719 ± 0.199
3.527IleGly: 3.527 ± 0.554
0.98IleHis: 0.98 ± 0.198
1.372IleIle: 1.372 ± 0.26
0.653IleLys: 0.653 ± 0.202
2.417IleLeu: 2.417 ± 0.428
0.784IleMet: 0.784 ± 0.262
1.568IleAsn: 1.568 ± 0.238
3.07IlePro: 3.07 ± 0.605
1.372IleGln: 1.372 ± 0.332
3.854IleArg: 3.854 ± 0.459
2.482IleSer: 2.482 ± 0.459
2.939IleThr: 2.939 ± 0.547
3.135IleVal: 3.135 ± 0.487
0.392IleTrp: 0.392 ± 0.168
1.764IleTyr: 1.764 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
4.115LysAla: 4.115 ± 0.528
0.196LysCys: 0.196 ± 0.114
1.764LysAsp: 1.764 ± 0.346
1.698LysGlu: 1.698 ± 0.374
0.523LysPhe: 0.523 ± 0.188
2.417LysGly: 2.417 ± 0.544
0.719LysHis: 0.719 ± 0.203
1.176LysIle: 1.176 ± 0.253
1.633LysLys: 1.633 ± 0.486
3.201LysLeu: 3.201 ± 0.409
0.523LysMet: 0.523 ± 0.162
0.849LysAsn: 0.849 ± 0.236
2.352LysPro: 2.352 ± 0.398
1.306LysGln: 1.306 ± 0.339
2.352LysArg: 2.352 ± 0.46
2.156LysSer: 2.156 ± 0.523
2.352LysThr: 2.352 ± 0.41
2.221LysVal: 2.221 ± 0.36
0.849LysTrp: 0.849 ± 0.263
0.849LysTyr: 0.849 ± 0.202
0.0LysXaa: 0.0 ± 0.0
Leu
10.909LeuAla: 10.909 ± 1.057
0.914LeuCys: 0.914 ± 0.265
5.683LeuAsp: 5.683 ± 0.626
4.181LeuGlu: 4.181 ± 0.484
2.548LeuPhe: 2.548 ± 0.563
7.643LeuGly: 7.643 ± 0.629
1.372LeuHis: 1.372 ± 0.283
3.397LeuIle: 3.397 ± 0.513
2.025LeuLys: 2.025 ± 0.413
5.618LeuLeu: 5.618 ± 0.558
1.568LeuMet: 1.568 ± 0.311
1.96LeuAsn: 1.96 ± 0.372
4.05LeuPro: 4.05 ± 0.62
2.874LeuGln: 2.874 ± 0.457
7.447LeuArg: 7.447 ± 0.819
4.181LeuSer: 4.181 ± 0.483
5.944LeuThr: 5.944 ± 0.646
5.226LeuVal: 5.226 ± 0.635
1.176LeuTrp: 1.176 ± 0.237
1.502LeuTyr: 1.502 ± 0.316
0.0LeuXaa: 0.0 ± 0.0
Met
3.07MetAla: 3.07 ± 0.579
0.131MetCys: 0.131 ± 0.098
0.653MetAsp: 0.653 ± 0.198
0.457MetGlu: 0.457 ± 0.239
0.588MetPhe: 0.588 ± 0.167
1.568MetGly: 1.568 ± 0.376
0.523MetHis: 0.523 ± 0.172
0.457MetIle: 0.457 ± 0.164
0.719MetLys: 0.719 ± 0.203
1.96MetLeu: 1.96 ± 0.391
0.327MetMet: 0.327 ± 0.161
0.719MetAsn: 0.719 ± 0.206
1.764MetPro: 1.764 ± 0.346
0.588MetGln: 0.588 ± 0.175
1.633MetArg: 1.633 ± 0.35
1.894MetSer: 1.894 ± 0.334
2.548MetThr: 2.548 ± 0.416
1.176MetVal: 1.176 ± 0.261
0.588MetTrp: 0.588 ± 0.193
0.065MetTyr: 0.065 ± 0.06
0.0MetXaa: 0.0 ± 0.0
Asn
3.07AsnAla: 3.07 ± 0.649
0.196AsnCys: 0.196 ± 0.116
1.568AsnAsp: 1.568 ± 0.333
1.502AsnGlu: 1.502 ± 0.299
0.653AsnPhe: 0.653 ± 0.177
2.548AsnGly: 2.548 ± 0.505
0.98AsnHis: 0.98 ± 0.194
1.437AsnIle: 1.437 ± 0.299
0.98AsnLys: 0.98 ± 0.257
2.09AsnLeu: 2.09 ± 0.4
0.327AsnMet: 0.327 ± 0.128
1.176AsnAsn: 1.176 ± 0.309
2.874AsnPro: 2.874 ± 0.429
1.241AsnGln: 1.241 ± 0.265
2.809AsnArg: 2.809 ± 0.479
1.698AsnSer: 1.698 ± 0.296
1.96AsnThr: 1.96 ± 0.423
1.306AsnVal: 1.306 ± 0.293
0.588AsnTrp: 0.588 ± 0.184
0.784AsnTyr: 0.784 ± 0.227
0.0AsnXaa: 0.0 ± 0.0
Pro
7.316ProAla: 7.316 ± 0.66
0.784ProCys: 0.784 ± 0.231
4.703ProAsp: 4.703 ± 0.78
4.181ProGlu: 4.181 ± 0.53
1.502ProPhe: 1.502 ± 0.294
6.467ProGly: 6.467 ± 0.764
0.914ProHis: 0.914 ± 0.314
2.482ProIle: 2.482 ± 0.382
2.09ProLys: 2.09 ± 0.336
3.789ProLeu: 3.789 ± 0.452
1.306ProMet: 1.306 ± 0.249
1.96ProAsn: 1.96 ± 0.388
2.743ProPro: 2.743 ± 0.453
1.633ProGln: 1.633 ± 0.385
3.462ProArg: 3.462 ± 0.605
3.266ProSer: 3.266 ± 0.397
4.703ProThr: 4.703 ± 0.746
4.572ProVal: 4.572 ± 0.59
1.372ProTrp: 1.372 ± 0.296
0.719ProTyr: 0.719 ± 0.204
0.0ProXaa: 0.0 ± 0.0
Gln
4.572GlnAla: 4.572 ± 0.694
0.065GlnCys: 0.065 ± 0.064
1.698GlnAsp: 1.698 ± 0.335
1.698GlnGlu: 1.698 ± 0.331
1.11GlnPhe: 1.11 ± 0.248
2.156GlnGly: 2.156 ± 0.571
0.914GlnHis: 0.914 ± 0.255
1.829GlnIle: 1.829 ± 0.403
1.11GlnLys: 1.11 ± 0.322
4.507GlnLeu: 4.507 ± 0.58
1.241GlnMet: 1.241 ± 0.251
0.784GlnAsn: 0.784 ± 0.233
1.502GlnPro: 1.502 ± 0.276
1.437GlnGln: 1.437 ± 0.342
3.527GlnArg: 3.527 ± 0.628
1.698GlnSer: 1.698 ± 0.315
2.939GlnThr: 2.939 ± 0.559
3.201GlnVal: 3.201 ± 0.491
0.914GlnTrp: 0.914 ± 0.21
0.719GlnTyr: 0.719 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
9.21ArgAla: 9.21 ± 0.889
1.241ArgCys: 1.241 ± 0.391
4.507ArgAsp: 4.507 ± 0.718
4.572ArgGlu: 4.572 ± 0.613
1.764ArgPhe: 1.764 ± 0.323
5.03ArgGly: 5.03 ± 0.578
1.306ArgHis: 1.306 ± 0.331
3.985ArgIle: 3.985 ± 0.497
2.809ArgLys: 2.809 ± 0.348
6.271ArgLeu: 6.271 ± 0.683
1.568ArgMet: 1.568 ± 0.281
2.352ArgAsn: 2.352 ± 0.418
3.919ArgPro: 3.919 ± 0.651
2.743ArgGln: 2.743 ± 0.379
6.14ArgArg: 6.14 ± 1.05
3.593ArgSer: 3.593 ± 0.505
5.095ArgThr: 5.095 ± 0.671
4.899ArgVal: 4.899 ± 0.61
2.025ArgTrp: 2.025 ± 0.306
1.829ArgTyr: 1.829 ± 0.307
0.0ArgXaa: 0.0 ± 0.0
Ser
5.16SerAla: 5.16 ± 0.659
0.392SerCys: 0.392 ± 0.161
3.527SerAsp: 3.527 ± 0.413
2.613SerGlu: 2.613 ± 0.38
1.437SerPhe: 1.437 ± 0.324
5.291SerGly: 5.291 ± 0.788
1.045SerHis: 1.045 ± 0.231
3.201SerIle: 3.201 ± 0.402
1.829SerLys: 1.829 ± 0.34
3.658SerLeu: 3.658 ± 0.452
1.437SerMet: 1.437 ± 0.251
1.568SerAsn: 1.568 ± 0.347
2.286SerPro: 2.286 ± 0.368
2.809SerGln: 2.809 ± 0.525
2.482SerArg: 2.482 ± 0.414
2.939SerSer: 2.939 ± 0.467
3.985SerThr: 3.985 ± 0.47
3.854SerVal: 3.854 ± 0.492
0.914SerTrp: 0.914 ± 0.27
1.11SerTyr: 1.11 ± 0.283
0.0SerXaa: 0.0 ± 0.0
Thr
8.753ThrAla: 8.753 ± 1.085
0.653ThrCys: 0.653 ± 0.215
3.201ThrAsp: 3.201 ± 0.372
3.397ThrGlu: 3.397 ± 0.569
1.502ThrPhe: 1.502 ± 0.324
7.185ThrGly: 7.185 ± 0.781
1.764ThrHis: 1.764 ± 0.337
4.246ThrIle: 4.246 ± 0.527
1.96ThrLys: 1.96 ± 0.366
5.683ThrLeu: 5.683 ± 0.568
0.849ThrMet: 0.849 ± 0.219
1.633ThrAsn: 1.633 ± 0.269
5.487ThrPro: 5.487 ± 0.802
1.96ThrGln: 1.96 ± 0.434
4.572ThrArg: 4.572 ± 0.592
3.593ThrSer: 3.593 ± 0.479
5.16ThrThr: 5.16 ± 0.874
5.618ThrVal: 5.618 ± 0.648
1.502ThrTrp: 1.502 ± 0.406
1.764ThrTyr: 1.764 ± 0.347
0.0ThrXaa: 0.0 ± 0.0
Val
8.034ValAla: 8.034 ± 0.754
0.719ValCys: 0.719 ± 0.246
5.487ValAsp: 5.487 ± 0.601
4.572ValGlu: 4.572 ± 0.455
1.829ValPhe: 1.829 ± 0.305
5.226ValGly: 5.226 ± 0.664
0.914ValHis: 0.914 ± 0.247
3.919ValIle: 3.919 ± 0.494
2.809ValLys: 2.809 ± 0.5
4.964ValLeu: 4.964 ± 0.539
1.633ValMet: 1.633 ± 0.313
2.221ValAsn: 2.221 ± 0.319
4.246ValPro: 4.246 ± 0.461
2.678ValGln: 2.678 ± 0.565
5.03ValArg: 5.03 ± 0.485
3.919ValSer: 3.919 ± 0.54
5.03ValThr: 5.03 ± 0.573
4.442ValVal: 4.442 ± 0.512
0.588ValTrp: 0.588 ± 0.212
1.241ValTyr: 1.241 ± 0.299
0.0ValXaa: 0.0 ± 0.0
Trp
1.437TrpAla: 1.437 ± 0.266
0.261TrpCys: 0.261 ± 0.144
1.306TrpAsp: 1.306 ± 0.311
0.784TrpGlu: 0.784 ± 0.239
0.719TrpPhe: 0.719 ± 0.232
1.764TrpGly: 1.764 ± 0.323
0.784TrpHis: 0.784 ± 0.26
0.719TrpIle: 0.719 ± 0.197
0.523TrpLys: 0.523 ± 0.194
1.372TrpLeu: 1.372 ± 0.277
0.457TrpMet: 0.457 ± 0.154
0.523TrpAsn: 0.523 ± 0.15
1.306TrpPro: 1.306 ± 0.268
0.523TrpGln: 0.523 ± 0.172
2.156TrpArg: 2.156 ± 0.366
0.784TrpSer: 0.784 ± 0.233
1.502TrpThr: 1.502 ± 0.299
1.11TrpVal: 1.11 ± 0.246
0.653TrpTrp: 0.653 ± 0.18
0.392TrpTyr: 0.392 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.939TyrAla: 2.939 ± 0.426
0.196TyrCys: 0.196 ± 0.111
1.306TyrAsp: 1.306 ± 0.238
1.502TyrGlu: 1.502 ± 0.324
0.327TyrPhe: 0.327 ± 0.134
2.09TyrGly: 2.09 ± 0.341
0.588TyrHis: 0.588 ± 0.232
0.457TyrIle: 0.457 ± 0.205
0.719TyrLys: 0.719 ± 0.27
1.764TyrLeu: 1.764 ± 0.381
0.392TyrMet: 0.392 ± 0.159
0.914TyrAsn: 0.914 ± 0.286
1.11TyrPro: 1.11 ± 0.331
0.914TyrGln: 0.914 ± 0.206
1.568TyrArg: 1.568 ± 0.372
1.568TyrSer: 1.568 ± 0.301
1.568TyrThr: 1.568 ± 0.29
1.176TyrVal: 1.176 ± 0.246
0.196TyrTrp: 0.196 ± 0.105
0.457TyrTyr: 0.457 ± 0.139
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (15310 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski