Amino acid dipepetide frequency for Gordonia phage Doggs

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.824AlaAla: 20.824 ± 2.24
0.809AlaCys: 0.809 ± 0.266
7.211AlaAsp: 7.211 ± 0.943
8.424AlaGlu: 8.424 ± 1.076
2.965AlaPhe: 2.965 ± 0.522
10.58AlaGly: 10.58 ± 1.115
2.493AlaHis: 2.493 ± 0.428
5.122AlaIle: 5.122 ± 0.586
3.976AlaLys: 3.976 ± 0.549
9.165AlaLeu: 9.165 ± 0.705
4.178AlaMet: 4.178 ± 0.573
3.572AlaAsn: 3.572 ± 0.635
6.941AlaPro: 6.941 ± 0.789
5.593AlaGln: 5.593 ± 0.872
7.885AlaArg: 7.885 ± 0.881
6.941AlaSer: 6.941 ± 0.917
8.154AlaThr: 8.154 ± 0.838
7.952AlaVal: 7.952 ± 0.727
1.55AlaTrp: 1.55 ± 0.237
2.628AlaTyr: 2.628 ± 0.363
0.0AlaXaa: 0.0 ± 0.0
Cys
0.943CysAla: 0.943 ± 0.258
0.337CysCys: 0.337 ± 0.165
0.472CysAsp: 0.472 ± 0.186
0.337CysGlu: 0.337 ± 0.179
0.135CysPhe: 0.135 ± 0.104
0.674CysGly: 0.674 ± 0.298
0.202CysHis: 0.202 ± 0.125
0.539CysIle: 0.539 ± 0.225
0.27CysLys: 0.27 ± 0.131
0.607CysLeu: 0.607 ± 0.245
0.135CysMet: 0.135 ± 0.104
0.404CysAsn: 0.404 ± 0.148
0.607CysPro: 0.607 ± 0.203
0.27CysGln: 0.27 ± 0.153
1.078CysArg: 1.078 ± 0.252
0.202CysSer: 0.202 ± 0.111
0.472CysThr: 0.472 ± 0.15
0.27CysVal: 0.27 ± 0.128
0.27CysTrp: 0.27 ± 0.121
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.874AspAla: 6.874 ± 0.649
0.674AspCys: 0.674 ± 0.263
3.774AspAsp: 3.774 ± 0.746
4.919AspGlu: 4.919 ± 0.67
1.55AspPhe: 1.55 ± 0.399
5.661AspGly: 5.661 ± 0.611
1.483AspHis: 1.483 ± 0.446
2.426AspIle: 2.426 ± 0.416
1.483AspLys: 1.483 ± 0.257
5.459AspLeu: 5.459 ± 0.751
0.876AspMet: 0.876 ± 0.208
2.022AspAsn: 2.022 ± 0.283
3.639AspPro: 3.639 ± 0.515
2.561AspGln: 2.561 ± 0.423
4.448AspArg: 4.448 ± 0.632
3.572AspSer: 3.572 ± 0.452
3.976AspThr: 3.976 ± 0.413
4.178AspVal: 4.178 ± 0.555
1.348AspTrp: 1.348 ± 0.342
1.011AspTyr: 1.011 ± 0.208
0.0AspXaa: 0.0 ± 0.0
Glu
7.076GluAla: 7.076 ± 0.855
0.539GluCys: 0.539 ± 0.194
3.909GluAsp: 3.909 ± 0.548
2.291GluGlu: 2.291 ± 0.541
2.089GluPhe: 2.089 ± 0.485
4.313GluGly: 4.313 ± 0.458
0.943GluHis: 0.943 ± 0.284
3.167GluIle: 3.167 ± 0.462
2.022GluLys: 2.022 ± 0.318
5.593GluLeu: 5.593 ± 0.566
1.752GluMet: 1.752 ± 0.289
1.213GluAsn: 1.213 ± 0.234
2.359GluPro: 2.359 ± 0.431
2.965GluGln: 2.965 ± 0.412
5.459GluArg: 5.459 ± 0.723
3.774GluSer: 3.774 ± 0.457
3.167GluThr: 3.167 ± 0.445
4.043GluVal: 4.043 ± 0.54
1.146GluTrp: 1.146 ± 0.307
1.483GluTyr: 1.483 ± 0.269
0.0GluXaa: 0.0 ± 0.0
Phe
2.763PheAla: 2.763 ± 0.55
0.135PheCys: 0.135 ± 0.1
1.887PheAsp: 1.887 ± 0.461
2.022PheGlu: 2.022 ± 0.522
0.809PhePhe: 0.809 ± 0.306
2.83PheGly: 2.83 ± 0.496
0.607PheHis: 0.607 ± 0.217
1.011PheIle: 1.011 ± 0.256
0.404PheLys: 0.404 ± 0.131
2.696PheLeu: 2.696 ± 0.437
0.337PheMet: 0.337 ± 0.161
1.078PheAsn: 1.078 ± 0.23
1.213PhePro: 1.213 ± 0.361
0.741PheGln: 0.741 ± 0.214
1.415PheArg: 1.415 ± 0.299
1.415PheSer: 1.415 ± 0.316
1.887PheThr: 1.887 ± 0.374
2.291PheVal: 2.291 ± 0.468
0.674PheTrp: 0.674 ± 0.217
0.404PheTyr: 0.404 ± 0.122
0.0PheXaa: 0.0 ± 0.0
Gly
8.019GlyAla: 8.019 ± 0.739
0.809GlyCys: 0.809 ± 0.262
6.537GlyAsp: 6.537 ± 0.554
3.235GlyGlu: 3.235 ± 0.452
2.628GlyPhe: 2.628 ± 0.483
7.413GlyGly: 7.413 ± 0.891
2.291GlyHis: 2.291 ± 0.363
3.841GlyIle: 3.841 ± 0.503
2.426GlyLys: 2.426 ± 0.385
7.413GlyLeu: 7.413 ± 0.707
2.022GlyMet: 2.022 ± 0.299
2.426GlyAsn: 2.426 ± 0.487
4.515GlyPro: 4.515 ± 0.551
3.706GlyGln: 3.706 ± 0.513
6.335GlyArg: 6.335 ± 0.619
5.189GlySer: 5.189 ± 0.543
6.065GlyThr: 6.065 ± 0.941
6.537GlyVal: 6.537 ± 0.691
1.28GlyTrp: 1.28 ± 0.25
2.089GlyTyr: 2.089 ± 0.447
0.0GlyXaa: 0.0 ± 0.0
His
2.291HisAla: 2.291 ± 0.317
0.135HisCys: 0.135 ± 0.084
1.078HisAsp: 1.078 ± 0.298
1.213HisGlu: 1.213 ± 0.327
0.27HisPhe: 0.27 ± 0.152
1.55HisGly: 1.55 ± 0.332
0.674HisHis: 0.674 ± 0.247
1.078HisIle: 1.078 ± 0.26
0.404HisLys: 0.404 ± 0.167
2.359HisLeu: 2.359 ± 0.503
0.135HisMet: 0.135 ± 0.092
0.27HisAsn: 0.27 ± 0.108
1.348HisPro: 1.348 ± 0.302
1.078HisGln: 1.078 ± 0.269
1.415HisArg: 1.415 ± 0.298
1.146HisSer: 1.146 ± 0.287
1.146HisThr: 1.146 ± 0.275
0.943HisVal: 0.943 ± 0.259
0.27HisTrp: 0.27 ± 0.117
0.607HisTyr: 0.607 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
5.526IleAla: 5.526 ± 0.564
0.135IleCys: 0.135 ± 0.095
3.639IleAsp: 3.639 ± 0.526
3.504IleGlu: 3.504 ± 0.68
0.809IlePhe: 0.809 ± 0.29
3.706IleGly: 3.706 ± 0.472
0.809IleHis: 0.809 ± 0.281
1.28IleIle: 1.28 ± 0.289
1.213IleLys: 1.213 ± 0.251
2.696IleLeu: 2.696 ± 0.334
0.674IleMet: 0.674 ± 0.213
1.685IleAsn: 1.685 ± 0.3
2.561IlePro: 2.561 ± 0.426
1.483IleGln: 1.483 ± 0.235
3.167IleArg: 3.167 ± 0.505
2.224IleSer: 2.224 ± 0.556
3.572IleThr: 3.572 ± 0.557
2.561IleVal: 2.561 ± 0.463
0.876IleTrp: 0.876 ± 0.301
1.28IleTyr: 1.28 ± 0.41
0.0IleXaa: 0.0 ± 0.0
Lys
4.246LysAla: 4.246 ± 0.735
0.135LysCys: 0.135 ± 0.093
1.752LysAsp: 1.752 ± 0.356
1.415LysGlu: 1.415 ± 0.343
0.404LysPhe: 0.404 ± 0.146
1.954LysGly: 1.954 ± 0.379
0.337LysHis: 0.337 ± 0.141
1.28LysIle: 1.28 ± 0.283
1.348LysLys: 1.348 ± 0.335
2.628LysLeu: 2.628 ± 0.421
0.404LysMet: 0.404 ± 0.141
0.809LysAsn: 0.809 ± 0.282
1.685LysPro: 1.685 ± 0.282
1.28LysGln: 1.28 ± 0.273
2.156LysArg: 2.156 ± 0.355
2.022LysSer: 2.022 ± 0.319
2.493LysThr: 2.493 ± 0.518
2.628LysVal: 2.628 ± 0.363
0.809LysTrp: 0.809 ± 0.227
0.539LysTyr: 0.539 ± 0.257
0.0LysXaa: 0.0 ± 0.0
Leu
11.254LeuAla: 11.254 ± 0.976
0.539LeuCys: 0.539 ± 0.23
4.987LeuAsp: 4.987 ± 0.657
4.178LeuGlu: 4.178 ± 0.549
2.965LeuPhe: 2.965 ± 0.555
6.065LeuGly: 6.065 ± 0.639
1.011LeuHis: 1.011 ± 0.261
2.965LeuIle: 2.965 ± 0.436
2.628LeuLys: 2.628 ± 0.373
6.469LeuLeu: 6.469 ± 0.747
1.483LeuMet: 1.483 ± 0.375
2.089LeuAsn: 2.089 ± 0.299
4.043LeuPro: 4.043 ± 0.562
2.696LeuGln: 2.696 ± 0.382
6.065LeuArg: 6.065 ± 0.726
4.852LeuSer: 4.852 ± 0.682
6.267LeuThr: 6.267 ± 0.592
7.076LeuVal: 7.076 ± 0.749
1.617LeuTrp: 1.617 ± 0.312
1.348LeuTyr: 1.348 ± 0.259
0.0LeuXaa: 0.0 ± 0.0
Met
2.426MetAla: 2.426 ± 0.335
0.202MetCys: 0.202 ± 0.125
0.876MetAsp: 0.876 ± 0.246
0.809MetGlu: 0.809 ± 0.222
0.943MetPhe: 0.943 ± 0.29
1.82MetGly: 1.82 ± 0.452
0.27MetHis: 0.27 ± 0.143
0.876MetIle: 0.876 ± 0.204
0.337MetLys: 0.337 ± 0.164
1.954MetLeu: 1.954 ± 0.333
0.472MetMet: 0.472 ± 0.199
0.539MetAsn: 0.539 ± 0.201
1.28MetPro: 1.28 ± 0.208
0.741MetGln: 0.741 ± 0.258
1.617MetArg: 1.617 ± 0.263
1.954MetSer: 1.954 ± 0.348
2.561MetThr: 2.561 ± 0.406
1.28MetVal: 1.28 ± 0.257
0.27MetTrp: 0.27 ± 0.126
0.27MetTyr: 0.27 ± 0.116
0.0MetXaa: 0.0 ± 0.0
Asn
3.504AsnAla: 3.504 ± 0.524
0.067AsnCys: 0.067 ± 0.07
1.55AsnAsp: 1.55 ± 0.412
1.078AsnGlu: 1.078 ± 0.3
0.539AsnPhe: 0.539 ± 0.211
3.1AsnGly: 3.1 ± 0.557
0.741AsnHis: 0.741 ± 0.234
1.28AsnIle: 1.28 ± 0.278
0.607AsnLys: 0.607 ± 0.198
2.156AsnLeu: 2.156 ± 0.389
0.27AsnMet: 0.27 ± 0.117
0.741AsnAsn: 0.741 ± 0.197
2.83AsnPro: 2.83 ± 0.448
0.876AsnGln: 0.876 ± 0.188
2.022AsnArg: 2.022 ± 0.422
1.348AsnSer: 1.348 ± 0.316
1.617AsnThr: 1.617 ± 0.354
1.617AsnVal: 1.617 ± 0.334
0.674AsnTrp: 0.674 ± 0.191
0.472AsnTyr: 0.472 ± 0.183
0.0AsnXaa: 0.0 ± 0.0
Pro
7.885ProAla: 7.885 ± 0.611
0.539ProCys: 0.539 ± 0.167
3.976ProAsp: 3.976 ± 0.563
3.774ProGlu: 3.774 ± 0.395
1.348ProPhe: 1.348 ± 0.299
5.796ProGly: 5.796 ± 0.734
0.809ProHis: 0.809 ± 0.241
2.628ProIle: 2.628 ± 0.547
1.752ProLys: 1.752 ± 0.437
3.437ProLeu: 3.437 ± 0.485
1.146ProMet: 1.146 ± 0.278
1.82ProAsn: 1.82 ± 0.323
3.167ProPro: 3.167 ± 0.686
1.685ProGln: 1.685 ± 0.35
4.919ProArg: 4.919 ± 0.811
3.167ProSer: 3.167 ± 0.467
3.572ProThr: 3.572 ± 0.521
3.774ProVal: 3.774 ± 0.587
1.28ProTrp: 1.28 ± 0.271
0.809ProTyr: 0.809 ± 0.21
0.0ProXaa: 0.0 ± 0.0
Gln
4.313GlnAla: 4.313 ± 0.981
0.27GlnCys: 0.27 ± 0.14
1.685GlnAsp: 1.685 ± 0.29
2.022GlnGlu: 2.022 ± 0.366
1.617GlnPhe: 1.617 ± 0.349
2.898GlnGly: 2.898 ± 0.457
0.809GlnHis: 0.809 ± 0.241
1.752GlnIle: 1.752 ± 0.334
1.28GlnLys: 1.28 ± 0.368
3.976GlnLeu: 3.976 ± 0.49
0.943GlnMet: 0.943 ± 0.275
0.337GlnAsn: 0.337 ± 0.147
2.493GlnPro: 2.493 ± 0.354
2.022GlnGln: 2.022 ± 0.583
3.437GlnArg: 3.437 ± 0.449
2.493GlnSer: 2.493 ± 0.435
1.685GlnThr: 1.685 ± 0.377
2.359GlnVal: 2.359 ± 0.473
0.876GlnTrp: 0.876 ± 0.278
1.213GlnTyr: 1.213 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
7.952ArgAla: 7.952 ± 0.677
1.078ArgCys: 1.078 ± 0.31
4.246ArgAsp: 4.246 ± 0.69
4.785ArgGlu: 4.785 ± 0.578
1.617ArgPhe: 1.617 ± 0.341
4.785ArgGly: 4.785 ± 0.544
1.213ArgHis: 1.213 ± 0.297
3.774ArgIle: 3.774 ± 0.687
3.1ArgLys: 3.1 ± 0.558
5.93ArgLeu: 5.93 ± 0.638
2.156ArgMet: 2.156 ± 0.326
2.224ArgAsn: 2.224 ± 0.312
3.909ArgPro: 3.909 ± 0.839
2.898ArgGln: 2.898 ± 0.523
7.682ArgArg: 7.682 ± 0.798
4.313ArgSer: 4.313 ± 0.595
4.717ArgThr: 4.717 ± 0.43
5.526ArgVal: 5.526 ± 0.605
1.483ArgTrp: 1.483 ± 0.322
1.752ArgTyr: 1.752 ± 0.304
0.0ArgXaa: 0.0 ± 0.0
Ser
7.548SerAla: 7.548 ± 0.75
0.337SerCys: 0.337 ± 0.174
2.763SerAsp: 2.763 ± 0.442
3.909SerGlu: 3.909 ± 0.475
1.213SerPhe: 1.213 ± 0.285
6.065SerGly: 6.065 ± 0.66
1.213SerHis: 1.213 ± 0.318
3.033SerIle: 3.033 ± 0.446
2.022SerLys: 2.022 ± 0.405
3.706SerLeu: 3.706 ± 0.594
1.415SerMet: 1.415 ± 0.318
1.55SerAsn: 1.55 ± 0.315
3.572SerPro: 3.572 ± 0.491
2.022SerGln: 2.022 ± 0.398
4.043SerArg: 4.043 ± 0.552
3.909SerSer: 3.909 ± 0.565
3.841SerThr: 3.841 ± 0.482
4.38SerVal: 4.38 ± 0.363
1.28SerTrp: 1.28 ± 0.279
1.011SerTyr: 1.011 ± 0.258
0.0SerXaa: 0.0 ± 0.0
Thr
8.828ThrAla: 8.828 ± 1.107
0.607ThrCys: 0.607 ± 0.163
4.583ThrAsp: 4.583 ± 0.608
4.043ThrGlu: 4.043 ± 0.514
1.28ThrPhe: 1.28 ± 0.273
6.739ThrGly: 6.739 ± 0.668
1.213ThrHis: 1.213 ± 0.231
2.426ThrIle: 2.426 ± 0.4
1.752ThrLys: 1.752 ± 0.416
5.863ThrLeu: 5.863 ± 0.481
0.943ThrMet: 0.943 ± 0.287
1.483ThrAsn: 1.483 ± 0.34
5.324ThrPro: 5.324 ± 0.629
1.752ThrGln: 1.752 ± 0.325
3.437ThrArg: 3.437 ± 0.477
3.639ThrSer: 3.639 ± 0.466
4.987ThrThr: 4.987 ± 0.622
6.2ThrVal: 6.2 ± 0.683
1.146ThrTrp: 1.146 ± 0.292
1.82ThrTyr: 1.82 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
9.974ValAla: 9.974 ± 0.959
0.337ValCys: 0.337 ± 0.189
4.111ValAsp: 4.111 ± 0.508
4.448ValGlu: 4.448 ± 0.671
1.954ValPhe: 1.954 ± 0.341
5.796ValGly: 5.796 ± 0.611
1.213ValHis: 1.213 ± 0.261
3.235ValIle: 3.235 ± 0.5
2.089ValLys: 2.089 ± 0.295
5.324ValLeu: 5.324 ± 0.82
1.348ValMet: 1.348 ± 0.25
1.954ValAsn: 1.954 ± 0.287
3.706ValPro: 3.706 ± 0.516
2.493ValGln: 2.493 ± 0.392
5.593ValArg: 5.593 ± 0.611
5.122ValSer: 5.122 ± 0.507
5.256ValThr: 5.256 ± 0.565
5.324ValVal: 5.324 ± 0.547
1.483ValTrp: 1.483 ± 0.298
1.078ValTyr: 1.078 ± 0.261
0.0ValXaa: 0.0 ± 0.0
Trp
1.685TrpAla: 1.685 ± 0.314
0.472TrpCys: 0.472 ± 0.175
1.617TrpAsp: 1.617 ± 0.335
1.348TrpGlu: 1.348 ± 0.381
0.741TrpPhe: 0.741 ± 0.214
1.28TrpGly: 1.28 ± 0.257
0.539TrpHis: 0.539 ± 0.184
0.674TrpIle: 0.674 ± 0.191
0.607TrpLys: 0.607 ± 0.17
1.483TrpLeu: 1.483 ± 0.309
0.472TrpMet: 0.472 ± 0.164
0.27TrpAsn: 0.27 ± 0.145
0.809TrpPro: 0.809 ± 0.227
0.876TrpGln: 0.876 ± 0.246
1.617TrpArg: 1.617 ± 0.263
1.011TrpSer: 1.011 ± 0.264
1.28TrpThr: 1.28 ± 0.321
1.617TrpVal: 1.617 ± 0.294
0.472TrpTrp: 0.472 ± 0.18
0.472TrpTyr: 0.472 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.033TyrAla: 3.033 ± 0.436
0.067TyrCys: 0.067 ± 0.061
1.28TyrAsp: 1.28 ± 0.31
1.55TyrGlu: 1.55 ± 0.326
0.741TyrPhe: 0.741 ± 0.216
1.752TyrGly: 1.752 ± 0.334
0.539TyrHis: 0.539 ± 0.215
1.011TyrIle: 1.011 ± 0.336
0.607TyrLys: 0.607 ± 0.184
1.685TyrLeu: 1.685 ± 0.333
0.27TyrMet: 0.27 ± 0.122
0.607TyrAsn: 0.607 ± 0.193
1.28TyrPro: 1.28 ± 0.284
0.741TyrGln: 0.741 ± 0.237
1.348TyrArg: 1.348 ± 0.306
0.607TyrSer: 0.607 ± 0.174
1.348TyrThr: 1.348 ± 0.321
1.28TyrVal: 1.28 ± 0.351
0.539TyrTrp: 0.539 ± 0.239
0.607TyrTyr: 0.607 ± 0.146
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (14840 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski