Amino acid dipepetide frequency for Gordonia phage Dardanus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.325AlaAla: 17.325 ± 1.488
0.747AlaCys: 0.747 ± 0.248
7.841AlaAsp: 7.841 ± 0.853
7.542AlaGlu: 7.542 ± 0.853
3.136AlaPhe: 3.136 ± 0.474
15.757AlaGly: 15.757 ± 1.182
2.315AlaHis: 2.315 ± 0.422
4.929AlaIle: 4.929 ± 0.582
5.153AlaLys: 5.153 ± 0.821
10.081AlaLeu: 10.081 ± 1.297
2.912AlaMet: 2.912 ± 0.485
2.912AlaAsn: 2.912 ± 0.414
5.003AlaPro: 5.003 ± 0.703
5.153AlaGln: 5.153 ± 0.724
10.38AlaArg: 10.38 ± 1.155
5.003AlaSer: 5.003 ± 0.662
7.393AlaThr: 7.393 ± 0.929
9.26AlaVal: 9.26 ± 0.935
2.763AlaTrp: 2.763 ± 0.405
3.286AlaTyr: 3.286 ± 0.573
0.0AlaXaa: 0.0 ± 0.0
Cys
0.672CysAla: 0.672 ± 0.249
0.0CysCys: 0.0 ± 0.0
0.597CysAsp: 0.597 ± 0.187
0.597CysGlu: 0.597 ± 0.23
0.224CysPhe: 0.224 ± 0.151
0.747CysGly: 0.747 ± 0.252
0.299CysHis: 0.299 ± 0.169
0.373CysIle: 0.373 ± 0.162
0.224CysLys: 0.224 ± 0.133
0.299CysLeu: 0.299 ± 0.156
0.075CysMet: 0.075 ± 0.072
0.299CysAsn: 0.299 ± 0.144
0.373CysPro: 0.373 ± 0.221
0.448CysGln: 0.448 ± 0.181
0.299CysArg: 0.299 ± 0.146
0.299CysSer: 0.299 ± 0.133
0.075CysThr: 0.075 ± 0.06
0.672CysVal: 0.672 ± 0.225
0.299CysTrp: 0.299 ± 0.149
0.075CysTyr: 0.075 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
8.588AspAla: 8.588 ± 0.951
0.821AspCys: 0.821 ± 0.253
5.974AspAsp: 5.974 ± 0.809
6.422AspGlu: 6.422 ± 0.721
1.792AspPhe: 1.792 ± 0.349
6.422AspGly: 6.422 ± 0.676
1.344AspHis: 1.344 ± 0.363
1.27AspIle: 1.27 ± 0.336
1.494AspLys: 1.494 ± 0.334
5.825AspLeu: 5.825 ± 0.575
1.568AspMet: 1.568 ± 0.27
1.792AspAsn: 1.792 ± 0.341
3.659AspPro: 3.659 ± 0.544
2.39AspGln: 2.39 ± 0.356
4.257AspArg: 4.257 ± 0.75
3.286AspSer: 3.286 ± 0.505
3.136AspThr: 3.136 ± 0.551
5.601AspVal: 5.601 ± 0.528
1.12AspTrp: 1.12 ± 0.27
1.718AspTyr: 1.718 ± 0.428
0.0AspXaa: 0.0 ± 0.0
Glu
8.289GluAla: 8.289 ± 0.814
0.597GluCys: 0.597 ± 0.255
3.659GluAsp: 3.659 ± 0.608
2.614GluGlu: 2.614 ± 0.437
2.838GluPhe: 2.838 ± 0.383
4.929GluGly: 4.929 ± 0.653
1.195GluHis: 1.195 ± 0.268
1.27GluIle: 1.27 ± 0.283
2.091GluLys: 2.091 ± 0.451
5.899GluLeu: 5.899 ± 0.758
1.643GluMet: 1.643 ± 0.356
1.568GluAsn: 1.568 ± 0.343
2.838GluPro: 2.838 ± 0.493
2.912GluGln: 2.912 ± 0.437
5.302GluArg: 5.302 ± 0.634
3.51GluSer: 3.51 ± 0.43
3.211GluThr: 3.211 ± 0.45
4.854GluVal: 4.854 ± 0.528
1.643GluTrp: 1.643 ± 0.35
1.942GluTyr: 1.942 ± 0.374
0.0GluXaa: 0.0 ± 0.0
Phe
4.257PheAla: 4.257 ± 0.557
0.075PheCys: 0.075 ± 0.07
2.091PheAsp: 2.091 ± 0.356
2.987PheGlu: 2.987 ± 0.476
0.672PhePhe: 0.672 ± 0.22
2.688PheGly: 2.688 ± 0.483
0.523PheHis: 0.523 ± 0.2
0.896PheIle: 0.896 ± 0.22
1.195PheLys: 1.195 ± 0.306
1.344PheLeu: 1.344 ± 0.329
0.597PheMet: 0.597 ± 0.24
0.821PheAsn: 0.821 ± 0.241
0.971PhePro: 0.971 ± 0.282
0.747PheGln: 0.747 ± 0.292
3.51PheArg: 3.51 ± 0.587
1.27PheSer: 1.27 ± 0.329
2.763PheThr: 2.763 ± 0.469
2.091PheVal: 2.091 ± 0.377
0.373PheTrp: 0.373 ± 0.167
0.747PheTyr: 0.747 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
11.202GlyAla: 11.202 ± 1.626
0.821GlyCys: 0.821 ± 0.258
6.945GlyAsp: 6.945 ± 0.564
6.796GlyGlu: 6.796 ± 0.545
2.912GlyPhe: 2.912 ± 0.487
9.633GlyGly: 9.633 ± 1.248
2.091GlyHis: 2.091 ± 0.519
2.838GlyIle: 2.838 ± 0.441
3.958GlyLys: 3.958 ± 0.557
6.646GlyLeu: 6.646 ± 0.854
3.286GlyMet: 3.286 ± 0.487
3.584GlyAsn: 3.584 ± 0.519
4.033GlyPro: 4.033 ± 0.905
3.958GlyGln: 3.958 ± 0.63
8.289GlyArg: 8.289 ± 0.822
4.331GlySer: 4.331 ± 0.54
4.406GlyThr: 4.406 ± 0.689
6.348GlyVal: 6.348 ± 0.751
2.24GlyTrp: 2.24 ± 0.518
2.614GlyTyr: 2.614 ± 0.579
0.0GlyXaa: 0.0 ± 0.0
His
2.614HisAla: 2.614 ± 0.401
0.075HisCys: 0.075 ± 0.079
1.643HisAsp: 1.643 ± 0.421
1.27HisGlu: 1.27 ± 0.338
0.821HisPhe: 0.821 ± 0.217
2.166HisGly: 2.166 ± 0.468
0.597HisHis: 0.597 ± 0.226
0.597HisIle: 0.597 ± 0.232
0.523HisLys: 0.523 ± 0.189
1.195HisLeu: 1.195 ± 0.316
0.597HisMet: 0.597 ± 0.208
0.747HisAsn: 0.747 ± 0.241
1.344HisPro: 1.344 ± 0.405
0.597HisGln: 0.597 ± 0.247
1.12HisArg: 1.12 ± 0.379
0.448HisSer: 0.448 ± 0.164
0.896HisThr: 0.896 ± 0.239
1.718HisVal: 1.718 ± 0.344
0.299HisTrp: 0.299 ± 0.165
0.448HisTyr: 0.448 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
4.63IleAla: 4.63 ± 0.674
0.075IleCys: 0.075 ± 0.068
2.539IleAsp: 2.539 ± 0.467
5.003IleGlu: 5.003 ± 0.592
0.672IlePhe: 0.672 ± 0.203
4.406IleGly: 4.406 ± 0.615
0.597IleHis: 0.597 ± 0.223
1.12IleIle: 1.12 ± 0.271
0.672IleLys: 0.672 ± 0.217
2.688IleLeu: 2.688 ± 0.389
0.299IleMet: 0.299 ± 0.171
0.747IleAsn: 0.747 ± 0.251
1.718IlePro: 1.718 ± 0.31
0.448IleGln: 0.448 ± 0.16
2.763IleArg: 2.763 ± 0.536
1.568IleSer: 1.568 ± 0.346
1.718IleThr: 1.718 ± 0.372
3.062IleVal: 3.062 ± 0.522
0.224IleTrp: 0.224 ± 0.167
0.448IleTyr: 0.448 ± 0.274
0.0IleXaa: 0.0 ± 0.0
Lys
5.227LysAla: 5.227 ± 0.936
0.224LysCys: 0.224 ± 0.148
2.614LysAsp: 2.614 ± 0.469
0.896LysGlu: 0.896 ± 0.314
0.821LysPhe: 0.821 ± 0.239
3.659LysGly: 3.659 ± 0.465
0.523LysHis: 0.523 ± 0.21
1.045LysIle: 1.045 ± 0.274
0.747LysLys: 0.747 ± 0.272
2.987LysLeu: 2.987 ± 0.437
0.523LysMet: 0.523 ± 0.17
1.12LysAsn: 1.12 ± 0.256
2.464LysPro: 2.464 ± 0.43
1.045LysGln: 1.045 ± 0.275
3.36LysArg: 3.36 ± 0.469
2.464LysSer: 2.464 ± 0.4
2.464LysThr: 2.464 ± 0.501
3.062LysVal: 3.062 ± 0.47
0.672LysTrp: 0.672 ± 0.239
0.747LysTyr: 0.747 ± 0.296
0.0LysXaa: 0.0 ± 0.0
Leu
11.276LeuAla: 11.276 ± 0.959
0.896LeuCys: 0.896 ± 0.301
6.348LeuAsp: 6.348 ± 0.705
3.36LeuGlu: 3.36 ± 0.468
2.315LeuPhe: 2.315 ± 0.357
7.468LeuGly: 7.468 ± 0.714
1.12LeuHis: 1.12 ± 0.337
2.39LeuIle: 2.39 ± 0.406
3.584LeuLys: 3.584 ± 0.777
4.182LeuLeu: 4.182 ± 0.607
0.971LeuMet: 0.971 ± 0.292
2.763LeuAsn: 2.763 ± 0.5
3.584LeuPro: 3.584 ± 0.477
1.419LeuGln: 1.419 ± 0.302
4.779LeuArg: 4.779 ± 0.571
3.734LeuSer: 3.734 ± 0.497
5.377LeuThr: 5.377 ± 0.67
5.377LeuVal: 5.377 ± 0.676
1.568LeuTrp: 1.568 ± 0.345
1.942LeuTyr: 1.942 ± 0.307
0.0LeuXaa: 0.0 ± 0.0
Met
3.809MetAla: 3.809 ± 0.473
0.0MetCys: 0.0 ± 0.0
0.971MetAsp: 0.971 ± 0.289
0.672MetGlu: 0.672 ± 0.222
0.224MetPhe: 0.224 ± 0.111
2.091MetGly: 2.091 ± 0.43
0.299MetHis: 0.299 ± 0.147
0.672MetIle: 0.672 ± 0.205
1.27MetLys: 1.27 ± 0.291
0.971MetLeu: 0.971 ± 0.32
0.672MetMet: 0.672 ± 0.224
1.12MetAsn: 1.12 ± 0.282
1.344MetPro: 1.344 ± 0.331
0.672MetGln: 0.672 ± 0.171
1.27MetArg: 1.27 ± 0.363
1.718MetSer: 1.718 ± 0.443
1.718MetThr: 1.718 ± 0.309
1.27MetVal: 1.27 ± 0.32
0.448MetTrp: 0.448 ± 0.201
0.299MetTyr: 0.299 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
3.809AsnAla: 3.809 ± 0.768
0.075AsnCys: 0.075 ± 0.069
2.24AsnAsp: 2.24 ± 0.461
1.792AsnGlu: 1.792 ± 0.276
1.12AsnPhe: 1.12 ± 0.291
4.033AsnGly: 4.033 ± 0.549
0.597AsnHis: 0.597 ± 0.228
1.27AsnIle: 1.27 ± 0.299
0.672AsnLys: 0.672 ± 0.216
2.39AsnLeu: 2.39 ± 0.465
0.597AsnMet: 0.597 ± 0.191
0.821AsnAsn: 0.821 ± 0.221
2.539AsnPro: 2.539 ± 0.443
0.821AsnGln: 0.821 ± 0.261
1.942AsnArg: 1.942 ± 0.524
1.344AsnSer: 1.344 ± 0.282
1.494AsnThr: 1.494 ± 0.483
2.39AsnVal: 2.39 ± 0.477
0.448AsnTrp: 0.448 ± 0.217
0.373AsnTyr: 0.373 ± 0.169
0.0AsnXaa: 0.0 ± 0.0
Pro
6.497ProAla: 6.497 ± 0.91
0.373ProCys: 0.373 ± 0.186
3.958ProAsp: 3.958 ± 0.493
3.286ProGlu: 3.286 ± 0.511
1.792ProPhe: 1.792 ± 0.309
3.062ProGly: 3.062 ± 0.487
1.12ProHis: 1.12 ± 0.308
2.24ProIle: 2.24 ± 0.445
1.792ProLys: 1.792 ± 0.457
3.136ProLeu: 3.136 ± 0.405
0.523ProMet: 0.523 ± 0.166
1.419ProAsn: 1.419 ± 0.315
1.344ProPro: 1.344 ± 0.33
1.195ProGln: 1.195 ± 0.272
3.51ProArg: 3.51 ± 0.517
1.568ProSer: 1.568 ± 0.361
3.286ProThr: 3.286 ± 0.555
4.033ProVal: 4.033 ± 0.66
1.045ProTrp: 1.045 ± 0.248
1.27ProTyr: 1.27 ± 0.249
0.0ProXaa: 0.0 ± 0.0
Gln
4.406GlnAla: 4.406 ± 0.594
0.149GlnCys: 0.149 ± 0.096
1.344GlnAsp: 1.344 ± 0.327
0.747GlnGlu: 0.747 ± 0.241
0.747GlnPhe: 0.747 ± 0.231
2.763GlnGly: 2.763 ± 0.415
0.597GlnHis: 0.597 ± 0.191
1.12GlnIle: 1.12 ± 0.286
0.971GlnLys: 0.971 ± 0.272
3.286GlnLeu: 3.286 ± 0.444
0.821GlnMet: 0.821 ± 0.258
0.597GlnAsn: 0.597 ± 0.187
1.344GlnPro: 1.344 ± 0.257
1.045GlnGln: 1.045 ± 0.364
2.838GlnArg: 2.838 ± 0.498
1.718GlnSer: 1.718 ± 0.383
2.166GlnThr: 2.166 ± 0.339
2.763GlnVal: 2.763 ± 0.403
0.821GlnTrp: 0.821 ± 0.253
1.27GlnTyr: 1.27 ± 0.346
0.0GlnXaa: 0.0 ± 0.0
Arg
8.214ArgAla: 8.214 ± 0.902
0.299ArgCys: 0.299 ± 0.174
4.331ArgAsp: 4.331 ± 0.644
5.153ArgGlu: 5.153 ± 0.691
2.987ArgPhe: 2.987 ± 0.41
7.02ArgGly: 7.02 ± 0.812
2.24ArgHis: 2.24 ± 0.516
3.435ArgIle: 3.435 ± 0.571
3.36ArgLys: 3.36 ± 0.644
5.675ArgLeu: 5.675 ± 0.623
1.792ArgMet: 1.792 ± 0.348
2.315ArgAsn: 2.315 ± 0.339
2.838ArgPro: 2.838 ± 0.612
2.091ArgGln: 2.091 ± 0.394
6.572ArgArg: 6.572 ± 0.849
3.286ArgSer: 3.286 ± 0.426
4.182ArgThr: 4.182 ± 0.645
6.796ArgVal: 6.796 ± 0.694
1.419ArgTrp: 1.419 ± 0.328
2.39ArgTyr: 2.39 ± 0.477
0.0ArgXaa: 0.0 ± 0.0
Ser
6.124SerAla: 6.124 ± 0.691
0.448SerCys: 0.448 ± 0.175
3.435SerAsp: 3.435 ± 0.506
2.091SerGlu: 2.091 ± 0.386
1.718SerPhe: 1.718 ± 0.399
5.003SerGly: 5.003 ± 0.71
0.523SerHis: 0.523 ± 0.211
2.24SerIle: 2.24 ± 0.377
1.792SerLys: 1.792 ± 0.319
3.734SerLeu: 3.734 ± 0.462
0.896SerMet: 0.896 ± 0.261
1.792SerAsn: 1.792 ± 0.37
1.867SerPro: 1.867 ± 0.331
1.12SerGln: 1.12 ± 0.249
3.958SerArg: 3.958 ± 0.837
3.435SerSer: 3.435 ± 0.709
2.912SerThr: 2.912 ± 0.521
4.705SerVal: 4.705 ± 0.578
1.045SerTrp: 1.045 ± 0.304
0.896SerTyr: 0.896 ± 0.298
0.0SerXaa: 0.0 ± 0.0
Thr
5.974ThrAla: 5.974 ± 1.036
0.373ThrCys: 0.373 ± 0.175
3.734ThrAsp: 3.734 ± 0.49
2.838ThrGlu: 2.838 ± 0.567
2.614ThrPhe: 2.614 ± 0.477
5.302ThrGly: 5.302 ± 0.619
1.045ThrHis: 1.045 ± 0.291
3.136ThrIle: 3.136 ± 0.488
2.614ThrLys: 2.614 ± 0.458
4.779ThrLeu: 4.779 ± 0.639
0.597ThrMet: 0.597 ± 0.23
1.792ThrAsn: 1.792 ± 0.365
3.36ThrPro: 3.36 ± 0.422
1.195ThrGln: 1.195 ± 0.301
3.734ThrArg: 3.734 ± 0.671
3.958ThrSer: 3.958 ± 0.611
3.062ThrThr: 3.062 ± 0.541
5.675ThrVal: 5.675 ± 0.736
0.971ThrTrp: 0.971 ± 0.239
1.195ThrTyr: 1.195 ± 0.367
0.0ThrXaa: 0.0 ± 0.0
Val
11.127ValAla: 11.127 ± 1.142
0.597ValCys: 0.597 ± 0.322
5.227ValAsp: 5.227 ± 0.646
6.124ValGlu: 6.124 ± 0.712
1.867ValPhe: 1.867 ± 0.362
5.75ValGly: 5.75 ± 0.721
1.792ValHis: 1.792 ± 0.358
2.838ValIle: 2.838 ± 0.465
2.987ValLys: 2.987 ± 0.53
6.049ValLeu: 6.049 ± 0.595
1.867ValMet: 1.867 ± 0.346
2.614ValAsn: 2.614 ± 0.385
3.809ValPro: 3.809 ± 0.651
2.464ValGln: 2.464 ± 0.423
4.779ValArg: 4.779 ± 0.704
4.481ValSer: 4.481 ± 0.45
5.601ValThr: 5.601 ± 0.684
6.796ValVal: 6.796 ± 0.68
1.419ValTrp: 1.419 ± 0.389
2.091ValTyr: 2.091 ± 0.532
0.0ValXaa: 0.0 ± 0.0
Trp
2.166TrpAla: 2.166 ± 0.47
0.149TrpCys: 0.149 ± 0.112
1.12TrpAsp: 1.12 ± 0.201
0.747TrpGlu: 0.747 ± 0.21
0.597TrpPhe: 0.597 ± 0.199
1.27TrpGly: 1.27 ± 0.363
0.224TrpHis: 0.224 ± 0.131
1.045TrpIle: 1.045 ± 0.259
0.971TrpLys: 0.971 ± 0.287
1.867TrpLeu: 1.867 ± 0.387
0.448TrpMet: 0.448 ± 0.161
1.045TrpAsn: 1.045 ± 0.305
0.821TrpPro: 0.821 ± 0.323
0.971TrpGln: 0.971 ± 0.267
1.718TrpArg: 1.718 ± 0.398
1.344TrpSer: 1.344 ± 0.352
0.896TrpThr: 0.896 ± 0.272
1.344TrpVal: 1.344 ± 0.333
0.523TrpTrp: 0.523 ± 0.181
0.299TrpTyr: 0.299 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.688TyrAla: 2.688 ± 0.42
0.149TyrCys: 0.149 ± 0.097
1.867TyrAsp: 1.867 ± 0.421
1.867TyrGlu: 1.867 ± 0.346
0.747TyrPhe: 0.747 ± 0.262
2.763TyrGly: 2.763 ± 0.474
0.672TyrHis: 0.672 ± 0.254
0.672TyrIle: 0.672 ± 0.198
0.597TyrLys: 0.597 ± 0.177
1.419TyrLeu: 1.419 ± 0.361
0.747TyrMet: 0.747 ± 0.266
0.896TyrAsn: 0.896 ± 0.243
1.195TyrPro: 1.195 ± 0.349
0.821TyrGln: 0.821 ± 0.21
2.091TyrArg: 2.091 ± 0.546
0.971TyrSer: 0.971 ± 0.276
1.195TyrThr: 1.195 ± 0.201
2.464TyrVal: 2.464 ± 0.406
0.224TyrTrp: 0.224 ± 0.132
0.896TyrTyr: 0.896 ± 0.26
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (13392 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski