Amino acid dipepetide frequency for Gordonia phage Apricot

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.831AlaAla: 13.831 ± 1.304
0.477AlaCys: 0.477 ± 0.167
6.856AlaAsp: 6.856 ± 0.752
7.75AlaGlu: 7.75 ± 0.67
2.206AlaPhe: 2.206 ± 0.365
8.406AlaGly: 8.406 ± 0.891
1.61AlaHis: 1.61 ± 0.286
4.471AlaIle: 4.471 ± 0.539
4.352AlaLys: 4.352 ± 0.711
9.002AlaLeu: 9.002 ± 0.862
3.339AlaMet: 3.339 ± 0.525
3.339AlaAsn: 3.339 ± 0.582
4.292AlaPro: 4.292 ± 0.448
4.889AlaGln: 4.889 ± 0.601
7.81AlaArg: 7.81 ± 0.703
5.604AlaSer: 5.604 ± 0.531
6.081AlaThr: 6.081 ± 0.627
8.465AlaVal: 8.465 ± 1.079
1.908AlaTrp: 1.908 ± 0.323
3.04AlaTyr: 3.04 ± 0.411
0.0AlaXaa: 0.0 ± 0.0
Cys
0.894CysAla: 0.894 ± 0.22
0.119CysCys: 0.119 ± 0.089
0.715CysAsp: 0.715 ± 0.208
0.596CysGlu: 0.596 ± 0.219
0.179CysPhe: 0.179 ± 0.098
1.312CysGly: 1.312 ± 0.361
0.417CysHis: 0.417 ± 0.17
0.596CysIle: 0.596 ± 0.235
0.596CysLys: 0.596 ± 0.205
0.537CysLeu: 0.537 ± 0.199
0.119CysMet: 0.119 ± 0.089
0.179CysAsn: 0.179 ± 0.127
0.715CysPro: 0.715 ± 0.248
0.238CysGln: 0.238 ± 0.124
0.775CysArg: 0.775 ± 0.226
0.715CysSer: 0.715 ± 0.199
0.477CysThr: 0.477 ± 0.197
0.537CysVal: 0.537 ± 0.183
0.238CysTrp: 0.238 ± 0.11
0.238CysTyr: 0.238 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
7.094AspAla: 7.094 ± 0.681
0.596AspCys: 0.596 ± 0.168
3.994AspAsp: 3.994 ± 0.548
4.829AspGlu: 4.829 ± 0.669
1.669AspPhe: 1.669 ± 0.348
6.021AspGly: 6.021 ± 0.583
1.431AspHis: 1.431 ± 0.298
2.444AspIle: 2.444 ± 0.423
1.49AspLys: 1.49 ± 0.279
5.544AspLeu: 5.544 ± 0.59
1.312AspMet: 1.312 ± 0.347
1.371AspAsn: 1.371 ± 0.3
5.723AspPro: 5.723 ± 0.48
2.385AspGln: 2.385 ± 0.413
4.889AspArg: 4.889 ± 0.587
3.637AspSer: 3.637 ± 0.376
3.219AspThr: 3.219 ± 0.456
4.412AspVal: 4.412 ± 0.755
1.55AspTrp: 1.55 ± 0.298
1.967AspTyr: 1.967 ± 0.327
0.0AspXaa: 0.0 ± 0.0
Glu
7.094GluAla: 7.094 ± 0.76
0.596GluCys: 0.596 ± 0.213
3.458GluAsp: 3.458 ± 0.546
3.517GluGlu: 3.517 ± 0.624
2.444GluPhe: 2.444 ± 0.346
3.339GluGly: 3.339 ± 0.381
1.788GluHis: 1.788 ± 0.468
2.683GluIle: 2.683 ± 0.335
1.967GluLys: 1.967 ± 0.409
5.246GluLeu: 5.246 ± 0.573
1.252GluMet: 1.252 ± 0.297
1.133GluAsn: 1.133 ± 0.216
3.756GluPro: 3.756 ± 0.571
2.027GluGln: 2.027 ± 0.369
5.485GluArg: 5.485 ± 0.623
3.696GluSer: 3.696 ± 0.403
4.054GluThr: 4.054 ± 0.419
4.531GluVal: 4.531 ± 0.692
1.788GluTrp: 1.788 ± 0.342
2.087GluTyr: 2.087 ± 0.322
0.0GluXaa: 0.0 ± 0.0
Phe
2.206PheAla: 2.206 ± 0.29
0.238PheCys: 0.238 ± 0.13
1.967PheAsp: 1.967 ± 0.319
1.55PheGlu: 1.55 ± 0.229
0.537PhePhe: 0.537 ± 0.197
2.683PheGly: 2.683 ± 0.387
0.417PheHis: 0.417 ± 0.158
1.252PheIle: 1.252 ± 0.257
1.013PheLys: 1.013 ± 0.266
1.61PheLeu: 1.61 ± 0.32
0.298PheMet: 0.298 ± 0.151
0.656PheAsn: 0.656 ± 0.215
1.133PhePro: 1.133 ± 0.265
0.894PheGln: 0.894 ± 0.265
1.371PheArg: 1.371 ± 0.257
0.954PheSer: 0.954 ± 0.191
1.431PheThr: 1.431 ± 0.279
2.087PheVal: 2.087 ± 0.345
0.298PheTrp: 0.298 ± 0.134
0.775PheTyr: 0.775 ± 0.217
0.0PheXaa: 0.0 ± 0.0
Gly
6.975GlyAla: 6.975 ± 1.275
1.013GlyCys: 1.013 ± 0.296
5.067GlyAsp: 5.067 ± 0.719
4.71GlyGlu: 4.71 ± 0.494
2.563GlyPhe: 2.563 ± 0.33
7.81GlyGly: 7.81 ± 1.245
1.908GlyHis: 1.908 ± 0.342
3.935GlyIle: 3.935 ± 0.544
2.563GlyLys: 2.563 ± 0.424
7.333GlyLeu: 7.333 ± 0.586
1.908GlyMet: 1.908 ± 0.344
2.385GlyAsn: 2.385 ± 0.459
3.696GlyPro: 3.696 ± 0.428
2.265GlyGln: 2.265 ± 0.384
6.737GlyArg: 6.737 ± 0.495
4.889GlySer: 4.889 ± 0.625
5.604GlyThr: 5.604 ± 0.662
6.617GlyVal: 6.617 ± 0.648
2.742GlyTrp: 2.742 ± 0.459
1.788GlyTyr: 1.788 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.967HisAla: 1.967 ± 0.295
0.298HisCys: 0.298 ± 0.11
1.133HisAsp: 1.133 ± 0.303
1.431HisGlu: 1.431 ± 0.349
0.537HisPhe: 0.537 ± 0.193
1.431HisGly: 1.431 ± 0.305
0.537HisHis: 0.537 ± 0.165
1.371HisIle: 1.371 ± 0.284
0.596HisLys: 0.596 ± 0.207
1.848HisLeu: 1.848 ± 0.431
0.537HisMet: 0.537 ± 0.215
0.715HisAsn: 0.715 ± 0.217
1.133HisPro: 1.133 ± 0.288
0.835HisGln: 0.835 ± 0.234
2.027HisArg: 2.027 ± 0.387
1.073HisSer: 1.073 ± 0.277
1.431HisThr: 1.431 ± 0.306
2.206HisVal: 2.206 ± 0.43
0.775HisTrp: 0.775 ± 0.257
0.715HisTyr: 0.715 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
5.604IleAla: 5.604 ± 0.614
0.298IleCys: 0.298 ± 0.125
4.114IleAsp: 4.114 ± 0.447
2.981IleGlu: 2.981 ± 0.531
0.715IlePhe: 0.715 ± 0.22
3.756IleGly: 3.756 ± 0.526
0.894IleHis: 0.894 ± 0.234
0.894IleIle: 0.894 ± 0.274
1.669IleLys: 1.669 ± 0.31
2.981IleLeu: 2.981 ± 0.348
0.596IleMet: 0.596 ± 0.203
1.431IleAsn: 1.431 ± 0.26
2.325IlePro: 2.325 ± 0.348
1.55IleGln: 1.55 ± 0.292
3.398IleArg: 3.398 ± 0.378
2.027IleSer: 2.027 ± 0.327
3.279IleThr: 3.279 ± 0.409
3.637IleVal: 3.637 ± 0.497
0.715IleTrp: 0.715 ± 0.189
1.013IleTyr: 1.013 ± 0.286
0.0IleXaa: 0.0 ± 0.0
Lys
4.233LysAla: 4.233 ± 0.765
0.477LysCys: 0.477 ± 0.191
1.49LysAsp: 1.49 ± 0.385
1.431LysGlu: 1.431 ± 0.288
1.252LysPhe: 1.252 ± 0.257
2.563LysGly: 2.563 ± 0.385
1.073LysHis: 1.073 ± 0.243
1.729LysIle: 1.729 ± 0.405
1.788LysLys: 1.788 ± 0.302
3.219LysLeu: 3.219 ± 0.421
0.298LysMet: 0.298 ± 0.117
0.835LysAsn: 0.835 ± 0.207
1.55LysPro: 1.55 ± 0.458
1.073LysGln: 1.073 ± 0.378
2.087LysArg: 2.087 ± 0.411
1.788LysSer: 1.788 ± 0.279
2.623LysThr: 2.623 ± 0.353
2.265LysVal: 2.265 ± 0.367
1.133LysTrp: 1.133 ± 0.314
0.656LysTyr: 0.656 ± 0.167
0.0LysXaa: 0.0 ± 0.0
Leu
8.525LeuAla: 8.525 ± 0.635
0.715LeuCys: 0.715 ± 0.229
4.948LeuAsp: 4.948 ± 0.566
4.889LeuGlu: 4.889 ± 0.535
1.073LeuPhe: 1.073 ± 0.226
6.14LeuGly: 6.14 ± 0.872
1.669LeuHis: 1.669 ± 0.298
3.398LeuIle: 3.398 ± 0.364
2.742LeuLys: 2.742 ± 0.482
6.021LeuLeu: 6.021 ± 0.554
1.61LeuMet: 1.61 ± 0.314
3.16LeuAsn: 3.16 ± 0.539
5.306LeuPro: 5.306 ± 0.527
1.967LeuGln: 1.967 ± 0.367
6.081LeuArg: 6.081 ± 0.679
5.485LeuSer: 5.485 ± 0.525
5.187LeuThr: 5.187 ± 0.595
6.677LeuVal: 6.677 ± 0.581
1.55LeuTrp: 1.55 ± 0.353
1.729LeuTyr: 1.729 ± 0.354
0.0LeuXaa: 0.0 ± 0.0
Met
2.265MetAla: 2.265 ± 0.374
0.238MetCys: 0.238 ± 0.123
0.954MetAsp: 0.954 ± 0.195
0.775MetGlu: 0.775 ± 0.208
0.298MetPhe: 0.298 ± 0.117
2.265MetGly: 2.265 ± 0.683
0.358MetHis: 0.358 ± 0.159
0.894MetIle: 0.894 ± 0.275
0.656MetLys: 0.656 ± 0.2
1.49MetLeu: 1.49 ± 0.216
0.238MetMet: 0.238 ± 0.101
0.656MetAsn: 0.656 ± 0.187
1.371MetPro: 1.371 ± 0.298
0.596MetGln: 0.596 ± 0.199
1.908MetArg: 1.908 ± 0.375
3.04MetSer: 3.04 ± 0.409
3.279MetThr: 3.279 ± 0.401
0.715MetVal: 0.715 ± 0.216
0.119MetTrp: 0.119 ± 0.089
0.358MetTyr: 0.358 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
3.696AsnAla: 3.696 ± 0.502
0.417AsnCys: 0.417 ± 0.174
2.325AsnAsp: 2.325 ± 0.404
1.073AsnGlu: 1.073 ± 0.24
0.477AsnPhe: 0.477 ± 0.149
2.981AsnGly: 2.981 ± 0.422
0.537AsnHis: 0.537 ± 0.195
1.133AsnIle: 1.133 ± 0.257
1.073AsnLys: 1.073 ± 0.239
1.848AsnLeu: 1.848 ± 0.342
0.477AsnMet: 0.477 ± 0.16
1.133AsnAsn: 1.133 ± 0.294
2.265AsnPro: 2.265 ± 0.329
0.954AsnGln: 0.954 ± 0.279
2.385AsnArg: 2.385 ± 0.49
1.61AsnSer: 1.61 ± 0.33
1.61AsnThr: 1.61 ± 0.369
2.504AsnVal: 2.504 ± 0.367
0.417AsnTrp: 0.417 ± 0.15
0.537AsnTyr: 0.537 ± 0.197
0.0AsnXaa: 0.0 ± 0.0
Pro
5.842ProAla: 5.842 ± 0.5
0.835ProCys: 0.835 ± 0.253
4.292ProAsp: 4.292 ± 0.575
3.16ProGlu: 3.16 ± 0.411
1.55ProPhe: 1.55 ± 0.324
5.067ProGly: 5.067 ± 0.524
1.133ProHis: 1.133 ± 0.26
2.742ProIle: 2.742 ± 0.425
2.027ProLys: 2.027 ± 0.378
3.637ProLeu: 3.637 ± 0.509
1.133ProMet: 1.133 ± 0.293
1.55ProAsn: 1.55 ± 0.417
2.981ProPro: 2.981 ± 0.523
1.729ProGln: 1.729 ± 0.293
3.339ProArg: 3.339 ± 0.509
4.233ProSer: 4.233 ± 0.48
3.875ProThr: 3.875 ± 0.554
4.352ProVal: 4.352 ± 0.553
1.133ProTrp: 1.133 ± 0.233
1.61ProTyr: 1.61 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
3.458GlnAla: 3.458 ± 0.487
0.417GlnCys: 0.417 ± 0.184
1.49GlnAsp: 1.49 ± 0.28
2.385GlnGlu: 2.385 ± 0.328
0.835GlnPhe: 0.835 ± 0.243
1.967GlnGly: 1.967 ± 0.388
0.835GlnHis: 0.835 ± 0.233
1.848GlnIle: 1.848 ± 0.316
0.954GlnLys: 0.954 ± 0.228
2.683GlnLeu: 2.683 ± 0.522
1.133GlnMet: 1.133 ± 0.225
0.894GlnAsn: 0.894 ± 0.233
2.325GlnPro: 2.325 ± 0.346
2.206GlnGln: 2.206 ± 0.376
2.742GlnArg: 2.742 ± 0.454
2.027GlnSer: 2.027 ± 0.295
2.087GlnThr: 2.087 ± 0.395
2.444GlnVal: 2.444 ± 0.407
0.656GlnTrp: 0.656 ± 0.259
0.715GlnTyr: 0.715 ± 0.253
0.0GlnXaa: 0.0 ± 0.0
Arg
7.75ArgAla: 7.75 ± 0.676
1.252ArgCys: 1.252 ± 0.301
5.485ArgAsp: 5.485 ± 0.675
4.65ArgGlu: 4.65 ± 0.532
1.729ArgPhe: 1.729 ± 0.301
5.127ArgGly: 5.127 ± 0.556
2.683ArgHis: 2.683 ± 0.55
4.59ArgIle: 4.59 ± 0.642
2.563ArgLys: 2.563 ± 0.33
6.2ArgLeu: 6.2 ± 0.658
1.848ArgMet: 1.848 ± 0.308
2.146ArgAsn: 2.146 ± 0.42
2.921ArgPro: 2.921 ± 0.473
2.504ArgGln: 2.504 ± 0.372
7.75ArgArg: 7.75 ± 0.941
3.637ArgSer: 3.637 ± 0.627
4.769ArgThr: 4.769 ± 0.5
5.127ArgVal: 5.127 ± 0.69
1.908ArgTrp: 1.908 ± 0.376
2.206ArgTyr: 2.206 ± 0.313
0.0ArgXaa: 0.0 ± 0.0
Ser
6.796SerAla: 6.796 ± 0.716
0.298SerCys: 0.298 ± 0.151
3.696SerAsp: 3.696 ± 0.39
3.219SerGlu: 3.219 ± 0.419
1.312SerPhe: 1.312 ± 0.298
7.214SerGly: 7.214 ± 0.882
1.55SerHis: 1.55 ± 0.295
2.325SerIle: 2.325 ± 0.424
2.087SerLys: 2.087 ± 0.349
5.365SerLeu: 5.365 ± 0.535
1.848SerMet: 1.848 ± 0.344
2.027SerAsn: 2.027 ± 0.337
2.504SerPro: 2.504 ± 0.327
1.967SerGln: 1.967 ± 0.366
4.352SerArg: 4.352 ± 0.574
3.815SerSer: 3.815 ± 0.518
3.219SerThr: 3.219 ± 0.635
4.471SerVal: 4.471 ± 0.591
1.431SerTrp: 1.431 ± 0.22
1.013SerTyr: 1.013 ± 0.22
0.0SerXaa: 0.0 ± 0.0
Thr
8.644ThrAla: 8.644 ± 0.76
0.775ThrCys: 0.775 ± 0.242
4.292ThrAsp: 4.292 ± 0.487
3.398ThrGlu: 3.398 ± 0.382
1.312ThrPhe: 1.312 ± 0.224
5.127ThrGly: 5.127 ± 0.53
1.848ThrHis: 1.848 ± 0.386
3.279ThrIle: 3.279 ± 0.53
1.371ThrLys: 1.371 ± 0.262
5.842ThrLeu: 5.842 ± 0.528
0.954ThrMet: 0.954 ± 0.232
1.848ThrAsn: 1.848 ± 0.322
5.187ThrPro: 5.187 ± 0.56
2.146ThrGln: 2.146 ± 0.324
4.054ThrArg: 4.054 ± 0.441
3.756ThrSer: 3.756 ± 0.56
4.71ThrThr: 4.71 ± 0.564
5.842ThrVal: 5.842 ± 0.551
0.894ThrTrp: 0.894 ± 0.241
1.431ThrTyr: 1.431 ± 0.268
0.0ThrXaa: 0.0 ± 0.0
Val
7.214ValAla: 7.214 ± 0.626
0.477ValCys: 0.477 ± 0.178
5.306ValAsp: 5.306 ± 0.579
6.26ValGlu: 6.26 ± 0.85
2.027ValPhe: 2.027 ± 0.357
5.604ValGly: 5.604 ± 0.728
1.133ValHis: 1.133 ± 0.242
2.265ValIle: 2.265 ± 0.334
2.683ValLys: 2.683 ± 0.376
5.365ValLeu: 5.365 ± 0.643
1.61ValMet: 1.61 ± 0.309
2.504ValAsn: 2.504 ± 0.334
4.65ValPro: 4.65 ± 0.607
2.385ValGln: 2.385 ± 0.401
5.842ValArg: 5.842 ± 0.63
5.962ValSer: 5.962 ± 0.797
5.425ValThr: 5.425 ± 0.555
6.081ValVal: 6.081 ± 0.891
1.669ValTrp: 1.669 ± 0.338
1.967ValTyr: 1.967 ± 0.405
0.0ValXaa: 0.0 ± 0.0
Trp
1.431TrpAla: 1.431 ± 0.39
0.358TrpCys: 0.358 ± 0.146
1.788TrpAsp: 1.788 ± 0.322
1.133TrpGlu: 1.133 ± 0.226
0.238TrpPhe: 0.238 ± 0.131
1.013TrpGly: 1.013 ± 0.202
0.358TrpHis: 0.358 ± 0.141
1.312TrpIle: 1.312 ± 0.299
0.417TrpLys: 0.417 ± 0.21
1.967TrpLeu: 1.967 ± 0.327
1.371TrpMet: 1.371 ± 0.301
0.894TrpAsn: 0.894 ± 0.192
1.133TrpPro: 1.133 ± 0.283
0.835TrpGln: 0.835 ± 0.218
1.669TrpArg: 1.669 ± 0.373
1.073TrpSer: 1.073 ± 0.234
2.325TrpThr: 2.325 ± 0.414
1.55TrpVal: 1.55 ± 0.299
0.537TrpTrp: 0.537 ± 0.194
0.656TrpTyr: 0.656 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.206TyrAla: 2.206 ± 0.332
0.298TyrCys: 0.298 ± 0.153
2.206TyrAsp: 2.206 ± 0.447
2.206TyrGlu: 2.206 ± 0.389
0.477TyrPhe: 0.477 ± 0.16
2.683TyrGly: 2.683 ± 0.384
0.417TyrHis: 0.417 ± 0.147
0.715TyrIle: 0.715 ± 0.202
0.954TyrLys: 0.954 ± 0.246
1.133TyrLeu: 1.133 ± 0.34
0.537TyrMet: 0.537 ± 0.144
0.596TyrAsn: 0.596 ± 0.178
1.252TyrPro: 1.252 ± 0.333
0.656TyrGln: 0.656 ± 0.221
2.146TyrArg: 2.146 ± 0.357
1.49TyrSer: 1.49 ± 0.342
2.027TyrThr: 2.027 ± 0.409
1.908TyrVal: 1.908 ± 0.281
0.596TyrTrp: 0.596 ± 0.225
0.477TyrTyr: 0.477 ± 0.147
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (16775 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski