Amino acid dipepetide frequency for Escherichia phage Lidtsur

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.323AlaAla: 13.323 ± 1.9
1.438AlaCys: 1.438 ± 0.615
5.148AlaAsp: 5.148 ± 0.808
6.132AlaGlu: 6.132 ± 0.65
3.255AlaPhe: 3.255 ± 0.542
7.116AlaGly: 7.116 ± 0.772
1.893AlaHis: 1.893 ± 0.345
5.223AlaIle: 5.223 ± 0.719
4.845AlaLys: 4.845 ± 0.68
9.084AlaLeu: 9.084 ± 1.089
3.482AlaMet: 3.482 ± 0.713
4.921AlaAsn: 4.921 ± 0.598
3.482AlaPro: 3.482 ± 0.622
4.618AlaGln: 4.618 ± 0.762
6.435AlaArg: 6.435 ± 0.687
5.905AlaSer: 5.905 ± 0.879
6.207AlaThr: 6.207 ± 0.814
7.419AlaVal: 7.419 ± 0.837
1.211AlaTrp: 1.211 ± 0.283
3.785AlaTyr: 3.785 ± 0.474
0.0AlaXaa: 0.0 ± 0.0
Cys
1.211CysAla: 1.211 ± 0.443
0.151CysCys: 0.151 ± 0.164
0.379CysAsp: 0.379 ± 0.179
0.757CysGlu: 0.757 ± 0.274
0.151CysPhe: 0.151 ± 0.091
0.908CysGly: 0.908 ± 0.336
0.379CysHis: 0.379 ± 0.178
0.606CysIle: 0.606 ± 0.22
0.606CysLys: 0.606 ± 0.218
0.681CysLeu: 0.681 ± 0.262
0.379CysMet: 0.379 ± 0.167
0.303CysAsn: 0.303 ± 0.153
0.227CysPro: 0.227 ± 0.168
0.454CysGln: 0.454 ± 0.241
0.454CysArg: 0.454 ± 0.188
0.53CysSer: 0.53 ± 0.198
0.606CysThr: 0.606 ± 0.266
0.833CysVal: 0.833 ± 0.331
0.151CysTrp: 0.151 ± 0.103
0.53CysTyr: 0.53 ± 0.228
0.0CysXaa: 0.0 ± 0.0
Asp
5.98AspAla: 5.98 ± 0.629
0.606AspCys: 0.606 ± 0.291
4.921AspAsp: 4.921 ± 0.769
3.709AspGlu: 3.709 ± 0.69
1.968AspPhe: 1.968 ± 0.548
5.299AspGly: 5.299 ± 0.736
0.757AspHis: 0.757 ± 0.302
2.12AspIle: 2.12 ± 0.458
2.725AspLys: 2.725 ± 0.446
4.164AspLeu: 4.164 ± 0.469
1.968AspMet: 1.968 ± 0.681
2.498AspAsn: 2.498 ± 0.37
3.179AspPro: 3.179 ± 0.436
1.136AspGln: 1.136 ± 0.473
2.65AspArg: 2.65 ± 0.395
3.861AspSer: 3.861 ± 0.522
3.861AspThr: 3.861 ± 0.5
3.558AspVal: 3.558 ± 0.443
1.06AspTrp: 1.06 ± 0.305
2.271AspTyr: 2.271 ± 0.424
0.0AspXaa: 0.0 ± 0.0
Glu
6.435GluAla: 6.435 ± 0.863
0.908GluCys: 0.908 ± 0.32
3.179GluAsp: 3.179 ± 0.456
4.391GluGlu: 4.391 ± 0.72
2.725GluPhe: 2.725 ± 0.402
4.088GluGly: 4.088 ± 0.621
1.363GluHis: 1.363 ± 0.27
2.65GluIle: 2.65 ± 0.379
2.65GluLys: 2.65 ± 0.514
4.921GluLeu: 4.921 ± 0.65
1.817GluMet: 1.817 ± 0.389
2.12GluAsn: 2.12 ± 0.421
2.12GluPro: 2.12 ± 0.481
3.861GluGln: 3.861 ± 0.593
3.407GluArg: 3.407 ± 0.426
3.331GluSer: 3.331 ± 0.43
3.179GluThr: 3.179 ± 0.536
4.164GluVal: 4.164 ± 0.699
0.984GluTrp: 0.984 ± 0.221
2.801GluTyr: 2.801 ± 0.523
0.0GluXaa: 0.0 ± 0.0
Phe
2.347PheAla: 2.347 ± 0.424
0.379PheCys: 0.379 ± 0.145
2.65PheAsp: 2.65 ± 0.515
2.12PheGlu: 2.12 ± 0.434
0.833PhePhe: 0.833 ± 0.236
2.271PheGly: 2.271 ± 0.33
0.681PheHis: 0.681 ± 0.182
2.12PheIle: 2.12 ± 0.37
2.044PheLys: 2.044 ± 0.527
2.347PheLeu: 2.347 ± 0.349
1.136PheMet: 1.136 ± 0.267
2.044PheAsn: 2.044 ± 0.468
1.893PhePro: 1.893 ± 0.332
0.984PheGln: 0.984 ± 0.272
1.741PheArg: 1.741 ± 0.455
1.968PheSer: 1.968 ± 0.49
2.044PheThr: 2.044 ± 0.461
2.574PheVal: 2.574 ± 0.386
0.454PheTrp: 0.454 ± 0.184
0.984PheTyr: 0.984 ± 0.256
0.0PheXaa: 0.0 ± 0.0
Gly
7.494GlyAla: 7.494 ± 1.008
0.757GlyCys: 0.757 ± 0.262
4.315GlyAsp: 4.315 ± 0.596
4.164GlyGlu: 4.164 ± 0.577
1.893GlyPhe: 1.893 ± 0.376
6.737GlyGly: 6.737 ± 1.071
1.741GlyHis: 1.741 ± 0.393
4.996GlyIle: 4.996 ± 0.571
3.785GlyLys: 3.785 ± 0.791
6.207GlyLeu: 6.207 ± 0.569
2.044GlyMet: 2.044 ± 0.508
2.574GlyAsn: 2.574 ± 0.408
1.363GlyPro: 1.363 ± 0.32
3.936GlyGln: 3.936 ± 0.563
4.693GlyArg: 4.693 ± 0.499
4.391GlySer: 4.391 ± 0.774
5.753GlyThr: 5.753 ± 1.131
6.662GlyVal: 6.662 ± 0.759
1.363GlyTrp: 1.363 ± 0.284
3.028GlyTyr: 3.028 ± 0.552
0.0GlyXaa: 0.0 ± 0.0
His
1.514HisAla: 1.514 ± 0.36
0.227HisCys: 0.227 ± 0.118
1.514HisAsp: 1.514 ± 0.393
1.06HisGlu: 1.06 ± 0.282
0.681HisPhe: 0.681 ± 0.187
1.741HisGly: 1.741 ± 0.319
0.757HisHis: 0.757 ± 0.305
0.757HisIle: 0.757 ± 0.209
1.438HisLys: 1.438 ± 0.358
1.817HisLeu: 1.817 ± 0.444
0.908HisMet: 0.908 ± 0.246
0.908HisAsn: 0.908 ± 0.227
0.681HisPro: 0.681 ± 0.275
0.681HisGln: 0.681 ± 0.211
0.454HisArg: 0.454 ± 0.189
1.06HisSer: 1.06 ± 0.24
0.757HisThr: 0.757 ± 0.171
1.741HisVal: 1.741 ± 0.506
0.53HisTrp: 0.53 ± 0.225
0.606HisTyr: 0.606 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
4.618IleAla: 4.618 ± 0.607
0.606IleCys: 0.606 ± 0.315
3.104IleAsp: 3.104 ± 0.469
3.179IleGlu: 3.179 ± 0.448
1.06IlePhe: 1.06 ± 0.234
3.785IleGly: 3.785 ± 0.663
1.06IleHis: 1.06 ± 0.229
2.498IleIle: 2.498 ± 0.555
2.195IleLys: 2.195 ± 0.465
2.498IleLeu: 2.498 ± 0.425
1.514IleMet: 1.514 ± 0.285
2.498IleAsn: 2.498 ± 0.391
2.347IlePro: 2.347 ± 0.547
2.195IleGln: 2.195 ± 0.483
3.407IleArg: 3.407 ± 0.683
2.498IleSer: 2.498 ± 0.38
2.422IleThr: 2.422 ± 0.367
3.331IleVal: 3.331 ± 0.58
0.303IleTrp: 0.303 ± 0.158
1.968IleTyr: 1.968 ± 0.409
0.0IleXaa: 0.0 ± 0.0
Lys
5.753LysAla: 5.753 ± 0.81
0.757LysCys: 0.757 ± 0.262
2.725LysAsp: 2.725 ± 0.599
3.179LysGlu: 3.179 ± 0.485
1.665LysPhe: 1.665 ± 0.31
2.877LysGly: 2.877 ± 0.587
0.606LysHis: 0.606 ± 0.281
0.833LysIle: 0.833 ± 0.256
1.59LysLys: 1.59 ± 0.448
4.088LysLeu: 4.088 ± 0.687
1.136LysMet: 1.136 ± 0.275
0.908LysAsn: 0.908 ± 0.263
2.12LysPro: 2.12 ± 0.371
2.498LysGln: 2.498 ± 0.553
3.104LysArg: 3.104 ± 0.455
3.482LysSer: 3.482 ± 0.527
2.725LysThr: 2.725 ± 0.6
3.785LysVal: 3.785 ± 0.661
0.757LysTrp: 0.757 ± 0.305
1.211LysTyr: 1.211 ± 0.327
0.0LysXaa: 0.0 ± 0.0
Leu
10.144LeuAla: 10.144 ± 0.862
0.379LeuCys: 0.379 ± 0.167
4.921LeuAsp: 4.921 ± 0.586
4.542LeuGlu: 4.542 ± 0.486
3.407LeuPhe: 3.407 ± 0.544
5.45LeuGly: 5.45 ± 0.598
1.968LeuHis: 1.968 ± 0.502
3.255LeuIle: 3.255 ± 0.431
3.331LeuLys: 3.331 ± 0.514
6.435LeuLeu: 6.435 ± 0.872
2.044LeuMet: 2.044 ± 0.458
2.952LeuAsn: 2.952 ± 0.461
4.618LeuPro: 4.618 ± 0.827
4.315LeuGln: 4.315 ± 0.619
4.921LeuArg: 4.921 ± 0.694
5.45LeuSer: 5.45 ± 0.526
5.375LeuThr: 5.375 ± 0.709
4.315LeuVal: 4.315 ± 0.489
1.438LeuTrp: 1.438 ± 0.344
2.12LeuTyr: 2.12 ± 0.399
0.0LeuXaa: 0.0 ± 0.0
Met
3.255MetAla: 3.255 ± 0.529
0.076MetCys: 0.076 ± 0.081
1.59MetAsp: 1.59 ± 0.402
0.833MetGlu: 0.833 ± 0.242
0.833MetPhe: 0.833 ± 0.253
2.12MetGly: 2.12 ± 0.439
0.908MetHis: 0.908 ± 0.35
0.908MetIle: 0.908 ± 0.206
1.287MetLys: 1.287 ± 0.288
2.725MetLeu: 2.725 ± 0.489
1.514MetMet: 1.514 ± 0.464
1.287MetAsn: 1.287 ± 0.412
1.363MetPro: 1.363 ± 0.292
2.12MetGln: 2.12 ± 0.398
1.968MetArg: 1.968 ± 0.437
1.59MetSer: 1.59 ± 0.357
2.65MetThr: 2.65 ± 0.385
1.968MetVal: 1.968 ± 0.475
0.454MetTrp: 0.454 ± 0.24
1.287MetTyr: 1.287 ± 0.219
0.0MetXaa: 0.0 ± 0.0
Asn
4.012AsnAla: 4.012 ± 0.599
0.303AsnCys: 0.303 ± 0.154
1.59AsnAsp: 1.59 ± 0.368
2.044AsnGlu: 2.044 ± 0.342
1.59AsnPhe: 1.59 ± 0.313
3.104AsnGly: 3.104 ± 0.557
0.984AsnHis: 0.984 ± 0.285
3.028AsnIle: 3.028 ± 0.62
2.422AsnLys: 2.422 ± 0.382
3.028AsnLeu: 3.028 ± 0.413
1.514AsnMet: 1.514 ± 0.354
2.271AsnAsn: 2.271 ± 0.428
2.195AsnPro: 2.195 ± 0.479
2.195AsnGln: 2.195 ± 0.534
2.044AsnArg: 2.044 ± 0.291
2.574AsnSer: 2.574 ± 0.531
2.347AsnThr: 2.347 ± 0.418
3.028AsnVal: 3.028 ± 0.573
0.53AsnTrp: 0.53 ± 0.201
0.757AsnTyr: 0.757 ± 0.265
0.0AsnXaa: 0.0 ± 0.0
Pro
4.542ProAla: 4.542 ± 0.482
0.53ProCys: 0.53 ± 0.191
3.331ProAsp: 3.331 ± 0.591
4.164ProGlu: 4.164 ± 0.783
1.136ProPhe: 1.136 ± 0.249
2.952ProGly: 2.952 ± 0.417
0.833ProHis: 0.833 ± 0.271
1.893ProIle: 1.893 ± 0.315
1.287ProLys: 1.287 ± 0.334
2.877ProLeu: 2.877 ± 0.709
0.606ProMet: 0.606 ± 0.209
1.817ProAsn: 1.817 ± 0.523
1.741ProPro: 1.741 ± 0.385
3.028ProGln: 3.028 ± 1.205
1.817ProArg: 1.817 ± 0.253
2.271ProSer: 2.271 ± 0.477
3.104ProThr: 3.104 ± 0.445
2.725ProVal: 2.725 ± 0.436
0.53ProTrp: 0.53 ± 0.191
1.741ProTyr: 1.741 ± 0.471
0.0ProXaa: 0.0 ± 0.0
Gln
5.223GlnAla: 5.223 ± 1.336
0.151GlnCys: 0.151 ± 0.11
2.422GlnAsp: 2.422 ± 0.412
2.422GlnGlu: 2.422 ± 0.475
2.422GlnPhe: 2.422 ± 0.473
3.331GlnGly: 3.331 ± 0.5
0.908GlnHis: 0.908 ± 0.254
2.195GlnIle: 2.195 ± 0.483
2.271GlnLys: 2.271 ± 0.426
4.618GlnLeu: 4.618 ± 0.611
1.817GlnMet: 1.817 ± 0.356
1.59GlnAsn: 1.59 ± 0.356
2.347GlnPro: 2.347 ± 0.971
3.482GlnGln: 3.482 ± 1.14
3.331GlnArg: 3.331 ± 0.613
1.817GlnSer: 1.817 ± 0.425
2.574GlnThr: 2.574 ± 0.557
3.634GlnVal: 3.634 ± 0.749
0.757GlnTrp: 0.757 ± 0.201
1.741GlnTyr: 1.741 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
4.693ArgAla: 4.693 ± 0.976
0.681ArgCys: 0.681 ± 0.243
3.255ArgAsp: 3.255 ± 0.59
3.785ArgGlu: 3.785 ± 0.539
1.893ArgPhe: 1.893 ± 0.293
5.299ArgGly: 5.299 ± 0.445
0.984ArgHis: 0.984 ± 0.323
2.65ArgIle: 2.65 ± 0.453
2.725ArgLys: 2.725 ± 0.39
3.861ArgLeu: 3.861 ± 0.375
1.893ArgMet: 1.893 ± 0.436
1.893ArgAsn: 1.893 ± 0.543
1.741ArgPro: 1.741 ± 0.406
2.725ArgGln: 2.725 ± 0.394
3.709ArgArg: 3.709 ± 0.622
2.498ArgSer: 2.498 ± 0.322
3.255ArgThr: 3.255 ± 0.469
4.618ArgVal: 4.618 ± 0.527
0.681ArgTrp: 0.681 ± 0.254
2.65ArgTyr: 2.65 ± 0.377
0.0ArgXaa: 0.0 ± 0.0
Ser
6.662SerAla: 6.662 ± 0.584
0.227SerCys: 0.227 ± 0.128
2.801SerAsp: 2.801 ± 0.526
2.952SerGlu: 2.952 ± 0.402
1.817SerPhe: 1.817 ± 0.446
5.526SerGly: 5.526 ± 0.48
0.833SerHis: 0.833 ± 0.266
3.331SerIle: 3.331 ± 0.484
2.271SerLys: 2.271 ± 0.495
4.921SerLeu: 4.921 ± 0.761
1.741SerMet: 1.741 ± 0.442
3.407SerAsn: 3.407 ± 0.749
3.255SerPro: 3.255 ± 0.524
2.65SerGln: 2.65 ± 0.686
2.498SerArg: 2.498 ± 0.361
3.028SerSer: 3.028 ± 0.768
2.725SerThr: 2.725 ± 0.552
3.482SerVal: 3.482 ± 0.644
0.53SerTrp: 0.53 ± 0.19
2.65SerTyr: 2.65 ± 0.699
0.0SerXaa: 0.0 ± 0.0
Thr
7.494ThrAla: 7.494 ± 0.948
0.681ThrCys: 0.681 ± 0.237
3.709ThrAsp: 3.709 ± 0.614
2.574ThrGlu: 2.574 ± 0.491
2.195ThrPhe: 2.195 ± 0.491
5.678ThrGly: 5.678 ± 0.73
0.681ThrHis: 0.681 ± 0.185
2.952ThrIle: 2.952 ± 0.451
2.952ThrLys: 2.952 ± 0.426
5.526ThrLeu: 5.526 ± 0.686
1.287ThrMet: 1.287 ± 0.413
2.271ThrAsn: 2.271 ± 0.531
2.877ThrPro: 2.877 ± 0.52
2.271ThrGln: 2.271 ± 0.37
2.801ThrArg: 2.801 ± 0.386
3.407ThrSer: 3.407 ± 0.687
3.255ThrThr: 3.255 ± 0.83
4.693ThrVal: 4.693 ± 0.844
0.757ThrTrp: 0.757 ± 0.204
2.574ThrTyr: 2.574 ± 0.485
0.0ThrXaa: 0.0 ± 0.0
Val
6.435ValAla: 6.435 ± 0.752
0.606ValCys: 0.606 ± 0.202
3.709ValAsp: 3.709 ± 0.535
5.678ValGlu: 5.678 ± 0.789
2.271ValPhe: 2.271 ± 0.518
5.526ValGly: 5.526 ± 0.634
1.59ValHis: 1.59 ± 0.39
2.801ValIle: 2.801 ± 0.451
3.558ValLys: 3.558 ± 0.65
6.435ValLeu: 6.435 ± 0.764
1.893ValMet: 1.893 ± 0.418
3.255ValAsn: 3.255 ± 0.573
3.104ValPro: 3.104 ± 0.432
3.634ValGln: 3.634 ± 0.694
3.104ValArg: 3.104 ± 0.54
3.634ValSer: 3.634 ± 0.589
5.223ValThr: 5.223 ± 0.838
5.526ValVal: 5.526 ± 0.893
0.984ValTrp: 0.984 ± 0.255
2.422ValTyr: 2.422 ± 0.378
0.0ValXaa: 0.0 ± 0.0
Trp
0.757TrpAla: 0.757 ± 0.214
0.379TrpCys: 0.379 ± 0.167
1.06TrpAsp: 1.06 ± 0.206
0.833TrpGlu: 0.833 ± 0.295
0.908TrpPhe: 0.908 ± 0.333
0.833TrpGly: 0.833 ± 0.272
0.227TrpHis: 0.227 ± 0.107
0.606TrpIle: 0.606 ± 0.208
0.379TrpLys: 0.379 ± 0.18
1.968TrpLeu: 1.968 ± 0.446
0.227TrpMet: 0.227 ± 0.147
0.379TrpAsn: 0.379 ± 0.146
0.681TrpPro: 0.681 ± 0.213
0.53TrpGln: 0.53 ± 0.177
0.606TrpArg: 0.606 ± 0.209
1.287TrpSer: 1.287 ± 0.291
0.53TrpThr: 0.53 ± 0.171
0.833TrpVal: 0.833 ± 0.246
0.454TrpTrp: 0.454 ± 0.207
0.454TrpTyr: 0.454 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.028TyrAla: 3.028 ± 0.522
0.454TyrCys: 0.454 ± 0.185
1.968TyrAsp: 1.968 ± 0.361
2.574TyrGlu: 2.574 ± 0.575
0.984TyrPhe: 0.984 ± 0.277
3.255TyrGly: 3.255 ± 0.471
0.681TyrHis: 0.681 ± 0.27
1.665TyrIle: 1.665 ± 0.512
1.438TyrLys: 1.438 ± 0.327
3.179TyrLeu: 3.179 ± 0.608
1.741TyrMet: 1.741 ± 0.315
1.665TyrAsn: 1.665 ± 0.305
1.59TyrPro: 1.59 ± 0.303
1.741TyrGln: 1.741 ± 0.283
2.347TyrArg: 2.347 ± 0.468
2.725TyrSer: 2.725 ± 0.496
2.044TyrThr: 2.044 ± 0.502
2.498TyrVal: 2.498 ± 0.559
0.0TyrTrp: 0.0 ± 0.0
1.211TyrTyr: 1.211 ± 0.284
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (13211 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski