Amino acid dipepetide frequency for Dickeya phage BF25/12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.083AlaAla: 13.083 ± 1.204
0.756AlaCys: 0.756 ± 0.228
6.579AlaAsp: 6.579 ± 0.663
5.974AlaGlu: 5.974 ± 0.815
2.571AlaPhe: 2.571 ± 0.401
7.26AlaGly: 7.26 ± 0.824
2.42AlaHis: 2.42 ± 0.431
3.933AlaIle: 3.933 ± 0.529
3.933AlaLys: 3.933 ± 0.655
9.151AlaLeu: 9.151 ± 0.917
2.949AlaMet: 2.949 ± 0.446
4.084AlaAsn: 4.084 ± 0.563
3.252AlaPro: 3.252 ± 0.672
4.84AlaGln: 4.84 ± 0.544
5.369AlaArg: 5.369 ± 0.645
4.386AlaSer: 4.386 ± 0.6
6.428AlaThr: 6.428 ± 0.806
6.126AlaVal: 6.126 ± 0.638
1.361AlaTrp: 1.361 ± 0.284
2.798AlaTyr: 2.798 ± 0.436
0.0AlaXaa: 0.0 ± 0.0
Cys
0.756CysAla: 0.756 ± 0.249
0.303CysCys: 0.303 ± 0.194
1.134CysAsp: 1.134 ± 0.394
0.454CysGlu: 0.454 ± 0.174
0.303CysPhe: 0.303 ± 0.128
0.756CysGly: 0.756 ± 0.345
0.378CysHis: 0.378 ± 0.202
0.529CysIle: 0.529 ± 0.214
0.681CysLys: 0.681 ± 0.295
0.756CysLeu: 0.756 ± 0.23
0.454CysMet: 0.454 ± 0.176
0.832CysAsn: 0.832 ± 0.264
0.832CysPro: 0.832 ± 0.292
0.454CysGln: 0.454 ± 0.182
0.378CysArg: 0.378 ± 0.162
0.681CysSer: 0.681 ± 0.228
0.756CysThr: 0.756 ± 0.245
0.756CysVal: 0.756 ± 0.237
0.151CysTrp: 0.151 ± 0.107
0.605CysTyr: 0.605 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
6.277AspAla: 6.277 ± 0.76
0.529AspCys: 0.529 ± 0.242
4.386AspAsp: 4.386 ± 0.628
3.176AspGlu: 3.176 ± 0.51
1.966AspPhe: 1.966 ± 0.346
5.218AspGly: 5.218 ± 0.771
0.529AspHis: 0.529 ± 0.194
3.101AspIle: 3.101 ± 0.453
3.328AspLys: 3.328 ± 0.625
5.218AspLeu: 5.218 ± 0.575
2.269AspMet: 2.269 ± 0.398
4.084AspAsn: 4.084 ± 0.533
2.496AspPro: 2.496 ± 0.417
1.664AspGln: 1.664 ± 0.382
2.496AspArg: 2.496 ± 0.481
4.084AspSer: 4.084 ± 0.641
3.933AspThr: 3.933 ± 0.506
4.991AspVal: 4.991 ± 0.529
1.286AspTrp: 1.286 ± 0.256
2.193AspTyr: 2.193 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
5.596GluAla: 5.596 ± 0.759
0.605GluCys: 0.605 ± 0.289
2.798GluAsp: 2.798 ± 0.441
3.403GluGlu: 3.403 ± 0.709
2.874GluPhe: 2.874 ± 0.499
3.403GluGly: 3.403 ± 0.45
1.588GluHis: 1.588 ± 0.442
1.891GluIle: 1.891 ± 0.299
2.042GluLys: 2.042 ± 0.391
4.462GluLeu: 4.462 ± 0.549
1.513GluMet: 1.513 ± 0.291
1.891GluAsn: 1.891 ± 0.292
1.134GluPro: 1.134 ± 0.303
2.496GluGln: 2.496 ± 0.487
3.328GluArg: 3.328 ± 0.517
3.403GluSer: 3.403 ± 0.465
2.874GluThr: 2.874 ± 0.601
3.857GluVal: 3.857 ± 0.612
0.832GluTrp: 0.832 ± 0.248
2.042GluTyr: 2.042 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
2.949PheAla: 2.949 ± 0.36
0.227PheCys: 0.227 ± 0.12
2.723PheAsp: 2.723 ± 0.442
1.059PheGlu: 1.059 ± 0.249
1.059PhePhe: 1.059 ± 0.276
3.328PheGly: 3.328 ± 0.43
0.454PheHis: 0.454 ± 0.165
1.815PheIle: 1.815 ± 0.385
2.193PheLys: 2.193 ± 0.368
2.193PheLeu: 2.193 ± 0.315
0.605PheMet: 0.605 ± 0.21
1.739PheAsn: 1.739 ± 0.285
1.664PhePro: 1.664 ± 0.373
1.059PheGln: 1.059 ± 0.218
1.588PheArg: 1.588 ± 0.336
2.723PheSer: 2.723 ± 0.609
1.437PheThr: 1.437 ± 0.312
1.815PheVal: 1.815 ± 0.349
0.378PheTrp: 0.378 ± 0.151
0.756PheTyr: 0.756 ± 0.248
0.0PheXaa: 0.0 ± 0.0
Gly
6.958GlyAla: 6.958 ± 0.695
1.739GlyCys: 1.739 ± 0.54
4.991GlyAsp: 4.991 ± 0.593
2.949GlyGlu: 2.949 ± 0.421
2.571GlyPhe: 2.571 ± 0.386
5.823GlyGly: 5.823 ± 0.76
0.983GlyHis: 0.983 ± 0.255
5.369GlyIle: 5.369 ± 0.608
4.159GlyLys: 4.159 ± 0.602
5.521GlyLeu: 5.521 ± 0.518
2.118GlyMet: 2.118 ± 0.346
3.252GlyAsn: 3.252 ± 0.412
1.588GlyPro: 1.588 ± 0.435
2.949GlyGln: 2.949 ± 0.594
4.159GlyArg: 4.159 ± 0.614
5.143GlySer: 5.143 ± 0.61
6.428GlyThr: 6.428 ± 1.162
5.596GlyVal: 5.596 ± 0.728
0.832GlyTrp: 0.832 ± 0.24
3.857GlyTyr: 3.857 ± 0.785
0.0GlyXaa: 0.0 ± 0.0
His
1.437HisAla: 1.437 ± 0.336
0.529HisCys: 0.529 ± 0.186
1.21HisAsp: 1.21 ± 0.294
1.21HisGlu: 1.21 ± 0.37
0.454HisPhe: 0.454 ± 0.146
1.891HisGly: 1.891 ± 0.456
0.832HisHis: 0.832 ± 0.383
0.832HisIle: 0.832 ± 0.192
0.908HisLys: 0.908 ± 0.307
2.798HisLeu: 2.798 ± 0.475
0.681HisMet: 0.681 ± 0.271
1.134HisAsn: 1.134 ± 0.283
1.361HisPro: 1.361 ± 0.313
0.756HisGln: 0.756 ± 0.164
1.815HisArg: 1.815 ± 0.356
0.832HisSer: 0.832 ± 0.243
0.605HisThr: 0.605 ± 0.252
1.361HisVal: 1.361 ± 0.454
0.605HisTrp: 0.605 ± 0.209
0.529HisTyr: 0.529 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
3.554IleAla: 3.554 ± 0.562
0.605IleCys: 0.605 ± 0.209
2.496IleAsp: 2.496 ± 0.384
2.269IleGlu: 2.269 ± 0.334
1.059IlePhe: 1.059 ± 0.282
3.252IleGly: 3.252 ± 0.463
1.059IleHis: 1.059 ± 0.326
1.664IleIle: 1.664 ± 0.36
2.42IleLys: 2.42 ± 0.383
4.538IleLeu: 4.538 ± 0.632
0.908IleMet: 0.908 ± 0.263
2.344IleAsn: 2.344 ± 0.363
2.647IlePro: 2.647 ± 0.381
2.344IleGln: 2.344 ± 0.386
2.344IleArg: 2.344 ± 0.451
3.252IleSer: 3.252 ± 0.454
3.781IleThr: 3.781 ± 0.637
3.101IleVal: 3.101 ± 0.419
0.681IleTrp: 0.681 ± 0.246
1.059IleTyr: 1.059 ± 0.268
0.0IleXaa: 0.0 ± 0.0
Lys
5.672LysAla: 5.672 ± 0.815
0.227LysCys: 0.227 ± 0.117
3.252LysAsp: 3.252 ± 0.428
3.479LysGlu: 3.479 ± 0.684
0.529LysPhe: 0.529 ± 0.23
2.874LysGly: 2.874 ± 0.558
1.059LysHis: 1.059 ± 0.321
1.739LysIle: 1.739 ± 0.318
1.361LysLys: 1.361 ± 0.423
5.218LysLeu: 5.218 ± 0.677
1.21LysMet: 1.21 ± 0.28
1.588LysAsn: 1.588 ± 0.379
2.496LysPro: 2.496 ± 0.301
2.496LysGln: 2.496 ± 0.515
2.949LysArg: 2.949 ± 0.699
1.815LysSer: 1.815 ± 0.326
1.966LysThr: 1.966 ± 0.477
3.025LysVal: 3.025 ± 0.449
0.529LysTrp: 0.529 ± 0.241
1.664LysTyr: 1.664 ± 0.348
0.0LysXaa: 0.0 ± 0.0
Leu
8.319LeuAla: 8.319 ± 0.827
1.361LeuCys: 1.361 ± 0.295
5.067LeuAsp: 5.067 ± 0.764
4.764LeuGlu: 4.764 ± 0.578
1.966LeuPhe: 1.966 ± 0.334
6.579LeuGly: 6.579 ± 0.636
2.118LeuHis: 2.118 ± 0.503
3.403LeuIle: 3.403 ± 0.673
4.008LeuLys: 4.008 ± 0.481
6.958LeuLeu: 6.958 ± 0.796
2.496LeuMet: 2.496 ± 0.389
4.538LeuAsn: 4.538 ± 0.537
4.613LeuPro: 4.613 ± 0.597
3.252LeuGln: 3.252 ± 0.559
5.596LeuArg: 5.596 ± 0.742
6.504LeuSer: 6.504 ± 0.73
6.277LeuThr: 6.277 ± 0.758
5.823LeuVal: 5.823 ± 0.63
0.605LeuTrp: 0.605 ± 0.198
2.949LeuTyr: 2.949 ± 0.398
0.0LeuXaa: 0.0 ± 0.0
Met
2.193MetAla: 2.193 ± 0.348
0.151MetCys: 0.151 ± 0.133
1.513MetAsp: 1.513 ± 0.258
0.908MetGlu: 0.908 ± 0.241
0.983MetPhe: 0.983 ± 0.254
2.042MetGly: 2.042 ± 0.314
0.605MetHis: 0.605 ± 0.188
0.681MetIle: 0.681 ± 0.222
1.286MetLys: 1.286 ± 0.352
2.42MetLeu: 2.42 ± 0.399
0.605MetMet: 0.605 ± 0.184
0.832MetAsn: 0.832 ± 0.223
1.891MetPro: 1.891 ± 0.336
1.815MetGln: 1.815 ± 0.481
2.193MetArg: 2.193 ± 0.476
2.118MetSer: 2.118 ± 0.413
1.134MetThr: 1.134 ± 0.258
1.966MetVal: 1.966 ± 0.435
0.303MetTrp: 0.303 ± 0.199
1.513MetTyr: 1.513 ± 0.379
0.0MetXaa: 0.0 ± 0.0
Asn
3.933AsnAla: 3.933 ± 0.552
0.454AsnCys: 0.454 ± 0.203
1.513AsnAsp: 1.513 ± 0.338
1.437AsnGlu: 1.437 ± 0.304
1.891AsnPhe: 1.891 ± 0.39
4.764AsnGly: 4.764 ± 0.605
0.756AsnHis: 0.756 ± 0.207
2.193AsnIle: 2.193 ± 0.341
2.496AsnLys: 2.496 ± 0.371
5.067AsnLeu: 5.067 ± 0.599
1.361AsnMet: 1.361 ± 0.237
1.815AsnAsn: 1.815 ± 0.277
2.42AsnPro: 2.42 ± 0.469
2.118AsnGln: 2.118 ± 0.684
2.571AsnArg: 2.571 ± 0.442
2.571AsnSer: 2.571 ± 0.466
4.008AsnThr: 4.008 ± 0.557
2.949AsnVal: 2.949 ± 0.422
0.454AsnTrp: 0.454 ± 0.218
1.059AsnTyr: 1.059 ± 0.331
0.0AsnXaa: 0.0 ± 0.0
Pro
4.311ProAla: 4.311 ± 0.521
0.227ProCys: 0.227 ± 0.135
3.252ProAsp: 3.252 ± 0.577
3.63ProGlu: 3.63 ± 0.517
1.134ProPhe: 1.134 ± 0.323
2.874ProGly: 2.874 ± 0.329
0.378ProHis: 0.378 ± 0.152
1.513ProIle: 1.513 ± 0.356
1.588ProLys: 1.588 ± 0.373
3.025ProLeu: 3.025 ± 0.505
1.513ProMet: 1.513 ± 0.303
1.588ProAsn: 1.588 ± 0.339
1.361ProPro: 1.361 ± 0.315
1.815ProGln: 1.815 ± 0.357
1.739ProArg: 1.739 ± 0.329
2.344ProSer: 2.344 ± 0.422
2.496ProThr: 2.496 ± 0.463
4.008ProVal: 4.008 ± 0.569
0.529ProTrp: 0.529 ± 0.202
1.815ProTyr: 1.815 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
4.386GlnAla: 4.386 ± 0.588
0.076GlnCys: 0.076 ± 0.079
1.966GlnAsp: 1.966 ± 0.442
2.344GlnGlu: 2.344 ± 0.455
1.815GlnPhe: 1.815 ± 0.348
4.084GlnGly: 4.084 ± 0.672
1.361GlnHis: 1.361 ± 0.382
1.891GlnIle: 1.891 ± 0.322
1.21GlnLys: 1.21 ± 0.311
4.235GlnLeu: 4.235 ± 0.526
1.134GlnMet: 1.134 ± 0.304
2.118GlnAsn: 2.118 ± 0.613
1.286GlnPro: 1.286 ± 0.3
2.647GlnGln: 2.647 ± 0.754
2.949GlnArg: 2.949 ± 0.514
3.176GlnSer: 3.176 ± 0.732
2.344GlnThr: 2.344 ± 0.39
3.479GlnVal: 3.479 ± 0.514
0.529GlnTrp: 0.529 ± 0.191
2.344GlnTyr: 2.344 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
4.764ArgAla: 4.764 ± 0.444
0.832ArgCys: 0.832 ± 0.325
4.386ArgAsp: 4.386 ± 0.618
3.328ArgGlu: 3.328 ± 0.625
1.664ArgPhe: 1.664 ± 0.409
3.857ArgGly: 3.857 ± 0.559
1.588ArgHis: 1.588 ± 0.361
3.857ArgIle: 3.857 ± 0.604
2.723ArgLys: 2.723 ± 0.54
3.554ArgLeu: 3.554 ± 0.575
1.664ArgMet: 1.664 ± 0.367
2.647ArgAsn: 2.647 ± 0.496
1.513ArgPro: 1.513 ± 0.289
3.403ArgGln: 3.403 ± 0.517
4.084ArgArg: 4.084 ± 0.514
3.479ArgSer: 3.479 ± 0.572
3.781ArgThr: 3.781 ± 0.422
3.328ArgVal: 3.328 ± 0.514
0.983ArgTrp: 0.983 ± 0.221
1.966ArgTyr: 1.966 ± 0.443
0.0ArgXaa: 0.0 ± 0.0
Ser
7.487SerAla: 7.487 ± 0.878
0.529SerCys: 0.529 ± 0.22
2.798SerAsp: 2.798 ± 0.451
2.42SerGlu: 2.42 ± 0.304
2.042SerPhe: 2.042 ± 0.547
5.521SerGly: 5.521 ± 0.61
0.908SerHis: 0.908 ± 0.209
2.723SerIle: 2.723 ± 0.474
3.176SerLys: 3.176 ± 0.548
4.916SerLeu: 4.916 ± 0.542
1.513SerMet: 1.513 ± 0.301
2.949SerAsn: 2.949 ± 0.684
2.269SerPro: 2.269 ± 0.474
2.42SerGln: 2.42 ± 0.484
3.101SerArg: 3.101 ± 0.475
3.479SerSer: 3.479 ± 0.609
4.916SerThr: 4.916 ± 0.535
5.899SerVal: 5.899 ± 0.681
1.059SerTrp: 1.059 ± 0.269
1.664SerTyr: 1.664 ± 0.384
0.0SerXaa: 0.0 ± 0.0
Thr
6.958ThrAla: 6.958 ± 1.048
0.605ThrCys: 0.605 ± 0.2
4.538ThrAsp: 4.538 ± 0.826
2.874ThrGlu: 2.874 ± 0.5
2.193ThrPhe: 2.193 ± 0.372
5.067ThrGly: 5.067 ± 0.744
1.664ThrHis: 1.664 ± 0.355
2.344ThrIle: 2.344 ± 0.462
2.42ThrLys: 2.42 ± 0.484
6.731ThrLeu: 6.731 ± 0.769
0.983ThrMet: 0.983 ± 0.228
2.723ThrAsn: 2.723 ± 0.681
2.798ThrPro: 2.798 ± 0.494
2.344ThrGln: 2.344 ± 0.363
2.798ThrArg: 2.798 ± 0.506
3.781ThrSer: 3.781 ± 0.577
4.386ThrThr: 4.386 ± 1.002
6.201ThrVal: 6.201 ± 0.988
0.756ThrTrp: 0.756 ± 0.219
2.798ThrTyr: 2.798 ± 0.659
0.0ThrXaa: 0.0 ± 0.0
Val
5.823ValAla: 5.823 ± 0.675
1.286ValCys: 1.286 ± 0.381
5.067ValAsp: 5.067 ± 0.561
3.101ValGlu: 3.101 ± 0.5
2.42ValPhe: 2.42 ± 0.378
5.294ValGly: 5.294 ± 0.567
2.042ValHis: 2.042 ± 0.418
3.252ValIle: 3.252 ± 0.51
3.63ValLys: 3.63 ± 0.695
5.974ValLeu: 5.974 ± 0.528
1.513ValMet: 1.513 ± 0.303
3.403ValAsn: 3.403 ± 0.541
3.403ValPro: 3.403 ± 0.426
4.008ValGln: 4.008 ± 0.761
4.613ValArg: 4.613 ± 0.547
4.689ValSer: 4.689 ± 0.617
4.84ValThr: 4.84 ± 0.747
5.596ValVal: 5.596 ± 0.642
0.756ValTrp: 0.756 ± 0.261
2.798ValTyr: 2.798 ± 0.46
0.0ValXaa: 0.0 ± 0.0
Trp
0.908TrpAla: 0.908 ± 0.265
0.151TrpCys: 0.151 ± 0.1
0.832TrpAsp: 0.832 ± 0.321
1.21TrpGlu: 1.21 ± 0.328
0.832TrpPhe: 0.832 ± 0.336
0.756TrpGly: 0.756 ± 0.187
0.227TrpHis: 0.227 ± 0.125
0.529TrpIle: 0.529 ± 0.183
0.605TrpLys: 0.605 ± 0.227
1.21TrpLeu: 1.21 ± 0.311
0.454TrpMet: 0.454 ± 0.198
0.378TrpAsn: 0.378 ± 0.171
0.378TrpPro: 0.378 ± 0.168
0.756TrpGln: 0.756 ± 0.216
0.756TrpArg: 0.756 ± 0.222
0.454TrpSer: 0.454 ± 0.173
0.681TrpThr: 0.681 ± 0.211
0.983TrpVal: 0.983 ± 0.278
0.227TrpTrp: 0.227 ± 0.177
1.059TrpTyr: 1.059 ± 0.307
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.891TyrAla: 1.891 ± 0.354
0.756TyrCys: 0.756 ± 0.309
2.723TyrAsp: 2.723 ± 0.374
2.042TyrGlu: 2.042 ± 0.373
1.437TyrPhe: 1.437 ± 0.321
2.344TyrGly: 2.344 ± 0.423
0.832TyrHis: 0.832 ± 0.217
2.193TyrIle: 2.193 ± 0.333
0.983TyrLys: 0.983 ± 0.269
3.252TyrLeu: 3.252 ± 0.575
1.134TyrMet: 1.134 ± 0.379
1.891TyrAsn: 1.891 ± 0.437
1.739TyrPro: 1.739 ± 0.295
1.739TyrGln: 1.739 ± 0.289
2.571TyrArg: 2.571 ± 0.401
2.874TyrSer: 2.874 ± 0.403
1.891TyrThr: 1.891 ± 0.432
2.647TyrVal: 2.647 ± 0.441
0.529TyrTrp: 0.529 ± 0.226
1.286TyrTyr: 1.286 ± 0.372
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (13224 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski