Amino acid dipepetide frequency for Campylobacter phage DA10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.812AlaAla: 0.812 ± 0.281
0.361AlaCys: 0.361 ± 0.18
2.345AlaAsp: 2.345 ± 0.483
2.976AlaGlu: 2.976 ± 0.633
2.976AlaPhe: 2.976 ± 0.534
3.878AlaGly: 3.878 ± 0.599
0.271AlaHis: 0.271 ± 0.155
5.051AlaIle: 5.051 ± 0.633
6.133AlaLys: 6.133 ± 0.947
5.682AlaLeu: 5.682 ± 1.094
1.714AlaMet: 1.714 ± 0.451
4.6AlaAsn: 4.6 ± 0.877
0.992AlaPro: 0.992 ± 0.297
1.443AlaGln: 1.443 ± 0.362
1.443AlaArg: 1.443 ± 0.4
3.788AlaSer: 3.788 ± 0.761
3.067AlaThr: 3.067 ± 0.586
2.616AlaVal: 2.616 ± 0.51
0.361AlaTrp: 0.361 ± 0.177
1.353AlaTyr: 1.353 ± 0.312
0.0AlaXaa: 0.0 ± 0.0
Cys
0.18CysAla: 0.18 ± 0.12
0.0CysCys: 0.0 ± 0.0
1.082CysAsp: 1.082 ± 0.336
0.812CysGlu: 0.812 ± 0.295
0.902CysPhe: 0.902 ± 0.302
0.722CysGly: 0.722 ± 0.202
0.09CysHis: 0.09 ± 0.096
0.631CysIle: 0.631 ± 0.272
1.624CysLys: 1.624 ± 0.322
1.533CysLeu: 1.533 ± 0.345
0.18CysMet: 0.18 ± 0.112
0.902CysAsn: 0.902 ± 0.26
0.18CysPro: 0.18 ± 0.12
0.09CysGln: 0.09 ± 0.086
0.361CysArg: 0.361 ± 0.158
0.541CysSer: 0.541 ± 0.262
0.631CysThr: 0.631 ± 0.237
0.902CysVal: 0.902 ± 0.268
0.09CysTrp: 0.09 ± 0.077
0.722CysTyr: 0.722 ± 0.241
0.0CysXaa: 0.0 ± 0.0
Asp
2.165AspAla: 2.165 ± 0.473
0.631AspCys: 0.631 ± 0.267
5.953AspAsp: 5.953 ± 0.876
4.239AspGlu: 4.239 ± 0.815
4.78AspPhe: 4.78 ± 0.647
2.255AspGly: 2.255 ± 0.465
0.271AspHis: 0.271 ± 0.171
5.412AspIle: 5.412 ± 0.779
4.871AspLys: 4.871 ± 0.783
7.667AspLeu: 7.667 ± 0.774
1.804AspMet: 1.804 ± 0.385
5.682AspAsn: 5.682 ± 0.617
0.361AspPro: 0.361 ± 0.236
0.271AspGln: 0.271 ± 0.135
1.533AspArg: 1.533 ± 0.455
3.788AspSer: 3.788 ± 0.569
2.796AspThr: 2.796 ± 0.578
2.345AspVal: 2.345 ± 0.447
0.631AspTrp: 0.631 ± 0.225
2.886AspTyr: 2.886 ± 0.547
0.0AspXaa: 0.0 ± 0.0
Glu
3.969GluAla: 3.969 ± 0.688
0.631GluCys: 0.631 ± 0.232
3.878GluAsp: 3.878 ± 0.622
2.796GluGlu: 2.796 ± 0.555
4.51GluPhe: 4.51 ± 0.577
1.984GluGly: 1.984 ± 0.441
0.451GluHis: 0.451 ± 0.236
6.494GluIle: 6.494 ± 0.641
7.306GluLys: 7.306 ± 1.041
7.486GluLeu: 7.486 ± 0.878
1.173GluMet: 1.173 ± 0.352
7.757GluAsn: 7.757 ± 1.069
1.263GluPro: 1.263 ± 0.321
1.624GluGln: 1.624 ± 0.319
2.616GluArg: 2.616 ± 0.697
3.067GluSer: 3.067 ± 0.444
4.059GluThr: 4.059 ± 0.534
4.239GluVal: 4.239 ± 0.554
0.812GluTrp: 0.812 ± 0.241
3.788GluTyr: 3.788 ± 0.453
0.0GluXaa: 0.0 ± 0.0
Phe
2.345PheAla: 2.345 ± 0.514
0.992PheCys: 0.992 ± 0.301
3.608PheAsp: 3.608 ± 0.631
4.78PheGlu: 4.78 ± 0.667
2.616PhePhe: 2.616 ± 0.459
3.608PheGly: 3.608 ± 0.662
0.271PheHis: 0.271 ± 0.183
4.059PheIle: 4.059 ± 0.468
6.133PheLys: 6.133 ± 0.714
5.141PheLeu: 5.141 ± 0.775
1.714PheMet: 1.714 ± 0.354
6.404PheAsn: 6.404 ± 0.822
0.812PhePro: 0.812 ± 0.251
1.533PheGln: 1.533 ± 0.381
1.984PheArg: 1.984 ± 0.463
3.788PheSer: 3.788 ± 0.635
2.886PheThr: 2.886 ± 0.531
2.255PheVal: 2.255 ± 0.567
0.361PheTrp: 0.361 ± 0.181
2.886PheTyr: 2.886 ± 0.58
0.0PheXaa: 0.0 ± 0.0
Gly
2.976GlyAla: 2.976 ± 0.553
1.082GlyCys: 1.082 ± 0.302
2.886GlyAsp: 2.886 ± 0.533
2.886GlyGlu: 2.886 ± 0.451
2.886GlyPhe: 2.886 ± 0.576
2.345GlyGly: 2.345 ± 0.548
0.631GlyHis: 0.631 ± 0.276
3.427GlyIle: 3.427 ± 0.42
3.878GlyLys: 3.878 ± 0.567
4.78GlyLeu: 4.78 ± 0.644
1.533GlyMet: 1.533 ± 0.381
3.698GlyAsn: 3.698 ± 0.613
0.18GlyPro: 0.18 ± 0.131
1.804GlyGln: 1.804 ± 0.506
1.533GlyArg: 1.533 ± 0.335
3.878GlySer: 3.878 ± 0.774
1.894GlyThr: 1.894 ± 0.45
3.518GlyVal: 3.518 ± 0.656
0.451GlyTrp: 0.451 ± 0.196
2.435GlyTyr: 2.435 ± 0.419
0.0GlyXaa: 0.0 ± 0.0
His
0.09HisAla: 0.09 ± 0.079
0.0HisCys: 0.0 ± 0.0
0.451HisAsp: 0.451 ± 0.199
0.361HisGlu: 0.361 ± 0.173
0.271HisPhe: 0.271 ± 0.191
0.18HisGly: 0.18 ± 0.139
0.0HisHis: 0.0 ± 0.0
0.541HisIle: 0.541 ± 0.229
0.631HisLys: 0.631 ± 0.206
0.812HisLeu: 0.812 ± 0.306
0.0HisMet: 0.0 ± 0.0
0.812HisAsn: 0.812 ± 0.279
0.271HisPro: 0.271 ± 0.161
0.09HisGln: 0.09 ± 0.1
0.18HisArg: 0.18 ± 0.115
0.451HisSer: 0.451 ± 0.178
0.361HisThr: 0.361 ± 0.162
0.18HisVal: 0.18 ± 0.11
0.09HisTrp: 0.09 ± 0.085
0.631HisTyr: 0.631 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
3.608IleAla: 3.608 ± 0.578
1.173IleCys: 1.173 ± 0.255
4.149IleAsp: 4.149 ± 0.518
6.945IleGlu: 6.945 ± 0.715
4.149IlePhe: 4.149 ± 0.639
2.976IleGly: 2.976 ± 0.499
0.812IleHis: 0.812 ± 0.244
7.667IleIle: 7.667 ± 0.877
8.929IleLys: 8.929 ± 0.89
8.478IleLeu: 8.478 ± 0.845
1.714IleMet: 1.714 ± 0.398
8.569IleAsn: 8.569 ± 1.151
2.525IlePro: 2.525 ± 0.571
3.608IleGln: 3.608 ± 0.58
2.075IleArg: 2.075 ± 0.399
7.486IleSer: 7.486 ± 0.7
5.773IleThr: 5.773 ± 0.859
4.871IleVal: 4.871 ± 0.935
0.451IleTrp: 0.451 ± 0.209
3.247IleTyr: 3.247 ± 0.511
0.0IleXaa: 0.0 ± 0.0
Lys
5.773LysAla: 5.773 ± 0.863
1.173LysCys: 1.173 ± 0.347
7.757LysAsp: 7.757 ± 0.945
8.478LysGlu: 8.478 ± 1.201
4.149LysPhe: 4.149 ± 0.617
4.059LysGly: 4.059 ± 0.575
1.082LysHis: 1.082 ± 0.296
10.733LysIle: 10.733 ± 1.262
8.208LysLys: 8.208 ± 1.037
7.847LysLeu: 7.847 ± 0.843
1.533LysMet: 1.533 ± 0.322
9.29LysAsn: 9.29 ± 1.068
2.976LysPro: 2.976 ± 0.435
3.608LysGln: 3.608 ± 0.57
2.345LysArg: 2.345 ± 0.451
4.329LysSer: 4.329 ± 0.556
4.51LysThr: 4.51 ± 0.653
4.69LysVal: 4.69 ± 0.662
0.722LysTrp: 0.722 ± 0.249
4.329LysTyr: 4.329 ± 0.638
0.0LysXaa: 0.0 ± 0.0
Leu
5.231LeuAla: 5.231 ± 0.843
1.173LeuCys: 1.173 ± 0.383
4.961LeuAsp: 4.961 ± 0.734
7.576LeuGlu: 7.576 ± 0.98
4.329LeuPhe: 4.329 ± 0.582
4.961LeuGly: 4.961 ± 0.645
0.361LeuHis: 0.361 ± 0.165
8.478LeuIle: 8.478 ± 0.778
11.545LeuLys: 11.545 ± 1.026
7.125LeuLeu: 7.125 ± 0.889
2.435LeuMet: 2.435 ± 0.414
8.569LeuAsn: 8.569 ± 0.812
2.345LeuPro: 2.345 ± 0.448
3.157LeuGln: 3.157 ± 0.637
2.796LeuArg: 2.796 ± 0.506
6.404LeuSer: 6.404 ± 0.673
4.961LeuThr: 4.961 ± 0.982
3.788LeuVal: 3.788 ± 0.612
0.722LeuTrp: 0.722 ± 0.329
3.247LeuTyr: 3.247 ± 0.634
0.0LeuXaa: 0.0 ± 0.0
Met
1.804MetAla: 1.804 ± 0.406
0.18MetCys: 0.18 ± 0.131
0.722MetAsp: 0.722 ± 0.242
1.714MetGlu: 1.714 ± 0.36
1.263MetPhe: 1.263 ± 0.337
0.902MetGly: 0.902 ± 0.318
0.09MetHis: 0.09 ± 0.081
1.353MetIle: 1.353 ± 0.366
2.616MetLys: 2.616 ± 0.498
1.894MetLeu: 1.894 ± 0.4
0.18MetMet: 0.18 ± 0.13
0.992MetAsn: 0.992 ± 0.303
0.902MetPro: 0.902 ± 0.3
1.173MetGln: 1.173 ± 0.383
0.18MetArg: 0.18 ± 0.134
1.804MetSer: 1.804 ± 0.366
0.631MetThr: 0.631 ± 0.241
1.443MetVal: 1.443 ± 0.345
0.09MetTrp: 0.09 ± 0.086
0.992MetTyr: 0.992 ± 0.303
0.0MetXaa: 0.0 ± 0.0
Asn
4.961AsnAla: 4.961 ± 0.814
1.173AsnCys: 1.173 ± 0.318
3.878AsnAsp: 3.878 ± 0.674
6.855AsnGlu: 6.855 ± 0.848
4.6AsnPhe: 4.6 ± 0.8
4.961AsnGly: 4.961 ± 0.763
0.451AsnHis: 0.451 ± 0.25
8.478AsnIle: 8.478 ± 1.264
8.208AsnLys: 8.208 ± 0.982
8.478AsnLeu: 8.478 ± 1.059
1.624AsnMet: 1.624 ± 0.364
8.388AsnAsn: 8.388 ± 1.145
2.435AsnPro: 2.435 ± 0.455
4.78AsnGln: 4.78 ± 0.747
2.165AsnArg: 2.165 ± 0.431
6.314AsnSer: 6.314 ± 0.84
3.969AsnThr: 3.969 ± 0.562
3.969AsnVal: 3.969 ± 0.592
0.631AsnTrp: 0.631 ± 0.269
3.698AsnTyr: 3.698 ± 0.598
0.0AsnXaa: 0.0 ± 0.0
Pro
1.263ProAla: 1.263 ± 0.347
0.361ProCys: 0.361 ± 0.183
0.812ProAsp: 0.812 ± 0.267
1.082ProGlu: 1.082 ± 0.261
2.345ProPhe: 2.345 ± 0.452
0.361ProGly: 0.361 ± 0.165
0.451ProHis: 0.451 ± 0.189
2.165ProIle: 2.165 ± 0.563
1.624ProLys: 1.624 ± 0.356
2.886ProLeu: 2.886 ± 0.542
0.271ProMet: 0.271 ± 0.147
1.894ProAsn: 1.894 ± 0.534
0.722ProPro: 0.722 ± 0.26
0.631ProGln: 0.631 ± 0.183
0.541ProArg: 0.541 ± 0.229
2.616ProSer: 2.616 ± 0.505
0.902ProThr: 0.902 ± 0.258
0.722ProVal: 0.722 ± 0.264
0.09ProTrp: 0.09 ± 0.099
1.443ProTyr: 1.443 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
2.525GlnAla: 2.525 ± 0.581
0.18GlnCys: 0.18 ± 0.119
3.788GlnAsp: 3.788 ± 0.644
2.616GlnGlu: 2.616 ± 0.466
1.894GlnPhe: 1.894 ± 0.437
1.984GlnGly: 1.984 ± 0.433
0.18GlnHis: 0.18 ± 0.127
2.616GlnIle: 2.616 ± 0.521
2.525GlnLys: 2.525 ± 0.451
2.435GlnLeu: 2.435 ± 0.466
0.812GlnMet: 0.812 ± 0.307
2.796GlnAsn: 2.796 ± 0.414
0.631GlnPro: 0.631 ± 0.222
0.541GlnGln: 0.541 ± 0.178
1.082GlnArg: 1.082 ± 0.273
2.255GlnSer: 2.255 ± 0.45
2.255GlnThr: 2.255 ± 0.388
1.894GlnVal: 1.894 ± 0.377
0.361GlnTrp: 0.361 ± 0.17
0.902GlnTyr: 0.902 ± 0.29
0.0GlnXaa: 0.0 ± 0.0
Arg
1.263ArgAla: 1.263 ± 0.279
0.361ArgCys: 0.361 ± 0.181
1.533ArgAsp: 1.533 ± 0.302
2.435ArgGlu: 2.435 ± 0.563
1.984ArgPhe: 1.984 ± 0.396
1.353ArgGly: 1.353 ± 0.317
0.271ArgHis: 0.271 ± 0.152
2.886ArgIle: 2.886 ± 0.47
2.796ArgLys: 2.796 ± 0.532
2.075ArgLeu: 2.075 ± 0.475
0.631ArgMet: 0.631 ± 0.34
1.173ArgAsn: 1.173 ± 0.312
0.812ArgPro: 0.812 ± 0.255
1.263ArgGln: 1.263 ± 0.297
0.812ArgArg: 0.812 ± 0.284
1.173ArgSer: 1.173 ± 0.312
1.714ArgThr: 1.714 ± 0.397
1.533ArgVal: 1.533 ± 0.321
0.0ArgTrp: 0.0 ± 0.0
1.173ArgTyr: 1.173 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
3.878SerAla: 3.878 ± 0.698
0.812SerCys: 0.812 ± 0.245
4.239SerAsp: 4.239 ± 0.737
4.51SerGlu: 4.51 ± 0.487
4.239SerPhe: 4.239 ± 0.54
3.969SerGly: 3.969 ± 0.569
0.09SerHis: 0.09 ± 0.083
6.314SerIle: 6.314 ± 0.771
7.035SerLys: 7.035 ± 0.772
6.494SerLeu: 6.494 ± 0.84
0.722SerMet: 0.722 ± 0.293
4.059SerAsn: 4.059 ± 0.592
1.714SerPro: 1.714 ± 0.391
2.525SerGln: 2.525 ± 0.441
1.804SerArg: 1.804 ± 0.411
4.059SerSer: 4.059 ± 0.853
2.616SerThr: 2.616 ± 0.602
4.059SerVal: 4.059 ± 0.462
0.361SerTrp: 0.361 ± 0.167
2.886SerTyr: 2.886 ± 0.443
0.0SerXaa: 0.0 ± 0.0
Thr
3.337ThrAla: 3.337 ± 0.531
0.361ThrCys: 0.361 ± 0.2
2.616ThrAsp: 2.616 ± 0.503
2.345ThrGlu: 2.345 ± 0.514
3.337ThrPhe: 3.337 ± 0.526
3.337ThrGly: 3.337 ± 0.481
0.271ThrHis: 0.271 ± 0.132
4.78ThrIle: 4.78 ± 0.715
3.698ThrLys: 3.698 ± 0.567
4.149ThrLeu: 4.149 ± 0.663
0.902ThrMet: 0.902 ± 0.306
4.51ThrAsn: 4.51 ± 0.737
1.804ThrPro: 1.804 ± 0.434
2.886ThrGln: 2.886 ± 0.629
1.353ThrArg: 1.353 ± 0.375
3.608ThrSer: 3.608 ± 0.778
2.706ThrThr: 2.706 ± 0.661
1.443ThrVal: 1.443 ± 0.409
0.361ThrTrp: 0.361 ± 0.167
2.075ThrTyr: 2.075 ± 0.341
0.0ThrXaa: 0.0 ± 0.0
Val
3.157ValAla: 3.157 ± 0.636
0.541ValCys: 0.541 ± 0.196
3.427ValAsp: 3.427 ± 0.519
3.157ValGlu: 3.157 ± 0.571
3.518ValPhe: 3.518 ± 0.648
1.984ValGly: 1.984 ± 0.373
0.271ValHis: 0.271 ± 0.151
4.149ValIle: 4.149 ± 0.537
4.42ValLys: 4.42 ± 0.624
3.969ValLeu: 3.969 ± 0.458
1.173ValMet: 1.173 ± 0.303
4.871ValAsn: 4.871 ± 0.693
1.082ValPro: 1.082 ± 0.284
0.722ValGln: 0.722 ± 0.215
0.902ValArg: 0.902 ± 0.263
3.878ValSer: 3.878 ± 0.684
2.345ValThr: 2.345 ± 0.72
2.345ValVal: 2.345 ± 0.451
0.271ValTrp: 0.271 ± 0.136
2.525ValTyr: 2.525 ± 0.62
0.0ValXaa: 0.0 ± 0.0
Trp
0.631TrpAla: 0.631 ± 0.193
0.18TrpCys: 0.18 ± 0.115
0.541TrpAsp: 0.541 ± 0.254
0.18TrpGlu: 0.18 ± 0.155
0.271TrpPhe: 0.271 ± 0.145
0.451TrpGly: 0.451 ± 0.192
0.0TrpHis: 0.0 ± 0.0
0.451TrpIle: 0.451 ± 0.2
0.271TrpLys: 0.271 ± 0.159
1.173TrpLeu: 1.173 ± 0.325
0.0TrpMet: 0.0 ± 0.0
0.631TrpAsn: 0.631 ± 0.232
0.0TrpPro: 0.0 ± 0.0
0.361TrpGln: 0.361 ± 0.178
0.271TrpArg: 0.271 ± 0.122
0.631TrpSer: 0.631 ± 0.238
0.09TrpThr: 0.09 ± 0.079
0.541TrpVal: 0.541 ± 0.18
0.18TrpTrp: 0.18 ± 0.128
0.271TrpTyr: 0.271 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.894TyrAla: 1.894 ± 0.449
0.812TyrCys: 0.812 ± 0.234
1.714TyrAsp: 1.714 ± 0.422
3.157TyrGlu: 3.157 ± 0.729
3.157TyrPhe: 3.157 ± 0.474
2.435TyrGly: 2.435 ± 0.614
0.09TyrHis: 0.09 ± 0.081
3.518TyrIle: 3.518 ± 0.542
5.141TyrLys: 5.141 ± 0.918
4.059TyrLeu: 4.059 ± 0.602
0.722TyrMet: 0.722 ± 0.267
4.42TyrAsn: 4.42 ± 0.484
1.173TyrPro: 1.173 ± 0.326
2.165TyrGln: 2.165 ± 0.462
1.353TyrArg: 1.353 ± 0.345
2.525TyrSer: 2.525 ± 0.582
1.714TyrThr: 1.714 ± 0.361
1.353TyrVal: 1.353 ± 0.316
0.09TyrTrp: 0.09 ± 0.081
2.255TyrTyr: 2.255 ± 0.478
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11088 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski