Amino acid dipepetide frequency for Burkholderia phage AP3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.186AlaAla: 16.186 ± 2.814
0.866AlaCys: 0.866 ± 0.274
7.79AlaAsp: 7.79 ± 0.963
5.54AlaGlu: 5.54 ± 0.736
3.462AlaPhe: 3.462 ± 0.614
11.166AlaGly: 11.166 ± 1.233
2.077AlaHis: 2.077 ± 0.628
5.107AlaIle: 5.107 ± 0.641
5.193AlaLys: 5.193 ± 0.576
11.858AlaLeu: 11.858 ± 1.367
2.683AlaMet: 2.683 ± 0.456
4.155AlaAsn: 4.155 ± 0.582
5.367AlaPro: 5.367 ± 0.881
5.626AlaGln: 5.626 ± 0.885
10.041AlaArg: 10.041 ± 1.005
6.059AlaSer: 6.059 ± 0.971
7.444AlaThr: 7.444 ± 0.821
6.751AlaVal: 6.751 ± 0.712
2.25AlaTrp: 2.25 ± 0.331
2.943AlaTyr: 2.943 ± 0.591
0.0AlaXaa: 0.0 ± 0.0
Cys
0.866CysAla: 0.866 ± 0.27
0.087CysCys: 0.087 ± 0.086
0.433CysAsp: 0.433 ± 0.226
0.519CysGlu: 0.519 ± 0.195
0.26CysPhe: 0.26 ± 0.156
0.779CysGly: 0.779 ± 0.305
0.26CysHis: 0.26 ± 0.169
0.433CysIle: 0.433 ± 0.158
0.087CysLys: 0.087 ± 0.091
0.692CysLeu: 0.692 ± 0.305
0.433CysMet: 0.433 ± 0.223
0.26CysAsn: 0.26 ± 0.143
0.779CysPro: 0.779 ± 0.278
0.519CysGln: 0.519 ± 0.186
0.433CysArg: 0.433 ± 0.173
0.26CysSer: 0.26 ± 0.147
0.519CysThr: 0.519 ± 0.21
1.039CysVal: 1.039 ± 0.321
0.346CysTrp: 0.346 ± 0.147
0.26CysTyr: 0.26 ± 0.211
0.0CysXaa: 0.0 ± 0.0
Asp
9.175AspAla: 9.175 ± 0.772
0.519AspCys: 0.519 ± 0.199
3.722AspAsp: 3.722 ± 0.868
3.809AspGlu: 3.809 ± 0.628
2.25AspPhe: 2.25 ± 0.359
6.578AspGly: 6.578 ± 1.234
1.125AspHis: 1.125 ± 0.363
2.683AspIle: 2.683 ± 0.503
1.904AspLys: 1.904 ± 0.89
4.414AspLeu: 4.414 ± 0.791
1.039AspMet: 1.039 ± 0.278
1.471AspAsn: 1.471 ± 0.345
3.549AspPro: 3.549 ± 0.545
2.683AspGln: 2.683 ± 0.458
5.193AspArg: 5.193 ± 0.867
3.116AspSer: 3.116 ± 0.487
3.635AspThr: 3.635 ± 0.696
5.107AspVal: 5.107 ± 0.768
1.039AspTrp: 1.039 ± 0.301
1.039AspTyr: 1.039 ± 0.301
0.0AspXaa: 0.0 ± 0.0
Glu
4.155GluAla: 4.155 ± 0.586
0.519GluCys: 0.519 ± 0.235
2.077GluAsp: 2.077 ± 0.385
2.337GluGlu: 2.337 ± 0.514
2.25GluPhe: 2.25 ± 0.415
2.077GluGly: 2.077 ± 0.395
0.952GluHis: 0.952 ± 0.292
2.25GluIle: 2.25 ± 0.513
3.289GluLys: 3.289 ± 0.487
6.751GluLeu: 6.751 ± 0.818
1.385GluMet: 1.385 ± 0.311
1.298GluAsn: 1.298 ± 0.341
3.03GluPro: 3.03 ± 0.71
1.991GluGln: 1.991 ± 0.494
4.761GluArg: 4.761 ± 0.609
2.077GluSer: 2.077 ± 0.406
2.077GluThr: 2.077 ± 0.441
3.116GluVal: 3.116 ± 0.684
0.779GluTrp: 0.779 ± 0.316
2.164GluTyr: 2.164 ± 0.449
0.0GluXaa: 0.0 ± 0.0
Phe
4.674PheAla: 4.674 ± 0.723
0.433PheCys: 0.433 ± 0.177
1.904PheAsp: 1.904 ± 0.444
1.298PheGlu: 1.298 ± 0.375
1.212PhePhe: 1.212 ± 0.3
2.683PheGly: 2.683 ± 0.684
0.779PheHis: 0.779 ± 0.282
1.039PheIle: 1.039 ± 0.247
1.731PheLys: 1.731 ± 0.414
2.424PheLeu: 2.424 ± 0.486
0.692PheMet: 0.692 ± 0.298
1.039PheAsn: 1.039 ± 0.25
1.298PhePro: 1.298 ± 0.405
1.039PheGln: 1.039 ± 0.307
1.991PheArg: 1.991 ± 0.48
2.337PheSer: 2.337 ± 0.549
1.818PheThr: 1.818 ± 0.394
2.943PheVal: 2.943 ± 0.494
0.952PheTrp: 0.952 ± 0.247
0.779PheTyr: 0.779 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
10.733GlyAla: 10.733 ± 1.824
0.779GlyCys: 0.779 ± 0.24
4.761GlyAsp: 4.761 ± 0.625
3.549GlyGlu: 3.549 ± 0.499
3.462GlyPhe: 3.462 ± 0.462
6.665GlyGly: 6.665 ± 1.309
1.558GlyHis: 1.558 ± 0.479
4.328GlyIle: 4.328 ± 0.548
3.462GlyLys: 3.462 ± 0.41
6.405GlyLeu: 6.405 ± 0.847
1.904GlyMet: 1.904 ± 0.432
2.424GlyAsn: 2.424 ± 0.811
3.116GlyPro: 3.116 ± 0.617
2.424GlyGln: 2.424 ± 0.502
5.107GlyArg: 5.107 ± 0.649
3.809GlySer: 3.809 ± 0.751
5.626GlyThr: 5.626 ± 1.013
6.405GlyVal: 6.405 ± 0.776
1.385GlyTrp: 1.385 ± 0.301
2.164GlyTyr: 2.164 ± 0.505
0.0GlyXaa: 0.0 ± 0.0
His
2.337HisAla: 2.337 ± 0.356
0.26HisCys: 0.26 ± 0.151
1.385HisAsp: 1.385 ± 0.48
0.952HisGlu: 0.952 ± 0.311
0.519HisPhe: 0.519 ± 0.251
1.991HisGly: 1.991 ± 0.505
0.692HisHis: 0.692 ± 0.249
1.039HisIle: 1.039 ± 0.282
0.606HisLys: 0.606 ± 0.213
1.558HisLeu: 1.558 ± 0.537
0.433HisMet: 0.433 ± 0.165
0.173HisAsn: 0.173 ± 0.177
0.866HisPro: 0.866 ± 0.425
0.692HisGln: 0.692 ± 0.288
1.645HisArg: 1.645 ± 0.737
1.125HisSer: 1.125 ± 0.264
0.952HisThr: 0.952 ± 0.316
1.991HisVal: 1.991 ± 0.561
0.346HisTrp: 0.346 ± 0.168
0.433HisTyr: 0.433 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
6.578IleAla: 6.578 ± 0.848
0.173IleCys: 0.173 ± 0.116
4.501IleAsp: 4.501 ± 0.671
3.116IleGlu: 3.116 ± 0.768
0.779IlePhe: 0.779 ± 0.277
4.414IleGly: 4.414 ± 0.836
1.298IleHis: 1.298 ± 0.299
0.779IleIle: 0.779 ± 0.3
1.818IleLys: 1.818 ± 0.389
1.991IleLeu: 1.991 ± 0.362
0.692IleMet: 0.692 ± 0.238
1.471IleAsn: 1.471 ± 0.395
1.298IlePro: 1.298 ± 0.39
1.991IleGln: 1.991 ± 0.428
3.289IleArg: 3.289 ± 0.544
2.856IleSer: 2.856 ± 0.535
3.03IleThr: 3.03 ± 0.688
3.376IleVal: 3.376 ± 0.499
0.433IleTrp: 0.433 ± 0.173
1.212IleTyr: 1.212 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
4.761LysAla: 4.761 ± 0.7
0.173LysCys: 0.173 ± 0.145
1.645LysAsp: 1.645 ± 0.399
1.991LysGlu: 1.991 ± 0.439
1.731LysPhe: 1.731 ± 0.372
2.77LysGly: 2.77 ± 0.472
0.606LysHis: 0.606 ± 0.295
1.385LysIle: 1.385 ± 0.334
3.549LysLys: 3.549 ± 0.676
3.462LysLeu: 3.462 ± 0.639
1.039LysMet: 1.039 ± 0.279
1.471LysAsn: 1.471 ± 0.334
2.337LysPro: 2.337 ± 0.509
1.904LysGln: 1.904 ± 0.409
4.501LysArg: 4.501 ± 0.691
1.904LysSer: 1.904 ± 0.48
2.51LysThr: 2.51 ± 0.391
2.77LysVal: 2.77 ± 0.48
0.779LysTrp: 0.779 ± 0.252
1.039LysTyr: 1.039 ± 0.319
0.0LysXaa: 0.0 ± 0.0
Leu
11.599LeuAla: 11.599 ± 1.086
1.039LeuCys: 1.039 ± 0.365
6.578LeuAsp: 6.578 ± 0.59
4.501LeuGlu: 4.501 ± 0.622
3.03LeuPhe: 3.03 ± 0.663
5.886LeuGly: 5.886 ± 1.009
1.298LeuHis: 1.298 ± 0.369
4.068LeuIle: 4.068 ± 0.577
3.549LeuLys: 3.549 ± 0.589
5.972LeuLeu: 5.972 ± 0.641
2.424LeuMet: 2.424 ± 0.378
3.289LeuAsn: 3.289 ± 0.507
5.28LeuPro: 5.28 ± 0.751
3.549LeuGln: 3.549 ± 0.551
7.444LeuArg: 7.444 ± 0.731
4.761LeuSer: 4.761 ± 0.564
5.107LeuThr: 5.107 ± 0.659
5.193LeuVal: 5.193 ± 0.557
0.692LeuTrp: 0.692 ± 0.282
2.25LeuTyr: 2.25 ± 0.637
0.0LeuXaa: 0.0 ± 0.0
Met
2.25MetAla: 2.25 ± 0.378
0.173MetCys: 0.173 ± 0.2
1.125MetAsp: 1.125 ± 0.407
1.125MetGlu: 1.125 ± 0.275
1.039MetPhe: 1.039 ± 0.257
0.866MetGly: 0.866 ± 0.3
0.519MetHis: 0.519 ± 0.221
0.779MetIle: 0.779 ± 0.255
1.039MetLys: 1.039 ± 0.317
1.818MetLeu: 1.818 ± 0.353
0.346MetMet: 0.346 ± 0.232
1.298MetAsn: 1.298 ± 0.271
0.952MetPro: 0.952 ± 0.316
0.866MetGln: 0.866 ± 0.27
2.424MetArg: 2.424 ± 0.534
1.645MetSer: 1.645 ± 0.39
2.683MetThr: 2.683 ± 0.314
1.385MetVal: 1.385 ± 0.279
0.26MetTrp: 0.26 ± 0.158
0.433MetTyr: 0.433 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
3.809AsnAla: 3.809 ± 0.72
0.26AsnCys: 0.26 ± 0.129
2.51AsnAsp: 2.51 ± 0.41
1.558AsnGlu: 1.558 ± 0.291
1.212AsnPhe: 1.212 ± 0.27
3.895AsnGly: 3.895 ± 0.917
0.606AsnHis: 0.606 ± 0.241
1.558AsnIle: 1.558 ± 0.33
0.606AsnLys: 0.606 ± 0.217
2.337AsnLeu: 2.337 ± 0.523
0.606AsnMet: 0.606 ± 0.271
0.692AsnAsn: 0.692 ± 0.395
1.731AsnPro: 1.731 ± 0.456
1.039AsnGln: 1.039 ± 0.349
2.164AsnArg: 2.164 ± 0.514
1.471AsnSer: 1.471 ± 0.413
1.298AsnThr: 1.298 ± 0.279
2.77AsnVal: 2.77 ± 0.427
0.692AsnTrp: 0.692 ± 0.221
0.952AsnTyr: 0.952 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
6.492ProAla: 6.492 ± 1.192
0.519ProCys: 0.519 ± 0.254
3.809ProAsp: 3.809 ± 0.621
1.904ProGlu: 1.904 ± 0.42
1.471ProPhe: 1.471 ± 0.314
3.376ProGly: 3.376 ± 0.372
0.866ProHis: 0.866 ± 0.309
2.943ProIle: 2.943 ± 0.45
1.731ProLys: 1.731 ± 0.358
4.501ProLeu: 4.501 ± 0.642
0.779ProMet: 0.779 ± 0.202
1.645ProAsn: 1.645 ± 0.517
2.51ProPro: 2.51 ± 0.658
1.298ProGln: 1.298 ± 0.387
3.722ProArg: 3.722 ± 0.824
3.203ProSer: 3.203 ± 0.624
1.991ProThr: 1.991 ± 0.382
3.376ProVal: 3.376 ± 0.681
0.606ProTrp: 0.606 ± 0.246
1.039ProTyr: 1.039 ± 0.325
0.0ProXaa: 0.0 ± 0.0
Gln
4.414GlnAla: 4.414 ± 0.739
0.606GlnCys: 0.606 ± 0.196
1.731GlnAsp: 1.731 ± 0.364
1.471GlnGlu: 1.471 ± 0.344
0.866GlnPhe: 0.866 ± 0.318
2.597GlnGly: 2.597 ± 0.468
1.385GlnHis: 1.385 ± 0.396
1.558GlnIle: 1.558 ± 0.52
1.471GlnLys: 1.471 ± 0.494
4.414GlnLeu: 4.414 ± 0.626
0.692GlnMet: 0.692 ± 0.316
0.952GlnAsn: 0.952 ± 0.236
1.125GlnPro: 1.125 ± 0.321
1.904GlnGln: 1.904 ± 0.499
2.943GlnArg: 2.943 ± 0.562
2.25GlnSer: 2.25 ± 0.366
2.337GlnThr: 2.337 ± 0.299
2.337GlnVal: 2.337 ± 0.421
0.779GlnTrp: 0.779 ± 0.249
1.645GlnTyr: 1.645 ± 0.36
0.0GlnXaa: 0.0 ± 0.0
Arg
11.079ArgAla: 11.079 ± 1.239
0.779ArgCys: 0.779 ± 0.358
5.453ArgAsp: 5.453 ± 0.738
4.241ArgGlu: 4.241 ± 0.55
2.51ArgPhe: 2.51 ± 0.452
5.799ArgGly: 5.799 ± 0.796
1.991ArgHis: 1.991 ± 0.574
4.068ArgIle: 4.068 ± 0.593
3.03ArgLys: 3.03 ± 0.636
7.617ArgLeu: 7.617 ± 0.749
1.385ArgMet: 1.385 ± 0.314
2.597ArgAsn: 2.597 ± 0.577
3.462ArgPro: 3.462 ± 0.543
3.549ArgGln: 3.549 ± 0.616
8.05ArgArg: 8.05 ± 1.938
3.289ArgSer: 3.289 ± 0.588
4.761ArgThr: 4.761 ± 0.812
6.232ArgVal: 6.232 ± 0.813
0.866ArgTrp: 0.866 ± 0.333
2.424ArgTyr: 2.424 ± 0.525
0.0ArgXaa: 0.0 ± 0.0
Ser
5.799SerAla: 5.799 ± 0.703
0.26SerCys: 0.26 ± 0.13
2.25SerAsp: 2.25 ± 0.379
2.077SerGlu: 2.077 ± 0.308
1.558SerPhe: 1.558 ± 0.392
5.02SerGly: 5.02 ± 0.658
1.039SerHis: 1.039 ± 0.358
2.77SerIle: 2.77 ± 0.534
1.991SerLys: 1.991 ± 0.348
5.54SerLeu: 5.54 ± 0.754
1.125SerMet: 1.125 ± 0.279
1.904SerAsn: 1.904 ± 0.491
2.424SerPro: 2.424 ± 0.482
1.385SerGln: 1.385 ± 0.419
4.328SerArg: 4.328 ± 0.851
2.597SerSer: 2.597 ± 0.488
3.203SerThr: 3.203 ± 0.471
3.635SerVal: 3.635 ± 0.517
0.952SerTrp: 0.952 ± 0.312
0.952SerTyr: 0.952 ± 0.338
0.0SerXaa: 0.0 ± 0.0
Thr
6.146ThrAla: 6.146 ± 1.018
0.519ThrCys: 0.519 ± 0.207
3.895ThrAsp: 3.895 ± 0.881
2.597ThrGlu: 2.597 ± 0.335
1.385ThrPhe: 1.385 ± 0.341
6.146ThrGly: 6.146 ± 1.067
0.866ThrHis: 0.866 ± 0.312
3.03ThrIle: 3.03 ± 0.555
2.077ThrLys: 2.077 ± 0.496
4.847ThrLeu: 4.847 ± 0.553
1.818ThrMet: 1.818 ± 0.434
2.077ThrAsn: 2.077 ± 0.466
3.549ThrPro: 3.549 ± 0.716
1.298ThrGln: 1.298 ± 0.321
4.414ThrArg: 4.414 ± 0.463
3.722ThrSer: 3.722 ± 0.55
4.328ThrThr: 4.328 ± 0.693
4.501ThrVal: 4.501 ± 0.571
1.645ThrTrp: 1.645 ± 0.389
1.212ThrTyr: 1.212 ± 0.352
0.0ThrXaa: 0.0 ± 0.0
Val
7.617ValAla: 7.617 ± 0.753
0.606ValCys: 0.606 ± 0.243
5.972ValAsp: 5.972 ± 0.576
4.414ValGlu: 4.414 ± 0.737
2.337ValPhe: 2.337 ± 0.411
4.155ValGly: 4.155 ± 0.504
0.692ValHis: 0.692 ± 0.28
3.116ValIle: 3.116 ± 0.537
3.462ValLys: 3.462 ± 0.617
6.665ValLeu: 6.665 ± 1.028
2.683ValMet: 2.683 ± 0.542
2.25ValAsn: 2.25 ± 0.59
3.289ValPro: 3.289 ± 0.525
2.164ValGln: 2.164 ± 0.525
5.713ValArg: 5.713 ± 0.822
2.597ValSer: 2.597 ± 0.5
4.674ValThr: 4.674 ± 0.938
4.501ValVal: 4.501 ± 0.586
1.125ValTrp: 1.125 ± 0.293
2.164ValTyr: 2.164 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
1.731TrpAla: 1.731 ± 0.473
0.26TrpCys: 0.26 ± 0.149
1.125TrpAsp: 1.125 ± 0.35
0.779TrpGlu: 0.779 ± 0.226
0.433TrpPhe: 0.433 ± 0.159
0.952TrpGly: 0.952 ± 0.268
0.692TrpHis: 0.692 ± 0.244
1.298TrpIle: 1.298 ± 0.233
0.692TrpLys: 0.692 ± 0.256
1.558TrpLeu: 1.558 ± 0.375
0.26TrpMet: 0.26 ± 0.153
0.692TrpAsn: 0.692 ± 0.266
0.519TrpPro: 0.519 ± 0.227
0.606TrpGln: 0.606 ± 0.19
2.337TrpArg: 2.337 ± 0.485
0.519TrpSer: 0.519 ± 0.179
0.779TrpThr: 0.779 ± 0.265
0.606TrpVal: 0.606 ± 0.242
0.0TrpTrp: 0.0 ± 0.0
0.519TrpTyr: 0.519 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.164TyrAla: 2.164 ± 0.412
0.433TyrCys: 0.433 ± 0.215
1.385TyrAsp: 1.385 ± 0.402
1.558TyrGlu: 1.558 ± 0.47
1.298TyrPhe: 1.298 ± 0.495
2.25TyrGly: 2.25 ± 0.377
0.692TyrHis: 0.692 ± 0.247
0.866TyrIle: 0.866 ± 0.266
1.039TyrLys: 1.039 ± 0.251
2.683TyrLeu: 2.683 ± 0.535
0.519TyrMet: 0.519 ± 0.197
0.692TyrAsn: 0.692 ± 0.25
1.298TyrPro: 1.298 ± 0.38
0.952TyrGln: 0.952 ± 0.264
2.683TyrArg: 2.683 ± 0.583
1.212TyrSer: 1.212 ± 0.338
1.298TyrThr: 1.298 ± 0.321
2.164TyrVal: 2.164 ± 0.444
0.433TyrTrp: 0.433 ± 0.176
0.433TyrTyr: 0.433 ± 0.186
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (11554 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski