Amino acid dipepetide frequency for Burkholderia virus phiE122

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.665AlaAla: 22.665 ± 2.319
1.151AlaCys: 1.151 ± 0.26
8.322AlaAsp: 8.322 ± 1.034
7.171AlaGlu: 7.171 ± 1.03
3.453AlaPhe: 3.453 ± 0.588
10.093AlaGly: 10.093 ± 0.897
2.568AlaHis: 2.568 ± 0.376
5.666AlaIle: 5.666 ± 0.751
5.932AlaLys: 5.932 ± 0.904
14.254AlaLeu: 14.254 ± 1.623
3.896AlaMet: 3.896 ± 0.457
3.364AlaAsn: 3.364 ± 0.472
7.88AlaPro: 7.88 ± 0.894
4.869AlaGln: 4.869 ± 0.67
12.041AlaArg: 12.041 ± 1.118
7.437AlaSer: 7.437 ± 0.772
7.437AlaThr: 7.437 ± 1.139
7.614AlaVal: 7.614 ± 0.742
2.656AlaTrp: 2.656 ± 0.588
3.453AlaTyr: 3.453 ± 0.505
0.0AlaXaa: 0.0 ± 0.0
Cys
0.974CysAla: 0.974 ± 0.477
0.177CysCys: 0.177 ± 0.109
0.62CysAsp: 0.62 ± 0.217
0.797CysGlu: 0.797 ± 0.266
0.177CysPhe: 0.177 ± 0.128
1.594CysGly: 1.594 ± 0.444
0.177CysHis: 0.177 ± 0.122
0.266CysIle: 0.266 ± 0.151
0.089CysLys: 0.089 ± 0.086
0.531CysLeu: 0.531 ± 0.177
0.354CysMet: 0.354 ± 0.153
0.266CysAsn: 0.266 ± 0.147
0.62CysPro: 0.62 ± 0.239
0.443CysGln: 0.443 ± 0.182
0.797CysArg: 0.797 ± 0.28
0.531CysSer: 0.531 ± 0.199
0.797CysThr: 0.797 ± 0.301
0.708CysVal: 0.708 ± 0.201
0.266CysTrp: 0.266 ± 0.153
0.089CysTyr: 0.089 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
10.624AspAla: 10.624 ± 1.154
0.443AspCys: 0.443 ± 0.171
4.604AspAsp: 4.604 ± 0.738
3.984AspGlu: 3.984 ± 0.624
1.859AspPhe: 1.859 ± 0.354
7.171AspGly: 7.171 ± 1.03
1.151AspHis: 1.151 ± 0.389
2.656AspIle: 2.656 ± 0.632
1.771AspLys: 1.771 ± 0.358
3.807AspLeu: 3.807 ± 0.571
2.213AspMet: 2.213 ± 0.472
1.151AspAsn: 1.151 ± 0.346
3.099AspPro: 3.099 ± 0.584
1.594AspGln: 1.594 ± 0.281
3.718AspArg: 3.718 ± 0.514
2.479AspSer: 2.479 ± 0.512
3.276AspThr: 3.276 ± 0.65
4.338AspVal: 4.338 ± 0.707
0.974AspTrp: 0.974 ± 0.318
2.036AspTyr: 2.036 ± 0.422
0.0AspXaa: 0.0 ± 0.0
Glu
6.729GluAla: 6.729 ± 0.776
0.797GluCys: 0.797 ± 0.337
1.771GluAsp: 1.771 ± 0.373
1.771GluGlu: 1.771 ± 0.472
2.479GluPhe: 2.479 ± 0.367
3.099GluGly: 3.099 ± 0.567
1.771GluHis: 1.771 ± 0.484
3.453GluIle: 3.453 ± 0.541
2.745GluLys: 2.745 ± 0.581
6.463GluLeu: 6.463 ± 0.682
1.151GluMet: 1.151 ± 0.308
1.859GluAsn: 1.859 ± 0.371
2.833GluPro: 2.833 ± 0.603
2.39GluGln: 2.39 ± 0.396
5.224GluArg: 5.224 ± 0.499
2.125GluSer: 2.125 ± 0.417
2.656GluThr: 2.656 ± 0.502
3.276GluVal: 3.276 ± 0.525
1.062GluTrp: 1.062 ± 0.253
1.151GluTyr: 1.151 ± 0.27
0.0GluXaa: 0.0 ± 0.0
Phe
5.578PheAla: 5.578 ± 0.768
0.266PheCys: 0.266 ± 0.145
2.745PheAsp: 2.745 ± 0.466
2.568PheGlu: 2.568 ± 0.397
1.062PhePhe: 1.062 ± 0.271
2.922PheGly: 2.922 ± 0.594
0.708PheHis: 0.708 ± 0.35
1.151PheIle: 1.151 ± 0.259
0.974PheLys: 0.974 ± 0.288
1.948PheLeu: 1.948 ± 0.385
0.443PheMet: 0.443 ± 0.145
0.531PheAsn: 0.531 ± 0.239
1.328PhePro: 1.328 ± 0.348
0.974PheGln: 0.974 ± 0.317
2.479PheArg: 2.479 ± 0.473
1.771PheSer: 1.771 ± 0.362
1.594PheThr: 1.594 ± 0.377
2.125PheVal: 2.125 ± 0.46
0.531PheTrp: 0.531 ± 0.243
0.443PheTyr: 0.443 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
10.536GlyAla: 10.536 ± 1.413
0.797GlyCys: 0.797 ± 0.255
4.427GlyAsp: 4.427 ± 0.681
4.781GlyGlu: 4.781 ± 0.616
2.302GlyPhe: 2.302 ± 0.468
7.703GlyGly: 7.703 ± 1.387
1.505GlyHis: 1.505 ± 0.346
3.63GlyIle: 3.63 ± 0.755
3.099GlyLys: 3.099 ± 0.591
6.906GlyLeu: 6.906 ± 1.042
2.656GlyMet: 2.656 ± 0.476
2.479GlyAsn: 2.479 ± 0.61
2.125GlyPro: 2.125 ± 0.289
2.213GlyGln: 2.213 ± 0.434
5.843GlyArg: 5.843 ± 0.771
2.833GlySer: 2.833 ± 0.441
4.869GlyThr: 4.869 ± 0.715
6.906GlyVal: 6.906 ± 0.789
1.771GlyTrp: 1.771 ± 0.418
2.568GlyTyr: 2.568 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
4.073HisAla: 4.073 ± 0.618
0.177HisCys: 0.177 ± 0.11
1.062HisAsp: 1.062 ± 0.389
1.594HisGlu: 1.594 ± 0.282
0.266HisPhe: 0.266 ± 0.132
1.948HisGly: 1.948 ± 0.493
0.62HisHis: 0.62 ± 0.196
1.062HisIle: 1.062 ± 0.368
0.531HisLys: 0.531 ± 0.217
1.151HisLeu: 1.151 ± 0.372
0.089HisMet: 0.089 ± 0.09
0.266HisAsn: 0.266 ± 0.132
0.62HisPro: 0.62 ± 0.159
0.797HisGln: 0.797 ± 0.229
1.328HisArg: 1.328 ± 0.441
1.151HisSer: 1.151 ± 0.322
1.771HisThr: 1.771 ± 0.475
1.505HisVal: 1.505 ± 0.39
0.443HisTrp: 0.443 ± 0.207
0.797HisTyr: 0.797 ± 0.274
0.0HisXaa: 0.0 ± 0.0
Ile
6.64IleAla: 6.64 ± 0.763
0.354IleCys: 0.354 ± 0.142
4.604IleAsp: 4.604 ± 0.516
3.718IleGlu: 3.718 ± 0.604
0.62IlePhe: 0.62 ± 0.183
4.25IleGly: 4.25 ± 0.644
0.797IleHis: 0.797 ± 0.198
1.062IleIle: 1.062 ± 0.398
1.505IleLys: 1.505 ± 0.307
2.568IleLeu: 2.568 ± 0.337
0.708IleMet: 0.708 ± 0.262
1.328IleAsn: 1.328 ± 0.286
1.594IlePro: 1.594 ± 0.433
1.239IleGln: 1.239 ± 0.273
3.187IleArg: 3.187 ± 0.631
2.125IleSer: 2.125 ± 0.53
1.594IleThr: 1.594 ± 0.317
3.364IleVal: 3.364 ± 0.661
0.443IleTrp: 0.443 ± 0.233
0.797IleTyr: 0.797 ± 0.299
0.0IleXaa: 0.0 ± 0.0
Lys
4.604LysAla: 4.604 ± 0.694
0.089LysCys: 0.089 ± 0.084
2.39LysAsp: 2.39 ± 0.377
1.505LysGlu: 1.505 ± 0.354
1.328LysPhe: 1.328 ± 0.378
3.541LysGly: 3.541 ± 0.48
0.62LysHis: 0.62 ± 0.226
1.682LysIle: 1.682 ± 0.462
1.771LysLys: 1.771 ± 0.399
3.099LysLeu: 3.099 ± 0.74
0.974LysMet: 0.974 ± 0.24
1.239LysAsn: 1.239 ± 0.356
1.239LysPro: 1.239 ± 0.341
1.948LysGln: 1.948 ± 0.529
3.541LysArg: 3.541 ± 0.673
2.036LysSer: 2.036 ± 0.413
2.745LysThr: 2.745 ± 0.412
1.771LysVal: 1.771 ± 0.505
0.354LysTrp: 0.354 ± 0.187
1.505LysTyr: 1.505 ± 0.341
0.0LysXaa: 0.0 ± 0.0
Leu
12.041LeuAla: 12.041 ± 1.145
0.885LeuCys: 0.885 ± 0.294
6.02LeuAsp: 6.02 ± 0.555
3.63LeuGlu: 3.63 ± 0.476
3.276LeuPhe: 3.276 ± 0.529
6.286LeuGly: 6.286 ± 1.737
1.771LeuHis: 1.771 ± 0.566
3.276LeuIle: 3.276 ± 0.49
2.745LeuLys: 2.745 ± 0.526
5.843LeuLeu: 5.843 ± 0.75
1.948LeuMet: 1.948 ± 0.358
2.833LeuAsn: 2.833 ± 0.512
4.515LeuPro: 4.515 ± 0.913
2.656LeuGln: 2.656 ± 0.358
6.817LeuArg: 6.817 ± 0.771
5.666LeuSer: 5.666 ± 0.581
5.046LeuThr: 5.046 ± 0.609
5.932LeuVal: 5.932 ± 0.639
0.62LeuTrp: 0.62 ± 0.228
2.213LeuTyr: 2.213 ± 0.54
0.0LeuXaa: 0.0 ± 0.0
Met
2.922MetAla: 2.922 ± 0.519
0.177MetCys: 0.177 ± 0.127
1.062MetAsp: 1.062 ± 0.303
1.062MetGlu: 1.062 ± 0.261
0.62MetPhe: 0.62 ± 0.311
1.594MetGly: 1.594 ± 0.407
0.443MetHis: 0.443 ± 0.2
0.797MetIle: 0.797 ± 0.276
1.505MetLys: 1.505 ± 0.393
1.239MetLeu: 1.239 ± 0.288
0.354MetMet: 0.354 ± 0.156
0.797MetAsn: 0.797 ± 0.217
1.859MetPro: 1.859 ± 0.416
0.62MetGln: 0.62 ± 0.359
2.656MetArg: 2.656 ± 0.476
1.505MetSer: 1.505 ± 0.403
1.948MetThr: 1.948 ± 0.356
1.062MetVal: 1.062 ± 0.255
0.266MetTrp: 0.266 ± 0.191
0.531MetTyr: 0.531 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
3.364AsnAla: 3.364 ± 0.557
0.089AsnCys: 0.089 ± 0.091
2.656AsnAsp: 2.656 ± 0.563
2.568AsnGlu: 2.568 ± 0.419
0.62AsnPhe: 0.62 ± 0.191
3.099AsnGly: 3.099 ± 0.593
0.797AsnHis: 0.797 ± 0.369
1.062AsnIle: 1.062 ± 0.321
0.62AsnLys: 0.62 ± 0.234
2.302AsnLeu: 2.302 ± 0.456
0.62AsnMet: 0.62 ± 0.224
0.708AsnAsn: 0.708 ± 0.33
1.417AsnPro: 1.417 ± 0.388
0.885AsnGln: 0.885 ± 0.265
1.417AsnArg: 1.417 ± 0.274
1.151AsnSer: 1.151 ± 0.299
1.239AsnThr: 1.239 ± 0.318
2.568AsnVal: 2.568 ± 0.466
0.177AsnTrp: 0.177 ± 0.111
0.62AsnTyr: 0.62 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
7.525ProAla: 7.525 ± 1.143
0.797ProCys: 0.797 ± 0.333
3.63ProAsp: 3.63 ± 0.657
2.745ProGlu: 2.745 ± 0.661
1.151ProPhe: 1.151 ± 0.319
2.302ProGly: 2.302 ± 0.528
1.062ProHis: 1.062 ± 0.284
2.302ProIle: 2.302 ± 0.642
2.213ProLys: 2.213 ± 0.544
3.984ProLeu: 3.984 ± 0.657
0.443ProMet: 0.443 ± 0.204
1.594ProAsn: 1.594 ± 0.447
3.453ProPro: 3.453 ± 0.798
1.239ProGln: 1.239 ± 0.291
3.541ProArg: 3.541 ± 0.573
2.922ProSer: 2.922 ± 0.527
2.39ProThr: 2.39 ± 0.504
3.63ProVal: 3.63 ± 0.553
0.797ProTrp: 0.797 ± 0.259
0.974ProTyr: 0.974 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
4.515GlnAla: 4.515 ± 0.696
0.266GlnCys: 0.266 ± 0.153
1.417GlnAsp: 1.417 ± 0.264
1.239GlnGlu: 1.239 ± 0.392
1.859GlnPhe: 1.859 ± 0.399
2.036GlnGly: 2.036 ± 0.351
0.62GlnHis: 0.62 ± 0.206
1.948GlnIle: 1.948 ± 0.455
1.594GlnLys: 1.594 ± 0.329
3.099GlnLeu: 3.099 ± 0.521
0.974GlnMet: 0.974 ± 0.348
0.62GlnAsn: 0.62 ± 0.211
1.505GlnPro: 1.505 ± 0.406
2.036GlnGln: 2.036 ± 0.485
3.718GlnArg: 3.718 ± 0.569
2.036GlnSer: 2.036 ± 0.525
1.859GlnThr: 1.859 ± 0.501
1.948GlnVal: 1.948 ± 0.403
0.354GlnTrp: 0.354 ± 0.17
0.708GlnTyr: 0.708 ± 0.199
0.0GlnXaa: 0.0 ± 0.0
Arg
11.421ArgAla: 11.421 ± 1.277
1.239ArgCys: 1.239 ± 0.269
3.453ArgAsp: 3.453 ± 0.469
6.109ArgGlu: 6.109 ± 0.745
2.39ArgPhe: 2.39 ± 0.424
5.312ArgGly: 5.312 ± 0.583
1.948ArgHis: 1.948 ± 0.464
3.896ArgIle: 3.896 ± 0.64
3.453ArgLys: 3.453 ± 0.422
6.817ArgLeu: 6.817 ± 0.995
1.417ArgMet: 1.417 ± 0.313
2.479ArgAsn: 2.479 ± 0.58
3.718ArgPro: 3.718 ± 0.677
3.276ArgGln: 3.276 ± 0.465
7.171ArgArg: 7.171 ± 1.059
3.718ArgSer: 3.718 ± 0.628
3.63ArgThr: 3.63 ± 0.503
7.26ArgVal: 7.26 ± 0.942
1.328ArgTrp: 1.328 ± 0.301
1.328ArgTyr: 1.328 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
6.552SerAla: 6.552 ± 0.822
0.62SerCys: 0.62 ± 0.238
3.541SerAsp: 3.541 ± 0.502
1.859SerGlu: 1.859 ± 0.403
1.771SerPhe: 1.771 ± 0.327
4.515SerGly: 4.515 ± 0.545
0.974SerHis: 0.974 ± 0.32
2.036SerIle: 2.036 ± 0.372
1.771SerLys: 1.771 ± 0.288
4.25SerLeu: 4.25 ± 0.549
1.062SerMet: 1.062 ± 0.346
2.036SerAsn: 2.036 ± 0.454
2.922SerPro: 2.922 ± 0.64
1.682SerGln: 1.682 ± 0.396
3.276SerArg: 3.276 ± 0.612
3.541SerSer: 3.541 ± 0.592
3.63SerThr: 3.63 ± 0.536
2.479SerVal: 2.479 ± 0.413
1.151SerTrp: 1.151 ± 0.273
0.797SerTyr: 0.797 ± 0.201
0.0SerXaa: 0.0 ± 0.0
Thr
6.286ThrAla: 6.286 ± 0.878
0.708ThrCys: 0.708 ± 0.239
3.807ThrAsp: 3.807 ± 0.627
2.479ThrGlu: 2.479 ± 0.445
2.479ThrPhe: 2.479 ± 0.666
4.427ThrGly: 4.427 ± 0.695
0.974ThrHis: 0.974 ± 0.288
2.479ThrIle: 2.479 ± 0.581
1.771ThrLys: 1.771 ± 0.362
5.046ThrLeu: 5.046 ± 0.559
1.239ThrMet: 1.239 ± 0.371
1.771ThrAsn: 1.771 ± 0.348
4.25ThrPro: 4.25 ± 0.596
1.771ThrGln: 1.771 ± 0.338
4.427ThrArg: 4.427 ± 0.582
2.036ThrSer: 2.036 ± 0.391
3.984ThrThr: 3.984 ± 0.669
5.401ThrVal: 5.401 ± 0.585
0.531ThrTrp: 0.531 ± 0.19
0.885ThrTyr: 0.885 ± 0.333
0.0ThrXaa: 0.0 ± 0.0
Val
9.827ValAla: 9.827 ± 1.044
0.708ValCys: 0.708 ± 0.221
4.692ValAsp: 4.692 ± 0.783
3.984ValGlu: 3.984 ± 0.437
2.213ValPhe: 2.213 ± 0.462
5.135ValGly: 5.135 ± 0.647
1.239ValHis: 1.239 ± 0.358
2.922ValIle: 2.922 ± 0.467
2.568ValLys: 2.568 ± 0.416
6.109ValLeu: 6.109 ± 0.924
1.771ValMet: 1.771 ± 0.35
1.948ValAsn: 1.948 ± 0.363
2.656ValPro: 2.656 ± 0.444
2.125ValGln: 2.125 ± 0.505
6.02ValArg: 6.02 ± 0.871
3.718ValSer: 3.718 ± 0.588
3.984ValThr: 3.984 ± 0.512
5.755ValVal: 5.755 ± 0.802
0.885ValTrp: 0.885 ± 0.361
2.213ValTyr: 2.213 ± 0.426
0.0ValXaa: 0.0 ± 0.0
Trp
1.948TrpAla: 1.948 ± 0.391
0.354TrpCys: 0.354 ± 0.156
0.354TrpAsp: 0.354 ± 0.208
0.443TrpGlu: 0.443 ± 0.202
0.708TrpPhe: 0.708 ± 0.235
0.885TrpGly: 0.885 ± 0.428
0.62TrpHis: 0.62 ± 0.22
0.62TrpIle: 0.62 ± 0.201
0.62TrpLys: 0.62 ± 0.218
2.568TrpLeu: 2.568 ± 0.471
0.266TrpMet: 0.266 ± 0.147
0.266TrpAsn: 0.266 ± 0.133
0.443TrpPro: 0.443 ± 0.249
0.443TrpGln: 0.443 ± 0.167
1.417TrpArg: 1.417 ± 0.495
0.797TrpSer: 0.797 ± 0.27
1.062TrpThr: 1.062 ± 0.266
0.797TrpVal: 0.797 ± 0.238
0.177TrpTrp: 0.177 ± 0.108
0.354TrpTyr: 0.354 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.099TyrAla: 3.099 ± 0.52
0.177TyrCys: 0.177 ± 0.128
1.771TyrAsp: 1.771 ± 0.326
0.797TyrGlu: 0.797 ± 0.312
1.505TyrPhe: 1.505 ± 0.501
2.036TyrGly: 2.036 ± 0.463
0.885TyrHis: 0.885 ± 0.288
0.62TyrIle: 0.62 ± 0.229
0.708TyrLys: 0.708 ± 0.198
2.036TyrLeu: 2.036 ± 0.396
0.354TyrMet: 0.354 ± 0.15
0.443TyrAsn: 0.443 ± 0.168
0.62TyrPro: 0.62 ± 0.211
1.151TyrGln: 1.151 ± 0.33
2.745TyrArg: 2.745 ± 0.46
0.885TyrSer: 0.885 ± 0.314
1.239TyrThr: 1.239 ± 0.314
2.036TyrVal: 2.036 ± 0.516
0.354TyrTrp: 0.354 ± 0.159
0.531TyrTyr: 0.531 ± 0.243
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (11296 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski