Amino acid dipepetide frequency for Burkholderia phage FLC5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.078AlaAla: 19.078 ± 1.868
1.237AlaCys: 1.237 ± 0.347
9.384AlaAsp: 9.384 ± 1.076
7.941AlaGlu: 7.941 ± 0.839
4.125AlaPhe: 4.125 ± 0.674
12.066AlaGly: 12.066 ± 1.159
2.784AlaHis: 2.784 ± 0.611
7.012AlaIle: 7.012 ± 1.155
5.466AlaLys: 5.466 ± 0.946
14.437AlaLeu: 14.437 ± 1.447
2.991AlaMet: 2.991 ± 0.608
3.403AlaAsn: 3.403 ± 0.496
5.466AlaPro: 5.466 ± 0.583
5.156AlaGln: 5.156 ± 0.86
9.178AlaArg: 9.178 ± 1.449
7.219AlaSer: 7.219 ± 0.955
6.909AlaThr: 6.909 ± 1.344
7.116AlaVal: 7.116 ± 0.719
2.681AlaTrp: 2.681 ± 0.552
4.125AlaTyr: 4.125 ± 0.877
0.0AlaXaa: 0.0 ± 0.0
Cys
0.928CysAla: 0.928 ± 0.477
0.0CysCys: 0.0 ± 0.0
0.619CysAsp: 0.619 ± 0.289
0.722CysGlu: 0.722 ± 0.277
0.412CysPhe: 0.412 ± 0.207
0.412CysGly: 0.412 ± 0.174
0.0CysHis: 0.0 ± 0.0
0.309CysIle: 0.309 ± 0.19
0.309CysLys: 0.309 ± 0.19
0.619CysLeu: 0.619 ± 0.298
0.206CysMet: 0.206 ± 0.135
0.0CysAsn: 0.0 ± 0.0
0.309CysPro: 0.309 ± 0.167
0.412CysGln: 0.412 ± 0.255
0.309CysArg: 0.309 ± 0.168
0.309CysSer: 0.309 ± 0.174
0.516CysThr: 0.516 ± 0.272
0.619CysVal: 0.619 ± 0.282
0.103CysTrp: 0.103 ± 0.098
0.103CysTyr: 0.103 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
9.075AspAla: 9.075 ± 1.053
0.516AspCys: 0.516 ± 0.246
4.641AspAsp: 4.641 ± 0.605
4.228AspGlu: 4.228 ± 0.664
2.578AspPhe: 2.578 ± 0.544
4.847AspGly: 4.847 ± 0.781
1.134AspHis: 1.134 ± 0.339
2.372AspIle: 2.372 ± 0.57
2.887AspLys: 2.887 ± 0.618
5.981AspLeu: 5.981 ± 0.744
2.166AspMet: 2.166 ± 0.647
1.547AspAsn: 1.547 ± 0.369
2.991AspPro: 2.991 ± 0.579
1.753AspGln: 1.753 ± 0.351
4.537AspArg: 4.537 ± 0.583
1.959AspSer: 1.959 ± 0.471
4.331AspThr: 4.331 ± 0.679
3.816AspVal: 3.816 ± 0.564
0.825AspTrp: 0.825 ± 0.331
1.856AspTyr: 1.856 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
7.012GluAla: 7.012 ± 0.924
0.309GluCys: 0.309 ± 0.182
1.134GluAsp: 1.134 ± 0.277
1.959GluGlu: 1.959 ± 0.337
2.166GluPhe: 2.166 ± 0.517
2.578GluGly: 2.578 ± 0.527
1.341GluHis: 1.341 ± 0.334
3.403GluIle: 3.403 ± 0.407
2.578GluLys: 2.578 ± 0.619
8.044GluLeu: 8.044 ± 1.087
1.444GluMet: 1.444 ± 0.361
2.269GluAsn: 2.269 ± 0.543
2.578GluPro: 2.578 ± 0.518
2.062GluGln: 2.062 ± 0.452
4.228GluArg: 4.228 ± 0.918
3.712GluSer: 3.712 ± 0.672
3.3GluThr: 3.3 ± 0.631
3.197GluVal: 3.197 ± 0.71
1.134GluTrp: 1.134 ± 0.336
1.753GluTyr: 1.753 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
5.053PheAla: 5.053 ± 0.693
0.0PheCys: 0.0 ± 0.0
2.887PheAsp: 2.887 ± 0.374
2.166PheGlu: 2.166 ± 0.387
0.722PhePhe: 0.722 ± 0.252
2.784PheGly: 2.784 ± 0.512
0.309PheHis: 0.309 ± 0.176
1.444PheIle: 1.444 ± 0.37
1.341PheLys: 1.341 ± 0.419
2.269PheLeu: 2.269 ± 0.423
0.619PheMet: 0.619 ± 0.257
0.928PheAsn: 0.928 ± 0.301
1.444PhePro: 1.444 ± 0.426
0.619PheGln: 0.619 ± 0.244
2.578PheArg: 2.578 ± 0.525
2.062PheSer: 2.062 ± 0.372
2.269PheThr: 2.269 ± 0.471
1.65PheVal: 1.65 ± 0.529
0.516PheTrp: 0.516 ± 0.256
1.134PheTyr: 1.134 ± 0.345
0.0PheXaa: 0.0 ± 0.0
Gly
9.075GlyAla: 9.075 ± 1.252
0.516GlyCys: 0.516 ± 0.178
5.053GlyAsp: 5.053 ± 0.629
4.95GlyGlu: 4.95 ± 0.78
2.784GlyPhe: 2.784 ± 0.385
5.569GlyGly: 5.569 ± 0.768
1.134GlyHis: 1.134 ± 0.365
3.919GlyIle: 3.919 ± 0.713
4.125GlyLys: 4.125 ± 0.594
7.012GlyLeu: 7.012 ± 0.893
2.166GlyMet: 2.166 ± 0.473
3.403GlyAsn: 3.403 ± 0.674
2.372GlyPro: 2.372 ± 0.552
1.031GlyGln: 1.031 ± 0.338
4.537GlyArg: 4.537 ± 0.754
3.609GlySer: 3.609 ± 0.584
5.156GlyThr: 5.156 ± 0.754
4.95GlyVal: 4.95 ± 0.772
1.547GlyTrp: 1.547 ± 0.35
2.784GlyTyr: 2.784 ± 0.546
0.0GlyXaa: 0.0 ± 0.0
His
2.166HisAla: 2.166 ± 0.561
0.206HisCys: 0.206 ± 0.144
1.031HisAsp: 1.031 ± 0.347
1.856HisGlu: 1.856 ± 0.38
0.516HisPhe: 0.516 ± 0.221
2.372HisGly: 2.372 ± 0.599
0.516HisHis: 0.516 ± 0.26
1.031HisIle: 1.031 ± 0.316
0.722HisLys: 0.722 ± 0.224
2.062HisLeu: 2.062 ± 0.425
0.516HisMet: 0.516 ± 0.197
0.516HisAsn: 0.516 ± 0.236
0.309HisPro: 0.309 ± 0.196
0.825HisGln: 0.825 ± 0.258
1.031HisArg: 1.031 ± 0.334
0.516HisSer: 0.516 ± 0.178
1.031HisThr: 1.031 ± 0.405
1.341HisVal: 1.341 ± 0.367
0.206HisTrp: 0.206 ± 0.147
0.619HisTyr: 0.619 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
8.662IleAla: 8.662 ± 1.196
0.206IleCys: 0.206 ± 0.135
5.466IleAsp: 5.466 ± 0.711
4.847IleGlu: 4.847 ± 0.558
0.619IlePhe: 0.619 ± 0.252
5.259IleGly: 5.259 ± 0.853
0.619IleHis: 0.619 ± 0.197
0.619IleIle: 0.619 ± 0.244
1.856IleLys: 1.856 ± 0.36
2.475IleLeu: 2.475 ± 0.453
0.928IleMet: 0.928 ± 0.267
2.062IleAsn: 2.062 ± 0.471
1.856IlePro: 1.856 ± 0.607
1.237IleGln: 1.237 ± 0.36
1.547IleArg: 1.547 ± 0.377
1.959IleSer: 1.959 ± 0.413
3.712IleThr: 3.712 ± 0.626
3.197IleVal: 3.197 ± 0.37
0.309IleTrp: 0.309 ± 0.165
0.722IleTyr: 0.722 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
6.703LysAla: 6.703 ± 0.71
0.206LysCys: 0.206 ± 0.129
1.856LysAsp: 1.856 ± 0.403
2.372LysGlu: 2.372 ± 0.44
0.825LysPhe: 0.825 ± 0.352
3.609LysGly: 3.609 ± 0.708
1.031LysHis: 1.031 ± 0.401
1.237LysIle: 1.237 ± 0.393
2.372LysLys: 2.372 ± 0.414
3.919LysLeu: 3.919 ± 0.694
1.031LysMet: 1.031 ± 0.443
0.619LysAsn: 0.619 ± 0.305
2.475LysPro: 2.475 ± 0.536
1.959LysGln: 1.959 ± 0.486
4.847LysArg: 4.847 ± 0.661
2.578LysSer: 2.578 ± 0.461
2.372LysThr: 2.372 ± 0.487
2.166LysVal: 2.166 ± 0.53
0.412LysTrp: 0.412 ± 0.179
1.031LysTyr: 1.031 ± 0.343
0.0LysXaa: 0.0 ± 0.0
Leu
13.819LeuAla: 13.819 ± 1.065
0.825LeuCys: 0.825 ± 0.326
5.878LeuAsp: 5.878 ± 0.81
3.403LeuGlu: 3.403 ± 0.489
1.959LeuPhe: 1.959 ± 0.457
6.6LeuGly: 6.6 ± 1.213
1.856LeuHis: 1.856 ± 0.443
3.3LeuIle: 3.3 ± 0.503
3.506LeuLys: 3.506 ± 0.534
5.981LeuLeu: 5.981 ± 0.862
2.578LeuMet: 2.578 ± 0.537
3.816LeuAsn: 3.816 ± 0.634
4.744LeuPro: 4.744 ± 0.789
3.506LeuGln: 3.506 ± 0.542
5.259LeuArg: 5.259 ± 0.702
6.6LeuSer: 6.6 ± 0.787
6.806LeuThr: 6.806 ± 0.736
7.425LeuVal: 7.425 ± 0.788
0.516LeuTrp: 0.516 ± 0.223
2.166LeuTyr: 2.166 ± 0.381
0.0LeuXaa: 0.0 ± 0.0
Met
3.506MetAla: 3.506 ± 0.541
0.206MetCys: 0.206 ± 0.131
1.237MetAsp: 1.237 ± 0.293
0.928MetGlu: 0.928 ± 0.279
0.928MetPhe: 0.928 ± 0.268
0.619MetGly: 0.619 ± 0.238
0.722MetHis: 0.722 ± 0.285
1.031MetIle: 1.031 ± 0.346
1.341MetLys: 1.341 ± 0.366
2.578MetLeu: 2.578 ± 0.528
0.722MetMet: 0.722 ± 0.306
1.341MetAsn: 1.341 ± 0.276
0.619MetPro: 0.619 ± 0.243
1.341MetGln: 1.341 ± 0.352
2.578MetArg: 2.578 ± 0.539
1.341MetSer: 1.341 ± 0.314
2.475MetThr: 2.475 ± 0.501
1.134MetVal: 1.134 ± 0.29
0.0MetTrp: 0.0 ± 0.0
0.309MetTyr: 0.309 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
4.537AsnAla: 4.537 ± 0.595
0.0AsnCys: 0.0 ± 0.0
2.372AsnAsp: 2.372 ± 0.448
2.681AsnGlu: 2.681 ± 0.46
0.928AsnPhe: 0.928 ± 0.24
4.641AsnGly: 4.641 ± 0.91
0.619AsnHis: 0.619 ± 0.226
1.753AsnIle: 1.753 ± 0.327
1.547AsnLys: 1.547 ± 0.466
2.475AsnLeu: 2.475 ± 0.455
0.825AsnMet: 0.825 ± 0.205
0.928AsnAsn: 0.928 ± 0.223
1.031AsnPro: 1.031 ± 0.228
0.516AsnGln: 0.516 ± 0.197
1.134AsnArg: 1.134 ± 0.327
0.619AsnSer: 0.619 ± 0.233
2.062AsnThr: 2.062 ± 0.647
1.959AsnVal: 1.959 ± 0.412
0.206AsnTrp: 0.206 ± 0.13
1.031AsnTyr: 1.031 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
8.559ProAla: 8.559 ± 1.298
0.103ProCys: 0.103 ± 0.104
3.506ProAsp: 3.506 ± 0.809
2.475ProGlu: 2.475 ± 0.429
1.031ProPhe: 1.031 ± 0.276
2.062ProGly: 2.062 ± 0.336
0.619ProHis: 0.619 ± 0.31
2.784ProIle: 2.784 ± 0.421
1.134ProLys: 1.134 ± 0.332
2.578ProLeu: 2.578 ± 0.58
0.825ProMet: 0.825 ± 0.319
1.444ProAsn: 1.444 ± 0.366
1.959ProPro: 1.959 ± 0.44
0.825ProGln: 0.825 ± 0.318
3.403ProArg: 3.403 ± 0.688
2.887ProSer: 2.887 ± 0.512
2.062ProThr: 2.062 ± 0.456
2.887ProVal: 2.887 ± 0.617
0.412ProTrp: 0.412 ± 0.189
0.825ProTyr: 0.825 ± 0.346
0.0ProXaa: 0.0 ± 0.0
Gln
4.125GlnAla: 4.125 ± 0.715
0.412GlnCys: 0.412 ± 0.179
1.547GlnAsp: 1.547 ± 0.426
1.031GlnGlu: 1.031 ± 0.242
1.856GlnPhe: 1.856 ± 0.675
3.094GlnGly: 3.094 ± 0.643
0.412GlnHis: 0.412 ± 0.141
1.547GlnIle: 1.547 ± 0.478
1.856GlnLys: 1.856 ± 0.432
4.847GlnLeu: 4.847 ± 0.76
0.309GlnMet: 0.309 ± 0.17
0.516GlnAsn: 0.516 ± 0.196
0.619GlnPro: 0.619 ± 0.237
1.753GlnGln: 1.753 ± 0.371
3.3GlnArg: 3.3 ± 0.651
2.062GlnSer: 2.062 ± 0.387
1.444GlnThr: 1.444 ± 0.445
1.753GlnVal: 1.753 ± 0.397
0.619GlnTrp: 0.619 ± 0.246
1.547GlnTyr: 1.547 ± 0.331
0.0GlnXaa: 0.0 ± 0.0
Arg
7.941ArgAla: 7.941 ± 1.182
0.825ArgCys: 0.825 ± 0.308
4.537ArgAsp: 4.537 ± 0.576
4.434ArgGlu: 4.434 ± 0.753
1.341ArgPhe: 1.341 ± 0.434
4.125ArgGly: 4.125 ± 0.622
1.753ArgHis: 1.753 ± 0.513
4.228ArgIle: 4.228 ± 0.728
4.331ArgLys: 4.331 ± 0.65
5.569ArgLeu: 5.569 ± 0.736
2.166ArgMet: 2.166 ± 0.556
2.578ArgAsn: 2.578 ± 0.524
2.269ArgPro: 2.269 ± 0.358
2.166ArgGln: 2.166 ± 0.585
3.712ArgArg: 3.712 ± 0.794
3.816ArgSer: 3.816 ± 0.652
4.434ArgThr: 4.434 ± 0.608
6.806ArgVal: 6.806 ± 0.799
0.206ArgTrp: 0.206 ± 0.159
1.341ArgTyr: 1.341 ± 0.343
0.0ArgXaa: 0.0 ± 0.0
Ser
7.425SerAla: 7.425 ± 0.781
0.412SerCys: 0.412 ± 0.211
2.475SerAsp: 2.475 ± 0.578
2.475SerGlu: 2.475 ± 0.41
2.269SerPhe: 2.269 ± 0.442
3.403SerGly: 3.403 ± 0.559
1.444SerHis: 1.444 ± 0.397
3.506SerIle: 3.506 ± 0.566
1.753SerLys: 1.753 ± 0.501
4.537SerLeu: 4.537 ± 0.75
1.031SerMet: 1.031 ± 0.365
1.444SerAsn: 1.444 ± 0.372
2.991SerPro: 2.991 ± 0.48
2.269SerGln: 2.269 ± 0.425
4.125SerArg: 4.125 ± 0.613
2.681SerSer: 2.681 ± 0.554
4.537SerThr: 4.537 ± 0.832
2.991SerVal: 2.991 ± 0.533
1.341SerTrp: 1.341 ± 0.374
0.516SerTyr: 0.516 ± 0.241
0.0SerXaa: 0.0 ± 0.0
Thr
6.909ThrAla: 6.909 ± 0.778
0.619ThrCys: 0.619 ± 0.245
4.022ThrAsp: 4.022 ± 0.725
2.578ThrGlu: 2.578 ± 0.466
2.991ThrPhe: 2.991 ± 0.655
5.259ThrGly: 5.259 ± 0.989
0.928ThrHis: 0.928 ± 0.265
3.712ThrIle: 3.712 ± 0.677
2.887ThrLys: 2.887 ± 0.521
5.878ThrLeu: 5.878 ± 0.958
1.856ThrMet: 1.856 ± 0.368
1.959ThrAsn: 1.959 ± 0.405
3.816ThrPro: 3.816 ± 0.515
2.166ThrGln: 2.166 ± 0.352
4.022ThrArg: 4.022 ± 0.538
3.712ThrSer: 3.712 ± 0.6
4.95ThrThr: 4.95 ± 1.03
5.156ThrVal: 5.156 ± 0.69
0.206ThrTrp: 0.206 ± 0.17
1.341ThrTyr: 1.341 ± 0.368
0.0ThrXaa: 0.0 ± 0.0
Val
9.075ValAla: 9.075 ± 1.09
0.309ValCys: 0.309 ± 0.174
4.434ValAsp: 4.434 ± 0.587
2.784ValGlu: 2.784 ± 0.624
2.269ValPhe: 2.269 ± 0.505
4.125ValGly: 4.125 ± 0.735
0.722ValHis: 0.722 ± 0.325
3.919ValIle: 3.919 ± 0.501
2.681ValLys: 2.681 ± 0.446
4.744ValLeu: 4.744 ± 0.469
2.166ValMet: 2.166 ± 0.517
2.062ValAsn: 2.062 ± 0.411
2.784ValPro: 2.784 ± 0.449
3.094ValGln: 3.094 ± 0.6
4.95ValArg: 4.95 ± 0.595
4.125ValSer: 4.125 ± 0.565
3.816ValThr: 3.816 ± 0.699
4.847ValVal: 4.847 ± 0.737
1.134ValTrp: 1.134 ± 0.384
1.444ValTyr: 1.444 ± 0.325
0.0ValXaa: 0.0 ± 0.0
Trp
1.237TrpAla: 1.237 ± 0.309
0.0TrpCys: 0.0 ± 0.0
0.412TrpAsp: 0.412 ± 0.204
0.928TrpGlu: 0.928 ± 0.295
1.031TrpPhe: 1.031 ± 0.294
0.722TrpGly: 0.722 ± 0.287
0.619TrpHis: 0.619 ± 0.273
0.206TrpIle: 0.206 ± 0.127
0.619TrpLys: 0.619 ± 0.228
1.959TrpLeu: 1.959 ± 0.357
0.0TrpMet: 0.0 ± 0.0
0.309TrpAsn: 0.309 ± 0.189
0.825TrpPro: 0.825 ± 0.352
0.309TrpGln: 0.309 ± 0.163
1.237TrpArg: 1.237 ± 0.353
0.722TrpSer: 0.722 ± 0.298
0.722TrpThr: 0.722 ± 0.26
0.722TrpVal: 0.722 ± 0.272
0.0TrpTrp: 0.0 ± 0.0
0.206TrpTyr: 0.206 ± 0.14
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.887TyrAla: 2.887 ± 0.615
0.309TyrCys: 0.309 ± 0.198
1.65TyrAsp: 1.65 ± 0.384
1.444TyrGlu: 1.444 ± 0.445
1.444TyrPhe: 1.444 ± 0.403
1.547TyrGly: 1.547 ± 0.38
0.825TyrHis: 0.825 ± 0.247
0.825TyrIle: 0.825 ± 0.23
0.309TyrLys: 0.309 ± 0.177
2.578TyrLeu: 2.578 ± 0.519
0.309TyrMet: 0.309 ± 0.151
0.722TyrAsn: 0.722 ± 0.263
0.928TyrPro: 0.928 ± 0.297
1.65TyrGln: 1.65 ± 0.343
2.062TyrArg: 2.062 ± 0.481
1.134TyrSer: 1.134 ± 0.315
2.062TyrThr: 2.062 ± 0.352
1.753TyrVal: 1.753 ± 0.444
0.412TyrTrp: 0.412 ± 0.156
0.412TyrTyr: 0.412 ± 0.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (9698 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski