Amino acid dipepetide frequency for Burkholderia phage BcepGomr

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.438AlaAla: 14.438 ± 2.039
1.645AlaCys: 1.645 ± 0.297
6.458AlaAsp: 6.458 ± 0.559
7.006AlaGlu: 7.006 ± 1.243
4.325AlaPhe: 4.325 ± 0.535
8.712AlaGly: 8.712 ± 1.043
1.584AlaHis: 1.584 ± 0.351
6.275AlaIle: 6.275 ± 0.65
5.422AlaLys: 5.422 ± 0.817
7.189AlaLeu: 7.189 ± 0.762
3.838AlaMet: 3.838 ± 0.448
4.63AlaAsn: 4.63 ± 0.492
6.275AlaPro: 6.275 ± 1.293
4.143AlaGln: 4.143 ± 0.557
5.909AlaArg: 5.909 ± 0.688
5.726AlaSer: 5.726 ± 0.668
7.006AlaThr: 7.006 ± 0.999
5.605AlaVal: 5.605 ± 0.623
1.767AlaTrp: 1.767 ± 0.39
2.376AlaTyr: 2.376 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.731CysAla: 0.731 ± 0.237
0.366CysCys: 0.366 ± 0.142
0.731CysAsp: 0.731 ± 0.231
1.036CysGlu: 1.036 ± 0.293
0.366CysPhe: 0.366 ± 0.151
0.487CysGly: 0.487 ± 0.219
0.061CysHis: 0.061 ± 0.063
0.731CysIle: 0.731 ± 0.194
0.609CysLys: 0.609 ± 0.217
0.975CysLeu: 0.975 ± 0.226
0.366CysMet: 0.366 ± 0.141
0.366CysAsn: 0.366 ± 0.129
0.853CysPro: 0.853 ± 0.241
0.183CysGln: 0.183 ± 0.122
0.487CysArg: 0.487 ± 0.166
0.487CysSer: 0.487 ± 0.159
0.792CysThr: 0.792 ± 0.244
0.792CysVal: 0.792 ± 0.223
0.122CysTrp: 0.122 ± 0.081
0.244CysTyr: 0.244 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
6.579AspAla: 6.579 ± 0.618
0.609AspCys: 0.609 ± 0.177
2.68AspAsp: 2.68 ± 0.512
3.594AspGlu: 3.594 ± 0.43
2.68AspPhe: 2.68 ± 0.375
5.605AspGly: 5.605 ± 0.627
0.975AspHis: 0.975 ± 0.248
2.68AspIle: 2.68 ± 0.344
2.193AspLys: 2.193 ± 0.397
4.021AspLeu: 4.021 ± 0.419
1.401AspMet: 1.401 ± 0.321
1.828AspAsn: 1.828 ± 0.352
2.802AspPro: 2.802 ± 0.394
2.741AspGln: 2.741 ± 0.431
3.29AspArg: 3.29 ± 0.462
2.985AspSer: 2.985 ± 0.42
2.68AspThr: 2.68 ± 0.344
3.777AspVal: 3.777 ± 0.495
1.889AspTrp: 1.889 ± 0.353
1.523AspTyr: 1.523 ± 0.341
0.0AspXaa: 0.0 ± 0.0
Glu
7.31GluAla: 7.31 ± 1.054
0.914GluCys: 0.914 ± 0.283
2.62GluAsp: 2.62 ± 0.514
3.472GluGlu: 3.472 ± 0.54
2.315GluPhe: 2.315 ± 0.309
3.777GluGly: 3.777 ± 0.481
1.097GluHis: 1.097 ± 0.272
3.168GluIle: 3.168 ± 0.354
3.168GluLys: 3.168 ± 0.607
4.935GluLeu: 4.935 ± 0.564
2.802GluMet: 2.802 ± 0.459
3.046GluAsn: 3.046 ± 0.521
1.401GluPro: 1.401 ± 0.279
3.533GluGln: 3.533 ± 0.544
4.082GluArg: 4.082 ± 0.58
3.046GluSer: 3.046 ± 0.419
1.949GluThr: 1.949 ± 0.39
4.082GluVal: 4.082 ± 0.621
0.975GluTrp: 0.975 ± 0.223
2.68GluTyr: 2.68 ± 0.336
0.0GluXaa: 0.0 ± 0.0
Phe
3.655PheAla: 3.655 ± 0.57
0.366PheCys: 0.366 ± 0.149
2.559PheAsp: 2.559 ± 0.453
2.376PheGlu: 2.376 ± 0.373
0.914PhePhe: 0.914 ± 0.235
3.655PheGly: 3.655 ± 0.444
0.731PheHis: 0.731 ± 0.206
2.132PheIle: 2.132 ± 0.307
1.889PheLys: 1.889 ± 0.34
2.071PheLeu: 2.071 ± 0.388
0.67PheMet: 0.67 ± 0.19
2.01PheAsn: 2.01 ± 0.365
1.828PhePro: 1.828 ± 0.3
0.792PheGln: 0.792 ± 0.254
2.559PheArg: 2.559 ± 0.568
2.01PheSer: 2.01 ± 0.387
2.315PheThr: 2.315 ± 0.377
2.863PheVal: 2.863 ± 0.339
0.67PheTrp: 0.67 ± 0.197
1.279PheTyr: 1.279 ± 0.264
0.0PheXaa: 0.0 ± 0.0
Gly
7.371GlyAla: 7.371 ± 0.878
0.914GlyCys: 0.914 ± 0.276
3.533GlyAsp: 3.533 ± 0.537
4.995GlyGlu: 4.995 ± 0.597
2.559GlyPhe: 2.559 ± 0.37
6.945GlyGly: 6.945 ± 0.958
1.523GlyHis: 1.523 ± 0.324
4.264GlyIle: 4.264 ± 0.507
4.569GlyLys: 4.569 ± 0.403
6.336GlyLeu: 6.336 ± 0.666
1.828GlyMet: 1.828 ± 0.313
4.935GlyAsn: 4.935 ± 0.994
2.68GlyPro: 2.68 ± 0.507
4.508GlyGln: 4.508 ± 0.469
4.021GlyArg: 4.021 ± 0.627
5.361GlySer: 5.361 ± 0.636
4.874GlyThr: 4.874 ± 0.644
5.422GlyVal: 5.422 ± 0.673
1.157GlyTrp: 1.157 ± 0.272
3.168GlyTyr: 3.168 ± 0.423
0.0GlyXaa: 0.0 ± 0.0
His
1.889HisAla: 1.889 ± 0.363
0.366HisCys: 0.366 ± 0.16
0.67HisAsp: 0.67 ± 0.16
1.036HisGlu: 1.036 ± 0.231
0.853HisPhe: 0.853 ± 0.268
1.401HisGly: 1.401 ± 0.308
0.426HisHis: 0.426 ± 0.171
1.036HisIle: 1.036 ± 0.292
0.731HisLys: 0.731 ± 0.229
1.097HisLeu: 1.097 ± 0.27
0.609HisMet: 0.609 ± 0.197
0.426HisAsn: 0.426 ± 0.156
0.975HisPro: 0.975 ± 0.255
0.426HisGln: 0.426 ± 0.189
0.853HisArg: 0.853 ± 0.214
1.097HisSer: 1.097 ± 0.276
0.548HisThr: 0.548 ± 0.217
1.523HisVal: 1.523 ± 0.287
0.305HisTrp: 0.305 ± 0.174
0.853HisTyr: 0.853 ± 0.279
0.0HisXaa: 0.0 ± 0.0
Ile
5.483IleAla: 5.483 ± 0.575
0.426IleCys: 0.426 ± 0.202
4.63IleAsp: 4.63 ± 0.496
3.899IleGlu: 3.899 ± 0.499
2.254IlePhe: 2.254 ± 0.354
4.447IleGly: 4.447 ± 0.49
1.036IleHis: 1.036 ± 0.255
2.559IleIle: 2.559 ± 0.461
3.412IleLys: 3.412 ± 0.54
3.533IleLeu: 3.533 ± 0.419
1.401IleMet: 1.401 ± 0.246
2.863IleAsn: 2.863 ± 0.444
3.046IlePro: 3.046 ± 0.483
1.889IleGln: 1.889 ± 0.286
2.741IleArg: 2.741 ± 0.391
2.071IleSer: 2.071 ± 0.428
2.924IleThr: 2.924 ± 0.524
3.107IleVal: 3.107 ± 0.383
0.792IleTrp: 0.792 ± 0.212
1.828IleTyr: 1.828 ± 0.319
0.0IleXaa: 0.0 ± 0.0
Lys
7.676LysAla: 7.676 ± 0.953
0.244LysCys: 0.244 ± 0.139
2.559LysAsp: 2.559 ± 0.507
3.472LysGlu: 3.472 ± 0.51
1.645LysPhe: 1.645 ± 0.349
4.203LysGly: 4.203 ± 0.579
1.462LysHis: 1.462 ± 0.308
2.559LysIle: 2.559 ± 0.347
2.68LysLys: 2.68 ± 0.644
3.716LysLeu: 3.716 ± 0.485
1.523LysMet: 1.523 ± 0.377
1.706LysAsn: 1.706 ± 0.286
2.315LysPro: 2.315 ± 0.388
2.437LysGln: 2.437 ± 0.406
2.376LysArg: 2.376 ± 0.355
2.376LysSer: 2.376 ± 0.407
2.741LysThr: 2.741 ± 0.432
3.716LysVal: 3.716 ± 0.406
1.097LysTrp: 1.097 ± 0.218
1.34LysTyr: 1.34 ± 0.261
0.0LysXaa: 0.0 ± 0.0
Leu
6.945LeuAla: 6.945 ± 0.703
0.792LeuCys: 0.792 ± 0.248
5.239LeuAsp: 5.239 ± 0.52
3.96LeuGlu: 3.96 ± 0.558
2.924LeuPhe: 2.924 ± 0.482
4.447LeuGly: 4.447 ± 0.577
0.975LeuHis: 0.975 ± 0.232
3.229LeuIle: 3.229 ± 0.542
3.899LeuLys: 3.899 ± 0.558
4.021LeuLeu: 4.021 ± 0.725
1.767LeuMet: 1.767 ± 0.29
3.96LeuAsn: 3.96 ± 0.723
4.082LeuPro: 4.082 ± 0.501
2.863LeuGln: 2.863 ± 0.563
4.325LeuArg: 4.325 ± 0.488
4.447LeuSer: 4.447 ± 0.461
4.447LeuThr: 4.447 ± 0.464
3.899LeuVal: 3.899 ± 0.357
0.792LeuTrp: 0.792 ± 0.185
2.254LeuTyr: 2.254 ± 0.385
0.0LeuXaa: 0.0 ± 0.0
Met
3.229MetAla: 3.229 ± 0.435
0.183MetCys: 0.183 ± 0.113
2.254MetAsp: 2.254 ± 0.345
2.193MetGlu: 2.193 ± 0.449
0.548MetPhe: 0.548 ± 0.172
1.767MetGly: 1.767 ± 0.316
0.548MetHis: 0.548 ± 0.167
1.157MetIle: 1.157 ± 0.26
1.706MetLys: 1.706 ± 0.287
1.889MetLeu: 1.889 ± 0.363
1.218MetMet: 1.218 ± 0.323
1.218MetAsn: 1.218 ± 0.283
1.584MetPro: 1.584 ± 0.309
1.401MetGln: 1.401 ± 0.368
1.828MetArg: 1.828 ± 0.322
2.01MetSer: 2.01 ± 0.304
2.132MetThr: 2.132 ± 0.354
2.01MetVal: 2.01 ± 0.353
0.426MetTrp: 0.426 ± 0.15
0.426MetTyr: 0.426 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
4.752AsnAla: 4.752 ± 0.583
0.426AsnCys: 0.426 ± 0.146
2.254AsnAsp: 2.254 ± 0.421
2.437AsnGlu: 2.437 ± 0.351
1.828AsnPhe: 1.828 ± 0.435
5.3AsnGly: 5.3 ± 0.733
0.67AsnHis: 0.67 ± 0.164
2.437AsnIle: 2.437 ± 0.281
1.706AsnLys: 1.706 ± 0.358
3.046AsnLeu: 3.046 ± 0.441
1.401AsnMet: 1.401 ± 0.253
2.071AsnAsn: 2.071 ± 0.566
2.863AsnPro: 2.863 ± 0.445
1.706AsnGln: 1.706 ± 0.3
2.498AsnArg: 2.498 ± 0.542
2.071AsnSer: 2.071 ± 0.358
2.437AsnThr: 2.437 ± 0.343
3.351AsnVal: 3.351 ± 0.546
1.218AsnTrp: 1.218 ± 0.256
1.34AsnTyr: 1.34 ± 0.34
0.0AsnXaa: 0.0 ± 0.0
Pro
5.726ProAla: 5.726 ± 1.134
0.426ProCys: 0.426 ± 0.147
3.229ProAsp: 3.229 ± 0.51
3.229ProGlu: 3.229 ± 0.522
2.254ProPhe: 2.254 ± 0.397
3.96ProGly: 3.96 ± 0.463
0.67ProHis: 0.67 ± 0.224
2.071ProIle: 2.071 ± 0.362
2.132ProLys: 2.132 ± 0.321
3.107ProLeu: 3.107 ± 0.408
0.914ProMet: 0.914 ± 0.217
2.62ProAsn: 2.62 ± 0.389
3.351ProPro: 3.351 ± 0.646
1.706ProGln: 1.706 ± 0.325
2.437ProArg: 2.437 ± 0.548
2.741ProSer: 2.741 ± 0.395
3.168ProThr: 3.168 ± 0.423
4.143ProVal: 4.143 ± 0.552
0.731ProTrp: 0.731 ± 0.245
1.218ProTyr: 1.218 ± 0.304
0.0ProXaa: 0.0 ± 0.0
Gln
5.361GlnAla: 5.361 ± 0.787
0.426GlnCys: 0.426 ± 0.163
1.889GlnAsp: 1.889 ± 0.358
2.254GlnGlu: 2.254 ± 0.385
2.071GlnPhe: 2.071 ± 0.351
3.472GlnGly: 3.472 ± 0.594
0.487GlnHis: 0.487 ± 0.164
2.437GlnIle: 2.437 ± 0.472
2.01GlnLys: 2.01 ± 0.476
3.96GlnLeu: 3.96 ± 0.643
1.523GlnMet: 1.523 ± 0.372
1.34GlnAsn: 1.34 ± 0.288
1.889GlnPro: 1.889 ± 0.317
3.29GlnGln: 3.29 ± 0.556
1.767GlnArg: 1.767 ± 0.325
2.132GlnSer: 2.132 ± 0.373
2.437GlnThr: 2.437 ± 0.387
3.107GlnVal: 3.107 ± 0.34
1.097GlnTrp: 1.097 ± 0.283
1.645GlnTyr: 1.645 ± 0.306
0.0GlnXaa: 0.0 ± 0.0
Arg
5.726ArgAla: 5.726 ± 0.538
0.244ArgCys: 0.244 ± 0.138
2.741ArgAsp: 2.741 ± 0.354
3.533ArgGlu: 3.533 ± 0.493
2.071ArgPhe: 2.071 ± 0.374
4.021ArgGly: 4.021 ± 0.467
0.853ArgHis: 0.853 ± 0.3
3.655ArgIle: 3.655 ± 0.527
2.62ArgLys: 2.62 ± 0.388
4.752ArgLeu: 4.752 ± 0.485
2.193ArgMet: 2.193 ± 0.36
2.437ArgAsn: 2.437 ± 0.359
2.863ArgPro: 2.863 ± 0.391
2.254ArgGln: 2.254 ± 0.424
3.046ArgArg: 3.046 ± 0.452
3.168ArgSer: 3.168 ± 0.531
2.68ArgThr: 2.68 ± 0.441
3.472ArgVal: 3.472 ± 0.385
1.34ArgTrp: 1.34 ± 0.369
2.071ArgTyr: 2.071 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
6.64SerAla: 6.64 ± 0.718
0.426SerCys: 0.426 ± 0.206
2.802SerAsp: 2.802 ± 0.383
2.559SerGlu: 2.559 ± 0.418
1.584SerPhe: 1.584 ± 0.317
6.153SerGly: 6.153 ± 1.022
1.036SerHis: 1.036 ± 0.27
3.351SerIle: 3.351 ± 0.589
3.107SerLys: 3.107 ± 0.416
3.533SerLeu: 3.533 ± 0.422
1.34SerMet: 1.34 ± 0.265
2.498SerAsn: 2.498 ± 0.399
2.376SerPro: 2.376 ± 0.367
2.498SerGln: 2.498 ± 0.379
3.351SerArg: 3.351 ± 0.43
2.437SerSer: 2.437 ± 0.416
2.802SerThr: 2.802 ± 0.414
3.472SerVal: 3.472 ± 0.504
0.853SerTrp: 0.853 ± 0.226
1.097SerTyr: 1.097 ± 0.281
0.0SerXaa: 0.0 ± 0.0
Thr
5.483ThrAla: 5.483 ± 0.68
0.609ThrCys: 0.609 ± 0.208
3.472ThrAsp: 3.472 ± 0.463
3.29ThrGlu: 3.29 ± 0.468
1.949ThrPhe: 1.949 ± 0.321
4.082ThrGly: 4.082 ± 0.491
0.914ThrHis: 0.914 ± 0.217
3.412ThrIle: 3.412 ± 0.553
2.802ThrLys: 2.802 ± 0.465
3.594ThrLeu: 3.594 ± 0.495
1.401ThrMet: 1.401 ± 0.285
2.741ThrAsn: 2.741 ± 0.544
3.046ThrPro: 3.046 ± 0.44
2.376ThrGln: 2.376 ± 0.418
2.741ThrArg: 2.741 ± 0.397
3.229ThrSer: 3.229 ± 0.384
3.29ThrThr: 3.29 ± 0.486
4.203ThrVal: 4.203 ± 0.573
1.036ThrTrp: 1.036 ± 0.253
1.889ThrTyr: 1.889 ± 0.296
0.0ThrXaa: 0.0 ± 0.0
Val
7.067ValAla: 7.067 ± 0.553
0.853ValCys: 0.853 ± 0.292
4.325ValAsp: 4.325 ± 0.41
3.533ValGlu: 3.533 ± 0.505
1.889ValPhe: 1.889 ± 0.281
5.117ValGly: 5.117 ± 0.593
1.157ValHis: 1.157 ± 0.371
4.143ValIle: 4.143 ± 0.496
4.569ValLys: 4.569 ± 0.553
3.777ValLeu: 3.777 ± 0.527
2.01ValMet: 2.01 ± 0.405
2.559ValAsn: 2.559 ± 0.364
2.802ValPro: 2.802 ± 0.475
3.351ValGln: 3.351 ± 0.379
3.838ValArg: 3.838 ± 0.588
4.082ValSer: 4.082 ± 0.629
3.838ValThr: 3.838 ± 0.641
4.508ValVal: 4.508 ± 0.606
1.157ValTrp: 1.157 ± 0.283
1.889ValTyr: 1.889 ± 0.454
0.0ValXaa: 0.0 ± 0.0
Trp
1.401TrpAla: 1.401 ± 0.258
0.183TrpCys: 0.183 ± 0.099
0.914TrpAsp: 0.914 ± 0.262
0.914TrpGlu: 0.914 ± 0.23
1.097TrpPhe: 1.097 ± 0.282
0.914TrpGly: 0.914 ± 0.223
0.305TrpHis: 0.305 ± 0.131
0.914TrpIle: 0.914 ± 0.205
1.218TrpLys: 1.218 ± 0.286
1.401TrpLeu: 1.401 ± 0.294
0.731TrpMet: 0.731 ± 0.207
0.975TrpAsn: 0.975 ± 0.275
0.67TrpPro: 0.67 ± 0.223
1.157TrpGln: 1.157 ± 0.267
1.462TrpArg: 1.462 ± 0.39
1.157TrpSer: 1.157 ± 0.354
0.853TrpThr: 0.853 ± 0.204
1.462TrpVal: 1.462 ± 0.272
0.183TrpTrp: 0.183 ± 0.104
0.731TrpTyr: 0.731 ± 0.227
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.863TyrAla: 2.863 ± 0.559
0.366TyrCys: 0.366 ± 0.205
0.975TyrAsp: 0.975 ± 0.259
1.584TyrGlu: 1.584 ± 0.28
1.218TyrPhe: 1.218 ± 0.276
2.498TyrGly: 2.498 ± 0.43
0.609TyrHis: 0.609 ± 0.197
2.132TyrIle: 2.132 ± 0.485
1.462TyrLys: 1.462 ± 0.307
2.315TyrLeu: 2.315 ± 0.385
0.731TyrMet: 0.731 ± 0.227
1.584TyrAsn: 1.584 ± 0.321
2.132TyrPro: 2.132 ± 0.373
1.34TyrGln: 1.34 ± 0.308
2.132TyrArg: 2.132 ± 0.38
1.279TyrSer: 1.279 ± 0.28
1.645TyrThr: 1.645 ± 0.34
1.949TyrVal: 1.949 ± 0.34
1.036TyrTrp: 1.036 ± 0.266
0.975TyrTyr: 0.975 ± 0.322
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (16416 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski