Amino acid dipepetide frequency for Mycobacterium phage B1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.786AlaAla: 10.786 ± 1.321
0.729AlaCys: 0.729 ± 0.248
5.539AlaAsp: 5.539 ± 0.792
6.778AlaGlu: 6.778 ± 0.76
3.061AlaPhe: 3.061 ± 0.576
7.288AlaGly: 7.288 ± 0.911
2.041AlaHis: 2.041 ± 0.443
3.717AlaIle: 3.717 ± 0.605
4.664AlaLys: 4.664 ± 0.775
9.693AlaLeu: 9.693 ± 0.99
2.405AlaMet: 2.405 ± 0.39
2.332AlaAsn: 2.332 ± 0.444
5.539AlaPro: 5.539 ± 0.87
4.519AlaGln: 4.519 ± 0.574
5.393AlaArg: 5.393 ± 0.532
5.466AlaSer: 5.466 ± 0.602
5.393AlaThr: 5.393 ± 0.589
7.288AlaVal: 7.288 ± 0.784
2.041AlaTrp: 2.041 ± 0.408
2.114AlaTyr: 2.114 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
0.437CysAla: 0.437 ± 0.195
0.0CysCys: 0.0 ± 0.0
0.437CysAsp: 0.437 ± 0.173
0.802CysGlu: 0.802 ± 0.237
0.146CysPhe: 0.146 ± 0.099
0.656CysGly: 0.656 ± 0.166
0.219CysHis: 0.219 ± 0.12
0.292CysIle: 0.292 ± 0.16
0.364CysLys: 0.364 ± 0.161
0.51CysLeu: 0.51 ± 0.195
0.292CysMet: 0.292 ± 0.157
0.437CysAsn: 0.437 ± 0.168
0.51CysPro: 0.51 ± 0.261
0.219CysGln: 0.219 ± 0.13
0.729CysArg: 0.729 ± 0.189
0.292CysSer: 0.292 ± 0.154
0.219CysThr: 0.219 ± 0.154
0.437CysVal: 0.437 ± 0.177
0.073CysTrp: 0.073 ± 0.071
0.364CysTyr: 0.364 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
6.341AspAla: 6.341 ± 0.713
0.583AspCys: 0.583 ± 0.206
4.3AspAsp: 4.3 ± 0.679
4.592AspGlu: 4.592 ± 0.62
2.624AspPhe: 2.624 ± 0.415
5.175AspGly: 5.175 ± 0.522
1.603AspHis: 1.603 ± 0.382
2.624AspIle: 2.624 ± 0.454
1.603AspLys: 1.603 ± 0.295
5.758AspLeu: 5.758 ± 0.598
1.458AspMet: 1.458 ± 0.413
2.041AspAsn: 2.041 ± 0.436
4.446AspPro: 4.446 ± 0.703
1.968AspGln: 1.968 ± 0.389
4.008AspArg: 4.008 ± 0.748
3.207AspSer: 3.207 ± 0.527
4.008AspThr: 4.008 ± 0.558
3.936AspVal: 3.936 ± 0.375
1.312AspTrp: 1.312 ± 0.295
2.259AspTyr: 2.259 ± 0.403
0.0AspXaa: 0.0 ± 0.0
Glu
7.142GluAla: 7.142 ± 0.753
0.437GluCys: 0.437 ± 0.144
4.592GluAsp: 4.592 ± 0.663
4.008GluGlu: 4.008 ± 0.501
2.915GluPhe: 2.915 ± 0.414
5.029GluGly: 5.029 ± 0.694
1.458GluHis: 1.458 ± 0.35
2.551GluIle: 2.551 ± 0.426
2.478GluLys: 2.478 ± 0.342
6.486GluLeu: 6.486 ± 0.808
1.822GluMet: 1.822 ± 0.364
2.478GluAsn: 2.478 ± 0.424
3.061GluPro: 3.061 ± 0.482
2.624GluGln: 2.624 ± 0.403
4.154GluArg: 4.154 ± 0.641
4.154GluSer: 4.154 ± 0.636
3.936GluThr: 3.936 ± 0.482
5.393GluVal: 5.393 ± 0.808
1.166GluTrp: 1.166 ± 0.262
2.551GluTyr: 2.551 ± 0.469
0.0GluXaa: 0.0 ± 0.0
Phe
3.498PheAla: 3.498 ± 0.688
0.292PheCys: 0.292 ± 0.178
3.061PheAsp: 3.061 ± 0.437
2.624PheGlu: 2.624 ± 0.366
0.51PhePhe: 0.51 ± 0.197
2.842PheGly: 2.842 ± 0.489
0.51PheHis: 0.51 ± 0.187
1.531PheIle: 1.531 ± 0.337
1.093PheLys: 1.093 ± 0.312
3.061PheLeu: 3.061 ± 0.461
0.656PheMet: 0.656 ± 0.22
1.822PheAsn: 1.822 ± 0.471
1.749PhePro: 1.749 ± 0.374
1.093PheGln: 1.093 ± 0.254
1.895PheArg: 1.895 ± 0.28
1.749PheSer: 1.749 ± 0.381
1.749PheThr: 1.749 ± 0.392
2.405PheVal: 2.405 ± 0.429
0.51PheTrp: 0.51 ± 0.172
0.802PheTyr: 0.802 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
6.705GlyAla: 6.705 ± 0.958
0.875GlyCys: 0.875 ± 0.269
4.737GlyAsp: 4.737 ± 0.703
4.446GlyGlu: 4.446 ± 0.565
3.717GlyPhe: 3.717 ± 0.527
7.653GlyGly: 7.653 ± 1.309
1.822GlyHis: 1.822 ± 0.354
4.227GlyIle: 4.227 ± 0.572
3.207GlyLys: 3.207 ± 0.472
6.049GlyLeu: 6.049 ± 0.985
2.041GlyMet: 2.041 ± 0.397
3.717GlyAsn: 3.717 ± 0.67
5.466GlyPro: 5.466 ± 2.265
3.207GlyGln: 3.207 ± 0.525
4.737GlyArg: 4.737 ± 0.57
4.373GlySer: 4.373 ± 0.604
5.247GlyThr: 5.247 ± 0.65
5.247GlyVal: 5.247 ± 0.528
1.458GlyTrp: 1.458 ± 0.304
2.842GlyTyr: 2.842 ± 0.385
0.0GlyXaa: 0.0 ± 0.0
His
1.822HisAla: 1.822 ± 0.354
0.146HisCys: 0.146 ± 0.101
1.385HisAsp: 1.385 ± 0.339
1.312HisGlu: 1.312 ± 0.28
0.656HisPhe: 0.656 ± 0.199
2.405HisGly: 2.405 ± 0.531
0.656HisHis: 0.656 ± 0.209
1.093HisIle: 1.093 ± 0.235
0.729HisLys: 0.729 ± 0.224
1.458HisLeu: 1.458 ± 0.351
0.292HisMet: 0.292 ± 0.132
0.656HisAsn: 0.656 ± 0.181
1.385HisPro: 1.385 ± 0.292
1.02HisGln: 1.02 ± 0.291
1.822HisArg: 1.822 ± 0.351
0.583HisSer: 0.583 ± 0.251
1.166HisThr: 1.166 ± 0.28
1.166HisVal: 1.166 ± 0.277
0.364HisTrp: 0.364 ± 0.203
0.802HisTyr: 0.802 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
5.029IleAla: 5.029 ± 0.612
0.364IleCys: 0.364 ± 0.137
3.644IleAsp: 3.644 ± 0.5
4.008IleGlu: 4.008 ± 0.612
1.093IlePhe: 1.093 ± 0.302
4.008IleGly: 4.008 ± 0.425
1.093IleHis: 1.093 ± 0.306
1.749IleIle: 1.749 ± 0.378
1.093IleLys: 1.093 ± 0.263
2.624IleLeu: 2.624 ± 0.49
0.583IleMet: 0.583 ± 0.254
1.749IleAsn: 1.749 ± 0.34
3.498IlePro: 3.498 ± 0.401
1.822IleGln: 1.822 ± 0.414
3.644IleArg: 3.644 ± 0.514
2.332IleSer: 2.332 ± 0.564
3.717IleThr: 3.717 ± 0.592
3.134IleVal: 3.134 ± 0.481
0.802IleTrp: 0.802 ± 0.238
1.239IleTyr: 1.239 ± 0.314
0.0IleXaa: 0.0 ± 0.0
Lys
4.081LysAla: 4.081 ± 0.525
0.219LysCys: 0.219 ± 0.135
2.551LysAsp: 2.551 ± 0.406
3.061LysGlu: 3.061 ± 0.492
0.875LysPhe: 0.875 ± 0.254
3.571LysGly: 3.571 ± 0.593
0.947LysHis: 0.947 ± 0.263
1.968LysIle: 1.968 ± 0.376
2.551LysLys: 2.551 ± 0.54
3.498LysLeu: 3.498 ± 0.36
0.875LysMet: 0.875 ± 0.189
1.093LysAsn: 1.093 ± 0.266
2.624LysPro: 2.624 ± 0.575
1.312LysGln: 1.312 ± 0.288
2.624LysArg: 2.624 ± 0.504
2.551LysSer: 2.551 ± 0.523
2.697LysThr: 2.697 ± 0.553
3.936LysVal: 3.936 ± 0.646
0.875LysTrp: 0.875 ± 0.3
1.166LysTyr: 1.166 ± 0.321
0.0LysXaa: 0.0 ± 0.0
Leu
8.308LeuAla: 8.308 ± 0.828
0.292LeuCys: 0.292 ± 0.177
6.122LeuAsp: 6.122 ± 0.563
6.049LeuGlu: 6.049 ± 0.66
2.041LeuPhe: 2.041 ± 0.344
6.414LeuGly: 6.414 ± 0.929
1.458LeuHis: 1.458 ± 0.44
5.102LeuIle: 5.102 ± 0.551
3.863LeuLys: 3.863 ± 0.404
6.195LeuLeu: 6.195 ± 0.667
2.478LeuMet: 2.478 ± 0.411
2.769LeuAsn: 2.769 ± 0.404
3.936LeuPro: 3.936 ± 0.485
2.259LeuGln: 2.259 ± 0.819
7.142LeuArg: 7.142 ± 0.574
4.081LeuSer: 4.081 ± 0.502
5.175LeuThr: 5.175 ± 0.667
5.029LeuVal: 5.029 ± 0.524
1.385LeuTrp: 1.385 ± 0.314
2.697LeuTyr: 2.697 ± 0.528
0.0LeuXaa: 0.0 ± 0.0
Met
3.28MetAla: 3.28 ± 0.566
0.073MetCys: 0.073 ± 0.068
1.895MetAsp: 1.895 ± 0.392
1.312MetGlu: 1.312 ± 0.307
0.802MetPhe: 0.802 ± 0.25
1.822MetGly: 1.822 ± 0.339
0.219MetHis: 0.219 ± 0.15
1.166MetIle: 1.166 ± 0.302
1.603MetLys: 1.603 ± 0.305
1.458MetLeu: 1.458 ± 0.349
0.583MetMet: 0.583 ± 0.216
1.166MetAsn: 1.166 ± 0.296
1.603MetPro: 1.603 ± 0.368
0.875MetGln: 0.875 ± 0.281
1.531MetArg: 1.531 ± 0.445
1.676MetSer: 1.676 ± 0.351
1.458MetThr: 1.458 ± 0.368
1.312MetVal: 1.312 ± 0.323
0.219MetTrp: 0.219 ± 0.125
0.583MetTyr: 0.583 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
3.498AsnAla: 3.498 ± 0.383
0.219AsnCys: 0.219 ± 0.097
2.114AsnAsp: 2.114 ± 0.372
1.822AsnGlu: 1.822 ± 0.393
1.385AsnPhe: 1.385 ± 0.448
3.717AsnGly: 3.717 ± 0.778
1.02AsnHis: 1.02 ± 0.239
1.895AsnIle: 1.895 ± 0.499
1.166AsnLys: 1.166 ± 0.27
2.332AsnLeu: 2.332 ± 0.328
0.802AsnMet: 0.802 ± 0.253
1.093AsnAsn: 1.093 ± 0.279
2.697AsnPro: 2.697 ± 0.609
1.312AsnGln: 1.312 ± 0.32
2.041AsnArg: 2.041 ± 0.471
1.531AsnSer: 1.531 ± 0.273
1.749AsnThr: 1.749 ± 0.289
3.134AsnVal: 3.134 ± 0.565
0.802AsnTrp: 0.802 ± 0.21
1.385AsnTyr: 1.385 ± 0.394
0.0AsnXaa: 0.0 ± 0.0
Pro
5.466ProAla: 5.466 ± 0.567
0.364ProCys: 0.364 ± 0.183
3.571ProAsp: 3.571 ± 0.465
4.154ProGlu: 4.154 ± 0.53
1.676ProPhe: 1.676 ± 0.32
4.956ProGly: 4.956 ± 0.786
1.166ProHis: 1.166 ± 0.252
2.478ProIle: 2.478 ± 0.338
3.28ProLys: 3.28 ± 0.666
4.592ProLeu: 4.592 ± 0.65
0.875ProMet: 0.875 ± 0.288
2.842ProAsn: 2.842 ± 0.655
3.134ProPro: 3.134 ± 0.645
3.353ProGln: 3.353 ± 1.344
2.842ProArg: 2.842 ± 0.503
2.842ProSer: 2.842 ± 0.432
3.717ProThr: 3.717 ± 0.575
4.227ProVal: 4.227 ± 0.616
0.947ProTrp: 0.947 ± 0.423
1.166ProTyr: 1.166 ± 0.303
0.0ProXaa: 0.0 ± 0.0
Gln
4.373GlnAla: 4.373 ± 0.71
0.146GlnCys: 0.146 ± 0.09
1.458GlnAsp: 1.458 ± 0.295
2.405GlnGlu: 2.405 ± 0.552
1.531GlnPhe: 1.531 ± 0.387
4.227GlnGly: 4.227 ± 1.754
0.729GlnHis: 0.729 ± 0.203
1.968GlnIle: 1.968 ± 0.405
1.312GlnLys: 1.312 ± 0.294
3.863GlnLeu: 3.863 ± 0.775
1.312GlnMet: 1.312 ± 0.384
0.802GlnAsn: 0.802 ± 0.244
1.676GlnPro: 1.676 ± 0.359
2.259GlnGln: 2.259 ± 0.643
2.769GlnArg: 2.769 ± 0.515
1.458GlnSer: 1.458 ± 0.344
1.895GlnThr: 1.895 ± 0.469
2.624GlnVal: 2.624 ± 0.468
0.947GlnTrp: 0.947 ± 0.265
0.729GlnTyr: 0.729 ± 0.19
0.0GlnXaa: 0.0 ± 0.0
Arg
5.393ArgAla: 5.393 ± 0.598
0.656ArgCys: 0.656 ± 0.272
3.571ArgAsp: 3.571 ± 0.473
4.81ArgGlu: 4.81 ± 0.584
2.405ArgPhe: 2.405 ± 0.398
4.227ArgGly: 4.227 ± 0.53
1.385ArgHis: 1.385 ± 0.316
4.008ArgIle: 4.008 ± 0.466
3.644ArgLys: 3.644 ± 0.643
5.466ArgLeu: 5.466 ± 0.614
2.332ArgMet: 2.332 ± 0.407
2.478ArgAsn: 2.478 ± 0.442
3.061ArgPro: 3.061 ± 0.469
1.968ArgGln: 1.968 ± 0.411
5.83ArgArg: 5.83 ± 0.773
3.644ArgSer: 3.644 ± 0.55
2.915ArgThr: 2.915 ± 0.446
5.685ArgVal: 5.685 ± 0.537
1.603ArgTrp: 1.603 ± 0.415
1.822ArgTyr: 1.822 ± 0.45
0.0ArgXaa: 0.0 ± 0.0
Ser
3.717SerAla: 3.717 ± 0.54
0.146SerCys: 0.146 ± 0.102
2.842SerAsp: 2.842 ± 0.418
4.081SerGlu: 4.081 ± 0.494
2.697SerPhe: 2.697 ± 0.574
5.029SerGly: 5.029 ± 0.842
1.166SerHis: 1.166 ± 0.305
2.478SerIle: 2.478 ± 0.355
2.405SerLys: 2.405 ± 0.463
4.008SerLeu: 4.008 ± 0.636
1.385SerMet: 1.385 ± 0.309
1.968SerAsn: 1.968 ± 0.335
2.259SerPro: 2.259 ± 0.385
1.822SerGln: 1.822 ± 0.496
3.717SerArg: 3.717 ± 0.551
2.405SerSer: 2.405 ± 0.421
2.405SerThr: 2.405 ± 0.449
3.863SerVal: 3.863 ± 0.531
1.239SerTrp: 1.239 ± 0.318
1.458SerTyr: 1.458 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
5.32ThrAla: 5.32 ± 0.587
0.729ThrCys: 0.729 ± 0.249
3.425ThrAsp: 3.425 ± 0.601
3.717ThrGlu: 3.717 ± 0.551
1.749ThrPhe: 1.749 ± 0.378
4.592ThrGly: 4.592 ± 0.608
1.239ThrHis: 1.239 ± 0.385
2.332ThrIle: 2.332 ± 0.476
2.478ThrLys: 2.478 ± 0.385
5.758ThrLeu: 5.758 ± 0.742
1.093ThrMet: 1.093 ± 0.266
1.603ThrAsn: 1.603 ± 0.419
4.081ThrPro: 4.081 ± 0.521
2.405ThrGln: 2.405 ± 0.502
3.498ThrArg: 3.498 ± 0.563
2.405ThrSer: 2.405 ± 0.543
3.134ThrThr: 3.134 ± 0.504
5.029ThrVal: 5.029 ± 0.689
1.093ThrTrp: 1.093 ± 0.267
1.603ThrTyr: 1.603 ± 0.409
0.0ThrXaa: 0.0 ± 0.0
Val
7.215ValAla: 7.215 ± 0.804
0.51ValCys: 0.51 ± 0.178
5.029ValAsp: 5.029 ± 0.684
5.466ValGlu: 5.466 ± 0.665
2.186ValPhe: 2.186 ± 0.485
4.592ValGly: 4.592 ± 0.762
1.312ValHis: 1.312 ± 0.367
3.207ValIle: 3.207 ± 0.536
4.008ValLys: 4.008 ± 0.699
5.685ValLeu: 5.685 ± 0.566
1.458ValMet: 1.458 ± 0.302
2.988ValAsn: 2.988 ± 0.504
4.446ValPro: 4.446 ± 0.574
2.551ValGln: 2.551 ± 0.467
5.247ValArg: 5.247 ± 0.769
4.227ValSer: 4.227 ± 0.542
3.936ValThr: 3.936 ± 0.598
6.341ValVal: 6.341 ± 0.541
1.239ValTrp: 1.239 ± 0.333
2.259ValTyr: 2.259 ± 0.451
0.0ValXaa: 0.0 ± 0.0
Trp
1.822TrpAla: 1.822 ± 0.433
0.364TrpCys: 0.364 ± 0.215
1.239TrpAsp: 1.239 ± 0.383
1.093TrpGlu: 1.093 ± 0.243
0.875TrpPhe: 0.875 ± 0.217
1.822TrpGly: 1.822 ± 0.326
0.292TrpHis: 0.292 ± 0.142
1.749TrpIle: 1.749 ± 0.359
0.656TrpLys: 0.656 ± 0.177
1.166TrpLeu: 1.166 ± 0.283
0.802TrpMet: 0.802 ± 0.252
0.51TrpAsn: 0.51 ± 0.185
0.875TrpPro: 0.875 ± 0.31
0.802TrpGln: 0.802 ± 0.206
0.947TrpArg: 0.947 ± 0.259
0.802TrpSer: 0.802 ± 0.209
1.166TrpThr: 1.166 ± 0.353
1.385TrpVal: 1.385 ± 0.267
0.656TrpTrp: 0.656 ± 0.174
0.437TrpTyr: 0.437 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.551TyrAla: 2.551 ± 0.412
0.292TyrCys: 0.292 ± 0.153
2.405TyrAsp: 2.405 ± 0.5
1.895TyrGlu: 1.895 ± 0.429
0.51TyrPhe: 0.51 ± 0.182
1.749TyrGly: 1.749 ± 0.394
0.583TyrHis: 0.583 ± 0.19
1.093TyrIle: 1.093 ± 0.308
0.875TyrLys: 0.875 ± 0.298
3.134TyrLeu: 3.134 ± 0.494
1.02TyrMet: 1.02 ± 0.301
1.166TyrAsn: 1.166 ± 0.285
1.603TyrPro: 1.603 ± 0.31
1.093TyrGln: 1.093 ± 0.245
2.332TyrArg: 2.332 ± 0.456
1.312TyrSer: 1.312 ± 0.304
1.603TyrThr: 1.603 ± 0.345
2.332TyrVal: 2.332 ± 0.393
0.729TyrTrp: 0.729 ± 0.244
0.875TyrTyr: 0.875 ± 0.325
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (13722 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski