Amino acid dipepetide frequency for Rhizobium phage RHEph06

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.732AlaAla: 13.732 ± 1.526
0.663AlaCys: 0.663 ± 0.212
5.782AlaAsp: 5.782 ± 0.562
6.987AlaGlu: 6.987 ± 0.875
3.614AlaPhe: 3.614 ± 0.618
8.673AlaGly: 8.673 ± 0.861
1.506AlaHis: 1.506 ± 0.34
5.963AlaIle: 5.963 ± 0.478
6.866AlaLys: 6.866 ± 0.88
9.516AlaLeu: 9.516 ± 0.775
2.65AlaMet: 2.65 ± 0.386
3.975AlaAsn: 3.975 ± 0.499
3.674AlaPro: 3.674 ± 0.518
4.035AlaGln: 4.035 ± 0.519
5.481AlaArg: 5.481 ± 0.733
5.601AlaSer: 5.601 ± 0.677
6.565AlaThr: 6.565 ± 0.677
6.987AlaVal: 6.987 ± 0.683
1.265AlaTrp: 1.265 ± 0.291
3.192AlaTyr: 3.192 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.542CysAla: 0.542 ± 0.215
0.301CysCys: 0.301 ± 0.13
1.144CysAsp: 1.144 ± 0.35
0.843CysGlu: 0.843 ± 0.223
0.422CysPhe: 0.422 ± 0.17
0.964CysGly: 0.964 ± 0.285
0.301CysHis: 0.301 ± 0.111
0.663CysIle: 0.663 ± 0.201
0.783CysLys: 0.783 ± 0.252
0.964CysLeu: 0.964 ± 0.252
0.422CysMet: 0.422 ± 0.141
0.12CysAsn: 0.12 ± 0.073
0.542CysPro: 0.542 ± 0.172
0.361CysGln: 0.361 ± 0.165
0.843CysArg: 0.843 ± 0.27
0.723CysSer: 0.723 ± 0.199
0.602CysThr: 0.602 ± 0.189
0.422CysVal: 0.422 ± 0.175
0.12CysTrp: 0.12 ± 0.069
0.12CysTyr: 0.12 ± 0.079
0.0CysXaa: 0.0 ± 0.0
Asp
6.686AspAla: 6.686 ± 0.67
0.482AspCys: 0.482 ± 0.152
3.433AspAsp: 3.433 ± 0.532
4.457AspGlu: 4.457 ± 0.55
2.771AspPhe: 2.771 ± 0.403
5.421AspGly: 5.421 ± 0.634
1.446AspHis: 1.446 ± 0.333
3.433AspIle: 3.433 ± 0.548
2.469AspLys: 2.469 ± 0.475
5.421AspLeu: 5.421 ± 0.492
1.686AspMet: 1.686 ± 0.298
2.229AspAsn: 2.229 ± 0.406
3.252AspPro: 3.252 ± 0.439
2.108AspGln: 2.108 ± 0.344
2.65AspArg: 2.65 ± 0.385
2.891AspSer: 2.891 ± 0.459
2.59AspThr: 2.59 ± 0.364
3.614AspVal: 3.614 ± 0.474
1.506AspTrp: 1.506 ± 0.266
1.506AspTyr: 1.506 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
6.264GluAla: 6.264 ± 0.705
0.843GluCys: 0.843 ± 0.209
2.891GluAsp: 2.891 ± 0.42
3.192GluGlu: 3.192 ± 0.532
2.289GluPhe: 2.289 ± 0.317
4.457GluGly: 4.457 ± 0.626
1.385GluHis: 1.385 ± 0.319
3.794GluIle: 3.794 ± 0.436
3.313GluLys: 3.313 ± 0.441
5.782GluLeu: 5.782 ± 0.665
1.626GluMet: 1.626 ± 0.293
2.048GluAsn: 2.048 ± 0.287
1.807GluPro: 1.807 ± 0.342
2.048GluGln: 2.048 ± 0.444
4.698GluArg: 4.698 ± 0.542
3.072GluSer: 3.072 ± 0.46
3.794GluThr: 3.794 ± 0.463
4.096GluVal: 4.096 ± 0.584
0.903GluTrp: 0.903 ± 0.234
1.385GluTyr: 1.385 ± 0.263
0.0GluXaa: 0.0 ± 0.0
Phe
3.192PheAla: 3.192 ± 0.45
0.663PheCys: 0.663 ± 0.201
2.469PheAsp: 2.469 ± 0.374
2.048PheGlu: 2.048 ± 0.324
0.964PhePhe: 0.964 ± 0.209
3.313PheGly: 3.313 ± 0.484
0.542PheHis: 0.542 ± 0.16
2.168PheIle: 2.168 ± 0.322
2.048PheLys: 2.048 ± 0.337
2.831PheLeu: 2.831 ± 0.336
1.325PheMet: 1.325 ± 0.298
1.747PheAsn: 1.747 ± 0.349
2.168PhePro: 2.168 ± 0.27
1.084PheGln: 1.084 ± 0.239
2.349PheArg: 2.349 ± 0.389
1.988PheSer: 1.988 ± 0.37
2.469PheThr: 2.469 ± 0.391
2.229PheVal: 2.229 ± 0.433
0.542PheTrp: 0.542 ± 0.173
0.964PheTyr: 0.964 ± 0.2
0.0PheXaa: 0.0 ± 0.0
Gly
6.083GlyAla: 6.083 ± 0.765
1.084GlyCys: 1.084 ± 0.263
4.698GlyAsp: 4.698 ± 0.549
4.879GlyGlu: 4.879 ± 0.488
3.855GlyPhe: 3.855 ± 0.436
6.384GlyGly: 6.384 ± 1.025
1.446GlyHis: 1.446 ± 0.352
4.457GlyIle: 4.457 ± 0.438
5.3GlyLys: 5.3 ± 0.745
6.204GlyLeu: 6.204 ± 0.58
2.289GlyMet: 2.289 ± 0.361
2.831GlyAsn: 2.831 ± 0.533
2.289GlyPro: 2.289 ± 0.378
3.192GlyGln: 3.192 ± 0.492
4.096GlyArg: 4.096 ± 0.522
3.614GlySer: 3.614 ± 0.551
4.337GlyThr: 4.337 ± 0.463
5.722GlyVal: 5.722 ± 0.794
1.265GlyTrp: 1.265 ± 0.278
2.53GlyTyr: 2.53 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
1.325HisAla: 1.325 ± 0.258
0.301HisCys: 0.301 ± 0.129
0.964HisAsp: 0.964 ± 0.293
1.024HisGlu: 1.024 ± 0.201
0.843HisPhe: 0.843 ± 0.235
1.566HisGly: 1.566 ± 0.401
0.422HisHis: 0.422 ± 0.194
0.964HisIle: 0.964 ± 0.259
1.024HisLys: 1.024 ± 0.226
1.506HisLeu: 1.506 ± 0.331
0.602HisMet: 0.602 ± 0.17
0.361HisAsn: 0.361 ± 0.136
1.144HisPro: 1.144 ± 0.328
0.361HisGln: 0.361 ± 0.138
1.566HisArg: 1.566 ± 0.312
1.144HisSer: 1.144 ± 0.218
0.843HisThr: 0.843 ± 0.24
1.446HisVal: 1.446 ± 0.325
0.301HisTrp: 0.301 ± 0.11
0.783HisTyr: 0.783 ± 0.287
0.0HisXaa: 0.0 ± 0.0
Ile
6.143IleAla: 6.143 ± 0.58
0.843IleCys: 0.843 ± 0.205
4.939IleAsp: 4.939 ± 0.567
3.855IleGlu: 3.855 ± 0.449
2.349IlePhe: 2.349 ± 0.318
3.794IleGly: 3.794 ± 0.409
1.265IleHis: 1.265 ± 0.314
3.072IleIle: 3.072 ± 0.44
2.65IleLys: 2.65 ± 0.373
3.554IleLeu: 3.554 ± 0.414
1.325IleMet: 1.325 ± 0.238
2.71IleAsn: 2.71 ± 0.354
3.132IlePro: 3.132 ± 0.511
2.048IleGln: 2.048 ± 0.42
3.915IleArg: 3.915 ± 0.504
2.951IleSer: 2.951 ± 0.449
4.035IleThr: 4.035 ± 0.414
3.192IleVal: 3.192 ± 0.368
0.602IleTrp: 0.602 ± 0.194
1.265IleTyr: 1.265 ± 0.255
0.0IleXaa: 0.0 ± 0.0
Lys
7.469LysAla: 7.469 ± 0.928
0.482LysCys: 0.482 ± 0.182
3.313LysAsp: 3.313 ± 0.624
3.012LysGlu: 3.012 ± 0.431
1.807LysPhe: 1.807 ± 0.354
3.975LysGly: 3.975 ± 0.576
0.964LysHis: 0.964 ± 0.29
3.252LysIle: 3.252 ± 0.447
2.831LysLys: 2.831 ± 0.374
3.975LysLeu: 3.975 ± 0.447
1.205LysMet: 1.205 ± 0.282
1.747LysAsn: 1.747 ± 0.323
2.048LysPro: 2.048 ± 0.4
2.108LysGln: 2.108 ± 0.277
2.71LysArg: 2.71 ± 0.403
3.072LysSer: 3.072 ± 0.41
3.554LysThr: 3.554 ± 0.414
4.096LysVal: 4.096 ± 0.492
0.903LysTrp: 0.903 ± 0.198
1.566LysTyr: 1.566 ± 0.294
0.0LysXaa: 0.0 ± 0.0
Leu
9.396LeuAla: 9.396 ± 0.736
0.723LeuCys: 0.723 ± 0.266
5.059LeuAsp: 5.059 ± 0.601
4.577LeuGlu: 4.577 ± 0.644
1.867LeuPhe: 1.867 ± 0.367
6.806LeuGly: 6.806 ± 0.87
1.144LeuHis: 1.144 ± 0.201
4.517LeuIle: 4.517 ± 0.516
4.457LeuLys: 4.457 ± 0.567
6.987LeuLeu: 6.987 ± 0.736
2.229LeuMet: 2.229 ± 0.394
2.831LeuAsn: 2.831 ± 0.363
3.614LeuPro: 3.614 ± 0.488
2.469LeuGln: 2.469 ± 0.329
4.939LeuArg: 4.939 ± 0.451
5.18LeuSer: 5.18 ± 0.529
5.722LeuThr: 5.722 ± 0.668
4.397LeuVal: 4.397 ± 0.509
1.024LeuTrp: 1.024 ± 0.222
2.048LeuTyr: 2.048 ± 0.391
0.0LeuXaa: 0.0 ± 0.0
Met
1.807MetAla: 1.807 ± 0.401
0.241MetCys: 0.241 ± 0.122
1.144MetAsp: 1.144 ± 0.248
0.903MetGlu: 0.903 ± 0.183
0.783MetPhe: 0.783 ± 0.228
1.566MetGly: 1.566 ± 0.29
0.482MetHis: 0.482 ± 0.152
1.988MetIle: 1.988 ± 0.34
1.506MetLys: 1.506 ± 0.337
2.229MetLeu: 2.229 ± 0.343
0.422MetMet: 0.422 ± 0.19
1.024MetAsn: 1.024 ± 0.22
1.686MetPro: 1.686 ± 0.315
0.964MetGln: 0.964 ± 0.253
1.807MetArg: 1.807 ± 0.322
1.807MetSer: 1.807 ± 0.379
2.71MetThr: 2.71 ± 0.421
1.747MetVal: 1.747 ± 0.31
0.361MetTrp: 0.361 ± 0.184
0.663MetTyr: 0.663 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
4.879AsnAla: 4.879 ± 0.562
0.422AsnCys: 0.422 ± 0.203
2.048AsnAsp: 2.048 ± 0.343
1.807AsnGlu: 1.807 ± 0.3
2.108AsnPhe: 2.108 ± 0.324
3.855AsnGly: 3.855 ± 0.564
0.964AsnHis: 0.964 ± 0.266
1.927AsnIle: 1.927 ± 0.366
1.807AsnLys: 1.807 ± 0.366
2.289AsnLeu: 2.289 ± 0.381
1.325AsnMet: 1.325 ± 0.277
1.446AsnAsn: 1.446 ± 0.247
2.349AsnPro: 2.349 ± 0.413
1.144AsnGln: 1.144 ± 0.234
2.048AsnArg: 2.048 ± 0.359
2.108AsnSer: 2.108 ± 0.353
2.229AsnThr: 2.229 ± 0.361
2.59AsnVal: 2.59 ± 0.316
0.903AsnTrp: 0.903 ± 0.261
1.506AsnTyr: 1.506 ± 0.342
0.0AsnXaa: 0.0 ± 0.0
Pro
4.758ProAla: 4.758 ± 0.524
0.723ProCys: 0.723 ± 0.193
3.252ProAsp: 3.252 ± 0.441
3.252ProGlu: 3.252 ± 0.492
1.927ProPhe: 1.927 ± 0.313
3.252ProGly: 3.252 ± 0.373
0.843ProHis: 0.843 ± 0.226
2.469ProIle: 2.469 ± 0.392
1.867ProLys: 1.867 ± 0.293
3.012ProLeu: 3.012 ± 0.402
0.783ProMet: 0.783 ± 0.201
2.469ProAsn: 2.469 ± 0.359
2.108ProPro: 2.108 ± 0.324
1.747ProGln: 1.747 ± 0.421
2.349ProArg: 2.349 ± 0.449
2.71ProSer: 2.71 ± 0.491
3.012ProThr: 3.012 ± 0.524
3.674ProVal: 3.674 ± 0.452
0.422ProTrp: 0.422 ± 0.162
1.084ProTyr: 1.084 ± 0.271
0.0ProXaa: 0.0 ± 0.0
Gln
4.216GlnAla: 4.216 ± 0.59
0.422GlnCys: 0.422 ± 0.172
1.446GlnAsp: 1.446 ± 0.335
1.686GlnGlu: 1.686 ± 0.301
1.084GlnPhe: 1.084 ± 0.3
2.409GlnGly: 2.409 ± 0.396
1.024GlnHis: 1.024 ± 0.221
2.59GlnIle: 2.59 ± 0.316
1.084GlnLys: 1.084 ± 0.223
2.771GlnLeu: 2.771 ± 0.436
1.084GlnMet: 1.084 ± 0.238
1.626GlnAsn: 1.626 ± 0.313
1.927GlnPro: 1.927 ± 0.528
2.229GlnGln: 2.229 ± 0.59
2.71GlnArg: 2.71 ± 0.347
2.771GlnSer: 2.771 ± 0.374
2.048GlnThr: 2.048 ± 0.374
2.59GlnVal: 2.59 ± 0.321
0.783GlnTrp: 0.783 ± 0.218
0.964GlnTyr: 0.964 ± 0.281
0.0GlnXaa: 0.0 ± 0.0
Arg
5.3ArgAla: 5.3 ± 0.494
0.542ArgCys: 0.542 ± 0.163
3.433ArgAsp: 3.433 ± 0.368
3.072ArgGlu: 3.072 ± 0.545
2.108ArgPhe: 2.108 ± 0.321
3.313ArgGly: 3.313 ± 0.48
1.325ArgHis: 1.325 ± 0.288
3.554ArgIle: 3.554 ± 0.393
4.096ArgLys: 4.096 ± 0.619
4.156ArgLeu: 4.156 ± 0.602
1.686ArgMet: 1.686 ± 0.379
2.53ArgAsn: 2.53 ± 0.434
3.614ArgPro: 3.614 ± 0.517
2.771ArgGln: 2.771 ± 0.497
3.252ArgArg: 3.252 ± 0.624
2.951ArgSer: 2.951 ± 0.427
2.53ArgThr: 2.53 ± 0.468
3.554ArgVal: 3.554 ± 0.447
1.626ArgTrp: 1.626 ± 0.337
2.349ArgTyr: 2.349 ± 0.419
0.0ArgXaa: 0.0 ± 0.0
Ser
5.903SerAla: 5.903 ± 0.552
0.843SerCys: 0.843 ± 0.244
3.855SerAsp: 3.855 ± 0.485
3.855SerGlu: 3.855 ± 0.482
2.229SerPhe: 2.229 ± 0.393
4.216SerGly: 4.216 ± 0.583
0.663SerHis: 0.663 ± 0.198
3.072SerIle: 3.072 ± 0.575
3.072SerLys: 3.072 ± 0.454
4.818SerLeu: 4.818 ± 0.558
1.024SerMet: 1.024 ± 0.23
2.469SerAsn: 2.469 ± 0.419
2.409SerPro: 2.409 ± 0.392
1.325SerGln: 1.325 ± 0.279
3.192SerArg: 3.192 ± 0.507
3.373SerSer: 3.373 ± 0.412
4.096SerThr: 4.096 ± 0.624
4.397SerVal: 4.397 ± 0.416
0.542SerTrp: 0.542 ± 0.199
1.867SerTyr: 1.867 ± 0.339
0.0SerXaa: 0.0 ± 0.0
Thr
5.903ThrAla: 5.903 ± 0.578
0.542ThrCys: 0.542 ± 0.17
4.337ThrAsp: 4.337 ± 0.58
3.734ThrGlu: 3.734 ± 0.404
2.289ThrPhe: 2.289 ± 0.3
5.782ThrGly: 5.782 ± 0.617
0.542ThrHis: 0.542 ± 0.187
3.373ThrIle: 3.373 ± 0.499
2.53ThrLys: 2.53 ± 0.345
5.059ThrLeu: 5.059 ± 0.571
0.843ThrMet: 0.843 ± 0.217
2.59ThrAsn: 2.59 ± 0.427
3.373ThrPro: 3.373 ± 0.391
2.469ThrGln: 2.469 ± 0.387
3.012ThrArg: 3.012 ± 0.442
3.975ThrSer: 3.975 ± 0.45
3.433ThrThr: 3.433 ± 0.546
4.577ThrVal: 4.577 ± 0.544
1.566ThrTrp: 1.566 ± 0.323
1.988ThrTyr: 1.988 ± 0.317
0.0ThrXaa: 0.0 ± 0.0
Val
7.469ValAla: 7.469 ± 0.73
0.723ValCys: 0.723 ± 0.201
3.192ValAsp: 3.192 ± 0.39
4.216ValGlu: 4.216 ± 0.499
2.349ValPhe: 2.349 ± 0.351
3.493ValGly: 3.493 ± 0.372
1.084ValHis: 1.084 ± 0.272
3.614ValIle: 3.614 ± 0.413
4.457ValLys: 4.457 ± 0.544
5.722ValLeu: 5.722 ± 0.638
2.048ValMet: 2.048 ± 0.312
3.072ValAsn: 3.072 ± 0.417
2.771ValPro: 2.771 ± 0.37
3.373ValGln: 3.373 ± 0.4
2.891ValArg: 2.891 ± 0.422
3.915ValSer: 3.915 ± 0.434
4.818ValThr: 4.818 ± 0.485
4.638ValVal: 4.638 ± 0.528
0.964ValTrp: 0.964 ± 0.233
2.349ValTyr: 2.349 ± 0.323
0.0ValXaa: 0.0 ± 0.0
Trp
2.53TrpAla: 2.53 ± 0.437
0.12TrpCys: 0.12 ± 0.077
0.903TrpAsp: 0.903 ± 0.3
0.964TrpGlu: 0.964 ± 0.271
0.482TrpPhe: 0.482 ± 0.161
1.024TrpGly: 1.024 ± 0.215
0.482TrpHis: 0.482 ± 0.165
1.144TrpIle: 1.144 ± 0.279
0.602TrpLys: 0.602 ± 0.246
1.084TrpLeu: 1.084 ± 0.209
0.241TrpMet: 0.241 ± 0.128
1.024TrpAsn: 1.024 ± 0.227
0.422TrpPro: 0.422 ± 0.16
0.542TrpGln: 0.542 ± 0.2
1.265TrpArg: 1.265 ± 0.276
1.024TrpSer: 1.024 ± 0.277
0.723TrpThr: 0.723 ± 0.2
1.144TrpVal: 1.144 ± 0.228
0.06TrpTrp: 0.06 ± 0.051
0.422TrpTyr: 0.422 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.132TyrAla: 3.132 ± 0.385
0.241TyrCys: 0.241 ± 0.11
1.747TyrAsp: 1.747 ± 0.313
1.325TyrGlu: 1.325 ± 0.267
1.024TyrPhe: 1.024 ± 0.231
2.289TyrGly: 2.289 ± 0.396
0.663TyrHis: 0.663 ± 0.21
1.626TyrIle: 1.626 ± 0.37
1.325TyrLys: 1.325 ± 0.216
2.108TyrLeu: 2.108 ± 0.414
0.964TyrMet: 0.964 ± 0.236
0.903TyrAsn: 0.903 ± 0.239
1.385TyrPro: 1.385 ± 0.357
1.084TyrGln: 1.084 ± 0.244
1.988TyrArg: 1.988 ± 0.314
2.349TyrSer: 2.349 ± 0.425
1.807TyrThr: 1.807 ± 0.324
2.048TyrVal: 2.048 ± 0.441
0.542TyrTrp: 0.542 ± 0.164
0.903TyrTyr: 0.903 ± 0.302
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (16604 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski