Amino acid dipepetide frequency for Arthrobacter phage Elsa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.601AlaAla: 7.601 ± 1.513
0.109AlaCys: 0.109 ± 0.08
5.468AlaAsp: 5.468 ± 0.556
5.304AlaGlu: 5.304 ± 0.594
3.664AlaPhe: 3.664 ± 0.501
5.195AlaGly: 5.195 ± 0.897
1.586AlaHis: 1.586 ± 0.293
5.086AlaIle: 5.086 ± 0.849
5.961AlaLys: 5.961 ± 0.708
7.328AlaLeu: 7.328 ± 0.904
3.062AlaMet: 3.062 ± 0.636
3.5AlaAsn: 3.5 ± 0.397
2.242AlaPro: 2.242 ± 0.371
3.172AlaGln: 3.172 ± 0.491
4.047AlaArg: 4.047 ± 0.568
4.867AlaSer: 4.867 ± 0.582
4.32AlaThr: 4.32 ± 0.436
5.961AlaVal: 5.961 ± 0.776
0.766AlaTrp: 0.766 ± 0.214
2.679AlaTyr: 2.679 ± 0.321
0.0AlaXaa: 0.0 ± 0.0
Cys
0.437CysAla: 0.437 ± 0.172
0.055CysCys: 0.055 ± 0.055
0.82CysAsp: 0.82 ± 0.193
0.383CysGlu: 0.383 ± 0.134
0.0CysPhe: 0.0 ± 0.0
0.547CysGly: 0.547 ± 0.216
0.328CysHis: 0.328 ± 0.146
0.273CysIle: 0.273 ± 0.115
0.383CysLys: 0.383 ± 0.13
0.547CysLeu: 0.547 ± 0.162
0.109CysMet: 0.109 ± 0.084
0.437CysAsn: 0.437 ± 0.133
0.383CysPro: 0.383 ± 0.123
0.219CysGln: 0.219 ± 0.113
0.164CysArg: 0.164 ± 0.086
0.437CysSer: 0.437 ± 0.135
0.219CysThr: 0.219 ± 0.129
0.164CysVal: 0.164 ± 0.091
0.164CysTrp: 0.164 ± 0.097
0.219CysTyr: 0.219 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
5.468AspAla: 5.468 ± 0.431
0.383AspCys: 0.383 ± 0.169
3.226AspAsp: 3.226 ± 0.512
6.234AspGlu: 6.234 ± 0.643
3.664AspPhe: 3.664 ± 0.465
4.812AspGly: 4.812 ± 0.497
1.203AspHis: 1.203 ± 0.258
4.101AspIle: 4.101 ± 0.455
2.789AspLys: 2.789 ± 0.412
5.086AspLeu: 5.086 ± 0.723
2.187AspMet: 2.187 ± 0.383
2.625AspAsn: 2.625 ± 0.46
2.844AspPro: 2.844 ± 0.484
2.187AspGln: 2.187 ± 0.306
3.445AspArg: 3.445 ± 0.497
3.445AspSer: 3.445 ± 0.426
3.062AspThr: 3.062 ± 0.397
4.047AspVal: 4.047 ± 0.469
0.984AspTrp: 0.984 ± 0.266
1.969AspTyr: 1.969 ± 0.324
0.0AspXaa: 0.0 ± 0.0
Glu
5.414GluAla: 5.414 ± 0.63
0.492GluCys: 0.492 ± 0.192
4.867GluAsp: 4.867 ± 0.568
6.671GluGlu: 6.671 ± 0.732
2.351GluPhe: 2.351 ± 0.326
3.883GluGly: 3.883 ± 0.456
1.258GluHis: 1.258 ± 0.285
4.265GluIle: 4.265 ± 0.492
4.757GluLys: 4.757 ± 0.542
5.414GluLeu: 5.414 ± 0.564
2.187GluMet: 2.187 ± 0.329
3.445GluAsn: 3.445 ± 0.445
1.586GluPro: 1.586 ± 0.312
3.281GluGln: 3.281 ± 0.331
4.375GluArg: 4.375 ± 0.605
3.336GluSer: 3.336 ± 0.395
3.937GluThr: 3.937 ± 0.586
4.976GluVal: 4.976 ± 0.549
1.476GluTrp: 1.476 ± 0.324
3.281GluTyr: 3.281 ± 0.385
0.0GluXaa: 0.0 ± 0.0
Phe
2.297PheAla: 2.297 ± 0.428
0.492PheCys: 0.492 ± 0.192
2.734PheAsp: 2.734 ± 0.337
3.008PheGlu: 3.008 ± 0.354
1.75PhePhe: 1.75 ± 0.297
2.789PheGly: 2.789 ± 0.415
0.492PheHis: 0.492 ± 0.15
2.133PheIle: 2.133 ± 0.35
3.062PheLys: 3.062 ± 0.431
2.461PheLeu: 2.461 ± 0.326
0.602PheMet: 0.602 ± 0.179
2.461PheAsn: 2.461 ± 0.331
1.641PhePro: 1.641 ± 0.333
1.203PheGln: 1.203 ± 0.225
1.75PheArg: 1.75 ± 0.42
2.297PheSer: 2.297 ± 0.389
2.515PheThr: 2.515 ± 0.466
2.898PheVal: 2.898 ± 0.433
0.547PheTrp: 0.547 ± 0.201
1.422PheTyr: 1.422 ± 0.292
0.0PheXaa: 0.0 ± 0.0
Gly
5.742GlyAla: 5.742 ± 1.043
0.602GlyCys: 0.602 ± 0.198
4.648GlyAsp: 4.648 ± 0.475
4.32GlyGlu: 4.32 ± 0.364
3.226GlyPhe: 3.226 ± 0.562
4.156GlyGly: 4.156 ± 0.739
1.094GlyHis: 1.094 ± 0.261
4.757GlyIle: 4.757 ± 0.675
3.062GlyLys: 3.062 ± 0.519
6.234GlyLeu: 6.234 ± 1.048
1.914GlyMet: 1.914 ± 0.504
2.679GlyAsn: 2.679 ± 0.459
1.805GlyPro: 1.805 ± 0.265
1.859GlyGln: 1.859 ± 0.369
3.39GlyArg: 3.39 ± 0.429
3.773GlySer: 3.773 ± 0.479
4.812GlyThr: 4.812 ± 0.443
5.906GlyVal: 5.906 ± 0.652
1.312GlyTrp: 1.312 ± 0.255
2.625GlyTyr: 2.625 ± 0.446
0.0GlyXaa: 0.0 ± 0.0
His
1.312HisAla: 1.312 ± 0.326
0.055HisCys: 0.055 ± 0.057
1.039HisAsp: 1.039 ± 0.227
1.258HisGlu: 1.258 ± 0.272
0.492HisPhe: 0.492 ± 0.163
1.203HisGly: 1.203 ± 0.277
0.656HisHis: 0.656 ± 0.185
1.531HisIle: 1.531 ± 0.287
1.148HisLys: 1.148 ± 0.244
1.75HisLeu: 1.75 ± 0.303
0.273HisMet: 0.273 ± 0.101
0.492HisAsn: 0.492 ± 0.162
0.93HisPro: 0.93 ± 0.233
0.656HisGln: 0.656 ± 0.231
0.93HisArg: 0.93 ± 0.228
1.203HisSer: 1.203 ± 0.288
0.984HisThr: 0.984 ± 0.232
1.476HisVal: 1.476 ± 0.381
0.383HisTrp: 0.383 ± 0.148
1.367HisTyr: 1.367 ± 0.288
0.0HisXaa: 0.0 ± 0.0
Ile
5.851IleAla: 5.851 ± 0.855
0.219IleCys: 0.219 ± 0.118
4.812IleAsp: 4.812 ± 0.542
4.703IleGlu: 4.703 ± 0.475
1.914IlePhe: 1.914 ± 0.354
4.812IleGly: 4.812 ± 1.085
1.039IleHis: 1.039 ± 0.246
3.718IleIle: 3.718 ± 0.494
3.992IleLys: 3.992 ± 0.511
4.757IleLeu: 4.757 ± 0.462
1.695IleMet: 1.695 ± 0.434
3.062IleAsn: 3.062 ± 0.433
2.679IlePro: 2.679 ± 0.334
1.859IleGln: 1.859 ± 0.323
3.718IleArg: 3.718 ± 0.475
2.679IleSer: 2.679 ± 0.313
4.156IleThr: 4.156 ± 0.409
4.703IleVal: 4.703 ± 0.475
0.82IleTrp: 0.82 ± 0.219
1.805IleTyr: 1.805 ± 0.285
0.0IleXaa: 0.0 ± 0.0
Lys
5.359LysAla: 5.359 ± 0.546
0.383LysCys: 0.383 ± 0.148
3.281LysAsp: 3.281 ± 0.462
4.867LysGlu: 4.867 ± 0.531
2.625LysPhe: 2.625 ± 0.509
3.281LysGly: 3.281 ± 0.562
1.805LysHis: 1.805 ± 0.387
3.828LysIle: 3.828 ± 0.516
4.539LysLys: 4.539 ± 0.559
6.125LysLeu: 6.125 ± 0.636
1.969LysMet: 1.969 ± 0.268
3.008LysAsn: 3.008 ± 0.429
2.461LysPro: 2.461 ± 0.386
2.297LysGln: 2.297 ± 0.381
4.156LysArg: 4.156 ± 0.591
3.828LysSer: 3.828 ± 0.507
3.664LysThr: 3.664 ± 0.429
4.593LysVal: 4.593 ± 0.493
0.82LysTrp: 0.82 ± 0.199
2.351LysTyr: 2.351 ± 0.46
0.0LysXaa: 0.0 ± 0.0
Leu
6.89LeuAla: 6.89 ± 0.786
0.656LeuCys: 0.656 ± 0.196
5.14LeuAsp: 5.14 ± 0.482
5.906LeuGlu: 5.906 ± 0.601
2.679LeuPhe: 2.679 ± 0.54
5.961LeuGly: 5.961 ± 0.888
1.641LeuHis: 1.641 ± 0.291
5.632LeuIle: 5.632 ± 0.718
5.414LeuLys: 5.414 ± 0.457
6.179LeuLeu: 6.179 ± 0.831
1.531LeuMet: 1.531 ± 0.276
3.883LeuAsn: 3.883 ± 0.425
2.734LeuPro: 2.734 ± 0.329
1.641LeuGln: 1.641 ± 0.351
4.211LeuArg: 4.211 ± 0.529
5.906LeuSer: 5.906 ± 0.607
4.867LeuThr: 4.867 ± 0.73
5.523LeuVal: 5.523 ± 0.744
1.367LeuTrp: 1.367 ± 0.246
2.57LeuTyr: 2.57 ± 0.482
0.0LeuXaa: 0.0 ± 0.0
Met
2.625MetAla: 2.625 ± 0.508
0.219MetCys: 0.219 ± 0.114
1.641MetAsp: 1.641 ± 0.295
1.75MetGlu: 1.75 ± 0.409
1.203MetPhe: 1.203 ± 0.302
1.859MetGly: 1.859 ± 0.384
0.492MetHis: 0.492 ± 0.156
1.312MetIle: 1.312 ± 0.332
1.75MetLys: 1.75 ± 0.375
1.859MetLeu: 1.859 ± 0.338
0.547MetMet: 0.547 ± 0.158
1.586MetAsn: 1.586 ± 0.267
0.82MetPro: 0.82 ± 0.17
0.875MetGln: 0.875 ± 0.23
1.367MetArg: 1.367 ± 0.264
2.297MetSer: 2.297 ± 0.423
2.187MetThr: 2.187 ± 0.338
1.586MetVal: 1.586 ± 0.3
0.219MetTrp: 0.219 ± 0.095
0.766MetTyr: 0.766 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
3.992AsnAla: 3.992 ± 0.38
0.055AsnCys: 0.055 ± 0.049
3.062AsnAsp: 3.062 ± 0.509
2.789AsnGlu: 2.789 ± 0.352
1.969AsnPhe: 1.969 ± 0.27
3.773AsnGly: 3.773 ± 0.43
0.984AsnHis: 0.984 ± 0.236
3.172AsnIle: 3.172 ± 0.431
2.187AsnLys: 2.187 ± 0.321
4.375AsnLeu: 4.375 ± 0.417
1.641AsnMet: 1.641 ± 0.301
2.133AsnAsn: 2.133 ± 0.378
2.734AsnPro: 2.734 ± 0.471
1.859AsnGln: 1.859 ± 0.265
1.969AsnArg: 1.969 ± 0.355
2.679AsnSer: 2.679 ± 0.312
2.734AsnThr: 2.734 ± 0.392
2.789AsnVal: 2.789 ± 0.378
0.766AsnTrp: 0.766 ± 0.287
1.969AsnTyr: 1.969 ± 0.308
0.0AsnXaa: 0.0 ± 0.0
Pro
2.898ProAla: 2.898 ± 0.35
0.219ProCys: 0.219 ± 0.112
2.242ProAsp: 2.242 ± 0.374
4.265ProGlu: 4.265 ± 0.582
1.258ProPhe: 1.258 ± 0.253
3.008ProGly: 3.008 ± 0.342
0.547ProHis: 0.547 ± 0.179
2.133ProIle: 2.133 ± 0.244
2.515ProLys: 2.515 ± 0.351
2.187ProLeu: 2.187 ± 0.324
0.766ProMet: 0.766 ± 0.206
2.023ProAsn: 2.023 ± 0.325
1.805ProPro: 1.805 ± 0.363
0.984ProGln: 0.984 ± 0.245
1.859ProArg: 1.859 ± 0.395
2.734ProSer: 2.734 ± 0.368
2.242ProThr: 2.242 ± 0.362
2.734ProVal: 2.734 ± 0.396
0.328ProTrp: 0.328 ± 0.164
1.476ProTyr: 1.476 ± 0.355
0.0ProXaa: 0.0 ± 0.0
Gln
3.062GlnAla: 3.062 ± 0.504
0.273GlnCys: 0.273 ± 0.136
1.75GlnAsp: 1.75 ± 0.33
1.531GlnGlu: 1.531 ± 0.291
1.367GlnPhe: 1.367 ± 0.273
2.515GlnGly: 2.515 ± 0.363
0.328GlnHis: 0.328 ± 0.135
2.625GlnIle: 2.625 ± 0.358
2.406GlnLys: 2.406 ± 0.39
3.609GlnLeu: 3.609 ± 0.449
0.766GlnMet: 0.766 ± 0.185
1.367GlnAsn: 1.367 ± 0.29
1.203GlnPro: 1.203 ± 0.26
1.367GlnGln: 1.367 ± 0.277
1.586GlnArg: 1.586 ± 0.265
1.586GlnSer: 1.586 ± 0.322
1.859GlnThr: 1.859 ± 0.348
2.406GlnVal: 2.406 ± 0.308
0.492GlnTrp: 0.492 ± 0.172
1.312GlnTyr: 1.312 ± 0.3
0.0GlnXaa: 0.0 ± 0.0
Arg
2.953ArgAla: 2.953 ± 0.466
0.383ArgCys: 0.383 ± 0.151
2.953ArgAsp: 2.953 ± 0.339
3.39ArgGlu: 3.39 ± 0.432
1.641ArgPhe: 1.641 ± 0.316
2.242ArgGly: 2.242 ± 0.385
1.367ArgHis: 1.367 ± 0.346
3.445ArgIle: 3.445 ± 0.437
4.922ArgLys: 4.922 ± 0.701
3.609ArgLeu: 3.609 ± 0.413
1.148ArgMet: 1.148 ± 0.241
3.172ArgAsn: 3.172 ± 0.336
1.641ArgPro: 1.641 ± 0.239
1.914ArgGln: 1.914 ± 0.311
3.664ArgArg: 3.664 ± 0.553
4.429ArgSer: 4.429 ± 0.54
3.336ArgThr: 3.336 ± 0.453
3.554ArgVal: 3.554 ± 0.53
0.328ArgTrp: 0.328 ± 0.133
1.969ArgTyr: 1.969 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
4.757SerAla: 4.757 ± 0.568
0.164SerCys: 0.164 ± 0.086
3.554SerAsp: 3.554 ± 0.525
3.828SerGlu: 3.828 ± 0.457
2.133SerPhe: 2.133 ± 0.374
4.976SerGly: 4.976 ± 0.49
1.039SerHis: 1.039 ± 0.263
4.484SerIle: 4.484 ± 0.439
4.32SerLys: 4.32 ± 0.357
4.922SerLeu: 4.922 ± 0.494
1.586SerMet: 1.586 ± 0.246
3.609SerAsn: 3.609 ± 0.379
2.461SerPro: 2.461 ± 0.308
2.297SerGln: 2.297 ± 0.352
2.844SerArg: 2.844 ± 0.387
4.703SerSer: 4.703 ± 0.63
3.5SerThr: 3.5 ± 0.442
3.281SerVal: 3.281 ± 0.41
1.367SerTrp: 1.367 ± 0.223
2.351SerTyr: 2.351 ± 0.443
0.0SerXaa: 0.0 ± 0.0
Thr
5.578ThrAla: 5.578 ± 0.734
0.492ThrCys: 0.492 ± 0.161
3.828ThrAsp: 3.828 ± 0.414
3.883ThrGlu: 3.883 ± 0.575
2.133ThrPhe: 2.133 ± 0.305
3.992ThrGly: 3.992 ± 0.357
1.312ThrHis: 1.312 ± 0.307
3.5ThrIle: 3.5 ± 0.375
3.828ThrLys: 3.828 ± 0.396
4.32ThrLeu: 4.32 ± 0.688
0.984ThrMet: 0.984 ± 0.263
2.789ThrAsn: 2.789 ± 0.406
4.265ThrPro: 4.265 ± 0.574
2.242ThrGln: 2.242 ± 0.378
2.242ThrArg: 2.242 ± 0.394
3.718ThrSer: 3.718 ± 0.447
3.718ThrThr: 3.718 ± 0.449
3.828ThrVal: 3.828 ± 0.536
0.766ThrTrp: 0.766 ± 0.159
2.844ThrTyr: 2.844 ± 0.372
0.0ThrXaa: 0.0 ± 0.0
Val
5.578ValAla: 5.578 ± 0.656
0.273ValCys: 0.273 ± 0.102
5.031ValAsp: 5.031 ± 0.482
4.101ValGlu: 4.101 ± 0.598
2.297ValPhe: 2.297 ± 0.442
5.195ValGly: 5.195 ± 0.519
1.039ValHis: 1.039 ± 0.242
4.757ValIle: 4.757 ± 0.575
4.648ValLys: 4.648 ± 0.481
5.468ValLeu: 5.468 ± 0.636
2.406ValMet: 2.406 ± 0.312
2.953ValAsn: 2.953 ± 0.348
2.187ValPro: 2.187 ± 0.366
2.023ValGln: 2.023 ± 0.226
3.336ValArg: 3.336 ± 0.448
4.703ValSer: 4.703 ± 0.497
4.867ValThr: 4.867 ± 0.54
5.359ValVal: 5.359 ± 0.501
0.711ValTrp: 0.711 ± 0.216
2.898ValTyr: 2.898 ± 0.457
0.0ValXaa: 0.0 ± 0.0
Trp
0.711TrpAla: 0.711 ± 0.144
0.328TrpCys: 0.328 ± 0.124
0.711TrpAsp: 0.711 ± 0.25
0.93TrpGlu: 0.93 ± 0.196
0.547TrpPhe: 0.547 ± 0.209
0.875TrpGly: 0.875 ± 0.187
0.273TrpHis: 0.273 ± 0.11
0.437TrpIle: 0.437 ± 0.151
1.148TrpLys: 1.148 ± 0.231
1.367TrpLeu: 1.367 ± 0.317
0.273TrpMet: 0.273 ± 0.114
0.82TrpAsn: 0.82 ± 0.25
0.437TrpPro: 0.437 ± 0.141
0.437TrpGln: 0.437 ± 0.163
1.148TrpArg: 1.148 ± 0.37
0.766TrpSer: 0.766 ± 0.245
1.148TrpThr: 1.148 ± 0.195
1.039TrpVal: 1.039 ± 0.281
0.219TrpTrp: 0.219 ± 0.113
0.547TrpTyr: 0.547 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.39TyrAla: 3.39 ± 0.396
0.437TyrCys: 0.437 ± 0.172
2.953TyrAsp: 2.953 ± 0.55
2.133TyrGlu: 2.133 ± 0.294
1.641TyrPhe: 1.641 ± 0.337
2.898TyrGly: 2.898 ± 0.472
0.656TyrHis: 0.656 ± 0.172
1.805TyrIle: 1.805 ± 0.321
2.406TyrLys: 2.406 ± 0.45
2.734TyrLeu: 2.734 ± 0.426
1.258TyrMet: 1.258 ± 0.298
1.586TyrAsn: 1.586 ± 0.282
1.367TyrPro: 1.367 ± 0.244
1.039TyrGln: 1.039 ± 0.243
1.859TyrArg: 1.859 ± 0.335
2.789TyrSer: 2.789 ± 0.311
2.133TyrThr: 2.133 ± 0.335
3.008TyrVal: 3.008 ± 0.425
0.328TyrTrp: 0.328 ± 0.134
1.476TyrTyr: 1.476 ± 0.371
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (18288 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski