Amino acid dipepetide frequency for Aeromonas phage 4_4572

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.369AlaAla: 5.369 ± 0.802
0.762AlaCys: 0.762 ± 0.171
3.413AlaAsp: 3.413 ± 0.319
4.905AlaGlu: 4.905 ± 0.73
2.883AlaPhe: 2.883 ± 0.338
5.468AlaGly: 5.468 ± 0.637
0.994AlaHis: 0.994 ± 0.167
3.844AlaIle: 3.844 ± 0.333
5.137AlaLys: 5.137 ± 0.677
5.369AlaLeu: 5.369 ± 0.435
1.524AlaMet: 1.524 ± 0.282
3.48AlaAsn: 3.48 ± 0.394
2.916AlaPro: 2.916 ± 0.456
2.717AlaGln: 2.717 ± 0.37
3.679AlaArg: 3.679 ± 0.648
4.308AlaSer: 4.308 ± 0.448
4.043AlaThr: 4.043 ± 0.469
4.606AlaVal: 4.606 ± 0.513
0.862AlaTrp: 0.862 ± 0.224
1.955AlaTyr: 1.955 ± 0.232
0.0AlaXaa: 0.0 ± 0.0
Cys
0.696CysAla: 0.696 ± 0.146
0.133CysCys: 0.133 ± 0.063
0.829CysAsp: 0.829 ± 0.185
0.961CysGlu: 0.961 ± 0.242
0.431CysPhe: 0.431 ± 0.102
0.994CysGly: 0.994 ± 0.196
0.398CysHis: 0.398 ± 0.132
0.696CysIle: 0.696 ± 0.155
0.829CysLys: 0.829 ± 0.215
0.862CysLeu: 0.862 ± 0.152
0.365CysMet: 0.365 ± 0.107
0.696CysAsn: 0.696 ± 0.162
0.431CysPro: 0.431 ± 0.133
0.365CysGln: 0.365 ± 0.109
0.431CysArg: 0.431 ± 0.108
0.862CysSer: 0.862 ± 0.177
0.331CysThr: 0.331 ± 0.105
0.928CysVal: 0.928 ± 0.183
0.133CysTrp: 0.133 ± 0.059
0.497CysTyr: 0.497 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
4.209AspAla: 4.209 ± 0.423
0.762AspCys: 0.762 ± 0.164
3.181AspAsp: 3.181 ± 0.541
4.341AspGlu: 4.341 ± 0.464
3.215AspPhe: 3.215 ± 0.338
4.838AspGly: 4.838 ± 0.5
1.027AspHis: 1.027 ± 0.22
3.546AspIle: 3.546 ± 0.331
4.109AspLys: 4.109 ± 0.338
5.137AspLeu: 5.137 ± 0.411
2.088AspMet: 2.088 ± 0.268
3.082AspAsn: 3.082 ± 0.323
2.386AspPro: 2.386 ± 0.316
1.922AspGln: 1.922 ± 0.242
2.552AspArg: 2.552 ± 0.289
4.308AspSer: 4.308 ± 0.453
3.48AspThr: 3.48 ± 0.43
4.01AspVal: 4.01 ± 0.345
1.127AspTrp: 1.127 ± 0.176
2.817AspTyr: 2.817 ± 0.289
0.0AspXaa: 0.0 ± 0.0
Glu
5.137GluAla: 5.137 ± 0.65
0.994GluCys: 0.994 ± 0.213
4.673GluAsp: 4.673 ± 0.407
5.998GluGlu: 5.998 ± 0.82
2.983GluPhe: 2.983 ± 0.305
4.938GluGly: 4.938 ± 0.434
1.292GluHis: 1.292 ± 0.218
4.408GluIle: 4.408 ± 0.342
5.899GluLys: 5.899 ± 0.571
5.766GluLeu: 5.766 ± 0.374
2.651GluMet: 2.651 ± 0.391
2.353GluAsn: 2.353 ± 0.251
1.988GluPro: 1.988 ± 0.403
2.121GluGln: 2.121 ± 0.293
3.447GluArg: 3.447 ± 0.415
4.242GluSer: 4.242 ± 0.422
3.413GluThr: 3.413 ± 0.381
6.131GluVal: 6.131 ± 0.599
1.458GluTrp: 1.458 ± 0.239
2.751GluTyr: 2.751 ± 0.28
0.0GluXaa: 0.0 ± 0.0
Phe
2.154PheAla: 2.154 ± 0.252
0.696PheCys: 0.696 ± 0.152
2.916PheAsp: 2.916 ± 0.356
3.579PheGlu: 3.579 ± 0.33
1.856PhePhe: 1.856 ± 0.308
3.215PheGly: 3.215 ± 0.394
1.193PheHis: 1.193 ± 0.201
2.552PheIle: 2.552 ± 0.357
2.916PheLys: 2.916 ± 0.365
2.983PheLeu: 2.983 ± 0.384
1.06PheMet: 1.06 ± 0.189
2.552PheAsn: 2.552 ± 0.29
1.524PhePro: 1.524 ± 0.239
1.723PheGln: 1.723 ± 0.266
2.154PheArg: 2.154 ± 0.299
2.916PheSer: 2.916 ± 0.272
3.049PheThr: 3.049 ± 0.35
2.983PheVal: 2.983 ± 0.343
0.729PheTrp: 0.729 ± 0.162
1.524PheTyr: 1.524 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
4.507GlyAla: 4.507 ± 0.494
0.895GlyCys: 0.895 ± 0.211
3.977GlyAsp: 3.977 ± 0.329
4.573GlyGlu: 4.573 ± 0.302
3.977GlyPhe: 3.977 ± 0.407
4.872GlyGly: 4.872 ± 0.729
1.127GlyHis: 1.127 ± 0.193
4.143GlyIle: 4.143 ± 0.299
5.269GlyLys: 5.269 ± 0.515
5.369GlyLeu: 5.369 ± 0.426
2.055GlyMet: 2.055 ± 0.216
3.513GlyAsn: 3.513 ± 0.344
1.16GlyPro: 1.16 ± 0.185
2.187GlyGln: 2.187 ± 0.288
2.585GlyArg: 2.585 ± 0.227
5.468GlySer: 5.468 ± 0.456
3.877GlyThr: 3.877 ± 0.509
5.336GlyVal: 5.336 ± 0.543
1.458GlyTrp: 1.458 ± 0.202
3.082GlyTyr: 3.082 ± 0.337
0.0GlyXaa: 0.0 ± 0.0
His
0.862HisAla: 0.862 ± 0.159
0.265HisCys: 0.265 ± 0.088
1.392HisAsp: 1.392 ± 0.226
0.928HisGlu: 0.928 ± 0.179
1.127HisPhe: 1.127 ± 0.211
1.458HisGly: 1.458 ± 0.226
0.431HisHis: 0.431 ± 0.11
1.094HisIle: 1.094 ± 0.193
1.193HisLys: 1.193 ± 0.207
1.723HisLeu: 1.723 ± 0.278
0.563HisMet: 0.563 ± 0.117
0.928HisAsn: 0.928 ± 0.16
0.63HisPro: 0.63 ± 0.156
0.597HisGln: 0.597 ± 0.137
0.994HisArg: 0.994 ± 0.198
0.994HisSer: 0.994 ± 0.17
0.829HisThr: 0.829 ± 0.186
1.226HisVal: 1.226 ± 0.145
0.331HisTrp: 0.331 ± 0.107
0.961HisTyr: 0.961 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
4.242IleAla: 4.242 ± 0.368
0.53IleCys: 0.53 ± 0.134
3.513IleAsp: 3.513 ± 0.36
4.341IleGlu: 4.341 ± 0.292
1.955IlePhe: 1.955 ± 0.301
3.215IleGly: 3.215 ± 0.383
1.359IleHis: 1.359 ± 0.265
3.811IleIle: 3.811 ± 0.404
4.308IleLys: 4.308 ± 0.39
4.209IleLeu: 4.209 ± 0.344
1.16IleMet: 1.16 ± 0.176
3.082IleAsn: 3.082 ± 0.363
2.684IlePro: 2.684 ± 0.33
2.585IleGln: 2.585 ± 0.265
3.215IleArg: 3.215 ± 0.383
4.109IleSer: 4.109 ± 0.357
3.546IleThr: 3.546 ± 0.383
3.347IleVal: 3.347 ± 0.304
0.829IleTrp: 0.829 ± 0.167
1.723IleTyr: 1.723 ± 0.288
0.0IleXaa: 0.0 ± 0.0
Lys
5.965LysAla: 5.965 ± 1.058
0.795LysCys: 0.795 ± 0.18
5.07LysAsp: 5.07 ± 0.382
5.402LysGlu: 5.402 ± 0.527
2.85LysPhe: 2.85 ± 0.278
4.64LysGly: 4.64 ± 0.412
1.326LysHis: 1.326 ± 0.229
3.877LysIle: 3.877 ± 0.355
5.534LysLys: 5.534 ± 0.632
5.402LysLeu: 5.402 ± 0.427
2.353LysMet: 2.353 ± 0.28
2.949LysAsn: 2.949 ± 0.339
2.817LysPro: 2.817 ± 0.351
2.154LysGln: 2.154 ± 0.257
3.248LysArg: 3.248 ± 0.429
4.341LysSer: 4.341 ± 0.453
3.679LysThr: 3.679 ± 0.322
5.203LysVal: 5.203 ± 0.422
1.326LysTrp: 1.326 ± 0.222
2.353LysTyr: 2.353 ± 0.282
0.0LysXaa: 0.0 ± 0.0
Leu
5.07LeuAla: 5.07 ± 0.482
1.06LeuCys: 1.06 ± 0.207
4.772LeuAsp: 4.772 ± 0.384
5.766LeuGlu: 5.766 ± 0.452
2.916LeuPhe: 2.916 ± 0.363
5.468LeuGly: 5.468 ± 0.549
1.259LeuHis: 1.259 ± 0.227
4.938LeuIle: 4.938 ± 0.427
5.965LeuLys: 5.965 ± 0.46
5.634LeuLeu: 5.634 ± 0.711
2.287LeuMet: 2.287 ± 0.287
3.413LeuAsn: 3.413 ± 0.318
2.751LeuPro: 2.751 ± 0.371
2.585LeuGln: 2.585 ± 0.361
3.944LeuArg: 3.944 ± 0.461
6.197LeuSer: 6.197 ± 0.439
3.811LeuThr: 3.811 ± 0.427
4.872LeuVal: 4.872 ± 0.352
1.06LeuTrp: 1.06 ± 0.222
2.983LeuTyr: 2.983 ± 0.376
0.0LeuXaa: 0.0 ± 0.0
Met
2.055MetAla: 2.055 ± 0.253
0.365MetCys: 0.365 ± 0.129
1.69MetAsp: 1.69 ± 0.258
1.823MetGlu: 1.823 ± 0.25
1.226MetPhe: 1.226 ± 0.175
1.823MetGly: 1.823 ± 0.287
0.298MetHis: 0.298 ± 0.097
1.657MetIle: 1.657 ± 0.205
2.817MetLys: 2.817 ± 0.359
2.254MetLeu: 2.254 ± 0.291
1.027MetMet: 1.027 ± 0.241
1.889MetAsn: 1.889 ± 0.267
0.895MetPro: 0.895 ± 0.187
0.762MetGln: 0.762 ± 0.168
1.756MetArg: 1.756 ± 0.214
2.651MetSer: 2.651 ± 0.327
1.624MetThr: 1.624 ± 0.254
1.955MetVal: 1.955 ± 0.28
0.398MetTrp: 0.398 ± 0.126
1.16MetTyr: 1.16 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
3.314AsnAla: 3.314 ± 0.4
0.497AsnCys: 0.497 ± 0.115
2.519AsnAsp: 2.519 ± 0.28
2.353AsnGlu: 2.353 ± 0.29
2.419AsnPhe: 2.419 ± 0.317
3.944AsnGly: 3.944 ± 0.457
1.06AsnHis: 1.06 ± 0.203
3.248AsnIle: 3.248 ± 0.277
3.447AsnLys: 3.447 ± 0.286
4.706AsnLeu: 4.706 ± 0.413
1.458AsnMet: 1.458 ± 0.295
2.254AsnAsn: 2.254 ± 0.296
3.115AsnPro: 3.115 ± 0.326
1.79AsnGln: 1.79 ± 0.299
2.552AsnArg: 2.552 ± 0.293
3.215AsnSer: 3.215 ± 0.381
2.717AsnThr: 2.717 ± 0.313
3.314AsnVal: 3.314 ± 0.35
0.762AsnTrp: 0.762 ± 0.172
1.955AsnTyr: 1.955 ± 0.214
0.0AsnXaa: 0.0 ± 0.0
Pro
2.22ProAla: 2.22 ± 0.335
0.464ProCys: 0.464 ± 0.129
2.552ProAsp: 2.552 ± 0.257
3.546ProGlu: 3.546 ± 0.506
1.624ProPhe: 1.624 ± 0.296
1.591ProGly: 1.591 ± 0.199
0.696ProHis: 0.696 ± 0.157
1.79ProIle: 1.79 ± 0.291
1.955ProLys: 1.955 ± 0.286
2.287ProLeu: 2.287 ± 0.307
1.326ProMet: 1.326 ± 0.204
1.69ProAsn: 1.69 ± 0.261
1.524ProPro: 1.524 ± 0.368
1.756ProGln: 1.756 ± 0.269
1.326ProArg: 1.326 ± 0.234
2.055ProSer: 2.055 ± 0.255
3.049ProThr: 3.049 ± 0.478
3.447ProVal: 3.447 ± 0.324
0.663ProTrp: 0.663 ± 0.126
1.392ProTyr: 1.392 ± 0.251
0.0ProXaa: 0.0 ± 0.0
Gln
2.983GlnAla: 2.983 ± 0.382
0.365GlnCys: 0.365 ± 0.114
1.889GlnAsp: 1.889 ± 0.243
2.883GlnGlu: 2.883 ± 0.335
1.723GlnPhe: 1.723 ± 0.239
2.154GlnGly: 2.154 ± 0.335
0.497GlnHis: 0.497 ± 0.12
1.591GlnIle: 1.591 ± 0.252
2.32GlnLys: 2.32 ± 0.381
2.684GlnLeu: 2.684 ± 0.477
1.359GlnMet: 1.359 ± 0.227
2.121GlnAsn: 2.121 ± 0.284
1.06GlnPro: 1.06 ± 0.198
1.524GlnGln: 1.524 ± 0.361
1.558GlnArg: 1.558 ± 0.241
1.856GlnSer: 1.856 ± 0.233
1.657GlnThr: 1.657 ± 0.25
3.148GlnVal: 3.148 ± 0.316
0.663GlnTrp: 0.663 ± 0.161
0.994GlnTyr: 0.994 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
3.447ArgAla: 3.447 ± 0.455
0.398ArgCys: 0.398 ± 0.108
2.916ArgAsp: 2.916 ± 0.298
3.115ArgGlu: 3.115 ± 0.457
2.486ArgPhe: 2.486 ± 0.272
3.248ArgGly: 3.248 ± 0.306
0.829ArgHis: 0.829 ± 0.161
2.916ArgIle: 2.916 ± 0.382
3.612ArgLys: 3.612 ± 0.364
3.314ArgLeu: 3.314 ± 0.331
1.756ArgMet: 1.756 ± 0.259
2.452ArgAsn: 2.452 ± 0.283
1.723ArgPro: 1.723 ± 0.268
1.723ArgGln: 1.723 ± 0.284
2.486ArgArg: 2.486 ± 0.378
2.817ArgSer: 2.817 ± 0.308
1.988ArgThr: 1.988 ± 0.289
3.977ArgVal: 3.977 ± 0.347
0.729ArgTrp: 0.729 ± 0.159
1.723ArgTyr: 1.723 ± 0.226
0.0ArgXaa: 0.0 ± 0.0
Ser
3.844SerAla: 3.844 ± 0.433
0.762SerCys: 0.762 ± 0.18
4.374SerAsp: 4.374 ± 0.417
5.236SerGlu: 5.236 ± 0.394
3.281SerPhe: 3.281 ± 0.393
5.336SerGly: 5.336 ± 0.518
1.591SerHis: 1.591 ± 0.249
3.612SerIle: 3.612 ± 0.345
4.143SerLys: 4.143 ± 0.42
5.236SerLeu: 5.236 ± 0.471
1.922SerMet: 1.922 ± 0.245
4.109SerAsn: 4.109 ± 0.474
2.121SerPro: 2.121 ± 0.332
2.22SerGln: 2.22 ± 0.291
3.645SerArg: 3.645 ± 0.344
4.838SerSer: 4.838 ± 0.57
3.612SerThr: 3.612 ± 0.405
4.275SerVal: 4.275 ± 0.387
1.193SerTrp: 1.193 ± 0.191
1.856SerTyr: 1.856 ± 0.26
0.0SerXaa: 0.0 ± 0.0
Thr
3.944ThrAla: 3.944 ± 0.489
0.431ThrCys: 0.431 ± 0.139
3.281ThrAsp: 3.281 ± 0.384
3.347ThrGlu: 3.347 ± 0.333
2.651ThrPhe: 2.651 ± 0.295
4.109ThrGly: 4.109 ± 0.401
0.795ThrHis: 0.795 ± 0.169
2.983ThrIle: 2.983 ± 0.31
3.248ThrLys: 3.248 ± 0.403
4.54ThrLeu: 4.54 ± 0.49
0.928ThrMet: 0.928 ± 0.173
2.784ThrAsn: 2.784 ± 0.46
3.148ThrPro: 3.148 ± 0.35
2.055ThrGln: 2.055 ± 0.294
2.287ThrArg: 2.287 ± 0.308
3.612ThrSer: 3.612 ± 0.378
3.148ThrThr: 3.148 ± 0.432
4.507ThrVal: 4.507 ± 0.426
0.961ThrTrp: 0.961 ± 0.206
1.79ThrTyr: 1.79 ± 0.268
0.0ThrXaa: 0.0 ± 0.0
Val
4.971ValAla: 4.971 ± 0.511
0.961ValCys: 0.961 ± 0.175
4.64ValAsp: 4.64 ± 0.391
5.733ValGlu: 5.733 ± 0.464
2.883ValPhe: 2.883 ± 0.346
4.706ValGly: 4.706 ± 0.433
0.961ValHis: 0.961 ± 0.194
4.076ValIle: 4.076 ± 0.385
5.236ValLys: 5.236 ± 0.465
5.07ValLeu: 5.07 ± 0.45
2.552ValMet: 2.552 ± 0.311
4.076ValAsn: 4.076 ± 0.436
2.254ValPro: 2.254 ± 0.278
2.618ValGln: 2.618 ± 0.302
3.513ValArg: 3.513 ± 0.378
4.872ValSer: 4.872 ± 0.398
3.911ValThr: 3.911 ± 0.363
5.568ValVal: 5.568 ± 0.561
1.392ValTrp: 1.392 ± 0.273
2.353ValTyr: 2.353 ± 0.32
0.0ValXaa: 0.0 ± 0.0
Trp
1.226TrpAla: 1.226 ± 0.2
0.265TrpCys: 0.265 ± 0.09
1.69TrpAsp: 1.69 ± 0.255
1.193TrpGlu: 1.193 ± 0.232
0.795TrpPhe: 0.795 ± 0.155
1.16TrpGly: 1.16 ± 0.193
0.331TrpHis: 0.331 ± 0.104
0.862TrpIle: 0.862 ± 0.129
1.027TrpLys: 1.027 ± 0.213
1.127TrpLeu: 1.127 ± 0.191
0.696TrpMet: 0.696 ± 0.179
1.16TrpAsn: 1.16 ± 0.226
0.365TrpPro: 0.365 ± 0.105
0.398TrpGln: 0.398 ± 0.11
0.696TrpArg: 0.696 ± 0.16
1.292TrpSer: 1.292 ± 0.311
0.762TrpThr: 0.762 ± 0.179
1.292TrpVal: 1.292 ± 0.167
0.199TrpTrp: 0.199 ± 0.074
0.597TrpTyr: 0.597 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.022TyrAla: 2.022 ± 0.238
0.464TyrCys: 0.464 ± 0.132
2.85TyrAsp: 2.85 ± 0.378
2.618TyrGlu: 2.618 ± 0.27
0.994TyrPhe: 0.994 ± 0.185
2.254TyrGly: 2.254 ± 0.309
1.06TyrHis: 1.06 ± 0.176
2.121TyrIle: 2.121 ± 0.307
2.353TyrLys: 2.353 ± 0.3
3.082TyrLeu: 3.082 ± 0.391
0.961TyrMet: 0.961 ± 0.218
2.055TyrAsn: 2.055 ± 0.269
1.458TyrPro: 1.458 ± 0.169
1.226TyrGln: 1.226 ± 0.251
1.591TyrArg: 1.591 ± 0.211
2.254TyrSer: 2.254 ± 0.252
2.022TyrThr: 2.022 ± 0.35
2.22TyrVal: 2.22 ± 0.289
0.862TyrTrp: 0.862 ± 0.197
0.994TyrTyr: 0.994 ± 0.207
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 154 proteins (30176 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski