Amino acid dipepetide frequency for Gordonia phage Orchid

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.127AlaAla: 10.127 ± 2.757
0.374AlaCys: 0.374 ± 0.139
4.565AlaAsp: 4.565 ± 0.375
4.067AlaGlu: 4.067 ± 0.343
2.905AlaPhe: 2.905 ± 0.312
6.475AlaGly: 6.475 ± 1.279
1.287AlaHis: 1.287 ± 0.24
4.69AlaIle: 4.69 ± 0.446
4.69AlaLys: 4.69 ± 0.614
6.848AlaLeu: 6.848 ± 0.766
2.2AlaMet: 2.2 ± 0.357
3.735AlaAsn: 3.735 ± 0.337
5.188AlaPro: 5.188 ± 0.861
3.735AlaGln: 3.735 ± 0.592
4.233AlaArg: 4.233 ± 0.345
5.479AlaSer: 5.479 ± 0.629
5.354AlaThr: 5.354 ± 0.708
6.101AlaVal: 6.101 ± 0.578
1.536AlaTrp: 1.536 ± 0.286
3.071AlaTyr: 3.071 ± 0.475
0.0AlaXaa: 0.0 ± 0.0
Cys
0.623CysAla: 0.623 ± 0.207
0.0CysCys: 0.0 ± 0.0
0.623CysAsp: 0.623 ± 0.191
0.664CysGlu: 0.664 ± 0.178
0.498CysPhe: 0.498 ± 0.174
0.83CysGly: 0.83 ± 0.191
0.125CysHis: 0.125 ± 0.067
0.623CysIle: 0.623 ± 0.189
0.581CysLys: 0.581 ± 0.19
0.83CysLeu: 0.83 ± 0.213
0.291CysMet: 0.291 ± 0.106
0.498CysAsn: 0.498 ± 0.159
0.706CysPro: 0.706 ± 0.205
0.166CysGln: 0.166 ± 0.079
0.54CysArg: 0.54 ± 0.18
0.83CysSer: 0.83 ± 0.184
0.457CysThr: 0.457 ± 0.14
0.498CysVal: 0.498 ± 0.161
0.083CysTrp: 0.083 ± 0.06
0.332CysTyr: 0.332 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
5.396AspAla: 5.396 ± 0.652
0.747AspCys: 0.747 ± 0.188
4.15AspAsp: 4.15 ± 0.74
4.939AspGlu: 4.939 ± 0.484
2.615AspPhe: 2.615 ± 0.315
4.524AspGly: 4.524 ± 0.499
1.162AspHis: 1.162 ± 0.271
4.482AspIle: 4.482 ± 0.42
3.901AspLys: 3.901 ± 0.374
3.777AspLeu: 3.777 ± 0.405
1.785AspMet: 1.785 ± 0.276
3.362AspAsn: 3.362 ± 0.43
3.735AspPro: 3.735 ± 0.464
2.075AspGln: 2.075 ± 0.27
2.864AspArg: 2.864 ± 0.381
4.067AspSer: 4.067 ± 0.457
4.026AspThr: 4.026 ± 0.535
3.652AspVal: 3.652 ± 0.29
1.245AspTrp: 1.245 ± 0.228
2.615AspTyr: 2.615 ± 0.42
0.0AspXaa: 0.0 ± 0.0
Glu
3.818GluAla: 3.818 ± 0.419
0.789GluCys: 0.789 ± 0.226
3.32GluAsp: 3.32 ± 0.379
3.735GluGlu: 3.735 ± 0.545
3.154GluPhe: 3.154 ± 0.401
3.113GluGly: 3.113 ± 0.374
1.287GluHis: 1.287 ± 0.229
4.524GluIle: 4.524 ± 0.363
3.86GluLys: 3.86 ± 0.401
5.645GluLeu: 5.645 ± 0.519
1.66GluMet: 1.66 ± 0.375
3.569GluAsn: 3.569 ± 0.33
2.615GluPro: 2.615 ± 0.446
2.49GluGln: 2.49 ± 0.433
3.154GluArg: 3.154 ± 0.375
3.528GluSer: 3.528 ± 0.402
3.652GluThr: 3.652 ± 0.45
4.026GluVal: 4.026 ± 0.466
1.37GluTrp: 1.37 ± 0.203
2.2GluTyr: 2.2 ± 0.341
0.0GluXaa: 0.0 ± 0.0
Phe
3.113PheAla: 3.113 ± 0.344
0.457PheCys: 0.457 ± 0.144
2.656PheAsp: 2.656 ± 0.361
2.366PheGlu: 2.366 ± 0.338
1.287PhePhe: 1.287 ± 0.246
2.573PheGly: 2.573 ± 0.298
0.664PheHis: 0.664 ± 0.188
2.449PheIle: 2.449 ± 0.306
2.49PheLys: 2.49 ± 0.267
2.158PheLeu: 2.158 ± 0.336
1.162PheMet: 1.162 ± 0.211
1.909PheAsn: 1.909 ± 0.288
1.785PhePro: 1.785 ± 0.263
1.121PheGln: 1.121 ± 0.207
2.117PheArg: 2.117 ± 0.303
2.283PheSer: 2.283 ± 0.353
3.735PheThr: 3.735 ± 0.344
1.951PheVal: 1.951 ± 0.305
0.374PheTrp: 0.374 ± 0.142
0.913PheTyr: 0.913 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
5.935GlyAla: 5.935 ± 1.012
0.374GlyCys: 0.374 ± 0.121
3.86GlyAsp: 3.86 ± 0.361
4.275GlyGlu: 4.275 ± 0.359
3.279GlyPhe: 3.279 ± 0.366
5.479GlyGly: 5.479 ± 0.858
1.536GlyHis: 1.536 ± 0.29
5.396GlyIle: 5.396 ± 0.484
5.022GlyLys: 5.022 ± 0.394
5.188GlyLeu: 5.188 ± 0.735
2.864GlyMet: 2.864 ± 0.498
3.735GlyAsn: 3.735 ± 0.371
2.656GlyPro: 2.656 ± 0.307
2.324GlyGln: 2.324 ± 0.281
3.362GlyArg: 3.362 ± 0.376
4.524GlySer: 4.524 ± 0.471
4.814GlyThr: 4.814 ± 0.508
4.648GlyVal: 4.648 ± 0.511
1.245GlyTrp: 1.245 ± 0.205
2.698GlyTyr: 2.698 ± 0.33
0.0GlyXaa: 0.0 ± 0.0
His
1.162HisAla: 1.162 ± 0.306
0.208HisCys: 0.208 ± 0.102
1.37HisAsp: 1.37 ± 0.278
0.913HisGlu: 0.913 ± 0.198
0.789HisPhe: 0.789 ± 0.196
1.37HisGly: 1.37 ± 0.244
0.374HisHis: 0.374 ± 0.137
1.079HisIle: 1.079 ± 0.23
0.83HisLys: 0.83 ± 0.22
1.162HisLeu: 1.162 ± 0.195
0.374HisMet: 0.374 ± 0.12
0.747HisAsn: 0.747 ± 0.173
0.706HisPro: 0.706 ± 0.188
0.664HisGln: 0.664 ± 0.204
1.204HisArg: 1.204 ± 0.265
1.162HisSer: 1.162 ± 0.214
1.038HisThr: 1.038 ± 0.217
1.204HisVal: 1.204 ± 0.251
0.291HisTrp: 0.291 ± 0.088
0.913HisTyr: 0.913 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
4.856IleAla: 4.856 ± 0.496
0.747IleCys: 0.747 ± 0.183
4.939IleAsp: 4.939 ± 0.432
4.939IleGlu: 4.939 ± 0.476
1.951IlePhe: 1.951 ± 0.325
5.105IleGly: 5.105 ± 0.427
0.872IleHis: 0.872 ± 0.255
3.445IleIle: 3.445 ± 0.341
3.86IleLys: 3.86 ± 0.38
3.818IleLeu: 3.818 ± 0.37
0.955IleMet: 0.955 ± 0.191
3.237IleAsn: 3.237 ± 0.398
3.279IlePro: 3.279 ± 0.345
1.785IleGln: 1.785 ± 0.226
2.988IleArg: 2.988 ± 0.335
4.067IleSer: 4.067 ± 0.387
4.399IleThr: 4.399 ± 0.476
4.358IleVal: 4.358 ± 0.425
0.747IleTrp: 0.747 ± 0.18
1.826IleTyr: 1.826 ± 0.24
0.0IleXaa: 0.0 ± 0.0
Lys
4.773LysAla: 4.773 ± 0.533
0.457LysCys: 0.457 ± 0.167
3.403LysAsp: 3.403 ± 0.452
3.943LysGlu: 3.943 ± 0.461
1.743LysPhe: 1.743 ± 0.235
4.026LysGly: 4.026 ± 0.53
1.079LysHis: 1.079 ± 0.229
3.652LysIle: 3.652 ± 0.462
4.607LysLys: 4.607 ± 0.628
4.607LysLeu: 4.607 ± 0.46
1.951LysMet: 1.951 ± 0.301
2.781LysAsn: 2.781 ± 0.339
2.656LysPro: 2.656 ± 0.367
2.656LysGln: 2.656 ± 0.297
3.196LysArg: 3.196 ± 0.409
3.445LysSer: 3.445 ± 0.381
3.279LysThr: 3.279 ± 0.32
4.607LysVal: 4.607 ± 0.419
1.079LysTrp: 1.079 ± 0.207
1.785LysTyr: 1.785 ± 0.308
0.0LysXaa: 0.0 ± 0.0
Leu
6.724LeuAla: 6.724 ± 0.646
0.913LeuCys: 0.913 ± 0.228
4.856LeuAsp: 4.856 ± 0.388
5.105LeuGlu: 5.105 ± 0.479
2.573LeuPhe: 2.573 ± 0.385
4.814LeuGly: 4.814 ± 0.556
0.872LeuHis: 0.872 ± 0.203
3.735LeuIle: 3.735 ± 0.407
3.984LeuLys: 3.984 ± 0.426
4.233LeuLeu: 4.233 ± 0.369
1.785LeuMet: 1.785 ± 0.304
4.109LeuAsn: 4.109 ± 0.381
3.652LeuPro: 3.652 ± 0.404
2.324LeuGln: 2.324 ± 0.289
3.237LeuArg: 3.237 ± 0.382
3.943LeuSer: 3.943 ± 0.502
5.23LeuThr: 5.23 ± 0.466
5.064LeuVal: 5.064 ± 0.529
0.955LeuTrp: 0.955 ± 0.185
2.283LeuTyr: 2.283 ± 0.335
0.0LeuXaa: 0.0 ± 0.0
Met
2.366MetAla: 2.366 ± 0.458
0.083MetCys: 0.083 ± 0.072
1.453MetAsp: 1.453 ± 0.203
1.245MetGlu: 1.245 ± 0.183
0.83MetPhe: 0.83 ± 0.217
2.615MetGly: 2.615 ± 0.323
0.457MetHis: 0.457 ± 0.188
1.577MetIle: 1.577 ± 0.208
1.743MetLys: 1.743 ± 0.281
1.785MetLeu: 1.785 ± 0.286
0.498MetMet: 0.498 ± 0.131
1.868MetAsn: 1.868 ± 0.332
1.328MetPro: 1.328 ± 0.26
1.038MetGln: 1.038 ± 0.209
1.328MetArg: 1.328 ± 0.195
2.366MetSer: 2.366 ± 0.327
1.37MetThr: 1.37 ± 0.222
1.536MetVal: 1.536 ± 0.259
0.249MetTrp: 0.249 ± 0.122
1.162MetTyr: 1.162 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
3.362AsnAla: 3.362 ± 0.404
0.457AsnCys: 0.457 ± 0.136
3.403AsnAsp: 3.403 ± 0.353
2.905AsnGlu: 2.905 ± 0.393
2.117AsnPhe: 2.117 ± 0.273
3.403AsnGly: 3.403 ± 0.389
0.872AsnHis: 0.872 ± 0.21
3.403AsnIle: 3.403 ± 0.396
3.901AsnLys: 3.901 ± 0.386
3.694AsnLeu: 3.694 ± 0.478
1.121AsnMet: 1.121 ± 0.192
2.905AsnAsn: 2.905 ± 0.306
3.279AsnPro: 3.279 ± 0.402
1.785AsnGln: 1.785 ± 0.262
2.698AsnArg: 2.698 ± 0.317
3.777AsnSer: 3.777 ± 0.38
3.652AsnThr: 3.652 ± 0.398
3.362AsnVal: 3.362 ± 0.404
0.747AsnTrp: 0.747 ± 0.159
1.868AsnTyr: 1.868 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
5.271ProAla: 5.271 ± 0.714
0.415ProCys: 0.415 ± 0.174
3.32ProAsp: 3.32 ± 0.436
3.984ProGlu: 3.984 ± 0.437
1.411ProPhe: 1.411 ± 0.177
4.192ProGly: 4.192 ± 0.348
1.038ProHis: 1.038 ± 0.246
3.071ProIle: 3.071 ± 0.338
2.034ProLys: 2.034 ± 0.432
2.532ProLeu: 2.532 ± 0.297
1.453ProMet: 1.453 ± 0.235
2.947ProAsn: 2.947 ± 0.312
2.656ProPro: 2.656 ± 0.752
1.287ProGln: 1.287 ± 0.215
1.66ProArg: 1.66 ± 0.382
3.362ProSer: 3.362 ± 0.48
3.113ProThr: 3.113 ± 0.469
3.777ProVal: 3.777 ± 0.46
0.872ProTrp: 0.872 ± 0.184
1.66ProTyr: 1.66 ± 0.307
0.0ProXaa: 0.0 ± 0.0
Gln
4.026GlnAla: 4.026 ± 0.496
0.332GlnCys: 0.332 ± 0.125
1.785GlnAsp: 1.785 ± 0.26
2.2GlnGlu: 2.2 ± 0.325
1.37GlnPhe: 1.37 ± 0.274
2.2GlnGly: 2.2 ± 0.305
0.623GlnHis: 0.623 ± 0.193
2.49GlnIle: 2.49 ± 0.293
2.366GlnLys: 2.366 ± 0.295
2.822GlnLeu: 2.822 ± 0.356
1.162GlnMet: 1.162 ± 0.205
1.37GlnAsn: 1.37 ± 0.271
1.079GlnPro: 1.079 ± 0.282
1.37GlnGln: 1.37 ± 0.282
1.577GlnArg: 1.577 ± 0.207
2.366GlnSer: 2.366 ± 0.33
2.117GlnThr: 2.117 ± 0.306
2.2GlnVal: 2.2 ± 0.333
0.996GlnTrp: 0.996 ± 0.167
1.245GlnTyr: 1.245 ± 0.224
0.0GlnXaa: 0.0 ± 0.0
Arg
4.648ArgAla: 4.648 ± 0.617
0.623ArgCys: 0.623 ± 0.194
3.32ArgAsp: 3.32 ± 0.392
2.49ArgGlu: 2.49 ± 0.349
1.992ArgPhe: 1.992 ± 0.232
3.362ArgGly: 3.362 ± 0.423
1.038ArgHis: 1.038 ± 0.205
3.403ArgIle: 3.403 ± 0.424
3.279ArgLys: 3.279 ± 0.346
3.611ArgLeu: 3.611 ± 0.416
1.785ArgMet: 1.785 ± 0.265
1.992ArgAsn: 1.992 ± 0.283
2.698ArgPro: 2.698 ± 0.388
1.37ArgGln: 1.37 ± 0.205
2.739ArgArg: 2.739 ± 0.429
2.656ArgSer: 2.656 ± 0.324
2.49ArgThr: 2.49 ± 0.34
2.407ArgVal: 2.407 ± 0.336
1.079ArgTrp: 1.079 ± 0.196
1.992ArgTyr: 1.992 ± 0.272
0.0ArgXaa: 0.0 ± 0.0
Ser
5.23SerAla: 5.23 ± 0.681
0.623SerCys: 0.623 ± 0.184
4.524SerAsp: 4.524 ± 0.505
3.528SerGlu: 3.528 ± 0.46
2.407SerPhe: 2.407 ± 0.356
5.147SerGly: 5.147 ± 0.509
0.83SerHis: 0.83 ± 0.197
3.528SerIle: 3.528 ± 0.426
3.611SerLys: 3.611 ± 0.42
4.856SerLeu: 4.856 ± 0.451
2.117SerMet: 2.117 ± 0.313
3.528SerAsn: 3.528 ± 0.392
3.154SerPro: 3.154 ± 0.318
1.992SerGln: 1.992 ± 0.262
2.822SerArg: 2.822 ± 0.32
3.611SerSer: 3.611 ± 0.592
4.067SerThr: 4.067 ± 0.463
4.358SerVal: 4.358 ± 0.567
0.955SerTrp: 0.955 ± 0.212
1.785SerTyr: 1.785 ± 0.283
0.0SerXaa: 0.0 ± 0.0
Thr
6.226ThrAla: 6.226 ± 0.767
0.581ThrCys: 0.581 ± 0.18
4.939ThrAsp: 4.939 ± 0.52
3.154ThrGlu: 3.154 ± 0.465
2.366ThrPhe: 2.366 ± 0.271
5.769ThrGly: 5.769 ± 0.584
1.162ThrHis: 1.162 ± 0.238
3.528ThrIle: 3.528 ± 0.416
3.32ThrLys: 3.32 ± 0.33
4.358ThrLeu: 4.358 ± 0.349
1.204ThrMet: 1.204 ± 0.249
3.403ThrAsn: 3.403 ± 0.415
3.777ThrPro: 3.777 ± 0.467
2.656ThrGln: 2.656 ± 0.28
2.739ThrArg: 2.739 ± 0.262
4.026ThrSer: 4.026 ± 0.548
3.362ThrThr: 3.362 ± 0.544
4.316ThrVal: 4.316 ± 0.488
0.54ThrTrp: 0.54 ± 0.14
2.324ThrTyr: 2.324 ± 0.302
0.0ThrXaa: 0.0 ± 0.0
Val
5.147ValAla: 5.147 ± 0.41
0.789ValCys: 0.789 ± 0.199
5.354ValAsp: 5.354 ± 0.645
4.15ValGlu: 4.15 ± 0.393
3.03ValPhe: 3.03 ± 0.432
4.441ValGly: 4.441 ± 0.578
1.079ValHis: 1.079 ± 0.198
4.358ValIle: 4.358 ± 0.433
2.905ValLys: 2.905 ± 0.337
4.109ValLeu: 4.109 ± 0.43
1.536ValMet: 1.536 ± 0.228
3.777ValAsn: 3.777 ± 0.391
3.237ValPro: 3.237 ± 0.337
2.324ValGln: 2.324 ± 0.306
3.694ValArg: 3.694 ± 0.375
3.86ValSer: 3.86 ± 0.522
4.69ValThr: 4.69 ± 0.436
3.943ValVal: 3.943 ± 0.451
0.789ValTrp: 0.789 ± 0.187
1.951ValTyr: 1.951 ± 0.237
0.0ValXaa: 0.0 ± 0.0
Trp
1.577TrpAla: 1.577 ± 0.266
0.166TrpCys: 0.166 ± 0.079
0.955TrpAsp: 0.955 ± 0.18
0.996TrpGlu: 0.996 ± 0.18
0.415TrpPhe: 0.415 ± 0.135
0.747TrpGly: 0.747 ± 0.158
0.374TrpHis: 0.374 ± 0.136
0.955TrpIle: 0.955 ± 0.184
0.913TrpLys: 0.913 ± 0.149
1.536TrpLeu: 1.536 ± 0.236
0.374TrpMet: 0.374 ± 0.122
1.204TrpAsn: 1.204 ± 0.201
0.291TrpPro: 0.291 ± 0.117
0.872TrpGln: 0.872 ± 0.171
0.955TrpArg: 0.955 ± 0.193
0.83TrpSer: 0.83 ± 0.204
0.747TrpThr: 0.747 ± 0.186
0.83TrpVal: 0.83 ± 0.172
0.208TrpTrp: 0.208 ± 0.112
0.789TrpTyr: 0.789 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.283TyrAla: 2.283 ± 0.273
0.664TyrCys: 0.664 ± 0.166
2.366TyrAsp: 2.366 ± 0.374
1.909TyrGlu: 1.909 ± 0.343
0.83TyrPhe: 0.83 ± 0.187
3.03TyrGly: 3.03 ± 0.4
0.83TyrHis: 0.83 ± 0.182
1.785TyrIle: 1.785 ± 0.22
1.826TyrLys: 1.826 ± 0.37
2.822TyrLeu: 2.822 ± 0.432
0.581TyrMet: 0.581 ± 0.142
2.034TyrAsn: 2.034 ± 0.293
1.577TyrPro: 1.577 ± 0.316
1.66TyrGln: 1.66 ± 0.254
1.868TyrArg: 1.868 ± 0.315
2.49TyrSer: 2.49 ± 0.434
2.158TyrThr: 2.158 ± 0.251
2.324TyrVal: 2.324 ± 0.349
0.374TyrTrp: 0.374 ± 0.114
1.245TyrTyr: 1.245 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 114 proteins (24095 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski