Amino acid dipepetide frequency for Gordonia phage SheckWes

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.135AlaAla: 11.135 ± 1.525
0.994AlaCys: 0.994 ± 0.263
6.495AlaAsp: 6.495 ± 0.757
5.435AlaGlu: 5.435 ± 0.586
4.374AlaPhe: 4.374 ± 0.798
7.688AlaGly: 7.688 ± 0.836
1.922AlaHis: 1.922 ± 0.35
5.17AlaIle: 5.17 ± 0.474
3.844AlaLys: 3.844 ± 0.416
7.291AlaLeu: 7.291 ± 0.768
4.374AlaMet: 4.374 ± 0.682
2.651AlaAsn: 2.651 ± 0.403
5.037AlaPro: 5.037 ± 0.608
3.778AlaGln: 3.778 ± 0.742
6.694AlaArg: 6.694 ± 0.859
5.037AlaSer: 5.037 ± 0.896
6.23AlaThr: 6.23 ± 0.688
6.031AlaVal: 6.031 ± 0.717
2.055AlaTrp: 2.055 ± 0.367
2.187AlaTyr: 2.187 ± 0.446
0.0AlaXaa: 0.0 ± 0.0
Cys
0.928CysAla: 0.928 ± 0.285
0.0CysCys: 0.0 ± 0.0
0.994CysAsp: 0.994 ± 0.35
0.597CysGlu: 0.597 ± 0.209
0.265CysPhe: 0.265 ± 0.148
0.994CysGly: 0.994 ± 0.276
0.199CysHis: 0.199 ± 0.102
0.265CysIle: 0.265 ± 0.159
0.464CysLys: 0.464 ± 0.171
0.331CysLeu: 0.331 ± 0.145
0.133CysMet: 0.133 ± 0.082
0.597CysAsn: 0.597 ± 0.178
0.398CysPro: 0.398 ± 0.17
0.398CysGln: 0.398 ± 0.158
0.795CysArg: 0.795 ± 0.212
0.729CysSer: 0.729 ± 0.258
0.795CysThr: 0.795 ± 0.323
0.663CysVal: 0.663 ± 0.214
0.066CysTrp: 0.066 ± 0.059
0.331CysTyr: 0.331 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
5.236AspAla: 5.236 ± 0.407
0.53AspCys: 0.53 ± 0.182
4.838AspAsp: 4.838 ± 0.912
6.164AspGlu: 6.164 ± 0.8
1.79AspPhe: 1.79 ± 0.351
4.772AspGly: 4.772 ± 0.679
1.524AspHis: 1.524 ± 0.29
4.242AspIle: 4.242 ± 0.559
2.253AspLys: 2.253 ± 0.385
5.832AspLeu: 5.832 ± 0.736
1.79AspMet: 1.79 ± 0.314
2.253AspAsn: 2.253 ± 0.337
4.441AspPro: 4.441 ± 0.688
2.187AspGln: 2.187 ± 0.394
4.109AspArg: 4.109 ± 0.463
2.651AspSer: 2.651 ± 0.477
3.977AspThr: 3.977 ± 0.665
4.706AspVal: 4.706 ± 0.49
1.326AspTrp: 1.326 ± 0.286
2.585AspTyr: 2.585 ± 0.463
0.0AspXaa: 0.0 ± 0.0
Glu
6.098GluAla: 6.098 ± 0.529
0.795GluCys: 0.795 ± 0.2
4.971GluAsp: 4.971 ± 0.802
4.706GluGlu: 4.706 ± 0.75
1.06GluPhe: 1.06 ± 0.305
4.905GluGly: 4.905 ± 0.662
1.193GluHis: 1.193 ± 0.376
2.519GluIle: 2.519 ± 0.415
2.717GluLys: 2.717 ± 0.578
5.435GluLeu: 5.435 ± 0.569
2.055GluMet: 2.055 ± 0.431
2.121GluAsn: 2.121 ± 0.387
2.784GluPro: 2.784 ± 0.458
3.181GluGln: 3.181 ± 0.505
4.242GluArg: 4.242 ± 0.635
2.519GluSer: 2.519 ± 0.382
3.181GluThr: 3.181 ± 0.45
3.513GluVal: 3.513 ± 0.512
1.193GluTrp: 1.193 ± 0.237
2.055GluTyr: 2.055 ± 0.447
0.0GluXaa: 0.0 ± 0.0
Phe
2.916PheAla: 2.916 ± 0.48
0.663PheCys: 0.663 ± 0.248
3.38PheAsp: 3.38 ± 0.474
2.055PheGlu: 2.055 ± 0.347
1.06PhePhe: 1.06 ± 0.252
2.253PheGly: 2.253 ± 0.374
0.663PheHis: 0.663 ± 0.231
1.591PheIle: 1.591 ± 0.329
1.458PheLys: 1.458 ± 0.39
2.187PheLeu: 2.187 ± 0.348
0.53PheMet: 0.53 ± 0.182
0.994PheAsn: 0.994 ± 0.224
1.524PhePro: 1.524 ± 0.339
1.06PheGln: 1.06 ± 0.295
2.187PheArg: 2.187 ± 0.333
1.591PheSer: 1.591 ± 0.283
2.055PheThr: 2.055 ± 0.337
2.386PheVal: 2.386 ± 0.407
0.265PheTrp: 0.265 ± 0.121
0.265PheTyr: 0.265 ± 0.159
0.0PheXaa: 0.0 ± 0.0
Gly
7.622GlyAla: 7.622 ± 1.24
0.398GlyCys: 0.398 ± 0.175
6.164GlyAsp: 6.164 ± 1.345
4.176GlyGlu: 4.176 ± 0.563
2.452GlyPhe: 2.452 ± 0.387
5.435GlyGly: 5.435 ± 0.886
1.723GlyHis: 1.723 ± 0.362
4.971GlyIle: 4.971 ± 1.02
4.043GlyLys: 4.043 ± 0.473
5.369GlyLeu: 5.369 ± 0.565
1.326GlyMet: 1.326 ± 0.317
1.922GlyAsn: 1.922 ± 0.423
4.308GlyPro: 4.308 ± 1.98
2.916GlyGln: 2.916 ± 0.56
5.435GlyArg: 5.435 ± 0.611
5.302GlySer: 5.302 ± 0.867
5.634GlyThr: 5.634 ± 0.69
6.23GlyVal: 6.23 ± 0.674
1.922GlyTrp: 1.922 ± 0.361
2.055GlyTyr: 2.055 ± 0.338
0.0GlyXaa: 0.0 ± 0.0
His
1.392HisAla: 1.392 ± 0.289
0.265HisCys: 0.265 ± 0.122
1.988HisAsp: 1.988 ± 0.349
1.259HisGlu: 1.259 ± 0.307
0.795HisPhe: 0.795 ± 0.286
1.524HisGly: 1.524 ± 0.338
0.729HisHis: 0.729 ± 0.224
0.795HisIle: 0.795 ± 0.228
0.994HisLys: 0.994 ± 0.269
0.994HisLeu: 0.994 ± 0.295
0.928HisMet: 0.928 ± 0.274
0.862HisAsn: 0.862 ± 0.259
1.193HisPro: 1.193 ± 0.288
0.729HisGln: 0.729 ± 0.202
1.259HisArg: 1.259 ± 0.249
1.392HisSer: 1.392 ± 0.268
1.326HisThr: 1.326 ± 0.28
1.458HisVal: 1.458 ± 0.302
0.464HisTrp: 0.464 ± 0.201
0.331HisTyr: 0.331 ± 0.134
0.0HisXaa: 0.0 ± 0.0
Ile
5.766IleAla: 5.766 ± 0.667
0.331IleCys: 0.331 ± 0.141
3.977IleAsp: 3.977 ± 0.465
3.712IleGlu: 3.712 ± 0.482
1.392IlePhe: 1.392 ± 0.435
3.712IleGly: 3.712 ± 0.628
1.259IleHis: 1.259 ± 0.298
2.651IleIle: 2.651 ± 0.467
2.32IleLys: 2.32 ± 0.552
3.115IleLeu: 3.115 ± 0.443
0.862IleMet: 0.862 ± 0.189
1.392IleAsn: 1.392 ± 0.267
2.519IlePro: 2.519 ± 0.389
1.458IleGln: 1.458 ± 0.678
3.38IleArg: 3.38 ± 0.482
2.519IleSer: 2.519 ± 0.395
2.784IleThr: 2.784 ± 0.404
3.91IleVal: 3.91 ± 0.425
0.994IleTrp: 0.994 ± 0.275
1.259IleTyr: 1.259 ± 0.351
0.0IleXaa: 0.0 ± 0.0
Lys
5.369LysAla: 5.369 ± 0.715
0.331LysCys: 0.331 ± 0.157
2.519LysAsp: 2.519 ± 0.391
2.585LysGlu: 2.585 ± 0.409
1.856LysPhe: 1.856 ± 0.446
4.043LysGly: 4.043 ± 0.609
0.795LysHis: 0.795 ± 0.192
2.253LysIle: 2.253 ± 0.353
3.314LysLys: 3.314 ± 0.489
3.977LysLeu: 3.977 ± 0.683
1.458LysMet: 1.458 ± 0.292
1.657LysAsn: 1.657 ± 0.33
2.585LysPro: 2.585 ± 0.324
2.452LysGln: 2.452 ± 0.414
3.38LysArg: 3.38 ± 0.471
2.452LysSer: 2.452 ± 0.532
2.717LysThr: 2.717 ± 0.479
2.585LysVal: 2.585 ± 0.471
0.928LysTrp: 0.928 ± 0.229
1.723LysTyr: 1.723 ± 0.327
0.0LysXaa: 0.0 ± 0.0
Leu
8.285LeuAla: 8.285 ± 1.016
0.729LeuCys: 0.729 ± 0.212
4.507LeuAsp: 4.507 ± 0.577
5.501LeuGlu: 5.501 ± 0.757
2.121LeuPhe: 2.121 ± 0.392
6.164LeuGly: 6.164 ± 0.891
1.458LeuHis: 1.458 ± 0.327
3.977LeuIle: 3.977 ± 0.528
3.712LeuLys: 3.712 ± 0.578
6.628LeuLeu: 6.628 ± 0.616
1.259LeuMet: 1.259 ± 0.347
2.121LeuAsn: 2.121 ± 0.423
4.838LeuPro: 4.838 ± 0.523
2.519LeuGln: 2.519 ± 0.442
6.23LeuArg: 6.23 ± 0.713
4.242LeuSer: 4.242 ± 0.475
5.17LeuThr: 5.17 ± 0.62
5.037LeuVal: 5.037 ± 0.603
1.79LeuTrp: 1.79 ± 0.335
2.585LeuTyr: 2.585 ± 0.353
0.0LeuXaa: 0.0 ± 0.0
Met
2.85MetAla: 2.85 ± 0.363
0.133MetCys: 0.133 ± 0.093
1.193MetAsp: 1.193 ± 0.235
0.994MetGlu: 0.994 ± 0.228
0.928MetPhe: 0.928 ± 0.237
1.988MetGly: 1.988 ± 0.456
0.53MetHis: 0.53 ± 0.171
1.524MetIle: 1.524 ± 0.41
1.922MetLys: 1.922 ± 0.318
1.591MetLeu: 1.591 ± 0.332
0.795MetMet: 0.795 ± 0.244
0.994MetAsn: 0.994 ± 0.258
1.524MetPro: 1.524 ± 0.294
0.928MetGln: 0.928 ± 0.195
1.458MetArg: 1.458 ± 0.277
2.253MetSer: 2.253 ± 0.304
2.055MetThr: 2.055 ± 0.324
1.988MetVal: 1.988 ± 0.392
0.464MetTrp: 0.464 ± 0.207
0.663MetTyr: 0.663 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
3.38AsnAla: 3.38 ± 0.816
0.331AsnCys: 0.331 ± 0.156
1.79AsnAsp: 1.79 ± 0.378
1.856AsnGlu: 1.856 ± 0.321
0.928AsnPhe: 0.928 ± 0.269
2.651AsnGly: 2.651 ± 0.385
0.729AsnHis: 0.729 ± 0.26
1.591AsnIle: 1.591 ± 0.347
1.326AsnLys: 1.326 ± 0.365
2.585AsnLeu: 2.585 ± 0.434
0.663AsnMet: 0.663 ± 0.222
0.994AsnAsn: 0.994 ± 0.245
2.983AsnPro: 2.983 ± 0.44
0.862AsnGln: 0.862 ± 0.328
2.651AsnArg: 2.651 ± 0.379
1.524AsnSer: 1.524 ± 0.288
1.458AsnThr: 1.458 ± 0.255
1.988AsnVal: 1.988 ± 0.312
0.795AsnTrp: 0.795 ± 0.194
0.597AsnTyr: 0.597 ± 0.2
0.0AsnXaa: 0.0 ± 0.0
Pro
6.098ProAla: 6.098 ± 0.819
0.928ProCys: 0.928 ± 0.273
3.115ProAsp: 3.115 ± 0.469
3.513ProGlu: 3.513 ± 0.505
1.856ProPhe: 1.856 ± 0.405
4.706ProGly: 4.706 ± 0.673
0.729ProHis: 0.729 ± 0.189
2.519ProIle: 2.519 ± 0.42
3.248ProLys: 3.248 ± 0.552
3.181ProLeu: 3.181 ± 0.436
1.326ProMet: 1.326 ± 0.272
2.121ProAsn: 2.121 ± 0.34
2.916ProPro: 2.916 ± 0.483
3.049ProGln: 3.049 ± 1.106
3.115ProArg: 3.115 ± 0.475
2.983ProSer: 2.983 ± 0.572
3.712ProThr: 3.712 ± 0.52
3.778ProVal: 3.778 ± 0.525
1.259ProTrp: 1.259 ± 0.306
1.326ProTyr: 1.326 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
3.115GlnAla: 3.115 ± 0.73
0.199GlnCys: 0.199 ± 0.122
2.585GlnAsp: 2.585 ± 0.399
2.187GlnGlu: 2.187 ± 0.306
1.657GlnPhe: 1.657 ± 0.353
4.838GlnGly: 4.838 ± 2.445
0.663GlnHis: 0.663 ± 0.2
1.591GlnIle: 1.591 ± 0.346
1.524GlnLys: 1.524 ± 0.303
3.446GlnLeu: 3.446 ± 0.56
0.795GlnMet: 0.795 ± 0.243
0.928GlnAsn: 0.928 ± 0.268
1.591GlnPro: 1.591 ± 0.349
1.856GlnGln: 1.856 ± 0.402
3.181GlnArg: 3.181 ± 0.42
1.988GlnSer: 1.988 ± 0.325
1.259GlnThr: 1.259 ± 0.225
2.916GlnVal: 2.916 ± 0.397
0.862GlnTrp: 0.862 ± 0.2
1.591GlnTyr: 1.591 ± 0.331
0.0GlnXaa: 0.0 ± 0.0
Arg
7.489ArgAla: 7.489 ± 0.776
0.729ArgCys: 0.729 ± 0.269
3.645ArgAsp: 3.645 ± 0.606
3.645ArgGlu: 3.645 ± 0.592
1.922ArgPhe: 1.922 ± 0.457
4.905ArgGly: 4.905 ± 0.737
1.524ArgHis: 1.524 ± 0.296
2.519ArgIle: 2.519 ± 0.36
4.905ArgLys: 4.905 ± 0.519
6.429ArgLeu: 6.429 ± 0.608
1.723ArgMet: 1.723 ± 0.382
2.452ArgAsn: 2.452 ± 0.45
2.784ArgPro: 2.784 ± 0.542
2.717ArgGln: 2.717 ± 0.359
6.098ArgArg: 6.098 ± 0.753
3.91ArgSer: 3.91 ± 0.414
2.85ArgThr: 2.85 ± 0.54
5.369ArgVal: 5.369 ± 0.605
1.657ArgTrp: 1.657 ± 0.395
1.591ArgTyr: 1.591 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
5.037SerAla: 5.037 ± 0.694
0.729SerCys: 0.729 ± 0.207
3.645SerAsp: 3.645 ± 0.431
2.717SerGlu: 2.717 ± 0.449
1.259SerPhe: 1.259 ± 0.334
5.634SerGly: 5.634 ± 0.748
1.458SerHis: 1.458 ± 0.35
2.651SerIle: 2.651 ± 0.506
2.585SerLys: 2.585 ± 0.381
4.905SerLeu: 4.905 ± 0.737
1.79SerMet: 1.79 ± 0.364
1.524SerAsn: 1.524 ± 0.286
3.645SerPro: 3.645 ± 0.565
1.657SerGln: 1.657 ± 0.363
2.916SerArg: 2.916 ± 0.348
3.446SerSer: 3.446 ± 0.479
3.579SerThr: 3.579 ± 0.622
2.121SerVal: 2.121 ± 0.274
1.657SerTrp: 1.657 ± 0.349
1.326SerTyr: 1.326 ± 0.252
0.0SerXaa: 0.0 ± 0.0
Thr
5.236ThrAla: 5.236 ± 0.69
0.663ThrCys: 0.663 ± 0.249
3.446ThrAsp: 3.446 ± 0.47
2.85ThrGlu: 2.85 ± 0.419
1.591ThrPhe: 1.591 ± 0.306
5.567ThrGly: 5.567 ± 0.914
1.06ThrHis: 1.06 ± 0.225
2.983ThrIle: 2.983 ± 0.674
3.181ThrLys: 3.181 ± 0.507
5.369ThrLeu: 5.369 ± 0.561
1.458ThrMet: 1.458 ± 0.351
1.723ThrAsn: 1.723 ± 0.313
4.176ThrPro: 4.176 ± 0.424
1.988ThrGln: 1.988 ± 0.292
4.441ThrArg: 4.441 ± 0.575
3.314ThrSer: 3.314 ± 0.518
4.176ThrThr: 4.176 ± 0.674
5.103ThrVal: 5.103 ± 0.716
1.259ThrTrp: 1.259 ± 0.356
1.326ThrTyr: 1.326 ± 0.284
0.0ThrXaa: 0.0 ± 0.0
Val
6.429ValAla: 6.429 ± 0.607
0.53ValCys: 0.53 ± 0.167
4.639ValAsp: 4.639 ± 0.592
4.176ValGlu: 4.176 ± 0.583
2.187ValPhe: 2.187 ± 0.334
4.308ValGly: 4.308 ± 0.553
1.326ValHis: 1.326 ± 0.261
3.91ValIle: 3.91 ± 0.533
2.85ValLys: 2.85 ± 0.33
5.965ValLeu: 5.965 ± 0.751
1.856ValMet: 1.856 ± 0.412
2.253ValAsn: 2.253 ± 0.409
3.844ValPro: 3.844 ± 0.48
2.85ValGln: 2.85 ± 0.458
3.778ValArg: 3.778 ± 0.463
3.446ValSer: 3.446 ± 0.577
5.17ValThr: 5.17 ± 0.583
6.363ValVal: 6.363 ± 0.709
1.657ValTrp: 1.657 ± 0.335
1.988ValTyr: 1.988 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
1.988TrpAla: 1.988 ± 0.413
0.398TrpCys: 0.398 ± 0.181
1.458TrpAsp: 1.458 ± 0.251
0.928TrpGlu: 0.928 ± 0.298
0.729TrpPhe: 0.729 ± 0.242
0.928TrpGly: 0.928 ± 0.289
0.464TrpHis: 0.464 ± 0.179
0.331TrpIle: 0.331 ± 0.151
0.928TrpLys: 0.928 ± 0.283
2.121TrpLeu: 2.121 ± 0.447
0.398TrpMet: 0.398 ± 0.16
1.458TrpAsn: 1.458 ± 0.296
0.928TrpPro: 0.928 ± 0.234
1.193TrpGln: 1.193 ± 0.331
1.79TrpArg: 1.79 ± 0.27
1.259TrpSer: 1.259 ± 0.347
0.994TrpThr: 0.994 ± 0.229
1.723TrpVal: 1.723 ± 0.358
0.398TrpTrp: 0.398 ± 0.202
0.795TrpTyr: 0.795 ± 0.246
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.055TyrAla: 2.055 ± 0.365
0.265TyrCys: 0.265 ± 0.151
1.856TyrAsp: 1.856 ± 0.404
2.187TyrGlu: 2.187 ± 0.42
0.862TyrPhe: 0.862 ± 0.216
2.386TyrGly: 2.386 ± 0.353
0.729TyrHis: 0.729 ± 0.212
1.259TyrIle: 1.259 ± 0.219
1.392TyrLys: 1.392 ± 0.411
2.121TyrLeu: 2.121 ± 0.371
1.193TyrMet: 1.193 ± 0.271
0.663TyrAsn: 0.663 ± 0.209
1.458TyrPro: 1.458 ± 0.287
0.994TyrGln: 0.994 ± 0.219
1.723TyrArg: 1.723 ± 0.375
1.723TyrSer: 1.723 ± 0.347
1.79TyrThr: 1.79 ± 0.297
1.723TyrVal: 1.723 ± 0.412
0.199TyrTrp: 0.199 ± 0.126
0.795TyrTyr: 0.795 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (15089 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski