Amino acid dipepetide frequency for Gordonia phage Lucky10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.221AlaAla: 16.221 ± 1.464
0.652AlaCys: 0.652 ± 0.278
7.966AlaAsp: 7.966 ± 0.792
7.676AlaGlu: 7.676 ± 0.919
2.39AlaPhe: 2.39 ± 0.419
10.935AlaGly: 10.935 ± 0.926
1.955AlaHis: 1.955 ± 0.374
4.779AlaIle: 4.779 ± 0.52
3.983AlaLys: 3.983 ± 0.829
9.124AlaLeu: 9.124 ± 1.096
3.114AlaMet: 3.114 ± 0.504
4.2AlaAsn: 4.2 ± 0.639
5.504AlaPro: 5.504 ± 0.542
3.259AlaGln: 3.259 ± 0.426
9.052AlaArg: 9.052 ± 0.864
5.504AlaSer: 5.504 ± 0.544
7.821AlaThr: 7.821 ± 0.934
6.88AlaVal: 6.88 ± 0.743
2.317AlaTrp: 2.317 ± 0.446
2.172AlaTyr: 2.172 ± 0.39
0.0AlaXaa: 0.0 ± 0.0
Cys
0.869CysAla: 0.869 ± 0.292
0.145CysCys: 0.145 ± 0.112
0.579CysAsp: 0.579 ± 0.246
0.434CysGlu: 0.434 ± 0.208
0.145CysPhe: 0.145 ± 0.097
1.231CysGly: 1.231 ± 0.326
0.29CysHis: 0.29 ± 0.155
0.362CysIle: 0.362 ± 0.174
0.072CysLys: 0.072 ± 0.091
0.362CysLeu: 0.362 ± 0.147
0.145CysMet: 0.145 ± 0.124
0.145CysAsn: 0.145 ± 0.118
0.941CysPro: 0.941 ± 0.359
0.362CysGln: 0.362 ± 0.172
0.724CysArg: 0.724 ± 0.226
0.652CysSer: 0.652 ± 0.216
0.941CysThr: 0.941 ± 0.353
0.507CysVal: 0.507 ± 0.196
0.362CysTrp: 0.362 ± 0.179
0.217CysTyr: 0.217 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
7.097AspAla: 7.097 ± 0.615
0.29AspCys: 0.29 ± 0.129
6.59AspAsp: 6.59 ± 0.801
4.417AspGlu: 4.417 ± 0.673
1.593AspPhe: 1.593 ± 0.348
6.373AspGly: 6.373 ± 0.624
1.666AspHis: 1.666 ± 0.355
2.752AspIle: 2.752 ± 0.326
1.81AspLys: 1.81 ± 0.327
7.459AspLeu: 7.459 ± 0.655
1.521AspMet: 1.521 ± 0.362
1.666AspAsn: 1.666 ± 0.413
5.142AspPro: 5.142 ± 0.603
2.607AspGln: 2.607 ± 0.517
4.345AspArg: 4.345 ± 0.676
3.114AspSer: 3.114 ± 0.386
3.693AspThr: 3.693 ± 0.49
3.621AspVal: 3.621 ± 0.668
1.666AspTrp: 1.666 ± 0.4
1.738AspTyr: 1.738 ± 0.403
0.0AspXaa: 0.0 ± 0.0
Glu
6.155GluAla: 6.155 ± 0.814
1.014GluCys: 1.014 ± 0.37
3.404GluAsp: 3.404 ± 0.563
3.404GluGlu: 3.404 ± 0.497
1.666GluPhe: 1.666 ± 0.32
3.331GluGly: 3.331 ± 0.436
1.81GluHis: 1.81 ± 0.384
2.679GluIle: 2.679 ± 0.493
2.462GluLys: 2.462 ± 0.42
4.924GluLeu: 4.924 ± 0.733
1.303GluMet: 1.303 ± 0.329
1.303GluAsn: 1.303 ± 0.231
3.186GluPro: 3.186 ± 0.605
2.245GluGln: 2.245 ± 0.474
4.779GluArg: 4.779 ± 0.706
3.114GluSer: 3.114 ± 0.431
3.983GluThr: 3.983 ± 0.5
4.128GluVal: 4.128 ± 0.508
1.376GluTrp: 1.376 ± 0.339
1.159GluTyr: 1.159 ± 0.269
0.0GluXaa: 0.0 ± 0.0
Phe
2.752PheAla: 2.752 ± 0.423
0.507PheCys: 0.507 ± 0.25
2.1PheAsp: 2.1 ± 0.371
1.231PheGlu: 1.231 ± 0.276
0.797PhePhe: 0.797 ± 0.374
3.476PheGly: 3.476 ± 0.394
0.145PheHis: 0.145 ± 0.101
0.652PheIle: 0.652 ± 0.251
1.014PheLys: 1.014 ± 0.289
1.521PheLeu: 1.521 ± 0.461
0.362PheMet: 0.362 ± 0.16
1.014PheAsn: 1.014 ± 0.32
0.941PhePro: 0.941 ± 0.257
0.652PheGln: 0.652 ± 0.248
2.317PheArg: 2.317 ± 0.372
1.666PheSer: 1.666 ± 0.429
1.666PheThr: 1.666 ± 0.326
2.028PheVal: 2.028 ± 0.413
0.507PheTrp: 0.507 ± 0.216
0.29PheTyr: 0.29 ± 0.163
0.0PheXaa: 0.0 ± 0.0
Gly
8.255GlyAla: 8.255 ± 1.398
1.303GlyCys: 1.303 ± 0.419
5.504GlyAsp: 5.504 ± 0.634
4.128GlyGlu: 4.128 ± 0.652
2.172GlyPhe: 2.172 ± 0.375
9.487GlyGly: 9.487 ± 1.678
1.666GlyHis: 1.666 ± 0.43
3.259GlyIle: 3.259 ± 0.452
2.897GlyLys: 2.897 ± 0.473
6.3GlyLeu: 6.3 ± 0.7
2.535GlyMet: 2.535 ± 0.562
2.535GlyAsn: 2.535 ± 0.333
3.838GlyPro: 3.838 ± 0.5
3.114GlyGln: 3.114 ± 0.403
5.721GlyArg: 5.721 ± 0.574
4.707GlySer: 4.707 ± 0.598
7.024GlyThr: 7.024 ± 0.752
6.083GlyVal: 6.083 ± 0.714
2.752GlyTrp: 2.752 ± 0.354
2.535GlyTyr: 2.535 ± 0.389
0.0GlyXaa: 0.0 ± 0.0
His
1.955HisAla: 1.955 ± 0.339
0.145HisCys: 0.145 ± 0.106
0.724HisAsp: 0.724 ± 0.219
1.303HisGlu: 1.303 ± 0.335
0.362HisPhe: 0.362 ± 0.166
1.303HisGly: 1.303 ± 0.276
0.507HisHis: 0.507 ± 0.185
1.666HisIle: 1.666 ± 0.356
0.29HisLys: 0.29 ± 0.151
1.955HisLeu: 1.955 ± 0.464
0.434HisMet: 0.434 ± 0.195
0.941HisAsn: 0.941 ± 0.273
1.376HisPro: 1.376 ± 0.328
0.652HisGln: 0.652 ± 0.252
1.666HisArg: 1.666 ± 0.387
1.593HisSer: 1.593 ± 0.329
1.521HisThr: 1.521 ± 0.409
1.376HisVal: 1.376 ± 0.333
0.941HisTrp: 0.941 ± 0.277
0.507HisTyr: 0.507 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
6.445IleAla: 6.445 ± 0.628
0.145IleCys: 0.145 ± 0.139
3.259IleAsp: 3.259 ± 0.419
3.693IleGlu: 3.693 ± 0.744
0.797IlePhe: 0.797 ± 0.243
3.983IleGly: 3.983 ± 0.598
0.941IleHis: 0.941 ± 0.305
1.955IleIle: 1.955 ± 0.317
1.448IleLys: 1.448 ± 0.356
2.245IleLeu: 2.245 ± 0.429
0.652IleMet: 0.652 ± 0.201
1.014IleAsn: 1.014 ± 0.251
2.172IlePro: 2.172 ± 0.433
1.376IleGln: 1.376 ± 0.354
3.621IleArg: 3.621 ± 0.533
2.317IleSer: 2.317 ± 0.361
2.969IleThr: 2.969 ± 0.424
3.91IleVal: 3.91 ± 0.528
0.652IleTrp: 0.652 ± 0.233
0.724IleTyr: 0.724 ± 0.243
0.0IleXaa: 0.0 ± 0.0
Lys
3.548LysAla: 3.548 ± 0.677
0.434LysCys: 0.434 ± 0.183
2.028LysAsp: 2.028 ± 0.414
1.303LysGlu: 1.303 ± 0.269
0.724LysPhe: 0.724 ± 0.212
1.448LysGly: 1.448 ± 0.299
0.724LysHis: 0.724 ± 0.209
0.724LysIle: 0.724 ± 0.236
1.448LysLys: 1.448 ± 0.407
3.041LysLeu: 3.041 ± 0.45
0.507LysMet: 0.507 ± 0.177
1.159LysAsn: 1.159 ± 0.307
2.679LysPro: 2.679 ± 0.442
1.738LysGln: 1.738 ± 0.41
2.172LysArg: 2.172 ± 0.421
1.81LysSer: 1.81 ± 0.368
2.535LysThr: 2.535 ± 0.453
2.245LysVal: 2.245 ± 0.373
0.507LysTrp: 0.507 ± 0.183
0.797LysTyr: 0.797 ± 0.228
0.0LysXaa: 0.0 ± 0.0
Leu
9.342LeuAla: 9.342 ± 0.807
0.797LeuCys: 0.797 ± 0.241
5.576LeuAsp: 5.576 ± 0.581
4.779LeuGlu: 4.779 ± 0.577
2.317LeuPhe: 2.317 ± 0.35
5.938LeuGly: 5.938 ± 0.92
1.231LeuHis: 1.231 ± 0.37
3.114LeuIle: 3.114 ± 0.471
2.462LeuLys: 2.462 ± 0.54
5.793LeuLeu: 5.793 ± 0.661
1.955LeuMet: 1.955 ± 0.4
1.955LeuAsn: 1.955 ± 0.462
4.345LeuPro: 4.345 ± 0.576
2.1LeuGln: 2.1 ± 0.332
5.866LeuArg: 5.866 ± 0.639
5.069LeuSer: 5.069 ± 0.75
6.59LeuThr: 6.59 ± 0.653
5.576LeuVal: 5.576 ± 0.745
1.303LeuTrp: 1.303 ± 0.293
1.303LeuTyr: 1.303 ± 0.284
0.0LeuXaa: 0.0 ± 0.0
Met
2.824MetAla: 2.824 ± 0.526
0.145MetCys: 0.145 ± 0.12
1.376MetAsp: 1.376 ± 0.345
0.507MetGlu: 0.507 ± 0.204
0.579MetPhe: 0.579 ± 0.206
1.883MetGly: 1.883 ± 0.686
0.434MetHis: 0.434 ± 0.177
1.303MetIle: 1.303 ± 0.277
0.362MetLys: 0.362 ± 0.144
1.593MetLeu: 1.593 ± 0.348
0.362MetMet: 0.362 ± 0.158
0.869MetAsn: 0.869 ± 0.21
1.593MetPro: 1.593 ± 0.348
1.376MetGln: 1.376 ± 0.344
1.738MetArg: 1.738 ± 0.379
2.245MetSer: 2.245 ± 0.338
2.824MetThr: 2.824 ± 0.414
0.724MetVal: 0.724 ± 0.224
0.434MetTrp: 0.434 ± 0.189
0.362MetTyr: 0.362 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
3.186AsnAla: 3.186 ± 0.467
0.29AsnCys: 0.29 ± 0.143
2.028AsnAsp: 2.028 ± 0.405
1.231AsnGlu: 1.231 ± 0.207
0.941AsnPhe: 0.941 ± 0.252
3.041AsnGly: 3.041 ± 0.652
0.869AsnHis: 0.869 ± 0.316
1.014AsnIle: 1.014 ± 0.25
0.724AsnLys: 0.724 ± 0.193
2.39AsnLeu: 2.39 ± 0.584
0.29AsnMet: 0.29 ± 0.167
1.014AsnAsn: 1.014 ± 0.298
2.897AsnPro: 2.897 ± 0.441
0.941AsnGln: 0.941 ± 0.26
2.245AsnArg: 2.245 ± 0.511
1.376AsnSer: 1.376 ± 0.351
2.1AsnThr: 2.1 ± 0.325
1.81AsnVal: 1.81 ± 0.34
0.724AsnTrp: 0.724 ± 0.238
0.941AsnTyr: 0.941 ± 0.287
0.0AsnXaa: 0.0 ± 0.0
Pro
6.59ProAla: 6.59 ± 0.546
0.579ProCys: 0.579 ± 0.224
4.779ProAsp: 4.779 ± 0.743
3.693ProGlu: 3.693 ± 0.404
1.738ProPhe: 1.738 ± 0.311
4.852ProGly: 4.852 ± 0.5
1.159ProHis: 1.159 ± 0.271
2.462ProIle: 2.462 ± 0.443
2.607ProLys: 2.607 ± 0.37
3.983ProLeu: 3.983 ± 0.444
1.448ProMet: 1.448 ± 0.284
1.955ProAsn: 1.955 ± 0.373
2.607ProPro: 2.607 ± 0.497
1.738ProGln: 1.738 ± 0.318
3.548ProArg: 3.548 ± 0.604
3.476ProSer: 3.476 ± 0.477
2.607ProThr: 2.607 ± 0.581
4.635ProVal: 4.635 ± 0.697
1.521ProTrp: 1.521 ± 0.364
1.231ProTyr: 1.231 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
4.345GlnAla: 4.345 ± 0.592
0.072GlnCys: 0.072 ± 0.067
1.666GlnAsp: 1.666 ± 0.361
1.883GlnGlu: 1.883 ± 0.334
1.159GlnPhe: 1.159 ± 0.217
2.535GlnGly: 2.535 ± 0.427
0.797GlnHis: 0.797 ± 0.233
1.666GlnIle: 1.666 ± 0.28
1.159GlnLys: 1.159 ± 0.275
3.041GlnLeu: 3.041 ± 0.507
1.231GlnMet: 1.231 ± 0.279
0.797GlnAsn: 0.797 ± 0.219
2.245GlnPro: 2.245 ± 0.341
2.028GlnGln: 2.028 ± 0.348
3.404GlnArg: 3.404 ± 0.436
2.028GlnSer: 2.028 ± 0.441
2.1GlnThr: 2.1 ± 0.405
2.028GlnVal: 2.028 ± 0.318
0.507GlnTrp: 0.507 ± 0.18
0.724GlnTyr: 0.724 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
7.604ArgAla: 7.604 ± 0.785
0.797ArgCys: 0.797 ± 0.242
4.707ArgAsp: 4.707 ± 0.756
4.055ArgGlu: 4.055 ± 0.609
2.679ArgPhe: 2.679 ± 0.326
5.142ArgGly: 5.142 ± 0.6
2.607ArgHis: 2.607 ± 0.453
3.983ArgIle: 3.983 ± 0.627
2.824ArgLys: 2.824 ± 0.416
5.576ArgLeu: 5.576 ± 0.72
1.81ArgMet: 1.81 ± 0.371
2.172ArgAsn: 2.172 ± 0.373
4.562ArgPro: 4.562 ± 0.635
3.041ArgGln: 3.041 ± 0.559
7.749ArgArg: 7.749 ± 1.237
4.345ArgSer: 4.345 ± 0.518
4.49ArgThr: 4.49 ± 0.548
4.128ArgVal: 4.128 ± 0.495
2.1ArgTrp: 2.1 ± 0.52
1.521ArgTyr: 1.521 ± 0.29
0.0ArgXaa: 0.0 ± 0.0
Ser
6.445SerAla: 6.445 ± 0.93
0.362SerCys: 0.362 ± 0.223
4.273SerAsp: 4.273 ± 0.56
3.548SerGlu: 3.548 ± 0.662
1.883SerPhe: 1.883 ± 0.315
5.648SerGly: 5.648 ± 0.713
1.159SerHis: 1.159 ± 0.303
3.041SerIle: 3.041 ± 0.526
1.014SerLys: 1.014 ± 0.267
4.417SerLeu: 4.417 ± 0.613
2.245SerMet: 2.245 ± 0.347
1.231SerAsn: 1.231 ± 0.316
3.331SerPro: 3.331 ± 0.603
2.028SerGln: 2.028 ± 0.397
4.707SerArg: 4.707 ± 0.627
4.128SerSer: 4.128 ± 0.815
3.838SerThr: 3.838 ± 0.692
3.404SerVal: 3.404 ± 0.483
1.231SerTrp: 1.231 ± 0.344
1.014SerTyr: 1.014 ± 0.342
0.0SerXaa: 0.0 ± 0.0
Thr
9.269ThrAla: 9.269 ± 0.905
0.869ThrCys: 0.869 ± 0.293
4.128ThrAsp: 4.128 ± 0.47
3.476ThrGlu: 3.476 ± 0.503
1.231ThrPhe: 1.231 ± 0.309
7.024ThrGly: 7.024 ± 0.812
1.521ThrHis: 1.521 ± 0.364
3.766ThrIle: 3.766 ± 0.709
1.883ThrLys: 1.883 ± 0.258
5.576ThrLeu: 5.576 ± 0.64
1.086ThrMet: 1.086 ± 0.272
2.317ThrAsn: 2.317 ± 0.473
4.273ThrPro: 4.273 ± 0.544
2.245ThrGln: 2.245 ± 0.384
3.838ThrArg: 3.838 ± 0.59
3.983ThrSer: 3.983 ± 0.655
5.359ThrThr: 5.359 ± 0.684
5.359ThrVal: 5.359 ± 0.769
1.666ThrTrp: 1.666 ± 0.319
1.593ThrTyr: 1.593 ± 0.326
0.0ThrXaa: 0.0 ± 0.0
Val
8.111ValAla: 8.111 ± 0.901
0.652ValCys: 0.652 ± 0.227
4.707ValAsp: 4.707 ± 0.623
4.49ValGlu: 4.49 ± 0.629
1.666ValPhe: 1.666 ± 0.323
4.49ValGly: 4.49 ± 0.49
0.941ValHis: 0.941 ± 0.281
2.824ValIle: 2.824 ± 0.483
1.955ValLys: 1.955 ± 0.406
4.707ValLeu: 4.707 ± 0.612
1.593ValMet: 1.593 ± 0.303
1.883ValAsn: 1.883 ± 0.377
3.766ValPro: 3.766 ± 0.532
1.955ValGln: 1.955 ± 0.417
4.997ValArg: 4.997 ± 0.658
4.635ValSer: 4.635 ± 0.544
5.648ValThr: 5.648 ± 0.673
4.924ValVal: 4.924 ± 0.684
1.448ValTrp: 1.448 ± 0.325
1.521ValTyr: 1.521 ± 0.371
0.0ValXaa: 0.0 ± 0.0
Trp
2.172TrpAla: 2.172 ± 0.489
0.072TrpCys: 0.072 ± 0.084
2.028TrpAsp: 2.028 ± 0.384
0.724TrpGlu: 0.724 ± 0.227
0.29TrpPhe: 0.29 ± 0.156
1.376TrpGly: 1.376 ± 0.243
0.29TrpHis: 0.29 ± 0.153
1.521TrpIle: 1.521 ± 0.307
0.652TrpLys: 0.652 ± 0.255
1.955TrpLeu: 1.955 ± 0.413
0.434TrpMet: 0.434 ± 0.198
1.086TrpAsn: 1.086 ± 0.252
1.086TrpPro: 1.086 ± 0.287
1.159TrpGln: 1.159 ± 0.314
2.1TrpArg: 2.1 ± 0.46
1.81TrpSer: 1.81 ± 0.412
1.521TrpThr: 1.521 ± 0.377
1.448TrpVal: 1.448 ± 0.267
0.507TrpTrp: 0.507 ± 0.221
0.507TrpTyr: 0.507 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.317TyrAla: 2.317 ± 0.358
0.217TyrCys: 0.217 ± 0.127
1.738TyrAsp: 1.738 ± 0.372
1.448TyrGlu: 1.448 ± 0.361
0.507TyrPhe: 0.507 ± 0.165
2.317TyrGly: 2.317 ± 0.462
0.507TyrHis: 0.507 ± 0.171
0.797TyrIle: 0.797 ± 0.253
0.507TyrLys: 0.507 ± 0.216
1.448TyrLeu: 1.448 ± 0.26
0.434TyrMet: 0.434 ± 0.156
0.797TyrAsn: 0.797 ± 0.244
0.724TyrPro: 0.724 ± 0.244
0.797TyrGln: 0.797 ± 0.299
1.303TyrArg: 1.303 ± 0.351
1.376TyrSer: 1.376 ± 0.264
1.231TyrThr: 1.231 ± 0.353
2.1TyrVal: 2.1 ± 0.394
0.29TyrTrp: 0.29 ± 0.136
0.434TyrTyr: 0.434 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13810 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski