Amino acid dipepetide frequency for Bacillus phage Troll

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.09AlaAla: 3.09 ± 0.424
0.458AlaCys: 0.458 ± 0.113
3.489AlaAsp: 3.489 ± 0.276
3.967AlaGlu: 3.967 ± 0.267
2.452AlaPhe: 2.452 ± 0.205
3.489AlaGly: 3.489 ± 0.38
1.076AlaHis: 1.076 ± 0.159
3.688AlaIle: 3.688 ± 0.279
4.406AlaLys: 4.406 ± 0.306
4.964AlaLeu: 4.964 ± 0.371
1.854AlaMet: 1.854 ± 0.241
3.19AlaAsn: 3.19 ± 0.343
2.273AlaPro: 2.273 ± 0.284
2.193AlaGln: 2.193 ± 0.232
2.631AlaArg: 2.631 ± 0.233
3.369AlaSer: 3.369 ± 0.333
3.847AlaThr: 3.847 ± 0.397
3.608AlaVal: 3.608 ± 0.283
0.698AlaTrp: 0.698 ± 0.114
2.572AlaTyr: 2.572 ± 0.226
0.0AlaXaa: 0.0 ± 0.0
Cys
0.359CysAla: 0.359 ± 0.093
0.159CysCys: 0.159 ± 0.055
0.797CysAsp: 0.797 ± 0.147
0.618CysGlu: 0.618 ± 0.122
0.259CysPhe: 0.259 ± 0.068
0.797CysGly: 0.797 ± 0.137
0.199CysHis: 0.199 ± 0.058
0.478CysIle: 0.478 ± 0.093
0.897CysLys: 0.897 ± 0.147
0.578CysLeu: 0.578 ± 0.115
0.239CysMet: 0.239 ± 0.062
0.439CysAsn: 0.439 ± 0.105
0.379CysPro: 0.379 ± 0.09
0.199CysGln: 0.199 ± 0.062
0.299CysArg: 0.299 ± 0.093
0.458CysSer: 0.458 ± 0.102
0.478CysThr: 0.478 ± 0.09
0.538CysVal: 0.538 ± 0.112
0.14CysTrp: 0.14 ± 0.06
0.578CysTyr: 0.578 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
3.489AspAla: 3.489 ± 0.28
0.498AspCys: 0.498 ± 0.114
3.788AspAsp: 3.788 ± 0.318
5.063AspGlu: 5.063 ± 0.392
3.01AspPhe: 3.01 ± 0.236
3.827AspGly: 3.827 ± 0.364
0.618AspHis: 0.618 ± 0.131
5.362AspIle: 5.362 ± 0.351
5.98AspLys: 5.98 ± 0.404
4.904AspLeu: 4.904 ± 0.297
2.133AspMet: 2.133 ± 0.248
3.409AspAsn: 3.409 ± 0.328
1.495AspPro: 1.495 ± 0.213
0.758AspGln: 0.758 ± 0.135
2.791AspArg: 2.791 ± 0.267
3.309AspSer: 3.309 ± 0.238
3.508AspThr: 3.508 ± 0.28
4.545AspVal: 4.545 ± 0.302
1.037AspTrp: 1.037 ± 0.143
4.126AspTyr: 4.126 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
4.366GluAla: 4.366 ± 0.305
0.638GluCys: 0.638 ± 0.115
5.143GluAsp: 5.143 ± 0.403
9.429GluGlu: 9.429 ± 0.808
3.389GluPhe: 3.389 ± 0.255
4.744GluGly: 4.744 ± 0.28
1.575GluHis: 1.575 ± 0.23
5.582GluIle: 5.582 ± 0.405
6.299GluLys: 6.299 ± 0.405
9.19GluLeu: 9.19 ± 0.529
2.751GluMet: 2.751 ± 0.266
4.326GluAsn: 4.326 ± 0.326
1.993GluPro: 1.993 ± 0.296
3.209GluGln: 3.209 ± 0.337
3.469GluArg: 3.469 ± 0.285
3.628GluSer: 3.628 ± 0.337
3.907GluThr: 3.907 ± 0.282
6.259GluVal: 6.259 ± 0.472
1.176GluTrp: 1.176 ± 0.149
3.548GluTyr: 3.548 ± 0.261
0.0GluXaa: 0.0 ± 0.0
Phe
2.193PheAla: 2.193 ± 0.199
0.439PheCys: 0.439 ± 0.107
2.97PheAsp: 2.97 ± 0.243
2.751PheGlu: 2.751 ± 0.24
1.296PhePhe: 1.296 ± 0.167
2.053PheGly: 2.053 ± 0.246
0.817PheHis: 0.817 ± 0.124
2.99PheIle: 2.99 ± 0.277
3.03PheLys: 3.03 ± 0.221
3.09PheLeu: 3.09 ± 0.244
0.977PheMet: 0.977 ± 0.145
2.512PheAsn: 2.512 ± 0.221
1.276PhePro: 1.276 ± 0.169
1.196PheGln: 1.196 ± 0.149
1.575PheArg: 1.575 ± 0.176
2.512PheSer: 2.512 ± 0.22
2.831PheThr: 2.831 ± 0.251
2.731PheVal: 2.731 ± 0.257
0.359PheTrp: 0.359 ± 0.086
1.834PheTyr: 1.834 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
3.588GlyAla: 3.588 ± 0.476
0.738GlyCys: 0.738 ± 0.148
3.827GlyAsp: 3.827 ± 0.32
4.984GlyGlu: 4.984 ± 0.341
2.492GlyPhe: 2.492 ± 0.235
5.402GlyGly: 5.402 ± 0.812
1.296GlyHis: 1.296 ± 0.18
4.067GlyIle: 4.067 ± 0.346
5.422GlyLys: 5.422 ± 0.351
4.944GlyLeu: 4.944 ± 0.329
1.774GlyMet: 1.774 ± 0.206
3.229GlyAsn: 3.229 ± 0.27
0.0GlyPro: 0.0 ± 0.0
1.894GlyGln: 1.894 ± 0.359
2.591GlyArg: 2.591 ± 0.226
3.867GlySer: 3.867 ± 0.368
3.967GlyThr: 3.967 ± 0.381
4.784GlyVal: 4.784 ± 0.406
0.917GlyTrp: 0.917 ± 0.135
3.548GlyTyr: 3.548 ± 0.296
0.0GlyXaa: 0.0 ± 0.0
His
0.897HisAla: 0.897 ± 0.141
0.14HisCys: 0.14 ± 0.051
1.236HisAsp: 1.236 ± 0.154
1.395HisGlu: 1.395 ± 0.158
0.857HisPhe: 0.857 ± 0.136
0.797HisGly: 0.797 ± 0.129
0.439HisHis: 0.439 ± 0.117
1.076HisIle: 1.076 ± 0.148
1.415HisLys: 1.415 ± 0.199
1.675HisLeu: 1.675 ± 0.198
0.518HisMet: 0.518 ± 0.102
1.296HisAsn: 1.296 ± 0.18
0.598HisPro: 0.598 ± 0.112
0.578HisGln: 0.578 ± 0.115
0.817HisArg: 0.817 ± 0.127
0.937HisSer: 0.937 ± 0.171
1.096HisThr: 1.096 ± 0.136
1.535HisVal: 1.535 ± 0.206
0.199HisTrp: 0.199 ± 0.065
0.817HisTyr: 0.817 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
4.286IleAla: 4.286 ± 0.313
0.518IleCys: 0.518 ± 0.104
4.764IleAsp: 4.764 ± 0.322
5.941IleGlu: 5.941 ± 0.387
2.013IlePhe: 2.013 ± 0.165
3.768IleGly: 3.768 ± 0.29
0.997IleHis: 0.997 ± 0.136
4.047IleIle: 4.047 ± 0.336
6.16IleLys: 6.16 ± 0.35
4.824IleLeu: 4.824 ± 0.297
1.934IleMet: 1.934 ± 0.218
4.166IleAsn: 4.166 ± 0.352
2.153IlePro: 2.153 ± 0.186
2.133IleGln: 2.133 ± 0.203
2.591IleArg: 2.591 ± 0.232
3.768IleSer: 3.768 ± 0.277
4.465IleThr: 4.465 ± 0.382
4.146IleVal: 4.146 ± 0.266
0.439IleTrp: 0.439 ± 0.088
2.631IleTyr: 2.631 ± 0.235
0.0IleXaa: 0.0 ± 0.0
Lys
5.362LysAla: 5.362 ± 0.345
0.698LysCys: 0.698 ± 0.132
5.362LysAsp: 5.362 ± 0.354
8.652LysGlu: 8.652 ± 0.547
2.93LysPhe: 2.93 ± 0.228
5.821LysGly: 5.821 ± 0.38
1.734LysHis: 1.734 ± 0.202
4.186LysIle: 4.186 ± 0.269
7.535LysLys: 7.535 ± 0.454
6.22LysLeu: 6.22 ± 0.324
2.671LysMet: 2.671 ± 0.218
4.645LysAsn: 4.645 ± 0.322
2.412LysPro: 2.412 ± 0.245
3.588LysGln: 3.588 ± 0.262
3.15LysArg: 3.15 ± 0.282
4.206LysSer: 4.206 ± 0.351
4.146LysThr: 4.146 ± 0.296
5.821LysVal: 5.821 ± 0.411
1.017LysTrp: 1.017 ± 0.129
3.528LysTyr: 3.528 ± 0.301
0.0LysXaa: 0.0 ± 0.0
Leu
4.605LeuAla: 4.605 ± 0.328
0.698LeuCys: 0.698 ± 0.112
5.781LeuAsp: 5.781 ± 0.357
7.515LeuGlu: 7.515 ± 0.454
3.209LeuPhe: 3.209 ± 0.264
4.804LeuGly: 4.804 ± 0.318
1.774LeuHis: 1.774 ± 0.186
4.665LeuIle: 4.665 ± 0.396
5.841LeuLys: 5.841 ± 0.349
5.741LeuLeu: 5.741 ± 0.416
2.173LeuMet: 2.173 ± 0.24
4.186LeuAsn: 4.186 ± 0.239
2.831LeuPro: 2.831 ± 0.263
2.97LeuGln: 2.97 ± 0.272
3.748LeuArg: 3.748 ± 0.376
5.143LeuSer: 5.143 ± 0.339
5.362LeuThr: 5.362 ± 0.312
5.661LeuVal: 5.661 ± 0.376
0.897LeuTrp: 0.897 ± 0.137
3.389LeuTyr: 3.389 ± 0.315
0.0LeuXaa: 0.0 ± 0.0
Met
1.834MetAla: 1.834 ± 0.181
0.199MetCys: 0.199 ± 0.062
1.555MetAsp: 1.555 ± 0.154
2.253MetGlu: 2.253 ± 0.208
1.216MetPhe: 1.216 ± 0.178
1.435MetGly: 1.435 ± 0.202
0.458MetHis: 0.458 ± 0.099
1.694MetIle: 1.694 ± 0.205
3.209MetLys: 3.209 ± 0.275
2.352MetLeu: 2.352 ± 0.185
0.598MetMet: 0.598 ± 0.112
1.894MetAsn: 1.894 ± 0.203
0.478MetPro: 0.478 ± 0.101
1.037MetGln: 1.037 ± 0.148
1.256MetArg: 1.256 ± 0.146
1.974MetSer: 1.974 ± 0.204
2.292MetThr: 2.292 ± 0.257
1.575MetVal: 1.575 ± 0.194
0.399MetTrp: 0.399 ± 0.089
1.256MetTyr: 1.256 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
3.528AsnAla: 3.528 ± 0.268
0.538AsnCys: 0.538 ± 0.153
2.97AsnAsp: 2.97 ± 0.214
3.867AsnGlu: 3.867 ± 0.301
1.455AsnPhe: 1.455 ± 0.177
4.406AsnGly: 4.406 ± 0.359
0.917AsnHis: 0.917 ± 0.137
3.469AsnIle: 3.469 ± 0.255
4.984AsnLys: 4.984 ± 0.314
4.107AsnLeu: 4.107 ± 0.234
2.033AsnMet: 2.033 ± 0.176
3.13AsnAsn: 3.13 ± 0.326
2.213AsnPro: 2.213 ± 0.263
1.834AsnGln: 1.834 ± 0.188
2.811AsnArg: 2.811 ± 0.24
2.691AsnSer: 2.691 ± 0.263
3.349AsnThr: 3.349 ± 0.338
3.847AsnVal: 3.847 ± 0.279
0.758AsnTrp: 0.758 ± 0.139
2.552AsnTyr: 2.552 ± 0.199
0.0AsnXaa: 0.0 ± 0.0
Pro
1.834ProAla: 1.834 ± 0.295
0.199ProCys: 0.199 ± 0.054
1.774ProAsp: 1.774 ± 0.215
2.891ProGlu: 2.891 ± 0.268
1.395ProPhe: 1.395 ± 0.179
0.758ProGly: 0.758 ± 0.122
0.638ProHis: 0.638 ± 0.109
1.894ProIle: 1.894 ± 0.198
2.093ProLys: 2.093 ± 0.2
2.193ProLeu: 2.193 ± 0.2
0.777ProMet: 0.777 ± 0.145
1.635ProAsn: 1.635 ± 0.19
0.877ProPro: 0.877 ± 0.197
1.017ProGln: 1.017 ± 0.18
1.116ProArg: 1.116 ± 0.14
1.495ProSer: 1.495 ± 0.198
2.213ProThr: 2.213 ± 0.208
2.193ProVal: 2.193 ± 0.24
0.259ProTrp: 0.259 ± 0.069
1.655ProTyr: 1.655 ± 0.177
0.0ProXaa: 0.0 ± 0.0
Gln
2.651GlnAla: 2.651 ± 0.314
0.239GlnCys: 0.239 ± 0.074
2.013GlnAsp: 2.013 ± 0.207
2.691GlnGlu: 2.691 ± 0.213
0.937GlnPhe: 0.937 ± 0.132
2.352GlnGly: 2.352 ± 0.252
0.698GlnHis: 0.698 ± 0.117
2.193GlnIle: 2.193 ± 0.207
2.591GlnLys: 2.591 ± 0.257
2.831GlnLeu: 2.831 ± 0.275
0.957GlnMet: 0.957 ± 0.151
1.655GlnAsn: 1.655 ± 0.215
1.136GlnPro: 1.136 ± 0.175
1.595GlnGln: 1.595 ± 0.312
1.356GlnArg: 1.356 ± 0.177
1.914GlnSer: 1.914 ± 0.223
1.854GlnThr: 1.854 ± 0.208
2.133GlnVal: 2.133 ± 0.184
0.458GlnTrp: 0.458 ± 0.092
1.435GlnTyr: 1.435 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
2.073ArgAla: 2.073 ± 0.196
0.359ArgCys: 0.359 ± 0.101
2.631ArgAsp: 2.631 ± 0.278
3.987ArgGlu: 3.987 ± 0.297
2.213ArgPhe: 2.213 ± 0.217
2.851ArgGly: 2.851 ± 0.243
0.618ArgHis: 0.618 ± 0.128
3.09ArgIle: 3.09 ± 0.284
2.871ArgLys: 2.871 ± 0.286
4.087ArgLeu: 4.087 ± 0.28
1.236ArgMet: 1.236 ± 0.143
2.053ArgAsn: 2.053 ± 0.173
0.957ArgPro: 0.957 ± 0.112
1.595ArgGln: 1.595 ± 0.167
1.575ArgArg: 1.575 ± 0.198
1.694ArgSer: 1.694 ± 0.169
2.093ArgThr: 2.093 ± 0.175
2.831ArgVal: 2.831 ± 0.222
0.498ArgTrp: 0.498 ± 0.12
2.213ArgTyr: 2.213 ± 0.231
0.0ArgXaa: 0.0 ± 0.0
Ser
3.289SerAla: 3.289 ± 0.306
0.439SerCys: 0.439 ± 0.112
3.289SerAsp: 3.289 ± 0.278
4.067SerGlu: 4.067 ± 0.313
2.831SerPhe: 2.831 ± 0.274
3.588SerGly: 3.588 ± 0.303
0.897SerHis: 0.897 ± 0.143
3.907SerIle: 3.907 ± 0.265
4.485SerLys: 4.485 ± 0.339
4.565SerLeu: 4.565 ± 0.262
1.435SerMet: 1.435 ± 0.159
2.851SerAsn: 2.851 ± 0.299
1.754SerPro: 1.754 ± 0.204
1.356SerGln: 1.356 ± 0.166
2.153SerArg: 2.153 ± 0.214
3.07SerSer: 3.07 ± 0.319
3.668SerThr: 3.668 ± 0.347
3.508SerVal: 3.508 ± 0.303
0.638SerTrp: 0.638 ± 0.115
2.831SerTyr: 2.831 ± 0.259
0.0SerXaa: 0.0 ± 0.0
Thr
3.13ThrAla: 3.13 ± 0.29
0.478ThrCys: 0.478 ± 0.101
3.588ThrAsp: 3.588 ± 0.277
4.007ThrGlu: 4.007 ± 0.243
2.671ThrPhe: 2.671 ± 0.277
4.346ThrGly: 4.346 ± 0.377
1.037ThrHis: 1.037 ± 0.121
4.346ThrIle: 4.346 ± 0.32
5.063ThrLys: 5.063 ± 0.294
5.063ThrLeu: 5.063 ± 0.34
1.356ThrMet: 1.356 ± 0.183
3.449ThrAsn: 3.449 ± 0.336
2.292ThrPro: 2.292 ± 0.213
2.193ThrGln: 2.193 ± 0.186
2.532ThrArg: 2.532 ± 0.233
3.429ThrSer: 3.429 ± 0.36
3.449ThrThr: 3.449 ± 0.28
5.123ThrVal: 5.123 ± 0.437
0.678ThrTrp: 0.678 ± 0.123
3.209ThrTyr: 3.209 ± 0.262
0.0ThrXaa: 0.0 ± 0.0
Val
3.887ValAla: 3.887 ± 0.293
0.658ValCys: 0.658 ± 0.115
4.665ValAsp: 4.665 ± 0.298
6.04ValGlu: 6.04 ± 0.44
2.871ValPhe: 2.871 ± 0.232
4.306ValGly: 4.306 ± 0.282
1.276ValHis: 1.276 ± 0.182
4.306ValIle: 4.306 ± 0.292
6.06ValLys: 6.06 ± 0.354
5.402ValLeu: 5.402 ± 0.366
1.754ValMet: 1.754 ± 0.204
3.728ValAsn: 3.728 ± 0.383
2.332ValPro: 2.332 ± 0.204
2.093ValGln: 2.093 ± 0.183
2.512ValArg: 2.512 ± 0.22
3.887ValSer: 3.887 ± 0.342
5.143ValThr: 5.143 ± 0.409
4.744ValVal: 4.744 ± 0.313
0.658ValTrp: 0.658 ± 0.11
3.369ValTyr: 3.369 ± 0.29
0.0ValXaa: 0.0 ± 0.0
Trp
0.478TrpAla: 0.478 ± 0.109
0.199TrpCys: 0.199 ± 0.062
0.877TrpAsp: 0.877 ± 0.128
1.017TrpGlu: 1.017 ± 0.132
0.478TrpPhe: 0.478 ± 0.106
0.837TrpGly: 0.837 ± 0.143
0.399TrpHis: 0.399 ± 0.089
0.758TrpIle: 0.758 ± 0.122
0.837TrpLys: 0.837 ± 0.142
0.797TrpLeu: 0.797 ± 0.152
0.399TrpMet: 0.399 ± 0.092
0.718TrpAsn: 0.718 ± 0.149
0.0TrpPro: 0.0 ± 0.0
0.259TrpGln: 0.259 ± 0.082
0.498TrpArg: 0.498 ± 0.101
0.738TrpSer: 0.738 ± 0.139
0.638TrpThr: 0.638 ± 0.121
1.116TrpVal: 1.116 ± 0.153
0.14TrpTrp: 0.14 ± 0.055
0.638TrpTyr: 0.638 ± 0.103
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.372TyrAla: 2.372 ± 0.209
0.598TyrCys: 0.598 ± 0.132
3.13TyrAsp: 3.13 ± 0.281
3.508TyrGlu: 3.508 ± 0.259
1.455TyrPhe: 1.455 ± 0.174
2.95TyrGly: 2.95 ± 0.281
0.857TyrHis: 0.857 ± 0.128
3.947TyrIle: 3.947 ± 0.287
4.505TyrLys: 4.505 ± 0.328
3.469TyrLeu: 3.469 ± 0.288
1.375TyrMet: 1.375 ± 0.161
2.95TyrAsn: 2.95 ± 0.236
1.455TyrPro: 1.455 ± 0.194
1.993TyrGln: 1.993 ± 0.229
2.053TyrArg: 2.053 ± 0.169
2.532TyrSer: 2.532 ± 0.237
3.13TyrThr: 3.13 ± 0.274
2.99TyrVal: 2.99 ± 0.255
0.458TyrTrp: 0.458 ± 0.101
2.173TyrTyr: 2.173 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 289 proteins (50165 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski