Amino acid dipepetide frequency for Bacillus phage pW2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.588AlaCys: 0.588 ± 0.137
3.308AlaAsp: 3.308 ± 0.356
4.754AlaGlu: 4.754 ± 0.439
2.352AlaPhe: 2.352 ± 0.253
3.455AlaGly: 3.455 ± 0.474
1.25AlaHis: 1.25 ± 0.183
4.484AlaIle: 4.484 ± 0.344
5.489AlaLys: 5.489 ± 0.491
4.876AlaLeu: 4.876 ± 0.334
1.936AlaMet: 1.936 ± 0.296
3.774AlaAsn: 3.774 ± 0.379
1.47AlaPro: 1.47 ± 0.182
2.156AlaGln: 2.156 ± 0.43
1.862AlaArg: 1.862 ± 0.177
3.651AlaSer: 3.651 ± 0.521
3.896AlaThr: 3.896 ± 0.478
3.651AlaVal: 3.651 ± 0.315
0.539AlaTrp: 0.539 ± 0.107
2.279AlaTyr: 2.279 ± 0.245
0.0AlaXaa: 0.0 ± 0.0
Cys
0.515CysAla: 0.515 ± 0.112
0.049CysCys: 0.049 ± 0.04
0.686CysAsp: 0.686 ± 0.163
0.515CysGlu: 0.515 ± 0.122
0.221CysPhe: 0.221 ± 0.076
0.613CysGly: 0.613 ± 0.118
0.196CysHis: 0.196 ± 0.066
0.49CysIle: 0.49 ± 0.107
0.784CysLys: 0.784 ± 0.156
0.392CysLeu: 0.392 ± 0.142
0.343CysMet: 0.343 ± 0.1
0.319CysAsn: 0.319 ± 0.081
0.368CysPro: 0.368 ± 0.109
0.245CysGln: 0.245 ± 0.1
0.515CysArg: 0.515 ± 0.122
0.294CysSer: 0.294 ± 0.093
0.441CysThr: 0.441 ± 0.092
0.613CysVal: 0.613 ± 0.135
0.147CysTrp: 0.147 ± 0.058
0.539CysTyr: 0.539 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
3.994AspAla: 3.994 ± 0.378
0.858AspCys: 0.858 ± 0.132
3.749AspAsp: 3.749 ± 0.326
6.469AspGlu: 6.469 ± 0.389
3.088AspPhe: 3.088 ± 0.244
5.023AspGly: 5.023 ± 0.358
0.637AspHis: 0.637 ± 0.122
4.778AspIle: 4.778 ± 0.347
5.71AspLys: 5.71 ± 0.378
5.195AspLeu: 5.195 ± 0.377
1.911AspMet: 1.911 ± 0.232
3.725AspAsn: 3.725 ± 0.275
0.907AspPro: 0.907 ± 0.168
1.152AspGln: 1.152 ± 0.181
2.254AspArg: 2.254 ± 0.3
3.308AspSer: 3.308 ± 0.271
3.823AspThr: 3.823 ± 0.345
4.411AspVal: 4.411 ± 0.319
0.784AspTrp: 0.784 ± 0.141
3.357AspTyr: 3.357 ± 0.305
0.0AspXaa: 0.0 ± 0.0
Glu
4.558GluAla: 4.558 ± 0.485
0.613GluCys: 0.613 ± 0.153
5.759GluAsp: 5.759 ± 0.436
7.67GluGlu: 7.67 ± 0.609
3.21GluPhe: 3.21 ± 0.26
4.117GluGly: 4.117 ± 0.348
1.397GluHis: 1.397 ± 0.179
6.861GluIle: 6.861 ± 0.437
6.886GluLys: 6.886 ± 0.429
7.67GluLeu: 7.67 ± 0.382
2.597GluMet: 2.597 ± 0.192
4.656GluAsn: 4.656 ± 0.377
1.274GluPro: 1.274 ± 0.185
3.455GluGln: 3.455 ± 0.323
3.406GluArg: 3.406 ± 0.308
3.97GluSer: 3.97 ± 0.299
4.166GluThr: 4.166 ± 0.285
5.661GluVal: 5.661 ± 0.456
1.103GluTrp: 1.103 ± 0.164
3.676GluTyr: 3.676 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
2.181PheAla: 2.181 ± 0.184
0.245PheCys: 0.245 ± 0.08
3.455PheAsp: 3.455 ± 0.316
3.651PheGlu: 3.651 ± 0.348
1.887PhePhe: 1.887 ± 0.288
3.088PheGly: 3.088 ± 0.293
0.98PheHis: 0.98 ± 0.158
2.818PheIle: 2.818 ± 0.354
3.7PheLys: 3.7 ± 0.322
2.941PheLeu: 2.941 ± 0.298
1.152PheMet: 1.152 ± 0.195
2.401PheAsn: 2.401 ± 0.236
0.711PhePro: 0.711 ± 0.146
1.078PheGln: 1.078 ± 0.153
1.617PheArg: 1.617 ± 0.214
2.597PheSer: 2.597 ± 0.234
2.867PheThr: 2.867 ± 0.304
2.573PheVal: 2.573 ± 0.292
0.539PheTrp: 0.539 ± 0.151
1.691PheTyr: 1.691 ± 0.244
0.0PheXaa: 0.0 ± 0.0
Gly
3.725GlyAla: 3.725 ± 0.515
0.564GlyCys: 0.564 ± 0.118
3.627GlyAsp: 3.627 ± 0.301
4.239GlyGlu: 4.239 ± 0.298
2.843GlyPhe: 2.843 ± 0.291
4.19GlyGly: 4.19 ± 0.447
0.98GlyHis: 0.98 ± 0.179
4.68GlyIle: 4.68 ± 0.348
6.028GlyLys: 6.028 ± 0.347
5.146GlyLeu: 5.146 ± 0.316
1.838GlyMet: 1.838 ± 0.238
3.7GlyAsn: 3.7 ± 0.404
0.0GlyPro: 0.0 ± 0.0
2.254GlyGln: 2.254 ± 0.325
2.843GlyArg: 2.843 ± 0.296
3.749GlySer: 3.749 ± 0.402
3.872GlyThr: 3.872 ± 0.407
4.411GlyVal: 4.411 ± 0.322
0.858GlyTrp: 0.858 ± 0.162
3.21GlyTyr: 3.21 ± 0.278
0.0GlyXaa: 0.0 ± 0.0
His
0.784HisAla: 0.784 ± 0.141
0.319HisCys: 0.319 ± 0.093
1.421HisAsp: 1.421 ± 0.227
1.519HisGlu: 1.519 ± 0.247
0.76HisPhe: 0.76 ± 0.154
1.054HisGly: 1.054 ± 0.146
0.539HisHis: 0.539 ± 0.14
1.372HisIle: 1.372 ± 0.216
1.544HisLys: 1.544 ± 0.26
1.544HisLeu: 1.544 ± 0.184
0.539HisMet: 0.539 ± 0.127
0.98HisAsn: 0.98 ± 0.169
0.76HisPro: 0.76 ± 0.13
0.441HisGln: 0.441 ± 0.095
0.858HisArg: 0.858 ± 0.14
1.029HisSer: 1.029 ± 0.173
1.176HisThr: 1.176 ± 0.173
1.421HisVal: 1.421 ± 0.228
0.049HisTrp: 0.049 ± 0.037
0.833HisTyr: 0.833 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
3.725IleAla: 3.725 ± 0.292
0.294IleCys: 0.294 ± 0.085
5.685IleAsp: 5.685 ± 0.379
6.102IleGlu: 6.102 ± 0.446
2.205IlePhe: 2.205 ± 0.258
3.749IleGly: 3.749 ± 0.3
1.593IleHis: 1.593 ± 0.196
4.264IleIle: 4.264 ± 0.394
6.69IleLys: 6.69 ± 0.337
4.778IleLeu: 4.778 ± 0.414
1.715IleMet: 1.715 ± 0.229
3.945IleAsn: 3.945 ± 0.376
2.009IlePro: 2.009 ± 0.236
2.524IleGln: 2.524 ± 0.254
2.965IleArg: 2.965 ± 0.284
4.117IleSer: 4.117 ± 0.338
4.509IleThr: 4.509 ± 0.387
4.533IleVal: 4.533 ± 0.436
0.515IleTrp: 0.515 ± 0.14
2.843IleTyr: 2.843 ± 0.297
0.0IleXaa: 0.0 ± 0.0
Lys
5.489LysAla: 5.489 ± 0.619
0.76LysCys: 0.76 ± 0.137
5.661LysAsp: 5.661 ± 0.401
8.43LysGlu: 8.43 ± 0.563
4.019LysPhe: 4.019 ± 0.302
5.342LysGly: 5.342 ± 0.336
1.715LysHis: 1.715 ± 0.234
5.562LysIle: 5.562 ± 0.308
9.312LysLys: 9.312 ± 0.737
7.327LysLeu: 7.327 ± 0.428
3.137LysMet: 3.137 ± 0.315
4.607LysAsn: 4.607 ± 0.336
2.377LysPro: 2.377 ± 0.296
3.284LysGln: 3.284 ± 0.327
4.019LysArg: 4.019 ± 0.371
4.631LysSer: 4.631 ± 0.371
4.656LysThr: 4.656 ± 0.295
6.028LysVal: 6.028 ± 0.363
1.054LysTrp: 1.054 ± 0.15
3.798LysTyr: 3.798 ± 0.281
0.0LysXaa: 0.0 ± 0.0
Leu
4.778LeuAla: 4.778 ± 0.391
0.564LeuCys: 0.564 ± 0.117
5.562LeuAsp: 5.562 ± 0.421
6.004LeuGlu: 6.004 ± 0.426
2.965LeuPhe: 2.965 ± 0.27
5.121LeuGly: 5.121 ± 0.339
1.691LeuHis: 1.691 ± 0.229
4.435LeuIle: 4.435 ± 0.411
7.474LeuLys: 7.474 ± 0.427
5.881LeuLeu: 5.881 ± 0.587
2.426LeuMet: 2.426 ± 0.268
4.705LeuAsn: 4.705 ± 0.324
2.352LeuPro: 2.352 ± 0.228
3.21LeuGln: 3.21 ± 0.295
3.014LeuArg: 3.014 ± 0.312
4.386LeuSer: 4.386 ± 0.314
5.097LeuThr: 5.097 ± 0.334
4.754LeuVal: 4.754 ± 0.386
1.029LeuTrp: 1.029 ± 0.166
3.21LeuTyr: 3.21 ± 0.29
0.0LeuXaa: 0.0 ± 0.0
Met
1.985MetAla: 1.985 ± 0.26
0.196MetCys: 0.196 ± 0.063
1.642MetAsp: 1.642 ± 0.21
2.009MetGlu: 2.009 ± 0.265
1.348MetPhe: 1.348 ± 0.211
1.789MetGly: 1.789 ± 0.26
0.539MetHis: 0.539 ± 0.139
1.96MetIle: 1.96 ± 0.24
3.431MetLys: 3.431 ± 0.314
2.426MetLeu: 2.426 ± 0.289
1.029MetMet: 1.029 ± 0.184
2.181MetAsn: 2.181 ± 0.232
0.662MetPro: 0.662 ± 0.133
1.127MetGln: 1.127 ± 0.166
1.029MetArg: 1.029 ± 0.154
1.96MetSer: 1.96 ± 0.299
2.058MetThr: 2.058 ± 0.248
1.421MetVal: 1.421 ± 0.188
0.221MetTrp: 0.221 ± 0.098
1.201MetTyr: 1.201 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
3.48AsnAla: 3.48 ± 0.271
0.392AsnCys: 0.392 ± 0.1
2.843AsnAsp: 2.843 ± 0.319
4.264AsnGlu: 4.264 ± 0.338
2.279AsnPhe: 2.279 ± 0.252
4.901AsnGly: 4.901 ± 0.443
1.078AsnHis: 1.078 ± 0.155
4.043AsnIle: 4.043 ± 0.319
5.366AsnLys: 5.366 ± 0.403
4.729AsnLeu: 4.729 ± 0.366
1.936AsnMet: 1.936 ± 0.246
3.578AsnAsn: 3.578 ± 0.325
1.74AsnPro: 1.74 ± 0.275
2.328AsnGln: 2.328 ± 0.191
2.818AsnArg: 2.818 ± 0.248
2.99AsnSer: 2.99 ± 0.264
3.357AsnThr: 3.357 ± 0.254
3.48AsnVal: 3.48 ± 0.28
0.588AsnTrp: 0.588 ± 0.116
2.622AsnTyr: 2.622 ± 0.264
0.0AsnXaa: 0.0 ± 0.0
Pro
1.274ProAla: 1.274 ± 0.182
0.196ProCys: 0.196 ± 0.087
1.642ProAsp: 1.642 ± 0.193
2.23ProGlu: 2.23 ± 0.299
1.005ProPhe: 1.005 ± 0.189
0.0ProGly: 0.0 ± 0.0
0.637ProHis: 0.637 ± 0.132
1.691ProIle: 1.691 ± 0.236
1.936ProLys: 1.936 ± 0.242
1.862ProLeu: 1.862 ± 0.241
0.294ProMet: 0.294 ± 0.1
1.446ProAsn: 1.446 ± 0.189
0.515ProPro: 0.515 ± 0.122
1.201ProGln: 1.201 ± 0.149
0.637ProArg: 0.637 ± 0.124
1.299ProSer: 1.299 ± 0.211
1.74ProThr: 1.74 ± 0.262
1.789ProVal: 1.789 ± 0.165
0.147ProTrp: 0.147 ± 0.057
1.078ProTyr: 1.078 ± 0.157
0.0ProXaa: 0.0 ± 0.0
Gln
2.744GlnAla: 2.744 ± 0.5
0.196GlnCys: 0.196 ± 0.062
1.764GlnAsp: 1.764 ± 0.198
3.161GlnGlu: 3.161 ± 0.25
1.323GlnPhe: 1.323 ± 0.19
1.691GlnGly: 1.691 ± 0.252
0.441GlnHis: 0.441 ± 0.121
2.548GlnIle: 2.548 ± 0.278
3.014GlnLys: 3.014 ± 0.326
3.259GlnLeu: 3.259 ± 0.299
1.176GlnMet: 1.176 ± 0.185
1.911GlnAsn: 1.911 ± 0.268
0.735GlnPro: 0.735 ± 0.113
1.372GlnGln: 1.372 ± 0.2
1.838GlnArg: 1.838 ± 0.243
2.058GlnSer: 2.058 ± 0.441
1.862GlnThr: 1.862 ± 0.195
2.279GlnVal: 2.279 ± 0.263
0.613GlnTrp: 0.613 ± 0.105
1.519GlnTyr: 1.519 ± 0.186
0.0GlnXaa: 0.0 ± 0.0
Arg
2.107ArgAla: 2.107 ± 0.248
0.392ArgCys: 0.392 ± 0.101
2.597ArgAsp: 2.597 ± 0.305
3.186ArgGlu: 3.186 ± 0.278
1.838ArgPhe: 1.838 ± 0.239
2.328ArgGly: 2.328 ± 0.264
1.005ArgHis: 1.005 ± 0.178
3.112ArgIle: 3.112 ± 0.247
3.945ArgLys: 3.945 ± 0.325
2.843ArgLeu: 2.843 ± 0.345
1.544ArgMet: 1.544 ± 0.178
2.794ArgAsn: 2.794 ± 0.274
0.833ArgPro: 0.833 ± 0.141
1.568ArgGln: 1.568 ± 0.226
1.862ArgArg: 1.862 ± 0.248
1.397ArgSer: 1.397 ± 0.181
2.303ArgThr: 2.303 ± 0.221
2.597ArgVal: 2.597 ± 0.328
0.613ArgTrp: 0.613 ± 0.122
2.107ArgTyr: 2.107 ± 0.284
0.0ArgXaa: 0.0 ± 0.0
Ser
3.627SerAla: 3.627 ± 0.632
0.441SerCys: 0.441 ± 0.116
3.039SerAsp: 3.039 ± 0.264
3.945SerGlu: 3.945 ± 0.308
2.965SerPhe: 2.965 ± 0.254
4.092SerGly: 4.092 ± 0.475
0.833SerHis: 0.833 ± 0.145
3.553SerIle: 3.553 ± 0.296
4.778SerLys: 4.778 ± 0.385
4.166SerLeu: 4.166 ± 0.319
1.274SerMet: 1.274 ± 0.173
2.867SerAsn: 2.867 ± 0.227
1.642SerPro: 1.642 ± 0.216
2.132SerGln: 2.132 ± 0.326
2.205SerArg: 2.205 ± 0.219
3.235SerSer: 3.235 ± 0.346
3.798SerThr: 3.798 ± 0.391
3.308SerVal: 3.308 ± 0.294
0.637SerTrp: 0.637 ± 0.15
2.303SerTyr: 2.303 ± 0.259
0.0SerXaa: 0.0 ± 0.0
Thr
3.627ThrAla: 3.627 ± 0.38
0.515ThrCys: 0.515 ± 0.109
3.48ThrAsp: 3.48 ± 0.308
4.043ThrGlu: 4.043 ± 0.223
2.794ThrPhe: 2.794 ± 0.294
4.68ThrGly: 4.68 ± 0.404
1.054ThrHis: 1.054 ± 0.154
4.166ThrIle: 4.166 ± 0.4
4.999ThrLys: 4.999 ± 0.402
5.293ThrLeu: 5.293 ± 0.44
1.764ThrMet: 1.764 ± 0.234
3.455ThrAsn: 3.455 ± 0.346
1.593ThrPro: 1.593 ± 0.254
2.107ThrGln: 2.107 ± 0.25
2.794ThrArg: 2.794 ± 0.222
2.965ThrSer: 2.965 ± 0.341
3.578ThrThr: 3.578 ± 0.521
4.46ThrVal: 4.46 ± 0.383
0.49ThrTrp: 0.49 ± 0.093
3.014ThrTyr: 3.014 ± 0.344
0.0ThrXaa: 0.0 ± 0.0
Val
3.921ValAla: 3.921 ± 0.288
0.343ValCys: 0.343 ± 0.102
5.17ValAsp: 5.17 ± 0.353
6.273ValGlu: 6.273 ± 0.402
2.965ValPhe: 2.965 ± 0.333
3.921ValGly: 3.921 ± 0.306
1.103ValHis: 1.103 ± 0.177
4.288ValIle: 4.288 ± 0.336
5.342ValLys: 5.342 ± 0.38
4.533ValLeu: 4.533 ± 0.324
2.132ValMet: 2.132 ± 0.286
3.921ValAsn: 3.921 ± 0.347
1.323ValPro: 1.323 ± 0.158
2.034ValGln: 2.034 ± 0.277
2.303ValArg: 2.303 ± 0.257
3.872ValSer: 3.872 ± 0.373
4.117ValThr: 4.117 ± 0.361
5.391ValVal: 5.391 ± 0.421
0.809ValTrp: 0.809 ± 0.159
2.426ValTyr: 2.426 ± 0.333
0.0ValXaa: 0.0 ± 0.0
Trp
0.662TrpAla: 0.662 ± 0.123
0.27TrpCys: 0.27 ± 0.086
0.956TrpAsp: 0.956 ± 0.169
0.809TrpGlu: 0.809 ± 0.146
0.466TrpPhe: 0.466 ± 0.12
0.76TrpGly: 0.76 ± 0.157
0.27TrpHis: 0.27 ± 0.094
0.809TrpIle: 0.809 ± 0.156
1.005TrpLys: 1.005 ± 0.168
0.833TrpLeu: 0.833 ± 0.158
0.368TrpMet: 0.368 ± 0.116
0.76TrpAsn: 0.76 ± 0.116
0.0TrpPro: 0.0 ± 0.0
0.368TrpGln: 0.368 ± 0.093
0.392TrpArg: 0.392 ± 0.096
0.735TrpSer: 0.735 ± 0.161
0.711TrpThr: 0.711 ± 0.147
0.564TrpVal: 0.564 ± 0.135
0.147TrpTrp: 0.147 ± 0.057
0.49TrpTyr: 0.49 ± 0.129
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.597TyrAla: 2.597 ± 0.25
0.49TyrCys: 0.49 ± 0.117
3.088TyrAsp: 3.088 ± 0.285
3.431TyrGlu: 3.431 ± 0.357
1.642TyrPhe: 1.642 ± 0.23
2.843TyrGly: 2.843 ± 0.337
1.005TyrHis: 1.005 ± 0.167
2.843TyrIle: 2.843 ± 0.257
3.774TyrLys: 3.774 ± 0.327
2.965TyrLeu: 2.965 ± 0.312
1.078TyrMet: 1.078 ± 0.146
3.112TyrAsn: 3.112 ± 0.333
1.348TyrPro: 1.348 ± 0.245
1.47TyrGln: 1.47 ± 0.189
1.74TyrArg: 1.74 ± 0.239
2.597TyrSer: 2.597 ± 0.269
2.843TyrThr: 2.843 ± 0.331
2.843TyrVal: 2.843 ± 0.313
0.49TyrTrp: 0.49 ± 0.118
2.034TyrTyr: 2.034 ± 0.284
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 172 proteins (40810 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski