Amino acid dipepetide frequency for Bacillus phage PBS1 (Bacteriophage PBS1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.16AlaAla: 1.16 ± 0.2
0.224AlaCys: 0.224 ± 0.052
2.202AlaAsp: 2.202 ± 0.275
2.096AlaGlu: 2.096 ± 0.209
1.819AlaPhe: 1.819 ± 0.143
2.004AlaGly: 2.004 ± 0.32
0.382AlaHis: 0.382 ± 0.089
3.836AlaIle: 3.836 ± 0.262
3.56AlaLys: 3.56 ± 0.313
3.23AlaLeu: 3.23 ± 0.268
0.831AlaMet: 0.831 ± 0.111
2.808AlaAsn: 2.808 ± 0.202
0.738AlaPro: 0.738 ± 0.108
0.765AlaGln: 0.765 ± 0.119
1.384AlaArg: 1.384 ± 0.14
2.215AlaSer: 2.215 ± 0.243
2.307AlaThr: 2.307 ± 0.239
2.202AlaVal: 2.202 ± 0.189
0.277AlaTrp: 0.277 ± 0.056
1.384AlaTyr: 1.384 ± 0.122
0.0AlaXaa: 0.0 ± 0.0
Cys
0.29CysAla: 0.29 ± 0.074
0.04CysCys: 0.04 ± 0.021
0.396CysAsp: 0.396 ± 0.068
0.475CysGlu: 0.475 ± 0.078
0.303CysPhe: 0.303 ± 0.064
0.62CysGly: 0.62 ± 0.114
0.145CysHis: 0.145 ± 0.04
0.527CysIle: 0.527 ± 0.07
0.751CysLys: 0.751 ± 0.121
0.567CysLeu: 0.567 ± 0.089
0.132CysMet: 0.132 ± 0.033
0.857CysAsn: 0.857 ± 0.161
0.369CysPro: 0.369 ± 0.084
0.198CysGln: 0.198 ± 0.056
0.316CysArg: 0.316 ± 0.069
0.422CysSer: 0.422 ± 0.069
0.448CysThr: 0.448 ± 0.069
0.237CysVal: 0.237 ± 0.06
0.053CysTrp: 0.053 ± 0.029
0.475CysTyr: 0.475 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
2.175AspAla: 2.175 ± 0.203
0.593AspCys: 0.593 ± 0.097
4.258AspAsp: 4.258 ± 0.318
5.867AspGlu: 5.867 ± 0.363
3.401AspPhe: 3.401 ± 0.189
3.362AspGly: 3.362 ± 0.24
0.804AspHis: 0.804 ± 0.113
7.541AspIle: 7.541 ± 0.372
6.17AspLys: 6.17 ± 0.338
6.13AspLeu: 6.13 ± 0.332
1.345AspMet: 1.345 ± 0.119
5.221AspAsn: 5.221 ± 0.262
2.136AspPro: 2.136 ± 0.184
1.477AspGln: 1.477 ± 0.116
2.109AspArg: 2.109 ± 0.138
4.298AspSer: 4.298 ± 0.221
3.362AspThr: 3.362 ± 0.243
3.296AspVal: 3.296 ± 0.216
0.659AspTrp: 0.659 ± 0.095
3.863AspTyr: 3.863 ± 0.249
0.0AspXaa: 0.0 ± 0.0
Glu
2.399GluAla: 2.399 ± 0.188
0.58GluCys: 0.58 ± 0.102
4.852GluAsp: 4.852 ± 0.298
7.449GluGlu: 7.449 ± 0.484
3.836GluPhe: 3.836 ± 0.251
3.296GluGly: 3.296 ± 0.193
1.081GluHis: 1.081 ± 0.118
7.277GluIle: 7.277 ± 0.414
7.923GluLys: 7.923 ± 0.412
7.251GluLeu: 7.251 ± 0.364
2.057GluMet: 2.057 ± 0.166
5.682GluAsn: 5.682 ± 0.272
1.16GluPro: 1.16 ± 0.159
2.096GluGln: 2.096 ± 0.16
2.795GluArg: 2.795 ± 0.248
4.72GluSer: 4.72 ± 0.291
3.467GluThr: 3.467 ± 0.216
4.126GluVal: 4.126 ± 0.284
0.593GluTrp: 0.593 ± 0.102
4.416GluTyr: 4.416 ± 0.321
0.0GluXaa: 0.0 ± 0.0
Phe
1.582PheAla: 1.582 ± 0.113
0.356PheCys: 0.356 ± 0.068
3.375PheAsp: 3.375 ± 0.234
2.94PheGlu: 2.94 ± 0.215
1.767PhePhe: 1.767 ± 0.163
2.175PheGly: 2.175 ± 0.239
0.791PheHis: 0.791 ± 0.112
4.601PheIle: 4.601 ± 0.225
4.627PheLys: 4.627 ± 0.284
4.258PheLeu: 4.258 ± 0.246
1.173PheMet: 1.173 ± 0.122
4.324PheAsn: 4.324 ± 0.21
1.2PhePro: 1.2 ± 0.113
1.015PheGln: 1.015 ± 0.136
1.437PheArg: 1.437 ± 0.125
4.1PheSer: 4.1 ± 0.246
2.426PheThr: 2.426 ± 0.204
1.912PheVal: 1.912 ± 0.139
0.29PheTrp: 0.29 ± 0.067
2.254PheTyr: 2.254 ± 0.163
0.0PheXaa: 0.0 ± 0.0
Gly
1.701GlyAla: 1.701 ± 0.286
0.303GlyCys: 0.303 ± 0.079
2.544GlyAsp: 2.544 ± 0.228
3.296GlyGlu: 3.296 ± 0.318
2.281GlyPhe: 2.281 ± 0.173
2.268GlyGly: 2.268 ± 0.412
0.725GlyHis: 0.725 ± 0.092
4.311GlyIle: 4.311 ± 0.308
5.643GlyLys: 5.643 ± 0.308
3.916GlyLeu: 3.916 ± 0.306
1.107GlyMet: 1.107 ± 0.142
3.731GlyAsn: 3.731 ± 0.289
0.277GlyPro: 0.277 ± 0.055
1.318GlyGln: 1.318 ± 0.151
1.885GlyArg: 1.885 ± 0.262
3.243GlySer: 3.243 ± 0.269
2.624GlyThr: 2.624 ± 0.214
2.61GlyVal: 2.61 ± 0.223
0.33GlyTrp: 0.33 ± 0.061
2.373GlyTyr: 2.373 ± 0.17
0.0GlyXaa: 0.0 ± 0.0
His
0.567HisAla: 0.567 ± 0.104
0.119HisCys: 0.119 ± 0.043
0.857HisAsp: 0.857 ± 0.116
0.817HisGlu: 0.817 ± 0.109
0.989HisPhe: 0.989 ± 0.116
0.554HisGly: 0.554 ± 0.105
0.264HisHis: 0.264 ± 0.07
1.529HisIle: 1.529 ± 0.166
1.173HisLys: 1.173 ± 0.1
1.424HisLeu: 1.424 ± 0.147
0.343HisMet: 0.343 ± 0.071
1.081HisAsn: 1.081 ± 0.117
0.554HisPro: 0.554 ± 0.101
0.303HisGln: 0.303 ± 0.063
0.606HisArg: 0.606 ± 0.088
0.949HisSer: 0.949 ± 0.131
0.554HisThr: 0.554 ± 0.1
0.765HisVal: 0.765 ± 0.121
0.066HisTrp: 0.066 ± 0.026
0.844HisTyr: 0.844 ± 0.104
0.0HisXaa: 0.0 ± 0.0
Ile
3.454IleAla: 3.454 ± 0.228
0.725IleCys: 0.725 ± 0.089
7.581IleAsp: 7.581 ± 0.361
7.633IleGlu: 7.633 ± 0.403
3.599IlePhe: 3.599 ± 0.255
4.126IleGly: 4.126 ± 0.323
1.226IleHis: 1.226 ± 0.135
8.82IleIle: 8.82 ± 0.438
9.413IleLys: 9.413 ± 0.37
7.488IleLeu: 7.488 ± 0.392
1.885IleMet: 1.885 ± 0.165
8.266IleAsn: 8.266 ± 0.389
2.9IlePro: 2.9 ± 0.255
2.689IleGln: 2.689 ± 0.185
3.705IleArg: 3.705 ± 0.217
6.776IleSer: 6.776 ± 0.285
5.458IleThr: 5.458 ± 0.275
4.496IleVal: 4.496 ± 0.274
0.501IleTrp: 0.501 ± 0.078
3.902IleTyr: 3.902 ± 0.248
0.0IleXaa: 0.0 ± 0.0
Lys
3.217LysAla: 3.217 ± 0.243
0.857LysCys: 0.857 ± 0.14
7.277LysAsp: 7.277 ± 0.273
9.149LysGlu: 9.149 ± 0.444
4.377LysPhe: 4.377 ± 0.216
4.667LysGly: 4.667 ± 0.334
1.318LysHis: 1.318 ± 0.167
8.556LysIle: 8.556 ± 0.335
9.519LysLys: 9.519 ± 0.521
8.345LysLeu: 8.345 ± 0.377
2.294LysMet: 2.294 ± 0.172
8.002LysAsn: 8.002 ± 0.365
2.32LysPro: 2.32 ± 0.196
2.373LysGln: 2.373 ± 0.174
4.39LysArg: 4.39 ± 0.26
5.392LysSer: 5.392 ± 0.27
4.917LysThr: 4.917 ± 0.266
5.366LysVal: 5.366 ± 0.271
0.765LysTrp: 0.765 ± 0.124
4.931LysTyr: 4.931 ± 0.257
0.0LysXaa: 0.0 ± 0.0
Leu
3.415LeuAla: 3.415 ± 0.24
0.58LeuCys: 0.58 ± 0.092
6.038LeuAsp: 6.038 ± 0.279
6.987LeuGlu: 6.987 ± 0.356
3.981LeuPhe: 3.981 ± 0.268
3.823LeuGly: 3.823 ± 0.268
1.107LeuHis: 1.107 ± 0.133
7.277LeuIle: 7.277 ± 0.387
8.227LeuLys: 8.227 ± 0.315
8.002LeuLeu: 8.002 ± 0.428
2.333LeuMet: 2.333 ± 0.175
7.488LeuAsn: 7.488 ± 0.347
2.399LeuPro: 2.399 ± 0.205
2.03LeuGln: 2.03 ± 0.155
3.045LeuArg: 3.045 ± 0.202
6.869LeuSer: 6.869 ± 0.37
4.403LeuThr: 4.403 ± 0.207
4.443LeuVal: 4.443 ± 0.278
0.475LeuTrp: 0.475 ± 0.075
3.929LeuTyr: 3.929 ± 0.233
0.0LeuXaa: 0.0 ± 0.0
Met
1.028MetAla: 1.028 ± 0.132
0.119MetCys: 0.119 ± 0.041
1.819MetAsp: 1.819 ± 0.152
1.569MetGlu: 1.569 ± 0.137
0.91MetPhe: 0.91 ± 0.115
0.989MetGly: 0.989 ± 0.121
0.277MetHis: 0.277 ± 0.069
2.017MetIle: 2.017 ± 0.171
2.558MetLys: 2.558 ± 0.187
1.595MetLeu: 1.595 ± 0.161
0.461MetMet: 0.461 ± 0.087
1.753MetAsn: 1.753 ± 0.144
0.659MetPro: 0.659 ± 0.086
0.672MetGln: 0.672 ± 0.105
0.857MetArg: 0.857 ± 0.094
1.846MetSer: 1.846 ± 0.144
1.305MetThr: 1.305 ± 0.11
1.121MetVal: 1.121 ± 0.107
0.158MetTrp: 0.158 ± 0.044
0.896MetTyr: 0.896 ± 0.1
0.0MetXaa: 0.0 ± 0.0
Asn
2.782AsnAla: 2.782 ± 0.255
0.646AsnCys: 0.646 ± 0.107
5.656AsnAsp: 5.656 ± 0.265
5.893AsnGlu: 5.893 ± 0.336
3.639AsnPhe: 3.639 ± 0.206
4.351AsnGly: 4.351 ± 0.248
1.173AsnHis: 1.173 ± 0.126
8.24AsnIle: 8.24 ± 0.372
8.741AsnLys: 8.741 ± 0.435
6.803AsnLeu: 6.803 ± 0.335
1.516AsnMet: 1.516 ± 0.137
7.712AsnAsn: 7.712 ± 0.364
2.281AsnPro: 2.281 ± 0.198
2.136AsnGln: 2.136 ± 0.154
2.663AsnArg: 2.663 ± 0.199
5.999AsnSer: 5.999 ± 0.329
4.39AsnThr: 4.39 ± 0.23
3.48AsnVal: 3.48 ± 0.226
0.514AsnTrp: 0.514 ± 0.083
4.153AsnTyr: 4.153 ± 0.254
0.0AsnXaa: 0.0 ± 0.0
Pro
0.87ProAla: 0.87 ± 0.158
0.171ProCys: 0.171 ± 0.045
1.898ProAsp: 1.898 ± 0.154
1.727ProGlu: 1.727 ± 0.174
1.332ProPhe: 1.332 ± 0.138
0.343ProGly: 0.343 ± 0.067
0.382ProHis: 0.382 ± 0.07
2.413ProIle: 2.413 ± 0.201
2.096ProLys: 2.096 ± 0.171
2.294ProLeu: 2.294 ± 0.163
0.646ProMet: 0.646 ± 0.114
2.123ProAsn: 2.123 ± 0.174
0.541ProPro: 0.541 ± 0.12
0.62ProGln: 0.62 ± 0.105
0.672ProArg: 0.672 ± 0.104
2.175ProSer: 2.175 ± 0.176
1.622ProThr: 1.622 ± 0.172
1.595ProVal: 1.595 ± 0.153
0.119ProTrp: 0.119 ± 0.035
1.437ProTyr: 1.437 ± 0.167
0.0ProXaa: 0.0 ± 0.0
Gln
1.068GlnAla: 1.068 ± 0.117
0.145GlnCys: 0.145 ± 0.05
1.318GlnAsp: 1.318 ± 0.124
2.043GlnGlu: 2.043 ± 0.167
1.279GlnPhe: 1.279 ± 0.126
1.266GlnGly: 1.266 ± 0.148
0.396GlnHis: 0.396 ± 0.07
2.228GlnIle: 2.228 ± 0.177
2.584GlnLys: 2.584 ± 0.2
2.624GlnLeu: 2.624 ± 0.184
0.58GlnMet: 0.58 ± 0.092
1.938GlnAsn: 1.938 ± 0.184
0.646GlnPro: 0.646 ± 0.092
0.778GlnGln: 0.778 ± 0.127
0.857GlnArg: 0.857 ± 0.099
1.477GlnSer: 1.477 ± 0.146
1.384GlnThr: 1.384 ± 0.156
1.279GlnVal: 1.279 ± 0.139
0.277GlnTrp: 0.277 ± 0.071
1.49GlnTyr: 1.49 ± 0.143
0.0GlnXaa: 0.0 ± 0.0
Arg
1.2ArgAla: 1.2 ± 0.114
0.409ArgCys: 0.409 ± 0.1
2.663ArgAsp: 2.663 ± 0.202
2.676ArgGlu: 2.676 ± 0.224
1.78ArgPhe: 1.78 ± 0.155
1.793ArgGly: 1.793 ± 0.249
0.475ArgHis: 0.475 ± 0.079
3.612ArgIle: 3.612 ± 0.206
3.705ArgLys: 3.705 ± 0.22
3.006ArgLeu: 3.006 ± 0.256
1.2ArgMet: 1.2 ± 0.128
2.993ArgAsn: 2.993 ± 0.24
0.896ArgPro: 0.896 ± 0.107
1.028ArgGln: 1.028 ± 0.114
1.608ArgArg: 1.608 ± 0.192
2.268ArgSer: 2.268 ± 0.174
1.516ArgThr: 1.516 ± 0.135
1.885ArgVal: 1.885 ± 0.146
0.33ArgTrp: 0.33 ± 0.065
1.753ArgTyr: 1.753 ± 0.14
0.0ArgXaa: 0.0 ± 0.0
Ser
2.61SerAla: 2.61 ± 0.326
0.396SerCys: 0.396 ± 0.072
4.68SerAsp: 4.68 ± 0.241
4.944SerGlu: 4.944 ± 0.286
3.309SerPhe: 3.309 ± 0.189
3.494SerGly: 3.494 ± 0.329
1.015SerHis: 1.015 ± 0.109
6.908SerIle: 6.908 ± 0.307
6.499SerLys: 6.499 ± 0.354
6.249SerLeu: 6.249 ± 0.233
1.358SerMet: 1.358 ± 0.148
5.669SerAsn: 5.669 ± 0.321
1.912SerPro: 1.912 ± 0.152
1.938SerGln: 1.938 ± 0.164
2.479SerArg: 2.479 ± 0.21
4.917SerSer: 4.917 ± 0.488
3.916SerThr: 3.916 ± 0.297
3.678SerVal: 3.678 ± 0.231
0.475SerTrp: 0.475 ± 0.078
3.309SerTyr: 3.309 ± 0.22
0.0SerXaa: 0.0 ± 0.0
Thr
1.925ThrAla: 1.925 ± 0.21
0.264ThrCys: 0.264 ± 0.069
3.757ThrAsp: 3.757 ± 0.347
4.1ThrGlu: 4.1 ± 0.265
2.61ThrPhe: 2.61 ± 0.183
2.689ThrGly: 2.689 ± 0.275
0.936ThrHis: 0.936 ± 0.108
5.102ThrIle: 5.102 ± 0.329
4.904ThrLys: 4.904 ± 0.253
4.878ThrLeu: 4.878 ± 0.241
1.015ThrMet: 1.015 ± 0.116
3.889ThrAsn: 3.889 ± 0.244
1.411ThrPro: 1.411 ± 0.163
1.055ThrGln: 1.055 ± 0.116
1.793ThrArg: 1.793 ± 0.169
3.81ThrSer: 3.81 ± 0.239
3.388ThrThr: 3.388 ± 0.242
3.52ThrVal: 3.52 ± 0.265
0.422ThrTrp: 0.422 ± 0.085
2.742ThrTyr: 2.742 ± 0.205
0.0ThrXaa: 0.0 ± 0.0
Val
1.978ValAla: 1.978 ± 0.185
0.461ValCys: 0.461 ± 0.085
3.098ValAsp: 3.098 ± 0.242
3.665ValGlu: 3.665 ± 0.201
2.36ValPhe: 2.36 ± 0.179
1.938ValGly: 1.938 ± 0.179
0.857ValHis: 0.857 ± 0.12
4.733ValIle: 4.733 ± 0.279
5.128ValLys: 5.128 ± 0.301
4.351ValLeu: 4.351 ± 0.247
1.121ValMet: 1.121 ± 0.112
4.087ValAsn: 4.087 ± 0.223
1.305ValPro: 1.305 ± 0.137
1.582ValGln: 1.582 ± 0.162
1.938ValArg: 1.938 ± 0.134
4.206ValSer: 4.206 ± 0.258
3.138ValThr: 3.138 ± 0.236
2.808ValVal: 2.808 ± 0.21
0.277ValTrp: 0.277 ± 0.057
2.268ValTyr: 2.268 ± 0.165
0.0ValXaa: 0.0 ± 0.0
Trp
0.211TrpAla: 0.211 ± 0.055
0.066TrpCys: 0.066 ± 0.028
0.488TrpAsp: 0.488 ± 0.079
0.712TrpGlu: 0.712 ± 0.119
0.224TrpPhe: 0.224 ± 0.052
0.33TrpGly: 0.33 ± 0.065
0.145TrpHis: 0.145 ± 0.042
0.475TrpIle: 0.475 ± 0.074
0.686TrpLys: 0.686 ± 0.1
0.712TrpLeu: 0.712 ± 0.087
0.105TrpMet: 0.105 ± 0.037
0.672TrpAsn: 0.672 ± 0.187
0.0TrpPro: 0.0 ± 0.0
0.185TrpGln: 0.185 ± 0.05
0.224TrpArg: 0.224 ± 0.055
0.527TrpSer: 0.527 ± 0.092
0.435TrpThr: 0.435 ± 0.078
0.382TrpVal: 0.382 ± 0.068
0.119TrpTrp: 0.119 ± 0.048
0.396TrpTyr: 0.396 ± 0.087
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.793TyrAla: 1.793 ± 0.17
0.567TyrCys: 0.567 ± 0.078
3.309TyrAsp: 3.309 ± 0.222
3.019TyrGlu: 3.019 ± 0.206
2.808TyrPhe: 2.808 ± 0.242
2.399TyrGly: 2.399 ± 0.212
0.857TyrHis: 0.857 ± 0.115
4.641TyrIle: 4.641 ± 0.305
4.126TyrLys: 4.126 ± 0.271
3.744TyrLeu: 3.744 ± 0.269
1.094TyrMet: 1.094 ± 0.122
4.469TyrAsn: 4.469 ± 0.26
1.371TyrPro: 1.371 ± 0.145
1.397TyrGln: 1.397 ± 0.128
2.03TyrArg: 2.03 ± 0.199
3.546TyrSer: 3.546 ± 0.236
3.151TyrThr: 3.151 ± 0.217
2.096TyrVal: 2.096 ± 0.175
0.396TyrTrp: 0.396 ± 0.074
2.782TyrTyr: 2.782 ± 0.209
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 311 proteins (75853 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski