Amino acid dipepetide frequency for Bacillus phage JBP901

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.962AlaAla: 2.962 ± 0.398
0.379AlaCys: 0.379 ± 0.098
3.675AlaAsp: 3.675 ± 0.269
4.922AlaGlu: 4.922 ± 0.413
2.405AlaPhe: 2.405 ± 0.191
4.343AlaGly: 4.343 ± 0.393
0.935AlaHis: 0.935 ± 0.125
4.276AlaIle: 4.276 ± 0.314
4.833AlaLys: 4.833 ± 0.303
5.055AlaLeu: 5.055 ± 0.337
1.67AlaMet: 1.67 ± 0.239
3.274AlaAsn: 3.274 ± 0.382
2.049AlaPro: 2.049 ± 0.321
2.494AlaGln: 2.494 ± 0.25
2.94AlaArg: 2.94 ± 0.258
3.251AlaSer: 3.251 ± 0.392
4.922AlaThr: 4.922 ± 0.402
4.164AlaVal: 4.164 ± 0.32
0.913AlaTrp: 0.913 ± 0.142
3.073AlaTyr: 3.073 ± 0.265
0.0AlaXaa: 0.0 ± 0.0
Cys
0.245CysAla: 0.245 ± 0.077
0.089CysCys: 0.089 ± 0.044
0.423CysAsp: 0.423 ± 0.111
0.534CysGlu: 0.534 ± 0.109
0.267CysPhe: 0.267 ± 0.083
0.445CysGly: 0.445 ± 0.106
0.2CysHis: 0.2 ± 0.064
0.29CysIle: 0.29 ± 0.083
0.557CysLys: 0.557 ± 0.123
0.49CysLeu: 0.49 ± 0.121
0.178CysMet: 0.178 ± 0.076
0.423CysAsn: 0.423 ± 0.109
0.267CysPro: 0.267 ± 0.084
0.156CysGln: 0.156 ± 0.054
0.156CysArg: 0.156 ± 0.058
0.423CysSer: 0.423 ± 0.09
0.401CysThr: 0.401 ± 0.089
0.512CysVal: 0.512 ± 0.103
0.134CysTrp: 0.134 ± 0.066
0.534CysTyr: 0.534 ± 0.105
0.0CysXaa: 0.0 ± 0.0
Asp
3.875AspAla: 3.875 ± 0.279
0.423AspCys: 0.423 ± 0.111
3.118AspAsp: 3.118 ± 0.342
4.788AspGlu: 4.788 ± 0.35
2.361AspPhe: 2.361 ± 0.207
4.409AspGly: 4.409 ± 0.329
0.846AspHis: 0.846 ± 0.134
5.144AspIle: 5.144 ± 0.316
5.322AspLys: 5.322 ± 0.341
5.256AspLeu: 5.256 ± 0.288
1.47AspMet: 1.47 ± 0.191
3.296AspAsn: 3.296 ± 0.269
2.182AspPro: 2.182 ± 0.238
1.603AspGln: 1.603 ± 0.178
2.851AspArg: 2.851 ± 0.28
3.363AspSer: 3.363 ± 0.276
3.162AspThr: 3.162 ± 0.25
4.498AspVal: 4.498 ± 0.335
0.913AspTrp: 0.913 ± 0.136
3.63AspTyr: 3.63 ± 0.311
0.0AspXaa: 0.0 ± 0.0
Glu
4.209GluAla: 4.209 ± 0.306
0.401GluCys: 0.401 ± 0.095
5.077GluAsp: 5.077 ± 0.461
8.128GluGlu: 8.128 ± 0.783
3.051GluPhe: 3.051 ± 0.279
4.788GluGly: 4.788 ± 0.397
1.492GluHis: 1.492 ± 0.205
5.1GluIle: 5.1 ± 0.334
6.08GluLys: 6.08 ± 0.416
7.639GluLeu: 7.639 ± 0.442
2.294GluMet: 2.294 ± 0.215
3.719GluAsn: 3.719 ± 0.289
1.982GluPro: 1.982 ± 0.388
3.519GluGln: 3.519 ± 0.309
3.363GluArg: 3.363 ± 0.289
3.43GluSer: 3.43 ± 0.257
3.274GluThr: 3.274 ± 0.287
5.612GluVal: 5.612 ± 0.376
0.935GluTrp: 0.935 ± 0.153
3.652GluTyr: 3.652 ± 0.319
0.0GluXaa: 0.0 ± 0.0
Phe
2.182PheAla: 2.182 ± 0.196
0.29PheCys: 0.29 ± 0.077
2.739PheAsp: 2.739 ± 0.239
2.895PheGlu: 2.895 ± 0.242
1.381PhePhe: 1.381 ± 0.184
2.294PheGly: 2.294 ± 0.215
0.757PheHis: 0.757 ± 0.134
2.45PheIle: 2.45 ± 0.273
2.65PheLys: 2.65 ± 0.234
2.516PheLeu: 2.516 ± 0.235
1.314PheMet: 1.314 ± 0.168
2.338PheAsn: 2.338 ± 0.238
1.113PhePro: 1.113 ± 0.17
1.247PheGln: 1.247 ± 0.155
1.358PheArg: 1.358 ± 0.168
3.474PheSer: 3.474 ± 0.373
2.65PheThr: 2.65 ± 0.268
2.65PheVal: 2.65 ± 0.259
0.334PheTrp: 0.334 ± 0.089
1.893PheTyr: 1.893 ± 0.197
0.0PheXaa: 0.0 ± 0.0
Gly
4.009GlyAla: 4.009 ± 0.529
0.49GlyCys: 0.49 ± 0.116
3.608GlyAsp: 3.608 ± 0.308
4.031GlyGlu: 4.031 ± 0.348
2.695GlyPhe: 2.695 ± 0.263
5.523GlyGly: 5.523 ± 0.722
1.18GlyHis: 1.18 ± 0.164
4.231GlyIle: 4.231 ± 0.34
5.434GlyLys: 5.434 ± 0.35
4.699GlyLeu: 4.699 ± 0.304
1.96GlyMet: 1.96 ± 0.256
3.875GlyAsn: 3.875 ± 0.308
0.089GlyPro: 0.089 ± 0.051
2.583GlyGln: 2.583 ± 0.29
2.628GlyArg: 2.628 ± 0.273
4.743GlySer: 4.743 ± 0.383
5.612GlyThr: 5.612 ± 0.423
4.788GlyVal: 4.788 ± 0.356
0.98GlyTrp: 0.98 ± 0.156
3.274GlyTyr: 3.274 ± 0.296
0.0GlyXaa: 0.0 ± 0.0
His
1.113HisAla: 1.113 ± 0.152
0.089HisCys: 0.089 ± 0.048
0.913HisAsp: 0.913 ± 0.16
1.269HisGlu: 1.269 ± 0.181
0.601HisPhe: 0.601 ± 0.114
0.913HisGly: 0.913 ± 0.139
0.356HisHis: 0.356 ± 0.093
1.247HisIle: 1.247 ± 0.204
1.136HisLys: 1.136 ± 0.183
1.559HisLeu: 1.559 ± 0.246
0.468HisMet: 0.468 ± 0.108
1.069HisAsn: 1.069 ± 0.177
0.579HisPro: 0.579 ± 0.1
0.49HisGln: 0.49 ± 0.126
1.136HisArg: 1.136 ± 0.183
0.869HisSer: 0.869 ± 0.134
1.18HisThr: 1.18 ± 0.176
1.559HisVal: 1.559 ± 0.183
0.178HisTrp: 0.178 ± 0.071
0.69HisTyr: 0.69 ± 0.136
0.0HisXaa: 0.0 ± 0.0
Ile
4.677IleAla: 4.677 ± 0.359
0.312IleCys: 0.312 ± 0.083
4.164IleAsp: 4.164 ± 0.349
5.456IleGlu: 5.456 ± 0.297
1.915IlePhe: 1.915 ± 0.208
4.276IleGly: 4.276 ± 0.307
1.358IleHis: 1.358 ± 0.188
3.452IleIle: 3.452 ± 0.33
5.233IleLys: 5.233 ± 0.324
4.298IleLeu: 4.298 ± 0.384
1.715IleMet: 1.715 ± 0.214
3.83IleAsn: 3.83 ± 0.303
2.739IlePro: 2.739 ± 0.268
2.071IleGln: 2.071 ± 0.151
2.806IleArg: 2.806 ± 0.263
4.743IleSer: 4.743 ± 0.342
4.298IleThr: 4.298 ± 0.357
4.61IleVal: 4.61 ± 0.316
0.423IleTrp: 0.423 ± 0.094
2.227IleTyr: 2.227 ± 0.239
0.0IleXaa: 0.0 ± 0.0
Lys
5.167LysAla: 5.167 ± 0.348
0.267LysCys: 0.267 ± 0.089
5.612LysAsp: 5.612 ± 0.388
7.572LysGlu: 7.572 ± 0.518
2.695LysPhe: 2.695 ± 0.236
4.833LysGly: 4.833 ± 0.406
1.314LysHis: 1.314 ± 0.235
4.432LysIle: 4.432 ± 0.311
5.879LysLys: 5.879 ± 0.539
6.48LysLeu: 6.48 ± 0.381
2.316LysMet: 2.316 ± 0.227
3.474LysAsn: 3.474 ± 0.28
2.672LysPro: 2.672 ± 0.249
3.652LysGln: 3.652 ± 0.275
3.474LysArg: 3.474 ± 0.295
3.875LysSer: 3.875 ± 0.294
4.164LysThr: 4.164 ± 0.279
5.79LysVal: 5.79 ± 0.369
0.98LysTrp: 0.98 ± 0.132
3.162LysTyr: 3.162 ± 0.311
0.0LysXaa: 0.0 ± 0.0
Leu
5.322LeuAla: 5.322 ± 0.273
0.557LeuCys: 0.557 ± 0.119
5.634LeuAsp: 5.634 ± 0.415
6.592LeuGlu: 6.592 ± 0.435
3.051LeuPhe: 3.051 ± 0.276
5.3LeuGly: 5.3 ± 0.327
1.292LeuHis: 1.292 ± 0.162
4.543LeuIle: 4.543 ± 0.414
6.191LeuLys: 6.191 ± 0.342
5.857LeuLeu: 5.857 ± 0.455
2.116LeuMet: 2.116 ± 0.229
4.009LeuAsn: 4.009 ± 0.342
2.784LeuPro: 2.784 ± 0.279
3.496LeuGln: 3.496 ± 0.256
4.009LeuArg: 4.009 ± 0.32
4.432LeuSer: 4.432 ± 0.358
5.3LeuThr: 5.3 ± 0.39
5.434LeuVal: 5.434 ± 0.337
0.824LeuTrp: 0.824 ± 0.147
3.207LeuTyr: 3.207 ± 0.239
0.0LeuXaa: 0.0 ± 0.0
Met
1.67MetAla: 1.67 ± 0.174
0.134MetCys: 0.134 ± 0.061
1.826MetAsp: 1.826 ± 0.189
1.893MetGlu: 1.893 ± 0.194
1.247MetPhe: 1.247 ± 0.143
1.269MetGly: 1.269 ± 0.179
0.379MetHis: 0.379 ± 0.089
1.648MetIle: 1.648 ± 0.178
2.583MetLys: 2.583 ± 0.217
2.16MetLeu: 2.16 ± 0.18
0.869MetMet: 0.869 ± 0.139
1.937MetAsn: 1.937 ± 0.225
0.601MetPro: 0.601 ± 0.139
0.935MetGln: 0.935 ± 0.126
1.358MetArg: 1.358 ± 0.155
1.782MetSer: 1.782 ± 0.209
2.049MetThr: 2.049 ± 0.19
1.559MetVal: 1.559 ± 0.171
0.267MetTrp: 0.267 ± 0.087
0.891MetTyr: 0.891 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
3.118AsnAla: 3.118 ± 0.301
0.401AsnCys: 0.401 ± 0.088
2.695AsnAsp: 2.695 ± 0.197
3.162AsnGlu: 3.162 ± 0.308
2.294AsnPhe: 2.294 ± 0.256
4.476AsnGly: 4.476 ± 0.417
1.002AsnHis: 1.002 ± 0.173
3.741AsnIle: 3.741 ± 0.316
4.053AsnLys: 4.053 ± 0.309
4.254AsnLeu: 4.254 ± 0.313
1.759AsnMet: 1.759 ± 0.195
2.94AsnAsn: 2.94 ± 0.296
2.65AsnPro: 2.65 ± 0.255
1.804AsnGln: 1.804 ± 0.206
2.695AsnArg: 2.695 ± 0.246
2.984AsnSer: 2.984 ± 0.284
3.741AsnThr: 3.741 ± 0.372
3.942AsnVal: 3.942 ± 0.325
0.579AsnTrp: 0.579 ± 0.112
1.96AsnTyr: 1.96 ± 0.186
0.0AsnXaa: 0.0 ± 0.0
Pro
1.804ProAla: 1.804 ± 0.236
0.312ProCys: 0.312 ± 0.077
2.116ProAsp: 2.116 ± 0.23
2.182ProGlu: 2.182 ± 0.233
1.18ProPhe: 1.18 ± 0.181
1.314ProGly: 1.314 ± 0.168
0.468ProHis: 0.468 ± 0.083
2.027ProIle: 2.027 ± 0.202
2.761ProLys: 2.761 ± 0.318
2.516ProLeu: 2.516 ± 0.265
0.98ProMet: 0.98 ± 0.174
2.071ProAsn: 2.071 ± 0.211
0.779ProPro: 0.779 ± 0.111
1.425ProGln: 1.425 ± 0.237
1.158ProArg: 1.158 ± 0.132
2.182ProSer: 2.182 ± 0.27
2.583ProThr: 2.583 ± 0.347
2.583ProVal: 2.583 ± 0.319
0.223ProTrp: 0.223 ± 0.077
1.381ProTyr: 1.381 ± 0.173
0.0ProXaa: 0.0 ± 0.0
Gln
2.895GlnAla: 2.895 ± 0.292
0.334GlnCys: 0.334 ± 0.102
2.116GlnAsp: 2.116 ± 0.226
3.274GlnGlu: 3.274 ± 0.252
1.247GlnPhe: 1.247 ± 0.179
2.695GlnGly: 2.695 ± 0.236
0.958GlnHis: 0.958 ± 0.175
2.16GlnIle: 2.16 ± 0.208
2.583GlnLys: 2.583 ± 0.239
3.385GlnLeu: 3.385 ± 0.263
0.869GlnMet: 0.869 ± 0.126
1.737GlnAsn: 1.737 ± 0.198
1.67GlnPro: 1.67 ± 0.336
2.049GlnGln: 2.049 ± 0.227
1.336GlnArg: 1.336 ± 0.154
2.45GlnSer: 2.45 ± 0.237
2.182GlnThr: 2.182 ± 0.247
2.65GlnVal: 2.65 ± 0.236
0.512GlnTrp: 0.512 ± 0.098
1.247GlnTyr: 1.247 ± 0.139
0.0GlnXaa: 0.0 ± 0.0
Arg
2.984ArgAla: 2.984 ± 0.279
0.312ArgCys: 0.312 ± 0.097
2.338ArgAsp: 2.338 ± 0.239
3.942ArgGlu: 3.942 ± 0.319
1.893ArgPhe: 1.893 ± 0.164
2.851ArgGly: 2.851 ± 0.291
0.601ArgHis: 0.601 ± 0.111
2.962ArgIle: 2.962 ± 0.243
3.185ArgLys: 3.185 ± 0.326
3.541ArgLeu: 3.541 ± 0.294
1.514ArgMet: 1.514 ± 0.182
2.071ArgAsn: 2.071 ± 0.203
1.425ArgPro: 1.425 ± 0.199
1.559ArgGln: 1.559 ± 0.159
2.227ArgArg: 2.227 ± 0.222
2.16ArgSer: 2.16 ± 0.227
2.16ArgThr: 2.16 ± 0.203
3.496ArgVal: 3.496 ± 0.38
0.557ArgTrp: 0.557 ± 0.097
1.982ArgTyr: 1.982 ± 0.221
0.0ArgXaa: 0.0 ± 0.0
Ser
3.474SerAla: 3.474 ± 0.351
0.334SerCys: 0.334 ± 0.1
3.519SerAsp: 3.519 ± 0.232
3.474SerGlu: 3.474 ± 0.314
2.895SerPhe: 2.895 ± 0.238
4.365SerGly: 4.365 ± 0.393
0.913SerHis: 0.913 ± 0.145
4.009SerIle: 4.009 ± 0.296
4.32SerLys: 4.32 ± 0.292
5.1SerLeu: 5.1 ± 0.309
1.336SerMet: 1.336 ± 0.182
3.185SerAsn: 3.185 ± 0.284
1.737SerPro: 1.737 ± 0.205
1.915SerGln: 1.915 ± 0.214
2.472SerArg: 2.472 ± 0.25
4.031SerSer: 4.031 ± 0.511
3.786SerThr: 3.786 ± 0.395
3.875SerVal: 3.875 ± 0.34
0.735SerTrp: 0.735 ± 0.156
2.695SerTyr: 2.695 ± 0.235
0.0SerXaa: 0.0 ± 0.0
Thr
3.919ThrAla: 3.919 ± 0.427
0.401ThrCys: 0.401 ± 0.115
4.164ThrAsp: 4.164 ± 0.272
4.031ThrGlu: 4.031 ± 0.316
2.561ThrPhe: 2.561 ± 0.266
4.988ThrGly: 4.988 ± 0.367
1.203ThrHis: 1.203 ± 0.147
4.766ThrIle: 4.766 ± 0.324
4.075ThrLys: 4.075 ± 0.3
5.1ThrLeu: 5.1 ± 0.307
1.381ThrMet: 1.381 ± 0.179
3.563ThrAsn: 3.563 ± 0.303
2.917ThrPro: 2.917 ± 0.293
2.227ThrGln: 2.227 ± 0.236
2.628ThrArg: 2.628 ± 0.244
3.073ThrSer: 3.073 ± 0.27
3.652ThrThr: 3.652 ± 0.317
5.612ThrVal: 5.612 ± 0.388
0.779ThrTrp: 0.779 ± 0.136
3.118ThrTyr: 3.118 ± 0.275
0.0ThrXaa: 0.0 ± 0.0
Val
5.055ValAla: 5.055 ± 0.354
0.579ValCys: 0.579 ± 0.113
4.498ValAsp: 4.498 ± 0.252
5.746ValGlu: 5.746 ± 0.5
2.739ValPhe: 2.739 ± 0.236
3.964ValGly: 3.964 ± 0.293
1.158ValHis: 1.158 ± 0.141
4.454ValIle: 4.454 ± 0.382
6.035ValLys: 6.035 ± 0.372
5.701ValLeu: 5.701 ± 0.391
1.559ValMet: 1.559 ± 0.194
3.919ValAsn: 3.919 ± 0.32
2.739ValPro: 2.739 ± 0.289
2.984ValGln: 2.984 ± 0.253
3.051ValArg: 3.051 ± 0.267
3.897ValSer: 3.897 ± 0.289
5.055ValThr: 5.055 ± 0.428
5.1ValVal: 5.1 ± 0.373
0.891ValTrp: 0.891 ± 0.144
2.984ValTyr: 2.984 ± 0.262
0.0ValXaa: 0.0 ± 0.0
Trp
1.024TrpAla: 1.024 ± 0.203
0.2TrpCys: 0.2 ± 0.055
1.069TrpAsp: 1.069 ± 0.153
0.869TrpGlu: 0.869 ± 0.137
0.512TrpPhe: 0.512 ± 0.123
0.69TrpGly: 0.69 ± 0.147
0.178TrpHis: 0.178 ± 0.064
0.735TrpIle: 0.735 ± 0.12
0.958TrpLys: 0.958 ± 0.16
0.869TrpLeu: 0.869 ± 0.136
0.2TrpMet: 0.2 ± 0.062
0.735TrpAsn: 0.735 ± 0.17
0.0TrpPro: 0.0 ± 0.0
0.445TrpGln: 0.445 ± 0.092
0.49TrpArg: 0.49 ± 0.103
0.601TrpSer: 0.601 ± 0.105
0.624TrpThr: 0.624 ± 0.105
0.891TrpVal: 0.891 ± 0.136
0.2TrpTrp: 0.2 ± 0.063
0.534TrpTyr: 0.534 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.828TyrAla: 2.828 ± 0.257
0.423TyrCys: 0.423 ± 0.107
3.207TyrAsp: 3.207 ± 0.265
2.984TyrGlu: 2.984 ± 0.288
1.537TyrPhe: 1.537 ± 0.188
2.695TyrGly: 2.695 ± 0.256
0.824TyrHis: 0.824 ± 0.133
2.94TyrIle: 2.94 ± 0.243
3.964TyrLys: 3.964 ± 0.303
3.474TyrLeu: 3.474 ± 0.252
1.069TyrMet: 1.069 ± 0.144
2.784TyrAsn: 2.784 ± 0.254
1.069TyrPro: 1.069 ± 0.162
1.692TyrGln: 1.692 ± 0.197
1.626TyrArg: 1.626 ± 0.205
2.405TyrSer: 2.405 ± 0.22
3.363TyrThr: 3.363 ± 0.248
2.784TyrVal: 2.784 ± 0.29
0.468TyrTrp: 0.468 ± 0.103
1.893TyrTyr: 1.893 ± 0.231
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 201 proteins (44905 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski