Amino acid dipepetide frequency for Lactobacillus virus Lb338-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.074AlaAla: 4.074 ± 0.928
0.306AlaCys: 0.306 ± 0.083
3.885AlaAsp: 3.885 ± 0.33
3.391AlaGlu: 3.391 ± 0.314
2.402AlaPhe: 2.402 ± 0.214
3.461AlaGly: 3.461 ± 0.346
1.036AlaHis: 1.036 ± 0.135
4.356AlaIle: 4.356 ± 0.349
5.863AlaLys: 5.863 ± 0.409
4.921AlaLeu: 4.921 ± 0.357
1.648AlaMet: 1.648 ± 0.223
4.45AlaAsn: 4.45 ± 0.509
1.813AlaPro: 1.813 ± 0.274
2.496AlaGln: 2.496 ± 0.536
2.119AlaArg: 2.119 ± 0.218
5.157AlaSer: 5.157 ± 0.835
4.662AlaThr: 4.662 ± 0.675
3.744AlaVal: 3.744 ± 0.374
0.871AlaTrp: 0.871 ± 0.156
3.226AlaTyr: 3.226 ± 0.305
0.0AlaXaa: 0.0 ± 0.0
Cys
0.33CysAla: 0.33 ± 0.084
0.047CysCys: 0.047 ± 0.039
0.518CysAsp: 0.518 ± 0.107
0.306CysGlu: 0.306 ± 0.084
0.283CysPhe: 0.283 ± 0.071
0.636CysGly: 0.636 ± 0.155
0.259CysHis: 0.259 ± 0.077
0.306CysIle: 0.306 ± 0.084
0.753CysLys: 0.753 ± 0.158
0.494CysLeu: 0.494 ± 0.127
0.212CysMet: 0.212 ± 0.069
0.283CysAsn: 0.283 ± 0.084
0.542CysPro: 0.542 ± 0.169
0.165CysGln: 0.165 ± 0.065
0.141CysArg: 0.141 ± 0.049
0.306CysSer: 0.306 ± 0.096
0.33CysThr: 0.33 ± 0.092
0.4CysVal: 0.4 ± 0.09
0.165CysTrp: 0.165 ± 0.063
0.683CysTyr: 0.683 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
4.686AspAla: 4.686 ± 0.323
0.33AspCys: 0.33 ± 0.098
4.568AspAsp: 4.568 ± 0.421
3.956AspGlu: 3.956 ± 0.35
3.085AspPhe: 3.085 ± 0.251
4.403AspGly: 4.403 ± 0.46
1.177AspHis: 1.177 ± 0.175
5.086AspIle: 5.086 ± 0.377
6.593AspLys: 6.593 ± 0.442
5.792AspLeu: 5.792 ± 0.401
1.766AspMet: 1.766 ± 0.22
3.838AspAsn: 3.838 ± 0.336
2.331AspPro: 2.331 ± 0.242
1.837AspGln: 1.837 ± 0.204
1.978AspArg: 1.978 ± 0.243
5.133AspSer: 5.133 ± 0.434
4.521AspThr: 4.521 ± 0.313
3.815AspVal: 3.815 ± 0.298
1.083AspTrp: 1.083 ± 0.156
3.838AspTyr: 3.838 ± 0.324
0.0AspXaa: 0.0 ± 0.0
Glu
3.32GluAla: 3.32 ± 0.332
0.518GluCys: 0.518 ± 0.124
4.921GluAsp: 4.921 ± 0.389
4.285GluGlu: 4.285 ± 0.444
1.813GluPhe: 1.813 ± 0.212
2.567GluGly: 2.567 ± 0.273
1.531GluHis: 1.531 ± 0.194
3.556GluIle: 3.556 ± 0.331
4.992GluLys: 4.992 ± 0.523
4.851GluLeu: 4.851 ± 0.347
0.942GluMet: 0.942 ± 0.135
3.061GluAsn: 3.061 ± 0.285
1.531GluPro: 1.531 ± 0.197
2.755GluGln: 2.755 ± 0.282
1.954GluArg: 1.954 ± 0.197
4.144GluSer: 4.144 ± 0.379
3.438GluThr: 3.438 ± 0.329
3.579GluVal: 3.579 ± 0.321
0.659GluTrp: 0.659 ± 0.11
3.038GluTyr: 3.038 ± 0.277
0.0GluXaa: 0.0 ± 0.0
Phe
1.79PheAla: 1.79 ± 0.211
0.306PheCys: 0.306 ± 0.093
2.213PheAsp: 2.213 ± 0.26
1.978PheGlu: 1.978 ± 0.212
1.201PhePhe: 1.201 ± 0.15
2.213PheGly: 2.213 ± 0.26
0.471PheHis: 0.471 ± 0.096
2.213PheIle: 2.213 ± 0.215
3.014PheLys: 3.014 ± 0.265
2.614PheLeu: 2.614 ± 0.237
1.013PheMet: 1.013 ± 0.15
2.237PheAsn: 2.237 ± 0.247
1.295PhePro: 1.295 ± 0.19
0.965PheGln: 0.965 ± 0.167
1.342PheArg: 1.342 ± 0.178
3.673PheSer: 3.673 ± 0.311
2.614PheThr: 2.614 ± 0.27
2.637PheVal: 2.637 ± 0.293
0.377PheTrp: 0.377 ± 0.106
1.46PheTyr: 1.46 ± 0.2
0.0PheXaa: 0.0 ± 0.0
Gly
2.826GlyAla: 2.826 ± 0.287
0.471GlyCys: 0.471 ± 0.108
3.979GlyAsp: 3.979 ± 0.295
2.59GlyGlu: 2.59 ± 0.242
2.284GlyPhe: 2.284 ± 0.215
3.838GlyGly: 3.838 ± 0.527
1.319GlyHis: 1.319 ± 0.187
4.756GlyIle: 4.756 ± 0.332
5.628GlyLys: 5.628 ± 0.461
4.804GlyLeu: 4.804 ± 0.408
1.742GlyMet: 1.742 ± 0.236
3.979GlyAsn: 3.979 ± 0.379
0.73GlyPro: 0.73 ± 0.141
2.049GlyGln: 2.049 ± 0.314
2.284GlyArg: 2.284 ± 0.223
4.945GlySer: 4.945 ± 0.456
4.285GlyThr: 4.285 ± 0.463
4.144GlyVal: 4.144 ± 0.295
0.871GlyTrp: 0.871 ± 0.158
3.603GlyTyr: 3.603 ± 0.341
0.0GlyXaa: 0.0 ± 0.0
His
1.083HisAla: 1.083 ± 0.16
0.283HisCys: 0.283 ± 0.075
1.248HisAsp: 1.248 ± 0.165
0.848HisGlu: 0.848 ± 0.155
0.683HisPhe: 0.683 ± 0.138
1.013HisGly: 1.013 ± 0.185
0.353HisHis: 0.353 ± 0.087
1.154HisIle: 1.154 ± 0.181
1.413HisLys: 1.413 ± 0.209
1.295HisLeu: 1.295 ± 0.196
0.283HisMet: 0.283 ± 0.1
1.342HisAsn: 1.342 ± 0.188
0.589HisPro: 0.589 ± 0.121
0.753HisGln: 0.753 ± 0.13
0.589HisArg: 0.589 ± 0.122
1.248HisSer: 1.248 ± 0.169
1.06HisThr: 1.06 ± 0.136
1.083HisVal: 1.083 ± 0.194
0.165HisTrp: 0.165 ± 0.059
1.06HisTyr: 1.06 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
3.862IleAla: 3.862 ± 0.345
0.706IleCys: 0.706 ± 0.141
4.992IleAsp: 4.992 ± 0.391
3.249IleGlu: 3.249 ± 0.317
1.813IlePhe: 1.813 ± 0.173
3.838IleGly: 3.838 ± 0.279
0.965IleHis: 0.965 ± 0.137
3.391IleIle: 3.391 ± 0.326
6.711IleLys: 6.711 ± 0.472
4.427IleLeu: 4.427 ± 0.385
1.695IleMet: 1.695 ± 0.215
4.121IleAsn: 4.121 ± 0.277
2.519IlePro: 2.519 ± 0.27
2.284IleGln: 2.284 ± 0.265
2.614IleArg: 2.614 ± 0.253
4.898IleSer: 4.898 ± 0.335
3.65IleThr: 3.65 ± 0.327
3.862IleVal: 3.862 ± 0.34
0.589IleTrp: 0.589 ± 0.15
2.943IleTyr: 2.943 ± 0.293
0.0IleXaa: 0.0 ± 0.0
Lys
5.18LysAla: 5.18 ± 0.359
0.683LysCys: 0.683 ± 0.144
6.499LysAsp: 6.499 ± 0.465
6.617LysGlu: 6.617 ± 0.553
3.132LysPhe: 3.132 ± 0.275
4.804LysGly: 4.804 ± 0.31
1.954LysHis: 1.954 ± 0.277
4.686LysIle: 4.686 ± 0.378
6.475LysLys: 6.475 ± 0.532
7.841LysLeu: 7.841 ± 0.532
2.308LysMet: 2.308 ± 0.27
4.615LysAsn: 4.615 ± 0.403
2.472LysPro: 2.472 ± 0.247
4.191LysGln: 4.191 ± 0.348
3.603LysArg: 3.603 ± 0.398
6.051LysSer: 6.051 ± 0.491
3.956LysThr: 3.956 ± 0.302
6.028LysVal: 6.028 ± 0.39
0.871LysTrp: 0.871 ± 0.188
3.862LysTyr: 3.862 ± 0.38
0.0LysXaa: 0.0 ± 0.0
Leu
6.24LeuAla: 6.24 ± 0.436
0.612LeuCys: 0.612 ± 0.126
6.428LeuAsp: 6.428 ± 0.364
5.11LeuGlu: 5.11 ± 0.388
2.92LeuPhe: 2.92 ± 0.276
5.392LeuGly: 5.392 ± 0.395
1.13LeuHis: 1.13 ± 0.182
4.262LeuIle: 4.262 ± 0.39
6.664LeuLys: 6.664 ± 0.391
6.522LeuLeu: 6.522 ± 0.423
1.954LeuMet: 1.954 ± 0.229
4.968LeuAsn: 4.968 ± 0.316
2.943LeuPro: 2.943 ± 0.3
3.014LeuGln: 3.014 ± 0.295
2.896LeuArg: 2.896 ± 0.256
6.405LeuSer: 6.405 ± 0.433
5.015LeuThr: 5.015 ± 0.316
5.345LeuVal: 5.345 ± 0.39
0.683LeuTrp: 0.683 ± 0.133
3.579LeuTyr: 3.579 ± 0.349
0.0LeuXaa: 0.0 ± 0.0
Met
1.531MetAla: 1.531 ± 0.214
0.141MetCys: 0.141 ± 0.056
1.413MetAsp: 1.413 ± 0.206
1.601MetGlu: 1.601 ± 0.242
0.848MetPhe: 0.848 ± 0.153
1.695MetGly: 1.695 ± 0.237
0.353MetHis: 0.353 ± 0.083
1.46MetIle: 1.46 ± 0.207
2.049MetLys: 2.049 ± 0.224
2.708MetLeu: 2.708 ± 0.251
0.447MetMet: 0.447 ± 0.103
1.319MetAsn: 1.319 ± 0.185
0.824MetPro: 0.824 ± 0.134
1.013MetGln: 1.013 ± 0.162
0.824MetArg: 0.824 ± 0.142
1.837MetSer: 1.837 ± 0.227
1.413MetThr: 1.413 ± 0.175
1.319MetVal: 1.319 ± 0.151
0.283MetTrp: 0.283 ± 0.076
1.201MetTyr: 1.201 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
3.626AsnAla: 3.626 ± 0.728
0.565AsnCys: 0.565 ± 0.115
3.697AsnAsp: 3.697 ± 0.309
3.179AsnGlu: 3.179 ± 0.282
2.284AsnPhe: 2.284 ± 0.244
3.956AsnGly: 3.956 ± 0.4
1.177AsnHis: 1.177 ± 0.139
3.508AsnIle: 3.508 ± 0.29
5.604AsnLys: 5.604 ± 0.3
4.615AsnLeu: 4.615 ± 0.372
1.436AsnMet: 1.436 ± 0.192
3.956AsnAsn: 3.956 ± 0.409
2.449AsnPro: 2.449 ± 0.251
2.449AsnGln: 2.449 ± 0.303
2.001AsnArg: 2.001 ± 0.23
4.497AsnSer: 4.497 ± 0.374
3.202AsnThr: 3.202 ± 0.316
3.32AsnVal: 3.32 ± 0.264
0.565AsnTrp: 0.565 ± 0.125
3.697AsnTyr: 3.697 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
1.695ProAla: 1.695 ± 0.225
0.235ProCys: 0.235 ± 0.077
2.778ProAsp: 2.778 ± 0.225
2.237ProGlu: 2.237 ± 0.244
1.107ProPhe: 1.107 ± 0.195
1.013ProGly: 1.013 ± 0.157
0.518ProHis: 0.518 ± 0.119
2.826ProIle: 2.826 ± 0.295
2.425ProLys: 2.425 ± 0.302
2.284ProLeu: 2.284 ± 0.258
0.777ProMet: 0.777 ± 0.124
1.813ProAsn: 1.813 ± 0.196
0.494ProPro: 0.494 ± 0.127
0.942ProGln: 0.942 ± 0.21
1.083ProArg: 1.083 ± 0.162
2.708ProSer: 2.708 ± 0.331
2.213ProThr: 2.213 ± 0.236
2.519ProVal: 2.519 ± 0.281
0.4ProTrp: 0.4 ± 0.102
2.049ProTyr: 2.049 ± 0.249
0.0ProXaa: 0.0 ± 0.0
Gln
3.626GlnAla: 3.626 ± 0.475
0.094GlnCys: 0.094 ± 0.06
2.378GlnAsp: 2.378 ± 0.283
2.355GlnGlu: 2.355 ± 0.29
0.918GlnPhe: 0.918 ± 0.131
2.519GlnGly: 2.519 ± 0.221
0.447GlnHis: 0.447 ± 0.099
2.072GlnIle: 2.072 ± 0.215
2.849GlnLys: 2.849 ± 0.269
3.485GlnLeu: 3.485 ± 0.261
0.942GlnMet: 0.942 ± 0.177
1.648GlnAsn: 1.648 ± 0.173
1.177GlnPro: 1.177 ± 0.271
2.143GlnGln: 2.143 ± 0.391
1.272GlnArg: 1.272 ± 0.211
2.778GlnSer: 2.778 ± 0.353
1.625GlnThr: 1.625 ± 0.301
3.438GlnVal: 3.438 ± 0.281
0.377GlnTrp: 0.377 ± 0.1
1.719GlnTyr: 1.719 ± 0.191
0.0GlnXaa: 0.0 ± 0.0
Arg
2.26ArgAla: 2.26 ± 0.271
0.306ArgCys: 0.306 ± 0.119
1.884ArgAsp: 1.884 ± 0.231
2.355ArgGlu: 2.355 ± 0.305
1.483ArgPhe: 1.483 ± 0.206
2.119ArgGly: 2.119 ± 0.246
0.659ArgHis: 0.659 ± 0.135
2.096ArgIle: 2.096 ± 0.226
2.967ArgLys: 2.967 ± 0.328
3.061ArgLeu: 3.061 ± 0.258
0.824ArgMet: 0.824 ± 0.15
1.837ArgAsn: 1.837 ± 0.244
1.389ArgPro: 1.389 ± 0.162
1.578ArgGln: 1.578 ± 0.216
1.672ArgArg: 1.672 ± 0.21
2.001ArgSer: 2.001 ± 0.233
1.766ArgThr: 1.766 ± 0.188
2.402ArgVal: 2.402 ± 0.231
0.377ArgTrp: 0.377 ± 0.087
2.166ArgTyr: 2.166 ± 0.246
0.0ArgXaa: 0.0 ± 0.0
Ser
4.615SerAla: 4.615 ± 0.854
0.447SerCys: 0.447 ± 0.111
4.686SerAsp: 4.686 ± 0.396
4.026SerGlu: 4.026 ± 0.317
2.684SerPhe: 2.684 ± 0.257
5.722SerGly: 5.722 ± 0.519
1.436SerHis: 1.436 ± 0.208
4.756SerIle: 4.756 ± 0.426
7.323SerLys: 7.323 ± 0.594
6.216SerLeu: 6.216 ± 0.337
1.766SerMet: 1.766 ± 0.176
4.992SerAsn: 4.992 ± 0.402
2.237SerPro: 2.237 ± 0.218
2.637SerGln: 2.637 ± 0.282
2.684SerArg: 2.684 ± 0.248
6.617SerSer: 6.617 ± 0.629
4.968SerThr: 4.968 ± 0.567
4.356SerVal: 4.356 ± 0.369
1.06SerTrp: 1.06 ± 0.177
4.026SerTyr: 4.026 ± 0.257
0.0SerXaa: 0.0 ± 0.0
Thr
3.932ThrAla: 3.932 ± 0.849
0.33ThrCys: 0.33 ± 0.105
4.45ThrAsp: 4.45 ± 0.391
2.59ThrGlu: 2.59 ± 0.239
2.402ThrPhe: 2.402 ± 0.25
4.45ThrGly: 4.45 ± 0.568
0.895ThrHis: 0.895 ± 0.164
4.78ThrIle: 4.78 ± 0.312
4.427ThrLys: 4.427 ± 0.317
5.345ThrLeu: 5.345 ± 0.416
1.319ThrMet: 1.319 ± 0.157
3.579ThrAsn: 3.579 ± 0.325
2.378ThrPro: 2.378 ± 0.284
2.049ThrGln: 2.049 ± 0.347
1.601ThrArg: 1.601 ± 0.207
4.074ThrSer: 4.074 ± 0.399
3.838ThrThr: 3.838 ± 0.458
4.238ThrVal: 4.238 ± 0.314
0.824ThrTrp: 0.824 ± 0.15
3.603ThrTyr: 3.603 ± 0.299
0.0ThrXaa: 0.0 ± 0.0
Val
4.78ValAla: 4.78 ± 0.371
0.306ValCys: 0.306 ± 0.087
4.474ValAsp: 4.474 ± 0.359
3.367ValGlu: 3.367 ± 0.279
2.143ValPhe: 2.143 ± 0.251
3.556ValGly: 3.556 ± 0.305
0.918ValHis: 0.918 ± 0.185
3.932ValIle: 3.932 ± 0.306
5.063ValLys: 5.063 ± 0.33
5.204ValLeu: 5.204 ± 0.413
1.46ValMet: 1.46 ± 0.18
4.074ValAsn: 4.074 ± 0.344
2.355ValPro: 2.355 ± 0.282
1.954ValGln: 1.954 ± 0.228
2.237ValArg: 2.237 ± 0.24
5.84ValSer: 5.84 ± 0.352
4.474ValThr: 4.474 ± 0.354
4.144ValVal: 4.144 ± 0.319
0.753ValTrp: 0.753 ± 0.138
3.391ValTyr: 3.391 ± 0.304
0.0ValXaa: 0.0 ± 0.0
Trp
0.73TrpAla: 0.73 ± 0.133
0.141TrpCys: 0.141 ± 0.051
1.013TrpAsp: 1.013 ± 0.154
0.918TrpGlu: 0.918 ± 0.142
0.471TrpPhe: 0.471 ± 0.109
0.565TrpGly: 0.565 ± 0.114
0.024TrpHis: 0.024 ± 0.026
0.659TrpIle: 0.659 ± 0.12
0.753TrpLys: 0.753 ± 0.14
1.389TrpLeu: 1.389 ± 0.246
0.283TrpMet: 0.283 ± 0.087
0.518TrpAsn: 0.518 ± 0.121
0.141TrpPro: 0.141 ± 0.064
0.471TrpGln: 0.471 ± 0.112
0.4TrpArg: 0.4 ± 0.113
0.871TrpSer: 0.871 ± 0.126
0.659TrpThr: 0.659 ± 0.122
0.871TrpVal: 0.871 ± 0.16
0.235TrpTrp: 0.235 ± 0.094
0.753TrpTyr: 0.753 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.697TyrAla: 3.697 ± 0.319
0.33TyrCys: 0.33 ± 0.093
3.791TyrAsp: 3.791 ± 0.278
2.496TyrGlu: 2.496 ± 0.29
1.672TyrPhe: 1.672 ± 0.194
3.367TyrGly: 3.367 ± 0.301
0.918TyrHis: 0.918 ± 0.133
3.273TyrIle: 3.273 ± 0.284
4.285TyrLys: 4.285 ± 0.351
4.215TyrLeu: 4.215 ± 0.408
1.436TyrMet: 1.436 ± 0.189
3.155TyrAsn: 3.155 ± 0.285
1.907TyrPro: 1.907 ± 0.231
2.119TyrGln: 2.119 ± 0.234
1.907TyrArg: 1.907 ± 0.234
4.074TyrSer: 4.074 ± 0.283
3.414TyrThr: 3.414 ± 0.365
3.179TyrVal: 3.179 ± 0.271
0.659TyrTrp: 0.659 ± 0.132
2.567TyrTyr: 2.567 ± 0.286
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 199 proteins (42470 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski