Amino acid dipepetide frequency for Lactococcus phage phiL47

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.082AlaAla: 0.082 ± 0.049
0.22AlaCys: 0.22 ± 0.086
2.883AlaAsp: 2.883 ± 0.3
3.267AlaGlu: 3.267 ± 0.287
2.032AlaPhe: 2.032 ± 0.231
3.597AlaGly: 3.597 ± 0.541
0.604AlaHis: 0.604 ± 0.124
3.734AlaIle: 3.734 ± 0.388
4.942AlaLys: 4.942 ± 0.556
4.722AlaLeu: 4.722 ± 0.526
1.483AlaMet: 1.483 ± 0.234
2.965AlaAsn: 2.965 ± 0.311
1.29AlaPro: 1.29 ± 0.28
2.416AlaGln: 2.416 ± 0.489
1.51AlaArg: 1.51 ± 0.176
3.405AlaSer: 3.405 ± 0.373
3.267AlaThr: 3.267 ± 0.357
3.432AlaVal: 3.432 ± 0.359
0.412AlaTrp: 0.412 ± 0.094
2.581AlaTyr: 2.581 ± 0.284
0.0AlaXaa: 0.0 ± 0.0
Cys
0.137CysAla: 0.137 ± 0.066
0.0CysCys: 0.0 ± 0.0
0.412CysAsp: 0.412 ± 0.121
0.494CysGlu: 0.494 ± 0.151
0.247CysPhe: 0.247 ± 0.094
0.494CysGly: 0.494 ± 0.149
0.0CysHis: 0.0 ± 0.0
0.412CysIle: 0.412 ± 0.105
0.631CysLys: 0.631 ± 0.144
0.384CysLeu: 0.384 ± 0.105
0.11CysMet: 0.11 ± 0.057
0.384CysAsn: 0.384 ± 0.105
0.412CysPro: 0.412 ± 0.178
0.165CysGln: 0.165 ± 0.059
0.329CysArg: 0.329 ± 0.09
0.659CysSer: 0.659 ± 0.156
0.22CysThr: 0.22 ± 0.088
0.302CysVal: 0.302 ± 0.1
0.055CysTrp: 0.055 ± 0.038
0.302CysTyr: 0.302 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
2.636AspAla: 2.636 ± 0.338
0.247AspCys: 0.247 ± 0.093
5.189AspAsp: 5.189 ± 0.483
6.123AspGlu: 6.123 ± 0.438
4.146AspPhe: 4.146 ± 0.3
4.668AspGly: 4.668 ± 0.359
0.467AspHis: 0.467 ± 0.124
5.903AspIle: 5.903 ± 0.39
6.754AspLys: 6.754 ± 0.511
5.958AspLeu: 5.958 ± 0.522
1.84AspMet: 1.84 ± 0.218
4.585AspAsn: 4.585 ± 0.31
0.934AspPro: 0.934 ± 0.154
1.043AspGln: 1.043 ± 0.202
1.84AspArg: 1.84 ± 0.226
4.915AspSer: 4.915 ± 0.345
3.816AspThr: 3.816 ± 0.322
3.926AspVal: 3.926 ± 0.322
0.796AspTrp: 0.796 ± 0.148
3.24AspTyr: 3.24 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
4.036GluAla: 4.036 ± 0.412
0.439GluCys: 0.439 ± 0.105
5.546GluAsp: 5.546 ± 0.564
7.221GluGlu: 7.221 ± 0.597
3.35GluPhe: 3.35 ± 0.376
3.157GluGly: 3.157 ± 0.301
1.236GluHis: 1.236 ± 0.233
7.331GluIle: 7.331 ± 0.429
7.056GluLys: 7.056 ± 0.541
7.88GluLeu: 7.88 ± 0.413
2.526GluMet: 2.526 ± 0.232
5.134GluAsn: 5.134 ± 0.39
1.208GluPro: 1.208 ± 0.217
2.498GluGln: 2.498 ± 0.26
2.718GluArg: 2.718 ± 0.253
5.574GluSer: 5.574 ± 0.361
4.146GluThr: 4.146 ± 0.347
5.024GluVal: 5.024 ± 0.363
0.961GluTrp: 0.961 ± 0.187
4.311GluTyr: 4.311 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
1.977PheAla: 1.977 ± 0.246
0.302PheCys: 0.302 ± 0.115
3.926PheAsp: 3.926 ± 0.375
4.036PheGlu: 4.036 ± 0.384
1.318PhePhe: 1.318 ± 0.225
2.636PheGly: 2.636 ± 0.242
0.522PheHis: 0.522 ± 0.135
3.295PheIle: 3.295 ± 0.368
4.091PheLys: 4.091 ± 0.307
3.212PheLeu: 3.212 ± 0.367
1.373PheMet: 1.373 ± 0.215
3.02PheAsn: 3.02 ± 0.264
0.796PhePro: 0.796 ± 0.157
1.208PheGln: 1.208 ± 0.189
1.647PheArg: 1.647 ± 0.261
2.91PheSer: 2.91 ± 0.319
2.993PheThr: 2.993 ± 0.302
2.746PheVal: 2.746 ± 0.311
0.439PheTrp: 0.439 ± 0.119
2.334PheTyr: 2.334 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
2.801GlyAla: 2.801 ± 0.385
0.412GlyCys: 0.412 ± 0.107
3.322GlyAsp: 3.322 ± 0.312
4.558GlyGlu: 4.558 ± 0.375
3.075GlyPhe: 3.075 ± 0.298
3.542GlyGly: 3.542 ± 0.513
0.824GlyHis: 0.824 ± 0.141
4.777GlyIle: 4.777 ± 0.424
4.53GlyLys: 4.53 ± 0.479
4.613GlyLeu: 4.613 ± 0.368
1.812GlyMet: 1.812 ± 0.213
4.256GlyAsn: 4.256 ± 0.359
0.0GlyPro: 0.0 ± 0.0
1.345GlyGln: 1.345 ± 0.203
2.004GlyArg: 2.004 ± 0.229
3.652GlySer: 3.652 ± 0.356
4.613GlyThr: 4.613 ± 0.46
3.844GlyVal: 3.844 ± 0.295
0.824GlyTrp: 0.824 ± 0.204
2.883GlyTyr: 2.883 ± 0.269
0.0GlyXaa: 0.0 ± 0.0
His
0.631HisAla: 0.631 ± 0.126
0.055HisCys: 0.055 ± 0.041
1.016HisAsp: 1.016 ± 0.187
0.988HisGlu: 0.988 ± 0.173
0.659HisPhe: 0.659 ± 0.152
1.126HisGly: 1.126 ± 0.219
0.192HisHis: 0.192 ± 0.079
1.153HisIle: 1.153 ± 0.165
0.988HisLys: 0.988 ± 0.177
0.906HisLeu: 0.906 ± 0.143
0.302HisMet: 0.302 ± 0.101
0.988HisAsn: 0.988 ± 0.206
0.357HisPro: 0.357 ± 0.091
0.412HisGln: 0.412 ± 0.099
0.467HisArg: 0.467 ± 0.115
0.796HisSer: 0.796 ± 0.122
0.851HisThr: 0.851 ± 0.151
0.631HisVal: 0.631 ± 0.135
0.137HisTrp: 0.137 ± 0.064
0.741HisTyr: 0.741 ± 0.164
0.0HisXaa: 0.0 ± 0.0
Ile
4.063IleAla: 4.063 ± 0.301
0.467IleCys: 0.467 ± 0.113
5.711IleAsp: 5.711 ± 0.407
6.425IleGlu: 6.425 ± 0.46
3.295IlePhe: 3.295 ± 0.365
3.569IleGly: 3.569 ± 0.402
0.988IleHis: 0.988 ± 0.169
5.464IleIle: 5.464 ± 0.528
7.688IleLys: 7.688 ± 0.455
5.793IleLeu: 5.793 ± 0.466
1.702IleMet: 1.702 ± 0.197
5.436IleAsn: 5.436 ± 0.418
2.581IlePro: 2.581 ± 0.312
2.471IleGln: 2.471 ± 0.385
2.306IleArg: 2.306 ± 0.244
5.189IleSer: 5.189 ± 0.357
4.777IleThr: 4.777 ± 0.438
5.272IleVal: 5.272 ± 0.469
0.879IleTrp: 0.879 ± 0.16
3.13IleTyr: 3.13 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
4.832LysAla: 4.832 ± 0.453
0.522LysCys: 0.522 ± 0.169
6.644LysAsp: 6.644 ± 0.34
9.17LysGlu: 9.17 ± 0.515
3.597LysPhe: 3.597 ± 0.366
4.668LysGly: 4.668 ± 0.322
1.538LysHis: 1.538 ± 0.237
6.672LysIle: 6.672 ± 0.351
7.523LysLys: 7.523 ± 0.511
8.072LysLeu: 8.072 ± 0.543
3.048LysMet: 3.048 ± 0.317
5.628LysAsn: 5.628 ± 0.362
2.114LysPro: 2.114 ± 0.268
3.02LysGln: 3.02 ± 0.318
3.981LysArg: 3.981 ± 0.34
4.53LysSer: 4.53 ± 0.43
5.326LysThr: 5.326 ± 0.463
4.777LysVal: 4.777 ± 0.315
0.879LysTrp: 0.879 ± 0.157
3.734LysTyr: 3.734 ± 0.341
0.0LysXaa: 0.0 ± 0.0
Leu
4.42LeuAla: 4.42 ± 0.353
0.631LeuCys: 0.631 ± 0.158
5.793LeuAsp: 5.793 ± 0.41
6.617LeuGlu: 6.617 ± 0.504
3.789LeuPhe: 3.789 ± 0.478
5.024LeuGly: 5.024 ± 0.48
0.988LeuHis: 0.988 ± 0.184
5.766LeuIle: 5.766 ± 0.491
7.523LeuLys: 7.523 ± 0.52
6.205LeuLeu: 6.205 ± 0.461
1.949LeuMet: 1.949 ± 0.227
5.546LeuAsn: 5.546 ± 0.455
2.169LeuPro: 2.169 ± 0.225
2.691LeuGln: 2.691 ± 0.348
2.746LeuArg: 2.746 ± 0.28
5.903LeuSer: 5.903 ± 0.415
5.574LeuThr: 5.574 ± 0.369
4.997LeuVal: 4.997 ± 0.409
0.686LeuTrp: 0.686 ± 0.124
3.35LeuTyr: 3.35 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
1.73MetAla: 1.73 ± 0.211
0.192MetCys: 0.192 ± 0.07
1.702MetAsp: 1.702 ± 0.255
2.114MetGlu: 2.114 ± 0.255
1.263MetPhe: 1.263 ± 0.212
1.181MetGly: 1.181 ± 0.196
0.11MetHis: 0.11 ± 0.062
1.84MetIle: 1.84 ± 0.235
2.636MetLys: 2.636 ± 0.276
1.949MetLeu: 1.949 ± 0.211
0.686MetMet: 0.686 ± 0.122
2.004MetAsn: 2.004 ± 0.204
0.659MetPro: 0.659 ± 0.14
1.126MetGln: 1.126 ± 0.358
0.769MetArg: 0.769 ± 0.185
2.114MetSer: 2.114 ± 0.243
1.867MetThr: 1.867 ± 0.254
1.428MetVal: 1.428 ± 0.216
0.165MetTrp: 0.165 ± 0.068
1.29MetTyr: 1.29 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
3.35AsnAla: 3.35 ± 0.419
0.357AsnCys: 0.357 ± 0.107
3.487AsnAsp: 3.487 ± 0.307
5.189AsnGlu: 5.189 ± 0.427
2.91AsnPhe: 2.91 ± 0.345
4.448AsnGly: 4.448 ± 0.322
0.906AsnHis: 0.906 ± 0.182
5.656AsnIle: 5.656 ± 0.419
6.04AsnLys: 6.04 ± 0.399
5.601AsnLeu: 5.601 ± 0.477
1.428AsnMet: 1.428 ± 0.169
3.707AsnAsn: 3.707 ± 0.394
1.757AsnPro: 1.757 ± 0.23
2.718AsnGln: 2.718 ± 0.413
2.087AsnArg: 2.087 ± 0.207
4.063AsnSer: 4.063 ± 0.324
3.35AsnThr: 3.35 ± 0.305
3.844AsnVal: 3.844 ± 0.33
0.741AsnTrp: 0.741 ± 0.134
2.91AsnTyr: 2.91 ± 0.305
0.0AsnXaa: 0.0 ± 0.0
Pro
1.126ProAla: 1.126 ± 0.203
0.082ProCys: 0.082 ± 0.046
1.565ProAsp: 1.565 ± 0.219
1.647ProGlu: 1.647 ± 0.215
1.153ProPhe: 1.153 ± 0.179
0.137ProGly: 0.137 ± 0.061
0.467ProHis: 0.467 ± 0.09
1.675ProIle: 1.675 ± 0.201
2.169ProLys: 2.169 ± 0.306
2.059ProLeu: 2.059 ± 0.228
0.659ProMet: 0.659 ± 0.116
1.785ProAsn: 1.785 ± 0.221
0.384ProPro: 0.384 ± 0.085
0.659ProGln: 0.659 ± 0.124
0.549ProArg: 0.549 ± 0.144
1.73ProSer: 1.73 ± 0.214
1.483ProThr: 1.483 ± 0.27
1.84ProVal: 1.84 ± 0.277
0.137ProTrp: 0.137 ± 0.069
0.961ProTyr: 0.961 ± 0.154
0.0ProXaa: 0.0 ± 0.0
Gln
1.949GlnAla: 1.949 ± 0.452
0.137GlnCys: 0.137 ± 0.062
1.702GlnAsp: 1.702 ± 0.182
2.361GlnGlu: 2.361 ± 0.233
1.181GlnPhe: 1.181 ± 0.197
1.812GlnGly: 1.812 ± 0.192
0.714GlnHis: 0.714 ± 0.167
2.581GlnIle: 2.581 ± 0.264
2.993GlnLys: 2.993 ± 0.428
3.322GlnLeu: 3.322 ± 0.421
1.043GlnMet: 1.043 ± 0.268
2.059GlnAsn: 2.059 ± 0.261
0.631GlnPro: 0.631 ± 0.124
1.29GlnGln: 1.29 ± 0.48
1.592GlnArg: 1.592 ± 0.197
1.949GlnSer: 1.949 ± 0.308
2.224GlnThr: 2.224 ± 0.407
1.345GlnVal: 1.345 ± 0.204
0.439GlnTrp: 0.439 ± 0.131
1.647GlnTyr: 1.647 ± 0.23
0.0GlnXaa: 0.0 ± 0.0
Arg
1.373ArgAla: 1.373 ± 0.145
0.247ArgCys: 0.247 ± 0.154
2.828ArgAsp: 2.828 ± 0.244
2.855ArgGlu: 2.855 ± 0.331
1.483ArgPhe: 1.483 ± 0.241
2.444ArgGly: 2.444 ± 0.293
0.494ArgHis: 0.494 ± 0.128
2.773ArgIle: 2.773 ± 0.25
3.24ArgLys: 3.24 ± 0.34
2.471ArgLeu: 2.471 ± 0.245
0.796ArgMet: 0.796 ± 0.136
2.361ArgAsn: 2.361 ± 0.224
0.631ArgPro: 0.631 ± 0.123
1.098ArgGln: 1.098 ± 0.209
1.071ArgArg: 1.071 ± 0.169
1.757ArgSer: 1.757 ± 0.242
1.812ArgThr: 1.812 ± 0.216
2.608ArgVal: 2.608 ± 0.282
0.494ArgTrp: 0.494 ± 0.124
1.922ArgTyr: 1.922 ± 0.188
0.0ArgXaa: 0.0 ± 0.0
Ser
3.542SerAla: 3.542 ± 0.317
0.357SerCys: 0.357 ± 0.112
4.558SerAsp: 4.558 ± 0.457
5.024SerGlu: 5.024 ± 0.387
3.322SerPhe: 3.322 ± 0.329
4.613SerGly: 4.613 ± 0.379
0.577SerHis: 0.577 ± 0.134
5.024SerIle: 5.024 ± 0.397
5.793SerLys: 5.793 ± 0.394
4.887SerLeu: 4.887 ± 0.483
1.785SerMet: 1.785 ± 0.208
3.844SerAsn: 3.844 ± 0.365
1.483SerPro: 1.483 ± 0.164
2.471SerGln: 2.471 ± 0.244
2.636SerArg: 2.636 ± 0.222
5.189SerSer: 5.189 ± 0.784
3.734SerThr: 3.734 ± 0.284
3.844SerVal: 3.844 ± 0.378
0.741SerTrp: 0.741 ± 0.167
2.828SerTyr: 2.828 ± 0.283
0.0SerXaa: 0.0 ± 0.0
Thr
3.624ThrAla: 3.624 ± 0.512
0.439ThrCys: 0.439 ± 0.148
3.899ThrAsp: 3.899 ± 0.309
3.981ThrGlu: 3.981 ± 0.387
2.691ThrPhe: 2.691 ± 0.258
3.157ThrGly: 3.157 ± 0.464
1.098ThrHis: 1.098 ± 0.192
4.777ThrIle: 4.777 ± 0.477
5.766ThrLys: 5.766 ± 0.365
5.162ThrLeu: 5.162 ± 0.348
1.428ThrMet: 1.428 ± 0.223
3.597ThrAsn: 3.597 ± 0.38
2.032ThrPro: 2.032 ± 0.22
2.142ThrGln: 2.142 ± 0.251
2.004ThrArg: 2.004 ± 0.239
4.091ThrSer: 4.091 ± 0.42
3.459ThrThr: 3.459 ± 0.437
4.997ThrVal: 4.997 ± 0.404
0.906ThrTrp: 0.906 ± 0.187
3.02ThrTyr: 3.02 ± 0.29
0.0ThrXaa: 0.0 ± 0.0
Val
3.569ValAla: 3.569 ± 0.35
0.522ValCys: 0.522 ± 0.142
4.805ValAsp: 4.805 ± 0.361
4.86ValGlu: 4.86 ± 0.425
3.048ValPhe: 3.048 ± 0.28
3.734ValGly: 3.734 ± 0.364
0.796ValHis: 0.796 ± 0.142
4.338ValIle: 4.338 ± 0.392
5.381ValLys: 5.381 ± 0.366
4.42ValLeu: 4.42 ± 0.413
1.318ValMet: 1.318 ± 0.21
3.432ValAsn: 3.432 ± 0.308
1.4ValPro: 1.4 ± 0.212
2.196ValGln: 2.196 ± 0.241
2.087ValArg: 2.087 ± 0.227
4.009ValSer: 4.009 ± 0.335
4.997ValThr: 4.997 ± 0.369
4.118ValVal: 4.118 ± 0.412
0.741ValTrp: 0.741 ± 0.176
2.444ValTyr: 2.444 ± 0.328
0.0ValXaa: 0.0 ± 0.0
Trp
0.522TrpAla: 0.522 ± 0.127
0.082TrpCys: 0.082 ± 0.048
0.741TrpAsp: 0.741 ± 0.134
0.796TrpGlu: 0.796 ± 0.187
0.659TrpPhe: 0.659 ± 0.171
0.604TrpGly: 0.604 ± 0.132
0.165TrpHis: 0.165 ± 0.065
0.714TrpIle: 0.714 ± 0.185
0.824TrpLys: 0.824 ± 0.143
0.906TrpLeu: 0.906 ± 0.177
0.522TrpMet: 0.522 ± 0.132
1.016TrpAsn: 1.016 ± 0.163
0.0TrpPro: 0.0 ± 0.0
0.384TrpGln: 0.384 ± 0.103
0.439TrpArg: 0.439 ± 0.139
0.769TrpSer: 0.769 ± 0.168
0.686TrpThr: 0.686 ± 0.144
0.522TrpVal: 0.522 ± 0.145
0.137TrpTrp: 0.137 ± 0.062
0.439TrpTyr: 0.439 ± 0.129
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.444TyrAla: 2.444 ± 0.23
0.439TyrCys: 0.439 ± 0.122
3.295TyrAsp: 3.295 ± 0.309
3.624TyrGlu: 3.624 ± 0.31
1.565TyrPhe: 1.565 ± 0.233
3.075TyrGly: 3.075 ± 0.26
0.714TyrHis: 0.714 ± 0.137
3.322TyrIle: 3.322 ± 0.332
3.707TyrLys: 3.707 ± 0.317
3.844TyrLeu: 3.844 ± 0.398
0.988TyrMet: 0.988 ± 0.159
2.801TyrAsn: 2.801 ± 0.298
1.428TyrPro: 1.428 ± 0.242
1.647TyrGln: 1.647 ± 0.2
1.977TyrArg: 1.977 ± 0.269
3.048TyrSer: 3.048 ± 0.363
3.075TyrThr: 3.075 ± 0.359
2.718TyrVal: 2.718 ± 0.25
0.357TyrTrp: 0.357 ± 0.097
1.812TyrTyr: 1.812 ± 0.181
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 189 proteins (36423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski