Amino acid dipepetide frequency for Clostridium phage phiCD211

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.054AlaAla: 1.054 ± 0.214
0.5AlaCys: 0.5 ± 0.123
1.976AlaAsp: 1.976 ± 0.273
2.74AlaGlu: 2.74 ± 0.494
1.58AlaPhe: 1.58 ± 0.178
1.949AlaGly: 1.949 ± 0.284
0.448AlaHis: 0.448 ± 0.102
4.109AlaIle: 4.109 ± 0.392
4.583AlaLys: 4.583 ± 0.528
3.319AlaLeu: 3.319 ± 0.392
0.685AlaMet: 0.685 ± 0.178
2.502AlaAsn: 2.502 ± 0.332
0.711AlaPro: 0.711 ± 0.152
0.922AlaGln: 0.922 ± 0.164
1.291AlaArg: 1.291 ± 0.179
2.423AlaSer: 2.423 ± 0.287
2.239AlaThr: 2.239 ± 0.291
1.422AlaVal: 1.422 ± 0.227
0.395AlaTrp: 0.395 ± 0.098
1.818AlaTyr: 1.818 ± 0.228
0.0AlaXaa: 0.0 ± 0.0
Cys
0.316CysAla: 0.316 ± 0.085
0.316CysCys: 0.316 ± 0.093
1.106CysAsp: 1.106 ± 0.187
1.001CysGlu: 1.001 ± 0.194
0.58CysPhe: 0.58 ± 0.121
1.317CysGly: 1.317 ± 0.265
0.263CysHis: 0.263 ± 0.077
1.343CysIle: 1.343 ± 0.214
1.554CysLys: 1.554 ± 0.27
1.001CysLeu: 1.001 ± 0.175
0.527CysMet: 0.527 ± 0.107
1.238CysAsn: 1.238 ± 0.213
0.316CysPro: 0.316 ± 0.086
0.237CysGln: 0.237 ± 0.08
0.632CysArg: 0.632 ± 0.145
1.054CysSer: 1.054 ± 0.168
0.448CysThr: 0.448 ± 0.112
0.738CysVal: 0.738 ± 0.142
0.184CysTrp: 0.184 ± 0.076
0.922CysTyr: 0.922 ± 0.195
0.0CysXaa: 0.0 ± 0.0
Asp
2.713AspAla: 2.713 ± 0.407
0.896AspCys: 0.896 ± 0.169
3.451AspAsp: 3.451 ± 0.324
5.532AspGlu: 5.532 ± 0.507
2.397AspPhe: 2.397 ± 0.236
3.293AspGly: 3.293 ± 0.349
0.421AspHis: 0.421 ± 0.112
8.192AspIle: 8.192 ± 0.491
7.27AspLys: 7.27 ± 0.687
5.505AspLeu: 5.505 ± 0.373
1.739AspMet: 1.739 ± 0.19
4.952AspAsn: 4.952 ± 0.571
0.553AspPro: 0.553 ± 0.12
0.342AspGln: 0.342 ± 0.088
2.845AspArg: 2.845 ± 0.237
3.53AspSer: 3.53 ± 0.289
3.74AspThr: 3.74 ± 0.31
4.057AspVal: 4.057 ± 0.289
0.421AspTrp: 0.421 ± 0.115
3.688AspTyr: 3.688 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
2.608GluAla: 2.608 ± 0.384
1.08GluCys: 1.08 ± 0.177
5.189GluAsp: 5.189 ± 0.406
8.377GluGlu: 8.377 ± 0.688
3.003GluPhe: 3.003 ± 0.317
3.161GluGly: 3.161 ± 0.32
0.948GluHis: 0.948 ± 0.135
9.114GluIle: 9.114 ± 0.555
9.167GluLys: 9.167 ± 0.732
9.746GluLeu: 9.746 ± 0.501
1.949GluMet: 1.949 ± 0.244
7.06GluAsn: 7.06 ± 0.358
0.738GluPro: 0.738 ± 0.159
2.028GluGln: 2.028 ± 0.278
3.029GluArg: 3.029 ± 0.318
4.057GluSer: 4.057 ± 0.384
2.845GluThr: 2.845 ± 0.299
4.873GluVal: 4.873 ± 0.394
0.975GluTrp: 0.975 ± 0.162
4.847GluTyr: 4.847 ± 0.459
0.0GluXaa: 0.0 ± 0.0
Phe
1.185PheAla: 1.185 ± 0.22
0.632PheCys: 0.632 ± 0.12
2.555PheAsp: 2.555 ± 0.277
3.398PheGlu: 3.398 ± 0.37
1.291PhePhe: 1.291 ± 0.212
1.791PheGly: 1.791 ± 0.299
0.316PheHis: 0.316 ± 0.095
3.635PheIle: 3.635 ± 0.337
4.373PheLys: 4.373 ± 0.396
3.582PheLeu: 3.582 ± 0.394
0.869PheMet: 0.869 ± 0.153
3.582PheAsn: 3.582 ± 0.369
0.738PhePro: 0.738 ± 0.165
1.185PheGln: 1.185 ± 0.196
1.238PheArg: 1.238 ± 0.231
2.476PheSer: 2.476 ± 0.238
2.055PheThr: 2.055 ± 0.284
1.844PheVal: 1.844 ± 0.219
0.342PheTrp: 0.342 ± 0.096
1.712PheTyr: 1.712 ± 0.241
0.0PheXaa: 0.0 ± 0.0
Gly
1.897GlyAla: 1.897 ± 0.253
0.79GlyCys: 0.79 ± 0.181
3.056GlyAsp: 3.056 ± 0.314
3.477GlyGlu: 3.477 ± 0.289
2.239GlyPhe: 2.239 ± 0.259
3.029GlyGly: 3.029 ± 0.748
0.711GlyHis: 0.711 ± 0.148
3.767GlyIle: 3.767 ± 0.35
5.031GlyLys: 5.031 ± 0.4
4.136GlyLeu: 4.136 ± 0.441
1.238GlyMet: 1.238 ± 0.211
3.293GlyAsn: 3.293 ± 0.413
0.0GlyPro: 0.0 ± 0.0
1.317GlyGln: 1.317 ± 0.225
1.818GlyArg: 1.818 ± 0.217
2.792GlySer: 2.792 ± 0.325
2.476GlyThr: 2.476 ± 0.293
2.45GlyVal: 2.45 ± 0.244
0.316GlyTrp: 0.316 ± 0.1
3.372GlyTyr: 3.372 ± 0.377
0.0GlyXaa: 0.0 ± 0.0
His
0.369HisAla: 0.369 ± 0.116
0.395HisCys: 0.395 ± 0.119
0.869HisAsp: 0.869 ± 0.152
0.527HisGlu: 0.527 ± 0.106
0.659HisPhe: 0.659 ± 0.142
0.659HisGly: 0.659 ± 0.188
0.158HisHis: 0.158 ± 0.072
1.291HisIle: 1.291 ± 0.171
1.264HisLys: 1.264 ± 0.194
0.948HisLeu: 0.948 ± 0.16
0.369HisMet: 0.369 ± 0.099
1.027HisAsn: 1.027 ± 0.181
0.184HisPro: 0.184 ± 0.063
0.342HisGln: 0.342 ± 0.101
0.421HisArg: 0.421 ± 0.125
0.764HisSer: 0.764 ± 0.144
0.606HisThr: 0.606 ± 0.128
0.606HisVal: 0.606 ± 0.137
0.105HisTrp: 0.105 ± 0.059
0.685HisTyr: 0.685 ± 0.127
0.0HisXaa: 0.0 ± 0.0
Ile
4.162IleAla: 4.162 ± 0.396
1.554IleCys: 1.554 ± 0.222
7.455IleAsp: 7.455 ± 0.517
9.22IleGlu: 9.22 ± 0.574
3.503IlePhe: 3.503 ± 0.356
3.74IleGly: 3.74 ± 0.395
1.212IleHis: 1.212 ± 0.197
7.902IleIle: 7.902 ± 0.545
11.933IleLys: 11.933 ± 0.631
7.718IleLeu: 7.718 ± 0.431
1.791IleMet: 1.791 ± 0.233
7.981IleAsn: 7.981 ± 0.452
2.608IlePro: 2.608 ± 0.288
2.977IleGln: 2.977 ± 0.319
3.451IleArg: 3.451 ± 0.347
7.244IleSer: 7.244 ± 0.46
4.188IleThr: 4.188 ± 0.337
4.399IleVal: 4.399 ± 0.363
0.632IleTrp: 0.632 ± 0.135
4.136IleTyr: 4.136 ± 0.334
0.0IleXaa: 0.0 ± 0.0
Lys
4.083LysAla: 4.083 ± 0.441
1.58LysCys: 1.58 ± 0.227
7.349LysAsp: 7.349 ± 0.553
12.301LysGlu: 12.301 ± 0.778
4.109LysPhe: 4.109 ± 0.305
4.03LysGly: 4.03 ± 0.338
1.238LysHis: 1.238 ± 0.194
10.774LysIle: 10.774 ± 0.492
10.405LysLys: 10.405 ± 0.657
9.404LysLeu: 9.404 ± 0.614
2.318LysMet: 2.318 ± 0.26
8.35LysAsn: 8.35 ± 0.43
1.923LysPro: 1.923 ± 0.252
3.582LysGln: 3.582 ± 0.368
3.899LysArg: 3.899 ± 0.461
6.401LysSer: 6.401 ± 0.404
5.242LysThr: 5.242 ± 0.405
5.584LysVal: 5.584 ± 0.399
0.817LysTrp: 0.817 ± 0.143
6.98LysTyr: 6.98 ± 0.54
0.0LysXaa: 0.0 ± 0.0
Leu
3.424LeuAla: 3.424 ± 0.324
1.238LeuCys: 1.238 ± 0.204
7.007LeuAsp: 7.007 ± 0.587
8.719LeuGlu: 8.719 ± 0.422
2.66LeuPhe: 2.66 ± 0.231
4.741LeuGly: 4.741 ± 0.525
1.08LeuHis: 1.08 ± 0.221
7.876LeuIle: 7.876 ± 0.559
10.8LeuLys: 10.8 ± 0.683
6.48LeuLeu: 6.48 ± 0.446
1.949LeuMet: 1.949 ± 0.203
7.692LeuAsn: 7.692 ± 0.475
1.791LeuPro: 1.791 ± 0.205
2.555LeuGln: 2.555 ± 0.255
2.66LeuArg: 2.66 ± 0.321
5.98LeuSer: 5.98 ± 0.369
4.636LeuThr: 4.636 ± 0.319
2.95LeuVal: 2.95 ± 0.237
0.553LeuTrp: 0.553 ± 0.14
3.398LeuTyr: 3.398 ± 0.356
0.0LeuXaa: 0.0 ± 0.0
Met
0.843MetAla: 0.843 ± 0.18
0.316MetCys: 0.316 ± 0.09
1.343MetAsp: 1.343 ± 0.192
1.712MetGlu: 1.712 ± 0.256
0.79MetPhe: 0.79 ± 0.141
0.896MetGly: 0.896 ± 0.129
0.237MetHis: 0.237 ± 0.083
2.134MetIle: 2.134 ± 0.262
2.898MetLys: 2.898 ± 0.315
1.949MetLeu: 1.949 ± 0.252
0.711MetMet: 0.711 ± 0.165
1.818MetAsn: 1.818 ± 0.22
0.448MetPro: 0.448 ± 0.106
0.685MetGln: 0.685 ± 0.116
0.922MetArg: 0.922 ± 0.141
1.791MetSer: 1.791 ± 0.202
1.001MetThr: 1.001 ± 0.164
0.764MetVal: 0.764 ± 0.151
0.053MetTrp: 0.053 ± 0.036
1.133MetTyr: 1.133 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
2.687AsnAla: 2.687 ± 0.311
1.212AsnCys: 1.212 ± 0.198
4.557AsnAsp: 4.557 ± 0.419
6.138AsnGlu: 6.138 ± 0.41
2.423AsnPhe: 2.423 ± 0.258
3.82AsnGly: 3.82 ± 0.374
1.027AsnHis: 1.027 ± 0.165
8.324AsnIle: 8.324 ± 0.602
9.088AsnLys: 9.088 ± 0.561
6.98AsnLeu: 6.98 ± 0.419
1.712AsnMet: 1.712 ± 0.224
5.953AsnAsn: 5.953 ± 0.571
2.397AsnPro: 2.397 ± 0.224
2.239AsnGln: 2.239 ± 0.222
3.214AsnArg: 3.214 ± 0.278
5.242AsnSer: 5.242 ± 0.415
4.267AsnThr: 4.267 ± 0.445
3.609AsnVal: 3.609 ± 0.337
0.553AsnTrp: 0.553 ± 0.133
4.241AsnTyr: 4.241 ± 0.294
0.0AsnXaa: 0.0 ± 0.0
Pro
0.474ProAla: 0.474 ± 0.124
0.395ProCys: 0.395 ± 0.089
1.106ProAsp: 1.106 ± 0.227
1.054ProGlu: 1.054 ± 0.217
1.212ProPhe: 1.212 ± 0.174
0.527ProGly: 0.527 ± 0.141
0.5ProHis: 0.5 ± 0.121
2.028ProIle: 2.028 ± 0.219
2.186ProLys: 2.186 ± 0.254
1.185ProLeu: 1.185 ± 0.169
0.211ProMet: 0.211 ± 0.079
1.37ProAsn: 1.37 ± 0.191
0.553ProPro: 0.553 ± 0.125
0.5ProGln: 0.5 ± 0.117
0.553ProArg: 0.553 ± 0.114
1.422ProSer: 1.422 ± 0.206
1.159ProThr: 1.159 ± 0.218
1.212ProVal: 1.212 ± 0.18
0.026ProTrp: 0.026 ± 0.028
0.948ProTyr: 0.948 ± 0.171
0.0ProXaa: 0.0 ± 0.0
Gln
1.449GlnAla: 1.449 ± 0.269
0.211GlnCys: 0.211 ± 0.07
1.818GlnAsp: 1.818 ± 0.248
2.502GlnGlu: 2.502 ± 0.237
1.185GlnPhe: 1.185 ± 0.149
1.133GlnGly: 1.133 ± 0.181
0.316GlnHis: 0.316 ± 0.101
2.766GlnIle: 2.766 ± 0.217
2.792GlnLys: 2.792 ± 0.304
2.845GlnLeu: 2.845 ± 0.305
0.79GlnMet: 0.79 ± 0.169
1.791GlnAsn: 1.791 ± 0.237
0.5GlnPro: 0.5 ± 0.09
1.212GlnGln: 1.212 ± 0.184
1.054GlnArg: 1.054 ± 0.135
1.185GlnSer: 1.185 ± 0.189
1.264GlnThr: 1.264 ± 0.206
1.475GlnVal: 1.475 ± 0.25
0.132GlnTrp: 0.132 ± 0.071
0.843GlnTyr: 0.843 ± 0.158
0.0GlnXaa: 0.0 ± 0.0
Arg
1.633ArgAla: 1.633 ± 0.198
0.738ArgCys: 0.738 ± 0.156
2.107ArgAsp: 2.107 ± 0.256
2.766ArgGlu: 2.766 ± 0.309
1.686ArgPhe: 1.686 ± 0.209
1.765ArgGly: 1.765 ± 0.25
0.421ArgHis: 0.421 ± 0.11
3.714ArgIle: 3.714 ± 0.313
3.582ArgLys: 3.582 ± 0.295
3.266ArgLeu: 3.266 ± 0.338
1.027ArgMet: 1.027 ± 0.177
2.687ArgAsn: 2.687 ± 0.298
0.606ArgPro: 0.606 ± 0.124
1.106ArgGln: 1.106 ± 0.155
1.001ArgArg: 1.001 ± 0.218
1.87ArgSer: 1.87 ± 0.216
1.449ArgThr: 1.449 ± 0.226
1.844ArgVal: 1.844 ± 0.228
0.395ArgTrp: 0.395 ± 0.111
1.87ArgTyr: 1.87 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
1.739SerAla: 1.739 ± 0.289
0.659SerCys: 0.659 ± 0.133
4.373SerAsp: 4.373 ± 0.345
4.083SerGlu: 4.083 ± 0.358
2.634SerPhe: 2.634 ± 0.244
3.451SerGly: 3.451 ± 0.39
0.738SerHis: 0.738 ± 0.169
5.927SerIle: 5.927 ± 0.446
7.06SerLys: 7.06 ± 0.404
5.98SerLeu: 5.98 ± 0.438
1.58SerMet: 1.58 ± 0.231
5.426SerAsn: 5.426 ± 0.55
0.948SerPro: 0.948 ± 0.154
1.712SerGln: 1.712 ± 0.223
2.502SerArg: 2.502 ± 0.249
4.452SerSer: 4.452 ± 0.424
3.556SerThr: 3.556 ± 0.416
2.74SerVal: 2.74 ± 0.318
0.474SerTrp: 0.474 ± 0.111
3.161SerTyr: 3.161 ± 0.28
0.0SerXaa: 0.0 ± 0.0
Thr
2.213ThrAla: 2.213 ± 0.346
0.79ThrCys: 0.79 ± 0.148
3.24ThrAsp: 3.24 ± 0.274
2.95ThrGlu: 2.95 ± 0.304
1.897ThrPhe: 1.897 ± 0.337
2.924ThrGly: 2.924 ± 0.311
0.632ThrHis: 0.632 ± 0.141
5.005ThrIle: 5.005 ± 0.458
5.137ThrLys: 5.137 ± 0.436
4.741ThrLeu: 4.741 ± 0.335
0.711ThrMet: 0.711 ± 0.124
3.556ThrAsn: 3.556 ± 0.357
1.343ThrPro: 1.343 ± 0.244
1.37ThrGln: 1.37 ± 0.185
1.739ThrArg: 1.739 ± 0.192
3.503ThrSer: 3.503 ± 0.412
2.529ThrThr: 2.529 ± 0.317
2.502ThrVal: 2.502 ± 0.305
0.342ThrTrp: 0.342 ± 0.098
2.265ThrTyr: 2.265 ± 0.256
0.0ThrXaa: 0.0 ± 0.0
Val
2.002ValAla: 2.002 ± 0.226
0.738ValCys: 0.738 ± 0.132
3.214ValAsp: 3.214 ± 0.292
4.03ValGlu: 4.03 ± 0.383
2.502ValPhe: 2.502 ± 0.27
2.423ValGly: 2.423 ± 0.262
0.58ValHis: 0.58 ± 0.117
3.899ValIle: 3.899 ± 0.28
5.031ValLys: 5.031 ± 0.423
3.978ValLeu: 3.978 ± 0.349
0.843ValMet: 0.843 ± 0.145
4.241ValAsn: 4.241 ± 0.391
1.212ValPro: 1.212 ± 0.208
1.66ValGln: 1.66 ± 0.24
1.475ValArg: 1.475 ± 0.17
2.74ValSer: 2.74 ± 0.31
2.581ValThr: 2.581 ± 0.322
2.581ValVal: 2.581 ± 0.264
0.474ValTrp: 0.474 ± 0.125
2.292ValTyr: 2.292 ± 0.247
0.0ValXaa: 0.0 ± 0.0
Trp
0.29TrpAla: 0.29 ± 0.089
0.263TrpCys: 0.263 ± 0.082
0.606TrpAsp: 0.606 ± 0.133
0.421TrpGlu: 0.421 ± 0.112
0.237TrpPhe: 0.237 ± 0.081
0.5TrpGly: 0.5 ± 0.111
0.079TrpHis: 0.079 ± 0.048
0.58TrpIle: 0.58 ± 0.123
0.685TrpLys: 0.685 ± 0.157
0.764TrpLeu: 0.764 ± 0.134
0.132TrpMet: 0.132 ± 0.058
0.817TrpAsn: 0.817 ± 0.153
0.0TrpPro: 0.0 ± 0.0
0.237TrpGln: 0.237 ± 0.077
0.29TrpArg: 0.29 ± 0.089
0.5TrpSer: 0.5 ± 0.149
0.184TrpThr: 0.184 ± 0.071
0.421TrpVal: 0.421 ± 0.114
0.053TrpTrp: 0.053 ± 0.035
0.474TrpTyr: 0.474 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.528TyrAla: 1.528 ± 0.18
0.79TyrCys: 0.79 ± 0.18
3.161TyrAsp: 3.161 ± 0.285
3.846TyrGlu: 3.846 ± 0.388
2.213TyrPhe: 2.213 ± 0.255
2.055TyrGly: 2.055 ± 0.328
0.922TyrHis: 0.922 ± 0.168
5.426TyrIle: 5.426 ± 0.515
5.532TyrLys: 5.532 ± 0.501
4.689TyrLeu: 4.689 ± 0.494
1.264TyrMet: 1.264 ± 0.183
4.346TyrAsn: 4.346 ± 0.317
1.054TyrPro: 1.054 ± 0.185
1.054TyrGln: 1.054 ± 0.138
1.501TyrArg: 1.501 ± 0.221
3.767TyrSer: 3.767 ± 0.36
2.924TyrThr: 2.924 ± 0.345
2.423TyrVal: 2.423 ± 0.351
0.29TyrTrp: 0.29 ± 0.092
2.344TyrTyr: 2.344 ± 0.33
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 190 proteins (37964 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski