Amino acid dipepetide frequency for Lactobacillus phage 3-521

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.291AlaAla: 3.291 ± 0.446
0.216AlaCys: 0.216 ± 0.077
3.892AlaAsp: 3.892 ± 0.282
2.883AlaGlu: 2.883 ± 0.27
1.778AlaPhe: 1.778 ± 0.182
3.555AlaGly: 3.555 ± 0.503
0.817AlaHis: 0.817 ± 0.131
3.82AlaIle: 3.82 ± 0.266
4.444AlaLys: 4.444 ± 0.377
4.444AlaLeu: 4.444 ± 0.413
1.345AlaMet: 1.345 ± 0.178
3.868AlaAsn: 3.868 ± 0.342
1.754AlaPro: 1.754 ± 0.232
2.282AlaGln: 2.282 ± 0.308
1.345AlaArg: 1.345 ± 0.211
5.405AlaSer: 5.405 ± 0.51
4.012AlaThr: 4.012 ± 0.453
3.219AlaVal: 3.219 ± 0.287
0.673AlaTrp: 0.673 ± 0.125
3.267AlaTyr: 3.267 ± 0.257
0.0AlaXaa: 0.0 ± 0.0
Cys
0.288CysAla: 0.288 ± 0.089
0.048CysCys: 0.048 ± 0.033
0.456CysAsp: 0.456 ± 0.117
0.264CysGlu: 0.264 ± 0.082
0.168CysPhe: 0.168 ± 0.06
0.745CysGly: 0.745 ± 0.178
0.144CysHis: 0.144 ± 0.06
0.384CysIle: 0.384 ± 0.093
0.432CysLys: 0.432 ± 0.11
0.408CysLeu: 0.408 ± 0.097
0.072CysMet: 0.072 ± 0.044
0.36CysAsn: 0.36 ± 0.098
0.384CysPro: 0.384 ± 0.141
0.192CysGln: 0.192 ± 0.077
0.288CysArg: 0.288 ± 0.088
0.456CysSer: 0.456 ± 0.11
0.649CysThr: 0.649 ± 0.132
0.336CysVal: 0.336 ± 0.098
0.072CysTrp: 0.072 ± 0.043
0.312CysTyr: 0.312 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
4.396AspAla: 4.396 ± 0.363
0.456AspCys: 0.456 ± 0.099
4.108AspAsp: 4.108 ± 0.282
4.588AspGlu: 4.588 ± 0.367
2.426AspPhe: 2.426 ± 0.216
4.396AspGly: 4.396 ± 0.32
0.793AspHis: 0.793 ± 0.157
4.132AspIle: 4.132 ± 0.292
5.429AspLys: 5.429 ± 0.362
6.126AspLeu: 6.126 ± 0.423
1.898AspMet: 1.898 ± 0.226
4.613AspAsn: 4.613 ± 0.463
1.946AspPro: 1.946 ± 0.235
1.369AspGln: 1.369 ± 0.202
1.682AspArg: 1.682 ± 0.261
6.438AspSer: 6.438 ± 0.454
7.279AspThr: 7.279 ± 0.428
4.18AspVal: 4.18 ± 0.315
0.841AspTrp: 0.841 ± 0.132
3.724AspTyr: 3.724 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
3.243GluAla: 3.243 ± 0.271
0.312GluCys: 0.312 ± 0.104
4.276GluAsp: 4.276 ± 0.385
4.036GluGlu: 4.036 ± 0.424
1.682GluPhe: 1.682 ± 0.22
3.7GluGly: 3.7 ± 0.284
1.273GluHis: 1.273 ± 0.201
3.123GluIle: 3.123 ± 0.276
3.988GluLys: 3.988 ± 0.417
6.102GluLeu: 6.102 ± 0.445
1.465GluMet: 1.465 ± 0.217
2.643GluAsn: 2.643 ± 0.269
1.345GluPro: 1.345 ± 0.217
2.498GluGln: 2.498 ± 0.28
1.802GluArg: 1.802 ± 0.262
3.916GluSer: 3.916 ± 0.368
2.522GluThr: 2.522 ± 0.22
4.372GluVal: 4.372 ± 0.352
0.504GluTrp: 0.504 ± 0.121
3.435GluTyr: 3.435 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
1.417PheAla: 1.417 ± 0.198
0.216PheCys: 0.216 ± 0.083
2.643PheAsp: 2.643 ± 0.254
1.321PheGlu: 1.321 ± 0.187
0.697PhePhe: 0.697 ± 0.168
1.874PheGly: 1.874 ± 0.263
0.408PheHis: 0.408 ± 0.094
1.802PheIle: 1.802 ± 0.237
2.546PheLys: 2.546 ± 0.235
2.498PheLeu: 2.498 ± 0.273
0.793PheMet: 0.793 ± 0.147
2.739PheAsn: 2.739 ± 0.293
0.889PhePro: 0.889 ± 0.132
0.841PheGln: 0.841 ± 0.138
0.889PheArg: 0.889 ± 0.142
3.243PheSer: 3.243 ± 0.304
2.859PheThr: 2.859 ± 0.297
2.33PheVal: 2.33 ± 0.292
0.336PheTrp: 0.336 ± 0.081
1.754PheTyr: 1.754 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
3.387GlyAla: 3.387 ± 0.428
0.336GlyCys: 0.336 ± 0.1
4.012GlyAsp: 4.012 ± 0.312
2.811GlyGlu: 2.811 ± 0.248
1.73GlyPhe: 1.73 ± 0.232
2.907GlyGly: 2.907 ± 0.488
1.057GlyHis: 1.057 ± 0.204
3.868GlyIle: 3.868 ± 0.369
5.405GlyLys: 5.405 ± 0.468
4.372GlyLeu: 4.372 ± 0.35
1.513GlyMet: 1.513 ± 0.197
4.661GlyAsn: 4.661 ± 0.37
0.553GlyPro: 0.553 ± 0.125
2.234GlyGln: 2.234 ± 0.21
1.634GlyArg: 1.634 ± 0.261
4.949GlySer: 4.949 ± 0.499
7.039GlyThr: 7.039 ± 0.681
4.324GlyVal: 4.324 ± 0.351
0.745GlyTrp: 0.745 ± 0.129
4.444GlyTyr: 4.444 ± 0.359
0.0GlyXaa: 0.0 ± 0.0
His
0.504HisAla: 0.504 ± 0.144
0.168HisCys: 0.168 ± 0.076
1.177HisAsp: 1.177 ± 0.183
0.961HisGlu: 0.961 ± 0.167
0.865HisPhe: 0.865 ± 0.159
1.057HisGly: 1.057 ± 0.138
0.601HisHis: 0.601 ± 0.157
1.105HisIle: 1.105 ± 0.166
1.129HisLys: 1.129 ± 0.181
1.538HisLeu: 1.538 ± 0.191
0.312HisMet: 0.312 ± 0.079
1.057HisAsn: 1.057 ± 0.131
0.625HisPro: 0.625 ± 0.116
0.769HisGln: 0.769 ± 0.134
0.577HisArg: 0.577 ± 0.144
1.369HisSer: 1.369 ± 0.161
0.937HisThr: 0.937 ± 0.157
1.465HisVal: 1.465 ± 0.191
0.264HisTrp: 0.264 ± 0.077
1.153HisTyr: 1.153 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
3.676IleAla: 3.676 ± 0.316
0.408IleCys: 0.408 ± 0.128
4.781IleAsp: 4.781 ± 0.342
3.027IleGlu: 3.027 ± 0.281
1.922IlePhe: 1.922 ± 0.226
3.387IleGly: 3.387 ± 0.233
0.937IleHis: 0.937 ± 0.159
3.219IleIle: 3.219 ± 0.323
4.516IleLys: 4.516 ± 0.399
3.652IleLeu: 3.652 ± 0.393
1.129IleMet: 1.129 ± 0.179
4.588IleAsn: 4.588 ± 0.359
2.258IlePro: 2.258 ± 0.334
1.754IleGln: 1.754 ± 0.22
1.73IleArg: 1.73 ± 0.199
5.742IleSer: 5.742 ± 0.445
3.844IleThr: 3.844 ± 0.388
3.796IleVal: 3.796 ± 0.332
0.36IleTrp: 0.36 ± 0.093
2.234IleTyr: 2.234 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
4.324LysAla: 4.324 ± 0.442
0.48LysCys: 0.48 ± 0.111
5.694LysAsp: 5.694 ± 0.41
5.838LysGlu: 5.838 ± 0.519
1.874LysPhe: 1.874 ± 0.278
4.901LysGly: 4.901 ± 0.441
1.321LysHis: 1.321 ± 0.187
3.315LysIle: 3.315 ± 0.317
5.045LysLys: 5.045 ± 0.488
5.718LysLeu: 5.718 ± 0.307
2.426LysMet: 2.426 ± 0.24
3.7LysAsn: 3.7 ± 0.296
1.73LysPro: 1.73 ± 0.273
3.315LysGln: 3.315 ± 0.342
2.282LysArg: 2.282 ± 0.289
5.285LysSer: 5.285 ± 0.4
3.555LysThr: 3.555 ± 0.289
6.991LysVal: 6.991 ± 0.409
0.577LysTrp: 0.577 ± 0.114
4.54LysTyr: 4.54 ± 0.464
0.0LysXaa: 0.0 ± 0.0
Leu
4.588LeuAla: 4.588 ± 0.326
0.456LeuCys: 0.456 ± 0.101
7.087LeuAsp: 7.087 ± 0.481
4.829LeuGlu: 4.829 ± 0.365
2.955LeuPhe: 2.955 ± 0.244
5.165LeuGly: 5.165 ± 0.378
1.105LeuHis: 1.105 ± 0.177
4.036LeuIle: 4.036 ± 0.362
5.766LeuLys: 5.766 ± 0.483
7.159LeuLeu: 7.159 ± 0.522
1.97LeuMet: 1.97 ± 0.227
5.309LeuAsn: 5.309 ± 0.299
3.892LeuPro: 3.892 ± 0.374
3.171LeuGln: 3.171 ± 0.332
2.45LeuArg: 2.45 ± 0.302
8.168LeuSer: 8.168 ± 0.451
5.646LeuThr: 5.646 ± 0.366
5.934LeuVal: 5.934 ± 0.484
0.793LeuTrp: 0.793 ± 0.214
3.796LeuTyr: 3.796 ± 0.302
0.0LeuXaa: 0.0 ± 0.0
Met
1.634MetAla: 1.634 ± 0.214
0.144MetCys: 0.144 ± 0.068
1.634MetAsp: 1.634 ± 0.16
1.538MetGlu: 1.538 ± 0.184
0.553MetPhe: 0.553 ± 0.123
1.225MetGly: 1.225 ± 0.17
0.288MetHis: 0.288 ± 0.073
1.345MetIle: 1.345 ± 0.221
1.61MetLys: 1.61 ± 0.224
1.634MetLeu: 1.634 ± 0.241
0.601MetMet: 0.601 ± 0.12
1.465MetAsn: 1.465 ± 0.181
0.697MetPro: 0.697 ± 0.141
1.225MetGln: 1.225 ± 0.186
0.697MetArg: 0.697 ± 0.127
2.018MetSer: 2.018 ± 0.212
1.297MetThr: 1.297 ± 0.191
1.297MetVal: 1.297 ± 0.189
0.216MetTrp: 0.216 ± 0.062
1.465MetTyr: 1.465 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
3.507AsnAla: 3.507 ± 0.384
0.288AsnCys: 0.288 ± 0.087
3.724AsnAsp: 3.724 ± 0.363
3.147AsnGlu: 3.147 ± 0.289
2.234AsnPhe: 2.234 ± 0.238
4.084AsnGly: 4.084 ± 0.323
1.393AsnHis: 1.393 ± 0.184
3.652AsnIle: 3.652 ± 0.335
5.021AsnLys: 5.021 ± 0.392
4.901AsnLeu: 4.901 ± 0.331
1.465AsnMet: 1.465 ± 0.183
4.228AsnAsn: 4.228 ± 0.288
3.075AsnPro: 3.075 ± 0.341
2.787AsnGln: 2.787 ± 0.198
2.571AsnArg: 2.571 ± 0.23
5.597AsnSer: 5.597 ± 0.404
3.82AsnThr: 3.82 ± 0.342
3.868AsnVal: 3.868 ± 0.402
0.601AsnTrp: 0.601 ± 0.115
3.844AsnTyr: 3.844 ± 0.377
0.0AsnXaa: 0.0 ± 0.0
Pro
1.345ProAla: 1.345 ± 0.193
0.144ProCys: 0.144 ± 0.061
2.691ProAsp: 2.691 ± 0.323
2.522ProGlu: 2.522 ± 0.28
1.105ProPhe: 1.105 ± 0.234
1.586ProGly: 1.586 ± 0.17
0.36ProHis: 0.36 ± 0.094
1.97ProIle: 1.97 ± 0.231
2.739ProLys: 2.739 ± 0.299
2.739ProLeu: 2.739 ± 0.211
0.48ProMet: 0.48 ± 0.107
2.114ProAsn: 2.114 ± 0.197
0.504ProPro: 0.504 ± 0.091
0.841ProGln: 0.841 ± 0.149
0.961ProArg: 0.961 ± 0.158
3.051ProSer: 3.051 ± 0.319
2.066ProThr: 2.066 ± 0.308
2.835ProVal: 2.835 ± 0.256
0.336ProTrp: 0.336 ± 0.083
2.186ProTyr: 2.186 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
2.763GlnAla: 2.763 ± 0.326
0.12GlnCys: 0.12 ± 0.055
2.474GlnAsp: 2.474 ± 0.235
2.931GlnGlu: 2.931 ± 0.278
1.009GlnPhe: 1.009 ± 0.149
3.435GlnGly: 3.435 ± 0.273
0.721GlnHis: 0.721 ± 0.119
1.634GlnIle: 1.634 ± 0.229
2.33GlnLys: 2.33 ± 0.228
3.892GlnLeu: 3.892 ± 0.302
0.865GlnMet: 0.865 ± 0.17
1.658GlnAsn: 1.658 ± 0.2
0.889GlnPro: 0.889 ± 0.146
1.73GlnGln: 1.73 ± 0.2
1.321GlnArg: 1.321 ± 0.198
2.474GlnSer: 2.474 ± 0.218
1.73GlnThr: 1.73 ± 0.207
3.844GlnVal: 3.844 ± 0.316
0.36GlnTrp: 0.36 ± 0.101
2.114GlnTyr: 2.114 ± 0.246
0.0GlnXaa: 0.0 ± 0.0
Arg
1.634ArgAla: 1.634 ± 0.211
0.312ArgCys: 0.312 ± 0.096
1.898ArgAsp: 1.898 ± 0.272
1.898ArgGlu: 1.898 ± 0.25
1.081ArgPhe: 1.081 ± 0.154
1.85ArgGly: 1.85 ± 0.268
0.889ArgHis: 0.889 ± 0.168
2.138ArgIle: 2.138 ± 0.237
2.595ArgLys: 2.595 ± 0.298
2.643ArgLeu: 2.643 ± 0.28
0.625ArgMet: 0.625 ± 0.159
1.538ArgAsn: 1.538 ± 0.205
0.865ArgPro: 0.865 ± 0.156
1.129ArgGln: 1.129 ± 0.224
1.321ArgArg: 1.321 ± 0.187
1.946ArgSer: 1.946 ± 0.216
1.802ArgThr: 1.802 ± 0.241
2.426ArgVal: 2.426 ± 0.183
0.192ArgTrp: 0.192 ± 0.076
2.162ArgTyr: 2.162 ± 0.262
0.0ArgXaa: 0.0 ± 0.0
Ser
4.516SerAla: 4.516 ± 0.447
0.529SerCys: 0.529 ± 0.106
6.558SerAsp: 6.558 ± 0.396
4.42SerGlu: 4.42 ± 0.353
2.787SerPhe: 2.787 ± 0.335
6.414SerGly: 6.414 ± 0.554
1.249SerHis: 1.249 ± 0.18
5.525SerIle: 5.525 ± 0.461
6.823SerLys: 6.823 ± 0.438
7.471SerLeu: 7.471 ± 0.572
1.586SerMet: 1.586 ± 0.156
5.838SerAsn: 5.838 ± 0.433
2.619SerPro: 2.619 ± 0.228
3.195SerGln: 3.195 ± 0.339
2.33SerArg: 2.33 ± 0.289
9.369SerSer: 9.369 ± 0.855
6.847SerThr: 6.847 ± 0.591
5.165SerVal: 5.165 ± 0.424
1.081SerTrp: 1.081 ± 0.234
4.132SerTyr: 4.132 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
4.372ThrAla: 4.372 ± 0.466
0.456ThrCys: 0.456 ± 0.099
5.429ThrAsp: 5.429 ± 0.453
3.147ThrGlu: 3.147 ± 0.313
2.859ThrPhe: 2.859 ± 0.298
4.54ThrGly: 4.54 ± 0.438
1.297ThrHis: 1.297 ± 0.205
5.237ThrIle: 5.237 ± 0.492
4.324ThrLys: 4.324 ± 0.358
6.006ThrLeu: 6.006 ± 0.414
1.369ThrMet: 1.369 ± 0.203
4.156ThrAsn: 4.156 ± 0.309
3.027ThrPro: 3.027 ± 0.374
1.97ThrGln: 1.97 ± 0.206
2.258ThrArg: 2.258 ± 0.274
6.751ThrSer: 6.751 ± 0.629
5.453ThrThr: 5.453 ± 0.546
5.189ThrVal: 5.189 ± 0.404
0.913ThrTrp: 0.913 ± 0.163
3.483ThrTyr: 3.483 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
4.06ValAla: 4.06 ± 0.396
0.456ValCys: 0.456 ± 0.132
4.18ValAsp: 4.18 ± 0.338
3.219ValGlu: 3.219 ± 0.315
2.066ValPhe: 2.066 ± 0.277
3.507ValGly: 3.507 ± 0.318
1.465ValHis: 1.465 ± 0.181
3.387ValIle: 3.387 ± 0.263
4.372ValLys: 4.372 ± 0.336
6.486ValLeu: 6.486 ± 0.468
1.562ValMet: 1.562 ± 0.172
4.396ValAsn: 4.396 ± 0.371
3.483ValPro: 3.483 ± 0.306
3.459ValGln: 3.459 ± 0.293
2.09ValArg: 2.09 ± 0.196
7.087ValSer: 7.087 ± 0.412
6.246ValThr: 6.246 ± 0.449
4.54ValVal: 4.54 ± 0.372
0.504ValTrp: 0.504 ± 0.109
3.363ValTyr: 3.363 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
0.408TrpAla: 0.408 ± 0.13
0.168TrpCys: 0.168 ± 0.071
0.625TrpAsp: 0.625 ± 0.109
0.745TrpGlu: 0.745 ± 0.111
0.48TrpPhe: 0.48 ± 0.132
0.721TrpGly: 0.721 ± 0.135
0.12TrpHis: 0.12 ± 0.056
0.432TrpIle: 0.432 ± 0.097
0.456TrpLys: 0.456 ± 0.119
1.081TrpLeu: 1.081 ± 0.162
0.048TrpMet: 0.048 ± 0.034
0.937TrpAsn: 0.937 ± 0.223
0.048TrpPro: 0.048 ± 0.03
0.336TrpGln: 0.336 ± 0.094
0.529TrpArg: 0.529 ± 0.098
0.745TrpSer: 0.745 ± 0.147
0.577TrpThr: 0.577 ± 0.164
0.913TrpVal: 0.913 ± 0.196
0.12TrpTrp: 0.12 ± 0.051
0.745TrpTyr: 0.745 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.955TyrAla: 2.955 ± 0.3
0.745TyrCys: 0.745 ± 0.133
3.483TyrAsp: 3.483 ± 0.309
2.114TyrGlu: 2.114 ± 0.224
1.826TyrPhe: 1.826 ± 0.202
2.931TyrGly: 2.931 ± 0.332
1.393TyrHis: 1.393 ± 0.166
2.835TyrIle: 2.835 ± 0.272
3.94TyrLys: 3.94 ± 0.387
5.429TyrLeu: 5.429 ± 0.461
1.033TyrMet: 1.033 ± 0.158
3.94TyrAsn: 3.94 ± 0.382
1.994TyrPro: 1.994 ± 0.205
3.195TyrGln: 3.195 ± 0.269
2.186TyrArg: 2.186 ± 0.261
4.444TyrSer: 4.444 ± 0.436
4.06TyrThr: 4.06 ± 0.355
2.907TyrVal: 2.907 ± 0.254
0.745TyrTrp: 0.745 ± 0.122
3.099TyrTyr: 3.099 ± 0.344
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 155 proteins (41627 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski