Amino acid dipepetide frequency for Rhizobium phage vB_RleM_P10VF

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.878AlaAla: 4.878 ± 0.39
0.553AlaCys: 0.553 ± 0.125
4.201AlaAsp: 4.201 ± 0.275
4.201AlaGlu: 4.201 ± 0.275
2.644AlaPhe: 2.644 ± 0.242
4.037AlaGly: 4.037 ± 0.381
1.045AlaHis: 1.045 ± 0.131
4.878AlaIle: 4.878 ± 0.36
4.57AlaLys: 4.57 ± 0.386
5.411AlaLeu: 5.411 ± 0.345
1.804AlaMet: 1.804 ± 0.235
2.787AlaAsn: 2.787 ± 0.237
2.152AlaPro: 2.152 ± 0.231
1.885AlaGln: 1.885 ± 0.198
3.238AlaArg: 3.238 ± 0.258
4.386AlaSer: 4.386 ± 0.318
3.771AlaThr: 3.771 ± 0.293
4.14AlaVal: 4.14 ± 0.295
0.902AlaTrp: 0.902 ± 0.141
2.09AlaTyr: 2.09 ± 0.174
0.0AlaXaa: 0.0 ± 0.0
Cys
0.574CysAla: 0.574 ± 0.106
0.143CysCys: 0.143 ± 0.044
0.574CysAsp: 0.574 ± 0.115
0.635CysGlu: 0.635 ± 0.116
0.492CysPhe: 0.492 ± 0.123
0.553CysGly: 0.553 ± 0.097
0.287CysHis: 0.287 ± 0.066
0.389CysIle: 0.389 ± 0.097
0.758CysLys: 0.758 ± 0.13
0.697CysLeu: 0.697 ± 0.127
0.143CysMet: 0.143 ± 0.055
0.266CysAsn: 0.266 ± 0.072
0.328CysPro: 0.328 ± 0.078
0.225CysGln: 0.225 ± 0.059
0.635CysArg: 0.635 ± 0.105
0.389CysSer: 0.389 ± 0.075
0.328CysThr: 0.328 ± 0.097
0.738CysVal: 0.738 ± 0.138
0.041CysTrp: 0.041 ± 0.029
0.41CysTyr: 0.41 ± 0.095
0.0CysXaa: 0.0 ± 0.0
Asp
4.263AspAla: 4.263 ± 0.291
0.574AspCys: 0.574 ± 0.11
5.124AspAsp: 5.124 ± 0.386
5.226AspGlu: 5.226 ± 0.312
4.058AspPhe: 4.058 ± 0.274
4.734AspGly: 4.734 ± 0.297
1.517AspHis: 1.517 ± 0.199
4.816AspIle: 4.816 ± 0.305
4.57AspLys: 4.57 ± 0.36
6.005AspLeu: 6.005 ± 0.378
1.844AspMet: 1.844 ± 0.214
2.951AspAsn: 2.951 ± 0.228
3.197AspPro: 3.197 ± 0.243
2.172AspGln: 2.172 ± 0.236
3.197AspArg: 3.197 ± 0.257
4.078AspSer: 4.078 ± 0.293
3.32AspThr: 3.32 ± 0.263
4.55AspVal: 4.55 ± 0.359
1.189AspTrp: 1.189 ± 0.169
3.074AspTyr: 3.074 ± 0.285
0.0AspXaa: 0.0 ± 0.0
Glu
4.447GluAla: 4.447 ± 0.359
0.594GluCys: 0.594 ± 0.107
4.201GluAsp: 4.201 ± 0.362
5.492GluGlu: 5.492 ± 0.438
3.976GluPhe: 3.976 ± 0.281
3.013GluGly: 3.013 ± 0.262
1.025GluHis: 1.025 ± 0.151
6.148GluIle: 6.148 ± 0.354
5.042GluLys: 5.042 ± 0.387
6.169GluLeu: 6.169 ± 0.392
1.988GluMet: 1.988 ± 0.219
3.587GluAsn: 3.587 ± 0.265
1.783GluPro: 1.783 ± 0.169
2.213GluGln: 2.213 ± 0.214
3.587GluArg: 3.587 ± 0.325
3.402GluSer: 3.402 ± 0.258
4.222GluThr: 4.222 ± 0.288
4.673GluVal: 4.673 ± 0.283
1.004GluTrp: 1.004 ± 0.141
2.685GluTyr: 2.685 ± 0.248
0.0GluXaa: 0.0 ± 0.0
Phe
3.32PheAla: 3.32 ± 0.281
0.512PheCys: 0.512 ± 0.102
4.611PheAsp: 4.611 ± 0.32
4.427PheGlu: 4.427 ± 0.304
2.398PhePhe: 2.398 ± 0.289
3.218PheGly: 3.218 ± 0.248
0.922PheHis: 0.922 ± 0.128
2.541PheIle: 2.541 ± 0.218
3.177PheLys: 3.177 ± 0.244
3.853PheLeu: 3.853 ± 0.272
1.353PheMet: 1.353 ± 0.169
2.48PheAsn: 2.48 ± 0.24
1.394PhePro: 1.394 ± 0.165
1.373PheGln: 1.373 ± 0.155
2.295PheArg: 2.295 ± 0.248
3.361PheSer: 3.361 ± 0.26
3.197PheThr: 3.197 ± 0.251
4.017PheVal: 4.017 ± 0.317
0.697PheTrp: 0.697 ± 0.128
1.619PheTyr: 1.619 ± 0.176
0.0PheXaa: 0.0 ± 0.0
Gly
3.709GlyAla: 3.709 ± 0.403
0.512GlyCys: 0.512 ± 0.108
4.693GlyAsp: 4.693 ± 0.362
3.894GlyGlu: 3.894 ± 0.264
2.91GlyPhe: 2.91 ± 0.232
4.591GlyGly: 4.591 ± 0.725
1.066GlyHis: 1.066 ± 0.147
3.894GlyIle: 3.894 ± 0.285
4.14GlyLys: 4.14 ± 0.329
3.75GlyLeu: 3.75 ± 0.298
1.722GlyMet: 1.722 ± 0.215
3.627GlyAsn: 3.627 ± 0.349
1.865GlyPro: 1.865 ± 0.226
1.804GlyGln: 1.804 ± 0.206
3.136GlyArg: 3.136 ± 0.251
5.144GlySer: 5.144 ± 0.424
4.488GlyThr: 4.488 ± 0.452
3.771GlyVal: 3.771 ± 0.258
0.758GlyTrp: 0.758 ± 0.122
2.582GlyTyr: 2.582 ± 0.218
0.0GlyXaa: 0.0 ± 0.0
His
1.271HisAla: 1.271 ± 0.145
0.184HisCys: 0.184 ± 0.06
1.004HisAsp: 1.004 ± 0.14
1.332HisGlu: 1.332 ± 0.206
1.066HisPhe: 1.066 ± 0.134
1.107HisGly: 1.107 ± 0.161
0.512HisHis: 0.512 ± 0.107
1.168HisIle: 1.168 ± 0.15
0.902HisLys: 0.902 ± 0.154
1.455HisLeu: 1.455 ± 0.211
0.492HisMet: 0.492 ± 0.115
0.902HisAsn: 0.902 ± 0.13
0.922HisPro: 0.922 ± 0.134
0.717HisGln: 0.717 ± 0.139
1.086HisArg: 1.086 ± 0.163
0.902HisSer: 0.902 ± 0.143
1.004HisThr: 1.004 ± 0.128
1.66HisVal: 1.66 ± 0.218
0.205HisTrp: 0.205 ± 0.064
0.861HisTyr: 0.861 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
5.021IleAla: 5.021 ± 0.306
0.533IleCys: 0.533 ± 0.107
5.738IleAsp: 5.738 ± 0.311
5.144IleGlu: 5.144 ± 0.332
3.033IlePhe: 3.033 ± 0.236
4.55IleGly: 4.55 ± 0.423
1.291IleHis: 1.291 ± 0.151
3.648IleIle: 3.648 ± 0.27
4.017IleLys: 4.017 ± 0.278
4.427IleLeu: 4.427 ± 0.269
1.332IleMet: 1.332 ± 0.163
4.078IleAsn: 4.078 ± 0.328
2.521IlePro: 2.521 ± 0.219
2.193IleGln: 2.193 ± 0.221
3.75IleArg: 3.75 ± 0.265
4.878IleSer: 4.878 ± 0.338
3.791IleThr: 3.791 ± 0.274
5.144IleVal: 5.144 ± 0.376
0.779IleTrp: 0.779 ± 0.118
2.623IleTyr: 2.623 ± 0.245
0.0IleXaa: 0.0 ± 0.0
Lys
4.099LysAla: 4.099 ± 0.312
0.553LysCys: 0.553 ± 0.111
4.386LysAsp: 4.386 ± 0.324
5.021LysGlu: 5.021 ± 0.375
3.853LysPhe: 3.853 ± 0.294
2.869LysGly: 2.869 ± 0.252
1.271LysHis: 1.271 ± 0.2
6.025LysIle: 6.025 ± 0.386
5.759LysLys: 5.759 ± 0.451
5.574LysLeu: 5.574 ± 0.411
2.049LysMet: 2.049 ± 0.238
3.73LysAsn: 3.73 ± 0.275
1.824LysPro: 1.824 ± 0.213
2.193LysGln: 2.193 ± 0.21
3.32LysArg: 3.32 ± 0.323
3.894LysSer: 3.894 ± 0.273
4.857LysThr: 4.857 ± 0.313
4.119LysVal: 4.119 ± 0.312
0.902LysTrp: 0.902 ± 0.136
2.623LysTyr: 2.623 ± 0.221
0.0LysXaa: 0.0 ± 0.0
Leu
5.247LeuAla: 5.247 ± 0.328
0.758LeuCys: 0.758 ± 0.133
5.411LeuAsp: 5.411 ± 0.342
5.267LeuGlu: 5.267 ± 0.387
3.259LeuPhe: 3.259 ± 0.258
4.693LeuGly: 4.693 ± 0.322
1.414LeuHis: 1.414 ± 0.185
4.406LeuIle: 4.406 ± 0.315
5.779LeuLys: 5.779 ± 0.359
5.185LeuLeu: 5.185 ± 0.27
1.66LeuMet: 1.66 ± 0.21
4.201LeuAsn: 4.201 ± 0.303
3.013LeuPro: 3.013 ± 0.258
2.644LeuGln: 2.644 ± 0.218
4.119LeuArg: 4.119 ± 0.305
5.656LeuSer: 5.656 ± 0.336
4.96LeuThr: 4.96 ± 0.308
4.673LeuVal: 4.673 ± 0.326
0.799LeuTrp: 0.799 ± 0.144
2.623LeuTyr: 2.623 ± 0.267
0.0LeuXaa: 0.0 ± 0.0
Met
1.701MetAla: 1.701 ± 0.195
0.205MetCys: 0.205 ± 0.059
1.107MetAsp: 1.107 ± 0.164
1.517MetGlu: 1.517 ± 0.192
1.209MetPhe: 1.209 ± 0.163
0.902MetGly: 0.902 ± 0.132
0.512MetHis: 0.512 ± 0.093
1.537MetIle: 1.537 ± 0.185
2.357MetLys: 2.357 ± 0.25
1.517MetLeu: 1.517 ± 0.19
0.676MetMet: 0.676 ± 0.121
1.435MetAsn: 1.435 ± 0.162
0.758MetPro: 0.758 ± 0.117
1.004MetGln: 1.004 ± 0.154
1.537MetArg: 1.537 ± 0.191
2.521MetSer: 2.521 ± 0.213
2.685MetThr: 2.685 ± 0.236
1.025MetVal: 1.025 ± 0.149
0.164MetTrp: 0.164 ± 0.058
0.779MetTyr: 0.779 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.709AsnAla: 3.709 ± 0.364
0.328AsnCys: 0.328 ± 0.08
3.689AsnAsp: 3.689 ± 0.272
3.791AsnGlu: 3.791 ± 0.324
2.705AsnPhe: 2.705 ± 0.184
4.263AsnGly: 4.263 ± 0.263
0.82AsnHis: 0.82 ± 0.117
3.033AsnIle: 3.033 ± 0.275
2.767AsnLys: 2.767 ± 0.231
3.894AsnLeu: 3.894 ± 0.357
1.312AsnMet: 1.312 ± 0.152
2.439AsnAsn: 2.439 ± 0.217
2.5AsnPro: 2.5 ± 0.222
1.414AsnGln: 1.414 ± 0.17
2.603AsnArg: 2.603 ± 0.218
3.75AsnSer: 3.75 ± 0.284
2.541AsnThr: 2.541 ± 0.183
3.709AsnVal: 3.709 ± 0.267
0.533AsnTrp: 0.533 ± 0.11
2.172AsnTyr: 2.172 ± 0.239
0.0AsnXaa: 0.0 ± 0.0
Pro
1.619ProAla: 1.619 ± 0.183
0.184ProCys: 0.184 ± 0.062
2.172ProAsp: 2.172 ± 0.248
2.992ProGlu: 2.992 ± 0.263
1.66ProPhe: 1.66 ± 0.173
2.746ProGly: 2.746 ± 0.24
0.84ProHis: 0.84 ± 0.127
2.049ProIle: 2.049 ± 0.198
1.783ProLys: 1.783 ± 0.208
2.521ProLeu: 2.521 ± 0.219
0.779ProMet: 0.779 ± 0.131
1.844ProAsn: 1.844 ± 0.206
0.82ProPro: 0.82 ± 0.122
1.004ProGln: 1.004 ± 0.149
1.558ProArg: 1.558 ± 0.187
2.91ProSer: 2.91 ± 0.239
2.152ProThr: 2.152 ± 0.2
3.197ProVal: 3.197 ± 0.241
0.369ProTrp: 0.369 ± 0.101
1.599ProTyr: 1.599 ± 0.192
0.0ProXaa: 0.0 ± 0.0
Gln
1.783GlnAla: 1.783 ± 0.209
0.287GlnCys: 0.287 ± 0.079
2.09GlnAsp: 2.09 ± 0.186
2.111GlnGlu: 2.111 ± 0.21
1.496GlnPhe: 1.496 ± 0.18
1.373GlnGly: 1.373 ± 0.167
0.574GlnHis: 0.574 ± 0.095
2.5GlnIle: 2.5 ± 0.218
2.5GlnLys: 2.5 ± 0.257
2.746GlnLeu: 2.746 ± 0.273
0.799GlnMet: 0.799 ± 0.139
2.152GlnAsn: 2.152 ± 0.183
0.594GlnPro: 0.594 ± 0.102
1.25GlnGln: 1.25 ± 0.136
1.394GlnArg: 1.394 ± 0.162
1.701GlnSer: 1.701 ± 0.19
2.172GlnThr: 2.172 ± 0.207
1.844GlnVal: 1.844 ± 0.155
0.471GlnTrp: 0.471 ± 0.123
1.25GlnTyr: 1.25 ± 0.171
0.0GlnXaa: 0.0 ± 0.0
Arg
3.238ArgAla: 3.238 ± 0.24
0.369ArgCys: 0.369 ± 0.076
3.505ArgAsp: 3.505 ± 0.262
2.664ArgGlu: 2.664 ± 0.234
2.931ArgPhe: 2.931 ± 0.286
3.156ArgGly: 3.156 ± 0.281
1.066ArgHis: 1.066 ± 0.16
3.75ArgIle: 3.75 ± 0.293
3.464ArgLys: 3.464 ± 0.322
3.873ArgLeu: 3.873 ± 0.248
1.25ArgMet: 1.25 ± 0.172
2.767ArgAsn: 2.767 ± 0.204
1.763ArgPro: 1.763 ± 0.201
1.742ArgGln: 1.742 ± 0.186
2.726ArgArg: 2.726 ± 0.257
2.746ArgSer: 2.746 ± 0.226
2.89ArgThr: 2.89 ± 0.225
4.201ArgVal: 4.201 ± 0.309
0.533ArgTrp: 0.533 ± 0.1
2.5ArgTyr: 2.5 ± 0.197
0.0ArgXaa: 0.0 ± 0.0
Ser
4.017SerAla: 4.017 ± 0.356
0.635SerCys: 0.635 ± 0.123
4.673SerAsp: 4.673 ± 0.318
4.181SerGlu: 4.181 ± 0.34
3.648SerPhe: 3.648 ± 0.264
5.083SerGly: 5.083 ± 0.483
1.127SerHis: 1.127 ± 0.144
4.55SerIle: 4.55 ± 0.408
4.632SerLys: 4.632 ± 0.324
5.021SerLeu: 5.021 ± 0.3
1.844SerMet: 1.844 ± 0.194
3.607SerAsn: 3.607 ± 0.276
2.008SerPro: 2.008 ± 0.187
1.681SerGln: 1.681 ± 0.181
3.279SerArg: 3.279 ± 0.269
5.001SerSer: 5.001 ± 0.426
3.832SerThr: 3.832 ± 0.375
4.468SerVal: 4.468 ± 0.339
0.82SerTrp: 0.82 ± 0.127
2.808SerTyr: 2.808 ± 0.239
0.0SerXaa: 0.0 ± 0.0
Thr
3.587ThrAla: 3.587 ± 0.348
0.451ThrCys: 0.451 ± 0.09
4.037ThrAsp: 4.037 ± 0.315
3.443ThrGlu: 3.443 ± 0.231
3.218ThrPhe: 3.218 ± 0.278
4.591ThrGly: 4.591 ± 0.437
1.045ThrHis: 1.045 ± 0.141
4.632ThrIle: 4.632 ± 0.333
4.406ThrLys: 4.406 ± 0.288
4.878ThrLeu: 4.878 ± 0.349
1.23ThrMet: 1.23 ± 0.131
3.033ThrAsn: 3.033 ± 0.287
2.623ThrPro: 2.623 ± 0.264
1.537ThrGln: 1.537 ± 0.152
3.156ThrArg: 3.156 ± 0.259
4.55ThrSer: 4.55 ± 0.481
3.546ThrThr: 3.546 ± 0.442
4.447ThrVal: 4.447 ± 0.351
0.676ThrTrp: 0.676 ± 0.103
2.439ThrTyr: 2.439 ± 0.205
0.0ThrXaa: 0.0 ± 0.0
Val
4.263ValAla: 4.263 ± 0.369
0.533ValCys: 0.533 ± 0.09
5.472ValAsp: 5.472 ± 0.341
4.939ValGlu: 4.939 ± 0.428
3.423ValPhe: 3.423 ± 0.249
3.791ValGly: 3.791 ± 0.337
1.312ValHis: 1.312 ± 0.168
4.796ValIle: 4.796 ± 0.366
5.144ValLys: 5.144 ± 0.319
4.919ValLeu: 4.919 ± 0.319
1.455ValMet: 1.455 ± 0.197
3.218ValAsn: 3.218 ± 0.316
2.828ValPro: 2.828 ± 0.245
2.234ValGln: 2.234 ± 0.222
3.627ValArg: 3.627 ± 0.289
4.365ValSer: 4.365 ± 0.301
4.55ValThr: 4.55 ± 0.406
4.632ValVal: 4.632 ± 0.321
0.84ValTrp: 0.84 ± 0.125
2.582ValTyr: 2.582 ± 0.226
0.0ValXaa: 0.0 ± 0.0
Trp
0.43TrpAla: 0.43 ± 0.097
0.266TrpCys: 0.266 ± 0.071
0.656TrpAsp: 0.656 ± 0.115
0.594TrpGlu: 0.594 ± 0.117
0.799TrpPhe: 0.799 ± 0.144
0.287TrpGly: 0.287 ± 0.07
0.369TrpHis: 0.369 ± 0.081
1.148TrpIle: 1.148 ± 0.149
1.025TrpLys: 1.025 ± 0.136
0.984TrpLeu: 0.984 ± 0.135
0.43TrpMet: 0.43 ± 0.102
0.902TrpAsn: 0.902 ± 0.137
0.369TrpPro: 0.369 ± 0.099
0.369TrpGln: 0.369 ± 0.1
0.574TrpArg: 0.574 ± 0.118
0.861TrpSer: 0.861 ± 0.12
0.922TrpThr: 0.922 ± 0.145
0.799TrpVal: 0.799 ± 0.129
0.266TrpTrp: 0.266 ± 0.08
0.389TrpTyr: 0.389 ± 0.092
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.398TyrAla: 2.398 ± 0.209
0.43TyrCys: 0.43 ± 0.082
3.115TyrAsp: 3.115 ± 0.259
2.377TyrGlu: 2.377 ± 0.208
1.988TyrPhe: 1.988 ± 0.197
2.5TyrGly: 2.5 ± 0.236
0.738TyrHis: 0.738 ± 0.125
2.582TyrIle: 2.582 ± 0.2
2.131TyrLys: 2.131 ± 0.201
2.89TyrLeu: 2.89 ± 0.266
0.963TyrMet: 0.963 ± 0.16
1.967TyrAsn: 1.967 ± 0.164
1.537TyrPro: 1.537 ± 0.18
1.435TyrGln: 1.435 ± 0.179
2.316TyrArg: 2.316 ± 0.212
2.418TyrSer: 2.418 ± 0.216
2.295TyrThr: 2.295 ± 0.276
3.136TyrVal: 3.136 ± 0.284
0.492TyrTrp: 0.492 ± 0.097
1.701TyrTyr: 1.701 ± 0.18
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 257 proteins (48795 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski