Amino acid dipepetide frequency for Microbacterium phage PauloDiaboli

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.041AlaAla: 11.041 ± 0.757
0.599AlaCys: 0.599 ± 0.116
6.213AlaAsp: 6.213 ± 0.358
7.561AlaGlu: 7.561 ± 0.6
3.238AlaPhe: 3.238 ± 0.283
6.943AlaGly: 6.943 ± 0.385
1.909AlaHis: 1.909 ± 0.231
4.379AlaIle: 4.379 ± 0.316
4.622AlaLys: 4.622 ± 0.365
9.245AlaLeu: 9.245 ± 0.418
2.77AlaMet: 2.77 ± 0.274
3.181AlaAsn: 3.181 ± 0.282
5.034AlaPro: 5.034 ± 0.563
3.032AlaGln: 3.032 ± 0.273
5.876AlaArg: 5.876 ± 0.397
5.502AlaSer: 5.502 ± 0.442
5.895AlaThr: 5.895 ± 0.485
7.13AlaVal: 7.13 ± 0.349
1.609AlaTrp: 1.609 ± 0.185
2.62AlaTyr: 2.62 ± 0.254
0.0AlaXaa: 0.0 ± 0.0
Cys
0.561CysAla: 0.561 ± 0.119
0.112CysCys: 0.112 ± 0.046
0.393CysAsp: 0.393 ± 0.092
0.561CysGlu: 0.561 ± 0.104
0.262CysPhe: 0.262 ± 0.074
1.104CysGly: 1.104 ± 0.22
0.131CysHis: 0.131 ± 0.051
0.337CysIle: 0.337 ± 0.081
0.243CysLys: 0.243 ± 0.073
0.337CysLeu: 0.337 ± 0.094
0.225CysMet: 0.225 ± 0.061
0.225CysAsn: 0.225 ± 0.064
0.505CysPro: 0.505 ± 0.122
0.112CysGln: 0.112 ± 0.048
0.524CysArg: 0.524 ± 0.13
0.468CysSer: 0.468 ± 0.108
0.561CysThr: 0.561 ± 0.117
0.524CysVal: 0.524 ± 0.104
0.112CysTrp: 0.112 ± 0.049
0.337CysTyr: 0.337 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
6.831AspAla: 6.831 ± 0.404
0.524AspCys: 0.524 ± 0.123
4.117AspAsp: 4.117 ± 0.36
5.109AspGlu: 5.109 ± 0.33
2.545AspPhe: 2.545 ± 0.232
6.063AspGly: 6.063 ± 0.37
1.535AspHis: 1.535 ± 0.171
3.481AspIle: 3.481 ± 0.242
2.246AspLys: 2.246 ± 0.226
5.072AspLeu: 5.072 ± 0.317
1.74AspMet: 1.74 ± 0.179
2.433AspAsn: 2.433 ± 0.206
4.267AspPro: 4.267 ± 0.334
2.021AspGln: 2.021 ± 0.191
4.005AspArg: 4.005 ± 0.296
3.256AspSer: 3.256 ± 0.279
3.724AspThr: 3.724 ± 0.259
3.855AspVal: 3.855 ± 0.27
1.609AspTrp: 1.609 ± 0.165
1.815AspTyr: 1.815 ± 0.177
0.0AspXaa: 0.0 ± 0.0
Glu
7.13GluAla: 7.13 ± 0.544
0.543GluCys: 0.543 ± 0.122
5.053GluAsp: 5.053 ± 0.36
5.371GluGlu: 5.371 ± 0.507
2.489GluPhe: 2.489 ± 0.216
4.697GluGly: 4.697 ± 0.308
1.16GluHis: 1.16 ± 0.153
2.919GluIle: 2.919 ± 0.237
2.938GluLys: 2.938 ± 0.239
6.288GluLeu: 6.288 ± 0.377
1.404GluMet: 1.404 ± 0.165
2.133GluAsn: 2.133 ± 0.197
3.144GluPro: 3.144 ± 0.263
2.545GluGln: 2.545 ± 0.261
4.548GluArg: 4.548 ± 0.341
3.406GluSer: 3.406 ± 0.259
3.855GluThr: 3.855 ± 0.329
4.772GluVal: 4.772 ± 0.363
1.684GluTrp: 1.684 ± 0.182
2.601GluTyr: 2.601 ± 0.225
0.0GluXaa: 0.0 ± 0.0
Phe
3.05PheAla: 3.05 ± 0.279
0.187PheCys: 0.187 ± 0.058
2.059PheAsp: 2.059 ± 0.209
2.863PheGlu: 2.863 ± 0.311
1.647PhePhe: 1.647 ± 0.219
2.919PheGly: 2.919 ± 0.244
0.524PheHis: 0.524 ± 0.105
1.684PheIle: 1.684 ± 0.193
1.647PheLys: 1.647 ± 0.157
2.994PheLeu: 2.994 ± 0.252
0.898PheMet: 0.898 ± 0.108
1.591PheAsn: 1.591 ± 0.206
1.684PhePro: 1.684 ± 0.153
0.954PheGln: 0.954 ± 0.108
2.002PheArg: 2.002 ± 0.209
2.021PheSer: 2.021 ± 0.21
2.077PheThr: 2.077 ± 0.222
2.62PheVal: 2.62 ± 0.249
0.73PheTrp: 0.73 ± 0.119
1.198PheTyr: 1.198 ± 0.16
0.0PheXaa: 0.0 ± 0.0
Gly
6.662GlyAla: 6.662 ± 0.424
0.524GlyCys: 0.524 ± 0.115
5.24GlyAsp: 5.24 ± 0.307
4.922GlyGlu: 4.922 ± 0.285
3.107GlyPhe: 3.107 ± 0.241
6.176GlyGly: 6.176 ± 0.41
1.422GlyHis: 1.422 ± 0.214
3.967GlyIle: 3.967 ± 0.276
4.098GlyLys: 4.098 ± 0.275
5.801GlyLeu: 5.801 ± 0.396
2.414GlyMet: 2.414 ± 0.207
2.901GlyAsn: 2.901 ± 0.194
2.676GlyPro: 2.676 ± 0.263
2.302GlyGln: 2.302 ± 0.22
4.248GlyArg: 4.248 ± 0.288
4.679GlySer: 4.679 ± 0.34
5.745GlyThr: 5.745 ± 0.504
5.97GlyVal: 5.97 ± 0.396
2.04GlyTrp: 2.04 ± 0.231
3.069GlyTyr: 3.069 ± 0.249
0.0GlyXaa: 0.0 ± 0.0
His
1.909HisAla: 1.909 ± 0.204
0.094HisCys: 0.094 ± 0.043
1.123HisAsp: 1.123 ± 0.16
1.123HisGlu: 1.123 ± 0.165
0.786HisPhe: 0.786 ± 0.134
1.871HisGly: 1.871 ± 0.201
0.543HisHis: 0.543 ± 0.122
1.104HisIle: 1.104 ± 0.154
0.749HisLys: 0.749 ± 0.135
1.254HisLeu: 1.254 ± 0.17
0.412HisMet: 0.412 ± 0.092
0.786HisAsn: 0.786 ± 0.137
1.404HisPro: 1.404 ± 0.165
0.618HisGln: 0.618 ± 0.116
1.273HisArg: 1.273 ± 0.151
0.973HisSer: 0.973 ± 0.138
0.805HisThr: 0.805 ± 0.121
1.46HisVal: 1.46 ± 0.187
0.356HisTrp: 0.356 ± 0.085
0.88HisTyr: 0.88 ± 0.126
0.0HisXaa: 0.0 ± 0.0
Ile
5.39IleAla: 5.39 ± 0.341
0.468IleCys: 0.468 ± 0.091
3.631IleAsp: 3.631 ± 0.275
3.724IleGlu: 3.724 ± 0.282
1.572IlePhe: 1.572 ± 0.222
3.518IleGly: 3.518 ± 0.273
0.954IleHis: 0.954 ± 0.123
2.526IleIle: 2.526 ± 0.26
1.853IleLys: 1.853 ± 0.202
3.5IleLeu: 3.5 ± 0.285
1.198IleMet: 1.198 ± 0.148
1.984IleAsn: 1.984 ± 0.161
2.657IlePro: 2.657 ± 0.214
1.553IleGln: 1.553 ± 0.16
3.406IleArg: 3.406 ± 0.239
2.283IleSer: 2.283 ± 0.208
3.743IleThr: 3.743 ± 0.293
3.836IleVal: 3.836 ± 0.29
0.861IleTrp: 0.861 ± 0.106
1.46IleTyr: 1.46 ± 0.141
0.0IleXaa: 0.0 ± 0.0
Lys
4.641LysAla: 4.641 ± 0.277
0.337LysCys: 0.337 ± 0.088
3.125LysAsp: 3.125 ± 0.235
2.19LysGlu: 2.19 ± 0.21
1.666LysPhe: 1.666 ± 0.156
2.695LysGly: 2.695 ± 0.236
0.88LysHis: 0.88 ± 0.136
2.246LysIle: 2.246 ± 0.218
2.264LysLys: 2.264 ± 0.211
3.219LysLeu: 3.219 ± 0.267
1.179LysMet: 1.179 ± 0.165
1.46LysAsn: 1.46 ± 0.179
2.021LysPro: 2.021 ± 0.184
1.329LysGln: 1.329 ± 0.147
3.181LysArg: 3.181 ± 0.278
2.47LysSer: 2.47 ± 0.221
3.219LysThr: 3.219 ± 0.269
3.406LysVal: 3.406 ± 0.244
0.767LysTrp: 0.767 ± 0.121
1.46LysTyr: 1.46 ± 0.181
0.0LysXaa: 0.0 ± 0.0
Leu
7.729LeuAla: 7.729 ± 0.413
0.524LeuCys: 0.524 ± 0.113
5.839LeuAsp: 5.839 ± 0.398
4.941LeuGlu: 4.941 ± 0.321
2.115LeuPhe: 2.115 ± 0.245
5.221LeuGly: 5.221 ± 0.357
1.89LeuHis: 1.89 ± 0.192
4.042LeuIle: 4.042 ± 0.283
2.957LeuLys: 2.957 ± 0.265
5.483LeuLeu: 5.483 ± 0.434
1.46LeuMet: 1.46 ± 0.143
3.256LeuAsn: 3.256 ± 0.32
3.911LeuPro: 3.911 ± 0.301
2.62LeuGln: 2.62 ± 0.3
5.521LeuArg: 5.521 ± 0.395
5.09LeuSer: 5.09 ± 0.322
5.689LeuThr: 5.689 ± 0.44
5.483LeuVal: 5.483 ± 0.311
1.31LeuTrp: 1.31 ± 0.175
2.264LeuTyr: 2.264 ± 0.224
0.0LeuXaa: 0.0 ± 0.0
Met
2.246MetAla: 2.246 ± 0.181
0.075MetCys: 0.075 ± 0.037
1.535MetAsp: 1.535 ± 0.128
1.254MetGlu: 1.254 ± 0.151
0.674MetPhe: 0.674 ± 0.121
1.946MetGly: 1.946 ± 0.219
0.487MetHis: 0.487 ± 0.111
1.31MetIle: 1.31 ± 0.143
1.067MetLys: 1.067 ± 0.14
1.666MetLeu: 1.666 ± 0.175
0.543MetMet: 0.543 ± 0.107
0.805MetAsn: 0.805 ± 0.11
1.385MetPro: 1.385 ± 0.172
0.749MetGln: 0.749 ± 0.091
1.478MetArg: 1.478 ± 0.175
2.526MetSer: 2.526 ± 0.216
2.377MetThr: 2.377 ± 0.258
1.385MetVal: 1.385 ± 0.166
0.412MetTrp: 0.412 ± 0.09
0.674MetTyr: 0.674 ± 0.122
0.0MetXaa: 0.0 ± 0.0
Asn
3.612AsnAla: 3.612 ± 0.297
0.131AsnCys: 0.131 ± 0.049
1.815AsnAsp: 1.815 ± 0.151
2.021AsnGlu: 2.021 ± 0.205
1.273AsnPhe: 1.273 ± 0.154
4.51AsnGly: 4.51 ± 0.367
0.749AsnHis: 0.749 ± 0.128
2.246AsnIle: 2.246 ± 0.246
1.46AsnLys: 1.46 ± 0.177
2.826AsnLeu: 2.826 ± 0.233
0.674AsnMet: 0.674 ± 0.097
1.142AsnAsn: 1.142 ± 0.173
2.208AsnPro: 2.208 ± 0.221
0.992AsnGln: 0.992 ± 0.154
2.489AsnArg: 2.489 ± 0.208
2.059AsnSer: 2.059 ± 0.197
2.133AsnThr: 2.133 ± 0.265
2.676AsnVal: 2.676 ± 0.298
0.88AsnTrp: 0.88 ± 0.129
1.198AsnTyr: 1.198 ± 0.146
0.0AsnXaa: 0.0 ± 0.0
Pro
4.267ProAla: 4.267 ± 0.438
0.299ProCys: 0.299 ± 0.076
3.425ProAsp: 3.425 ± 0.301
4.791ProGlu: 4.791 ± 0.432
1.591ProPhe: 1.591 ± 0.145
3.911ProGly: 3.911 ± 0.27
0.936ProHis: 0.936 ± 0.146
2.657ProIle: 2.657 ± 0.243
2.152ProLys: 2.152 ± 0.172
3.331ProLeu: 3.331 ± 0.259
0.973ProMet: 0.973 ± 0.124
1.984ProAsn: 1.984 ± 0.271
2.096ProPro: 2.096 ± 0.228
1.834ProGln: 1.834 ± 0.324
2.526ProArg: 2.526 ± 0.275
2.976ProSer: 2.976 ± 0.253
2.957ProThr: 2.957 ± 0.242
4.211ProVal: 4.211 ± 0.287
1.123ProTrp: 1.123 ± 0.153
1.834ProTyr: 1.834 ± 0.167
0.0ProXaa: 0.0 ± 0.0
Gln
3.331GlnAla: 3.331 ± 0.29
0.281GlnCys: 0.281 ± 0.075
1.684GlnAsp: 1.684 ± 0.22
1.984GlnGlu: 1.984 ± 0.211
1.179GlnPhe: 1.179 ± 0.144
2.264GlnGly: 2.264 ± 0.251
0.599GlnHis: 0.599 ± 0.103
1.759GlnIle: 1.759 ± 0.17
1.703GlnLys: 1.703 ± 0.186
2.433GlnLeu: 2.433 ± 0.199
0.786GlnMet: 0.786 ± 0.093
1.46GlnAsn: 1.46 ± 0.153
1.722GlnPro: 1.722 ± 0.177
1.235GlnGln: 1.235 ± 0.215
1.815GlnArg: 1.815 ± 0.178
1.366GlnSer: 1.366 ± 0.162
2.227GlnThr: 2.227 ± 0.243
2.19GlnVal: 2.19 ± 0.201
0.561GlnTrp: 0.561 ± 0.102
1.329GlnTyr: 1.329 ± 0.164
0.0GlnXaa: 0.0 ± 0.0
Arg
6.045ArgAla: 6.045 ± 0.377
0.674ArgCys: 0.674 ± 0.127
4.286ArgAsp: 4.286 ± 0.401
4.716ArgGlu: 4.716 ± 0.332
2.47ArgPhe: 2.47 ± 0.215
4.173ArgGly: 4.173 ± 0.267
1.067ArgHis: 1.067 ± 0.135
3.331ArgIle: 3.331 ± 0.229
3.181ArgLys: 3.181 ± 0.248
4.847ArgLeu: 4.847 ± 0.307
1.628ArgMet: 1.628 ± 0.166
2.377ArgAsn: 2.377 ± 0.201
2.751ArgPro: 2.751 ± 0.26
2.002ArgGln: 2.002 ± 0.196
4.342ArgArg: 4.342 ± 0.357
3.2ArgSer: 3.2 ± 0.245
3.107ArgThr: 3.107 ± 0.238
4.66ArgVal: 4.66 ± 0.28
1.16ArgTrp: 1.16 ± 0.147
1.965ArgTyr: 1.965 ± 0.191
0.0ArgXaa: 0.0 ± 0.0
Ser
6.026SerAla: 6.026 ± 0.421
0.393SerCys: 0.393 ± 0.111
4.042SerAsp: 4.042 ± 0.302
3.35SerGlu: 3.35 ± 0.277
2.04SerPhe: 2.04 ± 0.199
5.165SerGly: 5.165 ± 0.395
0.861SerHis: 0.861 ± 0.102
2.863SerIle: 2.863 ± 0.236
2.433SerLys: 2.433 ± 0.217
4.061SerLeu: 4.061 ± 0.267
1.666SerMet: 1.666 ± 0.177
1.89SerAsn: 1.89 ± 0.22
2.601SerPro: 2.601 ± 0.227
1.797SerGln: 1.797 ± 0.161
3.425SerArg: 3.425 ± 0.252
3.069SerSer: 3.069 ± 0.325
3.631SerThr: 3.631 ± 0.331
3.893SerVal: 3.893 ± 0.278
1.216SerTrp: 1.216 ± 0.167
2.002SerTyr: 2.002 ± 0.21
0.0SerXaa: 0.0 ± 0.0
Thr
6.438ThrAla: 6.438 ± 0.483
0.468ThrCys: 0.468 ± 0.093
3.949ThrAsp: 3.949 ± 0.295
3.893ThrGlu: 3.893 ± 0.279
2.639ThrPhe: 2.639 ± 0.298
5.371ThrGly: 5.371 ± 0.428
1.104ThrHis: 1.104 ± 0.139
3.406ThrIle: 3.406 ± 0.305
2.863ThrLys: 2.863 ± 0.272
5.184ThrLeu: 5.184 ± 0.308
1.591ThrMet: 1.591 ± 0.196
2.377ThrAsn: 2.377 ± 0.24
3.893ThrPro: 3.893 ± 0.285
1.815ThrGln: 1.815 ± 0.21
3.593ThrArg: 3.593 ± 0.286
3.387ThrSer: 3.387 ± 0.302
4.753ThrThr: 4.753 ± 0.506
5.558ThrVal: 5.558 ± 0.381
1.516ThrTrp: 1.516 ± 0.151
2.246ThrTyr: 2.246 ± 0.225
0.0ThrXaa: 0.0 ± 0.0
Val
6.606ValAla: 6.606 ± 0.52
0.88ValCys: 0.88 ± 0.182
4.604ValAsp: 4.604 ± 0.269
5.184ValGlu: 5.184 ± 0.362
2.208ValPhe: 2.208 ± 0.209
4.997ValGly: 4.997 ± 0.373
1.572ValHis: 1.572 ± 0.181
3.462ValIle: 3.462 ± 0.243
3.088ValLys: 3.088 ± 0.215
5.783ValLeu: 5.783 ± 0.36
1.666ValMet: 1.666 ± 0.209
2.788ValAsn: 2.788 ± 0.214
3.818ValPro: 3.818 ± 0.227
2.695ValGln: 2.695 ± 0.238
3.949ValArg: 3.949 ± 0.282
4.417ValSer: 4.417 ± 0.312
5.727ValThr: 5.727 ± 0.502
5.596ValVal: 5.596 ± 0.418
1.46ValTrp: 1.46 ± 0.179
2.489ValTyr: 2.489 ± 0.23
0.0ValXaa: 0.0 ± 0.0
Trp
1.871TrpAla: 1.871 ± 0.207
0.243TrpCys: 0.243 ± 0.066
1.871TrpAsp: 1.871 ± 0.199
1.104TrpGlu: 1.104 ± 0.161
0.636TrpPhe: 0.636 ± 0.114
1.778TrpGly: 1.778 ± 0.183
0.374TrpHis: 0.374 ± 0.1
1.123TrpIle: 1.123 ± 0.17
1.011TrpLys: 1.011 ± 0.128
1.572TrpLeu: 1.572 ± 0.164
0.543TrpMet: 0.543 ± 0.105
0.842TrpAsn: 0.842 ± 0.128
0.561TrpPro: 0.561 ± 0.108
0.636TrpGln: 0.636 ± 0.1
1.329TrpArg: 1.329 ± 0.173
1.123TrpSer: 1.123 ± 0.172
1.31TrpThr: 1.31 ± 0.15
1.722TrpVal: 1.722 ± 0.179
0.636TrpTrp: 0.636 ± 0.108
0.58TrpTyr: 0.58 ± 0.109
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.013TyrAla: 3.013 ± 0.235
0.281TyrCys: 0.281 ± 0.073
2.246TyrAsp: 2.246 ± 0.205
2.021TyrGlu: 2.021 ± 0.212
1.216TyrPhe: 1.216 ± 0.152
2.657TyrGly: 2.657 ± 0.235
0.823TyrHis: 0.823 ± 0.125
1.273TyrIle: 1.273 ± 0.15
1.179TyrLys: 1.179 ± 0.162
2.452TyrLeu: 2.452 ± 0.217
0.805TyrMet: 0.805 ± 0.116
1.478TyrAsn: 1.478 ± 0.178
1.535TyrPro: 1.535 ± 0.158
1.067TyrGln: 1.067 ± 0.147
2.452TyrArg: 2.452 ± 0.201
2.152TyrSer: 2.152 ± 0.231
2.508TyrThr: 2.508 ± 0.266
2.04TyrVal: 2.04 ± 0.181
0.823TyrTrp: 0.823 ± 0.132
1.198TyrTyr: 1.198 ± 0.147
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 311 proteins (53436 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski