Amino acid dipepetide frequency for Dickeya phage vB_DsoM_AD1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.269AlaAla: 8.269 ± 0.431
0.768AlaCys: 0.768 ± 0.093
5.793AlaAsp: 5.793 ± 0.295
5.525AlaGlu: 5.525 ± 0.322
3.146AlaPhe: 3.146 ± 0.179
5.025AlaGly: 5.025 ± 0.328
1.537AlaHis: 1.537 ± 0.143
4.159AlaIle: 4.159 ± 0.249
6.049AlaLys: 6.049 ± 0.481
7.476AlaLeu: 7.476 ± 0.373
2.768AlaMet: 2.768 ± 0.194
4.11AlaAsn: 4.11 ± 0.253
3.403AlaPro: 3.403 ± 0.243
3.354AlaGln: 3.354 ± 0.246
4.049AlaArg: 4.049 ± 0.234
5.049AlaSer: 5.049 ± 0.259
4.732AlaThr: 4.732 ± 0.331
5.988AlaVal: 5.988 ± 0.321
0.805AlaTrp: 0.805 ± 0.101
2.878AlaTyr: 2.878 ± 0.207
0.0AlaXaa: 0.0 ± 0.0
Cys
0.707CysAla: 0.707 ± 0.105
0.159CysCys: 0.159 ± 0.047
0.878CysAsp: 0.878 ± 0.109
0.732CysGlu: 0.732 ± 0.104
0.463CysPhe: 0.463 ± 0.079
0.976CysGly: 0.976 ± 0.112
0.293CysHis: 0.293 ± 0.062
0.72CysIle: 0.72 ± 0.097
0.573CysLys: 0.573 ± 0.081
0.89CysLeu: 0.89 ± 0.117
0.329CysMet: 0.329 ± 0.073
0.402CysAsn: 0.402 ± 0.07
0.659CysPro: 0.659 ± 0.08
0.366CysGln: 0.366 ± 0.072
0.72CysArg: 0.72 ± 0.107
0.829CysSer: 0.829 ± 0.106
0.549CysThr: 0.549 ± 0.089
0.842CysVal: 0.842 ± 0.11
0.122CysTrp: 0.122 ± 0.039
0.427CysTyr: 0.427 ± 0.074
0.0CysXaa: 0.0 ± 0.0
Asp
5.427AspAla: 5.427 ± 0.254
0.854AspCys: 0.854 ± 0.114
5.854AspAsp: 5.854 ± 0.664
5.598AspGlu: 5.598 ± 0.723
2.732AspPhe: 2.732 ± 0.185
4.744AspGly: 4.744 ± 0.334
1.244AspHis: 1.244 ± 0.117
3.622AspIle: 3.622 ± 0.2
3.268AspLys: 3.268 ± 0.221
6.232AspLeu: 6.232 ± 0.307
2.061AspMet: 2.061 ± 0.138
2.732AspAsn: 2.732 ± 0.186
3.037AspPro: 3.037 ± 0.189
2.073AspGln: 2.073 ± 0.162
2.878AspArg: 2.878 ± 0.208
4.244AspSer: 4.244 ± 0.243
3.842AspThr: 3.842 ± 0.223
4.842AspVal: 4.842 ± 0.259
0.976AspTrp: 0.976 ± 0.12
2.512AspTyr: 2.512 ± 0.173
0.0AspXaa: 0.0 ± 0.0
Glu
5.012GluAla: 5.012 ± 0.292
0.732GluCys: 0.732 ± 0.095
5.208GluAsp: 5.208 ± 0.723
4.964GluGlu: 4.964 ± 0.582
2.207GluPhe: 2.207 ± 0.141
3.647GluGly: 3.647 ± 0.207
1.793GluHis: 1.793 ± 0.156
3.573GluIle: 3.573 ± 0.212
3.488GluLys: 3.488 ± 0.273
5.464GluLeu: 5.464 ± 0.315
2.073GluMet: 2.073 ± 0.173
2.427GluAsn: 2.427 ± 0.162
2.293GluPro: 2.293 ± 0.184
2.903GluGln: 2.903 ± 0.215
3.146GluArg: 3.146 ± 0.219
3.451GluSer: 3.451 ± 0.263
2.732GluThr: 2.732 ± 0.194
4.0GluVal: 4.0 ± 0.229
0.854GluTrp: 0.854 ± 0.11
1.866GluTyr: 1.866 ± 0.184
0.0GluXaa: 0.0 ± 0.0
Phe
2.695PheAla: 2.695 ± 0.171
0.573PheCys: 0.573 ± 0.095
3.817PheAsp: 3.817 ± 0.229
2.72PheGlu: 2.72 ± 0.188
1.134PhePhe: 1.134 ± 0.128
2.829PheGly: 2.829 ± 0.171
0.634PheHis: 0.634 ± 0.101
2.012PheIle: 2.012 ± 0.147
2.183PheLys: 2.183 ± 0.189
2.512PheLeu: 2.512 ± 0.199
1.012PheMet: 1.012 ± 0.115
1.939PheAsn: 1.939 ± 0.136
1.293PhePro: 1.293 ± 0.122
0.976PheGln: 0.976 ± 0.096
1.976PheArg: 1.976 ± 0.148
2.525PheSer: 2.525 ± 0.188
2.646PheThr: 2.646 ± 0.16
2.793PheVal: 2.793 ± 0.189
0.366PheTrp: 0.366 ± 0.07
0.988PheTyr: 0.988 ± 0.116
0.0PheXaa: 0.0 ± 0.0
Gly
4.89GlyAla: 4.89 ± 0.249
0.598GlyCys: 0.598 ± 0.101
4.012GlyAsp: 4.012 ± 0.226
3.793GlyGlu: 3.793 ± 0.276
2.659GlyPhe: 2.659 ± 0.186
4.647GlyGly: 4.647 ± 0.305
1.232GlyHis: 1.232 ± 0.13
3.171GlyIle: 3.171 ± 0.193
4.659GlyLys: 4.659 ± 0.276
5.208GlyLeu: 5.208 ± 0.22
1.89GlyMet: 1.89 ± 0.151
3.049GlyAsn: 3.049 ± 0.226
2.464GlyPro: 2.464 ± 0.337
2.549GlyGln: 2.549 ± 0.174
3.525GlyArg: 3.525 ± 0.282
4.281GlySer: 4.281 ± 0.24
4.647GlyThr: 4.647 ± 0.315
4.671GlyVal: 4.671 ± 0.231
0.915GlyTrp: 0.915 ± 0.112
2.659GlyTyr: 2.659 ± 0.149
0.0GlyXaa: 0.0 ± 0.0
His
1.537HisAla: 1.537 ± 0.138
0.281HisCys: 0.281 ± 0.063
1.232HisAsp: 1.232 ± 0.124
1.378HisGlu: 1.378 ± 0.138
0.854HisPhe: 0.854 ± 0.102
1.403HisGly: 1.403 ± 0.128
0.354HisHis: 0.354 ± 0.067
1.378HisIle: 1.378 ± 0.136
1.049HisLys: 1.049 ± 0.119
1.732HisLeu: 1.732 ± 0.134
0.768HisMet: 0.768 ± 0.102
1.012HisAsn: 1.012 ± 0.109
0.988HisPro: 0.988 ± 0.098
0.61HisGln: 0.61 ± 0.087
1.244HisArg: 1.244 ± 0.148
1.134HisSer: 1.134 ± 0.112
1.159HisThr: 1.159 ± 0.121
1.415HisVal: 1.415 ± 0.126
0.281HisTrp: 0.281 ± 0.055
0.756HisTyr: 0.756 ± 0.092
0.0HisXaa: 0.0 ± 0.0
Ile
4.927IleAla: 4.927 ± 0.21
0.793IleCys: 0.793 ± 0.1
4.39IleAsp: 4.39 ± 0.232
4.037IleGlu: 4.037 ± 0.234
1.683IlePhe: 1.683 ± 0.137
3.086IleGly: 3.086 ± 0.202
1.354IleHis: 1.354 ± 0.136
2.585IleIle: 2.585 ± 0.183
3.329IleLys: 3.329 ± 0.21
3.415IleLeu: 3.415 ± 0.202
1.293IleMet: 1.293 ± 0.12
2.671IleAsn: 2.671 ± 0.21
2.439IlePro: 2.439 ± 0.17
2.085IleGln: 2.085 ± 0.172
3.171IleArg: 3.171 ± 0.209
3.342IleSer: 3.342 ± 0.213
3.464IleThr: 3.464 ± 0.217
4.0IleVal: 4.0 ± 0.262
0.451IleTrp: 0.451 ± 0.077
1.744IleTyr: 1.744 ± 0.156
0.0IleXaa: 0.0 ± 0.0
Lys
6.293LysAla: 6.293 ± 0.538
0.549LysCys: 0.549 ± 0.089
3.293LysAsp: 3.293 ± 0.213
3.549LysGlu: 3.549 ± 0.244
2.024LysPhe: 2.024 ± 0.167
3.671LysGly: 3.671 ± 0.259
1.439LysHis: 1.439 ± 0.154
2.842LysIle: 2.842 ± 0.207
5.732LysLys: 5.732 ± 0.689
5.695LysLeu: 5.695 ± 0.332
1.829LysMet: 1.829 ± 0.165
2.293LysAsn: 2.293 ± 0.193
2.915LysPro: 2.915 ± 0.198
2.464LysGln: 2.464 ± 0.186
3.72LysArg: 3.72 ± 0.248
3.768LysSer: 3.768 ± 0.279
3.72LysThr: 3.72 ± 0.25
3.964LysVal: 3.964 ± 0.241
0.744LysTrp: 0.744 ± 0.08
2.293LysTyr: 2.293 ± 0.169
0.0LysXaa: 0.0 ± 0.0
Leu
6.854LeuAla: 6.854 ± 0.313
0.89LeuCys: 0.89 ± 0.112
5.781LeuAsp: 5.781 ± 0.331
4.464LeuGlu: 4.464 ± 0.293
2.744LeuPhe: 2.744 ± 0.202
4.976LeuGly: 4.976 ± 0.256
1.695LeuHis: 1.695 ± 0.162
4.22LeuIle: 4.22 ± 0.207
5.22LeuLys: 5.22 ± 0.353
5.952LeuLeu: 5.952 ± 0.295
2.354LeuMet: 2.354 ± 0.172
4.439LeuAsn: 4.439 ± 0.234
4.195LeuPro: 4.195 ± 0.239
2.756LeuGln: 2.756 ± 0.175
5.269LeuArg: 5.269 ± 0.234
5.83LeuSer: 5.83 ± 0.235
5.61LeuThr: 5.61 ± 0.278
5.488LeuVal: 5.488 ± 0.264
0.744LeuTrp: 0.744 ± 0.098
2.427LeuTyr: 2.427 ± 0.17
0.0LeuXaa: 0.0 ± 0.0
Met
2.488MetAla: 2.488 ± 0.156
0.317MetCys: 0.317 ± 0.049
1.427MetAsp: 1.427 ± 0.133
1.403MetGlu: 1.403 ± 0.131
1.256MetPhe: 1.256 ± 0.112
1.403MetGly: 1.403 ± 0.113
0.537MetHis: 0.537 ± 0.078
1.695MetIle: 1.695 ± 0.141
1.854MetLys: 1.854 ± 0.157
2.732MetLeu: 2.732 ± 0.187
0.768MetMet: 0.768 ± 0.102
1.598MetAsn: 1.598 ± 0.112
1.585MetPro: 1.585 ± 0.152
1.232MetGln: 1.232 ± 0.118
1.866MetArg: 1.866 ± 0.181
1.854MetSer: 1.854 ± 0.145
1.646MetThr: 1.646 ± 0.13
1.256MetVal: 1.256 ± 0.129
0.366MetTrp: 0.366 ± 0.082
1.024MetTyr: 1.024 ± 0.125
0.0MetXaa: 0.0 ± 0.0
Asn
4.171AsnAla: 4.171 ± 0.281
0.634AsnCys: 0.634 ± 0.096
2.671AsnAsp: 2.671 ± 0.166
2.354AsnGlu: 2.354 ± 0.144
1.829AsnPhe: 1.829 ± 0.168
3.878AsnGly: 3.878 ± 0.274
0.878AsnHis: 0.878 ± 0.095
2.768AsnIle: 2.768 ± 0.246
2.781AsnLys: 2.781 ± 0.163
3.439AsnLeu: 3.439 ± 0.221
1.427AsnMet: 1.427 ± 0.131
2.0AsnAsn: 2.0 ± 0.167
2.159AsnPro: 2.159 ± 0.15
1.695AsnGln: 1.695 ± 0.171
2.146AsnArg: 2.146 ± 0.159
2.744AsnSer: 2.744 ± 0.181
2.915AsnThr: 2.915 ± 0.22
3.512AsnVal: 3.512 ± 0.22
0.476AsnTrp: 0.476 ± 0.069
1.439AsnTyr: 1.439 ± 0.149
0.0AsnXaa: 0.0 ± 0.0
Pro
3.5ProAla: 3.5 ± 0.243
0.415ProCys: 0.415 ± 0.078
3.378ProAsp: 3.378 ± 0.197
3.025ProGlu: 3.025 ± 0.178
1.695ProPhe: 1.695 ± 0.137
2.598ProGly: 2.598 ± 0.27
0.854ProHis: 0.854 ± 0.119
2.488ProIle: 2.488 ± 0.166
3.098ProLys: 3.098 ± 0.224
3.061ProLeu: 3.061 ± 0.21
0.951ProMet: 0.951 ± 0.113
1.915ProAsn: 1.915 ± 0.128
1.329ProPro: 1.329 ± 0.14
1.671ProGln: 1.671 ± 0.153
2.0ProArg: 2.0 ± 0.159
2.659ProSer: 2.659 ± 0.189
3.317ProThr: 3.317 ± 0.261
3.659ProVal: 3.659 ± 0.231
0.329ProTrp: 0.329 ± 0.058
1.378ProTyr: 1.378 ± 0.129
0.0ProXaa: 0.0 ± 0.0
Gln
3.244GlnAla: 3.244 ± 0.174
0.341GlnCys: 0.341 ± 0.076
1.915GlnAsp: 1.915 ± 0.181
1.866GlnGlu: 1.866 ± 0.161
1.707GlnPhe: 1.707 ± 0.152
2.329GlnGly: 2.329 ± 0.16
0.805GlnHis: 0.805 ± 0.103
2.488GlnIle: 2.488 ± 0.171
2.0GlnLys: 2.0 ± 0.149
3.305GlnLeu: 3.305 ± 0.187
1.329GlnMet: 1.329 ± 0.148
1.329GlnAsn: 1.329 ± 0.112
1.854GlnPro: 1.854 ± 0.171
1.744GlnGln: 1.744 ± 0.17
2.183GlnArg: 2.183 ± 0.197
2.39GlnSer: 2.39 ± 0.159
2.427GlnThr: 2.427 ± 0.186
2.22GlnVal: 2.22 ± 0.171
0.622GlnTrp: 0.622 ± 0.094
1.488GlnTyr: 1.488 ± 0.111
0.0GlnXaa: 0.0 ± 0.0
Arg
3.951ArgAla: 3.951 ± 0.244
0.61ArgCys: 0.61 ± 0.074
3.354ArgAsp: 3.354 ± 0.23
2.903ArgGlu: 2.903 ± 0.212
2.268ArgPhe: 2.268 ± 0.161
3.366ArgGly: 3.366 ± 0.249
1.268ArgHis: 1.268 ± 0.142
3.488ArgIle: 3.488 ± 0.198
3.695ArgLys: 3.695 ± 0.26
4.634ArgLeu: 4.634 ± 0.273
1.61ArgMet: 1.61 ± 0.123
2.5ArgAsn: 2.5 ± 0.148
2.11ArgPro: 2.11 ± 0.135
1.988ArgGln: 1.988 ± 0.164
2.964ArgArg: 2.964 ± 0.22
2.829ArgSer: 2.829 ± 0.203
3.134ArgThr: 3.134 ± 0.162
4.073ArgVal: 4.073 ± 0.24
0.488ArgTrp: 0.488 ± 0.079
2.232ArgTyr: 2.232 ± 0.154
0.0ArgXaa: 0.0 ± 0.0
Ser
5.695SerAla: 5.695 ± 0.334
0.756SerCys: 0.756 ± 0.105
4.39SerAsp: 4.39 ± 0.27
3.415SerGlu: 3.415 ± 0.248
2.146SerPhe: 2.146 ± 0.165
4.72SerGly: 4.72 ± 0.253
1.098SerHis: 1.098 ± 0.129
3.476SerIle: 3.476 ± 0.221
3.854SerLys: 3.854 ± 0.276
5.305SerLeu: 5.305 ± 0.282
1.549SerMet: 1.549 ± 0.146
3.086SerAsn: 3.086 ± 0.235
2.451SerPro: 2.451 ± 0.162
2.22SerGln: 2.22 ± 0.171
2.89SerArg: 2.89 ± 0.168
4.329SerSer: 4.329 ± 0.268
3.464SerThr: 3.464 ± 0.229
4.976SerVal: 4.976 ± 0.27
0.683SerTrp: 0.683 ± 0.11
2.0SerTyr: 2.0 ± 0.171
0.0SerXaa: 0.0 ± 0.0
Thr
5.769ThrAla: 5.769 ± 0.394
0.744ThrCys: 0.744 ± 0.096
3.561ThrAsp: 3.561 ± 0.191
3.073ThrGlu: 3.073 ± 0.221
2.585ThrPhe: 2.585 ± 0.202
4.805ThrGly: 4.805 ± 0.367
1.293ThrHis: 1.293 ± 0.132
3.72ThrIle: 3.72 ± 0.232
2.915ThrLys: 2.915 ± 0.201
5.342ThrLeu: 5.342 ± 0.234
1.342ThrMet: 1.342 ± 0.13
2.646ThrAsn: 2.646 ± 0.176
2.988ThrPro: 2.988 ± 0.183
2.512ThrGln: 2.512 ± 0.203
2.646ThrArg: 2.646 ± 0.199
3.244ThrSer: 3.244 ± 0.189
3.756ThrThr: 3.756 ± 0.293
5.342ThrVal: 5.342 ± 0.34
0.744ThrTrp: 0.744 ± 0.09
2.268ThrTyr: 2.268 ± 0.189
0.0ThrXaa: 0.0 ± 0.0
Val
5.586ValAla: 5.586 ± 0.318
0.866ValCys: 0.866 ± 0.092
4.659ValAsp: 4.659 ± 0.265
4.756ValGlu: 4.756 ± 0.289
2.683ValPhe: 2.683 ± 0.17
4.232ValGly: 4.232 ± 0.209
1.159ValHis: 1.159 ± 0.1
3.61ValIle: 3.61 ± 0.199
4.671ValLys: 4.671 ± 0.242
5.744ValLeu: 5.744 ± 0.26
1.781ValMet: 1.781 ± 0.153
3.366ValAsn: 3.366 ± 0.209
3.427ValPro: 3.427 ± 0.197
2.464ValGln: 2.464 ± 0.178
4.183ValArg: 4.183 ± 0.247
5.159ValSer: 5.159 ± 0.254
4.525ValThr: 4.525 ± 0.399
5.549ValVal: 5.549 ± 0.285
0.951ValTrp: 0.951 ± 0.117
2.549ValTyr: 2.549 ± 0.169
0.0ValXaa: 0.0 ± 0.0
Trp
0.939TrpAla: 0.939 ± 0.123
0.22TrpCys: 0.22 ± 0.052
0.585TrpAsp: 0.585 ± 0.1
0.573TrpGlu: 0.573 ± 0.09
0.305TrpPhe: 0.305 ± 0.058
0.72TrpGly: 0.72 ± 0.098
0.305TrpHis: 0.305 ± 0.06
0.671TrpIle: 0.671 ± 0.091
0.549TrpLys: 0.549 ± 0.087
1.061TrpLeu: 1.061 ± 0.105
0.317TrpMet: 0.317 ± 0.058
0.549TrpAsn: 0.549 ± 0.074
0.573TrpPro: 0.573 ± 0.087
0.476TrpGln: 0.476 ± 0.072
0.707TrpArg: 0.707 ± 0.087
0.817TrpSer: 0.817 ± 0.092
0.817TrpThr: 0.817 ± 0.113
0.72TrpVal: 0.72 ± 0.086
0.146TrpTrp: 0.146 ± 0.039
0.427TrpTyr: 0.427 ± 0.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.927TyrAla: 2.927 ± 0.191
0.537TyrCys: 0.537 ± 0.089
2.573TyrAsp: 2.573 ± 0.178
1.707TyrGlu: 1.707 ± 0.146
1.5TyrPhe: 1.5 ± 0.14
2.464TyrGly: 2.464 ± 0.175
0.793TyrHis: 0.793 ± 0.097
1.573TyrIle: 1.573 ± 0.133
1.829TyrLys: 1.829 ± 0.152
2.72TyrLeu: 2.72 ± 0.2
0.842TyrMet: 0.842 ± 0.094
1.854TyrAsn: 1.854 ± 0.173
1.159TyrPro: 1.159 ± 0.118
1.451TyrGln: 1.451 ± 0.131
2.183TyrArg: 2.183 ± 0.19
2.024TyrSer: 2.024 ± 0.161
2.195TyrThr: 2.195 ± 0.221
2.61TyrVal: 2.61 ± 0.213
0.415TyrTrp: 0.415 ± 0.067
1.073TyrTyr: 1.073 ± 0.121
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 330 proteins (81997 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski