Amino acid dipepetide frequency for Dickeya phage vB_DsoM_JA29

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.26AlaAla: 6.26 ± 0.385
0.717AlaCys: 0.717 ± 0.104
4.739AlaAsp: 4.739 ± 0.269
5.217AlaGlu: 5.217 ± 0.308
3.256AlaPhe: 3.256 ± 0.19
4.299AlaGly: 4.299 ± 0.266
1.508AlaHis: 1.508 ± 0.139
4.852AlaIle: 4.852 ± 0.253
6.072AlaLys: 6.072 ± 0.386
6.436AlaLeu: 6.436 ± 0.31
1.923AlaMet: 1.923 ± 0.152
3.909AlaAsn: 3.909 ± 0.276
2.904AlaPro: 2.904 ± 0.22
2.879AlaGln: 2.879 ± 0.185
4.249AlaArg: 4.249 ± 0.225
4.45AlaSer: 4.45 ± 0.236
4.362AlaThr: 4.362 ± 0.302
4.613AlaVal: 4.613 ± 0.255
0.578AlaTrp: 0.578 ± 0.089
2.502AlaTyr: 2.502 ± 0.201
0.0AlaXaa: 0.0 ± 0.0
Cys
0.717CysAla: 0.717 ± 0.111
0.138CysCys: 0.138 ± 0.037
0.704CysAsp: 0.704 ± 0.097
0.867CysGlu: 0.867 ± 0.11
0.515CysPhe: 0.515 ± 0.073
0.855CysGly: 0.855 ± 0.067
0.239CysHis: 0.239 ± 0.053
0.641CysIle: 0.641 ± 0.092
0.704CysLys: 0.704 ± 0.098
0.955CysLeu: 0.955 ± 0.102
0.264CysMet: 0.264 ± 0.058
0.365CysAsn: 0.365 ± 0.06
0.742CysPro: 0.742 ± 0.105
0.251CysGln: 0.251 ± 0.056
0.603CysArg: 0.603 ± 0.104
0.729CysSer: 0.729 ± 0.108
0.515CysThr: 0.515 ± 0.099
0.704CysVal: 0.704 ± 0.094
0.075CysTrp: 0.075 ± 0.029
0.377CysTyr: 0.377 ± 0.066
0.0CysXaa: 0.0 ± 0.0
Asp
4.676AspAla: 4.676 ± 0.269
0.578AspCys: 0.578 ± 0.088
5.317AspAsp: 5.317 ± 0.655
5.468AspGlu: 5.468 ± 0.708
3.193AspPhe: 3.193 ± 0.192
4.5AspGly: 4.5 ± 0.301
0.968AspHis: 0.968 ± 0.111
3.985AspIle: 3.985 ± 0.263
3.595AspLys: 3.595 ± 0.25
5.896AspLeu: 5.896 ± 0.299
1.471AspMet: 1.471 ± 0.169
2.439AspAsn: 2.439 ± 0.155
3.356AspPro: 3.356 ± 0.266
2.124AspGln: 2.124 ± 0.148
3.042AspArg: 3.042 ± 0.199
4.84AspSer: 4.84 ± 0.305
2.967AspThr: 2.967 ± 0.208
4.199AspVal: 4.199 ± 0.225
0.93AspTrp: 0.93 ± 0.092
2.376AspTyr: 2.376 ± 0.172
0.0AspXaa: 0.0 ± 0.0
Glu
4.802GluAla: 4.802 ± 0.25
0.578GluCys: 0.578 ± 0.09
5.669GluAsp: 5.669 ± 0.749
5.958GluGlu: 5.958 ± 0.682
3.143GluPhe: 3.143 ± 0.193
3.168GluGly: 3.168 ± 0.221
1.332GluHis: 1.332 ± 0.134
4.689GluIle: 4.689 ± 0.278
4.651GluLys: 4.651 ± 0.318
5.795GluLeu: 5.795 ± 0.321
2.024GluMet: 2.024 ± 0.173
3.457GluAsn: 3.457 ± 0.204
1.823GluPro: 1.823 ± 0.151
2.451GluGln: 2.451 ± 0.18
3.105GluArg: 3.105 ± 0.251
4.375GluSer: 4.375 ± 0.313
3.545GluThr: 3.545 ± 0.228
4.349GluVal: 4.349 ± 0.222
0.83GluTrp: 0.83 ± 0.101
2.388GluTyr: 2.388 ± 0.17
0.0GluXaa: 0.0 ± 0.0
Phe
3.231PheAla: 3.231 ± 0.23
0.666PheCys: 0.666 ± 0.093
3.733PheAsp: 3.733 ± 0.264
3.143PheGlu: 3.143 ± 0.213
1.496PhePhe: 1.496 ± 0.156
3.105PheGly: 3.105 ± 0.205
0.767PheHis: 0.767 ± 0.102
2.652PheIle: 2.652 ± 0.18
2.275PheLys: 2.275 ± 0.161
3.105PheLeu: 3.105 ± 0.196
0.955PheMet: 0.955 ± 0.109
2.338PheAsn: 2.338 ± 0.171
1.81PhePro: 1.81 ± 0.147
1.156PheGln: 1.156 ± 0.107
2.275PheArg: 2.275 ± 0.153
2.979PheSer: 2.979 ± 0.224
2.602PheThr: 2.602 ± 0.19
3.117PheVal: 3.117 ± 0.204
0.402PheTrp: 0.402 ± 0.069
1.471PheTyr: 1.471 ± 0.161
0.0PheXaa: 0.0 ± 0.0
Gly
4.437GlyAla: 4.437 ± 0.252
0.528GlyCys: 0.528 ± 0.062
3.645GlyAsp: 3.645 ± 0.218
3.834GlyGlu: 3.834 ± 0.299
2.715GlyPhe: 2.715 ± 0.182
3.733GlyGly: 3.733 ± 0.349
0.742GlyHis: 0.742 ± 0.085
3.733GlyIle: 3.733 ± 0.277
5.267GlyLys: 5.267 ± 0.274
4.199GlyLeu: 4.199 ± 0.243
1.697GlyMet: 1.697 ± 0.14
3.004GlyAsn: 3.004 ± 0.253
1.156GlyPro: 1.156 ± 0.235
1.697GlyGln: 1.697 ± 0.155
2.879GlyArg: 2.879 ± 0.197
4.676GlySer: 4.676 ± 0.3
4.5GlyThr: 4.5 ± 0.274
4.035GlyVal: 4.035 ± 0.23
0.805GlyTrp: 0.805 ± 0.107
2.489GlyTyr: 2.489 ± 0.168
0.0GlyXaa: 0.0 ± 0.0
His
1.345HisAla: 1.345 ± 0.117
0.339HisCys: 0.339 ± 0.071
0.918HisAsp: 0.918 ± 0.114
1.332HisGlu: 1.332 ± 0.121
0.918HisPhe: 0.918 ± 0.117
1.307HisGly: 1.307 ± 0.107
0.339HisHis: 0.339 ± 0.07
1.081HisIle: 1.081 ± 0.113
1.345HisLys: 1.345 ± 0.108
1.345HisLeu: 1.345 ± 0.16
0.415HisMet: 0.415 ± 0.077
0.805HisAsn: 0.805 ± 0.105
0.729HisPro: 0.729 ± 0.089
0.327HisGln: 0.327 ± 0.078
1.144HisArg: 1.144 ± 0.117
1.081HisSer: 1.081 ± 0.11
0.993HisThr: 0.993 ± 0.112
1.383HisVal: 1.383 ± 0.112
0.214HisTrp: 0.214 ± 0.048
0.893HisTyr: 0.893 ± 0.107
0.0HisXaa: 0.0 ± 0.0
Ile
4.815IleAla: 4.815 ± 0.262
0.855IleCys: 0.855 ± 0.105
4.023IleAsp: 4.023 ± 0.24
4.4IleGlu: 4.4 ± 0.251
2.187IlePhe: 2.187 ± 0.183
3.746IleGly: 3.746 ± 0.25
1.32IleHis: 1.32 ± 0.129
2.766IleIle: 2.766 ± 0.178
3.746IleLys: 3.746 ± 0.244
4.437IleLeu: 4.437 ± 0.25
1.345IleMet: 1.345 ± 0.137
2.967IleAsn: 2.967 ± 0.266
3.243IlePro: 3.243 ± 0.209
2.313IleGln: 2.313 ± 0.149
3.671IleArg: 3.671 ± 0.227
4.852IleSer: 4.852 ± 0.242
3.168IleThr: 3.168 ± 0.197
4.676IleVal: 4.676 ± 0.202
0.541IleTrp: 0.541 ± 0.083
1.898IleTyr: 1.898 ± 0.163
0.0IleXaa: 0.0 ± 0.0
Lys
5.242LysAla: 5.242 ± 0.349
0.528LysCys: 0.528 ± 0.085
3.507LysAsp: 3.507 ± 0.229
4.312LysGlu: 4.312 ± 0.254
2.891LysPhe: 2.891 ± 0.197
3.62LysGly: 3.62 ± 0.275
1.257LysHis: 1.257 ± 0.134
4.035LysIle: 4.035 ± 0.209
6.826LysLys: 6.826 ± 0.582
5.971LysLeu: 5.971 ± 0.349
1.735LysMet: 1.735 ± 0.143
3.319LysAsn: 3.319 ± 0.225
2.854LysPro: 2.854 ± 0.236
2.715LysGln: 2.715 ± 0.163
3.633LysArg: 3.633 ± 0.263
4.362LysSer: 4.362 ± 0.295
4.463LysThr: 4.463 ± 0.243
4.437LysVal: 4.437 ± 0.223
0.578LysTrp: 0.578 ± 0.08
2.476LysTyr: 2.476 ± 0.18
0.0LysXaa: 0.0 ± 0.0
Leu
6.424LeuAla: 6.424 ± 0.296
0.918LeuCys: 0.918 ± 0.109
4.991LeuAsp: 4.991 ± 0.216
5.556LeuGlu: 5.556 ± 0.283
3.495LeuPhe: 3.495 ± 0.21
4.085LeuGly: 4.085 ± 0.244
1.785LeuHis: 1.785 ± 0.14
4.576LeuIle: 4.576 ± 0.215
5.267LeuLys: 5.267 ± 0.313
6.512LeuLeu: 6.512 ± 0.305
2.25LeuMet: 2.25 ± 0.165
4.903LeuAsn: 4.903 ± 0.27
4.261LeuPro: 4.261 ± 0.216
2.414LeuGln: 2.414 ± 0.168
4.915LeuArg: 4.915 ± 0.253
6.801LeuSer: 6.801 ± 0.313
4.676LeuThr: 4.676 ± 0.234
5.028LeuVal: 5.028 ± 0.245
0.792LeuTrp: 0.792 ± 0.102
2.539LeuTyr: 2.539 ± 0.216
0.0LeuXaa: 0.0 ± 0.0
Met
1.911MetAla: 1.911 ± 0.161
0.415MetCys: 0.415 ± 0.083
1.471MetAsp: 1.471 ± 0.131
1.785MetGlu: 1.785 ± 0.151
1.207MetPhe: 1.207 ± 0.115
0.993MetGly: 0.993 ± 0.106
0.339MetHis: 0.339 ± 0.061
1.395MetIle: 1.395 ± 0.126
1.848MetLys: 1.848 ± 0.155
2.036MetLeu: 2.036 ± 0.154
0.616MetMet: 0.616 ± 0.096
1.006MetAsn: 1.006 ± 0.124
1.307MetPro: 1.307 ± 0.126
1.207MetGln: 1.207 ± 0.118
1.672MetArg: 1.672 ± 0.153
1.999MetSer: 1.999 ± 0.155
1.395MetThr: 1.395 ± 0.131
1.282MetVal: 1.282 ± 0.134
0.226MetTrp: 0.226 ± 0.059
0.805MetTyr: 0.805 ± 0.098
0.0MetXaa: 0.0 ± 0.0
Asn
4.186AsnAla: 4.186 ± 0.21
0.566AsnCys: 0.566 ± 0.089
2.64AsnAsp: 2.64 ± 0.16
2.753AsnGlu: 2.753 ± 0.168
2.414AsnPhe: 2.414 ± 0.179
3.495AsnGly: 3.495 ± 0.212
0.591AsnHis: 0.591 ± 0.07
3.268AsnIle: 3.268 ± 0.213
3.042AsnLys: 3.042 ± 0.219
3.997AsnLeu: 3.997 ± 0.205
1.232AsnMet: 1.232 ± 0.122
2.099AsnAsn: 2.099 ± 0.174
2.388AsnPro: 2.388 ± 0.175
1.747AsnGln: 1.747 ± 0.173
2.124AsnArg: 2.124 ± 0.16
3.419AsnSer: 3.419 ± 0.227
2.602AsnThr: 2.602 ± 0.18
3.645AsnVal: 3.645 ± 0.231
0.603AsnTrp: 0.603 ± 0.099
1.609AsnTyr: 1.609 ± 0.136
0.0AsnXaa: 0.0 ± 0.0
Pro
2.854ProAla: 2.854 ± 0.195
0.503ProCys: 0.503 ± 0.076
3.155ProAsp: 3.155 ± 0.192
3.482ProGlu: 3.482 ± 0.217
1.961ProPhe: 1.961 ± 0.134
1.936ProGly: 1.936 ± 0.176
0.654ProHis: 0.654 ± 0.101
2.577ProIle: 2.577 ± 0.188
3.645ProLys: 3.645 ± 0.267
3.268ProLeu: 3.268 ± 0.23
0.993ProMet: 0.993 ± 0.113
1.999ProAsn: 1.999 ± 0.175
1.27ProPro: 1.27 ± 0.152
1.244ProGln: 1.244 ± 0.141
1.785ProArg: 1.785 ± 0.168
2.816ProSer: 2.816 ± 0.173
2.866ProThr: 2.866 ± 0.209
3.268ProVal: 3.268 ± 0.198
0.302ProTrp: 0.302 ± 0.068
1.282ProTyr: 1.282 ± 0.137
0.0ProXaa: 0.0 ± 0.0
Gln
2.376GlnAla: 2.376 ± 0.181
0.277GlnCys: 0.277 ± 0.053
1.886GlnAsp: 1.886 ± 0.153
2.162GlnGlu: 2.162 ± 0.152
1.408GlnPhe: 1.408 ± 0.12
1.521GlnGly: 1.521 ± 0.122
0.754GlnHis: 0.754 ± 0.095
2.828GlnIle: 2.828 ± 0.189
2.376GlnLys: 2.376 ± 0.162
3.13GlnLeu: 3.13 ± 0.186
0.83GlnMet: 0.83 ± 0.092
1.948GlnAsn: 1.948 ± 0.186
1.207GlnPro: 1.207 ± 0.129
1.219GlnGln: 1.219 ± 0.137
1.76GlnArg: 1.76 ± 0.148
2.25GlnSer: 2.25 ± 0.166
2.3GlnThr: 2.3 ± 0.127
2.074GlnVal: 2.074 ± 0.185
0.453GlnTrp: 0.453 ± 0.085
1.546GlnTyr: 1.546 ± 0.144
0.0GlnXaa: 0.0 ± 0.0
Arg
3.759ArgAla: 3.759 ± 0.231
0.616ArgCys: 0.616 ± 0.087
3.243ArgAsp: 3.243 ± 0.207
2.942ArgGlu: 2.942 ± 0.21
2.225ArgPhe: 2.225 ± 0.182
3.055ArgGly: 3.055 ± 0.238
0.968ArgHis: 0.968 ± 0.112
3.884ArgIle: 3.884 ± 0.229
3.847ArgLys: 3.847 ± 0.274
4.701ArgLeu: 4.701 ± 0.233
1.659ArgMet: 1.659 ± 0.128
2.791ArgAsn: 2.791 ± 0.194
1.71ArgPro: 1.71 ± 0.151
1.873ArgGln: 1.873 ± 0.167
3.067ArgArg: 3.067 ± 0.215
3.557ArgSer: 3.557 ± 0.245
2.791ArgThr: 2.791 ± 0.17
3.733ArgVal: 3.733 ± 0.217
0.591ArgTrp: 0.591 ± 0.081
1.898ArgTyr: 1.898 ± 0.128
0.0ArgXaa: 0.0 ± 0.0
Ser
6.034SerAla: 6.034 ± 0.278
0.666SerCys: 0.666 ± 0.099
5.028SerAsp: 5.028 ± 0.264
4.789SerGlu: 4.789 ± 0.367
3.067SerPhe: 3.067 ± 0.212
5.028SerGly: 5.028 ± 0.228
1.307SerHis: 1.307 ± 0.127
4.098SerIle: 4.098 ± 0.219
4.098SerLys: 4.098 ± 0.255
5.808SerLeu: 5.808 ± 0.278
1.71SerMet: 1.71 ± 0.14
3.18SerAsn: 3.18 ± 0.213
3.017SerPro: 3.017 ± 0.194
2.212SerGln: 2.212 ± 0.141
3.155SerArg: 3.155 ± 0.232
4.563SerSer: 4.563 ± 0.321
4.161SerThr: 4.161 ± 0.261
5.393SerVal: 5.393 ± 0.26
0.616SerTrp: 0.616 ± 0.094
2.388SerTyr: 2.388 ± 0.195
0.0SerXaa: 0.0 ± 0.0
Thr
4.563ThrAla: 4.563 ± 0.301
0.553ThrCys: 0.553 ± 0.081
3.469ThrAsp: 3.469 ± 0.239
3.205ThrGlu: 3.205 ± 0.183
2.552ThrPhe: 2.552 ± 0.187
4.312ThrGly: 4.312 ± 0.323
1.194ThrHis: 1.194 ± 0.121
3.834ThrIle: 3.834 ± 0.268
3.545ThrLys: 3.545 ± 0.224
5.418ThrLeu: 5.418 ± 0.291
1.081ThrMet: 1.081 ± 0.122
2.615ThrAsn: 2.615 ± 0.172
2.967ThrPro: 2.967 ± 0.213
2.062ThrGln: 2.062 ± 0.201
2.451ThrArg: 2.451 ± 0.187
3.733ThrSer: 3.733 ± 0.24
3.205ThrThr: 3.205 ± 0.223
4.903ThrVal: 4.903 ± 0.265
0.679ThrTrp: 0.679 ± 0.107
1.948ThrTyr: 1.948 ± 0.183
0.0ThrXaa: 0.0 ± 0.0
Val
4.789ValAla: 4.789 ± 0.279
0.779ValCys: 0.779 ± 0.094
4.739ValAsp: 4.739 ± 0.254
4.551ValGlu: 4.551 ± 0.234
2.615ValPhe: 2.615 ± 0.181
3.935ValGly: 3.935 ± 0.213
1.106ValHis: 1.106 ± 0.118
3.658ValIle: 3.658 ± 0.24
4.023ValLys: 4.023 ± 0.27
5.657ValLeu: 5.657 ± 0.293
1.395ValMet: 1.395 ± 0.141
3.155ValAsn: 3.155 ± 0.196
3.394ValPro: 3.394 ± 0.188
2.527ValGln: 2.527 ± 0.205
4.488ValArg: 4.488 ± 0.232
5.493ValSer: 5.493 ± 0.26
4.4ValThr: 4.4 ± 0.275
5.481ValVal: 5.481 ± 0.303
0.918ValTrp: 0.918 ± 0.127
2.426ValTyr: 2.426 ± 0.155
0.0ValXaa: 0.0 ± 0.0
Trp
0.679TrpAla: 0.679 ± 0.092
0.151TrpCys: 0.151 ± 0.04
0.503TrpAsp: 0.503 ± 0.079
0.616TrpGlu: 0.616 ± 0.08
0.415TrpPhe: 0.415 ± 0.094
0.578TrpGly: 0.578 ± 0.084
0.365TrpHis: 0.365 ± 0.074
0.654TrpIle: 0.654 ± 0.09
0.654TrpLys: 0.654 ± 0.089
0.88TrpLeu: 0.88 ± 0.116
0.302TrpMet: 0.302 ± 0.067
0.553TrpAsn: 0.553 ± 0.084
0.415TrpPro: 0.415 ± 0.07
0.541TrpGln: 0.541 ± 0.075
0.742TrpArg: 0.742 ± 0.094
0.591TrpSer: 0.591 ± 0.092
0.779TrpThr: 0.779 ± 0.107
0.754TrpVal: 0.754 ± 0.107
0.063TrpTrp: 0.063 ± 0.033
0.339TrpTyr: 0.339 ± 0.053
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.64TyrAla: 2.64 ± 0.206
0.591TyrCys: 0.591 ± 0.09
2.539TyrAsp: 2.539 ± 0.207
1.76TyrGlu: 1.76 ± 0.172
1.609TyrPhe: 1.609 ± 0.144
2.502TyrGly: 2.502 ± 0.179
0.717TyrHis: 0.717 ± 0.089
1.76TyrIle: 1.76 ± 0.151
1.936TyrLys: 1.936 ± 0.145
2.64TyrLeu: 2.64 ± 0.173
1.006TyrMet: 1.006 ± 0.097
1.571TyrAsn: 1.571 ± 0.154
1.307TyrPro: 1.307 ± 0.119
1.345TyrGln: 1.345 ± 0.137
2.15TyrArg: 2.15 ± 0.171
2.753TyrSer: 2.753 ± 0.214
1.999TyrThr: 1.999 ± 0.184
2.451TyrVal: 2.451 ± 0.169
0.377TyrTrp: 0.377 ± 0.056
1.27TyrTyr: 1.27 ± 0.121
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 318 proteins (79552 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski