Amino acid dipepetide frequency for Pseudomonas phage Noxifer

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.723AlaAla: 8.723 ± 0.541
0.69AlaCys: 0.69 ± 0.095
5.57AlaAsp: 5.57 ± 0.238
5.294AlaGlu: 5.294 ± 0.309
3.027AlaPhe: 3.027 ± 0.197
5.72AlaGly: 5.72 ± 0.354
1.496AlaHis: 1.496 ± 0.145
4.12AlaIle: 4.12 ± 0.231
4.5AlaLys: 4.5 ± 0.259
7.515AlaLeu: 7.515 ± 0.311
3.188AlaMet: 3.188 ± 0.161
3.671AlaAsn: 3.671 ± 0.194
3.418AlaPro: 3.418 ± 0.245
3.337AlaGln: 3.337 ± 0.201
4.051AlaArg: 4.051 ± 0.218
4.477AlaSer: 4.477 ± 0.275
5.11AlaThr: 5.11 ± 0.244
6.065AlaVal: 6.065 ± 0.263
1.082AlaTrp: 1.082 ± 0.119
3.268AlaTyr: 3.268 ± 0.208
0.0AlaXaa: 0.0 ± 0.0
Cys
0.621CysAla: 0.621 ± 0.094
0.092CysCys: 0.092 ± 0.034
0.541CysAsp: 0.541 ± 0.09
0.656CysGlu: 0.656 ± 0.081
0.46CysPhe: 0.46 ± 0.081
0.529CysGly: 0.529 ± 0.084
0.23CysHis: 0.23 ± 0.064
0.38CysIle: 0.38 ± 0.072
0.426CysLys: 0.426 ± 0.07
0.84CysLeu: 0.84 ± 0.093
0.311CysMet: 0.311 ± 0.068
0.414CysAsn: 0.414 ± 0.07
0.368CysPro: 0.368 ± 0.069
0.242CysGln: 0.242 ± 0.055
0.714CysArg: 0.714 ± 0.099
0.46CysSer: 0.46 ± 0.085
0.495CysThr: 0.495 ± 0.063
0.829CysVal: 0.829 ± 0.112
0.081CysTrp: 0.081 ± 0.028
0.414CysTyr: 0.414 ± 0.076
0.0CysXaa: 0.0 ± 0.0
Asp
5.271AspAla: 5.271 ± 0.244
0.587AspCys: 0.587 ± 0.087
3.89AspAsp: 3.89 ± 0.228
3.993AspGlu: 3.993 ± 0.202
2.474AspPhe: 2.474 ± 0.201
4.718AspGly: 4.718 ± 0.302
1.358AspHis: 1.358 ± 0.147
3.706AspIle: 3.706 ± 0.21
3.291AspLys: 3.291 ± 0.301
5.766AspLeu: 5.766 ± 0.242
1.772AspMet: 1.772 ± 0.158
2.612AspAsn: 2.612 ± 0.184
3.429AspPro: 3.429 ± 0.183
2.187AspGln: 2.187 ± 0.155
3.395AspArg: 3.395 ± 0.196
3.084AspSer: 3.084 ± 0.181
3.66AspThr: 3.66 ± 0.236
4.845AspVal: 4.845 ± 0.22
0.886AspTrp: 0.886 ± 0.1
2.635AspTyr: 2.635 ± 0.166
0.0AspXaa: 0.0 ± 0.0
Glu
5.259GluAla: 5.259 ± 0.251
0.587GluCys: 0.587 ± 0.096
3.913GluAsp: 3.913 ± 0.23
4.373GluGlu: 4.373 ± 0.288
2.889GluPhe: 2.889 ± 0.183
3.901GluGly: 3.901 ± 0.223
1.795GluHis: 1.795 ± 0.176
3.602GluIle: 3.602 ± 0.203
2.9GluLys: 2.9 ± 0.229
6.548GluLeu: 6.548 ± 0.281
1.669GluMet: 1.669 ± 0.15
2.233GluAsn: 2.233 ± 0.156
3.015GluPro: 3.015 ± 0.244
2.866GluGln: 2.866 ± 0.188
3.521GluArg: 3.521 ± 0.226
2.969GluSer: 2.969 ± 0.196
3.268GluThr: 3.268 ± 0.201
4.626GluVal: 4.626 ± 0.239
1.024GluTrp: 1.024 ± 0.111
2.221GluTyr: 2.221 ± 0.145
0.0GluXaa: 0.0 ± 0.0
Phe
2.831PheAla: 2.831 ± 0.171
0.334PheCys: 0.334 ± 0.067
2.727PheAsp: 2.727 ± 0.166
2.497PheGlu: 2.497 ± 0.174
1.416PhePhe: 1.416 ± 0.143
2.647PheGly: 2.647 ± 0.149
0.99PheHis: 0.99 ± 0.104
2.302PheIle: 2.302 ± 0.135
2.382PheLys: 2.382 ± 0.153
2.67PheLeu: 2.67 ± 0.157
1.277PheMet: 1.277 ± 0.122
2.198PheAsn: 2.198 ± 0.157
1.519PhePro: 1.519 ± 0.128
1.312PheGln: 1.312 ± 0.134
2.106PheArg: 2.106 ± 0.159
2.325PheSer: 2.325 ± 0.181
2.601PheThr: 2.601 ± 0.177
2.463PheVal: 2.463 ± 0.164
0.564PheTrp: 0.564 ± 0.095
1.462PheTyr: 1.462 ± 0.117
0.0PheXaa: 0.0 ± 0.0
Gly
4.879GlyAla: 4.879 ± 0.377
0.679GlyCys: 0.679 ± 0.101
4.925GlyAsp: 4.925 ± 0.451
4.431GlyGlu: 4.431 ± 0.234
2.889GlyPhe: 2.889 ± 0.201
5.317GlyGly: 5.317 ± 0.458
1.162GlyHis: 1.162 ± 0.124
3.775GlyIle: 3.775 ± 0.205
4.005GlyLys: 4.005 ± 0.274
5.478GlyLeu: 5.478 ± 0.241
1.818GlyMet: 1.818 ± 0.121
3.326GlyAsn: 3.326 ± 0.221
1.899GlyPro: 1.899 ± 0.205
2.75GlyGln: 2.75 ± 0.187
3.372GlyArg: 3.372 ± 0.222
3.66GlySer: 3.66 ± 0.237
4.373GlyThr: 4.373 ± 0.306
5.317GlyVal: 5.317 ± 0.26
1.128GlyTrp: 1.128 ± 0.136
2.75GlyTyr: 2.75 ± 0.205
0.0GlyXaa: 0.0 ± 0.0
His
1.623HisAla: 1.623 ± 0.15
0.357HisCys: 0.357 ± 0.07
1.243HisAsp: 1.243 ± 0.112
1.369HisGlu: 1.369 ± 0.144
0.783HisPhe: 0.783 ± 0.084
1.289HisGly: 1.289 ± 0.12
0.564HisHis: 0.564 ± 0.094
1.289HisIle: 1.289 ± 0.12
0.978HisLys: 0.978 ± 0.113
1.864HisLeu: 1.864 ± 0.156
0.725HisMet: 0.725 ± 0.09
1.036HisAsn: 1.036 ± 0.113
1.174HisPro: 1.174 ± 0.118
0.771HisGln: 0.771 ± 0.103
1.646HisArg: 1.646 ± 0.139
1.197HisSer: 1.197 ± 0.123
1.323HisThr: 1.323 ± 0.119
1.761HisVal: 1.761 ± 0.147
0.46HisTrp: 0.46 ± 0.074
1.266HisTyr: 1.266 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
4.753IleAla: 4.753 ± 0.232
0.495IleCys: 0.495 ± 0.078
4.005IleAsp: 4.005 ± 0.195
4.028IleGlu: 4.028 ± 0.22
1.346IlePhe: 1.346 ± 0.137
3.579IleGly: 3.579 ± 0.192
1.174IleHis: 1.174 ± 0.134
2.417IleIle: 2.417 ± 0.182
3.211IleLys: 3.211 ± 0.199
4.12IleLeu: 4.12 ± 0.222
1.3IleMet: 1.3 ± 0.149
3.119IleAsn: 3.119 ± 0.175
2.843IlePro: 2.843 ± 0.193
2.279IleGln: 2.279 ± 0.184
3.28IleArg: 3.28 ± 0.206
3.015IleSer: 3.015 ± 0.159
3.855IleThr: 3.855 ± 0.236
3.579IleVal: 3.579 ± 0.229
0.552IleTrp: 0.552 ± 0.09
1.784IleTyr: 1.784 ± 0.155
0.0IleXaa: 0.0 ± 0.0
Lys
4.649LysAla: 4.649 ± 0.316
0.345LysCys: 0.345 ± 0.071
3.096LysAsp: 3.096 ± 0.219
3.706LysGlu: 3.706 ± 0.224
1.956LysPhe: 1.956 ± 0.161
3.314LysGly: 3.314 ± 0.369
1.3LysHis: 1.3 ± 0.135
2.52LysIle: 2.52 ± 0.172
2.75LysLys: 2.75 ± 0.263
4.638LysLeu: 4.638 ± 0.265
1.485LysMet: 1.485 ± 0.134
1.784LysAsn: 1.784 ± 0.151
2.635LysPro: 2.635 ± 0.18
2.129LysGln: 2.129 ± 0.143
3.027LysArg: 3.027 ± 0.203
2.532LysSer: 2.532 ± 0.188
3.119LysThr: 3.119 ± 0.214
3.545LysVal: 3.545 ± 0.186
0.886LysTrp: 0.886 ± 0.121
1.738LysTyr: 1.738 ± 0.126
0.0LysXaa: 0.0 ± 0.0
Leu
7.262LeuAla: 7.262 ± 0.295
0.714LeuCys: 0.714 ± 0.09
5.754LeuAsp: 5.754 ± 0.298
5.42LeuGlu: 5.42 ± 0.229
3.291LeuPhe: 3.291 ± 0.195
6.03LeuGly: 6.03 ± 0.311
1.634LeuHis: 1.634 ± 0.119
4.787LeuIle: 4.787 ± 0.244
4.799LeuLys: 4.799 ± 0.261
7.549LeuLeu: 7.549 ± 0.325
2.302LeuMet: 2.302 ± 0.162
4.373LeuAsn: 4.373 ± 0.208
4.339LeuPro: 4.339 ± 0.196
3.015LeuGln: 3.015 ± 0.188
5.282LeuArg: 5.282 ± 0.26
5.351LeuSer: 5.351 ± 0.27
6.145LeuThr: 6.145 ± 0.267
6.099LeuVal: 6.099 ± 0.275
0.932LeuTrp: 0.932 ± 0.101
2.912LeuTyr: 2.912 ± 0.185
0.0LeuXaa: 0.0 ± 0.0
Met
2.739MetAla: 2.739 ± 0.166
0.207MetCys: 0.207 ± 0.041
1.577MetAsp: 1.577 ± 0.123
1.795MetGlu: 1.795 ± 0.148
1.139MetPhe: 1.139 ± 0.092
1.979MetGly: 1.979 ± 0.148
0.737MetHis: 0.737 ± 0.113
1.485MetIle: 1.485 ± 0.126
1.174MetLys: 1.174 ± 0.108
2.451MetLeu: 2.451 ± 0.172
0.714MetMet: 0.714 ± 0.086
1.266MetAsn: 1.266 ± 0.132
1.312MetPro: 1.312 ± 0.116
1.381MetGln: 1.381 ± 0.133
1.749MetArg: 1.749 ± 0.15
2.302MetSer: 2.302 ± 0.157
1.945MetThr: 1.945 ± 0.137
2.094MetVal: 2.094 ± 0.151
0.242MetTrp: 0.242 ± 0.051
1.116MetTyr: 1.116 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.798AsnAla: 3.798 ± 0.247
0.357AsnCys: 0.357 ± 0.067
2.762AsnAsp: 2.762 ± 0.186
2.773AsnGlu: 2.773 ± 0.184
1.623AsnPhe: 1.623 ± 0.153
3.498AsnGly: 3.498 ± 0.254
1.082AsnHis: 1.082 ± 0.125
2.935AsnIle: 2.935 ± 0.199
1.922AsnLys: 1.922 ± 0.155
3.913AsnLeu: 3.913 ± 0.175
1.346AsnMet: 1.346 ± 0.128
2.325AsnAsn: 2.325 ± 0.159
2.716AsnPro: 2.716 ± 0.195
1.68AsnGln: 1.68 ± 0.123
2.566AsnArg: 2.566 ± 0.177
1.968AsnSer: 1.968 ± 0.137
2.866AsnThr: 2.866 ± 0.189
3.234AsnVal: 3.234 ± 0.208
0.667AsnTrp: 0.667 ± 0.09
1.634AsnTyr: 1.634 ± 0.165
0.0AsnXaa: 0.0 ± 0.0
Pro
4.511ProAla: 4.511 ± 0.307
0.299ProCys: 0.299 ± 0.057
3.05ProAsp: 3.05 ± 0.189
3.303ProGlu: 3.303 ± 0.236
1.91ProPhe: 1.91 ± 0.15
3.015ProGly: 3.015 ± 0.183
1.22ProHis: 1.22 ± 0.138
2.267ProIle: 2.267 ± 0.183
2.727ProLys: 2.727 ± 0.193
3.786ProLeu: 3.786 ± 0.219
1.392ProMet: 1.392 ± 0.138
2.141ProAsn: 2.141 ± 0.156
1.922ProPro: 1.922 ± 0.186
1.565ProGln: 1.565 ± 0.124
2.002ProArg: 2.002 ± 0.165
2.187ProSer: 2.187 ± 0.166
3.418ProThr: 3.418 ± 0.192
3.936ProVal: 3.936 ± 0.206
0.38ProTrp: 0.38 ± 0.066
1.623ProTyr: 1.623 ± 0.127
0.0ProXaa: 0.0 ± 0.0
Gln
3.637GlnAla: 3.637 ± 0.21
0.437GlnCys: 0.437 ± 0.089
1.369GlnAsp: 1.369 ± 0.128
2.129GlnGlu: 2.129 ± 0.136
1.887GlnPhe: 1.887 ± 0.145
2.233GlnGly: 2.233 ± 0.177
1.036GlnHis: 1.036 ± 0.115
2.198GlnIle: 2.198 ± 0.157
1.657GlnLys: 1.657 ± 0.135
3.832GlnLeu: 3.832 ± 0.207
1.151GlnMet: 1.151 ± 0.103
1.404GlnAsn: 1.404 ± 0.114
1.807GlnPro: 1.807 ± 0.148
1.795GlnGln: 1.795 ± 0.192
2.117GlnArg: 2.117 ± 0.12
1.749GlnSer: 1.749 ± 0.162
2.348GlnThr: 2.348 ± 0.178
2.889GlnVal: 2.889 ± 0.192
0.518GlnTrp: 0.518 ± 0.089
1.657GlnTyr: 1.657 ± 0.139
0.0GlnXaa: 0.0 ± 0.0
Arg
3.74ArgAla: 3.74 ± 0.222
0.587ArgCys: 0.587 ± 0.091
3.591ArgAsp: 3.591 ± 0.211
3.337ArgGlu: 3.337 ± 0.237
2.555ArgPhe: 2.555 ± 0.182
3.475ArgGly: 3.475 ± 0.216
1.462ArgHis: 1.462 ± 0.144
3.176ArgIle: 3.176 ± 0.178
2.935ArgLys: 2.935 ± 0.158
5.282ArgLeu: 5.282 ± 0.246
1.795ArgMet: 1.795 ± 0.147
2.532ArgAsn: 2.532 ± 0.199
2.002ArgPro: 2.002 ± 0.149
1.991ArgGln: 1.991 ± 0.156
3.13ArgArg: 3.13 ± 0.206
3.073ArgSer: 3.073 ± 0.207
2.831ArgThr: 2.831 ± 0.174
3.867ArgVal: 3.867 ± 0.202
0.978ArgTrp: 0.978 ± 0.094
2.221ArgTyr: 2.221 ± 0.164
0.0ArgXaa: 0.0 ± 0.0
Ser
4.35SerAla: 4.35 ± 0.242
0.414SerCys: 0.414 ± 0.073
3.452SerAsp: 3.452 ± 0.202
2.981SerGlu: 2.981 ± 0.183
2.221SerPhe: 2.221 ± 0.185
3.763SerGly: 3.763 ± 0.209
1.07SerHis: 1.07 ± 0.123
3.245SerIle: 3.245 ± 0.22
2.578SerLys: 2.578 ± 0.184
4.972SerLeu: 4.972 ± 0.244
1.991SerMet: 1.991 ± 0.142
2.67SerAsn: 2.67 ± 0.182
2.244SerPro: 2.244 ± 0.154
1.749SerGln: 1.749 ± 0.143
2.543SerArg: 2.543 ± 0.172
2.992SerSer: 2.992 ± 0.196
3.326SerThr: 3.326 ± 0.202
4.212SerVal: 4.212 ± 0.214
0.794SerTrp: 0.794 ± 0.107
2.279SerTyr: 2.279 ± 0.17
0.0SerXaa: 0.0 ± 0.0
Thr
5.328ThrAla: 5.328 ± 0.332
0.518ThrCys: 0.518 ± 0.077
3.786ThrAsp: 3.786 ± 0.215
3.556ThrGlu: 3.556 ± 0.219
2.302ThrPhe: 2.302 ± 0.168
4.81ThrGly: 4.81 ± 0.343
1.496ThrHis: 1.496 ± 0.158
3.614ThrIle: 3.614 ± 0.192
2.681ThrLys: 2.681 ± 0.145
5.512ThrLeu: 5.512 ± 0.271
1.416ThrMet: 1.416 ± 0.126
2.658ThrAsn: 2.658 ± 0.207
3.982ThrPro: 3.982 ± 0.217
2.635ThrGln: 2.635 ± 0.169
3.004ThrArg: 3.004 ± 0.201
3.073ThrSer: 3.073 ± 0.218
4.154ThrThr: 4.154 ± 0.267
4.626ThrVal: 4.626 ± 0.241
0.852ThrTrp: 0.852 ± 0.101
2.578ThrTyr: 2.578 ± 0.164
0.0ThrXaa: 0.0 ± 0.0
Val
6.318ValAla: 6.318 ± 0.266
0.644ValCys: 0.644 ± 0.084
4.822ValAsp: 4.822 ± 0.218
4.776ValGlu: 4.776 ± 0.274
2.394ValPhe: 2.394 ± 0.156
4.845ValGly: 4.845 ± 0.265
1.577ValHis: 1.577 ± 0.147
3.97ValIle: 3.97 ± 0.203
3.936ValLys: 3.936 ± 0.216
6.111ValLeu: 6.111 ± 0.247
2.256ValMet: 2.256 ± 0.17
3.521ValAsn: 3.521 ± 0.223
3.637ValPro: 3.637 ± 0.236
2.428ValGln: 2.428 ± 0.179
3.844ValArg: 3.844 ± 0.186
4.557ValSer: 4.557 ± 0.248
4.373ValThr: 4.373 ± 0.265
5.8ValVal: 5.8 ± 0.27
0.944ValTrp: 0.944 ± 0.103
2.808ValTyr: 2.808 ± 0.206
0.0ValXaa: 0.0 ± 0.0
Trp
1.162TrpAla: 1.162 ± 0.124
0.184TrpCys: 0.184 ± 0.05
0.84TrpAsp: 0.84 ± 0.089
0.679TrpGlu: 0.679 ± 0.076
0.656TrpPhe: 0.656 ± 0.082
0.829TrpGly: 0.829 ± 0.088
0.311TrpHis: 0.311 ± 0.062
0.783TrpIle: 0.783 ± 0.105
0.483TrpLys: 0.483 ± 0.059
1.6TrpLeu: 1.6 ± 0.118
0.38TrpMet: 0.38 ± 0.068
0.587TrpAsn: 0.587 ± 0.086
0.529TrpPro: 0.529 ± 0.077
0.483TrpGln: 0.483 ± 0.077
0.875TrpArg: 0.875 ± 0.103
0.69TrpSer: 0.69 ± 0.086
0.771TrpThr: 0.771 ± 0.091
1.082TrpVal: 1.082 ± 0.113
0.219TrpTrp: 0.219 ± 0.047
0.587TrpTyr: 0.587 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.866TyrAla: 2.866 ± 0.175
0.575TyrCys: 0.575 ± 0.087
2.647TyrAsp: 2.647 ± 0.214
2.083TyrGlu: 2.083 ± 0.138
1.404TyrPhe: 1.404 ± 0.123
2.451TyrGly: 2.451 ± 0.186
1.024TyrHis: 1.024 ± 0.105
2.164TyrIle: 2.164 ± 0.177
1.818TyrLys: 1.818 ± 0.151
3.545TyrLeu: 3.545 ± 0.216
1.105TyrMet: 1.105 ± 0.123
1.795TyrAsn: 1.795 ± 0.152
1.864TyrPro: 1.864 ± 0.142
1.254TyrGln: 1.254 ± 0.113
2.336TyrArg: 2.336 ± 0.164
2.187TyrSer: 2.187 ± 0.158
2.601TyrThr: 2.601 ± 0.195
2.693TyrVal: 2.693 ± 0.195
0.541TyrTrp: 0.541 ± 0.074
1.887TyrTyr: 1.887 ± 0.13
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 334 proteins (86896 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski