Amino acid dipepetide frequency for Pseudomonas phage PA1C

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.0AlaAla: 6.0 ± 0.448
0.566AlaCys: 0.566 ± 0.081
3.748AlaAsp: 3.748 ± 0.215
4.666AlaGlu: 4.666 ± 0.31
2.52AlaPhe: 2.52 ± 0.164
4.687AlaGly: 4.687 ± 0.315
1.281AlaHis: 1.281 ± 0.108
5.018AlaIle: 5.018 ± 0.235
3.95AlaLys: 3.95 ± 0.265
6.235AlaLeu: 6.235 ± 0.249
2.36AlaMet: 2.36 ± 0.175
3.534AlaAsn: 3.534 ± 0.209
2.349AlaPro: 2.349 ± 0.185
2.424AlaGln: 2.424 ± 0.22
3.096AlaArg: 3.096 ± 0.195
4.175AlaSer: 4.175 ± 0.214
4.516AlaThr: 4.516 ± 0.241
5.168AlaVal: 5.168 ± 0.272
0.876AlaTrp: 0.876 ± 0.099
2.958AlaTyr: 2.958 ± 0.184
0.0AlaXaa: 0.0 ± 0.0
Cys
0.427CysAla: 0.427 ± 0.063
0.203CysCys: 0.203 ± 0.042
0.545CysAsp: 0.545 ± 0.072
0.651CysGlu: 0.651 ± 0.081
0.331CysPhe: 0.331 ± 0.062
0.63CysGly: 0.63 ± 0.093
0.192CysHis: 0.192 ± 0.05
0.715CysIle: 0.715 ± 0.095
0.641CysLys: 0.641 ± 0.084
0.641CysLeu: 0.641 ± 0.079
0.235CysMet: 0.235 ± 0.054
0.416CysAsn: 0.416 ± 0.062
0.352CysPro: 0.352 ± 0.07
0.374CysGln: 0.374 ± 0.061
0.523CysArg: 0.523 ± 0.075
0.448CysSer: 0.448 ± 0.086
0.555CysThr: 0.555 ± 0.075
0.705CysVal: 0.705 ± 0.1
0.182CysTrp: 0.182 ± 0.043
0.352CysTyr: 0.352 ± 0.063
0.0CysXaa: 0.0 ± 0.0
Asp
4.015AspAla: 4.015 ± 0.229
0.523AspCys: 0.523 ± 0.067
4.26AspAsp: 4.26 ± 0.22
4.559AspGlu: 4.559 ± 0.202
2.648AspPhe: 2.648 ± 0.155
4.516AspGly: 4.516 ± 0.266
1.142AspHis: 1.142 ± 0.1
4.292AspIle: 4.292 ± 0.211
3.822AspLys: 3.822 ± 0.263
5.904AspLeu: 5.904 ± 0.225
1.527AspMet: 1.527 ± 0.132
3.214AspAsn: 3.214 ± 0.19
3.363AspPro: 3.363 ± 0.203
2.296AspGln: 2.296 ± 0.17
3.032AspArg: 3.032 ± 0.155
3.075AspSer: 3.075 ± 0.18
3.694AspThr: 3.694 ± 0.204
4.879AspVal: 4.879 ± 0.238
0.79AspTrp: 0.79 ± 0.085
2.584AspTyr: 2.584 ± 0.181
0.0AspXaa: 0.0 ± 0.0
Glu
5.306GluAla: 5.306 ± 0.3
0.48GluCys: 0.48 ± 0.075
3.95GluAsp: 3.95 ± 0.263
4.879GluGlu: 4.879 ± 0.465
2.979GluPhe: 2.979 ± 0.201
3.662GluGly: 3.662 ± 0.183
1.644GluHis: 1.644 ± 0.13
4.506GluIle: 4.506 ± 0.254
3.577GluLys: 3.577 ± 0.198
7.154GluLeu: 7.154 ± 0.272
2.135GluMet: 2.135 ± 0.172
3.171GluAsn: 3.171 ± 0.192
2.125GluPro: 2.125 ± 0.193
3.086GluGln: 3.086 ± 0.186
3.427GluArg: 3.427 ± 0.194
3.406GluSer: 3.406 ± 0.247
4.132GluThr: 4.132 ± 0.343
5.05GluVal: 5.05 ± 0.235
1.142GluTrp: 1.142 ± 0.114
2.627GluTyr: 2.627 ± 0.15
0.0GluXaa: 0.0 ± 0.0
Phe
2.317PheAla: 2.317 ± 0.14
0.363PheCys: 0.363 ± 0.063
3.032PheAsp: 3.032 ± 0.148
2.637PheGlu: 2.637 ± 0.174
1.505PhePhe: 1.505 ± 0.132
2.861PheGly: 2.861 ± 0.179
0.94PheHis: 0.94 ± 0.093
2.733PheIle: 2.733 ± 0.172
2.744PheLys: 2.744 ± 0.204
2.744PheLeu: 2.744 ± 0.167
1.078PheMet: 1.078 ± 0.104
2.648PheAsn: 2.648 ± 0.197
1.537PhePro: 1.537 ± 0.119
1.377PheGln: 1.377 ± 0.143
2.135PheArg: 2.135 ± 0.137
2.669PheSer: 2.669 ± 0.18
2.552PheThr: 2.552 ± 0.15
3.086PheVal: 3.086 ± 0.179
0.395PheTrp: 0.395 ± 0.064
1.879PheTyr: 1.879 ± 0.14
0.0PheXaa: 0.0 ± 0.0
Gly
3.908GlyAla: 3.908 ± 0.292
0.566GlyCys: 0.566 ± 0.085
4.698GlyAsp: 4.698 ± 0.432
4.346GlyGlu: 4.346 ± 0.247
2.616GlyPhe: 2.616 ± 0.174
4.655GlyGly: 4.655 ± 0.3
1.164GlyHis: 1.164 ± 0.106
4.207GlyIle: 4.207 ± 0.21
4.207GlyLys: 4.207 ± 0.214
4.815GlyLeu: 4.815 ± 0.235
2.007GlyMet: 2.007 ± 0.154
3.566GlyAsn: 3.566 ± 0.191
1.858GlyPro: 1.858 ± 0.175
2.253GlyGln: 2.253 ± 0.161
3.363GlyArg: 3.363 ± 0.24
3.833GlySer: 3.833 ± 0.189
4.228GlyThr: 4.228 ± 0.268
4.431GlyVal: 4.431 ± 0.208
1.142GlyTrp: 1.142 ± 0.114
2.787GlyTyr: 2.787 ± 0.202
0.0GlyXaa: 0.0 ± 0.0
His
1.185HisAla: 1.185 ± 0.116
0.246HisCys: 0.246 ± 0.051
1.239HisAsp: 1.239 ± 0.109
1.303HisGlu: 1.303 ± 0.115
0.961HisPhe: 0.961 ± 0.112
1.57HisGly: 1.57 ± 0.137
0.502HisHis: 0.502 ± 0.074
1.537HisIle: 1.537 ± 0.134
1.014HisLys: 1.014 ± 0.108
1.623HisLeu: 1.623 ± 0.122
0.502HisMet: 0.502 ± 0.073
0.908HisAsn: 0.908 ± 0.1
1.174HisPro: 1.174 ± 0.114
0.63HisGln: 0.63 ± 0.082
1.431HisArg: 1.431 ± 0.138
0.897HisSer: 0.897 ± 0.095
1.1HisThr: 1.1 ± 0.104
1.431HisVal: 1.431 ± 0.132
0.352HisTrp: 0.352 ± 0.067
0.961HisTyr: 0.961 ± 0.115
0.0HisXaa: 0.0 ± 0.0
Ile
5.007IleAla: 5.007 ± 0.238
0.534IleCys: 0.534 ± 0.081
4.943IleAsp: 4.943 ± 0.212
4.89IleGlu: 4.89 ± 0.252
2.082IlePhe: 2.082 ± 0.14
3.833IleGly: 3.833 ± 0.211
1.388IleHis: 1.388 ± 0.123
3.673IleIle: 3.673 ± 0.229
3.555IleLys: 3.555 ± 0.212
4.719IleLeu: 4.719 ± 0.2
1.388IleMet: 1.388 ± 0.114
3.555IleAsn: 3.555 ± 0.217
3.491IlePro: 3.491 ± 0.203
2.349IleGln: 2.349 ± 0.152
3.801IleArg: 3.801 ± 0.244
4.132IleSer: 4.132 ± 0.207
4.153IleThr: 4.153 ± 0.219
4.271IleVal: 4.271 ± 0.222
0.779IleTrp: 0.779 ± 0.119
2.264IleTyr: 2.264 ± 0.185
0.0IleXaa: 0.0 ± 0.0
Lys
5.082LysAla: 5.082 ± 0.279
0.406LysCys: 0.406 ± 0.067
3.395LysAsp: 3.395 ± 0.197
4.719LysGlu: 4.719 ± 0.255
2.584LysPhe: 2.584 ± 0.156
4.036LysGly: 4.036 ± 0.429
1.42LysHis: 1.42 ± 0.124
3.203LysIle: 3.203 ± 0.157
3.363LysLys: 3.363 ± 0.245
5.296LysLeu: 5.296 ± 0.253
1.965LysMet: 1.965 ± 0.142
2.498LysAsn: 2.498 ± 0.147
2.178LysPro: 2.178 ± 0.148
1.804LysGln: 1.804 ± 0.119
2.893LysArg: 2.893 ± 0.183
2.893LysSer: 2.893 ± 0.187
3.139LysThr: 3.139 ± 0.178
4.463LysVal: 4.463 ± 0.208
0.897LysTrp: 0.897 ± 0.094
2.296LysTyr: 2.296 ± 0.139
0.0LysXaa: 0.0 ± 0.0
Leu
5.894LeuAla: 5.894 ± 0.27
0.908LeuCys: 0.908 ± 0.091
5.894LeuAsp: 5.894 ± 0.262
5.958LeuGlu: 5.958 ± 0.251
3.406LeuPhe: 3.406 ± 0.217
5.093LeuGly: 5.093 ± 0.237
1.836LeuHis: 1.836 ± 0.142
5.061LeuIle: 5.061 ± 0.241
4.922LeuLys: 4.922 ± 0.269
6.097LeuLeu: 6.097 ± 0.302
2.135LeuMet: 2.135 ± 0.15
4.41LeuAsn: 4.41 ± 0.212
4.1LeuPro: 4.1 ± 0.187
2.776LeuGln: 2.776 ± 0.165
4.933LeuArg: 4.933 ± 0.226
5.36LeuSer: 5.36 ± 0.249
5.435LeuThr: 5.435 ± 0.233
5.68LeuVal: 5.68 ± 0.224
0.982LeuTrp: 0.982 ± 0.104
3.118LeuTyr: 3.118 ± 0.191
0.0LeuXaa: 0.0 ± 0.0
Met
2.424MetAla: 2.424 ± 0.157
0.256MetCys: 0.256 ± 0.057
1.655MetAsp: 1.655 ± 0.143
1.698MetGlu: 1.698 ± 0.149
1.409MetPhe: 1.409 ± 0.117
1.762MetGly: 1.762 ± 0.171
0.577MetHis: 0.577 ± 0.079
1.655MetIle: 1.655 ± 0.129
1.655MetLys: 1.655 ± 0.11
2.103MetLeu: 2.103 ± 0.132
0.843MetMet: 0.843 ± 0.14
1.441MetAsn: 1.441 ± 0.121
1.132MetPro: 1.132 ± 0.109
0.929MetGln: 0.929 ± 0.107
1.473MetArg: 1.473 ± 0.133
2.445MetSer: 2.445 ± 0.146
1.751MetThr: 1.751 ± 0.142
2.082MetVal: 2.082 ± 0.147
0.235MetTrp: 0.235 ± 0.053
1.121MetTyr: 1.121 ± 0.126
0.0MetXaa: 0.0 ± 0.0
Asn
3.662AsnAla: 3.662 ± 0.25
0.48AsnCys: 0.48 ± 0.07
2.755AsnAsp: 2.755 ± 0.158
3.16AsnGlu: 3.16 ± 0.178
1.847AsnPhe: 1.847 ± 0.142
4.175AsnGly: 4.175 ± 0.224
1.014AsnHis: 1.014 ± 0.105
3.321AsnIle: 3.321 ± 0.198
2.958AsnLys: 2.958 ± 0.17
4.057AsnLeu: 4.057 ± 0.188
1.463AsnMet: 1.463 ± 0.125
3.406AsnAsn: 3.406 ± 0.233
3.043AsnPro: 3.043 ± 0.202
1.73AsnGln: 1.73 ± 0.138
2.701AsnArg: 2.701 ± 0.169
3.182AsnSer: 3.182 ± 0.183
3.15AsnThr: 3.15 ± 0.205
3.555AsnVal: 3.555 ± 0.199
0.843AsnTrp: 0.843 ± 0.086
1.997AsnTyr: 1.997 ± 0.144
0.0AsnXaa: 0.0 ± 0.0
Pro
2.904ProAla: 2.904 ± 0.21
0.384ProCys: 0.384 ± 0.066
2.744ProAsp: 2.744 ± 0.178
3.331ProGlu: 3.331 ± 0.219
1.879ProPhe: 1.879 ± 0.145
2.562ProGly: 2.562 ± 0.155
0.673ProHis: 0.673 ± 0.082
2.562ProIle: 2.562 ± 0.153
2.274ProLys: 2.274 ± 0.16
3.214ProLeu: 3.214 ± 0.199
1.089ProMet: 1.089 ± 0.09
2.071ProAsn: 2.071 ± 0.148
1.409ProPro: 1.409 ± 0.119
1.409ProGln: 1.409 ± 0.154
1.9ProArg: 1.9 ± 0.125
2.349ProSer: 2.349 ± 0.155
2.915ProThr: 2.915 ± 0.18
3.353ProVal: 3.353 ± 0.208
0.459ProTrp: 0.459 ± 0.067
1.484ProTyr: 1.484 ± 0.124
0.0ProXaa: 0.0 ± 0.0
Gln
2.669GlnAla: 2.669 ± 0.225
0.384GlnCys: 0.384 ± 0.07
1.666GlnAsp: 1.666 ± 0.137
2.402GlnGlu: 2.402 ± 0.191
1.751GlnPhe: 1.751 ± 0.143
2.189GlnGly: 2.189 ± 0.167
0.897GlnHis: 0.897 ± 0.108
2.093GlnIle: 2.093 ± 0.14
1.58GlnLys: 1.58 ± 0.132
3.865GlnLeu: 3.865 ± 0.193
1.11GlnMet: 1.11 ± 0.112
1.943GlnAsn: 1.943 ± 0.157
1.196GlnPro: 1.196 ± 0.125
1.644GlnGln: 1.644 ± 0.194
2.125GlnArg: 2.125 ± 0.154
1.655GlnSer: 1.655 ± 0.16
1.858GlnThr: 1.858 ± 0.16
2.264GlnVal: 2.264 ± 0.158
0.534GlnTrp: 0.534 ± 0.07
1.527GlnTyr: 1.527 ± 0.124
0.0GlnXaa: 0.0 ± 0.0
Arg
3.385ArgAla: 3.385 ± 0.204
0.438ArgCys: 0.438 ± 0.075
3.566ArgAsp: 3.566 ± 0.214
3.15ArgGlu: 3.15 ± 0.223
2.424ArgPhe: 2.424 ± 0.178
2.829ArgGly: 2.829 ± 0.165
0.908ArgHis: 0.908 ± 0.108
3.47ArgIle: 3.47 ± 0.188
3.342ArgLys: 3.342 ± 0.212
5.007ArgLeu: 5.007 ± 0.229
1.484ArgMet: 1.484 ± 0.139
2.829ArgAsn: 2.829 ± 0.194
1.858ArgPro: 1.858 ± 0.145
1.975ArgGln: 1.975 ± 0.145
3.054ArgArg: 3.054 ± 0.177
3.011ArgSer: 3.011 ± 0.193
2.755ArgThr: 2.755 ± 0.175
4.004ArgVal: 4.004 ± 0.205
0.897ArgTrp: 0.897 ± 0.09
2.253ArgTyr: 2.253 ± 0.167
0.0ArgXaa: 0.0 ± 0.0
Ser
3.78SerAla: 3.78 ± 0.261
0.555SerCys: 0.555 ± 0.094
3.705SerAsp: 3.705 ± 0.184
3.278SerGlu: 3.278 ± 0.216
2.627SerPhe: 2.627 ± 0.195
3.812SerGly: 3.812 ± 0.198
1.11SerHis: 1.11 ± 0.126
4.281SerIle: 4.281 ± 0.234
3.737SerLys: 3.737 ± 0.203
4.869SerLeu: 4.869 ± 0.26
1.836SerMet: 1.836 ± 0.137
3.15SerAsn: 3.15 ± 0.203
2.199SerPro: 2.199 ± 0.14
1.815SerGln: 1.815 ± 0.167
3.011SerArg: 3.011 ± 0.19
3.438SerSer: 3.438 ± 0.185
3.748SerThr: 3.748 ± 0.184
4.057SerVal: 4.057 ± 0.2
0.758SerTrp: 0.758 ± 0.106
2.114SerTyr: 2.114 ± 0.136
0.0SerXaa: 0.0 ± 0.0
Thr
4.111ThrAla: 4.111 ± 0.221
0.534ThrCys: 0.534 ± 0.069
3.844ThrAsp: 3.844 ± 0.186
3.95ThrGlu: 3.95 ± 0.23
2.701ThrPhe: 2.701 ± 0.181
4.463ThrGly: 4.463 ± 0.251
1.046ThrHis: 1.046 ± 0.107
3.993ThrIle: 3.993 ± 0.21
3.118ThrLys: 3.118 ± 0.186
5.232ThrLeu: 5.232 ± 0.226
1.537ThrMet: 1.537 ± 0.125
2.99ThrAsn: 2.99 ± 0.202
2.936ThrPro: 2.936 ± 0.166
2.274ThrGln: 2.274 ± 0.174
3.075ThrArg: 3.075 ± 0.167
3.523ThrSer: 3.523 ± 0.194
3.833ThrThr: 3.833 ± 0.257
4.698ThrVal: 4.698 ± 0.223
0.993ThrTrp: 0.993 ± 0.108
2.381ThrTyr: 2.381 ± 0.157
0.0ThrXaa: 0.0 ± 0.0
Val
4.676ValAla: 4.676 ± 0.237
0.651ValCys: 0.651 ± 0.086
5.189ValAsp: 5.189 ± 0.275
5.381ValGlu: 5.381 ± 0.318
2.851ValPhe: 2.851 ± 0.176
4.303ValGly: 4.303 ± 0.222
1.441ValHis: 1.441 ± 0.141
4.965ValIle: 4.965 ± 0.229
4.794ValLys: 4.794 ± 0.266
5.392ValLeu: 5.392 ± 0.234
2.221ValMet: 2.221 ± 0.155
3.844ValAsn: 3.844 ± 0.208
2.808ValPro: 2.808 ± 0.184
2.349ValGln: 2.349 ± 0.16
3.577ValArg: 3.577 ± 0.193
4.356ValSer: 4.356 ± 0.207
4.623ValThr: 4.623 ± 0.252
4.858ValVal: 4.858 ± 0.279
0.758ValTrp: 0.758 ± 0.083
2.659ValTyr: 2.659 ± 0.179
0.0ValXaa: 0.0 ± 0.0
Trp
0.865TrpAla: 0.865 ± 0.082
0.149TrpCys: 0.149 ± 0.036
0.726TrpAsp: 0.726 ± 0.091
0.929TrpGlu: 0.929 ± 0.097
0.641TrpPhe: 0.641 ± 0.086
0.683TrpGly: 0.683 ± 0.086
0.32TrpHis: 0.32 ± 0.06
0.94TrpIle: 0.94 ± 0.099
0.918TrpLys: 0.918 ± 0.117
1.345TrpLeu: 1.345 ± 0.123
0.427TrpMet: 0.427 ± 0.066
0.694TrpAsn: 0.694 ± 0.081
0.374TrpPro: 0.374 ± 0.064
0.416TrpGln: 0.416 ± 0.061
0.705TrpArg: 0.705 ± 0.089
0.769TrpSer: 0.769 ± 0.085
0.897TrpThr: 0.897 ± 0.11
1.142TrpVal: 1.142 ± 0.109
0.16TrpTrp: 0.16 ± 0.042
0.577TrpTyr: 0.577 ± 0.069
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.509TyrAla: 2.509 ± 0.166
0.491TyrCys: 0.491 ± 0.08
2.584TyrAsp: 2.584 ± 0.187
2.552TyrGlu: 2.552 ± 0.175
1.473TyrPhe: 1.473 ± 0.144
2.328TyrGly: 2.328 ± 0.164
1.068TyrHis: 1.068 ± 0.111
2.584TyrIle: 2.584 ± 0.187
2.36TyrLys: 2.36 ± 0.16
3.684TyrLeu: 3.684 ± 0.222
1.292TyrMet: 1.292 ± 0.124
2.221TyrAsn: 2.221 ± 0.162
1.484TyrPro: 1.484 ± 0.105
1.441TyrGln: 1.441 ± 0.124
2.381TyrArg: 2.381 ± 0.167
2.264TyrSer: 2.264 ± 0.131
2.199TyrThr: 2.199 ± 0.133
2.52TyrVal: 2.52 ± 0.172
0.502TyrTrp: 0.502 ± 0.083
1.879TyrTyr: 1.879 ± 0.146
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 401 proteins (93661 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski