Amino acid dipepetide frequency for Sinorhizobium phage phiN3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.677AlaAla: 6.677 ± 0.387
0.799AlaCys: 0.799 ± 0.101
4.216AlaAsp: 4.216 ± 0.245
5.251AlaGlu: 5.251 ± 0.347
3.119AlaPhe: 3.119 ± 0.191
4.075AlaGly: 4.075 ± 0.313
1.301AlaHis: 1.301 ± 0.141
4.843AlaIle: 4.843 ± 0.308
5.094AlaLys: 5.094 ± 0.336
6.818AlaLeu: 6.818 ± 0.404
2.32AlaMet: 2.32 ± 0.195
3.731AlaAsn: 3.731 ± 0.333
2.398AlaPro: 2.398 ± 0.213
2.21AlaGln: 2.21 ± 0.171
4.216AlaArg: 4.216 ± 0.271
4.091AlaSer: 4.091 ± 0.345
4.64AlaThr: 4.64 ± 0.335
4.671AlaVal: 4.671 ± 0.311
1.035AlaTrp: 1.035 ± 0.14
2.884AlaTyr: 2.884 ± 0.212
0.0AlaXaa: 0.0 ± 0.0
Cys
0.893CysAla: 0.893 ± 0.121
0.141CysCys: 0.141 ± 0.059
0.878CysAsp: 0.878 ± 0.126
0.658CysGlu: 0.658 ± 0.105
0.439CysPhe: 0.439 ± 0.099
0.752CysGly: 0.752 ± 0.113
0.282CysHis: 0.282 ± 0.062
0.423CysIle: 0.423 ± 0.089
0.392CysLys: 0.392 ± 0.086
0.658CysLeu: 0.658 ± 0.1
0.219CysMet: 0.219 ± 0.06
0.361CysAsn: 0.361 ± 0.08
0.376CysPro: 0.376 ± 0.079
0.408CysGln: 0.408 ± 0.083
0.486CysArg: 0.486 ± 0.082
0.674CysSer: 0.674 ± 0.096
0.423CysThr: 0.423 ± 0.086
0.658CysVal: 0.658 ± 0.112
0.157CysTrp: 0.157 ± 0.047
0.408CysTyr: 0.408 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
4.702AspAla: 4.702 ± 0.268
0.69AspCys: 0.69 ± 0.128
4.248AspAsp: 4.248 ± 0.301
4.687AspGlu: 4.687 ± 0.258
3.652AspPhe: 3.652 ± 0.266
5.047AspGly: 5.047 ± 0.293
1.364AspHis: 1.364 ± 0.134
3.919AspIle: 3.919 ± 0.256
3.464AspLys: 3.464 ± 0.289
4.843AspLeu: 4.843 ± 0.275
2.069AspMet: 2.069 ± 0.19
2.806AspAsn: 2.806 ± 0.25
2.868AspPro: 2.868 ± 0.185
1.677AspGln: 1.677 ± 0.139
3.527AspArg: 3.527 ± 0.182
3.072AspSer: 3.072 ± 0.225
3.684AspThr: 3.684 ± 0.258
4.953AspVal: 4.953 ± 0.279
1.16AspTrp: 1.16 ± 0.135
3.041AspTyr: 3.041 ± 0.226
0.0AspXaa: 0.0 ± 0.0
Glu
5.549GluAla: 5.549 ± 0.354
0.737GluCys: 0.737 ± 0.116
4.31GluAsp: 4.31 ± 0.296
5.878GluGlu: 5.878 ± 0.339
3.48GluPhe: 3.48 ± 0.237
4.358GluGly: 4.358 ± 0.256
1.395GluHis: 1.395 ± 0.147
5.251GluIle: 5.251 ± 0.282
4.185GluLys: 4.185 ± 0.292
6.144GluLeu: 6.144 ± 0.318
2.257GluMet: 2.257 ± 0.18
3.589GluAsn: 3.589 ± 0.223
1.959GluPro: 1.959 ± 0.195
2.383GluGln: 2.383 ± 0.205
4.436GluArg: 4.436 ± 0.314
2.994GluSer: 2.994 ± 0.253
4.812GluThr: 4.812 ± 0.279
4.169GluVal: 4.169 ± 0.259
1.301GluTrp: 1.301 ± 0.158
3.292GluTyr: 3.292 ± 0.25
0.0GluXaa: 0.0 ± 0.0
Phe
3.292PheAla: 3.292 ± 0.232
0.737PheCys: 0.737 ± 0.089
4.091PheAsp: 4.091 ± 0.28
3.919PheGlu: 3.919 ± 0.226
1.944PhePhe: 1.944 ± 0.19
3.37PheGly: 3.37 ± 0.26
0.831PheHis: 0.831 ± 0.115
2.524PheIle: 2.524 ± 0.188
3.276PheLys: 3.276 ± 0.228
3.652PheLeu: 3.652 ± 0.214
1.207PheMet: 1.207 ± 0.148
2.194PheAsn: 2.194 ± 0.199
1.646PhePro: 1.646 ± 0.182
1.63PheGln: 1.63 ± 0.158
1.928PheArg: 1.928 ± 0.198
2.696PheSer: 2.696 ± 0.24
2.837PheThr: 2.837 ± 0.214
3.386PheVal: 3.386 ± 0.227
0.439PheTrp: 0.439 ± 0.095
1.818PheTyr: 1.818 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
4.389GlyAla: 4.389 ± 0.299
0.768GlyCys: 0.768 ± 0.132
4.31GlyAsp: 4.31 ± 0.312
3.919GlyGlu: 3.919 ± 0.242
3.276GlyPhe: 3.276 ± 0.205
4.122GlyGly: 4.122 ± 0.257
1.05GlyHis: 1.05 ± 0.119
3.934GlyIle: 3.934 ± 0.201
4.624GlyLys: 4.624 ± 0.295
4.013GlyLeu: 4.013 ± 0.239
2.147GlyMet: 2.147 ± 0.153
3.354GlyAsn: 3.354 ± 0.23
1.411GlyPro: 1.411 ± 0.158
1.176GlyGln: 1.176 ± 0.124
3.527GlyArg: 3.527 ± 0.231
4.248GlySer: 4.248 ± 0.27
4.232GlyThr: 4.232 ± 0.387
4.185GlyVal: 4.185 ± 0.284
1.364GlyTrp: 1.364 ± 0.15
2.884GlyTyr: 2.884 ± 0.236
0.0GlyXaa: 0.0 ± 0.0
His
1.207HisAla: 1.207 ± 0.122
0.298HisCys: 0.298 ± 0.064
1.426HisAsp: 1.426 ± 0.177
1.52HisGlu: 1.52 ± 0.18
0.987HisPhe: 0.987 ± 0.114
1.458HisGly: 1.458 ± 0.151
0.705HisHis: 0.705 ± 0.115
1.395HisIle: 1.395 ± 0.136
1.238HisLys: 1.238 ± 0.146
1.395HisLeu: 1.395 ± 0.155
0.47HisMet: 0.47 ± 0.089
0.925HisAsn: 0.925 ± 0.113
0.987HisPro: 0.987 ± 0.118
0.47HisGln: 0.47 ± 0.092
0.987HisArg: 0.987 ± 0.129
0.925HisSer: 0.925 ± 0.106
1.082HisThr: 1.082 ± 0.206
1.379HisVal: 1.379 ± 0.116
0.298HisTrp: 0.298 ± 0.089
1.066HisTyr: 1.066 ± 0.109
0.0HisXaa: 0.0 ± 0.0
Ile
5.831IleAla: 5.831 ± 0.344
0.643IleCys: 0.643 ± 0.093
5.204IleAsp: 5.204 ± 0.287
5.988IleGlu: 5.988 ± 0.299
2.194IlePhe: 2.194 ± 0.206
3.778IleGly: 3.778 ± 0.275
1.27IleHis: 1.27 ± 0.131
3.997IleIle: 3.997 ± 0.288
3.919IleLys: 3.919 ± 0.237
4.608IleLeu: 4.608 ± 0.275
1.426IleMet: 1.426 ± 0.142
3.292IleAsn: 3.292 ± 0.228
2.351IlePro: 2.351 ± 0.186
2.147IleGln: 2.147 ± 0.179
3.213IleArg: 3.213 ± 0.248
3.715IleSer: 3.715 ± 0.256
3.872IleThr: 3.872 ± 0.286
4.248IleVal: 4.248 ± 0.264
0.643IleTrp: 0.643 ± 0.11
2.069IleTyr: 2.069 ± 0.168
0.0IleXaa: 0.0 ± 0.0
Lys
5.032LysAla: 5.032 ± 0.36
0.376LysCys: 0.376 ± 0.081
3.84LysAsp: 3.84 ± 0.258
4.718LysGlu: 4.718 ± 0.284
3.229LysPhe: 3.229 ± 0.218
3.292LysGly: 3.292 ± 0.239
1.442LysHis: 1.442 ± 0.168
4.906LysIle: 4.906 ± 0.319
4.593LysLys: 4.593 ± 0.32
5.126LysLeu: 5.126 ± 0.325
2.461LysMet: 2.461 ± 0.211
3.166LysAsn: 3.166 ± 0.197
2.226LysPro: 2.226 ± 0.2
2.241LysGln: 2.241 ± 0.212
3.433LysArg: 3.433 ± 0.264
3.825LysSer: 3.825 ± 0.255
4.232LysThr: 4.232 ± 0.255
4.232LysVal: 4.232 ± 0.267
0.893LysTrp: 0.893 ± 0.131
2.696LysTyr: 2.696 ± 0.215
0.0LysXaa: 0.0 ± 0.0
Leu
5.533LeuAla: 5.533 ± 0.301
0.705LeuCys: 0.705 ± 0.12
4.969LeuAsp: 4.969 ± 0.245
5.063LeuGlu: 5.063 ± 0.3
3.213LeuPhe: 3.213 ± 0.212
3.746LeuGly: 3.746 ± 0.285
1.599LeuHis: 1.599 ± 0.168
4.937LeuIle: 4.937 ± 0.298
5.972LeuLys: 5.972 ± 0.319
5.267LeuLeu: 5.267 ± 0.331
2.132LeuMet: 2.132 ± 0.179
4.06LeuAsn: 4.06 ± 0.248
3.354LeuPro: 3.354 ± 0.249
2.179LeuGln: 2.179 ± 0.202
4.06LeuArg: 4.06 ± 0.224
4.765LeuSer: 4.765 ± 0.294
4.702LeuThr: 4.702 ± 0.271
5.126LeuVal: 5.126 ± 0.263
0.925LeuTrp: 0.925 ± 0.111
2.915LeuTyr: 2.915 ± 0.234
0.0LeuXaa: 0.0 ± 0.0
Met
1.975MetAla: 1.975 ± 0.198
0.235MetCys: 0.235 ± 0.059
1.27MetAsp: 1.27 ± 0.131
1.756MetGlu: 1.756 ± 0.174
1.426MetPhe: 1.426 ± 0.157
1.113MetGly: 1.113 ± 0.142
0.298MetHis: 0.298 ± 0.069
2.085MetIle: 2.085 ± 0.179
2.383MetLys: 2.383 ± 0.198
2.069MetLeu: 2.069 ± 0.202
0.799MetMet: 0.799 ± 0.115
1.991MetAsn: 1.991 ± 0.193
1.066MetPro: 1.066 ± 0.154
0.392MetGln: 0.392 ± 0.071
1.52MetArg: 1.52 ± 0.157
2.383MetSer: 2.383 ± 0.193
2.774MetThr: 2.774 ± 0.206
1.552MetVal: 1.552 ± 0.173
0.313MetTrp: 0.313 ± 0.056
1.019MetTyr: 1.019 ± 0.112
0.0MetXaa: 0.0 ± 0.0
Asn
3.558AsnAla: 3.558 ± 0.232
0.423AsnCys: 0.423 ± 0.073
3.041AsnAsp: 3.041 ± 0.215
3.276AsnGlu: 3.276 ± 0.243
2.1AsnPhe: 2.1 ± 0.181
3.825AsnGly: 3.825 ± 0.264
0.925AsnHis: 0.925 ± 0.136
3.307AsnIle: 3.307 ± 0.262
2.461AsnLys: 2.461 ± 0.192
3.903AsnLeu: 3.903 ± 0.267
1.223AsnMet: 1.223 ± 0.139
2.555AsnAsn: 2.555 ± 0.259
2.665AsnPro: 2.665 ± 0.224
1.364AsnGln: 1.364 ± 0.15
2.743AsnArg: 2.743 ± 0.233
3.119AsnSer: 3.119 ± 0.285
2.931AsnThr: 2.931 ± 0.386
2.931AsnVal: 2.931 ± 0.242
0.658AsnTrp: 0.658 ± 0.107
2.132AsnTyr: 2.132 ± 0.192
0.0AsnXaa: 0.0 ± 0.0
Pro
2.743ProAla: 2.743 ± 0.215
0.361ProCys: 0.361 ± 0.076
2.931ProAsp: 2.931 ± 0.207
3.26ProGlu: 3.26 ± 0.268
1.818ProPhe: 1.818 ± 0.147
2.053ProGly: 2.053 ± 0.177
0.987ProHis: 0.987 ± 0.144
2.006ProIle: 2.006 ± 0.173
2.226ProLys: 2.226 ± 0.164
2.273ProLeu: 2.273 ± 0.19
0.784ProMet: 0.784 ± 0.13
1.458ProAsn: 1.458 ± 0.15
1.191ProPro: 1.191 ± 0.146
0.956ProGln: 0.956 ± 0.121
1.301ProArg: 1.301 ± 0.132
2.288ProSer: 2.288 ± 0.185
2.571ProThr: 2.571 ± 0.194
3.057ProVal: 3.057 ± 0.202
0.643ProTrp: 0.643 ± 0.091
1.536ProTyr: 1.536 ± 0.157
0.0ProXaa: 0.0 ± 0.0
Gln
2.053GlnAla: 2.053 ± 0.196
0.219GlnCys: 0.219 ± 0.064
1.583GlnAsp: 1.583 ± 0.147
1.677GlnGlu: 1.677 ± 0.153
1.693GlnPhe: 1.693 ± 0.156
1.85GlnGly: 1.85 ± 0.173
0.486GlnHis: 0.486 ± 0.076
1.944GlnIle: 1.944 ± 0.143
1.944GlnLys: 1.944 ± 0.192
2.414GlnLeu: 2.414 ± 0.186
0.893GlnMet: 0.893 ± 0.139
1.379GlnAsn: 1.379 ± 0.144
0.94GlnPro: 0.94 ± 0.111
0.94GlnGln: 0.94 ± 0.135
1.756GlnArg: 1.756 ± 0.164
1.677GlnSer: 1.677 ± 0.158
1.63GlnThr: 1.63 ± 0.164
1.803GlnVal: 1.803 ± 0.161
0.502GlnTrp: 0.502 ± 0.082
1.129GlnTyr: 1.129 ± 0.168
0.0GlnXaa: 0.0 ± 0.0
Arg
3.511ArgAla: 3.511 ± 0.258
0.329ArgCys: 0.329 ± 0.066
3.307ArgAsp: 3.307 ± 0.237
4.169ArgGlu: 4.169 ± 0.287
2.649ArgPhe: 2.649 ± 0.183
3.245ArgGly: 3.245 ± 0.217
1.317ArgHis: 1.317 ± 0.163
3.292ArgIle: 3.292 ± 0.238
4.828ArgLys: 4.828 ± 0.396
3.793ArgLeu: 3.793 ± 0.223
1.693ArgMet: 1.693 ± 0.165
2.79ArgAsn: 2.79 ± 0.218
1.661ArgPro: 1.661 ± 0.166
1.614ArgGln: 1.614 ± 0.177
3.198ArgArg: 3.198 ± 0.237
2.618ArgSer: 2.618 ± 0.207
2.618ArgThr: 2.618 ± 0.194
3.135ArgVal: 3.135 ± 0.225
1.082ArgTrp: 1.082 ± 0.132
2.367ArgTyr: 2.367 ± 0.196
0.0ArgXaa: 0.0 ± 0.0
Ser
4.169SerAla: 4.169 ± 0.264
0.549SerCys: 0.549 ± 0.08
4.138SerAsp: 4.138 ± 0.258
3.95SerGlu: 3.95 ± 0.261
2.947SerPhe: 2.947 ± 0.261
4.577SerGly: 4.577 ± 0.394
1.082SerHis: 1.082 ± 0.13
3.542SerIle: 3.542 ± 0.197
4.091SerLys: 4.091 ± 0.277
4.671SerLeu: 4.671 ± 0.229
1.536SerMet: 1.536 ± 0.146
2.837SerAsn: 2.837 ± 0.259
1.897SerPro: 1.897 ± 0.182
1.756SerGln: 1.756 ± 0.148
2.931SerArg: 2.931 ± 0.196
4.122SerSer: 4.122 ± 0.398
3.511SerThr: 3.511 ± 0.274
3.825SerVal: 3.825 ± 0.278
0.658SerTrp: 0.658 ± 0.107
2.524SerTyr: 2.524 ± 0.212
0.0SerXaa: 0.0 ± 0.0
Thr
4.279ThrAla: 4.279 ± 0.329
0.455ThrCys: 0.455 ± 0.082
3.511ThrAsp: 3.511 ± 0.253
3.934ThrGlu: 3.934 ± 0.278
3.417ThrPhe: 3.417 ± 0.259
4.734ThrGly: 4.734 ± 0.382
1.301ThrHis: 1.301 ± 0.19
4.499ThrIle: 4.499 ± 0.306
3.778ThrLys: 3.778 ± 0.238
4.452ThrLeu: 4.452 ± 0.291
1.442ThrMet: 1.442 ± 0.136
2.759ThrAsn: 2.759 ± 0.251
2.962ThrPro: 2.962 ± 0.209
1.458ThrGln: 1.458 ± 0.163
3.088ThrArg: 3.088 ± 0.2
4.201ThrSer: 4.201 ± 0.435
4.765ThrThr: 4.765 ± 0.583
4.718ThrVal: 4.718 ± 0.368
0.784ThrTrp: 0.784 ± 0.117
2.649ThrTyr: 2.649 ± 0.219
0.0ThrXaa: 0.0 ± 0.0
Val
5.329ValAla: 5.329 ± 0.306
0.674ValCys: 0.674 ± 0.12
4.64ValAsp: 4.64 ± 0.254
5.094ValGlu: 5.094 ± 0.251
2.806ValPhe: 2.806 ± 0.205
3.825ValGly: 3.825 ± 0.278
1.129ValHis: 1.129 ± 0.136
3.872ValIle: 3.872 ± 0.273
4.248ValLys: 4.248 ± 0.251
5.173ValLeu: 5.173 ± 0.306
2.116ValMet: 2.116 ± 0.184
2.962ValAsn: 2.962 ± 0.221
2.602ValPro: 2.602 ± 0.199
1.818ValGln: 1.818 ± 0.163
3.307ValArg: 3.307 ± 0.218
4.467ValSer: 4.467 ± 0.313
4.546ValThr: 4.546 ± 0.308
5.11ValVal: 5.11 ± 0.375
0.94ValTrp: 0.94 ± 0.135
2.571ValTyr: 2.571 ± 0.186
0.0ValXaa: 0.0 ± 0.0
Trp
0.893TrpAla: 0.893 ± 0.128
0.11TrpCys: 0.11 ± 0.045
0.768TrpAsp: 0.768 ± 0.111
1.05TrpGlu: 1.05 ± 0.146
0.909TrpPhe: 0.909 ± 0.118
0.705TrpGly: 0.705 ± 0.093
0.439TrpHis: 0.439 ± 0.093
0.909TrpIle: 0.909 ± 0.122
0.893TrpLys: 0.893 ± 0.136
1.082TrpLeu: 1.082 ± 0.125
0.376TrpMet: 0.376 ± 0.073
0.799TrpAsn: 0.799 ± 0.093
0.125TrpPro: 0.125 ± 0.058
0.611TrpGln: 0.611 ± 0.088
0.956TrpArg: 0.956 ± 0.112
0.987TrpSer: 0.987 ± 0.113
1.191TrpThr: 1.191 ± 0.148
0.862TrpVal: 0.862 ± 0.111
0.345TrpTrp: 0.345 ± 0.074
0.799TrpTyr: 0.799 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.774TyrAla: 2.774 ± 0.187
0.439TyrCys: 0.439 ± 0.08
2.79TyrAsp: 2.79 ± 0.194
2.712TyrGlu: 2.712 ± 0.211
2.116TyrPhe: 2.116 ± 0.181
3.009TyrGly: 3.009 ± 0.207
1.019TyrHis: 1.019 ± 0.118
2.821TyrIle: 2.821 ± 0.186
2.367TyrLys: 2.367 ± 0.253
2.868TyrLeu: 2.868 ± 0.232
0.831TyrMet: 0.831 ± 0.1
2.132TyrAsn: 2.132 ± 0.171
1.756TyrPro: 1.756 ± 0.172
1.082TyrGln: 1.082 ± 0.13
2.602TyrArg: 2.602 ± 0.186
2.445TyrSer: 2.445 ± 0.186
2.132TyrThr: 2.132 ± 0.168
3.229TyrVal: 3.229 ± 0.252
0.674TyrTrp: 0.674 ± 0.097
1.756TyrTyr: 1.756 ± 0.169
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 402 proteins (63799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski