Amino acid dipepetide frequency for Acinetobacter phage Henu6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.219AlaAla: 5.219 ± 0.377
0.74AlaCys: 0.74 ± 0.132
4.129AlaAsp: 4.129 ± 0.334
4.674AlaGlu: 4.674 ± 0.351
2.668AlaPhe: 2.668 ± 0.177
4.694AlaGly: 4.694 ± 0.469
1.091AlaHis: 1.091 ± 0.168
5.434AlaIle: 5.434 ± 0.318
5.589AlaLys: 5.589 ± 0.337
5.745AlaLeu: 5.745 ± 0.371
1.578AlaMet: 1.578 ± 0.156
3.798AlaAsn: 3.798 ± 0.312
2.318AlaPro: 2.318 ± 0.213
2.882AlaGln: 2.882 ± 0.251
2.707AlaArg: 2.707 ± 0.241
4.187AlaSer: 4.187 ± 0.339
4.616AlaThr: 4.616 ± 0.571
4.557AlaVal: 4.557 ± 0.335
1.11AlaTrp: 1.11 ± 0.142
2.493AlaTyr: 2.493 ± 0.198
0.0AlaXaa: 0.0 ± 0.0
Cys
0.74CysAla: 0.74 ± 0.109
0.175CysCys: 0.175 ± 0.067
0.818CysAsp: 0.818 ± 0.139
0.818CysGlu: 0.818 ± 0.138
0.506CysPhe: 0.506 ± 0.095
0.643CysGly: 0.643 ± 0.118
0.292CysHis: 0.292 ± 0.079
0.76CysIle: 0.76 ± 0.125
0.701CysLys: 0.701 ± 0.137
0.643CysLeu: 0.643 ± 0.125
0.214CysMet: 0.214 ± 0.066
0.545CysAsn: 0.545 ± 0.097
0.565CysPro: 0.565 ± 0.115
0.253CysGln: 0.253 ± 0.067
0.467CysArg: 0.467 ± 0.096
0.76CysSer: 0.76 ± 0.115
0.779CysThr: 0.779 ± 0.133
0.662CysVal: 0.662 ± 0.105
0.117CysTrp: 0.117 ± 0.044
0.253CysTyr: 0.253 ± 0.066
0.0CysXaa: 0.0 ± 0.0
Asp
4.655AspAla: 4.655 ± 0.317
0.623AspCys: 0.623 ± 0.106
3.564AspAsp: 3.564 ± 0.293
4.285AspGlu: 4.285 ± 0.34
2.902AspPhe: 2.902 ± 0.238
4.148AspGly: 4.148 ± 0.282
1.032AspHis: 1.032 ± 0.156
4.44AspIle: 4.44 ± 0.285
3.992AspLys: 3.992 ± 0.251
5.18AspLeu: 5.18 ± 0.279
1.655AspMet: 1.655 ± 0.185
3.506AspAsn: 3.506 ± 0.244
2.201AspPro: 2.201 ± 0.27
1.694AspGln: 1.694 ± 0.201
2.279AspArg: 2.279 ± 0.227
3.467AspSer: 3.467 ± 0.25
3.7AspThr: 3.7 ± 0.291
3.934AspVal: 3.934 ± 0.321
1.188AspTrp: 1.188 ± 0.165
3.233AspTyr: 3.233 ± 0.328
0.0AspXaa: 0.0 ± 0.0
Glu
4.635GluAla: 4.635 ± 0.376
0.935GluCys: 0.935 ± 0.147
3.583GluAsp: 3.583 ± 0.299
3.739GluGlu: 3.739 ± 0.302
2.843GluPhe: 2.843 ± 0.253
3.077GluGly: 3.077 ± 0.25
1.422GluHis: 1.422 ± 0.186
5.356GluIle: 5.356 ± 0.371
4.012GluLys: 4.012 ± 0.264
6.485GluLeu: 6.485 ± 0.392
2.006GluMet: 2.006 ± 0.183
3.486GluAsn: 3.486 ± 0.291
1.967GluPro: 1.967 ± 0.183
2.123GluGln: 2.123 ± 0.168
2.551GluArg: 2.551 ± 0.246
3.856GluSer: 3.856 ± 0.265
3.895GluThr: 3.895 ± 0.27
4.343GluVal: 4.343 ± 0.279
0.623GluTrp: 0.623 ± 0.111
3.33GluTyr: 3.33 ± 0.323
0.019GluXaa: 0.019 ± 0.019
Phe
3.097PheAla: 3.097 ± 0.207
0.37PheCys: 0.37 ± 0.079
3.642PheAsp: 3.642 ± 0.271
3.233PheGlu: 3.233 ± 0.304
1.324PhePhe: 1.324 ± 0.175
2.902PheGly: 2.902 ± 0.237
0.467PheHis: 0.467 ± 0.093
3.136PheIle: 3.136 ± 0.242
3.837PheLys: 3.837 ± 0.252
2.902PheLeu: 2.902 ± 0.222
0.974PheMet: 0.974 ± 0.153
2.999PheAsn: 2.999 ± 0.242
1.071PhePro: 1.071 ± 0.135
1.11PheGln: 1.11 ± 0.13
1.519PheArg: 1.519 ± 0.185
2.571PheSer: 2.571 ± 0.208
2.688PheThr: 2.688 ± 0.281
3.058PheVal: 3.058 ± 0.229
0.487PheTrp: 0.487 ± 0.094
1.422PheTyr: 1.422 ± 0.183
0.0PheXaa: 0.0 ± 0.0
Gly
4.148GlyAla: 4.148 ± 0.416
0.545GlyCys: 0.545 ± 0.091
3.739GlyAsp: 3.739 ± 0.345
3.506GlyGlu: 3.506 ± 0.235
2.357GlyPhe: 2.357 ± 0.241
3.077GlyGly: 3.077 ± 0.379
0.818GlyHis: 0.818 ± 0.141
4.655GlyIle: 4.655 ± 0.338
3.876GlyLys: 3.876 ± 0.314
4.713GlyLeu: 4.713 ± 0.294
1.5GlyMet: 1.5 ± 0.161
2.96GlyAsn: 2.96 ± 0.347
1.694GlyPro: 1.694 ± 0.2
2.025GlyGln: 2.025 ± 0.194
2.395GlyArg: 2.395 ± 0.238
3.798GlySer: 3.798 ± 0.313
4.655GlyThr: 4.655 ± 0.5
3.856GlyVal: 3.856 ± 0.257
1.052GlyTrp: 1.052 ± 0.149
2.707GlyTyr: 2.707 ± 0.256
0.0GlyXaa: 0.0 ± 0.0
His
1.071HisAla: 1.071 ± 0.144
0.234HisCys: 0.234 ± 0.066
0.896HisAsp: 0.896 ± 0.144
1.091HisGlu: 1.091 ± 0.131
0.779HisPhe: 0.779 ± 0.13
1.169HisGly: 1.169 ± 0.162
0.409HisHis: 0.409 ± 0.074
1.772HisIle: 1.772 ± 0.209
1.597HisLys: 1.597 ± 0.204
1.461HisLeu: 1.461 ± 0.165
0.487HisMet: 0.487 ± 0.094
1.169HisAsn: 1.169 ± 0.175
1.071HisPro: 1.071 ± 0.183
0.487HisGln: 0.487 ± 0.088
0.857HisArg: 0.857 ± 0.135
0.954HisSer: 0.954 ± 0.12
1.207HisThr: 1.207 ± 0.153
1.305HisVal: 1.305 ± 0.172
0.253HisTrp: 0.253 ± 0.075
0.876HisTyr: 0.876 ± 0.127
0.0HisXaa: 0.0 ± 0.0
Ile
5.453IleAla: 5.453 ± 0.315
1.052IleCys: 1.052 ± 0.158
4.927IleAsp: 4.927 ± 0.323
5.648IleGlu: 5.648 ± 0.365
2.765IlePhe: 2.765 ± 0.219
3.778IleGly: 3.778 ± 0.298
1.227IleHis: 1.227 ± 0.144
5.064IleIle: 5.064 ± 0.317
6.661IleLys: 6.661 ± 0.413
5.278IleLeu: 5.278 ± 0.361
1.831IleMet: 1.831 ± 0.184
4.246IleAsn: 4.246 ± 0.3
2.493IlePro: 2.493 ± 0.178
3.272IleGln: 3.272 ± 0.271
3.252IleArg: 3.252 ± 0.271
4.771IleSer: 4.771 ± 0.401
4.674IleThr: 4.674 ± 0.304
4.226IleVal: 4.226 ± 0.24
0.643IleTrp: 0.643 ± 0.116
2.493IleTyr: 2.493 ± 0.234
0.0IleXaa: 0.0 ± 0.0
Lys
5.512LysAla: 5.512 ± 0.333
0.76LysCys: 0.76 ± 0.146
4.382LysAsp: 4.382 ± 0.306
4.479LysGlu: 4.479 ± 0.318
3.467LysPhe: 3.467 ± 0.243
4.109LysGly: 4.109 ± 0.294
1.87LysHis: 1.87 ± 0.211
5.064LysIle: 5.064 ± 0.352
4.148LysLys: 4.148 ± 0.33
6.797LysLeu: 6.797 ± 0.401
2.298LysMet: 2.298 ± 0.218
4.226LysAsn: 4.226 ± 0.242
2.727LysPro: 2.727 ± 0.217
2.376LysGln: 2.376 ± 0.201
3.252LysArg: 3.252 ± 0.208
4.694LysSer: 4.694 ± 0.333
4.596LysThr: 4.596 ± 0.278
4.849LysVal: 4.849 ± 0.287
0.993LysTrp: 0.993 ± 0.115
3.233LysTyr: 3.233 ± 0.242
0.0LysXaa: 0.0 ± 0.0
Leu
5.609LeuAla: 5.609 ± 0.314
0.993LeuCys: 0.993 ± 0.154
5.219LeuAsp: 5.219 ± 0.327
5.122LeuGlu: 5.122 ± 0.343
3.233LeuPhe: 3.233 ± 0.216
4.012LeuGly: 4.012 ± 0.336
1.636LeuHis: 1.636 ± 0.167
5.278LeuIle: 5.278 ± 0.371
6.894LeuLys: 6.894 ± 0.45
5.434LeuLeu: 5.434 ± 0.312
2.473LeuMet: 2.473 ± 0.213
5.414LeuAsn: 5.414 ± 0.375
3.116LeuPro: 3.116 ± 0.264
2.688LeuGln: 2.688 ± 0.236
3.525LeuArg: 3.525 ± 0.248
5.589LeuSer: 5.589 ± 0.339
5.141LeuThr: 5.141 ± 0.38
4.888LeuVal: 4.888 ± 0.321
0.837LeuTrp: 0.837 ± 0.127
3.077LeuTyr: 3.077 ± 0.282
0.019LeuXaa: 0.019 ± 0.017
Met
1.87MetAla: 1.87 ± 0.161
0.312MetCys: 0.312 ± 0.091
1.655MetAsp: 1.655 ± 0.16
1.539MetGlu: 1.539 ± 0.185
1.266MetPhe: 1.266 ± 0.167
1.071MetGly: 1.071 ± 0.144
0.623MetHis: 0.623 ± 0.121
1.733MetIle: 1.733 ± 0.172
2.727MetLys: 2.727 ± 0.216
1.948MetLeu: 1.948 ± 0.202
0.779MetMet: 0.779 ± 0.147
1.539MetAsn: 1.539 ± 0.177
0.993MetPro: 0.993 ± 0.13
1.13MetGln: 1.13 ± 0.147
1.227MetArg: 1.227 ± 0.147
2.162MetSer: 2.162 ± 0.212
1.792MetThr: 1.792 ± 0.183
1.324MetVal: 1.324 ± 0.185
0.292MetTrp: 0.292 ± 0.066
1.032MetTyr: 1.032 ± 0.137
0.0MetXaa: 0.0 ± 0.0
Asn
4.479AsnAla: 4.479 ± 0.301
0.526AsnCys: 0.526 ± 0.112
3.369AsnAsp: 3.369 ± 0.289
3.447AsnGlu: 3.447 ± 0.277
2.61AsnPhe: 2.61 ± 0.209
4.051AsnGly: 4.051 ± 0.342
1.11AsnHis: 1.11 ± 0.149
5.161AsnIle: 5.161 ± 0.363
3.837AsnLys: 3.837 ± 0.313
5.239AsnLeu: 5.239 ± 0.316
1.558AsnMet: 1.558 ± 0.176
3.291AsnAsn: 3.291 ± 0.248
2.571AsnPro: 2.571 ± 0.26
2.103AsnGln: 2.103 ± 0.228
2.454AsnArg: 2.454 ± 0.218
3.428AsnSer: 3.428 ± 0.258
3.7AsnThr: 3.7 ± 0.374
3.545AsnVal: 3.545 ± 0.291
0.837AsnTrp: 0.837 ± 0.11
1.986AsnTyr: 1.986 ± 0.204
0.0AsnXaa: 0.0 ± 0.0
Pro
2.727ProAla: 2.727 ± 0.33
0.214ProCys: 0.214 ± 0.063
2.415ProAsp: 2.415 ± 0.223
3.077ProGlu: 3.077 ± 0.276
1.655ProPhe: 1.655 ± 0.18
2.532ProGly: 2.532 ± 0.205
0.798ProHis: 0.798 ± 0.119
2.24ProIle: 2.24 ± 0.197
2.863ProLys: 2.863 ± 0.228
2.649ProLeu: 2.649 ± 0.247
0.448ProMet: 0.448 ± 0.091
2.279ProAsn: 2.279 ± 0.22
1.052ProPro: 1.052 ± 0.147
1.11ProGln: 1.11 ± 0.136
1.227ProArg: 1.227 ± 0.153
2.298ProSer: 2.298 ± 0.238
2.493ProThr: 2.493 ± 0.303
2.357ProVal: 2.357 ± 0.21
0.448ProTrp: 0.448 ± 0.091
1.402ProTyr: 1.402 ± 0.163
0.0ProXaa: 0.0 ± 0.0
Gln
2.843GlnAla: 2.843 ± 0.268
0.156GlnCys: 0.156 ± 0.063
1.655GlnAsp: 1.655 ± 0.206
2.181GlnGlu: 2.181 ± 0.211
1.655GlnPhe: 1.655 ± 0.147
1.811GlnGly: 1.811 ± 0.174
0.701GlnHis: 0.701 ± 0.131
2.551GlnIle: 2.551 ± 0.226
2.142GlnLys: 2.142 ± 0.198
3.447GlnLeu: 3.447 ± 0.269
1.363GlnMet: 1.363 ± 0.188
1.928GlnAsn: 1.928 ± 0.217
1.149GlnPro: 1.149 ± 0.162
1.383GlnGln: 1.383 ± 0.17
1.597GlnArg: 1.597 ± 0.192
2.22GlnSer: 2.22 ± 0.21
2.045GlnThr: 2.045 ± 0.207
2.434GlnVal: 2.434 ± 0.262
0.565GlnTrp: 0.565 ± 0.113
1.714GlnTyr: 1.714 ± 0.189
0.0GlnXaa: 0.0 ± 0.0
Arg
2.668ArgAla: 2.668 ± 0.227
0.467ArgCys: 0.467 ± 0.099
2.59ArgAsp: 2.59 ± 0.268
2.785ArgGlu: 2.785 ± 0.247
2.084ArgPhe: 2.084 ± 0.226
2.318ArgGly: 2.318 ± 0.223
0.837ArgHis: 0.837 ± 0.107
3.077ArgIle: 3.077 ± 0.213
3.428ArgLys: 3.428 ± 0.287
3.291ArgLeu: 3.291 ± 0.282
1.032ArgMet: 1.032 ± 0.125
2.376ArgAsn: 2.376 ± 0.233
1.227ArgPro: 1.227 ± 0.163
1.733ArgGln: 1.733 ± 0.214
1.811ArgArg: 1.811 ± 0.22
2.532ArgSer: 2.532 ± 0.205
2.337ArgThr: 2.337 ± 0.208
2.727ArgVal: 2.727 ± 0.248
0.701ArgTrp: 0.701 ± 0.132
1.675ArgTyr: 1.675 ± 0.192
0.019ArgXaa: 0.019 ± 0.022
Ser
3.72SerAla: 3.72 ± 0.251
0.487SerCys: 0.487 ± 0.096
4.09SerAsp: 4.09 ± 0.326
3.155SerGlu: 3.155 ± 0.257
2.902SerPhe: 2.902 ± 0.209
3.953SerGly: 3.953 ± 0.427
1.363SerHis: 1.363 ± 0.17
4.927SerIle: 4.927 ± 0.316
4.635SerLys: 4.635 ± 0.328
5.219SerLeu: 5.219 ± 0.355
1.811SerMet: 1.811 ± 0.161
4.226SerAsn: 4.226 ± 0.314
2.123SerPro: 2.123 ± 0.225
1.967SerGln: 1.967 ± 0.174
2.707SerArg: 2.707 ± 0.241
3.681SerSer: 3.681 ± 0.32
4.285SerThr: 4.285 ± 0.384
3.817SerVal: 3.817 ± 0.294
0.818SerTrp: 0.818 ± 0.118
2.629SerTyr: 2.629 ± 0.212
0.0SerXaa: 0.0 ± 0.0
Thr
4.265ThrAla: 4.265 ± 0.466
0.721ThrCys: 0.721 ± 0.113
3.369ThrAsp: 3.369 ± 0.302
4.168ThrGlu: 4.168 ± 0.291
3.077ThrPhe: 3.077 ± 0.266
4.168ThrGly: 4.168 ± 0.463
1.169ThrHis: 1.169 ± 0.167
4.752ThrIle: 4.752 ± 0.346
3.817ThrLys: 3.817 ± 0.211
5.122ThrLeu: 5.122 ± 0.346
1.461ThrMet: 1.461 ± 0.172
3.603ThrAsn: 3.603 ± 0.273
3.077ThrPro: 3.077 ± 0.278
2.61ThrGln: 2.61 ± 0.256
2.454ThrArg: 2.454 ± 0.245
3.837ThrSer: 3.837 ± 0.315
3.798ThrThr: 3.798 ± 0.389
4.733ThrVal: 4.733 ± 0.345
0.818ThrTrp: 0.818 ± 0.124
2.629ThrTyr: 2.629 ± 0.228
0.0ThrXaa: 0.0 ± 0.0
Val
4.129ValAla: 4.129 ± 0.327
0.818ValCys: 0.818 ± 0.139
3.915ValAsp: 3.915 ± 0.279
4.226ValGlu: 4.226 ± 0.293
2.532ValPhe: 2.532 ± 0.21
3.408ValGly: 3.408 ± 0.32
1.285ValHis: 1.285 ± 0.164
4.421ValIle: 4.421 ± 0.248
5.434ValLys: 5.434 ± 0.377
4.752ValLeu: 4.752 ± 0.307
1.792ValMet: 1.792 ± 0.198
4.109ValAsn: 4.109 ± 0.343
2.61ValPro: 2.61 ± 0.235
2.551ValGln: 2.551 ± 0.234
2.843ValArg: 2.843 ± 0.267
4.324ValSer: 4.324 ± 0.302
3.856ValThr: 3.856 ± 0.325
4.148ValVal: 4.148 ± 0.304
0.74ValTrp: 0.74 ± 0.123
2.863ValTyr: 2.863 ± 0.266
0.019ValXaa: 0.019 ± 0.02
Trp
0.74TrpAla: 0.74 ± 0.113
0.156TrpCys: 0.156 ± 0.055
0.954TrpAsp: 0.954 ± 0.151
0.798TrpGlu: 0.798 ± 0.124
0.467TrpPhe: 0.467 ± 0.08
0.506TrpGly: 0.506 ± 0.106
0.331TrpHis: 0.331 ± 0.084
0.876TrpIle: 0.876 ± 0.135
0.896TrpLys: 0.896 ± 0.164
0.915TrpLeu: 0.915 ± 0.136
0.565TrpMet: 0.565 ± 0.102
0.662TrpAsn: 0.662 ± 0.122
0.545TrpPro: 0.545 ± 0.094
0.448TrpGln: 0.448 ± 0.105
0.623TrpArg: 0.623 ± 0.115
1.071TrpSer: 1.071 ± 0.141
0.954TrpThr: 0.954 ± 0.131
0.935TrpVal: 0.935 ± 0.12
0.292TrpTrp: 0.292 ± 0.071
0.662TrpTyr: 0.662 ± 0.091
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.629TyrAla: 2.629 ± 0.231
0.428TyrCys: 0.428 ± 0.101
2.765TyrAsp: 2.765 ± 0.312
2.318TyrGlu: 2.318 ± 0.248
1.694TyrPhe: 1.694 ± 0.181
2.668TyrGly: 2.668 ± 0.233
0.662TyrHis: 0.662 ± 0.113
3.038TyrIle: 3.038 ± 0.253
2.746TyrLys: 2.746 ± 0.205
2.98TyrLeu: 2.98 ± 0.243
1.169TyrMet: 1.169 ± 0.139
2.941TyrAsn: 2.941 ± 0.285
1.714TyrPro: 1.714 ± 0.171
1.597TyrGln: 1.597 ± 0.157
1.967TyrArg: 1.967 ± 0.212
2.298TyrSer: 2.298 ± 0.244
2.434TyrThr: 2.434 ± 0.239
3.077TyrVal: 3.077 ± 0.264
0.584TyrTrp: 0.584 ± 0.115
1.305TyrTyr: 1.305 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.019XaaAla: 0.019 ± 0.017
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.019XaaPhe: 0.019 ± 0.022
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.019XaaIle: 0.019 ± 0.02
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.019XaaSer: 0.019 ± 0.019
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 234 proteins (51348 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski