Amino acid dipepetide frequency for Elephant endotheliotropic herpesvirus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.234AlaAla: 3.234 ± 0.31
1.555AlaCys: 1.555 ± 0.194
2.239AlaAsp: 2.239 ± 0.188
1.803AlaGlu: 1.803 ± 0.19
2.529AlaPhe: 2.529 ± 0.277
2.259AlaGly: 2.259 ± 0.228
1.223AlaHis: 1.223 ± 0.165
2.487AlaIle: 2.487 ± 0.242
1.658AlaLys: 1.658 ± 0.198
4.705AlaLeu: 4.705 ± 0.368
1.368AlaMet: 1.368 ± 0.178
2.28AlaAsn: 2.28 ± 0.204
2.529AlaPro: 2.529 ± 0.305
1.596AlaGln: 1.596 ± 0.182
2.135AlaArg: 2.135 ± 0.233
4.332AlaSer: 4.332 ± 0.403
3.586AlaThr: 3.586 ± 0.279
3.69AlaVal: 3.69 ± 0.306
0.497AlaTrp: 0.497 ± 0.105
1.783AlaTyr: 1.783 ± 0.161
0.0AlaXaa: 0.0 ± 0.0
Cys
1.264CysAla: 1.264 ± 0.182
0.643CysCys: 0.643 ± 0.138
1.555CysAsp: 1.555 ± 0.21
0.912CysGlu: 0.912 ± 0.149
1.41CysPhe: 1.41 ± 0.196
1.119CysGly: 1.119 ± 0.165
0.643CysHis: 0.643 ± 0.118
1.99CysIle: 1.99 ± 0.199
1.14CysLys: 1.14 ± 0.179
2.55CysLeu: 2.55 ± 0.284
0.974CysMet: 0.974 ± 0.145
1.202CysAsn: 1.202 ± 0.168
0.746CysPro: 0.746 ± 0.116
0.746CysGln: 0.746 ± 0.133
1.202CysArg: 1.202 ± 0.172
2.156CysSer: 2.156 ± 0.258
2.094CysThr: 2.094 ± 0.214
2.259CysVal: 2.259 ± 0.244
0.166CysTrp: 0.166 ± 0.058
1.306CysTyr: 1.306 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
2.363AspAla: 2.363 ± 0.238
0.891AspCys: 0.891 ± 0.154
3.918AspAsp: 3.918 ± 0.347
3.856AspGlu: 3.856 ± 0.311
2.508AspPhe: 2.508 ± 0.195
2.798AspGly: 2.798 ± 0.249
1.327AspHis: 1.327 ± 0.169
3.545AspIle: 3.545 ± 0.251
2.487AspLys: 2.487 ± 0.259
4.954AspLeu: 4.954 ± 0.361
1.803AspMet: 1.803 ± 0.194
2.674AspAsn: 2.674 ± 0.268
2.425AspPro: 2.425 ± 0.251
1.223AspGln: 1.223 ± 0.154
2.674AspArg: 2.674 ± 0.3
3.793AspSer: 3.793 ± 0.394
3.669AspThr: 3.669 ± 0.244
4.021AspVal: 4.021 ± 0.332
0.477AspTrp: 0.477 ± 0.105
1.762AspTyr: 1.762 ± 0.177
0.0AspXaa: 0.0 ± 0.0
Glu
2.405GluAla: 2.405 ± 0.231
1.016GluCys: 1.016 ± 0.17
3.254GluAsp: 3.254 ± 0.291
3.648GluGlu: 3.648 ± 0.315
2.052GluPhe: 2.052 ± 0.242
2.031GluGly: 2.031 ± 0.312
1.244GluHis: 1.244 ± 0.18
3.4GluIle: 3.4 ± 0.33
2.715GluLys: 2.715 ± 0.266
4.166GluLeu: 4.166 ± 0.321
0.995GluMet: 0.995 ± 0.143
3.503GluAsn: 3.503 ± 0.263
1.803GluPro: 1.803 ± 0.174
1.617GluGln: 1.617 ± 0.184
2.446GluArg: 2.446 ± 0.252
3.835GluSer: 3.835 ± 0.356
4.249GluThr: 4.249 ± 0.316
2.612GluVal: 2.612 ± 0.226
0.373GluTrp: 0.373 ± 0.08
2.177GluTyr: 2.177 ± 0.206
0.0GluXaa: 0.0 ± 0.0
Phe
1.845PheAla: 1.845 ± 0.22
1.555PheCys: 1.555 ± 0.203
2.467PheAsp: 2.467 ± 0.245
1.886PheGlu: 1.886 ± 0.22
3.047PhePhe: 3.047 ± 0.299
2.135PheGly: 2.135 ± 0.247
1.202PheHis: 1.202 ± 0.145
4.042PheIle: 4.042 ± 0.348
2.467PheLys: 2.467 ± 0.224
5.887PheLeu: 5.887 ± 0.433
1.513PheMet: 1.513 ± 0.192
2.736PheAsn: 2.736 ± 0.227
1.907PhePro: 1.907 ± 0.196
1.327PheGln: 1.327 ± 0.166
1.969PheArg: 1.969 ± 0.225
4.021PheSer: 4.021 ± 0.339
3.317PheThr: 3.317 ± 0.328
3.275PheVal: 3.275 ± 0.288
0.435PheTrp: 0.435 ± 0.083
2.28PheTyr: 2.28 ± 0.284
0.0PheXaa: 0.0 ± 0.0
Gly
2.57GlyAla: 2.57 ± 0.248
0.974GlyCys: 0.974 ± 0.152
2.57GlyAsp: 2.57 ± 0.292
2.031GlyGlu: 2.031 ± 0.267
1.99GlyPhe: 1.99 ± 0.197
3.337GlyGly: 3.337 ± 0.495
1.223GlyHis: 1.223 ± 0.176
3.151GlyIle: 3.151 ± 0.229
1.907GlyLys: 1.907 ± 0.193
4.249GlyLeu: 4.249 ± 0.293
1.244GlyMet: 1.244 ± 0.166
2.57GlyAsn: 2.57 ± 0.216
2.031GlyPro: 2.031 ± 0.227
1.7GlyGln: 1.7 ± 0.207
2.425GlyArg: 2.425 ± 0.288
4.021GlySer: 4.021 ± 0.414
2.84GlyThr: 2.84 ± 0.245
2.985GlyVal: 2.985 ± 0.261
0.332GlyTrp: 0.332 ± 0.086
1.472GlyTyr: 1.472 ± 0.249
0.0GlyXaa: 0.0 ± 0.0
His
1.43HisAla: 1.43 ± 0.198
0.56HisCys: 0.56 ± 0.142
1.306HisAsp: 1.306 ± 0.149
1.306HisGlu: 1.306 ± 0.166
1.036HisPhe: 1.036 ± 0.153
1.534HisGly: 1.534 ± 0.197
1.14HisHis: 1.14 ± 0.172
1.907HisIle: 1.907 ± 0.186
1.43HisLys: 1.43 ± 0.195
2.363HisLeu: 2.363 ± 0.198
0.767HisMet: 0.767 ± 0.152
1.202HisAsn: 1.202 ± 0.14
1.099HisPro: 1.099 ± 0.124
0.85HisGln: 0.85 ± 0.135
1.534HisArg: 1.534 ± 0.184
1.617HisSer: 1.617 ± 0.23
1.741HisThr: 1.741 ± 0.174
2.467HisVal: 2.467 ± 0.261
0.249HisTrp: 0.249 ± 0.072
0.974HisTyr: 0.974 ± 0.136
0.0HisXaa: 0.0 ± 0.0
Ile
2.757IleAla: 2.757 ± 0.218
1.783IleCys: 1.783 ± 0.243
3.4IleAsp: 3.4 ± 0.25
2.736IleGlu: 2.736 ± 0.251
3.773IlePhe: 3.773 ± 0.317
2.239IleGly: 2.239 ± 0.27
1.803IleHis: 1.803 ± 0.206
4.332IleIle: 4.332 ± 0.328
2.819IleLys: 2.819 ± 0.271
7.317IleLeu: 7.317 ± 0.406
1.264IleMet: 1.264 ± 0.142
3.835IleAsn: 3.835 ± 0.298
3.006IlePro: 3.006 ± 0.276
2.259IleGln: 2.259 ± 0.241
2.881IleArg: 2.881 ± 0.209
5.514IleSer: 5.514 ± 0.388
4.726IleThr: 4.726 ± 0.359
4.519IleVal: 4.519 ± 0.34
0.767IleTrp: 0.767 ± 0.124
3.503IleTyr: 3.503 ± 0.366
0.0IleXaa: 0.0 ± 0.0
Lys
1.741LysAla: 1.741 ± 0.175
1.285LysCys: 1.285 ± 0.194
2.757LysAsp: 2.757 ± 0.238
2.467LysGlu: 2.467 ± 0.212
1.99LysPhe: 1.99 ± 0.21
1.472LysGly: 1.472 ± 0.175
2.114LysHis: 2.114 ± 0.219
3.482LysIle: 3.482 ± 0.294
3.793LysLys: 3.793 ± 0.307
4.477LysLeu: 4.477 ± 0.319
1.036LysMet: 1.036 ± 0.129
3.337LysAsn: 3.337 ± 0.268
2.425LysPro: 2.425 ± 0.31
1.928LysGln: 1.928 ± 0.228
3.337LysArg: 3.337 ± 0.256
3.814LysSer: 3.814 ± 0.315
3.731LysThr: 3.731 ± 0.326
2.259LysVal: 2.259 ± 0.22
0.58LysTrp: 0.58 ± 0.115
2.446LysTyr: 2.446 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
3.98LeuAla: 3.98 ± 0.306
3.317LeuCys: 3.317 ± 0.32
4.374LeuAsp: 4.374 ± 0.357
5.472LeuGlu: 5.472 ± 0.359
5.618LeuPhe: 5.618 ± 0.446
3.814LeuGly: 3.814 ± 0.223
2.425LeuHis: 2.425 ± 0.228
5.7LeuIle: 5.7 ± 0.462
5.659LeuLys: 5.659 ± 0.33
11.173LeuLeu: 11.173 ± 0.709
2.031LeuMet: 2.031 ± 0.213
5.514LeuAsn: 5.514 ± 0.418
3.731LeuPro: 3.731 ± 0.349
3.192LeuGln: 3.192 ± 0.293
4.726LeuArg: 4.726 ± 0.401
8.395LeuSer: 8.395 ± 0.422
5.928LeuThr: 5.928 ± 0.392
5.783LeuVal: 5.783 ± 0.399
1.078LeuTrp: 1.078 ± 0.126
5.037LeuTyr: 5.037 ± 0.368
0.0LeuXaa: 0.0 ± 0.0
Met
1.451MetAla: 1.451 ± 0.141
0.871MetCys: 0.871 ± 0.134
1.347MetAsp: 1.347 ± 0.163
1.057MetGlu: 1.057 ± 0.154
1.949MetPhe: 1.949 ± 0.242
1.078MetGly: 1.078 ± 0.141
0.601MetHis: 0.601 ± 0.136
1.638MetIle: 1.638 ± 0.174
1.223MetLys: 1.223 ± 0.179
2.612MetLeu: 2.612 ± 0.289
0.85MetMet: 0.85 ± 0.117
1.036MetAsn: 1.036 ± 0.173
0.891MetPro: 0.891 ± 0.106
0.788MetGln: 0.788 ± 0.131
0.995MetArg: 0.995 ± 0.171
2.301MetSer: 2.301 ± 0.184
1.783MetThr: 1.783 ± 0.175
1.389MetVal: 1.389 ± 0.18
0.249MetTrp: 0.249 ± 0.065
1.555MetTyr: 1.555 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
3.192AsnAla: 3.192 ± 0.31
1.036AsnCys: 1.036 ± 0.139
3.089AsnAsp: 3.089 ± 0.224
2.446AsnGlu: 2.446 ± 0.247
2.715AsnPhe: 2.715 ± 0.218
2.259AsnGly: 2.259 ± 0.218
1.119AsnHis: 1.119 ± 0.147
4.602AsnIle: 4.602 ± 0.358
2.757AsnLys: 2.757 ± 0.199
4.768AsnLeu: 4.768 ± 0.264
1.72AsnMet: 1.72 ± 0.193
3.773AsnAsn: 3.773 ± 0.418
1.949AsnPro: 1.949 ± 0.185
1.658AsnGln: 1.658 ± 0.19
2.591AsnArg: 2.591 ± 0.275
3.856AsnSer: 3.856 ± 0.278
5.327AsnThr: 5.327 ± 0.36
4.457AsnVal: 4.457 ± 0.318
0.518AsnTrp: 0.518 ± 0.091
2.011AsnTyr: 2.011 ± 0.237
0.0AsnXaa: 0.0 ± 0.0
Pro
1.866ProAla: 1.866 ± 0.225
1.036ProCys: 1.036 ± 0.153
2.135ProAsp: 2.135 ± 0.208
2.156ProGlu: 2.156 ± 0.223
2.011ProPhe: 2.011 ± 0.195
2.031ProGly: 2.031 ± 0.244
1.264ProHis: 1.264 ± 0.178
2.943ProIle: 2.943 ± 0.226
1.949ProLys: 1.949 ± 0.213
4.395ProLeu: 4.395 ± 0.308
1.036ProMet: 1.036 ± 0.161
2.177ProAsn: 2.177 ± 0.215
3.545ProPro: 3.545 ± 0.475
1.41ProGln: 1.41 ± 0.214
1.99ProArg: 1.99 ± 0.213
4.395ProSer: 4.395 ± 0.485
2.943ProThr: 2.943 ± 0.416
3.835ProVal: 3.835 ± 0.393
0.415ProTrp: 0.415 ± 0.098
1.886ProTyr: 1.886 ± 0.167
0.0ProXaa: 0.0 ± 0.0
Gln
1.7GlnAla: 1.7 ± 0.239
0.808GlnCys: 0.808 ± 0.122
1.368GlnAsp: 1.368 ± 0.241
1.72GlnGlu: 1.72 ± 0.208
1.14GlnPhe: 1.14 ± 0.168
1.472GlnGly: 1.472 ± 0.187
1.036GlnHis: 1.036 ± 0.152
1.72GlnIle: 1.72 ± 0.225
2.011GlnLys: 2.011 ± 0.18
2.881GlnLeu: 2.881 ± 0.223
0.912GlnMet: 0.912 ± 0.137
1.803GlnAsn: 1.803 ± 0.162
1.762GlnPro: 1.762 ± 0.309
1.72GlnGln: 1.72 ± 0.367
1.866GlnArg: 1.866 ± 0.277
2.653GlnSer: 2.653 ± 0.311
2.591GlnThr: 2.591 ± 0.324
1.783GlnVal: 1.783 ± 0.218
0.249GlnTrp: 0.249 ± 0.073
1.347GlnTyr: 1.347 ± 0.185
0.0GlnXaa: 0.0 ± 0.0
Arg
2.301ArgAla: 2.301 ± 0.268
0.995ArgCys: 0.995 ± 0.137
3.172ArgAsp: 3.172 ± 0.245
2.591ArgGlu: 2.591 ± 0.238
2.031ArgPhe: 2.031 ± 0.247
2.591ArgGly: 2.591 ± 0.28
1.7ArgHis: 1.7 ± 0.224
2.218ArgIle: 2.218 ± 0.18
3.047ArgLys: 3.047 ± 0.342
4.395ArgLeu: 4.395 ± 0.326
1.43ArgMet: 1.43 ± 0.175
2.861ArgAsn: 2.861 ± 0.249
2.653ArgPro: 2.653 ± 0.301
1.845ArgGln: 1.845 ± 0.205
3.648ArgArg: 3.648 ± 0.382
3.669ArgSer: 3.669 ± 0.379
3.317ArgThr: 3.317 ± 0.256
2.985ArgVal: 2.985 ± 0.258
0.56ArgTrp: 0.56 ± 0.118
2.28ArgTyr: 2.28 ± 0.196
0.0ArgXaa: 0.0 ± 0.0
Ser
4.332SerAla: 4.332 ± 0.395
2.094SerCys: 2.094 ± 0.233
4.954SerAsp: 4.954 ± 0.413
4.001SerGlu: 4.001 ± 0.318
3.565SerPhe: 3.565 ± 0.262
4.83SerGly: 4.83 ± 0.45
1.783SerHis: 1.783 ± 0.19
5.472SerIle: 5.472 ± 0.337
4.021SerLys: 4.021 ± 0.29
7.421SerLeu: 7.421 ± 0.463
1.824SerMet: 1.824 ± 0.226
4.312SerAsn: 4.312 ± 0.296
3.669SerPro: 3.669 ± 0.279
2.55SerGln: 2.55 ± 0.387
4.581SerArg: 4.581 ± 0.401
10.136SerSer: 10.136 ± 1.136
6.509SerThr: 6.509 ± 0.72
5.928SerVal: 5.928 ± 0.43
0.601SerTrp: 0.601 ± 0.129
2.798SerTyr: 2.798 ± 0.285
0.0SerXaa: 0.0 ± 0.0
Thr
3.358ThrAla: 3.358 ± 0.254
2.073ThrCys: 2.073 ± 0.225
3.379ThrAsp: 3.379 ± 0.232
3.773ThrGlu: 3.773 ± 0.27
3.13ThrPhe: 3.13 ± 0.299
3.503ThrGly: 3.503 ± 0.248
1.824ThrHis: 1.824 ± 0.191
4.374ThrIle: 4.374 ± 0.337
3.254ThrLys: 3.254 ± 0.234
6.799ThrLeu: 6.799 ± 0.403
1.41ThrMet: 1.41 ± 0.188
4.001ThrAsn: 4.001 ± 0.352
4.208ThrPro: 4.208 ± 0.375
2.384ThrGln: 2.384 ± 0.319
3.296ThrArg: 3.296 ± 0.273
6.675ThrSer: 6.675 ± 0.611
6.986ThrThr: 6.986 ± 1.544
6.094ThrVal: 6.094 ± 0.354
0.933ThrTrp: 0.933 ± 0.151
2.612ThrTyr: 2.612 ± 0.286
0.0ThrXaa: 0.0 ± 0.0
Val
3.586ValAla: 3.586 ± 0.286
2.135ValCys: 2.135 ± 0.254
3.213ValAsp: 3.213 ± 0.239
3.068ValGlu: 3.068 ± 0.248
4.249ValPhe: 4.249 ± 0.281
2.674ValGly: 2.674 ± 0.212
1.803ValHis: 1.803 ± 0.217
4.146ValIle: 4.146 ± 0.313
3.275ValLys: 3.275 ± 0.266
6.447ValLeu: 6.447 ± 0.411
1.762ValMet: 1.762 ± 0.206
3.938ValAsn: 3.938 ± 0.279
3.358ValPro: 3.358 ± 0.284
2.177ValGln: 2.177 ± 0.232
3.172ValArg: 3.172 ± 0.235
6.509ValSer: 6.509 ± 0.387
5.555ValThr: 5.555 ± 0.383
4.084ValVal: 4.084 ± 0.307
0.394ValTrp: 0.394 ± 0.107
3.006ValTyr: 3.006 ± 0.262
0.0ValXaa: 0.0 ± 0.0
Trp
0.249TrpAla: 0.249 ± 0.081
0.311TrpCys: 0.311 ± 0.073
0.269TrpAsp: 0.269 ± 0.07
0.477TrpGlu: 0.477 ± 0.108
0.643TrpPhe: 0.643 ± 0.133
0.415TrpGly: 0.415 ± 0.086
0.249TrpHis: 0.249 ± 0.06
0.601TrpIle: 0.601 ± 0.12
0.56TrpLys: 0.56 ± 0.108
1.099TrpLeu: 1.099 ± 0.173
0.207TrpMet: 0.207 ± 0.066
0.705TrpAsn: 0.705 ± 0.127
0.415TrpPro: 0.415 ± 0.099
0.332TrpGln: 0.332 ± 0.086
0.415TrpArg: 0.415 ± 0.088
0.767TrpSer: 0.767 ± 0.114
0.601TrpThr: 0.601 ± 0.092
0.601TrpVal: 0.601 ± 0.122
0.124TrpTrp: 0.124 ± 0.048
0.352TrpTyr: 0.352 ± 0.086
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.866TyrAla: 1.866 ± 0.171
1.14TyrCys: 1.14 ± 0.155
2.405TyrAsp: 2.405 ± 0.263
2.135TyrGlu: 2.135 ± 0.246
1.928TyrPhe: 1.928 ± 0.196
2.031TyrGly: 2.031 ± 0.18
0.788TyrHis: 0.788 ± 0.129
3.482TyrIle: 3.482 ± 0.307
2.301TyrLys: 2.301 ± 0.213
4.312TyrLeu: 4.312 ± 0.335
1.492TyrMet: 1.492 ± 0.213
2.384TyrAsn: 2.384 ± 0.281
1.264TyrPro: 1.264 ± 0.172
1.244TyrGln: 1.244 ± 0.166
2.405TyrArg: 2.405 ± 0.261
2.923TyrSer: 2.923 ± 0.286
2.57TyrThr: 2.57 ± 0.21
3.524TyrVal: 3.524 ± 0.31
0.394TyrTrp: 0.394 ± 0.105
1.886TyrTyr: 1.886 ± 0.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 115 proteins (48243 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski