Amino acid dipepetide frequency for Soft-shelled turtle iridovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.561AlaAla: 10.561 ± 0.766
1.766AlaCys: 1.766 ± 0.216
4.55AlaAsp: 4.55 ± 0.444
5.739AlaGlu: 5.739 ± 0.417
2.683AlaPhe: 2.683 ± 0.323
7.403AlaGly: 7.403 ± 0.751
1.766AlaHis: 1.766 ± 0.227
2.615AlaIle: 2.615 ± 0.267
4.788AlaLys: 4.788 ± 0.426
7.403AlaLeu: 7.403 ± 0.637
3.294AlaMet: 3.294 ± 0.348
1.732AlaAsn: 1.732 ± 0.243
4.584AlaPro: 4.584 ± 0.479
2.615AlaGln: 2.615 ± 0.364
4.788AlaArg: 4.788 ± 0.553
6.656AlaSer: 6.656 ± 0.612
4.414AlaThr: 4.414 ± 0.398
8.727AlaVal: 8.727 ± 0.577
1.46AlaTrp: 1.46 ± 0.244
2.615AlaTyr: 2.615 ± 0.236
0.0AlaXaa: 0.0 ± 0.0
Cys
1.8CysAla: 1.8 ± 0.32
0.713CysCys: 0.713 ± 0.189
1.29CysAsp: 1.29 ± 0.201
0.985CysGlu: 0.985 ± 0.179
0.509CysPhe: 0.509 ± 0.134
1.528CysGly: 1.528 ± 0.252
0.441CysHis: 0.441 ± 0.147
0.747CysIle: 0.747 ± 0.165
1.324CysLys: 1.324 ± 0.214
1.596CysLeu: 1.596 ± 0.309
0.747CysMet: 0.747 ± 0.182
0.577CysAsn: 0.577 ± 0.139
1.562CysPro: 1.562 ± 0.283
0.577CysGln: 0.577 ± 0.127
1.358CysArg: 1.358 ± 0.256
1.698CysSer: 1.698 ± 0.261
0.883CysThr: 0.883 ± 0.168
1.63CysVal: 1.63 ± 0.225
0.543CysTrp: 0.543 ± 0.128
0.543CysTyr: 0.543 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
5.603AspAla: 5.603 ± 0.483
1.392AspCys: 1.392 ± 0.202
3.464AspAsp: 3.464 ± 0.366
3.158AspGlu: 3.158 ± 0.369
1.97AspPhe: 1.97 ± 0.466
5.026AspGly: 5.026 ± 0.524
0.917AspHis: 0.917 ± 0.158
2.513AspIle: 2.513 ± 0.326
2.479AspLys: 2.479 ± 0.282
5.501AspLeu: 5.501 ± 0.505
2.275AspMet: 2.275 ± 0.248
1.868AspAsn: 1.868 ± 0.291
4.313AspPro: 4.313 ± 0.412
1.426AspGln: 1.426 ± 0.246
3.667AspArg: 3.667 ± 0.372
4.584AspSer: 4.584 ± 0.432
2.683AspThr: 2.683 ± 0.341
4.652AspVal: 4.652 ± 0.319
0.951AspTrp: 0.951 ± 0.195
2.241AspTyr: 2.241 ± 0.297
0.0AspXaa: 0.0 ± 0.0
Glu
6.214GluAla: 6.214 ± 0.417
1.494GluCys: 1.494 ± 0.22
3.871GluAsp: 3.871 ± 0.399
4.041GluGlu: 4.041 ± 0.501
1.834GluPhe: 1.834 ± 0.234
3.871GluGly: 3.871 ± 0.401
0.781GluHis: 0.781 ± 0.156
1.766GluIle: 1.766 ± 0.205
2.988GluLys: 2.988 ± 0.287
3.565GluLeu: 3.565 ± 0.341
2.275GluMet: 2.275 ± 0.244
1.121GluAsn: 1.121 ± 0.221
2.92GluPro: 2.92 ± 0.32
1.766GluGln: 1.766 ± 0.336
3.633GluArg: 3.633 ± 0.393
3.769GluSer: 3.769 ± 0.387
3.769GluThr: 3.769 ± 0.307
3.735GluVal: 3.735 ± 0.418
1.29GluTrp: 1.29 ± 0.201
1.868GluTyr: 1.868 ± 0.27
0.0GluXaa: 0.0 ± 0.0
Phe
3.056PheAla: 3.056 ± 0.383
0.713PheCys: 0.713 ± 0.178
1.426PheAsp: 1.426 ± 0.228
1.902PheGlu: 1.902 ± 0.221
1.222PhePhe: 1.222 ± 0.229
2.615PheGly: 2.615 ± 0.292
0.645PheHis: 0.645 ± 0.146
1.053PheIle: 1.053 ± 0.201
1.29PheLys: 1.29 ± 0.209
3.565PheLeu: 3.565 ± 0.499
1.053PheMet: 1.053 ± 0.185
1.256PheAsn: 1.256 ± 0.183
2.003PhePro: 2.003 ± 0.241
0.679PheGln: 0.679 ± 0.147
2.309PheArg: 2.309 ± 0.344
2.615PheSer: 2.615 ± 0.252
1.902PheThr: 1.902 ± 0.231
2.818PheVal: 2.818 ± 0.325
0.272PheTrp: 0.272 ± 0.105
1.053PheTyr: 1.053 ± 0.199
0.0PheXaa: 0.0 ± 0.0
Gly
5.705GlyAla: 5.705 ± 0.49
1.732GlyCys: 1.732 ± 0.31
5.161GlyAsp: 5.161 ± 0.758
3.565GlyGlu: 3.565 ± 0.304
2.751GlyPhe: 2.751 ± 0.317
5.603GlyGly: 5.603 ± 0.59
1.562GlyHis: 1.562 ± 0.231
2.309GlyIle: 2.309 ± 0.307
4.346GlyLys: 4.346 ± 0.403
5.637GlyLeu: 5.637 ± 0.516
2.105GlyMet: 2.105 ± 0.279
1.324GlyAsn: 1.324 ± 0.245
4.618GlyPro: 4.618 ± 0.712
1.528GlyGln: 1.528 ± 0.228
5.263GlyArg: 5.263 ± 0.544
5.909GlySer: 5.909 ± 0.45
4.72GlyThr: 4.72 ± 0.434
5.433GlyVal: 5.433 ± 0.428
1.426GlyTrp: 1.426 ± 0.222
2.547GlyTyr: 2.547 ± 0.282
0.0GlyXaa: 0.0 ± 0.0
His
1.732HisAla: 1.732 ± 0.226
0.34HisCys: 0.34 ± 0.108
1.019HisAsp: 1.019 ± 0.188
0.577HisGlu: 0.577 ± 0.166
0.441HisPhe: 0.441 ± 0.131
1.528HisGly: 1.528 ± 0.209
0.509HisHis: 0.509 ± 0.169
0.747HisIle: 0.747 ± 0.149
0.815HisLys: 0.815 ± 0.177
1.902HisLeu: 1.902 ± 0.256
0.679HisMet: 0.679 ± 0.127
0.645HisAsn: 0.645 ± 0.164
1.155HisPro: 1.155 ± 0.206
0.577HisGln: 0.577 ± 0.132
1.29HisArg: 1.29 ± 0.278
1.222HisSer: 1.222 ± 0.272
1.256HisThr: 1.256 ± 0.201
1.834HisVal: 1.834 ± 0.224
0.204HisTrp: 0.204 ± 0.076
0.747HisTyr: 0.747 ± 0.177
0.0HisXaa: 0.0 ± 0.0
Ile
2.411IleAla: 2.411 ± 0.293
0.645IleCys: 0.645 ± 0.146
1.902IleAsp: 1.902 ± 0.293
1.596IleGlu: 1.596 ± 0.214
1.087IlePhe: 1.087 ± 0.19
1.562IleGly: 1.562 ± 0.222
0.917IleHis: 0.917 ± 0.216
1.019IleIle: 1.019 ± 0.184
2.479IleLys: 2.479 ± 0.254
3.294IleLeu: 3.294 ± 0.294
1.188IleMet: 1.188 ± 0.192
0.883IleAsn: 0.883 ± 0.195
2.207IlePro: 2.207 ± 0.247
0.781IleGln: 0.781 ± 0.194
2.649IleArg: 2.649 ± 0.297
2.581IleSer: 2.581 ± 0.291
1.698IleThr: 1.698 ± 0.298
2.92IleVal: 2.92 ± 0.339
0.17IleTrp: 0.17 ± 0.09
0.849IleTyr: 0.849 ± 0.206
0.0IleXaa: 0.0 ± 0.0
Lys
4.754LysAla: 4.754 ± 0.561
0.917LysCys: 0.917 ± 0.185
2.717LysAsp: 2.717 ± 0.33
3.022LysGlu: 3.022 ± 0.34
1.392LysPhe: 1.392 ± 0.232
4.38LysGly: 4.38 ± 0.399
0.747LysHis: 0.747 ± 0.143
2.343LysIle: 2.343 ± 0.268
3.871LysLys: 3.871 ± 0.621
4.177LysLeu: 4.177 ± 0.404
1.97LysMet: 1.97 ± 0.255
1.834LysAsn: 1.834 ± 0.271
3.362LysPro: 3.362 ± 0.504
1.46LysGln: 1.46 ± 0.288
5.399LysArg: 5.399 ± 0.948
4.482LysSer: 4.482 ± 0.986
4.007LysThr: 4.007 ± 0.341
3.599LysVal: 3.599 ± 0.344
0.611LysTrp: 0.611 ± 0.147
1.97LysTyr: 1.97 ± 0.203
0.0LysXaa: 0.0 ± 0.0
Leu
6.52LeuAla: 6.52 ± 0.478
2.003LeuCys: 2.003 ± 0.361
5.331LeuAsp: 5.331 ± 0.432
5.195LeuGlu: 5.195 ± 0.512
3.056LeuPhe: 3.056 ± 0.306
5.433LeuGly: 5.433 ± 0.531
1.63LeuHis: 1.63 ± 0.251
2.581LeuIle: 2.581 ± 0.321
5.06LeuLys: 5.06 ± 0.349
6.69LeuLeu: 6.69 ± 0.608
2.411LeuMet: 2.411 ± 0.235
2.581LeuAsn: 2.581 ± 0.278
4.38LeuPro: 4.38 ± 0.517
1.63LeuGln: 1.63 ± 0.229
6.112LeuArg: 6.112 ± 0.592
6.554LeuSer: 6.554 ± 0.58
5.161LeuThr: 5.161 ± 0.432
6.078LeuVal: 6.078 ± 0.428
0.985LeuTrp: 0.985 ± 0.175
2.037LeuTyr: 2.037 ± 0.262
0.0LeuXaa: 0.0 ± 0.0
Met
3.396MetAla: 3.396 ± 0.353
0.747MetCys: 0.747 ± 0.192
2.207MetAsp: 2.207 ± 0.239
2.139MetGlu: 2.139 ± 0.279
1.29MetPhe: 1.29 ± 0.206
2.717MetGly: 2.717 ± 0.285
0.747MetHis: 0.747 ± 0.186
0.645MetIle: 0.645 ± 0.16
0.917MetLys: 0.917 ± 0.144
2.037MetLeu: 2.037 ± 0.265
0.883MetMet: 0.883 ± 0.198
0.407MetAsn: 0.407 ± 0.116
1.562MetPro: 1.562 ± 0.254
0.815MetGln: 0.815 ± 0.185
2.105MetArg: 2.105 ± 0.26
3.192MetSer: 3.192 ± 0.37
2.207MetThr: 2.207 ± 0.257
2.377MetVal: 2.377 ± 0.302
0.577MetTrp: 0.577 ± 0.163
0.781MetTyr: 0.781 ± 0.213
0.0MetXaa: 0.0 ± 0.0
Asn
2.173AsnAla: 2.173 ± 0.238
0.509AsnCys: 0.509 ± 0.139
0.951AsnAsp: 0.951 ± 0.193
0.951AsnGlu: 0.951 ± 0.149
0.679AsnPhe: 0.679 ± 0.131
1.834AsnGly: 1.834 ± 0.219
0.34AsnHis: 0.34 ± 0.121
1.256AsnIle: 1.256 ± 0.276
0.985AsnLys: 0.985 ± 0.177
2.784AsnLeu: 2.784 ± 0.352
0.985AsnMet: 0.985 ± 0.194
0.781AsnAsn: 0.781 ± 0.226
2.445AsnPro: 2.445 ± 0.412
0.747AsnGln: 0.747 ± 0.141
1.528AsnArg: 1.528 ± 0.219
1.63AsnSer: 1.63 ± 0.21
1.358AsnThr: 1.358 ± 0.252
2.717AsnVal: 2.717 ± 0.345
0.475AsnTrp: 0.475 ± 0.108
0.951AsnTyr: 0.951 ± 0.151
0.0AsnXaa: 0.0 ± 0.0
Pro
6.418ProAla: 6.418 ± 0.651
0.985ProCys: 0.985 ± 0.215
4.041ProAsp: 4.041 ± 0.364
4.584ProGlu: 4.584 ± 0.468
2.037ProPhe: 2.037 ± 0.228
4.041ProGly: 4.041 ± 0.366
1.528ProHis: 1.528 ± 0.198
1.936ProIle: 1.936 ± 0.257
3.633ProLys: 3.633 ± 0.62
3.599ProLeu: 3.599 ± 0.38
1.392ProMet: 1.392 ± 0.217
1.494ProAsn: 1.494 ± 0.256
4.279ProPro: 4.279 ± 0.547
1.63ProGln: 1.63 ± 0.24
3.667ProArg: 3.667 ± 0.536
5.229ProSer: 5.229 ± 0.533
3.056ProThr: 3.056 ± 0.394
6.69ProVal: 6.69 ± 0.749
0.951ProTrp: 0.951 ± 0.215
1.596ProTyr: 1.596 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
2.275GlnAla: 2.275 ± 0.235
0.577GlnCys: 0.577 ± 0.144
1.766GlnAsp: 1.766 ± 0.31
1.8GlnGlu: 1.8 ± 0.3
0.679GlnPhe: 0.679 ± 0.146
1.528GlnGly: 1.528 ± 0.234
0.679GlnHis: 0.679 ± 0.284
0.985GlnIle: 0.985 ± 0.166
1.392GlnLys: 1.392 ± 0.236
1.698GlnLeu: 1.698 ± 0.298
0.747GlnMet: 0.747 ± 0.182
0.645GlnAsn: 0.645 ± 0.136
1.358GlnPro: 1.358 ± 0.306
1.766GlnGln: 1.766 ± 0.76
1.63GlnArg: 1.63 ± 0.229
1.902GlnSer: 1.902 ± 0.257
1.936GlnThr: 1.936 ± 0.308
1.902GlnVal: 1.902 ± 0.287
0.374GlnTrp: 0.374 ± 0.114
0.713GlnTyr: 0.713 ± 0.118
0.0GlnXaa: 0.0 ± 0.0
Arg
5.161ArgAla: 5.161 ± 0.445
1.087ArgCys: 1.087 ± 0.197
4.245ArgAsp: 4.245 ± 0.446
4.143ArgGlu: 4.143 ± 0.366
2.139ArgPhe: 2.139 ± 0.255
5.297ArgGly: 5.297 ± 0.411
1.426ArgHis: 1.426 ± 0.248
2.139ArgIle: 2.139 ± 0.306
4.924ArgLys: 4.924 ± 0.937
5.807ArgLeu: 5.807 ± 0.414
2.071ArgMet: 2.071 ± 0.257
2.071ArgAsn: 2.071 ± 0.385
4.55ArgPro: 4.55 ± 0.5
1.766ArgGln: 1.766 ± 0.273
5.433ArgArg: 5.433 ± 0.503
3.973ArgSer: 3.973 ± 0.512
3.803ArgThr: 3.803 ± 0.409
4.958ArgVal: 4.958 ± 0.429
0.951ArgTrp: 0.951 ± 0.216
1.936ArgTyr: 1.936 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
6.825SerAla: 6.825 ± 0.585
1.528SerCys: 1.528 ± 0.23
5.026SerAsp: 5.026 ± 0.357
4.007SerGlu: 4.007 ± 0.357
2.751SerPhe: 2.751 ± 0.27
5.569SerGly: 5.569 ± 0.483
1.528SerHis: 1.528 ± 0.262
2.241SerIle: 2.241 ± 0.232
3.633SerLys: 3.633 ± 0.435
6.723SerLeu: 6.723 ± 0.676
2.071SerMet: 2.071 ± 0.278
1.834SerAsn: 1.834 ± 0.25
6.01SerPro: 6.01 ± 1.283
1.8SerGln: 1.8 ± 0.289
4.482SerArg: 4.482 ± 0.527
6.112SerSer: 6.112 ± 0.632
3.328SerThr: 3.328 ± 0.355
6.723SerVal: 6.723 ± 0.533
1.426SerTrp: 1.426 ± 0.191
1.732SerTyr: 1.732 ± 0.235
0.0SerXaa: 0.0 ± 0.0
Thr
5.875ThrAla: 5.875 ± 0.489
1.053ThrCys: 1.053 ± 0.19
3.599ThrAsp: 3.599 ± 0.329
2.581ThrGlu: 2.581 ± 0.3
2.411ThrPhe: 2.411 ± 0.235
5.365ThrGly: 5.365 ± 0.459
0.577ThrHis: 0.577 ± 0.098
2.037ThrIle: 2.037 ± 0.199
2.784ThrLys: 2.784 ± 0.331
4.958ThrLeu: 4.958 ± 0.431
1.834ThrMet: 1.834 ± 0.228
1.019ThrAsn: 1.019 ± 0.166
4.109ThrPro: 4.109 ± 0.439
1.46ThrGln: 1.46 ± 0.24
3.43ThrArg: 3.43 ± 0.357
3.362ThrSer: 3.362 ± 0.434
2.173ThrThr: 2.173 ± 0.361
6.18ThrVal: 6.18 ± 0.423
0.577ThrTrp: 0.577 ± 0.191
1.324ThrTyr: 1.324 ± 0.225
0.0ThrXaa: 0.0 ± 0.0
Val
6.282ValAla: 6.282 ± 0.495
1.732ValCys: 1.732 ± 0.289
4.822ValAsp: 4.822 ± 0.399
4.177ValGlu: 4.177 ± 0.465
3.09ValPhe: 3.09 ± 0.299
4.516ValGly: 4.516 ± 0.499
1.834ValHis: 1.834 ± 0.269
2.309ValIle: 2.309 ± 0.269
6.723ValLys: 6.723 ± 0.854
7.029ValLeu: 7.029 ± 0.589
2.649ValMet: 2.649 ± 0.318
2.615ValAsn: 2.615 ± 0.301
4.686ValPro: 4.686 ± 0.573
2.207ValGln: 2.207 ± 0.224
6.588ValArg: 6.588 ± 0.623
6.418ValSer: 6.418 ± 0.571
5.128ValThr: 5.128 ± 0.479
6.588ValVal: 6.588 ± 0.519
1.29ValTrp: 1.29 ± 0.198
2.513ValTyr: 2.513 ± 0.279
0.0ValXaa: 0.0 ± 0.0
Trp
0.951TrpAla: 0.951 ± 0.24
0.441TrpCys: 0.441 ± 0.146
1.053TrpAsp: 1.053 ± 0.173
0.917TrpGlu: 0.917 ± 0.204
0.543TrpPhe: 0.543 ± 0.128
0.951TrpGly: 0.951 ± 0.198
0.272TrpHis: 0.272 ± 0.102
0.407TrpIle: 0.407 ± 0.131
0.883TrpLys: 0.883 ± 0.144
1.46TrpLeu: 1.46 ± 0.181
0.374TrpMet: 0.374 ± 0.096
0.611TrpAsn: 0.611 ± 0.16
0.747TrpPro: 0.747 ± 0.155
0.272TrpGln: 0.272 ± 0.088
0.951TrpArg: 0.951 ± 0.167
0.985TrpSer: 0.985 ± 0.217
1.562TrpThr: 1.562 ± 0.244
0.951TrpVal: 0.951 ± 0.18
0.204TrpTrp: 0.204 ± 0.085
0.475TrpTyr: 0.475 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.445TyrAla: 2.445 ± 0.286
0.679TyrCys: 0.679 ± 0.152
2.241TyrAsp: 2.241 ± 0.281
1.494TyrGlu: 1.494 ± 0.23
0.883TyrPhe: 0.883 ± 0.177
2.479TyrGly: 2.479 ± 0.259
0.374TyrHis: 0.374 ± 0.109
1.188TyrIle: 1.188 ± 0.215
1.698TyrLys: 1.698 ± 0.217
2.003TyrLeu: 2.003 ± 0.273
0.713TyrMet: 0.713 ± 0.151
0.781TyrAsn: 0.781 ± 0.19
1.936TyrPro: 1.936 ± 0.23
0.849TyrGln: 0.849 ± 0.161
1.596TyrArg: 1.596 ± 0.212
2.479TyrSer: 2.479 ± 0.279
1.664TyrThr: 1.664 ± 0.262
2.717TyrVal: 2.717 ± 0.301
0.272TyrTrp: 0.272 ± 0.1
0.747TyrTyr: 0.747 ± 0.186
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 105 proteins (29450 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski