Amino acid dipepetide frequency for Grouper iridovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.471AlaAla: 8.471 ± 0.695
1.592AlaCys: 1.592 ± 0.253
4.975AlaAsp: 4.975 ± 1.269
6.737AlaGlu: 6.737 ± 1.518
2.587AlaPhe: 2.587 ± 0.272
4.747AlaGly: 4.747 ± 0.548
1.62AlaHis: 1.62 ± 0.216
3.468AlaIle: 3.468 ± 0.304
4.804AlaLys: 4.804 ± 0.464
6.851AlaLeu: 6.851 ± 0.503
1.762AlaMet: 1.762 ± 0.231
3.241AlaAsn: 3.241 ± 0.414
4.378AlaPro: 4.378 ± 0.659
3.411AlaGln: 3.411 ± 0.376
4.15AlaArg: 4.15 ± 0.44
5.799AlaSer: 5.799 ± 1.485
4.662AlaThr: 4.662 ± 1.064
6.993AlaVal: 6.993 ± 0.481
1.165AlaTrp: 1.165 ± 0.163
2.672AlaTyr: 2.672 ± 0.247
0.0AlaXaa: 0.0 ± 0.0
Cys
2.189CysAla: 2.189 ± 0.314
0.853CysCys: 0.853 ± 0.179
1.45CysAsp: 1.45 ± 0.231
1.45CysGlu: 1.45 ± 0.232
0.739CysPhe: 0.739 ± 0.149
1.848CysGly: 1.848 ± 0.299
0.597CysHis: 0.597 ± 0.133
0.768CysIle: 0.768 ± 0.166
1.905CysLys: 1.905 ± 0.273
1.563CysLeu: 1.563 ± 0.257
0.682CysMet: 0.682 ± 0.131
0.853CysAsn: 0.853 ± 0.158
1.563CysPro: 1.563 ± 0.276
0.512CysGln: 0.512 ± 0.126
1.08CysArg: 1.08 ± 0.164
1.194CysSer: 1.194 ± 0.236
0.881CysThr: 0.881 ± 0.187
1.706CysVal: 1.706 ± 0.25
0.426CysTrp: 0.426 ± 0.123
0.711CysTyr: 0.711 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
4.946AspAla: 4.946 ± 0.498
1.364AspCys: 1.364 ± 0.246
3.07AspAsp: 3.07 ± 0.379
3.923AspGlu: 3.923 ± 0.379
2.558AspPhe: 2.558 ± 0.283
3.127AspGly: 3.127 ± 0.264
0.768AspHis: 0.768 ± 0.161
2.786AspIle: 2.786 ± 0.246
3.241AspLys: 3.241 ± 0.376
4.861AspLeu: 4.861 ± 0.468
1.819AspMet: 1.819 ± 0.174
1.791AspAsn: 1.791 ± 0.225
3.07AspPro: 3.07 ± 0.357
2.445AspGln: 2.445 ± 0.975
2.843AspArg: 2.843 ± 0.258
3.354AspSer: 3.354 ± 0.339
2.899AspThr: 2.899 ± 0.283
3.383AspVal: 3.383 ± 0.398
0.682AspTrp: 0.682 ± 0.187
2.104AspTyr: 2.104 ± 0.269
0.0AspXaa: 0.0 ± 0.0
Glu
5.941GluAla: 5.941 ± 2.516
1.336GluCys: 1.336 ± 0.308
3.468GluAsp: 3.468 ± 0.308
5.088GluGlu: 5.088 ± 1.335
1.99GluPhe: 1.99 ± 0.266
2.956GluGly: 2.956 ± 0.315
1.194GluHis: 1.194 ± 0.19
3.496GluIle: 3.496 ± 0.389
4.321GluLys: 4.321 ± 0.392
4.975GluLeu: 4.975 ± 0.432
2.018GluMet: 2.018 ± 0.251
3.013GluAsn: 3.013 ± 0.291
3.013GluPro: 3.013 ± 0.355
2.018GluGln: 2.018 ± 0.262
4.69GluArg: 4.69 ± 0.728
3.155GluSer: 3.155 ± 0.312
5.202GluThr: 5.202 ± 0.385
2.615GluVal: 2.615 ± 0.326
0.938GluTrp: 0.938 ± 0.162
2.189GluTyr: 2.189 ± 0.358
0.0GluXaa: 0.0 ± 0.0
Phe
2.7PheAla: 2.7 ± 0.28
0.597PheCys: 0.597 ± 0.116
1.819PheAsp: 1.819 ± 0.24
1.507PheGlu: 1.507 ± 0.224
1.336PhePhe: 1.336 ± 0.197
2.445PheGly: 2.445 ± 0.306
0.597PheHis: 0.597 ± 0.143
1.592PheIle: 1.592 ± 0.23
2.672PheLys: 2.672 ± 0.324
2.843PheLeu: 2.843 ± 0.337
1.364PheMet: 1.364 ± 0.154
1.421PheAsn: 1.421 ± 0.207
1.649PhePro: 1.649 ± 0.215
0.625PheGln: 0.625 ± 0.125
1.706PheArg: 1.706 ± 0.226
3.127PheSer: 3.127 ± 0.324
2.445PheThr: 2.445 ± 0.323
2.473PheVal: 2.473 ± 0.241
0.398PheTrp: 0.398 ± 0.092
1.279PheTyr: 1.279 ± 0.221
0.0PheXaa: 0.0 ± 0.0
Gly
4.321GlyAla: 4.321 ± 0.525
1.023GlyCys: 1.023 ± 0.17
3.951GlyAsp: 3.951 ± 0.437
3.752GlyGlu: 3.752 ± 0.442
2.331GlyPhe: 2.331 ± 0.244
4.093GlyGly: 4.093 ± 0.415
1.08GlyHis: 1.08 ± 0.158
3.098GlyIle: 3.098 ± 0.299
3.838GlyLys: 3.838 ± 0.35
4.434GlyLeu: 4.434 ± 0.466
1.45GlyMet: 1.45 ± 0.213
2.132GlyAsn: 2.132 ± 0.282
6.908GlyPro: 6.908 ± 1.784
1.819GlyGln: 1.819 ± 0.267
3.44GlyArg: 3.44 ± 0.365
3.866GlySer: 3.866 ± 0.388
3.44GlyThr: 3.44 ± 0.366
3.923GlyVal: 3.923 ± 0.374
0.938GlyTrp: 0.938 ± 0.181
1.961GlyTyr: 1.961 ± 0.234
0.0GlyXaa: 0.0 ± 0.0
His
1.62HisAla: 1.62 ± 0.233
0.455HisCys: 0.455 ± 0.132
0.966HisAsp: 0.966 ± 0.169
1.052HisGlu: 1.052 ± 0.19
0.569HisPhe: 0.569 ± 0.132
1.336HisGly: 1.336 ± 0.216
0.682HisHis: 0.682 ± 0.133
1.109HisIle: 1.109 ± 0.175
1.052HisLys: 1.052 ± 0.155
1.99HisLeu: 1.99 ± 0.274
0.37HisMet: 0.37 ± 0.107
0.654HisAsn: 0.654 ± 0.154
1.251HisPro: 1.251 ± 0.197
0.796HisGln: 0.796 ± 0.154
0.995HisArg: 0.995 ± 0.174
1.194HisSer: 1.194 ± 0.197
1.251HisThr: 1.251 ± 0.206
1.592HisVal: 1.592 ± 0.21
0.455HisTrp: 0.455 ± 0.101
0.597HisTyr: 0.597 ± 0.102
0.0HisXaa: 0.0 ± 0.0
Ile
3.297IleAla: 3.297 ± 0.308
0.796IleCys: 0.796 ± 0.161
2.757IleAsp: 2.757 ± 0.334
2.445IleGlu: 2.445 ± 0.306
2.047IlePhe: 2.047 ± 0.247
2.132IleGly: 2.132 ± 0.253
1.109IleHis: 1.109 ± 0.217
1.905IleIle: 1.905 ± 0.294
3.44IleLys: 3.44 ± 0.485
3.894IleLeu: 3.894 ± 0.369
1.478IleMet: 1.478 ± 0.174
2.274IleAsn: 2.274 ± 0.254
3.042IlePro: 3.042 ± 0.306
1.279IleGln: 1.279 ± 0.189
2.587IleArg: 2.587 ± 0.334
3.07IleSer: 3.07 ± 0.27
3.07IleThr: 3.07 ± 0.311
4.122IleVal: 4.122 ± 0.395
0.512IleTrp: 0.512 ± 0.145
1.308IleTyr: 1.308 ± 0.199
0.0IleXaa: 0.0 ± 0.0
Lys
6.623LysAla: 6.623 ± 2.709
1.762LysCys: 1.762 ± 0.289
3.297LysAsp: 3.297 ± 0.319
3.496LysGlu: 3.496 ± 0.39
1.933LysPhe: 1.933 ± 0.216
3.326LysGly: 3.326 ± 0.463
1.421LysHis: 1.421 ± 0.208
4.15LysIle: 4.15 ± 0.365
3.44LysLys: 3.44 ± 0.424
6.112LysLeu: 6.112 ± 0.607
2.388LysMet: 2.388 ± 0.256
3.155LysAsn: 3.155 ± 0.345
2.928LysPro: 2.928 ± 0.364
2.047LysGln: 2.047 ± 0.344
4.235LysArg: 4.235 ± 0.496
4.491LysSer: 4.491 ± 0.525
4.946LysThr: 4.946 ± 0.367
2.615LysVal: 2.615 ± 0.315
1.023LysTrp: 1.023 ± 0.164
2.445LysTyr: 2.445 ± 0.278
0.0LysXaa: 0.0 ± 0.0
Leu
6.424LeuAla: 6.424 ± 0.465
1.876LeuCys: 1.876 ± 0.365
4.577LeuAsp: 4.577 ± 0.366
4.889LeuGlu: 4.889 ± 0.506
2.928LeuPhe: 2.928 ± 0.325
4.406LeuGly: 4.406 ± 0.443
1.649LeuHis: 1.649 ± 0.19
3.894LeuIle: 3.894 ± 0.337
6.282LeuLys: 6.282 ± 0.552
7.334LeuLeu: 7.334 ± 0.67
2.104LeuMet: 2.104 ± 0.213
3.98LeuAsn: 3.98 ± 0.317
3.894LeuPro: 3.894 ± 0.394
2.615LeuGln: 2.615 ± 0.286
4.605LeuArg: 4.605 ± 0.441
5.401LeuSer: 5.401 ± 0.499
6.453LeuThr: 6.453 ± 0.483
5.6LeuVal: 5.6 ± 0.476
1.08LeuTrp: 1.08 ± 0.19
2.246LeuTyr: 2.246 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
2.7MetAla: 2.7 ± 0.301
0.938MetCys: 0.938 ± 0.18
1.649MetAsp: 1.649 ± 0.228
1.905MetGlu: 1.905 ± 0.258
0.711MetPhe: 0.711 ± 0.137
1.762MetGly: 1.762 ± 0.284
0.682MetHis: 0.682 ± 0.121
1.023MetIle: 1.023 ± 0.151
1.45MetLys: 1.45 ± 0.209
2.303MetLeu: 2.303 ± 0.24
0.796MetMet: 0.796 ± 0.17
0.824MetAsn: 0.824 ± 0.144
1.052MetPro: 1.052 ± 0.163
0.768MetGln: 0.768 ± 0.136
1.421MetArg: 1.421 ± 0.193
2.018MetSer: 2.018 ± 0.234
2.445MetThr: 2.445 ± 0.288
1.762MetVal: 1.762 ± 0.22
0.398MetTrp: 0.398 ± 0.119
1.109MetTyr: 1.109 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
2.729AsnAla: 2.729 ± 0.281
1.194AsnCys: 1.194 ± 0.229
1.99AsnAsp: 1.99 ± 0.232
1.961AsnGlu: 1.961 ± 0.269
1.677AsnPhe: 1.677 ± 0.177
2.729AsnGly: 2.729 ± 0.331
0.796AsnHis: 0.796 ± 0.151
2.132AsnIle: 2.132 ± 0.287
2.047AsnLys: 2.047 ± 0.252
3.639AsnLeu: 3.639 ± 0.395
1.052AsnMet: 1.052 ± 0.204
1.933AsnAsn: 1.933 ± 0.249
2.786AsnPro: 2.786 ± 0.446
0.711AsnGln: 0.711 ± 0.146
2.303AsnArg: 2.303 ± 0.268
2.473AsnSer: 2.473 ± 0.331
1.819AsnThr: 1.819 ± 0.325
3.61AsnVal: 3.61 ± 0.31
0.483AsnTrp: 0.483 ± 0.113
1.563AsnTyr: 1.563 ± 0.216
0.0AsnXaa: 0.0 ± 0.0
Pro
5.117ProAla: 5.117 ± 0.481
1.023ProCys: 1.023 ± 0.193
2.757ProAsp: 2.757 ± 0.297
4.207ProGlu: 4.207 ± 0.545
1.734ProPhe: 1.734 ± 0.285
5.003ProGly: 5.003 ± 1.075
1.251ProHis: 1.251 ± 0.203
2.786ProIle: 2.786 ± 0.351
3.639ProLys: 3.639 ± 0.406
3.695ProLeu: 3.695 ± 0.352
1.165ProMet: 1.165 ± 0.217
1.99ProAsn: 1.99 ± 0.235
4.832ProPro: 4.832 ± 0.761
1.336ProGln: 1.336 ± 0.252
3.354ProArg: 3.354 ± 0.498
4.633ProSer: 4.633 ± 0.763
4.036ProThr: 4.036 ± 0.542
4.747ProVal: 4.747 ± 0.471
0.54ProTrp: 0.54 ± 0.138
1.563ProTyr: 1.563 ± 0.247
0.0ProXaa: 0.0 ± 0.0
Gln
1.99GlnAla: 1.99 ± 0.266
0.654GlnCys: 0.654 ± 0.154
1.62GlnAsp: 1.62 ± 0.249
2.246GlnGlu: 2.246 ± 0.319
0.91GlnPhe: 0.91 ± 0.172
2.075GlnGly: 2.075 ± 0.425
0.796GlnHis: 0.796 ± 0.166
1.421GlnIle: 1.421 ± 0.182
2.615GlnLys: 2.615 ± 1.003
2.985GlnLeu: 2.985 ± 0.291
1.023GlnMet: 1.023 ± 0.165
1.563GlnAsn: 1.563 ± 0.254
1.592GlnPro: 1.592 ± 0.286
1.194GlnGln: 1.194 ± 0.184
1.677GlnArg: 1.677 ± 0.227
1.677GlnSer: 1.677 ± 0.198
2.985GlnThr: 2.985 ± 0.29
1.336GlnVal: 1.336 ± 0.168
0.569GlnTrp: 0.569 ± 0.134
0.995GlnTyr: 0.995 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
4.605ArgAla: 4.605 ± 0.562
0.938ArgCys: 0.938 ± 0.145
3.468ArgAsp: 3.468 ± 0.548
3.326ArgGlu: 3.326 ± 0.392
2.16ArgPhe: 2.16 ± 0.275
4.349ArgGly: 4.349 ± 0.575
1.08ArgHis: 1.08 ± 0.157
2.388ArgIle: 2.388 ± 0.24
3.724ArgLys: 3.724 ± 0.531
5.06ArgLeu: 5.06 ± 0.388
1.791ArgMet: 1.791 ± 0.265
2.018ArgAsn: 2.018 ± 0.317
3.61ArgPro: 3.61 ± 0.519
2.388ArgGln: 2.388 ± 0.288
4.378ArgArg: 4.378 ± 0.575
2.956ArgSer: 2.956 ± 0.366
2.871ArgThr: 2.871 ± 0.41
3.468ArgVal: 3.468 ± 0.387
0.597ArgTrp: 0.597 ± 0.145
1.905ArgTyr: 1.905 ± 0.234
0.0ArgXaa: 0.0 ± 0.0
Ser
5.515SerAla: 5.515 ± 0.345
1.364SerCys: 1.364 ± 0.261
3.61SerAsp: 3.61 ± 0.316
3.894SerGlu: 3.894 ± 0.314
2.16SerPhe: 2.16 ± 0.232
4.093SerGly: 4.093 ± 0.331
1.421SerHis: 1.421 ± 0.242
2.501SerIle: 2.501 ± 0.301
4.463SerLys: 4.463 ± 1.276
5.458SerLeu: 5.458 ± 0.502
1.563SerMet: 1.563 ± 0.167
2.132SerAsn: 2.132 ± 0.23
4.577SerPro: 4.577 ± 0.862
2.189SerGln: 2.189 ± 0.346
3.354SerArg: 3.354 ± 0.38
5.628SerSer: 5.628 ± 1.254
3.894SerThr: 3.894 ± 0.643
5.827SerVal: 5.827 ± 0.514
0.796SerTrp: 0.796 ± 0.16
1.791SerTyr: 1.791 ± 0.236
0.0SerXaa: 0.0 ± 0.0
Thr
5.799ThrAla: 5.799 ± 0.462
1.677ThrCys: 1.677 ± 0.266
3.724ThrAsp: 3.724 ± 0.341
4.434ThrGlu: 4.434 ± 1.121
2.473ThrPhe: 2.473 ± 0.236
5.202ThrGly: 5.202 ± 0.581
1.251ThrHis: 1.251 ± 0.176
2.871ThrIle: 2.871 ± 0.319
3.553ThrLys: 3.553 ± 0.396
4.52ThrLeu: 4.52 ± 0.414
1.677ThrMet: 1.677 ± 0.227
1.99ThrAsn: 1.99 ± 0.246
3.838ThrPro: 3.838 ± 0.398
2.047ThrGln: 2.047 ± 0.273
3.639ThrArg: 3.639 ± 0.306
3.809ThrSer: 3.809 ± 0.46
2.644ThrThr: 2.644 ± 0.282
5.515ThrVal: 5.515 ± 0.679
0.796ThrTrp: 0.796 ± 0.159
2.473ThrTyr: 2.473 ± 0.253
0.0ThrXaa: 0.0 ± 0.0
Val
5.543ValAla: 5.543 ± 0.658
2.075ValCys: 2.075 ± 0.282
3.241ValAsp: 3.241 ± 0.361
4.321ValGlu: 4.321 ± 0.352
2.473ValPhe: 2.473 ± 0.342
3.695ValGly: 3.695 ± 0.353
1.364ValHis: 1.364 ± 0.207
3.297ValIle: 3.297 ± 0.328
5.77ValLys: 5.77 ± 0.432
5.77ValLeu: 5.77 ± 0.441
1.905ValMet: 1.905 ± 0.201
2.899ValAsn: 2.899 ± 0.267
3.07ValPro: 3.07 ± 0.318
2.246ValGln: 2.246 ± 0.247
4.036ValArg: 4.036 ± 0.402
4.946ValSer: 4.946 ± 0.434
4.804ValThr: 4.804 ± 0.454
5.628ValVal: 5.628 ± 0.425
0.995ValTrp: 0.995 ± 0.163
2.672ValTyr: 2.672 ± 0.298
0.0ValXaa: 0.0 ± 0.0
Trp
1.08TrpAla: 1.08 ± 0.219
0.483TrpCys: 0.483 ± 0.139
0.654TrpAsp: 0.654 ± 0.141
0.91TrpGlu: 0.91 ± 0.209
0.512TrpPhe: 0.512 ± 0.119
0.54TrpGly: 0.54 ± 0.142
0.227TrpHis: 0.227 ± 0.083
0.54TrpIle: 0.54 ± 0.114
0.938TrpLys: 0.938 ± 0.165
1.62TrpLeu: 1.62 ± 0.23
0.227TrpMet: 0.227 ± 0.086
0.483TrpAsn: 0.483 ± 0.125
0.739TrpPro: 0.739 ± 0.128
0.569TrpGln: 0.569 ± 0.125
0.597TrpArg: 0.597 ± 0.125
0.853TrpSer: 0.853 ± 0.161
0.853TrpThr: 0.853 ± 0.155
0.768TrpVal: 0.768 ± 0.171
0.199TrpTrp: 0.199 ± 0.093
0.54TrpTyr: 0.54 ± 0.143
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.473TyrAla: 2.473 ± 0.32
0.966TyrCys: 0.966 ± 0.149
1.99TyrAsp: 1.99 ± 0.323
2.075TyrGlu: 2.075 ± 0.297
0.768TyrPhe: 0.768 ± 0.132
1.933TyrGly: 1.933 ± 0.227
0.398TyrHis: 0.398 ± 0.116
1.336TyrIle: 1.336 ± 0.166
2.644TyrLys: 2.644 ± 0.267
2.331TyrLeu: 2.331 ± 0.313
1.052TyrMet: 1.052 ± 0.173
1.279TyrAsn: 1.279 ± 0.211
1.62TyrPro: 1.62 ± 0.199
0.91TyrGln: 0.91 ± 0.128
2.018TyrArg: 2.018 ± 0.273
2.587TyrSer: 2.587 ± 0.26
2.331TyrThr: 2.331 ± 0.31
3.07TyrVal: 3.07 ± 0.298
0.341TyrTrp: 0.341 ± 0.094
1.165TyrTyr: 1.165 ± 0.164
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (35180 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski