Amino acid dipepetide frequency for Halovirus HSTV-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.126AlaAla: 8.126 ± 0.841
0.438AlaCys: 0.438 ± 0.127
6.423AlaAsp: 6.423 ± 0.65
10.364AlaGlu: 10.364 ± 0.767
2.628AlaPhe: 2.628 ± 0.343
5.304AlaGly: 5.304 ± 0.666
1.411AlaHis: 1.411 ± 0.258
4.233AlaIle: 4.233 ± 0.481
3.066AlaLys: 3.066 ± 0.415
6.034AlaLeu: 6.034 ± 0.518
2.53AlaMet: 2.53 ± 0.332
2.92AlaAsn: 2.92 ± 0.448
2.968AlaPro: 2.968 ± 0.449
2.725AlaGln: 2.725 ± 0.374
4.525AlaArg: 4.525 ± 0.413
4.817AlaSer: 4.817 ± 0.51
4.233AlaThr: 4.233 ± 0.397
6.18AlaVal: 6.18 ± 0.537
1.168AlaTrp: 1.168 ± 0.254
2.482AlaTyr: 2.482 ± 0.38
0.0AlaXaa: 0.0 ± 0.0
Cys
0.438CysAla: 0.438 ± 0.143
0.195CysCys: 0.195 ± 0.093
0.487CysAsp: 0.487 ± 0.2
0.633CysGlu: 0.633 ± 0.183
0.195CysPhe: 0.195 ± 0.102
0.876CysGly: 0.876 ± 0.189
0.195CysHis: 0.195 ± 0.101
0.341CysIle: 0.341 ± 0.138
0.049CysLys: 0.049 ± 0.049
0.487CysLeu: 0.487 ± 0.17
0.195CysMet: 0.195 ± 0.082
0.195CysAsn: 0.195 ± 0.101
0.779CysPro: 0.779 ± 0.243
0.146CysGln: 0.146 ± 0.083
0.292CysArg: 0.292 ± 0.112
0.243CysSer: 0.243 ± 0.123
0.195CysThr: 0.195 ± 0.093
0.341CysVal: 0.341 ± 0.124
0.049CysTrp: 0.049 ± 0.049
0.292CysTyr: 0.292 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
8.272AspAla: 8.272 ± 0.654
0.243AspCys: 0.243 ± 0.131
9.245AspAsp: 9.245 ± 0.923
9.051AspGlu: 9.051 ± 0.733
4.185AspPhe: 4.185 ± 0.512
9.197AspGly: 9.197 ± 0.814
1.508AspHis: 1.508 ± 0.275
4.039AspIle: 4.039 ± 0.436
1.995AspLys: 1.995 ± 0.311
7.153AspLeu: 7.153 ± 0.525
1.314AspMet: 1.314 ± 0.234
3.358AspAsn: 3.358 ± 0.386
4.233AspPro: 4.233 ± 0.46
1.557AspGln: 1.557 ± 0.267
4.671AspArg: 4.671 ± 0.467
5.304AspSer: 5.304 ± 0.528
4.087AspThr: 4.087 ± 0.451
5.207AspVal: 5.207 ± 0.565
1.654AspTrp: 1.654 ± 0.29
3.747AspTyr: 3.747 ± 0.394
0.0AspXaa: 0.0 ± 0.0
Glu
9.44GluAla: 9.44 ± 0.928
0.633GluCys: 0.633 ± 0.193
8.905GluAsp: 8.905 ± 0.68
7.056GluGlu: 7.056 ± 0.589
3.844GluPhe: 3.844 ± 0.459
7.202GluGly: 7.202 ± 0.572
1.654GluHis: 1.654 ± 0.283
4.185GluIle: 4.185 ± 0.464
3.114GluLys: 3.114 ± 0.359
7.591GluLeu: 7.591 ± 0.466
3.601GluMet: 3.601 ± 0.469
3.844GluAsn: 3.844 ± 0.459
4.331GluPro: 4.331 ± 0.458
3.893GluGln: 3.893 ± 0.416
5.401GluArg: 5.401 ± 0.568
4.039GluSer: 4.039 ± 0.472
5.596GluThr: 5.596 ± 0.59
8.369GluVal: 8.369 ± 0.665
1.508GluTrp: 1.508 ± 0.262
2.92GluTyr: 2.92 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
2.968PheAla: 2.968 ± 0.367
0.389PheCys: 0.389 ± 0.112
3.163PheAsp: 3.163 ± 0.363
3.941PheGlu: 3.941 ± 0.476
1.168PhePhe: 1.168 ± 0.239
3.212PheGly: 3.212 ± 0.396
1.508PheHis: 1.508 ± 0.29
1.508PheIle: 1.508 ± 0.289
1.849PheLys: 1.849 ± 0.401
1.8PheLeu: 1.8 ± 0.302
0.681PheMet: 0.681 ± 0.201
1.411PheAsn: 1.411 ± 0.236
1.703PhePro: 1.703 ± 0.315
1.216PheGln: 1.216 ± 0.199
2.336PheArg: 2.336 ± 0.328
1.703PheSer: 1.703 ± 0.332
1.752PheThr: 1.752 ± 0.351
1.898PheVal: 1.898 ± 0.267
0.146PheTrp: 0.146 ± 0.074
1.314PheTyr: 1.314 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
5.985GlyAla: 5.985 ± 0.796
0.341GlyCys: 0.341 ± 0.131
7.104GlyAsp: 7.104 ± 0.801
7.98GlyGlu: 7.98 ± 0.658
3.358GlyPhe: 3.358 ± 0.356
9.294GlyGly: 9.294 ± 1.975
1.898GlyHis: 1.898 ± 0.384
3.406GlyIle: 3.406 ± 0.465
2.871GlyLys: 2.871 ± 0.389
5.158GlyLeu: 5.158 ± 0.487
2.044GlyMet: 2.044 ± 0.268
3.358GlyAsn: 3.358 ± 0.539
2.238GlyPro: 2.238 ± 0.393
2.482GlyGln: 2.482 ± 0.311
3.747GlyArg: 3.747 ± 0.385
4.379GlySer: 4.379 ± 0.485
3.795GlyThr: 3.795 ± 0.39
5.304GlyVal: 5.304 ± 0.567
1.216GlyTrp: 1.216 ± 0.296
3.017GlyTyr: 3.017 ± 0.344
0.0GlyXaa: 0.0 ± 0.0
His
1.654HisAla: 1.654 ± 0.309
0.146HisCys: 0.146 ± 0.091
1.606HisAsp: 1.606 ± 0.295
2.336HisGlu: 2.336 ± 0.364
0.584HisPhe: 0.584 ± 0.193
1.946HisGly: 1.946 ± 0.346
0.535HisHis: 0.535 ± 0.171
1.216HisIle: 1.216 ± 0.256
0.633HisLys: 0.633 ± 0.249
1.654HisLeu: 1.654 ± 0.285
0.243HisMet: 0.243 ± 0.102
0.925HisAsn: 0.925 ± 0.19
1.752HisPro: 1.752 ± 0.26
0.487HisGln: 0.487 ± 0.142
1.654HisArg: 1.654 ± 0.303
1.265HisSer: 1.265 ± 0.254
1.022HisThr: 1.022 ± 0.205
1.898HisVal: 1.898 ± 0.347
0.243HisTrp: 0.243 ± 0.096
0.827HisTyr: 0.827 ± 0.159
0.0HisXaa: 0.0 ± 0.0
Ile
3.309IleAla: 3.309 ± 0.409
0.292IleCys: 0.292 ± 0.131
4.72IleAsp: 4.72 ± 0.397
5.596IleGlu: 5.596 ± 0.52
1.071IlePhe: 1.071 ± 0.217
2.822IleGly: 2.822 ± 0.411
0.779IleHis: 0.779 ± 0.175
1.752IleIle: 1.752 ± 0.275
1.46IleLys: 1.46 ± 0.299
3.309IleLeu: 3.309 ± 0.346
0.827IleMet: 0.827 ± 0.186
1.946IleAsn: 1.946 ± 0.367
2.092IlePro: 2.092 ± 0.362
1.46IleGln: 1.46 ± 0.267
3.114IleArg: 3.114 ± 0.45
2.92IleSer: 2.92 ± 0.403
3.163IleThr: 3.163 ± 0.423
2.384IleVal: 2.384 ± 0.322
0.195IleTrp: 0.195 ± 0.09
1.022IleTyr: 1.022 ± 0.186
0.0IleXaa: 0.0 ± 0.0
Lys
2.92LysAla: 2.92 ± 0.441
0.292LysCys: 0.292 ± 0.158
2.774LysAsp: 2.774 ± 0.39
3.114LysGlu: 3.114 ± 0.435
1.362LysPhe: 1.362 ± 0.277
1.557LysGly: 1.557 ± 0.282
1.022LysHis: 1.022 ± 0.165
1.119LysIle: 1.119 ± 0.268
1.508LysLys: 1.508 ± 0.426
2.044LysLeu: 2.044 ± 0.296
1.071LysMet: 1.071 ± 0.214
1.508LysAsn: 1.508 ± 0.281
1.752LysPro: 1.752 ± 0.296
1.362LysGln: 1.362 ± 0.277
2.822LysArg: 2.822 ± 0.444
2.968LysSer: 2.968 ± 0.471
2.384LysThr: 2.384 ± 0.313
2.725LysVal: 2.725 ± 0.378
0.341LysTrp: 0.341 ± 0.125
1.119LysTyr: 1.119 ± 0.232
0.0LysXaa: 0.0 ± 0.0
Leu
5.158LeuAla: 5.158 ± 0.386
0.779LeuCys: 0.779 ± 0.174
6.569LeuAsp: 6.569 ± 0.534
5.839LeuGlu: 5.839 ± 0.521
1.995LeuPhe: 1.995 ± 0.394
4.574LeuGly: 4.574 ± 0.516
1.995LeuHis: 1.995 ± 0.341
3.503LeuIle: 3.503 ± 0.563
2.92LeuLys: 2.92 ± 0.432
5.255LeuLeu: 5.255 ± 0.565
1.557LeuMet: 1.557 ± 0.241
3.066LeuAsn: 3.066 ± 0.352
3.212LeuPro: 3.212 ± 0.36
1.995LeuGln: 1.995 ± 0.314
5.207LeuArg: 5.207 ± 0.558
4.136LeuSer: 4.136 ± 0.377
4.623LeuThr: 4.623 ± 0.486
4.915LeuVal: 4.915 ± 0.519
1.071LeuTrp: 1.071 ± 0.212
1.703LeuTyr: 1.703 ± 0.279
0.0LeuXaa: 0.0 ± 0.0
Met
1.898MetAla: 1.898 ± 0.393
0.0MetCys: 0.0 ± 0.0
1.995MetAsp: 1.995 ± 0.325
1.995MetGlu: 1.995 ± 0.256
0.73MetPhe: 0.73 ± 0.175
1.508MetGly: 1.508 ± 0.238
0.487MetHis: 0.487 ± 0.175
0.389MetIle: 0.389 ± 0.131
1.168MetLys: 1.168 ± 0.242
1.849MetLeu: 1.849 ± 0.261
0.487MetMet: 0.487 ± 0.143
1.46MetAsn: 1.46 ± 0.265
1.46MetPro: 1.46 ± 0.215
0.341MetGln: 0.341 ± 0.116
1.071MetArg: 1.071 ± 0.248
2.238MetSer: 2.238 ± 0.337
1.995MetThr: 1.995 ± 0.307
2.19MetVal: 2.19 ± 0.259
0.584MetTrp: 0.584 ± 0.139
0.535MetTyr: 0.535 ± 0.127
0.0MetXaa: 0.0 ± 0.0
Asn
3.358AsnAla: 3.358 ± 0.354
0.243AsnCys: 0.243 ± 0.099
3.893AsnAsp: 3.893 ± 0.403
2.92AsnGlu: 2.92 ± 0.401
1.703AsnPhe: 1.703 ± 0.255
3.503AsnGly: 3.503 ± 0.671
1.022AsnHis: 1.022 ± 0.198
1.849AsnIle: 1.849 ± 0.271
1.216AsnLys: 1.216 ± 0.243
1.995AsnLeu: 1.995 ± 0.325
1.071AsnMet: 1.071 ± 0.254
1.946AsnAsn: 1.946 ± 0.563
3.114AsnPro: 3.114 ± 0.362
0.973AsnGln: 0.973 ± 0.259
2.384AsnArg: 2.384 ± 0.375
2.044AsnSer: 2.044 ± 0.303
2.141AsnThr: 2.141 ± 0.381
2.238AsnVal: 2.238 ± 0.371
0.389AsnTrp: 0.389 ± 0.123
1.606AsnTyr: 1.606 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
3.066ProAla: 3.066 ± 0.372
0.195ProCys: 0.195 ± 0.097
6.131ProAsp: 6.131 ± 0.527
5.401ProGlu: 5.401 ± 0.465
1.703ProPhe: 1.703 ± 0.317
3.309ProGly: 3.309 ± 0.409
1.362ProHis: 1.362 ± 0.286
2.238ProIle: 2.238 ± 0.25
1.557ProLys: 1.557 ± 0.284
2.287ProLeu: 2.287 ± 0.357
1.703ProMet: 1.703 ± 0.259
1.849ProAsn: 1.849 ± 0.301
2.53ProPro: 2.53 ± 0.461
1.411ProGln: 1.411 ± 0.237
2.822ProArg: 2.822 ± 0.37
2.19ProSer: 2.19 ± 0.371
2.968ProThr: 2.968 ± 0.473
3.406ProVal: 3.406 ± 0.441
0.487ProTrp: 0.487 ± 0.169
0.925ProTyr: 0.925 ± 0.226
0.0ProXaa: 0.0 ± 0.0
Gln
2.482GlnAla: 2.482 ± 0.351
0.195GlnCys: 0.195 ± 0.121
1.606GlnAsp: 1.606 ± 0.281
2.725GlnGlu: 2.725 ± 0.336
1.411GlnPhe: 1.411 ± 0.273
2.044GlnGly: 2.044 ± 0.281
0.681GlnHis: 0.681 ± 0.191
1.606GlnIle: 1.606 ± 0.269
1.508GlnLys: 1.508 ± 0.249
1.557GlnLeu: 1.557 ± 0.245
0.827GlnMet: 0.827 ± 0.216
1.071GlnAsn: 1.071 ± 0.257
1.46GlnPro: 1.46 ± 0.275
1.46GlnGln: 1.46 ± 0.273
2.044GlnArg: 2.044 ± 0.324
2.141GlnSer: 2.141 ± 0.327
1.703GlnThr: 1.703 ± 0.31
2.579GlnVal: 2.579 ± 0.395
0.535GlnTrp: 0.535 ± 0.162
0.973GlnTyr: 0.973 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
4.574ArgAla: 4.574 ± 0.454
0.389ArgCys: 0.389 ± 0.132
4.477ArgAsp: 4.477 ± 0.412
5.547ArgGlu: 5.547 ± 0.541
2.336ArgPhe: 2.336 ± 0.304
3.406ArgGly: 3.406 ± 0.475
1.752ArgHis: 1.752 ± 0.312
2.725ArgIle: 2.725 ± 0.328
2.433ArgLys: 2.433 ± 0.352
5.255ArgLeu: 5.255 ± 0.506
1.362ArgMet: 1.362 ± 0.257
2.433ArgAsn: 2.433 ± 0.309
2.336ArgPro: 2.336 ± 0.341
2.092ArgGln: 2.092 ± 0.341
5.109ArgArg: 5.109 ± 0.717
3.941ArgSer: 3.941 ± 0.46
3.698ArgThr: 3.698 ± 0.397
3.99ArgVal: 3.99 ± 0.379
0.925ArgTrp: 0.925 ± 0.223
2.482ArgTyr: 2.482 ± 0.365
0.0ArgXaa: 0.0 ± 0.0
Ser
4.623SerAla: 4.623 ± 0.473
0.487SerCys: 0.487 ± 0.183
4.233SerAsp: 4.233 ± 0.424
6.082SerGlu: 6.082 ± 0.583
1.752SerPhe: 1.752 ± 0.288
5.499SerGly: 5.499 ± 0.64
1.119SerHis: 1.119 ± 0.246
2.92SerIle: 2.92 ± 0.367
2.433SerLys: 2.433 ± 0.386
4.379SerLeu: 4.379 ± 0.461
1.265SerMet: 1.265 ± 0.236
1.411SerAsn: 1.411 ± 0.337
3.26SerPro: 3.26 ± 0.398
1.898SerGln: 1.898 ± 0.327
3.698SerArg: 3.698 ± 0.408
3.99SerSer: 3.99 ± 0.474
2.774SerThr: 2.774 ± 0.32
4.136SerVal: 4.136 ± 0.475
1.265SerTrp: 1.265 ± 0.22
1.362SerTyr: 1.362 ± 0.228
0.0SerXaa: 0.0 ± 0.0
Thr
5.158ThrAla: 5.158 ± 0.577
0.341ThrCys: 0.341 ± 0.116
4.72ThrAsp: 4.72 ± 0.581
5.644ThrGlu: 5.644 ± 0.56
1.995ThrPhe: 1.995 ± 0.311
5.109ThrGly: 5.109 ± 0.433
1.265ThrHis: 1.265 ± 0.222
2.725ThrIle: 2.725 ± 0.336
2.19ThrLys: 2.19 ± 0.375
4.379ThrLeu: 4.379 ± 0.433
1.508ThrMet: 1.508 ± 0.269
1.8ThrAsn: 1.8 ± 0.311
3.114ThrPro: 3.114 ± 0.419
1.557ThrGln: 1.557 ± 0.262
3.114ThrArg: 3.114 ± 0.418
2.53ThrSer: 2.53 ± 0.439
3.455ThrThr: 3.455 ± 0.428
4.282ThrVal: 4.282 ± 0.423
0.73ThrTrp: 0.73 ± 0.164
1.752ThrTyr: 1.752 ± 0.295
0.0ThrXaa: 0.0 ± 0.0
Val
6.228ValAla: 6.228 ± 0.539
0.584ValCys: 0.584 ± 0.158
6.374ValAsp: 6.374 ± 0.565
6.958ValGlu: 6.958 ± 0.588
2.092ValPhe: 2.092 ± 0.334
5.158ValGly: 5.158 ± 0.48
1.46ValHis: 1.46 ± 0.274
2.822ValIle: 2.822 ± 0.29
2.141ValLys: 2.141 ± 0.362
4.769ValLeu: 4.769 ± 0.509
1.314ValMet: 1.314 ± 0.272
3.114ValAsn: 3.114 ± 0.501
3.163ValPro: 3.163 ± 0.398
2.336ValGln: 2.336 ± 0.326
4.379ValArg: 4.379 ± 0.465
4.185ValSer: 4.185 ± 0.375
5.012ValThr: 5.012 ± 0.672
6.082ValVal: 6.082 ± 0.551
0.438ValTrp: 0.438 ± 0.146
2.384ValTyr: 2.384 ± 0.3
0.0ValXaa: 0.0 ± 0.0
Trp
0.827TrpAla: 0.827 ± 0.18
0.146TrpCys: 0.146 ± 0.071
1.411TrpAsp: 1.411 ± 0.278
0.973TrpGlu: 0.973 ± 0.256
0.73TrpPhe: 0.73 ± 0.179
0.876TrpGly: 0.876 ± 0.181
0.243TrpHis: 0.243 ± 0.114
0.292TrpIle: 0.292 ± 0.109
0.827TrpLys: 0.827 ± 0.19
1.168TrpLeu: 1.168 ± 0.284
0.438TrpMet: 0.438 ± 0.133
0.73TrpAsn: 0.73 ± 0.174
0.341TrpPro: 0.341 ± 0.125
0.243TrpGln: 0.243 ± 0.105
0.876TrpArg: 0.876 ± 0.186
1.022TrpSer: 1.022 ± 0.201
0.925TrpThr: 0.925 ± 0.241
1.022TrpVal: 1.022 ± 0.204
0.195TrpTrp: 0.195 ± 0.087
0.438TrpTyr: 0.438 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.092TyrAla: 2.092 ± 0.329
0.292TyrCys: 0.292 ± 0.102
4.185TyrAsp: 4.185 ± 0.469
2.968TyrGlu: 2.968 ± 0.374
0.827TyrPhe: 0.827 ± 0.197
2.579TyrGly: 2.579 ± 0.37
0.73TyrHis: 0.73 ± 0.192
1.362TyrIle: 1.362 ± 0.244
0.827TyrLys: 0.827 ± 0.189
1.946TyrLeu: 1.946 ± 0.298
0.292TyrMet: 0.292 ± 0.124
1.265TyrAsn: 1.265 ± 0.255
1.703TyrPro: 1.703 ± 0.361
0.973TyrGln: 0.973 ± 0.201
1.995TyrArg: 1.995 ± 0.33
2.53TyrSer: 2.53 ± 0.452
1.849TyrThr: 1.849 ± 0.276
1.898TyrVal: 1.898 ± 0.305
0.584TyrTrp: 0.584 ± 0.196
1.362TyrTyr: 1.362 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 103 proteins (20552 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski