Amino acid dipepetide frequency for Halovirus HCTV-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.156AlaAla: 5.156 ± 0.623
0.885AlaCys: 0.885 ± 0.181
6.743AlaAsp: 6.743 ± 0.471
6.895AlaGlu: 6.895 ± 0.474
2.349AlaPhe: 2.349 ± 0.242
5.888AlaGly: 5.888 ± 0.7
1.77AlaHis: 1.77 ± 0.241
3.021AlaIle: 3.021 ± 0.315
2.898AlaLys: 2.898 ± 0.361
5.278AlaLeu: 5.278 ± 0.436
1.678AlaMet: 1.678 ± 0.254
2.136AlaAsn: 2.136 ± 0.303
2.502AlaPro: 2.502 ± 0.298
2.776AlaGln: 2.776 ± 0.31
4.515AlaArg: 4.515 ± 0.36
4.363AlaSer: 4.363 ± 0.469
3.295AlaThr: 3.295 ± 0.423
4.21AlaVal: 4.21 ± 0.442
0.976AlaTrp: 0.976 ± 0.205
2.685AlaTyr: 2.685 ± 0.28
0.0AlaXaa: 0.0 ± 0.0
Cys
0.519CysAla: 0.519 ± 0.124
0.092CysCys: 0.092 ± 0.051
1.007CysAsp: 1.007 ± 0.178
0.885CysGlu: 0.885 ± 0.168
0.275CysPhe: 0.275 ± 0.09
1.587CysGly: 1.587 ± 0.291
0.122CysHis: 0.122 ± 0.06
0.183CysIle: 0.183 ± 0.067
0.427CysLys: 0.427 ± 0.125
0.488CysLeu: 0.488 ± 0.122
0.244CysMet: 0.244 ± 0.085
0.397CysAsn: 0.397 ± 0.097
0.976CysPro: 0.976 ± 0.215
0.275CysGln: 0.275 ± 0.078
0.763CysArg: 0.763 ± 0.174
0.366CysSer: 0.366 ± 0.113
0.214CysThr: 0.214 ± 0.082
0.549CysVal: 0.549 ± 0.121
0.153CysTrp: 0.153 ± 0.057
0.092CysTyr: 0.092 ± 0.045
0.0CysXaa: 0.0 ± 0.0
Asp
7.292AspAla: 7.292 ± 0.512
0.854AspCys: 0.854 ± 0.189
11.929AspAsp: 11.929 ± 0.988
13.089AspGlu: 13.089 ± 0.926
3.234AspPhe: 3.234 ± 0.264
7.75AspGly: 7.75 ± 0.556
1.312AspHis: 1.312 ± 0.256
3.814AspIle: 3.814 ± 0.403
2.776AspLys: 2.776 ± 0.299
8.207AspLeu: 8.207 ± 0.61
2.441AspMet: 2.441 ± 0.245
3.936AspAsn: 3.936 ± 0.388
4.21AspPro: 4.21 ± 0.369
2.166AspGln: 2.166 ± 0.234
4.638AspArg: 4.638 ± 0.396
4.729AspSer: 4.729 ± 0.374
5.766AspThr: 5.766 ± 0.375
7.2AspVal: 7.2 ± 0.45
1.709AspTrp: 1.709 ± 0.253
2.929AspTyr: 2.929 ± 0.303
0.0AspXaa: 0.0 ± 0.0
Glu
7.292GluAla: 7.292 ± 0.533
1.251GluCys: 1.251 ± 0.265
10.282GluAsp: 10.282 ± 0.694
10.465GluGlu: 10.465 ± 0.891
3.356GluPhe: 3.356 ± 0.297
7.811GluGly: 7.811 ± 0.585
2.075GluHis: 2.075 ± 0.245
4.485GluIle: 4.485 ± 0.363
4.515GluLys: 4.515 ± 0.48
6.01GluLeu: 6.01 ± 0.414
3.631GluMet: 3.631 ± 0.407
3.692GluAsn: 3.692 ± 0.383
3.295GluPro: 3.295 ± 0.342
4.149GluGln: 4.149 ± 0.375
6.895GluArg: 6.895 ± 0.487
5.919GluSer: 5.919 ± 0.443
6.102GluThr: 6.102 ± 0.463
9.336GluVal: 9.336 ± 0.637
2.349GluTrp: 2.349 ± 0.314
3.814GluTyr: 3.814 ± 0.295
0.0GluXaa: 0.0 ± 0.0
Phe
1.709PheAla: 1.709 ± 0.228
0.305PheCys: 0.305 ± 0.113
3.997PheAsp: 3.997 ± 0.319
3.448PheGlu: 3.448 ± 0.329
1.098PhePhe: 1.098 ± 0.202
2.349PheGly: 2.349 ± 0.248
0.671PheHis: 0.671 ± 0.134
1.922PheIle: 1.922 ± 0.312
0.854PheLys: 0.854 ± 0.148
2.166PheLeu: 2.166 ± 0.288
0.641PheMet: 0.641 ± 0.131
1.342PheAsn: 1.342 ± 0.182
1.22PhePro: 1.22 ± 0.199
0.976PheGln: 0.976 ± 0.18
1.587PheArg: 1.587 ± 0.215
2.349PheSer: 2.349 ± 0.297
2.044PheThr: 2.044 ± 0.276
2.837PheVal: 2.837 ± 0.271
1.037PheTrp: 1.037 ± 0.224
0.976PheTyr: 0.976 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
5.065GlyAla: 5.065 ± 0.628
0.58GlyCys: 0.58 ± 0.137
8.329GlyAsp: 8.329 ± 0.659
8.116GlyGlu: 8.116 ± 0.66
2.715GlyPhe: 2.715 ± 0.327
6.804GlyGly: 6.804 ± 0.831
1.587GlyHis: 1.587 ± 0.209
3.326GlyIle: 3.326 ± 0.321
3.021GlyLys: 3.021 ± 0.244
3.844GlyLeu: 3.844 ± 0.419
1.983GlyMet: 1.983 ± 0.21
2.929GlyAsn: 2.929 ± 0.238
1.831GlyPro: 1.831 ± 0.268
2.593GlyGln: 2.593 ± 0.275
4.149GlyArg: 4.149 ± 0.295
4.454GlySer: 4.454 ± 0.485
4.119GlyThr: 4.119 ± 0.348
5.827GlyVal: 5.827 ± 0.391
1.464GlyTrp: 1.464 ± 0.2
2.624GlyTyr: 2.624 ± 0.257
0.0GlyXaa: 0.0 ± 0.0
His
1.831HisAla: 1.831 ± 0.243
0.214HisCys: 0.214 ± 0.078
1.098HisAsp: 1.098 ± 0.224
1.922HisGlu: 1.922 ± 0.228
0.763HisPhe: 0.763 ± 0.15
1.678HisGly: 1.678 ± 0.262
0.488HisHis: 0.488 ± 0.13
0.763HisIle: 0.763 ± 0.154
0.488HisLys: 0.488 ± 0.11
1.709HisLeu: 1.709 ± 0.225
0.275HisMet: 0.275 ± 0.078
0.915HisAsn: 0.915 ± 0.151
0.976HisPro: 0.976 ± 0.15
0.61HisGln: 0.61 ± 0.14
1.281HisArg: 1.281 ± 0.206
1.159HisSer: 1.159 ± 0.158
0.885HisThr: 0.885 ± 0.172
1.617HisVal: 1.617 ± 0.285
0.336HisTrp: 0.336 ± 0.098
0.61HisTyr: 0.61 ± 0.173
0.0HisXaa: 0.0 ± 0.0
Ile
2.227IleAla: 2.227 ± 0.245
0.397IleCys: 0.397 ± 0.098
4.332IleAsp: 4.332 ± 0.306
4.668IleGlu: 4.668 ± 0.402
1.129IlePhe: 1.129 ± 0.201
3.112IleGly: 3.112 ± 0.366
0.885IleHis: 0.885 ± 0.181
1.556IleIle: 1.556 ± 0.266
1.159IleLys: 1.159 ± 0.204
2.776IleLeu: 2.776 ± 0.337
0.671IleMet: 0.671 ± 0.153
1.526IleAsn: 1.526 ± 0.235
2.288IlePro: 2.288 ± 0.223
1.495IleGln: 1.495 ± 0.204
2.868IleArg: 2.868 ± 0.34
2.624IleSer: 2.624 ± 0.277
2.41IleThr: 2.41 ± 0.282
2.959IleVal: 2.959 ± 0.313
0.58IleTrp: 0.58 ± 0.131
0.946IleTyr: 0.946 ± 0.162
0.0IleXaa: 0.0 ± 0.0
Lys
3.143LysAla: 3.143 ± 0.38
0.244LysCys: 0.244 ± 0.086
2.38LysAsp: 2.38 ± 0.328
3.844LysGlu: 3.844 ± 0.363
1.068LysPhe: 1.068 ± 0.174
2.319LysGly: 2.319 ± 0.266
0.824LysHis: 0.824 ± 0.165
1.495LysIle: 1.495 ± 0.229
1.434LysLys: 1.434 ± 0.243
2.715LysLeu: 2.715 ± 0.281
1.129LysMet: 1.129 ± 0.245
1.739LysAsn: 1.739 ± 0.239
1.312LysPro: 1.312 ± 0.203
1.434LysGln: 1.434 ± 0.209
2.807LysArg: 2.807 ± 0.324
1.953LysSer: 1.953 ± 0.254
2.471LysThr: 2.471 ± 0.293
3.082LysVal: 3.082 ± 0.306
0.549LysTrp: 0.549 ± 0.155
1.648LysTyr: 1.648 ± 0.233
0.0LysXaa: 0.0 ± 0.0
Leu
5.522LeuAla: 5.522 ± 0.348
0.549LeuCys: 0.549 ± 0.122
7.658LeuAsp: 7.658 ± 0.426
7.444LeuGlu: 7.444 ± 0.506
1.8LeuPhe: 1.8 ± 0.216
4.515LeuGly: 4.515 ± 0.432
1.159LeuHis: 1.159 ± 0.211
2.166LeuIle: 2.166 ± 0.231
2.746LeuLys: 2.746 ± 0.292
4.363LeuLeu: 4.363 ± 0.397
1.831LeuMet: 1.831 ± 0.259
2.105LeuAsn: 2.105 ± 0.234
2.776LeuPro: 2.776 ± 0.295
2.227LeuGln: 2.227 ± 0.318
4.21LeuArg: 4.21 ± 0.335
4.76LeuSer: 4.76 ± 0.408
4.18LeuThr: 4.18 ± 0.409
4.821LeuVal: 4.821 ± 0.451
1.037LeuTrp: 1.037 ± 0.242
1.648LeuTyr: 1.648 ± 0.248
0.0LeuXaa: 0.0 ± 0.0
Met
1.831MetAla: 1.831 ± 0.246
0.153MetCys: 0.153 ± 0.065
1.526MetAsp: 1.526 ± 0.225
1.892MetGlu: 1.892 ± 0.242
0.641MetPhe: 0.641 ± 0.146
1.8MetGly: 1.8 ± 0.229
0.366MetHis: 0.366 ± 0.114
0.61MetIle: 0.61 ± 0.132
1.403MetLys: 1.403 ± 0.211
1.709MetLeu: 1.709 ± 0.231
0.488MetMet: 0.488 ± 0.124
1.129MetAsn: 1.129 ± 0.158
1.739MetPro: 1.739 ± 0.257
0.671MetGln: 0.671 ± 0.146
1.342MetArg: 1.342 ± 0.233
2.563MetSer: 2.563 ± 0.274
2.441MetThr: 2.441 ± 0.273
1.464MetVal: 1.464 ± 0.277
0.275MetTrp: 0.275 ± 0.077
0.702MetTyr: 0.702 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
2.99AsnAla: 2.99 ± 0.306
0.488AsnCys: 0.488 ± 0.106
4.058AsnAsp: 4.058 ± 0.259
3.783AsnGlu: 3.783 ± 0.339
1.037AsnPhe: 1.037 ± 0.185
2.685AsnGly: 2.685 ± 0.333
0.732AsnHis: 0.732 ± 0.187
1.648AsnIle: 1.648 ± 0.231
1.068AsnLys: 1.068 ± 0.16
2.349AsnLeu: 2.349 ± 0.277
0.58AsnMet: 0.58 ± 0.143
1.037AsnAsn: 1.037 ± 0.171
2.593AsnPro: 2.593 ± 0.305
0.915AsnGln: 0.915 ± 0.175
2.227AsnArg: 2.227 ± 0.265
2.136AsnSer: 2.136 ± 0.242
2.014AsnThr: 2.014 ± 0.261
2.959AsnVal: 2.959 ± 0.342
0.732AsnTrp: 0.732 ± 0.196
1.129AsnTyr: 1.129 ± 0.16
0.0AsnXaa: 0.0 ± 0.0
Pro
3.051ProAla: 3.051 ± 0.343
0.336ProCys: 0.336 ± 0.097
5.4ProAsp: 5.4 ± 0.432
5.431ProGlu: 5.431 ± 0.416
1.495ProPhe: 1.495 ± 0.229
3.021ProGly: 3.021 ± 0.315
0.885ProHis: 0.885 ± 0.166
1.434ProIle: 1.434 ± 0.199
1.19ProLys: 1.19 ± 0.156
1.709ProLeu: 1.709 ± 0.187
0.763ProMet: 0.763 ± 0.158
1.464ProAsn: 1.464 ± 0.194
1.434ProPro: 1.434 ± 0.215
1.281ProGln: 1.281 ± 0.19
2.075ProArg: 2.075 ± 0.245
2.959ProSer: 2.959 ± 0.284
2.258ProThr: 2.258 ± 0.293
2.258ProVal: 2.258 ± 0.253
0.61ProTrp: 0.61 ± 0.133
1.037ProTyr: 1.037 ± 0.184
0.0ProXaa: 0.0 ± 0.0
Gln
2.654GlnAla: 2.654 ± 0.258
0.275GlnCys: 0.275 ± 0.095
2.898GlnAsp: 2.898 ± 0.319
3.265GlnGlu: 3.265 ± 0.282
0.976GlnPhe: 0.976 ± 0.161
2.136GlnGly: 2.136 ± 0.242
0.641GlnHis: 0.641 ± 0.116
1.617GlnIle: 1.617 ± 0.204
1.556GlnLys: 1.556 ± 0.23
2.441GlnLeu: 2.441 ± 0.297
1.037GlnMet: 1.037 ± 0.215
1.373GlnAsn: 1.373 ± 0.194
1.19GlnPro: 1.19 ± 0.183
1.342GlnGln: 1.342 ± 0.254
1.678GlnArg: 1.678 ± 0.236
2.105GlnSer: 2.105 ± 0.265
1.831GlnThr: 1.831 ± 0.24
2.715GlnVal: 2.715 ± 0.31
0.488GlnTrp: 0.488 ± 0.105
0.976GlnTyr: 0.976 ± 0.172
0.0GlnXaa: 0.0 ± 0.0
Arg
3.905ArgAla: 3.905 ± 0.355
0.58ArgCys: 0.58 ± 0.138
4.363ArgAsp: 4.363 ± 0.379
5.797ArgGlu: 5.797 ± 0.347
2.349ArgPhe: 2.349 ± 0.251
3.875ArgGly: 3.875 ± 0.379
1.739ArgHis: 1.739 ± 0.251
2.593ArgIle: 2.593 ± 0.241
2.898ArgLys: 2.898 ± 0.309
4.79ArgLeu: 4.79 ± 0.46
2.197ArgMet: 2.197 ± 0.342
1.983ArgAsn: 1.983 ± 0.219
1.8ArgPro: 1.8 ± 0.257
1.983ArgGln: 1.983 ± 0.251
4.515ArgArg: 4.515 ± 0.419
3.631ArgSer: 3.631 ± 0.309
2.99ArgThr: 2.99 ± 0.327
4.363ArgVal: 4.363 ± 0.404
1.007ArgTrp: 1.007 ± 0.17
2.471ArgTyr: 2.471 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
3.997SerAla: 3.997 ± 0.454
0.61SerCys: 0.61 ± 0.162
7.078SerAsp: 7.078 ± 0.475
5.919SerGlu: 5.919 ± 0.44
2.563SerPhe: 2.563 ± 0.321
5.339SerGly: 5.339 ± 0.452
0.885SerHis: 0.885 ± 0.192
2.654SerIle: 2.654 ± 0.34
2.319SerLys: 2.319 ± 0.266
4.729SerLeu: 4.729 ± 0.375
1.434SerMet: 1.434 ± 0.183
2.105SerAsn: 2.105 ± 0.254
2.41SerPro: 2.41 ± 0.264
2.105SerGln: 2.105 ± 0.218
3.356SerArg: 3.356 ± 0.335
4.149SerSer: 4.149 ± 0.433
3.966SerThr: 3.966 ± 0.352
4.271SerVal: 4.271 ± 0.415
0.885SerTrp: 0.885 ± 0.168
1.861SerTyr: 1.861 ± 0.224
0.0SerXaa: 0.0 ± 0.0
Thr
4.302ThrAla: 4.302 ± 0.335
0.366ThrCys: 0.366 ± 0.112
4.668ThrAsp: 4.668 ± 0.356
5.339ThrGlu: 5.339 ± 0.389
2.929ThrPhe: 2.929 ± 0.365
4.363ThrGly: 4.363 ± 0.347
1.007ThrHis: 1.007 ± 0.17
2.898ThrIle: 2.898 ± 0.317
2.075ThrLys: 2.075 ± 0.255
4.393ThrLeu: 4.393 ± 0.382
1.129ThrMet: 1.129 ± 0.145
2.044ThrAsn: 2.044 ± 0.253
2.654ThrPro: 2.654 ± 0.225
2.044ThrGln: 2.044 ± 0.247
3.509ThrArg: 3.509 ± 0.325
3.448ThrSer: 3.448 ± 0.338
3.814ThrThr: 3.814 ± 0.544
4.515ThrVal: 4.515 ± 0.374
0.946ThrTrp: 0.946 ± 0.203
1.739ThrTyr: 1.739 ± 0.211
0.0ThrXaa: 0.0 ± 0.0
Val
4.973ValAla: 4.973 ± 0.47
0.793ValCys: 0.793 ± 0.15
6.377ValAsp: 6.377 ± 0.439
8.756ValGlu: 8.756 ± 0.62
2.075ValPhe: 2.075 ± 0.217
4.546ValGly: 4.546 ± 0.333
1.617ValHis: 1.617 ± 0.275
2.715ValIle: 2.715 ± 0.332
3.112ValLys: 3.112 ± 0.299
4.485ValLeu: 4.485 ± 0.466
1.495ValMet: 1.495 ± 0.19
3.234ValAsn: 3.234 ± 0.317
3.387ValPro: 3.387 ± 0.327
2.532ValGln: 2.532 ± 0.325
4.638ValArg: 4.638 ± 0.404
5.949ValSer: 5.949 ± 0.432
4.638ValThr: 4.638 ± 0.45
5.919ValVal: 5.919 ± 0.463
0.854ValTrp: 0.854 ± 0.151
2.471ValTyr: 2.471 ± 0.271
0.0ValXaa: 0.0 ± 0.0
Trp
0.702TrpAla: 0.702 ± 0.163
0.244TrpCys: 0.244 ± 0.099
2.166TrpAsp: 2.166 ± 0.289
2.197TrpGlu: 2.197 ± 0.25
0.824TrpPhe: 0.824 ± 0.156
1.129TrpGly: 1.129 ± 0.324
0.305TrpHis: 0.305 ± 0.096
0.275TrpIle: 0.275 ± 0.111
0.458TrpLys: 0.458 ± 0.123
1.037TrpLeu: 1.037 ± 0.193
0.488TrpMet: 0.488 ± 0.139
0.793TrpAsn: 0.793 ± 0.15
0.305TrpPro: 0.305 ± 0.125
0.488TrpGln: 0.488 ± 0.125
1.098TrpArg: 1.098 ± 0.192
1.129TrpSer: 1.129 ± 0.208
1.159TrpThr: 1.159 ± 0.2
1.434TrpVal: 1.434 ± 0.206
0.366TrpTrp: 0.366 ± 0.1
0.275TrpTyr: 0.275 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.861TyrAla: 1.861 ± 0.26
0.397TyrCys: 0.397 ± 0.12
3.692TyrAsp: 3.692 ± 0.298
3.57TyrGlu: 3.57 ± 0.331
0.854TyrPhe: 0.854 ± 0.179
2.654TyrGly: 2.654 ± 0.286
0.549TyrHis: 0.549 ± 0.139
1.434TyrIle: 1.434 ± 0.235
1.19TyrLys: 1.19 ± 0.185
2.319TyrLeu: 2.319 ± 0.278
0.58TyrMet: 0.58 ± 0.135
1.281TyrAsn: 1.281 ± 0.173
1.19TyrPro: 1.19 ± 0.179
1.129TyrGln: 1.129 ± 0.187
1.678TyrArg: 1.678 ± 0.245
1.892TyrSer: 1.892 ± 0.244
1.587TyrThr: 1.587 ± 0.184
2.227TyrVal: 2.227 ± 0.311
0.519TyrTrp: 0.519 ± 0.127
1.556TyrTyr: 1.556 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 160 proteins (32777 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski