Amino acid dipepetide frequency for Kallithea virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.148AlaAla: 2.148 ± 0.285
0.743AlaCys: 0.743 ± 0.128
1.989AlaAsp: 1.989 ± 0.217
1.538AlaGlu: 1.538 ± 0.228
1.724AlaPhe: 1.724 ± 0.24
1.008AlaGly: 1.008 ± 0.184
0.796AlaHis: 0.796 ± 0.162
4.588AlaIle: 4.588 ± 0.406
3.077AlaLys: 3.077 ± 0.339
3.634AlaLeu: 3.634 ± 0.349
1.14AlaMet: 1.14 ± 0.214
3.793AlaAsn: 3.793 ± 0.295
1.618AlaPro: 1.618 ± 0.215
1.565AlaGln: 1.565 ± 0.199
1.724AlaArg: 1.724 ± 0.27
3.209AlaSer: 3.209 ± 0.23
3.289AlaThr: 3.289 ± 0.345
1.989AlaVal: 1.989 ± 0.232
0.239AlaTrp: 0.239 ± 0.072
1.804AlaTyr: 1.804 ± 0.212
0.0AlaXaa: 0.0 ± 0.0
Cys
0.902CysAla: 0.902 ± 0.179
0.292CysCys: 0.292 ± 0.098
1.247CysAsp: 1.247 ± 0.224
0.849CysGlu: 0.849 ± 0.122
0.769CysPhe: 0.769 ± 0.139
0.743CysGly: 0.743 ± 0.183
0.345CysHis: 0.345 ± 0.099
1.644CysIle: 1.644 ± 0.189
1.353CysLys: 1.353 ± 0.187
1.565CysLeu: 1.565 ± 0.201
0.371CysMet: 0.371 ± 0.097
1.432CysAsn: 1.432 ± 0.196
0.584CysPro: 0.584 ± 0.13
0.769CysGln: 0.769 ± 0.124
1.247CysArg: 1.247 ± 0.2
1.618CysSer: 1.618 ± 0.253
0.849CysThr: 0.849 ± 0.142
0.981CysVal: 0.981 ± 0.192
0.08CysTrp: 0.08 ± 0.045
0.769CysTyr: 0.769 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
2.467AspAla: 2.467 ± 0.278
1.3AspCys: 1.3 ± 0.176
6.763AspAsp: 6.763 ± 0.919
4.111AspGlu: 4.111 ± 0.501
2.971AspPhe: 2.971 ± 0.395
3.501AspGly: 3.501 ± 0.366
1.114AspHis: 1.114 ± 0.251
5.835AspIle: 5.835 ± 0.424
2.732AspLys: 2.732 ± 0.254
4.403AspLeu: 4.403 ± 0.403
1.114AspMet: 1.114 ± 0.18
4.96AspAsn: 4.96 ± 0.514
2.069AspPro: 2.069 ± 0.248
1.751AspGln: 1.751 ± 0.211
2.546AspArg: 2.546 ± 0.244
4.482AspSer: 4.482 ± 0.338
3.13AspThr: 3.13 ± 0.319
3.925AspVal: 3.925 ± 0.351
0.371AspTrp: 0.371 ± 0.098
2.785AspTyr: 2.785 ± 0.276
0.0AspXaa: 0.0 ± 0.0
Glu
1.91GluAla: 1.91 ± 0.201
0.981GluCys: 0.981 ± 0.145
2.652GluAsp: 2.652 ± 0.32
2.042GluGlu: 2.042 ± 0.278
2.281GluPhe: 2.281 ± 0.226
1.326GluGly: 1.326 ± 0.177
1.087GluHis: 1.087 ± 0.15
3.713GluIle: 3.713 ± 0.296
2.467GluLys: 2.467 ± 0.296
4.032GluLeu: 4.032 ± 0.332
1.247GluMet: 1.247 ± 0.211
3.819GluAsn: 3.819 ± 0.347
2.387GluPro: 2.387 ± 0.316
2.122GluGln: 2.122 ± 0.212
2.016GluArg: 2.016 ± 0.192
4.907GluSer: 4.907 ± 0.424
4.111GluThr: 4.111 ± 0.367
1.777GluVal: 1.777 ± 0.199
0.477GluTrp: 0.477 ± 0.103
2.997GluTyr: 2.997 ± 0.22
0.0GluXaa: 0.0 ± 0.0
Phe
1.963PheAla: 1.963 ± 0.227
0.822PheCys: 0.822 ± 0.189
3.475PheAsp: 3.475 ± 0.386
2.44PheGlu: 2.44 ± 0.264
1.167PhePhe: 1.167 ± 0.201
1.857PheGly: 1.857 ± 0.22
0.928PheHis: 0.928 ± 0.166
3.315PheIle: 3.315 ± 0.282
2.971PheLys: 2.971 ± 0.341
2.732PheLeu: 2.732 ± 0.317
1.326PheMet: 1.326 ± 0.238
3.368PheAsn: 3.368 ± 0.316
1.432PhePro: 1.432 ± 0.179
1.273PheGln: 1.273 ± 0.175
1.512PheArg: 1.512 ± 0.2
2.785PheSer: 2.785 ± 0.265
2.334PheThr: 2.334 ± 0.257
2.785PheVal: 2.785 ± 0.273
0.239PheTrp: 0.239 ± 0.07
1.697PheTyr: 1.697 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
1.406GlyAla: 1.406 ± 0.182
0.743GlyCys: 0.743 ± 0.166
2.52GlyAsp: 2.52 ± 0.278
1.724GlyGlu: 1.724 ± 0.266
1.512GlyPhe: 1.512 ± 0.193
1.459GlyGly: 1.459 ± 0.177
0.69GlyHis: 0.69 ± 0.154
3.209GlyIle: 3.209 ± 0.267
2.069GlyLys: 2.069 ± 0.2
2.811GlyLeu: 2.811 ± 0.335
0.637GlyMet: 0.637 ± 0.141
2.652GlyAsn: 2.652 ± 0.273
1.167GlyPro: 1.167 ± 0.196
0.981GlyGln: 0.981 ± 0.167
1.565GlyArg: 1.565 ± 0.222
2.944GlySer: 2.944 ± 0.277
1.751GlyThr: 1.751 ± 0.246
2.016GlyVal: 2.016 ± 0.248
0.318GlyTrp: 0.318 ± 0.095
2.334GlyTyr: 2.334 ± 0.322
0.0GlyXaa: 0.0 ± 0.0
His
0.796HisAla: 0.796 ± 0.129
0.371HisCys: 0.371 ± 0.109
0.796HisAsp: 0.796 ± 0.106
0.981HisGlu: 0.981 ± 0.171
0.955HisPhe: 0.955 ± 0.147
1.14HisGly: 1.14 ± 0.172
0.875HisHis: 0.875 ± 0.181
1.618HisIle: 1.618 ± 0.203
1.406HisLys: 1.406 ± 0.199
2.148HisLeu: 2.148 ± 0.204
0.477HisMet: 0.477 ± 0.099
1.565HisAsn: 1.565 ± 0.217
0.849HisPro: 0.849 ± 0.206
1.406HisGln: 1.406 ± 0.397
1.034HisArg: 1.034 ± 0.147
1.459HisSer: 1.459 ± 0.186
1.353HisThr: 1.353 ± 0.188
1.459HisVal: 1.459 ± 0.204
0.133HisTrp: 0.133 ± 0.054
1.008HisTyr: 1.008 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
4.138IleAla: 4.138 ± 0.311
1.644IleCys: 1.644 ± 0.229
5.782IleAsp: 5.782 ± 0.391
4.96IleGlu: 4.96 ± 0.403
3.077IlePhe: 3.077 ± 0.312
3.024IleGly: 3.024 ± 0.32
1.697IleHis: 1.697 ± 0.222
7.771IleIle: 7.771 ± 0.531
5.543IleLys: 5.543 ± 0.416
7.347IleLeu: 7.347 ± 0.504
2.387IleMet: 2.387 ± 0.26
7.029IleAsn: 7.029 ± 0.431
4.35IlePro: 4.35 ± 0.329
2.918IleGln: 2.918 ± 0.276
3.713IleArg: 3.713 ± 0.356
6.312IleSer: 6.312 ± 0.468
5.702IleThr: 5.702 ± 0.468
4.774IleVal: 4.774 ± 0.431
0.822IleTrp: 0.822 ± 0.167
4.164IleTyr: 4.164 ± 0.346
0.0IleXaa: 0.0 ± 0.0
Lys
1.751LysAla: 1.751 ± 0.214
1.485LysCys: 1.485 ± 0.22
1.883LysAsp: 1.883 ± 0.256
2.122LysGlu: 2.122 ± 0.258
3.607LysPhe: 3.607 ± 0.377
1.14LysGly: 1.14 ± 0.209
2.095LysHis: 2.095 ± 0.17
6.339LysIle: 6.339 ± 0.552
3.687LysLys: 3.687 ± 0.384
5.411LysLeu: 5.411 ± 0.416
1.751LysMet: 1.751 ± 0.224
4.854LysAsn: 4.854 ± 0.333
2.785LysPro: 2.785 ± 0.47
2.148LysGln: 2.148 ± 0.257
3.952LysArg: 3.952 ± 0.411
5.862LysSer: 5.862 ± 0.382
5.066LysThr: 5.066 ± 0.489
2.705LysVal: 2.705 ± 0.346
0.212LysTrp: 0.212 ± 0.086
4.111LysTyr: 4.111 ± 0.356
0.0LysXaa: 0.0 ± 0.0
Leu
3.766LeuAla: 3.766 ± 0.295
1.432LeuCys: 1.432 ± 0.193
5.517LeuAsp: 5.517 ± 0.499
3.554LeuGlu: 3.554 ± 0.258
3.05LeuPhe: 3.05 ± 0.224
2.679LeuGly: 2.679 ± 0.31
2.016LeuHis: 2.016 ± 0.214
5.888LeuIle: 5.888 ± 0.441
5.225LeuLys: 5.225 ± 0.41
7.373LeuLeu: 7.373 ± 0.467
2.228LeuMet: 2.228 ± 0.293
6.896LeuAsn: 6.896 ± 0.506
4.748LeuPro: 4.748 ± 0.421
3.634LeuGln: 3.634 ± 0.359
3.368LeuArg: 3.368 ± 0.304
5.49LeuSer: 5.49 ± 0.516
4.403LeuThr: 4.403 ± 0.369
4.668LeuVal: 4.668 ± 0.384
0.477LeuTrp: 0.477 ± 0.12
3.899LeuTyr: 3.899 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
1.353MetAla: 1.353 ± 0.229
0.61MetCys: 0.61 ± 0.11
1.936MetAsp: 1.936 ± 0.218
1.167MetGlu: 1.167 ± 0.157
1.273MetPhe: 1.273 ± 0.202
0.902MetGly: 0.902 ± 0.139
0.504MetHis: 0.504 ± 0.131
1.697MetIle: 1.697 ± 0.215
1.326MetLys: 1.326 ± 0.185
2.334MetLeu: 2.334 ± 0.316
0.61MetMet: 0.61 ± 0.108
1.751MetAsn: 1.751 ± 0.2
1.91MetPro: 1.91 ± 0.255
0.902MetGln: 0.902 ± 0.16
1.087MetArg: 1.087 ± 0.173
2.387MetSer: 2.387 ± 0.256
1.591MetThr: 1.591 ± 0.201
1.326MetVal: 1.326 ± 0.212
0.053MetTrp: 0.053 ± 0.041
1.353MetTyr: 1.353 ± 0.226
0.0MetXaa: 0.0 ± 0.0
Asn
4.217AsnAla: 4.217 ± 0.356
1.22AsnCys: 1.22 ± 0.194
6.366AsnAsp: 6.366 ± 0.505
4.509AsnGlu: 4.509 ± 0.361
3.13AsnPhe: 3.13 ± 0.282
3.766AsnGly: 3.766 ± 0.301
1.697AsnHis: 1.697 ± 0.228
8.859AsnIle: 8.859 ± 0.543
4.642AsnLys: 4.642 ± 0.34
6.604AsnLeu: 6.604 ± 0.431
1.936AsnMet: 1.936 ± 0.233
10.264AsnAsn: 10.264 ± 0.915
2.652AsnPro: 2.652 ± 0.306
2.864AsnGln: 2.864 ± 0.26
3.342AsnArg: 3.342 ± 0.235
6.206AsnSer: 6.206 ± 0.439
4.907AsnThr: 4.907 ± 0.414
5.835AsnVal: 5.835 ± 0.384
0.371AsnTrp: 0.371 ± 0.103
3.103AsnTyr: 3.103 ± 0.276
0.0AsnXaa: 0.0 ± 0.0
Pro
1.565ProAla: 1.565 ± 0.238
0.424ProCys: 0.424 ± 0.116
1.989ProAsp: 1.989 ± 0.211
2.599ProGlu: 2.599 ± 0.279
1.406ProPhe: 1.406 ± 0.209
0.902ProGly: 0.902 ± 0.162
0.902ProHis: 0.902 ± 0.143
4.244ProIle: 4.244 ± 0.399
3.634ProLys: 3.634 ± 0.446
3.103ProLeu: 3.103 ± 0.252
0.875ProMet: 0.875 ± 0.128
3.581ProAsn: 3.581 ± 0.284
2.44ProPro: 2.44 ± 0.373
2.546ProGln: 2.546 ± 0.373
1.618ProArg: 1.618 ± 0.202
4.403ProSer: 4.403 ± 0.434
4.535ProThr: 4.535 ± 0.485
1.697ProVal: 1.697 ± 0.206
0.159ProTrp: 0.159 ± 0.061
1.485ProTyr: 1.485 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
1.167GlnAla: 1.167 ± 0.196
0.902GlnCys: 0.902 ± 0.196
1.591GlnAsp: 1.591 ± 0.189
1.432GlnGlu: 1.432 ± 0.193
1.751GlnPhe: 1.751 ± 0.219
0.69GlnGly: 0.69 ± 0.13
1.485GlnHis: 1.485 ± 0.293
2.732GlnIle: 2.732 ± 0.283
2.148GlnLys: 2.148 ± 0.246
3.448GlnLeu: 3.448 ± 0.255
1.432GlnMet: 1.432 ± 0.177
3.368GlnAsn: 3.368 ± 0.285
2.228GlnPro: 2.228 ± 0.28
7.029GlnGln: 7.029 ± 1.643
1.936GlnArg: 1.936 ± 0.197
3.501GlnSer: 3.501 ± 0.36
2.944GlnThr: 2.944 ± 0.239
1.247GlnVal: 1.247 ± 0.18
0.186GlnTrp: 0.186 ± 0.068
2.175GlnTyr: 2.175 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
1.459ArgAla: 1.459 ± 0.229
0.769ArgCys: 0.769 ± 0.131
1.989ArgAsp: 1.989 ± 0.251
1.804ArgGlu: 1.804 ± 0.202
1.857ArgPhe: 1.857 ± 0.218
1.247ArgGly: 1.247 ± 0.173
0.981ArgHis: 0.981 ± 0.13
3.766ArgIle: 3.766 ± 0.327
2.891ArgLys: 2.891 ± 0.331
3.687ArgLeu: 3.687 ± 0.328
1.326ArgMet: 1.326 ± 0.173
3.687ArgAsn: 3.687 ± 0.253
2.281ArgPro: 2.281 ± 0.269
2.573ArgGln: 2.573 ± 0.269
2.573ArgArg: 2.573 ± 0.357
3.952ArgSer: 3.952 ± 0.55
2.361ArgThr: 2.361 ± 0.252
2.148ArgVal: 2.148 ± 0.274
0.292ArgTrp: 0.292 ± 0.102
2.175ArgTyr: 2.175 ± 0.296
0.0ArgXaa: 0.0 ± 0.0
Ser
2.971SerAla: 2.971 ± 0.314
1.751SerCys: 1.751 ± 0.271
4.748SerAsp: 4.748 ± 0.367
4.005SerGlu: 4.005 ± 0.384
3.024SerPhe: 3.024 ± 0.378
2.679SerGly: 2.679 ± 0.289
1.326SerHis: 1.326 ± 0.205
6.472SerIle: 6.472 ± 0.387
6.18SerLys: 6.18 ± 0.409
6.1SerLeu: 6.1 ± 0.338
2.334SerMet: 2.334 ± 0.27
8.169SerAsn: 8.169 ± 0.643
2.997SerPro: 2.997 ± 0.293
3.236SerGln: 3.236 ± 0.358
3.74SerArg: 3.74 ± 0.426
9.018SerSer: 9.018 ± 1.157
6.286SerThr: 6.286 ± 0.532
4.085SerVal: 4.085 ± 0.361
0.398SerTrp: 0.398 ± 0.101
3.209SerTyr: 3.209 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
2.679ThrAla: 2.679 ± 0.283
1.061ThrCys: 1.061 ± 0.157
2.997ThrAsp: 2.997 ± 0.269
2.864ThrGlu: 2.864 ± 0.258
2.811ThrPhe: 2.811 ± 0.269
2.361ThrGly: 2.361 ± 0.239
1.406ThrHis: 1.406 ± 0.182
7.108ThrIle: 7.108 ± 0.507
4.482ThrLys: 4.482 ± 0.478
5.305ThrLeu: 5.305 ± 0.4
2.308ThrMet: 2.308 ± 0.203
6.047ThrAsn: 6.047 ± 0.518
3.713ThrPro: 3.713 ± 0.445
1.936ThrGln: 1.936 ± 0.231
2.281ThrArg: 2.281 ± 0.247
5.888ThrSer: 5.888 ± 0.595
8.275ThrThr: 8.275 ± 1.24
3.793ThrVal: 3.793 ± 0.316
0.239ThrTrp: 0.239 ± 0.107
2.254ThrTyr: 2.254 ± 0.313
0.0ThrXaa: 0.0 ± 0.0
Val
3.183ValAla: 3.183 ± 0.331
0.981ValCys: 0.981 ± 0.194
4.138ValAsp: 4.138 ± 0.298
3.103ValGlu: 3.103 ± 0.442
1.751ValPhe: 1.751 ± 0.205
2.254ValGly: 2.254 ± 0.275
0.928ValHis: 0.928 ± 0.154
4.191ValIle: 4.191 ± 0.307
3.236ValLys: 3.236 ± 0.318
4.376ValLeu: 4.376 ± 0.349
1.3ValMet: 1.3 ± 0.197
3.899ValAsn: 3.899 ± 0.388
1.83ValPro: 1.83 ± 0.186
1.591ValGln: 1.591 ± 0.209
2.175ValArg: 2.175 ± 0.22
4.403ValSer: 4.403 ± 0.333
3.315ValThr: 3.315 ± 0.295
3.819ValVal: 3.819 ± 0.367
0.239ValTrp: 0.239 ± 0.067
2.679ValTyr: 2.679 ± 0.274
0.0ValXaa: 0.0 ± 0.0
Trp
0.212TrpAla: 0.212 ± 0.074
0.133TrpCys: 0.133 ± 0.06
0.451TrpAsp: 0.451 ± 0.11
0.186TrpGlu: 0.186 ± 0.065
0.504TrpPhe: 0.504 ± 0.129
0.159TrpGly: 0.159 ± 0.067
0.08TrpHis: 0.08 ± 0.041
0.345TrpIle: 0.345 ± 0.098
0.292TrpLys: 0.292 ± 0.093
0.451TrpLeu: 0.451 ± 0.123
0.053TrpMet: 0.053 ± 0.033
0.451TrpAsn: 0.451 ± 0.089
0.424TrpPro: 0.424 ± 0.123
0.239TrpGln: 0.239 ± 0.081
0.292TrpArg: 0.292 ± 0.105
0.451TrpSer: 0.451 ± 0.112
0.265TrpThr: 0.265 ± 0.086
0.212TrpVal: 0.212 ± 0.077
0.106TrpTrp: 0.106 ± 0.053
0.345TrpTyr: 0.345 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.565TyrAla: 1.565 ± 0.21
0.743TyrCys: 0.743 ± 0.14
3.501TyrAsp: 3.501 ± 0.444
2.361TyrGlu: 2.361 ± 0.223
1.804TyrPhe: 1.804 ± 0.301
1.671TyrGly: 1.671 ± 0.231
0.875TyrHis: 0.875 ± 0.129
3.846TyrIle: 3.846 ± 0.324
3.607TyrLys: 3.607 ± 0.29
3.448TyrLeu: 3.448 ± 0.331
1.194TyrMet: 1.194 ± 0.203
4.907TyrAsn: 4.907 ± 0.417
1.644TyrPro: 1.644 ± 0.222
1.91TyrGln: 1.91 ± 0.293
2.095TyrArg: 2.095 ± 0.225
3.448TyrSer: 3.448 ± 0.344
3.342TyrThr: 3.342 ± 0.329
2.148TyrVal: 2.148 ± 0.269
0.239TyrTrp: 0.239 ± 0.078
2.308TyrTyr: 2.308 ± 0.246
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (37704 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski