Amino acid dipepetide frequency for Tomelloso virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.345AlaAla: 3.345 ± 0.342
1.147AlaCys: 1.147 ± 0.188
3.09AlaAsp: 3.09 ± 0.295
2.612AlaGlu: 2.612 ± 0.277
2.453AlaPhe: 2.453 ± 0.267
1.561AlaGly: 1.561 ± 0.241
0.924AlaHis: 0.924 ± 0.2
4.778AlaIle: 4.778 ± 0.308
4.428AlaLys: 4.428 ± 0.436
4.842AlaLeu: 4.842 ± 0.345
1.179AlaMet: 1.179 ± 0.188
3.472AlaAsn: 3.472 ± 0.429
2.453AlaPro: 2.453 ± 0.29
2.007AlaGln: 2.007 ± 0.284
1.943AlaArg: 1.943 ± 0.242
4.587AlaSer: 4.587 ± 0.51
3.09AlaThr: 3.09 ± 0.376
3.186AlaVal: 3.186 ± 0.319
0.446AlaTrp: 0.446 ± 0.116
1.72AlaTyr: 1.72 ± 0.205
0.0AlaXaa: 0.0 ± 0.0
Cys
1.242CysAla: 1.242 ± 0.225
0.446CysCys: 0.446 ± 0.156
1.561CysAsp: 1.561 ± 0.195
1.179CysGlu: 1.179 ± 0.198
0.765CysPhe: 0.765 ± 0.194
1.274CysGly: 1.274 ± 0.204
0.51CysHis: 0.51 ± 0.139
1.656CysIle: 1.656 ± 0.216
1.402CysLys: 1.402 ± 0.208
1.625CysLeu: 1.625 ± 0.28
0.382CysMet: 0.382 ± 0.093
1.402CysAsn: 1.402 ± 0.206
1.019CysPro: 1.019 ± 0.218
0.796CysGln: 0.796 ± 0.165
1.147CysArg: 1.147 ± 0.226
1.688CysSer: 1.688 ± 0.272
1.433CysThr: 1.433 ± 0.191
1.115CysVal: 1.115 ± 0.206
0.127CysTrp: 0.127 ± 0.06
0.796CysTyr: 0.796 ± 0.144
0.0CysXaa: 0.0 ± 0.0
Asp
2.867AspAla: 2.867 ± 0.251
1.465AspCys: 1.465 ± 0.194
6.562AspAsp: 6.562 ± 0.998
4.396AspGlu: 4.396 ± 0.487
3.472AspPhe: 3.472 ± 0.31
3.472AspGly: 3.472 ± 0.326
0.956AspHis: 0.956 ± 0.197
4.842AspIle: 4.842 ± 0.369
2.325AspLys: 2.325 ± 0.297
6.371AspLeu: 6.371 ± 0.523
1.019AspMet: 1.019 ± 0.167
3.122AspAsn: 3.122 ± 0.323
2.325AspPro: 2.325 ± 0.242
1.911AspGln: 1.911 ± 0.339
2.676AspArg: 2.676 ± 0.33
4.969AspSer: 4.969 ± 0.457
3.409AspThr: 3.409 ± 0.386
3.663AspVal: 3.663 ± 0.326
0.605AspTrp: 0.605 ± 0.143
2.994AspTyr: 2.994 ± 0.326
0.0AspXaa: 0.0 ± 0.0
Glu
3.186GluAla: 3.186 ± 0.472
1.051GluCys: 1.051 ± 0.218
2.421GluAsp: 2.421 ± 0.255
3.472GluGlu: 3.472 ± 0.363
2.74GluPhe: 2.74 ± 0.306
1.529GluGly: 1.529 ± 0.225
1.147GluHis: 1.147 ± 0.208
4.173GluIle: 4.173 ± 0.433
3.823GluLys: 3.823 ± 0.314
6.307GluLeu: 6.307 ± 0.332
1.943GluMet: 1.943 ± 0.281
4.3GluAsn: 4.3 ± 0.332
2.102GluPro: 2.102 ± 0.305
1.752GluGln: 1.752 ± 0.229
2.262GluArg: 2.262 ± 0.322
4.683GluSer: 4.683 ± 0.456
3.058GluThr: 3.058 ± 0.299
1.816GluVal: 1.816 ± 0.233
0.446GluTrp: 0.446 ± 0.107
3.409GluTyr: 3.409 ± 0.29
0.0GluXaa: 0.0 ± 0.0
Phe
2.899PheAla: 2.899 ± 0.351
0.86PheCys: 0.86 ± 0.179
3.631PheAsp: 3.631 ± 0.335
2.453PheGlu: 2.453 ± 0.3
1.274PhePhe: 1.274 ± 0.204
2.389PheGly: 2.389 ± 0.327
0.605PheHis: 0.605 ± 0.121
3.6PheIle: 3.6 ± 0.343
3.568PheLys: 3.568 ± 0.386
3.44PheLeu: 3.44 ± 0.356
1.497PheMet: 1.497 ± 0.208
2.58PheAsn: 2.58 ± 0.264
1.433PhePro: 1.433 ± 0.207
1.593PheGln: 1.593 ± 0.273
1.338PheArg: 1.338 ± 0.231
3.249PheSer: 3.249 ± 0.359
2.771PheThr: 2.771 ± 0.306
2.517PheVal: 2.517 ± 0.302
0.255PheTrp: 0.255 ± 0.078
2.102PheTyr: 2.102 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
2.071GlyAla: 2.071 ± 0.232
0.956GlyCys: 0.956 ± 0.146
2.071GlyAsp: 2.071 ± 0.286
2.389GlyGlu: 2.389 ± 0.264
1.752GlyPhe: 1.752 ± 0.21
1.625GlyGly: 1.625 ± 0.196
0.669GlyHis: 0.669 ± 0.142
3.727GlyIle: 3.727 ± 0.322
2.58GlyLys: 2.58 ± 0.281
3.154GlyLeu: 3.154 ± 0.402
0.796GlyMet: 0.796 ± 0.154
2.485GlyAsn: 2.485 ± 0.261
0.701GlyPro: 0.701 ± 0.16
1.497GlyGln: 1.497 ± 0.232
1.274GlyArg: 1.274 ± 0.191
2.421GlySer: 2.421 ± 0.333
1.879GlyThr: 1.879 ± 0.273
2.421GlyVal: 2.421 ± 0.256
0.191GlyTrp: 0.191 ± 0.098
2.262GlyTyr: 2.262 ± 0.327
0.0GlyXaa: 0.0 ± 0.0
His
0.828HisAla: 0.828 ± 0.138
0.542HisCys: 0.542 ± 0.116
1.115HisAsp: 1.115 ± 0.198
0.924HisGlu: 0.924 ± 0.178
1.115HisPhe: 1.115 ± 0.174
0.86HisGly: 0.86 ± 0.148
0.382HisHis: 0.382 ± 0.131
1.21HisIle: 1.21 ± 0.205
1.147HisLys: 1.147 ± 0.182
2.039HisLeu: 2.039 ± 0.25
0.319HisMet: 0.319 ± 0.095
1.37HisAsn: 1.37 ± 0.184
0.701HisPro: 0.701 ± 0.146
0.35HisGln: 0.35 ± 0.1
0.669HisArg: 0.669 ± 0.138
1.402HisSer: 1.402 ± 0.215
1.242HisThr: 1.242 ± 0.19
1.593HisVal: 1.593 ± 0.226
0.096HisTrp: 0.096 ± 0.052
1.083HisTyr: 1.083 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
4.683IleAla: 4.683 ± 0.362
1.784IleCys: 1.784 ± 0.293
6.084IleAsp: 6.084 ± 0.404
5.543IleGlu: 5.543 ± 0.509
3.058IlePhe: 3.058 ± 0.316
2.994IleGly: 2.994 ± 0.264
1.625IleHis: 1.625 ± 0.254
6.021IleIle: 6.021 ± 0.392
5.288IleLys: 5.288 ± 0.606
7.454IleLeu: 7.454 ± 0.524
1.879IleMet: 1.879 ± 0.234
4.778IleAsn: 4.778 ± 0.515
3.249IlePro: 3.249 ± 0.262
2.294IleGln: 2.294 ± 0.261
3.217IleArg: 3.217 ± 0.326
5.702IleSer: 5.702 ± 0.393
4.46IleThr: 4.46 ± 0.387
5.925IleVal: 5.925 ± 0.463
0.765IleTrp: 0.765 ± 0.154
3.6IleTyr: 3.6 ± 0.298
0.0IleXaa: 0.0 ± 0.0
Lys
2.644LysAla: 2.644 ± 0.302
1.593LysCys: 1.593 ± 0.19
2.007LysAsp: 2.007 ± 0.258
2.453LysGlu: 2.453 ± 0.226
2.867LysPhe: 2.867 ± 0.293
1.752LysGly: 1.752 ± 0.269
1.848LysHis: 1.848 ± 0.275
6.307LysIle: 6.307 ± 0.48
4.141LysLys: 4.141 ± 0.353
6.307LysLeu: 6.307 ± 0.484
2.325LysMet: 2.325 ± 0.282
5.224LysAsn: 5.224 ± 0.33
2.644LysPro: 2.644 ± 0.272
1.975LysGln: 1.975 ± 0.262
3.377LysArg: 3.377 ± 0.399
5.192LysSer: 5.192 ± 0.408
5.352LysThr: 5.352 ± 0.397
3.44LysVal: 3.44 ± 0.313
0.414LysTrp: 0.414 ± 0.127
4.332LysTyr: 4.332 ± 0.275
0.0LysXaa: 0.0 ± 0.0
Leu
4.619LeuAla: 4.619 ± 0.428
1.816LeuCys: 1.816 ± 0.224
6.371LeuAsp: 6.371 ± 0.491
4.396LeuGlu: 4.396 ± 0.384
3.95LeuPhe: 3.95 ± 0.359
2.931LeuGly: 2.931 ± 0.278
1.911LeuHis: 1.911 ± 0.209
6.339LeuIle: 6.339 ± 0.465
6.881LeuLys: 6.881 ± 0.411
8.251LeuLeu: 8.251 ± 0.433
2.835LeuMet: 2.835 ± 0.302
6.658LeuAsn: 6.658 ± 0.496
4.683LeuPro: 4.683 ± 0.404
3.631LeuGln: 3.631 ± 0.363
3.854LeuArg: 3.854 ± 0.358
6.435LeuSer: 6.435 ± 0.476
4.683LeuThr: 4.683 ± 0.35
4.651LeuVal: 4.651 ± 0.476
0.796LeuTrp: 0.796 ± 0.147
4.332LeuTyr: 4.332 ± 0.432
0.0LeuXaa: 0.0 ± 0.0
Met
1.943MetAla: 1.943 ± 0.278
0.542MetCys: 0.542 ± 0.131
1.688MetAsp: 1.688 ± 0.244
1.402MetGlu: 1.402 ± 0.212
1.274MetPhe: 1.274 ± 0.248
0.573MetGly: 0.573 ± 0.127
0.701MetHis: 0.701 ± 0.133
1.625MetIle: 1.625 ± 0.238
1.625MetLys: 1.625 ± 0.198
2.198MetLeu: 2.198 ± 0.226
0.605MetMet: 0.605 ± 0.137
1.051MetAsn: 1.051 ± 0.185
1.338MetPro: 1.338 ± 0.222
0.86MetGln: 0.86 ± 0.13
1.274MetArg: 1.274 ± 0.149
2.708MetSer: 2.708 ± 0.244
1.529MetThr: 1.529 ± 0.21
1.529MetVal: 1.529 ± 0.236
0.191MetTrp: 0.191 ± 0.083
1.656MetTyr: 1.656 ± 0.213
0.0MetXaa: 0.0 ± 0.0
Asn
4.109AsnAla: 4.109 ± 0.335
1.37AsnCys: 1.37 ± 0.208
5.001AsnAsp: 5.001 ± 0.393
4.109AsnGlu: 4.109 ± 0.326
2.963AsnPhe: 2.963 ± 0.339
2.644AsnGly: 2.644 ± 0.275
1.147AsnHis: 1.147 ± 0.238
6.084AsnIle: 6.084 ± 0.449
3.44AsnLys: 3.44 ± 0.321
5.224AsnLeu: 5.224 ± 0.385
1.848AsnMet: 1.848 ± 0.243
4.587AsnAsn: 4.587 ± 0.533
2.007AsnPro: 2.007 ± 0.222
2.58AsnGln: 2.58 ± 0.297
2.994AsnArg: 2.994 ± 0.289
5.129AsnSer: 5.129 ± 0.385
3.695AsnThr: 3.695 ± 0.364
4.651AsnVal: 4.651 ± 0.39
0.223AsnTrp: 0.223 ± 0.075
3.186AsnTyr: 3.186 ± 0.333
0.0AsnXaa: 0.0 ± 0.0
Pro
1.561ProAla: 1.561 ± 0.254
0.669ProCys: 0.669 ± 0.158
1.656ProAsp: 1.656 ± 0.272
2.294ProGlu: 2.294 ± 0.253
1.561ProPhe: 1.561 ± 0.209
1.179ProGly: 1.179 ± 0.174
0.892ProHis: 0.892 ± 0.191
3.472ProIle: 3.472 ± 0.324
3.568ProLys: 3.568 ± 0.309
3.377ProLeu: 3.377 ± 0.302
1.083ProMet: 1.083 ± 0.203
3.568ProAsn: 3.568 ± 0.398
2.294ProPro: 2.294 ± 0.46
1.975ProGln: 1.975 ± 0.231
1.306ProArg: 1.306 ± 0.184
3.536ProSer: 3.536 ± 0.35
4.205ProThr: 4.205 ± 0.386
2.294ProVal: 2.294 ± 0.254
0.191ProTrp: 0.191 ± 0.076
1.688ProTyr: 1.688 ± 0.198
0.0ProXaa: 0.0 ± 0.0
Gln
1.816GlnAla: 1.816 ± 0.254
0.86GlnCys: 0.86 ± 0.167
1.338GlnAsp: 1.338 ± 0.204
1.147GlnGlu: 1.147 ± 0.175
1.911GlnPhe: 1.911 ± 0.241
0.446GlnGly: 0.446 ± 0.139
1.019GlnHis: 1.019 ± 0.227
2.198GlnIle: 2.198 ± 0.249
2.357GlnLys: 2.357 ± 0.238
3.759GlnLeu: 3.759 ± 0.381
1.21GlnMet: 1.21 ± 0.186
2.708GlnAsn: 2.708 ± 0.285
1.848GlnPro: 1.848 ± 0.252
1.943GlnGln: 1.943 ± 0.386
1.402GlnArg: 1.402 ± 0.191
2.612GlnSer: 2.612 ± 0.329
2.23GlnThr: 2.23 ± 0.358
1.593GlnVal: 1.593 ± 0.227
0.478GlnTrp: 0.478 ± 0.132
2.166GlnTyr: 2.166 ± 0.222
0.0GlnXaa: 0.0 ± 0.0
Arg
2.134ArgAla: 2.134 ± 0.232
0.828ArgCys: 0.828 ± 0.166
1.975ArgAsp: 1.975 ± 0.244
1.752ArgGlu: 1.752 ± 0.239
1.816ArgPhe: 1.816 ± 0.204
1.019ArgGly: 1.019 ± 0.173
0.796ArgHis: 0.796 ± 0.136
3.154ArgIle: 3.154 ± 0.351
3.026ArgLys: 3.026 ± 0.251
4.428ArgLeu: 4.428 ± 0.422
0.86ArgMet: 0.86 ± 0.157
2.548ArgAsn: 2.548 ± 0.36
1.911ArgPro: 1.911 ± 0.175
1.975ArgGln: 1.975 ± 0.278
2.931ArgArg: 2.931 ± 0.451
3.95ArgSer: 3.95 ± 0.539
1.943ArgThr: 1.943 ± 0.274
2.548ArgVal: 2.548 ± 0.232
0.255ArgTrp: 0.255 ± 0.104
1.688ArgTyr: 1.688 ± 0.26
0.0ArgXaa: 0.0 ± 0.0
Ser
4.205SerAla: 4.205 ± 0.398
2.071SerCys: 2.071 ± 0.27
5.447SerAsp: 5.447 ± 0.46
4.842SerGlu: 4.842 ± 0.562
2.963SerPhe: 2.963 ± 0.305
2.867SerGly: 2.867 ± 0.286
1.274SerHis: 1.274 ± 0.183
6.817SerIle: 6.817 ± 0.481
5.161SerLys: 5.161 ± 0.458
6.658SerLeu: 6.658 ± 0.57
2.198SerMet: 2.198 ± 0.285
5.352SerAsn: 5.352 ± 0.476
2.835SerPro: 2.835 ± 0.364
2.134SerGln: 2.134 ± 0.279
3.377SerArg: 3.377 ± 0.436
8.792SerSer: 8.792 ± 2.46
5.989SerThr: 5.989 ± 0.441
4.109SerVal: 4.109 ± 0.321
0.446SerTrp: 0.446 ± 0.122
3.6SerTyr: 3.6 ± 0.334
0.0SerXaa: 0.0 ± 0.0
Thr
3.249ThrAla: 3.249 ± 0.403
1.338ThrCys: 1.338 ± 0.216
3.44ThrAsp: 3.44 ± 0.378
3.6ThrGlu: 3.6 ± 0.455
3.122ThrPhe: 3.122 ± 0.274
2.644ThrGly: 2.644 ± 0.303
0.892ThrHis: 0.892 ± 0.184
5.734ThrIle: 5.734 ± 0.435
3.727ThrLys: 3.727 ± 0.387
5.925ThrLeu: 5.925 ± 0.459
1.593ThrMet: 1.593 ± 0.193
3.791ThrAsn: 3.791 ± 0.347
3.472ThrPro: 3.472 ± 0.328
2.007ThrGln: 2.007 ± 0.255
2.23ThrArg: 2.23 ± 0.289
4.778ThrSer: 4.778 ± 0.431
5.702ThrThr: 5.702 ± 0.655
3.472ThrVal: 3.472 ± 0.275
0.382ThrTrp: 0.382 ± 0.099
2.74ThrTyr: 2.74 ± 0.283
0.0ThrXaa: 0.0 ± 0.0
Val
3.281ValAla: 3.281 ± 0.318
1.147ValCys: 1.147 ± 0.2
4.396ValAsp: 4.396 ± 0.417
3.6ValGlu: 3.6 ± 0.362
2.325ValPhe: 2.325 ± 0.34
2.612ValGly: 2.612 ± 0.32
0.988ValHis: 0.988 ± 0.173
4.46ValIle: 4.46 ± 0.32
3.886ValLys: 3.886 ± 0.397
4.428ValLeu: 4.428 ± 0.313
1.402ValMet: 1.402 ± 0.219
3.409ValAsn: 3.409 ± 0.359
2.74ValPro: 2.74 ± 0.305
1.816ValGln: 1.816 ± 0.221
1.975ValArg: 1.975 ± 0.252
4.523ValSer: 4.523 ± 0.372
3.217ValThr: 3.217 ± 0.386
3.6ValVal: 3.6 ± 0.36
0.414ValTrp: 0.414 ± 0.121
2.58ValTyr: 2.58 ± 0.348
0.0ValXaa: 0.0 ± 0.0
Trp
0.382TrpAla: 0.382 ± 0.105
0.191TrpCys: 0.191 ± 0.084
0.223TrpAsp: 0.223 ± 0.085
0.542TrpGlu: 0.542 ± 0.124
0.542TrpPhe: 0.542 ± 0.131
0.191TrpGly: 0.191 ± 0.083
0.191TrpHis: 0.191 ± 0.077
0.319TrpIle: 0.319 ± 0.101
0.319TrpLys: 0.319 ± 0.102
0.573TrpLeu: 0.573 ± 0.139
0.127TrpMet: 0.127 ± 0.066
0.542TrpAsn: 0.542 ± 0.118
0.478TrpPro: 0.478 ± 0.144
0.446TrpGln: 0.446 ± 0.125
0.414TrpArg: 0.414 ± 0.116
0.733TrpSer: 0.733 ± 0.176
0.319TrpThr: 0.319 ± 0.089
0.223TrpVal: 0.223 ± 0.081
0.064TrpTrp: 0.064 ± 0.045
0.319TrpTyr: 0.319 ± 0.097
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.198TyrAla: 2.198 ± 0.285
0.924TyrCys: 0.924 ± 0.153
3.345TyrAsp: 3.345 ± 0.31
2.517TyrGlu: 2.517 ± 0.277
2.134TyrPhe: 2.134 ± 0.263
2.357TyrGly: 2.357 ± 0.256
0.51TyrHis: 0.51 ± 0.124
3.886TyrIle: 3.886 ± 0.372
3.504TyrLys: 3.504 ± 0.392
3.663TyrLeu: 3.663 ± 0.387
1.179TyrMet: 1.179 ± 0.174
4.046TyrAsn: 4.046 ± 0.328
2.134TyrPro: 2.134 ± 0.247
1.497TyrGln: 1.497 ± 0.226
1.879TyrArg: 1.879 ± 0.22
4.046TyrSer: 4.046 ± 0.365
3.631TyrThr: 3.631 ± 0.351
2.453TyrVal: 2.453 ± 0.263
0.35TyrTrp: 0.35 ± 0.123
2.134TyrTyr: 2.134 ± 0.301
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (31393 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski