Amino acid dipepetide frequency for Squirrelpox virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.186AlaAla: 13.186 ± 1.333
1.78AlaCys: 1.78 ± 0.237
5.489AlaAsp: 5.489 ± 0.406
7.226AlaGlu: 7.226 ± 1.459
2.787AlaPhe: 2.787 ± 0.256
5.467AlaGly: 5.467 ± 0.473
1.479AlaHis: 1.479 ± 0.164
3.345AlaIle: 3.345 ± 0.275
3.431AlaLys: 3.431 ± 0.813
8.684AlaLeu: 8.684 ± 0.635
2.23AlaMet: 2.23 ± 0.243
2.723AlaAsn: 2.723 ± 0.238
4.61AlaPro: 4.61 ± 0.558
2.337AlaGln: 2.337 ± 0.366
8.555AlaArg: 8.555 ± 0.565
6.089AlaSer: 6.089 ± 0.375
4.781AlaThr: 4.781 ± 0.425
7.633AlaVal: 7.633 ± 0.554
0.536AlaTrp: 0.536 ± 0.147
2.358AlaTyr: 2.358 ± 0.231
0.0AlaXaa: 0.0 ± 0.0
Cys
1.801CysAla: 1.801 ± 0.212
0.6CysCys: 0.6 ± 0.127
1.115CysAsp: 1.115 ± 0.166
1.651CysGlu: 1.651 ± 0.205
0.622CysPhe: 0.622 ± 0.117
1.437CysGly: 1.437 ± 0.173
0.472CysHis: 0.472 ± 0.11
0.622CysIle: 0.622 ± 0.116
0.472CysLys: 0.472 ± 0.099
2.08CysLeu: 2.08 ± 0.23
0.429CysMet: 0.429 ± 0.101
0.665CysAsn: 0.665 ± 0.123
0.557CysPro: 0.557 ± 0.121
0.6CysGln: 0.6 ± 0.121
1.651CysArg: 1.651 ± 0.204
1.244CysSer: 1.244 ± 0.166
0.943CysThr: 0.943 ± 0.166
2.401CysVal: 2.401 ± 0.268
0.214CysTrp: 0.214 ± 0.06
0.772CysTyr: 0.772 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
5.939AspAla: 5.939 ± 0.428
0.879AspCys: 0.879 ± 0.124
4.503AspAsp: 4.503 ± 0.353
5.274AspGlu: 5.274 ± 0.386
2.53AspPhe: 2.53 ± 0.303
5.124AspGly: 5.124 ± 0.468
1.008AspHis: 1.008 ± 0.166
2.937AspIle: 2.937 ± 0.298
2.294AspLys: 2.294 ± 0.25
5.532AspLeu: 5.532 ± 0.505
1.78AspMet: 1.78 ± 0.173
1.587AspAsn: 1.587 ± 0.187
3.195AspPro: 3.195 ± 0.263
1.222AspGln: 1.222 ± 0.129
3.859AspArg: 3.859 ± 0.373
3.195AspSer: 3.195 ± 0.275
3.023AspThr: 3.023 ± 0.276
5.403AspVal: 5.403 ± 0.346
0.45AspTrp: 0.45 ± 0.098
1.672AspTyr: 1.672 ± 0.213
0.0AspXaa: 0.0 ± 0.0
Glu
6.668GluAla: 6.668 ± 1.042
1.201GluCys: 1.201 ± 0.208
5.081GluAsp: 5.081 ± 0.511
6.647GluGlu: 6.647 ± 1.308
2.809GluPhe: 2.809 ± 0.249
3.302GluGly: 3.302 ± 0.298
1.437GluHis: 1.437 ± 0.192
2.916GluIle: 2.916 ± 0.246
4.245GluLys: 4.245 ± 1.382
5.896GluLeu: 5.896 ± 0.802
1.501GluMet: 1.501 ± 0.174
1.822GluAsn: 1.822 ± 0.203
3.259GluPro: 3.259 ± 0.67
1.801GluGln: 1.801 ± 0.442
5.617GluArg: 5.617 ± 0.566
4.524GluSer: 4.524 ± 0.382
4.138GluThr: 4.138 ± 0.434
3.945GluVal: 3.945 ± 0.287
0.536GluTrp: 0.536 ± 0.118
2.101GluTyr: 2.101 ± 0.186
0.0GluXaa: 0.0 ± 0.0
Phe
2.937PheAla: 2.937 ± 0.286
0.922PheCys: 0.922 ± 0.144
2.551PheAsp: 2.551 ± 0.276
2.358PheGlu: 2.358 ± 0.266
2.101PhePhe: 2.101 ± 0.217
2.251PheGly: 2.251 ± 0.271
0.858PheHis: 0.858 ± 0.159
1.522PheIle: 1.522 ± 0.216
1.158PheLys: 1.158 ± 0.165
3.752PheLeu: 3.752 ± 0.345
1.329PheMet: 1.329 ± 0.163
1.501PheAsn: 1.501 ± 0.208
1.458PhePro: 1.458 ± 0.189
0.836PheGln: 0.836 ± 0.132
2.852PheArg: 2.852 ± 0.295
3.366PheSer: 3.366 ± 0.323
1.973PheThr: 1.973 ± 0.23
4.503PheVal: 4.503 ± 0.398
0.407PheTrp: 0.407 ± 0.1
1.394PheTyr: 1.394 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
5.103GlyAla: 5.103 ± 0.362
1.051GlyCys: 1.051 ± 0.145
4.374GlyAsp: 4.374 ± 0.429
4.009GlyGlu: 4.009 ± 0.318
2.23GlyPhe: 2.23 ± 0.273
4.974GlyGly: 4.974 ± 0.885
1.072GlyHis: 1.072 ± 0.175
2.251GlyIle: 2.251 ± 0.276
2.466GlyLys: 2.466 ± 0.32
4.696GlyLeu: 4.696 ± 0.37
1.158GlyMet: 1.158 ± 0.16
1.93GlyAsn: 1.93 ± 0.246
2.594GlyPro: 2.594 ± 0.431
1.63GlyGln: 1.63 ± 0.403
4.953GlyArg: 4.953 ± 0.461
4.717GlySer: 4.717 ± 0.678
3.13GlyThr: 3.13 ± 0.375
4.331GlyVal: 4.331 ± 0.321
0.343GlyTrp: 0.343 ± 0.084
1.994GlyTyr: 1.994 ± 0.206
0.0GlyXaa: 0.0 ± 0.0
His
1.801HisAla: 1.801 ± 0.193
0.579HisCys: 0.579 ± 0.142
1.051HisAsp: 1.051 ± 0.154
1.029HisGlu: 1.029 ± 0.144
0.943HisPhe: 0.943 ± 0.154
1.286HisGly: 1.286 ± 0.163
0.708HisHis: 0.708 ± 0.124
1.158HisIle: 1.158 ± 0.176
0.729HisLys: 0.729 ± 0.146
2.23HisLeu: 2.23 ± 0.196
0.708HisMet: 0.708 ± 0.139
0.729HisAsn: 0.729 ± 0.123
1.008HisPro: 1.008 ± 0.166
0.515HisGln: 0.515 ± 0.125
1.437HisArg: 1.437 ± 0.193
1.286HisSer: 1.286 ± 0.174
1.115HisThr: 1.115 ± 0.177
2.123HisVal: 2.123 ± 0.241
0.257HisTrp: 0.257 ± 0.071
0.557HisTyr: 0.557 ± 0.109
0.0HisXaa: 0.0 ± 0.0
Ile
3.173IleAla: 3.173 ± 0.267
0.836IleCys: 0.836 ± 0.167
2.337IleAsp: 2.337 ± 0.226
2.208IleGlu: 2.208 ± 0.25
1.93IlePhe: 1.93 ± 0.219
1.78IleGly: 1.78 ± 0.219
1.072IleHis: 1.072 ± 0.139
2.23IleIle: 2.23 ± 0.331
1.694IleLys: 1.694 ± 0.211
3.988IleLeu: 3.988 ± 0.392
1.201IleMet: 1.201 ± 0.179
2.166IleAsn: 2.166 ± 0.296
1.458IlePro: 1.458 ± 0.214
1.115IleGln: 1.115 ± 0.143
2.723IleArg: 2.723 ± 0.252
3.409IleSer: 3.409 ± 0.357
2.123IleThr: 2.123 ± 0.254
4.16IleVal: 4.16 ± 0.342
0.407IleTrp: 0.407 ± 0.095
1.479IleTyr: 1.479 ± 0.182
0.0IleXaa: 0.0 ± 0.0
Lys
3.109LysAla: 3.109 ± 1.035
0.729LysCys: 0.729 ± 0.134
2.101LysAsp: 2.101 ± 0.212
2.294LysGlu: 2.294 ± 0.65
1.329LysPhe: 1.329 ± 0.264
1.694LysGly: 1.694 ± 0.325
1.008LysHis: 1.008 ± 0.156
2.187LysIle: 2.187 ± 0.253
2.83LysLys: 2.83 ± 0.499
3.774LysLeu: 3.774 ± 0.286
1.415LysMet: 1.415 ± 0.165
1.822LysAsn: 1.822 ± 0.191
2.316LysPro: 2.316 ± 0.529
0.879LysGln: 0.879 ± 0.238
3.045LysArg: 3.045 ± 0.315
2.916LysSer: 2.916 ± 0.347
2.251LysThr: 2.251 ± 0.552
2.594LysVal: 2.594 ± 0.335
0.3LysTrp: 0.3 ± 0.092
1.544LysTyr: 1.544 ± 0.201
0.0LysXaa: 0.0 ± 0.0
Leu
8.512LeuAla: 8.512 ± 0.523
2.251LeuCys: 2.251 ± 0.195
5.682LeuAsp: 5.682 ± 0.357
7.44LeuGlu: 7.44 ± 1.23
3.859LeuPhe: 3.859 ± 0.361
4.781LeuGly: 4.781 ± 0.336
1.737LeuHis: 1.737 ± 0.199
3.688LeuIle: 3.688 ± 0.317
3.473LeuLys: 3.473 ± 0.333
8.705LeuLeu: 8.705 ± 0.651
2.273LeuMet: 2.273 ± 0.259
2.852LeuAsn: 2.852 ± 0.308
4.052LeuPro: 4.052 ± 0.384
1.822LeuGln: 1.822 ± 0.253
8.383LeuArg: 8.383 ± 0.493
6.754LeuSer: 6.754 ± 0.369
5.403LeuThr: 5.403 ± 0.325
7.397LeuVal: 7.397 ± 0.518
0.45LeuTrp: 0.45 ± 0.104
2.787LeuTyr: 2.787 ± 0.292
0.0LeuXaa: 0.0 ± 0.0
Met
2.637MetAla: 2.637 ± 0.272
0.407MetCys: 0.407 ± 0.084
1.887MetAsp: 1.887 ± 0.18
1.801MetGlu: 1.801 ± 0.178
1.029MetPhe: 1.029 ± 0.14
1.351MetGly: 1.351 ± 0.159
0.472MetHis: 0.472 ± 0.102
1.244MetIle: 1.244 ± 0.211
0.622MetLys: 0.622 ± 0.118
2.466MetLeu: 2.466 ± 0.248
0.643MetMet: 0.643 ± 0.11
1.093MetAsn: 1.093 ± 0.18
1.008MetPro: 1.008 ± 0.138
0.75MetGln: 0.75 ± 0.104
2.466MetArg: 2.466 ± 0.278
1.865MetSer: 1.865 ± 0.172
1.179MetThr: 1.179 ± 0.162
1.758MetVal: 1.758 ± 0.175
0.193MetTrp: 0.193 ± 0.066
0.686MetTyr: 0.686 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
2.787AsnAla: 2.787 ± 0.228
0.686AsnCys: 0.686 ± 0.133
1.844AsnAsp: 1.844 ± 0.236
1.865AsnGlu: 1.865 ± 0.206
1.286AsnPhe: 1.286 ± 0.177
2.058AsnGly: 2.058 ± 0.276
0.75AsnHis: 0.75 ± 0.126
2.166AsnIle: 2.166 ± 0.329
1.651AsnLys: 1.651 ± 0.213
2.937AsnLeu: 2.937 ± 0.277
0.922AsnMet: 0.922 ± 0.114
1.758AsnAsn: 1.758 ± 0.206
1.737AsnPro: 1.737 ± 0.196
1.158AsnGln: 1.158 ± 0.263
2.294AsnArg: 2.294 ± 0.225
2.015AsnSer: 2.015 ± 0.29
1.951AsnThr: 1.951 ± 0.196
3.152AsnVal: 3.152 ± 0.321
0.172AsnTrp: 0.172 ± 0.053
1.136AsnTyr: 1.136 ± 0.157
0.0AsnXaa: 0.0 ± 0.0
Pro
5.66ProAla: 5.66 ± 0.508
0.858ProCys: 0.858 ± 0.133
2.744ProAsp: 2.744 ± 0.252
4.245ProGlu: 4.245 ± 0.721
1.265ProPhe: 1.265 ± 0.148
3.452ProGly: 3.452 ± 0.437
0.879ProHis: 0.879 ± 0.145
1.115ProIle: 1.115 ± 0.15
1.951ProLys: 1.951 ± 0.586
3.259ProLeu: 3.259 ± 0.282
0.965ProMet: 0.965 ± 0.155
1.351ProAsn: 1.351 ± 0.166
3.752ProPro: 3.752 ± 0.505
1.587ProGln: 1.587 ± 0.297
4.352ProArg: 4.352 ± 0.456
2.873ProSer: 2.873 ± 0.323
2.23ProThr: 2.23 ± 0.268
3.559ProVal: 3.559 ± 0.33
0.429ProTrp: 0.429 ± 0.085
1.415ProTyr: 1.415 ± 0.168
0.0ProXaa: 0.0 ± 0.0
Gln
2.251GlnAla: 2.251 ± 0.513
0.343GlnCys: 0.343 ± 0.102
1.587GlnAsp: 1.587 ± 0.27
1.415GlnGlu: 1.415 ± 0.242
0.579GlnPhe: 0.579 ± 0.11
1.265GlnGly: 1.265 ± 0.245
0.708GlnHis: 0.708 ± 0.143
0.879GlnIle: 0.879 ± 0.134
1.201GlnLys: 1.201 ± 0.305
2.294GlnLeu: 2.294 ± 0.204
0.6GlnMet: 0.6 ± 0.115
0.686GlnAsn: 0.686 ± 0.113
1.737GlnPro: 1.737 ± 0.304
1.265GlnGln: 1.265 ± 0.284
2.358GlnArg: 2.358 ± 0.251
1.844GlnSer: 1.844 ± 0.242
1.394GlnThr: 1.394 ± 0.249
1.672GlnVal: 1.672 ± 0.289
0.214GlnTrp: 0.214 ± 0.057
0.836GlnTyr: 0.836 ± 0.128
0.0GlnXaa: 0.0 ± 0.0
Arg
8.255ArgAla: 8.255 ± 0.758
1.844ArgCys: 1.844 ± 0.229
5.017ArgAsp: 5.017 ± 0.388
5.017ArgGlu: 5.017 ± 0.457
3.838ArgPhe: 3.838 ± 0.334
5.274ArgGly: 5.274 ± 0.448
2.08ArgHis: 2.08 ± 0.232
3.388ArgIle: 3.388 ± 0.287
2.916ArgLys: 2.916 ± 0.276
8.083ArgLeu: 8.083 ± 0.43
1.78ArgMet: 1.78 ± 0.177
2.787ArgAsn: 2.787 ± 0.268
3.752ArgPro: 3.752 ± 0.349
2.058ArgGln: 2.058 ± 0.201
7.89ArgArg: 7.89 ± 0.636
4.395ArgSer: 4.395 ± 0.381
3.838ArgThr: 3.838 ± 0.3
7.033ArgVal: 7.033 ± 0.408
0.579ArgTrp: 0.579 ± 0.12
2.616ArgTyr: 2.616 ± 0.269
0.0ArgXaa: 0.0 ± 0.0
Ser
6.368SerAla: 6.368 ± 0.346
1.415SerCys: 1.415 ± 0.191
4.074SerAsp: 4.074 ± 0.35
4.545SerGlu: 4.545 ± 0.434
2.873SerPhe: 2.873 ± 0.263
4.524SerGly: 4.524 ± 0.79
1.437SerHis: 1.437 ± 0.186
2.702SerIle: 2.702 ± 0.269
2.702SerLys: 2.702 ± 0.299
5.896SerLeu: 5.896 ± 0.392
1.801SerMet: 1.801 ± 0.239
2.144SerAsn: 2.144 ± 0.237
3.431SerPro: 3.431 ± 0.385
1.458SerGln: 1.458 ± 0.19
4.889SerArg: 4.889 ± 0.336
5.639SerSer: 5.639 ± 0.74
4.074SerThr: 4.074 ± 0.329
5.961SerVal: 5.961 ± 0.39
0.429SerTrp: 0.429 ± 0.088
2.101SerTyr: 2.101 ± 0.303
0.0SerXaa: 0.0 ± 0.0
Thr
4.052ThrAla: 4.052 ± 0.346
1.008ThrCys: 1.008 ± 0.117
3.28ThrAsp: 3.28 ± 0.238
3.602ThrGlu: 3.602 ± 0.442
1.78ThrPhe: 1.78 ± 0.179
3.045ThrGly: 3.045 ± 0.288
1.329ThrHis: 1.329 ± 0.178
2.08ThrIle: 2.08 ± 0.277
1.951ThrLys: 1.951 ± 0.254
5.274ThrLeu: 5.274 ± 0.277
1.565ThrMet: 1.565 ± 0.171
2.08ThrAsn: 2.08 ± 0.198
2.616ThrPro: 2.616 ± 0.313
1.544ThrGln: 1.544 ± 0.259
4.738ThrArg: 4.738 ± 0.373
3.409ThrSer: 3.409 ± 0.335
3.323ThrThr: 3.323 ± 0.318
5.467ThrVal: 5.467 ± 0.448
0.515ThrTrp: 0.515 ± 0.13
1.458ThrTyr: 1.458 ± 0.196
0.0ThrXaa: 0.0 ± 0.0
Val
7.59ValAla: 7.59 ± 0.551
2.037ValCys: 2.037 ± 0.268
4.781ValAsp: 4.781 ± 0.349
4.974ValGlu: 4.974 ± 0.369
4.009ValPhe: 4.009 ± 0.345
3.838ValGly: 3.838 ± 0.298
1.715ValHis: 1.715 ± 0.167
3.238ValIle: 3.238 ± 0.345
2.98ValLys: 2.98 ± 0.29
8.684ValLeu: 8.684 ± 0.516
1.994ValMet: 1.994 ± 0.188
3.066ValAsn: 3.066 ± 0.321
3.967ValPro: 3.967 ± 0.355
1.887ValGln: 1.887 ± 0.298
7.59ValArg: 7.59 ± 0.463
6.411ValSer: 6.411 ± 0.515
4.846ValThr: 4.846 ± 0.376
6.818ValVal: 6.818 ± 0.501
0.493ValTrp: 0.493 ± 0.104
2.723ValTyr: 2.723 ± 0.292
0.0ValXaa: 0.0 ± 0.0
Trp
0.364TrpAla: 0.364 ± 0.095
0.129TrpCys: 0.129 ± 0.058
0.3TrpAsp: 0.3 ± 0.083
0.429TrpGlu: 0.429 ± 0.091
0.429TrpPhe: 0.429 ± 0.098
0.236TrpGly: 0.236 ± 0.078
0.172TrpHis: 0.172 ± 0.057
0.3TrpIle: 0.3 ± 0.072
0.343TrpLys: 0.343 ± 0.093
0.943TrpLeu: 0.943 ± 0.147
0.3TrpMet: 0.3 ± 0.072
0.3TrpAsn: 0.3 ± 0.085
0.343TrpPro: 0.343 ± 0.107
0.15TrpGln: 0.15 ± 0.059
0.557TrpArg: 0.557 ± 0.111
0.472TrpSer: 0.472 ± 0.103
0.622TrpThr: 0.622 ± 0.124
0.493TrpVal: 0.493 ± 0.113
0.021TrpTrp: 0.021 ± 0.024
0.279TrpTyr: 0.279 ± 0.076
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.53TyrAla: 2.53 ± 0.277
0.793TyrCys: 0.793 ± 0.147
1.608TyrAsp: 1.608 ± 0.187
1.415TyrGlu: 1.415 ± 0.212
1.801TyrPhe: 1.801 ± 0.205
1.887TyrGly: 1.887 ± 0.2
0.858TyrHis: 0.858 ± 0.13
1.522TyrIle: 1.522 ± 0.189
1.029TyrLys: 1.029 ± 0.176
3.13TyrLeu: 3.13 ± 0.329
0.943TyrMet: 0.943 ± 0.143
1.265TyrAsn: 1.265 ± 0.205
1.093TyrPro: 1.093 ± 0.141
0.515TyrGln: 0.515 ± 0.106
2.23TyrArg: 2.23 ± 0.211
2.037TyrSer: 2.037 ± 0.209
1.78TyrThr: 1.78 ± 0.189
3.238TyrVal: 3.238 ± 0.249
0.236TyrTrp: 0.236 ± 0.079
1.158TyrTyr: 1.158 ± 0.153
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 141 proteins (46641 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski