Amino acid dipepetide frequency for Sheeppox virus (strain Turkey/TU-V02127) (SPPV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.047AlaAla: 1.047 ± 0.184
0.371AlaCys: 0.371 ± 0.073
1.243AlaAsp: 1.243 ± 0.203
0.981AlaGlu: 0.981 ± 0.199
1.352AlaPhe: 1.352 ± 0.172
0.676AlaGly: 0.676 ± 0.169
0.393AlaHis: 0.393 ± 0.118
2.879AlaIle: 2.879 ± 0.251
1.81AlaLys: 1.81 ± 0.203
2.704AlaLeu: 2.704 ± 0.246
0.589AlaMet: 0.589 ± 0.112
1.418AlaAsn: 1.418 ± 0.18
0.48AlaPro: 0.48 ± 0.123
0.24AlaGln: 0.24 ± 0.076
0.851AlaArg: 0.851 ± 0.184
2.028AlaSer: 2.028 ± 0.243
1.374AlaThr: 1.374 ± 0.163
1.33AlaVal: 1.33 ± 0.163
0.153AlaTrp: 0.153 ± 0.067
1.221AlaTyr: 1.221 ± 0.161
0.0AlaXaa: 0.0 ± 0.0
Cys
0.371CysAla: 0.371 ± 0.09
0.589CysCys: 0.589 ± 0.151
1.396CysAsp: 1.396 ± 0.186
0.872CysGlu: 0.872 ± 0.129
0.981CysPhe: 0.981 ± 0.132
0.872CysGly: 0.872 ± 0.141
0.262CysHis: 0.262 ± 0.079
1.941CysIle: 1.941 ± 0.21
1.505CysLys: 1.505 ± 0.186
1.701CysLeu: 1.701 ± 0.198
0.349CysMet: 0.349 ± 0.091
1.876CysAsn: 1.876 ± 0.216
0.545CysPro: 0.545 ± 0.109
0.305CysGln: 0.305 ± 0.078
0.523CysArg: 0.523 ± 0.1
1.701CysSer: 1.701 ± 0.212
0.851CysThr: 0.851 ± 0.155
1.243CysVal: 1.243 ± 0.155
0.153CysTrp: 0.153 ± 0.057
1.439CysTyr: 1.439 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
1.461AspAla: 1.461 ± 0.165
0.807AspCys: 0.807 ± 0.109
5.278AspAsp: 5.278 ± 0.34
4.231AspGlu: 4.231 ± 0.29
3.882AspPhe: 3.882 ± 0.25
2.115AspGly: 2.115 ± 0.231
0.654AspHis: 0.654 ± 0.105
8.113AspIle: 8.113 ± 0.478
4.885AspLys: 4.885 ± 0.308
4.558AspLeu: 4.558 ± 0.355
1.265AspMet: 1.265 ± 0.166
5.234AspAsn: 5.234 ± 0.391
1.09AspPro: 1.09 ± 0.147
0.872AspGln: 0.872 ± 0.111
1.156AspArg: 1.156 ± 0.161
3.511AspSer: 3.511 ± 0.272
2.595AspThr: 2.595 ± 0.248
3.577AspVal: 3.577 ± 0.274
0.393AspTrp: 0.393 ± 0.1
3.162AspTyr: 3.162 ± 0.279
0.0AspXaa: 0.0 ± 0.0
Glu
1.243GluAla: 1.243 ± 0.16
1.047GluCys: 1.047 ± 0.136
2.421GluAsp: 2.421 ± 0.246
3.577GluGlu: 3.577 ± 0.336
2.639GluPhe: 2.639 ± 0.241
1.287GluGly: 1.287 ± 0.177
0.742GluHis: 0.742 ± 0.115
6.085GluIle: 6.085 ± 0.386
6.608GluLys: 6.608 ± 0.349
5.081GluLeu: 5.081 ± 0.351
1.265GluMet: 1.265 ± 0.172
5.278GluAsn: 5.278 ± 0.276
1.592GluPro: 1.592 ± 0.228
0.981GluGln: 0.981 ± 0.152
1.745GluArg: 1.745 ± 0.251
4.209GluSer: 4.209 ± 0.31
3.271GluThr: 3.271 ± 0.297
2.508GluVal: 2.508 ± 0.226
0.414GluTrp: 0.414 ± 0.092
3.38GluTyr: 3.38 ± 0.28
0.0GluXaa: 0.0 ± 0.0
Phe
1.156PheAla: 1.156 ± 0.174
1.156PheCys: 1.156 ± 0.17
3.424PheAsp: 3.424 ± 0.265
2.639PheGlu: 2.639 ± 0.194
3.555PhePhe: 3.555 ± 0.309
2.072PheGly: 2.072 ± 0.213
1.025PheHis: 1.025 ± 0.134
6.564PheIle: 6.564 ± 0.414
5.539PheLys: 5.539 ± 0.333
5.976PheLeu: 5.976 ± 0.54
1.156PheMet: 1.156 ± 0.166
5.212PheAsn: 5.212 ± 0.447
2.028PhePro: 2.028 ± 0.222
0.96PheGln: 0.96 ± 0.154
1.178PheArg: 1.178 ± 0.146
5.191PheSer: 5.191 ± 0.31
3.053PheThr: 3.053 ± 0.268
3.228PheVal: 3.228 ± 0.265
0.567PheTrp: 0.567 ± 0.125
2.901PheTyr: 2.901 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
0.894GlyAla: 0.894 ± 0.147
0.632GlyCys: 0.632 ± 0.122
1.963GlyAsp: 1.963 ± 0.201
1.592GlyGlu: 1.592 ± 0.2
2.137GlyPhe: 2.137 ± 0.238
1.897GlyGly: 1.897 ± 0.261
0.327GlyHis: 0.327 ± 0.09
4.122GlyIle: 4.122 ± 0.347
3.686GlyLys: 3.686 ± 0.279
2.443GlyLeu: 2.443 ± 0.22
0.545GlyMet: 0.545 ± 0.124
2.835GlyAsn: 2.835 ± 0.297
0.545GlyPro: 0.545 ± 0.103
0.545GlyGln: 0.545 ± 0.107
1.221GlyArg: 1.221 ± 0.171
2.421GlySer: 2.421 ± 0.244
1.657GlyThr: 1.657 ± 0.203
2.203GlyVal: 2.203 ± 0.225
0.262GlyTrp: 0.262 ± 0.079
1.963GlyTyr: 1.963 ± 0.195
0.0GlyXaa: 0.0 ± 0.0
His
0.218HisAla: 0.218 ± 0.059
0.567HisCys: 0.567 ± 0.143
0.676HisAsp: 0.676 ± 0.104
0.676HisGlu: 0.676 ± 0.115
0.785HisPhe: 0.785 ± 0.17
0.589HisGly: 0.589 ± 0.126
0.262HisHis: 0.262 ± 0.079
1.919HisIle: 1.919 ± 0.212
1.309HisLys: 1.309 ± 0.157
1.636HisLeu: 1.636 ± 0.19
0.632HisMet: 0.632 ± 0.133
1.199HisAsn: 1.199 ± 0.166
0.523HisPro: 0.523 ± 0.116
0.327HisGln: 0.327 ± 0.063
0.523HisArg: 0.523 ± 0.119
1.112HisSer: 1.112 ± 0.149
0.742HisThr: 0.742 ± 0.156
1.09HisVal: 1.09 ± 0.159
0.196HisTrp: 0.196 ± 0.067
0.676HisTyr: 0.676 ± 0.128
0.0HisXaa: 0.0 ± 0.0
Ile
2.421IleAla: 2.421 ± 0.267
1.963IleCys: 1.963 ± 0.218
6.586IleAsp: 6.586 ± 0.368
6.128IleGlu: 6.128 ± 0.439
6.325IlePhe: 6.325 ± 0.39
3.337IleGly: 3.337 ± 0.288
1.701IleHis: 1.701 ± 0.226
11.646IleIle: 11.646 ± 0.682
11.951IleLys: 11.951 ± 0.754
10.25IleLeu: 10.25 ± 0.618
2.181IleMet: 2.181 ± 0.236
10.425IleAsn: 10.425 ± 0.49
3.424IlePro: 3.424 ± 0.281
1.592IleGln: 1.592 ± 0.157
3.337IleArg: 3.337 ± 0.278
10.119IleSer: 10.119 ± 0.443
5.888IleThr: 5.888 ± 0.401
5.67IleVal: 5.67 ± 0.31
0.72IleTrp: 0.72 ± 0.141
5.518IleTyr: 5.518 ± 0.468
0.0IleXaa: 0.0 ± 0.0
Lys
1.897LysAla: 1.897 ± 0.23
1.614LysCys: 1.614 ± 0.213
5.016LysAsp: 5.016 ± 0.365
5.91LysGlu: 5.91 ± 0.401
4.863LysPhe: 4.863 ± 0.323
3.031LysGly: 3.031 ± 0.289
1.657LysHis: 1.657 ± 0.185
11.624LysIle: 11.624 ± 0.627
12.126LysLys: 12.126 ± 0.65
8.047LysLeu: 8.047 ± 0.437
2.312LysMet: 2.312 ± 0.22
10.01LysAsn: 10.01 ± 0.47
2.203LysPro: 2.203 ± 0.211
2.181LysGln: 2.181 ± 0.218
3.511LysArg: 3.511 ± 0.266
7.328LysSer: 7.328 ± 0.393
5.103LysThr: 5.103 ± 0.306
4.82LysVal: 4.82 ± 0.227
0.698LysTrp: 0.698 ± 0.151
6.303LysTyr: 6.303 ± 0.426
0.0LysXaa: 0.0 ± 0.0
Leu
2.115LeuAla: 2.115 ± 0.212
1.745LeuCys: 1.745 ± 0.194
4.623LeuAsp: 4.623 ± 0.322
5.125LeuGlu: 5.125 ± 0.398
5.823LeuPhe: 5.823 ± 0.447
3.075LeuGly: 3.075 ± 0.252
1.439LeuHis: 1.439 ± 0.199
8.44LeuIle: 8.44 ± 0.486
9.094LeuLys: 9.094 ± 0.476
8.614LeuLeu: 8.614 ± 0.582
2.246LeuMet: 2.246 ± 0.208
6.761LeuAsn: 6.761 ± 0.368
2.813LeuPro: 2.813 ± 0.258
1.81LeuGln: 1.81 ± 0.171
2.508LeuArg: 2.508 ± 0.23
9.072LeuSer: 9.072 ± 0.413
4.951LeuThr: 4.951 ± 0.373
4.885LeuVal: 4.885 ± 0.34
0.371LeuTrp: 0.371 ± 0.085
4.602LeuTyr: 4.602 ± 0.34
0.0LeuXaa: 0.0 ± 0.0
Met
0.872MetAla: 0.872 ± 0.145
0.371MetCys: 0.371 ± 0.087
1.679MetAsp: 1.679 ± 0.174
1.352MetGlu: 1.352 ± 0.193
1.461MetPhe: 1.461 ± 0.16
0.829MetGly: 0.829 ± 0.106
0.327MetHis: 0.327 ± 0.096
1.897MetIle: 1.897 ± 0.219
1.592MetLys: 1.592 ± 0.191
2.159MetLeu: 2.159 ± 0.224
0.502MetMet: 0.502 ± 0.116
1.548MetAsn: 1.548 ± 0.163
0.523MetPro: 0.523 ± 0.097
0.305MetGln: 0.305 ± 0.083
0.829MetArg: 0.829 ± 0.161
1.897MetSer: 1.897 ± 0.182
1.003MetThr: 1.003 ± 0.139
1.047MetVal: 1.047 ± 0.145
0.109MetTrp: 0.109 ± 0.05
1.309MetTyr: 1.309 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
1.657AsnAla: 1.657 ± 0.171
1.418AsnCys: 1.418 ± 0.173
6.019AsnAsp: 6.019 ± 0.397
5.452AsnGlu: 5.452 ± 0.345
4.449AsnPhe: 4.449 ± 0.302
3.511AsnGly: 3.511 ± 0.297
1.527AsnHis: 1.527 ± 0.164
11.232AsnIle: 11.232 ± 0.67
8.811AsnLys: 8.811 ± 0.51
6.128AsnLeu: 6.128 ± 0.384
1.897AsnMet: 1.897 ± 0.199
10.032AsnAsn: 10.032 ± 0.654
2.399AsnPro: 2.399 ± 0.234
1.657AsnGln: 1.657 ± 0.18
1.963AsnArg: 1.963 ± 0.197
5.561AsnSer: 5.561 ± 0.319
4.427AsnThr: 4.427 ± 0.368
5.343AsnVal: 5.343 ± 0.328
0.327AsnTrp: 0.327 ± 0.087
4.994AsnTyr: 4.994 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
0.567ProAla: 0.567 ± 0.097
0.458ProCys: 0.458 ± 0.109
1.265ProAsp: 1.265 ± 0.172
1.636ProGlu: 1.636 ± 0.167
1.876ProPhe: 1.876 ± 0.195
0.785ProGly: 0.785 ± 0.134
0.48ProHis: 0.48 ± 0.105
3.184ProIle: 3.184 ± 0.285
2.704ProLys: 2.704 ± 0.228
2.835ProLeu: 2.835 ± 0.259
0.545ProMet: 0.545 ± 0.113
2.181ProAsn: 2.181 ± 0.186
0.829ProPro: 0.829 ± 0.121
0.589ProGln: 0.589 ± 0.103
0.872ProArg: 0.872 ± 0.148
2.268ProSer: 2.268 ± 0.2
1.505ProThr: 1.505 ± 0.184
1.374ProVal: 1.374 ± 0.143
0.174ProTrp: 0.174 ± 0.067
1.112ProTyr: 1.112 ± 0.152
0.0ProXaa: 0.0 ± 0.0
Gln
0.305GlnAla: 0.305 ± 0.107
0.349GlnCys: 0.349 ± 0.089
0.894GlnAsp: 0.894 ± 0.156
0.96GlnGlu: 0.96 ± 0.154
0.589GlnPhe: 0.589 ± 0.108
0.414GlnGly: 0.414 ± 0.094
0.371GlnHis: 0.371 ± 0.082
1.505GlnIle: 1.505 ± 0.206
2.137GlnLys: 2.137 ± 0.243
1.767GlnLeu: 1.767 ± 0.217
0.48GlnMet: 0.48 ± 0.097
1.309GlnAsn: 1.309 ± 0.14
0.502GlnPro: 0.502 ± 0.114
0.654GlnGln: 0.654 ± 0.136
0.742GlnArg: 0.742 ± 0.142
1.701GlnSer: 1.701 ± 0.206
1.287GlnThr: 1.287 ± 0.141
1.003GlnVal: 1.003 ± 0.138
0.218GlnTrp: 0.218 ± 0.068
1.221GlnTyr: 1.221 ± 0.143
0.0GlnXaa: 0.0 ± 0.0
Arg
0.742ArgAla: 0.742 ± 0.112
0.632ArgCys: 0.632 ± 0.111
1.461ArgAsp: 1.461 ± 0.175
1.374ArgGlu: 1.374 ± 0.172
1.897ArgPhe: 1.897 ± 0.216
1.287ArgGly: 1.287 ± 0.175
0.589ArgHis: 0.589 ± 0.119
2.639ArgIle: 2.639 ± 0.288
2.922ArgLys: 2.922 ± 0.285
2.29ArgLeu: 2.29 ± 0.191
0.611ArgMet: 0.611 ± 0.116
2.05ArgAsn: 2.05 ± 0.264
0.632ArgPro: 0.632 ± 0.13
0.763ArgGln: 0.763 ± 0.135
1.069ArgArg: 1.069 ± 0.167
2.421ArgSer: 2.421 ± 0.251
1.461ArgThr: 1.461 ± 0.216
1.723ArgVal: 1.723 ± 0.205
0.24ArgTrp: 0.24 ± 0.057
1.788ArgTyr: 1.788 ± 0.212
0.0ArgXaa: 0.0 ± 0.0
Ser
1.876SerAla: 1.876 ± 0.202
1.985SerCys: 1.985 ± 0.201
5.3SerAsp: 5.3 ± 0.452
4.144SerGlu: 4.144 ± 0.391
5.147SerPhe: 5.147 ± 0.311
2.573SerGly: 2.573 ± 0.234
1.33SerHis: 1.33 ± 0.196
8.658SerIle: 8.658 ± 0.431
8.026SerLys: 8.026 ± 0.413
7.917SerLeu: 7.917 ± 0.36
1.636SerMet: 1.636 ± 0.178
7.001SerAsn: 7.001 ± 0.357
1.963SerPro: 1.963 ± 0.236
1.919SerGln: 1.919 ± 0.195
2.203SerArg: 2.203 ± 0.202
7.001SerSer: 7.001 ± 0.391
3.62SerThr: 3.62 ± 0.35
4.514SerVal: 4.514 ± 0.305
0.393SerTrp: 0.393 ± 0.08
4.275SerTyr: 4.275 ± 0.288
0.0SerXaa: 0.0 ± 0.0
Thr
1.418ThrAla: 1.418 ± 0.214
1.156ThrCys: 1.156 ± 0.185
2.508ThrAsp: 2.508 ± 0.261
2.857ThrGlu: 2.857 ± 0.223
3.25ThrPhe: 3.25 ± 0.283
1.243ThrGly: 1.243 ± 0.166
1.003ThrHis: 1.003 ± 0.131
5.736ThrIle: 5.736 ± 0.421
4.994ThrLys: 4.994 ± 0.292
5.081ThrLeu: 5.081 ± 0.284
1.09ThrMet: 1.09 ± 0.147
3.773ThrAsn: 3.773 ± 0.252
1.788ThrPro: 1.788 ± 0.202
0.676ThrGln: 0.676 ± 0.105
1.309ThrArg: 1.309 ± 0.19
4.602ThrSer: 4.602 ± 0.362
3.577ThrThr: 3.577 ± 0.3
2.748ThrVal: 2.748 ± 0.256
0.262ThrTrp: 0.262 ± 0.064
2.704ThrTyr: 2.704 ± 0.25
0.0ThrXaa: 0.0 ± 0.0
Val
1.548ValAla: 1.548 ± 0.208
1.047ValCys: 1.047 ± 0.179
3.293ValAsp: 3.293 ± 0.261
2.77ValGlu: 2.77 ± 0.243
3.642ValPhe: 3.642 ± 0.322
1.636ValGly: 1.636 ± 0.178
0.698ValHis: 0.698 ± 0.11
5.365ValIle: 5.365 ± 0.353
4.929ValLys: 4.929 ± 0.322
5.539ValLeu: 5.539 ± 0.289
1.134ValMet: 1.134 ± 0.148
5.365ValAsn: 5.365 ± 0.382
1.483ValPro: 1.483 ± 0.173
0.981ValGln: 0.981 ± 0.165
1.548ValArg: 1.548 ± 0.164
4.558ValSer: 4.558 ± 0.378
2.966ValThr: 2.966 ± 0.291
2.792ValVal: 2.792 ± 0.25
0.305ValTrp: 0.305 ± 0.086
3.031ValTyr: 3.031 ± 0.233
0.0ValXaa: 0.0 ± 0.0
Trp
0.131TrpAla: 0.131 ± 0.054
0.131TrpCys: 0.131 ± 0.056
0.305TrpAsp: 0.305 ± 0.089
0.414TrpGlu: 0.414 ± 0.093
0.458TrpPhe: 0.458 ± 0.104
0.262TrpGly: 0.262 ± 0.078
0.0TrpHis: 0.0 ± 0.0
0.698TrpIle: 0.698 ± 0.123
0.654TrpLys: 0.654 ± 0.112
0.567TrpLeu: 0.567 ± 0.105
0.218TrpMet: 0.218 ± 0.068
0.502TrpAsn: 0.502 ± 0.096
0.262TrpPro: 0.262 ± 0.071
0.065TrpGln: 0.065 ± 0.041
0.24TrpArg: 0.24 ± 0.073
0.523TrpSer: 0.523 ± 0.128
0.196TrpThr: 0.196 ± 0.064
0.262TrpVal: 0.262 ± 0.081
0.044TrpTrp: 0.044 ± 0.029
0.349TrpTyr: 0.349 ± 0.079
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.199TyrAla: 1.199 ± 0.166
1.439TyrCys: 1.439 ± 0.2
3.489TyrAsp: 3.489 ± 0.237
2.682TyrGlu: 2.682 ± 0.212
3.468TyrPhe: 3.468 ± 0.264
2.399TyrGly: 2.399 ± 0.258
0.872TyrHis: 0.872 ± 0.144
6.935TyrIle: 6.935 ± 0.377
4.972TyrLys: 4.972 ± 0.33
4.82TyrLeu: 4.82 ± 0.37
0.938TyrMet: 0.938 ± 0.136
4.798TyrAsn: 4.798 ± 0.338
1.614TyrPro: 1.614 ± 0.167
0.981TyrGln: 0.981 ± 0.158
1.287TyrArg: 1.287 ± 0.176
4.253TyrSer: 4.253 ± 0.252
2.246TyrThr: 2.246 ± 0.204
3.271TyrVal: 3.271 ± 0.243
0.284TyrTrp: 0.284 ± 0.079
2.726TyrTyr: 2.726 ± 0.22
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 144 proteins (45854 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski