Amino acid dipepetide frequency for Orf virus (ORFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.979AlaAla: 15.979 ± 0.916
2.592AlaCys: 2.592 ± 0.219
5.839AlaAsp: 5.839 ± 0.364
6.096AlaGlu: 6.096 ± 0.361
3.133AlaPhe: 3.133 ± 0.271
6.124AlaGly: 6.124 ± 0.484
3.361AlaHis: 3.361 ± 0.315
2.692AlaIle: 2.692 ± 0.219
3.005AlaLys: 3.005 ± 0.204
10.069AlaLeu: 10.069 ± 0.542
2.706AlaMet: 2.706 ± 0.212
2.378AlaAsn: 2.378 ± 0.182
6.779AlaPro: 6.779 ± 0.419
3.247AlaGln: 3.247 ± 0.23
13.444AlaArg: 13.444 ± 0.603
9.243AlaSer: 9.243 ± 0.568
4.785AlaThr: 4.785 ± 0.256
8.844AlaVal: 8.844 ± 0.468
0.527AlaTrp: 0.527 ± 0.092
1.88AlaTyr: 1.88 ± 0.162
0.028AlaXaa: 0.028 ± 0.021
Cys
2.905CysAla: 2.905 ± 0.231
0.755CysCys: 0.755 ± 0.117
1.068CysAsp: 1.068 ± 0.131
1.253CysGlu: 1.253 ± 0.13
0.826CysPhe: 0.826 ± 0.113
1.951CysGly: 1.951 ± 0.151
0.256CysHis: 0.256 ± 0.062
0.555CysIle: 0.555 ± 0.107
0.598CysLys: 0.598 ± 0.084
2.165CysLeu: 2.165 ± 0.19
0.641CysMet: 0.641 ± 0.1
0.442CysAsn: 0.442 ± 0.077
1.139CysPro: 1.139 ± 0.224
0.37CysGln: 0.37 ± 0.065
2.321CysArg: 2.321 ± 0.182
2.193CysSer: 2.193 ± 0.186
1.538CysThr: 1.538 ± 0.15
2.008CysVal: 2.008 ± 0.17
0.242CysTrp: 0.242 ± 0.059
0.47CysTyr: 0.47 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
6.238AspAla: 6.238 ± 0.347
0.812AspCys: 0.812 ± 0.106
2.279AspAsp: 2.279 ± 0.205
2.991AspGlu: 2.991 ± 0.222
2.108AspPhe: 2.108 ± 0.151
3.489AspGly: 3.489 ± 0.215
1.068AspHis: 1.068 ± 0.125
1.965AspIle: 1.965 ± 0.176
1.196AspLys: 1.196 ± 0.142
4.315AspLeu: 4.315 ± 0.25
1.567AspMet: 1.567 ± 0.145
1.495AspAsn: 1.495 ± 0.161
2.378AspPro: 2.378 ± 0.208
0.783AspGln: 0.783 ± 0.116
3.461AspArg: 3.461 ± 0.227
3.276AspSer: 3.276 ± 0.229
2.037AspThr: 2.037 ± 0.194
4.344AspVal: 4.344 ± 0.256
0.342AspTrp: 0.342 ± 0.074
1.054AspTyr: 1.054 ± 0.13
0.0AspXaa: 0.0 ± 0.0
Glu
4.714GluAla: 4.714 ± 0.295
0.883GluCys: 0.883 ± 0.137
2.934GluAsp: 2.934 ± 0.191
3.546GluGlu: 3.546 ± 0.291
2.079GluPhe: 2.079 ± 0.208
2.934GluGly: 2.934 ± 0.233
2.079GluHis: 2.079 ± 0.218
2.022GluIle: 2.022 ± 0.218
2.264GluLys: 2.264 ± 0.216
4.472GluLeu: 4.472 ± 0.325
1.225GluMet: 1.225 ± 0.147
1.495GluAsn: 1.495 ± 0.153
2.863GluPro: 2.863 ± 0.241
1.51GluGln: 1.51 ± 0.15
5.127GluArg: 5.127 ± 0.328
3.56GluSer: 3.56 ± 0.233
3.204GluThr: 3.204 ± 0.244
4.372GluVal: 4.372 ± 0.345
0.456GluTrp: 0.456 ± 0.067
1.495GluTyr: 1.495 ± 0.132
0.0GluXaa: 0.0 ± 0.0
Phe
3.689PheAla: 3.689 ± 0.215
1.04PheCys: 1.04 ± 0.14
1.723PheAsp: 1.723 ± 0.158
1.866PheGlu: 1.866 ± 0.166
1.994PhePhe: 1.994 ± 0.193
1.98PheGly: 1.98 ± 0.171
0.855PheHis: 0.855 ± 0.132
1.182PheIle: 1.182 ± 0.139
1.296PheLys: 1.296 ± 0.157
3.56PheLeu: 3.56 ± 0.268
1.196PheMet: 1.196 ± 0.151
1.325PheAsn: 1.325 ± 0.136
1.652PhePro: 1.652 ± 0.162
0.669PheGln: 0.669 ± 0.092
2.621PheArg: 2.621 ± 0.216
3.646PheSer: 3.646 ± 0.249
2.051PheThr: 2.051 ± 0.199
3.447PheVal: 3.447 ± 0.271
0.399PheTrp: 0.399 ± 0.089
0.84PheTyr: 0.84 ± 0.11
0.0PheXaa: 0.0 ± 0.0
Gly
7.662GlyAla: 7.662 ± 0.554
1.396GlyCys: 1.396 ± 0.15
3.176GlyAsp: 3.176 ± 0.234
3.034GlyGlu: 3.034 ± 0.194
1.937GlyPhe: 1.937 ± 0.15
5.084GlyGly: 5.084 ± 0.348
2.421GlyHis: 2.421 ± 0.344
1.823GlyIle: 1.823 ± 0.182
1.88GlyLys: 1.88 ± 0.171
4.671GlyLeu: 4.671 ± 0.29
1.225GlyMet: 1.225 ± 0.141
1.638GlyAsn: 1.638 ± 0.164
1.994GlyPro: 1.994 ± 0.19
1.353GlyGln: 1.353 ± 0.159
7.292GlyArg: 7.292 ± 0.483
4.258GlySer: 4.258 ± 0.282
2.791GlyThr: 2.791 ± 0.251
5.497GlyVal: 5.497 ± 0.297
0.641GlyTrp: 0.641 ± 0.096
1.31GlyTyr: 1.31 ± 0.136
0.043GlyXaa: 0.043 ± 0.025
His
3.703HisAla: 3.703 ± 0.383
0.57HisCys: 0.57 ± 0.1
1.424HisAsp: 1.424 ± 0.144
1.681HisGlu: 1.681 ± 0.279
0.755HisPhe: 0.755 ± 0.092
2.507HisGly: 2.507 ± 0.277
1.182HisHis: 1.182 ± 0.19
0.855HisIle: 0.855 ± 0.116
0.684HisLys: 0.684 ± 0.111
2.635HisLeu: 2.635 ± 0.24
0.655HisMet: 0.655 ± 0.08
0.498HisAsn: 0.498 ± 0.083
1.225HisPro: 1.225 ± 0.145
0.783HisGln: 0.783 ± 0.147
2.649HisArg: 2.649 ± 0.275
1.453HisSer: 1.453 ± 0.154
1.325HisThr: 1.325 ± 0.143
3.019HisVal: 3.019 ± 0.339
0.228HisTrp: 0.228 ± 0.061
0.484HisTyr: 0.484 ± 0.071
0.014HisXaa: 0.014 ± 0.014
Ile
2.834IleAla: 2.834 ± 0.19
0.968IleCys: 0.968 ± 0.126
1.424IleAsp: 1.424 ± 0.147
1.851IleGlu: 1.851 ± 0.16
1.809IlePhe: 1.809 ± 0.146
1.154IleGly: 1.154 ± 0.132
0.612IleHis: 0.612 ± 0.104
1.082IleIle: 1.082 ± 0.126
1.253IleLys: 1.253 ± 0.19
2.364IleLeu: 2.364 ± 0.242
0.869IleMet: 0.869 ± 0.138
1.396IleAsn: 1.396 ± 0.16
1.51IlePro: 1.51 ± 0.178
0.712IleGln: 0.712 ± 0.127
2.378IleArg: 2.378 ± 0.209
3.005IleSer: 3.005 ± 0.268
1.438IleThr: 1.438 ± 0.15
2.706IleVal: 2.706 ± 0.234
0.199IleTrp: 0.199 ± 0.048
0.997IleTyr: 0.997 ± 0.125
0.0IleXaa: 0.0 ± 0.0
Lys
1.823LysAla: 1.823 ± 0.18
0.527LysCys: 0.527 ± 0.091
1.481LysAsp: 1.481 ± 0.141
1.438LysGlu: 1.438 ± 0.167
1.325LysPhe: 1.325 ± 0.155
1.495LysGly: 1.495 ± 0.136
0.897LysHis: 0.897 ± 0.114
1.937LysIle: 1.937 ± 0.258
1.951LysLys: 1.951 ± 0.176
3.105LysLeu: 3.105 ± 0.244
1.011LysMet: 1.011 ± 0.114
1.282LysAsn: 1.282 ± 0.157
1.41LysPro: 1.41 ± 0.179
0.911LysGln: 0.911 ± 0.125
2.763LysArg: 2.763 ± 0.204
2.977LysSer: 2.977 ± 0.237
2.307LysThr: 2.307 ± 0.184
1.994LysVal: 1.994 ± 0.191
0.171LysTrp: 0.171 ± 0.061
1.239LysTyr: 1.239 ± 0.108
0.014LysXaa: 0.014 ± 0.013
Leu
9.528LeuAla: 9.528 ± 0.468
2.293LeuCys: 2.293 ± 0.197
4.443LeuAsp: 4.443 ± 0.335
4.899LeuGlu: 4.899 ± 0.286
3.603LeuPhe: 3.603 ± 0.25
4.785LeuGly: 4.785 ± 0.305
2.507LeuHis: 2.507 ± 0.295
2.663LeuIle: 2.663 ± 0.205
2.834LeuLys: 2.834 ± 0.227
9.072LeuLeu: 9.072 ± 0.383
2.621LeuMet: 2.621 ± 0.204
2.407LeuAsn: 2.407 ± 0.221
4.358LeuPro: 4.358 ± 0.302
2.065LeuGln: 2.065 ± 0.198
9.599LeuArg: 9.599 ± 0.523
6.694LeuSer: 6.694 ± 0.32
4.102LeuThr: 4.102 ± 0.311
6.779LeuVal: 6.779 ± 0.395
0.57LeuTrp: 0.57 ± 0.088
2.179LeuTyr: 2.179 ± 0.172
0.028LeuXaa: 0.028 ± 0.018
Met
2.621MetAla: 2.621 ± 0.211
0.598MetCys: 0.598 ± 0.093
1.41MetAsp: 1.41 ± 0.149
1.467MetGlu: 1.467 ± 0.171
1.182MetPhe: 1.182 ± 0.131
1.097MetGly: 1.097 ± 0.119
0.498MetHis: 0.498 ± 0.087
0.926MetIle: 0.926 ± 0.112
0.968MetLys: 0.968 ± 0.132
2.364MetLeu: 2.364 ± 0.206
0.555MetMet: 0.555 ± 0.095
0.641MetAsn: 0.641 ± 0.104
1.467MetPro: 1.467 ± 0.172
0.555MetGln: 0.555 ± 0.084
2.962MetArg: 2.962 ± 0.227
3.176MetSer: 3.176 ± 0.243
1.51MetThr: 1.51 ± 0.146
1.695MetVal: 1.695 ± 0.157
0.271MetTrp: 0.271 ± 0.066
0.911MetTyr: 0.911 ± 0.107
0.014MetXaa: 0.014 ± 0.014
Asn
2.72AsnAla: 2.72 ± 0.222
0.527AsnCys: 0.527 ± 0.085
1.025AsnAsp: 1.025 ± 0.133
1.139AsnGlu: 1.139 ± 0.132
1.524AsnPhe: 1.524 ± 0.173
1.552AsnGly: 1.552 ± 0.159
0.57AsnHis: 0.57 ± 0.089
1.353AsnIle: 1.353 ± 0.146
0.869AsnLys: 0.869 ± 0.107
2.151AsnLeu: 2.151 ± 0.199
1.011AsnMet: 1.011 ± 0.139
1.31AsnAsn: 1.31 ± 0.167
1.268AsnPro: 1.268 ± 0.145
0.555AsnGln: 0.555 ± 0.108
1.695AsnArg: 1.695 ± 0.158
2.364AsnSer: 2.364 ± 0.202
1.923AsnThr: 1.923 ± 0.183
2.521AsnVal: 2.521 ± 0.208
0.128AsnTrp: 0.128 ± 0.042
0.983AsnTyr: 0.983 ± 0.123
0.0AsnXaa: 0.0 ± 0.0
Pro
6.352ProAla: 6.352 ± 0.532
1.353ProCys: 1.353 ± 0.144
2.635ProAsp: 2.635 ± 0.169
3.133ProGlu: 3.133 ± 0.253
1.239ProPhe: 1.239 ± 0.137
3.617ProGly: 3.617 ± 0.273
1.424ProHis: 1.424 ± 0.158
1.054ProIle: 1.054 ± 0.122
1.268ProLys: 1.268 ± 0.161
4.159ProLeu: 4.159 ± 0.285
1.182ProMet: 1.182 ± 0.126
1.082ProAsn: 1.082 ± 0.124
4.045ProPro: 4.045 ± 0.335
1.624ProGln: 1.624 ± 0.167
5.953ProArg: 5.953 ± 0.405
4.743ProSer: 4.743 ± 0.329
2.492ProThr: 2.492 ± 0.215
3.603ProVal: 3.603 ± 0.281
0.47ProTrp: 0.47 ± 0.096
0.883ProTyr: 0.883 ± 0.121
0.028ProXaa: 0.028 ± 0.017
Gln
1.752GlnAla: 1.752 ± 0.161
0.484GlnCys: 0.484 ± 0.086
0.983GlnAsp: 0.983 ± 0.097
1.438GlnGlu: 1.438 ± 0.173
0.769GlnPhe: 0.769 ± 0.108
0.911GlnGly: 0.911 ± 0.12
1.139GlnHis: 1.139 ± 0.153
0.655GlnIle: 0.655 ± 0.097
0.897GlnLys: 0.897 ± 0.123
2.008GlnLeu: 2.008 ± 0.187
0.541GlnMet: 0.541 ± 0.077
0.826GlnAsn: 0.826 ± 0.103
1.851GlnPro: 1.851 ± 0.196
1.054GlnGln: 1.054 ± 0.155
3.546GlnArg: 3.546 ± 0.321
1.681GlnSer: 1.681 ± 0.161
1.538GlnThr: 1.538 ± 0.183
1.552GlnVal: 1.552 ± 0.164
0.185GlnTrp: 0.185 ± 0.059
0.584GlnTyr: 0.584 ± 0.085
0.0GlnXaa: 0.0 ± 0.0
Arg
13.373ArgAla: 13.373 ± 0.822
2.264ArgCys: 2.264 ± 0.248
4.059ArgAsp: 4.059 ± 0.242
5.398ArgGlu: 5.398 ± 0.331
3.048ArgPhe: 3.048 ± 0.197
7.762ArgGly: 7.762 ± 0.582
3.546ArgHis: 3.546 ± 0.235
2.122ArgIle: 2.122 ± 0.175
3.19ArgLys: 3.19 ± 0.219
8.374ArgLeu: 8.374 ± 0.345
2.592ArgMet: 2.592 ± 0.211
2.236ArgAsn: 2.236 ± 0.191
4.857ArgPro: 4.857 ± 0.294
2.478ArgGln: 2.478 ± 0.222
11.721ArgArg: 11.721 ± 0.642
8.189ArgSer: 8.189 ± 0.537
4.942ArgThr: 4.942 ± 0.332
9.215ArgVal: 9.215 ± 0.466
0.798ArgTrp: 0.798 ± 0.118
2.136ArgTyr: 2.136 ± 0.198
0.0ArgXaa: 0.0 ± 0.0
Ser
10.325SerAla: 10.325 ± 0.663
1.88SerCys: 1.88 ± 0.197
2.948SerAsp: 2.948 ± 0.222
3.845SerGlu: 3.845 ± 0.218
2.45SerPhe: 2.45 ± 0.2
5.497SerGly: 5.497 ± 0.387
1.154SerHis: 1.154 ± 0.119
2.592SerIle: 2.592 ± 0.246
2.834SerLys: 2.834 ± 0.236
6.181SerLeu: 6.181 ± 0.28
2.734SerMet: 2.734 ± 0.21
1.78SerAsn: 1.78 ± 0.174
4.401SerPro: 4.401 ± 0.31
1.681SerGln: 1.681 ± 0.162
8.317SerArg: 8.317 ± 0.498
10.397SerSer: 10.397 ± 0.892
6.922SerThr: 6.922 ± 0.534
7.064SerVal: 7.064 ± 0.326
0.812SerTrp: 0.812 ± 0.115
1.381SerTyr: 1.381 ± 0.122
0.014SerXaa: 0.014 ± 0.014
Thr
5.626ThrAla: 5.626 ± 0.285
1.111ThrCys: 1.111 ± 0.118
2.236ThrAsp: 2.236 ± 0.186
2.72ThrGlu: 2.72 ± 0.215
1.908ThrPhe: 1.908 ± 0.175
2.777ThrGly: 2.777 ± 0.23
1.111ThrHis: 1.111 ± 0.138
1.538ThrIle: 1.538 ± 0.151
1.851ThrLys: 1.851 ± 0.153
5.027ThrLeu: 5.027 ± 0.258
1.495ThrMet: 1.495 ± 0.154
1.624ThrAsn: 1.624 ± 0.162
3.674ThrPro: 3.674 ± 0.304
1.31ThrGln: 1.31 ± 0.126
5.483ThrArg: 5.483 ± 0.35
5.597ThrSer: 5.597 ± 0.421
3.703ThrThr: 3.703 ± 0.35
3.76ThrVal: 3.76 ± 0.241
0.498ThrTrp: 0.498 ± 0.093
1.353ThrTyr: 1.353 ± 0.132
0.014ThrXaa: 0.014 ± 0.013
Val
8.004ValAla: 8.004 ± 0.383
2.706ValCys: 2.706 ± 0.197
4.828ValAsp: 4.828 ± 0.238
4.244ValGlu: 4.244 ± 0.339
3.546ValPhe: 3.546 ± 0.243
4.401ValGly: 4.401 ± 0.289
3.062ValHis: 3.062 ± 0.324
2.122ValIle: 2.122 ± 0.194
2.122ValLys: 2.122 ± 0.193
8.431ValLeu: 8.431 ± 0.428
1.88ValMet: 1.88 ± 0.161
2.179ValAsn: 2.179 ± 0.209
4.03ValPro: 4.03 ± 0.262
2.122ValGln: 2.122 ± 0.26
8.417ValArg: 8.417 ± 0.439
6.209ValSer: 6.209 ± 0.384
3.831ValThr: 3.831 ± 0.248
7.107ValVal: 7.107 ± 0.5
0.527ValTrp: 0.527 ± 0.1
2.208ValTyr: 2.208 ± 0.2
0.0ValXaa: 0.0 ± 0.0
Trp
0.541TrpAla: 0.541 ± 0.083
0.271TrpCys: 0.271 ± 0.065
0.199TrpAsp: 0.199 ± 0.054
0.214TrpGlu: 0.214 ± 0.056
0.413TrpPhe: 0.413 ± 0.072
0.385TrpGly: 0.385 ± 0.088
0.157TrpHis: 0.157 ± 0.046
0.356TrpIle: 0.356 ± 0.078
0.399TrpLys: 0.399 ± 0.079
0.741TrpLeu: 0.741 ± 0.105
0.399TrpMet: 0.399 ± 0.076
0.157TrpAsn: 0.157 ± 0.047
0.427TrpPro: 0.427 ± 0.081
0.185TrpGln: 0.185 ± 0.048
0.897TrpArg: 0.897 ± 0.111
0.726TrpSer: 0.726 ± 0.099
0.598TrpThr: 0.598 ± 0.092
0.313TrpVal: 0.313 ± 0.063
0.185TrpTrp: 0.185 ± 0.071
0.256TrpTyr: 0.256 ± 0.067
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.336TyrAla: 2.336 ± 0.207
0.655TyrCys: 0.655 ± 0.087
1.082TyrAsp: 1.082 ± 0.125
0.983TyrGlu: 0.983 ± 0.118
1.381TyrPhe: 1.381 ± 0.149
1.595TyrGly: 1.595 ± 0.166
0.427TyrHis: 0.427 ± 0.091
1.04TyrIle: 1.04 ± 0.135
0.698TyrLys: 0.698 ± 0.122
2.165TyrLeu: 2.165 ± 0.168
0.769TyrMet: 0.769 ± 0.091
0.997TyrAsn: 0.997 ± 0.127
0.997TyrPro: 0.997 ± 0.109
0.427TyrGln: 0.427 ± 0.083
1.638TyrArg: 1.638 ± 0.155
1.809TyrSer: 1.809 ± 0.172
1.367TyrThr: 1.367 ± 0.153
2.122TyrVal: 2.122 ± 0.179
0.171TyrTrp: 0.171 ± 0.05
0.584TyrTyr: 0.584 ± 0.093
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.028XaaAla: 0.028 ± 0.019
0.014XaaCys: 0.014 ± 0.014
0.014XaaAsp: 0.014 ± 0.014
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.028XaaGly: 0.028 ± 0.021
0.0XaaHis: 0.0 ± 0.0
0.028XaaIle: 0.028 ± 0.017
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.014XaaPro: 0.014 ± 0.012
0.0XaaGln: 0.0 ± 0.0
0.014XaaArg: 0.014 ± 0.015
0.014XaaSer: 0.014 ± 0.014
0.014XaaThr: 0.014 ± 0.013
0.014XaaVal: 0.014 ± 0.013
0.0XaaTrp: 0.0 ± 0.0
0.014XaaTyr: 0.014 ± 0.014
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 259 proteins (70216 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski