Amino acid dipepetide frequency for Shearwaterpox virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.378AlaAla: 1.378 ± 0.146
0.956AlaCys: 0.956 ± 0.104
2.632AlaAsp: 2.632 ± 0.228
1.521AlaGlu: 1.521 ± 0.151
0.997AlaPhe: 0.997 ± 0.094
1.388AlaGly: 1.388 ± 0.141
0.421AlaHis: 0.421 ± 0.061
3.691AlaIle: 3.691 ± 0.225
2.591AlaLys: 2.591 ± 0.174
2.971AlaLeu: 2.971 ± 0.188
0.792AlaMet: 0.792 ± 0.101
2.344AlaAsn: 2.344 ± 0.174
0.658AlaPro: 0.658 ± 0.083
0.473AlaGln: 0.473 ± 0.084
1.172AlaArg: 1.172 ± 0.124
2.673AlaSer: 2.673 ± 0.176
1.768AlaThr: 1.768 ± 0.154
3.464AlaVal: 3.464 ± 0.266
0.113AlaTrp: 0.113 ± 0.035
1.563AlaTyr: 1.563 ± 0.147
0.0AlaXaa: 0.0 ± 0.0
Cys
0.576CysAla: 0.576 ± 0.084
0.637CysCys: 0.637 ± 0.113
1.203CysAsp: 1.203 ± 0.128
1.244CysGlu: 1.244 ± 0.112
0.977CysPhe: 0.977 ± 0.096
1.028CysGly: 1.028 ± 0.113
0.308CysHis: 0.308 ± 0.054
2.57CysIle: 2.57 ± 0.163
2.231CysLys: 2.231 ± 0.164
1.573CysLeu: 1.573 ± 0.146
0.679CysMet: 0.679 ± 0.093
2.066CysAsn: 2.066 ± 0.166
0.73CysPro: 0.73 ± 0.089
0.288CysGln: 0.288 ± 0.056
0.812CysArg: 0.812 ± 0.087
1.563CysSer: 1.563 ± 0.116
1.007CysThr: 1.007 ± 0.104
1.275CysVal: 1.275 ± 0.115
0.236CysTrp: 0.236 ± 0.056
1.655CysTyr: 1.655 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
1.953AspAla: 1.953 ± 0.132
1.028AspCys: 1.028 ± 0.1
4.092AspAsp: 4.092 ± 0.239
3.382AspGlu: 3.382 ± 0.217
2.724AspPhe: 2.724 ± 0.162
2.005AspGly: 2.005 ± 0.147
0.822AspHis: 0.822 ± 0.084
8.687AspIle: 8.687 ± 0.347
5.5AspLys: 5.5 ± 0.226
4.832AspLeu: 4.832 ± 0.246
1.593AspMet: 1.593 ± 0.125
5.346AspAsn: 5.346 ± 0.211
1.871AspPro: 1.871 ± 0.15
0.874AspGln: 0.874 ± 0.102
2.179AspArg: 2.179 ± 0.157
4.236AspSer: 4.236 ± 0.246
3.978AspThr: 3.978 ± 0.196
3.793AspVal: 3.793 ± 0.191
0.432AspTrp: 0.432 ± 0.063
3.845AspTyr: 3.845 ± 0.21
0.0AspXaa: 0.0 ± 0.0
Glu
1.686GluAla: 1.686 ± 0.143
1.213GluCys: 1.213 ± 0.109
3.927GluAsp: 3.927 ± 0.203
4.03GluGlu: 4.03 ± 0.238
2.241GluPhe: 2.241 ± 0.156
1.871GluGly: 1.871 ± 0.145
1.059GluHis: 1.059 ± 0.126
5.757GluIle: 5.757 ± 0.279
4.873GluLys: 4.873 ± 0.231
6.446GluLeu: 6.446 ± 0.314
1.254GluMet: 1.254 ± 0.115
3.968GluAsn: 3.968 ± 0.26
1.121GluPro: 1.121 ± 0.129
1.408GluGln: 1.408 ± 0.122
2.21GluArg: 2.21 ± 0.199
3.721GluSer: 3.721 ± 0.19
2.95GluThr: 2.95 ± 0.177
2.94GluVal: 2.94 ± 0.183
0.442GluTrp: 0.442 ± 0.07
3.578GluTyr: 3.578 ± 0.185
0.0GluXaa: 0.0 ± 0.0
Phe
0.997PheAla: 0.997 ± 0.101
0.864PheCys: 0.864 ± 0.099
2.128PheAsp: 2.128 ± 0.148
2.056PheGlu: 2.056 ± 0.16
1.871PhePhe: 1.871 ± 0.172
1.439PheGly: 1.439 ± 0.106
0.792PheHis: 0.792 ± 0.098
4.832PheIle: 4.832 ± 0.227
3.66PheLys: 3.66 ± 0.203
3.824PheLeu: 3.824 ± 0.244
1.131PheMet: 1.131 ± 0.11
3.485PheAsn: 3.485 ± 0.222
1.378PhePro: 1.378 ± 0.116
0.504PheGln: 0.504 ± 0.067
1.367PheArg: 1.367 ± 0.101
3.547PheSer: 3.547 ± 0.186
2.673PheThr: 2.673 ± 0.142
2.251PheVal: 2.251 ± 0.166
0.267PheTrp: 0.267 ± 0.048
2.293PheTyr: 2.293 ± 0.14
0.0PheXaa: 0.0 ± 0.0
Gly
2.817GlyAla: 2.817 ± 0.266
0.802GlyCys: 0.802 ± 0.089
2.128GlyAsp: 2.128 ± 0.153
1.799GlyGlu: 1.799 ± 0.121
1.614GlyPhe: 1.614 ± 0.127
1.686GlyGly: 1.686 ± 0.276
0.658GlyHis: 0.658 ± 0.095
3.68GlyIle: 3.68 ± 0.174
3.475GlyLys: 3.475 ± 0.204
2.56GlyLeu: 2.56 ± 0.155
0.679GlyMet: 0.679 ± 0.084
2.992GlyAsn: 2.992 ± 0.193
0.555GlyPro: 0.555 ± 0.069
0.617GlyGln: 0.617 ± 0.083
1.563GlyArg: 1.563 ± 0.154
2.642GlySer: 2.642 ± 0.223
1.717GlyThr: 1.717 ± 0.129
2.097GlyVal: 2.097 ± 0.184
0.267GlyTrp: 0.267 ± 0.067
2.837GlyTyr: 2.837 ± 0.168
0.0GlyXaa: 0.0 ± 0.0
His
0.689HisAla: 0.689 ± 0.084
0.401HisCys: 0.401 ± 0.057
1.09HisAsp: 1.09 ± 0.107
0.977HisGlu: 0.977 ± 0.108
0.74HisPhe: 0.74 ± 0.075
1.007HisGly: 1.007 ± 0.101
0.36HisHis: 0.36 ± 0.05
2.046HisIle: 2.046 ± 0.157
1.367HisLys: 1.367 ± 0.132
1.82HisLeu: 1.82 ± 0.142
0.514HisMet: 0.514 ± 0.068
1.593HisAsn: 1.593 ± 0.133
0.596HisPro: 0.596 ± 0.07
0.339HisGln: 0.339 ± 0.068
0.74HisArg: 0.74 ± 0.084
1.285HisSer: 1.285 ± 0.117
0.946HisThr: 0.946 ± 0.089
1.275HisVal: 1.275 ± 0.114
0.164HisTrp: 0.164 ± 0.036
1.45HisTyr: 1.45 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
3.526IleAla: 3.526 ± 0.187
2.179IleCys: 2.179 ± 0.18
7.474IleAsp: 7.474 ± 0.305
5.767IleGlu: 5.767 ± 0.233
3.978IlePhe: 3.978 ± 0.246
2.817IleGly: 2.817 ± 0.144
1.922IleHis: 1.922 ± 0.12
10.517IleIle: 10.517 ± 0.434
9.705IleLys: 9.705 ± 0.336
9.612IleLeu: 9.612 ± 0.348
2.57IleMet: 2.57 ± 0.155
9.643IleAsn: 9.643 ± 0.417
3.207IlePro: 3.207 ± 0.172
1.964IleGln: 1.964 ± 0.145
4.03IleArg: 4.03 ± 0.199
8.594IleSer: 8.594 ± 0.307
6.137IleThr: 6.137 ± 0.234
5.181IleVal: 5.181 ± 0.25
0.411IleTrp: 0.411 ± 0.064
5.171IleTyr: 5.171 ± 0.243
0.0IleXaa: 0.0 ± 0.0
Lys
2.621LysAla: 2.621 ± 0.17
2.149LysCys: 2.149 ± 0.16
6.076LysAsp: 6.076 ± 0.266
6.405LysGlu: 6.405 ± 0.313
3.331LysPhe: 3.331 ± 0.18
2.971LysGly: 2.971 ± 0.185
2.118LysHis: 2.118 ± 0.129
8.409LysIle: 8.409 ± 0.33
7.916LysLys: 7.916 ± 0.302
7.392LysLeu: 7.392 ± 0.306
2.087LysMet: 2.087 ± 0.127
6.405LysAsn: 6.405 ± 0.273
2.077LysPro: 2.077 ± 0.159
2.159LysGln: 2.159 ± 0.167
3.608LysArg: 3.608 ± 0.185
5.921LysSer: 5.921 ± 0.256
4.112LysThr: 4.112 ± 0.192
4.328LysVal: 4.328 ± 0.206
0.617LysTrp: 0.617 ± 0.084
6.281LysTyr: 6.281 ± 0.25
0.0LysXaa: 0.0 ± 0.0
Leu
2.714LeuAla: 2.714 ± 0.182
2.087LeuCys: 2.087 ± 0.166
5.973LeuAsp: 5.973 ± 0.253
5.767LeuGlu: 5.767 ± 0.251
3.732LeuPhe: 3.732 ± 0.204
3.094LeuGly: 3.094 ± 0.188
3.074LeuHis: 3.074 ± 0.29
8.584LeuIle: 8.584 ± 0.332
7.649LeuLys: 7.649 ± 0.314
11.792LeuLeu: 11.792 ± 0.62
2.221LeuMet: 2.221 ± 0.149
5.078LeuAsn: 5.078 ± 0.274
2.724LeuPro: 2.724 ± 0.165
2.046LeuGln: 2.046 ± 0.134
3.393LeuArg: 3.393 ± 0.176
7.464LeuSer: 7.464 ± 0.298
4.379LeuThr: 4.379 ± 0.242
5.027LeuVal: 5.027 ± 0.22
0.319LeuTrp: 0.319 ± 0.057
5.243LeuTyr: 5.243 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
0.894MetAla: 0.894 ± 0.094
0.535MetCys: 0.535 ± 0.076
1.645MetAsp: 1.645 ± 0.146
1.778MetGlu: 1.778 ± 0.138
1.254MetPhe: 1.254 ± 0.112
0.894MetGly: 0.894 ± 0.085
0.432MetHis: 0.432 ± 0.065
2.19MetIle: 2.19 ± 0.145
1.943MetLys: 1.943 ± 0.108
2.406MetLeu: 2.406 ± 0.156
0.617MetMet: 0.617 ± 0.084
1.542MetAsn: 1.542 ± 0.126
0.679MetPro: 0.679 ± 0.081
0.452MetGln: 0.452 ± 0.056
0.802MetArg: 0.802 ± 0.094
2.087MetSer: 2.087 ± 0.145
0.997MetThr: 0.997 ± 0.1
1.367MetVal: 1.367 ± 0.109
0.134MetTrp: 0.134 ± 0.037
1.676MetTyr: 1.676 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
2.601AsnAla: 2.601 ± 0.185
1.46AsnCys: 1.46 ± 0.128
4.338AsnAsp: 4.338 ± 0.186
3.763AsnGlu: 3.763 ± 0.208
3.207AsnPhe: 3.207 ± 0.202
3.279AsnGly: 3.279 ± 0.191
1.182AsnHis: 1.182 ± 0.114
9.982AsnIle: 9.982 ± 0.394
7.299AsnLys: 7.299 ± 0.327
5.264AsnLeu: 5.264 ± 0.26
1.974AsnMet: 1.974 ± 0.141
8.245AsnAsn: 8.245 ± 0.38
1.871AsnPro: 1.871 ± 0.134
1.038AsnGln: 1.038 ± 0.113
3.136AsnArg: 3.136 ± 0.162
5.747AsnSer: 5.747 ± 0.266
5.366AsnThr: 5.366 ± 0.255
4.585AsnVal: 4.585 ± 0.23
0.473AsnTrp: 0.473 ± 0.064
4.184AsnTyr: 4.184 ± 0.22
0.0AsnXaa: 0.0 ± 0.0
Pro
0.699ProAla: 0.699 ± 0.086
0.576ProCys: 0.576 ± 0.087
1.645ProAsp: 1.645 ± 0.145
1.707ProGlu: 1.707 ± 0.128
1.388ProPhe: 1.388 ± 0.123
1.018ProGly: 1.018 ± 0.131
0.545ProHis: 0.545 ± 0.079
2.837ProIle: 2.837 ± 0.166
1.964ProLys: 1.964 ± 0.167
3.65ProLeu: 3.65 ± 0.271
0.545ProMet: 0.545 ± 0.067
2.159ProAsn: 2.159 ± 0.145
0.956ProPro: 0.956 ± 0.097
0.463ProGln: 0.463 ± 0.076
1.038ProArg: 1.038 ± 0.116
2.097ProSer: 2.097 ± 0.159
1.234ProThr: 1.234 ± 0.126
1.676ProVal: 1.676 ± 0.131
0.236ProTrp: 0.236 ± 0.042
1.696ProTyr: 1.696 ± 0.138
0.0ProXaa: 0.0 ± 0.0
Gln
0.74GlnAla: 0.74 ± 0.093
0.391GlnCys: 0.391 ± 0.053
1.295GlnAsp: 1.295 ± 0.107
1.172GlnGlu: 1.172 ± 0.105
0.668GlnPhe: 0.668 ± 0.093
0.74GlnGly: 0.74 ± 0.08
0.493GlnHis: 0.493 ± 0.065
1.326GlnIle: 1.326 ± 0.116
1.583GlnLys: 1.583 ± 0.129
2.107GlnLeu: 2.107 ± 0.136
0.463GlnMet: 0.463 ± 0.068
1.234GlnAsn: 1.234 ± 0.1
0.442GlnPro: 0.442 ± 0.095
0.658GlnGln: 0.658 ± 0.091
0.812GlnArg: 0.812 ± 0.085
1.398GlnSer: 1.398 ± 0.133
0.72GlnThr: 0.72 ± 0.087
0.73GlnVal: 0.73 ± 0.086
0.072GlnTrp: 0.072 ± 0.026
1.254GlnTyr: 1.254 ± 0.113
0.0GlnXaa: 0.0 ± 0.0
Arg
1.131ArgAla: 1.131 ± 0.102
0.915ArgCys: 0.915 ± 0.095
2.478ArgAsp: 2.478 ± 0.166
2.293ArgGlu: 2.293 ± 0.161
1.758ArgPhe: 1.758 ± 0.125
1.45ArgGly: 1.45 ± 0.16
0.956ArgHis: 0.956 ± 0.119
3.269ArgIle: 3.269 ± 0.17
3.156ArgLys: 3.156 ± 0.205
3.423ArgLeu: 3.423 ± 0.161
0.987ArgMet: 0.987 ± 0.094
3.136ArgAsn: 3.136 ± 0.185
0.761ArgPro: 0.761 ± 0.102
0.946ArgGln: 0.946 ± 0.109
2.19ArgArg: 2.19 ± 0.199
2.776ArgSer: 2.776 ± 0.201
2.046ArgThr: 2.046 ± 0.152
1.871ArgVal: 1.871 ± 0.143
0.38ArgTrp: 0.38 ± 0.065
2.786ArgTyr: 2.786 ± 0.169
0.0ArgXaa: 0.0 ± 0.0
Ser
2.19SerAla: 2.19 ± 0.154
1.84SerCys: 1.84 ± 0.135
4.421SerAsp: 4.421 ± 0.232
3.495SerGlu: 3.495 ± 0.204
3.413SerPhe: 3.413 ± 0.196
3.362SerGly: 3.362 ± 0.257
1.172SerHis: 1.172 ± 0.108
8.512SerIle: 8.512 ± 0.343
6.672SerLys: 6.672 ± 0.249
6.96SerLeu: 6.96 ± 0.292
1.809SerMet: 1.809 ± 0.132
5.86SerAsn: 5.86 ± 0.265
2.529SerPro: 2.529 ± 0.165
1.306SerGln: 1.306 ± 0.134
2.776SerArg: 2.776 ± 0.185
6.806SerSer: 6.806 ± 0.459
4.05SerThr: 4.05 ± 0.235
4.585SerVal: 4.585 ± 0.248
0.442SerTrp: 0.442 ± 0.066
4.647SerTyr: 4.647 ± 0.233
0.0SerXaa: 0.0 ± 0.0
Thr
2.056ThrAla: 2.056 ± 0.16
1.676ThrCys: 1.676 ± 0.155
3.31ThrAsp: 3.31 ± 0.191
3.156ThrGlu: 3.156 ± 0.181
2.138ThrPhe: 2.138 ± 0.148
2.138ThrGly: 2.138 ± 0.153
0.864ThrHis: 0.864 ± 0.097
4.955ThrIle: 4.955 ± 0.216
4.194ThrLys: 4.194 ± 0.208
4.77ThrLeu: 4.77 ± 0.205
1.131ThrMet: 1.131 ± 0.121
3.321ThrAsn: 3.321 ± 0.185
2.961ThrPro: 2.961 ± 0.218
0.915ThrGln: 0.915 ± 0.102
1.778ThrArg: 1.778 ± 0.137
4.585ThrSer: 4.585 ± 0.275
2.848ThrThr: 2.848 ± 0.218
3.835ThrVal: 3.835 ± 0.232
0.535ThrTrp: 0.535 ± 0.086
2.817ThrTyr: 2.817 ± 0.139
0.0ThrXaa: 0.0 ± 0.0
Val
1.881ValAla: 1.881 ± 0.144
1.429ValCys: 1.429 ± 0.137
3.279ValAsp: 3.279 ± 0.159
2.93ValGlu: 2.93 ± 0.173
2.591ValPhe: 2.591 ± 0.16
1.645ValGly: 1.645 ± 0.175
0.966ValHis: 0.966 ± 0.085
5.716ValIle: 5.716 ± 0.298
5.726ValLys: 5.726 ± 0.264
5.243ValLeu: 5.243 ± 0.218
1.388ValMet: 1.388 ± 0.118
4.945ValAsn: 4.945 ± 0.257
1.429ValPro: 1.429 ± 0.106
0.812ValGln: 0.812 ± 0.087
2.467ValArg: 2.467 ± 0.155
4.873ValSer: 4.873 ± 0.24
3.372ValThr: 3.372 ± 0.201
2.889ValVal: 2.889 ± 0.175
0.37ValTrp: 0.37 ± 0.062
2.786ValTyr: 2.786 ± 0.179
0.0ValXaa: 0.0 ± 0.0
Trp
0.216TrpAla: 0.216 ± 0.064
0.144TrpCys: 0.144 ± 0.036
0.257TrpAsp: 0.257 ± 0.053
0.35TrpGlu: 0.35 ± 0.058
0.288TrpPhe: 0.288 ± 0.05
0.123TrpGly: 0.123 ± 0.04
0.103TrpHis: 0.103 ± 0.033
0.915TrpIle: 0.915 ± 0.109
0.627TrpLys: 0.627 ± 0.075
0.709TrpLeu: 0.709 ± 0.091
0.267TrpMet: 0.267 ± 0.056
0.411TrpAsn: 0.411 ± 0.063
0.175TrpPro: 0.175 ± 0.05
0.123TrpGln: 0.123 ± 0.035
0.247TrpArg: 0.247 ± 0.051
0.391TrpSer: 0.391 ± 0.056
0.473TrpThr: 0.473 ± 0.065
0.339TrpVal: 0.339 ± 0.052
0.103TrpTrp: 0.103 ± 0.054
0.288TrpTyr: 0.288 ± 0.056
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.2TyrAla: 2.2 ± 0.188
1.378TyrCys: 1.378 ± 0.135
3.783TyrAsp: 3.783 ± 0.198
3.105TyrGlu: 3.105 ± 0.174
2.447TyrPhe: 2.447 ± 0.158
3.228TyrGly: 3.228 ± 0.178
1.151TyrHis: 1.151 ± 0.112
5.932TyrIle: 5.932 ± 0.242
4.852TyrLys: 4.852 ± 0.213
5.007TyrLeu: 5.007 ± 0.208
1.614TyrMet: 1.614 ± 0.135
5.222TyrAsn: 5.222 ± 0.225
1.408TyrPro: 1.408 ± 0.108
0.915TyrGln: 0.915 ± 0.089
2.385TyrArg: 2.385 ± 0.15
4.441TyrSer: 4.441 ± 0.2
3.218TyrThr: 3.218 ± 0.195
3.228TyrVal: 3.228 ± 0.203
0.535TyrTrp: 0.535 ± 0.066
3.608TyrTyr: 3.608 ± 0.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 305 proteins (97274 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski