Amino acid dipepetide frequency for Salmon gill poxvirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.306AlaAla: 1.306 ± 0.203
0.639AlaCys: 0.639 ± 0.082
2.052AlaAsp: 2.052 ± 0.249
1.426AlaGlu: 1.426 ± 0.15
1.505AlaPhe: 1.505 ± 0.165
1.772AlaGly: 1.772 ± 0.177
0.733AlaHis: 0.733 ± 0.114
1.772AlaIle: 1.772 ± 0.166
1.958AlaLys: 1.958 ± 0.194
2.478AlaLeu: 2.478 ± 0.181
0.933AlaMet: 0.933 ± 0.113
1.825AlaAsn: 1.825 ± 0.262
1.052AlaPro: 1.052 ± 0.158
0.879AlaGln: 0.879 ± 0.104
1.106AlaArg: 1.106 ± 0.129
1.918AlaSer: 1.918 ± 0.18
1.665AlaThr: 1.665 ± 0.17
2.185AlaVal: 2.185 ± 0.169
0.48AlaTrp: 0.48 ± 0.085
1.266AlaTyr: 1.266 ± 0.137
0.013AlaXaa: 0.013 ± 0.011
Cys
0.546CysAla: 0.546 ± 0.077
0.44CysCys: 0.44 ± 0.078
1.106CysAsp: 1.106 ± 0.131
0.893CysGlu: 0.893 ± 0.121
0.853CysPhe: 0.853 ± 0.138
1.026CysGly: 1.026 ± 0.115
0.44CysHis: 0.44 ± 0.086
1.226CysIle: 1.226 ± 0.151
1.652CysLys: 1.652 ± 0.135
1.719CysLeu: 1.719 ± 0.172
0.693CysMet: 0.693 ± 0.089
1.172CysAsn: 1.172 ± 0.141
1.359CysPro: 1.359 ± 0.137
0.653CysGln: 0.653 ± 0.09
0.6CysArg: 0.6 ± 0.091
1.199CysSer: 1.199 ± 0.131
1.306CysThr: 1.306 ± 0.166
1.572CysVal: 1.572 ± 0.171
0.28CysTrp: 0.28 ± 0.052
0.48CysTyr: 0.48 ± 0.081
0.0CysXaa: 0.0 ± 0.0
Asp
1.599AspAla: 1.599 ± 0.129
1.013AspCys: 1.013 ± 0.133
4.197AspAsp: 4.197 ± 0.305
3.384AspGlu: 3.384 ± 0.251
3.864AspPhe: 3.864 ± 0.233
2.305AspGly: 2.305 ± 0.194
2.105AspHis: 2.105 ± 0.193
5.529AspIle: 5.529 ± 0.289
4.077AspLys: 4.077 ± 0.187
5.809AspLeu: 5.809 ± 0.319
2.465AspMet: 2.465 ± 0.18
3.957AspAsn: 3.957 ± 0.261
3.171AspPro: 3.171 ± 0.335
2.398AspGln: 2.398 ± 0.196
2.265AspArg: 2.265 ± 0.167
4.903AspSer: 4.903 ± 0.316
4.263AspThr: 4.263 ± 0.208
4.01AspVal: 4.01 ± 0.217
0.866AspTrp: 0.866 ± 0.102
3.211AspTyr: 3.211 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
1.532GluAla: 1.532 ± 0.162
0.893GluCys: 0.893 ± 0.129
2.904GluAsp: 2.904 ± 0.264
3.477GluGlu: 3.477 ± 0.384
3.237GluPhe: 3.237 ± 0.324
1.519GluGly: 1.519 ± 0.167
1.785GluHis: 1.785 ± 0.167
4.743GluIle: 4.743 ± 0.257
4.33GluLys: 4.33 ± 0.222
5.382GluLeu: 5.382 ± 0.319
1.545GluMet: 1.545 ± 0.163
3.997GluAsn: 3.997 ± 0.269
1.732GluPro: 1.732 ± 0.16
1.825GluGln: 1.825 ± 0.152
2.132GluArg: 2.132 ± 0.433
4.716GluSer: 4.716 ± 0.328
4.37GluThr: 4.37 ± 0.276
3.171GluVal: 3.171 ± 0.378
0.839GluTrp: 0.839 ± 0.09
2.918GluTyr: 2.918 ± 0.155
0.0GluXaa: 0.0 ± 0.0
Phe
1.199PheAla: 1.199 ± 0.149
1.039PheCys: 1.039 ± 0.111
3.357PheAsp: 3.357 ± 0.246
2.625PheGlu: 2.625 ± 0.196
2.345PhePhe: 2.345 ± 0.187
2.625PheGly: 2.625 ± 0.171
1.013PheHis: 1.013 ± 0.126
4.13PheIle: 4.13 ± 0.342
3.784PheLys: 3.784 ± 0.254
4.303PheLeu: 4.303 ± 0.331
1.439PheMet: 1.439 ± 0.149
3.144PheAsn: 3.144 ± 0.202
1.985PhePro: 1.985 ± 0.273
0.986PheGln: 0.986 ± 0.126
1.612PheArg: 1.612 ± 0.145
3.904PheSer: 3.904 ± 0.228
3.89PheThr: 3.89 ± 0.33
3.157PheVal: 3.157 ± 0.196
0.573PheTrp: 0.573 ± 0.09
2.145PheTyr: 2.145 ± 0.148
0.0PheXaa: 0.0 ± 0.0
Gly
1.732GlyAla: 1.732 ± 0.186
0.799GlyCys: 0.799 ± 0.116
2.238GlyAsp: 2.238 ± 0.157
1.985GlyGlu: 1.985 ± 0.198
2.385GlyPhe: 2.385 ± 0.177
2.505GlyGly: 2.505 ± 0.318
1.132GlyHis: 1.132 ± 0.123
3.584GlyIle: 3.584 ± 0.228
4.157GlyLys: 4.157 ± 0.249
4.023GlyLeu: 4.023 ± 0.353
1.585GlyMet: 1.585 ± 0.185
2.984GlyAsn: 2.984 ± 0.371
1.679GlyPro: 1.679 ± 0.173
1.599GlyGln: 1.599 ± 0.168
1.679GlyArg: 1.679 ± 0.143
3.57GlySer: 3.57 ± 0.328
3.091GlyThr: 3.091 ± 0.216
2.811GlyVal: 2.811 ± 0.209
0.493GlyTrp: 0.493 ± 0.092
1.812GlyTyr: 1.812 ± 0.151
0.0GlyXaa: 0.0 ± 0.0
His
0.799HisAla: 0.799 ± 0.107
0.573HisCys: 0.573 ± 0.102
2.105HisAsp: 2.105 ± 0.184
1.279HisGlu: 1.279 ± 0.135
0.933HisPhe: 0.933 ± 0.106
1.159HisGly: 1.159 ± 0.112
0.733HisHis: 0.733 ± 0.092
1.572HisIle: 1.572 ± 0.138
1.812HisLys: 1.812 ± 0.168
1.665HisLeu: 1.665 ± 0.149
0.853HisMet: 0.853 ± 0.104
1.399HisAsn: 1.399 ± 0.126
1.092HisPro: 1.092 ± 0.135
0.906HisGln: 0.906 ± 0.25
1.013HisArg: 1.013 ± 0.128
1.426HisSer: 1.426 ± 0.153
1.705HisThr: 1.705 ± 0.223
2.132HisVal: 2.132 ± 0.183
0.4HisTrp: 0.4 ± 0.067
0.613HisTyr: 0.613 ± 0.085
0.0HisXaa: 0.0 ± 0.0
Ile
1.759IleAla: 1.759 ± 0.166
1.386IleCys: 1.386 ± 0.164
5.103IleAsp: 5.103 ± 0.221
4.303IleGlu: 4.303 ± 0.353
3.637IlePhe: 3.637 ± 0.243
3.104IleGly: 3.104 ± 0.197
1.625IleHis: 1.625 ± 0.153
5.835IleIle: 5.835 ± 0.361
6.968IleLys: 6.968 ± 0.366
7.088IleLeu: 7.088 ± 0.377
2.705IleMet: 2.705 ± 0.193
5.409IleAsn: 5.409 ± 0.336
3.664IlePro: 3.664 ± 0.221
2.252IleGln: 2.252 ± 0.179
2.918IleArg: 2.918 ± 0.221
6.182IleSer: 6.182 ± 0.307
5.023IleThr: 5.023 ± 0.211
4.063IleVal: 4.063 ± 0.268
1.039IleTrp: 1.039 ± 0.128
2.944IleTyr: 2.944 ± 0.202
0.013IleXaa: 0.013 ± 0.013
Lys
2.238LysAla: 2.238 ± 0.18
1.426LysCys: 1.426 ± 0.131
3.277LysAsp: 3.277 ± 0.207
4.237LysGlu: 4.237 ± 0.228
3.757LysPhe: 3.757 ± 0.207
2.451LysGly: 2.451 ± 0.193
2.225LysHis: 2.225 ± 0.2
7.661LysIle: 7.661 ± 0.389
8.034LysLys: 8.034 ± 0.47
6.901LysLeu: 6.901 ± 0.334
2.358LysMet: 2.358 ± 0.191
6.728LysAsn: 6.728 ± 0.36
2.771LysPro: 2.771 ± 0.341
2.478LysGln: 2.478 ± 0.172
2.545LysArg: 2.545 ± 0.2
6.155LysSer: 6.155 ± 0.314
6.608LysThr: 6.608 ± 0.336
3.73LysVal: 3.73 ± 0.245
0.799LysTrp: 0.799 ± 0.101
3.544LysTyr: 3.544 ± 0.226
0.0LysXaa: 0.0 ± 0.0
Leu
2.718LeuAla: 2.718 ± 0.2
1.479LeuCys: 1.479 ± 0.154
5.156LeuAsp: 5.156 ± 0.268
5.356LeuGlu: 5.356 ± 0.347
4.476LeuPhe: 4.476 ± 0.264
4.01LeuGly: 4.01 ± 0.262
1.998LeuHis: 1.998 ± 0.249
6.155LeuIle: 6.155 ± 0.386
6.581LeuLys: 6.581 ± 0.345
7.128LeuLeu: 7.128 ± 0.355
2.212LeuMet: 2.212 ± 0.158
5.143LeuAsn: 5.143 ± 0.287
3.677LeuPro: 3.677 ± 0.243
2.571LeuGln: 2.571 ± 0.205
2.838LeuArg: 2.838 ± 0.219
7.208LeuSer: 7.208 ± 0.355
6.062LeuThr: 6.062 ± 0.291
5.342LeuVal: 5.342 ± 0.332
0.959LeuTrp: 0.959 ± 0.106
3.917LeuTyr: 3.917 ± 0.291
0.013LeuXaa: 0.013 ± 0.016
Met
1.279MetAla: 1.279 ± 0.114
0.573MetCys: 0.573 ± 0.091
1.865MetAsp: 1.865 ± 0.186
1.958MetGlu: 1.958 ± 0.164
1.825MetPhe: 1.825 ± 0.143
1.199MetGly: 1.199 ± 0.15
0.266MetHis: 0.266 ± 0.054
2.784MetIle: 2.784 ± 0.193
2.318MetLys: 2.318 ± 0.172
2.411MetLeu: 2.411 ± 0.174
0.706MetMet: 0.706 ± 0.109
1.652MetAsn: 1.652 ± 0.162
0.493MetPro: 0.493 ± 0.073
0.36MetGln: 0.36 ± 0.068
0.586MetArg: 0.586 ± 0.091
2.971MetSer: 2.971 ± 0.21
3.078MetThr: 3.078 ± 0.228
1.865MetVal: 1.865 ± 0.166
0.346MetTrp: 0.346 ± 0.066
1.719MetTyr: 1.719 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
1.878AsnAla: 1.878 ± 0.278
1.092AsnCys: 1.092 ± 0.195
3.57AsnAsp: 3.57 ± 0.207
2.958AsnGlu: 2.958 ± 0.21
3.011AsnPhe: 3.011 ± 0.222
2.758AsnGly: 2.758 ± 0.212
1.612AsnHis: 1.612 ± 0.149
6.342AsnIle: 6.342 ± 0.412
5.622AsnLys: 5.622 ± 0.374
5.036AsnLeu: 5.036 ± 0.332
2.331AsnMet: 2.331 ± 0.178
4.849AsnAsn: 4.849 ± 0.343
3.038AsnPro: 3.038 ± 0.322
1.998AsnGln: 1.998 ± 0.192
1.958AsnArg: 1.958 ± 0.147
4.197AsnSer: 4.197 ± 0.292
4.49AsnThr: 4.49 ± 0.441
4.57AsnVal: 4.57 ± 0.257
0.933AsnTrp: 0.933 ± 0.109
2.744AsnTyr: 2.744 ± 0.22
0.0AsnXaa: 0.0 ± 0.0
Pro
1.306ProAla: 1.306 ± 0.161
0.613ProCys: 0.613 ± 0.102
4.077ProAsp: 4.077 ± 0.246
3.251ProGlu: 3.251 ± 0.262
1.585ProPhe: 1.585 ± 0.139
2.931ProGly: 2.931 ± 0.337
0.799ProHis: 0.799 ± 0.11
2.371ProIle: 2.371 ± 0.168
2.345ProLys: 2.345 ± 0.19
3.104ProLeu: 3.104 ± 0.201
1.132ProMet: 1.132 ± 0.11
2.531ProAsn: 2.531 ± 0.354
2.158ProPro: 2.158 ± 0.342
0.639ProGln: 0.639 ± 0.094
1.479ProArg: 1.479 ± 0.186
2.358ProSer: 2.358 ± 0.189
2.678ProThr: 2.678 ± 0.21
3.331ProVal: 3.331 ± 0.308
0.679ProTrp: 0.679 ± 0.152
1.719ProTyr: 1.719 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
0.959GlnAla: 0.959 ± 0.136
0.533GlnCys: 0.533 ± 0.078
1.785GlnAsp: 1.785 ± 0.15
2.411GlnGlu: 2.411 ± 0.359
1.212GlnPhe: 1.212 ± 0.139
1.572GlnGly: 1.572 ± 0.332
0.773GlnHis: 0.773 ± 0.118
2.252GlnIle: 2.252 ± 0.166
2.172GlnLys: 2.172 ± 0.185
2.718GlnLeu: 2.718 ± 0.231
0.879GlnMet: 0.879 ± 0.127
1.625GlnAsn: 1.625 ± 0.136
0.826GlnPro: 0.826 ± 0.109
1.146GlnGln: 1.146 ± 0.2
0.959GlnArg: 0.959 ± 0.136
2.092GlnSer: 2.092 ± 0.174
2.238GlnThr: 2.238 ± 0.17
1.865GlnVal: 1.865 ± 0.16
0.32GlnTrp: 0.32 ± 0.058
1.119GlnTyr: 1.119 ± 0.139
0.0GlnXaa: 0.0 ± 0.0
Arg
0.906ArgAla: 0.906 ± 0.14
0.759ArgCys: 0.759 ± 0.124
2.225ArgAsp: 2.225 ± 0.206
2.038ArgGlu: 2.038 ± 0.177
1.692ArgPhe: 1.692 ± 0.127
1.532ArgGly: 1.532 ± 0.175
1.079ArgHis: 1.079 ± 0.111
2.331ArgIle: 2.331 ± 0.156
2.984ArgLys: 2.984 ± 0.247
3.078ArgLeu: 3.078 ± 0.222
0.746ArgMet: 0.746 ± 0.099
2.305ArgAsn: 2.305 ± 0.205
1.745ArgPro: 1.745 ± 0.315
1.146ArgGln: 1.146 ± 0.143
1.532ArgArg: 1.532 ± 0.168
2.691ArgSer: 2.691 ± 0.207
2.278ArgThr: 2.278 ± 0.187
2.451ArgVal: 2.451 ± 0.168
0.32ArgTrp: 0.32 ± 0.061
1.306ArgTyr: 1.306 ± 0.146
0.0ArgXaa: 0.0 ± 0.0
Ser
1.985SerAla: 1.985 ± 0.207
1.665SerCys: 1.665 ± 0.157
6.981SerAsp: 6.981 ± 0.353
4.836SerGlu: 4.836 ± 0.352
2.998SerPhe: 2.998 ± 0.211
4.063SerGly: 4.063 ± 0.233
1.585SerHis: 1.585 ± 0.139
4.943SerIle: 4.943 ± 0.27
5.516SerLys: 5.516 ± 0.313
6.168SerLeu: 6.168 ± 0.347
2.278SerMet: 2.278 ± 0.202
4.143SerAsn: 4.143 ± 0.224
2.811SerPro: 2.811 ± 0.233
2.198SerGln: 2.198 ± 0.174
3.118SerArg: 3.118 ± 0.274
5.782SerSer: 5.782 ± 0.398
5.076SerThr: 5.076 ± 0.323
5.782SerVal: 5.782 ± 0.336
1.132SerTrp: 1.132 ± 0.132
2.625SerTyr: 2.625 ± 0.203
0.013SerXaa: 0.013 ± 0.013
Thr
1.985ThrAla: 1.985 ± 0.175
1.785ThrCys: 1.785 ± 0.157
5.596ThrAsp: 5.596 ± 0.381
4.969ThrGlu: 4.969 ± 0.359
3.411ThrPhe: 3.411 ± 0.241
4.41ThrGly: 4.41 ± 0.304
1.878ThrHis: 1.878 ± 0.287
4.73ThrIle: 4.73 ± 0.207
5.436ThrLys: 5.436 ± 0.256
6.222ThrLeu: 6.222 ± 0.346
2.025ThrMet: 2.025 ± 0.131
3.504ThrAsn: 3.504 ± 0.242
2.585ThrPro: 2.585 ± 0.186
1.892ThrGln: 1.892 ± 0.223
2.531ThrArg: 2.531 ± 0.196
5.209ThrSer: 5.209 ± 0.354
5.742ThrThr: 5.742 ± 0.474
4.37ThrVal: 4.37 ± 0.249
0.933ThrTrp: 0.933 ± 0.109
2.585ThrTyr: 2.585 ± 0.162
0.013ThrXaa: 0.013 ± 0.011
Val
1.799ValAla: 1.799 ± 0.175
1.585ValCys: 1.585 ± 0.141
3.837ValAsp: 3.837 ± 0.261
2.971ValGlu: 2.971 ± 0.203
3.797ValPhe: 3.797 ± 0.33
2.705ValGly: 2.705 ± 0.248
1.399ValHis: 1.399 ± 0.145
4.849ValIle: 4.849 ± 0.281
5.529ValLys: 5.529 ± 0.255
5.489ValLeu: 5.489 ± 0.253
1.812ValMet: 1.812 ± 0.152
4.41ValAsn: 4.41 ± 0.271
3.157ValPro: 3.157 ± 0.324
2.025ValGln: 2.025 ± 0.165
2.318ValArg: 2.318 ± 0.214
4.943ValSer: 4.943 ± 0.293
4.183ValThr: 4.183 ± 0.224
4.77ValVal: 4.77 ± 0.342
0.773ValTrp: 0.773 ± 0.105
2.665ValTyr: 2.665 ± 0.212
0.0ValXaa: 0.0 ± 0.0
Trp
0.506TrpAla: 0.506 ± 0.087
0.253TrpCys: 0.253 ± 0.056
0.906TrpAsp: 0.906 ± 0.103
0.626TrpGlu: 0.626 ± 0.097
0.679TrpPhe: 0.679 ± 0.1
0.333TrpGly: 0.333 ± 0.071
0.187TrpHis: 0.187 ± 0.044
0.866TrpIle: 0.866 ± 0.116
1.572TrpLys: 1.572 ± 0.157
0.906TrpLeu: 0.906 ± 0.136
0.32TrpMet: 0.32 ± 0.063
0.986TrpAsn: 0.986 ± 0.097
0.386TrpPro: 0.386 ± 0.068
0.213TrpGln: 0.213 ± 0.05
0.506TrpArg: 0.506 ± 0.108
0.826TrpSer: 0.826 ± 0.113
1.465TrpThr: 1.465 ± 0.169
0.813TrpVal: 0.813 ± 0.119
0.173TrpTrp: 0.173 ± 0.042
0.453TrpTyr: 0.453 ± 0.084
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.933TyrAla: 0.933 ± 0.128
0.666TyrCys: 0.666 ± 0.09
3.051TyrAsp: 3.051 ± 0.186
2.065TyrGlu: 2.065 ± 0.163
2.052TyrPhe: 2.052 ± 0.172
1.865TyrGly: 1.865 ± 0.149
0.799TyrHis: 0.799 ± 0.107
3.344TyrIle: 3.344 ± 0.327
3.051TyrLys: 3.051 ± 0.179
3.371TyrLeu: 3.371 ± 0.238
1.226TyrMet: 1.226 ± 0.148
3.211TyrAsn: 3.211 ± 0.274
1.705TyrPro: 1.705 ± 0.168
1.292TyrGln: 1.292 ± 0.286
1.452TyrArg: 1.452 ± 0.142
3.437TyrSer: 3.437 ± 0.221
2.598TyrThr: 2.598 ± 0.214
3.038TyrVal: 3.038 ± 0.203
0.639TyrTrp: 0.639 ± 0.105
1.892TyrTyr: 1.892 ± 0.151
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.027XaaLys: 0.027 ± 0.017
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.013XaaAsn: 0.013 ± 0.013
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.013XaaSer: 0.013 ± 0.016
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.013XaaTyr: 0.013 ± 0.011
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 206 proteins (75061 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski