Amino acid dipepetide frequency for Moosepox virus GoldyGopher14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.377AlaAla: 1.377 ± 0.206
0.452AlaCys: 0.452 ± 0.096
1.652AlaAsp: 1.652 ± 0.173
1.259AlaGlu: 1.259 ± 0.127
1.18AlaPhe: 1.18 ± 0.169
1.023AlaGly: 1.023 ± 0.229
0.511AlaHis: 0.511 ± 0.092
3.205AlaIle: 3.205 ± 0.237
2.065AlaLys: 2.065 ± 0.175
2.792AlaLeu: 2.792 ± 0.228
0.649AlaMet: 0.649 ± 0.122
2.045AlaAsn: 2.045 ± 0.194
0.747AlaPro: 0.747 ± 0.14
0.511AlaGln: 0.511 ± 0.095
1.16AlaArg: 1.16 ± 0.16
2.635AlaSer: 2.635 ± 0.299
1.731AlaThr: 1.731 ± 0.17
1.593AlaVal: 1.593 ± 0.166
0.216AlaTrp: 0.216 ± 0.076
1.495AlaTyr: 1.495 ± 0.155
0.0AlaXaa: 0.0 ± 0.0
Cys
0.629CysAla: 0.629 ± 0.128
0.57CysCys: 0.57 ± 0.117
1.259CysAsp: 1.259 ± 0.164
0.983CysGlu: 0.983 ± 0.131
0.983CysPhe: 0.983 ± 0.139
1.023CysGly: 1.023 ± 0.111
0.236CysHis: 0.236 ± 0.073
2.458CysIle: 2.458 ± 0.218
1.514CysLys: 1.514 ± 0.181
1.514CysLeu: 1.514 ± 0.177
0.492CysMet: 0.492 ± 0.102
2.104CysAsn: 2.104 ± 0.207
0.57CysPro: 0.57 ± 0.101
0.374CysGln: 0.374 ± 0.094
0.452CysArg: 0.452 ± 0.085
1.652CysSer: 1.652 ± 0.179
1.042CysThr: 1.042 ± 0.15
1.18CysVal: 1.18 ± 0.16
0.157CysTrp: 0.157 ± 0.065
1.18CysTyr: 1.18 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
1.672AspAla: 1.672 ± 0.172
0.905AspCys: 0.905 ± 0.152
4.484AspAsp: 4.484 ± 0.303
4.897AspGlu: 4.897 ± 0.331
3.127AspPhe: 3.127 ± 0.258
2.438AspGly: 2.438 ± 0.227
0.826AspHis: 0.826 ± 0.132
8.594AspIle: 8.594 ± 0.493
5.31AspLys: 5.31 ± 0.403
4.444AspLeu: 4.444 ± 0.313
1.829AspMet: 1.829 ± 0.191
4.818AspAsn: 4.818 ± 0.32
1.357AspPro: 1.357 ± 0.162
1.101AspGln: 1.101 ± 0.147
1.672AspArg: 1.672 ± 0.202
3.481AspSer: 3.481 ± 0.245
3.068AspThr: 3.068 ± 0.294
4.012AspVal: 4.012 ± 0.261
0.433AspTrp: 0.433 ± 0.09
3.225AspTyr: 3.225 ± 0.226
0.0AspXaa: 0.0 ± 0.0
Glu
1.357GluAla: 1.357 ± 0.157
1.18GluCys: 1.18 ± 0.165
4.208GluAsp: 4.208 ± 0.336
4.051GluGlu: 4.051 ± 0.373
2.615GluPhe: 2.615 ± 0.23
1.18GluGly: 1.18 ± 0.126
0.924GluHis: 0.924 ± 0.135
5.742GluIle: 5.742 ± 0.325
5.683GluLys: 5.683 ± 0.389
5.034GluLeu: 5.034 ± 0.341
1.416GluMet: 1.416 ± 0.175
5.133GluAsn: 5.133 ± 0.282
1.436GluPro: 1.436 ± 0.169
1.18GluGln: 1.18 ± 0.172
1.652GluArg: 1.652 ± 0.201
4.208GluSer: 4.208 ± 0.267
3.441GluThr: 3.441 ± 0.235
2.34GluVal: 2.34 ± 0.245
0.275GluTrp: 0.275 ± 0.072
3.304GluTyr: 3.304 ± 0.247
0.0GluXaa: 0.0 ± 0.0
Phe
1.121PheAla: 1.121 ± 0.151
1.082PheCys: 1.082 ± 0.139
3.638PheAsp: 3.638 ± 0.326
2.438PheGlu: 2.438 ± 0.262
2.655PhePhe: 2.655 ± 0.216
2.222PheGly: 2.222 ± 0.241
0.865PheHis: 0.865 ± 0.148
6.273PheIle: 6.273 ± 0.401
4.11PheLys: 4.11 ± 0.256
5.015PheLeu: 5.015 ± 0.363
1.534PheMet: 1.534 ± 0.171
4.444PheAsn: 4.444 ± 0.352
1.731PhePro: 1.731 ± 0.151
0.826PheGln: 0.826 ± 0.118
1.554PheArg: 1.554 ± 0.158
4.838PheSer: 4.838 ± 0.357
2.714PheThr: 2.714 ± 0.254
2.812PheVal: 2.812 ± 0.229
0.472PheTrp: 0.472 ± 0.11
2.773PheTyr: 2.773 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
1.436GlyAla: 1.436 ± 0.189
0.629GlyCys: 0.629 ± 0.108
2.203GlyAsp: 2.203 ± 0.209
1.986GlyGlu: 1.986 ± 0.214
2.399GlyPhe: 2.399 ± 0.277
1.868GlyGly: 1.868 ± 0.251
0.511GlyHis: 0.511 ± 0.1
4.346GlyIle: 4.346 ± 0.309
3.441GlyLys: 3.441 ± 0.26
2.537GlyLeu: 2.537 ± 0.189
0.728GlyMet: 0.728 ± 0.133
3.363GlyAsn: 3.363 ± 0.307
0.728GlyPro: 0.728 ± 0.101
0.531GlyGln: 0.531 ± 0.094
1.337GlyArg: 1.337 ± 0.161
2.38GlySer: 2.38 ± 0.217
1.691GlyThr: 1.691 ± 0.22
1.79GlyVal: 1.79 ± 0.199
0.256GlyTrp: 0.256 ± 0.077
2.124GlyTyr: 2.124 ± 0.211
0.0GlyXaa: 0.0 ± 0.0
His
0.472HisAla: 0.472 ± 0.087
0.59HisCys: 0.59 ± 0.125
0.885HisAsp: 0.885 ± 0.132
0.964HisGlu: 0.964 ± 0.133
0.747HisPhe: 0.747 ± 0.135
0.806HisGly: 0.806 ± 0.128
0.315HisHis: 0.315 ± 0.084
2.497HisIle: 2.497 ± 0.203
1.377HisLys: 1.377 ± 0.175
1.514HisLeu: 1.514 ± 0.179
0.551HisMet: 0.551 ± 0.118
1.534HisAsn: 1.534 ± 0.17
0.551HisPro: 0.551 ± 0.113
0.433HisGln: 0.433 ± 0.08
0.649HisArg: 0.649 ± 0.101
0.983HisSer: 0.983 ± 0.127
0.767HisThr: 0.767 ± 0.128
1.003HisVal: 1.003 ± 0.141
0.157HisTrp: 0.157 ± 0.054
0.806HisTyr: 0.806 ± 0.107
0.0HisXaa: 0.0 ± 0.0
Ile
2.93IleAla: 2.93 ± 0.229
2.301IleCys: 2.301 ± 0.227
7.06IleAsp: 7.06 ± 0.33
5.939IleGlu: 5.939 ± 0.39
5.939IlePhe: 5.939 ± 0.431
3.54IleGly: 3.54 ± 0.285
2.222IleHis: 2.222 ± 0.196
11.839IleIle: 11.839 ± 0.653
10.619IleLys: 10.619 ± 0.578
11.013IleLeu: 11.013 ± 0.555
2.124IleMet: 2.124 ± 0.198
10.659IleAsn: 10.659 ± 0.473
3.579IlePro: 3.579 ± 0.29
1.888IleGln: 1.888 ± 0.164
3.284IleArg: 3.284 ± 0.279
9.184IleSer: 9.184 ± 0.404
6.254IleThr: 6.254 ± 0.362
5.703IleVal: 5.703 ± 0.278
0.551IleTrp: 0.551 ± 0.108
5.624IleTyr: 5.624 ± 0.338
0.0IleXaa: 0.0 ± 0.0
Lys
2.144LysAla: 2.144 ± 0.237
1.495LysCys: 1.495 ± 0.188
5.074LysAsp: 5.074 ± 0.337
4.916LysGlu: 4.916 ± 0.255
4.051LysPhe: 4.051 ± 0.337
2.773LysGly: 2.773 ± 0.231
1.632LysHis: 1.632 ± 0.185
10.088LysIle: 10.088 ± 0.6
9.302LysLys: 9.302 ± 0.467
8.043LysLeu: 8.043 ± 0.382
2.438LysMet: 2.438 ± 0.25
8.259LysAsn: 8.259 ± 0.417
2.399LysPro: 2.399 ± 0.25
2.203LysGln: 2.203 ± 0.209
3.52LysArg: 3.52 ± 0.26
6.116LysSer: 6.116 ± 0.309
5.211LysThr: 5.211 ± 0.307
4.031LysVal: 4.031 ± 0.304
0.669LysTrp: 0.669 ± 0.139
5.998LysTyr: 5.998 ± 0.426
0.0LysXaa: 0.0 ± 0.0
Leu
2.812LeuAla: 2.812 ± 0.243
1.672LeuCys: 1.672 ± 0.155
5.231LeuAsp: 5.231 ± 0.39
5.664LeuGlu: 5.664 ± 0.374
5.074LeuPhe: 5.074 ± 0.315
3.068LeuGly: 3.068 ± 0.266
1.613LeuHis: 1.613 ± 0.209
9.439LeuIle: 9.439 ± 0.441
7.335LeuLys: 7.335 ± 0.428
9.793LeuLeu: 9.793 ± 0.558
2.144LeuMet: 2.144 ± 0.228
6.627LeuAsn: 6.627 ± 0.403
2.733LeuPro: 2.733 ± 0.224
2.065LeuGln: 2.065 ± 0.176
2.753LeuArg: 2.753 ± 0.294
8.2LeuSer: 8.2 ± 0.37
5.231LeuThr: 5.231 ± 0.328
4.169LeuVal: 4.169 ± 0.293
0.393LeuTrp: 0.393 ± 0.082
4.582LeuTyr: 4.582 ± 0.266
0.0LeuXaa: 0.0 ± 0.0
Met
1.101MetAla: 1.101 ± 0.15
0.354MetCys: 0.354 ± 0.082
1.652MetAsp: 1.652 ± 0.17
1.554MetGlu: 1.554 ± 0.177
1.652MetPhe: 1.652 ± 0.194
0.924MetGly: 0.924 ± 0.116
0.315MetHis: 0.315 ± 0.081
2.321MetIle: 2.321 ± 0.205
2.104MetLys: 2.104 ± 0.233
2.183MetLeu: 2.183 ± 0.214
0.806MetMet: 0.806 ± 0.13
1.849MetAsn: 1.849 ± 0.171
0.806MetPro: 0.806 ± 0.137
0.492MetGln: 0.492 ± 0.097
0.964MetArg: 0.964 ± 0.149
2.124MetSer: 2.124 ± 0.209
1.023MetThr: 1.023 ± 0.123
1.18MetVal: 1.18 ± 0.14
0.157MetTrp: 0.157 ± 0.055
1.495MetTyr: 1.495 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
1.711AsnAla: 1.711 ± 0.184
1.239AsnCys: 1.239 ± 0.18
5.349AsnAsp: 5.349 ± 0.328
4.385AsnGlu: 4.385 ± 0.366
3.854AsnPhe: 3.854 ± 0.297
3.343AsnGly: 3.343 ± 0.248
1.593AsnHis: 1.593 ± 0.151
10.226AsnIle: 10.226 ± 0.601
7.866AsnLys: 7.866 ± 0.413
6.214AsnLeu: 6.214 ± 0.38
2.438AsnMet: 2.438 ± 0.223
8.436AsnAsn: 8.436 ± 0.62
2.596AsnPro: 2.596 ± 0.227
1.849AsnGln: 1.849 ± 0.179
2.517AsnArg: 2.517 ± 0.217
5.723AsnSer: 5.723 ± 0.328
4.208AsnThr: 4.208 ± 0.333
4.7AsnVal: 4.7 ± 0.355
0.413AsnTrp: 0.413 ± 0.077
4.149AsnTyr: 4.149 ± 0.286
0.0AsnXaa: 0.0 ± 0.0
Pro
0.688ProAla: 0.688 ± 0.132
0.61ProCys: 0.61 ± 0.118
1.79ProAsp: 1.79 ± 0.187
1.888ProGlu: 1.888 ± 0.182
1.947ProPhe: 1.947 ± 0.199
1.082ProGly: 1.082 ± 0.156
0.374ProHis: 0.374 ± 0.101
3.048ProIle: 3.048 ± 0.23
2.576ProLys: 2.576 ± 0.211
2.733ProLeu: 2.733 ± 0.243
0.806ProMet: 0.806 ± 0.129
2.104ProAsn: 2.104 ± 0.155
1.141ProPro: 1.141 ± 0.219
0.57ProGln: 0.57 ± 0.105
1.023ProArg: 1.023 ± 0.183
2.321ProSer: 2.321 ± 0.235
1.652ProThr: 1.652 ± 0.197
1.455ProVal: 1.455 ± 0.181
0.216ProTrp: 0.216 ± 0.06
1.691ProTyr: 1.691 ± 0.151
0.0ProXaa: 0.0 ± 0.0
Gln
0.492GlnAla: 0.492 ± 0.114
0.374GlnCys: 0.374 ± 0.07
1.023GlnAsp: 1.023 ± 0.127
1.121GlnGlu: 1.121 ± 0.15
0.983GlnPhe: 0.983 ± 0.124
0.57GlnGly: 0.57 ± 0.091
0.433GlnHis: 0.433 ± 0.082
2.163GlnIle: 2.163 ± 0.218
2.045GlnLys: 2.045 ± 0.199
2.144GlnLeu: 2.144 ± 0.236
0.511GlnMet: 0.511 ± 0.116
1.239GlnAsn: 1.239 ± 0.155
0.57GlnPro: 0.57 ± 0.098
0.629GlnGln: 0.629 ± 0.119
0.747GlnArg: 0.747 ± 0.148
1.495GlnSer: 1.495 ± 0.162
1.416GlnThr: 1.416 ± 0.158
0.806GlnVal: 0.806 ± 0.135
0.256GlnTrp: 0.256 ± 0.069
1.18GlnTyr: 1.18 ± 0.138
0.0GlnXaa: 0.0 ± 0.0
Arg
0.806ArgAla: 0.806 ± 0.129
0.964ArgCys: 0.964 ± 0.156
1.613ArgAsp: 1.613 ± 0.178
1.554ArgGlu: 1.554 ± 0.173
2.085ArgPhe: 2.085 ± 0.237
1.239ArgGly: 1.239 ± 0.164
0.806ArgHis: 0.806 ± 0.144
3.146ArgIle: 3.146 ± 0.231
2.969ArgLys: 2.969 ± 0.273
2.812ArgLeu: 2.812 ± 0.217
0.747ArgMet: 0.747 ± 0.112
2.281ArgAsn: 2.281 ± 0.286
0.944ArgPro: 0.944 ± 0.151
0.767ArgGln: 0.767 ± 0.118
1.357ArgArg: 1.357 ± 0.165
2.537ArgSer: 2.537 ± 0.247
1.475ArgThr: 1.475 ± 0.168
1.691ArgVal: 1.691 ± 0.174
0.256ArgTrp: 0.256 ± 0.073
1.967ArgTyr: 1.967 ± 0.199
0.0ArgXaa: 0.0 ± 0.0
Ser
2.38SerAla: 2.38 ± 0.232
1.731SerCys: 1.731 ± 0.202
4.7SerAsp: 4.7 ± 0.33
4.346SerGlu: 4.346 ± 0.323
3.953SerPhe: 3.953 ± 0.209
2.891SerGly: 2.891 ± 0.225
1.278SerHis: 1.278 ± 0.186
8.712SerIle: 8.712 ± 0.441
7.374SerLys: 7.374 ± 0.353
7.905SerLeu: 7.905 ± 0.408
1.947SerMet: 1.947 ± 0.199
5.408SerAsn: 5.408 ± 0.308
2.144SerPro: 2.144 ± 0.266
1.77SerGln: 1.77 ± 0.172
2.399SerArg: 2.399 ± 0.242
6.077SerSer: 6.077 ± 0.549
3.894SerThr: 3.894 ± 0.309
3.776SerVal: 3.776 ± 0.293
0.57SerTrp: 0.57 ± 0.083
3.874SerTyr: 3.874 ± 0.275
0.0SerXaa: 0.0 ± 0.0
Thr
1.652ThrAla: 1.652 ± 0.175
1.495ThrCys: 1.495 ± 0.186
3.068ThrAsp: 3.068 ± 0.217
2.871ThrGlu: 2.871 ± 0.229
3.304ThrPhe: 3.304 ± 0.236
1.986ThrGly: 1.986 ± 0.212
1.16ThrHis: 1.16 ± 0.125
6.175ThrIle: 6.175 ± 0.392
4.877ThrLys: 4.877 ± 0.271
4.838ThrLeu: 4.838 ± 0.342
1.455ThrMet: 1.455 ± 0.171
3.481ThrAsn: 3.481 ± 0.294
1.868ThrPro: 1.868 ± 0.204
0.885ThrGln: 0.885 ± 0.15
1.514ThrArg: 1.514 ± 0.173
4.012ThrSer: 4.012 ± 0.285
3.382ThrThr: 3.382 ± 0.296
3.186ThrVal: 3.186 ± 0.267
0.531ThrTrp: 0.531 ± 0.093
2.615ThrTyr: 2.615 ± 0.266
0.0ThrXaa: 0.0 ± 0.0
Val
1.947ValAla: 1.947 ± 0.19
1.318ValCys: 1.318 ± 0.218
3.166ValAsp: 3.166 ± 0.224
2.596ValGlu: 2.596 ± 0.191
3.028ValPhe: 3.028 ± 0.245
1.534ValGly: 1.534 ± 0.178
1.023ValHis: 1.023 ± 0.143
4.739ValIle: 4.739 ± 0.317
4.72ValLys: 4.72 ± 0.304
4.7ValLeu: 4.7 ± 0.297
0.905ValMet: 0.905 ± 0.136
4.012ValAsn: 4.012 ± 0.257
1.593ValPro: 1.593 ± 0.172
1.042ValGln: 1.042 ± 0.112
1.593ValArg: 1.593 ± 0.197
4.366ValSer: 4.366 ± 0.32
2.969ValThr: 2.969 ± 0.256
2.124ValVal: 2.124 ± 0.205
0.236ValTrp: 0.236 ± 0.069
3.166ValTyr: 3.166 ± 0.3
0.0ValXaa: 0.0 ± 0.0
Trp
0.236TrpAla: 0.236 ± 0.073
0.098TrpCys: 0.098 ± 0.044
0.216TrpAsp: 0.216 ± 0.057
0.374TrpGlu: 0.374 ± 0.084
0.511TrpPhe: 0.511 ± 0.109
0.236TrpGly: 0.236 ± 0.075
0.02TrpHis: 0.02 ± 0.023
0.747TrpIle: 0.747 ± 0.119
0.649TrpLys: 0.649 ± 0.115
0.492TrpLeu: 0.492 ± 0.119
0.315TrpMet: 0.315 ± 0.072
0.472TrpAsn: 0.472 ± 0.093
0.177TrpPro: 0.177 ± 0.051
0.079TrpGln: 0.079 ± 0.036
0.374TrpArg: 0.374 ± 0.096
0.688TrpSer: 0.688 ± 0.117
0.256TrpThr: 0.256 ± 0.075
0.374TrpVal: 0.374 ± 0.087
0.02TrpTrp: 0.02 ± 0.021
0.275TrpTyr: 0.275 ± 0.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.337TyrAla: 1.337 ± 0.183
1.318TyrCys: 1.318 ± 0.15
3.422TyrAsp: 3.422 ± 0.281
2.576TyrGlu: 2.576 ± 0.183
3.028TyrPhe: 3.028 ± 0.223
2.537TyrGly: 2.537 ± 0.201
0.944TyrHis: 0.944 ± 0.119
6.49TyrIle: 6.49 ± 0.383
4.425TyrLys: 4.425 ± 0.286
4.877TyrLeu: 4.877 ± 0.364
1.2TyrMet: 1.2 ± 0.142
4.543TyrAsn: 4.543 ± 0.298
2.045TyrPro: 2.045 ± 0.193
1.023TyrGln: 1.023 ± 0.161
1.495TyrArg: 1.495 ± 0.175
4.11TyrSer: 4.11 ± 0.285
2.93TyrThr: 2.93 ± 0.208
2.851TyrVal: 2.851 ± 0.255
0.374TyrTrp: 0.374 ± 0.096
2.891TyrTyr: 2.891 ± 0.274
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 165 proteins (50852 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski