Amino acid dipepetide frequency for Saimiriine betaherpesvirus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.429AlaAla: 4.429 ± 0.585
1.457AlaCys: 1.457 ± 0.167
2.555AlaAsp: 2.555 ± 0.221
3.009AlaGlu: 3.009 ± 0.285
2.479AlaPhe: 2.479 ± 0.217
2.687AlaGly: 2.687 ± 0.37
1.173AlaHis: 1.173 ± 0.17
3.426AlaIle: 3.426 ± 0.245
1.987AlaLys: 1.987 ± 0.228
6.226AlaLeu: 6.226 ± 0.381
1.59AlaMet: 1.59 ± 0.212
1.911AlaAsn: 1.911 ± 0.189
2.593AlaPro: 2.593 ± 0.281
1.609AlaGln: 1.609 ± 0.153
3.255AlaArg: 3.255 ± 0.381
4.107AlaSer: 4.107 ± 0.272
3.785AlaThr: 3.785 ± 0.276
4.712AlaVal: 4.712 ± 0.341
0.776AlaTrp: 0.776 ± 0.133
1.665AlaTyr: 1.665 ± 0.186
0.0AlaXaa: 0.0 ± 0.0
Cys
1.476CysAla: 1.476 ± 0.158
0.738CysCys: 0.738 ± 0.132
1.344CysAsp: 1.344 ± 0.176
1.419CysGlu: 1.419 ± 0.166
1.306CysPhe: 1.306 ± 0.171
1.173CysGly: 1.173 ± 0.169
0.795CysHis: 0.795 ± 0.128
1.363CysIle: 1.363 ± 0.169
0.927CysLys: 0.927 ± 0.15
3.161CysLeu: 3.161 ± 0.275
0.549CysMet: 0.549 ± 0.093
1.173CysAsn: 1.173 ± 0.125
0.984CysPro: 0.984 ± 0.146
1.249CysGln: 1.249 ± 0.155
2.101CysArg: 2.101 ± 0.266
1.817CysSer: 1.817 ± 0.224
1.495CysThr: 1.495 ± 0.173
2.233CysVal: 2.233 ± 0.216
0.397CysTrp: 0.397 ± 0.084
1.022CysTyr: 1.022 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
3.217AspAla: 3.217 ± 0.269
1.136AspCys: 1.136 ± 0.168
3.388AspAsp: 3.388 ± 0.337
3.766AspGlu: 3.766 ± 0.266
2.176AspPhe: 2.176 ± 0.213
2.214AspGly: 2.214 ± 0.226
1.4AspHis: 1.4 ± 0.17
2.896AspIle: 2.896 ± 0.243
1.647AspLys: 1.647 ± 0.19
5.943AspLeu: 5.943 ± 0.348
1.079AspMet: 1.079 ± 0.133
1.647AspAsn: 1.647 ± 0.184
2.517AspPro: 2.517 ± 0.252
1.552AspGln: 1.552 ± 0.155
2.839AspArg: 2.839 ± 0.247
3.35AspSer: 3.35 ± 0.28
3.179AspThr: 3.179 ± 0.274
4.145AspVal: 4.145 ± 0.283
0.662AspTrp: 0.662 ± 0.115
1.798AspTyr: 1.798 ± 0.205
0.0AspXaa: 0.0 ± 0.0
Glu
3.312GluAla: 3.312 ± 0.26
1.325GluCys: 1.325 ± 0.178
3.936GluAsp: 3.936 ± 0.29
4.315GluGlu: 4.315 ± 0.372
1.836GluPhe: 1.836 ± 0.174
1.779GluGly: 1.779 ± 0.205
1.741GluHis: 1.741 ± 0.2
2.877GluIle: 2.877 ± 0.214
2.801GluLys: 2.801 ± 0.29
5.167GluLeu: 5.167 ± 0.345
1.419GluMet: 1.419 ± 0.161
3.047GluAsn: 3.047 ± 0.21
2.025GluPro: 2.025 ± 0.169
2.025GluGln: 2.025 ± 0.19
3.179GluArg: 3.179 ± 0.277
4.05GluSer: 4.05 ± 0.264
4.296GluThr: 4.296 ± 0.299
3.426GluVal: 3.426 ± 0.326
0.643GluTrp: 0.643 ± 0.111
1.949GluTyr: 1.949 ± 0.186
0.0GluXaa: 0.0 ± 0.0
Phe
1.855PheAla: 1.855 ± 0.165
1.419PheCys: 1.419 ± 0.16
1.968PheAsp: 1.968 ± 0.192
2.309PheGlu: 2.309 ± 0.192
2.574PhePhe: 2.574 ± 0.247
2.536PheGly: 2.536 ± 0.241
1.022PheHis: 1.022 ± 0.155
2.915PheIle: 2.915 ± 0.245
2.139PheLys: 2.139 ± 0.168
5.318PheLeu: 5.318 ± 0.295
1.098PheMet: 1.098 ± 0.14
2.176PheAsn: 2.176 ± 0.205
1.855PhePro: 1.855 ± 0.18
1.798PheGln: 1.798 ± 0.142
2.422PheArg: 2.422 ± 0.239
3.179PheSer: 3.179 ± 0.252
2.896PheThr: 2.896 ± 0.248
3.52PheVal: 3.52 ± 0.301
0.606PheTrp: 0.606 ± 0.09
1.741PheTyr: 1.741 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
2.479GlyAla: 2.479 ± 0.381
1.211GlyCys: 1.211 ± 0.172
2.328GlyAsp: 2.328 ± 0.177
2.725GlyGlu: 2.725 ± 0.234
1.93GlyPhe: 1.93 ± 0.22
2.422GlyGly: 2.422 ± 0.443
0.965GlyHis: 0.965 ± 0.144
2.65GlyIle: 2.65 ± 0.241
1.911GlyLys: 1.911 ± 0.246
4.977GlyLeu: 4.977 ± 0.321
1.098GlyMet: 1.098 ± 0.174
1.741GlyAsn: 1.741 ± 0.19
1.874GlyPro: 1.874 ± 0.201
1.647GlyGln: 1.647 ± 0.174
2.896GlyArg: 2.896 ± 0.285
3.407GlySer: 3.407 ± 0.269
3.293GlyThr: 3.293 ± 0.241
3.653GlyVal: 3.653 ± 0.333
0.454GlyTrp: 0.454 ± 0.079
1.741GlyTyr: 1.741 ± 0.168
0.0GlyXaa: 0.0 ± 0.0
His
1.628HisAla: 1.628 ± 0.212
0.473HisCys: 0.473 ± 0.092
1.817HisAsp: 1.817 ± 0.24
1.117HisGlu: 1.117 ± 0.13
1.287HisPhe: 1.287 ± 0.182
1.552HisGly: 1.552 ± 0.161
0.984HisHis: 0.984 ± 0.171
1.533HisIle: 1.533 ± 0.178
0.738HisLys: 0.738 ± 0.121
3.217HisLeu: 3.217 ± 0.254
0.435HisMet: 0.435 ± 0.081
1.154HisAsn: 1.154 ± 0.15
1.268HisPro: 1.268 ± 0.157
1.325HisGln: 1.325 ± 0.16
2.157HisArg: 2.157 ± 0.197
1.476HisSer: 1.476 ± 0.186
1.76HisThr: 1.76 ± 0.233
2.271HisVal: 2.271 ± 0.22
0.208HisTrp: 0.208 ± 0.062
0.814HisTyr: 0.814 ± 0.124
0.0HisXaa: 0.0 ± 0.0
Ile
2.99IleAla: 2.99 ± 0.257
1.703IleCys: 1.703 ± 0.195
2.687IleAsp: 2.687 ± 0.253
2.366IleGlu: 2.366 ± 0.231
3.066IlePhe: 3.066 ± 0.273
2.593IleGly: 2.593 ± 0.244
1.4IleHis: 1.4 ± 0.202
3.842IleIle: 3.842 ± 0.342
2.46IleLys: 2.46 ± 0.233
5.772IleLeu: 5.772 ± 0.444
1.495IleMet: 1.495 ± 0.175
2.195IleAsn: 2.195 ± 0.199
2.933IlePro: 2.933 ± 0.282
2.593IleGln: 2.593 ± 0.216
3.463IleArg: 3.463 ± 0.254
4.656IleSer: 4.656 ± 0.35
4.504IleThr: 4.504 ± 0.386
4.012IleVal: 4.012 ± 0.245
0.871IleTrp: 0.871 ± 0.13
2.782IleTyr: 2.782 ± 0.214
0.0IleXaa: 0.0 ± 0.0
Lys
2.101LysAla: 2.101 ± 0.192
1.022LysCys: 1.022 ± 0.134
2.366LysAsp: 2.366 ± 0.239
2.366LysGlu: 2.366 ± 0.175
1.287LysPhe: 1.287 ± 0.158
1.419LysGly: 1.419 ± 0.181
1.571LysHis: 1.571 ± 0.198
3.009LysIle: 3.009 ± 0.272
2.706LysLys: 2.706 ± 0.272
4.031LysLeu: 4.031 ± 0.251
1.041LysMet: 1.041 ± 0.124
2.498LysAsn: 2.498 ± 0.265
1.987LysPro: 1.987 ± 0.205
2.006LysGln: 2.006 ± 0.243
3.066LysArg: 3.066 ± 0.276
3.161LysSer: 3.161 ± 0.26
3.577LysThr: 3.577 ± 0.278
2.366LysVal: 2.366 ± 0.228
0.587LysTrp: 0.587 ± 0.118
1.665LysTyr: 1.665 ± 0.195
0.0LysXaa: 0.0 ± 0.0
Leu
5.299LeuAla: 5.299 ± 0.313
3.634LeuCys: 3.634 ± 0.291
4.958LeuAsp: 4.958 ± 0.289
5.034LeuGlu: 5.034 ± 0.337
4.977LeuPhe: 4.977 ± 0.354
4.958LeuGly: 4.958 ± 0.351
3.179LeuHis: 3.179 ± 0.233
6.378LeuIle: 6.378 ± 0.419
5.583LeuLys: 5.583 ± 0.413
11.28LeuLeu: 11.28 ± 0.537
2.839LeuMet: 2.839 ± 0.289
4.466LeuAsn: 4.466 ± 0.315
4.94LeuPro: 4.94 ± 0.315
3.709LeuGln: 3.709 ± 0.375
6.586LeuArg: 6.586 ± 0.415
8.876LeuSer: 8.876 ± 0.427
6.832LeuThr: 6.832 ± 0.438
5.697LeuVal: 5.697 ± 0.344
1.249LeuTrp: 1.249 ± 0.197
4.523LeuTyr: 4.523 ± 0.307
0.0LeuXaa: 0.0 ± 0.0
Met
1.836MetAla: 1.836 ± 0.225
0.568MetCys: 0.568 ± 0.097
1.211MetAsp: 1.211 ± 0.162
1.665MetGlu: 1.665 ± 0.189
1.154MetPhe: 1.154 ± 0.155
0.852MetGly: 0.852 ± 0.156
0.549MetHis: 0.549 ± 0.099
1.363MetIle: 1.363 ± 0.148
1.211MetLys: 1.211 ± 0.149
2.479MetLeu: 2.479 ± 0.213
0.871MetMet: 0.871 ± 0.141
0.908MetAsn: 0.908 ± 0.136
0.776MetPro: 0.776 ± 0.101
0.681MetGln: 0.681 ± 0.122
1.249MetArg: 1.249 ± 0.144
2.404MetSer: 2.404 ± 0.194
1.779MetThr: 1.779 ± 0.183
1.4MetVal: 1.4 ± 0.187
0.492MetTrp: 0.492 ± 0.089
1.211MetTyr: 1.211 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
2.309AsnAla: 2.309 ± 0.191
0.757AsnCys: 0.757 ± 0.115
1.817AsnAsp: 1.817 ± 0.181
2.404AsnGlu: 2.404 ± 0.207
1.893AsnPhe: 1.893 ± 0.223
2.063AsnGly: 2.063 ± 0.19
1.382AsnHis: 1.382 ± 0.146
2.744AsnIle: 2.744 ± 0.266
1.855AsnLys: 1.855 ± 0.199
4.712AsnLeu: 4.712 ± 0.341
0.908AsnMet: 0.908 ± 0.138
1.911AsnAsn: 1.911 ± 0.188
1.798AsnPro: 1.798 ± 0.182
1.268AsnGln: 1.268 ± 0.188
2.422AsnArg: 2.422 ± 0.209
3.312AsnSer: 3.312 ± 0.35
3.463AsnThr: 3.463 ± 0.419
4.296AsnVal: 4.296 ± 0.374
0.53AsnTrp: 0.53 ± 0.091
1.419AsnTyr: 1.419 ± 0.194
0.0AsnXaa: 0.0 ± 0.0
Pro
2.498ProAla: 2.498 ± 0.269
1.154ProCys: 1.154 ± 0.147
2.252ProAsp: 2.252 ± 0.209
3.009ProGlu: 3.009 ± 0.232
1.911ProPhe: 1.911 ± 0.212
2.271ProGly: 2.271 ± 0.247
1.098ProHis: 1.098 ± 0.113
2.309ProIle: 2.309 ± 0.211
1.987ProLys: 1.987 ± 0.197
4.296ProLeu: 4.296 ± 0.28
1.079ProMet: 1.079 ± 0.176
1.76ProAsn: 1.76 ± 0.244
3.047ProPro: 3.047 ± 0.365
1.249ProGln: 1.249 ± 0.184
3.388ProArg: 3.388 ± 0.332
3.918ProSer: 3.918 ± 0.262
3.028ProThr: 3.028 ± 0.337
3.35ProVal: 3.35 ± 0.253
0.7ProTrp: 0.7 ± 0.115
1.647ProTyr: 1.647 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
1.893GlnAla: 1.893 ± 0.223
1.041GlnCys: 1.041 ± 0.133
1.798GlnAsp: 1.798 ± 0.176
2.063GlnGlu: 2.063 ± 0.22
1.419GlnPhe: 1.419 ± 0.171
1.211GlnGly: 1.211 ± 0.147
1.079GlnHis: 1.079 ± 0.159
2.157GlnIle: 2.157 ± 0.251
1.987GlnLys: 1.987 ± 0.218
3.558GlnLeu: 3.558 ± 0.368
0.814GlnMet: 0.814 ± 0.131
1.987GlnAsn: 1.987 ± 0.18
1.363GlnPro: 1.363 ± 0.179
1.382GlnGln: 1.382 ± 0.176
2.593GlnArg: 2.593 ± 0.199
2.404GlnSer: 2.404 ± 0.214
2.801GlnThr: 2.801 ± 0.234
1.949GlnVal: 1.949 ± 0.177
0.473GlnTrp: 0.473 ± 0.097
1.192GlnTyr: 1.192 ± 0.166
0.0GlnXaa: 0.0 ± 0.0
Arg
2.422ArgAla: 2.422 ± 0.321
1.684ArgCys: 1.684 ± 0.204
3.482ArgAsp: 3.482 ± 0.312
2.839ArgGlu: 2.839 ± 0.223
3.123ArgPhe: 3.123 ± 0.332
2.441ArgGly: 2.441 ± 0.234
2.29ArgHis: 2.29 ± 0.183
3.388ArgIle: 3.388 ± 0.256
2.593ArgLys: 2.593 ± 0.23
6.813ArgLeu: 6.813 ± 0.377
1.457ArgMet: 1.457 ± 0.163
2.782ArgAsn: 2.782 ± 0.23
3.028ArgPro: 3.028 ± 0.332
2.479ArgGln: 2.479 ± 0.22
6.094ArgArg: 6.094 ± 0.493
4.675ArgSer: 4.675 ± 0.317
3.444ArgThr: 3.444 ± 0.248
4.05ArgVal: 4.05 ± 0.259
0.795ArgTrp: 0.795 ± 0.149
3.312ArgTyr: 3.312 ± 0.303
0.0ArgXaa: 0.0 ± 0.0
Ser
4.731SerAla: 4.731 ± 0.309
1.911SerCys: 1.911 ± 0.211
3.936SerAsp: 3.936 ± 0.266
4.618SerGlu: 4.618 ± 0.302
3.634SerPhe: 3.634 ± 0.266
4.258SerGly: 4.258 ± 0.296
1.741SerHis: 1.741 ± 0.168
3.766SerIle: 3.766 ± 0.275
3.312SerLys: 3.312 ± 0.262
7.835SerLeu: 7.835 ± 0.358
1.571SerMet: 1.571 ± 0.16
3.236SerAsn: 3.236 ± 0.315
3.52SerPro: 3.52 ± 0.319
2.593SerGln: 2.593 ± 0.274
4.731SerArg: 4.731 ± 0.34
8.403SerSer: 8.403 ± 0.631
6.037SerThr: 6.037 ± 0.638
5.678SerVal: 5.678 ± 0.378
1.117SerTrp: 1.117 ± 0.152
2.725SerTyr: 2.725 ± 0.194
0.0SerXaa: 0.0 ± 0.0
Thr
4.069ThrAla: 4.069 ± 0.321
2.063ThrCys: 2.063 ± 0.244
2.971ThrAsp: 2.971 ± 0.273
3.974ThrGlu: 3.974 ± 0.328
3.426ThrPhe: 3.426 ± 0.343
3.066ThrGly: 3.066 ± 0.269
1.76ThrHis: 1.76 ± 0.219
4.031ThrIle: 4.031 ± 0.33
2.877ThrLys: 2.877 ± 0.193
6.775ThrLeu: 6.775 ± 0.438
1.893ThrMet: 1.893 ± 0.207
2.782ThrAsn: 2.782 ± 0.394
3.861ThrPro: 3.861 ± 0.414
2.252ThrGln: 2.252 ± 0.259
3.596ThrArg: 3.596 ± 0.266
6.472ThrSer: 6.472 ± 0.545
6.113ThrThr: 6.113 ± 0.863
6.132ThrVal: 6.132 ± 0.379
1.003ThrTrp: 1.003 ± 0.182
2.536ThrTyr: 2.536 ± 0.266
0.0ThrXaa: 0.0 ± 0.0
Val
4.239ValAla: 4.239 ± 0.263
2.176ValCys: 2.176 ± 0.217
3.198ValAsp: 3.198 ± 0.301
3.539ValGlu: 3.539 ± 0.297
3.482ValPhe: 3.482 ± 0.283
3.293ValGly: 3.293 ± 0.32
1.495ValHis: 1.495 ± 0.191
4.429ValIle: 4.429 ± 0.347
3.009ValLys: 3.009 ± 0.278
7.343ValLeu: 7.343 ± 0.413
2.025ValMet: 2.025 ± 0.209
3.388ValAsn: 3.388 ± 0.26
3.142ValPro: 3.142 ± 0.266
2.063ValGln: 2.063 ± 0.18
3.766ValArg: 3.766 ± 0.294
6.34ValSer: 6.34 ± 0.402
5.64ValThr: 5.64 ± 0.354
4.485ValVal: 4.485 ± 0.375
1.136ValTrp: 1.136 ± 0.145
3.104ValTyr: 3.104 ± 0.243
0.0ValXaa: 0.0 ± 0.0
Trp
0.643TrpAla: 0.643 ± 0.124
0.379TrpCys: 0.379 ± 0.078
0.587TrpAsp: 0.587 ± 0.114
0.643TrpGlu: 0.643 ± 0.112
0.587TrpPhe: 0.587 ± 0.117
0.511TrpGly: 0.511 ± 0.084
0.303TrpHis: 0.303 ± 0.071
1.041TrpIle: 1.041 ± 0.174
0.625TrpLys: 0.625 ± 0.092
1.476TrpLeu: 1.476 ± 0.165
0.379TrpMet: 0.379 ± 0.088
0.549TrpAsn: 0.549 ± 0.111
0.889TrpPro: 0.889 ± 0.12
0.492TrpGln: 0.492 ± 0.089
0.833TrpArg: 0.833 ± 0.138
0.984TrpSer: 0.984 ± 0.145
1.041TrpThr: 1.041 ± 0.159
0.587TrpVal: 0.587 ± 0.118
0.151TrpTrp: 0.151 ± 0.061
0.568TrpTyr: 0.568 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.006TyrAla: 2.006 ± 0.174
0.871TyrCys: 0.871 ± 0.148
1.949TyrAsp: 1.949 ± 0.213
1.968TyrGlu: 1.968 ± 0.198
1.798TyrPhe: 1.798 ± 0.174
2.101TyrGly: 2.101 ± 0.231
1.287TyrHis: 1.287 ± 0.171
2.271TyrIle: 2.271 ± 0.199
1.476TyrLys: 1.476 ± 0.15
4.618TyrLeu: 4.618 ± 0.278
1.003TyrMet: 1.003 ± 0.138
1.949TyrAsn: 1.949 ± 0.212
1.552TyrPro: 1.552 ± 0.169
1.192TyrGln: 1.192 ± 0.114
2.574TyrArg: 2.574 ± 0.226
2.366TyrSer: 2.366 ± 0.218
2.687TyrThr: 2.687 ± 0.297
3.331TyrVal: 3.331 ± 0.238
0.416TyrTrp: 0.416 ± 0.098
1.76TyrTyr: 1.76 ± 0.153
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 140 proteins (52840 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski