Amino acid dipepetide frequency for Ranid herpesvirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.762AlaAla: 6.762 ± 0.353
1.877AlaCys: 1.877 ± 0.178
4.296AlaAsp: 4.296 ± 0.321
5.164AlaGlu: 5.164 ± 0.321
3.21AlaPhe: 3.21 ± 0.236
3.396AlaGly: 3.396 ± 0.285
1.83AlaHis: 1.83 ± 0.18
3.024AlaIle: 3.024 ± 0.233
3.769AlaLys: 3.769 ± 0.331
8.22AlaLeu: 8.22 ± 0.324
2.032AlaMet: 2.032 ± 0.168
2.342AlaAsn: 2.342 ± 0.218
3.939AlaPro: 3.939 ± 0.42
3.009AlaGln: 3.009 ± 0.245
3.877AlaArg: 3.877 ± 0.262
4.529AlaSer: 4.529 ± 0.314
3.505AlaThr: 3.505 ± 0.234
5.351AlaVal: 5.351 ± 0.322
0.775AlaTrp: 0.775 ± 0.098
2.776AlaTyr: 2.776 ± 0.219
0.0AlaXaa: 0.0 ± 0.0
Cys
2.249CysAla: 2.249 ± 0.184
0.76CysCys: 0.76 ± 0.132
1.458CysAsp: 1.458 ± 0.176
1.303CysGlu: 1.303 ± 0.151
1.256CysPhe: 1.256 ± 0.132
1.473CysGly: 1.473 ± 0.138
0.62CysHis: 0.62 ± 0.076
1.272CysIle: 1.272 ± 0.155
1.365CysLys: 1.365 ± 0.152
2.869CysLeu: 2.869 ± 0.232
0.931CysMet: 0.931 ± 0.115
1.086CysAsn: 1.086 ± 0.144
1.473CysPro: 1.473 ± 0.156
0.667CysGln: 0.667 ± 0.098
1.442CysArg: 1.442 ± 0.167
1.846CysSer: 1.846 ± 0.171
1.908CysThr: 1.908 ± 0.209
2.032CysVal: 2.032 ± 0.177
0.248CysTrp: 0.248 ± 0.066
1.396CysTyr: 1.396 ± 0.197
0.0CysXaa: 0.0 ± 0.0
Asp
4.063AspAla: 4.063 ± 0.256
1.349AspCys: 1.349 ± 0.155
2.637AspAsp: 2.637 ± 0.223
3.939AspGlu: 3.939 ± 0.298
2.28AspPhe: 2.28 ± 0.212
2.683AspGly: 2.683 ± 0.26
0.962AspHis: 0.962 ± 0.105
2.528AspIle: 2.528 ± 0.205
2.047AspLys: 2.047 ± 0.177
5.087AspLeu: 5.087 ± 0.228
1.939AspMet: 1.939 ± 0.19
1.132AspAsn: 1.132 ± 0.138
1.877AspPro: 1.877 ± 0.191
1.349AspGln: 1.349 ± 0.177
2.9AspArg: 2.9 ± 0.195
2.838AspSer: 2.838 ± 0.272
3.257AspThr: 3.257 ± 0.229
4.032AspVal: 4.032 ± 0.24
0.868AspTrp: 0.868 ± 0.119
2.171AspTyr: 2.171 ± 0.219
0.0AspXaa: 0.0 ± 0.0
Glu
4.808GluAla: 4.808 ± 0.377
1.628GluCys: 1.628 ± 0.148
3.924GluAsp: 3.924 ± 0.235
6.932GluGlu: 6.932 ± 0.643
2.792GluPhe: 2.792 ± 0.267
3.009GluGly: 3.009 ± 0.219
2.094GluHis: 2.094 ± 0.159
2.776GluIle: 2.776 ± 0.192
3.427GluLys: 3.427 ± 0.272
6.405GluLeu: 6.405 ± 0.27
2.249GluMet: 2.249 ± 0.185
2.823GluAsn: 2.823 ± 0.223
2.714GluPro: 2.714 ± 0.184
2.28GluGln: 2.28 ± 0.15
3.769GluArg: 3.769 ± 0.295
3.49GluSer: 3.49 ± 0.242
3.412GluThr: 3.412 ± 0.248
4.684GluVal: 4.684 ± 0.301
0.853GluTrp: 0.853 ± 0.121
2.574GluTyr: 2.574 ± 0.211
0.0GluXaa: 0.0 ± 0.0
Phe
2.668PheAla: 2.668 ± 0.227
1.256PheCys: 1.256 ± 0.143
1.939PheAsp: 1.939 ± 0.179
2.109PheGlu: 2.109 ± 0.184
1.97PhePhe: 1.97 ± 0.174
1.799PheGly: 1.799 ± 0.181
0.962PheHis: 0.962 ± 0.112
1.659PheIle: 1.659 ± 0.141
2.109PheLys: 2.109 ± 0.186
4.063PheLeu: 4.063 ± 0.269
1.163PheMet: 1.163 ± 0.143
1.737PheAsn: 1.737 ± 0.148
1.939PhePro: 1.939 ± 0.168
1.411PheGln: 1.411 ± 0.135
1.892PheArg: 1.892 ± 0.179
2.668PheSer: 2.668 ± 0.158
2.419PheThr: 2.419 ± 0.203
2.947PheVal: 2.947 ± 0.202
0.496PheTrp: 0.496 ± 0.086
1.954PheTyr: 1.954 ± 0.17
0.0PheXaa: 0.0 ± 0.0
Gly
2.823GlyAla: 2.823 ± 0.234
1.07GlyCys: 1.07 ± 0.135
2.28GlyAsp: 2.28 ± 0.221
2.947GlyGlu: 2.947 ± 0.235
1.675GlyPhe: 1.675 ± 0.174
2.249GlyGly: 2.249 ± 0.188
1.287GlyHis: 1.287 ± 0.155
1.799GlyIle: 1.799 ± 0.193
2.559GlyLys: 2.559 ± 0.21
3.629GlyLeu: 3.629 ± 0.247
1.396GlyMet: 1.396 ± 0.141
1.535GlyAsn: 1.535 ± 0.185
1.551GlyPro: 1.551 ± 0.144
1.504GlyGln: 1.504 ± 0.177
2.885GlyArg: 2.885 ± 0.291
2.683GlySer: 2.683 ± 0.201
2.947GlyThr: 2.947 ± 0.206
3.567GlyVal: 3.567 ± 0.278
0.388GlyTrp: 0.388 ± 0.071
1.768GlyTyr: 1.768 ± 0.189
0.0GlyXaa: 0.0 ± 0.0
His
2.063HisAla: 2.063 ± 0.179
0.993HisCys: 0.993 ± 0.131
1.039HisAsp: 1.039 ± 0.106
1.737HisGlu: 1.737 ± 0.166
1.194HisPhe: 1.194 ± 0.142
1.272HisGly: 1.272 ± 0.151
1.008HisHis: 1.008 ± 0.166
1.504HisIle: 1.504 ± 0.155
1.52HisLys: 1.52 ± 0.184
2.978HisLeu: 2.978 ± 0.217
1.086HisMet: 1.086 ± 0.158
1.194HisAsn: 1.194 ± 0.142
1.551HisPro: 1.551 ± 0.2
0.993HisGln: 0.993 ± 0.127
1.473HisArg: 1.473 ± 0.136
1.877HisSer: 1.877 ± 0.148
2.388HisThr: 2.388 ± 0.223
2.171HisVal: 2.171 ± 0.191
0.279HisTrp: 0.279 ± 0.061
1.83HisTyr: 1.83 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
2.776IleAla: 2.776 ± 0.24
1.287IleCys: 1.287 ± 0.15
2.187IleAsp: 2.187 ± 0.165
2.295IleGlu: 2.295 ± 0.191
1.985IlePhe: 1.985 ± 0.168
1.303IleGly: 1.303 ± 0.156
1.256IleHis: 1.256 ± 0.134
1.582IleIle: 1.582 ± 0.167
2.761IleLys: 2.761 ± 0.209
4.172IleLeu: 4.172 ± 0.246
1.442IleMet: 1.442 ± 0.147
1.97IleAsn: 1.97 ± 0.208
2.605IlePro: 2.605 ± 0.185
1.132IleGln: 1.132 ± 0.134
1.877IleArg: 1.877 ± 0.164
3.04IleSer: 3.04 ± 0.213
3.102IleThr: 3.102 ± 0.209
2.202IleVal: 2.202 ± 0.175
0.326IleTrp: 0.326 ± 0.067
2.202IleTyr: 2.202 ± 0.184
0.0IleXaa: 0.0 ± 0.0
Lys
3.521LysAla: 3.521 ± 0.223
1.349LysCys: 1.349 ± 0.154
2.854LysAsp: 2.854 ± 0.222
4.094LysGlu: 4.094 ± 0.333
1.473LysPhe: 1.473 ± 0.173
2.326LysGly: 2.326 ± 0.197
2.001LysHis: 2.001 ± 0.16
2.807LysIle: 2.807 ± 0.247
3.939LysLys: 3.939 ± 0.372
5.692LysLeu: 5.692 ± 0.25
1.582LysMet: 1.582 ± 0.182
2.916LysAsn: 2.916 ± 0.275
2.776LysPro: 2.776 ± 0.274
2.109LysGln: 2.109 ± 0.196
3.427LysArg: 3.427 ± 0.24
3.179LysSer: 3.179 ± 0.213
3.148LysThr: 3.148 ± 0.233
3.334LysVal: 3.334 ± 0.198
0.357LysTrp: 0.357 ± 0.083
2.497LysTyr: 2.497 ± 0.219
0.0LysXaa: 0.0 ± 0.0
Leu
5.692LeuAla: 5.692 ± 0.267
3.427LeuCys: 3.427 ± 0.26
4.575LeuAsp: 4.575 ± 0.274
6.204LeuGlu: 6.204 ± 0.335
3.691LeuPhe: 3.691 ± 0.269
3.707LeuGly: 3.707 ± 0.262
3.319LeuHis: 3.319 ± 0.211
3.474LeuIle: 3.474 ± 0.225
6.235LeuLys: 6.235 ± 0.334
11.089LeuLeu: 11.089 ± 0.536
2.916LeuMet: 2.916 ± 0.228
4.56LeuAsn: 4.56 ± 0.301
6.157LeuPro: 6.157 ± 0.436
4.405LeuGln: 4.405 ± 0.261
6.297LeuArg: 6.297 ± 0.323
6.932LeuSer: 6.932 ± 0.322
6.622LeuThr: 6.622 ± 0.373
5.506LeuVal: 5.506 ± 0.336
1.365LeuTrp: 1.365 ± 0.191
5.211LeuTyr: 5.211 ± 0.337
0.0LeuXaa: 0.0 ± 0.0
Met
2.745MetAla: 2.745 ± 0.223
1.039MetCys: 1.039 ± 0.146
1.38MetAsp: 1.38 ± 0.16
2.357MetGlu: 2.357 ± 0.213
0.962MetPhe: 0.962 ± 0.118
1.365MetGly: 1.365 ± 0.15
0.931MetHis: 0.931 ± 0.138
0.729MetIle: 0.729 ± 0.096
1.489MetLys: 1.489 ± 0.149
3.133MetLeu: 3.133 ± 0.258
0.574MetMet: 0.574 ± 0.107
1.132MetAsn: 1.132 ± 0.136
1.939MetPro: 1.939 ± 0.197
1.256MetGln: 1.256 ± 0.161
2.032MetArg: 2.032 ± 0.171
1.83MetSer: 1.83 ± 0.181
1.52MetThr: 1.52 ± 0.153
1.737MetVal: 1.737 ± 0.192
0.543MetTrp: 0.543 ± 0.087
1.163MetTyr: 1.163 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
3.303AsnAla: 3.303 ± 0.209
0.915AsnCys: 0.915 ± 0.125
1.97AsnAsp: 1.97 ± 0.156
2.032AsnGlu: 2.032 ± 0.178
1.908AsnPhe: 1.908 ± 0.201
1.706AsnGly: 1.706 ± 0.184
1.272AsnHis: 1.272 ± 0.143
2.032AsnIle: 2.032 ± 0.207
2.171AsnLys: 2.171 ± 0.186
3.924AsnLeu: 3.924 ± 0.27
1.163AsnMet: 1.163 ± 0.17
1.582AsnAsn: 1.582 ± 0.19
2.078AsnPro: 2.078 ± 0.194
1.272AsnGln: 1.272 ± 0.164
1.985AsnArg: 1.985 ± 0.184
2.559AsnSer: 2.559 ± 0.179
2.807AsnThr: 2.807 ± 0.236
3.536AsnVal: 3.536 ± 0.205
0.248AsnTrp: 0.248 ± 0.077
1.985AsnTyr: 1.985 ± 0.149
0.0AsnXaa: 0.0 ± 0.0
Pro
4.342ProAla: 4.342 ± 0.403
1.396ProCys: 1.396 ± 0.165
2.869ProAsp: 2.869 ± 0.253
3.412ProGlu: 3.412 ± 0.279
1.877ProPhe: 1.877 ± 0.176
2.528ProGly: 2.528 ± 0.182
1.566ProHis: 1.566 ± 0.174
1.892ProIle: 1.892 ± 0.228
2.993ProLys: 2.993 ± 0.25
5.428ProLeu: 5.428 ± 0.32
1.256ProMet: 1.256 ± 0.173
2.311ProAsn: 2.311 ± 0.2
5.878ProPro: 5.878 ± 0.999
2.047ProGln: 2.047 ± 0.176
2.699ProArg: 2.699 ± 0.201
4.001ProSer: 4.001 ± 0.387
3.831ProThr: 3.831 ± 0.388
3.49ProVal: 3.49 ± 0.244
0.45ProTrp: 0.45 ± 0.092
1.985ProTyr: 1.985 ± 0.16
0.0ProXaa: 0.0 ± 0.0
Gln
2.668GlnAla: 2.668 ± 0.17
0.884GlnCys: 0.884 ± 0.118
1.582GlnAsp: 1.582 ± 0.155
3.009GlnGlu: 3.009 ± 0.239
0.993GlnPhe: 0.993 ± 0.127
1.21GlnGly: 1.21 ± 0.146
1.52GlnHis: 1.52 ± 0.137
1.985GlnIle: 1.985 ± 0.165
2.063GlnLys: 2.063 ± 0.211
3.505GlnLeu: 3.505 ± 0.229
1.055GlnMet: 1.055 ± 0.122
1.939GlnAsn: 1.939 ± 0.17
2.156GlnPro: 2.156 ± 0.261
2.078GlnGln: 2.078 ± 0.239
2.419GlnArg: 2.419 ± 0.195
2.202GlnSer: 2.202 ± 0.169
2.202GlnThr: 2.202 ± 0.247
2.047GlnVal: 2.047 ± 0.163
0.279GlnTrp: 0.279 ± 0.08
1.458GlnTyr: 1.458 ± 0.148
0.0GlnXaa: 0.0 ± 0.0
Arg
3.815ArgAla: 3.815 ± 0.24
1.427ArgCys: 1.427 ± 0.135
2.73ArgAsp: 2.73 ± 0.213
3.536ArgGlu: 3.536 ± 0.262
2.218ArgPhe: 2.218 ± 0.197
2.342ArgGly: 2.342 ± 0.197
2.202ArgHis: 2.202 ± 0.172
2.125ArgIle: 2.125 ± 0.219
3.443ArgLys: 3.443 ± 0.231
6.017ArgLeu: 6.017 ± 0.338
1.753ArgMet: 1.753 ± 0.166
2.078ArgAsn: 2.078 ± 0.171
2.761ArgPro: 2.761 ± 0.251
2.109ArgGln: 2.109 ± 0.199
4.017ArgArg: 4.017 ± 0.269
3.614ArgSer: 3.614 ± 0.293
3.412ArgThr: 3.412 ± 0.24
3.738ArgVal: 3.738 ± 0.246
0.419ArgTrp: 0.419 ± 0.096
2.668ArgTyr: 2.668 ± 0.232
0.0ArgXaa: 0.0 ± 0.0
Ser
5.211SerAla: 5.211 ± 0.325
1.877SerCys: 1.877 ± 0.216
3.164SerAsp: 3.164 ± 0.245
3.629SerGlu: 3.629 ± 0.287
2.823SerPhe: 2.823 ± 0.234
2.373SerGly: 2.373 ± 0.187
1.753SerHis: 1.753 ± 0.177
2.59SerIle: 2.59 ± 0.228
3.148SerLys: 3.148 ± 0.255
6.886SerLeu: 6.886 ± 0.357
1.721SerMet: 1.721 ± 0.18
2.357SerAsn: 2.357 ± 0.218
3.753SerPro: 3.753 ± 0.336
1.877SerGln: 1.877 ± 0.193
3.179SerArg: 3.179 ± 0.223
4.823SerSer: 4.823 ± 0.522
4.063SerThr: 4.063 ± 0.331
5.118SerVal: 5.118 ± 0.285
0.589SerTrp: 0.589 ± 0.091
2.745SerTyr: 2.745 ± 0.204
0.0SerXaa: 0.0 ± 0.0
Thr
4.87ThrAla: 4.87 ± 0.262
1.659ThrCys: 1.659 ± 0.191
3.071ThrAsp: 3.071 ± 0.255
3.769ThrGlu: 3.769 ± 0.227
2.388ThrPhe: 2.388 ± 0.173
2.419ThrGly: 2.419 ± 0.193
2.016ThrHis: 2.016 ± 0.195
2.497ThrIle: 2.497 ± 0.167
3.117ThrLys: 3.117 ± 0.228
6.715ThrLeu: 6.715 ± 0.342
1.551ThrMet: 1.551 ± 0.161
2.109ThrAsn: 2.109 ± 0.247
4.42ThrPro: 4.42 ± 0.367
2.761ThrGln: 2.761 ± 0.199
3.086ThrArg: 3.086 ± 0.223
3.769ThrSer: 3.769 ± 0.346
3.877ThrThr: 3.877 ± 0.499
4.622ThrVal: 4.622 ± 0.294
0.605ThrTrp: 0.605 ± 0.097
3.071ThrTyr: 3.071 ± 0.261
0.0ThrXaa: 0.0 ± 0.0
Val
5.04ValAla: 5.04 ± 0.284
2.032ValCys: 2.032 ± 0.172
3.241ValAsp: 3.241 ± 0.198
5.242ValGlu: 5.242 ± 0.243
2.373ValPhe: 2.373 ± 0.2
2.605ValGly: 2.605 ± 0.201
1.83ValHis: 1.83 ± 0.172
2.559ValIle: 2.559 ± 0.213
3.691ValLys: 3.691 ± 0.265
6.669ValLeu: 6.669 ± 0.334
2.171ValMet: 2.171 ± 0.197
2.792ValAsn: 2.792 ± 0.201
4.389ValPro: 4.389 ± 0.275
2.869ValGln: 2.869 ± 0.191
3.955ValArg: 3.955 ± 0.237
4.079ValSer: 4.079 ± 0.281
4.296ValThr: 4.296 ± 0.268
4.978ValVal: 4.978 ± 0.334
0.791ValTrp: 0.791 ± 0.096
3.536ValTyr: 3.536 ± 0.354
0.0ValXaa: 0.0 ± 0.0
Trp
0.698TrpAla: 0.698 ± 0.108
0.264TrpCys: 0.264 ± 0.064
0.279TrpAsp: 0.279 ± 0.058
0.372TrpGlu: 0.372 ± 0.089
0.326TrpPhe: 0.326 ± 0.073
0.543TrpGly: 0.543 ± 0.095
0.419TrpHis: 0.419 ± 0.088
0.434TrpIle: 0.434 ± 0.087
0.744TrpLys: 0.744 ± 0.131
0.962TrpLeu: 0.962 ± 0.13
0.264TrpMet: 0.264 ± 0.064
0.512TrpAsn: 0.512 ± 0.084
0.636TrpPro: 0.636 ± 0.105
0.62TrpGln: 0.62 ± 0.094
0.698TrpArg: 0.698 ± 0.149
0.853TrpSer: 0.853 ± 0.137
0.574TrpThr: 0.574 ± 0.09
0.605TrpVal: 0.605 ± 0.088
0.171TrpTrp: 0.171 ± 0.052
0.605TrpTyr: 0.605 ± 0.08
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.567TyrAla: 3.567 ± 0.236
1.163TyrCys: 1.163 ± 0.13
2.466TyrAsp: 2.466 ± 0.196
2.559TyrGlu: 2.559 ± 0.212
1.768TyrPhe: 1.768 ± 0.186
2.233TyrGly: 2.233 ± 0.218
1.303TyrHis: 1.303 ± 0.161
2.233TyrIle: 2.233 ± 0.207
2.792TyrLys: 2.792 ± 0.226
3.955TyrLeu: 3.955 ± 0.264
1.597TyrMet: 1.597 ± 0.154
2.342TyrAsn: 2.342 ± 0.174
1.659TyrPro: 1.659 ± 0.169
1.504TyrGln: 1.504 ± 0.147
2.512TyrArg: 2.512 ± 0.195
2.869TyrSer: 2.869 ± 0.186
3.102TyrThr: 3.102 ± 0.264
3.443TyrVal: 3.443 ± 0.231
0.527TyrTrp: 0.527 ± 0.089
2.357TyrTyr: 2.357 ± 0.214
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 147 proteins (64480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski