Amino acid dipepetide frequency for Macropodid alphaherpesvirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.984AlaAla: 7.984 ± 0.612
1.869AlaCys: 1.869 ± 0.232
3.712AlaAsp: 3.712 ± 0.365
3.792AlaGlu: 3.792 ± 0.339
2.991AlaPhe: 2.991 ± 0.284
4.352AlaGly: 4.352 ± 0.405
2.51AlaHis: 2.51 ± 0.267
4.406AlaIle: 4.406 ± 0.281
2.43AlaLys: 2.43 ± 0.296
9.025AlaLeu: 9.025 ± 0.568
1.655AlaMet: 1.655 ± 0.249
2.483AlaAsn: 2.483 ± 0.222
6.755AlaPro: 6.755 ± 0.748
3.258AlaGln: 3.258 ± 0.329
5.127AlaArg: 5.127 ± 0.39
5.18AlaSer: 5.18 ± 0.437
5.901AlaThr: 5.901 ± 0.425
5.314AlaVal: 5.314 ± 0.385
1.121AlaTrp: 1.121 ± 0.137
2.563AlaTyr: 2.563 ± 0.296
0.0AlaXaa: 0.0 ± 0.0
Cys
1.121CysAla: 1.121 ± 0.233
0.294CysCys: 0.294 ± 0.093
0.881CysAsp: 0.881 ± 0.186
0.988CysGlu: 0.988 ± 0.194
0.801CysPhe: 0.801 ± 0.155
1.202CysGly: 1.202 ± 0.199
0.454CysHis: 0.454 ± 0.082
0.908CysIle: 0.908 ± 0.162
0.641CysLys: 0.641 ± 0.126
2.35CysLeu: 2.35 ± 0.256
0.294CysMet: 0.294 ± 0.078
0.668CysAsn: 0.668 ± 0.139
1.388CysPro: 1.388 ± 0.19
0.427CysGln: 0.427 ± 0.126
1.015CysArg: 1.015 ± 0.186
1.041CysSer: 1.041 ± 0.191
1.175CysThr: 1.175 ± 0.174
1.522CysVal: 1.522 ± 0.231
0.24CysTrp: 0.24 ± 0.083
0.427CysTyr: 0.427 ± 0.098
0.0CysXaa: 0.0 ± 0.0
Asp
3.952AspAla: 3.952 ± 0.345
0.774AspCys: 0.774 ± 0.138
2.884AspAsp: 2.884 ± 0.241
2.964AspGlu: 2.964 ± 0.332
1.682AspPhe: 1.682 ± 0.214
3.311AspGly: 3.311 ± 0.322
1.255AspHis: 1.255 ± 0.187
3.712AspIle: 3.712 ± 0.319
1.148AspLys: 1.148 ± 0.157
5.153AspLeu: 5.153 ± 0.43
1.202AspMet: 1.202 ± 0.13
1.762AspAsn: 1.762 ± 0.227
4.272AspPro: 4.272 ± 0.322
1.575AspGln: 1.575 ± 0.224
3.551AspArg: 3.551 ± 0.323
3.685AspSer: 3.685 ± 0.327
3.498AspThr: 3.498 ± 0.277
3.685AspVal: 3.685 ± 0.294
0.534AspTrp: 0.534 ± 0.123
1.335AspTyr: 1.335 ± 0.151
0.0AspXaa: 0.0 ± 0.0
Glu
4.673GluAla: 4.673 ± 0.359
1.015GluCys: 1.015 ± 0.182
3.578GluAsp: 3.578 ± 0.405
3.391GluGlu: 3.391 ± 0.402
2.029GluPhe: 2.029 ± 0.226
2.991GluGly: 2.991 ± 0.318
1.495GluHis: 1.495 ± 0.179
2.243GluIle: 2.243 ± 0.246
1.202GluLys: 1.202 ± 0.136
6.008GluLeu: 6.008 ± 0.396
0.935GluMet: 0.935 ± 0.143
1.709GluAsn: 1.709 ± 0.188
2.804GluPro: 2.804 ± 0.349
1.709GluGln: 1.709 ± 0.215
2.216GluArg: 2.216 ± 0.227
2.83GluSer: 2.83 ± 0.347
3.338GluThr: 3.338 ± 0.318
3.044GluVal: 3.044 ± 0.258
0.561GluTrp: 0.561 ± 0.106
1.602GluTyr: 1.602 ± 0.182
0.0GluXaa: 0.0 ± 0.0
Phe
2.617PheAla: 2.617 ± 0.238
0.694PheCys: 0.694 ± 0.148
2.43PheAsp: 2.43 ± 0.245
2.163PheGlu: 2.163 ± 0.222
1.949PhePhe: 1.949 ± 0.272
2.403PheGly: 2.403 ± 0.23
1.068PheHis: 1.068 ± 0.188
2.243PheIle: 2.243 ± 0.295
1.949PheLys: 1.949 ± 0.257
4.219PheLeu: 4.219 ± 0.421
0.694PheMet: 0.694 ± 0.128
1.175PheAsn: 1.175 ± 0.199
2.724PhePro: 2.724 ± 0.255
0.988PheGln: 0.988 ± 0.179
1.949PheArg: 1.949 ± 0.277
3.071PheSer: 3.071 ± 0.303
2.296PheThr: 2.296 ± 0.203
2.724PheVal: 2.724 ± 0.274
0.481PheTrp: 0.481 ± 0.098
1.335PheTyr: 1.335 ± 0.184
0.0PheXaa: 0.0 ± 0.0
Gly
5.501GlyAla: 5.501 ± 0.385
1.041GlyCys: 1.041 ± 0.146
3.525GlyAsp: 3.525 ± 0.308
3.338GlyGlu: 3.338 ± 0.324
2.163GlyPhe: 2.163 ± 0.242
4.646GlyGly: 4.646 ± 0.457
1.041GlyHis: 1.041 ± 0.201
2.19GlyIle: 2.19 ± 0.259
1.789GlyLys: 1.789 ± 0.223
5.42GlyLeu: 5.42 ± 0.369
1.068GlyMet: 1.068 ± 0.213
1.522GlyAsn: 1.522 ± 0.217
4.432GlyPro: 4.432 ± 0.396
1.949GlyGln: 1.949 ± 0.254
4.005GlyArg: 4.005 ± 0.425
4.299GlySer: 4.299 ± 0.365
3.578GlyThr: 3.578 ± 0.311
4.139GlyVal: 4.139 ± 0.297
0.801GlyTrp: 0.801 ± 0.138
1.709GlyTyr: 1.709 ± 0.248
0.0GlyXaa: 0.0 ± 0.0
His
2.083HisAla: 2.083 ± 0.182
0.374HisCys: 0.374 ± 0.119
0.961HisAsp: 0.961 ± 0.127
1.068HisGlu: 1.068 ± 0.189
0.881HisPhe: 0.881 ± 0.184
1.442HisGly: 1.442 ± 0.163
1.041HisHis: 1.041 ± 0.199
1.896HisIle: 1.896 ± 0.24
1.041HisLys: 1.041 ± 0.149
2.857HisLeu: 2.857 ± 0.261
0.614HisMet: 0.614 ± 0.133
1.041HisAsn: 1.041 ± 0.141
2.563HisPro: 2.563 ± 0.242
1.148HisGln: 1.148 ± 0.207
2.029HisArg: 2.029 ± 0.24
1.415HisSer: 1.415 ± 0.185
2.51HisThr: 2.51 ± 0.279
1.789HisVal: 1.789 ± 0.166
0.08HisTrp: 0.08 ± 0.047
0.828HisTyr: 0.828 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
4.085IleAla: 4.085 ± 0.357
1.041IleCys: 1.041 ± 0.19
2.457IleAsp: 2.457 ± 0.259
2.243IleGlu: 2.243 ± 0.263
2.483IlePhe: 2.483 ± 0.268
2.91IleGly: 2.91 ± 0.286
1.602IleHis: 1.602 ± 0.258
2.964IleIle: 2.964 ± 0.286
1.655IleLys: 1.655 ± 0.227
4.913IleLeu: 4.913 ± 0.38
0.694IleMet: 0.694 ± 0.133
2.056IleAsn: 2.056 ± 0.236
3.391IlePro: 3.391 ± 0.346
2.163IleGln: 2.163 ± 0.246
3.177IleArg: 3.177 ± 0.268
4.539IleSer: 4.539 ± 0.338
4.619IleThr: 4.619 ± 0.374
3.284IleVal: 3.284 ± 0.368
0.561IleTrp: 0.561 ± 0.133
1.976IleTyr: 1.976 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
2.136LysAla: 2.136 ± 0.274
0.481LysCys: 0.481 ± 0.104
1.575LysAsp: 1.575 ± 0.218
1.442LysGlu: 1.442 ± 0.199
1.255LysPhe: 1.255 ± 0.174
1.175LysGly: 1.175 ± 0.175
1.068LysHis: 1.068 ± 0.173
1.949LysIle: 1.949 ± 0.226
1.736LysLys: 1.736 ± 0.245
3.124LysLeu: 3.124 ± 0.381
0.694LysMet: 0.694 ± 0.111
0.961LysAsn: 0.961 ± 0.161
2.35LysPro: 2.35 ± 0.274
1.709LysGln: 1.709 ± 0.244
2.857LysArg: 2.857 ± 0.302
1.682LysSer: 1.682 ± 0.169
2.537LysThr: 2.537 ± 0.277
1.415LysVal: 1.415 ± 0.259
0.427LysTrp: 0.427 ± 0.114
1.388LysTyr: 1.388 ± 0.193
0.0LysXaa: 0.0 ± 0.0
Leu
8.411LeuAla: 8.411 ± 0.423
2.083LeuCys: 2.083 ± 0.295
4.78LeuAsp: 4.78 ± 0.306
4.913LeuGlu: 4.913 ± 0.318
4.539LeuPhe: 4.539 ± 0.365
6.302LeuGly: 6.302 ± 0.421
2.216LeuHis: 2.216 ± 0.255
4.886LeuIle: 4.886 ± 0.478
3.979LeuLys: 3.979 ± 0.317
10.921LeuLeu: 10.921 ± 0.579
1.789LeuMet: 1.789 ± 0.239
3.792LeuAsn: 3.792 ± 0.341
6.088LeuPro: 6.088 ± 0.46
4.886LeuGln: 4.886 ± 0.312
7.076LeuArg: 7.076 ± 0.437
7.503LeuSer: 7.503 ± 0.458
6.702LeuThr: 6.702 ± 0.504
6.195LeuVal: 6.195 ± 0.426
1.362LeuTrp: 1.362 ± 0.185
3.151LeuTyr: 3.151 ± 0.263
0.0LeuXaa: 0.0 ± 0.0
Met
2.27MetAla: 2.27 ± 0.287
0.294MetCys: 0.294 ± 0.081
1.415MetAsp: 1.415 ± 0.164
0.854MetGlu: 0.854 ± 0.12
1.121MetPhe: 1.121 ± 0.176
1.228MetGly: 1.228 ± 0.155
0.347MetHis: 0.347 ± 0.11
0.828MetIle: 0.828 ± 0.168
0.427MetLys: 0.427 ± 0.117
1.575MetLeu: 1.575 ± 0.226
0.427MetMet: 0.427 ± 0.127
0.507MetAsn: 0.507 ± 0.133
0.908MetPro: 0.908 ± 0.154
0.721MetGln: 0.721 ± 0.132
1.202MetArg: 1.202 ± 0.16
1.282MetSer: 1.282 ± 0.172
1.068MetThr: 1.068 ± 0.174
1.415MetVal: 1.415 ± 0.202
0.187MetTrp: 0.187 ± 0.076
0.561MetTyr: 0.561 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
2.136AsnAla: 2.136 ± 0.197
0.614AsnCys: 0.614 ± 0.134
1.549AsnAsp: 1.549 ± 0.21
1.736AsnGlu: 1.736 ± 0.175
1.495AsnPhe: 1.495 ± 0.224
1.495AsnGly: 1.495 ± 0.206
0.614AsnHis: 0.614 ± 0.114
2.643AsnIle: 2.643 ± 0.29
1.041AsnLys: 1.041 ± 0.184
3.311AsnLeu: 3.311 ± 0.296
0.614AsnMet: 0.614 ± 0.106
1.869AsnAsn: 1.869 ± 0.208
3.097AsnPro: 3.097 ± 0.281
1.602AsnGln: 1.602 ± 0.194
1.869AsnArg: 1.869 ± 0.216
2.563AsnSer: 2.563 ± 0.293
3.258AsnThr: 3.258 ± 0.285
1.709AsnVal: 1.709 ± 0.214
0.374AsnTrp: 0.374 ± 0.082
1.228AsnTyr: 1.228 ± 0.179
0.0AsnXaa: 0.0 ± 0.0
Pro
5.714ProAla: 5.714 ± 0.757
1.095ProCys: 1.095 ± 0.16
4.085ProAsp: 4.085 ± 0.373
3.845ProGlu: 3.845 ± 0.395
2.136ProPhe: 2.136 ± 0.25
4.192ProGly: 4.192 ± 0.472
2.75ProHis: 2.75 ± 0.262
3.872ProIle: 3.872 ± 0.369
2.43ProLys: 2.43 ± 0.317
6.515ProLeu: 6.515 ± 0.515
1.362ProMet: 1.362 ± 0.221
2.964ProAsn: 2.964 ± 0.292
9.372ProPro: 9.372 ± 0.847
3.391ProGln: 3.391 ± 0.28
5.634ProArg: 5.634 ± 0.609
6.302ProSer: 6.302 ± 0.532
6.622ProThr: 6.622 ± 0.356
3.952ProVal: 3.952 ± 0.329
0.694ProTrp: 0.694 ± 0.168
1.549ProTyr: 1.549 ± 0.239
0.0ProXaa: 0.0 ± 0.0
Gln
2.83GlnAla: 2.83 ± 0.303
0.614GlnCys: 0.614 ± 0.11
1.896GlnAsp: 1.896 ± 0.242
1.575GlnGlu: 1.575 ± 0.271
1.522GlnPhe: 1.522 ± 0.227
1.549GlnGly: 1.549 ± 0.22
1.148GlnHis: 1.148 ± 0.157
2.403GlnIle: 2.403 ± 0.197
1.015GlnLys: 1.015 ± 0.192
4.619GlnLeu: 4.619 ± 0.371
0.854GlnMet: 0.854 ± 0.131
1.629GlnAsn: 1.629 ± 0.197
3.177GlnPro: 3.177 ± 0.272
2.109GlnGln: 2.109 ± 0.229
2.724GlnArg: 2.724 ± 0.214
2.457GlnSer: 2.457 ± 0.207
3.818GlnThr: 3.818 ± 0.31
1.495GlnVal: 1.495 ± 0.212
0.32GlnTrp: 0.32 ± 0.109
1.255GlnTyr: 1.255 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
6.515ArgAla: 6.515 ± 0.545
1.202ArgCys: 1.202 ± 0.207
3.044ArgAsp: 3.044 ± 0.288
3.204ArgGlu: 3.204 ± 0.268
2.35ArgPhe: 2.35 ± 0.242
4.566ArgGly: 4.566 ± 0.398
1.736ArgHis: 1.736 ± 0.198
2.376ArgIle: 2.376 ± 0.231
1.736ArgLys: 1.736 ± 0.194
6.382ArgLeu: 6.382 ± 0.429
1.308ArgMet: 1.308 ± 0.214
1.682ArgAsn: 1.682 ± 0.261
5.581ArgPro: 5.581 ± 0.46
2.27ArgGln: 2.27 ± 0.256
6.649ArgArg: 6.649 ± 0.608
4.566ArgSer: 4.566 ± 0.384
4.833ArgThr: 4.833 ± 0.393
4.726ArgVal: 4.726 ± 0.396
0.694ArgTrp: 0.694 ± 0.144
1.816ArgTyr: 1.816 ± 0.209
0.0ArgXaa: 0.0 ± 0.0
Ser
5.447SerAla: 5.447 ± 0.303
1.041SerCys: 1.041 ± 0.128
3.311SerAsp: 3.311 ± 0.429
4.139SerGlu: 4.139 ± 0.364
2.376SerPhe: 2.376 ± 0.281
4.032SerGly: 4.032 ± 0.324
1.896SerHis: 1.896 ± 0.212
3.364SerIle: 3.364 ± 0.294
1.976SerLys: 1.976 ± 0.203
7.53SerLeu: 7.53 ± 0.366
1.202SerMet: 1.202 ± 0.163
2.323SerAsn: 2.323 ± 0.248
6.328SerPro: 6.328 ± 0.677
2.697SerGln: 2.697 ± 0.27
4.593SerArg: 4.593 ± 0.346
7.263SerSer: 7.263 ± 0.771
5.234SerThr: 5.234 ± 0.373
4.032SerVal: 4.032 ± 0.378
0.961SerTrp: 0.961 ± 0.138
1.736SerTyr: 1.736 ± 0.192
0.0SerXaa: 0.0 ± 0.0
Thr
6.355ThrAla: 6.355 ± 0.405
1.148ThrCys: 1.148 ± 0.199
3.818ThrAsp: 3.818 ± 0.346
3.017ThrGlu: 3.017 ± 0.287
2.804ThrPhe: 2.804 ± 0.263
3.818ThrGly: 3.818 ± 0.335
2.643ThrHis: 2.643 ± 0.218
4.139ThrIle: 4.139 ± 0.359
2.617ThrLys: 2.617 ± 0.223
6.488ThrLeu: 6.488 ± 0.358
1.335ThrMet: 1.335 ± 0.174
3.017ThrAsn: 3.017 ± 0.262
6.382ThrPro: 6.382 ± 0.609
3.418ThrGln: 3.418 ± 0.309
4.379ThrArg: 4.379 ± 0.266
4.699ThrSer: 4.699 ± 0.344
5.821ThrThr: 5.821 ± 0.428
4.192ThrVal: 4.192 ± 0.328
0.721ThrTrp: 0.721 ± 0.137
2.457ThrTyr: 2.457 ± 0.321
0.0ThrXaa: 0.0 ± 0.0
Val
5.474ValAla: 5.474 ± 0.404
1.469ValCys: 1.469 ± 0.188
3.525ValAsp: 3.525 ± 0.214
2.67ValGlu: 2.67 ± 0.217
2.83ValPhe: 2.83 ± 0.362
3.979ValGly: 3.979 ± 0.341
1.602ValHis: 1.602 ± 0.214
3.124ValIle: 3.124 ± 0.33
2.003ValLys: 2.003 ± 0.225
6.675ValLeu: 6.675 ± 0.394
0.988ValMet: 0.988 ± 0.136
1.976ValAsn: 1.976 ± 0.204
4.539ValPro: 4.539 ± 0.388
1.682ValGln: 1.682 ± 0.219
4.192ValArg: 4.192 ± 0.395
4.432ValSer: 4.432 ± 0.291
3.498ValThr: 3.498 ± 0.296
4.619ValVal: 4.619 ± 0.394
0.828ValTrp: 0.828 ± 0.155
2.136ValTyr: 2.136 ± 0.211
0.0ValXaa: 0.0 ± 0.0
Trp
1.015TrpAla: 1.015 ± 0.133
0.134TrpCys: 0.134 ± 0.064
0.935TrpAsp: 0.935 ± 0.157
0.587TrpGlu: 0.587 ± 0.145
0.534TrpPhe: 0.534 ± 0.152
0.694TrpGly: 0.694 ± 0.146
0.32TrpHis: 0.32 ± 0.103
0.561TrpIle: 0.561 ± 0.113
0.32TrpLys: 0.32 ± 0.09
1.308TrpLeu: 1.308 ± 0.17
0.16TrpMet: 0.16 ± 0.059
0.347TrpAsn: 0.347 ± 0.091
0.587TrpPro: 0.587 ± 0.128
0.267TrpGln: 0.267 ± 0.089
0.881TrpArg: 0.881 ± 0.159
0.668TrpSer: 0.668 ± 0.147
0.828TrpThr: 0.828 ± 0.14
0.908TrpVal: 0.908 ± 0.154
0.107TrpTrp: 0.107 ± 0.054
0.267TrpTyr: 0.267 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.109TyrAla: 2.109 ± 0.277
0.561TyrCys: 0.561 ± 0.121
1.575TyrAsp: 1.575 ± 0.202
1.442TyrGlu: 1.442 ± 0.243
1.228TyrPhe: 1.228 ± 0.215
1.869TyrGly: 1.869 ± 0.244
0.854TyrHis: 0.854 ± 0.15
2.109TyrIle: 2.109 ± 0.239
0.828TyrLys: 0.828 ± 0.153
3.177TyrLeu: 3.177 ± 0.31
0.721TyrMet: 0.721 ± 0.128
1.335TyrAsn: 1.335 ± 0.2
1.736TyrPro: 1.736 ± 0.261
1.041TyrGln: 1.041 ± 0.214
2.109TyrArg: 2.109 ± 0.261
2.003TyrSer: 2.003 ± 0.193
2.136TyrThr: 2.136 ± 0.229
2.109TyrVal: 2.109 ± 0.252
0.374TyrTrp: 0.374 ± 0.092
1.175TyrTyr: 1.175 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (37452 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski