Amino acid dipepetide frequency for Papio ursinus cytomegalovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.043AlaAla: 8.043 ± 0.702
1.936AlaCys: 1.936 ± 0.312
3.544AlaAsp: 3.544 ± 0.344
3.544AlaGlu: 3.544 ± 0.301
2.836AlaPhe: 2.836 ± 0.296
3.354AlaGly: 3.354 ± 0.341
1.609AlaHis: 1.609 ± 0.201
3.572AlaIle: 3.572 ± 0.326
2.208AlaLys: 2.208 ± 0.322
7.852AlaLeu: 7.852 ± 0.446
1.936AlaMet: 1.936 ± 0.276
2.154AlaAsn: 2.154 ± 0.261
3.654AlaPro: 3.654 ± 0.445
2.672AlaGln: 2.672 ± 0.353
4.035AlaArg: 4.035 ± 0.382
5.835AlaSer: 5.835 ± 0.47
4.771AlaThr: 4.771 ± 0.485
6.026AlaVal: 6.026 ± 0.456
1.063AlaTrp: 1.063 ± 0.183
2.072AlaTyr: 2.072 ± 0.234
0.0AlaXaa: 0.0 ± 0.0
Cys
1.636CysAla: 1.636 ± 0.231
0.791CysCys: 0.791 ± 0.148
1.527CysAsp: 1.527 ± 0.252
1.336CysGlu: 1.336 ± 0.191
1.036CysPhe: 1.036 ± 0.16
1.391CysGly: 1.391 ± 0.225
0.927CysHis: 0.927 ± 0.146
1.254CysIle: 1.254 ± 0.179
0.791CysLys: 0.791 ± 0.167
3.026CysLeu: 3.026 ± 0.302
0.654CysMet: 0.654 ± 0.165
1.118CysAsn: 1.118 ± 0.241
1.091CysPro: 1.091 ± 0.183
0.927CysGln: 0.927 ± 0.098
1.609CysArg: 1.609 ± 0.201
1.609CysSer: 1.609 ± 0.233
1.527CysThr: 1.527 ± 0.237
2.318CysVal: 2.318 ± 0.276
0.327CysTrp: 0.327 ± 0.096
0.872CysTyr: 0.872 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
3.408AspAla: 3.408 ± 0.35
0.982AspCys: 0.982 ± 0.167
3.844AspAsp: 3.844 ± 0.54
4.608AspGlu: 4.608 ± 0.465
2.263AspPhe: 2.263 ± 0.222
2.263AspGly: 2.263 ± 0.255
1.69AspHis: 1.69 ± 0.174
2.672AspIle: 2.672 ± 0.273
1.309AspLys: 1.309 ± 0.199
5.944AspLeu: 5.944 ± 0.403
1.418AspMet: 1.418 ± 0.181
1.609AspAsn: 1.609 ± 0.195
2.563AspPro: 2.563 ± 0.299
1.445AspGln: 1.445 ± 0.195
3.163AspArg: 3.163 ± 0.295
3.681AspSer: 3.681 ± 0.424
2.672AspThr: 2.672 ± 0.301
3.299AspVal: 3.299 ± 0.338
0.6AspTrp: 0.6 ± 0.142
1.854AspTyr: 1.854 ± 0.214
0.0AspXaa: 0.0 ± 0.0
Glu
4.117GluAla: 4.117 ± 0.414
1.2GluCys: 1.2 ± 0.176
3.463GluAsp: 3.463 ± 0.409
3.681GluGlu: 3.681 ± 0.321
1.936GluPhe: 1.936 ± 0.247
1.936GluGly: 1.936 ± 0.211
1.909GluHis: 1.909 ± 0.257
2.399GluIle: 2.399 ± 0.213
2.072GluLys: 2.072 ± 0.31
5.426GluLeu: 5.426 ± 0.425
1.227GluMet: 1.227 ± 0.157
2.563GluAsn: 2.563 ± 0.291
2.454GluPro: 2.454 ± 0.236
2.045GluGln: 2.045 ± 0.296
3.981GluArg: 3.981 ± 0.417
3.135GluSer: 3.135 ± 0.253
3.626GluThr: 3.626 ± 0.237
3.872GluVal: 3.872 ± 0.368
0.545GluTrp: 0.545 ± 0.116
1.554GluTyr: 1.554 ± 0.201
0.0GluXaa: 0.0 ± 0.0
Phe
2.481PheAla: 2.481 ± 0.261
1.363PheCys: 1.363 ± 0.208
1.963PheAsp: 1.963 ± 0.3
1.881PheGlu: 1.881 ± 0.241
2.372PhePhe: 2.372 ± 0.25
2.454PheGly: 2.454 ± 0.219
1.145PheHis: 1.145 ± 0.165
1.99PheIle: 1.99 ± 0.191
1.5PheLys: 1.5 ± 0.179
5.126PheLeu: 5.126 ± 0.363
1.145PheMet: 1.145 ± 0.183
1.745PheAsn: 1.745 ± 0.221
2.29PhePro: 2.29 ± 0.218
1.718PheGln: 1.718 ± 0.222
2.808PheArg: 2.808 ± 0.309
2.508PheSer: 2.508 ± 0.277
2.672PheThr: 2.672 ± 0.284
3.272PheVal: 3.272 ± 0.294
0.791PheTrp: 0.791 ± 0.182
1.909PheTyr: 1.909 ± 0.26
0.0PheXaa: 0.0 ± 0.0
Gly
3.054GlyAla: 3.054 ± 0.337
1.118GlyCys: 1.118 ± 0.195
2.563GlyAsp: 2.563 ± 0.243
2.89GlyGlu: 2.89 ± 0.289
1.718GlyPhe: 1.718 ± 0.214
3.299GlyGly: 3.299 ± 0.384
1.254GlyHis: 1.254 ± 0.169
2.236GlyIle: 2.236 ± 0.235
1.527GlyLys: 1.527 ± 0.143
5.044GlyLeu: 5.044 ± 0.346
0.791GlyMet: 0.791 ± 0.137
1.909GlyAsn: 1.909 ± 0.228
2.154GlyPro: 2.154 ± 0.282
1.609GlyGln: 1.609 ± 0.243
3.435GlyArg: 3.435 ± 0.39
3.599GlySer: 3.599 ± 0.235
2.454GlyThr: 2.454 ± 0.273
3.654GlyVal: 3.654 ± 0.326
0.573GlyTrp: 0.573 ± 0.103
1.936GlyTyr: 1.936 ± 0.234
0.0GlyXaa: 0.0 ± 0.0
His
2.454HisAla: 2.454 ± 0.278
0.736HisCys: 0.736 ± 0.164
1.636HisAsp: 1.636 ± 0.221
1.472HisGlu: 1.472 ± 0.173
0.9HisPhe: 0.9 ± 0.155
1.718HisGly: 1.718 ± 0.196
1.718HisHis: 1.718 ± 0.268
1.5HisIle: 1.5 ± 0.203
0.982HisLys: 0.982 ± 0.156
3.354HisLeu: 3.354 ± 0.368
0.573HisMet: 0.573 ± 0.136
1.254HisAsn: 1.254 ± 0.215
1.663HisPro: 1.663 ± 0.188
1.336HisGln: 1.336 ± 0.241
2.099HisArg: 2.099 ± 0.253
1.527HisSer: 1.527 ± 0.229
1.909HisThr: 1.909 ± 0.267
2.454HisVal: 2.454 ± 0.227
0.3HisTrp: 0.3 ± 0.082
1.063HisTyr: 1.063 ± 0.169
0.0HisXaa: 0.0 ± 0.0
Ile
2.945IleAla: 2.945 ± 0.356
1.063IleCys: 1.063 ± 0.207
2.018IleAsp: 2.018 ± 0.222
1.554IleGlu: 1.554 ± 0.257
2.508IlePhe: 2.508 ± 0.249
1.909IleGly: 1.909 ± 0.273
1.281IleHis: 1.281 ± 0.208
2.945IleIle: 2.945 ± 0.41
2.127IleLys: 2.127 ± 0.278
4.362IleLeu: 4.362 ± 0.425
1.472IleMet: 1.472 ± 0.228
1.799IleAsn: 1.799 ± 0.232
3.026IlePro: 3.026 ± 0.268
2.099IleGln: 2.099 ± 0.264
2.808IleArg: 2.808 ± 0.247
3.245IleSer: 3.245 ± 0.282
3.817IleThr: 3.817 ± 0.408
3.654IleVal: 3.654 ± 0.35
0.6IleTrp: 0.6 ± 0.161
2.508IleTyr: 2.508 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
2.972LysAla: 2.972 ± 0.301
0.845LysCys: 0.845 ± 0.148
1.827LysAsp: 1.827 ± 0.266
1.799LysGlu: 1.799 ± 0.271
1.091LysPhe: 1.091 ± 0.176
1.091LysGly: 1.091 ± 0.184
1.5LysHis: 1.5 ± 0.173
2.263LysIle: 2.263 ± 0.238
2.345LysLys: 2.345 ± 0.365
3.463LysLeu: 3.463 ± 0.333
0.927LysMet: 0.927 ± 0.151
2.045LysAsn: 2.045 ± 0.224
1.636LysPro: 1.636 ± 0.552
1.745LysGln: 1.745 ± 0.258
3.135LysArg: 3.135 ± 0.262
2.945LysSer: 2.945 ± 0.32
2.481LysThr: 2.481 ± 0.383
1.854LysVal: 1.854 ± 0.218
0.436LysTrp: 0.436 ± 0.104
1.172LysTyr: 1.172 ± 0.186
0.0LysXaa: 0.0 ± 0.0
Leu
6.544LeuAla: 6.544 ± 0.417
3.654LeuCys: 3.654 ± 0.314
4.281LeuAsp: 4.281 ± 0.415
4.88LeuGlu: 4.88 ± 0.397
5.289LeuPhe: 5.289 ± 0.37
4.717LeuGly: 4.717 ± 0.399
2.863LeuHis: 2.863 ± 0.306
5.262LeuIle: 5.262 ± 0.528
4.308LeuLys: 4.308 ± 0.422
11.315LeuLeu: 11.315 ± 0.706
2.727LeuMet: 2.727 ± 0.233
3.953LeuAsn: 3.953 ± 0.407
5.535LeuPro: 5.535 ± 0.386
3.735LeuGln: 3.735 ± 0.459
7.716LeuArg: 7.716 ± 0.637
7.798LeuSer: 7.798 ± 0.515
7.089LeuThr: 7.089 ± 0.455
6.489LeuVal: 6.489 ± 0.412
1.227LeuTrp: 1.227 ± 0.164
3.953LeuTyr: 3.953 ± 0.369
0.0LeuXaa: 0.0 ± 0.0
Met
2.318MetAla: 2.318 ± 0.288
0.654MetCys: 0.654 ± 0.145
1.118MetAsp: 1.118 ± 0.176
1.391MetGlu: 1.391 ± 0.228
1.363MetPhe: 1.363 ± 0.243
0.845MetGly: 0.845 ± 0.138
0.627MetHis: 0.627 ± 0.137
1.091MetIle: 1.091 ± 0.162
0.954MetLys: 0.954 ± 0.128
2.972MetLeu: 2.972 ± 0.344
0.736MetMet: 0.736 ± 0.155
1.118MetAsn: 1.118 ± 0.184
0.927MetPro: 0.927 ± 0.194
0.9MetGln: 0.9 ± 0.15
1.2MetArg: 1.2 ± 0.161
2.045MetSer: 2.045 ± 0.227
1.745MetThr: 1.745 ± 0.215
1.5MetVal: 1.5 ± 0.19
0.518MetTrp: 0.518 ± 0.118
0.845MetTyr: 0.845 ± 0.16
0.0MetXaa: 0.0 ± 0.0
Asn
3.026AsnAla: 3.026 ± 0.265
0.682AsnCys: 0.682 ± 0.17
1.772AsnAsp: 1.772 ± 0.215
1.772AsnGlu: 1.772 ± 0.203
1.663AsnPhe: 1.663 ± 0.187
1.472AsnGly: 1.472 ± 0.191
0.982AsnHis: 0.982 ± 0.123
2.345AsnIle: 2.345 ± 0.298
1.609AsnLys: 1.609 ± 0.216
4.172AsnLeu: 4.172 ± 0.384
0.954AsnMet: 0.954 ± 0.191
1.799AsnAsn: 1.799 ± 0.208
1.827AsnPro: 1.827 ± 0.266
1.636AsnGln: 1.636 ± 0.263
2.208AsnArg: 2.208 ± 0.266
3.108AsnSer: 3.108 ± 0.303
2.972AsnThr: 2.972 ± 0.402
3.354AsnVal: 3.354 ± 0.306
0.354AsnTrp: 0.354 ± 0.099
1.445AsnTyr: 1.445 ± 0.194
0.0AsnXaa: 0.0 ± 0.0
Pro
4.144ProAla: 4.144 ± 0.495
1.472ProCys: 1.472 ± 0.209
3.054ProAsp: 3.054 ± 0.291
2.481ProGlu: 2.481 ± 0.245
2.263ProPhe: 2.263 ± 0.267
2.427ProGly: 2.427 ± 0.334
1.663ProHis: 1.663 ± 0.219
2.099ProIle: 2.099 ± 0.228
1.745ProLys: 1.745 ± 0.241
4.581ProLeu: 4.581 ± 0.378
1.118ProMet: 1.118 ± 0.168
1.554ProAsn: 1.554 ± 0.177
5.535ProPro: 5.535 ± 0.972
1.772ProGln: 1.772 ± 0.262
3.817ProArg: 3.817 ± 0.392
4.744ProSer: 4.744 ± 0.413
3.49ProThr: 3.49 ± 0.342
4.226ProVal: 4.226 ± 0.345
0.682ProTrp: 0.682 ± 0.171
1.799ProTyr: 1.799 ± 0.224
0.0ProXaa: 0.0 ± 0.0
Gln
2.29GlnAla: 2.29 ± 0.321
0.845GlnCys: 0.845 ± 0.146
1.636GlnAsp: 1.636 ± 0.24
2.099GlnGlu: 2.099 ± 0.251
1.391GlnPhe: 1.391 ± 0.215
1.281GlnGly: 1.281 ± 0.186
1.118GlnHis: 1.118 ± 0.194
1.5GlnIle: 1.5 ± 0.228
1.909GlnLys: 1.909 ± 0.198
4.526GlnLeu: 4.526 ± 0.452
1.145GlnMet: 1.145 ± 0.191
1.636GlnAsn: 1.636 ± 0.237
2.318GlnPro: 2.318 ± 0.28
2.836GlnGln: 2.836 ± 0.484
2.645GlnArg: 2.645 ± 0.283
2.345GlnSer: 2.345 ± 0.228
2.808GlnThr: 2.808 ± 0.376
2.727GlnVal: 2.727 ± 0.302
0.491GlnTrp: 0.491 ± 0.106
1.336GlnTyr: 1.336 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
4.172ArgAla: 4.172 ± 0.362
1.581ArgCys: 1.581 ± 0.208
4.09ArgAsp: 4.09 ± 0.41
3.844ArgGlu: 3.844 ± 0.345
2.508ArgPhe: 2.508 ± 0.289
3.572ArgGly: 3.572 ± 0.339
2.617ArgHis: 2.617 ± 0.258
2.454ArgIle: 2.454 ± 0.226
2.236ArgLys: 2.236 ± 0.23
7.28ArgLeu: 7.28 ± 0.496
1.554ArgMet: 1.554 ± 0.218
2.318ArgAsn: 2.318 ± 0.277
3.626ArgPro: 3.626 ± 0.274
3.326ArgGln: 3.326 ± 0.325
5.944ArgArg: 5.944 ± 0.595
3.953ArgSer: 3.953 ± 0.356
3.354ArgThr: 3.354 ± 0.337
4.581ArgVal: 4.581 ± 0.418
0.927ArgTrp: 0.927 ± 0.216
2.863ArgTyr: 2.863 ± 0.257
0.0ArgXaa: 0.0 ± 0.0
Ser
6.107SerAla: 6.107 ± 0.503
1.363SerCys: 1.363 ± 0.21
3.981SerAsp: 3.981 ± 0.323
4.035SerGlu: 4.035 ± 0.273
2.836SerPhe: 2.836 ± 0.279
4.253SerGly: 4.253 ± 0.379
2.208SerHis: 2.208 ± 0.232
2.972SerIle: 2.972 ± 0.286
2.754SerLys: 2.754 ± 0.374
6.871SerLeu: 6.871 ± 0.424
1.772SerMet: 1.772 ± 0.226
3.054SerAsn: 3.054 ± 0.321
4.935SerPro: 4.935 ± 0.383
2.645SerGln: 2.645 ± 0.259
5.044SerArg: 5.044 ± 0.375
7.798SerSer: 7.798 ± 0.707
4.526SerThr: 4.526 ± 0.674
4.88SerVal: 4.88 ± 0.421
0.818SerTrp: 0.818 ± 0.166
2.508SerTyr: 2.508 ± 0.278
0.0SerXaa: 0.0 ± 0.0
Thr
5.017ThrAla: 5.017 ± 0.404
1.718ThrCys: 1.718 ± 0.212
3.217ThrAsp: 3.217 ± 0.391
3.026ThrGlu: 3.026 ± 0.279
3.19ThrPhe: 3.19 ± 0.287
3.026ThrGly: 3.026 ± 0.263
1.718ThrHis: 1.718 ± 0.226
3.272ThrIle: 3.272 ± 0.313
2.481ThrLys: 2.481 ± 0.239
5.944ThrLeu: 5.944 ± 0.558
1.445ThrMet: 1.445 ± 0.179
2.399ThrAsn: 2.399 ± 0.347
3.599ThrPro: 3.599 ± 0.329
2.29ThrGln: 2.29 ± 0.239
3.354ThrArg: 3.354 ± 0.312
6.162ThrSer: 6.162 ± 0.725
5.889ThrThr: 5.889 ± 0.832
5.398ThrVal: 5.398 ± 0.353
0.845ThrTrp: 0.845 ± 0.162
1.854ThrTyr: 1.854 ± 0.253
0.0ThrXaa: 0.0 ± 0.0
Val
5.126ValAla: 5.126 ± 0.41
2.181ValCys: 2.181 ± 0.276
3.108ValAsp: 3.108 ± 0.39
3.681ValGlu: 3.681 ± 0.306
3.953ValPhe: 3.953 ± 0.46
2.972ValGly: 2.972 ± 0.314
2.208ValHis: 2.208 ± 0.255
3.626ValIle: 3.626 ± 0.332
2.754ValLys: 2.754 ± 0.354
6.707ValLeu: 6.707 ± 0.439
2.018ValMet: 2.018 ± 0.239
3.054ValAsn: 3.054 ± 0.394
3.735ValPro: 3.735 ± 0.276
2.399ValGln: 2.399 ± 0.306
4.253ValArg: 4.253 ± 0.373
6.489ValSer: 6.489 ± 0.397
4.962ValThr: 4.962 ± 0.409
5.317ValVal: 5.317 ± 0.399
1.063ValTrp: 1.063 ± 0.187
3.108ValTyr: 3.108 ± 0.348
0.0ValXaa: 0.0 ± 0.0
Trp
0.464TrpAla: 0.464 ± 0.093
0.518TrpCys: 0.518 ± 0.12
0.627TrpAsp: 0.627 ± 0.131
0.709TrpGlu: 0.709 ± 0.188
0.464TrpPhe: 0.464 ± 0.11
0.573TrpGly: 0.573 ± 0.154
0.382TrpHis: 0.382 ± 0.107
0.627TrpIle: 0.627 ± 0.128
0.491TrpLys: 0.491 ± 0.127
1.663TrpLeu: 1.663 ± 0.226
0.436TrpMet: 0.436 ± 0.108
0.464TrpAsn: 0.464 ± 0.102
0.682TrpPro: 0.682 ± 0.12
0.409TrpGln: 0.409 ± 0.101
1.063TrpArg: 1.063 ± 0.23
0.872TrpSer: 0.872 ± 0.136
0.845TrpThr: 0.845 ± 0.148
0.736TrpVal: 0.736 ± 0.138
0.191TrpTrp: 0.191 ± 0.075
0.573TrpTyr: 0.573 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.399TyrAla: 2.399 ± 0.24
1.063TyrCys: 1.063 ± 0.141
2.018TyrAsp: 2.018 ± 0.261
2.454TyrGlu: 2.454 ± 0.23
1.554TyrPhe: 1.554 ± 0.198
2.263TyrGly: 2.263 ± 0.215
1.227TyrHis: 1.227 ± 0.164
1.663TyrIle: 1.663 ± 0.215
1.472TyrLys: 1.472 ± 0.199
3.708TyrLeu: 3.708 ± 0.284
0.791TyrMet: 0.791 ± 0.155
1.609TyrAsn: 1.609 ± 0.225
1.418TyrPro: 1.418 ± 0.207
1.336TyrGln: 1.336 ± 0.208
2.481TyrArg: 2.481 ± 0.268
1.99TyrSer: 1.99 ± 0.239
2.236TyrThr: 2.236 ± 0.284
3.108TyrVal: 3.108 ± 0.294
0.436TyrTrp: 0.436 ± 0.109
1.554TyrTyr: 1.554 ± 0.152
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (36678 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski