Amino acid dipepetide frequency for Retroperitoneal fibromatosis-associated herpesvirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.574AlaAla: 8.574 ± 0.719
2.99AlaCys: 2.99 ± 0.351
4.128AlaAsp: 4.128 ± 0.399
3.44AlaGlu: 3.44 ± 0.358
2.911AlaPhe: 2.911 ± 0.285
5.002AlaGly: 5.002 ± 0.343
1.773AlaHis: 1.773 ± 0.236
3.705AlaIle: 3.705 ± 0.327
2.302AlaLys: 2.302 ± 0.263
9.236AlaLeu: 9.236 ± 0.684
1.932AlaMet: 1.932 ± 0.191
2.514AlaAsn: 2.514 ± 0.314
6.351AlaPro: 6.351 ± 0.493
2.885AlaGln: 2.885 ± 0.288
4.949AlaArg: 4.949 ± 0.409
7.066AlaSer: 7.066 ± 0.443
5.822AlaThr: 5.822 ± 0.436
5.637AlaVal: 5.637 ± 0.425
1.217AlaTrp: 1.217 ± 0.196
2.435AlaTyr: 2.435 ± 0.334
0.0AlaXaa: 0.0 ± 0.0
Cys
1.958CysAla: 1.958 ± 0.317
0.529CysCys: 0.529 ± 0.117
1.217CysAsp: 1.217 ± 0.198
1.059CysGlu: 1.059 ± 0.215
1.059CysPhe: 1.059 ± 0.172
1.456CysGly: 1.456 ± 0.209
0.873CysHis: 0.873 ± 0.162
1.35CysIle: 1.35 ± 0.194
0.794CysLys: 0.794 ± 0.147
2.99CysLeu: 2.99 ± 0.393
0.423CysMet: 0.423 ± 0.101
0.662CysAsn: 0.662 ± 0.117
1.429CysPro: 1.429 ± 0.2
1.323CysGln: 1.323 ± 0.157
1.747CysArg: 1.747 ± 0.214
1.297CysSer: 1.297 ± 0.162
1.244CysThr: 1.244 ± 0.193
2.038CysVal: 2.038 ± 0.25
0.291CysTrp: 0.291 ± 0.085
0.741CysTyr: 0.741 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
4.737AspAla: 4.737 ± 0.409
0.847AspCys: 0.847 ± 0.153
2.699AspAsp: 2.699 ± 0.377
2.408AspGlu: 2.408 ± 0.217
1.773AspPhe: 1.773 ± 0.256
2.699AspGly: 2.699 ± 0.256
0.979AspHis: 0.979 ± 0.156
2.276AspIle: 2.276 ± 0.212
1.085AspLys: 1.085 ± 0.163
4.472AspLeu: 4.472 ± 0.336
1.032AspMet: 1.032 ± 0.188
1.403AspAsn: 1.403 ± 0.164
3.52AspPro: 3.52 ± 0.39
1.323AspGln: 1.323 ± 0.183
3.043AspArg: 3.043 ± 0.237
3.123AspSer: 3.123 ± 0.332
3.308AspThr: 3.308 ± 0.443
3.599AspVal: 3.599 ± 0.384
0.767AspTrp: 0.767 ± 0.141
1.138AspTyr: 1.138 ± 0.148
0.0AspXaa: 0.0 ± 0.0
Glu
5.161GluAla: 5.161 ± 0.469
0.926GluCys: 0.926 ± 0.2
2.488GluAsp: 2.488 ± 0.26
5.478GluGlu: 5.478 ± 2.736
1.747GluPhe: 1.747 ± 0.25
3.096GluGly: 3.096 ± 0.348
1.35GluHis: 1.35 ± 0.172
2.885GluIle: 2.885 ± 0.357
1.985GluLys: 1.985 ± 0.203
4.922GluLeu: 4.922 ± 0.373
1.032GluMet: 1.032 ± 0.213
1.8GluAsn: 1.8 ± 0.232
7.463GluPro: 7.463 ± 4.438
1.985GluGln: 1.985 ± 0.211
3.017GluArg: 3.017 ± 0.282
2.885GluSer: 2.885 ± 0.346
4.658GluThr: 4.658 ± 0.381
3.308GluVal: 3.308 ± 0.407
0.423GluTrp: 0.423 ± 0.104
1.482GluTyr: 1.482 ± 0.238
0.0GluXaa: 0.0 ± 0.0
Phe
2.858PheAla: 2.858 ± 0.287
1.032PheCys: 1.032 ± 0.17
1.905PheAsp: 1.905 ± 0.204
1.535PheGlu: 1.535 ± 0.238
2.355PhePhe: 2.355 ± 0.239
2.144PheGly: 2.144 ± 0.268
1.164PheHis: 1.164 ± 0.192
2.011PheIle: 2.011 ± 0.243
1.535PheLys: 1.535 ± 0.301
5.372PheLeu: 5.372 ± 0.421
0.582PheMet: 0.582 ± 0.134
1.561PheAsn: 1.561 ± 0.221
2.329PhePro: 2.329 ± 0.253
1.561PheGln: 1.561 ± 0.191
2.038PheArg: 2.038 ± 0.218
2.699PheSer: 2.699 ± 0.361
1.905PheThr: 1.905 ± 0.24
3.149PheVal: 3.149 ± 0.303
0.423PheTrp: 0.423 ± 0.114
1.72PheTyr: 1.72 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
4.393GlyAla: 4.393 ± 0.372
1.376GlyCys: 1.376 ± 0.215
3.493GlyAsp: 3.493 ± 0.4
3.176GlyGlu: 3.176 ± 0.329
2.567GlyPhe: 2.567 ± 0.258
3.837GlyGly: 3.837 ± 0.388
1.8GlyHis: 1.8 ± 0.292
2.911GlyIle: 2.911 ± 0.226
1.958GlyLys: 1.958 ± 0.258
6.034GlyLeu: 6.034 ± 0.462
1.164GlyMet: 1.164 ± 0.193
2.064GlyAsn: 2.064 ± 0.21
3.97GlyPro: 3.97 ± 0.367
2.488GlyGln: 2.488 ± 0.225
4.578GlyArg: 4.578 ± 0.337
4.446GlySer: 4.446 ± 0.428
4.393GlyThr: 4.393 ± 0.3
3.731GlyVal: 3.731 ± 0.338
0.635GlyTrp: 0.635 ± 0.134
1.508GlyTyr: 1.508 ± 0.23
0.0GlyXaa: 0.0 ± 0.0
His
1.985HisAla: 1.985 ± 0.252
0.529HisCys: 0.529 ± 0.108
1.164HisAsp: 1.164 ± 0.156
1.191HisGlu: 1.191 ± 0.167
0.953HisPhe: 0.953 ± 0.169
1.985HisGly: 1.985 ± 0.183
1.244HisHis: 1.244 ± 0.243
1.164HisIle: 1.164 ± 0.228
0.979HisLys: 0.979 ± 0.168
2.938HisLeu: 2.938 ± 0.298
0.82HisMet: 0.82 ± 0.137
0.847HisAsn: 0.847 ± 0.174
2.011HisPro: 2.011 ± 0.218
1.006HisGln: 1.006 ± 0.242
1.932HisArg: 1.932 ± 0.208
1.667HisSer: 1.667 ± 0.207
1.614HisThr: 1.614 ± 0.33
2.355HisVal: 2.355 ± 0.281
0.318HisTrp: 0.318 ± 0.107
0.741HisTyr: 0.741 ± 0.115
0.0HisXaa: 0.0 ± 0.0
Ile
2.911IleAla: 2.911 ± 0.234
1.429IleCys: 1.429 ± 0.2
2.144IleAsp: 2.144 ± 0.287
1.8IleGlu: 1.8 ± 0.307
2.752IlePhe: 2.752 ± 0.257
2.011IleGly: 2.011 ± 0.208
0.979IleHis: 0.979 ± 0.215
1.8IleIle: 1.8 ± 0.225
1.879IleLys: 1.879 ± 0.299
4.684IleLeu: 4.684 ± 0.402
1.032IleMet: 1.032 ± 0.19
1.508IleAsn: 1.508 ± 0.201
2.752IlePro: 2.752 ± 0.331
1.826IleGln: 1.826 ± 0.24
2.938IleArg: 2.938 ± 0.307
3.202IleSer: 3.202 ± 0.275
2.911IleThr: 2.911 ± 0.344
2.964IleVal: 2.964 ± 0.297
0.45IleTrp: 0.45 ± 0.106
2.276IleTyr: 2.276 ± 0.24
0.0IleXaa: 0.0 ± 0.0
Lys
2.355LysAla: 2.355 ± 0.286
0.953LysCys: 0.953 ± 0.164
1.297LysAsp: 1.297 ± 0.219
1.932LysGlu: 1.932 ± 0.215
1.111LysPhe: 1.111 ± 0.162
1.614LysGly: 1.614 ± 0.233
1.456LysHis: 1.456 ± 0.22
2.038LysIle: 2.038 ± 0.311
1.614LysLys: 1.614 ± 0.253
3.705LysLeu: 3.705 ± 0.298
0.609LysMet: 0.609 ± 0.141
1.773LysAsn: 1.773 ± 0.235
1.8LysPro: 1.8 ± 0.234
1.429LysGln: 1.429 ± 0.234
2.408LysArg: 2.408 ± 0.269
1.852LysSer: 1.852 ± 0.252
2.752LysThr: 2.752 ± 0.272
1.535LysVal: 1.535 ± 0.185
0.185LysTrp: 0.185 ± 0.085
1.059LysTyr: 1.059 ± 0.184
0.0LysXaa: 0.0 ± 0.0
Leu
9.13LeuAla: 9.13 ± 0.544
2.858LeuCys: 2.858 ± 0.363
4.499LeuAsp: 4.499 ± 0.356
5.69LeuGlu: 5.69 ± 0.487
4.314LeuPhe: 4.314 ± 0.472
6.881LeuGly: 6.881 ± 0.507
3.334LeuHis: 3.334 ± 0.299
3.546LeuIle: 3.546 ± 0.304
3.864LeuLys: 3.864 ± 0.379
10.797LeuLeu: 10.797 ± 0.658
1.773LeuMet: 1.773 ± 0.228
3.44LeuAsn: 3.44 ± 0.298
6.642LeuPro: 6.642 ± 0.458
4.128LeuGln: 4.128 ± 0.297
6.007LeuArg: 6.007 ± 0.385
7.675LeuSer: 7.675 ± 0.532
6.96LeuThr: 6.96 ± 0.54
6.854LeuVal: 6.854 ± 0.53
1.032LeuTrp: 1.032 ± 0.196
3.096LeuTyr: 3.096 ± 0.297
0.0LeuXaa: 0.0 ± 0.0
Met
2.064MetAla: 2.064 ± 0.24
0.476MetCys: 0.476 ± 0.119
1.217MetAsp: 1.217 ± 0.174
1.35MetGlu: 1.35 ± 0.224
1.217MetPhe: 1.217 ± 0.21
1.27MetGly: 1.27 ± 0.225
0.503MetHis: 0.503 ± 0.108
0.662MetIle: 0.662 ± 0.131
0.397MetLys: 0.397 ± 0.099
2.223MetLeu: 2.223 ± 0.24
0.476MetMet: 0.476 ± 0.104
0.45MetAsn: 0.45 ± 0.127
0.953MetPro: 0.953 ± 0.171
0.635MetGln: 0.635 ± 0.141
1.138MetArg: 1.138 ± 0.152
1.244MetSer: 1.244 ± 0.178
1.111MetThr: 1.111 ± 0.175
1.27MetVal: 1.27 ± 0.172
0.423MetTrp: 0.423 ± 0.104
0.715MetTyr: 0.715 ± 0.131
0.0MetXaa: 0.0 ± 0.0
Asn
2.779AsnAla: 2.779 ± 0.241
0.609AsnCys: 0.609 ± 0.129
1.006AsnAsp: 1.006 ± 0.155
1.35AsnGlu: 1.35 ± 0.238
1.429AsnPhe: 1.429 ± 0.264
1.72AsnGly: 1.72 ± 0.202
0.688AsnHis: 0.688 ± 0.145
1.905AsnIle: 1.905 ± 0.299
1.244AsnLys: 1.244 ± 0.209
3.758AsnLeu: 3.758 ± 0.362
0.476AsnMet: 0.476 ± 0.123
1.006AsnAsn: 1.006 ± 0.22
2.276AsnPro: 2.276 ± 0.213
1.059AsnGln: 1.059 ± 0.16
1.747AsnArg: 1.747 ± 0.199
2.302AsnSer: 2.302 ± 0.333
2.382AsnThr: 2.382 ± 0.282
2.964AsnVal: 2.964 ± 0.28
0.344AsnTrp: 0.344 ± 0.084
1.032AsnTyr: 1.032 ± 0.135
0.0AsnXaa: 0.0 ± 0.0
Pro
5.637ProAla: 5.637 ± 0.47
1.614ProCys: 1.614 ± 0.241
3.07ProAsp: 3.07 ± 0.35
8.574ProGlu: 8.574 ± 4.285
2.038ProPhe: 2.038 ± 0.267
5.161ProGly: 5.161 ± 0.38
1.667ProHis: 1.667 ± 0.204
1.985ProIle: 1.985 ± 0.239
2.17ProLys: 2.17 ± 0.305
6.034ProLeu: 6.034 ± 0.405
1.217ProMet: 1.217 ± 0.208
1.667ProAsn: 1.667 ± 0.22
7.066ProPro: 7.066 ± 1.018
2.382ProGln: 2.382 ± 0.224
3.784ProArg: 3.784 ± 0.384
5.24ProSer: 5.24 ± 0.491
5.478ProThr: 5.478 ± 0.61
6.06ProVal: 6.06 ± 0.455
0.847ProTrp: 0.847 ± 0.166
1.297ProTyr: 1.297 ± 0.163
0.0ProXaa: 0.0 ± 0.0
Gln
3.679GlnAla: 3.679 ± 0.401
0.9GlnCys: 0.9 ± 0.175
1.72GlnAsp: 1.72 ± 0.216
2.144GlnGlu: 2.144 ± 0.215
1.376GlnPhe: 1.376 ± 0.205
2.17GlnGly: 2.17 ± 0.231
1.111GlnHis: 1.111 ± 0.213
1.667GlnIle: 1.667 ± 0.205
1.561GlnLys: 1.561 ± 0.228
3.573GlnLeu: 3.573 ± 0.349
0.635GlnMet: 0.635 ± 0.134
1.614GlnAsn: 1.614 ± 0.209
2.514GlnPro: 2.514 ± 0.316
3.017GlnGln: 3.017 ± 1.759
2.302GlnArg: 2.302 ± 0.262
2.673GlnSer: 2.673 ± 0.269
2.805GlnThr: 2.805 ± 0.343
1.747GlnVal: 1.747 ± 0.171
0.476GlnTrp: 0.476 ± 0.1
0.979GlnTyr: 0.979 ± 0.168
0.0GlnXaa: 0.0 ± 0.0
Arg
5.319ArgAla: 5.319 ± 0.448
1.164ArgCys: 1.164 ± 0.148
3.07ArgAsp: 3.07 ± 0.345
4.128ArgGlu: 4.128 ± 0.372
2.091ArgPhe: 2.091 ± 0.184
4.843ArgGly: 4.843 ± 0.472
1.852ArgHis: 1.852 ± 0.194
3.096ArgIle: 3.096 ± 0.339
2.091ArgLys: 2.091 ± 0.25
6.378ArgLeu: 6.378 ± 0.443
1.244ArgMet: 1.244 ± 0.187
1.535ArgAsn: 1.535 ± 0.204
3.599ArgPro: 3.599 ± 0.419
2.541ArgGln: 2.541 ± 0.245
5.875ArgArg: 5.875 ± 0.491
3.917ArgSer: 3.917 ± 0.389
3.229ArgThr: 3.229 ± 0.32
3.97ArgVal: 3.97 ± 0.372
0.82ArgTrp: 0.82 ± 0.146
1.482ArgTyr: 1.482 ± 0.189
0.0ArgXaa: 0.0 ± 0.0
Ser
6.801SerAla: 6.801 ± 0.474
1.508SerCys: 1.508 ± 0.186
3.149SerAsp: 3.149 ± 0.289
3.837SerGlu: 3.837 ± 0.408
2.329SerPhe: 2.329 ± 0.236
4.711SerGly: 4.711 ± 0.489
2.011SerHis: 2.011 ± 0.343
2.805SerIle: 2.805 ± 0.335
2.276SerLys: 2.276 ± 0.279
7.542SerLeu: 7.542 ± 0.386
1.879SerMet: 1.879 ± 0.239
2.435SerAsn: 2.435 ± 0.306
5.716SerPro: 5.716 ± 0.627
2.329SerGln: 2.329 ± 0.327
4.155SerArg: 4.155 ± 0.384
5.902SerSer: 5.902 ± 0.753
4.658SerThr: 4.658 ± 0.584
4.499SerVal: 4.499 ± 0.434
1.006SerTrp: 1.006 ± 0.191
2.091SerTyr: 2.091 ± 0.19
0.0SerXaa: 0.0 ± 0.0
Thr
5.557ThrAla: 5.557 ± 0.392
1.376ThrCys: 1.376 ± 0.227
2.964ThrAsp: 2.964 ± 0.307
3.467ThrGlu: 3.467 ± 0.373
2.435ThrPhe: 2.435 ± 0.291
4.023ThrGly: 4.023 ± 0.32
1.932ThrHis: 1.932 ± 0.276
2.699ThrIle: 2.699 ± 0.34
1.72ThrLys: 1.72 ± 0.295
6.537ThrLeu: 6.537 ± 0.458
1.27ThrMet: 1.27 ± 0.145
2.038ThrAsn: 2.038 ± 0.323
5.24ThrPro: 5.24 ± 0.54
2.805ThrGln: 2.805 ± 0.473
4.155ThrArg: 4.155 ± 0.494
5.902ThrSer: 5.902 ± 0.667
5.875ThrThr: 5.875 ± 1.136
5.346ThrVal: 5.346 ± 0.39
1.27ThrTrp: 1.27 ± 0.162
2.223ThrTyr: 2.223 ± 0.24
0.0ThrXaa: 0.0 ± 0.0
Val
5.875ValAla: 5.875 ± 0.452
2.144ValCys: 2.144 ± 0.212
3.123ValAsp: 3.123 ± 0.287
3.282ValGlu: 3.282 ± 0.278
3.282ValPhe: 3.282 ± 0.356
4.049ValGly: 4.049 ± 0.305
1.561ValHis: 1.561 ± 0.219
3.361ValIle: 3.361 ± 0.263
2.461ValLys: 2.461 ± 0.263
6.484ValLeu: 6.484 ± 0.463
1.217ValMet: 1.217 ± 0.174
2.011ValAsn: 2.011 ± 0.25
5.266ValPro: 5.266 ± 0.417
2.064ValGln: 2.064 ± 0.224
4.023ValArg: 4.023 ± 0.399
5.61ValSer: 5.61 ± 0.5
5.002ValThr: 5.002 ± 0.344
5.028ValVal: 5.028 ± 0.46
0.741ValTrp: 0.741 ± 0.182
2.752ValTyr: 2.752 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
0.873TrpAla: 0.873 ± 0.18
0.344TrpCys: 0.344 ± 0.095
0.529TrpAsp: 0.529 ± 0.102
0.688TrpGlu: 0.688 ± 0.126
0.556TrpPhe: 0.556 ± 0.125
0.37TrpGly: 0.37 ± 0.075
0.397TrpHis: 0.397 ± 0.111
0.556TrpIle: 0.556 ± 0.118
0.476TrpLys: 0.476 ± 0.124
1.376TrpLeu: 1.376 ± 0.193
0.265TrpMet: 0.265 ± 0.084
0.503TrpAsn: 0.503 ± 0.13
0.767TrpPro: 0.767 ± 0.132
0.741TrpGln: 0.741 ± 0.119
0.794TrpArg: 0.794 ± 0.177
0.82TrpSer: 0.82 ± 0.167
1.006TrpThr: 1.006 ± 0.142
0.794TrpVal: 0.794 ± 0.136
0.132TrpTrp: 0.132 ± 0.071
0.397TrpTyr: 0.397 ± 0.085
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.223TyrAla: 2.223 ± 0.289
0.873TyrCys: 0.873 ± 0.163
1.35TyrAsp: 1.35 ± 0.202
1.561TyrGlu: 1.561 ± 0.195
1.588TyrPhe: 1.588 ± 0.211
1.482TyrGly: 1.482 ± 0.186
0.767TyrHis: 0.767 ± 0.13
1.826TyrIle: 1.826 ± 0.235
1.085TyrLys: 1.085 ± 0.192
3.493TyrLeu: 3.493 ± 0.319
0.688TyrMet: 0.688 ± 0.119
1.244TyrAsn: 1.244 ± 0.209
1.323TyrPro: 1.323 ± 0.173
1.138TyrGln: 1.138 ± 0.148
1.614TyrArg: 1.614 ± 0.257
2.117TyrSer: 2.117 ± 0.261
1.72TyrThr: 1.72 ± 0.199
2.488TyrVal: 2.488 ± 0.288
0.582TyrTrp: 0.582 ± 0.137
1.111TyrTyr: 1.111 ± 0.228
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (37788 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski