Amino acid dipepetide frequency for Fruit bat alphaherpesvirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.704AlaAla: 14.704 ± 1.304
2.349AlaCys: 2.349 ± 0.234
5.086AlaAsp: 5.086 ± 0.376
4.754AlaGlu: 4.754 ± 0.642
4.063AlaPhe: 4.063 ± 0.367
6.274AlaGly: 6.274 ± 0.424
3.04AlaHis: 3.04 ± 0.326
3.98AlaIle: 3.98 ± 0.293
2.294AlaLys: 2.294 ± 0.278
12.769AlaLeu: 12.769 ± 0.897
1.714AlaMet: 1.714 ± 0.198
2.874AlaAsn: 2.874 ± 0.317
10.061AlaPro: 10.061 ± 0.911
4.091AlaGln: 4.091 ± 0.388
7.877AlaArg: 7.877 ± 0.766
9.397AlaSer: 9.397 ± 0.408
6.744AlaThr: 6.744 ± 0.497
7.158AlaVal: 7.158 ± 0.44
1.41AlaTrp: 1.41 ± 0.203
2.543AlaTyr: 2.543 ± 0.253
0.0AlaXaa: 0.0 ± 0.0
Cys
1.769CysAla: 1.769 ± 0.237
0.47CysCys: 0.47 ± 0.137
1.078CysAsp: 1.078 ± 0.168
1.023CysGlu: 1.023 ± 0.16
0.774CysPhe: 0.774 ± 0.143
1.05CysGly: 1.05 ± 0.153
0.58CysHis: 0.58 ± 0.132
0.663CysIle: 0.663 ± 0.137
0.387CysLys: 0.387 ± 0.077
2.073CysLeu: 2.073 ± 0.23
0.359CysMet: 0.359 ± 0.102
0.525CysAsn: 0.525 ± 0.104
1.52CysPro: 1.52 ± 0.242
0.58CysGln: 0.58 ± 0.131
1.437CysArg: 1.437 ± 0.237
1.188CysSer: 1.188 ± 0.179
0.857CysThr: 0.857 ± 0.144
1.631CysVal: 1.631 ± 0.232
0.138CysTrp: 0.138 ± 0.063
0.415CysTyr: 0.415 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
5.555AspAla: 5.555 ± 0.462
0.774AspCys: 0.774 ± 0.129
2.819AspAsp: 2.819 ± 0.257
3.013AspGlu: 3.013 ± 0.283
1.741AspPhe: 1.741 ± 0.271
2.598AspGly: 2.598 ± 0.271
0.857AspHis: 0.857 ± 0.167
2.239AspIle: 2.239 ± 0.299
0.691AspLys: 0.691 ± 0.139
4.782AspLeu: 4.782 ± 0.429
0.719AspMet: 0.719 ± 0.129
1.161AspAsn: 1.161 ± 0.173
3.289AspPro: 3.289 ± 0.375
1.271AspGln: 1.271 ± 0.243
2.792AspArg: 2.792 ± 0.232
3.621AspSer: 3.621 ± 0.316
2.847AspThr: 2.847 ± 0.282
3.952AspVal: 3.952 ± 0.293
0.94AspTrp: 0.94 ± 0.172
1.133AspTyr: 1.133 ± 0.165
0.0AspXaa: 0.0 ± 0.0
Glu
7.048GluAla: 7.048 ± 0.484
0.884GluCys: 0.884 ± 0.177
3.234GluAsp: 3.234 ± 0.27
4.035GluGlu: 4.035 ± 0.402
2.045GluPhe: 2.045 ± 0.268
2.653GluGly: 2.653 ± 0.297
1.52GluHis: 1.52 ± 0.166
2.018GluIle: 2.018 ± 0.269
1.078GluLys: 1.078 ± 0.139
5.555GluLeu: 5.555 ± 0.555
1.216GluMet: 1.216 ± 0.186
1.078GluAsn: 1.078 ± 0.169
3.013GluPro: 3.013 ± 0.413
2.018GluGln: 2.018 ± 0.182
3.759GluArg: 3.759 ± 0.336
4.173GluSer: 4.173 ± 0.53
3.372GluThr: 3.372 ± 0.395
3.234GluVal: 3.234 ± 0.306
0.525GluTrp: 0.525 ± 0.14
1.382GluTyr: 1.382 ± 0.176
0.0GluXaa: 0.0 ± 0.0
Phe
3.206PheAla: 3.206 ± 0.292
0.912PheCys: 0.912 ± 0.169
2.211PheAsp: 2.211 ± 0.269
2.322PheGlu: 2.322 ± 0.255
2.266PhePhe: 2.266 ± 0.312
2.543PheGly: 2.543 ± 0.305
1.05PheHis: 1.05 ± 0.144
1.741PheIle: 1.741 ± 0.234
0.967PheLys: 0.967 ± 0.177
4.395PheLeu: 4.395 ± 0.48
0.884PheMet: 0.884 ± 0.151
1.244PheAsn: 1.244 ± 0.21
2.349PhePro: 2.349 ± 0.241
1.271PheGln: 1.271 ± 0.191
2.128PheArg: 2.128 ± 0.273
3.151PheSer: 3.151 ± 0.288
1.99PheThr: 1.99 ± 0.253
3.123PheVal: 3.123 ± 0.338
0.691PheTrp: 0.691 ± 0.141
1.133PheTyr: 1.133 ± 0.2
0.0PheXaa: 0.0 ± 0.0
Gly
7.048GlyAla: 7.048 ± 0.495
0.94GlyCys: 0.94 ± 0.175
3.123GlyAsp: 3.123 ± 0.299
3.98GlyGlu: 3.98 ± 0.266
2.045GlyPhe: 2.045 ± 0.272
4.588GlyGly: 4.588 ± 0.338
1.216GlyHis: 1.216 ± 0.143
1.741GlyIle: 1.741 ± 0.241
1.382GlyLys: 1.382 ± 0.198
6.302GlyLeu: 6.302 ± 0.55
0.995GlyMet: 0.995 ± 0.151
1.354GlyAsn: 1.354 ± 0.172
4.284GlyPro: 4.284 ± 0.387
2.349GlyGln: 2.349 ± 0.226
4.726GlyArg: 4.726 ± 0.471
3.759GlySer: 3.759 ± 0.31
2.57GlyThr: 2.57 ± 0.278
3.869GlyVal: 3.869 ± 0.302
0.746GlyTrp: 0.746 ± 0.139
1.437GlyTyr: 1.437 ± 0.205
0.0GlyXaa: 0.0 ± 0.0
His
2.985HisAla: 2.985 ± 0.251
0.304HisCys: 0.304 ± 0.097
0.857HisAsp: 0.857 ± 0.135
1.106HisGlu: 1.106 ± 0.145
1.05HisPhe: 1.05 ± 0.22
1.327HisGly: 1.327 ± 0.221
0.857HisHis: 0.857 ± 0.174
0.995HisIle: 0.995 ± 0.178
0.802HisLys: 0.802 ± 0.17
2.902HisLeu: 2.902 ± 0.288
0.525HisMet: 0.525 ± 0.107
0.857HisAsn: 0.857 ± 0.159
2.322HisPro: 2.322 ± 0.288
1.078HisGln: 1.078 ± 0.14
2.266HisArg: 2.266 ± 0.205
1.548HisSer: 1.548 ± 0.203
2.073HisThr: 2.073 ± 0.223
1.52HisVal: 1.52 ± 0.215
0.249HisTrp: 0.249 ± 0.065
0.912HisTyr: 0.912 ± 0.185
0.0HisXaa: 0.0 ± 0.0
Ile
3.51IleAla: 3.51 ± 0.283
0.774IleCys: 0.774 ± 0.167
1.797IleAsp: 1.797 ± 0.199
1.99IleGlu: 1.99 ± 0.2
1.354IlePhe: 1.354 ± 0.25
1.492IleGly: 1.492 ± 0.21
1.244IleHis: 1.244 ± 0.182
1.548IleIle: 1.548 ± 0.213
1.437IleLys: 1.437 ± 0.257
3.897IleLeu: 3.897 ± 0.374
0.608IleMet: 0.608 ± 0.127
1.492IleAsn: 1.492 ± 0.189
2.211IlePro: 2.211 ± 0.255
1.382IleGln: 1.382 ± 0.22
1.879IleArg: 1.879 ± 0.275
2.874IleSer: 2.874 ± 0.342
3.178IleThr: 3.178 ± 0.324
2.045IleVal: 2.045 ± 0.272
0.332IleTrp: 0.332 ± 0.092
0.912IleTyr: 0.912 ± 0.174
0.0IleXaa: 0.0 ± 0.0
Lys
1.99LysAla: 1.99 ± 0.229
0.442LysCys: 0.442 ± 0.114
1.05LysAsp: 1.05 ± 0.192
0.663LysGlu: 0.663 ± 0.15
1.023LysPhe: 1.023 ± 0.159
0.94LysGly: 0.94 ± 0.133
0.802LysHis: 0.802 ± 0.18
1.133LysIle: 1.133 ± 0.168
0.912LysLys: 0.912 ± 0.215
2.377LysLeu: 2.377 ± 0.278
0.608LysMet: 0.608 ± 0.142
0.58LysAsn: 0.58 ± 0.129
1.852LysPro: 1.852 ± 0.306
1.078LysGln: 1.078 ± 0.125
2.543LysArg: 2.543 ± 0.234
1.575LysSer: 1.575 ± 0.176
1.714LysThr: 1.714 ± 0.18
1.271LysVal: 1.271 ± 0.173
0.221LysTrp: 0.221 ± 0.074
0.829LysTyr: 0.829 ± 0.165
0.0LysXaa: 0.0 ± 0.0
Leu
11.774LeuAla: 11.774 ± 0.893
2.405LeuCys: 2.405 ± 0.295
4.782LeuAsp: 4.782 ± 0.423
5.749LeuGlu: 5.749 ± 0.523
4.809LeuPhe: 4.809 ± 0.385
6.91LeuGly: 6.91 ± 0.492
2.515LeuHis: 2.515 ± 0.344
4.229LeuIle: 4.229 ± 0.397
2.349LeuLys: 2.349 ± 0.256
11.111LeuLeu: 11.111 ± 0.589
2.073LeuMet: 2.073 ± 0.287
2.322LeuAsn: 2.322 ± 0.378
7.076LeuPro: 7.076 ± 0.454
4.091LeuGln: 4.091 ± 0.336
8.264LeuArg: 8.264 ± 0.485
8.264LeuSer: 8.264 ± 0.417
6.136LeuThr: 6.136 ± 0.407
6.606LeuVal: 6.606 ± 0.41
1.52LeuTrp: 1.52 ± 0.272
2.626LeuTyr: 2.626 ± 0.255
0.0LeuXaa: 0.0 ± 0.0
Met
2.487MetAla: 2.487 ± 0.285
0.276MetCys: 0.276 ± 0.071
1.05MetAsp: 1.05 ± 0.16
0.663MetGlu: 0.663 ± 0.126
0.774MetPhe: 0.774 ± 0.146
1.41MetGly: 1.41 ± 0.188
0.442MetHis: 0.442 ± 0.11
0.387MetIle: 0.387 ± 0.107
0.387MetLys: 0.387 ± 0.138
1.714MetLeu: 1.714 ± 0.262
0.497MetMet: 0.497 ± 0.119
0.415MetAsn: 0.415 ± 0.102
1.023MetPro: 1.023 ± 0.159
0.663MetGln: 0.663 ± 0.157
1.41MetArg: 1.41 ± 0.179
1.244MetSer: 1.244 ± 0.223
0.663MetThr: 0.663 ± 0.129
1.161MetVal: 1.161 ± 0.164
0.304MetTrp: 0.304 ± 0.106
0.58MetTyr: 0.58 ± 0.116
0.0MetXaa: 0.0 ± 0.0
Asn
2.792AsnAla: 2.792 ± 0.308
0.47AsnCys: 0.47 ± 0.113
0.857AsnAsp: 0.857 ± 0.17
1.05AsnGlu: 1.05 ± 0.19
1.52AsnPhe: 1.52 ± 0.223
1.216AsnGly: 1.216 ± 0.227
0.746AsnHis: 0.746 ± 0.126
1.05AsnIle: 1.05 ± 0.19
0.415AsnLys: 0.415 ± 0.112
2.764AsnLeu: 2.764 ± 0.336
0.442AsnMet: 0.442 ± 0.11
1.188AsnAsn: 1.188 ± 0.205
2.405AsnPro: 2.405 ± 0.313
1.299AsnGln: 1.299 ± 0.217
1.244AsnArg: 1.244 ± 0.195
2.101AsnSer: 2.101 ± 0.318
2.432AsnThr: 2.432 ± 0.279
1.575AsnVal: 1.575 ± 0.211
0.193AsnTrp: 0.193 ± 0.073
0.829AsnTyr: 0.829 ± 0.152
0.0AsnXaa: 0.0 ± 0.0
Pro
8.872ProAla: 8.872 ± 0.966
1.188ProCys: 1.188 ± 0.171
3.455ProAsp: 3.455 ± 0.296
4.367ProGlu: 4.367 ± 0.408
2.156ProPhe: 2.156 ± 0.208
4.201ProGly: 4.201 ± 0.47
2.266ProHis: 2.266 ± 0.256
2.432ProIle: 2.432 ± 0.303
2.322ProLys: 2.322 ± 0.262
7.601ProLeu: 7.601 ± 0.395
1.05ProMet: 1.05 ± 0.192
2.128ProAsn: 2.128 ± 0.198
10.751ProPro: 10.751 ± 1.014
3.068ProGln: 3.068 ± 0.297
5.196ProArg: 5.196 ± 0.594
7.048ProSer: 7.048 ± 0.495
5.694ProThr: 5.694 ± 0.523
4.367ProVal: 4.367 ± 0.496
0.663ProTrp: 0.663 ± 0.122
1.354ProTyr: 1.354 ± 0.174
0.0ProXaa: 0.0 ± 0.0
Gln
4.45GlnAla: 4.45 ± 0.353
0.94GlnCys: 0.94 ± 0.164
1.797GlnAsp: 1.797 ± 0.248
1.741GlnGlu: 1.741 ± 0.245
1.658GlnPhe: 1.658 ± 0.183
1.714GlnGly: 1.714 ± 0.241
0.995GlnHis: 0.995 ± 0.178
1.492GlnIle: 1.492 ± 0.227
0.857GlnLys: 0.857 ± 0.199
4.726GlnLeu: 4.726 ± 0.425
0.829GlnMet: 0.829 ± 0.116
0.802GlnAsn: 0.802 ± 0.155
3.096GlnPro: 3.096 ± 0.28
1.548GlnGln: 1.548 ± 0.156
3.317GlnArg: 3.317 ± 0.237
2.266GlnSer: 2.266 ± 0.251
2.681GlnThr: 2.681 ± 0.247
2.018GlnVal: 2.018 ± 0.243
0.304GlnTrp: 0.304 ± 0.066
1.161GlnTyr: 1.161 ± 0.159
0.0GlnXaa: 0.0 ± 0.0
Arg
8.596ArgAla: 8.596 ± 0.658
1.188ArgCys: 1.188 ± 0.189
2.847ArgAsp: 2.847 ± 0.313
4.146ArgGlu: 4.146 ± 0.377
2.957ArgPhe: 2.957 ± 0.342
5.058ArgGly: 5.058 ± 0.357
2.045ArgHis: 2.045 ± 0.252
1.741ArgIle: 1.741 ± 0.217
1.437ArgLys: 1.437 ± 0.201
7.711ArgLeu: 7.711 ± 0.545
0.967ArgMet: 0.967 ± 0.167
1.797ArgAsn: 1.797 ± 0.214
5.417ArgPro: 5.417 ± 0.475
2.985ArgGln: 2.985 ± 0.274
7.462ArgArg: 7.462 ± 0.719
5.113ArgSer: 5.113 ± 0.502
3.317ArgThr: 3.317 ± 0.304
5.086ArgVal: 5.086 ± 0.458
0.857ArgTrp: 0.857 ± 0.169
1.852ArgTyr: 1.852 ± 0.237
0.0ArgXaa: 0.0 ± 0.0
Ser
8.043SerAla: 8.043 ± 0.518
1.133SerCys: 1.133 ± 0.171
3.455SerAsp: 3.455 ± 0.29
5.168SerGlu: 5.168 ± 0.581
2.736SerPhe: 2.736 ± 0.274
4.809SerGly: 4.809 ± 0.342
2.211SerHis: 2.211 ± 0.243
2.239SerIle: 2.239 ± 0.319
2.073SerLys: 2.073 ± 0.252
8.043SerLeu: 8.043 ± 0.572
1.299SerMet: 1.299 ± 0.179
1.548SerAsn: 1.548 ± 0.324
6.661SerPro: 6.661 ± 0.671
3.178SerGln: 3.178 ± 0.286
5.003SerArg: 5.003 ± 0.483
7.545SerSer: 7.545 ± 0.673
5.555SerThr: 5.555 ± 0.485
4.643SerVal: 4.643 ± 0.37
0.829SerTrp: 0.829 ± 0.113
1.106SerTyr: 1.106 ± 0.16
0.0SerXaa: 0.0 ± 0.0
Thr
7.02ThrAla: 7.02 ± 0.444
0.967ThrCys: 0.967 ± 0.167
2.377ThrAsp: 2.377 ± 0.308
3.261ThrGlu: 3.261 ± 0.297
1.962ThrPhe: 1.962 ± 0.253
3.317ThrGly: 3.317 ± 0.305
1.852ThrHis: 1.852 ± 0.199
2.626ThrIle: 2.626 ± 0.243
1.465ThrLys: 1.465 ± 0.156
6.136ThrLeu: 6.136 ± 0.404
0.691ThrMet: 0.691 ± 0.137
2.128ThrAsn: 2.128 ± 0.263
6.578ThrPro: 6.578 ± 0.632
2.598ThrGln: 2.598 ± 0.274
4.477ThrArg: 4.477 ± 0.31
5.39ThrSer: 5.39 ± 0.526
5.39ThrThr: 5.39 ± 0.549
3.013ThrVal: 3.013 ± 0.281
0.719ThrTrp: 0.719 ± 0.123
1.658ThrTyr: 1.658 ± 0.241
0.0ThrXaa: 0.0 ± 0.0
Val
7.462ValAla: 7.462 ± 0.543
1.382ValCys: 1.382 ± 0.179
2.957ValAsp: 2.957 ± 0.295
3.151ValGlu: 3.151 ± 0.329
3.123ValPhe: 3.123 ± 0.328
3.51ValGly: 3.51 ± 0.31
1.299ValHis: 1.299 ± 0.185
2.266ValIle: 2.266 ± 0.243
1.52ValLys: 1.52 ± 0.246
6.882ValLeu: 6.882 ± 0.436
1.299ValMet: 1.299 ± 0.164
2.018ValAsn: 2.018 ± 0.26
4.201ValPro: 4.201 ± 0.367
2.266ValGln: 2.266 ± 0.192
4.367ValArg: 4.367 ± 0.404
4.533ValSer: 4.533 ± 0.327
3.759ValThr: 3.759 ± 0.339
4.975ValVal: 4.975 ± 0.375
0.94ValTrp: 0.94 ± 0.175
2.294ValTyr: 2.294 ± 0.285
0.0ValXaa: 0.0 ± 0.0
Trp
1.271TrpAla: 1.271 ± 0.179
0.193TrpCys: 0.193 ± 0.067
0.608TrpAsp: 0.608 ± 0.105
0.663TrpGlu: 0.663 ± 0.155
0.47TrpPhe: 0.47 ± 0.126
1.299TrpGly: 1.299 ± 0.216
0.387TrpHis: 0.387 ± 0.121
0.359TrpIle: 0.359 ± 0.091
0.276TrpLys: 0.276 ± 0.08
1.327TrpLeu: 1.327 ± 0.211
0.304TrpMet: 0.304 ± 0.101
0.304TrpAsn: 0.304 ± 0.076
0.58TrpPro: 0.58 ± 0.136
0.47TrpGln: 0.47 ± 0.1
1.078TrpArg: 1.078 ± 0.178
0.553TrpSer: 0.553 ± 0.135
0.884TrpThr: 0.884 ± 0.166
0.719TrpVal: 0.719 ± 0.137
0.083TrpTrp: 0.083 ± 0.057
0.221TrpTyr: 0.221 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.626TyrAla: 2.626 ± 0.287
0.525TyrCys: 0.525 ± 0.135
1.354TyrAsp: 1.354 ± 0.197
0.995TyrGlu: 0.995 ± 0.185
1.05TyrPhe: 1.05 ± 0.204
1.548TyrGly: 1.548 ± 0.215
0.608TyrHis: 0.608 ± 0.111
1.078TyrIle: 1.078 ± 0.211
0.553TyrLys: 0.553 ± 0.134
2.543TyrLeu: 2.543 ± 0.241
0.442TyrMet: 0.442 ± 0.133
0.774TyrAsn: 0.774 ± 0.134
1.492TyrPro: 1.492 ± 0.179
1.106TyrGln: 1.106 ± 0.155
1.354TyrArg: 1.354 ± 0.164
1.935TyrSer: 1.935 ± 0.276
1.769TyrThr: 1.769 ± 0.258
2.239TyrVal: 2.239 ± 0.219
0.387TyrTrp: 0.387 ± 0.092
1.106TyrTyr: 1.106 ± 0.152
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (36182 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski