Amino acid dipepetide frequency for Human herpesvirus 7 (strain JI) (HHV-7) (Human T lymphotropic virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.553AlaAla: 2.553 ± 0.366
1.026AlaCys: 1.026 ± 0.141
1.98AlaAsp: 1.98 ± 0.187
2.743AlaGlu: 2.743 ± 0.23
2.6AlaPhe: 2.6 ± 0.28
1.551AlaGly: 1.551 ± 0.295
1.217AlaHis: 1.217 ± 0.173
3.054AlaIle: 3.054 ± 0.342
2.767AlaLys: 2.767 ± 0.274
4.843AlaLeu: 4.843 ± 0.374
0.93AlaMet: 0.93 ± 0.138
1.932AlaAsn: 1.932 ± 0.211
1.646AlaPro: 1.646 ± 0.258
1.503AlaGln: 1.503 ± 0.172
1.908AlaArg: 1.908 ± 0.305
3.459AlaSer: 3.459 ± 0.238
2.505AlaThr: 2.505 ± 0.279
2.958AlaVal: 2.958 ± 0.366
0.382AlaTrp: 0.382 ± 0.099
1.431AlaTyr: 1.431 ± 0.195
0.0AlaXaa: 0.0 ± 0.0
Cys
1.002CysAla: 1.002 ± 0.192
0.525CysCys: 0.525 ± 0.111
1.169CysAsp: 1.169 ± 0.187
1.455CysGlu: 1.455 ± 0.199
1.121CysPhe: 1.121 ± 0.191
1.097CysGly: 1.097 ± 0.176
0.787CysHis: 0.787 ± 0.147
1.527CysIle: 1.527 ± 0.226
1.956CysLys: 1.956 ± 0.326
3.149CysLeu: 3.149 ± 0.361
0.668CysMet: 0.668 ± 0.124
1.407CysAsn: 1.407 ± 0.217
0.811CysPro: 0.811 ± 0.147
0.787CysGln: 0.787 ± 0.135
1.145CysArg: 1.145 ± 0.2
2.29CysSer: 2.29 ± 0.254
1.36CysThr: 1.36 ± 0.194
1.479CysVal: 1.479 ± 0.223
0.095CysTrp: 0.095 ± 0.044
0.954CysTyr: 0.954 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
1.861AspAla: 1.861 ± 0.215
1.121AspCys: 1.121 ± 0.154
2.314AspAsp: 2.314 ± 0.306
2.743AspGlu: 2.743 ± 0.236
4.032AspPhe: 4.032 ± 0.304
1.407AspGly: 1.407 ± 0.206
0.787AspHis: 0.787 ± 0.14
3.817AspIle: 3.817 ± 0.243
2.529AspLys: 2.529 ± 0.29
5.773AspLeu: 5.773 ± 0.393
1.002AspMet: 1.002 ± 0.187
2.6AspAsn: 2.6 ± 0.236
1.98AspPro: 1.98 ± 0.194
1.431AspGln: 1.431 ± 0.196
1.503AspArg: 1.503 ± 0.227
3.912AspSer: 3.912 ± 0.349
2.72AspThr: 2.72 ± 0.307
3.483AspVal: 3.483 ± 0.309
0.286AspTrp: 0.286 ± 0.068
1.694AspTyr: 1.694 ± 0.206
0.0AspXaa: 0.0 ± 0.0
Glu
2.171GluAla: 2.171 ± 0.247
1.24GluCys: 1.24 ± 0.19
2.672GluAsp: 2.672 ± 0.295
3.65GluGlu: 3.65 ± 0.382
2.863GluPhe: 2.863 ± 0.255
1.622GluGly: 1.622 ± 0.195
1.05GluHis: 1.05 ± 0.17
5.153GluIle: 5.153 ± 0.389
5.415GluLys: 5.415 ± 0.432
5.558GluLeu: 5.558 ± 0.352
1.646GluMet: 1.646 ± 0.201
5.201GluAsn: 5.201 ± 0.4
1.574GluPro: 1.574 ± 0.228
2.672GluGln: 2.672 ± 0.268
2.457GluArg: 2.457 ± 0.326
3.912GluSer: 3.912 ± 0.362
4.437GluThr: 4.437 ± 0.343
2.123GluVal: 2.123 ± 0.219
0.358GluTrp: 0.358 ± 0.086
2.219GluTyr: 2.219 ± 0.218
0.0GluXaa: 0.0 ± 0.0
Phe
2.362PheAla: 2.362 ± 0.202
1.551PheCys: 1.551 ± 0.206
2.553PheAsp: 2.553 ± 0.231
3.125PheGlu: 3.125 ± 0.297
4.628PhePhe: 4.628 ± 0.397
2.362PheGly: 2.362 ± 0.34
1.407PheHis: 1.407 ± 0.19
4.485PheIle: 4.485 ± 0.368
4.175PheLys: 4.175 ± 0.366
7.658PheLeu: 7.658 ± 0.533
1.551PheMet: 1.551 ± 0.315
3.626PheAsn: 3.626 ± 0.272
3.149PhePro: 3.149 ± 0.287
1.741PheGln: 1.741 ± 0.213
2.386PheArg: 2.386 ± 0.36
6.107PheSer: 6.107 ± 0.524
3.101PheThr: 3.101 ± 0.242
3.316PheVal: 3.316 ± 0.288
0.501PheTrp: 0.501 ± 0.119
2.553PheTyr: 2.553 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
1.622GlyAla: 1.622 ± 0.264
0.692GlyCys: 0.692 ± 0.165
1.407GlyAsp: 1.407 ± 0.177
1.861GlyGlu: 1.861 ± 0.191
2.624GlyPhe: 2.624 ± 0.347
1.026GlyGly: 1.026 ± 0.166
0.787GlyHis: 0.787 ± 0.162
2.553GlyIle: 2.553 ± 0.246
3.22GlyLys: 3.22 ± 0.249
3.841GlyLeu: 3.841 ± 0.387
0.716GlyMet: 0.716 ± 0.147
2.791GlyAsn: 2.791 ± 0.275
0.859GlyPro: 0.859 ± 0.161
1.026GlyGln: 1.026 ± 0.126
1.384GlyArg: 1.384 ± 0.262
2.791GlySer: 2.791 ± 0.349
2.386GlyThr: 2.386 ± 0.329
1.813GlyVal: 1.813 ± 0.25
0.429GlyTrp: 0.429 ± 0.106
1.336GlyTyr: 1.336 ± 0.186
0.0GlyXaa: 0.0 ± 0.0
His
1.145HisAla: 1.145 ± 0.161
0.716HisCys: 0.716 ± 0.133
1.145HisAsp: 1.145 ± 0.181
1.264HisGlu: 1.264 ± 0.183
1.837HisPhe: 1.837 ± 0.264
1.121HisGly: 1.121 ± 0.159
0.811HisHis: 0.811 ± 0.129
1.718HisIle: 1.718 ± 0.215
1.145HisLys: 1.145 ± 0.171
2.743HisLeu: 2.743 ± 0.268
0.525HisMet: 0.525 ± 0.115
1.67HisAsn: 1.67 ± 0.318
1.24HisPro: 1.24 ± 0.321
0.811HisGln: 0.811 ± 0.14
1.527HisArg: 1.527 ± 0.227
1.908HisSer: 1.908 ± 0.198
1.312HisThr: 1.312 ± 0.185
1.622HisVal: 1.622 ± 0.25
0.143HisTrp: 0.143 ± 0.05
1.026HisTyr: 1.026 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
2.934IleAla: 2.934 ± 0.235
2.028IleCys: 2.028 ± 0.27
3.197IleAsp: 3.197 ± 0.271
4.366IleGlu: 4.366 ± 0.414
4.628IlePhe: 4.628 ± 0.433
2.553IleGly: 2.553 ± 0.328
1.861IleHis: 1.861 ± 0.196
5.678IleIle: 5.678 ± 0.464
5.01IleLys: 5.01 ± 0.35
8.994IleLeu: 8.994 ± 0.523
1.217IleMet: 1.217 ± 0.186
4.652IleAsn: 4.652 ± 0.391
3.531IlePro: 3.531 ± 0.326
3.387IleGln: 3.387 ± 0.329
2.505IleArg: 2.505 ± 0.228
6.513IleSer: 6.513 ± 0.374
4.342IleThr: 4.342 ± 0.337
3.817IleVal: 3.817 ± 0.389
0.644IleTrp: 0.644 ± 0.128
3.674IleTyr: 3.674 ± 0.242
0.0IleXaa: 0.0 ± 0.0
Lys
2.266LysAla: 2.266 ± 0.268
1.431LysCys: 1.431 ± 0.218
3.292LysAsp: 3.292 ± 0.284
4.437LysGlu: 4.437 ± 0.327
3.435LysPhe: 3.435 ± 0.27
1.765LysGly: 1.765 ± 0.214
1.932LysHis: 1.932 ± 0.273
6.059LysIle: 6.059 ± 0.38
6.703LysLys: 6.703 ± 0.54
6.942LysLeu: 6.942 ± 0.405
1.741LysMet: 1.741 ± 0.216
6.083LysAsn: 6.083 ± 0.45
2.362LysPro: 2.362 ± 0.354
3.626LysGln: 3.626 ± 0.36
3.411LysArg: 3.411 ± 0.25
4.366LysSer: 4.366 ± 0.388
6.298LysThr: 6.298 ± 0.45
3.268LysVal: 3.268 ± 0.271
0.406LysTrp: 0.406 ± 0.106
2.433LysTyr: 2.433 ± 0.234
0.0LysXaa: 0.0 ± 0.0
Leu
5.01LeuAla: 5.01 ± 0.404
3.507LeuCys: 3.507 ± 0.402
4.509LeuAsp: 4.509 ± 0.3
5.582LeuGlu: 5.582 ± 0.482
6.584LeuPhe: 6.584 ± 0.499
3.841LeuGly: 3.841 ± 0.373
2.958LeuHis: 2.958 ± 0.32
7.3LeuIle: 7.3 ± 0.521
7.944LeuLys: 7.944 ± 0.45
11.761LeuLeu: 11.761 ± 0.703
1.956LeuMet: 1.956 ± 0.239
6.656LeuAsn: 6.656 ± 0.434
4.819LeuPro: 4.819 ± 0.477
4.652LeuGln: 4.652 ± 0.318
4.366LeuArg: 4.366 ± 0.444
9.447LeuSer: 9.447 ± 0.425
6.942LeuThr: 6.942 ± 0.508
4.437LeuVal: 4.437 ± 0.351
0.74LeuTrp: 0.74 ± 0.141
4.246LeuTyr: 4.246 ± 0.28
0.0LeuXaa: 0.0 ± 0.0
Met
0.907MetAla: 0.907 ± 0.128
0.525MetCys: 0.525 ± 0.109
1.193MetAsp: 1.193 ± 0.171
1.407MetGlu: 1.407 ± 0.196
1.312MetPhe: 1.312 ± 0.153
1.026MetGly: 1.026 ± 0.225
0.596MetHis: 0.596 ± 0.123
1.431MetIle: 1.431 ± 0.191
1.24MetLys: 1.24 ± 0.172
2.6MetLeu: 2.6 ± 0.243
0.382MetMet: 0.382 ± 0.094
1.264MetAsn: 1.264 ± 0.196
0.811MetPro: 0.811 ± 0.147
1.002MetGln: 1.002 ± 0.175
0.644MetArg: 0.644 ± 0.149
2.075MetSer: 2.075 ± 0.224
1.503MetThr: 1.503 ± 0.198
0.716MetVal: 0.716 ± 0.142
0.382MetTrp: 0.382 ± 0.098
0.954MetTyr: 0.954 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
2.743AsnAla: 2.743 ± 0.306
1.431AsnCys: 1.431 ± 0.164
2.958AsnAsp: 2.958 ± 0.246
3.626AsnGlu: 3.626 ± 0.328
4.246AsnPhe: 4.246 ± 0.322
2.505AsnGly: 2.505 ± 0.229
1.384AsnHis: 1.384 ± 0.166
5.558AsnIle: 5.558 ± 0.461
4.199AsnLys: 4.199 ± 0.366
6.918AsnLeu: 6.918 ± 0.401
1.765AsnMet: 1.765 ± 0.231
3.459AsnAsn: 3.459 ± 0.338
3.006AsnPro: 3.006 ± 0.481
1.932AsnGln: 1.932 ± 0.229
2.457AsnArg: 2.457 ± 0.236
5.892AsnSer: 5.892 ± 0.495
3.602AsnThr: 3.602 ± 0.294
4.151AsnVal: 4.151 ± 0.35
0.358AsnTrp: 0.358 ± 0.089
2.457AsnTyr: 2.457 ± 0.226
0.0AsnXaa: 0.0 ± 0.0
Pro
1.598ProAla: 1.598 ± 0.191
1.073ProCys: 1.073 ± 0.178
1.908ProAsp: 1.908 ± 0.236
2.362ProGlu: 2.362 ± 0.239
2.648ProPhe: 2.648 ± 0.378
1.193ProGly: 1.193 ± 0.237
1.24ProHis: 1.24 ± 0.313
3.22ProIle: 3.22 ± 0.275
2.338ProLys: 2.338 ± 0.239
4.032ProLeu: 4.032 ± 0.41
0.883ProMet: 0.883 ± 0.152
2.791ProAsn: 2.791 ± 0.431
1.551ProPro: 1.551 ± 0.232
1.527ProGln: 1.527 ± 0.253
1.574ProArg: 1.574 ± 0.209
3.054ProSer: 3.054 ± 0.351
2.457ProThr: 2.457 ± 0.245
2.672ProVal: 2.672 ± 0.346
0.429ProTrp: 0.429 ± 0.102
1.145ProTyr: 1.145 ± 0.189
0.0ProXaa: 0.0 ± 0.0
Gln
1.479GlnAla: 1.479 ± 0.243
0.954GlnCys: 0.954 ± 0.158
1.98GlnAsp: 1.98 ± 0.198
2.743GlnGlu: 2.743 ± 0.433
1.813GlnPhe: 1.813 ± 0.233
0.954GlnGly: 0.954 ± 0.149
0.907GlnHis: 0.907 ± 0.159
3.435GlnIle: 3.435 ± 0.295
3.244GlnLys: 3.244 ± 0.311
3.435GlnLeu: 3.435 ± 0.296
0.835GlnMet: 0.835 ± 0.177
3.268GlnAsn: 3.268 ± 0.316
1.169GlnPro: 1.169 ± 0.253
1.67GlnGln: 1.67 ± 0.202
1.813GlnArg: 1.813 ± 0.224
3.054GlnSer: 3.054 ± 0.312
2.672GlnThr: 2.672 ± 0.247
1.455GlnVal: 1.455 ± 0.181
0.119GlnTrp: 0.119 ± 0.057
1.312GlnTyr: 1.312 ± 0.16
0.0GlnXaa: 0.0 ± 0.0
Arg
1.741ArgAla: 1.741 ± 0.223
1.073ArgCys: 1.073 ± 0.158
2.075ArgAsp: 2.075 ± 0.272
2.099ArgGlu: 2.099 ± 0.252
2.576ArgPhe: 2.576 ± 0.27
2.266ArgGly: 2.266 ± 0.4
1.551ArgHis: 1.551 ± 0.187
2.576ArgIle: 2.576 ± 0.304
3.054ArgLys: 3.054 ± 0.264
3.865ArgLeu: 3.865 ± 0.324
0.835ArgMet: 0.835 ± 0.136
2.767ArgAsn: 2.767 ± 0.266
1.622ArgPro: 1.622 ± 0.361
2.052ArgGln: 2.052 ± 0.336
2.028ArgArg: 2.028 ± 0.293
2.553ArgSer: 2.553 ± 0.384
1.98ArgThr: 1.98 ± 0.265
2.052ArgVal: 2.052 ± 0.328
0.429ArgTrp: 0.429 ± 0.091
1.503ArgTyr: 1.503 ± 0.164
0.0ArgXaa: 0.0 ± 0.0
Ser
3.793SerAla: 3.793 ± 0.365
1.837SerCys: 1.837 ± 0.227
4.27SerAsp: 4.27 ± 0.555
5.057SerGlu: 5.057 ± 0.473
4.914SerPhe: 4.914 ± 0.327
2.576SerGly: 2.576 ± 0.238
2.195SerHis: 2.195 ± 0.351
6.513SerIle: 6.513 ± 0.346
6.107SerLys: 6.107 ± 0.405
8.087SerLeu: 8.087 ± 0.472
1.837SerMet: 1.837 ± 0.204
4.413SerAsn: 4.413 ± 0.357
3.077SerPro: 3.077 ± 0.281
2.553SerGln: 2.553 ± 0.272
3.34SerArg: 3.34 ± 0.332
6.703SerSer: 6.703 ± 0.803
4.867SerThr: 4.867 ± 0.491
4.795SerVal: 4.795 ± 0.377
0.859SerTrp: 0.859 ± 0.166
2.576SerTyr: 2.576 ± 0.212
0.0SerXaa: 0.0 ± 0.0
Thr
3.34ThrAla: 3.34 ± 0.351
1.527ThrCys: 1.527 ± 0.2
3.507ThrAsp: 3.507 ± 0.41
4.151ThrGlu: 4.151 ± 0.365
4.246ThrPhe: 4.246 ± 0.413
2.553ThrGly: 2.553 ± 0.329
1.622ThrHis: 1.622 ± 0.188
4.127ThrIle: 4.127 ± 0.356
3.912ThrLys: 3.912 ± 0.332
6.107ThrLeu: 6.107 ± 0.443
1.431ThrMet: 1.431 ± 0.181
3.888ThrAsn: 3.888 ± 0.356
2.743ThrPro: 2.743 ± 0.303
2.433ThrGln: 2.433 ± 0.31
1.98ThrArg: 1.98 ± 0.342
4.222ThrSer: 4.222 ± 0.318
4.079ThrThr: 4.079 ± 0.413
4.485ThrVal: 4.485 ± 0.39
0.453ThrTrp: 0.453 ± 0.115
2.529ThrTyr: 2.529 ± 0.272
0.0ThrXaa: 0.0 ± 0.0
Val
2.576ValAla: 2.576 ± 0.315
1.455ValCys: 1.455 ± 0.204
2.696ValAsp: 2.696 ± 0.276
2.982ValGlu: 2.982 ± 0.254
3.387ValPhe: 3.387 ± 0.313
1.837ValGly: 1.837 ± 0.26
1.312ValHis: 1.312 ± 0.171
3.602ValIle: 3.602 ± 0.351
3.65ValLys: 3.65 ± 0.401
6.202ValLeu: 6.202 ± 0.385
1.169ValMet: 1.169 ± 0.175
3.292ValAsn: 3.292 ± 0.287
1.908ValPro: 1.908 ± 0.23
1.813ValGln: 1.813 ± 0.215
2.099ValArg: 2.099 ± 0.311
4.509ValSer: 4.509 ± 0.332
4.079ValThr: 4.079 ± 0.347
2.6ValVal: 2.6 ± 0.244
0.453ValTrp: 0.453 ± 0.1
2.386ValTyr: 2.386 ± 0.25
0.0ValXaa: 0.0 ± 0.0
Trp
0.215TrpAla: 0.215 ± 0.073
0.191TrpCys: 0.191 ± 0.06
0.215TrpAsp: 0.215 ± 0.063
0.334TrpGlu: 0.334 ± 0.095
0.62TrpPhe: 0.62 ± 0.127
0.239TrpGly: 0.239 ± 0.083
0.167TrpHis: 0.167 ± 0.059
0.62TrpIle: 0.62 ± 0.112
0.811TrpLys: 0.811 ± 0.143
0.835TrpLeu: 0.835 ± 0.141
0.262TrpMet: 0.262 ± 0.082
0.358TrpAsn: 0.358 ± 0.079
0.62TrpPro: 0.62 ± 0.151
0.334TrpGln: 0.334 ± 0.085
0.31TrpArg: 0.31 ± 0.091
0.596TrpSer: 0.596 ± 0.135
0.596TrpThr: 0.596 ± 0.127
0.334TrpVal: 0.334 ± 0.093
0.048TrpTrp: 0.048 ± 0.035
0.31TrpTyr: 0.31 ± 0.072
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.67TyrAla: 1.67 ± 0.241
0.883TyrCys: 0.883 ± 0.137
1.956TyrAsp: 1.956 ± 0.193
2.004TyrGlu: 2.004 ± 0.228
2.219TyrPhe: 2.219 ± 0.207
1.67TyrGly: 1.67 ± 0.173
0.859TyrHis: 0.859 ± 0.137
3.101TyrIle: 3.101 ± 0.319
2.839TyrLys: 2.839 ± 0.273
3.912TyrLeu: 3.912 ± 0.353
0.596TyrMet: 0.596 ± 0.135
2.553TyrAsn: 2.553 ± 0.304
1.217TyrPro: 1.217 ± 0.157
1.36TyrGln: 1.36 ± 0.209
1.861TyrArg: 1.861 ± 0.186
3.006TyrSer: 3.006 ± 0.268
2.099TyrThr: 2.099 ± 0.223
2.433TyrVal: 2.433 ± 0.228
0.501TyrTrp: 0.501 ± 0.108
1.479TyrTyr: 1.479 ± 0.217
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 103 proteins (41920 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski