Amino acid dipepetide frequency for Human herpesvirus 8 type P (isolate GK18) (HHV-8) (Kaposi s sarcoma-associated herpesvirus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.124AlaAla: 7.124 ± 0.604
2.323AlaCys: 2.323 ± 0.233
3.02AlaAsp: 3.02 ± 0.328
3.252AlaGlu: 3.252 ± 0.361
2.968AlaPhe: 2.968 ± 0.253
4.491AlaGly: 4.491 ± 0.393
1.575AlaHis: 1.575 ± 0.235
3.485AlaIle: 3.485 ± 0.352
2.452AlaLys: 2.452 ± 0.309
8.079AlaLeu: 8.079 ± 0.576
1.729AlaMet: 1.729 ± 0.21
2.349AlaAsn: 2.349 ± 0.309
5.421AlaPro: 5.421 ± 0.605
2.943AlaGln: 2.943 ± 0.358
4.285AlaArg: 4.285 ± 0.342
6.531AlaSer: 6.531 ± 0.398
5.266AlaThr: 5.266 ± 0.484
5.601AlaVal: 5.601 ± 0.442
0.671AlaTrp: 0.671 ± 0.121
2.349AlaTyr: 2.349 ± 0.268
0.0AlaXaa: 0.0 ± 0.0
Cys
1.781CysAla: 1.781 ± 0.249
0.671CysCys: 0.671 ± 0.138
1.136CysAsp: 1.136 ± 0.191
1.316CysGlu: 1.316 ± 0.219
1.213CysPhe: 1.213 ± 0.197
1.549CysGly: 1.549 ± 0.235
0.619CysHis: 0.619 ± 0.123
1.162CysIle: 1.162 ± 0.192
0.619CysLys: 0.619 ± 0.137
3.201CysLeu: 3.201 ± 0.415
0.439CysMet: 0.439 ± 0.108
0.852CysAsn: 0.852 ± 0.186
1.523CysPro: 1.523 ± 0.183
1.342CysGln: 1.342 ± 0.187
1.858CysArg: 1.858 ± 0.229
1.884CysSer: 1.884 ± 0.253
1.471CysThr: 1.471 ± 0.152
1.704CysVal: 1.704 ± 0.227
0.258CysTrp: 0.258 ± 0.078
0.878CysTyr: 0.878 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
3.846AspAla: 3.846 ± 0.405
1.11AspCys: 1.11 ± 0.208
2.891AspAsp: 2.891 ± 0.529
3.51AspGlu: 3.51 ± 0.96
1.833AspPhe: 1.833 ± 0.211
2.684AspGly: 2.684 ± 0.273
0.981AspHis: 0.981 ± 0.16
2.684AspIle: 2.684 ± 0.224
1.575AspLys: 1.575 ± 0.18
4.156AspLeu: 4.156 ± 0.349
1.342AspMet: 1.342 ± 0.203
1.523AspAsn: 1.523 ± 0.19
3.252AspPro: 3.252 ± 0.308
1.291AspGln: 1.291 ± 0.18
2.452AspArg: 2.452 ± 0.263
2.994AspSer: 2.994 ± 0.316
3.252AspThr: 3.252 ± 0.24
3.407AspVal: 3.407 ± 0.356
0.774AspTrp: 0.774 ± 0.17
1.291AspTyr: 1.291 ± 0.192
0.0AspXaa: 0.0 ± 0.0
Glu
4.362GluAla: 4.362 ± 0.425
1.265GluCys: 1.265 ± 0.216
3.407GluAsp: 3.407 ± 0.561
4.827GluGlu: 4.827 ± 1.474
1.497GluPhe: 1.497 ± 0.206
3.046GluGly: 3.046 ± 0.327
1.471GluHis: 1.471 ± 0.167
2.891GluIle: 2.891 ± 0.198
2.065GluLys: 2.065 ± 0.247
5.292GluLeu: 5.292 ± 0.466
0.903GluMet: 0.903 ± 0.141
1.884GluAsn: 1.884 ± 0.191
3.588GluPro: 3.588 ± 0.737
3.975GluGln: 3.975 ± 1.814
3.201GluArg: 3.201 ± 0.33
3.072GluSer: 3.072 ± 0.305
3.691GluThr: 3.691 ± 0.35
3.278GluVal: 3.278 ± 0.316
0.619GluTrp: 0.619 ± 0.12
1.342GluTyr: 1.342 ± 0.174
0.0GluXaa: 0.0 ± 0.0
Phe
2.581PheAla: 2.581 ± 0.225
1.136PheCys: 1.136 ± 0.207
1.729PheAsp: 1.729 ± 0.237
1.781PheGlu: 1.781 ± 0.227
2.323PhePhe: 2.323 ± 0.263
2.426PheGly: 2.426 ± 0.273
1.007PheHis: 1.007 ± 0.156
2.555PheIle: 2.555 ± 0.235
1.858PheLys: 1.858 ± 0.229
4.982PheLeu: 4.982 ± 0.47
0.878PheMet: 0.878 ± 0.159
1.497PheAsn: 1.497 ± 0.19
2.091PhePro: 2.091 ± 0.271
1.91PheGln: 1.91 ± 0.216
1.962PheArg: 1.962 ± 0.187
3.33PheSer: 3.33 ± 0.33
2.091PheThr: 2.091 ± 0.224
3.278PheVal: 3.278 ± 0.358
0.516PheTrp: 0.516 ± 0.117
1.678PheTyr: 1.678 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
4.595GlyAla: 4.595 ± 0.497
1.058GlyCys: 1.058 ± 0.167
3.433GlyAsp: 3.433 ± 0.384
3.433GlyGlu: 3.433 ± 0.293
2.659GlyPhe: 2.659 ± 0.309
4.13GlyGly: 4.13 ± 0.323
1.471GlyHis: 1.471 ± 0.17
2.814GlyIle: 2.814 ± 0.324
2.194GlyLys: 2.194 ± 0.305
7.305GlyLeu: 7.305 ± 0.459
1.11GlyMet: 1.11 ± 0.152
2.142GlyAsn: 2.142 ± 0.213
3.665GlyPro: 3.665 ± 0.36
2.917GlyGln: 2.917 ± 0.297
4.053GlyArg: 4.053 ± 0.348
4.362GlySer: 4.362 ± 0.405
3.614GlyThr: 3.614 ± 0.29
4.182GlyVal: 4.182 ± 0.336
0.749GlyTrp: 0.749 ± 0.154
1.729GlyTyr: 1.729 ± 0.279
0.0GlyXaa: 0.0 ± 0.0
His
1.858HisAla: 1.858 ± 0.263
0.723HisCys: 0.723 ± 0.154
1.058HisAsp: 1.058 ± 0.162
1.239HisGlu: 1.239 ± 0.173
1.239HisPhe: 1.239 ± 0.184
1.523HisGly: 1.523 ± 0.185
0.903HisHis: 0.903 ± 0.146
1.523HisIle: 1.523 ± 0.21
1.032HisLys: 1.032 ± 0.194
2.917HisLeu: 2.917 ± 0.267
0.568HisMet: 0.568 ± 0.148
0.826HisAsn: 0.826 ± 0.128
2.297HisPro: 2.297 ± 0.242
1.136HisGln: 1.136 ± 0.186
1.858HisArg: 1.858 ± 0.223
1.833HisSer: 1.833 ± 0.246
1.445HisThr: 1.445 ± 0.202
2.375HisVal: 2.375 ± 0.225
0.258HisTrp: 0.258 ± 0.098
0.852HisTyr: 0.852 ± 0.134
0.0HisXaa: 0.0 ± 0.0
Ile
2.426IleAla: 2.426 ± 0.231
1.575IleCys: 1.575 ± 0.222
2.271IleAsp: 2.271 ± 0.218
1.523IleGlu: 1.523 ± 0.196
2.814IlePhe: 2.814 ± 0.252
1.704IleGly: 1.704 ± 0.217
0.852IleHis: 0.852 ± 0.142
2.401IleIle: 2.401 ± 0.248
1.988IleLys: 1.988 ± 0.303
4.749IleLeu: 4.749 ± 0.396
0.981IleMet: 0.981 ± 0.153
1.729IleAsn: 1.729 ± 0.273
3.536IlePro: 3.536 ± 0.398
2.168IleGln: 2.168 ± 0.214
2.452IleArg: 2.452 ± 0.321
4.104IleSer: 4.104 ± 0.37
3.097IleThr: 3.097 ± 0.295
2.788IleVal: 2.788 ± 0.284
0.336IleTrp: 0.336 ± 0.087
1.858IleTyr: 1.858 ± 0.22
0.0IleXaa: 0.0 ± 0.0
Lys
2.607LysAla: 2.607 ± 0.304
0.749LysCys: 0.749 ± 0.138
2.013LysAsp: 2.013 ± 0.197
1.962LysGlu: 1.962 ± 0.174
1.316LysPhe: 1.316 ± 0.163
2.142LysGly: 2.142 ± 0.244
1.187LysHis: 1.187 ± 0.201
2.168LysIle: 2.168 ± 0.278
2.091LysLys: 2.091 ± 0.248
4.182LysLeu: 4.182 ± 0.366
0.852LysMet: 0.852 ± 0.137
1.626LysAsn: 1.626 ± 0.245
2.065LysPro: 2.065 ± 0.257
1.704LysGln: 1.704 ± 0.242
2.53LysArg: 2.53 ± 0.244
2.246LysSer: 2.246 ± 0.232
2.839LysThr: 2.839 ± 0.321
1.833LysVal: 1.833 ± 0.247
0.387LysTrp: 0.387 ± 0.108
1.11LysTyr: 1.11 ± 0.187
0.0LysXaa: 0.0 ± 0.0
Leu
7.795LeuAla: 7.795 ± 0.53
3.097LeuCys: 3.097 ± 0.348
3.949LeuAsp: 3.949 ± 0.372
6.298LeuGlu: 6.298 ± 0.555
4.904LeuPhe: 4.904 ± 0.484
6.84LeuGly: 6.84 ± 0.538
3.149LeuHis: 3.149 ± 0.293
3.407LeuIle: 3.407 ± 0.343
3.743LeuLys: 3.743 ± 0.355
10.764LeuLeu: 10.764 ± 0.767
2.013LeuMet: 2.013 ± 0.22
3.097LeuAsn: 3.097 ± 0.366
7.382LeuPro: 7.382 ± 0.461
4.466LeuGln: 4.466 ± 0.43
6.401LeuArg: 6.401 ± 0.339
8.002LeuSer: 8.002 ± 0.551
7.124LeuThr: 7.124 ± 0.437
7.047LeuVal: 7.047 ± 0.522
1.291LeuTrp: 1.291 ± 0.198
2.968LeuTyr: 2.968 ± 0.303
0.0LeuXaa: 0.0 ± 0.0
Met
2.168MetAla: 2.168 ± 0.234
0.671MetCys: 0.671 ± 0.136
1.239MetAsp: 1.239 ± 0.181
1.11MetGlu: 1.11 ± 0.153
1.007MetPhe: 1.007 ± 0.153
1.497MetGly: 1.497 ± 0.24
0.542MetHis: 0.542 ± 0.122
0.749MetIle: 0.749 ± 0.159
0.955MetLys: 0.955 ± 0.197
2.194MetLeu: 2.194 ± 0.289
0.336MetMet: 0.336 ± 0.116
0.439MetAsn: 0.439 ± 0.126
1.084MetPro: 1.084 ± 0.171
0.723MetGln: 0.723 ± 0.147
0.981MetArg: 0.981 ± 0.139
1.42MetSer: 1.42 ± 0.211
1.213MetThr: 1.213 ± 0.209
1.11MetVal: 1.11 ± 0.173
0.258MetTrp: 0.258 ± 0.074
0.8MetTyr: 0.8 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
2.039AsnAla: 2.039 ± 0.278
0.878AsnCys: 0.878 ± 0.149
0.903AsnAsp: 0.903 ± 0.153
1.678AsnGlu: 1.678 ± 0.257
1.523AsnPhe: 1.523 ± 0.208
1.807AsnGly: 1.807 ± 0.263
0.723AsnHis: 0.723 ± 0.136
2.168AsnIle: 2.168 ± 0.297
1.704AsnLys: 1.704 ± 0.226
3.588AsnLeu: 3.588 ± 0.316
0.903AsnMet: 0.903 ± 0.175
1.549AsnAsn: 1.549 ± 0.169
2.142AsnPro: 2.142 ± 0.22
0.903AsnGln: 0.903 ± 0.16
1.445AsnArg: 1.445 ± 0.194
2.168AsnSer: 2.168 ± 0.299
2.323AsnThr: 2.323 ± 0.295
2.684AsnVal: 2.684 ± 0.289
0.258AsnTrp: 0.258 ± 0.079
0.826AsnTyr: 0.826 ± 0.142
0.0AsnXaa: 0.0 ± 0.0
Pro
5.653ProAla: 5.653 ± 0.574
1.523ProCys: 1.523 ± 0.238
3.072ProAsp: 3.072 ± 0.258
3.536ProGlu: 3.536 ± 0.417
2.117ProPhe: 2.117 ± 0.229
4.956ProGly: 4.956 ± 0.43
2.168ProHis: 2.168 ± 0.304
2.246ProIle: 2.246 ± 0.295
2.323ProLys: 2.323 ± 0.279
6.453ProLeu: 6.453 ± 0.445
1.213ProMet: 1.213 ± 0.186
2.065ProAsn: 2.065 ± 0.228
6.711ProPro: 6.711 ± 0.784
3.252ProGln: 3.252 ± 0.803
3.898ProArg: 3.898 ± 0.353
5.446ProSer: 5.446 ± 0.435
5.214ProThr: 5.214 ± 0.366
5.292ProVal: 5.292 ± 0.38
0.981ProTrp: 0.981 ± 0.202
1.549ProTyr: 1.549 ± 0.171
0.0ProXaa: 0.0 ± 0.0
Gln
3.356GlnAla: 3.356 ± 0.41
0.878GlnCys: 0.878 ± 0.173
2.065GlnAsp: 2.065 ± 0.451
4.801GlnGlu: 4.801 ± 1.822
1.6GlnPhe: 1.6 ± 0.202
2.607GlnGly: 2.607 ± 0.331
0.8GlnHis: 0.8 ± 0.15
1.549GlnIle: 1.549 ± 0.235
1.988GlnLys: 1.988 ± 0.303
3.794GlnLeu: 3.794 ± 0.369
0.8GlnMet: 0.8 ± 0.159
1.42GlnAsn: 1.42 ± 0.177
2.401GlnPro: 2.401 ± 0.313
4.207GlnGln: 4.207 ± 2.236
2.478GlnArg: 2.478 ± 0.358
3.51GlnSer: 3.51 ± 0.264
2.839GlnThr: 2.839 ± 0.287
2.091GlnVal: 2.091 ± 0.222
0.465GlnTrp: 0.465 ± 0.099
0.878GlnTyr: 0.878 ± 0.18
0.0GlnXaa: 0.0 ± 0.0
Arg
4.491ArgAla: 4.491 ± 0.4
1.007ArgCys: 1.007 ± 0.171
3.046ArgAsp: 3.046 ± 0.248
3.743ArgGlu: 3.743 ± 0.358
1.858ArgPhe: 1.858 ± 0.215
4.595ArgGly: 4.595 ± 0.391
2.065ArgHis: 2.065 ± 0.302
2.323ArgIle: 2.323 ± 0.276
2.478ArgLys: 2.478 ± 0.285
5.963ArgLeu: 5.963 ± 0.412
1.394ArgMet: 1.394 ± 0.204
1.42ArgAsn: 1.42 ± 0.171
3.769ArgPro: 3.769 ± 0.358
2.53ArgGln: 2.53 ± 0.231
4.749ArgArg: 4.749 ± 0.45
3.381ArgSer: 3.381 ± 0.249
3.175ArgThr: 3.175 ± 0.289
4.259ArgVal: 4.259 ± 0.389
0.697ArgTrp: 0.697 ± 0.13
1.549ArgTyr: 1.549 ± 0.21
0.0ArgXaa: 0.0 ± 0.0
Ser
5.085SerAla: 5.085 ± 0.344
1.91SerCys: 1.91 ± 0.209
3.097SerAsp: 3.097 ± 0.293
3.252SerGlu: 3.252 ± 0.282
2.865SerPhe: 2.865 ± 0.26
5.24SerGly: 5.24 ± 0.465
2.555SerHis: 2.555 ± 0.264
3.33SerIle: 3.33 ± 0.294
2.736SerLys: 2.736 ± 0.283
7.408SerLeu: 7.408 ± 0.505
1.729SerMet: 1.729 ± 0.238
2.246SerAsn: 2.246 ± 0.246
6.401SerPro: 6.401 ± 0.611
3.149SerGln: 3.149 ± 0.248
4.001SerArg: 4.001 ± 0.338
7.124SerSer: 7.124 ± 0.682
5.498SerThr: 5.498 ± 0.366
5.292SerVal: 5.292 ± 0.407
1.162SerTrp: 1.162 ± 0.161
1.833SerTyr: 1.833 ± 0.224
0.0SerXaa: 0.0 ± 0.0
Thr
5.24ThrAla: 5.24 ± 0.415
1.42ThrCys: 1.42 ± 0.215
3.252ThrAsp: 3.252 ± 0.313
2.994ThrGlu: 2.994 ± 0.223
2.607ThrPhe: 2.607 ± 0.302
4.13ThrGly: 4.13 ± 0.403
2.168ThrHis: 2.168 ± 0.245
2.323ThrIle: 2.323 ± 0.277
2.039ThrLys: 2.039 ± 0.302
7.15ThrLeu: 7.15 ± 0.399
1.032ThrMet: 1.032 ± 0.168
1.575ThrAsn: 1.575 ± 0.25
5.679ThrPro: 5.679 ± 0.457
2.349ThrGln: 2.349 ± 0.242
3.381ThrArg: 3.381 ± 0.348
5.601ThrSer: 5.601 ± 0.389
4.466ThrThr: 4.466 ± 0.361
5.188ThrVal: 5.188 ± 0.47
1.136ThrTrp: 1.136 ± 0.212
2.091ThrTyr: 2.091 ± 0.252
0.0ThrXaa: 0.0 ± 0.0
Val
5.575ValAla: 5.575 ± 0.421
2.22ValCys: 2.22 ± 0.281
3.149ValAsp: 3.149 ± 0.271
3.665ValGlu: 3.665 ± 0.303
3.588ValPhe: 3.588 ± 0.372
4.104ValGly: 4.104 ± 0.355
1.936ValHis: 1.936 ± 0.215
3.304ValIle: 3.304 ± 0.278
2.297ValLys: 2.297 ± 0.251
6.685ValLeu: 6.685 ± 0.456
1.342ValMet: 1.342 ± 0.211
2.22ValAsn: 2.22 ± 0.398
4.543ValPro: 4.543 ± 0.344
2.013ValGln: 2.013 ± 0.216
3.691ValArg: 3.691 ± 0.344
6.092ValSer: 6.092 ± 0.405
4.517ValThr: 4.517 ± 0.33
5.111ValVal: 5.111 ± 0.415
0.852ValTrp: 0.852 ± 0.183
2.71ValTyr: 2.71 ± 0.27
0.0ValXaa: 0.0 ± 0.0
Trp
1.032TrpAla: 1.032 ± 0.16
0.258TrpCys: 0.258 ± 0.09
0.723TrpAsp: 0.723 ± 0.128
0.594TrpGlu: 0.594 ± 0.111
0.387TrpPhe: 0.387 ± 0.096
0.516TrpGly: 0.516 ± 0.105
0.516TrpHis: 0.516 ± 0.104
0.774TrpIle: 0.774 ± 0.162
0.31TrpLys: 0.31 ± 0.081
1.445TrpLeu: 1.445 ± 0.178
0.206TrpMet: 0.206 ± 0.089
0.387TrpAsn: 0.387 ± 0.131
0.697TrpPro: 0.697 ± 0.117
0.465TrpGln: 0.465 ± 0.107
0.8TrpArg: 0.8 ± 0.162
0.8TrpSer: 0.8 ± 0.136
0.878TrpThr: 0.878 ± 0.158
0.8TrpVal: 0.8 ± 0.142
0.103TrpTrp: 0.103 ± 0.065
0.542TrpTyr: 0.542 ± 0.109
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.297TyrAla: 2.297 ± 0.207
1.007TyrCys: 1.007 ± 0.178
1.394TyrAsp: 1.394 ± 0.205
1.162TyrGlu: 1.162 ± 0.197
1.291TyrPhe: 1.291 ± 0.235
1.626TyrGly: 1.626 ± 0.204
0.774TyrHis: 0.774 ± 0.135
1.497TyrIle: 1.497 ± 0.196
1.162TyrLys: 1.162 ± 0.165
3.433TyrLeu: 3.433 ± 0.334
0.697TyrMet: 0.697 ± 0.135
1.368TyrAsn: 1.368 ± 0.201
1.42TyrPro: 1.42 ± 0.143
1.11TyrGln: 1.11 ± 0.172
2.013TyrArg: 2.013 ± 0.229
2.013TyrSer: 2.013 ± 0.274
1.807TyrThr: 1.807 ± 0.255
2.297TyrVal: 2.297 ± 0.242
0.465TyrTrp: 0.465 ± 0.123
0.878TyrTyr: 0.878 ± 0.13
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (38742 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski