Amino acid dipepetide frequency for Human herpesvirus 8 (HHV-8) (Kaposi s sarcoma-associated herpesvirus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.114AlaAla: 7.114 ± 0.649
2.286AlaCys: 2.286 ± 0.219
3.005AlaAsp: 3.005 ± 0.317
3.262AlaGlu: 3.262 ± 0.35
3.005AlaPhe: 3.005 ± 0.268
4.443AlaGly: 4.443 ± 0.377
1.592AlaHis: 1.592 ± 0.211
3.467AlaIle: 3.467 ± 0.374
2.491AlaLys: 2.491 ± 0.307
8.013AlaLeu: 8.013 ± 0.641
1.849AlaMet: 1.849 ± 0.203
2.363AlaAsn: 2.363 ± 0.225
5.394AlaPro: 5.394 ± 0.591
2.902AlaGln: 2.902 ± 0.326
4.264AlaArg: 4.264 ± 0.317
6.549AlaSer: 6.549 ± 0.429
5.291AlaThr: 5.291 ± 0.456
5.573AlaVal: 5.573 ± 0.42
0.668AlaTrp: 0.668 ± 0.1
2.363AlaTyr: 2.363 ± 0.277
0.0AlaXaa: 0.0 ± 0.0
Cys
1.772CysAla: 1.772 ± 0.21
0.668CysCys: 0.668 ± 0.133
1.156CysAsp: 1.156 ± 0.179
1.336CysGlu: 1.336 ± 0.205
1.207CysPhe: 1.207 ± 0.175
1.541CysGly: 1.541 ± 0.183
0.616CysHis: 0.616 ± 0.103
1.156CysIle: 1.156 ± 0.174
0.616CysLys: 0.616 ± 0.129
3.185CysLeu: 3.185 ± 0.349
0.437CysMet: 0.437 ± 0.121
0.848CysAsn: 0.848 ± 0.193
1.515CysPro: 1.515 ± 0.185
1.31CysGln: 1.31 ± 0.17
1.849CysArg: 1.849 ± 0.196
1.875CysSer: 1.875 ± 0.203
1.464CysThr: 1.464 ± 0.163
1.695CysVal: 1.695 ± 0.183
0.283CysTrp: 0.283 ± 0.081
0.848CysTyr: 0.848 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
3.878AspAla: 3.878 ± 0.385
1.104AspCys: 1.104 ± 0.197
2.877AspAsp: 2.877 ± 0.557
3.262AspGlu: 3.262 ± 0.821
1.849AspPhe: 1.849 ± 0.205
2.748AspGly: 2.748 ± 0.242
0.976AspHis: 0.976 ± 0.139
2.697AspIle: 2.697 ± 0.216
1.541AspLys: 1.541 ± 0.176
4.084AspLeu: 4.084 ± 0.326
1.361AspMet: 1.361 ± 0.183
1.541AspAsn: 1.541 ± 0.194
3.21AspPro: 3.21 ± 0.315
1.31AspGln: 1.31 ± 0.155
2.44AspArg: 2.44 ± 0.261
3.056AspSer: 3.056 ± 0.366
3.236AspThr: 3.236 ± 0.269
3.39AspVal: 3.39 ± 0.365
0.771AspTrp: 0.771 ± 0.138
1.31AspTyr: 1.31 ± 0.204
0.0AspXaa: 0.0 ± 0.0
Glu
4.366GluAla: 4.366 ± 0.437
1.233GluCys: 1.233 ± 0.224
3.39GluAsp: 3.39 ± 0.667
5.06GluGlu: 5.06 ± 1.855
1.438GluPhe: 1.438 ± 0.194
3.056GluGly: 3.056 ± 0.334
1.438GluHis: 1.438 ± 0.156
2.877GluIle: 2.877 ± 0.229
2.055GluLys: 2.055 ± 0.26
5.522GluLeu: 5.522 ± 0.709
0.925GluMet: 0.925 ± 0.149
1.875GluAsn: 1.875 ± 0.183
3.442GluPro: 3.442 ± 0.784
4.058GluGln: 4.058 ± 2.152
3.21GluArg: 3.21 ± 0.237
3.108GluSer: 3.108 ± 0.276
3.698GluThr: 3.698 ± 0.335
3.313GluVal: 3.313 ± 0.304
0.642GluTrp: 0.642 ± 0.115
1.361GluTyr: 1.361 ± 0.163
0.0GluXaa: 0.0 ± 0.0
Phe
2.594PheAla: 2.594 ± 0.24
1.104PheCys: 1.104 ± 0.157
1.747PheAsp: 1.747 ± 0.227
1.798PheGlu: 1.798 ± 0.237
2.26PhePhe: 2.26 ± 0.243
2.414PheGly: 2.414 ± 0.232
1.002PheHis: 1.002 ± 0.159
2.543PheIle: 2.543 ± 0.249
1.798PheLys: 1.798 ± 0.217
5.008PheLeu: 5.008 ± 0.402
0.848PheMet: 0.848 ± 0.135
1.49PheAsn: 1.49 ± 0.162
2.132PhePro: 2.132 ± 0.243
1.901PheGln: 1.901 ± 0.176
1.952PheArg: 1.952 ± 0.179
3.365PheSer: 3.365 ± 0.35
2.08PheThr: 2.08 ± 0.229
3.236PheVal: 3.236 ± 0.326
0.514PheTrp: 0.514 ± 0.132
1.669PheTyr: 1.669 ± 0.229
0.0PheXaa: 0.0 ± 0.0
Gly
4.546GlyAla: 4.546 ± 0.44
1.053GlyCys: 1.053 ± 0.184
3.442GlyAsp: 3.442 ± 0.312
3.39GlyGlu: 3.39 ± 0.282
2.671GlyPhe: 2.671 ± 0.275
4.161GlyGly: 4.161 ± 0.382
1.49GlyHis: 1.49 ± 0.189
2.8GlyIle: 2.8 ± 0.311
2.209GlyLys: 2.209 ± 0.293
7.243GlyLeu: 7.243 ± 0.423
1.104GlyMet: 1.104 ± 0.144
2.132GlyAsn: 2.132 ± 0.194
3.673GlyPro: 3.673 ± 0.339
2.851GlyGln: 2.851 ± 0.255
4.186GlyArg: 4.186 ± 0.291
4.392GlySer: 4.392 ± 0.366
3.596GlyThr: 3.596 ± 0.263
4.212GlyVal: 4.212 ± 0.345
0.745GlyTrp: 0.745 ± 0.145
1.721GlyTyr: 1.721 ± 0.265
0.0GlyXaa: 0.0 ± 0.0
His
1.875HisAla: 1.875 ± 0.243
0.719HisCys: 0.719 ± 0.126
1.104HisAsp: 1.104 ± 0.183
1.207HisGlu: 1.207 ± 0.143
1.259HisPhe: 1.259 ± 0.171
1.515HisGly: 1.515 ± 0.186
0.899HisHis: 0.899 ± 0.135
1.567HisIle: 1.567 ± 0.208
1.027HisLys: 1.027 ± 0.203
2.902HisLeu: 2.902 ± 0.255
0.565HisMet: 0.565 ± 0.152
0.796HisAsn: 0.796 ± 0.127
2.312HisPro: 2.312 ± 0.264
1.13HisGln: 1.13 ± 0.161
1.926HisArg: 1.926 ± 0.201
1.875HisSer: 1.875 ± 0.242
1.464HisThr: 1.464 ± 0.188
2.363HisVal: 2.363 ± 0.232
0.257HisTrp: 0.257 ± 0.086
0.848HisTyr: 0.848 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
2.414IleAla: 2.414 ± 0.204
1.541IleCys: 1.541 ± 0.208
2.26IleAsp: 2.26 ± 0.203
1.515IleGlu: 1.515 ± 0.178
2.825IlePhe: 2.825 ± 0.297
1.721IleGly: 1.721 ± 0.227
0.873IleHis: 0.873 ± 0.136
2.389IleIle: 2.389 ± 0.241
1.978IleLys: 1.978 ± 0.297
4.674IleLeu: 4.674 ± 0.356
0.976IleMet: 0.976 ± 0.152
1.721IleAsn: 1.721 ± 0.269
3.519IlePro: 3.519 ± 0.376
2.234IleGln: 2.234 ± 0.224
2.44IleArg: 2.44 ± 0.27
4.135IleSer: 4.135 ± 0.34
3.082IleThr: 3.082 ± 0.292
2.825IleVal: 2.825 ± 0.302
0.334IleTrp: 0.334 ± 0.086
1.849IleTyr: 1.849 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
2.594LysAla: 2.594 ± 0.296
0.745LysCys: 0.745 ± 0.152
1.978LysAsp: 1.978 ± 0.221
1.926LysGlu: 1.926 ± 0.179
1.31LysPhe: 1.31 ± 0.159
2.132LysGly: 2.132 ± 0.242
1.156LysHis: 1.156 ± 0.174
2.157LysIle: 2.157 ± 0.28
2.08LysLys: 2.08 ± 0.233
4.161LysLeu: 4.161 ± 0.299
0.822LysMet: 0.822 ± 0.134
1.618LysAsn: 1.618 ± 0.228
2.106LysPro: 2.106 ± 0.318
1.721LysGln: 1.721 ± 0.216
2.517LysArg: 2.517 ± 0.248
2.26LysSer: 2.26 ± 0.246
2.851LysThr: 2.851 ± 0.278
1.824LysVal: 1.824 ± 0.224
0.385LysTrp: 0.385 ± 0.11
1.104LysTyr: 1.104 ± 0.177
0.0LysXaa: 0.0 ± 0.0
Leu
7.782LeuAla: 7.782 ± 0.506
3.159LeuCys: 3.159 ± 0.346
3.981LeuAsp: 3.981 ± 0.356
6.524LeuGlu: 6.524 ± 0.741
4.88LeuPhe: 4.88 ± 0.45
6.832LeuGly: 6.832 ± 0.458
3.185LeuHis: 3.185 ± 0.288
3.442LeuIle: 3.442 ± 0.372
3.673LeuLys: 3.673 ± 0.348
10.736LeuLeu: 10.736 ± 0.837
2.003LeuMet: 2.003 ± 0.25
3.056LeuAsn: 3.056 ± 0.346
7.371LeuPro: 7.371 ± 0.403
4.418LeuGln: 4.418 ± 0.417
6.37LeuArg: 6.37 ± 0.372
8.039LeuSer: 8.039 ± 0.516
7.089LeuThr: 7.089 ± 0.416
7.063LeuVal: 7.063 ± 0.516
1.284LeuTrp: 1.284 ± 0.196
2.902LeuTyr: 2.902 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
2.157MetAla: 2.157 ± 0.222
0.693MetCys: 0.693 ± 0.133
1.259MetAsp: 1.259 ± 0.191
1.104MetGlu: 1.104 ± 0.168
1.002MetPhe: 1.002 ± 0.139
1.464MetGly: 1.464 ± 0.195
0.539MetHis: 0.539 ± 0.124
0.745MetIle: 0.745 ± 0.155
0.976MetLys: 0.976 ± 0.174
2.234MetLeu: 2.234 ± 0.293
0.334MetMet: 0.334 ± 0.122
0.437MetAsn: 0.437 ± 0.114
1.104MetPro: 1.104 ± 0.146
0.719MetGln: 0.719 ± 0.139
0.95MetArg: 0.95 ± 0.13
1.49MetSer: 1.49 ± 0.19
1.207MetThr: 1.207 ± 0.201
1.104MetVal: 1.104 ± 0.183
0.257MetTrp: 0.257 ± 0.072
0.822MetTyr: 0.822 ± 0.128
0.0MetXaa: 0.0 ± 0.0
Asn
2.029AsnAla: 2.029 ± 0.246
0.873AsnCys: 0.873 ± 0.152
0.899AsnAsp: 0.899 ± 0.147
1.721AsnGlu: 1.721 ± 0.246
1.49AsnPhe: 1.49 ± 0.22
1.747AsnGly: 1.747 ± 0.267
0.745AsnHis: 0.745 ± 0.14
2.132AsnIle: 2.132 ± 0.275
1.695AsnLys: 1.695 ± 0.229
3.596AsnLeu: 3.596 ± 0.31
0.899AsnMet: 0.899 ± 0.167
1.567AsnAsn: 1.567 ± 0.159
2.157AsnPro: 2.157 ± 0.213
0.873AsnGln: 0.873 ± 0.171
1.49AsnArg: 1.49 ± 0.191
2.183AsnSer: 2.183 ± 0.309
2.337AsnThr: 2.337 ± 0.279
2.671AsnVal: 2.671 ± 0.269
0.257AsnTrp: 0.257 ± 0.081
0.822AsnTyr: 0.822 ± 0.148
0.0AsnXaa: 0.0 ± 0.0
Pro
5.702ProAla: 5.702 ± 0.478
1.515ProCys: 1.515 ± 0.236
3.031ProAsp: 3.031 ± 0.246
3.467ProGlu: 3.467 ± 0.39
2.08ProPhe: 2.08 ± 0.213
4.931ProGly: 4.931 ± 0.416
2.157ProHis: 2.157 ± 0.311
2.234ProIle: 2.234 ± 0.284
2.363ProLys: 2.363 ± 0.256
6.447ProLeu: 6.447 ± 0.463
1.233ProMet: 1.233 ± 0.2
2.029ProAsn: 2.029 ± 0.258
6.755ProPro: 6.755 ± 0.788
3.159ProGln: 3.159 ± 0.796
3.981ProArg: 3.981 ± 0.332
5.548ProSer: 5.548 ± 0.459
5.24ProThr: 5.24 ± 0.339
5.317ProVal: 5.317 ± 0.339
0.976ProTrp: 0.976 ± 0.159
1.515ProTyr: 1.515 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
3.339GlnAla: 3.339 ± 0.45
0.873GlnCys: 0.873 ± 0.15
1.798GlnAsp: 1.798 ± 0.299
5.162GlnGlu: 5.162 ± 2.532
1.567GlnPhe: 1.567 ± 0.194
2.568GlnGly: 2.568 ± 0.324
0.822GlnHis: 0.822 ± 0.125
1.567GlnIle: 1.567 ± 0.194
1.978GlnLys: 1.978 ± 0.256
3.827GlnLeu: 3.827 ± 0.373
0.796GlnMet: 0.796 ± 0.132
1.361GlnAsn: 1.361 ± 0.174
2.414GlnPro: 2.414 ± 0.323
3.698GlnGln: 3.698 ± 2.018
2.312GlnArg: 2.312 ± 0.295
3.544GlnSer: 3.544 ± 0.283
2.851GlnThr: 2.851 ± 0.263
2.106GlnVal: 2.106 ± 0.197
0.437GlnTrp: 0.437 ± 0.087
0.873GlnTyr: 0.873 ± 0.167
0.0GlnXaa: 0.0 ± 0.0
Arg
4.469ArgAla: 4.469 ± 0.388
1.027ArgCys: 1.027 ± 0.149
3.031ArgAsp: 3.031 ± 0.232
3.596ArgGlu: 3.596 ± 0.32
1.901ArgPhe: 1.901 ± 0.214
4.674ArgGly: 4.674 ± 0.365
2.132ArgHis: 2.132 ± 0.277
2.286ArgIle: 2.286 ± 0.272
2.491ArgLys: 2.491 ± 0.241
5.959ArgLeu: 5.959 ± 0.372
1.413ArgMet: 1.413 ± 0.216
1.438ArgAsn: 1.438 ± 0.17
3.853ArgPro: 3.853 ± 0.356
2.517ArgGln: 2.517 ± 0.257
4.752ArgArg: 4.752 ± 0.434
3.467ArgSer: 3.467 ± 0.264
3.159ArgThr: 3.159 ± 0.297
4.238ArgVal: 4.238 ± 0.344
0.668ArgTrp: 0.668 ± 0.121
1.515ArgTyr: 1.515 ± 0.201
0.0ArgXaa: 0.0 ± 0.0
Ser
5.214SerAla: 5.214 ± 0.455
1.901SerCys: 1.901 ± 0.204
3.185SerAsp: 3.185 ± 0.323
3.262SerGlu: 3.262 ± 0.268
2.851SerPhe: 2.851 ± 0.275
5.317SerGly: 5.317 ± 0.553
2.594SerHis: 2.594 ± 0.236
3.365SerIle: 3.365 ± 0.306
2.774SerLys: 2.774 ± 0.34
7.448SerLeu: 7.448 ± 0.486
1.747SerMet: 1.747 ± 0.225
2.234SerAsn: 2.234 ± 0.258
6.421SerPro: 6.421 ± 0.685
3.185SerGln: 3.185 ± 0.287
4.058SerArg: 4.058 ± 0.313
7.217SerSer: 7.217 ± 0.675
5.522SerThr: 5.522 ± 0.327
5.317SerVal: 5.317 ± 0.439
1.181SerTrp: 1.181 ± 0.185
1.849SerTyr: 1.849 ± 0.214
0.0SerXaa: 0.0 ± 0.0
Thr
5.162ThrAla: 5.162 ± 0.46
1.413ThrCys: 1.413 ± 0.247
3.262ThrAsp: 3.262 ± 0.287
3.031ThrGlu: 3.031 ± 0.32
2.568ThrPhe: 2.568 ± 0.251
4.135ThrGly: 4.135 ± 0.393
2.183ThrHis: 2.183 ± 0.226
2.337ThrIle: 2.337 ± 0.275
2.029ThrLys: 2.029 ± 0.28
7.14ThrLeu: 7.14 ± 0.439
1.027ThrMet: 1.027 ± 0.161
1.695ThrAsn: 1.695 ± 0.262
5.65ThrPro: 5.65 ± 0.455
2.363ThrGln: 2.363 ± 0.24
3.365ThrArg: 3.365 ± 0.298
5.573ThrSer: 5.573 ± 0.385
4.469ThrThr: 4.469 ± 0.409
5.188ThrVal: 5.188 ± 0.443
1.13ThrTrp: 1.13 ± 0.186
2.08ThrTyr: 2.08 ± 0.233
0.0ThrXaa: 0.0 ± 0.0
Val
5.573ValAla: 5.573 ± 0.439
2.209ValCys: 2.209 ± 0.223
3.108ValAsp: 3.108 ± 0.28
3.673ValGlu: 3.673 ± 0.274
3.596ValPhe: 3.596 ± 0.312
4.084ValGly: 4.084 ± 0.39
1.926ValHis: 1.926 ± 0.233
3.288ValIle: 3.288 ± 0.315
2.234ValLys: 2.234 ± 0.26
6.755ValLeu: 6.755 ± 0.508
1.336ValMet: 1.336 ± 0.178
2.209ValAsn: 2.209 ± 0.359
4.546ValPro: 4.546 ± 0.342
2.003ValGln: 2.003 ± 0.189
3.724ValArg: 3.724 ± 0.335
6.138ValSer: 6.138 ± 0.446
4.52ValThr: 4.52 ± 0.309
5.137ValVal: 5.137 ± 0.413
0.848ValTrp: 0.848 ± 0.168
2.722ValTyr: 2.722 ± 0.326
0.0ValXaa: 0.0 ± 0.0
Trp
1.027TrpAla: 1.027 ± 0.161
0.257TrpCys: 0.257 ± 0.072
0.719TrpAsp: 0.719 ± 0.13
0.591TrpGlu: 0.591 ± 0.138
0.385TrpPhe: 0.385 ± 0.097
0.514TrpGly: 0.514 ± 0.102
0.514TrpHis: 0.514 ± 0.101
0.771TrpIle: 0.771 ± 0.164
0.308TrpLys: 0.308 ± 0.068
1.438TrpLeu: 1.438 ± 0.169
0.205TrpMet: 0.205 ± 0.069
0.385TrpAsn: 0.385 ± 0.11
0.693TrpPro: 0.693 ± 0.125
0.462TrpGln: 0.462 ± 0.111
0.796TrpArg: 0.796 ± 0.133
0.822TrpSer: 0.822 ± 0.137
0.873TrpThr: 0.873 ± 0.122
0.796TrpVal: 0.796 ± 0.142
0.103TrpTrp: 0.103 ± 0.06
0.539TrpTyr: 0.539 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.286TyrAla: 2.286 ± 0.248
1.002TyrCys: 1.002 ± 0.159
1.413TyrAsp: 1.413 ± 0.2
1.181TyrGlu: 1.181 ± 0.155
1.336TyrPhe: 1.336 ± 0.204
1.618TyrGly: 1.618 ± 0.196
0.771TyrHis: 0.771 ± 0.131
1.49TyrIle: 1.49 ± 0.217
1.13TyrLys: 1.13 ± 0.167
3.416TyrLeu: 3.416 ± 0.354
0.693TyrMet: 0.693 ± 0.143
1.387TyrAsn: 1.387 ± 0.219
1.413TyrPro: 1.413 ± 0.156
1.079TyrGln: 1.079 ± 0.172
2.003TyrArg: 2.003 ± 0.224
2.003TyrSer: 2.003 ± 0.243
1.772TyrThr: 1.772 ± 0.244
2.26TyrVal: 2.26 ± 0.244
0.462TyrTrp: 0.462 ± 0.098
0.873TyrTyr: 0.873 ± 0.127
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (38936 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski