Amino acid dipepetide frequency for Rhinolophus gammaherpesvirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.149AlaAla: 3.149 ± 0.404
1.157AlaCys: 1.157 ± 0.223
1.803AlaAsp: 1.803 ± 0.242
2.368AlaGlu: 2.368 ± 0.274
2.557AlaPhe: 2.557 ± 0.256
2.234AlaGly: 2.234 ± 0.227
1.238AlaHis: 1.238 ± 0.18
2.853AlaIle: 2.853 ± 0.268
2.476AlaLys: 2.476 ± 0.265
4.844AlaLeu: 4.844 ± 0.407
1.265AlaMet: 1.265 ± 0.188
1.938AlaAsn: 1.938 ± 0.219
3.283AlaPro: 3.283 ± 0.565
1.749AlaGln: 1.749 ± 0.223
1.776AlaArg: 1.776 ± 0.234
4.871AlaSer: 4.871 ± 0.448
3.822AlaThr: 3.822 ± 0.339
3.552AlaVal: 3.552 ± 0.255
0.484AlaTrp: 0.484 ± 0.108
1.13AlaTyr: 1.13 ± 0.145
0.0AlaXaa: 0.0 ± 0.0
Cys
1.05CysAla: 1.05 ± 0.178
0.646CysCys: 0.646 ± 0.137
1.184CysAsp: 1.184 ± 0.177
1.453CysGlu: 1.453 ± 0.23
1.184CysPhe: 1.184 ± 0.188
1.265CysGly: 1.265 ± 0.188
0.754CysHis: 0.754 ± 0.163
1.265CysIle: 1.265 ± 0.142
1.13CysLys: 1.13 ± 0.197
3.472CysLeu: 3.472 ± 0.359
0.646CysMet: 0.646 ± 0.123
1.05CysAsn: 1.05 ± 0.143
1.292CysPro: 1.292 ± 0.178
1.184CysGln: 1.184 ± 0.202
1.103CysArg: 1.103 ± 0.166
1.803CysSer: 1.803 ± 0.223
1.157CysThr: 1.157 ± 0.169
1.696CysVal: 1.696 ± 0.302
0.296CysTrp: 0.296 ± 0.082
0.834CysTyr: 0.834 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
2.315AspAla: 2.315 ± 0.206
1.184AspCys: 1.184 ± 0.184
3.122AspAsp: 3.122 ± 0.429
3.014AspGlu: 3.014 ± 0.389
2.853AspPhe: 2.853 ± 0.251
2.341AspGly: 2.341 ± 0.237
0.969AspHis: 0.969 ± 0.154
3.337AspIle: 3.337 ± 0.354
1.48AspLys: 1.48 ± 0.251
4.898AspLeu: 4.898 ± 0.382
1.238AspMet: 1.238 ± 0.201
2.207AspAsn: 2.207 ± 0.251
3.149AspPro: 3.149 ± 0.273
1.373AspGln: 1.373 ± 0.187
1.911AspArg: 1.911 ± 0.269
4.306AspSer: 4.306 ± 0.389
3.203AspThr: 3.203 ± 0.262
3.499AspVal: 3.499 ± 0.354
0.646AspTrp: 0.646 ± 0.117
1.319AspTyr: 1.319 ± 0.146
0.0AspXaa: 0.0 ± 0.0
Glu
3.445GluAla: 3.445 ± 0.437
1.292GluCys: 1.292 ± 0.224
3.552GluAsp: 3.552 ± 0.408
5.759GluGlu: 5.759 ± 2.187
2.099GluPhe: 2.099 ± 0.217
2.072GluGly: 2.072 ± 0.305
1.48GluHis: 1.48 ± 0.19
3.364GluIle: 3.364 ± 0.319
2.611GluLys: 2.611 ± 0.242
4.441GluLeu: 4.441 ± 0.326
1.157GluMet: 1.157 ± 0.169
2.718GluAsn: 2.718 ± 0.307
3.122GluPro: 3.122 ± 1.464
1.669GluGln: 1.669 ± 0.204
1.938GluArg: 1.938 ± 0.215
4.306GluSer: 4.306 ± 0.319
4.118GluThr: 4.118 ± 0.3
3.445GluVal: 3.445 ± 0.31
0.404GluTrp: 0.404 ± 0.101
1.642GluTyr: 1.642 ± 0.234
0.0GluXaa: 0.0 ± 0.0
Phe
1.669PheAla: 1.669 ± 0.186
1.238PheCys: 1.238 ± 0.24
1.857PheAsp: 1.857 ± 0.239
2.261PheGlu: 2.261 ± 0.249
2.96PhePhe: 2.96 ± 0.336
2.422PheGly: 2.422 ± 0.28
1.373PheHis: 1.373 ± 0.177
3.391PheIle: 3.391 ± 0.301
3.391PheLys: 3.391 ± 0.28
5.921PheLeu: 5.921 ± 0.503
1.265PheMet: 1.265 ± 0.151
2.691PheAsn: 2.691 ± 0.285
2.18PhePro: 2.18 ± 0.238
2.53PheGln: 2.53 ± 0.278
2.126PheArg: 2.126 ± 0.238
4.118PheSer: 4.118 ± 0.354
2.422PheThr: 2.422 ± 0.27
3.445PheVal: 3.445 ± 0.273
0.404PheTrp: 0.404 ± 0.119
2.315PheTyr: 2.315 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
2.234GlyAla: 2.234 ± 0.244
1.184GlyCys: 1.184 ± 0.185
2.799GlyAsp: 2.799 ± 0.309
3.014GlyGlu: 3.014 ± 0.375
2.449GlyPhe: 2.449 ± 0.274
2.611GlyGly: 2.611 ± 0.245
1.373GlyHis: 1.373 ± 0.235
2.664GlyIle: 2.664 ± 0.227
2.18GlyLys: 2.18 ± 0.251
4.898GlyLeu: 4.898 ± 0.342
1.292GlyMet: 1.292 ± 0.156
2.315GlyAsn: 2.315 ± 0.227
2.261GlyPro: 2.261 ± 0.257
2.018GlyGln: 2.018 ± 0.309
2.207GlyArg: 2.207 ± 0.275
3.714GlySer: 3.714 ± 0.365
2.691GlyThr: 2.691 ± 0.236
3.714GlyVal: 3.714 ± 0.394
0.296GlyTrp: 0.296 ± 0.062
1.507GlyTyr: 1.507 ± 0.227
0.0GlyXaa: 0.0 ± 0.0
His
1.426HisAla: 1.426 ± 0.18
0.592HisCys: 0.592 ± 0.112
0.969HisAsp: 0.969 ± 0.172
1.077HisGlu: 1.077 ± 0.147
1.319HisPhe: 1.319 ± 0.221
1.211HisGly: 1.211 ± 0.211
0.834HisHis: 0.834 ± 0.185
1.884HisIle: 1.884 ± 0.178
1.722HisLys: 1.722 ± 0.183
2.826HisLeu: 2.826 ± 0.301
0.7HisMet: 0.7 ± 0.12
1.399HisAsn: 1.399 ± 0.212
2.126HisPro: 2.126 ± 0.306
1.373HisGln: 1.373 ± 0.215
1.157HisArg: 1.157 ± 0.178
2.503HisSer: 2.503 ± 0.249
1.884HisThr: 1.884 ± 0.27
1.669HisVal: 1.669 ± 0.228
0.377HisTrp: 0.377 ± 0.088
0.727HisTyr: 0.727 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
1.749IleAla: 1.749 ± 0.236
1.965IleCys: 1.965 ± 0.265
2.907IleAsp: 2.907 ± 0.211
2.368IleGlu: 2.368 ± 0.29
3.364IlePhe: 3.364 ± 0.296
2.045IleGly: 2.045 ± 0.216
1.642IleHis: 1.642 ± 0.213
3.364IleIle: 3.364 ± 0.339
3.203IleLys: 3.203 ± 0.348
6.244IleLeu: 6.244 ± 0.388
1.292IleMet: 1.292 ± 0.177
2.987IleAsn: 2.987 ± 0.31
3.552IlePro: 3.552 ± 0.28
3.095IleGln: 3.095 ± 0.288
2.126IleArg: 2.126 ± 0.309
5.598IleSer: 5.598 ± 0.411
3.472IleThr: 3.472 ± 0.34
2.907IleVal: 2.907 ± 0.314
0.7IleTrp: 0.7 ± 0.176
2.745IleTyr: 2.745 ± 0.396
0.0IleXaa: 0.0 ± 0.0
Lys
2.476LysAla: 2.476 ± 0.246
1.05LysCys: 1.05 ± 0.147
2.664LysAsp: 2.664 ± 0.286
2.718LysGlu: 2.718 ± 0.225
2.099LysPhe: 2.099 ± 0.261
2.315LysGly: 2.315 ± 0.202
1.776LysHis: 1.776 ± 0.218
3.526LysIle: 3.526 ± 0.288
4.306LysLys: 4.306 ± 0.402
5.625LysLeu: 5.625 ± 0.474
1.911LysMet: 1.911 ± 0.262
3.176LysAsn: 3.176 ± 0.27
3.014LysPro: 3.014 ± 0.388
2.449LysGln: 2.449 ± 0.265
3.23LysArg: 3.23 ± 0.324
4.387LysSer: 4.387 ± 0.328
3.579LysThr: 3.579 ± 0.383
3.149LysVal: 3.149 ± 0.249
0.565LysTrp: 0.565 ± 0.125
1.669LysTyr: 1.669 ± 0.213
0.0LysXaa: 0.0 ± 0.0
Leu
5.113LeuAla: 5.113 ± 0.41
2.718LeuCys: 2.718 ± 0.298
4.521LeuAsp: 4.521 ± 0.323
5.652LeuGlu: 5.652 ± 0.442
4.817LeuPhe: 4.817 ± 0.404
4.225LeuGly: 4.225 ± 0.378
2.987LeuHis: 2.987 ± 0.329
4.414LeuIle: 4.414 ± 0.322
6.674LeuLys: 6.674 ± 0.452
9.608LeuLeu: 9.608 ± 0.644
2.557LeuMet: 2.557 ± 0.254
5.652LeuAsn: 5.652 ± 0.386
5.921LeuPro: 5.921 ± 0.466
3.875LeuGln: 3.875 ± 0.268
3.822LeuArg: 3.822 ± 0.37
10.254LeuSer: 10.254 ± 0.612
7.213LeuThr: 7.213 ± 0.638
6.486LeuVal: 6.486 ± 0.433
1.211LeuTrp: 1.211 ± 0.207
3.445LeuTyr: 3.445 ± 0.322
0.0LeuXaa: 0.0 ± 0.0
Met
1.965MetAla: 1.965 ± 0.242
0.861MetCys: 0.861 ± 0.188
0.996MetAsp: 0.996 ± 0.16
1.534MetGlu: 1.534 ± 0.199
1.615MetPhe: 1.615 ± 0.242
1.453MetGly: 1.453 ± 0.208
0.673MetHis: 0.673 ± 0.133
1.453MetIle: 1.453 ± 0.199
0.942MetLys: 0.942 ± 0.178
2.207MetLeu: 2.207 ± 0.248
0.78MetMet: 0.78 ± 0.146
0.807MetAsn: 0.807 ± 0.145
0.861MetPro: 0.861 ± 0.163
0.7MetGln: 0.7 ± 0.137
1.05MetArg: 1.05 ± 0.193
2.207MetSer: 2.207 ± 0.273
1.426MetThr: 1.426 ± 0.21
1.615MetVal: 1.615 ± 0.286
0.377MetTrp: 0.377 ± 0.096
1.157MetTyr: 1.157 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
1.561AsnAla: 1.561 ± 0.217
1.05AsnCys: 1.05 ± 0.202
1.319AsnAsp: 1.319 ± 0.171
1.399AsnGlu: 1.399 ± 0.185
2.422AsnPhe: 2.422 ± 0.233
2.099AsnGly: 2.099 ± 0.276
1.023AsnHis: 1.023 ± 0.153
3.795AsnIle: 3.795 ± 0.36
2.745AsnLys: 2.745 ± 0.312
5.033AsnLeu: 5.033 ± 0.303
1.13AsnMet: 1.13 ± 0.206
2.288AsnAsn: 2.288 ± 0.283
3.472AsnPro: 3.472 ± 0.392
1.938AsnGln: 1.938 ± 0.24
1.965AsnArg: 1.965 ± 0.243
5.113AsnSer: 5.113 ± 0.406
3.23AsnThr: 3.23 ± 0.272
3.445AsnVal: 3.445 ± 0.287
0.431AsnTrp: 0.431 ± 0.103
1.722AsnTyr: 1.722 ± 0.253
0.0AsnXaa: 0.0 ± 0.0
Pro
2.826ProAla: 2.826 ± 0.541
1.373ProCys: 1.373 ± 0.181
2.88ProAsp: 2.88 ± 0.29
3.687ProGlu: 3.687 ± 1.23
2.288ProPhe: 2.288 ± 0.205
3.579ProGly: 3.579 ± 0.518
1.426ProHis: 1.426 ± 0.182
3.256ProIle: 3.256 ± 0.328
3.606ProLys: 3.606 ± 0.368
6.863ProLeu: 6.863 ± 0.831
0.969ProMet: 0.969 ± 0.168
2.18ProAsn: 2.18 ± 0.328
4.468ProPro: 4.468 ± 0.765
2.772ProGln: 2.772 ± 0.385
2.18ProArg: 2.18 ± 0.27
5.302ProSer: 5.302 ± 0.443
3.983ProThr: 3.983 ± 0.506
4.333ProVal: 4.333 ± 0.277
0.727ProTrp: 0.727 ± 0.125
1.292ProTyr: 1.292 ± 0.189
0.0ProXaa: 0.0 ± 0.0
Gln
2.53GlnAla: 2.53 ± 0.275
0.915GlnCys: 0.915 ± 0.172
1.965GlnAsp: 1.965 ± 0.241
2.288GlnGlu: 2.288 ± 0.291
1.992GlnPhe: 1.992 ± 0.26
2.18GlnGly: 2.18 ± 0.229
1.023GlnHis: 1.023 ± 0.148
2.422GlnIle: 2.422 ± 0.228
2.557GlnLys: 2.557 ± 0.302
3.849GlnLeu: 3.849 ± 0.348
1.103GlnMet: 1.103 ± 0.146
1.992GlnAsn: 1.992 ± 0.232
2.368GlnPro: 2.368 ± 0.336
1.696GlnGln: 1.696 ± 0.27
1.696GlnArg: 1.696 ± 0.183
3.499GlnSer: 3.499 ± 0.595
3.687GlnThr: 3.687 ± 0.466
3.014GlnVal: 3.014 ± 0.625
0.565GlnTrp: 0.565 ± 0.121
1.184GlnTyr: 1.184 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
2.745ArgAla: 2.745 ± 0.265
1.184ArgCys: 1.184 ± 0.186
3.149ArgAsp: 3.149 ± 0.304
2.907ArgGlu: 2.907 ± 0.272
1.911ArgPhe: 1.911 ± 0.226
2.422ArgGly: 2.422 ± 0.32
1.426ArgHis: 1.426 ± 0.199
1.722ArgIle: 1.722 ± 0.209
2.422ArgLys: 2.422 ± 0.274
3.66ArgLeu: 3.66 ± 0.288
0.888ArgMet: 0.888 ± 0.139
1.426ArgAsn: 1.426 ± 0.249
2.341ArgPro: 2.341 ± 0.241
1.588ArgGln: 1.588 ± 0.222
2.315ArgArg: 2.315 ± 0.262
2.745ArgSer: 2.745 ± 0.243
2.207ArgThr: 2.207 ± 0.257
2.96ArgVal: 2.96 ± 0.345
0.458ArgTrp: 0.458 ± 0.108
1.319ArgTyr: 1.319 ± 0.218
0.0ArgXaa: 0.0 ± 0.0
Ser
3.633SerAla: 3.633 ± 0.357
1.83SerCys: 1.83 ± 0.202
4.387SerAsp: 4.387 ± 0.353
4.683SerGlu: 4.683 ± 0.417
4.441SerPhe: 4.441 ± 0.314
4.925SerGly: 4.925 ± 0.417
3.418SerHis: 3.418 ± 0.335
4.468SerIle: 4.468 ± 0.419
5.759SerLys: 5.759 ± 0.387
10.442SerLeu: 10.442 ± 0.645
2.18SerMet: 2.18 ± 0.194
3.014SerAsn: 3.014 ± 0.279
5.436SerPro: 5.436 ± 0.526
4.898SerGln: 4.898 ± 0.684
3.687SerArg: 3.687 ± 0.342
9.258SerSer: 9.258 ± 0.719
5.571SerThr: 5.571 ± 0.636
6.647SerVal: 6.647 ± 0.454
0.592SerTrp: 0.592 ± 0.115
2.368SerTyr: 2.368 ± 0.271
0.0SerXaa: 0.0 ± 0.0
Thr
3.337ThrAla: 3.337 ± 0.461
1.696ThrCys: 1.696 ± 0.204
3.552ThrAsp: 3.552 ± 0.318
3.391ThrGlu: 3.391 ± 0.343
3.095ThrPhe: 3.095 ± 0.325
2.772ThrGly: 2.772 ± 0.254
1.722ThrHis: 1.722 ± 0.247
3.552ThrIle: 3.552 ± 0.275
2.664ThrLys: 2.664 ± 0.249
6.486ThrLeu: 6.486 ± 0.47
1.13ThrMet: 1.13 ± 0.187
3.203ThrAsn: 3.203 ± 0.332
4.737ThrPro: 4.737 ± 0.48
3.552ThrGln: 3.552 ± 0.597
3.256ThrArg: 3.256 ± 0.298
5.975ThrSer: 5.975 ± 0.555
5.356ThrThr: 5.356 ± 0.881
4.279ThrVal: 4.279 ± 0.353
0.727ThrTrp: 0.727 ± 0.159
2.072ThrTyr: 2.072 ± 0.178
0.0ThrXaa: 0.0 ± 0.0
Val
3.391ValAla: 3.391 ± 0.334
1.669ValCys: 1.669 ± 0.217
2.88ValAsp: 2.88 ± 0.336
3.23ValGlu: 3.23 ± 0.284
4.279ValPhe: 4.279 ± 0.332
3.256ValGly: 3.256 ± 0.379
1.399ValHis: 1.399 ± 0.201
3.579ValIle: 3.579 ± 0.299
3.552ValLys: 3.552 ± 0.351
5.786ValLeu: 5.786 ± 0.422
1.992ValMet: 1.992 ± 0.245
3.66ValAsn: 3.66 ± 0.352
4.306ValPro: 4.306 ± 0.612
2.476ValGln: 2.476 ± 0.238
2.53ValArg: 2.53 ± 0.282
7.536ValSer: 7.536 ± 0.616
4.414ValThr: 4.414 ± 0.296
4.602ValVal: 4.602 ± 0.359
0.404ValTrp: 0.404 ± 0.126
2.745ValTyr: 2.745 ± 0.288
0.0ValXaa: 0.0 ± 0.0
Trp
0.538TrpAla: 0.538 ± 0.096
0.296TrpCys: 0.296 ± 0.089
0.458TrpAsp: 0.458 ± 0.135
0.377TrpGlu: 0.377 ± 0.112
0.484TrpPhe: 0.484 ± 0.112
0.484TrpGly: 0.484 ± 0.11
0.323TrpHis: 0.323 ± 0.086
0.646TrpIle: 0.646 ± 0.133
0.484TrpLys: 0.484 ± 0.118
1.077TrpLeu: 1.077 ± 0.183
0.215TrpMet: 0.215 ± 0.076
0.484TrpAsn: 0.484 ± 0.094
0.673TrpPro: 0.673 ± 0.109
0.458TrpGln: 0.458 ± 0.122
0.35TrpArg: 0.35 ± 0.105
0.78TrpSer: 0.78 ± 0.152
0.861TrpThr: 0.861 ± 0.136
0.834TrpVal: 0.834 ± 0.142
0.054TrpTrp: 0.054 ± 0.04
0.215TrpTyr: 0.215 ± 0.069
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.399TyrAla: 1.399 ± 0.213
0.565TyrCys: 0.565 ± 0.14
1.399TyrAsp: 1.399 ± 0.207
1.453TyrGlu: 1.453 ± 0.174
1.749TyrPhe: 1.749 ± 0.287
1.507TyrGly: 1.507 ± 0.194
0.915TyrHis: 0.915 ± 0.153
2.368TyrIle: 2.368 ± 0.304
1.776TyrLys: 1.776 ± 0.236
2.987TyrLeu: 2.987 ± 0.31
0.942TyrMet: 0.942 ± 0.163
1.992TyrAsn: 1.992 ± 0.306
1.426TyrPro: 1.426 ± 0.196
1.13TyrGln: 1.13 ± 0.173
1.399TyrArg: 1.399 ± 0.206
3.364TyrSer: 3.364 ± 0.291
2.261TyrThr: 2.261 ± 0.308
2.395TyrVal: 2.395 ± 0.336
0.377TyrTrp: 0.377 ± 0.106
1.103TyrTyr: 1.103 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (37158 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski