Amino acid dipepetide frequency for Berne virus (BEV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.155AlaAla: 3.155 ± 1.2
0.954AlaCys: 0.954 ± 0.201
2.421AlaAsp: 2.421 ± 0.292
1.101AlaGlu: 1.101 ± 0.256
2.715AlaPhe: 2.715 ± 0.124
1.394AlaGly: 1.394 ± 0.268
0.734AlaHis: 0.734 ± 0.2
3.375AlaIle: 3.375 ± 0.83
2.495AlaLys: 2.495 ± 0.477
4.109AlaLeu: 4.109 ± 0.94
0.734AlaMet: 0.734 ± 0.466
2.788AlaAsn: 2.788 ± 0.35
2.568AlaPro: 2.568 ± 0.301
1.908AlaGln: 1.908 ± 0.593
2.935AlaArg: 2.935 ± 0.204
2.348AlaSer: 2.348 ± 0.242
3.302AlaThr: 3.302 ± 1.103
3.889AlaVal: 3.889 ± 0.316
0.88AlaTrp: 0.88 ± 0.14
2.568AlaTyr: 2.568 ± 0.354
0.0AlaXaa: 0.0 ± 0.0
Cys
1.101CysAla: 1.101 ± 0.203
0.734CysCys: 0.734 ± 0.194
2.641CysAsp: 2.641 ± 0.478
1.614CysGlu: 1.614 ± 0.42
1.394CysPhe: 1.394 ± 0.434
1.688CysGly: 1.688 ± 0.333
0.44CysHis: 0.44 ± 0.295
1.247CysIle: 1.247 ± 0.615
1.247CysLys: 1.247 ± 0.154
3.302CysLeu: 3.302 ± 0.697
0.44CysMet: 0.44 ± 0.146
1.467CysAsn: 1.467 ± 0.136
1.614CysPro: 1.614 ± 0.211
1.467CysGln: 1.467 ± 0.221
1.027CysArg: 1.027 ± 0.121
1.467CysSer: 1.467 ± 0.306
2.128CysThr: 2.128 ± 0.235
2.715CysVal: 2.715 ± 0.268
0.44CysTrp: 0.44 ± 0.122
1.981CysTyr: 1.981 ± 0.399
0.0CysXaa: 0.0 ± 0.0
Asp
1.981AspAla: 1.981 ± 0.381
1.688AspCys: 1.688 ± 0.252
2.862AspAsp: 2.862 ± 0.278
2.788AspGlu: 2.788 ± 0.603
4.036AspPhe: 4.036 ± 0.644
2.348AspGly: 2.348 ± 0.178
0.88AspHis: 0.88 ± 0.183
3.375AspIle: 3.375 ± 0.504
2.201AspLys: 2.201 ± 0.271
5.136AspLeu: 5.136 ± 0.751
1.101AspMet: 1.101 ± 0.203
3.082AspAsn: 3.082 ± 0.504
1.688AspPro: 1.688 ± 0.157
3.889AspGln: 3.889 ± 0.581
0.807AspArg: 0.807 ± 0.117
2.568AspSer: 2.568 ± 0.312
2.201AspThr: 2.201 ± 0.358
4.476AspVal: 4.476 ± 0.274
0.293AspTrp: 0.293 ± 0.063
2.788AspTyr: 2.788 ± 0.422
0.0AspXaa: 0.0 ± 0.0
Glu
1.761GluAla: 1.761 ± 0.154
1.467GluCys: 1.467 ± 0.294
3.302GluAsp: 3.302 ± 0.488
1.688GluGlu: 1.688 ± 0.436
3.449GluPhe: 3.449 ± 0.802
2.128GluGly: 2.128 ± 0.35
1.321GluHis: 1.321 ± 0.256
2.054GluIle: 2.054 ± 0.445
3.375GluLys: 3.375 ± 0.769
3.155GluLeu: 3.155 ± 0.378
1.027GluMet: 1.027 ± 0.155
2.201GluAsn: 2.201 ± 0.329
2.054GluPro: 2.054 ± 0.276
4.256GluGln: 4.256 ± 0.519
1.321GluArg: 1.321 ± 0.171
3.449GluSer: 3.449 ± 0.377
1.394GluThr: 1.394 ± 0.307
3.889GluVal: 3.889 ± 0.595
0.88GluTrp: 0.88 ± 0.142
2.201GluTyr: 2.201 ± 0.44
0.0GluXaa: 0.0 ± 0.0
Phe
2.275PheAla: 2.275 ± 0.444
2.054PheCys: 2.054 ± 0.325
4.036PheAsp: 4.036 ± 0.392
3.962PheGlu: 3.962 ± 0.494
2.641PhePhe: 2.641 ± 0.311
3.742PheGly: 3.742 ± 0.442
0.66PheHis: 0.66 ± 0.125
2.641PheIle: 2.641 ± 0.367
6.677PheLys: 6.677 ± 1.027
4.989PheLeu: 4.989 ± 0.466
1.467PheMet: 1.467 ± 0.294
3.669PheAsn: 3.669 ± 0.227
1.027PhePro: 1.027 ± 0.294
3.082PheGln: 3.082 ± 0.199
1.614PheArg: 1.614 ± 0.458
5.209PheSer: 5.209 ± 0.829
3.522PheThr: 3.522 ± 0.427
6.75PheVal: 6.75 ± 0.77
1.027PheTrp: 1.027 ± 0.152
4.036PheTyr: 4.036 ± 0.37
0.0PheXaa: 0.0 ± 0.0
Gly
1.981GlyAla: 1.981 ± 0.347
1.027GlyCys: 1.027 ± 0.276
2.568GlyAsp: 2.568 ± 0.612
2.935GlyGlu: 2.935 ± 0.588
3.595GlyPhe: 3.595 ± 0.331
2.054GlyGly: 2.054 ± 0.268
1.541GlyHis: 1.541 ± 0.401
2.715GlyIle: 2.715 ± 0.233
3.008GlyLys: 3.008 ± 0.427
3.962GlyLeu: 3.962 ± 0.356
0.807GlyMet: 0.807 ± 0.223
1.614GlyAsn: 1.614 ± 0.593
1.541GlyPro: 1.541 ± 0.124
2.715GlyGln: 2.715 ± 0.366
1.467GlyArg: 1.467 ± 0.173
3.155GlySer: 3.155 ± 0.855
2.201GlyThr: 2.201 ± 0.312
4.256GlyVal: 4.256 ± 0.483
0.514GlyTrp: 0.514 ± 0.27
2.495GlyTyr: 2.495 ± 0.193
0.0GlyXaa: 0.0 ± 0.0
His
0.807HisAla: 0.807 ± 0.248
0.293HisCys: 0.293 ± 0.161
0.66HisAsp: 0.66 ± 0.194
0.66HisGlu: 0.66 ± 0.171
1.541HisPhe: 1.541 ± 0.189
1.321HisGly: 1.321 ± 0.17
0.66HisHis: 0.66 ± 0.114
1.101HisIle: 1.101 ± 0.324
0.807HisLys: 0.807 ± 0.223
2.788HisLeu: 2.788 ± 0.394
0.587HisMet: 0.587 ± 0.157
1.541HisAsn: 1.541 ± 0.212
0.954HisPro: 0.954 ± 0.435
0.807HisGln: 0.807 ± 0.148
0.954HisArg: 0.954 ± 0.148
1.688HisSer: 1.688 ± 0.216
0.88HisThr: 0.88 ± 0.196
1.981HisVal: 1.981 ± 0.256
0.514HisTrp: 0.514 ± 0.089
2.128HisTyr: 2.128 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
1.908IleAla: 1.908 ± 1.302
1.688IleCys: 1.688 ± 0.245
0.66IleAsp: 0.66 ± 0.153
2.421IleGlu: 2.421 ± 0.228
3.228IlePhe: 3.228 ± 0.315
2.348IleGly: 2.348 ± 0.544
1.174IleHis: 1.174 ± 0.203
3.302IleIle: 3.302 ± 0.616
2.862IleLys: 2.862 ± 0.424
5.136IleLeu: 5.136 ± 0.475
1.321IleMet: 1.321 ± 0.354
1.688IleAsn: 1.688 ± 0.386
2.054IlePro: 2.054 ± 0.281
2.128IleGln: 2.128 ± 0.236
1.247IleArg: 1.247 ± 0.22
3.889IleSer: 3.889 ± 1.362
4.696IleThr: 4.696 ± 0.718
5.356IleVal: 5.356 ± 0.187
0.88IleTrp: 0.88 ± 0.263
1.761IleTyr: 1.761 ± 0.189
0.0IleXaa: 0.0 ± 0.0
Lys
2.568LysAla: 2.568 ± 0.466
2.054LysCys: 2.054 ± 0.451
2.788LysAsp: 2.788 ± 0.559
2.128LysGlu: 2.128 ± 0.257
2.641LysPhe: 2.641 ± 0.46
2.275LysGly: 2.275 ± 0.482
1.834LysHis: 1.834 ± 0.369
2.348LysIle: 2.348 ± 0.417
2.641LysLys: 2.641 ± 0.418
6.383LysLeu: 6.383 ± 0.471
1.321LysMet: 1.321 ± 0.342
3.008LysAsn: 3.008 ± 0.48
3.155LysPro: 3.155 ± 0.612
2.568LysGln: 2.568 ± 0.466
2.201LysArg: 2.201 ± 0.349
3.742LysSer: 3.742 ± 0.533
3.082LysThr: 3.082 ± 0.205
5.43LysVal: 5.43 ± 0.697
0.734LysTrp: 0.734 ± 0.203
2.201LysTyr: 2.201 ± 0.413
0.0LysXaa: 0.0 ± 0.0
Leu
5.209LeuAla: 5.209 ± 0.475
2.788LeuCys: 2.788 ± 0.312
6.31LeuAsp: 6.31 ± 0.654
3.522LeuGlu: 3.522 ± 0.323
6.237LeuPhe: 6.237 ± 0.435
4.476LeuGly: 4.476 ± 0.439
2.275LeuHis: 2.275 ± 0.301
4.036LeuIle: 4.036 ± 0.244
5.796LeuLys: 5.796 ± 1.136
6.75LeuLeu: 6.75 ± 0.751
2.348LeuMet: 2.348 ± 0.386
3.595LeuAsn: 3.595 ± 0.204
6.163LeuPro: 6.163 ± 0.967
5.209LeuGln: 5.209 ± 0.776
2.275LeuArg: 2.275 ± 0.136
10.639LeuSer: 10.639 ± 1.24
6.53LeuThr: 6.53 ± 0.745
7.191LeuVal: 7.191 ± 0.685
1.247LeuTrp: 1.247 ± 0.26
3.669LeuTyr: 3.669 ± 0.488
0.0LeuXaa: 0.0 ± 0.0
Met
0.88MetAla: 0.88 ± 0.33
0.807MetCys: 0.807 ± 0.21
0.367MetAsp: 0.367 ± 0.163
0.807MetGlu: 0.807 ± 0.162
1.834MetPhe: 1.834 ± 0.773
1.247MetGly: 1.247 ± 0.324
0.22MetHis: 0.22 ± 0.083
1.027MetIle: 1.027 ± 0.121
0.734MetLys: 0.734 ± 0.192
2.641MetLeu: 2.641 ± 0.332
0.954MetMet: 0.954 ± 0.269
0.954MetAsn: 0.954 ± 0.728
1.467MetPro: 1.467 ± 0.244
0.734MetGln: 0.734 ± 0.834
1.247MetArg: 1.247 ± 0.192
2.054MetSer: 2.054 ± 0.236
1.688MetThr: 1.688 ± 0.342
1.174MetVal: 1.174 ± 0.203
0.66MetTrp: 0.66 ± 0.25
0.88MetTyr: 0.88 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
1.834AsnAla: 1.834 ± 0.369
1.834AsnCys: 1.834 ± 0.272
2.495AsnAsp: 2.495 ± 0.381
2.715AsnGlu: 2.715 ± 0.461
3.889AsnPhe: 3.889 ± 0.39
2.935AsnGly: 2.935 ± 0.696
0.88AsnHis: 0.88 ± 0.15
3.228AsnIle: 3.228 ± 0.396
1.834AsnLys: 1.834 ± 0.36
3.815AsnLeu: 3.815 ± 0.524
0.88AsnMet: 0.88 ± 0.406
1.761AsnAsn: 1.761 ± 1.023
1.981AsnPro: 1.981 ± 0.326
2.201AsnGln: 2.201 ± 1.086
1.614AsnArg: 1.614 ± 1.629
2.935AsnSer: 2.935 ± 0.56
3.008AsnThr: 3.008 ± 0.428
5.356AsnVal: 5.356 ± 0.575
0.734AsnTrp: 0.734 ± 0.11
2.495AsnTyr: 2.495 ± 0.727
0.0AsnXaa: 0.0 ± 0.0
Pro
1.761ProAla: 1.761 ± 0.172
1.027ProCys: 1.027 ± 0.178
1.981ProAsp: 1.981 ± 0.169
1.981ProGlu: 1.981 ± 0.252
3.815ProPhe: 3.815 ± 0.518
1.321ProGly: 1.321 ± 0.325
1.027ProHis: 1.027 ± 0.121
2.641ProIle: 2.641 ± 0.339
1.761ProLys: 1.761 ± 0.297
5.65ProLeu: 5.65 ± 0.478
1.247ProMet: 1.247 ± 0.653
2.348ProAsn: 2.348 ± 0.503
2.275ProPro: 2.275 ± 0.255
1.981ProGln: 1.981 ± 0.296
1.761ProArg: 1.761 ± 0.351
4.182ProSer: 4.182 ± 1.554
3.302ProThr: 3.302 ± 0.202
4.476ProVal: 4.476 ± 0.563
0.44ProTrp: 0.44 ± 0.129
1.174ProTyr: 1.174 ± 0.218
0.0ProXaa: 0.0 ± 0.0
Gln
3.595GlnAla: 3.595 ± 0.445
2.054GlnCys: 2.054 ± 0.393
2.935GlnAsp: 2.935 ± 0.343
2.568GlnGlu: 2.568 ± 0.476
3.302GlnPhe: 3.302 ± 0.479
2.275GlnGly: 2.275 ± 0.131
0.734GlnHis: 0.734 ± 0.119
3.082GlnIle: 3.082 ± 0.786
2.421GlnLys: 2.421 ± 0.523
5.796GlnLeu: 5.796 ± 0.285
0.807GlnMet: 0.807 ± 0.157
2.421GlnAsn: 2.421 ± 1.604
3.082GlnPro: 3.082 ± 0.303
4.916GlnGln: 4.916 ± 0.427
1.467GlnArg: 1.467 ± 0.78
3.449GlnSer: 3.449 ± 0.49
2.201GlnThr: 2.201 ± 0.331
4.182GlnVal: 4.182 ± 0.702
0.587GlnTrp: 0.587 ± 0.148
1.981GlnTyr: 1.981 ± 0.437
0.0GlnXaa: 0.0 ± 0.0
Arg
1.981ArgAla: 1.981 ± 0.121
0.954ArgCys: 0.954 ± 0.25
1.101ArgAsp: 1.101 ± 0.203
1.247ArgGlu: 1.247 ± 0.223
1.688ArgPhe: 1.688 ± 0.22
1.688ArgGly: 1.688 ± 0.585
0.587ArgHis: 0.587 ± 0.072
1.027ArgIle: 1.027 ± 0.302
1.688ArgLys: 1.688 ± 0.247
3.889ArgLeu: 3.889 ± 0.509
0.514ArgMet: 0.514 ± 0.089
1.027ArgAsn: 1.027 ± 0.715
2.128ArgPro: 2.128 ± 0.129
1.467ArgGln: 1.467 ± 1.534
2.128ArgArg: 2.128 ± 0.905
2.275ArgSer: 2.275 ± 0.243
1.614ArgThr: 1.614 ± 0.337
3.155ArgVal: 3.155 ± 0.174
0.22ArgTrp: 0.22 ± 0.065
2.054ArgTyr: 2.054 ± 0.24
0.0ArgXaa: 0.0 ± 0.0
Ser
3.522SerAla: 3.522 ± 0.386
2.348SerCys: 2.348 ± 0.301
4.402SerAsp: 4.402 ± 0.419
3.228SerGlu: 3.228 ± 0.098
3.889SerPhe: 3.889 ± 0.267
3.962SerGly: 3.962 ± 1.163
1.981SerHis: 1.981 ± 0.406
3.889SerIle: 3.889 ± 0.736
3.889SerLys: 3.889 ± 0.772
7.117SerLeu: 7.117 ± 1.336
1.761SerMet: 1.761 ± 0.724
4.622SerAsn: 4.622 ± 0.755
3.449SerPro: 3.449 ± 0.24
3.008SerGln: 3.008 ± 0.501
2.641SerArg: 2.641 ± 0.194
6.53SerSer: 6.53 ± 0.767
4.402SerThr: 4.402 ± 0.951
6.53SerVal: 6.53 ± 0.273
1.394SerTrp: 1.394 ± 0.549
3.155SerTyr: 3.155 ± 0.592
0.0SerXaa: 0.0 ± 0.0
Thr
3.155ThrAla: 3.155 ± 1.107
1.394ThrCys: 1.394 ± 0.202
1.908ThrAsp: 1.908 ± 0.272
2.348ThrGlu: 2.348 ± 0.546
3.815ThrPhe: 3.815 ± 0.361
3.228ThrGly: 3.228 ± 0.246
2.054ThrHis: 2.054 ± 0.279
2.862ThrIle: 2.862 ± 0.681
2.641ThrLys: 2.641 ± 0.439
6.604ThrLeu: 6.604 ± 1.01
1.467ThrMet: 1.467 ± 0.164
1.688ThrAsn: 1.688 ± 0.232
3.008ThrPro: 3.008 ± 0.723
3.302ThrGln: 3.302 ± 0.516
1.467ThrArg: 1.467 ± 0.164
5.209ThrSer: 5.209 ± 0.466
4.696ThrThr: 4.696 ± 0.507
4.109ThrVal: 4.109 ± 0.383
1.174ThrTrp: 1.174 ± 0.218
2.715ThrTyr: 2.715 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
4.696ValAla: 4.696 ± 0.76
2.862ValCys: 2.862 ± 0.39
4.036ValAsp: 4.036 ± 0.419
6.237ValGlu: 6.237 ± 1.334
6.237ValPhe: 6.237 ± 0.941
3.962ValGly: 3.962 ± 0.475
1.834ValHis: 1.834 ± 0.319
2.788ValIle: 2.788 ± 0.652
5.87ValLys: 5.87 ± 0.628
9.465ValLeu: 9.465 ± 0.709
2.201ValMet: 2.201 ± 0.323
5.283ValAsn: 5.283 ± 0.458
3.595ValPro: 3.595 ± 0.163
4.109ValGln: 4.109 ± 0.368
2.348ValArg: 2.348 ± 0.178
6.09ValSer: 6.09 ± 0.452
4.549ValThr: 4.549 ± 0.975
8.952ValVal: 8.952 ± 0.905
0.954ValTrp: 0.954 ± 0.299
4.622ValTyr: 4.622 ± 0.566
0.0ValXaa: 0.0 ± 0.0
Trp
0.293TrpAla: 0.293 ± 0.204
0.734TrpCys: 0.734 ± 0.203
0.44TrpAsp: 0.44 ± 0.152
0.147TrpGlu: 0.147 ± 0.041
2.128TrpPhe: 2.128 ± 0.264
0.073TrpGly: 0.073 ± 0.107
0.293TrpHis: 0.293 ± 0.108
0.22TrpIle: 0.22 ± 0.321
0.66TrpLys: 0.66 ± 0.171
2.054TrpLeu: 2.054 ± 0.261
0.22TrpMet: 0.22 ± 0.065
0.734TrpAsn: 0.734 ± 0.11
0.734TrpPro: 0.734 ± 0.185
0.66TrpGln: 0.66 ± 0.171
0.293TrpArg: 0.293 ± 0.161
1.174TrpSer: 1.174 ± 0.21
1.174TrpThr: 1.174 ± 0.196
0.954TrpVal: 0.954 ± 0.155
0.44TrpTrp: 0.44 ± 0.179
1.101TrpTyr: 1.101 ± 0.287
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.908TyrAla: 1.908 ± 0.263
1.467TyrCys: 1.467 ± 0.252
2.275TyrAsp: 2.275 ± 0.511
2.275TyrGlu: 2.275 ± 0.35
3.008TyrPhe: 3.008 ± 0.298
1.688TyrGly: 1.688 ± 0.174
1.541TyrHis: 1.541 ± 0.135
2.788TyrIle: 2.788 ± 0.228
2.495TyrLys: 2.495 ± 0.44
3.522TyrLeu: 3.522 ± 0.374
1.101TyrMet: 1.101 ± 0.261
2.935TyrAsn: 2.935 ± 0.224
1.614TyrPro: 1.614 ± 0.918
3.449TyrGln: 3.449 ± 0.428
1.614TyrArg: 1.614 ± 0.371
3.669TyrSer: 3.669 ± 0.323
2.275TyrThr: 2.275 ± 0.854
5.87TyrVal: 5.87 ± 0.405
0.514TyrTrp: 0.514 ± 0.55
2.862TyrTyr: 2.862 ± 0.453
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (13630 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski