Amino acid dipepetide frequency for Bat SARS-like coronavirus YNLF_31C

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.612AlaAla: 6.612 ± 0.433
2.133AlaCys: 2.133 ± 0.52
2.773AlaAsp: 2.773 ± 0.668
2.346AlaGlu: 2.346 ± 0.652
2.773AlaPhe: 2.773 ± 0.397
4.692AlaGly: 4.692 ± 0.607
0.924AlaHis: 0.924 ± 0.342
3.626AlaIle: 3.626 ± 0.374
3.839AlaLys: 3.839 ± 1.017
7.465AlaLeu: 7.465 ± 0.789
2.56AlaMet: 2.56 ± 0.294
3.484AlaAsn: 3.484 ± 0.608
2.702AlaPro: 2.702 ± 0.58
1.991AlaGln: 1.991 ± 0.337
3.342AlaArg: 3.342 ± 0.356
4.977AlaSer: 4.977 ± 1.484
4.764AlaThr: 4.764 ± 0.69
4.764AlaVal: 4.764 ± 0.881
1.422AlaTrp: 1.422 ± 0.217
3.768AlaTyr: 3.768 ± 0.438
0.0AlaXaa: 0.0 ± 0.0
Cys
2.488CysAla: 2.488 ± 0.428
1.706CysCys: 1.706 ± 0.378
2.133CysAsp: 2.133 ± 0.475
1.209CysGlu: 1.209 ± 0.373
1.28CysPhe: 1.28 ± 0.328
2.631CysGly: 2.631 ± 0.57
0.569CysHis: 0.569 ± 0.125
1.849CysIle: 1.849 ± 0.439
0.853CysLys: 0.853 ± 0.181
2.702CysLeu: 2.702 ± 0.732
0.569CysMet: 0.569 ± 0.22
1.28CysAsn: 1.28 ± 0.395
0.924CysPro: 0.924 ± 0.23
0.569CysGln: 0.569 ± 0.181
1.138CysArg: 1.138 ± 0.238
1.92CysSer: 1.92 ± 0.507
2.417CysThr: 2.417 ± 0.501
3.057CysVal: 3.057 ± 0.569
0.427CysTrp: 0.427 ± 0.606
1.706CysTyr: 1.706 ± 0.375
0.0CysXaa: 0.0 ± 0.0
Asp
4.266AspAla: 4.266 ± 0.85
1.351AspCys: 1.351 ± 0.378
2.631AspAsp: 2.631 ± 0.413
2.275AspGlu: 2.275 ± 0.364
2.702AspPhe: 2.702 ± 0.491
3.555AspGly: 3.555 ± 0.556
0.711AspHis: 0.711 ± 0.251
3.199AspIle: 3.199 ± 0.413
2.56AspLys: 2.56 ± 0.47
4.408AspLeu: 4.408 ± 0.387
1.28AspMet: 1.28 ± 0.296
2.986AspAsn: 2.986 ± 0.435
1.28AspPro: 1.28 ± 0.618
1.564AspGln: 1.564 ± 0.403
1.351AspArg: 1.351 ± 0.284
2.915AspSer: 2.915 ± 0.406
3.768AspThr: 3.768 ± 0.976
4.053AspVal: 4.053 ± 0.872
0.569AspTrp: 0.569 ± 0.192
3.413AspTyr: 3.413 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
2.986GluAla: 2.986 ± 0.565
1.706GluCys: 1.706 ± 0.324
2.346GluAsp: 2.346 ± 0.467
5.119GluGlu: 5.119 ± 0.977
2.062GluPhe: 2.062 ± 0.614
2.773GluGly: 2.773 ± 0.467
1.564GluHis: 1.564 ± 0.325
2.275GluIle: 2.275 ± 0.464
2.133GluLys: 2.133 ± 0.477
4.621GluLeu: 4.621 ± 0.707
1.138GluMet: 1.138 ± 0.355
2.275GluAsn: 2.275 ± 0.34
2.062GluPro: 2.062 ± 0.374
1.777GluGln: 1.777 ± 0.327
1.351GluArg: 1.351 ± 0.303
2.631GluSer: 2.631 ± 0.325
3.199GluThr: 3.199 ± 0.741
4.408GluVal: 4.408 ± 0.442
0.427GluTrp: 0.427 ± 0.14
1.991GluTyr: 1.991 ± 0.585
0.0GluXaa: 0.0 ± 0.0
Phe
2.844PheAla: 2.844 ± 0.928
1.991PheCys: 1.991 ± 0.39
2.986PheAsp: 2.986 ± 0.729
1.209PheGlu: 1.209 ± 0.274
1.991PhePhe: 1.991 ± 0.296
2.986PheGly: 2.986 ± 0.836
0.711PheHis: 0.711 ± 0.499
2.488PheIle: 2.488 ± 0.274
2.773PheLys: 2.773 ± 0.408
5.617PheLeu: 5.617 ± 1.091
0.924PheMet: 0.924 ± 0.29
3.057PheAsn: 3.057 ± 1.498
1.92PhePro: 1.92 ± 0.311
0.995PheGln: 0.995 ± 0.52
1.706PheArg: 1.706 ± 0.423
2.986PheSer: 2.986 ± 0.414
3.91PheThr: 3.91 ± 0.317
3.484PheVal: 3.484 ± 0.708
0.355PheTrp: 0.355 ± 0.133
2.844PheTyr: 2.844 ± 0.329
0.0PheXaa: 0.0 ± 0.0
Gly
4.195GlyAla: 4.195 ± 0.76
1.635GlyCys: 1.635 ± 0.36
3.768GlyAsp: 3.768 ± 0.538
1.991GlyGlu: 1.991 ± 0.545
3.342GlyPhe: 3.342 ± 0.358
4.337GlyGly: 4.337 ± 0.794
1.28GlyHis: 1.28 ± 0.399
3.413GlyIle: 3.413 ± 0.712
2.773GlyLys: 2.773 ± 0.315
3.839GlyLeu: 3.839 ± 0.523
1.138GlyMet: 1.138 ± 0.385
2.844GlyAsn: 2.844 ± 0.287
2.346GlyPro: 2.346 ± 0.7
2.275GlyGln: 2.275 ± 0.334
1.777GlyArg: 1.777 ± 0.266
3.982GlySer: 3.982 ± 0.389
5.403GlyThr: 5.403 ± 1.413
7.11GlyVal: 7.11 ± 0.657
0.284GlyTrp: 0.284 ± 0.254
2.773GlyTyr: 2.773 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
1.351HisAla: 1.351 ± 0.234
0.711HisCys: 0.711 ± 0.25
0.711HisAsp: 0.711 ± 0.314
1.138HisGlu: 1.138 ± 0.213
1.209HisPhe: 1.209 ± 0.42
1.422HisGly: 1.422 ± 0.34
0.569HisHis: 0.569 ± 0.2
0.924HisIle: 0.924 ± 0.285
0.782HisLys: 0.782 ± 0.228
2.062HisLeu: 2.062 ± 0.281
0.498HisMet: 0.498 ± 0.242
0.924HisAsn: 0.924 ± 0.214
0.569HisPro: 0.569 ± 0.241
0.355HisGln: 0.355 ± 0.125
0.284HisArg: 0.284 ± 0.178
1.564HisSer: 1.564 ± 0.247
1.991HisThr: 1.991 ± 0.276
1.706HisVal: 1.706 ± 0.359
0.355HisTrp: 0.355 ± 0.166
0.711HisTyr: 0.711 ± 0.25
0.0HisXaa: 0.0 ± 0.0
Ile
3.697IleAla: 3.697 ± 1.105
1.28IleCys: 1.28 ± 0.391
2.844IleAsp: 2.844 ± 0.422
1.777IleGlu: 1.777 ± 0.377
1.635IlePhe: 1.635 ± 0.293
2.915IleGly: 2.915 ± 0.864
0.569IleHis: 0.569 ± 0.222
2.773IleIle: 2.773 ± 0.672
3.768IleLys: 3.768 ± 0.731
4.195IleLeu: 4.195 ± 0.67
1.564IleMet: 1.564 ± 0.422
2.844IleAsn: 2.844 ± 0.318
1.706IlePro: 1.706 ± 0.302
2.631IleGln: 2.631 ± 0.535
2.204IleArg: 2.204 ± 0.557
3.413IleSer: 3.413 ± 0.583
4.053IleThr: 4.053 ± 0.478
4.835IleVal: 4.835 ± 0.715
0.498IleTrp: 0.498 ± 0.348
1.138IleTyr: 1.138 ± 0.678
0.0IleXaa: 0.0 ± 0.0
Lys
2.773LysAla: 2.773 ± 0.666
1.92LysCys: 1.92 ± 0.448
2.631LysAsp: 2.631 ± 0.797
2.915LysGlu: 2.915 ± 0.407
2.702LysPhe: 2.702 ± 0.615
4.977LysGly: 4.977 ± 0.641
1.706LysHis: 1.706 ± 0.424
2.56LysIle: 2.56 ± 0.384
2.986LysLys: 2.986 ± 1.366
6.47LysLeu: 6.47 ± 0.578
1.493LysMet: 1.493 ± 0.252
2.062LysAsn: 2.062 ± 0.565
3.413LysPro: 3.413 ± 0.395
1.351LysGln: 1.351 ± 0.764
2.488LysArg: 2.488 ± 0.245
4.55LysSer: 4.55 ± 0.765
3.697LysThr: 3.697 ± 0.413
3.271LysVal: 3.271 ± 0.391
0.711LysTrp: 0.711 ± 0.166
1.991LysTyr: 1.991 ± 0.37
0.0LysXaa: 0.0 ± 0.0
Leu
6.754LeuAla: 6.754 ± 0.887
2.844LeuCys: 2.844 ± 0.374
4.977LeuAsp: 4.977 ± 0.614
4.906LeuGlu: 4.906 ± 0.959
3.199LeuPhe: 3.199 ± 1.013
5.119LeuGly: 5.119 ± 0.474
1.635LeuHis: 1.635 ± 0.271
3.839LeuIle: 3.839 ± 1.22
6.897LeuLys: 6.897 ± 1.301
10.451LeuLeu: 10.451 ± 1.931
2.773LeuMet: 2.773 ± 0.565
6.541LeuAsn: 6.541 ± 0.648
4.408LeuPro: 4.408 ± 0.7
4.479LeuGln: 4.479 ± 0.427
5.048LeuArg: 5.048 ± 0.55
7.75LeuSer: 7.75 ± 0.982
5.261LeuThr: 5.261 ± 0.482
5.83LeuVal: 5.83 ± 1.164
1.066LeuTrp: 1.066 ± 0.391
3.342LeuTyr: 3.342 ± 0.476
0.0LeuXaa: 0.0 ± 0.0
Met
1.706MetAla: 1.706 ± 0.594
0.924MetCys: 0.924 ± 0.224
1.564MetAsp: 1.564 ± 0.444
0.782MetGlu: 0.782 ± 0.593
0.995MetPhe: 0.995 ± 0.203
0.782MetGly: 0.782 ± 0.276
0.427MetHis: 0.427 ± 0.154
0.569MetIle: 0.569 ± 0.32
1.066MetLys: 1.066 ± 0.341
2.915MetLeu: 2.915 ± 0.698
0.711MetMet: 0.711 ± 0.265
0.995MetAsn: 0.995 ± 0.292
1.351MetPro: 1.351 ± 0.363
1.209MetGln: 1.209 ± 0.324
0.711MetArg: 0.711 ± 0.307
2.417MetSer: 2.417 ± 0.394
1.351MetThr: 1.351 ± 0.476
1.564MetVal: 1.564 ± 0.505
0.711MetTrp: 0.711 ± 0.378
1.422MetTyr: 1.422 ± 0.312
0.0MetXaa: 0.0 ± 0.0
Asn
3.982AsnAla: 3.982 ± 0.53
1.635AsnCys: 1.635 ± 0.404
1.706AsnAsp: 1.706 ± 0.265
2.133AsnGlu: 2.133 ± 0.442
1.849AsnPhe: 1.849 ± 1.02
4.55AsnGly: 4.55 ± 0.508
1.28AsnHis: 1.28 ± 0.341
2.631AsnIle: 2.631 ± 0.518
2.702AsnLys: 2.702 ± 0.367
5.119AsnLeu: 5.119 ± 0.667
1.209AsnMet: 1.209 ± 0.429
3.128AsnAsn: 3.128 ± 0.44
1.777AsnPro: 1.777 ± 0.429
1.422AsnGln: 1.422 ± 0.874
1.706AsnArg: 1.706 ± 0.512
3.768AsnSer: 3.768 ± 0.777
3.626AsnThr: 3.626 ± 0.775
4.266AsnVal: 4.266 ± 0.493
0.284AsnTrp: 0.284 ± 0.19
2.488AsnTyr: 2.488 ± 0.427
0.0AsnXaa: 0.0 ± 0.0
Pro
2.631ProAla: 2.631 ± 0.309
1.209ProCys: 1.209 ± 0.271
1.635ProAsp: 1.635 ± 0.394
1.706ProGlu: 1.706 ± 0.265
1.849ProPhe: 1.849 ± 0.602
1.92ProGly: 1.92 ± 0.368
0.711ProHis: 0.711 ± 0.161
2.915ProIle: 2.915 ± 0.404
3.413ProLys: 3.413 ± 0.581
4.692ProLeu: 4.692 ± 0.842
0.64ProMet: 0.64 ± 0.205
2.133ProAsn: 2.133 ± 0.333
1.493ProPro: 1.493 ± 0.235
1.209ProGln: 1.209 ± 1.177
1.351ProArg: 1.351 ± 0.242
2.488ProSer: 2.488 ± 0.393
2.56ProThr: 2.56 ± 0.54
3.555ProVal: 3.555 ± 0.712
0.284ProTrp: 0.284 ± 0.1
1.066ProTyr: 1.066 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
3.199GlnAla: 3.199 ± 0.317
0.995GlnCys: 0.995 ± 0.262
1.564GlnAsp: 1.564 ± 0.565
2.062GlnGlu: 2.062 ± 0.786
1.635GlnPhe: 1.635 ± 0.591
2.062GlnGly: 2.062 ± 1.179
0.569GlnHis: 0.569 ± 0.233
1.92GlnIle: 1.92 ± 1.186
1.28GlnLys: 1.28 ± 0.48
3.626GlnLeu: 3.626 ± 0.606
0.995GlnMet: 0.995 ± 0.191
1.564GlnAsn: 1.564 ± 0.753
2.062GlnPro: 2.062 ± 0.317
1.991GlnGln: 1.991 ± 0.501
1.777GlnArg: 1.777 ± 0.47
2.133GlnSer: 2.133 ± 0.523
2.417GlnThr: 2.417 ± 0.362
2.844GlnVal: 2.844 ± 0.505
0.782GlnTrp: 0.782 ± 0.223
1.066GlnTyr: 1.066 ± 0.425
0.0GlnXaa: 0.0 ± 0.0
Arg
3.555ArgAla: 3.555 ± 0.619
1.066ArgCys: 1.066 ± 0.372
2.133ArgAsp: 2.133 ± 0.587
2.631ArgGlu: 2.631 ± 0.702
1.635ArgPhe: 1.635 ± 0.543
2.346ArgGly: 2.346 ± 1.141
1.066ArgHis: 1.066 ± 0.327
1.564ArgIle: 1.564 ± 0.861
1.92ArgLys: 1.92 ± 0.579
2.986ArgLeu: 2.986 ± 0.462
0.427ArgMet: 0.427 ± 0.183
1.493ArgAsn: 1.493 ± 0.597
1.209ArgPro: 1.209 ± 0.382
1.92ArgGln: 1.92 ± 0.796
1.066ArgArg: 1.066 ± 0.712
2.844ArgSer: 2.844 ± 0.584
1.777ArgThr: 1.777 ± 0.688
3.484ArgVal: 3.484 ± 0.531
0.569ArgTrp: 0.569 ± 0.531
1.351ArgTyr: 1.351 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
5.901SerAla: 5.901 ± 0.604
1.564SerCys: 1.564 ± 0.4
3.768SerAsp: 3.768 ± 0.804
3.768SerGlu: 3.768 ± 0.552
4.408SerPhe: 4.408 ± 1.144
3.768SerGly: 3.768 ± 1.175
1.635SerHis: 1.635 ± 0.404
2.915SerIle: 2.915 ± 0.365
3.484SerLys: 3.484 ± 0.496
5.901SerLeu: 5.901 ± 0.575
1.28SerMet: 1.28 ± 0.188
2.986SerAsn: 2.986 ± 0.543
2.133SerPro: 2.133 ± 0.863
2.204SerGln: 2.204 ± 0.282
2.204SerArg: 2.204 ± 1.851
4.408SerSer: 4.408 ± 0.959
5.261SerThr: 5.261 ± 0.675
5.901SerVal: 5.901 ± 0.701
1.066SerTrp: 1.066 ± 0.268
3.413SerTyr: 3.413 ± 0.639
0.0SerXaa: 0.0 ± 0.0
Thr
3.413ThrAla: 3.413 ± 1.114
2.915ThrCys: 2.915 ± 0.745
3.057ThrAsp: 3.057 ± 0.873
3.697ThrGlu: 3.697 ± 0.413
4.55ThrPhe: 4.55 ± 0.497
3.91ThrGly: 3.91 ± 0.309
1.635ThrHis: 1.635 ± 0.39
4.621ThrIle: 4.621 ± 0.814
3.626ThrLys: 3.626 ± 0.322
6.47ThrLeu: 6.47 ± 0.464
1.635ThrMet: 1.635 ± 0.391
2.702ThrAsn: 2.702 ± 0.221
3.128ThrPro: 3.128 ± 0.394
3.626ThrGln: 3.626 ± 1.458
2.56ThrArg: 2.56 ± 0.582
5.261ThrSer: 5.261 ± 0.96
6.683ThrThr: 6.683 ± 0.997
5.617ThrVal: 5.617 ± 0.242
0.498ThrTrp: 0.498 ± 0.164
2.631ThrTyr: 2.631 ± 0.434
0.0ThrXaa: 0.0 ± 0.0
Val
5.688ValAla: 5.688 ± 0.458
2.062ValCys: 2.062 ± 0.333
4.906ValAsp: 4.906 ± 1.088
4.906ValGlu: 4.906 ± 1.23
3.91ValPhe: 3.91 ± 0.682
3.91ValGly: 3.91 ± 0.637
0.995ValHis: 0.995 ± 0.511
3.982ValIle: 3.982 ± 0.739
5.332ValLys: 5.332 ± 0.869
8.603ValLeu: 8.603 ± 1.099
1.849ValMet: 1.849 ± 0.297
3.839ValAsn: 3.839 ± 0.729
3.057ValPro: 3.057 ± 0.34
3.199ValGln: 3.199 ± 0.569
2.346ValArg: 2.346 ± 0.304
4.479ValSer: 4.479 ± 0.522
6.754ValThr: 6.754 ± 0.983
7.323ValVal: 7.323 ± 1.048
0.355ValTrp: 0.355 ± 0.106
4.266ValTyr: 4.266 ± 0.741
0.0ValXaa: 0.0 ± 0.0
Trp
0.711TrpAla: 0.711 ± 0.25
0.213TrpCys: 0.213 ± 0.077
0.355TrpAsp: 0.355 ± 0.231
0.498TrpGlu: 0.498 ± 0.114
1.066TrpPhe: 1.066 ± 0.201
0.142TrpGly: 0.142 ± 0.093
0.427TrpHis: 0.427 ± 0.405
0.498TrpIle: 0.498 ± 0.189
0.569TrpLys: 0.569 ± 0.189
1.493TrpLeu: 1.493 ± 0.601
0.142TrpMet: 0.142 ± 0.054
1.351TrpAsn: 1.351 ± 0.339
0.355TrpPro: 0.355 ± 0.363
0.284TrpGln: 0.284 ± 0.17
0.213TrpArg: 0.213 ± 0.133
0.853TrpSer: 0.853 ± 0.248
0.427TrpThr: 0.427 ± 0.137
0.853TrpVal: 0.853 ± 0.311
0.071TrpTrp: 0.071 ± 0.046
0.498TrpTyr: 0.498 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.204TyrAla: 2.204 ± 0.346
1.635TyrCys: 1.635 ± 0.456
2.346TyrAsp: 2.346 ± 0.546
1.777TyrGlu: 1.777 ± 0.343
2.986TyrPhe: 2.986 ± 0.889
1.706TyrGly: 1.706 ± 0.218
0.853TyrHis: 0.853 ± 0.356
1.777TyrIle: 1.777 ± 0.284
3.91TyrLys: 3.91 ± 0.518
3.484TyrLeu: 3.484 ± 0.986
1.28TyrMet: 1.28 ± 0.49
2.773TyrAsn: 2.773 ± 0.424
1.564TyrPro: 1.564 ± 0.483
1.493TyrGln: 1.493 ± 0.744
2.417TyrArg: 2.417 ± 0.324
2.631TyrSer: 2.631 ± 0.876
2.915TyrThr: 2.915 ± 0.69
3.839TyrVal: 3.839 ± 0.592
0.284TyrTrp: 0.284 ± 0.115
2.346TyrTyr: 2.346 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (14066 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski