Amino acid dipepetide frequency for Rousettus bat coronavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.513AlaAla: 7.513 ± 1.773
2.944AlaCys: 2.944 ± 0.904
3.655AlaAsp: 3.655 ± 0.806
2.234AlaGlu: 2.234 ± 0.408
3.147AlaPhe: 3.147 ± 0.604
3.858AlaGly: 3.858 ± 0.8
1.421AlaHis: 1.421 ± 0.25
5.584AlaIle: 5.584 ± 0.887
3.858AlaLys: 3.858 ± 1.119
7.107AlaLeu: 7.107 ± 1.2
2.437AlaMet: 2.437 ± 0.55
4.264AlaAsn: 4.264 ± 0.713
3.147AlaPro: 3.147 ± 0.682
2.335AlaGln: 2.335 ± 0.472
3.756AlaArg: 3.756 ± 0.717
5.076AlaSer: 5.076 ± 0.665
4.264AlaThr: 4.264 ± 0.616
5.888AlaVal: 5.888 ± 0.684
0.812AlaTrp: 0.812 ± 0.438
3.249AlaTyr: 3.249 ± 0.622
0.0AlaXaa: 0.0 ± 0.0
Cys
2.538CysAla: 2.538 ± 0.693
1.117CysCys: 1.117 ± 0.428
1.726CysAsp: 1.726 ± 0.649
1.015CysGlu: 1.015 ± 0.314
1.421CysPhe: 1.421 ± 0.402
1.827CysGly: 1.827 ± 0.425
0.609CysHis: 0.609 ± 0.33
2.03CysIle: 2.03 ± 0.971
1.726CysLys: 1.726 ± 0.458
2.234CysLeu: 2.234 ± 0.67
0.711CysMet: 0.711 ± 0.244
1.218CysAsn: 1.218 ± 0.846
1.117CysPro: 1.117 ± 0.435
1.015CysGln: 1.015 ± 0.609
1.218CysArg: 1.218 ± 0.638
2.335CysSer: 2.335 ± 0.703
2.538CysThr: 2.538 ± 0.696
3.249CysVal: 3.249 ± 0.899
0.406CysTrp: 0.406 ± 0.129
2.335CysTyr: 2.335 ± 0.758
0.0CysXaa: 0.0 ± 0.0
Asp
4.365AspAla: 4.365 ± 1.187
1.624AspCys: 1.624 ± 0.445
2.944AspAsp: 2.944 ± 0.9
2.234AspGlu: 2.234 ± 0.429
2.234AspPhe: 2.234 ± 0.713
4.365AspGly: 4.365 ± 1.129
0.406AspHis: 0.406 ± 0.214
2.437AspIle: 2.437 ± 0.81
2.234AspLys: 2.234 ± 0.601
4.467AspLeu: 4.467 ± 0.6
0.711AspMet: 0.711 ± 0.374
1.726AspAsn: 1.726 ± 0.802
1.929AspPro: 1.929 ± 0.628
1.523AspGln: 1.523 ± 0.351
1.218AspArg: 1.218 ± 0.404
3.046AspSer: 3.046 ± 0.828
3.858AspThr: 3.858 ± 0.496
5.381AspVal: 5.381 ± 0.757
1.015AspTrp: 1.015 ± 0.398
2.741AspTyr: 2.741 ± 1.049
0.0AspXaa: 0.0 ± 0.0
Glu
3.553GluAla: 3.553 ± 0.836
0.914GluCys: 0.914 ± 0.336
2.03GluAsp: 2.03 ± 0.432
2.538GluGlu: 2.538 ± 0.704
1.929GluPhe: 1.929 ± 0.394
3.147GluGly: 3.147 ± 0.983
1.015GluHis: 1.015 ± 0.266
1.523GluIle: 1.523 ± 0.433
1.726GluLys: 1.726 ± 0.406
4.569GluLeu: 4.569 ± 0.876
1.015GluMet: 1.015 ± 0.355
2.538GluAsn: 2.538 ± 0.65
1.827GluPro: 1.827 ± 0.459
1.827GluGln: 1.827 ± 0.821
1.523GluArg: 1.523 ± 0.295
2.843GluSer: 2.843 ± 0.672
2.437GluThr: 2.437 ± 0.384
3.858GluVal: 3.858 ± 1.388
0.102GluTrp: 0.102 ± 0.053
1.32GluTyr: 1.32 ± 0.479
0.0GluXaa: 0.0 ± 0.0
Phe
2.538PheAla: 2.538 ± 1.087
1.117PheCys: 1.117 ± 0.252
3.046PheAsp: 3.046 ± 0.438
1.827PheGlu: 1.827 ± 0.359
1.218PhePhe: 1.218 ± 0.347
3.35PheGly: 3.35 ± 0.586
1.015PheHis: 1.015 ± 0.433
1.929PheIle: 1.929 ± 1.326
2.843PheLys: 2.843 ± 0.613
3.046PheLeu: 3.046 ± 0.637
1.015PheMet: 1.015 ± 0.332
3.147PheAsn: 3.147 ± 1.485
1.32PhePro: 1.32 ± 0.474
1.015PheGln: 1.015 ± 0.609
1.32PheArg: 1.32 ± 0.369
2.843PheSer: 2.843 ± 0.532
3.858PheThr: 3.858 ± 1.105
4.569PheVal: 4.569 ± 1.082
0.406PheTrp: 0.406 ± 0.214
3.046PheTyr: 3.046 ± 0.409
0.0PheXaa: 0.0 ± 0.0
Gly
4.975GlyAla: 4.975 ± 0.787
1.523GlyCys: 1.523 ± 0.401
3.655GlyAsp: 3.655 ± 0.901
1.421GlyGlu: 1.421 ± 0.317
4.264GlyPhe: 4.264 ± 0.915
4.061GlyGly: 4.061 ± 0.944
0.812GlyHis: 0.812 ± 0.287
3.046GlyIle: 3.046 ± 0.277
2.741GlyLys: 2.741 ± 0.758
4.772GlyLeu: 4.772 ± 0.946
1.218GlyMet: 1.218 ± 0.59
3.147GlyAsn: 3.147 ± 0.762
2.132GlyPro: 2.132 ± 0.605
1.726GlyGln: 1.726 ± 0.307
1.929GlyArg: 1.929 ± 1.143
3.655GlySer: 3.655 ± 0.796
5.178GlyThr: 5.178 ± 0.825
8.528GlyVal: 8.528 ± 1.501
1.015GlyTrp: 1.015 ± 0.535
2.437GlyTyr: 2.437 ± 0.49
0.0GlyXaa: 0.0 ± 0.0
His
1.523HisAla: 1.523 ± 0.47
0.203HisCys: 0.203 ± 0.253
0.812HisAsp: 0.812 ± 0.428
0.508HisGlu: 0.508 ± 0.495
1.32HisPhe: 1.32 ± 0.446
1.32HisGly: 1.32 ± 0.326
0.508HisHis: 0.508 ± 0.26
1.015HisIle: 1.015 ± 0.387
1.218HisLys: 1.218 ± 0.469
1.827HisLeu: 1.827 ± 0.444
0.203HisMet: 0.203 ± 0.107
1.015HisAsn: 1.015 ± 0.393
0.406HisPro: 0.406 ± 0.214
0.812HisGln: 0.812 ± 0.306
0.406HisArg: 0.406 ± 0.602
1.015HisSer: 1.015 ± 0.261
1.827HisThr: 1.827 ± 0.49
2.741HisVal: 2.741 ± 1.027
0.102HisTrp: 0.102 ± 0.296
1.015HisTyr: 1.015 ± 0.318
0.0HisXaa: 0.0 ± 0.0
Ile
3.046IleAla: 3.046 ± 1.02
0.914IleCys: 0.914 ± 0.282
2.234IleAsp: 2.234 ± 0.701
1.117IleGlu: 1.117 ± 0.28
1.218IlePhe: 1.218 ± 0.526
2.741IleGly: 2.741 ± 0.605
0.711IleHis: 0.711 ± 0.876
1.218IleIle: 1.218 ± 0.294
1.827IleLys: 1.827 ± 0.695
4.365IleLeu: 4.365 ± 1.498
0.812IleMet: 0.812 ± 0.326
1.929IleAsn: 1.929 ± 0.374
2.741IlePro: 2.741 ± 0.562
1.015IleGln: 1.015 ± 0.314
1.929IleArg: 1.929 ± 0.411
4.569IleSer: 4.569 ± 1.331
3.249IleThr: 3.249 ± 0.859
4.162IleVal: 4.162 ± 1.362
0.203IleTrp: 0.203 ± 0.237
1.32IleTyr: 1.32 ± 0.521
0.0IleXaa: 0.0 ± 0.0
Lys
3.046LysAla: 3.046 ± 0.47
1.218LysCys: 1.218 ± 0.391
1.929LysAsp: 1.929 ± 0.628
2.741LysGlu: 2.741 ± 0.801
2.64LysPhe: 2.64 ± 0.769
3.046LysGly: 3.046 ± 0.711
1.32LysHis: 1.32 ± 0.567
1.218LysIle: 1.218 ± 0.49
2.335LysLys: 2.335 ± 1.189
5.076LysLeu: 5.076 ± 0.667
0.914LysMet: 0.914 ± 0.196
1.117LysAsn: 1.117 ± 0.536
4.162LysPro: 4.162 ± 1.033
2.03LysGln: 2.03 ± 0.517
2.741LysArg: 2.741 ± 0.849
2.843LysSer: 2.843 ± 0.69
2.335LysThr: 2.335 ± 0.669
4.772LysVal: 4.772 ± 0.887
0.914LysTrp: 0.914 ± 0.549
2.437LysTyr: 2.437 ± 0.501
0.0LysXaa: 0.0 ± 0.0
Leu
7.208LeuAla: 7.208 ± 2.435
3.655LeuCys: 3.655 ± 0.53
4.061LeuAsp: 4.061 ± 1.093
3.858LeuGlu: 3.858 ± 0.515
3.35LeuPhe: 3.35 ± 1.027
4.467LeuGly: 4.467 ± 0.644
2.741LeuHis: 2.741 ± 0.619
3.35LeuIle: 3.35 ± 1.434
5.685LeuLys: 5.685 ± 1.371
10.558LeuLeu: 10.558 ± 2.847
1.624LeuMet: 1.624 ± 0.397
4.061LeuAsn: 4.061 ± 0.715
4.67LeuPro: 4.67 ± 0.922
3.655LeuGln: 3.655 ± 0.885
4.569LeuArg: 4.569 ± 1.385
7.614LeuSer: 7.614 ± 0.847
5.787LeuThr: 5.787 ± 0.45
9.239LeuVal: 9.239 ± 2.214
1.624LeuTrp: 1.624 ± 0.897
4.67LeuTyr: 4.67 ± 0.729
0.0LeuXaa: 0.0 ± 0.0
Met
2.132MetAla: 2.132 ± 0.585
1.218MetCys: 1.218 ± 0.379
0.812MetAsp: 0.812 ± 0.347
1.015MetGlu: 1.015 ± 0.294
1.117MetPhe: 1.117 ± 0.283
1.421MetGly: 1.421 ± 0.755
0.609MetHis: 0.609 ± 0.321
0.914MetIle: 0.914 ± 0.336
0.203MetLys: 0.203 ± 0.392
2.437MetLeu: 2.437 ± 0.773
0.305MetMet: 0.305 ± 0.16
0.711MetAsn: 0.711 ± 0.226
1.117MetPro: 1.117 ± 0.487
1.015MetGln: 1.015 ± 0.234
0.812MetArg: 0.812 ± 0.36
1.726MetSer: 1.726 ± 0.409
1.117MetThr: 1.117 ± 0.438
1.827MetVal: 1.827 ± 0.468
0.711MetTrp: 0.711 ± 0.598
1.015MetTyr: 1.015 ± 0.424
0.0MetXaa: 0.0 ± 0.0
Asn
3.858AsnAla: 3.858 ± 0.582
2.132AsnCys: 2.132 ± 0.614
2.03AsnAsp: 2.03 ± 0.843
1.726AsnGlu: 1.726 ± 0.5
2.03AsnPhe: 2.03 ± 1.337
4.162AsnGly: 4.162 ± 0.681
0.711AsnHis: 0.711 ± 0.395
1.218AsnIle: 1.218 ± 0.359
2.741AsnLys: 2.741 ± 0.624
4.264AsnLeu: 4.264 ± 0.712
1.523AsnMet: 1.523 ± 0.42
2.234AsnAsn: 2.234 ± 0.682
2.03AsnPro: 2.03 ± 0.721
1.015AsnGln: 1.015 ± 0.861
2.132AsnArg: 2.132 ± 0.47
3.249AsnSer: 3.249 ± 0.629
2.944AsnThr: 2.944 ± 1.252
4.569AsnVal: 4.569 ± 0.493
0.711AsnTrp: 0.711 ± 0.374
2.538AsnTyr: 2.538 ± 0.571
0.0AsnXaa: 0.0 ± 0.0
Pro
3.553ProAla: 3.553 ± 0.676
1.015ProCys: 1.015 ± 0.387
2.538ProAsp: 2.538 ± 0.418
2.234ProGlu: 2.234 ± 0.802
1.827ProPhe: 1.827 ± 0.255
3.147ProGly: 3.147 ± 0.626
0.812ProHis: 0.812 ± 0.269
2.234ProIle: 2.234 ± 0.497
2.944ProLys: 2.944 ± 0.948
3.959ProLeu: 3.959 ± 0.598
1.117ProMet: 1.117 ± 0.341
2.234ProAsn: 2.234 ± 0.985
2.132ProPro: 2.132 ± 0.482
1.827ProGln: 1.827 ± 0.778
1.929ProArg: 1.929 ± 0.85
2.132ProSer: 2.132 ± 0.509
2.843ProThr: 2.843 ± 0.948
4.365ProVal: 4.365 ± 0.386
0.812ProTrp: 0.812 ± 0.276
2.335ProTyr: 2.335 ± 0.773
0.0ProXaa: 0.0 ± 0.0
Gln
2.538GlnAla: 2.538 ± 0.634
0.508GlnCys: 0.508 ± 0.802
1.929GlnAsp: 1.929 ± 0.578
2.335GlnGlu: 2.335 ± 0.556
1.726GlnPhe: 1.726 ± 0.354
1.624GlnGly: 1.624 ± 0.651
0.812GlnHis: 0.812 ± 0.867
1.218GlnIle: 1.218 ± 0.475
1.523GlnLys: 1.523 ± 0.67
4.569GlnLeu: 4.569 ± 0.76
0.609GlnMet: 0.609 ± 0.165
1.117GlnAsn: 1.117 ± 0.459
1.726GlnPro: 1.726 ± 0.826
1.015GlnGln: 1.015 ± 0.257
1.624GlnArg: 1.624 ± 0.377
2.335GlnSer: 2.335 ± 0.64
2.132GlnThr: 2.132 ± 0.494
2.944GlnVal: 2.944 ± 0.737
0.711GlnTrp: 0.711 ± 0.182
1.421GlnTyr: 1.421 ± 0.356
0.0GlnXaa: 0.0 ± 0.0
Arg
3.655ArgAla: 3.655 ± 0.628
2.03ArgCys: 2.03 ± 0.588
1.421ArgAsp: 1.421 ± 0.348
2.741ArgGlu: 2.741 ± 0.936
2.538ArgPhe: 2.538 ± 0.486
2.132ArgGly: 2.132 ± 0.688
1.218ArgHis: 1.218 ± 0.62
1.421ArgIle: 1.421 ± 0.668
1.523ArgLys: 1.523 ± 0.502
3.756ArgLeu: 3.756 ± 0.67
1.117ArgMet: 1.117 ± 0.438
2.335ArgAsn: 2.335 ± 1.54
1.218ArgPro: 1.218 ± 0.714
1.726ArgGln: 1.726 ± 0.581
1.726ArgArg: 1.726 ± 0.653
1.421ArgSer: 1.421 ± 0.365
3.147ArgThr: 3.147 ± 0.914
3.553ArgVal: 3.553 ± 0.95
0.406ArgTrp: 0.406 ± 0.37
1.929ArgTyr: 1.929 ± 0.303
0.0ArgXaa: 0.0 ± 0.0
Ser
4.569SerAla: 4.569 ± 0.506
2.64SerCys: 2.64 ± 0.671
3.655SerAsp: 3.655 ± 0.521
3.756SerGlu: 3.756 ± 0.672
2.944SerPhe: 2.944 ± 1.302
3.452SerGly: 3.452 ± 0.528
1.218SerHis: 1.218 ± 0.412
3.046SerIle: 3.046 ± 0.68
2.843SerLys: 2.843 ± 0.808
6.396SerLeu: 6.396 ± 0.908
1.523SerMet: 1.523 ± 0.23
2.843SerAsn: 2.843 ± 1.313
2.234SerPro: 2.234 ± 0.615
2.843SerGln: 2.843 ± 0.48
3.046SerArg: 3.046 ± 1.48
5.279SerSer: 5.279 ± 0.697
4.467SerThr: 4.467 ± 0.744
7.614SerVal: 7.614 ± 1.211
0.812SerTrp: 0.812 ± 0.311
2.437SerTyr: 2.437 ± 0.475
0.0SerXaa: 0.0 ± 0.0
Thr
5.178ThrAla: 5.178 ± 0.491
2.437ThrCys: 2.437 ± 0.64
2.234ThrAsp: 2.234 ± 0.451
2.335ThrGlu: 2.335 ± 0.435
3.249ThrPhe: 3.249 ± 0.816
4.365ThrGly: 4.365 ± 0.537
1.421ThrHis: 1.421 ± 0.383
2.335ThrIle: 2.335 ± 1.097
3.249ThrLys: 3.249 ± 0.954
5.99ThrLeu: 5.99 ± 1.158
1.726ThrMet: 1.726 ± 0.494
3.249ThrAsn: 3.249 ± 0.782
4.975ThrPro: 4.975 ± 1.343
2.538ThrGln: 2.538 ± 0.769
2.538ThrArg: 2.538 ± 1.732
4.264ThrSer: 4.264 ± 0.814
5.279ThrThr: 5.279 ± 1.722
6.294ThrVal: 6.294 ± 1.077
0.914ThrTrp: 0.914 ± 0.376
2.843ThrTyr: 2.843 ± 0.897
0.0ThrXaa: 0.0 ± 0.0
Val
6.193ValAla: 6.193 ± 1.597
3.553ValCys: 3.553 ± 0.841
5.178ValAsp: 5.178 ± 0.993
4.162ValGlu: 4.162 ± 0.929
3.452ValPhe: 3.452 ± 0.752
5.888ValGly: 5.888 ± 0.865
1.32ValHis: 1.32 ± 0.479
3.553ValIle: 3.553 ± 0.739
5.482ValLys: 5.482 ± 1.478
12.081ValLeu: 12.081 ± 1.548
2.335ValMet: 2.335 ± 0.651
5.178ValAsn: 5.178 ± 0.949
4.365ValPro: 4.365 ± 0.982
3.756ValGln: 3.756 ± 0.699
4.061ValArg: 4.061 ± 0.786
7.614ValSer: 7.614 ± 1.683
6.497ValThr: 6.497 ± 0.524
9.746ValVal: 9.746 ± 1.58
0.914ValTrp: 0.914 ± 0.336
3.858ValTyr: 3.858 ± 0.746
0.0ValXaa: 0.0 ± 0.0
Trp
0.711TrpAla: 0.711 ± 0.374
0.406TrpCys: 0.406 ± 0.247
0.914TrpAsp: 0.914 ± 0.481
0.406TrpGlu: 0.406 ± 0.214
1.015TrpPhe: 1.015 ± 0.404
0.711TrpGly: 0.711 ± 0.784
0.0TrpHis: 0.0 ± 0.0
0.406TrpIle: 0.406 ± 0.214
0.406TrpLys: 0.406 ± 0.214
1.726TrpLeu: 1.726 ± 1.387
0.203TrpMet: 0.203 ± 0.411
0.609TrpAsn: 0.609 ± 0.269
0.508TrpPro: 0.508 ± 0.395
0.508TrpGln: 0.508 ± 0.354
0.508TrpArg: 0.508 ± 0.267
0.812TrpSer: 0.812 ± 0.501
0.812TrpThr: 0.812 ± 0.287
1.523TrpVal: 1.523 ± 0.25
0.102TrpTrp: 0.102 ± 0.053
1.015TrpTyr: 1.015 ± 0.28
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.858TyrAla: 3.858 ± 0.751
1.218TyrCys: 1.218 ± 0.543
3.655TyrAsp: 3.655 ± 0.643
2.03TyrGlu: 2.03 ± 0.342
1.827TyrPhe: 1.827 ± 0.495
3.147TyrGly: 3.147 ± 0.7
1.117TyrHis: 1.117 ± 0.505
1.624TyrIle: 1.624 ± 0.479
1.726TyrLys: 1.726 ± 0.634
3.249TyrLeu: 3.249 ± 1.034
1.015TyrMet: 1.015 ± 0.363
3.147TyrAsn: 3.147 ± 1.038
2.234TyrPro: 2.234 ± 0.434
1.218TyrGln: 1.218 ± 0.333
2.03TyrArg: 2.03 ± 0.507
2.944TyrSer: 2.944 ± 0.813
2.843TyrThr: 2.843 ± 0.589
4.467TyrVal: 4.467 ± 1.233
0.609TyrTrp: 0.609 ± 0.44
2.335TyrTyr: 2.335 ± 0.913
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (9851 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski