Amino acid dipepetide frequency for Bat coronavirus Cp/Yunnan2011

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.274AlaAla: 6.274 ± 0.68
2.055AlaCys: 2.055 ± 0.354
3.245AlaAsp: 3.245 ± 0.54
2.488AlaGlu: 2.488 ± 0.327
2.488AlaPhe: 2.488 ± 0.387
4.219AlaGly: 4.219 ± 0.467
1.082AlaHis: 1.082 ± 0.342
4.543AlaIle: 4.543 ± 0.51
3.57AlaLys: 3.57 ± 0.853
7.248AlaLeu: 7.248 ± 1.016
2.488AlaMet: 2.488 ± 0.372
3.894AlaAsn: 3.894 ± 0.612
2.596AlaPro: 2.596 ± 0.309
2.704AlaGln: 2.704 ± 0.526
2.813AlaArg: 2.813 ± 0.46
5.409AlaSer: 5.409 ± 1.354
4.868AlaThr: 4.868 ± 0.301
4.868AlaVal: 4.868 ± 1.049
1.406AlaTrp: 1.406 ± 0.249
4.003AlaTyr: 4.003 ± 0.48
0.0AlaXaa: 0.0 ± 0.0
Cys
2.164CysAla: 2.164 ± 0.482
1.623CysCys: 1.623 ± 0.276
2.38CysAsp: 2.38 ± 0.64
1.082CysGlu: 1.082 ± 0.444
1.19CysPhe: 1.19 ± 0.402
2.38CysGly: 2.38 ± 0.5
0.541CysHis: 0.541 ± 0.105
1.839CysIle: 1.839 ± 0.388
0.974CysLys: 0.974 ± 0.238
2.704CysLeu: 2.704 ± 0.581
0.541CysMet: 0.541 ± 0.235
1.406CysAsn: 1.406 ± 0.42
0.757CysPro: 0.757 ± 0.122
0.541CysGln: 0.541 ± 0.23
1.082CysArg: 1.082 ± 0.369
2.164CysSer: 2.164 ± 0.518
2.164CysThr: 2.164 ± 0.527
2.596CysVal: 2.596 ± 0.525
0.541CysTrp: 0.541 ± 0.535
1.406CysTyr: 1.406 ± 0.387
0.0CysXaa: 0.0 ± 0.0
Asp
4.219AspAla: 4.219 ± 0.787
1.19AspCys: 1.19 ± 0.23
2.272AspAsp: 2.272 ± 0.363
2.704AspGlu: 2.704 ± 0.208
2.921AspPhe: 2.921 ± 0.872
4.111AspGly: 4.111 ± 0.651
0.757AspHis: 0.757 ± 0.241
3.029AspIle: 3.029 ± 0.56
2.813AspLys: 2.813 ± 0.767
4.868AspLeu: 4.868 ± 0.501
1.19AspMet: 1.19 ± 0.416
3.137AspAsn: 3.137 ± 0.562
1.623AspPro: 1.623 ± 0.4
1.406AspGln: 1.406 ± 0.257
1.839AspArg: 1.839 ± 0.759
3.245AspSer: 3.245 ± 0.391
3.678AspThr: 3.678 ± 0.924
4.111AspVal: 4.111 ± 0.49
0.757AspTrp: 0.757 ± 0.336
3.57AspTyr: 3.57 ± 0.558
0.0AspXaa: 0.0 ± 0.0
Glu
3.029GluAla: 3.029 ± 0.664
1.839GluCys: 1.839 ± 0.292
2.272GluAsp: 2.272 ± 0.559
3.894GluGlu: 3.894 ± 1.325
1.947GluPhe: 1.947 ± 0.515
2.813GluGly: 2.813 ± 0.379
1.514GluHis: 1.514 ± 0.368
2.596GluIle: 2.596 ± 0.438
1.947GluLys: 1.947 ± 0.384
4.435GluLeu: 4.435 ± 0.453
0.865GluMet: 0.865 ± 0.174
2.38GluAsn: 2.38 ± 0.599
1.839GluPro: 1.839 ± 0.428
1.839GluGln: 1.839 ± 0.472
1.406GluArg: 1.406 ± 0.289
2.813GluSer: 2.813 ± 0.527
3.029GluThr: 3.029 ± 0.483
3.462GluVal: 3.462 ± 0.638
0.216GluTrp: 0.216 ± 0.135
1.839GluTyr: 1.839 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
2.596PheAla: 2.596 ± 0.5
1.947PheCys: 1.947 ± 0.451
2.921PheAsp: 2.921 ± 0.766
1.514PheGlu: 1.514 ± 0.238
2.164PhePhe: 2.164 ± 0.328
3.137PheGly: 3.137 ± 0.849
0.649PheHis: 0.649 ± 0.325
2.704PheIle: 2.704 ± 0.246
3.462PheLys: 3.462 ± 0.849
4.652PheLeu: 4.652 ± 1.215
0.974PheMet: 0.974 ± 0.238
3.462PheAsn: 3.462 ± 1.296
2.164PhePro: 2.164 ± 0.34
1.082PheGln: 1.082 ± 0.618
1.19PheArg: 1.19 ± 0.224
3.786PheSer: 3.786 ± 0.962
3.786PheThr: 3.786 ± 0.254
3.678PheVal: 3.678 ± 0.905
0.325PheTrp: 0.325 ± 0.16
2.596PheTyr: 2.596 ± 0.54
0.0PheXaa: 0.0 ± 0.0
Gly
4.327GlyAla: 4.327 ± 1.019
1.731GlyCys: 1.731 ± 0.288
4.111GlyAsp: 4.111 ± 0.392
2.055GlyGlu: 2.055 ± 0.292
3.462GlyPhe: 3.462 ± 0.376
4.111GlyGly: 4.111 ± 0.9
1.406GlyHis: 1.406 ± 0.353
4.003GlyIle: 4.003 ± 0.737
2.921GlyLys: 2.921 ± 0.589
3.786GlyLeu: 3.786 ± 0.442
1.082GlyMet: 1.082 ± 0.355
2.813GlyAsn: 2.813 ± 0.431
2.38GlyPro: 2.38 ± 0.578
2.164GlyGln: 2.164 ± 0.423
1.839GlyArg: 1.839 ± 0.492
4.111GlySer: 4.111 ± 0.473
5.084GlyThr: 5.084 ± 0.836
7.032GlyVal: 7.032 ± 0.767
0.541GlyTrp: 0.541 ± 0.369
2.596GlyTyr: 2.596 ± 0.459
0.0GlyXaa: 0.0 ± 0.0
His
1.406HisAla: 1.406 ± 0.393
0.541HisCys: 0.541 ± 0.171
0.974HisAsp: 0.974 ± 0.479
1.082HisGlu: 1.082 ± 0.378
1.514HisPhe: 1.514 ± 0.678
1.406HisGly: 1.406 ± 0.295
0.541HisHis: 0.541 ± 0.171
1.19HisIle: 1.19 ± 0.312
0.649HisLys: 0.649 ± 0.174
2.055HisLeu: 2.055 ± 0.185
0.541HisMet: 0.541 ± 0.348
0.865HisAsn: 0.865 ± 0.228
0.541HisPro: 0.541 ± 0.23
0.433HisGln: 0.433 ± 0.075
0.216HisArg: 0.216 ± 0.193
1.947HisSer: 1.947 ± 0.16
1.731HisThr: 1.731 ± 0.288
1.406HisVal: 1.406 ± 0.237
0.325HisTrp: 0.325 ± 0.201
0.649HisTyr: 0.649 ± 0.092
0.0HisXaa: 0.0 ± 0.0
Ile
4.003IleAla: 4.003 ± 1.332
1.19IleCys: 1.19 ± 0.375
3.029IleAsp: 3.029 ± 0.217
1.731IleGlu: 1.731 ± 0.554
1.623IlePhe: 1.623 ± 0.336
4.003IleGly: 4.003 ± 1.261
0.216IleHis: 0.216 ± 0.193
3.029IleIle: 3.029 ± 0.452
3.678IleLys: 3.678 ± 0.474
4.219IleLeu: 4.219 ± 0.279
1.623IleMet: 1.623 ± 0.356
2.921IleAsn: 2.921 ± 0.437
1.947IlePro: 1.947 ± 0.58
1.839IleGln: 1.839 ± 0.389
2.38IleArg: 2.38 ± 0.84
3.462IleSer: 3.462 ± 0.578
4.327IleThr: 4.327 ± 0.293
4.435IleVal: 4.435 ± 0.402
0.325IleTrp: 0.325 ± 0.106
1.298IleTyr: 1.298 ± 0.66
0.0IleXaa: 0.0 ± 0.0
Lys
2.813LysAla: 2.813 ± 0.692
2.055LysCys: 2.055 ± 0.323
2.921LysAsp: 2.921 ± 0.864
3.137LysGlu: 3.137 ± 0.382
2.596LysPhe: 2.596 ± 0.427
4.976LysGly: 4.976 ± 0.508
1.839LysHis: 1.839 ± 0.532
2.813LysIle: 2.813 ± 0.608
2.813LysLys: 2.813 ± 1.228
6.274LysLeu: 6.274 ± 0.483
1.406LysMet: 1.406 ± 0.27
2.272LysAsn: 2.272 ± 0.58
3.354LysPro: 3.354 ± 0.279
1.731LysGln: 1.731 ± 0.735
2.488LysArg: 2.488 ± 0.144
3.57LysSer: 3.57 ± 0.5
3.57LysThr: 3.57 ± 0.381
2.813LysVal: 2.813 ± 0.25
0.757LysTrp: 0.757 ± 0.156
2.164LysTyr: 2.164 ± 0.614
0.0LysXaa: 0.0 ± 0.0
Leu
6.383LeuAla: 6.383 ± 0.789
2.704LeuCys: 2.704 ± 0.485
4.652LeuAsp: 4.652 ± 1.131
4.003LeuGlu: 4.003 ± 0.854
3.462LeuPhe: 3.462 ± 0.492
5.084LeuGly: 5.084 ± 0.335
1.839LeuHis: 1.839 ± 0.4
3.245LeuIle: 3.245 ± 0.931
6.815LeuLys: 6.815 ± 0.85
10.169LeuLeu: 10.169 ± 1.707
2.596LeuMet: 2.596 ± 0.417
6.058LeuAsn: 6.058 ± 1.235
4.976LeuPro: 4.976 ± 0.719
4.652LeuGln: 4.652 ± 0.411
4.435LeuArg: 4.435 ± 0.944
7.032LeuSer: 7.032 ± 0.717
5.409LeuThr: 5.409 ± 0.304
5.733LeuVal: 5.733 ± 0.946
1.19LeuTrp: 1.19 ± 0.382
3.462LeuTyr: 3.462 ± 0.749
0.0LeuXaa: 0.0 ± 0.0
Met
1.947MetAla: 1.947 ± 0.591
0.757MetCys: 0.757 ± 0.263
1.623MetAsp: 1.623 ± 0.3
0.865MetGlu: 0.865 ± 0.321
0.974MetPhe: 0.974 ± 0.156
0.757MetGly: 0.757 ± 0.242
0.433MetHis: 0.433 ± 0.151
0.757MetIle: 0.757 ± 0.563
0.974MetLys: 0.974 ± 0.167
2.488MetLeu: 2.488 ± 0.67
0.757MetMet: 0.757 ± 0.304
0.757MetAsn: 0.757 ± 0.242
1.298MetPro: 1.298 ± 0.272
1.082MetGln: 1.082 ± 0.297
0.649MetArg: 0.649 ± 0.174
2.488MetSer: 2.488 ± 0.468
1.406MetThr: 1.406 ± 0.205
1.082MetVal: 1.082 ± 0.368
0.649MetTrp: 0.649 ± 0.402
1.514MetTyr: 1.514 ± 0.254
0.0MetXaa: 0.0 ± 0.0
Asn
4.543AsnAla: 4.543 ± 0.93
1.623AsnCys: 1.623 ± 0.237
1.623AsnAsp: 1.623 ± 0.289
1.731AsnGlu: 1.731 ± 0.367
2.488AsnPhe: 2.488 ± 1.549
3.894AsnGly: 3.894 ± 0.88
1.298AsnHis: 1.298 ± 0.428
2.488AsnIle: 2.488 ± 0.491
2.704AsnLys: 2.704 ± 0.348
4.976AsnLeu: 4.976 ± 0.881
1.298AsnMet: 1.298 ± 0.444
3.137AsnAsn: 3.137 ± 0.475
1.947AsnPro: 1.947 ± 0.405
1.731AsnGln: 1.731 ± 0.743
2.055AsnArg: 2.055 ± 0.544
4.003AsnSer: 4.003 ± 0.818
3.029AsnThr: 3.029 ± 0.585
4.868AsnVal: 4.868 ± 0.73
0.433AsnTrp: 0.433 ± 0.158
2.704AsnTyr: 2.704 ± 0.889
0.0AsnXaa: 0.0 ± 0.0
Pro
3.137ProAla: 3.137 ± 0.556
0.865ProCys: 0.865 ± 0.174
1.514ProAsp: 1.514 ± 0.267
1.623ProGlu: 1.623 ± 0.395
1.839ProPhe: 1.839 ± 0.316
2.164ProGly: 2.164 ± 0.347
0.649ProHis: 0.649 ± 0.325
2.488ProIle: 2.488 ± 0.144
3.354ProLys: 3.354 ± 0.559
4.219ProLeu: 4.219 ± 0.352
0.541ProMet: 0.541 ± 0.105
2.055ProAsn: 2.055 ± 0.316
1.623ProPro: 1.623 ± 0.296
1.731ProGln: 1.731 ± 1.135
1.623ProArg: 1.623 ± 0.592
2.38ProSer: 2.38 ± 0.596
3.57ProThr: 3.57 ± 0.334
3.354ProVal: 3.354 ± 0.969
0.325ProTrp: 0.325 ± 0.097
1.19ProTyr: 1.19 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
3.137GlnAla: 3.137 ± 0.432
0.974GlnCys: 0.974 ± 0.352
1.839GlnAsp: 1.839 ± 0.498
1.839GlnGlu: 1.839 ± 0.609
2.055GlnPhe: 2.055 ± 0.58
2.488GlnGly: 2.488 ± 0.918
0.757GlnHis: 0.757 ± 0.429
2.272GlnIle: 2.272 ± 1.185
1.839GlnLys: 1.839 ± 0.353
4.003GlnLeu: 4.003 ± 0.502
1.082GlnMet: 1.082 ± 0.165
1.514GlnAsn: 1.514 ± 0.317
1.947GlnPro: 1.947 ± 0.479
1.839GlnGln: 1.839 ± 0.674
1.406GlnArg: 1.406 ± 0.523
1.839GlnSer: 1.839 ± 0.448
2.38GlnThr: 2.38 ± 0.357
2.704GlnVal: 2.704 ± 0.468
0.649GlnTrp: 0.649 ± 0.167
1.406GlnTyr: 1.406 ± 0.563
0.0GlnXaa: 0.0 ± 0.0
Arg
3.462ArgAla: 3.462 ± 0.451
1.19ArgCys: 1.19 ± 0.353
1.947ArgAsp: 1.947 ± 0.389
2.488ArgGlu: 2.488 ± 0.531
1.731ArgPhe: 1.731 ± 0.47
2.704ArgGly: 2.704 ± 1.268
0.865ArgHis: 0.865 ± 0.274
1.947ArgIle: 1.947 ± 0.592
1.839ArgLys: 1.839 ± 0.438
3.137ArgLeu: 3.137 ± 0.565
0.541ArgMet: 0.541 ± 0.264
2.055ArgAsn: 2.055 ± 0.796
1.082ArgPro: 1.082 ± 0.378
2.055ArgGln: 2.055 ± 0.637
1.19ArgArg: 1.19 ± 0.661
2.596ArgSer: 2.596 ± 0.865
1.731ArgThr: 1.731 ± 0.329
3.678ArgVal: 3.678 ± 0.455
0.433ArgTrp: 0.433 ± 0.254
1.298ArgTyr: 1.298 ± 0.324
0.0ArgXaa: 0.0 ± 0.0
Ser
5.95SerAla: 5.95 ± 0.796
1.406SerCys: 1.406 ± 0.379
4.219SerAsp: 4.219 ± 0.826
4.003SerGlu: 4.003 ± 0.878
4.435SerPhe: 4.435 ± 1.26
4.003SerGly: 4.003 ± 1.272
1.406SerHis: 1.406 ± 0.634
2.38SerIle: 2.38 ± 0.391
3.462SerLys: 3.462 ± 0.408
5.95SerLeu: 5.95 ± 0.466
1.406SerMet: 1.406 ± 0.24
2.921SerAsn: 2.921 ± 0.652
2.055SerPro: 2.055 ± 0.569
2.488SerGln: 2.488 ± 0.597
3.029SerArg: 3.029 ± 1.45
4.435SerSer: 4.435 ± 0.943
4.76SerThr: 4.76 ± 0.694
6.383SerVal: 6.383 ± 0.701
1.082SerTrp: 1.082 ± 0.146
3.137SerTyr: 3.137 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
4.111ThrAla: 4.111 ± 1.004
2.488ThrCys: 2.488 ± 0.787
3.245ThrAsp: 3.245 ± 0.747
3.57ThrGlu: 3.57 ± 0.444
4.111ThrPhe: 4.111 ± 1.104
4.003ThrGly: 4.003 ± 0.289
1.298ThrHis: 1.298 ± 0.217
3.57ThrIle: 3.57 ± 0.8
3.462ThrLys: 3.462 ± 0.329
6.491ThrLeu: 6.491 ± 0.429
1.731ThrMet: 1.731 ± 0.492
2.921ThrAsn: 2.921 ± 0.242
2.596ThrPro: 2.596 ± 0.493
3.678ThrGln: 3.678 ± 0.919
2.921ThrArg: 2.921 ± 0.721
5.409ThrSer: 5.409 ± 0.879
5.517ThrThr: 5.517 ± 1.223
5.625ThrVal: 5.625 ± 0.387
0.541ThrTrp: 0.541 ± 0.287
2.488ThrTyr: 2.488 ± 0.199
0.0ThrXaa: 0.0 ± 0.0
Val
5.625ValAla: 5.625 ± 0.796
2.272ValCys: 2.272 ± 0.428
5.301ValAsp: 5.301 ± 1.114
4.003ValGlu: 4.003 ± 0.745
3.786ValPhe: 3.786 ± 0.352
3.245ValGly: 3.245 ± 0.561
1.406ValHis: 1.406 ± 0.358
4.219ValIle: 4.219 ± 0.405
4.652ValLys: 4.652 ± 0.685
7.572ValLeu: 7.572 ± 0.912
1.514ValMet: 1.514 ± 0.271
4.003ValAsn: 4.003 ± 1.428
3.354ValPro: 3.354 ± 0.246
3.137ValGln: 3.137 ± 0.843
2.704ValArg: 2.704 ± 0.298
4.868ValSer: 4.868 ± 0.572
6.491ValThr: 6.491 ± 0.93
6.383ValVal: 6.383 ± 0.881
0.433ValTrp: 0.433 ± 0.216
4.435ValTyr: 4.435 ± 0.728
0.0ValXaa: 0.0 ± 0.0
Trp
0.649TrpAla: 0.649 ± 0.212
0.216TrpCys: 0.216 ± 0.076
0.541TrpAsp: 0.541 ± 0.482
0.541TrpGlu: 0.541 ± 0.106
1.082TrpPhe: 1.082 ± 0.477
0.216TrpGly: 0.216 ± 0.193
0.433TrpHis: 0.433 ± 0.313
0.541TrpIle: 0.541 ± 0.22
0.541TrpLys: 0.541 ± 0.162
1.514TrpLeu: 1.514 ± 0.568
0.0TrpMet: 0.0 ± 0.0
1.298TrpAsn: 1.298 ± 0.241
0.433TrpPro: 0.433 ± 0.349
0.325TrpGln: 0.325 ± 0.156
0.325TrpArg: 0.325 ± 0.209
0.757TrpSer: 0.757 ± 0.275
0.541TrpThr: 0.541 ± 0.235
0.865TrpVal: 0.865 ± 0.378
0.108TrpTrp: 0.108 ± 0.096
0.433TrpTyr: 0.433 ± 0.26
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.488TyrAla: 2.488 ± 0.8
1.406TyrCys: 1.406 ± 0.547
2.813TyrAsp: 2.813 ± 1.114
1.623TyrGlu: 1.623 ± 0.316
2.921TyrPhe: 2.921 ± 0.607
1.839TyrGly: 1.839 ± 0.207
1.082TyrHis: 1.082 ± 0.282
1.623TyrIle: 1.623 ± 0.307
3.894TyrLys: 3.894 ± 0.442
3.137TyrLeu: 3.137 ± 0.919
0.974TyrMet: 0.974 ± 0.606
2.704TyrAsn: 2.704 ± 0.436
1.731TyrPro: 1.731 ± 0.622
1.514TyrGln: 1.514 ± 0.519
2.38TyrArg: 2.38 ± 0.282
2.596TyrSer: 2.596 ± 0.538
2.704TyrThr: 2.704 ± 0.383
4.435TyrVal: 4.435 ± 0.479
0.325TyrTrp: 0.325 ± 0.16
2.704TyrTyr: 2.704 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (9245 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski