Amino acid dipepetide frequency for Ferret coronavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.498AlaAla: 5.498 ± 0.619
2.184AlaCys: 2.184 ± 0.644
2.41AlaAsp: 2.41 ± 0.31
1.958AlaGlu: 1.958 ± 0.364
3.69AlaPhe: 3.69 ± 0.362
3.54AlaGly: 3.54 ± 0.341
1.28AlaHis: 1.28 ± 0.22
4.142AlaIle: 4.142 ± 0.639
4.218AlaLys: 4.218 ± 0.844
5.799AlaLeu: 5.799 ± 0.792
1.582AlaMet: 1.582 ± 0.572
4.669AlaAsn: 4.669 ± 0.533
2.184AlaPro: 2.184 ± 0.576
2.109AlaGln: 2.109 ± 0.796
2.109AlaArg: 2.109 ± 0.421
4.519AlaSer: 4.519 ± 0.394
3.615AlaThr: 3.615 ± 0.452
5.498AlaVal: 5.498 ± 0.874
0.527AlaTrp: 0.527 ± 0.094
3.088AlaTyr: 3.088 ± 0.383
0.0AlaXaa: 0.0 ± 0.0
Cys
1.582CysAla: 1.582 ± 0.463
1.205CysCys: 1.205 ± 0.355
1.732CysAsp: 1.732 ± 0.423
0.979CysGlu: 0.979 ± 0.234
2.033CysPhe: 2.033 ± 0.3
2.711CysGly: 2.711 ± 0.492
0.377CysHis: 0.377 ± 0.325
1.582CysIle: 1.582 ± 0.375
1.808CysLys: 1.808 ± 0.318
2.787CysLeu: 2.787 ± 0.413
0.678CysMet: 0.678 ± 0.243
1.13CysAsn: 1.13 ± 0.48
1.13CysPro: 1.13 ± 0.208
0.452CysGln: 0.452 ± 0.19
1.356CysArg: 1.356 ± 0.269
2.184CysSer: 2.184 ± 0.284
2.711CysThr: 2.711 ± 0.307
2.937CysVal: 2.937 ± 0.511
0.828CysTrp: 0.828 ± 0.299
2.561CysTyr: 2.561 ± 0.617
0.0CysXaa: 0.0 ± 0.0
Asp
3.615AspAla: 3.615 ± 0.893
1.657AspCys: 1.657 ± 0.281
3.69AspAsp: 3.69 ± 0.779
2.109AspGlu: 2.109 ± 0.388
3.013AspPhe: 3.013 ± 0.521
4.745AspGly: 4.745 ± 0.61
1.13AspHis: 1.13 ± 0.352
2.636AspIle: 2.636 ± 0.303
2.636AspLys: 2.636 ± 0.342
4.142AspLeu: 4.142 ± 0.76
2.033AspMet: 2.033 ± 0.471
3.314AspAsn: 3.314 ± 0.503
1.431AspPro: 1.431 ± 0.314
0.979AspGln: 0.979 ± 0.255
1.356AspArg: 1.356 ± 0.427
3.916AspSer: 3.916 ± 0.447
2.184AspThr: 2.184 ± 0.231
6.853AspVal: 6.853 ± 1.119
0.753AspTrp: 0.753 ± 0.203
3.163AspTyr: 3.163 ± 0.332
0.0AspXaa: 0.0 ± 0.0
Glu
2.184GluAla: 2.184 ± 0.198
0.979GluCys: 0.979 ± 0.234
2.485GluAsp: 2.485 ± 0.416
2.109GluGlu: 2.109 ± 0.48
2.259GluPhe: 2.259 ± 0.586
3.615GluGly: 3.615 ± 0.581
0.527GluHis: 0.527 ± 0.245
1.808GluIle: 1.808 ± 0.715
1.431GluLys: 1.431 ± 0.29
3.766GluLeu: 3.766 ± 0.337
0.301GluMet: 0.301 ± 0.073
1.732GluAsn: 1.732 ± 0.29
1.506GluPro: 1.506 ± 0.183
2.485GluGln: 2.485 ± 0.41
1.657GluArg: 1.657 ± 0.21
2.862GluSer: 2.862 ± 0.259
2.184GluThr: 2.184 ± 0.554
3.615GluVal: 3.615 ± 0.457
0.377GluTrp: 0.377 ± 0.082
1.732GluTyr: 1.732 ± 0.284
0.0GluXaa: 0.0 ± 0.0
Phe
3.314PheAla: 3.314 ± 0.487
1.657PheCys: 1.657 ± 0.213
4.443PheAsp: 4.443 ± 0.515
2.787PheGlu: 2.787 ± 0.354
2.636PhePhe: 2.636 ± 0.685
4.82PheGly: 4.82 ± 0.494
0.151PheHis: 0.151 ± 0.105
2.259PheIle: 2.259 ± 0.34
5.272PheLys: 5.272 ± 0.714
3.389PheLeu: 3.389 ± 0.638
1.13PheMet: 1.13 ± 0.289
3.615PheAsn: 3.615 ± 1.126
0.678PhePro: 0.678 ± 0.585
0.452PheGln: 0.452 ± 0.172
0.979PheArg: 0.979 ± 0.303
3.314PheSer: 3.314 ± 0.81
2.41PheThr: 2.41 ± 0.339
6.853PheVal: 6.853 ± 1.026
0.904PheTrp: 0.904 ± 0.492
3.464PheTyr: 3.464 ± 0.423
0.0PheXaa: 0.0 ± 0.0
Gly
3.916GlyAla: 3.916 ± 0.595
3.464GlyCys: 3.464 ± 0.364
5.272GlyAsp: 5.272 ± 0.773
2.335GlyGlu: 2.335 ± 0.245
4.443GlyPhe: 4.443 ± 0.52
4.895GlyGly: 4.895 ± 0.546
0.904GlyHis: 0.904 ± 0.217
2.184GlyIle: 2.184 ± 0.371
4.669GlyLys: 4.669 ± 0.882
5.498GlyLeu: 5.498 ± 0.52
1.657GlyMet: 1.657 ± 0.583
4.519GlyAsn: 4.519 ± 0.59
1.732GlyPro: 1.732 ± 0.262
0.603GlyGln: 0.603 ± 0.412
1.28GlyArg: 1.28 ± 0.411
4.368GlySer: 4.368 ± 0.61
3.615GlyThr: 3.615 ± 0.605
7.381GlyVal: 7.381 ± 0.505
0.226GlyTrp: 0.226 ± 0.182
3.615GlyTyr: 3.615 ± 0.67
0.0GlyXaa: 0.0 ± 0.0
His
1.13HisAla: 1.13 ± 0.216
0.678HisCys: 0.678 ± 0.172
0.979HisAsp: 0.979 ± 0.17
0.979HisGlu: 0.979 ± 0.22
0.979HisPhe: 0.979 ± 0.51
0.678HisGly: 0.678 ± 0.175
0.377HisHis: 0.377 ± 0.082
0.828HisIle: 0.828 ± 0.319
1.732HisLys: 1.732 ± 0.608
1.808HisLeu: 1.808 ± 0.331
0.226HisMet: 0.226 ± 0.166
1.732HisAsn: 1.732 ± 0.239
0.828HisPro: 0.828 ± 0.207
0.527HisGln: 0.527 ± 0.342
0.301HisArg: 0.301 ± 0.114
1.054HisSer: 1.054 ± 0.234
1.28HisThr: 1.28 ± 0.243
1.205HisVal: 1.205 ± 0.428
0.075HisTrp: 0.075 ± 0.122
1.205HisTyr: 1.205 ± 0.481
0.0HisXaa: 0.0 ± 0.0
Ile
3.238IleAla: 3.238 ± 0.842
0.904IleCys: 0.904 ± 0.155
2.862IleAsp: 2.862 ± 0.813
1.808IleGlu: 1.808 ± 0.536
1.506IlePhe: 1.506 ± 0.546
2.636IleGly: 2.636 ± 0.307
0.377IleHis: 0.377 ± 0.158
2.259IleIle: 2.259 ± 0.71
4.067IleLys: 4.067 ± 0.967
2.937IleLeu: 2.937 ± 0.328
1.883IleMet: 1.883 ± 0.531
2.711IleAsn: 2.711 ± 0.888
1.883IlePro: 1.883 ± 0.409
1.205IleGln: 1.205 ± 0.248
1.356IleArg: 1.356 ± 0.321
2.937IleSer: 2.937 ± 0.468
2.561IleThr: 2.561 ± 0.456
6.853IleVal: 6.853 ± 0.676
0.678IleTrp: 0.678 ± 0.199
1.883IleTyr: 1.883 ± 0.62
0.0IleXaa: 0.0 ± 0.0
Lys
4.594LysAla: 4.594 ± 1.196
1.808LysCys: 1.808 ± 0.54
2.561LysAsp: 2.561 ± 0.691
2.561LysGlu: 2.561 ± 0.249
3.464LysPhe: 3.464 ± 0.973
3.163LysGly: 3.163 ± 0.745
3.013LysHis: 3.013 ± 0.493
1.958LysIle: 1.958 ± 0.573
1.356LysLys: 1.356 ± 0.283
6.1LysLeu: 6.1 ± 1.021
1.657LysMet: 1.657 ± 0.269
2.335LysAsn: 2.335 ± 0.189
4.142LysPro: 4.142 ± 1.197
2.109LysGln: 2.109 ± 0.55
1.506LysArg: 1.506 ± 0.393
4.443LysSer: 4.443 ± 0.766
3.615LysThr: 3.615 ± 0.367
4.895LysVal: 4.895 ± 0.889
0.603LysTrp: 0.603 ± 0.354
2.711LysTyr: 2.711 ± 0.252
0.0LysXaa: 0.0 ± 0.0
Leu
4.594LeuAla: 4.594 ± 1.272
3.088LeuCys: 3.088 ± 0.393
4.293LeuAsp: 4.293 ± 0.639
5.272LeuGlu: 5.272 ± 0.632
3.916LeuPhe: 3.916 ± 0.812
5.347LeuGly: 5.347 ± 0.559
2.033LeuHis: 2.033 ± 0.239
3.841LeuIle: 3.841 ± 0.594
5.874LeuLys: 5.874 ± 0.973
9.113LeuLeu: 9.113 ± 0.742
2.033LeuMet: 2.033 ± 0.455
4.443LeuAsn: 4.443 ± 0.693
4.067LeuPro: 4.067 ± 1.371
4.067LeuGln: 4.067 ± 0.475
2.41LeuArg: 2.41 ± 0.475
8.51LeuSer: 8.51 ± 0.84
5.272LeuThr: 5.272 ± 0.873
6.552LeuVal: 6.552 ± 0.671
1.13LeuTrp: 1.13 ± 0.313
3.766LeuTyr: 3.766 ± 0.518
0.0LeuXaa: 0.0 ± 0.0
Met
1.205MetAla: 1.205 ± 0.186
1.28MetCys: 1.28 ± 0.481
0.904MetAsp: 0.904 ± 0.182
0.753MetGlu: 0.753 ± 0.164
1.506MetPhe: 1.506 ± 0.505
1.054MetGly: 1.054 ± 0.155
0.678MetHis: 0.678 ± 0.243
0.678MetIle: 0.678 ± 0.356
0.904MetLys: 0.904 ± 0.259
4.443MetLeu: 4.443 ± 0.733
0.603MetMet: 0.603 ± 0.234
0.603MetAsn: 0.603 ± 0.233
0.828MetPro: 0.828 ± 0.237
0.979MetGln: 0.979 ± 0.355
1.13MetArg: 1.13 ± 0.353
1.356MetSer: 1.356 ± 0.331
1.883MetThr: 1.883 ± 0.554
2.109MetVal: 2.109 ± 0.653
0.301MetTrp: 0.301 ± 0.342
1.506MetTyr: 1.506 ± 0.367
0.0MetXaa: 0.0 ± 0.0
Asn
4.293AsnAla: 4.293 ± 0.495
2.109AsnCys: 2.109 ± 0.391
2.259AsnAsp: 2.259 ± 0.204
2.109AsnGlu: 2.109 ± 0.169
2.711AsnPhe: 2.711 ± 0.402
6.251AsnGly: 6.251 ± 0.711
0.904AsnHis: 0.904 ± 0.418
2.636AsnIle: 2.636 ± 0.385
2.561AsnLys: 2.561 ± 0.727
5.799AsnLeu: 5.799 ± 0.964
1.506AsnMet: 1.506 ± 0.197
3.992AsnAsn: 3.992 ± 0.554
1.205AsnPro: 1.205 ± 0.436
1.28AsnGln: 1.28 ± 0.722
2.033AsnArg: 2.033 ± 0.513
4.218AsnSer: 4.218 ± 1.328
3.013AsnThr: 3.013 ± 0.312
7.004AsnVal: 7.004 ± 0.577
0.678AsnTrp: 0.678 ± 0.423
2.335AsnTyr: 2.335 ± 0.26
0.0AsnXaa: 0.0 ± 0.0
Pro
1.657ProAla: 1.657 ± 0.288
0.527ProCys: 0.527 ± 0.183
2.109ProAsp: 2.109 ± 0.259
1.431ProGlu: 1.431 ± 0.515
2.109ProPhe: 2.109 ± 0.392
2.109ProGly: 2.109 ± 0.389
0.452ProHis: 0.452 ± 0.14
2.033ProIle: 2.033 ± 0.241
1.808ProLys: 1.808 ± 0.365
2.787ProLeu: 2.787 ± 0.478
0.904ProMet: 0.904 ± 0.165
1.808ProAsn: 1.808 ± 0.187
0.904ProPro: 0.904 ± 0.231
0.678ProGln: 0.678 ± 0.464
1.356ProArg: 1.356 ± 0.448
3.54ProSer: 3.54 ± 0.478
2.711ProThr: 2.711 ± 0.491
3.916ProVal: 3.916 ± 0.279
0.678ProTrp: 0.678 ± 0.145
1.054ProTyr: 1.054 ± 0.236
0.0ProXaa: 0.0 ± 0.0
Gln
2.259GlnAla: 2.259 ± 0.612
0.904GlnCys: 0.904 ± 0.155
0.828GlnAsp: 0.828 ± 0.211
0.452GlnGlu: 0.452 ± 0.174
0.753GlnPhe: 0.753 ± 0.193
1.582GlnGly: 1.582 ± 0.354
1.054GlnHis: 1.054 ± 0.369
0.678GlnIle: 0.678 ± 0.485
1.808GlnLys: 1.808 ± 0.268
3.088GlnLeu: 3.088 ± 0.199
0.678GlnMet: 0.678 ± 0.221
1.808GlnAsn: 1.808 ± 0.327
1.356GlnPro: 1.356 ± 0.277
0.678GlnGln: 0.678 ± 0.735
1.205GlnArg: 1.205 ± 0.422
2.787GlnSer: 2.787 ± 0.338
2.259GlnThr: 2.259 ± 1.106
1.356GlnVal: 1.356 ± 0.167
0.452GlnTrp: 0.452 ± 0.161
1.356GlnTyr: 1.356 ± 0.405
0.0GlnXaa: 0.0 ± 0.0
Arg
2.711ArgAla: 2.711 ± 0.623
1.28ArgCys: 1.28 ± 0.252
2.033ArgAsp: 2.033 ± 0.351
1.28ArgGlu: 1.28 ± 0.334
2.561ArgPhe: 2.561 ± 0.643
2.109ArgGly: 2.109 ± 0.452
0.226ArgHis: 0.226 ± 0.166
0.979ArgIle: 0.979 ± 0.183
1.657ArgLys: 1.657 ± 0.626
3.238ArgLeu: 3.238 ± 0.634
0.678ArgMet: 0.678 ± 0.242
1.732ArgAsn: 1.732 ± 0.342
0.753ArgPro: 0.753 ± 0.601
0.979ArgGln: 0.979 ± 0.176
1.883ArgArg: 1.883 ± 0.76
2.862ArgSer: 2.862 ± 2.071
1.13ArgThr: 1.13 ± 0.39
3.013ArgVal: 3.013 ± 0.473
0.452ArgTrp: 0.452 ± 0.203
1.13ArgTyr: 1.13 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
4.669SerAla: 4.669 ± 0.715
1.958SerCys: 1.958 ± 0.426
4.067SerAsp: 4.067 ± 0.32
1.732SerGlu: 1.732 ± 0.392
2.937SerPhe: 2.937 ± 0.738
4.745SerGly: 4.745 ± 0.793
1.506SerHis: 1.506 ± 0.287
4.443SerIle: 4.443 ± 0.962
5.272SerLys: 5.272 ± 0.737
5.799SerLeu: 5.799 ± 0.774
1.808SerMet: 1.808 ± 0.282
4.142SerAsn: 4.142 ± 0.601
1.13SerPro: 1.13 ± 0.261
2.259SerGln: 2.259 ± 0.495
2.41SerArg: 2.41 ± 1.777
5.046SerSer: 5.046 ± 1.11
4.594SerThr: 4.594 ± 0.596
8.661SerVal: 8.661 ± 0.791
1.205SerTrp: 1.205 ± 0.368
3.766SerTyr: 3.766 ± 0.159
0.0SerXaa: 0.0 ± 0.0
Thr
3.013ThrAla: 3.013 ± 0.417
1.431ThrCys: 1.431 ± 0.297
2.561ThrAsp: 2.561 ± 0.349
1.808ThrGlu: 1.808 ± 0.355
3.314ThrPhe: 3.314 ± 0.555
4.067ThrGly: 4.067 ± 0.839
1.205ThrHis: 1.205 ± 0.208
4.293ThrIle: 4.293 ± 0.707
2.636ThrLys: 2.636 ± 0.379
5.423ThrLeu: 5.423 ± 1.108
1.582ThrMet: 1.582 ± 0.373
3.238ThrAsn: 3.238 ± 0.407
3.013ThrPro: 3.013 ± 0.611
1.958ThrGln: 1.958 ± 0.455
1.808ThrArg: 1.808 ± 0.204
4.293ThrSer: 4.293 ± 0.871
4.142ThrThr: 4.142 ± 0.719
6.025ThrVal: 6.025 ± 0.84
0.377ThrTrp: 0.377 ± 0.169
2.561ThrTyr: 2.561 ± 0.375
0.0ThrXaa: 0.0 ± 0.0
Val
6.703ValAla: 6.703 ± 0.534
3.916ValCys: 3.916 ± 0.53
5.197ValAsp: 5.197 ± 1.016
4.82ValGlu: 4.82 ± 0.838
6.251ValPhe: 6.251 ± 1.08
4.971ValGly: 4.971 ± 0.694
1.808ValHis: 1.808 ± 0.413
4.82ValIle: 4.82 ± 0.716
5.95ValLys: 5.95 ± 0.929
9.339ValLeu: 9.339 ± 0.533
1.808ValMet: 1.808 ± 0.436
7.607ValAsn: 7.607 ± 0.47
3.841ValPro: 3.841 ± 0.501
2.259ValGln: 2.259 ± 0.55
3.615ValArg: 3.615 ± 0.297
6.552ValSer: 6.552 ± 0.599
5.95ValThr: 5.95 ± 0.569
12.577ValVal: 12.577 ± 1.373
0.678ValTrp: 0.678 ± 0.131
3.992ValTyr: 3.992 ± 0.485
0.0ValXaa: 0.0 ± 0.0
Trp
0.452TrpAla: 0.452 ± 0.077
0.151TrpCys: 0.151 ± 0.089
0.753TrpAsp: 0.753 ± 0.276
0.603TrpGlu: 0.603 ± 0.19
1.431TrpPhe: 1.431 ± 0.371
0.377TrpGly: 0.377 ± 0.155
0.151TrpHis: 0.151 ± 0.176
0.452TrpIle: 0.452 ± 0.213
0.226TrpLys: 0.226 ± 0.131
1.431TrpLeu: 1.431 ± 0.237
0.075TrpMet: 0.075 ± 0.045
0.452TrpAsn: 0.452 ± 0.127
0.301TrpPro: 0.301 ± 0.193
0.075TrpGln: 0.075 ± 0.168
0.527TrpArg: 0.527 ± 0.185
1.506TrpSer: 1.506 ± 0.62
0.678TrpThr: 0.678 ± 0.334
0.828TrpVal: 0.828 ± 0.153
0.377TrpTrp: 0.377 ± 0.277
1.205TrpTyr: 1.205 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.992TyrAla: 3.992 ± 0.874
1.582TyrCys: 1.582 ± 0.212
3.992TyrAsp: 3.992 ± 1.063
1.732TyrGlu: 1.732 ± 0.279
3.013TyrPhe: 3.013 ± 0.289
3.163TyrGly: 3.163 ± 0.332
0.452TyrHis: 0.452 ± 0.216
2.033TyrIle: 2.033 ± 0.472
2.711TyrLys: 2.711 ± 0.419
2.937TyrLeu: 2.937 ± 0.64
1.808TyrMet: 1.808 ± 0.276
3.314TyrAsn: 3.314 ± 0.592
1.657TyrPro: 1.657 ± 0.3
1.205TyrGln: 1.205 ± 0.312
2.41TyrArg: 2.41 ± 0.43
1.883TyrSer: 1.883 ± 0.348
2.937TyrThr: 2.937 ± 0.279
4.519TyrVal: 4.519 ± 0.762
0.753TyrTrp: 0.753 ± 0.078
3.314TyrTyr: 3.314 ± 0.368
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (13279 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski