Amino acid dipepetide frequency for Coronavirus AcCoV-JC34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.463AlaAla: 5.463 ± 1.154
1.895AlaCys: 1.895 ± 0.527
3.456AlaAsp: 3.456 ± 0.59
1.672AlaGlu: 1.672 ± 0.446
3.567AlaPhe: 3.567 ± 0.338
3.679AlaGly: 3.679 ± 0.285
1.003AlaHis: 1.003 ± 0.336
4.125AlaIle: 4.125 ± 0.873
3.679AlaLys: 3.679 ± 1.04
6.132AlaLeu: 6.132 ± 0.576
1.895AlaMet: 1.895 ± 0.428
4.459AlaAsn: 4.459 ± 0.822
2.564AlaPro: 2.564 ± 0.762
2.453AlaGln: 2.453 ± 0.573
3.01AlaArg: 3.01 ± 0.58
5.017AlaSer: 5.017 ± 1.294
4.013AlaThr: 4.013 ± 0.43
6.689AlaVal: 6.689 ± 0.974
0.223AlaTrp: 0.223 ± 0.156
2.564AlaTyr: 2.564 ± 0.265
0.0AlaXaa: 0.0 ± 0.0
Cys
1.895CysAla: 1.895 ± 0.648
2.23CysCys: 2.23 ± 0.566
2.787CysAsp: 2.787 ± 0.43
0.557CysGlu: 0.557 ± 0.433
1.561CysPhe: 1.561 ± 0.417
2.341CysGly: 2.341 ± 0.364
0.334CysHis: 0.334 ± 0.143
2.118CysIle: 2.118 ± 0.27
2.453CysLys: 2.453 ± 0.595
2.787CysLeu: 2.787 ± 0.386
0.223CysMet: 0.223 ± 0.151
2.453CysAsn: 2.453 ± 0.522
1.115CysPro: 1.115 ± 0.243
0.557CysGln: 0.557 ± 0.206
0.78CysArg: 0.78 ± 0.288
3.122CysSer: 3.122 ± 0.462
3.122CysThr: 3.122 ± 0.544
4.013CysVal: 4.013 ± 0.539
0.892CysTrp: 0.892 ± 0.207
2.453CysTyr: 2.453 ± 0.514
0.0CysXaa: 0.0 ± 0.0
Asp
4.459AspAla: 4.459 ± 0.551
2.341AspCys: 2.341 ± 0.295
3.122AspAsp: 3.122 ± 0.423
1.561AspGlu: 1.561 ± 0.308
3.233AspPhe: 3.233 ± 0.878
6.243AspGly: 6.243 ± 0.473
0.892AspHis: 0.892 ± 0.142
2.564AspIle: 2.564 ± 0.934
1.561AspLys: 1.561 ± 0.332
4.682AspLeu: 4.682 ± 1.012
0.78AspMet: 0.78 ± 0.143
2.787AspAsn: 2.787 ± 0.54
1.115AspPro: 1.115 ± 0.293
1.672AspGln: 1.672 ± 0.444
1.784AspArg: 1.784 ± 0.475
2.676AspSer: 2.676 ± 0.404
3.01AspThr: 3.01 ± 0.536
5.574AspVal: 5.574 ± 0.908
0.446AspTrp: 0.446 ± 0.3
4.125AspTyr: 4.125 ± 0.352
0.0AspXaa: 0.0 ± 0.0
Glu
2.23GluAla: 2.23 ± 0.474
1.226GluCys: 1.226 ± 0.478
2.007GluAsp: 2.007 ± 0.534
1.449GluGlu: 1.449 ± 0.205
2.341GluPhe: 2.341 ± 0.303
2.899GluGly: 2.899 ± 0.837
1.561GluHis: 1.561 ± 0.377
2.23GluIle: 2.23 ± 0.489
1.449GluLys: 1.449 ± 0.412
3.567GluLeu: 3.567 ± 0.469
0.334GluMet: 0.334 ± 0.273
1.561GluAsn: 1.561 ± 0.289
1.895GluPro: 1.895 ± 0.626
1.784GluGln: 1.784 ± 0.213
2.007GluArg: 2.007 ± 0.296
1.449GluSer: 1.449 ± 0.325
1.226GluThr: 1.226 ± 0.399
3.679GluVal: 3.679 ± 0.753
0.78GluTrp: 0.78 ± 0.288
1.449GluTyr: 1.449 ± 0.474
0.0GluXaa: 0.0 ± 0.0
Phe
2.899PheAla: 2.899 ± 0.43
2.118PheCys: 2.118 ± 0.251
5.128PheAsp: 5.128 ± 0.36
2.899PheGlu: 2.899 ± 0.45
3.233PhePhe: 3.233 ± 0.69
3.679PheGly: 3.679 ± 0.396
0.669PheHis: 0.669 ± 0.231
2.564PheIle: 2.564 ± 0.62
4.348PheLys: 4.348 ± 0.979
3.79PheLeu: 3.79 ± 0.536
1.338PheMet: 1.338 ± 0.388
5.017PheAsn: 5.017 ± 0.719
1.115PhePro: 1.115 ± 0.35
1.226PheGln: 1.226 ± 0.675
1.338PheArg: 1.338 ± 0.228
6.02PheSer: 6.02 ± 1.086
3.122PheThr: 3.122 ± 0.414
6.355PheVal: 6.355 ± 0.679
1.338PheTrp: 1.338 ± 0.35
3.344PheTyr: 3.344 ± 0.613
0.0PheXaa: 0.0 ± 0.0
Gly
4.794GlyAla: 4.794 ± 0.999
2.899GlyCys: 2.899 ± 0.646
3.79GlyAsp: 3.79 ± 0.689
1.561GlyGlu: 1.561 ± 0.408
4.905GlyPhe: 4.905 ± 0.449
4.125GlyGly: 4.125 ± 0.426
1.115GlyHis: 1.115 ± 0.51
2.564GlyIle: 2.564 ± 0.703
3.01GlyLys: 3.01 ± 0.736
5.909GlyLeu: 5.909 ± 0.698
1.003GlyMet: 1.003 ± 0.374
2.676GlyAsn: 2.676 ± 0.564
1.784GlyPro: 1.784 ± 0.501
1.449GlyGln: 1.449 ± 0.394
2.23GlyArg: 2.23 ± 0.418
5.128GlySer: 5.128 ± 0.752
4.236GlyThr: 4.236 ± 0.919
7.023GlyVal: 7.023 ± 0.835
0.557GlyTrp: 0.557 ± 0.245
3.902GlyTyr: 3.902 ± 0.413
0.0GlyXaa: 0.0 ± 0.0
His
1.561HisAla: 1.561 ± 0.474
0.669HisCys: 0.669 ± 0.095
0.78HisAsp: 0.78 ± 0.104
0.334HisGlu: 0.334 ± 0.274
1.449HisPhe: 1.449 ± 0.334
1.784HisGly: 1.784 ± 0.544
0.446HisHis: 0.446 ± 0.23
0.78HisIle: 0.78 ± 0.234
1.003HisLys: 1.003 ± 0.336
2.23HisLeu: 2.23 ± 0.416
0.223HisMet: 0.223 ± 0.154
0.892HisAsn: 0.892 ± 0.207
0.669HisPro: 0.669 ± 0.138
0.334HisGln: 0.334 ± 0.112
1.003HisArg: 1.003 ± 0.455
0.78HisSer: 0.78 ± 0.234
1.226HisThr: 1.226 ± 0.496
2.676HisVal: 2.676 ± 0.373
0.334HisTrp: 0.334 ± 0.15
0.78HisTyr: 0.78 ± 0.116
0.0HisXaa: 0.0 ± 0.0
Ile
2.564IleAla: 2.564 ± 0.702
1.449IleCys: 1.449 ± 0.271
3.01IleAsp: 3.01 ± 0.593
2.23IleGlu: 2.23 ± 0.763
2.118IlePhe: 2.118 ± 0.66
2.118IleGly: 2.118 ± 0.647
0.669IleHis: 0.669 ± 0.095
1.895IleIle: 1.895 ± 0.474
2.787IleLys: 2.787 ± 0.692
5.574IleLeu: 5.574 ± 0.789
1.226IleMet: 1.226 ± 0.455
3.122IleAsn: 3.122 ± 0.516
1.561IlePro: 1.561 ± 0.279
1.115IleGln: 1.115 ± 0.51
1.672IleArg: 1.672 ± 0.385
2.564IleSer: 2.564 ± 0.289
4.348IleThr: 4.348 ± 0.848
5.128IleVal: 5.128 ± 0.644
0.446IleTrp: 0.446 ± 0.252
1.449IleTyr: 1.449 ± 0.342
0.0IleXaa: 0.0 ± 0.0
Lys
3.567LysAla: 3.567 ± 0.788
2.118LysCys: 2.118 ± 0.698
2.453LysAsp: 2.453 ± 0.639
2.787LysGlu: 2.787 ± 0.562
3.902LysPhe: 3.902 ± 0.504
2.676LysGly: 2.676 ± 0.636
1.338LysHis: 1.338 ± 0.414
2.564LysIle: 2.564 ± 0.466
2.341LysLys: 2.341 ± 0.749
6.02LysLeu: 6.02 ± 1.334
0.892LysMet: 0.892 ± 0.19
2.007LysAsn: 2.007 ± 0.351
2.899LysPro: 2.899 ± 0.633
1.338LysGln: 1.338 ± 0.189
1.895LysArg: 1.895 ± 0.597
2.899LysSer: 2.899 ± 0.259
3.01LysThr: 3.01 ± 0.541
3.01LysVal: 3.01 ± 0.49
0.223LysTrp: 0.223 ± 0.124
3.122LysTyr: 3.122 ± 0.349
0.0LysXaa: 0.0 ± 0.0
Leu
5.574LeuAla: 5.574 ± 0.766
3.456LeuCys: 3.456 ± 0.587
2.787LeuAsp: 2.787 ± 0.531
3.679LeuGlu: 3.679 ± 0.576
5.463LeuPhe: 5.463 ± 1.019
5.128LeuGly: 5.128 ± 0.328
2.341LeuHis: 2.341 ± 0.371
3.567LeuIle: 3.567 ± 0.927
4.348LeuLys: 4.348 ± 0.371
7.915LeuLeu: 7.915 ± 1.587
2.007LeuMet: 2.007 ± 0.318
4.013LeuAsn: 4.013 ± 0.555
3.567LeuPro: 3.567 ± 0.652
3.679LeuGln: 3.679 ± 0.386
3.902LeuArg: 3.902 ± 0.402
7.804LeuSer: 7.804 ± 0.819
5.017LeuThr: 5.017 ± 0.667
7.023LeuVal: 7.023 ± 0.721
1.672LeuTrp: 1.672 ± 0.305
5.463LeuTyr: 5.463 ± 0.813
0.0LeuXaa: 0.0 ± 0.0
Met
1.338MetAla: 1.338 ± 0.269
0.446MetCys: 0.446 ± 0.23
1.003MetAsp: 1.003 ± 0.186
0.557MetGlu: 0.557 ± 0.129
1.338MetPhe: 1.338 ± 0.308
1.003MetGly: 1.003 ± 0.491
0.446MetHis: 0.446 ± 0.138
1.003MetIle: 1.003 ± 0.482
0.334MetLys: 0.334 ± 0.226
2.676MetLeu: 2.676 ± 0.415
0.334MetMet: 0.334 ± 0.274
1.115MetAsn: 1.115 ± 0.284
1.003MetPro: 1.003 ± 0.321
0.78MetGln: 0.78 ± 0.441
0.557MetArg: 0.557 ± 0.177
1.226MetSer: 1.226 ± 0.33
1.003MetThr: 1.003 ± 0.38
2.118MetVal: 2.118 ± 0.443
0.223MetTrp: 0.223 ± 0.154
1.226MetTyr: 1.226 ± 0.326
0.0MetXaa: 0.0 ± 0.0
Asn
4.348AsnAla: 4.348 ± 0.66
2.899AsnCys: 2.899 ± 0.559
2.453AsnAsp: 2.453 ± 0.361
2.118AsnGlu: 2.118 ± 0.359
4.682AsnPhe: 4.682 ± 0.472
5.686AsnGly: 5.686 ± 1.07
1.226AsnHis: 1.226 ± 0.47
2.787AsnIle: 2.787 ± 0.751
3.344AsnLys: 3.344 ± 0.602
4.571AsnLeu: 4.571 ± 0.655
1.003AsnMet: 1.003 ± 0.278
4.013AsnAsn: 4.013 ± 1.039
2.118AsnPro: 2.118 ± 0.446
1.003AsnGln: 1.003 ± 0.413
1.895AsnArg: 1.895 ± 0.638
4.682AsnSer: 4.682 ± 0.573
2.23AsnThr: 2.23 ± 0.231
7.135AsnVal: 7.135 ± 1.025
1.003AsnTrp: 1.003 ± 0.224
2.007AsnTyr: 2.007 ± 0.394
0.0AsnXaa: 0.0 ± 0.0
Pro
2.453ProAla: 2.453 ± 0.616
1.338ProCys: 1.338 ± 0.189
1.784ProAsp: 1.784 ± 0.761
2.118ProGlu: 2.118 ± 0.661
1.003ProPhe: 1.003 ± 0.246
2.676ProGly: 2.676 ± 0.214
1.003ProHis: 1.003 ± 0.271
1.561ProIle: 1.561 ± 0.261
1.561ProLys: 1.561 ± 0.147
3.679ProLeu: 3.679 ± 0.746
0.223ProMet: 0.223 ± 0.151
2.341ProAsn: 2.341 ± 0.772
1.672ProPro: 1.672 ± 0.576
1.226ProGln: 1.226 ± 0.508
1.115ProArg: 1.115 ± 0.169
2.453ProSer: 2.453 ± 0.568
2.564ProThr: 2.564 ± 0.749
2.564ProVal: 2.564 ± 0.44
0.557ProTrp: 0.557 ± 0.255
0.78ProTyr: 0.78 ± 0.578
0.0ProXaa: 0.0 ± 0.0
Gln
1.784GlnAla: 1.784 ± 0.285
1.115GlnCys: 1.115 ± 0.35
1.561GlnAsp: 1.561 ± 0.354
1.226GlnGlu: 1.226 ± 0.273
1.226GlnPhe: 1.226 ± 0.189
1.672GlnGly: 1.672 ± 0.286
0.334GlnHis: 0.334 ± 0.291
1.672GlnIle: 1.672 ± 0.394
0.669GlnLys: 0.669 ± 0.41
4.236GlnLeu: 4.236 ± 0.642
1.115GlnMet: 1.115 ± 0.266
1.561GlnAsn: 1.561 ± 0.272
1.672GlnPro: 1.672 ± 0.452
1.338GlnGln: 1.338 ± 0.463
1.338GlnArg: 1.338 ± 0.294
2.118GlnSer: 2.118 ± 0.506
1.449GlnThr: 1.449 ± 0.464
1.784GlnVal: 1.784 ± 0.511
0.223GlnTrp: 0.223 ± 0.183
1.561GlnTyr: 1.561 ± 0.47
0.0GlnXaa: 0.0 ± 0.0
Arg
3.233ArgAla: 3.233 ± 0.667
1.003ArgCys: 1.003 ± 0.36
1.115ArgAsp: 1.115 ± 0.35
0.557ArgGlu: 0.557 ± 0.17
3.233ArgPhe: 3.233 ± 0.484
2.453ArgGly: 2.453 ± 0.884
1.115ArgHis: 1.115 ± 0.301
2.23ArgIle: 2.23 ± 0.349
2.23ArgLys: 2.23 ± 0.482
2.564ArgLeu: 2.564 ± 0.503
0.446ArgMet: 0.446 ± 0.086
2.787ArgAsn: 2.787 ± 1.001
1.449ArgPro: 1.449 ± 0.485
0.78ArgGln: 0.78 ± 0.104
1.115ArgArg: 1.115 ± 0.212
2.007ArgSer: 2.007 ± 0.427
1.449ArgThr: 1.449 ± 0.605
2.453ArgVal: 2.453 ± 0.583
0.446ArgTrp: 0.446 ± 0.178
2.118ArgTyr: 2.118 ± 0.25
0.0ArgXaa: 0.0 ± 0.0
Ser
4.571SerAla: 4.571 ± 0.53
2.23SerCys: 2.23 ± 0.875
3.902SerAsp: 3.902 ± 0.409
2.899SerGlu: 2.899 ± 0.618
4.348SerPhe: 4.348 ± 1.013
4.125SerGly: 4.125 ± 0.854
1.449SerHis: 1.449 ± 0.169
2.899SerIle: 2.899 ± 0.599
4.682SerLys: 4.682 ± 1.026
4.459SerLeu: 4.459 ± 0.276
2.118SerMet: 2.118 ± 0.377
5.351SerAsn: 5.351 ± 0.358
1.115SerPro: 1.115 ± 0.171
2.007SerGln: 2.007 ± 0.602
2.118SerArg: 2.118 ± 0.734
4.794SerSer: 4.794 ± 1.219
4.013SerThr: 4.013 ± 0.468
8.473SerVal: 8.473 ± 0.966
1.338SerTrp: 1.338 ± 0.419
2.787SerTyr: 2.787 ± 0.3
0.0SerXaa: 0.0 ± 0.0
Thr
4.348ThrAla: 4.348 ± 0.857
1.226ThrCys: 1.226 ± 0.32
2.899ThrAsp: 2.899 ± 0.253
2.23ThrGlu: 2.23 ± 0.573
3.567ThrPhe: 3.567 ± 0.652
4.348ThrGly: 4.348 ± 0.82
0.446ThrHis: 0.446 ± 0.206
3.122ThrIle: 3.122 ± 0.39
3.233ThrLys: 3.233 ± 0.711
4.794ThrLeu: 4.794 ± 0.563
1.449ThrMet: 1.449 ± 0.945
4.571ThrAsn: 4.571 ± 0.674
2.564ThrPro: 2.564 ± 0.408
2.564ThrGln: 2.564 ± 0.65
2.118ThrArg: 2.118 ± 0.302
3.679ThrSer: 3.679 ± 0.964
4.459ThrThr: 4.459 ± 0.562
5.017ThrVal: 5.017 ± 0.433
0.446ThrTrp: 0.446 ± 0.322
2.564ThrTyr: 2.564 ± 0.684
0.0ThrXaa: 0.0 ± 0.0
Val
6.132ValAla: 6.132 ± 0.866
4.459ValCys: 4.459 ± 0.558
5.351ValAsp: 5.351 ± 0.772
4.013ValGlu: 4.013 ± 0.308
6.355ValPhe: 6.355 ± 0.616
4.682ValGly: 4.682 ± 1.071
1.561ValHis: 1.561 ± 0.35
3.79ValIle: 3.79 ± 0.581
5.686ValLys: 5.686 ± 0.618
8.361ValLeu: 8.361 ± 0.862
2.007ValMet: 2.007 ± 0.534
6.243ValAsn: 6.243 ± 1.014
3.01ValPro: 3.01 ± 0.93
2.676ValGln: 2.676 ± 0.828
3.122ValArg: 3.122 ± 0.507
7.692ValSer: 7.692 ± 0.824
6.355ValThr: 6.355 ± 0.767
9.253ValVal: 9.253 ± 1.842
1.338ValTrp: 1.338 ± 0.414
4.682ValTyr: 4.682 ± 0.62
0.0ValXaa: 0.0 ± 0.0
Trp
0.669TrpAla: 0.669 ± 0.236
0.669TrpCys: 0.669 ± 0.193
1.449TrpAsp: 1.449 ± 0.378
0.446TrpGlu: 0.446 ± 0.26
1.003TrpPhe: 1.003 ± 0.421
0.223TrpGly: 0.223 ± 0.183
0.669TrpHis: 0.669 ± 0.583
0.223TrpIle: 0.223 ± 0.229
0.446TrpLys: 0.446 ± 0.149
1.226TrpLeu: 1.226 ± 0.27
0.334TrpMet: 0.334 ± 0.219
1.003TrpAsn: 1.003 ± 0.26
0.334TrpPro: 0.334 ± 0.21
0.223TrpGln: 0.223 ± 0.069
0.334TrpArg: 0.334 ± 0.143
1.449TrpSer: 1.449 ± 0.391
0.446TrpThr: 0.446 ± 0.178
1.003TrpVal: 1.003 ± 0.114
0.446TrpTrp: 0.446 ± 0.322
0.892TrpTyr: 0.892 ± 0.32
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.679TyrAla: 3.679 ± 0.9
2.007TyrCys: 2.007 ± 0.691
3.79TyrAsp: 3.79 ± 0.907
2.23TyrGlu: 2.23 ± 0.638
3.01TyrPhe: 3.01 ± 0.619
3.122TyrGly: 3.122 ± 0.232
1.338TyrHis: 1.338 ± 0.258
2.787TyrIle: 2.787 ± 0.505
1.895TyrLys: 1.895 ± 0.263
3.233TyrLeu: 3.233 ± 0.617
0.669TyrMet: 0.669 ± 0.095
3.233TyrAsn: 3.233 ± 0.58
1.449TyrPro: 1.449 ± 0.695
1.449TyrGln: 1.449 ± 0.502
1.449TyrArg: 1.449 ± 0.463
2.23TyrSer: 2.23 ± 0.609
3.344TyrThr: 3.344 ± 0.443
5.797TyrVal: 5.797 ± 0.569
0.557TyrTrp: 0.557 ± 0.167
3.344TyrTyr: 3.344 ± 0.405
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (8971 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski