Amino acid dipepetide frequency for Canada goose coronavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.265AlaAla: 5.265 ± 0.957
1.984AlaCys: 1.984 ± 0.613
3.281AlaAsp: 3.281 ± 0.744
2.06AlaGlu: 2.06 ± 0.175
3.128AlaPhe: 3.128 ± 0.947
5.188AlaGly: 5.188 ± 1.0
1.679AlaHis: 1.679 ± 0.415
4.883AlaIle: 4.883 ± 0.587
3.357AlaLys: 3.357 ± 0.666
6.791AlaLeu: 6.791 ± 1.145
1.602AlaMet: 1.602 ± 0.194
4.654AlaAsn: 4.654 ± 0.89
1.679AlaPro: 1.679 ± 0.492
1.908AlaGln: 1.908 ± 0.92
2.823AlaArg: 2.823 ± 0.788
4.273AlaSer: 4.273 ± 0.523
3.739AlaThr: 3.739 ± 0.482
5.799AlaVal: 5.799 ± 0.58
1.602AlaTrp: 1.602 ± 0.41
3.357AlaTyr: 3.357 ± 0.512
0.0AlaXaa: 0.0 ± 0.0
Cys
1.45CysAla: 1.45 ± 0.45
1.373CysCys: 1.373 ± 0.474
1.526CysAsp: 1.526 ± 0.387
1.755CysGlu: 1.755 ± 0.525
2.213CysPhe: 2.213 ± 0.496
2.747CysGly: 2.747 ± 0.696
0.916CysHis: 0.916 ± 0.248
0.992CysIle: 0.992 ± 0.336
2.518CysLys: 2.518 ± 0.544
2.289CysLeu: 2.289 ± 0.557
0.305CysMet: 0.305 ± 0.099
1.984CysAsn: 1.984 ± 0.305
0.992CysPro: 0.992 ± 0.325
0.916CysGln: 0.916 ± 0.203
1.602CysArg: 1.602 ± 0.305
1.908CysSer: 1.908 ± 0.333
1.755CysThr: 1.755 ± 1.061
4.044CysVal: 4.044 ± 0.624
0.763CysTrp: 0.763 ± 0.187
2.442CysTyr: 2.442 ± 0.662
0.0CysXaa: 0.0 ± 0.0
Asp
3.891AspAla: 3.891 ± 0.76
1.755AspCys: 1.755 ± 0.547
4.197AspAsp: 4.197 ± 0.447
2.213AspGlu: 2.213 ± 0.481
3.586AspPhe: 3.586 ± 0.424
3.662AspGly: 3.662 ± 0.355
0.229AspHis: 0.229 ± 0.173
3.434AspIle: 3.434 ± 0.811
2.594AspLys: 2.594 ± 0.356
4.807AspLeu: 4.807 ± 0.396
0.839AspMet: 0.839 ± 0.181
3.357AspAsn: 3.357 ± 0.327
1.984AspPro: 1.984 ± 0.566
1.526AspGln: 1.526 ± 0.524
1.526AspArg: 1.526 ± 0.586
4.197AspSer: 4.197 ± 0.538
2.594AspThr: 2.594 ± 0.45
5.57AspVal: 5.57 ± 0.777
0.61AspTrp: 0.61 ± 0.155
2.899AspTyr: 2.899 ± 0.608
0.0AspXaa: 0.0 ± 0.0
Glu
2.518GluAla: 2.518 ± 0.502
0.916GluCys: 0.916 ± 0.264
1.908GluAsp: 1.908 ± 0.461
2.976GluGlu: 2.976 ± 0.552
2.289GluPhe: 2.289 ± 0.577
1.602GluGly: 1.602 ± 0.462
0.382GluHis: 0.382 ± 0.541
1.221GluIle: 1.221 ± 0.225
1.45GluLys: 1.45 ± 0.667
3.51GluLeu: 3.51 ± 0.593
0.687GluMet: 0.687 ± 0.192
2.213GluAsn: 2.213 ± 0.44
2.136GluPro: 2.136 ± 0.299
2.289GluGln: 2.289 ± 0.387
1.45GluArg: 1.45 ± 0.34
2.518GluSer: 2.518 ± 0.451
1.602GluThr: 1.602 ± 0.26
3.51GluVal: 3.51 ± 0.526
0.763GluTrp: 0.763 ± 0.328
1.221GluTyr: 1.221 ± 0.177
0.0GluXaa: 0.0 ± 0.0
Phe
2.823PheAla: 2.823 ± 0.459
2.442PheCys: 2.442 ± 0.474
4.807PheAsp: 4.807 ± 0.716
2.518PheGlu: 2.518 ± 0.354
2.823PhePhe: 2.823 ± 0.408
4.12PheGly: 4.12 ± 0.617
0.382PheHis: 0.382 ± 0.213
3.205PheIle: 3.205 ± 0.77
4.273PheLys: 4.273 ± 0.666
4.578PheLeu: 4.578 ± 0.941
1.221PheMet: 1.221 ± 0.268
4.044PheAsn: 4.044 ± 0.767
0.992PhePro: 0.992 ± 0.343
0.992PheGln: 0.992 ± 0.207
1.373PheArg: 1.373 ± 0.316
4.273PheSer: 4.273 ± 0.499
2.518PheThr: 2.518 ± 0.611
7.859PheVal: 7.859 ± 1.225
0.992PheTrp: 0.992 ± 0.514
3.128PheTyr: 3.128 ± 0.556
0.0PheXaa: 0.0 ± 0.0
Gly
3.739GlyAla: 3.739 ± 0.532
2.213GlyCys: 2.213 ± 0.752
4.273GlyAsp: 4.273 ± 0.54
1.908GlyGlu: 1.908 ± 0.258
3.815GlyPhe: 3.815 ± 0.517
4.12GlyGly: 4.12 ± 0.73
1.45GlyHis: 1.45 ± 0.343
3.891GlyIle: 3.891 ± 0.423
2.976GlyLys: 2.976 ± 0.518
3.586GlyLeu: 3.586 ± 0.525
0.992GlyMet: 0.992 ± 0.266
2.518GlyAsn: 2.518 ± 0.223
2.136GlyPro: 2.136 ± 0.489
0.229GlyGln: 0.229 ± 0.308
2.06GlyArg: 2.06 ± 0.902
5.799GlySer: 5.799 ± 0.612
3.281GlyThr: 3.281 ± 0.318
8.393GlyVal: 8.393 ± 1.225
0.992GlyTrp: 0.992 ± 0.443
2.213GlyTyr: 2.213 ± 0.312
0.0GlyXaa: 0.0 ± 0.0
His
1.755HisAla: 1.755 ± 0.488
0.458HisCys: 0.458 ± 0.224
1.297HisAsp: 1.297 ± 0.216
0.687HisGlu: 0.687 ± 0.232
0.992HisPhe: 0.992 ± 0.335
1.526HisGly: 1.526 ± 0.253
0.305HisHis: 0.305 ± 0.116
0.763HisIle: 0.763 ± 0.266
0.61HisLys: 0.61 ± 0.159
2.136HisLeu: 2.136 ± 0.409
0.229HisMet: 0.229 ± 0.107
1.526HisAsn: 1.526 ± 0.297
0.458HisPro: 0.458 ± 0.24
0.534HisGln: 0.534 ± 0.217
0.229HisArg: 0.229 ± 0.222
0.992HisSer: 0.992 ± 0.366
0.763HisThr: 0.763 ± 0.27
1.831HisVal: 1.831 ± 0.361
0.0HisTrp: 0.0 ± 0.0
0.305HisTyr: 0.305 ± 0.239
0.0HisXaa: 0.0 ± 0.0
Ile
2.823IleAla: 2.823 ± 0.548
0.916IleCys: 0.916 ± 0.473
2.518IleAsp: 2.518 ± 0.491
1.679IleGlu: 1.679 ± 0.412
2.365IlePhe: 2.365 ± 0.49
2.976IleGly: 2.976 ± 0.672
0.382IleHis: 0.382 ± 0.31
2.136IleIle: 2.136 ± 1.176
2.136IleLys: 2.136 ± 0.386
5.341IleLeu: 5.341 ± 0.467
0.839IleMet: 0.839 ± 0.344
2.823IleAsn: 2.823 ± 0.276
2.213IlePro: 2.213 ± 0.515
1.755IleGln: 1.755 ± 1.195
1.755IleArg: 1.755 ± 0.428
3.128IleSer: 3.128 ± 0.491
2.594IleThr: 2.594 ± 0.646
5.951IleVal: 5.951 ± 1.183
0.839IleTrp: 0.839 ± 0.174
2.213IleTyr: 2.213 ± 0.409
0.0IleXaa: 0.0 ± 0.0
Lys
4.883LysAla: 4.883 ± 0.745
2.289LysCys: 2.289 ± 0.385
2.06LysAsp: 2.06 ± 0.343
1.908LysGlu: 1.908 ± 0.332
3.968LysPhe: 3.968 ± 0.714
2.671LysGly: 2.671 ± 0.443
1.679LysHis: 1.679 ± 0.43
1.984LysIle: 1.984 ± 0.378
2.899LysLys: 2.899 ± 0.444
6.409LysLeu: 6.409 ± 1.092
0.687LysMet: 0.687 ± 0.289
2.213LysAsn: 2.213 ± 0.319
3.052LysPro: 3.052 ± 1.028
2.671LysGln: 2.671 ± 0.42
1.831LysArg: 1.831 ± 0.502
2.289LysSer: 2.289 ± 0.858
2.747LysThr: 2.747 ± 0.66
4.044LysVal: 4.044 ± 0.646
0.458LysTrp: 0.458 ± 0.218
2.518LysTyr: 2.518 ± 0.583
0.0LysXaa: 0.0 ± 0.0
Leu
7.096LeuAla: 7.096 ± 0.894
3.205LeuCys: 3.205 ± 0.618
2.976LeuAsp: 2.976 ± 0.54
3.128LeuGlu: 3.128 ± 0.559
5.723LeuPhe: 5.723 ± 0.991
4.12LeuGly: 4.12 ± 0.503
1.831LeuHis: 1.831 ± 0.37
3.586LeuIle: 3.586 ± 0.8
6.333LeuLys: 6.333 ± 1.12
8.164LeuLeu: 8.164 ± 1.194
1.068LeuMet: 1.068 ± 0.379
6.18LeuAsn: 6.18 ± 0.925
2.899LeuPro: 2.899 ± 0.531
4.578LeuGln: 4.578 ± 0.385
3.205LeuArg: 3.205 ± 0.796
6.638LeuSer: 6.638 ± 0.695
5.417LeuThr: 5.417 ± 0.983
8.012LeuVal: 8.012 ± 0.943
0.839LeuTrp: 0.839 ± 0.479
6.638LeuTyr: 6.638 ± 0.822
0.0LeuXaa: 0.0 ± 0.0
Met
2.365MetAla: 2.365 ± 0.382
0.382MetCys: 0.382 ± 0.372
0.916MetAsp: 0.916 ± 0.222
0.534MetGlu: 0.534 ± 0.153
1.145MetPhe: 1.145 ± 0.337
0.382MetGly: 0.382 ± 0.15
0.534MetHis: 0.534 ± 0.188
0.534MetIle: 0.534 ± 0.341
0.305MetLys: 0.305 ± 0.099
2.06MetLeu: 2.06 ± 0.322
0.534MetMet: 0.534 ± 0.188
0.992MetAsn: 0.992 ± 0.381
1.145MetPro: 1.145 ± 0.341
1.145MetGln: 1.145 ± 0.414
0.458MetArg: 0.458 ± 0.217
1.145MetSer: 1.145 ± 0.521
1.297MetThr: 1.297 ± 0.221
1.984MetVal: 1.984 ± 0.332
0.458MetTrp: 0.458 ± 0.175
0.839MetTyr: 0.839 ± 0.683
0.0MetXaa: 0.0 ± 0.0
Asn
3.891AsnAla: 3.891 ± 0.813
1.984AsnCys: 1.984 ± 0.463
2.518AsnAsp: 2.518 ± 0.525
1.755AsnGlu: 1.755 ± 0.335
3.739AsnPhe: 3.739 ± 0.64
5.494AsnGly: 5.494 ± 0.661
0.61AsnHis: 0.61 ± 0.132
3.128AsnIle: 3.128 ± 0.483
3.815AsnLys: 3.815 ± 0.398
4.807AsnLeu: 4.807 ± 0.685
1.373AsnMet: 1.373 ± 0.278
2.671AsnAsn: 2.671 ± 0.769
1.984AsnPro: 1.984 ± 1.01
1.45AsnGln: 1.45 ± 0.554
1.526AsnArg: 1.526 ± 0.595
2.518AsnSer: 2.518 ± 0.717
2.823AsnThr: 2.823 ± 0.459
6.409AsnVal: 6.409 ± 1.364
0.458AsnTrp: 0.458 ± 0.181
2.671AsnTyr: 2.671 ± 0.471
0.0AsnXaa: 0.0 ± 0.0
Pro
1.373ProAla: 1.373 ± 0.354
1.068ProCys: 1.068 ± 0.267
2.594ProAsp: 2.594 ± 0.615
1.373ProGlu: 1.373 ± 1.04
2.365ProPhe: 2.365 ± 0.498
1.908ProGly: 1.908 ± 0.55
0.763ProHis: 0.763 ± 0.331
1.45ProIle: 1.45 ± 0.181
2.365ProLys: 2.365 ± 0.569
3.128ProLeu: 3.128 ± 0.8
0.687ProMet: 0.687 ± 0.217
1.755ProAsn: 1.755 ± 0.417
1.755ProPro: 1.755 ± 0.286
1.831ProGln: 1.831 ± 0.387
0.916ProArg: 0.916 ± 0.194
3.052ProSer: 3.052 ± 0.562
2.976ProThr: 2.976 ± 0.546
3.586ProVal: 3.586 ± 0.331
0.305ProTrp: 0.305 ± 0.099
1.755ProTyr: 1.755 ± 0.491
0.0ProXaa: 0.0 ± 0.0
Gln
3.205GlnAla: 3.205 ± 0.627
1.068GlnCys: 1.068 ± 0.404
2.442GlnAsp: 2.442 ± 0.769
1.145GlnGlu: 1.145 ± 0.298
1.602GlnPhe: 1.602 ± 0.387
1.908GlnGly: 1.908 ± 0.553
1.068GlnHis: 1.068 ± 0.215
0.916GlnIle: 0.916 ± 0.623
0.992GlnLys: 0.992 ± 0.267
3.815GlnLeu: 3.815 ± 0.597
0.61GlnMet: 0.61 ± 0.397
1.45GlnAsn: 1.45 ± 0.616
1.373GlnPro: 1.373 ± 0.319
2.365GlnGln: 2.365 ± 1.131
1.068GlnArg: 1.068 ± 0.331
2.518GlnSer: 2.518 ± 0.915
2.213GlnThr: 2.213 ± 0.62
2.518GlnVal: 2.518 ± 0.453
0.534GlnTrp: 0.534 ± 0.225
1.602GlnTyr: 1.602 ± 0.751
0.0GlnXaa: 0.0 ± 0.0
Arg
2.899ArgAla: 2.899 ± 0.618
1.145ArgCys: 1.145 ± 0.273
1.908ArgAsp: 1.908 ± 0.31
1.602ArgGlu: 1.602 ± 0.637
1.908ArgPhe: 1.908 ± 0.466
2.289ArgGly: 2.289 ± 0.469
0.534ArgHis: 0.534 ± 0.289
1.373ArgIle: 1.373 ± 0.254
1.831ArgLys: 1.831 ± 0.406
2.289ArgLeu: 2.289 ± 0.756
0.687ArgMet: 0.687 ± 0.226
2.594ArgAsn: 2.594 ± 0.562
0.992ArgPro: 0.992 ± 0.288
1.221ArgGln: 1.221 ± 0.313
2.518ArgArg: 2.518 ± 0.676
1.908ArgSer: 1.908 ± 0.414
1.831ArgThr: 1.831 ± 0.686
3.205ArgVal: 3.205 ± 0.734
0.229ArgTrp: 0.229 ± 0.14
1.679ArgTyr: 1.679 ± 0.355
0.0ArgXaa: 0.0 ± 0.0
Ser
4.197SerAla: 4.197 ± 0.47
2.671SerCys: 2.671 ± 0.332
2.823SerAsp: 2.823 ± 0.49
2.365SerGlu: 2.365 ± 0.7
6.18SerPhe: 6.18 ± 0.681
4.731SerGly: 4.731 ± 0.875
1.221SerHis: 1.221 ± 0.318
3.662SerIle: 3.662 ± 0.974
3.281SerLys: 3.281 ± 0.506
6.257SerLeu: 6.257 ± 0.694
1.679SerMet: 1.679 ± 0.299
3.205SerAsn: 3.205 ± 0.503
1.831SerPro: 1.831 ± 0.226
1.602SerGln: 1.602 ± 0.342
2.747SerArg: 2.747 ± 1.523
5.112SerSer: 5.112 ± 1.058
3.128SerThr: 3.128 ± 0.619
7.783SerVal: 7.783 ± 1.056
0.687SerTrp: 0.687 ± 0.292
2.823SerTyr: 2.823 ± 0.579
0.0SerXaa: 0.0 ± 0.0
Thr
3.434ThrAla: 3.434 ± 0.882
1.373ThrCys: 1.373 ± 0.27
2.518ThrAsp: 2.518 ± 0.663
1.45ThrGlu: 1.45 ± 0.359
3.586ThrPhe: 3.586 ± 0.976
3.586ThrGly: 3.586 ± 0.653
1.221ThrHis: 1.221 ± 0.261
2.518ThrIle: 2.518 ± 0.905
2.289ThrLys: 2.289 ± 1.12
5.799ThrLeu: 5.799 ± 0.607
1.526ThrMet: 1.526 ± 0.394
2.289ThrAsn: 2.289 ± 0.483
3.052ThrPro: 3.052 ± 0.638
1.602ThrGln: 1.602 ± 0.675
2.136ThrArg: 2.136 ± 0.823
5.951ThrSer: 5.951 ± 0.501
3.434ThrThr: 3.434 ± 0.475
5.646ThrVal: 5.646 ± 0.759
0.687ThrTrp: 0.687 ± 0.257
2.289ThrTyr: 2.289 ± 0.865
0.0ThrXaa: 0.0 ± 0.0
Val
7.172ValAla: 7.172 ± 1.503
3.51ValCys: 3.51 ± 0.373
7.096ValAsp: 7.096 ± 1.395
3.968ValGlu: 3.968 ± 0.676
4.96ValPhe: 4.96 ± 0.78
4.12ValGly: 4.12 ± 0.595
1.526ValHis: 1.526 ± 0.202
4.807ValIle: 4.807 ± 0.862
5.265ValLys: 5.265 ± 0.99
9.767ValLeu: 9.767 ± 1.029
2.442ValMet: 2.442 ± 0.443
5.494ValAsn: 5.494 ± 0.83
4.502ValPro: 4.502 ± 0.765
3.586ValGln: 3.586 ± 0.559
2.976ValArg: 2.976 ± 0.694
6.18ValSer: 6.18 ± 1.173
6.562ValThr: 6.562 ± 0.725
13.887ValVal: 13.887 ± 2.245
1.373ValTrp: 1.373 ± 0.361
5.951ValTyr: 5.951 ± 0.699
0.0ValXaa: 0.0 ± 0.0
Trp
0.534TrpAla: 0.534 ± 0.232
0.534TrpCys: 0.534 ± 0.246
0.992TrpAsp: 0.992 ± 0.287
0.229TrpGlu: 0.229 ± 0.152
0.992TrpPhe: 0.992 ± 0.278
0.458TrpGly: 0.458 ± 0.252
0.382TrpHis: 0.382 ± 0.23
0.687TrpIle: 0.687 ± 0.19
0.458TrpLys: 0.458 ± 0.164
1.984TrpLeu: 1.984 ± 0.441
0.076TrpMet: 0.076 ± 0.222
0.763TrpAsn: 0.763 ± 0.207
0.382TrpPro: 0.382 ± 0.575
0.458TrpGln: 0.458 ± 0.183
0.687TrpArg: 0.687 ± 0.192
0.763TrpSer: 0.763 ± 0.275
1.221TrpThr: 1.221 ± 0.311
0.763TrpVal: 0.763 ± 0.368
0.61TrpTrp: 0.61 ± 0.147
0.839TrpTyr: 0.839 ± 0.329
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.739TyrAla: 3.739 ± 0.384
3.205TyrCys: 3.205 ± 0.546
2.823TyrAsp: 2.823 ± 0.531
1.908TyrGlu: 1.908 ± 0.486
1.984TyrPhe: 1.984 ± 0.226
2.976TyrGly: 2.976 ± 0.84
0.382TyrHis: 0.382 ± 0.253
2.213TyrIle: 2.213 ± 0.806
3.51TyrLys: 3.51 ± 0.477
4.12TyrLeu: 4.12 ± 0.783
0.992TyrMet: 0.992 ± 0.302
2.899TyrAsn: 2.899 ± 0.489
1.45TyrPro: 1.45 ± 0.318
1.755TyrGln: 1.755 ± 0.536
1.755TyrArg: 1.755 ± 0.425
2.823TyrSer: 2.823 ± 0.582
3.891TyrThr: 3.891 ± 0.486
4.349TyrVal: 4.349 ± 0.309
0.61TyrTrp: 0.61 ± 0.307
2.671TyrTyr: 2.671 ± 0.76
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (13107 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski