Amino acid dipepetide frequency for Wuhan arthropod virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.026AlaAla: 1.026 ± 0.399
1.539AlaCys: 1.539 ± 0.78
2.565AlaAsp: 2.565 ± 0.773
2.565AlaGlu: 2.565 ± 0.815
2.565AlaPhe: 2.565 ± 1.772
2.308AlaGly: 2.308 ± 1.087
0.513AlaHis: 0.513 ± 0.309
5.899AlaIle: 5.899 ± 2.818
4.36AlaLys: 4.36 ± 1.596
4.617AlaLeu: 4.617 ± 1.212
1.282AlaMet: 1.282 ± 0.626
2.821AlaAsn: 2.821 ± 0.684
1.795AlaPro: 1.795 ± 1.039
2.308AlaGln: 2.308 ± 1.009
1.539AlaArg: 1.539 ± 0.682
3.078AlaSer: 3.078 ± 0.734
3.847AlaThr: 3.847 ± 1.957
4.617AlaVal: 4.617 ± 1.249
0.0AlaTrp: 0.0 ± 0.0
1.282AlaTyr: 1.282 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
0.769CysAla: 0.769 ± 0.293
0.513CysCys: 0.513 ± 0.351
1.795CysAsp: 1.795 ± 0.642
1.026CysGlu: 1.026 ± 0.506
2.052CysPhe: 2.052 ± 0.707
0.769CysGly: 0.769 ± 0.402
0.256CysHis: 0.256 ± 0.377
1.539CysIle: 1.539 ± 0.724
1.539CysLys: 1.539 ± 0.383
2.052CysLeu: 2.052 ± 0.576
0.0CysMet: 0.0 ± 0.0
0.769CysAsn: 0.769 ± 0.507
1.026CysPro: 1.026 ± 0.582
0.513CysGln: 0.513 ± 0.244
1.026CysArg: 1.026 ± 0.566
1.282CysSer: 1.282 ± 0.772
0.769CysThr: 0.769 ± 0.39
1.282CysVal: 1.282 ± 0.501
0.0CysTrp: 0.0 ± 0.0
0.513CysTyr: 0.513 ± 0.378
0.0CysXaa: 0.0 ± 0.0
Asp
2.565AspAla: 2.565 ± 0.806
1.539AspCys: 1.539 ± 0.482
2.821AspAsp: 2.821 ± 0.739
3.334AspGlu: 3.334 ± 1.276
4.36AspPhe: 4.36 ± 1.447
2.821AspGly: 2.821 ± 0.916
0.769AspHis: 0.769 ± 0.463
5.899AspIle: 5.899 ± 0.739
3.591AspLys: 3.591 ± 1.6
6.412AspLeu: 6.412 ± 1.498
2.308AspMet: 2.308 ± 0.965
3.591AspAsn: 3.591 ± 0.55
1.795AspPro: 1.795 ± 1.039
1.795AspGln: 1.795 ± 0.89
1.282AspArg: 1.282 ± 0.591
5.13AspSer: 5.13 ± 1.031
2.821AspThr: 2.821 ± 1.226
4.104AspVal: 4.104 ± 0.862
1.282AspTrp: 1.282 ± 0.567
3.847AspTyr: 3.847 ± 1.292
0.0AspXaa: 0.0 ± 0.0
Glu
3.847GluAla: 3.847 ± 0.726
0.513GluCys: 0.513 ± 0.418
3.591GluAsp: 3.591 ± 0.885
2.821GluGlu: 2.821 ± 1.435
2.308GluPhe: 2.308 ± 0.911
2.052GluGly: 2.052 ± 0.49
1.282GluHis: 1.282 ± 0.584
4.617GluIle: 4.617 ± 0.839
4.104GluLys: 4.104 ± 0.881
4.617GluLeu: 4.617 ± 1.2
1.282GluMet: 1.282 ± 0.531
3.078GluAsn: 3.078 ± 1.131
1.539GluPro: 1.539 ± 0.673
1.282GluGln: 1.282 ± 0.579
1.026GluArg: 1.026 ± 0.482
3.334GluSer: 3.334 ± 1.997
2.821GluThr: 2.821 ± 0.943
6.155GluVal: 6.155 ± 0.955
0.513GluTrp: 0.513 ± 0.244
1.795GluTyr: 1.795 ± 0.572
0.0GluXaa: 0.0 ± 0.0
Phe
3.078PheAla: 3.078 ± 0.808
1.026PheCys: 1.026 ± 0.354
3.334PheAsp: 3.334 ± 0.93
1.282PheGlu: 1.282 ± 0.528
2.308PhePhe: 2.308 ± 0.566
3.078PheGly: 3.078 ± 1.099
1.026PheHis: 1.026 ± 0.553
4.873PheIle: 4.873 ± 1.347
4.104PheLys: 4.104 ± 0.995
2.565PheLeu: 2.565 ± 0.537
1.539PheMet: 1.539 ± 0.458
4.873PheAsn: 4.873 ± 0.565
1.026PhePro: 1.026 ± 0.473
1.539PheGln: 1.539 ± 0.51
2.052PheArg: 2.052 ± 0.637
3.591PheSer: 3.591 ± 1.174
2.052PheThr: 2.052 ± 0.5
4.104PheVal: 4.104 ± 0.7
0.0PheTrp: 0.0 ± 0.0
2.565PheTyr: 2.565 ± 0.629
0.0PheXaa: 0.0 ± 0.0
Gly
1.795GlyAla: 1.795 ± 0.817
0.513GlyCys: 0.513 ± 0.244
2.821GlyAsp: 2.821 ± 0.86
3.334GlyGlu: 3.334 ± 0.799
2.308GlyPhe: 2.308 ± 0.678
3.078GlyGly: 3.078 ± 2.162
0.769GlyHis: 0.769 ± 0.539
4.104GlyIle: 4.104 ± 1.273
3.334GlyLys: 3.334 ± 1.081
4.617GlyLeu: 4.617 ± 1.887
0.256GlyMet: 0.256 ± 0.341
2.565GlyAsn: 2.565 ± 0.936
1.539GlyPro: 1.539 ± 0.488
1.539GlyGln: 1.539 ± 0.573
2.052GlyArg: 2.052 ± 0.484
3.078GlySer: 3.078 ± 1.159
2.565GlyThr: 2.565 ± 1.164
1.795GlyVal: 1.795 ± 0.705
0.513GlyTrp: 0.513 ± 0.378
3.591GlyTyr: 3.591 ± 1.112
0.0GlyXaa: 0.0 ± 0.0
His
0.769HisAla: 0.769 ± 0.665
0.0HisCys: 0.0 ± 0.0
1.282HisAsp: 1.282 ± 1.057
0.256HisGlu: 0.256 ± 0.445
1.282HisPhe: 1.282 ± 0.669
1.795HisGly: 1.795 ± 0.999
0.513HisHis: 0.513 ± 0.244
2.821HisIle: 2.821 ± 0.915
1.282HisLys: 1.282 ± 0.475
1.539HisLeu: 1.539 ± 0.93
0.0HisMet: 0.0 ± 0.0
1.026HisAsn: 1.026 ± 0.478
0.513HisPro: 0.513 ± 0.244
0.0HisGln: 0.0 ± 0.0
0.769HisArg: 0.769 ± 0.402
2.052HisSer: 2.052 ± 0.647
1.026HisThr: 1.026 ± 0.399
2.308HisVal: 2.308 ± 0.849
0.256HisTrp: 0.256 ± 0.377
1.026HisTyr: 1.026 ± 0.386
0.0HisXaa: 0.0 ± 0.0
Ile
4.36IleAla: 4.36 ± 0.685
1.539IleCys: 1.539 ± 0.829
4.36IleAsp: 4.36 ± 1.918
5.13IleGlu: 5.13 ± 1.132
2.821IlePhe: 2.821 ± 0.944
2.821IleGly: 2.821 ± 1.127
1.282IleHis: 1.282 ± 0.412
5.13IleIle: 5.13 ± 1.7
6.668IleLys: 6.668 ± 0.73
12.567IleLeu: 12.567 ± 4.566
2.308IleMet: 2.308 ± 0.576
8.977IleAsn: 8.977 ± 1.769
3.847IlePro: 3.847 ± 1.355
3.078IleGln: 3.078 ± 0.843
1.795IleArg: 1.795 ± 0.493
5.386IleSer: 5.386 ± 2.321
6.925IleThr: 6.925 ± 1.529
5.642IleVal: 5.642 ± 1.182
0.256IleTrp: 0.256 ± 0.154
2.308IleTyr: 2.308 ± 0.78
0.0IleXaa: 0.0 ± 0.0
Lys
4.104LysAla: 4.104 ± 1.897
1.795LysCys: 1.795 ± 1.237
4.617LysAsp: 4.617 ± 1.939
3.847LysGlu: 3.847 ± 1.276
6.412LysPhe: 6.412 ± 1.642
4.36LysGly: 4.36 ± 0.694
1.026LysHis: 1.026 ± 0.482
7.694LysIle: 7.694 ± 1.585
3.334LysLys: 3.334 ± 0.97
6.925LysLeu: 6.925 ± 1.27
2.052LysMet: 2.052 ± 1.297
5.642LysAsn: 5.642 ± 1.131
3.078LysPro: 3.078 ± 0.968
2.052LysGln: 2.052 ± 0.497
2.052LysArg: 2.052 ± 0.799
5.13LysSer: 5.13 ± 1.03
3.078LysThr: 3.078 ± 1.47
3.591LysVal: 3.591 ± 2.161
0.0LysTrp: 0.0 ± 0.0
2.565LysTyr: 2.565 ± 0.761
0.0LysXaa: 0.0 ± 0.0
Leu
4.617LeuAla: 4.617 ± 1.095
0.769LeuCys: 0.769 ± 0.402
9.746LeuAsp: 9.746 ± 1.356
3.591LeuGlu: 3.591 ± 1.438
3.078LeuPhe: 3.078 ± 0.569
2.821LeuGly: 2.821 ± 1.365
2.308LeuHis: 2.308 ± 0.674
8.464LeuIle: 8.464 ± 1.455
7.694LeuLys: 7.694 ± 1.825
9.49LeuLeu: 9.49 ± 3.145
1.539LeuMet: 1.539 ± 0.708
7.181LeuAsn: 7.181 ± 1.97
3.078LeuPro: 3.078 ± 0.812
2.565LeuGln: 2.565 ± 0.91
3.334LeuArg: 3.334 ± 0.7
7.438LeuSer: 7.438 ± 1.947
5.386LeuThr: 5.386 ± 1.361
4.36LeuVal: 4.36 ± 0.844
0.513LeuTrp: 0.513 ± 0.351
4.617LeuTyr: 4.617 ± 0.975
0.0LeuXaa: 0.0 ± 0.0
Met
0.513MetAla: 0.513 ± 0.309
0.769MetCys: 0.769 ± 0.463
1.539MetAsp: 1.539 ± 0.926
1.539MetGlu: 1.539 ± 0.701
2.308MetPhe: 2.308 ± 1.221
0.256MetGly: 0.256 ± 0.341
0.513MetHis: 0.513 ± 0.31
1.026MetIle: 1.026 ± 0.514
1.026MetLys: 1.026 ± 0.446
2.052MetLeu: 2.052 ± 0.519
0.513MetMet: 0.513 ± 0.31
1.795MetAsn: 1.795 ± 0.815
1.282MetPro: 1.282 ± 0.637
1.026MetGln: 1.026 ± 0.617
1.026MetArg: 1.026 ± 0.932
2.821MetSer: 2.821 ± 0.987
1.026MetThr: 1.026 ± 0.617
1.539MetVal: 1.539 ± 0.427
0.0MetTrp: 0.0 ± 0.0
1.026MetTyr: 1.026 ± 0.52
0.0MetXaa: 0.0 ± 0.0
Asn
2.821AsnAla: 2.821 ± 0.869
3.078AsnCys: 3.078 ± 0.957
2.308AsnAsp: 2.308 ± 0.616
2.565AsnGlu: 2.565 ± 0.832
2.052AsnPhe: 2.052 ± 0.399
3.078AsnGly: 3.078 ± 0.953
1.026AsnHis: 1.026 ± 0.428
3.847AsnIle: 3.847 ± 1.57
3.334AsnLys: 3.334 ± 1.342
7.951AsnLeu: 7.951 ± 1.906
1.795AsnMet: 1.795 ± 0.437
3.334AsnAsn: 3.334 ± 0.474
2.308AsnPro: 2.308 ± 0.822
3.334AsnGln: 3.334 ± 1.018
2.052AsnArg: 2.052 ± 0.756
6.412AsnSer: 6.412 ± 2.513
6.155AsnThr: 6.155 ± 2.697
4.617AsnVal: 4.617 ± 1.427
0.769AsnTrp: 0.769 ± 0.352
5.13AsnTyr: 5.13 ± 1.836
0.0AsnXaa: 0.0 ± 0.0
Pro
2.821ProAla: 2.821 ± 1.211
0.256ProCys: 0.256 ± 0.445
2.308ProAsp: 2.308 ± 0.854
3.591ProGlu: 3.591 ± 0.985
1.282ProPhe: 1.282 ± 0.474
1.026ProGly: 1.026 ± 0.428
1.282ProHis: 1.282 ± 1.422
3.591ProIle: 3.591 ± 1.788
1.026ProLys: 1.026 ± 0.617
2.052ProLeu: 2.052 ± 0.732
1.282ProMet: 1.282 ± 0.514
0.513ProAsn: 0.513 ± 0.244
1.539ProPro: 1.539 ± 1.142
1.539ProGln: 1.539 ± 0.701
1.282ProArg: 1.282 ± 0.584
2.821ProSer: 2.821 ± 0.608
1.795ProThr: 1.795 ± 0.817
1.795ProVal: 1.795 ± 1.238
0.0ProTrp: 0.0 ± 0.0
1.539ProTyr: 1.539 ± 1.153
0.0ProXaa: 0.0 ± 0.0
Gln
1.282GlnAla: 1.282 ± 0.431
0.256GlnCys: 0.256 ± 0.341
3.334GlnAsp: 3.334 ± 0.811
3.078GlnGlu: 3.078 ± 1.171
1.795GlnPhe: 1.795 ± 0.746
1.539GlnGly: 1.539 ± 0.926
0.513GlnHis: 0.513 ± 0.309
2.308GlnIle: 2.308 ± 0.553
2.821GlnLys: 2.821 ± 1.071
2.821GlnLeu: 2.821 ± 0.739
1.795GlnMet: 1.795 ± 1.08
1.539GlnAsn: 1.539 ± 0.427
0.769GlnPro: 0.769 ± 0.376
0.513GlnGln: 0.513 ± 0.62
0.256GlnArg: 0.256 ± 0.154
1.282GlnSer: 1.282 ± 0.567
2.565GlnThr: 2.565 ± 1.528
3.847GlnVal: 3.847 ± 0.636
0.0GlnTrp: 0.0 ± 0.0
0.769GlnTyr: 0.769 ± 0.507
0.0GlnXaa: 0.0 ± 0.0
Arg
1.282ArgAla: 1.282 ± 0.567
0.769ArgCys: 0.769 ± 0.39
1.026ArgAsp: 1.026 ± 0.4
1.539ArgGlu: 1.539 ± 0.709
1.282ArgPhe: 1.282 ± 0.519
1.795ArgGly: 1.795 ± 0.707
0.769ArgHis: 0.769 ± 0.465
4.617ArgIle: 4.617 ± 1.073
3.591ArgLys: 3.591 ± 0.782
2.308ArgLeu: 2.308 ± 0.616
0.769ArgMet: 0.769 ± 0.634
1.539ArgAsn: 1.539 ± 0.709
0.513ArgPro: 0.513 ± 0.309
1.282ArgGln: 1.282 ± 0.772
1.795ArgArg: 1.795 ± 0.817
1.282ArgSer: 1.282 ± 0.626
2.565ArgThr: 2.565 ± 0.529
1.026ArgVal: 1.026 ± 0.386
0.256ArgTrp: 0.256 ± 0.154
2.308ArgTyr: 2.308 ± 0.62
0.0ArgXaa: 0.0 ± 0.0
Ser
3.847SerAla: 3.847 ± 1.241
1.282SerCys: 1.282 ± 0.625
3.078SerAsp: 3.078 ± 0.786
3.847SerGlu: 3.847 ± 1.566
3.078SerPhe: 3.078 ± 0.973
3.847SerGly: 3.847 ± 0.581
2.308SerHis: 2.308 ± 0.725
6.155SerIle: 6.155 ± 1.59
6.668SerLys: 6.668 ± 1.307
6.668SerLeu: 6.668 ± 2.034
1.026SerMet: 1.026 ± 0.364
3.847SerAsn: 3.847 ± 0.816
2.052SerPro: 2.052 ± 0.988
2.565SerGln: 2.565 ± 0.56
1.795SerArg: 1.795 ± 0.437
5.13SerSer: 5.13 ± 1.52
6.412SerThr: 6.412 ± 2.79
5.13SerVal: 5.13 ± 1.948
1.026SerTrp: 1.026 ± 0.629
3.334SerTyr: 3.334 ± 1.365
0.0SerXaa: 0.0 ± 0.0
Thr
3.591ThrAla: 3.591 ± 1.703
0.513ThrCys: 0.513 ± 0.244
4.617ThrAsp: 4.617 ± 0.974
3.334ThrGlu: 3.334 ± 1.027
4.36ThrPhe: 4.36 ± 1.066
3.591ThrGly: 3.591 ± 1.392
1.282ThrHis: 1.282 ± 1.043
4.617ThrIle: 4.617 ± 1.689
5.642ThrLys: 5.642 ± 1.115
4.873ThrLeu: 4.873 ± 1.251
1.282ThrMet: 1.282 ± 0.437
5.642ThrAsn: 5.642 ± 2.684
1.539ThrPro: 1.539 ± 0.586
2.308ThrGln: 2.308 ± 1.144
1.795ThrArg: 1.795 ± 0.553
5.642ThrSer: 5.642 ± 1.529
4.104ThrThr: 4.104 ± 1.927
4.873ThrVal: 4.873 ± 1.614
0.0ThrTrp: 0.0 ± 0.0
2.052ThrTyr: 2.052 ± 0.435
0.0ThrXaa: 0.0 ± 0.0
Val
5.642ValAla: 5.642 ± 1.528
0.769ValCys: 0.769 ± 0.39
4.104ValAsp: 4.104 ± 1.604
4.104ValGlu: 4.104 ± 1.298
2.565ValPhe: 2.565 ± 0.63
2.821ValGly: 2.821 ± 0.858
2.308ValHis: 2.308 ± 0.809
5.386ValIle: 5.386 ± 1.394
5.899ValLys: 5.899 ± 1.266
4.873ValLeu: 4.873 ± 0.891
1.026ValMet: 1.026 ± 0.63
6.412ValAsn: 6.412 ± 1.109
2.308ValPro: 2.308 ± 0.702
1.539ValGln: 1.539 ± 0.704
2.308ValArg: 2.308 ± 0.849
4.873ValSer: 4.873 ± 1.213
6.155ValThr: 6.155 ± 2.073
2.821ValVal: 2.821 ± 0.439
0.0ValTrp: 0.0 ± 0.0
1.282ValTyr: 1.282 ± 0.584
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.256TrpCys: 0.256 ± 0.154
0.513TrpAsp: 0.513 ± 0.741
0.256TrpGlu: 0.256 ± 0.154
0.513TrpPhe: 0.513 ± 0.309
0.513TrpGly: 0.513 ± 0.753
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.513TrpLys: 0.513 ± 0.309
0.769TrpLeu: 0.769 ± 0.463
0.0TrpMet: 0.0 ± 0.0
0.256TrpAsn: 0.256 ± 0.284
0.513TrpPro: 0.513 ± 0.309
0.0TrpGln: 0.0 ± 0.0
0.513TrpArg: 0.513 ± 0.31
0.513TrpSer: 0.513 ± 0.31
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.256TrpTyr: 0.256 ± 0.415
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.795TyrAla: 1.795 ± 0.529
1.795TyrCys: 1.795 ± 0.634
2.308TyrAsp: 2.308 ± 0.698
1.539TyrGlu: 1.539 ± 0.505
1.539TyrPhe: 1.539 ± 0.843
1.795TyrGly: 1.795 ± 0.817
0.769TyrHis: 0.769 ± 0.522
4.617TyrIle: 4.617 ± 1.224
2.821TyrLys: 2.821 ± 0.928
2.821TyrLeu: 2.821 ± 1.506
1.282TyrMet: 1.282 ± 0.614
2.565TyrAsn: 2.565 ± 0.694
1.282TyrPro: 1.282 ± 0.474
2.308TyrGln: 2.308 ± 0.454
2.565TyrArg: 2.565 ± 1.147
2.821TyrSer: 2.821 ± 1.127
3.591TyrThr: 3.591 ± 1.043
3.591TyrVal: 3.591 ± 0.885
0.0TyrTrp: 0.0 ± 0.0
2.052TyrTyr: 2.052 ± 0.732
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (3900 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski