Amino acid dipepetide frequency for Wuhan spider virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.87AlaAla: 5.87 ± 4.164
0.267AlaCys: 0.267 ± 0.739
1.601AlaAsp: 1.601 ± 0.664
4.803AlaGlu: 4.803 ± 0.327
2.668AlaPhe: 2.668 ± 0.97
4.002AlaGly: 4.002 ± 1.983
1.334AlaHis: 1.334 ± 0.715
2.401AlaIle: 2.401 ± 0.49
3.735AlaLys: 3.735 ± 2.123
6.937AlaLeu: 6.937 ± 1.786
0.534AlaMet: 0.534 ± 0.374
2.668AlaAsn: 2.668 ± 1.293
2.401AlaPro: 2.401 ± 1.098
4.269AlaGln: 4.269 ± 1.109
3.469AlaArg: 3.469 ± 0.586
5.603AlaSer: 5.603 ± 0.888
3.735AlaThr: 3.735 ± 1.605
3.202AlaVal: 3.202 ± 1.437
1.601AlaTrp: 1.601 ± 0.705
3.202AlaTyr: 3.202 ± 1.032
0.0AlaXaa: 0.0 ± 0.0
Cys
1.868CysAla: 1.868 ± 0.634
0.0CysCys: 0.0 ± 0.0
1.868CysAsp: 1.868 ± 1.255
1.067CysGlu: 1.067 ± 0.628
0.534CysPhe: 0.534 ± 0.286
1.601CysGly: 1.601 ± 0.858
0.267CysHis: 0.267 ± 0.143
0.8CysIle: 0.8 ± 0.429
0.534CysLys: 0.534 ± 0.286
1.601CysLeu: 1.601 ± 0.517
0.267CysMet: 0.267 ± 0.143
1.067CysAsn: 1.067 ± 0.572
0.8CysPro: 0.8 ± 0.636
1.334CysGln: 1.334 ± 0.652
0.8CysArg: 0.8 ± 0.636
2.401CysSer: 2.401 ± 1.098
0.8CysThr: 0.8 ± 0.429
0.267CysVal: 0.267 ± 0.143
0.267CysTrp: 0.267 ± 0.143
0.534CysTyr: 0.534 ± 0.674
0.0CysXaa: 0.0 ± 0.0
Asp
4.002AspAla: 4.002 ± 1.711
1.334AspCys: 1.334 ± 0.652
4.002AspAsp: 4.002 ± 3.047
1.334AspGlu: 1.334 ± 0.715
3.735AspPhe: 3.735 ± 1.12
3.202AspGly: 3.202 ± 1.66
1.334AspHis: 1.334 ± 0.506
5.336AspIle: 5.336 ± 1.037
2.134AspLys: 2.134 ± 0.759
4.269AspLeu: 4.269 ± 0.735
1.334AspMet: 1.334 ± 0.715
2.935AspAsn: 2.935 ± 1.586
3.735AspPro: 3.735 ± 0.692
3.202AspGln: 3.202 ± 0.886
1.868AspArg: 1.868 ± 0.808
3.202AspSer: 3.202 ± 1.004
2.401AspThr: 2.401 ± 0.89
2.668AspVal: 2.668 ± 0.426
1.334AspTrp: 1.334 ± 0.715
1.868AspTyr: 1.868 ± 0.781
0.0AspXaa: 0.0 ± 0.0
Glu
3.202GluAla: 3.202 ± 1.41
1.067GluCys: 1.067 ± 0.628
3.202GluAsp: 3.202 ± 0.835
5.336GluGlu: 5.336 ± 2.748
4.002GluPhe: 4.002 ± 1.005
2.935GluGly: 2.935 ± 1.091
1.067GluHis: 1.067 ± 0.349
3.202GluIle: 3.202 ± 1.41
4.269GluLys: 4.269 ± 1.36
4.002GluLeu: 4.002 ± 1.392
1.067GluMet: 1.067 ± 0.628
2.134GluAsn: 2.134 ± 0.697
2.935GluPro: 2.935 ± 1.091
3.735GluGln: 3.735 ± 0.978
2.668GluArg: 2.668 ± 1.093
4.269GluSer: 4.269 ± 1.519
3.469GluThr: 3.469 ± 0.618
5.069GluVal: 5.069 ± 1.436
1.067GluTrp: 1.067 ± 0.597
2.134GluTyr: 2.134 ± 1.144
0.0GluXaa: 0.0 ± 0.0
Phe
3.469PheAla: 3.469 ± 1.569
0.534PheCys: 0.534 ± 0.286
1.868PheAsp: 1.868 ± 2.092
2.401PheGlu: 2.401 ± 0.652
0.534PhePhe: 0.534 ± 0.286
1.868PheGly: 1.868 ± 0.785
2.401PheHis: 2.401 ± 1.098
1.868PheIle: 1.868 ± 0.634
4.803PheLys: 4.803 ± 1.025
4.269PheLeu: 4.269 ± 1.519
1.334PheMet: 1.334 ± 1.737
4.002PheAsn: 4.002 ± 1.426
3.469PhePro: 3.469 ± 0.879
1.601PheGln: 1.601 ± 0.664
1.601PheArg: 1.601 ± 0.664
2.668PheSer: 2.668 ± 0.426
2.134PheThr: 2.134 ± 1.609
2.668PheVal: 2.668 ± 0.834
1.067PheTrp: 1.067 ± 0.572
1.601PheTyr: 1.601 ± 1.218
0.0PheXaa: 0.0 ± 0.0
Gly
2.935GlyAla: 2.935 ± 2.468
1.067GlyCys: 1.067 ± 0.572
2.134GlyAsp: 2.134 ± 0.447
2.668GlyGlu: 2.668 ± 1.522
2.935GlyPhe: 2.935 ± 0.919
2.668GlyGly: 2.668 ± 1.511
0.8GlyHis: 0.8 ± 0.429
3.202GlyIle: 3.202 ± 1.395
5.87GlyLys: 5.87 ± 1.886
2.668GlyLeu: 2.668 ± 1.298
1.868GlyMet: 1.868 ± 0.666
2.935GlyAsn: 2.935 ± 1.01
2.935GlyPro: 2.935 ± 1.67
2.134GlyGln: 2.134 ± 1.12
1.868GlyArg: 1.868 ± 0.808
3.469GlySer: 3.469 ± 0.807
3.469GlyThr: 3.469 ± 1.686
5.069GlyVal: 5.069 ± 2.088
0.8GlyTrp: 0.8 ± 0.715
2.134GlyTyr: 2.134 ± 1.12
0.0GlyXaa: 0.0 ± 0.0
His
1.334HisAla: 1.334 ± 0.417
0.8HisCys: 0.8 ± 0.429
1.601HisAsp: 1.601 ± 0.443
2.134HisGlu: 2.134 ± 0.527
2.134HisPhe: 2.134 ± 1.256
0.267HisGly: 0.267 ± 0.458
0.267HisHis: 0.267 ± 0.143
1.334HisIle: 1.334 ± 0.417
1.067HisLys: 1.067 ± 0.628
1.601HisLeu: 1.601 ± 0.858
0.534HisMet: 0.534 ± 0.286
1.334HisAsn: 1.334 ± 0.417
1.067HisPro: 1.067 ± 0.597
0.8HisGln: 0.8 ± 0.332
0.267HisArg: 0.267 ± 0.143
1.601HisSer: 1.601 ± 1.218
0.8HisThr: 0.8 ± 0.429
0.8HisVal: 0.8 ± 0.429
0.534HisTrp: 0.534 ± 0.374
1.868HisTyr: 1.868 ± 1.001
0.0HisXaa: 0.0 ± 0.0
Ile
4.002IleAla: 4.002 ± 1.356
1.601IleCys: 1.601 ± 0.705
1.868IleAsp: 1.868 ± 0.781
2.401IleGlu: 2.401 ± 0.457
2.134IlePhe: 2.134 ± 0.447
4.536IleGly: 4.536 ± 1.064
0.8IleHis: 0.8 ± 0.332
3.469IleIle: 3.469 ± 1.573
1.601IleLys: 1.601 ± 0.858
3.469IleLeu: 3.469 ± 1.465
1.868IleMet: 1.868 ± 1.001
3.735IleAsn: 3.735 ± 1.065
3.469IlePro: 3.469 ± 1.829
1.601IleGln: 1.601 ± 0.719
4.002IleArg: 4.002 ± 0.341
6.403IleSer: 6.403 ± 1.174
3.735IleThr: 3.735 ± 0.9
4.269IleVal: 4.269 ± 2.289
0.0IleTrp: 0.0 ± 0.0
2.935IleTyr: 2.935 ± 0.479
0.0IleXaa: 0.0 ± 0.0
Lys
4.269LysAla: 4.269 ± 1.581
1.067LysCys: 1.067 ± 0.349
5.069LysAsp: 5.069 ± 1.35
4.269LysGlu: 4.269 ± 2.289
3.469LysPhe: 3.469 ± 0.618
3.469LysGly: 3.469 ± 0.586
0.8LysHis: 0.8 ± 0.636
5.336LysIle: 5.336 ± 1.62
5.336LysLys: 5.336 ± 1.211
5.87LysLeu: 5.87 ± 2.019
1.601LysMet: 1.601 ± 0.705
2.935LysAsn: 2.935 ± 0.843
3.469LysPro: 3.469 ± 0.879
2.134LysGln: 2.134 ± 0.874
2.134LysArg: 2.134 ± 0.447
5.069LysSer: 5.069 ± 0.823
3.469LysThr: 3.469 ± 0.182
4.269LysVal: 4.269 ± 0.796
0.8LysTrp: 0.8 ± 0.332
2.401LysTyr: 2.401 ± 1.288
0.0LysXaa: 0.0 ± 0.0
Leu
7.737LeuAla: 7.737 ± 1.279
1.601LeuCys: 1.601 ± 0.517
4.269LeuAsp: 4.269 ± 1.395
5.069LeuGlu: 5.069 ± 1.436
1.868LeuPhe: 1.868 ± 2.103
2.935LeuGly: 2.935 ± 1.732
1.334LeuHis: 1.334 ± 0.506
3.735LeuIle: 3.735 ± 0.721
6.137LeuLys: 6.137 ± 0.93
6.137LeuLeu: 6.137 ± 2.333
0.8LeuMet: 0.8 ± 0.706
5.336LeuAsn: 5.336 ± 1.137
4.536LeuPro: 4.536 ± 2.108
2.935LeuGln: 2.935 ± 0.843
3.735LeuArg: 3.735 ± 1.577
5.336LeuSer: 5.336 ± 1.137
3.735LeuThr: 3.735 ± 1.12
5.336LeuVal: 5.336 ± 2.049
1.601LeuTrp: 1.601 ± 0.517
2.935LeuTyr: 2.935 ± 1.091
0.0LeuXaa: 0.0 ± 0.0
Met
1.334MetAla: 1.334 ± 0.562
0.267MetCys: 0.267 ± 0.143
1.067MetAsp: 1.067 ± 0.628
1.334MetGlu: 1.334 ± 0.854
1.601MetPhe: 1.601 ± 0.517
0.8MetGly: 0.8 ± 0.636
1.334MetHis: 1.334 ± 0.715
0.8MetIle: 0.8 ± 0.429
0.267MetLys: 0.267 ± 0.143
2.134MetLeu: 2.134 ± 0.527
0.267MetMet: 0.267 ± 0.143
1.334MetAsn: 1.334 ± 0.417
1.067MetPro: 1.067 ± 0.627
0.8MetGln: 0.8 ± 0.706
0.534MetArg: 0.534 ± 0.286
2.401MetSer: 2.401 ± 0.647
1.334MetThr: 1.334 ± 0.417
1.334MetVal: 1.334 ± 0.692
0.0MetTrp: 0.0 ± 0.0
1.067MetTyr: 1.067 ± 0.597
0.0MetXaa: 0.0 ± 0.0
Asn
2.401AsnAla: 2.401 ± 1.311
1.334AsnCys: 1.334 ± 0.652
3.469AsnAsp: 3.469 ± 0.572
3.735AsnGlu: 3.735 ± 1.267
2.401AsnPhe: 2.401 ± 1.952
2.134AsnGly: 2.134 ± 1.212
0.534AsnHis: 0.534 ± 0.286
4.269AsnIle: 4.269 ± 1.519
4.269AsnLys: 4.269 ± 0.991
4.803AsnLeu: 4.803 ± 1.903
1.601AsnMet: 1.601 ± 0.774
0.8AsnAsn: 0.8 ± 0.429
3.202AsnPro: 3.202 ± 1.3
2.134AsnGln: 2.134 ± 1.494
2.401AsnArg: 2.401 ± 1.156
2.935AsnSer: 2.935 ± 0.379
5.069AsnThr: 5.069 ± 2.273
3.202AsnVal: 3.202 ± 0.665
0.267AsnTrp: 0.267 ± 0.143
1.868AsnTyr: 1.868 ± 0.808
0.0AsnXaa: 0.0 ± 0.0
Pro
2.668ProAla: 2.668 ± 1.708
1.868ProCys: 1.868 ± 0.622
3.202ProAsp: 3.202 ± 1.034
3.735ProGlu: 3.735 ± 1.922
1.334ProPhe: 1.334 ± 1.303
3.735ProGly: 3.735 ± 2.033
0.534ProHis: 0.534 ± 0.286
2.668ProIle: 2.668 ± 1.866
4.002ProLys: 4.002 ± 0.72
2.668ProLeu: 2.668 ± 0.426
1.868ProMet: 1.868 ± 1.825
3.469ProAsn: 3.469 ± 2.085
4.269ProPro: 4.269 ± 2.626
4.002ProGln: 4.002 ± 1.342
1.868ProArg: 1.868 ± 1.331
5.069ProSer: 5.069 ± 2.137
3.202ProThr: 3.202 ± 0.703
4.002ProVal: 4.002 ± 0.837
0.534ProTrp: 0.534 ± 0.374
1.601ProTyr: 1.601 ± 0.751
0.0ProXaa: 0.0 ± 0.0
Gln
2.668GlnAla: 2.668 ± 0.9
0.8GlnCys: 0.8 ± 0.636
2.401GlnAsp: 2.401 ± 1.331
2.401GlnGlu: 2.401 ± 1.272
2.401GlnPhe: 2.401 ± 0.49
1.868GlnGly: 1.868 ± 1.133
1.868GlnHis: 1.868 ± 0.622
4.269GlnIle: 4.269 ± 1.921
3.735GlnLys: 3.735 ± 1.282
4.002GlnLeu: 4.002 ± 0.81
0.534GlnMet: 0.534 ± 0.374
1.868GlnAsn: 1.868 ± 0.622
1.334GlnPro: 1.334 ± 0.923
0.8GlnGln: 0.8 ± 0.715
1.601GlnArg: 1.601 ± 0.827
3.202GlnSer: 3.202 ± 1.267
2.134GlnThr: 2.134 ± 0.573
3.735GlnVal: 3.735 ± 1.956
1.067GlnTrp: 1.067 ± 1.28
2.134GlnTyr: 2.134 ± 0.911
0.0GlnXaa: 0.0 ± 0.0
Arg
1.868ArgAla: 1.868 ± 1.902
0.267ArgCys: 0.267 ± 0.739
4.002ArgAsp: 4.002 ± 0.857
2.668ArgGlu: 2.668 ± 1.025
2.134ArgPhe: 2.134 ± 1.638
3.469ArgGly: 3.469 ± 1.172
0.267ArgHis: 0.267 ± 0.143
2.935ArgIle: 2.935 ± 0.676
2.935ArgLys: 2.935 ± 1.016
2.668ArgLeu: 2.668 ± 0.992
0.267ArgMet: 0.267 ± 0.143
2.668ArgAsn: 2.668 ± 2.31
2.935ArgPro: 2.935 ± 1.174
0.8ArgGln: 0.8 ± 0.706
3.735ArgArg: 3.735 ± 1.245
2.134ArgSer: 2.134 ± 1.12
4.002ArgThr: 4.002 ± 0.981
2.935ArgVal: 2.935 ± 0.676
0.534ArgTrp: 0.534 ± 0.286
1.067ArgTyr: 1.067 ± 0.572
0.0ArgXaa: 0.0 ± 0.0
Ser
5.069SerAla: 5.069 ± 0.843
1.067SerCys: 1.067 ± 0.628
3.735SerAsp: 3.735 ± 0.197
3.202SerGlu: 3.202 ± 0.534
4.002SerPhe: 4.002 ± 1.601
4.002SerGly: 4.002 ± 1.356
2.134SerHis: 2.134 ± 0.447
2.134SerIle: 2.134 ± 1.019
4.803SerLys: 4.803 ± 1.025
6.937SerLeu: 6.937 ± 1.822
1.334SerMet: 1.334 ± 0.777
4.002SerAsn: 4.002 ± 2.255
3.469SerPro: 3.469 ± 1.111
4.002SerGln: 4.002 ± 1.279
2.668SerArg: 2.668 ± 2.005
7.471SerSer: 7.471 ± 4.011
4.002SerThr: 4.002 ± 0.934
6.67SerVal: 6.67 ± 1.68
2.134SerTrp: 2.134 ± 1.522
2.134SerTyr: 2.134 ± 0.911
0.0SerXaa: 0.0 ± 0.0
Thr
1.601ThrAla: 1.601 ± 0.664
1.067ThrCys: 1.067 ± 0.572
2.401ThrAsp: 2.401 ± 0.755
4.269ThrGlu: 4.269 ± 1.413
3.469ThrPhe: 3.469 ± 0.881
4.002ThrGly: 4.002 ± 1.426
2.401ThrHis: 2.401 ± 0.755
2.935ThrIle: 2.935 ± 0.978
3.202ThrLys: 3.202 ± 1.127
4.269ThrLeu: 4.269 ± 1.223
1.067ThrMet: 1.067 ± 0.747
1.067ThrAsn: 1.067 ± 0.627
5.069ThrPro: 5.069 ± 2.137
3.202ThrGln: 3.202 ± 1.439
1.868ThrArg: 1.868 ± 0.644
3.735ThrSer: 3.735 ± 1.245
2.935ThrThr: 2.935 ± 1.01
4.803ThrVal: 4.803 ± 1.781
0.0ThrTrp: 0.0 ± 0.0
2.668ThrTyr: 2.668 ± 1.093
0.0ThrXaa: 0.0 ± 0.0
Val
4.536ValAla: 4.536 ± 1.444
2.134ValCys: 2.134 ± 1.144
5.069ValAsp: 5.069 ± 1.585
4.269ValGlu: 4.269 ± 1.862
2.668ValPhe: 2.668 ± 0.992
4.002ValGly: 4.002 ± 1.426
0.8ValHis: 0.8 ± 0.636
2.935ValIle: 2.935 ± 0.94
4.002ValLys: 4.002 ± 1.392
5.069ValLeu: 5.069 ± 1.218
1.601ValMet: 1.601 ± 0.705
4.269ValAsn: 4.269 ± 1.147
3.469ValPro: 3.469 ± 1.169
2.134ValGln: 2.134 ± 1.095
2.401ValArg: 2.401 ± 1.834
5.069ValSer: 5.069 ± 1.915
2.935ValThr: 2.935 ± 0.929
4.803ValVal: 4.803 ± 0.914
1.334ValTrp: 1.334 ± 0.417
4.803ValTyr: 4.803 ± 1.466
0.0ValXaa: 0.0 ± 0.0
Trp
0.8TrpAla: 0.8 ± 0.332
0.267TrpCys: 0.267 ± 0.143
0.534TrpAsp: 0.534 ± 0.286
1.868TrpGlu: 1.868 ± 1.001
1.334TrpPhe: 1.334 ± 0.854
0.534TrpGly: 0.534 ± 0.374
1.067TrpHis: 1.067 ± 0.572
1.334TrpIle: 1.334 ± 0.652
0.267TrpLys: 0.267 ± 0.143
0.8TrpLeu: 0.8 ± 0.332
0.267TrpMet: 0.267 ± 0.458
1.334TrpAsn: 1.334 ± 0.692
0.8TrpPro: 0.8 ± 0.706
1.334TrpGln: 1.334 ± 1.204
0.8TrpArg: 0.8 ± 0.706
0.8TrpSer: 0.8 ± 0.332
0.534TrpThr: 0.534 ± 0.286
0.8TrpVal: 0.8 ± 0.824
0.534TrpTrp: 0.534 ± 0.286
0.534TrpTyr: 0.534 ± 0.286
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.668TyrAla: 2.668 ± 1.093
0.267TyrCys: 0.267 ± 0.143
1.868TyrAsp: 1.868 ± 1.255
1.868TyrGlu: 1.868 ± 0.781
0.8TyrPhe: 0.8 ± 0.429
1.601TyrGly: 1.601 ± 0.517
0.8TyrHis: 0.8 ± 0.429
2.935TyrIle: 2.935 ± 1.574
4.269TyrLys: 4.269 ± 2.492
2.935TyrLeu: 2.935 ± 1.379
0.8TyrMet: 0.8 ± 0.429
3.202TyrAsn: 3.202 ± 0.73
1.868TyrPro: 1.868 ± 0.781
1.868TyrGln: 1.868 ± 0.532
4.002TyrArg: 4.002 ± 0.72
2.134TyrSer: 2.134 ± 0.686
2.134TyrThr: 2.134 ± 0.911
2.401TyrVal: 2.401 ± 0.512
1.067TyrTrp: 1.067 ± 0.606
2.134TyrTyr: 2.134 ± 0.786
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3749 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski