Amino acid dipepetide frequency for Wuhan spider virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.209AlaAla: 4.209 ± 1.653
0.842AlaCys: 0.842 ± 0.285
5.331AlaAsp: 5.331 ± 0.536
2.525AlaGlu: 2.525 ± 0.941
1.403AlaPhe: 1.403 ± 0.557
3.648AlaGly: 3.648 ± 2.691
1.964AlaHis: 1.964 ± 0.439
2.806AlaIle: 2.806 ± 1.492
4.209AlaLys: 4.209 ± 0.71
6.173AlaLeu: 6.173 ± 1.838
1.684AlaMet: 1.684 ± 0.607
2.525AlaAsn: 2.525 ± 0.477
1.403AlaPro: 1.403 ± 0.833
1.684AlaGln: 1.684 ± 0.709
1.403AlaArg: 1.403 ± 1.743
5.051AlaSer: 5.051 ± 1.715
3.648AlaThr: 3.648 ± 1.328
1.964AlaVal: 1.964 ± 0.59
0.561AlaTrp: 0.561 ± 0.321
3.086AlaTyr: 3.086 ± 2.179
0.0AlaXaa: 0.0 ± 0.0
Cys
1.122CysAla: 1.122 ± 0.322
0.281CysCys: 0.281 ± 0.149
0.561CysAsp: 0.561 ± 0.298
1.403CysGlu: 1.403 ± 0.746
0.561CysPhe: 0.561 ± 0.321
0.561CysGly: 0.561 ± 0.298
0.281CysHis: 0.281 ± 0.149
1.684CysIle: 1.684 ± 1.143
1.403CysLys: 1.403 ± 0.746
2.525CysLeu: 2.525 ± 1.091
0.842CysMet: 0.842 ± 0.447
0.561CysAsn: 0.561 ± 0.298
0.0CysPro: 0.0 ± 0.0
0.561CysGln: 0.561 ± 0.634
1.122CysArg: 1.122 ± 0.597
2.525CysSer: 2.525 ± 0.941
0.561CysThr: 0.561 ± 0.298
0.281CysVal: 0.281 ± 0.149
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.245AspAla: 2.245 ± 1.07
1.684AspCys: 1.684 ± 0.895
3.928AspAsp: 3.928 ± 1.325
2.806AspGlu: 2.806 ± 0.607
3.086AspPhe: 3.086 ± 1.103
2.525AspGly: 2.525 ± 0.726
1.684AspHis: 1.684 ± 0.895
3.086AspIle: 3.086 ± 1.135
5.051AspLys: 5.051 ± 1.852
5.331AspLeu: 5.331 ± 0.761
1.122AspMet: 1.122 ± 0.515
3.086AspAsn: 3.086 ± 0.377
4.489AspPro: 4.489 ± 1.193
0.561AspGln: 0.561 ± 0.321
1.684AspArg: 1.684 ± 0.895
3.086AspSer: 3.086 ± 1.152
1.964AspThr: 1.964 ± 0.59
5.612AspVal: 5.612 ± 0.908
0.842AspTrp: 0.842 ± 0.572
3.648AspTyr: 3.648 ± 1.192
0.0AspXaa: 0.0 ± 0.0
Glu
1.403GluAla: 1.403 ± 0.414
1.122GluCys: 1.122 ± 0.597
4.489GluAsp: 4.489 ± 0.91
5.331GluGlu: 5.331 ± 3.385
4.209GluPhe: 4.209 ± 1.241
1.684GluGly: 1.684 ± 1.06
1.122GluHis: 1.122 ± 0.597
5.612GluIle: 5.612 ± 1.483
3.086GluLys: 3.086 ± 0.719
3.086GluLeu: 3.086 ± 0.455
1.684GluMet: 1.684 ± 0.472
3.648GluAsn: 3.648 ± 0.655
3.928GluPro: 3.928 ± 0.879
1.684GluGln: 1.684 ± 0.532
3.648GluArg: 3.648 ± 1.458
3.367GluSer: 3.367 ± 1.374
3.086GluThr: 3.086 ± 0.941
5.051GluVal: 5.051 ± 0.902
0.0GluTrp: 0.0 ± 0.0
1.684GluTyr: 1.684 ± 0.895
0.0GluXaa: 0.0 ± 0.0
Phe
4.209PheAla: 4.209 ± 0.755
0.281PheCys: 0.281 ± 0.149
1.403PheAsp: 1.403 ± 0.557
2.806PheGlu: 2.806 ± 1.084
3.086PhePhe: 3.086 ± 0.727
1.684PheGly: 1.684 ± 1.341
1.684PheHis: 1.684 ± 0.57
1.122PheIle: 1.122 ± 0.642
3.928PheLys: 3.928 ± 0.879
3.367PheLeu: 3.367 ± 0.859
1.122PheMet: 1.122 ± 0.322
3.928PheAsn: 3.928 ± 0.113
3.086PhePro: 3.086 ± 1.203
2.245PheGln: 2.245 ± 0.954
3.086PheArg: 3.086 ± 0.455
4.77PheSer: 4.77 ± 1.275
3.086PheThr: 3.086 ± 1.631
3.928PheVal: 3.928 ± 1.137
0.561PheTrp: 0.561 ± 0.823
2.806PheTyr: 2.806 ± 1.562
0.0PheXaa: 0.0 ± 0.0
Gly
1.684GlyAla: 1.684 ± 2.781
0.281GlyCys: 0.281 ± 0.149
3.086GlyAsp: 3.086 ± 1.009
2.525GlyGlu: 2.525 ± 0.726
2.525GlyPhe: 2.525 ± 0.492
2.806GlyGly: 2.806 ± 2.323
0.561GlyHis: 0.561 ± 0.298
5.892GlyIle: 5.892 ± 0.52
3.648GlyLys: 3.648 ± 0.997
4.77GlyLeu: 4.77 ± 1.373
1.122GlyMet: 1.122 ± 0.322
2.525GlyAsn: 2.525 ± 2.971
2.806GlyPro: 2.806 ± 1.023
1.122GlyGln: 1.122 ± 0.322
0.842GlyArg: 0.842 ± 1.192
3.648GlySer: 3.648 ± 1.933
2.525GlyThr: 2.525 ± 0.855
3.086GlyVal: 3.086 ± 0.689
0.561GlyTrp: 0.561 ± 0.298
1.684GlyTyr: 1.684 ± 0.895
0.0GlyXaa: 0.0 ± 0.0
His
2.245HisAla: 2.245 ± 0.8
0.281HisCys: 0.281 ± 0.149
0.281HisAsp: 0.281 ± 0.149
1.964HisGlu: 1.964 ± 0.59
1.403HisPhe: 1.403 ± 0.746
0.842HisGly: 0.842 ± 0.447
0.561HisHis: 0.561 ± 0.298
0.842HisIle: 0.842 ± 0.502
1.964HisLys: 1.964 ± 1.106
2.806HisLeu: 2.806 ± 0.607
1.403HisMet: 1.403 ± 0.414
0.561HisAsn: 0.561 ± 0.298
0.281HisPro: 0.281 ± 0.149
0.842HisGln: 0.842 ± 0.447
0.842HisArg: 0.842 ± 0.447
0.561HisSer: 0.561 ± 0.298
1.964HisThr: 1.964 ± 0.662
1.403HisVal: 1.403 ± 0.746
0.281HisTrp: 0.281 ± 0.149
1.964HisTyr: 1.964 ± 1.044
0.0HisXaa: 0.0 ± 0.0
Ile
2.806IleAla: 2.806 ± 0.827
0.281IleCys: 0.281 ± 0.722
5.331IleAsp: 5.331 ± 1.018
5.051IleGlu: 5.051 ± 1.671
4.489IlePhe: 4.489 ± 0.8
3.367IleGly: 3.367 ± 1.79
0.842IleHis: 0.842 ± 0.447
4.209IleIle: 4.209 ± 2.237
4.209IleLys: 4.209 ± 0.74
5.051IleLeu: 5.051 ± 1.881
2.525IleMet: 2.525 ± 0.796
3.086IleAsn: 3.086 ± 0.455
2.525IlePro: 2.525 ± 0.448
3.086IleGln: 3.086 ± 1.009
4.209IleArg: 4.209 ± 1.241
4.209IleSer: 4.209 ± 0.361
1.122IleThr: 1.122 ± 0.886
3.648IleVal: 3.648 ± 1.192
0.561IleTrp: 0.561 ± 0.298
3.367IleTyr: 3.367 ± 1.14
0.0IleXaa: 0.0 ± 0.0
Lys
3.086LysAla: 3.086 ± 0.719
0.561LysCys: 0.561 ± 0.298
3.367LysAsp: 3.367 ± 0.571
4.77LysGlu: 4.77 ± 2.109
4.77LysPhe: 4.77 ± 3.192
1.403LysGly: 1.403 ± 0.746
1.403LysHis: 1.403 ± 0.746
5.331LysIle: 5.331 ± 0.93
5.612LysLys: 5.612 ± 1.55
6.453LysLeu: 6.453 ± 2.26
0.842LysMet: 0.842 ± 0.447
2.806LysAsn: 2.806 ± 0.827
4.489LysPro: 4.489 ± 1.004
3.367LysGln: 3.367 ± 1.653
2.806LysArg: 2.806 ± 0.607
6.173LysSer: 6.173 ± 1.16
3.648LysThr: 3.648 ± 0.89
4.209LysVal: 4.209 ± 1.336
0.281LysTrp: 0.281 ± 0.149
2.245LysTyr: 2.245 ± 1.372
0.0LysXaa: 0.0 ± 0.0
Leu
7.295LeuAla: 7.295 ± 0.7
1.964LeuCys: 1.964 ± 1.044
7.576LeuAsp: 7.576 ± 1.153
4.489LeuGlu: 4.489 ± 1.004
2.245LeuPhe: 2.245 ± 0.8
4.489LeuGly: 4.489 ± 3.512
1.403LeuHis: 1.403 ± 0.546
5.051LeuIle: 5.051 ± 2.257
5.331LeuLys: 5.331 ± 1.185
5.892LeuLeu: 5.892 ± 1.317
1.403LeuMet: 1.403 ± 0.414
4.77LeuAsn: 4.77 ± 1.265
4.77LeuPro: 4.77 ± 3.093
4.77LeuGln: 4.77 ± 0.89
2.525LeuArg: 2.525 ± 1.2
5.331LeuSer: 5.331 ± 0.405
4.77LeuThr: 4.77 ± 1.265
5.051LeuVal: 5.051 ± 1.302
0.561LeuTrp: 0.561 ± 0.298
4.209LeuTyr: 4.209 ± 1.279
0.0LeuXaa: 0.0 ± 0.0
Met
3.367MetAla: 3.367 ± 1.887
0.281MetCys: 0.281 ± 0.149
1.122MetAsp: 1.122 ± 0.551
2.806MetGlu: 2.806 ± 1.675
0.842MetPhe: 0.842 ± 0.447
0.842MetGly: 0.842 ± 0.447
1.122MetHis: 1.122 ± 0.597
1.403MetIle: 1.403 ± 0.588
1.403MetLys: 1.403 ± 0.746
1.684MetLeu: 1.684 ± 0.709
0.281MetMet: 0.281 ± 0.149
0.561MetAsn: 0.561 ± 0.298
1.684MetPro: 1.684 ± 0.895
1.122MetGln: 1.122 ± 0.544
1.403MetArg: 1.403 ± 0.588
1.684MetSer: 1.684 ± 0.532
0.561MetThr: 0.561 ± 0.321
1.122MetVal: 1.122 ± 0.544
0.0MetTrp: 0.0 ± 0.0
0.561MetTyr: 0.561 ± 0.298
0.0MetXaa: 0.0 ± 0.0
Asn
2.806AsnAla: 2.806 ± 0.724
1.684AsnCys: 1.684 ± 0.472
2.525AsnAsp: 2.525 ± 0.492
4.489AsnGlu: 4.489 ± 1.004
3.928AsnPhe: 3.928 ± 2.339
2.245AsnGly: 2.245 ± 2.21
1.684AsnHis: 1.684 ± 0.895
2.525AsnIle: 2.525 ± 0.941
4.209AsnLys: 4.209 ± 0.361
5.612AsnLeu: 5.612 ± 1.04
1.122AsnMet: 1.122 ± 0.581
2.245AsnAsn: 2.245 ± 2.937
3.367AsnPro: 3.367 ± 0.84
1.684AsnGln: 1.684 ± 1.508
3.086AsnArg: 3.086 ± 0.82
6.734AsnSer: 6.734 ± 1.565
2.245AsnThr: 2.245 ± 0.898
3.367AsnVal: 3.367 ± 0.859
0.842AsnTrp: 0.842 ± 0.447
1.403AsnTyr: 1.403 ± 0.414
0.0AsnXaa: 0.0 ± 0.0
Pro
3.086ProAla: 3.086 ± 0.455
0.561ProCys: 0.561 ± 0.298
3.367ProAsp: 3.367 ± 1.063
1.964ProGlu: 1.964 ± 1.044
2.245ProPhe: 2.245 ± 1.762
5.051ProGly: 5.051 ± 1.881
0.561ProHis: 0.561 ± 0.298
4.209ProIle: 4.209 ± 1.241
3.367ProLys: 3.367 ± 1.952
3.648ProLeu: 3.648 ± 0.92
1.403ProMet: 1.403 ± 1.743
3.367ProAsn: 3.367 ± 0.762
4.77ProPro: 4.77 ± 1.197
1.684ProGln: 1.684 ± 0.963
2.245ProArg: 2.245 ± 2.19
3.648ProSer: 3.648 ± 0.977
3.367ProThr: 3.367 ± 1.009
3.928ProVal: 3.928 ± 1.541
1.122ProTrp: 1.122 ± 0.642
2.525ProTyr: 2.525 ± 0.845
0.0ProXaa: 0.0 ± 0.0
Gln
2.245GlnAla: 2.245 ± 0.954
0.281GlnCys: 0.281 ± 0.149
1.122GlnAsp: 1.122 ± 0.886
1.684GlnGlu: 1.684 ± 0.709
1.122GlnPhe: 1.122 ± 0.963
1.403GlnGly: 1.403 ± 0.414
0.842GlnHis: 0.842 ± 0.447
1.122GlnIle: 1.122 ± 0.322
1.403GlnLys: 1.403 ± 0.746
3.648GlnLeu: 3.648 ± 0.688
1.122GlnMet: 1.122 ± 0.597
2.245GlnAsn: 2.245 ± 2.19
1.684GlnPro: 1.684 ± 0.963
0.561GlnGln: 0.561 ± 1.108
0.842GlnArg: 0.842 ± 0.447
4.77GlnSer: 4.77 ± 1.442
3.928GlnThr: 3.928 ± 3.483
2.245GlnVal: 2.245 ± 0.455
0.0GlnTrp: 0.0 ± 0.0
2.806GlnTyr: 2.806 ± 1.48
0.0GlnXaa: 0.0 ± 0.0
Arg
2.245ArgAla: 2.245 ± 2.036
0.842ArgCys: 0.842 ± 0.447
1.403ArgAsp: 1.403 ± 0.414
1.684ArgGlu: 1.684 ± 0.799
1.964ArgPhe: 1.964 ± 1.044
1.684ArgGly: 1.684 ± 0.381
1.403ArgHis: 1.403 ± 0.746
3.086ArgIle: 3.086 ± 0.82
4.209ArgLys: 4.209 ± 1.089
2.245ArgLeu: 2.245 ± 0.8
0.842ArgMet: 0.842 ± 0.425
1.403ArgAsn: 1.403 ± 0.414
1.684ArgPro: 1.684 ± 0.381
1.403ArgGln: 1.403 ± 0.45
3.367ArgArg: 3.367 ± 0.842
3.367ArgSer: 3.367 ± 1.556
3.648ArgThr: 3.648 ± 0.899
1.964ArgVal: 1.964 ± 0.686
0.0ArgTrp: 0.0 ± 0.0
1.964ArgTyr: 1.964 ± 0.684
0.0ArgXaa: 0.0 ± 0.0
Ser
5.331SerAla: 5.331 ± 1.887
1.403SerCys: 1.403 ± 0.746
3.086SerAsp: 3.086 ± 0.941
3.367SerGlu: 3.367 ± 0.407
5.612SerPhe: 5.612 ± 3.263
5.612SerGly: 5.612 ± 1.633
1.403SerHis: 1.403 ± 0.521
6.453SerIle: 6.453 ± 1.218
7.576SerLys: 7.576 ± 1.729
6.453SerLeu: 6.453 ± 0.763
1.403SerMet: 1.403 ± 0.521
8.418SerAsn: 8.418 ± 2.466
3.648SerPro: 3.648 ± 0.973
2.245SerGln: 2.245 ± 1.846
2.245SerArg: 2.245 ± 0.581
7.295SerSer: 7.295 ± 4.514
4.77SerThr: 4.77 ± 0.659
4.77SerVal: 4.77 ± 2.38
0.842SerTrp: 0.842 ± 0.285
3.367SerTyr: 3.367 ± 0.859
0.0SerXaa: 0.0 ± 0.0
Thr
2.525ThrAla: 2.525 ± 0.748
0.561ThrCys: 0.561 ± 0.634
2.525ThrAsp: 2.525 ± 0.515
1.684ThrGlu: 1.684 ± 0.532
3.648ThrPhe: 3.648 ± 0.688
2.806ThrGly: 2.806 ± 0.758
1.964ThrHis: 1.964 ± 0.662
3.928ThrIle: 3.928 ± 0.699
1.684ThrLys: 1.684 ± 0.532
4.77ThrLeu: 4.77 ± 0.904
1.684ThrMet: 1.684 ± 0.532
2.525ThrAsn: 2.525 ± 2.237
4.77ThrPro: 4.77 ± 2.796
2.525ThrGln: 2.525 ± 0.995
2.525ThrArg: 2.525 ± 0.918
5.051ThrSer: 5.051 ± 3.337
3.648ThrThr: 3.648 ± 1.15
4.77ThrVal: 4.77 ± 1.74
0.842ThrTrp: 0.842 ± 0.447
2.245ThrTyr: 2.245 ± 0.455
0.0ThrXaa: 0.0 ± 0.0
Val
2.245ValAla: 2.245 ± 0.581
1.684ValCys: 1.684 ± 1.143
4.209ValAsp: 4.209 ± 1.241
3.367ValGlu: 3.367 ± 1.597
2.806ValPhe: 2.806 ± 0.827
1.964ValGly: 1.964 ± 0.365
1.122ValHis: 1.122 ± 0.544
3.648ValIle: 3.648 ± 0.655
3.367ValLys: 3.367 ± 1.14
6.173ValLeu: 6.173 ± 1.259
1.122ValMet: 1.122 ± 0.544
5.892ValAsn: 5.892 ± 1.201
3.928ValPro: 3.928 ± 0.763
2.806ValGln: 2.806 ± 1.155
1.122ValArg: 1.122 ± 0.597
7.015ValSer: 7.015 ± 1.64
5.051ValThr: 5.051 ± 1.881
5.051ValVal: 5.051 ± 1.8
0.842ValTrp: 0.842 ± 0.285
0.842ValTyr: 0.842 ± 0.285
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.281TrpCys: 0.281 ± 0.149
0.842TrpAsp: 0.842 ± 0.285
1.403TrpGlu: 1.403 ± 0.588
0.281TrpPhe: 0.281 ± 0.149
0.842TrpGly: 0.842 ± 0.285
0.561TrpHis: 0.561 ± 0.298
0.281TrpIle: 0.281 ± 0.149
0.0TrpLys: 0.0 ± 0.0
1.122TrpLeu: 1.122 ± 0.597
0.0TrpMet: 0.0 ± 0.0
0.842TrpAsn: 0.842 ± 0.447
0.561TrpPro: 0.561 ± 0.321
0.281TrpGln: 0.281 ± 0.149
0.0TrpArg: 0.0 ± 0.0
0.842TrpSer: 0.842 ± 0.285
0.561TrpThr: 0.561 ± 0.888
0.561TrpVal: 0.561 ± 0.298
0.281TrpTrp: 0.281 ± 0.149
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.964TyrAla: 1.964 ± 0.439
1.684TyrCys: 1.684 ± 0.532
1.964TyrAsp: 1.964 ± 1.044
2.806TyrGlu: 2.806 ± 0.607
1.964TyrPhe: 1.964 ± 1.069
2.245TyrGly: 2.245 ± 0.658
1.122TyrHis: 1.122 ± 0.322
1.684TyrIle: 1.684 ± 0.895
1.964TyrLys: 1.964 ± 0.59
3.086TyrLeu: 3.086 ± 0.455
0.842TyrMet: 0.842 ± 0.285
3.086TyrAsn: 3.086 ± 0.455
2.245TyrPro: 2.245 ± 0.596
0.842TyrGln: 0.842 ± 0.447
1.403TyrArg: 1.403 ± 0.746
6.173TyrSer: 6.173 ± 3.422
2.525TyrThr: 2.525 ± 0.855
2.245TyrVal: 2.245 ± 0.596
0.561TyrTrp: 0.561 ± 0.298
2.525TyrTyr: 2.525 ± 0.995
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3565 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski