Amino acid dipepetide frequency for Wuhan spider virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.779AlaAla: 20.779 ± 5.284
2.597AlaCys: 2.597 ± 0.851
3.117AlaAsp: 3.117 ± 0.58
5.195AlaGlu: 5.195 ± 1.606
3.117AlaPhe: 3.117 ± 1.524
9.87AlaGly: 9.87 ± 3.511
2.597AlaHis: 2.597 ± 1.7
2.078AlaIle: 2.078 ± 0.397
9.87AlaLys: 9.87 ± 3.655
8.831AlaLeu: 8.831 ± 1.821
2.597AlaMet: 2.597 ± 1.373
2.597AlaAsn: 2.597 ± 2.129
7.273AlaPro: 7.273 ± 3.454
4.156AlaGln: 4.156 ± 0.195
3.636AlaArg: 3.636 ± 0.93
11.948AlaSer: 11.948 ± 2.661
9.87AlaThr: 9.87 ± 1.694
4.675AlaVal: 4.675 ± 1.267
2.597AlaTrp: 2.597 ± 1.7
1.558AlaTyr: 1.558 ± 0.896
0.0AlaXaa: 0.0 ± 0.0
Cys
0.519CysAla: 0.519 ± 0.464
1.039CysCys: 1.039 ± 0.416
2.078CysAsp: 2.078 ± 0.832
1.558CysGlu: 1.558 ± 1.175
1.558CysPhe: 1.558 ± 0.997
1.558CysGly: 1.558 ± 0.804
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.039CysLys: 1.039 ± 0.49
1.039CysLeu: 1.039 ± 0.676
0.0CysMet: 0.0 ± 0.0
1.558CysAsn: 1.558 ± 0.626
0.0CysPro: 0.0 ± 0.0
1.039CysGln: 1.039 ± 0.571
2.597CysArg: 2.597 ± 1.194
1.558CysSer: 1.558 ± 0.626
0.0CysThr: 0.0 ± 0.0
1.558CysVal: 1.558 ± 0.535
1.039CysTrp: 1.039 ± 0.929
1.039CysTyr: 1.039 ± 0.929
0.0CysXaa: 0.0 ± 0.0
Asp
4.156AspAla: 4.156 ± 1.022
0.519AspCys: 0.519 ± 0.528
4.156AspAsp: 4.156 ± 2.037
2.078AspGlu: 2.078 ± 1.17
2.597AspPhe: 2.597 ± 0.542
4.156AspGly: 4.156 ± 1.942
1.039AspHis: 1.039 ± 0.929
0.519AspIle: 0.519 ± 0.361
2.597AspLys: 2.597 ± 1.382
5.714AspLeu: 5.714 ± 0.521
0.0AspMet: 0.0 ± 0.0
2.597AspAsn: 2.597 ± 0.865
4.675AspPro: 4.675 ± 2.237
1.039AspGln: 1.039 ± 0.722
2.078AspArg: 2.078 ± 1.099
1.558AspSer: 1.558 ± 0.804
1.558AspThr: 1.558 ± 0.953
2.078AspVal: 2.078 ± 0.933
2.597AspTrp: 2.597 ± 0.701
1.039AspTyr: 1.039 ± 0.416
0.0AspXaa: 0.0 ± 0.0
Glu
6.753GluAla: 6.753 ± 1.771
1.039GluCys: 1.039 ± 0.929
2.078GluAsp: 2.078 ± 0.991
6.753GluGlu: 6.753 ± 1.939
1.558GluPhe: 1.558 ± 0.896
4.156GluGly: 4.156 ± 1.216
2.597GluHis: 2.597 ± 0.865
4.156GluIle: 4.156 ± 1.32
4.675GluLys: 4.675 ± 1.688
3.636GluLeu: 3.636 ± 1.092
1.558GluMet: 1.558 ± 0.706
0.519GluAsn: 0.519 ± 0.361
5.195GluPro: 5.195 ± 1.276
5.195GluGln: 5.195 ± 1.52
4.675GluArg: 4.675 ± 2.175
3.117GluSer: 3.117 ± 1.069
3.636GluThr: 3.636 ± 1.015
4.156GluVal: 4.156 ± 0.62
0.0GluTrp: 0.0 ± 0.0
1.039GluTyr: 1.039 ± 0.416
0.0GluXaa: 0.0 ± 0.0
Phe
5.714PheAla: 5.714 ± 1.486
0.519PheCys: 0.519 ± 0.464
2.078PheAsp: 2.078 ± 0.647
1.558PheGlu: 1.558 ± 0.706
1.558PhePhe: 1.558 ± 1.393
2.597PheGly: 2.597 ± 0.277
1.039PheHis: 1.039 ± 0.49
0.519PheIle: 0.519 ± 0.361
0.519PheLys: 0.519 ± 0.464
3.117PheLeu: 3.117 ± 1.147
1.558PheMet: 1.558 ± 0.626
1.039PheAsn: 1.039 ± 1.057
0.519PhePro: 0.519 ± 0.464
0.0PheGln: 0.0 ± 0.0
0.519PheArg: 0.519 ± 0.361
2.597PheSer: 2.597 ± 0.277
2.597PheThr: 2.597 ± 0.999
2.597PheVal: 2.597 ± 0.701
0.519PheTrp: 0.519 ± 0.584
1.558PheTyr: 1.558 ± 0.896
0.0PheXaa: 0.0 ± 0.0
Gly
5.714GlyAla: 5.714 ± 1.86
0.519GlyCys: 0.519 ± 0.361
2.078GlyAsp: 2.078 ± 1.246
4.675GlyGlu: 4.675 ± 1.267
2.597GlyPhe: 2.597 ± 0.999
6.234GlyGly: 6.234 ± 0.662
1.039GlyHis: 1.039 ± 0.742
1.558GlyIle: 1.558 ± 1.083
6.234GlyLys: 6.234 ± 1.695
5.714GlyLeu: 5.714 ± 3.056
2.078GlyMet: 2.078 ± 1.052
3.117GlyAsn: 3.117 ± 0.58
4.675GlyPro: 4.675 ± 0.955
2.078GlyGln: 2.078 ± 1.461
3.636GlyArg: 3.636 ± 0.448
3.636GlySer: 3.636 ± 1.161
4.156GlyThr: 4.156 ± 1.337
4.675GlyVal: 4.675 ± 1.064
3.117GlyTrp: 3.117 ± 0.868
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.558HisAla: 1.558 ± 1.083
0.519HisCys: 0.519 ± 0.464
1.039HisAsp: 1.039 ± 0.571
0.519HisGlu: 0.519 ± 0.584
0.0HisPhe: 0.0 ± 0.0
2.078HisGly: 2.078 ± 0.813
0.519HisHis: 0.519 ± 0.361
0.519HisIle: 0.519 ± 0.584
1.039HisLys: 1.039 ± 0.722
1.039HisLeu: 1.039 ± 0.571
0.519HisMet: 0.519 ± 0.464
0.519HisAsn: 0.519 ± 0.464
2.078HisPro: 2.078 ± 0.991
2.078HisGln: 2.078 ± 0.783
1.039HisArg: 1.039 ± 0.676
2.597HisSer: 2.597 ± 1.194
0.519HisThr: 0.519 ± 0.361
1.039HisVal: 1.039 ± 0.571
0.0HisTrp: 0.0 ± 0.0
1.039HisTyr: 1.039 ± 0.49
0.0HisXaa: 0.0 ± 0.0
Ile
3.117IleAla: 3.117 ± 1.0
1.039IleCys: 1.039 ± 0.49
0.519IleAsp: 0.519 ± 0.361
3.117IleGlu: 3.117 ± 0.331
0.519IlePhe: 0.519 ± 0.464
2.078IleGly: 2.078 ± 0.397
0.0IleHis: 0.0 ± 0.0
1.039IleIle: 1.039 ± 0.722
0.519IleLys: 0.519 ± 0.361
2.078IleLeu: 2.078 ± 1.048
1.039IleMet: 1.039 ± 0.416
0.0IleAsn: 0.0 ± 0.0
1.039IlePro: 1.039 ± 0.49
1.039IleGln: 1.039 ± 0.416
1.039IleArg: 1.039 ± 0.742
3.117IleSer: 3.117 ± 1.18
4.156IleThr: 4.156 ± 1.32
1.558IleVal: 1.558 ± 0.535
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
9.87LysAla: 9.87 ± 3.973
0.519LysCys: 0.519 ± 0.464
2.078LysAsp: 2.078 ± 0.612
3.636LysGlu: 3.636 ± 1.102
2.597LysPhe: 2.597 ± 0.909
4.675LysGly: 4.675 ± 2.201
0.519LysHis: 0.519 ± 0.361
0.0LysIle: 0.0 ± 0.0
7.792LysLys: 7.792 ± 3.688
7.792LysLeu: 7.792 ± 1.135
1.558LysMet: 1.558 ± 0.804
1.039LysAsn: 1.039 ± 0.742
5.195LysPro: 5.195 ± 3.036
4.675LysGln: 4.675 ± 2.077
3.636LysArg: 3.636 ± 0.91
4.156LysSer: 4.156 ± 0.62
2.078LysThr: 2.078 ± 0.933
4.156LysVal: 4.156 ± 1.337
1.558LysTrp: 1.558 ± 0.337
1.039LysTyr: 1.039 ± 0.49
0.0LysXaa: 0.0 ± 0.0
Leu
8.312LeuAla: 8.312 ± 1.263
0.519LeuCys: 0.519 ± 0.464
4.675LeuAsp: 4.675 ± 0.401
7.792LeuGlu: 7.792 ± 1.364
4.156LeuPhe: 4.156 ± 1.565
6.234LeuGly: 6.234 ± 1.153
0.0LeuHis: 0.0 ± 0.0
1.558LeuIle: 1.558 ± 0.706
5.195LeuLys: 5.195 ± 0.293
8.312LeuLeu: 8.312 ± 2.429
2.078LeuMet: 2.078 ± 0.905
3.636LeuAsn: 3.636 ± 0.93
3.117LeuPro: 3.117 ± 0.995
2.078LeuGln: 2.078 ± 0.402
6.234LeuArg: 6.234 ± 1.941
6.753LeuSer: 6.753 ± 1.645
1.039LeuThr: 1.039 ± 0.571
3.636LeuVal: 3.636 ± 0.906
1.558LeuTrp: 1.558 ± 1.752
1.039LeuTyr: 1.039 ± 0.49
0.0LeuXaa: 0.0 ± 0.0
Met
1.039MetAla: 1.039 ± 0.571
2.078MetCys: 2.078 ± 1.04
0.0MetAsp: 0.0 ± 0.0
1.039MetGlu: 1.039 ± 0.416
1.558MetPhe: 1.558 ± 0.706
1.558MetGly: 1.558 ± 0.953
1.558MetHis: 1.558 ± 1.175
0.519MetIle: 0.519 ± 0.528
0.0MetLys: 0.0 ± 0.0
3.117MetLeu: 3.117 ± 0.331
0.519MetMet: 0.519 ± 0.528
0.519MetAsn: 0.519 ± 0.528
2.078MetPro: 2.078 ± 0.402
1.558MetGln: 1.558 ± 0.706
2.078MetArg: 2.078 ± 0.397
1.039MetSer: 1.039 ± 0.416
1.039MetThr: 1.039 ± 0.416
2.078MetVal: 2.078 ± 1.246
0.0MetTrp: 0.0 ± 0.0
0.519MetTyr: 0.519 ± 0.464
0.0MetXaa: 0.0 ± 0.0
Asn
0.519AsnAla: 0.519 ± 0.528
1.039AsnCys: 1.039 ± 0.416
1.558AsnAsp: 1.558 ± 0.337
3.117AsnGlu: 3.117 ± 1.042
1.039AsnPhe: 1.039 ± 0.416
1.039AsnGly: 1.039 ± 0.742
1.039AsnHis: 1.039 ± 0.742
1.039AsnIle: 1.039 ± 0.416
2.078AsnLys: 2.078 ± 0.397
1.558AsnLeu: 1.558 ± 0.997
1.558AsnMet: 1.558 ± 0.997
1.039AsnAsn: 1.039 ± 0.49
2.078AsnPro: 2.078 ± 0.979
1.039AsnGln: 1.039 ± 0.416
0.0AsnArg: 0.0 ± 0.0
1.558AsnSer: 1.558 ± 0.706
2.078AsnThr: 2.078 ± 1.491
2.597AsnVal: 2.597 ± 1.144
0.0AsnTrp: 0.0 ± 0.0
1.558AsnTyr: 1.558 ± 0.626
0.0AsnXaa: 0.0 ± 0.0
Pro
11.948ProAla: 11.948 ± 3.062
0.519ProCys: 0.519 ± 0.584
5.714ProAsp: 5.714 ± 1.304
4.675ProGlu: 4.675 ± 1.438
0.519ProPhe: 0.519 ± 0.528
2.597ProGly: 2.597 ± 1.127
2.078ProHis: 2.078 ± 1.444
3.117ProIle: 3.117 ± 0.674
5.195ProLys: 5.195 ± 2.168
4.675ProLeu: 4.675 ± 1.709
0.519ProMet: 0.519 ± 0.464
1.039ProAsn: 1.039 ± 0.49
7.273ProPro: 7.273 ± 1.779
2.597ProGln: 2.597 ± 1.463
5.195ProArg: 5.195 ± 1.893
3.117ProSer: 3.117 ± 1.18
3.636ProThr: 3.636 ± 1.087
3.636ProVal: 3.636 ± 1.397
1.039ProTrp: 1.039 ± 0.722
1.039ProTyr: 1.039 ± 0.571
0.0ProXaa: 0.0 ± 0.0
Gln
3.636GlnAla: 3.636 ± 1.258
0.0GlnCys: 0.0 ± 0.0
3.117GlnAsp: 3.117 ± 1.251
3.636GlnGlu: 3.636 ± 1.521
1.558GlnPhe: 1.558 ± 0.337
2.597GlnGly: 2.597 ± 0.603
1.039GlnHis: 1.039 ± 0.416
1.558GlnIle: 1.558 ± 0.706
2.078GlnLys: 2.078 ± 0.832
1.039GlnLeu: 1.039 ± 0.929
2.597GlnMet: 2.597 ± 1.173
0.519GlnAsn: 0.519 ± 0.528
1.558GlnPro: 1.558 ± 0.337
1.558GlnGln: 1.558 ± 0.626
3.636GlnArg: 3.636 ± 2.337
2.078GlnSer: 2.078 ± 0.933
4.675GlnThr: 4.675 ± 2.081
0.519GlnVal: 0.519 ± 0.361
1.039GlnTrp: 1.039 ± 0.742
1.039GlnTyr: 1.039 ± 0.571
0.0GlnXaa: 0.0 ± 0.0
Arg
5.195ArgAla: 5.195 ± 1.893
3.117ArgCys: 3.117 ± 1.609
2.597ArgAsp: 2.597 ± 0.603
3.117ArgGlu: 3.117 ± 1.251
3.117ArgPhe: 3.117 ± 0.995
4.156ArgGly: 4.156 ± 1.619
0.519ArgHis: 0.519 ± 0.464
2.597ArgIle: 2.597 ± 0.277
3.636ArgLys: 3.636 ± 1.237
4.675ArgLeu: 4.675 ± 1.647
2.597ArgMet: 2.597 ± 0.58
2.597ArgAsn: 2.597 ± 1.075
3.117ArgPro: 3.117 ± 1.042
3.117ArgGln: 3.117 ± 0.79
5.195ArgArg: 5.195 ± 1.731
2.597ArgSer: 2.597 ± 0.94
3.636ArgThr: 3.636 ± 1.102
4.156ArgVal: 4.156 ± 0.985
2.597ArgTrp: 2.597 ± 1.373
2.078ArgTyr: 2.078 ± 1.048
0.0ArgXaa: 0.0 ± 0.0
Ser
9.87SerAla: 9.87 ± 1.778
1.039SerCys: 1.039 ± 0.571
3.636SerAsp: 3.636 ± 0.851
2.597SerGlu: 2.597 ± 0.277
2.597SerPhe: 2.597 ± 0.686
5.714SerGly: 5.714 ± 1.115
0.519SerHis: 0.519 ± 0.361
0.519SerIle: 0.519 ± 0.584
5.714SerLys: 5.714 ± 1.025
5.195SerLeu: 5.195 ± 0.668
1.039SerMet: 1.039 ± 0.571
1.039SerAsn: 1.039 ± 0.929
4.156SerPro: 4.156 ± 1.52
1.558SerGln: 1.558 ± 0.679
5.195SerArg: 5.195 ± 2.324
6.234SerSer: 6.234 ± 1.155
4.156SerThr: 4.156 ± 0.62
4.675SerVal: 4.675 ± 2.077
1.558SerTrp: 1.558 ± 0.337
0.519SerTyr: 0.519 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
8.831ThrAla: 8.831 ± 1.676
0.0ThrCys: 0.0 ± 0.0
1.039ThrAsp: 1.039 ± 0.722
4.675ThrGlu: 4.675 ± 1.702
1.039ThrPhe: 1.039 ± 0.585
2.078ThrGly: 2.078 ± 0.979
1.558ThrHis: 1.558 ± 0.679
1.558ThrIle: 1.558 ± 0.896
2.078ThrLys: 2.078 ± 1.061
2.597ThrLeu: 2.597 ± 0.603
0.0ThrMet: 0.0 ± 0.0
1.558ThrAsn: 1.558 ± 0.953
7.792ThrPro: 7.792 ± 1.718
1.558ThrGln: 1.558 ± 0.618
4.675ThrArg: 4.675 ± 1.435
5.195ThrSer: 5.195 ± 0.703
3.117ThrThr: 3.117 ± 1.813
3.117ThrVal: 3.117 ± 1.712
1.039ThrTrp: 1.039 ± 0.585
3.117ThrTyr: 3.117 ± 0.674
0.0ThrXaa: 0.0 ± 0.0
Val
8.312ValAla: 8.312 ± 1.798
3.117ValCys: 3.117 ± 0.926
3.636ValAsp: 3.636 ± 0.91
5.714ValGlu: 5.714 ± 1.779
0.519ValPhe: 0.519 ± 0.528
3.117ValGly: 3.117 ± 0.506
1.558ValHis: 1.558 ± 1.149
1.039ValIle: 1.039 ± 0.416
3.636ValLys: 3.636 ± 1.547
3.117ValLeu: 3.117 ± 0.926
1.039ValMet: 1.039 ± 0.416
1.039ValAsn: 1.039 ± 0.416
5.714ValPro: 5.714 ± 1.505
0.519ValGln: 0.519 ± 0.464
4.675ValArg: 4.675 ± 1.3
2.078ValSer: 2.078 ± 1.461
2.597ValThr: 2.597 ± 0.701
5.195ValVal: 5.195 ± 1.011
1.558ValTrp: 1.558 ± 0.896
1.039ValTyr: 1.039 ± 1.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.519TrpAla: 0.519 ± 0.464
0.0TrpCys: 0.0 ± 0.0
1.558TrpAsp: 1.558 ± 0.337
1.039TrpGlu: 1.039 ± 0.929
0.519TrpPhe: 0.519 ± 0.584
0.519TrpGly: 0.519 ± 0.464
0.519TrpHis: 0.519 ± 0.464
1.039TrpIle: 1.039 ± 1.168
2.078TrpLys: 2.078 ± 1.04
3.117TrpLeu: 3.117 ± 1.147
0.519TrpMet: 0.519 ± 0.361
0.519TrpAsn: 0.519 ± 0.584
2.078TrpPro: 2.078 ± 0.402
1.558TrpGln: 1.558 ± 0.535
3.117TrpArg: 3.117 ± 1.0
0.519TrpSer: 0.519 ± 0.584
1.558TrpThr: 1.558 ± 0.953
2.078TrpVal: 2.078 ± 0.979
0.519TrpTrp: 0.519 ± 0.528
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.597TyrAla: 2.597 ± 0.865
1.039TyrCys: 1.039 ± 0.929
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
1.039TyrGly: 1.039 ± 0.49
0.519TyrHis: 0.519 ± 0.464
2.078TyrIle: 2.078 ± 0.402
2.597TyrLys: 2.597 ± 1.7
1.558TyrLeu: 1.558 ± 0.679
0.0TyrMet: 0.0 ± 0.0
1.039TyrAsn: 1.039 ± 0.416
0.519TyrPro: 0.519 ± 0.528
1.039TyrGln: 1.039 ± 1.057
1.558TyrArg: 1.558 ± 0.997
2.078TyrSer: 2.078 ± 0.612
0.519TyrThr: 0.519 ± 0.528
1.039TyrVal: 1.039 ± 0.571
1.039TyrTrp: 1.039 ± 0.571
0.519TyrTyr: 0.519 ± 0.464
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1926 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski