Amino acid dipepetide frequency for Wuhan pillworm virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.805AlaAla: 1.805 ± 2.143
0.722AlaCys: 0.722 ± 0.209
4.152AlaAsp: 4.152 ± 1.043
2.708AlaGlu: 2.708 ± 0.713
1.444AlaPhe: 1.444 ± 0.495
2.166AlaGly: 2.166 ± 1.124
0.722AlaHis: 0.722 ± 0.209
3.249AlaIle: 3.249 ± 0.338
2.527AlaLys: 2.527 ± 0.369
4.513AlaLeu: 4.513 ± 1.737
1.264AlaMet: 1.264 ± 0.527
2.166AlaAsn: 2.166 ± 1.189
1.083AlaPro: 1.083 ± 0.403
0.542AlaGln: 0.542 ± 0.313
2.527AlaArg: 2.527 ± 0.773
5.596AlaSer: 5.596 ± 1.14
2.166AlaThr: 2.166 ± 0.204
2.888AlaVal: 2.888 ± 0.9
0.542AlaTrp: 0.542 ± 0.313
1.083AlaTyr: 1.083 ± 0.562
0.0AlaXaa: 0.0 ± 0.0
Cys
0.542CysAla: 0.542 ± 0.352
0.361CysCys: 0.361 ± 0.395
2.166CysAsp: 2.166 ± 1.707
1.264CysGlu: 1.264 ± 0.598
1.444CysPhe: 1.444 ± 0.283
1.444CysGly: 1.444 ± 0.628
0.361CysHis: 0.361 ± 0.157
2.166CysIle: 2.166 ± 0.796
2.888CysLys: 2.888 ± 1.178
2.708CysLeu: 2.708 ± 0.482
1.264CysMet: 1.264 ± 0.402
1.264CysAsn: 1.264 ± 0.458
0.361CysPro: 0.361 ± 0.157
1.625CysGln: 1.625 ± 0.658
0.903CysArg: 0.903 ± 0.311
1.444CysSer: 1.444 ± 0.595
1.264CysThr: 1.264 ± 0.199
1.083CysVal: 1.083 ± 0.488
0.361CysTrp: 0.361 ± 0.413
0.903CysTyr: 0.903 ± 0.505
0.0CysXaa: 0.0 ± 0.0
Asp
2.347AspAla: 2.347 ± 0.29
1.444AspCys: 1.444 ± 0.215
6.137AspAsp: 6.137 ± 0.898
5.957AspGlu: 5.957 ± 0.995
2.347AspPhe: 2.347 ± 0.034
1.986AspGly: 1.986 ± 0.464
1.444AspHis: 1.444 ± 0.353
5.235AspIle: 5.235 ± 0.553
4.513AspLys: 4.513 ± 0.786
5.957AspLeu: 5.957 ± 0.828
1.264AspMet: 1.264 ± 0.409
4.152AspAsn: 4.152 ± 0.915
2.166AspPro: 2.166 ± 0.724
2.347AspGln: 2.347 ± 0.594
3.61AspArg: 3.61 ± 0.721
5.596AspSer: 5.596 ± 0.54
1.805AspThr: 1.805 ± 0.988
4.874AspVal: 4.874 ± 1.26
1.625AspTrp: 1.625 ± 0.422
1.625AspTyr: 1.625 ± 0.522
0.0AspXaa: 0.0 ± 0.0
Glu
1.444GluAla: 1.444 ± 0.518
1.444GluCys: 1.444 ± 0.769
4.332GluAsp: 4.332 ± 0.562
3.43GluGlu: 3.43 ± 0.985
2.888GluPhe: 2.888 ± 0.553
1.986GluGly: 1.986 ± 0.5
0.542GluHis: 0.542 ± 0.242
3.43GluIle: 3.43 ± 0.633
3.971GluLys: 3.971 ± 0.744
8.123GluLeu: 8.123 ± 0.745
2.888GluMet: 2.888 ± 0.724
3.971GluAsn: 3.971 ± 1.425
1.986GluPro: 1.986 ± 0.631
1.625GluGln: 1.625 ± 0.625
2.708GluArg: 2.708 ± 0.345
7.762GluSer: 7.762 ± 1.348
2.527GluThr: 2.527 ± 0.215
4.152GluVal: 4.152 ± 1.043
0.361GluTrp: 0.361 ± 0.157
1.805GluTyr: 1.805 ± 0.451
0.0GluXaa: 0.0 ± 0.0
Phe
1.444PheAla: 1.444 ± 0.643
1.083PheCys: 1.083 ± 0.476
1.805PheAsp: 1.805 ± 0.433
0.903PheGlu: 0.903 ± 0.311
0.542PhePhe: 0.542 ± 0.307
1.444PheGly: 1.444 ± 0.353
0.722PheHis: 0.722 ± 0.471
2.527PheIle: 2.527 ± 1.182
3.61PheLys: 3.61 ± 0.619
3.971PheLeu: 3.971 ± 0.681
1.083PheMet: 1.083 ± 0.45
2.888PheAsn: 2.888 ± 0.661
0.722PhePro: 0.722 ± 0.337
0.722PheGln: 0.722 ± 0.268
2.527PheArg: 2.527 ± 0.719
3.249PheSer: 3.249 ± 1.082
1.444PheThr: 1.444 ± 0.496
1.625PheVal: 1.625 ± 0.288
0.542PheTrp: 0.542 ± 0.307
1.625PheTyr: 1.625 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
0.903GlyAla: 0.903 ± 0.327
1.444GlyCys: 1.444 ± 0.87
2.708GlyAsp: 2.708 ± 0.225
3.61GlyGlu: 3.61 ± 1.226
2.708GlyPhe: 2.708 ± 0.38
3.61GlyGly: 3.61 ± 1.188
0.903GlyHis: 0.903 ± 0.603
3.249GlyIle: 3.249 ± 1.009
2.527GlyLys: 2.527 ± 1.65
4.152GlyLeu: 4.152 ± 0.834
1.264GlyMet: 1.264 ± 0.63
2.708GlyAsn: 2.708 ± 0.657
0.722GlyPro: 0.722 ± 0.268
0.903GlyGln: 0.903 ± 0.576
2.708GlyArg: 2.708 ± 0.482
3.791GlySer: 3.791 ± 0.794
2.708GlyThr: 2.708 ± 0.469
2.888GlyVal: 2.888 ± 1.256
0.722GlyTrp: 0.722 ± 0.227
2.347GlyTyr: 2.347 ± 1.417
0.0GlyXaa: 0.0 ± 0.0
His
0.542HisAla: 0.542 ± 0.242
0.542HisCys: 0.542 ± 0.166
1.083HisAsp: 1.083 ± 0.226
0.542HisGlu: 0.542 ± 0.307
1.264HisPhe: 1.264 ± 0.527
0.542HisGly: 0.542 ± 0.307
0.181HisHis: 0.181 ± 0.207
1.625HisIle: 1.625 ± 0.256
1.264HisLys: 1.264 ± 0.199
2.347HisLeu: 2.347 ± 0.661
0.722HisMet: 0.722 ± 0.227
0.722HisAsn: 0.722 ± 0.514
0.181HisPro: 0.181 ± 0.308
0.542HisGln: 0.542 ± 0.166
0.903HisArg: 0.903 ± 0.307
1.625HisSer: 1.625 ± 0.658
0.903HisThr: 0.903 ± 0.311
0.542HisVal: 0.542 ± 0.352
0.181HisTrp: 0.181 ± 0.102
0.361HisTyr: 0.361 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
3.971IleAla: 3.971 ± 0.268
1.444IleCys: 1.444 ± 0.33
3.971IleAsp: 3.971 ± 0.662
5.054IleGlu: 5.054 ± 0.692
2.347IlePhe: 2.347 ± 0.575
3.61IleGly: 3.61 ± 0.994
1.264IleHis: 1.264 ± 0.488
5.957IleIle: 5.957 ± 1.188
7.762IleLys: 7.762 ± 1.708
7.762IleLeu: 7.762 ± 1.127
3.61IleMet: 3.61 ± 1.094
3.43IleAsn: 3.43 ± 1.613
2.166IlePro: 2.166 ± 0.557
2.166IleGln: 2.166 ± 0.739
3.61IleArg: 3.61 ± 0.736
8.123IleSer: 8.123 ± 0.647
3.971IleThr: 3.971 ± 0.431
2.527IleVal: 2.527 ± 0.215
0.722IleTrp: 0.722 ± 0.556
1.264IleTyr: 1.264 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
4.152LysAla: 4.152 ± 0.795
2.347LysCys: 2.347 ± 0.564
4.152LysAsp: 4.152 ± 0.613
3.249LysGlu: 3.249 ± 0.697
2.166LysPhe: 2.166 ± 0.488
4.332LysGly: 4.332 ± 0.714
1.264LysHis: 1.264 ± 0.488
5.415LysIle: 5.415 ± 1.485
4.332LysLys: 4.332 ± 1.088
10.108LysLeu: 10.108 ± 3.11
2.708LysMet: 2.708 ± 0.567
1.986LysAsn: 1.986 ± 0.936
2.166LysPro: 2.166 ± 0.357
2.708LysGln: 2.708 ± 0.514
4.152LysArg: 4.152 ± 0.474
6.498LysSer: 6.498 ± 0.69
3.069LysThr: 3.069 ± 0.334
4.513LysVal: 4.513 ± 2.327
0.903LysTrp: 0.903 ± 0.512
1.805LysTyr: 1.805 ± 0.641
0.0LysXaa: 0.0 ± 0.0
Leu
4.152LeuAla: 4.152 ± 1.191
2.347LeuCys: 2.347 ± 0.285
4.874LeuAsp: 4.874 ± 0.495
4.874LeuGlu: 4.874 ± 1.41
1.805LeuPhe: 1.805 ± 0.518
5.054LeuGly: 5.054 ± 0.643
0.903LeuHis: 0.903 ± 0.327
8.845LeuIle: 8.845 ± 0.768
8.845LeuLys: 8.845 ± 1.708
8.484LeuLeu: 8.484 ± 1.508
3.43LeuMet: 3.43 ± 0.55
9.206LeuAsn: 9.206 ± 0.63
2.708LeuPro: 2.708 ± 0.831
1.805LeuGln: 1.805 ± 0.481
6.137LeuArg: 6.137 ± 1.14
11.191LeuSer: 11.191 ± 1.074
7.04LeuThr: 7.04 ± 0.497
6.318LeuVal: 6.318 ± 1.119
1.444LeuTrp: 1.444 ± 0.283
3.971LeuTyr: 3.971 ± 0.34
0.0LeuXaa: 0.0 ± 0.0
Met
2.347MetAla: 2.347 ± 0.942
0.903MetCys: 0.903 ± 0.65
1.986MetAsp: 1.986 ± 0.775
1.805MetGlu: 1.805 ± 0.621
0.903MetPhe: 0.903 ± 0.311
0.903MetGly: 0.903 ± 0.307
0.542MetHis: 0.542 ± 0.166
2.527MetIle: 2.527 ± 0.471
2.347MetLys: 2.347 ± 0.823
3.069MetLeu: 3.069 ± 0.992
1.083MetMet: 1.083 ± 0.402
2.166MetAsn: 2.166 ± 0.09
0.361MetPro: 0.361 ± 0.33
0.903MetGln: 0.903 ± 0.214
1.805MetArg: 1.805 ± 0.271
3.61MetSer: 3.61 ± 0.438
1.625MetThr: 1.625 ± 0.612
1.986MetVal: 1.986 ± 0.396
0.361MetTrp: 0.361 ± 0.33
1.444MetTyr: 1.444 ± 0.383
0.0MetXaa: 0.0 ± 0.0
Asn
1.264AsnAla: 1.264 ± 0.385
1.805AsnCys: 1.805 ± 0.663
3.61AsnAsp: 3.61 ± 0.371
5.235AsnGlu: 5.235 ± 0.647
1.444AsnPhe: 1.444 ± 0.727
2.527AsnGly: 2.527 ± 1.237
1.083AsnHis: 1.083 ± 0.471
4.332AsnIle: 4.332 ± 0.557
4.513AsnLys: 4.513 ± 0.462
5.415AsnLeu: 5.415 ± 0.851
1.083AsnMet: 1.083 ± 0.191
3.069AsnAsn: 3.069 ± 0.481
0.903AsnPro: 0.903 ± 0.489
2.527AsnGln: 2.527 ± 0.529
1.805AsnArg: 1.805 ± 0.798
5.235AsnSer: 5.235 ± 0.789
2.888AsnThr: 2.888 ± 0.365
4.152AsnVal: 4.152 ± 0.556
1.083AsnTrp: 1.083 ± 0.332
4.332AsnTyr: 4.332 ± 0.743
0.0AsnXaa: 0.0 ± 0.0
Pro
0.903ProAla: 0.903 ± 0.169
0.181ProCys: 0.181 ± 0.207
2.166ProAsp: 2.166 ± 0.712
1.625ProGlu: 1.625 ± 0.297
0.722ProPhe: 0.722 ± 0.41
1.805ProGly: 1.805 ± 0.56
0.181ProHis: 0.181 ± 0.102
1.264ProIle: 1.264 ± 0.66
2.166ProLys: 2.166 ± 0.499
1.444ProLeu: 1.444 ± 0.33
0.542ProMet: 0.542 ± 0.313
1.444ProAsn: 1.444 ± 0.579
1.264ProPro: 1.264 ± 0.5
0.361ProGln: 0.361 ± 0.157
1.264ProArg: 1.264 ± 0.708
3.249ProSer: 3.249 ± 0.867
2.347ProThr: 2.347 ± 1.084
1.083ProVal: 1.083 ± 0.484
0.361ProTrp: 0.361 ± 0.205
0.903ProTyr: 0.903 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
1.083GlnAla: 1.083 ± 0.683
1.083GlnCys: 1.083 ± 0.705
1.083GlnAsp: 1.083 ± 0.45
1.264GlnGlu: 1.264 ± 0.531
1.625GlnPhe: 1.625 ± 0.874
0.903GlnGly: 0.903 ± 0.609
0.361GlnHis: 0.361 ± 0.33
1.805GlnIle: 1.805 ± 0.327
1.625GlnLys: 1.625 ± 0.341
3.249GlnLeu: 3.249 ± 0.328
1.264GlnMet: 1.264 ± 0.5
1.986GlnAsn: 1.986 ± 0.716
0.361GlnPro: 0.361 ± 0.257
0.542GlnGln: 0.542 ± 0.307
1.264GlnArg: 1.264 ± 0.385
2.708GlnSer: 2.708 ± 0.482
1.805GlnThr: 1.805 ± 0.531
1.083GlnVal: 1.083 ± 0.251
0.0GlnTrp: 0.0 ± 0.0
0.722GlnTyr: 0.722 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
3.791ArgAla: 3.791 ± 0.483
1.805ArgCys: 1.805 ± 0.337
2.708ArgAsp: 2.708 ± 0.634
3.069ArgGlu: 3.069 ± 0.981
2.527ArgPhe: 2.527 ± 0.274
1.986ArgGly: 1.986 ± 0.899
1.083ArgHis: 1.083 ± 0.403
2.527ArgIle: 2.527 ± 0.682
3.069ArgLys: 3.069 ± 0.981
6.318ArgLeu: 6.318 ± 0.86
1.986ArgMet: 1.986 ± 0.747
3.971ArgAsn: 3.971 ± 0.283
0.722ArgPro: 0.722 ± 0.209
0.903ArgGln: 0.903 ± 0.512
2.708ArgArg: 2.708 ± 0.583
5.957ArgSer: 5.957 ± 1.305
2.888ArgThr: 2.888 ± 0.756
4.513ArgVal: 4.513 ± 0.961
0.542ArgTrp: 0.542 ± 0.313
1.444ArgTyr: 1.444 ± 0.454
0.0ArgXaa: 0.0 ± 0.0
Ser
5.235SerAla: 5.235 ± 1.044
2.166SerCys: 2.166 ± 1.164
9.928SerAsp: 9.928 ± 1.983
6.318SerGlu: 6.318 ± 1.127
3.971SerPhe: 3.971 ± 1.189
3.971SerGly: 3.971 ± 0.854
1.986SerHis: 1.986 ± 0.615
7.762SerIle: 7.762 ± 0.668
6.318SerLys: 6.318 ± 0.638
10.108SerLeu: 10.108 ± 0.382
1.625SerMet: 1.625 ± 0.341
3.971SerAsn: 3.971 ± 0.581
2.888SerPro: 2.888 ± 0.675
1.625SerGln: 1.625 ± 0.422
5.235SerArg: 5.235 ± 1.562
11.011SerSer: 11.011 ± 1.527
4.693SerThr: 4.693 ± 1.144
7.762SerVal: 7.762 ± 0.379
1.264SerTrp: 1.264 ± 0.5
3.791SerTyr: 3.791 ± 0.742
0.0SerXaa: 0.0 ± 0.0
Thr
3.249ThrAla: 3.249 ± 0.325
2.166ThrCys: 2.166 ± 1.124
3.61ThrAsp: 3.61 ± 1.28
2.888ThrGlu: 2.888 ± 0.551
1.264ThrPhe: 1.264 ± 0.486
3.61ThrGly: 3.61 ± 1.852
0.903ThrHis: 0.903 ± 0.338
4.513ThrIle: 4.513 ± 0.955
3.069ThrLys: 3.069 ± 0.429
3.971ThrLeu: 3.971 ± 1.25
1.986ThrMet: 1.986 ± 0.409
2.708ThrAsn: 2.708 ± 0.432
0.722ThrPro: 0.722 ± 0.248
1.264ThrGln: 1.264 ± 0.445
2.888ThrArg: 2.888 ± 0.881
6.498ThrSer: 6.498 ± 1.845
2.527ThrThr: 2.527 ± 0.573
3.069ThrVal: 3.069 ± 0.855
0.181ThrTrp: 0.181 ± 0.102
0.903ThrTyr: 0.903 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
3.61ValAla: 3.61 ± 1.047
1.264ValCys: 1.264 ± 0.732
2.888ValAsp: 2.888 ± 0.6
4.513ValGlu: 4.513 ± 1.248
0.722ValPhe: 0.722 ± 0.41
2.527ValGly: 2.527 ± 0.75
1.805ValHis: 1.805 ± 0.291
5.415ValIle: 5.415 ± 0.62
3.249ValLys: 3.249 ± 0.747
6.498ValLeu: 6.498 ± 0.976
2.166ValMet: 2.166 ± 0.89
4.332ValAsn: 4.332 ± 1.148
1.625ValPro: 1.625 ± 0.525
0.722ValGln: 0.722 ± 0.337
4.693ValArg: 4.693 ± 0.585
3.791ValSer: 3.791 ± 1.072
4.693ValThr: 4.693 ± 1.336
3.069ValVal: 3.069 ± 0.444
0.361ValTrp: 0.361 ± 0.395
1.444ValTyr: 1.444 ± 0.383
0.0ValXaa: 0.0 ± 0.0
Trp
0.542TrpAla: 0.542 ± 0.166
0.542TrpCys: 0.542 ± 0.307
0.542TrpAsp: 0.542 ± 0.166
0.181TrpGlu: 0.181 ± 0.207
0.903TrpPhe: 0.903 ± 0.512
0.181TrpGly: 0.181 ± 0.102
0.0TrpHis: 0.0 ± 0.0
0.361TrpIle: 0.361 ± 0.33
0.722TrpLys: 0.722 ± 0.227
2.347TrpLeu: 2.347 ± 0.719
0.181TrpMet: 0.181 ± 0.207
0.361TrpAsn: 0.361 ± 0.413
0.722TrpPro: 0.722 ± 0.471
0.722TrpGln: 0.722 ± 0.227
1.083TrpArg: 1.083 ± 0.519
1.083TrpSer: 1.083 ± 0.332
0.542TrpThr: 0.542 ± 0.313
0.361TrpVal: 0.361 ± 0.205
0.181TrpTrp: 0.181 ± 0.207
0.361TrpTyr: 0.361 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.542TyrAla: 0.542 ± 0.242
0.722TyrCys: 0.722 ± 0.423
2.527TyrAsp: 2.527 ± 0.757
2.888TyrGlu: 2.888 ± 1.144
1.444TyrPhe: 1.444 ± 0.386
1.986TyrGly: 1.986 ± 0.971
0.361TyrHis: 0.361 ± 0.205
2.527TyrIle: 2.527 ± 0.719
2.888TyrLys: 2.888 ± 0.899
2.888TyrLeu: 2.888 ± 0.574
1.083TyrMet: 1.083 ± 0.436
1.805TyrAsn: 1.805 ± 0.675
1.444TyrPro: 1.444 ± 0.353
1.264TyrGln: 1.264 ± 0.199
1.986TyrArg: 1.986 ± 0.409
3.61TyrSer: 3.61 ± 1.145
1.083TyrThr: 1.083 ± 0.471
0.903TyrVal: 0.903 ± 0.169
0.181TyrTrp: 0.181 ± 0.102
1.083TyrTyr: 1.083 ± 0.403
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (5541 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski