Amino acid dipepetide frequency for Wenzhou hepe-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.316AlaAla: 3.316 ± 0.987
0.0AlaCys: 0.0 ± 0.0
3.015AlaAsp: 3.015 ± 1.432
3.316AlaGlu: 3.316 ± 1.601
3.316AlaPhe: 3.316 ± 1.569
2.412AlaGly: 2.412 ± 0.703
0.904AlaHis: 0.904 ± 0.352
5.427AlaIle: 5.427 ± 1.901
5.125AlaLys: 5.125 ± 1.62
3.919AlaLeu: 3.919 ± 1.078
1.507AlaMet: 1.507 ± 0.717
2.713AlaAsn: 2.713 ± 0.902
2.11AlaPro: 2.11 ± 0.826
0.904AlaGln: 0.904 ± 1.172
3.015AlaArg: 3.015 ± 1.745
4.824AlaSer: 4.824 ± 1.435
4.824AlaThr: 4.824 ± 1.742
5.427AlaVal: 5.427 ± 2.11
0.0AlaTrp: 0.0 ± 0.0
3.618AlaTyr: 3.618 ± 1.063
0.0AlaXaa: 0.0 ± 0.0
Cys
0.904CysAla: 0.904 ± 0.352
0.301CysCys: 0.301 ± 0.174
0.603CysAsp: 0.603 ± 0.349
1.206CysGlu: 1.206 ± 0.45
0.0CysPhe: 0.0 ± 0.0
1.206CysGly: 1.206 ± 0.45
0.301CysHis: 0.301 ± 0.832
1.206CysIle: 1.206 ± 0.698
0.603CysLys: 0.603 ± 0.602
1.206CysLeu: 1.206 ± 0.45
0.301CysMet: 0.301 ± 0.174
0.301CysAsn: 0.301 ± 0.447
0.301CysPro: 0.301 ± 0.385
0.0CysGln: 0.0 ± 0.0
0.904CysArg: 0.904 ± 0.523
0.603CysSer: 0.603 ± 0.325
0.301CysThr: 0.301 ± 0.174
1.507CysVal: 1.507 ± 0.596
0.0CysTrp: 0.0 ± 0.0
0.603CysTyr: 0.603 ± 0.349
0.0CysXaa: 0.0 ± 0.0
Asp
2.412AspAla: 2.412 ± 1.003
0.603AspCys: 0.603 ± 0.77
5.125AspAsp: 5.125 ± 1.62
6.03AspGlu: 6.03 ± 3.991
5.125AspPhe: 5.125 ± 1.784
2.412AspGly: 2.412 ± 1.062
1.507AspHis: 1.507 ± 0.656
6.03AspIle: 6.03 ± 1.815
4.221AspLys: 4.221 ± 0.98
2.713AspLeu: 2.713 ± 1.615
2.11AspMet: 2.11 ± 0.671
3.618AspAsn: 3.618 ± 1.125
4.221AspPro: 4.221 ± 1.969
1.507AspGln: 1.507 ± 0.46
4.221AspArg: 4.221 ± 1.05
5.427AspSer: 5.427 ± 1.537
4.522AspThr: 4.522 ± 1.374
3.919AspVal: 3.919 ± 1.79
0.603AspTrp: 0.603 ± 0.349
1.809AspTyr: 1.809 ± 1.08
0.0AspXaa: 0.0 ± 0.0
Glu
3.618GluAla: 3.618 ± 1.278
0.603GluCys: 0.603 ± 0.325
5.125GluAsp: 5.125 ± 2.482
7.235GluGlu: 7.235 ± 4.786
1.809GluPhe: 1.809 ± 0.86
5.125GluGly: 5.125 ± 1.197
1.809GluHis: 1.809 ± 1.047
4.221GluIle: 4.221 ± 0.681
2.713GluLys: 2.713 ± 0.867
4.522GluLeu: 4.522 ± 1.475
1.206GluMet: 1.206 ± 0.45
2.11GluAsn: 2.11 ± 0.923
2.713GluPro: 2.713 ± 0.464
3.316GluGln: 3.316 ± 1.059
1.206GluArg: 1.206 ± 0.698
4.522GluSer: 4.522 ± 1.677
5.427GluThr: 5.427 ± 3.718
3.618GluVal: 3.618 ± 1.642
0.904GluTrp: 0.904 ± 0.691
2.11GluTyr: 2.11 ± 1.136
0.0GluXaa: 0.0 ± 0.0
Phe
3.316PheAla: 3.316 ± 1.32
1.507PheCys: 1.507 ± 0.872
5.125PheAsp: 5.125 ± 1.372
3.919PheGlu: 3.919 ± 1.352
2.11PhePhe: 2.11 ± 0.941
4.824PheGly: 4.824 ± 0.899
1.206PheHis: 1.206 ± 0.657
3.015PheIle: 3.015 ± 1.095
3.316PheLys: 3.316 ± 2.286
2.412PheLeu: 2.412 ± 1.268
1.809PheMet: 1.809 ± 0.787
2.11PheAsn: 2.11 ± 0.994
1.507PhePro: 1.507 ± 1.095
2.713PheGln: 2.713 ± 3.469
2.412PheArg: 2.412 ± 0.441
3.316PheSer: 3.316 ± 1.086
1.809PheThr: 1.809 ± 0.704
3.618PheVal: 3.618 ± 0.838
0.603PheTrp: 0.603 ± 0.325
2.412PheTyr: 2.412 ± 0.714
0.0PheXaa: 0.0 ± 0.0
Gly
1.507GlyAla: 1.507 ± 0.717
1.206GlyCys: 1.206 ± 0.698
5.427GlyAsp: 5.427 ± 1.943
2.713GlyGlu: 2.713 ± 0.793
2.11GlyPhe: 2.11 ± 0.897
2.713GlyGly: 2.713 ± 1.761
2.713GlyHis: 2.713 ± 0.464
3.316GlyIle: 3.316 ± 1.083
2.713GlyLys: 2.713 ± 0.948
3.919GlyLeu: 3.919 ± 1.061
1.809GlyMet: 1.809 ± 1.537
2.713GlyAsn: 2.713 ± 1.229
1.507GlyPro: 1.507 ± 0.46
2.713GlyGln: 2.713 ± 0.867
3.919GlyArg: 3.919 ± 1.578
2.11GlySer: 2.11 ± 1.21
3.618GlyThr: 3.618 ± 1.345
3.919GlyVal: 3.919 ± 1.079
0.0GlyTrp: 0.0 ± 0.0
2.412GlyTyr: 2.412 ± 1.102
0.0GlyXaa: 0.0 ± 0.0
His
1.809HisAla: 1.809 ± 0.821
0.301HisCys: 0.301 ± 0.174
2.11HisAsp: 2.11 ± 1.312
2.11HisGlu: 2.11 ± 1.359
1.206HisPhe: 1.206 ± 0.698
1.507HisGly: 1.507 ± 0.643
0.301HisHis: 0.301 ± 0.385
0.301HisIle: 0.301 ± 0.174
2.11HisLys: 2.11 ± 0.466
1.206HisLeu: 1.206 ± 0.706
0.301HisMet: 0.301 ± 0.174
1.507HisAsn: 1.507 ± 1.457
0.603HisPro: 0.603 ± 0.349
0.0HisGln: 0.0 ± 0.0
0.603HisArg: 0.603 ± 0.325
2.412HisSer: 2.412 ± 0.911
2.713HisThr: 2.713 ± 1.181
2.412HisVal: 2.412 ± 0.559
0.301HisTrp: 0.301 ± 0.174
1.507HisTyr: 1.507 ± 1.058
0.0HisXaa: 0.0 ± 0.0
Ile
4.221IleAla: 4.221 ± 2.084
0.301IleCys: 0.301 ± 0.385
6.331IleAsp: 6.331 ± 2.14
3.316IleGlu: 3.316 ± 1.555
3.015IlePhe: 3.015 ± 2.287
4.522IleGly: 4.522 ± 0.927
1.206IleHis: 1.206 ± 1.081
3.618IleIle: 3.618 ± 1.299
4.824IleLys: 4.824 ± 1.344
6.934IleLeu: 6.934 ± 4.441
2.11IleMet: 2.11 ± 0.671
3.316IleAsn: 3.316 ± 0.828
3.919IlePro: 3.919 ± 1.236
2.713IleGln: 2.713 ± 1.03
1.507IleArg: 1.507 ± 0.643
2.713IleSer: 2.713 ± 2.306
4.221IleThr: 4.221 ± 1.501
3.316IleVal: 3.316 ± 2.95
0.0IleTrp: 0.0 ± 0.0
3.919IleTyr: 3.919 ± 1.331
0.0IleXaa: 0.0 ± 0.0
Lys
6.934LysAla: 6.934 ± 2.951
0.603LysCys: 0.603 ± 0.349
3.015LysAsp: 3.015 ± 2.017
5.125LysGlu: 5.125 ± 1.806
3.316LysPhe: 3.316 ± 1.083
4.221LysGly: 4.221 ± 1.155
0.904LysHis: 0.904 ± 0.523
3.919LysIle: 3.919 ± 0.956
3.618LysLys: 3.618 ± 1.978
7.235LysLeu: 7.235 ± 1.452
1.206LysMet: 1.206 ± 1.41
2.713LysAsn: 2.713 ± 0.916
3.316LysPro: 3.316 ± 0.652
1.809LysGln: 1.809 ± 1.047
2.412LysArg: 2.412 ± 1.045
3.919LysSer: 3.919 ± 2.077
3.618LysThr: 3.618 ± 1.61
2.713LysVal: 2.713 ± 3.296
0.904LysTrp: 0.904 ± 0.43
2.412LysTyr: 2.412 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
3.618LeuAla: 3.618 ± 0.838
0.603LeuCys: 0.603 ± 0.349
5.125LeuAsp: 5.125 ± 0.789
2.11LeuGlu: 2.11 ± 0.671
3.618LeuPhe: 3.618 ± 2.882
3.618LeuGly: 3.618 ± 0.942
2.11LeuHis: 2.11 ± 2.059
3.618LeuIle: 3.618 ± 2.218
5.728LeuLys: 5.728 ± 1.005
4.522LeuLeu: 4.522 ± 0.774
2.412LeuMet: 2.412 ± 0.846
3.618LeuAsn: 3.618 ± 1.299
4.221LeuPro: 4.221 ± 2.17
4.221LeuGln: 4.221 ± 1.531
4.221LeuArg: 4.221 ± 1.491
4.522LeuSer: 4.522 ± 2.147
3.919LeuThr: 3.919 ± 1.688
3.618LeuVal: 3.618 ± 1.247
0.301LeuTrp: 0.301 ± 0.174
3.316LeuTyr: 3.316 ± 2.533
0.0LeuXaa: 0.0 ± 0.0
Met
2.11MetAla: 2.11 ± 0.804
0.0MetCys: 0.0 ± 0.0
1.809MetAsp: 1.809 ± 0.549
1.809MetGlu: 1.809 ± 1.146
0.603MetPhe: 0.603 ± 0.325
1.206MetGly: 1.206 ± 0.698
1.507MetHis: 1.507 ± 0.585
0.904MetIle: 0.904 ± 0.352
0.301MetLys: 0.301 ± 0.174
2.11MetLeu: 2.11 ± 0.466
1.206MetMet: 1.206 ± 0.45
1.809MetAsn: 1.809 ± 0.821
0.904MetPro: 0.904 ± 0.691
0.603MetGln: 0.603 ± 0.349
1.809MetArg: 1.809 ± 0.549
3.015MetSer: 3.015 ± 0.9
0.904MetThr: 0.904 ± 0.523
3.316MetVal: 3.316 ± 1.116
0.301MetTrp: 0.301 ± 0.174
3.316MetTyr: 3.316 ± 2.77
0.0MetXaa: 0.0 ± 0.0
Asn
3.015AsnAla: 3.015 ± 0.9
0.603AsnCys: 0.603 ± 0.349
2.713AsnAsp: 2.713 ± 0.793
1.507AsnGlu: 1.507 ± 0.585
4.221AsnPhe: 4.221 ± 1.172
0.301AsnGly: 0.301 ± 0.385
1.507AsnHis: 1.507 ± 0.585
6.03AsnIle: 6.03 ± 2.581
2.11AsnLys: 2.11 ± 0.645
4.221AsnLeu: 4.221 ± 2.598
0.904AsnMet: 0.904 ± 0.523
1.507AsnAsn: 1.507 ± 1.058
2.11AsnPro: 2.11 ± 0.961
2.713AsnGln: 2.713 ± 1.122
3.015AsnArg: 3.015 ± 1.399
2.412AsnSer: 2.412 ± 1.037
3.015AsnThr: 3.015 ± 1.432
5.427AsnVal: 5.427 ± 1.445
0.904AsnTrp: 0.904 ± 0.523
1.507AsnTyr: 1.507 ± 0.46
0.0AsnXaa: 0.0 ± 0.0
Pro
1.507ProAla: 1.507 ± 0.823
0.301ProCys: 0.301 ± 0.174
2.11ProAsp: 2.11 ± 1.332
3.015ProGlu: 3.015 ± 0.853
2.11ProPhe: 2.11 ± 0.935
2.11ProGly: 2.11 ± 0.79
0.0ProHis: 0.0 ± 0.0
3.015ProIle: 3.015 ± 2.265
3.919ProLys: 3.919 ± 3.403
3.316ProLeu: 3.316 ± 1.086
0.904ProMet: 0.904 ± 0.523
3.919ProAsn: 3.919 ± 1.205
1.206ProPro: 1.206 ± 0.706
1.507ProGln: 1.507 ± 0.717
3.919ProArg: 3.919 ± 1.148
2.11ProSer: 2.11 ± 1.276
2.412ProThr: 2.412 ± 0.829
3.618ProVal: 3.618 ± 2.118
0.301ProTrp: 0.301 ± 0.385
2.412ProTyr: 2.412 ± 0.999
0.0ProXaa: 0.0 ± 0.0
Gln
3.316GlnAla: 3.316 ± 1.21
0.0GlnCys: 0.0 ± 0.0
3.015GlnAsp: 3.015 ± 1.095
1.206GlnGlu: 1.206 ± 2.406
3.618GlnPhe: 3.618 ± 0.971
1.809GlnGly: 1.809 ± 0.737
1.206GlnHis: 1.206 ± 0.45
3.015GlnIle: 3.015 ± 1.171
3.919GlnLys: 3.919 ± 1.858
1.809GlnLeu: 1.809 ± 0.549
0.603GlnMet: 0.603 ± 0.325
1.809GlnAsn: 1.809 ± 2.297
1.507GlnPro: 1.507 ± 1.094
2.412GlnGln: 2.412 ± 1.201
0.603GlnArg: 0.603 ± 0.402
2.11GlnSer: 2.11 ± 1.129
1.507GlnThr: 1.507 ± 0.46
3.015GlnVal: 3.015 ± 0.424
0.301GlnTrp: 0.301 ± 0.447
1.507GlnTyr: 1.507 ± 1.716
0.0GlnXaa: 0.0 ± 0.0
Arg
2.11ArgAla: 2.11 ± 0.645
1.507ArgCys: 1.507 ± 1.451
2.412ArgAsp: 2.412 ± 0.829
2.713ArgGlu: 2.713 ± 1.181
2.713ArgPhe: 2.713 ± 0.777
2.713ArgGly: 2.713 ± 1.181
0.603ArgHis: 0.603 ± 0.402
2.713ArgIle: 2.713 ± 0.929
2.713ArgLys: 2.713 ± 0.481
3.015ArgLeu: 3.015 ± 1.0
0.603ArgMet: 0.603 ± 0.753
3.316ArgAsn: 3.316 ± 1.919
1.809ArgPro: 1.809 ± 0.737
1.507ArgGln: 1.507 ± 0.717
2.412ArgArg: 2.412 ± 0.781
2.412ArgSer: 2.412 ± 0.829
3.618ArgThr: 3.618 ± 1.74
4.522ArgVal: 4.522 ± 1.086
0.0ArgTrp: 0.0 ± 0.0
2.412ArgTyr: 2.412 ± 1.396
0.0ArgXaa: 0.0 ± 0.0
Ser
4.221SerAla: 4.221 ± 1.501
1.206SerCys: 1.206 ± 0.65
3.919SerAsp: 3.919 ± 1.087
4.824SerGlu: 4.824 ± 2.489
4.221SerPhe: 4.221 ± 1.729
3.618SerGly: 3.618 ± 1.361
1.206SerHis: 1.206 ± 0.519
3.015SerIle: 3.015 ± 2.092
4.824SerLys: 4.824 ± 1.515
4.522SerLeu: 4.522 ± 6.06
2.713SerMet: 2.713 ± 0.652
2.412SerAsn: 2.412 ± 0.714
3.618SerPro: 3.618 ± 1.092
2.11SerGln: 2.11 ± 1.917
2.11SerArg: 2.11 ± 1.21
6.03SerSer: 6.03 ± 3.764
3.316SerThr: 3.316 ± 1.793
6.331SerVal: 6.331 ± 1.348
0.301SerTrp: 0.301 ± 0.174
2.713SerTyr: 2.713 ± 1.266
0.0SerXaa: 0.0 ± 0.0
Thr
2.412ThrAla: 2.412 ± 1.102
1.206ThrCys: 1.206 ± 0.45
3.919ThrAsp: 3.919 ± 1.387
2.11ThrGlu: 2.11 ± 0.685
2.412ThrPhe: 2.412 ± 1.611
3.316ThrGly: 3.316 ± 1.378
1.809ThrHis: 1.809 ± 1.388
6.03ThrIle: 6.03 ± 2.225
2.412ThrLys: 2.412 ± 0.856
5.427ThrLeu: 5.427 ± 1.587
3.618ThrMet: 3.618 ± 0.54
2.713ThrAsn: 2.713 ± 1.181
2.713ThrPro: 2.713 ± 1.144
3.015ThrGln: 3.015 ± 1.086
1.809ThrArg: 1.809 ± 0.873
5.728ThrSer: 5.728 ± 2.758
4.522ThrThr: 4.522 ± 1.485
6.331ThrVal: 6.331 ± 1.655
0.301ThrTrp: 0.301 ± 0.385
1.507ThrTyr: 1.507 ± 0.717
0.0ThrXaa: 0.0 ± 0.0
Val
7.235ValAla: 7.235 ± 2.082
0.904ValCys: 0.904 ± 0.501
3.316ValAsp: 3.316 ± 0.878
7.235ValGlu: 7.235 ± 0.936
4.221ValPhe: 4.221 ± 1.222
3.316ValGly: 3.316 ± 1.083
3.015ValHis: 3.015 ± 0.72
3.618ValIle: 3.618 ± 2.955
4.824ValLys: 4.824 ± 1.473
3.015ValLeu: 3.015 ± 1.309
2.713ValMet: 2.713 ± 1.032
3.618ValAsn: 3.618 ± 1.556
3.618ValPro: 3.618 ± 0.843
2.713ValGln: 2.713 ± 0.635
3.316ValArg: 3.316 ± 1.086
4.522ValSer: 4.522 ± 1.417
3.618ValThr: 3.618 ± 1.388
5.427ValVal: 5.427 ± 1.447
0.0ValTrp: 0.0 ± 0.0
4.522ValTyr: 4.522 ± 1.185
0.0ValXaa: 0.0 ± 0.0
Trp
0.603TrpAla: 0.603 ± 0.349
0.301TrpCys: 0.301 ± 0.447
0.301TrpAsp: 0.301 ± 0.174
0.603TrpGlu: 0.603 ± 0.325
0.301TrpPhe: 0.301 ± 0.385
0.301TrpGly: 0.301 ± 0.174
0.0TrpHis: 0.0 ± 0.0
0.301TrpIle: 0.301 ± 0.174
0.603TrpLys: 0.603 ± 0.349
0.603TrpLeu: 0.603 ± 0.325
0.0TrpMet: 0.0 ± 0.0
0.301TrpAsn: 0.301 ± 0.447
0.0TrpPro: 0.0 ± 0.0
0.301TrpGln: 0.301 ± 0.385
0.301TrpArg: 0.301 ± 0.174
0.301TrpSer: 0.301 ± 0.385
0.301TrpThr: 0.301 ± 0.174
0.603TrpVal: 0.603 ± 0.349
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.507TyrAla: 1.507 ± 1.058
0.301TyrCys: 0.301 ± 0.174
3.618TyrAsp: 3.618 ± 1.063
1.809TyrGlu: 1.809 ± 1.047
3.015TyrPhe: 3.015 ± 2.264
1.809TyrGly: 1.809 ± 0.787
1.507TyrHis: 1.507 ± 0.596
2.713TyrIle: 2.713 ± 2.136
3.316TyrLys: 3.316 ± 1.211
2.713TyrLeu: 2.713 ± 0.929
1.206TyrMet: 1.206 ± 0.65
3.618TyrAsn: 3.618 ± 1.74
1.809TyrPro: 1.809 ± 0.532
1.507TyrGln: 1.507 ± 1.457
2.11TyrArg: 2.11 ± 0.937
4.221TyrSer: 4.221 ± 2.492
5.125TyrThr: 5.125 ± 0.889
2.11TyrVal: 2.11 ± 1.884
0.0TyrTrp: 0.0 ± 0.0
3.015TyrTyr: 3.015 ± 1.431
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3318 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski