Amino acid dipepetide frequency for Hubei virga-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.074AlaAla: 1.074 ± 0.528
0.716AlaCys: 0.716 ± 0.644
2.147AlaAsp: 2.147 ± 1.056
1.432AlaGlu: 1.432 ± 1.288
0.0AlaPhe: 0.0 ± 0.0
1.432AlaGly: 1.432 ± 1.524
1.074AlaHis: 1.074 ± 0.531
3.221AlaIle: 3.221 ± 0.891
2.147AlaLys: 2.147 ± 1.056
2.147AlaLeu: 2.147 ± 1.062
0.358AlaMet: 0.358 ± 0.176
4.295AlaAsn: 4.295 ± 0.983
1.432AlaPro: 1.432 ± 0.704
0.716AlaGln: 0.716 ± 1.697
1.074AlaArg: 1.074 ± 0.528
2.505AlaSer: 2.505 ± 0.608
2.147AlaThr: 2.147 ± 0.505
2.505AlaVal: 2.505 ± 0.977
0.0AlaTrp: 0.0 ± 0.0
3.579AlaTyr: 3.579 ± 1.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.716CysAla: 0.716 ± 0.352
0.0CysCys: 0.0 ± 0.0
0.716CysAsp: 0.716 ± 0.352
1.074CysGlu: 1.074 ± 0.528
0.358CysPhe: 0.358 ± 0.781
0.358CysGly: 0.358 ± 0.176
0.716CysHis: 0.716 ± 0.352
2.147CysIle: 2.147 ± 1.062
2.147CysLys: 2.147 ± 1.056
1.074CysLeu: 1.074 ± 0.531
0.0CysMet: 0.0 ± 0.0
3.221CysAsn: 3.221 ± 1.584
0.358CysPro: 0.358 ± 0.781
1.074CysGln: 1.074 ± 0.528
0.358CysArg: 0.358 ± 0.176
2.505CysSer: 2.505 ± 1.232
2.863CysThr: 2.863 ± 1.408
2.505CysVal: 2.505 ± 0.608
0.0CysTrp: 0.0 ± 0.0
1.432CysTyr: 1.432 ± 0.459
0.0CysXaa: 0.0 ± 0.0
Asp
2.147AspAla: 2.147 ± 1.44
1.074AspCys: 1.074 ± 0.528
2.147AspAsp: 2.147 ± 1.056
4.295AspGlu: 4.295 ± 2.111
3.221AspPhe: 3.221 ± 0.886
2.505AspGly: 2.505 ± 0.608
0.358AspHis: 0.358 ± 0.176
6.8AspIle: 6.8 ± 2.226
4.653AspLys: 4.653 ± 0.96
6.084AspLeu: 6.084 ± 1.801
2.863AspMet: 2.863 ± 0.926
2.505AspAsn: 2.505 ± 0.608
2.147AspPro: 2.147 ± 1.056
0.0AspGln: 0.0 ± 0.0
1.074AspArg: 1.074 ± 0.531
4.295AspSer: 4.295 ± 1.602
3.937AspThr: 3.937 ± 0.939
4.653AspVal: 4.653 ± 4.504
0.0AspTrp: 0.0 ± 0.0
2.863AspTyr: 2.863 ± 1.198
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
1.074GluCys: 1.074 ± 0.528
0.716GluAsp: 0.716 ± 0.352
3.221GluGlu: 3.221 ± 1.584
4.295GluPhe: 4.295 ± 1.377
1.074GluGly: 1.074 ± 1.603
2.863GluHis: 2.863 ± 1.198
5.369GluIle: 5.369 ± 1.342
3.221GluLys: 3.221 ± 1.584
3.937GluLeu: 3.937 ± 0.939
1.432GluMet: 1.432 ± 0.704
3.579GluAsn: 3.579 ± 1.042
1.074GluPro: 1.074 ± 0.528
0.716GluGln: 0.716 ± 0.352
1.432GluArg: 1.432 ± 0.704
3.579GluSer: 3.579 ± 1.76
2.147GluThr: 2.147 ± 1.056
3.221GluVal: 3.221 ± 0.891
0.0GluTrp: 0.0 ± 0.0
3.579GluTyr: 3.579 ± 0.898
0.0GluXaa: 0.0 ± 0.0
Phe
1.432PheAla: 1.432 ± 1.72
2.147PheCys: 2.147 ± 1.062
3.221PheAsp: 3.221 ± 1.592
2.863PheGlu: 2.863 ± 0.918
2.505PhePhe: 2.505 ± 0.977
2.505PheGly: 2.505 ± 0.608
0.716PheHis: 0.716 ± 0.644
3.579PheIle: 3.579 ± 2.334
2.863PheLys: 2.863 ± 1.396
3.579PheLeu: 3.579 ± 2.334
0.716PheMet: 0.716 ± 0.352
4.653PheAsn: 4.653 ± 2.619
1.432PhePro: 1.432 ± 0.704
2.147PheGln: 2.147 ± 0.505
2.147PheArg: 2.147 ± 1.062
5.727PheSer: 5.727 ± 1.837
2.505PheThr: 2.505 ± 0.977
5.727PheVal: 5.727 ± 3.955
0.716PheTrp: 0.716 ± 0.352
4.295PheTyr: 4.295 ± 2.879
0.0PheXaa: 0.0 ± 0.0
Gly
1.79GlyAla: 1.79 ± 1.576
0.358GlyCys: 0.358 ± 0.176
3.221GlyAsp: 3.221 ± 0.886
0.358GlyGlu: 0.358 ± 0.781
1.074GlyPhe: 1.074 ± 1.42
0.358GlyGly: 0.358 ± 0.176
0.716GlyHis: 0.716 ± 0.352
1.79GlyIle: 1.79 ± 1.462
3.937GlyLys: 3.937 ± 1.936
2.863GlyLeu: 2.863 ± 3.114
0.716GlyMet: 0.716 ± 0.644
2.863GlyAsn: 2.863 ± 1.198
0.0GlyPro: 0.0 ± 0.0
0.0GlyGln: 0.0 ± 0.0
0.716GlyArg: 0.716 ± 0.352
2.505GlySer: 2.505 ± 0.608
1.074GlyThr: 1.074 ± 0.531
1.432GlyVal: 1.432 ± 0.704
0.0GlyTrp: 0.0 ± 0.0
1.79GlyTyr: 1.79 ± 1.167
0.0GlyXaa: 0.0 ± 0.0
His
1.074HisAla: 1.074 ± 1.42
1.79HisCys: 1.79 ± 0.449
1.074HisAsp: 1.074 ± 0.531
2.147HisGlu: 2.147 ± 1.062
1.79HisPhe: 1.79 ± 0.88
1.074HisGly: 1.074 ± 0.531
1.79HisHis: 1.79 ± 0.449
2.147HisIle: 2.147 ± 1.062
2.863HisLys: 2.863 ± 1.396
2.147HisLeu: 2.147 ± 1.932
1.432HisMet: 1.432 ± 0.704
1.432HisAsn: 1.432 ± 0.704
0.358HisPro: 0.358 ± 0.176
0.716HisGln: 0.716 ± 0.644
1.79HisArg: 1.79 ± 0.449
3.579HisSer: 3.579 ± 0.898
2.505HisThr: 2.505 ± 0.608
1.79HisVal: 1.79 ± 3.298
0.716HisTrp: 0.716 ± 0.644
2.147HisTyr: 2.147 ± 0.505
0.0HisXaa: 0.0 ± 0.0
Ile
5.011IleAla: 5.011 ± 1.216
2.863IleCys: 2.863 ± 1.408
5.727IleAsp: 5.727 ± 0.96
2.147IleGlu: 2.147 ± 1.056
3.579IlePhe: 3.579 ± 3.219
1.432IleGly: 1.432 ± 0.704
2.505IleHis: 2.505 ± 1.396
4.653IleIle: 4.653 ± 2.619
6.8IleLys: 6.8 ± 2.565
8.232IleLeu: 8.232 ± 3.502
2.147IleMet: 2.147 ± 1.418
6.442IleAsn: 6.442 ± 1.514
4.653IlePro: 4.653 ± 1.536
3.579IleGln: 3.579 ± 1.504
2.863IleArg: 2.863 ± 0.739
3.937IleSer: 3.937 ± 0.939
5.011IleThr: 5.011 ± 1.002
5.727IleVal: 5.727 ± 0.523
0.0IleTrp: 0.0 ± 0.0
5.011IleTyr: 5.011 ± 3.617
0.0IleXaa: 0.0 ± 0.0
Lys
0.716LysAla: 0.716 ± 0.644
2.863LysCys: 2.863 ± 1.408
2.863LysAsp: 2.863 ± 0.739
3.579LysGlu: 3.579 ± 1.76
6.8LysPhe: 6.8 ± 2.344
1.79LysGly: 1.79 ± 0.88
3.221LysHis: 3.221 ± 1.462
6.442LysIle: 6.442 ± 3.167
2.505LysLys: 2.505 ± 1.396
8.232LysLeu: 8.232 ± 2.414
1.074LysMet: 1.074 ± 0.528
5.727LysAsn: 5.727 ± 1.165
3.221LysPro: 3.221 ± 1.099
1.79LysGln: 1.79 ± 1.462
2.863LysArg: 2.863 ± 1.408
2.863LysSer: 2.863 ± 1.396
3.221LysThr: 3.221 ± 0.886
4.653LysVal: 4.653 ± 2.287
0.0LysTrp: 0.0 ± 0.0
4.653LysTyr: 4.653 ± 0.837
0.0LysXaa: 0.0 ± 0.0
Leu
1.074LeuAla: 1.074 ± 0.528
1.432LeuCys: 1.432 ± 0.704
6.8LeuAsp: 6.8 ± 0.635
4.295LeuGlu: 4.295 ± 1.368
6.8LeuPhe: 6.8 ± 6.432
2.505LeuGly: 2.505 ± 0.608
2.505LeuHis: 2.505 ± 1.809
6.442LeuIle: 6.442 ± 1.475
5.369LeuLys: 5.369 ± 0.595
7.874LeuLeu: 7.874 ± 3.678
1.074LeuMet: 1.074 ± 0.531
6.442LeuAsn: 6.442 ± 2.312
3.221LeuPro: 3.221 ± 0.891
3.221LeuGln: 3.221 ± 1.418
4.295LeuArg: 4.295 ± 0.983
10.021LeuSer: 10.021 ± 3.454
8.232LeuThr: 8.232 ± 3.455
6.442LeuVal: 6.442 ± 1.782
0.716LeuTrp: 0.716 ± 1.697
5.727LeuTyr: 5.727 ± 0.96
0.0LeuXaa: 0.0 ± 0.0
Met
2.147MetAla: 2.147 ± 1.056
0.716MetCys: 0.716 ± 0.352
2.147MetAsp: 2.147 ± 1.44
0.716MetGlu: 0.716 ± 0.352
1.074MetPhe: 1.074 ± 0.528
0.358MetGly: 0.358 ± 0.781
0.716MetHis: 0.716 ± 0.352
1.79MetIle: 1.79 ± 0.88
0.0MetLys: 0.0 ± 0.0
2.505MetLeu: 2.505 ± 1.797
0.716MetMet: 0.716 ± 0.352
1.074MetAsn: 1.074 ± 0.528
1.074MetPro: 1.074 ± 1.603
1.432MetGln: 1.432 ± 0.704
0.358MetArg: 0.358 ± 0.176
2.505MetSer: 2.505 ± 0.608
0.358MetThr: 0.358 ± 0.176
1.79MetVal: 1.79 ± 0.88
0.0MetTrp: 0.0 ± 0.0
1.79MetTyr: 1.79 ± 0.88
0.0MetXaa: 0.0 ± 0.0
Asn
2.505AsnAla: 2.505 ± 0.608
1.79AsnCys: 1.79 ± 0.449
3.579AsnAsp: 3.579 ± 1.021
3.579AsnGlu: 3.579 ± 1.021
5.369AsnPhe: 5.369 ± 1.99
3.221AsnGly: 3.221 ± 1.099
2.147AsnHis: 2.147 ± 1.932
7.158AsnIle: 7.158 ± 3.519
3.937AsnLys: 3.937 ± 1.936
7.516AsnLeu: 7.516 ± 2.912
1.074AsnMet: 1.074 ± 0.528
4.653AsnAsn: 4.653 ± 2.287
1.432AsnPro: 1.432 ± 1.524
2.505AsnGln: 2.505 ± 1.232
0.716AsnArg: 0.716 ± 0.352
8.948AsnSer: 8.948 ± 0.628
4.653AsnThr: 4.653 ± 2.287
5.011AsnVal: 5.011 ± 0.705
0.358AsnTrp: 0.358 ± 0.176
4.653AsnTyr: 4.653 ± 2.287
0.0AsnXaa: 0.0 ± 0.0
Pro
1.432ProAla: 1.432 ± 0.704
0.0ProCys: 0.0 ± 0.0
1.074ProAsp: 1.074 ± 0.528
2.505ProGlu: 2.505 ± 1.232
1.432ProPhe: 1.432 ± 0.704
1.432ProGly: 1.432 ± 0.459
1.432ProHis: 1.432 ± 0.459
3.221ProIle: 3.221 ± 0.886
2.505ProLys: 2.505 ± 1.396
2.147ProLeu: 2.147 ± 1.056
0.358ProMet: 0.358 ± 0.176
2.863ProAsn: 2.863 ± 0.739
0.716ProPro: 0.716 ± 0.352
1.074ProGln: 1.074 ± 0.528
0.0ProArg: 0.0 ± 0.0
2.863ProSer: 2.863 ± 0.739
2.147ProThr: 2.147 ± 1.056
2.505ProVal: 2.505 ± 1.809
0.716ProTrp: 0.716 ± 2.023
2.863ProTyr: 2.863 ± 3.114
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.358GlnCys: 0.358 ± 0.176
1.432GlnAsp: 1.432 ± 0.459
1.79GlnGlu: 1.79 ± 0.88
1.79GlnPhe: 1.79 ± 0.449
0.716GlnGly: 0.716 ± 0.352
1.79GlnHis: 1.79 ± 0.449
3.579GlnIle: 3.579 ± 2.015
3.221GlnLys: 3.221 ± 1.418
2.147GlnLeu: 2.147 ± 1.056
0.358GlnMet: 0.358 ± 0.176
1.79GlnAsn: 1.79 ± 0.88
1.074GlnPro: 1.074 ± 0.531
1.074GlnGln: 1.074 ± 0.528
2.147GlnArg: 2.147 ± 0.505
1.79GlnSer: 1.79 ± 0.449
1.79GlnThr: 1.79 ± 1.462
1.074GlnVal: 1.074 ± 0.528
0.716GlnTrp: 0.716 ± 1.697
1.074GlnTyr: 1.074 ± 0.531
0.0GlnXaa: 0.0 ± 0.0
Arg
1.79ArgAla: 1.79 ± 0.449
1.074ArgCys: 1.074 ± 0.528
0.716ArgAsp: 0.716 ± 0.352
1.074ArgGlu: 1.074 ± 0.528
1.432ArgPhe: 1.432 ± 1.72
0.0ArgGly: 0.0 ± 0.0
1.432ArgHis: 1.432 ± 0.704
2.863ArgIle: 2.863 ± 1.396
2.863ArgLys: 2.863 ± 1.408
4.653ArgLeu: 4.653 ± 2.033
1.074ArgMet: 1.074 ± 0.528
2.863ArgAsn: 2.863 ± 1.408
0.716ArgPro: 0.716 ± 1.697
1.074ArgGln: 1.074 ± 0.531
0.358ArgArg: 0.358 ± 0.176
1.432ArgSer: 1.432 ± 0.704
1.432ArgThr: 1.432 ± 0.704
2.505ArgVal: 2.505 ± 1.312
0.0ArgTrp: 0.0 ± 0.0
2.147ArgTyr: 2.147 ± 1.932
0.0ArgXaa: 0.0 ± 0.0
Ser
4.295SerAla: 4.295 ± 1.377
1.432SerCys: 1.432 ± 0.459
5.369SerAsp: 5.369 ± 1.072
4.295SerGlu: 4.295 ± 1.009
2.147SerPhe: 2.147 ± 1.932
1.79SerGly: 1.79 ± 0.88
2.505SerHis: 2.505 ± 0.608
7.874SerIle: 7.874 ± 0.114
5.011SerLys: 5.011 ± 2.463
10.737SerLeu: 10.737 ± 2.465
1.432SerMet: 1.432 ± 1.115
6.442SerAsn: 6.442 ± 1.514
3.221SerPro: 3.221 ± 0.886
2.147SerGln: 2.147 ± 0.505
1.432SerArg: 1.432 ± 1.524
6.084SerSer: 6.084 ± 1.646
4.295SerThr: 4.295 ± 2.509
5.369SerVal: 5.369 ± 1.072
0.358SerTrp: 0.358 ± 0.176
5.369SerTyr: 5.369 ± 2.639
0.0SerXaa: 0.0 ± 0.0
Thr
1.79ThrAla: 1.79 ± 0.88
1.79ThrCys: 1.79 ± 0.88
4.653ThrAsp: 4.653 ± 3.161
1.432ThrGlu: 1.432 ± 0.704
3.937ThrPhe: 3.937 ± 1.431
1.074ThrGly: 1.074 ± 0.531
1.79ThrHis: 1.79 ± 1.167
4.295ThrIle: 4.295 ± 1.377
3.579ThrLys: 3.579 ± 1.021
6.442ThrLeu: 6.442 ± 4.069
1.432ThrMet: 1.432 ± 0.704
5.727ThrAsn: 5.727 ± 1.383
2.863ThrPro: 2.863 ± 0.739
1.074ThrGln: 1.074 ± 0.531
1.79ThrArg: 1.79 ± 1.576
2.863ThrSer: 2.863 ± 0.739
5.369ThrThr: 5.369 ± 1.342
3.579ThrVal: 3.579 ± 1.46
0.0ThrTrp: 0.0 ± 0.0
5.011ThrTyr: 5.011 ± 1.705
0.0ThrXaa: 0.0 ± 0.0
Val
2.147ValAla: 2.147 ± 1.062
1.432ValCys: 1.432 ± 0.704
6.8ValAsp: 6.8 ± 1.533
3.579ValGlu: 3.579 ± 1.298
2.863ValPhe: 2.863 ± 0.739
2.505ValGly: 2.505 ± 1.809
3.221ValHis: 3.221 ± 0.886
3.937ValIle: 3.937 ± 0.939
7.874ValLys: 7.874 ± 4.132
5.011ValLeu: 5.011 ± 1.216
2.147ValMet: 2.147 ± 1.418
4.653ValAsn: 4.653 ± 2.287
2.147ValPro: 2.147 ± 1.056
3.221ValGln: 3.221 ± 1.462
2.147ValArg: 2.147 ± 0.505
4.653ValSer: 4.653 ± 0.96
3.937ValThr: 3.937 ± 4.646
4.295ValVal: 4.295 ± 1.368
0.358ValTrp: 0.358 ± 0.176
2.505ValTyr: 2.505 ± 1.797
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.074TrpPhe: 1.074 ± 0.528
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.358TrpIle: 0.358 ± 0.176
0.716TrpLys: 0.716 ± 0.644
1.432TrpLeu: 1.432 ± 1.72
0.358TrpMet: 0.358 ± 0.176
0.358TrpAsn: 0.358 ± 1.804
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.358TrpSer: 0.358 ± 1.804
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.358TrpTyr: 0.358 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.863TyrAla: 2.863 ± 1.408
0.716TyrCys: 0.716 ± 0.644
3.221TyrAsp: 3.221 ± 1.584
2.147TyrGlu: 2.147 ± 1.056
3.937TyrPhe: 3.937 ± 1.138
1.432TyrGly: 1.432 ± 3.617
2.863TyrHis: 2.863 ± 0.918
4.653TyrIle: 4.653 ± 1.487
2.863TyrLys: 2.863 ± 0.739
5.727TyrLeu: 5.727 ± 3.257
2.147TyrMet: 2.147 ± 0.505
3.221TyrAsn: 3.221 ± 0.886
2.147TyrPro: 2.147 ± 1.062
2.147TyrGln: 2.147 ± 1.056
3.937TyrArg: 3.937 ± 0.97
8.948TyrSer: 8.948 ± 3.383
2.863TyrThr: 2.863 ± 1.628
4.653TyrVal: 4.653 ± 1.536
0.358TyrTrp: 0.358 ± 0.176
5.011TyrTyr: 5.011 ± 2.162
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2795 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski