Amino acid dipepetide frequency for Hubei chuvirus-like virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.734AlaAla: 2.734 ± 2.2
1.215AlaCys: 1.215 ± 0.36
2.43AlaAsp: 2.43 ± 1.528
3.645AlaGlu: 3.645 ± 1.239
2.734AlaPhe: 2.734 ± 1.084
2.126AlaGly: 2.126 ± 0.343
0.608AlaHis: 0.608 ± 0.331
2.734AlaIle: 2.734 ± 2.16
1.823AlaLys: 1.823 ± 0.97
7.898AlaLeu: 7.898 ± 1.318
2.734AlaMet: 2.734 ± 1.433
2.126AlaAsn: 2.126 ± 0.343
2.734AlaPro: 2.734 ± 1.775
0.608AlaGln: 0.608 ± 0.331
5.468AlaArg: 5.468 ± 2.136
6.987AlaSer: 6.987 ± 0.237
3.949AlaThr: 3.949 ± 1.352
3.645AlaVal: 3.645 ± 1.94
0.0AlaTrp: 0.0 ± 0.0
3.949AlaTyr: 3.949 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.304CysAla: 0.304 ± 0.165
0.608CysCys: 0.608 ± 0.569
0.911CysAsp: 0.911 ± 0.695
0.911CysGlu: 0.911 ± 0.695
0.608CysPhe: 0.608 ± 0.331
0.911CysGly: 0.911 ± 0.999
0.608CysHis: 0.608 ± 0.331
0.608CysIle: 0.608 ± 0.331
1.519CysLys: 1.519 ± 0.577
1.215CysLeu: 1.215 ± 0.502
0.608CysMet: 0.608 ± 0.308
1.215CysAsn: 1.215 ± 0.662
1.215CysPro: 1.215 ± 0.662
0.608CysGln: 0.608 ± 0.748
0.911CysArg: 0.911 ± 0.592
0.608CysSer: 0.608 ± 0.8
0.911CysThr: 0.911 ± 0.496
1.215CysVal: 1.215 ± 0.662
0.0CysTrp: 0.0 ± 0.0
2.43CysTyr: 2.43 ± 0.861
0.0CysXaa: 0.0 ± 0.0
Asp
4.557AspAla: 4.557 ± 1.044
1.519AspCys: 1.519 ± 1.72
3.341AspAsp: 3.341 ± 0.506
3.341AspGlu: 3.341 ± 0.8
0.911AspPhe: 0.911 ± 0.51
2.126AspGly: 2.126 ± 0.859
0.911AspHis: 0.911 ± 0.496
4.253AspIle: 4.253 ± 1.285
2.734AspLys: 2.734 ± 0.488
6.379AspLeu: 6.379 ± 0.715
1.823AspMet: 1.823 ± 0.97
1.823AspAsn: 1.823 ± 0.97
2.734AspPro: 2.734 ± 0.729
1.519AspGln: 1.519 ± 0.479
2.126AspArg: 2.126 ± 0.238
2.43AspSer: 2.43 ± 1.692
4.86AspThr: 4.86 ± 1.006
2.734AspVal: 2.734 ± 0.576
0.911AspTrp: 0.911 ± 0.496
2.126AspTyr: 2.126 ± 0.634
0.0AspXaa: 0.0 ± 0.0
Glu
3.038GluAla: 3.038 ± 0.919
0.911GluCys: 0.911 ± 0.592
3.341GluAsp: 3.341 ± 1.276
6.379GluGlu: 6.379 ± 1.855
1.823GluPhe: 1.823 ± 1.02
5.164GluGly: 5.164 ± 1.728
2.43GluHis: 2.43 ± 0.925
2.43GluIle: 2.43 ± 0.925
3.341GluLys: 3.341 ± 1.148
6.683GluLeu: 6.683 ± 2.188
1.215GluMet: 1.215 ± 0.502
1.215GluAsn: 1.215 ± 0.502
2.734GluPro: 2.734 ± 0.075
2.43GluGln: 2.43 ± 0.347
3.949GluArg: 3.949 ± 1.153
6.075GluSer: 6.075 ± 0.779
4.253GluThr: 4.253 ± 1.307
5.164GluVal: 5.164 ± 1.621
1.519GluTrp: 1.519 ± 0.827
1.823GluTyr: 1.823 ± 0.632
0.0GluXaa: 0.0 ± 0.0
Phe
1.823PheAla: 1.823 ± 0.97
0.911PheCys: 0.911 ± 0.496
3.341PheAsp: 3.341 ± 0.989
1.519PheGlu: 1.519 ± 0.479
0.911PhePhe: 0.911 ± 0.292
2.126PheGly: 2.126 ± 0.343
0.608PheHis: 0.608 ± 0.331
1.823PheIle: 1.823 ± 0.993
1.215PheLys: 1.215 ± 0.662
3.341PheLeu: 3.341 ± 1.094
0.608PheMet: 0.608 ± 0.331
1.519PheAsn: 1.519 ± 0.479
0.911PhePro: 0.911 ± 0.51
1.823PheGln: 1.823 ± 1.75
3.645PheArg: 3.645 ± 1.263
4.253PheSer: 4.253 ± 0.264
1.823PheThr: 1.823 ± 0.619
1.519PheVal: 1.519 ± 1.09
0.0PheTrp: 0.0 ± 0.0
1.823PheTyr: 1.823 ± 0.619
0.0PheXaa: 0.0 ± 0.0
Gly
3.038GlyAla: 3.038 ± 1.244
0.911GlyCys: 0.911 ± 0.292
2.734GlyAsp: 2.734 ± 1.433
3.038GlyGlu: 3.038 ± 1.621
2.734GlyPhe: 2.734 ± 0.075
3.645GlyGly: 3.645 ± 2.719
1.519GlyHis: 1.519 ± 0.479
2.734GlyIle: 2.734 ± 1.524
3.341GlyLys: 3.341 ± 1.471
6.379GlyLeu: 6.379 ± 2.062
1.823GlyMet: 1.823 ± 0.925
1.215GlyAsn: 1.215 ± 0.36
1.823GlyPro: 1.823 ± 0.219
2.126GlyGln: 2.126 ± 0.998
2.43GlyArg: 2.43 ± 0.875
4.86GlySer: 4.86 ± 2.856
2.734GlyThr: 2.734 ± 1.183
4.253GlyVal: 4.253 ± 0.477
0.0GlyTrp: 0.0 ± 0.0
4.557GlyTyr: 4.557 ± 0.992
0.0GlyXaa: 0.0 ± 0.0
His
1.519HisAla: 1.519 ± 1.068
0.0HisCys: 0.0 ± 0.0
0.911HisAsp: 0.911 ± 0.51
2.734HisGlu: 2.734 ± 0.831
2.43HisPhe: 2.43 ± 0.925
0.608HisGly: 0.608 ± 0.308
0.608HisHis: 0.608 ± 0.331
2.734HisIle: 2.734 ± 1.084
1.519HisLys: 1.519 ± 0.479
2.734HisLeu: 2.734 ± 0.875
0.608HisMet: 0.608 ± 0.331
1.519HisAsn: 1.519 ± 0.827
1.823HisPro: 1.823 ± 0.619
0.608HisGln: 0.608 ± 0.331
1.215HisArg: 1.215 ± 0.617
3.038HisSer: 3.038 ± 1.154
2.43HisThr: 2.43 ± 1.324
2.734HisVal: 2.734 ± 0.488
0.0HisTrp: 0.0 ± 0.0
1.519HisTyr: 1.519 ± 0.827
0.0HisXaa: 0.0 ± 0.0
Ile
4.86IleAla: 4.86 ± 1.006
0.304IleCys: 0.304 ± 0.165
3.949IleAsp: 3.949 ± 1.207
3.949IleGlu: 3.949 ± 1.387
1.215IlePhe: 1.215 ± 0.36
3.645IleGly: 3.645 ± 0.438
2.734IleHis: 2.734 ± 0.875
4.86IleIle: 4.86 ± 1.566
3.949IleLys: 3.949 ± 0.427
3.949IleLeu: 3.949 ± 0.867
1.823IleMet: 1.823 ± 0.583
3.341IleAsn: 3.341 ± 1.276
2.43IlePro: 2.43 ± 1.324
2.126IleGln: 2.126 ± 1.303
2.43IleArg: 2.43 ± 0.347
4.86IleSer: 4.86 ± 1.27
6.379IleThr: 6.379 ± 1.536
4.253IleVal: 4.253 ± 0.971
0.608IleTrp: 0.608 ± 0.8
2.126IleTyr: 2.126 ± 0.859
0.0IleXaa: 0.0 ± 0.0
Lys
3.341LysAla: 3.341 ± 0.336
1.215LysCys: 1.215 ± 0.617
2.734LysAsp: 2.734 ± 1.084
3.949LysGlu: 3.949 ± 1.207
3.038LysPhe: 3.038 ± 0.178
2.43LysGly: 2.43 ± 0.875
0.608LysHis: 0.608 ± 0.331
4.557LysIle: 4.557 ± 0.149
1.823LysLys: 1.823 ± 0.619
2.43LysLeu: 2.43 ± 0.72
0.911LysMet: 0.911 ± 0.496
1.823LysAsn: 1.823 ± 0.619
0.911LysPro: 0.911 ± 0.496
0.608LysGln: 0.608 ± 0.748
4.253LysArg: 4.253 ± 0.652
3.645LysSer: 3.645 ± 1.568
3.038LysThr: 3.038 ± 1.162
6.683LysVal: 6.683 ± 0.485
0.911LysTrp: 0.911 ± 0.51
2.734LysTyr: 2.734 ± 0.831
0.0LysXaa: 0.0 ± 0.0
Leu
6.075LeuAla: 6.075 ± 0.448
2.126LeuCys: 2.126 ± 0.769
6.683LeuAsp: 6.683 ± 2.109
5.772LeuGlu: 5.772 ± 1.652
2.126LeuPhe: 2.126 ± 1.633
6.379LeuGly: 6.379 ± 0.511
3.038LeuHis: 3.038 ± 0.957
5.772LeuIle: 5.772 ± 1.597
3.645LeuLys: 3.645 ± 1.568
10.936LeuLeu: 10.936 ± 2.974
2.43LeuMet: 2.43 ± 0.261
1.823LeuAsn: 1.823 ± 0.219
3.645LeuPro: 3.645 ± 0.96
2.734LeuGln: 2.734 ± 1.036
7.898LeuArg: 7.898 ± 1.332
8.809LeuSer: 8.809 ± 1.859
4.86LeuThr: 4.86 ± 0.808
4.86LeuVal: 4.86 ± 0.173
1.215LeuTrp: 1.215 ± 1.138
4.253LeuTyr: 4.253 ± 0.524
0.0LeuXaa: 0.0 ± 0.0
Met
2.734MetAla: 2.734 ± 1.775
0.0MetCys: 0.0 ± 0.0
1.519MetAsp: 1.519 ± 1.489
2.126MetGlu: 2.126 ± 0.769
0.304MetPhe: 0.304 ± 0.664
1.519MetGly: 1.519 ± 0.479
0.304MetHis: 0.304 ± 0.4
1.823MetIle: 1.823 ± 0.219
2.126MetLys: 2.126 ± 0.769
2.734MetLeu: 2.734 ± 0.488
0.911MetMet: 0.911 ± 0.695
0.911MetAsn: 0.911 ± 0.695
0.911MetPro: 0.911 ± 0.51
0.0MetGln: 0.0 ± 0.0
2.43MetArg: 2.43 ± 0.347
0.911MetSer: 0.911 ± 0.496
1.215MetThr: 1.215 ± 0.662
2.43MetVal: 2.43 ± 1.669
0.608MetTrp: 0.608 ± 0.569
1.215MetTyr: 1.215 ± 0.441
0.0MetXaa: 0.0 ± 0.0
Asn
3.038AsnAla: 3.038 ± 0.898
0.608AsnCys: 0.608 ± 0.331
0.911AsnAsp: 0.911 ± 0.292
1.215AsnGlu: 1.215 ± 0.441
1.215AsnPhe: 1.215 ± 0.502
0.911AsnGly: 0.911 ± 0.51
2.734AsnHis: 2.734 ± 1.489
2.43AsnIle: 2.43 ± 0.185
1.215AsnLys: 1.215 ± 0.662
3.341AsnLeu: 3.341 ± 1.496
1.519AsnMet: 1.519 ± 0.306
1.215AsnAsn: 1.215 ± 1.22
1.823AsnPro: 1.823 ± 1.75
1.215AsnGln: 1.215 ± 0.36
2.126AsnArg: 2.126 ± 0.238
2.734AsnSer: 2.734 ± 1.103
1.519AsnThr: 1.519 ± 0.306
2.43AsnVal: 2.43 ± 0.712
0.304AsnTrp: 0.304 ± 0.165
1.215AsnTyr: 1.215 ± 0.441
0.0AsnXaa: 0.0 ± 0.0
Pro
2.734ProAla: 2.734 ± 1.036
0.608ProCys: 0.608 ± 0.331
1.823ProAsp: 1.823 ± 1.183
2.734ProGlu: 2.734 ± 0.741
2.43ProPhe: 2.43 ± 0.347
2.126ProGly: 2.126 ± 1.03
1.215ProHis: 1.215 ± 0.36
2.126ProIle: 2.126 ± 1.03
2.734ProLys: 2.734 ± 0.875
5.468ProLeu: 5.468 ± 0.977
0.304ProMet: 0.304 ± 0.23
1.823ProAsn: 1.823 ± 0.97
2.126ProPro: 2.126 ± 0.769
1.519ProGln: 1.519 ± 0.306
1.215ProArg: 1.215 ± 0.662
3.949ProSer: 3.949 ± 1.387
2.734ProThr: 2.734 ± 0.831
1.823ProVal: 1.823 ± 0.583
1.519ProTrp: 1.519 ± 1.068
2.734ProTyr: 2.734 ± 0.831
0.0ProXaa: 0.0 ± 0.0
Gln
2.734GlnAla: 2.734 ± 2.192
0.608GlnCys: 0.608 ± 0.8
0.911GlnAsp: 0.911 ± 0.292
1.823GlnGlu: 1.823 ± 0.993
0.608GlnPhe: 0.608 ± 0.331
2.126GlnGly: 2.126 ± 1.158
1.823GlnHis: 1.823 ± 0.632
1.215GlnIle: 1.215 ± 0.36
2.43GlnLys: 2.43 ± 0.712
2.734GlnLeu: 2.734 ± 1.775
1.519GlnMet: 1.519 ± 0.59
0.911GlnAsn: 0.911 ± 0.592
1.823GlnPro: 1.823 ± 0.632
1.215GlnGln: 1.215 ± 0.441
1.823GlnArg: 1.823 ± 0.219
0.608GlnSer: 0.608 ± 0.331
0.911GlnThr: 0.911 ± 0.292
1.215GlnVal: 1.215 ± 0.502
0.911GlnTrp: 0.911 ± 0.496
0.911GlnTyr: 0.911 ± 0.496
0.0GlnXaa: 0.0 ± 0.0
Arg
3.645ArgAla: 3.645 ± 2.041
1.215ArgCys: 1.215 ± 0.617
2.43ArgAsp: 2.43 ± 0.347
3.341ArgGlu: 3.341 ± 1.405
2.43ArgPhe: 2.43 ± 1.324
4.253ArgGly: 4.253 ± 2.061
2.126ArgHis: 2.126 ± 0.343
5.468ArgIle: 5.468 ± 0.657
3.645ArgLys: 3.645 ± 1.08
5.164ArgLeu: 5.164 ± 1.223
1.215ArgMet: 1.215 ± 0.441
1.823ArgAsn: 1.823 ± 0.632
2.43ArgPro: 2.43 ± 0.861
1.519ArgGln: 1.519 ± 0.577
3.341ArgArg: 3.341 ± 1.286
6.075ArgSer: 6.075 ± 1.191
1.215ArgThr: 1.215 ± 0.834
4.86ArgVal: 4.86 ± 0.701
0.304ArgTrp: 0.304 ± 0.664
4.86ArgTyr: 4.86 ± 0.797
0.0ArgXaa: 0.0 ± 0.0
Ser
4.86SerAla: 4.86 ± 1.372
1.823SerCys: 1.823 ± 0.583
5.772SerAsp: 5.772 ± 1.354
5.468SerGlu: 5.468 ± 0.882
4.557SerPhe: 4.557 ± 0.997
3.949SerGly: 3.949 ± 1.352
3.038SerHis: 3.038 ± 0.957
6.379SerIle: 6.379 ± 0.439
5.468SerLys: 5.468 ± 2.55
4.253SerLeu: 4.253 ± 0.524
1.823SerMet: 1.823 ± 0.506
2.43SerAsn: 2.43 ± 0.875
3.341SerPro: 3.341 ± 1.489
2.734SerGln: 2.734 ± 1.084
5.164SerArg: 5.164 ± 1.813
7.594SerSer: 7.594 ± 2.265
3.645SerThr: 3.645 ± 1.012
5.772SerVal: 5.772 ± 1.26
0.911SerTrp: 0.911 ± 0.292
2.734SerTyr: 2.734 ± 0.875
0.0SerXaa: 0.0 ± 0.0
Thr
3.645ThrAla: 3.645 ± 1.509
0.608ThrCys: 0.608 ± 0.569
2.734ThrAsp: 2.734 ± 0.075
5.772ThrGlu: 5.772 ± 1.35
0.911ThrPhe: 0.911 ± 0.496
2.126ThrGly: 2.126 ± 0.878
1.519ThrHis: 1.519 ± 0.479
5.468ThrIle: 5.468 ± 1.257
2.126ThrLys: 2.126 ± 1.158
5.468ThrLeu: 5.468 ± 1.678
1.215ThrMet: 1.215 ± 0.502
1.215ThrAsn: 1.215 ± 0.441
5.164ThrPro: 5.164 ± 0.229
0.911ThrGln: 0.911 ± 0.496
4.253ThrArg: 4.253 ± 1.065
3.038ThrSer: 3.038 ± 0.178
7.29ThrThr: 7.29 ± 2.066
3.038ThrVal: 3.038 ± 0.459
1.823ThrTrp: 1.823 ± 0.583
3.341ThrTyr: 3.341 ± 1.175
0.0ThrXaa: 0.0 ± 0.0
Val
2.43ValAla: 2.43 ± 1.528
1.823ValCys: 1.823 ± 0.993
2.43ValAsp: 2.43 ± 0.783
4.253ValGlu: 4.253 ± 1.065
2.734ValPhe: 2.734 ± 0.075
5.164ValGly: 5.164 ± 1.708
3.645ValHis: 3.645 ± 0.365
3.341ValIle: 3.341 ± 0.989
3.038ValLys: 3.038 ± 0.178
7.594ValLeu: 7.594 ± 2.229
1.215ValMet: 1.215 ± 0.662
3.038ValAsn: 3.038 ± 0.898
3.038ValPro: 3.038 ± 0.957
3.038ValGln: 3.038 ± 0.71
2.734ValArg: 2.734 ± 0.729
5.772ValSer: 5.772 ± 2.063
3.645ValThr: 3.645 ± 3.11
3.645ValVal: 3.645 ± 1.509
0.911ValTrp: 0.911 ± 0.592
3.949ValTyr: 3.949 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
0.911TrpAla: 0.911 ± 0.592
0.0TrpCys: 0.0 ± 0.0
1.519TrpAsp: 1.519 ± 0.306
0.304TrpGlu: 0.304 ± 0.165
0.608TrpPhe: 0.608 ± 0.331
0.911TrpGly: 0.911 ± 0.51
0.0TrpHis: 0.0 ± 0.0
0.304TrpIle: 0.304 ± 0.165
0.608TrpLys: 0.608 ± 0.331
1.215TrpLeu: 1.215 ± 1.138
0.304TrpMet: 0.304 ± 0.4
1.215TrpAsn: 1.215 ± 0.441
0.304TrpPro: 0.304 ± 0.165
0.608TrpGln: 0.608 ± 0.331
0.608TrpArg: 0.608 ± 0.331
0.911TrpSer: 0.911 ± 0.496
0.608TrpThr: 0.608 ± 0.308
0.608TrpVal: 0.608 ± 0.331
0.0TrpTrp: 0.0 ± 0.0
1.215TrpTyr: 1.215 ± 1.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.126TyrAla: 2.126 ± 1.158
0.911TyrCys: 0.911 ± 0.292
2.734TyrAsp: 2.734 ± 0.075
3.645TyrGlu: 3.645 ± 0.76
1.215TyrPhe: 1.215 ± 0.617
3.645TyrGly: 3.645 ± 1.323
1.215TyrHis: 1.215 ± 0.502
2.734TyrIle: 2.734 ± 1.015
3.341TyrLys: 3.341 ± 0.336
5.164TyrLeu: 5.164 ± 2.386
1.823TyrMet: 1.823 ± 0.925
1.519TyrAsn: 1.519 ± 0.67
2.126TyrPro: 2.126 ± 1.03
0.911TyrGln: 0.911 ± 0.592
3.341TyrArg: 3.341 ± 0.8
4.557TyrSer: 4.557 ± 1.436
3.645TyrThr: 3.645 ± 1.453
4.557TyrVal: 4.557 ± 0.657
0.304TyrTrp: 0.304 ± 0.165
2.126TyrTyr: 2.126 ± 0.745
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski