Amino acid dipepetide frequency for Hepatitis C virus genotype 1a (isolate H77) (HCV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.034AlaAla: 11.034 ± 2.083
2.207AlaCys: 2.207 ± 0.949
2.837AlaAsp: 2.837 ± 1.221
4.414AlaGlu: 4.414 ± 1.899
1.892AlaPhe: 1.892 ± 0.814
9.458AlaGly: 9.458 ± 6.587
2.837AlaHis: 2.837 ± 1.443
4.414AlaIle: 4.414 ± 0.765
3.468AlaLys: 3.468 ± 1.492
10.088AlaLeu: 10.088 ± 3.652
0.946AlaMet: 0.946 ± 3.661
1.576AlaAsn: 1.576 ± 0.678
6.305AlaPro: 6.305 ± 2.615
2.207AlaGln: 2.207 ± 0.949
6.936AlaArg: 6.936 ± 0.32
5.99AlaSer: 5.99 ± 2.751
4.729AlaThr: 4.729 ± 2.035
6.62AlaVal: 6.62 ± 0.184
2.207AlaTrp: 2.207 ± 0.949
3.468AlaTyr: 3.468 ± 1.492
0.0AlaXaa: 0.0 ± 0.0
Cys
1.576CysAla: 1.576 ± 1.986
0.946CysCys: 0.946 ± 2.257
1.261CysAsp: 1.261 ± 0.543
1.892CysGlu: 1.892 ± 0.814
1.261CysPhe: 1.261 ± 0.543
4.729CysGly: 4.729 ± 2.035
1.261CysHis: 1.261 ± 0.543
0.631CysIle: 0.631 ± 0.271
0.631CysLys: 0.631 ± 0.271
2.207CysLeu: 2.207 ± 1.715
0.946CysMet: 0.946 ± 0.407
1.261CysAsn: 1.261 ± 0.543
2.207CysPro: 2.207 ± 0.949
1.261CysGln: 1.261 ± 0.543
1.261CysArg: 1.261 ± 2.121
3.468CysSer: 3.468 ± 1.492
2.522CysThr: 2.522 ± 1.085
2.522CysVal: 2.522 ± 1.085
0.946CysTrp: 0.946 ± 0.407
0.631CysTyr: 0.631 ± 0.271
0.0CysXaa: 0.0 ± 0.0
Asp
4.098AspAla: 4.098 ± 1.763
1.576AspCys: 1.576 ± 0.678
0.315AspAsp: 0.315 ± 0.136
1.576AspGlu: 1.576 ± 0.678
1.892AspPhe: 1.892 ± 0.814
2.522AspGly: 2.522 ± 1.579
0.315AspHis: 0.315 ± 0.136
2.207AspIle: 2.207 ± 0.949
0.315AspLys: 0.315 ± 0.136
5.359AspLeu: 5.359 ± 2.306
0.315AspMet: 0.315 ± 0.136
0.946AspAsn: 0.946 ± 0.407
2.837AspPro: 2.837 ± 1.221
1.261AspGln: 1.261 ± 0.543
1.576AspArg: 1.576 ± 0.678
3.153AspSer: 3.153 ± 1.356
2.207AspThr: 2.207 ± 0.949
4.098AspVal: 4.098 ± 0.901
0.946AspTrp: 0.946 ± 2.257
0.946AspTyr: 0.946 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
3.468GluAla: 3.468 ± 1.172
1.261GluCys: 1.261 ± 0.543
2.207GluAsp: 2.207 ± 0.949
2.522GluGlu: 2.522 ± 1.085
1.261GluPhe: 1.261 ± 2.121
2.207GluGly: 2.207 ± 0.949
0.315GluHis: 0.315 ± 0.136
2.207GluIle: 2.207 ± 0.949
0.631GluLys: 0.631 ± 0.271
2.207GluLeu: 2.207 ± 0.949
0.631GluMet: 0.631 ± 0.271
1.576GluAsn: 1.576 ± 0.678
2.207GluPro: 2.207 ± 0.949
1.261GluGln: 1.261 ± 0.543
2.207GluArg: 2.207 ± 0.949
2.207GluSer: 2.207 ± 0.949
2.522GluThr: 2.522 ± 1.085
5.044GluVal: 5.044 ± 0.494
0.0GluTrp: 0.0 ± 0.0
1.261GluTyr: 1.261 ± 0.543
0.0GluXaa: 0.0 ± 0.0
Phe
3.153PheAla: 3.153 ± 1.356
1.261PheCys: 1.261 ± 0.543
2.207PheAsp: 2.207 ± 0.949
0.0PheGlu: 0.0 ± 0.0
1.261PhePhe: 1.261 ± 0.543
1.576PheGly: 1.576 ± 0.678
0.315PheHis: 0.315 ± 0.136
0.946PheIle: 0.946 ± 0.407
1.261PheLys: 1.261 ± 0.543
2.837PheLeu: 2.837 ± 1.221
0.0PheMet: 0.0 ± 0.0
0.946PheAsn: 0.946 ± 0.407
0.946PhePro: 0.946 ± 0.407
0.631PheGln: 0.631 ± 0.271
0.946PheArg: 0.946 ± 0.407
2.207PheSer: 2.207 ± 0.949
4.098PheThr: 4.098 ± 0.901
1.576PheVal: 1.576 ± 0.678
1.261PheTrp: 1.261 ± 2.121
0.631PheTyr: 0.631 ± 0.271
0.0PheXaa: 0.0 ± 0.0
Gly
9.458GlyAla: 9.458 ± 1.259
3.468GlyCys: 3.468 ± 1.492
3.783GlyAsp: 3.783 ± 1.628
3.153GlyGlu: 3.153 ± 1.356
2.207GlyPhe: 2.207 ± 1.715
6.62GlyGly: 6.62 ± 0.184
1.576GlyHis: 1.576 ± 0.678
3.153GlyIle: 3.153 ± 1.356
3.468GlyLys: 3.468 ± 1.492
8.197GlyLeu: 8.197 ± 1.801
1.576GlyMet: 1.576 ± 0.678
3.783GlyAsn: 3.783 ± 1.628
5.675GlyPro: 5.675 ± 8.215
1.576GlyGln: 1.576 ± 0.678
5.359GlyArg: 5.359 ± 8.35
6.62GlySer: 6.62 ± 0.184
2.837GlyThr: 2.837 ± 4.107
7.881GlyVal: 7.881 ± 0.727
2.522GlyTrp: 2.522 ± 1.085
2.522GlyTyr: 2.522 ± 1.085
0.0GlyXaa: 0.0 ± 0.0
His
1.892HisAla: 1.892 ± 0.814
0.315HisCys: 0.315 ± 0.136
0.631HisAsp: 0.631 ± 0.271
0.315HisGlu: 0.315 ± 0.136
0.631HisPhe: 0.631 ± 0.271
2.207HisGly: 2.207 ± 0.949
0.631HisHis: 0.631 ± 0.271
1.892HisIle: 1.892 ± 0.814
0.315HisLys: 0.315 ± 0.136
1.892HisLeu: 1.892 ± 0.814
0.315HisMet: 0.315 ± 0.136
0.631HisAsn: 0.631 ± 0.271
0.946HisPro: 0.946 ± 0.407
0.631HisGln: 0.631 ± 0.271
1.576HisArg: 1.576 ± 1.986
1.576HisSer: 1.576 ± 0.678
0.946HisThr: 0.946 ± 0.407
1.892HisVal: 1.892 ± 1.85
0.631HisTrp: 0.631 ± 0.271
1.892HisTyr: 1.892 ± 0.814
0.0HisXaa: 0.0 ± 0.0
Ile
2.207IleAla: 2.207 ± 0.949
1.576IleCys: 1.576 ± 0.678
1.576IleAsp: 1.576 ± 0.678
1.576IleGlu: 1.576 ± 0.678
1.261IlePhe: 1.261 ± 0.543
0.631IleGly: 0.631 ± 0.271
0.315IleHis: 0.315 ± 0.136
2.522IleIle: 2.522 ± 1.085
1.892IleLys: 1.892 ± 0.814
4.098IleLeu: 4.098 ± 1.763
1.892IleMet: 1.892 ± 0.814
3.153IleAsn: 3.153 ± 1.356
3.783IlePro: 3.783 ± 1.036
1.261IleGln: 1.261 ± 0.543
1.261IleArg: 1.261 ± 0.543
2.207IleSer: 2.207 ± 0.949
5.044IleThr: 5.044 ± 2.17
2.522IleVal: 2.522 ± 1.085
0.631IleTrp: 0.631 ± 2.393
1.892IleTyr: 1.892 ± 0.814
0.0IleXaa: 0.0 ± 0.0
Lys
3.783LysAla: 3.783 ± 1.628
0.946LysCys: 0.946 ± 0.407
0.631LysAsp: 0.631 ± 0.271
0.315LysGlu: 0.315 ± 0.136
1.576LysPhe: 1.576 ± 0.678
2.207LysGly: 2.207 ± 0.949
0.315LysHis: 0.315 ± 0.136
0.631LysIle: 0.631 ± 0.271
2.522LysLys: 2.522 ± 1.579
3.783LysLeu: 3.783 ± 1.628
0.315LysMet: 0.315 ± 0.136
1.261LysAsn: 1.261 ± 0.543
2.837LysPro: 2.837 ± 4.107
0.946LysGln: 0.946 ± 0.407
1.261LysArg: 1.261 ± 0.543
2.207LysSer: 2.207 ± 0.949
1.576LysThr: 1.576 ± 1.986
3.468LysVal: 3.468 ± 1.492
0.315LysTrp: 0.315 ± 0.136
1.261LysTyr: 1.261 ± 0.543
0.0LysXaa: 0.0 ± 0.0
Leu
10.719LeuAla: 10.719 ± 1.948
1.576LeuCys: 1.576 ± 0.678
4.098LeuAsp: 4.098 ± 0.901
4.414LeuGlu: 4.414 ± 3.429
2.207LeuPhe: 2.207 ± 0.949
5.99LeuGly: 5.99 ± 2.751
3.153LeuHis: 3.153 ± 1.356
3.468LeuIle: 3.468 ± 1.492
2.837LeuLys: 2.837 ± 1.221
15.132LeuLeu: 15.132 ± 3.847
1.892LeuMet: 1.892 ± 0.814
2.207LeuAsn: 2.207 ± 0.949
7.881LeuPro: 7.881 ± 4.601
1.576LeuGln: 1.576 ± 0.678
5.044LeuArg: 5.044 ± 0.494
7.566LeuSer: 7.566 ± 2.073
7.566LeuThr: 7.566 ± 3.255
9.142LeuVal: 9.142 ± 1.269
1.892LeuTrp: 1.892 ± 0.814
3.153LeuTyr: 3.153 ± 1.356
0.0LeuXaa: 0.0 ± 0.0
Met
2.837MetAla: 2.837 ± 4.107
0.0MetCys: 0.0 ± 0.0
0.315MetAsp: 0.315 ± 0.136
0.631MetGlu: 0.631 ± 0.271
0.631MetPhe: 0.631 ± 0.271
1.261MetGly: 1.261 ± 0.543
0.315MetHis: 0.315 ± 0.136
0.631MetIle: 0.631 ± 0.271
0.0MetLys: 0.0 ± 0.0
1.261MetLeu: 1.261 ± 0.543
1.261MetMet: 1.261 ± 0.543
1.261MetAsn: 1.261 ± 0.543
0.631MetPro: 0.631 ± 0.271
0.0MetGln: 0.0 ± 0.0
1.261MetArg: 1.261 ± 2.121
1.892MetSer: 1.892 ± 1.85
1.576MetThr: 1.576 ± 0.678
0.946MetVal: 0.946 ± 0.407
1.261MetTrp: 1.261 ± 0.543
0.315MetTyr: 0.315 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
1.892AsnAla: 1.892 ± 0.814
1.261AsnCys: 1.261 ± 0.543
0.631AsnAsp: 0.631 ± 0.271
1.261AsnGlu: 1.261 ± 0.543
0.631AsnPhe: 0.631 ± 0.271
1.892AsnGly: 1.892 ± 0.814
0.946AsnHis: 0.946 ± 0.407
2.207AsnIle: 2.207 ± 0.949
0.315AsnLys: 0.315 ± 0.136
3.153AsnLeu: 3.153 ± 1.308
0.631AsnMet: 0.631 ± 0.271
0.631AsnAsn: 0.631 ± 0.271
1.576AsnPro: 1.576 ± 1.986
0.315AsnGln: 0.315 ± 0.136
1.261AsnArg: 1.261 ± 0.543
3.153AsnSer: 3.153 ± 1.356
2.837AsnThr: 2.837 ± 1.221
1.261AsnVal: 1.261 ± 2.121
1.892AsnTrp: 1.892 ± 0.814
0.946AsnTyr: 0.946 ± 0.407
0.0AsnXaa: 0.0 ± 0.0
Pro
5.675ProAla: 5.675 ± 2.441
3.783ProCys: 3.783 ± 1.628
2.522ProAsp: 2.522 ± 1.085
2.522ProGlu: 2.522 ± 1.085
0.631ProPhe: 0.631 ± 0.271
7.566ProGly: 7.566 ± 7.401
0.315ProHis: 0.315 ± 0.136
3.153ProIle: 3.153 ± 1.356
1.261ProLys: 1.261 ± 2.121
8.197ProLeu: 8.197 ± 1.801
0.315ProMet: 0.315 ± 0.136
1.892ProAsn: 1.892 ± 1.85
7.251ProPro: 7.251 ± 3.12
3.783ProGln: 3.783 ± 3.7
5.044ProArg: 5.044 ± 3.158
4.098ProSer: 4.098 ± 3.565
5.675ProThr: 5.675 ± 2.887
4.729ProVal: 4.729 ± 0.629
0.631ProTrp: 0.631 ± 2.393
1.892ProTyr: 1.892 ± 0.814
0.0ProXaa: 0.0 ± 0.0
Gln
2.837GlnAla: 2.837 ± 1.221
0.315GlnCys: 0.315 ± 0.136
2.207GlnAsp: 2.207 ± 0.949
1.261GlnGlu: 1.261 ± 0.543
0.315GlnPhe: 0.315 ± 0.136
1.261GlnGly: 1.261 ± 0.543
0.631GlnHis: 0.631 ± 0.271
0.946GlnIle: 0.946 ± 0.407
0.946GlnLys: 0.946 ± 0.407
3.468GlnLeu: 3.468 ± 1.492
0.631GlnMet: 0.631 ± 0.271
0.946GlnAsn: 0.946 ± 0.407
1.576GlnPro: 1.576 ± 0.678
0.315GlnGln: 0.315 ± 0.136
2.522GlnArg: 2.522 ± 1.579
0.631GlnSer: 0.631 ± 0.271
2.837GlnThr: 2.837 ± 1.443
1.576GlnVal: 1.576 ± 0.678
0.946GlnTrp: 0.946 ± 0.407
1.261GlnTyr: 1.261 ± 0.543
0.0GlnXaa: 0.0 ± 0.0
Arg
5.675ArgAla: 5.675 ± 8.215
2.522ArgCys: 2.522 ± 1.085
3.153ArgAsp: 3.153 ± 1.308
1.261ArgGlu: 1.261 ± 0.543
0.946ArgPhe: 0.946 ± 0.407
6.62ArgGly: 6.62 ± 0.184
2.837ArgHis: 2.837 ± 1.443
1.576ArgIle: 1.576 ± 0.678
3.783ArgLys: 3.783 ± 1.036
6.62ArgLeu: 6.62 ± 0.184
1.576ArgMet: 1.576 ± 1.919
1.261ArgAsn: 1.261 ± 2.121
3.153ArgPro: 3.153 ± 1.356
0.631ArgGln: 0.631 ± 0.271
6.305ArgArg: 6.305 ± 0.049
3.468ArgSer: 3.468 ± 6.5
3.783ArgThr: 3.783 ± 1.036
5.675ArgVal: 5.675 ± 2.887
0.631ArgTrp: 0.631 ± 0.271
0.946ArgTyr: 0.946 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
4.414SerAla: 4.414 ± 0.765
2.522SerCys: 2.522 ± 1.579
1.892SerAsp: 1.892 ± 0.814
1.576SerGlu: 1.576 ± 0.678
2.207SerPhe: 2.207 ± 0.949
7.881SerGly: 7.881 ± 1.937
1.261SerHis: 1.261 ± 0.543
2.207SerIle: 2.207 ± 1.715
2.207SerLys: 2.207 ± 0.949
5.359SerLeu: 5.359 ± 3.022
1.892SerMet: 1.892 ± 1.85
0.631SerAsn: 0.631 ± 0.271
7.251SerPro: 7.251 ± 4.872
1.892SerGln: 1.892 ± 0.814
4.729SerArg: 4.729 ± 0.629
7.251SerSer: 7.251 ± 7.536
6.62SerThr: 6.62 ± 0.184
5.359SerVal: 5.359 ± 2.306
3.468SerTrp: 3.468 ± 1.172
2.207SerTyr: 2.207 ± 0.949
0.0SerXaa: 0.0 ± 0.0
Thr
5.99ThrAla: 5.99 ± 0.087
3.153ThrCys: 3.153 ± 1.308
3.153ThrAsp: 3.153 ± 1.356
2.837ThrGlu: 2.837 ± 1.221
1.576ThrPhe: 1.576 ± 0.678
7.251ThrGly: 7.251 ± 3.12
1.892ThrHis: 1.892 ± 0.814
1.892ThrIle: 1.892 ± 0.814
2.837ThrLys: 2.837 ± 1.221
4.729ThrLeu: 4.729 ± 0.629
0.946ThrMet: 0.946 ± 0.407
2.522ThrAsn: 2.522 ± 1.579
6.62ThrPro: 6.62 ± 2.48
2.207ThrGln: 2.207 ± 0.949
4.729ThrArg: 4.729 ± 2.035
4.414ThrSer: 4.414 ± 3.429
6.936ThrThr: 6.936 ± 2.984
4.098ThrVal: 4.098 ± 0.901
1.892ThrTrp: 1.892 ± 0.814
1.892ThrTyr: 1.892 ± 1.85
0.0ThrXaa: 0.0 ± 0.0
Val
8.197ValAla: 8.197 ± 7.13
3.468ValCys: 3.468 ± 1.172
2.837ValAsp: 2.837 ± 1.443
3.783ValGlu: 3.783 ± 1.036
3.468ValPhe: 3.468 ± 1.492
8.827ValGly: 8.827 ± 1.53
0.631ValHis: 0.631 ± 0.271
4.098ValIle: 4.098 ± 1.763
1.261ValLys: 1.261 ± 0.543
7.881ValLeu: 7.881 ± 3.391
0.315ValMet: 0.315 ± 0.136
0.946ValAsn: 0.946 ± 0.407
4.098ValPro: 4.098 ± 1.763
3.153ValGln: 3.153 ± 1.356
5.359ValArg: 5.359 ± 3.022
5.99ValSer: 5.99 ± 0.087
5.044ValThr: 5.044 ± 0.494
6.936ValVal: 6.936 ± 2.984
1.261ValTrp: 1.261 ± 0.543
2.522ValTyr: 2.522 ± 1.085
0.0ValXaa: 0.0 ± 0.0
Trp
2.837TrpAla: 2.837 ± 1.221
0.315TrpCys: 0.315 ± 0.136
0.631TrpAsp: 0.631 ± 0.271
0.946TrpGlu: 0.946 ± 0.407
1.261TrpPhe: 1.261 ± 0.543
1.892TrpGly: 1.892 ± 1.85
0.946TrpHis: 0.946 ± 0.407
0.946TrpIle: 0.946 ± 0.407
1.261TrpLys: 1.261 ± 2.121
1.892TrpLeu: 1.892 ± 0.814
0.946TrpMet: 0.946 ± 0.407
0.631TrpAsn: 0.631 ± 0.271
0.946TrpPro: 0.946 ± 0.407
0.631TrpGln: 0.631 ± 0.271
1.892TrpArg: 1.892 ± 1.85
1.261TrpSer: 1.261 ± 0.543
1.261TrpThr: 1.261 ± 0.543
2.207TrpVal: 2.207 ± 4.379
0.315TrpTrp: 0.315 ± 0.136
0.631TrpTyr: 0.631 ± 0.271
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.892TyrAla: 1.892 ± 0.814
0.946TyrCys: 0.946 ± 0.407
1.576TyrAsp: 1.576 ± 0.678
0.946TyrGlu: 0.946 ± 0.407
0.946TyrPhe: 0.946 ± 0.407
2.837TyrGly: 2.837 ± 1.221
0.946TyrHis: 0.946 ± 0.407
1.576TyrIle: 1.576 ± 0.678
0.946TyrLys: 0.946 ± 0.407
3.153TyrLeu: 3.153 ± 1.356
0.631TyrMet: 0.631 ± 0.271
0.631TyrAsn: 0.631 ± 0.271
2.207TyrPro: 2.207 ± 0.949
1.892TyrGln: 1.892 ± 0.814
2.207TyrArg: 2.207 ± 1.715
3.468TyrSer: 3.468 ± 1.492
0.946TyrThr: 0.946 ± 0.407
2.522TyrVal: 2.522 ± 1.085
0.0TyrTrp: 0.0 ± 0.0
0.946TyrTyr: 0.946 ± 0.407
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3173 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski