Amino acid dipepetide frequency for Cryptosporidium parvum virus 1 (strain KSU-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
1.188AlaCys: 1.188 ± 0.724
2.375AlaAsp: 2.375 ± 2.38
2.375AlaGlu: 2.375 ± 0.466
4.751AlaPhe: 4.751 ± 0.982
1.188AlaGly: 1.188 ± 0.724
1.188AlaHis: 1.188 ± 0.724
1.188AlaIle: 1.188 ± 0.724
0.0AlaLys: 0.0 ± 0.0
3.563AlaLeu: 3.563 ± 0.258
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
2.375AlaPro: 2.375 ± 0.466
2.375AlaGln: 2.375 ± 1.448
3.563AlaArg: 3.563 ± 2.172
1.188AlaSer: 1.188 ± 0.724
2.375AlaThr: 2.375 ± 0.466
4.751AlaVal: 4.751 ± 2.846
1.188AlaTrp: 1.188 ± 1.19
3.563AlaTyr: 3.563 ± 2.172
0.0AlaXaa: 0.0 ± 0.0
Cys
1.188CysAla: 1.188 ± 1.19
0.0CysCys: 0.0 ± 0.0
1.188CysAsp: 1.188 ± 1.19
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
3.563CysGln: 3.563 ± 0.258
3.563CysArg: 3.563 ± 0.258
1.188CysSer: 1.188 ± 1.19
1.188CysThr: 1.188 ± 0.724
0.0CysVal: 0.0 ± 0.0
1.188CysTrp: 1.188 ± 0.724
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.375AspAla: 2.375 ± 1.448
0.0AspCys: 0.0 ± 0.0
3.563AspAsp: 3.563 ± 0.258
3.563AspGlu: 3.563 ± 1.656
4.751AspPhe: 4.751 ± 0.982
3.563AspGly: 3.563 ± 0.258
0.0AspHis: 0.0 ± 0.0
3.563AspIle: 3.563 ± 2.172
4.751AspLys: 4.751 ± 2.896
15.439AspLeu: 15.439 ± 3.986
1.188AspMet: 1.188 ± 1.19
1.188AspAsn: 1.188 ± 1.19
0.0AspPro: 0.0 ± 0.0
1.188AspGln: 1.188 ± 0.724
2.375AspArg: 2.375 ± 0.466
1.188AspSer: 1.188 ± 0.724
1.188AspThr: 1.188 ± 1.19
2.375AspVal: 2.375 ± 1.448
1.188AspTrp: 1.188 ± 1.19
3.563AspTyr: 3.563 ± 0.258
0.0AspXaa: 0.0 ± 0.0
Glu
4.751GluAla: 4.751 ± 0.932
2.375GluCys: 2.375 ± 2.38
2.375GluAsp: 2.375 ± 1.448
1.188GluGlu: 1.188 ± 0.724
3.563GluPhe: 3.563 ± 2.172
0.0GluGly: 0.0 ± 0.0
2.375GluHis: 2.375 ± 2.38
2.375GluIle: 2.375 ± 0.466
9.501GluLys: 9.501 ± 0.05
5.938GluLeu: 5.938 ± 0.208
2.375GluMet: 2.375 ± 0.466
4.751GluAsn: 4.751 ± 2.896
0.0GluPro: 0.0 ± 0.0
1.188GluGln: 1.188 ± 1.19
3.563GluArg: 3.563 ± 0.258
2.375GluSer: 2.375 ± 0.466
4.751GluThr: 4.751 ± 0.932
2.375GluVal: 2.375 ± 0.466
0.0GluTrp: 0.0 ± 0.0
1.188GluTyr: 1.188 ± 0.724
0.0GluXaa: 0.0 ± 0.0
Phe
1.188PheAla: 1.188 ± 0.724
0.0PheCys: 0.0 ± 0.0
4.751PheAsp: 4.751 ± 0.982
7.126PheGlu: 7.126 ± 0.516
1.188PhePhe: 1.188 ± 0.724
3.563PheGly: 3.563 ± 0.258
3.563PheHis: 3.563 ± 2.172
3.563PheIle: 3.563 ± 2.172
5.938PheLys: 5.938 ± 3.62
2.375PheLeu: 2.375 ± 1.448
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
2.375PhePro: 2.375 ± 0.466
0.0PheGln: 0.0 ± 0.0
4.751PheArg: 4.751 ± 0.982
3.563PheSer: 3.563 ± 2.172
1.188PheThr: 1.188 ± 0.724
3.563PheVal: 3.563 ± 1.656
1.188PheTrp: 1.188 ± 0.724
4.751PheTyr: 4.751 ± 0.932
0.0PheXaa: 0.0 ± 0.0
Gly
2.375GlyAla: 2.375 ± 1.448
1.188GlyCys: 1.188 ± 0.724
2.375GlyAsp: 2.375 ± 1.448
1.188GlyGlu: 1.188 ± 0.724
1.188GlyPhe: 1.188 ± 1.19
1.188GlyGly: 1.188 ± 0.724
1.188GlyHis: 1.188 ± 1.19
3.563GlyIle: 3.563 ± 0.258
4.751GlyLys: 4.751 ± 0.982
5.938GlyLeu: 5.938 ± 1.706
2.375GlyMet: 2.375 ± 0.922
0.0GlyAsn: 0.0 ± 0.0
0.0GlyPro: 0.0 ± 0.0
2.375GlyGln: 2.375 ± 0.466
0.0GlyArg: 0.0 ± 0.0
2.375GlySer: 2.375 ± 1.448
2.375GlyThr: 2.375 ± 1.448
2.375GlyVal: 2.375 ± 1.448
2.375GlyTrp: 2.375 ± 0.466
1.188GlyTyr: 1.188 ± 0.724
0.0GlyXaa: 0.0 ± 0.0
His
1.188HisAla: 1.188 ± 0.724
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.188HisGly: 1.188 ± 0.724
1.188HisHis: 1.188 ± 1.19
1.188HisIle: 1.188 ± 1.19
1.188HisLys: 1.188 ± 0.724
4.751HisLeu: 4.751 ± 4.76
0.0HisMet: 0.0 ± 0.707
2.375HisAsn: 2.375 ± 0.466
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.188HisArg: 1.188 ± 1.19
3.563HisSer: 3.563 ± 3.57
0.0HisThr: 0.0 ± 0.0
1.188HisVal: 1.188 ± 0.724
0.0HisTrp: 0.0 ± 0.0
4.751HisTyr: 4.751 ± 2.896
0.0HisXaa: 0.0 ± 0.0
Ile
4.751IleAla: 4.751 ± 0.982
0.0IleCys: 0.0 ± 0.0
4.751IleAsp: 4.751 ± 2.896
3.563IleGlu: 3.563 ± 1.656
2.375IlePhe: 2.375 ± 1.448
4.751IleGly: 4.751 ± 2.896
0.0IleHis: 0.0 ± 0.0
5.938IleIle: 5.938 ± 0.208
1.188IleLys: 1.188 ± 0.724
9.501IleLeu: 9.501 ± 1.864
2.375IleMet: 2.375 ± 0.466
4.751IleAsn: 4.751 ± 2.846
4.751IlePro: 4.751 ± 0.982
3.563IleGln: 3.563 ± 1.656
1.188IleArg: 1.188 ± 1.19
2.375IleSer: 2.375 ± 0.466
2.375IleThr: 2.375 ± 2.38
5.938IleVal: 5.938 ± 0.208
1.188IleTrp: 1.188 ± 0.724
5.938IleTyr: 5.938 ± 1.706
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
1.188LysAsp: 1.188 ± 1.19
3.563LysGlu: 3.563 ± 2.172
7.126LysPhe: 7.126 ± 2.43
1.188LysGly: 1.188 ± 0.724
1.188LysHis: 1.188 ± 1.19
8.314LysIle: 8.314 ± 0.674
5.938LysLys: 5.938 ± 2.122
4.751LysLeu: 4.751 ± 0.982
1.188LysMet: 1.188 ± 0.724
4.751LysAsn: 4.751 ± 2.846
4.751LysPro: 4.751 ± 0.982
1.188LysGln: 1.188 ± 0.724
4.751LysArg: 4.751 ± 0.982
4.751LysSer: 4.751 ± 0.982
1.188LysThr: 1.188 ± 0.724
5.938LysVal: 5.938 ± 2.122
2.375LysTrp: 2.375 ± 1.448
7.126LysTyr: 7.126 ± 4.344
0.0LysXaa: 0.0 ± 0.0
Leu
1.188LeuAla: 1.188 ± 0.724
1.188LeuCys: 1.188 ± 0.724
13.064LeuAsp: 13.064 ± 2.222
1.188LeuGlu: 1.188 ± 0.724
3.563LeuPhe: 3.563 ± 2.172
10.689LeuGly: 10.689 ± 3.054
0.0LeuHis: 0.0 ± 0.0
8.314LeuIle: 8.314 ± 2.588
7.126LeuLys: 7.126 ± 0.516
10.689LeuLeu: 10.689 ± 3.054
0.0LeuMet: 0.0 ± 0.0
8.314LeuAsn: 8.314 ± 3.154
5.938LeuPro: 5.938 ± 1.706
3.563LeuGln: 3.563 ± 0.258
8.314LeuArg: 8.314 ± 3.154
11.876LeuSer: 11.876 ± 4.244
7.126LeuThr: 7.126 ± 1.398
3.563LeuVal: 3.563 ± 0.258
0.0LeuTrp: 0.0 ± 0.0
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.188MetAla: 1.188 ± 0.724
0.0MetCys: 0.0 ± 0.0
1.188MetAsp: 1.188 ± 1.19
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.188MetHis: 1.188 ± 1.19
3.563MetIle: 3.563 ± 1.656
2.375MetLys: 2.375 ± 1.448
1.188MetLeu: 1.188 ± 0.724
1.188MetMet: 1.188 ± 0.724
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
2.375MetArg: 2.375 ± 0.466
1.188MetSer: 1.188 ± 0.724
0.0MetThr: 0.0 ± 0.0
3.563MetVal: 3.563 ± 3.57
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.188AsnAla: 1.188 ± 1.19
1.188AsnCys: 1.188 ± 1.19
1.188AsnAsp: 1.188 ± 0.724
2.375AsnGlu: 2.375 ± 1.448
3.563AsnPhe: 3.563 ± 2.172
0.0AsnGly: 0.0 ± 0.0
2.375AsnHis: 2.375 ± 2.38
4.751AsnIle: 4.751 ± 0.982
3.563AsnLys: 3.563 ± 0.258
4.751AsnLeu: 4.751 ± 0.932
1.188AsnMet: 1.188 ± 0.724
1.188AsnAsn: 1.188 ± 1.19
1.188AsnPro: 1.188 ± 1.19
1.188AsnGln: 1.188 ± 0.724
7.126AsnArg: 7.126 ± 1.398
0.0AsnSer: 0.0 ± 0.0
2.375AsnThr: 2.375 ± 0.466
3.563AsnVal: 3.563 ± 0.258
1.188AsnTrp: 1.188 ± 0.724
1.188AsnTyr: 1.188 ± 1.19
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.188ProCys: 1.188 ± 0.724
2.375ProAsp: 2.375 ± 2.38
4.751ProGlu: 4.751 ± 0.982
2.375ProPhe: 2.375 ± 1.448
1.188ProGly: 1.188 ± 0.724
1.188ProHis: 1.188 ± 1.19
3.563ProIle: 3.563 ± 1.656
4.751ProLys: 4.751 ± 0.982
1.188ProLeu: 1.188 ± 1.19
1.188ProMet: 1.188 ± 1.19
3.563ProAsn: 3.563 ± 0.258
3.563ProPro: 3.563 ± 0.258
0.0ProGln: 0.0 ± 0.0
3.563ProArg: 3.563 ± 0.258
2.375ProSer: 2.375 ± 1.448
7.126ProThr: 7.126 ± 3.312
1.188ProVal: 1.188 ± 1.19
0.0ProTrp: 0.0 ± 0.0
2.375ProTyr: 2.375 ± 0.466
0.0ProXaa: 0.0 ± 0.0
Gln
1.188GlnAla: 1.188 ± 1.19
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
4.751GlnGlu: 4.751 ± 0.982
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
3.563GlnIle: 3.563 ± 2.172
2.375GlnLys: 2.375 ± 2.38
5.938GlnLeu: 5.938 ± 3.62
1.188GlnMet: 1.188 ± 0.724
1.188GlnAsn: 1.188 ± 1.19
1.188GlnPro: 1.188 ± 0.724
1.188GlnGln: 1.188 ± 0.724
1.188GlnArg: 1.188 ± 0.724
3.563GlnSer: 3.563 ± 1.656
2.375GlnThr: 2.375 ± 2.38
2.375GlnVal: 2.375 ± 0.466
1.188GlnTrp: 1.188 ± 0.724
2.375GlnTyr: 2.375 ± 1.448
0.0GlnXaa: 0.0 ± 0.0
Arg
1.188ArgAla: 1.188 ± 0.724
1.188ArgCys: 1.188 ± 0.724
3.563ArgAsp: 3.563 ± 0.258
1.188ArgGlu: 1.188 ± 1.19
7.126ArgPhe: 7.126 ± 0.516
2.375ArgGly: 2.375 ± 1.448
2.375ArgHis: 2.375 ± 1.448
5.938ArgIle: 5.938 ± 0.208
3.563ArgLys: 3.563 ± 1.656
8.314ArgLeu: 8.314 ± 3.154
0.0ArgMet: 0.0 ± 0.0
1.188ArgAsn: 1.188 ± 0.724
4.751ArgPro: 4.751 ± 2.846
1.188ArgGln: 1.188 ± 0.724
1.188ArgArg: 1.188 ± 1.19
8.314ArgSer: 8.314 ± 1.24
5.938ArgThr: 5.938 ± 1.706
3.563ArgVal: 3.563 ± 1.656
1.188ArgTrp: 1.188 ± 1.19
3.563ArgTyr: 3.563 ± 0.258
0.0ArgXaa: 0.0 ± 0.0
Ser
4.751SerAla: 4.751 ± 0.982
1.188SerCys: 1.188 ± 0.724
1.188SerAsp: 1.188 ± 1.19
5.938SerGlu: 5.938 ± 2.122
5.938SerPhe: 5.938 ± 0.208
3.563SerGly: 3.563 ± 0.258
3.563SerHis: 3.563 ± 0.258
7.126SerIle: 7.126 ± 1.398
5.938SerLys: 5.938 ± 1.706
7.126SerLeu: 7.126 ± 2.43
1.188SerMet: 1.188 ± 0.724
3.563SerAsn: 3.563 ± 1.656
1.188SerPro: 1.188 ± 1.19
3.563SerGln: 3.563 ± 1.656
3.563SerArg: 3.563 ± 0.258
4.751SerSer: 4.751 ± 0.932
3.563SerThr: 3.563 ± 0.258
2.375SerVal: 2.375 ± 0.466
0.0SerTrp: 0.0 ± 0.0
2.375SerTyr: 2.375 ± 0.466
0.0SerXaa: 0.0 ± 0.0
Thr
5.938ThrAla: 5.938 ± 2.122
1.188ThrCys: 1.188 ± 1.19
2.375ThrAsp: 2.375 ± 2.38
4.751ThrGlu: 4.751 ± 0.932
3.563ThrPhe: 3.563 ± 1.656
2.375ThrGly: 2.375 ± 1.448
1.188ThrHis: 1.188 ± 0.724
1.188ThrIle: 1.188 ± 0.724
3.563ThrLys: 3.563 ± 0.258
2.375ThrLeu: 2.375 ± 0.466
2.375ThrMet: 2.375 ± 2.38
1.188ThrAsn: 1.188 ± 0.724
3.563ThrPro: 3.563 ± 1.656
1.188ThrGln: 1.188 ± 0.724
5.938ThrArg: 5.938 ± 0.208
8.314ThrSer: 8.314 ± 2.588
2.375ThrThr: 2.375 ± 0.466
1.188ThrVal: 1.188 ± 0.724
1.188ThrTrp: 1.188 ± 0.724
1.188ThrTyr: 1.188 ± 1.19
0.0ThrXaa: 0.0 ± 0.0
Val
1.188ValAla: 1.188 ± 1.19
1.188ValCys: 1.188 ± 1.19
2.375ValAsp: 2.375 ± 2.38
1.188ValGlu: 1.188 ± 1.19
0.0ValPhe: 0.0 ± 0.0
0.0ValGly: 0.0 ± 0.0
2.375ValHis: 2.375 ± 2.38
0.0ValIle: 0.0 ± 0.0
3.563ValLys: 3.563 ± 0.258
3.563ValLeu: 3.563 ± 1.656
0.0ValMet: 0.0 ± 0.0
3.563ValAsn: 3.563 ± 2.172
7.126ValPro: 7.126 ± 1.398
4.751ValGln: 4.751 ± 0.932
5.938ValArg: 5.938 ± 1.706
5.938ValSer: 5.938 ± 1.706
3.563ValThr: 3.563 ± 1.656
0.0ValVal: 0.0 ± 0.0
1.188ValTrp: 1.188 ± 0.724
4.751ValTyr: 4.751 ± 2.846
0.0ValXaa: 0.0 ± 0.0
Trp
2.375TrpAla: 2.375 ± 1.448
0.0TrpCys: 0.0 ± 0.0
2.375TrpAsp: 2.375 ± 1.448
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.188TrpIle: 1.188 ± 0.724
0.0TrpLys: 0.0 ± 0.0
2.375TrpLeu: 2.375 ± 1.448
0.0TrpMet: 0.0 ± 0.0
1.188TrpAsn: 1.188 ± 1.19
0.0TrpPro: 0.0 ± 0.0
1.188TrpGln: 1.188 ± 0.724
2.375TrpArg: 2.375 ± 0.466
1.188TrpSer: 1.188 ± 0.724
2.375TrpThr: 2.375 ± 0.466
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.188TrpTyr: 1.188 ± 1.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.188TyrAla: 1.188 ± 0.724
0.0TyrCys: 0.0 ± 0.0
4.751TyrAsp: 4.751 ± 2.896
8.314TyrGlu: 8.314 ± 0.674
3.563TyrPhe: 3.563 ± 2.172
3.563TyrGly: 3.563 ± 2.172
0.0TyrHis: 0.0 ± 0.0
2.375TyrIle: 2.375 ± 0.466
1.188TyrLys: 1.188 ± 1.19
5.938TyrLeu: 5.938 ± 3.62
0.0TyrMet: 0.0 ± 0.0
2.375TyrAsn: 2.375 ± 0.466
4.751TyrPro: 4.751 ± 0.932
2.375TyrGln: 2.375 ± 1.448
1.188TyrArg: 1.188 ± 1.19
2.375TyrSer: 2.375 ± 0.466
3.563TyrThr: 3.563 ± 1.656
2.375TyrVal: 2.375 ± 0.466
1.188TyrTrp: 1.188 ± 0.724
1.188TyrTyr: 1.188 ± 1.19
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (843 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski