Amino acid dipepetide frequency for Gentian Kobu-sho-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.901AlaAla: 6.901 ± 0.0
1.894AlaCys: 1.894 ± 0.0
2.571AlaAsp: 2.571 ± 0.0
5.413AlaGlu: 5.413 ± 0.0
4.195AlaPhe: 4.195 ± 0.0
4.736AlaGly: 4.736 ± 0.0
1.218AlaHis: 1.218 ± 0.0
5.277AlaIle: 5.277 ± 0.0
6.089AlaLys: 6.089 ± 0.0
6.36AlaLeu: 6.36 ± 0.0
2.3AlaMet: 2.3 ± 0.0
2.842AlaAsn: 2.842 ± 0.0
3.924AlaPro: 3.924 ± 0.0
2.3AlaGln: 2.3 ± 0.0
3.112AlaArg: 3.112 ± 0.0
5.548AlaSer: 5.548 ± 0.0
3.789AlaThr: 3.789 ± 0.0
5.413AlaVal: 5.413 ± 0.0
1.218AlaTrp: 1.218 ± 0.0
1.083AlaTyr: 1.083 ± 0.0
0.135AlaXaa: 0.135 ± 0.0
Cys
0.677CysAla: 0.677 ± 0.0
0.135CysCys: 0.135 ± 0.0
1.353CysAsp: 1.353 ± 0.0
0.677CysGlu: 0.677 ± 0.0
0.135CysPhe: 0.135 ± 0.0
1.488CysGly: 1.488 ± 0.0
0.541CysHis: 0.541 ± 0.0
1.624CysIle: 1.624 ± 0.0
1.353CysLys: 1.353 ± 0.0
1.488CysLeu: 1.488 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.947CysAsn: 0.947 ± 0.0
1.353CysPro: 1.353 ± 0.0
0.677CysGln: 0.677 ± 0.0
0.947CysArg: 0.947 ± 0.0
1.218CysSer: 1.218 ± 0.0
0.812CysThr: 0.812 ± 0.0
1.624CysVal: 1.624 ± 0.0
0.135CysTrp: 0.135 ± 0.0
0.947CysTyr: 0.947 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.706AspAla: 2.706 ± 0.0
1.218AspCys: 1.218 ± 0.0
2.03AspAsp: 2.03 ± 0.0
3.654AspGlu: 3.654 ± 0.0
2.165AspPhe: 2.165 ± 0.0
4.06AspGly: 4.06 ± 0.0
0.947AspHis: 0.947 ± 0.0
2.03AspIle: 2.03 ± 0.0
2.842AspLys: 2.842 ± 0.0
5.413AspLeu: 5.413 ± 0.0
1.488AspMet: 1.488 ± 0.0
2.571AspAsn: 2.571 ± 0.0
2.3AspPro: 2.3 ± 0.0
2.03AspGln: 2.03 ± 0.0
1.759AspArg: 1.759 ± 0.0
3.383AspSer: 3.383 ± 0.0
1.624AspThr: 1.624 ± 0.0
3.112AspVal: 3.112 ± 0.0
1.083AspTrp: 1.083 ± 0.0
2.03AspTyr: 2.03 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.871GluAla: 4.871 ± 0.0
1.083GluCys: 1.083 ± 0.0
3.112GluAsp: 3.112 ± 0.0
8.119GluGlu: 8.119 ± 0.0
4.195GluPhe: 4.195 ± 0.0
6.225GluGly: 6.225 ± 0.0
1.759GluHis: 1.759 ± 0.0
5.142GluIle: 5.142 ± 0.0
5.413GluLys: 5.413 ± 0.0
4.736GluLeu: 4.736 ± 0.0
2.571GluMet: 2.571 ± 0.0
2.436GluAsn: 2.436 ± 0.0
2.3GluPro: 2.3 ± 0.0
3.518GluGln: 3.518 ± 0.0
4.601GluArg: 4.601 ± 0.0
4.33GluSer: 4.33 ± 0.0
2.977GluThr: 2.977 ± 0.0
4.195GluVal: 4.195 ± 0.0
1.218GluTrp: 1.218 ± 0.0
1.759GluTyr: 1.759 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.977PheAla: 2.977 ± 0.0
0.947PheCys: 0.947 ± 0.0
3.112PheAsp: 3.112 ± 0.0
3.789PheGlu: 3.789 ± 0.0
2.436PhePhe: 2.436 ± 0.0
4.06PheGly: 4.06 ± 0.0
0.947PheHis: 0.947 ± 0.0
2.842PheIle: 2.842 ± 0.0
2.842PheLys: 2.842 ± 0.0
2.3PheLeu: 2.3 ± 0.0
1.218PheMet: 1.218 ± 0.0
1.624PheAsn: 1.624 ± 0.0
0.947PhePro: 0.947 ± 0.0
0.812PheGln: 0.812 ± 0.0
2.165PheArg: 2.165 ± 0.0
2.436PheSer: 2.436 ± 0.0
2.436PheThr: 2.436 ± 0.0
2.842PheVal: 2.842 ± 0.0
1.488PheTrp: 1.488 ± 0.0
1.488PheTyr: 1.488 ± 0.0
0.135PheXaa: 0.135 ± 0.0
Gly
5.413GlyAla: 5.413 ± 0.0
1.353GlyCys: 1.353 ± 0.0
3.383GlyAsp: 3.383 ± 0.0
4.601GlyGlu: 4.601 ± 0.0
3.248GlyPhe: 3.248 ± 0.0
3.789GlyGly: 3.789 ± 0.0
1.759GlyHis: 1.759 ± 0.0
3.654GlyIle: 3.654 ± 0.0
7.578GlyLys: 7.578 ± 0.0
7.037GlyLeu: 7.037 ± 0.0
1.894GlyMet: 1.894 ± 0.0
2.3GlyAsn: 2.3 ± 0.0
1.894GlyPro: 1.894 ± 0.0
3.112GlyGln: 3.112 ± 0.0
4.33GlyArg: 4.33 ± 0.0
4.601GlySer: 4.601 ± 0.0
4.195GlyThr: 4.195 ± 0.0
4.195GlyVal: 4.195 ± 0.0
1.083GlyTrp: 1.083 ± 0.0
1.218GlyTyr: 1.218 ± 0.0
0.135GlyXaa: 0.135 ± 0.0
His
1.759HisAla: 1.759 ± 0.0
0.812HisCys: 0.812 ± 0.0
0.541HisAsp: 0.541 ± 0.0
0.406HisGlu: 0.406 ± 0.0
1.894HisPhe: 1.894 ± 0.0
2.3HisGly: 2.3 ± 0.0
0.135HisHis: 0.135 ± 0.0
0.947HisIle: 0.947 ± 0.0
1.218HisLys: 1.218 ± 0.0
1.488HisLeu: 1.488 ± 0.0
0.406HisMet: 0.406 ± 0.0
0.677HisAsn: 0.677 ± 0.0
0.677HisPro: 0.677 ± 0.0
1.083HisGln: 1.083 ± 0.0
0.947HisArg: 0.947 ± 0.0
1.353HisSer: 1.353 ± 0.0
1.218HisThr: 1.218 ± 0.0
1.488HisVal: 1.488 ± 0.0
0.541HisTrp: 0.541 ± 0.0
0.812HisTyr: 0.812 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.33IleAla: 4.33 ± 0.0
0.677IleCys: 0.677 ± 0.0
3.924IleAsp: 3.924 ± 0.0
4.06IleGlu: 4.06 ± 0.0
1.488IlePhe: 1.488 ± 0.0
4.06IleGly: 4.06 ± 0.0
1.083IleHis: 1.083 ± 0.0
2.706IleIle: 2.706 ± 0.0
2.977IleLys: 2.977 ± 0.0
3.112IleLeu: 3.112 ± 0.0
1.624IleMet: 1.624 ± 0.0
2.3IleAsn: 2.3 ± 0.0
1.894IlePro: 1.894 ± 0.0
2.165IleGln: 2.165 ± 0.0
3.383IleArg: 3.383 ± 0.0
4.195IleSer: 4.195 ± 0.0
3.518IleThr: 3.518 ± 0.0
5.142IleVal: 5.142 ± 0.0
0.541IleTrp: 0.541 ± 0.0
1.353IleTyr: 1.353 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.819LysAla: 5.819 ± 0.0
0.812LysCys: 0.812 ± 0.0
2.977LysAsp: 2.977 ± 0.0
5.413LysGlu: 5.413 ± 0.0
3.518LysPhe: 3.518 ± 0.0
5.007LysGly: 5.007 ± 0.0
1.488LysHis: 1.488 ± 0.0
4.601LysIle: 4.601 ± 0.0
6.766LysLys: 6.766 ± 0.0
5.548LysLeu: 5.548 ± 0.0
1.218LysMet: 1.218 ± 0.0
3.112LysAsn: 3.112 ± 0.0
2.436LysPro: 2.436 ± 0.0
2.706LysGln: 2.706 ± 0.0
3.789LysArg: 3.789 ± 0.0
3.789LysSer: 3.789 ± 0.0
5.819LysThr: 5.819 ± 0.0
5.413LysVal: 5.413 ± 0.0
0.541LysTrp: 0.541 ± 0.0
2.03LysTyr: 2.03 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.819LeuAla: 5.819 ± 0.0
1.624LeuCys: 1.624 ± 0.0
3.112LeuAsp: 3.112 ± 0.0
6.495LeuGlu: 6.495 ± 0.0
2.571LeuPhe: 2.571 ± 0.0
5.954LeuGly: 5.954 ± 0.0
1.894LeuHis: 1.894 ± 0.0
2.842LeuIle: 2.842 ± 0.0
5.548LeuLys: 5.548 ± 0.0
6.36LeuLeu: 6.36 ± 0.0
2.03LeuMet: 2.03 ± 0.0
4.195LeuAsn: 4.195 ± 0.0
3.518LeuPro: 3.518 ± 0.0
2.436LeuGln: 2.436 ± 0.0
5.548LeuArg: 5.548 ± 0.0
5.277LeuSer: 5.277 ± 0.0
4.195LeuThr: 4.195 ± 0.0
5.413LeuVal: 5.413 ± 0.0
1.624LeuTrp: 1.624 ± 0.0
2.706LeuTyr: 2.706 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.3MetAla: 2.3 ± 0.0
0.135MetCys: 0.135 ± 0.0
1.083MetAsp: 1.083 ± 0.0
1.894MetGlu: 1.894 ± 0.0
0.406MetPhe: 0.406 ± 0.0
1.488MetGly: 1.488 ± 0.0
0.271MetHis: 0.271 ± 0.0
1.759MetIle: 1.759 ± 0.0
1.624MetLys: 1.624 ± 0.0
1.353MetLeu: 1.353 ± 0.0
1.083MetMet: 1.083 ± 0.0
1.624MetAsn: 1.624 ± 0.0
0.677MetPro: 0.677 ± 0.0
0.812MetGln: 0.812 ± 0.0
2.165MetArg: 2.165 ± 0.0
1.759MetSer: 1.759 ± 0.0
2.03MetThr: 2.03 ± 0.0
2.706MetVal: 2.706 ± 0.0
0.406MetTrp: 0.406 ± 0.0
0.947MetTyr: 0.947 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.842AsnAla: 2.842 ± 0.0
0.677AsnCys: 0.677 ± 0.0
2.3AsnAsp: 2.3 ± 0.0
2.571AsnGlu: 2.571 ± 0.0
1.759AsnPhe: 1.759 ± 0.0
3.248AsnGly: 3.248 ± 0.0
0.947AsnHis: 0.947 ± 0.0
3.654AsnIle: 3.654 ± 0.0
2.977AsnLys: 2.977 ± 0.0
5.277AsnLeu: 5.277 ± 0.0
0.541AsnMet: 0.541 ± 0.0
2.436AsnAsn: 2.436 ± 0.0
1.624AsnPro: 1.624 ± 0.0
1.083AsnGln: 1.083 ± 0.0
1.488AsnArg: 1.488 ± 0.0
2.571AsnSer: 2.571 ± 0.0
1.894AsnThr: 1.894 ± 0.0
3.518AsnVal: 3.518 ± 0.0
0.541AsnTrp: 0.541 ± 0.0
1.759AsnTyr: 1.759 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.248ProAla: 3.248 ± 0.0
0.135ProCys: 0.135 ± 0.0
1.624ProAsp: 1.624 ± 0.0
3.654ProGlu: 3.654 ± 0.0
1.353ProPhe: 1.353 ± 0.0
2.436ProGly: 2.436 ± 0.0
0.677ProHis: 0.677 ± 0.0
2.571ProIle: 2.571 ± 0.0
2.03ProLys: 2.03 ± 0.0
2.842ProLeu: 2.842 ± 0.0
0.677ProMet: 0.677 ± 0.0
1.488ProAsn: 1.488 ± 0.0
0.541ProPro: 0.541 ± 0.0
1.488ProGln: 1.488 ± 0.0
2.165ProArg: 2.165 ± 0.0
3.383ProSer: 3.383 ± 0.0
1.759ProThr: 1.759 ± 0.0
2.436ProVal: 2.436 ± 0.0
0.541ProTrp: 0.541 ± 0.0
1.353ProTyr: 1.353 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.977GlnAla: 2.977 ± 0.0
0.947GlnCys: 0.947 ± 0.0
1.624GlnAsp: 1.624 ± 0.0
3.789GlnGlu: 3.789 ± 0.0
2.571GlnPhe: 2.571 ± 0.0
2.03GlnGly: 2.03 ± 0.0
1.488GlnHis: 1.488 ± 0.0
0.541GlnIle: 0.541 ± 0.0
2.165GlnLys: 2.165 ± 0.0
2.706GlnLeu: 2.706 ± 0.0
0.677GlnMet: 0.677 ± 0.0
1.488GlnAsn: 1.488 ± 0.0
0.541GlnPro: 0.541 ± 0.0
1.218GlnGln: 1.218 ± 0.0
2.571GlnArg: 2.571 ± 0.0
2.165GlnSer: 2.165 ± 0.0
1.624GlnThr: 1.624 ± 0.0
1.624GlnVal: 1.624 ± 0.0
0.812GlnTrp: 0.812 ± 0.0
1.353GlnTyr: 1.353 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.601ArgAla: 4.601 ± 0.0
1.218ArgCys: 1.218 ± 0.0
3.518ArgAsp: 3.518 ± 0.0
3.112ArgGlu: 3.112 ± 0.0
2.977ArgPhe: 2.977 ± 0.0
4.33ArgGly: 4.33 ± 0.0
0.812ArgHis: 0.812 ± 0.0
3.112ArgIle: 3.112 ± 0.0
4.195ArgLys: 4.195 ± 0.0
4.33ArgLeu: 4.33 ± 0.0
1.218ArgMet: 1.218 ± 0.0
2.3ArgAsn: 2.3 ± 0.0
1.894ArgPro: 1.894 ± 0.0
2.03ArgGln: 2.03 ± 0.0
2.977ArgArg: 2.977 ± 0.0
3.383ArgSer: 3.383 ± 0.0
2.977ArgThr: 2.977 ± 0.0
4.736ArgVal: 4.736 ± 0.0
0.947ArgTrp: 0.947 ± 0.0
0.947ArgTyr: 0.947 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.789SerAla: 3.789 ± 0.0
0.812SerCys: 0.812 ± 0.0
3.383SerAsp: 3.383 ± 0.0
4.195SerGlu: 4.195 ± 0.0
2.571SerPhe: 2.571 ± 0.0
5.683SerGly: 5.683 ± 0.0
1.083SerHis: 1.083 ± 0.0
2.842SerIle: 2.842 ± 0.0
4.465SerLys: 4.465 ± 0.0
6.225SerLeu: 6.225 ± 0.0
1.353SerMet: 1.353 ± 0.0
2.706SerAsn: 2.706 ± 0.0
2.03SerPro: 2.03 ± 0.0
1.894SerGln: 1.894 ± 0.0
4.06SerArg: 4.06 ± 0.0
3.654SerSer: 3.654 ± 0.0
4.871SerThr: 4.871 ± 0.0
5.683SerVal: 5.683 ± 0.0
2.03SerTrp: 2.03 ± 0.0
1.488SerTyr: 1.488 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.277ThrAla: 5.277 ± 0.0
1.218ThrCys: 1.218 ± 0.0
2.165ThrAsp: 2.165 ± 0.0
3.383ThrGlu: 3.383 ± 0.0
2.03ThrPhe: 2.03 ± 0.0
3.112ThrGly: 3.112 ± 0.0
1.083ThrHis: 1.083 ± 0.0
3.248ThrIle: 3.248 ± 0.0
5.277ThrLys: 5.277 ± 0.0
2.977ThrLeu: 2.977 ± 0.0
1.488ThrMet: 1.488 ± 0.0
3.518ThrAsn: 3.518 ± 0.0
3.248ThrPro: 3.248 ± 0.0
1.353ThrGln: 1.353 ± 0.0
3.924ThrArg: 3.924 ± 0.0
3.789ThrSer: 3.789 ± 0.0
4.465ThrThr: 4.465 ± 0.0
3.789ThrVal: 3.789 ± 0.0
0.406ThrTrp: 0.406 ± 0.0
1.759ThrTyr: 1.759 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.954ValAla: 5.954 ± 0.0
1.759ValCys: 1.759 ± 0.0
3.654ValAsp: 3.654 ± 0.0
6.36ValGlu: 6.36 ± 0.0
2.165ValPhe: 2.165 ± 0.0
3.518ValGly: 3.518 ± 0.0
0.947ValHis: 0.947 ± 0.0
4.06ValIle: 4.06 ± 0.0
3.924ValLys: 3.924 ± 0.0
5.413ValLeu: 5.413 ± 0.0
2.03ValMet: 2.03 ± 0.0
2.977ValAsn: 2.977 ± 0.0
3.518ValPro: 3.518 ± 0.0
2.977ValGln: 2.977 ± 0.0
3.924ValArg: 3.924 ± 0.0
5.548ValSer: 5.548 ± 0.0
4.601ValThr: 4.601 ± 0.0
6.901ValVal: 6.901 ± 0.0
0.541ValTrp: 0.541 ± 0.0
2.977ValTyr: 2.977 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.488TrpAla: 1.488 ± 0.0
0.135TrpCys: 0.135 ± 0.0
0.812TrpAsp: 0.812 ± 0.0
1.759TrpGlu: 1.759 ± 0.0
1.083TrpPhe: 1.083 ± 0.0
1.083TrpGly: 1.083 ± 0.0
0.271TrpHis: 0.271 ± 0.0
0.541TrpIle: 0.541 ± 0.0
1.218TrpLys: 1.218 ± 0.0
1.218TrpLeu: 1.218 ± 0.0
1.218TrpMet: 1.218 ± 0.0
0.135TrpAsn: 0.135 ± 0.0
0.406TrpPro: 0.406 ± 0.0
0.406TrpGln: 0.406 ± 0.0
1.083TrpArg: 1.083 ± 0.0
0.677TrpSer: 0.677 ± 0.0
1.083TrpThr: 1.083 ± 0.0
1.218TrpVal: 1.218 ± 0.0
0.135TrpTrp: 0.135 ± 0.0
0.812TrpTyr: 0.812 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.571TyrAla: 2.571 ± 0.0
0.541TyrCys: 0.541 ± 0.0
2.165TyrAsp: 2.165 ± 0.0
1.218TyrGlu: 1.218 ± 0.0
1.083TyrPhe: 1.083 ± 0.0
2.165TyrGly: 2.165 ± 0.0
1.353TyrHis: 1.353 ± 0.0
0.541TyrIle: 0.541 ± 0.0
2.165TyrLys: 2.165 ± 0.0
2.977TyrLeu: 2.977 ± 0.0
1.353TyrMet: 1.353 ± 0.0
2.03TyrAsn: 2.03 ± 0.0
0.677TyrPro: 0.677 ± 0.0
0.947TyrGln: 0.947 ± 0.0
0.677TyrArg: 0.677 ± 0.0
1.624TyrSer: 1.624 ± 0.0
1.624TyrThr: 1.624 ± 0.0
2.3TyrVal: 2.3 ± 0.0
0.947TyrTrp: 0.947 ± 0.0
1.353TyrTyr: 1.353 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.135XaaLys: 0.135 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.135XaaSer: 0.135 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.135XaaTyr: 0.135 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (7391 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski