Amino acid dipepetide frequency for Shahe heteroptera virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.248AlaAla: 6.248 ± 0.0
1.041AlaCys: 1.041 ± 0.0
4.512AlaAsp: 4.512 ± 0.0
2.777AlaGlu: 2.777 ± 0.0
2.083AlaPhe: 2.083 ± 0.0
4.512AlaGly: 4.512 ± 0.0
2.083AlaHis: 2.083 ± 0.0
5.207AlaIle: 5.207 ± 0.0
5.207AlaLys: 5.207 ± 0.0
6.595AlaLeu: 6.595 ± 0.0
2.083AlaMet: 2.083 ± 0.0
3.471AlaAsn: 3.471 ± 0.0
3.818AlaPro: 3.818 ± 0.0
3.124AlaGln: 3.124 ± 0.0
4.165AlaArg: 4.165 ± 0.0
4.512AlaSer: 4.512 ± 0.0
2.43AlaThr: 2.43 ± 0.0
4.512AlaVal: 4.512 ± 0.0
1.041AlaTrp: 1.041 ± 0.0
2.777AlaTyr: 2.777 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.694CysAla: 0.694 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.041CysAsp: 1.041 ± 0.0
1.041CysGlu: 1.041 ± 0.0
0.694CysPhe: 0.694 ± 0.0
1.388CysGly: 1.388 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.041CysIle: 1.041 ± 0.0
1.388CysLys: 1.388 ± 0.0
2.083CysLeu: 2.083 ± 0.0
0.347CysMet: 0.347 ± 0.0
1.388CysAsn: 1.388 ± 0.0
0.347CysPro: 0.347 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.694CysArg: 0.694 ± 0.0
1.388CysSer: 1.388 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.347CysVal: 0.347 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.694CysTyr: 0.694 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.859AspAla: 4.859 ± 0.0
1.388AspCys: 1.388 ± 0.0
3.818AspAsp: 3.818 ± 0.0
1.736AspGlu: 1.736 ± 0.0
2.777AspPhe: 2.777 ± 0.0
3.124AspGly: 3.124 ± 0.0
1.041AspHis: 1.041 ± 0.0
2.43AspIle: 2.43 ± 0.0
4.859AspLys: 4.859 ± 0.0
5.207AspLeu: 5.207 ± 0.0
1.041AspMet: 1.041 ± 0.0
1.388AspAsn: 1.388 ± 0.0
3.818AspPro: 3.818 ± 0.0
2.43AspGln: 2.43 ± 0.0
1.736AspArg: 1.736 ± 0.0
6.942AspSer: 6.942 ± 0.0
3.124AspThr: 3.124 ± 0.0
4.512AspVal: 4.512 ± 0.0
0.347AspTrp: 0.347 ± 0.0
2.777AspTyr: 2.777 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.554GluAla: 5.554 ± 0.0
1.041GluCys: 1.041 ± 0.0
2.43GluAsp: 2.43 ± 0.0
2.43GluGlu: 2.43 ± 0.0
2.43GluPhe: 2.43 ± 0.0
5.554GluGly: 5.554 ± 0.0
0.694GluHis: 0.694 ± 0.0
2.43GluIle: 2.43 ± 0.0
2.777GluLys: 2.777 ± 0.0
5.207GluLeu: 5.207 ± 0.0
1.041GluMet: 1.041 ± 0.0
1.736GluAsn: 1.736 ± 0.0
2.777GluPro: 2.777 ± 0.0
0.694GluGln: 0.694 ± 0.0
3.124GluArg: 3.124 ± 0.0
2.083GluSer: 2.083 ± 0.0
1.388GluThr: 1.388 ± 0.0
4.165GluVal: 4.165 ± 0.0
0.694GluTrp: 0.694 ± 0.0
1.041GluTyr: 1.041 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.471PheAla: 3.471 ± 0.0
0.694PheCys: 0.694 ± 0.0
3.818PheAsp: 3.818 ± 0.0
2.777PheGlu: 2.777 ± 0.0
1.041PhePhe: 1.041 ± 0.0
3.124PheGly: 3.124 ± 0.0
1.041PheHis: 1.041 ± 0.0
2.083PheIle: 2.083 ± 0.0
1.388PheLys: 1.388 ± 0.0
3.124PheLeu: 3.124 ± 0.0
1.041PheMet: 1.041 ± 0.0
3.471PheAsn: 3.471 ± 0.0
2.777PhePro: 2.777 ± 0.0
1.041PheGln: 1.041 ± 0.0
3.124PheArg: 3.124 ± 0.0
3.471PheSer: 3.471 ± 0.0
2.777PheThr: 2.777 ± 0.0
4.859PheVal: 4.859 ± 0.0
0.694PheTrp: 0.694 ± 0.0
2.43PheTyr: 2.43 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.471GlyAla: 3.471 ± 0.0
1.041GlyCys: 1.041 ± 0.0
4.512GlyAsp: 4.512 ± 0.0
5.901GlyGlu: 5.901 ± 0.0
4.512GlyPhe: 4.512 ± 0.0
3.818GlyGly: 3.818 ± 0.0
1.041GlyHis: 1.041 ± 0.0
4.859GlyIle: 4.859 ± 0.0
4.165GlyLys: 4.165 ± 0.0
4.165GlyLeu: 4.165 ± 0.0
1.736GlyMet: 1.736 ± 0.0
1.388GlyAsn: 1.388 ± 0.0
1.388GlyPro: 1.388 ± 0.0
2.083GlyGln: 2.083 ± 0.0
1.388GlyArg: 1.388 ± 0.0
3.818GlySer: 3.818 ± 0.0
3.124GlyThr: 3.124 ± 0.0
7.636GlyVal: 7.636 ± 0.0
0.694GlyTrp: 0.694 ± 0.0
2.777GlyTyr: 2.777 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.041HisAla: 1.041 ± 0.0
0.694HisCys: 0.694 ± 0.0
0.694HisAsp: 0.694 ± 0.0
0.347HisGlu: 0.347 ± 0.0
0.694HisPhe: 0.694 ± 0.0
2.083HisGly: 2.083 ± 0.0
0.347HisHis: 0.347 ± 0.0
1.041HisIle: 1.041 ± 0.0
1.041HisLys: 1.041 ± 0.0
1.041HisLeu: 1.041 ± 0.0
0.347HisMet: 0.347 ± 0.0
0.347HisAsn: 0.347 ± 0.0
1.736HisPro: 1.736 ± 0.0
0.347HisGln: 0.347 ± 0.0
0.347HisArg: 0.347 ± 0.0
1.388HisSer: 1.388 ± 0.0
0.694HisThr: 0.694 ± 0.0
2.777HisVal: 2.777 ± 0.0
0.694HisTrp: 0.694 ± 0.0
0.694HisTyr: 0.694 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.554IleAla: 5.554 ± 0.0
1.041IleCys: 1.041 ± 0.0
2.43IleAsp: 2.43 ± 0.0
1.388IleGlu: 1.388 ± 0.0
2.43IlePhe: 2.43 ± 0.0
4.165IleGly: 4.165 ± 0.0
0.694IleHis: 0.694 ± 0.0
1.388IleIle: 1.388 ± 0.0
4.165IleLys: 4.165 ± 0.0
5.207IleLeu: 5.207 ± 0.0
1.736IleMet: 1.736 ± 0.0
2.083IleAsn: 2.083 ± 0.0
3.471IlePro: 3.471 ± 0.0
3.124IleGln: 3.124 ± 0.0
4.165IleArg: 4.165 ± 0.0
2.777IleSer: 2.777 ± 0.0
3.124IleThr: 3.124 ± 0.0
3.818IleVal: 3.818 ± 0.0
1.388IleTrp: 1.388 ± 0.0
2.43IleTyr: 2.43 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.777LysAla: 2.777 ± 0.0
1.388LysCys: 1.388 ± 0.0
4.165LysAsp: 4.165 ± 0.0
3.124LysGlu: 3.124 ± 0.0
3.471LysPhe: 3.471 ± 0.0
3.124LysGly: 3.124 ± 0.0
1.388LysHis: 1.388 ± 0.0
1.388LysIle: 1.388 ± 0.0
4.165LysLys: 4.165 ± 0.0
5.901LysLeu: 5.901 ± 0.0
1.388LysMet: 1.388 ± 0.0
3.818LysAsn: 3.818 ± 0.0
2.777LysPro: 2.777 ± 0.0
2.083LysGln: 2.083 ± 0.0
3.818LysArg: 3.818 ± 0.0
5.554LysSer: 5.554 ± 0.0
2.777LysThr: 2.777 ± 0.0
4.165LysVal: 4.165 ± 0.0
1.736LysTrp: 1.736 ± 0.0
2.083LysTyr: 2.083 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.942LeuAla: 6.942 ± 0.0
1.388LeuCys: 1.388 ± 0.0
6.595LeuAsp: 6.595 ± 0.0
5.207LeuGlu: 5.207 ± 0.0
2.777LeuPhe: 2.777 ± 0.0
4.859LeuGly: 4.859 ± 0.0
2.083LeuHis: 2.083 ± 0.0
5.207LeuIle: 5.207 ± 0.0
5.901LeuLys: 5.901 ± 0.0
6.248LeuLeu: 6.248 ± 0.0
1.736LeuMet: 1.736 ± 0.0
5.207LeuAsn: 5.207 ± 0.0
5.554LeuPro: 5.554 ± 0.0
2.777LeuGln: 2.777 ± 0.0
5.207LeuArg: 5.207 ± 0.0
2.43LeuSer: 2.43 ± 0.0
4.512LeuThr: 4.512 ± 0.0
6.248LeuVal: 6.248 ± 0.0
0.694LeuTrp: 0.694 ± 0.0
1.736LeuTyr: 1.736 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
4.165MetAla: 4.165 ± 0.0
1.041MetCys: 1.041 ± 0.0
1.736MetAsp: 1.736 ± 0.0
0.694MetGlu: 0.694 ± 0.0
0.694MetPhe: 0.694 ± 0.0
0.694MetGly: 0.694 ± 0.0
0.347MetHis: 0.347 ± 0.0
1.041MetIle: 1.041 ± 0.0
1.041MetLys: 1.041 ± 0.0
2.083MetLeu: 2.083 ± 0.0
0.694MetMet: 0.694 ± 0.0
0.347MetAsn: 0.347 ± 0.0
2.777MetPro: 2.777 ± 0.0
0.694MetGln: 0.694 ± 0.0
2.777MetArg: 2.777 ± 0.0
1.041MetSer: 1.041 ± 0.0
0.694MetThr: 0.694 ± 0.0
0.694MetVal: 0.694 ± 0.0
0.347MetTrp: 0.347 ± 0.0
1.041MetTyr: 1.041 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.818AsnAla: 3.818 ± 0.0
0.347AsnCys: 0.347 ± 0.0
2.083AsnAsp: 2.083 ± 0.0
2.43AsnGlu: 2.43 ± 0.0
3.124AsnPhe: 3.124 ± 0.0
2.43AsnGly: 2.43 ± 0.0
0.347AsnHis: 0.347 ± 0.0
3.818AsnIle: 3.818 ± 0.0
1.736AsnLys: 1.736 ± 0.0
2.777AsnLeu: 2.777 ± 0.0
1.041AsnMet: 1.041 ± 0.0
2.43AsnAsn: 2.43 ± 0.0
3.124AsnPro: 3.124 ± 0.0
0.347AsnGln: 0.347 ± 0.0
3.124AsnArg: 3.124 ± 0.0
1.388AsnSer: 1.388 ± 0.0
3.124AsnThr: 3.124 ± 0.0
4.512AsnVal: 4.512 ± 0.0
0.347AsnTrp: 0.347 ± 0.0
1.388AsnTyr: 1.388 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.777ProAla: 2.777 ± 0.0
0.347ProCys: 0.347 ± 0.0
2.43ProAsp: 2.43 ± 0.0
3.124ProGlu: 3.124 ± 0.0
3.124ProPhe: 3.124 ± 0.0
2.43ProGly: 2.43 ± 0.0
1.388ProHis: 1.388 ± 0.0
5.207ProIle: 5.207 ± 0.0
3.124ProLys: 3.124 ± 0.0
3.124ProLeu: 3.124 ± 0.0
1.388ProMet: 1.388 ± 0.0
2.083ProAsn: 2.083 ± 0.0
5.554ProPro: 5.554 ± 0.0
1.388ProGln: 1.388 ± 0.0
4.165ProArg: 4.165 ± 0.0
4.859ProSer: 4.859 ± 0.0
3.471ProThr: 3.471 ± 0.0
5.554ProVal: 5.554 ± 0.0
1.041ProTrp: 1.041 ± 0.0
2.083ProTyr: 2.083 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.43GlnAla: 2.43 ± 0.0
0.694GlnCys: 0.694 ± 0.0
2.083GlnAsp: 2.083 ± 0.0
1.041GlnGlu: 1.041 ± 0.0
2.083GlnPhe: 2.083 ± 0.0
3.818GlnGly: 3.818 ± 0.0
1.041GlnHis: 1.041 ± 0.0
3.124GlnIle: 3.124 ± 0.0
1.388GlnLys: 1.388 ± 0.0
2.43GlnLeu: 2.43 ± 0.0
1.041GlnMet: 1.041 ± 0.0
0.694GlnAsn: 0.694 ± 0.0
2.777GlnPro: 2.777 ± 0.0
0.694GlnGln: 0.694 ± 0.0
1.041GlnArg: 1.041 ± 0.0
0.347GlnSer: 0.347 ± 0.0
1.736GlnThr: 1.736 ± 0.0
2.777GlnVal: 2.777 ± 0.0
0.694GlnTrp: 0.694 ± 0.0
0.347GlnTyr: 0.347 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.512ArgAla: 4.512 ± 0.0
0.347ArgCys: 0.347 ± 0.0
3.818ArgAsp: 3.818 ± 0.0
3.471ArgGlu: 3.471 ± 0.0
2.083ArgPhe: 2.083 ± 0.0
3.471ArgGly: 3.471 ± 0.0
1.041ArgHis: 1.041 ± 0.0
3.124ArgIle: 3.124 ± 0.0
4.859ArgLys: 4.859 ± 0.0
4.859ArgLeu: 4.859 ± 0.0
0.694ArgMet: 0.694 ± 0.0
2.083ArgAsn: 2.083 ± 0.0
1.041ArgPro: 1.041 ± 0.0
2.43ArgGln: 2.43 ± 0.0
4.165ArgArg: 4.165 ± 0.0
2.777ArgSer: 2.777 ± 0.0
4.165ArgThr: 4.165 ± 0.0
3.818ArgVal: 3.818 ± 0.0
1.736ArgTrp: 1.736 ± 0.0
3.124ArgTyr: 3.124 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.859SerAla: 4.859 ± 0.0
0.347SerCys: 0.347 ± 0.0
4.165SerAsp: 4.165 ± 0.0
3.471SerGlu: 3.471 ± 0.0
2.083SerPhe: 2.083 ± 0.0
3.124SerGly: 3.124 ± 0.0
0.347SerHis: 0.347 ± 0.0
2.777SerIle: 2.777 ± 0.0
4.165SerLys: 4.165 ± 0.0
4.859SerLeu: 4.859 ± 0.0
2.43SerMet: 2.43 ± 0.0
2.777SerAsn: 2.777 ± 0.0
4.859SerPro: 4.859 ± 0.0
2.43SerGln: 2.43 ± 0.0
2.083SerArg: 2.083 ± 0.0
5.554SerSer: 5.554 ± 0.0
5.554SerThr: 5.554 ± 0.0
5.554SerVal: 5.554 ± 0.0
1.388SerTrp: 1.388 ± 0.0
2.43SerTyr: 2.43 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.124ThrAla: 3.124 ± 0.0
0.347ThrCys: 0.347 ± 0.0
2.43ThrAsp: 2.43 ± 0.0
1.041ThrGlu: 1.041 ± 0.0
3.471ThrPhe: 3.471 ± 0.0
4.165ThrGly: 4.165 ± 0.0
1.041ThrHis: 1.041 ± 0.0
3.124ThrIle: 3.124 ± 0.0
2.777ThrLys: 2.777 ± 0.0
4.859ThrLeu: 4.859 ± 0.0
1.041ThrMet: 1.041 ± 0.0
2.43ThrAsn: 2.43 ± 0.0
3.124ThrPro: 3.124 ± 0.0
1.736ThrGln: 1.736 ± 0.0
2.43ThrArg: 2.43 ± 0.0
5.901ThrSer: 5.901 ± 0.0
5.207ThrThr: 5.207 ± 0.0
4.165ThrVal: 4.165 ± 0.0
1.736ThrTrp: 1.736 ± 0.0
1.736ThrTyr: 1.736 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.777ValAla: 2.777 ± 0.0
0.347ValCys: 0.347 ± 0.0
5.207ValAsp: 5.207 ± 0.0
4.512ValGlu: 4.512 ± 0.0
5.207ValPhe: 5.207 ± 0.0
4.512ValGly: 4.512 ± 0.0
1.388ValHis: 1.388 ± 0.0
3.471ValIle: 3.471 ± 0.0
4.512ValLys: 4.512 ± 0.0
6.595ValLeu: 6.595 ± 0.0
2.777ValMet: 2.777 ± 0.0
3.471ValAsn: 3.471 ± 0.0
4.859ValPro: 4.859 ± 0.0
0.694ValGln: 0.694 ± 0.0
4.165ValArg: 4.165 ± 0.0
6.942ValSer: 6.942 ± 0.0
4.859ValThr: 4.859 ± 0.0
6.248ValVal: 6.248 ± 0.0
0.347ValTrp: 0.347 ± 0.0
4.859ValTyr: 4.859 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.694TrpAla: 0.694 ± 0.0
0.347TrpCys: 0.347 ± 0.0
0.694TrpAsp: 0.694 ± 0.0
0.694TrpGlu: 0.694 ± 0.0
0.694TrpPhe: 0.694 ± 0.0
1.041TrpGly: 1.041 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.694TrpIle: 0.694 ± 0.0
1.041TrpLys: 1.041 ± 0.0
2.083TrpLeu: 2.083 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.041TrpAsn: 1.041 ± 0.0
0.694TrpPro: 0.694 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.43TrpArg: 2.43 ± 0.0
0.347TrpSer: 0.347 ± 0.0
0.694TrpThr: 0.694 ± 0.0
0.694TrpVal: 0.694 ± 0.0
0.347TrpTrp: 0.347 ± 0.0
2.083TrpTyr: 2.083 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.43TyrAla: 2.43 ± 0.0
0.694TyrCys: 0.694 ± 0.0
0.694TyrAsp: 0.694 ± 0.0
2.083TyrGlu: 2.083 ± 0.0
2.777TyrPhe: 2.777 ± 0.0
2.083TyrGly: 2.083 ± 0.0
1.041TyrHis: 1.041 ± 0.0
3.124TyrIle: 3.124 ± 0.0
1.388TyrLys: 1.388 ± 0.0
4.859TyrLeu: 4.859 ± 0.0
0.694TyrMet: 0.694 ± 0.0
2.083TyrAsn: 2.083 ± 0.0
1.388TyrPro: 1.388 ± 0.0
4.165TyrGln: 4.165 ± 0.0
3.471TyrArg: 3.471 ± 0.0
1.736TyrSer: 1.736 ± 0.0
2.777TyrThr: 2.777 ± 0.0
0.694TyrVal: 0.694 ± 0.0
0.347TyrTrp: 0.347 ± 0.0
2.777TyrTyr: 2.777 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2882 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski