Amino acid dipepetide frequency for Diaphorina citri flavi-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.358AlaAla: 5.358 ± 0.0
1.451AlaCys: 1.451 ± 0.0
3.125AlaAsp: 3.125 ± 0.0
4.576AlaGlu: 4.576 ± 0.0
2.344AlaPhe: 2.344 ± 0.0
5.804AlaGly: 5.804 ± 0.0
1.005AlaHis: 1.005 ± 0.0
3.683AlaIle: 3.683 ± 0.0
5.135AlaLys: 5.135 ± 0.0
7.702AlaLeu: 7.702 ± 0.0
2.679AlaMet: 2.679 ± 0.0
2.902AlaAsn: 2.902 ± 0.0
3.014AlaPro: 3.014 ± 0.0
1.674AlaGln: 1.674 ± 0.0
5.358AlaArg: 5.358 ± 0.0
6.251AlaSer: 6.251 ± 0.0
3.795AlaThr: 3.795 ± 0.0
4.465AlaVal: 4.465 ± 0.0
1.339AlaTrp: 1.339 ± 0.0
2.79AlaTyr: 2.79 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.67CysAla: 0.67 ± 0.0
0.446CysCys: 0.446 ± 0.0
1.116CysAsp: 1.116 ± 0.0
0.781CysGlu: 0.781 ± 0.0
0.781CysPhe: 0.781 ± 0.0
1.898CysGly: 1.898 ± 0.0
0.781CysHis: 0.781 ± 0.0
1.228CysIle: 1.228 ± 0.0
0.893CysLys: 0.893 ± 0.0
1.786CysLeu: 1.786 ± 0.0
0.893CysMet: 0.893 ± 0.0
0.223CysAsn: 0.223 ± 0.0
1.005CysPro: 1.005 ± 0.0
0.558CysGln: 0.558 ± 0.0
1.228CysArg: 1.228 ± 0.0
1.563CysSer: 1.563 ± 0.0
1.005CysThr: 1.005 ± 0.0
1.563CysVal: 1.563 ± 0.0
0.223CysTrp: 0.223 ± 0.0
0.781CysTyr: 0.781 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.349AspAla: 3.349 ± 0.0
1.339AspCys: 1.339 ± 0.0
3.125AspAsp: 3.125 ± 0.0
4.8AspGlu: 4.8 ± 0.0
2.121AspPhe: 2.121 ± 0.0
4.13AspGly: 4.13 ± 0.0
2.009AspHis: 2.009 ± 0.0
4.688AspIle: 4.688 ± 0.0
5.246AspLys: 5.246 ± 0.0
4.576AspLeu: 4.576 ± 0.0
1.563AspMet: 1.563 ± 0.0
2.79AspAsn: 2.79 ± 0.0
2.232AspPro: 2.232 ± 0.0
1.228AspGln: 1.228 ± 0.0
2.902AspArg: 2.902 ± 0.0
2.79AspSer: 2.79 ± 0.0
2.456AspThr: 2.456 ± 0.0
4.688AspVal: 4.688 ± 0.0
1.451AspTrp: 1.451 ± 0.0
2.679AspTyr: 2.679 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.023GluAla: 5.023 ± 0.0
1.786GluCys: 1.786 ± 0.0
4.353GluAsp: 4.353 ± 0.0
4.465GluGlu: 4.465 ± 0.0
2.232GluPhe: 2.232 ± 0.0
2.344GluGly: 2.344 ± 0.0
1.451GluHis: 1.451 ± 0.0
4.688GluIle: 4.688 ± 0.0
5.693GluLys: 5.693 ± 0.0
5.023GluLeu: 5.023 ± 0.0
1.005GluMet: 1.005 ± 0.0
3.349GluAsn: 3.349 ± 0.0
1.116GluPro: 1.116 ± 0.0
1.563GluGln: 1.563 ± 0.0
3.572GluArg: 3.572 ± 0.0
3.237GluSer: 3.237 ± 0.0
2.902GluThr: 2.902 ± 0.0
3.572GluVal: 3.572 ± 0.0
1.786GluTrp: 1.786 ± 0.0
2.567GluTyr: 2.567 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.679PheAla: 2.679 ± 0.0
0.558PheCys: 0.558 ± 0.0
2.232PheAsp: 2.232 ± 0.0
1.898PheGlu: 1.898 ± 0.0
0.781PhePhe: 0.781 ± 0.0
2.456PheGly: 2.456 ± 0.0
0.781PheHis: 0.781 ± 0.0
1.786PheIle: 1.786 ± 0.0
1.563PheLys: 1.563 ± 0.0
2.567PheLeu: 2.567 ± 0.0
0.446PheMet: 0.446 ± 0.0
1.228PheAsn: 1.228 ± 0.0
0.893PhePro: 0.893 ± 0.0
0.893PheGln: 0.893 ± 0.0
1.116PheArg: 1.116 ± 0.0
2.344PheSer: 2.344 ± 0.0
1.563PheThr: 1.563 ± 0.0
2.121PheVal: 2.121 ± 0.0
0.335PheTrp: 0.335 ± 0.0
1.674PheTyr: 1.674 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.683GlyAla: 3.683 ± 0.0
1.339GlyCys: 1.339 ± 0.0
4.8GlyAsp: 4.8 ± 0.0
3.349GlyGlu: 3.349 ± 0.0
2.344GlyPhe: 2.344 ± 0.0
3.46GlyGly: 3.46 ± 0.0
2.344GlyHis: 2.344 ± 0.0
4.8GlyIle: 4.8 ± 0.0
5.916GlyLys: 5.916 ± 0.0
4.018GlyLeu: 4.018 ± 0.0
1.563GlyMet: 1.563 ± 0.0
3.572GlyAsn: 3.572 ± 0.0
2.344GlyPro: 2.344 ± 0.0
0.781GlyGln: 0.781 ± 0.0
3.349GlyArg: 3.349 ± 0.0
5.469GlySer: 5.469 ± 0.0
3.125GlyThr: 3.125 ± 0.0
4.018GlyVal: 4.018 ± 0.0
1.228GlyTrp: 1.228 ± 0.0
2.456GlyTyr: 2.456 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.674HisAla: 1.674 ± 0.0
0.446HisCys: 0.446 ± 0.0
0.67HisAsp: 0.67 ± 0.0
1.674HisGlu: 1.674 ± 0.0
0.67HisPhe: 0.67 ± 0.0
1.563HisGly: 1.563 ± 0.0
0.781HisHis: 0.781 ± 0.0
2.009HisIle: 2.009 ± 0.0
1.339HisLys: 1.339 ± 0.0
2.121HisLeu: 2.121 ± 0.0
1.005HisMet: 1.005 ± 0.0
1.563HisAsn: 1.563 ± 0.0
1.005HisPro: 1.005 ± 0.0
0.446HisGln: 0.446 ± 0.0
0.893HisArg: 0.893 ± 0.0
2.232HisSer: 2.232 ± 0.0
1.339HisThr: 1.339 ± 0.0
1.898HisVal: 1.898 ± 0.0
0.558HisTrp: 0.558 ± 0.0
0.781HisTyr: 0.781 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.469IleAla: 5.469 ± 0.0
1.228IleCys: 1.228 ± 0.0
4.911IleAsp: 4.911 ± 0.0
4.688IleGlu: 4.688 ± 0.0
2.232IlePhe: 2.232 ± 0.0
4.8IleGly: 4.8 ± 0.0
1.116IleHis: 1.116 ± 0.0
2.679IleIle: 2.679 ± 0.0
3.572IleLys: 3.572 ± 0.0
4.242IleLeu: 4.242 ± 0.0
1.898IleMet: 1.898 ± 0.0
2.232IleAsn: 2.232 ± 0.0
2.679IlePro: 2.679 ± 0.0
1.898IleGln: 1.898 ± 0.0
2.902IleArg: 2.902 ± 0.0
3.795IleSer: 3.795 ± 0.0
3.572IleThr: 3.572 ± 0.0
3.795IleVal: 3.795 ± 0.0
1.005IleTrp: 1.005 ± 0.0
2.344IleTyr: 2.344 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
7.813LysAla: 7.813 ± 0.0
1.228LysCys: 1.228 ± 0.0
4.242LysAsp: 4.242 ± 0.0
4.576LysGlu: 4.576 ± 0.0
1.674LysPhe: 1.674 ± 0.0
3.237LysGly: 3.237 ± 0.0
1.116LysHis: 1.116 ± 0.0
6.139LysIle: 6.139 ± 0.0
6.027LysLys: 6.027 ± 0.0
4.465LysLeu: 4.465 ± 0.0
2.121LysMet: 2.121 ± 0.0
2.79LysAsn: 2.79 ± 0.0
3.014LysPro: 3.014 ± 0.0
2.009LysGln: 2.009 ± 0.0
5.469LysArg: 5.469 ± 0.0
3.014LysSer: 3.014 ± 0.0
5.581LysThr: 5.581 ± 0.0
4.8LysVal: 4.8 ± 0.0
1.451LysTrp: 1.451 ± 0.0
2.344LysTyr: 2.344 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.474LeuAla: 6.474 ± 0.0
1.116LeuCys: 1.116 ± 0.0
6.474LeuAsp: 6.474 ± 0.0
4.911LeuGlu: 4.911 ± 0.0
2.79LeuPhe: 2.79 ± 0.0
5.358LeuGly: 5.358 ± 0.0
1.563LeuHis: 1.563 ± 0.0
3.349LeuIle: 3.349 ± 0.0
5.246LeuLys: 5.246 ± 0.0
5.246LeuLeu: 5.246 ± 0.0
1.339LeuMet: 1.339 ± 0.0
3.907LeuAsn: 3.907 ± 0.0
4.353LeuPro: 4.353 ± 0.0
1.898LeuGln: 1.898 ± 0.0
4.13LeuArg: 4.13 ± 0.0
6.362LeuSer: 6.362 ± 0.0
4.353LeuThr: 4.353 ± 0.0
5.023LeuVal: 5.023 ± 0.0
0.893LeuTrp: 0.893 ± 0.0
1.451LeuTyr: 1.451 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.344MetAla: 2.344 ± 0.0
0.446MetCys: 0.446 ± 0.0
1.786MetAsp: 1.786 ± 0.0
0.893MetGlu: 0.893 ± 0.0
0.893MetPhe: 0.893 ± 0.0
1.339MetGly: 1.339 ± 0.0
0.558MetHis: 0.558 ± 0.0
1.339MetIle: 1.339 ± 0.0
1.898MetLys: 1.898 ± 0.0
3.125MetLeu: 3.125 ± 0.0
0.893MetMet: 0.893 ± 0.0
1.005MetAsn: 1.005 ± 0.0
1.563MetPro: 1.563 ± 0.0
0.893MetGln: 0.893 ± 0.0
1.451MetArg: 1.451 ± 0.0
1.786MetSer: 1.786 ± 0.0
2.344MetThr: 2.344 ± 0.0
2.344MetVal: 2.344 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.451MetTyr: 1.451 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.349AsnAla: 3.349 ± 0.0
0.446AsnCys: 0.446 ± 0.0
2.232AsnAsp: 2.232 ± 0.0
2.009AsnGlu: 2.009 ± 0.0
0.781AsnPhe: 0.781 ± 0.0
2.902AsnGly: 2.902 ± 0.0
1.228AsnHis: 1.228 ± 0.0
2.121AsnIle: 2.121 ± 0.0
3.349AsnLys: 3.349 ± 0.0
3.349AsnLeu: 3.349 ± 0.0
1.563AsnMet: 1.563 ± 0.0
2.009AsnAsn: 2.009 ± 0.0
2.009AsnPro: 2.009 ± 0.0
1.674AsnGln: 1.674 ± 0.0
2.009AsnArg: 2.009 ± 0.0
3.014AsnSer: 3.014 ± 0.0
3.014AsnThr: 3.014 ± 0.0
3.572AsnVal: 3.572 ± 0.0
0.558AsnTrp: 0.558 ± 0.0
2.009AsnTyr: 2.009 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.014ProAla: 3.014 ± 0.0
0.223ProCys: 0.223 ± 0.0
2.456ProAsp: 2.456 ± 0.0
2.79ProGlu: 2.79 ± 0.0
0.0ProPhe: 0.0 ± 0.0
2.009ProGly: 2.009 ± 0.0
1.339ProHis: 1.339 ± 0.0
2.79ProIle: 2.79 ± 0.0
3.795ProLys: 3.795 ± 0.0
2.567ProLeu: 2.567 ± 0.0
1.563ProMet: 1.563 ± 0.0
1.786ProAsn: 1.786 ± 0.0
2.79ProPro: 2.79 ± 0.0
0.67ProGln: 0.67 ± 0.0
2.121ProArg: 2.121 ± 0.0
2.567ProSer: 2.567 ± 0.0
3.349ProThr: 3.349 ± 0.0
4.13ProVal: 4.13 ± 0.0
0.223ProTrp: 0.223 ± 0.0
1.116ProTyr: 1.116 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.572GlnAla: 3.572 ± 0.0
0.446GlnCys: 0.446 ± 0.0
1.228GlnAsp: 1.228 ± 0.0
1.228GlnGlu: 1.228 ± 0.0
0.893GlnPhe: 0.893 ± 0.0
1.451GlnGly: 1.451 ± 0.0
0.67GlnHis: 0.67 ± 0.0
1.339GlnIle: 1.339 ± 0.0
1.898GlnLys: 1.898 ± 0.0
2.121GlnLeu: 2.121 ± 0.0
0.446GlnMet: 0.446 ± 0.0
1.563GlnAsn: 1.563 ± 0.0
0.781GlnPro: 0.781 ± 0.0
0.781GlnGln: 0.781 ± 0.0
1.898GlnArg: 1.898 ± 0.0
2.456GlnSer: 2.456 ± 0.0
1.451GlnThr: 1.451 ± 0.0
1.005GlnVal: 1.005 ± 0.0
0.781GlnTrp: 0.781 ± 0.0
1.786GlnTyr: 1.786 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.456ArgAla: 2.456 ± 0.0
0.893ArgCys: 0.893 ± 0.0
2.79ArgAsp: 2.79 ± 0.0
3.46ArgGlu: 3.46 ± 0.0
2.344ArgPhe: 2.344 ± 0.0
3.683ArgGly: 3.683 ± 0.0
0.893ArgHis: 0.893 ± 0.0
3.683ArgIle: 3.683 ± 0.0
3.572ArgLys: 3.572 ± 0.0
4.465ArgLeu: 4.465 ± 0.0
2.567ArgMet: 2.567 ± 0.0
2.121ArgAsn: 2.121 ± 0.0
1.898ArgPro: 1.898 ± 0.0
2.567ArgGln: 2.567 ± 0.0
3.683ArgArg: 3.683 ± 0.0
3.795ArgSer: 3.795 ± 0.0
3.125ArgThr: 3.125 ± 0.0
4.465ArgVal: 4.465 ± 0.0
0.558ArgTrp: 0.558 ± 0.0
2.121ArgTyr: 2.121 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.8SerAla: 4.8 ± 0.0
1.563SerCys: 1.563 ± 0.0
3.125SerAsp: 3.125 ± 0.0
5.023SerGlu: 5.023 ± 0.0
2.344SerPhe: 2.344 ± 0.0
5.469SerGly: 5.469 ± 0.0
2.344SerHis: 2.344 ± 0.0
3.795SerIle: 3.795 ± 0.0
5.023SerLys: 5.023 ± 0.0
5.469SerLeu: 5.469 ± 0.0
2.121SerMet: 2.121 ± 0.0
3.014SerAsn: 3.014 ± 0.0
2.344SerPro: 2.344 ± 0.0
2.679SerGln: 2.679 ± 0.0
3.683SerArg: 3.683 ± 0.0
7.813SerSer: 7.813 ± 0.0
4.8SerThr: 4.8 ± 0.0
5.693SerVal: 5.693 ± 0.0
1.228SerTrp: 1.228 ± 0.0
1.116SerTyr: 1.116 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.688ThrAla: 4.688 ± 0.0
0.893ThrCys: 0.893 ± 0.0
4.13ThrAsp: 4.13 ± 0.0
3.46ThrGlu: 3.46 ± 0.0
1.786ThrPhe: 1.786 ± 0.0
3.683ThrGly: 3.683 ± 0.0
1.563ThrHis: 1.563 ± 0.0
4.353ThrIle: 4.353 ± 0.0
3.683ThrLys: 3.683 ± 0.0
4.13ThrLeu: 4.13 ± 0.0
1.005ThrMet: 1.005 ± 0.0
1.898ThrAsn: 1.898 ± 0.0
3.349ThrPro: 3.349 ± 0.0
1.786ThrGln: 1.786 ± 0.0
3.125ThrArg: 3.125 ± 0.0
5.358ThrSer: 5.358 ± 0.0
5.358ThrThr: 5.358 ± 0.0
4.353ThrVal: 4.353 ± 0.0
0.446ThrTrp: 0.446 ± 0.0
2.679ThrTyr: 2.679 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.023ValAla: 5.023 ± 0.0
2.121ValCys: 2.121 ± 0.0
4.465ValAsp: 4.465 ± 0.0
4.242ValGlu: 4.242 ± 0.0
1.674ValPhe: 1.674 ± 0.0
4.911ValGly: 4.911 ± 0.0
1.786ValHis: 1.786 ± 0.0
3.572ValIle: 3.572 ± 0.0
5.135ValLys: 5.135 ± 0.0
4.13ValLeu: 4.13 ± 0.0
1.898ValMet: 1.898 ± 0.0
2.902ValAsn: 2.902 ± 0.0
3.683ValPro: 3.683 ± 0.0
2.456ValGln: 2.456 ± 0.0
3.014ValArg: 3.014 ± 0.0
5.916ValSer: 5.916 ± 0.0
5.246ValThr: 5.246 ± 0.0
5.135ValVal: 5.135 ± 0.0
1.116ValTrp: 1.116 ± 0.0
2.344ValTyr: 2.344 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.67TrpAla: 0.67 ± 0.0
0.558TrpCys: 0.558 ± 0.0
0.893TrpAsp: 0.893 ± 0.0
1.005TrpGlu: 1.005 ± 0.0
0.446TrpPhe: 0.446 ± 0.0
0.781TrpGly: 0.781 ± 0.0
0.335TrpHis: 0.335 ± 0.0
1.228TrpIle: 1.228 ± 0.0
1.228TrpLys: 1.228 ± 0.0
1.898TrpLeu: 1.898 ± 0.0
0.335TrpMet: 0.335 ± 0.0
1.005TrpAsn: 1.005 ± 0.0
0.223TrpPro: 0.223 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.116TrpArg: 1.116 ± 0.0
0.893TrpSer: 0.893 ± 0.0
1.116TrpThr: 1.116 ± 0.0
1.339TrpVal: 1.339 ± 0.0
0.223TrpTrp: 0.223 ± 0.0
1.116TrpTyr: 1.116 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.344TyrAla: 2.344 ± 0.0
0.893TyrCys: 0.893 ± 0.0
2.009TyrAsp: 2.009 ± 0.0
1.898TyrGlu: 1.898 ± 0.0
0.781TyrPhe: 0.781 ± 0.0
2.679TyrGly: 2.679 ± 0.0
1.005TyrHis: 1.005 ± 0.0
1.786TyrIle: 1.786 ± 0.0
2.902TyrLys: 2.902 ± 0.0
3.349TyrLeu: 3.349 ± 0.0
1.339TyrMet: 1.339 ± 0.0
1.339TyrAsn: 1.339 ± 0.0
1.005TyrPro: 1.005 ± 0.0
1.339TyrGln: 1.339 ± 0.0
2.121TyrArg: 2.121 ± 0.0
2.79TyrSer: 2.79 ± 0.0
2.232TyrThr: 2.232 ± 0.0
2.79TyrVal: 2.79 ± 0.0
1.005TyrTrp: 1.005 ± 0.0
1.563TyrTyr: 1.563 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (8960 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski