Amino acid dipepetide frequency for Diabrotica virgifera virgifera virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.366AlaAla: 4.366 ± 0.0
1.746AlaCys: 1.746 ± 0.0
3.492AlaAsp: 3.492 ± 0.0
2.619AlaGlu: 2.619 ± 0.0
4.657AlaPhe: 4.657 ± 0.0
2.619AlaGly: 2.619 ± 0.0
2.037AlaHis: 2.037 ± 0.0
5.53AlaIle: 5.53 ± 0.0
4.657AlaLys: 4.657 ± 0.0
6.694AlaLeu: 6.694 ± 0.0
3.201AlaMet: 3.201 ± 0.0
3.492AlaAsn: 3.492 ± 0.0
3.492AlaPro: 3.492 ± 0.0
1.746AlaGln: 1.746 ± 0.0
1.746AlaArg: 1.746 ± 0.0
4.948AlaSer: 4.948 ± 0.0
4.075AlaThr: 4.075 ± 0.0
3.783AlaVal: 3.783 ± 0.0
0.291AlaTrp: 0.291 ± 0.0
2.037AlaTyr: 2.037 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.291CysAla: 0.291 ± 0.0
0.582CysCys: 0.582 ± 0.0
0.873CysAsp: 0.873 ± 0.0
0.873CysGlu: 0.873 ± 0.0
0.582CysPhe: 0.582 ± 0.0
2.037CysGly: 2.037 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.873CysIle: 0.873 ± 0.0
0.873CysLys: 0.873 ± 0.0
1.455CysLeu: 1.455 ± 0.0
0.291CysMet: 0.291 ± 0.0
1.164CysAsn: 1.164 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.582CysArg: 0.582 ± 0.0
1.164CysSer: 1.164 ± 0.0
0.291CysThr: 0.291 ± 0.0
1.746CysVal: 1.746 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.582CysTyr: 0.582 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.492AspAla: 3.492 ± 0.0
0.582AspCys: 0.582 ± 0.0
3.201AspAsp: 3.201 ± 0.0
4.366AspGlu: 4.366 ± 0.0
2.91AspPhe: 2.91 ± 0.0
2.619AspGly: 2.619 ± 0.0
0.873AspHis: 0.873 ± 0.0
3.783AspIle: 3.783 ± 0.0
5.53AspLys: 5.53 ± 0.0
5.239AspLeu: 5.239 ± 0.0
2.328AspMet: 2.328 ± 0.0
2.619AspAsn: 2.619 ± 0.0
2.619AspPro: 2.619 ± 0.0
2.328AspGln: 2.328 ± 0.0
1.455AspArg: 1.455 ± 0.0
2.328AspSer: 2.328 ± 0.0
3.201AspThr: 3.201 ± 0.0
1.746AspVal: 1.746 ± 0.0
0.582AspTrp: 0.582 ± 0.0
2.328AspTyr: 2.328 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.783GluAla: 3.783 ± 0.0
0.873GluCys: 0.873 ± 0.0
2.037GluAsp: 2.037 ± 0.0
4.075GluGlu: 4.075 ± 0.0
3.201GluPhe: 3.201 ± 0.0
2.91GluGly: 2.91 ± 0.0
1.164GluHis: 1.164 ± 0.0
5.239GluIle: 5.239 ± 0.0
4.366GluLys: 4.366 ± 0.0
5.821GluLeu: 5.821 ± 0.0
2.037GluMet: 2.037 ± 0.0
3.201GluAsn: 3.201 ± 0.0
3.201GluPro: 3.201 ± 0.0
2.619GluGln: 2.619 ± 0.0
3.201GluArg: 3.201 ± 0.0
3.783GluSer: 3.783 ± 0.0
3.492GluThr: 3.492 ± 0.0
2.328GluVal: 2.328 ± 0.0
0.582GluTrp: 0.582 ± 0.0
2.037GluTyr: 2.037 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.366PheAla: 4.366 ± 0.0
0.291PheCys: 0.291 ± 0.0
2.619PheAsp: 2.619 ± 0.0
2.619PheGlu: 2.619 ± 0.0
0.873PhePhe: 0.873 ± 0.0
2.619PheGly: 2.619 ± 0.0
1.164PheHis: 1.164 ± 0.0
2.328PheIle: 2.328 ± 0.0
4.366PheLys: 4.366 ± 0.0
2.91PheLeu: 2.91 ± 0.0
0.582PheMet: 0.582 ± 0.0
3.783PheAsn: 3.783 ± 0.0
1.164PhePro: 1.164 ± 0.0
3.783PheGln: 3.783 ± 0.0
3.783PheArg: 3.783 ± 0.0
4.657PheSer: 4.657 ± 0.0
2.91PheThr: 2.91 ± 0.0
4.075PheVal: 4.075 ± 0.0
0.0PheTrp: 0.0 ± 0.0
2.037PheTyr: 2.037 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.783GlyAla: 3.783 ± 0.0
1.164GlyCys: 1.164 ± 0.0
4.366GlyAsp: 4.366 ± 0.0
3.201GlyGlu: 3.201 ± 0.0
2.037GlyPhe: 2.037 ± 0.0
2.619GlyGly: 2.619 ± 0.0
1.455GlyHis: 1.455 ± 0.0
3.201GlyIle: 3.201 ± 0.0
4.075GlyLys: 4.075 ± 0.0
4.075GlyLeu: 4.075 ± 0.0
1.746GlyMet: 1.746 ± 0.0
3.492GlyAsn: 3.492 ± 0.0
1.164GlyPro: 1.164 ± 0.0
2.037GlyGln: 2.037 ± 0.0
1.746GlyArg: 1.746 ± 0.0
3.492GlySer: 3.492 ± 0.0
4.657GlyThr: 4.657 ± 0.0
3.783GlyVal: 3.783 ± 0.0
0.582GlyTrp: 0.582 ± 0.0
1.455GlyTyr: 1.455 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.455HisAla: 1.455 ± 0.0
0.582HisCys: 0.582 ± 0.0
1.455HisAsp: 1.455 ± 0.0
0.873HisGlu: 0.873 ± 0.0
1.455HisPhe: 1.455 ± 0.0
1.746HisGly: 1.746 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.746HisIle: 1.746 ± 0.0
1.746HisLys: 1.746 ± 0.0
0.582HisLeu: 0.582 ± 0.0
0.291HisMet: 0.291 ± 0.0
1.746HisAsn: 1.746 ± 0.0
1.455HisPro: 1.455 ± 0.0
0.873HisGln: 0.873 ± 0.0
0.873HisArg: 0.873 ± 0.0
0.582HisSer: 0.582 ± 0.0
0.0HisThr: 0.0 ± 0.0
2.037HisVal: 2.037 ± 0.0
0.291HisTrp: 0.291 ± 0.0
1.455HisTyr: 1.455 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.112IleAla: 6.112 ± 0.0
0.582IleCys: 0.582 ± 0.0
3.783IleAsp: 3.783 ± 0.0
2.619IleGlu: 2.619 ± 0.0
3.201IlePhe: 3.201 ± 0.0
2.619IleGly: 2.619 ± 0.0
1.164IleHis: 1.164 ± 0.0
4.075IleIle: 4.075 ± 0.0
3.783IleLys: 3.783 ± 0.0
5.53IleLeu: 5.53 ± 0.0
1.746IleMet: 1.746 ± 0.0
3.492IleAsn: 3.492 ± 0.0
1.746IlePro: 1.746 ± 0.0
1.455IleGln: 1.455 ± 0.0
2.328IleArg: 2.328 ± 0.0
4.075IleSer: 4.075 ± 0.0
6.403IleThr: 6.403 ± 0.0
4.657IleVal: 4.657 ± 0.0
0.582IleTrp: 0.582 ± 0.0
3.492IleTyr: 3.492 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.783LysAla: 3.783 ± 0.0
0.873LysCys: 0.873 ± 0.0
3.492LysAsp: 3.492 ± 0.0
5.239LysGlu: 5.239 ± 0.0
3.783LysPhe: 3.783 ± 0.0
2.619LysGly: 2.619 ± 0.0
3.201LysHis: 3.201 ± 0.0
6.112LysIle: 6.112 ± 0.0
7.858LysLys: 7.858 ± 0.0
6.694LysLeu: 6.694 ± 0.0
2.037LysMet: 2.037 ± 0.0
4.075LysAsn: 4.075 ± 0.0
4.075LysPro: 4.075 ± 0.0
3.783LysGln: 3.783 ± 0.0
1.746LysArg: 1.746 ± 0.0
4.948LysSer: 4.948 ± 0.0
3.783LysThr: 3.783 ± 0.0
4.075LysVal: 4.075 ± 0.0
0.873LysTrp: 0.873 ± 0.0
2.91LysTyr: 2.91 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.657LeuAla: 4.657 ± 0.0
0.582LeuCys: 0.582 ± 0.0
5.821LeuAsp: 5.821 ± 0.0
6.112LeuGlu: 6.112 ± 0.0
4.075LeuPhe: 4.075 ± 0.0
4.366LeuGly: 4.366 ± 0.0
1.455LeuHis: 1.455 ± 0.0
4.366LeuIle: 4.366 ± 0.0
5.239LeuLys: 5.239 ± 0.0
4.366LeuLeu: 4.366 ± 0.0
2.037LeuMet: 2.037 ± 0.0
3.201LeuAsn: 3.201 ± 0.0
6.112LeuPro: 6.112 ± 0.0
4.657LeuGln: 4.657 ± 0.0
2.619LeuArg: 2.619 ± 0.0
4.366LeuSer: 4.366 ± 0.0
5.821LeuThr: 5.821 ± 0.0
5.239LeuVal: 5.239 ± 0.0
0.873LeuTrp: 0.873 ± 0.0
2.328LeuTyr: 2.328 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.455MetAla: 1.455 ± 0.0
0.582MetCys: 0.582 ± 0.0
2.328MetAsp: 2.328 ± 0.0
3.201MetGlu: 3.201 ± 0.0
1.164MetPhe: 1.164 ± 0.0
1.746MetGly: 1.746 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.455MetIle: 1.455 ± 0.0
1.746MetLys: 1.746 ± 0.0
1.455MetLeu: 1.455 ± 0.0
0.291MetMet: 0.291 ± 0.0
0.873MetAsn: 0.873 ± 0.0
2.328MetPro: 2.328 ± 0.0
2.037MetGln: 2.037 ± 0.0
0.291MetArg: 0.291 ± 0.0
0.582MetSer: 0.582 ± 0.0
1.746MetThr: 1.746 ± 0.0
2.037MetVal: 2.037 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.492AsnAla: 3.492 ± 0.0
0.873AsnCys: 0.873 ± 0.0
1.455AsnAsp: 1.455 ± 0.0
3.783AsnGlu: 3.783 ± 0.0
3.783AsnPhe: 3.783 ± 0.0
2.619AsnGly: 2.619 ± 0.0
0.873AsnHis: 0.873 ± 0.0
2.328AsnIle: 2.328 ± 0.0
3.492AsnLys: 3.492 ± 0.0
4.366AsnLeu: 4.366 ± 0.0
0.873AsnMet: 0.873 ± 0.0
2.91AsnAsn: 2.91 ± 0.0
4.075AsnPro: 4.075 ± 0.0
1.164AsnGln: 1.164 ± 0.0
0.873AsnArg: 0.873 ± 0.0
4.948AsnSer: 4.948 ± 0.0
3.201AsnThr: 3.201 ± 0.0
2.328AsnVal: 2.328 ± 0.0
1.164AsnTrp: 1.164 ± 0.0
1.455AsnTyr: 1.455 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.492ProAla: 3.492 ± 0.0
0.0ProCys: 0.0 ± 0.0
1.746ProAsp: 1.746 ± 0.0
2.328ProGlu: 2.328 ± 0.0
4.075ProPhe: 4.075 ± 0.0
1.455ProGly: 1.455 ± 0.0
0.873ProHis: 0.873 ± 0.0
2.037ProIle: 2.037 ± 0.0
4.948ProLys: 4.948 ± 0.0
2.91ProLeu: 2.91 ± 0.0
0.873ProMet: 0.873 ± 0.0
1.746ProAsn: 1.746 ± 0.0
1.164ProPro: 1.164 ± 0.0
3.201ProGln: 3.201 ± 0.0
3.492ProArg: 3.492 ± 0.0
4.657ProSer: 4.657 ± 0.0
4.075ProThr: 4.075 ± 0.0
2.037ProVal: 2.037 ± 0.0
0.291ProTrp: 0.291 ± 0.0
1.455ProTyr: 1.455 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.619GlnAla: 2.619 ± 0.0
0.582GlnCys: 0.582 ± 0.0
2.037GlnAsp: 2.037 ± 0.0
4.366GlnGlu: 4.366 ± 0.0
1.164GlnPhe: 1.164 ± 0.0
0.873GlnGly: 0.873 ± 0.0
1.164GlnHis: 1.164 ± 0.0
3.492GlnIle: 3.492 ± 0.0
4.075GlnLys: 4.075 ± 0.0
4.657GlnLeu: 4.657 ± 0.0
0.291GlnMet: 0.291 ± 0.0
1.164GlnAsn: 1.164 ± 0.0
0.291GlnPro: 0.291 ± 0.0
2.328GlnGln: 2.328 ± 0.0
4.075GlnArg: 4.075 ± 0.0
4.075GlnSer: 4.075 ± 0.0
3.201GlnThr: 3.201 ± 0.0
2.619GlnVal: 2.619 ± 0.0
0.582GlnTrp: 0.582 ± 0.0
1.164GlnTyr: 1.164 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.783ArgAla: 3.783 ± 0.0
0.291ArgCys: 0.291 ± 0.0
1.746ArgAsp: 1.746 ± 0.0
1.455ArgGlu: 1.455 ± 0.0
2.619ArgPhe: 2.619 ± 0.0
2.328ArgGly: 2.328 ± 0.0
0.291ArgHis: 0.291 ± 0.0
2.619ArgIle: 2.619 ± 0.0
3.201ArgLys: 3.201 ± 0.0
2.328ArgLeu: 2.328 ± 0.0
1.746ArgMet: 1.746 ± 0.0
2.037ArgAsn: 2.037 ± 0.0
0.582ArgPro: 0.582 ± 0.0
1.746ArgGln: 1.746 ± 0.0
3.783ArgArg: 3.783 ± 0.0
1.455ArgSer: 1.455 ± 0.0
4.366ArgThr: 4.366 ± 0.0
3.201ArgVal: 3.201 ± 0.0
0.582ArgTrp: 0.582 ± 0.0
2.619ArgTyr: 2.619 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.492SerAla: 3.492 ± 0.0
2.328SerCys: 2.328 ± 0.0
3.492SerAsp: 3.492 ± 0.0
3.783SerGlu: 3.783 ± 0.0
3.492SerPhe: 3.492 ± 0.0
4.366SerGly: 4.366 ± 0.0
1.746SerHis: 1.746 ± 0.0
3.201SerIle: 3.201 ± 0.0
4.366SerLys: 4.366 ± 0.0
6.112SerLeu: 6.112 ± 0.0
0.873SerMet: 0.873 ± 0.0
2.328SerAsn: 2.328 ± 0.0
2.619SerPro: 2.619 ± 0.0
2.91SerGln: 2.91 ± 0.0
2.037SerArg: 2.037 ± 0.0
5.53SerSer: 5.53 ± 0.0
4.657SerThr: 4.657 ± 0.0
6.985SerVal: 6.985 ± 0.0
0.0SerTrp: 0.0 ± 0.0
1.164SerTyr: 1.164 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.657ThrAla: 4.657 ± 0.0
0.873ThrCys: 0.873 ± 0.0
4.366ThrAsp: 4.366 ± 0.0
3.492ThrGlu: 3.492 ± 0.0
2.91ThrPhe: 2.91 ± 0.0
4.366ThrGly: 4.366 ± 0.0
1.746ThrHis: 1.746 ± 0.0
3.492ThrIle: 3.492 ± 0.0
4.075ThrLys: 4.075 ± 0.0
5.821ThrLeu: 5.821 ± 0.0
0.291ThrMet: 0.291 ± 0.0
2.91ThrAsn: 2.91 ± 0.0
6.403ThrPro: 6.403 ± 0.0
2.619ThrGln: 2.619 ± 0.0
2.328ThrArg: 2.328 ± 0.0
4.657ThrSer: 4.657 ± 0.0
7.276ThrThr: 7.276 ± 0.0
4.657ThrVal: 4.657 ± 0.0
2.037ThrTrp: 2.037 ± 0.0
2.037ThrTyr: 2.037 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.657ValAla: 4.657 ± 0.0
0.291ValCys: 0.291 ± 0.0
2.91ValAsp: 2.91 ± 0.0
2.619ValGlu: 2.619 ± 0.0
2.619ValPhe: 2.619 ± 0.0
5.53ValGly: 5.53 ± 0.0
0.873ValHis: 0.873 ± 0.0
3.783ValIle: 3.783 ± 0.0
3.492ValLys: 3.492 ± 0.0
3.783ValLeu: 3.783 ± 0.0
2.91ValMet: 2.91 ± 0.0
2.91ValAsn: 2.91 ± 0.0
2.328ValPro: 2.328 ± 0.0
3.783ValGln: 3.783 ± 0.0
4.657ValArg: 4.657 ± 0.0
3.492ValSer: 3.492 ± 0.0
5.239ValThr: 5.239 ± 0.0
5.53ValVal: 5.53 ± 0.0
0.582ValTrp: 0.582 ± 0.0
4.366ValTyr: 4.366 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.873TrpAla: 0.873 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.291TrpAsp: 0.291 ± 0.0
0.582TrpGlu: 0.582 ± 0.0
0.582TrpPhe: 0.582 ± 0.0
0.582TrpGly: 0.582 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.164TrpIle: 1.164 ± 0.0
1.164TrpLys: 1.164 ± 0.0
1.164TrpLeu: 1.164 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.873TrpAsn: 0.873 ± 0.0
0.582TrpPro: 0.582 ± 0.0
1.164TrpGln: 1.164 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.291TrpThr: 0.291 ± 0.0
0.873TrpVal: 0.873 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.291TrpTyr: 0.291 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.619TyrAla: 2.619 ± 0.0
0.582TyrCys: 0.582 ± 0.0
2.619TyrAsp: 2.619 ± 0.0
1.746TyrGlu: 1.746 ± 0.0
1.746TyrPhe: 1.746 ± 0.0
3.492TyrGly: 3.492 ± 0.0
1.164TyrHis: 1.164 ± 0.0
2.619TyrIle: 2.619 ± 0.0
2.91TyrLys: 2.91 ± 0.0
2.328TyrLeu: 2.328 ± 0.0
0.582TyrMet: 0.582 ± 0.0
2.037TyrAsn: 2.037 ± 0.0
1.455TyrPro: 1.455 ± 0.0
0.582TyrGln: 0.582 ± 0.0
1.164TyrArg: 1.164 ± 0.0
1.746TyrSer: 1.746 ± 0.0
2.328TyrThr: 2.328 ± 0.0
2.91TyrVal: 2.91 ± 0.0
0.582TyrTrp: 0.582 ± 0.0
2.037TyrTyr: 2.037 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3437 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski