Amino acid dipepetide frequency for Kelp fly virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.076AlaAla: 4.076 ± 0.0
0.291AlaCys: 0.291 ± 0.0
2.911AlaAsp: 2.911 ± 0.0
3.493AlaGlu: 3.493 ± 0.0
1.747AlaPhe: 1.747 ± 0.0
2.62AlaGly: 2.62 ± 0.0
0.0AlaHis: 0.0 ± 0.0
4.367AlaIle: 4.367 ± 0.0
4.949AlaLys: 4.949 ± 0.0
5.822AlaLeu: 5.822 ± 0.0
1.456AlaMet: 1.456 ± 0.0
2.329AlaAsn: 2.329 ± 0.0
2.62AlaPro: 2.62 ± 0.0
2.911AlaGln: 2.911 ± 0.0
1.747AlaArg: 1.747 ± 0.0
4.949AlaSer: 4.949 ± 0.0
3.493AlaThr: 3.493 ± 0.0
2.62AlaVal: 2.62 ± 0.0
0.873AlaTrp: 0.873 ± 0.0
2.62AlaTyr: 2.62 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.582CysAla: 0.582 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.456CysAsp: 1.456 ± 0.0
1.747CysGlu: 1.747 ± 0.0
0.873CysPhe: 0.873 ± 0.0
0.582CysGly: 0.582 ± 0.0
0.582CysHis: 0.582 ± 0.0
1.164CysIle: 1.164 ± 0.0
1.456CysLys: 1.456 ± 0.0
1.747CysLeu: 1.747 ± 0.0
0.291CysMet: 0.291 ± 0.0
0.291CysAsn: 0.291 ± 0.0
1.164CysPro: 1.164 ± 0.0
1.456CysGln: 1.456 ± 0.0
1.164CysArg: 1.164 ± 0.0
0.873CysSer: 0.873 ± 0.0
0.582CysThr: 0.582 ± 0.0
1.164CysVal: 1.164 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.873CysTyr: 0.873 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.747AspAla: 1.747 ± 0.0
0.873AspCys: 0.873 ± 0.0
2.62AspAsp: 2.62 ± 0.0
4.076AspGlu: 4.076 ± 0.0
3.493AspPhe: 3.493 ± 0.0
2.329AspGly: 2.329 ± 0.0
1.164AspHis: 1.164 ± 0.0
4.658AspIle: 4.658 ± 0.0
3.202AspLys: 3.202 ± 0.0
6.114AspLeu: 6.114 ± 0.0
1.456AspMet: 1.456 ± 0.0
2.329AspAsn: 2.329 ± 0.0
2.911AspPro: 2.911 ± 0.0
1.164AspGln: 1.164 ± 0.0
1.164AspArg: 1.164 ± 0.0
3.785AspSer: 3.785 ± 0.0
1.747AspThr: 1.747 ± 0.0
1.747AspVal: 1.747 ± 0.0
1.456AspTrp: 1.456 ± 0.0
0.873AspTyr: 0.873 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.911GluAla: 2.911 ± 0.0
0.873GluCys: 0.873 ± 0.0
2.911GluAsp: 2.911 ± 0.0
4.076GluGlu: 4.076 ± 0.0
1.747GluPhe: 1.747 ± 0.0
2.038GluGly: 2.038 ± 0.0
1.456GluHis: 1.456 ± 0.0
4.949GluIle: 4.949 ± 0.0
4.367GluLys: 4.367 ± 0.0
3.785GluLeu: 3.785 ± 0.0
0.582GluMet: 0.582 ± 0.0
4.949GluAsn: 4.949 ± 0.0
3.202GluPro: 3.202 ± 0.0
2.038GluGln: 2.038 ± 0.0
1.456GluArg: 1.456 ± 0.0
3.493GluSer: 3.493 ± 0.0
4.658GluThr: 4.658 ± 0.0
5.822GluVal: 5.822 ± 0.0
0.582GluTrp: 0.582 ± 0.0
2.911GluTyr: 2.911 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.911PheAla: 2.911 ± 0.0
0.582PheCys: 0.582 ± 0.0
2.329PheAsp: 2.329 ± 0.0
0.873PheGlu: 0.873 ± 0.0
1.456PhePhe: 1.456 ± 0.0
3.493PheGly: 3.493 ± 0.0
1.747PheHis: 1.747 ± 0.0
3.202PheIle: 3.202 ± 0.0
3.785PheLys: 3.785 ± 0.0
4.076PheLeu: 4.076 ± 0.0
2.911PheMet: 2.911 ± 0.0
3.202PheAsn: 3.202 ± 0.0
0.873PhePro: 0.873 ± 0.0
1.164PheGln: 1.164 ± 0.0
2.62PheArg: 2.62 ± 0.0
4.658PheSer: 4.658 ± 0.0
2.911PheThr: 2.911 ± 0.0
4.949PheVal: 4.949 ± 0.0
0.0PheTrp: 0.0 ± 0.0
2.911PheTyr: 2.911 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.038GlyAla: 2.038 ± 0.0
0.873GlyCys: 0.873 ± 0.0
2.038GlyAsp: 2.038 ± 0.0
2.911GlyGlu: 2.911 ± 0.0
2.62GlyPhe: 2.62 ± 0.0
1.747GlyGly: 1.747 ± 0.0
0.291GlyHis: 0.291 ± 0.0
3.785GlyIle: 3.785 ± 0.0
5.531GlyLys: 5.531 ± 0.0
6.987GlyLeu: 6.987 ± 0.0
1.747GlyMet: 1.747 ± 0.0
3.202GlyAsn: 3.202 ± 0.0
1.747GlyPro: 1.747 ± 0.0
1.747GlyGln: 1.747 ± 0.0
1.164GlyArg: 1.164 ± 0.0
6.405GlySer: 6.405 ± 0.0
5.24GlyThr: 5.24 ± 0.0
3.202GlyVal: 3.202 ± 0.0
0.873GlyTrp: 0.873 ± 0.0
1.164GlyTyr: 1.164 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.582HisAla: 0.582 ± 0.0
0.873HisCys: 0.873 ± 0.0
0.582HisAsp: 0.582 ± 0.0
0.873HisGlu: 0.873 ± 0.0
1.747HisPhe: 1.747 ± 0.0
0.582HisGly: 0.582 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.164HisIle: 1.164 ± 0.0
1.164HisLys: 1.164 ± 0.0
2.038HisLeu: 2.038 ± 0.0
0.873HisMet: 0.873 ± 0.0
0.291HisAsn: 0.291 ± 0.0
0.873HisPro: 0.873 ± 0.0
0.873HisGln: 0.873 ± 0.0
1.164HisArg: 1.164 ± 0.0
1.164HisSer: 1.164 ± 0.0
0.873HisThr: 0.873 ± 0.0
0.582HisVal: 0.582 ± 0.0
0.291HisTrp: 0.291 ± 0.0
1.164HisTyr: 1.164 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.658IleAla: 4.658 ± 0.0
2.038IleCys: 2.038 ± 0.0
3.493IleAsp: 3.493 ± 0.0
4.367IleGlu: 4.367 ± 0.0
3.202IlePhe: 3.202 ± 0.0
4.367IleGly: 4.367 ± 0.0
0.873IleHis: 0.873 ± 0.0
3.785IleIle: 3.785 ± 0.0
3.493IleLys: 3.493 ± 0.0
6.405IleLeu: 6.405 ± 0.0
1.747IleMet: 1.747 ± 0.0
3.202IleAsn: 3.202 ± 0.0
3.493IlePro: 3.493 ± 0.0
2.329IleGln: 2.329 ± 0.0
1.164IleArg: 1.164 ± 0.0
7.278IleSer: 7.278 ± 0.0
3.493IleThr: 3.493 ± 0.0
3.202IleVal: 3.202 ± 0.0
1.164IleTrp: 1.164 ± 0.0
5.531IleTyr: 5.531 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.456LysAla: 1.456 ± 0.0
1.456LysCys: 1.456 ± 0.0
3.493LysAsp: 3.493 ± 0.0
3.785LysGlu: 3.785 ± 0.0
5.24LysPhe: 5.24 ± 0.0
3.785LysGly: 3.785 ± 0.0
2.329LysHis: 2.329 ± 0.0
6.114LysIle: 6.114 ± 0.0
6.696LysLys: 6.696 ± 0.0
7.569LysLeu: 7.569 ± 0.0
2.62LysMet: 2.62 ± 0.0
4.658LysAsn: 4.658 ± 0.0
1.747LysPro: 1.747 ± 0.0
4.367LysGln: 4.367 ± 0.0
2.329LysArg: 2.329 ± 0.0
2.62LysSer: 2.62 ± 0.0
4.367LysThr: 4.367 ± 0.0
3.202LysVal: 3.202 ± 0.0
1.747LysTrp: 1.747 ± 0.0
4.367LysTyr: 4.367 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.987LeuAla: 6.987 ± 0.0
2.329LeuCys: 2.329 ± 0.0
4.076LeuAsp: 4.076 ± 0.0
6.696LeuGlu: 6.696 ± 0.0
4.367LeuPhe: 4.367 ± 0.0
3.202LeuGly: 3.202 ± 0.0
1.164LeuHis: 1.164 ± 0.0
5.24LeuIle: 5.24 ± 0.0
6.696LeuLys: 6.696 ± 0.0
6.696LeuLeu: 6.696 ± 0.0
1.164LeuMet: 1.164 ± 0.0
6.987LeuAsn: 6.987 ± 0.0
3.202LeuPro: 3.202 ± 0.0
4.076LeuGln: 4.076 ± 0.0
2.62LeuArg: 2.62 ± 0.0
9.316LeuSer: 9.316 ± 0.0
4.076LeuThr: 4.076 ± 0.0
5.531LeuVal: 5.531 ± 0.0
1.456LeuTrp: 1.456 ± 0.0
4.658LeuTyr: 4.658 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.038MetAla: 2.038 ± 0.0
0.582MetCys: 0.582 ± 0.0
2.911MetAsp: 2.911 ± 0.0
0.873MetGlu: 0.873 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.329MetGly: 2.329 ± 0.0
0.291MetHis: 0.291 ± 0.0
1.456MetIle: 1.456 ± 0.0
1.747MetLys: 1.747 ± 0.0
3.202MetLeu: 3.202 ± 0.0
0.291MetMet: 0.291 ± 0.0
1.164MetAsn: 1.164 ± 0.0
0.873MetPro: 0.873 ± 0.0
1.164MetGln: 1.164 ± 0.0
0.291MetArg: 0.291 ± 0.0
2.62MetSer: 2.62 ± 0.0
1.456MetThr: 1.456 ± 0.0
1.456MetVal: 1.456 ± 0.0
0.582MetTrp: 0.582 ± 0.0
0.873MetTyr: 0.873 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.747AsnAla: 1.747 ± 0.0
1.456AsnCys: 1.456 ± 0.0
2.911AsnAsp: 2.911 ± 0.0
3.202AsnGlu: 3.202 ± 0.0
3.493AsnPhe: 3.493 ± 0.0
3.202AsnGly: 3.202 ± 0.0
1.164AsnHis: 1.164 ± 0.0
4.076AsnIle: 4.076 ± 0.0
2.62AsnLys: 2.62 ± 0.0
5.531AsnLeu: 5.531 ± 0.0
1.456AsnMet: 1.456 ± 0.0
2.329AsnAsn: 2.329 ± 0.0
2.62AsnPro: 2.62 ± 0.0
0.582AsnGln: 0.582 ± 0.0
2.62AsnArg: 2.62 ± 0.0
2.038AsnSer: 2.038 ± 0.0
3.785AsnThr: 3.785 ± 0.0
4.367AsnVal: 4.367 ± 0.0
0.582AsnTrp: 0.582 ± 0.0
3.202AsnTyr: 3.202 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.785ProAla: 3.785 ± 0.0
0.291ProCys: 0.291 ± 0.0
1.747ProAsp: 1.747 ± 0.0
1.747ProGlu: 1.747 ± 0.0
1.456ProPhe: 1.456 ± 0.0
2.911ProGly: 2.911 ± 0.0
0.873ProHis: 0.873 ± 0.0
4.076ProIle: 4.076 ± 0.0
1.747ProLys: 1.747 ± 0.0
4.949ProLeu: 4.949 ± 0.0
0.873ProMet: 0.873 ± 0.0
1.747ProAsn: 1.747 ± 0.0
1.164ProPro: 1.164 ± 0.0
2.329ProGln: 2.329 ± 0.0
2.038ProArg: 2.038 ± 0.0
4.949ProSer: 4.949 ± 0.0
3.202ProThr: 3.202 ± 0.0
2.329ProVal: 2.329 ± 0.0
0.291ProTrp: 0.291 ± 0.0
0.873ProTyr: 0.873 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.038GlnAla: 2.038 ± 0.0
0.873GlnCys: 0.873 ± 0.0
1.164GlnAsp: 1.164 ± 0.0
2.911GlnGlu: 2.911 ± 0.0
1.747GlnPhe: 1.747 ± 0.0
1.747GlnGly: 1.747 ± 0.0
0.582GlnHis: 0.582 ± 0.0
2.329GlnIle: 2.329 ± 0.0
1.747GlnLys: 1.747 ± 0.0
3.785GlnLeu: 3.785 ± 0.0
2.038GlnMet: 2.038 ± 0.0
1.164GlnAsn: 1.164 ± 0.0
1.747GlnPro: 1.747 ± 0.0
2.62GlnGln: 2.62 ± 0.0
0.873GlnArg: 0.873 ± 0.0
4.949GlnSer: 4.949 ± 0.0
2.038GlnThr: 2.038 ± 0.0
4.076GlnVal: 4.076 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
0.873GlnTyr: 0.873 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
0.291ArgAla: 0.291 ± 0.0
0.291ArgCys: 0.291 ± 0.0
1.747ArgAsp: 1.747 ± 0.0
2.038ArgGlu: 2.038 ± 0.0
2.911ArgPhe: 2.911 ± 0.0
1.747ArgGly: 1.747 ± 0.0
0.291ArgHis: 0.291 ± 0.0
3.493ArgIle: 3.493 ± 0.0
2.911ArgLys: 2.911 ± 0.0
2.911ArgLeu: 2.911 ± 0.0
2.038ArgMet: 2.038 ± 0.0
1.747ArgAsn: 1.747 ± 0.0
0.582ArgPro: 0.582 ± 0.0
1.747ArgGln: 1.747 ± 0.0
2.911ArgArg: 2.911 ± 0.0
1.456ArgSer: 1.456 ± 0.0
2.038ArgThr: 2.038 ± 0.0
2.329ArgVal: 2.329 ± 0.0
0.291ArgTrp: 0.291 ± 0.0
1.747ArgTyr: 1.747 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.531SerAla: 5.531 ± 0.0
1.747SerCys: 1.747 ± 0.0
4.949SerAsp: 4.949 ± 0.0
3.785SerGlu: 3.785 ± 0.0
3.785SerPhe: 3.785 ± 0.0
5.822SerGly: 5.822 ± 0.0
1.747SerHis: 1.747 ± 0.0
5.24SerIle: 5.24 ± 0.0
4.367SerLys: 4.367 ± 0.0
7.86SerLeu: 7.86 ± 0.0
2.038SerMet: 2.038 ± 0.0
4.949SerAsn: 4.949 ± 0.0
3.202SerPro: 3.202 ± 0.0
2.911SerGln: 2.911 ± 0.0
4.367SerArg: 4.367 ± 0.0
5.24SerSer: 5.24 ± 0.0
5.24SerThr: 5.24 ± 0.0
5.24SerVal: 5.24 ± 0.0
1.456SerTrp: 1.456 ± 0.0
3.785SerTyr: 3.785 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.658ThrAla: 4.658 ± 0.0
0.291ThrCys: 0.291 ± 0.0
2.62ThrAsp: 2.62 ± 0.0
3.202ThrGlu: 3.202 ± 0.0
3.785ThrPhe: 3.785 ± 0.0
5.24ThrGly: 5.24 ± 0.0
1.456ThrHis: 1.456 ± 0.0
1.456ThrIle: 1.456 ± 0.0
4.658ThrLys: 4.658 ± 0.0
4.076ThrLeu: 4.076 ± 0.0
0.873ThrMet: 0.873 ± 0.0
2.62ThrAsn: 2.62 ± 0.0
4.949ThrPro: 4.949 ± 0.0
1.456ThrGln: 1.456 ± 0.0
2.038ThrArg: 2.038 ± 0.0
6.114ThrSer: 6.114 ± 0.0
4.658ThrThr: 4.658 ± 0.0
3.493ThrVal: 3.493 ± 0.0
0.873ThrTrp: 0.873 ± 0.0
2.62ThrTyr: 2.62 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.658ValAla: 4.658 ± 0.0
0.582ValCys: 0.582 ± 0.0
2.911ValAsp: 2.911 ± 0.0
3.493ValGlu: 3.493 ± 0.0
3.493ValPhe: 3.493 ± 0.0
2.62ValGly: 2.62 ± 0.0
1.164ValHis: 1.164 ± 0.0
2.911ValIle: 2.911 ± 0.0
6.405ValLys: 6.405 ± 0.0
3.785ValLeu: 3.785 ± 0.0
1.164ValMet: 1.164 ± 0.0
3.202ValAsn: 3.202 ± 0.0
3.202ValPro: 3.202 ± 0.0
2.911ValGln: 2.911 ± 0.0
0.582ValArg: 0.582 ± 0.0
6.987ValSer: 6.987 ± 0.0
4.658ValThr: 4.658 ± 0.0
3.493ValVal: 3.493 ± 0.0
0.873ValTrp: 0.873 ± 0.0
2.038ValTyr: 2.038 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.873TrpAla: 0.873 ± 0.0
0.291TrpCys: 0.291 ± 0.0
0.582TrpAsp: 0.582 ± 0.0
1.456TrpGlu: 1.456 ± 0.0
0.873TrpPhe: 0.873 ± 0.0
0.582TrpGly: 0.582 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.873TrpIle: 0.873 ± 0.0
1.164TrpLys: 1.164 ± 0.0
0.873TrpLeu: 0.873 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.456TrpAsn: 1.456 ± 0.0
1.164TrpPro: 1.164 ± 0.0
0.291TrpGln: 0.291 ± 0.0
0.873TrpArg: 0.873 ± 0.0
1.164TrpSer: 1.164 ± 0.0
0.582TrpThr: 0.582 ± 0.0
0.582TrpVal: 0.582 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.329TyrAla: 2.329 ± 0.0
1.456TyrCys: 1.456 ± 0.0
1.456TyrAsp: 1.456 ± 0.0
3.202TyrGlu: 3.202 ± 0.0
2.62TyrPhe: 2.62 ± 0.0
4.367TyrGly: 4.367 ± 0.0
0.582TyrHis: 0.582 ± 0.0
4.949TyrIle: 4.949 ± 0.0
5.24TyrLys: 5.24 ± 0.0
2.329TyrLeu: 2.329 ± 0.0
0.291TyrMet: 0.291 ± 0.0
1.164TyrAsn: 1.164 ± 0.0
2.038TyrPro: 2.038 ± 0.0
1.164TyrGln: 1.164 ± 0.0
2.329TyrArg: 2.329 ± 0.0
3.202TyrSer: 3.202 ± 0.0
2.038TyrThr: 2.038 ± 0.0
2.038TyrVal: 2.038 ± 0.0
0.291TyrTrp: 0.291 ± 0.0
1.164TyrTyr: 1.164 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3436 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski