Amino acid dipepetide frequency for Shahe isopoda virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.633AlaAla: 9.633 ± 0.0
2.408AlaCys: 2.408 ± 0.0
3.01AlaAsp: 3.01 ± 0.0
2.107AlaGlu: 2.107 ± 0.0
1.505AlaPhe: 1.505 ± 0.0
6.321AlaGly: 6.321 ± 0.0
1.204AlaHis: 1.204 ± 0.0
2.709AlaIle: 2.709 ± 0.0
3.01AlaLys: 3.01 ± 0.0
4.214AlaLeu: 4.214 ± 0.0
2.408AlaMet: 2.408 ± 0.0
3.01AlaAsn: 3.01 ± 0.0
4.214AlaPro: 4.214 ± 0.0
4.515AlaGln: 4.515 ± 0.0
6.321AlaArg: 6.321 ± 0.0
6.02AlaSer: 6.02 ± 0.0
6.321AlaThr: 6.321 ± 0.0
4.816AlaVal: 4.816 ± 0.0
1.204AlaTrp: 1.204 ± 0.0
2.709AlaTyr: 2.709 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.903CysAla: 0.903 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.204CysAsp: 1.204 ± 0.0
0.602CysGlu: 0.602 ± 0.0
0.903CysPhe: 0.903 ± 0.0
1.505CysGly: 1.505 ± 0.0
0.301CysHis: 0.301 ± 0.0
0.903CysIle: 0.903 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.602CysLeu: 0.602 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.602CysAsn: 0.602 ± 0.0
0.903CysPro: 0.903 ± 0.0
0.602CysGln: 0.602 ± 0.0
0.903CysArg: 0.903 ± 0.0
1.204CysSer: 1.204 ± 0.0
0.602CysThr: 0.602 ± 0.0
0.602CysVal: 0.602 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.301CysTyr: 0.301 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.515AspAla: 4.515 ± 0.0
0.301AspCys: 0.301 ± 0.0
3.612AspAsp: 3.612 ± 0.0
3.311AspGlu: 3.311 ± 0.0
3.311AspPhe: 3.311 ± 0.0
2.709AspGly: 2.709 ± 0.0
2.107AspHis: 2.107 ± 0.0
4.816AspIle: 4.816 ± 0.0
2.709AspLys: 2.709 ± 0.0
5.719AspLeu: 5.719 ± 0.0
2.709AspMet: 2.709 ± 0.0
2.107AspAsn: 2.107 ± 0.0
2.709AspPro: 2.709 ± 0.0
1.204AspGln: 1.204 ± 0.0
3.01AspArg: 3.01 ± 0.0
5.117AspSer: 5.117 ± 0.0
3.311AspThr: 3.311 ± 0.0
4.214AspVal: 4.214 ± 0.0
0.602AspTrp: 0.602 ± 0.0
2.709AspTyr: 2.709 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.311GluAla: 3.311 ± 0.0
1.204GluCys: 1.204 ± 0.0
1.806GluAsp: 1.806 ± 0.0
3.01GluGlu: 3.01 ± 0.0
3.612GluPhe: 3.612 ± 0.0
1.204GluGly: 1.204 ± 0.0
1.806GluHis: 1.806 ± 0.0
2.709GluIle: 2.709 ± 0.0
1.505GluLys: 1.505 ± 0.0
3.913GluLeu: 3.913 ± 0.0
1.204GluMet: 1.204 ± 0.0
3.311GluAsn: 3.311 ± 0.0
2.107GluPro: 2.107 ± 0.0
1.505GluGln: 1.505 ± 0.0
1.806GluArg: 1.806 ± 0.0
3.01GluSer: 3.01 ± 0.0
3.913GluThr: 3.913 ± 0.0
2.408GluVal: 2.408 ± 0.0
0.602GluTrp: 0.602 ± 0.0
3.01GluTyr: 3.01 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.515PheAla: 4.515 ± 0.0
0.301PheCys: 0.301 ± 0.0
3.612PheAsp: 3.612 ± 0.0
1.505PheGlu: 1.505 ± 0.0
3.612PhePhe: 3.612 ± 0.0
1.204PheGly: 1.204 ± 0.0
0.602PheHis: 0.602 ± 0.0
2.408PheIle: 2.408 ± 0.0
2.709PheLys: 2.709 ± 0.0
2.107PheLeu: 2.107 ± 0.0
0.301PheMet: 0.301 ± 0.0
2.107PheAsn: 2.107 ± 0.0
2.709PhePro: 2.709 ± 0.0
0.903PheGln: 0.903 ± 0.0
1.806PheArg: 1.806 ± 0.0
2.709PheSer: 2.709 ± 0.0
2.107PheThr: 2.107 ± 0.0
4.214PheVal: 4.214 ± 0.0
0.602PheTrp: 0.602 ± 0.0
1.806PheTyr: 1.806 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.719GlyAla: 5.719 ± 0.0
0.602GlyCys: 0.602 ± 0.0
2.107GlyAsp: 2.107 ± 0.0
2.408GlyGlu: 2.408 ± 0.0
1.505GlyPhe: 1.505 ± 0.0
2.709GlyGly: 2.709 ± 0.0
1.806GlyHis: 1.806 ± 0.0
3.913GlyIle: 3.913 ± 0.0
2.408GlyLys: 2.408 ± 0.0
4.214GlyLeu: 4.214 ± 0.0
0.903GlyMet: 0.903 ± 0.0
3.913GlyAsn: 3.913 ± 0.0
4.214GlyPro: 4.214 ± 0.0
2.107GlyGln: 2.107 ± 0.0
3.01GlyArg: 3.01 ± 0.0
3.612GlySer: 3.612 ± 0.0
3.612GlyThr: 3.612 ± 0.0
6.02GlyVal: 6.02 ± 0.0
0.301GlyTrp: 0.301 ± 0.0
1.204GlyTyr: 1.204 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.806HisAla: 1.806 ± 0.0
0.602HisCys: 0.602 ± 0.0
0.602HisAsp: 0.602 ± 0.0
1.204HisGlu: 1.204 ± 0.0
1.204HisPhe: 1.204 ± 0.0
0.0HisGly: 0.0 ± 0.0
3.01HisHis: 3.01 ± 0.0
1.806HisIle: 1.806 ± 0.0
0.602HisLys: 0.602 ± 0.0
3.311HisLeu: 3.311 ± 0.0
0.903HisMet: 0.903 ± 0.0
0.602HisAsn: 0.602 ± 0.0
1.204HisPro: 1.204 ± 0.0
0.602HisGln: 0.602 ± 0.0
1.505HisArg: 1.505 ± 0.0
0.602HisSer: 0.602 ± 0.0
1.806HisThr: 1.806 ± 0.0
1.505HisVal: 1.505 ± 0.0
0.903HisTrp: 0.903 ± 0.0
0.903HisTyr: 0.903 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.515IleAla: 4.515 ± 0.0
0.602IleCys: 0.602 ± 0.0
4.816IleAsp: 4.816 ± 0.0
3.01IleGlu: 3.01 ± 0.0
0.602IlePhe: 0.602 ± 0.0
4.214IleGly: 4.214 ± 0.0
0.903IleHis: 0.903 ± 0.0
2.107IleIle: 2.107 ± 0.0
5.117IleLys: 5.117 ± 0.0
5.418IleLeu: 5.418 ± 0.0
2.107IleMet: 2.107 ± 0.0
4.214IleAsn: 4.214 ± 0.0
3.01IlePro: 3.01 ± 0.0
1.806IleGln: 1.806 ± 0.0
2.709IleArg: 2.709 ± 0.0
4.515IleSer: 4.515 ± 0.0
3.01IleThr: 3.01 ± 0.0
5.117IleVal: 5.117 ± 0.0
0.602IleTrp: 0.602 ± 0.0
2.709IleTyr: 2.709 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.515LysAla: 4.515 ± 0.0
0.301LysCys: 0.301 ± 0.0
3.913LysAsp: 3.913 ± 0.0
2.408LysGlu: 2.408 ± 0.0
1.505LysPhe: 1.505 ± 0.0
2.107LysGly: 2.107 ± 0.0
1.204LysHis: 1.204 ± 0.0
2.709LysIle: 2.709 ± 0.0
3.311LysLys: 3.311 ± 0.0
3.612LysLeu: 3.612 ± 0.0
1.204LysMet: 1.204 ± 0.0
2.709LysAsn: 2.709 ± 0.0
4.214LysPro: 4.214 ± 0.0
0.301LysGln: 0.301 ± 0.0
3.311LysArg: 3.311 ± 0.0
5.418LysSer: 5.418 ± 0.0
6.623LysThr: 6.623 ± 0.0
3.612LysVal: 3.612 ± 0.0
1.204LysTrp: 1.204 ± 0.0
1.204LysTyr: 1.204 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.515LeuAla: 4.515 ± 0.0
0.301LeuCys: 0.301 ± 0.0
4.214LeuAsp: 4.214 ± 0.0
3.612LeuGlu: 3.612 ± 0.0
1.505LeuPhe: 1.505 ± 0.0
3.612LeuGly: 3.612 ± 0.0
1.204LeuHis: 1.204 ± 0.0
6.02LeuIle: 6.02 ± 0.0
6.02LeuLys: 6.02 ± 0.0
6.02LeuLeu: 6.02 ± 0.0
0.903LeuMet: 0.903 ± 0.0
4.515LeuAsn: 4.515 ± 0.0
4.214LeuPro: 4.214 ± 0.0
3.311LeuGln: 3.311 ± 0.0
3.612LeuArg: 3.612 ± 0.0
6.02LeuSer: 6.02 ± 0.0
6.321LeuThr: 6.321 ± 0.0
6.623LeuVal: 6.623 ± 0.0
0.301LeuTrp: 0.301 ± 0.0
2.408LeuTyr: 2.408 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.602MetAla: 0.602 ± 0.0
0.301MetCys: 0.301 ± 0.0
2.107MetAsp: 2.107 ± 0.0
1.505MetGlu: 1.505 ± 0.0
1.204MetPhe: 1.204 ± 0.0
1.505MetGly: 1.505 ± 0.0
0.903MetHis: 0.903 ± 0.0
1.806MetIle: 1.806 ± 0.0
1.505MetLys: 1.505 ± 0.0
1.505MetLeu: 1.505 ± 0.0
0.903MetMet: 0.903 ± 0.0
1.505MetAsn: 1.505 ± 0.0
0.903MetPro: 0.903 ± 0.0
1.204MetGln: 1.204 ± 0.0
1.204MetArg: 1.204 ± 0.0
2.107MetSer: 2.107 ± 0.0
1.204MetThr: 1.204 ± 0.0
0.602MetVal: 0.602 ± 0.0
0.602MetTrp: 0.602 ± 0.0
1.204MetTyr: 1.204 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.214AsnAla: 4.214 ± 0.0
0.0AsnCys: 0.0 ± 0.0
3.311AsnAsp: 3.311 ± 0.0
2.408AsnGlu: 2.408 ± 0.0
3.01AsnPhe: 3.01 ± 0.0
3.913AsnGly: 3.913 ± 0.0
0.903AsnHis: 0.903 ± 0.0
2.709AsnIle: 2.709 ± 0.0
3.612AsnLys: 3.612 ± 0.0
3.01AsnLeu: 3.01 ± 0.0
2.107AsnMet: 2.107 ± 0.0
2.709AsnAsn: 2.709 ± 0.0
3.612AsnPro: 3.612 ± 0.0
2.709AsnGln: 2.709 ± 0.0
3.311AsnArg: 3.311 ± 0.0
3.612AsnSer: 3.612 ± 0.0
1.806AsnThr: 1.806 ± 0.0
3.913AsnVal: 3.913 ± 0.0
1.505AsnTrp: 1.505 ± 0.0
1.806AsnTyr: 1.806 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.311ProAla: 3.311 ± 0.0
0.301ProCys: 0.301 ± 0.0
2.709ProAsp: 2.709 ± 0.0
1.806ProGlu: 1.806 ± 0.0
2.107ProPhe: 2.107 ± 0.0
3.311ProGly: 3.311 ± 0.0
1.505ProHis: 1.505 ± 0.0
4.214ProIle: 4.214 ± 0.0
2.408ProLys: 2.408 ± 0.0
6.321ProLeu: 6.321 ± 0.0
0.903ProMet: 0.903 ± 0.0
2.107ProAsn: 2.107 ± 0.0
4.816ProPro: 4.816 ± 0.0
0.602ProGln: 0.602 ± 0.0
2.107ProArg: 2.107 ± 0.0
5.719ProSer: 5.719 ± 0.0
5.117ProThr: 5.117 ± 0.0
3.311ProVal: 3.311 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.806ProTyr: 1.806 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.311GlnAla: 3.311 ± 0.0
0.301GlnCys: 0.301 ± 0.0
2.709GlnAsp: 2.709 ± 0.0
1.505GlnGlu: 1.505 ± 0.0
0.903GlnPhe: 0.903 ± 0.0
1.505GlnGly: 1.505 ± 0.0
0.903GlnHis: 0.903 ± 0.0
1.505GlnIle: 1.505 ± 0.0
1.505GlnLys: 1.505 ± 0.0
2.408GlnLeu: 2.408 ± 0.0
0.602GlnMet: 0.602 ± 0.0
2.408GlnAsn: 2.408 ± 0.0
1.505GlnPro: 1.505 ± 0.0
1.204GlnGln: 1.204 ± 0.0
0.903GlnArg: 0.903 ± 0.0
3.612GlnSer: 3.612 ± 0.0
3.612GlnThr: 3.612 ± 0.0
1.806GlnVal: 1.806 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.505GlnTyr: 1.505 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.311ArgAla: 3.311 ± 0.0
0.903ArgCys: 0.903 ± 0.0
3.311ArgAsp: 3.311 ± 0.0
2.709ArgGlu: 2.709 ± 0.0
3.01ArgPhe: 3.01 ± 0.0
3.01ArgGly: 3.01 ± 0.0
1.204ArgHis: 1.204 ± 0.0
2.709ArgIle: 2.709 ± 0.0
3.612ArgLys: 3.612 ± 0.0
3.913ArgLeu: 3.913 ± 0.0
0.602ArgMet: 0.602 ± 0.0
2.709ArgAsn: 2.709 ± 0.0
2.408ArgPro: 2.408 ± 0.0
2.709ArgGln: 2.709 ± 0.0
3.612ArgArg: 3.612 ± 0.0
3.01ArgSer: 3.01 ± 0.0
3.913ArgThr: 3.913 ± 0.0
4.214ArgVal: 4.214 ± 0.0
0.903ArgTrp: 0.903 ± 0.0
1.806ArgTyr: 1.806 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.02SerAla: 6.02 ± 0.0
0.903SerCys: 0.903 ± 0.0
6.02SerAsp: 6.02 ± 0.0
4.515SerGlu: 4.515 ± 0.0
3.913SerPhe: 3.913 ± 0.0
4.515SerGly: 4.515 ± 0.0
1.204SerHis: 1.204 ± 0.0
5.418SerIle: 5.418 ± 0.0
2.709SerLys: 2.709 ± 0.0
6.02SerLeu: 6.02 ± 0.0
2.107SerMet: 2.107 ± 0.0
4.515SerAsn: 4.515 ± 0.0
3.612SerPro: 3.612 ± 0.0
3.01SerGln: 3.01 ± 0.0
3.01SerArg: 3.01 ± 0.0
8.73SerSer: 8.73 ± 0.0
4.816SerThr: 4.816 ± 0.0
4.214SerVal: 4.214 ± 0.0
1.204SerTrp: 1.204 ± 0.0
3.913SerTyr: 3.913 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.913ThrAla: 3.913 ± 0.0
0.602ThrCys: 0.602 ± 0.0
4.816ThrAsp: 4.816 ± 0.0
3.913ThrGlu: 3.913 ± 0.0
4.214ThrPhe: 4.214 ± 0.0
5.719ThrGly: 5.719 ± 0.0
1.204ThrHis: 1.204 ± 0.0
7.225ThrIle: 7.225 ± 0.0
4.214ThrLys: 4.214 ± 0.0
3.612ThrLeu: 3.612 ± 0.0
1.204ThrMet: 1.204 ± 0.0
4.214ThrAsn: 4.214 ± 0.0
3.612ThrPro: 3.612 ± 0.0
2.408ThrGln: 2.408 ± 0.0
4.816ThrArg: 4.816 ± 0.0
6.02ThrSer: 6.02 ± 0.0
3.612ThrThr: 3.612 ± 0.0
3.01ThrVal: 3.01 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
1.806ThrTyr: 1.806 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.719ValAla: 5.719 ± 0.0
1.806ValCys: 1.806 ± 0.0
3.612ValAsp: 3.612 ± 0.0
3.612ValGlu: 3.612 ± 0.0
3.01ValPhe: 3.01 ± 0.0
3.311ValGly: 3.311 ± 0.0
1.806ValHis: 1.806 ± 0.0
3.612ValIle: 3.612 ± 0.0
5.117ValLys: 5.117 ± 0.0
4.214ValLeu: 4.214 ± 0.0
1.204ValMet: 1.204 ± 0.0
4.515ValAsn: 4.515 ± 0.0
2.709ValPro: 2.709 ± 0.0
1.204ValGln: 1.204 ± 0.0
4.214ValArg: 4.214 ± 0.0
5.719ValSer: 5.719 ± 0.0
3.913ValThr: 3.913 ± 0.0
5.117ValVal: 5.117 ± 0.0
0.903ValTrp: 0.903 ± 0.0
2.709ValTyr: 2.709 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.602TrpAla: 0.602 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.602TrpAsp: 0.602 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.301TrpPhe: 0.301 ± 0.0
1.204TrpGly: 1.204 ± 0.0
0.301TrpHis: 0.301 ± 0.0
0.602TrpIle: 0.602 ± 0.0
1.204TrpLys: 1.204 ± 0.0
2.107TrpLeu: 2.107 ± 0.0
0.602TrpMet: 0.602 ± 0.0
0.301TrpAsn: 0.301 ± 0.0
0.301TrpPro: 0.301 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.204TrpArg: 1.204 ± 0.0
0.602TrpSer: 0.602 ± 0.0
0.903TrpThr: 0.903 ± 0.0
1.204TrpVal: 1.204 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.709TyrAla: 2.709 ± 0.0
0.903TyrCys: 0.903 ± 0.0
2.709TyrAsp: 2.709 ± 0.0
1.806TyrGlu: 1.806 ± 0.0
0.903TyrPhe: 0.903 ± 0.0
2.709TyrGly: 2.709 ± 0.0
0.903TyrHis: 0.903 ± 0.0
1.204TyrIle: 1.204 ± 0.0
2.107TyrLys: 2.107 ± 0.0
3.01TyrLeu: 3.01 ± 0.0
1.204TyrMet: 1.204 ± 0.0
2.107TyrAsn: 2.107 ± 0.0
1.505TyrPro: 1.505 ± 0.0
1.806TyrGln: 1.806 ± 0.0
1.204TyrArg: 1.204 ± 0.0
3.01TyrSer: 3.01 ± 0.0
3.612TyrThr: 3.612 ± 0.0
1.505TyrVal: 1.505 ± 0.0
0.602TyrTrp: 0.602 ± 0.0
0.903TyrTyr: 0.903 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3323 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski