Amino acid dipepetide frequency for Rasavirus sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.38AlaAla: 5.38 ± 0.0
0.0AlaCys: 0.0 ± 0.0
3.945AlaAsp: 3.945 ± 0.0
2.511AlaGlu: 2.511 ± 0.0
3.587AlaPhe: 3.587 ± 0.0
5.739AlaGly: 5.739 ± 0.0
1.076AlaHis: 1.076 ± 0.0
5.022AlaIle: 5.022 ± 0.0
2.511AlaLys: 2.511 ± 0.0
6.815AlaLeu: 6.815 ± 0.0
0.717AlaMet: 0.717 ± 0.0
2.511AlaAsn: 2.511 ± 0.0
4.663AlaPro: 4.663 ± 0.0
4.304AlaGln: 4.304 ± 0.0
3.587AlaArg: 3.587 ± 0.0
4.304AlaSer: 4.304 ± 0.0
3.945AlaThr: 3.945 ± 0.0
7.174AlaVal: 7.174 ± 0.0
1.435AlaTrp: 1.435 ± 0.0
2.511AlaTyr: 2.511 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.435CysAla: 1.435 ± 0.0
0.359CysCys: 0.359 ± 0.0
0.359CysAsp: 0.359 ± 0.0
0.359CysGlu: 0.359 ± 0.0
1.076CysPhe: 1.076 ± 0.0
1.435CysGly: 1.435 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.076CysIle: 1.076 ± 0.0
0.359CysLys: 0.359 ± 0.0
1.435CysLeu: 1.435 ± 0.0
0.717CysMet: 0.717 ± 0.0
1.435CysAsn: 1.435 ± 0.0
2.152CysPro: 2.152 ± 0.0
0.359CysGln: 0.359 ± 0.0
1.435CysArg: 1.435 ± 0.0
1.076CysSer: 1.076 ± 0.0
1.435CysThr: 1.435 ± 0.0
1.793CysVal: 1.793 ± 0.0
0.717CysTrp: 0.717 ± 0.0
1.076CysTyr: 1.076 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.152AspAla: 2.152 ± 0.0
0.717AspCys: 0.717 ± 0.0
5.022AspAsp: 5.022 ± 0.0
5.022AspGlu: 5.022 ± 0.0
2.152AspPhe: 2.152 ± 0.0
4.304AspGly: 4.304 ± 0.0
0.717AspHis: 0.717 ± 0.0
1.435AspIle: 1.435 ± 0.0
2.511AspLys: 2.511 ± 0.0
5.022AspLeu: 5.022 ± 0.0
0.717AspMet: 0.717 ± 0.0
2.152AspAsn: 2.152 ± 0.0
4.304AspPro: 4.304 ± 0.0
0.717AspGln: 0.717 ± 0.0
1.793AspArg: 1.793 ± 0.0
4.663AspSer: 4.663 ± 0.0
2.511AspThr: 2.511 ± 0.0
4.304AspVal: 4.304 ± 0.0
1.435AspTrp: 1.435 ± 0.0
2.152AspTyr: 2.152 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.739GluAla: 5.739 ± 0.0
1.435GluCys: 1.435 ± 0.0
1.076GluAsp: 1.076 ± 0.0
3.587GluGlu: 3.587 ± 0.0
3.228GluPhe: 3.228 ± 0.0
2.511GluGly: 2.511 ± 0.0
1.076GluHis: 1.076 ± 0.0
2.152GluIle: 2.152 ± 0.0
2.511GluLys: 2.511 ± 0.0
4.663GluLeu: 4.663 ± 0.0
2.152GluMet: 2.152 ± 0.0
2.152GluAsn: 2.152 ± 0.0
1.435GluPro: 1.435 ± 0.0
2.152GluGln: 2.152 ± 0.0
2.152GluArg: 2.152 ± 0.0
4.304GluSer: 4.304 ± 0.0
1.076GluThr: 1.076 ± 0.0
4.304GluVal: 4.304 ± 0.0
1.435GluTrp: 1.435 ± 0.0
2.869GluTyr: 2.869 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.587PheAla: 3.587 ± 0.0
0.717PheCys: 0.717 ± 0.0
5.022PheAsp: 5.022 ± 0.0
3.587PheGlu: 3.587 ± 0.0
2.869PhePhe: 2.869 ± 0.0
2.869PheGly: 2.869 ± 0.0
1.076PheHis: 1.076 ± 0.0
1.076PheIle: 1.076 ± 0.0
3.228PheLys: 3.228 ± 0.0
2.869PheLeu: 2.869 ± 0.0
1.435PheMet: 1.435 ± 0.0
1.076PheAsn: 1.076 ± 0.0
2.152PhePro: 2.152 ± 0.0
1.076PheGln: 1.076 ± 0.0
2.869PheArg: 2.869 ± 0.0
5.739PheSer: 5.739 ± 0.0
2.511PheThr: 2.511 ± 0.0
5.38PheVal: 5.38 ± 0.0
0.359PheTrp: 0.359 ± 0.0
0.717PheTyr: 0.717 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.945GlyAla: 3.945 ± 0.0
1.435GlyCys: 1.435 ± 0.0
3.228GlyAsp: 3.228 ± 0.0
2.511GlyGlu: 2.511 ± 0.0
2.869GlyPhe: 2.869 ± 0.0
4.304GlyGly: 4.304 ± 0.0
1.793GlyHis: 1.793 ± 0.0
3.228GlyIle: 3.228 ± 0.0
2.869GlyLys: 2.869 ± 0.0
6.815GlyLeu: 6.815 ± 0.0
2.152GlyMet: 2.152 ± 0.0
3.228GlyAsn: 3.228 ± 0.0
2.152GlyPro: 2.152 ± 0.0
2.869GlyGln: 2.869 ± 0.0
3.945GlyArg: 3.945 ± 0.0
5.739GlySer: 5.739 ± 0.0
3.228GlyThr: 3.228 ± 0.0
5.739GlyVal: 5.739 ± 0.0
1.076GlyTrp: 1.076 ± 0.0
2.511GlyTyr: 2.511 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.435HisAla: 1.435 ± 0.0
1.793HisCys: 1.793 ± 0.0
0.359HisAsp: 0.359 ± 0.0
1.435HisGlu: 1.435 ± 0.0
2.152HisPhe: 2.152 ± 0.0
0.717HisGly: 0.717 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.435HisIle: 1.435 ± 0.0
0.359HisLys: 0.359 ± 0.0
1.793HisLeu: 1.793 ± 0.0
0.717HisMet: 0.717 ± 0.0
1.076HisAsn: 1.076 ± 0.0
1.435HisPro: 1.435 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.076HisArg: 1.076 ± 0.0
1.435HisSer: 1.435 ± 0.0
0.717HisThr: 0.717 ± 0.0
1.435HisVal: 1.435 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.359HisTyr: 0.359 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.228IleAla: 3.228 ± 0.0
1.076IleCys: 1.076 ± 0.0
3.587IleAsp: 3.587 ± 0.0
2.869IleGlu: 2.869 ± 0.0
1.793IlePhe: 1.793 ± 0.0
4.304IleGly: 4.304 ± 0.0
0.0IleHis: 0.0 ± 0.0
1.435IleIle: 1.435 ± 0.0
2.511IleLys: 2.511 ± 0.0
2.152IleLeu: 2.152 ± 0.0
0.359IleMet: 0.359 ± 0.0
3.228IleAsn: 3.228 ± 0.0
6.815IlePro: 6.815 ± 0.0
1.435IleGln: 1.435 ± 0.0
2.511IleArg: 2.511 ± 0.0
1.435IleSer: 1.435 ± 0.0
1.793IleThr: 1.793 ± 0.0
2.152IleVal: 2.152 ± 0.0
0.717IleTrp: 0.717 ± 0.0
1.076IleTyr: 1.076 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.304LysAla: 4.304 ± 0.0
0.717LysCys: 0.717 ± 0.0
2.511LysAsp: 2.511 ± 0.0
1.435LysGlu: 1.435 ± 0.0
2.152LysPhe: 2.152 ± 0.0
2.152LysGly: 2.152 ± 0.0
1.435LysHis: 1.435 ± 0.0
2.869LysIle: 2.869 ± 0.0
2.511LysLys: 2.511 ± 0.0
3.228LysLeu: 3.228 ± 0.0
1.076LysMet: 1.076 ± 0.0
1.076LysAsn: 1.076 ± 0.0
2.152LysPro: 2.152 ± 0.0
2.152LysGln: 2.152 ± 0.0
0.359LysArg: 0.359 ± 0.0
3.228LysSer: 3.228 ± 0.0
3.228LysThr: 3.228 ± 0.0
3.945LysVal: 3.945 ± 0.0
0.359LysTrp: 0.359 ± 0.0
2.152LysTyr: 2.152 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.456LeuAla: 6.456 ± 0.0
1.076LeuCys: 1.076 ± 0.0
2.511LeuAsp: 2.511 ± 0.0
2.869LeuGlu: 2.869 ± 0.0
2.511LeuPhe: 2.511 ± 0.0
4.663LeuGly: 4.663 ± 0.0
3.228LeuHis: 3.228 ± 0.0
5.022LeuIle: 5.022 ± 0.0
4.663LeuLys: 4.663 ± 0.0
8.25LeuLeu: 8.25 ± 0.0
1.435LeuMet: 1.435 ± 0.0
4.304LeuAsn: 4.304 ± 0.0
4.304LeuPro: 4.304 ± 0.0
2.152LeuGln: 2.152 ± 0.0
4.304LeuArg: 4.304 ± 0.0
7.532LeuSer: 7.532 ± 0.0
5.022LeuThr: 5.022 ± 0.0
6.098LeuVal: 6.098 ± 0.0
0.717LeuTrp: 0.717 ± 0.0
5.38LeuTyr: 5.38 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.076MetAla: 1.076 ± 0.0
0.359MetCys: 0.359 ± 0.0
0.359MetAsp: 0.359 ± 0.0
1.793MetGlu: 1.793 ± 0.0
2.869MetPhe: 2.869 ± 0.0
1.793MetGly: 1.793 ± 0.0
0.717MetHis: 0.717 ± 0.0
1.435MetIle: 1.435 ± 0.0
0.359MetLys: 0.359 ± 0.0
1.076MetLeu: 1.076 ± 0.0
0.717MetMet: 0.717 ± 0.0
1.793MetAsn: 1.793 ± 0.0
1.076MetPro: 1.076 ± 0.0
0.717MetGln: 0.717 ± 0.0
1.793MetArg: 1.793 ± 0.0
2.869MetSer: 2.869 ± 0.0
1.435MetThr: 1.435 ± 0.0
3.228MetVal: 3.228 ± 0.0
0.359MetTrp: 0.359 ± 0.0
1.076MetTyr: 1.076 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.793AsnAla: 1.793 ± 0.0
0.359AsnCys: 0.359 ± 0.0
1.793AsnAsp: 1.793 ± 0.0
2.511AsnGlu: 2.511 ± 0.0
1.793AsnPhe: 1.793 ± 0.0
2.152AsnGly: 2.152 ± 0.0
1.435AsnHis: 1.435 ± 0.0
2.152AsnIle: 2.152 ± 0.0
1.076AsnLys: 1.076 ± 0.0
3.228AsnLeu: 3.228 ± 0.0
0.717AsnMet: 0.717 ± 0.0
3.587AsnAsn: 3.587 ± 0.0
1.793AsnPro: 1.793 ± 0.0
1.435AsnGln: 1.435 ± 0.0
2.511AsnArg: 2.511 ± 0.0
5.739AsnSer: 5.739 ± 0.0
3.945AsnThr: 3.945 ± 0.0
7.174AsnVal: 7.174 ± 0.0
0.717AsnTrp: 0.717 ± 0.0
1.793AsnTyr: 1.793 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.663ProAla: 4.663 ± 0.0
0.717ProCys: 0.717 ± 0.0
1.793ProAsp: 1.793 ± 0.0
3.945ProGlu: 3.945 ± 0.0
2.511ProPhe: 2.511 ± 0.0
4.663ProGly: 4.663 ± 0.0
2.511ProHis: 2.511 ± 0.0
0.359ProIle: 0.359 ± 0.0
1.435ProLys: 1.435 ± 0.0
4.304ProLeu: 4.304 ± 0.0
2.511ProMet: 2.511 ± 0.0
1.793ProAsn: 1.793 ± 0.0
2.869ProPro: 2.869 ± 0.0
0.717ProGln: 0.717 ± 0.0
3.228ProArg: 3.228 ± 0.0
3.587ProSer: 3.587 ± 0.0
2.869ProThr: 2.869 ± 0.0
6.815ProVal: 6.815 ± 0.0
0.717ProTrp: 0.717 ± 0.0
3.587ProTyr: 3.587 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.793GlnAla: 1.793 ± 0.0
0.0GlnCys: 0.0 ± 0.0
0.359GlnAsp: 0.359 ± 0.0
1.435GlnGlu: 1.435 ± 0.0
2.152GlnPhe: 2.152 ± 0.0
3.587GlnGly: 3.587 ± 0.0
0.717GlnHis: 0.717 ± 0.0
2.152GlnIle: 2.152 ± 0.0
1.076GlnLys: 1.076 ± 0.0
1.435GlnLeu: 1.435 ± 0.0
0.359GlnMet: 0.359 ± 0.0
1.435GlnAsn: 1.435 ± 0.0
1.435GlnPro: 1.435 ± 0.0
1.435GlnGln: 1.435 ± 0.0
1.793GlnArg: 1.793 ± 0.0
2.152GlnSer: 2.152 ± 0.0
1.076GlnThr: 1.076 ± 0.0
1.793GlnVal: 1.793 ± 0.0
0.359GlnTrp: 0.359 ± 0.0
3.228GlnTyr: 3.228 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.022ArgAla: 5.022 ± 0.0
1.076ArgCys: 1.076 ± 0.0
1.793ArgAsp: 1.793 ± 0.0
3.587ArgGlu: 3.587 ± 0.0
2.152ArgPhe: 2.152 ± 0.0
2.511ArgGly: 2.511 ± 0.0
0.359ArgHis: 0.359 ± 0.0
0.717ArgIle: 0.717 ± 0.0
1.793ArgLys: 1.793 ± 0.0
3.945ArgLeu: 3.945 ± 0.0
1.793ArgMet: 1.793 ± 0.0
1.076ArgAsn: 1.076 ± 0.0
2.511ArgPro: 2.511 ± 0.0
1.793ArgGln: 1.793 ± 0.0
3.228ArgArg: 3.228 ± 0.0
5.38ArgSer: 5.38 ± 0.0
3.945ArgThr: 3.945 ± 0.0
3.945ArgVal: 3.945 ± 0.0
1.435ArgTrp: 1.435 ± 0.0
2.869ArgTyr: 2.869 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.098SerAla: 6.098 ± 0.0
0.717SerCys: 0.717 ± 0.0
5.022SerAsp: 5.022 ± 0.0
3.228SerGlu: 3.228 ± 0.0
4.304SerPhe: 4.304 ± 0.0
3.945SerGly: 3.945 ± 0.0
0.359SerHis: 0.359 ± 0.0
5.739SerIle: 5.739 ± 0.0
4.304SerLys: 4.304 ± 0.0
6.456SerLeu: 6.456 ± 0.0
2.511SerMet: 2.511 ± 0.0
3.945SerAsn: 3.945 ± 0.0
3.228SerPro: 3.228 ± 0.0
2.152SerGln: 2.152 ± 0.0
3.228SerArg: 3.228 ± 0.0
10.402SerSer: 10.402 ± 0.0
5.38SerThr: 5.38 ± 0.0
5.739SerVal: 5.739 ± 0.0
1.076SerTrp: 1.076 ± 0.0
4.304SerTyr: 4.304 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.587ThrAla: 3.587 ± 0.0
1.435ThrCys: 1.435 ± 0.0
2.511ThrAsp: 2.511 ± 0.0
3.945ThrGlu: 3.945 ± 0.0
2.511ThrPhe: 2.511 ± 0.0
3.945ThrGly: 3.945 ± 0.0
0.717ThrHis: 0.717 ± 0.0
2.869ThrIle: 2.869 ± 0.0
2.869ThrLys: 2.869 ± 0.0
6.815ThrLeu: 6.815 ± 0.0
2.152ThrMet: 2.152 ± 0.0
3.228ThrAsn: 3.228 ± 0.0
2.869ThrPro: 2.869 ± 0.0
0.359ThrGln: 0.359 ± 0.0
3.587ThrArg: 3.587 ± 0.0
2.152ThrSer: 2.152 ± 0.0
3.587ThrThr: 3.587 ± 0.0
5.022ThrVal: 5.022 ± 0.0
0.717ThrTrp: 0.717 ± 0.0
2.152ThrTyr: 2.152 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.815ValAla: 6.815 ± 0.0
4.304ValCys: 4.304 ± 0.0
6.815ValAsp: 6.815 ± 0.0
3.945ValGlu: 3.945 ± 0.0
4.663ValPhe: 4.663 ± 0.0
5.739ValGly: 5.739 ± 0.0
1.435ValHis: 1.435 ± 0.0
2.869ValIle: 2.869 ± 0.0
3.945ValLys: 3.945 ± 0.0
6.098ValLeu: 6.098 ± 0.0
3.228ValMet: 3.228 ± 0.0
3.945ValAsn: 3.945 ± 0.0
6.456ValPro: 6.456 ± 0.0
1.076ValGln: 1.076 ± 0.0
3.945ValArg: 3.945 ± 0.0
6.098ValSer: 6.098 ± 0.0
5.739ValThr: 5.739 ± 0.0
4.304ValVal: 4.304 ± 0.0
1.076ValTrp: 1.076 ± 0.0
3.228ValTyr: 3.228 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.359TrpAla: 0.359 ± 0.0
0.717TrpCys: 0.717 ± 0.0
0.717TrpAsp: 0.717 ± 0.0
0.359TrpGlu: 0.359 ± 0.0
1.435TrpPhe: 1.435 ± 0.0
0.359TrpGly: 0.359 ± 0.0
0.717TrpHis: 0.717 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.076TrpLys: 1.076 ± 0.0
1.435TrpLeu: 1.435 ± 0.0
0.717TrpMet: 0.717 ± 0.0
2.152TrpAsn: 2.152 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.076TrpGln: 1.076 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.717TrpSer: 0.717 ± 0.0
1.076TrpThr: 1.076 ± 0.0
1.793TrpVal: 1.793 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.076TrpTyr: 1.076 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.869TyrAla: 2.869 ± 0.0
1.076TyrCys: 1.076 ± 0.0
5.022TyrAsp: 5.022 ± 0.0
1.076TyrGlu: 1.076 ± 0.0
1.435TyrPhe: 1.435 ± 0.0
3.945TyrGly: 3.945 ± 0.0
0.359TyrHis: 0.359 ± 0.0
1.435TyrIle: 1.435 ± 0.0
1.076TyrLys: 1.076 ± 0.0
4.663TyrLeu: 4.663 ± 0.0
0.359TyrMet: 0.359 ± 0.0
2.869TyrAsn: 2.869 ± 0.0
1.793TyrPro: 1.793 ± 0.0
1.793TyrGln: 1.793 ± 0.0
3.587TyrArg: 3.587 ± 0.0
3.587TyrSer: 3.587 ± 0.0
2.511TyrThr: 2.511 ± 0.0
3.587TyrVal: 3.587 ± 0.0
1.076TyrTrp: 1.076 ± 0.0
1.076TyrTyr: 1.076 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2789 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski