Amino acid dipepetide frequency for Beihai picorna-like virus 123

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.478AlaAla: 5.478 ± 0.0
2.435AlaCys: 2.435 ± 0.0
3.652AlaAsp: 3.652 ± 0.0
3.652AlaGlu: 3.652 ± 0.0
3.348AlaPhe: 3.348 ± 0.0
3.348AlaGly: 3.348 ± 0.0
0.913AlaHis: 0.913 ± 0.0
3.043AlaIle: 3.043 ± 0.0
4.26AlaLys: 4.26 ± 0.0
6.695AlaLeu: 6.695 ± 0.0
3.652AlaMet: 3.652 ± 0.0
2.13AlaAsn: 2.13 ± 0.0
2.435AlaPro: 2.435 ± 0.0
1.826AlaGln: 1.826 ± 0.0
2.739AlaArg: 2.739 ± 0.0
2.739AlaSer: 2.739 ± 0.0
2.739AlaThr: 2.739 ± 0.0
4.869AlaVal: 4.869 ± 0.0
0.609AlaTrp: 0.609 ± 0.0
2.13AlaTyr: 2.13 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.217CysAla: 1.217 ± 0.0
0.913CysCys: 0.913 ± 0.0
0.609CysAsp: 0.609 ± 0.0
2.13CysGlu: 2.13 ± 0.0
1.826CysPhe: 1.826 ± 0.0
2.13CysGly: 2.13 ± 0.0
0.304CysHis: 0.304 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.304CysLys: 0.304 ± 0.0
2.435CysLeu: 2.435 ± 0.0
1.217CysMet: 1.217 ± 0.0
0.609CysAsn: 0.609 ± 0.0
0.913CysPro: 0.913 ± 0.0
0.304CysGln: 0.304 ± 0.0
0.304CysArg: 0.304 ± 0.0
1.826CysSer: 1.826 ± 0.0
1.522CysThr: 1.522 ± 0.0
1.826CysVal: 1.826 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.304CysTyr: 0.304 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.435AspAla: 2.435 ± 0.0
1.826AspCys: 1.826 ± 0.0
5.478AspAsp: 5.478 ± 0.0
5.478AspGlu: 5.478 ± 0.0
6.086AspPhe: 6.086 ± 0.0
5.173AspGly: 5.173 ± 0.0
1.217AspHis: 1.217 ± 0.0
5.173AspIle: 5.173 ± 0.0
3.652AspLys: 3.652 ± 0.0
4.869AspLeu: 4.869 ± 0.0
2.739AspMet: 2.739 ± 0.0
2.435AspAsn: 2.435 ± 0.0
3.652AspPro: 3.652 ± 0.0
2.13AspGln: 2.13 ± 0.0
3.043AspArg: 3.043 ± 0.0
3.348AspSer: 3.348 ± 0.0
3.652AspThr: 3.652 ± 0.0
4.869AspVal: 4.869 ± 0.0
1.217AspTrp: 1.217 ± 0.0
2.435AspTyr: 2.435 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.652GluAla: 3.652 ± 0.0
1.522GluCys: 1.522 ± 0.0
3.956GluAsp: 3.956 ± 0.0
3.652GluGlu: 3.652 ± 0.0
2.435GluPhe: 2.435 ± 0.0
3.043GluGly: 3.043 ± 0.0
2.435GluHis: 2.435 ± 0.0
3.956GluIle: 3.956 ± 0.0
4.565GluLys: 4.565 ± 0.0
8.217GluLeu: 8.217 ± 0.0
1.826GluMet: 1.826 ± 0.0
2.739GluAsn: 2.739 ± 0.0
2.739GluPro: 2.739 ± 0.0
2.435GluGln: 2.435 ± 0.0
3.652GluArg: 3.652 ± 0.0
3.652GluSer: 3.652 ± 0.0
3.652GluThr: 3.652 ± 0.0
3.956GluVal: 3.956 ± 0.0
0.913GluTrp: 0.913 ± 0.0
3.652GluTyr: 3.652 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.956PheAla: 3.956 ± 0.0
1.217PheCys: 1.217 ± 0.0
3.652PheAsp: 3.652 ± 0.0
3.652PheGlu: 3.652 ± 0.0
3.043PhePhe: 3.043 ± 0.0
2.13PheGly: 2.13 ± 0.0
0.913PheHis: 0.913 ± 0.0
2.435PheIle: 2.435 ± 0.0
1.826PheLys: 1.826 ± 0.0
4.26PheLeu: 4.26 ± 0.0
1.522PheMet: 1.522 ± 0.0
0.913PheAsn: 0.913 ± 0.0
3.043PhePro: 3.043 ± 0.0
2.739PheGln: 2.739 ± 0.0
2.13PheArg: 2.13 ± 0.0
5.173PheSer: 5.173 ± 0.0
2.739PheThr: 2.739 ± 0.0
3.348PheVal: 3.348 ± 0.0
0.0PheTrp: 0.0 ± 0.0
2.13PheTyr: 2.13 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.13GlyAla: 2.13 ± 0.0
1.522GlyCys: 1.522 ± 0.0
6.695GlyAsp: 6.695 ± 0.0
3.043GlyGlu: 3.043 ± 0.0
2.13GlyPhe: 2.13 ± 0.0
1.217GlyGly: 1.217 ± 0.0
0.609GlyHis: 0.609 ± 0.0
2.435GlyIle: 2.435 ± 0.0
3.652GlyLys: 3.652 ± 0.0
3.652GlyLeu: 3.652 ± 0.0
2.739GlyMet: 2.739 ± 0.0
3.348GlyAsn: 3.348 ± 0.0
1.826GlyPro: 1.826 ± 0.0
1.826GlyGln: 1.826 ± 0.0
1.217GlyArg: 1.217 ± 0.0
3.956GlySer: 3.956 ± 0.0
2.13GlyThr: 2.13 ± 0.0
6.999GlyVal: 6.999 ± 0.0
0.913GlyTrp: 0.913 ± 0.0
2.13GlyTyr: 2.13 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.522HisAla: 1.522 ± 0.0
0.609HisCys: 0.609 ± 0.0
1.522HisAsp: 1.522 ± 0.0
0.609HisGlu: 0.609 ± 0.0
1.217HisPhe: 1.217 ± 0.0
3.043HisGly: 3.043 ± 0.0
0.913HisHis: 0.913 ± 0.0
0.913HisIle: 0.913 ± 0.0
0.304HisLys: 0.304 ± 0.0
1.522HisLeu: 1.522 ± 0.0
0.609HisMet: 0.609 ± 0.0
0.609HisAsn: 0.609 ± 0.0
0.913HisPro: 0.913 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.609HisArg: 0.609 ± 0.0
1.826HisSer: 1.826 ± 0.0
0.609HisThr: 0.609 ± 0.0
2.435HisVal: 2.435 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.913HisTyr: 0.913 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.348IleAla: 3.348 ± 0.0
0.0IleCys: 0.0 ± 0.0
3.043IleAsp: 3.043 ± 0.0
3.043IleGlu: 3.043 ± 0.0
1.826IlePhe: 1.826 ± 0.0
2.13IleGly: 2.13 ± 0.0
0.913IleHis: 0.913 ± 0.0
1.826IleIle: 1.826 ± 0.0
4.565IleLys: 4.565 ± 0.0
4.869IleLeu: 4.869 ± 0.0
1.522IleMet: 1.522 ± 0.0
2.435IleAsn: 2.435 ± 0.0
3.043IlePro: 3.043 ± 0.0
0.913IleGln: 0.913 ± 0.0
3.348IleArg: 3.348 ± 0.0
3.043IleSer: 3.043 ± 0.0
3.043IleThr: 3.043 ± 0.0
4.26IleVal: 4.26 ± 0.0
0.304IleTrp: 0.304 ± 0.0
1.217IleTyr: 1.217 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.652LysAla: 3.652 ± 0.0
1.826LysCys: 1.826 ± 0.0
3.956LysAsp: 3.956 ± 0.0
6.695LysGlu: 6.695 ± 0.0
3.652LysPhe: 3.652 ± 0.0
2.435LysGly: 2.435 ± 0.0
0.609LysHis: 0.609 ± 0.0
2.435LysIle: 2.435 ± 0.0
4.869LysLys: 4.869 ± 0.0
6.695LysLeu: 6.695 ± 0.0
1.217LysMet: 1.217 ± 0.0
2.435LysAsn: 2.435 ± 0.0
1.217LysPro: 1.217 ± 0.0
0.913LysGln: 0.913 ± 0.0
3.043LysArg: 3.043 ± 0.0
3.956LysSer: 3.956 ± 0.0
2.13LysThr: 2.13 ± 0.0
6.086LysVal: 6.086 ± 0.0
1.522LysTrp: 1.522 ± 0.0
3.652LysTyr: 3.652 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.652LeuAla: 3.652 ± 0.0
2.739LeuCys: 2.739 ± 0.0
6.695LeuAsp: 6.695 ± 0.0
4.869LeuGlu: 4.869 ± 0.0
3.956LeuPhe: 3.956 ± 0.0
5.782LeuGly: 5.782 ± 0.0
2.435LeuHis: 2.435 ± 0.0
5.173LeuIle: 5.173 ± 0.0
6.695LeuLys: 6.695 ± 0.0
8.217LeuLeu: 8.217 ± 0.0
2.739LeuMet: 2.739 ± 0.0
3.348LeuAsn: 3.348 ± 0.0
2.435LeuPro: 2.435 ± 0.0
3.043LeuGln: 3.043 ± 0.0
7.912LeuArg: 7.912 ± 0.0
4.869LeuSer: 4.869 ± 0.0
6.086LeuThr: 6.086 ± 0.0
6.391LeuVal: 6.391 ± 0.0
1.522LeuTrp: 1.522 ± 0.0
2.13LeuTyr: 2.13 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.522MetAla: 1.522 ± 0.0
1.217MetCys: 1.217 ± 0.0
3.043MetAsp: 3.043 ± 0.0
2.739MetGlu: 2.739 ± 0.0
0.609MetPhe: 0.609 ± 0.0
2.13MetGly: 2.13 ± 0.0
0.913MetHis: 0.913 ± 0.0
0.913MetIle: 0.913 ± 0.0
2.13MetLys: 2.13 ± 0.0
2.739MetLeu: 2.739 ± 0.0
1.217MetMet: 1.217 ± 0.0
0.913MetAsn: 0.913 ± 0.0
0.913MetPro: 0.913 ± 0.0
1.217MetGln: 1.217 ± 0.0
0.913MetArg: 0.913 ± 0.0
3.956MetSer: 3.956 ± 0.0
1.522MetThr: 1.522 ± 0.0
1.826MetVal: 1.826 ± 0.0
0.609MetTrp: 0.609 ± 0.0
1.217MetTyr: 1.217 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.348AsnAla: 3.348 ± 0.0
0.609AsnCys: 0.609 ± 0.0
1.522AsnAsp: 1.522 ± 0.0
4.869AsnGlu: 4.869 ± 0.0
1.217AsnPhe: 1.217 ± 0.0
1.522AsnGly: 1.522 ± 0.0
1.217AsnHis: 1.217 ± 0.0
1.826AsnIle: 1.826 ± 0.0
1.522AsnLys: 1.522 ± 0.0
3.348AsnLeu: 3.348 ± 0.0
1.826AsnMet: 1.826 ± 0.0
2.739AsnAsn: 2.739 ± 0.0
1.826AsnPro: 1.826 ± 0.0
1.522AsnGln: 1.522 ± 0.0
1.826AsnArg: 1.826 ± 0.0
2.739AsnSer: 2.739 ± 0.0
3.043AsnThr: 3.043 ± 0.0
2.435AsnVal: 2.435 ± 0.0
0.913AsnTrp: 0.913 ± 0.0
0.304AsnTyr: 0.304 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.435ProAla: 2.435 ± 0.0
0.304ProCys: 0.304 ± 0.0
3.348ProAsp: 3.348 ± 0.0
2.13ProGlu: 2.13 ± 0.0
2.739ProPhe: 2.739 ± 0.0
2.435ProGly: 2.435 ± 0.0
1.522ProHis: 1.522 ± 0.0
3.043ProIle: 3.043 ± 0.0
1.217ProLys: 1.217 ± 0.0
3.956ProLeu: 3.956 ± 0.0
1.217ProMet: 1.217 ± 0.0
2.13ProAsn: 2.13 ± 0.0
2.739ProPro: 2.739 ± 0.0
0.304ProGln: 0.304 ± 0.0
1.826ProArg: 1.826 ± 0.0
2.435ProSer: 2.435 ± 0.0
2.13ProThr: 2.13 ± 0.0
3.956ProVal: 3.956 ± 0.0
0.0ProTrp: 0.0 ± 0.0
3.043ProTyr: 3.043 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.13GlnAla: 2.13 ± 0.0
0.913GlnCys: 0.913 ± 0.0
2.13GlnAsp: 2.13 ± 0.0
1.522GlnGlu: 1.522 ± 0.0
2.13GlnPhe: 2.13 ± 0.0
2.13GlnGly: 2.13 ± 0.0
0.304GlnHis: 0.304 ± 0.0
1.826GlnIle: 1.826 ± 0.0
1.826GlnLys: 1.826 ± 0.0
3.348GlnLeu: 3.348 ± 0.0
1.217GlnMet: 1.217 ± 0.0
1.217GlnAsn: 1.217 ± 0.0
1.522GlnPro: 1.522 ± 0.0
0.0GlnGln: 0.0 ± 0.0
0.304GlnArg: 0.304 ± 0.0
4.26GlnSer: 4.26 ± 0.0
0.913GlnThr: 0.913 ± 0.0
3.348GlnVal: 3.348 ± 0.0
0.609GlnTrp: 0.609 ± 0.0
1.217GlnTyr: 1.217 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.26ArgAla: 4.26 ± 0.0
0.304ArgCys: 0.304 ± 0.0
3.348ArgAsp: 3.348 ± 0.0
4.565ArgGlu: 4.565 ± 0.0
3.348ArgPhe: 3.348 ± 0.0
1.826ArgGly: 1.826 ± 0.0
1.217ArgHis: 1.217 ± 0.0
3.043ArgIle: 3.043 ± 0.0
3.348ArgLys: 3.348 ± 0.0
2.435ArgLeu: 2.435 ± 0.0
1.522ArgMet: 1.522 ± 0.0
4.26ArgAsn: 4.26 ± 0.0
1.217ArgPro: 1.217 ± 0.0
1.217ArgGln: 1.217 ± 0.0
2.739ArgArg: 2.739 ± 0.0
3.043ArgSer: 3.043 ± 0.0
1.217ArgThr: 1.217 ± 0.0
3.956ArgVal: 3.956 ± 0.0
0.304ArgTrp: 0.304 ± 0.0
1.217ArgTyr: 1.217 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.26SerAla: 4.26 ± 0.0
0.609SerCys: 0.609 ± 0.0
3.652SerAsp: 3.652 ± 0.0
4.565SerGlu: 4.565 ± 0.0
2.739SerPhe: 2.739 ± 0.0
4.26SerGly: 4.26 ± 0.0
1.522SerHis: 1.522 ± 0.0
2.13SerIle: 2.13 ± 0.0
3.956SerLys: 3.956 ± 0.0
6.999SerLeu: 6.999 ± 0.0
0.913SerMet: 0.913 ± 0.0
2.435SerAsn: 2.435 ± 0.0
3.348SerPro: 3.348 ± 0.0
4.26SerGln: 4.26 ± 0.0
1.826SerArg: 1.826 ± 0.0
5.782SerSer: 5.782 ± 0.0
4.565SerThr: 4.565 ± 0.0
5.478SerVal: 5.478 ± 0.0
0.913SerTrp: 0.913 ± 0.0
3.956SerTyr: 3.956 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.043ThrAla: 3.043 ± 0.0
0.304ThrCys: 0.304 ± 0.0
3.348ThrAsp: 3.348 ± 0.0
3.956ThrGlu: 3.956 ± 0.0
1.217ThrPhe: 1.217 ± 0.0
2.739ThrGly: 2.739 ± 0.0
0.913ThrHis: 0.913 ± 0.0
3.956ThrIle: 3.956 ± 0.0
5.173ThrLys: 5.173 ± 0.0
3.348ThrLeu: 3.348 ± 0.0
0.304ThrMet: 0.304 ± 0.0
1.522ThrAsn: 1.522 ± 0.0
2.739ThrPro: 2.739 ± 0.0
3.348ThrGln: 3.348 ± 0.0
3.956ThrArg: 3.956 ± 0.0
2.13ThrSer: 2.13 ± 0.0
3.043ThrThr: 3.043 ± 0.0
4.565ThrVal: 4.565 ± 0.0
0.609ThrTrp: 0.609 ± 0.0
0.913ThrTyr: 0.913 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.086ValAla: 6.086 ± 0.0
1.522ValCys: 1.522 ± 0.0
6.695ValAsp: 6.695 ± 0.0
3.348ValGlu: 3.348 ± 0.0
4.565ValPhe: 4.565 ± 0.0
3.652ValGly: 3.652 ± 0.0
1.217ValHis: 1.217 ± 0.0
2.739ValIle: 2.739 ± 0.0
4.869ValLys: 4.869 ± 0.0
5.478ValLeu: 5.478 ± 0.0
2.739ValMet: 2.739 ± 0.0
2.13ValAsn: 2.13 ± 0.0
4.565ValPro: 4.565 ± 0.0
4.26ValGln: 4.26 ± 0.0
4.565ValArg: 4.565 ± 0.0
6.086ValSer: 6.086 ± 0.0
3.348ValThr: 3.348 ± 0.0
6.695ValVal: 6.695 ± 0.0
1.522ValTrp: 1.522 ± 0.0
5.173ValTyr: 5.173 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.217TrpAla: 1.217 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.609TrpGlu: 0.609 ± 0.0
1.217TrpPhe: 1.217 ± 0.0
0.304TrpGly: 0.304 ± 0.0
0.304TrpHis: 0.304 ± 0.0
0.609TrpIle: 0.609 ± 0.0
0.913TrpLys: 0.913 ± 0.0
1.522TrpLeu: 1.522 ± 0.0
0.304TrpMet: 0.304 ± 0.0
0.304TrpAsn: 0.304 ± 0.0
0.609TrpPro: 0.609 ± 0.0
0.304TrpGln: 0.304 ± 0.0
0.609TrpArg: 0.609 ± 0.0
1.826TrpSer: 1.826 ± 0.0
0.609TrpThr: 0.609 ± 0.0
1.217TrpVal: 1.217 ± 0.0
0.304TrpTrp: 0.304 ± 0.0
1.217TrpTyr: 1.217 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.348TyrAla: 3.348 ± 0.0
0.304TyrCys: 0.304 ± 0.0
4.565TyrAsp: 4.565 ± 0.0
1.522TyrGlu: 1.522 ± 0.0
1.522TyrPhe: 1.522 ± 0.0
2.739TyrGly: 2.739 ± 0.0
0.304TyrHis: 0.304 ± 0.0
1.217TyrIle: 1.217 ± 0.0
3.348TyrLys: 3.348 ± 0.0
6.391TyrLeu: 6.391 ± 0.0
0.913TyrMet: 0.913 ± 0.0
1.522TyrAsn: 1.522 ± 0.0
1.217TyrPro: 1.217 ± 0.0
0.304TyrGln: 0.304 ± 0.0
1.826TyrArg: 1.826 ± 0.0
1.522TyrSer: 1.522 ± 0.0
2.435TyrThr: 2.435 ± 0.0
2.739TyrVal: 2.739 ± 0.0
1.217TyrTrp: 1.217 ± 0.0
0.913TyrTyr: 0.913 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3287 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski