Amino acid dipepetide frequency for Lagenaria siceraria endornavirus-Hubei

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.608AlaAla: 1.608 ± 0.0
0.0AlaCys: 0.0 ± 0.0
2.613AlaAsp: 2.613 ± 0.0
3.215AlaGlu: 3.215 ± 0.0
0.603AlaPhe: 0.603 ± 0.0
2.412AlaGly: 2.412 ± 0.0
0.603AlaHis: 0.603 ± 0.0
3.215AlaIle: 3.215 ± 0.0
3.818AlaLys: 3.818 ± 0.0
4.421AlaLeu: 4.421 ± 0.0
1.407AlaMet: 1.407 ± 0.0
1.809AlaAsn: 1.809 ± 0.0
0.804AlaPro: 0.804 ± 0.0
0.804AlaGln: 0.804 ± 0.0
2.01AlaArg: 2.01 ± 0.0
2.814AlaSer: 2.814 ± 0.0
2.814AlaThr: 2.814 ± 0.0
3.215AlaVal: 3.215 ± 0.0
0.603AlaTrp: 0.603 ± 0.0
1.005AlaTyr: 1.005 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.603CysAla: 0.603 ± 0.0
0.804CysCys: 0.804 ± 0.0
0.603CysAsp: 0.603 ± 0.0
0.804CysGlu: 0.804 ± 0.0
0.804CysPhe: 0.804 ± 0.0
1.407CysGly: 1.407 ± 0.0
0.402CysHis: 0.402 ± 0.0
1.005CysIle: 1.005 ± 0.0
1.206CysLys: 1.206 ± 0.0
0.804CysLeu: 0.804 ± 0.0
1.206CysMet: 1.206 ± 0.0
1.206CysAsn: 1.206 ± 0.0
0.603CysPro: 0.603 ± 0.0
0.603CysGln: 0.603 ± 0.0
0.201CysArg: 0.201 ± 0.0
0.804CysSer: 0.804 ± 0.0
0.603CysThr: 0.603 ± 0.0
1.206CysVal: 1.206 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.603CysTyr: 0.603 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.215AspAla: 3.215 ± 0.0
0.603AspCys: 0.603 ± 0.0
5.024AspAsp: 5.024 ± 0.0
3.416AspGlu: 3.416 ± 0.0
2.613AspPhe: 2.613 ± 0.0
2.412AspGly: 2.412 ± 0.0
0.804AspHis: 0.804 ± 0.0
4.823AspIle: 4.823 ± 0.0
4.622AspLys: 4.622 ± 0.0
5.225AspLeu: 5.225 ± 0.0
2.412AspMet: 2.412 ± 0.0
4.421AspAsn: 4.421 ± 0.0
1.407AspPro: 1.407 ± 0.0
1.005AspGln: 1.005 ± 0.0
2.613AspArg: 2.613 ± 0.0
2.814AspSer: 2.814 ± 0.0
4.019AspThr: 4.019 ± 0.0
4.22AspVal: 4.22 ± 0.0
1.206AspTrp: 1.206 ± 0.0
2.613AspTyr: 2.613 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.613GluAla: 2.613 ± 0.0
0.804GluCys: 0.804 ± 0.0
3.215GluAsp: 3.215 ± 0.0
4.421GluGlu: 4.421 ± 0.0
2.211GluPhe: 2.211 ± 0.0
2.412GluGly: 2.412 ± 0.0
2.412GluHis: 2.412 ± 0.0
4.823GluIle: 4.823 ± 0.0
4.421GluLys: 4.421 ± 0.0
9.043GluLeu: 9.043 ± 0.0
1.407GluMet: 1.407 ± 0.0
4.22GluAsn: 4.22 ± 0.0
2.412GluPro: 2.412 ± 0.0
3.416GluGln: 3.416 ± 0.0
3.416GluArg: 3.416 ± 0.0
2.814GluSer: 2.814 ± 0.0
4.421GluThr: 4.421 ± 0.0
3.014GluVal: 3.014 ± 0.0
0.603GluTrp: 0.603 ± 0.0
2.613GluTyr: 2.613 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.608PheAla: 1.608 ± 0.0
0.603PheCys: 0.603 ± 0.0
2.211PheAsp: 2.211 ± 0.0
3.014PheGlu: 3.014 ± 0.0
0.603PhePhe: 0.603 ± 0.0
2.211PheGly: 2.211 ± 0.0
0.804PheHis: 0.804 ± 0.0
2.211PheIle: 2.211 ± 0.0
2.211PheLys: 2.211 ± 0.0
2.613PheLeu: 2.613 ± 0.0
0.804PheMet: 0.804 ± 0.0
2.814PheAsn: 2.814 ± 0.0
0.402PhePro: 0.402 ± 0.0
0.603PheGln: 0.603 ± 0.0
2.01PheArg: 2.01 ± 0.0
3.215PheSer: 3.215 ± 0.0
0.804PheThr: 0.804 ± 0.0
2.613PheVal: 2.613 ± 0.0
0.402PheTrp: 0.402 ± 0.0
1.809PheTyr: 1.809 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.809GlyAla: 1.809 ± 0.0
0.603GlyCys: 0.603 ± 0.0
3.416GlyAsp: 3.416 ± 0.0
2.211GlyGlu: 2.211 ± 0.0
1.005GlyPhe: 1.005 ± 0.0
2.211GlyGly: 2.211 ± 0.0
1.206GlyHis: 1.206 ± 0.0
3.416GlyIle: 3.416 ± 0.0
4.421GlyLys: 4.421 ± 0.0
4.22GlyLeu: 4.22 ± 0.0
1.809GlyMet: 1.809 ± 0.0
2.814GlyAsn: 2.814 ± 0.0
1.809GlyPro: 1.809 ± 0.0
2.613GlyGln: 2.613 ± 0.0
1.608GlyArg: 1.608 ± 0.0
3.215GlySer: 3.215 ± 0.0
2.613GlyThr: 2.613 ± 0.0
3.617GlyVal: 3.617 ± 0.0
1.608GlyTrp: 1.608 ± 0.0
2.412GlyTyr: 2.412 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.005HisAla: 1.005 ± 0.0
0.603HisCys: 0.603 ± 0.0
0.804HisAsp: 0.804 ± 0.0
2.211HisGlu: 2.211 ± 0.0
1.608HisPhe: 1.608 ± 0.0
1.407HisGly: 1.407 ± 0.0
1.407HisHis: 1.407 ± 0.0
1.407HisIle: 1.407 ± 0.0
1.407HisLys: 1.407 ± 0.0
3.014HisLeu: 3.014 ± 0.0
0.402HisMet: 0.402 ± 0.0
3.215HisAsn: 3.215 ± 0.0
1.005HisPro: 1.005 ± 0.0
0.603HisGln: 0.603 ± 0.0
1.206HisArg: 1.206 ± 0.0
1.809HisSer: 1.809 ± 0.0
1.005HisThr: 1.005 ± 0.0
1.608HisVal: 1.608 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.608HisTyr: 1.608 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.215IleAla: 3.215 ± 0.0
1.407IleCys: 1.407 ± 0.0
5.426IleAsp: 5.426 ± 0.0
4.22IleGlu: 4.22 ± 0.0
1.206IlePhe: 1.206 ± 0.0
3.416IleGly: 3.416 ± 0.0
1.206IleHis: 1.206 ± 0.0
9.043IleIle: 9.043 ± 0.0
8.842IleLys: 8.842 ± 0.0
7.838IleLeu: 7.838 ± 0.0
2.814IleMet: 2.814 ± 0.0
5.225IleAsn: 5.225 ± 0.0
2.814IlePro: 2.814 ± 0.0
2.412IleGln: 2.412 ± 0.0
3.416IleArg: 3.416 ± 0.0
3.215IleSer: 3.215 ± 0.0
8.441IleThr: 8.441 ± 0.0
5.225IleVal: 5.225 ± 0.0
0.603IleTrp: 0.603 ± 0.0
3.014IleTyr: 3.014 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.814LysAla: 2.814 ± 0.0
0.603LysCys: 0.603 ± 0.0
4.421LysAsp: 4.421 ± 0.0
4.823LysGlu: 4.823 ± 0.0
2.01LysPhe: 2.01 ± 0.0
2.211LysGly: 2.211 ± 0.0
2.211LysHis: 2.211 ± 0.0
6.833LysIle: 6.833 ± 0.0
2.412LysLys: 2.412 ± 0.0
7.436LysLeu: 7.436 ± 0.0
2.613LysMet: 2.613 ± 0.0
3.014LysAsn: 3.014 ± 0.0
5.225LysPro: 5.225 ± 0.0
3.416LysGln: 3.416 ± 0.0
2.814LysArg: 2.814 ± 0.0
5.225LysSer: 5.225 ± 0.0
6.23LysThr: 6.23 ± 0.0
4.622LysVal: 4.622 ± 0.0
1.608LysTrp: 1.608 ± 0.0
3.416LysTyr: 3.416 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.416LeuAla: 3.416 ± 0.0
1.407LeuCys: 1.407 ± 0.0
6.029LeuAsp: 6.029 ± 0.0
6.632LeuGlu: 6.632 ± 0.0
3.617LeuPhe: 3.617 ± 0.0
4.22LeuGly: 4.22 ± 0.0
2.412LeuHis: 2.412 ± 0.0
5.828LeuIle: 5.828 ± 0.0
6.833LeuLys: 6.833 ± 0.0
6.029LeuLeu: 6.029 ± 0.0
3.215LeuMet: 3.215 ± 0.0
7.637LeuAsn: 7.637 ± 0.0
5.426LeuPro: 5.426 ± 0.0
5.024LeuGln: 5.024 ± 0.0
4.22LeuArg: 4.22 ± 0.0
4.421LeuSer: 4.421 ± 0.0
7.838LeuThr: 7.838 ± 0.0
5.426LeuVal: 5.426 ± 0.0
1.407LeuTrp: 1.407 ± 0.0
2.613LeuTyr: 2.613 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.608MetAla: 1.608 ± 0.0
0.603MetCys: 0.603 ± 0.0
1.608MetAsp: 1.608 ± 0.0
1.005MetGlu: 1.005 ± 0.0
0.603MetPhe: 0.603 ± 0.0
1.407MetGly: 1.407 ± 0.0
0.804MetHis: 0.804 ± 0.0
2.412MetIle: 2.412 ± 0.0
1.608MetLys: 1.608 ± 0.0
3.014MetLeu: 3.014 ± 0.0
0.804MetMet: 0.804 ± 0.0
0.603MetAsn: 0.603 ± 0.0
0.201MetPro: 0.201 ± 0.0
1.608MetGln: 1.608 ± 0.0
2.412MetArg: 2.412 ± 0.0
1.608MetSer: 1.608 ± 0.0
3.617MetThr: 3.617 ± 0.0
3.818MetVal: 3.818 ± 0.0
0.402MetTrp: 0.402 ± 0.0
1.407MetTyr: 1.407 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.215AsnAla: 3.215 ± 0.0
1.809AsnCys: 1.809 ± 0.0
2.613AsnAsp: 2.613 ± 0.0
4.421AsnGlu: 4.421 ± 0.0
2.613AsnPhe: 2.613 ± 0.0
2.01AsnGly: 2.01 ± 0.0
2.211AsnHis: 2.211 ± 0.0
6.833AsnIle: 6.833 ± 0.0
4.622AsnLys: 4.622 ± 0.0
6.23AsnLeu: 6.23 ± 0.0
0.603AsnMet: 0.603 ± 0.0
4.421AsnAsn: 4.421 ± 0.0
2.211AsnPro: 2.211 ± 0.0
2.01AsnGln: 2.01 ± 0.0
3.416AsnArg: 3.416 ± 0.0
2.613AsnSer: 2.613 ± 0.0
2.814AsnThr: 2.814 ± 0.0
5.225AsnVal: 5.225 ± 0.0
1.206AsnTrp: 1.206 ± 0.0
4.622AsnTyr: 4.622 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.804ProAla: 0.804 ± 0.0
0.201ProCys: 0.201 ± 0.0
2.814ProAsp: 2.814 ± 0.0
3.014ProGlu: 3.014 ± 0.0
1.005ProPhe: 1.005 ± 0.0
3.014ProGly: 3.014 ± 0.0
0.804ProHis: 0.804 ± 0.0
3.416ProIle: 3.416 ± 0.0
2.814ProLys: 2.814 ± 0.0
3.617ProLeu: 3.617 ± 0.0
0.603ProMet: 0.603 ± 0.0
2.211ProAsn: 2.211 ± 0.0
1.005ProPro: 1.005 ± 0.0
1.005ProGln: 1.005 ± 0.0
1.005ProArg: 1.005 ± 0.0
2.01ProSer: 2.01 ± 0.0
2.613ProThr: 2.613 ± 0.0
2.814ProVal: 2.814 ± 0.0
1.005ProTrp: 1.005 ± 0.0
1.809ProTyr: 1.809 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.804GlnAla: 0.804 ± 0.0
0.402GlnCys: 0.402 ± 0.0
1.005GlnAsp: 1.005 ± 0.0
2.01GlnGlu: 2.01 ± 0.0
2.412GlnPhe: 2.412 ± 0.0
2.412GlnGly: 2.412 ± 0.0
0.603GlnHis: 0.603 ± 0.0
4.22GlnIle: 4.22 ± 0.0
1.206GlnLys: 1.206 ± 0.0
2.814GlnLeu: 2.814 ± 0.0
0.603GlnMet: 0.603 ± 0.0
1.407GlnAsn: 1.407 ± 0.0
1.407GlnPro: 1.407 ± 0.0
1.608GlnGln: 1.608 ± 0.0
1.809GlnArg: 1.809 ± 0.0
2.211GlnSer: 2.211 ± 0.0
2.613GlnThr: 2.613 ± 0.0
3.818GlnVal: 3.818 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.005GlnTyr: 1.005 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.211ArgAla: 2.211 ± 0.0
0.804ArgCys: 0.804 ± 0.0
2.613ArgAsp: 2.613 ± 0.0
2.613ArgGlu: 2.613 ± 0.0
1.809ArgPhe: 1.809 ± 0.0
1.809ArgGly: 1.809 ± 0.0
1.005ArgHis: 1.005 ± 0.0
2.613ArgIle: 2.613 ± 0.0
3.215ArgLys: 3.215 ± 0.0
5.627ArgLeu: 5.627 ± 0.0
0.804ArgMet: 0.804 ± 0.0
1.809ArgAsn: 1.809 ± 0.0
1.407ArgPro: 1.407 ± 0.0
2.01ArgGln: 2.01 ± 0.0
1.005ArgArg: 1.005 ± 0.0
3.014ArgSer: 3.014 ± 0.0
2.613ArgThr: 2.613 ± 0.0
3.215ArgVal: 3.215 ± 0.0
0.603ArgTrp: 0.603 ± 0.0
1.005ArgTyr: 1.005 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.814SerAla: 2.814 ± 0.0
0.603SerCys: 0.603 ± 0.0
2.613SerAsp: 2.613 ± 0.0
3.215SerGlu: 3.215 ± 0.0
1.005SerPhe: 1.005 ± 0.0
2.613SerGly: 2.613 ± 0.0
2.211SerHis: 2.211 ± 0.0
4.019SerIle: 4.019 ± 0.0
4.421SerLys: 4.421 ± 0.0
3.617SerLeu: 3.617 ± 0.0
3.617SerMet: 3.617 ± 0.0
5.225SerAsn: 5.225 ± 0.0
1.005SerPro: 1.005 ± 0.0
1.005SerGln: 1.005 ± 0.0
2.211SerArg: 2.211 ± 0.0
2.412SerSer: 2.412 ± 0.0
4.22SerThr: 4.22 ± 0.0
3.215SerVal: 3.215 ± 0.0
1.608SerTrp: 1.608 ± 0.0
2.211SerTyr: 2.211 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.211ThrAla: 2.211 ± 0.0
0.603ThrCys: 0.603 ± 0.0
4.823ThrAsp: 4.823 ± 0.0
5.225ThrGlu: 5.225 ± 0.0
2.613ThrPhe: 2.613 ± 0.0
4.019ThrGly: 4.019 ± 0.0
1.608ThrHis: 1.608 ± 0.0
6.632ThrIle: 6.632 ± 0.0
6.632ThrLys: 6.632 ± 0.0
6.23ThrLeu: 6.23 ± 0.0
1.608ThrMet: 1.608 ± 0.0
5.426ThrAsn: 5.426 ± 0.0
3.617ThrPro: 3.617 ± 0.0
1.407ThrGln: 1.407 ± 0.0
1.608ThrArg: 1.608 ± 0.0
3.617ThrSer: 3.617 ± 0.0
4.823ThrThr: 4.823 ± 0.0
3.215ThrVal: 3.215 ± 0.0
0.804ThrTrp: 0.804 ± 0.0
3.014ThrTyr: 3.014 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.412ValAla: 2.412 ± 0.0
2.211ValCys: 2.211 ± 0.0
3.416ValAsp: 3.416 ± 0.0
4.823ValGlu: 4.823 ± 0.0
1.206ValPhe: 1.206 ± 0.0
4.019ValGly: 4.019 ± 0.0
1.809ValHis: 1.809 ± 0.0
5.225ValIle: 5.225 ± 0.0
7.436ValLys: 7.436 ± 0.0
6.632ValLeu: 6.632 ± 0.0
2.412ValMet: 2.412 ± 0.0
4.421ValAsn: 4.421 ± 0.0
2.613ValPro: 2.613 ± 0.0
1.407ValGln: 1.407 ± 0.0
2.412ValArg: 2.412 ± 0.0
3.014ValSer: 3.014 ± 0.0
5.828ValThr: 5.828 ± 0.0
3.818ValVal: 3.818 ± 0.0
1.407ValTrp: 1.407 ± 0.0
2.01ValTyr: 2.01 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.402TrpAla: 0.402 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.407TrpAsp: 1.407 ± 0.0
0.603TrpGlu: 0.603 ± 0.0
1.206TrpPhe: 1.206 ± 0.0
0.402TrpGly: 0.402 ± 0.0
0.603TrpHis: 0.603 ± 0.0
2.01TrpIle: 2.01 ± 0.0
0.402TrpLys: 0.402 ± 0.0
2.01TrpLeu: 2.01 ± 0.0
0.603TrpMet: 0.603 ± 0.0
0.402TrpAsn: 0.402 ± 0.0
0.201TrpPro: 0.201 ± 0.0
0.603TrpGln: 0.603 ± 0.0
0.804TrpArg: 0.804 ± 0.0
0.603TrpSer: 0.603 ± 0.0
0.201TrpThr: 0.201 ± 0.0
2.01TrpVal: 2.01 ± 0.0
0.201TrpTrp: 0.201 ± 0.0
1.407TrpTyr: 1.407 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.407TyrAla: 1.407 ± 0.0
0.804TyrCys: 0.804 ± 0.0
2.814TyrAsp: 2.814 ± 0.0
2.814TyrGlu: 2.814 ± 0.0
2.814TyrPhe: 2.814 ± 0.0
2.613TyrGly: 2.613 ± 0.0
2.211TyrHis: 2.211 ± 0.0
3.014TyrIle: 3.014 ± 0.0
1.608TyrLys: 1.608 ± 0.0
3.215TyrLeu: 3.215 ± 0.0
1.206TyrMet: 1.206 ± 0.0
3.416TyrAsn: 3.416 ± 0.0
2.01TyrPro: 2.01 ± 0.0
1.206TyrGln: 1.206 ± 0.0
1.608TyrArg: 1.608 ± 0.0
2.412TyrSer: 2.412 ± 0.0
1.608TyrThr: 1.608 ± 0.0
2.613TyrVal: 2.613 ± 0.0
0.804TyrTrp: 0.804 ± 0.0
1.407TyrTyr: 1.407 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (4977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski