Amino acid dipepetide frequency for Changping earthworm virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.938AlaAla: 4.938 ± 0.0
0.705AlaCys: 0.705 ± 0.0
4.233AlaAsp: 4.233 ± 0.0
4.233AlaGlu: 4.233 ± 0.0
2.822AlaPhe: 2.822 ± 0.0
4.586AlaGly: 4.586 ± 0.0
2.822AlaHis: 2.822 ± 0.0
4.586AlaIle: 4.586 ± 0.0
4.586AlaLys: 4.586 ± 0.0
4.938AlaLeu: 4.938 ± 0.0
1.058AlaMet: 1.058 ± 0.0
3.527AlaAsn: 3.527 ± 0.0
3.88AlaPro: 3.88 ± 0.0
2.116AlaGln: 2.116 ± 0.0
3.175AlaArg: 3.175 ± 0.0
4.233AlaSer: 4.233 ± 0.0
4.233AlaThr: 4.233 ± 0.0
6.349AlaVal: 6.349 ± 0.0
0.353AlaTrp: 0.353 ± 0.0
1.411AlaTyr: 1.411 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.411CysAla: 1.411 ± 0.0
0.705CysCys: 0.705 ± 0.0
1.411CysAsp: 1.411 ± 0.0
2.116CysGlu: 2.116 ± 0.0
0.353CysPhe: 0.353 ± 0.0
2.469CysGly: 2.469 ± 0.0
0.705CysHis: 0.705 ± 0.0
0.353CysIle: 0.353 ± 0.0
0.705CysLys: 0.705 ± 0.0
1.411CysLeu: 1.411 ± 0.0
0.705CysMet: 0.705 ± 0.0
0.353CysAsn: 0.353 ± 0.0
0.705CysPro: 0.705 ± 0.0
0.705CysGln: 0.705 ± 0.0
0.705CysArg: 0.705 ± 0.0
1.058CysSer: 1.058 ± 0.0
2.116CysThr: 2.116 ± 0.0
2.116CysVal: 2.116 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.353CysTyr: 0.353 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.822AspAla: 2.822 ± 0.0
2.116AspCys: 2.116 ± 0.0
5.996AspAsp: 5.996 ± 0.0
4.233AspGlu: 4.233 ± 0.0
2.822AspPhe: 2.822 ± 0.0
3.88AspGly: 3.88 ± 0.0
1.411AspHis: 1.411 ± 0.0
2.116AspIle: 2.116 ± 0.0
2.822AspLys: 2.822 ± 0.0
7.407AspLeu: 7.407 ± 0.0
0.705AspMet: 0.705 ± 0.0
1.058AspAsn: 1.058 ± 0.0
1.411AspPro: 1.411 ± 0.0
1.764AspGln: 1.764 ± 0.0
2.116AspArg: 2.116 ± 0.0
5.644AspSer: 5.644 ± 0.0
1.764AspThr: 1.764 ± 0.0
3.88AspVal: 3.88 ± 0.0
0.353AspTrp: 0.353 ± 0.0
3.175AspTyr: 3.175 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.233GluAla: 4.233 ± 0.0
1.764GluCys: 1.764 ± 0.0
2.469GluAsp: 2.469 ± 0.0
3.175GluGlu: 3.175 ± 0.0
4.233GluPhe: 4.233 ± 0.0
1.411GluGly: 1.411 ± 0.0
2.469GluHis: 2.469 ± 0.0
5.291GluIle: 5.291 ± 0.0
5.996GluLys: 5.996 ± 0.0
5.291GluLeu: 5.291 ± 0.0
1.764GluMet: 1.764 ± 0.0
1.411GluAsn: 1.411 ± 0.0
2.469GluPro: 2.469 ± 0.0
1.764GluGln: 1.764 ± 0.0
1.764GluArg: 1.764 ± 0.0
3.175GluSer: 3.175 ± 0.0
3.527GluThr: 3.527 ± 0.0
4.233GluVal: 4.233 ± 0.0
1.411GluTrp: 1.411 ± 0.0
3.527GluTyr: 3.527 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.822PheAla: 2.822 ± 0.0
0.353PheCys: 0.353 ± 0.0
3.175PheAsp: 3.175 ± 0.0
2.469PheGlu: 2.469 ± 0.0
3.88PhePhe: 3.88 ± 0.0
2.469PheGly: 2.469 ± 0.0
1.764PheHis: 1.764 ± 0.0
1.411PheIle: 1.411 ± 0.0
2.116PheLys: 2.116 ± 0.0
7.055PheLeu: 7.055 ± 0.0
0.353PheMet: 0.353 ± 0.0
3.175PheAsn: 3.175 ± 0.0
3.175PhePro: 3.175 ± 0.0
1.411PheGln: 1.411 ± 0.0
2.822PheArg: 2.822 ± 0.0
3.175PheSer: 3.175 ± 0.0
4.586PheThr: 4.586 ± 0.0
4.233PheVal: 4.233 ± 0.0
0.705PheTrp: 0.705 ± 0.0
2.469PheTyr: 2.469 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.88GlyAla: 3.88 ± 0.0
1.411GlyCys: 1.411 ± 0.0
4.938GlyAsp: 4.938 ± 0.0
1.764GlyGlu: 1.764 ± 0.0
3.88GlyPhe: 3.88 ± 0.0
5.996GlyGly: 5.996 ± 0.0
1.058GlyHis: 1.058 ± 0.0
5.644GlyIle: 5.644 ± 0.0
5.996GlyLys: 5.996 ± 0.0
4.233GlyLeu: 4.233 ± 0.0
0.353GlyMet: 0.353 ± 0.0
3.175GlyAsn: 3.175 ± 0.0
1.411GlyPro: 1.411 ± 0.0
0.705GlyGln: 0.705 ± 0.0
4.586GlyArg: 4.586 ± 0.0
3.88GlySer: 3.88 ± 0.0
2.469GlyThr: 2.469 ± 0.0
3.175GlyVal: 3.175 ± 0.0
0.705GlyTrp: 0.705 ± 0.0
1.764GlyTyr: 1.764 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.411HisAla: 1.411 ± 0.0
0.705HisCys: 0.705 ± 0.0
1.058HisAsp: 1.058 ± 0.0
1.411HisGlu: 1.411 ± 0.0
1.764HisPhe: 1.764 ± 0.0
2.469HisGly: 2.469 ± 0.0
0.705HisHis: 0.705 ± 0.0
1.411HisIle: 1.411 ± 0.0
2.116HisLys: 2.116 ± 0.0
3.175HisLeu: 3.175 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.764HisAsn: 1.764 ± 0.0
1.411HisPro: 1.411 ± 0.0
1.764HisGln: 1.764 ± 0.0
1.058HisArg: 1.058 ± 0.0
2.116HisSer: 2.116 ± 0.0
1.764HisThr: 1.764 ± 0.0
2.116HisVal: 2.116 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.469HisTyr: 2.469 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.822IleAla: 2.822 ± 0.0
1.058IleCys: 1.058 ± 0.0
2.116IleAsp: 2.116 ± 0.0
3.88IleGlu: 3.88 ± 0.0
3.175IlePhe: 3.175 ± 0.0
4.233IleGly: 4.233 ± 0.0
2.469IleHis: 2.469 ± 0.0
1.411IleIle: 1.411 ± 0.0
2.469IleLys: 2.469 ± 0.0
3.527IleLeu: 3.527 ± 0.0
1.058IleMet: 1.058 ± 0.0
1.764IleAsn: 1.764 ± 0.0
3.527IlePro: 3.527 ± 0.0
2.116IleGln: 2.116 ± 0.0
4.586IleArg: 4.586 ± 0.0
2.469IleSer: 2.469 ± 0.0
3.527IleThr: 3.527 ± 0.0
3.88IleVal: 3.88 ± 0.0
1.058IleTrp: 1.058 ± 0.0
3.527IleTyr: 3.527 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.88LysAla: 3.88 ± 0.0
0.353LysCys: 0.353 ± 0.0
3.175LysAsp: 3.175 ± 0.0
3.88LysGlu: 3.88 ± 0.0
4.586LysPhe: 4.586 ± 0.0
2.469LysGly: 2.469 ± 0.0
1.411LysHis: 1.411 ± 0.0
2.469LysIle: 2.469 ± 0.0
4.586LysLys: 4.586 ± 0.0
3.175LysLeu: 3.175 ± 0.0
0.705LysMet: 0.705 ± 0.0
2.469LysAsn: 2.469 ± 0.0
3.175LysPro: 3.175 ± 0.0
2.822LysGln: 2.822 ± 0.0
2.822LysArg: 2.822 ± 0.0
3.527LysSer: 3.527 ± 0.0
7.76LysThr: 7.76 ± 0.0
4.938LysVal: 4.938 ± 0.0
0.353LysTrp: 0.353 ± 0.0
2.822LysTyr: 2.822 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.055LeuAla: 7.055 ± 0.0
1.058LeuCys: 1.058 ± 0.0
7.055LeuAsp: 7.055 ± 0.0
6.349LeuGlu: 6.349 ± 0.0
2.822LeuPhe: 2.822 ± 0.0
3.527LeuGly: 3.527 ± 0.0
2.116LeuHis: 2.116 ± 0.0
2.116LeuIle: 2.116 ± 0.0
5.291LeuLys: 5.291 ± 0.0
6.702LeuLeu: 6.702 ± 0.0
1.411LeuMet: 1.411 ± 0.0
5.291LeuAsn: 5.291 ± 0.0
3.527LeuPro: 3.527 ± 0.0
3.88LeuGln: 3.88 ± 0.0
4.938LeuArg: 4.938 ± 0.0
3.527LeuSer: 3.527 ± 0.0
4.586LeuThr: 4.586 ± 0.0
5.291LeuVal: 5.291 ± 0.0
1.411LeuTrp: 1.411 ± 0.0
3.175LeuTyr: 3.175 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.705MetAla: 0.705 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.411MetAsp: 1.411 ± 0.0
0.705MetGlu: 0.705 ± 0.0
0.353MetPhe: 0.353 ± 0.0
0.705MetGly: 0.705 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.058MetIle: 1.058 ± 0.0
2.822MetLys: 2.822 ± 0.0
1.058MetLeu: 1.058 ± 0.0
0.353MetMet: 0.353 ± 0.0
0.705MetAsn: 0.705 ± 0.0
2.116MetPro: 2.116 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.411MetArg: 1.411 ± 0.0
1.058MetSer: 1.058 ± 0.0
1.764MetThr: 1.764 ± 0.0
0.705MetVal: 0.705 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.705MetTyr: 0.705 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.175AsnAla: 3.175 ± 0.0
0.0AsnCys: 0.0 ± 0.0
2.822AsnAsp: 2.822 ± 0.0
3.527AsnGlu: 3.527 ± 0.0
4.233AsnPhe: 4.233 ± 0.0
2.469AsnGly: 2.469 ± 0.0
1.411AsnHis: 1.411 ± 0.0
3.527AsnIle: 3.527 ± 0.0
1.764AsnLys: 1.764 ± 0.0
2.469AsnLeu: 2.469 ± 0.0
1.411AsnMet: 1.411 ± 0.0
2.116AsnAsn: 2.116 ± 0.0
1.411AsnPro: 1.411 ± 0.0
1.764AsnGln: 1.764 ± 0.0
1.411AsnArg: 1.411 ± 0.0
4.586AsnSer: 4.586 ± 0.0
1.764AsnThr: 1.764 ± 0.0
1.411AsnVal: 1.411 ± 0.0
0.353AsnTrp: 0.353 ± 0.0
1.411AsnTyr: 1.411 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.527ProAla: 3.527 ± 0.0
1.058ProCys: 1.058 ± 0.0
2.469ProAsp: 2.469 ± 0.0
2.469ProGlu: 2.469 ± 0.0
3.88ProPhe: 3.88 ± 0.0
2.822ProGly: 2.822 ± 0.0
1.058ProHis: 1.058 ± 0.0
4.586ProIle: 4.586 ± 0.0
4.233ProLys: 4.233 ± 0.0
2.469ProLeu: 2.469 ± 0.0
0.0ProMet: 0.0 ± 0.0
1.764ProAsn: 1.764 ± 0.0
1.411ProPro: 1.411 ± 0.0
2.469ProGln: 2.469 ± 0.0
1.058ProArg: 1.058 ± 0.0
3.527ProSer: 3.527 ± 0.0
2.822ProThr: 2.822 ± 0.0
4.938ProVal: 4.938 ± 0.0
0.705ProTrp: 0.705 ± 0.0
2.822ProTyr: 2.822 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.175GlnAla: 3.175 ± 0.0
0.705GlnCys: 0.705 ± 0.0
0.705GlnAsp: 0.705 ± 0.0
1.411GlnGlu: 1.411 ± 0.0
1.058GlnPhe: 1.058 ± 0.0
3.88GlnGly: 3.88 ± 0.0
1.058GlnHis: 1.058 ± 0.0
4.586GlnIle: 4.586 ± 0.0
1.411GlnLys: 1.411 ± 0.0
1.411GlnLeu: 1.411 ± 0.0
0.353GlnMet: 0.353 ± 0.0
1.764GlnAsn: 1.764 ± 0.0
3.175GlnPro: 3.175 ± 0.0
1.764GlnGln: 1.764 ± 0.0
2.822GlnArg: 2.822 ± 0.0
1.764GlnSer: 1.764 ± 0.0
2.469GlnThr: 2.469 ± 0.0
2.822GlnVal: 2.822 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.058GlnTyr: 1.058 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.058ArgAla: 1.058 ± 0.0
1.058ArgCys: 1.058 ± 0.0
1.411ArgAsp: 1.411 ± 0.0
3.175ArgGlu: 3.175 ± 0.0
2.469ArgPhe: 2.469 ± 0.0
1.764ArgGly: 1.764 ± 0.0
1.764ArgHis: 1.764 ± 0.0
3.175ArgIle: 3.175 ± 0.0
2.822ArgLys: 2.822 ± 0.0
3.527ArgLeu: 3.527 ± 0.0
1.411ArgMet: 1.411 ± 0.0
2.116ArgAsn: 2.116 ± 0.0
3.88ArgPro: 3.88 ± 0.0
2.822ArgGln: 2.822 ± 0.0
2.116ArgArg: 2.116 ± 0.0
1.764ArgSer: 1.764 ± 0.0
3.527ArgThr: 3.527 ± 0.0
4.586ArgVal: 4.586 ± 0.0
0.353ArgTrp: 0.353 ± 0.0
2.116ArgTyr: 2.116 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.586SerAla: 4.586 ± 0.0
2.116SerCys: 2.116 ± 0.0
4.586SerAsp: 4.586 ± 0.0
3.527SerGlu: 3.527 ± 0.0
2.116SerPhe: 2.116 ± 0.0
3.527SerGly: 3.527 ± 0.0
2.116SerHis: 2.116 ± 0.0
2.822SerIle: 2.822 ± 0.0
2.469SerLys: 2.469 ± 0.0
7.055SerLeu: 7.055 ± 0.0
0.705SerMet: 0.705 ± 0.0
1.411SerAsn: 1.411 ± 0.0
3.88SerPro: 3.88 ± 0.0
2.469SerGln: 2.469 ± 0.0
1.058SerArg: 1.058 ± 0.0
4.586SerSer: 4.586 ± 0.0
3.88SerThr: 3.88 ± 0.0
4.938SerVal: 4.938 ± 0.0
0.353SerTrp: 0.353 ± 0.0
2.116SerTyr: 2.116 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.291ThrAla: 5.291 ± 0.0
2.116ThrCys: 2.116 ± 0.0
3.175ThrAsp: 3.175 ± 0.0
4.586ThrGlu: 4.586 ± 0.0
4.233ThrPhe: 4.233 ± 0.0
4.586ThrGly: 4.586 ± 0.0
2.469ThrHis: 2.469 ± 0.0
4.586ThrIle: 4.586 ± 0.0
3.527ThrLys: 3.527 ± 0.0
5.996ThrLeu: 5.996 ± 0.0
1.411ThrMet: 1.411 ± 0.0
3.88ThrAsn: 3.88 ± 0.0
2.469ThrPro: 2.469 ± 0.0
3.175ThrGln: 3.175 ± 0.0
1.764ThrArg: 1.764 ± 0.0
1.411ThrSer: 1.411 ± 0.0
7.76ThrThr: 7.76 ± 0.0
3.175ThrVal: 3.175 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
3.175ThrTyr: 3.175 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
7.76ValAla: 7.76 ± 0.0
1.058ValCys: 1.058 ± 0.0
2.822ValAsp: 2.822 ± 0.0
4.586ValGlu: 4.586 ± 0.0
1.764ValPhe: 1.764 ± 0.0
4.586ValGly: 4.586 ± 0.0
1.411ValHis: 1.411 ± 0.0
2.116ValIle: 2.116 ± 0.0
2.822ValLys: 2.822 ± 0.0
4.938ValLeu: 4.938 ± 0.0
2.116ValMet: 2.116 ± 0.0
3.88ValAsn: 3.88 ± 0.0
4.938ValPro: 4.938 ± 0.0
2.469ValGln: 2.469 ± 0.0
4.233ValArg: 4.233 ± 0.0
4.586ValSer: 4.586 ± 0.0
6.349ValThr: 6.349 ± 0.0
4.233ValVal: 4.233 ± 0.0
1.411ValTrp: 1.411 ± 0.0
3.527ValTyr: 3.527 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.764TrpAla: 1.764 ± 0.0
0.353TrpCys: 0.353 ± 0.0
0.705TrpAsp: 0.705 ± 0.0
1.058TrpGlu: 1.058 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.353TrpGly: 0.353 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.705TrpLys: 0.705 ± 0.0
0.705TrpLeu: 0.705 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.353TrpPro: 0.353 ± 0.0
0.705TrpGln: 0.705 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.058TrpSer: 1.058 ± 0.0
0.353TrpThr: 0.353 ± 0.0
1.058TrpVal: 1.058 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.705TrpTyr: 0.705 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.469TyrAla: 2.469 ± 0.0
1.764TyrCys: 1.764 ± 0.0
1.764TyrAsp: 1.764 ± 0.0
2.822TyrGlu: 2.822 ± 0.0
3.175TyrPhe: 3.175 ± 0.0
2.822TyrGly: 2.822 ± 0.0
2.469TyrHis: 2.469 ± 0.0
1.411TyrIle: 1.411 ± 0.0
2.116TyrLys: 2.116 ± 0.0
4.586TyrLeu: 4.586 ± 0.0
1.411TyrMet: 1.411 ± 0.0
1.411TyrAsn: 1.411 ± 0.0
1.764TyrPro: 1.764 ± 0.0
0.705TyrGln: 0.705 ± 0.0
2.469TyrArg: 2.469 ± 0.0
3.175TyrSer: 3.175 ± 0.0
2.116TyrThr: 2.116 ± 0.0
3.527TyrVal: 3.527 ± 0.0
0.353TyrTrp: 0.353 ± 0.0
1.411TyrTyr: 1.411 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2836 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski