Amino acid dipepetide frequency for Changjiang tombus-like virus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.095AlaAla: 13.095 ± 5.931
2.381AlaCys: 2.381 ± 0.786
4.762AlaAsp: 4.762 ± 0.818
0.0AlaGlu: 0.0 ± 0.0
3.571AlaPhe: 3.571 ± 1.633
5.952AlaGly: 5.952 ± 3.102
0.0AlaHis: 0.0 ± 0.0
3.571AlaIle: 3.571 ± 0.44
2.381AlaLys: 2.381 ± 1.703
11.905AlaLeu: 11.905 ± 1.239
1.19AlaMet: 1.19 ± 0.905
4.762AlaAsn: 4.762 ± 0.756
4.762AlaPro: 4.762 ± 2.327
1.19AlaGln: 1.19 ± 0.905
2.381AlaArg: 2.381 ± 1.125
8.333AlaSer: 8.333 ± 3.483
11.905AlaThr: 11.905 ± 4.374
2.381AlaVal: 2.381 ± 2.45
2.381AlaTrp: 2.381 ± 1.164
3.571AlaTyr: 3.571 ± 1.366
0.0AlaXaa: 0.0 ± 0.0
Cys
1.19CysAla: 1.19 ± 1.225
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.19CysPhe: 1.19 ± 1.225
3.571CysGly: 3.571 ± 1.633
0.0CysHis: 0.0 ± 0.0
1.19CysIle: 1.19 ± 0.905
1.19CysLys: 1.19 ± 0.905
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
2.381CysAsn: 2.381 ± 1.703
1.19CysPro: 1.19 ± 0.851
1.19CysGln: 1.19 ± 0.905
1.19CysArg: 1.19 ± 0.905
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.381CysVal: 2.381 ± 0.786
1.19CysTrp: 1.19 ± 1.225
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.571AspAla: 3.571 ± 0.44
1.19AspCys: 1.19 ± 0.905
3.571AspAsp: 3.571 ± 2.171
4.762AspGlu: 4.762 ± 2.25
3.571AspPhe: 3.571 ± 1.631
3.571AspGly: 3.571 ± 1.465
2.381AspHis: 2.381 ± 1.703
0.0AspIle: 0.0 ± 0.0
3.571AspLys: 3.571 ± 1.633
0.0AspLeu: 0.0 ± 0.0
4.762AspMet: 4.762 ± 2.25
0.0AspAsn: 0.0 ± 0.0
5.952AspPro: 5.952 ± 2.17
3.571AspGln: 3.571 ± 1.465
0.0AspArg: 0.0 ± 0.0
1.19AspSer: 1.19 ± 0.905
1.19AspThr: 1.19 ± 0.905
2.381AspVal: 2.381 ± 0.786
3.571AspTrp: 3.571 ± 1.633
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.762GluAla: 4.762 ± 2.136
0.0GluCys: 0.0 ± 0.0
2.381GluAsp: 2.381 ± 1.125
3.571GluGlu: 3.571 ± 2.715
4.762GluPhe: 4.762 ± 3.62
3.571GluGly: 3.571 ± 1.633
2.381GluHis: 2.381 ± 1.81
3.571GluIle: 3.571 ± 3.676
0.0GluLys: 0.0 ± 0.0
5.952GluLeu: 5.952 ± 0.348
2.381GluMet: 2.381 ± 1.125
1.19GluAsn: 1.19 ± 0.905
0.0GluPro: 0.0 ± 0.0
1.19GluGln: 1.19 ± 0.905
3.571GluArg: 3.571 ± 1.633
1.19GluSer: 1.19 ± 1.225
1.19GluThr: 1.19 ± 0.905
3.571GluVal: 3.571 ± 1.633
2.381GluTrp: 2.381 ± 0.786
1.19GluTyr: 1.19 ± 1.225
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
2.381PheCys: 2.381 ± 0.786
2.381PheAsp: 2.381 ± 1.81
3.571PheGlu: 3.571 ± 1.633
0.0PhePhe: 0.0 ± 0.0
2.381PheGly: 2.381 ± 1.81
0.0PheHis: 0.0 ± 0.0
2.381PheIle: 2.381 ± 1.81
3.571PheLys: 3.571 ± 1.465
3.571PheLeu: 3.571 ± 1.465
0.0PheMet: 0.0 ± 0.0
2.381PheAsn: 2.381 ± 0.786
3.571PhePro: 3.571 ± 1.366
3.571PheGln: 3.571 ± 1.631
5.952PheArg: 5.952 ± 0.348
1.19PheSer: 1.19 ± 1.225
1.19PheThr: 1.19 ± 0.851
3.571PheVal: 3.571 ± 0.44
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.381GlyAla: 2.381 ± 1.703
2.381GlyCys: 2.381 ± 1.125
4.762GlyAsp: 4.762 ± 2.389
2.381GlyGlu: 2.381 ± 1.703
3.571GlyPhe: 3.571 ± 1.366
1.19GlyGly: 1.19 ± 0.851
2.381GlyHis: 2.381 ± 1.125
3.571GlyIle: 3.571 ± 1.366
2.381GlyLys: 2.381 ± 1.703
8.333GlyLeu: 8.333 ± 0.953
2.381GlyMet: 2.381 ± 0.786
2.381GlyAsn: 2.381 ± 0.786
3.571GlyPro: 3.571 ± 1.366
0.0GlyGln: 0.0 ± 0.0
3.571GlyArg: 3.571 ± 2.234
5.952GlySer: 5.952 ± 1.549
1.19GlyThr: 1.19 ± 0.851
5.952GlyVal: 5.952 ± 3.102
1.19GlyTrp: 1.19 ± 0.905
4.762GlyTyr: 4.762 ± 2.136
0.0GlyXaa: 0.0 ± 0.0
His
2.381HisAla: 2.381 ± 1.81
0.0HisCys: 0.0 ± 0.0
1.19HisAsp: 1.19 ± 0.905
0.0HisGlu: 0.0 ± 0.0
1.19HisPhe: 1.19 ± 1.225
1.19HisGly: 1.19 ± 0.905
1.19HisHis: 1.19 ± 0.905
0.0HisIle: 0.0 ± 0.0
2.381HisLys: 2.381 ± 1.125
4.762HisLeu: 4.762 ± 0.818
1.19HisMet: 1.19 ± 1.016
2.381HisAsn: 2.381 ± 1.703
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
2.381HisArg: 2.381 ± 1.81
2.381HisSer: 2.381 ± 1.164
1.19HisThr: 1.19 ± 0.851
2.381HisVal: 2.381 ± 1.125
1.19HisTrp: 1.19 ± 0.851
1.19HisTyr: 1.19 ± 1.225
0.0HisXaa: 0.0 ± 0.0
Ile
5.952IleAla: 5.952 ± 1.51
1.19IleCys: 1.19 ± 0.905
2.381IleAsp: 2.381 ± 1.81
2.381IleGlu: 2.381 ± 1.81
1.19IlePhe: 1.19 ± 0.905
3.571IleGly: 3.571 ± 1.631
1.19IleHis: 1.19 ± 0.905
4.762IleIle: 4.762 ± 2.25
2.381IleLys: 2.381 ± 1.81
5.952IleLeu: 5.952 ± 6.126
0.0IleMet: 0.0 ± 0.0
1.19IleAsn: 1.19 ± 0.905
1.19IlePro: 1.19 ± 0.851
0.0IleGln: 0.0 ± 0.0
1.19IleArg: 1.19 ± 0.905
1.19IleSer: 1.19 ± 1.225
3.571IleThr: 3.571 ± 1.465
3.571IleVal: 3.571 ± 2.234
0.0IleTrp: 0.0 ± 0.0
1.19IleTyr: 1.19 ± 0.905
0.0IleXaa: 0.0 ± 0.0
Lys
3.571LysAla: 3.571 ± 1.633
0.0LysCys: 0.0 ± 0.0
1.19LysAsp: 1.19 ± 0.905
2.381LysGlu: 2.381 ± 1.125
0.0LysPhe: 0.0 ± 0.0
3.571LysGly: 3.571 ± 2.554
2.381LysHis: 2.381 ± 1.125
0.0LysIle: 0.0 ± 0.0
1.19LysLys: 1.19 ± 0.905
4.762LysLeu: 4.762 ± 1.665
0.0LysMet: 0.0 ± 0.0
1.19LysAsn: 1.19 ± 0.851
1.19LysPro: 1.19 ± 0.905
2.381LysGln: 2.381 ± 1.703
2.381LysArg: 2.381 ± 1.125
1.19LysSer: 1.19 ± 0.851
2.381LysThr: 2.381 ± 0.786
2.381LysVal: 2.381 ± 2.45
2.381LysTrp: 2.381 ± 0.786
3.571LysTyr: 3.571 ± 0.44
0.0LysXaa: 0.0 ± 0.0
Leu
8.333LeuAla: 8.333 ± 1.837
1.19LeuCys: 1.19 ± 0.851
4.762LeuAsp: 4.762 ± 0.818
9.524LeuGlu: 9.524 ± 3.633
1.19LeuPhe: 1.19 ± 0.905
5.952LeuGly: 5.952 ± 2.059
3.571LeuHis: 3.571 ± 0.44
1.19LeuIle: 1.19 ± 1.225
4.762LeuLys: 4.762 ± 3.409
13.095LeuLeu: 13.095 ± 1.721
4.762LeuMet: 4.762 ± 0.722
3.571LeuAsn: 3.571 ± 1.465
9.524LeuPro: 9.524 ± 1.874
0.0LeuGln: 0.0 ± 0.0
2.381LeuArg: 2.381 ± 1.164
3.571LeuSer: 3.571 ± 1.631
5.952LeuThr: 5.952 ± 1.549
5.952LeuVal: 5.952 ± 2.89
0.0LeuTrp: 0.0 ± 0.0
5.952LeuTyr: 5.952 ± 2.655
0.0LeuXaa: 0.0 ± 0.0
Met
4.762MetAla: 4.762 ± 0.756
1.19MetCys: 1.19 ± 0.905
1.19MetAsp: 1.19 ± 1.225
2.381MetGlu: 2.381 ± 1.125
1.19MetPhe: 1.19 ± 0.905
2.381MetGly: 2.381 ± 2.45
1.19MetHis: 1.19 ± 0.905
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.19MetLeu: 1.19 ± 0.905
1.19MetMet: 1.19 ± 0.905
1.19MetAsn: 1.19 ± 0.905
2.381MetPro: 2.381 ± 1.703
1.19MetGln: 1.19 ± 0.905
0.0MetArg: 0.0 ± 0.0
1.19MetSer: 1.19 ± 0.905
2.381MetThr: 2.381 ± 1.164
3.571MetVal: 3.571 ± 0.44
0.0MetTrp: 0.0 ± 0.0
2.381MetTyr: 2.381 ± 0.786
0.0MetXaa: 0.0 ± 0.0
Asn
2.381AsnAla: 2.381 ± 1.703
0.0AsnCys: 0.0 ± 0.0
1.19AsnAsp: 1.19 ± 0.905
2.381AsnGlu: 2.381 ± 1.125
0.0AsnPhe: 0.0 ± 0.0
2.381AsnGly: 2.381 ± 1.703
0.0AsnHis: 0.0 ± 0.0
4.762AsnIle: 4.762 ± 1.571
1.19AsnLys: 1.19 ± 0.851
4.762AsnLeu: 4.762 ± 2.136
3.571AsnMet: 3.571 ± 1.465
2.381AsnAsn: 2.381 ± 1.703
2.381AsnPro: 2.381 ± 0.786
1.19AsnGln: 1.19 ± 0.851
1.19AsnArg: 1.19 ± 0.851
1.19AsnSer: 1.19 ± 0.851
4.762AsnThr: 4.762 ± 2.136
3.571AsnVal: 3.571 ± 1.465
0.0AsnTrp: 0.0 ± 0.0
1.19AsnTyr: 1.19 ± 0.905
0.0AsnXaa: 0.0 ± 0.0
Pro
4.762ProAla: 4.762 ± 2.327
3.571ProCys: 3.571 ± 0.44
3.571ProAsp: 3.571 ± 0.44
0.0ProGlu: 0.0 ± 0.0
2.381ProPhe: 2.381 ± 0.786
1.19ProGly: 1.19 ± 0.851
2.381ProHis: 2.381 ± 0.786
2.381ProIle: 2.381 ± 2.45
1.19ProLys: 1.19 ± 0.851
2.381ProLeu: 2.381 ± 1.164
1.19ProMet: 1.19 ± 0.905
0.0ProAsn: 0.0 ± 0.0
0.0ProPro: 0.0 ± 0.0
2.381ProGln: 2.381 ± 0.786
2.381ProArg: 2.381 ± 1.703
5.952ProSer: 5.952 ± 0.348
0.0ProThr: 0.0 ± 0.0
10.714ProVal: 10.714 ± 3.68
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.571GlnAla: 3.571 ± 1.366
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.19GlnGlu: 1.19 ± 0.851
0.0GlnPhe: 0.0 ± 0.0
2.381GlnGly: 2.381 ± 1.164
1.19GlnHis: 1.19 ± 0.905
1.19GlnIle: 1.19 ± 0.905
0.0GlnLys: 0.0 ± 0.0
3.571GlnLeu: 3.571 ± 1.366
0.0GlnMet: 0.0 ± 0.0
4.762GlnAsn: 4.762 ± 2.136
1.19GlnPro: 1.19 ± 0.905
1.19GlnGln: 1.19 ± 0.851
4.762GlnArg: 4.762 ± 1.665
1.19GlnSer: 1.19 ± 0.905
1.19GlnThr: 1.19 ± 0.905
3.571GlnVal: 3.571 ± 1.465
0.0GlnTrp: 0.0 ± 0.0
3.571GlnTyr: 3.571 ± 1.465
0.0GlnXaa: 0.0 ± 0.0
Arg
9.524ArgAla: 9.524 ± 3.633
1.19ArgCys: 1.19 ± 1.225
3.571ArgAsp: 3.571 ± 1.465
3.571ArgGlu: 3.571 ± 1.465
5.952ArgPhe: 5.952 ± 3.181
1.19ArgGly: 1.19 ± 0.851
2.381ArgHis: 2.381 ± 2.45
2.381ArgIle: 2.381 ± 2.45
1.19ArgLys: 1.19 ± 1.225
0.0ArgLeu: 0.0 ± 0.0
1.19ArgMet: 1.19 ± 1.225
2.381ArgAsn: 2.381 ± 0.786
1.19ArgPro: 1.19 ± 0.905
4.762ArgGln: 4.762 ± 0.818
8.333ArgArg: 8.333 ± 5.633
3.571ArgSer: 3.571 ± 0.44
4.762ArgThr: 4.762 ± 2.329
8.333ArgVal: 8.333 ± 2.033
0.0ArgTrp: 0.0 ± 0.0
1.19ArgTyr: 1.19 ± 0.851
0.0ArgXaa: 0.0 ± 0.0
Ser
8.333SerAla: 8.333 ± 3.226
1.19SerCys: 1.19 ± 1.225
1.19SerAsp: 1.19 ± 0.851
1.19SerGlu: 1.19 ± 0.905
2.381SerPhe: 2.381 ± 1.703
3.571SerGly: 3.571 ± 1.366
3.571SerHis: 3.571 ± 0.44
1.19SerIle: 1.19 ± 0.905
2.381SerLys: 2.381 ± 1.164
4.762SerLeu: 4.762 ± 2.25
0.0SerMet: 0.0 ± 0.0
1.19SerAsn: 1.19 ± 0.851
1.19SerPro: 1.19 ± 1.225
1.19SerGln: 1.19 ± 0.905
9.524SerArg: 9.524 ± 3.207
1.19SerSer: 1.19 ± 1.225
5.952SerThr: 5.952 ± 2.952
7.143SerVal: 7.143 ± 0.881
0.0SerTrp: 0.0 ± 0.0
2.381SerTyr: 2.381 ± 1.164
0.0SerXaa: 0.0 ± 0.0
Thr
3.571ThrAla: 3.571 ± 0.44
0.0ThrCys: 0.0 ± 0.0
1.19ThrAsp: 1.19 ± 0.851
2.381ThrGlu: 2.381 ± 1.81
2.381ThrPhe: 2.381 ± 1.164
4.762ThrGly: 4.762 ± 2.136
1.19ThrHis: 1.19 ± 0.851
3.571ThrIle: 3.571 ± 1.465
4.762ThrLys: 4.762 ± 0.756
4.762ThrLeu: 4.762 ± 3.406
2.381ThrMet: 2.381 ± 0.795
2.381ThrAsn: 2.381 ± 1.703
2.381ThrPro: 2.381 ± 1.703
3.571ThrGln: 3.571 ± 1.631
0.0ThrArg: 0.0 ± 0.0
9.524ThrSer: 9.524 ± 1.646
4.762ThrThr: 4.762 ± 2.136
4.762ThrVal: 4.762 ± 2.327
1.19ThrTrp: 1.19 ± 1.225
3.571ThrTyr: 3.571 ± 1.366
0.0ThrXaa: 0.0 ± 0.0
Val
3.571ValAla: 3.571 ± 1.366
0.0ValCys: 0.0 ± 0.0
7.143ValAsp: 7.143 ± 1.011
4.762ValGlu: 4.762 ± 3.62
5.952ValPhe: 5.952 ± 2.17
9.524ValGly: 9.524 ± 1.513
1.19ValHis: 1.19 ± 0.851
3.571ValIle: 3.571 ± 0.44
2.381ValLys: 2.381 ± 1.125
9.524ValLeu: 9.524 ± 6.566
1.19ValMet: 1.19 ± 0.851
0.0ValAsn: 0.0 ± 0.0
2.381ValPro: 2.381 ± 1.125
4.762ValGln: 4.762 ± 1.571
10.714ValArg: 10.714 ± 4.555
5.952ValSer: 5.952 ± 1.481
4.762ValThr: 4.762 ± 2.327
4.762ValVal: 4.762 ± 0.756
1.19ValTrp: 1.19 ± 1.225
1.19ValTyr: 1.19 ± 0.905
0.0ValXaa: 0.0 ± 0.0
Trp
1.19TrpAla: 1.19 ± 0.905
0.0TrpCys: 0.0 ± 0.0
1.19TrpAsp: 1.19 ± 1.225
0.0TrpGlu: 0.0 ± 0.0
1.19TrpPhe: 1.19 ± 0.851
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.381TrpIle: 2.381 ± 1.125
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
2.381TrpMet: 2.381 ± 1.125
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.571TrpArg: 3.571 ± 1.465
1.19TrpSer: 1.19 ± 0.905
3.571TrpThr: 3.571 ± 1.631
1.19TrpVal: 1.19 ± 1.225
1.19TrpTrp: 1.19 ± 0.851
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.571TyrAla: 3.571 ± 1.366
0.0TyrCys: 0.0 ± 0.0
1.19TyrAsp: 1.19 ± 0.851
2.381TyrGlu: 2.381 ± 1.164
0.0TyrPhe: 0.0 ± 0.0
2.381TyrGly: 2.381 ± 0.786
1.19TyrHis: 1.19 ± 1.225
2.381TyrIle: 2.381 ± 1.81
1.19TyrLys: 1.19 ± 0.905
5.952TyrLeu: 5.952 ± 2.059
0.0TyrMet: 0.0 ± 0.0
4.762TyrAsn: 4.762 ± 1.571
2.381TyrPro: 2.381 ± 1.164
1.19TyrGln: 1.19 ± 0.905
1.19TyrArg: 1.19 ± 0.851
2.381TyrSer: 2.381 ± 1.125
1.19TyrThr: 1.19 ± 0.905
2.381TyrVal: 2.381 ± 2.45
1.19TyrTrp: 1.19 ± 0.905
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (841 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski