Amino acid dipepetide frequency for Yellow fever virus (strain 17D vaccine) (YFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.038AlaAla: 7.038 ± 0.0
1.466AlaCys: 1.466 ± 0.0
2.346AlaAsp: 2.346 ± 0.0
4.106AlaGlu: 4.106 ± 0.0
2.639AlaPhe: 2.639 ± 0.0
4.106AlaGly: 4.106 ± 0.0
2.346AlaHis: 2.346 ± 0.0
2.933AlaIle: 2.933 ± 0.0
3.226AlaLys: 3.226 ± 0.0
7.625AlaLeu: 7.625 ± 0.0
4.399AlaMet: 4.399 ± 0.0
2.053AlaAsn: 2.053 ± 0.0
2.346AlaPro: 2.346 ± 0.0
2.933AlaGln: 2.933 ± 0.0
2.639AlaArg: 2.639 ± 0.0
5.572AlaSer: 5.572 ± 0.0
2.639AlaThr: 2.639 ± 0.0
7.331AlaVal: 7.331 ± 0.0
1.173AlaTrp: 1.173 ± 0.0
2.639AlaTyr: 2.639 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.053CysAla: 2.053 ± 0.0
0.293CysCys: 0.293 ± 0.0
2.639CysAsp: 2.639 ± 0.0
0.587CysGlu: 0.587 ± 0.0
0.293CysPhe: 0.293 ± 0.0
2.639CysGly: 2.639 ± 0.0
0.293CysHis: 0.293 ± 0.0
0.587CysIle: 0.587 ± 0.0
0.293CysLys: 0.293 ± 0.0
0.293CysLeu: 0.293 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.466CysPro: 1.466 ± 0.0
0.293CysGln: 0.293 ± 0.0
2.346CysArg: 2.346 ± 0.0
0.293CysSer: 0.293 ± 0.0
1.173CysThr: 1.173 ± 0.0
1.466CysVal: 1.466 ± 0.0
0.88CysTrp: 0.88 ± 0.0
0.88CysTyr: 0.88 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.466AspAla: 1.466 ± 0.0
1.466AspCys: 1.466 ± 0.0
2.639AspAsp: 2.639 ± 0.0
2.933AspGlu: 2.933 ± 0.0
1.76AspPhe: 1.76 ± 0.0
4.399AspGly: 4.399 ± 0.0
0.293AspHis: 0.293 ± 0.0
3.519AspIle: 3.519 ± 0.0
3.519AspLys: 3.519 ± 0.0
4.106AspLeu: 4.106 ± 0.0
0.587AspMet: 0.587 ± 0.0
2.346AspAsn: 2.346 ± 0.0
1.76AspPro: 1.76 ± 0.0
1.76AspGln: 1.76 ± 0.0
2.933AspArg: 2.933 ± 0.0
2.639AspSer: 2.639 ± 0.0
3.812AspThr: 3.812 ± 0.0
2.639AspVal: 2.639 ± 0.0
0.88AspTrp: 0.88 ± 0.0
0.88AspTyr: 0.88 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.572GluAla: 5.572 ± 0.0
1.466GluCys: 1.466 ± 0.0
2.346GluAsp: 2.346 ± 0.0
6.452GluGlu: 6.452 ± 0.0
2.639GluPhe: 2.639 ± 0.0
5.279GluGly: 5.279 ± 0.0
1.173GluHis: 1.173 ± 0.0
2.933GluIle: 2.933 ± 0.0
3.226GluLys: 3.226 ± 0.0
3.812GluLeu: 3.812 ± 0.0
3.812GluMet: 3.812 ± 0.0
2.933GluAsn: 2.933 ± 0.0
2.053GluPro: 2.053 ± 0.0
1.76GluGln: 1.76 ± 0.0
3.519GluArg: 3.519 ± 0.0
1.76GluSer: 1.76 ± 0.0
1.466GluThr: 1.466 ± 0.0
6.745GluVal: 6.745 ± 0.0
1.173GluTrp: 1.173 ± 0.0
1.466GluTyr: 1.466 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.88PheAla: 0.88 ± 0.0
0.88PheCys: 0.88 ± 0.0
0.293PheAsp: 0.293 ± 0.0
2.639PheGlu: 2.639 ± 0.0
2.346PhePhe: 2.346 ± 0.0
4.399PheGly: 4.399 ± 0.0
3.226PheHis: 3.226 ± 0.0
2.639PheIle: 2.639 ± 0.0
1.76PheLys: 1.76 ± 0.0
4.399PheLeu: 4.399 ± 0.0
1.173PheMet: 1.173 ± 0.0
0.293PheAsn: 0.293 ± 0.0
0.293PhePro: 0.293 ± 0.0
1.173PheGln: 1.173 ± 0.0
0.88PheArg: 0.88 ± 0.0
2.346PheSer: 2.346 ± 0.0
1.76PheThr: 1.76 ± 0.0
1.466PheVal: 1.466 ± 0.0
0.587PheTrp: 0.587 ± 0.0
0.587PheTyr: 0.587 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.158GlyAla: 6.158 ± 0.0
2.053GlyCys: 2.053 ± 0.0
2.933GlyAsp: 2.933 ± 0.0
4.985GlyGlu: 4.985 ± 0.0
1.76GlyPhe: 1.76 ± 0.0
7.625GlyGly: 7.625 ± 0.0
1.173GlyHis: 1.173 ± 0.0
4.985GlyIle: 4.985 ± 0.0
8.211GlyLys: 8.211 ± 0.0
7.331GlyLeu: 7.331 ± 0.0
1.76GlyMet: 1.76 ± 0.0
3.519GlyAsn: 3.519 ± 0.0
3.226GlyPro: 3.226 ± 0.0
1.466GlyGln: 1.466 ± 0.0
6.745GlyArg: 6.745 ± 0.0
6.745GlySer: 6.745 ± 0.0
4.985GlyThr: 4.985 ± 0.0
8.211GlyVal: 8.211 ± 0.0
2.933GlyTrp: 2.933 ± 0.0
1.466GlyTyr: 1.466 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.173HisAla: 1.173 ± 0.0
0.293HisCys: 0.293 ± 0.0
0.587HisAsp: 0.587 ± 0.0
2.933HisGlu: 2.933 ± 0.0
0.88HisPhe: 0.88 ± 0.0
3.226HisGly: 3.226 ± 0.0
1.76HisHis: 1.76 ± 0.0
0.88HisIle: 0.88 ± 0.0
0.88HisLys: 0.88 ± 0.0
2.639HisLeu: 2.639 ± 0.0
0.293HisMet: 0.293 ± 0.0
0.293HisAsn: 0.293 ± 0.0
0.88HisPro: 0.88 ± 0.0
0.88HisGln: 0.88 ± 0.0
0.88HisArg: 0.88 ± 0.0
0.587HisSer: 0.587 ± 0.0
1.466HisThr: 1.466 ± 0.0
2.053HisVal: 2.053 ± 0.0
0.88HisTrp: 0.88 ± 0.0
0.587HisTyr: 0.587 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.812IleAla: 3.812 ± 0.0
0.293IleCys: 0.293 ± 0.0
2.639IleAsp: 2.639 ± 0.0
2.933IleGlu: 2.933 ± 0.0
2.346IlePhe: 2.346 ± 0.0
5.865IleGly: 5.865 ± 0.0
2.346IleHis: 2.346 ± 0.0
2.346IleIle: 2.346 ± 0.0
2.933IleLys: 2.933 ± 0.0
5.865IleLeu: 5.865 ± 0.0
1.466IleMet: 1.466 ± 0.0
1.76IleAsn: 1.76 ± 0.0
3.519IlePro: 3.519 ± 0.0
0.587IleGln: 0.587 ± 0.0
2.933IleArg: 2.933 ± 0.0
3.226IleSer: 3.226 ± 0.0
2.346IleThr: 2.346 ± 0.0
3.226IleVal: 3.226 ± 0.0
0.587IleTrp: 0.587 ± 0.0
0.293IleTyr: 0.293 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.933LysAla: 2.933 ± 0.0
2.053LysCys: 2.053 ± 0.0
1.76LysAsp: 1.76 ± 0.0
3.812LysGlu: 3.812 ± 0.0
1.466LysPhe: 1.466 ± 0.0
4.692LysGly: 4.692 ± 0.0
0.0LysHis: 0.0 ± 0.0
2.639LysIle: 2.639 ± 0.0
3.812LysLys: 3.812 ± 0.0
5.279LysLeu: 5.279 ± 0.0
1.173LysMet: 1.173 ± 0.0
2.639LysAsn: 2.639 ± 0.0
2.346LysPro: 2.346 ± 0.0
1.466LysGln: 1.466 ± 0.0
4.399LysArg: 4.399 ± 0.0
1.76LysSer: 1.76 ± 0.0
5.865LysThr: 5.865 ± 0.0
6.158LysVal: 6.158 ± 0.0
1.173LysTrp: 1.173 ± 0.0
1.173LysTyr: 1.173 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
9.677LeuAla: 9.677 ± 0.0
2.053LeuCys: 2.053 ± 0.0
4.399LeuAsp: 4.399 ± 0.0
6.452LeuGlu: 6.452 ± 0.0
2.933LeuPhe: 2.933 ± 0.0
7.331LeuGly: 7.331 ± 0.0
2.639LeuHis: 2.639 ± 0.0
4.985LeuIle: 4.985 ± 0.0
4.106LeuLys: 4.106 ± 0.0
7.331LeuLeu: 7.331 ± 0.0
2.639LeuMet: 2.639 ± 0.0
3.519LeuAsn: 3.519 ± 0.0
2.639LeuPro: 2.639 ± 0.0
2.053LeuGln: 2.053 ± 0.0
2.933LeuArg: 2.933 ± 0.0
6.745LeuSer: 6.745 ± 0.0
7.331LeuThr: 7.331 ± 0.0
8.211LeuVal: 8.211 ± 0.0
1.76LeuTrp: 1.76 ± 0.0
1.76LeuTyr: 1.76 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.226MetAla: 3.226 ± 0.0
0.587MetCys: 0.587 ± 0.0
2.053MetAsp: 2.053 ± 0.0
2.053MetGlu: 2.053 ± 0.0
1.466MetPhe: 1.466 ± 0.0
2.346MetGly: 2.346 ± 0.0
0.293MetHis: 0.293 ± 0.0
1.173MetIle: 1.173 ± 0.0
1.466MetLys: 1.466 ± 0.0
4.692MetLeu: 4.692 ± 0.0
1.466MetMet: 1.466 ± 0.0
0.88MetAsn: 0.88 ± 0.0
1.76MetPro: 1.76 ± 0.0
1.173MetGln: 1.173 ± 0.0
2.053MetArg: 2.053 ± 0.0
2.346MetSer: 2.346 ± 0.0
4.106MetThr: 4.106 ± 0.0
2.346MetVal: 2.346 ± 0.0
1.173MetTrp: 1.173 ± 0.0
1.173MetTyr: 1.173 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.346AsnAla: 2.346 ± 0.0
1.173AsnCys: 1.173 ± 0.0
2.053AsnAsp: 2.053 ± 0.0
2.053AsnGlu: 2.053 ± 0.0
1.173AsnPhe: 1.173 ± 0.0
4.399AsnGly: 4.399 ± 0.0
0.88AsnHis: 0.88 ± 0.0
1.173AsnIle: 1.173 ± 0.0
1.173AsnLys: 1.173 ± 0.0
2.639AsnLeu: 2.639 ± 0.0
1.76AsnMet: 1.76 ± 0.0
1.173AsnAsn: 1.173 ± 0.0
3.226AsnPro: 3.226 ± 0.0
0.88AsnGln: 0.88 ± 0.0
2.639AsnArg: 2.639 ± 0.0
2.053AsnSer: 2.053 ± 0.0
1.76AsnThr: 1.76 ± 0.0
2.639AsnVal: 2.639 ± 0.0
0.88AsnTrp: 0.88 ± 0.0
0.293AsnTyr: 0.293 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.76ProAla: 1.76 ± 0.0
0.587ProCys: 0.587 ± 0.0
2.053ProAsp: 2.053 ± 0.0
1.76ProGlu: 1.76 ± 0.0
3.226ProPhe: 3.226 ± 0.0
4.399ProGly: 4.399 ± 0.0
1.173ProHis: 1.173 ± 0.0
1.466ProIle: 1.466 ± 0.0
1.466ProLys: 1.466 ± 0.0
3.226ProLeu: 3.226 ± 0.0
1.173ProMet: 1.173 ± 0.0
0.587ProAsn: 0.587 ± 0.0
1.466ProPro: 1.466 ± 0.0
0.587ProGln: 0.587 ± 0.0
2.053ProArg: 2.053 ± 0.0
2.933ProSer: 2.933 ± 0.0
4.106ProThr: 4.106 ± 0.0
3.226ProVal: 3.226 ± 0.0
1.76ProTrp: 1.76 ± 0.0
0.88ProTyr: 0.88 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.587GlnAla: 0.587 ± 0.0
0.0GlnCys: 0.0 ± 0.0
2.639GlnAsp: 2.639 ± 0.0
2.639GlnGlu: 2.639 ± 0.0
0.587GlnPhe: 0.587 ± 0.0
2.933GlnGly: 2.933 ± 0.0
0.293GlnHis: 0.293 ± 0.0
0.88GlnIle: 0.88 ± 0.0
1.466GlnLys: 1.466 ± 0.0
1.76GlnLeu: 1.76 ± 0.0
0.293GlnMet: 0.293 ± 0.0
0.293GlnAsn: 0.293 ± 0.0
0.587GlnPro: 0.587 ± 0.0
1.173GlnGln: 1.173 ± 0.0
2.933GlnArg: 2.933 ± 0.0
1.76GlnSer: 1.76 ± 0.0
2.346GlnThr: 2.346 ± 0.0
2.346GlnVal: 2.346 ± 0.0
0.88GlnTrp: 0.88 ± 0.0
0.88GlnTyr: 0.88 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.226ArgAla: 3.226 ± 0.0
0.293ArgCys: 0.293 ± 0.0
3.226ArgAsp: 3.226 ± 0.0
3.812ArgGlu: 3.812 ± 0.0
0.88ArgPhe: 0.88 ± 0.0
6.158ArgGly: 6.158 ± 0.0
0.587ArgHis: 0.587 ± 0.0
2.933ArgIle: 2.933 ± 0.0
5.279ArgLys: 5.279 ± 0.0
3.812ArgLeu: 3.812 ± 0.0
2.053ArgMet: 2.053 ± 0.0
2.933ArgAsn: 2.933 ± 0.0
2.639ArgPro: 2.639 ± 0.0
2.053ArgGln: 2.053 ± 0.0
5.572ArgArg: 5.572 ± 0.0
4.399ArgSer: 4.399 ± 0.0
3.226ArgThr: 3.226 ± 0.0
5.865ArgVal: 5.865 ± 0.0
1.76ArgTrp: 1.76 ± 0.0
0.88ArgTyr: 0.88 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.158SerAla: 6.158 ± 0.0
0.587SerCys: 0.587 ± 0.0
1.466SerAsp: 1.466 ± 0.0
4.106SerGlu: 4.106 ± 0.0
1.76SerPhe: 1.76 ± 0.0
5.865SerGly: 5.865 ± 0.0
2.346SerHis: 2.346 ± 0.0
3.519SerIle: 3.519 ± 0.0
1.466SerLys: 1.466 ± 0.0
6.158SerLeu: 6.158 ± 0.0
3.226SerMet: 3.226 ± 0.0
1.76SerAsn: 1.76 ± 0.0
3.226SerPro: 3.226 ± 0.0
0.88SerGln: 0.88 ± 0.0
4.985SerArg: 4.985 ± 0.0
3.812SerSer: 3.812 ± 0.0
1.76SerThr: 1.76 ± 0.0
4.985SerVal: 4.985 ± 0.0
2.639SerTrp: 2.639 ± 0.0
2.346SerTyr: 2.346 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.399ThrAla: 4.399 ± 0.0
0.587ThrCys: 0.587 ± 0.0
3.519ThrAsp: 3.519 ± 0.0
1.76ThrGlu: 1.76 ± 0.0
1.76ThrPhe: 1.76 ± 0.0
3.812ThrGly: 3.812 ± 0.0
2.053ThrHis: 2.053 ± 0.0
3.519ThrIle: 3.519 ± 0.0
2.933ThrLys: 2.933 ± 0.0
5.865ThrLeu: 5.865 ± 0.0
2.639ThrMet: 2.639 ± 0.0
2.346ThrAsn: 2.346 ± 0.0
1.76ThrPro: 1.76 ± 0.0
1.173ThrGln: 1.173 ± 0.0
4.399ThrArg: 4.399 ± 0.0
4.985ThrSer: 4.985 ± 0.0
2.053ThrThr: 2.053 ± 0.0
4.985ThrVal: 4.985 ± 0.0
1.76ThrTrp: 1.76 ± 0.0
1.76ThrTyr: 1.76 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.158ValAla: 6.158 ± 0.0
0.587ValCys: 0.587 ± 0.0
4.399ValAsp: 4.399 ± 0.0
4.106ValGlu: 4.106 ± 0.0
2.639ValPhe: 2.639 ± 0.0
5.572ValGly: 5.572 ± 0.0
0.293ValHis: 0.293 ± 0.0
6.452ValIle: 6.452 ± 0.0
5.572ValLys: 5.572 ± 0.0
8.798ValLeu: 8.798 ± 0.0
3.812ValMet: 3.812 ± 0.0
3.519ValAsn: 3.519 ± 0.0
2.639ValPro: 2.639 ± 0.0
3.519ValGln: 3.519 ± 0.0
4.985ValArg: 4.985 ± 0.0
6.158ValSer: 6.158 ± 0.0
4.985ValThr: 4.985 ± 0.0
6.745ValVal: 6.745 ± 0.0
1.76ValTrp: 1.76 ± 0.0
1.173ValTyr: 1.173 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.466TrpAla: 1.466 ± 0.0
1.76TrpCys: 1.76 ± 0.0
1.76TrpAsp: 1.76 ± 0.0
1.173TrpGlu: 1.173 ± 0.0
1.173TrpPhe: 1.173 ± 0.0
0.587TrpGly: 0.587 ± 0.0
0.88TrpHis: 0.88 ± 0.0
1.466TrpIle: 1.466 ± 0.0
2.053TrpLys: 2.053 ± 0.0
2.639TrpLeu: 2.639 ± 0.0
1.76TrpMet: 1.76 ± 0.0
2.346TrpAsn: 2.346 ± 0.0
0.587TrpPro: 0.587 ± 0.0
0.88TrpGln: 0.88 ± 0.0
0.88TrpArg: 0.88 ± 0.0
1.466TrpSer: 1.466 ± 0.0
0.293TrpThr: 0.293 ± 0.0
1.173TrpVal: 1.173 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.587TrpTyr: 0.587 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.466TyrAla: 1.466 ± 0.0
0.293TyrCys: 0.293 ± 0.0
0.587TyrAsp: 0.587 ± 0.0
0.293TyrGlu: 0.293 ± 0.0
0.293TyrPhe: 0.293 ± 0.0
2.639TyrGly: 2.639 ± 0.0
0.0TyrHis: 0.0 ± 0.0
0.587TyrIle: 0.587 ± 0.0
1.76TyrLys: 1.76 ± 0.0
3.226TyrLeu: 3.226 ± 0.0
2.053TyrMet: 2.053 ± 0.0
1.173TyrAsn: 1.173 ± 0.0
1.466TyrPro: 1.466 ± 0.0
0.587TyrGln: 0.587 ± 0.0
0.88TyrArg: 0.88 ± 0.0
1.466TyrSer: 1.466 ± 0.0
0.587TyrThr: 0.587 ± 0.0
2.053TyrVal: 2.053 ± 0.0
0.293TyrTrp: 0.293 ± 0.0
1.466TyrTyr: 1.466 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3411 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski