Amino acid dipepetide frequency for Burke-Gilman virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.225AlaAla: 6.225 ± 0.0
2.179AlaCys: 2.179 ± 0.0
3.735AlaAsp: 3.735 ± 0.0
4.046AlaGlu: 4.046 ± 0.0
3.735AlaPhe: 3.735 ± 0.0
4.669AlaGly: 4.669 ± 0.0
0.622AlaHis: 0.622 ± 0.0
4.046AlaIle: 4.046 ± 0.0
5.291AlaLys: 5.291 ± 0.0
3.112AlaLeu: 3.112 ± 0.0
1.867AlaMet: 1.867 ± 0.0
4.669AlaAsn: 4.669 ± 0.0
1.245AlaPro: 1.245 ± 0.0
4.98AlaGln: 4.98 ± 0.0
4.669AlaArg: 4.669 ± 0.0
6.225AlaSer: 6.225 ± 0.0
2.179AlaThr: 2.179 ± 0.0
6.536AlaVal: 6.536 ± 0.0
0.622AlaTrp: 0.622 ± 0.0
2.179AlaTyr: 2.179 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.934CysAla: 0.934 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.49CysAsp: 2.49 ± 0.0
0.622CysGlu: 0.622 ± 0.0
1.245CysPhe: 1.245 ± 0.0
2.179CysGly: 2.179 ± 0.0
0.311CysHis: 0.311 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.179CysLys: 2.179 ± 0.0
1.245CysLeu: 1.245 ± 0.0
0.622CysMet: 0.622 ± 0.0
0.311CysAsn: 0.311 ± 0.0
0.934CysPro: 0.934 ± 0.0
0.622CysGln: 0.622 ± 0.0
0.311CysArg: 0.311 ± 0.0
1.867CysSer: 1.867 ± 0.0
0.934CysThr: 0.934 ± 0.0
2.179CysVal: 2.179 ± 0.0
0.311CysTrp: 0.311 ± 0.0
0.934CysTyr: 0.934 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.98AspAla: 4.98 ± 0.0
1.556AspCys: 1.556 ± 0.0
4.98AspAsp: 4.98 ± 0.0
3.735AspGlu: 3.735 ± 0.0
2.801AspPhe: 2.801 ± 0.0
3.424AspGly: 3.424 ± 0.0
2.49AspHis: 2.49 ± 0.0
2.801AspIle: 2.801 ± 0.0
3.424AspLys: 3.424 ± 0.0
3.424AspLeu: 3.424 ± 0.0
0.934AspMet: 0.934 ± 0.0
2.49AspAsn: 2.49 ± 0.0
4.669AspPro: 4.669 ± 0.0
1.867AspGln: 1.867 ± 0.0
4.357AspArg: 4.357 ± 0.0
3.112AspSer: 3.112 ± 0.0
2.801AspThr: 2.801 ± 0.0
4.98AspVal: 4.98 ± 0.0
1.556AspTrp: 1.556 ± 0.0
1.245AspTyr: 1.245 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.357GluAla: 4.357 ± 0.0
0.622GluCys: 0.622 ± 0.0
2.49GluAsp: 2.49 ± 0.0
3.735GluGlu: 3.735 ± 0.0
2.801GluPhe: 2.801 ± 0.0
2.801GluGly: 2.801 ± 0.0
1.867GluHis: 1.867 ± 0.0
3.112GluIle: 3.112 ± 0.0
4.046GluLys: 4.046 ± 0.0
5.291GluLeu: 5.291 ± 0.0
2.49GluMet: 2.49 ± 0.0
1.867GluAsn: 1.867 ± 0.0
1.245GluPro: 1.245 ± 0.0
0.934GluGln: 0.934 ± 0.0
0.934GluArg: 0.934 ± 0.0
4.357GluSer: 4.357 ± 0.0
2.49GluThr: 2.49 ± 0.0
4.357GluVal: 4.357 ± 0.0
1.556GluTrp: 1.556 ± 0.0
0.934GluTyr: 0.934 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.112PheAla: 3.112 ± 0.0
1.245PheCys: 1.245 ± 0.0
4.669PheAsp: 4.669 ± 0.0
1.867PheGlu: 1.867 ± 0.0
1.867PhePhe: 1.867 ± 0.0
0.934PheGly: 0.934 ± 0.0
0.622PheHis: 0.622 ± 0.0
2.801PheIle: 2.801 ± 0.0
3.112PheLys: 3.112 ± 0.0
3.735PheLeu: 3.735 ± 0.0
1.556PheMet: 1.556 ± 0.0
3.735PheAsn: 3.735 ± 0.0
2.49PhePro: 2.49 ± 0.0
1.556PheGln: 1.556 ± 0.0
2.179PheArg: 2.179 ± 0.0
4.357PheSer: 4.357 ± 0.0
3.735PheThr: 3.735 ± 0.0
4.98PheVal: 4.98 ± 0.0
1.556PheTrp: 1.556 ± 0.0
0.622PheTyr: 0.622 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.49GlyAla: 2.49 ± 0.0
0.934GlyCys: 0.934 ± 0.0
3.735GlyAsp: 3.735 ± 0.0
2.49GlyGlu: 2.49 ± 0.0
2.179GlyPhe: 2.179 ± 0.0
3.112GlyGly: 3.112 ± 0.0
0.311GlyHis: 0.311 ± 0.0
6.847GlyIle: 6.847 ± 0.0
2.801GlyLys: 2.801 ± 0.0
5.291GlyLeu: 5.291 ± 0.0
2.179GlyMet: 2.179 ± 0.0
5.291GlyAsn: 5.291 ± 0.0
1.867GlyPro: 1.867 ± 0.0
2.179GlyGln: 2.179 ± 0.0
3.735GlyArg: 3.735 ± 0.0
3.424GlySer: 3.424 ± 0.0
1.556GlyThr: 1.556 ± 0.0
4.669GlyVal: 4.669 ± 0.0
0.311GlyTrp: 0.311 ± 0.0
3.112GlyTyr: 3.112 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.311HisAla: 0.311 ± 0.0
0.934HisCys: 0.934 ± 0.0
0.934HisAsp: 0.934 ± 0.0
1.245HisGlu: 1.245 ± 0.0
2.179HisPhe: 2.179 ± 0.0
1.556HisGly: 1.556 ± 0.0
1.245HisHis: 1.245 ± 0.0
0.934HisIle: 0.934 ± 0.0
0.934HisLys: 0.934 ± 0.0
2.179HisLeu: 2.179 ± 0.0
0.622HisMet: 0.622 ± 0.0
1.245HisAsn: 1.245 ± 0.0
1.245HisPro: 1.245 ± 0.0
0.622HisGln: 0.622 ± 0.0
0.934HisArg: 0.934 ± 0.0
1.556HisSer: 1.556 ± 0.0
1.867HisThr: 1.867 ± 0.0
2.49HisVal: 2.49 ± 0.0
0.622HisTrp: 0.622 ± 0.0
0.934HisTyr: 0.934 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.291IleAla: 5.291 ± 0.0
2.49IleCys: 2.49 ± 0.0
3.735IleAsp: 3.735 ± 0.0
5.291IleGlu: 5.291 ± 0.0
0.934IlePhe: 0.934 ± 0.0
3.424IleGly: 3.424 ± 0.0
2.179IleHis: 2.179 ± 0.0
2.179IleIle: 2.179 ± 0.0
3.424IleLys: 3.424 ± 0.0
3.735IleLeu: 3.735 ± 0.0
1.245IleMet: 1.245 ± 0.0
3.424IleAsn: 3.424 ± 0.0
2.179IlePro: 2.179 ± 0.0
1.867IleGln: 1.867 ± 0.0
2.801IleArg: 2.801 ± 0.0
2.49IleSer: 2.49 ± 0.0
2.179IleThr: 2.179 ± 0.0
3.424IleVal: 3.424 ± 0.0
1.245IleTrp: 1.245 ± 0.0
1.245IleTyr: 1.245 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.801LysAla: 2.801 ± 0.0
1.245LysCys: 1.245 ± 0.0
3.112LysAsp: 3.112 ± 0.0
3.112LysGlu: 3.112 ± 0.0
2.801LysPhe: 2.801 ± 0.0
2.179LysGly: 2.179 ± 0.0
1.556LysHis: 1.556 ± 0.0
4.046LysIle: 4.046 ± 0.0
1.556LysLys: 1.556 ± 0.0
3.735LysLeu: 3.735 ± 0.0
1.867LysMet: 1.867 ± 0.0
1.867LysAsn: 1.867 ± 0.0
2.49LysPro: 2.49 ± 0.0
3.735LysGln: 3.735 ± 0.0
0.934LysArg: 0.934 ± 0.0
4.357LysSer: 4.357 ± 0.0
2.179LysThr: 2.179 ± 0.0
4.98LysVal: 4.98 ± 0.0
0.934LysTrp: 0.934 ± 0.0
1.556LysTyr: 1.556 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.781LeuAla: 7.781 ± 0.0
0.934LeuCys: 0.934 ± 0.0
4.046LeuAsp: 4.046 ± 0.0
3.735LeuGlu: 3.735 ± 0.0
3.424LeuPhe: 3.424 ± 0.0
3.424LeuGly: 3.424 ± 0.0
1.867LeuHis: 1.867 ± 0.0
1.556LeuIle: 1.556 ± 0.0
5.291LeuLys: 5.291 ± 0.0
4.669LeuLeu: 4.669 ± 0.0
1.867LeuMet: 1.867 ± 0.0
4.98LeuAsn: 4.98 ± 0.0
4.357LeuPro: 4.357 ± 0.0
1.867LeuGln: 1.867 ± 0.0
5.602LeuArg: 5.602 ± 0.0
6.847LeuSer: 6.847 ± 0.0
4.669LeuThr: 4.669 ± 0.0
2.801LeuVal: 2.801 ± 0.0
0.311LeuTrp: 0.311 ± 0.0
2.801LeuTyr: 2.801 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.867MetAla: 1.867 ± 0.0
0.622MetCys: 0.622 ± 0.0
1.867MetAsp: 1.867 ± 0.0
2.179MetGlu: 2.179 ± 0.0
2.179MetPhe: 2.179 ± 0.0
1.556MetGly: 1.556 ± 0.0
1.556MetHis: 1.556 ± 0.0
1.556MetIle: 1.556 ± 0.0
0.934MetLys: 0.934 ± 0.0
1.245MetLeu: 1.245 ± 0.0
0.934MetMet: 0.934 ± 0.0
2.179MetAsn: 2.179 ± 0.0
1.867MetPro: 1.867 ± 0.0
0.934MetGln: 0.934 ± 0.0
0.934MetArg: 0.934 ± 0.0
2.179MetSer: 2.179 ± 0.0
1.556MetThr: 1.556 ± 0.0
2.179MetVal: 2.179 ± 0.0
0.622MetTrp: 0.622 ± 0.0
0.934MetTyr: 0.934 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.357AsnAla: 4.357 ± 0.0
1.867AsnCys: 1.867 ± 0.0
4.357AsnAsp: 4.357 ± 0.0
2.179AsnGlu: 2.179 ± 0.0
3.112AsnPhe: 3.112 ± 0.0
2.801AsnGly: 2.801 ± 0.0
0.934AsnHis: 0.934 ± 0.0
2.801AsnIle: 2.801 ± 0.0
1.556AsnLys: 1.556 ± 0.0
4.669AsnLeu: 4.669 ± 0.0
1.556AsnMet: 1.556 ± 0.0
3.112AsnAsn: 3.112 ± 0.0
4.357AsnPro: 4.357 ± 0.0
1.556AsnGln: 1.556 ± 0.0
2.801AsnArg: 2.801 ± 0.0
3.735AsnSer: 3.735 ± 0.0
3.424AsnThr: 3.424 ± 0.0
4.98AsnVal: 4.98 ± 0.0
2.179AsnTrp: 2.179 ± 0.0
1.556AsnTyr: 1.556 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.801ProAla: 2.801 ± 0.0
0.311ProCys: 0.311 ± 0.0
3.735ProAsp: 3.735 ± 0.0
1.867ProGlu: 1.867 ± 0.0
3.112ProPhe: 3.112 ± 0.0
2.801ProGly: 2.801 ± 0.0
0.311ProHis: 0.311 ± 0.0
2.49ProIle: 2.49 ± 0.0
1.556ProLys: 1.556 ± 0.0
3.735ProLeu: 3.735 ± 0.0
1.245ProMet: 1.245 ± 0.0
2.179ProAsn: 2.179 ± 0.0
1.867ProPro: 1.867 ± 0.0
2.49ProGln: 2.49 ± 0.0
2.49ProArg: 2.49 ± 0.0
4.98ProSer: 4.98 ± 0.0
4.046ProThr: 4.046 ± 0.0
4.357ProVal: 4.357 ± 0.0
1.245ProTrp: 1.245 ± 0.0
1.867ProTyr: 1.867 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.735GlnAla: 3.735 ± 0.0
0.622GlnCys: 0.622 ± 0.0
0.934GlnAsp: 0.934 ± 0.0
2.801GlnGlu: 2.801 ± 0.0
2.179GlnPhe: 2.179 ± 0.0
2.49GlnGly: 2.49 ± 0.0
1.867GlnHis: 1.867 ± 0.0
2.801GlnIle: 2.801 ± 0.0
0.622GlnLys: 0.622 ± 0.0
3.112GlnLeu: 3.112 ± 0.0
1.556GlnMet: 1.556 ± 0.0
3.112GlnAsn: 3.112 ± 0.0
1.245GlnPro: 1.245 ± 0.0
1.556GlnGln: 1.556 ± 0.0
1.556GlnArg: 1.556 ± 0.0
3.424GlnSer: 3.424 ± 0.0
1.245GlnThr: 1.245 ± 0.0
1.556GlnVal: 1.556 ± 0.0
0.622GlnTrp: 0.622 ± 0.0
0.934GlnTyr: 0.934 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.98ArgAla: 4.98 ± 0.0
0.934ArgCys: 0.934 ± 0.0
2.49ArgAsp: 2.49 ± 0.0
2.801ArgGlu: 2.801 ± 0.0
2.49ArgPhe: 2.49 ± 0.0
1.556ArgGly: 1.556 ± 0.0
1.245ArgHis: 1.245 ± 0.0
3.112ArgIle: 3.112 ± 0.0
3.112ArgLys: 3.112 ± 0.0
5.602ArgLeu: 5.602 ± 0.0
0.934ArgMet: 0.934 ± 0.0
4.357ArgAsn: 4.357 ± 0.0
2.801ArgPro: 2.801 ± 0.0
1.867ArgGln: 1.867 ± 0.0
3.735ArgArg: 3.735 ± 0.0
4.046ArgSer: 4.046 ± 0.0
2.801ArgThr: 2.801 ± 0.0
3.424ArgVal: 3.424 ± 0.0
0.311ArgTrp: 0.311 ± 0.0
0.934ArgTyr: 0.934 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.669SerAla: 4.669 ± 0.0
0.622SerCys: 0.622 ± 0.0
7.158SerAsp: 7.158 ± 0.0
1.867SerGlu: 1.867 ± 0.0
3.735SerPhe: 3.735 ± 0.0
5.602SerGly: 5.602 ± 0.0
2.801SerHis: 2.801 ± 0.0
3.424SerIle: 3.424 ± 0.0
2.179SerLys: 2.179 ± 0.0
6.536SerLeu: 6.536 ± 0.0
1.867SerMet: 1.867 ± 0.0
4.046SerAsn: 4.046 ± 0.0
3.735SerPro: 3.735 ± 0.0
2.801SerGln: 2.801 ± 0.0
4.357SerArg: 4.357 ± 0.0
4.357SerSer: 4.357 ± 0.0
4.357SerThr: 4.357 ± 0.0
5.291SerVal: 5.291 ± 0.0
1.556SerTrp: 1.556 ± 0.0
2.801SerTyr: 2.801 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.424ThrAla: 3.424 ± 0.0
0.622ThrCys: 0.622 ± 0.0
2.49ThrAsp: 2.49 ± 0.0
1.867ThrGlu: 1.867 ± 0.0
3.735ThrPhe: 3.735 ± 0.0
4.357ThrGly: 4.357 ± 0.0
1.245ThrHis: 1.245 ± 0.0
4.357ThrIle: 4.357 ± 0.0
0.934ThrLys: 0.934 ± 0.0
3.424ThrLeu: 3.424 ± 0.0
0.934ThrMet: 0.934 ± 0.0
1.245ThrAsn: 1.245 ± 0.0
3.424ThrPro: 3.424 ± 0.0
1.556ThrGln: 1.556 ± 0.0
4.669ThrArg: 4.669 ± 0.0
3.112ThrSer: 3.112 ± 0.0
3.735ThrThr: 3.735 ± 0.0
3.424ThrVal: 3.424 ± 0.0
1.556ThrTrp: 1.556 ± 0.0
2.801ThrTyr: 2.801 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.913ValAla: 5.913 ± 0.0
1.556ValCys: 1.556 ± 0.0
2.49ValAsp: 2.49 ± 0.0
4.98ValGlu: 4.98 ± 0.0
4.669ValPhe: 4.669 ± 0.0
5.291ValGly: 5.291 ± 0.0
0.622ValHis: 0.622 ± 0.0
2.801ValIle: 2.801 ± 0.0
4.98ValLys: 4.98 ± 0.0
4.669ValLeu: 4.669 ± 0.0
2.179ValMet: 2.179 ± 0.0
5.291ValAsn: 5.291 ± 0.0
6.225ValPro: 6.225 ± 0.0
3.112ValGln: 3.112 ± 0.0
2.801ValArg: 2.801 ± 0.0
5.602ValSer: 5.602 ± 0.0
3.735ValThr: 3.735 ± 0.0
4.357ValVal: 4.357 ± 0.0
1.867ValTrp: 1.867 ± 0.0
1.867ValTyr: 1.867 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.622TrpAla: 0.622 ± 0.0
0.622TrpCys: 0.622 ± 0.0
0.934TrpAsp: 0.934 ± 0.0
0.622TrpGlu: 0.622 ± 0.0
0.622TrpPhe: 0.622 ± 0.0
1.245TrpGly: 1.245 ± 0.0
0.622TrpHis: 0.622 ± 0.0
0.934TrpIle: 0.934 ± 0.0
1.867TrpLys: 1.867 ± 0.0
1.867TrpLeu: 1.867 ± 0.0
1.245TrpMet: 1.245 ± 0.0
0.934TrpAsn: 0.934 ± 0.0
0.311TrpPro: 0.311 ± 0.0
0.934TrpGln: 0.934 ± 0.0
1.245TrpArg: 1.245 ± 0.0
1.556TrpSer: 1.556 ± 0.0
0.934TrpThr: 0.934 ± 0.0
1.556TrpVal: 1.556 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.622TrpTyr: 0.622 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.179TyrAla: 2.179 ± 0.0
0.311TyrCys: 0.311 ± 0.0
1.245TyrAsp: 1.245 ± 0.0
1.245TyrGlu: 1.245 ± 0.0
1.245TyrPhe: 1.245 ± 0.0
3.112TyrGly: 3.112 ± 0.0
0.311TyrHis: 0.311 ± 0.0
2.179TyrIle: 2.179 ± 0.0
1.245TyrLys: 1.245 ± 0.0
1.556TyrLeu: 1.556 ± 0.0
1.867TyrMet: 1.867 ± 0.0
1.556TyrAsn: 1.556 ± 0.0
0.934TyrPro: 0.934 ± 0.0
0.934TyrGln: 0.934 ± 0.0
2.49TyrArg: 2.49 ± 0.0
2.179TyrSer: 2.179 ± 0.0
2.49TyrThr: 2.49 ± 0.0
2.49TyrVal: 2.49 ± 0.0
0.311TyrTrp: 0.311 ± 0.0
0.934TyrTyr: 0.934 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3214 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski