Amino acid dipepetide frequency for Bradson virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.714AlaAla: 3.714 ± 0.0
1.35AlaCys: 1.35 ± 0.0
2.701AlaAsp: 2.701 ± 0.0
3.714AlaGlu: 3.714 ± 0.0
3.714AlaPhe: 3.714 ± 0.0
3.714AlaGly: 3.714 ± 0.0
1.013AlaHis: 1.013 ± 0.0
5.064AlaIle: 5.064 ± 0.0
4.051AlaLys: 4.051 ± 0.0
4.727AlaLeu: 4.727 ± 0.0
0.675AlaMet: 0.675 ± 0.0
2.701AlaAsn: 2.701 ± 0.0
2.026AlaPro: 2.026 ± 0.0
2.026AlaGln: 2.026 ± 0.0
1.688AlaArg: 1.688 ± 0.0
4.389AlaSer: 4.389 ± 0.0
2.701AlaThr: 2.701 ± 0.0
3.376AlaVal: 3.376 ± 0.0
0.338AlaTrp: 0.338 ± 0.0
1.013AlaTyr: 1.013 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.35CysAla: 1.35 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.026CysAsp: 2.026 ± 0.0
1.688CysGlu: 1.688 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.013CysGly: 1.013 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.688CysIle: 1.688 ± 0.0
1.013CysLys: 1.013 ± 0.0
1.688CysLeu: 1.688 ± 0.0
1.688CysMet: 1.688 ± 0.0
1.35CysAsn: 1.35 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.338CysGln: 0.338 ± 0.0
0.675CysArg: 0.675 ± 0.0
1.013CysSer: 1.013 ± 0.0
0.338CysThr: 0.338 ± 0.0
2.026CysVal: 2.026 ± 0.0
0.338CysTrp: 0.338 ± 0.0
0.675CysTyr: 0.675 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.038AspAla: 3.038 ± 0.0
0.675AspCys: 0.675 ± 0.0
3.038AspAsp: 3.038 ± 0.0
6.077AspGlu: 6.077 ± 0.0
4.389AspPhe: 4.389 ± 0.0
3.376AspGly: 3.376 ± 0.0
1.013AspHis: 1.013 ± 0.0
4.051AspIle: 4.051 ± 0.0
4.727AspLys: 4.727 ± 0.0
7.09AspLeu: 7.09 ± 0.0
0.338AspMet: 0.338 ± 0.0
3.714AspAsn: 3.714 ± 0.0
4.051AspPro: 4.051 ± 0.0
1.013AspGln: 1.013 ± 0.0
1.35AspArg: 1.35 ± 0.0
4.051AspSer: 4.051 ± 0.0
1.688AspThr: 1.688 ± 0.0
2.363AspVal: 2.363 ± 0.0
1.35AspTrp: 1.35 ± 0.0
1.688AspTyr: 1.688 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.389GluAla: 4.389 ± 0.0
0.338GluCys: 0.338 ± 0.0
3.376GluAsp: 3.376 ± 0.0
4.389GluGlu: 4.389 ± 0.0
2.363GluPhe: 2.363 ± 0.0
4.727GluGly: 4.727 ± 0.0
0.675GluHis: 0.675 ± 0.0
4.727GluIle: 4.727 ± 0.0
3.714GluLys: 3.714 ± 0.0
7.09GluLeu: 7.09 ± 0.0
1.35GluMet: 1.35 ± 0.0
1.013GluAsn: 1.013 ± 0.0
1.013GluPro: 1.013 ± 0.0
4.051GluGln: 4.051 ± 0.0
1.688GluArg: 1.688 ± 0.0
2.701GluSer: 2.701 ± 0.0
2.701GluThr: 2.701 ± 0.0
5.402GluVal: 5.402 ± 0.0
1.013GluTrp: 1.013 ± 0.0
1.35GluTyr: 1.35 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.688PheAla: 1.688 ± 0.0
0.675PheCys: 0.675 ± 0.0
5.402PheAsp: 5.402 ± 0.0
2.363PheGlu: 2.363 ± 0.0
2.701PhePhe: 2.701 ± 0.0
4.051PheGly: 4.051 ± 0.0
1.35PheHis: 1.35 ± 0.0
2.363PheIle: 2.363 ± 0.0
5.402PheLys: 5.402 ± 0.0
4.389PheLeu: 4.389 ± 0.0
0.675PheMet: 0.675 ± 0.0
3.038PheAsn: 3.038 ± 0.0
2.026PhePro: 2.026 ± 0.0
1.688PheGln: 1.688 ± 0.0
2.701PheArg: 2.701 ± 0.0
3.714PheSer: 3.714 ± 0.0
1.35PheThr: 1.35 ± 0.0
2.701PheVal: 2.701 ± 0.0
0.338PheTrp: 0.338 ± 0.0
0.675PheTyr: 0.675 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.013GlyAla: 1.013 ± 0.0
1.35GlyCys: 1.35 ± 0.0
4.389GlyAsp: 4.389 ± 0.0
2.363GlyGlu: 2.363 ± 0.0
2.701GlyPhe: 2.701 ± 0.0
1.35GlyGly: 1.35 ± 0.0
1.35GlyHis: 1.35 ± 0.0
5.064GlyIle: 5.064 ± 0.0
3.714GlyLys: 3.714 ± 0.0
4.727GlyLeu: 4.727 ± 0.0
1.688GlyMet: 1.688 ± 0.0
3.376GlyAsn: 3.376 ± 0.0
2.026GlyPro: 2.026 ± 0.0
1.688GlyGln: 1.688 ± 0.0
1.013GlyArg: 1.013 ± 0.0
3.714GlySer: 3.714 ± 0.0
4.727GlyThr: 4.727 ± 0.0
4.051GlyVal: 4.051 ± 0.0
0.675GlyTrp: 0.675 ± 0.0
4.389GlyTyr: 4.389 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.35HisAla: 1.35 ± 0.0
1.013HisCys: 1.013 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.013HisGlu: 1.013 ± 0.0
1.688HisPhe: 1.688 ± 0.0
2.026HisGly: 2.026 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.35HisIle: 1.35 ± 0.0
0.338HisLys: 0.338 ± 0.0
1.013HisLeu: 1.013 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.013HisAsn: 1.013 ± 0.0
0.338HisPro: 0.338 ± 0.0
0.338HisGln: 0.338 ± 0.0
0.338HisArg: 0.338 ± 0.0
1.688HisSer: 1.688 ± 0.0
1.35HisThr: 1.35 ± 0.0
1.688HisVal: 1.688 ± 0.0
0.675HisTrp: 0.675 ± 0.0
1.013HisTyr: 1.013 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.038IleAla: 3.038 ± 0.0
2.026IleCys: 2.026 ± 0.0
3.376IleAsp: 3.376 ± 0.0
4.051IleGlu: 4.051 ± 0.0
3.714IlePhe: 3.714 ± 0.0
3.376IleGly: 3.376 ± 0.0
0.675IleHis: 0.675 ± 0.0
7.427IleIle: 7.427 ± 0.0
6.752IleLys: 6.752 ± 0.0
4.389IleLeu: 4.389 ± 0.0
3.714IleMet: 3.714 ± 0.0
6.077IleAsn: 6.077 ± 0.0
4.051IlePro: 4.051 ± 0.0
1.35IleGln: 1.35 ± 0.0
3.714IleArg: 3.714 ± 0.0
4.389IleSer: 4.389 ± 0.0
3.038IleThr: 3.038 ± 0.0
5.739IleVal: 5.739 ± 0.0
0.338IleTrp: 0.338 ± 0.0
3.376IleTyr: 3.376 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.701LysAla: 2.701 ± 0.0
1.35LysCys: 1.35 ± 0.0
3.376LysAsp: 3.376 ± 0.0
4.389LysGlu: 4.389 ± 0.0
3.376LysPhe: 3.376 ± 0.0
1.35LysGly: 1.35 ± 0.0
1.688LysHis: 1.688 ± 0.0
4.727LysIle: 4.727 ± 0.0
5.402LysLys: 5.402 ± 0.0
5.402LysLeu: 5.402 ± 0.0
0.675LysMet: 0.675 ± 0.0
6.752LysAsn: 6.752 ± 0.0
4.727LysPro: 4.727 ± 0.0
2.363LysGln: 2.363 ± 0.0
1.688LysArg: 1.688 ± 0.0
4.727LysSer: 4.727 ± 0.0
6.077LysThr: 6.077 ± 0.0
4.051LysVal: 4.051 ± 0.0
1.35LysTrp: 1.35 ± 0.0
3.376LysTyr: 3.376 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.727LeuAla: 4.727 ± 0.0
1.688LeuCys: 1.688 ± 0.0
6.752LeuAsp: 6.752 ± 0.0
6.415LeuGlu: 6.415 ± 0.0
2.363LeuPhe: 2.363 ± 0.0
3.376LeuGly: 3.376 ± 0.0
1.013LeuHis: 1.013 ± 0.0
3.714LeuIle: 3.714 ± 0.0
7.09LeuLys: 7.09 ± 0.0
5.402LeuLeu: 5.402 ± 0.0
2.026LeuMet: 2.026 ± 0.0
5.739LeuAsn: 5.739 ± 0.0
2.026LeuPro: 2.026 ± 0.0
3.038LeuGln: 3.038 ± 0.0
1.688LeuArg: 1.688 ± 0.0
6.077LeuSer: 6.077 ± 0.0
8.44LeuThr: 8.44 ± 0.0
7.09LeuVal: 7.09 ± 0.0
0.338LeuTrp: 0.338 ± 0.0
1.35LeuTyr: 1.35 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.026MetAla: 2.026 ± 0.0
0.338MetCys: 0.338 ± 0.0
3.038MetAsp: 3.038 ± 0.0
0.338MetGlu: 0.338 ± 0.0
1.013MetPhe: 1.013 ± 0.0
0.675MetGly: 0.675 ± 0.0
1.35MetHis: 1.35 ± 0.0
0.675MetIle: 0.675 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.35MetLeu: 1.35 ± 0.0
0.338MetMet: 0.338 ± 0.0
1.013MetAsn: 1.013 ± 0.0
3.038MetPro: 3.038 ± 0.0
1.35MetGln: 1.35 ± 0.0
2.363MetArg: 2.363 ± 0.0
1.688MetSer: 1.688 ± 0.0
1.35MetThr: 1.35 ± 0.0
0.675MetVal: 0.675 ± 0.0
0.675MetTrp: 0.675 ± 0.0
1.013MetTyr: 1.013 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.051AsnAla: 4.051 ± 0.0
1.688AsnCys: 1.688 ± 0.0
3.376AsnAsp: 3.376 ± 0.0
4.051AsnGlu: 4.051 ± 0.0
4.051AsnPhe: 4.051 ± 0.0
3.038AsnGly: 3.038 ± 0.0
2.026AsnHis: 2.026 ± 0.0
4.389AsnIle: 4.389 ± 0.0
2.701AsnLys: 2.701 ± 0.0
5.402AsnLeu: 5.402 ± 0.0
1.688AsnMet: 1.688 ± 0.0
6.415AsnAsn: 6.415 ± 0.0
2.363AsnPro: 2.363 ± 0.0
5.064AsnGln: 5.064 ± 0.0
2.026AsnArg: 2.026 ± 0.0
2.701AsnSer: 2.701 ± 0.0
4.389AsnThr: 4.389 ± 0.0
3.038AsnVal: 3.038 ± 0.0
1.013AsnTrp: 1.013 ± 0.0
2.026AsnTyr: 2.026 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.701ProAla: 2.701 ± 0.0
0.338ProCys: 0.338 ± 0.0
1.35ProAsp: 1.35 ± 0.0
2.363ProGlu: 2.363 ± 0.0
2.363ProPhe: 2.363 ± 0.0
2.363ProGly: 2.363 ± 0.0
1.013ProHis: 1.013 ± 0.0
2.363ProIle: 2.363 ± 0.0
4.727ProLys: 4.727 ± 0.0
4.051ProLeu: 4.051 ± 0.0
1.013ProMet: 1.013 ± 0.0
1.013ProAsn: 1.013 ± 0.0
1.013ProPro: 1.013 ± 0.0
1.35ProGln: 1.35 ± 0.0
1.688ProArg: 1.688 ± 0.0
4.051ProSer: 4.051 ± 0.0
2.701ProThr: 2.701 ± 0.0
2.701ProVal: 2.701 ± 0.0
0.675ProTrp: 0.675 ± 0.0
2.363ProTyr: 2.363 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.35GlnAla: 1.35 ± 0.0
1.35GlnCys: 1.35 ± 0.0
1.688GlnAsp: 1.688 ± 0.0
0.675GlnGlu: 0.675 ± 0.0
0.675GlnPhe: 0.675 ± 0.0
3.038GlnGly: 3.038 ± 0.0
0.0GlnHis: 0.0 ± 0.0
4.051GlnIle: 4.051 ± 0.0
1.35GlnLys: 1.35 ± 0.0
3.714GlnLeu: 3.714 ± 0.0
1.013GlnMet: 1.013 ± 0.0
2.026GlnAsn: 2.026 ± 0.0
1.688GlnPro: 1.688 ± 0.0
1.35GlnGln: 1.35 ± 0.0
1.688GlnArg: 1.688 ± 0.0
2.363GlnSer: 2.363 ± 0.0
3.714GlnThr: 3.714 ± 0.0
2.363GlnVal: 2.363 ± 0.0
0.338GlnTrp: 0.338 ± 0.0
3.038GlnTyr: 3.038 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.026ArgAla: 2.026 ± 0.0
1.013ArgCys: 1.013 ± 0.0
1.688ArgAsp: 1.688 ± 0.0
1.013ArgGlu: 1.013 ± 0.0
2.363ArgPhe: 2.363 ± 0.0
3.376ArgGly: 3.376 ± 0.0
1.688ArgHis: 1.688 ± 0.0
2.701ArgIle: 2.701 ± 0.0
2.026ArgLys: 2.026 ± 0.0
1.688ArgLeu: 1.688 ± 0.0
0.675ArgMet: 0.675 ± 0.0
4.727ArgAsn: 4.727 ± 0.0
1.35ArgPro: 1.35 ± 0.0
1.013ArgGln: 1.013 ± 0.0
2.701ArgArg: 2.701 ± 0.0
1.688ArgSer: 1.688 ± 0.0
2.363ArgThr: 2.363 ± 0.0
3.038ArgVal: 3.038 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
2.026ArgTyr: 2.026 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.389SerAla: 4.389 ± 0.0
0.338SerCys: 0.338 ± 0.0
4.051SerAsp: 4.051 ± 0.0
4.389SerGlu: 4.389 ± 0.0
4.389SerPhe: 4.389 ± 0.0
4.727SerGly: 4.727 ± 0.0
1.013SerHis: 1.013 ± 0.0
5.739SerIle: 5.739 ± 0.0
4.389SerLys: 4.389 ± 0.0
4.051SerLeu: 4.051 ± 0.0
2.363SerMet: 2.363 ± 0.0
2.701SerAsn: 2.701 ± 0.0
2.026SerPro: 2.026 ± 0.0
3.038SerGln: 3.038 ± 0.0
2.026SerArg: 2.026 ± 0.0
6.415SerSer: 6.415 ± 0.0
7.09SerThr: 7.09 ± 0.0
5.402SerVal: 5.402 ± 0.0
0.338SerTrp: 0.338 ± 0.0
2.701SerTyr: 2.701 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.714ThrAla: 3.714 ± 0.0
1.35ThrCys: 1.35 ± 0.0
3.038ThrAsp: 3.038 ± 0.0
2.363ThrGlu: 2.363 ± 0.0
2.026ThrPhe: 2.026 ± 0.0
4.051ThrGly: 4.051 ± 0.0
1.013ThrHis: 1.013 ± 0.0
7.765ThrIle: 7.765 ± 0.0
2.701ThrLys: 2.701 ± 0.0
6.077ThrLeu: 6.077 ± 0.0
1.35ThrMet: 1.35 ± 0.0
4.051ThrAsn: 4.051 ± 0.0
2.701ThrPro: 2.701 ± 0.0
2.363ThrGln: 2.363 ± 0.0
3.714ThrArg: 3.714 ± 0.0
5.402ThrSer: 5.402 ± 0.0
5.402ThrThr: 5.402 ± 0.0
2.701ThrVal: 2.701 ± 0.0
0.675ThrTrp: 0.675 ± 0.0
3.714ThrTyr: 3.714 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.376ValAla: 3.376 ± 0.0
1.35ValCys: 1.35 ± 0.0
3.714ValAsp: 3.714 ± 0.0
4.389ValGlu: 4.389 ± 0.0
2.701ValPhe: 2.701 ± 0.0
3.038ValGly: 3.038 ± 0.0
1.013ValHis: 1.013 ± 0.0
2.363ValIle: 2.363 ± 0.0
6.415ValLys: 6.415 ± 0.0
4.727ValLeu: 4.727 ± 0.0
1.013ValMet: 1.013 ± 0.0
6.077ValAsn: 6.077 ± 0.0
3.714ValPro: 3.714 ± 0.0
2.363ValGln: 2.363 ± 0.0
3.038ValArg: 3.038 ± 0.0
5.739ValSer: 5.739 ± 0.0
3.714ValThr: 3.714 ± 0.0
4.389ValVal: 4.389 ± 0.0
1.013ValTrp: 1.013 ± 0.0
1.35ValTyr: 1.35 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.675TrpAla: 0.675 ± 0.0
0.338TrpCys: 0.338 ± 0.0
0.675TrpAsp: 0.675 ± 0.0
0.675TrpGlu: 0.675 ± 0.0
1.013TrpPhe: 1.013 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.35TrpIle: 1.35 ± 0.0
1.013TrpLys: 1.013 ± 0.0
0.338TrpLeu: 0.338 ± 0.0
0.675TrpMet: 0.675 ± 0.0
0.675TrpAsn: 0.675 ± 0.0
0.338TrpPro: 0.338 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.026TrpArg: 2.026 ± 0.0
1.013TrpSer: 1.013 ± 0.0
1.013TrpThr: 1.013 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.338TrpTrp: 0.338 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.376TyrAla: 3.376 ± 0.0
0.338TyrCys: 0.338 ± 0.0
2.026TyrAsp: 2.026 ± 0.0
1.688TyrGlu: 1.688 ± 0.0
1.688TyrPhe: 1.688 ± 0.0
2.701TyrGly: 2.701 ± 0.0
0.338TyrHis: 0.338 ± 0.0
3.038TyrIle: 3.038 ± 0.0
2.026TyrLys: 2.026 ± 0.0
3.038TyrLeu: 3.038 ± 0.0
1.013TyrMet: 1.013 ± 0.0
2.026TyrAsn: 2.026 ± 0.0
1.688TyrPro: 1.688 ± 0.0
1.688TyrGln: 1.688 ± 0.0
1.35TyrArg: 1.35 ± 0.0
4.051TyrSer: 4.051 ± 0.0
2.026TyrThr: 2.026 ± 0.0
2.701TyrVal: 2.701 ± 0.0
0.338TyrTrp: 0.338 ± 0.0
0.675TyrTyr: 0.675 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2963 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski