Amino acid dipepetide frequency for Avian encephalomyelitis virus (strain Calnek vaccine) (AEV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.344AlaAla: 2.344 ± 0.0
0.938AlaCys: 0.938 ± 0.0
0.938AlaAsp: 0.938 ± 0.0
1.875AlaGlu: 1.875 ± 0.0
0.469AlaPhe: 0.469 ± 0.0
6.095AlaGly: 6.095 ± 0.0
1.406AlaHis: 1.406 ± 0.0
4.688AlaIle: 4.688 ± 0.0
4.688AlaLys: 4.688 ± 0.0
2.813AlaLeu: 2.813 ± 0.0
2.813AlaMet: 2.813 ± 0.0
1.406AlaAsn: 1.406 ± 0.0
1.875AlaPro: 1.875 ± 0.0
5.157AlaGln: 5.157 ± 0.0
3.282AlaArg: 3.282 ± 0.0
4.219AlaSer: 4.219 ± 0.0
3.751AlaThr: 3.751 ± 0.0
6.095AlaVal: 6.095 ± 0.0
0.469AlaTrp: 0.469 ± 0.0
2.344AlaTyr: 2.344 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.938CysAla: 0.938 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.469CysAsp: 0.469 ± 0.0
1.406CysGlu: 1.406 ± 0.0
0.938CysPhe: 0.938 ± 0.0
0.938CysGly: 0.938 ± 0.0
1.406CysHis: 1.406 ± 0.0
0.938CysIle: 0.938 ± 0.0
1.406CysLys: 1.406 ± 0.0
1.406CysLeu: 1.406 ± 0.0
0.469CysMet: 0.469 ± 0.0
1.406CysAsn: 1.406 ± 0.0
2.813CysPro: 2.813 ± 0.0
0.469CysGln: 0.469 ± 0.0
0.938CysArg: 0.938 ± 0.0
1.406CysSer: 1.406 ± 0.0
2.344CysThr: 2.344 ± 0.0
0.938CysVal: 0.938 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.406CysTyr: 1.406 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.813AspAla: 2.813 ± 0.0
0.469AspCys: 0.469 ± 0.0
4.219AspAsp: 4.219 ± 0.0
5.626AspGlu: 5.626 ± 0.0
2.344AspPhe: 2.344 ± 0.0
2.344AspGly: 2.344 ± 0.0
1.406AspHis: 1.406 ± 0.0
6.095AspIle: 6.095 ± 0.0
3.282AspLys: 3.282 ± 0.0
7.032AspLeu: 7.032 ± 0.0
0.469AspMet: 0.469 ± 0.0
0.469AspAsn: 0.469 ± 0.0
2.344AspPro: 2.344 ± 0.0
3.751AspGln: 3.751 ± 0.0
1.875AspArg: 1.875 ± 0.0
2.344AspSer: 2.344 ± 0.0
2.813AspThr: 2.813 ± 0.0
3.751AspVal: 3.751 ± 0.0
0.938AspTrp: 0.938 ± 0.0
0.469AspTyr: 0.469 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.344GluAla: 2.344 ± 0.0
0.938GluCys: 0.938 ± 0.0
4.219GluAsp: 4.219 ± 0.0
6.095GluGlu: 6.095 ± 0.0
2.813GluPhe: 2.813 ± 0.0
5.157GluGly: 5.157 ± 0.0
0.938GluHis: 0.938 ± 0.0
2.813GluIle: 2.813 ± 0.0
4.219GluLys: 4.219 ± 0.0
6.564GluLeu: 6.564 ± 0.0
1.406GluMet: 1.406 ± 0.0
1.406GluAsn: 1.406 ± 0.0
2.344GluPro: 2.344 ± 0.0
1.406GluGln: 1.406 ± 0.0
2.813GluArg: 2.813 ± 0.0
2.813GluSer: 2.813 ± 0.0
2.344GluThr: 2.344 ± 0.0
4.688GluVal: 4.688 ± 0.0
1.406GluTrp: 1.406 ± 0.0
1.406GluTyr: 1.406 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.875PheAla: 1.875 ± 0.0
2.813PheCys: 2.813 ± 0.0
1.406PheAsp: 1.406 ± 0.0
1.406PheGlu: 1.406 ± 0.0
0.938PhePhe: 0.938 ± 0.0
1.406PheGly: 1.406 ± 0.0
0.938PheHis: 0.938 ± 0.0
1.406PheIle: 1.406 ± 0.0
1.406PheLys: 1.406 ± 0.0
2.813PheLeu: 2.813 ± 0.0
1.875PheMet: 1.875 ± 0.0
1.406PheAsn: 1.406 ± 0.0
1.875PhePro: 1.875 ± 0.0
2.344PheGln: 2.344 ± 0.0
2.813PheArg: 2.813 ± 0.0
5.157PheSer: 5.157 ± 0.0
3.282PheThr: 3.282 ± 0.0
3.282PheVal: 3.282 ± 0.0
0.938PheTrp: 0.938 ± 0.0
1.406PheTyr: 1.406 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.751GlyAla: 3.751 ± 0.0
0.938GlyCys: 0.938 ± 0.0
3.751GlyAsp: 3.751 ± 0.0
4.219GlyGlu: 4.219 ± 0.0
2.344GlyPhe: 2.344 ± 0.0
4.219GlyGly: 4.219 ± 0.0
0.469GlyHis: 0.469 ± 0.0
5.157GlyIle: 5.157 ± 0.0
6.564GlyLys: 6.564 ± 0.0
7.032GlyLeu: 7.032 ± 0.0
1.406GlyMet: 1.406 ± 0.0
1.406GlyAsn: 1.406 ± 0.0
2.344GlyPro: 2.344 ± 0.0
2.344GlyGln: 2.344 ± 0.0
3.282GlyArg: 3.282 ± 0.0
3.282GlySer: 3.282 ± 0.0
5.157GlyThr: 5.157 ± 0.0
6.564GlyVal: 6.564 ± 0.0
0.469GlyTrp: 0.469 ± 0.0
1.406GlyTyr: 1.406 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.406HisAla: 1.406 ± 0.0
0.469HisCys: 0.469 ± 0.0
0.469HisAsp: 0.469 ± 0.0
0.938HisGlu: 0.938 ± 0.0
1.875HisPhe: 1.875 ± 0.0
3.751HisGly: 3.751 ± 0.0
0.938HisHis: 0.938 ± 0.0
0.938HisIle: 0.938 ± 0.0
1.875HisLys: 1.875 ± 0.0
0.0HisLeu: 0.0 ± 0.0
1.406HisMet: 1.406 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.875HisPro: 1.875 ± 0.0
0.469HisGln: 0.469 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.344HisSer: 2.344 ± 0.0
1.406HisThr: 1.406 ± 0.0
3.282HisVal: 3.282 ± 0.0
0.469HisTrp: 0.469 ± 0.0
0.469HisTyr: 0.469 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.688IleAla: 4.688 ± 0.0
1.875IleCys: 1.875 ± 0.0
3.751IleAsp: 3.751 ± 0.0
5.626IleGlu: 5.626 ± 0.0
0.938IlePhe: 0.938 ± 0.0
0.938IleGly: 0.938 ± 0.0
0.938IleHis: 0.938 ± 0.0
2.344IleIle: 2.344 ± 0.0
4.688IleLys: 4.688 ± 0.0
3.282IleLeu: 3.282 ± 0.0
2.344IleMet: 2.344 ± 0.0
4.219IleAsn: 4.219 ± 0.0
2.344IlePro: 2.344 ± 0.0
2.344IleGln: 2.344 ± 0.0
2.344IleArg: 2.344 ± 0.0
5.157IleSer: 5.157 ± 0.0
3.282IleThr: 3.282 ± 0.0
5.626IleVal: 5.626 ± 0.0
1.875IleTrp: 1.875 ± 0.0
1.875IleTyr: 1.875 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.751LysAla: 3.751 ± 0.0
0.938LysCys: 0.938 ± 0.0
1.875LysAsp: 1.875 ± 0.0
6.095LysGlu: 6.095 ± 0.0
1.406LysPhe: 1.406 ± 0.0
0.938LysGly: 0.938 ± 0.0
0.469LysHis: 0.469 ± 0.0
4.688LysIle: 4.688 ± 0.0
2.813LysLys: 2.813 ± 0.0
4.219LysLeu: 4.219 ± 0.0
4.688LysMet: 4.688 ± 0.0
2.813LysAsn: 2.813 ± 0.0
1.875LysPro: 1.875 ± 0.0
0.938LysGln: 0.938 ± 0.0
0.938LysArg: 0.938 ± 0.0
3.751LysSer: 3.751 ± 0.0
7.97LysThr: 7.97 ± 0.0
4.219LysVal: 4.219 ± 0.0
2.344LysTrp: 2.344 ± 0.0
3.282LysTyr: 3.282 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.626LeuAla: 5.626 ± 0.0
2.813LeuCys: 2.813 ± 0.0
7.501LeuAsp: 7.501 ± 0.0
5.626LeuGlu: 5.626 ± 0.0
3.282LeuPhe: 3.282 ± 0.0
8.439LeuGly: 8.439 ± 0.0
2.813LeuHis: 2.813 ± 0.0
3.282LeuIle: 3.282 ± 0.0
3.282LeuLys: 3.282 ± 0.0
3.751LeuLeu: 3.751 ± 0.0
1.875LeuMet: 1.875 ± 0.0
3.282LeuAsn: 3.282 ± 0.0
0.938LeuPro: 0.938 ± 0.0
1.875LeuGln: 1.875 ± 0.0
4.688LeuArg: 4.688 ± 0.0
7.501LeuSer: 7.501 ± 0.0
3.751LeuThr: 3.751 ± 0.0
7.97LeuVal: 7.97 ± 0.0
1.406LeuTrp: 1.406 ± 0.0
4.219LeuTyr: 4.219 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.344MetAla: 2.344 ± 0.0
0.938MetCys: 0.938 ± 0.0
2.344MetAsp: 2.344 ± 0.0
1.875MetGlu: 1.875 ± 0.0
1.875MetPhe: 1.875 ± 0.0
1.406MetGly: 1.406 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.875MetIle: 1.875 ± 0.0
2.344MetLys: 2.344 ± 0.0
3.751MetLeu: 3.751 ± 0.0
0.938MetMet: 0.938 ± 0.0
1.406MetAsn: 1.406 ± 0.0
1.406MetPro: 1.406 ± 0.0
1.406MetGln: 1.406 ± 0.0
1.406MetArg: 1.406 ± 0.0
1.406MetSer: 1.406 ± 0.0
1.406MetThr: 1.406 ± 0.0
1.875MetVal: 1.875 ± 0.0
0.469MetTrp: 0.469 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.282AsnAla: 3.282 ± 0.0
0.938AsnCys: 0.938 ± 0.0
0.469AsnAsp: 0.469 ± 0.0
1.875AsnGlu: 1.875 ± 0.0
0.469AsnPhe: 0.469 ± 0.0
1.875AsnGly: 1.875 ± 0.0
0.938AsnHis: 0.938 ± 0.0
1.875AsnIle: 1.875 ± 0.0
0.938AsnLys: 0.938 ± 0.0
4.688AsnLeu: 4.688 ± 0.0
0.938AsnMet: 0.938 ± 0.0
0.938AsnAsn: 0.938 ± 0.0
1.406AsnPro: 1.406 ± 0.0
0.469AsnGln: 0.469 ± 0.0
1.875AsnArg: 1.875 ± 0.0
3.282AsnSer: 3.282 ± 0.0
2.813AsnThr: 2.813 ± 0.0
4.219AsnVal: 4.219 ± 0.0
0.469AsnTrp: 0.469 ± 0.0
1.875AsnTyr: 1.875 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.938ProAla: 0.938 ± 0.0
0.938ProCys: 0.938 ± 0.0
0.938ProAsp: 0.938 ± 0.0
1.875ProGlu: 1.875 ± 0.0
2.813ProPhe: 2.813 ± 0.0
4.219ProGly: 4.219 ± 0.0
0.0ProHis: 0.0 ± 0.0
2.344ProIle: 2.344 ± 0.0
1.875ProLys: 1.875 ± 0.0
5.157ProLeu: 5.157 ± 0.0
0.0ProMet: 0.0 ± 0.0
0.938ProAsn: 0.938 ± 0.0
0.0ProPro: 0.0 ± 0.0
1.875ProGln: 1.875 ± 0.0
4.219ProArg: 4.219 ± 0.0
0.938ProSer: 0.938 ± 0.0
3.751ProThr: 3.751 ± 0.0
2.813ProVal: 2.813 ± 0.0
0.0ProTrp: 0.0 ± 0.0
2.813ProTyr: 2.813 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.938GlnAla: 0.938 ± 0.0
1.406GlnCys: 1.406 ± 0.0
2.813GlnAsp: 2.813 ± 0.0
0.938GlnGlu: 0.938 ± 0.0
1.406GlnPhe: 1.406 ± 0.0
2.813GlnGly: 2.813 ± 0.0
0.938GlnHis: 0.938 ± 0.0
3.282GlnIle: 3.282 ± 0.0
1.406GlnLys: 1.406 ± 0.0
3.751GlnLeu: 3.751 ± 0.0
0.469GlnMet: 0.469 ± 0.0
0.469GlnAsn: 0.469 ± 0.0
0.469GlnPro: 0.469 ± 0.0
0.938GlnGln: 0.938 ± 0.0
0.938GlnArg: 0.938 ± 0.0
4.688GlnSer: 4.688 ± 0.0
1.406GlnThr: 1.406 ± 0.0
3.282GlnVal: 3.282 ± 0.0
0.938GlnTrp: 0.938 ± 0.0
1.406GlnTyr: 1.406 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.157ArgAla: 5.157 ± 0.0
0.469ArgCys: 0.469 ± 0.0
1.406ArgAsp: 1.406 ± 0.0
2.813ArgGlu: 2.813 ± 0.0
3.282ArgPhe: 3.282 ± 0.0
3.282ArgGly: 3.282 ± 0.0
0.938ArgHis: 0.938 ± 0.0
3.751ArgIle: 3.751 ± 0.0
2.813ArgLys: 2.813 ± 0.0
5.157ArgLeu: 5.157 ± 0.0
1.406ArgMet: 1.406 ± 0.0
1.406ArgAsn: 1.406 ± 0.0
0.469ArgPro: 0.469 ± 0.0
0.469ArgGln: 0.469 ± 0.0
1.406ArgArg: 1.406 ± 0.0
1.406ArgSer: 1.406 ± 0.0
2.344ArgThr: 2.344 ± 0.0
3.751ArgVal: 3.751 ± 0.0
0.469ArgTrp: 0.469 ± 0.0
1.406ArgTyr: 1.406 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.751SerAla: 3.751 ± 0.0
1.406SerCys: 1.406 ± 0.0
4.219SerAsp: 4.219 ± 0.0
2.813SerGlu: 2.813 ± 0.0
4.688SerPhe: 4.688 ± 0.0
3.282SerGly: 3.282 ± 0.0
1.875SerHis: 1.875 ± 0.0
6.564SerIle: 6.564 ± 0.0
4.219SerLys: 4.219 ± 0.0
4.219SerLeu: 4.219 ± 0.0
2.813SerMet: 2.813 ± 0.0
2.813SerAsn: 2.813 ± 0.0
5.157SerPro: 5.157 ± 0.0
1.406SerGln: 1.406 ± 0.0
2.813SerArg: 2.813 ± 0.0
8.908SerSer: 8.908 ± 0.0
6.564SerThr: 6.564 ± 0.0
8.908SerVal: 8.908 ± 0.0
0.469SerTrp: 0.469 ± 0.0
1.406SerTyr: 1.406 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.282ThrAla: 3.282 ± 0.0
0.938ThrCys: 0.938 ± 0.0
1.875ThrAsp: 1.875 ± 0.0
2.344ThrGlu: 2.344 ± 0.0
2.813ThrPhe: 2.813 ± 0.0
3.282ThrGly: 3.282 ± 0.0
1.875ThrHis: 1.875 ± 0.0
0.938ThrIle: 0.938 ± 0.0
4.219ThrLys: 4.219 ± 0.0
5.626ThrLeu: 5.626 ± 0.0
2.344ThrMet: 2.344 ± 0.0
5.157ThrAsn: 5.157 ± 0.0
4.219ThrPro: 4.219 ± 0.0
3.282ThrGln: 3.282 ± 0.0
0.938ThrArg: 0.938 ± 0.0
7.501ThrSer: 7.501 ± 0.0
5.626ThrThr: 5.626 ± 0.0
7.501ThrVal: 7.501 ± 0.0
0.938ThrTrp: 0.938 ± 0.0
4.219ThrTyr: 4.219 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.813ValAla: 2.813 ± 0.0
0.938ValCys: 0.938 ± 0.0
7.97ValAsp: 7.97 ± 0.0
4.219ValGlu: 4.219 ± 0.0
4.688ValPhe: 4.688 ± 0.0
9.376ValGly: 9.376 ± 0.0
2.813ValHis: 2.813 ± 0.0
4.219ValIle: 4.219 ± 0.0
4.219ValLys: 4.219 ± 0.0
7.97ValLeu: 7.97 ± 0.0
1.875ValMet: 1.875 ± 0.0
2.344ValAsn: 2.344 ± 0.0
3.751ValPro: 3.751 ± 0.0
2.813ValGln: 2.813 ± 0.0
4.688ValArg: 4.688 ± 0.0
7.032ValSer: 7.032 ± 0.0
5.626ValThr: 5.626 ± 0.0
6.564ValVal: 6.564 ± 0.0
0.469ValTrp: 0.469 ± 0.0
4.219ValTyr: 4.219 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.938TrpAla: 0.938 ± 0.0
0.469TrpCys: 0.469 ± 0.0
0.938TrpAsp: 0.938 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.406TrpPhe: 1.406 ± 0.0
0.469TrpGly: 0.469 ± 0.0
0.469TrpHis: 0.469 ± 0.0
0.469TrpIle: 0.469 ± 0.0
1.875TrpLys: 1.875 ± 0.0
1.875TrpLeu: 1.875 ± 0.0
0.469TrpMet: 0.469 ± 0.0
0.469TrpAsn: 0.469 ± 0.0
0.938TrpPro: 0.938 ± 0.0
0.469TrpGln: 0.469 ± 0.0
0.469TrpArg: 0.469 ± 0.0
1.406TrpSer: 1.406 ± 0.0
1.406TrpThr: 1.406 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.938TrpTyr: 0.938 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.688TyrAla: 4.688 ± 0.0
1.406TyrCys: 1.406 ± 0.0
3.282TyrAsp: 3.282 ± 0.0
0.469TyrGlu: 0.469 ± 0.0
0.469TyrPhe: 0.469 ± 0.0
1.875TyrGly: 1.875 ± 0.0
2.813TyrHis: 2.813 ± 0.0
2.813TyrIle: 2.813 ± 0.0
1.875TyrLys: 1.875 ± 0.0
2.813TyrLeu: 2.813 ± 0.0
0.0TyrMet: 0.0 ± 0.0
1.875TyrAsn: 1.875 ± 0.0
0.0TyrPro: 0.0 ± 0.0
0.469TyrGln: 0.469 ± 0.0
1.875TyrArg: 1.875 ± 0.0
3.751TyrSer: 3.751 ± 0.0
1.875TyrThr: 1.875 ± 0.0
3.282TyrVal: 3.282 ± 0.0
0.938TyrTrp: 0.938 ± 0.0
0.938TyrTyr: 0.938 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2134 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski