Amino acid dipepetide frequency for Chicken associated smacovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.442AlaAla: 3.442 ± 2.125
1.721AlaCys: 1.721 ± 1.335
0.0AlaAsp: 0.0 ± 0.0
0.0AlaGlu: 0.0 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
5.164AlaGly: 5.164 ± 0.79
1.721AlaHis: 1.721 ± 1.335
3.442AlaIle: 3.442 ± 0.272
1.721AlaLys: 1.721 ± 1.335
5.164AlaLeu: 5.164 ± 4.004
5.164AlaMet: 5.164 ± 0.79
0.0AlaAsn: 0.0 ± 0.0
3.442AlaPro: 3.442 ± 0.272
3.442AlaGln: 3.442 ± 2.125
1.721AlaArg: 1.721 ± 1.335
3.442AlaSer: 3.442 ± 2.125
1.721AlaThr: 1.721 ± 1.062
0.0AlaVal: 0.0 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
6.885AlaTyr: 6.885 ± 0.545
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.721CysLys: 1.721 ± 1.335
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.721CysGln: 1.721 ± 1.335
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.721CysTyr: 1.721 ± 1.335
0.0CysXaa: 0.0 ± 0.0
Asp
1.721AspAla: 1.721 ± 1.062
0.0AspCys: 0.0 ± 0.0
3.442AspAsp: 3.442 ± 0.272
0.0AspGlu: 0.0 ± 0.0
0.0AspPhe: 0.0 ± 0.0
5.164AspGly: 5.164 ± 0.79
0.0AspHis: 0.0 ± 0.0
5.164AspIle: 5.164 ± 1.607
0.0AspLys: 0.0 ± 0.0
3.442AspLeu: 3.442 ± 0.272
0.0AspMet: 0.0 ± 0.0
3.442AspAsn: 3.442 ± 0.272
8.606AspPro: 8.606 ± 0.518
1.721AspGln: 1.721 ± 1.062
8.606AspArg: 8.606 ± 0.518
5.164AspSer: 5.164 ± 3.187
5.164AspThr: 5.164 ± 1.607
1.721AspVal: 1.721 ± 1.335
0.0AspTrp: 0.0 ± 0.0
3.442AspTyr: 3.442 ± 0.272
0.0AspXaa: 0.0 ± 0.0
Glu
10.327GluAla: 10.327 ± 3.214
1.721GluCys: 1.721 ± 1.335
1.721GluAsp: 1.721 ± 1.335
1.721GluGlu: 1.721 ± 1.335
0.0GluPhe: 0.0 ± 0.0
6.885GluGly: 6.885 ± 2.942
1.721GluHis: 1.721 ± 1.335
5.164GluIle: 5.164 ± 1.607
0.0GluLys: 0.0 ± 0.0
3.442GluLeu: 3.442 ± 2.125
0.0GluMet: 0.0 ± 0.804
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
6.885GluGln: 6.885 ± 0.545
6.885GluArg: 6.885 ± 0.545
1.721GluSer: 1.721 ± 1.062
3.442GluThr: 3.442 ± 0.272
1.721GluVal: 1.721 ± 1.062
0.0GluTrp: 0.0 ± 0.0
1.721GluTyr: 1.721 ± 1.062
0.0GluXaa: 0.0 ± 0.0
Phe
1.721PheAla: 1.721 ± 1.062
0.0PheCys: 0.0 ± 0.0
0.0PheAsp: 0.0 ± 0.0
1.721PheGlu: 1.721 ± 1.062
3.442PhePhe: 3.442 ± 0.272
3.442PheGly: 3.442 ± 2.67
0.0PheHis: 0.0 ± 0.0
1.721PheIle: 1.721 ± 1.335
3.442PheLys: 3.442 ± 2.125
1.721PheLeu: 1.721 ± 1.062
0.0PheMet: 0.0 ± 0.0
5.164PheAsn: 5.164 ± 3.187
3.442PhePro: 3.442 ± 2.125
1.721PheGln: 1.721 ± 1.062
1.721PheArg: 1.721 ± 1.062
0.0PheSer: 0.0 ± 0.0
0.0PheThr: 0.0 ± 0.0
1.721PheVal: 1.721 ± 1.062
1.721PheTrp: 1.721 ± 1.062
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
0.0GlyCys: 0.0 ± 0.0
5.164GlyAsp: 5.164 ± 0.79
12.048GlyGlu: 12.048 ± 2.643
3.442GlyPhe: 3.442 ± 0.272
5.164GlyGly: 5.164 ± 1.607
1.721GlyHis: 1.721 ± 1.335
6.885GlyIle: 6.885 ± 4.25
6.885GlyLys: 6.885 ± 5.339
6.885GlyLeu: 6.885 ± 0.545
0.0GlyMet: 0.0 ± 0.0
3.442GlyAsn: 3.442 ± 2.125
0.0GlyPro: 0.0 ± 0.0
3.442GlyGln: 3.442 ± 2.125
5.164GlyArg: 5.164 ± 3.187
1.721GlySer: 1.721 ± 1.062
1.721GlyThr: 1.721 ± 1.335
5.164GlyVal: 5.164 ± 1.607
3.442GlyTrp: 3.442 ± 2.125
5.164GlyTyr: 5.164 ± 0.79
0.0GlyXaa: 0.0 ± 0.0
His
1.721HisAla: 1.721 ± 1.062
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.721HisPhe: 1.721 ± 1.062
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.721HisLys: 1.721 ± 1.335
1.721HisLeu: 1.721 ± 1.335
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.721HisSer: 1.721 ± 1.335
0.0HisThr: 0.0 ± 0.0
1.721HisVal: 1.721 ± 1.335
1.721HisTrp: 1.721 ± 1.335
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.442IleAla: 3.442 ± 0.272
0.0IleCys: 0.0 ± 0.0
1.721IleAsp: 1.721 ± 1.335
3.442IleGlu: 3.442 ± 0.272
3.442IlePhe: 3.442 ± 2.125
3.442IleGly: 3.442 ± 2.125
0.0IleHis: 0.0 ± 0.0
10.327IleIle: 10.327 ± 5.611
1.721IleLys: 1.721 ± 1.335
3.442IleLeu: 3.442 ± 0.272
1.721IleMet: 1.721 ± 1.335
5.164IleAsn: 5.164 ± 1.607
5.164IlePro: 5.164 ± 4.004
1.721IleGln: 1.721 ± 1.335
6.885IleArg: 6.885 ± 2.942
3.442IleSer: 3.442 ± 2.125
3.442IleThr: 3.442 ± 2.125
1.721IleVal: 1.721 ± 1.335
3.442IleTrp: 3.442 ± 0.272
1.721IleTyr: 1.721 ± 1.062
0.0IleXaa: 0.0 ± 0.0
Lys
5.164LysAla: 5.164 ± 1.607
0.0LysCys: 0.0 ± 0.0
6.885LysAsp: 6.885 ± 0.545
6.885LysGlu: 6.885 ± 2.942
3.442LysPhe: 3.442 ± 2.125
0.0LysGly: 0.0 ± 0.0
0.0LysHis: 0.0 ± 0.0
3.442LysIle: 3.442 ± 2.67
1.721LysLys: 1.721 ± 1.335
5.164LysLeu: 5.164 ± 0.79
0.0LysMet: 0.0 ± 0.0
3.442LysAsn: 3.442 ± 2.67
1.721LysPro: 1.721 ± 1.335
3.442LysGln: 3.442 ± 2.67
3.442LysArg: 3.442 ± 2.67
1.721LysSer: 1.721 ± 1.335
6.885LysThr: 6.885 ± 1.853
5.164LysVal: 5.164 ± 1.607
1.721LysTrp: 1.721 ± 1.335
5.164LysTyr: 5.164 ± 1.607
0.0LysXaa: 0.0 ± 0.0
Leu
1.721LeuAla: 1.721 ± 1.062
0.0LeuCys: 0.0 ± 0.0
3.442LeuAsp: 3.442 ± 2.125
3.442LeuGlu: 3.442 ± 0.272
1.721LeuPhe: 1.721 ± 1.062
8.606LeuGly: 8.606 ± 0.518
0.0LeuHis: 0.0 ± 0.0
3.442LeuIle: 3.442 ± 0.272
10.327LeuLys: 10.327 ± 5.611
5.164LeuLeu: 5.164 ± 0.79
1.721LeuMet: 1.721 ± 1.062
1.721LeuAsn: 1.721 ± 1.062
6.885LeuPro: 6.885 ± 4.25
3.442LeuGln: 3.442 ± 2.125
1.721LeuArg: 1.721 ± 1.335
5.164LeuSer: 5.164 ± 1.607
3.442LeuThr: 3.442 ± 0.272
6.885LeuVal: 6.885 ± 2.942
1.721LeuTrp: 1.721 ± 1.335
3.442LeuTyr: 3.442 ± 0.272
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
3.442MetAsp: 3.442 ± 0.272
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.721MetLys: 1.721 ± 1.335
5.164MetLeu: 5.164 ± 1.607
0.0MetMet: 0.0 ± 0.0
5.164MetAsn: 5.164 ± 0.79
0.0MetPro: 0.0 ± 0.0
1.721MetGln: 1.721 ± 1.335
1.721MetArg: 1.721 ± 1.062
1.721MetSer: 1.721 ± 1.062
1.721MetThr: 1.721 ± 1.335
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
3.442AsnAsp: 3.442 ± 0.272
8.606AsnGlu: 8.606 ± 4.277
3.442AsnPhe: 3.442 ± 0.272
3.442AsnGly: 3.442 ± 2.125
0.0AsnHis: 0.0 ± 0.0
5.164AsnIle: 5.164 ± 4.004
3.442AsnLys: 3.442 ± 2.67
5.164AsnLeu: 5.164 ± 0.79
1.721AsnMet: 1.721 ± 1.062
0.0AsnAsn: 0.0 ± 0.0
3.442AsnPro: 3.442 ± 2.125
0.0AsnGln: 0.0 ± 0.0
1.721AsnArg: 1.721 ± 1.062
3.442AsnSer: 3.442 ± 2.125
3.442AsnThr: 3.442 ± 0.272
6.885AsnVal: 6.885 ± 4.25
0.0AsnTrp: 0.0 ± 0.0
1.721AsnTyr: 1.721 ± 1.062
0.0AsnXaa: 0.0 ± 0.0
Pro
1.721ProAla: 1.721 ± 1.062
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
3.442ProGlu: 3.442 ± 0.272
1.721ProPhe: 1.721 ± 1.062
8.606ProGly: 8.606 ± 5.312
0.0ProHis: 0.0 ± 0.0
1.721ProIle: 1.721 ± 1.062
3.442ProLys: 3.442 ± 0.272
3.442ProLeu: 3.442 ± 0.272
1.721ProMet: 1.721 ± 1.062
1.721ProAsn: 1.721 ± 1.335
3.442ProPro: 3.442 ± 0.272
3.442ProGln: 3.442 ± 2.125
8.606ProArg: 8.606 ± 4.277
0.0ProSer: 0.0 ± 0.0
5.164ProThr: 5.164 ± 0.79
5.164ProVal: 5.164 ± 0.79
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
5.164GlnAla: 5.164 ± 0.79
0.0GlnCys: 0.0 ± 0.0
3.442GlnAsp: 3.442 ± 2.125
0.0GlnGlu: 0.0 ± 0.0
1.721GlnPhe: 1.721 ± 1.062
5.164GlnGly: 5.164 ± 1.607
0.0GlnHis: 0.0 ± 0.0
3.442GlnIle: 3.442 ± 0.272
1.721GlnLys: 1.721 ± 1.335
5.164GlnLeu: 5.164 ± 3.187
0.0GlnMet: 0.0 ± 0.0
3.442GlnAsn: 3.442 ± 0.272
0.0GlnPro: 0.0 ± 0.0
1.721GlnGln: 1.721 ± 1.062
0.0GlnArg: 0.0 ± 0.0
3.442GlnSer: 3.442 ± 0.272
6.885GlnThr: 6.885 ± 1.853
5.164GlnVal: 5.164 ± 3.187
1.721GlnTrp: 1.721 ± 1.335
1.721GlnTyr: 1.721 ± 1.062
0.0GlnXaa: 0.0 ± 0.0
Arg
1.721ArgAla: 1.721 ± 1.335
0.0ArgCys: 0.0 ± 0.0
1.721ArgAsp: 1.721 ± 1.062
3.442ArgGlu: 3.442 ± 2.125
5.164ArgPhe: 5.164 ± 0.79
3.442ArgGly: 3.442 ± 2.67
1.721ArgHis: 1.721 ± 1.062
1.721ArgIle: 1.721 ± 1.062
5.164ArgLys: 5.164 ± 1.607
3.442ArgLeu: 3.442 ± 0.272
1.721ArgMet: 1.721 ± 1.335
1.721ArgAsn: 1.721 ± 1.335
0.0ArgPro: 0.0 ± 0.0
3.442ArgGln: 3.442 ± 2.125
1.721ArgArg: 1.721 ± 1.062
3.442ArgSer: 3.442 ± 0.272
1.721ArgThr: 1.721 ± 1.335
8.606ArgVal: 8.606 ± 0.518
3.442ArgTrp: 3.442 ± 0.272
3.442ArgTyr: 3.442 ± 0.272
0.0ArgXaa: 0.0 ± 0.0
Ser
5.164SerAla: 5.164 ± 4.004
0.0SerCys: 0.0 ± 0.0
6.885SerAsp: 6.885 ± 1.853
3.442SerGlu: 3.442 ± 0.272
0.0SerPhe: 0.0 ± 0.0
3.442SerGly: 3.442 ± 2.125
0.0SerHis: 0.0 ± 0.0
0.0SerIle: 0.0 ± 0.0
3.442SerLys: 3.442 ± 2.125
1.721SerLeu: 1.721 ± 1.062
3.442SerMet: 3.442 ± 2.091
1.721SerAsn: 1.721 ± 1.335
0.0SerPro: 0.0 ± 0.0
3.442SerGln: 3.442 ± 2.125
0.0SerArg: 0.0 ± 0.0
3.442SerSer: 3.442 ± 2.125
5.164SerThr: 5.164 ± 3.187
3.442SerVal: 3.442 ± 2.125
3.442SerTrp: 3.442 ± 2.67
3.442SerTyr: 3.442 ± 2.125
0.0SerXaa: 0.0 ± 0.0
Thr
1.721ThrAla: 1.721 ± 1.335
0.0ThrCys: 0.0 ± 0.0
1.721ThrAsp: 1.721 ± 1.062
1.721ThrGlu: 1.721 ± 1.062
0.0ThrPhe: 0.0 ± 0.0
3.442ThrGly: 3.442 ± 2.125
0.0ThrHis: 0.0 ± 0.0
6.885ThrIle: 6.885 ± 0.545
0.0ThrLys: 0.0 ± 0.0
6.885ThrLeu: 6.885 ± 0.545
0.0ThrMet: 0.0 ± 0.0
8.606ThrAsn: 8.606 ± 0.518
5.164ThrPro: 5.164 ± 0.79
3.442ThrGln: 3.442 ± 2.125
1.721ThrArg: 1.721 ± 1.062
5.164ThrSer: 5.164 ± 1.607
0.0ThrThr: 0.0 ± 0.0
5.164ThrVal: 5.164 ± 3.187
1.721ThrTrp: 1.721 ± 1.335
6.885ThrTyr: 6.885 ± 0.545
0.0ThrXaa: 0.0 ± 0.0
Val
3.442ValAla: 3.442 ± 2.125
0.0ValCys: 0.0 ± 0.0
3.442ValAsp: 3.442 ± 2.125
1.721ValGlu: 1.721 ± 1.335
0.0ValPhe: 0.0 ± 0.0
6.885ValGly: 6.885 ± 0.545
5.164ValHis: 5.164 ± 1.607
1.721ValIle: 1.721 ± 1.062
8.606ValLys: 8.606 ± 0.518
5.164ValLeu: 5.164 ± 1.607
1.721ValMet: 1.721 ± 1.335
1.721ValAsn: 1.721 ± 1.062
8.606ValPro: 8.606 ± 2.915
0.0ValGln: 0.0 ± 0.0
1.721ValArg: 1.721 ± 1.062
3.442ValSer: 3.442 ± 0.272
5.164ValThr: 5.164 ± 3.187
1.721ValVal: 1.721 ± 1.062
1.721ValTrp: 1.721 ± 1.335
5.164ValTyr: 5.164 ± 1.607
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.721TrpCys: 1.721 ± 1.335
3.442TrpAsp: 3.442 ± 2.67
1.721TrpGlu: 1.721 ± 1.335
0.0TrpPhe: 0.0 ± 0.0
5.164TrpGly: 5.164 ± 3.187
0.0TrpHis: 0.0 ± 0.0
1.721TrpIle: 1.721 ± 1.335
1.721TrpLys: 1.721 ± 1.335
1.721TrpLeu: 1.721 ± 1.062
0.0TrpMet: 0.0 ± 0.0
5.164TrpAsn: 5.164 ± 0.79
0.0TrpPro: 0.0 ± 0.0
1.721TrpGln: 1.721 ± 1.335
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.721TrpThr: 1.721 ± 1.335
1.721TrpVal: 1.721 ± 1.335
0.0TrpTrp: 0.0 ± 0.0
3.442TrpTyr: 3.442 ± 0.272
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
6.885TyrAsp: 6.885 ± 2.942
3.442TyrGlu: 3.442 ± 2.67
3.442TyrPhe: 3.442 ± 2.125
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
3.442TyrIle: 3.442 ± 0.272
6.885TyrLys: 6.885 ± 1.853
0.0TyrLeu: 0.0 ± 0.0
1.721TyrMet: 1.721 ± 1.335
3.442TyrAsn: 3.442 ± 2.125
3.442TyrPro: 3.442 ± 0.272
3.442TyrGln: 3.442 ± 0.272
3.442TyrArg: 3.442 ± 2.125
3.442TyrSer: 3.442 ± 0.272
3.442TyrThr: 3.442 ± 0.272
3.442TyrVal: 3.442 ± 0.272
5.164TyrTrp: 5.164 ± 0.79
5.164TyrTyr: 5.164 ± 3.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski