Amino acid dipepetide frequency for Sheep faeces associated smacovirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.902AlaAla: 4.902 ± 2.929
0.0AlaCys: 0.0 ± 0.0
8.17AlaAsp: 8.17 ± 2.579
3.268AlaGlu: 3.268 ± 2.652
4.902AlaPhe: 4.902 ± 0.626
3.268AlaGly: 3.268 ± 1.953
3.268AlaHis: 3.268 ± 0.35
6.536AlaIle: 6.536 ± 0.7
1.634AlaLys: 1.634 ± 0.976
8.17AlaLeu: 8.17 ± 2.026
0.0AlaMet: 0.0 ± 0.0
3.268AlaAsn: 3.268 ± 0.35
1.634AlaPro: 1.634 ± 0.976
0.0AlaGln: 0.0 ± 0.0
1.634AlaArg: 1.634 ± 1.326
8.17AlaSer: 8.17 ± 4.881
3.268AlaThr: 3.268 ± 1.953
8.17AlaVal: 8.17 ± 4.881
0.0AlaTrp: 0.0 ± 0.0
3.268AlaTyr: 3.268 ± 0.35
0.0AlaXaa: 0.0 ± 0.0
Cys
1.634CysAla: 1.634 ± 0.976
0.0CysCys: 0.0 ± 0.0
1.634CysAsp: 1.634 ± 0.976
3.268CysGlu: 3.268 ± 2.652
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.634CysArg: 1.634 ± 1.326
4.902CysSer: 4.902 ± 3.978
3.268CysThr: 3.268 ± 0.35
0.0CysVal: 0.0 ± 0.0
1.634CysTrp: 1.634 ± 1.326
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.634AspAla: 1.634 ± 0.976
0.0AspCys: 0.0 ± 0.0
3.268AspAsp: 3.268 ± 0.35
1.634AspGlu: 1.634 ± 0.976
1.634AspPhe: 1.634 ± 1.326
6.536AspGly: 6.536 ± 0.7
1.634AspHis: 1.634 ± 1.326
6.536AspIle: 6.536 ± 1.603
0.0AspLys: 0.0 ± 0.0
4.902AspLeu: 4.902 ± 0.626
1.634AspMet: 1.634 ± 0.976
4.902AspAsn: 4.902 ± 1.676
8.17AspPro: 8.17 ± 2.026
1.634AspGln: 1.634 ± 0.976
1.634AspArg: 1.634 ± 1.326
3.268AspSer: 3.268 ± 1.953
1.634AspThr: 1.634 ± 1.326
8.17AspVal: 8.17 ± 2.026
1.634AspTrp: 1.634 ± 1.326
1.634AspTyr: 1.634 ± 1.326
0.0AspXaa: 0.0 ± 0.0
Glu
3.268GluAla: 3.268 ± 0.35
1.634GluCys: 1.634 ± 0.976
0.0GluAsp: 0.0 ± 0.0
1.634GluGlu: 1.634 ± 1.326
1.634GluPhe: 1.634 ± 0.976
1.634GluGly: 1.634 ± 1.326
3.268GluHis: 3.268 ± 0.35
1.634GluIle: 1.634 ± 1.326
9.804GluLys: 9.804 ± 3.352
3.268GluLeu: 3.268 ± 0.35
3.268GluMet: 3.268 ± 0.35
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
1.634GluGln: 1.634 ± 1.326
4.902GluArg: 4.902 ± 3.978
0.0GluSer: 0.0 ± 0.0
6.536GluThr: 6.536 ± 0.7
0.0GluVal: 0.0 ± 0.0
1.634GluTrp: 1.634 ± 1.326
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.634PheAla: 1.634 ± 1.326
0.0PheCys: 0.0 ± 0.0
3.268PheAsp: 3.268 ± 2.652
3.268PheGlu: 3.268 ± 2.652
6.536PhePhe: 6.536 ± 1.603
3.268PheGly: 3.268 ± 1.953
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
4.902PheLys: 4.902 ± 2.929
1.634PheLeu: 1.634 ± 0.976
0.0PheMet: 0.0 ± 0.0
3.268PheAsn: 3.268 ± 1.953
0.0PhePro: 0.0 ± 0.0
3.268PheGln: 3.268 ± 0.35
3.268PheArg: 3.268 ± 0.35
3.268PheSer: 3.268 ± 1.953
1.634PheThr: 1.634 ± 1.326
3.268PheVal: 3.268 ± 0.35
1.634PheTrp: 1.634 ± 1.326
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.902GlyAla: 4.902 ± 0.626
0.0GlyCys: 0.0 ± 0.0
0.0GlyAsp: 0.0 ± 0.0
1.634GlyGlu: 1.634 ± 1.326
1.634GlyPhe: 1.634 ± 0.976
6.536GlyGly: 6.536 ± 1.603
0.0GlyHis: 0.0 ± 0.0
1.634GlyIle: 1.634 ± 1.326
4.902GlyLys: 4.902 ± 3.978
8.17GlyLeu: 8.17 ± 0.277
1.634GlyMet: 1.634 ± 0.976
3.268GlyAsn: 3.268 ± 1.953
1.634GlyPro: 1.634 ± 1.326
0.0GlyGln: 0.0 ± 0.0
8.17GlyArg: 8.17 ± 2.026
3.268GlySer: 3.268 ± 1.953
13.072GlyThr: 13.072 ± 3.205
4.902GlyVal: 4.902 ± 2.929
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
3.268HisAsp: 3.268 ± 0.35
0.0HisGlu: 0.0 ± 0.0
3.268HisPhe: 3.268 ± 2.652
1.634HisGly: 1.634 ± 1.326
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
1.634HisMet: 1.634 ± 1.326
0.0HisAsn: 0.0 ± 0.0
1.634HisPro: 1.634 ± 0.976
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
4.902HisVal: 4.902 ± 0.626
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
8.17IleAla: 8.17 ± 2.579
3.268IleCys: 3.268 ± 2.652
3.268IleAsp: 3.268 ± 2.652
3.268IleGlu: 3.268 ± 2.652
1.634IlePhe: 1.634 ± 1.326
4.902IleGly: 4.902 ± 2.929
0.0IleHis: 0.0 ± 0.0
6.536IleIle: 6.536 ± 3.002
0.0IleLys: 0.0 ± 0.0
0.0IleLeu: 0.0 ± 0.0
3.268IleMet: 3.268 ± 0.35
1.634IleAsn: 1.634 ± 0.976
4.902IlePro: 4.902 ± 1.676
1.634IleGln: 1.634 ± 0.976
6.536IleArg: 6.536 ± 3.002
6.536IleSer: 6.536 ± 1.603
1.634IleThr: 1.634 ± 0.976
3.268IleVal: 3.268 ± 0.35
0.0IleTrp: 0.0 ± 0.0
1.634IleTyr: 1.634 ± 0.976
0.0IleXaa: 0.0 ± 0.0
Lys
6.536LysAla: 6.536 ± 1.603
0.0LysCys: 0.0 ± 0.0
3.268LysAsp: 3.268 ± 2.652
1.634LysGlu: 1.634 ± 1.326
1.634LysPhe: 1.634 ± 0.976
3.268LysGly: 3.268 ± 0.35
0.0LysHis: 0.0 ± 0.0
3.268LysIle: 3.268 ± 0.35
6.536LysLys: 6.536 ± 3.002
4.902LysLeu: 4.902 ± 3.978
1.634LysMet: 1.634 ± 1.577
1.634LysAsn: 1.634 ± 1.326
3.268LysPro: 3.268 ± 2.652
3.268LysGln: 3.268 ± 2.652
3.268LysArg: 3.268 ± 0.35
3.268LysSer: 3.268 ± 0.35
1.634LysThr: 1.634 ± 1.326
1.634LysVal: 1.634 ± 1.326
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
1.634LeuAla: 1.634 ± 0.976
0.0LeuCys: 0.0 ± 0.0
9.804LeuAsp: 9.804 ± 3.352
3.268LeuGlu: 3.268 ± 1.953
1.634LeuPhe: 1.634 ± 0.976
3.268LeuGly: 3.268 ± 0.35
1.634LeuHis: 1.634 ± 1.326
3.268LeuIle: 3.268 ± 1.953
3.268LeuLys: 3.268 ± 2.652
6.536LeuLeu: 6.536 ± 1.603
1.634LeuMet: 1.634 ± 0.976
6.536LeuAsn: 6.536 ± 3.905
1.634LeuPro: 1.634 ± 0.976
1.634LeuGln: 1.634 ± 0.976
1.634LeuArg: 1.634 ± 0.976
11.438LeuSer: 11.438 ± 2.376
1.634LeuThr: 1.634 ± 1.326
8.17LeuVal: 8.17 ± 2.026
1.634LeuTrp: 1.634 ± 1.326
3.268LeuTyr: 3.268 ± 0.35
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.268MetGlu: 3.268 ± 0.35
1.634MetPhe: 1.634 ± 0.976
1.634MetGly: 1.634 ± 0.976
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.268MetLys: 3.268 ± 2.652
1.634MetLeu: 1.634 ± 0.976
1.634MetMet: 1.634 ± 1.326
1.634MetAsn: 1.634 ± 0.976
3.268MetPro: 3.268 ± 1.953
3.268MetGln: 3.268 ± 0.35
1.634MetArg: 1.634 ± 0.976
0.0MetSer: 0.0 ± 0.0
4.902MetThr: 4.902 ± 0.626
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.902AsnAla: 4.902 ± 2.929
0.0AsnCys: 0.0 ± 0.0
1.634AsnAsp: 1.634 ± 0.976
0.0AsnGlu: 0.0 ± 0.0
3.268AsnPhe: 3.268 ± 0.35
6.536AsnGly: 6.536 ± 1.603
0.0AsnHis: 0.0 ± 0.0
4.902AsnIle: 4.902 ± 0.626
3.268AsnLys: 3.268 ± 0.35
3.268AsnLeu: 3.268 ± 0.35
3.268AsnMet: 3.268 ± 1.953
4.902AsnAsn: 4.902 ± 0.626
4.902AsnPro: 4.902 ± 0.626
0.0AsnGln: 0.0 ± 0.0
1.634AsnArg: 1.634 ± 0.976
3.268AsnSer: 3.268 ± 1.953
0.0AsnThr: 0.0 ± 0.0
4.902AsnVal: 4.902 ± 2.929
1.634AsnTrp: 1.634 ± 1.326
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.268ProAla: 3.268 ± 1.953
0.0ProCys: 0.0 ± 0.0
1.634ProAsp: 1.634 ± 1.326
0.0ProGlu: 0.0 ± 0.0
0.0ProPhe: 0.0 ± 0.0
6.536ProGly: 6.536 ± 1.603
0.0ProHis: 0.0 ± 0.0
1.634ProIle: 1.634 ± 0.976
0.0ProLys: 0.0 ± 0.0
8.17ProLeu: 8.17 ± 0.277
1.634ProMet: 1.634 ± 0.976
1.634ProAsn: 1.634 ± 0.976
3.268ProPro: 3.268 ± 1.953
1.634ProGln: 1.634 ± 0.976
6.536ProArg: 6.536 ± 5.305
3.268ProSer: 3.268 ± 1.953
3.268ProThr: 3.268 ± 0.35
4.902ProVal: 4.902 ± 0.626
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.634GlnAla: 1.634 ± 1.326
1.634GlnCys: 1.634 ± 1.326
4.902GlnAsp: 4.902 ± 0.626
1.634GlnGlu: 1.634 ± 0.976
0.0GlnPhe: 0.0 ± 0.0
1.634GlnGly: 1.634 ± 0.976
0.0GlnHis: 0.0 ± 0.0
4.902GlnIle: 4.902 ± 1.676
1.634GlnLys: 1.634 ± 1.326
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
3.268GlnGln: 3.268 ± 1.953
1.634GlnArg: 1.634 ± 0.976
3.268GlnSer: 3.268 ± 1.953
3.268GlnThr: 3.268 ± 1.953
1.634GlnVal: 1.634 ± 1.326
1.634GlnTrp: 1.634 ± 1.326
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.634ArgAla: 1.634 ± 1.326
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
1.634ArgGlu: 1.634 ± 1.326
4.902ArgPhe: 4.902 ± 1.676
6.536ArgGly: 6.536 ± 5.305
0.0ArgHis: 0.0 ± 0.0
6.536ArgIle: 6.536 ± 3.002
3.268ArgLys: 3.268 ± 2.652
3.268ArgLeu: 3.268 ± 0.35
1.634ArgMet: 1.634 ± 0.976
3.268ArgAsn: 3.268 ± 0.35
1.634ArgPro: 1.634 ± 0.976
1.634ArgGln: 1.634 ± 1.326
3.268ArgArg: 3.268 ± 0.35
0.0ArgSer: 0.0 ± 0.0
4.902ArgThr: 4.902 ± 0.626
1.634ArgVal: 1.634 ± 0.976
3.268ArgTrp: 3.268 ± 0.35
8.17ArgTyr: 8.17 ± 2.026
0.0ArgXaa: 0.0 ± 0.0
Ser
9.804SerAla: 9.804 ± 1.253
6.536SerCys: 6.536 ± 0.7
9.804SerAsp: 9.804 ± 1.253
3.268SerGlu: 3.268 ± 1.953
1.634SerPhe: 1.634 ± 0.976
1.634SerGly: 1.634 ± 1.326
3.268SerHis: 3.268 ± 2.652
6.536SerIle: 6.536 ± 3.905
3.268SerLys: 3.268 ± 0.35
3.268SerLeu: 3.268 ± 1.953
0.0SerMet: 0.0 ± 0.0
4.902SerAsn: 4.902 ± 2.929
0.0SerPro: 0.0 ± 0.0
1.634SerGln: 1.634 ± 0.976
0.0SerArg: 0.0 ± 0.0
4.902SerSer: 4.902 ± 2.929
4.902SerThr: 4.902 ± 2.929
6.536SerVal: 6.536 ± 1.603
1.634SerTrp: 1.634 ± 0.976
3.268SerTyr: 3.268 ± 0.35
0.0SerXaa: 0.0 ± 0.0
Thr
8.17ThrAla: 8.17 ± 0.277
0.0ThrCys: 0.0 ± 0.0
3.268ThrAsp: 3.268 ± 0.35
1.634ThrGlu: 1.634 ± 1.326
0.0ThrPhe: 0.0 ± 0.0
1.634ThrGly: 1.634 ± 0.976
0.0ThrHis: 0.0 ± 0.0
3.268ThrIle: 3.268 ± 2.652
1.634ThrLys: 1.634 ± 1.326
9.804ThrLeu: 9.804 ± 3.555
3.268ThrMet: 3.268 ± 2.074
1.634ThrAsn: 1.634 ± 1.326
4.902ThrPro: 4.902 ± 2.929
3.268ThrGln: 3.268 ± 1.953
1.634ThrArg: 1.634 ± 1.326
4.902ThrSer: 4.902 ± 0.626
8.17ThrThr: 8.17 ± 0.277
8.17ThrVal: 8.17 ± 4.881
0.0ThrTrp: 0.0 ± 0.0
4.902ThrTyr: 4.902 ± 2.929
0.0ThrXaa: 0.0 ± 0.0
Val
4.902ValAla: 4.902 ± 0.626
1.634ValCys: 1.634 ± 1.326
3.268ValAsp: 3.268 ± 0.35
6.536ValGlu: 6.536 ± 1.603
4.902ValPhe: 4.902 ± 1.676
3.268ValGly: 3.268 ± 0.35
1.634ValHis: 1.634 ± 0.976
1.634ValIle: 1.634 ± 0.976
3.268ValLys: 3.268 ± 0.35
6.536ValLeu: 6.536 ± 3.002
0.0ValMet: 0.0 ± 0.0
4.902ValAsn: 4.902 ± 2.929
4.902ValPro: 4.902 ± 0.626
1.634ValGln: 1.634 ± 0.976
6.536ValArg: 6.536 ± 1.603
9.804ValSer: 9.804 ± 5.858
4.902ValThr: 4.902 ± 2.929
6.536ValVal: 6.536 ± 0.7
0.0ValTrp: 0.0 ± 0.0
1.634ValTyr: 1.634 ± 0.976
0.0ValXaa: 0.0 ± 0.0
Trp
1.634TrpAla: 1.634 ± 1.326
3.268TrpCys: 3.268 ± 2.652
0.0TrpAsp: 0.0 ± 0.0
1.634TrpGlu: 1.634 ± 1.326
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
4.902TrpAsn: 4.902 ± 0.626
0.0TrpPro: 0.0 ± 0.0
3.268TrpGln: 3.268 ± 2.652
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.634TrpThr: 1.634 ± 1.326
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.268TyrAla: 3.268 ± 1.953
0.0TyrCys: 0.0 ± 0.0
1.634TyrAsp: 1.634 ± 0.976
3.268TyrGlu: 3.268 ± 2.652
3.268TyrPhe: 3.268 ± 1.953
1.634TyrGly: 1.634 ± 0.976
1.634TyrHis: 1.634 ± 0.976
3.268TyrIle: 3.268 ± 2.652
1.634TyrLys: 1.634 ± 0.976
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.634TyrPro: 1.634 ± 0.976
0.0TyrGln: 0.0 ± 0.0
1.634TyrArg: 1.634 ± 1.326
3.268TyrSer: 3.268 ± 2.652
0.0TyrThr: 0.0 ± 0.0
1.634TyrVal: 1.634 ± 0.976
0.0TyrTrp: 0.0 ± 0.0
1.634TyrTyr: 1.634 ± 0.976
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (613 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski