Amino acid dipepetide frequency for Porcine associated porprismacovirus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.295AlaAla: 3.295 ± 2.019
3.295AlaCys: 3.295 ± 0.508
1.647AlaAsp: 1.647 ± 1.01
0.0AlaGlu: 0.0 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
6.59AlaGly: 6.59 ± 4.039
1.647AlaHis: 1.647 ± 1.518
1.647AlaIle: 1.647 ± 1.518
4.942AlaLys: 4.942 ± 0.502
9.885AlaLeu: 9.885 ± 4.051
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
0.0AlaPro: 0.0 ± 0.0
1.647AlaGln: 1.647 ± 1.01
0.0AlaArg: 0.0 ± 0.0
4.942AlaSer: 4.942 ± 0.502
1.647AlaThr: 1.647 ± 1.01
4.942AlaVal: 4.942 ± 3.029
0.0AlaTrp: 0.0 ± 0.0
3.295AlaTyr: 3.295 ± 3.035
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
3.295CysAsp: 3.295 ± 2.019
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.647CysGly: 1.647 ± 1.518
1.647CysHis: 1.647 ± 1.01
1.647CysIle: 1.647 ± 1.518
1.647CysLys: 1.647 ± 1.518
1.647CysLeu: 1.647 ± 1.01
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.647CysArg: 1.647 ± 1.518
1.647CysSer: 1.647 ± 1.01
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.647CysTyr: 1.647 ± 1.518
0.0CysXaa: 0.0 ± 0.0
Asp
4.942AspAla: 4.942 ± 0.502
1.647AspCys: 1.647 ± 1.518
0.0AspAsp: 0.0 ± 0.0
1.647AspGlu: 1.647 ± 1.518
1.647AspPhe: 1.647 ± 1.01
8.237AspGly: 8.237 ± 0.006
1.647AspHis: 1.647 ± 1.518
1.647AspIle: 1.647 ± 1.518
1.647AspLys: 1.647 ± 1.518
4.942AspLeu: 4.942 ± 3.029
1.647AspMet: 1.647 ± 1.01
0.0AspAsn: 0.0 ± 0.0
4.942AspPro: 4.942 ± 3.029
3.295AspGln: 3.295 ± 2.019
6.59AspArg: 6.59 ± 3.543
4.942AspSer: 4.942 ± 0.502
3.295AspThr: 3.295 ± 3.035
1.647AspVal: 1.647 ± 1.518
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.942GluAla: 4.942 ± 0.502
1.647GluCys: 1.647 ± 1.518
1.647GluAsp: 1.647 ± 1.518
3.295GluGlu: 3.295 ± 0.508
3.295GluPhe: 3.295 ± 0.508
4.942GluGly: 4.942 ± 2.026
1.647GluHis: 1.647 ± 1.01
3.295GluIle: 3.295 ± 2.019
0.0GluLys: 0.0 ± 0.0
4.942GluLeu: 4.942 ± 3.029
1.647GluMet: 1.647 ± 1.01
1.647GluAsn: 1.647 ± 1.01
1.647GluPro: 1.647 ± 1.01
1.647GluGln: 1.647 ± 1.518
3.295GluArg: 3.295 ± 3.035
3.295GluSer: 3.295 ± 0.508
6.59GluThr: 6.59 ± 1.016
1.647GluVal: 1.647 ± 1.518
1.647GluTrp: 1.647 ± 1.518
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.647PheAsp: 1.647 ± 1.01
4.942PheGlu: 4.942 ± 0.502
3.295PhePhe: 3.295 ± 0.508
3.295PheGly: 3.295 ± 2.019
1.647PheHis: 1.647 ± 1.01
1.647PheIle: 1.647 ± 1.01
1.647PheLys: 1.647 ± 1.01
0.0PheLeu: 0.0 ± 0.0
1.647PheMet: 1.647 ± 0.942
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
6.59PheArg: 6.59 ± 1.511
3.295PheSer: 3.295 ± 2.019
3.295PheThr: 3.295 ± 0.508
1.647PheVal: 1.647 ± 1.01
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.59GlyAla: 6.59 ± 1.511
3.295GlyCys: 3.295 ± 2.019
1.647GlyAsp: 1.647 ± 1.01
4.942GlyGlu: 4.942 ± 0.502
1.647GlyPhe: 1.647 ± 1.01
3.295GlyGly: 3.295 ± 2.019
0.0GlyHis: 0.0 ± 0.0
4.942GlyIle: 4.942 ± 3.029
6.59GlyLys: 6.59 ± 3.543
8.237GlyLeu: 8.237 ± 0.006
1.647GlyMet: 1.647 ± 1.01
3.295GlyAsn: 3.295 ± 0.508
0.0GlyPro: 0.0 ± 0.0
1.647GlyGln: 1.647 ± 1.518
0.0GlyArg: 0.0 ± 0.0
6.59GlySer: 6.59 ± 4.039
6.59GlyThr: 6.59 ± 4.039
6.59GlyVal: 6.59 ± 1.016
1.647GlyTrp: 1.647 ± 1.518
1.647GlyTyr: 1.647 ± 1.518
0.0GlyXaa: 0.0 ± 0.0
His
1.647HisAla: 1.647 ± 1.518
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.647HisPhe: 1.647 ± 1.01
3.295HisGly: 3.295 ± 2.019
0.0HisHis: 0.0 ± 0.0
3.295HisIle: 3.295 ± 3.035
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.647HisPro: 1.647 ± 1.01
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
4.942HisSer: 4.942 ± 0.502
1.647HisThr: 1.647 ± 1.01
1.647HisVal: 1.647 ± 1.01
1.647HisTrp: 1.647 ± 1.518
1.647HisTyr: 1.647 ± 1.01
0.0HisXaa: 0.0 ± 0.0
Ile
1.647IleAla: 1.647 ± 1.518
0.0IleCys: 0.0 ± 0.0
4.942IleAsp: 4.942 ± 4.553
9.885IleGlu: 9.885 ± 4.051
1.647IlePhe: 1.647 ± 1.01
1.647IleGly: 1.647 ± 1.518
1.647IleHis: 1.647 ± 1.01
6.59IleIle: 6.59 ± 3.543
1.647IleLys: 1.647 ± 1.518
6.59IleLeu: 6.59 ± 4.039
1.647IleMet: 1.647 ± 1.518
0.0IleAsn: 0.0 ± 0.0
8.237IlePro: 8.237 ± 2.534
3.295IleGln: 3.295 ± 0.508
4.942IleArg: 4.942 ± 4.553
0.0IleSer: 0.0 ± 0.0
1.647IleThr: 1.647 ± 1.01
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
3.295IleTyr: 3.295 ± 0.508
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
4.942LysAsp: 4.942 ± 2.026
3.295LysGlu: 3.295 ± 3.035
1.647LysPhe: 1.647 ± 1.01
4.942LysGly: 4.942 ± 0.502
3.295LysHis: 3.295 ± 3.035
0.0LysIle: 0.0 ± 0.0
1.647LysLys: 1.647 ± 1.518
4.942LysLeu: 4.942 ± 4.553
0.0LysMet: 0.0 ± 0.0
1.647LysAsn: 1.647 ± 1.518
1.647LysPro: 1.647 ± 1.01
0.0LysGln: 0.0 ± 0.0
1.647LysArg: 1.647 ± 1.518
1.647LysSer: 1.647 ± 1.518
3.295LysThr: 3.295 ± 2.019
4.942LysVal: 4.942 ± 0.502
3.295LysTrp: 3.295 ± 3.035
4.942LysTyr: 4.942 ± 3.029
0.0LysXaa: 0.0 ± 0.0
Leu
3.295LeuAla: 3.295 ± 0.508
0.0LeuCys: 0.0 ± 0.0
3.295LeuAsp: 3.295 ± 0.508
0.0LeuGlu: 0.0 ± 0.0
3.295LeuPhe: 3.295 ± 2.019
4.942LeuGly: 4.942 ± 3.029
1.647LeuHis: 1.647 ± 1.01
4.942LeuIle: 4.942 ± 2.026
1.647LeuLys: 1.647 ± 1.01
3.295LeuLeu: 3.295 ± 2.019
1.647LeuMet: 1.647 ± 1.518
4.942LeuAsn: 4.942 ± 3.029
6.59LeuPro: 6.59 ± 4.039
8.237LeuGln: 8.237 ± 0.006
3.295LeuArg: 3.295 ± 0.508
6.59LeuSer: 6.59 ± 1.511
6.59LeuThr: 6.59 ± 4.039
1.647LeuVal: 1.647 ± 1.518
1.647LeuTrp: 1.647 ± 1.518
3.295LeuTyr: 3.295 ± 0.508
0.0LeuXaa: 0.0 ± 0.0
Met
1.647MetAla: 1.647 ± 1.01
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.647MetGly: 1.647 ± 1.01
0.0MetHis: 0.0 ± 0.0
1.647MetIle: 1.647 ± 1.518
0.0MetLys: 0.0 ± 0.0
3.295MetLeu: 3.295 ± 2.019
3.295MetMet: 3.295 ± 3.035
1.647MetAsn: 1.647 ± 1.518
1.647MetPro: 1.647 ± 1.518
4.942MetGln: 4.942 ± 4.553
0.0MetArg: 0.0 ± 0.0
1.647MetSer: 1.647 ± 1.01
3.295MetThr: 3.295 ± 3.035
4.942MetVal: 4.942 ± 3.029
0.0MetTrp: 0.0 ± 0.0
3.295MetTyr: 3.295 ± 0.508
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
4.942AsnAsp: 4.942 ± 4.553
1.647AsnGlu: 1.647 ± 1.01
1.647AsnPhe: 1.647 ± 1.01
4.942AsnGly: 4.942 ± 0.502
0.0AsnHis: 0.0 ± 0.0
1.647AsnIle: 1.647 ± 1.518
0.0AsnLys: 0.0 ± 0.0
1.647AsnLeu: 1.647 ± 1.518
3.295AsnMet: 3.295 ± 2.019
0.0AsnAsn: 0.0 ± 0.0
3.295AsnPro: 3.295 ± 2.019
1.647AsnGln: 1.647 ± 1.518
1.647AsnArg: 1.647 ± 1.01
0.0AsnSer: 0.0 ± 0.0
3.295AsnThr: 3.295 ± 0.508
3.295AsnVal: 3.295 ± 0.508
0.0AsnTrp: 0.0 ± 0.0
1.647AsnTyr: 1.647 ± 1.01
0.0AsnXaa: 0.0 ± 0.0
Pro
3.295ProAla: 3.295 ± 2.019
0.0ProCys: 0.0 ± 0.0
3.295ProAsp: 3.295 ± 2.019
1.647ProGlu: 1.647 ± 1.01
0.0ProPhe: 0.0 ± 0.0
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
3.295ProIle: 3.295 ± 2.019
1.647ProLys: 1.647 ± 1.518
6.59ProLeu: 6.59 ± 4.039
1.647ProMet: 1.647 ± 1.01
1.647ProAsn: 1.647 ± 1.518
3.295ProPro: 3.295 ± 0.508
1.647ProGln: 1.647 ± 1.518
4.942ProArg: 4.942 ± 2.026
8.237ProSer: 8.237 ± 5.049
8.237ProThr: 8.237 ± 0.006
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
4.942ProTyr: 4.942 ± 0.502
0.0ProXaa: 0.0 ± 0.0
Gln
4.942GlnAla: 4.942 ± 2.026
0.0GlnCys: 0.0 ± 0.0
3.295GlnAsp: 3.295 ± 0.508
3.295GlnGlu: 3.295 ± 2.019
1.647GlnPhe: 1.647 ± 1.01
1.647GlnGly: 1.647 ± 1.01
0.0GlnHis: 0.0 ± 0.0
1.647GlnIle: 1.647 ± 1.518
1.647GlnLys: 1.647 ± 1.518
4.942GlnLeu: 4.942 ± 0.502
0.0GlnMet: 0.0 ± 0.0
4.942GlnAsn: 4.942 ± 0.502
1.647GlnPro: 1.647 ± 1.01
3.295GlnGln: 3.295 ± 0.508
1.647GlnArg: 1.647 ± 1.518
1.647GlnSer: 1.647 ± 1.518
1.647GlnThr: 1.647 ± 1.01
4.942GlnVal: 4.942 ± 3.029
1.647GlnTrp: 1.647 ± 1.518
1.647GlnTyr: 1.647 ± 1.518
0.0GlnXaa: 0.0 ± 0.0
Arg
1.647ArgAla: 1.647 ± 1.518
0.0ArgCys: 0.0 ± 0.0
1.647ArgAsp: 1.647 ± 1.01
3.295ArgGlu: 3.295 ± 0.508
1.647ArgPhe: 1.647 ± 1.01
1.647ArgGly: 1.647 ± 1.518
1.647ArgHis: 1.647 ± 1.01
3.295ArgIle: 3.295 ± 3.035
6.59ArgLys: 6.59 ± 1.511
3.295ArgLeu: 3.295 ± 2.019
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
6.59ArgPro: 6.59 ± 3.543
0.0ArgGln: 0.0 ± 0.0
0.0ArgArg: 0.0 ± 0.0
1.647ArgSer: 1.647 ± 1.518
4.942ArgThr: 4.942 ± 4.553
1.647ArgVal: 1.647 ± 1.01
1.647ArgTrp: 1.647 ± 1.518
6.59ArgTyr: 6.59 ± 3.543
0.0ArgXaa: 0.0 ± 0.0
Ser
4.942SerAla: 4.942 ± 0.502
1.647SerCys: 1.647 ± 1.518
6.59SerAsp: 6.59 ± 1.511
4.942SerGlu: 4.942 ± 0.502
4.942SerPhe: 4.942 ± 0.502
9.885SerGly: 9.885 ± 6.058
0.0SerHis: 0.0 ± 0.0
3.295SerIle: 3.295 ± 0.508
4.942SerLys: 4.942 ± 2.026
3.295SerLeu: 3.295 ± 2.019
3.295SerMet: 3.295 ± 0.508
4.942SerAsn: 4.942 ± 2.026
3.295SerPro: 3.295 ± 2.019
0.0SerGln: 0.0 ± 0.0
0.0SerArg: 0.0 ± 0.0
6.59SerSer: 6.59 ± 1.511
1.647SerThr: 1.647 ± 1.01
9.885SerVal: 9.885 ± 6.058
4.942SerTrp: 4.942 ± 2.026
4.942SerTyr: 4.942 ± 3.029
0.0SerXaa: 0.0 ± 0.0
Thr
3.295ThrAla: 3.295 ± 0.508
3.295ThrCys: 3.295 ± 2.019
0.0ThrAsp: 0.0 ± 0.0
3.295ThrGlu: 3.295 ± 0.508
1.647ThrPhe: 1.647 ± 1.518
4.942ThrGly: 4.942 ± 2.026
1.647ThrHis: 1.647 ± 1.01
8.237ThrIle: 8.237 ± 0.006
1.647ThrLys: 1.647 ± 1.01
0.0ThrLeu: 0.0 ± 0.0
1.647ThrMet: 1.647 ± 1.518
4.942ThrAsn: 4.942 ± 0.502
3.295ThrPro: 3.295 ± 0.508
6.59ThrGln: 6.59 ± 4.039
6.59ThrArg: 6.59 ± 1.511
6.59ThrSer: 6.59 ± 1.511
4.942ThrThr: 4.942 ± 3.029
8.237ThrVal: 8.237 ± 0.006
1.647ThrTrp: 1.647 ± 1.01
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
6.59ValAsp: 6.59 ± 1.511
3.295ValGlu: 3.295 ± 2.019
0.0ValPhe: 0.0 ± 0.0
1.647ValGly: 1.647 ± 1.518
1.647ValHis: 1.647 ± 1.01
0.0ValIle: 0.0 ± 0.0
3.295ValLys: 3.295 ± 3.035
1.647ValLeu: 1.647 ± 1.518
3.295ValMet: 3.295 ± 3.035
3.295ValAsn: 3.295 ± 0.508
4.942ValPro: 4.942 ± 3.029
3.295ValGln: 3.295 ± 2.019
1.647ValArg: 1.647 ± 1.01
13.18ValSer: 13.18 ± 5.55
6.59ValThr: 6.59 ± 1.511
4.942ValVal: 4.942 ± 2.026
1.647ValTrp: 1.647 ± 1.518
8.237ValTyr: 8.237 ± 5.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.647TrpCys: 1.647 ± 1.518
0.0TrpAsp: 0.0 ± 0.0
1.647TrpGlu: 1.647 ± 1.518
1.647TrpPhe: 1.647 ± 1.518
0.0TrpGly: 0.0 ± 0.0
1.647TrpHis: 1.647 ± 1.01
3.295TrpIle: 3.295 ± 3.035
1.647TrpLys: 1.647 ± 1.518
1.647TrpLeu: 1.647 ± 1.01
1.647TrpMet: 1.647 ± 1.518
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.647TrpGln: 1.647 ± 1.518
1.647TrpArg: 1.647 ± 1.518
1.647TrpSer: 1.647 ± 1.518
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.647TrpTyr: 1.647 ± 1.518
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.295TyrAla: 3.295 ± 2.019
0.0TyrCys: 0.0 ± 0.0
4.942TyrAsp: 4.942 ± 2.026
1.647TyrGlu: 1.647 ± 1.518
3.295TyrPhe: 3.295 ± 2.019
1.647TyrGly: 1.647 ± 1.518
1.647TyrHis: 1.647 ± 1.518
3.295TyrIle: 3.295 ± 3.035
4.942TyrLys: 4.942 ± 0.502
0.0TyrLeu: 0.0 ± 0.0
3.295TyrMet: 3.295 ± 0.564
1.647TyrAsn: 1.647 ± 1.01
1.647TyrPro: 1.647 ± 1.01
3.295TyrGln: 3.295 ± 2.019
1.647TyrArg: 1.647 ± 1.01
4.942TyrSer: 4.942 ± 2.026
3.295TyrThr: 3.295 ± 2.019
6.59TyrVal: 6.59 ± 1.016
0.0TyrTrp: 0.0 ± 0.0
6.59TyrTyr: 6.59 ± 4.039
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (608 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski