Amino acid dipepetide frequency for Porcine stool-associated circular virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.616AlaAla: 1.616 ± 0.991
0.0AlaCys: 0.0 ± 0.0
3.231AlaAsp: 3.231 ± 0.364
1.616AlaGlu: 1.616 ± 0.991
3.231AlaPhe: 3.231 ± 1.983
1.616AlaGly: 1.616 ± 0.991
3.231AlaHis: 3.231 ± 0.364
4.847AlaIle: 4.847 ± 1.719
3.231AlaLys: 3.231 ± 0.364
3.231AlaLeu: 3.231 ± 0.364
1.616AlaMet: 1.616 ± 1.355
4.847AlaAsn: 4.847 ± 0.627
1.616AlaPro: 1.616 ± 0.991
3.231AlaGln: 3.231 ± 0.364
1.616AlaArg: 1.616 ± 0.991
6.462AlaSer: 6.462 ± 0.728
4.847AlaThr: 4.847 ± 2.974
9.693AlaVal: 9.693 ± 1.255
3.231AlaTrp: 3.231 ± 2.711
3.231AlaTyr: 3.231 ± 1.983
0.0AlaXaa: 0.0 ± 0.0
Cys
1.616CysAla: 1.616 ± 1.355
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.616CysHis: 1.616 ± 0.991
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.616CysAsn: 1.616 ± 1.355
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.616CysArg: 1.616 ± 1.355
1.616CysSer: 1.616 ± 0.991
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.847AspAla: 4.847 ± 0.627
0.0AspCys: 0.0 ± 0.0
4.847AspAsp: 4.847 ± 2.974
1.616AspGlu: 1.616 ± 0.991
0.0AspPhe: 0.0 ± 0.0
6.462AspGly: 6.462 ± 5.422
3.231AspHis: 3.231 ± 0.364
3.231AspIle: 3.231 ± 0.364
1.616AspLys: 1.616 ± 1.355
3.231AspLeu: 3.231 ± 0.364
3.231AspMet: 3.231 ± 1.983
0.0AspAsn: 0.0 ± 0.0
8.078AspPro: 8.078 ± 0.263
0.0AspGln: 0.0 ± 0.0
4.847AspArg: 4.847 ± 1.719
1.616AspSer: 1.616 ± 0.991
8.078AspThr: 8.078 ± 0.263
8.078AspVal: 8.078 ± 2.083
0.0AspTrp: 0.0 ± 0.0
3.231AspTyr: 3.231 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
3.231GluAla: 3.231 ± 0.364
0.0GluCys: 0.0 ± 0.0
1.616GluAsp: 1.616 ± 1.355
3.231GluGlu: 3.231 ± 2.711
0.0GluPhe: 0.0 ± 0.0
1.616GluGly: 1.616 ± 1.355
1.616GluHis: 1.616 ± 1.355
0.0GluIle: 0.0 ± 0.0
1.616GluLys: 1.616 ± 1.355
0.0GluLeu: 0.0 ± 0.0
0.0GluMet: 0.0 ± 0.0
1.616GluAsn: 1.616 ± 0.991
3.231GluPro: 3.231 ± 1.983
3.231GluGln: 3.231 ± 1.983
6.462GluArg: 6.462 ± 3.075
1.616GluSer: 1.616 ± 0.991
4.847GluThr: 4.847 ± 1.719
6.462GluVal: 6.462 ± 1.619
3.231GluTrp: 3.231 ± 2.711
3.231GluTyr: 3.231 ± 0.364
0.0GluXaa: 0.0 ± 0.0
Phe
6.462PheAla: 6.462 ± 3.966
0.0PheCys: 0.0 ± 0.0
1.616PheAsp: 1.616 ± 0.991
0.0PheGlu: 0.0 ± 0.0
1.616PhePhe: 1.616 ± 1.355
1.616PheGly: 1.616 ± 1.355
1.616PheHis: 1.616 ± 0.991
0.0PheIle: 0.0 ± 0.0
1.616PheLys: 1.616 ± 0.991
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
1.616PheGln: 1.616 ± 0.991
6.462PheArg: 6.462 ± 3.966
6.462PheSer: 6.462 ± 1.619
0.0PheThr: 0.0 ± 0.0
0.0PheVal: 0.0 ± 0.0
1.616PheTrp: 1.616 ± 0.991
1.616PheTyr: 1.616 ± 0.991
0.0PheXaa: 0.0 ± 0.0
Gly
4.847GlyAla: 4.847 ± 0.627
3.231GlyCys: 3.231 ± 1.983
3.231GlyAsp: 3.231 ± 0.364
6.462GlyGlu: 6.462 ± 0.728
6.462GlyPhe: 6.462 ± 3.966
3.231GlyGly: 3.231 ± 1.983
3.231GlyHis: 3.231 ± 2.711
6.462GlyIle: 6.462 ± 1.619
1.616GlyLys: 1.616 ± 1.355
8.078GlyLeu: 8.078 ± 0.263
1.616GlyMet: 1.616 ± 0.991
8.078GlyAsn: 8.078 ± 2.61
3.231GlyPro: 3.231 ± 0.364
3.231GlyGln: 3.231 ± 0.364
0.0GlyArg: 0.0 ± 0.0
1.616GlySer: 1.616 ± 0.991
3.231GlyThr: 3.231 ± 1.983
3.231GlyVal: 3.231 ± 0.364
3.231GlyTrp: 3.231 ± 0.364
1.616GlyTyr: 1.616 ± 1.355
0.0GlyXaa: 0.0 ± 0.0
His
1.616HisAla: 1.616 ± 0.991
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
6.462HisGly: 6.462 ± 1.619
0.0HisHis: 0.0 ± 0.0
3.231HisIle: 3.231 ± 2.711
1.616HisLys: 1.616 ± 1.355
3.231HisLeu: 3.231 ± 2.711
0.0HisMet: 0.0 ± 0.0
1.616HisAsn: 1.616 ± 0.991
1.616HisPro: 1.616 ± 0.991
1.616HisGln: 1.616 ± 0.991
0.0HisArg: 0.0 ± 0.0
1.616HisSer: 1.616 ± 1.355
3.231HisThr: 3.231 ± 1.983
0.0HisVal: 0.0 ± 0.0
1.616HisTrp: 1.616 ± 1.355
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.616IleCys: 1.616 ± 1.355
6.462IleAsp: 6.462 ± 0.728
3.231IleGlu: 3.231 ± 2.711
0.0IlePhe: 0.0 ± 0.0
9.693IleGly: 9.693 ± 3.602
1.616IleHis: 1.616 ± 0.991
4.847IleIle: 4.847 ± 4.066
3.231IleLys: 3.231 ± 2.711
4.847IleLeu: 4.847 ± 0.627
1.616IleMet: 1.616 ± 0.991
8.078IleAsn: 8.078 ± 2.61
3.231IlePro: 3.231 ± 0.364
1.616IleGln: 1.616 ± 1.355
3.231IleArg: 3.231 ± 2.711
3.231IleSer: 3.231 ± 0.364
0.0IleThr: 0.0 ± 0.0
1.616IleVal: 1.616 ± 0.991
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.847LysAla: 4.847 ± 1.719
1.616LysCys: 1.616 ± 1.355
3.231LysAsp: 3.231 ± 2.711
3.231LysGlu: 3.231 ± 2.711
1.616LysPhe: 1.616 ± 0.991
6.462LysGly: 6.462 ± 0.728
0.0LysHis: 0.0 ± 0.0
1.616LysIle: 1.616 ± 1.355
0.0LysLys: 0.0 ± 0.0
4.847LysLeu: 4.847 ± 4.066
1.616LysMet: 1.616 ± 0.991
0.0LysAsn: 0.0 ± 0.0
0.0LysPro: 0.0 ± 0.0
4.847LysGln: 4.847 ± 2.974
0.0LysArg: 0.0 ± 0.0
4.847LysSer: 4.847 ± 1.719
8.078LysThr: 8.078 ± 2.083
3.231LysVal: 3.231 ± 1.983
3.231LysTrp: 3.231 ± 2.711
1.616LysTyr: 1.616 ± 0.991
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
0.0LeuCys: 0.0 ± 0.0
4.847LeuAsp: 4.847 ± 0.627
1.616LeuGlu: 1.616 ± 1.355
1.616LeuPhe: 1.616 ± 1.355
4.847LeuGly: 4.847 ± 0.627
1.616LeuHis: 1.616 ± 0.991
3.231LeuIle: 3.231 ± 0.364
1.616LeuLys: 1.616 ± 1.355
3.231LeuLeu: 3.231 ± 0.364
0.0LeuMet: 0.0 ± 0.0
1.616LeuAsn: 1.616 ± 0.991
4.847LeuPro: 4.847 ± 2.974
6.462LeuGln: 6.462 ± 1.619
8.078LeuArg: 8.078 ± 2.083
3.231LeuSer: 3.231 ± 0.364
6.462LeuThr: 6.462 ± 3.075
4.847LeuVal: 4.847 ± 0.627
0.0LeuTrp: 0.0 ± 0.0
6.462LeuTyr: 6.462 ± 0.728
0.0LeuXaa: 0.0 ± 0.0
Met
1.616MetAla: 1.616 ± 0.991
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.616MetGlu: 1.616 ± 1.355
1.616MetPhe: 1.616 ± 0.991
3.231MetGly: 3.231 ± 1.983
0.0MetHis: 0.0 ± 0.0
3.231MetIle: 3.231 ± 1.983
0.0MetLys: 0.0 ± 0.0
4.847MetLeu: 4.847 ± 0.627
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.616MetPro: 1.616 ± 1.355
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
3.231MetThr: 3.231 ± 0.364
1.616MetVal: 1.616 ± 1.355
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.847AsnAla: 4.847 ± 2.974
0.0AsnCys: 0.0 ± 0.0
4.847AsnAsp: 4.847 ± 1.719
0.0AsnGlu: 0.0 ± 0.0
6.462AsnPhe: 6.462 ± 3.966
3.231AsnGly: 3.231 ± 1.983
3.231AsnHis: 3.231 ± 0.364
1.616AsnIle: 1.616 ± 0.991
3.231AsnLys: 3.231 ± 2.711
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
4.847AsnAsn: 4.847 ± 0.627
1.616AsnPro: 1.616 ± 0.991
1.616AsnGln: 1.616 ± 0.991
1.616AsnArg: 1.616 ± 0.991
0.0AsnSer: 0.0 ± 0.0
6.462AsnThr: 6.462 ± 0.728
3.231AsnVal: 3.231 ± 1.983
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.462ProAla: 6.462 ± 3.966
0.0ProCys: 0.0 ± 0.0
4.847ProAsp: 4.847 ± 0.627
3.231ProGlu: 3.231 ± 0.364
1.616ProPhe: 1.616 ± 0.991
1.616ProGly: 1.616 ± 0.991
0.0ProHis: 0.0 ± 0.0
1.616ProIle: 1.616 ± 0.991
4.847ProLys: 4.847 ± 1.719
6.462ProLeu: 6.462 ± 3.966
1.616ProMet: 1.616 ± 1.355
3.231ProAsn: 3.231 ± 1.983
1.616ProPro: 1.616 ± 0.991
1.616ProGln: 1.616 ± 0.991
6.462ProArg: 6.462 ± 3.075
0.0ProSer: 0.0 ± 0.0
3.231ProThr: 3.231 ± 1.983
3.231ProVal: 3.231 ± 1.983
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.231GlnAla: 3.231 ± 2.711
0.0GlnCys: 0.0 ± 0.0
8.078GlnAsp: 8.078 ± 2.083
1.616GlnGlu: 1.616 ± 0.991
3.231GlnPhe: 3.231 ± 1.983
1.616GlnGly: 1.616 ± 0.991
0.0GlnHis: 0.0 ± 0.0
6.462GlnIle: 6.462 ± 1.619
4.847GlnLys: 4.847 ± 0.627
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.757
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
4.847GlnSer: 4.847 ± 0.627
0.0GlnThr: 0.0 ± 0.0
8.078GlnVal: 8.078 ± 0.263
1.616GlnTrp: 1.616 ± 0.991
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.231ArgAla: 3.231 ± 2.711
0.0ArgCys: 0.0 ± 0.0
1.616ArgAsp: 1.616 ± 0.991
0.0ArgGlu: 0.0 ± 0.0
1.616ArgPhe: 1.616 ± 1.355
3.231ArgGly: 3.231 ± 0.364
0.0ArgHis: 0.0 ± 0.0
3.231ArgIle: 3.231 ± 2.711
8.078ArgLys: 8.078 ± 0.263
6.462ArgLeu: 6.462 ± 0.728
0.0ArgMet: 0.0 ± 0.0
1.616ArgAsn: 1.616 ± 1.355
6.462ArgPro: 6.462 ± 1.619
3.231ArgGln: 3.231 ± 2.711
0.0ArgArg: 0.0 ± 0.0
4.847ArgSer: 4.847 ± 0.627
3.231ArgThr: 3.231 ± 0.364
0.0ArgVal: 0.0 ± 0.0
3.231ArgTrp: 3.231 ± 2.711
1.616ArgTyr: 1.616 ± 1.355
0.0ArgXaa: 0.0 ± 0.0
Ser
4.847SerAla: 4.847 ± 0.627
0.0SerCys: 0.0 ± 0.0
6.462SerAsp: 6.462 ± 0.728
8.078SerGlu: 8.078 ± 0.263
0.0SerPhe: 0.0 ± 0.0
9.693SerGly: 9.693 ± 3.602
1.616SerHis: 1.616 ± 0.991
1.616SerIle: 1.616 ± 0.991
4.847SerLys: 4.847 ± 1.719
6.462SerLeu: 6.462 ± 1.619
1.616SerMet: 1.616 ± 0.991
0.0SerAsn: 0.0 ± 0.0
3.231SerPro: 3.231 ± 1.983
3.231SerGln: 3.231 ± 0.364
0.0SerArg: 0.0 ± 0.0
6.462SerSer: 6.462 ± 0.728
3.231SerThr: 3.231 ± 0.364
3.231SerVal: 3.231 ± 1.983
1.616SerTrp: 1.616 ± 1.355
1.616SerTyr: 1.616 ± 0.991
0.0SerXaa: 0.0 ± 0.0
Thr
3.231ThrAla: 3.231 ± 0.364
1.616ThrCys: 1.616 ± 1.355
1.616ThrAsp: 1.616 ± 0.991
4.847ThrGlu: 4.847 ± 2.974
0.0ThrPhe: 0.0 ± 0.0
3.231ThrGly: 3.231 ± 2.711
0.0ThrHis: 0.0 ± 0.0
3.231ThrIle: 3.231 ± 1.983
1.616ThrLys: 1.616 ± 1.355
1.616ThrLeu: 1.616 ± 0.991
3.231ThrMet: 3.231 ± 2.116
6.462ThrAsn: 6.462 ± 1.619
4.847ThrPro: 4.847 ± 0.627
1.616ThrGln: 1.616 ± 0.991
0.0ThrArg: 0.0 ± 0.0
9.693ThrSer: 9.693 ± 3.602
0.0ThrThr: 0.0 ± 0.0
8.078ThrVal: 8.078 ± 2.083
8.078ThrTrp: 8.078 ± 4.43
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
8.078ValAla: 8.078 ± 2.083
0.0ValCys: 0.0 ± 0.0
4.847ValAsp: 4.847 ± 2.974
1.616ValGlu: 1.616 ± 0.991
1.616ValPhe: 1.616 ± 0.991
6.462ValGly: 6.462 ± 3.966
4.847ValHis: 4.847 ± 1.719
4.847ValIle: 4.847 ± 0.627
6.462ValLys: 6.462 ± 0.728
3.231ValLeu: 3.231 ± 0.364
1.616ValMet: 1.616 ± 0.991
3.231ValAsn: 3.231 ± 0.364
6.462ValPro: 6.462 ± 1.619
3.231ValGln: 3.231 ± 2.711
3.231ValArg: 3.231 ± 0.364
6.462ValSer: 6.462 ± 3.966
1.616ValThr: 1.616 ± 1.355
16.155ValVal: 16.155 ± 0.527
4.847ValTrp: 4.847 ± 1.719
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.616TrpAla: 1.616 ± 0.991
0.0TrpCys: 0.0 ± 0.0
1.616TrpAsp: 1.616 ± 1.355
1.616TrpGlu: 1.616 ± 1.355
1.616TrpPhe: 1.616 ± 0.991
1.616TrpGly: 1.616 ± 1.355
0.0TrpHis: 0.0 ± 0.0
3.231TrpIle: 3.231 ± 2.711
3.231TrpLys: 3.231 ± 1.983
3.231TrpLeu: 3.231 ± 2.711
1.616TrpMet: 1.616 ± 1.355
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
3.231TrpGln: 3.231 ± 0.364
6.462TrpArg: 6.462 ± 5.422
0.0TrpSer: 0.0 ± 0.0
1.616TrpThr: 1.616 ± 1.355
3.231TrpVal: 3.231 ± 2.711
0.0TrpTrp: 0.0 ± 0.0
1.616TrpTyr: 1.616 ± 1.355
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
3.231TyrAsp: 3.231 ± 2.711
3.231TyrGlu: 3.231 ± 2.711
0.0TyrPhe: 0.0 ± 0.0
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
1.616TyrIle: 1.616 ± 1.355
1.616TyrLys: 1.616 ± 0.991
1.616TyrLeu: 1.616 ± 0.991
1.616TyrMet: 1.616 ± 0.991
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
1.616TyrGln: 1.616 ± 0.991
1.616TyrArg: 1.616 ± 0.991
3.231TyrSer: 3.231 ± 0.364
1.616TyrThr: 1.616 ± 0.991
4.847TyrVal: 4.847 ± 0.627
0.0TyrTrp: 0.0 ± 0.0
1.616TyrTyr: 1.616 ± 0.991
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski