Amino acid dipepetide frequency for Penicillium aurantiogriseum partitivirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.179AlaAla: 7.179 ± 5.476
1.026AlaCys: 1.026 ± 0.782
2.051AlaAsp: 2.051 ± 1.564
6.154AlaGlu: 6.154 ± 0.437
5.128AlaPhe: 5.128 ± 3.183
8.205AlaGly: 8.205 ± 0.582
2.051AlaHis: 2.051 ± 1.564
3.077AlaIle: 3.077 ± 2.347
9.231AlaLys: 9.231 ± 2.892
2.051AlaLeu: 2.051 ± 1.564
3.077AlaMet: 3.077 ± 0.491
1.026AlaAsn: 1.026 ± 0.782
8.205AlaPro: 8.205 ± 2.001
3.077AlaGln: 3.077 ± 0.928
3.077AlaArg: 3.077 ± 0.928
9.231AlaSer: 9.231 ± 5.621
5.128AlaThr: 5.128 ± 2.492
2.051AlaVal: 2.051 ± 1.564
1.026AlaTrp: 1.026 ± 0.782
2.051AlaTyr: 2.051 ± 0.146
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.026CysCys: 1.026 ± 0.637
0.0CysAsp: 0.0 ± 0.0
3.077CysGlu: 3.077 ± 1.91
1.026CysPhe: 1.026 ± 0.782
1.026CysGly: 1.026 ± 0.637
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.051CysLeu: 2.051 ± 0.146
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.026CysPro: 1.026 ± 0.637
1.026CysGln: 1.026 ± 0.637
1.026CysArg: 1.026 ± 0.637
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.026CysTrp: 1.026 ± 0.782
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.077AspAla: 3.077 ± 0.928
1.026AspCys: 1.026 ± 0.637
7.179AspAsp: 7.179 ± 2.638
0.0AspGlu: 0.0 ± 0.0
6.154AspPhe: 6.154 ± 0.437
5.128AspGly: 5.128 ± 1.765
0.0AspHis: 0.0 ± 0.0
3.077AspIle: 3.077 ± 1.91
2.051AspLys: 2.051 ± 0.146
3.077AspLeu: 3.077 ± 0.928
0.0AspMet: 0.0 ± 0.0
2.051AspAsn: 2.051 ± 1.564
4.103AspPro: 4.103 ± 2.547
3.077AspGln: 3.077 ± 0.491
0.0AspArg: 0.0 ± 0.0
6.154AspSer: 6.154 ± 0.437
4.103AspThr: 4.103 ± 0.291
2.051AspVal: 2.051 ± 1.273
3.077AspTrp: 3.077 ± 0.928
2.051AspTyr: 2.051 ± 0.146
0.0AspXaa: 0.0 ± 0.0
Glu
2.051GluAla: 2.051 ± 0.146
1.026GluCys: 1.026 ± 0.637
2.051GluAsp: 2.051 ± 1.273
4.103GluGlu: 4.103 ± 1.128
6.154GluPhe: 6.154 ± 1.855
4.103GluGly: 4.103 ± 2.547
0.0GluHis: 0.0 ± 0.0
5.128GluIle: 5.128 ± 0.346
5.128GluLys: 5.128 ± 0.346
0.0GluLeu: 0.0 ± 0.0
2.051GluMet: 2.051 ± 0.146
0.0GluAsn: 0.0 ± 0.0
5.128GluPro: 5.128 ± 0.346
2.051GluGln: 2.051 ± 0.146
3.077GluArg: 3.077 ± 0.491
3.077GluSer: 3.077 ± 0.491
3.077GluThr: 3.077 ± 0.491
5.128GluVal: 5.128 ± 0.346
3.077GluTrp: 3.077 ± 1.91
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
5.128PheAla: 5.128 ± 2.492
1.026PheCys: 1.026 ± 0.637
4.103PheAsp: 4.103 ± 0.291
1.026PheGlu: 1.026 ± 0.782
2.051PhePhe: 2.051 ± 0.146
3.077PheGly: 3.077 ± 0.928
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
3.077PheLys: 3.077 ± 1.91
4.103PheLeu: 4.103 ± 1.71
2.051PheMet: 2.051 ± 1.273
2.051PheAsn: 2.051 ± 0.146
4.103PhePro: 4.103 ± 1.71
3.077PheGln: 3.077 ± 0.491
3.077PheArg: 3.077 ± 1.91
8.205PheSer: 8.205 ± 0.582
7.179PheThr: 7.179 ± 1.219
3.077PheVal: 3.077 ± 0.928
1.026PheTrp: 1.026 ± 0.637
2.051PheTyr: 2.051 ± 1.273
0.0PheXaa: 0.0 ± 0.0
Gly
2.051GlyAla: 2.051 ± 0.146
0.0GlyCys: 0.0 ± 0.0
5.128GlyAsp: 5.128 ± 1.073
5.128GlyGlu: 5.128 ± 0.346
3.077GlyPhe: 3.077 ± 0.491
4.103GlyGly: 4.103 ± 1.128
2.051GlyHis: 2.051 ± 0.146
0.0GlyIle: 0.0 ± 0.0
6.154GlyLys: 6.154 ± 0.437
5.128GlyLeu: 5.128 ± 0.346
5.128GlyMet: 5.128 ± 1.493
0.0GlyAsn: 0.0 ± 0.0
0.0GlyPro: 0.0 ± 0.0
3.077GlyGln: 3.077 ± 1.91
3.077GlyArg: 3.077 ± 0.491
7.179GlySer: 7.179 ± 0.2
4.103GlyThr: 4.103 ± 0.291
1.026GlyVal: 1.026 ± 0.637
2.051GlyTrp: 2.051 ± 1.273
4.103GlyTyr: 4.103 ± 0.291
0.0GlyXaa: 0.0 ± 0.0
His
1.026HisAla: 1.026 ± 0.782
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.051HisGlu: 2.051 ± 1.273
2.051HisPhe: 2.051 ± 0.146
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.077HisLeu: 3.077 ± 0.491
1.026HisMet: 1.026 ± 0.637
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
2.051HisArg: 2.051 ± 1.273
0.0HisSer: 0.0 ± 0.0
3.077HisThr: 3.077 ± 0.928
3.077HisVal: 3.077 ± 2.347
0.0HisTrp: 0.0 ± 0.0
1.026HisTyr: 1.026 ± 0.637
0.0HisXaa: 0.0 ± 0.0
Ile
3.077IleAla: 3.077 ± 0.491
0.0IleCys: 0.0 ± 0.0
3.077IleAsp: 3.077 ± 1.91
3.077IleGlu: 3.077 ± 0.491
1.026IlePhe: 1.026 ± 0.637
2.051IleGly: 2.051 ± 0.146
1.026IleHis: 1.026 ± 0.782
2.051IleIle: 2.051 ± 1.564
1.026IleLys: 1.026 ± 0.782
4.103IleLeu: 4.103 ± 1.128
1.026IleMet: 1.026 ± 0.782
0.0IleAsn: 0.0 ± 0.0
1.026IlePro: 1.026 ± 0.782
0.0IleGln: 0.0 ± 0.0
1.026IleArg: 1.026 ± 0.637
2.051IleSer: 2.051 ± 0.146
0.0IleThr: 0.0 ± 0.0
1.026IleVal: 1.026 ± 0.782
0.0IleTrp: 0.0 ± 0.0
6.154IleTyr: 6.154 ± 3.82
0.0IleXaa: 0.0 ± 0.0
Lys
8.205LysAla: 8.205 ± 0.837
1.026LysCys: 1.026 ± 0.637
3.077LysAsp: 3.077 ± 0.491
2.051LysGlu: 2.051 ± 0.146
3.077LysPhe: 3.077 ± 0.491
2.051LysGly: 2.051 ± 1.273
0.0LysHis: 0.0 ± 0.0
1.026LysIle: 1.026 ± 0.782
2.051LysLys: 2.051 ± 1.273
4.103LysLeu: 4.103 ± 2.547
1.026LysMet: 1.026 ± 0.637
1.026LysAsn: 1.026 ± 0.782
6.154LysPro: 6.154 ± 1.855
3.077LysGln: 3.077 ± 0.491
5.128LysArg: 5.128 ± 0.346
5.128LysSer: 5.128 ± 0.346
3.077LysThr: 3.077 ± 0.491
2.051LysVal: 2.051 ± 1.273
1.026LysTrp: 1.026 ± 0.637
1.026LysTyr: 1.026 ± 0.637
0.0LysXaa: 0.0 ± 0.0
Leu
9.231LeuAla: 9.231 ± 1.364
0.0LeuCys: 0.0 ± 0.0
5.128LeuAsp: 5.128 ± 0.346
4.103LeuGlu: 4.103 ± 0.291
5.128LeuPhe: 5.128 ± 1.073
5.128LeuGly: 5.128 ± 1.765
1.026LeuHis: 1.026 ± 0.637
4.103LeuIle: 4.103 ± 2.547
4.103LeuLys: 4.103 ± 0.291
6.154LeuLeu: 6.154 ± 1.855
2.051LeuMet: 2.051 ± 1.273
2.051LeuAsn: 2.051 ± 0.146
4.103LeuPro: 4.103 ± 0.291
0.0LeuGln: 0.0 ± 0.0
8.205LeuArg: 8.205 ± 2.256
5.128LeuSer: 5.128 ± 3.911
3.077LeuThr: 3.077 ± 0.928
5.128LeuVal: 5.128 ± 0.346
1.026LeuTrp: 1.026 ± 0.637
2.051LeuTyr: 2.051 ± 1.273
0.0LeuXaa: 0.0 ± 0.0
Met
2.051MetAla: 2.051 ± 0.146
0.0MetCys: 0.0 ± 0.0
1.026MetAsp: 1.026 ± 0.637
1.026MetGlu: 1.026 ± 0.637
1.026MetPhe: 1.026 ± 0.637
2.051MetGly: 2.051 ± 0.146
0.0MetHis: 0.0 ± 0.0
2.051MetIle: 2.051 ± 0.146
1.026MetLys: 1.026 ± 0.637
3.077MetLeu: 3.077 ± 1.91
0.0MetMet: 0.0 ± 0.0
2.051MetAsn: 2.051 ± 0.146
3.077MetPro: 3.077 ± 0.928
0.0MetGln: 0.0 ± 0.0
2.051MetArg: 2.051 ± 1.273
3.077MetSer: 3.077 ± 0.928
1.026MetThr: 1.026 ± 0.782
1.026MetVal: 1.026 ± 0.637
1.026MetTrp: 1.026 ± 0.637
3.077MetTyr: 3.077 ± 0.491
0.0MetXaa: 0.0 ± 0.0
Asn
3.077AsnAla: 3.077 ± 2.347
0.0AsnCys: 0.0 ± 0.0
2.051AsnAsp: 2.051 ± 0.146
1.026AsnGlu: 1.026 ± 0.782
0.0AsnPhe: 0.0 ± 0.0
1.026AsnGly: 1.026 ± 0.637
0.0AsnHis: 0.0 ± 0.0
1.026AsnIle: 1.026 ± 0.637
0.0AsnLys: 0.0 ± 0.0
2.051AsnLeu: 2.051 ± 1.273
1.026AsnMet: 1.026 ± 0.782
0.0AsnAsn: 0.0 ± 0.0
2.051AsnPro: 2.051 ± 0.146
1.026AsnGln: 1.026 ± 0.782
2.051AsnArg: 2.051 ± 1.564
2.051AsnSer: 2.051 ± 1.273
2.051AsnThr: 2.051 ± 1.273
2.051AsnVal: 2.051 ± 0.146
1.026AsnTrp: 1.026 ± 0.782
2.051AsnTyr: 2.051 ± 0.146
0.0AsnXaa: 0.0 ± 0.0
Pro
8.205ProAla: 8.205 ± 3.42
3.077ProCys: 3.077 ± 1.91
1.026ProAsp: 1.026 ± 0.637
5.128ProGlu: 5.128 ± 1.765
0.0ProPhe: 0.0 ± 0.0
3.077ProGly: 3.077 ± 0.928
1.026ProHis: 1.026 ± 0.782
1.026ProIle: 1.026 ± 0.637
3.077ProLys: 3.077 ± 0.928
3.077ProLeu: 3.077 ± 0.491
2.051ProMet: 2.051 ± 0.146
3.077ProAsn: 3.077 ± 0.491
2.051ProPro: 2.051 ± 0.146
2.051ProGln: 2.051 ± 1.564
3.077ProArg: 3.077 ± 0.491
8.205ProSer: 8.205 ± 4.839
6.154ProThr: 6.154 ± 3.82
4.103ProVal: 4.103 ± 1.71
1.026ProTrp: 1.026 ± 0.637
1.026ProTyr: 1.026 ± 0.637
0.0ProXaa: 0.0 ± 0.0
Gln
2.051GlnAla: 2.051 ± 0.146
0.0GlnCys: 0.0 ± 0.0
2.051GlnAsp: 2.051 ± 1.273
2.051GlnGlu: 2.051 ± 1.273
4.103GlnPhe: 4.103 ± 0.291
3.077GlnGly: 3.077 ± 0.928
1.026GlnHis: 1.026 ± 0.637
1.026GlnIle: 1.026 ± 0.782
4.103GlnLys: 4.103 ± 2.547
5.128GlnLeu: 5.128 ± 3.911
3.077GlnMet: 3.077 ± 0.928
1.026GlnAsn: 1.026 ± 0.637
2.051GlnPro: 2.051 ± 1.564
2.051GlnGln: 2.051 ± 0.146
2.051GlnArg: 2.051 ± 0.146
2.051GlnSer: 2.051 ± 0.146
1.026GlnThr: 1.026 ± 0.782
0.0GlnVal: 0.0 ± 0.0
1.026GlnTrp: 1.026 ± 0.782
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.077ArgAla: 3.077 ± 2.347
1.026ArgCys: 1.026 ± 0.782
7.179ArgAsp: 7.179 ± 1.619
5.128ArgGlu: 5.128 ± 3.183
5.128ArgPhe: 5.128 ± 2.492
3.077ArgGly: 3.077 ± 1.91
1.026ArgHis: 1.026 ± 0.782
1.026ArgIle: 1.026 ± 0.782
1.026ArgLys: 1.026 ± 0.637
5.128ArgLeu: 5.128 ± 0.346
2.051ArgMet: 2.051 ± 0.958
2.051ArgAsn: 2.051 ± 1.273
3.077ArgPro: 3.077 ± 1.91
3.077ArgGln: 3.077 ± 0.928
5.128ArgArg: 5.128 ± 0.346
6.154ArgSer: 6.154 ± 0.982
1.026ArgThr: 1.026 ± 0.782
3.077ArgVal: 3.077 ± 0.491
2.051ArgTrp: 2.051 ± 1.273
2.051ArgTyr: 2.051 ± 1.273
0.0ArgXaa: 0.0 ± 0.0
Ser
13.333SerAla: 13.333 ± 1.655
0.0SerCys: 0.0 ± 0.0
2.051SerAsp: 2.051 ± 1.564
4.103SerGlu: 4.103 ± 0.291
3.077SerPhe: 3.077 ± 0.491
6.154SerGly: 6.154 ± 1.855
2.051SerHis: 2.051 ± 1.273
3.077SerIle: 3.077 ± 0.491
5.128SerLys: 5.128 ± 1.073
6.154SerLeu: 6.154 ± 0.982
0.0SerMet: 0.0 ± 0.0
3.077SerAsn: 3.077 ± 0.928
2.051SerPro: 2.051 ± 1.273
7.179SerGln: 7.179 ± 4.057
4.103SerArg: 4.103 ± 1.71
7.179SerSer: 7.179 ± 0.2
9.231SerThr: 9.231 ± 1.364
5.128SerVal: 5.128 ± 1.073
2.051SerTrp: 2.051 ± 0.146
1.026SerTyr: 1.026 ± 0.637
0.0SerXaa: 0.0 ± 0.0
Thr
3.077ThrAla: 3.077 ± 0.928
1.026ThrCys: 1.026 ± 0.782
4.103ThrAsp: 4.103 ± 0.291
5.128ThrGlu: 5.128 ± 2.492
6.154ThrPhe: 6.154 ± 1.855
4.103ThrGly: 4.103 ± 1.71
2.051ThrHis: 2.051 ± 0.146
0.0ThrIle: 0.0 ± 0.0
4.103ThrLys: 4.103 ± 2.547
4.103ThrLeu: 4.103 ± 1.128
0.0ThrMet: 0.0 ± 0.0
1.026ThrAsn: 1.026 ± 0.782
4.103ThrPro: 4.103 ± 0.291
3.077ThrGln: 3.077 ± 1.91
6.154ThrArg: 6.154 ± 0.437
3.077ThrSer: 3.077 ± 0.491
6.154ThrThr: 6.154 ± 1.855
5.128ThrVal: 5.128 ± 2.492
1.026ThrTrp: 1.026 ± 0.637
3.077ThrTyr: 3.077 ± 0.491
0.0ThrXaa: 0.0 ± 0.0
Val
2.051ValAla: 2.051 ± 1.564
0.0ValCys: 0.0 ± 0.0
2.051ValAsp: 2.051 ± 0.146
1.026ValGlu: 1.026 ± 0.637
2.051ValPhe: 2.051 ± 1.564
1.026ValGly: 1.026 ± 0.637
3.077ValHis: 3.077 ± 0.491
3.077ValIle: 3.077 ± 1.91
2.051ValLys: 2.051 ± 1.564
5.128ValLeu: 5.128 ± 0.346
2.051ValMet: 2.051 ± 0.146
4.103ValAsn: 4.103 ± 0.291
5.128ValPro: 5.128 ± 1.073
2.051ValGln: 2.051 ± 1.564
2.051ValArg: 2.051 ± 0.146
4.103ValSer: 4.103 ± 0.291
3.077ValThr: 3.077 ± 2.347
5.128ValVal: 5.128 ± 2.492
3.077ValTrp: 3.077 ± 1.91
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.026TrpAla: 1.026 ± 0.782
1.026TrpCys: 1.026 ± 0.637
4.103TrpAsp: 4.103 ± 0.291
0.0TrpGlu: 0.0 ± 0.0
2.051TrpPhe: 2.051 ± 1.273
0.0TrpGly: 0.0 ± 0.0
1.026TrpHis: 1.026 ± 0.637
1.026TrpIle: 1.026 ± 0.637
0.0TrpLys: 0.0 ± 0.0
6.154TrpLeu: 6.154 ± 0.982
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.077TrpArg: 3.077 ± 0.491
1.026TrpSer: 1.026 ± 0.637
3.077TrpThr: 3.077 ± 0.491
1.026TrpVal: 1.026 ± 0.637
1.026TrpTrp: 1.026 ± 0.637
1.026TrpTyr: 1.026 ± 0.637
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.128TyrAla: 5.128 ± 0.346
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
1.026TyrGlu: 1.026 ± 0.637
1.026TyrPhe: 1.026 ± 0.637
4.103TyrGly: 4.103 ± 1.128
1.026TyrHis: 1.026 ± 0.637
1.026TyrIle: 1.026 ± 0.637
2.051TyrLys: 2.051 ± 1.273
3.077TyrLeu: 3.077 ± 0.928
1.026TyrMet: 1.026 ± 0.637
1.026TyrAsn: 1.026 ± 0.637
4.103TyrPro: 4.103 ± 1.128
0.0TyrGln: 0.0 ± 0.0
4.103TyrArg: 4.103 ± 1.128
3.077TyrSer: 3.077 ± 1.91
1.026TyrThr: 1.026 ± 0.637
1.026TyrVal: 1.026 ± 0.637
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (976 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski