Amino acid dipepetide frequency for Molossus molossus circovirus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.311AlaAla: 9.311 ± 6.63
0.0AlaCys: 0.0 ± 0.0
1.862AlaAsp: 1.862 ± 1.326
3.724AlaGlu: 3.724 ± 2.652
0.0AlaPhe: 0.0 ± 0.0
14.898AlaGly: 14.898 ± 7.975
0.0AlaHis: 0.0 ± 0.0
5.587AlaIle: 5.587 ± 3.919
5.587AlaLys: 5.587 ± 1.346
3.724AlaLeu: 3.724 ± 0.02
0.0AlaMet: 0.0 ± 0.0
3.724AlaAsn: 3.724 ± 2.652
0.0AlaPro: 0.0 ± 0.0
5.587AlaGln: 5.587 ± 1.287
1.862AlaArg: 1.862 ± 1.326
3.724AlaSer: 3.724 ± 2.652
1.862AlaThr: 1.862 ± 1.326
1.862AlaVal: 1.862 ± 1.326
0.0AlaTrp: 0.0 ± 0.0
1.862AlaTyr: 1.862 ± 1.326
0.0AlaXaa: 0.0 ± 0.0
Cys
1.862CysAla: 1.862 ± 1.326
0.0CysCys: 0.0 ± 0.0
1.862CysAsp: 1.862 ± 1.306
0.0CysGlu: 0.0 ± 0.0
1.862CysPhe: 1.862 ± 1.306
1.862CysGly: 1.862 ± 1.306
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.862CysLys: 1.862 ± 1.306
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.862CysAsn: 1.862 ± 1.326
1.862CysPro: 1.862 ± 1.306
0.0CysGln: 0.0 ± 0.0
1.862CysArg: 1.862 ± 1.306
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.862CysTrp: 1.862 ± 1.306
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.862AspAla: 1.862 ± 1.306
3.724AspCys: 3.724 ± 0.02
5.587AspAsp: 5.587 ± 3.919
1.862AspGlu: 1.862 ± 1.306
1.862AspPhe: 1.862 ± 1.326
7.449AspGly: 7.449 ± 0.039
1.862AspHis: 1.862 ± 1.326
1.862AspIle: 1.862 ± 1.306
1.862AspLys: 1.862 ± 1.306
5.587AspLeu: 5.587 ± 3.919
1.862AspMet: 1.862 ± 1.326
1.862AspAsn: 1.862 ± 1.306
0.0AspPro: 0.0 ± 0.0
0.0AspGln: 0.0 ± 0.0
3.724AspArg: 3.724 ± 2.613
1.862AspSer: 1.862 ± 1.326
3.724AspThr: 3.724 ± 2.613
0.0AspVal: 0.0 ± 0.0
0.0AspTrp: 0.0 ± 0.0
1.862AspTyr: 1.862 ± 1.306
0.0AspXaa: 0.0 ± 0.0
Glu
5.587GluAla: 5.587 ± 1.287
0.0GluCys: 0.0 ± 0.0
1.862GluAsp: 1.862 ± 1.306
1.862GluGlu: 1.862 ± 1.306
3.724GluPhe: 3.724 ± 0.02
5.587GluGly: 5.587 ± 1.287
3.724GluHis: 3.724 ± 2.613
5.587GluIle: 5.587 ± 3.919
3.724GluLys: 3.724 ± 2.613
0.0GluLeu: 0.0 ± 0.0
0.0GluMet: 0.0 ± 0.0
3.724GluAsn: 3.724 ± 2.652
3.724GluPro: 3.724 ± 2.613
0.0GluGln: 0.0 ± 0.0
0.0GluArg: 0.0 ± 0.0
0.0GluSer: 0.0 ± 0.0
3.724GluThr: 3.724 ± 0.02
3.724GluVal: 3.724 ± 0.02
1.862GluTrp: 1.862 ± 1.306
1.862GluTyr: 1.862 ± 1.306
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
7.449PheAsp: 7.449 ± 2.593
3.724PheGlu: 3.724 ± 0.02
0.0PhePhe: 0.0 ± 0.0
7.449PheGly: 7.449 ± 2.671
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
9.311PheLys: 9.311 ± 1.365
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
1.862PheAsn: 1.862 ± 1.306
0.0PhePro: 0.0 ± 0.0
1.862PheGln: 1.862 ± 1.326
1.862PheArg: 1.862 ± 1.326
3.724PheSer: 3.724 ± 2.652
7.449PheThr: 7.449 ± 0.039
3.724PheVal: 3.724 ± 2.613
0.0PheTrp: 0.0 ± 0.0
1.862PheTyr: 1.862 ± 1.326
0.0PheXaa: 0.0 ± 0.0
Gly
1.862GlyAla: 1.862 ± 1.326
1.862GlyCys: 1.862 ± 1.306
0.0GlyAsp: 0.0 ± 0.0
0.0GlyGlu: 0.0 ± 0.0
3.724GlyPhe: 3.724 ± 2.652
3.724GlyGly: 3.724 ± 0.02
1.862GlyHis: 1.862 ± 1.306
11.173GlyIle: 11.173 ± 7.956
1.862GlyLys: 1.862 ± 1.306
1.862GlyLeu: 1.862 ± 1.326
3.724GlyMet: 3.724 ± 2.652
1.862GlyAsn: 1.862 ± 1.326
5.587GlyPro: 5.587 ± 1.287
3.724GlyGln: 3.724 ± 2.613
3.724GlyArg: 3.724 ± 0.02
5.587GlySer: 5.587 ± 1.287
11.173GlyThr: 11.173 ± 7.956
3.724GlyVal: 3.724 ± 2.652
0.0GlyTrp: 0.0 ± 0.0
7.449GlyTyr: 7.449 ± 5.225
0.0GlyXaa: 0.0 ± 0.0
His
1.862HisAla: 1.862 ± 1.306
0.0HisCys: 0.0 ± 0.0
1.862HisAsp: 1.862 ± 1.326
0.0HisGlu: 0.0 ± 0.0
5.587HisPhe: 5.587 ± 1.287
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.862HisIle: 1.862 ± 1.306
3.724HisLys: 3.724 ± 2.613
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.862HisGln: 1.862 ± 1.306
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.862HisThr: 1.862 ± 1.306
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.862HisTyr: 1.862 ± 1.326
0.0HisXaa: 0.0 ± 0.0
Ile
3.724IleAla: 3.724 ± 0.02
0.0IleCys: 0.0 ± 0.0
5.587IleAsp: 5.587 ± 1.287
1.862IleGlu: 1.862 ± 1.306
3.724IlePhe: 3.724 ± 0.02
1.862IleGly: 1.862 ± 1.326
0.0IleHis: 0.0 ± 0.0
5.587IleIle: 5.587 ± 1.287
3.724IleLys: 3.724 ± 0.02
9.311IleLeu: 9.311 ± 1.267
1.862IleMet: 1.862 ± 1.326
9.311IleAsn: 9.311 ± 1.267
5.587IlePro: 5.587 ± 1.346
1.862IleGln: 1.862 ± 1.306
5.587IleArg: 5.587 ± 1.346
7.449IleSer: 7.449 ± 2.671
3.724IleThr: 3.724 ± 0.02
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
5.587IleTyr: 5.587 ± 1.287
0.0IleXaa: 0.0 ± 0.0
Lys
1.862LysAla: 1.862 ± 1.326
1.862LysCys: 1.862 ± 1.306
1.862LysAsp: 1.862 ± 1.306
3.724LysGlu: 3.724 ± 2.613
1.862LysPhe: 1.862 ± 1.326
5.587LysGly: 5.587 ± 1.287
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
9.311LysLys: 9.311 ± 3.899
3.724LysLeu: 3.724 ± 0.02
1.862LysMet: 1.862 ± 1.306
5.587LysAsn: 5.587 ± 3.919
1.862LysPro: 1.862 ± 1.326
3.724LysGln: 3.724 ± 2.613
11.173LysArg: 11.173 ± 2.691
3.724LysSer: 3.724 ± 2.613
9.311LysThr: 9.311 ± 3.899
1.862LysVal: 1.862 ± 1.326
1.862LysTrp: 1.862 ± 1.306
3.724LysTyr: 3.724 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
3.724LeuAla: 3.724 ± 2.652
1.862LeuCys: 1.862 ± 1.306
1.862LeuAsp: 1.862 ± 1.306
5.587LeuGlu: 5.587 ± 1.287
3.724LeuPhe: 3.724 ± 0.02
0.0LeuGly: 0.0 ± 0.0
1.862LeuHis: 1.862 ± 1.306
3.724LeuIle: 3.724 ± 2.613
5.587LeuLys: 5.587 ± 1.287
5.587LeuLeu: 5.587 ± 1.287
0.0LeuMet: 0.0 ± 0.0
1.862LeuAsn: 1.862 ± 1.326
1.862LeuPro: 1.862 ± 1.326
3.724LeuGln: 3.724 ± 2.652
3.724LeuArg: 3.724 ± 0.02
3.724LeuSer: 3.724 ± 0.02
3.724LeuThr: 3.724 ± 0.02
0.0LeuVal: 0.0 ± 0.0
1.862LeuTrp: 1.862 ± 1.306
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.862MetGlu: 1.862 ± 1.326
1.862MetPhe: 1.862 ± 1.326
0.0MetGly: 0.0 ± 0.0
1.862MetHis: 1.862 ± 1.326
5.587MetIle: 5.587 ± 1.287
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
1.862MetMet: 1.862 ± 0.927
0.0MetAsn: 0.0 ± 0.0
1.862MetPro: 1.862 ± 1.326
0.0MetGln: 0.0 ± 0.0
1.862MetArg: 1.862 ± 1.326
1.862MetSer: 1.862 ± 1.306
3.724MetThr: 3.724 ± 2.652
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.724AsnAla: 3.724 ± 2.652
0.0AsnCys: 0.0 ± 0.0
5.587AsnAsp: 5.587 ± 3.919
5.587AsnGlu: 5.587 ± 1.287
1.862AsnPhe: 1.862 ± 1.306
3.724AsnGly: 3.724 ± 2.652
0.0AsnHis: 0.0 ± 0.0
5.587AsnIle: 5.587 ± 3.978
3.724AsnLys: 3.724 ± 0.02
1.862AsnLeu: 1.862 ± 1.326
1.862AsnMet: 1.862 ± 1.306
7.449AsnAsn: 7.449 ± 2.671
1.862AsnPro: 1.862 ± 1.306
1.862AsnGln: 1.862 ± 1.326
0.0AsnArg: 0.0 ± 0.0
3.724AsnSer: 3.724 ± 2.613
3.724AsnThr: 3.724 ± 2.652
1.862AsnVal: 1.862 ± 1.326
1.862AsnTrp: 1.862 ± 1.306
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.724ProAla: 3.724 ± 0.02
0.0ProCys: 0.0 ± 0.0
3.724ProAsp: 3.724 ± 0.02
1.862ProGlu: 1.862 ± 1.306
3.724ProPhe: 3.724 ± 2.652
1.862ProGly: 1.862 ± 1.326
1.862ProHis: 1.862 ± 1.326
3.724ProIle: 3.724 ± 0.02
1.862ProLys: 1.862 ± 1.326
1.862ProLeu: 1.862 ± 1.306
1.862ProMet: 1.862 ± 1.306
1.862ProAsn: 1.862 ± 1.326
3.724ProPro: 3.724 ± 0.02
0.0ProGln: 0.0 ± 0.0
3.724ProArg: 3.724 ± 2.652
0.0ProSer: 0.0 ± 0.0
3.724ProThr: 3.724 ± 2.613
1.862ProVal: 1.862 ± 1.326
1.862ProTrp: 1.862 ± 1.306
3.724ProTyr: 3.724 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
5.587GlnAla: 5.587 ± 1.346
0.0GlnCys: 0.0 ± 0.0
1.862GlnAsp: 1.862 ± 1.326
3.724GlnGlu: 3.724 ± 2.613
1.862GlnPhe: 1.862 ± 1.326
3.724GlnGly: 3.724 ± 2.613
1.862GlnHis: 1.862 ± 1.306
5.587GlnIle: 5.587 ± 1.287
1.862GlnLys: 1.862 ± 1.306
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.0
1.862GlnAsn: 1.862 ± 1.326
1.862GlnPro: 1.862 ± 1.306
5.587GlnGln: 5.587 ± 1.287
1.862GlnArg: 1.862 ± 1.306
0.0GlnSer: 0.0 ± 0.0
1.862GlnThr: 1.862 ± 1.326
0.0GlnVal: 0.0 ± 0.0
1.862GlnTrp: 1.862 ± 1.326
1.862GlnTyr: 1.862 ± 1.326
0.0GlnXaa: 0.0 ± 0.0
Arg
1.862ArgAla: 1.862 ± 1.326
3.724ArgCys: 3.724 ± 0.02
0.0ArgAsp: 0.0 ± 0.0
5.587ArgGlu: 5.587 ± 3.919
1.862ArgPhe: 1.862 ± 1.326
1.862ArgGly: 1.862 ± 1.326
0.0ArgHis: 0.0 ± 0.0
1.862ArgIle: 1.862 ± 1.306
5.587ArgLys: 5.587 ± 1.346
3.724ArgLeu: 3.724 ± 0.02
1.862ArgMet: 1.862 ± 2.096
3.724ArgAsn: 3.724 ± 0.02
3.724ArgPro: 3.724 ± 2.652
0.0ArgGln: 0.0 ± 0.0
13.035ArgArg: 13.035 ± 4.017
0.0ArgSer: 0.0 ± 0.0
5.587ArgThr: 5.587 ± 1.346
1.862ArgVal: 1.862 ± 1.326
0.0ArgTrp: 0.0 ± 0.0
5.587ArgTyr: 5.587 ± 1.287
0.0ArgXaa: 0.0 ± 0.0
Ser
5.587SerAla: 5.587 ± 3.978
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
1.862SerGlu: 1.862 ± 1.326
7.449SerPhe: 7.449 ± 5.225
0.0SerGly: 0.0 ± 0.0
1.862SerHis: 1.862 ± 1.306
0.0SerIle: 0.0 ± 0.0
5.587SerLys: 5.587 ± 3.919
7.449SerLeu: 7.449 ± 0.039
0.0SerMet: 0.0 ± 0.0
1.862SerAsn: 1.862 ± 1.306
5.587SerPro: 5.587 ± 1.346
1.862SerGln: 1.862 ± 1.306
1.862SerArg: 1.862 ± 1.306
0.0SerSer: 0.0 ± 0.0
0.0SerThr: 0.0 ± 0.0
3.724SerVal: 3.724 ± 2.652
1.862SerTrp: 1.862 ± 1.306
3.724SerTyr: 3.724 ± 2.652
0.0SerXaa: 0.0 ± 0.0
Thr
7.449ThrAla: 7.449 ± 5.304
0.0ThrCys: 0.0 ± 0.0
1.862ThrAsp: 1.862 ± 1.306
0.0ThrGlu: 0.0 ± 0.0
0.0ThrPhe: 0.0 ± 0.0
7.449ThrGly: 7.449 ± 2.671
1.862ThrHis: 1.862 ± 1.306
5.587ThrIle: 5.587 ± 3.978
3.724ThrLys: 3.724 ± 2.613
5.587ThrLeu: 5.587 ± 1.346
1.862ThrMet: 1.862 ± 1.326
1.862ThrAsn: 1.862 ± 1.326
0.0ThrPro: 0.0 ± 0.0
0.0ThrGln: 0.0 ± 0.0
3.724ThrArg: 3.724 ± 0.02
9.311ThrSer: 9.311 ± 3.899
3.724ThrThr: 3.724 ± 2.652
7.449ThrVal: 7.449 ± 0.039
1.862ThrTrp: 1.862 ± 1.306
5.587ThrTyr: 5.587 ± 1.346
0.0ThrXaa: 0.0 ± 0.0
Val
3.724ValAla: 3.724 ± 0.02
0.0ValCys: 0.0 ± 0.0
1.862ValAsp: 1.862 ± 1.326
1.862ValGlu: 1.862 ± 1.306
1.862ValPhe: 1.862 ± 1.326
3.724ValGly: 3.724 ± 2.652
0.0ValHis: 0.0 ± 0.0
9.311ValIle: 9.311 ± 3.997
0.0ValLys: 0.0 ± 0.0
0.0ValLeu: 0.0 ± 0.0
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
3.724ValPro: 3.724 ± 2.652
1.862ValGln: 1.862 ± 1.326
0.0ValArg: 0.0 ± 0.0
1.862ValSer: 1.862 ± 1.326
0.0ValThr: 0.0 ± 0.0
5.587ValVal: 5.587 ± 1.346
1.862ValTrp: 1.862 ± 1.306
5.587ValTyr: 5.587 ± 3.919
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.862TrpCys: 1.862 ± 1.306
1.862TrpAsp: 1.862 ± 1.306
3.724TrpGlu: 3.724 ± 2.613
1.862TrpPhe: 1.862 ± 1.326
3.724TrpGly: 3.724 ± 2.613
0.0TrpHis: 0.0 ± 0.0
1.862TrpIle: 1.862 ± 1.306
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.862TrpAsn: 1.862 ± 1.306
1.862TrpPro: 1.862 ± 1.326
3.724TrpGln: 3.724 ± 0.02
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
1.862TrpTrp: 1.862 ± 1.306
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.862TyrAla: 1.862 ± 1.326
1.862TyrCys: 1.862 ± 1.306
1.862TyrAsp: 1.862 ± 1.306
1.862TyrGlu: 1.862 ± 1.306
1.862TyrPhe: 1.862 ± 1.306
0.0TyrGly: 0.0 ± 0.0
1.862TyrHis: 1.862 ± 1.306
1.862TyrIle: 1.862 ± 1.306
5.587TyrLys: 5.587 ± 1.287
3.724TyrLeu: 3.724 ± 0.02
1.862TyrMet: 1.862 ± 1.326
3.724TyrAsn: 3.724 ± 0.02
1.862TyrPro: 1.862 ± 1.306
5.587TyrGln: 5.587 ± 1.346
3.724TyrArg: 3.724 ± 2.652
3.724TyrSer: 3.724 ± 2.613
0.0TyrThr: 0.0 ± 0.0
5.587TyrVal: 5.587 ± 1.287
3.724TyrTrp: 3.724 ± 2.652
1.862TyrTyr: 1.862 ± 1.326
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (538 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski