Amino acid dipepetide frequency for CRESS virus sp. ctbTJ1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.988AlaAla: 5.988 ± 4.474
1.996AlaCys: 1.996 ± 1.163
0.0AlaAsp: 0.0 ± 0.0
1.996AlaGlu: 1.996 ± 1.491
1.996AlaPhe: 1.996 ± 1.163
7.984AlaGly: 7.984 ± 1.997
0.0AlaHis: 0.0 ± 0.0
1.996AlaIle: 1.996 ± 1.163
5.988AlaLys: 5.988 ± 0.834
5.988AlaLeu: 5.988 ± 0.834
0.0AlaMet: 0.0 ± 0.0
3.992AlaAsn: 3.992 ± 2.983
5.988AlaPro: 5.988 ± 1.82
0.0AlaGln: 0.0 ± 0.0
7.984AlaArg: 7.984 ± 4.651
5.988AlaSer: 5.988 ± 0.834
7.984AlaThr: 7.984 ± 5.965
5.988AlaVal: 5.988 ± 1.82
1.996AlaTrp: 1.996 ± 1.163
5.988AlaTyr: 5.988 ± 1.82
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.996CysGlu: 1.996 ± 1.163
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
3.992CysMet: 3.992 ± 2.326
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.996CysArg: 1.996 ± 1.491
1.996CysSer: 1.996 ± 1.163
0.0CysThr: 0.0 ± 0.0
7.984CysVal: 7.984 ± 4.651
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.996AspAla: 1.996 ± 1.491
0.0AspCys: 0.0 ± 0.0
5.988AspAsp: 5.988 ± 0.834
5.988AspGlu: 5.988 ± 0.834
0.0AspPhe: 0.0 ± 0.0
1.996AspGly: 1.996 ± 1.491
0.0AspHis: 0.0 ± 0.0
1.996AspIle: 1.996 ± 1.163
3.992AspLys: 3.992 ± 2.326
3.992AspLeu: 3.992 ± 2.326
1.996AspMet: 1.996 ± 1.163
3.992AspAsn: 3.992 ± 0.328
3.992AspPro: 3.992 ± 0.328
1.996AspGln: 1.996 ± 1.491
1.996AspArg: 1.996 ± 1.491
1.996AspSer: 1.996 ± 1.491
3.992AspThr: 3.992 ± 0.328
0.0AspVal: 0.0 ± 0.0
1.996AspTrp: 1.996 ± 1.491
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.988GluAla: 5.988 ± 0.834
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
5.988GluGlu: 5.988 ± 3.489
1.996GluPhe: 1.996 ± 1.163
3.992GluGly: 3.992 ± 2.326
0.0GluHis: 0.0 ± 0.0
1.996GluIle: 1.996 ± 1.163
3.992GluLys: 3.992 ± 2.326
1.996GluLeu: 1.996 ± 1.163
0.0GluMet: 0.0 ± 0.0
3.992GluAsn: 3.992 ± 0.328
1.996GluPro: 1.996 ± 1.163
0.0GluGln: 0.0 ± 0.0
0.0GluArg: 0.0 ± 0.0
0.0GluSer: 0.0 ± 0.0
3.992GluThr: 3.992 ± 2.326
7.984GluVal: 7.984 ± 4.651
3.992GluTrp: 3.992 ± 2.326
3.992GluTyr: 3.992 ± 0.328
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.996PheAsp: 1.996 ± 1.163
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
3.992PheHis: 3.992 ± 0.328
0.0PheIle: 0.0 ± 0.0
7.984PheLys: 7.984 ± 0.657
3.992PheLeu: 3.992 ± 2.983
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
3.992PheGln: 3.992 ± 2.326
0.0PheArg: 0.0 ± 0.0
5.988PheSer: 5.988 ± 3.489
3.992PheThr: 3.992 ± 0.328
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
9.98GlyAla: 9.98 ± 2.148
1.996GlyCys: 1.996 ± 1.163
3.992GlyAsp: 3.992 ± 0.328
0.0GlyGlu: 0.0 ± 0.0
0.0GlyPhe: 0.0 ± 0.0
7.984GlyGly: 7.984 ± 0.657
0.0GlyHis: 0.0 ± 0.0
1.996GlyIle: 1.996 ± 1.163
3.992GlyLys: 3.992 ± 2.326
5.988GlyLeu: 5.988 ± 0.834
0.0GlyMet: 0.0 ± 0.0
5.988GlyAsn: 5.988 ± 1.82
5.988GlyPro: 5.988 ± 0.834
5.988GlyGln: 5.988 ± 0.834
1.996GlyArg: 1.996 ± 1.163
3.992GlySer: 3.992 ± 0.328
5.988GlyThr: 5.988 ± 1.82
3.992GlyVal: 3.992 ± 0.328
1.996GlyTrp: 1.996 ± 1.163
3.992GlyTyr: 3.992 ± 2.326
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.992HisGly: 3.992 ± 2.326
1.996HisHis: 1.996 ± 1.491
5.988HisIle: 5.988 ± 4.474
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.996HisArg: 1.996 ± 1.163
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.996HisVal: 1.996 ± 1.491
0.0HisTrp: 0.0 ± 0.0
3.992HisTyr: 3.992 ± 0.328
0.0HisXaa: 0.0 ± 0.0
Ile
3.992IleAla: 3.992 ± 0.328
0.0IleCys: 0.0 ± 0.0
3.992IleAsp: 3.992 ± 2.326
0.0IleGlu: 0.0 ± 0.0
5.988IlePhe: 5.988 ± 1.82
0.0IleGly: 0.0 ± 0.0
1.996IleHis: 1.996 ± 1.491
0.0IleIle: 0.0 ± 0.0
3.992IleLys: 3.992 ± 0.328
3.992IleLeu: 3.992 ± 2.983
3.992IleMet: 3.992 ± 2.983
0.0IleAsn: 0.0 ± 0.0
0.0IlePro: 0.0 ± 0.0
1.996IleGln: 1.996 ± 1.491
1.996IleArg: 1.996 ± 1.163
3.992IleSer: 3.992 ± 0.328
3.992IleThr: 3.992 ± 2.326
1.996IleVal: 1.996 ± 1.491
1.996IleTrp: 1.996 ± 1.163
3.992IleTyr: 3.992 ± 0.328
0.0IleXaa: 0.0 ± 0.0
Lys
7.984LysAla: 7.984 ± 1.997
1.996LysCys: 1.996 ± 1.163
0.0LysAsp: 0.0 ± 0.0
3.992LysGlu: 3.992 ± 2.326
1.996LysPhe: 1.996 ± 1.491
3.992LysGly: 3.992 ± 0.328
1.996LysHis: 1.996 ± 1.491
3.992LysIle: 3.992 ± 2.326
5.988LysLys: 5.988 ± 4.474
3.992LysLeu: 3.992 ± 2.326
1.996LysMet: 1.996 ± 1.491
0.0LysAsn: 0.0 ± 0.0
5.988LysPro: 5.988 ± 3.489
0.0LysGln: 0.0 ± 0.0
3.992LysArg: 3.992 ± 2.983
0.0LysSer: 0.0 ± 0.0
5.988LysThr: 5.988 ± 3.489
7.984LysVal: 7.984 ± 1.997
1.996LysTrp: 1.996 ± 1.163
5.988LysTyr: 5.988 ± 4.474
0.0LysXaa: 0.0 ± 0.0
Leu
5.988LeuAla: 5.988 ± 1.82
0.0LeuCys: 0.0 ± 0.0
9.98LeuAsp: 9.98 ± 7.457
3.992LeuGlu: 3.992 ± 2.326
3.992LeuPhe: 3.992 ± 2.326
7.984LeuGly: 7.984 ± 0.657
1.996LeuHis: 1.996 ± 1.163
5.988LeuIle: 5.988 ± 1.82
1.996LeuLys: 1.996 ± 1.163
1.996LeuLeu: 1.996 ± 1.491
0.0LeuMet: 0.0 ± 0.0
5.988LeuAsn: 5.988 ± 4.474
3.992LeuPro: 3.992 ± 2.326
1.996LeuGln: 1.996 ± 1.491
7.984LeuArg: 7.984 ± 1.997
1.996LeuSer: 1.996 ± 1.163
0.0LeuThr: 0.0 ± 0.0
0.0LeuVal: 0.0 ± 0.0
1.996LeuTrp: 1.996 ± 1.163
1.996LeuTyr: 1.996 ± 1.491
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.996MetAsp: 1.996 ± 1.163
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.996MetHis: 1.996 ± 1.491
3.992MetIle: 3.992 ± 0.328
1.996MetLys: 1.996 ± 1.163
1.996MetLeu: 1.996 ± 1.491
0.0MetMet: 0.0 ± 0.0
1.996MetAsn: 1.996 ± 1.491
1.996MetPro: 1.996 ± 1.491
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
3.992MetThr: 3.992 ± 2.326
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.996AsnCys: 1.996 ± 1.491
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
3.992AsnPhe: 3.992 ± 2.326
3.992AsnGly: 3.992 ± 2.983
1.996AsnHis: 1.996 ± 1.491
1.996AsnIle: 1.996 ± 1.491
3.992AsnLys: 3.992 ± 2.326
1.996AsnLeu: 1.996 ± 1.491
0.0AsnMet: 0.0 ± 1.059
5.988AsnAsn: 5.988 ± 1.82
5.988AsnPro: 5.988 ± 4.474
0.0AsnGln: 0.0 ± 0.0
3.992AsnArg: 3.992 ± 2.983
3.992AsnSer: 3.992 ± 2.983
1.996AsnThr: 1.996 ± 1.491
1.996AsnVal: 1.996 ± 1.163
0.0AsnTrp: 0.0 ± 0.0
3.992AsnTyr: 3.992 ± 0.328
0.0AsnXaa: 0.0 ± 0.0
Pro
5.988ProAla: 5.988 ± 3.489
1.996ProCys: 1.996 ± 1.163
0.0ProAsp: 0.0 ± 0.0
5.988ProGlu: 5.988 ± 0.834
0.0ProPhe: 0.0 ± 0.0
5.988ProGly: 5.988 ± 0.834
0.0ProHis: 0.0 ± 0.0
1.996ProIle: 1.996 ± 1.491
3.992ProLys: 3.992 ± 0.328
3.992ProLeu: 3.992 ± 2.983
1.996ProMet: 1.996 ± 0.951
0.0ProAsn: 0.0 ± 0.0
1.996ProPro: 1.996 ± 1.491
1.996ProGln: 1.996 ± 1.163
7.984ProArg: 7.984 ± 1.997
0.0ProSer: 0.0 ± 0.0
5.988ProThr: 5.988 ± 4.474
1.996ProVal: 1.996 ± 1.163
1.996ProTrp: 1.996 ± 1.163
1.996ProTyr: 1.996 ± 1.163
0.0ProXaa: 0.0 ± 0.0
Gln
3.992GlnAla: 3.992 ± 0.328
0.0GlnCys: 0.0 ± 0.0
1.996GlnAsp: 1.996 ± 1.491
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
3.992GlnGly: 3.992 ± 0.328
0.0GlnHis: 0.0 ± 0.0
1.996GlnIle: 1.996 ± 1.491
1.996GlnLys: 1.996 ± 1.163
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
1.996GlnGln: 1.996 ± 1.163
1.996GlnArg: 1.996 ± 1.163
3.992GlnSer: 3.992 ± 0.328
1.996GlnThr: 1.996 ± 1.163
1.996GlnVal: 1.996 ± 1.163
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
9.98ArgAla: 9.98 ± 0.506
0.0ArgCys: 0.0 ± 0.0
1.996ArgAsp: 1.996 ± 1.163
5.988ArgGlu: 5.988 ± 3.489
9.98ArgPhe: 9.98 ± 2.148
1.996ArgGly: 1.996 ± 1.163
1.996ArgHis: 1.996 ± 1.163
1.996ArgIle: 1.996 ± 1.491
1.996ArgLys: 1.996 ± 1.491
3.992ArgLeu: 3.992 ± 0.328
0.0ArgMet: 0.0 ± 0.0
5.988ArgAsn: 5.988 ± 1.82
1.996ArgPro: 1.996 ± 1.163
0.0ArgGln: 0.0 ± 0.0
7.984ArgArg: 7.984 ± 3.311
1.996ArgSer: 1.996 ± 1.491
3.992ArgThr: 3.992 ± 0.328
1.996ArgVal: 1.996 ± 1.163
0.0ArgTrp: 0.0 ± 0.0
5.988ArgTyr: 5.988 ± 0.834
0.0ArgXaa: 0.0 ± 0.0
Ser
3.992SerAla: 3.992 ± 2.983
1.996SerCys: 1.996 ± 1.163
1.996SerAsp: 1.996 ± 1.491
3.992SerGlu: 3.992 ± 2.326
0.0SerPhe: 0.0 ± 0.0
5.988SerGly: 5.988 ± 0.834
0.0SerHis: 0.0 ± 0.0
1.996SerIle: 1.996 ± 1.491
3.992SerLys: 3.992 ± 0.328
5.988SerLeu: 5.988 ± 0.834
3.992SerMet: 3.992 ± 0.328
1.996SerAsn: 1.996 ± 1.163
0.0SerPro: 0.0 ± 0.0
0.0SerGln: 0.0 ± 0.0
5.988SerArg: 5.988 ± 0.834
7.984SerSer: 7.984 ± 3.311
1.996SerThr: 1.996 ± 1.491
3.992SerVal: 3.992 ± 2.983
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
9.98ThrAla: 9.98 ± 2.148
0.0ThrCys: 0.0 ± 0.0
5.988ThrAsp: 5.988 ± 0.834
1.996ThrGlu: 1.996 ± 1.163
0.0ThrPhe: 0.0 ± 0.0
5.988ThrGly: 5.988 ± 0.834
1.996ThrHis: 1.996 ± 1.163
3.992ThrIle: 3.992 ± 2.326
1.996ThrLys: 1.996 ± 1.491
1.996ThrLeu: 1.996 ± 1.491
0.0ThrMet: 0.0 ± 0.0
3.992ThrAsn: 3.992 ± 2.983
0.0ThrPro: 0.0 ± 0.0
1.996ThrGln: 1.996 ± 1.163
3.992ThrArg: 3.992 ± 2.326
9.98ThrSer: 9.98 ± 2.148
0.0ThrThr: 0.0 ± 0.0
5.988ThrVal: 5.988 ± 4.474
3.992ThrTrp: 3.992 ± 2.326
3.992ThrTyr: 3.992 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
1.996ValAla: 1.996 ± 1.163
0.0ValCys: 0.0 ± 0.0
5.988ValAsp: 5.988 ± 0.834
3.992ValGlu: 3.992 ± 2.326
1.996ValPhe: 1.996 ± 1.163
1.996ValGly: 1.996 ± 1.163
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
5.988ValLys: 5.988 ± 0.834
7.984ValLeu: 7.984 ± 0.657
0.0ValMet: 0.0 ± 0.0
1.996ValAsn: 1.996 ± 1.491
7.984ValPro: 7.984 ± 1.997
1.996ValGln: 1.996 ± 1.491
0.0ValArg: 0.0 ± 0.0
1.996ValSer: 1.996 ± 1.491
11.976ValThr: 11.976 ± 1.669
5.988ValVal: 5.988 ± 0.834
3.992ValTrp: 3.992 ± 0.328
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.996TrpAla: 1.996 ± 1.163
0.0TrpCys: 0.0 ± 0.0
1.996TrpAsp: 1.996 ± 1.163
1.996TrpGlu: 1.996 ± 1.163
0.0TrpPhe: 0.0 ± 0.0
3.992TrpGly: 3.992 ± 0.328
0.0TrpHis: 0.0 ± 0.0
1.996TrpIle: 1.996 ± 1.163
0.0TrpLys: 0.0 ± 0.0
3.992TrpLeu: 3.992 ± 2.326
0.0TrpMet: 0.0 ± 0.0
1.996TrpAsn: 1.996 ± 1.491
3.992TrpPro: 3.992 ± 2.326
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.996TrpVal: 1.996 ± 1.163
1.996TrpTrp: 1.996 ± 1.163
1.996TrpTyr: 1.996 ± 1.163
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
5.988TyrCys: 5.988 ± 3.489
0.0TyrAsp: 0.0 ± 0.0
3.992TyrGlu: 3.992 ± 2.326
0.0TyrPhe: 0.0 ± 0.0
3.992TyrGly: 3.992 ± 0.328
0.0TyrHis: 0.0 ± 0.0
3.992TyrIle: 3.992 ± 2.983
5.988TyrLys: 5.988 ± 1.82
5.988TyrLeu: 5.988 ± 1.82
0.0TyrMet: 0.0 ± 0.0
1.996TyrAsn: 1.996 ± 1.163
3.992TyrPro: 3.992 ± 0.328
1.996TyrGln: 1.996 ± 1.163
7.984TyrArg: 7.984 ± 5.965
0.0TyrSer: 0.0 ± 0.0
0.0TyrThr: 0.0 ± 0.0
1.996TyrVal: 1.996 ± 1.163
0.0TyrTrp: 0.0 ± 0.0
1.996TyrTyr: 1.996 ± 1.491
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (502 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski