Amino acid dipepetide frequency for Hermit crab associated circular virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.102AlaAla: 5.102 ± 3.865
1.701AlaCys: 1.701 ± 1.208
0.0AlaAsp: 0.0 ± 0.0
3.401AlaGlu: 3.401 ± 2.416
0.0AlaPhe: 0.0 ± 0.0
6.803AlaGly: 6.803 ± 0.161
5.102AlaHis: 5.102 ± 1.127
5.102AlaIle: 5.102 ± 1.127
3.401AlaLys: 3.401 ± 2.416
1.701AlaLeu: 1.701 ± 1.288
0.0AlaMet: 0.0 ± 0.865
5.102AlaAsn: 5.102 ± 3.865
3.401AlaPro: 3.401 ± 2.416
0.0AlaGln: 0.0 ± 0.0
0.0AlaArg: 0.0 ± 0.0
3.401AlaSer: 3.401 ± 0.081
0.0AlaThr: 0.0 ± 0.0
6.803AlaVal: 6.803 ± 2.658
0.0AlaTrp: 0.0 ± 0.0
3.401AlaTyr: 3.401 ± 2.577
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
3.401CysPhe: 3.401 ± 2.416
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.701CysIle: 1.701 ± 1.208
1.701CysLys: 1.701 ± 1.208
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
3.401CysAsn: 3.401 ± 0.081
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.102AspAla: 5.102 ± 1.369
0.0AspCys: 0.0 ± 0.0
3.401AspAsp: 3.401 ± 2.416
5.102AspGlu: 5.102 ± 3.623
1.701AspPhe: 1.701 ± 1.208
1.701AspGly: 1.701 ± 1.208
1.701AspHis: 1.701 ± 1.208
0.0AspIle: 0.0 ± 0.0
0.0AspLys: 0.0 ± 0.0
8.503AspLeu: 8.503 ± 1.046
0.0AspMet: 0.0 ± 0.0
1.701AspAsn: 1.701 ± 1.288
5.102AspPro: 5.102 ± 1.369
0.0AspGln: 0.0 ± 0.0
6.803AspArg: 6.803 ± 4.831
1.701AspSer: 1.701 ± 1.208
1.701AspThr: 1.701 ± 1.288
5.102AspVal: 5.102 ± 3.623
0.0AspTrp: 0.0 ± 0.0
5.102AspTyr: 5.102 ± 3.623
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
1.701GluAsp: 1.701 ± 1.208
5.102GluGlu: 5.102 ± 1.127
5.102GluPhe: 5.102 ± 1.127
3.401GluGly: 3.401 ± 2.577
1.701GluHis: 1.701 ± 1.288
0.0GluIle: 0.0 ± 0.0
5.102GluLys: 5.102 ± 1.127
6.803GluLeu: 6.803 ± 2.335
0.0GluMet: 0.0 ± 0.0
5.102GluAsn: 5.102 ± 1.369
3.401GluPro: 3.401 ± 0.081
6.803GluGln: 6.803 ± 0.161
5.102GluArg: 5.102 ± 3.623
1.701GluSer: 1.701 ± 1.208
0.0GluThr: 0.0 ± 0.0
1.701GluVal: 1.701 ± 1.208
1.701GluTrp: 1.701 ± 1.208
1.701GluTyr: 1.701 ± 1.208
0.0GluXaa: 0.0 ± 0.0
Phe
1.701PheAla: 1.701 ± 1.288
1.701PheCys: 1.701 ± 1.208
5.102PheAsp: 5.102 ± 3.623
0.0PheGlu: 0.0 ± 0.0
5.102PhePhe: 5.102 ± 1.127
1.701PheGly: 1.701 ± 1.208
1.701PheHis: 1.701 ± 1.288
3.401PheIle: 3.401 ± 0.081
5.102PheLys: 5.102 ± 3.865
3.401PheLeu: 3.401 ± 0.081
1.701PheMet: 1.701 ± 1.208
0.0PheAsn: 0.0 ± 0.0
3.401PhePro: 3.401 ± 0.081
0.0PheGln: 0.0 ± 0.0
1.701PheArg: 1.701 ± 1.208
6.803PheSer: 6.803 ± 2.335
3.401PheThr: 3.401 ± 0.081
3.401PheVal: 3.401 ± 0.081
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.401GlyAla: 3.401 ± 2.577
0.0GlyCys: 0.0 ± 0.0
3.401GlyAsp: 3.401 ± 0.081
1.701GlyGlu: 1.701 ± 1.208
1.701GlyPhe: 1.701 ± 1.288
1.701GlyGly: 1.701 ± 1.208
0.0GlyHis: 0.0 ± 0.0
1.701GlyIle: 1.701 ± 1.288
5.102GlyLys: 5.102 ± 1.127
1.701GlyLeu: 1.701 ± 1.208
1.701GlyMet: 1.701 ± 1.208
6.803GlyAsn: 6.803 ± 2.335
0.0GlyPro: 0.0 ± 0.0
3.401GlyGln: 3.401 ± 0.081
5.102GlyArg: 5.102 ± 1.127
1.701GlySer: 1.701 ± 1.288
6.803GlyThr: 6.803 ± 2.658
5.102GlyVal: 5.102 ± 3.865
0.0GlyTrp: 0.0 ± 0.0
3.401GlyTyr: 3.401 ± 0.081
0.0GlyXaa: 0.0 ± 0.0
His
1.701HisAla: 1.701 ± 1.208
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.701HisGlu: 1.701 ± 1.288
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.701HisHis: 1.701 ± 1.208
5.102HisIle: 5.102 ± 1.127
0.0HisLys: 0.0 ± 0.0
1.701HisLeu: 1.701 ± 1.208
1.701HisMet: 1.701 ± 1.288
0.0HisAsn: 0.0 ± 0.0
1.701HisPro: 1.701 ± 1.208
0.0HisGln: 0.0 ± 0.0
3.401HisArg: 3.401 ± 0.081
0.0HisSer: 0.0 ± 0.0
1.701HisThr: 1.701 ± 1.288
1.701HisVal: 1.701 ± 1.208
1.701HisTrp: 1.701 ± 1.208
1.701HisTyr: 1.701 ± 1.288
0.0HisXaa: 0.0 ± 0.0
Ile
1.701IleAla: 1.701 ± 1.288
0.0IleCys: 0.0 ± 0.0
3.401IleAsp: 3.401 ± 0.081
0.0IleGlu: 0.0 ± 0.0
1.701IlePhe: 1.701 ± 1.208
1.701IleGly: 1.701 ± 1.208
0.0IleHis: 0.0 ± 0.0
1.701IleIle: 1.701 ± 1.288
3.401IleLys: 3.401 ± 0.081
1.701IleLeu: 1.701 ± 1.208
1.701IleMet: 1.701 ± 1.288
5.102IleAsn: 5.102 ± 3.865
3.401IlePro: 3.401 ± 0.081
0.0IleGln: 0.0 ± 0.0
5.102IleArg: 5.102 ± 1.127
5.102IleSer: 5.102 ± 1.369
0.0IleThr: 0.0 ± 0.0
1.701IleVal: 1.701 ± 1.208
0.0IleTrp: 0.0 ± 0.0
1.701IleTyr: 1.701 ± 1.288
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
3.401LysAsp: 3.401 ± 0.081
6.803LysGlu: 6.803 ± 2.335
1.701LysPhe: 1.701 ± 1.288
0.0LysGly: 0.0 ± 0.0
0.0LysHis: 0.0 ± 0.0
6.803LysIle: 6.803 ± 2.658
11.905LysLys: 11.905 ± 3.462
1.701LysLeu: 1.701 ± 1.208
1.701LysMet: 1.701 ± 1.288
5.102LysAsn: 5.102 ± 1.369
0.0LysPro: 0.0 ± 0.0
5.102LysGln: 5.102 ± 3.623
5.102LysArg: 5.102 ± 1.127
5.102LysSer: 5.102 ± 1.127
3.401LysThr: 3.401 ± 2.577
3.401LysVal: 3.401 ± 0.081
1.701LysTrp: 1.701 ± 1.288
6.803LysTyr: 6.803 ± 2.658
0.0LysXaa: 0.0 ± 0.0
Leu
3.401LeuAla: 3.401 ± 2.416
1.701LeuCys: 1.701 ± 1.208
1.701LeuAsp: 1.701 ± 1.208
3.401LeuGlu: 3.401 ± 2.416
1.701LeuPhe: 1.701 ± 1.208
1.701LeuGly: 1.701 ± 1.288
1.701LeuHis: 1.701 ± 1.208
3.401LeuIle: 3.401 ± 0.081
6.803LeuLys: 6.803 ± 2.335
0.0LeuLeu: 0.0 ± 0.0
1.701LeuMet: 1.701 ± 1.208
3.401LeuAsn: 3.401 ± 2.416
1.701LeuPro: 1.701 ± 1.208
1.701LeuGln: 1.701 ± 1.208
3.401LeuArg: 3.401 ± 0.081
5.102LeuSer: 5.102 ± 3.623
5.102LeuThr: 5.102 ± 1.369
8.503LeuVal: 8.503 ± 1.45
1.701LeuTrp: 1.701 ± 1.208
6.803LeuTyr: 6.803 ± 5.154
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.701MetAsp: 1.701 ± 1.288
1.701MetGlu: 1.701 ± 1.208
3.401MetPhe: 3.401 ± 0.081
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.401MetLys: 3.401 ± 0.081
1.701MetLeu: 1.701 ± 1.208
0.0MetMet: 0.0 ± 0.0
1.701MetAsn: 1.701 ± 1.288
3.401MetPro: 3.401 ± 0.081
3.401MetGln: 3.401 ± 0.081
1.701MetArg: 1.701 ± 1.288
1.701MetSer: 1.701 ± 1.288
3.401MetThr: 3.401 ± 0.081
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
5.102AsnAsp: 5.102 ± 1.369
3.401AsnGlu: 3.401 ± 0.081
5.102AsnPhe: 5.102 ± 1.369
3.401AsnGly: 3.401 ± 0.081
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
3.401AsnLeu: 3.401 ± 2.577
0.0AsnMet: 0.0 ± 0.0
6.803AsnAsn: 6.803 ± 2.335
6.803AsnPro: 6.803 ± 5.154
0.0AsnGln: 0.0 ± 0.0
6.803AsnArg: 6.803 ± 2.658
3.401AsnSer: 3.401 ± 0.081
6.803AsnThr: 6.803 ± 0.161
3.401AsnVal: 3.401 ± 2.577
0.0AsnTrp: 0.0 ± 0.0
5.102AsnTyr: 5.102 ± 1.369
0.0AsnXaa: 0.0 ± 0.0
Pro
8.503ProAla: 8.503 ± 1.046
0.0ProCys: 0.0 ± 0.0
3.401ProAsp: 3.401 ± 2.416
5.102ProGlu: 5.102 ± 3.865
0.0ProPhe: 0.0 ± 0.0
3.401ProGly: 3.401 ± 2.577
6.803ProHis: 6.803 ± 2.335
0.0ProIle: 0.0 ± 0.0
6.803ProLys: 6.803 ± 0.161
1.701ProLeu: 1.701 ± 1.288
0.0ProMet: 0.0 ± 0.0
3.401ProAsn: 3.401 ± 0.081
3.401ProPro: 3.401 ± 0.081
0.0ProGln: 0.0 ± 0.0
8.503ProArg: 8.503 ± 1.046
5.102ProSer: 5.102 ± 1.369
8.503ProThr: 8.503 ± 1.046
3.401ProVal: 3.401 ± 2.577
1.701ProTrp: 1.701 ± 1.208
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.701GlnAla: 1.701 ± 1.208
0.0GlnCys: 0.0 ± 0.0
1.701GlnAsp: 1.701 ± 1.208
5.102GlnGlu: 5.102 ± 1.369
0.0GlnPhe: 0.0 ± 0.0
5.102GlnGly: 5.102 ± 3.623
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
3.401GlnLeu: 3.401 ± 2.416
5.102GlnMet: 5.102 ± 1.369
1.701GlnAsn: 1.701 ± 1.288
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.701GlnArg: 1.701 ± 1.208
1.701GlnSer: 1.701 ± 1.288
1.701GlnThr: 1.701 ± 1.288
3.401GlnVal: 3.401 ± 0.081
1.701GlnTrp: 1.701 ± 1.208
3.401GlnTyr: 3.401 ± 0.081
0.0GlnXaa: 0.0 ± 0.0
Arg
3.401ArgAla: 3.401 ± 0.081
0.0ArgCys: 0.0 ± 0.0
6.803ArgAsp: 6.803 ± 4.831
5.102ArgGlu: 5.102 ± 1.127
3.401ArgPhe: 3.401 ± 2.416
8.503ArgGly: 8.503 ± 1.45
0.0ArgHis: 0.0 ± 0.0
1.701ArgIle: 1.701 ± 1.288
3.401ArgLys: 3.401 ± 2.577
6.803ArgLeu: 6.803 ± 0.161
1.701ArgMet: 1.701 ± 1.208
1.701ArgAsn: 1.701 ± 1.288
8.503ArgPro: 8.503 ± 1.046
5.102ArgGln: 5.102 ± 1.127
5.102ArgArg: 5.102 ± 3.865
3.401ArgSer: 3.401 ± 2.416
5.102ArgThr: 5.102 ± 1.127
3.401ArgVal: 3.401 ± 0.081
1.701ArgTrp: 1.701 ± 1.288
3.401ArgTyr: 3.401 ± 2.416
0.0ArgXaa: 0.0 ± 0.0
Ser
10.204SerAla: 10.204 ± 0.242
0.0SerCys: 0.0 ± 0.0
1.701SerAsp: 1.701 ± 1.208
1.701SerGlu: 1.701 ± 1.288
1.701SerPhe: 1.701 ± 1.208
1.701SerGly: 1.701 ± 1.208
0.0SerHis: 0.0 ± 0.0
3.401SerIle: 3.401 ± 0.081
5.102SerLys: 5.102 ± 3.865
1.701SerLeu: 1.701 ± 1.208
1.701SerMet: 1.701 ± 1.288
0.0SerAsn: 0.0 ± 0.0
8.503SerPro: 8.503 ± 1.046
1.701SerGln: 1.701 ± 1.288
5.102SerArg: 5.102 ± 1.127
1.701SerSer: 1.701 ± 1.288
5.102SerThr: 5.102 ± 1.369
0.0SerVal: 0.0 ± 0.0
1.701SerTrp: 1.701 ± 1.208
1.701SerTyr: 1.701 ± 1.208
0.0SerXaa: 0.0 ± 0.0
Thr
3.401ThrAla: 3.401 ± 0.081
0.0ThrCys: 0.0 ± 0.0
5.102ThrAsp: 5.102 ± 1.127
3.401ThrGlu: 3.401 ± 0.081
1.701ThrPhe: 1.701 ± 1.288
5.102ThrGly: 5.102 ± 3.865
3.401ThrHis: 3.401 ± 0.081
3.401ThrIle: 3.401 ± 2.577
5.102ThrLys: 5.102 ± 1.127
8.503ThrLeu: 8.503 ± 1.046
1.701ThrMet: 1.701 ± 1.288
3.401ThrAsn: 3.401 ± 2.577
8.503ThrPro: 8.503 ± 1.046
1.701ThrGln: 1.701 ± 1.288
3.401ThrArg: 3.401 ± 0.081
0.0ThrSer: 0.0 ± 0.0
3.401ThrThr: 3.401 ± 2.577
0.0ThrVal: 0.0 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
1.701ThrTyr: 1.701 ± 1.288
0.0ThrXaa: 0.0 ± 0.0
Val
3.401ValAla: 3.401 ± 0.081
0.0ValCys: 0.0 ± 0.0
3.401ValAsp: 3.401 ± 2.416
0.0ValGlu: 0.0 ± 0.0
3.401ValPhe: 3.401 ± 2.416
1.701ValGly: 1.701 ± 1.288
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
3.401ValLys: 3.401 ± 2.577
5.102ValLeu: 5.102 ± 1.127
3.401ValMet: 3.401 ± 1.901
3.401ValAsn: 3.401 ± 2.577
5.102ValPro: 5.102 ± 1.369
6.803ValGln: 6.803 ± 0.161
5.102ValArg: 5.102 ± 1.369
3.401ValSer: 3.401 ± 2.577
3.401ValThr: 3.401 ± 0.081
8.503ValVal: 8.503 ± 1.046
1.701ValTrp: 1.701 ± 1.288
1.701ValTyr: 1.701 ± 1.288
0.0ValXaa: 0.0 ± 0.0
Trp
1.701TrpAla: 1.701 ± 1.288
1.701TrpCys: 1.701 ± 1.208
0.0TrpAsp: 0.0 ± 0.0
1.701TrpGlu: 1.701 ± 1.208
3.401TrpPhe: 3.401 ± 0.081
1.701TrpGly: 1.701 ± 1.208
0.0TrpHis: 0.0 ± 0.0
1.701TrpIle: 1.701 ± 1.208
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.701TrpAsn: 1.701 ± 1.288
0.0TrpPro: 0.0 ± 0.0
1.701TrpGln: 1.701 ± 1.208
0.0TrpArg: 0.0 ± 0.0
1.701TrpSer: 1.701 ± 1.288
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
1.701TrpTrp: 1.701 ± 1.288
1.701TrpTyr: 1.701 ± 1.288
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.701TyrAla: 1.701 ± 1.208
3.401TyrCys: 3.401 ± 0.081
3.401TyrAsp: 3.401 ± 0.081
1.701TyrGlu: 1.701 ± 1.208
5.102TyrPhe: 5.102 ± 3.865
5.102TyrGly: 5.102 ± 1.369
1.701TyrHis: 1.701 ± 1.288
0.0TyrIle: 0.0 ± 0.0
0.0TyrLys: 0.0 ± 0.0
5.102TyrLeu: 5.102 ± 3.623
1.701TyrMet: 1.701 ± 1.288
0.0TyrAsn: 0.0 ± 0.0
3.401TyrPro: 3.401 ± 2.577
0.0TyrGln: 0.0 ± 0.0
5.102TyrArg: 5.102 ± 1.369
1.701TyrSer: 1.701 ± 1.288
3.401TyrThr: 3.401 ± 0.081
3.401TyrVal: 3.401 ± 0.081
3.401TyrTrp: 3.401 ± 2.577
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (589 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski