Amino acid dipepetide frequency for Xinzhou partiti-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.519AlaAla: 6.519 ± 1.78
2.608AlaCys: 2.608 ± 1.552
2.608AlaAsp: 2.608 ± 0.335
1.304AlaGlu: 1.304 ± 1.111
6.519AlaPhe: 6.519 ± 0.107
2.608AlaGly: 2.608 ± 0.335
1.304AlaHis: 1.304 ± 1.111
7.823AlaIle: 7.823 ± 2.77
2.608AlaLys: 2.608 ± 2.221
6.519AlaLeu: 6.519 ± 0.107
0.0AlaMet: 0.0 ± 0.0
5.215AlaAsn: 5.215 ± 2.556
2.608AlaPro: 2.608 ± 0.335
2.608AlaGln: 2.608 ± 0.335
5.215AlaArg: 5.215 ± 1.218
7.823AlaSer: 7.823 ± 4.777
6.519AlaThr: 6.519 ± 3.666
3.911AlaVal: 3.911 ± 0.442
0.0AlaTrp: 0.0 ± 0.0
2.608AlaTyr: 2.608 ± 1.552
0.0AlaXaa: 0.0 ± 0.0
Cys
1.304CysAla: 1.304 ± 1.111
0.0CysCys: 0.0 ± 0.0
1.304CysAsp: 1.304 ± 0.776
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
3.911CysHis: 3.911 ± 0.442
1.304CysIle: 1.304 ± 0.776
0.0CysLys: 0.0 ± 0.0
1.304CysLeu: 1.304 ± 0.776
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.304CysPro: 1.304 ± 1.111
1.304CysGln: 1.304 ± 0.776
1.304CysArg: 1.304 ± 0.776
1.304CysSer: 1.304 ± 1.111
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.911AspAla: 3.911 ± 1.445
0.0AspCys: 0.0 ± 0.0
3.911AspAsp: 3.911 ± 0.442
1.304AspGlu: 1.304 ± 0.776
3.911AspPhe: 3.911 ± 0.442
3.911AspGly: 3.911 ± 1.445
0.0AspHis: 0.0 ± 0.0
3.911AspIle: 3.911 ± 0.442
0.0AspLys: 0.0 ± 0.0
2.608AspLeu: 2.608 ± 0.335
1.304AspMet: 1.304 ± 0.776
3.911AspAsn: 3.911 ± 0.442
2.608AspPro: 2.608 ± 0.335
1.304AspGln: 1.304 ± 0.776
2.608AspArg: 2.608 ± 2.221
9.126AspSer: 9.126 ± 0.228
1.304AspThr: 1.304 ± 1.111
0.0AspVal: 0.0 ± 0.0
3.911AspTrp: 3.911 ± 1.445
1.304AspTyr: 1.304 ± 0.776
0.0AspXaa: 0.0 ± 0.0
Glu
2.608GluAla: 2.608 ± 1.552
0.0GluCys: 0.0 ± 0.0
1.304GluAsp: 1.304 ± 1.111
3.911GluGlu: 3.911 ± 1.445
2.608GluPhe: 2.608 ± 1.552
2.608GluGly: 2.608 ± 0.335
1.304GluHis: 1.304 ± 0.776
1.304GluIle: 1.304 ± 0.776
0.0GluLys: 0.0 ± 0.0
6.519GluLeu: 6.519 ± 0.107
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
1.304GluPro: 1.304 ± 1.111
1.304GluGln: 1.304 ± 0.776
0.0GluArg: 0.0 ± 0.0
7.823GluSer: 7.823 ± 2.77
1.304GluThr: 1.304 ± 0.776
1.304GluVal: 1.304 ± 1.111
1.304GluTrp: 1.304 ± 1.111
2.608GluTyr: 2.608 ± 0.335
0.0GluXaa: 0.0 ± 0.0
Phe
2.608PheAla: 2.608 ± 0.335
1.304PheCys: 1.304 ± 0.776
2.608PheAsp: 2.608 ± 1.552
2.608PheGlu: 2.608 ± 1.552
3.911PhePhe: 3.911 ± 2.328
2.608PheGly: 2.608 ± 0.335
2.608PheHis: 2.608 ± 0.335
3.911PheIle: 3.911 ± 1.445
2.608PheLys: 2.608 ± 1.552
3.911PheLeu: 3.911 ± 0.442
1.304PheMet: 1.304 ± 0.776
0.0PheAsn: 0.0 ± 0.0
2.608PhePro: 2.608 ± 0.335
0.0PheGln: 0.0 ± 0.0
2.608PheArg: 2.608 ± 1.552
2.608PheSer: 2.608 ± 2.221
5.215PheThr: 5.215 ± 3.104
3.911PheVal: 3.911 ± 2.328
0.0PheTrp: 0.0 ± 0.0
2.608PheTyr: 2.608 ± 1.552
0.0PheXaa: 0.0 ± 0.0
Gly
1.304GlyAla: 1.304 ± 0.776
0.0GlyCys: 0.0 ± 0.0
5.215GlyAsp: 5.215 ± 2.556
0.0GlyGlu: 0.0 ± 0.0
2.608GlyPhe: 2.608 ± 0.335
0.0GlyGly: 0.0 ± 0.0
3.911GlyHis: 3.911 ± 2.328
2.608GlyIle: 2.608 ± 0.335
2.608GlyLys: 2.608 ± 1.552
2.608GlyLeu: 2.608 ± 0.335
3.911GlyMet: 3.911 ± 1.01
2.608GlyAsn: 2.608 ± 0.335
1.304GlyPro: 1.304 ± 1.111
2.608GlyGln: 2.608 ± 2.221
1.304GlyArg: 1.304 ± 1.111
6.519GlySer: 6.519 ± 3.88
7.823GlyThr: 7.823 ± 2.89
5.215GlyVal: 5.215 ± 4.443
1.304GlyTrp: 1.304 ± 1.111
2.608GlyTyr: 2.608 ± 1.552
0.0GlyXaa: 0.0 ± 0.0
His
2.608HisAla: 2.608 ± 0.335
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
2.608HisGly: 2.608 ± 0.335
2.608HisHis: 2.608 ± 1.552
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
2.608HisLeu: 2.608 ± 1.552
0.0HisMet: 0.0 ± 0.0
3.911HisAsn: 3.911 ± 2.328
2.608HisPro: 2.608 ± 0.335
0.0HisGln: 0.0 ± 0.0
1.304HisArg: 1.304 ± 0.776
1.304HisSer: 1.304 ± 1.111
7.823HisThr: 7.823 ± 0.883
1.304HisVal: 1.304 ± 0.776
0.0HisTrp: 0.0 ± 0.0
1.304HisTyr: 1.304 ± 0.776
0.0HisXaa: 0.0 ± 0.0
Ile
5.215IleAla: 5.215 ± 2.556
1.304IleCys: 1.304 ± 0.776
9.126IleAsp: 9.126 ± 0.228
2.608IleGlu: 2.608 ± 1.552
2.608IlePhe: 2.608 ± 1.552
6.519IleGly: 6.519 ± 3.666
2.608IleHis: 2.608 ± 0.335
1.304IleIle: 1.304 ± 0.776
3.911IleLys: 3.911 ± 2.328
6.519IleLeu: 6.519 ± 1.78
1.304IleMet: 1.304 ± 0.776
3.911IleAsn: 3.911 ± 1.445
1.304IlePro: 1.304 ± 0.776
1.304IleGln: 1.304 ± 1.111
1.304IleArg: 1.304 ± 0.776
2.608IleSer: 2.608 ± 1.552
0.0IleThr: 0.0 ± 0.0
9.126IleVal: 9.126 ± 1.659
0.0IleTrp: 0.0 ± 0.0
2.608IleTyr: 2.608 ± 0.335
0.0IleXaa: 0.0 ± 0.0
Lys
1.304LysAla: 1.304 ± 0.776
0.0LysCys: 0.0 ± 0.0
1.304LysAsp: 1.304 ± 0.776
2.608LysGlu: 2.608 ± 1.552
0.0LysPhe: 0.0 ± 0.0
0.0LysGly: 0.0 ± 0.0
2.608LysHis: 2.608 ± 1.552
1.304LysIle: 1.304 ± 0.776
0.0LysLys: 0.0 ± 0.0
5.215LysLeu: 5.215 ± 0.669
1.304LysMet: 1.304 ± 0.776
0.0LysAsn: 0.0 ± 0.0
1.304LysPro: 1.304 ± 1.111
2.608LysGln: 2.608 ± 2.221
3.911LysArg: 3.911 ± 0.442
2.608LysSer: 2.608 ± 0.335
2.608LysThr: 2.608 ± 1.552
1.304LysVal: 1.304 ± 1.111
0.0LysTrp: 0.0 ± 0.0
1.304LysTyr: 1.304 ± 0.776
0.0LysXaa: 0.0 ± 0.0
Leu
10.43LeuAla: 10.43 ± 0.549
1.304LeuCys: 1.304 ± 1.111
3.911LeuAsp: 3.911 ± 1.445
7.823LeuGlu: 7.823 ± 2.77
2.608LeuPhe: 2.608 ± 1.552
3.911LeuGly: 3.911 ± 0.442
1.304LeuHis: 1.304 ± 0.776
9.126LeuIle: 9.126 ± 1.659
2.608LeuLys: 2.608 ± 0.335
7.823LeuLeu: 7.823 ± 1.004
2.608LeuMet: 2.608 ± 1.552
5.215LeuAsn: 5.215 ± 0.669
5.215LeuPro: 5.215 ± 1.218
2.608LeuGln: 2.608 ± 0.335
6.519LeuArg: 6.519 ± 1.994
2.608LeuSer: 2.608 ± 1.552
5.215LeuThr: 5.215 ± 3.104
7.823LeuVal: 7.823 ± 4.777
0.0LeuTrp: 0.0 ± 0.0
9.126LeuTyr: 9.126 ± 0.228
0.0LeuXaa: 0.0 ± 0.0
Met
1.304MetAla: 1.304 ± 1.111
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.304MetPhe: 1.304 ± 0.776
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.608MetLeu: 2.608 ± 1.552
0.0MetMet: 0.0 ± 0.0
1.304MetAsn: 1.304 ± 0.776
2.608MetPro: 2.608 ± 0.335
1.304MetGln: 1.304 ± 0.776
1.304MetArg: 1.304 ± 1.111
3.911MetSer: 3.911 ± 0.442
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
2.608MetTyr: 2.608 ± 1.552
0.0MetXaa: 0.0 ± 0.0
Asn
2.608AsnAla: 2.608 ± 1.552
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
5.215AsnPhe: 5.215 ± 1.218
1.304AsnGly: 1.304 ± 1.111
1.304AsnHis: 1.304 ± 0.776
1.304AsnIle: 1.304 ± 1.111
2.608AsnLys: 2.608 ± 0.335
9.126AsnLeu: 9.126 ± 0.228
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.608AsnPro: 2.608 ± 0.335
1.304AsnGln: 1.304 ± 0.776
2.608AsnArg: 2.608 ± 0.335
3.911AsnSer: 3.911 ± 0.442
1.304AsnThr: 1.304 ± 1.111
2.608AsnVal: 2.608 ± 0.335
0.0AsnTrp: 0.0 ± 0.0
2.608AsnTyr: 2.608 ± 0.335
0.0AsnXaa: 0.0 ± 0.0
Pro
5.215ProAla: 5.215 ± 4.443
2.608ProCys: 2.608 ± 0.335
3.911ProAsp: 3.911 ± 1.445
3.911ProGlu: 3.911 ± 1.445
1.304ProPhe: 1.304 ± 0.776
2.608ProGly: 2.608 ± 2.221
0.0ProHis: 0.0 ± 0.0
3.911ProIle: 3.911 ± 2.328
2.608ProLys: 2.608 ± 0.335
2.608ProLeu: 2.608 ± 0.335
0.0ProMet: 0.0 ± 0.0
2.608ProAsn: 2.608 ± 1.552
3.911ProPro: 3.911 ± 1.445
3.911ProGln: 3.911 ± 1.445
3.911ProArg: 3.911 ± 1.445
5.215ProSer: 5.215 ± 1.218
2.608ProThr: 2.608 ± 0.335
3.911ProVal: 3.911 ± 1.445
0.0ProTrp: 0.0 ± 0.0
3.911ProTyr: 3.911 ± 2.328
0.0ProXaa: 0.0 ± 0.0
Gln
2.608GlnAla: 2.608 ± 1.552
0.0GlnCys: 0.0 ± 0.0
1.304GlnAsp: 1.304 ± 1.111
1.304GlnGlu: 1.304 ± 0.776
1.304GlnPhe: 1.304 ± 1.111
1.304GlnGly: 1.304 ± 1.111
1.304GlnHis: 1.304 ± 0.776
3.911GlnIle: 3.911 ± 1.445
0.0GlnLys: 0.0 ± 0.0
9.126GlnLeu: 9.126 ± 3.546
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
3.911GlnPro: 3.911 ± 0.442
1.304GlnGln: 1.304 ± 1.111
1.304GlnArg: 1.304 ± 1.111
3.911GlnSer: 3.911 ± 3.332
1.304GlnThr: 1.304 ± 0.776
1.304GlnVal: 1.304 ± 1.111
0.0GlnTrp: 0.0 ± 0.0
2.608GlnTyr: 2.608 ± 0.335
0.0GlnXaa: 0.0 ± 0.0
Arg
6.519ArgAla: 6.519 ± 0.107
0.0ArgCys: 0.0 ± 0.0
3.911ArgAsp: 3.911 ± 2.328
0.0ArgGlu: 0.0 ± 0.0
1.304ArgPhe: 1.304 ± 0.776
6.519ArgGly: 6.519 ± 0.107
1.304ArgHis: 1.304 ± 0.776
3.911ArgIle: 3.911 ± 3.332
1.304ArgLys: 1.304 ± 0.776
6.519ArgLeu: 6.519 ± 3.88
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
6.519ArgPro: 6.519 ± 1.78
3.911ArgGln: 3.911 ± 1.445
5.215ArgArg: 5.215 ± 2.556
2.608ArgSer: 2.608 ± 0.335
2.608ArgThr: 2.608 ± 0.335
2.608ArgVal: 2.608 ± 0.335
1.304ArgTrp: 1.304 ± 0.776
1.304ArgTyr: 1.304 ± 0.776
0.0ArgXaa: 0.0 ± 0.0
Ser
6.519SerAla: 6.519 ± 0.107
0.0SerCys: 0.0 ± 0.0
2.608SerAsp: 2.608 ± 2.221
3.911SerGlu: 3.911 ± 1.445
6.519SerPhe: 6.519 ± 1.994
5.215SerGly: 5.215 ± 0.669
1.304SerHis: 1.304 ± 0.776
5.215SerIle: 5.215 ± 1.218
5.215SerLys: 5.215 ± 1.218
5.215SerLeu: 5.215 ± 1.218
1.304SerMet: 1.304 ± 1.259
5.215SerAsn: 5.215 ± 2.556
1.304SerPro: 1.304 ± 0.776
2.608SerGln: 2.608 ± 1.552
7.823SerArg: 7.823 ± 0.883
7.823SerSer: 7.823 ± 4.657
2.608SerThr: 2.608 ± 0.335
7.823SerVal: 7.823 ± 2.89
0.0SerTrp: 0.0 ± 0.0
5.215SerTyr: 5.215 ± 1.218
0.0SerXaa: 0.0 ± 0.0
Thr
5.215ThrAla: 5.215 ± 0.669
0.0ThrCys: 0.0 ± 0.0
1.304ThrAsp: 1.304 ± 1.111
5.215ThrGlu: 5.215 ± 0.669
2.608ThrPhe: 2.608 ± 0.335
3.911ThrGly: 3.911 ± 1.445
2.608ThrHis: 2.608 ± 2.221
5.215ThrIle: 5.215 ± 1.218
0.0ThrLys: 0.0 ± 0.0
5.215ThrLeu: 5.215 ± 1.218
0.0ThrMet: 0.0 ± 0.0
1.304ThrAsn: 1.304 ± 1.111
9.126ThrPro: 9.126 ± 0.228
3.911ThrGln: 3.911 ± 0.442
2.608ThrArg: 2.608 ± 1.552
1.304ThrSer: 1.304 ± 0.776
0.0ThrThr: 0.0 ± 0.0
5.215ThrVal: 5.215 ± 1.218
1.304ThrTrp: 1.304 ± 1.111
3.911ThrTyr: 3.911 ± 1.445
0.0ThrXaa: 0.0 ± 0.0
Val
1.304ValAla: 1.304 ± 0.776
2.608ValCys: 2.608 ± 2.221
2.608ValAsp: 2.608 ± 0.335
2.608ValGlu: 2.608 ± 0.335
2.608ValPhe: 2.608 ± 0.335
3.911ValGly: 3.911 ± 0.442
0.0ValHis: 0.0 ± 0.0
5.215ValIle: 5.215 ± 4.443
1.304ValLys: 1.304 ± 1.111
6.519ValLeu: 6.519 ± 1.78
3.911ValMet: 3.911 ± 1.445
5.215ValAsn: 5.215 ± 3.104
3.911ValPro: 3.911 ± 1.445
3.911ValGln: 3.911 ± 0.442
6.519ValArg: 6.519 ± 0.107
5.215ValSer: 5.215 ± 1.218
3.911ValThr: 3.911 ± 1.445
5.215ValVal: 5.215 ± 1.218
2.608ValTrp: 2.608 ± 0.335
1.304ValTyr: 1.304 ± 1.111
0.0ValXaa: 0.0 ± 0.0
Trp
1.304TrpAla: 1.304 ± 1.111
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.304TrpPhe: 1.304 ± 0.776
2.608TrpGly: 2.608 ± 1.552
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.608TrpLeu: 2.608 ± 2.221
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
2.608TrpThr: 2.608 ± 2.221
1.304TrpVal: 1.304 ± 1.111
1.304TrpTrp: 1.304 ± 1.111
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.519TyrAla: 6.519 ± 3.666
2.608TyrCys: 2.608 ± 1.552
3.911TyrAsp: 3.911 ± 2.328
0.0TyrGlu: 0.0 ± 0.0
1.304TyrPhe: 1.304 ± 0.776
3.911TyrGly: 3.911 ± 2.328
0.0TyrHis: 0.0 ± 0.0
3.911TyrIle: 3.911 ± 0.442
3.911TyrLys: 3.911 ± 0.442
2.608TyrLeu: 2.608 ± 1.552
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
2.608TyrPro: 2.608 ± 0.335
0.0TyrGln: 0.0 ± 0.0
0.0TyrArg: 0.0 ± 0.0
6.519TyrSer: 6.519 ± 1.994
5.215TyrThr: 5.215 ± 0.669
6.519TyrVal: 6.519 ± 1.994
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (768 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski