Amino acid dipepetide frequency for Wenzhou tombus-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.202AlaAla: 4.202 ± 2.237
1.401AlaCys: 1.401 ± 0.746
2.801AlaAsp: 2.801 ± 0.686
4.202AlaGlu: 4.202 ± 2.237
2.801AlaPhe: 2.801 ± 1.491
1.401AlaGly: 1.401 ± 0.746
1.401AlaHis: 1.401 ± 0.746
5.602AlaIle: 5.602 ± 2.982
1.401AlaLys: 1.401 ± 0.746
7.003AlaLeu: 7.003 ± 0.627
4.202AlaMet: 4.202 ± 1.49
2.801AlaAsn: 2.801 ± 0.686
1.401AlaPro: 1.401 ± 0.746
1.401AlaGln: 1.401 ± 0.746
9.804AlaArg: 9.804 ± 1.313
5.602AlaSer: 5.602 ± 2.982
2.801AlaThr: 2.801 ± 2.863
5.602AlaVal: 5.602 ± 0.805
1.401AlaTrp: 1.401 ± 1.432
1.401AlaTyr: 1.401 ± 1.432
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.401CysAsp: 1.401 ± 0.746
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.401CysGly: 1.401 ± 0.746
1.401CysHis: 1.401 ± 0.746
4.202CysIle: 4.202 ± 2.237
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.401CysAsn: 1.401 ± 0.746
1.401CysPro: 1.401 ± 1.432
2.801CysGln: 2.801 ± 0.686
0.0CysArg: 0.0 ± 0.0
1.401CysSer: 1.401 ± 0.746
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.401CysTyr: 1.401 ± 0.746
0.0CysXaa: 0.0 ± 0.0
Asp
8.403AspAla: 8.403 ± 0.119
1.401AspCys: 1.401 ± 0.746
4.202AspAsp: 4.202 ± 0.059
5.602AspGlu: 5.602 ± 3.549
1.401AspPhe: 1.401 ± 0.746
4.202AspGly: 4.202 ± 2.237
0.0AspHis: 0.0 ± 0.0
1.401AspIle: 1.401 ± 0.746
5.602AspLys: 5.602 ± 0.805
7.003AspLeu: 7.003 ± 0.627
0.0AspMet: 0.0 ± 0.0
4.202AspAsn: 4.202 ± 4.295
2.801AspPro: 2.801 ± 0.686
1.401AspGln: 1.401 ± 1.432
4.202AspArg: 4.202 ± 2.237
1.401AspSer: 1.401 ± 1.432
0.0AspThr: 0.0 ± 0.0
7.003AspVal: 7.003 ± 1.551
0.0AspTrp: 0.0 ± 0.0
2.801AspTyr: 2.801 ± 0.686
0.0AspXaa: 0.0 ± 0.0
Glu
1.401GluAla: 1.401 ± 1.432
0.0GluCys: 0.0 ± 0.0
4.202GluAsp: 4.202 ± 2.237
1.401GluGlu: 1.401 ± 0.746
8.403GluPhe: 8.403 ± 2.296
0.0GluGly: 0.0 ± 0.0
1.401GluHis: 1.401 ± 0.746
0.0GluIle: 0.0 ± 0.0
7.003GluLys: 7.003 ± 3.728
2.801GluLeu: 2.801 ± 0.686
2.801GluMet: 2.801 ± 2.863
0.0GluAsn: 0.0 ± 0.0
1.401GluPro: 1.401 ± 1.432
5.602GluGln: 5.602 ± 2.982
2.801GluArg: 2.801 ± 0.686
4.202GluSer: 4.202 ± 0.059
5.602GluThr: 5.602 ± 1.372
5.602GluVal: 5.602 ± 1.372
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.801PheAla: 2.801 ± 1.491
4.202PheCys: 4.202 ± 2.237
2.801PheAsp: 2.801 ± 1.491
4.202PheGlu: 4.202 ± 2.237
2.801PhePhe: 2.801 ± 1.491
1.401PheGly: 1.401 ± 0.746
0.0PheHis: 0.0 ± 0.0
1.401PheIle: 1.401 ± 1.432
0.0PheLys: 0.0 ± 0.0
7.003PheLeu: 7.003 ± 0.627
0.0PheMet: 0.0 ± 0.0
1.401PheAsn: 1.401 ± 0.746
1.401PhePro: 1.401 ± 1.432
1.401PheGln: 1.401 ± 0.746
7.003PheArg: 7.003 ± 0.627
4.202PheSer: 4.202 ± 2.118
4.202PheThr: 4.202 ± 2.118
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
2.801PheTyr: 2.801 ± 0.686
0.0PheXaa: 0.0 ± 0.0
Gly
2.801GlyAla: 2.801 ± 0.686
0.0GlyCys: 0.0 ± 0.0
5.602GlyAsp: 5.602 ± 2.982
1.401GlyGlu: 1.401 ± 0.746
1.401GlyPhe: 1.401 ± 0.746
1.401GlyGly: 1.401 ± 0.746
1.401GlyHis: 1.401 ± 0.746
7.003GlyIle: 7.003 ± 1.551
1.401GlyLys: 1.401 ± 0.746
5.602GlyLeu: 5.602 ± 2.982
4.202GlyMet: 4.202 ± 2.118
2.801GlyAsn: 2.801 ± 0.686
1.401GlyPro: 1.401 ± 0.746
0.0GlyGln: 0.0 ± 0.0
1.401GlyArg: 1.401 ± 0.746
1.401GlySer: 1.401 ± 0.746
1.401GlyThr: 1.401 ± 1.432
1.401GlyVal: 1.401 ± 1.432
4.202GlyTrp: 4.202 ± 0.059
5.602GlyTyr: 5.602 ± 1.372
0.0GlyXaa: 0.0 ± 0.0
His
1.401HisAla: 1.401 ± 0.746
0.0HisCys: 0.0 ± 0.0
1.401HisAsp: 1.401 ± 1.432
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.401HisGly: 1.401 ± 1.432
0.0HisHis: 0.0 ± 0.0
1.401HisIle: 1.401 ± 0.746
1.401HisLys: 1.401 ± 0.746
4.202HisLeu: 4.202 ± 2.237
2.801HisMet: 2.801 ± 0.686
1.401HisAsn: 1.401 ± 0.746
5.602HisPro: 5.602 ± 2.982
1.401HisGln: 1.401 ± 0.746
0.0HisArg: 0.0 ± 0.0
4.202HisSer: 4.202 ± 0.059
1.401HisThr: 1.401 ± 1.432
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.801IleAla: 2.801 ± 1.491
0.0IleCys: 0.0 ± 0.0
9.804IleAsp: 9.804 ± 1.313
4.202IleGlu: 4.202 ± 2.118
1.401IlePhe: 1.401 ± 1.432
1.401IleGly: 1.401 ± 0.746
1.401IleHis: 1.401 ± 0.746
2.801IleIle: 2.801 ± 1.491
5.602IleLys: 5.602 ± 3.549
2.801IleLeu: 2.801 ± 1.491
1.401IleMet: 1.401 ± 0.746
4.202IleAsn: 4.202 ± 2.237
5.602IlePro: 5.602 ± 2.982
1.401IleGln: 1.401 ± 0.746
2.801IleArg: 2.801 ± 1.491
8.403IleSer: 8.403 ± 2.058
2.801IleThr: 2.801 ± 0.686
1.401IleVal: 1.401 ± 1.432
0.0IleTrp: 0.0 ± 0.0
1.401IleTyr: 1.401 ± 0.746
0.0IleXaa: 0.0 ± 0.0
Lys
7.003LysAla: 7.003 ± 1.551
0.0LysCys: 0.0 ± 0.0
2.801LysAsp: 2.801 ± 0.686
5.602LysGlu: 5.602 ± 1.372
2.801LysPhe: 2.801 ± 1.491
4.202LysGly: 4.202 ± 2.237
5.602LysHis: 5.602 ± 0.805
1.401LysIle: 1.401 ± 0.746
0.0LysLys: 0.0 ± 0.0
4.202LysLeu: 4.202 ± 0.059
1.401LysMet: 1.401 ± 1.432
1.401LysAsn: 1.401 ± 0.746
1.401LysPro: 1.401 ± 0.746
0.0LysGln: 0.0 ± 0.0
7.003LysArg: 7.003 ± 3.728
1.401LysSer: 1.401 ± 1.432
1.401LysThr: 1.401 ± 1.432
2.801LysVal: 2.801 ± 1.491
1.401LysTrp: 1.401 ± 0.746
4.202LysTyr: 4.202 ± 0.059
0.0LysXaa: 0.0 ± 0.0
Leu
8.403LeuAla: 8.403 ± 0.119
4.202LeuCys: 4.202 ± 2.118
7.003LeuAsp: 7.003 ± 1.551
7.003LeuGlu: 7.003 ± 3.728
2.801LeuPhe: 2.801 ± 0.686
4.202LeuGly: 4.202 ± 2.118
1.401LeuHis: 1.401 ± 0.746
4.202LeuIle: 4.202 ± 4.295
8.403LeuLys: 8.403 ± 0.119
14.006LeuLeu: 14.006 ± 5.608
4.202LeuMet: 4.202 ± 2.237
2.801LeuAsn: 2.801 ± 2.863
0.0LeuPro: 0.0 ± 0.0
1.401LeuGln: 1.401 ± 0.746
1.401LeuArg: 1.401 ± 1.432
4.202LeuSer: 4.202 ± 2.118
4.202LeuThr: 4.202 ± 2.237
7.003LeuVal: 7.003 ± 3.728
2.801LeuTrp: 2.801 ± 0.686
5.602LeuTyr: 5.602 ± 3.549
0.0LeuXaa: 0.0 ± 0.0
Met
1.401MetAla: 1.401 ± 0.746
0.0MetCys: 0.0 ± 0.0
1.401MetAsp: 1.401 ± 0.746
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.401MetGly: 1.401 ± 1.432
1.401MetHis: 1.401 ± 1.432
4.202MetIle: 4.202 ± 0.059
2.801MetLys: 2.801 ± 1.491
1.401MetLeu: 1.401 ± 1.432
0.0MetMet: 0.0 ± 0.628
1.401MetAsn: 1.401 ± 1.432
1.401MetPro: 1.401 ± 1.432
2.801MetGln: 2.801 ± 0.686
1.401MetArg: 1.401 ± 1.432
5.602MetSer: 5.602 ± 0.805
1.401MetThr: 1.401 ± 0.746
2.801MetVal: 2.801 ± 1.491
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.401AsnAla: 1.401 ± 1.432
2.801AsnCys: 2.801 ± 1.491
1.401AsnAsp: 1.401 ± 1.432
2.801AsnGlu: 2.801 ± 0.686
2.801AsnPhe: 2.801 ± 0.686
4.202AsnGly: 4.202 ± 0.059
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
1.401AsnLys: 1.401 ± 0.746
1.401AsnLeu: 1.401 ± 1.432
0.0AsnMet: 0.0 ± 0.0
1.401AsnAsn: 1.401 ± 0.746
4.202AsnPro: 4.202 ± 2.237
1.401AsnGln: 1.401 ± 0.746
4.202AsnArg: 4.202 ± 4.295
0.0AsnSer: 0.0 ± 0.0
2.801AsnThr: 2.801 ± 1.491
4.202AsnVal: 4.202 ± 2.118
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.401ProAla: 1.401 ± 0.746
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
1.401ProGlu: 1.401 ± 1.432
1.401ProPhe: 1.401 ± 1.432
4.202ProGly: 4.202 ± 2.118
1.401ProHis: 1.401 ± 0.746
4.202ProIle: 4.202 ± 2.118
1.401ProLys: 1.401 ± 0.746
4.202ProLeu: 4.202 ± 0.059
1.401ProMet: 1.401 ± 0.746
4.202ProAsn: 4.202 ± 0.059
1.401ProPro: 1.401 ± 1.432
1.401ProGln: 1.401 ± 0.746
9.804ProArg: 9.804 ± 0.864
5.602ProSer: 5.602 ± 0.805
0.0ProThr: 0.0 ± 0.0
8.403ProVal: 8.403 ± 2.296
1.401ProTrp: 1.401 ± 1.432
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.401GlnAla: 1.401 ± 0.746
0.0GlnCys: 0.0 ± 0.0
2.801GlnAsp: 2.801 ± 1.491
1.401GlnGlu: 1.401 ± 0.746
5.602GlnPhe: 5.602 ± 1.372
1.401GlnGly: 1.401 ± 0.746
2.801GlnHis: 2.801 ± 1.491
2.801GlnIle: 2.801 ± 0.686
2.801GlnLys: 2.801 ± 0.686
1.401GlnLeu: 1.401 ± 0.746
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
2.801GlnPro: 2.801 ± 1.491
2.801GlnGln: 2.801 ± 0.686
1.401GlnArg: 1.401 ± 0.746
2.801GlnSer: 2.801 ± 1.491
1.401GlnThr: 1.401 ± 1.432
0.0GlnVal: 0.0 ± 0.0
1.401GlnTrp: 1.401 ± 1.432
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
9.804ArgAla: 9.804 ± 3.49
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
4.202ArgGlu: 4.202 ± 0.059
4.202ArgPhe: 4.202 ± 2.237
8.403ArgGly: 8.403 ± 0.119
1.401ArgHis: 1.401 ± 1.432
2.801ArgIle: 2.801 ± 1.491
5.602ArgLys: 5.602 ± 0.805
5.602ArgLeu: 5.602 ± 1.372
2.801ArgMet: 2.801 ± 1.491
2.801ArgAsn: 2.801 ± 0.686
5.602ArgPro: 5.602 ± 1.372
2.801ArgGln: 2.801 ± 1.491
7.003ArgArg: 7.003 ± 2.804
4.202ArgSer: 4.202 ± 0.059
2.801ArgThr: 2.801 ± 0.686
1.401ArgVal: 1.401 ± 0.746
1.401ArgTrp: 1.401 ± 0.746
7.003ArgTyr: 7.003 ± 1.551
0.0ArgXaa: 0.0 ± 0.0
Ser
4.202SerAla: 4.202 ± 2.237
1.401SerCys: 1.401 ± 0.746
5.602SerAsp: 5.602 ± 1.372
5.602SerGlu: 5.602 ± 0.805
1.401SerPhe: 1.401 ± 1.432
7.003SerGly: 7.003 ± 1.551
2.801SerHis: 2.801 ± 1.491
8.403SerIle: 8.403 ± 0.119
4.202SerLys: 4.202 ± 2.237
5.602SerLeu: 5.602 ± 1.372
0.0SerMet: 0.0 ± 0.0
1.401SerAsn: 1.401 ± 1.432
5.602SerPro: 5.602 ± 1.372
0.0SerGln: 0.0 ± 0.0
5.602SerArg: 5.602 ± 1.372
2.801SerSer: 2.801 ± 0.686
1.401SerThr: 1.401 ± 1.432
2.801SerVal: 2.801 ± 0.686
0.0SerTrp: 0.0 ± 0.0
2.801SerTyr: 2.801 ± 0.686
0.0SerXaa: 0.0 ± 0.0
Thr
1.401ThrAla: 1.401 ± 1.432
0.0ThrCys: 0.0 ± 0.0
4.202ThrAsp: 4.202 ± 4.295
0.0ThrGlu: 0.0 ± 0.0
2.801ThrPhe: 2.801 ± 0.686
1.401ThrGly: 1.401 ± 0.746
2.801ThrHis: 2.801 ± 0.686
1.401ThrIle: 1.401 ± 1.432
1.401ThrLys: 1.401 ± 1.432
4.202ThrLeu: 4.202 ± 2.118
1.401ThrMet: 1.401 ± 1.432
0.0ThrAsn: 0.0 ± 0.0
4.202ThrPro: 4.202 ± 0.059
1.401ThrGln: 1.401 ± 1.432
5.602ThrArg: 5.602 ± 2.982
1.401ThrSer: 1.401 ± 0.746
0.0ThrThr: 0.0 ± 0.0
0.0ThrVal: 0.0 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
4.202ThrTyr: 4.202 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
4.202ValAla: 4.202 ± 2.237
0.0ValCys: 0.0 ± 0.0
4.202ValAsp: 4.202 ± 2.118
2.801ValGlu: 2.801 ± 1.491
4.202ValPhe: 4.202 ± 0.059
1.401ValGly: 1.401 ± 0.746
0.0ValHis: 0.0 ± 0.0
4.202ValIle: 4.202 ± 0.059
1.401ValLys: 1.401 ± 0.746
4.202ValLeu: 4.202 ± 2.118
4.202ValMet: 4.202 ± 2.237
0.0ValAsn: 0.0 ± 0.0
2.801ValPro: 2.801 ± 0.686
2.801ValGln: 2.801 ± 0.686
5.602ValArg: 5.602 ± 0.805
7.003ValSer: 7.003 ± 1.551
0.0ValThr: 0.0 ± 0.0
2.801ValVal: 2.801 ± 2.863
1.401ValTrp: 1.401 ± 0.746
1.401ValTyr: 1.401 ± 0.746
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.401TrpPhe: 1.401 ± 1.432
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
4.202TrpIle: 4.202 ± 0.059
0.0TrpLys: 0.0 ± 0.0
4.202TrpLeu: 4.202 ± 2.118
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.401TrpGln: 1.401 ± 1.432
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.401TrpThr: 1.401 ± 0.746
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
2.801TrpTyr: 2.801 ± 1.491
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.202TyrAla: 4.202 ± 0.059
1.401TyrCys: 1.401 ± 0.746
1.401TyrAsp: 1.401 ± 1.432
2.801TyrGlu: 2.801 ± 0.686
1.401TyrPhe: 1.401 ± 0.746
1.401TyrGly: 1.401 ± 0.746
1.401TyrHis: 1.401 ± 1.432
0.0TyrIle: 0.0 ± 0.0
4.202TyrLys: 4.202 ± 0.059
8.403TyrLeu: 8.403 ± 2.296
0.0TyrMet: 0.0 ± 0.0
2.801TyrAsn: 2.801 ± 1.491
2.801TyrPro: 2.801 ± 2.863
1.401TyrGln: 1.401 ± 0.746
2.801TyrArg: 2.801 ± 0.686
2.801TyrSer: 2.801 ± 2.863
2.801TyrThr: 2.801 ± 0.686
1.401TyrVal: 1.401 ± 0.746
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (715 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski