Amino acid dipepetide frequency for Impatiens necrotic spot virus (INSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.416AlaAla: 1.416 ± 0.926
1.416AlaCys: 1.416 ± 0.926
3.541AlaAsp: 3.541 ± 3.126
2.125AlaGlu: 2.125 ± 0.579
0.708AlaPhe: 0.708 ± 1.273
0.708AlaGly: 0.708 ± 0.347
1.416AlaHis: 1.416 ± 0.694
2.833AlaIle: 2.833 ± 0.232
2.125AlaLys: 2.125 ± 0.579
4.958AlaLeu: 4.958 ± 0.81
0.708AlaMet: 0.708 ± 1.273
2.833AlaAsn: 2.833 ± 1.853
0.708AlaPro: 0.708 ± 0.347
2.125AlaGln: 2.125 ± 1.042
2.833AlaArg: 2.833 ± 1.853
4.958AlaSer: 4.958 ± 0.81
2.833AlaThr: 2.833 ± 0.232
3.541AlaVal: 3.541 ± 0.115
0.0AlaTrp: 0.0 ± 0.0
1.416AlaTyr: 1.416 ± 0.694
0.0AlaXaa: 0.0 ± 0.0
Cys
1.416CysAla: 1.416 ± 0.694
0.0CysCys: 0.0 ± 0.0
2.125CysAsp: 2.125 ± 2.2
2.125CysGlu: 2.125 ± 1.042
2.833CysPhe: 2.833 ± 0.232
2.833CysGly: 2.833 ± 1.389
0.0CysHis: 0.0 ± 0.0
4.958CysIle: 4.958 ± 2.43
3.541CysLys: 3.541 ± 0.115
2.833CysLeu: 2.833 ± 1.389
0.708CysMet: 0.708 ± 1.273
0.708CysAsn: 0.708 ± 0.347
2.125CysPro: 2.125 ± 0.579
0.0CysGln: 0.0 ± 0.0
2.833CysArg: 2.833 ± 1.389
4.958CysSer: 4.958 ± 2.43
1.416CysThr: 1.416 ± 0.694
0.708CysVal: 0.708 ± 1.273
0.0CysTrp: 0.0 ± 0.0
0.708CysTyr: 0.708 ± 0.347
0.0CysXaa: 0.0 ± 0.0
Asp
1.416AspAla: 1.416 ± 0.694
3.541AspCys: 3.541 ± 1.736
0.0AspAsp: 0.0 ± 0.0
0.708AspGlu: 0.708 ± 0.347
3.541AspPhe: 3.541 ± 1.736
2.833AspGly: 2.833 ± 0.232
2.125AspHis: 2.125 ± 0.579
3.541AspIle: 3.541 ± 0.115
4.958AspLys: 4.958 ± 0.811
4.958AspLeu: 4.958 ± 2.432
1.416AspMet: 1.416 ± 0.694
2.833AspAsn: 2.833 ± 1.853
2.833AspPro: 2.833 ± 3.473
4.958AspGln: 4.958 ± 0.811
1.416AspArg: 1.416 ± 0.694
4.958AspSer: 4.958 ± 2.432
7.79AspThr: 7.79 ± 2.199
1.416AspVal: 1.416 ± 0.926
0.0AspTrp: 0.0 ± 0.0
1.416AspTyr: 1.416 ± 0.694
0.0AspXaa: 0.0 ± 0.0
Glu
2.125GluAla: 2.125 ± 0.579
2.125GluCys: 2.125 ± 1.042
0.708GluAsp: 0.708 ± 0.347
7.082GluGlu: 7.082 ± 1.39
0.708GluPhe: 0.708 ± 1.273
3.541GluGly: 3.541 ± 1.736
0.708GluHis: 0.708 ± 0.347
2.833GluIle: 2.833 ± 1.853
5.666GluLys: 5.666 ± 0.464
2.833GluLeu: 2.833 ± 1.853
0.708GluMet: 0.708 ± 0.347
4.958GluAsn: 4.958 ± 0.811
1.416GluPro: 1.416 ± 0.694
2.125GluGln: 2.125 ± 1.042
0.708GluArg: 0.708 ± 0.347
7.082GluSer: 7.082 ± 1.39
2.125GluThr: 2.125 ± 1.042
3.541GluVal: 3.541 ± 0.115
0.708GluTrp: 0.708 ± 0.347
1.416GluTyr: 1.416 ± 0.694
0.0GluXaa: 0.0 ± 0.0
Phe
1.416PheAla: 1.416 ± 0.926
3.541PheCys: 3.541 ± 0.115
2.125PheAsp: 2.125 ± 0.579
0.708PheGlu: 0.708 ± 0.347
2.125PhePhe: 2.125 ± 0.579
2.125PheGly: 2.125 ± 1.042
1.416PheHis: 1.416 ± 0.694
2.125PheIle: 2.125 ± 0.579
2.125PheLys: 2.125 ± 0.579
3.541PheLeu: 3.541 ± 1.736
0.708PheMet: 0.708 ± 0.347
1.416PheAsn: 1.416 ± 0.694
2.125PhePro: 2.125 ± 1.042
1.416PheGln: 1.416 ± 0.694
1.416PheArg: 1.416 ± 0.694
6.374PheSer: 6.374 ± 3.125
2.125PheThr: 2.125 ± 1.042
4.958PheVal: 4.958 ± 0.811
0.708PheTrp: 0.708 ± 0.347
3.541PheTyr: 3.541 ± 0.115
0.0PheXaa: 0.0 ± 0.0
Gly
2.833GlyAla: 2.833 ± 0.232
2.833GlyCys: 2.833 ± 1.389
5.666GlyAsp: 5.666 ± 1.157
0.0GlyGlu: 0.0 ± 0.0
2.833GlyPhe: 2.833 ± 1.389
1.416GlyGly: 1.416 ± 0.694
0.708GlyHis: 0.708 ± 0.347
1.416GlyIle: 1.416 ± 0.694
6.374GlyLys: 6.374 ± 0.117
4.958GlyLeu: 4.958 ± 2.43
0.0GlyMet: 0.0 ± 0.0
3.541GlyAsn: 3.541 ± 1.505
1.416GlyPro: 1.416 ± 0.694
1.416GlyGln: 1.416 ± 0.926
0.708GlyArg: 0.708 ± 0.347
3.541GlySer: 3.541 ± 0.115
3.541GlyThr: 3.541 ± 1.505
1.416GlyVal: 1.416 ± 0.694
0.0GlyTrp: 0.0 ± 0.0
4.958GlyTyr: 4.958 ± 2.43
0.0GlyXaa: 0.0 ± 0.0
His
0.708HisAla: 0.708 ± 0.347
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.416HisGlu: 1.416 ± 0.694
1.416HisPhe: 1.416 ± 0.694
0.708HisGly: 0.708 ± 0.347
0.708HisHis: 0.708 ± 1.273
0.708HisIle: 0.708 ± 1.273
0.708HisLys: 0.708 ± 0.347
1.416HisLeu: 1.416 ± 0.926
0.0HisMet: 0.0 ± 0.0
2.833HisAsn: 2.833 ± 1.853
2.833HisPro: 2.833 ± 1.853
0.708HisGln: 0.708 ± 0.347
0.0HisArg: 0.0 ± 0.0
1.416HisSer: 1.416 ± 0.694
0.708HisThr: 0.708 ± 0.347
1.416HisVal: 1.416 ± 0.694
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.249IleAla: 4.249 ± 0.463
2.125IleCys: 2.125 ± 2.2
3.541IleAsp: 3.541 ± 0.115
4.958IleGlu: 4.958 ± 0.811
3.541IlePhe: 3.541 ± 1.736
3.541IleGly: 3.541 ± 1.736
1.416IleHis: 1.416 ± 0.926
7.082IleIle: 7.082 ± 1.39
6.374IleLys: 6.374 ± 0.117
4.958IleLeu: 4.958 ± 0.81
3.541IleMet: 3.541 ± 0.926
2.833IleAsn: 2.833 ± 0.232
4.249IlePro: 4.249 ± 2.779
1.416IleGln: 1.416 ± 0.694
2.125IleArg: 2.125 ± 2.2
8.499IleSer: 8.499 ± 2.316
7.082IleThr: 7.082 ± 3.472
6.374IleVal: 6.374 ± 1.737
0.708IleTrp: 0.708 ± 1.273
2.833IleTyr: 2.833 ± 1.389
0.0IleXaa: 0.0 ± 0.0
Lys
2.125LysAla: 2.125 ± 0.579
3.541LysCys: 3.541 ± 1.736
2.833LysAsp: 2.833 ± 1.853
3.541LysGlu: 3.541 ± 1.505
2.125LysPhe: 2.125 ± 1.042
4.958LysGly: 4.958 ± 2.432
0.708LysHis: 0.708 ± 0.347
7.79LysIle: 7.79 ± 2.663
4.958LysLys: 4.958 ± 0.81
4.249LysLeu: 4.249 ± 1.158
1.416LysMet: 1.416 ± 0.694
3.541LysAsn: 3.541 ± 0.115
1.416LysPro: 1.416 ± 0.694
4.249LysGln: 4.249 ± 2.779
2.833LysArg: 2.833 ± 1.389
10.623LysSer: 10.623 ± 2.895
7.082LysThr: 7.082 ± 3.472
2.833LysVal: 2.833 ± 0.232
0.0LysTrp: 0.0 ± 0.0
4.249LysTyr: 4.249 ± 2.083
0.0LysXaa: 0.0 ± 0.0
Leu
4.958LeuAla: 4.958 ± 0.811
4.249LeuCys: 4.249 ± 0.463
5.666LeuAsp: 5.666 ± 0.464
2.833LeuGlu: 2.833 ± 0.232
2.833LeuPhe: 2.833 ± 0.232
3.541LeuGly: 3.541 ± 0.115
1.416LeuHis: 1.416 ± 0.926
8.499LeuIle: 8.499 ± 0.925
4.249LeuLys: 4.249 ± 0.463
4.958LeuLeu: 4.958 ± 0.81
0.708LeuMet: 0.708 ± 1.273
4.249LeuAsn: 4.249 ± 0.463
2.125LeuPro: 2.125 ± 1.042
2.125LeuGln: 2.125 ± 0.579
2.833LeuArg: 2.833 ± 0.232
9.207LeuSer: 9.207 ± 0.348
4.958LeuThr: 4.958 ± 2.43
4.249LeuVal: 4.249 ± 2.083
0.708LeuTrp: 0.708 ± 0.347
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.416MetAla: 1.416 ± 0.694
0.708MetCys: 0.708 ± 0.347
3.541MetAsp: 3.541 ± 0.115
0.708MetGlu: 0.708 ± 0.347
1.416MetPhe: 1.416 ± 0.694
0.708MetGly: 0.708 ± 0.347
0.0MetHis: 0.0 ± 0.0
4.249MetIle: 4.249 ± 2.779
1.416MetLys: 1.416 ± 0.694
0.708MetLeu: 0.708 ± 0.347
0.708MetMet: 0.708 ± 1.273
1.416MetAsn: 1.416 ± 2.547
0.708MetPro: 0.708 ± 0.347
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.416MetSer: 1.416 ± 0.694
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.416MetTyr: 1.416 ± 0.926
0.0MetXaa: 0.0 ± 0.0
Asn
3.541AsnAla: 3.541 ± 0.115
1.416AsnCys: 1.416 ± 0.926
4.249AsnAsp: 4.249 ± 0.463
3.541AsnGlu: 3.541 ± 0.115
2.125AsnPhe: 2.125 ± 1.042
3.541AsnGly: 3.541 ± 0.115
1.416AsnHis: 1.416 ± 2.547
2.833AsnIle: 2.833 ± 1.389
7.79AsnLys: 7.79 ± 1.043
3.541AsnLeu: 3.541 ± 0.115
0.708AsnMet: 0.708 ± 0.347
3.541AsnAsn: 3.541 ± 3.126
2.833AsnPro: 2.833 ± 0.232
0.708AsnGln: 0.708 ± 0.347
1.416AsnArg: 1.416 ± 0.926
4.958AsnSer: 4.958 ± 0.811
2.125AsnThr: 2.125 ± 2.2
2.833AsnVal: 2.833 ± 0.232
1.416AsnTrp: 1.416 ± 0.926
2.125AsnTyr: 2.125 ± 1.042
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.708ProCys: 0.708 ± 0.347
1.416ProAsp: 1.416 ± 0.926
4.958ProGlu: 4.958 ± 2.43
0.708ProPhe: 0.708 ± 0.347
0.708ProGly: 0.708 ± 0.347
0.0ProHis: 0.0 ± 0.0
7.79ProIle: 7.79 ± 0.578
2.125ProLys: 2.125 ± 2.2
2.833ProLeu: 2.833 ± 1.389
1.416ProMet: 1.416 ± 0.694
1.416ProAsn: 1.416 ± 0.926
0.708ProPro: 0.708 ± 0.347
1.416ProGln: 1.416 ± 0.694
2.833ProArg: 2.833 ± 1.853
2.833ProSer: 2.833 ± 0.232
4.249ProThr: 4.249 ± 1.158
1.416ProVal: 1.416 ± 0.694
0.0ProTrp: 0.0 ± 0.0
0.708ProTyr: 0.708 ± 0.347
0.0ProXaa: 0.0 ± 0.0
Gln
1.416GlnAla: 1.416 ± 0.926
2.125GlnCys: 2.125 ± 1.042
0.708GlnAsp: 0.708 ± 0.347
0.708GlnGlu: 0.708 ± 0.347
0.708GlnPhe: 0.708 ± 0.347
2.833GlnGly: 2.833 ± 1.853
0.708GlnHis: 0.708 ± 0.347
2.833GlnIle: 2.833 ± 1.853
1.416GlnLys: 1.416 ± 0.694
2.833GlnLeu: 2.833 ± 1.853
0.708GlnMet: 0.708 ± 0.347
2.125GlnAsn: 2.125 ± 1.042
0.708GlnPro: 0.708 ± 0.347
0.708GlnGln: 0.708 ± 0.347
0.708GlnArg: 0.708 ± 1.273
3.541GlnSer: 3.541 ± 1.736
2.125GlnThr: 2.125 ± 1.042
1.416GlnVal: 1.416 ± 0.926
0.0GlnTrp: 0.0 ± 0.0
2.833GlnTyr: 2.833 ± 0.232
0.0GlnXaa: 0.0 ± 0.0
Arg
2.833ArgAla: 2.833 ± 5.094
0.708ArgCys: 0.708 ± 0.347
1.416ArgAsp: 1.416 ± 0.694
1.416ArgGlu: 1.416 ± 0.926
2.125ArgPhe: 2.125 ± 1.042
1.416ArgGly: 1.416 ± 0.694
0.0ArgHis: 0.0 ± 0.0
2.833ArgIle: 2.833 ± 0.232
2.833ArgLys: 2.833 ± 0.232
2.833ArgLeu: 2.833 ± 1.389
0.708ArgMet: 0.708 ± 1.273
2.125ArgAsn: 2.125 ± 1.042
2.125ArgPro: 2.125 ± 1.042
0.708ArgGln: 0.708 ± 1.273
2.125ArgArg: 2.125 ± 0.579
2.125ArgSer: 2.125 ± 0.579
1.416ArgThr: 1.416 ± 0.926
2.125ArgVal: 2.125 ± 0.579
0.0ArgTrp: 0.0 ± 0.0
2.125ArgTyr: 2.125 ± 1.042
0.0ArgXaa: 0.0 ± 0.0
Ser
3.541SerAla: 3.541 ± 3.126
3.541SerCys: 3.541 ± 0.115
5.666SerAsp: 5.666 ± 1.157
8.499SerGlu: 8.499 ± 2.316
7.79SerPhe: 7.79 ± 1.043
7.082SerGly: 7.082 ± 3.472
2.833SerHis: 2.833 ± 1.389
7.79SerIle: 7.79 ± 0.578
7.082SerLys: 7.082 ± 0.231
9.915SerLeu: 9.915 ± 1.62
0.708SerMet: 0.708 ± 0.347
2.833SerAsn: 2.833 ± 1.389
1.416SerPro: 1.416 ± 0.926
2.125SerGln: 2.125 ± 0.579
2.833SerArg: 2.833 ± 3.473
11.331SerSer: 11.331 ± 4.169
5.666SerThr: 5.666 ± 2.778
8.499SerVal: 8.499 ± 0.696
2.833SerTrp: 2.833 ± 1.853
4.249SerTyr: 4.249 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
3.541ThrAla: 3.541 ± 1.736
2.125ThrCys: 2.125 ± 1.042
2.833ThrAsp: 2.833 ± 0.232
3.541ThrGlu: 3.541 ± 1.736
2.833ThrPhe: 2.833 ± 1.389
3.541ThrGly: 3.541 ± 0.115
1.416ThrHis: 1.416 ± 0.694
5.666ThrIle: 5.666 ± 1.157
4.958ThrLys: 4.958 ± 0.811
4.958ThrLeu: 4.958 ± 0.81
2.125ThrMet: 2.125 ± 1.042
4.958ThrAsn: 4.958 ± 0.81
2.125ThrPro: 2.125 ± 1.042
0.708ThrGln: 0.708 ± 0.347
2.125ThrArg: 2.125 ± 1.042
4.958ThrSer: 4.958 ± 0.811
3.541ThrThr: 3.541 ± 1.505
6.374ThrVal: 6.374 ± 1.504
0.708ThrTrp: 0.708 ± 1.273
2.833ThrTyr: 2.833 ± 0.232
0.0ThrXaa: 0.0 ± 0.0
Val
2.125ValAla: 2.125 ± 0.579
1.416ValCys: 1.416 ± 0.694
2.833ValAsp: 2.833 ± 0.232
3.541ValGlu: 3.541 ± 0.115
2.125ValPhe: 2.125 ± 0.579
1.416ValGly: 1.416 ± 0.926
0.708ValHis: 0.708 ± 1.273
2.833ValIle: 2.833 ± 0.232
3.541ValLys: 3.541 ± 1.736
4.249ValLeu: 4.249 ± 1.158
0.708ValMet: 0.708 ± 1.273
4.249ValAsn: 4.249 ± 1.158
4.958ValPro: 4.958 ± 2.43
1.416ValGln: 1.416 ± 0.926
2.833ValArg: 2.833 ± 1.389
6.374ValSer: 6.374 ± 1.504
4.958ValThr: 4.958 ± 0.811
4.249ValVal: 4.249 ± 0.463
1.416ValTrp: 1.416 ± 0.694
4.249ValTyr: 4.249 ± 0.463
0.0ValXaa: 0.0 ± 0.0
Trp
0.708TrpAla: 0.708 ± 0.347
0.708TrpCys: 0.708 ± 0.347
2.125TrpAsp: 2.125 ± 0.579
0.0TrpGlu: 0.0 ± 0.0
1.416TrpPhe: 1.416 ± 0.694
0.708TrpGly: 0.708 ± 0.347
0.0TrpHis: 0.0 ± 0.0
0.708TrpIle: 0.708 ± 1.273
0.708TrpLys: 0.708 ± 1.273
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
2.125TrpSer: 2.125 ± 0.579
0.708TrpThr: 0.708 ± 1.273
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.416TyrAla: 1.416 ± 0.694
0.0TyrCys: 0.0 ± 0.0
5.666TyrAsp: 5.666 ± 1.157
0.708TyrGlu: 0.708 ± 1.273
2.125TyrPhe: 2.125 ± 1.042
1.416TyrGly: 1.416 ± 0.694
0.0TyrHis: 0.0 ± 0.0
2.833TyrIle: 2.833 ± 1.389
2.125TyrLys: 2.125 ± 0.579
2.833TyrLeu: 2.833 ± 0.232
2.125TyrMet: 2.125 ± 0.579
4.249TyrAsn: 4.249 ± 2.083
1.416TyrPro: 1.416 ± 0.694
2.833TyrGln: 2.833 ± 1.389
1.416TyrArg: 1.416 ± 0.694
4.958TyrSer: 4.958 ± 0.81
1.416TyrThr: 1.416 ± 0.694
2.833TyrVal: 2.833 ± 1.389
0.708TyrTrp: 0.708 ± 0.347
2.125TyrTyr: 2.125 ± 1.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1413 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski