Amino acid dipepetide frequency for Apple stem grooving virus (strain P-209) (ASGV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.475AlaAla: 2.475 ± 1.233
2.063AlaCys: 2.063 ± 0.532
2.063AlaAsp: 2.063 ± 1.028
2.888AlaGlu: 2.888 ± 1.439
4.125AlaPhe: 4.125 ± 2.055
3.3AlaGly: 3.3 ± 1.644
0.825AlaHis: 0.825 ± 0.411
5.776AlaIle: 5.776 ± 3.36
3.3AlaLys: 3.3 ± 1.474
1.65AlaLeu: 1.65 ± 2.297
0.413AlaMet: 0.413 ± 1.354
2.888AlaAsn: 2.888 ± 0.121
1.238AlaPro: 1.238 ± 2.502
1.65AlaGln: 1.65 ± 0.822
4.538AlaArg: 4.538 ± 0.858
5.776AlaSer: 5.776 ± 1.801
4.538AlaThr: 4.538 ± 0.702
3.3AlaVal: 3.3 ± 1.644
0.825AlaTrp: 0.825 ± 0.411
0.825AlaTyr: 0.825 ± 1.148
0.0AlaXaa: 0.0 ± 0.0
Cys
0.413CysAla: 0.413 ± 0.206
0.825CysCys: 0.825 ± 1.148
0.0CysAsp: 0.0 ± 0.0
1.238CysGlu: 1.238 ± 0.943
2.063CysPhe: 2.063 ± 1.028
0.825CysGly: 0.825 ± 0.411
0.413CysHis: 0.413 ± 0.206
0.825CysIle: 0.825 ± 1.148
1.65CysLys: 1.65 ± 0.822
5.363CysLeu: 5.363 ± 0.447
0.0CysMet: 0.0 ± 0.0
0.825CysAsn: 0.825 ± 0.411
0.413CysPro: 0.413 ± 1.354
1.238CysGln: 1.238 ± 0.617
0.413CysArg: 0.413 ± 0.206
1.65CysSer: 1.65 ± 0.737
2.063CysThr: 2.063 ± 1.028
1.65CysVal: 1.65 ± 0.737
0.0CysTrp: 0.0 ± 0.0
0.825CysTyr: 0.825 ± 1.148
0.0CysXaa: 0.0 ± 0.0
Asp
1.65AspAla: 1.65 ± 3.856
2.063AspCys: 2.063 ± 2.091
1.65AspAsp: 1.65 ± 0.737
3.713AspGlu: 3.713 ± 1.85
3.713AspPhe: 3.713 ± 1.269
4.125AspGly: 4.125 ± 2.623
0.825AspHis: 0.825 ± 0.411
2.888AspIle: 2.888 ± 0.121
5.363AspLys: 5.363 ± 1.113
6.188AspLeu: 6.188 ± 1.524
1.238AspMet: 1.238 ± 0.617
0.413AspAsn: 0.413 ± 0.206
3.3AspPro: 3.3 ± 0.085
2.063AspGln: 2.063 ± 1.028
2.475AspArg: 2.475 ± 0.326
3.713AspSer: 3.713 ± 2.828
2.063AspThr: 2.063 ± 2.091
2.888AspVal: 2.888 ± 0.121
0.413AspTrp: 0.413 ± 0.206
2.063AspTyr: 2.063 ± 1.028
0.0AspXaa: 0.0 ± 0.0
Glu
3.713GluAla: 3.713 ± 1.269
1.65GluCys: 1.65 ± 0.822
4.95GluAsp: 4.95 ± 0.907
5.363GluGlu: 5.363 ± 0.447
5.363GluPhe: 5.363 ± 1.113
6.601GluGly: 6.601 ± 0.17
0.825GluHis: 0.825 ± 1.148
4.538GluIle: 4.538 ± 0.858
6.188GluLys: 6.188 ± 0.036
7.426GluLeu: 7.426 ± 0.581
1.65GluMet: 1.65 ± 0.822
2.888GluAsn: 2.888 ± 1.439
2.475GluPro: 2.475 ± 1.233
0.825GluGln: 0.825 ± 1.148
3.3GluArg: 3.3 ± 0.085
6.601GluSer: 6.601 ± 1.39
2.475GluThr: 2.475 ± 0.326
4.95GluVal: 4.95 ± 0.907
0.413GluTrp: 0.413 ± 0.206
0.825GluTyr: 0.825 ± 0.411
0.0GluXaa: 0.0 ± 0.0
Phe
5.776PheAla: 5.776 ± 1.318
2.063PheCys: 2.063 ± 1.028
7.013PheAsp: 7.013 ± 1.935
6.601PheGlu: 6.601 ± 0.17
2.888PhePhe: 2.888 ± 0.121
4.538PheGly: 4.538 ± 2.417
0.825PheHis: 0.825 ± 0.411
3.713PheIle: 3.713 ± 1.85
4.538PheLys: 4.538 ± 2.417
7.013PheLeu: 7.013 ± 1.184
1.238PheMet: 1.238 ± 0.617
1.238PheAsn: 1.238 ± 0.943
3.3PhePro: 3.3 ± 1.644
2.063PheGln: 2.063 ± 1.028
4.538PheArg: 4.538 ± 5.536
5.776PheSer: 5.776 ± 2.878
2.475PheThr: 2.475 ± 1.233
2.475PheVal: 2.475 ± 1.233
1.238PheTrp: 1.238 ± 0.617
0.825PheTyr: 0.825 ± 0.411
0.0PheXaa: 0.0 ± 0.0
Gly
1.65GlyAla: 1.65 ± 2.297
1.65GlyCys: 1.65 ± 0.737
3.713GlyAsp: 3.713 ± 2.828
4.95GlyGlu: 4.95 ± 2.466
3.713GlyPhe: 3.713 ± 0.29
1.238GlyGly: 1.238 ± 0.617
0.825GlyHis: 0.825 ± 1.148
2.475GlyIle: 2.475 ± 0.326
5.776GlyLys: 5.776 ± 0.241
7.013GlyLeu: 7.013 ± 0.375
2.063GlyMet: 2.063 ± 0.649
2.063GlyAsn: 2.063 ± 1.028
1.238GlyPro: 1.238 ± 0.943
2.888GlyGln: 2.888 ± 1.439
3.3GlyArg: 3.3 ± 3.034
4.95GlySer: 4.95 ± 0.907
1.65GlyThr: 1.65 ± 0.737
4.125GlyVal: 4.125 ± 1.063
0.825GlyTrp: 0.825 ± 1.148
1.238GlyTyr: 1.238 ± 0.617
0.0GlyXaa: 0.0 ± 0.0
His
1.238HisAla: 1.238 ± 0.617
0.825HisCys: 0.825 ± 1.148
0.825HisAsp: 0.825 ± 0.411
1.238HisGlu: 1.238 ± 0.617
1.65HisPhe: 1.65 ± 0.822
1.238HisGly: 1.238 ± 0.617
0.825HisHis: 0.825 ± 0.411
1.65HisIle: 1.65 ± 2.297
0.825HisLys: 0.825 ± 0.411
1.65HisLeu: 1.65 ± 0.822
0.0HisMet: 0.0 ± 0.0
0.825HisAsn: 0.825 ± 0.411
0.413HisPro: 0.413 ± 0.206
0.825HisGln: 0.825 ± 0.411
1.65HisArg: 1.65 ± 0.737
1.65HisSer: 1.65 ± 0.822
1.238HisThr: 1.238 ± 0.617
1.238HisVal: 1.238 ± 0.617
0.0HisTrp: 0.0 ± 0.0
1.65HisTyr: 1.65 ± 0.737
0.0HisXaa: 0.0 ± 0.0
Ile
2.475IleAla: 2.475 ± 1.233
1.238IleCys: 1.238 ± 0.943
5.776IleAsp: 5.776 ± 0.241
5.776IleGlu: 5.776 ± 0.241
2.063IlePhe: 2.063 ± 1.028
2.888IleGly: 2.888 ± 0.121
0.413IleHis: 0.413 ± 0.206
1.65IleIle: 1.65 ± 0.822
5.363IleLys: 5.363 ± 2.006
4.125IleLeu: 4.125 ± 1.063
0.825IleMet: 0.825 ± 0.411
2.063IleAsn: 2.063 ± 1.028
1.238IlePro: 1.238 ± 0.617
2.063IleGln: 2.063 ± 1.028
2.888IleArg: 2.888 ± 0.121
5.363IleSer: 5.363 ± 3.566
1.238IleThr: 1.238 ± 0.617
2.888IleVal: 2.888 ± 1.68
1.238IleTrp: 1.238 ± 0.617
0.413IleTyr: 0.413 ± 0.206
0.0IleXaa: 0.0 ± 0.0
Lys
3.3LysAla: 3.3 ± 0.085
0.413LysCys: 0.413 ± 0.206
1.65LysAsp: 1.65 ± 0.737
7.426LysGlu: 7.426 ± 2.14
3.3LysPhe: 3.3 ± 1.474
7.013LysGly: 7.013 ± 1.935
1.238LysHis: 1.238 ± 0.617
4.125LysIle: 4.125 ± 0.496
2.063LysLys: 2.063 ± 0.532
7.013LysLeu: 7.013 ± 1.935
3.713LysMet: 3.713 ± 0.29
3.3LysAsn: 3.3 ± 0.085
1.238LysPro: 1.238 ± 0.943
1.65LysGln: 1.65 ± 0.822
8.251LysArg: 8.251 ± 5.246
7.838LysSer: 7.838 ± 2.346
5.363LysThr: 5.363 ± 1.113
4.95LysVal: 4.95 ± 0.652
1.238LysTrp: 1.238 ± 0.617
2.888LysTyr: 2.888 ± 1.68
0.0LysXaa: 0.0 ± 0.0
Leu
6.188LeuAla: 6.188 ± 1.524
0.825LeuCys: 0.825 ± 0.411
7.013LeuAsp: 7.013 ± 2.743
7.013LeuGlu: 7.013 ± 0.375
9.076LeuPhe: 9.076 ± 1.716
4.95LeuGly: 4.95 ± 2.466
2.475LeuHis: 2.475 ± 0.326
3.713LeuIle: 3.713 ± 1.85
9.488LeuLys: 9.488 ± 0.049
11.139LeuLeu: 11.139 ± 3.99
1.65LeuMet: 1.65 ± 0.822
5.363LeuAsn: 5.363 ± 1.113
2.475LeuPro: 2.475 ± 0.326
4.125LeuGln: 4.125 ± 2.055
4.125LeuArg: 4.125 ± 0.496
9.076LeuSer: 9.076 ± 0.156
5.776LeuThr: 5.776 ± 2.878
3.713LeuVal: 3.713 ± 1.269
0.825LeuTrp: 0.825 ± 0.411
2.888LeuTyr: 2.888 ± 0.121
0.0LeuXaa: 0.0 ± 0.0
Met
3.713MetAla: 3.713 ± 1.269
0.825MetCys: 0.825 ± 0.411
0.0MetAsp: 0.0 ± 0.0
3.3MetGlu: 3.3 ± 1.474
1.65MetPhe: 1.65 ± 0.822
0.0MetGly: 0.0 ± 0.0
0.413MetHis: 0.413 ± 0.206
1.238MetIle: 1.238 ± 0.617
0.825MetLys: 0.825 ± 0.411
2.475MetLeu: 2.475 ± 0.326
0.825MetMet: 0.825 ± 0.411
1.238MetAsn: 1.238 ± 0.617
1.65MetPro: 1.65 ± 0.822
0.413MetGln: 0.413 ± 0.206
2.063MetArg: 2.063 ± 1.028
1.238MetSer: 1.238 ± 0.617
1.238MetThr: 1.238 ± 0.617
0.825MetVal: 0.825 ± 0.411
0.413MetTrp: 0.413 ± 0.206
0.825MetTyr: 0.825 ± 0.411
0.0MetXaa: 0.0 ± 0.0
Asn
2.475AsnAla: 2.475 ± 1.233
1.238AsnCys: 1.238 ± 0.943
1.65AsnAsp: 1.65 ± 0.822
2.888AsnGlu: 2.888 ± 1.68
3.3AsnPhe: 3.3 ± 0.085
1.238AsnGly: 1.238 ± 0.617
2.063AsnHis: 2.063 ± 1.028
2.063AsnIle: 2.063 ± 1.028
1.65AsnLys: 1.65 ± 0.737
4.125AsnLeu: 4.125 ± 0.496
2.475AsnMet: 2.475 ± 1.233
0.0AsnAsn: 0.0 ± 0.0
1.65AsnPro: 1.65 ± 0.822
1.238AsnGln: 1.238 ± 0.617
3.3AsnArg: 3.3 ± 0.085
4.125AsnSer: 4.125 ± 1.063
0.825AsnThr: 0.825 ± 0.411
2.063AsnVal: 2.063 ± 3.65
0.825AsnTrp: 0.825 ± 0.411
0.825AsnTyr: 0.825 ± 0.411
0.0AsnXaa: 0.0 ± 0.0
Pro
1.238ProAla: 1.238 ± 0.617
0.0ProCys: 0.0 ± 0.0
1.65ProAsp: 1.65 ± 0.737
3.713ProGlu: 3.713 ± 0.29
2.888ProPhe: 2.888 ± 1.439
1.65ProGly: 1.65 ± 2.297
1.238ProHis: 1.238 ± 0.617
1.65ProIle: 1.65 ± 0.822
3.713ProLys: 3.713 ± 0.29
2.888ProLeu: 2.888 ± 0.121
1.238ProMet: 1.238 ± 0.15
1.65ProAsn: 1.65 ± 3.856
2.063ProPro: 2.063 ± 0.532
1.65ProGln: 1.65 ± 0.737
2.063ProArg: 2.063 ± 1.028
2.888ProSer: 2.888 ± 1.439
1.238ProThr: 1.238 ± 0.617
2.063ProVal: 2.063 ± 1.028
0.413ProTrp: 0.413 ± 0.206
0.413ProTyr: 0.413 ± 0.206
0.0ProXaa: 0.0 ± 0.0
Gln
1.65GlnAla: 1.65 ± 0.737
0.825GlnCys: 0.825 ± 0.411
2.063GlnAsp: 2.063 ± 0.532
1.65GlnGlu: 1.65 ± 0.737
1.65GlnPhe: 1.65 ± 0.822
2.475GlnGly: 2.475 ± 1.233
0.413GlnHis: 0.413 ± 0.206
0.413GlnIle: 0.413 ± 0.206
1.238GlnLys: 1.238 ± 0.617
3.3GlnLeu: 3.3 ± 1.644
0.825GlnMet: 0.825 ± 0.411
1.65GlnAsn: 1.65 ± 0.822
1.238GlnPro: 1.238 ± 0.617
0.413GlnGln: 0.413 ± 0.206
0.413GlnArg: 0.413 ± 0.206
3.3GlnSer: 3.3 ± 1.644
3.3GlnThr: 3.3 ± 1.644
1.238GlnVal: 1.238 ± 0.617
1.65GlnTrp: 1.65 ± 0.822
1.238GlnTyr: 1.238 ± 0.943
0.0GlnXaa: 0.0 ± 0.0
Arg
2.888ArgAla: 2.888 ± 0.121
1.238ArgCys: 1.238 ± 0.943
2.888ArgAsp: 2.888 ± 1.439
2.475ArgGlu: 2.475 ± 1.233
7.013ArgPhe: 7.013 ± 5.862
3.3ArgGly: 3.3 ± 6.153
0.825ArgHis: 0.825 ± 0.411
2.888ArgIle: 2.888 ± 0.121
6.601ArgLys: 6.601 ± 1.729
6.188ArgLeu: 6.188 ± 1.524
2.475ArgMet: 2.475 ± 1.233
1.65ArgAsn: 1.65 ± 0.822
1.238ArgPro: 1.238 ± 2.502
2.063ArgGln: 2.063 ± 1.028
5.776ArgArg: 5.776 ± 1.801
4.538ArgSer: 4.538 ± 7.095
0.825ArgThr: 0.825 ± 0.411
4.538ArgVal: 4.538 ± 2.417
0.825ArgTrp: 0.825 ± 0.411
0.825ArgTyr: 0.825 ± 0.411
0.0ArgXaa: 0.0 ± 0.0
Ser
4.125SerAla: 4.125 ± 4.182
1.65SerCys: 1.65 ± 0.822
4.125SerAsp: 4.125 ± 2.623
5.776SerGlu: 5.776 ± 0.241
6.188SerPhe: 6.188 ± 3.083
4.538SerGly: 4.538 ± 2.417
2.475SerHis: 2.475 ± 0.326
3.3SerIle: 3.3 ± 3.034
8.251SerLys: 8.251 ± 4.111
8.251SerLeu: 8.251 ± 2.551
2.475SerMet: 2.475 ± 0.326
5.363SerAsn: 5.363 ± 0.447
3.713SerPro: 3.713 ± 1.269
2.475SerGln: 2.475 ± 0.326
4.538SerArg: 4.538 ± 2.417
8.663SerSer: 8.663 ± 1.921
4.125SerThr: 4.125 ± 1.063
3.713SerVal: 3.713 ± 1.85
0.825SerTrp: 0.825 ± 0.411
0.825SerTyr: 0.825 ± 1.148
0.0SerXaa: 0.0 ± 0.0
Thr
2.888ThrAla: 2.888 ± 1.439
1.65ThrCys: 1.65 ± 0.822
2.475ThrAsp: 2.475 ± 1.886
2.888ThrGlu: 2.888 ± 1.68
3.3ThrPhe: 3.3 ± 0.085
2.475ThrGly: 2.475 ± 0.326
1.238ThrHis: 1.238 ± 0.617
2.475ThrIle: 2.475 ± 1.233
4.125ThrLys: 4.125 ± 0.496
5.776ThrLeu: 5.776 ± 2.878
0.413ThrMet: 0.413 ± 0.206
1.65ThrAsn: 1.65 ± 0.822
2.475ThrPro: 2.475 ± 1.233
1.65ThrGln: 1.65 ± 0.822
2.888ThrArg: 2.888 ± 1.439
3.3ThrSer: 3.3 ± 0.085
2.475ThrThr: 2.475 ± 0.326
2.475ThrVal: 2.475 ± 1.233
0.413ThrTrp: 0.413 ± 0.206
1.65ThrTyr: 1.65 ± 0.822
0.0ThrXaa: 0.0 ± 0.0
Val
4.538ValAla: 4.538 ± 0.702
0.413ValCys: 0.413 ± 0.206
2.888ValAsp: 2.888 ± 0.121
2.063ValGlu: 2.063 ± 0.532
4.538ValPhe: 4.538 ± 2.261
2.888ValGly: 2.888 ± 0.121
2.888ValHis: 2.888 ± 0.121
3.713ValIle: 3.713 ± 1.269
2.888ValLys: 2.888 ± 0.121
4.538ValLeu: 4.538 ± 0.858
0.413ValMet: 0.413 ± 0.206
4.125ValAsn: 4.125 ± 2.623
2.475ValPro: 2.475 ± 1.233
1.238ValGln: 1.238 ± 0.943
1.65ValArg: 1.65 ± 0.737
3.3ValSer: 3.3 ± 0.085
2.888ValThr: 2.888 ± 1.439
1.65ValVal: 1.65 ± 0.822
0.413ValTrp: 0.413 ± 0.206
2.063ValTyr: 2.063 ± 0.532
0.0ValXaa: 0.0 ± 0.0
Trp
2.063TrpAla: 2.063 ± 0.532
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.413TrpGlu: 0.413 ± 0.206
0.825TrpPhe: 0.825 ± 0.411
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.65TrpIle: 1.65 ± 0.822
2.063TrpLys: 2.063 ± 1.028
0.825TrpLeu: 0.825 ± 0.411
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.65TrpPro: 1.65 ± 0.822
0.413TrpGln: 0.413 ± 0.206
0.825TrpArg: 0.825 ± 0.411
0.413TrpSer: 0.413 ± 0.206
0.413TrpThr: 0.413 ± 0.206
0.825TrpVal: 0.825 ± 0.411
0.0TrpTrp: 0.0 ± 0.0
0.413TrpTyr: 0.413 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.238TyrCys: 1.238 ± 0.617
0.413TyrAsp: 0.413 ± 1.354
1.65TyrGlu: 1.65 ± 0.737
0.825TyrPhe: 0.825 ± 1.148
2.063TyrGly: 2.063 ± 0.532
0.413TyrHis: 0.413 ± 0.206
1.238TyrIle: 1.238 ± 0.617
2.063TyrLys: 2.063 ± 2.091
4.95TyrLeu: 4.95 ± 0.652
0.825TyrMet: 0.825 ± 0.411
0.825TyrAsn: 0.825 ± 0.411
1.238TyrPro: 1.238 ± 0.617
0.0TyrGln: 0.0 ± 0.0
2.063TyrArg: 2.063 ± 0.532
1.238TyrSer: 1.238 ± 0.617
2.475TyrThr: 2.475 ± 1.233
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.825TyrTyr: 0.825 ± 0.411
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2425 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski