Amino acid dipepetide frequency for Streptomyces sp. MZ03-48

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.282AlaAla: 23.282 ± 0.213
1.118AlaCys: 1.118 ± 0.031
8.647AlaAsp: 8.647 ± 0.084
8.693AlaGlu: 8.693 ± 0.089
3.555AlaPhe: 3.555 ± 0.051
13.091AlaGly: 13.091 ± 0.114
3.288AlaHis: 3.288 ± 0.047
3.126AlaIle: 3.126 ± 0.05
2.957AlaLys: 2.957 ± 0.053
15.269AlaLeu: 15.269 ± 0.125
2.504AlaMet: 2.504 ± 0.041
1.819AlaAsn: 1.819 ± 0.033
8.206AlaPro: 8.206 ± 0.089
3.769AlaGln: 3.769 ± 0.052
11.036AlaArg: 11.036 ± 0.094
5.649AlaSer: 5.649 ± 0.061
7.583AlaThr: 7.583 ± 0.068
12.315AlaVal: 12.315 ± 0.122
1.804AlaTrp: 1.804 ± 0.034
2.748AlaTyr: 2.748 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
1.273CysAla: 1.273 ± 0.031
0.107CysCys: 0.107 ± 0.008
0.481CysAsp: 0.481 ± 0.019
0.404CysGlu: 0.404 ± 0.015
0.233CysPhe: 0.233 ± 0.011
1.026CysGly: 1.026 ± 0.027
0.209CysHis: 0.209 ± 0.01
0.185CysIle: 0.185 ± 0.01
0.106CysLys: 0.106 ± 0.009
0.735CysLeu: 0.735 ± 0.021
0.122CysMet: 0.122 ± 0.009
0.127CysAsn: 0.127 ± 0.009
0.496CysPro: 0.496 ± 0.018
0.177CysGln: 0.177 ± 0.01
0.681CysArg: 0.681 ± 0.022
0.395CysSer: 0.395 ± 0.015
0.57CysThr: 0.57 ± 0.02
0.671CysVal: 0.671 ± 0.022
0.118CysTrp: 0.118 ± 0.008
0.152CysTyr: 0.152 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.652AspAla: 8.652 ± 0.087
0.424AspCys: 0.424 ± 0.019
3.611AspAsp: 3.611 ± 0.048
3.726AspGlu: 3.726 ± 0.048
1.523AspPhe: 1.523 ± 0.031
6.894AspGly: 6.894 ± 0.079
1.558AspHis: 1.558 ± 0.032
1.851AspIle: 1.851 ± 0.034
1.137AspLys: 1.137 ± 0.033
6.08AspLeu: 6.08 ± 0.067
0.766AspMet: 0.766 ± 0.021
0.899AspAsn: 0.899 ± 0.027
4.372AspPro: 4.372 ± 0.055
1.388AspGln: 1.388 ± 0.034
5.031AspArg: 5.031 ± 0.059
2.249AspSer: 2.249 ± 0.04
3.239AspThr: 3.239 ± 0.049
4.329AspVal: 4.329 ± 0.046
0.912AspTrp: 0.912 ± 0.022
0.945AspTyr: 0.945 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
7.384GluAla: 7.384 ± 0.088
0.369GluCys: 0.369 ± 0.014
2.668GluAsp: 2.668 ± 0.045
3.665GluGlu: 3.665 ± 0.073
1.432GluPhe: 1.432 ± 0.028
4.295GluGly: 4.295 ± 0.05
1.399GluHis: 1.399 ± 0.034
2.259GluIle: 2.259 ± 0.039
1.496GluLys: 1.496 ± 0.038
6.562GluLeu: 6.562 ± 0.084
0.941GluMet: 0.941 ± 0.025
1.003GluAsn: 1.003 ± 0.028
3.239GluPro: 3.239 ± 0.041
2.168GluGln: 2.168 ± 0.053
5.582GluArg: 5.582 ± 0.074
2.397GluSer: 2.397 ± 0.041
2.83GluThr: 2.83 ± 0.039
4.479GluVal: 4.479 ± 0.057
0.729GluTrp: 0.729 ± 0.02
0.992GluTyr: 0.992 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
3.756PheAla: 3.756 ± 0.052
0.289PheCys: 0.289 ± 0.012
1.853PheAsp: 1.853 ± 0.032
1.266PheGlu: 1.266 ± 0.03
0.875PhePhe: 0.875 ± 0.025
3.044PheGly: 3.044 ± 0.043
0.673PheHis: 0.673 ± 0.021
0.701PheIle: 0.701 ± 0.019
0.483PheLys: 0.483 ± 0.019
2.627PheLeu: 2.627 ± 0.037
0.367PheMet: 0.367 ± 0.016
0.509PheAsn: 0.509 ± 0.017
1.396PhePro: 1.396 ± 0.027
0.642PheGln: 0.642 ± 0.018
1.882PheArg: 1.882 ± 0.034
1.39PheSer: 1.39 ± 0.026
2.019PheThr: 2.019 ± 0.04
2.077PheVal: 2.077 ± 0.03
0.393PheTrp: 0.393 ± 0.014
0.507PheTyr: 0.507 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
12.284GlyAla: 12.284 ± 0.091
0.935GlyCys: 0.935 ± 0.027
5.324GlyAsp: 5.324 ± 0.06
5.161GlyGlu: 5.161 ± 0.06
2.826GlyPhe: 2.826 ± 0.051
9.449GlyGly: 9.449 ± 0.113
2.44GlyHis: 2.44 ± 0.04
3.436GlyIle: 3.436 ± 0.049
2.6GlyLys: 2.6 ± 0.049
9.399GlyLeu: 9.399 ± 0.084
2.09GlyMet: 2.09 ± 0.036
1.76GlyAsn: 1.76 ± 0.04
5.745GlyPro: 5.745 ± 0.066
2.612GlyGln: 2.612 ± 0.048
7.94GlyArg: 7.94 ± 0.08
5.161GlySer: 5.161 ± 0.054
6.462GlyThr: 6.462 ± 0.073
7.488GlyVal: 7.488 ± 0.071
1.622GlyTrp: 1.622 ± 0.031
2.174GlyTyr: 2.174 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
2.931HisAla: 2.931 ± 0.044
0.217HisCys: 0.217 ± 0.011
1.446HisAsp: 1.446 ± 0.029
1.152HisGlu: 1.152 ± 0.027
0.707HisPhe: 0.707 ± 0.021
2.671HisGly: 2.671 ± 0.044
0.826HisHis: 0.826 ± 0.024
0.688HisIle: 0.688 ± 0.019
0.402HisLys: 0.402 ± 0.015
2.681HisLeu: 2.681 ± 0.042
0.344HisMet: 0.344 ± 0.014
0.358HisAsn: 0.358 ± 0.013
1.98HisPro: 1.98 ± 0.031
0.672HisGln: 0.672 ± 0.022
2.367HisArg: 2.367 ± 0.042
1.022HisSer: 1.022 ± 0.028
1.527HisThr: 1.527 ± 0.033
1.599HisVal: 1.599 ± 0.034
0.367HisTrp: 0.367 ± 0.015
0.449HisTyr: 0.449 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
4.605IleAla: 4.605 ± 0.056
0.32IleCys: 0.32 ± 0.014
2.216IleAsp: 2.216 ± 0.037
1.917IleGlu: 1.917 ± 0.036
0.684IlePhe: 0.684 ± 0.023
3.653IleGly: 3.653 ± 0.057
0.62IleHis: 0.62 ± 0.019
0.846IleIle: 0.846 ± 0.026
0.776IleLys: 0.776 ± 0.023
2.247IleLeu: 2.247 ± 0.042
0.416IleMet: 0.416 ± 0.015
0.682IleAsn: 0.682 ± 0.02
1.767IlePro: 1.767 ± 0.038
0.678IleGln: 0.678 ± 0.022
2.258IleArg: 2.258 ± 0.037
1.562IleSer: 1.562 ± 0.031
2.015IleThr: 2.015 ± 0.037
2.621IleVal: 2.621 ± 0.043
0.348IleTrp: 0.348 ± 0.014
0.474IleTyr: 0.474 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
2.864LysAla: 2.864 ± 0.05
0.111LysCys: 0.111 ± 0.008
1.421LysAsp: 1.421 ± 0.036
1.306LysGlu: 1.306 ± 0.035
0.466LysPhe: 0.466 ± 0.017
1.982LysGly: 1.982 ± 0.046
0.416LysHis: 0.416 ± 0.013
0.936LysIle: 0.936 ± 0.024
0.998LysLys: 0.998 ± 0.039
2.043LysLeu: 2.043 ± 0.038
0.395LysMet: 0.395 ± 0.016
0.542LysAsn: 0.542 ± 0.018
1.329LysPro: 1.329 ± 0.035
0.696LysGln: 0.696 ± 0.026
1.463LysArg: 1.463 ± 0.033
1.153LysSer: 1.153 ± 0.031
1.301LysThr: 1.301 ± 0.034
1.888LysVal: 1.888 ± 0.041
0.238LysTrp: 0.238 ± 0.011
0.437LysTyr: 0.437 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
15.261LeuAla: 15.261 ± 0.125
0.867LeuCys: 0.867 ± 0.027
6.662LeuAsp: 6.662 ± 0.07
4.514LeuGlu: 4.514 ± 0.069
2.6LeuPhe: 2.6 ± 0.04
9.285LeuGly: 9.285 ± 0.082
2.523LeuHis: 2.523 ± 0.045
3.332LeuIle: 3.332 ± 0.048
2.036LeuLys: 2.036 ± 0.045
11.317LeuLeu: 11.317 ± 0.12
1.595LeuMet: 1.595 ± 0.033
1.62LeuAsn: 1.62 ± 0.033
6.748LeuPro: 6.748 ± 0.067
2.17LeuGln: 2.17 ± 0.036
8.995LeuArg: 8.995 ± 0.082
5.202LeuSer: 5.202 ± 0.06
7.309LeuThr: 7.309 ± 0.068
8.342LeuVal: 8.342 ± 0.087
1.214LeuTrp: 1.214 ± 0.027
1.761LeuTyr: 1.761 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
2.313MetAla: 2.313 ± 0.039
0.137MetCys: 0.137 ± 0.009
0.914MetAsp: 0.914 ± 0.021
0.735MetGlu: 0.735 ± 0.019
0.419MetPhe: 0.419 ± 0.017
1.329MetGly: 1.329 ± 0.033
0.364MetHis: 0.364 ± 0.014
0.636MetIle: 0.636 ± 0.021
0.416MetLys: 0.416 ± 0.015
1.68MetLeu: 1.68 ± 0.032
0.299MetMet: 0.299 ± 0.013
0.416MetAsn: 0.416 ± 0.015
1.09MetPro: 1.09 ± 0.026
0.427MetGln: 0.427 ± 0.017
1.415MetArg: 1.415 ± 0.028
1.287MetSer: 1.287 ± 0.025
1.505MetThr: 1.505 ± 0.032
1.294MetVal: 1.294 ± 0.03
0.202MetTrp: 0.202 ± 0.011
0.305MetTyr: 0.305 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.151AsnAla: 2.151 ± 0.041
0.162AsnCys: 0.162 ± 0.01
0.941AsnAsp: 0.941 ± 0.022
0.79AsnGlu: 0.79 ± 0.023
0.446AsnPhe: 0.446 ± 0.017
1.82AsnGly: 1.82 ± 0.037
0.412AsnHis: 0.412 ± 0.015
0.613AsnIle: 0.613 ± 0.02
0.423AsnLys: 0.423 ± 0.015
1.56AsnLeu: 1.56 ± 0.033
0.286AsnMet: 0.286 ± 0.012
0.401AsnAsn: 0.401 ± 0.015
1.265AsnPro: 1.265 ± 0.024
0.451AsnGln: 0.451 ± 0.017
1.212AsnArg: 1.212 ± 0.025
0.877AsnSer: 0.877 ± 0.027
1.079AsnThr: 1.079 ± 0.028
1.292AsnVal: 1.292 ± 0.032
0.262AsnTrp: 0.262 ± 0.012
0.358AsnTyr: 0.358 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
8.802ProAla: 8.802 ± 0.075
0.404ProCys: 0.404 ± 0.015
4.662ProAsp: 4.662 ± 0.051
4.476ProGlu: 4.476 ± 0.053
1.529ProPhe: 1.529 ± 0.029
7.452ProGly: 7.452 ± 0.084
1.591ProHis: 1.591 ± 0.036
1.162ProIle: 1.162 ± 0.029
1.202ProLys: 1.202 ± 0.036
5.587ProLeu: 5.587 ± 0.063
0.975ProMet: 0.975 ± 0.023
0.832ProAsn: 0.832 ± 0.025
3.782ProPro: 3.782 ± 0.074
2.019ProGln: 2.019 ± 0.059
4.25ProArg: 4.25 ± 0.051
3.136ProSer: 3.136 ± 0.05
3.112ProThr: 3.112 ± 0.048
5.578ProVal: 5.578 ± 0.055
0.884ProTrp: 0.884 ± 0.022
1.441ProTyr: 1.441 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
3.563GlnAla: 3.563 ± 0.054
0.167GlnCys: 0.167 ± 0.009
1.399GlnAsp: 1.399 ± 0.033
1.607GlnGlu: 1.607 ± 0.043
0.657GlnPhe: 0.657 ± 0.018
2.254GlnGly: 2.254 ± 0.039
0.674GlnHis: 0.674 ± 0.027
1.113GlnIle: 1.113 ± 0.027
0.613GlnLys: 0.613 ± 0.021
3.186GlnLeu: 3.186 ± 0.05
0.498GlnMet: 0.498 ± 0.017
0.468GlnAsn: 0.468 ± 0.018
1.67GlnPro: 1.67 ± 0.045
1.302GlnGln: 1.302 ± 0.053
2.396GlnArg: 2.396 ± 0.043
1.184GlnSer: 1.184 ± 0.034
1.257GlnThr: 1.257 ± 0.032
2.218GlnVal: 2.218 ± 0.042
0.435GlnTrp: 0.435 ± 0.017
0.516GlnTyr: 0.516 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
10.546ArgAla: 10.546 ± 0.107
0.656ArgCys: 0.656 ± 0.022
4.372ArgAsp: 4.372 ± 0.053
4.754ArgGlu: 4.754 ± 0.061
2.263ArgPhe: 2.263 ± 0.039
6.185ArgGly: 6.185 ± 0.07
2.25ArgHis: 2.25 ± 0.039
3.271ArgIle: 3.271 ± 0.039
1.729ArgLys: 1.729 ± 0.035
8.828ArgLeu: 8.828 ± 0.098
1.777ArgMet: 1.777 ± 0.033
1.402ArgAsn: 1.402 ± 0.031
5.369ArgPro: 5.369 ± 0.059
2.301ArgGln: 2.301 ± 0.04
8.276ArgArg: 8.276 ± 0.088
3.98ArgSer: 3.98 ± 0.048
5.666ArgThr: 5.666 ± 0.065
5.536ArgVal: 5.536 ± 0.054
1.376ArgTrp: 1.376 ± 0.028
1.878ArgTyr: 1.878 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
6.33SerAla: 6.33 ± 0.06
0.43SerCys: 0.43 ± 0.017
2.505SerAsp: 2.505 ± 0.042
2.242SerGlu: 2.242 ± 0.042
1.496SerPhe: 1.496 ± 0.029
6.091SerGly: 6.091 ± 0.074
1.055SerHis: 1.055 ± 0.027
1.221SerIle: 1.221 ± 0.028
0.987SerLys: 0.987 ± 0.029
4.582SerLeu: 4.582 ± 0.06
0.992SerMet: 0.992 ± 0.026
0.785SerAsn: 0.785 ± 0.023
3.013SerPro: 3.013 ± 0.041
1.202SerGln: 1.202 ± 0.028
3.531SerArg: 3.531 ± 0.048
2.527SerSer: 2.527 ± 0.052
2.716SerThr: 2.716 ± 0.05
3.996SerVal: 3.996 ± 0.049
0.832SerTrp: 0.832 ± 0.025
1.119SerTyr: 1.119 ± 0.026
0.0SerXaa: 0.0 ± 0.0
Thr
9.358ThrAla: 9.358 ± 0.078
0.432ThrCys: 0.432 ± 0.017
3.706ThrAsp: 3.706 ± 0.045
3.224ThrGlu: 3.224 ± 0.051
1.676ThrPhe: 1.676 ± 0.034
6.878ThrGly: 6.878 ± 0.068
1.294ThrHis: 1.294 ± 0.03
1.498ThrIle: 1.498 ± 0.028
1.197ThrLys: 1.197 ± 0.031
5.936ThrLeu: 5.936 ± 0.058
0.959ThrMet: 0.959 ± 0.025
0.924ThrAsn: 0.924 ± 0.025
4.254ThrPro: 4.254 ± 0.061
1.296ThrGln: 1.296 ± 0.038
3.819ThrArg: 3.819 ± 0.05
2.974ThrSer: 2.974 ± 0.047
3.832ThrThr: 3.832 ± 0.057
6.326ThrVal: 6.326 ± 0.074
0.814ThrTrp: 0.814 ± 0.026
1.284ThrTyr: 1.284 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
10.77ValAla: 10.77 ± 0.094
0.796ValCys: 0.796 ± 0.022
4.618ValAsp: 4.618 ± 0.054
4.432ValGlu: 4.432 ± 0.057
2.25ValPhe: 2.25 ± 0.037
6.306ValGly: 6.306 ± 0.072
2.033ValHis: 2.033 ± 0.033
2.885ValIle: 2.885 ± 0.045
1.696ValLys: 1.696 ± 0.037
9.214ValLeu: 9.214 ± 0.092
1.357ValMet: 1.357 ± 0.027
1.588ValAsn: 1.588 ± 0.033
5.398ValPro: 5.398 ± 0.057
2.037ValGln: 2.037 ± 0.03
7.166ValArg: 7.166 ± 0.068
3.876ValSer: 3.876 ± 0.054
5.653ValThr: 5.653 ± 0.067
7.467ValVal: 7.467 ± 0.081
1.02ValTrp: 1.02 ± 0.023
1.456ValTyr: 1.456 ± 0.028
0.0ValXaa: 0.0 ± 0.0
Trp
1.562TrpAla: 1.562 ± 0.034
0.166TrpCys: 0.166 ± 0.009
0.75TrpAsp: 0.75 ± 0.021
0.731TrpGlu: 0.731 ± 0.023
0.494TrpPhe: 0.494 ± 0.018
1.0TrpGly: 1.0 ± 0.024
0.36TrpHis: 0.36 ± 0.012
0.518TrpIle: 0.518 ± 0.016
0.36TrpLys: 0.36 ± 0.014
1.716TrpLeu: 1.716 ± 0.033
0.274TrpMet: 0.274 ± 0.01
0.322TrpAsn: 0.322 ± 0.013
0.764TrpPro: 0.764 ± 0.021
0.617TrpGln: 0.617 ± 0.018
1.298TrpArg: 1.298 ± 0.026
0.814TrpSer: 0.814 ± 0.023
0.907TrpThr: 0.907 ± 0.023
0.911TrpVal: 0.911 ± 0.023
0.306TrpTrp: 0.306 ± 0.014
0.315TrpTyr: 0.315 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.771TyrAla: 2.771 ± 0.042
0.162TyrCys: 0.162 ± 0.008
1.39TyrAsp: 1.39 ± 0.032
1.095TyrGlu: 1.095 ± 0.024
0.621TyrPhe: 0.621 ± 0.019
2.201TyrGly: 2.201 ± 0.04
0.422TyrHis: 0.422 ± 0.016
0.426TyrIle: 0.426 ± 0.017
0.372TyrLys: 0.372 ± 0.016
2.105TyrLeu: 2.105 ± 0.032
0.224TyrMet: 0.224 ± 0.012
0.344TyrAsn: 0.344 ± 0.015
0.993TyrPro: 0.993 ± 0.028
0.552TyrGln: 0.552 ± 0.019
1.815TyrArg: 1.815 ± 0.027
0.815TyrSer: 0.815 ± 0.024
1.102TyrThr: 1.102 ± 0.028
1.591TyrVal: 1.591 ± 0.035
0.309TyrTrp: 0.309 ± 0.014
0.394TyrTyr: 0.394 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5822 proteins (1838379 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski