Amino acid dipepetide frequency for Legionella septentrionalis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.587AlaAla: 8.587 ± 0.148
1.327AlaCys: 1.327 ± 0.043
3.795AlaAsp: 3.795 ± 0.058
5.513AlaGlu: 5.513 ± 0.104
3.477AlaPhe: 3.477 ± 0.072
5.921AlaGly: 5.921 ± 0.121
2.073AlaHis: 2.073 ± 0.054
6.289AlaIle: 6.289 ± 0.12
5.084AlaLys: 5.084 ± 0.089
10.001AlaLeu: 10.001 ± 0.134
2.28AlaMet: 2.28 ± 0.067
3.661AlaAsn: 3.661 ± 0.075
2.818AlaPro: 2.818 ± 0.069
4.013AlaGln: 4.013 ± 0.079
4.22AlaArg: 4.22 ± 0.076
4.972AlaSer: 4.972 ± 0.08
4.089AlaThr: 4.089 ± 0.089
5.435AlaVal: 5.435 ± 0.097
0.997AlaTrp: 0.997 ± 0.034
2.854AlaTyr: 2.854 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.897CysAla: 0.897 ± 0.035
0.204CysCys: 0.204 ± 0.018
0.577CysAsp: 0.577 ± 0.029
0.686CysGlu: 0.686 ± 0.032
0.618CysPhe: 0.618 ± 0.029
0.927CysGly: 0.927 ± 0.04
0.349CysHis: 0.349 ± 0.024
0.819CysIle: 0.819 ± 0.034
0.682CysLys: 0.682 ± 0.03
1.331CysLeu: 1.331 ± 0.049
0.266CysMet: 0.266 ± 0.019
0.486CysAsn: 0.486 ± 0.027
0.485CysPro: 0.485 ± 0.026
0.521CysGln: 0.521 ± 0.023
0.561CysArg: 0.561 ± 0.028
0.805CysSer: 0.805 ± 0.04
0.606CysThr: 0.606 ± 0.029
0.681CysVal: 0.681 ± 0.028
0.152CysTrp: 0.152 ± 0.013
0.454CysTyr: 0.454 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
4.178AspAla: 4.178 ± 0.071
0.566AspCys: 0.566 ± 0.028
2.217AspAsp: 2.217 ± 0.065
3.576AspGlu: 3.576 ± 0.077
2.405AspPhe: 2.405 ± 0.054
2.527AspGly: 2.527 ± 0.06
0.922AspHis: 0.922 ± 0.038
3.425AspIle: 3.425 ± 0.078
3.163AspLys: 3.163 ± 0.078
4.824AspLeu: 4.824 ± 0.086
1.073AspMet: 1.073 ± 0.035
1.948AspAsn: 1.948 ± 0.054
1.823AspPro: 1.823 ± 0.052
1.155AspGln: 1.155 ± 0.04
1.721AspArg: 1.721 ± 0.048
2.392AspSer: 2.392 ± 0.06
2.149AspThr: 2.149 ± 0.077
3.13AspVal: 3.13 ± 0.07
0.644AspTrp: 0.644 ± 0.028
1.967AspTyr: 1.967 ± 0.056
0.0AspXaa: 0.0 ± 0.0
Glu
5.068GluAla: 5.068 ± 0.095
0.588GluCys: 0.588 ± 0.032
2.677GluAsp: 2.677 ± 0.072
4.575GluGlu: 4.575 ± 0.112
2.515GluPhe: 2.515 ± 0.062
3.116GluGly: 3.116 ± 0.072
1.657GluHis: 1.657 ± 0.053
4.665GluIle: 4.665 ± 0.089
4.787GluLys: 4.787 ± 0.1
6.838GluLeu: 6.838 ± 0.1
1.51GluMet: 1.51 ± 0.044
2.929GluAsn: 2.929 ± 0.075
1.947GluPro: 1.947 ± 0.057
3.793GluGln: 3.793 ± 0.088
3.116GluArg: 3.116 ± 0.072
2.963GluSer: 2.963 ± 0.066
3.047GluThr: 3.047 ± 0.068
3.553GluVal: 3.553 ± 0.088
0.621GluTrp: 0.621 ± 0.035
1.785GluTyr: 1.785 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
3.92PheAla: 3.92 ± 0.075
0.645PheCys: 0.645 ± 0.028
2.221PheAsp: 2.221 ± 0.053
2.019PheGlu: 2.019 ± 0.057
2.543PhePhe: 2.543 ± 0.074
2.553PheGly: 2.553 ± 0.07
0.99PheHis: 0.99 ± 0.035
3.507PheIle: 3.507 ± 0.072
2.461PheLys: 2.461 ± 0.069
4.92PheLeu: 4.92 ± 0.109
1.073PheMet: 1.073 ± 0.042
2.293PheAsn: 2.293 ± 0.067
1.892PhePro: 1.892 ± 0.047
1.416PheGln: 1.416 ± 0.042
1.6PheArg: 1.6 ± 0.042
3.355PheSer: 3.355 ± 0.076
2.505PheThr: 2.505 ± 0.057
2.422PheVal: 2.422 ± 0.064
0.586PheTrp: 0.586 ± 0.031
1.683PheTyr: 1.683 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
4.553GlyAla: 4.553 ± 0.096
0.873GlyCys: 0.873 ± 0.042
2.821GlyAsp: 2.821 ± 0.066
3.68GlyGlu: 3.68 ± 0.07
3.202GlyPhe: 3.202 ± 0.09
4.038GlyGly: 4.038 ± 0.092
1.49GlyHis: 1.49 ± 0.042
4.992GlyIle: 4.992 ± 0.094
4.138GlyLys: 4.138 ± 0.08
6.587GlyLeu: 6.587 ± 0.104
1.817GlyMet: 1.817 ± 0.048
2.465GlyAsn: 2.465 ± 0.063
1.591GlyPro: 1.591 ± 0.046
2.469GlyGln: 2.469 ± 0.056
2.778GlyArg: 2.778 ± 0.084
3.409GlySer: 3.409 ± 0.083
2.933GlyThr: 2.933 ± 0.068
4.188GlyVal: 4.188 ± 0.093
0.823GlyTrp: 0.823 ± 0.036
2.276GlyTyr: 2.276 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
2.522HisAla: 2.522 ± 0.066
0.382HisCys: 0.382 ± 0.027
1.206HisAsp: 1.206 ± 0.047
1.56HisGlu: 1.56 ± 0.047
1.25HisPhe: 1.25 ± 0.046
1.757HisGly: 1.757 ± 0.056
0.907HisHis: 0.907 ± 0.048
1.632HisIle: 1.632 ± 0.05
1.162HisLys: 1.162 ± 0.041
2.705HisLeu: 2.705 ± 0.073
0.573HisMet: 0.573 ± 0.028
0.91HisAsn: 0.91 ± 0.036
1.38HisPro: 1.38 ± 0.044
1.178HisGln: 1.178 ± 0.045
1.21HisArg: 1.21 ± 0.042
1.462HisSer: 1.462 ± 0.043
1.251HisThr: 1.251 ± 0.038
1.574HisVal: 1.574 ± 0.039
0.408HisTrp: 0.408 ± 0.021
1.129HisTyr: 1.129 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
6.643IleAla: 6.643 ± 0.112
0.861IleCys: 0.861 ± 0.034
3.699IleAsp: 3.699 ± 0.066
4.455IleGlu: 4.455 ± 0.08
2.817IlePhe: 2.817 ± 0.072
4.441IleGly: 4.441 ± 0.086
1.819IleHis: 1.819 ± 0.056
4.878IleIle: 4.878 ± 0.096
4.501IleLys: 4.501 ± 0.089
7.251IleLeu: 7.251 ± 0.122
1.342IleMet: 1.342 ± 0.045
3.547IleAsn: 3.547 ± 0.076
3.39IlePro: 3.39 ± 0.067
2.691IleGln: 2.691 ± 0.062
3.156IleArg: 3.156 ± 0.062
4.545IleSer: 4.545 ± 0.082
3.896IleThr: 3.896 ± 0.082
3.938IleVal: 3.938 ± 0.086
0.586IleTrp: 0.586 ± 0.031
2.136IleTyr: 2.136 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
4.69LysAla: 4.69 ± 0.086
0.404LysCys: 0.404 ± 0.022
2.811LysAsp: 2.811 ± 0.067
4.437LysGlu: 4.437 ± 0.096
1.767LysPhe: 1.767 ± 0.046
2.994LysGly: 2.994 ± 0.069
1.507LysHis: 1.507 ± 0.045
4.561LysIle: 4.561 ± 0.086
4.727LysLys: 4.727 ± 0.095
6.104LysLeu: 6.104 ± 0.108
1.44LysMet: 1.44 ± 0.047
3.44LysAsn: 3.44 ± 0.077
2.946LysPro: 2.946 ± 0.072
3.456LysGln: 3.456 ± 0.073
3.05LysArg: 3.05 ± 0.076
3.303LysSer: 3.303 ± 0.072
3.544LysThr: 3.544 ± 0.076
2.922LysVal: 2.922 ± 0.077
0.558LysTrp: 0.558 ± 0.027
1.516LysTyr: 1.516 ± 0.048
0.0LysXaa: 0.0 ± 0.0
Leu
10.591LeuAla: 10.591 ± 0.154
1.395LeuCys: 1.395 ± 0.043
4.899LeuAsp: 4.899 ± 0.083
5.725LeuGlu: 5.725 ± 0.111
5.007LeuPhe: 5.007 ± 0.112
6.855LeuGly: 6.855 ± 0.107
3.032LeuHis: 3.032 ± 0.07
7.397LeuIle: 7.397 ± 0.122
6.315LeuLys: 6.315 ± 0.099
12.895LeuLeu: 12.895 ± 0.204
2.381LeuMet: 2.381 ± 0.063
5.199LeuAsn: 5.199 ± 0.096
5.333LeuPro: 5.333 ± 0.096
5.833LeuGln: 5.833 ± 0.112
4.982LeuArg: 4.982 ± 0.091
7.633LeuSer: 7.633 ± 0.118
6.101LeuThr: 6.101 ± 0.112
5.966LeuVal: 5.966 ± 0.09
1.043LeuTrp: 1.043 ± 0.043
3.127LeuTyr: 3.127 ± 0.071
0.0LeuXaa: 0.0 ± 0.0
Met
2.113MetAla: 2.113 ± 0.062
0.184MetCys: 0.184 ± 0.016
1.142MetAsp: 1.142 ± 0.036
1.268MetGlu: 1.268 ± 0.049
0.709MetPhe: 0.709 ± 0.03
1.55MetGly: 1.55 ± 0.05
0.678MetHis: 0.678 ± 0.032
1.414MetIle: 1.414 ± 0.042
1.502MetLys: 1.502 ± 0.046
2.541MetLeu: 2.541 ± 0.055
0.66MetMet: 0.66 ± 0.032
1.082MetAsn: 1.082 ± 0.034
1.147MetPro: 1.147 ± 0.039
1.558MetGln: 1.558 ± 0.051
1.286MetArg: 1.286 ± 0.036
1.444MetSer: 1.444 ± 0.046
1.222MetThr: 1.222 ± 0.04
1.558MetVal: 1.558 ± 0.055
0.207MetTrp: 0.207 ± 0.016
0.522MetTyr: 0.522 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
3.564AsnAla: 3.564 ± 0.077
0.513AsnCys: 0.513 ± 0.028
1.987AsnAsp: 1.987 ± 0.059
2.794AsnGlu: 2.794 ± 0.069
1.956AsnPhe: 1.956 ± 0.052
2.338AsnGly: 2.338 ± 0.069
1.272AsnHis: 1.272 ± 0.041
3.06AsnIle: 3.06 ± 0.074
2.871AsnLys: 2.871 ± 0.066
4.884AsnLeu: 4.884 ± 0.093
0.953AsnMet: 0.953 ± 0.038
2.201AsnAsn: 2.201 ± 0.062
2.561AsnPro: 2.561 ± 0.058
2.241AsnGln: 2.241 ± 0.072
2.023AsnArg: 2.023 ± 0.046
2.474AsnSer: 2.474 ± 0.064
2.297AsnThr: 2.297 ± 0.062
2.288AsnVal: 2.288 ± 0.065
0.596AsnTrp: 0.596 ± 0.032
1.771AsnTyr: 1.771 ± 0.056
0.0AsnXaa: 0.0 ± 0.0
Pro
3.82ProAla: 3.82 ± 0.071
0.468ProCys: 0.468 ± 0.023
2.029ProAsp: 2.029 ± 0.057
3.231ProGlu: 3.231 ± 0.07
1.944ProPhe: 1.944 ± 0.048
2.671ProGly: 2.671 ± 0.067
1.095ProHis: 1.095 ± 0.039
2.597ProIle: 2.597 ± 0.06
2.138ProLys: 2.138 ± 0.052
4.631ProLeu: 4.631 ± 0.076
1.046ProMet: 1.046 ± 0.038
1.757ProAsn: 1.757 ± 0.052
1.607ProPro: 1.607 ± 0.049
1.883ProGln: 1.883 ± 0.053
1.571ProArg: 1.571 ± 0.05
2.34ProSer: 2.34 ± 0.058
1.896ProThr: 1.896 ± 0.052
3.327ProVal: 3.327 ± 0.078
0.522ProTrp: 0.522 ± 0.026
1.452ProTyr: 1.452 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
4.649GlnAla: 4.649 ± 0.081
0.516GlnCys: 0.516 ± 0.027
2.004GlnAsp: 2.004 ± 0.06
3.298GlnGlu: 3.298 ± 0.084
2.11GlnPhe: 2.11 ± 0.048
2.877GlnGly: 2.877 ± 0.065
1.387GlnHis: 1.387 ± 0.047
3.124GlnIle: 3.124 ± 0.068
2.846GlnLys: 2.846 ± 0.076
5.154GlnLeu: 5.154 ± 0.097
1.065GlnMet: 1.065 ± 0.034
1.976GlnAsn: 1.976 ± 0.057
1.636GlnPro: 1.636 ± 0.045
3.363GlnGln: 3.363 ± 0.091
2.229GlnArg: 2.229 ± 0.062
2.444GlnSer: 2.444 ± 0.058
2.527GlnThr: 2.527 ± 0.067
2.723GlnVal: 2.723 ± 0.066
0.592GlnTrp: 0.592 ± 0.032
1.59GlnTyr: 1.59 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
3.584ArgAla: 3.584 ± 0.082
0.472ArgCys: 0.472 ± 0.024
2.12ArgAsp: 2.12 ± 0.053
3.124ArgGlu: 3.124 ± 0.071
2.309ArgPhe: 2.309 ± 0.054
2.629ArgGly: 2.629 ± 0.063
1.292ArgHis: 1.292 ± 0.041
3.34ArgIle: 3.34 ± 0.06
2.641ArgLys: 2.641 ± 0.06
5.321ArgLeu: 5.321 ± 0.088
1.175ArgMet: 1.175 ± 0.043
1.988ArgAsn: 1.988 ± 0.054
1.56ArgPro: 1.56 ± 0.049
2.47ArgGln: 2.47 ± 0.066
2.352ArgArg: 2.352 ± 0.065
2.289ArgSer: 2.289 ± 0.052
2.061ArgThr: 2.061 ± 0.05
2.785ArgVal: 2.785 ± 0.068
0.592ArgTrp: 0.592 ± 0.032
1.805ArgTyr: 1.805 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
4.721SerAla: 4.721 ± 0.09
0.699SerCys: 0.699 ± 0.031
2.404SerAsp: 2.404 ± 0.051
3.236SerGlu: 3.236 ± 0.066
2.807SerPhe: 2.807 ± 0.068
4.1SerGly: 4.1 ± 0.084
1.522SerHis: 1.522 ± 0.046
4.162SerIle: 4.162 ± 0.081
3.155SerLys: 3.155 ± 0.074
7.309SerLeu: 7.309 ± 0.118
1.582SerMet: 1.582 ± 0.044
2.376SerAsn: 2.376 ± 0.06
2.703SerPro: 2.703 ± 0.061
2.637SerGln: 2.637 ± 0.069
2.61SerArg: 2.61 ± 0.061
3.886SerSer: 3.886 ± 0.083
2.801SerThr: 2.801 ± 0.067
3.383SerVal: 3.383 ± 0.07
0.767SerTrp: 0.767 ± 0.032
2.015SerTyr: 2.015 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
4.623ThrAla: 4.623 ± 0.088
0.6ThrCys: 0.6 ± 0.028
2.097ThrAsp: 2.097 ± 0.054
2.667ThrGlu: 2.667 ± 0.066
2.061ThrPhe: 2.061 ± 0.057
3.565ThrGly: 3.565 ± 0.082
1.4ThrHis: 1.4 ± 0.038
3.639ThrIle: 3.639 ± 0.098
2.276ThrLys: 2.276 ± 0.063
6.19ThrLeu: 6.19 ± 0.101
1.086ThrMet: 1.086 ± 0.04
1.883ThrAsn: 1.883 ± 0.058
2.811ThrPro: 2.811 ± 0.067
2.314ThrGln: 2.314 ± 0.059
2.401ThrArg: 2.401 ± 0.054
2.955ThrSer: 2.955 ± 0.065
2.623ThrThr: 2.623 ± 0.078
3.375ThrVal: 3.375 ± 0.078
0.564ThrTrp: 0.564 ± 0.027
1.571ThrTyr: 1.571 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
5.158ValAla: 5.158 ± 0.102
0.798ValCys: 0.798 ± 0.037
3.118ValAsp: 3.118 ± 0.072
3.431ValGlu: 3.431 ± 0.086
2.823ValPhe: 2.823 ± 0.064
3.489ValGly: 3.489 ± 0.076
1.415ValHis: 1.415 ± 0.044
4.497ValIle: 4.497 ± 0.104
3.557ValLys: 3.557 ± 0.078
6.55ValLeu: 6.55 ± 0.109
1.559ValMet: 1.559 ± 0.048
2.806ValAsn: 2.806 ± 0.065
2.394ValPro: 2.394 ± 0.061
2.326ValGln: 2.326 ± 0.059
2.603ValArg: 2.603 ± 0.064
3.693ValSer: 3.693 ± 0.084
3.134ValThr: 3.134 ± 0.078
3.96ValVal: 3.96 ± 0.091
0.602ValTrp: 0.602 ± 0.031
1.889ValTyr: 1.889 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.77TrpAla: 0.77 ± 0.033
0.128TrpCys: 0.128 ± 0.013
0.474TrpAsp: 0.474 ± 0.027
0.585TrpGlu: 0.585 ± 0.028
0.553TrpPhe: 0.553 ± 0.026
0.701TrpGly: 0.701 ± 0.033
0.373TrpHis: 0.373 ± 0.024
0.738TrpIle: 0.738 ± 0.034
0.492TrpLys: 0.492 ± 0.026
1.704TrpLeu: 1.704 ± 0.06
0.282TrpMet: 0.282 ± 0.02
0.43TrpAsn: 0.43 ± 0.026
0.449TrpPro: 0.449 ± 0.022
0.815TrpGln: 0.815 ± 0.033
0.661TrpArg: 0.661 ± 0.031
0.604TrpSer: 0.604 ± 0.028
0.44TrpThr: 0.44 ± 0.025
0.726TrpVal: 0.726 ± 0.034
0.203TrpTrp: 0.203 ± 0.017
0.392TrpTyr: 0.392 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.755TyrAla: 2.755 ± 0.061
0.488TyrCys: 0.488 ± 0.023
1.51TyrAsp: 1.51 ± 0.045
1.889TyrGlu: 1.889 ± 0.052
1.729TyrPhe: 1.729 ± 0.052
2.072TyrGly: 2.072 ± 0.056
0.975TyrHis: 0.975 ± 0.042
1.853TyrIle: 1.853 ± 0.048
1.677TyrLys: 1.677 ± 0.051
3.97TyrLeu: 3.97 ± 0.082
0.622TyrMet: 0.622 ± 0.031
1.267TyrAsn: 1.267 ± 0.046
1.554TyrPro: 1.554 ± 0.043
1.975TyrGln: 1.975 ± 0.055
1.78TyrArg: 1.78 ± 0.054
1.907TyrSer: 1.907 ± 0.058
1.58TyrThr: 1.58 ± 0.051
1.871TyrVal: 1.871 ± 0.053
0.458TyrTrp: 0.458 ± 0.024
1.274TyrTyr: 1.274 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2276 proteins (750551 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski