Amino acid dipepetide frequency for Lacunisphaera limnophila

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.659AlaAla: 17.659 ± 0.196
1.066AlaCys: 1.066 ± 0.029
6.408AlaAsp: 6.408 ± 0.085
7.059AlaGlu: 7.059 ± 0.101
4.094AlaPhe: 4.094 ± 0.062
11.35AlaGly: 11.35 ± 0.132
2.334AlaHis: 2.334 ± 0.048
4.794AlaIle: 4.794 ± 0.078
4.218AlaLys: 4.218 ± 0.074
13.239AlaLeu: 13.239 ± 0.131
2.487AlaMet: 2.487 ± 0.045
2.93AlaAsn: 2.93 ± 0.056
6.268AlaPro: 6.268 ± 0.091
4.295AlaGln: 4.295 ± 0.079
8.448AlaArg: 8.448 ± 0.118
5.517AlaSer: 5.517 ± 0.075
6.988AlaThr: 6.988 ± 0.079
8.577AlaVal: 8.577 ± 0.096
2.036AlaTrp: 2.036 ± 0.057
2.633AlaTyr: 2.633 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.984CysAla: 0.984 ± 0.03
0.111CysCys: 0.111 ± 0.01
0.468CysAsp: 0.468 ± 0.019
0.403CysGlu: 0.403 ± 0.016
0.357CysPhe: 0.357 ± 0.018
0.854CysGly: 0.854 ± 0.029
0.293CysHis: 0.293 ± 0.021
0.358CysIle: 0.358 ± 0.018
0.179CysLys: 0.179 ± 0.016
0.974CysLeu: 0.974 ± 0.026
0.123CysMet: 0.123 ± 0.009
0.202CysAsn: 0.202 ± 0.014
0.483CysPro: 0.483 ± 0.023
0.249CysGln: 0.249 ± 0.016
0.644CysArg: 0.644 ± 0.024
0.435CysSer: 0.435 ± 0.019
0.48CysThr: 0.48 ± 0.02
0.642CysVal: 0.642 ± 0.022
0.149CysTrp: 0.149 ± 0.011
0.227CysTyr: 0.227 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
5.454AspAla: 5.454 ± 0.077
0.435AspCys: 0.435 ± 0.019
2.412AspAsp: 2.412 ± 0.091
2.94AspGlu: 2.94 ± 0.051
2.525AspPhe: 2.525 ± 0.062
4.588AspGly: 4.588 ± 0.094
1.17AspHis: 1.17 ± 0.036
2.142AspIle: 2.142 ± 0.055
1.517AspLys: 1.517 ± 0.043
5.747AspLeu: 5.747 ± 0.081
0.759AspMet: 0.759 ± 0.024
1.27AspAsn: 1.27 ± 0.053
3.406AspPro: 3.406 ± 0.054
1.75AspGln: 1.75 ± 0.038
3.715AspArg: 3.715 ± 0.063
2.15AspSer: 2.15 ± 0.044
2.577AspThr: 2.577 ± 0.103
3.276AspVal: 3.276 ± 0.052
1.024AspTrp: 1.024 ± 0.035
1.691AspTyr: 1.691 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
6.386GluAla: 6.386 ± 0.104
0.369GluCys: 0.369 ± 0.015
2.115GluAsp: 2.115 ± 0.041
2.782GluGlu: 2.782 ± 0.064
2.207GluPhe: 2.207 ± 0.047
3.66GluGly: 3.66 ± 0.053
1.222GluHis: 1.222 ± 0.034
3.091GluIle: 3.091 ± 0.055
2.493GluLys: 2.493 ± 0.053
5.953GluLeu: 5.953 ± 0.083
1.171GluMet: 1.171 ± 0.031
1.608GluAsn: 1.608 ± 0.035
2.654GluPro: 2.654 ± 0.046
2.306GluGln: 2.306 ± 0.047
3.809GluArg: 3.809 ± 0.061
2.598GluSer: 2.598 ± 0.053
3.049GluThr: 3.049 ± 0.058
3.902GluVal: 3.902 ± 0.058
0.791GluTrp: 0.791 ± 0.027
1.149GluTyr: 1.149 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
4.591PheAla: 4.591 ± 0.058
0.409PheCys: 0.409 ± 0.018
2.351PheAsp: 2.351 ± 0.046
2.01PheGlu: 2.01 ± 0.043
1.739PhePhe: 1.739 ± 0.049
3.416PheGly: 3.416 ± 0.059
0.859PheHis: 0.859 ± 0.028
1.719PheIle: 1.719 ± 0.035
1.277PheLys: 1.277 ± 0.038
3.961PheLeu: 3.961 ± 0.073
0.725PheMet: 0.725 ± 0.025
1.312PheAsn: 1.312 ± 0.033
1.818PhePro: 1.818 ± 0.039
1.21PheGln: 1.21 ± 0.032
2.521PheArg: 2.521 ± 0.058
2.363PheSer: 2.363 ± 0.046
2.76PheThr: 2.76 ± 0.063
2.767PheVal: 2.767 ± 0.047
0.611PheTrp: 0.611 ± 0.023
1.059PheTyr: 1.059 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
8.728GlyAla: 8.728 ± 0.129
0.889GlyCys: 0.889 ± 0.031
4.136GlyAsp: 4.136 ± 0.177
4.284GlyGlu: 4.284 ± 0.064
3.546GlyPhe: 3.546 ± 0.056
7.215GlyGly: 7.215 ± 0.156
1.873GlyHis: 1.873 ± 0.039
3.851GlyIle: 3.851 ± 0.07
3.026GlyLys: 3.026 ± 0.058
9.643GlyLeu: 9.643 ± 0.108
1.769GlyMet: 1.769 ± 0.043
2.219GlyAsn: 2.219 ± 0.072
3.553GlyPro: 3.553 ± 0.07
2.926GlyGln: 2.926 ± 0.051
6.223GlyArg: 6.223 ± 0.102
4.397GlySer: 4.397 ± 0.089
5.04GlyThr: 5.04 ± 0.13
6.073GlyVal: 6.073 ± 0.074
1.762GlyTrp: 1.762 ± 0.052
2.414GlyTyr: 2.414 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.545HisAla: 2.545 ± 0.047
0.233HisCys: 0.233 ± 0.016
1.235HisAsp: 1.235 ± 0.034
1.194HisGlu: 1.194 ± 0.037
0.998HisPhe: 0.998 ± 0.027
2.101HisGly: 2.101 ± 0.048
0.678HisHis: 0.678 ± 0.027
0.742HisIle: 0.742 ± 0.025
0.445HisLys: 0.445 ± 0.021
2.484HisLeu: 2.484 ± 0.048
0.315HisMet: 0.315 ± 0.016
0.562HisAsn: 0.562 ± 0.022
1.564HisPro: 1.564 ± 0.042
0.74HisGln: 0.74 ± 0.028
1.654HisArg: 1.654 ± 0.041
0.966HisSer: 0.966 ± 0.029
1.173HisThr: 1.173 ± 0.032
1.409HisVal: 1.409 ± 0.027
0.442HisTrp: 0.442 ± 0.02
0.691HisTyr: 0.691 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.432IleAla: 5.432 ± 0.065
0.416IleCys: 0.416 ± 0.02
2.512IleAsp: 2.512 ± 0.043
2.742IleGlu: 2.742 ± 0.055
1.606IlePhe: 1.606 ± 0.041
3.886IleGly: 3.886 ± 0.072
0.924IleHis: 0.924 ± 0.031
2.204IleIle: 2.204 ± 0.057
1.51IleLys: 1.51 ± 0.034
4.149IleLeu: 4.149 ± 0.063
0.762IleMet: 0.762 ± 0.027
1.469IleAsn: 1.469 ± 0.047
2.328IlePro: 2.328 ± 0.046
1.258IleGln: 1.258 ± 0.033
2.897IleArg: 2.897 ± 0.053
2.379IleSer: 2.379 ± 0.05
2.958IleThr: 2.958 ± 0.077
3.303IleVal: 3.303 ± 0.064
0.548IleTrp: 0.548 ± 0.022
1.11IleTyr: 1.11 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
3.7LysAla: 3.7 ± 0.074
0.217LysCys: 0.217 ± 0.014
1.67LysAsp: 1.67 ± 0.048
1.764LysGlu: 1.764 ± 0.045
1.35LysPhe: 1.35 ± 0.035
2.2LysGly: 2.2 ± 0.046
0.793LysHis: 0.793 ± 0.028
1.826LysIle: 1.826 ± 0.044
1.756LysLys: 1.756 ± 0.048
3.802LysLeu: 3.802 ± 0.056
0.837LysMet: 0.837 ± 0.027
1.036LysAsn: 1.036 ± 0.028
2.197LysPro: 2.197 ± 0.048
1.214LysGln: 1.214 ± 0.032
2.048LysArg: 2.048 ± 0.045
1.865LysSer: 1.865 ± 0.053
2.022LysThr: 2.022 ± 0.046
2.442LysVal: 2.442 ± 0.053
0.484LysTrp: 0.484 ± 0.021
0.801LysTyr: 0.801 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
15.105LeuAla: 15.105 ± 0.128
1.036LeuCys: 1.036 ± 0.03
5.339LeuAsp: 5.339 ± 0.073
5.315LeuGlu: 5.315 ± 0.076
3.983LeuPhe: 3.983 ± 0.076
9.203LeuGly: 9.203 ± 0.086
2.336LeuHis: 2.336 ± 0.053
4.584LeuIle: 4.584 ± 0.071
3.842LeuLys: 3.842 ± 0.073
12.146LeuLeu: 12.146 ± 0.149
1.992LeuMet: 1.992 ± 0.043
3.054LeuAsn: 3.054 ± 0.062
6.696LeuPro: 6.696 ± 0.097
3.566LeuGln: 3.566 ± 0.062
7.876LeuArg: 7.876 ± 0.098
5.526LeuSer: 5.526 ± 0.076
6.819LeuThr: 6.819 ± 0.101
8.147LeuVal: 8.147 ± 0.096
1.512LeuTrp: 1.512 ± 0.05
2.281LeuTyr: 2.281 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
2.1MetAla: 2.1 ± 0.041
0.125MetCys: 0.125 ± 0.009
0.924MetAsp: 0.924 ± 0.027
0.931MetGlu: 0.931 ± 0.027
0.586MetPhe: 0.586 ± 0.026
1.417MetGly: 1.417 ± 0.037
0.415MetHis: 0.415 ± 0.019
0.92MetIle: 0.92 ± 0.027
1.078MetLys: 1.078 ± 0.032
2.026MetLeu: 2.026 ± 0.044
0.413MetMet: 0.413 ± 0.017
0.773MetAsn: 0.773 ± 0.023
1.342MetPro: 1.342 ± 0.036
0.689MetGln: 0.689 ± 0.024
1.273MetArg: 1.273 ± 0.035
1.252MetSer: 1.252 ± 0.032
1.193MetThr: 1.193 ± 0.03
1.207MetVal: 1.207 ± 0.037
0.17MetTrp: 0.17 ± 0.011
0.24MetTyr: 0.24 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.875AsnAla: 2.875 ± 0.055
0.221AsnCys: 0.221 ± 0.014
1.364AsnAsp: 1.364 ± 0.05
1.331AsnGlu: 1.331 ± 0.036
1.163AsnPhe: 1.163 ± 0.032
2.208AsnGly: 2.208 ± 0.058
0.673AsnHis: 0.673 ± 0.02
1.173AsnIle: 1.173 ± 0.035
0.743AsnLys: 0.743 ± 0.031
3.188AsnLeu: 3.188 ± 0.067
0.422AsnMet: 0.422 ± 0.018
0.888AsnAsn: 0.888 ± 0.04
2.174AsnPro: 2.174 ± 0.043
1.015AsnGln: 1.015 ± 0.033
1.961AsnArg: 1.961 ± 0.044
1.425AsnSer: 1.425 ± 0.048
1.6AsnThr: 1.6 ± 0.06
1.811AsnVal: 1.811 ± 0.062
0.48AsnTrp: 0.48 ± 0.023
0.88AsnTyr: 0.88 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
9.103ProAla: 9.103 ± 0.138
0.329ProCys: 0.329 ± 0.017
3.508ProAsp: 3.508 ± 0.065
3.67ProGlu: 3.67 ± 0.062
2.023ProPhe: 2.023 ± 0.046
5.297ProGly: 5.297 ± 0.076
1.177ProHis: 1.177 ± 0.036
1.804ProIle: 1.804 ± 0.039
1.705ProLys: 1.705 ± 0.043
5.447ProLeu: 5.447 ± 0.096
1.055ProMet: 1.055 ± 0.033
1.286ProAsn: 1.286 ± 0.03
3.247ProPro: 3.247 ± 0.069
1.582ProGln: 1.582 ± 0.043
3.442ProArg: 3.442 ± 0.064
2.575ProSer: 2.575 ± 0.048
3.056ProThr: 3.056 ± 0.056
4.888ProVal: 4.888 ± 0.078
0.946ProTrp: 0.946 ± 0.03
1.157ProTyr: 1.157 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
4.268GlnAla: 4.268 ± 0.074
0.223GlnCys: 0.223 ± 0.013
1.36GlnAsp: 1.36 ± 0.035
1.688GlnGlu: 1.688 ± 0.039
1.245GlnPhe: 1.245 ± 0.03
2.467GlnGly: 2.467 ± 0.054
0.799GlnHis: 0.799 ± 0.028
1.693GlnIle: 1.693 ± 0.038
1.271GlnLys: 1.271 ± 0.034
3.962GlnLeu: 3.962 ± 0.067
0.693GlnMet: 0.693 ± 0.024
0.92GlnAsn: 0.92 ± 0.028
2.275GlnPro: 2.275 ± 0.054
1.546GlnGln: 1.546 ± 0.047
2.666GlnArg: 2.666 ± 0.047
1.655GlnSer: 1.655 ± 0.036
1.914GlnThr: 1.914 ± 0.039
2.579GlnVal: 2.579 ± 0.039
0.489GlnTrp: 0.489 ± 0.021
0.637GlnTyr: 0.637 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
7.623ArgAla: 7.623 ± 0.102
0.519ArgCys: 0.519 ± 0.026
3.643ArgAsp: 3.643 ± 0.056
4.264ArgGlu: 4.264 ± 0.072
2.778ArgPhe: 2.778 ± 0.057
4.653ArgGly: 4.653 ± 0.063
1.749ArgHis: 1.749 ± 0.038
3.413ArgIle: 3.413 ± 0.054
2.15ArgLys: 2.15 ± 0.047
8.452ArgLeu: 8.452 ± 0.113
1.438ArgMet: 1.438 ± 0.034
1.803ArgAsn: 1.803 ± 0.042
3.985ArgPro: 3.985 ± 0.072
2.723ArgGln: 2.723 ± 0.056
5.382ArgArg: 5.382 ± 0.088
3.288ArgSer: 3.288 ± 0.053
3.942ArgThr: 3.942 ± 0.065
4.951ArgVal: 4.951 ± 0.075
1.309ArgTrp: 1.309 ± 0.038
1.883ArgTyr: 1.883 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
5.903SerAla: 5.903 ± 0.083
0.41SerCys: 0.41 ± 0.02
2.386SerAsp: 2.386 ± 0.043
2.277SerGlu: 2.277 ± 0.045
2.071SerPhe: 2.071 ± 0.049
4.762SerGly: 4.762 ± 0.099
1.067SerHis: 1.067 ± 0.033
2.193SerIle: 2.193 ± 0.053
1.42SerLys: 1.42 ± 0.037
5.842SerLeu: 5.842 ± 0.068
0.967SerMet: 0.967 ± 0.033
1.294SerAsn: 1.294 ± 0.038
3.199SerPro: 3.199 ± 0.061
1.522SerGln: 1.522 ± 0.041
3.339SerArg: 3.339 ± 0.054
2.814SerSer: 2.814 ± 0.062
2.874SerThr: 2.874 ± 0.076
3.591SerVal: 3.591 ± 0.058
0.754SerTrp: 0.754 ± 0.026
1.363SerTyr: 1.363 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
7.138ThrAla: 7.138 ± 0.101
0.394ThrCys: 0.394 ± 0.017
2.937ThrAsp: 2.937 ± 0.063
2.86ThrGlu: 2.86 ± 0.055
2.328ThrPhe: 2.328 ± 0.062
5.927ThrGly: 5.927 ± 0.136
1.232ThrHis: 1.232 ± 0.035
2.595ThrIle: 2.595 ± 0.087
1.884ThrLys: 1.884 ± 0.044
6.748ThrLeu: 6.748 ± 0.117
0.959ThrMet: 0.959 ± 0.03
1.518ThrAsn: 1.518 ± 0.051
4.091ThrPro: 4.091 ± 0.067
1.773ThrGln: 1.773 ± 0.041
3.562ThrArg: 3.562 ± 0.059
2.816ThrSer: 2.816 ± 0.056
3.456ThrThr: 3.456 ± 0.089
4.617ThrVal: 4.617 ± 0.081
0.924ThrTrp: 0.924 ± 0.03
1.495ThrTyr: 1.495 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
8.564ValAla: 8.564 ± 0.095
0.79ValCys: 0.79 ± 0.03
3.562ValAsp: 3.562 ± 0.057
3.874ValGlu: 3.874 ± 0.066
2.926ValPhe: 2.926 ± 0.048
5.198ValGly: 5.198 ± 0.082
1.505ValHis: 1.505 ± 0.036
3.705ValIle: 3.705 ± 0.057
2.405ValLys: 2.405 ± 0.049
7.894ValLeu: 7.894 ± 0.089
1.498ValMet: 1.498 ± 0.042
2.209ValAsn: 2.209 ± 0.048
3.914ValPro: 3.914 ± 0.063
2.257ValGln: 2.257 ± 0.046
5.086ValArg: 5.086 ± 0.07
3.882ValSer: 3.882 ± 0.058
4.931ValThr: 4.931 ± 0.072
5.811ValVal: 5.811 ± 0.083
1.089ValTrp: 1.089 ± 0.032
1.63ValTyr: 1.63 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.502TrpAla: 1.502 ± 0.038
0.187TrpCys: 0.187 ± 0.012
0.773TrpAsp: 0.773 ± 0.027
0.696TrpGlu: 0.696 ± 0.026
0.716TrpPhe: 0.716 ± 0.027
1.07TrpGly: 1.07 ± 0.033
0.45TrpHis: 0.45 ± 0.02
0.742TrpIle: 0.742 ± 0.027
0.505TrpLys: 0.505 ± 0.02
2.135TrpLeu: 2.135 ± 0.062
0.342TrpMet: 0.342 ± 0.018
0.512TrpAsn: 0.512 ± 0.022
0.833TrpPro: 0.833 ± 0.033
0.777TrpGln: 0.777 ± 0.027
1.387TrpArg: 1.387 ± 0.044
0.939TrpSer: 0.939 ± 0.029
0.98TrpThr: 0.98 ± 0.03
1.063TrpVal: 1.063 ± 0.036
0.366TrpTrp: 0.366 ± 0.02
0.337TrpTyr: 0.337 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.741TyrAla: 2.741 ± 0.059
0.236TyrCys: 0.236 ± 0.014
1.447TyrAsp: 1.447 ± 0.038
1.232TyrGlu: 1.232 ± 0.028
1.236TyrPhe: 1.236 ± 0.032
2.093TyrGly: 2.093 ± 0.039
0.625TyrHis: 0.625 ± 0.023
0.845TyrIle: 0.845 ± 0.029
0.597TyrLys: 0.597 ± 0.023
2.569TyrLeu: 2.569 ± 0.044
0.343TyrMet: 0.343 ± 0.016
0.733TyrAsn: 0.733 ± 0.031
1.269TyrPro: 1.269 ± 0.041
0.951TyrGln: 0.951 ± 0.025
2.013TyrArg: 2.013 ± 0.047
1.205TyrSer: 1.205 ± 0.036
1.444TyrThr: 1.444 ± 0.033
1.677TyrVal: 1.677 ± 0.036
0.422TyrTrp: 0.422 ± 0.017
0.839TyrTyr: 0.839 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3510 proteins (1259279 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski