Amino acid dipepetide frequency for Leucobacter massiliensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.984AlaAla: 22.984 ± 0.333
0.795AlaCys: 0.795 ± 0.034
7.291AlaAsp: 7.291 ± 0.108
11.321AlaGlu: 11.321 ± 0.184
3.856AlaPhe: 3.856 ± 0.069
14.002AlaGly: 14.002 ± 0.174
2.505AlaHis: 2.505 ± 0.07
5.573AlaIle: 5.573 ± 0.089
2.603AlaLys: 2.603 ± 0.066
14.886AlaLeu: 14.886 ± 0.198
2.55AlaMet: 2.55 ± 0.053
2.159AlaAsn: 2.159 ± 0.048
7.197AlaPro: 7.197 ± 0.12
4.278AlaGln: 4.278 ± 0.083
10.11AlaArg: 10.11 ± 0.154
7.298AlaSer: 7.298 ± 0.118
6.24AlaThr: 6.24 ± 0.089
11.212AlaVal: 11.212 ± 0.136
1.73AlaTrp: 1.73 ± 0.054
2.311AlaTyr: 2.311 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.766CysAla: 0.766 ± 0.03
0.06CysCys: 0.06 ± 0.009
0.326CysAsp: 0.326 ± 0.02
0.364CysGlu: 0.364 ± 0.021
0.173CysPhe: 0.173 ± 0.014
0.638CysGly: 0.638 ± 0.028
0.108CysHis: 0.108 ± 0.011
0.183CysIle: 0.183 ± 0.016
0.06CysLys: 0.06 ± 0.009
0.445CysLeu: 0.445 ± 0.022
0.076CysMet: 0.076 ± 0.01
0.11CysAsn: 0.11 ± 0.011
0.271CysPro: 0.271 ± 0.019
0.109CysGln: 0.109 ± 0.01
0.376CysArg: 0.376 ± 0.02
0.348CysSer: 0.348 ± 0.022
0.371CysThr: 0.371 ± 0.021
0.428CysVal: 0.428 ± 0.021
0.078CysTrp: 0.078 ± 0.009
0.111CysTyr: 0.111 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
8.132AspAla: 8.132 ± 0.117
0.242AspCys: 0.242 ± 0.018
3.006AspAsp: 3.006 ± 0.07
4.099AspGlu: 4.099 ± 0.083
1.679AspPhe: 1.679 ± 0.042
5.64AspGly: 5.64 ± 0.103
1.056AspHis: 1.056 ± 0.035
1.81AspIle: 1.81 ± 0.054
0.754AspLys: 0.754 ± 0.038
4.701AspLeu: 4.701 ± 0.081
0.646AspMet: 0.646 ± 0.024
0.708AspAsn: 0.708 ± 0.031
4.677AspPro: 4.677 ± 0.084
1.238AspGln: 1.238 ± 0.039
4.508AspArg: 4.508 ± 0.082
2.694AspSer: 2.694 ± 0.299
2.727AspThr: 2.727 ± 0.066
4.052AspVal: 4.052 ± 0.076
0.821AspTrp: 0.821 ± 0.032
1.209AspTyr: 1.209 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
9.017GluAla: 9.017 ± 0.149
0.28GluCys: 0.28 ± 0.018
2.978GluAsp: 2.978 ± 0.068
4.095GluGlu: 4.095 ± 0.092
1.91GluPhe: 1.91 ± 0.042
4.835GluGly: 4.835 ± 0.074
1.71GluHis: 1.71 ± 0.048
2.784GluIle: 2.784 ± 0.071
1.234GluLys: 1.234 ± 0.041
8.639GluLeu: 8.639 ± 0.125
1.044GluMet: 1.044 ± 0.034
1.225GluAsn: 1.225 ± 0.04
3.622GluPro: 3.622 ± 0.079
2.892GluGln: 2.892 ± 0.065
6.982GluArg: 6.982 ± 0.147
2.95GluSer: 2.95 ± 0.071
3.251GluThr: 3.251 ± 0.068
5.108GluVal: 5.108 ± 0.077
0.913GluTrp: 0.913 ± 0.034
1.154GluTyr: 1.154 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
4.63PheAla: 4.63 ± 0.08
0.189PheCys: 0.189 ± 0.016
2.104PheAsp: 2.104 ± 0.054
1.955PheGlu: 1.955 ± 0.053
1.143PhePhe: 1.143 ± 0.055
3.394PheGly: 3.394 ± 0.07
0.508PheHis: 0.508 ± 0.027
1.13PheIle: 1.13 ± 0.042
0.44PheLys: 0.44 ± 0.024
2.662PheLeu: 2.662 ± 0.053
0.46PheMet: 0.46 ± 0.024
0.559PheAsn: 0.559 ± 0.025
1.399PhePro: 1.399 ± 0.034
0.753PheGln: 0.753 ± 0.028
1.749PheArg: 1.749 ± 0.041
1.811PheSer: 1.811 ± 0.052
2.166PheThr: 2.166 ± 0.063
2.643PheVal: 2.643 ± 0.056
0.441PheTrp: 0.441 ± 0.023
0.562PheTyr: 0.562 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
12.609GlyAla: 12.609 ± 0.166
0.587GlyCys: 0.587 ± 0.026
4.866GlyAsp: 4.866 ± 0.09
6.409GlyGlu: 6.409 ± 0.095
3.284GlyPhe: 3.284 ± 0.057
8.773GlyGly: 8.773 ± 0.141
1.677GlyHis: 1.677 ± 0.043
4.707GlyIle: 4.707 ± 0.087
2.145GlyLys: 2.145 ± 0.057
9.175GlyLeu: 9.175 ± 0.131
1.887GlyMet: 1.887 ± 0.046
1.627GlyAsn: 1.627 ± 0.051
4.277GlyPro: 4.277 ± 0.074
2.39GlyGln: 2.39 ± 0.05
6.948GlyArg: 6.948 ± 0.11
6.11GlySer: 6.11 ± 0.111
5.763GlyThr: 5.763 ± 0.266
8.124GlyVal: 8.124 ± 0.103
1.503GlyTrp: 1.503 ± 0.04
2.131GlyTyr: 2.131 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
2.466HisAla: 2.466 ± 0.057
0.132HisCys: 0.132 ± 0.013
1.106HisAsp: 1.106 ± 0.037
1.312HisGlu: 1.312 ± 0.041
0.573HisPhe: 0.573 ± 0.023
1.946HisGly: 1.946 ± 0.048
0.519HisHis: 0.519 ± 0.026
0.64HisIle: 0.64 ± 0.029
0.276HisLys: 0.276 ± 0.018
1.926HisLeu: 1.926 ± 0.047
0.275HisMet: 0.275 ± 0.016
0.29HisAsn: 0.29 ± 0.017
1.56HisPro: 1.56 ± 0.044
0.518HisGln: 0.518 ± 0.024
1.695HisArg: 1.695 ± 0.05
0.892HisSer: 0.892 ± 0.036
0.99HisThr: 0.99 ± 0.033
1.342HisVal: 1.342 ± 0.042
0.289HisTrp: 0.289 ± 0.017
0.42HisTyr: 0.42 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.96IleAla: 6.96 ± 0.105
0.287IleCys: 0.287 ± 0.017
2.954IleAsp: 2.954 ± 0.058
3.029IleGlu: 3.029 ± 0.065
1.123IlePhe: 1.123 ± 0.047
4.322IleGly: 4.322 ± 0.08
0.632IleHis: 0.632 ± 0.028
1.543IleIle: 1.543 ± 0.052
0.682IleLys: 0.682 ± 0.032
3.527IleLeu: 3.527 ± 0.073
0.622IleMet: 0.622 ± 0.026
0.815IleAsn: 0.815 ± 0.03
2.444IlePro: 2.444 ± 0.064
0.874IleGln: 0.874 ± 0.031
2.712IleArg: 2.712 ± 0.053
2.154IleSer: 2.154 ± 0.052
2.631IleThr: 2.631 ± 0.069
4.074IleVal: 4.074 ± 0.071
0.455IleTrp: 0.455 ± 0.023
0.679IleTyr: 0.679 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
1.906LysAla: 1.906 ± 0.057
0.068LysCys: 0.068 ± 0.009
0.953LysAsp: 0.953 ± 0.04
0.853LysGlu: 0.853 ± 0.036
0.493LysPhe: 0.493 ± 0.027
1.337LysGly: 1.337 ± 0.052
0.421LysHis: 0.421 ± 0.024
0.891LysIle: 0.891 ± 0.038
0.725LysLys: 0.725 ± 0.037
1.878LysLeu: 1.878 ± 0.049
0.361LysMet: 0.361 ± 0.022
0.514LysAsn: 0.514 ± 0.025
1.15LysPro: 1.15 ± 0.04
0.721LysGln: 0.721 ± 0.031
1.639LysArg: 1.639 ± 0.042
1.044LysSer: 1.044 ± 0.039
1.153LysThr: 1.153 ± 0.043
1.329LysVal: 1.329 ± 0.054
0.248LysTrp: 0.248 ± 0.018
0.408LysTyr: 0.408 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
15.84LeuAla: 15.84 ± 0.2
0.567LeuCys: 0.567 ± 0.024
6.388LeuAsp: 6.388 ± 0.099
5.255LeuGlu: 5.255 ± 0.097
2.853LeuPhe: 2.853 ± 0.066
10.247LeuGly: 10.247 ± 0.135
1.769LeuHis: 1.769 ± 0.043
4.403LeuIle: 4.403 ± 0.08
1.579LeuLys: 1.579 ± 0.047
11.209LeuLeu: 11.209 ± 0.209
1.588LeuMet: 1.588 ± 0.045
1.73LeuAsn: 1.73 ± 0.042
5.764LeuPro: 5.764 ± 0.084
2.481LeuGln: 2.481 ± 0.053
8.552LeuArg: 8.552 ± 0.129
6.067LeuSer: 6.067 ± 0.097
5.774LeuThr: 5.774 ± 0.075
8.657LeuVal: 8.657 ± 0.113
1.189LeuTrp: 1.189 ± 0.043
1.561LeuTyr: 1.561 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
1.884MetAla: 1.884 ± 0.045
0.096MetCys: 0.096 ± 0.011
0.743MetAsp: 0.743 ± 0.028
0.684MetGlu: 0.684 ± 0.029
0.448MetPhe: 0.448 ± 0.026
1.32MetGly: 1.32 ± 0.04
0.349MetHis: 0.349 ± 0.024
0.863MetIle: 0.863 ± 0.033
0.346MetLys: 0.346 ± 0.022
1.958MetLeu: 1.958 ± 0.052
0.285MetMet: 0.285 ± 0.017
0.483MetAsn: 0.483 ± 0.025
1.032MetPro: 1.032 ± 0.034
0.549MetGln: 0.549 ± 0.024
1.499MetArg: 1.499 ± 0.044
1.496MetSer: 1.496 ± 0.042
1.325MetThr: 1.325 ± 0.036
1.202MetVal: 1.202 ± 0.031
0.17MetTrp: 0.17 ± 0.016
0.278MetTyr: 0.278 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.358AsnAla: 2.358 ± 0.051
0.111AsnCys: 0.111 ± 0.012
0.937AsnAsp: 0.937 ± 0.037
1.011AsnGlu: 1.011 ± 0.037
0.565AsnPhe: 0.565 ± 0.027
1.782AsnGly: 1.782 ± 0.058
0.327AsnHis: 0.327 ± 0.02
0.758AsnIle: 0.758 ± 0.031
0.317AsnLys: 0.317 ± 0.021
1.659AsnLeu: 1.659 ± 0.039
0.322AsnMet: 0.322 ± 0.021
0.404AsnAsn: 0.404 ± 0.023
1.416AsnPro: 1.416 ± 0.043
0.509AsnGln: 0.509 ± 0.024
1.271AsnArg: 1.271 ± 0.039
0.895AsnSer: 0.895 ± 0.037
1.111AsnThr: 1.111 ± 0.046
1.546AsnVal: 1.546 ± 0.046
0.307AsnTrp: 0.307 ± 0.021
0.441AsnTyr: 0.441 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
8.195ProAla: 8.195 ± 0.141
0.219ProCys: 0.219 ± 0.016
3.126ProAsp: 3.126 ± 0.066
5.247ProGlu: 5.247 ± 0.088
1.64ProPhe: 1.64 ± 0.045
6.393ProGly: 6.393 ± 0.112
1.109ProHis: 1.109 ± 0.04
1.977ProIle: 1.977 ± 0.05
0.995ProLys: 0.995 ± 0.039
5.283ProLeu: 5.283 ± 0.082
0.852ProMet: 0.852 ± 0.031
1.002ProAsn: 1.002 ± 0.034
2.522ProPro: 2.522 ± 0.068
1.619ProGln: 1.619 ± 0.046
3.91ProArg: 3.91 ± 0.073
3.111ProSer: 3.111 ± 0.058
2.353ProThr: 2.353 ± 0.054
4.857ProVal: 4.857 ± 0.08
0.778ProTrp: 0.778 ± 0.03
0.981ProTyr: 0.981 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
3.646GlnAla: 3.646 ± 0.069
0.126GlnCys: 0.126 ± 0.011
1.194GlnAsp: 1.194 ± 0.039
1.467GlnGlu: 1.467 ± 0.048
0.786GlnPhe: 0.786 ± 0.033
2.159GlnGly: 2.159 ± 0.051
0.726GlnHis: 0.726 ± 0.027
1.489GlnIle: 1.489 ± 0.04
0.583GlnLys: 0.583 ± 0.027
3.627GlnLeu: 3.627 ± 0.075
0.485GlnMet: 0.485 ± 0.021
0.61GlnAsn: 0.61 ± 0.025
1.6GlnPro: 1.6 ± 0.044
1.414GlnGln: 1.414 ± 0.044
2.902GlnArg: 2.902 ± 0.067
1.429GlnSer: 1.429 ± 0.042
1.328GlnThr: 1.328 ± 0.037
1.943GlnVal: 1.943 ± 0.051
0.421GlnTrp: 0.421 ± 0.022
0.563GlnTyr: 0.563 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
10.272ArgAla: 10.272 ± 0.14
0.375ArgCys: 0.375 ± 0.022
4.263ArgAsp: 4.263 ± 0.075
5.94ArgGlu: 5.94 ± 0.112
2.639ArgPhe: 2.639 ± 0.061
6.561ArgGly: 6.561 ± 0.114
1.583ArgHis: 1.583 ± 0.048
3.977ArgIle: 3.977 ± 0.067
1.414ArgLys: 1.414 ± 0.043
8.177ArgLeu: 8.177 ± 0.126
1.675ArgMet: 1.675 ± 0.046
1.238ArgAsn: 1.238 ± 0.038
4.02ArgPro: 4.02 ± 0.072
2.057ArgGln: 2.057 ± 0.046
7.181ArgArg: 7.181 ± 0.118
4.386ArgSer: 4.386 ± 0.073
4.053ArgThr: 4.053 ± 0.061
6.344ArgVal: 6.344 ± 0.088
1.143ArgTrp: 1.143 ± 0.039
1.638ArgTyr: 1.638 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
7.303SerAla: 7.303 ± 0.114
0.331SerCys: 0.331 ± 0.021
2.909SerAsp: 2.909 ± 0.074
3.498SerGlu: 3.498 ± 0.078
1.91SerPhe: 1.91 ± 0.05
6.911SerGly: 6.911 ± 0.272
0.944SerHis: 0.944 ± 0.037
2.206SerIle: 2.206 ± 0.05
1.002SerLys: 1.002 ± 0.041
5.237SerLeu: 5.237 ± 0.075
1.077SerMet: 1.077 ± 0.033
1.026SerAsn: 1.026 ± 0.036
3.214SerPro: 3.214 ± 0.065
1.329SerGln: 1.329 ± 0.045
4.091SerArg: 4.091 ± 0.072
3.226SerSer: 3.226 ± 0.068
2.921SerThr: 2.921 ± 0.056
4.367SerVal: 4.367 ± 0.071
0.996SerTrp: 0.996 ± 0.041
1.229SerTyr: 1.229 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
7.34ThrAla: 7.34 ± 0.118
0.19ThrCys: 0.19 ± 0.015
2.779ThrAsp: 2.779 ± 0.292
3.16ThrGlu: 3.16 ± 0.066
1.577ThrPhe: 1.577 ± 0.05
5.743ThrGly: 5.743 ± 0.11
1.05ThrHis: 1.05 ± 0.033
2.414ThrIle: 2.414 ± 0.057
0.944ThrLys: 0.944 ± 0.035
5.534ThrLeu: 5.534 ± 0.093
0.896ThrMet: 0.896 ± 0.033
1.051ThrAsn: 1.051 ± 0.047
3.672ThrPro: 3.672 ± 0.07
1.398ThrGln: 1.398 ± 0.044
3.738ThrArg: 3.738 ± 0.057
2.882ThrSer: 2.882 ± 0.058
2.831ThrThr: 2.831 ± 0.066
5.615ThrVal: 5.615 ± 0.104
0.634ThrTrp: 0.634 ± 0.026
0.769ThrTyr: 0.769 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
10.608ValAla: 10.608 ± 0.113
0.516ValCys: 0.516 ± 0.027
4.612ValAsp: 4.612 ± 0.071
5.047ValGlu: 5.047 ± 0.087
2.861ValPhe: 2.861 ± 0.068
6.195ValGly: 6.195 ± 0.095
1.542ValHis: 1.542 ± 0.04
3.967ValIle: 3.967 ± 0.075
1.438ValLys: 1.438 ± 0.049
9.332ValLeu: 9.332 ± 0.131
1.373ValMet: 1.373 ± 0.039
1.736ValAsn: 1.736 ± 0.05
4.686ValPro: 4.686 ± 0.079
2.334ValGln: 2.334 ± 0.056
6.135ValArg: 6.135 ± 0.078
5.129ValSer: 5.129 ± 0.091
5.374ValThr: 5.374 ± 0.105
7.437ValVal: 7.437 ± 0.118
1.034ValTrp: 1.034 ± 0.039
1.474ValTyr: 1.474 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
1.565TrpAla: 1.565 ± 0.047
0.102TrpCys: 0.102 ± 0.012
0.724TrpAsp: 0.724 ± 0.028
0.715TrpGlu: 0.715 ± 0.026
0.524TrpPhe: 0.524 ± 0.026
1.057TrpGly: 1.057 ± 0.041
0.311TrpHis: 0.311 ± 0.019
0.645TrpIle: 0.645 ± 0.03
0.257TrpLys: 0.257 ± 0.017
1.569TrpLeu: 1.569 ± 0.051
0.292TrpMet: 0.292 ± 0.019
0.386TrpAsn: 0.386 ± 0.021
0.632TrpPro: 0.632 ± 0.025
0.498TrpGln: 0.498 ± 0.025
1.22TrpArg: 1.22 ± 0.047
0.832TrpSer: 0.832 ± 0.032
0.746TrpThr: 0.746 ± 0.03
1.061TrpVal: 1.061 ± 0.035
0.31TrpTrp: 0.31 ± 0.022
0.269TrpTyr: 0.269 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.421TyrAla: 2.421 ± 0.048
0.128TyrCys: 0.128 ± 0.012
1.139TyrAsp: 1.139 ± 0.04
1.164TyrGlu: 1.164 ± 0.04
0.662TyrPhe: 0.662 ± 0.025
1.736TyrGly: 1.736 ± 0.046
0.32TyrHis: 0.32 ± 0.019
0.556TyrIle: 0.556 ± 0.024
0.297TyrLys: 0.297 ± 0.019
2.049TyrLeu: 2.049 ± 0.053
0.241TyrMet: 0.241 ± 0.015
0.375TyrAsn: 0.375 ± 0.018
1.016TyrPro: 1.016 ± 0.036
0.538TyrGln: 0.538 ± 0.027
1.719TyrArg: 1.719 ± 0.05
0.974TyrSer: 0.974 ± 0.034
1.109TyrThr: 1.109 ± 0.046
1.491TyrVal: 1.491 ± 0.04
0.254TyrTrp: 0.254 ± 0.015
0.391TyrTyr: 0.391 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2663 proteins (899312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski