Amino acid dipepetide frequency for Candidatus Macondimonas diazotrophica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.914AlaAla: 14.914 ± 0.164
1.257AlaCys: 1.257 ± 0.039
6.268AlaAsp: 6.268 ± 0.085
7.104AlaGlu: 7.104 ± 0.121
3.717AlaPhe: 3.717 ± 0.074
9.704AlaGly: 9.704 ± 0.119
2.809AlaHis: 2.809 ± 0.062
5.357AlaIle: 5.357 ± 0.087
2.652AlaLys: 2.652 ± 0.065
14.388AlaLeu: 14.388 ± 0.176
2.914AlaMet: 2.914 ± 0.061
2.492AlaAsn: 2.492 ± 0.065
5.356AlaPro: 5.356 ± 0.099
4.957AlaGln: 4.957 ± 0.096
9.17AlaArg: 9.17 ± 0.124
5.409AlaSer: 5.409 ± 0.085
5.316AlaThr: 5.316 ± 0.085
8.705AlaVal: 8.705 ± 0.106
1.754AlaTrp: 1.754 ± 0.055
2.548AlaTyr: 2.548 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
1.144CysAla: 1.144 ± 0.034
0.131CysCys: 0.131 ± 0.012
0.498CysAsp: 0.498 ± 0.025
0.545CysGlu: 0.545 ± 0.026
0.349CysPhe: 0.349 ± 0.02
1.04CysGly: 1.04 ± 0.04
0.327CysHis: 0.327 ± 0.024
0.435CysIle: 0.435 ± 0.022
0.201CysLys: 0.201 ± 0.015
0.97CysLeu: 0.97 ± 0.035
0.185CysMet: 0.185 ± 0.015
0.236CysAsn: 0.236 ± 0.017
0.695CysPro: 0.695 ± 0.032
0.299CysGln: 0.299 ± 0.019
0.828CysArg: 0.828 ± 0.029
0.536CysSer: 0.536 ± 0.032
0.482CysThr: 0.482 ± 0.027
0.675CysVal: 0.675 ± 0.028
0.149CysTrp: 0.149 ± 0.016
0.271CysTyr: 0.271 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
6.541AspAla: 6.541 ± 0.101
0.588AspCys: 0.588 ± 0.029
2.847AspAsp: 2.847 ± 0.064
3.287AspGlu: 3.287 ± 0.063
2.097AspPhe: 2.097 ± 0.051
4.772AspGly: 4.772 ± 0.089
1.366AspHis: 1.366 ± 0.039
2.588AspIle: 2.588 ± 0.06
1.354AspLys: 1.354 ± 0.04
6.612AspLeu: 6.612 ± 0.111
1.222AspMet: 1.222 ± 0.042
1.207AspAsn: 1.207 ± 0.042
3.856AspPro: 3.856 ± 0.067
2.12AspGln: 2.12 ± 0.048
4.447AspArg: 4.447 ± 0.083
2.438AspSer: 2.438 ± 0.062
2.597AspThr: 2.597 ± 0.062
3.75AspVal: 3.75 ± 0.068
1.078AspTrp: 1.078 ± 0.038
1.674AspTyr: 1.674 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
7.065GluAla: 7.065 ± 0.098
0.49GluCys: 0.49 ± 0.027
2.752GluAsp: 2.752 ± 0.06
2.983GluGlu: 2.983 ± 0.072
1.765GluPhe: 1.765 ± 0.047
3.95GluGly: 3.95 ± 0.072
1.371GluHis: 1.371 ± 0.043
3.164GluIle: 3.164 ± 0.062
1.69GluLys: 1.69 ± 0.05
5.907GluLeu: 5.907 ± 0.094
1.44GluMet: 1.44 ± 0.044
1.452GluAsn: 1.452 ± 0.046
2.709GluPro: 2.709 ± 0.053
2.76GluGln: 2.76 ± 0.064
5.26GluArg: 5.26 ± 0.086
3.057GluSer: 3.057 ± 0.067
3.304GluThr: 3.304 ± 0.066
4.23GluVal: 4.23 ± 0.086
0.813GluTrp: 0.813 ± 0.031
1.121GluTyr: 1.121 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
3.489PheAla: 3.489 ± 0.071
0.389PheCys: 0.389 ± 0.022
2.392PheAsp: 2.392 ± 0.053
2.055PheGlu: 2.055 ± 0.047
1.303PhePhe: 1.303 ± 0.044
3.23PheGly: 3.23 ± 0.068
0.835PheHis: 0.835 ± 0.031
1.568PheIle: 1.568 ± 0.043
0.937PheLys: 0.937 ± 0.034
3.198PheLeu: 3.198 ± 0.067
0.807PheMet: 0.807 ± 0.034
1.088PheAsn: 1.088 ± 0.037
1.587PhePro: 1.587 ± 0.048
1.205PheGln: 1.205 ± 0.038
2.145PheArg: 2.145 ± 0.048
2.138PheSer: 2.138 ± 0.055
1.825PheThr: 1.825 ± 0.055
2.417PheVal: 2.417 ± 0.064
0.533PheTrp: 0.533 ± 0.026
0.945PheTyr: 0.945 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
8.545GlyAla: 8.545 ± 0.111
0.988GlyCys: 0.988 ± 0.037
4.261GlyAsp: 4.261 ± 0.079
4.748GlyGlu: 4.748 ± 0.079
3.223GlyPhe: 3.223 ± 0.068
6.531GlyGly: 6.531 ± 0.114
2.117GlyHis: 2.117 ± 0.056
4.402GlyIle: 4.402 ± 0.084
2.601GlyLys: 2.601 ± 0.061
9.062GlyLeu: 9.062 ± 0.119
2.366GlyMet: 2.366 ± 0.057
1.973GlyAsn: 1.973 ± 0.058
3.37GlyPro: 3.37 ± 0.07
3.223GlyGln: 3.223 ± 0.063
6.292GlyArg: 6.292 ± 0.1
4.311GlySer: 4.311 ± 0.081
4.194GlyThr: 4.194 ± 0.08
6.291GlyVal: 6.291 ± 0.09
1.469GlyTrp: 1.469 ± 0.04
2.407GlyTyr: 2.407 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
2.721HisAla: 2.721 ± 0.062
0.306HisCys: 0.306 ± 0.021
1.306HisAsp: 1.306 ± 0.04
1.193HisGlu: 1.193 ± 0.04
0.953HisPhe: 0.953 ± 0.036
2.203HisGly: 2.203 ± 0.045
0.839HisHis: 0.839 ± 0.042
1.027HisIle: 1.027 ± 0.035
0.559HisLys: 0.559 ± 0.029
2.786HisLeu: 2.786 ± 0.059
0.517HisMet: 0.517 ± 0.023
0.555HisAsn: 0.555 ± 0.027
1.839HisPro: 1.839 ± 0.053
0.886HisGln: 0.886 ± 0.032
1.994HisArg: 1.994 ± 0.05
1.048HisSer: 1.048 ± 0.037
1.102HisThr: 1.102 ± 0.034
1.532HisVal: 1.532 ± 0.041
0.475HisTrp: 0.475 ± 0.027
0.774HisTyr: 0.774 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
5.804IleAla: 5.804 ± 0.098
0.467IleCys: 0.467 ± 0.023
3.623IleAsp: 3.623 ± 0.059
3.298IleGlu: 3.298 ± 0.074
1.316IlePhe: 1.316 ± 0.043
4.36IleGly: 4.36 ± 0.082
1.243IleHis: 1.243 ± 0.036
1.876IleIle: 1.876 ± 0.055
1.426IleLys: 1.426 ± 0.048
4.6IleLeu: 4.6 ± 0.079
0.871IleMet: 0.871 ± 0.031
1.392IleAsn: 1.392 ± 0.044
2.689IlePro: 2.689 ± 0.059
1.708IleGln: 1.708 ± 0.051
3.654IleArg: 3.654 ± 0.071
2.358IleSer: 2.358 ± 0.053
2.662IleThr: 2.662 ± 0.066
3.285IleVal: 3.285 ± 0.064
0.592IleTrp: 0.592 ± 0.028
1.14IleTyr: 1.14 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
3.016LysAla: 3.016 ± 0.074
0.164LysCys: 0.164 ± 0.016
1.453LysAsp: 1.453 ± 0.046
1.444LysGlu: 1.444 ± 0.055
0.726LysPhe: 0.726 ± 0.03
2.065LysGly: 2.065 ± 0.057
0.54LysHis: 0.54 ± 0.027
1.241LysIle: 1.241 ± 0.044
0.984LysLys: 0.984 ± 0.049
2.573LysLeu: 2.573 ± 0.066
0.573LysMet: 0.573 ± 0.026
0.819LysAsn: 0.819 ± 0.043
1.567LysPro: 1.567 ± 0.045
0.974LysGln: 0.974 ± 0.035
1.991LysArg: 1.991 ± 0.051
1.478LysSer: 1.478 ± 0.044
1.643LysThr: 1.643 ± 0.05
1.86LysVal: 1.86 ± 0.055
0.282LysTrp: 0.282 ± 0.018
0.62LysTyr: 0.62 ± 0.031
0.0LysXaa: 0.0 ± 0.0
Leu
14.26LeuAla: 14.26 ± 0.177
1.075LeuCys: 1.075 ± 0.034
6.679LeuAsp: 6.679 ± 0.101
6.235LeuGlu: 6.235 ± 0.106
3.591LeuPhe: 3.591 ± 0.08
8.998LeuGly: 8.998 ± 0.108
2.472LeuHis: 2.472 ± 0.054
5.37LeuIle: 5.37 ± 0.087
3.061LeuLys: 3.061 ± 0.076
11.27LeuLeu: 11.27 ± 0.158
2.615LeuMet: 2.615 ± 0.062
2.712LeuAsn: 2.712 ± 0.061
6.379LeuPro: 6.379 ± 0.099
3.773LeuGln: 3.773 ± 0.074
8.361LeuArg: 8.361 ± 0.115
6.183LeuSer: 6.183 ± 0.093
5.901LeuThr: 5.901 ± 0.098
7.259LeuVal: 7.259 ± 0.115
1.437LeuTrp: 1.437 ± 0.045
2.33LeuTyr: 2.33 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
3.084MetAla: 3.084 ± 0.059
0.176MetCys: 0.176 ± 0.013
1.341MetAsp: 1.341 ± 0.044
1.148MetGlu: 1.148 ± 0.042
0.627MetPhe: 0.627 ± 0.03
1.977MetGly: 1.977 ± 0.052
0.524MetHis: 0.524 ± 0.025
1.075MetIle: 1.075 ± 0.041
0.746MetLys: 0.746 ± 0.033
2.323MetLeu: 2.323 ± 0.057
0.573MetMet: 0.573 ± 0.03
0.811MetAsn: 0.811 ± 0.029
1.429MetPro: 1.429 ± 0.04
0.924MetGln: 0.924 ± 0.035
1.752MetArg: 1.752 ± 0.044
1.545MetSer: 1.545 ± 0.04
1.604MetThr: 1.604 ± 0.051
1.643MetVal: 1.643 ± 0.048
0.208MetTrp: 0.208 ± 0.015
0.358MetTyr: 0.358 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
2.853AsnAla: 2.853 ± 0.064
0.26AsnCys: 0.26 ± 0.018
1.234AsnAsp: 1.234 ± 0.04
1.171AsnGlu: 1.171 ± 0.038
0.865AsnPhe: 0.865 ± 0.035
2.084AsnGly: 2.084 ± 0.064
0.626AsnHis: 0.626 ± 0.027
1.2AsnIle: 1.2 ± 0.051
0.67AsnLys: 0.67 ± 0.034
2.711AsnLeu: 2.711 ± 0.056
0.579AsnMet: 0.579 ± 0.031
0.699AsnAsn: 0.699 ± 0.032
1.853AsnPro: 1.853 ± 0.048
0.941AsnGln: 0.941 ± 0.037
1.97AsnArg: 1.97 ± 0.048
1.112AsnSer: 1.112 ± 0.041
1.294AsnThr: 1.294 ± 0.04
1.79AsnVal: 1.79 ± 0.044
0.376AsnTrp: 0.376 ± 0.02
0.658AsnTyr: 0.658 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
6.538ProAla: 6.538 ± 0.109
0.447ProCys: 0.447 ± 0.025
3.949ProAsp: 3.949 ± 0.078
4.01ProGlu: 4.01 ± 0.07
1.789ProPhe: 1.789 ± 0.056
4.826ProGly: 4.826 ± 0.089
1.305ProHis: 1.305 ± 0.04
2.487ProIle: 2.487 ± 0.059
1.242ProLys: 1.242 ± 0.045
5.347ProLeu: 5.347 ± 0.09
1.331ProMet: 1.331 ± 0.04
1.241ProAsn: 1.241 ± 0.041
2.927ProPro: 2.927 ± 0.074
1.816ProGln: 1.816 ± 0.055
3.23ProArg: 3.23 ± 0.071
2.588ProSer: 2.588 ± 0.059
2.646ProThr: 2.646 ± 0.056
4.621ProVal: 4.621 ± 0.071
0.898ProTrp: 0.898 ± 0.034
1.283ProTyr: 1.283 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
5.251GlnAla: 5.251 ± 0.083
0.34GlnCys: 0.34 ± 0.02
1.834GlnAsp: 1.834 ± 0.052
1.857GlnGlu: 1.857 ± 0.051
1.138GlnPhe: 1.138 ± 0.04
3.019GlnGly: 3.019 ± 0.069
0.874GlnHis: 0.874 ± 0.035
2.059GlnIle: 2.059 ± 0.053
0.872GlnLys: 0.872 ± 0.036
3.861GlnLeu: 3.861 ± 0.078
0.916GlnMet: 0.916 ± 0.034
0.906GlnAsn: 0.906 ± 0.033
2.003GlnPro: 2.003 ± 0.053
1.655GlnGln: 1.655 ± 0.053
3.408GlnArg: 3.408 ± 0.073
1.951GlnSer: 1.951 ± 0.045
2.032GlnThr: 2.032 ± 0.047
3.003GlnVal: 3.003 ± 0.065
0.635GlnTrp: 0.635 ± 0.025
0.805GlnTyr: 0.805 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
7.924ArgAla: 7.924 ± 0.127
0.744ArgCys: 0.744 ± 0.039
4.025ArgAsp: 4.025 ± 0.076
4.743ArgGlu: 4.743 ± 0.085
3.096ArgPhe: 3.096 ± 0.053
4.902ArgGly: 4.902 ± 0.077
2.219ArgHis: 2.219 ± 0.059
4.374ArgIle: 4.374 ± 0.07
1.868ArgLys: 1.868 ± 0.059
9.573ArgLeu: 9.573 ± 0.137
2.034ArgMet: 2.034 ± 0.05
1.956ArgAsn: 1.956 ± 0.053
3.93ArgPro: 3.93 ± 0.076
3.442ArgGln: 3.442 ± 0.072
6.664ArgArg: 6.664 ± 0.117
3.642ArgSer: 3.642 ± 0.071
3.492ArgThr: 3.492 ± 0.072
5.333ArgVal: 5.333 ± 0.083
1.426ArgTrp: 1.426 ± 0.039
2.372ArgTyr: 2.372 ± 0.053
0.0ArgXaa: 0.0 ± 0.0
Ser
5.755SerAla: 5.755 ± 0.096
0.485SerCys: 0.485 ± 0.024
2.791SerAsp: 2.791 ± 0.069
2.727SerGlu: 2.727 ± 0.057
1.829SerPhe: 1.829 ± 0.053
5.18SerGly: 5.18 ± 0.078
1.122SerHis: 1.122 ± 0.035
2.333SerIle: 2.333 ± 0.059
1.265SerLys: 1.265 ± 0.052
5.552SerLeu: 5.552 ± 0.09
1.268SerMet: 1.268 ± 0.04
1.293SerAsn: 1.293 ± 0.045
2.887SerPro: 2.887 ± 0.059
1.755SerGln: 1.755 ± 0.047
3.908SerArg: 3.908 ± 0.075
2.768SerSer: 2.768 ± 0.078
2.493SerThr: 2.493 ± 0.06
3.596SerVal: 3.596 ± 0.068
0.754SerTrp: 0.754 ± 0.033
1.146SerTyr: 1.146 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
5.733ThrAla: 5.733 ± 0.088
0.429ThrCys: 0.429 ± 0.025
2.836ThrAsp: 2.836 ± 0.066
2.67ThrGlu: 2.67 ± 0.062
1.619ThrPhe: 1.619 ± 0.043
4.925ThrGly: 4.925 ± 0.073
1.336ThrHis: 1.336 ± 0.039
2.252ThrIle: 2.252 ± 0.057
0.995ThrLys: 0.995 ± 0.035
6.434ThrLeu: 6.434 ± 0.093
0.909ThrMet: 0.909 ± 0.036
1.119ThrAsn: 1.119 ± 0.042
3.482ThrPro: 3.482 ± 0.068
1.745ThrGln: 1.745 ± 0.041
3.659ThrArg: 3.659 ± 0.077
2.307ThrSer: 2.307 ± 0.059
2.592ThrThr: 2.592 ± 0.07
4.226ThrVal: 4.226 ± 0.08
0.672ThrTrp: 0.672 ± 0.028
1.232ThrTyr: 1.232 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
8.105ValAla: 8.105 ± 0.112
0.801ValCys: 0.801 ± 0.028
4.257ValAsp: 4.257 ± 0.07
4.174ValGlu: 4.174 ± 0.087
2.595ValPhe: 2.595 ± 0.061
5.412ValGly: 5.412 ± 0.08
1.654ValHis: 1.654 ± 0.049
3.853ValIle: 3.853 ± 0.071
1.909ValLys: 1.909 ± 0.055
8.057ValLeu: 8.057 ± 0.103
1.918ValMet: 1.918 ± 0.053
1.963ValAsn: 1.963 ± 0.054
3.819ValPro: 3.819 ± 0.071
2.458ValGln: 2.458 ± 0.057
5.389ValArg: 5.389 ± 0.087
4.076ValSer: 4.076 ± 0.071
3.99ValThr: 3.99 ± 0.067
5.9ValVal: 5.9 ± 0.103
0.94ValTrp: 0.94 ± 0.033
1.655ValTyr: 1.655 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
1.394TrpAla: 1.394 ± 0.044
0.165TrpCys: 0.165 ± 0.012
0.757TrpAsp: 0.757 ± 0.024
0.693TrpGlu: 0.693 ± 0.029
0.508TrpPhe: 0.508 ± 0.024
1.053TrpGly: 1.053 ± 0.037
0.432TrpHis: 0.432 ± 0.025
0.8TrpIle: 0.8 ± 0.031
0.382TrpLys: 0.382 ± 0.023
1.908TrpLeu: 1.908 ± 0.053
0.404TrpMet: 0.404 ± 0.023
0.423TrpAsn: 0.423 ± 0.023
0.819TrpPro: 0.819 ± 0.03
0.744TrpGln: 0.744 ± 0.032
1.353TrpArg: 1.353 ± 0.05
0.785TrpSer: 0.785 ± 0.031
0.787TrpThr: 0.787 ± 0.033
1.137TrpVal: 1.137 ± 0.043
0.271TrpTrp: 0.271 ± 0.02
0.37TrpTyr: 0.37 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.656TyrAla: 2.656 ± 0.07
0.295TyrCys: 0.295 ± 0.019
1.337TyrAsp: 1.337 ± 0.042
1.2TyrGlu: 1.2 ± 0.043
0.979TyrPhe: 0.979 ± 0.032
2.142TyrGly: 2.142 ± 0.057
0.605TyrHis: 0.605 ± 0.031
0.903TyrIle: 0.903 ± 0.038
0.601TyrLys: 0.601 ± 0.03
2.829TyrLeu: 2.829 ± 0.053
0.426TyrMet: 0.426 ± 0.026
0.608TyrAsn: 0.608 ± 0.03
1.324TyrPro: 1.324 ± 0.038
1.035TyrGln: 1.035 ± 0.032
2.284TyrArg: 2.284 ± 0.061
1.177TyrSer: 1.177 ± 0.042
1.214TyrThr: 1.214 ± 0.045
1.671TyrVal: 1.671 ± 0.037
0.422TyrTrp: 0.422 ± 0.024
0.669TyrTyr: 0.669 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2722 proteins (837426 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski