Amino acid dipepetide frequency for Hartmannibacter diazotrophicus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.614AlaAla: 16.614 ± 0.155
1.086AlaCys: 1.086 ± 0.028
7.207AlaAsp: 7.207 ± 0.08
7.712AlaGlu: 7.712 ± 0.079
4.788AlaPhe: 4.788 ± 0.061
11.022AlaGly: 11.022 ± 0.087
2.208AlaHis: 2.208 ± 0.039
6.797AlaIle: 6.797 ± 0.071
4.013AlaLys: 4.013 ± 0.065
12.878AlaLeu: 12.878 ± 0.112
3.701AlaMet: 3.701 ± 0.046
2.869AlaAsn: 2.869 ± 0.04
5.196AlaPro: 5.196 ± 0.071
2.998AlaGln: 2.998 ± 0.045
8.495AlaArg: 8.495 ± 0.08
6.974AlaSer: 6.974 ± 0.074
6.129AlaThr: 6.129 ± 0.071
8.978AlaVal: 8.978 ± 0.09
1.324AlaTrp: 1.324 ± 0.031
2.551AlaTyr: 2.551 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.894CysAla: 0.894 ± 0.024
0.104CysCys: 0.104 ± 0.008
0.52CysAsp: 0.52 ± 0.02
0.464CysGlu: 0.464 ± 0.019
0.317CysPhe: 0.317 ± 0.015
0.921CysGly: 0.921 ± 0.026
0.26CysHis: 0.26 ± 0.014
0.361CysIle: 0.361 ± 0.016
0.202CysLys: 0.202 ± 0.012
0.874CysLeu: 0.874 ± 0.022
0.182CysMet: 0.182 ± 0.01
0.209CysAsn: 0.209 ± 0.013
0.468CysPro: 0.468 ± 0.018
0.237CysGln: 0.237 ± 0.012
0.654CysArg: 0.654 ± 0.022
0.483CysSer: 0.483 ± 0.016
0.393CysThr: 0.393 ± 0.015
0.602CysVal: 0.602 ± 0.018
0.116CysTrp: 0.116 ± 0.01
0.204CysTyr: 0.204 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
6.889AspAla: 6.889 ± 0.072
0.546AspCys: 0.546 ± 0.018
3.536AspAsp: 3.536 ± 0.053
3.799AspGlu: 3.799 ± 0.05
2.355AspPhe: 2.355 ± 0.042
5.466AspGly: 5.466 ± 0.076
1.318AspHis: 1.318 ± 0.032
3.305AspIle: 3.305 ± 0.05
1.845AspLys: 1.845 ± 0.044
6.599AspLeu: 6.599 ± 0.075
1.453AspMet: 1.453 ± 0.03
1.325AspAsn: 1.325 ± 0.03
3.564AspPro: 3.564 ± 0.053
1.629AspGln: 1.629 ± 0.033
4.42AspArg: 4.42 ± 0.063
2.129AspSer: 2.129 ± 0.04
2.499AspThr: 2.499 ± 0.041
4.503AspVal: 4.503 ± 0.052
0.976AspTrp: 0.976 ± 0.025
1.475AspTyr: 1.475 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
8.232GluAla: 8.232 ± 0.088
0.367GluCys: 0.367 ± 0.016
3.083GluAsp: 3.083 ± 0.051
3.508GluGlu: 3.508 ± 0.065
1.746GluPhe: 1.746 ± 0.032
4.459GluGly: 4.459 ± 0.053
1.143GluHis: 1.143 ± 0.028
3.694GluIle: 3.694 ± 0.05
2.417GluLys: 2.417 ± 0.043
4.932GluLeu: 4.932 ± 0.067
1.653GluMet: 1.653 ± 0.033
1.545GluAsn: 1.545 ± 0.036
2.867GluPro: 2.867 ± 0.055
1.872GluGln: 1.872 ± 0.03
4.779GluArg: 4.779 ± 0.064
2.386GluSer: 2.386 ± 0.04
4.065GluThr: 4.065 ± 0.056
4.083GluVal: 4.083 ± 0.059
0.658GluTrp: 0.658 ± 0.02
0.864GluTyr: 0.864 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
4.72PheAla: 4.72 ± 0.062
0.385PheCys: 0.385 ± 0.016
2.819PheAsp: 2.819 ± 0.046
2.176PheGlu: 2.176 ± 0.036
1.51PhePhe: 1.51 ± 0.038
3.839PheGly: 3.839 ± 0.053
0.781PheHis: 0.781 ± 0.023
1.719PheIle: 1.719 ± 0.035
1.058PheLys: 1.058 ± 0.026
3.626PheLeu: 3.626 ± 0.058
0.874PheMet: 0.874 ± 0.025
1.018PheAsn: 1.018 ± 0.025
1.58PhePro: 1.58 ± 0.034
1.016PheGln: 1.016 ± 0.025
2.286PheArg: 2.286 ± 0.038
2.385PheSer: 2.385 ± 0.042
1.818PheThr: 1.818 ± 0.031
3.068PheVal: 3.068 ± 0.046
0.569PheTrp: 0.569 ± 0.024
0.877PheTyr: 0.877 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
9.089GlyAla: 9.089 ± 0.088
0.857GlyCys: 0.857 ± 0.025
4.606GlyAsp: 4.606 ± 0.057
5.125GlyGlu: 5.125 ± 0.065
3.655GlyPhe: 3.655 ± 0.049
7.167GlyGly: 7.167 ± 0.086
1.995GlyHis: 1.995 ± 0.041
4.827GlyIle: 4.827 ± 0.061
3.392GlyLys: 3.392 ± 0.055
9.39GlyLeu: 9.39 ± 0.095
2.336GlyMet: 2.336 ± 0.037
2.206GlyAsn: 2.206 ± 0.043
3.634GlyPro: 3.634 ± 0.047
2.681GlyGln: 2.681 ± 0.042
6.11GlyArg: 6.11 ± 0.074
4.786GlySer: 4.786 ± 0.053
4.627GlyThr: 4.627 ± 0.055
5.98GlyVal: 5.98 ± 0.063
1.313GlyTrp: 1.313 ± 0.03
2.263GlyTyr: 2.263 ± 0.037
0.0GlyXaa: 0.0 ± 0.0
His
2.266HisAla: 2.266 ± 0.043
0.219HisCys: 0.219 ± 0.011
1.311HisAsp: 1.311 ± 0.029
1.045HisGlu: 1.045 ± 0.029
0.887HisPhe: 0.887 ± 0.023
1.895HisGly: 1.895 ± 0.043
0.58HisHis: 0.58 ± 0.027
0.879HisIle: 0.879 ± 0.029
0.462HisLys: 0.462 ± 0.017
2.115HisLeu: 2.115 ± 0.031
0.497HisMet: 0.497 ± 0.017
0.439HisAsn: 0.439 ± 0.015
1.366HisPro: 1.366 ± 0.03
0.563HisGln: 0.563 ± 0.018
1.371HisArg: 1.371 ± 0.032
0.929HisSer: 0.929 ± 0.023
0.794HisThr: 0.794 ± 0.021
1.629HisVal: 1.629 ± 0.032
0.315HisTrp: 0.315 ± 0.015
0.551HisTyr: 0.551 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
7.733IleAla: 7.733 ± 0.084
0.518IleCys: 0.518 ± 0.019
3.881IleAsp: 3.881 ± 0.047
3.78IleGlu: 3.78 ± 0.057
1.864IlePhe: 1.864 ± 0.038
5.333IleGly: 5.333 ± 0.059
0.942IleHis: 0.942 ± 0.024
2.165IleIle: 2.165 ± 0.045
1.448IleLys: 1.448 ± 0.034
4.847IleLeu: 4.847 ± 0.066
1.012IleMet: 1.012 ± 0.023
1.377IleAsn: 1.377 ± 0.031
2.377IlePro: 2.377 ± 0.036
1.163IleGln: 1.163 ± 0.031
3.365IleArg: 3.365 ± 0.052
2.994IleSer: 2.994 ± 0.048
2.548IleThr: 2.548 ± 0.048
4.61IleVal: 4.61 ± 0.061
0.576IleTrp: 0.576 ± 0.021
1.105IleTyr: 1.105 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
4.715LysAla: 4.715 ± 0.054
0.165LysCys: 0.165 ± 0.011
1.92LysAsp: 1.92 ± 0.036
1.672LysGlu: 1.672 ± 0.035
0.921LysPhe: 0.921 ± 0.028
2.791LysGly: 2.791 ± 0.045
0.516LysHis: 0.516 ± 0.018
1.795LysIle: 1.795 ± 0.04
1.226LysLys: 1.226 ± 0.035
3.119LysLeu: 3.119 ± 0.048
0.796LysMet: 0.796 ± 0.023
0.789LysAsn: 0.789 ± 0.024
2.024LysPro: 2.024 ± 0.039
0.831LysGln: 0.831 ± 0.025
2.213LysArg: 2.213 ± 0.041
1.984LysSer: 1.984 ± 0.033
2.187LysThr: 2.187 ± 0.04
2.653LysVal: 2.653 ± 0.052
0.36LysTrp: 0.36 ± 0.015
0.584LysTyr: 0.584 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
13.988LeuAla: 13.988 ± 0.127
0.927LeuCys: 0.927 ± 0.026
6.204LeuAsp: 6.204 ± 0.066
5.219LeuGlu: 5.219 ± 0.057
3.706LeuPhe: 3.706 ± 0.055
8.432LeuGly: 8.432 ± 0.085
1.748LeuHis: 1.748 ± 0.033
5.024LeuIle: 5.024 ± 0.068
3.877LeuLys: 3.877 ± 0.051
9.48LeuLeu: 9.48 ± 0.111
2.515LeuMet: 2.515 ± 0.041
2.422LeuAsn: 2.422 ± 0.04
5.614LeuPro: 5.614 ± 0.07
2.563LeuGln: 2.563 ± 0.042
6.121LeuArg: 6.121 ± 0.06
6.586LeuSer: 6.586 ± 0.076
5.577LeuThr: 5.577 ± 0.056
7.895LeuVal: 7.895 ± 0.081
1.094LeuTrp: 1.094 ± 0.026
1.985LeuTyr: 1.985 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
3.374MetAla: 3.374 ± 0.045
0.138MetCys: 0.138 ± 0.009
1.257MetAsp: 1.257 ± 0.029
1.233MetGlu: 1.233 ± 0.028
0.674MetPhe: 0.674 ± 0.021
1.773MetGly: 1.773 ± 0.037
0.443MetHis: 0.443 ± 0.017
1.471MetIle: 1.471 ± 0.031
1.042MetLys: 1.042 ± 0.027
2.507MetLeu: 2.507 ± 0.039
0.712MetMet: 0.712 ± 0.022
0.786MetAsn: 0.786 ± 0.023
1.606MetPro: 1.606 ± 0.035
0.782MetGln: 0.782 ± 0.021
1.86MetArg: 1.86 ± 0.031
1.777MetSer: 1.777 ± 0.033
2.18MetThr: 2.18 ± 0.036
1.817MetVal: 1.817 ± 0.04
0.203MetTrp: 0.203 ± 0.012
0.258MetTyr: 0.258 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
3.077AsnAla: 3.077 ± 0.047
0.228AsnCys: 0.228 ± 0.013
1.482AsnAsp: 1.482 ± 0.034
1.32AsnGlu: 1.32 ± 0.028
0.932AsnPhe: 0.932 ± 0.026
2.359AsnGly: 2.359 ± 0.039
0.503AsnHis: 0.503 ± 0.017
1.337AsnIle: 1.337 ± 0.03
0.632AsnLys: 0.632 ± 0.023
2.427AsnLeu: 2.427 ± 0.037
0.584AsnMet: 0.584 ± 0.022
0.621AsnAsn: 0.621 ± 0.023
1.772AsnPro: 1.772 ± 0.037
0.642AsnGln: 0.642 ± 0.02
1.705AsnArg: 1.705 ± 0.036
1.232AsnSer: 1.232 ± 0.029
1.133AsnThr: 1.133 ± 0.027
1.95AsnVal: 1.95 ± 0.035
0.399AsnTrp: 0.399 ± 0.016
0.603AsnTyr: 0.603 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
6.091ProAla: 6.091 ± 0.073
0.304ProCys: 0.304 ± 0.014
3.855ProAsp: 3.855 ± 0.054
3.786ProGlu: 3.786 ± 0.056
2.03ProPhe: 2.03 ± 0.037
4.35ProGly: 4.35 ± 0.056
1.067ProHis: 1.067 ± 0.024
2.459ProIle: 2.459 ± 0.04
1.792ProLys: 1.792 ± 0.039
4.697ProLeu: 4.697 ± 0.061
1.297ProMet: 1.297 ± 0.029
1.29ProAsn: 1.29 ± 0.03
2.471ProPro: 2.471 ± 0.049
1.416ProGln: 1.416 ± 0.032
2.746ProArg: 2.746 ± 0.039
2.842ProSer: 2.842 ± 0.044
2.531ProThr: 2.531 ± 0.045
4.36ProVal: 4.36 ± 0.053
0.612ProTrp: 0.612 ± 0.02
1.224ProTyr: 1.224 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
3.479GlnAla: 3.479 ± 0.054
0.193GlnCys: 0.193 ± 0.013
1.362GlnAsp: 1.362 ± 0.032
1.434GlnGlu: 1.434 ± 0.031
1.033GlnPhe: 1.033 ± 0.024
2.027GlnGly: 2.027 ± 0.036
0.541GlnHis: 0.541 ± 0.019
1.748GlnIle: 1.748 ± 0.034
1.021GlnLys: 1.021 ± 0.029
2.587GlnLeu: 2.587 ± 0.044
0.842GlnMet: 0.842 ± 0.023
0.808GlnAsn: 0.808 ± 0.024
1.528GlnPro: 1.528 ± 0.036
1.034GlnGln: 1.034 ± 0.03
2.126GlnArg: 2.126 ± 0.04
1.748GlnSer: 1.748 ± 0.038
1.67GlnThr: 1.67 ± 0.031
2.004GlnVal: 2.004 ± 0.036
0.302GlnTrp: 0.302 ± 0.014
0.521GlnTyr: 0.521 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
7.028ArgAla: 7.028 ± 0.075
0.502ArgCys: 0.502 ± 0.016
3.756ArgAsp: 3.756 ± 0.051
3.997ArgGlu: 3.997 ± 0.061
2.897ArgPhe: 2.897 ± 0.042
4.571ArgGly: 4.571 ± 0.059
1.777ArgHis: 1.777 ± 0.037
4.147ArgIle: 4.147 ± 0.05
2.251ArgLys: 2.251 ± 0.037
8.141ArgLeu: 8.141 ± 0.075
1.882ArgMet: 1.882 ± 0.03
1.729ArgAsn: 1.729 ± 0.031
3.523ArgPro: 3.523 ± 0.048
2.641ArgGln: 2.641 ± 0.043
5.64ArgArg: 5.64 ± 0.08
3.747ArgSer: 3.747 ± 0.052
3.397ArgThr: 3.397 ± 0.048
4.409ArgVal: 4.409 ± 0.059
0.874ArgTrp: 0.874 ± 0.026
1.549ArgTyr: 1.549 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
6.368SerAla: 6.368 ± 0.072
0.42SerCys: 0.42 ± 0.016
3.232SerAsp: 3.232 ± 0.045
3.048SerGlu: 3.048 ± 0.045
2.292SerPhe: 2.292 ± 0.04
5.758SerGly: 5.758 ± 0.063
1.086SerHis: 1.086 ± 0.024
3.025SerIle: 3.025 ± 0.044
1.589SerLys: 1.589 ± 0.031
5.787SerLeu: 5.787 ± 0.075
1.45SerMet: 1.45 ± 0.029
1.339SerAsn: 1.339 ± 0.031
2.922SerPro: 2.922 ± 0.043
1.603SerGln: 1.603 ± 0.035
3.845SerArg: 3.845 ± 0.047
3.23SerSer: 3.23 ± 0.061
2.685SerThr: 2.685 ± 0.049
4.195SerVal: 4.195 ± 0.053
0.697SerTrp: 0.697 ± 0.023
1.232SerTyr: 1.232 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
6.19ThrAla: 6.19 ± 0.076
0.442ThrCys: 0.442 ± 0.018
2.962ThrAsp: 2.962 ± 0.044
2.675ThrGlu: 2.675 ± 0.044
2.088ThrPhe: 2.088 ± 0.035
5.038ThrGly: 5.038 ± 0.069
1.022ThrHis: 1.022 ± 0.025
3.161ThrIle: 3.161 ± 0.042
1.513ThrLys: 1.513 ± 0.031
5.75ThrLeu: 5.75 ± 0.065
1.334ThrMet: 1.334 ± 0.032
1.293ThrAsn: 1.293 ± 0.03
3.139ThrPro: 3.139 ± 0.05
1.267ThrGln: 1.267 ± 0.026
3.241ThrArg: 3.241 ± 0.049
3.154ThrSer: 3.154 ± 0.049
2.96ThrThr: 2.96 ± 0.047
4.541ThrVal: 4.541 ± 0.052
0.651ThrTrp: 0.651 ± 0.023
1.242ThrTyr: 1.242 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
9.34ValAla: 9.34 ± 0.096
0.661ValCys: 0.661 ± 0.017
4.562ValAsp: 4.562 ± 0.063
4.646ValGlu: 4.646 ± 0.061
3.003ValPhe: 3.003 ± 0.05
5.935ValGly: 5.935 ± 0.067
1.414ValHis: 1.414 ± 0.033
4.18ValIle: 4.18 ± 0.052
2.398ValLys: 2.398 ± 0.046
7.472ValLeu: 7.472 ± 0.085
1.986ValMet: 1.986 ± 0.031
1.925ValAsn: 1.925 ± 0.036
3.822ValPro: 3.822 ± 0.06
1.897ValGln: 1.897 ± 0.036
4.838ValArg: 4.838 ± 0.061
4.493ValSer: 4.493 ± 0.057
4.717ValThr: 4.717 ± 0.056
6.21ValVal: 6.21 ± 0.073
0.887ValTrp: 0.887 ± 0.028
1.557ValTyr: 1.557 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.131TrpAla: 1.131 ± 0.028
0.146TrpCys: 0.146 ± 0.009
0.581TrpAsp: 0.581 ± 0.02
0.514TrpGlu: 0.514 ± 0.019
0.517TrpPhe: 0.517 ± 0.015
0.819TrpGly: 0.819 ± 0.027
0.315TrpHis: 0.315 ± 0.013
0.681TrpIle: 0.681 ± 0.023
0.464TrpLys: 0.464 ± 0.016
1.501TrpLeu: 1.501 ± 0.031
0.342TrpMet: 0.342 ± 0.016
0.412TrpAsn: 0.412 ± 0.017
0.674TrpPro: 0.674 ± 0.022
0.523TrpGln: 0.523 ± 0.018
1.015TrpArg: 1.015 ± 0.027
0.81TrpSer: 0.81 ± 0.025
0.79TrpThr: 0.79 ± 0.022
0.734TrpVal: 0.734 ± 0.022
0.224TrpTrp: 0.224 ± 0.012
0.265TrpTyr: 0.265 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.321TyrAla: 2.321 ± 0.035
0.258TyrCys: 0.258 ± 0.011
1.491TyrAsp: 1.491 ± 0.036
1.201TyrGlu: 1.201 ± 0.032
0.91TyrPhe: 0.91 ± 0.024
2.085TyrGly: 2.085 ± 0.04
0.457TyrHis: 0.457 ± 0.019
0.862TyrIle: 0.862 ± 0.025
0.618TyrLys: 0.618 ± 0.023
2.247TyrLeu: 2.247 ± 0.038
0.444TyrMet: 0.444 ± 0.017
0.536TyrAsn: 0.536 ± 0.02
1.071TyrPro: 1.071 ± 0.028
0.649TyrGln: 0.649 ± 0.022
1.631TyrArg: 1.631 ± 0.035
1.137TyrSer: 1.137 ± 0.028
0.96TyrThr: 0.96 ± 0.025
1.723TyrVal: 1.723 ± 0.036
0.31TyrTrp: 0.31 ± 0.014
0.566TyrTyr: 0.566 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4966 proteins (1563237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski