Amino acid dipepetide frequency for Johnsonella ignava ATCC 51276

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.407AlaAla: 6.407 ± 0.13
0.999AlaCys: 0.999 ± 0.038
4.48AlaAsp: 4.48 ± 0.083
4.017AlaGlu: 4.017 ± 0.085
2.975AlaPhe: 2.975 ± 0.07
5.693AlaGly: 5.693 ± 0.12
0.987AlaHis: 0.987 ± 0.038
4.493AlaIle: 4.493 ± 0.093
4.875AlaLys: 4.875 ± 0.102
6.301AlaLeu: 6.301 ± 0.107
1.798AlaMet: 1.798 ± 0.057
2.386AlaAsn: 2.386 ± 0.055
1.578AlaPro: 1.578 ± 0.057
2.127AlaGln: 2.127 ± 0.056
2.541AlaArg: 2.541 ± 0.069
3.968AlaSer: 3.968 ± 0.091
2.419AlaThr: 2.419 ± 0.063
5.88AlaVal: 5.88 ± 0.117
0.491AlaTrp: 0.491 ± 0.027
2.719AlaTyr: 2.719 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.843CysAla: 0.843 ± 0.037
0.217CysCys: 0.217 ± 0.016
0.711CysAsp: 0.711 ± 0.028
0.736CysGlu: 0.736 ± 0.032
0.558CysPhe: 0.558 ± 0.032
1.115CysGly: 1.115 ± 0.038
0.248CysHis: 0.248 ± 0.015
1.379CysIle: 1.379 ± 0.052
0.829CysLys: 0.829 ± 0.035
0.947CysLeu: 0.947 ± 0.036
0.386CysMet: 0.386 ± 0.026
0.565CysAsn: 0.565 ± 0.025
0.472CysPro: 0.472 ± 0.034
0.204CysGln: 0.204 ± 0.02
0.588CysArg: 0.588 ± 0.03
0.955CysSer: 0.955 ± 0.04
0.654CysThr: 0.654 ± 0.032
0.717CysVal: 0.717 ± 0.03
0.061CysTrp: 0.061 ± 0.011
0.446CysTyr: 0.446 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
3.522AspAla: 3.522 ± 0.078
0.659AspCys: 0.659 ± 0.031
3.481AspAsp: 3.481 ± 0.081
5.234AspGlu: 5.234 ± 0.1
3.138AspPhe: 3.138 ± 0.065
4.143AspGly: 4.143 ± 0.097
0.642AspHis: 0.642 ± 0.032
6.906AspIle: 6.906 ± 0.121
5.343AspLys: 5.343 ± 0.1
4.35AspLeu: 4.35 ± 0.077
2.011AspMet: 2.011 ± 0.053
3.27AspAsn: 3.27 ± 0.084
1.455AspPro: 1.455 ± 0.057
0.741AspGln: 0.741 ± 0.036
2.419AspArg: 2.419 ± 0.069
3.617AspSer: 3.617 ± 0.074
3.148AspThr: 3.148 ± 0.067
3.296AspVal: 3.296 ± 0.071
0.45AspTrp: 0.45 ± 0.03
2.875AspTyr: 2.875 ± 0.073
0.0AspXaa: 0.0 ± 0.0
Glu
4.815GluAla: 4.815 ± 0.091
0.69GluCys: 0.69 ± 0.03
4.171GluAsp: 4.171 ± 0.095
5.51GluGlu: 5.51 ± 0.137
2.757GluPhe: 2.757 ± 0.059
3.968GluGly: 3.968 ± 0.078
1.265GluHis: 1.265 ± 0.052
6.051GluIle: 6.051 ± 0.105
7.289GluLys: 7.289 ± 0.136
6.602GluLeu: 6.602 ± 0.118
1.77GluMet: 1.77 ± 0.052
5.22GluAsn: 5.22 ± 0.095
1.703GluPro: 1.703 ± 0.054
2.004GluGln: 2.004 ± 0.056
2.976GluArg: 2.976 ± 0.075
3.756GluSer: 3.756 ± 0.068
3.05GluThr: 3.05 ± 0.066
4.09GluVal: 4.09 ± 0.082
0.528GluTrp: 0.528 ± 0.026
3.563GluTyr: 3.563 ± 0.082
0.0GluXaa: 0.0 ± 0.0
Phe
2.865PheAla: 2.865 ± 0.072
0.611PheCys: 0.611 ± 0.028
2.727PheAsp: 2.727 ± 0.065
3.004PheGlu: 3.004 ± 0.063
1.997PhePhe: 1.997 ± 0.065
2.682PheGly: 2.682 ± 0.057
0.513PheHis: 0.513 ± 0.03
4.298PheIle: 4.298 ± 0.096
3.441PheLys: 3.441 ± 0.068
3.711PheLeu: 3.711 ± 0.083
1.321PheMet: 1.321 ± 0.043
2.408PheAsn: 2.408 ± 0.06
1.16PhePro: 1.16 ± 0.037
0.875PheGln: 0.875 ± 0.034
1.451PheArg: 1.451 ± 0.052
3.232PheSer: 3.232 ± 0.068
2.187PheThr: 2.187 ± 0.059
2.495PheVal: 2.495 ± 0.06
0.353PheTrp: 0.353 ± 0.021
1.914PheTyr: 1.914 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
4.284GlyAla: 4.284 ± 0.098
1.0GlyCys: 1.0 ± 0.045
3.274GlyAsp: 3.274 ± 0.068
4.354GlyGlu: 4.354 ± 0.092
3.256GlyPhe: 3.256 ± 0.076
4.827GlyGly: 4.827 ± 0.126
1.06GlyHis: 1.06 ± 0.038
6.712GlyIle: 6.712 ± 0.101
5.68GlyLys: 5.68 ± 0.091
5.261GlyLeu: 5.261 ± 0.101
2.139GlyMet: 2.139 ± 0.064
3.504GlyAsn: 3.504 ± 0.095
1.073GlyPro: 1.073 ± 0.046
1.572GlyGln: 1.572 ± 0.043
2.924GlyArg: 2.924 ± 0.069
4.642GlySer: 4.642 ± 0.101
3.431GlyThr: 3.431 ± 0.075
4.209GlyVal: 4.209 ± 0.094
0.659GlyTrp: 0.659 ± 0.04
3.18GlyTyr: 3.18 ± 0.068
0.0GlyXaa: 0.0 ± 0.0
His
0.894HisAla: 0.894 ± 0.034
0.211HisCys: 0.211 ± 0.017
0.915HisAsp: 0.915 ± 0.037
1.053HisGlu: 1.053 ± 0.052
0.721HisPhe: 0.721 ± 0.036
1.048HisGly: 1.048 ± 0.037
0.298HisHis: 0.298 ± 0.026
1.594HisIle: 1.594 ± 0.041
1.114HisLys: 1.114 ± 0.035
1.123HisLeu: 1.123 ± 0.037
0.432HisMet: 0.432 ± 0.024
0.825HisAsn: 0.825 ± 0.03
0.627HisPro: 0.627 ± 0.028
0.347HisGln: 0.347 ± 0.023
0.706HisArg: 0.706 ± 0.031
1.019HisSer: 1.019 ± 0.043
0.797HisThr: 0.797 ± 0.03
0.846HisVal: 0.846 ± 0.037
0.15HisTrp: 0.15 ± 0.015
0.692HisTyr: 0.692 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.587IleAla: 5.587 ± 0.118
1.273IleCys: 1.273 ± 0.045
5.375IleAsp: 5.375 ± 0.088
6.041IleGlu: 6.041 ± 0.095
3.774IlePhe: 3.774 ± 0.089
5.37IleGly: 5.37 ± 0.116
1.281IleHis: 1.281 ± 0.043
7.604IleIle: 7.604 ± 0.128
7.879IleLys: 7.879 ± 0.12
7.433IleLeu: 7.433 ± 0.119
2.298IleMet: 2.298 ± 0.064
4.742IleAsn: 4.742 ± 0.086
3.115IlePro: 3.115 ± 0.066
2.008IleGln: 2.008 ± 0.049
3.379IleArg: 3.379 ± 0.069
6.532IleSer: 6.532 ± 0.108
4.156IleThr: 4.156 ± 0.07
4.959IleVal: 4.959 ± 0.097
0.599IleTrp: 0.599 ± 0.028
3.704IleTyr: 3.704 ± 0.08
0.0IleXaa: 0.0 ± 0.0
Lys
5.569LysAla: 5.569 ± 0.097
0.801LysCys: 0.801 ± 0.036
5.551LysAsp: 5.551 ± 0.095
7.66LysGlu: 7.66 ± 0.142
2.594LysPhe: 2.594 ± 0.06
4.8LysGly: 4.8 ± 0.082
1.321LysHis: 1.321 ± 0.045
6.669LysIle: 6.669 ± 0.103
8.791LysLys: 8.791 ± 0.128
7.005LysLeu: 7.005 ± 0.109
2.105LysMet: 2.105 ± 0.056
5.813LysAsn: 5.813 ± 0.108
2.314LysPro: 2.314 ± 0.062
2.391LysGln: 2.391 ± 0.068
3.776LysArg: 3.776 ± 0.073
5.254LysSer: 5.254 ± 0.096
4.66LysThr: 4.66 ± 0.088
4.463LysVal: 4.463 ± 0.09
0.554LysTrp: 0.554 ± 0.03
3.747LysTyr: 3.747 ± 0.078
0.001LysXaa: 0.001 ± 0.001
Leu
5.52LeuAla: 5.52 ± 0.097
1.273LeuCys: 1.273 ± 0.043
4.928LeuAsp: 4.928 ± 0.098
5.91LeuGlu: 5.91 ± 0.111
3.765LeuPhe: 3.765 ± 0.087
5.363LeuGly: 5.363 ± 0.092
1.297LeuHis: 1.297 ± 0.044
6.921LeuIle: 6.921 ± 0.126
8.134LeuLys: 8.134 ± 0.115
7.39LeuLeu: 7.39 ± 0.127
2.329LeuMet: 2.329 ± 0.057
4.77LeuAsn: 4.77 ± 0.086
3.012LeuPro: 3.012 ± 0.067
1.968LeuGln: 1.968 ± 0.052
3.39LeuArg: 3.39 ± 0.079
6.866LeuSer: 6.866 ± 0.099
4.103LeuThr: 4.103 ± 0.079
4.475LeuVal: 4.475 ± 0.086
0.646LeuTrp: 0.646 ± 0.032
3.659LeuTyr: 3.659 ± 0.073
0.0LeuXaa: 0.0 ± 0.0
Met
2.037MetAla: 2.037 ± 0.06
0.334MetCys: 0.334 ± 0.021
1.696MetAsp: 1.696 ± 0.044
1.999MetGlu: 1.999 ± 0.053
1.012MetPhe: 1.012 ± 0.037
1.7MetGly: 1.7 ± 0.056
0.458MetHis: 0.458 ± 0.027
2.172MetIle: 2.172 ± 0.063
2.412MetLys: 2.412 ± 0.06
2.806MetLeu: 2.806 ± 0.07
0.71MetMet: 0.71 ± 0.037
1.438MetAsn: 1.438 ± 0.052
1.114MetPro: 1.114 ± 0.034
0.873MetGln: 0.873 ± 0.032
1.093MetArg: 1.093 ± 0.036
1.784MetSer: 1.784 ± 0.046
1.24MetThr: 1.24 ± 0.043
1.59MetVal: 1.59 ± 0.047
0.18MetTrp: 0.18 ± 0.014
1.024MetTyr: 1.024 ± 0.042
0.0MetXaa: 0.0 ± 0.0
Asn
3.392AsnAla: 3.392 ± 0.082
0.582AsnCys: 0.582 ± 0.031
3.009AsnAsp: 3.009 ± 0.064
3.745AsnGlu: 3.745 ± 0.09
2.166AsnPhe: 2.166 ± 0.056
3.187AsnGly: 3.187 ± 0.075
0.818AsnHis: 0.818 ± 0.037
5.683AsnIle: 5.683 ± 0.098
4.616AsnLys: 4.616 ± 0.091
4.346AsnLeu: 4.346 ± 0.088
1.554AsnMet: 1.554 ± 0.056
2.975AsnAsn: 2.975 ± 0.074
2.16AsnPro: 2.16 ± 0.067
1.278AsnGln: 1.278 ± 0.049
2.135AsnArg: 2.135 ± 0.06
3.471AsnSer: 3.471 ± 0.076
3.155AsnThr: 3.155 ± 0.074
2.967AsnVal: 2.967 ± 0.069
0.424AsnTrp: 0.424 ± 0.024
2.224AsnTyr: 2.224 ± 0.06
0.0AsnXaa: 0.0 ± 0.0
Pro
2.022ProAla: 2.022 ± 0.052
0.337ProCys: 0.337 ± 0.022
2.392ProAsp: 2.392 ± 0.071
2.642ProGlu: 2.642 ± 0.077
1.366ProPhe: 1.366 ± 0.048
1.983ProGly: 1.983 ± 0.062
0.56ProHis: 0.56 ± 0.028
1.869ProIle: 1.869 ± 0.056
2.041ProLys: 2.041 ± 0.054
2.404ProLeu: 2.404 ± 0.054
0.692ProMet: 0.692 ± 0.031
1.175ProAsn: 1.175 ± 0.046
0.755ProPro: 0.755 ± 0.035
0.961ProGln: 0.961 ± 0.042
0.83ProArg: 0.83 ± 0.04
1.739ProSer: 1.739 ± 0.046
1.239ProThr: 1.239 ± 0.048
2.709ProVal: 2.709 ± 0.059
0.22ProTrp: 0.22 ± 0.018
1.446ProTyr: 1.446 ± 0.048
0.0ProXaa: 0.0 ± 0.0
Gln
1.789GlnAla: 1.789 ± 0.055
0.207GlnCys: 0.207 ± 0.016
1.284GlnAsp: 1.284 ± 0.038
1.911GlnGlu: 1.911 ± 0.055
0.89GlnPhe: 0.89 ± 0.037
1.703GlnGly: 1.703 ± 0.056
0.353GlnHis: 0.353 ± 0.021
2.03GlnIle: 2.03 ± 0.055
2.388GlnLys: 2.388 ± 0.051
2.179GlnLeu: 2.179 ± 0.059
0.737GlnMet: 0.737 ± 0.035
1.59GlnAsn: 1.59 ± 0.051
0.682GlnPro: 0.682 ± 0.037
0.728GlnGln: 0.728 ± 0.036
1.152GlnArg: 1.152 ± 0.039
1.713GlnSer: 1.713 ± 0.048
1.313GlnThr: 1.313 ± 0.047
1.367GlnVal: 1.367 ± 0.041
0.245GlnTrp: 0.245 ± 0.019
1.015GlnTyr: 1.015 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
2.411ArgAla: 2.411 ± 0.061
0.465ArgCys: 0.465 ± 0.024
2.216ArgAsp: 2.216 ± 0.06
2.957ArgGlu: 2.957 ± 0.081
1.768ArgPhe: 1.768 ± 0.049
2.366ArgGly: 2.366 ± 0.06
0.772ArgHis: 0.772 ± 0.032
3.64ArgIle: 3.64 ± 0.076
3.264ArgLys: 3.264 ± 0.08
4.155ArgLeu: 4.155 ± 0.084
1.253ArgMet: 1.253 ± 0.039
2.077ArgAsn: 2.077 ± 0.065
1.086ArgPro: 1.086 ± 0.039
1.393ArgGln: 1.393 ± 0.047
1.8ArgArg: 1.8 ± 0.051
2.03ArgSer: 2.03 ± 0.053
1.81ArgThr: 1.81 ± 0.062
2.365ArgVal: 2.365 ± 0.065
0.309ArgTrp: 0.309 ± 0.019
2.044ArgTyr: 2.044 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
4.433SerAla: 4.433 ± 0.081
0.801SerCys: 0.801 ± 0.031
4.382SerAsp: 4.382 ± 0.092
4.614SerGlu: 4.614 ± 0.09
2.993SerPhe: 2.993 ± 0.063
5.94SerGly: 5.94 ± 0.112
1.008SerHis: 1.008 ± 0.034
5.376SerIle: 5.376 ± 0.091
5.025SerLys: 5.025 ± 0.101
5.425SerLeu: 5.425 ± 0.106
1.776SerMet: 1.776 ± 0.053
2.887SerAsn: 2.887 ± 0.067
1.777SerPro: 1.777 ± 0.047
1.764SerGln: 1.764 ± 0.058
2.711SerArg: 2.711 ± 0.058
4.593SerSer: 4.593 ± 0.112
2.948SerThr: 2.948 ± 0.07
4.095SerVal: 4.095 ± 0.073
0.528SerTrp: 0.528 ± 0.027
2.859SerTyr: 2.859 ± 0.075
0.0SerXaa: 0.0 ± 0.0
Thr
3.947ThrAla: 3.947 ± 0.087
0.512ThrCys: 0.512 ± 0.029
3.274ThrAsp: 3.274 ± 0.075
3.01ThrGlu: 3.01 ± 0.074
1.927ThrPhe: 1.927 ± 0.059
4.239ThrGly: 4.239 ± 0.094
0.788ThrHis: 0.788 ± 0.032
3.375ThrIle: 3.375 ± 0.085
3.297ThrLys: 3.297 ± 0.069
4.122ThrLeu: 4.122 ± 0.073
1.121ThrMet: 1.121 ± 0.039
1.959ThrAsn: 1.959 ± 0.055
1.915ThrPro: 1.915 ± 0.057
1.325ThrGln: 1.325 ± 0.045
1.602ThrArg: 1.602 ± 0.049
3.093ThrSer: 3.093 ± 0.076
2.268ThrThr: 2.268 ± 0.065
3.9ThrVal: 3.9 ± 0.085
0.373ThrTrp: 0.373 ± 0.023
1.871ThrTyr: 1.871 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
3.603ValAla: 3.603 ± 0.093
0.972ValCys: 0.972 ± 0.04
3.439ValAsp: 3.439 ± 0.072
3.747ValGlu: 3.747 ± 0.08
3.188ValPhe: 3.188 ± 0.072
3.623ValGly: 3.623 ± 0.094
0.914ValHis: 0.914 ± 0.036
5.403ValIle: 5.403 ± 0.106
4.972ValLys: 4.972 ± 0.083
6.128ValLeu: 6.128 ± 0.105
1.749ValMet: 1.749 ± 0.056
3.197ValAsn: 3.197 ± 0.057
1.975ValPro: 1.975 ± 0.056
1.475ValGln: 1.475 ± 0.051
2.406ValArg: 2.406 ± 0.066
4.491ValSer: 4.491 ± 0.084
2.556ValThr: 2.556 ± 0.068
3.813ValVal: 3.813 ± 0.092
0.477ValTrp: 0.477 ± 0.025
2.839ValTyr: 2.839 ± 0.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.46TrpAla: 0.46 ± 0.025
0.114TrpCys: 0.114 ± 0.014
0.471TrpAsp: 0.471 ± 0.026
0.484TrpGlu: 0.484 ± 0.026
0.343TrpPhe: 0.343 ± 0.02
0.471TrpGly: 0.471 ± 0.025
0.15TrpHis: 0.15 ± 0.015
0.684TrpIle: 0.684 ± 0.034
0.67TrpLys: 0.67 ± 0.031
0.749TrpLeu: 0.749 ± 0.035
0.207TrpMet: 0.207 ± 0.015
0.431TrpAsn: 0.431 ± 0.026
0.114TrpPro: 0.114 ± 0.013
0.326TrpGln: 0.326 ± 0.022
0.325TrpArg: 0.325 ± 0.022
0.362TrpSer: 0.362 ± 0.025
0.292TrpThr: 0.292 ± 0.021
0.416TrpVal: 0.416 ± 0.026
0.089TrpTrp: 0.089 ± 0.012
0.515TrpTyr: 0.515 ± 0.044
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.732TyrAla: 2.732 ± 0.064
0.573TyrCys: 0.573 ± 0.026
2.923TyrAsp: 2.923 ± 0.09
3.151TyrGlu: 3.151 ± 0.076
2.057TyrPhe: 2.057 ± 0.063
2.821TyrGly: 2.821 ± 0.06
0.682TyrHis: 0.682 ± 0.029
4.249TyrIle: 4.249 ± 0.101
3.778TyrLys: 3.778 ± 0.087
3.28TyrLeu: 3.28 ± 0.065
1.281TyrMet: 1.281 ± 0.048
2.656TyrAsn: 2.656 ± 0.07
1.293TyrPro: 1.293 ± 0.043
0.916TyrGln: 0.916 ± 0.037
1.979TyrArg: 1.979 ± 0.055
2.822TyrSer: 2.822 ± 0.068
2.491TyrThr: 2.491 ± 0.061
2.314TyrVal: 2.314 ± 0.056
0.337TyrTrp: 0.337 ± 0.021
2.16TyrTyr: 2.16 ± 0.074
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2284 proteins (754037 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski