Amino acid dipepetide frequency for Methanospirillum hungatei JF-1 (strain ATCC 27890 / DSM 864 / NBRC 100397 / JF-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.959AlaAla: 5.959 ± 0.106
1.061AlaCys: 1.061 ± 0.038
4.43AlaAsp: 4.43 ± 0.069
4.79AlaGlu: 4.79 ± 0.09
2.687AlaPhe: 2.687 ± 0.056
6.433AlaGly: 6.433 ± 0.109
1.51AlaHis: 1.51 ± 0.043
5.777AlaIle: 5.777 ± 0.099
2.904AlaLys: 2.904 ± 0.073
7.132AlaLeu: 7.132 ± 0.105
2.051AlaMet: 2.051 ± 0.046
1.846AlaAsn: 1.846 ± 0.047
2.476AlaPro: 2.476 ± 0.061
2.006AlaGln: 2.006 ± 0.048
3.686AlaArg: 3.686 ± 0.065
4.063AlaSer: 4.063 ± 0.073
3.353AlaThr: 3.353 ± 0.071
5.218AlaVal: 5.218 ± 0.09
0.773AlaTrp: 0.773 ± 0.032
2.185AlaTyr: 2.185 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.925CysAla: 0.925 ± 0.034
0.28CysCys: 0.28 ± 0.02
0.743CysAsp: 0.743 ± 0.026
0.702CysGlu: 0.702 ± 0.028
0.451CysPhe: 0.451 ± 0.021
1.428CysGly: 1.428 ± 0.057
0.365CysHis: 0.365 ± 0.017
1.114CysIle: 1.114 ± 0.037
0.532CysLys: 0.532 ± 0.023
1.056CysLeu: 1.056 ± 0.033
0.377CysMet: 0.377 ± 0.019
0.481CysAsn: 0.481 ± 0.019
0.834CysPro: 0.834 ± 0.033
0.41CysGln: 0.41 ± 0.02
0.757CysArg: 0.757 ± 0.026
1.0CysSer: 1.0 ± 0.037
0.917CysThr: 0.917 ± 0.028
0.707CysVal: 0.707 ± 0.029
0.148CysTrp: 0.148 ± 0.012
0.435CysTyr: 0.435 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
4.242AspAla: 4.242 ± 0.085
0.626AspCys: 0.626 ± 0.029
3.087AspAsp: 3.087 ± 0.062
4.411AspGlu: 4.411 ± 0.074
2.118AspPhe: 2.118 ± 0.053
4.115AspGly: 4.115 ± 0.075
1.23AspHis: 1.23 ± 0.036
5.506AspIle: 5.506 ± 0.078
2.454AspLys: 2.454 ± 0.055
6.235AspLeu: 6.235 ± 0.087
1.424AspMet: 1.424 ± 0.037
1.908AspAsn: 1.908 ± 0.039
3.337AspPro: 3.337 ± 0.066
1.851AspGln: 1.851 ± 0.05
2.969AspArg: 2.969 ± 0.057
2.994AspSer: 2.994 ± 0.055
3.203AspThr: 3.203 ± 0.058
3.642AspVal: 3.642 ± 0.055
0.613AspTrp: 0.613 ± 0.027
1.844AspTyr: 1.844 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
4.426GluAla: 4.426 ± 0.085
0.657GluCys: 0.657 ± 0.026
3.402GluAsp: 3.402 ± 0.061
5.832GluGlu: 5.832 ± 0.133
2.355GluPhe: 2.355 ± 0.045
4.249GluGly: 4.249 ± 0.07
1.471GluHis: 1.471 ± 0.041
6.195GluIle: 6.195 ± 0.096
5.041GluLys: 5.041 ± 0.082
5.587GluLeu: 5.587 ± 0.102
2.133GluMet: 2.133 ± 0.051
2.633GluAsn: 2.633 ± 0.055
2.585GluPro: 2.585 ± 0.132
2.495GluGln: 2.495 ± 0.046
3.712GluArg: 3.712 ± 0.074
3.739GluSer: 3.739 ± 0.072
3.382GluThr: 3.382 ± 0.064
4.376GluVal: 4.376 ± 0.104
0.686GluTrp: 0.686 ± 0.028
2.594GluTyr: 2.594 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
2.511PheAla: 2.511 ± 0.058
0.663PheCys: 0.663 ± 0.03
2.278PheAsp: 2.278 ± 0.048
2.383PheGlu: 2.383 ± 0.055
2.001PhePhe: 2.001 ± 0.051
3.091PheGly: 3.091 ± 0.077
0.934PheHis: 0.934 ± 0.034
3.375PheIle: 3.375 ± 0.069
1.325PheLys: 1.325 ± 0.036
4.029PheLeu: 4.029 ± 0.082
0.845PheMet: 0.845 ± 0.034
1.21PheAsn: 1.21 ± 0.04
1.72PhePro: 1.72 ± 0.042
1.237PheGln: 1.237 ± 0.032
2.037PheArg: 2.037 ± 0.041
3.643PheSer: 3.643 ± 0.063
2.472PheThr: 2.472 ± 0.059
2.321PheVal: 2.321 ± 0.05
0.458PheTrp: 0.458 ± 0.024
1.28PheTyr: 1.28 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.636GlyAla: 4.636 ± 0.095
1.205GlyCys: 1.205 ± 0.038
4.23GlyAsp: 4.23 ± 0.08
4.643GlyGlu: 4.643 ± 0.082
3.15GlyPhe: 3.15 ± 0.071
4.902GlyGly: 4.902 ± 0.099
1.319GlyHis: 1.319 ± 0.038
6.928GlyIle: 6.928 ± 0.094
4.538GlyLys: 4.538 ± 0.065
6.058GlyLeu: 6.058 ± 0.086
2.365GlyMet: 2.365 ± 0.059
2.642GlyAsn: 2.642 ± 0.066
2.387GlyPro: 2.387 ± 0.063
2.036GlyGln: 2.036 ± 0.051
3.626GlyArg: 3.626 ± 0.072
4.879GlySer: 4.879 ± 0.072
4.891GlyThr: 4.891 ± 0.086
4.928GlyVal: 4.928 ± 0.078
0.932GlyTrp: 0.932 ± 0.033
3.223GlyTyr: 3.223 ± 0.07
0.0GlyXaa: 0.0 ± 0.0
His
1.682HisAla: 1.682 ± 0.049
0.321HisCys: 0.321 ± 0.018
1.417HisAsp: 1.417 ± 0.038
1.555HisGlu: 1.555 ± 0.039
0.728HisPhe: 0.728 ± 0.027
1.509HisGly: 1.509 ± 0.041
0.64HisHis: 0.64 ± 0.033
1.807HisIle: 1.807 ± 0.045
0.78HisLys: 0.78 ± 0.029
2.301HisLeu: 2.301 ± 0.055
0.463HisMet: 0.463 ± 0.02
0.737HisAsn: 0.737 ± 0.03
1.472HisPro: 1.472 ± 0.04
0.869HisGln: 0.869 ± 0.032
1.048HisArg: 1.048 ± 0.034
1.227HisSer: 1.227 ± 0.034
1.267HisThr: 1.267 ± 0.041
1.331HisVal: 1.331 ± 0.035
0.186HisTrp: 0.186 ± 0.013
0.682HisTyr: 0.682 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.267IleAla: 6.267 ± 0.094
1.289IleCys: 1.289 ± 0.035
4.403IleAsp: 4.403 ± 0.071
5.166IleGlu: 5.166 ± 0.079
3.245IlePhe: 3.245 ± 0.069
6.061IleGly: 6.061 ± 0.1
2.054IleHis: 2.054 ± 0.042
7.257IleIle: 7.257 ± 0.11
3.181IleLys: 3.181 ± 0.073
8.444IleLeu: 8.444 ± 0.105
1.871IleMet: 1.871 ± 0.043
2.519IleAsn: 2.519 ± 0.062
5.034IlePro: 5.034 ± 0.07
2.98IleGln: 2.98 ± 0.059
5.528IleArg: 5.528 ± 0.079
6.568IleSer: 6.568 ± 0.089
5.648IleThr: 5.648 ± 0.079
4.652IleVal: 4.652 ± 0.074
0.856IleTrp: 0.856 ± 0.032
2.192IleTyr: 2.192 ± 0.051
0.0IleXaa: 0.0 ± 0.0
Lys
3.449LysAla: 3.449 ± 0.069
0.455LysCys: 0.455 ± 0.021
2.732LysAsp: 2.732 ± 0.061
4.205LysGlu: 4.205 ± 0.077
1.515LysPhe: 1.515 ± 0.043
3.76LysGly: 3.76 ± 0.072
0.751LysHis: 0.751 ± 0.026
4.205LysIle: 4.205 ± 0.086
4.038LysLys: 4.038 ± 0.08
3.03LysLeu: 3.03 ± 0.048
1.289LysMet: 1.289 ± 0.039
2.683LysAsn: 2.683 ± 0.048
2.275LysPro: 2.275 ± 0.053
1.536LysGln: 1.536 ± 0.045
2.712LysArg: 2.712 ± 0.055
2.986LysSer: 2.986 ± 0.058
3.35LysThr: 3.35 ± 0.062
2.903LysVal: 2.903 ± 0.058
0.466LysTrp: 0.466 ± 0.022
1.565LysTyr: 1.565 ± 0.043
0.0LysXaa: 0.0 ± 0.0
Leu
6.699LeuAla: 6.699 ± 0.103
1.327LeuCys: 1.327 ± 0.04
4.978LeuAsp: 4.978 ± 0.088
5.406LeuGlu: 5.406 ± 0.084
4.359LeuPhe: 4.359 ± 0.089
5.539LeuGly: 5.539 ± 0.095
2.109LeuHis: 2.109 ± 0.051
8.087LeuIle: 8.087 ± 0.109
4.831LeuLys: 4.831 ± 0.082
8.951LeuLeu: 8.951 ± 0.137
2.426LeuMet: 2.426 ± 0.052
3.529LeuAsn: 3.529 ± 0.062
4.107LeuPro: 4.107 ± 0.066
2.779LeuGln: 2.779 ± 0.05
3.952LeuArg: 3.952 ± 0.066
7.567LeuSer: 7.567 ± 0.087
5.546LeuThr: 5.546 ± 0.079
5.836LeuVal: 5.836 ± 0.077
0.816LeuTrp: 0.816 ± 0.03
2.949LeuTyr: 2.949 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
1.989MetAla: 1.989 ± 0.049
0.251MetCys: 0.251 ± 0.017
1.438MetAsp: 1.438 ± 0.039
1.672MetGlu: 1.672 ± 0.041
0.757MetPhe: 0.757 ± 0.03
1.799MetGly: 1.799 ± 0.052
0.593MetHis: 0.593 ± 0.026
2.234MetIle: 2.234 ± 0.049
1.813MetLys: 1.813 ± 0.038
2.062MetLeu: 2.062 ± 0.055
0.794MetMet: 0.794 ± 0.033
1.453MetAsn: 1.453 ± 0.041
1.057MetPro: 1.057 ± 0.041
0.986MetGln: 0.986 ± 0.031
1.313MetArg: 1.313 ± 0.036
1.526MetSer: 1.526 ± 0.038
1.749MetThr: 1.749 ± 0.046
1.958MetVal: 1.958 ± 0.054
0.18MetTrp: 0.18 ± 0.014
0.746MetTyr: 0.746 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.534AsnAla: 2.534 ± 0.055
0.412AsnCys: 0.412 ± 0.023
1.57AsnAsp: 1.57 ± 0.046
2.245AsnGlu: 2.245 ± 0.057
1.217AsnPhe: 1.217 ± 0.041
2.652AsnGly: 2.652 ± 0.057
0.768AsnHis: 0.768 ± 0.032
2.878AsnIle: 2.878 ± 0.06
1.522AsnLys: 1.522 ± 0.044
3.705AsnLeu: 3.705 ± 0.068
0.811AsnMet: 0.811 ± 0.027
1.441AsnAsn: 1.441 ± 0.044
2.696AsnPro: 2.696 ± 0.07
1.504AsnGln: 1.504 ± 0.041
2.138AsnArg: 2.138 ± 0.045
2.089AsnSer: 2.089 ± 0.065
2.14AsnThr: 2.14 ± 0.058
1.957AsnVal: 1.957 ± 0.048
0.375AsnTrp: 0.375 ± 0.022
1.134AsnTyr: 1.134 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
3.319ProAla: 3.319 ± 0.057
0.601ProCys: 0.601 ± 0.025
4.361ProAsp: 4.361 ± 0.071
4.055ProGlu: 4.055 ± 0.165
1.983ProPhe: 1.983 ± 0.05
3.909ProGly: 3.909 ± 0.076
0.99ProHis: 0.99 ± 0.038
2.872ProIle: 2.872 ± 0.061
1.798ProLys: 1.798 ± 0.046
3.867ProLeu: 3.867 ± 0.064
0.919ProMet: 0.919 ± 0.034
1.252ProAsn: 1.252 ± 0.036
1.853ProPro: 1.853 ± 0.047
1.065ProGln: 1.065 ± 0.034
1.687ProArg: 1.687 ± 0.045
2.881ProSer: 2.881 ± 0.062
2.318ProThr: 2.318 ± 0.146
4.971ProVal: 4.971 ± 0.09
0.457ProTrp: 0.457 ± 0.019
1.705ProTyr: 1.705 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
2.455GlnAla: 2.455 ± 0.048
0.338GlnCys: 0.338 ± 0.019
1.843GlnAsp: 1.843 ± 0.048
2.453GlnGlu: 2.453 ± 0.054
1.163GlnPhe: 1.163 ± 0.04
2.082GlnGly: 2.082 ± 0.055
0.519GlnHis: 0.519 ± 0.024
2.946GlnIle: 2.946 ± 0.057
2.362GlnLys: 2.362 ± 0.056
2.089GlnLeu: 2.089 ± 0.052
1.012GlnMet: 1.012 ± 0.031
1.487GlnAsn: 1.487 ± 0.044
1.003GlnPro: 1.003 ± 0.034
0.956GlnGln: 0.956 ± 0.04
1.371GlnArg: 1.371 ± 0.039
1.939GlnSer: 1.939 ± 0.049
1.702GlnThr: 1.702 ± 0.042
2.509GlnVal: 2.509 ± 0.054
0.311GlnTrp: 0.311 ± 0.019
1.306GlnTyr: 1.306 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
3.233ArgAla: 3.233 ± 0.069
0.727ArgCys: 0.727 ± 0.029
3.242ArgAsp: 3.242 ± 0.055
4.122ArgGlu: 4.122 ± 0.076
2.458ArgPhe: 2.458 ± 0.057
2.947ArgGly: 2.947 ± 0.057
1.048ArgHis: 1.048 ± 0.031
4.683ArgIle: 4.683 ± 0.077
3.093ArgLys: 3.093 ± 0.059
4.425ArgLeu: 4.425 ± 0.083
1.696ArgMet: 1.696 ± 0.047
2.069ArgAsn: 2.069 ± 0.042
1.853ArgPro: 1.853 ± 0.044
1.614ArgGln: 1.614 ± 0.04
2.592ArgArg: 2.592 ± 0.056
3.123ArgSer: 3.123 ± 0.055
2.767ArgThr: 2.767 ± 0.05
3.187ArgVal: 3.187 ± 0.061
0.57ArgTrp: 0.57 ± 0.027
2.341ArgTyr: 2.341 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
4.411SerAla: 4.411 ± 0.077
0.965SerCys: 0.965 ± 0.035
4.326SerAsp: 4.326 ± 0.07
4.308SerGlu: 4.308 ± 0.07
2.929SerPhe: 2.929 ± 0.06
6.411SerGly: 6.411 ± 0.096
1.658SerHis: 1.658 ± 0.044
4.966SerIle: 4.966 ± 0.069
2.471SerLys: 2.471 ± 0.053
6.461SerLeu: 6.461 ± 0.081
1.75SerMet: 1.75 ± 0.038
1.842SerAsn: 1.842 ± 0.059
3.24SerPro: 3.24 ± 0.063
2.035SerGln: 2.035 ± 0.048
3.453SerArg: 3.453 ± 0.068
4.804SerSer: 4.804 ± 0.086
3.093SerThr: 3.093 ± 0.069
4.566SerVal: 4.566 ± 0.071
0.91SerTrp: 0.91 ± 0.037
2.273SerTyr: 2.273 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
4.079ThrAla: 4.079 ± 0.069
0.816ThrCys: 0.816 ± 0.031
3.345ThrAsp: 3.345 ± 0.072
3.271ThrGlu: 3.271 ± 0.072
2.115ThrPhe: 2.115 ± 0.048
5.882ThrGly: 5.882 ± 0.096
1.287ThrHis: 1.287 ± 0.041
5.022ThrIle: 5.022 ± 0.075
2.252ThrLys: 2.252 ± 0.05
5.408ThrLeu: 5.408 ± 0.076
1.32ThrMet: 1.32 ± 0.042
1.813ThrAsn: 1.813 ± 0.056
3.563ThrPro: 3.563 ± 0.2
1.457ThrGln: 1.457 ± 0.041
2.992ThrArg: 2.992 ± 0.057
3.756ThrSer: 3.756 ± 0.08
3.046ThrThr: 3.046 ± 0.078
4.042ThrVal: 4.042 ± 0.107
0.625ThrTrp: 0.625 ± 0.027
1.931ThrTyr: 1.931 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
4.301ValAla: 4.301 ± 0.075
1.011ValCys: 1.011 ± 0.037
3.222ValAsp: 3.222 ± 0.059
3.436ValGlu: 3.436 ± 0.067
2.834ValPhe: 2.834 ± 0.055
3.626ValGly: 3.626 ± 0.07
1.616ValHis: 1.616 ± 0.041
5.949ValIle: 5.949 ± 0.082
3.11ValLys: 3.11 ± 0.059
6.414ValLeu: 6.414 ± 0.094
1.871ValMet: 1.871 ± 0.047
2.309ValAsn: 2.309 ± 0.059
3.346ValPro: 3.346 ± 0.068
2.159ValGln: 2.159 ± 0.051
3.863ValArg: 3.863 ± 0.066
5.06ValSer: 5.06 ± 0.077
4.596ValThr: 4.596 ± 0.177
4.251ValVal: 4.251 ± 0.073
0.684ValTrp: 0.684 ± 0.029
2.187ValTyr: 2.187 ± 0.051
0.0ValXaa: 0.0 ± 0.0
Trp
0.592TrpAla: 0.592 ± 0.028
0.142TrpCys: 0.142 ± 0.013
0.727TrpAsp: 0.727 ± 0.031
0.683TrpGlu: 0.683 ± 0.026
0.547TrpPhe: 0.547 ± 0.025
0.578TrpGly: 0.578 ± 0.027
0.215TrpHis: 0.215 ± 0.015
0.955TrpIle: 0.955 ± 0.031
0.666TrpLys: 0.666 ± 0.024
0.859TrpLeu: 0.859 ± 0.028
0.306TrpMet: 0.306 ± 0.02
0.659TrpAsn: 0.659 ± 0.03
0.283TrpPro: 0.283 ± 0.017
0.407TrpGln: 0.407 ± 0.021
0.486TrpArg: 0.486 ± 0.024
0.637TrpSer: 0.637 ± 0.028
0.535TrpThr: 0.535 ± 0.027
0.639TrpVal: 0.639 ± 0.026
0.158TrpTrp: 0.158 ± 0.013
0.523TrpTyr: 0.523 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.627TyrAla: 2.627 ± 0.052
0.515TyrCys: 0.515 ± 0.021
2.037TyrAsp: 2.037 ± 0.05
2.21TyrGlu: 2.21 ± 0.049
1.2TyrPhe: 1.2 ± 0.035
2.545TyrGly: 2.545 ± 0.05
0.945TyrHis: 0.945 ± 0.032
2.459TyrIle: 2.459 ± 0.053
1.193TyrLys: 1.193 ± 0.038
3.683TyrLeu: 3.683 ± 0.069
0.645TyrMet: 0.645 ± 0.026
1.275TyrAsn: 1.275 ± 0.045
1.743TyrPro: 1.743 ± 0.046
1.467TyrGln: 1.467 ± 0.037
1.868TyrArg: 1.868 ± 0.049
2.39TyrSer: 2.39 ± 0.056
2.05TyrThr: 2.05 ± 0.055
1.851TyrVal: 1.851 ± 0.039
0.393TyrTrp: 0.393 ± 0.022
1.29TyrTyr: 1.29 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3087 proteins (996237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski