Amino acid dipepetide frequency for Nitrospira defluvii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.293AlaAla: 11.293 ± 0.139
1.17AlaCys: 1.17 ± 0.031
5.056AlaAsp: 5.056 ± 0.072
6.095AlaGlu: 6.095 ± 0.076
3.31AlaPhe: 3.31 ± 0.054
8.572AlaGly: 8.572 ± 0.095
2.196AlaHis: 2.196 ± 0.045
5.068AlaIle: 5.068 ± 0.069
4.164AlaLys: 4.164 ± 0.065
11.128AlaLeu: 11.128 ± 0.12
2.784AlaMet: 2.784 ± 0.051
2.352AlaAsn: 2.352 ± 0.056
4.478AlaPro: 4.478 ± 0.073
4.278AlaGln: 4.278 ± 0.066
6.607AlaArg: 6.607 ± 0.082
5.428AlaSer: 5.428 ± 0.067
5.49AlaThr: 5.49 ± 0.071
7.931AlaVal: 7.931 ± 0.09
1.356AlaTrp: 1.356 ± 0.036
2.465AlaTyr: 2.465 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
1.031CysAla: 1.031 ± 0.029
0.166CysCys: 0.166 ± 0.012
0.465CysAsp: 0.465 ± 0.021
0.542CysGlu: 0.542 ± 0.022
0.351CysPhe: 0.351 ± 0.019
0.958CysGly: 0.958 ± 0.026
0.357CysHis: 0.357 ± 0.021
0.375CysIle: 0.375 ± 0.015
0.298CysLys: 0.298 ± 0.013
1.182CysLeu: 1.182 ± 0.032
0.223CysMet: 0.223 ± 0.013
0.226CysAsn: 0.226 ± 0.014
0.605CysPro: 0.605 ± 0.024
0.375CysGln: 0.375 ± 0.018
0.816CysArg: 0.816 ± 0.025
0.632CysSer: 0.632 ± 0.02
0.511CysThr: 0.511 ± 0.023
0.703CysVal: 0.703 ± 0.024
0.146CysTrp: 0.146 ± 0.01
0.279CysTyr: 0.279 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
4.967AspAla: 4.967 ± 0.066
0.543AspCys: 0.543 ± 0.022
2.416AspAsp: 2.416 ± 0.055
3.176AspGlu: 3.176 ± 0.046
1.774AspPhe: 1.774 ± 0.036
4.053AspGly: 4.053 ± 0.062
1.365AspHis: 1.365 ± 0.031
2.565AspIle: 2.565 ± 0.046
1.549AspLys: 1.549 ± 0.039
5.946AspLeu: 5.946 ± 0.079
1.005AspMet: 1.005 ± 0.03
1.112AspAsn: 1.112 ± 0.031
3.11AspPro: 3.11 ± 0.05
2.191AspGln: 2.191 ± 0.041
4.299AspArg: 4.299 ± 0.06
2.525AspSer: 2.525 ± 0.046
2.454AspThr: 2.454 ± 0.045
3.98AspVal: 3.98 ± 0.067
0.774AspTrp: 0.774 ± 0.029
1.383AspTyr: 1.383 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
6.493GluAla: 6.493 ± 0.076
0.479GluCys: 0.479 ± 0.019
2.59GluAsp: 2.59 ± 0.046
3.797GluGlu: 3.797 ± 0.08
2.064GluPhe: 2.064 ± 0.042
4.211GluGly: 4.211 ± 0.057
1.452GluHis: 1.452 ± 0.033
3.067GluIle: 3.067 ± 0.05
2.419GluLys: 2.419 ± 0.056
5.997GluLeu: 5.997 ± 0.077
1.39GluMet: 1.39 ± 0.031
1.251GluAsn: 1.251 ± 0.03
2.655GluPro: 2.655 ± 0.05
3.403GluGln: 3.403 ± 0.062
5.17GluArg: 5.17 ± 0.077
3.432GluSer: 3.432 ± 0.056
3.181GluThr: 3.181 ± 0.052
4.317GluVal: 4.317 ± 0.068
0.893GluTrp: 0.893 ± 0.03
1.357GluTyr: 1.357 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.26PheAla: 3.26 ± 0.048
0.45PheCys: 0.45 ± 0.022
2.127PheAsp: 2.127 ± 0.04
2.045PheGlu: 2.045 ± 0.043
1.416PhePhe: 1.416 ± 0.038
3.106PheGly: 3.106 ± 0.059
0.938PheHis: 0.938 ± 0.028
1.501PheIle: 1.501 ± 0.035
1.131PheLys: 1.131 ± 0.033
3.746PheLeu: 3.746 ± 0.067
0.784PheMet: 0.784 ± 0.029
1.064PheAsn: 1.064 ± 0.027
1.728PhePro: 1.728 ± 0.035
1.303PheGln: 1.303 ± 0.033
2.339PheArg: 2.339 ± 0.043
2.356PheSer: 2.356 ± 0.042
2.113PheThr: 2.113 ± 0.037
2.709PheVal: 2.709 ± 0.05
0.528PheTrp: 0.528 ± 0.021
0.973PheTyr: 0.973 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
6.916GlyAla: 6.916 ± 0.096
0.984GlyCys: 0.984 ± 0.028
3.688GlyAsp: 3.688 ± 0.07
4.443GlyGlu: 4.443 ± 0.058
3.106GlyPhe: 3.106 ± 0.055
6.33GlyGly: 6.33 ± 0.107
1.975GlyHis: 1.975 ± 0.043
4.259GlyIle: 4.259 ± 0.06
3.47GlyLys: 3.47 ± 0.054
8.538GlyLeu: 8.538 ± 0.089
2.17GlyMet: 2.17 ± 0.039
2.168GlyAsn: 2.168 ± 0.068
3.11GlyPro: 3.11 ± 0.052
3.229GlyGln: 3.229 ± 0.055
5.543GlyArg: 5.543 ± 0.071
4.693GlySer: 4.693 ± 0.072
4.998GlyThr: 4.998 ± 0.083
5.853GlyVal: 5.853 ± 0.075
1.244GlyTrp: 1.244 ± 0.03
2.318GlyTyr: 2.318 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.319HisAla: 2.319 ± 0.048
0.31HisCys: 0.31 ± 0.017
1.241HisAsp: 1.241 ± 0.032
1.384HisGlu: 1.384 ± 0.03
0.865HisPhe: 0.865 ± 0.028
1.874HisGly: 1.874 ± 0.039
0.799HisHis: 0.799 ± 0.026
1.14HisIle: 1.14 ± 0.027
0.672HisLys: 0.672 ± 0.021
2.569HisLeu: 2.569 ± 0.048
0.492HisMet: 0.492 ± 0.018
0.562HisAsn: 0.562 ± 0.023
1.663HisPro: 1.663 ± 0.041
0.938HisGln: 0.938 ± 0.026
1.921HisArg: 1.921 ± 0.04
1.239HisSer: 1.239 ± 0.028
1.145HisThr: 1.145 ± 0.032
1.724HisVal: 1.724 ± 0.039
0.336HisTrp: 0.336 ± 0.016
0.661HisTyr: 0.661 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.117IleAla: 5.117 ± 0.059
0.467IleCys: 0.467 ± 0.022
2.869IleAsp: 2.869 ± 0.048
3.458IleGlu: 3.458 ± 0.064
1.45IlePhe: 1.45 ± 0.035
4.451IleGly: 4.451 ± 0.07
1.22IleHis: 1.22 ± 0.032
1.994IleIle: 1.994 ± 0.045
1.862IleLys: 1.862 ± 0.046
4.813IleLeu: 4.813 ± 0.069
0.864IleMet: 0.864 ± 0.025
1.31IleAsn: 1.31 ± 0.038
2.858IlePro: 2.858 ± 0.046
1.949IleGln: 1.949 ± 0.036
3.513IleArg: 3.513 ± 0.058
2.617IleSer: 2.617 ± 0.046
2.892IleThr: 2.892 ± 0.05
4.111IleVal: 4.111 ± 0.059
0.505IleTrp: 0.505 ± 0.02
1.096IleTyr: 1.096 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
3.988LysAla: 3.988 ± 0.066
0.245LysCys: 0.245 ± 0.015
2.177LysAsp: 2.177 ± 0.047
2.573LysGlu: 2.573 ± 0.052
1.019LysPhe: 1.019 ± 0.034
2.976LysGly: 2.976 ± 0.06
0.789LysHis: 0.789 ± 0.025
1.904LysIle: 1.904 ± 0.051
1.92LysLys: 1.92 ± 0.052
3.39LysLeu: 3.39 ± 0.053
0.903LysMet: 0.903 ± 0.027
1.064LysAsn: 1.064 ± 0.026
2.113LysPro: 2.113 ± 0.043
1.784LysGln: 1.784 ± 0.041
2.632LysArg: 2.632 ± 0.052
2.096LysSer: 2.096 ± 0.041
2.353LysThr: 2.353 ± 0.046
2.77LysVal: 2.77 ± 0.058
0.372LysTrp: 0.372 ± 0.02
0.795LysTyr: 0.795 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
11.416LeuAla: 11.416 ± 0.136
1.215LeuCys: 1.215 ± 0.031
5.694LeuAsp: 5.694 ± 0.067
5.826LeuGlu: 5.826 ± 0.07
3.923LeuPhe: 3.923 ± 0.073
7.713LeuGly: 7.713 ± 0.085
2.391LeuHis: 2.391 ± 0.05
5.009LeuIle: 5.009 ± 0.078
4.187LeuLys: 4.187 ± 0.068
11.536LeuLeu: 11.536 ± 0.126
2.295LeuMet: 2.295 ± 0.046
2.929LeuAsn: 2.929 ± 0.047
5.572LeuPro: 5.572 ± 0.065
4.003LeuGln: 4.003 ± 0.063
7.47LeuArg: 7.47 ± 0.089
6.973LeuSer: 6.973 ± 0.079
6.698LeuThr: 6.698 ± 0.084
7.806LeuVal: 7.806 ± 0.086
1.266LeuTrp: 1.266 ± 0.036
2.584LeuTyr: 2.584 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
2.647MetAla: 2.647 ± 0.046
0.177MetCys: 0.177 ± 0.012
1.143MetAsp: 1.143 ± 0.032
1.3MetGlu: 1.3 ± 0.034
0.693MetPhe: 0.693 ± 0.022
1.791MetGly: 1.791 ± 0.045
0.466MetHis: 0.466 ± 0.019
1.175MetIle: 1.175 ± 0.033
1.214MetLys: 1.214 ± 0.033
2.249MetLeu: 2.249 ± 0.042
0.615MetMet: 0.615 ± 0.024
0.815MetAsn: 0.815 ± 0.024
1.373MetPro: 1.373 ± 0.031
0.896MetGln: 0.896 ± 0.028
1.577MetArg: 1.577 ± 0.033
1.4MetSer: 1.4 ± 0.033
1.699MetThr: 1.699 ± 0.04
1.842MetVal: 1.842 ± 0.041
0.23MetTrp: 0.23 ± 0.014
0.404MetTyr: 0.404 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.381AsnAla: 2.381 ± 0.044
0.248AsnCys: 0.248 ± 0.012
1.247AsnAsp: 1.247 ± 0.04
1.38AsnGlu: 1.38 ± 0.032
0.918AsnPhe: 0.918 ± 0.027
2.157AsnGly: 2.157 ± 0.046
0.647AsnHis: 0.647 ± 0.021
1.272AsnIle: 1.272 ± 0.033
0.835AsnLys: 0.835 ± 0.028
2.86AsnLeu: 2.86 ± 0.053
0.552AsnMet: 0.552 ± 0.019
0.756AsnAsn: 0.756 ± 0.028
1.899AsnPro: 1.899 ± 0.044
1.106AsnGln: 1.106 ± 0.033
1.977AsnArg: 1.977 ± 0.04
1.324AsnSer: 1.324 ± 0.038
1.292AsnThr: 1.292 ± 0.034
2.026AsnVal: 2.026 ± 0.047
0.356AsnTrp: 0.356 ± 0.017
0.681AsnTyr: 0.681 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
5.487ProAla: 5.487 ± 0.083
0.463ProCys: 0.463 ± 0.019
3.265ProAsp: 3.265 ± 0.057
3.406ProGlu: 3.406 ± 0.054
2.005ProPhe: 2.005 ± 0.037
4.146ProGly: 4.146 ± 0.054
1.185ProHis: 1.185 ± 0.03
2.457ProIle: 2.457 ± 0.049
1.833ProLys: 1.833 ± 0.043
5.168ProLeu: 5.168 ± 0.068
1.237ProMet: 1.237 ± 0.034
1.381ProAsn: 1.381 ± 0.034
2.875ProPro: 2.875 ± 0.084
1.922ProGln: 1.922 ± 0.045
2.775ProArg: 2.775 ± 0.051
3.317ProSer: 3.317 ± 0.056
2.977ProThr: 2.977 ± 0.054
4.352ProVal: 4.352 ± 0.056
0.687ProTrp: 0.687 ± 0.021
1.238ProTyr: 1.238 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
5.078GlnAla: 5.078 ± 0.075
0.324GlnCys: 0.324 ± 0.016
2.073GlnAsp: 2.073 ± 0.042
2.612GlnGlu: 2.612 ± 0.052
1.447GlnPhe: 1.447 ± 0.036
3.095GlnGly: 3.095 ± 0.052
0.954GlnHis: 0.954 ± 0.027
1.953GlnIle: 1.953 ± 0.037
1.358GlnLys: 1.358 ± 0.031
4.017GlnLeu: 4.017 ± 0.065
0.83GlnMet: 0.83 ± 0.024
0.905GlnAsn: 0.905 ± 0.025
2.205GlnPro: 2.205 ± 0.044
2.111GlnGln: 2.111 ± 0.043
3.05GlnArg: 3.05 ± 0.05
2.331GlnSer: 2.331 ± 0.046
2.216GlnThr: 2.216 ± 0.041
3.294GlnVal: 3.294 ± 0.05
0.584GlnTrp: 0.584 ± 0.022
0.993GlnTyr: 0.993 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
6.518ArgAla: 6.518 ± 0.086
0.702ArgCys: 0.702 ± 0.023
3.58ArgAsp: 3.58 ± 0.054
4.615ArgGlu: 4.615 ± 0.071
2.816ArgPhe: 2.816 ± 0.043
4.445ArgGly: 4.445 ± 0.064
1.889ArgHis: 1.889 ± 0.039
3.895ArgIle: 3.895 ± 0.053
2.566ArgLys: 2.566 ± 0.05
8.21ArgLeu: 8.21 ± 0.09
1.973ArgMet: 1.973 ± 0.047
1.812ArgAsn: 1.812 ± 0.041
3.438ArgPro: 3.438 ± 0.052
3.136ArgGln: 3.136 ± 0.049
5.408ArgArg: 5.408 ± 0.08
3.815ArgSer: 3.815 ± 0.056
3.927ArgThr: 3.927 ± 0.059
5.287ArgVal: 5.287 ± 0.067
1.109ArgTrp: 1.109 ± 0.03
2.126ArgTyr: 2.126 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
5.591SerAla: 5.591 ± 0.067
0.552SerCys: 0.552 ± 0.021
2.766SerAsp: 2.766 ± 0.048
3.006SerGlu: 3.006 ± 0.056
2.2SerPhe: 2.2 ± 0.045
5.255SerGly: 5.255 ± 0.077
1.373SerHis: 1.373 ± 0.034
2.83SerIle: 2.83 ± 0.054
1.979SerLys: 1.979 ± 0.042
6.505SerLeu: 6.505 ± 0.08
1.482SerMet: 1.482 ± 0.034
1.441SerAsn: 1.441 ± 0.038
3.329SerPro: 3.329 ± 0.058
2.156SerGln: 2.156 ± 0.046
4.089SerArg: 4.089 ± 0.069
3.706SerSer: 3.706 ± 0.074
3.212SerThr: 3.212 ± 0.052
4.187SerVal: 4.187 ± 0.054
0.799SerTrp: 0.799 ± 0.025
1.44SerTyr: 1.44 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
5.617ThrAla: 5.617 ± 0.078
0.538ThrCys: 0.538 ± 0.022
2.809ThrAsp: 2.809 ± 0.055
3.009ThrGlu: 3.009 ± 0.054
2.151ThrPhe: 2.151 ± 0.043
5.051ThrGly: 5.051 ± 0.076
1.326ThrHis: 1.326 ± 0.033
3.387ThrIle: 3.387 ± 0.053
2.079ThrLys: 2.079 ± 0.042
6.189ThrLeu: 6.189 ± 0.08
1.345ThrMet: 1.345 ± 0.026
1.515ThrAsn: 1.515 ± 0.037
3.408ThrPro: 3.408 ± 0.059
2.11ThrGln: 2.11 ± 0.037
3.347ThrArg: 3.347 ± 0.052
3.126ThrSer: 3.126 ± 0.049
3.467ThrThr: 3.467 ± 0.069
5.028ThrVal: 5.028 ± 0.076
0.674ThrTrp: 0.674 ± 0.025
1.405ThrTyr: 1.405 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
7.707ValAla: 7.707 ± 0.085
0.755ValCys: 0.755 ± 0.025
3.843ValAsp: 3.843 ± 0.057
4.752ValGlu: 4.752 ± 0.071
2.581ValPhe: 2.581 ± 0.047
5.853ValGly: 5.853 ± 0.078
1.549ValHis: 1.549 ± 0.035
3.879ValIle: 3.879 ± 0.057
2.993ValLys: 2.993 ± 0.063
8.112ValLeu: 8.112 ± 0.073
1.909ValMet: 1.909 ± 0.037
2.122ValAsn: 2.122 ± 0.047
3.929ValPro: 3.929 ± 0.057
2.856ValGln: 2.856 ± 0.055
5.489ValArg: 5.489 ± 0.081
4.678ValSer: 4.678 ± 0.075
5.063ValThr: 5.063 ± 0.069
6.407ValVal: 6.407 ± 0.097
0.868ValTrp: 0.868 ± 0.028
1.682ValTyr: 1.682 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
1.109TrpAla: 1.109 ± 0.032
0.153TrpCys: 0.153 ± 0.012
0.652TrpAsp: 0.652 ± 0.023
0.723TrpGlu: 0.723 ± 0.026
0.539TrpPhe: 0.539 ± 0.021
0.892TrpGly: 0.892 ± 0.028
0.329TrpHis: 0.329 ± 0.016
0.694TrpIle: 0.694 ± 0.023
0.567TrpLys: 0.567 ± 0.021
1.556TrpLeu: 1.556 ± 0.033
0.404TrpMet: 0.404 ± 0.017
0.479TrpAsn: 0.479 ± 0.018
0.597TrpPro: 0.597 ± 0.022
0.603TrpGln: 0.603 ± 0.024
0.951TrpArg: 0.951 ± 0.034
0.844TrpSer: 0.844 ± 0.029
0.742TrpThr: 0.742 ± 0.027
0.898TrpVal: 0.898 ± 0.026
0.252TrpTrp: 0.252 ± 0.014
0.389TrpTyr: 0.389 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.286TyrAla: 2.286 ± 0.042
0.299TyrCys: 0.299 ± 0.015
1.487TyrAsp: 1.487 ± 0.04
1.475TyrGlu: 1.475 ± 0.036
0.987TyrPhe: 0.987 ± 0.031
2.107TyrGly: 2.107 ± 0.041
0.654TyrHis: 0.654 ± 0.025
0.991TyrIle: 0.991 ± 0.029
0.766TyrLys: 0.766 ± 0.026
2.794TyrLeu: 2.794 ± 0.046
0.449TyrMet: 0.449 ± 0.019
0.651TyrAsn: 0.651 ± 0.026
1.231TyrPro: 1.231 ± 0.031
1.082TyrGln: 1.082 ± 0.029
2.296TyrArg: 2.296 ± 0.044
1.366TyrSer: 1.366 ± 0.03
1.149TyrThr: 1.149 ± 0.032
1.803TyrVal: 1.803 ± 0.038
0.396TyrTrp: 0.396 ± 0.018
0.79TyrTyr: 0.79 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4263 proteins (1285082 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski