Amino acid dipepetide frequency for Haliscomenobacter hydrossis (strain ATCC 27775 / DSM 1100 / LMG 10767 / O)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.783AlaAla: 6.783 ± 0.079
0.826AlaCys: 0.826 ± 0.023
4.142AlaAsp: 4.142 ± 0.052
5.022AlaGlu: 5.022 ± 0.056
3.66AlaPhe: 3.66 ± 0.045
5.75AlaGly: 5.75 ± 0.061
1.359AlaHis: 1.359 ± 0.028
5.053AlaIle: 5.053 ± 0.044
4.418AlaLys: 4.418 ± 0.051
8.307AlaLeu: 8.307 ± 0.077
1.753AlaMet: 1.753 ± 0.03
3.659AlaAsn: 3.659 ± 0.047
3.074AlaPro: 3.074 ± 0.04
3.948AlaGln: 3.948 ± 0.043
3.143AlaArg: 3.143 ± 0.039
4.486AlaSer: 4.486 ± 0.05
4.285AlaThr: 4.285 ± 0.057
4.741AlaVal: 4.741 ± 0.053
1.101AlaTrp: 1.101 ± 0.02
2.848AlaTyr: 2.848 ± 0.036
0.0AlaXaa: 0.0 ± 0.0
Cys
0.721CysAla: 0.721 ± 0.02
0.17CysCys: 0.17 ± 0.009
0.506CysAsp: 0.506 ± 0.024
0.467CysGlu: 0.467 ± 0.016
0.515CysPhe: 0.515 ± 0.015
0.743CysGly: 0.743 ± 0.023
0.23CysHis: 0.23 ± 0.011
0.62CysIle: 0.62 ± 0.023
0.468CysLys: 0.468 ± 0.016
0.922CysLeu: 0.922 ± 0.025
0.185CysMet: 0.185 ± 0.008
0.43CysAsn: 0.43 ± 0.015
0.432CysPro: 0.432 ± 0.018
0.406CysGln: 0.406 ± 0.014
0.42CysArg: 0.42 ± 0.014
0.655CysSer: 0.655 ± 0.025
0.637CysThr: 0.637 ± 0.022
0.527CysVal: 0.527 ± 0.02
0.14CysTrp: 0.14 ± 0.008
0.359CysTyr: 0.359 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
4.181AspAla: 4.181 ± 0.058
0.541AspCys: 0.541 ± 0.024
2.491AspAsp: 2.491 ± 0.052
3.184AspGlu: 3.184 ± 0.045
3.104AspPhe: 3.104 ± 0.037
4.119AspGly: 4.119 ± 0.137
0.973AspHis: 0.973 ± 0.019
3.139AspIle: 3.139 ± 0.041
2.978AspLys: 2.978 ± 0.042
5.703AspLeu: 5.703 ± 0.054
0.986AspMet: 0.986 ± 0.021
2.437AspAsn: 2.437 ± 0.067
2.648AspPro: 2.648 ± 0.061
2.225AspGln: 2.225 ± 0.031
2.407AspArg: 2.407 ± 0.033
2.676AspSer: 2.676 ± 0.05
2.681AspThr: 2.681 ± 0.067
3.13AspVal: 3.13 ± 0.034
0.955AspTrp: 0.955 ± 0.019
2.307AspTyr: 2.307 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
4.805GluAla: 4.805 ± 0.052
0.407GluCys: 0.407 ± 0.017
2.906GluAsp: 2.906 ± 0.048
3.878GluGlu: 3.878 ± 0.054
2.416GluPhe: 2.416 ± 0.036
3.741GluGly: 3.741 ± 0.037
1.177GluHis: 1.177 ± 0.028
3.996GluIle: 3.996 ± 0.041
4.528GluLys: 4.528 ± 0.055
6.153GluLeu: 6.153 ± 0.064
1.551GluMet: 1.551 ± 0.027
2.965GluAsn: 2.965 ± 0.035
1.881GluPro: 1.881 ± 0.031
2.763GluGln: 2.763 ± 0.036
2.934GluArg: 2.934 ± 0.038
2.844GluSer: 2.844 ± 0.039
2.908GluThr: 2.908 ± 0.038
4.067GluVal: 4.067 ± 0.049
0.87GluTrp: 0.87 ± 0.02
2.12GluTyr: 2.12 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
3.546PheAla: 3.546 ± 0.038
0.544PheCys: 0.544 ± 0.016
2.844PheAsp: 2.844 ± 0.033
2.81PheGlu: 2.81 ± 0.033
2.709PhePhe: 2.709 ± 0.041
3.462PheGly: 3.462 ± 0.038
0.895PheHis: 0.895 ± 0.018
2.867PheIle: 2.867 ± 0.038
2.576PheLys: 2.576 ± 0.037
4.75PheLeu: 4.75 ± 0.057
1.002PheMet: 1.002 ± 0.023
2.56PheAsn: 2.56 ± 0.039
1.989PhePro: 1.989 ± 0.032
1.896PheGln: 1.896 ± 0.027
2.204PheArg: 2.204 ± 0.033
3.745PheSer: 3.745 ± 0.045
2.916PheThr: 2.916 ± 0.035
2.974PheVal: 2.974 ± 0.037
0.727PheTrp: 0.727 ± 0.017
1.92PheTyr: 1.92 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
4.969GlyAla: 4.969 ± 0.059
0.775GlyCys: 0.775 ± 0.028
3.473GlyAsp: 3.473 ± 0.055
3.71GlyGlu: 3.71 ± 0.046
3.588GlyPhe: 3.588 ± 0.039
5.44GlyGly: 5.44 ± 0.085
1.277GlyHis: 1.277 ± 0.025
5.03GlyIle: 5.03 ± 0.067
5.035GlyLys: 5.035 ± 0.054
6.936GlyLeu: 6.936 ± 0.059
1.732GlyMet: 1.732 ± 0.031
3.609GlyAsn: 3.609 ± 0.042
1.927GlyPro: 1.927 ± 0.037
3.109GlyGln: 3.109 ± 0.039
3.084GlyArg: 3.084 ± 0.042
4.318GlySer: 4.318 ± 0.053
4.231GlyThr: 4.231 ± 0.065
4.706GlyVal: 4.706 ± 0.062
1.016GlyTrp: 1.016 ± 0.02
2.772GlyTyr: 2.772 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
1.26HisAla: 1.26 ± 0.024
0.219HisCys: 0.219 ± 0.01
0.871HisAsp: 0.871 ± 0.019
1.023HisGlu: 1.023 ± 0.021
1.29HisPhe: 1.29 ± 0.025
1.174HisGly: 1.174 ± 0.023
0.603HisHis: 0.603 ± 0.017
1.147HisIle: 1.147 ± 0.026
0.894HisLys: 0.894 ± 0.02
2.229HisLeu: 2.229 ± 0.033
0.303HisMet: 0.303 ± 0.011
0.857HisAsn: 0.857 ± 0.019
1.241HisPro: 1.241 ± 0.023
0.923HisGln: 0.923 ± 0.021
0.917HisArg: 0.917 ± 0.02
1.06HisSer: 1.06 ± 0.023
1.04HisThr: 1.04 ± 0.02
0.932HisVal: 0.932 ± 0.021
0.357HisTrp: 0.357 ± 0.012
0.927HisTyr: 0.927 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
5.207IleAla: 5.207 ± 0.05
0.693IleCys: 0.693 ± 0.025
3.752IleAsp: 3.752 ± 0.041
4.056IleGlu: 4.056 ± 0.047
2.771IlePhe: 2.771 ± 0.037
4.491IleGly: 4.491 ± 0.046
1.265IleHis: 1.265 ± 0.025
3.612IleIle: 3.612 ± 0.047
3.676IleLys: 3.676 ± 0.04
5.94IleLeu: 5.94 ± 0.058
1.092IleMet: 1.092 ± 0.019
3.313IleAsn: 3.313 ± 0.039
3.158IlePro: 3.158 ± 0.038
2.725IleGln: 2.725 ± 0.038
3.111IleArg: 3.111 ± 0.04
4.129IleSer: 4.129 ± 0.044
3.55IleThr: 3.55 ± 0.04
3.923IleVal: 3.923 ± 0.04
0.847IleTrp: 0.847 ± 0.019
2.155IleTyr: 2.155 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.71LysAla: 4.71 ± 0.052
0.328LysCys: 0.328 ± 0.012
3.294LysAsp: 3.294 ± 0.039
3.701LysGlu: 3.701 ± 0.047
2.104LysPhe: 2.104 ± 0.036
3.97LysGly: 3.97 ± 0.045
1.044LysHis: 1.044 ± 0.022
4.259LysIle: 4.259 ± 0.052
4.125LysLys: 4.125 ± 0.053
5.405LysLeu: 5.405 ± 0.049
1.7LysMet: 1.7 ± 0.03
3.226LysAsn: 3.226 ± 0.043
2.503LysPro: 2.503 ± 0.036
2.221LysGln: 2.221 ± 0.034
2.493LysArg: 2.493 ± 0.037
3.227LysSer: 3.227 ± 0.038
3.664LysThr: 3.664 ± 0.041
4.018LysVal: 4.018 ± 0.043
0.756LysTrp: 0.756 ± 0.02
2.14LysTyr: 2.14 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
8.066LeuAla: 8.066 ± 0.076
0.999LeuCys: 0.999 ± 0.026
5.447LeuAsp: 5.447 ± 0.06
6.135LeuGlu: 6.135 ± 0.06
4.72LeuPhe: 4.72 ± 0.052
6.783LeuGly: 6.783 ± 0.064
2.016LeuHis: 2.016 ± 0.034
6.047LeuIle: 6.047 ± 0.066
6.051LeuLys: 6.051 ± 0.062
10.467LeuLeu: 10.467 ± 0.105
2.166LeuMet: 2.166 ± 0.031
5.221LeuAsn: 5.221 ± 0.053
4.772LeuPro: 4.772 ± 0.05
4.438LeuGln: 4.438 ± 0.052
5.114LeuArg: 5.114 ± 0.05
7.414LeuSer: 7.414 ± 0.068
5.078LeuThr: 5.078 ± 0.047
5.9LeuVal: 5.9 ± 0.058
1.204LeuTrp: 1.204 ± 0.027
3.373LeuTyr: 3.373 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
1.826MetAla: 1.826 ± 0.031
0.146MetCys: 0.146 ± 0.008
1.166MetAsp: 1.166 ± 0.021
1.36MetGlu: 1.36 ± 0.024
0.71MetPhe: 0.71 ± 0.017
1.638MetGly: 1.638 ± 0.03
0.425MetHis: 0.425 ± 0.014
1.251MetIle: 1.251 ± 0.026
1.539MetLys: 1.539 ± 0.026
2.137MetLeu: 2.137 ± 0.035
0.555MetMet: 0.555 ± 0.015
1.042MetAsn: 1.042 ± 0.021
1.091MetPro: 1.091 ± 0.021
0.945MetGln: 0.945 ± 0.02
1.129MetArg: 1.129 ± 0.021
1.36MetSer: 1.36 ± 0.023
1.068MetThr: 1.068 ± 0.019
1.458MetVal: 1.458 ± 0.023
0.222MetTrp: 0.222 ± 0.009
0.666MetTyr: 0.666 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.861AsnAla: 3.861 ± 0.045
0.523AsnCys: 0.523 ± 0.021
2.45AsnAsp: 2.45 ± 0.063
2.564AsnGlu: 2.564 ± 0.03
2.546AsnPhe: 2.546 ± 0.034
3.93AsnGly: 3.93 ± 0.062
0.94AsnHis: 0.94 ± 0.021
3.119AsnIle: 3.119 ± 0.042
2.341AsnLys: 2.341 ± 0.03
5.288AsnLeu: 5.288 ± 0.049
0.938AsnMet: 0.938 ± 0.022
2.689AsnAsn: 2.689 ± 0.048
3.107AsnPro: 3.107 ± 0.038
2.226AsnGln: 2.226 ± 0.034
2.431AsnArg: 2.431 ± 0.034
2.814AsnSer: 2.814 ± 0.034
3.021AsnThr: 3.021 ± 0.04
2.875AsnVal: 2.875 ± 0.044
0.854AsnTrp: 0.854 ± 0.017
2.105AsnTyr: 2.105 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
3.698ProAla: 3.698 ± 0.044
0.338ProCys: 0.338 ± 0.015
2.748ProAsp: 2.748 ± 0.053
3.121ProGlu: 3.121 ± 0.036
2.152ProPhe: 2.152 ± 0.03
3.12ProGly: 3.12 ± 0.037
0.817ProHis: 0.817 ± 0.02
2.676ProIle: 2.676 ± 0.035
2.327ProLys: 2.327 ± 0.038
3.971ProLeu: 3.971 ± 0.038
0.909ProMet: 0.909 ± 0.019
2.545ProAsn: 2.545 ± 0.033
1.519ProPro: 1.519 ± 0.031
1.864ProGln: 1.864 ± 0.028
1.59ProArg: 1.59 ± 0.024
2.662ProSer: 2.662 ± 0.038
2.421ProThr: 2.421 ± 0.042
2.861ProVal: 2.861 ± 0.038
0.52ProTrp: 0.52 ± 0.016
1.536ProTyr: 1.536 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.506GlnAla: 3.506 ± 0.041
0.275GlnCys: 0.275 ± 0.011
2.057GlnAsp: 2.057 ± 0.031
2.537GlnGlu: 2.537 ± 0.041
1.927GlnPhe: 1.927 ± 0.03
2.812GlnGly: 2.812 ± 0.033
0.993GlnHis: 0.993 ± 0.022
2.75GlnIle: 2.75 ± 0.037
2.779GlnLys: 2.779 ± 0.037
4.454GlnLeu: 4.454 ± 0.049
1.019GlnMet: 1.019 ± 0.022
2.209GlnAsn: 2.209 ± 0.032
1.714GlnPro: 1.714 ± 0.027
2.278GlnGln: 2.278 ± 0.032
2.246GlnArg: 2.246 ± 0.033
2.441GlnSer: 2.441 ± 0.034
2.398GlnThr: 2.398 ± 0.036
2.906GlnVal: 2.906 ± 0.036
0.599GlnTrp: 0.599 ± 0.016
1.64GlnTyr: 1.64 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
3.076ArgAla: 3.076 ± 0.039
0.371ArgCys: 0.371 ± 0.012
2.424ArgAsp: 2.424 ± 0.036
2.749ArgGlu: 2.749 ± 0.034
2.532ArgPhe: 2.532 ± 0.032
2.744ArgGly: 2.744 ± 0.035
0.903ArgHis: 0.903 ± 0.02
3.315ArgIle: 3.315 ± 0.04
2.819ArgLys: 2.819 ± 0.035
4.686ArgLeu: 4.686 ± 0.049
1.259ArgMet: 1.259 ± 0.023
2.392ArgAsn: 2.392 ± 0.034
1.813ArgPro: 1.813 ± 0.033
2.032ArgGln: 2.032 ± 0.031
2.315ArgArg: 2.315 ± 0.036
2.78ArgSer: 2.78 ± 0.035
2.318ArgThr: 2.318 ± 0.035
2.864ArgVal: 2.864 ± 0.04
0.778ArgTrp: 0.778 ± 0.019
2.04ArgTyr: 2.04 ± 0.026
0.0ArgXaa: 0.0 ± 0.0
Ser
4.881SerAla: 4.881 ± 0.042
0.663SerCys: 0.663 ± 0.018
2.972SerAsp: 2.972 ± 0.05
2.986SerGlu: 2.986 ± 0.041
3.291SerPhe: 3.291 ± 0.037
4.797SerGly: 4.797 ± 0.059
0.968SerHis: 0.968 ± 0.021
4.51SerIle: 4.51 ± 0.045
3.376SerLys: 3.376 ± 0.042
6.035SerLeu: 6.035 ± 0.058
1.266SerMet: 1.266 ± 0.026
3.226SerAsn: 3.226 ± 0.043
2.843SerPro: 2.843 ± 0.04
1.989SerGln: 1.989 ± 0.029
2.585SerArg: 2.585 ± 0.032
4.049SerSer: 4.049 ± 0.046
3.907SerThr: 3.907 ± 0.053
3.74SerVal: 3.74 ± 0.052
0.955SerTrp: 0.955 ± 0.022
2.159SerTyr: 2.159 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
4.562ThrAla: 4.562 ± 0.06
0.525ThrCys: 0.525 ± 0.018
3.031ThrAsp: 3.031 ± 0.093
2.898ThrGlu: 2.898 ± 0.031
2.811ThrPhe: 2.811 ± 0.037
4.387ThrGly: 4.387 ± 0.056
1.037ThrHis: 1.037 ± 0.021
3.63ThrIle: 3.63 ± 0.044
2.571ThrLys: 2.571 ± 0.038
6.049ThrLeu: 6.049 ± 0.053
0.94ThrMet: 0.94 ± 0.02
2.509ThrAsn: 2.509 ± 0.036
2.985ThrPro: 2.985 ± 0.046
2.322ThrGln: 2.322 ± 0.035
2.314ThrArg: 2.314 ± 0.03
3.193ThrSer: 3.193 ± 0.039
3.433ThrThr: 3.433 ± 0.066
3.679ThrVal: 3.679 ± 0.066
0.81ThrTrp: 0.81 ± 0.02
2.267ThrTyr: 2.267 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
4.872ValAla: 4.872 ± 0.049
0.66ValCys: 0.66 ± 0.019
3.395ValAsp: 3.395 ± 0.047
3.935ValGlu: 3.935 ± 0.051
3.297ValPhe: 3.297 ± 0.039
4.039ValGly: 4.039 ± 0.039
1.132ValHis: 1.132 ± 0.022
3.761ValIle: 3.761 ± 0.038
3.748ValLys: 3.748 ± 0.046
6.433ValLeu: 6.433 ± 0.063
1.379ValMet: 1.379 ± 0.024
3.128ValAsn: 3.128 ± 0.045
2.585ValPro: 2.585 ± 0.031
2.597ValGln: 2.597 ± 0.032
2.754ValArg: 2.754 ± 0.034
4.088ValSer: 4.088 ± 0.053
3.345ValThr: 3.345 ± 0.067
4.415ValVal: 4.415 ± 0.06
0.835ValTrp: 0.835 ± 0.018
2.357ValTyr: 2.357 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
0.986TrpAla: 0.986 ± 0.023
0.153TrpCys: 0.153 ± 0.009
0.781TrpAsp: 0.781 ± 0.021
0.853TrpGlu: 0.853 ± 0.018
0.614TrpPhe: 0.614 ± 0.019
1.019TrpGly: 1.019 ± 0.02
0.318TrpHis: 0.318 ± 0.011
0.837TrpIle: 0.837 ± 0.022
0.893TrpLys: 0.893 ± 0.02
1.473TrpLeu: 1.473 ± 0.027
0.407TrpMet: 0.407 ± 0.013
0.76TrpAsn: 0.76 ± 0.019
0.393TrpPro: 0.393 ± 0.014
0.71TrpGln: 0.71 ± 0.018
0.75TrpArg: 0.75 ± 0.018
0.955TrpSer: 0.955 ± 0.027
0.761TrpThr: 0.761 ± 0.02
0.94TrpVal: 0.94 ± 0.02
0.255TrpTrp: 0.255 ± 0.009
0.515TrpTyr: 0.515 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.844TyrAla: 2.844 ± 0.034
0.399TyrCys: 0.399 ± 0.014
2.114TyrAsp: 2.114 ± 0.034
1.923TyrGlu: 1.923 ± 0.032
2.179TyrPhe: 2.179 ± 0.026
2.575TyrGly: 2.575 ± 0.037
0.893TyrHis: 0.893 ± 0.018
1.846TyrIle: 1.846 ± 0.029
1.745TyrLys: 1.745 ± 0.027
4.077TyrLeu: 4.077 ± 0.044
0.593TyrMet: 0.593 ± 0.017
1.844TyrAsn: 1.844 ± 0.031
1.719TyrPro: 1.719 ± 0.027
1.926TyrGln: 1.926 ± 0.032
2.271TyrArg: 2.271 ± 0.032
2.299TyrSer: 2.299 ± 0.036
2.29TyrThr: 2.29 ± 0.038
2.1TyrVal: 2.1 ± 0.03
0.572TyrTrp: 0.572 ± 0.017
1.624TyrTyr: 1.624 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6704 proteins (2562877 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski