Amino acid dipepetide frequency for Sediminispirochaeta smaragdinae (strain DSM 11293 / JCM 15392 / SEBR 4228) (Spirochaeta smaragdinae)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.264AlaAla: 8.264 ± 0.104
0.878AlaCys: 0.878 ± 0.028
4.239AlaAsp: 4.239 ± 0.056
5.695AlaGlu: 5.695 ± 0.076
3.681AlaPhe: 3.681 ± 0.057
6.861AlaGly: 6.861 ± 0.087
1.298AlaHis: 1.298 ± 0.031
5.466AlaIle: 5.466 ± 0.074
3.89AlaLys: 3.89 ± 0.062
9.288AlaLeu: 9.288 ± 0.097
2.435AlaMet: 2.435 ± 0.041
2.119AlaAsn: 2.119 ± 0.036
2.607AlaPro: 2.607 ± 0.05
2.235AlaGln: 2.235 ± 0.046
3.96AlaArg: 3.96 ± 0.054
5.616AlaSer: 5.616 ± 0.083
3.453AlaThr: 3.453 ± 0.053
6.423AlaVal: 6.423 ± 0.068
0.807AlaTrp: 0.807 ± 0.026
2.559AlaTyr: 2.559 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.747CysAla: 0.747 ± 0.025
0.149CysCys: 0.149 ± 0.01
0.631CysAsp: 0.631 ± 0.022
0.557CysGlu: 0.557 ± 0.022
0.501CysPhe: 0.501 ± 0.02
1.069CysGly: 1.069 ± 0.035
0.222CysHis: 0.222 ± 0.013
0.699CysIle: 0.699 ± 0.02
0.443CysLys: 0.443 ± 0.019
0.844CysLeu: 0.844 ± 0.022
0.246CysMet: 0.246 ± 0.013
0.348CysAsn: 0.348 ± 0.015
0.487CysPro: 0.487 ± 0.023
0.222CysGln: 0.222 ± 0.012
0.735CysArg: 0.735 ± 0.024
0.816CysSer: 0.816 ± 0.024
0.533CysThr: 0.533 ± 0.018
0.566CysVal: 0.566 ± 0.022
0.101CysTrp: 0.101 ± 0.009
0.344CysTyr: 0.344 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.448AspAla: 4.448 ± 0.067
0.47AspCys: 0.47 ± 0.019
3.052AspAsp: 3.052 ± 0.056
4.132AspGlu: 4.132 ± 0.06
2.646AspPhe: 2.646 ± 0.047
4.18AspGly: 4.18 ± 0.078
1.109AspHis: 1.109 ± 0.03
3.975AspIle: 3.975 ± 0.055
2.33AspLys: 2.33 ± 0.044
5.434AspLeu: 5.434 ± 0.067
1.245AspMet: 1.245 ± 0.029
1.676AspAsn: 1.676 ± 0.037
2.745AspPro: 2.745 ± 0.037
1.63AspGln: 1.63 ± 0.036
3.483AspArg: 3.483 ± 0.059
3.198AspSer: 3.198 ± 0.052
2.795AspThr: 2.795 ± 0.043
3.269AspVal: 3.269 ± 0.048
0.601AspTrp: 0.601 ± 0.021
2.101AspTyr: 2.101 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
6.556GluAla: 6.556 ± 0.087
0.493GluCys: 0.493 ± 0.021
3.316GluAsp: 3.316 ± 0.053
6.322GluGlu: 6.322 ± 0.117
2.068GluPhe: 2.068 ± 0.04
4.97GluGly: 4.97 ± 0.07
1.494GluHis: 1.494 ± 0.032
5.155GluIle: 5.155 ± 0.07
5.118GluLys: 5.118 ± 0.076
7.093GluLeu: 7.093 ± 0.095
1.924GluMet: 1.924 ± 0.036
2.679GluAsn: 2.679 ± 0.045
2.185GluPro: 2.185 ± 0.049
2.478GluGln: 2.478 ± 0.043
4.466GluArg: 4.466 ± 0.066
3.91GluSer: 3.91 ± 0.063
3.556GluThr: 3.556 ± 0.055
4.19GluVal: 4.19 ± 0.057
0.581GluTrp: 0.581 ± 0.021
2.084GluTyr: 2.084 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.417PheAla: 3.417 ± 0.055
0.564PheCys: 0.564 ± 0.02
2.776PheAsp: 2.776 ± 0.045
2.472PheGlu: 2.472 ± 0.042
2.86PhePhe: 2.86 ± 0.056
3.441PheGly: 3.441 ± 0.054
1.007PheHis: 1.007 ± 0.025
3.006PheIle: 3.006 ± 0.053
1.432PheLys: 1.432 ± 0.033
5.295PheLeu: 5.295 ± 0.067
0.967PheMet: 0.967 ± 0.025
1.309PheAsn: 1.309 ± 0.03
2.163PhePro: 2.163 ± 0.041
1.27PheGln: 1.27 ± 0.034
2.598PheArg: 2.598 ± 0.046
4.311PheSer: 4.311 ± 0.057
2.456PheThr: 2.456 ± 0.049
2.735PheVal: 2.735 ± 0.047
0.524PheTrp: 0.524 ± 0.021
1.456PheTyr: 1.456 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
5.802GlyAla: 5.802 ± 0.071
0.938GlyCys: 0.938 ± 0.029
3.767GlyAsp: 3.767 ± 0.062
5.033GlyGlu: 5.033 ± 0.068
3.69GlyPhe: 3.69 ± 0.055
5.853GlyGly: 5.853 ± 0.089
1.338GlyHis: 1.338 ± 0.03
6.405GlyIle: 6.405 ± 0.077
4.976GlyLys: 4.976 ± 0.065
6.939GlyLeu: 6.939 ± 0.077
2.247GlyMet: 2.247 ± 0.045
2.819GlyAsn: 2.819 ± 0.05
2.142GlyPro: 2.142 ± 0.044
1.943GlyGln: 1.943 ± 0.04
4.323GlyArg: 4.323 ± 0.057
5.148GlySer: 5.148 ± 0.087
4.373GlyThr: 4.373 ± 0.083
5.068GlyVal: 5.068 ± 0.059
0.883GlyTrp: 0.883 ± 0.026
2.877GlyTyr: 2.877 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
1.441HisAla: 1.441 ± 0.033
0.27HisCys: 0.27 ± 0.014
1.026HisAsp: 1.026 ± 0.028
1.152HisGlu: 1.152 ± 0.032
0.966HisPhe: 0.966 ± 0.027
1.42HisGly: 1.42 ± 0.032
0.554HisHis: 0.554 ± 0.02
1.395HisIle: 1.395 ± 0.032
0.732HisLys: 0.732 ± 0.024
1.967HisLeu: 1.967 ± 0.035
0.472HisMet: 0.472 ± 0.017
0.653HisAsn: 0.653 ± 0.025
1.202HisPro: 1.202 ± 0.027
0.591HisGln: 0.591 ± 0.022
1.158HisArg: 1.158 ± 0.028
1.148HisSer: 1.148 ± 0.026
0.934HisThr: 0.934 ± 0.028
1.128HisVal: 1.128 ± 0.031
0.214HisTrp: 0.214 ± 0.01
0.76HisTyr: 0.76 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.394IleAla: 6.394 ± 0.074
0.751IleCys: 0.751 ± 0.025
4.638IleAsp: 4.638 ± 0.059
4.989IleGlu: 4.989 ± 0.069
2.998IlePhe: 2.998 ± 0.044
5.47IleGly: 5.47 ± 0.066
1.392IleHis: 1.392 ± 0.03
5.044IleIle: 5.044 ± 0.075
3.029IleLys: 3.029 ± 0.058
7.007IleLeu: 7.007 ± 0.094
1.602IleMet: 1.602 ± 0.034
2.505IleAsn: 2.505 ± 0.045
3.646IlePro: 3.646 ± 0.055
1.801IleGln: 1.801 ± 0.039
3.951IleArg: 3.951 ± 0.057
5.079IleSer: 5.079 ± 0.064
3.845IleThr: 3.845 ± 0.056
4.831IleVal: 4.831 ± 0.062
0.631IleTrp: 0.631 ± 0.02
1.934IleTyr: 1.934 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
4.597LysAla: 4.597 ± 0.065
0.28LysCys: 0.28 ± 0.015
2.608LysAsp: 2.608 ± 0.049
4.868LysGlu: 4.868 ± 0.081
1.239LysPhe: 1.239 ± 0.031
4.063LysGly: 4.063 ± 0.055
0.92LysHis: 0.92 ± 0.023
3.677LysIle: 3.677 ± 0.057
3.963LysLys: 3.963 ± 0.065
4.364LysLeu: 4.364 ± 0.063
1.341LysMet: 1.341 ± 0.035
2.251LysAsn: 2.251 ± 0.043
1.839LysPro: 1.839 ± 0.035
1.901LysGln: 1.901 ± 0.04
3.677LysArg: 3.677 ± 0.056
2.996LysSer: 2.996 ± 0.044
2.752LysThr: 2.752 ± 0.049
3.268LysVal: 3.268 ± 0.048
0.411LysTrp: 0.411 ± 0.02
1.285LysTyr: 1.285 ± 0.036
0.0LysXaa: 0.0 ± 0.0
Leu
8.34LeuAla: 8.34 ± 0.089
1.185LeuCys: 1.185 ± 0.031
5.343LeuAsp: 5.343 ± 0.064
6.572LeuGlu: 6.572 ± 0.097
5.885LeuPhe: 5.885 ± 0.087
7.0LeuGly: 7.0 ± 0.075
1.93LeuHis: 1.93 ± 0.046
6.79LeuIle: 6.79 ± 0.087
5.151LeuLys: 5.151 ± 0.071
11.646LeuLeu: 11.646 ± 0.132
2.34LeuMet: 2.34 ± 0.044
3.153LeuAsn: 3.153 ± 0.049
4.848LeuPro: 4.848 ± 0.074
2.932LeuGln: 2.932 ± 0.05
5.6LeuArg: 5.6 ± 0.085
8.518LeuSer: 8.518 ± 0.084
4.965LeuThr: 4.965 ± 0.07
6.468LeuVal: 6.468 ± 0.082
0.966LeuTrp: 0.966 ± 0.031
3.237LeuTyr: 3.237 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.18MetAla: 2.18 ± 0.038
0.184MetCys: 0.184 ± 0.011
1.392MetAsp: 1.392 ± 0.034
1.861MetGlu: 1.861 ± 0.039
0.846MetPhe: 0.846 ± 0.025
1.885MetGly: 1.885 ± 0.038
0.404MetHis: 0.404 ± 0.018
1.974MetIle: 1.974 ± 0.037
2.017MetLys: 2.017 ± 0.038
2.46MetLeu: 2.46 ± 0.041
0.74MetMet: 0.74 ± 0.027
1.222MetAsn: 1.222 ± 0.032
1.006MetPro: 1.006 ± 0.024
0.772MetGln: 0.772 ± 0.026
1.357MetArg: 1.357 ± 0.029
1.463MetSer: 1.463 ± 0.037
1.313MetThr: 1.313 ± 0.034
1.941MetVal: 1.941 ± 0.041
0.18MetTrp: 0.18 ± 0.012
0.553MetTyr: 0.553 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
2.465AsnAla: 2.465 ± 0.049
0.35AsnCys: 0.35 ± 0.016
1.894AsnAsp: 1.894 ± 0.036
2.15AsnGlu: 2.15 ± 0.048
1.394AsnPhe: 1.394 ± 0.034
2.743AsnGly: 2.743 ± 0.049
0.706AsnHis: 0.706 ± 0.024
2.669AsnIle: 2.669 ± 0.046
1.585AsnLys: 1.585 ± 0.036
3.399AsnLeu: 3.399 ± 0.05
0.894AsnMet: 0.894 ± 0.026
1.334AsnAsn: 1.334 ± 0.035
1.864AsnPro: 1.864 ± 0.031
1.133AsnGln: 1.133 ± 0.027
2.129AsnArg: 2.129 ± 0.04
2.082AsnSer: 2.082 ± 0.041
1.601AsnThr: 1.601 ± 0.035
1.963AsnVal: 1.963 ± 0.04
0.399AsnTrp: 0.399 ± 0.017
1.267AsnTyr: 1.267 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
3.043ProAla: 3.043 ± 0.052
0.346ProCys: 0.346 ± 0.017
2.7ProAsp: 2.7 ± 0.051
3.771ProGlu: 3.771 ± 0.063
2.179ProPhe: 2.179 ± 0.04
3.226ProGly: 3.226 ± 0.048
0.839ProHis: 0.839 ± 0.025
2.542ProIle: 2.542 ± 0.049
1.839ProLys: 1.839 ± 0.037
4.144ProLeu: 4.144 ± 0.063
0.971ProMet: 0.971 ± 0.025
1.23ProAsn: 1.23 ± 0.028
1.543ProPro: 1.543 ± 0.04
1.228ProGln: 1.228 ± 0.027
1.7ProArg: 1.7 ± 0.032
2.727ProSer: 2.727 ± 0.044
1.706ProThr: 1.706 ± 0.036
3.317ProVal: 3.317 ± 0.048
0.459ProTrp: 0.459 ± 0.019
1.405ProTyr: 1.405 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
2.512GlnAla: 2.512 ± 0.045
0.257GlnCys: 0.257 ± 0.012
1.357GlnAsp: 1.357 ± 0.029
2.558GlnGlu: 2.558 ± 0.048
1.193GlnPhe: 1.193 ± 0.031
2.03GlnGly: 2.03 ± 0.035
0.516GlnHis: 0.516 ± 0.02
2.109GlnIle: 2.109 ± 0.043
1.953GlnLys: 1.953 ± 0.039
2.888GlnLeu: 2.888 ± 0.05
0.869GlnMet: 0.869 ± 0.026
1.142GlnAsn: 1.142 ± 0.027
0.895GlnPro: 0.895 ± 0.027
1.262GlnGln: 1.262 ± 0.032
1.82GlnArg: 1.82 ± 0.039
1.708GlnSer: 1.708 ± 0.039
1.395GlnThr: 1.395 ± 0.03
1.86GlnVal: 1.86 ± 0.041
0.387GlnTrp: 0.387 ± 0.018
0.942GlnTyr: 0.942 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
3.629ArgAla: 3.629 ± 0.043
0.629ArgCys: 0.629 ± 0.024
2.897ArgAsp: 2.897 ± 0.049
4.396ArgGlu: 4.396 ± 0.068
3.048ArgPhe: 3.048 ± 0.053
3.596ArgGly: 3.596 ± 0.048
1.043ArgHis: 1.043 ± 0.028
4.353ArgIle: 4.353 ± 0.063
3.676ArgLys: 3.676 ± 0.057
5.901ArgLeu: 5.901 ± 0.073
1.587ArgMet: 1.587 ± 0.034
2.306ArgAsn: 2.306 ± 0.046
1.92ArgPro: 1.92 ± 0.04
2.001ArgGln: 2.001 ± 0.043
3.677ArgArg: 3.677 ± 0.069
3.752ArgSer: 3.752 ± 0.062
2.511ArgThr: 2.511 ± 0.042
3.261ArgVal: 3.261 ± 0.053
0.691ArgTrp: 0.691 ± 0.022
2.309ArgTyr: 2.309 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
5.202SerAla: 5.202 ± 0.079
0.816SerCys: 0.816 ± 0.025
3.869SerAsp: 3.869 ± 0.05
4.089SerGlu: 4.089 ± 0.068
3.771SerPhe: 3.771 ± 0.054
6.32SerGly: 6.32 ± 0.083
1.23SerHis: 1.23 ± 0.029
4.81SerIle: 4.81 ± 0.053
2.607SerLys: 2.607 ± 0.046
7.467SerLeu: 7.467 ± 0.085
1.758SerMet: 1.758 ± 0.031
1.92SerAsn: 1.92 ± 0.042
2.718SerPro: 2.718 ± 0.05
1.849SerGln: 1.849 ± 0.042
3.744SerArg: 3.744 ± 0.061
4.981SerSer: 4.981 ± 0.09
3.115SerThr: 3.115 ± 0.045
4.72SerVal: 4.72 ± 0.072
0.872SerTrp: 0.872 ± 0.027
2.272SerTyr: 2.272 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
4.216ThrAla: 4.216 ± 0.059
0.467ThrCys: 0.467 ± 0.019
2.498ThrAsp: 2.498 ± 0.043
3.005ThrGlu: 3.005 ± 0.049
2.2ThrPhe: 2.2 ± 0.039
4.555ThrGly: 4.555 ± 0.061
0.887ThrHis: 0.887 ± 0.027
4.239ThrIle: 4.239 ± 0.057
2.158ThrLys: 2.158 ± 0.034
5.351ThrLeu: 5.351 ± 0.064
1.35ThrMet: 1.35 ± 0.03
1.549ThrAsn: 1.549 ± 0.033
2.183ThrPro: 2.183 ± 0.042
1.204ThrGln: 1.204 ± 0.032
2.149ThrArg: 2.149 ± 0.041
3.1ThrSer: 3.1 ± 0.054
2.551ThrThr: 2.551 ± 0.048
3.859ThrVal: 3.859 ± 0.061
0.539ThrTrp: 0.539 ± 0.022
1.505ThrTyr: 1.505 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
5.379ValAla: 5.379 ± 0.072
0.762ValCys: 0.762 ± 0.025
3.927ValAsp: 3.927 ± 0.058
4.388ValGlu: 4.388 ± 0.06
2.929ValPhe: 2.929 ± 0.046
4.672ValGly: 4.672 ± 0.063
1.263ValHis: 1.263 ± 0.033
4.557ValIle: 4.557 ± 0.064
3.285ValLys: 3.285 ± 0.06
6.778ValLeu: 6.778 ± 0.07
1.766ValMet: 1.766 ± 0.037
2.094ValAsn: 2.094 ± 0.04
3.001ValPro: 3.001 ± 0.056
1.688ValGln: 1.688 ± 0.031
3.642ValArg: 3.642 ± 0.059
4.899ValSer: 4.899 ± 0.063
3.496ValThr: 3.496 ± 0.056
5.047ValVal: 5.047 ± 0.064
0.641ValTrp: 0.641 ± 0.027
1.966ValTyr: 1.966 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.729TrpAla: 0.729 ± 0.023
0.099TrpCys: 0.099 ± 0.008
0.631TrpAsp: 0.631 ± 0.028
0.758TrpGlu: 0.758 ± 0.026
0.513TrpPhe: 0.513 ± 0.021
0.719TrpGly: 0.719 ± 0.023
0.201TrpHis: 0.201 ± 0.014
0.694TrpIle: 0.694 ± 0.026
0.727TrpLys: 0.727 ± 0.023
0.987TrpLeu: 0.987 ± 0.027
0.311TrpMet: 0.311 ± 0.016
0.557TrpAsn: 0.557 ± 0.02
0.362TrpPro: 0.362 ± 0.015
0.393TrpGln: 0.393 ± 0.017
0.525TrpArg: 0.525 ± 0.023
0.647TrpSer: 0.647 ± 0.024
0.466TrpThr: 0.466 ± 0.018
0.564TrpVal: 0.564 ± 0.021
0.132TrpTrp: 0.132 ± 0.01
0.371TrpTyr: 0.371 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.414TyrAla: 2.414 ± 0.045
0.369TyrCys: 0.369 ± 0.016
1.958TyrAsp: 1.958 ± 0.037
1.872TyrGlu: 1.872 ± 0.039
1.451TyrPhe: 1.451 ± 0.034
2.492TyrGly: 2.492 ± 0.044
0.833TyrHis: 0.833 ± 0.028
1.979TyrIle: 1.979 ± 0.038
1.39TyrLys: 1.39 ± 0.038
3.616TyrLeu: 3.616 ± 0.059
0.684TyrMet: 0.684 ± 0.022
1.155TyrAsn: 1.155 ± 0.031
1.558TyrPro: 1.558 ± 0.035
1.119TyrGln: 1.119 ± 0.029
2.49TyrArg: 2.49 ± 0.043
2.032TyrSer: 2.032 ± 0.04
1.746TyrThr: 1.746 ± 0.038
1.702TyrVal: 1.702 ± 0.036
0.366TyrTrp: 0.366 ± 0.018
1.279TyrTyr: 1.279 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4211 proteins (1432067 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski