Amino acid dipepetide frequency for Owenweeksia hongkongensis (strain DSM 17368 / CIP 108786 / JCM 12287 / NRRL B-23963 / UST20020801)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.007AlaAla: 5.007 ± 0.087
0.642AlaCys: 0.642 ± 0.025
3.884AlaAsp: 3.884 ± 0.07
4.489AlaGlu: 4.489 ± 0.075
3.394AlaPhe: 3.394 ± 0.052
5.161AlaGly: 5.161 ± 0.091
1.201AlaHis: 1.201 ± 0.037
4.941AlaIle: 4.941 ± 0.076
4.371AlaLys: 4.371 ± 0.075
6.713AlaLeu: 6.713 ± 0.078
1.747AlaMet: 1.747 ± 0.039
3.677AlaAsn: 3.677 ± 0.069
2.387AlaPro: 2.387 ± 0.057
2.72AlaGln: 2.72 ± 0.048
2.396AlaArg: 2.396 ± 0.048
4.631AlaSer: 4.631 ± 0.079
3.897AlaThr: 3.897 ± 0.083
4.287AlaVal: 4.287 ± 0.068
0.789AlaTrp: 0.789 ± 0.03
2.63AlaTyr: 2.63 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.561CysAla: 0.561 ± 0.023
0.106CysCys: 0.106 ± 0.01
0.549CysAsp: 0.549 ± 0.031
0.486CysGlu: 0.486 ± 0.025
0.406CysPhe: 0.406 ± 0.018
0.764CysGly: 0.764 ± 0.036
0.186CysHis: 0.186 ± 0.014
0.482CysIle: 0.482 ± 0.022
0.445CysLys: 0.445 ± 0.022
0.679CysLeu: 0.679 ± 0.025
0.148CysMet: 0.148 ± 0.011
0.433CysAsn: 0.433 ± 0.021
0.378CysPro: 0.378 ± 0.02
0.247CysGln: 0.247 ± 0.014
0.265CysArg: 0.265 ± 0.014
0.582CysSer: 0.582 ± 0.025
0.514CysThr: 0.514 ± 0.024
0.474CysVal: 0.474 ± 0.019
0.136CysTrp: 0.136 ± 0.015
0.298CysTyr: 0.298 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.696AspAla: 3.696 ± 0.072
0.463AspCys: 0.463 ± 0.02
2.981AspAsp: 2.981 ± 0.06
3.858AspGlu: 3.858 ± 0.069
3.533AspPhe: 3.533 ± 0.054
4.071AspGly: 4.071 ± 0.074
0.965AspHis: 0.965 ± 0.027
4.033AspIle: 4.033 ± 0.059
3.586AspLys: 3.586 ± 0.066
5.382AspLeu: 5.382 ± 0.07
1.295AspMet: 1.295 ± 0.034
2.749AspAsn: 2.749 ± 0.055
2.103AspPro: 2.103 ± 0.043
1.735AspGln: 1.735 ± 0.041
2.033AspArg: 2.033 ± 0.037
4.092AspSer: 4.092 ± 0.103
3.325AspThr: 3.325 ± 0.094
3.594AspVal: 3.594 ± 0.06
0.796AspTrp: 0.796 ± 0.028
2.678AspTyr: 2.678 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
4.772GluAla: 4.772 ± 0.08
0.365GluCys: 0.365 ± 0.017
3.654GluAsp: 3.654 ± 0.071
5.135GluGlu: 5.135 ± 0.098
2.711GluPhe: 2.711 ± 0.047
4.097GluGly: 4.097 ± 0.062
1.186GluHis: 1.186 ± 0.034
5.036GluIle: 5.036 ± 0.069
5.062GluLys: 5.062 ± 0.096
6.248GluLeu: 6.248 ± 0.099
1.877GluMet: 1.877 ± 0.05
3.976GluAsn: 3.976 ± 0.061
1.78GluPro: 1.78 ± 0.041
2.15GluGln: 2.15 ± 0.051
2.584GluArg: 2.584 ± 0.059
3.537GluSer: 3.537 ± 0.059
3.236GluThr: 3.236 ± 0.053
4.865GluVal: 4.865 ± 0.076
0.723GluTrp: 0.723 ± 0.026
2.508GluTyr: 2.508 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
3.301PheAla: 3.301 ± 0.055
0.46PheCys: 0.46 ± 0.022
3.177PheAsp: 3.177 ± 0.05
3.338PheGlu: 3.338 ± 0.063
2.411PhePhe: 2.411 ± 0.06
3.809PheGly: 3.809 ± 0.07
0.843PheHis: 0.843 ± 0.025
3.371PheIle: 3.371 ± 0.058
2.933PheLys: 2.933 ± 0.056
4.292PheLeu: 4.292 ± 0.076
1.159PheMet: 1.159 ± 0.032
2.63PheAsn: 2.63 ± 0.053
1.777PhePro: 1.777 ± 0.037
1.57PheGln: 1.57 ± 0.04
1.761PheArg: 1.761 ± 0.041
4.005PheSer: 4.005 ± 0.062
3.356PheThr: 3.356 ± 0.075
2.923PheVal: 2.923 ± 0.056
0.6PheTrp: 0.6 ± 0.024
2.146PheTyr: 2.146 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
4.77GlyAla: 4.77 ± 0.089
0.721GlyCys: 0.721 ± 0.053
3.877GlyAsp: 3.877 ± 0.066
4.04GlyGlu: 4.04 ± 0.054
3.728GlyPhe: 3.728 ± 0.064
5.302GlyGly: 5.302 ± 0.113
1.219GlyHis: 1.219 ± 0.033
5.048GlyIle: 5.048 ± 0.071
4.85GlyLys: 4.85 ± 0.079
6.137GlyLeu: 6.137 ± 0.075
1.905GlyMet: 1.905 ± 0.042
3.945GlyAsn: 3.945 ± 0.099
1.807GlyPro: 1.807 ± 0.057
2.385GlyGln: 2.385 ± 0.051
2.418GlyArg: 2.418 ± 0.053
4.826GlySer: 4.826 ± 0.094
4.426GlyThr: 4.426 ± 0.124
4.899GlyVal: 4.899 ± 0.08
0.851GlyTrp: 0.851 ± 0.026
3.008GlyTyr: 3.008 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
1.001HisAla: 1.001 ± 0.028
0.191HisCys: 0.191 ± 0.014
0.925HisAsp: 0.925 ± 0.026
1.025HisGlu: 1.025 ± 0.029
1.193HisPhe: 1.193 ± 0.036
1.166HisGly: 1.166 ± 0.029
0.522HisHis: 0.522 ± 0.023
1.324HisIle: 1.324 ± 0.032
1.054HisLys: 1.054 ± 0.028
1.928HisLeu: 1.928 ± 0.046
0.403HisMet: 0.403 ± 0.019
0.932HisAsn: 0.932 ± 0.028
0.935HisPro: 0.935 ± 0.028
0.725HisGln: 0.725 ± 0.026
0.749HisArg: 0.749 ± 0.026
1.22HisSer: 1.22 ± 0.034
1.037HisThr: 1.037 ± 0.028
1.005HisVal: 1.005 ± 0.028
0.279HisTrp: 0.279 ± 0.016
0.821HisTyr: 0.821 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.015IleAla: 5.015 ± 0.073
0.62IleCys: 0.62 ± 0.02
4.467IleAsp: 4.467 ± 0.066
4.715IleGlu: 4.715 ± 0.066
3.105IlePhe: 3.105 ± 0.056
4.809IleGly: 4.809 ± 0.066
1.286IleHis: 1.286 ± 0.032
4.685IleIle: 4.685 ± 0.083
4.317IleLys: 4.317 ± 0.067
6.31IleLeu: 6.31 ± 0.105
1.403IleMet: 1.403 ± 0.036
3.959IleAsn: 3.959 ± 0.058
2.935IlePro: 2.935 ± 0.048
2.284IleGln: 2.284 ± 0.043
2.589IleArg: 2.589 ± 0.048
5.519IleSer: 5.519 ± 0.072
4.311IleThr: 4.311 ± 0.071
4.076IleVal: 4.076 ± 0.059
0.751IleTrp: 0.751 ± 0.026
2.581IleTyr: 2.581 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
4.801LysAla: 4.801 ± 0.083
0.375LysCys: 0.375 ± 0.019
3.843LysAsp: 3.843 ± 0.074
5.113LysGlu: 5.113 ± 0.098
2.487LysPhe: 2.487 ± 0.05
4.142LysGly: 4.142 ± 0.067
1.281LysHis: 1.281 ± 0.037
4.339LysIle: 4.339 ± 0.08
5.241LysLys: 5.241 ± 0.098
6.017LysLeu: 6.017 ± 0.093
1.946LysMet: 1.946 ± 0.045
3.66LysAsn: 3.66 ± 0.069
2.427LysPro: 2.427 ± 0.06
2.344LysGln: 2.344 ± 0.047
2.59LysArg: 2.59 ± 0.052
4.075LysSer: 4.075 ± 0.07
3.588LysThr: 3.588 ± 0.065
4.496LysVal: 4.496 ± 0.076
0.759LysTrp: 0.759 ± 0.026
2.593LysTyr: 2.593 ± 0.054
0.0LysXaa: 0.0 ± 0.0
Leu
6.619LeuAla: 6.619 ± 0.097
0.689LeuCys: 0.689 ± 0.029
5.224LeuAsp: 5.224 ± 0.083
6.051LeuGlu: 6.051 ± 0.109
4.44LeuPhe: 4.44 ± 0.079
5.965LeuGly: 5.965 ± 0.075
1.654LeuHis: 1.654 ± 0.036
6.272LeuIle: 6.272 ± 0.091
6.769LeuLys: 6.769 ± 0.108
8.446LeuLeu: 8.446 ± 0.134
2.259LeuMet: 2.259 ± 0.05
5.27LeuAsn: 5.27 ± 0.072
3.727LeuPro: 3.727 ± 0.059
3.424LeuGln: 3.424 ± 0.057
3.68LeuArg: 3.68 ± 0.059
7.092LeuSer: 7.092 ± 0.093
5.081LeuThr: 5.081 ± 0.069
5.547LeuVal: 5.547 ± 0.076
0.912LeuTrp: 0.912 ± 0.033
3.234LeuTyr: 3.234 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
1.981MetAla: 1.981 ± 0.043
0.148MetCys: 0.148 ± 0.011
1.515MetAsp: 1.515 ± 0.038
1.641MetGlu: 1.641 ± 0.039
0.848MetPhe: 0.848 ± 0.027
1.722MetGly: 1.722 ± 0.04
0.432MetHis: 0.432 ± 0.02
1.547MetIle: 1.547 ± 0.035
2.095MetLys: 2.095 ± 0.044
2.169MetLeu: 2.169 ± 0.05
0.689MetMet: 0.689 ± 0.025
1.303MetAsn: 1.303 ± 0.031
1.008MetPro: 1.008 ± 0.026
0.894MetGln: 0.894 ± 0.029
1.036MetArg: 1.036 ± 0.031
1.478MetSer: 1.478 ± 0.039
1.208MetThr: 1.208 ± 0.035
1.613MetVal: 1.613 ± 0.038
0.187MetTrp: 0.187 ± 0.012
0.755MetTyr: 0.755 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.614AsnAla: 3.614 ± 0.063
0.471AsnCys: 0.471 ± 0.024
3.081AsnAsp: 3.081 ± 0.056
3.15AsnGlu: 3.15 ± 0.063
2.899AsnPhe: 2.899 ± 0.051
4.291AsnGly: 4.291 ± 0.097
1.042AsnHis: 1.042 ± 0.031
3.876AsnIle: 3.876 ± 0.066
3.013AsnLys: 3.013 ± 0.062
5.182AsnLeu: 5.182 ± 0.072
1.208AsnMet: 1.208 ± 0.032
2.979AsnAsn: 2.979 ± 0.075
2.902AsnPro: 2.902 ± 0.055
2.0AsnGln: 2.0 ± 0.04
2.083AsnArg: 2.083 ± 0.041
3.765AsnSer: 3.765 ± 0.079
3.413AsnThr: 3.413 ± 0.075
3.369AsnVal: 3.369 ± 0.058
0.819AsnTrp: 0.819 ± 0.03
2.449AsnTyr: 2.449 ± 0.058
0.0AsnXaa: 0.0 ± 0.0
Pro
2.603ProAla: 2.603 ± 0.052
0.309ProCys: 0.309 ± 0.015
2.311ProAsp: 2.311 ± 0.044
2.98ProGlu: 2.98 ± 0.057
1.976ProPhe: 1.976 ± 0.047
2.448ProGly: 2.448 ± 0.058
0.722ProHis: 0.722 ± 0.022
2.535ProIle: 2.535 ± 0.05
2.379ProLys: 2.379 ± 0.05
3.073ProLeu: 3.073 ± 0.052
0.831ProMet: 0.831 ± 0.027
2.317ProAsn: 2.317 ± 0.047
1.041ProPro: 1.041 ± 0.037
1.354ProGln: 1.354 ± 0.032
1.057ProArg: 1.057 ± 0.029
2.578ProSer: 2.578 ± 0.056
2.165ProThr: 2.165 ± 0.044
2.578ProVal: 2.578 ± 0.056
0.458ProTrp: 0.458 ± 0.022
1.533ProTyr: 1.533 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
2.421GlnAla: 2.421 ± 0.052
0.191GlnCys: 0.191 ± 0.014
1.649GlnAsp: 1.649 ± 0.04
2.208GlnGlu: 2.208 ± 0.042
1.643GlnPhe: 1.643 ± 0.035
2.102GlnGly: 2.102 ± 0.049
0.652GlnHis: 0.652 ± 0.025
2.362GlnIle: 2.362 ± 0.043
2.616GlnLys: 2.616 ± 0.064
3.391GlnLeu: 3.391 ± 0.061
0.969GlnMet: 0.969 ± 0.031
2.155GlnAsn: 2.155 ± 0.047
1.265GlnPro: 1.265 ± 0.035
1.516GlnGln: 1.516 ± 0.041
1.445GlnArg: 1.445 ± 0.04
2.334GlnSer: 2.334 ± 0.05
1.932GlnThr: 1.932 ± 0.037
2.278GlnVal: 2.278 ± 0.05
0.457GlnTrp: 0.457 ± 0.022
1.334GlnTyr: 1.334 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
2.296ArgAla: 2.296 ± 0.042
0.215ArgCys: 0.215 ± 0.014
1.995ArgAsp: 1.995 ± 0.045
2.542ArgGlu: 2.542 ± 0.056
2.084ArgPhe: 2.084 ± 0.043
2.243ArgGly: 2.243 ± 0.04
0.69ArgHis: 0.69 ± 0.024
2.848ArgIle: 2.848 ± 0.051
2.828ArgLys: 2.828 ± 0.062
3.46ArgLeu: 3.46 ± 0.062
1.114ArgMet: 1.114 ± 0.034
2.113ArgAsn: 2.113 ± 0.046
1.277ArgPro: 1.277 ± 0.036
1.326ArgGln: 1.326 ± 0.034
1.36ArgArg: 1.36 ± 0.034
2.13ArgSer: 2.13 ± 0.042
1.878ArgThr: 1.878 ± 0.042
2.442ArgVal: 2.442 ± 0.045
0.476ArgTrp: 0.476 ± 0.022
1.604ArgTyr: 1.604 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
4.642SerAla: 4.642 ± 0.08
0.714SerCys: 0.714 ± 0.032
3.531SerAsp: 3.531 ± 0.065
3.941SerGlu: 3.941 ± 0.069
3.947SerPhe: 3.947 ± 0.057
5.645SerGly: 5.645 ± 0.104
1.251SerHis: 1.251 ± 0.033
5.189SerIle: 5.189 ± 0.08
4.208SerLys: 4.208 ± 0.079
6.604SerLeu: 6.604 ± 0.092
1.516SerMet: 1.516 ± 0.037
3.687SerAsn: 3.687 ± 0.08
2.699SerPro: 2.699 ± 0.059
2.363SerGln: 2.363 ± 0.043
2.455SerArg: 2.455 ± 0.056
5.15SerSer: 5.15 ± 0.098
4.199SerThr: 4.199 ± 0.094
4.38SerVal: 4.38 ± 0.074
0.859SerTrp: 0.859 ± 0.026
2.855SerTyr: 2.855 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
4.154ThrAla: 4.154 ± 0.091
0.439ThrCys: 0.439 ± 0.022
3.283ThrAsp: 3.283 ± 0.074
3.44ThrGlu: 3.44 ± 0.062
2.948ThrPhe: 2.948 ± 0.054
4.631ThrGly: 4.631 ± 0.108
1.031ThrHis: 1.031 ± 0.025
4.2ThrIle: 4.2 ± 0.085
3.023ThrLys: 3.023 ± 0.048
5.528ThrLeu: 5.528 ± 0.071
1.097ThrMet: 1.097 ± 0.031
3.065ThrAsn: 3.065 ± 0.072
2.579ThrPro: 2.579 ± 0.059
1.811ThrGln: 1.811 ± 0.038
1.799ThrArg: 1.799 ± 0.041
4.144ThrSer: 4.144 ± 0.088
3.725ThrThr: 3.725 ± 0.095
3.942ThrVal: 3.942 ± 0.101
0.647ThrTrp: 0.647 ± 0.027
2.472ThrTyr: 2.472 ± 0.06
0.0ThrXaa: 0.0 ± 0.0
Val
4.497ValAla: 4.497 ± 0.068
0.575ValCys: 0.575 ± 0.024
3.827ValAsp: 3.827 ± 0.06
4.288ValGlu: 4.288 ± 0.065
3.223ValPhe: 3.223 ± 0.061
4.254ValGly: 4.254 ± 0.067
1.118ValHis: 1.118 ± 0.032
4.467ValIle: 4.467 ± 0.066
4.102ValLys: 4.102 ± 0.063
5.808ValLeu: 5.808 ± 0.08
1.562ValMet: 1.562 ± 0.044
3.719ValAsn: 3.719 ± 0.064
2.267ValPro: 2.267 ± 0.043
2.033ValGln: 2.033 ± 0.043
2.304ValArg: 2.304 ± 0.047
4.939ValSer: 4.939 ± 0.069
3.557ValThr: 3.557 ± 0.085
4.61ValVal: 4.61 ± 0.068
0.715ValTrp: 0.715 ± 0.027
2.595ValTyr: 2.595 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.759TrpAla: 0.759 ± 0.029
0.117TrpCys: 0.117 ± 0.01
0.752TrpAsp: 0.752 ± 0.027
0.774TrpGlu: 0.774 ± 0.025
0.552TrpPhe: 0.552 ± 0.023
0.811TrpGly: 0.811 ± 0.025
0.239TrpHis: 0.239 ± 0.015
0.709TrpIle: 0.709 ± 0.031
0.842TrpLys: 0.842 ± 0.026
1.157TrpLeu: 1.157 ± 0.031
0.352TrpMet: 0.352 ± 0.019
0.705TrpAsn: 0.705 ± 0.026
0.337TrpPro: 0.337 ± 0.019
0.483TrpGln: 0.483 ± 0.022
0.46TrpArg: 0.46 ± 0.019
0.761TrpSer: 0.761 ± 0.028
0.67TrpThr: 0.67 ± 0.025
0.819TrpVal: 0.819 ± 0.03
0.181TrpTrp: 0.181 ± 0.013
0.48TrpTyr: 0.48 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.455TyrAla: 2.455 ± 0.046
0.327TyrCys: 0.327 ± 0.017
2.443TyrAsp: 2.443 ± 0.05
2.288TyrGlu: 2.288 ± 0.044
2.332TyrPhe: 2.332 ± 0.045
2.73TyrGly: 2.73 ± 0.048
0.914TyrHis: 0.914 ± 0.031
2.475TyrIle: 2.475 ± 0.05
2.384TyrLys: 2.384 ± 0.048
3.83TyrLeu: 3.83 ± 0.062
0.789TyrMet: 0.789 ± 0.028
2.332TyrAsn: 2.332 ± 0.047
1.617TyrPro: 1.617 ± 0.04
1.556TyrGln: 1.556 ± 0.035
1.857TyrArg: 1.857 ± 0.04
2.975TyrSer: 2.975 ± 0.056
2.441TyrThr: 2.441 ± 0.058
2.263TyrVal: 2.263 ± 0.055
0.567TyrTrp: 0.567 ± 0.027
2.006TyrTyr: 2.006 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3471 proteins (1212283 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski