Amino acid dipepetide frequency for Flavobacterium subsaxonicum WB 4.1-42 = DSM 21790

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.77AlaAla: 6.77 ± 0.108
0.662AlaCys: 0.662 ± 0.028
4.61AlaAsp: 4.61 ± 0.062
4.666AlaGlu: 4.666 ± 0.07
3.685AlaPhe: 3.685 ± 0.062
5.521AlaGly: 5.521 ± 0.082
1.189AlaHis: 1.189 ± 0.038
5.642AlaIle: 5.642 ± 0.093
4.958AlaLys: 4.958 ± 0.081
7.486AlaLeu: 7.486 ± 0.088
1.733AlaMet: 1.733 ± 0.043
4.136AlaAsn: 4.136 ± 0.073
2.527AlaPro: 2.527 ± 0.057
3.299AlaGln: 3.299 ± 0.05
2.273AlaArg: 2.273 ± 0.044
4.424AlaSer: 4.424 ± 0.071
5.381AlaThr: 5.381 ± 0.115
5.312AlaVal: 5.312 ± 0.075
0.774AlaTrp: 0.774 ± 0.025
2.983AlaTyr: 2.983 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.522CysAla: 0.522 ± 0.022
0.098CysCys: 0.098 ± 0.008
0.439CysAsp: 0.439 ± 0.026
0.417CysGlu: 0.417 ± 0.023
0.383CysPhe: 0.383 ± 0.018
0.686CysGly: 0.686 ± 0.031
0.167CysHis: 0.167 ± 0.013
0.61CysIle: 0.61 ± 0.025
0.404CysLys: 0.404 ± 0.015
0.711CysLeu: 0.711 ± 0.022
0.16CysMet: 0.16 ± 0.012
0.433CysAsn: 0.433 ± 0.022
0.307CysPro: 0.307 ± 0.019
0.222CysGln: 0.222 ± 0.014
0.255CysArg: 0.255 ± 0.012
0.611CysSer: 0.611 ± 0.025
0.581CysThr: 0.581 ± 0.035
0.439CysVal: 0.439 ± 0.02
0.058CysTrp: 0.058 ± 0.007
0.328CysTyr: 0.328 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
4.68AspAla: 4.68 ± 0.065
0.409AspCys: 0.409 ± 0.021
2.979AspAsp: 2.979 ± 0.06
3.387AspGlu: 3.387 ± 0.056
3.328AspPhe: 3.328 ± 0.054
4.01AspGly: 4.01 ± 0.07
0.84AspHis: 0.84 ± 0.026
4.195AspIle: 4.195 ± 0.054
4.116AspLys: 4.116 ± 0.065
4.705AspLeu: 4.705 ± 0.062
1.177AspMet: 1.177 ± 0.032
3.303AspAsn: 3.303 ± 0.057
1.719AspPro: 1.719 ± 0.043
1.309AspGln: 1.309 ± 0.034
1.849AspArg: 1.849 ± 0.037
2.884AspSer: 2.884 ± 0.051
3.189AspThr: 3.189 ± 0.05
3.858AspVal: 3.858 ± 0.054
0.705AspTrp: 0.705 ± 0.028
2.82AspTyr: 2.82 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
4.78GluAla: 4.78 ± 0.07
0.344GluCys: 0.344 ± 0.02
3.049GluAsp: 3.049 ± 0.051
3.841GluGlu: 3.841 ± 0.068
2.634GluPhe: 2.634 ± 0.05
3.766GluGly: 3.766 ± 0.062
1.094GluHis: 1.094 ± 0.033
4.335GluIle: 4.335 ± 0.069
4.699GluLys: 4.699 ± 0.072
5.461GluLeu: 5.461 ± 0.073
1.48GluMet: 1.48 ± 0.039
3.506GluAsn: 3.506 ± 0.058
1.724GluPro: 1.724 ± 0.037
2.231GluGln: 2.231 ± 0.05
2.32GluArg: 2.32 ± 0.048
2.885GluSer: 2.885 ± 0.057
3.211GluThr: 3.211 ± 0.048
4.036GluVal: 4.036 ± 0.057
0.641GluTrp: 0.641 ± 0.024
2.367GluTyr: 2.367 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
3.615PheAla: 3.615 ± 0.054
0.498PheCys: 0.498 ± 0.02
3.13PheAsp: 3.13 ± 0.052
3.064PheGlu: 3.064 ± 0.048
2.493PhePhe: 2.493 ± 0.054
3.506PheGly: 3.506 ± 0.062
0.7PheHis: 0.7 ± 0.023
3.62PheIle: 3.62 ± 0.063
3.421PheLys: 3.421 ± 0.058
4.126PheLeu: 4.126 ± 0.064
1.122PheMet: 1.122 ± 0.033
3.071PheAsn: 3.071 ± 0.063
1.55PhePro: 1.55 ± 0.033
1.169PheGln: 1.169 ± 0.033
1.582PheArg: 1.582 ± 0.038
3.552PheSer: 3.552 ± 0.057
3.807PheThr: 3.807 ± 0.063
3.005PheVal: 3.005 ± 0.058
0.525PheTrp: 0.525 ± 0.021
2.232PheTyr: 2.232 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.815GlyAla: 4.815 ± 0.076
0.724GlyCys: 0.724 ± 0.034
3.319GlyAsp: 3.319 ± 0.051
3.399GlyGlu: 3.399 ± 0.055
3.67GlyPhe: 3.67 ± 0.06
4.844GlyGly: 4.844 ± 0.083
1.22GlyHis: 1.22 ± 0.031
5.143GlyIle: 5.143 ± 0.06
4.903GlyLys: 4.903 ± 0.069
6.075GlyLeu: 6.075 ± 0.084
1.66GlyMet: 1.66 ± 0.042
3.884GlyAsn: 3.884 ± 0.076
1.525GlyPro: 1.525 ± 0.039
2.273GlyGln: 2.273 ± 0.051
2.252GlyArg: 2.252 ± 0.045
4.33GlySer: 4.33 ± 0.064
5.078GlyThr: 5.078 ± 0.112
4.366GlyVal: 4.366 ± 0.072
0.862GlyTrp: 0.862 ± 0.029
3.143GlyTyr: 3.143 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
1.13HisAla: 1.13 ± 0.037
0.146HisCys: 0.146 ± 0.011
0.946HisAsp: 0.946 ± 0.024
0.926HisGlu: 0.926 ± 0.029
1.09HisPhe: 1.09 ± 0.034
1.086HisGly: 1.086 ± 0.034
0.463HisHis: 0.463 ± 0.021
1.321HisIle: 1.321 ± 0.037
1.1HisLys: 1.1 ± 0.029
1.66HisLeu: 1.66 ± 0.038
0.327HisMet: 0.327 ± 0.016
1.068HisAsn: 1.068 ± 0.029
0.845HisPro: 0.845 ± 0.027
0.605HisGln: 0.605 ± 0.023
0.599HisArg: 0.599 ± 0.021
1.014HisSer: 1.014 ± 0.03
1.009HisThr: 1.009 ± 0.027
0.935HisVal: 0.935 ± 0.029
0.199HisTrp: 0.199 ± 0.013
0.916HisTyr: 0.916 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
6.131IleAla: 6.131 ± 0.081
0.557IleCys: 0.557 ± 0.018
4.373IleAsp: 4.373 ± 0.068
4.498IleGlu: 4.498 ± 0.068
3.103IlePhe: 3.103 ± 0.053
4.567IleGly: 4.567 ± 0.06
1.108IleHis: 1.108 ± 0.029
5.42IleIle: 5.42 ± 0.077
5.202IleLys: 5.202 ± 0.068
5.961IleLeu: 5.961 ± 0.089
1.4IleMet: 1.4 ± 0.037
4.264IleAsn: 4.264 ± 0.064
2.948IlePro: 2.948 ± 0.054
2.255IleGln: 2.255 ± 0.042
2.343IleArg: 2.343 ± 0.045
4.58IleSer: 4.58 ± 0.07
5.389IleThr: 5.389 ± 0.073
4.611IleVal: 4.611 ± 0.072
0.605IleTrp: 0.605 ± 0.026
2.752IleTyr: 2.752 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
5.262LysAla: 5.262 ± 0.083
0.314LysCys: 0.314 ± 0.017
4.32LysAsp: 4.32 ± 0.068
4.93LysGlu: 4.93 ± 0.067
2.75LysPhe: 2.75 ± 0.053
4.356LysGly: 4.356 ± 0.062
1.21LysHis: 1.21 ± 0.036
5.31LysIle: 5.31 ± 0.074
5.993LysLys: 5.993 ± 0.086
6.043LysLeu: 6.043 ± 0.086
1.921LysMet: 1.921 ± 0.039
4.456LysAsn: 4.456 ± 0.056
2.56LysPro: 2.56 ± 0.051
2.738LysGln: 2.738 ± 0.047
2.382LysArg: 2.382 ± 0.05
3.732LysSer: 3.732 ± 0.062
4.283LysThr: 4.283 ± 0.058
4.38LysVal: 4.38 ± 0.062
0.692LysTrp: 0.692 ± 0.024
2.868LysTyr: 2.868 ± 0.05
0.0LysXaa: 0.0 ± 0.0
Leu
6.754LeuAla: 6.754 ± 0.084
0.785LeuCys: 0.785 ± 0.025
4.81LeuAsp: 4.81 ± 0.074
5.283LeuGlu: 5.283 ± 0.078
4.626LeuPhe: 4.626 ± 0.078
5.764LeuGly: 5.764 ± 0.079
1.678LeuHis: 1.678 ± 0.038
5.834LeuIle: 5.834 ± 0.087
6.744LeuLys: 6.744 ± 0.085
9.399LeuLeu: 9.399 ± 0.117
2.024LeuMet: 2.024 ± 0.045
5.13LeuAsn: 5.13 ± 0.07
3.964LeuPro: 3.964 ± 0.06
3.633LeuGln: 3.633 ± 0.059
3.109LeuArg: 3.109 ± 0.061
6.221LeuSer: 6.221 ± 0.074
5.78LeuThr: 5.78 ± 0.077
5.491LeuVal: 5.491 ± 0.069
0.865LeuTrp: 0.865 ± 0.032
3.531LeuTyr: 3.531 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
1.944MetAla: 1.944 ± 0.038
0.132MetCys: 0.132 ± 0.01
1.2MetAsp: 1.2 ± 0.032
1.408MetGlu: 1.408 ± 0.035
0.901MetPhe: 0.901 ± 0.029
1.487MetGly: 1.487 ± 0.035
0.479MetHis: 0.479 ± 0.022
1.307MetIle: 1.307 ± 0.031
2.022MetLys: 2.022 ± 0.046
2.082MetLeu: 2.082 ± 0.042
0.555MetMet: 0.555 ± 0.022
1.034MetAsn: 1.034 ± 0.03
1.017MetPro: 1.017 ± 0.027
0.95MetGln: 0.95 ± 0.026
0.875MetArg: 0.875 ± 0.029
1.251MetSer: 1.251 ± 0.03
1.062MetThr: 1.062 ± 0.029
1.472MetVal: 1.472 ± 0.031
0.2MetTrp: 0.2 ± 0.013
0.785MetTyr: 0.785 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
4.364AsnAla: 4.364 ± 0.074
0.447AsnCys: 0.447 ± 0.022
3.035AsnAsp: 3.035 ± 0.052
3.031AsnGlu: 3.031 ± 0.043
2.924AsnPhe: 2.924 ± 0.052
4.417AsnGly: 4.417 ± 0.103
0.958AsnHis: 0.958 ± 0.025
4.306AsnIle: 4.306 ± 0.064
3.763AsnLys: 3.763 ± 0.063
5.033AsnLeu: 5.033 ± 0.073
1.197AsnMet: 1.197 ± 0.029
3.88AsnAsn: 3.88 ± 0.072
2.827AsnPro: 2.827 ± 0.051
1.889AsnGln: 1.889 ± 0.042
2.071AsnArg: 2.071 ± 0.039
3.435AsnSer: 3.435 ± 0.064
3.813AsnThr: 3.813 ± 0.074
3.718AsnVal: 3.718 ± 0.06
0.713AsnTrp: 0.713 ± 0.029
2.902AsnTyr: 2.902 ± 0.06
0.0AsnXaa: 0.0 ± 0.0
Pro
3.079ProAla: 3.079 ± 0.06
0.206ProCys: 0.206 ± 0.014
2.376ProAsp: 2.376 ± 0.044
2.722ProGlu: 2.722 ± 0.057
1.879ProPhe: 1.879 ± 0.041
2.461ProGly: 2.461 ± 0.053
0.635ProHis: 0.635 ± 0.025
2.344ProIle: 2.344 ± 0.044
2.282ProLys: 2.282 ± 0.047
3.226ProLeu: 3.226 ± 0.05
0.742ProMet: 0.742 ± 0.025
2.17ProAsn: 2.17 ± 0.055
0.961ProPro: 0.961 ± 0.037
1.531ProGln: 1.531 ± 0.042
0.914ProArg: 0.914 ± 0.027
1.959ProSer: 1.959 ± 0.044
2.2ProThr: 2.2 ± 0.057
3.019ProVal: 3.019 ± 0.05
0.354ProTrp: 0.354 ± 0.017
1.585ProTyr: 1.585 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.595GlnAla: 2.595 ± 0.053
0.214GlnCys: 0.214 ± 0.012
1.787GlnAsp: 1.787 ± 0.04
2.132GlnGlu: 2.132 ± 0.041
1.63GlnPhe: 1.63 ± 0.036
2.009GlnGly: 2.009 ± 0.043
0.697GlnHis: 0.697 ± 0.026
2.338GlnIle: 2.338 ± 0.048
2.715GlnLys: 2.715 ± 0.044
3.56GlnLeu: 3.56 ± 0.064
0.846GlnMet: 0.846 ± 0.026
2.21GlnAsn: 2.21 ± 0.049
1.422GlnPro: 1.422 ± 0.037
1.887GlnGln: 1.887 ± 0.041
1.266GlnArg: 1.266 ± 0.03
2.016GlnSer: 2.016 ± 0.042
2.165GlnThr: 2.165 ± 0.054
2.199GlnVal: 2.199 ± 0.044
0.431GlnTrp: 0.431 ± 0.02
1.619GlnTyr: 1.619 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
2.171ArgAla: 2.171 ± 0.049
0.204ArgCys: 0.204 ± 0.012
1.794ArgAsp: 1.794 ± 0.037
2.167ArgGlu: 2.167 ± 0.048
1.818ArgPhe: 1.818 ± 0.035
1.81ArgGly: 1.81 ± 0.044
0.655ArgHis: 0.655 ± 0.024
2.675ArgIle: 2.675 ± 0.046
2.53ArgLys: 2.53 ± 0.049
3.254ArgLeu: 3.254 ± 0.054
0.911ArgMet: 0.911 ± 0.027
1.954ArgAsn: 1.954 ± 0.034
1.13ArgPro: 1.13 ± 0.034
1.289ArgGln: 1.289 ± 0.031
1.322ArgArg: 1.322 ± 0.034
1.781ArgSer: 1.781 ± 0.04
1.754ArgThr: 1.754 ± 0.041
2.211ArgVal: 2.211 ± 0.047
0.377ArgTrp: 0.377 ± 0.018
1.549ArgTyr: 1.549 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
4.611SerAla: 4.611 ± 0.067
0.608SerCys: 0.608 ± 0.021
3.027SerAsp: 3.027 ± 0.044
2.987SerGlu: 2.987 ± 0.048
3.403SerPhe: 3.403 ± 0.055
4.711SerGly: 4.711 ± 0.067
1.133SerHis: 1.133 ± 0.029
4.381SerIle: 4.381 ± 0.07
3.755SerLys: 3.755 ± 0.056
5.655SerLeu: 5.655 ± 0.073
1.177SerMet: 1.177 ± 0.031
3.268SerAsn: 3.268 ± 0.06
2.254SerPro: 2.254 ± 0.046
2.061SerGln: 2.061 ± 0.049
2.067SerArg: 2.067 ± 0.043
3.479SerSer: 3.479 ± 0.06
3.562SerThr: 3.562 ± 0.07
4.104SerVal: 4.104 ± 0.059
0.628SerTrp: 0.628 ± 0.024
2.686SerTyr: 2.686 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
5.617ThrAla: 5.617 ± 0.119
0.416ThrCys: 0.416 ± 0.021
3.899ThrAsp: 3.899 ± 0.056
3.432ThrGlu: 3.432 ± 0.061
3.179ThrPhe: 3.179 ± 0.05
5.253ThrGly: 5.253 ± 0.101
1.055ThrHis: 1.055 ± 0.031
4.921ThrIle: 4.921 ± 0.074
3.585ThrLys: 3.585 ± 0.052
5.93ThrLeu: 5.93 ± 0.086
1.077ThrMet: 1.077 ± 0.029
3.462ThrAsn: 3.462 ± 0.078
2.94ThrPro: 2.94 ± 0.058
2.251ThrGln: 2.251 ± 0.049
1.766ThrArg: 1.766 ± 0.041
3.713ThrSer: 3.713 ± 0.066
4.842ThrThr: 4.842 ± 0.127
5.027ThrVal: 5.027 ± 0.096
0.646ThrTrp: 0.646 ± 0.026
2.748ThrTyr: 2.748 ± 0.063
0.0ThrXaa: 0.0 ± 0.0
Val
5.489ValAla: 5.489 ± 0.074
0.595ValCys: 0.595 ± 0.023
3.323ValAsp: 3.323 ± 0.05
3.369ValGlu: 3.369 ± 0.055
3.306ValPhe: 3.306 ± 0.051
3.87ValGly: 3.87 ± 0.065
1.044ValHis: 1.044 ± 0.029
4.836ValIle: 4.836 ± 0.071
4.588ValLys: 4.588 ± 0.067
6.185ValLeu: 6.185 ± 0.079
1.493ValMet: 1.493 ± 0.036
3.91ValAsn: 3.91 ± 0.063
2.507ValPro: 2.507 ± 0.047
2.108ValGln: 2.108 ± 0.043
2.101ValArg: 2.101 ± 0.048
4.291ValSer: 4.291 ± 0.07
4.959ValThr: 4.959 ± 0.091
4.776ValVal: 4.776 ± 0.069
0.628ValTrp: 0.628 ± 0.024
2.639ValTyr: 2.639 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.757TrpAla: 0.757 ± 0.029
0.098TrpCys: 0.098 ± 0.009
0.647TrpAsp: 0.647 ± 0.022
0.613TrpGlu: 0.613 ± 0.025
0.538TrpPhe: 0.538 ± 0.021
0.697TrpGly: 0.697 ± 0.027
0.254TrpHis: 0.254 ± 0.014
0.678TrpIle: 0.678 ± 0.023
0.728TrpLys: 0.728 ± 0.023
0.973TrpLeu: 0.973 ± 0.029
0.307TrpMet: 0.307 ± 0.017
0.649TrpAsn: 0.649 ± 0.022
0.298TrpPro: 0.298 ± 0.015
0.508TrpGln: 0.508 ± 0.019
0.374TrpArg: 0.374 ± 0.019
0.593TrpSer: 0.593 ± 0.024
0.557TrpThr: 0.557 ± 0.023
0.665TrpVal: 0.665 ± 0.024
0.155TrpTrp: 0.155 ± 0.011
0.465TrpTyr: 0.465 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.946TyrAla: 2.946 ± 0.05
0.376TyrCys: 0.376 ± 0.016
2.398TyrAsp: 2.398 ± 0.046
2.13TyrGlu: 2.13 ± 0.045
2.45TyrPhe: 2.45 ± 0.049
2.666TyrGly: 2.666 ± 0.045
0.81TyrHis: 0.81 ± 0.026
2.849TyrIle: 2.849 ± 0.054
2.991TyrLys: 2.991 ± 0.049
3.898TyrLeu: 3.898 ± 0.064
0.853TyrMet: 0.853 ± 0.027
2.934TyrAsn: 2.934 ± 0.056
1.619TyrPro: 1.619 ± 0.041
1.527TyrGln: 1.527 ± 0.039
1.634TyrArg: 1.634 ± 0.035
2.806TyrSer: 2.806 ± 0.049
3.138TyrThr: 3.138 ± 0.066
2.393TyrVal: 2.393 ± 0.048
0.498TyrTrp: 0.498 ± 0.022
2.196TyrTyr: 2.196 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3785 proteins (1272052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski