Amino acid dipepetide frequency for Pedobacter sp. BS3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.415AlaAla: 6.415 ± 0.093
0.7AlaCys: 0.7 ± 0.02
4.61AlaAsp: 4.61 ± 0.053
4.591AlaGlu: 4.591 ± 0.067
3.327AlaPhe: 3.327 ± 0.046
6.286AlaGly: 6.286 ± 0.095
1.264AlaHis: 1.264 ± 0.03
5.483AlaIle: 5.483 ± 0.069
4.506AlaLys: 4.506 ± 0.062
6.999AlaLeu: 6.999 ± 0.083
1.663AlaMet: 1.663 ± 0.038
3.914AlaAsn: 3.914 ± 0.07
2.333AlaPro: 2.333 ± 0.044
2.954AlaGln: 2.954 ± 0.043
2.875AlaArg: 2.875 ± 0.044
4.684AlaSer: 4.684 ± 0.069
4.23AlaThr: 4.23 ± 0.072
4.949AlaVal: 4.949 ± 0.067
0.867AlaTrp: 0.867 ± 0.027
3.163AlaTyr: 3.163 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.532CysAla: 0.532 ± 0.02
0.135CysCys: 0.135 ± 0.012
0.415CysAsp: 0.415 ± 0.017
0.342CysGlu: 0.342 ± 0.018
0.422CysPhe: 0.422 ± 0.016
0.621CysGly: 0.621 ± 0.024
0.198CysHis: 0.198 ± 0.014
0.656CysIle: 0.656 ± 0.022
0.511CysLys: 0.511 ± 0.02
0.767CysLeu: 0.767 ± 0.023
0.216CysMet: 0.216 ± 0.013
0.389CysAsn: 0.389 ± 0.017
0.333CysPro: 0.333 ± 0.016
0.256CysGln: 0.256 ± 0.013
0.38CysArg: 0.38 ± 0.017
0.578CysSer: 0.578 ± 0.018
0.473CysThr: 0.473 ± 0.02
0.468CysVal: 0.468 ± 0.017
0.1CysTrp: 0.1 ± 0.009
0.374CysTyr: 0.374 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
4.072AspAla: 4.072 ± 0.056
0.387AspCys: 0.387 ± 0.017
2.668AspAsp: 2.668 ± 0.059
3.431AspGlu: 3.431 ± 0.06
3.105AspPhe: 3.105 ± 0.055
3.987AspGly: 3.987 ± 0.07
0.894AspHis: 0.894 ± 0.027
4.406AspIle: 4.406 ± 0.056
4.048AspLys: 4.048 ± 0.064
4.681AspLeu: 4.681 ± 0.058
1.228AspMet: 1.228 ± 0.03
3.112AspAsn: 3.112 ± 0.066
2.054AspPro: 2.054 ± 0.044
1.502AspGln: 1.502 ± 0.033
2.065AspArg: 2.065 ± 0.04
2.754AspSer: 2.754 ± 0.049
2.716AspThr: 2.716 ± 0.057
3.441AspVal: 3.441 ± 0.052
0.797AspTrp: 0.797 ± 0.027
2.63AspTyr: 2.63 ± 0.061
0.0AspXaa: 0.0 ± 0.0
Glu
4.296GluAla: 4.296 ± 0.063
0.348GluCys: 0.348 ± 0.017
2.65GluAsp: 2.65 ± 0.051
3.304GluGlu: 3.304 ± 0.073
2.276GluPhe: 2.276 ± 0.041
3.058GluGly: 3.058 ± 0.052
1.209GluHis: 1.209 ± 0.03
4.08GluIle: 4.08 ± 0.061
4.403GluLys: 4.403 ± 0.068
5.528GluLeu: 5.528 ± 0.079
1.273GluMet: 1.273 ± 0.029
3.211GluAsn: 3.211 ± 0.055
1.845GluPro: 1.845 ± 0.038
2.662GluGln: 2.662 ± 0.059
2.54GluArg: 2.54 ± 0.048
2.68GluSer: 2.68 ± 0.053
3.093GluThr: 3.093 ± 0.044
3.607GluVal: 3.607 ± 0.052
0.602GluTrp: 0.602 ± 0.021
2.086GluTyr: 2.086 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.181PheAla: 3.181 ± 0.042
0.471PheCys: 0.471 ± 0.022
2.773PheAsp: 2.773 ± 0.044
2.523PheGlu: 2.523 ± 0.04
2.199PhePhe: 2.199 ± 0.041
3.109PheGly: 3.109 ± 0.052
0.744PheHis: 0.744 ± 0.026
3.452PheIle: 3.452 ± 0.055
2.972PheLys: 2.972 ± 0.054
3.968PheLeu: 3.968 ± 0.063
1.022PheMet: 1.022 ± 0.028
3.028PheAsn: 3.028 ± 0.047
1.738PhePro: 1.738 ± 0.038
1.297PheGln: 1.297 ± 0.033
1.959PheArg: 1.959 ± 0.04
3.493PheSer: 3.493 ± 0.055
3.231PheThr: 3.231 ± 0.05
2.589PheVal: 2.589 ± 0.05
0.604PheTrp: 0.604 ± 0.022
2.129PheTyr: 2.129 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
4.578GlyAla: 4.578 ± 0.077
0.657GlyCys: 0.657 ± 0.025
3.454GlyAsp: 3.454 ± 0.056
3.269GlyGlu: 3.269 ± 0.051
3.47GlyPhe: 3.47 ± 0.045
4.949GlyGly: 4.949 ± 0.091
1.192GlyHis: 1.192 ± 0.035
5.475GlyIle: 5.475 ± 0.074
5.438GlyLys: 5.438 ± 0.072
6.122GlyLeu: 6.122 ± 0.081
1.66GlyMet: 1.66 ± 0.03
4.167GlyAsn: 4.167 ± 0.069
1.616GlyPro: 1.616 ± 0.035
2.327GlyGln: 2.327 ± 0.048
2.781GlyArg: 2.781 ± 0.039
4.595GlySer: 4.595 ± 0.072
4.624GlyThr: 4.624 ± 0.103
4.319GlyVal: 4.319 ± 0.064
0.971GlyTrp: 0.971 ± 0.025
3.441GlyTyr: 3.441 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
1.224HisAla: 1.224 ± 0.034
0.206HisCys: 0.206 ± 0.011
0.873HisAsp: 0.873 ± 0.026
0.981HisGlu: 0.981 ± 0.025
1.091HisPhe: 1.091 ± 0.028
1.173HisGly: 1.173 ± 0.033
0.555HisHis: 0.555 ± 0.02
1.619HisIle: 1.619 ± 0.039
1.024HisLys: 1.024 ± 0.026
1.757HisLeu: 1.757 ± 0.042
0.285HisMet: 0.285 ± 0.014
0.997HisAsn: 0.997 ± 0.03
1.041HisPro: 1.041 ± 0.032
0.767HisGln: 0.767 ± 0.027
0.861HisArg: 0.861 ± 0.022
1.047HisSer: 1.047 ± 0.028
1.145HisThr: 1.145 ± 0.029
1.05HisVal: 1.05 ± 0.032
0.267HisTrp: 0.267 ± 0.014
0.914HisTyr: 0.914 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.925IleAla: 5.925 ± 0.07
0.649IleCys: 0.649 ± 0.024
4.143IleAsp: 4.143 ± 0.048
3.962IleGlu: 3.962 ± 0.061
2.773IlePhe: 2.773 ± 0.051
4.623IleGly: 4.623 ± 0.068
1.382IleHis: 1.382 ± 0.033
5.235IleIle: 5.235 ± 0.07
4.591IleLys: 4.591 ± 0.059
5.978IleLeu: 5.978 ± 0.077
1.321IleMet: 1.321 ± 0.027
4.108IleAsn: 4.108 ± 0.059
3.12IlePro: 3.12 ± 0.054
2.329IleGln: 2.329 ± 0.038
3.521IleArg: 3.521 ± 0.055
5.177IleSer: 5.177 ± 0.071
4.856IleThr: 4.856 ± 0.081
4.132IleVal: 4.132 ± 0.064
0.734IleTrp: 0.734 ± 0.027
2.726IleTyr: 2.726 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
5.223LysAla: 5.223 ± 0.065
0.312LysCys: 0.312 ± 0.013
3.829LysAsp: 3.829 ± 0.063
4.133LysGlu: 4.133 ± 0.066
2.47LysPhe: 2.47 ± 0.046
4.187LysGly: 4.187 ± 0.056
1.254LysHis: 1.254 ± 0.029
4.73LysIle: 4.73 ± 0.063
5.075LysLys: 5.075 ± 0.071
5.986LysLeu: 5.986 ± 0.079
1.558LysMet: 1.558 ± 0.034
4.115LysAsn: 4.115 ± 0.057
3.023LysPro: 3.023 ± 0.057
3.001LysGln: 3.001 ± 0.053
2.967LysArg: 2.967 ± 0.046
3.713LysSer: 3.713 ± 0.053
4.176LysThr: 4.176 ± 0.056
4.043LysVal: 4.043 ± 0.054
0.804LysTrp: 0.804 ± 0.025
2.787LysTyr: 2.787 ± 0.041
0.0LysXaa: 0.0 ± 0.0
Leu
6.784LeuAla: 6.784 ± 0.077
0.821LeuCys: 0.821 ± 0.025
4.624LeuAsp: 4.624 ± 0.067
4.723LeuGlu: 4.723 ± 0.068
4.452LeuPhe: 4.452 ± 0.07
5.476LeuGly: 5.476 ± 0.077
1.771LeuHis: 1.771 ± 0.035
6.119LeuIle: 6.119 ± 0.088
6.938LeuLys: 6.938 ± 0.082
9.222LeuLeu: 9.222 ± 0.113
1.974LeuMet: 1.974 ± 0.041
5.531LeuAsn: 5.531 ± 0.071
4.13LeuPro: 4.13 ± 0.055
3.768LeuGln: 3.768 ± 0.063
3.797LeuArg: 3.797 ± 0.063
6.817LeuSer: 6.817 ± 0.074
5.711LeuThr: 5.711 ± 0.071
5.502LeuVal: 5.502 ± 0.074
1.022LeuTrp: 1.022 ± 0.029
3.584LeuTyr: 3.584 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
1.807MetAla: 1.807 ± 0.039
0.149MetCys: 0.149 ± 0.009
1.082MetAsp: 1.082 ± 0.03
1.15MetGlu: 1.15 ± 0.03
0.75MetPhe: 0.75 ± 0.024
1.4MetGly: 1.4 ± 0.035
0.42MetHis: 0.42 ± 0.018
1.281MetIle: 1.281 ± 0.027
1.817MetLys: 1.817 ± 0.037
2.107MetLeu: 2.107 ± 0.04
0.542MetMet: 0.542 ± 0.019
1.165MetAsn: 1.165 ± 0.026
1.059MetPro: 1.059 ± 0.028
0.965MetGln: 0.965 ± 0.027
0.958MetArg: 0.958 ± 0.029
1.239MetSer: 1.239 ± 0.031
0.978MetThr: 0.978 ± 0.027
1.3MetVal: 1.3 ± 0.032
0.195MetTrp: 0.195 ± 0.012
0.729MetTyr: 0.729 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
4.143AsnAla: 4.143 ± 0.067
0.435AsnCys: 0.435 ± 0.021
2.798AsnAsp: 2.798 ± 0.049
2.936AsnGlu: 2.936 ± 0.05
2.501AsnPhe: 2.501 ± 0.043
4.313AsnGly: 4.313 ± 0.077
1.011AsnHis: 1.011 ± 0.026
4.156AsnIle: 4.156 ± 0.06
3.549AsnLys: 3.549 ± 0.054
5.051AsnLeu: 5.051 ± 0.069
1.17AsnMet: 1.17 ± 0.028
3.536AsnAsn: 3.536 ± 0.07
2.969AsnPro: 2.969 ± 0.045
2.106AsnGln: 2.106 ± 0.039
2.669AsnArg: 2.669 ± 0.052
3.389AsnSer: 3.389 ± 0.06
3.606AsnThr: 3.606 ± 0.067
3.223AsnVal: 3.223 ± 0.051
0.83AsnTrp: 0.83 ± 0.026
2.753AsnTyr: 2.753 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
3.518ProAla: 3.518 ± 0.057
0.241ProCys: 0.241 ± 0.014
2.944ProAsp: 2.944 ± 0.048
3.003ProGlu: 3.003 ± 0.054
1.837ProPhe: 1.837 ± 0.038
3.2ProGly: 3.2 ± 0.054
0.731ProHis: 0.731 ± 0.028
1.969ProIle: 1.969 ± 0.045
2.039ProLys: 2.039 ± 0.039
3.492ProLeu: 3.492 ± 0.049
0.639ProMet: 0.639 ± 0.023
1.883ProAsn: 1.883 ± 0.039
1.056ProPro: 1.056 ± 0.031
1.6ProGln: 1.6 ± 0.035
1.229ProArg: 1.229 ± 0.032
2.209ProSer: 2.209 ± 0.043
1.877ProThr: 1.877 ± 0.043
3.847ProVal: 3.847 ± 0.059
0.45ProTrp: 0.45 ± 0.021
1.682ProTyr: 1.682 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
3.039GlnAla: 3.039 ± 0.048
0.212GlnCys: 0.212 ± 0.012
1.689GlnAsp: 1.689 ± 0.035
1.954GlnGlu: 1.954 ± 0.047
1.66GlnPhe: 1.66 ± 0.037
2.179GlnGly: 2.179 ± 0.041
0.937GlnHis: 0.937 ± 0.03
2.321GlnIle: 2.321 ± 0.047
2.598GlnLys: 2.598 ± 0.046
4.086GlnLeu: 4.086 ± 0.056
0.78GlnMet: 0.78 ± 0.024
2.145GlnAsn: 2.145 ± 0.038
1.64GlnPro: 1.64 ± 0.031
2.309GlnGln: 2.309 ± 0.057
1.638GlnArg: 1.638 ± 0.034
2.162GlnSer: 2.162 ± 0.04
2.276GlnThr: 2.276 ± 0.044
2.612GlnVal: 2.612 ± 0.047
0.507GlnTrp: 0.507 ± 0.022
1.693GlnTyr: 1.693 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
2.538ArgAla: 2.538 ± 0.046
0.292ArgCys: 0.292 ± 0.015
2.099ArgAsp: 2.099 ± 0.04
2.433ArgGlu: 2.433 ± 0.046
2.204ArgPhe: 2.204 ± 0.04
2.257ArgGly: 2.257 ± 0.04
0.821ArgHis: 0.821 ± 0.023
3.358ArgIle: 3.358 ± 0.048
3.09ArgLys: 3.09 ± 0.053
4.284ArgLeu: 4.284 ± 0.065
1.087ArgMet: 1.087 ± 0.028
2.437ArgAsn: 2.437 ± 0.043
1.428ArgPro: 1.428 ± 0.031
1.857ArgGln: 1.857 ± 0.047
1.76ArgArg: 1.76 ± 0.038
2.45ArgSer: 2.45 ± 0.039
2.263ArgThr: 2.263 ± 0.044
2.61ArgVal: 2.61 ± 0.045
0.642ArgTrp: 0.642 ± 0.021
2.199ArgTyr: 2.199 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
5.001SerAla: 5.001 ± 0.068
0.57SerCys: 0.57 ± 0.019
3.37SerAsp: 3.37 ± 0.045
3.015SerGlu: 3.015 ± 0.047
3.248SerPhe: 3.248 ± 0.056
5.485SerGly: 5.485 ± 0.079
1.09SerHis: 1.09 ± 0.029
4.187SerIle: 4.187 ± 0.05
3.652SerLys: 3.652 ± 0.055
5.953SerLeu: 5.953 ± 0.067
1.176SerMet: 1.176 ± 0.029
3.181SerAsn: 3.181 ± 0.049
2.543SerPro: 2.543 ± 0.046
2.05SerGln: 2.05 ± 0.041
2.747SerArg: 2.747 ± 0.045
4.194SerSer: 4.194 ± 0.073
3.63SerThr: 3.63 ± 0.06
4.256SerVal: 4.256 ± 0.057
0.854SerTrp: 0.854 ± 0.027
2.893SerTyr: 2.893 ± 0.059
0.0SerXaa: 0.0 ± 0.0
Thr
4.98ThrAla: 4.98 ± 0.085
0.437ThrCys: 0.437 ± 0.02
3.735ThrAsp: 3.735 ± 0.059
3.13ThrGlu: 3.13 ± 0.046
2.872ThrPhe: 2.872 ± 0.046
5.382ThrGly: 5.382 ± 0.079
1.059ThrHis: 1.059 ± 0.03
4.393ThrIle: 4.393 ± 0.086
2.875ThrLys: 2.875 ± 0.049
5.556ThrLeu: 5.556 ± 0.083
0.995ThrMet: 0.995 ± 0.027
2.903ThrAsn: 2.903 ± 0.05
2.701ThrPro: 2.701 ± 0.047
1.922ThrGln: 1.922 ± 0.039
2.127ThrArg: 2.127 ± 0.039
3.717ThrSer: 3.717 ± 0.065
3.628ThrThr: 3.628 ± 0.073
4.381ThrVal: 4.381 ± 0.069
0.79ThrTrp: 0.79 ± 0.028
2.704ThrTyr: 2.704 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
4.638ValAla: 4.638 ± 0.061
0.633ValCys: 0.633 ± 0.022
3.227ValAsp: 3.227 ± 0.053
3.143ValGlu: 3.143 ± 0.053
3.015ValPhe: 3.015 ± 0.051
3.544ValGly: 3.544 ± 0.065
1.085ValHis: 1.085 ± 0.031
4.692ValIle: 4.692 ± 0.067
4.642ValLys: 4.642 ± 0.054
5.96ValLeu: 5.96 ± 0.081
1.336ValMet: 1.336 ± 0.033
3.854ValAsn: 3.854 ± 0.054
2.48ValPro: 2.48 ± 0.044
2.222ValGln: 2.222 ± 0.035
2.497ValArg: 2.497 ± 0.041
4.621ValSer: 4.621 ± 0.071
4.194ValThr: 4.194 ± 0.088
4.037ValVal: 4.037 ± 0.065
0.772ValTrp: 0.772 ± 0.026
2.749ValTyr: 2.749 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.803TrpAla: 0.803 ± 0.023
0.14TrpCys: 0.14 ± 0.01
0.708TrpAsp: 0.708 ± 0.026
0.655TrpGlu: 0.655 ± 0.022
0.608TrpPhe: 0.608 ± 0.021
0.889TrpGly: 0.889 ± 0.026
0.323TrpHis: 0.323 ± 0.017
0.821TrpIle: 0.821 ± 0.025
0.876TrpLys: 0.876 ± 0.027
1.211TrpLeu: 1.211 ± 0.032
0.349TrpMet: 0.349 ± 0.019
0.798TrpAsn: 0.798 ± 0.027
0.397TrpPro: 0.397 ± 0.018
0.637TrpGln: 0.637 ± 0.02
0.54TrpArg: 0.54 ± 0.019
0.707TrpSer: 0.707 ± 0.024
0.695TrpThr: 0.695 ± 0.028
0.706TrpVal: 0.706 ± 0.021
0.221TrpTrp: 0.221 ± 0.013
0.48TrpTyr: 0.48 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.115TyrAla: 3.115 ± 0.051
0.372TyrCys: 0.372 ± 0.017
2.284TyrAsp: 2.284 ± 0.043
2.072TyrGlu: 2.072 ± 0.04
2.201TyrPhe: 2.201 ± 0.05
2.988TyrGly: 2.988 ± 0.046
0.936TyrHis: 0.936 ± 0.028
2.849TyrIle: 2.849 ± 0.047
2.743TyrLys: 2.743 ± 0.042
4.057TyrLeu: 4.057 ± 0.065
0.818TyrMet: 0.818 ± 0.024
2.672TyrAsn: 2.672 ± 0.051
1.938TyrPro: 1.938 ± 0.037
1.908TyrGln: 1.908 ± 0.04
2.199TyrArg: 2.199 ± 0.043
2.872TyrSer: 2.872 ± 0.049
2.886TyrThr: 2.886 ± 0.046
2.269TyrVal: 2.269 ± 0.047
0.534TyrTrp: 0.534 ± 0.019
2.16TyrTyr: 2.16 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3935 proteins (1391832 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski