Amino acid dipepetide frequency for Belliella baltica (strain DSM 15883 / CIP 108006 / LMG 21964 / BA134)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.533AlaAla: 4.533 ± 0.083
0.554AlaCys: 0.554 ± 0.025
3.345AlaAsp: 3.345 ± 0.053
4.317AlaGlu: 4.317 ± 0.067
3.444AlaPhe: 3.444 ± 0.052
4.487AlaGly: 4.487 ± 0.079
1.073AlaHis: 1.073 ± 0.033
5.173AlaIle: 5.173 ± 0.077
4.498AlaLys: 4.498 ± 0.076
6.234AlaLeu: 6.234 ± 0.095
1.658AlaMet: 1.658 ± 0.039
3.066AlaAsn: 3.066 ± 0.065
1.859AlaPro: 1.859 ± 0.043
2.404AlaGln: 2.404 ± 0.052
2.245AlaArg: 2.245 ± 0.048
4.074AlaSer: 4.074 ± 0.061
3.119AlaThr: 3.119 ± 0.063
4.06AlaVal: 4.06 ± 0.065
0.72AlaTrp: 0.72 ± 0.026
2.473AlaTyr: 2.473 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.408CysAla: 0.408 ± 0.024
0.117CysCys: 0.117 ± 0.012
0.377CysAsp: 0.377 ± 0.018
0.45CysGlu: 0.45 ± 0.02
0.375CysPhe: 0.375 ± 0.019
0.581CysGly: 0.581 ± 0.025
0.181CysHis: 0.181 ± 0.013
0.473CysIle: 0.473 ± 0.021
0.417CysLys: 0.417 ± 0.017
0.6CysLeu: 0.6 ± 0.022
0.152CysMet: 0.152 ± 0.013
0.318CysAsn: 0.318 ± 0.017
0.338CysPro: 0.338 ± 0.018
0.246CysGln: 0.246 ± 0.014
0.232CysArg: 0.232 ± 0.014
0.492CysSer: 0.492 ± 0.023
0.381CysThr: 0.381 ± 0.02
0.38CysVal: 0.38 ± 0.018
0.063CysTrp: 0.063 ± 0.006
0.201CysTyr: 0.201 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.221AspAla: 3.221 ± 0.058
0.374AspCys: 0.374 ± 0.018
2.557AspAsp: 2.557 ± 0.052
3.839AspGlu: 3.839 ± 0.055
3.824AspPhe: 3.824 ± 0.064
3.684AspGly: 3.684 ± 0.066
1.069AspHis: 1.069 ± 0.031
4.214AspIle: 4.214 ± 0.071
3.815AspLys: 3.815 ± 0.063
5.883AspLeu: 5.883 ± 0.076
1.253AspMet: 1.253 ± 0.037
2.546AspAsn: 2.546 ± 0.054
2.18AspPro: 2.18 ± 0.054
2.267AspGln: 2.267 ± 0.044
2.227AspArg: 2.227 ± 0.046
3.043AspSer: 3.043 ± 0.061
2.201AspThr: 2.201 ± 0.047
3.123AspVal: 3.123 ± 0.057
0.799AspTrp: 0.799 ± 0.022
2.488AspTyr: 2.488 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
4.601GluAla: 4.601 ± 0.07
0.284GluCys: 0.284 ± 0.016
3.808GluAsp: 3.808 ± 0.057
5.984GluGlu: 5.984 ± 0.092
3.47GluPhe: 3.47 ± 0.061
4.441GluGly: 4.441 ± 0.055
1.132GluHis: 1.132 ± 0.033
6.245GluIle: 6.245 ± 0.074
6.581GluLys: 6.581 ± 0.107
6.808GluLeu: 6.808 ± 0.094
1.894GluMet: 1.894 ± 0.039
4.846GluAsn: 4.846 ± 0.073
1.738GluPro: 1.738 ± 0.035
2.364GluGln: 2.364 ± 0.047
2.89GluArg: 2.89 ± 0.062
3.884GluSer: 3.884 ± 0.066
3.174GluThr: 3.174 ± 0.051
4.73GluVal: 4.73 ± 0.062
0.693GluTrp: 0.693 ± 0.028
2.3GluTyr: 2.3 ± 0.047
0.0GluXaa: 0.0 ± 0.0
Phe
3.168PheAla: 3.168 ± 0.067
0.397PheCys: 0.397 ± 0.022
3.513PheAsp: 3.513 ± 0.053
3.987PheGlu: 3.987 ± 0.06
3.041PhePhe: 3.041 ± 0.063
3.857PheGly: 3.857 ± 0.062
0.942PheHis: 0.942 ± 0.029
3.835PheIle: 3.835 ± 0.066
3.442PheLys: 3.442 ± 0.062
5.334PheLeu: 5.334 ± 0.081
1.236PheMet: 1.236 ± 0.034
2.941PheAsn: 2.941 ± 0.06
1.93PhePro: 1.93 ± 0.039
1.916PheGln: 1.916 ± 0.042
2.024PheArg: 2.024 ± 0.045
4.185PheSer: 4.185 ± 0.061
2.856PheThr: 2.856 ± 0.048
3.161PheVal: 3.161 ± 0.048
0.647PheTrp: 0.647 ± 0.026
2.174PheTyr: 2.174 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
4.202GlyAla: 4.202 ± 0.067
0.499GlyCys: 0.499 ± 0.022
3.355GlyAsp: 3.355 ± 0.063
4.242GlyGlu: 4.242 ± 0.07
3.866GlyPhe: 3.866 ± 0.068
4.718GlyGly: 4.718 ± 0.101
1.205GlyHis: 1.205 ± 0.032
5.479GlyIle: 5.479 ± 0.071
5.228GlyLys: 5.228 ± 0.071
6.351GlyLeu: 6.351 ± 0.089
1.88GlyMet: 1.88 ± 0.049
3.518GlyAsn: 3.518 ± 0.071
1.558GlyPro: 1.558 ± 0.043
2.202GlyGln: 2.202 ± 0.051
2.606GlyArg: 2.606 ± 0.06
4.133GlySer: 4.133 ± 0.072
3.464GlyThr: 3.464 ± 0.062
4.477GlyVal: 4.477 ± 0.066
0.766GlyTrp: 0.766 ± 0.026
2.612GlyTyr: 2.612 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.111HisAla: 1.111 ± 0.032
0.171HisCys: 0.171 ± 0.012
0.864HisAsp: 0.864 ± 0.031
1.002HisGlu: 1.002 ± 0.033
1.173HisPhe: 1.173 ± 0.033
1.14HisGly: 1.14 ± 0.036
0.519HisHis: 0.519 ± 0.024
1.325HisIle: 1.325 ± 0.039
1.014HisLys: 1.014 ± 0.027
1.912HisLeu: 1.912 ± 0.048
0.353HisMet: 0.353 ± 0.017
0.736HisAsn: 0.736 ± 0.026
1.008HisPro: 1.008 ± 0.028
0.875HisGln: 0.875 ± 0.027
0.702HisArg: 0.702 ± 0.024
1.041HisSer: 1.041 ± 0.031
0.848HisThr: 0.848 ± 0.03
0.989HisVal: 0.989 ± 0.027
0.198HisTrp: 0.198 ± 0.013
0.808HisTyr: 0.808 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.262IleAla: 5.262 ± 0.088
0.626IleCys: 0.626 ± 0.024
4.586IleAsp: 4.586 ± 0.061
5.473IleGlu: 5.473 ± 0.084
4.031IlePhe: 4.031 ± 0.073
5.103IleGly: 5.103 ± 0.081
1.48IleHis: 1.48 ± 0.032
5.845IleIle: 5.845 ± 0.097
5.702IleLys: 5.702 ± 0.085
7.6IleLeu: 7.6 ± 0.102
1.501IleMet: 1.501 ± 0.036
4.284IleAsn: 4.284 ± 0.057
3.476IlePro: 3.476 ± 0.058
3.028IleGln: 3.028 ± 0.057
3.002IleArg: 3.002 ± 0.05
5.84IleSer: 5.84 ± 0.089
3.973IleThr: 3.973 ± 0.063
4.18IleVal: 4.18 ± 0.068
0.799IleTrp: 0.799 ± 0.026
2.886IleTyr: 2.886 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
4.736LysAla: 4.736 ± 0.066
0.347LysCys: 0.347 ± 0.021
4.163LysAsp: 4.163 ± 0.069
6.213LysGlu: 6.213 ± 0.106
3.098LysPhe: 3.098 ± 0.058
4.525LysGly: 4.525 ± 0.072
1.238LysHis: 1.238 ± 0.03
6.154LysIle: 6.154 ± 0.095
6.067LysLys: 6.067 ± 0.103
6.54LysLeu: 6.54 ± 0.091
2.025LysMet: 2.025 ± 0.045
4.42LysAsn: 4.42 ± 0.066
2.401LysPro: 2.401 ± 0.049
2.247LysGln: 2.247 ± 0.048
2.846LysArg: 2.846 ± 0.058
5.155LysSer: 5.155 ± 0.084
3.731LysThr: 3.731 ± 0.069
4.809LysVal: 4.809 ± 0.078
0.765LysTrp: 0.765 ± 0.028
2.79LysTyr: 2.79 ± 0.051
0.0LysXaa: 0.0 ± 0.0
Leu
6.379LeuAla: 6.379 ± 0.083
0.597LeuCys: 0.597 ± 0.024
5.518LeuAsp: 5.518 ± 0.073
7.23LeuGlu: 7.23 ± 0.081
5.062LeuPhe: 5.062 ± 0.078
6.362LeuGly: 6.362 ± 0.086
1.594LeuHis: 1.594 ± 0.037
7.683LeuIle: 7.683 ± 0.105
7.418LeuLys: 7.418 ± 0.111
9.199LeuLeu: 9.199 ± 0.118
2.297LeuMet: 2.297 ± 0.052
5.268LeuAsn: 5.268 ± 0.067
3.762LeuPro: 3.762 ± 0.054
3.32LeuGln: 3.32 ± 0.058
3.796LeuArg: 3.796 ± 0.065
6.89LeuSer: 6.89 ± 0.082
4.993LeuThr: 4.993 ± 0.082
5.723LeuVal: 5.723 ± 0.077
0.934LeuTrp: 0.934 ± 0.033
3.037LeuTyr: 3.037 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
1.791MetAla: 1.791 ± 0.038
0.114MetCys: 0.114 ± 0.011
1.347MetAsp: 1.347 ± 0.031
1.719MetGlu: 1.719 ± 0.042
0.858MetPhe: 0.858 ± 0.027
1.66MetGly: 1.66 ± 0.045
0.422MetHis: 0.422 ± 0.017
1.936MetIle: 1.936 ± 0.042
2.369MetLys: 2.369 ± 0.046
2.154MetLeu: 2.154 ± 0.052
0.716MetMet: 0.716 ± 0.028
1.369MetAsn: 1.369 ± 0.03
0.978MetPro: 0.978 ± 0.029
0.816MetGln: 0.816 ± 0.027
0.931MetArg: 0.931 ± 0.028
1.341MetSer: 1.341 ± 0.04
1.219MetThr: 1.219 ± 0.029
1.479MetVal: 1.479 ± 0.039
0.185MetTrp: 0.185 ± 0.011
0.647MetTyr: 0.647 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
3.18AsnAla: 3.18 ± 0.061
0.358AsnCys: 0.358 ± 0.02
2.714AsnAsp: 2.714 ± 0.056
3.432AsnGlu: 3.432 ± 0.065
3.066AsnPhe: 3.066 ± 0.058
3.457AsnGly: 3.457 ± 0.078
1.037AsnHis: 1.037 ± 0.033
4.157AsnIle: 4.157 ± 0.062
3.616AsnLys: 3.616 ± 0.069
5.708AsnLeu: 5.708 ± 0.076
1.201AsnMet: 1.201 ± 0.035
2.794AsnAsn: 2.794 ± 0.063
2.817AsnPro: 2.817 ± 0.06
2.542AsnGln: 2.542 ± 0.045
2.321AsnArg: 2.321 ± 0.058
3.648AsnSer: 3.648 ± 0.07
2.72AsnThr: 2.72 ± 0.074
2.985AsnVal: 2.985 ± 0.065
0.775AsnTrp: 0.775 ± 0.029
2.289AsnTyr: 2.289 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
2.085ProAla: 2.085 ± 0.047
0.2ProCys: 0.2 ± 0.014
2.309ProAsp: 2.309 ± 0.048
3.149ProGlu: 3.149 ± 0.058
2.015ProPhe: 2.015 ± 0.045
2.241ProGly: 2.241 ± 0.057
0.642ProHis: 0.642 ± 0.026
2.932ProIle: 2.932 ± 0.049
2.459ProLys: 2.459 ± 0.054
3.231ProLeu: 3.231 ± 0.05
0.847ProMet: 0.847 ± 0.025
2.126ProAsn: 2.126 ± 0.052
0.855ProPro: 0.855 ± 0.026
1.244ProGln: 1.244 ± 0.033
1.257ProArg: 1.257 ± 0.041
2.336ProSer: 2.336 ± 0.053
1.798ProThr: 1.798 ± 0.043
2.36ProVal: 2.36 ± 0.044
0.393ProTrp: 0.393 ± 0.02
1.458ProTyr: 1.458 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
2.375GlnAla: 2.375 ± 0.046
0.148GlnCys: 0.148 ± 0.011
1.784GlnAsp: 1.784 ± 0.039
2.833GlnGlu: 2.833 ± 0.051
1.806GlnPhe: 1.806 ± 0.036
2.125GlnGly: 2.125 ± 0.048
0.532GlnHis: 0.532 ± 0.022
2.93GlnIle: 2.93 ± 0.053
2.889GlnLys: 2.889 ± 0.06
3.502GlnLeu: 3.502 ± 0.049
0.94GlnMet: 0.94 ± 0.028
2.162GlnAsn: 2.162 ± 0.047
1.13GlnPro: 1.13 ± 0.034
1.379GlnGln: 1.379 ± 0.042
1.488GlnArg: 1.488 ± 0.034
2.224GlnSer: 2.224 ± 0.053
1.835GlnThr: 1.835 ± 0.04
2.405GlnVal: 2.405 ± 0.047
0.35GlnTrp: 0.35 ± 0.017
1.369GlnTyr: 1.369 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
2.348ArgAla: 2.348 ± 0.049
0.17ArgCys: 0.17 ± 0.011
2.087ArgAsp: 2.087 ± 0.052
2.82ArgGlu: 2.82 ± 0.059
2.099ArgPhe: 2.099 ± 0.046
2.292ArgGly: 2.292 ± 0.052
0.594ArgHis: 0.594 ± 0.024
3.16ArgIle: 3.16 ± 0.057
3.187ArgLys: 3.187 ± 0.057
3.779ArgLeu: 3.779 ± 0.061
1.121ArgMet: 1.121 ± 0.03
2.331ArgAsn: 2.331 ± 0.054
1.375ArgPro: 1.375 ± 0.036
1.359ArgGln: 1.359 ± 0.036
1.59ArgArg: 1.59 ± 0.044
2.224ArgSer: 2.224 ± 0.046
1.932ArgThr: 1.932 ± 0.041
2.495ArgVal: 2.495 ± 0.052
0.46ArgTrp: 0.46 ± 0.023
1.568ArgTyr: 1.568 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
3.731SerAla: 3.731 ± 0.061
0.615SerCys: 0.615 ± 0.021
3.357SerAsp: 3.357 ± 0.055
4.527SerGlu: 4.527 ± 0.069
3.815SerPhe: 3.815 ± 0.059
4.834SerGly: 4.834 ± 0.077
1.109SerHis: 1.109 ± 0.032
5.2SerIle: 5.2 ± 0.071
4.932SerLys: 4.932 ± 0.081
6.503SerLeu: 6.503 ± 0.081
1.48SerMet: 1.48 ± 0.032
3.671SerAsn: 3.671 ± 0.068
2.414SerPro: 2.414 ± 0.056
2.308SerGln: 2.308 ± 0.044
2.599SerArg: 2.599 ± 0.053
4.478SerSer: 4.478 ± 0.062
3.148SerThr: 3.148 ± 0.055
3.602SerVal: 3.602 ± 0.052
0.785SerTrp: 0.785 ± 0.027
2.532SerTyr: 2.532 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
3.237ThrAla: 3.237 ± 0.056
0.338ThrCys: 0.338 ± 0.018
2.724ThrAsp: 2.724 ± 0.049
3.183ThrGlu: 3.183 ± 0.056
2.885ThrPhe: 2.885 ± 0.057
3.717ThrGly: 3.717 ± 0.068
0.914ThrHis: 0.914 ± 0.03
3.849ThrIle: 3.849 ± 0.056
3.135ThrLys: 3.135 ± 0.055
4.933ThrLeu: 4.933 ± 0.087
0.958ThrMet: 0.958 ± 0.033
2.476ThrAsn: 2.476 ± 0.062
2.163ThrPro: 2.163 ± 0.046
1.745ThrGln: 1.745 ± 0.04
1.706ThrArg: 1.706 ± 0.045
3.227ThrSer: 3.227 ± 0.06
2.537ThrThr: 2.537 ± 0.06
3.188ThrVal: 3.188 ± 0.09
0.586ThrTrp: 0.586 ± 0.023
2.003ThrTyr: 2.003 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
4.003ValAla: 4.003 ± 0.074
0.497ValCys: 0.497 ± 0.018
3.475ValAsp: 3.475 ± 0.078
4.228ValGlu: 4.228 ± 0.062
3.5ValPhe: 3.5 ± 0.057
3.926ValGly: 3.926 ± 0.067
1.098ValHis: 1.098 ± 0.032
4.715ValIle: 4.715 ± 0.072
4.305ValLys: 4.305 ± 0.061
5.77ValLeu: 5.77 ± 0.077
1.447ValMet: 1.447 ± 0.035
3.38ValAsn: 3.38 ± 0.062
2.179ValPro: 2.179 ± 0.043
1.815ValGln: 1.815 ± 0.04
2.335ValArg: 2.335 ± 0.048
4.199ValSer: 4.199 ± 0.061
3.041ValThr: 3.041 ± 0.092
3.768ValVal: 3.768 ± 0.068
0.629ValTrp: 0.629 ± 0.023
2.182ValTyr: 2.182 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.668TrpAla: 0.668 ± 0.025
0.075TrpCys: 0.075 ± 0.008
0.666TrpAsp: 0.666 ± 0.027
0.868TrpGlu: 0.868 ± 0.025
0.575TrpPhe: 0.575 ± 0.024
0.73TrpGly: 0.73 ± 0.026
0.201TrpHis: 0.201 ± 0.013
0.867TrpIle: 0.867 ± 0.029
0.824TrpLys: 0.824 ± 0.028
1.015TrpLeu: 1.015 ± 0.033
0.368TrpMet: 0.368 ± 0.018
0.597TrpAsn: 0.597 ± 0.027
0.284TrpPro: 0.284 ± 0.016
0.404TrpGln: 0.404 ± 0.019
0.475TrpArg: 0.475 ± 0.023
0.667TrpSer: 0.667 ± 0.029
0.581TrpThr: 0.581 ± 0.027
0.742TrpVal: 0.742 ± 0.027
0.171TrpTrp: 0.171 ± 0.015
0.419TrpTyr: 0.419 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.294TyrAla: 2.294 ± 0.048
0.298TyrCys: 0.298 ± 0.017
2.055TyrAsp: 2.055 ± 0.046
2.374TyrGlu: 2.374 ± 0.049
2.585TyrPhe: 2.585 ± 0.047
2.477TyrGly: 2.477 ± 0.041
0.804TyrHis: 0.804 ± 0.026
2.505TyrIle: 2.505 ± 0.048
2.373TyrLys: 2.373 ± 0.046
4.009TyrLeu: 4.009 ± 0.061
0.723TyrMet: 0.723 ± 0.027
1.97TyrAsn: 1.97 ± 0.044
1.494TyrPro: 1.494 ± 0.038
1.724TyrGln: 1.724 ± 0.041
1.719TyrArg: 1.719 ± 0.04
2.563TyrSer: 2.563 ± 0.057
1.954TyrThr: 1.954 ± 0.077
1.836TyrVal: 1.836 ± 0.041
0.477TyrTrp: 0.477 ± 0.022
1.584TyrTyr: 1.584 ± 0.039
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3609 proteins (1180055 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski