Amino acid dipepetide frequency for Desulfobotulus alkaliphilus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.407AlaAla: 9.407 ± 0.118
1.334AlaCys: 1.334 ± 0.04
4.596AlaAsp: 4.596 ± 0.069
6.175AlaGlu: 6.175 ± 0.08
4.373AlaPhe: 4.373 ± 0.073
7.962AlaGly: 7.962 ± 0.097
1.755AlaHis: 1.755 ± 0.049
5.076AlaIle: 5.076 ± 0.071
3.458AlaLys: 3.458 ± 0.058
10.204AlaLeu: 10.204 ± 0.119
3.51AlaMet: 3.51 ± 0.053
1.956AlaAsn: 1.956 ± 0.047
3.157AlaPro: 3.157 ± 0.061
2.196AlaGln: 2.196 ± 0.043
5.432AlaArg: 5.432 ± 0.08
5.678AlaSer: 5.678 ± 0.079
3.569AlaThr: 3.569 ± 0.07
5.971AlaVal: 5.971 ± 0.087
0.937AlaTrp: 0.937 ± 0.031
2.173AlaTyr: 2.173 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.842CysAla: 0.842 ± 0.032
0.235CysCys: 0.235 ± 0.014
0.581CysAsp: 0.581 ± 0.023
0.587CysGlu: 0.587 ± 0.021
0.566CysPhe: 0.566 ± 0.024
1.251CysGly: 1.251 ± 0.038
0.411CysHis: 0.411 ± 0.025
0.774CysIle: 0.774 ± 0.028
0.427CysLys: 0.427 ± 0.019
1.299CysLeu: 1.299 ± 0.035
0.361CysMet: 0.361 ± 0.019
0.32CysAsn: 0.32 ± 0.018
0.734CysPro: 0.734 ± 0.031
0.32CysGln: 0.32 ± 0.017
0.932CysArg: 0.932 ± 0.033
0.72CysSer: 0.72 ± 0.032
0.589CysThr: 0.589 ± 0.022
0.728CysVal: 0.728 ± 0.029
0.135CysTrp: 0.135 ± 0.013
0.267CysTyr: 0.267 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.215AspAla: 5.215 ± 0.072
0.56AspCys: 0.56 ± 0.023
2.209AspAsp: 2.209 ± 0.052
3.196AspGlu: 3.196 ± 0.059
2.933AspPhe: 2.933 ± 0.054
3.866AspGly: 3.866 ± 0.082
1.159AspHis: 1.159 ± 0.026
3.968AspIle: 3.968 ± 0.066
2.349AspLys: 2.349 ± 0.051
6.058AspLeu: 6.058 ± 0.087
1.804AspMet: 1.804 ± 0.04
1.405AspAsn: 1.405 ± 0.035
2.788AspPro: 2.788 ± 0.054
1.551AspGln: 1.551 ± 0.037
3.503AspArg: 3.503 ± 0.06
2.699AspSer: 2.699 ± 0.048
3.017AspThr: 3.017 ± 0.053
3.267AspVal: 3.267 ± 0.066
0.723AspTrp: 0.723 ± 0.028
1.706AspTyr: 1.706 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
6.945GluAla: 6.945 ± 0.094
0.45GluCys: 0.45 ± 0.022
4.126GluAsp: 4.126 ± 0.064
5.755GluGlu: 5.755 ± 0.097
1.901GluPhe: 1.901 ± 0.04
5.296GluGly: 5.296 ± 0.072
1.094GluHis: 1.094 ± 0.029
4.88GluIle: 4.88 ± 0.076
6.799GluLys: 6.799 ± 0.091
5.796GluLeu: 5.796 ± 0.069
2.151GluMet: 2.151 ± 0.045
3.381GluAsn: 3.381 ± 0.064
2.13GluPro: 2.13 ± 0.051
1.993GluGln: 1.993 ± 0.052
4.218GluArg: 4.218 ± 0.063
3.838GluSer: 3.838 ± 0.061
3.868GluThr: 3.868 ± 0.059
4.199GluVal: 4.199 ± 0.067
0.641GluTrp: 0.641 ± 0.026
1.46GluTyr: 1.46 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.183PheAla: 3.183 ± 0.06
0.713PheCys: 0.713 ± 0.026
2.456PheAsp: 2.456 ± 0.043
2.504PheGlu: 2.504 ± 0.047
2.817PhePhe: 2.817 ± 0.064
3.026PheGly: 3.026 ± 0.048
1.139PheHis: 1.139 ± 0.032
2.631PheIle: 2.631 ± 0.052
1.665PheLys: 1.665 ± 0.043
4.83PheLeu: 4.83 ± 0.072
1.362PheMet: 1.362 ± 0.036
1.286PheAsn: 1.286 ± 0.032
2.058PhePro: 2.058 ± 0.043
1.486PheGln: 1.486 ± 0.039
2.792PheArg: 2.792 ± 0.05
3.778PheSer: 3.778 ± 0.069
2.504PheThr: 2.504 ± 0.051
2.651PheVal: 2.651 ± 0.052
0.588PheTrp: 0.588 ± 0.023
1.399PheTyr: 1.399 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
5.537GlyAla: 5.537 ± 0.082
1.094GlyCys: 1.094 ± 0.036
3.783GlyAsp: 3.783 ± 0.068
5.225GlyGlu: 5.225 ± 0.077
3.745GlyPhe: 3.745 ± 0.057
5.711GlyGly: 5.711 ± 0.097
1.732GlyHis: 1.732 ± 0.047
5.431GlyIle: 5.431 ± 0.079
4.593GlyLys: 4.593 ± 0.068
8.34GlyLeu: 8.34 ± 0.104
2.796GlyMet: 2.796 ± 0.051
2.432GlyAsn: 2.432 ± 0.053
2.561GlyPro: 2.561 ± 0.054
2.386GlyGln: 2.386 ± 0.057
4.988GlyArg: 4.988 ± 0.081
4.69GlySer: 4.69 ± 0.083
3.667GlyThr: 3.667 ± 0.067
4.907GlyVal: 4.907 ± 0.077
0.953GlyTrp: 0.953 ± 0.031
2.32GlyTyr: 2.32 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
1.877HisAla: 1.877 ± 0.044
0.324HisCys: 0.324 ± 0.018
0.979HisAsp: 0.979 ± 0.028
1.345HisGlu: 1.345 ± 0.03
1.105HisPhe: 1.105 ± 0.032
1.99HisGly: 1.99 ± 0.046
0.581HisHis: 0.581 ± 0.023
1.414HisIle: 1.414 ± 0.035
0.912HisLys: 0.912 ± 0.028
2.488HisLeu: 2.488 ± 0.057
0.701HisMet: 0.701 ± 0.028
0.595HisAsn: 0.595 ± 0.022
1.547HisPro: 1.547 ± 0.037
0.661HisGln: 0.661 ± 0.027
1.501HisArg: 1.501 ± 0.039
1.14HisSer: 1.14 ± 0.03
1.219HisThr: 1.219 ± 0.035
1.282HisVal: 1.282 ± 0.038
0.246HisTrp: 0.246 ± 0.016
0.709HisTyr: 0.709 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.034IleAla: 5.034 ± 0.088
0.842IleCys: 0.842 ± 0.026
2.634IleAsp: 2.634 ± 0.051
3.557IleGlu: 3.557 ± 0.064
2.904IlePhe: 2.904 ± 0.054
3.906IleGly: 3.906 ± 0.079
1.757IleHis: 1.757 ± 0.036
3.203IleIle: 3.203 ± 0.067
2.518IleLys: 2.518 ± 0.053
7.353IleLeu: 7.353 ± 0.098
1.449IleMet: 1.449 ± 0.039
1.832IleAsn: 1.832 ± 0.043
3.548IlePro: 3.548 ± 0.063
2.232IleGln: 2.232 ± 0.043
5.315IleArg: 5.315 ± 0.069
4.039IleSer: 4.039 ± 0.055
3.121IleThr: 3.121 ± 0.061
3.299IleVal: 3.299 ± 0.054
0.626IleTrp: 0.626 ± 0.024
1.52IleTyr: 1.52 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
5.011LysAla: 5.011 ± 0.08
0.376LysCys: 0.376 ± 0.019
3.392LysAsp: 3.392 ± 0.051
4.358LysGlu: 4.358 ± 0.077
1.194LysPhe: 1.194 ± 0.036
4.152LysGly: 4.152 ± 0.061
0.797LysHis: 0.797 ± 0.027
3.437LysIle: 3.437 ± 0.061
4.625LysLys: 4.625 ± 0.085
3.697LysLeu: 3.697 ± 0.061
1.409LysMet: 1.409 ± 0.036
2.745LysAsn: 2.745 ± 0.056
2.25LysPro: 2.25 ± 0.05
1.4LysGln: 1.4 ± 0.037
2.737LysArg: 2.737 ± 0.047
2.731LysSer: 2.731 ± 0.052
3.119LysThr: 3.119 ± 0.052
2.738LysVal: 2.738 ± 0.061
0.427LysTrp: 0.427 ± 0.021
1.108LysTyr: 1.108 ± 0.036
0.0LysXaa: 0.0 ± 0.0
Leu
9.788LeuAla: 9.788 ± 0.103
1.468LeuCys: 1.468 ± 0.04
5.704LeuAsp: 5.704 ± 0.076
7.489LeuGlu: 7.489 ± 0.096
5.0LeuPhe: 5.0 ± 0.083
7.087LeuGly: 7.087 ± 0.099
2.402LeuHis: 2.402 ± 0.047
5.598LeuIle: 5.598 ± 0.085
5.763LeuLys: 5.763 ± 0.082
10.737LeuLeu: 10.737 ± 0.132
3.145LeuMet: 3.145 ± 0.059
3.171LeuAsn: 3.171 ± 0.05
5.525LeuPro: 5.525 ± 0.08
3.377LeuGln: 3.377 ± 0.065
6.239LeuArg: 6.239 ± 0.098
7.491LeuSer: 7.491 ± 0.092
4.865LeuThr: 4.865 ± 0.064
6.723LeuVal: 6.723 ± 0.081
1.099LeuTrp: 1.099 ± 0.037
2.62LeuTyr: 2.62 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
3.765MetAla: 3.765 ± 0.063
0.173MetCys: 0.173 ± 0.013
2.49MetAsp: 2.49 ± 0.048
2.987MetGlu: 2.987 ± 0.052
0.727MetPhe: 0.727 ± 0.029
2.927MetGly: 2.927 ± 0.057
0.589MetHis: 0.589 ± 0.023
1.518MetIle: 1.518 ± 0.041
1.776MetLys: 1.776 ± 0.039
2.707MetLeu: 2.707 ± 0.05
0.795MetMet: 0.795 ± 0.03
1.113MetAsn: 1.113 ± 0.034
1.45MetPro: 1.45 ± 0.032
1.219MetGln: 1.219 ± 0.034
1.553MetArg: 1.553 ± 0.036
1.334MetSer: 1.334 ± 0.037
1.554MetThr: 1.554 ± 0.032
2.165MetVal: 2.165 ± 0.05
0.157MetTrp: 0.157 ± 0.013
0.462MetTyr: 0.462 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.776AsnAla: 2.776 ± 0.054
0.354AsnCys: 0.354 ± 0.019
1.347AsnAsp: 1.347 ± 0.035
1.678AsnGlu: 1.678 ± 0.043
1.325AsnPhe: 1.325 ± 0.036
2.135AsnGly: 2.135 ± 0.045
0.822AsnHis: 0.822 ± 0.028
2.236AsnIle: 2.236 ± 0.045
1.269AsnLys: 1.269 ± 0.035
3.56AsnLeu: 3.56 ± 0.061
0.902AsnMet: 0.902 ± 0.027
1.024AsnAsn: 1.024 ± 0.033
2.251AsnPro: 2.251 ± 0.048
1.126AsnGln: 1.126 ± 0.031
2.29AsnArg: 2.29 ± 0.048
1.492AsnSer: 1.492 ± 0.037
1.761AsnThr: 1.761 ± 0.049
1.687AsnVal: 1.687 ± 0.045
0.376AsnTrp: 0.376 ± 0.02
0.878AsnTyr: 0.878 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
4.05ProAla: 4.05 ± 0.076
0.522ProCys: 0.522 ± 0.024
3.55ProAsp: 3.55 ± 0.058
5.34ProGlu: 5.34 ± 0.071
2.246ProPhe: 2.246 ± 0.047
3.83ProGly: 3.83 ± 0.071
1.07ProHis: 1.07 ± 0.033
1.792ProIle: 1.792 ± 0.042
1.613ProLys: 1.613 ± 0.046
4.864ProLeu: 4.864 ± 0.078
1.496ProMet: 1.496 ± 0.035
0.907ProAsn: 0.907 ± 0.029
2.16ProPro: 2.16 ± 0.05
1.209ProGln: 1.209 ± 0.034
1.974ProArg: 1.974 ± 0.041
2.561ProSer: 2.561 ± 0.048
1.4ProThr: 1.4 ± 0.04
4.107ProVal: 4.107 ± 0.068
0.645ProTrp: 0.645 ± 0.023
1.125ProTyr: 1.125 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
3.19GlnAla: 3.19 ± 0.061
0.261GlnCys: 0.261 ± 0.015
1.763GlnAsp: 1.763 ± 0.041
2.326GlnGlu: 2.326 ± 0.04
0.862GlnPhe: 0.862 ± 0.027
2.347GlnGly: 2.347 ± 0.042
0.649GlnHis: 0.649 ± 0.026
1.936GlnIle: 1.936 ± 0.047
2.209GlnLys: 2.209 ± 0.05
2.532GlnLeu: 2.532 ± 0.045
0.981GlnMet: 0.981 ± 0.033
1.265GlnAsn: 1.265 ± 0.032
1.378GlnPro: 1.378 ± 0.037
1.283GlnGln: 1.283 ± 0.037
2.095GlnArg: 2.095 ± 0.052
1.772GlnSer: 1.772 ± 0.04
1.701GlnThr: 1.701 ± 0.046
2.03GlnVal: 2.03 ± 0.044
0.409GlnTrp: 0.409 ± 0.019
0.683GlnTyr: 0.683 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
4.426ArgAla: 4.426 ± 0.077
0.7ArgCys: 0.7 ± 0.031
3.177ArgAsp: 3.177 ± 0.057
4.904ArgGlu: 4.904 ± 0.071
3.001ArgPhe: 3.001 ± 0.049
3.488ArgGly: 3.488 ± 0.057
1.594ArgHis: 1.594 ± 0.041
5.095ArgIle: 5.095 ± 0.07
3.655ArgLys: 3.655 ± 0.066
6.671ArgLeu: 6.671 ± 0.079
2.495ArgMet: 2.495 ± 0.042
2.137ArgAsn: 2.137 ± 0.044
2.562ArgPro: 2.562 ± 0.053
2.493ArgGln: 2.493 ± 0.055
3.842ArgArg: 3.842 ± 0.066
3.642ArgSer: 3.642 ± 0.055
2.735ArgThr: 2.735 ± 0.045
4.009ArgVal: 4.009 ± 0.065
0.742ArgTrp: 0.742 ± 0.031
1.879ArgTyr: 1.879 ± 0.041
0.001ArgXaa: 0.001 ± 0.001
Ser
4.836SerAla: 4.836 ± 0.066
0.73SerCys: 0.73 ± 0.029
3.16SerAsp: 3.16 ± 0.055
3.93SerGlu: 3.93 ± 0.067
2.986SerPhe: 2.986 ± 0.052
5.843SerGly: 5.843 ± 0.092
1.513SerHis: 1.513 ± 0.037
3.433SerIle: 3.433 ± 0.059
1.962SerLys: 1.962 ± 0.042
7.227SerLeu: 7.227 ± 0.084
1.967SerMet: 1.967 ± 0.043
1.291SerAsn: 1.291 ± 0.034
3.128SerPro: 3.128 ± 0.064
1.806SerGln: 1.806 ± 0.042
4.045SerArg: 4.045 ± 0.063
3.674SerSer: 3.674 ± 0.072
2.531SerThr: 2.531 ± 0.054
3.689SerVal: 3.689 ± 0.058
0.773SerTrp: 0.773 ± 0.029
1.514SerTyr: 1.514 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
4.64ThrAla: 4.64 ± 0.076
0.605ThrCys: 0.605 ± 0.024
2.623ThrAsp: 2.623 ± 0.044
3.143ThrGlu: 3.143 ± 0.056
2.041ThrPhe: 2.041 ± 0.044
5.334ThrGly: 5.334 ± 0.089
1.119ThrHis: 1.119 ± 0.029
2.684ThrIle: 2.684 ± 0.053
1.6ThrLys: 1.6 ± 0.037
5.955ThrLeu: 5.955 ± 0.074
1.224ThrMet: 1.224 ± 0.032
1.087ThrAsn: 1.087 ± 0.036
2.778ThrPro: 2.778 ± 0.052
1.345ThrGln: 1.345 ± 0.04
3.019ThrArg: 3.019 ± 0.05
2.494ThrSer: 2.494 ± 0.058
2.107ThrThr: 2.107 ± 0.052
3.028ThrVal: 3.028 ± 0.065
0.456ThrTrp: 0.456 ± 0.022
1.088ThrTyr: 1.088 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
5.387ValAla: 5.387 ± 0.092
0.915ValCys: 0.915 ± 0.03
3.4ValAsp: 3.4 ± 0.056
3.984ValGlu: 3.984 ± 0.069
3.369ValPhe: 3.369 ± 0.057
3.773ValGly: 3.773 ± 0.081
1.494ValHis: 1.494 ± 0.038
3.54ValIle: 3.54 ± 0.052
2.898ValLys: 2.898 ± 0.055
7.103ValLeu: 7.103 ± 0.083
1.904ValMet: 1.904 ± 0.042
2.106ValAsn: 2.106 ± 0.044
2.627ValPro: 2.627 ± 0.047
2.101ValGln: 2.101 ± 0.042
4.172ValArg: 4.172 ± 0.067
4.316ValSer: 4.316 ± 0.065
3.159ValThr: 3.159 ± 0.072
4.17ValVal: 4.17 ± 0.076
0.712ValTrp: 0.712 ± 0.028
1.603ValTyr: 1.603 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
0.719TrpAla: 0.719 ± 0.029
0.122TrpCys: 0.122 ± 0.011
0.615TrpAsp: 0.615 ± 0.026
0.86TrpGlu: 0.86 ± 0.029
0.509TrpPhe: 0.509 ± 0.022
0.731TrpGly: 0.731 ± 0.026
0.267TrpHis: 0.267 ± 0.017
0.698TrpIle: 0.698 ± 0.03
0.613TrpLys: 0.613 ± 0.027
1.224TrpLeu: 1.224 ± 0.038
0.413TrpMet: 0.413 ± 0.017
0.405TrpAsn: 0.405 ± 0.019
0.497TrpPro: 0.497 ± 0.023
0.582TrpGln: 0.582 ± 0.02
0.607TrpArg: 0.607 ± 0.024
0.579TrpSer: 0.579 ± 0.025
0.509TrpThr: 0.509 ± 0.023
0.684TrpVal: 0.684 ± 0.024
0.131TrpTrp: 0.131 ± 0.013
0.322TrpTyr: 0.322 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.293TyrAla: 2.293 ± 0.051
0.3TyrCys: 0.3 ± 0.016
1.39TyrAsp: 1.39 ± 0.039
1.579TyrGlu: 1.579 ± 0.041
1.262TyrPhe: 1.262 ± 0.034
2.238TyrGly: 2.238 ± 0.047
0.671TyrHis: 0.671 ± 0.028
1.347TyrIle: 1.347 ± 0.032
0.999TyrLys: 0.999 ± 0.034
2.649TyrLeu: 2.649 ± 0.052
0.598TyrMet: 0.598 ± 0.023
0.831TyrAsn: 0.831 ± 0.029
1.334TyrPro: 1.334 ± 0.032
0.905TyrGln: 0.905 ± 0.028
1.993TyrArg: 1.993 ± 0.044
1.367TyrSer: 1.367 ± 0.034
1.372TyrThr: 1.372 ± 0.046
1.4TyrVal: 1.4 ± 0.039
0.308TyrTrp: 0.308 ± 0.018
0.848TyrTyr: 0.848 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.002
Statistics based on 3369 proteins (1138264 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski