Amino acid dipepetide frequency for Desulfovibrio salexigens (strain ATCC 14822 / DSM 2638 / NCIMB 8403 / VKM B-1763) (Maridesulfovibrio salexigens)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.109AlaAla: 8.109 ± 0.112
1.089AlaCys: 1.089 ± 0.031
4.776AlaAsp: 4.776 ± 0.069
6.418AlaGlu: 6.418 ± 0.081
3.164AlaPhe: 3.164 ± 0.067
6.915AlaGly: 6.915 ± 0.084
1.379AlaHis: 1.379 ± 0.028
5.033AlaIle: 5.033 ± 0.072
4.817AlaLys: 4.817 ± 0.063
8.308AlaLeu: 8.308 ± 0.09
2.602AlaMet: 2.602 ± 0.045
2.506AlaAsn: 2.506 ± 0.043
2.766AlaPro: 2.766 ± 0.055
2.788AlaGln: 2.788 ± 0.051
3.93AlaArg: 3.93 ± 0.059
4.484AlaSer: 4.484 ± 0.062
3.863AlaThr: 3.863 ± 0.062
6.454AlaVal: 6.454 ± 0.083
0.793AlaTrp: 0.793 ± 0.03
2.204AlaTyr: 2.204 ± 0.044
0.0AlaXaa: 0.0 ± 0.0
Cys
1.085CysAla: 1.085 ± 0.034
0.263CysCys: 0.263 ± 0.014
0.647CysAsp: 0.647 ± 0.022
0.762CysGlu: 0.762 ± 0.026
0.536CysPhe: 0.536 ± 0.017
1.349CysGly: 1.349 ± 0.041
0.369CysHis: 0.369 ± 0.025
0.802CysIle: 0.802 ± 0.025
0.654CysLys: 0.654 ± 0.022
1.217CysLeu: 1.217 ± 0.033
0.386CysMet: 0.386 ± 0.018
0.474CysAsn: 0.474 ± 0.019
0.815CysPro: 0.815 ± 0.028
0.329CysGln: 0.329 ± 0.018
0.698CysArg: 0.698 ± 0.022
0.993CysSer: 0.993 ± 0.031
0.65CysThr: 0.65 ± 0.023
0.835CysVal: 0.835 ± 0.026
0.137CysTrp: 0.137 ± 0.011
0.354CysTyr: 0.354 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.93AspAla: 3.93 ± 0.066
0.698AspCys: 0.698 ± 0.026
3.042AspAsp: 3.042 ± 0.066
4.129AspGlu: 4.129 ± 0.06
2.921AspPhe: 2.921 ± 0.059
4.106AspGly: 4.106 ± 0.07
1.043AspHis: 1.043 ± 0.03
4.004AspIle: 4.004 ± 0.065
3.541AspLys: 3.541 ± 0.058
5.733AspLeu: 5.733 ± 0.078
1.734AspMet: 1.734 ± 0.037
2.249AspAsn: 2.249 ± 0.048
2.55AspPro: 2.55 ± 0.045
1.681AspGln: 1.681 ± 0.038
2.746AspArg: 2.746 ± 0.048
3.508AspSer: 3.508 ± 0.065
2.45AspThr: 2.45 ± 0.05
3.725AspVal: 3.725 ± 0.064
0.717AspTrp: 0.717 ± 0.022
1.956AspTyr: 1.956 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
5.783GluAla: 5.783 ± 0.084
0.764GluCys: 0.764 ± 0.026
3.872GluAsp: 3.872 ± 0.062
5.435GluGlu: 5.435 ± 0.084
2.797GluPhe: 2.797 ± 0.048
4.471GluGly: 4.471 ± 0.074
1.354GluHis: 1.354 ± 0.032
5.082GluIle: 5.082 ± 0.067
5.24GluLys: 5.24 ± 0.077
7.338GluLeu: 7.338 ± 0.077
2.179GluMet: 2.179 ± 0.047
3.101GluAsn: 3.101 ± 0.046
2.198GluPro: 2.198 ± 0.06
2.838GluGln: 2.838 ± 0.055
3.519GluArg: 3.519 ± 0.071
3.957GluSer: 3.957 ± 0.06
3.316GluThr: 3.316 ± 0.053
4.824GluVal: 4.824 ± 0.073
0.676GluTrp: 0.676 ± 0.024
2.103GluTyr: 2.103 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
3.378PheAla: 3.378 ± 0.05
0.689PheCys: 0.689 ± 0.025
2.537PheAsp: 2.537 ± 0.051
2.69PheGlu: 2.69 ± 0.052
2.198PhePhe: 2.198 ± 0.051
3.228PheGly: 3.228 ± 0.055
0.774PheHis: 0.774 ± 0.027
2.912PheIle: 2.912 ± 0.052
2.72PheLys: 2.72 ± 0.046
4.114PheLeu: 4.114 ± 0.066
1.381PheMet: 1.381 ± 0.034
1.882PheAsn: 1.882 ± 0.042
1.765PhePro: 1.765 ± 0.036
1.104PheGln: 1.104 ± 0.031
1.904PheArg: 1.904 ± 0.045
3.333PheSer: 3.333 ± 0.054
2.368PheThr: 2.368 ± 0.045
2.656PheVal: 2.656 ± 0.051
0.59PheTrp: 0.59 ± 0.024
1.45PheTyr: 1.45 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
5.663GlyAla: 5.663 ± 0.088
1.318GlyCys: 1.318 ± 0.04
3.697GlyAsp: 3.697 ± 0.059
4.813GlyGlu: 4.813 ± 0.065
3.6GlyPhe: 3.6 ± 0.068
5.523GlyGly: 5.523 ± 0.086
1.527GlyHis: 1.527 ± 0.038
5.22GlyIle: 5.22 ± 0.081
5.279GlyLys: 5.279 ± 0.067
7.591GlyLeu: 7.591 ± 0.094
2.58GlyMet: 2.58 ± 0.054
2.796GlyAsn: 2.796 ± 0.059
2.344GlyPro: 2.344 ± 0.038
2.474GlyGln: 2.474 ± 0.047
3.689GlyArg: 3.689 ± 0.051
4.386GlySer: 4.386 ± 0.07
3.808GlyThr: 3.808 ± 0.055
5.424GlyVal: 5.424 ± 0.067
0.897GlyTrp: 0.897 ± 0.03
2.553GlyTyr: 2.553 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.331HisAla: 1.331 ± 0.032
0.278HisCys: 0.278 ± 0.014
1.052HisAsp: 1.052 ± 0.031
1.286HisGlu: 1.286 ± 0.032
0.878HisPhe: 0.878 ± 0.025
1.455HisGly: 1.455 ± 0.033
0.463HisHis: 0.463 ± 0.023
1.218HisIle: 1.218 ± 0.031
1.051HisLys: 1.051 ± 0.028
1.751HisLeu: 1.751 ± 0.04
0.55HisMet: 0.55 ± 0.021
0.762HisAsn: 0.762 ± 0.029
1.044HisPro: 1.044 ± 0.029
0.538HisGln: 0.538 ± 0.019
0.876HisArg: 0.876 ± 0.027
1.195HisSer: 1.195 ± 0.028
0.866HisThr: 0.866 ± 0.027
1.14HisVal: 1.14 ± 0.032
0.218HisTrp: 0.218 ± 0.013
0.626HisTyr: 0.626 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.451IleAla: 5.451 ± 0.074
0.951IleCys: 0.951 ± 0.028
3.663IleAsp: 3.663 ± 0.065
4.517IleGlu: 4.517 ± 0.059
2.973IlePhe: 2.973 ± 0.058
4.637IleGly: 4.637 ± 0.075
1.172IleHis: 1.172 ± 0.029
4.266IleIle: 4.266 ± 0.08
4.022IleLys: 4.022 ± 0.066
6.373IleLeu: 6.373 ± 0.096
1.808IleMet: 1.808 ± 0.037
2.895IleAsn: 2.895 ± 0.049
3.188IlePro: 3.188 ± 0.048
1.767IleGln: 1.767 ± 0.039
3.104IleArg: 3.104 ± 0.051
4.932IleSer: 4.932 ± 0.067
3.405IleThr: 3.405 ± 0.065
4.392IleVal: 4.392 ± 0.068
0.626IleTrp: 0.626 ± 0.023
1.835IleTyr: 1.835 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
5.317LysAla: 5.317 ± 0.072
0.664LysCys: 0.664 ± 0.025
3.753LysAsp: 3.753 ± 0.065
4.53LysGlu: 4.53 ± 0.064
2.316LysPhe: 2.316 ± 0.044
4.438LysGly: 4.438 ± 0.061
1.118LysHis: 1.118 ± 0.029
4.188LysIle: 4.188 ± 0.059
4.593LysLys: 4.593 ± 0.076
5.736LysLeu: 5.736 ± 0.067
1.991LysMet: 1.991 ± 0.047
2.706LysAsn: 2.706 ± 0.049
2.485LysPro: 2.485 ± 0.051
2.002LysGln: 2.002 ± 0.041
3.004LysArg: 3.004 ± 0.055
3.723LysSer: 3.723 ± 0.058
3.139LysThr: 3.139 ± 0.053
4.484LysVal: 4.484 ± 0.058
0.668LysTrp: 0.668 ± 0.024
1.858LysTyr: 1.858 ± 0.041
0.0LysXaa: 0.0 ± 0.0
Leu
8.758LeuAla: 8.758 ± 0.094
1.356LeuCys: 1.356 ± 0.034
5.7LeuAsp: 5.7 ± 0.073
6.811LeuGlu: 6.811 ± 0.085
4.236LeuPhe: 4.236 ± 0.075
7.439LeuGly: 7.439 ± 0.081
1.753LeuHis: 1.753 ± 0.039
5.901LeuIle: 5.901 ± 0.074
6.412LeuLys: 6.412 ± 0.07
9.451LeuLeu: 9.451 ± 0.102
2.54LeuMet: 2.54 ± 0.048
3.911LeuAsn: 3.911 ± 0.063
4.515LeuPro: 4.515 ± 0.067
2.791LeuGln: 2.791 ± 0.046
4.772LeuArg: 4.772 ± 0.064
6.751LeuSer: 6.751 ± 0.084
5.153LeuThr: 5.153 ± 0.064
6.428LeuVal: 6.428 ± 0.084
0.932LeuTrp: 0.932 ± 0.029
2.573LeuTyr: 2.573 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
2.688MetAla: 2.688 ± 0.042
0.285MetCys: 0.285 ± 0.017
1.878MetAsp: 1.878 ± 0.039
1.967MetGlu: 1.967 ± 0.042
1.051MetPhe: 1.051 ± 0.031
2.269MetGly: 2.269 ± 0.045
0.577MetHis: 0.577 ± 0.019
1.743MetIle: 1.743 ± 0.039
1.84MetLys: 1.84 ± 0.036
2.934MetLeu: 2.934 ± 0.056
0.714MetMet: 0.714 ± 0.029
1.327MetAsn: 1.327 ± 0.034
1.29MetPro: 1.29 ± 0.031
1.02MetGln: 1.02 ± 0.032
1.438MetArg: 1.438 ± 0.035
2.074MetSer: 2.074 ± 0.039
1.664MetThr: 1.664 ± 0.034
2.107MetVal: 2.107 ± 0.041
0.21MetTrp: 0.21 ± 0.013
0.596MetTyr: 0.596 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
2.788AsnAla: 2.788 ± 0.052
0.558AsnCys: 0.558 ± 0.019
2.07AsnAsp: 2.07 ± 0.053
2.371AsnGlu: 2.371 ± 0.044
1.803AsnPhe: 1.803 ± 0.04
2.886AsnGly: 2.886 ± 0.045
0.679AsnHis: 0.679 ± 0.026
3.07AsnIle: 3.07 ± 0.051
2.355AsnLys: 2.355 ± 0.04
3.874AsnLeu: 3.874 ± 0.06
1.213AsnMet: 1.213 ± 0.031
1.49AsnAsn: 1.49 ± 0.039
2.236AsnPro: 2.236 ± 0.044
1.091AsnGln: 1.091 ± 0.031
1.883AsnArg: 1.883 ± 0.045
2.605AsnSer: 2.605 ± 0.051
1.876AsnThr: 1.876 ± 0.041
2.504AsnVal: 2.504 ± 0.043
0.489AsnTrp: 0.489 ± 0.023
1.235AsnTyr: 1.235 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
3.479ProAla: 3.479 ± 0.063
0.509ProCys: 0.509 ± 0.022
2.882ProAsp: 2.882 ± 0.048
4.275ProGlu: 4.275 ± 0.073
1.872ProPhe: 1.872 ± 0.037
3.226ProGly: 3.226 ± 0.052
0.799ProHis: 0.799 ± 0.028
2.195ProIle: 2.195 ± 0.043
2.165ProLys: 2.165 ± 0.046
3.877ProLeu: 3.877 ± 0.06
1.028ProMet: 1.028 ± 0.034
1.3ProAsn: 1.3 ± 0.029
1.447ProPro: 1.447 ± 0.035
1.455ProGln: 1.455 ± 0.033
1.577ProArg: 1.577 ± 0.041
2.288ProSer: 2.288 ± 0.048
1.808ProThr: 1.808 ± 0.04
3.599ProVal: 3.599 ± 0.052
0.49ProTrp: 0.49 ± 0.019
1.323ProTyr: 1.323 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
2.921GlnAla: 2.921 ± 0.049
0.345GlnCys: 0.345 ± 0.016
1.712GlnAsp: 1.712 ± 0.034
2.231GlnGlu: 2.231 ± 0.05
1.078GlnPhe: 1.078 ± 0.033
2.328GlnGly: 2.328 ± 0.047
0.57GlnHis: 0.57 ± 0.023
1.989GlnIle: 1.989 ± 0.041
2.012GlnLys: 2.012 ± 0.041
2.936GlnLeu: 2.936 ± 0.049
0.91GlnMet: 0.91 ± 0.028
1.361GlnAsn: 1.361 ± 0.033
1.168GlnPro: 1.168 ± 0.036
1.363GlnGln: 1.363 ± 0.045
1.55GlnArg: 1.55 ± 0.035
1.931GlnSer: 1.931 ± 0.045
1.604GlnThr: 1.604 ± 0.04
2.136GlnVal: 2.136 ± 0.045
0.324GlnTrp: 0.324 ± 0.019
0.819GlnTyr: 0.819 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
3.417ArgAla: 3.417 ± 0.061
0.562ArgCys: 0.562 ± 0.022
2.652ArgAsp: 2.652 ± 0.046
3.837ArgGlu: 3.837 ± 0.062
2.109ArgPhe: 2.109 ± 0.045
2.8ArgGly: 2.8 ± 0.047
0.914ArgHis: 0.914 ± 0.029
3.692ArgIle: 3.692 ± 0.06
3.67ArgLys: 3.67 ± 0.051
4.601ArgLeu: 4.601 ± 0.066
1.6ArgMet: 1.6 ± 0.041
2.17ArgAsn: 2.17 ± 0.04
1.798ArgPro: 1.798 ± 0.04
1.581ArgGln: 1.581 ± 0.038
2.455ArgArg: 2.455 ± 0.054
2.86ArgSer: 2.86 ± 0.047
2.39ArgThr: 2.39 ± 0.047
3.068ArgVal: 3.068 ± 0.05
0.47ArgTrp: 0.47 ± 0.02
1.458ArgTyr: 1.458 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
5.052SerAla: 5.052 ± 0.074
0.869SerCys: 0.869 ± 0.026
3.353SerAsp: 3.353 ± 0.061
4.058SerGlu: 4.058 ± 0.061
3.066SerPhe: 3.066 ± 0.05
6.031SerGly: 6.031 ± 0.083
1.133SerHis: 1.133 ± 0.032
4.352SerIle: 4.352 ± 0.069
3.61SerLys: 3.61 ± 0.049
6.303SerLeu: 6.303 ± 0.071
2.016SerMet: 2.016 ± 0.036
2.177SerAsn: 2.177 ± 0.041
2.671SerPro: 2.671 ± 0.046
1.773SerGln: 1.773 ± 0.039
3.179SerArg: 3.179 ± 0.057
4.31SerSer: 4.31 ± 0.077
3.055SerThr: 3.055 ± 0.053
4.348SerVal: 4.348 ± 0.065
0.733SerTrp: 0.733 ± 0.023
1.866SerTyr: 1.866 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
4.46ThrAla: 4.46 ± 0.074
0.589ThrCys: 0.589 ± 0.025
2.734ThrAsp: 2.734 ± 0.052
3.157ThrGlu: 3.157 ± 0.061
2.068ThrPhe: 2.068 ± 0.042
4.469ThrGly: 4.469 ± 0.072
0.933ThrHis: 0.933 ± 0.027
3.379ThrIle: 3.379 ± 0.054
2.384ThrLys: 2.384 ± 0.05
4.935ThrLeu: 4.935 ± 0.067
1.338ThrMet: 1.338 ± 0.03
1.645ThrAsn: 1.645 ± 0.04
2.697ThrPro: 2.697 ± 0.055
1.282ThrGln: 1.282 ± 0.03
2.175ThrArg: 2.175 ± 0.043
3.075ThrSer: 3.075 ± 0.051
2.451ThrThr: 2.451 ± 0.056
3.762ThrVal: 3.762 ± 0.062
0.433ThrTrp: 0.433 ± 0.021
1.415ThrTyr: 1.415 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
5.798ValAla: 5.798 ± 0.079
0.992ValCys: 0.992 ± 0.029
4.062ValAsp: 4.062 ± 0.06
4.952ValGlu: 4.952 ± 0.075
3.037ValPhe: 3.037 ± 0.059
4.668ValGly: 4.668 ± 0.074
1.234ValHis: 1.234 ± 0.031
4.499ValIle: 4.499 ± 0.063
3.908ValLys: 3.908 ± 0.066
7.066ValLeu: 7.066 ± 0.088
1.979ValMet: 1.979 ± 0.041
2.724ValAsn: 2.724 ± 0.047
2.859ValPro: 2.859 ± 0.047
2.141ValGln: 2.141 ± 0.043
3.566ValArg: 3.566 ± 0.058
4.802ValSer: 4.802 ± 0.064
3.47ValThr: 3.47 ± 0.06
5.22ValVal: 5.22 ± 0.078
0.674ValTrp: 0.674 ± 0.024
1.874ValTyr: 1.874 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
0.812TrpAla: 0.812 ± 0.028
0.131TrpCys: 0.131 ± 0.009
0.638TrpAsp: 0.638 ± 0.022
0.663TrpGlu: 0.663 ± 0.023
0.483TrpPhe: 0.483 ± 0.02
0.702TrpGly: 0.702 ± 0.025
0.22TrpHis: 0.22 ± 0.015
0.68TrpIle: 0.68 ± 0.023
0.739TrpLys: 0.739 ± 0.026
1.074TrpLeu: 1.074 ± 0.035
0.369TrpMet: 0.369 ± 0.018
0.489TrpAsn: 0.489 ± 0.023
0.425TrpPro: 0.425 ± 0.019
0.426TrpGln: 0.426 ± 0.021
0.508TrpArg: 0.508 ± 0.019
0.625TrpSer: 0.625 ± 0.025
0.53TrpThr: 0.53 ± 0.024
0.632TrpVal: 0.632 ± 0.022
0.177TrpTrp: 0.177 ± 0.012
0.296TrpTyr: 0.296 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.179TyrAla: 2.179 ± 0.043
0.443TyrCys: 0.443 ± 0.019
1.741TyrAsp: 1.741 ± 0.041
1.944TyrGlu: 1.944 ± 0.044
1.489TyrPhe: 1.489 ± 0.041
2.243TyrGly: 2.243 ± 0.043
0.57TyrHis: 0.57 ± 0.019
1.754TyrIle: 1.754 ± 0.043
1.699TyrLys: 1.699 ± 0.039
2.971TyrLeu: 2.971 ± 0.046
0.755TyrMet: 0.755 ± 0.025
1.108TyrAsn: 1.108 ± 0.035
1.334TyrPro: 1.334 ± 0.031
0.842TyrGln: 0.842 ± 0.026
1.532TyrArg: 1.532 ± 0.037
2.125TyrSer: 2.125 ± 0.043
1.516TyrThr: 1.516 ± 0.043
1.785TyrVal: 1.785 ± 0.037
0.364TyrTrp: 0.364 ± 0.019
1.031TyrTyr: 1.031 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3807 proteins (1264146 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski