Amino acid dipepetide frequency for Desulfovibrio senegalensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.185AlaAla: 10.185 ± 0.13
1.443AlaCys: 1.443 ± 0.044
5.504AlaAsp: 5.504 ± 0.084
6.232AlaGlu: 6.232 ± 0.1
3.561AlaPhe: 3.561 ± 0.064
8.017AlaGly: 8.017 ± 0.104
2.067AlaHis: 2.067 ± 0.047
4.425AlaIle: 4.425 ± 0.073
3.986AlaLys: 3.986 ± 0.074
10.157AlaLeu: 10.157 ± 0.106
3.43AlaMet: 3.43 ± 0.066
2.629AlaAsn: 2.629 ± 0.058
3.695AlaPro: 3.695 ± 0.069
3.344AlaGln: 3.344 ± 0.063
6.307AlaArg: 6.307 ± 0.106
4.881AlaSer: 4.881 ± 0.077
4.335AlaThr: 4.335 ± 0.062
7.661AlaVal: 7.661 ± 0.086
1.098AlaTrp: 1.098 ± 0.032
2.332AlaTyr: 2.332 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
1.289CysAla: 1.289 ± 0.037
0.26CysCys: 0.26 ± 0.016
0.65CysAsp: 0.65 ± 0.025
0.726CysGlu: 0.726 ± 0.027
0.528CysPhe: 0.528 ± 0.023
1.396CysGly: 1.396 ± 0.05
0.446CysHis: 0.446 ± 0.032
0.696CysIle: 0.696 ± 0.024
0.466CysLys: 0.466 ± 0.02
1.34CysLeu: 1.34 ± 0.033
0.405CysMet: 0.405 ± 0.02
0.419CysAsn: 0.419 ± 0.018
0.92CysPro: 0.92 ± 0.032
0.311CysGln: 0.311 ± 0.018
0.982CysArg: 0.982 ± 0.032
0.926CysSer: 0.926 ± 0.034
0.738CysThr: 0.738 ± 0.025
1.071CysVal: 1.071 ± 0.04
0.158CysTrp: 0.158 ± 0.012
0.329CysTyr: 0.329 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
5.148AspAla: 5.148 ± 0.066
0.828AspCys: 0.828 ± 0.029
3.273AspAsp: 3.273 ± 0.064
4.124AspGlu: 4.124 ± 0.07
2.693AspPhe: 2.693 ± 0.05
4.36AspGly: 4.36 ± 0.071
1.286AspHis: 1.286 ± 0.037
3.622AspIle: 3.622 ± 0.062
2.692AspLys: 2.692 ± 0.059
5.484AspLeu: 5.484 ± 0.074
2.205AspMet: 2.205 ± 0.053
1.828AspAsn: 1.828 ± 0.047
2.957AspPro: 2.957 ± 0.053
1.659AspGln: 1.659 ± 0.041
3.503AspArg: 3.503 ± 0.054
2.993AspSer: 2.993 ± 0.056
2.851AspThr: 2.851 ± 0.052
4.277AspVal: 4.277 ± 0.079
0.793AspTrp: 0.793 ± 0.03
1.704AspTyr: 1.704 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
6.249GluAla: 6.249 ± 0.094
0.672GluCys: 0.672 ± 0.024
3.515GluAsp: 3.515 ± 0.066
4.261GluGlu: 4.261 ± 0.088
2.36GluPhe: 2.36 ± 0.057
3.872GluGly: 3.872 ± 0.077
1.58GluHis: 1.58 ± 0.039
3.691GluIle: 3.691 ± 0.073
3.741GluLys: 3.741 ± 0.07
6.709GluLeu: 6.709 ± 0.09
2.001GluMet: 2.001 ± 0.052
2.577GluAsn: 2.577 ± 0.049
2.543GluPro: 2.543 ± 0.056
3.165GluGln: 3.165 ± 0.065
4.574GluArg: 4.574 ± 0.081
3.513GluSer: 3.513 ± 0.055
3.35GluThr: 3.35 ± 0.065
4.174GluVal: 4.174 ± 0.076
0.64GluTrp: 0.64 ± 0.029
1.844GluTyr: 1.844 ± 0.046
0.0GluXaa: 0.0 ± 0.0
Phe
3.585PheAla: 3.585 ± 0.074
0.713PheCys: 0.713 ± 0.026
2.621PheAsp: 2.621 ± 0.051
2.481PheGlu: 2.481 ± 0.049
1.909PhePhe: 1.909 ± 0.051
3.318PheGly: 3.318 ± 0.057
0.863PheHis: 0.863 ± 0.033
2.04PheIle: 2.04 ± 0.046
1.646PheLys: 1.646 ± 0.039
4.097PheLeu: 4.097 ± 0.067
1.419PheMet: 1.419 ± 0.041
1.341PheAsn: 1.341 ± 0.033
1.671PhePro: 1.671 ± 0.039
0.995PheGln: 0.995 ± 0.035
2.338PheArg: 2.338 ± 0.05
2.816PheSer: 2.816 ± 0.058
2.127PheThr: 2.127 ± 0.049
2.981PheVal: 2.981 ± 0.06
0.585PheTrp: 0.585 ± 0.024
1.128PheTyr: 1.128 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
6.646GlyAla: 6.646 ± 0.094
1.329GlyCys: 1.329 ± 0.043
4.056GlyAsp: 4.056 ± 0.061
4.688GlyGlu: 4.688 ± 0.066
3.642GlyPhe: 3.642 ± 0.06
5.726GlyGly: 5.726 ± 0.102
1.88GlyHis: 1.88 ± 0.038
4.553GlyIle: 4.553 ± 0.073
4.227GlyLys: 4.227 ± 0.072
8.376GlyLeu: 8.376 ± 0.1
3.026GlyMet: 3.026 ± 0.062
2.494GlyAsn: 2.494 ± 0.051
2.733GlyPro: 2.733 ± 0.051
2.845GlyGln: 2.845 ± 0.06
4.913GlyArg: 4.913 ± 0.078
4.333GlySer: 4.333 ± 0.069
4.345GlyThr: 4.345 ± 0.087
6.287GlyVal: 6.287 ± 0.082
1.037GlyTrp: 1.037 ± 0.032
2.325GlyTyr: 2.325 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
2.119HisAla: 2.119 ± 0.048
0.383HisCys: 0.383 ± 0.022
1.417HisAsp: 1.417 ± 0.041
1.494HisGlu: 1.494 ± 0.034
0.972HisPhe: 0.972 ± 0.031
1.97HisGly: 1.97 ± 0.051
0.586HisHis: 0.586 ± 0.026
1.099HisIle: 1.099 ± 0.034
0.871HisLys: 0.871 ± 0.033
2.037HisLeu: 2.037 ± 0.043
0.697HisMet: 0.697 ± 0.025
0.643HisAsn: 0.643 ± 0.025
1.265HisPro: 1.265 ± 0.037
0.648HisGln: 0.648 ± 0.024
1.194HisArg: 1.194 ± 0.036
1.1HisSer: 1.1 ± 0.037
1.091HisThr: 1.091 ± 0.034
1.556HisVal: 1.556 ± 0.035
0.307HisTrp: 0.307 ± 0.018
0.645HisTyr: 0.645 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
4.889IleAla: 4.889 ± 0.096
0.764IleCys: 0.764 ± 0.028
2.738IleAsp: 2.738 ± 0.054
3.118IleGlu: 3.118 ± 0.067
2.078IlePhe: 2.078 ± 0.05
3.883IleGly: 3.883 ± 0.073
1.114IleHis: 1.114 ± 0.033
2.947IleIle: 2.947 ± 0.063
2.304IleLys: 2.304 ± 0.054
5.099IleLeu: 5.099 ± 0.077
1.667IleMet: 1.667 ± 0.045
1.832IleAsn: 1.832 ± 0.043
2.815IlePro: 2.815 ± 0.056
1.462IleGln: 1.462 ± 0.04
3.543IleArg: 3.543 ± 0.069
3.21IleSer: 3.21 ± 0.061
2.729IleThr: 2.729 ± 0.056
3.869IleVal: 3.869 ± 0.068
0.565IleTrp: 0.565 ± 0.025
1.311IleTyr: 1.311 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.689LysAla: 4.689 ± 0.082
0.465LysCys: 0.465 ± 0.021
2.749LysAsp: 2.749 ± 0.056
2.828LysGlu: 2.828 ± 0.063
1.309LysPhe: 1.309 ± 0.038
3.585LysGly: 3.585 ± 0.066
0.942LysHis: 0.942 ± 0.033
2.452LysIle: 2.452 ± 0.052
3.115LysLys: 3.115 ± 0.065
4.039LysLeu: 4.039 ± 0.069
1.306LysMet: 1.306 ± 0.036
1.781LysAsn: 1.781 ± 0.047
2.179LysPro: 2.179 ± 0.056
1.704LysGln: 1.704 ± 0.047
3.068LysArg: 3.068 ± 0.068
2.388LysSer: 2.388 ± 0.054
2.708LysThr: 2.708 ± 0.055
3.079LysVal: 3.079 ± 0.066
0.446LysTrp: 0.446 ± 0.021
1.251LysTyr: 1.251 ± 0.036
0.0LysXaa: 0.0 ± 0.0
Leu
10.646LeuAla: 10.646 ± 0.112
1.52LeuCys: 1.52 ± 0.039
6.283LeuAsp: 6.283 ± 0.095
6.863LeuGlu: 6.863 ± 0.091
4.282LeuPhe: 4.282 ± 0.084
8.287LeuGly: 8.287 ± 0.11
2.116LeuHis: 2.116 ± 0.043
4.166LeuIle: 4.166 ± 0.07
4.582LeuLys: 4.582 ± 0.073
10.05LeuLeu: 10.05 ± 0.123
2.654LeuMet: 2.654 ± 0.057
3.382LeuAsn: 3.382 ± 0.055
5.01LeuPro: 5.01 ± 0.075
3.096LeuGln: 3.096 ± 0.056
6.124LeuArg: 6.124 ± 0.087
6.06LeuSer: 6.06 ± 0.1
4.989LeuThr: 4.989 ± 0.075
7.388LeuVal: 7.388 ± 0.095
1.059LeuTrp: 1.059 ± 0.042
2.279LeuTyr: 2.279 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
3.497MetAla: 3.497 ± 0.05
0.287MetCys: 0.287 ± 0.016
2.2MetAsp: 2.2 ± 0.05
2.069MetGlu: 2.069 ± 0.045
1.045MetPhe: 1.045 ± 0.035
2.657MetGly: 2.657 ± 0.056
0.705MetHis: 0.705 ± 0.028
1.408MetIle: 1.408 ± 0.038
1.475MetLys: 1.475 ± 0.044
3.068MetLeu: 3.068 ± 0.057
0.642MetMet: 0.642 ± 0.031
1.275MetAsn: 1.275 ± 0.038
1.567MetPro: 1.567 ± 0.043
1.214MetGln: 1.214 ± 0.038
1.93MetArg: 1.93 ± 0.046
1.843MetSer: 1.843 ± 0.048
1.836MetThr: 1.836 ± 0.04
2.296MetVal: 2.296 ± 0.052
0.233MetTrp: 0.233 ± 0.015
0.554MetTyr: 0.554 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
3.025AsnAla: 3.025 ± 0.056
0.466AsnCys: 0.466 ± 0.022
1.795AsnAsp: 1.795 ± 0.04
1.895AsnGlu: 1.895 ± 0.047
1.205AsnPhe: 1.205 ± 0.035
2.603AsnGly: 2.603 ± 0.057
0.66AsnHis: 0.66 ± 0.028
1.992AsnIle: 1.992 ± 0.044
1.463AsnLys: 1.463 ± 0.038
3.187AsnLeu: 3.187 ± 0.056
1.088AsnMet: 1.088 ± 0.036
1.101AsnAsn: 1.101 ± 0.037
2.041AsnPro: 2.041 ± 0.045
0.927AsnGln: 0.927 ± 0.032
2.173AsnArg: 2.173 ± 0.056
1.639AsnSer: 1.639 ± 0.043
1.756AsnThr: 1.756 ± 0.049
2.463AsnVal: 2.463 ± 0.048
0.435AsnTrp: 0.435 ± 0.019
0.869AsnTyr: 0.869 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
4.311ProAla: 4.311 ± 0.09
0.578ProCys: 0.578 ± 0.026
3.48ProAsp: 3.48 ± 0.064
4.261ProGlu: 4.261 ± 0.076
1.828ProPhe: 1.828 ± 0.043
4.264ProGly: 4.264 ± 0.072
1.033ProHis: 1.033 ± 0.032
1.7ProIle: 1.7 ± 0.041
1.929ProLys: 1.929 ± 0.048
4.375ProLeu: 4.375 ± 0.066
1.273ProMet: 1.273 ± 0.034
1.23ProAsn: 1.23 ± 0.035
1.716ProPro: 1.716 ± 0.046
1.575ProGln: 1.575 ± 0.039
2.18ProArg: 2.18 ± 0.052
2.276ProSer: 2.276 ± 0.042
1.869ProThr: 1.869 ± 0.055
4.128ProVal: 4.128 ± 0.07
0.628ProTrp: 0.628 ± 0.026
1.205ProTyr: 1.205 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
3.746GlnAla: 3.746 ± 0.066
0.475GlnCys: 0.475 ± 0.026
1.94GlnAsp: 1.94 ± 0.044
2.199GlnGlu: 2.199 ± 0.047
1.12GlnPhe: 1.12 ± 0.036
2.881GlnGly: 2.881 ± 0.052
0.665GlnHis: 0.665 ± 0.026
1.564GlnIle: 1.564 ± 0.042
1.626GlnLys: 1.626 ± 0.042
2.96GlnLeu: 2.96 ± 0.055
0.99GlnMet: 0.99 ± 0.029
1.155GlnAsn: 1.155 ± 0.041
1.335GlnPro: 1.335 ± 0.033
1.443GlnGln: 1.443 ± 0.042
2.164GlnArg: 2.164 ± 0.051
1.858GlnSer: 1.858 ± 0.048
1.793GlnThr: 1.793 ± 0.047
2.305GlnVal: 2.305 ± 0.05
0.474GlnTrp: 0.474 ± 0.02
0.859GlnTyr: 0.859 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
5.281ArgAla: 5.281 ± 0.083
0.815ArgCys: 0.815 ± 0.029
3.563ArgAsp: 3.563 ± 0.067
4.815ArgGlu: 4.815 ± 0.083
2.922ArgPhe: 2.922 ± 0.058
3.823ArgGly: 3.823 ± 0.073
1.516ArgHis: 1.516 ± 0.04
3.88ArgIle: 3.88 ± 0.062
3.335ArgLys: 3.335 ± 0.067
6.466ArgLeu: 6.466 ± 0.093
2.139ArgMet: 2.139 ± 0.045
2.188ArgAsn: 2.188 ± 0.049
2.708ArgPro: 2.708 ± 0.058
2.396ArgGln: 2.396 ± 0.058
3.893ArgArg: 3.893 ± 0.081
3.264ArgSer: 3.264 ± 0.057
3.3ArgThr: 3.3 ± 0.058
4.269ArgVal: 4.269 ± 0.063
0.647ArgTrp: 0.647 ± 0.027
1.703ArgTyr: 1.703 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
4.68SerAla: 4.68 ± 0.069
0.766SerCys: 0.766 ± 0.029
2.862SerAsp: 2.862 ± 0.051
3.261SerGlu: 3.261 ± 0.056
2.383SerPhe: 2.383 ± 0.054
5.505SerGly: 5.505 ± 0.076
1.172SerHis: 1.172 ± 0.036
3.135SerIle: 3.135 ± 0.063
2.187SerLys: 2.187 ± 0.051
5.958SerLeu: 5.958 ± 0.089
2.017SerMet: 2.017 ± 0.041
1.569SerAsn: 1.569 ± 0.046
2.559SerPro: 2.559 ± 0.052
1.628SerGln: 1.628 ± 0.042
3.557SerArg: 3.557 ± 0.066
3.172SerSer: 3.172 ± 0.067
2.637SerThr: 2.637 ± 0.053
4.323SerVal: 4.323 ± 0.063
0.7SerTrp: 0.7 ± 0.028
1.421SerTyr: 1.421 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.779ThrAla: 4.779 ± 0.069
0.659ThrCys: 0.659 ± 0.026
2.839ThrAsp: 2.839 ± 0.057
2.732ThrGlu: 2.732 ± 0.047
1.92ThrPhe: 1.92 ± 0.047
4.828ThrGly: 4.828 ± 0.068
1.045ThrHis: 1.045 ± 0.036
2.884ThrIle: 2.884 ± 0.069
1.856ThrLys: 1.856 ± 0.05
5.494ThrLeu: 5.494 ± 0.075
1.667ThrMet: 1.667 ± 0.042
1.486ThrAsn: 1.486 ± 0.043
2.869ThrPro: 2.869 ± 0.064
1.411ThrGln: 1.411 ± 0.037
2.873ThrArg: 2.873 ± 0.051
2.517ThrSer: 2.517 ± 0.061
2.704ThrThr: 2.704 ± 0.069
4.362ThrVal: 4.362 ± 0.063
0.546ThrTrp: 0.546 ± 0.025
1.343ThrTyr: 1.343 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
7.207ValAla: 7.207 ± 0.101
1.108ValCys: 1.108 ± 0.036
4.602ValAsp: 4.602 ± 0.065
4.698ValGlu: 4.698 ± 0.076
3.301ValPhe: 3.301 ± 0.062
5.42ValGly: 5.42 ± 0.083
1.587ValHis: 1.587 ± 0.043
3.966ValIle: 3.966 ± 0.071
2.843ValLys: 2.843 ± 0.06
8.006ValLeu: 8.006 ± 0.098
2.111ValMet: 2.111 ± 0.049
2.545ValAsn: 2.545 ± 0.046
3.314ValPro: 3.314 ± 0.065
2.459ValGln: 2.459 ± 0.052
5.166ValArg: 5.166 ± 0.092
4.555ValSer: 4.555 ± 0.077
3.695ValThr: 3.695 ± 0.061
6.19ValVal: 6.19 ± 0.098
0.777ValTrp: 0.777 ± 0.029
1.757ValTyr: 1.757 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
1.012TrpAla: 1.012 ± 0.034
0.141TrpCys: 0.141 ± 0.011
0.668TrpAsp: 0.668 ± 0.024
0.697TrpGlu: 0.697 ± 0.028
0.492TrpPhe: 0.492 ± 0.022
0.887TrpGly: 0.887 ± 0.034
0.257TrpHis: 0.257 ± 0.014
0.564TrpIle: 0.564 ± 0.025
0.653TrpLys: 0.653 ± 0.024
1.322TrpLeu: 1.322 ± 0.041
0.309TrpMet: 0.309 ± 0.017
0.451TrpAsn: 0.451 ± 0.021
0.613TrpPro: 0.613 ± 0.027
0.511TrpGln: 0.511 ± 0.023
0.717TrpArg: 0.717 ± 0.028
0.528TrpSer: 0.528 ± 0.024
0.656TrpThr: 0.656 ± 0.028
0.766TrpVal: 0.766 ± 0.031
0.196TrpTrp: 0.196 ± 0.016
0.272TrpTyr: 0.272 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.309TyrAla: 2.309 ± 0.048
0.382TyrCys: 0.382 ± 0.021
1.527TyrAsp: 1.527 ± 0.039
1.587TyrGlu: 1.587 ± 0.039
1.123TyrPhe: 1.123 ± 0.034
2.178TyrGly: 2.178 ± 0.046
0.576TyrHis: 0.576 ± 0.02
1.151TyrIle: 1.151 ± 0.037
1.085TyrLys: 1.085 ± 0.034
2.612TyrLeu: 2.612 ± 0.049
0.755TyrMet: 0.755 ± 0.025
0.859TyrAsn: 0.859 ± 0.03
1.328TyrPro: 1.328 ± 0.039
0.761TyrGln: 0.761 ± 0.027
1.727TyrArg: 1.727 ± 0.04
1.619TyrSer: 1.619 ± 0.039
1.302TyrThr: 1.302 ± 0.036
1.866TyrVal: 1.866 ± 0.052
0.386TyrTrp: 0.386 ± 0.022
0.899TyrTyr: 0.899 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3062 proteins (1003229 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski