Amino acid dipepetide frequency for Desulfotomaculum copahuensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.664AlaAla: 14.664 ± 0.188
1.427AlaCys: 1.427 ± 0.039
4.56AlaAsp: 4.56 ± 0.071
5.805AlaGlu: 5.805 ± 0.083
3.196AlaPhe: 3.196 ± 0.06
13.566AlaGly: 13.566 ± 0.167
1.603AlaHis: 1.603 ± 0.037
4.492AlaIle: 4.492 ± 0.078
3.154AlaLys: 3.154 ± 0.059
11.191AlaLeu: 11.191 ± 0.114
2.633AlaMet: 2.633 ± 0.057
2.363AlaAsn: 2.363 ± 0.061
4.041AlaPro: 4.041 ± 0.066
2.993AlaGln: 2.993 ± 0.063
7.787AlaArg: 7.787 ± 0.097
3.963AlaSer: 3.963 ± 0.063
3.842AlaThr: 3.842 ± 0.074
9.994AlaVal: 9.994 ± 0.098
1.015AlaTrp: 1.015 ± 0.031
2.403AlaTyr: 2.403 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
1.199CysAla: 1.199 ± 0.038
0.264CysCys: 0.264 ± 0.019
0.54CysAsp: 0.54 ± 0.022
0.472CysGlu: 0.472 ± 0.021
0.464CysPhe: 0.464 ± 0.022
1.473CysGly: 1.473 ± 0.042
0.279CysHis: 0.279 ± 0.018
0.568CysIle: 0.568 ± 0.027
0.345CysLys: 0.345 ± 0.021
1.313CysLeu: 1.313 ± 0.036
0.251CysMet: 0.251 ± 0.015
0.36CysAsn: 0.36 ± 0.022
0.908CysPro: 0.908 ± 0.033
0.387CysGln: 0.387 ± 0.019
1.425CysArg: 1.425 ± 0.044
0.67CysSer: 0.67 ± 0.031
0.628CysThr: 0.628 ± 0.025
0.693CysVal: 0.693 ± 0.031
0.15CysTrp: 0.15 ± 0.012
0.389CysTyr: 0.389 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
4.118AspAla: 4.118 ± 0.067
0.632AspCys: 0.632 ± 0.023
2.01AspAsp: 2.01 ± 0.048
3.015AspGlu: 3.015 ± 0.06
1.936AspPhe: 1.936 ± 0.038
4.254AspGly: 4.254 ± 0.08
0.78AspHis: 0.78 ± 0.027
2.952AspIle: 2.952 ± 0.057
1.91AspLys: 1.91 ± 0.05
5.115AspLeu: 5.115 ± 0.076
1.269AspMet: 1.269 ± 0.041
1.385AspAsn: 1.385 ± 0.04
2.696AspPro: 2.696 ± 0.057
1.508AspGln: 1.508 ± 0.041
3.44AspArg: 3.44 ± 0.064
1.818AspSer: 1.818 ± 0.041
2.053AspThr: 2.053 ± 0.047
3.597AspVal: 3.597 ± 0.06
0.612AspTrp: 0.612 ± 0.028
1.686AspTyr: 1.686 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
5.957GluAla: 5.957 ± 0.085
0.545GluCys: 0.545 ± 0.024
2.478GluAsp: 2.478 ± 0.058
4.679GluGlu: 4.679 ± 0.088
1.849GluPhe: 1.849 ± 0.049
3.754GluGly: 3.754 ± 0.066
1.229GluHis: 1.229 ± 0.038
4.479GluIle: 4.479 ± 0.064
4.355GluLys: 4.355 ± 0.071
6.704GluLeu: 6.704 ± 0.093
2.086GluMet: 2.086 ± 0.05
2.394GluAsn: 2.394 ± 0.05
2.381GluPro: 2.381 ± 0.047
2.775GluGln: 2.775 ± 0.06
4.184GluArg: 4.184 ± 0.08
2.316GluSer: 2.316 ± 0.053
2.983GluThr: 2.983 ± 0.058
4.452GluVal: 4.452 ± 0.075
0.546GluTrp: 0.546 ± 0.025
1.653GluTyr: 1.653 ± 0.046
0.0GluXaa: 0.0 ± 0.0
Phe
3.254PheAla: 3.254 ± 0.068
0.54PheCys: 0.54 ± 0.024
1.843PheAsp: 1.843 ± 0.048
1.652PheGlu: 1.652 ± 0.045
1.657PhePhe: 1.657 ± 0.05
2.932PheGly: 2.932 ± 0.057
0.765PheHis: 0.765 ± 0.032
2.212PheIle: 2.212 ± 0.054
1.586PheLys: 1.586 ± 0.043
3.783PheLeu: 3.783 ± 0.064
0.942PheMet: 0.942 ± 0.029
1.451PheAsn: 1.451 ± 0.04
1.782PhePro: 1.782 ± 0.041
1.162PheGln: 1.162 ± 0.034
1.911PheArg: 1.911 ± 0.044
2.166PheSer: 2.166 ± 0.048
2.284PheThr: 2.284 ± 0.047
2.39PheVal: 2.39 ± 0.049
0.465PheTrp: 0.465 ± 0.027
1.299PheTyr: 1.299 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
8.206GlyAla: 8.206 ± 0.129
1.352GlyCys: 1.352 ± 0.035
3.744GlyAsp: 3.744 ± 0.062
5.572GlyGlu: 5.572 ± 0.078
3.191GlyPhe: 3.191 ± 0.063
8.037GlyGly: 8.037 ± 0.144
1.725GlyHis: 1.725 ± 0.043
5.138GlyIle: 5.138 ± 0.073
4.161GlyLys: 4.161 ± 0.061
9.579GlyLeu: 9.579 ± 0.118
2.54GlyMet: 2.54 ± 0.049
2.64GlyAsn: 2.64 ± 0.057
3.244GlyPro: 3.244 ± 0.06
3.142GlyGln: 3.142 ± 0.062
7.25GlyArg: 7.25 ± 0.093
4.079GlySer: 4.079 ± 0.073
4.516GlyThr: 4.516 ± 0.079
6.958GlyVal: 6.958 ± 0.093
1.08GlyTrp: 1.08 ± 0.038
2.818GlyTyr: 2.818 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.494HisAla: 1.494 ± 0.036
0.343HisCys: 0.343 ± 0.019
0.849HisAsp: 0.849 ± 0.032
0.857HisGlu: 0.857 ± 0.031
0.857HisPhe: 0.857 ± 0.032
1.668HisGly: 1.668 ± 0.047
0.529HisHis: 0.529 ± 0.026
0.996HisIle: 0.996 ± 0.031
0.603HisLys: 0.603 ± 0.023
2.172HisLeu: 2.172 ± 0.051
0.488HisMet: 0.488 ± 0.022
0.624HisAsn: 0.624 ± 0.023
1.379HisPro: 1.379 ± 0.042
0.61HisGln: 0.61 ± 0.024
1.529HisArg: 1.529 ± 0.039
0.884HisSer: 0.884 ± 0.027
0.943HisThr: 0.943 ± 0.031
1.179HisVal: 1.179 ± 0.035
0.257HisTrp: 0.257 ± 0.017
0.682HisTyr: 0.682 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.189IleAla: 5.189 ± 0.087
0.782IleCys: 0.782 ± 0.032
2.748IleAsp: 2.748 ± 0.056
2.845IleGlu: 2.845 ± 0.064
2.191IlePhe: 2.191 ± 0.05
4.063IleGly: 4.063 ± 0.08
1.073IleHis: 1.073 ± 0.037
3.634IleIle: 3.634 ± 0.075
2.941IleLys: 2.941 ± 0.064
5.335IleLeu: 5.335 ± 0.087
1.527IleMet: 1.527 ± 0.04
2.482IleAsn: 2.482 ± 0.055
3.012IlePro: 3.012 ± 0.055
1.599IleGln: 1.599 ± 0.04
3.312IleArg: 3.312 ± 0.067
3.257IleSer: 3.257 ± 0.058
3.387IleThr: 3.387 ± 0.063
3.562IleVal: 3.562 ± 0.065
0.568IleTrp: 0.568 ± 0.024
1.685IleTyr: 1.685 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
3.63LysAla: 3.63 ± 0.065
0.432LysCys: 0.432 ± 0.022
2.105LysAsp: 2.105 ± 0.052
3.462LysGlu: 3.462 ± 0.072
1.27LysPhe: 1.27 ± 0.039
2.913LysGly: 2.913 ± 0.057
0.714LysHis: 0.714 ± 0.024
2.758LysIle: 2.758 ± 0.053
2.718LysLys: 2.718 ± 0.056
3.949LysLeu: 3.949 ± 0.066
1.371LysMet: 1.371 ± 0.035
1.801LysAsn: 1.801 ± 0.043
2.016LysPro: 2.016 ± 0.053
1.594LysGln: 1.594 ± 0.045
2.325LysArg: 2.325 ± 0.056
2.003LysSer: 2.003 ± 0.038
2.451LysThr: 2.451 ± 0.046
3.203LysVal: 3.203 ± 0.063
0.413LysTrp: 0.413 ± 0.021
1.356LysTyr: 1.356 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
13.213LeuAla: 13.213 ± 0.136
1.195LeuCys: 1.195 ± 0.034
5.319LeuAsp: 5.319 ± 0.073
6.405LeuGlu: 6.405 ± 0.093
3.915LeuPhe: 3.915 ± 0.072
8.132LeuGly: 8.132 ± 0.111
2.036LeuHis: 2.036 ± 0.045
5.596LeuIle: 5.596 ± 0.095
4.766LeuLys: 4.766 ± 0.076
11.464LeuLeu: 11.464 ± 0.158
2.282LeuMet: 2.282 ± 0.05
3.484LeuAsn: 3.484 ± 0.066
5.859LeuPro: 5.859 ± 0.082
3.679LeuGln: 3.679 ± 0.058
5.786LeuArg: 5.786 ± 0.081
5.706LeuSer: 5.706 ± 0.091
5.909LeuThr: 5.909 ± 0.078
8.276LeuVal: 8.276 ± 0.116
0.974LeuTrp: 0.974 ± 0.034
2.689LeuTyr: 2.689 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
2.811MetAla: 2.811 ± 0.055
0.262MetCys: 0.262 ± 0.016
1.332MetAsp: 1.332 ± 0.038
1.72MetGlu: 1.72 ± 0.044
0.804MetPhe: 0.804 ± 0.028
1.897MetGly: 1.897 ± 0.049
0.518MetHis: 0.518 ± 0.022
1.429MetIle: 1.429 ± 0.041
1.202MetLys: 1.202 ± 0.036
2.814MetLeu: 2.814 ± 0.057
0.567MetMet: 0.567 ± 0.025
0.873MetAsn: 0.873 ± 0.03
1.449MetPro: 1.449 ± 0.039
0.978MetGln: 0.978 ± 0.032
1.5MetArg: 1.5 ± 0.037
1.366MetSer: 1.366 ± 0.041
1.287MetThr: 1.287 ± 0.04
1.966MetVal: 1.966 ± 0.046
0.169MetTrp: 0.169 ± 0.014
0.564MetTyr: 0.564 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.689AsnAla: 2.689 ± 0.054
0.475AsnCys: 0.475 ± 0.023
1.247AsnAsp: 1.247 ± 0.038
1.535AsnGlu: 1.535 ± 0.043
1.224AsnPhe: 1.224 ± 0.034
2.744AsnGly: 2.744 ± 0.065
0.625AsnHis: 0.625 ± 0.031
2.159AsnIle: 2.159 ± 0.048
1.347AsnLys: 1.347 ± 0.044
3.499AsnLeu: 3.499 ± 0.062
0.863AsnMet: 0.863 ± 0.033
1.167AsnAsn: 1.167 ± 0.039
2.111AsnPro: 2.111 ± 0.05
1.108AsnGln: 1.108 ± 0.044
2.348AsnArg: 2.348 ± 0.05
1.494AsnSer: 1.494 ± 0.049
1.582AsnThr: 1.582 ± 0.044
2.239AsnVal: 2.239 ± 0.053
0.435AsnTrp: 0.435 ± 0.021
1.066AsnTyr: 1.066 ± 0.039
0.0AsnXaa: 0.0 ± 0.0
Pro
6.866ProAla: 6.866 ± 0.115
0.493ProCys: 0.493 ± 0.022
2.974ProAsp: 2.974 ± 0.054
3.974ProGlu: 3.974 ± 0.07
1.743ProPhe: 1.743 ± 0.043
6.084ProGly: 6.084 ± 0.088
0.872ProHis: 0.872 ± 0.033
1.333ProIle: 1.333 ± 0.04
1.228ProLys: 1.228 ± 0.042
4.788ProLeu: 4.788 ± 0.077
0.812ProMet: 0.812 ± 0.029
1.023ProAsn: 1.023 ± 0.035
2.718ProPro: 2.718 ± 0.067
1.497ProGln: 1.497 ± 0.042
2.867ProArg: 2.867 ± 0.048
1.928ProSer: 1.928 ± 0.046
1.552ProThr: 1.552 ± 0.045
5.729ProVal: 5.729 ± 0.086
0.54ProTrp: 0.54 ± 0.022
1.315ProTyr: 1.315 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
3.906GlnAla: 3.906 ± 0.075
0.367GlnCys: 0.367 ± 0.023
1.466GlnAsp: 1.466 ± 0.04
2.32GlnGlu: 2.32 ± 0.059
1.106GlnPhe: 1.106 ± 0.035
2.363GlnGly: 2.363 ± 0.055
0.561GlnHis: 0.561 ± 0.021
1.91GlnIle: 1.91 ± 0.041
1.841GlnLys: 1.841 ± 0.045
3.437GlnLeu: 3.437 ± 0.061
1.017GlnMet: 1.017 ± 0.032
1.146GlnAsn: 1.146 ± 0.041
1.655GlnPro: 1.655 ± 0.049
1.561GlnGln: 1.561 ± 0.048
1.933GlnArg: 1.933 ± 0.048
1.518GlnSer: 1.518 ± 0.042
1.698GlnThr: 1.698 ± 0.04
3.185GlnVal: 3.185 ± 0.068
0.375GlnTrp: 0.375 ± 0.024
0.968GlnTyr: 0.968 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
6.189ArgAla: 6.189 ± 0.096
0.84ArgCys: 0.84 ± 0.033
3.325ArgAsp: 3.325 ± 0.061
5.898ArgGlu: 5.898 ± 0.088
2.314ArgPhe: 2.314 ± 0.049
4.819ArgGly: 4.819 ± 0.081
1.488ArgHis: 1.488 ± 0.041
3.42ArgIle: 3.42 ± 0.054
2.523ArgLys: 2.523 ± 0.05
7.939ArgLeu: 7.939 ± 0.106
1.665ArgMet: 1.665 ± 0.042
1.759ArgAsn: 1.759 ± 0.039
3.597ArgPro: 3.597 ± 0.063
3.141ArgGln: 3.141 ± 0.058
5.868ArgArg: 5.868 ± 0.111
2.551ArgSer: 2.551 ± 0.051
2.917ArgThr: 2.917 ± 0.049
5.387ArgVal: 5.387 ± 0.076
0.84ArgTrp: 0.84 ± 0.03
2.029ArgTyr: 2.029 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
4.233SerAla: 4.233 ± 0.076
0.658SerCys: 0.658 ± 0.026
1.867SerAsp: 1.867 ± 0.053
2.08SerGlu: 2.08 ± 0.048
1.982SerPhe: 1.982 ± 0.043
5.295SerGly: 5.295 ± 0.075
0.824SerHis: 0.824 ± 0.031
2.555SerIle: 2.555 ± 0.052
1.514SerLys: 1.514 ± 0.043
5.236SerLeu: 5.236 ± 0.082
1.261SerMet: 1.261 ± 0.034
1.296SerAsn: 1.296 ± 0.037
2.665SerPro: 2.665 ± 0.053
1.239SerGln: 1.239 ± 0.039
3.632SerArg: 3.632 ± 0.066
2.358SerSer: 2.358 ± 0.064
2.315SerThr: 2.315 ± 0.056
3.194SerVal: 3.194 ± 0.061
0.581SerTrp: 0.581 ± 0.028
1.373SerTyr: 1.373 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
5.859ThrAla: 5.859 ± 0.095
0.628ThrCys: 0.628 ± 0.025
2.1ThrAsp: 2.1 ± 0.044
2.419ThrGlu: 2.419 ± 0.054
1.68ThrPhe: 1.68 ± 0.04
6.713ThrGly: 6.713 ± 0.091
0.855ThrHis: 0.855 ± 0.036
2.65ThrIle: 2.65 ± 0.049
1.315ThrLys: 1.315 ± 0.04
4.799ThrLeu: 4.799 ± 0.078
1.102ThrMet: 1.102 ± 0.034
1.327ThrAsn: 1.327 ± 0.043
2.714ThrPro: 2.714 ± 0.047
1.075ThrGln: 1.075 ± 0.031
2.969ThrArg: 2.969 ± 0.053
2.115ThrSer: 2.115 ± 0.046
2.464ThrThr: 2.464 ± 0.065
4.754ThrVal: 4.754 ± 0.076
0.517ThrTrp: 0.517 ± 0.026
1.193ThrTyr: 1.193 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
7.91ValAla: 7.91 ± 0.108
1.005ValCys: 1.005 ± 0.035
4.346ValAsp: 4.346 ± 0.07
5.111ValGlu: 5.111 ± 0.08
3.045ValPhe: 3.045 ± 0.06
4.87ValGly: 4.87 ± 0.081
1.494ValHis: 1.494 ± 0.042
4.828ValIle: 4.828 ± 0.069
3.565ValLys: 3.565 ± 0.059
8.838ValLeu: 8.838 ± 0.101
1.972ValMet: 1.972 ± 0.042
2.936ValAsn: 2.936 ± 0.057
4.118ValPro: 4.118 ± 0.07
2.579ValGln: 2.579 ± 0.054
5.066ValArg: 5.066 ± 0.082
4.074ValSer: 4.074 ± 0.067
4.498ValThr: 4.498 ± 0.07
6.611ValVal: 6.611 ± 0.098
0.729ValTrp: 0.729 ± 0.03
2.293ValTyr: 2.293 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.863TrpAla: 0.863 ± 0.032
0.118TrpCys: 0.118 ± 0.01
0.484TrpAsp: 0.484 ± 0.022
0.696TrpGlu: 0.696 ± 0.029
0.396TrpPhe: 0.396 ± 0.02
0.829TrpGly: 0.829 ± 0.033
0.256TrpHis: 0.256 ± 0.015
0.509TrpIle: 0.509 ± 0.022
0.384TrpLys: 0.384 ± 0.021
1.409TrpLeu: 1.409 ± 0.048
0.268TrpMet: 0.268 ± 0.018
0.39TrpAsn: 0.39 ± 0.023
0.574TrpPro: 0.574 ± 0.023
0.577TrpGln: 0.577 ± 0.023
0.881TrpArg: 0.881 ± 0.033
0.539TrpSer: 0.539 ± 0.031
0.467TrpThr: 0.467 ± 0.024
0.648TrpVal: 0.648 ± 0.022
0.178TrpTrp: 0.178 ± 0.013
0.32TrpTyr: 0.32 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.438TyrAla: 2.438 ± 0.051
0.421TyrCys: 0.421 ± 0.019
1.449TyrAsp: 1.449 ± 0.042
1.281TyrGlu: 1.281 ± 0.036
1.255TyrPhe: 1.255 ± 0.036
2.566TyrGly: 2.566 ± 0.06
0.719TyrHis: 0.719 ± 0.025
1.575TyrIle: 1.575 ± 0.041
1.025TyrLys: 1.025 ± 0.031
3.321TyrLeu: 3.321 ± 0.054
0.593TyrMet: 0.593 ± 0.026
1.054TyrAsn: 1.054 ± 0.038
1.419TyrPro: 1.419 ± 0.042
1.132TyrGln: 1.132 ± 0.033
2.495TyrArg: 2.495 ± 0.05
1.352TyrSer: 1.352 ± 0.042
1.475TyrThr: 1.475 ± 0.041
1.872TyrVal: 1.872 ± 0.047
0.342TyrTrp: 0.342 ± 0.02
1.078TyrTyr: 1.078 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3325 proteins (993877 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski