Amino acid dipepetide frequency for Parapedobacter composti

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.174AlaAla: 8.174 ± 0.102
0.763AlaCys: 0.763 ± 0.024
5.303AlaAsp: 5.303 ± 0.061
5.137AlaGlu: 5.137 ± 0.069
3.885AlaPhe: 3.885 ± 0.054
6.675AlaGly: 6.675 ± 0.079
1.615AlaHis: 1.615 ± 0.038
5.393AlaIle: 5.393 ± 0.073
3.942AlaLys: 3.942 ± 0.062
8.162AlaLeu: 8.162 ± 0.088
2.026AlaMet: 2.026 ± 0.04
3.674AlaAsn: 3.674 ± 0.057
2.901AlaPro: 2.901 ± 0.047
3.162AlaGln: 3.162 ± 0.047
3.938AlaArg: 3.938 ± 0.063
4.764AlaSer: 4.764 ± 0.055
4.431AlaThr: 4.431 ± 0.078
6.252AlaVal: 6.252 ± 0.074
1.124AlaTrp: 1.124 ± 0.031
3.4AlaTyr: 3.4 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
0.543CysAla: 0.543 ± 0.021
0.149CysCys: 0.149 ± 0.011
0.357CysAsp: 0.357 ± 0.016
0.371CysGlu: 0.371 ± 0.022
0.369CysPhe: 0.369 ± 0.016
0.599CysGly: 0.599 ± 0.023
0.209CysHis: 0.209 ± 0.013
0.497CysIle: 0.497 ± 0.02
0.319CysLys: 0.319 ± 0.016
0.749CysLeu: 0.749 ± 0.023
0.209CysMet: 0.209 ± 0.015
0.331CysAsn: 0.331 ± 0.015
0.3CysPro: 0.3 ± 0.016
0.21CysGln: 0.21 ± 0.012
0.452CysArg: 0.452 ± 0.017
0.479CysSer: 0.479 ± 0.02
0.426CysThr: 0.426 ± 0.02
0.452CysVal: 0.452 ± 0.017
0.099CysTrp: 0.099 ± 0.008
0.328CysTyr: 0.328 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
4.563AspAla: 4.563 ± 0.066
0.338AspCys: 0.338 ± 0.018
2.954AspAsp: 2.954 ± 0.05
3.707AspGlu: 3.707 ± 0.058
2.95AspPhe: 2.95 ± 0.045
4.808AspGly: 4.808 ± 0.071
1.08AspHis: 1.08 ± 0.03
3.894AspIle: 3.894 ± 0.05
2.746AspLys: 2.746 ± 0.046
4.662AspLeu: 4.662 ± 0.058
1.317AspMet: 1.317 ± 0.031
2.526AspAsn: 2.526 ± 0.044
2.351AspPro: 2.351 ± 0.043
1.592AspGln: 1.592 ± 0.036
3.143AspArg: 3.143 ± 0.055
2.916AspSer: 2.916 ± 0.055
2.899AspThr: 2.899 ± 0.051
3.809AspVal: 3.809 ± 0.054
0.89AspTrp: 0.89 ± 0.028
2.624AspTyr: 2.624 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
5.257GluAla: 5.257 ± 0.071
0.335GluCys: 0.335 ± 0.019
2.681GluAsp: 2.681 ± 0.05
3.751GluGlu: 3.751 ± 0.056
2.076GluPhe: 2.076 ± 0.042
3.894GluGly: 3.894 ± 0.053
1.388GluHis: 1.388 ± 0.03
3.744GluIle: 3.744 ± 0.055
3.488GluLys: 3.488 ± 0.063
5.94GluLeu: 5.94 ± 0.068
1.341GluMet: 1.341 ± 0.033
2.863GluAsn: 2.863 ± 0.053
2.049GluPro: 2.049 ± 0.038
2.78GluGln: 2.78 ± 0.054
3.856GluArg: 3.856 ± 0.058
2.767GluSer: 2.767 ± 0.047
3.254GluThr: 3.254 ± 0.045
4.105GluVal: 4.105 ± 0.053
0.768GluTrp: 0.768 ± 0.021
2.09GluTyr: 2.09 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.513PheAla: 3.513 ± 0.055
0.411PheCys: 0.411 ± 0.018
2.889PheAsp: 2.889 ± 0.041
2.592PheGlu: 2.592 ± 0.045
2.362PhePhe: 2.362 ± 0.048
3.67PheGly: 3.67 ± 0.057
0.964PheHis: 0.964 ± 0.029
2.747PheIle: 2.747 ± 0.049
2.012PheLys: 2.012 ± 0.042
3.833PheLeu: 3.833 ± 0.064
1.081PheMet: 1.081 ± 0.025
2.498PheAsn: 2.498 ± 0.048
1.775PhePro: 1.775 ± 0.041
1.4PheGln: 1.4 ± 0.029
2.569PheArg: 2.569 ± 0.038
3.344PheSer: 3.344 ± 0.055
2.685PheThr: 2.685 ± 0.043
2.868PheVal: 2.868 ± 0.045
0.589PheTrp: 0.589 ± 0.021
1.903PheTyr: 1.903 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
5.46GlyAla: 5.46 ± 0.064
0.628GlyCys: 0.628 ± 0.022
3.905GlyAsp: 3.905 ± 0.061
3.997GlyGlu: 3.997 ± 0.059
3.612GlyPhe: 3.612 ± 0.061
5.859GlyGly: 5.859 ± 0.085
1.513GlyHis: 1.513 ± 0.033
5.379GlyIle: 5.379 ± 0.066
4.369GlyLys: 4.369 ± 0.06
6.886GlyLeu: 6.886 ± 0.089
1.978GlyMet: 1.978 ± 0.04
3.826GlyAsn: 3.826 ± 0.063
1.956GlyPro: 1.956 ± 0.039
2.785GlyGln: 2.785 ± 0.045
4.169GlyArg: 4.169 ± 0.059
4.438GlySer: 4.438 ± 0.065
4.687GlyThr: 4.687 ± 0.072
5.154GlyVal: 5.154 ± 0.073
1.191GlyTrp: 1.191 ± 0.031
3.558GlyTyr: 3.558 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
1.734HisAla: 1.734 ± 0.041
0.203HisCys: 0.203 ± 0.013
1.106HisAsp: 1.106 ± 0.027
1.157HisGlu: 1.157 ± 0.028
1.14HisPhe: 1.14 ± 0.032
1.687HisGly: 1.687 ± 0.038
0.647HisHis: 0.647 ± 0.022
1.509HisIle: 1.509 ± 0.033
0.8HisLys: 0.8 ± 0.023
1.977HisLeu: 1.977 ± 0.049
0.416HisMet: 0.416 ± 0.016
0.933HisAsn: 0.933 ± 0.03
1.226HisPro: 1.226 ± 0.029
0.88HisGln: 0.88 ± 0.025
1.398HisArg: 1.398 ± 0.032
1.063HisSer: 1.063 ± 0.024
1.305HisThr: 1.305 ± 0.034
1.398HisVal: 1.398 ± 0.032
0.349HisTrp: 0.349 ± 0.016
1.073HisTyr: 1.073 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.827IleAla: 5.827 ± 0.077
0.547IleCys: 0.547 ± 0.02
3.944IleAsp: 3.944 ± 0.05
3.476IleGlu: 3.476 ± 0.051
2.331IlePhe: 2.331 ± 0.047
4.803IleGly: 4.803 ± 0.068
1.396IleHis: 1.396 ± 0.032
3.726IleIle: 3.726 ± 0.053
3.101IleLys: 3.101 ± 0.049
4.888IleLeu: 4.888 ± 0.071
1.167IleMet: 1.167 ± 0.03
3.168IleAsn: 3.168 ± 0.053
3.14IlePro: 3.14 ± 0.048
2.196IleGln: 2.196 ± 0.046
3.876IleArg: 3.876 ± 0.054
3.963IleSer: 3.963 ± 0.059
3.879IleThr: 3.879 ± 0.056
3.932IleVal: 3.932 ± 0.054
0.698IleTrp: 0.698 ± 0.024
2.272IleTyr: 2.272 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
4.316LysAla: 4.316 ± 0.068
0.218LysCys: 0.218 ± 0.011
2.667LysAsp: 2.667 ± 0.046
3.232LysGlu: 3.232 ± 0.058
1.63LysPhe: 1.63 ± 0.033
3.551LysGly: 3.551 ± 0.054
1.225LysHis: 1.225 ± 0.033
2.959LysIle: 2.959 ± 0.048
3.013LysLys: 3.013 ± 0.054
4.525LysLeu: 4.525 ± 0.06
1.168LysMet: 1.168 ± 0.028
2.284LysAsn: 2.284 ± 0.047
2.322LysPro: 2.322 ± 0.043
2.209LysGln: 2.209 ± 0.039
3.042LysArg: 3.042 ± 0.046
2.631LysSer: 2.631 ± 0.043
2.953LysThr: 2.953 ± 0.049
3.174LysVal: 3.174 ± 0.052
0.608LysTrp: 0.608 ± 0.02
1.924LysTyr: 1.924 ± 0.041
0.0LysXaa: 0.0 ± 0.0
Leu
8.271LeuAla: 8.271 ± 0.082
0.793LeuCys: 0.793 ± 0.025
5.203LeuAsp: 5.203 ± 0.059
5.148LeuGlu: 5.148 ± 0.062
4.471LeuPhe: 4.471 ± 0.061
6.238LeuGly: 6.238 ± 0.086
2.03LeuHis: 2.03 ± 0.045
5.32LeuIle: 5.32 ± 0.079
5.018LeuLys: 5.018 ± 0.065
9.651LeuLeu: 9.651 ± 0.112
2.209LeuMet: 2.209 ± 0.042
4.494LeuAsn: 4.494 ± 0.059
4.376LeuPro: 4.376 ± 0.062
3.58LeuGln: 3.58 ± 0.056
5.178LeuArg: 5.178 ± 0.062
6.252LeuSer: 6.252 ± 0.074
5.36LeuThr: 5.36 ± 0.061
6.022LeuVal: 6.022 ± 0.073
1.106LeuTrp: 1.106 ± 0.026
3.463LeuTyr: 3.463 ± 0.056
0.0LeuXaa: 0.0 ± 0.0
Met
2.197MetAla: 2.197 ± 0.041
0.129MetCys: 0.129 ± 0.011
1.348MetAsp: 1.348 ± 0.031
1.572MetGlu: 1.572 ± 0.035
0.741MetPhe: 0.741 ± 0.026
1.723MetGly: 1.723 ± 0.039
0.448MetHis: 0.448 ± 0.016
1.109MetIle: 1.109 ± 0.029
1.548MetLys: 1.548 ± 0.036
2.118MetLeu: 2.118 ± 0.048
0.602MetMet: 0.602 ± 0.021
1.068MetAsn: 1.068 ± 0.029
1.113MetPro: 1.113 ± 0.027
0.968MetGln: 0.968 ± 0.026
1.303MetArg: 1.303 ± 0.032
1.137MetSer: 1.137 ± 0.032
1.166MetThr: 1.166 ± 0.03
1.594MetVal: 1.594 ± 0.044
0.213MetTrp: 0.213 ± 0.012
0.677MetTyr: 0.677 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
3.798AsnAla: 3.798 ± 0.054
0.26AsnCys: 0.26 ± 0.015
2.383AsnAsp: 2.383 ± 0.045
2.623AsnGlu: 2.623 ± 0.047
2.051AsnPhe: 2.051 ± 0.042
3.939AsnGly: 3.939 ± 0.061
1.045AsnHis: 1.045 ± 0.03
3.126AsnIle: 3.126 ± 0.048
2.136AsnLys: 2.136 ± 0.044
4.081AsnLeu: 4.081 ± 0.06
1.03AsnMet: 1.03 ± 0.025
2.375AsnAsn: 2.375 ± 0.047
2.711AsnPro: 2.711 ± 0.048
1.728AsnGln: 1.728 ± 0.037
3.049AsnArg: 3.049 ± 0.055
2.474AsnSer: 2.474 ± 0.05
2.863AsnThr: 2.863 ± 0.042
3.014AsnVal: 3.014 ± 0.054
0.682AsnTrp: 0.682 ± 0.024
2.078AsnTyr: 2.078 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
3.638ProAla: 3.638 ± 0.057
0.235ProCys: 0.235 ± 0.012
3.095ProAsp: 3.095 ± 0.05
3.29ProGlu: 3.29 ± 0.051
1.956ProPhe: 1.956 ± 0.037
3.297ProGly: 3.297 ± 0.06
0.942ProHis: 0.942 ± 0.026
2.408ProIle: 2.408 ± 0.044
1.76ProLys: 1.76 ± 0.035
3.803ProLeu: 3.803 ± 0.05
0.916ProMet: 0.916 ± 0.025
1.912ProAsn: 1.912 ± 0.036
1.431ProPro: 1.431 ± 0.043
1.618ProGln: 1.618 ± 0.038
1.774ProArg: 1.774 ± 0.038
2.353ProSer: 2.353 ± 0.04
2.077ProThr: 2.077 ± 0.038
3.396ProVal: 3.396 ± 0.052
0.515ProTrp: 0.515 ± 0.021
1.779ProTyr: 1.779 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
3.225GlnAla: 3.225 ± 0.05
0.188GlnCys: 0.188 ± 0.012
1.621GlnAsp: 1.621 ± 0.036
2.223GlnGlu: 2.223 ± 0.044
1.573GlnPhe: 1.573 ± 0.032
2.498GlnGly: 2.498 ± 0.036
1.099GlnHis: 1.099 ± 0.028
1.929GlnIle: 1.929 ± 0.041
1.621GlnLys: 1.621 ± 0.039
4.356GlnLeu: 4.356 ± 0.069
0.743GlnMet: 0.743 ± 0.022
1.448GlnAsn: 1.448 ± 0.035
1.758GlnPro: 1.758 ± 0.033
2.27GlnGln: 2.27 ± 0.044
2.58GlnArg: 2.58 ± 0.039
2.047GlnSer: 2.047 ± 0.036
2.04GlnThr: 2.04 ± 0.036
2.572GlnVal: 2.572 ± 0.042
0.563GlnTrp: 0.563 ± 0.021
1.586GlnTyr: 1.586 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
3.901ArgAla: 3.901 ± 0.064
0.344ArgCys: 0.344 ± 0.018
2.796ArgAsp: 2.796 ± 0.047
3.445ArgGlu: 3.445 ± 0.053
3.069ArgPhe: 3.069 ± 0.052
3.383ArgGly: 3.383 ± 0.058
1.286ArgHis: 1.286 ± 0.03
3.961ArgIle: 3.961 ± 0.057
3.163ArgLys: 3.163 ± 0.048
5.715ArgLeu: 5.715 ± 0.063
1.527ArgMet: 1.527 ± 0.039
2.829ArgAsn: 2.829 ± 0.053
2.182ArgPro: 2.182 ± 0.047
2.383ArgGln: 2.383 ± 0.04
2.994ArgArg: 2.994 ± 0.048
2.851ArgSer: 2.851 ± 0.051
2.894ArgThr: 2.894 ± 0.042
3.61ArgVal: 3.61 ± 0.045
0.962ArgTrp: 0.962 ± 0.027
2.91ArgTyr: 2.91 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
4.906SerAla: 4.906 ± 0.066
0.543SerCys: 0.543 ± 0.023
2.986SerAsp: 2.986 ± 0.048
2.906SerGlu: 2.906 ± 0.046
2.861SerPhe: 2.861 ± 0.049
5.051SerGly: 5.051 ± 0.069
1.225SerHis: 1.225 ± 0.033
3.735SerIle: 3.735 ± 0.054
2.468SerLys: 2.468 ± 0.046
5.566SerLeu: 5.566 ± 0.07
1.227SerMet: 1.227 ± 0.035
2.623SerAsn: 2.623 ± 0.05
2.55SerPro: 2.55 ± 0.041
1.816SerGln: 1.816 ± 0.036
3.239SerArg: 3.239 ± 0.052
3.401SerSer: 3.401 ± 0.056
3.207SerThr: 3.207 ± 0.047
4.145SerVal: 4.145 ± 0.059
0.748SerTrp: 0.748 ± 0.024
2.4SerTyr: 2.4 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
5.317ThrAla: 5.317 ± 0.066
0.353ThrCys: 0.353 ± 0.014
3.428ThrAsp: 3.428 ± 0.048
3.032ThrGlu: 3.032 ± 0.043
2.597ThrPhe: 2.597 ± 0.042
4.883ThrGly: 4.883 ± 0.064
1.176ThrHis: 1.176 ± 0.028
3.587ThrIle: 3.587 ± 0.05
2.269ThrLys: 2.269 ± 0.04
5.363ThrLeu: 5.363 ± 0.066
1.068ThrMet: 1.068 ± 0.027
2.398ThrAsn: 2.398 ± 0.044
2.766ThrPro: 2.766 ± 0.048
1.809ThrGln: 1.809 ± 0.034
2.49ThrArg: 2.49 ± 0.044
3.005ThrSer: 3.005 ± 0.045
3.086ThrThr: 3.086 ± 0.062
4.578ThrVal: 4.578 ± 0.091
0.755ThrTrp: 0.755 ± 0.028
2.47ThrTyr: 2.47 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
5.787ValAla: 5.787 ± 0.082
0.577ValCys: 0.577 ± 0.022
3.943ValAsp: 3.943 ± 0.053
3.718ValGlu: 3.718 ± 0.057
3.308ValPhe: 3.308 ± 0.053
4.558ValGly: 4.558 ± 0.056
1.279ValHis: 1.279 ± 0.035
4.267ValIle: 4.267 ± 0.06
3.423ValLys: 3.423 ± 0.055
6.564ValLeu: 6.564 ± 0.073
1.512ValMet: 1.512 ± 0.035
3.394ValAsn: 3.394 ± 0.051
2.958ValPro: 2.958 ± 0.047
2.245ValGln: 2.245 ± 0.039
3.552ValArg: 3.552 ± 0.052
4.624ValSer: 4.624 ± 0.06
4.034ValThr: 4.034 ± 0.083
5.16ValVal: 5.16 ± 0.076
0.838ValTrp: 0.838 ± 0.025
2.805ValTyr: 2.805 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.952TrpAla: 0.952 ± 0.026
0.122TrpCys: 0.122 ± 0.009
0.76TrpAsp: 0.76 ± 0.026
0.812TrpGlu: 0.812 ± 0.023
0.599TrpPhe: 0.599 ± 0.023
1.043TrpGly: 1.043 ± 0.032
0.352TrpHis: 0.352 ± 0.017
0.784TrpIle: 0.784 ± 0.028
0.737TrpLys: 0.737 ± 0.023
1.433TrpLeu: 1.433 ± 0.034
0.4TrpMet: 0.4 ± 0.017
0.685TrpAsn: 0.685 ± 0.021
0.448TrpPro: 0.448 ± 0.018
0.613TrpGln: 0.613 ± 0.026
0.765TrpArg: 0.765 ± 0.026
0.734TrpSer: 0.734 ± 0.025
0.654TrpThr: 0.654 ± 0.024
0.845TrpVal: 0.845 ± 0.024
0.254TrpTrp: 0.254 ± 0.013
0.534TrpTyr: 0.534 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.34TyrAla: 3.34 ± 0.05
0.315TyrCys: 0.315 ± 0.015
2.398TyrAsp: 2.398 ± 0.045
2.237TyrGlu: 2.237 ± 0.044
2.123TyrPhe: 2.123 ± 0.043
3.187TyrGly: 3.187 ± 0.05
1.082TyrHis: 1.082 ± 0.029
2.258TyrIle: 2.258 ± 0.037
1.717TyrLys: 1.717 ± 0.035
3.91TyrLeu: 3.91 ± 0.056
0.837TyrMet: 0.837 ± 0.024
2.118TyrAsn: 2.118 ± 0.045
1.884TyrPro: 1.884 ± 0.036
1.696TyrGln: 1.696 ± 0.04
2.838TyrArg: 2.838 ± 0.049
2.36TyrSer: 2.36 ± 0.047
2.53TyrThr: 2.53 ± 0.043
2.467TyrVal: 2.467 ± 0.045
0.574TyrTrp: 0.574 ± 0.022
1.879TyrTyr: 1.879 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3985 proteins (1421768 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski