Amino acid dipepetide frequency for Erythrobacter longus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.372AlaAla: 14.372 ± 0.17
0.994AlaCys: 0.994 ± 0.032
6.733AlaAsp: 6.733 ± 0.097
7.42AlaGlu: 7.42 ± 0.112
4.528AlaPhe: 4.528 ± 0.064
10.218AlaGly: 10.218 ± 0.121
2.056AlaHis: 2.056 ± 0.046
6.584AlaIle: 6.584 ± 0.089
4.916AlaLys: 4.916 ± 0.09
12.407AlaLeu: 12.407 ± 0.15
3.558AlaMet: 3.558 ± 0.062
3.653AlaAsn: 3.653 ± 0.069
5.4AlaPro: 5.4 ± 0.083
4.856AlaGln: 4.856 ± 0.073
7.901AlaArg: 7.901 ± 0.101
7.367AlaSer: 7.367 ± 0.105
5.729AlaThr: 5.729 ± 0.087
7.539AlaVal: 7.539 ± 0.087
1.271AlaTrp: 1.271 ± 0.041
2.364AlaTyr: 2.364 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.997CysAla: 0.997 ± 0.032
0.089CysCys: 0.089 ± 0.009
0.612CysAsp: 0.612 ± 0.027
0.591CysGlu: 0.591 ± 0.027
0.3CysPhe: 0.3 ± 0.017
0.832CysGly: 0.832 ± 0.031
0.199CysHis: 0.199 ± 0.022
0.349CysIle: 0.349 ± 0.019
0.227CysLys: 0.227 ± 0.016
0.68CysLeu: 0.68 ± 0.025
0.153CysMet: 0.153 ± 0.013
0.251CysAsn: 0.251 ± 0.016
0.394CysPro: 0.394 ± 0.022
0.197CysGln: 0.197 ± 0.014
0.454CysArg: 0.454 ± 0.021
0.486CysSer: 0.486 ± 0.023
0.385CysThr: 0.385 ± 0.021
0.595CysVal: 0.595 ± 0.026
0.106CysTrp: 0.106 ± 0.009
0.171CysTyr: 0.171 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
7.355AspAla: 7.355 ± 0.112
0.53AspCys: 0.53 ± 0.026
3.771AspAsp: 3.771 ± 0.112
4.355AspGlu: 4.355 ± 0.083
2.605AspPhe: 2.605 ± 0.06
5.875AspGly: 5.875 ± 0.156
1.235AspHis: 1.235 ± 0.039
3.185AspIle: 3.185 ± 0.072
1.952AspLys: 1.952 ± 0.049
5.944AspLeu: 5.944 ± 0.09
1.39AspMet: 1.39 ± 0.035
1.801AspAsn: 1.801 ± 0.046
3.518AspPro: 3.518 ± 0.057
1.99AspGln: 1.99 ± 0.048
4.028AspArg: 4.028 ± 0.065
2.318AspSer: 2.318 ± 0.056
3.181AspThr: 3.181 ± 0.089
4.175AspVal: 4.175 ± 0.082
1.125AspTrp: 1.125 ± 0.038
1.602AspTyr: 1.602 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
8.585GluAla: 8.585 ± 0.098
0.428GluCys: 0.428 ± 0.019
3.638GluAsp: 3.638 ± 0.072
4.518GluGlu: 4.518 ± 0.09
2.183GluPhe: 2.183 ± 0.048
5.436GluGly: 5.436 ± 0.066
1.186GluHis: 1.186 ± 0.041
3.603GluIle: 3.603 ± 0.061
2.519GluLys: 2.519 ± 0.06
6.022GluLeu: 6.022 ± 0.091
1.766GluMet: 1.766 ± 0.047
1.997GluAsn: 1.997 ± 0.042
2.889GluPro: 2.889 ± 0.06
2.433GluGln: 2.433 ± 0.057
4.965GluArg: 4.965 ± 0.091
2.63GluSer: 2.63 ± 0.047
3.733GluThr: 3.733 ± 0.067
4.228GluVal: 4.228 ± 0.071
0.932GluTrp: 0.932 ± 0.03
1.259GluTyr: 1.259 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
5.183PheAla: 5.183 ± 0.078
0.32PheCys: 0.32 ± 0.019
3.036PheAsp: 3.036 ± 0.062
2.707PheGlu: 2.707 ± 0.059
1.533PhePhe: 1.533 ± 0.045
3.939PheGly: 3.939 ± 0.07
0.618PheHis: 0.618 ± 0.028
1.686PheIle: 1.686 ± 0.039
1.068PheLys: 1.068 ± 0.034
3.281PheLeu: 3.281 ± 0.058
0.831PheMet: 0.831 ± 0.03
1.223PheAsn: 1.223 ± 0.039
1.54PhePro: 1.54 ± 0.039
1.08PheGln: 1.08 ± 0.03
2.004PheArg: 2.004 ± 0.044
2.334PheSer: 2.334 ± 0.051
2.205PheThr: 2.205 ± 0.059
2.869PheVal: 2.869 ± 0.052
0.575PheTrp: 0.575 ± 0.026
0.995PheTyr: 0.995 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
9.456GlyAla: 9.456 ± 0.105
0.82GlyCys: 0.82 ± 0.033
5.438GlyAsp: 5.438 ± 0.14
6.046GlyGlu: 6.046 ± 0.086
3.906GlyPhe: 3.906 ± 0.071
8.11GlyGly: 8.11 ± 0.209
1.583GlyHis: 1.583 ± 0.042
4.446GlyIle: 4.446 ± 0.069
3.48GlyLys: 3.48 ± 0.068
8.365GlyLeu: 8.365 ± 0.104
2.242GlyMet: 2.242 ± 0.05
2.843GlyAsn: 2.843 ± 0.103
3.39GlyPro: 3.39 ± 0.065
3.022GlyGln: 3.022 ± 0.059
5.168GlyArg: 5.168 ± 0.079
5.326GlySer: 5.326 ± 0.132
4.924GlyThr: 4.924 ± 0.119
6.018GlyVal: 6.018 ± 0.083
1.407GlyTrp: 1.407 ± 0.042
2.167GlyTyr: 2.167 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.941HisAla: 1.941 ± 0.046
0.252HisCys: 0.252 ± 0.015
1.099HisAsp: 1.099 ± 0.035
1.135HisGlu: 1.135 ± 0.035
0.846HisPhe: 0.846 ± 0.03
1.663HisGly: 1.663 ± 0.048
0.46HisHis: 0.46 ± 0.024
0.906HisIle: 0.906 ± 0.03
0.569HisLys: 0.569 ± 0.028
1.665HisLeu: 1.665 ± 0.046
0.443HisMet: 0.443 ± 0.02
0.487HisAsn: 0.487 ± 0.021
1.083HisPro: 1.083 ± 0.033
0.504HisGln: 0.504 ± 0.022
1.162HisArg: 1.162 ± 0.038
1.049HisSer: 1.049 ± 0.033
0.799HisThr: 0.799 ± 0.027
1.208HisVal: 1.208 ± 0.037
0.289HisTrp: 0.289 ± 0.016
0.53HisTyr: 0.53 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
7.717IleAla: 7.717 ± 0.104
0.467IleCys: 0.467 ± 0.023
3.928IleAsp: 3.928 ± 0.069
4.204IleGlu: 4.204 ± 0.066
1.74IlePhe: 1.74 ± 0.041
5.146IleGly: 5.146 ± 0.078
0.857IleHis: 0.857 ± 0.03
2.541IleIle: 2.541 ± 0.058
1.547IleLys: 1.547 ± 0.042
3.999IleLeu: 3.999 ± 0.06
0.986IleMet: 0.986 ± 0.036
1.602IleAsn: 1.602 ± 0.047
2.297IlePro: 2.297 ± 0.049
1.307IleGln: 1.307 ± 0.04
2.802IleArg: 2.802 ± 0.052
3.074IleSer: 3.074 ± 0.052
3.046IleThr: 3.046 ± 0.079
3.839IleVal: 3.839 ± 0.062
0.664IleTrp: 0.664 ± 0.028
1.103IleTyr: 1.103 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.567LysAla: 4.567 ± 0.09
0.189LysCys: 0.189 ± 0.013
1.89LysAsp: 1.89 ± 0.044
1.935LysGlu: 1.935 ± 0.052
0.992LysPhe: 0.992 ± 0.033
2.941LysGly: 2.941 ± 0.057
0.707LysHis: 0.707 ± 0.027
1.634LysIle: 1.634 ± 0.047
1.474LysLys: 1.474 ± 0.056
3.587LysLeu: 3.587 ± 0.073
0.835LysMet: 0.835 ± 0.032
0.861LysAsn: 0.861 ± 0.026
2.058LysPro: 2.058 ± 0.051
1.081LysGln: 1.081 ± 0.036
2.566LysArg: 2.566 ± 0.066
2.024LysSer: 2.024 ± 0.055
1.89LysThr: 1.89 ± 0.038
2.378LysVal: 2.378 ± 0.055
0.493LysTrp: 0.493 ± 0.02
0.668LysTyr: 0.668 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
12.653LeuAla: 12.653 ± 0.138
0.737LeuCys: 0.737 ± 0.026
5.882LeuAsp: 5.882 ± 0.076
5.97LeuGlu: 5.97 ± 0.083
3.644LeuPhe: 3.644 ± 0.072
8.225LeuGly: 8.225 ± 0.11
1.548LeuHis: 1.548 ± 0.046
5.043LeuIle: 5.043 ± 0.071
3.267LeuLys: 3.267 ± 0.073
8.544LeuLeu: 8.544 ± 0.136
2.106LeuMet: 2.106 ± 0.05
2.638LeuAsn: 2.638 ± 0.054
4.965LeuPro: 4.965 ± 0.088
2.652LeuGln: 2.652 ± 0.055
5.865LeuArg: 5.865 ± 0.095
6.435LeuSer: 6.435 ± 0.077
5.545LeuThr: 5.545 ± 0.088
6.667LeuVal: 6.667 ± 0.093
1.118LeuTrp: 1.118 ± 0.037
1.885LeuTyr: 1.885 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
2.958MetAla: 2.958 ± 0.059
0.159MetCys: 0.159 ± 0.013
1.18MetAsp: 1.18 ± 0.04
1.277MetGlu: 1.277 ± 0.035
0.672MetPhe: 0.672 ± 0.026
1.952MetGly: 1.952 ± 0.048
0.47MetHis: 0.47 ± 0.018
1.377MetIle: 1.377 ± 0.039
0.981MetLys: 0.981 ± 0.033
2.509MetLeu: 2.509 ± 0.055
0.731MetMet: 0.731 ± 0.026
0.796MetAsn: 0.796 ± 0.025
1.328MetPro: 1.328 ± 0.035
0.838MetGln: 0.838 ± 0.029
1.727MetArg: 1.727 ± 0.045
1.576MetSer: 1.576 ± 0.043
1.683MetThr: 1.683 ± 0.04
1.726MetVal: 1.726 ± 0.046
0.212MetTrp: 0.212 ± 0.014
0.286MetTyr: 0.286 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.686AsnAla: 3.686 ± 0.08
0.281AsnCys: 0.281 ± 0.016
1.885AsnAsp: 1.885 ± 0.092
1.639AsnGlu: 1.639 ± 0.038
1.217AsnPhe: 1.217 ± 0.038
2.769AsnGly: 2.769 ± 0.083
0.544AsnHis: 0.544 ± 0.023
1.547AsnIle: 1.547 ± 0.041
0.797AsnLys: 0.797 ± 0.03
2.698AsnLeu: 2.698 ± 0.054
0.656AsnMet: 0.656 ± 0.027
0.915AsnAsn: 0.915 ± 0.043
2.052AsnPro: 2.052 ± 0.045
0.936AsnGln: 0.936 ± 0.035
1.951AsnArg: 1.951 ± 0.039
1.772AsnSer: 1.772 ± 0.051
1.559AsnThr: 1.559 ± 0.048
2.071AsnVal: 2.071 ± 0.051
0.495AsnTrp: 0.495 ± 0.022
0.661AsnTyr: 0.661 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
5.332ProAla: 5.332 ± 0.091
0.314ProCys: 0.314 ± 0.016
3.625ProAsp: 3.625 ± 0.069
3.913ProGlu: 3.913 ± 0.08
1.965ProPhe: 1.965 ± 0.048
4.024ProGly: 4.024 ± 0.065
0.89ProHis: 0.89 ± 0.033
2.501ProIle: 2.501 ± 0.049
1.741ProLys: 1.741 ± 0.045
4.362ProLeu: 4.362 ± 0.063
1.131ProMet: 1.131 ± 0.032
1.454ProAsn: 1.454 ± 0.037
2.193ProPro: 2.193 ± 0.079
1.801ProGln: 1.801 ± 0.044
2.487ProArg: 2.487 ± 0.054
3.026ProSer: 3.026 ± 0.057
2.416ProThr: 2.416 ± 0.05
3.803ProVal: 3.803 ± 0.06
0.569ProTrp: 0.569 ± 0.023
1.042ProTyr: 1.042 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
3.828GlnAla: 3.828 ± 0.065
0.274GlnCys: 0.274 ± 0.017
1.79GlnAsp: 1.79 ± 0.043
1.785GlnGlu: 1.785 ± 0.044
1.361GlnPhe: 1.361 ± 0.032
2.753GlnGly: 2.753 ± 0.058
0.581GlnHis: 0.581 ± 0.024
2.007GlnIle: 2.007 ± 0.048
1.057GlnLys: 1.057 ± 0.033
3.24GlnLeu: 3.24 ± 0.055
0.974GlnMet: 0.974 ± 0.031
1.034GlnAsn: 1.034 ± 0.033
1.576GlnPro: 1.576 ± 0.043
1.186GlnGln: 1.186 ± 0.042
2.268GlnArg: 2.268 ± 0.048
2.298GlnSer: 2.298 ± 0.057
1.945GlnThr: 1.945 ± 0.046
2.379GlnVal: 2.379 ± 0.052
0.482GlnTrp: 0.482 ± 0.02
0.731GlnTyr: 0.731 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
6.935ArgAla: 6.935 ± 0.098
0.408ArgCys: 0.408 ± 0.02
3.911ArgAsp: 3.911 ± 0.073
4.494ArgGlu: 4.494 ± 0.082
2.893ArgPhe: 2.893 ± 0.064
4.572ArgGly: 4.572 ± 0.074
1.203ArgHis: 1.203 ± 0.035
3.743ArgIle: 3.743 ± 0.059
2.321ArgLys: 2.321 ± 0.057
6.556ArgLeu: 6.556 ± 0.105
1.689ArgMet: 1.689 ± 0.048
1.847ArgAsn: 1.847 ± 0.038
2.811ArgPro: 2.811 ± 0.056
2.155ArgGln: 2.155 ± 0.044
4.208ArgArg: 4.208 ± 0.086
3.692ArgSer: 3.692 ± 0.067
3.077ArgThr: 3.077 ± 0.055
4.27ArgVal: 4.27 ± 0.077
0.938ArgTrp: 0.938 ± 0.03
1.58ArgTyr: 1.58 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
6.7SerAla: 6.7 ± 0.103
0.448SerCys: 0.448 ± 0.023
4.047SerAsp: 4.047 ± 0.096
3.625SerGlu: 3.625 ± 0.056
2.409SerPhe: 2.409 ± 0.053
6.044SerGly: 6.044 ± 0.108
0.998SerHis: 0.998 ± 0.034
2.981SerIle: 2.981 ± 0.071
1.882SerLys: 1.882 ± 0.051
5.598SerLeu: 5.598 ± 0.079
1.321SerMet: 1.321 ± 0.037
1.833SerAsn: 1.833 ± 0.048
2.816SerPro: 2.816 ± 0.058
2.111SerGln: 2.111 ± 0.045
3.511SerArg: 3.511 ± 0.062
3.573SerSer: 3.573 ± 0.079
2.798SerThr: 2.798 ± 0.059
4.06SerVal: 4.06 ± 0.072
0.797SerTrp: 0.797 ± 0.027
1.346SerTyr: 1.346 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
5.993ThrAla: 5.993 ± 0.1
0.408ThrCys: 0.408 ± 0.022
3.0ThrAsp: 3.0 ± 0.065
2.563ThrGlu: 2.563 ± 0.055
2.102ThrPhe: 2.102 ± 0.06
5.306ThrGly: 5.306 ± 0.099
0.984ThrHis: 0.984 ± 0.031
3.083ThrIle: 3.083 ± 0.086
1.709ThrLys: 1.709 ± 0.047
5.58ThrLeu: 5.58 ± 0.093
1.151ThrMet: 1.151 ± 0.034
1.679ThrAsn: 1.679 ± 0.048
3.146ThrPro: 3.146 ± 0.061
1.919ThrGln: 1.919 ± 0.052
3.324ThrArg: 3.324 ± 0.061
3.304ThrSer: 3.304 ± 0.071
2.604ThrThr: 2.604 ± 0.06
4.011ThrVal: 4.011 ± 0.077
0.599ThrTrp: 0.599 ± 0.028
1.23ThrTyr: 1.23 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
7.865ValAla: 7.865 ± 0.09
0.593ValCys: 0.593 ± 0.026
4.263ValAsp: 4.263 ± 0.069
4.713ValGlu: 4.713 ± 0.067
2.628ValPhe: 2.628 ± 0.046
5.361ValGly: 5.361 ± 0.082
1.197ValHis: 1.197 ± 0.034
4.004ValIle: 4.004 ± 0.065
2.162ValLys: 2.162 ± 0.06
6.593ValLeu: 6.593 ± 0.085
1.734ValMet: 1.734 ± 0.048
2.145ValAsn: 2.145 ± 0.069
3.577ValPro: 3.577 ± 0.059
2.11ValGln: 2.11 ± 0.051
4.24ValArg: 4.24 ± 0.07
4.362ValSer: 4.362 ± 0.075
4.407ValThr: 4.407 ± 0.082
4.9ValVal: 4.9 ± 0.083
0.839ValTrp: 0.839 ± 0.027
1.318ValTyr: 1.318 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.249TrpAla: 1.249 ± 0.038
0.117TrpCys: 0.117 ± 0.01
0.71TrpAsp: 0.71 ± 0.029
0.758TrpGlu: 0.758 ± 0.025
0.575TrpPhe: 0.575 ± 0.025
0.96TrpGly: 0.96 ± 0.033
0.317TrpHis: 0.317 ± 0.021
0.677TrpIle: 0.677 ± 0.027
0.483TrpLys: 0.483 ± 0.023
1.693TrpLeu: 1.693 ± 0.048
0.36TrpMet: 0.36 ± 0.019
0.426TrpAsn: 0.426 ± 0.021
0.587TrpPro: 0.587 ± 0.025
0.626TrpGln: 0.626 ± 0.024
1.078TrpArg: 1.078 ± 0.034
0.871TrpSer: 0.871 ± 0.033
0.657TrpThr: 0.657 ± 0.023
0.862TrpVal: 0.862 ± 0.026
0.227TrpTrp: 0.227 ± 0.015
0.28TrpTyr: 0.28 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.493TyrAla: 2.493 ± 0.049
0.238TyrCys: 0.238 ± 0.015
1.497TyrAsp: 1.497 ± 0.043
1.299TyrGlu: 1.299 ± 0.037
0.928TyrPhe: 0.928 ± 0.027
2.031TyrGly: 2.031 ± 0.05
0.456TyrHis: 0.456 ± 0.022
0.972TyrIle: 0.972 ± 0.028
0.619TyrLys: 0.619 ± 0.025
2.067TyrLeu: 2.067 ± 0.048
0.382TyrMet: 0.382 ± 0.017
0.648TyrAsn: 0.648 ± 0.026
0.966TyrPro: 0.966 ± 0.033
0.747TyrGln: 0.747 ± 0.026
1.594TyrArg: 1.594 ± 0.047
1.379TyrSer: 1.379 ± 0.033
1.114TyrThr: 1.114 ± 0.031
1.413TyrVal: 1.413 ± 0.037
0.376TyrTrp: 0.376 ± 0.021
0.543TyrTyr: 0.543 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3219 proteins (1072962 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski