Amino acid dipepetide frequency for Massilia armeniaca

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.989AlaAla: 19.989 ± 0.2
1.275AlaCys: 1.275 ± 0.028
6.947AlaAsp: 6.947 ± 0.065
6.463AlaGlu: 6.463 ± 0.087
3.891AlaPhe: 3.891 ± 0.05
11.566AlaGly: 11.566 ± 0.115
2.567AlaHis: 2.567 ± 0.039
5.536AlaIle: 5.536 ± 0.061
3.844AlaLys: 3.844 ± 0.07
15.143AlaLeu: 15.143 ± 0.163
3.176AlaMet: 3.176 ± 0.04
3.316AlaAsn: 3.316 ± 0.052
6.899AlaPro: 6.899 ± 0.09
5.83AlaGln: 5.83 ± 0.069
9.171AlaArg: 9.171 ± 0.107
6.6AlaSer: 6.6 ± 0.073
6.768AlaThr: 6.768 ± 0.08
9.005AlaVal: 9.005 ± 0.085
1.85AlaTrp: 1.85 ± 0.038
2.772AlaTyr: 2.772 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
1.251CysAla: 1.251 ± 0.027
0.129CysCys: 0.129 ± 0.009
0.52CysAsp: 0.52 ± 0.017
0.416CysGlu: 0.416 ± 0.016
0.267CysPhe: 0.267 ± 0.012
0.937CysGly: 0.937 ± 0.026
0.262CysHis: 0.262 ± 0.014
0.366CysIle: 0.366 ± 0.015
0.208CysLys: 0.208 ± 0.012
0.795CysLeu: 0.795 ± 0.023
0.192CysMet: 0.192 ± 0.01
0.238CysAsn: 0.238 ± 0.013
0.44CysPro: 0.44 ± 0.018
0.255CysGln: 0.255 ± 0.011
0.615CysArg: 0.615 ± 0.021
0.447CysSer: 0.447 ± 0.018
0.486CysThr: 0.486 ± 0.018
0.645CysVal: 0.645 ± 0.022
0.14CysTrp: 0.14 ± 0.009
0.23CysTyr: 0.23 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.115AspAla: 7.115 ± 0.073
0.479AspCys: 0.479 ± 0.018
3.045AspAsp: 3.045 ± 0.045
3.218AspGlu: 3.218 ± 0.051
2.076AspPhe: 2.076 ± 0.034
5.373AspGly: 5.373 ± 0.071
1.023AspHis: 1.023 ± 0.027
2.6AspIle: 2.6 ± 0.042
1.988AspLys: 1.988 ± 0.04
5.069AspLeu: 5.069 ± 0.053
1.215AspMet: 1.215 ± 0.027
1.582AspAsn: 1.582 ± 0.033
2.961AspPro: 2.961 ± 0.045
1.747AspGln: 1.747 ± 0.034
3.1AspArg: 3.1 ± 0.049
2.553AspSer: 2.553 ± 0.036
2.875AspThr: 2.875 ± 0.053
4.217AspVal: 4.217 ± 0.046
0.945AspTrp: 0.945 ± 0.024
1.642AspTyr: 1.642 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
6.212GluAla: 6.212 ± 0.065
0.365GluCys: 0.365 ± 0.014
2.117GluAsp: 2.117 ± 0.034
2.702GluGlu: 2.702 ± 0.049
1.732GluPhe: 1.732 ± 0.03
3.389GluGly: 3.389 ± 0.043
1.358GluHis: 1.358 ± 0.03
2.433GluIle: 2.433 ± 0.047
1.857GluLys: 1.857 ± 0.039
5.945GluLeu: 5.945 ± 0.076
1.206GluMet: 1.206 ± 0.028
1.347GluAsn: 1.347 ± 0.027
2.314GluPro: 2.314 ± 0.04
2.809GluGln: 2.809 ± 0.044
4.637GluArg: 4.637 ± 0.06
2.127GluSer: 2.127 ± 0.037
2.639GluThr: 2.639 ± 0.04
3.491GluVal: 3.491 ± 0.053
0.719GluTrp: 0.719 ± 0.02
1.201GluTyr: 1.201 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
4.26PheAla: 4.26 ± 0.053
0.357PheCys: 0.357 ± 0.015
2.58PheAsp: 2.58 ± 0.042
1.809PheGlu: 1.809 ± 0.036
1.228PhePhe: 1.228 ± 0.034
3.27PheGly: 3.27 ± 0.048
0.748PheHis: 0.748 ± 0.02
1.468PheIle: 1.468 ± 0.032
1.088PheLys: 1.088 ± 0.028
2.898PheLeu: 2.898 ± 0.045
0.745PheMet: 0.745 ± 0.021
1.197PheAsn: 1.197 ± 0.025
1.38PhePro: 1.38 ± 0.028
1.052PheGln: 1.052 ± 0.028
1.89PheArg: 1.89 ± 0.029
1.974PheSer: 1.974 ± 0.036
2.053PheThr: 2.053 ± 0.036
2.645PheVal: 2.645 ± 0.042
0.465PheTrp: 0.465 ± 0.018
0.994PheTyr: 0.994 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
9.905GlyAla: 9.905 ± 0.099
0.824GlyCys: 0.824 ± 0.024
4.183GlyAsp: 4.183 ± 0.062
4.062GlyGlu: 4.062 ± 0.052
3.121GlyPhe: 3.121 ± 0.043
7.08GlyGly: 7.08 ± 0.093
1.92GlyHis: 1.92 ± 0.042
4.165GlyIle: 4.165 ± 0.053
3.625GlyLys: 3.625 ± 0.056
7.993GlyLeu: 7.993 ± 0.083
2.318GlyMet: 2.318 ± 0.035
2.666GlyAsn: 2.666 ± 0.061
3.002GlyPro: 3.002 ± 0.048
3.238GlyGln: 3.238 ± 0.047
5.514GlyArg: 5.514 ± 0.059
4.706GlySer: 4.706 ± 0.063
5.144GlyThr: 5.144 ± 0.076
6.155GlyVal: 6.155 ± 0.066
1.43GlyTrp: 1.43 ± 0.032
2.478GlyTyr: 2.478 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.855HisAla: 2.855 ± 0.049
0.261HisCys: 0.261 ± 0.012
1.314HisAsp: 1.314 ± 0.026
1.159HisGlu: 1.159 ± 0.029
0.872HisPhe: 0.872 ± 0.023
2.277HisGly: 2.277 ± 0.04
0.635HisHis: 0.635 ± 0.024
0.905HisIle: 0.905 ± 0.024
0.609HisLys: 0.609 ± 0.019
2.13HisLeu: 2.13 ± 0.039
0.5HisMet: 0.5 ± 0.018
0.552HisAsn: 0.552 ± 0.019
1.319HisPro: 1.319 ± 0.029
0.732HisGln: 0.732 ± 0.018
1.403HisArg: 1.403 ± 0.028
1.002HisSer: 1.002 ± 0.023
1.049HisThr: 1.049 ± 0.022
1.661HisVal: 1.661 ± 0.035
0.375HisTrp: 0.375 ± 0.013
0.675HisTyr: 0.675 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.235IleAla: 6.235 ± 0.07
0.367IleCys: 0.367 ± 0.013
3.243IleAsp: 3.243 ± 0.044
2.86IleGlu: 2.86 ± 0.042
1.212IlePhe: 1.212 ± 0.027
4.288IleGly: 4.288 ± 0.054
0.874IleHis: 0.874 ± 0.023
1.514IleIle: 1.514 ± 0.038
1.429IleLys: 1.429 ± 0.03
3.44IleLeu: 3.44 ± 0.053
0.775IleMet: 0.775 ± 0.022
1.399IleAsn: 1.399 ± 0.031
1.944IlePro: 1.944 ± 0.031
1.042IleGln: 1.042 ± 0.025
2.647IleArg: 2.647 ± 0.041
2.247IleSer: 2.247 ± 0.037
2.38IleThr: 2.38 ± 0.04
3.996IleVal: 3.996 ± 0.055
0.442IleTrp: 0.442 ± 0.018
0.957IleTyr: 0.957 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.959LysAla: 3.959 ± 0.064
0.143LysCys: 0.143 ± 0.009
1.67LysAsp: 1.67 ± 0.036
1.626LysGlu: 1.626 ± 0.033
1.029LysPhe: 1.029 ± 0.023
2.365LysGly: 2.365 ± 0.04
0.686LysHis: 0.686 ± 0.024
1.481LysIle: 1.481 ± 0.036
1.432LysLys: 1.432 ± 0.04
3.692LysLeu: 3.692 ± 0.05
0.869LysMet: 0.869 ± 0.027
1.115LysAsn: 1.115 ± 0.034
2.043LysPro: 2.043 ± 0.038
1.316LysGln: 1.316 ± 0.029
2.02LysArg: 2.02 ± 0.037
1.681LysSer: 1.681 ± 0.035
1.861LysThr: 1.861 ± 0.043
2.569LysVal: 2.569 ± 0.047
0.371LysTrp: 0.371 ± 0.016
0.784LysTyr: 0.784 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
15.941LeuAla: 15.941 ± 0.153
1.037LeuCys: 1.037 ± 0.027
5.929LeuAsp: 5.929 ± 0.06
5.007LeuGlu: 5.007 ± 0.071
3.386LeuPhe: 3.386 ± 0.047
7.985LeuGly: 7.985 ± 0.073
2.36LeuHis: 2.36 ± 0.043
3.872LeuIle: 3.872 ± 0.057
3.389LeuLys: 3.389 ± 0.043
11.462LeuLeu: 11.462 ± 0.132
2.164LeuMet: 2.164 ± 0.039
2.786LeuAsn: 2.786 ± 0.04
6.147LeuPro: 6.147 ± 0.068
3.972LeuGln: 3.972 ± 0.054
7.921LeuArg: 7.921 ± 0.089
5.417LeuSer: 5.417 ± 0.065
5.703LeuThr: 5.703 ± 0.079
7.5LeuVal: 7.5 ± 0.067
1.191LeuTrp: 1.191 ± 0.032
2.326LeuTyr: 2.326 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
2.839MetAla: 2.839 ± 0.044
0.154MetCys: 0.154 ± 0.01
1.115MetAsp: 1.115 ± 0.025
1.043MetGlu: 1.043 ± 0.027
0.671MetPhe: 0.671 ± 0.021
1.615MetGly: 1.615 ± 0.031
0.522MetHis: 0.522 ± 0.017
0.928MetIle: 0.928 ± 0.022
0.987MetLys: 0.987 ± 0.027
2.595MetLeu: 2.595 ± 0.044
0.608MetMet: 0.608 ± 0.02
0.855MetAsn: 0.855 ± 0.02
1.361MetPro: 1.361 ± 0.028
1.088MetGln: 1.088 ± 0.026
1.619MetArg: 1.619 ± 0.032
1.376MetSer: 1.376 ± 0.03
1.613MetThr: 1.613 ± 0.03
1.551MetVal: 1.551 ± 0.03
0.196MetTrp: 0.196 ± 0.01
0.442MetTyr: 0.442 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.541AsnAla: 3.541 ± 0.05
0.267AsnCys: 0.267 ± 0.013
1.581AsnAsp: 1.581 ± 0.047
1.282AsnGlu: 1.282 ± 0.028
1.0AsnPhe: 1.0 ± 0.025
2.784AsnGly: 2.784 ± 0.048
0.568AsnHis: 0.568 ± 0.017
1.326AsnIle: 1.326 ± 0.032
0.903AsnLys: 0.903 ± 0.027
2.874AsnLeu: 2.874 ± 0.045
0.644AsnMet: 0.644 ± 0.02
0.967AsnAsn: 0.967 ± 0.03
1.809AsnPro: 1.809 ± 0.034
0.969AsnGln: 0.969 ± 0.027
1.814AsnArg: 1.814 ± 0.035
1.339AsnSer: 1.339 ± 0.03
1.582AsnThr: 1.582 ± 0.032
2.394AsnVal: 2.394 ± 0.044
0.414AsnTrp: 0.414 ± 0.017
0.885AsnTyr: 0.885 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
7.95ProAla: 7.95 ± 0.091
0.39ProCys: 0.39 ± 0.015
3.318ProAsp: 3.318 ± 0.047
3.056ProGlu: 3.056 ± 0.042
1.758ProPhe: 1.758 ± 0.033
4.658ProGly: 4.658 ± 0.059
1.136ProHis: 1.136 ± 0.027
1.796ProIle: 1.796 ± 0.032
1.425ProLys: 1.425 ± 0.033
5.228ProLeu: 5.228 ± 0.062
1.067ProMet: 1.067 ± 0.025
1.39ProAsn: 1.39 ± 0.027
2.703ProPro: 2.703 ± 0.051
2.017ProGln: 2.017 ± 0.036
2.892ProArg: 2.892 ± 0.05
2.398ProSer: 2.398 ± 0.035
2.498ProThr: 2.498 ± 0.042
4.284ProVal: 4.284 ± 0.051
0.665ProTrp: 0.665 ± 0.022
1.225ProTyr: 1.225 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
5.636GlnAla: 5.636 ± 0.07
0.292GlnCys: 0.292 ± 0.013
1.692GlnAsp: 1.692 ± 0.034
1.706GlnGlu: 1.706 ± 0.036
1.366GlnPhe: 1.366 ± 0.028
3.023GlnGly: 3.023 ± 0.045
0.985GlnHis: 0.985 ± 0.023
1.612GlnIle: 1.612 ± 0.035
1.164GlnLys: 1.164 ± 0.029
4.511GlnLeu: 4.511 ± 0.058
0.973GlnMet: 0.973 ± 0.023
0.977GlnAsn: 0.977 ± 0.024
2.275GlnPro: 2.275 ± 0.043
2.085GlnGln: 2.085 ± 0.038
3.277GlnArg: 3.277 ± 0.047
1.761GlnSer: 1.761 ± 0.036
1.763GlnThr: 1.763 ± 0.035
3.001GlnVal: 3.001 ± 0.05
0.572GlnTrp: 0.572 ± 0.017
1.007GlnTyr: 1.007 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
8.137ArgAla: 8.137 ± 0.079
0.586ArgCys: 0.586 ± 0.018
3.813ArgAsp: 3.813 ± 0.06
3.751ArgGlu: 3.751 ± 0.049
2.649ArgPhe: 2.649 ± 0.036
4.59ArgGly: 4.59 ± 0.054
1.994ArgHis: 1.994 ± 0.038
3.464ArgIle: 3.464 ± 0.048
2.107ArgLys: 2.107 ± 0.039
7.654ArgLeu: 7.654 ± 0.08
1.863ArgMet: 1.863 ± 0.028
1.932ArgAsn: 1.932 ± 0.032
3.225ArgPro: 3.225 ± 0.051
3.205ArgGln: 3.205 ± 0.047
5.227ArgArg: 5.227 ± 0.065
3.097ArgSer: 3.097 ± 0.046
3.515ArgThr: 3.515 ± 0.053
4.547ArgVal: 4.547 ± 0.048
1.12ArgTrp: 1.12 ± 0.031
2.138ArgTyr: 2.138 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
6.37SerAla: 6.37 ± 0.081
0.428SerCys: 0.428 ± 0.016
2.594SerAsp: 2.594 ± 0.037
2.26SerGlu: 2.26 ± 0.038
1.919SerPhe: 1.919 ± 0.035
4.982SerGly: 4.982 ± 0.07
1.136SerHis: 1.136 ± 0.026
2.361SerIle: 2.361 ± 0.04
1.547SerLys: 1.547 ± 0.028
5.091SerLeu: 5.091 ± 0.057
1.214SerMet: 1.214 ± 0.027
1.541SerAsn: 1.541 ± 0.031
2.443SerPro: 2.443 ± 0.041
1.718SerGln: 1.718 ± 0.031
3.129SerArg: 3.129 ± 0.046
2.893SerSer: 2.893 ± 0.051
2.91SerThr: 2.91 ± 0.065
3.727SerVal: 3.727 ± 0.053
0.745SerTrp: 0.745 ± 0.022
1.471SerTyr: 1.471 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
6.663ThrAla: 6.663 ± 0.086
0.42ThrCys: 0.42 ± 0.016
2.832ThrAsp: 2.832 ± 0.043
2.367ThrGlu: 2.367 ± 0.035
1.927ThrPhe: 1.927 ± 0.041
4.893ThrGly: 4.893 ± 0.06
1.047ThrHis: 1.047 ± 0.028
2.671ThrIle: 2.671 ± 0.045
1.405ThrLys: 1.405 ± 0.033
6.393ThrLeu: 6.393 ± 0.062
1.217ThrMet: 1.217 ± 0.031
1.44ThrAsn: 1.44 ± 0.036
3.559ThrPro: 3.559 ± 0.056
1.931ThrGln: 1.931 ± 0.045
3.265ThrArg: 3.265 ± 0.045
2.879ThrSer: 2.879 ± 0.076
3.162ThrThr: 3.162 ± 0.07
4.828ThrVal: 4.828 ± 0.058
0.775ThrTrp: 0.775 ± 0.024
1.372ThrTyr: 1.372 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
9.778ValAla: 9.778 ± 0.083
0.643ValCys: 0.643 ± 0.02
4.042ValAsp: 4.042 ± 0.051
4.002ValGlu: 4.002 ± 0.056
2.434ValPhe: 2.434 ± 0.041
5.334ValGly: 5.334 ± 0.063
1.496ValHis: 1.496 ± 0.025
3.319ValIle: 3.319 ± 0.046
2.479ValLys: 2.479 ± 0.043
8.061ValLeu: 8.061 ± 0.087
1.625ValMet: 1.625 ± 0.035
2.381ValAsn: 2.381 ± 0.038
4.117ValPro: 4.117 ± 0.05
2.786ValGln: 2.786 ± 0.038
5.085ValArg: 5.085 ± 0.063
3.888ValSer: 3.888 ± 0.049
4.847ValThr: 4.847 ± 0.057
5.721ValVal: 5.721 ± 0.058
0.841ValTrp: 0.841 ± 0.024
1.7ValTyr: 1.7 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.146TrpAla: 1.146 ± 0.025
0.154TrpCys: 0.154 ± 0.009
0.682TrpAsp: 0.682 ± 0.023
0.567TrpGlu: 0.567 ± 0.017
0.538TrpPhe: 0.538 ± 0.019
0.861TrpGly: 0.861 ± 0.025
0.428TrpHis: 0.428 ± 0.015
0.595TrpIle: 0.595 ± 0.019
0.438TrpLys: 0.438 ± 0.015
1.93TrpLeu: 1.93 ± 0.036
0.351TrpMet: 0.351 ± 0.014
0.491TrpAsn: 0.491 ± 0.019
0.64TrpPro: 0.64 ± 0.019
0.846TrpGln: 0.846 ± 0.025
1.288TrpArg: 1.288 ± 0.03
0.749TrpSer: 0.749 ± 0.022
0.753TrpThr: 0.753 ± 0.026
0.816TrpVal: 0.816 ± 0.024
0.263TrpTrp: 0.263 ± 0.014
0.364TrpTyr: 0.364 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.829TyrAla: 2.829 ± 0.038
0.267TyrCys: 0.267 ± 0.013
1.607TyrAsp: 1.607 ± 0.038
1.243TyrGlu: 1.243 ± 0.025
1.024TyrPhe: 1.024 ± 0.028
2.26TyrGly: 2.26 ± 0.044
0.538TyrHis: 0.538 ± 0.018
0.879TyrIle: 0.879 ± 0.022
0.786TyrLys: 0.786 ± 0.022
2.59TyrLeu: 2.59 ± 0.034
0.457TyrMet: 0.457 ± 0.017
0.771TyrAsn: 0.771 ± 0.025
1.273TyrPro: 1.273 ± 0.031
1.04TyrGln: 1.04 ± 0.025
2.099TyrArg: 2.099 ± 0.036
1.344TyrSer: 1.344 ± 0.029
1.447TyrThr: 1.447 ± 0.034
1.825TyrVal: 1.825 ± 0.034
0.384TyrTrp: 0.384 ± 0.016
0.789TyrTyr: 0.789 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5145 proteins (1777161 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski