Amino acid dipepetide frequency for Erwinia iniecta

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.904AlaAla: 10.904 ± 0.132
0.987AlaCys: 0.987 ± 0.031
5.216AlaAsp: 5.216 ± 0.061
6.209AlaGlu: 6.209 ± 0.071
3.543AlaPhe: 3.543 ± 0.062
7.741AlaGly: 7.741 ± 0.082
1.709AlaHis: 1.709 ± 0.035
5.998AlaIle: 5.998 ± 0.079
3.568AlaLys: 3.568 ± 0.074
12.19AlaLeu: 12.19 ± 0.118
2.974AlaMet: 2.974 ± 0.051
3.014AlaAsn: 3.014 ± 0.053
3.534AlaPro: 3.534 ± 0.054
4.625AlaGln: 4.625 ± 0.069
5.654AlaArg: 5.654 ± 0.066
5.962AlaSer: 5.962 ± 0.074
4.775AlaThr: 4.775 ± 0.074
6.825AlaVal: 6.825 ± 0.086
1.511AlaTrp: 1.511 ± 0.032
1.983AlaTyr: 1.983 ± 0.044
0.001AlaXaa: 0.001 ± 0.001
Cys
0.864CysAla: 0.864 ± 0.027
0.166CysCys: 0.166 ± 0.012
0.54CysAsp: 0.54 ± 0.019
0.511CysGlu: 0.511 ± 0.019
0.454CysPhe: 0.454 ± 0.02
0.969CysGly: 0.969 ± 0.031
0.33CysHis: 0.33 ± 0.017
0.522CysIle: 0.522 ± 0.022
0.267CysLys: 0.267 ± 0.014
1.014CysLeu: 1.014 ± 0.028
0.206CysMet: 0.206 ± 0.013
0.296CysAsn: 0.296 ± 0.016
0.422CysPro: 0.422 ± 0.017
0.502CysGln: 0.502 ± 0.024
0.577CysArg: 0.577 ± 0.023
0.623CysSer: 0.623 ± 0.023
0.408CysThr: 0.408 ± 0.017
0.652CysVal: 0.652 ± 0.023
0.187CysTrp: 0.187 ± 0.012
0.322CysTyr: 0.322 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.287AspAla: 5.287 ± 0.071
0.496AspCys: 0.496 ± 0.022
3.03AspAsp: 3.03 ± 0.063
3.359AspGlu: 3.359 ± 0.059
2.208AspPhe: 2.208 ± 0.046
3.776AspGly: 3.776 ± 0.054
1.067AspHis: 1.067 ± 0.033
3.454AspIle: 3.454 ± 0.049
2.412AspLys: 2.412 ± 0.048
4.648AspLeu: 4.648 ± 0.066
1.286AspMet: 1.286 ± 0.034
2.408AspAsn: 2.408 ± 0.048
2.158AspPro: 2.158 ± 0.042
1.911AspGln: 1.911 ± 0.045
2.933AspArg: 2.933 ± 0.054
3.192AspSer: 3.192 ± 0.084
2.302AspThr: 2.302 ± 0.038
3.495AspVal: 3.495 ± 0.058
0.846AspTrp: 0.846 ± 0.025
1.955AspTyr: 1.955 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
5.133GluAla: 5.133 ± 0.072
0.378GluCys: 0.378 ± 0.016
2.207GluAsp: 2.207 ± 0.043
3.094GluGlu: 3.094 ± 0.058
1.83GluPhe: 1.83 ± 0.039
3.347GluGly: 3.347 ± 0.053
1.255GluHis: 1.255 ± 0.033
3.335GluIle: 3.335 ± 0.046
3.065GluLys: 3.065 ± 0.057
5.859GluLeu: 5.859 ± 0.08
1.743GluMet: 1.743 ± 0.039
2.366GluAsn: 2.366 ± 0.043
2.022GluPro: 2.022 ± 0.041
3.694GluGln: 3.694 ± 0.067
3.468GluArg: 3.468 ± 0.055
3.015GluSer: 3.015 ± 0.04
2.713GluThr: 2.713 ± 0.044
3.786GluVal: 3.786 ± 0.061
0.753GluTrp: 0.753 ± 0.021
1.405GluTyr: 1.405 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
3.835PheAla: 3.835 ± 0.059
0.53PheCys: 0.53 ± 0.019
2.365PheAsp: 2.365 ± 0.044
1.662PheGlu: 1.662 ± 0.034
1.699PhePhe: 1.699 ± 0.042
3.131PheGly: 3.131 ± 0.062
0.818PheHis: 0.818 ± 0.025
2.642PheIle: 2.642 ± 0.051
1.232PheLys: 1.232 ± 0.031
3.33PheLeu: 3.33 ± 0.055
0.956PheMet: 0.956 ± 0.026
1.786PheAsn: 1.786 ± 0.036
1.592PhePro: 1.592 ± 0.037
1.227PheGln: 1.227 ± 0.031
1.954PheArg: 1.954 ± 0.042
3.429PheSer: 3.429 ± 0.043
2.321PheThr: 2.321 ± 0.042
2.401PheVal: 2.401 ± 0.049
0.651PheTrp: 0.651 ± 0.025
1.204PheTyr: 1.204 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
6.091GlyAla: 6.091 ± 0.078
0.947GlyCys: 0.947 ± 0.031
3.655GlyAsp: 3.655 ± 0.056
4.529GlyGlu: 4.529 ± 0.067
3.288GlyPhe: 3.288 ± 0.047
5.13GlyGly: 5.13 ± 0.084
1.546GlyHis: 1.546 ± 0.036
4.933GlyIle: 4.933 ± 0.072
3.906GlyLys: 3.906 ± 0.058
7.452GlyLeu: 7.452 ± 0.089
2.334GlyMet: 2.334 ± 0.046
2.75GlyAsn: 2.75 ± 0.053
1.954GlyPro: 1.954 ± 0.036
2.973GlyGln: 2.973 ± 0.055
3.755GlyArg: 3.755 ± 0.055
4.431GlySer: 4.431 ± 0.057
3.42GlyThr: 3.42 ± 0.054
5.576GlyVal: 5.576 ± 0.073
1.363GlyTrp: 1.363 ± 0.033
2.626GlyTyr: 2.626 ± 0.042
0.002GlyXaa: 0.002 ± 0.001
His
1.95HisAla: 1.95 ± 0.041
0.322HisCys: 0.322 ± 0.017
1.132HisAsp: 1.132 ± 0.03
1.004HisGlu: 1.004 ± 0.028
1.074HisPhe: 1.074 ± 0.029
1.709HisGly: 1.709 ± 0.041
0.799HisHis: 0.799 ± 0.031
1.214HisIle: 1.214 ± 0.031
0.703HisLys: 0.703 ± 0.025
2.282HisLeu: 2.282 ± 0.041
0.489HisMet: 0.489 ± 0.02
0.832HisAsn: 0.832 ± 0.023
1.312HisPro: 1.312 ± 0.031
1.516HisGln: 1.516 ± 0.035
1.245HisArg: 1.245 ± 0.033
1.383HisSer: 1.383 ± 0.031
1.002HisThr: 1.002 ± 0.026
1.099HisVal: 1.099 ± 0.029
0.462HisTrp: 0.462 ± 0.021
0.926HisTyr: 0.926 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
6.622IleAla: 6.622 ± 0.079
0.622IleCys: 0.622 ± 0.021
3.741IleAsp: 3.741 ± 0.055
3.36IleGlu: 3.36 ± 0.047
2.057IlePhe: 2.057 ± 0.043
4.687IleGly: 4.687 ± 0.067
1.147IleHis: 1.147 ± 0.033
3.413IleIle: 3.413 ± 0.06
2.398IleLys: 2.398 ± 0.047
4.948IleLeu: 4.948 ± 0.07
1.256IleMet: 1.256 ± 0.033
2.736IleAsn: 2.736 ± 0.05
2.618IlePro: 2.618 ± 0.042
1.783IleGln: 1.783 ± 0.041
2.838IleArg: 2.838 ± 0.045
4.015IleSer: 4.015 ± 0.061
3.512IleThr: 3.512 ± 0.047
3.721IleVal: 3.721 ± 0.06
0.706IleTrp: 0.706 ± 0.027
1.513IleTyr: 1.513 ± 0.038
0.001IleXaa: 0.001 ± 0.001
Lys
4.006LysAla: 4.006 ± 0.068
0.225LysCys: 0.225 ± 0.014
1.957LysAsp: 1.957 ± 0.046
2.15LysGlu: 2.15 ± 0.05
1.118LysPhe: 1.118 ± 0.031
2.687LysGly: 2.687 ± 0.057
0.841LysHis: 0.841 ± 0.027
2.321LysIle: 2.321 ± 0.049
2.114LysLys: 2.114 ± 0.047
4.282LysLeu: 4.282 ± 0.063
1.157LysMet: 1.157 ± 0.034
1.707LysAsn: 1.707 ± 0.042
2.119LysPro: 2.119 ± 0.043
2.194LysGln: 2.194 ± 0.044
2.486LysArg: 2.486 ± 0.041
2.395LysSer: 2.395 ± 0.051
2.275LysThr: 2.275 ± 0.045
2.868LysVal: 2.868 ± 0.052
0.406LysTrp: 0.406 ± 0.019
1.009LysTyr: 1.009 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
11.846LeuAla: 11.846 ± 0.105
1.188LeuCys: 1.188 ± 0.031
5.572LeuAsp: 5.572 ± 0.075
5.349LeuGlu: 5.349 ± 0.072
4.363LeuPhe: 4.363 ± 0.067
7.178LeuGly: 7.178 ± 0.09
2.349LeuHis: 2.349 ± 0.046
6.155LeuIle: 6.155 ± 0.083
4.582LeuLys: 4.582 ± 0.062
12.941LeuLeu: 12.941 ± 0.147
3.184LeuMet: 3.184 ± 0.05
4.351LeuAsn: 4.351 ± 0.061
5.904LeuPro: 5.904 ± 0.083
4.842LeuGln: 4.842 ± 0.072
6.323LeuArg: 6.323 ± 0.078
8.011LeuSer: 8.011 ± 0.092
6.497LeuThr: 6.497 ± 0.077
7.109LeuVal: 7.109 ± 0.08
1.427LeuTrp: 1.427 ± 0.041
2.557LeuTyr: 2.557 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
2.84MetAla: 2.84 ± 0.047
0.17MetCys: 0.17 ± 0.01
1.136MetAsp: 1.136 ± 0.029
1.148MetGlu: 1.148 ± 0.032
0.846MetPhe: 0.846 ± 0.026
1.805MetGly: 1.805 ± 0.039
0.517MetHis: 0.517 ± 0.022
1.519MetIle: 1.519 ± 0.036
1.323MetLys: 1.323 ± 0.03
3.381MetLeu: 3.381 ± 0.051
0.919MetMet: 0.919 ± 0.026
1.093MetAsn: 1.093 ± 0.026
1.373MetPro: 1.373 ± 0.034
1.303MetGln: 1.303 ± 0.035
1.492MetArg: 1.492 ± 0.034
1.858MetSer: 1.858 ± 0.042
1.705MetThr: 1.705 ± 0.035
1.957MetVal: 1.957 ± 0.039
0.232MetTrp: 0.232 ± 0.014
0.454MetTyr: 0.454 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.508AsnAla: 3.508 ± 0.052
0.327AsnCys: 0.327 ± 0.015
1.976AsnAsp: 1.976 ± 0.041
1.761AsnGlu: 1.761 ± 0.035
1.408AsnPhe: 1.408 ± 0.036
2.975AsnGly: 2.975 ± 0.053
0.838AsnHis: 0.838 ± 0.025
2.369AsnIle: 2.369 ± 0.054
1.565AsnLys: 1.565 ± 0.04
3.689AsnLeu: 3.689 ± 0.051
0.881AsnMet: 0.881 ± 0.025
1.677AsnAsn: 1.677 ± 0.047
2.107AsnPro: 2.107 ± 0.04
1.914AsnGln: 1.914 ± 0.042
2.061AsnArg: 2.061 ± 0.045
2.273AsnSer: 2.273 ± 0.045
1.858AsnThr: 1.858 ± 0.04
2.369AsnVal: 2.369 ± 0.042
0.627AsnTrp: 0.627 ± 0.022
1.205AsnTyr: 1.205 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
4.774ProAla: 4.774 ± 0.067
0.335ProCys: 0.335 ± 0.016
2.744ProAsp: 2.744 ± 0.044
3.256ProGlu: 3.256 ± 0.051
1.821ProPhe: 1.821 ± 0.033
3.43ProGly: 3.43 ± 0.052
1.057ProHis: 1.057 ± 0.029
1.817ProIle: 1.817 ± 0.036
1.429ProLys: 1.429 ± 0.036
5.426ProLeu: 5.426 ± 0.065
1.109ProMet: 1.109 ± 0.028
1.193ProAsn: 1.193 ± 0.027
1.705ProPro: 1.705 ± 0.041
2.641ProGln: 2.641 ± 0.05
1.979ProArg: 1.979 ± 0.042
2.169ProSer: 2.169 ± 0.037
1.994ProThr: 1.994 ± 0.038
3.817ProVal: 3.817 ± 0.061
0.746ProTrp: 0.746 ± 0.025
1.146ProTyr: 1.146 ± 0.027
0.002ProXaa: 0.002 ± 0.001
Gln
4.988GlnAla: 4.988 ± 0.077
0.375GlnCys: 0.375 ± 0.018
1.98GlnAsp: 1.98 ± 0.041
1.99GlnGlu: 1.99 ± 0.041
1.63GlnPhe: 1.63 ± 0.033
3.266GlnGly: 3.266 ± 0.057
1.585GlnHis: 1.585 ± 0.036
2.423GlnIle: 2.423 ± 0.046
1.718GlnLys: 1.718 ± 0.037
6.099GlnLeu: 6.099 ± 0.091
1.267GlnMet: 1.267 ± 0.028
1.488GlnAsn: 1.488 ± 0.035
2.824GlnPro: 2.824 ± 0.061
4.875GlnGln: 4.875 ± 0.11
3.659GlnArg: 3.659 ± 0.069
2.669GlnSer: 2.669 ± 0.051
2.203GlnThr: 2.203 ± 0.042
3.197GlnVal: 3.197 ± 0.056
0.778GlnTrp: 0.778 ± 0.032
1.227GlnTyr: 1.227 ± 0.031
0.002GlnXaa: 0.002 ± 0.001
Arg
4.752ArgAla: 4.752 ± 0.06
0.559ArgCys: 0.559 ± 0.023
3.145ArgAsp: 3.145 ± 0.056
3.593ArgGlu: 3.593 ± 0.061
2.678ArgPhe: 2.678 ± 0.045
3.4ArgGly: 3.4 ± 0.052
1.596ArgHis: 1.596 ± 0.037
3.342ArgIle: 3.342 ± 0.053
2.209ArgLys: 2.209 ± 0.037
6.619ArgLeu: 6.619 ± 0.088
1.53ArgMet: 1.53 ± 0.038
2.079ArgAsn: 2.079 ± 0.04
2.242ArgPro: 2.242 ± 0.047
3.484ArgGln: 3.484 ± 0.063
3.487ArgArg: 3.487 ± 0.061
2.967ArgSer: 2.967 ± 0.042
2.39ArgThr: 2.39 ± 0.032
3.826ArgVal: 3.826 ± 0.064
0.996ArgTrp: 0.996 ± 0.028
2.173ArgTyr: 2.173 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
6.208SerAla: 6.208 ± 0.065
0.6SerCys: 0.6 ± 0.022
3.468SerAsp: 3.468 ± 0.081
3.403SerGlu: 3.403 ± 0.06
2.498SerPhe: 2.498 ± 0.046
5.667SerGly: 5.667 ± 0.064
1.475SerHis: 1.475 ± 0.036
2.989SerIle: 2.989 ± 0.055
2.107SerLys: 2.107 ± 0.048
7.308SerLeu: 7.308 ± 0.074
1.547SerMet: 1.547 ± 0.035
1.968SerAsn: 1.968 ± 0.046
2.721SerPro: 2.721 ± 0.043
2.865SerGln: 2.865 ± 0.05
3.585SerArg: 3.585 ± 0.054
3.901SerSer: 3.901 ± 0.068
2.943SerThr: 2.943 ± 0.051
4.358SerVal: 4.358 ± 0.059
1.201SerTrp: 1.201 ± 0.03
1.742SerTyr: 1.742 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.915ThrAla: 4.915 ± 0.073
0.403ThrCys: 0.403 ± 0.016
2.576ThrAsp: 2.576 ± 0.046
2.598ThrGlu: 2.598 ± 0.043
1.915ThrPhe: 1.915 ± 0.035
4.103ThrGly: 4.103 ± 0.058
1.139ThrHis: 1.139 ± 0.031
2.629ThrIle: 2.629 ± 0.047
1.376ThrLys: 1.376 ± 0.037
7.463ThrLeu: 7.463 ± 0.092
1.052ThrMet: 1.052 ± 0.032
1.471ThrAsn: 1.471 ± 0.038
3.054ThrPro: 3.054 ± 0.046
2.208ThrGln: 2.208 ± 0.043
3.052ThrArg: 3.052 ± 0.044
2.888ThrSer: 2.888 ± 0.05
2.716ThrThr: 2.716 ± 0.06
3.724ThrVal: 3.724 ± 0.055
0.648ThrTrp: 0.648 ± 0.025
1.026ThrTyr: 1.026 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
7.046ValAla: 7.046 ± 0.082
0.681ValCys: 0.681 ± 0.024
3.613ValAsp: 3.613 ± 0.052
3.84ValGlu: 3.84 ± 0.059
2.487ValPhe: 2.487 ± 0.042
4.637ValGly: 4.637 ± 0.069
1.154ValHis: 1.154 ± 0.034
4.534ValIle: 4.534 ± 0.065
2.975ValLys: 2.975 ± 0.048
7.179ValLeu: 7.179 ± 0.07
2.187ValMet: 2.187 ± 0.052
2.702ValAsn: 2.702 ± 0.045
2.852ValPro: 2.852 ± 0.048
2.402ValGln: 2.402 ± 0.046
3.557ValArg: 3.557 ± 0.058
4.748ValSer: 4.748 ± 0.063
4.104ValThr: 4.104 ± 0.06
5.268ValVal: 5.268 ± 0.084
0.919ValTrp: 0.919 ± 0.027
1.578ValTyr: 1.578 ± 0.035
0.002ValXaa: 0.002 ± 0.001
Trp
0.979TrpAla: 0.979 ± 0.025
0.179TrpCys: 0.179 ± 0.012
0.649TrpAsp: 0.649 ± 0.024
0.561TrpGlu: 0.561 ± 0.021
0.707TrpPhe: 0.707 ± 0.028
0.881TrpGly: 0.881 ± 0.025
0.46TrpHis: 0.46 ± 0.018
0.763TrpIle: 0.763 ± 0.025
0.499TrpLys: 0.499 ± 0.02
2.458TrpLeu: 2.458 ± 0.055
0.428TrpMet: 0.428 ± 0.015
0.488TrpAsn: 0.488 ± 0.02
0.673TrpPro: 0.673 ± 0.022
1.406TrpGln: 1.406 ± 0.039
1.086TrpArg: 1.086 ± 0.031
0.866TrpSer: 0.866 ± 0.026
0.537TrpThr: 0.537 ± 0.022
0.869TrpVal: 0.869 ± 0.024
0.247TrpTrp: 0.247 ± 0.014
0.417TrpTyr: 0.417 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.372TyrAla: 2.372 ± 0.044
0.339TyrCys: 0.339 ± 0.016
1.521TyrAsp: 1.521 ± 0.039
1.092TyrGlu: 1.092 ± 0.029
1.14TyrPhe: 1.14 ± 0.031
2.138TyrGly: 2.138 ± 0.038
0.776TyrHis: 0.776 ± 0.022
1.32TyrIle: 1.32 ± 0.035
0.869TyrLys: 0.869 ± 0.028
3.161TyrLeu: 3.161 ± 0.052
0.567TyrMet: 0.567 ± 0.021
0.981TyrAsn: 0.981 ± 0.032
1.406TyrPro: 1.406 ± 0.034
1.876TyrGln: 1.876 ± 0.037
1.876TyrArg: 1.876 ± 0.036
1.758TyrSer: 1.758 ± 0.044
1.279TyrThr: 1.279 ± 0.035
1.547TyrVal: 1.547 ± 0.035
0.449TyrTrp: 0.449 ± 0.022
0.906TyrTyr: 0.906 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.002XaaMet: 0.002 ± 0.001
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.001
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.001
0.062XaaXaa: 0.062 ± 0.02
Statistics based on 4105 proteins (1321614 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski