Amino acid dipepetide frequency for Xenorhabdus nematophila (strain ATCC 19061 / DSM 3370 / CCUG 14189 / LMG 1036 / NCIMB 9965 / AN6)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.209AlaAla: 7.209 ± 0.105
1.024AlaCys: 1.024 ± 0.031
4.284AlaAsp: 4.284 ± 0.082
5.46AlaGlu: 5.46 ± 0.093
3.161AlaPhe: 3.161 ± 0.053
6.073AlaGly: 6.073 ± 0.087
1.697AlaHis: 1.697 ± 0.04
5.777AlaIle: 5.777 ± 0.078
4.044AlaLys: 4.044 ± 0.077
9.466AlaLeu: 9.466 ± 0.112
2.271AlaMet: 2.271 ± 0.04
3.134AlaAsn: 3.134 ± 0.058
2.745AlaPro: 2.745 ± 0.048
3.68AlaGln: 3.68 ± 0.061
4.132AlaArg: 4.132 ± 0.078
4.645AlaSer: 4.645 ± 0.071
3.975AlaThr: 3.975 ± 0.062
5.389AlaVal: 5.389 ± 0.065
1.006AlaTrp: 1.006 ± 0.03
2.389AlaTyr: 2.389 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.827CysAla: 0.827 ± 0.028
0.227CysCys: 0.227 ± 0.014
0.597CysAsp: 0.597 ± 0.024
0.607CysGlu: 0.607 ± 0.024
0.455CysPhe: 0.455 ± 0.02
1.003CysGly: 1.003 ± 0.033
0.428CysHis: 0.428 ± 0.019
0.665CysIle: 0.665 ± 0.023
0.375CysLys: 0.375 ± 0.019
1.129CysLeu: 1.129 ± 0.032
0.243CysMet: 0.243 ± 0.015
0.368CysAsn: 0.368 ± 0.02
0.523CysPro: 0.523 ± 0.022
0.641CysGln: 0.641 ± 0.025
0.632CysArg: 0.632 ± 0.024
0.794CysSer: 0.794 ± 0.026
0.522CysThr: 0.522 ± 0.022
0.718CysVal: 0.718 ± 0.025
0.187CysTrp: 0.187 ± 0.011
0.425CysTyr: 0.425 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.105AspAla: 4.105 ± 0.066
0.55AspCys: 0.55 ± 0.024
2.72AspAsp: 2.72 ± 0.049
3.536AspGlu: 3.536 ± 0.061
2.282AspPhe: 2.282 ± 0.041
3.417AspGly: 3.417 ± 0.067
1.073AspHis: 1.073 ± 0.029
4.221AspIle: 4.221 ± 0.058
3.049AspLys: 3.049 ± 0.05
4.736AspLeu: 4.736 ± 0.068
1.266AspMet: 1.266 ± 0.034
2.619AspAsn: 2.619 ± 0.05
2.11AspPro: 2.11 ± 0.051
1.64AspGln: 1.64 ± 0.036
2.432AspArg: 2.432 ± 0.055
3.078AspSer: 3.078 ± 0.056
2.618AspThr: 2.618 ± 0.048
3.262AspVal: 3.262 ± 0.063
0.866AspTrp: 0.866 ± 0.027
1.939AspTyr: 1.939 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
4.583GluAla: 4.583 ± 0.076
0.605GluCys: 0.605 ± 0.023
2.457GluAsp: 2.457 ± 0.049
3.52GluGlu: 3.52 ± 0.074
2.023GluPhe: 2.023 ± 0.047
3.329GluGly: 3.329 ± 0.051
1.618GluHis: 1.618 ± 0.038
4.2GluIle: 4.2 ± 0.06
4.172GluLys: 4.172 ± 0.07
6.653GluLeu: 6.653 ± 0.092
1.701GluMet: 1.701 ± 0.035
3.012GluAsn: 3.012 ± 0.055
2.04GluPro: 2.04 ± 0.047
3.869GluGln: 3.869 ± 0.071
3.72GluArg: 3.72 ± 0.065
3.295GluSer: 3.295 ± 0.051
3.256GluThr: 3.256 ± 0.055
3.282GluVal: 3.282 ± 0.063
0.848GluTrp: 0.848 ± 0.031
1.87GluTyr: 1.87 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.0PheAla: 3.0 ± 0.05
0.546PheCys: 0.546 ± 0.021
2.389PheAsp: 2.389 ± 0.052
2.03PheGlu: 2.03 ± 0.041
1.745PhePhe: 1.745 ± 0.043
2.857PheGly: 2.857 ± 0.058
0.914PheHis: 0.914 ± 0.032
2.917PheIle: 2.917 ± 0.056
1.719PheLys: 1.719 ± 0.043
3.37PheLeu: 3.37 ± 0.062
1.073PheMet: 1.073 ± 0.026
2.008PheAsn: 2.008 ± 0.041
1.556PhePro: 1.556 ± 0.04
1.263PheGln: 1.263 ± 0.03
1.873PheArg: 1.873 ± 0.038
3.329PheSer: 3.329 ± 0.058
2.263PheThr: 2.263 ± 0.044
2.432PheVal: 2.432 ± 0.048
0.552PheTrp: 0.552 ± 0.021
1.431PheTyr: 1.431 ± 0.037
0.001PheXaa: 0.001 ± 0.001
Gly
4.937GlyAla: 4.937 ± 0.076
0.92GlyCys: 0.92 ± 0.026
3.354GlyAsp: 3.354 ± 0.075
4.08GlyGlu: 4.08 ± 0.069
2.95GlyPhe: 2.95 ± 0.056
4.686GlyGly: 4.686 ± 0.068
1.669GlyHis: 1.669 ± 0.036
5.276GlyIle: 5.276 ± 0.069
4.26GlyLys: 4.26 ± 0.07
6.589GlyLeu: 6.589 ± 0.078
2.073GlyMet: 2.073 ± 0.044
2.919GlyAsn: 2.919 ± 0.081
1.534GlyPro: 1.534 ± 0.035
2.841GlyGln: 2.841 ± 0.063
3.581GlyArg: 3.581 ± 0.069
3.894GlySer: 3.894 ± 0.061
3.376GlyThr: 3.376 ± 0.058
4.591GlyVal: 4.591 ± 0.069
1.069GlyTrp: 1.069 ± 0.027
2.742GlyTyr: 2.742 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
1.694HisAla: 1.694 ± 0.037
0.395HisCys: 0.395 ± 0.018
1.272HisAsp: 1.272 ± 0.038
1.231HisGlu: 1.231 ± 0.031
1.177HisPhe: 1.177 ± 0.029
1.697HisGly: 1.697 ± 0.04
0.983HisHis: 0.983 ± 0.039
1.615HisIle: 1.615 ± 0.036
1.045HisLys: 1.045 ± 0.033
2.593HisLeu: 2.593 ± 0.062
0.542HisMet: 0.542 ± 0.022
0.94HisAsn: 0.94 ± 0.029
1.29HisPro: 1.29 ± 0.03
1.416HisGln: 1.416 ± 0.035
1.364HisArg: 1.364 ± 0.035
1.537HisSer: 1.537 ± 0.038
1.176HisThr: 1.176 ± 0.032
1.324HisVal: 1.324 ± 0.035
0.439HisTrp: 0.439 ± 0.018
1.113HisTyr: 1.113 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.226IleAla: 6.226 ± 0.077
0.841IleCys: 0.841 ± 0.026
3.961IleAsp: 3.961 ± 0.057
4.288IleGlu: 4.288 ± 0.069
2.457IlePhe: 2.457 ± 0.054
4.749IleGly: 4.749 ± 0.072
1.574IleHis: 1.574 ± 0.035
4.298IleIle: 4.298 ± 0.075
3.464IleLys: 3.464 ± 0.06
5.974IleLeu: 5.974 ± 0.081
1.452IleMet: 1.452 ± 0.036
3.46IleAsn: 3.46 ± 0.074
3.112IlePro: 3.112 ± 0.051
2.483IleGln: 2.483 ± 0.059
3.554IleArg: 3.554 ± 0.049
4.917IleSer: 4.917 ± 0.072
4.03IleThr: 4.03 ± 0.061
3.886IleVal: 3.886 ± 0.059
0.808IleTrp: 0.808 ± 0.026
1.977IleTyr: 1.977 ± 0.04
0.0IleXaa: 0.0 ± 0.0
Lys
4.292LysAla: 4.292 ± 0.07
0.329LysCys: 0.329 ± 0.015
2.354LysAsp: 2.354 ± 0.047
3.185LysGlu: 3.185 ± 0.058
1.42LysPhe: 1.42 ± 0.035
3.355LysGly: 3.355 ± 0.068
1.212LysHis: 1.212 ± 0.036
3.592LysIle: 3.592 ± 0.055
3.225LysLys: 3.225 ± 0.065
5.235LysLeu: 5.235 ± 0.083
1.405LysMet: 1.405 ± 0.037
2.828LysAsn: 2.828 ± 0.064
2.301LysPro: 2.301 ± 0.04
2.746LysGln: 2.746 ± 0.045
2.931LysArg: 2.931 ± 0.053
3.059LysSer: 3.059 ± 0.056
3.077LysThr: 3.077 ± 0.051
3.22LysVal: 3.22 ± 0.06
0.589LysTrp: 0.589 ± 0.024
1.515LysTyr: 1.515 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
9.488LeuAla: 9.488 ± 0.139
1.229LeuCys: 1.229 ± 0.034
5.369LeuAsp: 5.369 ± 0.08
5.73LeuGlu: 5.73 ± 0.084
4.277LeuPhe: 4.277 ± 0.066
6.429LeuGly: 6.429 ± 0.079
2.334LeuHis: 2.334 ± 0.046
6.502LeuIle: 6.502 ± 0.08
5.426LeuLys: 5.426 ± 0.085
11.186LeuLeu: 11.186 ± 0.151
2.676LeuMet: 2.676 ± 0.057
5.018LeuAsn: 5.018 ± 0.074
5.505LeuPro: 5.505 ± 0.083
4.174LeuGln: 4.174 ± 0.073
5.468LeuArg: 5.468 ± 0.074
8.364LeuSer: 8.364 ± 0.089
6.238LeuThr: 6.238 ± 0.1
6.143LeuVal: 6.143 ± 0.084
1.257LeuTrp: 1.257 ± 0.037
2.941LeuTyr: 2.941 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
2.4MetAla: 2.4 ± 0.051
0.219MetCys: 0.219 ± 0.014
1.141MetAsp: 1.141 ± 0.035
1.344MetGlu: 1.344 ± 0.032
0.813MetPhe: 0.813 ± 0.026
1.685MetGly: 1.685 ± 0.04
0.416MetHis: 0.416 ± 0.018
1.595MetIle: 1.595 ± 0.038
1.492MetLys: 1.492 ± 0.036
2.851MetLeu: 2.851 ± 0.049
0.807MetMet: 0.807 ± 0.029
1.178MetAsn: 1.178 ± 0.03
1.272MetPro: 1.272 ± 0.033
1.089MetGln: 1.089 ± 0.031
1.274MetArg: 1.274 ± 0.035
1.8MetSer: 1.8 ± 0.042
1.71MetThr: 1.71 ± 0.042
1.657MetVal: 1.657 ± 0.041
0.241MetTrp: 0.241 ± 0.014
0.557MetTyr: 0.557 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.456AsnAla: 3.456 ± 0.053
0.396AsnCys: 0.396 ± 0.018
2.262AsnAsp: 2.262 ± 0.053
2.502AsnGlu: 2.502 ± 0.045
1.533AsnPhe: 1.533 ± 0.04
3.156AsnGly: 3.156 ± 0.074
1.151AsnHis: 1.151 ± 0.038
3.319AsnIle: 3.319 ± 0.071
2.504AsnLys: 2.504 ± 0.059
3.964AsnLeu: 3.964 ± 0.07
1.085AsnMet: 1.085 ± 0.031
2.256AsnAsn: 2.256 ± 0.057
2.231AsnPro: 2.231 ± 0.039
2.176AsnGln: 2.176 ± 0.048
2.383AsnArg: 2.383 ± 0.045
2.598AsnSer: 2.598 ± 0.052
2.48AsnThr: 2.48 ± 0.046
2.525AsnVal: 2.525 ± 0.057
0.589AsnTrp: 0.589 ± 0.024
1.511AsnTyr: 1.511 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
3.697ProAla: 3.697 ± 0.068
0.393ProCys: 0.393 ± 0.02
2.917ProAsp: 2.917 ± 0.056
3.609ProGlu: 3.609 ± 0.059
1.781ProPhe: 1.781 ± 0.034
2.436ProGly: 2.436 ± 0.05
1.034ProHis: 1.034 ± 0.03
2.46ProIle: 2.46 ± 0.045
1.898ProLys: 1.898 ± 0.037
4.66ProLeu: 4.66 ± 0.09
0.963ProMet: 0.963 ± 0.028
1.631ProAsn: 1.631 ± 0.037
1.534ProPro: 1.534 ± 0.039
1.877ProGln: 1.877 ± 0.04
1.767ProArg: 1.767 ± 0.041
2.318ProSer: 2.318 ± 0.043
2.15ProThr: 2.15 ± 0.043
3.589ProVal: 3.589 ± 0.065
0.574ProTrp: 0.574 ± 0.025
1.331ProTyr: 1.331 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.128GlnAla: 4.128 ± 0.068
0.517GlnCys: 0.517 ± 0.024
1.983GlnAsp: 1.983 ± 0.043
2.509GlnGlu: 2.509 ± 0.055
1.711GlnPhe: 1.711 ± 0.037
2.995GlnGly: 2.995 ± 0.048
1.31GlnHis: 1.31 ± 0.033
2.815GlnIle: 2.815 ± 0.053
2.454GlnLys: 2.454 ± 0.047
5.288GlnLeu: 5.288 ± 0.1
1.073GlnMet: 1.073 ± 0.028
1.792GlnAsn: 1.792 ± 0.04
2.125GlnPro: 2.125 ± 0.062
3.468GlnGln: 3.468 ± 0.091
2.822GlnArg: 2.822 ± 0.058
2.759GlnSer: 2.759 ± 0.05
2.295GlnThr: 2.295 ± 0.055
2.889GlnVal: 2.889 ± 0.057
0.776GlnTrp: 0.776 ± 0.028
1.516GlnTyr: 1.516 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
3.618ArgAla: 3.618 ± 0.053
0.593ArgCys: 0.593 ± 0.026
2.633ArgAsp: 2.633 ± 0.048
3.496ArgGlu: 3.496 ± 0.061
2.408ArgPhe: 2.408 ± 0.046
3.085ArgGly: 3.085 ± 0.055
1.668ArgHis: 1.668 ± 0.046
3.7ArgIle: 3.7 ± 0.059
2.741ArgLys: 2.741 ± 0.051
6.167ArgLeu: 6.167 ± 0.093
1.343ArgMet: 1.343 ± 0.032
2.255ArgAsn: 2.255 ± 0.046
2.082ArgPro: 2.082 ± 0.05
3.087ArgGln: 3.087 ± 0.058
3.146ArgArg: 3.146 ± 0.064
2.812ArgSer: 2.812 ± 0.048
2.474ArgThr: 2.474 ± 0.048
3.213ArgVal: 3.213 ± 0.052
0.93ArgTrp: 0.93 ± 0.028
2.087ArgTyr: 2.087 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
5.135SerAla: 5.135 ± 0.074
0.687SerCys: 0.687 ± 0.027
3.389SerAsp: 3.389 ± 0.052
3.731SerGlu: 3.731 ± 0.068
2.513SerPhe: 2.513 ± 0.05
5.142SerGly: 5.142 ± 0.07
1.707SerHis: 1.707 ± 0.042
4.02SerIle: 4.02 ± 0.063
2.706SerLys: 2.706 ± 0.084
7.121SerLeu: 7.121 ± 0.086
1.546SerMet: 1.546 ± 0.042
2.354SerAsn: 2.354 ± 0.05
2.919SerPro: 2.919 ± 0.053
3.04SerGln: 3.04 ± 0.051
3.494SerArg: 3.494 ± 0.06
4.018SerSer: 4.018 ± 0.075
3.078SerThr: 3.078 ± 0.053
4.311SerVal: 4.311 ± 0.054
0.888SerTrp: 0.888 ± 0.03
1.953SerTyr: 1.953 ± 0.044
0.001SerXaa: 0.001 ± 0.001
Thr
4.387ThrAla: 4.387 ± 0.072
0.518ThrCys: 0.518 ± 0.02
2.838ThrAsp: 2.838 ± 0.057
3.284ThrGlu: 3.284 ± 0.058
1.978ThrPhe: 1.978 ± 0.038
4.19ThrGly: 4.19 ± 0.069
1.365ThrHis: 1.365 ± 0.037
3.208ThrIle: 3.208 ± 0.055
2.122ThrLys: 2.122 ± 0.049
6.757ThrLeu: 6.757 ± 0.08
1.069ThrMet: 1.069 ± 0.029
1.786ThrAsn: 1.786 ± 0.044
2.971ThrPro: 2.971 ± 0.046
2.436ThrGln: 2.436 ± 0.053
2.789ThrArg: 2.789 ± 0.056
2.994ThrSer: 2.994 ± 0.055
2.85ThrThr: 2.85 ± 0.062
3.725ThrVal: 3.725 ± 0.053
0.609ThrTrp: 0.609 ± 0.024
1.534ThrTyr: 1.534 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
5.204ValAla: 5.204 ± 0.069
0.721ValCys: 0.721 ± 0.027
3.305ValAsp: 3.305 ± 0.057
3.697ValGlu: 3.697 ± 0.061
2.567ValPhe: 2.567 ± 0.049
4.072ValGly: 4.072 ± 0.067
1.255ValHis: 1.255 ± 0.025
4.737ValIle: 4.737 ± 0.066
3.248ValLys: 3.248 ± 0.063
6.333ValLeu: 6.333 ± 0.081
1.892ValMet: 1.892 ± 0.042
2.763ValAsn: 2.763 ± 0.061
2.625ValPro: 2.625 ± 0.059
2.226ValGln: 2.226 ± 0.048
3.154ValArg: 3.154 ± 0.056
4.518ValSer: 4.518 ± 0.06
3.796ValThr: 3.796 ± 0.07
4.282ValVal: 4.282 ± 0.074
0.791ValTrp: 0.791 ± 0.026
1.79ValTyr: 1.79 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
0.837TrpAla: 0.837 ± 0.027
0.186TrpCys: 0.186 ± 0.013
0.62TrpAsp: 0.62 ± 0.024
0.685TrpGlu: 0.685 ± 0.029
0.598TrpPhe: 0.598 ± 0.021
0.771TrpGly: 0.771 ± 0.027
0.444TrpHis: 0.444 ± 0.018
0.75TrpIle: 0.75 ± 0.029
0.625TrpLys: 0.625 ± 0.024
2.042TrpLeu: 2.042 ± 0.04
0.352TrpMet: 0.352 ± 0.016
0.519TrpAsn: 0.519 ± 0.022
0.561TrpPro: 0.561 ± 0.022
1.12TrpGln: 1.12 ± 0.035
0.892TrpArg: 0.892 ± 0.031
0.863TrpSer: 0.863 ± 0.029
0.477TrpThr: 0.477 ± 0.021
0.824TrpVal: 0.824 ± 0.025
0.181TrpTrp: 0.181 ± 0.011
0.362TrpTyr: 0.362 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.338TyrAla: 2.338 ± 0.048
0.471TyrCys: 0.471 ± 0.019
1.672TyrAsp: 1.672 ± 0.048
1.53TyrGlu: 1.53 ± 0.039
1.42TyrPhe: 1.42 ± 0.037
2.212TyrGly: 2.212 ± 0.049
0.987TyrHis: 0.987 ± 0.029
1.842TyrIle: 1.842 ± 0.041
1.3TyrLys: 1.3 ± 0.032
3.637TyrLeu: 3.637 ± 0.058
0.709TyrMet: 0.709 ± 0.025
1.235TyrAsn: 1.235 ± 0.037
1.586TyrPro: 1.586 ± 0.041
1.954TyrGln: 1.954 ± 0.049
2.131TyrArg: 2.131 ± 0.042
2.161TyrSer: 2.161 ± 0.047
1.57TyrThr: 1.57 ± 0.04
1.716TyrVal: 1.716 ± 0.04
0.51TyrTrp: 0.51 ± 0.021
1.066TyrTyr: 1.066 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.001XaaPhe: 0.001 ± 0.001
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.019XaaXaa: 0.019 ± 0.012
Statistics based on 4439 proteins (1253860 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski