Amino acid dipepetide frequency for Paenibacillus piri

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.215AlaAla: 10.215 ± 0.121
0.802AlaCys: 0.802 ± 0.021
4.55AlaAsp: 4.55 ± 0.047
5.525AlaGlu: 5.525 ± 0.062
3.385AlaPhe: 3.385 ± 0.044
7.242AlaGly: 7.242 ± 0.075
1.434AlaHis: 1.434 ± 0.028
5.293AlaIle: 5.293 ± 0.059
4.408AlaLys: 4.408 ± 0.05
8.421AlaLeu: 8.421 ± 0.078
2.416AlaMet: 2.416 ± 0.032
2.839AlaAsn: 2.839 ± 0.037
2.972AlaPro: 2.972 ± 0.048
2.893AlaGln: 2.893 ± 0.041
3.655AlaArg: 3.655 ± 0.05
5.106AlaSer: 5.106 ± 0.056
3.502AlaThr: 3.502 ± 0.136
6.939AlaVal: 6.939 ± 0.062
0.97AlaTrp: 0.97 ± 0.021
2.777AlaTyr: 2.777 ± 0.039
0.001AlaXaa: 0.001 ± 0.001
Cys
0.598CysAla: 0.598 ± 0.016
0.124CysCys: 0.124 ± 0.008
0.424CysAsp: 0.424 ± 0.013
0.442CysGlu: 0.442 ± 0.016
0.336CysPhe: 0.336 ± 0.013
0.839CysGly: 0.839 ± 0.02
0.219CysHis: 0.219 ± 0.013
0.551CysIle: 0.551 ± 0.019
0.327CysLys: 0.327 ± 0.012
0.825CysLeu: 0.825 ± 0.02
0.222CysMet: 0.222 ± 0.01
0.271CysAsn: 0.271 ± 0.011
0.381CysPro: 0.381 ± 0.017
0.225CysGln: 0.225 ± 0.01
0.533CysArg: 0.533 ± 0.017
0.673CysSer: 0.673 ± 0.017
0.443CysThr: 0.443 ± 0.015
0.48CysVal: 0.48 ± 0.018
0.113CysTrp: 0.113 ± 0.007
0.295CysTyr: 0.295 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
4.155AspAla: 4.155 ± 0.052
0.418AspCys: 0.418 ± 0.015
2.467AspAsp: 2.467 ± 0.037
3.582AspGlu: 3.582 ± 0.049
2.074AspPhe: 2.074 ± 0.033
4.151AspGly: 4.151 ± 0.051
1.131AspHis: 1.131 ± 0.027
3.43AspIle: 3.43 ± 0.049
2.807AspLys: 2.807 ± 0.041
4.627AspLeu: 4.627 ± 0.051
1.427AspMet: 1.427 ± 0.025
1.76AspAsn: 1.76 ± 0.034
2.48AspPro: 2.48 ± 0.039
1.796AspGln: 1.796 ± 0.028
2.887AspArg: 2.887 ± 0.038
2.768AspSer: 2.768 ± 0.035
2.446AspThr: 2.446 ± 0.035
3.437AspVal: 3.437 ± 0.041
0.88AspTrp: 0.88 ± 0.021
2.08AspTyr: 2.08 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
6.081GluAla: 6.081 ± 0.062
0.398GluCys: 0.398 ± 0.015
2.627GluAsp: 2.627 ± 0.037
4.415GluGlu: 4.415 ± 0.058
2.102GluPhe: 2.102 ± 0.034
3.892GluGly: 3.892 ± 0.043
1.541GluHis: 1.541 ± 0.028
3.937GluIle: 3.937 ± 0.044
3.649GluLys: 3.649 ± 0.049
7.151GluLeu: 7.151 ± 0.075
1.82GluMet: 1.82 ± 0.031
2.168GluAsn: 2.168 ± 0.029
2.467GluPro: 2.467 ± 0.035
3.736GluGln: 3.736 ± 0.056
3.973GluArg: 3.973 ± 0.058
3.333GluSer: 3.333 ± 0.043
3.289GluThr: 3.289 ± 0.034
3.854GluVal: 3.854 ± 0.052
0.912GluTrp: 0.912 ± 0.023
1.84GluTyr: 1.84 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
3.401PheAla: 3.401 ± 0.038
0.374PheCys: 0.374 ± 0.014
2.358PheAsp: 2.358 ± 0.034
2.347PheGlu: 2.347 ± 0.036
1.939PhePhe: 1.939 ± 0.035
3.202PheGly: 3.202 ± 0.043
0.951PheHis: 0.951 ± 0.019
3.113PheIle: 3.113 ± 0.049
2.121PheLys: 2.121 ± 0.036
3.849PheLeu: 3.849 ± 0.062
1.188PheMet: 1.188 ± 0.026
1.654PheAsn: 1.654 ± 0.028
1.696PhePro: 1.696 ± 0.03
1.55PheGln: 1.55 ± 0.029
2.055PheArg: 2.055 ± 0.031
2.811PheSer: 2.811 ± 0.041
2.492PheThr: 2.492 ± 0.035
2.959PheVal: 2.959 ± 0.043
0.536PheTrp: 0.536 ± 0.018
1.532PheTyr: 1.532 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
5.859GlyAla: 5.859 ± 0.167
0.87GlyCys: 0.87 ± 0.02
3.461GlyAsp: 3.461 ± 0.049
4.201GlyGlu: 4.201 ± 0.044
3.249GlyPhe: 3.249 ± 0.043
5.802GlyGly: 5.802 ± 0.078
1.592GlyHis: 1.592 ± 0.029
5.7GlyIle: 5.7 ± 0.061
4.651GlyLys: 4.651 ± 0.056
7.334GlyLeu: 7.334 ± 0.06
2.364GlyMet: 2.364 ± 0.036
2.618GlyAsn: 2.618 ± 0.052
2.113GlyPro: 2.113 ± 0.032
2.804GlyGln: 2.804 ± 0.039
3.626GlyArg: 3.626 ± 0.05
5.005GlySer: 5.005 ± 0.057
4.384GlyThr: 4.384 ± 0.059
5.076GlyVal: 5.076 ± 0.052
1.184GlyTrp: 1.184 ± 0.024
2.916GlyTyr: 2.916 ± 0.036
0.0GlyXaa: 0.0 ± 0.001
His
1.68HisAla: 1.68 ± 0.034
0.218HisCys: 0.218 ± 0.01
1.075HisAsp: 1.075 ± 0.025
1.257HisGlu: 1.257 ± 0.024
1.086HisPhe: 1.086 ± 0.022
1.555HisGly: 1.555 ± 0.029
0.615HisHis: 0.615 ± 0.016
1.529HisIle: 1.529 ± 0.024
0.897HisLys: 0.897 ± 0.018
2.058HisLeu: 2.058 ± 0.035
0.616HisMet: 0.616 ± 0.016
0.695HisAsn: 0.695 ± 0.017
1.248HisPro: 1.248 ± 0.026
0.793HisGln: 0.793 ± 0.017
1.133HisArg: 1.133 ± 0.024
1.194HisSer: 1.194 ± 0.029
1.042HisThr: 1.042 ± 0.022
1.441HisVal: 1.441 ± 0.028
0.356HisTrp: 0.356 ± 0.012
0.908HisTyr: 0.908 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.004IleAla: 6.004 ± 0.055
0.649IleCys: 0.649 ± 0.016
3.738IleAsp: 3.738 ± 0.047
4.22IleGlu: 4.22 ± 0.047
2.462IlePhe: 2.462 ± 0.04
5.513IleGly: 5.513 ± 0.061
1.51IleHis: 1.51 ± 0.025
4.278IleIle: 4.278 ± 0.066
2.974IleLys: 2.974 ± 0.042
5.489IleLeu: 5.489 ± 0.066
1.722IleMet: 1.722 ± 0.03
2.457IleAsn: 2.457 ± 0.036
3.265IlePro: 3.265 ± 0.042
2.478IleGln: 2.478 ± 0.039
4.053IleArg: 4.053 ± 0.048
4.296IleSer: 4.296 ± 0.049
3.528IleThr: 3.528 ± 0.041
5.282IleVal: 5.282 ± 0.062
0.771IleTrp: 0.771 ± 0.02
2.108IleTyr: 2.108 ± 0.034
0.0IleXaa: 0.0 ± 0.0
Lys
4.352LysAla: 4.352 ± 0.055
0.233LysCys: 0.233 ± 0.01
2.915LysAsp: 2.915 ± 0.045
4.091LysGlu: 4.091 ± 0.049
1.647LysPhe: 1.647 ± 0.032
3.571LysGly: 3.571 ± 0.04
1.171LysHis: 1.171 ± 0.025
3.093LysIle: 3.093 ± 0.044
3.373LysLys: 3.373 ± 0.058
5.582LysLeu: 5.582 ± 0.062
1.653LysMet: 1.653 ± 0.03
2.123LysAsn: 2.123 ± 0.037
2.525LysPro: 2.525 ± 0.035
2.744LysGln: 2.744 ± 0.041
2.874LysArg: 2.874 ± 0.034
3.046LysSer: 3.046 ± 0.037
2.83LysThr: 2.83 ± 0.036
3.561LysVal: 3.561 ± 0.04
0.733LysTrp: 0.733 ± 0.02
1.819LysTyr: 1.819 ± 0.03
0.0LysXaa: 0.0 ± 0.001
Leu
8.232LeuAla: 8.232 ± 0.074
0.879LeuCys: 0.879 ± 0.023
5.054LeuAsp: 5.054 ± 0.053
6.045LeuGlu: 6.045 ± 0.057
4.66LeuPhe: 4.66 ± 0.067
6.539LeuGly: 6.539 ± 0.068
2.172LeuHis: 2.172 ± 0.032
6.551LeuIle: 6.551 ± 0.065
5.461LeuLys: 5.461 ± 0.06
11.154LeuLeu: 11.154 ± 0.112
2.6LeuMet: 2.6 ± 0.033
3.944LeuAsn: 3.944 ± 0.047
4.737LeuPro: 4.737 ± 0.049
4.455LeuGln: 4.455 ± 0.054
4.991LeuArg: 4.991 ± 0.055
6.879LeuSer: 6.879 ± 0.056
5.545LeuThr: 5.545 ± 0.051
6.166LeuVal: 6.166 ± 0.056
1.101LeuTrp: 1.101 ± 0.025
3.23LeuTyr: 3.23 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
2.325MetAla: 2.325 ± 0.033
0.162MetCys: 0.162 ± 0.009
1.559MetAsp: 1.559 ± 0.026
1.891MetGlu: 1.891 ± 0.031
1.124MetPhe: 1.124 ± 0.026
1.727MetGly: 1.727 ± 0.027
0.488MetHis: 0.488 ± 0.013
2.007MetIle: 2.007 ± 0.032
2.047MetLys: 2.047 ± 0.033
3.012MetLeu: 3.012 ± 0.038
0.946MetMet: 0.946 ± 0.023
1.473MetAsn: 1.473 ± 0.027
1.211MetPro: 1.211 ± 0.024
1.024MetGln: 1.024 ± 0.02
1.4MetArg: 1.4 ± 0.025
1.771MetSer: 1.771 ± 0.029
1.635MetThr: 1.635 ± 0.024
1.807MetVal: 1.807 ± 0.03
0.27MetTrp: 0.27 ± 0.012
0.833MetTyr: 0.833 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
2.869AsnAla: 2.869 ± 0.04
0.277AsnCys: 0.277 ± 0.012
1.949AsnAsp: 1.949 ± 0.034
2.492AsnGlu: 2.492 ± 0.034
1.376AsnPhe: 1.376 ± 0.025
3.43AsnGly: 3.43 ± 0.059
0.813AsnHis: 0.813 ± 0.02
2.374AsnIle: 2.374 ± 0.034
2.113AsnLys: 2.113 ± 0.034
3.192AsnLeu: 3.192 ± 0.036
1.053AsnMet: 1.053 ± 0.023
1.559AsnAsn: 1.559 ± 0.032
2.056AsnPro: 2.056 ± 0.03
1.474AsnGln: 1.474 ± 0.027
2.234AsnArg: 2.234 ± 0.03
2.088AsnSer: 2.088 ± 0.037
1.953AsnThr: 1.953 ± 0.035
2.624AsnVal: 2.624 ± 0.04
0.522AsnTrp: 0.522 ± 0.016
1.391AsnTyr: 1.391 ± 0.025
0.001AsnXaa: 0.001 ± 0.001
Pro
3.849ProAla: 3.849 ± 0.055
0.252ProCys: 0.252 ± 0.01
2.804ProAsp: 2.804 ± 0.041
3.238ProGlu: 3.238 ± 0.05
2.096ProPhe: 2.096 ± 0.03
3.296ProGly: 3.296 ± 0.046
0.995ProHis: 0.995 ± 0.022
2.538ProIle: 2.538 ± 0.041
1.927ProLys: 1.927 ± 0.03
4.21ProLeu: 4.21 ± 0.049
1.1ProMet: 1.1 ± 0.023
1.649ProAsn: 1.649 ± 0.033
1.425ProPro: 1.425 ± 0.03
1.577ProGln: 1.577 ± 0.027
1.491ProArg: 1.491 ± 0.029
2.498ProSer: 2.498 ± 0.04
1.875ProThr: 1.875 ± 0.031
3.529ProVal: 3.529 ± 0.042
0.555ProTrp: 0.555 ± 0.015
1.569ProTyr: 1.569 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
3.666GlnAla: 3.666 ± 0.05
0.264GlnCys: 0.264 ± 0.012
1.651GlnAsp: 1.651 ± 0.027
2.422GlnGlu: 2.422 ± 0.044
1.708GlnPhe: 1.708 ± 0.031
2.614GlnGly: 2.614 ± 0.036
0.823GlnHis: 0.823 ± 0.022
2.584GlnIle: 2.584 ± 0.033
2.037GlnLys: 2.037 ± 0.035
4.59GlnLeu: 4.59 ± 0.061
1.209GlnMet: 1.209 ± 0.028
1.358GlnAsn: 1.358 ± 0.023
1.797GlnPro: 1.797 ± 0.035
2.007GlnGln: 2.007 ± 0.039
1.946GlnArg: 1.946 ± 0.033
2.504GlnSer: 2.504 ± 0.035
2.111GlnThr: 2.111 ± 0.032
2.596GlnVal: 2.596 ± 0.037
0.586GlnTrp: 0.586 ± 0.017
1.337GlnTyr: 1.337 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
3.403ArgAla: 3.403 ± 0.043
0.433ArgCys: 0.433 ± 0.015
2.366ArgAsp: 2.366 ± 0.035
3.502ArgGlu: 3.502 ± 0.05
2.385ArgPhe: 2.385 ± 0.034
3.033ArgGly: 3.033 ± 0.042
1.198ArgHis: 1.198 ± 0.024
3.828ArgIle: 3.828 ± 0.043
3.047ArgLys: 3.047 ± 0.041
5.688ArgLeu: 5.688 ± 0.063
1.777ArgMet: 1.777 ± 0.032
2.078ArgAsn: 2.078 ± 0.029
1.852ArgPro: 1.852 ± 0.031
2.402ArgGln: 2.402 ± 0.039
2.808ArgArg: 2.808 ± 0.044
3.273ArgSer: 3.273 ± 0.04
2.708ArgThr: 2.708 ± 0.038
2.997ArgVal: 2.997 ± 0.037
0.766ArgTrp: 0.766 ± 0.021
1.886ArgTyr: 1.886 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
4.957SerAla: 4.957 ± 0.053
0.49SerCys: 0.49 ± 0.015
3.01SerAsp: 3.01 ± 0.041
3.461SerGlu: 3.461 ± 0.043
3.115SerPhe: 3.115 ± 0.041
5.639SerGly: 5.639 ± 0.064
1.202SerHis: 1.202 ± 0.022
4.263SerIle: 4.263 ± 0.053
3.147SerLys: 3.147 ± 0.044
6.298SerLeu: 6.298 ± 0.06
1.861SerMet: 1.861 ± 0.028
2.252SerAsn: 2.252 ± 0.037
2.5SerPro: 2.5 ± 0.036
1.981SerGln: 1.981 ± 0.026
3.213SerArg: 3.213 ± 0.041
4.035SerSer: 4.035 ± 0.053
2.941SerThr: 2.941 ± 0.039
4.607SerVal: 4.607 ± 0.051
0.838SerTrp: 0.838 ± 0.019
2.27SerTyr: 2.27 ± 0.037
0.001SerXaa: 0.001 ± 0.001
Thr
4.843ThrAla: 4.843 ± 0.053
0.365ThrCys: 0.365 ± 0.013
2.842ThrAsp: 2.842 ± 0.038
2.936ThrGlu: 2.936 ± 0.039
2.258ThrPhe: 2.258 ± 0.038
4.668ThrGly: 4.668 ± 0.157
0.953ThrHis: 0.953 ± 0.017
3.669ThrIle: 3.669 ± 0.05
2.419ThrLys: 2.419 ± 0.032
5.18ThrLeu: 5.18 ± 0.047
1.35ThrMet: 1.35 ± 0.025
1.906ThrAsn: 1.906 ± 0.033
2.466ThrPro: 2.466 ± 0.038
1.44ThrGln: 1.44 ± 0.025
2.231ThrArg: 2.231 ± 0.033
3.006ThrSer: 3.006 ± 0.041
2.6ThrThr: 2.6 ± 0.047
4.614ThrVal: 4.614 ± 0.066
0.637ThrTrp: 0.637 ± 0.02
1.851ThrTyr: 1.851 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
5.203ValAla: 5.203 ± 0.05
0.709ValCys: 0.709 ± 0.024
3.424ValAsp: 3.424 ± 0.046
4.153ValGlu: 4.153 ± 0.043
2.92ValPhe: 2.92 ± 0.045
4.551ValGly: 4.551 ± 0.048
1.515ValHis: 1.515 ± 0.025
4.91ValIle: 4.91 ± 0.06
3.979ValLys: 3.979 ± 0.053
6.959ValLeu: 6.959 ± 0.051
1.995ValMet: 1.995 ± 0.027
2.826ValAsn: 2.826 ± 0.037
3.243ValPro: 3.243 ± 0.045
2.757ValGln: 2.757 ± 0.038
3.572ValArg: 3.572 ± 0.037
4.768ValSer: 4.768 ± 0.048
4.321ValThr: 4.321 ± 0.066
4.88ValVal: 4.88 ± 0.056
0.892ValTrp: 0.892 ± 0.019
2.499ValTyr: 2.499 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
0.867TrpAla: 0.867 ± 0.02
0.097TrpCys: 0.097 ± 0.006
0.712TrpAsp: 0.712 ± 0.02
0.78TrpGlu: 0.78 ± 0.02
0.568TrpPhe: 0.568 ± 0.015
0.882TrpGly: 0.882 ± 0.021
0.324TrpHis: 0.324 ± 0.012
0.941TrpIle: 0.941 ± 0.021
0.831TrpLys: 0.831 ± 0.019
1.467TrpLeu: 1.467 ± 0.029
0.464TrpMet: 0.464 ± 0.015
0.742TrpAsn: 0.742 ± 0.02
0.4TrpPro: 0.4 ± 0.014
0.504TrpGln: 0.504 ± 0.016
0.636TrpArg: 0.636 ± 0.016
0.87TrpSer: 0.87 ± 0.02
0.705TrpThr: 0.705 ± 0.02
0.888TrpVal: 0.888 ± 0.019
0.189TrpTrp: 0.189 ± 0.011
0.43TrpTyr: 0.43 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.787TyrAla: 2.787 ± 0.037
0.31TyrCys: 0.31 ± 0.012
1.816TyrAsp: 1.816 ± 0.034
2.185TyrGlu: 2.185 ± 0.029
1.638TyrPhe: 1.638 ± 0.03
2.662TyrGly: 2.662 ± 0.037
0.749TyrHis: 0.749 ± 0.018
2.21TyrIle: 2.21 ± 0.034
1.722TyrLys: 1.722 ± 0.029
3.306TyrLeu: 3.306 ± 0.043
0.927TyrMet: 0.927 ± 0.023
1.468TyrAsn: 1.468 ± 0.026
1.639TyrPro: 1.639 ± 0.029
1.172TyrGln: 1.172 ± 0.024
2.118TyrArg: 2.118 ± 0.031
2.155TyrSer: 2.155 ± 0.031
1.884TyrThr: 1.884 ± 0.032
2.337TyrVal: 2.337 ± 0.035
0.486TyrTrp: 0.486 ± 0.015
1.348TyrTyr: 1.348 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6802 proteins (2221892 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski