Amino acid dipepetide frequency for Ascoidea rubescens DSM 1968

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.589AlaAla: 2.589 ± 0.045
0.505AlaCys: 0.505 ± 0.015
1.983AlaAsp: 1.983 ± 0.031
2.118AlaGlu: 2.118 ± 0.031
1.85AlaPhe: 1.85 ± 0.029
2.062AlaGly: 2.062 ± 0.039
0.728AlaHis: 0.728 ± 0.016
3.27AlaIle: 3.27 ± 0.043
2.899AlaLys: 2.899 ± 0.035
3.955AlaLeu: 3.955 ± 0.046
0.677AlaMet: 0.677 ± 0.017
2.923AlaAsn: 2.923 ± 0.033
1.59AlaPro: 1.59 ± 0.035
1.421AlaGln: 1.421 ± 0.026
1.539AlaArg: 1.539 ± 0.026
3.718AlaSer: 3.718 ± 0.05
2.212AlaThr: 2.212 ± 0.032
2.111AlaVal: 2.111 ± 0.03
0.329AlaTrp: 0.329 ± 0.01
1.28AlaTyr: 1.28 ± 0.025
0.0AlaXaa: 0.0 ± 0.0
Cys
0.487CysAla: 0.487 ± 0.014
0.283CysCys: 0.283 ± 0.011
0.601CysAsp: 0.601 ± 0.014
0.543CysGlu: 0.543 ± 0.013
0.779CysPhe: 0.779 ± 0.017
0.648CysGly: 0.648 ± 0.016
0.244CysHis: 0.244 ± 0.01
0.815CysIle: 0.815 ± 0.017
0.675CysLys: 0.675 ± 0.019
1.267CysLeu: 1.267 ± 0.021
0.158CysMet: 0.158 ± 0.007
0.679CysAsn: 0.679 ± 0.017
0.399CysPro: 0.399 ± 0.012
0.356CysGln: 0.356 ± 0.012
0.365CysArg: 0.365 ± 0.011
1.004CysSer: 1.004 ± 0.021
0.431CysThr: 0.431 ± 0.014
0.527CysVal: 0.527 ± 0.014
0.131CysTrp: 0.131 ± 0.006
0.475CysTyr: 0.475 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
2.038AspAla: 2.038 ± 0.031
0.605AspCys: 0.605 ± 0.014
5.045AspAsp: 5.045 ± 0.07
4.167AspGlu: 4.167 ± 0.049
3.194AspPhe: 3.194 ± 0.032
2.252AspGly: 2.252 ± 0.031
1.062AspHis: 1.062 ± 0.021
4.823AspIle: 4.823 ± 0.045
4.065AspLys: 4.065 ± 0.046
6.1AspLeu: 6.1 ± 0.048
0.732AspMet: 0.732 ± 0.016
5.873AspAsn: 5.873 ± 0.074
2.196AspPro: 2.196 ± 0.03
2.028AspGln: 2.028 ± 0.026
1.605AspArg: 1.605 ± 0.027
5.458AspSer: 5.458 ± 0.065
2.424AspThr: 2.424 ± 0.028
2.425AspVal: 2.425 ± 0.031
0.49AspTrp: 0.49 ± 0.014
2.604AspTyr: 2.604 ± 0.034
0.001AspXaa: 0.001 ± 0.001
Glu
2.185GluAla: 2.185 ± 0.033
0.472GluCys: 0.472 ± 0.015
3.35GluAsp: 3.35 ± 0.047
4.261GluGlu: 4.261 ± 0.058
2.741GluPhe: 2.741 ± 0.03
1.731GluGly: 1.731 ± 0.028
0.803GluHis: 0.803 ± 0.014
5.266GluIle: 5.266 ± 0.057
5.679GluLys: 5.679 ± 0.063
5.413GluLeu: 5.413 ± 0.054
1.036GluMet: 1.036 ± 0.018
6.056GluAsn: 6.056 ± 0.072
1.365GluPro: 1.365 ± 0.025
1.619GluGln: 1.619 ± 0.026
2.105GluArg: 2.105 ± 0.029
4.242GluSer: 4.242 ± 0.042
2.703GluThr: 2.703 ± 0.033
2.367GluVal: 2.367 ± 0.033
0.444GluTrp: 0.444 ± 0.012
2.012GluTyr: 2.012 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
2.002PheAla: 2.002 ± 0.032
0.59PheCys: 0.59 ± 0.015
3.231PheAsp: 3.231 ± 0.039
2.897PheGlu: 2.897 ± 0.033
2.772PhePhe: 2.772 ± 0.039
2.339PheGly: 2.339 ± 0.031
0.976PheHis: 0.976 ± 0.016
3.777PheIle: 3.777 ± 0.045
4.183PheLys: 4.183 ± 0.039
5.007PheLeu: 5.007 ± 0.055
0.702PheMet: 0.702 ± 0.015
4.984PheAsn: 4.984 ± 0.049
1.798PhePro: 1.798 ± 0.025
2.043PheGln: 2.043 ± 0.029
1.447PheArg: 1.447 ± 0.023
4.613PheSer: 4.613 ± 0.045
2.256PheThr: 2.256 ± 0.029
2.209PheVal: 2.209 ± 0.029
0.464PheTrp: 0.464 ± 0.014
1.899PheTyr: 1.899 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
1.895GlyAla: 1.895 ± 0.033
0.585GlyCys: 0.585 ± 0.015
2.136GlyAsp: 2.136 ± 0.036
2.013GlyGlu: 2.013 ± 0.029
2.305GlyPhe: 2.305 ± 0.035
2.409GlyGly: 2.409 ± 0.042
0.752GlyHis: 0.752 ± 0.021
3.207GlyIle: 3.207 ± 0.045
2.95GlyLys: 2.95 ± 0.035
3.915GlyLeu: 3.915 ± 0.04
0.658GlyMet: 0.658 ± 0.016
3.058GlyAsn: 3.058 ± 0.055
1.185GlyPro: 1.185 ± 0.021
1.05GlyGln: 1.05 ± 0.022
1.441GlyArg: 1.441 ± 0.031
3.568GlySer: 3.568 ± 0.038
1.926GlyThr: 1.926 ± 0.028
2.174GlyVal: 2.174 ± 0.032
0.489GlyTrp: 0.489 ± 0.015
1.734GlyTyr: 1.734 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
0.605HisAla: 0.605 ± 0.014
0.258HisCys: 0.258 ± 0.011
1.005HisAsp: 1.005 ± 0.02
0.882HisGlu: 0.882 ± 0.019
0.975HisPhe: 0.975 ± 0.019
0.754HisGly: 0.754 ± 0.015
0.618HisHis: 0.618 ± 0.02
1.363HisIle: 1.363 ± 0.02
1.246HisLys: 1.246 ± 0.021
2.058HisLeu: 2.058 ± 0.028
0.253HisMet: 0.253 ± 0.009
1.825HisAsn: 1.825 ± 0.035
0.924HisPro: 0.924 ± 0.019
0.868HisGln: 0.868 ± 0.018
0.722HisArg: 0.722 ± 0.017
1.937HisSer: 1.937 ± 0.03
0.829HisThr: 0.829 ± 0.016
0.682HisVal: 0.682 ± 0.014
0.159HisTrp: 0.159 ± 0.007
0.8HisTyr: 0.8 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
3.186IleAla: 3.186 ± 0.038
0.949IleCys: 0.949 ± 0.018
5.532IleAsp: 5.532 ± 0.044
4.959IleGlu: 4.959 ± 0.043
3.775IlePhe: 3.775 ± 0.044
3.246IleGly: 3.246 ± 0.039
1.535IleHis: 1.535 ± 0.024
6.303IleIle: 6.303 ± 0.056
6.815IleLys: 6.815 ± 0.055
7.722IleLeu: 7.722 ± 0.069
1.125IleMet: 1.125 ± 0.019
8.812IleAsn: 8.812 ± 0.081
3.383IlePro: 3.383 ± 0.034
3.055IleGln: 3.055 ± 0.039
2.537IleArg: 2.537 ± 0.025
7.8IleSer: 7.8 ± 0.066
3.948IleThr: 3.948 ± 0.04
3.464IleVal: 3.464 ± 0.037
0.655IleTrp: 0.655 ± 0.017
2.825IleTyr: 2.825 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
2.903LysAla: 2.903 ± 0.043
0.709LysCys: 0.709 ± 0.016
4.456LysAsp: 4.456 ± 0.051
5.146LysGlu: 5.146 ± 0.058
4.007LysPhe: 4.007 ± 0.043
2.398LysGly: 2.398 ± 0.031
1.333LysHis: 1.333 ± 0.021
7.55LysIle: 7.55 ± 0.066
8.288LysLys: 8.288 ± 0.078
7.991LysLeu: 7.991 ± 0.066
1.238LysMet: 1.238 ± 0.022
8.434LysAsn: 8.434 ± 0.071
2.775LysPro: 2.775 ± 0.045
2.686LysGln: 2.686 ± 0.031
3.408LysArg: 3.408 ± 0.037
6.918LysSer: 6.918 ± 0.061
4.282LysThr: 4.282 ± 0.044
3.357LysVal: 3.357 ± 0.039
0.601LysTrp: 0.601 ± 0.017
2.941LysTyr: 2.941 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
4.155LeuAla: 4.155 ± 0.052
1.014LeuCys: 1.014 ± 0.02
5.412LeuAsp: 5.412 ± 0.046
5.028LeuGlu: 5.028 ± 0.052
4.81LeuPhe: 4.81 ± 0.052
3.572LeuGly: 3.572 ± 0.038
1.64LeuHis: 1.64 ± 0.023
8.378LeuIle: 8.378 ± 0.083
8.988LeuLys: 8.988 ± 0.072
9.698LeuLeu: 9.698 ± 0.084
1.557LeuMet: 1.557 ± 0.024
10.131LeuAsn: 10.131 ± 0.088
4.081LeuPro: 4.081 ± 0.041
3.414LeuGln: 3.414 ± 0.04
3.435LeuArg: 3.435 ± 0.038
9.97LeuSer: 9.97 ± 0.059
4.891LeuThr: 4.891 ± 0.045
4.418LeuVal: 4.418 ± 0.04
0.692LeuTrp: 0.692 ± 0.017
3.156LeuTyr: 3.156 ± 0.034
0.001LeuXaa: 0.001 ± 0.0
Met
0.891MetAla: 0.891 ± 0.018
0.169MetCys: 0.169 ± 0.007
0.873MetAsp: 0.873 ± 0.018
0.858MetGlu: 0.858 ± 0.016
0.663MetPhe: 0.663 ± 0.016
0.786MetGly: 0.786 ± 0.018
0.196MetHis: 0.196 ± 0.007
1.168MetIle: 1.168 ± 0.019
1.179MetLys: 1.179 ± 0.021
1.232MetLeu: 1.232 ± 0.019
0.31MetMet: 0.31 ± 0.01
1.39MetAsn: 1.39 ± 0.036
0.547MetPro: 0.547 ± 0.014
0.397MetGln: 0.397 ± 0.014
0.498MetArg: 0.498 ± 0.013
1.374MetSer: 1.374 ± 0.021
0.721MetThr: 0.721 ± 0.016
0.847MetVal: 0.847 ± 0.018
0.107MetTrp: 0.107 ± 0.006
0.394MetTyr: 0.394 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.879AsnAla: 2.879 ± 0.034
0.932AsnCys: 0.932 ± 0.022
7.12AsnAsp: 7.12 ± 0.077
5.633AsnGlu: 5.633 ± 0.072
4.719AsnPhe: 4.719 ± 0.047
3.573AsnGly: 3.573 ± 0.054
2.412AsnHis: 2.412 ± 0.039
7.649AsnIle: 7.649 ± 0.076
7.415AsnLys: 7.415 ± 0.071
9.865AsnLeu: 9.865 ± 0.096
1.142AsnMet: 1.142 ± 0.036
18.394AsnAsn: 18.394 ± 0.337
3.571AsnPro: 3.571 ± 0.041
4.722AsnGln: 4.722 ± 0.08
2.516AsnArg: 2.516 ± 0.031
11.027AsnSer: 11.027 ± 0.111
5.029AsnThr: 5.029 ± 0.069
3.212AsnVal: 3.212 ± 0.034
0.665AsnTrp: 0.665 ± 0.016
4.204AsnTyr: 4.204 ± 0.054
0.005AsnXaa: 0.005 ± 0.001
Pro
1.587ProAla: 1.587 ± 0.033
0.273ProCys: 0.273 ± 0.011
1.948ProAsp: 1.948 ± 0.026
2.062ProGlu: 2.062 ± 0.029
1.866ProPhe: 1.866 ± 0.025
1.391ProGly: 1.391 ± 0.026
0.653ProHis: 0.653 ± 0.016
3.353ProIle: 3.353 ± 0.036
2.814ProLys: 2.814 ± 0.038
3.55ProLeu: 3.55 ± 0.038
0.514ProMet: 0.514 ± 0.014
3.491ProAsn: 3.491 ± 0.046
1.984ProPro: 1.984 ± 0.052
1.509ProGln: 1.509 ± 0.034
1.182ProArg: 1.182 ± 0.02
3.772ProSer: 3.772 ± 0.043
2.28ProThr: 2.28 ± 0.034
1.883ProVal: 1.883 ± 0.029
0.276ProTrp: 0.276 ± 0.009
1.315ProTyr: 1.315 ± 0.022
0.001ProXaa: 0.001 ± 0.001
Gln
1.338GlnAla: 1.338 ± 0.027
0.311GlnCys: 0.311 ± 0.01
1.621GlnAsp: 1.621 ± 0.024
1.826GlnGlu: 1.826 ± 0.027
1.87GlnPhe: 1.87 ± 0.025
0.987GlnGly: 0.987 ± 0.019
0.653GlnHis: 0.653 ± 0.016
3.214GlnIle: 3.214 ± 0.036
3.095GlnLys: 3.095 ± 0.04
3.806GlnLeu: 3.806 ± 0.044
0.63GlnMet: 0.63 ± 0.017
4.376GlnAsn: 4.376 ± 0.07
1.445GlnPro: 1.445 ± 0.032
2.224GlnGln: 2.224 ± 0.089
1.406GlnArg: 1.406 ± 0.026
3.225GlnSer: 3.225 ± 0.048
1.773GlnThr: 1.773 ± 0.026
1.54GlnVal: 1.54 ± 0.026
0.263GlnTrp: 0.263 ± 0.009
1.232GlnTyr: 1.232 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
1.489ArgAla: 1.489 ± 0.028
0.415ArgCys: 0.415 ± 0.013
1.691ArgAsp: 1.691 ± 0.025
1.881ArgGlu: 1.881 ± 0.03
1.787ArgPhe: 1.787 ± 0.023
1.412ArgGly: 1.412 ± 0.03
0.647ArgHis: 0.647 ± 0.014
2.661ArgIle: 2.661 ± 0.03
3.286ArgLys: 3.286 ± 0.037
3.357ArgLeu: 3.357 ± 0.038
0.568ArgMet: 0.568 ± 0.014
2.839ArgAsn: 2.839 ± 0.032
1.157ArgPro: 1.157 ± 0.02
1.129ArgGln: 1.129 ± 0.021
1.874ArgArg: 1.874 ± 0.033
2.895ArgSer: 2.895 ± 0.034
1.572ArgThr: 1.572 ± 0.022
1.555ArgVal: 1.555 ± 0.023
0.308ArgTrp: 0.308 ± 0.01
1.313ArgTyr: 1.313 ± 0.02
0.0ArgXaa: 0.0 ± 0.0
Ser
3.553SerAla: 3.553 ± 0.045
0.952SerCys: 0.952 ± 0.023
5.025SerAsp: 5.025 ± 0.057
4.144SerGlu: 4.144 ± 0.043
4.902SerPhe: 4.902 ± 0.044
3.45SerGly: 3.45 ± 0.041
1.798SerHis: 1.798 ± 0.025
8.284SerIle: 8.284 ± 0.062
7.618SerLys: 7.618 ± 0.057
9.447SerLeu: 9.447 ± 0.067
1.279SerMet: 1.279 ± 0.02
10.979SerAsn: 10.979 ± 0.111
3.6SerPro: 3.6 ± 0.055
3.356SerGln: 3.356 ± 0.05
3.109SerArg: 3.109 ± 0.037
11.85SerSer: 11.85 ± 0.123
5.683SerThr: 5.683 ± 0.061
3.775SerVal: 3.775 ± 0.037
0.672SerTrp: 0.672 ± 0.016
3.153SerTyr: 3.153 ± 0.037
0.001SerXaa: 0.001 ± 0.001
Thr
2.29ThrAla: 2.29 ± 0.031
0.529ThrCys: 0.529 ± 0.014
2.716ThrAsp: 2.716 ± 0.038
2.391ThrGlu: 2.391 ± 0.03
2.382ThrPhe: 2.382 ± 0.03
2.29ThrGly: 2.29 ± 0.033
0.942ThrHis: 0.942 ± 0.02
4.116ThrIle: 4.116 ± 0.041
3.677ThrLys: 3.677 ± 0.041
4.637ThrLeu: 4.637 ± 0.04
0.684ThrMet: 0.684 ± 0.015
5.097ThrAsn: 5.097 ± 0.062
2.295ThrPro: 2.295 ± 0.037
1.707ThrGln: 1.707 ± 0.029
1.653ThrArg: 1.653 ± 0.025
5.098ThrSer: 5.098 ± 0.056
3.424ThrThr: 3.424 ± 0.053
2.33ThrVal: 2.33 ± 0.029
0.38ThrTrp: 0.38 ± 0.011
1.634ThrTyr: 1.634 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
2.103ValAla: 2.103 ± 0.032
0.587ValCys: 0.587 ± 0.015
2.659ValAsp: 2.659 ± 0.031
2.686ValGlu: 2.686 ± 0.039
2.299ValPhe: 2.299 ± 0.03
2.145ValGly: 2.145 ± 0.03
0.804ValHis: 0.804 ± 0.018
3.225ValIle: 3.225 ± 0.034
3.16ValLys: 3.16 ± 0.036
4.438ValLeu: 4.438 ± 0.045
0.675ValMet: 0.675 ± 0.014
3.118ValAsn: 3.118 ± 0.036
1.832ValPro: 1.832 ± 0.026
1.456ValGln: 1.456 ± 0.023
1.479ValArg: 1.479 ± 0.025
4.033ValSer: 4.033 ± 0.042
2.015ValThr: 2.015 ± 0.03
2.36ValVal: 2.36 ± 0.037
0.393ValTrp: 0.393 ± 0.013
1.617ValTyr: 1.617 ± 0.025
0.001ValXaa: 0.001 ± 0.0
Trp
0.383TrpAla: 0.383 ± 0.012
0.164TrpCys: 0.164 ± 0.009
0.496TrpAsp: 0.496 ± 0.015
0.444TrpGlu: 0.444 ± 0.012
0.463TrpPhe: 0.463 ± 0.013
0.4TrpGly: 0.4 ± 0.012
0.134TrpHis: 0.134 ± 0.007
0.61TrpIle: 0.61 ± 0.015
0.653TrpLys: 0.653 ± 0.015
0.835TrpLeu: 0.835 ± 0.021
0.149TrpMet: 0.149 ± 0.006
0.585TrpAsn: 0.585 ± 0.016
0.226TrpPro: 0.226 ± 0.008
0.221TrpGln: 0.221 ± 0.009
0.364TrpArg: 0.364 ± 0.011
0.681TrpSer: 0.681 ± 0.016
0.348TrpThr: 0.348 ± 0.011
0.427TrpVal: 0.427 ± 0.011
0.115TrpTrp: 0.115 ± 0.007
0.3TrpTyr: 0.3 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.2TyrAla: 1.2 ± 0.021
0.565TyrCys: 0.565 ± 0.013
2.283TyrAsp: 2.283 ± 0.033
1.911TyrGlu: 1.911 ± 0.024
2.031TyrPhe: 2.031 ± 0.03
1.603TyrGly: 1.603 ± 0.026
0.833TyrHis: 0.833 ± 0.018
2.594TyrIle: 2.594 ± 0.028
2.734TyrLys: 2.734 ± 0.034
4.149TyrLeu: 4.149 ± 0.042
0.515TyrMet: 0.515 ± 0.013
3.636TyrAsn: 3.636 ± 0.042
1.352TyrPro: 1.352 ± 0.021
1.573TyrGln: 1.573 ± 0.026
1.152TyrArg: 1.152 ± 0.019
3.447TyrSer: 3.447 ± 0.037
1.522TyrThr: 1.522 ± 0.024
1.422TyrVal: 1.422 ± 0.023
0.365TyrTrp: 0.365 ± 0.01
1.892TyrTyr: 1.892 ± 0.031
0.001TyrXaa: 0.001 ± 0.001
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.002XaaAsp: 0.002 ± 0.001
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.0XaaLeu: 0.0 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.004XaaAsn: 0.004 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.013XaaXaa: 0.013 ± 0.007
Statistics based on 6735 proteins (3141677 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski