Amino acid dipepetide frequency for Aureitalea marina

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.072AlaAla: 5.072 ± 0.098
0.601AlaCys: 0.601 ± 0.031
4.065AlaAsp: 4.065 ± 0.092
4.342AlaGlu: 4.342 ± 0.072
3.183AlaPhe: 3.183 ± 0.06
5.193AlaGly: 5.193 ± 0.1
1.221AlaHis: 1.221 ± 0.038
5.064AlaIle: 5.064 ± 0.084
3.628AlaLys: 3.628 ± 0.078
6.749AlaLeu: 6.749 ± 0.081
1.969AlaMet: 1.969 ± 0.051
3.178AlaAsn: 3.178 ± 0.076
2.077AlaPro: 2.077 ± 0.064
3.028AlaGln: 3.028 ± 0.072
3.077AlaArg: 3.077 ± 0.071
4.474AlaSer: 4.474 ± 0.087
3.605AlaThr: 3.605 ± 0.098
4.821AlaVal: 4.821 ± 0.082
0.731AlaTrp: 0.731 ± 0.034
2.626AlaTyr: 2.626 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.427CysAla: 0.427 ± 0.023
0.103CysCys: 0.103 ± 0.012
0.444CysAsp: 0.444 ± 0.019
0.451CysGlu: 0.451 ± 0.028
0.303CysPhe: 0.303 ± 0.018
0.658CysGly: 0.658 ± 0.03
0.179CysHis: 0.179 ± 0.018
0.462CysIle: 0.462 ± 0.022
0.322CysLys: 0.322 ± 0.018
0.699CysLeu: 0.699 ± 0.026
0.141CysMet: 0.141 ± 0.011
0.323CysAsn: 0.323 ± 0.02
0.335CysPro: 0.335 ± 0.022
0.303CysGln: 0.303 ± 0.028
0.253CysArg: 0.253 ± 0.017
0.543CysSer: 0.543 ± 0.033
0.397CysThr: 0.397 ± 0.028
0.432CysVal: 0.432 ± 0.025
0.083CysTrp: 0.083 ± 0.011
0.256CysTyr: 0.256 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.868AspAla: 3.868 ± 0.093
0.403AspCys: 0.403 ± 0.023
3.462AspAsp: 3.462 ± 0.157
3.772AspGlu: 3.772 ± 0.078
3.408AspPhe: 3.408 ± 0.067
4.678AspGly: 4.678 ± 0.116
1.329AspHis: 1.329 ± 0.046
3.977AspIle: 3.977 ± 0.079
3.281AspLys: 3.281 ± 0.067
6.233AspLeu: 6.233 ± 0.091
1.393AspMet: 1.393 ± 0.043
2.959AspAsn: 2.959 ± 0.099
2.826AspPro: 2.826 ± 0.09
3.13AspGln: 3.13 ± 0.067
3.147AspArg: 3.147 ± 0.066
3.564AspSer: 3.564 ± 0.072
2.693AspThr: 2.693 ± 0.066
3.667AspVal: 3.667 ± 0.07
0.898AspTrp: 0.898 ± 0.031
2.827AspTyr: 2.827 ± 0.062
0.0AspXaa: 0.0 ± 0.0
Glu
4.799GluAla: 4.799 ± 0.083
0.287GluCys: 0.287 ± 0.021
4.086GluAsp: 4.086 ± 0.073
5.44GluGlu: 5.44 ± 0.102
3.039GluPhe: 3.039 ± 0.064
4.065GluGly: 4.065 ± 0.062
1.175GluHis: 1.175 ± 0.042
5.004GluIle: 5.004 ± 0.082
4.056GluLys: 4.056 ± 0.09
6.757GluLeu: 6.757 ± 0.1
1.788GluMet: 1.788 ± 0.049
3.459GluAsn: 3.459 ± 0.063
1.899GluPro: 1.899 ± 0.055
2.825GluGln: 2.825 ± 0.06
3.142GluArg: 3.142 ± 0.05
3.552GluSer: 3.552 ± 0.076
3.302GluThr: 3.302 ± 0.067
4.931GluVal: 4.931 ± 0.096
0.624GluTrp: 0.624 ± 0.029
2.239GluTyr: 2.239 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
2.978PheAla: 2.978 ± 0.054
0.356PheCys: 0.356 ± 0.022
3.527PheAsp: 3.527 ± 0.063
3.318PheGlu: 3.318 ± 0.068
2.481PhePhe: 2.481 ± 0.068
3.801PheGly: 3.801 ± 0.067
0.829PheHis: 0.829 ± 0.034
3.029PheIle: 3.029 ± 0.061
2.548PheLys: 2.548 ± 0.064
4.465PheLeu: 4.465 ± 0.086
1.122PheMet: 1.122 ± 0.039
2.532PheAsn: 2.532 ± 0.051
1.759PhePro: 1.759 ± 0.045
1.814PheGln: 1.814 ± 0.048
2.173PheArg: 2.173 ± 0.056
3.488PheSer: 3.488 ± 0.073
2.789PheThr: 2.789 ± 0.068
2.958PheVal: 2.958 ± 0.073
0.639PheTrp: 0.639 ± 0.029
2.083PheTyr: 2.083 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
4.698GlyAla: 4.698 ± 0.083
0.643GlyCys: 0.643 ± 0.029
4.162GlyAsp: 4.162 ± 0.093
4.124GlyGlu: 4.124 ± 0.075
3.741GlyPhe: 3.741 ± 0.068
5.414GlyGly: 5.414 ± 0.108
1.294GlyHis: 1.294 ± 0.043
5.6GlyIle: 5.6 ± 0.095
4.275GlyLys: 4.275 ± 0.087
6.905GlyLeu: 6.905 ± 0.102
2.021GlyMet: 2.021 ± 0.061
3.533GlyAsn: 3.533 ± 0.085
2.007GlyPro: 2.007 ± 0.059
2.889GlyGln: 2.889 ± 0.066
3.037GlyArg: 3.037 ± 0.062
4.805GlySer: 4.805 ± 0.111
4.187GlyThr: 4.187 ± 0.097
4.853GlyVal: 4.853 ± 0.083
0.991GlyTrp: 0.991 ± 0.038
2.935GlyTyr: 2.935 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
1.03HisAla: 1.03 ± 0.035
0.183HisCys: 0.183 ± 0.016
0.797HisAsp: 0.797 ± 0.028
0.889HisGlu: 0.889 ± 0.034
1.079HisPhe: 1.079 ± 0.038
1.27HisGly: 1.27 ± 0.039
0.547HisHis: 0.547 ± 0.028
1.256HisIle: 1.256 ± 0.044
0.923HisLys: 0.923 ± 0.034
1.951HisLeu: 1.951 ± 0.051
0.378HisMet: 0.378 ± 0.022
0.775HisAsn: 0.775 ± 0.03
1.036HisPro: 1.036 ± 0.036
0.955HisGln: 0.955 ± 0.034
0.898HisArg: 0.898 ± 0.036
1.025HisSer: 1.025 ± 0.036
0.869HisThr: 0.869 ± 0.029
1.003HisVal: 1.003 ± 0.039
0.282HisTrp: 0.282 ± 0.018
0.907HisTyr: 0.907 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
5.286IleAla: 5.286 ± 0.079
0.602IleCys: 0.602 ± 0.042
4.739IleAsp: 4.739 ± 0.071
4.627IleGlu: 4.627 ± 0.082
3.09IlePhe: 3.09 ± 0.068
5.202IleGly: 5.202 ± 0.094
1.354IleHis: 1.354 ± 0.04
4.278IleIle: 4.278 ± 0.08
3.596IleLys: 3.596 ± 0.065
6.237IleLeu: 6.237 ± 0.097
1.367IleMet: 1.367 ± 0.036
3.539IleAsn: 3.539 ± 0.069
3.203IlePro: 3.203 ± 0.061
2.737IleGln: 2.737 ± 0.056
3.378IleArg: 3.378 ± 0.069
4.996IleSer: 4.996 ± 0.081
3.881IleThr: 3.881 ± 0.075
4.368IleVal: 4.368 ± 0.082
0.763IleTrp: 0.763 ± 0.035
2.609IleTyr: 2.609 ± 0.061
0.0IleXaa: 0.0 ± 0.0
Lys
4.117LysAla: 4.117 ± 0.083
0.252LysCys: 0.252 ± 0.017
3.398LysAsp: 3.398 ± 0.073
4.562LysGlu: 4.562 ± 0.093
2.013LysPhe: 2.013 ± 0.054
3.797LysGly: 3.797 ± 0.079
0.918LysHis: 0.918 ± 0.036
3.818LysIle: 3.818 ± 0.075
4.188LysLys: 4.188 ± 0.097
5.097LysLeu: 5.097 ± 0.099
1.537LysMet: 1.537 ± 0.048
2.731LysAsn: 2.731 ± 0.066
2.087LysPro: 2.087 ± 0.06
2.324LysGln: 2.324 ± 0.059
3.012LysArg: 3.012 ± 0.074
3.186LysSer: 3.186 ± 0.068
3.111LysThr: 3.111 ± 0.063
3.552LysVal: 3.552 ± 0.068
0.709LysTrp: 0.709 ± 0.028
1.994LysTyr: 1.994 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
6.914LeuAla: 6.914 ± 0.1
0.655LeuCys: 0.655 ± 0.028
6.071LeuAsp: 6.071 ± 0.1
6.51LeuGlu: 6.51 ± 0.095
4.751LeuPhe: 4.751 ± 0.091
6.769LeuGly: 6.769 ± 0.1
1.501LeuHis: 1.501 ± 0.046
7.354LeuIle: 7.354 ± 0.119
5.615LeuLys: 5.615 ± 0.103
9.243LeuLeu: 9.243 ± 0.134
2.42LeuMet: 2.42 ± 0.056
4.941LeuAsn: 4.941 ± 0.075
3.802LeuPro: 3.802 ± 0.062
3.506LeuGln: 3.506 ± 0.066
4.052LeuArg: 4.052 ± 0.072
6.714LeuSer: 6.714 ± 0.085
5.146LeuThr: 5.146 ± 0.08
6.305LeuVal: 6.305 ± 0.087
0.993LeuTrp: 0.993 ± 0.034
3.084LeuTyr: 3.084 ± 0.07
0.0LeuXaa: 0.0 ± 0.0
Met
2.017MetAla: 2.017 ± 0.055
0.136MetCys: 0.136 ± 0.01
1.543MetAsp: 1.543 ± 0.043
1.779MetGlu: 1.779 ± 0.048
0.809MetPhe: 0.809 ± 0.035
1.726MetGly: 1.726 ± 0.044
0.419MetHis: 0.419 ± 0.022
1.776MetIle: 1.776 ± 0.047
1.883MetLys: 1.883 ± 0.049
2.162MetLeu: 2.162 ± 0.053
0.618MetMet: 0.618 ± 0.032
1.361MetAsn: 1.361 ± 0.043
0.901MetPro: 0.901 ± 0.037
0.962MetGln: 0.962 ± 0.033
1.256MetArg: 1.256 ± 0.043
1.542MetSer: 1.542 ± 0.047
1.375MetThr: 1.375 ± 0.04
1.624MetVal: 1.624 ± 0.047
0.172MetTrp: 0.172 ± 0.014
0.704MetTyr: 0.704 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.182AsnAla: 3.182 ± 0.063
0.398AsnCys: 0.398 ± 0.033
2.795AsnAsp: 2.795 ± 0.074
2.889AsnGlu: 2.889 ± 0.058
2.463AsnPhe: 2.463 ± 0.056
3.766AsnGly: 3.766 ± 0.111
0.853AsnHis: 0.853 ± 0.033
3.102AsnIle: 3.102 ± 0.067
2.718AsnLys: 2.718 ± 0.063
4.504AsnLeu: 4.504 ± 0.074
1.257AsnMet: 1.257 ± 0.035
2.521AsnAsn: 2.521 ± 0.06
2.735AsnPro: 2.735 ± 0.057
2.144AsnGln: 2.144 ± 0.061
2.526AsnArg: 2.526 ± 0.055
3.277AsnSer: 3.277 ± 0.07
2.815AsnThr: 2.815 ± 0.082
2.847AsnVal: 2.847 ± 0.095
0.89AsnTrp: 0.89 ± 0.033
2.265AsnTyr: 2.265 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
2.556ProAla: 2.556 ± 0.063
0.206ProCys: 0.206 ± 0.015
2.828ProAsp: 2.828 ± 0.055
3.307ProGlu: 3.307 ± 0.07
1.915ProPhe: 1.915 ± 0.043
2.724ProGly: 2.724 ± 0.066
0.724ProHis: 0.724 ± 0.029
2.625ProIle: 2.625 ± 0.068
2.034ProLys: 2.034 ± 0.055
3.402ProLeu: 3.402 ± 0.065
0.896ProMet: 0.896 ± 0.029
2.054ProAsn: 2.054 ± 0.058
1.159ProPro: 1.159 ± 0.044
1.409ProGln: 1.409 ± 0.042
1.293ProArg: 1.293 ± 0.045
2.371ProSer: 2.371 ± 0.057
1.975ProThr: 1.975 ± 0.064
2.825ProVal: 2.825 ± 0.053
0.497ProTrp: 0.497 ± 0.023
1.447ProTyr: 1.447 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
3.049GlnAla: 3.049 ± 0.066
0.206GlnCys: 0.206 ± 0.017
2.416GlnAsp: 2.416 ± 0.056
2.941GlnGlu: 2.941 ± 0.065
2.006GlnPhe: 2.006 ± 0.049
2.606GlnGly: 2.606 ± 0.057
0.66GlnHis: 0.66 ± 0.027
2.778GlnIle: 2.778 ± 0.052
2.128GlnLys: 2.128 ± 0.05
4.612GlnLeu: 4.612 ± 0.084
1.166GlnMet: 1.166 ± 0.038
1.943GlnAsn: 1.943 ± 0.052
1.442GlnPro: 1.442 ± 0.045
1.957GlnGln: 1.957 ± 0.06
1.832GlnArg: 1.832 ± 0.047
2.268GlnSer: 2.268 ± 0.049
1.995GlnThr: 1.995 ± 0.046
2.851GlnVal: 2.851 ± 0.055
0.538GlnTrp: 0.538 ± 0.024
1.495GlnTyr: 1.495 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
2.783ArgAla: 2.783 ± 0.052
0.235ArgCys: 0.235 ± 0.016
2.491ArgAsp: 2.491 ± 0.057
3.043ArgGlu: 3.043 ± 0.068
2.473ArgPhe: 2.473 ± 0.053
2.579ArgGly: 2.579 ± 0.052
0.793ArgHis: 0.793 ± 0.032
3.476ArgIle: 3.476 ± 0.068
3.013ArgLys: 3.013 ± 0.062
4.431ArgLeu: 4.431 ± 0.079
1.333ArgMet: 1.333 ± 0.044
2.379ArgAsn: 2.379 ± 0.058
1.681ArgPro: 1.681 ± 0.045
1.89ArgGln: 1.89 ± 0.046
2.112ArgArg: 2.112 ± 0.057
3.094ArgSer: 3.094 ± 0.066
2.367ArgThr: 2.367 ± 0.05
2.911ArgVal: 2.911 ± 0.058
0.599ArgTrp: 0.599 ± 0.026
2.06ArgTyr: 2.06 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
4.088SerAla: 4.088 ± 0.074
0.617SerCys: 0.617 ± 0.032
3.944SerAsp: 3.944 ± 0.093
4.016SerGlu: 4.016 ± 0.079
3.422SerPhe: 3.422 ± 0.057
5.521SerGly: 5.521 ± 0.103
1.019SerHis: 1.019 ± 0.033
4.351SerIle: 4.351 ± 0.079
3.459SerLys: 3.459 ± 0.069
6.274SerLeu: 6.274 ± 0.103
1.618SerMet: 1.618 ± 0.04
3.092SerAsn: 3.092 ± 0.08
2.48SerPro: 2.48 ± 0.061
2.345SerGln: 2.345 ± 0.054
2.848SerArg: 2.848 ± 0.063
4.266SerSer: 4.266 ± 0.082
3.386SerThr: 3.386 ± 0.074
4.1SerVal: 4.1 ± 0.069
0.886SerTrp: 0.886 ± 0.032
2.556SerTyr: 2.556 ± 0.065
0.0SerXaa: 0.0 ± 0.0
Thr
4.162ThrAla: 4.162 ± 0.109
0.309ThrCys: 0.309 ± 0.028
3.482ThrAsp: 3.482 ± 0.096
3.3ThrGlu: 3.3 ± 0.071
2.566ThrPhe: 2.566 ± 0.062
4.217ThrGly: 4.217 ± 0.105
0.994ThrHis: 0.994 ± 0.034
3.831ThrIle: 3.831 ± 0.078
2.439ThrLys: 2.439 ± 0.056
5.133ThrLeu: 5.133 ± 0.088
1.005ThrMet: 1.005 ± 0.036
2.6ThrAsn: 2.6 ± 0.089
2.382ThrPro: 2.382 ± 0.065
1.995ThrGln: 1.995 ± 0.056
2.111ThrArg: 2.111 ± 0.05
3.271ThrSer: 3.271 ± 0.07
3.167ThrThr: 3.167 ± 0.077
3.909ThrVal: 3.909 ± 0.134
0.618ThrTrp: 0.618 ± 0.037
2.162ThrTyr: 2.162 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
4.487ValAla: 4.487 ± 0.076
0.535ValCys: 0.535 ± 0.031
4.153ValAsp: 4.153 ± 0.086
4.157ValGlu: 4.157 ± 0.074
3.19ValPhe: 3.19 ± 0.065
4.498ValGly: 4.498 ± 0.081
1.173ValHis: 1.173 ± 0.041
4.987ValIle: 4.987 ± 0.079
3.486ValLys: 3.486 ± 0.07
6.33ValLeu: 6.33 ± 0.093
1.585ValMet: 1.585 ± 0.042
3.288ValAsn: 3.288 ± 0.058
2.414ValPro: 2.414 ± 0.055
2.344ValGln: 2.344 ± 0.05
2.862ValArg: 2.862 ± 0.067
4.542ValSer: 4.542 ± 0.083
3.712ValThr: 3.712 ± 0.183
4.899ValVal: 4.899 ± 0.093
0.708ValTrp: 0.708 ± 0.026
2.531ValTyr: 2.531 ± 0.058
0.0ValXaa: 0.0 ± 0.0
Trp
0.769TrpAla: 0.769 ± 0.032
0.087TrpCys: 0.087 ± 0.01
0.745TrpAsp: 0.745 ± 0.033
0.825TrpGlu: 0.825 ± 0.034
0.575TrpPhe: 0.575 ± 0.029
0.801TrpGly: 0.801 ± 0.035
0.259TrpHis: 0.259 ± 0.017
0.871TrpIle: 0.871 ± 0.04
0.78TrpLys: 0.78 ± 0.032
1.213TrpLeu: 1.213 ± 0.037
0.389TrpMet: 0.389 ± 0.025
0.734TrpAsn: 0.734 ± 0.03
0.332TrpPro: 0.332 ± 0.02
0.454TrpGln: 0.454 ± 0.023
0.54TrpArg: 0.54 ± 0.029
0.812TrpSer: 0.812 ± 0.029
0.675TrpThr: 0.675 ± 0.031
0.784TrpVal: 0.784 ± 0.031
0.175TrpTrp: 0.175 ± 0.014
0.497TrpTyr: 0.497 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.424TyrAla: 2.424 ± 0.055
0.298TyrCys: 0.298 ± 0.019
2.365TyrAsp: 2.365 ± 0.048
2.176TyrGlu: 2.176 ± 0.053
2.184TyrPhe: 2.184 ± 0.057
2.83TyrGly: 2.83 ± 0.067
0.814TyrHis: 0.814 ± 0.029
2.215TyrIle: 2.215 ± 0.052
2.033TyrLys: 2.033 ± 0.05
3.949TyrLeu: 3.949 ± 0.075
0.762TyrMet: 0.762 ± 0.031
2.004TyrAsn: 2.004 ± 0.047
1.689TyrPro: 1.689 ± 0.048
1.88TyrGln: 1.88 ± 0.053
2.193TyrArg: 2.193 ± 0.048
2.496TyrSer: 2.496 ± 0.054
2.205TyrThr: 2.205 ± 0.072
2.24TyrVal: 2.24 ± 0.048
0.521TyrTrp: 0.521 ± 0.028
1.739TyrTyr: 1.739 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2645 proteins (881571 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski