Amino acid dipepetide frequency for Terriglobus albidus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.2AlaAla: 13.2 ± 0.117
0.912AlaCys: 0.912 ± 0.022
4.896AlaAsp: 4.896 ± 0.058
6.005AlaGlu: 6.005 ± 0.084
3.837AlaPhe: 3.837 ± 0.044
8.782AlaGly: 8.782 ± 0.074
2.05AlaHis: 2.05 ± 0.034
5.694AlaIle: 5.694 ± 0.065
4.151AlaLys: 4.151 ± 0.063
10.451AlaLeu: 10.451 ± 0.093
2.72AlaMet: 2.72 ± 0.043
3.49AlaAsn: 3.49 ± 0.049
4.919AlaPro: 4.919 ± 0.061
4.384AlaGln: 4.384 ± 0.056
5.862AlaArg: 5.862 ± 0.058
6.62AlaSer: 6.62 ± 0.073
6.541AlaThr: 6.541 ± 0.081
7.754AlaVal: 7.754 ± 0.073
1.34AlaTrp: 1.34 ± 0.03
2.653AlaTyr: 2.653 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
0.814CysAla: 0.814 ± 0.02
0.15CysCys: 0.15 ± 0.009
0.43CysAsp: 0.43 ± 0.014
0.417CysGlu: 0.417 ± 0.016
0.343CysPhe: 0.343 ± 0.015
0.875CysGly: 0.875 ± 0.026
0.242CysHis: 0.242 ± 0.015
0.46CysIle: 0.46 ± 0.016
0.228CysLys: 0.228 ± 0.011
0.807CysLeu: 0.807 ± 0.021
0.2CysMet: 0.2 ± 0.01
0.264CysAsn: 0.264 ± 0.011
0.397CysPro: 0.397 ± 0.015
0.212CysGln: 0.212 ± 0.013
0.464CysArg: 0.464 ± 0.017
0.62CysSer: 0.62 ± 0.023
0.539CysThr: 0.539 ± 0.022
0.58CysVal: 0.58 ± 0.017
0.112CysTrp: 0.112 ± 0.008
0.225CysTyr: 0.225 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
5.362AspAla: 5.362 ± 0.06
0.386AspCys: 0.386 ± 0.013
2.401AspAsp: 2.401 ± 0.041
2.823AspGlu: 2.823 ± 0.044
2.071AspPhe: 2.071 ± 0.037
4.438AspGly: 4.438 ± 0.066
1.192AspHis: 1.192 ± 0.024
2.383AspIle: 2.383 ± 0.037
1.744AspLys: 1.744 ± 0.032
5.2AspLeu: 5.2 ± 0.06
0.921AspMet: 0.921 ± 0.021
1.457AspAsn: 1.457 ± 0.032
3.049AspPro: 3.049 ± 0.045
1.717AspGln: 1.717 ± 0.029
3.228AspArg: 3.228 ± 0.045
2.677AspSer: 2.677 ± 0.047
2.621AspThr: 2.621 ± 0.038
3.605AspVal: 3.605 ± 0.047
0.837AspTrp: 0.837 ± 0.021
1.516AspTyr: 1.516 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
5.677GluAla: 5.677 ± 0.073
0.347GluCys: 0.347 ± 0.015
2.439GluAsp: 2.439 ± 0.042
3.34GluGlu: 3.34 ± 0.058
2.014GluPhe: 2.014 ± 0.035
3.624GluGly: 3.624 ± 0.051
1.33GluHis: 1.33 ± 0.03
3.199GluIle: 3.199 ± 0.044
2.536GluLys: 2.536 ± 0.042
5.409GluLeu: 5.409 ± 0.073
1.484GluMet: 1.484 ± 0.029
1.679GluAsn: 1.679 ± 0.035
2.255GluPro: 2.255 ± 0.045
2.507GluGln: 2.507 ± 0.047
3.885GluArg: 3.885 ± 0.055
2.905GluSer: 2.905 ± 0.037
3.074GluThr: 3.074 ± 0.043
3.659GluVal: 3.659 ± 0.055
0.739GluTrp: 0.739 ± 0.023
1.406GluTyr: 1.406 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
4.187PheAla: 4.187 ± 0.052
0.373PheCys: 0.373 ± 0.016
2.327PheAsp: 2.327 ± 0.038
1.928PheGlu: 1.928 ± 0.034
1.714PhePhe: 1.714 ± 0.034
3.61PheGly: 3.61 ± 0.06
1.054PheHis: 1.054 ± 0.023
1.475PheIle: 1.475 ± 0.031
1.029PheLys: 1.029 ± 0.023
3.83PheLeu: 3.83 ± 0.056
0.674PheMet: 0.674 ± 0.02
1.482PheAsn: 1.482 ± 0.037
1.909PhePro: 1.909 ± 0.03
1.368PheGln: 1.368 ± 0.026
2.398PheArg: 2.398 ± 0.035
2.848PheSer: 2.848 ± 0.043
2.552PheThr: 2.552 ± 0.042
2.63PheVal: 2.63 ± 0.043
0.571PheTrp: 0.571 ± 0.018
1.199PheTyr: 1.199 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
7.442GlyAla: 7.442 ± 0.069
0.796GlyCys: 0.796 ± 0.021
3.883GlyAsp: 3.883 ± 0.056
3.961GlyGlu: 3.961 ± 0.049
3.348GlyPhe: 3.348 ± 0.048
6.585GlyGly: 6.585 ± 0.081
1.778GlyHis: 1.778 ± 0.03
4.581GlyIle: 4.581 ± 0.058
3.789GlyLys: 3.789 ± 0.051
7.372GlyLeu: 7.372 ± 0.071
2.046GlyMet: 2.046 ± 0.037
3.053GlyAsn: 3.053 ± 0.055
3.11GlyPro: 3.11 ± 0.043
2.953GlyGln: 2.953 ± 0.047
4.629GlyArg: 4.629 ± 0.056
5.462GlySer: 5.462 ± 0.089
5.442GlyThr: 5.442 ± 0.1
5.92GlyVal: 5.92 ± 0.063
1.355GlyTrp: 1.355 ± 0.031
2.646GlyTyr: 2.646 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.211HisAla: 2.211 ± 0.036
0.217HisCys: 0.217 ± 0.012
1.167HisAsp: 1.167 ± 0.022
1.17HisGlu: 1.17 ± 0.026
0.974HisPhe: 0.974 ± 0.024
2.008HisGly: 2.008 ± 0.036
0.636HisHis: 0.636 ± 0.021
1.117HisIle: 1.117 ± 0.024
0.61HisLys: 0.61 ± 0.018
2.367HisLeu: 2.367 ± 0.044
0.493HisMet: 0.493 ± 0.014
0.693HisAsn: 0.693 ± 0.018
1.432HisPro: 1.432 ± 0.03
0.777HisGln: 0.777 ± 0.023
1.467HisArg: 1.467 ± 0.033
1.248HisSer: 1.248 ± 0.027
1.267HisThr: 1.267 ± 0.025
1.558HisVal: 1.558 ± 0.031
0.383HisTrp: 0.383 ± 0.016
0.692HisTyr: 0.692 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
6.303IleAla: 6.303 ± 0.066
0.48IleCys: 0.48 ± 0.014
3.119IleAsp: 3.119 ± 0.045
2.929IleGlu: 2.929 ± 0.045
1.973IlePhe: 1.973 ± 0.036
4.2IleGly: 4.2 ± 0.041
1.142IleHis: 1.142 ± 0.026
1.81IleIle: 1.81 ± 0.035
1.515IleLys: 1.515 ± 0.029
4.411IleLeu: 4.411 ± 0.056
0.768IleMet: 0.768 ± 0.021
1.673IleAsn: 1.673 ± 0.037
2.862IlePro: 2.862 ± 0.043
1.696IleGln: 1.696 ± 0.033
3.104IleArg: 3.104 ± 0.042
3.357IleSer: 3.357 ± 0.046
3.349IleThr: 3.349 ± 0.043
3.783IleVal: 3.783 ± 0.052
0.595IleTrp: 0.595 ± 0.018
1.391IleTyr: 1.391 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
3.885LysAla: 3.885 ± 0.057
0.185LysCys: 0.185 ± 0.01
2.06LysAsp: 2.06 ± 0.039
2.096LysGlu: 2.096 ± 0.037
1.165LysPhe: 1.165 ± 0.025
2.522LysGly: 2.522 ± 0.039
0.85LysHis: 0.85 ± 0.021
1.881LysIle: 1.881 ± 0.034
1.811LysLys: 1.811 ± 0.048
3.893LysLeu: 3.893 ± 0.048
0.945LysMet: 0.945 ± 0.022
1.291LysAsn: 1.291 ± 0.029
2.339LysPro: 2.339 ± 0.047
1.709LysGln: 1.709 ± 0.035
2.279LysArg: 2.279 ± 0.035
2.105LysSer: 2.105 ± 0.036
2.44LysThr: 2.44 ± 0.041
2.687LysVal: 2.687 ± 0.048
0.448LysTrp: 0.448 ± 0.016
0.986LysTyr: 0.986 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
10.806LeuAla: 10.806 ± 0.103
0.939LeuCys: 0.939 ± 0.026
4.963LeuAsp: 4.963 ± 0.055
5.197LeuGlu: 5.197 ± 0.067
3.597LeuPhe: 3.597 ± 0.053
7.248LeuGly: 7.248 ± 0.078
2.351LeuHis: 2.351 ± 0.043
4.478LeuIle: 4.478 ± 0.064
3.909LeuLys: 3.909 ± 0.049
10.569LeuLeu: 10.569 ± 0.121
2.047LeuMet: 2.047 ± 0.036
3.377LeuAsn: 3.377 ± 0.048
5.807LeuPro: 5.807 ± 0.063
3.902LeuGln: 3.902 ± 0.052
6.815LeuArg: 6.815 ± 0.077
6.688LeuSer: 6.688 ± 0.067
6.201LeuThr: 6.201 ± 0.07
6.393LeuVal: 6.393 ± 0.074
1.354LeuTrp: 1.354 ± 0.035
2.511LeuTyr: 2.511 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
2.457MetAla: 2.457 ± 0.035
0.164MetCys: 0.164 ± 0.01
0.991MetAsp: 0.991 ± 0.025
1.201MetGlu: 1.201 ± 0.027
0.645MetPhe: 0.645 ± 0.018
1.568MetGly: 1.568 ± 0.035
0.571MetHis: 0.571 ± 0.015
1.032MetIle: 1.032 ± 0.027
1.1MetLys: 1.1 ± 0.025
2.3MetLeu: 2.3 ± 0.037
0.597MetMet: 0.597 ± 0.019
0.837MetAsn: 0.837 ± 0.022
1.362MetPro: 1.362 ± 0.029
1.081MetGln: 1.081 ± 0.026
1.593MetArg: 1.593 ± 0.031
1.487MetSer: 1.487 ± 0.03
1.502MetThr: 1.502 ± 0.029
1.533MetVal: 1.533 ± 0.032
0.218MetTrp: 0.218 ± 0.01
0.468MetTyr: 0.468 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.597AsnAla: 3.597 ± 0.053
0.283AsnCys: 0.283 ± 0.014
1.682AsnAsp: 1.682 ± 0.03
1.483AsnGlu: 1.483 ± 0.028
1.541AsnPhe: 1.541 ± 0.032
3.398AsnGly: 3.398 ± 0.062
0.751AsnHis: 0.751 ± 0.02
1.698AsnIle: 1.698 ± 0.036
0.953AsnLys: 0.953 ± 0.022
3.428AsnLeu: 3.428 ± 0.044
0.656AsnMet: 0.656 ± 0.018
1.387AsnAsn: 1.387 ± 0.037
2.336AsnPro: 2.336 ± 0.039
1.373AsnGln: 1.373 ± 0.033
1.981AsnArg: 1.981 ± 0.037
2.123AsnSer: 2.123 ± 0.044
2.132AsnThr: 2.132 ± 0.04
2.515AsnVal: 2.515 ± 0.04
0.52AsnTrp: 0.52 ± 0.019
1.27AsnTyr: 1.27 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
5.967ProAla: 5.967 ± 0.069
0.337ProCys: 0.337 ± 0.013
2.86ProAsp: 2.86 ± 0.045
3.587ProGlu: 3.587 ± 0.048
1.982ProPhe: 1.982 ± 0.037
4.528ProGly: 4.528 ± 0.05
1.112ProHis: 1.112 ± 0.024
2.42ProIle: 2.42 ± 0.036
1.948ProLys: 1.948 ± 0.041
4.569ProLeu: 4.569 ± 0.054
1.162ProMet: 1.162 ± 0.026
1.915ProAsn: 1.915 ± 0.038
2.45ProPro: 2.45 ± 0.049
2.211ProGln: 2.211 ± 0.04
2.543ProArg: 2.543 ± 0.036
3.426ProSer: 3.426 ± 0.044
3.023ProThr: 3.023 ± 0.052
4.246ProVal: 4.246 ± 0.056
0.696ProTrp: 0.696 ± 0.02
1.499ProTyr: 1.499 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.113GlnAla: 4.113 ± 0.059
0.239GlnCys: 0.239 ± 0.013
1.625GlnAsp: 1.625 ± 0.026
1.931GlnGlu: 1.931 ± 0.037
1.483GlnPhe: 1.483 ± 0.029
2.619GlnGly: 2.619 ± 0.037
0.927GlnHis: 0.927 ± 0.023
2.222GlnIle: 2.222 ± 0.035
1.578GlnLys: 1.578 ± 0.029
3.772GlnLeu: 3.772 ± 0.051
1.119GlnMet: 1.119 ± 0.026
1.422GlnAsn: 1.422 ± 0.034
2.174GlnPro: 2.174 ± 0.034
2.458GlnGln: 2.458 ± 0.052
2.699GlnArg: 2.699 ± 0.036
2.307GlnSer: 2.307 ± 0.038
2.599GlnThr: 2.599 ± 0.043
2.838GlnVal: 2.838 ± 0.044
0.601GlnTrp: 0.601 ± 0.02
1.131GlnTyr: 1.131 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
5.513ArgAla: 5.513 ± 0.054
0.498ArgCys: 0.498 ± 0.018
2.961ArgAsp: 2.961 ± 0.04
3.748ArgGlu: 3.748 ± 0.053
2.663ArgPhe: 2.663 ± 0.045
3.976ArgGly: 3.976 ± 0.05
1.381ArgHis: 1.381 ± 0.029
3.63ArgIle: 3.63 ± 0.051
2.494ArgLys: 2.494 ± 0.043
6.226ArgLeu: 6.226 ± 0.066
1.735ArgMet: 1.735 ± 0.035
2.287ArgAsn: 2.287 ± 0.038
2.86ArgPro: 2.86 ± 0.046
2.581ArgGln: 2.581 ± 0.037
4.316ArgArg: 4.316 ± 0.062
3.929ArgSer: 3.929 ± 0.054
3.536ArgThr: 3.536 ± 0.04
4.23ArgVal: 4.23 ± 0.058
1.021ArgTrp: 1.021 ± 0.026
1.988ArgTyr: 1.988 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
6.538SerAla: 6.538 ± 0.068
0.547SerCys: 0.547 ± 0.015
2.909SerAsp: 2.909 ± 0.041
2.955SerGlu: 2.955 ± 0.048
2.718SerPhe: 2.718 ± 0.043
6.148SerGly: 6.148 ± 0.104
1.356SerHis: 1.356 ± 0.025
3.45SerIle: 3.45 ± 0.042
2.139SerLys: 2.139 ± 0.035
6.338SerLeu: 6.338 ± 0.068
1.434SerMet: 1.434 ± 0.031
2.294SerAsn: 2.294 ± 0.05
3.403SerPro: 3.403 ± 0.044
2.327SerGln: 2.327 ± 0.04
3.719SerArg: 3.719 ± 0.048
4.79SerSer: 4.79 ± 0.076
4.0SerThr: 4.0 ± 0.066
4.539SerVal: 4.539 ± 0.057
0.9SerTrp: 0.9 ± 0.022
1.954SerTyr: 1.954 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
6.724ThrAla: 6.724 ± 0.088
0.501ThrCys: 0.501 ± 0.019
2.873ThrAsp: 2.873 ± 0.044
2.815ThrGlu: 2.815 ± 0.043
2.534ThrPhe: 2.534 ± 0.051
5.641ThrGly: 5.641 ± 0.073
1.218ThrHis: 1.218 ± 0.025
3.401ThrIle: 3.401 ± 0.051
1.866ThrLys: 1.866 ± 0.035
6.411ThrLeu: 6.411 ± 0.07
1.215ThrMet: 1.215 ± 0.028
2.103ThrAsn: 2.103 ± 0.045
4.013ThrPro: 4.013 ± 0.06
2.233ThrGln: 2.233 ± 0.038
3.169ThrArg: 3.169 ± 0.043
4.127ThrSer: 4.127 ± 0.084
3.858ThrThr: 3.858 ± 0.071
5.1ThrVal: 5.1 ± 0.08
0.876ThrTrp: 0.876 ± 0.025
1.756ThrTyr: 1.756 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
7.544ValAla: 7.544 ± 0.076
0.651ValCys: 0.651 ± 0.017
3.755ValAsp: 3.755 ± 0.047
3.88ValGlu: 3.88 ± 0.058
2.693ValPhe: 2.693 ± 0.043
4.923ValGly: 4.923 ± 0.065
1.577ValHis: 1.577 ± 0.033
3.57ValIle: 3.57 ± 0.041
2.526ValLys: 2.526 ± 0.04
7.492ValLeu: 7.492 ± 0.078
1.571ValMet: 1.571 ± 0.031
2.656ValAsn: 2.656 ± 0.043
3.766ValPro: 3.766 ± 0.042
2.562ValGln: 2.562 ± 0.04
4.51ValArg: 4.51 ± 0.06
4.958ValSer: 4.958 ± 0.06
4.982ValThr: 4.982 ± 0.078
5.636ValVal: 5.636 ± 0.067
0.936ValTrp: 0.936 ± 0.026
1.852ValTyr: 1.852 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.131TrpAla: 1.131 ± 0.026
0.136TrpCys: 0.136 ± 0.008
0.658TrpAsp: 0.658 ± 0.019
0.618TrpGlu: 0.618 ± 0.021
0.578TrpPhe: 0.578 ± 0.019
0.976TrpGly: 0.976 ± 0.026
0.382TrpHis: 0.382 ± 0.016
0.795TrpIle: 0.795 ± 0.021
0.743TrpLys: 0.743 ± 0.022
1.495TrpLeu: 1.495 ± 0.037
0.433TrpMet: 0.433 ± 0.017
0.688TrpAsn: 0.688 ± 0.023
0.586TrpPro: 0.586 ± 0.02
0.693TrpGln: 0.693 ± 0.019
0.898TrpArg: 0.898 ± 0.021
0.943TrpSer: 0.943 ± 0.024
0.87TrpThr: 0.87 ± 0.022
0.908TrpVal: 0.908 ± 0.022
0.241TrpTrp: 0.241 ± 0.011
0.395TrpTyr: 0.395 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.787TyrAla: 2.787 ± 0.045
0.237TyrCys: 0.237 ± 0.012
1.63TyrAsp: 1.63 ± 0.036
1.424TyrGlu: 1.424 ± 0.031
1.285TyrPhe: 1.285 ± 0.029
2.482TyrGly: 2.482 ± 0.042
0.589TyrHis: 0.589 ± 0.019
1.164TyrIle: 1.164 ± 0.027
0.906TyrLys: 0.906 ± 0.022
2.815TyrLeu: 2.815 ± 0.043
0.487TyrMet: 0.487 ± 0.017
1.059TyrAsn: 1.059 ± 0.034
1.453TyrPro: 1.453 ± 0.031
1.111TyrGln: 1.111 ± 0.023
2.012TyrArg: 2.012 ± 0.039
1.84TyrSer: 1.84 ± 0.038
1.908TyrThr: 1.908 ± 0.038
1.924TyrVal: 1.924 ± 0.035
0.425TyrTrp: 0.425 ± 0.016
0.868TyrTyr: 0.868 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4970 proteins (1854419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski