Amino acid dipepetide frequency for Heliobacterium modesticaldum (strain ATCC 51547 / Ice1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.56AlaAla: 11.56 ± 0.198
1.186AlaCys: 1.186 ± 0.041
5.151AlaAsp: 5.151 ± 0.084
6.881AlaGlu: 6.881 ± 0.101
3.739AlaPhe: 3.739 ± 0.083
8.485AlaGly: 8.485 ± 0.128
1.614AlaHis: 1.614 ± 0.052
5.889AlaIle: 5.889 ± 0.094
4.6AlaLys: 4.6 ± 0.085
10.711AlaLeu: 10.711 ± 0.151
2.75AlaMet: 2.75 ± 0.063
2.558AlaAsn: 2.558 ± 0.068
4.025AlaPro: 4.025 ± 0.075
3.415AlaGln: 3.415 ± 0.065
6.075AlaArg: 6.075 ± 0.086
4.786AlaSer: 4.786 ± 0.08
4.495AlaThr: 4.495 ± 0.127
8.039AlaVal: 8.039 ± 0.121
0.99AlaTrp: 0.99 ± 0.038
2.627AlaTyr: 2.627 ± 0.062
0.0AlaXaa: 0.0 ± 0.0
Cys
0.997CysAla: 0.997 ± 0.035
0.224CysCys: 0.224 ± 0.016
0.603CysAsp: 0.603 ± 0.026
0.542CysGlu: 0.542 ± 0.029
0.401CysPhe: 0.401 ± 0.022
1.175CysGly: 1.175 ± 0.052
0.345CysHis: 0.345 ± 0.02
0.581CysIle: 0.581 ± 0.028
0.395CysLys: 0.395 ± 0.025
1.06CysLeu: 1.06 ± 0.035
0.25CysMet: 0.25 ± 0.016
0.321CysAsn: 0.321 ± 0.02
0.826CysPro: 0.826 ± 0.041
0.408CysGln: 0.408 ± 0.02
0.953CysArg: 0.953 ± 0.04
0.611CysSer: 0.611 ± 0.028
0.446CysThr: 0.446 ± 0.024
0.641CysVal: 0.641 ± 0.028
0.121CysTrp: 0.121 ± 0.011
0.339CysTyr: 0.339 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
4.588AspAla: 4.588 ± 0.078
0.671AspCys: 0.671 ± 0.031
2.717AspAsp: 2.717 ± 0.065
3.783AspGlu: 3.783 ± 0.081
2.106AspPhe: 2.106 ± 0.054
4.343AspGly: 4.343 ± 0.084
1.011AspHis: 1.011 ± 0.04
3.253AspIle: 3.253 ± 0.066
2.603AspLys: 2.603 ± 0.06
5.372AspLeu: 5.372 ± 0.097
1.353AspMet: 1.353 ± 0.041
1.464AspAsn: 1.464 ± 0.048
2.607AspPro: 2.607 ± 0.059
1.505AspGln: 1.505 ± 0.037
3.675AspArg: 3.675 ± 0.073
2.299AspSer: 2.299 ± 0.052
2.276AspThr: 2.276 ± 0.065
3.866AspVal: 3.866 ± 0.072
0.689AspTrp: 0.689 ± 0.027
1.647AspTyr: 1.647 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
7.64GluAla: 7.64 ± 0.111
0.544GluCys: 0.544 ± 0.027
2.825GluAsp: 2.825 ± 0.061
5.66GluGlu: 5.66 ± 0.113
1.791GluPhe: 1.791 ± 0.05
5.132GluGly: 5.132 ± 0.095
1.121GluHis: 1.121 ± 0.038
4.25GluIle: 4.25 ± 0.078
4.75GluLys: 4.75 ± 0.087
6.457GluLeu: 6.457 ± 0.119
2.095GluMet: 2.095 ± 0.053
2.195GluAsn: 2.195 ± 0.054
2.406GluPro: 2.406 ± 0.065
3.076GluGln: 3.076 ± 0.074
5.163GluArg: 5.163 ± 0.083
2.977GluSer: 2.977 ± 0.064
3.393GluThr: 3.393 ± 0.063
4.642GluVal: 4.642 ± 0.086
0.714GluTrp: 0.714 ± 0.03
1.715GluTyr: 1.715 ± 0.047
0.0GluXaa: 0.0 ± 0.0
Phe
3.349PheAla: 3.349 ± 0.062
0.526PheCys: 0.526 ± 0.026
2.095PheAsp: 2.095 ± 0.05
1.997PheGlu: 1.997 ± 0.05
1.926PhePhe: 1.926 ± 0.059
3.144PheGly: 3.144 ± 0.068
0.806PheHis: 0.806 ± 0.032
2.207PheIle: 2.207 ± 0.049
1.302PheLys: 1.302 ± 0.047
3.764PheLeu: 3.764 ± 0.08
0.808PheMet: 0.808 ± 0.031
1.154PheAsn: 1.154 ± 0.038
1.695PhePro: 1.695 ± 0.05
1.295PheGln: 1.295 ± 0.04
2.357PheArg: 2.357 ± 0.048
2.401PheSer: 2.401 ± 0.063
2.062PheThr: 2.062 ± 0.056
2.529PheVal: 2.529 ± 0.056
0.483PheTrp: 0.483 ± 0.027
1.177PheTyr: 1.177 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
7.318GlyAla: 7.318 ± 0.117
1.111GlyCys: 1.111 ± 0.039
4.143GlyAsp: 4.143 ± 0.085
5.226GlyGlu: 5.226 ± 0.076
3.298GlyPhe: 3.298 ± 0.06
6.457GlyGly: 6.457 ± 0.121
1.614GlyHis: 1.614 ± 0.048
5.095GlyIle: 5.095 ± 0.083
4.701GlyLys: 4.701 ± 0.071
8.12GlyLeu: 8.12 ± 0.114
2.425GlyMet: 2.425 ± 0.052
2.465GlyAsn: 2.465 ± 0.068
2.635GlyPro: 2.635 ± 0.065
3.031GlyGln: 3.031 ± 0.073
5.348GlyArg: 5.348 ± 0.086
4.103GlySer: 4.103 ± 0.082
4.216GlyThr: 4.216 ± 0.105
5.89GlyVal: 5.89 ± 0.102
0.951GlyTrp: 0.951 ± 0.031
2.456GlyTyr: 2.456 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.469HisAla: 1.469 ± 0.042
0.325HisCys: 0.325 ± 0.022
0.933HisAsp: 0.933 ± 0.036
0.919HisGlu: 0.919 ± 0.031
0.802HisPhe: 0.802 ± 0.03
1.544HisGly: 1.544 ± 0.05
0.534HisHis: 0.534 ± 0.027
1.161HisIle: 1.161 ± 0.037
0.694HisLys: 0.694 ± 0.03
2.073HisLeu: 2.073 ± 0.059
0.428HisMet: 0.428 ± 0.021
0.584HisAsn: 0.584 ± 0.026
1.362HisPro: 1.362 ± 0.043
0.718HisGln: 0.718 ± 0.029
1.507HisArg: 1.507 ± 0.047
1.039HisSer: 1.039 ± 0.042
0.901HisThr: 0.901 ± 0.033
1.211HisVal: 1.211 ± 0.04
0.269HisTrp: 0.269 ± 0.016
0.628HisTyr: 0.628 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.21IleAla: 6.21 ± 0.091
0.686IleCys: 0.686 ± 0.03
3.831IleAsp: 3.831 ± 0.071
4.039IleGlu: 4.039 ± 0.076
2.054IlePhe: 2.054 ± 0.052
4.893IleGly: 4.893 ± 0.097
1.27IleHis: 1.27 ± 0.039
3.565IleIle: 3.565 ± 0.08
2.475IleLys: 2.475 ± 0.069
5.517IleLeu: 5.517 ± 0.104
1.243IleMet: 1.243 ± 0.043
1.932IleAsn: 1.932 ± 0.056
3.103IlePro: 3.103 ± 0.064
1.992IleGln: 1.992 ± 0.052
3.899IleArg: 3.899 ± 0.074
3.211IleSer: 3.211 ± 0.078
3.31IleThr: 3.31 ± 0.078
4.58IleVal: 4.58 ± 0.073
0.492IleTrp: 0.492 ± 0.023
1.581IleTyr: 1.581 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
5.389LysAla: 5.389 ± 0.097
0.337LysCys: 0.337 ± 0.025
2.581LysAsp: 2.581 ± 0.062
4.156LysGlu: 4.156 ± 0.083
1.099LysPhe: 1.099 ± 0.038
4.091LysGly: 4.091 ± 0.08
0.725LysHis: 0.725 ± 0.03
2.693LysIle: 2.693 ± 0.073
3.179LysLys: 3.179 ± 0.08
3.915LysLeu: 3.915 ± 0.077
1.247LysMet: 1.247 ± 0.043
1.724LysAsn: 1.724 ± 0.055
2.162LysPro: 2.162 ± 0.051
1.736LysGln: 1.736 ± 0.05
3.092LysArg: 3.092 ± 0.063
2.399LysSer: 2.399 ± 0.058
2.801LysThr: 2.801 ± 0.059
3.664LysVal: 3.664 ± 0.08
0.469LysTrp: 0.469 ± 0.022
1.158LysTyr: 1.158 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
10.752LeuAla: 10.752 ± 0.158
1.107LeuCys: 1.107 ± 0.038
5.057LeuAsp: 5.057 ± 0.081
6.511LeuGlu: 6.511 ± 0.11
4.093LeuPhe: 4.093 ± 0.093
7.211LeuGly: 7.211 ± 0.121
1.919LeuHis: 1.919 ± 0.045
5.653LeuIle: 5.653 ± 0.099
4.63LeuLys: 4.63 ± 0.073
10.638LeuLeu: 10.638 ± 0.185
2.227LeuMet: 2.227 ± 0.056
2.86LeuAsn: 2.86 ± 0.055
5.333LeuPro: 5.333 ± 0.088
3.716LeuGln: 3.716 ± 0.067
6.83LeuArg: 6.83 ± 0.114
6.47LeuSer: 6.47 ± 0.112
5.718LeuThr: 5.718 ± 0.095
6.615LeuVal: 6.615 ± 0.1
1.018LeuTrp: 1.018 ± 0.038
2.568LeuTyr: 2.568 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
2.935MetAla: 2.935 ± 0.06
0.162MetCys: 0.162 ± 0.014
1.421MetAsp: 1.421 ± 0.04
1.933MetGlu: 1.933 ± 0.046
0.653MetPhe: 0.653 ± 0.029
2.038MetGly: 2.038 ± 0.052
0.408MetHis: 0.408 ± 0.026
1.502MetIle: 1.502 ± 0.049
1.45MetLys: 1.45 ± 0.039
2.238MetLeu: 2.238 ± 0.055
0.663MetMet: 0.663 ± 0.03
0.97MetAsn: 0.97 ± 0.034
1.085MetPro: 1.085 ± 0.033
0.89MetGln: 0.89 ± 0.036
1.519MetArg: 1.519 ± 0.043
1.354MetSer: 1.354 ± 0.04
1.651MetThr: 1.651 ± 0.046
1.9MetVal: 1.9 ± 0.055
0.154MetTrp: 0.154 ± 0.014
0.465MetTyr: 0.465 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.634AsnAla: 2.634 ± 0.066
0.354AsnCys: 0.354 ± 0.021
1.46AsnAsp: 1.46 ± 0.047
1.647AsnGlu: 1.647 ± 0.049
1.01AsnPhe: 1.01 ± 0.035
2.426AsnGly: 2.426 ± 0.065
0.63AsnHis: 0.63 ± 0.028
1.975AsnIle: 1.975 ± 0.058
1.39AsnLys: 1.39 ± 0.045
3.244AsnLeu: 3.244 ± 0.059
0.717AsnMet: 0.717 ± 0.029
0.931AsnAsn: 0.931 ± 0.041
1.95AsnPro: 1.95 ± 0.056
1.106AsnGln: 1.106 ± 0.037
2.358AsnArg: 2.358 ± 0.05
1.458AsnSer: 1.458 ± 0.05
1.444AsnThr: 1.444 ± 0.046
2.099AsnVal: 2.099 ± 0.05
0.372AsnTrp: 0.372 ± 0.022
0.889AsnTyr: 0.889 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
4.703ProAla: 4.703 ± 0.094
0.449ProCys: 0.449 ± 0.025
2.637ProAsp: 2.637 ± 0.057
3.919ProGlu: 3.919 ± 0.073
1.889ProPhe: 1.889 ± 0.048
4.015ProGly: 4.015 ± 0.097
0.867ProHis: 0.867 ± 0.034
2.455ProIle: 2.455 ± 0.049
2.038ProLys: 2.038 ± 0.056
4.566ProLeu: 4.566 ± 0.081
1.079ProMet: 1.079 ± 0.038
1.207ProAsn: 1.207 ± 0.044
2.305ProPro: 2.305 ± 0.133
1.636ProGln: 1.636 ± 0.041
2.261ProArg: 2.261 ± 0.052
2.547ProSer: 2.547 ± 0.067
2.161ProThr: 2.161 ± 0.064
4.265ProVal: 4.265 ± 0.078
0.657ProTrp: 0.657 ± 0.031
1.301ProTyr: 1.301 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
3.998GlnAla: 3.998 ± 0.077
0.309GlnCys: 0.309 ± 0.022
1.464GlnAsp: 1.464 ± 0.043
2.602GlnGlu: 2.602 ± 0.06
1.136GlnPhe: 1.136 ± 0.043
2.867GlnGly: 2.867 ± 0.061
0.648GlnHis: 0.648 ± 0.03
2.21GlnIle: 2.21 ± 0.049
2.078GlnLys: 2.078 ± 0.059
3.314GlnLeu: 3.314 ± 0.071
1.113GlnMet: 1.113 ± 0.034
1.005GlnAsn: 1.005 ± 0.036
1.609GlnPro: 1.609 ± 0.047
1.538GlnGln: 1.538 ± 0.058
2.577GlnArg: 2.577 ± 0.076
1.846GlnSer: 1.846 ± 0.06
1.81GlnThr: 1.81 ± 0.05
3.106GlnVal: 3.106 ± 0.063
0.481GlnTrp: 0.481 ± 0.028
0.848GlnTyr: 0.848 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
4.909ArgAla: 4.909 ± 0.089
0.843ArgCys: 0.843 ± 0.035
3.171ArgAsp: 3.171 ± 0.072
5.197ArgGlu: 5.197 ± 0.091
2.887ArgPhe: 2.887 ± 0.063
4.241ArgGly: 4.241 ± 0.082
1.468ArgHis: 1.468 ± 0.041
4.229ArgIle: 4.229 ± 0.079
3.12ArgLys: 3.12 ± 0.059
7.902ArgLeu: 7.902 ± 0.119
1.92ArgMet: 1.92 ± 0.056
1.853ArgAsn: 1.853 ± 0.053
2.912ArgPro: 2.912 ± 0.069
3.361ArgGln: 3.361 ± 0.081
5.477ArgArg: 5.477 ± 0.114
3.228ArgSer: 3.228 ± 0.065
2.805ArgThr: 2.805 ± 0.075
4.277ArgVal: 4.277 ± 0.071
1.072ArgTrp: 1.072 ± 0.04
2.1ArgTyr: 2.1 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
5.009SerAla: 5.009 ± 0.091
0.533SerCys: 0.533 ± 0.024
2.603SerAsp: 2.603 ± 0.059
3.092SerGlu: 3.092 ± 0.062
2.153SerPhe: 2.153 ± 0.051
4.839SerGly: 4.839 ± 0.1
1.037SerHis: 1.037 ± 0.04
2.941SerIle: 2.941 ± 0.066
2.127SerLys: 2.127 ± 0.053
5.716SerLeu: 5.716 ± 0.09
1.239SerMet: 1.239 ± 0.035
1.574SerAsn: 1.574 ± 0.05
2.9SerPro: 2.9 ± 0.066
1.812SerGln: 1.812 ± 0.055
3.668SerArg: 3.668 ± 0.07
2.9SerSer: 2.9 ± 0.071
2.481SerThr: 2.481 ± 0.059
3.852SerVal: 3.852 ± 0.07
0.595SerTrp: 0.595 ± 0.025
1.358SerTyr: 1.358 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
5.533ThrAla: 5.533 ± 0.152
0.547ThrCys: 0.547 ± 0.027
2.589ThrAsp: 2.589 ± 0.061
2.973ThrGlu: 2.973 ± 0.063
1.875ThrPhe: 1.875 ± 0.047
5.091ThrGly: 5.091 ± 0.098
0.9ThrHis: 0.9 ± 0.034
3.165ThrIle: 3.165 ± 0.073
1.952ThrLys: 1.952 ± 0.052
5.162ThrLeu: 5.162 ± 0.099
1.246ThrMet: 1.246 ± 0.04
1.412ThrAsn: 1.412 ± 0.042
2.826ThrPro: 2.826 ± 0.067
1.471ThrGln: 1.471 ± 0.046
2.487ThrArg: 2.487 ± 0.063
2.536ThrSer: 2.536 ± 0.065
2.738ThrThr: 2.738 ± 0.1
4.861ThrVal: 4.861 ± 0.128
0.499ThrTrp: 0.499 ± 0.026
1.408ThrTyr: 1.408 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
7.412ValAla: 7.412 ± 0.106
0.804ValCys: 0.804 ± 0.029
4.389ValAsp: 4.389 ± 0.085
5.196ValGlu: 5.196 ± 0.083
2.669ValPhe: 2.669 ± 0.059
5.277ValGly: 5.277 ± 0.093
1.329ValHis: 1.329 ± 0.039
5.011ValIle: 5.011 ± 0.074
3.5ValLys: 3.5 ± 0.082
7.06ValLeu: 7.06 ± 0.097
1.731ValMet: 1.731 ± 0.051
2.556ValAsn: 2.556 ± 0.067
3.402ValPro: 3.402 ± 0.066
2.331ValGln: 2.331 ± 0.048
4.58ValArg: 4.58 ± 0.075
4.145ValSer: 4.145 ± 0.075
4.56ValThr: 4.56 ± 0.118
5.681ValVal: 5.681 ± 0.099
0.632ValTrp: 0.632 ± 0.031
1.97ValTyr: 1.97 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.828TrpAla: 0.828 ± 0.033
0.107TrpCys: 0.107 ± 0.012
0.585TrpAsp: 0.585 ± 0.03
0.791TrpGlu: 0.791 ± 0.024
0.391TrpPhe: 0.391 ± 0.021
0.811TrpGly: 0.811 ± 0.035
0.223TrpHis: 0.223 ± 0.016
0.601TrpIle: 0.601 ± 0.029
0.556TrpLys: 0.556 ± 0.025
1.248TrpLeu: 1.248 ± 0.048
0.326TrpMet: 0.326 ± 0.023
0.411TrpAsn: 0.411 ± 0.021
0.46TrpPro: 0.46 ± 0.025
0.51TrpGln: 0.51 ± 0.029
0.903TrpArg: 0.903 ± 0.034
0.662TrpSer: 0.662 ± 0.03
0.553TrpThr: 0.553 ± 0.024
0.754TrpVal: 0.754 ± 0.029
0.168TrpTrp: 0.168 ± 0.016
0.33TrpTyr: 0.33 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.343TyrAla: 2.343 ± 0.052
0.415TyrCys: 0.415 ± 0.023
1.572TyrAsp: 1.572 ± 0.051
1.484TyrGlu: 1.484 ± 0.047
1.107TyrPhe: 1.107 ± 0.039
2.524TyrGly: 2.524 ± 0.05
0.631TyrHis: 0.631 ± 0.03
1.449TyrIle: 1.449 ± 0.045
1.077TyrLys: 1.077 ± 0.039
2.849TyrLeu: 2.849 ± 0.061
0.529TyrMet: 0.529 ± 0.026
0.915TyrAsn: 0.915 ± 0.038
1.356TyrPro: 1.356 ± 0.042
1.01TyrGln: 1.01 ± 0.034
2.244TyrArg: 2.244 ± 0.058
1.457TyrSer: 1.457 ± 0.042
1.45TyrThr: 1.45 ± 0.055
1.76TyrVal: 1.76 ± 0.041
0.391TyrTrp: 0.391 ± 0.023
0.919TyrTyr: 0.919 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2924 proteins (852455 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski