Amino acid dipepetide frequency for Desulfuribacillus alkaliarsenatis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.532AlaAla: 5.532 ± 0.106
0.771AlaCys: 0.771 ± 0.031
3.806AlaAsp: 3.806 ± 0.072
4.916AlaGlu: 4.916 ± 0.086
2.842AlaPhe: 2.842 ± 0.062
5.077AlaGly: 5.077 ± 0.095
1.161AlaHis: 1.161 ± 0.036
6.707AlaIle: 6.707 ± 0.088
4.931AlaLys: 4.931 ± 0.089
6.939AlaLeu: 6.939 ± 0.11
2.019AlaMet: 2.019 ± 0.051
3.4AlaAsn: 3.4 ± 0.07
1.98AlaPro: 1.98 ± 0.065
2.228AlaGln: 2.228 ± 0.048
2.897AlaArg: 2.897 ± 0.053
4.048AlaSer: 4.048 ± 0.084
3.906AlaThr: 3.906 ± 0.069
5.109AlaVal: 5.109 ± 0.086
0.564AlaTrp: 0.564 ± 0.029
2.358AlaTyr: 2.358 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.481CysAla: 0.481 ± 0.023
0.119CysCys: 0.119 ± 0.012
0.464CysAsp: 0.464 ± 0.02
0.48CysGlu: 0.48 ± 0.022
0.314CysPhe: 0.314 ± 0.017
0.74CysGly: 0.74 ± 0.033
0.264CysHis: 0.264 ± 0.023
0.694CysIle: 0.694 ± 0.026
0.637CysLys: 0.637 ± 0.029
0.748CysLeu: 0.748 ± 0.03
0.23CysMet: 0.23 ± 0.015
0.447CysAsn: 0.447 ± 0.025
0.447CysPro: 0.447 ± 0.028
0.287CysGln: 0.287 ± 0.019
0.353CysArg: 0.353 ± 0.021
0.568CysSer: 0.568 ± 0.027
0.458CysThr: 0.458 ± 0.023
0.545CysVal: 0.545 ± 0.021
0.083CysTrp: 0.083 ± 0.009
0.314CysTyr: 0.314 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.479AspAla: 3.479 ± 0.069
0.521AspCys: 0.521 ± 0.024
2.735AspAsp: 2.735 ± 0.059
4.166AspGlu: 4.166 ± 0.082
2.473AspPhe: 2.473 ± 0.053
3.4AspGly: 3.4 ± 0.074
0.761AspHis: 0.761 ± 0.028
5.444AspIle: 5.444 ± 0.077
3.66AspLys: 3.66 ± 0.066
4.769AspLeu: 4.769 ± 0.077
1.415AspMet: 1.415 ± 0.046
2.64AspAsn: 2.64 ± 0.051
1.596AspPro: 1.596 ± 0.042
1.587AspGln: 1.587 ± 0.043
2.303AspArg: 2.303 ± 0.054
3.008AspSer: 3.008 ± 0.064
2.694AspThr: 2.694 ± 0.056
3.909AspVal: 3.909 ± 0.061
0.618AspTrp: 0.618 ± 0.032
2.355AspTyr: 2.355 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
5.298GluAla: 5.298 ± 0.084
0.431GluCys: 0.431 ± 0.021
3.476GluAsp: 3.476 ± 0.064
5.533GluGlu: 5.533 ± 0.098
2.642GluPhe: 2.642 ± 0.053
3.761GluGly: 3.761 ± 0.063
1.608GluHis: 1.608 ± 0.043
6.317GluIle: 6.317 ± 0.083
5.095GluLys: 5.095 ± 0.089
7.859GluLeu: 7.859 ± 0.111
1.935GluMet: 1.935 ± 0.047
3.357GluAsn: 3.357 ± 0.062
2.007GluPro: 2.007 ± 0.058
4.388GluGln: 4.388 ± 0.071
3.499GluArg: 3.499 ± 0.06
3.333GluSer: 3.333 ± 0.067
3.326GluThr: 3.326 ± 0.061
5.107GluVal: 5.107 ± 0.076
0.601GluTrp: 0.601 ± 0.025
2.577GluTyr: 2.577 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
3.049PheAla: 3.049 ± 0.055
0.372PheCys: 0.372 ± 0.022
2.323PheAsp: 2.323 ± 0.057
2.551PheGlu: 2.551 ± 0.05
1.918PhePhe: 1.918 ± 0.055
2.686PheGly: 2.686 ± 0.071
0.725PheHis: 0.725 ± 0.032
3.75PheIle: 3.75 ± 0.073
2.191PheLys: 2.191 ± 0.053
3.785PheLeu: 3.785 ± 0.077
1.054PheMet: 1.054 ± 0.034
2.014PheAsn: 2.014 ± 0.047
1.306PhePro: 1.306 ± 0.043
1.456PheGln: 1.456 ± 0.041
1.525PheArg: 1.525 ± 0.041
2.783PheSer: 2.783 ± 0.057
2.352PheThr: 2.352 ± 0.052
2.956PheVal: 2.956 ± 0.062
0.388PheTrp: 0.388 ± 0.023
1.598PheTyr: 1.598 ± 0.051
0.0PheXaa: 0.0 ± 0.0
Gly
4.595GlyAla: 4.595 ± 0.091
0.818GlyCys: 0.818 ± 0.036
3.026GlyAsp: 3.026 ± 0.063
4.005GlyGlu: 4.005 ± 0.064
3.083GlyPhe: 3.083 ± 0.058
4.166GlyGly: 4.166 ± 0.1
1.311GlyHis: 1.311 ± 0.034
5.998GlyIle: 5.998 ± 0.093
4.422GlyLys: 4.422 ± 0.079
6.197GlyLeu: 6.197 ± 0.089
1.855GlyMet: 1.855 ± 0.051
2.758GlyAsn: 2.758 ± 0.055
1.505GlyPro: 1.505 ± 0.047
2.41GlyGln: 2.41 ± 0.051
2.655GlyArg: 2.655 ± 0.058
3.797GlySer: 3.797 ± 0.069
3.739GlyThr: 3.739 ± 0.076
5.012GlyVal: 5.012 ± 0.075
0.613GlyTrp: 0.613 ± 0.031
2.882GlyTyr: 2.882 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
1.184HisAla: 1.184 ± 0.039
0.22HisCys: 0.22 ± 0.017
0.925HisAsp: 0.925 ± 0.03
1.194HisGlu: 1.194 ± 0.041
0.873HisPhe: 0.873 ± 0.031
1.385HisGly: 1.385 ± 0.04
0.482HisHis: 0.482 ± 0.025
1.631HisIle: 1.631 ± 0.041
1.218HisLys: 1.218 ± 0.037
1.713HisLeu: 1.713 ± 0.048
0.474HisMet: 0.474 ± 0.026
0.97HisAsn: 0.97 ± 0.031
0.907HisPro: 0.907 ± 0.034
0.774HisGln: 0.774 ± 0.027
0.894HisArg: 0.894 ± 0.029
1.233HisSer: 1.233 ± 0.037
1.06HisThr: 1.06 ± 0.033
1.254HisVal: 1.254 ± 0.039
0.223HisTrp: 0.223 ± 0.016
0.791HisTyr: 0.791 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.928IleAla: 6.928 ± 0.095
0.758IleCys: 0.758 ± 0.029
5.573IleAsp: 5.573 ± 0.084
6.906IleGlu: 6.906 ± 0.099
3.286IlePhe: 3.286 ± 0.072
5.861IleGly: 5.861 ± 0.086
1.533IleHis: 1.533 ± 0.043
7.721IleIle: 7.721 ± 0.109
5.346IleLys: 5.346 ± 0.087
7.65IleLeu: 7.65 ± 0.089
2.079IleMet: 2.079 ± 0.052
4.504IleAsn: 4.504 ± 0.075
3.407IlePro: 3.407 ± 0.057
2.837IleGln: 2.837 ± 0.057
3.576IleArg: 3.576 ± 0.057
5.445IleSer: 5.445 ± 0.081
5.023IleThr: 5.023 ± 0.063
6.192IleVal: 6.192 ± 0.086
0.59IleTrp: 0.59 ± 0.022
2.894IleTyr: 2.894 ± 0.061
0.0IleXaa: 0.0 ± 0.0
Lys
4.507LysAla: 4.507 ± 0.087
0.456LysCys: 0.456 ± 0.023
3.889LysAsp: 3.889 ± 0.068
5.544LysGlu: 5.544 ± 0.085
1.751LysPhe: 1.751 ± 0.042
3.892LysGly: 3.892 ± 0.07
1.547LysHis: 1.547 ± 0.048
5.165LysIle: 5.165 ± 0.075
5.048LysLys: 5.048 ± 0.099
6.139LysLeu: 6.139 ± 0.089
1.75LysMet: 1.75 ± 0.046
3.367LysAsn: 3.367 ± 0.069
2.105LysPro: 2.105 ± 0.055
3.325LysGln: 3.325 ± 0.074
2.933LysArg: 2.933 ± 0.065
3.522LysSer: 3.522 ± 0.064
3.349LysThr: 3.349 ± 0.069
4.479LysVal: 4.479 ± 0.068
0.545LysTrp: 0.545 ± 0.028
2.432LysTyr: 2.432 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
7.634LeuAla: 7.634 ± 0.104
0.741LeuCys: 0.741 ± 0.032
5.126LeuAsp: 5.126 ± 0.084
7.066LeuGlu: 7.066 ± 0.102
3.908LeuPhe: 3.908 ± 0.077
6.258LeuGly: 6.258 ± 0.092
1.854LeuHis: 1.854 ± 0.041
7.582LeuIle: 7.582 ± 0.098
6.064LeuLys: 6.064 ± 0.088
9.945LeuLeu: 9.945 ± 0.143
2.378LeuMet: 2.378 ± 0.056
4.374LeuAsn: 4.374 ± 0.071
3.764LeuPro: 3.764 ± 0.071
3.87LeuGln: 3.87 ± 0.07
3.935LeuArg: 3.935 ± 0.062
5.959LeuSer: 5.959 ± 0.081
5.304LeuThr: 5.304 ± 0.069
6.373LeuVal: 6.373 ± 0.098
0.711LeuTrp: 0.711 ± 0.027
3.197LeuTyr: 3.197 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
1.926MetAla: 1.926 ± 0.044
0.209MetCys: 0.209 ± 0.016
1.399MetAsp: 1.399 ± 0.039
1.79MetGlu: 1.79 ± 0.043
1.03MetPhe: 1.03 ± 0.035
1.741MetGly: 1.741 ± 0.044
0.435MetHis: 0.435 ± 0.022
2.003MetIle: 2.003 ± 0.049
1.853MetLys: 1.853 ± 0.049
2.808MetLeu: 2.808 ± 0.064
0.701MetMet: 0.701 ± 0.03
1.283MetAsn: 1.283 ± 0.038
1.054MetPro: 1.054 ± 0.037
1.075MetGln: 1.075 ± 0.038
1.169MetArg: 1.169 ± 0.034
1.632MetSer: 1.632 ± 0.041
1.416MetThr: 1.416 ± 0.041
1.81MetVal: 1.81 ± 0.051
0.156MetTrp: 0.156 ± 0.016
0.871MetTyr: 0.871 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
2.826AsnAla: 2.826 ± 0.057
0.404AsnCys: 0.404 ± 0.022
2.523AsnAsp: 2.523 ± 0.053
3.335AsnGlu: 3.335 ± 0.061
1.657AsnPhe: 1.657 ± 0.042
2.797AsnGly: 2.797 ± 0.066
1.108AsnHis: 1.108 ± 0.034
4.519AsnIle: 4.519 ± 0.078
3.525AsnLys: 3.525 ± 0.067
4.239AsnLeu: 4.239 ± 0.069
1.333AsnMet: 1.333 ± 0.038
3.002AsnAsn: 3.002 ± 0.07
2.16AsnPro: 2.16 ± 0.056
2.402AsnGln: 2.402 ± 0.064
2.274AsnArg: 2.274 ± 0.053
2.723AsnSer: 2.723 ± 0.062
2.476AsnThr: 2.476 ± 0.052
3.232AsnVal: 3.232 ± 0.061
0.492AsnTrp: 0.492 ± 0.024
1.918AsnTyr: 1.918 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
2.285ProAla: 2.285 ± 0.067
0.249ProCys: 0.249 ± 0.017
1.915ProAsp: 1.915 ± 0.049
2.844ProGlu: 2.844 ± 0.065
1.527ProPhe: 1.527 ± 0.041
2.256ProGly: 2.256 ± 0.055
0.708ProHis: 0.708 ± 0.029
3.044ProIle: 3.044 ± 0.059
1.966ProLys: 1.966 ± 0.05
3.053ProLeu: 3.053 ± 0.057
0.871ProMet: 0.871 ± 0.033
1.629ProAsn: 1.629 ± 0.042
0.955ProPro: 0.955 ± 0.039
1.084ProGln: 1.084 ± 0.034
1.112ProArg: 1.112 ± 0.036
1.843ProSer: 1.843 ± 0.051
1.936ProThr: 1.936 ± 0.042
2.71ProVal: 2.71 ± 0.064
0.379ProTrp: 0.379 ± 0.021
1.364ProTyr: 1.364 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
3.166GlnAla: 3.166 ± 0.065
0.234GlnCys: 0.234 ± 0.017
1.79GlnAsp: 1.79 ± 0.044
3.23GlnGlu: 3.23 ± 0.072
1.444GlnPhe: 1.444 ± 0.031
2.462GlnGly: 2.462 ± 0.058
0.791GlnHis: 0.791 ± 0.03
3.202GlnIle: 3.202 ± 0.058
2.59GlnLys: 2.59 ± 0.063
4.337GlnLeu: 4.337 ± 0.075
1.063GlnMet: 1.063 ± 0.038
1.543GlnAsn: 1.543 ± 0.042
1.263GlnPro: 1.263 ± 0.04
2.359GlnGln: 2.359 ± 0.066
1.793GlnArg: 1.793 ± 0.05
2.068GlnSer: 2.068 ± 0.048
2.006GlnThr: 2.006 ± 0.049
2.747GlnVal: 2.747 ± 0.06
0.372GlnTrp: 0.372 ± 0.018
1.392GlnTyr: 1.392 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
2.769ArgAla: 2.769 ± 0.057
0.312ArgCys: 0.312 ± 0.019
2.217ArgAsp: 2.217 ± 0.046
3.095ArgGlu: 3.095 ± 0.063
1.86ArgPhe: 1.86 ± 0.046
2.511ArgGly: 2.511 ± 0.055
0.82ArgHis: 0.82 ± 0.028
3.725ArgIle: 3.725 ± 0.065
2.955ArgLys: 2.955 ± 0.064
4.101ArgLeu: 4.101 ± 0.068
1.209ArgMet: 1.209 ± 0.039
2.203ArgAsn: 2.203 ± 0.052
1.35ArgPro: 1.35 ± 0.039
1.755ArgGln: 1.755 ± 0.042
1.821ArgArg: 1.821 ± 0.049
2.258ArgSer: 2.258 ± 0.049
2.131ArgThr: 2.131 ± 0.047
2.926ArgVal: 2.926 ± 0.066
0.419ArgTrp: 0.419 ± 0.021
1.69ArgTyr: 1.69 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
3.631SerAla: 3.631 ± 0.06
0.506SerCys: 0.506 ± 0.024
2.866SerAsp: 2.866 ± 0.058
3.769SerGlu: 3.769 ± 0.061
2.847SerPhe: 2.847 ± 0.059
3.938SerGly: 3.938 ± 0.075
1.058SerHis: 1.058 ± 0.031
5.366SerIle: 5.366 ± 0.082
3.767SerLys: 3.767 ± 0.064
5.783SerLeu: 5.783 ± 0.085
1.691SerMet: 1.691 ± 0.042
3.004SerAsn: 3.004 ± 0.067
1.774SerPro: 1.774 ± 0.052
2.067SerGln: 2.067 ± 0.049
2.308SerArg: 2.308 ± 0.048
3.679SerSer: 3.679 ± 0.081
3.184SerThr: 3.184 ± 0.056
3.938SerVal: 3.938 ± 0.058
0.58SerTrp: 0.58 ± 0.028
2.317SerTyr: 2.317 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
3.689ThrAla: 3.689 ± 0.066
0.486ThrCys: 0.486 ± 0.023
2.938ThrAsp: 2.938 ± 0.055
3.646ThrGlu: 3.646 ± 0.072
2.163ThrPhe: 2.163 ± 0.056
4.199ThrGly: 4.199 ± 0.089
1.02ThrHis: 1.02 ± 0.037
5.119ThrIle: 5.119 ± 0.075
3.292ThrLys: 3.292 ± 0.061
4.841ThrLeu: 4.841 ± 0.075
1.34ThrMet: 1.34 ± 0.037
2.61ThrAsn: 2.61 ± 0.066
2.19ThrPro: 2.19 ± 0.05
1.613ThrGln: 1.613 ± 0.041
2.045ThrArg: 2.045 ± 0.046
3.097ThrSer: 3.097 ± 0.06
3.031ThrThr: 3.031 ± 0.065
4.013ThrVal: 4.013 ± 0.07
0.466ThrTrp: 0.466 ± 0.025
1.857ThrTyr: 1.857 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
5.569ValAla: 5.569 ± 0.093
0.626ValCys: 0.626 ± 0.026
3.962ValAsp: 3.962 ± 0.081
4.953ValGlu: 4.953 ± 0.09
3.078ValPhe: 3.078 ± 0.054
4.663ValGly: 4.663 ± 0.075
1.239ValHis: 1.239 ± 0.041
6.271ValIle: 6.271 ± 0.084
4.2ValLys: 4.2 ± 0.072
6.639ValLeu: 6.639 ± 0.104
1.813ValMet: 1.813 ± 0.048
3.346ValAsn: 3.346 ± 0.062
2.355ValPro: 2.355 ± 0.049
2.279ValGln: 2.279 ± 0.056
2.876ValArg: 2.876 ± 0.051
4.321ValSer: 4.321 ± 0.081
4.012ValThr: 4.012 ± 0.08
5.521ValVal: 5.521 ± 0.105
0.563ValTrp: 0.563 ± 0.025
2.451ValTyr: 2.451 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.577TrpAla: 0.577 ± 0.025
0.066TrpCys: 0.066 ± 0.01
0.49TrpAsp: 0.49 ± 0.026
0.634TrpGlu: 0.634 ± 0.029
0.386TrpPhe: 0.386 ± 0.019
0.594TrpGly: 0.594 ± 0.027
0.196TrpHis: 0.196 ± 0.015
0.658TrpIle: 0.658 ± 0.029
0.545TrpLys: 0.545 ± 0.028
0.922TrpLeu: 0.922 ± 0.036
0.26TrpMet: 0.26 ± 0.02
0.484TrpAsn: 0.484 ± 0.024
0.225TrpPro: 0.225 ± 0.018
0.443TrpGln: 0.443 ± 0.02
0.371TrpArg: 0.371 ± 0.018
0.516TrpSer: 0.516 ± 0.026
0.435TrpThr: 0.435 ± 0.022
0.567TrpVal: 0.567 ± 0.023
0.134TrpTrp: 0.134 ± 0.012
0.355TrpTyr: 0.355 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.068TyrAla: 2.068 ± 0.05
0.372TyrCys: 0.372 ± 0.021
2.09TyrAsp: 2.09 ± 0.052
2.494TyrGlu: 2.494 ± 0.059
1.702TyrPhe: 1.702 ± 0.047
2.457TyrGly: 2.457 ± 0.049
0.783TyrHis: 0.783 ± 0.031
3.193TyrIle: 3.193 ± 0.065
2.437TyrLys: 2.437 ± 0.056
3.576TyrLeu: 3.576 ± 0.06
0.891TyrMet: 0.891 ± 0.028
2.011TyrAsn: 2.011 ± 0.056
1.38TyrPro: 1.38 ± 0.039
1.61TyrGln: 1.61 ± 0.046
1.791TyrArg: 1.791 ± 0.049
2.24TyrSer: 2.24 ± 0.052
1.82TyrThr: 1.82 ± 0.051
2.337TyrVal: 2.337 ± 0.057
0.359TyrTrp: 0.359 ± 0.019
1.582TyrTyr: 1.582 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2879 proteins (915468 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski