Amino acid dipepetide frequency for Mycobacterium pseudokansasii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.866AlaAla: 21.866 ± 0.174
1.076AlaCys: 1.076 ± 0.025
7.941AlaAsp: 7.941 ± 0.067
7.314AlaGlu: 7.314 ± 0.081
3.39AlaPhe: 3.39 ± 0.039
13.347AlaGly: 13.347 ± 0.192
2.688AlaHis: 2.688 ± 0.042
5.18AlaIle: 5.18 ± 0.058
2.831AlaLys: 2.831 ± 0.054
12.619AlaLeu: 12.619 ± 0.112
2.981AlaMet: 2.981 ± 0.036
2.949AlaAsn: 2.949 ± 0.069
6.465AlaPro: 6.465 ± 0.079
4.226AlaGln: 4.226 ± 0.055
8.817AlaArg: 8.817 ± 0.081
6.117AlaSer: 6.117 ± 0.067
7.531AlaThr: 7.531 ± 0.089
11.643AlaVal: 11.643 ± 0.106
1.707AlaTrp: 1.707 ± 0.028
2.341AlaTyr: 2.341 ± 0.033
0.0AlaXaa: 0.0 ± 0.0
Cys
1.153CysAla: 1.153 ± 0.029
0.13CysCys: 0.13 ± 0.01
0.581CysAsp: 0.581 ± 0.021
0.426CysGlu: 0.426 ± 0.016
0.242CysPhe: 0.242 ± 0.012
1.017CysGly: 1.017 ± 0.026
0.215CysHis: 0.215 ± 0.01
0.276CysIle: 0.276 ± 0.01
0.127CysLys: 0.127 ± 0.008
0.688CysLeu: 0.688 ± 0.019
0.151CysMet: 0.151 ± 0.008
0.198CysAsn: 0.198 ± 0.01
0.559CysPro: 0.559 ± 0.019
0.244CysGln: 0.244 ± 0.013
0.699CysArg: 0.699 ± 0.022
0.523CysSer: 0.523 ± 0.018
0.487CysThr: 0.487 ± 0.017
0.696CysVal: 0.696 ± 0.023
0.151CysTrp: 0.151 ± 0.01
0.208CysTyr: 0.208 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
7.121AspAla: 7.121 ± 0.07
0.482AspCys: 0.482 ± 0.018
3.72AspAsp: 3.72 ± 0.055
3.637AspGlu: 3.637 ± 0.051
1.707AspPhe: 1.707 ± 0.032
5.625AspGly: 5.625 ± 0.077
1.43AspHis: 1.43 ± 0.027
2.539AspIle: 2.539 ± 0.034
1.215AspLys: 1.215 ± 0.029
5.543AspLeu: 5.543 ± 0.056
0.951AspMet: 0.951 ± 0.021
1.256AspAsn: 1.256 ± 0.03
4.173AspPro: 4.173 ± 0.055
1.761AspGln: 1.761 ± 0.034
4.386AspArg: 4.386 ± 0.053
2.686AspSer: 2.686 ± 0.037
3.066AspThr: 3.066 ± 0.045
4.959AspVal: 4.959 ± 0.049
0.937AspTrp: 0.937 ± 0.022
1.434AspTyr: 1.434 ± 0.029
0.0AspXaa: 0.0 ± 0.0
Glu
5.62GluAla: 5.62 ± 0.081
0.379GluCys: 0.379 ± 0.015
2.093GluAsp: 2.093 ± 0.038
2.136GluGlu: 2.136 ± 0.041
1.691GluPhe: 1.691 ± 0.028
2.874GluGly: 2.874 ± 0.047
1.474GluHis: 1.474 ± 0.03
2.362GluIle: 2.362 ± 0.041
1.133GluLys: 1.133 ± 0.032
6.265GluLeu: 6.265 ± 0.066
1.02GluMet: 1.02 ± 0.026
0.995GluAsn: 0.995 ± 0.026
2.849GluPro: 2.849 ± 0.041
2.141GluGln: 2.141 ± 0.04
3.989GluArg: 3.989 ± 0.056
2.514GluSer: 2.514 ± 0.039
2.409GluThr: 2.409 ± 0.038
4.0GluVal: 4.0 ± 0.059
0.709GluTrp: 0.709 ± 0.02
1.066GluTyr: 1.066 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
3.922PheAla: 3.922 ± 0.05
0.316PheCys: 0.316 ± 0.014
2.294PheAsp: 2.294 ± 0.042
1.45PheGlu: 1.45 ± 0.028
1.038PhePhe: 1.038 ± 0.033
3.744PheGly: 3.744 ± 0.074
0.665PheHis: 0.665 ± 0.018
1.022PheIle: 1.022 ± 0.026
0.483PheLys: 0.483 ± 0.018
2.37PheLeu: 2.37 ± 0.041
0.49PheMet: 0.49 ± 0.017
0.916PheAsn: 0.916 ± 0.077
1.472PhePro: 1.472 ± 0.031
0.701PheGln: 0.701 ± 0.02
1.707PheArg: 1.707 ± 0.036
1.726PheSer: 1.726 ± 0.029
1.97PheThr: 1.97 ± 0.032
2.512PheVal: 2.512 ± 0.038
0.44PheTrp: 0.44 ± 0.016
0.694PheTyr: 0.694 ± 0.019
0.0PheXaa: 0.0 ± 0.0
Gly
11.053GlyAla: 11.053 ± 0.169
0.913GlyCys: 0.913 ± 0.027
4.978GlyAsp: 4.978 ± 0.068
3.966GlyGlu: 3.966 ± 0.048
3.279GlyPhe: 3.279 ± 0.082
10.299GlyGly: 10.299 ± 0.413
2.191GlyHis: 2.191 ± 0.034
4.282GlyIle: 4.282 ± 0.056
2.274GlyLys: 2.274 ± 0.04
8.915GlyLeu: 8.915 ± 0.09
2.27GlyMet: 2.27 ± 0.033
3.174GlyAsn: 3.174 ± 0.201
4.865GlyPro: 4.865 ± 0.062
3.102GlyGln: 3.102 ± 0.059
6.227GlyArg: 6.227 ± 0.064
5.977GlySer: 5.977 ± 0.107
5.484GlyThr: 5.484 ± 0.09
7.499GlyVal: 7.499 ± 0.086
1.695GlyTrp: 1.695 ± 0.037
2.49GlyTyr: 2.49 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
2.621HisAla: 2.621 ± 0.037
0.262HisCys: 0.262 ± 0.012
1.423HisAsp: 1.423 ± 0.03
0.985HisGlu: 0.985 ± 0.023
0.667HisPhe: 0.667 ± 0.021
2.204HisGly: 2.204 ± 0.038
0.744HisHis: 0.744 ± 0.025
0.856HisIle: 0.856 ± 0.023
0.378HisLys: 0.378 ± 0.013
2.113HisLeu: 2.113 ± 0.038
0.341HisMet: 0.341 ± 0.013
0.568HisAsn: 0.568 ± 0.021
1.769HisPro: 1.769 ± 0.039
0.742HisGln: 0.742 ± 0.02
2.029HisArg: 2.029 ± 0.035
1.128HisSer: 1.128 ± 0.025
1.285HisThr: 1.285 ± 0.026
1.72HisVal: 1.72 ± 0.03
0.406HisTrp: 0.406 ± 0.016
0.611HisTyr: 0.611 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.18IleAla: 6.18 ± 0.063
0.41IleCys: 0.41 ± 0.013
3.203IleAsp: 3.203 ± 0.04
2.414IleGlu: 2.414 ± 0.044
1.005IlePhe: 1.005 ± 0.023
4.637IleGly: 4.637 ± 0.085
0.815IleHis: 0.815 ± 0.02
1.418IleIle: 1.418 ± 0.031
0.88IleLys: 0.88 ± 0.024
2.846IleLeu: 2.846 ± 0.043
0.649IleMet: 0.649 ± 0.021
1.167IleAsn: 1.167 ± 0.032
2.561IlePro: 2.561 ± 0.051
0.936IleGln: 0.936 ± 0.023
2.669IleArg: 2.669 ± 0.039
2.359IleSer: 2.359 ± 0.04
2.836IleThr: 2.836 ± 0.046
3.493IleVal: 3.493 ± 0.044
0.464IleTrp: 0.464 ± 0.017
0.798IleTyr: 0.798 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
2.554LysAla: 2.554 ± 0.047
0.117LysCys: 0.117 ± 0.009
0.984LysAsp: 0.984 ± 0.028
0.82LysGlu: 0.82 ± 0.025
0.575LysPhe: 0.575 ± 0.02
1.469LysGly: 1.469 ± 0.036
0.53LysHis: 0.53 ± 0.021
0.938LysIle: 0.938 ± 0.023
0.573LysLys: 0.573 ± 0.023
2.131LysLeu: 2.131 ± 0.041
0.429LysMet: 0.429 ± 0.015
0.486LysAsn: 0.486 ± 0.02
1.449LysPro: 1.449 ± 0.031
0.694LysGln: 0.694 ± 0.022
1.54LysArg: 1.54 ± 0.034
1.143LysSer: 1.143 ± 0.024
1.276LysThr: 1.276 ± 0.031
1.805LysVal: 1.805 ± 0.029
0.301LysTrp: 0.301 ± 0.013
0.452LysTyr: 0.452 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
14.071LeuAla: 14.071 ± 0.114
0.847LeuCys: 0.847 ± 0.023
6.133LeuAsp: 6.133 ± 0.067
3.982LeuGlu: 3.982 ± 0.056
2.677LeuPhe: 2.677 ± 0.043
8.735LeuGly: 8.735 ± 0.082
2.082LeuHis: 2.082 ± 0.034
3.955LeuIle: 3.955 ± 0.052
1.693LeuLys: 1.693 ± 0.036
9.717LeuLeu: 9.717 ± 0.106
1.74LeuMet: 1.74 ± 0.029
2.288LeuAsn: 2.288 ± 0.052
6.027LeuPro: 6.027 ± 0.08
2.722LeuGln: 2.722 ± 0.04
7.625LeuArg: 7.625 ± 0.082
5.745LeuSer: 5.745 ± 0.057
6.293LeuThr: 6.293 ± 0.06
8.202LeuVal: 8.202 ± 0.074
1.245LeuTrp: 1.245 ± 0.028
1.706LeuTyr: 1.706 ± 0.031
0.0LeuXaa: 0.0 ± 0.0
Met
2.652MetAla: 2.652 ± 0.038
0.184MetCys: 0.184 ± 0.011
0.846MetAsp: 0.846 ± 0.021
0.667MetGlu: 0.667 ± 0.019
0.662MetPhe: 0.662 ± 0.019
1.506MetGly: 1.506 ± 0.03
0.43MetHis: 0.43 ± 0.018
0.858MetIle: 0.858 ± 0.021
0.422MetLys: 0.422 ± 0.014
2.141MetLeu: 2.141 ± 0.035
0.447MetMet: 0.447 ± 0.017
0.496MetAsn: 0.496 ± 0.022
1.277MetPro: 1.277 ± 0.033
0.572MetGln: 0.572 ± 0.019
1.473MetArg: 1.473 ± 0.029
1.619MetSer: 1.619 ± 0.03
1.715MetThr: 1.715 ± 0.031
1.601MetVal: 1.601 ± 0.034
0.323MetTrp: 0.323 ± 0.014
0.38MetTyr: 0.38 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.987AsnAla: 2.987 ± 0.053
0.204AsnCys: 0.204 ± 0.009
1.216AsnAsp: 1.216 ± 0.024
0.949AsnGlu: 0.949 ± 0.023
0.801AsnPhe: 0.801 ± 0.034
2.719AsnGly: 2.719 ± 0.081
0.522AsnHis: 0.522 ± 0.018
1.12AsnIle: 1.12 ± 0.067
0.449AsnLys: 0.449 ± 0.018
2.246AsnLeu: 2.246 ± 0.053
0.412AsnMet: 0.412 ± 0.016
0.733AsnAsn: 0.733 ± 0.055
1.83AsnPro: 1.83 ± 0.033
0.772AsnGln: 0.772 ± 0.025
1.654AsnArg: 1.654 ± 0.031
1.525AsnSer: 1.525 ± 0.092
1.675AsnThr: 1.675 ± 0.109
1.907AsnVal: 1.907 ± 0.07
0.382AsnTrp: 0.382 ± 0.015
0.565AsnTyr: 0.565 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
7.82ProAla: 7.82 ± 0.088
0.34ProCys: 0.34 ± 0.013
4.126ProAsp: 4.126 ± 0.051
3.44ProGlu: 3.44 ± 0.041
1.644ProPhe: 1.644 ± 0.033
6.233ProGly: 6.233 ± 0.073
1.287ProHis: 1.287 ± 0.026
2.233ProIle: 2.233 ± 0.043
1.234ProLys: 1.234 ± 0.026
5.096ProLeu: 5.096 ± 0.056
1.209ProMet: 1.209 ± 0.025
1.312ProAsn: 1.312 ± 0.027
4.242ProPro: 4.242 ± 0.087
1.957ProGln: 1.957 ± 0.032
3.614ProArg: 3.614 ± 0.048
3.216ProSer: 3.216 ± 0.045
3.592ProThr: 3.592 ± 0.048
5.085ProVal: 5.085 ± 0.058
0.882ProTrp: 0.882 ± 0.022
1.191ProTyr: 1.191 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
4.159GlnAla: 4.159 ± 0.056
0.254GlnCys: 0.254 ± 0.012
1.231GlnAsp: 1.231 ± 0.029
1.098GlnGlu: 1.098 ± 0.024
0.969GlnPhe: 0.969 ± 0.023
2.09GlnGly: 2.09 ± 0.045
0.834GlnHis: 0.834 ± 0.018
1.486GlnIle: 1.486 ± 0.031
0.538GlnLys: 0.538 ± 0.021
3.916GlnLeu: 3.916 ± 0.057
0.669GlnMet: 0.669 ± 0.018
0.652GlnAsn: 0.652 ± 0.02
2.072GlnPro: 2.072 ± 0.034
1.461GlnGln: 1.461 ± 0.029
2.935GlnArg: 2.935 ± 0.044
1.531GlnSer: 1.531 ± 0.03
1.677GlnThr: 1.677 ± 0.032
2.546GlnVal: 2.546 ± 0.039
0.552GlnTrp: 0.552 ± 0.018
0.655GlnTyr: 0.655 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
8.29ArgAla: 8.29 ± 0.078
0.733ArgCys: 0.733 ± 0.02
4.05ArgAsp: 4.05 ± 0.052
3.458ArgGlu: 3.458 ± 0.057
2.43ArgPhe: 2.43 ± 0.039
5.149ArgGly: 5.149 ± 0.062
1.92ArgHis: 1.92 ± 0.032
3.373ArgIle: 3.373 ± 0.048
1.539ArgLys: 1.539 ± 0.033
7.87ArgLeu: 7.87 ± 0.09
1.777ArgMet: 1.777 ± 0.033
1.7ArgAsn: 1.7 ± 0.03
4.177ArgPro: 4.177 ± 0.051
2.504ArgGln: 2.504 ± 0.042
6.912ArgArg: 6.912 ± 0.101
4.046ArgSer: 4.046 ± 0.049
3.832ArgThr: 3.832 ± 0.052
5.473ArgVal: 5.473 ± 0.075
1.389ArgTrp: 1.389 ± 0.03
2.009ArgTyr: 2.009 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
7.445SerAla: 7.445 ± 0.067
0.479SerCys: 0.479 ± 0.019
2.952SerAsp: 2.952 ± 0.046
2.337SerGlu: 2.337 ± 0.043
1.625SerPhe: 1.625 ± 0.034
6.385SerGly: 6.385 ± 0.16
1.088SerHis: 1.088 ± 0.024
2.155SerIle: 2.155 ± 0.036
1.092SerLys: 1.092 ± 0.029
4.893SerLeu: 4.893 ± 0.057
1.324SerMet: 1.324 ± 0.029
1.237SerAsn: 1.237 ± 0.031
3.286SerPro: 3.286 ± 0.043
1.616SerGln: 1.616 ± 0.029
3.787SerArg: 3.787 ± 0.052
3.264SerSer: 3.264 ± 0.049
3.371SerThr: 3.371 ± 0.046
4.475SerVal: 4.475 ± 0.048
0.993SerTrp: 0.993 ± 0.025
1.353SerTyr: 1.353 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
7.965ThrAla: 7.965 ± 0.082
0.441ThrCys: 0.441 ± 0.015
3.418ThrAsp: 3.418 ± 0.044
2.797ThrGlu: 2.797 ± 0.044
1.712ThrPhe: 1.712 ± 0.029
6.615ThrGly: 6.615 ± 0.144
1.215ThrHis: 1.215 ± 0.027
2.508ThrIle: 2.508 ± 0.045
1.182ThrLys: 1.182 ± 0.031
5.395ThrLeu: 5.395 ± 0.065
1.241ThrMet: 1.241 ± 0.025
1.415ThrAsn: 1.415 ± 0.042
3.86ThrPro: 3.86 ± 0.056
1.603ThrGln: 1.603 ± 0.033
3.582ThrArg: 3.582 ± 0.053
3.191ThrSer: 3.191 ± 0.054
3.907ThrThr: 3.907 ± 0.064
5.625ThrVal: 5.625 ± 0.066
0.831ThrTrp: 0.831 ± 0.023
1.232ThrTyr: 1.232 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
11.486ValAla: 11.486 ± 0.097
0.808ValCys: 0.808 ± 0.025
5.321ValAsp: 5.321 ± 0.056
4.163ValGlu: 4.163 ± 0.046
2.555ValPhe: 2.555 ± 0.041
7.17ValGly: 7.17 ± 0.107
1.774ValHis: 1.774 ± 0.032
3.889ValIle: 3.889 ± 0.044
1.605ValLys: 1.605 ± 0.032
8.549ValLeu: 8.549 ± 0.076
1.56ValMet: 1.56 ± 0.028
2.097ValAsn: 2.097 ± 0.041
4.622ValPro: 4.622 ± 0.055
2.088ValGln: 2.088 ± 0.039
5.717ValArg: 5.717 ± 0.064
4.758ValSer: 4.758 ± 0.053
5.301ValThr: 5.301 ± 0.056
8.336ValVal: 8.336 ± 0.091
1.06ValTrp: 1.06 ± 0.026
1.505ValTyr: 1.505 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.583TrpAla: 1.583 ± 0.029
0.16TrpCys: 0.16 ± 0.01
0.772TrpAsp: 0.772 ± 0.021
0.674TrpGlu: 0.674 ± 0.021
0.503TrpPhe: 0.503 ± 0.019
1.008TrpGly: 1.008 ± 0.027
0.384TrpHis: 0.384 ± 0.015
0.657TrpIle: 0.657 ± 0.019
0.301TrpLys: 0.301 ± 0.015
1.845TrpLeu: 1.845 ± 0.036
0.347TrpMet: 0.347 ± 0.013
0.458TrpAsn: 0.458 ± 0.016
0.866TrpPro: 0.866 ± 0.023
0.662TrpGln: 0.662 ± 0.021
1.346TrpArg: 1.346 ± 0.031
0.926TrpSer: 0.926 ± 0.023
0.867TrpThr: 0.867 ± 0.024
1.126TrpVal: 1.126 ± 0.022
0.361TrpTrp: 0.361 ± 0.015
0.343TrpTyr: 0.343 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.48TyrAla: 2.48 ± 0.035
0.238TyrCys: 0.238 ± 0.011
1.344TyrAsp: 1.344 ± 0.03
1.005TyrGlu: 1.005 ± 0.025
0.761TyrPhe: 0.761 ± 0.02
2.106TyrGly: 2.106 ± 0.041
0.565TyrHis: 0.565 ± 0.017
0.634TyrIle: 0.634 ± 0.018
0.317TyrLys: 0.317 ± 0.013
2.329TyrLeu: 2.329 ± 0.04
0.261TyrMet: 0.261 ± 0.013
0.573TyrAsn: 0.573 ± 0.037
1.331TyrPro: 1.331 ± 0.029
0.823TyrGln: 0.823 ± 0.023
1.918TyrArg: 1.918 ± 0.034
1.161TyrSer: 1.161 ± 0.026
1.161TyrThr: 1.161 ± 0.023
1.659TyrVal: 1.659 ± 0.032
0.363TyrTrp: 0.363 ± 0.013
0.531TyrTyr: 0.531 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5730 proteins (1855580 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski