Amino acid dipepetide frequency for Mycobacterium sp. THAF192

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.552AlaAla: 20.552 ± 0.168
0.965AlaCys: 0.965 ± 0.024
9.087AlaAsp: 9.087 ± 0.08
8.215AlaGlu: 8.215 ± 0.082
3.445AlaPhe: 3.445 ± 0.049
11.991AlaGly: 11.991 ± 0.101
2.61AlaHis: 2.61 ± 0.041
4.877AlaIle: 4.877 ± 0.057
2.653AlaLys: 2.653 ± 0.046
13.321AlaLeu: 13.321 ± 0.096
2.908AlaMet: 2.908 ± 0.044
2.441AlaAsn: 2.441 ± 0.039
6.597AlaPro: 6.597 ± 0.087
4.142AlaGln: 4.142 ± 0.051
8.646AlaArg: 8.646 ± 0.079
5.948AlaSer: 5.948 ± 0.07
7.527AlaThr: 7.527 ± 0.079
12.283AlaVal: 12.283 ± 0.107
1.679AlaTrp: 1.679 ± 0.035
2.276AlaTyr: 2.276 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
1.083CysAla: 1.083 ± 0.025
0.096CysCys: 0.096 ± 0.007
0.555CysAsp: 0.555 ± 0.018
0.394CysGlu: 0.394 ± 0.017
0.199CysPhe: 0.199 ± 0.01
0.922CysGly: 0.922 ± 0.021
0.173CysHis: 0.173 ± 0.012
0.262CysIle: 0.262 ± 0.012
0.104CysLys: 0.104 ± 0.008
0.654CysLeu: 0.654 ± 0.02
0.127CysMet: 0.127 ± 0.008
0.146CysAsn: 0.146 ± 0.01
0.471CysPro: 0.471 ± 0.018
0.219CysGln: 0.219 ± 0.01
0.599CysArg: 0.599 ± 0.02
0.46CysSer: 0.46 ± 0.016
0.503CysThr: 0.503 ± 0.017
0.661CysVal: 0.661 ± 0.019
0.131CysTrp: 0.131 ± 0.009
0.188CysTyr: 0.188 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.242AspAla: 8.242 ± 0.078
0.46AspCys: 0.46 ± 0.017
4.923AspAsp: 4.923 ± 0.063
4.396AspGlu: 4.396 ± 0.063
1.832AspPhe: 1.832 ± 0.034
6.333AspGly: 6.333 ± 0.068
1.447AspHis: 1.447 ± 0.028
2.62AspIle: 2.62 ± 0.041
1.25AspLys: 1.25 ± 0.031
6.041AspLeu: 6.041 ± 0.057
1.094AspMet: 1.094 ± 0.021
1.284AspAsn: 1.284 ± 0.028
4.646AspPro: 4.646 ± 0.055
1.792AspGln: 1.792 ± 0.031
4.739AspArg: 4.739 ± 0.055
3.114AspSer: 3.114 ± 0.045
3.628AspThr: 3.628 ± 0.049
5.584AspVal: 5.584 ± 0.063
1.025AspTrp: 1.025 ± 0.023
1.454AspTyr: 1.454 ± 0.028
0.0AspXaa: 0.0 ± 0.0
Glu
6.306GluAla: 6.306 ± 0.066
0.343GluCys: 0.343 ± 0.015
2.659GluAsp: 2.659 ± 0.04
2.511GluGlu: 2.511 ± 0.041
1.858GluPhe: 1.858 ± 0.031
3.385GluGly: 3.385 ± 0.049
1.584GluHis: 1.584 ± 0.033
2.512GluIle: 2.512 ± 0.042
1.256GluLys: 1.256 ± 0.03
6.553GluLeu: 6.553 ± 0.062
1.075GluMet: 1.075 ± 0.026
1.104GluAsn: 1.104 ± 0.024
3.23GluPro: 3.23 ± 0.047
2.406GluGln: 2.406 ± 0.042
4.5GluArg: 4.5 ± 0.055
2.789GluSer: 2.789 ± 0.036
2.758GluThr: 2.758 ± 0.041
4.523GluVal: 4.523 ± 0.055
0.759GluTrp: 0.759 ± 0.02
1.038GluTyr: 1.038 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
3.977PheAla: 3.977 ± 0.052
0.315PheCys: 0.315 ± 0.015
2.475PheAsp: 2.475 ± 0.033
1.497PheGlu: 1.497 ± 0.032
0.982PhePhe: 0.982 ± 0.028
3.352PheGly: 3.352 ± 0.045
0.623PheHis: 0.623 ± 0.018
1.045PheIle: 1.045 ± 0.029
0.487PheLys: 0.487 ± 0.019
2.6PheLeu: 2.6 ± 0.04
0.49PheMet: 0.49 ± 0.018
0.715PheAsn: 0.715 ± 0.022
1.407PhePro: 1.407 ± 0.032
0.647PheGln: 0.647 ± 0.021
1.668PheArg: 1.668 ± 0.03
1.674PheSer: 1.674 ± 0.034
2.241PheThr: 2.241 ± 0.037
2.618PheVal: 2.618 ± 0.043
0.467PheTrp: 0.467 ± 0.018
0.7PheTyr: 0.7 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
10.381GlyAla: 10.381 ± 0.099
0.796GlyCys: 0.796 ± 0.021
5.367GlyAsp: 5.367 ± 0.075
4.613GlyGlu: 4.613 ± 0.054
3.036GlyPhe: 3.036 ± 0.044
7.749GlyGly: 7.749 ± 0.111
2.015GlyHis: 2.015 ± 0.031
3.881GlyIle: 3.881 ± 0.047
2.061GlyLys: 2.061 ± 0.042
8.49GlyLeu: 8.49 ± 0.072
2.175GlyMet: 2.175 ± 0.037
1.873GlyAsn: 1.873 ± 0.043
4.602GlyPro: 4.602 ± 0.058
2.776GlyGln: 2.776 ± 0.041
6.34GlyArg: 6.34 ± 0.065
5.292GlySer: 5.292 ± 0.061
5.175GlyThr: 5.175 ± 0.055
7.721GlyVal: 7.721 ± 0.07
1.647GlyTrp: 1.647 ± 0.032
2.309GlyTyr: 2.309 ± 0.042
0.0GlyXaa: 0.0 ± 0.0
His
2.44HisAla: 2.44 ± 0.04
0.216HisCys: 0.216 ± 0.013
1.351HisAsp: 1.351 ± 0.029
1.066HisGlu: 1.066 ± 0.026
0.668HisPhe: 0.668 ± 0.02
2.127HisGly: 2.127 ± 0.037
0.704HisHis: 0.704 ± 0.024
0.799HisIle: 0.799 ± 0.022
0.343HisLys: 0.343 ± 0.013
2.185HisLeu: 2.185 ± 0.037
0.361HisMet: 0.361 ± 0.014
0.458HisAsn: 0.458 ± 0.015
1.609HisPro: 1.609 ± 0.033
0.625HisGln: 0.625 ± 0.021
1.967HisArg: 1.967 ± 0.035
1.095HisSer: 1.095 ± 0.024
1.242HisThr: 1.242 ± 0.026
1.739HisVal: 1.739 ± 0.034
0.371HisTrp: 0.371 ± 0.015
0.522HisTyr: 0.522 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
6.1IleAla: 6.1 ± 0.065
0.372IleCys: 0.372 ± 0.013
3.441IleAsp: 3.441 ± 0.05
2.509IleGlu: 2.509 ± 0.038
0.951IlePhe: 0.951 ± 0.028
4.295IleGly: 4.295 ± 0.054
0.723IleHis: 0.723 ± 0.02
1.363IleIle: 1.363 ± 0.033
0.8IleLys: 0.8 ± 0.022
2.994IleLeu: 2.994 ± 0.042
0.612IleMet: 0.612 ± 0.02
1.058IleAsn: 1.058 ± 0.023
2.198IlePro: 2.198 ± 0.034
0.892IleGln: 0.892 ± 0.023
2.522IleArg: 2.522 ± 0.04
2.174IleSer: 2.174 ± 0.035
2.781IleThr: 2.781 ± 0.046
3.574IleVal: 3.574 ± 0.047
0.49IleTrp: 0.49 ± 0.018
0.764IleTyr: 0.764 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
2.413LysAla: 2.413 ± 0.045
0.103LysCys: 0.103 ± 0.008
1.033LysAsp: 1.033 ± 0.028
0.879LysGlu: 0.879 ± 0.024
0.557LysPhe: 0.557 ± 0.018
1.397LysGly: 1.397 ± 0.028
0.455LysHis: 0.455 ± 0.014
0.907LysIle: 0.907 ± 0.023
0.619LysLys: 0.619 ± 0.026
2.038LysLeu: 2.038 ± 0.032
0.454LysMet: 0.454 ± 0.015
0.492LysAsn: 0.492 ± 0.017
1.287LysPro: 1.287 ± 0.029
0.66LysGln: 0.66 ± 0.018
1.524LysArg: 1.524 ± 0.033
1.19LysSer: 1.19 ± 0.034
1.285LysThr: 1.285 ± 0.028
1.75LysVal: 1.75 ± 0.032
0.29LysTrp: 0.29 ± 0.013
0.415LysTyr: 0.415 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
14.035LeuAla: 14.035 ± 0.11
0.799LeuCys: 0.799 ± 0.021
6.607LeuAsp: 6.607 ± 0.065
4.403LeuGlu: 4.403 ± 0.053
2.64LeuPhe: 2.64 ± 0.043
8.609LeuGly: 8.609 ± 0.076
2.027LeuHis: 2.027 ± 0.038
4.076LeuIle: 4.076 ± 0.048
1.678LeuLys: 1.678 ± 0.033
9.477LeuLeu: 9.477 ± 0.101
1.793LeuMet: 1.793 ± 0.032
1.977LeuAsn: 1.977 ± 0.031
5.63LeuPro: 5.63 ± 0.061
2.583LeuGln: 2.583 ± 0.045
7.544LeuArg: 7.544 ± 0.065
5.667LeuSer: 5.667 ± 0.062
6.519LeuThr: 6.519 ± 0.057
8.43LeuVal: 8.43 ± 0.077
1.229LeuTrp: 1.229 ± 0.027
1.646LeuTyr: 1.646 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
2.711MetAla: 2.711 ± 0.045
0.192MetCys: 0.192 ± 0.01
0.876MetAsp: 0.876 ± 0.021
0.695MetGlu: 0.695 ± 0.018
0.663MetPhe: 0.663 ± 0.018
1.549MetGly: 1.549 ± 0.03
0.402MetHis: 0.402 ± 0.016
0.918MetIle: 0.918 ± 0.026
0.443MetLys: 0.443 ± 0.016
2.069MetLeu: 2.069 ± 0.038
0.443MetMet: 0.443 ± 0.016
0.495MetAsn: 0.495 ± 0.017
1.268MetPro: 1.268 ± 0.027
0.585MetGln: 0.585 ± 0.017
1.572MetArg: 1.572 ± 0.029
1.598MetSer: 1.598 ± 0.027
1.891MetThr: 1.891 ± 0.033
1.668MetVal: 1.668 ± 0.032
0.263MetTrp: 0.263 ± 0.012
0.344MetTyr: 0.344 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.543AsnAla: 2.543 ± 0.038
0.169AsnCys: 0.169 ± 0.009
1.24AsnAsp: 1.24 ± 0.025
0.994AsnGlu: 0.994 ± 0.023
0.631AsnPhe: 0.631 ± 0.02
1.926AsnGly: 1.926 ± 0.036
0.455AsnHis: 0.455 ± 0.016
0.887AsnIle: 0.887 ± 0.023
0.423AsnLys: 0.423 ± 0.016
1.948AsnLeu: 1.948 ± 0.033
0.427AsnMet: 0.427 ± 0.016
0.552AsnAsn: 0.552 ± 0.019
1.723AsnPro: 1.723 ± 0.034
0.641AsnGln: 0.641 ± 0.019
1.578AsnArg: 1.578 ± 0.028
1.082AsnSer: 1.082 ± 0.025
1.305AsnThr: 1.305 ± 0.029
1.741AsnVal: 1.741 ± 0.032
0.378AsnTrp: 0.378 ± 0.013
0.509AsnTyr: 0.509 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
7.762ProAla: 7.762 ± 0.098
0.307ProCys: 0.307 ± 0.013
4.696ProAsp: 4.696 ± 0.054
3.666ProGlu: 3.666 ± 0.048
1.585ProPhe: 1.585 ± 0.027
5.656ProGly: 5.656 ± 0.064
1.233ProHis: 1.233 ± 0.028
1.956ProIle: 1.956 ± 0.034
1.142ProLys: 1.142 ± 0.027
4.796ProLeu: 4.796 ± 0.056
1.247ProMet: 1.247 ± 0.027
1.217ProAsn: 1.217 ± 0.029
3.922ProPro: 3.922 ± 0.096
1.866ProGln: 1.866 ± 0.033
3.505ProArg: 3.505 ± 0.042
3.119ProSer: 3.119 ± 0.049
3.487ProThr: 3.487 ± 0.046
5.217ProVal: 5.217 ± 0.057
0.857ProTrp: 0.857 ± 0.023
1.12ProTyr: 1.12 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
3.683GlnAla: 3.683 ± 0.051
0.236GlnCys: 0.236 ± 0.012
1.293GlnAsp: 1.293 ± 0.029
1.135GlnGlu: 1.135 ± 0.023
0.999GlnPhe: 0.999 ± 0.023
2.012GlnGly: 2.012 ± 0.028
0.736GlnHis: 0.736 ± 0.022
1.527GlnIle: 1.527 ± 0.029
0.611GlnLys: 0.611 ± 0.019
3.391GlnLeu: 3.391 ± 0.045
0.736GlnMet: 0.736 ± 0.02
0.627GlnAsn: 0.627 ± 0.02
1.894GlnPro: 1.894 ± 0.034
1.366GlnGln: 1.366 ± 0.033
2.82GlnArg: 2.82 ± 0.046
1.543GlnSer: 1.543 ± 0.03
1.655GlnThr: 1.655 ± 0.032
2.496GlnVal: 2.496 ± 0.041
0.603GlnTrp: 0.603 ± 0.016
0.645GlnTyr: 0.645 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
8.347ArgAla: 8.347 ± 0.081
0.636ArgCys: 0.636 ± 0.019
4.498ArgAsp: 4.498 ± 0.047
3.803ArgGlu: 3.803 ± 0.05
2.425ArgPhe: 2.425 ± 0.036
5.204ArgGly: 5.204 ± 0.059
1.838ArgHis: 1.838 ± 0.032
3.362ArgIle: 3.362 ± 0.046
1.449ArgLys: 1.449 ± 0.031
7.389ArgLeu: 7.389 ± 0.068
1.832ArgMet: 1.832 ± 0.03
1.601ArgAsn: 1.601 ± 0.031
4.097ArgPro: 4.097 ± 0.051
2.367ArgGln: 2.367 ± 0.042
6.945ArgArg: 6.945 ± 0.089
4.087ArgSer: 4.087 ± 0.05
4.297ArgThr: 4.297 ± 0.048
5.533ArgVal: 5.533 ± 0.054
1.409ArgTrp: 1.409 ± 0.033
1.887ArgTyr: 1.887 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
7.386SerAla: 7.386 ± 0.072
0.396SerCys: 0.396 ± 0.015
3.46SerAsp: 3.46 ± 0.044
2.672SerGlu: 2.672 ± 0.038
1.662SerPhe: 1.662 ± 0.028
5.754SerGly: 5.754 ± 0.06
1.039SerHis: 1.039 ± 0.023
2.094SerIle: 2.094 ± 0.035
1.024SerLys: 1.024 ± 0.022
4.635SerLeu: 4.635 ± 0.052
1.356SerMet: 1.356 ± 0.029
1.078SerAsn: 1.078 ± 0.024
3.191SerPro: 3.191 ± 0.042
1.476SerGln: 1.476 ± 0.029
3.704SerArg: 3.704 ± 0.05
3.207SerSer: 3.207 ± 0.048
3.37SerThr: 3.37 ± 0.045
4.614SerVal: 4.614 ± 0.052
0.895SerTrp: 0.895 ± 0.022
1.202SerTyr: 1.202 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
8.297ThrAla: 8.297 ± 0.084
0.422ThrCys: 0.422 ± 0.014
4.077ThrAsp: 4.077 ± 0.052
3.332ThrGlu: 3.332 ± 0.049
1.906ThrPhe: 1.906 ± 0.038
5.947ThrGly: 5.947 ± 0.062
1.14ThrHis: 1.14 ± 0.028
2.332ThrIle: 2.332 ± 0.043
1.162ThrLys: 1.162 ± 0.027
5.747ThrLeu: 5.747 ± 0.059
1.222ThrMet: 1.222 ± 0.027
1.213ThrAsn: 1.213 ± 0.025
3.837ThrPro: 3.837 ± 0.051
1.516ThrGln: 1.516 ± 0.028
3.786ThrArg: 3.786 ± 0.044
3.282ThrSer: 3.282 ± 0.05
4.042ThrThr: 4.042 ± 0.066
6.265ThrVal: 6.265 ± 0.068
0.873ThrTrp: 0.873 ± 0.023
1.313ThrTyr: 1.313 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
11.959ValAla: 11.959 ± 0.093
0.758ValCys: 0.758 ± 0.021
6.076ValAsp: 6.076 ± 0.061
4.635ValGlu: 4.635 ± 0.047
2.662ValPhe: 2.662 ± 0.044
7.117ValGly: 7.117 ± 0.077
1.872ValHis: 1.872 ± 0.034
3.922ValIle: 3.922 ± 0.046
1.578ValLys: 1.578 ± 0.034
9.006ValLeu: 9.006 ± 0.079
1.674ValMet: 1.674 ± 0.032
1.977ValAsn: 1.977 ± 0.039
4.691ValPro: 4.691 ± 0.051
2.105ValGln: 2.105 ± 0.034
5.888ValArg: 5.888 ± 0.056
4.838ValSer: 4.838 ± 0.054
5.825ValThr: 5.825 ± 0.066
8.743ValVal: 8.743 ± 0.082
1.051ValTrp: 1.051 ± 0.022
1.598ValTyr: 1.598 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.612TrpAla: 1.612 ± 0.03
0.157TrpCys: 0.157 ± 0.011
0.809TrpAsp: 0.809 ± 0.026
0.679TrpGlu: 0.679 ± 0.02
0.523TrpPhe: 0.523 ± 0.02
1.035TrpGly: 1.035 ± 0.024
0.361TrpHis: 0.361 ± 0.014
0.653TrpIle: 0.653 ± 0.02
0.315TrpLys: 0.315 ± 0.014
1.763TrpLeu: 1.763 ± 0.031
0.334TrpMet: 0.334 ± 0.013
0.394TrpAsn: 0.394 ± 0.013
0.826TrpPro: 0.826 ± 0.02
0.638TrpGln: 0.638 ± 0.02
1.289TrpArg: 1.289 ± 0.023
0.933TrpSer: 0.933 ± 0.023
0.94TrpThr: 0.94 ± 0.021
1.181TrpVal: 1.181 ± 0.024
0.363TrpTrp: 0.363 ± 0.014
0.318TrpTyr: 0.318 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.334TyrAla: 2.334 ± 0.037
0.211TyrCys: 0.211 ± 0.011
1.381TyrAsp: 1.381 ± 0.033
1.054TyrGlu: 1.054 ± 0.025
0.754TyrPhe: 0.754 ± 0.02
1.942TyrGly: 1.942 ± 0.039
0.453TyrHis: 0.453 ± 0.016
0.617TyrIle: 0.617 ± 0.018
0.348TyrLys: 0.348 ± 0.014
2.274TyrLeu: 2.274 ± 0.035
0.284TyrMet: 0.284 ± 0.013
0.449TyrAsn: 0.449 ± 0.017
1.23TyrPro: 1.23 ± 0.031
0.709TyrGln: 0.709 ± 0.022
1.866TyrArg: 1.866 ± 0.033
1.122TyrSer: 1.122 ± 0.025
1.241TyrThr: 1.241 ± 0.03
1.634TyrVal: 1.634 ± 0.03
0.346TyrTrp: 0.346 ± 0.015
0.494TyrTyr: 0.494 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5641 proteins (1818666 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski