Amino acid dipepetide frequency for Mycolicibacterium fortuitum (Mycobacterium fortuitum)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.807AlaAla: 19.807 ± 0.159
0.959AlaCys: 0.959 ± 0.026
8.515AlaAsp: 8.515 ± 0.066
7.991AlaGlu: 7.991 ± 0.075
3.492AlaPhe: 3.492 ± 0.046
11.918AlaGly: 11.918 ± 0.086
2.563AlaHis: 2.563 ± 0.034
5.259AlaIle: 5.259 ± 0.059
3.148AlaLys: 3.148 ± 0.046
12.907AlaLeu: 12.907 ± 0.091
2.988AlaMet: 2.988 ± 0.048
2.636AlaAsn: 2.636 ± 0.045
6.449AlaPro: 6.449 ± 0.095
4.108AlaGln: 4.108 ± 0.045
8.106AlaArg: 8.106 ± 0.076
5.869AlaSer: 5.869 ± 0.066
7.117AlaThr: 7.117 ± 0.064
11.765AlaVal: 11.765 ± 0.098
1.588AlaTrp: 1.588 ± 0.032
2.324AlaTyr: 2.324 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
1.047CysAla: 1.047 ± 0.029
0.089CysCys: 0.089 ± 0.007
0.517CysAsp: 0.517 ± 0.016
0.421CysGlu: 0.421 ± 0.015
0.212CysPhe: 0.212 ± 0.01
0.883CysGly: 0.883 ± 0.022
0.181CysHis: 0.181 ± 0.011
0.271CysIle: 0.271 ± 0.013
0.127CysLys: 0.127 ± 0.009
0.654CysLeu: 0.654 ± 0.018
0.132CysMet: 0.132 ± 0.008
0.166CysAsn: 0.166 ± 0.009
0.497CysPro: 0.497 ± 0.02
0.215CysGln: 0.215 ± 0.011
0.548CysArg: 0.548 ± 0.017
0.46CysSer: 0.46 ± 0.016
0.489CysThr: 0.489 ± 0.016
0.636CysVal: 0.636 ± 0.019
0.121CysTrp: 0.121 ± 0.007
0.174CysTyr: 0.174 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
7.588AspAla: 7.588 ± 0.067
0.433AspCys: 0.433 ± 0.016
4.16AspAsp: 4.16 ± 0.056
4.223AspGlu: 4.223 ± 0.058
1.828AspPhe: 1.828 ± 0.033
5.869AspGly: 5.869 ± 0.063
1.448AspHis: 1.448 ± 0.029
2.655AspIle: 2.655 ± 0.039
1.46AspLys: 1.46 ± 0.029
6.007AspLeu: 6.007 ± 0.06
1.066AspMet: 1.066 ± 0.027
1.344AspAsn: 1.344 ± 0.028
4.393AspPro: 4.393 ± 0.048
1.846AspGln: 1.846 ± 0.036
4.498AspArg: 4.498 ± 0.054
2.974AspSer: 2.974 ± 0.041
3.282AspThr: 3.282 ± 0.042
5.229AspVal: 5.229 ± 0.052
1.005AspTrp: 1.005 ± 0.021
1.441AspTyr: 1.441 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
6.305GluAla: 6.305 ± 0.071
0.359GluCys: 0.359 ± 0.014
2.55GluAsp: 2.55 ± 0.042
2.507GluGlu: 2.507 ± 0.047
1.946GluPhe: 1.946 ± 0.031
3.451GluGly: 3.451 ± 0.047
1.571GluHis: 1.571 ± 0.028
2.646GluIle: 2.646 ± 0.036
1.396GluLys: 1.396 ± 0.029
6.71GluLeu: 6.71 ± 0.063
1.07GluMet: 1.07 ± 0.025
1.206GluAsn: 1.206 ± 0.022
3.096GluPro: 3.096 ± 0.052
2.401GluGln: 2.401 ± 0.037
4.238GluArg: 4.238 ± 0.056
2.824GluSer: 2.824 ± 0.043
2.753GluThr: 2.753 ± 0.037
4.408GluVal: 4.408 ± 0.059
0.748GluTrp: 0.748 ± 0.021
1.208GluTyr: 1.208 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
3.967PheAla: 3.967 ± 0.052
0.316PheCys: 0.316 ± 0.012
2.547PheAsp: 2.547 ± 0.045
1.631PheGlu: 1.631 ± 0.029
1.001PhePhe: 1.001 ± 0.028
3.479PheGly: 3.479 ± 0.04
0.641PheHis: 0.641 ± 0.019
1.158PheIle: 1.158 ± 0.023
0.574PheLys: 0.574 ± 0.017
2.525PheLeu: 2.525 ± 0.036
0.55PheMet: 0.55 ± 0.016
0.788PheAsn: 0.788 ± 0.022
1.446PhePro: 1.446 ± 0.029
0.71PheGln: 0.71 ± 0.018
1.646PheArg: 1.646 ± 0.03
1.826PheSer: 1.826 ± 0.032
2.14PheThr: 2.14 ± 0.031
2.591PheVal: 2.591 ± 0.041
0.48PheTrp: 0.48 ± 0.018
0.75PheTyr: 0.75 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
10.034GlyAla: 10.034 ± 0.088
0.773GlyCys: 0.773 ± 0.023
4.881GlyAsp: 4.881 ± 0.051
4.432GlyGlu: 4.432 ± 0.051
3.098GlyPhe: 3.098 ± 0.043
7.56GlyGly: 7.56 ± 0.079
2.0GlyHis: 2.0 ± 0.035
4.218GlyIle: 4.218 ± 0.054
2.406GlyLys: 2.406 ± 0.042
8.71GlyLeu: 8.71 ± 0.069
2.222GlyMet: 2.222 ± 0.039
2.014GlyAsn: 2.014 ± 0.035
4.539GlyPro: 4.539 ± 0.05
2.915GlyGln: 2.915 ± 0.04
5.981GlyArg: 5.981 ± 0.055
5.273GlySer: 5.273 ± 0.053
5.378GlyThr: 5.378 ± 0.056
7.519GlyVal: 7.519 ± 0.061
1.702GlyTrp: 1.702 ± 0.031
2.469GlyTyr: 2.469 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
2.353HisAla: 2.353 ± 0.034
0.215HisCys: 0.215 ± 0.01
1.403HisAsp: 1.403 ± 0.033
1.079HisGlu: 1.079 ± 0.023
0.687HisPhe: 0.687 ± 0.02
2.087HisGly: 2.087 ± 0.032
0.698HisHis: 0.698 ± 0.021
0.84HisIle: 0.84 ± 0.022
0.364HisLys: 0.364 ± 0.015
2.126HisLeu: 2.126 ± 0.039
0.377HisMet: 0.377 ± 0.014
0.484HisAsn: 0.484 ± 0.018
1.658HisPro: 1.658 ± 0.032
0.666HisGln: 0.666 ± 0.019
1.889HisArg: 1.889 ± 0.03
1.135HisSer: 1.135 ± 0.026
1.245HisThr: 1.245 ± 0.028
1.693HisVal: 1.693 ± 0.032
0.391HisTrp: 0.391 ± 0.015
0.537HisTyr: 0.537 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
6.419IleAla: 6.419 ± 0.063
0.404IleCys: 0.404 ± 0.017
3.398IleAsp: 3.398 ± 0.051
2.661IleGlu: 2.661 ± 0.039
1.074IlePhe: 1.074 ± 0.024
4.575IleGly: 4.575 ± 0.055
0.787IleHis: 0.787 ± 0.021
1.547IleIle: 1.547 ± 0.035
0.978IleLys: 0.978 ± 0.02
3.129IleLeu: 3.129 ± 0.039
0.616IleMet: 0.616 ± 0.018
1.145IleAsn: 1.145 ± 0.026
2.433IlePro: 2.433 ± 0.038
0.98IleGln: 0.98 ± 0.024
2.649IleArg: 2.649 ± 0.032
2.431IleSer: 2.431 ± 0.037
2.816IleThr: 2.816 ± 0.041
3.676IleVal: 3.676 ± 0.045
0.516IleTrp: 0.516 ± 0.017
0.866IleTyr: 0.866 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
2.873LysAla: 2.873 ± 0.044
0.133LysCys: 0.133 ± 0.008
1.213LysAsp: 1.213 ± 0.031
1.001LysGlu: 1.001 ± 0.025
0.693LysPhe: 0.693 ± 0.018
1.638LysGly: 1.638 ± 0.036
0.519LysHis: 0.519 ± 0.016
1.107LysIle: 1.107 ± 0.026
0.707LysLys: 0.707 ± 0.028
2.468LysLeu: 2.468 ± 0.039
0.545LysMet: 0.545 ± 0.017
0.561LysAsn: 0.561 ± 0.02
1.583LysPro: 1.583 ± 0.034
0.795LysGln: 0.795 ± 0.023
1.609LysArg: 1.609 ± 0.028
1.31LysSer: 1.31 ± 0.029
1.398LysThr: 1.398 ± 0.028
2.018LysVal: 2.018 ± 0.034
0.342LysTrp: 0.342 ± 0.014
0.511LysTyr: 0.511 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
14.021LeuAla: 14.021 ± 0.087
0.781LeuCys: 0.781 ± 0.021
6.517LeuAsp: 6.517 ± 0.065
4.393LeuGlu: 4.393 ± 0.053
2.71LeuPhe: 2.71 ± 0.037
8.725LeuGly: 8.725 ± 0.069
2.078LeuHis: 2.078 ± 0.036
4.224LeuIle: 4.224 ± 0.05
1.987LeuLys: 1.987 ± 0.037
9.469LeuLeu: 9.469 ± 0.089
1.797LeuMet: 1.797 ± 0.029
2.246LeuAsn: 2.246 ± 0.036
5.776LeuPro: 5.776 ± 0.06
2.673LeuGln: 2.673 ± 0.039
7.203LeuArg: 7.203 ± 0.072
5.792LeuSer: 5.792 ± 0.055
6.458LeuThr: 6.458 ± 0.064
8.344LeuVal: 8.344 ± 0.086
1.228LeuTrp: 1.228 ± 0.027
1.713LeuTyr: 1.713 ± 0.029
0.0LeuXaa: 0.0 ± 0.0
Met
2.71MetAla: 2.71 ± 0.042
0.181MetCys: 0.181 ± 0.01
0.928MetAsp: 0.928 ± 0.023
0.702MetGlu: 0.702 ± 0.019
0.702MetPhe: 0.702 ± 0.021
1.533MetGly: 1.533 ± 0.028
0.413MetHis: 0.413 ± 0.016
0.926MetIle: 0.926 ± 0.021
0.504MetLys: 0.504 ± 0.019
2.163MetLeu: 2.163 ± 0.033
0.459MetMet: 0.459 ± 0.016
0.529MetAsn: 0.529 ± 0.016
1.318MetPro: 1.318 ± 0.025
0.583MetGln: 0.583 ± 0.017
1.548MetArg: 1.548 ± 0.032
1.622MetSer: 1.622 ± 0.031
1.852MetThr: 1.852 ± 0.028
1.768MetVal: 1.768 ± 0.029
0.267MetTrp: 0.267 ± 0.011
0.358MetTyr: 0.358 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.67AsnAla: 2.67 ± 0.035
0.197AsnCys: 0.197 ± 0.011
1.264AsnAsp: 1.264 ± 0.029
1.04AsnGlu: 1.04 ± 0.026
0.71AsnPhe: 0.71 ± 0.02
2.119AsnGly: 2.119 ± 0.037
0.499AsnHis: 0.499 ± 0.017
0.978AsnIle: 0.978 ± 0.025
0.539AsnLys: 0.539 ± 0.019
2.131AsnLeu: 2.131 ± 0.037
0.463AsnMet: 0.463 ± 0.014
0.592AsnAsn: 0.592 ± 0.019
1.844AsnPro: 1.844 ± 0.035
0.681AsnGln: 0.681 ± 0.018
1.634AsnArg: 1.634 ± 0.031
1.252AsnSer: 1.252 ± 0.025
1.403AsnThr: 1.403 ± 0.026
1.818AsnVal: 1.818 ± 0.033
0.409AsnTrp: 0.409 ± 0.014
0.593AsnTyr: 0.593 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
7.422ProAla: 7.422 ± 0.093
0.296ProCys: 0.296 ± 0.012
4.441ProAsp: 4.441 ± 0.043
3.943ProGlu: 3.943 ± 0.048
1.69ProPhe: 1.69 ± 0.028
5.665ProGly: 5.665 ± 0.065
1.197ProHis: 1.197 ± 0.027
2.193ProIle: 2.193 ± 0.033
1.361ProLys: 1.361 ± 0.027
4.779ProLeu: 4.779 ± 0.051
1.261ProMet: 1.261 ± 0.027
1.296ProAsn: 1.296 ± 0.027
3.484ProPro: 3.484 ± 0.074
1.882ProGln: 1.882 ± 0.037
3.281ProArg: 3.281 ± 0.046
3.122ProSer: 3.122 ± 0.043
3.45ProThr: 3.45 ± 0.044
5.186ProVal: 5.186 ± 0.057
0.875ProTrp: 0.875 ± 0.021
1.218ProTyr: 1.218 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.786GlnAla: 3.786 ± 0.047
0.221GlnCys: 0.221 ± 0.012
1.267GlnAsp: 1.267 ± 0.025
1.154GlnGlu: 1.154 ± 0.023
1.05GlnPhe: 1.05 ± 0.022
2.03GlnGly: 2.03 ± 0.035
0.755GlnHis: 0.755 ± 0.02
1.625GlnIle: 1.625 ± 0.03
0.699GlnLys: 0.699 ± 0.019
3.653GlnLeu: 3.653 ± 0.045
0.76GlnMet: 0.76 ± 0.02
0.648GlnAsn: 0.648 ± 0.02
1.905GlnPro: 1.905 ± 0.034
1.38GlnGln: 1.38 ± 0.034
2.788GlnArg: 2.788 ± 0.041
1.557GlnSer: 1.557 ± 0.033
1.658GlnThr: 1.658 ± 0.028
2.593GlnVal: 2.593 ± 0.045
0.634GlnTrp: 0.634 ± 0.02
0.673GlnTyr: 0.673 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
7.861ArgAla: 7.861 ± 0.073
0.543ArgCys: 0.543 ± 0.016
4.126ArgAsp: 4.126 ± 0.052
3.786ArgGlu: 3.786 ± 0.046
2.378ArgPhe: 2.378 ± 0.035
4.911ArgGly: 4.911 ± 0.05
1.72ArgHis: 1.72 ± 0.036
3.404ArgIle: 3.404 ± 0.046
1.599ArgLys: 1.599 ± 0.034
7.192ArgLeu: 7.192 ± 0.078
1.759ArgMet: 1.759 ± 0.029
1.696ArgAsn: 1.696 ± 0.033
3.752ArgPro: 3.752 ± 0.047
2.334ArgGln: 2.334 ± 0.035
6.083ArgArg: 6.083 ± 0.069
3.959ArgSer: 3.959 ± 0.049
4.099ArgThr: 4.099 ± 0.048
5.344ArgVal: 5.344 ± 0.059
1.328ArgTrp: 1.328 ± 0.024
1.882ArgTyr: 1.882 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
7.217SerAla: 7.217 ± 0.068
0.405SerCys: 0.405 ± 0.015
3.299SerAsp: 3.299 ± 0.04
2.723SerGlu: 2.723 ± 0.043
1.665SerPhe: 1.665 ± 0.027
5.727SerGly: 5.727 ± 0.054
1.058SerHis: 1.058 ± 0.023
2.187SerIle: 2.187 ± 0.035
1.247SerLys: 1.247 ± 0.03
4.807SerLeu: 4.807 ± 0.05
1.37SerMet: 1.37 ± 0.026
1.153SerAsn: 1.153 ± 0.027
3.082SerPro: 3.082 ± 0.038
1.529SerGln: 1.529 ± 0.028
3.551SerArg: 3.551 ± 0.046
3.237SerSer: 3.237 ± 0.048
3.461SerThr: 3.461 ± 0.043
4.717SerVal: 4.717 ± 0.049
0.942SerTrp: 0.942 ± 0.023
1.289SerTyr: 1.289 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
8.001ThrAla: 8.001 ± 0.079
0.415ThrCys: 0.415 ± 0.014
3.872ThrAsp: 3.872 ± 0.044
3.354ThrGlu: 3.354 ± 0.04
1.878ThrPhe: 1.878 ± 0.033
5.886ThrGly: 5.886 ± 0.055
1.181ThrHis: 1.181 ± 0.023
2.449ThrIle: 2.449 ± 0.04
1.357ThrLys: 1.357 ± 0.029
5.562ThrLeu: 5.562 ± 0.054
1.302ThrMet: 1.302 ± 0.029
1.253ThrAsn: 1.253 ± 0.028
3.849ThrPro: 3.849 ± 0.046
1.563ThrGln: 1.563 ± 0.029
3.561ThrArg: 3.561 ± 0.046
3.124ThrSer: 3.124 ± 0.041
3.979ThrThr: 3.979 ± 0.059
5.991ThrVal: 5.991 ± 0.054
0.854ThrTrp: 0.854 ± 0.02
1.328ThrTyr: 1.328 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
11.429ValAla: 11.429 ± 0.093
0.724ValCys: 0.724 ± 0.02
5.605ValAsp: 5.605 ± 0.065
4.607ValGlu: 4.607 ± 0.058
2.617ValPhe: 2.617 ± 0.039
6.973ValGly: 6.973 ± 0.062
1.793ValHis: 1.793 ± 0.029
4.022ValIle: 4.022 ± 0.047
1.82ValLys: 1.82 ± 0.034
8.988ValLeu: 8.988 ± 0.074
1.638ValMet: 1.638 ± 0.034
2.117ValAsn: 2.117 ± 0.036
4.768ValPro: 4.768 ± 0.057
2.176ValGln: 2.176 ± 0.033
5.756ValArg: 5.756 ± 0.053
4.802ValSer: 4.802 ± 0.046
5.628ValThr: 5.628 ± 0.062
8.347ValVal: 8.347 ± 0.079
1.074ValTrp: 1.074 ± 0.025
1.572ValTyr: 1.572 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.578TrpAla: 1.578 ± 0.03
0.15TrpCys: 0.15 ± 0.009
0.808TrpAsp: 0.808 ± 0.023
0.666TrpGlu: 0.666 ± 0.021
0.547TrpPhe: 0.547 ± 0.017
1.031TrpGly: 1.031 ± 0.027
0.37TrpHis: 0.37 ± 0.014
0.71TrpIle: 0.71 ± 0.021
0.35TrpLys: 0.35 ± 0.015
1.741TrpLeu: 1.741 ± 0.031
0.373TrpMet: 0.373 ± 0.012
0.434TrpAsn: 0.434 ± 0.015
0.875TrpPro: 0.875 ± 0.021
0.675TrpGln: 0.675 ± 0.021
1.233TrpArg: 1.233 ± 0.029
0.947TrpSer: 0.947 ± 0.022
0.934TrpThr: 0.934 ± 0.023
1.133TrpVal: 1.133 ± 0.025
0.351TrpTrp: 0.351 ± 0.016
0.349TrpTyr: 0.349 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.422TyrAla: 2.422 ± 0.043
0.247TyrCys: 0.247 ± 0.013
1.436TyrAsp: 1.436 ± 0.028
1.08TyrGlu: 1.08 ± 0.024
0.788TyrPhe: 0.788 ± 0.02
2.063TyrGly: 2.063 ± 0.031
0.456TyrHis: 0.456 ± 0.016
0.68TyrIle: 0.68 ± 0.017
0.401TyrLys: 0.401 ± 0.016
2.417TyrLeu: 2.417 ± 0.04
0.315TyrMet: 0.315 ± 0.015
0.529TyrAsn: 0.529 ± 0.017
1.296TyrPro: 1.296 ± 0.025
0.726TyrGln: 0.726 ± 0.02
1.953TyrArg: 1.953 ± 0.034
1.149TyrSer: 1.149 ± 0.022
1.218TyrThr: 1.218 ± 0.025
1.679TyrVal: 1.679 ± 0.03
0.401TyrTrp: 0.401 ± 0.017
0.511TyrTyr: 0.511 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6012 proteins (1901592 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski