Amino acid dipepetide frequency for Naasia lichenicola

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.522AlaAla: 19.522 ± 0.201
0.608AlaCys: 0.608 ± 0.024
8.323AlaAsp: 8.323 ± 0.09
7.499AlaGlu: 7.499 ± 0.096
3.92AlaPhe: 3.92 ± 0.056
11.583AlaGly: 11.583 ± 0.119
2.252AlaHis: 2.252 ± 0.049
6.776AlaIle: 6.776 ± 0.092
2.695AlaLys: 2.695 ± 0.063
13.581AlaLeu: 13.581 ± 0.14
2.642AlaMet: 2.642 ± 0.047
2.333AlaAsn: 2.333 ± 0.052
6.304AlaPro: 6.304 ± 0.105
3.705AlaGln: 3.705 ± 0.062
8.487AlaArg: 8.487 ± 0.116
8.161AlaSer: 8.161 ± 0.096
7.315AlaThr: 7.315 ± 0.102
11.085AlaVal: 11.085 ± 0.118
1.696AlaTrp: 1.696 ± 0.04
2.36AlaTyr: 2.36 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
0.557CysAla: 0.557 ± 0.026
0.046CysCys: 0.046 ± 0.007
0.299CysAsp: 0.299 ± 0.017
0.217CysGlu: 0.217 ± 0.015
0.163CysPhe: 0.163 ± 0.012
0.497CysGly: 0.497 ± 0.019
0.108CysHis: 0.108 ± 0.011
0.212CysIle: 0.212 ± 0.014
0.055CysLys: 0.055 ± 0.006
0.392CysLeu: 0.392 ± 0.021
0.078CysMet: 0.078 ± 0.009
0.106CysAsn: 0.106 ± 0.009
0.243CysPro: 0.243 ± 0.017
0.095CysGln: 0.095 ± 0.009
0.284CysArg: 0.284 ± 0.018
0.361CysSer: 0.361 ± 0.019
0.265CysThr: 0.265 ± 0.019
0.371CysVal: 0.371 ± 0.02
0.062CysTrp: 0.062 ± 0.007
0.096CysTyr: 0.096 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.239AspAla: 8.239 ± 0.101
0.233AspCys: 0.233 ± 0.014
4.262AspAsp: 4.262 ± 0.074
4.336AspGlu: 4.336 ± 0.073
1.916AspPhe: 1.916 ± 0.044
6.233AspGly: 6.233 ± 0.091
1.196AspHis: 1.196 ± 0.035
2.544AspIle: 2.544 ± 0.054
0.896AspLys: 0.896 ± 0.034
6.359AspLeu: 6.359 ± 0.074
0.788AspMet: 0.788 ± 0.027
0.986AspAsn: 0.986 ± 0.032
4.216AspPro: 4.216 ± 0.066
1.699AspGln: 1.699 ± 0.041
4.723AspArg: 4.723 ± 0.078
3.456AspSer: 3.456 ± 0.063
2.775AspThr: 2.775 ± 0.051
5.235AspVal: 5.235 ± 0.069
0.978AspTrp: 0.978 ± 0.029
1.387AspTyr: 1.387 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
6.407GluAla: 6.407 ± 0.085
0.184GluCys: 0.184 ± 0.012
2.69GluAsp: 2.69 ± 0.055
2.736GluGlu: 2.736 ± 0.058
1.729GluPhe: 1.729 ± 0.043
3.755GluGly: 3.755 ± 0.063
1.466GluHis: 1.466 ± 0.083
2.759GluIle: 2.759 ± 0.052
1.334GluLys: 1.334 ± 0.039
6.99GluLeu: 6.99 ± 0.093
0.879GluMet: 0.879 ± 0.027
1.161GluAsn: 1.161 ± 0.039
2.876GluPro: 2.876 ± 0.056
2.151GluGln: 2.151 ± 0.048
4.863GluArg: 4.863 ± 0.082
3.365GluSer: 3.365 ± 0.06
2.932GluThr: 2.932 ± 0.055
4.481GluVal: 4.481 ± 0.07
0.808GluTrp: 0.808 ± 0.031
1.045GluTyr: 1.045 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
4.295PheAla: 4.295 ± 0.063
0.182PheCys: 0.182 ± 0.011
2.431PheAsp: 2.431 ± 0.052
1.838PheGlu: 1.838 ± 0.046
1.04PhePhe: 1.04 ± 0.04
3.649PheGly: 3.649 ± 0.063
0.532PheHis: 0.532 ± 0.021
1.301PheIle: 1.301 ± 0.036
0.491PheLys: 0.491 ± 0.023
2.939PheLeu: 2.939 ± 0.057
0.421PheMet: 0.421 ± 0.02
0.676PheAsn: 0.676 ± 0.026
1.477PhePro: 1.477 ± 0.038
0.782PheGln: 0.782 ± 0.029
1.816PheArg: 1.816 ± 0.043
2.009PheSer: 2.009 ± 0.042
2.014PheThr: 2.014 ± 0.047
2.722PheVal: 2.722 ± 0.048
0.49PheTrp: 0.49 ± 0.021
0.667PheTyr: 0.667 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
10.523GlyAla: 10.523 ± 0.103
0.489GlyCys: 0.489 ± 0.022
4.935GlyAsp: 4.935 ± 0.063
4.709GlyGlu: 4.709 ± 0.067
3.326GlyPhe: 3.326 ± 0.058
7.794GlyGly: 7.794 ± 0.111
1.727GlyHis: 1.727 ± 0.043
5.172GlyIle: 5.172 ± 0.069
2.052GlyLys: 2.052 ± 0.051
8.913GlyLeu: 8.913 ± 0.1
1.863GlyMet: 1.863 ± 0.034
1.815GlyAsn: 1.815 ± 0.046
4.016GlyPro: 4.016 ± 0.061
2.502GlyGln: 2.502 ± 0.053
6.371GlyArg: 6.371 ± 0.085
6.55GlySer: 6.55 ± 0.1
5.603GlyThr: 5.603 ± 0.089
7.615GlyVal: 7.615 ± 0.087
1.622GlyTrp: 1.622 ± 0.039
2.364GlyTyr: 2.364 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
2.146HisAla: 2.146 ± 0.039
0.104HisCys: 0.104 ± 0.011
1.214HisAsp: 1.214 ± 0.041
1.139HisGlu: 1.139 ± 0.079
0.586HisPhe: 0.586 ± 0.024
1.932HisGly: 1.932 ± 0.038
0.482HisHis: 0.482 ± 0.021
0.761HisIle: 0.761 ± 0.022
0.241HisLys: 0.241 ± 0.015
2.004HisLeu: 2.004 ± 0.05
0.311HisMet: 0.311 ± 0.016
0.337HisAsn: 0.337 ± 0.021
1.421HisPro: 1.421 ± 0.04
0.496HisGln: 0.496 ± 0.022
1.555HisArg: 1.555 ± 0.041
1.14HisSer: 1.14 ± 0.033
0.903HisThr: 0.903 ± 0.029
1.389HisVal: 1.389 ± 0.033
0.279HisTrp: 0.279 ± 0.018
0.449HisTyr: 0.449 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.752IleAla: 7.752 ± 0.09
0.266IleCys: 0.266 ± 0.015
3.731IleAsp: 3.731 ± 0.057
3.128IleGlu: 3.128 ± 0.055
1.268IlePhe: 1.268 ± 0.036
5.241IleGly: 5.241 ± 0.077
0.783IleHis: 0.783 ± 0.027
1.98IleIle: 1.98 ± 0.048
0.886IleLys: 0.886 ± 0.032
4.069IleLeu: 4.069 ± 0.064
0.614IleMet: 0.614 ± 0.025
0.951IleAsn: 0.951 ± 0.033
2.625IlePro: 2.625 ± 0.045
1.131IleGln: 1.131 ± 0.03
3.062IleArg: 3.062 ± 0.057
2.938IleSer: 2.938 ± 0.052
2.978IleThr: 2.978 ± 0.056
4.924IleVal: 4.924 ± 0.073
0.569IleTrp: 0.569 ± 0.023
0.83IleTyr: 0.83 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
2.526LysAla: 2.526 ± 0.057
0.063LysCys: 0.063 ± 0.008
1.082LysAsp: 1.082 ± 0.034
0.925LysGlu: 0.925 ± 0.033
0.532LysPhe: 0.532 ± 0.024
1.518LysGly: 1.518 ± 0.045
0.434LysHis: 0.434 ± 0.017
0.885LysIle: 0.885 ± 0.029
0.705LysLys: 0.705 ± 0.032
1.887LysLeu: 1.887 ± 0.048
0.346LysMet: 0.346 ± 0.018
0.517LysAsn: 0.517 ± 0.026
1.186LysPro: 1.186 ± 0.036
0.704LysGln: 0.704 ± 0.028
1.58LysArg: 1.58 ± 0.04
1.242LysSer: 1.242 ± 0.03
1.241LysThr: 1.241 ± 0.039
1.699LysVal: 1.699 ± 0.045
0.241LysTrp: 0.241 ± 0.015
0.431LysTyr: 0.431 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
14.271LeuAla: 14.271 ± 0.147
0.484LeuCys: 0.484 ± 0.018
6.951LeuAsp: 6.951 ± 0.092
5.411LeuGlu: 5.411 ± 0.071
2.975LeuPhe: 2.975 ± 0.062
9.293LeuGly: 9.293 ± 0.106
1.893LeuHis: 1.893 ± 0.043
5.124LeuIle: 5.124 ± 0.082
1.857LeuLys: 1.857 ± 0.042
10.643LeuLeu: 10.643 ± 0.151
1.565LeuMet: 1.565 ± 0.037
1.888LeuAsn: 1.888 ± 0.047
5.597LeuPro: 5.597 ± 0.084
2.705LeuGln: 2.705 ± 0.054
7.225LeuArg: 7.225 ± 0.102
6.26LeuSer: 6.26 ± 0.091
6.127LeuThr: 6.127 ± 0.071
8.955LeuVal: 8.955 ± 0.12
1.156LeuTrp: 1.156 ± 0.033
1.67LeuTyr: 1.67 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
2.171MetAla: 2.171 ± 0.041
0.08MetCys: 0.08 ± 0.009
0.865MetAsp: 0.865 ± 0.027
0.655MetGlu: 0.655 ± 0.027
0.503MetPhe: 0.503 ± 0.02
1.268MetGly: 1.268 ± 0.032
0.342MetHis: 0.342 ± 0.017
0.87MetIle: 0.87 ± 0.031
0.393MetLys: 0.393 ± 0.021
1.796MetLeu: 1.796 ± 0.042
0.278MetMet: 0.278 ± 0.017
0.409MetAsn: 0.409 ± 0.019
1.155MetPro: 1.155 ± 0.034
0.528MetGln: 0.528 ± 0.019
1.331MetArg: 1.331 ± 0.033
1.51MetSer: 1.51 ± 0.037
1.622MetThr: 1.622 ± 0.042
1.26MetVal: 1.26 ± 0.032
0.178MetTrp: 0.178 ± 0.012
0.246MetTyr: 0.246 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.528AsnAla: 2.528 ± 0.047
0.116AsnCys: 0.116 ± 0.011
1.157AsnAsp: 1.157 ± 0.033
0.922AsnGlu: 0.922 ± 0.035
0.662AsnPhe: 0.662 ± 0.028
2.035AsnGly: 2.035 ± 0.051
0.347AsnHis: 0.347 ± 0.019
0.897AsnIle: 0.897 ± 0.035
0.348AsnLys: 0.348 ± 0.02
1.941AsnLeu: 1.941 ± 0.044
0.319AsnMet: 0.319 ± 0.018
0.465AsnAsn: 0.465 ± 0.025
1.532AsnPro: 1.532 ± 0.043
0.55AsnGln: 0.55 ± 0.027
1.342AsnArg: 1.342 ± 0.036
1.203AsnSer: 1.203 ± 0.038
1.094AsnThr: 1.094 ± 0.04
1.646AsnVal: 1.646 ± 0.043
0.326AsnTrp: 0.326 ± 0.016
0.486AsnTyr: 0.486 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
6.888ProAla: 6.888 ± 0.089
0.177ProCys: 0.177 ± 0.015
3.943ProAsp: 3.943 ± 0.064
3.496ProGlu: 3.496 ± 0.056
1.756ProPhe: 1.756 ± 0.04
4.743ProGly: 4.743 ± 0.065
1.04ProHis: 1.04 ± 0.031
2.695ProIle: 2.695 ± 0.049
1.084ProLys: 1.084 ± 0.038
4.782ProLeu: 4.782 ± 0.075
0.945ProMet: 0.945 ± 0.032
1.149ProAsn: 1.149 ± 0.031
2.406ProPro: 2.406 ± 0.063
1.466ProGln: 1.466 ± 0.04
3.307ProArg: 3.307 ± 0.063
3.688ProSer: 3.688 ± 0.057
3.716ProThr: 3.716 ± 0.059
4.61ProVal: 4.61 ± 0.071
0.813ProTrp: 0.813 ± 0.029
1.059ProTyr: 1.059 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
3.329GlnAla: 3.329 ± 0.058
0.109GlnCys: 0.109 ± 0.012
1.256GlnAsp: 1.256 ± 0.034
1.237GlnGlu: 1.237 ± 0.036
0.861GlnPhe: 0.861 ± 0.029
2.081GlnGly: 2.081 ± 0.048
0.608GlnHis: 0.608 ± 0.024
1.509GlnIle: 1.509 ± 0.039
0.68GlnLys: 0.68 ± 0.026
3.452GlnLeu: 3.452 ± 0.06
0.521GlnMet: 0.521 ± 0.019
0.611GlnAsn: 0.611 ± 0.027
1.583GlnPro: 1.583 ± 0.039
1.158GlnGln: 1.158 ± 0.037
2.381GlnArg: 2.381 ± 0.047
1.737GlnSer: 1.737 ± 0.041
1.576GlnThr: 1.576 ± 0.036
2.389GlnVal: 2.389 ± 0.045
0.432GlnTrp: 0.432 ± 0.019
0.593GlnTyr: 0.593 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
8.341ArgAla: 8.341 ± 0.104
0.256ArgCys: 0.256 ± 0.015
4.038ArgAsp: 4.038 ± 0.071
3.953ArgGlu: 3.953 ± 0.071
2.464ArgPhe: 2.464 ± 0.05
5.327ArgGly: 5.327 ± 0.079
1.483ArgHis: 1.483 ± 0.04
4.08ArgIle: 4.08 ± 0.067
1.393ArgLys: 1.393 ± 0.034
7.376ArgLeu: 7.376 ± 0.098
1.739ArgMet: 1.739 ± 0.038
1.349ArgAsn: 1.349 ± 0.032
3.689ArgPro: 3.689 ± 0.075
1.995ArgGln: 1.995 ± 0.048
6.381ArgArg: 6.381 ± 0.096
4.928ArgSer: 4.928 ± 0.079
4.128ArgThr: 4.128 ± 0.065
5.57ArgVal: 5.57 ± 0.071
1.093ArgTrp: 1.093 ± 0.036
1.494ArgTyr: 1.494 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
8.566SerAla: 8.566 ± 0.104
0.277SerCys: 0.277 ± 0.02
3.919SerAsp: 3.919 ± 0.063
3.135SerGlu: 3.135 ± 0.056
2.113SerPhe: 2.113 ± 0.05
6.56SerGly: 6.56 ± 0.086
1.101SerHis: 1.101 ± 0.03
3.274SerIle: 3.274 ± 0.06
1.242SerLys: 1.242 ± 0.037
5.924SerLeu: 5.924 ± 0.081
1.297SerMet: 1.297 ± 0.037
1.35SerAsn: 1.35 ± 0.042
3.406SerPro: 3.406 ± 0.061
1.629SerGln: 1.629 ± 0.035
4.253SerArg: 4.253 ± 0.055
4.847SerSer: 4.847 ± 0.094
4.515SerThr: 4.515 ± 0.087
5.318SerVal: 5.318 ± 0.073
0.945SerTrp: 0.945 ± 0.03
1.338SerTyr: 1.338 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
8.067ThrAla: 8.067 ± 0.099
0.234ThrCys: 0.234 ± 0.015
3.688ThrAsp: 3.688 ± 0.069
2.993ThrGlu: 2.993 ± 0.057
1.91ThrPhe: 1.91 ± 0.044
5.984ThrGly: 5.984 ± 0.09
0.962ThrHis: 0.962 ± 0.029
3.014ThrIle: 3.014 ± 0.055
1.19ThrLys: 1.19 ± 0.031
5.647ThrLeu: 5.647 ± 0.074
0.976ThrMet: 0.976 ± 0.032
1.208ThrAsn: 1.208 ± 0.031
3.786ThrPro: 3.786 ± 0.069
1.391ThrGln: 1.391 ± 0.04
3.654ThrArg: 3.654 ± 0.063
4.018ThrSer: 4.018 ± 0.084
3.925ThrThr: 3.925 ± 0.092
5.703ThrVal: 5.703 ± 0.081
0.781ThrTrp: 0.781 ± 0.026
1.112ThrTyr: 1.112 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
10.961ValAla: 10.961 ± 0.117
0.403ValCys: 0.403 ± 0.02
5.656ValAsp: 5.656 ± 0.076
4.612ValGlu: 4.612 ± 0.085
2.775ValPhe: 2.775 ± 0.043
7.457ValGly: 7.457 ± 0.086
1.53ValHis: 1.53 ± 0.037
4.388ValIle: 4.388 ± 0.064
1.483ValLys: 1.483 ± 0.038
9.439ValLeu: 9.439 ± 0.117
1.333ValMet: 1.333 ± 0.036
1.741ValAsn: 1.741 ± 0.042
4.528ValPro: 4.528 ± 0.058
2.276ValGln: 2.276 ± 0.048
5.73ValArg: 5.73 ± 0.073
5.269ValSer: 5.269 ± 0.069
5.45ValThr: 5.45 ± 0.081
8.267ValVal: 8.267 ± 0.113
1.019ValTrp: 1.019 ± 0.031
1.523ValTyr: 1.523 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
1.398TrpAla: 1.398 ± 0.038
0.071TrpCys: 0.071 ± 0.009
0.73TrpAsp: 0.73 ± 0.024
0.571TrpGlu: 0.571 ± 0.025
0.539TrpPhe: 0.539 ± 0.024
1.062TrpGly: 1.062 ± 0.033
0.311TrpHis: 0.311 ± 0.019
0.763TrpIle: 0.763 ± 0.028
0.366TrpLys: 0.366 ± 0.018
1.64TrpLeu: 1.64 ± 0.043
0.307TrpMet: 0.307 ± 0.016
0.419TrpAsn: 0.419 ± 0.02
0.696TrpPro: 0.696 ± 0.022
0.547TrpGln: 0.547 ± 0.026
1.123TrpArg: 1.123 ± 0.032
1.013TrpSer: 1.013 ± 0.032
0.938TrpThr: 0.938 ± 0.031
1.021TrpVal: 1.021 ± 0.029
0.321TrpTrp: 0.321 ± 0.017
0.287TrpTyr: 0.287 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.358TyrAla: 2.358 ± 0.051
0.124TyrCys: 0.124 ± 0.011
1.287TyrAsp: 1.287 ± 0.039
1.1TyrGlu: 1.1 ± 0.032
0.735TyrPhe: 0.735 ± 0.027
1.952TyrGly: 1.952 ± 0.043
0.294TyrHis: 0.294 ± 0.016
0.72TyrIle: 0.72 ± 0.027
0.351TyrLys: 0.351 ± 0.02
2.274TyrLeu: 2.274 ± 0.043
0.236TyrMet: 0.236 ± 0.014
0.448TyrAsn: 0.448 ± 0.019
1.021TyrPro: 1.021 ± 0.032
0.593TyrGln: 0.593 ± 0.026
1.611TyrArg: 1.611 ± 0.043
1.313TyrSer: 1.313 ± 0.035
1.125TyrThr: 1.125 ± 0.035
1.58TyrVal: 1.58 ± 0.04
0.313TyrTrp: 0.313 ± 0.017
0.469TyrTyr: 0.469 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3498 proteins (1124881 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski