Amino acid dipepetide frequency for Halomonas anticariensis (strain DSM 16096 / CECT 5854 / LMG 22089 / FP35)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.118AlaAla: 11.118 ± 0.091
1.149AlaCys: 1.149 ± 0.03
5.329AlaAsp: 5.329 ± 0.069
7.094AlaGlu: 7.094 ± 0.074
3.886AlaPhe: 3.886 ± 0.059
8.327AlaGly: 8.327 ± 0.084
2.284AlaHis: 2.284 ± 0.044
5.619AlaIle: 5.619 ± 0.068
3.012AlaLys: 3.012 ± 0.05
13.013AlaLeu: 13.013 ± 0.112
3.233AlaMet: 3.233 ± 0.045
2.806AlaAsn: 2.806 ± 0.05
4.354AlaPro: 4.354 ± 0.051
3.83AlaGln: 3.83 ± 0.053
7.252AlaArg: 7.252 ± 0.074
6.075AlaSer: 6.075 ± 0.066
5.219AlaThr: 5.219 ± 0.052
7.382AlaVal: 7.382 ± 0.083
1.778AlaTrp: 1.778 ± 0.033
2.429AlaTyr: 2.429 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.87CysAla: 0.87 ± 0.026
0.132CysCys: 0.132 ± 0.009
0.577CysAsp: 0.577 ± 0.021
0.591CysGlu: 0.591 ± 0.02
0.353CysPhe: 0.353 ± 0.015
0.916CysGly: 0.916 ± 0.024
0.364CysHis: 0.364 ± 0.015
0.419CysIle: 0.419 ± 0.017
0.204CysLys: 0.204 ± 0.012
1.124CysLeu: 1.124 ± 0.026
0.216CysMet: 0.216 ± 0.012
0.246CysAsn: 0.246 ± 0.012
0.535CysPro: 0.535 ± 0.021
0.4CysGln: 0.4 ± 0.018
0.77CysArg: 0.77 ± 0.023
0.535CysSer: 0.535 ± 0.019
0.404CysThr: 0.404 ± 0.018
0.647CysVal: 0.647 ± 0.023
0.141CysTrp: 0.141 ± 0.01
0.279CysTyr: 0.279 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
6.024AspAla: 6.024 ± 0.069
0.488AspCys: 0.488 ± 0.021
3.811AspAsp: 3.811 ± 0.058
4.224AspGlu: 4.224 ± 0.056
2.141AspPhe: 2.141 ± 0.032
4.452AspGly: 4.452 ± 0.057
1.368AspHis: 1.368 ± 0.031
3.281AspIle: 3.281 ± 0.05
1.704AspLys: 1.704 ± 0.04
5.586AspLeu: 5.586 ± 0.068
1.355AspMet: 1.355 ± 0.03
1.665AspAsn: 1.665 ± 0.035
3.024AspPro: 3.024 ± 0.051
1.908AspGln: 1.908 ± 0.039
3.598AspArg: 3.598 ± 0.052
2.985AspSer: 2.985 ± 0.05
3.078AspThr: 3.078 ± 0.047
3.927AspVal: 3.927 ± 0.057
1.026AspTrp: 1.026 ± 0.029
1.744AspTyr: 1.744 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
7.814GluAla: 7.814 ± 0.088
0.549GluCys: 0.549 ± 0.02
3.135GluAsp: 3.135 ± 0.052
4.255GluGlu: 4.255 ± 0.075
1.898GluPhe: 1.898 ± 0.038
4.789GluGly: 4.789 ± 0.056
1.874GluHis: 1.874 ± 0.032
3.339GluIle: 3.339 ± 0.042
1.864GluLys: 1.864 ± 0.04
7.333GluLeu: 7.333 ± 0.079
1.578GluMet: 1.578 ± 0.032
1.679GluAsn: 1.679 ± 0.037
2.827GluPro: 2.827 ± 0.042
3.058GluGln: 3.058 ± 0.052
6.225GluArg: 6.225 ± 0.069
3.399GluSer: 3.399 ± 0.054
3.357GluThr: 3.357 ± 0.054
4.689GluVal: 4.689 ± 0.058
0.923GluTrp: 0.923 ± 0.024
1.409GluTyr: 1.409 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.609PheAla: 3.609 ± 0.051
0.389PheCys: 0.389 ± 0.015
2.566PheAsp: 2.566 ± 0.047
2.197PheGlu: 2.197 ± 0.038
1.457PhePhe: 1.457 ± 0.036
3.158PheGly: 3.158 ± 0.052
0.9PheHis: 0.9 ± 0.024
1.782PheIle: 1.782 ± 0.034
0.997PheLys: 0.997 ± 0.026
3.428PheLeu: 3.428 ± 0.06
0.839PheMet: 0.839 ± 0.025
1.111PheAsn: 1.111 ± 0.028
1.527PhePro: 1.527 ± 0.034
1.237PheGln: 1.237 ± 0.029
2.048PheArg: 2.048 ± 0.034
2.419PheSer: 2.419 ± 0.041
1.986PheThr: 1.986 ± 0.038
2.554PheVal: 2.554 ± 0.049
0.533PheTrp: 0.533 ± 0.021
1.049PheTyr: 1.049 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
7.01GlyAla: 7.01 ± 0.078
0.922GlyCys: 0.922 ± 0.023
4.388GlyAsp: 4.388 ± 0.061
5.599GlyGlu: 5.599 ± 0.071
3.284GlyPhe: 3.284 ± 0.041
6.329GlyGly: 6.329 ± 0.086
2.129GlyHis: 2.129 ± 0.04
4.683GlyIle: 4.683 ± 0.067
2.772GlyLys: 2.772 ± 0.044
8.789GlyLeu: 8.789 ± 0.087
2.491GlyMet: 2.491 ± 0.048
2.248GlyAsn: 2.248 ± 0.044
2.742GlyPro: 2.742 ± 0.043
3.081GlyGln: 3.081 ± 0.05
5.302GlyArg: 5.302 ± 0.074
4.333GlySer: 4.333 ± 0.055
3.859GlyThr: 3.859 ± 0.052
6.148GlyVal: 6.148 ± 0.077
1.393GlyTrp: 1.393 ± 0.035
2.479GlyTyr: 2.479 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.466HisAla: 2.466 ± 0.043
0.309HisCys: 0.309 ± 0.015
1.724HisAsp: 1.724 ± 0.035
1.541HisGlu: 1.541 ± 0.034
1.091HisPhe: 1.091 ± 0.032
2.298HisGly: 2.298 ± 0.036
0.896HisHis: 0.896 ± 0.029
1.063HisIle: 1.063 ± 0.027
0.584HisLys: 0.584 ± 0.024
2.751HisLeu: 2.751 ± 0.046
0.525HisMet: 0.525 ± 0.019
0.589HisAsn: 0.589 ± 0.018
1.598HisPro: 1.598 ± 0.033
0.985HisGln: 0.985 ± 0.026
1.868HisArg: 1.868 ± 0.037
1.251HisSer: 1.251 ± 0.033
1.033HisThr: 1.033 ± 0.028
1.63HisVal: 1.63 ± 0.029
0.475HisTrp: 0.475 ± 0.018
0.854HisTyr: 0.854 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.842IleAla: 5.842 ± 0.066
0.478IleCys: 0.478 ± 0.019
3.699IleAsp: 3.699 ± 0.058
3.754IleGlu: 3.754 ± 0.054
1.654IlePhe: 1.654 ± 0.039
4.491IleGly: 4.491 ± 0.064
1.192IleHis: 1.192 ± 0.028
2.126IleIle: 2.126 ± 0.047
1.456IleLys: 1.456 ± 0.036
4.454IleLeu: 4.454 ± 0.062
1.01IleMet: 1.01 ± 0.025
1.621IleAsn: 1.621 ± 0.035
2.453IlePro: 2.453 ± 0.04
1.525IleGln: 1.525 ± 0.035
3.225IleArg: 3.225 ± 0.051
2.902IleSer: 2.902 ± 0.045
2.76IleThr: 2.76 ± 0.048
3.704IleVal: 3.704 ± 0.052
0.531IleTrp: 0.531 ± 0.02
1.143IleTyr: 1.143 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
3.173LysAla: 3.173 ± 0.055
0.176LysCys: 0.176 ± 0.011
1.328LysAsp: 1.328 ± 0.034
1.683LysGlu: 1.683 ± 0.035
0.66LysPhe: 0.66 ± 0.024
2.257LysGly: 2.257 ± 0.043
0.681LysHis: 0.681 ± 0.025
1.33LysIle: 1.33 ± 0.035
0.975LysLys: 0.975 ± 0.028
2.926LysLeu: 2.926 ± 0.051
0.638LysMet: 0.638 ± 0.02
0.739LysAsn: 0.739 ± 0.024
1.588LysPro: 1.588 ± 0.038
1.129LysGln: 1.129 ± 0.027
2.508LysArg: 2.508 ± 0.046
1.525LysSer: 1.525 ± 0.028
1.504LysThr: 1.504 ± 0.034
2.225LysVal: 2.225 ± 0.044
0.324LysTrp: 0.324 ± 0.015
0.577LysTyr: 0.577 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
13.654LeuAla: 13.654 ± 0.111
1.128LeuCys: 1.128 ± 0.031
7.11LeuAsp: 7.11 ± 0.087
7.661LeuGlu: 7.661 ± 0.092
3.836LeuPhe: 3.836 ± 0.066
9.272LeuGly: 9.272 ± 0.098
2.468LeuHis: 2.468 ± 0.044
5.602LeuIle: 5.602 ± 0.071
3.394LeuLys: 3.394 ± 0.052
12.002LeuLeu: 12.002 ± 0.129
2.833LeuMet: 2.833 ± 0.047
2.935LeuAsn: 2.935 ± 0.047
6.096LeuPro: 6.096 ± 0.072
3.321LeuGln: 3.321 ± 0.049
6.964LeuArg: 6.964 ± 0.066
7.031LeuSer: 7.031 ± 0.077
5.944LeuThr: 5.944 ± 0.064
8.052LeuVal: 8.052 ± 0.078
1.482LeuTrp: 1.482 ± 0.04
2.527LeuTyr: 2.527 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
3.11MetAla: 3.11 ± 0.048
0.175MetCys: 0.175 ± 0.01
1.132MetAsp: 1.132 ± 0.025
1.249MetGlu: 1.249 ± 0.027
0.697MetPhe: 0.697 ± 0.023
1.876MetGly: 1.876 ± 0.039
0.545MetHis: 0.545 ± 0.018
1.331MetIle: 1.331 ± 0.03
0.8MetLys: 0.8 ± 0.025
2.89MetLeu: 2.89 ± 0.053
0.675MetMet: 0.675 ± 0.023
0.818MetAsn: 0.818 ± 0.025
1.458MetPro: 1.458 ± 0.028
0.997MetGln: 0.997 ± 0.025
1.656MetArg: 1.656 ± 0.039
1.798MetSer: 1.798 ± 0.036
1.731MetThr: 1.731 ± 0.036
1.829MetVal: 1.829 ± 0.037
0.212MetTrp: 0.212 ± 0.011
0.356MetTyr: 0.356 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.941AsnAla: 2.941 ± 0.049
0.257AsnCys: 0.257 ± 0.014
1.671AsnAsp: 1.671 ± 0.039
1.582AsnGlu: 1.582 ± 0.032
0.89AsnPhe: 0.89 ± 0.026
2.285AsnGly: 2.285 ± 0.048
0.659AsnHis: 0.659 ± 0.023
1.411AsnIle: 1.411 ± 0.032
0.699AsnLys: 0.699 ± 0.021
2.912AsnLeu: 2.912 ± 0.049
0.579AsnMet: 0.579 ± 0.022
0.788AsnAsn: 0.788 ± 0.029
1.709AsnPro: 1.709 ± 0.038
1.003AsnGln: 1.003 ± 0.025
1.85AsnArg: 1.85 ± 0.034
1.261AsnSer: 1.261 ± 0.033
1.336AsnThr: 1.336 ± 0.033
1.927AsnVal: 1.927 ± 0.037
0.372AsnTrp: 0.372 ± 0.016
0.689AsnTyr: 0.689 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
4.571ProAla: 4.571 ± 0.061
0.391ProCys: 0.391 ± 0.02
3.203ProAsp: 3.203 ± 0.05
3.868ProGlu: 3.868 ± 0.058
1.821ProPhe: 1.821 ± 0.036
4.143ProGly: 4.143 ± 0.057
1.239ProHis: 1.239 ± 0.026
2.127ProIle: 2.127 ± 0.039
1.258ProLys: 1.258 ± 0.033
5.535ProLeu: 5.535 ± 0.065
1.273ProMet: 1.273 ± 0.03
1.272ProAsn: 1.272 ± 0.03
2.271ProPro: 2.271 ± 0.046
1.852ProGln: 1.852 ± 0.041
2.945ProArg: 2.945 ± 0.045
2.925ProSer: 2.925 ± 0.047
2.31ProThr: 2.31 ± 0.039
3.686ProVal: 3.686 ± 0.055
0.89ProTrp: 0.89 ± 0.021
1.259ProTyr: 1.259 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
5.148GlnAla: 5.148 ± 0.072
0.328GlnCys: 0.328 ± 0.015
1.707GlnAsp: 1.707 ± 0.029
2.3GlnGlu: 2.3 ± 0.042
1.062GlnPhe: 1.062 ± 0.024
3.182GlnGly: 3.182 ± 0.053
1.048GlnHis: 1.048 ± 0.03
1.569GlnIle: 1.569 ± 0.038
0.869GlnLys: 0.869 ± 0.026
4.051GlnLeu: 4.051 ± 0.056
0.833GlnMet: 0.833 ± 0.022
0.801GlnAsn: 0.801 ± 0.023
1.936GlnPro: 1.936 ± 0.034
2.066GlnGln: 2.066 ± 0.049
3.504GlnArg: 3.504 ± 0.055
1.859GlnSer: 1.859 ± 0.034
1.691GlnThr: 1.691 ± 0.035
2.901GlnVal: 2.901 ± 0.048
0.616GlnTrp: 0.616 ± 0.018
0.789GlnTyr: 0.789 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
6.011ArgAla: 6.011 ± 0.071
0.666ArgCys: 0.666 ± 0.021
4.204ArgAsp: 4.204 ± 0.055
5.387ArgGlu: 5.387 ± 0.069
2.943ArgPhe: 2.943 ± 0.054
4.431ArgGly: 4.431 ± 0.054
2.383ArgHis: 2.383 ± 0.047
3.563ArgIle: 3.563 ± 0.053
1.881ArgLys: 1.881 ± 0.035
9.593ArgLeu: 9.593 ± 0.096
1.727ArgMet: 1.727 ± 0.033
1.761ArgAsn: 1.761 ± 0.039
2.945ArgPro: 2.945 ± 0.046
3.459ArgGln: 3.459 ± 0.057
5.592ArgArg: 5.592 ± 0.084
3.288ArgSer: 3.288 ± 0.054
2.959ArgThr: 2.959 ± 0.039
4.83ArgVal: 4.83 ± 0.056
1.16ArgTrp: 1.16 ± 0.027
2.301ArgTyr: 2.301 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
5.288SerAla: 5.288 ± 0.062
0.484SerCys: 0.484 ± 0.019
3.093SerAsp: 3.093 ± 0.052
3.394SerGlu: 3.394 ± 0.049
2.07SerPhe: 2.07 ± 0.036
5.114SerGly: 5.114 ± 0.072
1.5SerHis: 1.5 ± 0.033
2.665SerIle: 2.665 ± 0.045
1.385SerLys: 1.385 ± 0.029
6.803SerLeu: 6.803 ± 0.072
1.504SerMet: 1.504 ± 0.033
1.451SerAsn: 1.451 ± 0.039
2.941SerPro: 2.941 ± 0.049
2.261SerGln: 2.261 ± 0.039
4.269SerArg: 4.269 ± 0.05
3.275SerSer: 3.275 ± 0.057
2.72SerThr: 2.72 ± 0.042
3.787SerVal: 3.787 ± 0.053
0.789SerTrp: 0.789 ± 0.023
1.355SerTyr: 1.355 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
5.061ThrAla: 5.061 ± 0.067
0.479ThrCys: 0.479 ± 0.02
2.508ThrAsp: 2.508 ± 0.046
2.649ThrGlu: 2.649 ± 0.045
1.9ThrPhe: 1.9 ± 0.036
4.228ThrGly: 4.228 ± 0.057
1.246ThrHis: 1.246 ± 0.033
2.307ThrIle: 2.307 ± 0.039
1.097ThrLys: 1.097 ± 0.029
7.015ThrLeu: 7.015 ± 0.08
1.112ThrMet: 1.112 ± 0.024
1.263ThrAsn: 1.263 ± 0.031
3.319ThrPro: 3.319 ± 0.047
1.89ThrGln: 1.89 ± 0.037
3.529ThrArg: 3.529 ± 0.048
2.853ThrSer: 2.853 ± 0.047
2.821ThrThr: 2.821 ± 0.052
3.44ThrVal: 3.44 ± 0.048
0.744ThrTrp: 0.744 ± 0.021
1.168ThrTyr: 1.168 ± 0.028
0.0ThrXaa: 0.0 ± 0.0
Val
7.932ValAla: 7.932 ± 0.075
0.729ValCys: 0.729 ± 0.023
4.168ValAsp: 4.168 ± 0.054
4.892ValGlu: 4.892 ± 0.067
2.591ValPhe: 2.591 ± 0.046
5.347ValGly: 5.347 ± 0.068
1.562ValHis: 1.562 ± 0.032
4.108ValIle: 4.108 ± 0.057
2.035ValLys: 2.035 ± 0.034
7.677ValLeu: 7.677 ± 0.08
2.018ValMet: 2.018 ± 0.043
2.007ValAsn: 2.007 ± 0.036
3.444ValPro: 3.444 ± 0.048
2.162ValGln: 2.162 ± 0.038
4.403ValArg: 4.403 ± 0.055
4.314ValSer: 4.314 ± 0.061
4.093ValThr: 4.093 ± 0.057
5.907ValVal: 5.907 ± 0.072
0.924ValTrp: 0.924 ± 0.027
1.569ValTyr: 1.569 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.19TrpAla: 1.19 ± 0.031
0.194TrpCys: 0.194 ± 0.011
0.594TrpAsp: 0.594 ± 0.023
0.739TrpGlu: 0.739 ± 0.026
0.553TrpPhe: 0.553 ± 0.02
0.966TrpGly: 0.966 ± 0.03
0.464TrpHis: 0.464 ± 0.02
0.693TrpIle: 0.693 ± 0.02
0.401TrpLys: 0.401 ± 0.019
2.442TrpLeu: 2.442 ± 0.046
0.405TrpMet: 0.405 ± 0.016
0.413TrpAsn: 0.413 ± 0.016
0.748TrpPro: 0.748 ± 0.023
0.961TrpGln: 0.961 ± 0.029
1.239TrpArg: 1.239 ± 0.034
0.789TrpSer: 0.789 ± 0.024
0.632TrpThr: 0.632 ± 0.021
0.968TrpVal: 0.968 ± 0.027
0.324TrpTrp: 0.324 ± 0.018
0.361TrpTyr: 0.361 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.356TyrAla: 2.356 ± 0.041
0.297TyrCys: 0.297 ± 0.015
1.442TyrAsp: 1.442 ± 0.034
1.336TyrGlu: 1.336 ± 0.028
0.996TyrPhe: 0.996 ± 0.029
2.118TyrGly: 2.118 ± 0.043
0.731TyrHis: 0.731 ± 0.023
1.001TyrIle: 1.001 ± 0.028
0.561TyrLys: 0.561 ± 0.02
2.988TyrLeu: 2.988 ± 0.048
0.492TyrMet: 0.492 ± 0.02
0.608TyrAsn: 0.608 ± 0.019
1.326TyrPro: 1.326 ± 0.033
1.135TyrGln: 1.135 ± 0.025
2.338TyrArg: 2.338 ± 0.047
1.328TyrSer: 1.328 ± 0.031
1.192TyrThr: 1.192 ± 0.029
1.65TyrVal: 1.65 ± 0.033
0.439TyrTrp: 0.439 ± 0.015
0.732TyrTyr: 0.732 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4705 proteins (1492920 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski