Amino acid dipepetide frequency for Halomonas arcis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.108AlaAla: 11.108 ± 0.127
1.171AlaCys: 1.171 ± 0.033
5.862AlaAsp: 5.862 ± 0.086
6.709AlaGlu: 6.709 ± 0.08
4.051AlaPhe: 4.051 ± 0.058
7.978AlaGly: 7.978 ± 0.12
2.319AlaHis: 2.319 ± 0.049
5.608AlaIle: 5.608 ± 0.078
3.283AlaLys: 3.283 ± 0.058
13.708AlaLeu: 13.708 ± 0.132
3.359AlaMet: 3.359 ± 0.053
3.105AlaAsn: 3.105 ± 0.057
4.358AlaPro: 4.358 ± 0.074
4.511AlaGln: 4.511 ± 0.064
6.37AlaArg: 6.37 ± 0.069
6.564AlaSer: 6.564 ± 0.092
5.333AlaThr: 5.333 ± 0.068
7.096AlaVal: 7.096 ± 0.084
1.679AlaTrp: 1.679 ± 0.041
2.482AlaTyr: 2.482 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.887CysAla: 0.887 ± 0.027
0.129CysCys: 0.129 ± 0.01
0.594CysAsp: 0.594 ± 0.022
0.492CysGlu: 0.492 ± 0.02
0.36CysPhe: 0.36 ± 0.019
0.835CysGly: 0.835 ± 0.03
0.329CysHis: 0.329 ± 0.017
0.418CysIle: 0.418 ± 0.018
0.216CysLys: 0.216 ± 0.014
1.055CysLeu: 1.055 ± 0.031
0.218CysMet: 0.218 ± 0.015
0.23CysAsn: 0.23 ± 0.014
0.474CysPro: 0.474 ± 0.022
0.443CysGln: 0.443 ± 0.02
0.642CysArg: 0.642 ± 0.021
0.494CysSer: 0.494 ± 0.02
0.389CysThr: 0.389 ± 0.018
0.693CysVal: 0.693 ± 0.024
0.132CysTrp: 0.132 ± 0.011
0.254CysTyr: 0.254 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
6.501AspAla: 6.501 ± 0.089
0.435AspCys: 0.435 ± 0.02
3.941AspAsp: 3.941 ± 0.081
4.118AspGlu: 4.118 ± 0.065
2.071AspPhe: 2.071 ± 0.045
4.301AspGly: 4.301 ± 0.093
1.414AspHis: 1.414 ± 0.037
3.384AspIle: 3.384 ± 0.061
1.72AspLys: 1.72 ± 0.037
5.106AspLeu: 5.106 ± 0.07
1.413AspMet: 1.413 ± 0.036
1.999AspAsn: 1.999 ± 0.054
2.839AspPro: 2.839 ± 0.049
2.373AspGln: 2.373 ± 0.047
3.124AspArg: 3.124 ± 0.051
2.96AspSer: 2.96 ± 0.067
3.18AspThr: 3.18 ± 0.068
4.261AspVal: 4.261 ± 0.08
0.904AspTrp: 0.904 ± 0.029
1.765AspTyr: 1.765 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
7.389GluAla: 7.389 ± 0.094
0.462GluCys: 0.462 ± 0.021
2.991GluAsp: 2.991 ± 0.055
3.847GluGlu: 3.847 ± 0.073
1.696GluPhe: 1.696 ± 0.04
4.46GluGly: 4.46 ± 0.073
1.732GluHis: 1.732 ± 0.043
2.913GluIle: 2.913 ± 0.054
2.077GluLys: 2.077 ± 0.05
6.54GluLeu: 6.54 ± 0.076
1.526GluMet: 1.526 ± 0.041
1.822GluAsn: 1.822 ± 0.043
2.543GluPro: 2.543 ± 0.051
3.67GluGln: 3.67 ± 0.066
5.171GluArg: 5.171 ± 0.076
3.312GluSer: 3.312 ± 0.056
3.298GluThr: 3.298 ± 0.052
4.252GluVal: 4.252 ± 0.063
0.944GluTrp: 0.944 ± 0.029
1.236GluTyr: 1.236 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
3.599PheAla: 3.599 ± 0.061
0.415PheCys: 0.415 ± 0.019
2.491PheAsp: 2.491 ± 0.049
2.171PheGlu: 2.171 ± 0.045
1.463PhePhe: 1.463 ± 0.04
3.129PheGly: 3.129 ± 0.059
0.786PheHis: 0.786 ± 0.025
2.012PheIle: 2.012 ± 0.046
1.108PheLys: 1.108 ± 0.032
3.212PheLeu: 3.212 ± 0.057
0.941PheMet: 0.941 ± 0.031
1.249PheAsn: 1.249 ± 0.034
1.436PhePro: 1.436 ± 0.034
1.262PheGln: 1.262 ± 0.034
1.833PheArg: 1.833 ± 0.045
2.551PheSer: 2.551 ± 0.049
1.974PheThr: 1.974 ± 0.041
2.438PheVal: 2.438 ± 0.047
0.511PheTrp: 0.511 ± 0.023
1.054PheTyr: 1.054 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
6.823GlyAla: 6.823 ± 0.092
0.892GlyCys: 0.892 ± 0.032
4.469GlyAsp: 4.469 ± 0.144
5.143GlyGlu: 5.143 ± 0.075
3.145GlyPhe: 3.145 ± 0.052
6.09GlyGly: 6.09 ± 0.136
1.948GlyHis: 1.948 ± 0.044
4.611GlyIle: 4.611 ± 0.069
2.776GlyLys: 2.776 ± 0.049
8.547GlyLeu: 8.547 ± 0.104
2.373GlyMet: 2.373 ± 0.041
2.274GlyAsn: 2.274 ± 0.071
2.458GlyPro: 2.458 ± 0.05
3.224GlyGln: 3.224 ± 0.054
4.542GlyArg: 4.542 ± 0.06
4.364GlySer: 4.364 ± 0.086
3.64GlyThr: 3.64 ± 0.067
6.182GlyVal: 6.182 ± 0.079
1.312GlyTrp: 1.312 ± 0.036
2.416GlyTyr: 2.416 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
2.443HisAla: 2.443 ± 0.047
0.305HisCys: 0.305 ± 0.016
1.478HisAsp: 1.478 ± 0.036
1.334HisGlu: 1.334 ± 0.039
1.088HisPhe: 1.088 ± 0.033
1.969HisGly: 1.969 ± 0.041
0.797HisHis: 0.797 ± 0.024
1.087HisIle: 1.087 ± 0.028
0.621HisLys: 0.621 ± 0.023
2.626HisLeu: 2.626 ± 0.049
0.53HisMet: 0.53 ± 0.022
0.656HisAsn: 0.656 ± 0.021
1.444HisPro: 1.444 ± 0.035
1.267HisGln: 1.267 ± 0.039
1.569HisArg: 1.569 ± 0.041
1.3HisSer: 1.3 ± 0.034
1.177HisThr: 1.177 ± 0.027
1.527HisVal: 1.527 ± 0.034
0.453HisTrp: 0.453 ± 0.021
0.883HisTyr: 0.883 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.095IleAla: 6.095 ± 0.075
0.465IleCys: 0.465 ± 0.021
3.659IleAsp: 3.659 ± 0.055
3.736IleGlu: 3.736 ± 0.065
1.61IlePhe: 1.61 ± 0.045
4.5IleGly: 4.5 ± 0.067
1.084IleHis: 1.084 ± 0.03
2.491IleIle: 2.491 ± 0.05
1.612IleLys: 1.612 ± 0.039
3.96IleLeu: 3.96 ± 0.064
1.114IleMet: 1.114 ± 0.033
1.968IleAsn: 1.968 ± 0.041
2.294IlePro: 2.294 ± 0.058
1.679IleGln: 1.679 ± 0.036
2.842IleArg: 2.842 ± 0.046
3.14IleSer: 3.14 ± 0.054
3.013IleThr: 3.013 ± 0.056
3.539IleVal: 3.539 ± 0.063
0.537IleTrp: 0.537 ± 0.023
1.226IleTyr: 1.226 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
3.423LysAla: 3.423 ± 0.062
0.179LysCys: 0.179 ± 0.012
1.473LysAsp: 1.473 ± 0.039
1.665LysGlu: 1.665 ± 0.045
0.688LysPhe: 0.688 ± 0.023
2.344LysGly: 2.344 ± 0.044
0.751LysHis: 0.751 ± 0.027
1.324LysIle: 1.324 ± 0.036
1.189LysLys: 1.189 ± 0.041
3.142LysLeu: 3.142 ± 0.06
0.744LysMet: 0.744 ± 0.026
0.815LysAsn: 0.815 ± 0.028
1.691LysPro: 1.691 ± 0.041
1.56LysGln: 1.56 ± 0.039
2.663LysArg: 2.663 ± 0.06
1.643LysSer: 1.643 ± 0.044
1.734LysThr: 1.734 ± 0.039
2.216LysVal: 2.216 ± 0.044
0.324LysTrp: 0.324 ± 0.017
0.521LysTyr: 0.521 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
13.398LeuAla: 13.398 ± 0.125
1.12LeuCys: 1.12 ± 0.031
6.629LeuAsp: 6.629 ± 0.085
6.764LeuGlu: 6.764 ± 0.085
3.861LeuPhe: 3.861 ± 0.063
8.675LeuGly: 8.675 ± 0.086
2.349LeuHis: 2.349 ± 0.043
5.659LeuIle: 5.659 ± 0.08
3.72LeuLys: 3.72 ± 0.059
12.207LeuLeu: 12.207 ± 0.162
3.008LeuMet: 3.008 ± 0.057
3.646LeuAsn: 3.646 ± 0.049
6.132LeuPro: 6.132 ± 0.074
3.426LeuGln: 3.426 ± 0.067
6.425LeuArg: 6.425 ± 0.075
7.45LeuSer: 7.45 ± 0.086
6.478LeuThr: 6.478 ± 0.072
7.758LeuVal: 7.758 ± 0.092
1.405LeuTrp: 1.405 ± 0.038
2.48LeuTyr: 2.48 ± 0.046
0.0LeuXaa: 0.0 ± 0.0
Met
3.245MetAla: 3.245 ± 0.057
0.174MetCys: 0.174 ± 0.011
1.149MetAsp: 1.149 ± 0.029
1.243MetGlu: 1.243 ± 0.038
0.731MetPhe: 0.731 ± 0.031
2.003MetGly: 2.003 ± 0.041
0.548MetHis: 0.548 ± 0.02
1.311MetIle: 1.311 ± 0.036
0.862MetLys: 0.862 ± 0.027
3.003MetLeu: 3.003 ± 0.054
0.709MetMet: 0.709 ± 0.025
0.862MetAsn: 0.862 ± 0.025
1.472MetPro: 1.472 ± 0.035
1.232MetGln: 1.232 ± 0.033
1.636MetArg: 1.636 ± 0.042
1.89MetSer: 1.89 ± 0.039
1.721MetThr: 1.721 ± 0.041
1.912MetVal: 1.912 ± 0.043
0.234MetTrp: 0.234 ± 0.013
0.382MetTyr: 0.382 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.575AsnAla: 3.575 ± 0.062
0.254AsnCys: 0.254 ± 0.017
1.893AsnAsp: 1.893 ± 0.051
1.768AsnGlu: 1.768 ± 0.038
0.975AsnPhe: 0.975 ± 0.032
2.412AsnGly: 2.412 ± 0.058
0.686AsnHis: 0.686 ± 0.026
1.58AsnIle: 1.58 ± 0.039
0.757AsnLys: 0.757 ± 0.026
2.881AsnLeu: 2.881 ± 0.053
0.666AsnMet: 0.666 ± 0.022
1.014AsnAsn: 1.014 ± 0.029
1.806AsnPro: 1.806 ± 0.037
1.361AsnGln: 1.361 ± 0.034
1.985AsnArg: 1.985 ± 0.036
1.496AsnSer: 1.496 ± 0.039
1.602AsnThr: 1.602 ± 0.042
2.271AsnVal: 2.271 ± 0.047
0.428AsnTrp: 0.428 ± 0.019
0.758AsnTyr: 0.758 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
4.567ProAla: 4.567 ± 0.066
0.343ProCys: 0.343 ± 0.018
3.151ProAsp: 3.151 ± 0.046
3.608ProGlu: 3.608 ± 0.061
1.794ProPhe: 1.794 ± 0.042
3.75ProGly: 3.75 ± 0.05
1.137ProHis: 1.137 ± 0.03
2.213ProIle: 2.213 ± 0.044
1.356ProLys: 1.356 ± 0.039
5.501ProLeu: 5.501 ± 0.075
1.22ProMet: 1.22 ± 0.032
1.506ProAsn: 1.506 ± 0.036
2.138ProPro: 2.138 ± 0.049
1.801ProGln: 1.801 ± 0.038
2.556ProArg: 2.556 ± 0.049
2.969ProSer: 2.969 ± 0.052
2.439ProThr: 2.439 ± 0.045
3.541ProVal: 3.541 ± 0.059
0.813ProTrp: 0.813 ± 0.028
1.106ProTyr: 1.106 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
5.485GlnAla: 5.485 ± 0.076
0.337GlnCys: 0.337 ± 0.016
1.878GlnAsp: 1.878 ± 0.039
2.309GlnGlu: 2.309 ± 0.046
1.207GlnPhe: 1.207 ± 0.04
3.381GlnGly: 3.381 ± 0.056
1.287GlnHis: 1.287 ± 0.034
1.679GlnIle: 1.679 ± 0.035
1.111GlnLys: 1.111 ± 0.034
5.163GlnLeu: 5.163 ± 0.079
1.012GlnMet: 1.012 ± 0.03
0.997GlnAsn: 0.997 ± 0.027
2.121GlnPro: 2.121 ± 0.047
2.839GlnGln: 2.839 ± 0.064
3.876GlnArg: 3.876 ± 0.074
2.226GlnSer: 2.226 ± 0.044
1.995GlnThr: 1.995 ± 0.043
3.122GlnVal: 3.122 ± 0.052
0.79GlnTrp: 0.79 ± 0.023
0.846GlnTyr: 0.846 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
5.411ArgAla: 5.411 ± 0.07
0.579ArgCys: 0.579 ± 0.023
3.824ArgAsp: 3.824 ± 0.062
4.544ArgGlu: 4.544 ± 0.073
2.755ArgPhe: 2.755 ± 0.049
3.873ArgGly: 3.873 ± 0.056
2.118ArgHis: 2.118 ± 0.044
3.229ArgIle: 3.229 ± 0.054
1.83ArgLys: 1.83 ± 0.045
8.391ArgLeu: 8.391 ± 0.1
1.641ArgMet: 1.641 ± 0.04
1.683ArgAsn: 1.683 ± 0.037
2.462ArgPro: 2.462 ± 0.047
3.421ArgGln: 3.421 ± 0.063
4.546ArgArg: 4.546 ± 0.075
3.13ArgSer: 3.13 ± 0.045
2.595ArgThr: 2.595 ± 0.046
4.333ArgVal: 4.333 ± 0.062
1.139ArgTrp: 1.139 ± 0.033
2.199ArgTyr: 2.199 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
6.021SerAla: 6.021 ± 0.084
0.442SerCys: 0.442 ± 0.019
3.487SerAsp: 3.487 ± 0.071
3.462SerGlu: 3.462 ± 0.048
2.163SerPhe: 2.163 ± 0.042
5.135SerGly: 5.135 ± 0.085
1.433SerHis: 1.433 ± 0.035
2.85SerIle: 2.85 ± 0.063
1.515SerLys: 1.515 ± 0.035
7.066SerLeu: 7.066 ± 0.08
1.561SerMet: 1.561 ± 0.036
1.605SerAsn: 1.605 ± 0.034
2.955SerPro: 2.955 ± 0.051
2.648SerGln: 2.648 ± 0.045
3.73SerArg: 3.73 ± 0.061
3.573SerSer: 3.573 ± 0.078
2.997SerThr: 2.997 ± 0.052
4.184SerVal: 4.184 ± 0.065
0.784SerTrp: 0.784 ± 0.026
1.361SerTyr: 1.361 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
5.198ThrAla: 5.198 ± 0.062
0.449ThrCys: 0.449 ± 0.022
2.664ThrAsp: 2.664 ± 0.052
2.467ThrGlu: 2.467 ± 0.045
1.901ThrPhe: 1.901 ± 0.04
4.11ThrGly: 4.11 ± 0.069
1.346ThrHis: 1.346 ± 0.03
2.332ThrIle: 2.332 ± 0.054
1.165ThrLys: 1.165 ± 0.035
7.64ThrLeu: 7.64 ± 0.102
1.182ThrMet: 1.182 ± 0.031
1.369ThrAsn: 1.369 ± 0.032
3.616ThrPro: 3.616 ± 0.055
2.329ThrGln: 2.329 ± 0.043
3.148ThrArg: 3.148 ± 0.053
3.164ThrSer: 3.164 ± 0.06
2.871ThrThr: 2.871 ± 0.057
3.549ThrVal: 3.549 ± 0.08
0.724ThrTrp: 0.724 ± 0.026
1.218ThrTyr: 1.218 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
7.842ValAla: 7.842 ± 0.091
0.669ValCys: 0.669 ± 0.021
4.241ValAsp: 4.241 ± 0.081
4.575ValGlu: 4.575 ± 0.065
2.57ValPhe: 2.57 ± 0.047
5.325ValGly: 5.325 ± 0.07
1.367ValHis: 1.367 ± 0.038
4.175ValIle: 4.175 ± 0.067
2.198ValLys: 2.198 ± 0.043
7.522ValLeu: 7.522 ± 0.079
2.176ValMet: 2.176 ± 0.04
2.303ValAsn: 2.303 ± 0.045
3.207ValPro: 3.207 ± 0.057
2.149ValGln: 2.149 ± 0.043
3.905ValArg: 3.905 ± 0.059
4.61ValSer: 4.61 ± 0.07
4.199ValThr: 4.199 ± 0.07
5.805ValVal: 5.805 ± 0.078
0.891ValTrp: 0.891 ± 0.028
1.625ValTyr: 1.625 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
1.15TrpAla: 1.15 ± 0.035
0.191TrpCys: 0.191 ± 0.014
0.588TrpAsp: 0.588 ± 0.025
0.72TrpGlu: 0.72 ± 0.025
0.525TrpPhe: 0.525 ± 0.023
0.919TrpGly: 0.919 ± 0.032
0.433TrpHis: 0.433 ± 0.018
0.675TrpIle: 0.675 ± 0.024
0.386TrpLys: 0.386 ± 0.016
2.413TrpLeu: 2.413 ± 0.054
0.41TrpMet: 0.41 ± 0.021
0.39TrpAsn: 0.39 ± 0.019
0.731TrpPro: 0.731 ± 0.027
1.103TrpGln: 1.103 ± 0.033
1.116TrpArg: 1.116 ± 0.03
0.762TrpSer: 0.762 ± 0.025
0.569TrpThr: 0.569 ± 0.022
1.026TrpVal: 1.026 ± 0.032
0.286TrpTrp: 0.286 ± 0.019
0.325TrpTyr: 0.325 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.494TyrAla: 2.494 ± 0.044
0.272TyrCys: 0.272 ± 0.016
1.344TyrAsp: 1.344 ± 0.036
1.207TyrGlu: 1.207 ± 0.03
0.981TyrPhe: 0.981 ± 0.033
2.039TyrGly: 2.039 ± 0.045
0.693TyrHis: 0.693 ± 0.025
1.013TyrIle: 1.013 ± 0.029
0.588TyrLys: 0.588 ± 0.021
2.907TyrLeu: 2.907 ± 0.052
0.516TyrMet: 0.516 ± 0.021
0.677TyrAsn: 0.677 ± 0.024
1.331TyrPro: 1.331 ± 0.035
1.364TyrGln: 1.364 ± 0.036
2.022TyrArg: 2.022 ± 0.035
1.372TyrSer: 1.372 ± 0.031
1.236TyrThr: 1.236 ± 0.038
1.65TyrVal: 1.65 ± 0.042
0.431TyrTrp: 0.431 ± 0.019
0.73TyrTyr: 0.73 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3843 proteins (1237372 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski