Amino acid dipepetide frequency for Cycloclasticus sp. symbiont of Poecilosclerida sp. N

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.154AlaAla: 7.154 ± 0.146
1.11AlaCys: 1.11 ± 0.044
4.796AlaAsp: 4.796 ± 0.09
5.005AlaGlu: 5.005 ± 0.095
3.384AlaPhe: 3.384 ± 0.1
6.314AlaGly: 6.314 ± 0.124
1.644AlaHis: 1.644 ± 0.056
5.909AlaIle: 5.909 ± 0.111
5.262AlaLys: 5.262 ± 0.124
9.17AlaLeu: 9.17 ± 0.166
2.238AlaMet: 2.238 ± 0.064
3.511AlaAsn: 3.511 ± 0.078
2.538AlaPro: 2.538 ± 0.081
3.308AlaGln: 3.308 ± 0.089
3.785AlaArg: 3.785 ± 0.086
5.354AlaSer: 5.354 ± 0.11
4.413AlaThr: 4.413 ± 0.13
5.893AlaVal: 5.893 ± 0.113
0.879AlaTrp: 0.879 ± 0.046
2.322AlaTyr: 2.322 ± 0.074
0.0AlaXaa: 0.0 ± 0.0
Cys
0.894CysAla: 0.894 ± 0.045
0.134CysCys: 0.134 ± 0.015
0.643CysAsp: 0.643 ± 0.036
0.697CysGlu: 0.697 ± 0.037
0.444CysPhe: 0.444 ± 0.03
0.992CysGly: 0.992 ± 0.058
0.349CysHis: 0.349 ± 0.026
0.655CysIle: 0.655 ± 0.036
0.544CysLys: 0.544 ± 0.03
1.105CysLeu: 1.105 ± 0.051
0.224CysMet: 0.224 ± 0.021
0.331CysAsn: 0.331 ± 0.028
0.534CysPro: 0.534 ± 0.039
0.409CysGln: 0.409 ± 0.029
0.493CysArg: 0.493 ± 0.028
0.729CysSer: 0.729 ± 0.039
0.501CysThr: 0.501 ± 0.032
0.77CysVal: 0.77 ± 0.038
0.115CysTrp: 0.115 ± 0.015
0.333CysTyr: 0.333 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
4.695AspAla: 4.695 ± 0.102
0.598AspCys: 0.598 ± 0.037
3.287AspAsp: 3.287 ± 0.083
4.187AspGlu: 4.187 ± 0.098
2.577AspPhe: 2.577 ± 0.082
3.928AspGly: 3.928 ± 0.109
1.033AspHis: 1.033 ± 0.042
4.533AspIle: 4.533 ± 0.103
3.569AspLys: 3.569 ± 0.083
5.346AspLeu: 5.346 ± 0.103
1.465AspMet: 1.465 ± 0.057
2.289AspAsn: 2.289 ± 0.069
1.929AspPro: 1.929 ± 0.09
1.631AspGln: 1.631 ± 0.052
2.275AspArg: 2.275 ± 0.081
3.228AspSer: 3.228 ± 0.083
2.679AspThr: 2.679 ± 0.077
4.333AspVal: 4.333 ± 0.108
0.805AspTrp: 0.805 ± 0.037
1.898AspTyr: 1.898 ± 0.062
0.0AspXaa: 0.0 ± 0.0
Glu
5.073GluAla: 5.073 ± 0.108
0.514GluCys: 0.514 ± 0.033
2.821GluAsp: 2.821 ± 0.078
3.62GluGlu: 3.62 ± 0.092
2.363GluPhe: 2.363 ± 0.075
4.039GluGly: 4.039 ± 0.084
1.469GluHis: 1.469 ± 0.057
4.017GluIle: 4.017 ± 0.096
4.748GluLys: 4.748 ± 0.115
6.456GluLeu: 6.456 ± 0.109
1.629GluMet: 1.629 ± 0.054
2.901GluAsn: 2.901 ± 0.082
1.974GluPro: 1.974 ± 0.061
3.602GluGln: 3.602 ± 0.099
3.269GluArg: 3.269 ± 0.077
3.608GluSer: 3.608 ± 0.079
3.347GluThr: 3.347 ± 0.09
4.134GluVal: 4.134 ± 0.096
0.705GluTrp: 0.705 ± 0.038
1.767GluTyr: 1.767 ± 0.066
0.0GluXaa: 0.0 ± 0.0
Phe
3.055PheAla: 3.055 ± 0.077
0.526PheCys: 0.526 ± 0.032
2.706PheAsp: 2.706 ± 0.08
2.632PheGlu: 2.632 ± 0.071
1.925PhePhe: 1.925 ± 0.062
2.858PheGly: 2.858 ± 0.082
0.781PheHis: 0.781 ± 0.039
2.938PheIle: 2.938 ± 0.076
2.398PheLys: 2.398 ± 0.071
3.684PheLeu: 3.684 ± 0.096
0.992PheMet: 0.992 ± 0.044
2.279PheAsn: 2.279 ± 0.108
1.508PhePro: 1.508 ± 0.058
1.151PheGln: 1.151 ± 0.046
1.572PheArg: 1.572 ± 0.055
3.347PheSer: 3.347 ± 0.074
2.038PheThr: 2.038 ± 0.06
2.503PheVal: 2.503 ± 0.061
0.415PheTrp: 0.415 ± 0.031
1.368PheTyr: 1.368 ± 0.062
0.0PheXaa: 0.0 ± 0.0
Gly
5.432GlyAla: 5.432 ± 0.144
0.953GlyCys: 0.953 ± 0.048
3.877GlyAsp: 3.877 ± 0.107
4.543GlyGlu: 4.543 ± 0.097
3.285GlyPhe: 3.285 ± 0.089
5.319GlyGly: 5.319 ± 0.142
1.64GlyHis: 1.64 ± 0.063
4.736GlyIle: 4.736 ± 0.102
4.635GlyLys: 4.635 ± 0.104
7.44GlyLeu: 7.44 ± 0.144
2.046GlyMet: 2.046 ± 0.067
2.562GlyAsn: 2.562 ± 0.092
1.584GlyPro: 1.584 ± 0.061
2.624GlyGln: 2.624 ± 0.08
3.244GlyArg: 3.244 ± 0.08
4.233GlySer: 4.233 ± 0.11
3.573GlyThr: 3.573 ± 0.111
5.677GlyVal: 5.677 ± 0.118
0.906GlyTrp: 0.906 ± 0.05
2.274GlyTyr: 2.274 ± 0.069
0.0GlyXaa: 0.0 ± 0.0
His
1.755HisAla: 1.755 ± 0.067
0.325HisCys: 0.325 ± 0.024
1.046HisAsp: 1.046 ± 0.046
1.138HisGlu: 1.138 ± 0.05
1.007HisPhe: 1.007 ± 0.047
1.584HisGly: 1.584 ± 0.055
0.612HisHis: 0.612 ± 0.04
1.502HisIle: 1.502 ± 0.048
1.272HisLys: 1.272 ± 0.05
2.314HisLeu: 2.314 ± 0.069
0.544HisMet: 0.544 ± 0.033
0.844HisAsn: 0.844 ± 0.038
1.109HisPro: 1.109 ± 0.053
0.959HisGln: 0.959 ± 0.045
1.013HisArg: 1.013 ± 0.053
1.251HisSer: 1.251 ± 0.052
1.11HisThr: 1.11 ± 0.048
1.397HisVal: 1.397 ± 0.06
0.3HisTrp: 0.3 ± 0.023
0.791HisTyr: 0.791 ± 0.037
0.0HisXaa: 0.0 ± 0.0
Ile
6.041IleAla: 6.041 ± 0.112
0.762IleCys: 0.762 ± 0.043
4.57IleAsp: 4.57 ± 0.11
4.933IleGlu: 4.933 ± 0.102
2.287IlePhe: 2.287 ± 0.078
4.629IleGly: 4.629 ± 0.11
1.381IleHis: 1.381 ± 0.052
4.378IleIle: 4.378 ± 0.114
4.791IleLys: 4.791 ± 0.106
5.297IleLeu: 5.297 ± 0.106
1.303IleMet: 1.303 ± 0.058
3.339IleAsn: 3.339 ± 0.088
2.595IlePro: 2.595 ± 0.072
2.459IleGln: 2.459 ± 0.065
2.971IleArg: 2.971 ± 0.075
4.855IleSer: 4.855 ± 0.089
3.85IleThr: 3.85 ± 0.105
4.3IleVal: 4.3 ± 0.101
0.579IleTrp: 0.579 ± 0.032
1.785IleTyr: 1.785 ± 0.057
0.0IleXaa: 0.0 ± 0.0
Lys
5.356LysAla: 5.356 ± 0.099
0.458LysCys: 0.458 ± 0.038
3.154LysAsp: 3.154 ± 0.079
4.148LysGlu: 4.148 ± 0.096
1.71LysPhe: 1.71 ± 0.065
4.101LysGly: 4.101 ± 0.108
1.397LysHis: 1.397 ± 0.056
3.881LysIle: 3.881 ± 0.089
4.816LysLys: 4.816 ± 0.136
5.934LysLeu: 5.934 ± 0.12
1.644LysMet: 1.644 ± 0.057
3.216LysAsn: 3.216 ± 0.096
2.671LysPro: 2.671 ± 0.069
3.45LysGln: 3.45 ± 0.097
3.507LysArg: 3.507 ± 0.093
3.764LysSer: 3.764 ± 0.077
3.809LysThr: 3.809 ± 0.104
4.021LysVal: 4.021 ± 0.09
0.625LysTrp: 0.625 ± 0.033
1.508LysTyr: 1.508 ± 0.059
0.0LysXaa: 0.0 ± 0.0
Leu
9.059LeuAla: 9.059 ± 0.146
1.038LeuCys: 1.038 ± 0.042
6.004LeuAsp: 6.004 ± 0.103
6.318LeuGlu: 6.318 ± 0.119
4.074LeuPhe: 4.074 ± 0.11
7.035LeuGly: 7.035 ± 0.129
2.096LeuHis: 2.096 ± 0.073
6.476LeuIle: 6.476 ± 0.136
6.283LeuLys: 6.283 ± 0.138
10.203LeuLeu: 10.203 ± 0.213
2.611LeuMet: 2.611 ± 0.074
4.855LeuAsn: 4.855 ± 0.108
4.321LeuPro: 4.321 ± 0.084
3.515LeuGln: 3.515 ± 0.1
4.463LeuArg: 4.463 ± 0.105
7.884LeuSer: 7.884 ± 0.131
5.714LeuThr: 5.714 ± 0.123
6.392LeuVal: 6.392 ± 0.136
0.933LeuTrp: 0.933 ± 0.038
2.42LeuTyr: 2.42 ± 0.073
0.0LeuXaa: 0.0 ± 0.0
Met
2.404MetAla: 2.404 ± 0.065
0.224MetCys: 0.224 ± 0.02
1.315MetAsp: 1.315 ± 0.059
1.319MetGlu: 1.319 ± 0.052
1.005MetPhe: 1.005 ± 0.08
1.898MetGly: 1.898 ± 0.065
0.481MetHis: 0.481 ± 0.034
1.379MetIle: 1.379 ± 0.055
1.471MetLys: 1.471 ± 0.051
2.558MetLeu: 2.558 ± 0.076
0.664MetMet: 0.664 ± 0.04
1.165MetAsn: 1.165 ± 0.048
1.206MetPro: 1.206 ± 0.049
1.13MetGln: 1.13 ± 0.047
1.27MetArg: 1.27 ± 0.054
2.055MetSer: 2.055 ± 0.07
1.407MetThr: 1.407 ± 0.042
1.695MetVal: 1.695 ± 0.061
0.175MetTrp: 0.175 ± 0.016
0.522MetTyr: 0.522 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
3.626AsnAla: 3.626 ± 0.077
0.425AsnCys: 0.425 ± 0.03
2.338AsnAsp: 2.338 ± 0.067
2.657AsnGlu: 2.657 ± 0.071
1.551AsnPhe: 1.551 ± 0.058
3.051AsnGly: 3.051 ± 0.089
0.849AsnHis: 0.849 ± 0.037
3.431AsnIle: 3.431 ± 0.09
3.144AsnLys: 3.144 ± 0.091
3.916AsnLeu: 3.916 ± 0.103
1.175AsnMet: 1.175 ± 0.068
2.201AsnAsn: 2.201 ± 0.081
2.094AsnPro: 2.094 ± 0.065
1.601AsnGln: 1.601 ± 0.065
2.051AsnArg: 2.051 ± 0.069
2.581AsnSer: 2.581 ± 0.085
2.5AsnThr: 2.5 ± 0.073
2.821AsnVal: 2.821 ± 0.078
0.581AsnTrp: 0.581 ± 0.037
1.212AsnTyr: 1.212 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
3.082ProAla: 3.082 ± 0.091
0.366ProCys: 0.366 ± 0.028
2.383ProAsp: 2.383 ± 0.066
2.776ProGlu: 2.776 ± 0.084
1.553ProPhe: 1.553 ± 0.063
2.272ProGly: 2.272 ± 0.082
0.886ProHis: 0.886 ± 0.043
2.529ProIle: 2.529 ± 0.073
2.129ProLys: 2.129 ± 0.062
3.885ProLeu: 3.885 ± 0.085
0.962ProMet: 0.962 ± 0.042
1.652ProAsn: 1.652 ± 0.06
1.214ProPro: 1.214 ± 0.046
1.379ProGln: 1.379 ± 0.057
1.403ProArg: 1.403 ± 0.055
2.501ProSer: 2.501 ± 0.065
2.071ProThr: 2.071 ± 0.079
2.951ProVal: 2.951 ± 0.071
0.399ProTrp: 0.399 ± 0.03
1.2ProTyr: 1.2 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
3.957GlnAla: 3.957 ± 0.096
0.438GlnCys: 0.438 ± 0.03
1.707GlnAsp: 1.707 ± 0.057
2.09GlnGlu: 2.09 ± 0.067
1.506GlnPhe: 1.506 ± 0.053
2.599GlnGly: 2.599 ± 0.066
1.091GlnHis: 1.091 ± 0.047
2.445GlnIle: 2.445 ± 0.07
2.424GlnLys: 2.424 ± 0.072
4.483GlnLeu: 4.483 ± 0.11
1.007GlnMet: 1.007 ± 0.043
1.564GlnAsn: 1.564 ± 0.067
1.373GlnPro: 1.373 ± 0.061
2.5GlnGln: 2.5 ± 0.092
2.392GlnArg: 2.392 ± 0.073
2.414GlnSer: 2.414 ± 0.066
2.225GlnThr: 2.225 ± 0.076
2.63GlnVal: 2.63 ± 0.071
0.563GlnTrp: 0.563 ± 0.039
1.13GlnTyr: 1.13 ± 0.051
0.0GlnXaa: 0.0 ± 0.0
Arg
3.491ArgAla: 3.491 ± 0.083
0.499ArgCys: 0.499 ± 0.035
2.503ArgAsp: 2.503 ± 0.075
2.891ArgGlu: 2.891 ± 0.089
2.336ArgPhe: 2.336 ± 0.075
3.057ArgGly: 3.057 ± 0.077
1.089ArgHis: 1.089 ± 0.044
3.084ArgIle: 3.084 ± 0.083
2.724ArgLys: 2.724 ± 0.084
5.182ArgLeu: 5.182 ± 0.118
1.251ArgMet: 1.251 ± 0.049
1.822ArgAsn: 1.822 ± 0.061
1.742ArgPro: 1.742 ± 0.056
2.129ArgGln: 2.129 ± 0.072
2.47ArgArg: 2.47 ± 0.071
2.566ArgSer: 2.566 ± 0.086
2.235ArgThr: 2.235 ± 0.07
3.366ArgVal: 3.366 ± 0.094
0.643ArgTrp: 0.643 ± 0.038
1.644ArgTyr: 1.644 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
5.317SerAla: 5.317 ± 0.103
0.756SerCys: 0.756 ± 0.039
3.67SerAsp: 3.67 ± 0.086
3.725SerGlu: 3.725 ± 0.09
2.735SerPhe: 2.735 ± 0.08
5.147SerGly: 5.147 ± 0.119
1.485SerHis: 1.485 ± 0.049
4.506SerIle: 4.506 ± 0.099
3.928SerLys: 3.928 ± 0.1
6.928SerLeu: 6.928 ± 0.136
1.691SerMet: 1.691 ± 0.061
2.747SerAsn: 2.747 ± 0.083
2.41SerPro: 2.41 ± 0.068
2.375SerGln: 2.375 ± 0.071
3.008SerArg: 3.008 ± 0.086
4.413SerSer: 4.413 ± 0.104
3.458SerThr: 3.458 ± 0.096
4.691SerVal: 4.691 ± 0.099
0.692SerTrp: 0.692 ± 0.038
1.823SerTyr: 1.823 ± 0.069
0.0SerXaa: 0.0 ± 0.0
Thr
4.528ThrAla: 4.528 ± 0.115
0.493ThrCys: 0.493 ± 0.029
3.064ThrAsp: 3.064 ± 0.089
2.72ThrGlu: 2.72 ± 0.07
2.057ThrPhe: 2.057 ± 0.057
4.344ThrGly: 4.344 ± 0.107
1.331ThrHis: 1.331 ± 0.048
3.522ThrIle: 3.522 ± 0.102
2.702ThrLys: 2.702 ± 0.085
6.121ThrLeu: 6.121 ± 0.128
1.165ThrMet: 1.165 ± 0.047
2.053ThrAsn: 2.053 ± 0.071
2.694ThrPro: 2.694 ± 0.097
2.207ThrGln: 2.207 ± 0.076
2.459ThrArg: 2.459 ± 0.069
3.23ThrSer: 3.23 ± 0.076
2.883ThrThr: 2.883 ± 0.097
4.003ThrVal: 4.003 ± 0.13
0.497ThrTrp: 0.497 ± 0.026
1.483ThrTyr: 1.483 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
5.841ValAla: 5.841 ± 0.107
0.789ValCys: 0.789 ± 0.046
4.251ValAsp: 4.251 ± 0.109
4.461ValGlu: 4.461 ± 0.099
3.064ValPhe: 3.064 ± 0.085
4.586ValGly: 4.586 ± 0.097
1.284ValHis: 1.284 ± 0.056
4.9ValIle: 4.9 ± 0.1
4.068ValLys: 4.068 ± 0.108
7.136ValLeu: 7.136 ± 0.137
1.862ValMet: 1.862 ± 0.064
2.944ValAsn: 2.944 ± 0.083
2.537ValPro: 2.537 ± 0.076
2.079ValGln: 2.079 ± 0.064
2.985ValArg: 2.985 ± 0.082
4.886ValSer: 4.886 ± 0.097
3.844ValThr: 3.844 ± 0.136
5.003ValVal: 5.003 ± 0.129
0.746ValTrp: 0.746 ± 0.044
1.847ValTyr: 1.847 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.859TrpAla: 0.859 ± 0.037
0.136TrpCys: 0.136 ± 0.017
0.629TrpAsp: 0.629 ± 0.045
0.602TrpGlu: 0.602 ± 0.036
0.516TrpPhe: 0.516 ± 0.037
0.738TrpGly: 0.738 ± 0.051
0.312TrpHis: 0.312 ± 0.025
0.532TrpIle: 0.532 ± 0.031
0.551TrpLys: 0.551 ± 0.035
1.465TrpLeu: 1.465 ± 0.065
0.257TrpMet: 0.257 ± 0.023
0.409TrpAsn: 0.409 ± 0.034
0.388TrpPro: 0.388 ± 0.028
0.639TrpGln: 0.639 ± 0.034
0.579TrpArg: 0.579 ± 0.033
0.74TrpSer: 0.74 ± 0.04
0.448TrpThr: 0.448 ± 0.036
0.801TrpVal: 0.801 ± 0.039
0.136TrpTrp: 0.136 ± 0.019
0.312TrpTyr: 0.312 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.373TyrAla: 2.373 ± 0.065
0.349TyrCys: 0.349 ± 0.027
1.518TyrAsp: 1.518 ± 0.053
1.594TyrGlu: 1.594 ± 0.056
1.296TyrPhe: 1.296 ± 0.053
2.094TyrGly: 2.094 ± 0.068
0.662TyrHis: 0.662 ± 0.041
1.751TyrIle: 1.751 ± 0.065
1.712TyrLys: 1.712 ± 0.059
3.07TyrLeu: 3.07 ± 0.086
0.612TyrMet: 0.612 ± 0.034
1.12TyrAsn: 1.12 ± 0.046
1.175TyrPro: 1.175 ± 0.054
1.391TyrGln: 1.391 ± 0.054
1.514TyrArg: 1.514 ± 0.065
1.864TyrSer: 1.864 ± 0.063
1.455TyrThr: 1.455 ± 0.057
1.73TyrVal: 1.73 ± 0.056
0.357TyrTrp: 0.357 ± 0.028
0.822TyrTyr: 0.822 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1736 proteins (513302 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski