Amino acid dipepetide frequency for Alysiella crassa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.184AlaAla: 12.184 ± 0.299
1.154AlaCys: 1.154 ± 0.045
5.578AlaAsp: 5.578 ± 0.097
6.729AlaGlu: 6.729 ± 0.245
3.526AlaPhe: 3.526 ± 0.079
6.117AlaGly: 6.117 ± 0.127
2.311AlaHis: 2.311 ± 0.063
5.372AlaIle: 5.372 ± 0.09
5.777AlaLys: 5.777 ± 0.155
9.374AlaLeu: 9.374 ± 0.159
2.871AlaMet: 2.871 ± 0.072
4.242AlaAsn: 4.242 ± 0.112
3.618AlaPro: 3.618 ± 0.09
5.986AlaGln: 5.986 ± 0.268
4.05AlaArg: 4.05 ± 0.094
4.286AlaSer: 4.286 ± 0.082
4.857AlaThr: 4.857 ± 0.138
6.646AlaVal: 6.646 ± 0.107
1.434AlaTrp: 1.434 ± 0.06
2.704AlaTyr: 2.704 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
0.919CysAla: 0.919 ± 0.032
0.151CysCys: 0.151 ± 0.016
0.504CysAsp: 0.504 ± 0.027
0.602CysGlu: 0.602 ± 0.033
0.456CysPhe: 0.456 ± 0.026
0.98CysGly: 0.98 ± 0.047
0.279CysHis: 0.279 ± 0.019
0.459CysIle: 0.459 ± 0.023
0.379CysLys: 0.379 ± 0.021
1.025CysLeu: 1.025 ± 0.04
0.19CysMet: 0.19 ± 0.018
0.362CysAsn: 0.362 ± 0.023
0.463CysPro: 0.463 ± 0.028
0.485CysGln: 0.485 ± 0.025
0.464CysArg: 0.464 ± 0.027
0.464CysSer: 0.464 ± 0.028
0.543CysThr: 0.543 ± 0.029
0.683CysVal: 0.683 ± 0.035
0.113CysTrp: 0.113 ± 0.011
0.306CysTyr: 0.306 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
3.641AspAla: 3.641 ± 0.077
0.524AspCys: 0.524 ± 0.026
3.244AspAsp: 3.244 ± 0.081
3.929AspGlu: 3.929 ± 0.083
2.881AspPhe: 2.881 ± 0.066
4.084AspGly: 4.084 ± 0.16
0.767AspHis: 0.767 ± 0.029
3.043AspIle: 3.043 ± 0.068
3.58AspLys: 3.58 ± 0.087
5.744AspLeu: 5.744 ± 0.101
1.221AspMet: 1.221 ± 0.037
2.71AspAsn: 2.71 ± 0.074
1.683AspPro: 1.683 ± 0.11
0.968AspGln: 0.968 ± 0.039
1.644AspArg: 1.644 ± 0.056
2.541AspSer: 2.541 ± 0.066
2.967AspThr: 2.967 ± 0.098
4.128AspVal: 4.128 ± 0.09
1.058AspTrp: 1.058 ± 0.037
2.179AspTyr: 2.179 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
4.733GluAla: 4.733 ± 0.132
0.518GluCys: 0.518 ± 0.029
2.211GluAsp: 2.211 ± 0.06
2.647GluGlu: 2.647 ± 0.077
2.414GluPhe: 2.414 ± 0.059
2.438GluGly: 2.438 ± 0.076
1.655GluHis: 1.655 ± 0.055
4.272GluIle: 4.272 ± 0.09
3.967GluLys: 3.967 ± 0.098
6.217GluLeu: 6.217 ± 0.111
1.867GluMet: 1.867 ± 0.049
3.745GluAsn: 3.745 ± 0.073
1.867GluPro: 1.867 ± 0.058
3.936GluGln: 3.936 ± 0.103
3.37GluArg: 3.37 ± 0.077
2.641GluSer: 2.641 ± 0.072
3.756GluThr: 3.756 ± 0.089
3.304GluVal: 3.304 ± 0.081
0.879GluTrp: 0.879 ± 0.034
1.796GluTyr: 1.796 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
4.454PheAla: 4.454 ± 0.084
0.529PheCys: 0.529 ± 0.026
2.711PheAsp: 2.711 ± 0.07
2.212PheGlu: 2.212 ± 0.061
1.761PhePhe: 1.761 ± 0.057
3.221PheGly: 3.221 ± 0.083
0.908PheHis: 0.908 ± 0.037
2.368PheIle: 2.368 ± 0.067
1.947PheLys: 1.947 ± 0.053
3.551PheLeu: 3.551 ± 0.095
0.907PheMet: 0.907 ± 0.04
1.879PheAsn: 1.879 ± 0.053
1.584PhePro: 1.584 ± 0.045
1.758PheGln: 1.758 ± 0.052
1.837PheArg: 1.837 ± 0.059
2.652PheSer: 2.652 ± 0.078
2.281PheThr: 2.281 ± 0.089
2.964PheVal: 2.964 ± 0.067
0.659PheTrp: 0.659 ± 0.033
1.344PheTyr: 1.344 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
5.861GlyAla: 5.861 ± 0.127
0.781GlyCys: 0.781 ± 0.034
3.555GlyAsp: 3.555 ± 0.156
4.183GlyGlu: 4.183 ± 0.084
3.068GlyPhe: 3.068 ± 0.072
4.982GlyGly: 4.982 ± 0.112
1.284GlyHis: 1.284 ± 0.047
3.686GlyIle: 3.686 ± 0.076
4.924GlyLys: 4.924 ± 0.096
6.638GlyLeu: 6.638 ± 0.111
2.003GlyMet: 2.003 ± 0.053
3.121GlyAsn: 3.121 ± 0.108
0.592GlyPro: 0.592 ± 0.033
2.54GlyGln: 2.54 ± 0.063
2.929GlyArg: 2.929 ± 0.076
4.471GlySer: 4.471 ± 0.106
3.463GlyThr: 3.463 ± 0.122
5.663GlyVal: 5.663 ± 0.106
0.961GlyTrp: 0.961 ± 0.037
2.086GlyTyr: 2.086 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
2.362HisAla: 2.362 ± 0.061
0.274HisCys: 0.274 ± 0.019
1.357HisAsp: 1.357 ± 0.047
1.498HisGlu: 1.498 ± 0.048
1.151HisPhe: 1.151 ± 0.043
1.773HisGly: 1.773 ± 0.055
0.819HisHis: 0.819 ± 0.034
2.004HisIle: 2.004 ± 0.051
1.119HisLys: 1.119 ± 0.037
2.251HisLeu: 2.251 ± 0.062
0.464HisMet: 0.464 ± 0.024
1.058HisAsn: 1.058 ± 0.037
1.055HisPro: 1.055 ± 0.041
1.099HisGln: 1.099 ± 0.034
1.017HisArg: 1.017 ± 0.036
1.264HisSer: 1.264 ± 0.044
1.451HisThr: 1.451 ± 0.044
1.21HisVal: 1.21 ± 0.039
0.338HisTrp: 0.338 ± 0.023
0.928HisTyr: 0.928 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.266IleAla: 6.266 ± 0.1
0.63IleCys: 0.63 ± 0.033
3.39IleAsp: 3.39 ± 0.071
3.456IleGlu: 3.456 ± 0.075
2.353IlePhe: 2.353 ± 0.067
4.399IleGly: 4.399 ± 0.101
1.363IleHis: 1.363 ± 0.039
3.412IleIle: 3.412 ± 0.073
2.945IleLys: 2.945 ± 0.07
5.733IleLeu: 5.733 ± 0.107
1.42IleMet: 1.42 ± 0.051
2.694IleAsn: 2.694 ± 0.064
2.375IlePro: 2.375 ± 0.059
2.818IleGln: 2.818 ± 0.066
2.741IleArg: 2.741 ± 0.066
3.558IleSer: 3.558 ± 0.08
3.547IleThr: 3.547 ± 0.144
4.255IleVal: 4.255 ± 0.078
0.776IleTrp: 0.776 ± 0.037
1.761IleTyr: 1.761 ± 0.052
0.0IleXaa: 0.0 ± 0.0
Lys
4.585LysAla: 4.585 ± 0.109
0.331LysCys: 0.331 ± 0.021
2.531LysAsp: 2.531 ± 0.074
2.716LysGlu: 2.716 ± 0.065
1.742LysPhe: 1.742 ± 0.053
3.081LysGly: 3.081 ± 0.091
1.367LysHis: 1.367 ± 0.047
3.993LysIle: 3.993 ± 0.09
3.263LysLys: 3.263 ± 0.097
5.583LysLeu: 5.583 ± 0.098
1.9LysMet: 1.9 ± 0.056
3.179LysAsn: 3.179 ± 0.08
2.538LysPro: 2.538 ± 0.065
3.495LysGln: 3.495 ± 0.093
2.667LysArg: 2.667 ± 0.068
3.041LysSer: 3.041 ± 0.086
3.7LysThr: 3.7 ± 0.078
3.253LysVal: 3.253 ± 0.108
0.799LysTrp: 0.799 ± 0.032
1.512LysTyr: 1.512 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
11.032LeuAla: 11.032 ± 0.173
1.01LeuCys: 1.01 ± 0.034
5.273LeuAsp: 5.273 ± 0.096
4.551LeuGlu: 4.551 ± 0.093
4.106LeuPhe: 4.106 ± 0.099
6.642LeuGly: 6.642 ± 0.118
2.311LeuHis: 2.311 ± 0.057
5.836LeuIle: 5.836 ± 0.112
5.534LeuLys: 5.534 ± 0.075
9.968LeuLeu: 9.968 ± 0.195
2.497LeuMet: 2.497 ± 0.07
5.383LeuAsn: 5.383 ± 0.105
5.84LeuPro: 5.84 ± 0.104
4.503LeuGln: 4.503 ± 0.083
4.702LeuArg: 4.702 ± 0.104
6.509LeuSer: 6.509 ± 0.109
5.143LeuThr: 5.143 ± 0.086
5.731LeuVal: 5.731 ± 0.096
1.163LeuTrp: 1.163 ± 0.056
2.447LeuTyr: 2.447 ± 0.067
0.0LeuXaa: 0.0 ± 0.0
Met
2.68MetAla: 2.68 ± 0.071
0.236MetCys: 0.236 ± 0.016
1.098MetAsp: 1.098 ± 0.04
1.115MetGlu: 1.115 ± 0.04
0.967MetPhe: 0.967 ± 0.035
1.86MetGly: 1.86 ± 0.052
0.475MetHis: 0.475 ± 0.027
1.468MetIle: 1.468 ± 0.054
1.696MetLys: 1.696 ± 0.051
2.56MetLeu: 2.56 ± 0.072
0.939MetMet: 0.939 ± 0.042
1.38MetAsn: 1.38 ± 0.043
1.18MetPro: 1.18 ± 0.04
1.112MetGln: 1.112 ± 0.036
1.31MetArg: 1.31 ± 0.042
1.551MetSer: 1.551 ± 0.045
1.514MetThr: 1.514 ± 0.047
1.675MetVal: 1.675 ± 0.047
0.269MetTrp: 0.269 ± 0.019
0.58MetTyr: 0.58 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
4.489AsnAla: 4.489 ± 0.104
0.411AsnCys: 0.411 ± 0.024
2.35AsnAsp: 2.35 ± 0.103
2.685AsnGlu: 2.685 ± 0.062
1.507AsnPhe: 1.507 ± 0.043
3.727AsnGly: 3.727 ± 0.105
1.406AsnHis: 1.406 ± 0.046
3.097AsnIle: 3.097 ± 0.096
2.598AsnLys: 2.598 ± 0.062
4.239AsnLeu: 4.239 ± 0.081
1.143AsnMet: 1.143 ± 0.039
2.35AsnAsn: 2.35 ± 0.081
2.773AsnPro: 2.773 ± 0.089
2.9AsnGln: 2.9 ± 0.071
2.145AsnArg: 2.145 ± 0.061
2.261AsnSer: 2.261 ± 0.072
2.819AsnThr: 2.819 ± 0.086
3.06AsnVal: 3.06 ± 0.081
0.668AsnTrp: 0.668 ± 0.033
1.437AsnTyr: 1.437 ± 0.041
0.0AsnXaa: 0.0 ± 0.0
Pro
3.437ProAla: 3.437 ± 0.088
0.304ProCys: 0.304 ± 0.02
2.515ProAsp: 2.515 ± 0.061
4.023ProGlu: 4.023 ± 0.09
1.816ProPhe: 1.816 ± 0.049
0.732ProGly: 0.732 ± 0.036
1.231ProHis: 1.231 ± 0.048
2.798ProIle: 2.798 ± 0.083
2.599ProLys: 2.599 ± 0.066
3.468ProLeu: 3.468 ± 0.074
1.012ProMet: 1.012 ± 0.04
2.525ProAsn: 2.525 ± 0.119
1.634ProPro: 1.634 ± 0.058
2.033ProGln: 2.033 ± 0.053
1.542ProArg: 1.542 ± 0.049
2.232ProSer: 2.232 ± 0.056
2.767ProThr: 2.767 ± 0.168
2.912ProVal: 2.912 ± 0.136
0.272ProTrp: 0.272 ± 0.021
1.276ProTyr: 1.276 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
6.049GlnAla: 6.049 ± 0.285
0.309GlnCys: 0.309 ± 0.021
2.281GlnAsp: 2.281 ± 0.052
2.453GlnGlu: 2.453 ± 0.068
2.152GlnPhe: 2.152 ± 0.057
2.866GlnGly: 2.866 ± 0.073
1.446GlnHis: 1.446 ± 0.052
3.642GlnIle: 3.642 ± 0.08
2.797GlnLys: 2.797 ± 0.069
4.307GlnLeu: 4.307 ± 0.088
1.317GlnMet: 1.317 ± 0.039
2.871GlnAsn: 2.871 ± 0.065
2.275GlnPro: 2.275 ± 0.075
2.968GlnGln: 2.968 ± 0.095
2.432GlnArg: 2.432 ± 0.067
2.726GlnSer: 2.726 ± 0.072
3.184GlnThr: 3.184 ± 0.087
2.777GlnVal: 2.777 ± 0.064
0.62GlnTrp: 0.62 ± 0.034
1.545GlnTyr: 1.545 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
4.0ArgAla: 4.0 ± 0.095
0.401ArgCys: 0.401 ± 0.025
2.614ArgAsp: 2.614 ± 0.059
3.448ArgGlu: 3.448 ± 0.073
2.246ArgPhe: 2.246 ± 0.067
2.86ArgGly: 2.86 ± 0.073
1.237ArgHis: 1.237 ± 0.045
2.647ArgIle: 2.647 ± 0.069
2.34ArgLys: 2.34 ± 0.062
4.859ArgLeu: 4.859 ± 0.096
1.177ArgMet: 1.177 ± 0.038
1.835ArgAsn: 1.835 ± 0.053
1.665ArgPro: 1.665 ± 0.054
2.647ArgGln: 2.647 ± 0.076
2.326ArgArg: 2.326 ± 0.068
2.101ArgSer: 2.101 ± 0.056
2.045ArgThr: 2.045 ± 0.057
3.447ArgVal: 3.447 ± 0.075
0.619ArgTrp: 0.619 ± 0.03
1.655ArgTyr: 1.655 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
5.142SerAla: 5.142 ± 0.1
0.528SerCys: 0.528 ± 0.029
2.911SerAsp: 2.911 ± 0.063
3.119SerGlu: 3.119 ± 0.065
2.089SerPhe: 2.089 ± 0.064
5.09SerGly: 5.09 ± 0.115
1.464SerHis: 1.464 ± 0.047
2.65SerIle: 2.65 ± 0.066
2.635SerLys: 2.635 ± 0.074
5.865SerLeu: 5.865 ± 0.08
1.217SerMet: 1.217 ± 0.044
2.436SerAsn: 2.436 ± 0.074
2.443SerPro: 2.443 ± 0.05
2.518SerGln: 2.518 ± 0.067
2.573SerArg: 2.573 ± 0.067
3.048SerSer: 3.048 ± 0.078
2.562SerThr: 2.562 ± 0.07
3.973SerVal: 3.973 ± 0.1
0.646SerTrp: 0.646 ± 0.032
1.508SerTyr: 1.508 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
5.767ThrAla: 5.767 ± 0.178
0.43ThrCys: 0.43 ± 0.025
2.758ThrAsp: 2.758 ± 0.117
3.208ThrGlu: 3.208 ± 0.079
2.03ThrPhe: 2.03 ± 0.059
3.955ThrGly: 3.955 ± 0.109
1.486ThrHis: 1.486 ± 0.041
3.331ThrIle: 3.331 ± 0.125
2.071ThrLys: 2.071 ± 0.057
5.975ThrLeu: 5.975 ± 0.085
1.092ThrMet: 1.092 ± 0.041
1.939ThrAsn: 1.939 ± 0.074
3.454ThrPro: 3.454 ± 0.197
2.861ThrGln: 2.861 ± 0.083
2.448ThrArg: 2.448 ± 0.063
2.58ThrSer: 2.58 ± 0.079
2.846ThrThr: 2.846 ± 0.129
4.293ThrVal: 4.293 ± 0.196
0.673ThrTrp: 0.673 ± 0.031
1.426ThrTyr: 1.426 ± 0.096
0.0ThrXaa: 0.0 ± 0.0
Val
7.081ValAla: 7.081 ± 0.128
0.815ValCys: 0.815 ± 0.033
3.314ValAsp: 3.314 ± 0.104
3.643ValGlu: 3.643 ± 0.094
2.955ValPhe: 2.955 ± 0.071
4.947ValGly: 4.947 ± 0.093
1.371ValHis: 1.371 ± 0.041
3.68ValIle: 3.68 ± 0.072
3.516ValLys: 3.516 ± 0.081
7.496ValLeu: 7.496 ± 0.133
1.674ValMet: 1.674 ± 0.048
2.926ValAsn: 2.926 ± 0.095
2.57ValPro: 2.57 ± 0.078
3.171ValGln: 3.171 ± 0.072
3.241ValArg: 3.241 ± 0.069
4.463ValSer: 4.463 ± 0.106
2.739ValThr: 2.739 ± 0.22
4.916ValVal: 4.916 ± 0.111
0.99ValTrp: 0.99 ± 0.039
2.167ValTyr: 2.167 ± 0.053
0.0ValXaa: 0.0 ± 0.0
Trp
1.307TrpAla: 1.307 ± 0.05
0.148TrpCys: 0.148 ± 0.014
0.658TrpAsp: 0.658 ± 0.03
0.539TrpGlu: 0.539 ± 0.026
0.738TrpPhe: 0.738 ± 0.036
0.856TrpGly: 0.856 ± 0.039
0.503TrpHis: 0.503 ± 0.026
0.669TrpIle: 0.669 ± 0.033
0.456TrpLys: 0.456 ± 0.024
2.064TrpLeu: 2.064 ± 0.077
0.185TrpMet: 0.185 ± 0.016
0.377TrpAsn: 0.377 ± 0.025
0.06TrpPro: 0.06 ± 0.008
1.431TrpGln: 1.431 ± 0.062
0.899TrpArg: 0.899 ± 0.037
0.537TrpSer: 0.537 ± 0.027
0.653TrpThr: 0.653 ± 0.026
0.927TrpVal: 0.927 ± 0.04
0.214TrpTrp: 0.214 ± 0.017
0.421TrpTyr: 0.421 ± 0.029
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.826TyrAla: 2.826 ± 0.068
0.341TyrCys: 0.341 ± 0.02
1.683TyrAsp: 1.683 ± 0.047
1.479TyrGlu: 1.479 ± 0.044
1.571TyrPhe: 1.571 ± 0.046
2.182TyrGly: 2.182 ± 0.059
0.807TyrHis: 0.807 ± 0.036
1.466TyrIle: 1.466 ± 0.044
1.129TyrLys: 1.129 ± 0.039
3.265TyrLeu: 3.265 ± 0.077
0.567TyrMet: 0.567 ± 0.027
1.055TyrAsn: 1.055 ± 0.042
1.459TyrPro: 1.459 ± 0.049
1.787TyrGln: 1.787 ± 0.058
1.831TyrArg: 1.831 ± 0.056
1.491TyrSer: 1.491 ± 0.046
1.787TyrThr: 1.787 ± 0.09
1.859TyrVal: 1.859 ± 0.048
0.494TyrTrp: 0.494 ± 0.025
0.944TyrTyr: 0.944 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2612 proteins (795213 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski