Amino acid dipepetide frequency for Candidatus Planktophila limnetica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.262AlaAla: 12.262 ± 0.242
0.824AlaCys: 0.824 ± 0.058
5.534AlaAsp: 5.534 ± 0.1
5.376AlaGlu: 5.376 ± 0.127
3.696AlaPhe: 3.696 ± 0.115
8.613AlaGly: 8.613 ± 0.153
2.306AlaHis: 2.306 ± 0.093
7.801AlaIle: 7.801 ± 0.157
5.505AlaLys: 5.505 ± 0.143
11.839AlaLeu: 11.839 ± 0.185
2.975AlaMet: 2.975 ± 0.078
3.051AlaAsn: 3.051 ± 0.084
4.36AlaPro: 4.36 ± 0.114
4.117AlaGln: 4.117 ± 0.106
5.739AlaArg: 5.739 ± 0.106
6.332AlaSer: 6.332 ± 0.133
6.212AlaThr: 6.212 ± 0.132
7.306AlaVal: 7.306 ± 0.133
1.092AlaTrp: 1.092 ± 0.061
2.246AlaTyr: 2.246 ± 0.077
0.0AlaXaa: 0.0 ± 0.0
Cys
0.851CysAla: 0.851 ± 0.046
0.057CysCys: 0.057 ± 0.012
0.564CysAsp: 0.564 ± 0.037
0.461CysGlu: 0.461 ± 0.03
0.222CysPhe: 0.222 ± 0.024
0.848CysGly: 0.848 ± 0.048
0.177CysHis: 0.177 ± 0.022
0.389CysIle: 0.389 ± 0.033
0.284CysLys: 0.284 ± 0.028
0.53CysLeu: 0.53 ± 0.035
0.112CysMet: 0.112 ± 0.015
0.237CysAsn: 0.237 ± 0.024
0.378CysPro: 0.378 ± 0.031
0.203CysGln: 0.203 ± 0.022
0.287CysArg: 0.287 ± 0.024
0.497CysSer: 0.497 ± 0.034
0.492CysThr: 0.492 ± 0.037
0.583CysVal: 0.583 ± 0.031
0.067CysTrp: 0.067 ± 0.013
0.134CysTyr: 0.134 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
5.992AspAla: 5.992 ± 0.135
0.389AspCys: 0.389 ± 0.034
2.545AspAsp: 2.545 ± 0.076
3.608AspGlu: 3.608 ± 0.126
2.184AspPhe: 2.184 ± 0.072
4.595AspGly: 4.595 ± 0.111
1.058AspHis: 1.058 ± 0.051
3.273AspIle: 3.273 ± 0.09
2.27AspLys: 2.27 ± 0.073
6.085AspLeu: 6.085 ± 0.132
0.886AspMet: 0.886 ± 0.045
1.302AspAsn: 1.302 ± 0.048
2.867AspPro: 2.867 ± 0.092
1.749AspGln: 1.749 ± 0.064
3.07AspArg: 3.07 ± 0.092
3.448AspSer: 3.448 ± 0.101
2.241AspThr: 2.241 ± 0.077
4.788AspVal: 4.788 ± 0.102
0.664AspTrp: 0.664 ± 0.042
1.276AspTyr: 1.276 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
5.323GluAla: 5.323 ± 0.13
0.409GluCys: 0.409 ± 0.032
2.604GluAsp: 2.604 ± 0.091
3.555GluGlu: 3.555 ± 0.111
2.246GluPhe: 2.246 ± 0.088
3.742GluGly: 3.742 ± 0.102
1.133GluHis: 1.133 ± 0.059
5.044GluIle: 5.044 ± 0.124
3.62GluLys: 3.62 ± 0.098
6.403GluLeu: 6.403 ± 0.153
1.465GluMet: 1.465 ± 0.061
2.198GluAsn: 2.198 ± 0.067
1.983GluPro: 1.983 ± 0.061
1.751GluGln: 1.751 ± 0.066
3.505GluArg: 3.505 ± 0.103
3.894GluSer: 3.894 ± 0.101
2.762GluThr: 2.762 ± 0.089
4.919GluVal: 4.919 ± 0.124
0.616GluTrp: 0.616 ± 0.036
1.328GluTyr: 1.328 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
4.212PheAla: 4.212 ± 0.107
0.282PheCys: 0.282 ± 0.027
2.368PheAsp: 2.368 ± 0.073
2.193PheGlu: 2.193 ± 0.071
1.52PhePhe: 1.52 ± 0.062
3.503PheGly: 3.503 ± 0.105
0.731PheHis: 0.731 ± 0.039
2.456PheIle: 2.456 ± 0.075
1.622PheLys: 1.622 ± 0.069
3.433PheLeu: 3.433 ± 0.099
0.772PheMet: 0.772 ± 0.04
1.316PheAsn: 1.316 ± 0.06
1.496PhePro: 1.496 ± 0.063
0.87PheGln: 0.87 ± 0.053
1.51PheArg: 1.51 ± 0.064
2.49PheSer: 2.49 ± 0.077
2.439PheThr: 2.439 ± 0.089
2.724PheVal: 2.724 ± 0.089
0.425PheTrp: 0.425 ± 0.038
0.851PheTyr: 0.851 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
8.494GlyAla: 8.494 ± 0.156
0.667GlyCys: 0.667 ± 0.041
4.064GlyAsp: 4.064 ± 0.099
4.317GlyGlu: 4.317 ± 0.093
3.412GlyPhe: 3.412 ± 0.106
6.635GlyGly: 6.635 ± 0.155
1.634GlyHis: 1.634 ± 0.07
5.648GlyIle: 5.648 ± 0.135
4.363GlyLys: 4.363 ± 0.098
7.538GlyLeu: 7.538 ± 0.146
1.883GlyMet: 1.883 ± 0.072
2.33GlyAsn: 2.33 ± 0.076
2.97GlyPro: 2.97 ± 0.091
2.346GlyGln: 2.346 ± 0.075
4.193GlyArg: 4.193 ± 0.101
5.309GlySer: 5.309 ± 0.112
4.611GlyThr: 4.611 ± 0.096
6.9GlyVal: 6.9 ± 0.135
1.152GlyTrp: 1.152 ± 0.049
2.136GlyTyr: 2.136 ± 0.075
0.0GlyXaa: 0.0 ± 0.0
His
1.818HisAla: 1.818 ± 0.078
0.17HisCys: 0.17 ± 0.018
1.111HisAsp: 1.111 ± 0.053
1.278HisGlu: 1.278 ± 0.057
0.745HisPhe: 0.745 ± 0.048
1.763HisGly: 1.763 ± 0.067
0.557HisHis: 0.557 ± 0.042
1.18HisIle: 1.18 ± 0.055
0.724HisLys: 0.724 ± 0.041
1.931HisLeu: 1.931 ± 0.064
0.394HisMet: 0.394 ± 0.031
0.573HisAsn: 0.573 ± 0.038
1.214HisPro: 1.214 ± 0.057
0.616HisGln: 0.616 ± 0.041
1.061HisArg: 1.061 ± 0.061
1.223HisSer: 1.223 ± 0.058
1.068HisThr: 1.068 ± 0.05
1.441HisVal: 1.441 ± 0.056
0.263HisTrp: 0.263 ± 0.025
0.425HisTyr: 0.425 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
8.824IleAla: 8.824 ± 0.175
0.552IleCys: 0.552 ± 0.033
4.439IleAsp: 4.439 ± 0.098
4.437IleGlu: 4.437 ± 0.111
2.349IlePhe: 2.349 ± 0.088
5.531IleGly: 5.531 ± 0.134
1.166IleHis: 1.166 ± 0.049
3.799IleIle: 3.799 ± 0.113
3.206IleLys: 3.206 ± 0.083
5.318IleLeu: 5.318 ± 0.119
1.018IleMet: 1.018 ± 0.045
2.437IleAsn: 2.437 ± 0.081
2.893IlePro: 2.893 ± 0.085
1.634IleGln: 1.634 ± 0.064
3.185IleArg: 3.185 ± 0.077
5.507IleSer: 5.507 ± 0.129
4.396IleThr: 4.396 ± 0.112
4.826IleVal: 4.826 ± 0.116
0.698IleTrp: 0.698 ± 0.043
1.534IleTyr: 1.534 ± 0.064
0.0IleXaa: 0.0 ± 0.0
Lys
5.077LysAla: 5.077 ± 0.116
0.308LysCys: 0.308 ± 0.026
2.776LysAsp: 2.776 ± 0.082
3.137LysGlu: 3.137 ± 0.085
1.62LysPhe: 1.62 ± 0.059
3.285LysGly: 3.285 ± 0.102
0.877LysHis: 0.877 ± 0.047
3.515LysIle: 3.515 ± 0.086
3.636LysLys: 3.636 ± 0.113
3.899LysLeu: 3.899 ± 0.097
1.314LysMet: 1.314 ± 0.055
1.995LysAsn: 1.995 ± 0.073
2.117LysPro: 2.117 ± 0.062
1.273LysGln: 1.273 ± 0.056
2.819LysArg: 2.819 ± 0.092
4.047LysSer: 4.047 ± 0.112
2.886LysThr: 2.886 ± 0.073
3.737LysVal: 3.737 ± 0.093
0.631LysTrp: 0.631 ± 0.039
1.419LysTyr: 1.419 ± 0.056
0.0LysXaa: 0.0 ± 0.0
Leu
10.787LeuAla: 10.787 ± 0.162
0.664LeuCys: 0.664 ± 0.042
5.655LeuAsp: 5.655 ± 0.133
5.557LeuGlu: 5.557 ± 0.133
3.39LeuPhe: 3.39 ± 0.105
8.186LeuGly: 8.186 ± 0.142
1.864LeuHis: 1.864 ± 0.073
6.527LeuIle: 6.527 ± 0.15
4.36LeuLys: 4.36 ± 0.108
9.571LeuLeu: 9.571 ± 0.211
2.021LeuMet: 2.021 ± 0.07
3.159LeuAsn: 3.159 ± 0.078
4.554LeuPro: 4.554 ± 0.097
2.499LeuGln: 2.499 ± 0.084
5.426LeuArg: 5.426 ± 0.126
7.115LeuSer: 7.115 ± 0.123
5.646LeuThr: 5.646 ± 0.11
7.815LeuVal: 7.815 ± 0.17
1.003LeuTrp: 1.003 ± 0.053
1.656LeuTyr: 1.656 ± 0.073
0.0LeuXaa: 0.0 ± 0.0
Met
2.475MetAla: 2.475 ± 0.081
0.155MetCys: 0.155 ± 0.021
1.111MetAsp: 1.111 ± 0.053
0.944MetGlu: 0.944 ± 0.05
0.581MetPhe: 0.581 ± 0.038
1.727MetGly: 1.727 ± 0.061
0.44MetHis: 0.44 ± 0.029
1.257MetIle: 1.257 ± 0.05
1.293MetLys: 1.293 ± 0.049
1.766MetLeu: 1.766 ± 0.07
0.437MetMet: 0.437 ± 0.033
0.822MetAsn: 0.822 ± 0.045
1.046MetPro: 1.046 ± 0.053
0.76MetGln: 0.76 ± 0.043
1.527MetArg: 1.527 ± 0.056
2.15MetSer: 2.15 ± 0.067
1.405MetThr: 1.405 ± 0.061
1.512MetVal: 1.512 ± 0.064
0.256MetTrp: 0.256 ± 0.022
0.473MetTyr: 0.473 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.323AsnAla: 3.323 ± 0.085
0.248AsnCys: 0.248 ± 0.024
1.62AsnAsp: 1.62 ± 0.063
1.782AsnGlu: 1.782 ± 0.068
1.278AsnPhe: 1.278 ± 0.064
2.506AsnGly: 2.506 ± 0.079
0.545AsnHis: 0.545 ± 0.037
1.861AsnIle: 1.861 ± 0.074
1.627AsnLys: 1.627 ± 0.068
3.233AsnLeu: 3.233 ± 0.101
0.652AsnMet: 0.652 ± 0.043
1.118AsnAsn: 1.118 ± 0.059
2.119AsnPro: 2.119 ± 0.063
1.001AsnGln: 1.001 ± 0.048
1.751AsnArg: 1.751 ± 0.063
2.184AsnSer: 2.184 ± 0.078
1.864AsnThr: 1.864 ± 0.065
2.33AsnVal: 2.33 ± 0.077
0.483AsnTrp: 0.483 ± 0.036
0.836AsnTyr: 0.836 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
4.336ProAla: 4.336 ± 0.11
0.275ProCys: 0.275 ± 0.025
2.42ProAsp: 2.42 ± 0.081
3.209ProGlu: 3.209 ± 0.09
1.474ProPhe: 1.474 ± 0.057
3.596ProGly: 3.596 ± 0.093
0.949ProHis: 0.949 ± 0.045
2.714ProIle: 2.714 ± 0.077
2.095ProLys: 2.095 ± 0.076
3.887ProLeu: 3.887 ± 0.092
0.915ProMet: 0.915 ± 0.042
1.395ProAsn: 1.395 ± 0.056
1.278ProPro: 1.278 ± 0.067
1.438ProGln: 1.438 ± 0.053
2.071ProArg: 2.071 ± 0.074
2.889ProSer: 2.889 ± 0.079
2.982ProThr: 2.982 ± 0.09
3.505ProVal: 3.505 ± 0.092
0.602ProTrp: 0.602 ± 0.038
1.104ProTyr: 1.104 ± 0.053
0.0ProXaa: 0.0 ± 0.0
Gln
3.075GlnAla: 3.075 ± 0.084
0.237GlnCys: 0.237 ± 0.023
1.319GlnAsp: 1.319 ± 0.05
1.883GlnGlu: 1.883 ± 0.07
0.975GlnPhe: 0.975 ± 0.051
2.169GlnGly: 2.169 ± 0.075
0.499GlnHis: 0.499 ± 0.037
2.358GlnIle: 2.358 ± 0.073
1.51GlnLys: 1.51 ± 0.055
3.03GlnLeu: 3.03 ± 0.098
0.731GlnMet: 0.731 ± 0.043
0.951GlnAsn: 0.951 ± 0.048
1.133GlnPro: 1.133 ± 0.054
0.834GlnGln: 0.834 ± 0.05
1.787GlnArg: 1.787 ± 0.072
2.26GlnSer: 2.26 ± 0.089
1.481GlnThr: 1.481 ± 0.067
2.351GlnVal: 2.351 ± 0.082
0.514GlnTrp: 0.514 ± 0.038
0.786GlnTyr: 0.786 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
5.295ArgAla: 5.295 ± 0.123
0.394ArgCys: 0.394 ± 0.03
2.991ArgAsp: 2.991 ± 0.091
3.586ArgGlu: 3.586 ± 0.109
2.081ArgPhe: 2.081 ± 0.068
3.823ArgGly: 3.823 ± 0.087
0.97ArgHis: 0.97 ± 0.052
3.773ArgIle: 3.773 ± 0.096
2.681ArgLys: 2.681 ± 0.081
4.781ArgLeu: 4.781 ± 0.132
1.321ArgMet: 1.321 ± 0.057
1.897ArgAsn: 1.897 ± 0.07
2.091ArgPro: 2.091 ± 0.081
1.453ArgGln: 1.453 ± 0.054
3.07ArgArg: 3.07 ± 0.118
3.409ArgSer: 3.409 ± 0.085
3.23ArgThr: 3.23 ± 0.097
4.241ArgVal: 4.241 ± 0.1
0.676ArgTrp: 0.676 ± 0.046
1.328ArgTyr: 1.328 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
7.452SerAla: 7.452 ± 0.157
0.394SerCys: 0.394 ± 0.032
3.808SerAsp: 3.808 ± 0.096
4.074SerGlu: 4.074 ± 0.107
2.669SerPhe: 2.669 ± 0.073
6.109SerGly: 6.109 ± 0.12
1.41SerHis: 1.41 ± 0.064
4.47SerIle: 4.47 ± 0.105
3.436SerLys: 3.436 ± 0.107
6.58SerLeu: 6.58 ± 0.129
1.658SerMet: 1.658 ± 0.059
2.098SerAsn: 2.098 ± 0.08
2.798SerPro: 2.798 ± 0.087
2.184SerGln: 2.184 ± 0.076
3.596SerArg: 3.596 ± 0.087
4.313SerSer: 4.313 ± 0.117
4.002SerThr: 4.002 ± 0.095
5.316SerVal: 5.316 ± 0.105
0.927SerTrp: 0.927 ± 0.05
1.551SerTyr: 1.551 ± 0.066
0.0SerXaa: 0.0 ± 0.0
Thr
5.51ThrAla: 5.51 ± 0.134
0.435ThrCys: 0.435 ± 0.037
3.022ThrAsp: 3.022 ± 0.096
2.903ThrGlu: 2.903 ± 0.099
2.377ThrPhe: 2.377 ± 0.076
4.929ThrGly: 4.929 ± 0.101
1.204ThrHis: 1.204 ± 0.055
3.646ThrIle: 3.646 ± 0.093
2.791ThrLys: 2.791 ± 0.09
6.062ThrLeu: 6.062 ± 0.115
1.164ThrMet: 1.164 ± 0.051
1.916ThrAsn: 1.916 ± 0.062
3.307ThrPro: 3.307 ± 0.096
1.897ThrGln: 1.897 ± 0.063
2.779ThrArg: 2.779 ± 0.076
3.928ThrSer: 3.928 ± 0.101
3.431ThrThr: 3.431 ± 0.09
4.167ThrVal: 4.167 ± 0.112
0.729ThrTrp: 0.729 ± 0.047
1.465ThrTyr: 1.465 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
8.594ValAla: 8.594 ± 0.153
0.583ValCys: 0.583 ± 0.037
4.542ValAsp: 4.542 ± 0.119
4.298ValGlu: 4.298 ± 0.11
2.807ValPhe: 2.807 ± 0.086
6.148ValGly: 6.148 ± 0.135
1.422ValHis: 1.422 ± 0.057
5.973ValIle: 5.973 ± 0.127
3.596ValLys: 3.596 ± 0.083
7.894ValLeu: 7.894 ± 0.134
1.727ValMet: 1.727 ± 0.058
2.497ValAsn: 2.497 ± 0.084
2.97ValPro: 2.97 ± 0.084
1.971ValGln: 1.971 ± 0.082
3.792ValArg: 3.792 ± 0.103
5.33ValSer: 5.33 ± 0.119
4.738ValThr: 4.738 ± 0.108
6.52ValVal: 6.52 ± 0.154
0.805ValTrp: 0.805 ± 0.046
1.431ValTyr: 1.431 ± 0.065
0.0ValXaa: 0.0 ± 0.0
Trp
1.094TrpAla: 1.094 ± 0.049
0.124TrpCys: 0.124 ± 0.017
0.588TrpAsp: 0.588 ± 0.037
0.492TrpGlu: 0.492 ± 0.031
0.526TrpPhe: 0.526 ± 0.042
0.822TrpGly: 0.822 ± 0.051
0.313TrpHis: 0.313 ± 0.026
0.812TrpIle: 0.812 ± 0.047
0.657TrpLys: 0.657 ± 0.042
1.219TrpLeu: 1.219 ± 0.055
0.315TrpMet: 0.315 ± 0.03
0.514TrpAsn: 0.514 ± 0.036
0.533TrpPro: 0.533 ± 0.035
0.504TrpGln: 0.504 ± 0.033
0.7TrpArg: 0.7 ± 0.037
0.886TrpSer: 0.886 ± 0.047
0.614TrpThr: 0.614 ± 0.044
0.884TrpVal: 0.884 ± 0.046
0.246TrpTrp: 0.246 ± 0.025
0.325TrpTyr: 0.325 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.389TyrAla: 2.389 ± 0.086
0.205TyrCys: 0.205 ± 0.022
1.209TyrAsp: 1.209 ± 0.061
1.41TyrGlu: 1.41 ± 0.054
1.061TyrPhe: 1.061 ± 0.06
2.071TyrGly: 2.071 ± 0.076
0.303TyrHis: 0.303 ± 0.025
1.252TyrIle: 1.252 ± 0.056
1.094TyrLys: 1.094 ± 0.048
2.459TyrLeu: 2.459 ± 0.072
0.373TyrMet: 0.373 ± 0.031
0.557TyrAsn: 0.557 ± 0.037
1.03TyrPro: 1.03 ± 0.049
0.781TyrGln: 0.781 ± 0.045
1.223TyrArg: 1.223 ± 0.052
1.632TyrSer: 1.632 ± 0.068
1.159TyrThr: 1.159 ± 0.056
1.775TyrVal: 1.775 ± 0.063
0.32TyrTrp: 0.32 ± 0.026
0.48TyrTyr: 0.48 ± 0.039
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1326 proteins (418542 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski