Amino acid dipepetide frequency for Prevotella maculosa OT 289

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.045AlaAla: 6.045 ± 0.11
1.17AlaCys: 1.17 ± 0.035
4.653AlaAsp: 4.653 ± 0.082
4.684AlaGlu: 4.684 ± 0.086
3.565AlaPhe: 3.565 ± 0.063
5.253AlaGly: 5.253 ± 0.09
1.579AlaHis: 1.579 ± 0.037
4.711AlaIle: 4.711 ± 0.078
4.542AlaLys: 4.542 ± 0.083
7.211AlaLeu: 7.211 ± 0.101
2.158AlaMet: 2.158 ± 0.05
3.368AlaAsn: 3.368 ± 0.064
2.385AlaPro: 2.385 ± 0.045
2.906AlaGln: 2.906 ± 0.058
3.649AlaArg: 3.649 ± 0.069
4.315AlaSer: 4.315 ± 0.077
4.271AlaThr: 4.271 ± 0.082
4.984AlaVal: 4.984 ± 0.082
0.97AlaTrp: 0.97 ± 0.039
3.217AlaTyr: 3.217 ± 0.066
0.0AlaXaa: 0.0 ± 0.0
Cys
0.902CysAla: 0.902 ± 0.031
0.234CysCys: 0.234 ± 0.018
0.705CysAsp: 0.705 ± 0.03
0.636CysGlu: 0.636 ± 0.025
0.628CysPhe: 0.628 ± 0.025
1.084CysGly: 1.084 ± 0.041
0.354CysHis: 0.354 ± 0.018
0.858CysIle: 0.858 ± 0.031
0.751CysLys: 0.751 ± 0.031
1.176CysLeu: 1.176 ± 0.038
0.367CysMet: 0.367 ± 0.019
0.665CysAsn: 0.665 ± 0.026
0.552CysPro: 0.552 ± 0.027
0.34CysGln: 0.34 ± 0.02
0.737CysArg: 0.737 ± 0.03
0.831CysSer: 0.831 ± 0.03
0.673CysThr: 0.673 ± 0.029
0.781CysVal: 0.781 ± 0.033
0.154CysTrp: 0.154 ± 0.013
0.599CysTyr: 0.599 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
4.374AspAla: 4.374 ± 0.066
0.761AspCys: 0.761 ± 0.03
3.1AspAsp: 3.1 ± 0.068
3.803AspGlu: 3.803 ± 0.072
3.112AspPhe: 3.112 ± 0.056
4.291AspGly: 4.291 ± 0.077
1.117AspHis: 1.117 ± 0.032
4.017AspIle: 4.017 ± 0.073
3.91AspLys: 3.91 ± 0.072
4.495AspLeu: 4.495 ± 0.086
1.619AspMet: 1.619 ± 0.04
2.902AspAsn: 2.902 ± 0.064
1.838AspPro: 1.838 ± 0.051
1.21AspGln: 1.21 ± 0.035
2.911AspArg: 2.911 ± 0.057
2.897AspSer: 2.897 ± 0.067
2.745AspThr: 2.745 ± 0.058
3.608AspVal: 3.608 ± 0.072
0.938AspTrp: 0.938 ± 0.034
3.134AspTyr: 3.134 ± 0.06
0.0AspXaa: 0.0 ± 0.0
Glu
5.101GluAla: 5.101 ± 0.087
0.626GluCys: 0.626 ± 0.029
3.057GluAsp: 3.057 ± 0.067
4.195GluGlu: 4.195 ± 0.083
1.969GluPhe: 1.969 ± 0.049
3.824GluGly: 3.824 ± 0.067
1.338GluHis: 1.338 ± 0.035
3.797GluIle: 3.797 ± 0.069
4.465GluLys: 4.465 ± 0.088
5.413GluLeu: 5.413 ± 0.086
1.855GluMet: 1.855 ± 0.043
2.957GluAsn: 2.957 ± 0.066
1.779GluPro: 1.779 ± 0.047
2.655GluGln: 2.655 ± 0.062
3.491GluArg: 3.491 ± 0.061
2.646GluSer: 2.646 ± 0.051
3.352GluThr: 3.352 ± 0.061
3.809GluVal: 3.809 ± 0.074
0.72GluTrp: 0.72 ± 0.027
2.302GluTyr: 2.302 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
3.299PheAla: 3.299 ± 0.064
0.77PheCys: 0.77 ± 0.026
2.906PheAsp: 2.906 ± 0.065
2.359PheGlu: 2.359 ± 0.062
2.205PhePhe: 2.205 ± 0.051
3.373PheGly: 3.373 ± 0.064
1.018PheHis: 1.018 ± 0.037
2.934PheIle: 2.934 ± 0.062
2.592PheLys: 2.592 ± 0.056
3.712PheLeu: 3.712 ± 0.082
1.31PheMet: 1.31 ± 0.039
2.352PheAsn: 2.352 ± 0.054
1.666PhePro: 1.666 ± 0.04
1.214PheGln: 1.214 ± 0.037
2.205PheArg: 2.205 ± 0.051
3.271PheSer: 3.271 ± 0.065
2.753PheThr: 2.753 ± 0.056
2.899PheVal: 2.899 ± 0.055
0.55PheTrp: 0.55 ± 0.027
1.835PheTyr: 1.835 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.492GlyAla: 4.492 ± 0.09
0.914GlyCys: 0.914 ± 0.029
3.433GlyAsp: 3.433 ± 0.065
3.831GlyGlu: 3.831 ± 0.067
3.328GlyPhe: 3.328 ± 0.055
4.775GlyGly: 4.775 ± 0.092
1.548GlyHis: 1.548 ± 0.038
5.004GlyIle: 5.004 ± 0.077
5.368GlyLys: 5.368 ± 0.081
6.17GlyLeu: 6.17 ± 0.096
2.172GlyMet: 2.172 ± 0.054
3.628GlyAsn: 3.628 ± 0.068
1.47GlyPro: 1.47 ± 0.038
2.263GlyGln: 2.263 ± 0.054
3.659GlyArg: 3.659 ± 0.074
4.092GlySer: 4.092 ± 0.08
4.28GlyThr: 4.28 ± 0.08
4.584GlyVal: 4.584 ± 0.072
1.037GlyTrp: 1.037 ± 0.042
3.495GlyTyr: 3.495 ± 0.066
0.0GlyXaa: 0.0 ± 0.0
His
1.511HisAla: 1.511 ± 0.047
0.347HisCys: 0.347 ± 0.022
1.281HisAsp: 1.281 ± 0.044
1.211HisGlu: 1.211 ± 0.039
1.164HisPhe: 1.164 ± 0.034
1.526HisGly: 1.526 ± 0.049
0.638HisHis: 0.638 ± 0.026
1.622HisIle: 1.622 ± 0.043
1.13HisLys: 1.13 ± 0.04
1.938HisLeu: 1.938 ± 0.052
0.404HisMet: 0.404 ± 0.02
1.016HisAsn: 1.016 ± 0.032
1.008HisPro: 1.008 ± 0.033
0.788HisGln: 0.788 ± 0.028
1.282HisArg: 1.282 ± 0.038
1.191HisSer: 1.191 ± 0.037
1.252HisThr: 1.252 ± 0.039
1.434HisVal: 1.434 ± 0.043
0.303HisTrp: 0.303 ± 0.018
1.092HisTyr: 1.092 ± 0.041
0.0HisXaa: 0.0 ± 0.0
Ile
5.316IleAla: 5.316 ± 0.081
0.921IleCys: 0.921 ± 0.034
4.489IleAsp: 4.489 ± 0.067
4.006IleGlu: 4.006 ± 0.081
2.39IlePhe: 2.39 ± 0.058
4.595IleGly: 4.595 ± 0.087
1.248IleHis: 1.248 ± 0.039
4.111IleIle: 4.111 ± 0.084
3.878IleLys: 3.878 ± 0.072
4.926IleLeu: 4.926 ± 0.09
1.44IleMet: 1.44 ± 0.041
3.24IleAsn: 3.24 ± 0.066
2.783IlePro: 2.783 ± 0.055
1.839IleGln: 1.839 ± 0.05
3.194IleArg: 3.194 ± 0.06
3.893IleSer: 3.893 ± 0.065
3.718IleThr: 3.718 ± 0.067
4.38IleVal: 4.38 ± 0.067
0.608IleTrp: 0.608 ± 0.025
2.422IleTyr: 2.422 ± 0.05
0.0IleXaa: 0.0 ± 0.0
Lys
5.54LysAla: 5.54 ± 0.087
0.567LysCys: 0.567 ± 0.026
3.869LysAsp: 3.869 ± 0.074
4.711LysGlu: 4.711 ± 0.077
2.098LysPhe: 2.098 ± 0.05
4.596LysGly: 4.596 ± 0.069
1.459LysHis: 1.459 ± 0.04
3.659LysIle: 3.659 ± 0.069
4.534LysLys: 4.534 ± 0.078
5.158LysLeu: 5.158 ± 0.086
2.046LysMet: 2.046 ± 0.051
3.238LysAsn: 3.238 ± 0.067
2.362LysPro: 2.362 ± 0.06
2.785LysGln: 2.785 ± 0.059
3.548LysArg: 3.548 ± 0.062
3.234LysSer: 3.234 ± 0.061
3.918LysThr: 3.918 ± 0.072
4.289LysVal: 4.289 ± 0.077
0.833LysTrp: 0.833 ± 0.032
2.586LysTyr: 2.586 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
6.462LeuAla: 6.462 ± 0.091
1.405LeuCys: 1.405 ± 0.039
4.663LeuAsp: 4.663 ± 0.07
4.39LeuGlu: 4.39 ± 0.079
4.394LeuPhe: 4.394 ± 0.076
5.751LeuGly: 5.751 ± 0.083
2.153LeuHis: 2.153 ± 0.054
4.894LeuIle: 4.894 ± 0.072
6.092LeuLys: 6.092 ± 0.084
8.88LeuLeu: 8.88 ± 0.148
2.641LeuMet: 2.641 ± 0.059
4.427LeuAsn: 4.427 ± 0.077
3.98LeuPro: 3.98 ± 0.064
3.597LeuGln: 3.597 ± 0.064
4.886LeuArg: 4.886 ± 0.085
6.485LeuSer: 6.485 ± 0.098
5.434LeuThr: 5.434 ± 0.079
5.13LeuVal: 5.13 ± 0.086
1.139LeuTrp: 1.139 ± 0.035
3.576LeuTyr: 3.576 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
2.442MetAla: 2.442 ± 0.049
0.269MetCys: 0.269 ± 0.017
1.371MetAsp: 1.371 ± 0.038
1.718MetGlu: 1.718 ± 0.046
1.052MetPhe: 1.052 ± 0.036
1.966MetGly: 1.966 ± 0.048
0.568MetHis: 0.568 ± 0.028
1.431MetIle: 1.431 ± 0.042
2.479MetLys: 2.479 ± 0.053
2.709MetLeu: 2.709 ± 0.059
0.936MetMet: 0.936 ± 0.034
1.559MetAsn: 1.559 ± 0.04
1.273MetPro: 1.273 ± 0.04
1.21MetGln: 1.21 ± 0.033
1.483MetArg: 1.483 ± 0.035
1.526MetSer: 1.526 ± 0.039
1.737MetThr: 1.737 ± 0.047
1.642MetVal: 1.642 ± 0.047
0.251MetTrp: 0.251 ± 0.016
0.757MetTyr: 0.757 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.706AsnAla: 3.706 ± 0.076
0.528AsnCys: 0.528 ± 0.024
2.93AsnAsp: 2.93 ± 0.057
2.899AsnGlu: 2.899 ± 0.058
2.083AsnPhe: 2.083 ± 0.049
4.024AsnGly: 4.024 ± 0.087
1.105AsnHis: 1.105 ± 0.032
3.515AsnIle: 3.515 ± 0.071
3.107AsnLys: 3.107 ± 0.071
4.044AsnLeu: 4.044 ± 0.071
1.348AsnMet: 1.348 ± 0.035
2.636AsnAsn: 2.636 ± 0.059
2.374AsnPro: 2.374 ± 0.043
1.475AsnGln: 1.475 ± 0.039
2.651AsnArg: 2.651 ± 0.059
2.606AsnSer: 2.606 ± 0.072
2.706AsnThr: 2.706 ± 0.053
3.208AsnVal: 3.208 ± 0.059
0.645AsnTrp: 0.645 ± 0.027
2.189AsnTyr: 2.189 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
2.721ProAla: 2.721 ± 0.057
0.389ProCys: 0.389 ± 0.019
2.438ProAsp: 2.438 ± 0.051
2.827ProGlu: 2.827 ± 0.055
1.903ProPhe: 1.903 ± 0.053
2.358ProGly: 2.358 ± 0.056
0.857ProHis: 0.857 ± 0.031
2.27ProIle: 2.27 ± 0.046
2.287ProLys: 2.287 ± 0.052
3.315ProLeu: 3.315 ± 0.067
1.067ProMet: 1.067 ± 0.033
1.745ProAsn: 1.745 ± 0.045
0.796ProPro: 0.796 ± 0.033
1.448ProGln: 1.448 ± 0.037
1.568ProArg: 1.568 ± 0.046
2.247ProSer: 2.247 ± 0.047
2.399ProThr: 2.399 ± 0.054
2.73ProVal: 2.73 ± 0.059
0.488ProTrp: 0.488 ± 0.025
1.666ProTyr: 1.666 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.763GlnAla: 2.763 ± 0.06
0.35GlnCys: 0.35 ± 0.021
1.521GlnAsp: 1.521 ± 0.043
2.103GlnGlu: 2.103 ± 0.048
1.292GlnPhe: 1.292 ± 0.037
2.318GlnGly: 2.318 ± 0.055
0.863GlnHis: 0.863 ± 0.029
2.196GlnIle: 2.196 ± 0.046
2.343GlnLys: 2.343 ± 0.06
3.601GlnLeu: 3.601 ± 0.075
1.056GlnMet: 1.056 ± 0.033
1.687GlnAsn: 1.687 ± 0.049
1.452GlnPro: 1.452 ± 0.041
1.821GlnGln: 1.821 ± 0.055
1.985GlnArg: 1.985 ± 0.05
1.974GlnSer: 1.974 ± 0.049
2.133GlnThr: 2.133 ± 0.046
2.048GlnVal: 2.048 ± 0.041
0.509GlnTrp: 0.509 ± 0.025
1.458GlnTyr: 1.458 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
3.139ArgAla: 3.139 ± 0.063
0.597ArgCys: 0.597 ± 0.025
2.471ArgAsp: 2.471 ± 0.055
3.133ArgGlu: 3.133 ± 0.061
2.638ArgPhe: 2.638 ± 0.058
2.879ArgGly: 2.879 ± 0.052
1.321ArgHis: 1.321 ± 0.04
3.512ArgIle: 3.512 ± 0.058
3.538ArgLys: 3.538 ± 0.061
5.495ArgLeu: 5.495 ± 0.094
1.718ArgMet: 1.718 ± 0.044
2.732ArgAsn: 2.732 ± 0.057
2.058ArgPro: 2.058 ± 0.059
2.243ArgGln: 2.243 ± 0.051
2.892ArgArg: 2.892 ± 0.067
2.645ArgSer: 2.645 ± 0.056
2.631ArgThr: 2.631 ± 0.053
2.847ArgVal: 2.847 ± 0.057
0.757ArgTrp: 0.757 ± 0.03
2.569ArgTyr: 2.569 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
4.156SerAla: 4.156 ± 0.08
0.792SerCys: 0.792 ± 0.03
3.246SerAsp: 3.246 ± 0.065
3.112SerGlu: 3.112 ± 0.054
3.129SerPhe: 3.129 ± 0.066
4.121SerGly: 4.121 ± 0.077
1.344SerHis: 1.344 ± 0.037
3.644SerIle: 3.644 ± 0.056
3.291SerLys: 3.291 ± 0.059
5.712SerLeu: 5.712 ± 0.09
1.537SerMet: 1.537 ± 0.044
2.714SerAsn: 2.714 ± 0.054
2.362SerPro: 2.362 ± 0.047
1.802SerGln: 1.802 ± 0.048
2.913SerArg: 2.913 ± 0.052
3.451SerSer: 3.451 ± 0.081
3.049SerThr: 3.049 ± 0.056
4.144SerVal: 4.144 ± 0.069
0.756SerTrp: 0.756 ± 0.029
2.774SerTyr: 2.774 ± 0.063
0.0SerXaa: 0.0 ± 0.0
Thr
4.631ThrAla: 4.631 ± 0.085
0.619ThrCys: 0.619 ± 0.025
3.602ThrAsp: 3.602 ± 0.065
3.146ThrGlu: 3.146 ± 0.058
2.774ThrPhe: 2.774 ± 0.058
4.28ThrGly: 4.28 ± 0.073
1.123ThrHis: 1.123 ± 0.033
3.714ThrIle: 3.714 ± 0.062
3.084ThrLys: 3.084 ± 0.061
5.653ThrLeu: 5.653 ± 0.085
1.436ThrMet: 1.436 ± 0.04
2.465ThrAsn: 2.465 ± 0.059
2.692ThrPro: 2.692 ± 0.053
1.745ThrGln: 1.745 ± 0.047
2.42ThrArg: 2.42 ± 0.048
3.163ThrSer: 3.163 ± 0.064
3.53ThrThr: 3.53 ± 0.08
4.122ThrVal: 4.122 ± 0.077
0.738ThrTrp: 0.738 ± 0.027
2.464ThrTyr: 2.464 ± 0.056
0.0ThrXaa: 0.0 ± 0.0
Val
4.82ValAla: 4.82 ± 0.082
1.02ValCys: 1.02 ± 0.036
3.653ValAsp: 3.653 ± 0.064
3.667ValGlu: 3.667 ± 0.072
2.867ValPhe: 2.867 ± 0.058
4.442ValGly: 4.442 ± 0.08
1.142ValHis: 1.142 ± 0.034
4.142ValIle: 4.142 ± 0.086
4.262ValLys: 4.262 ± 0.073
5.653ValLeu: 5.653 ± 0.093
1.821ValMet: 1.821 ± 0.046
3.274ValAsn: 3.274 ± 0.063
2.567ValPro: 2.567 ± 0.054
1.858ValGln: 1.858 ± 0.042
3.22ValArg: 3.22 ± 0.064
4.451ValSer: 4.451 ± 0.076
3.621ValThr: 3.621 ± 0.075
4.697ValVal: 4.697 ± 0.078
0.785ValTrp: 0.785 ± 0.033
2.693ValTyr: 2.693 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.903TrpAla: 0.903 ± 0.038
0.175TrpCys: 0.175 ± 0.012
0.694TrpAsp: 0.694 ± 0.028
0.699TrpGlu: 0.699 ± 0.028
0.568TrpPhe: 0.568 ± 0.027
0.931TrpGly: 0.931 ± 0.034
0.301TrpHis: 0.301 ± 0.018
0.747TrpIle: 0.747 ± 0.031
0.894TrpLys: 0.894 ± 0.035
1.374TrpLeu: 1.374 ± 0.042
0.347TrpMet: 0.347 ± 0.02
0.825TrpAsn: 0.825 ± 0.036
0.351TrpPro: 0.351 ± 0.022
0.642TrpGln: 0.642 ± 0.027
0.661TrpArg: 0.661 ± 0.031
0.682TrpSer: 0.682 ± 0.031
0.712TrpThr: 0.712 ± 0.032
0.711TrpVal: 0.711 ± 0.027
0.246TrpTrp: 0.246 ± 0.02
0.537TrpTyr: 0.537 ± 0.029
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.311TyrAla: 3.311 ± 0.064
0.574TyrCys: 0.574 ± 0.027
2.699TyrAsp: 2.699 ± 0.06
2.226TyrGlu: 2.226 ± 0.053
2.062TyrPhe: 2.062 ± 0.048
3.156TyrGly: 3.156 ± 0.065
1.062TyrHis: 1.062 ± 0.035
2.619TyrIle: 2.619 ± 0.047
2.48TyrLys: 2.48 ± 0.053
3.741TyrLeu: 3.741 ± 0.069
1.101TyrMet: 1.101 ± 0.036
2.351TyrAsn: 2.351 ± 0.065
1.775TyrPro: 1.775 ± 0.042
1.633TyrGln: 1.633 ± 0.039
2.457TyrArg: 2.457 ± 0.054
2.447TyrSer: 2.447 ± 0.053
2.485TyrThr: 2.485 ± 0.059
2.608TyrVal: 2.608 ± 0.051
0.577TyrTrp: 0.577 ± 0.029
2.073TyrTyr: 2.073 ± 0.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2568 proteins (911703 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski