Amino acid dipepetide frequency for Parvimonas micra

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.132AlaAla: 2.132 ± 0.097
0.687AlaCys: 0.687 ± 0.046
2.601AlaAsp: 2.601 ± 0.084
3.028AlaGlu: 3.028 ± 0.09
2.592AlaPhe: 2.592 ± 0.082
3.428AlaGly: 3.428 ± 0.097
0.627AlaHis: 0.627 ± 0.04
5.244AlaIle: 5.244 ± 0.119
4.98AlaLys: 4.98 ± 0.129
4.982AlaLeu: 4.982 ± 0.119
1.496AlaMet: 1.496 ± 0.064
2.61AlaAsn: 2.61 ± 0.081
1.136AlaPro: 1.136 ± 0.053
1.076AlaGln: 1.076 ± 0.053
1.796AlaArg: 1.796 ± 0.071
2.914AlaSer: 2.914 ± 0.082
2.617AlaThr: 2.617 ± 0.084
3.452AlaVal: 3.452 ± 0.089
0.28AlaTrp: 0.28 ± 0.03
1.836AlaTyr: 1.836 ± 0.062
0.0AlaXaa: 0.0 ± 0.0
Cys
0.614CysAla: 0.614 ± 0.036
0.173CysCys: 0.173 ± 0.021
0.642CysAsp: 0.642 ± 0.035
0.72CysGlu: 0.72 ± 0.041
0.496CysPhe: 0.496 ± 0.037
0.934CysGly: 0.934 ± 0.061
0.209CysHis: 0.209 ± 0.021
1.025CysIle: 1.025 ± 0.048
0.958CysLys: 0.958 ± 0.057
0.851CysLeu: 0.851 ± 0.044
0.258CysMet: 0.258 ± 0.022
0.587CysAsn: 0.587 ± 0.036
0.385CysPro: 0.385 ± 0.029
0.14CysGln: 0.14 ± 0.016
0.338CysArg: 0.338 ± 0.025
0.74CysSer: 0.74 ± 0.039
0.469CysThr: 0.469 ± 0.036
0.642CysVal: 0.642 ± 0.041
0.056CysTrp: 0.056 ± 0.011
0.393CysTyr: 0.393 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
2.428AspAla: 2.428 ± 0.083
0.656AspCys: 0.656 ± 0.036
3.048AspAsp: 3.048 ± 0.092
5.742AspGlu: 5.742 ± 0.13
3.501AspPhe: 3.501 ± 0.083
3.399AspGly: 3.399 ± 0.101
0.498AspHis: 0.498 ± 0.033
6.26AspIle: 6.26 ± 0.131
5.773AspLys: 5.773 ± 0.118
5.102AspLeu: 5.102 ± 0.108
1.443AspMet: 1.443 ± 0.054
3.197AspAsn: 3.197 ± 0.103
1.069AspPro: 1.069 ± 0.05
0.642AspGln: 0.642 ± 0.041
1.894AspArg: 1.894 ± 0.062
3.13AspSer: 3.13 ± 0.096
2.205AspThr: 2.205 ± 0.069
3.788AspVal: 3.788 ± 0.1
0.36AspTrp: 0.36 ± 0.032
2.697AspTyr: 2.697 ± 0.083
0.0AspXaa: 0.0 ± 0.0
Glu
3.403GluAla: 3.403 ± 0.105
0.542GluCys: 0.542 ± 0.039
4.511GluAsp: 4.511 ± 0.118
6.594GluGlu: 6.594 ± 0.156
3.73GluPhe: 3.73 ± 0.099
3.33GluGly: 3.33 ± 0.094
0.914GluHis: 0.914 ± 0.05
8.152GluIle: 8.152 ± 0.157
9.797GluLys: 9.797 ± 0.15
7.065GluLeu: 7.065 ± 0.133
1.99GluMet: 1.99 ± 0.066
6.814GluAsn: 6.814 ± 0.146
1.225GluPro: 1.225 ± 0.053
1.625GluGln: 1.625 ± 0.059
2.601GluArg: 2.601 ± 0.094
3.664GluSer: 3.664 ± 0.107
3.219GluThr: 3.219 ± 0.101
4.88GluVal: 4.88 ± 0.12
0.405GluTrp: 0.405 ± 0.03
3.379GluTyr: 3.379 ± 0.097
0.0GluXaa: 0.0 ± 0.0
Phe
2.552PheAla: 2.552 ± 0.082
0.671PheCys: 0.671 ± 0.039
3.355PheAsp: 3.355 ± 0.087
4.015PheGlu: 4.015 ± 0.109
2.805PhePhe: 2.805 ± 0.111
3.368PheGly: 3.368 ± 0.084
0.545PheHis: 0.545 ± 0.035
4.964PheIle: 4.964 ± 0.136
4.448PheLys: 4.448 ± 0.109
5.306PheLeu: 5.306 ± 0.146
1.294PheMet: 1.294 ± 0.066
2.954PheAsn: 2.954 ± 0.089
1.336PhePro: 1.336 ± 0.059
0.949PheGln: 0.949 ± 0.044
1.585PheArg: 1.585 ± 0.056
4.277PheSer: 4.277 ± 0.117
2.554PheThr: 2.554 ± 0.097
3.695PheVal: 3.695 ± 0.088
0.342PheTrp: 0.342 ± 0.029
2.21PheTyr: 2.21 ± 0.081
0.0PheXaa: 0.0 ± 0.0
Gly
3.497GlyAla: 3.497 ± 0.111
0.689GlyCys: 0.689 ± 0.045
2.857GlyAsp: 2.857 ± 0.088
4.133GlyGlu: 4.133 ± 0.096
3.241GlyPhe: 3.241 ± 0.096
3.859GlyGly: 3.859 ± 0.128
0.934GlyHis: 0.934 ± 0.054
6.369GlyIle: 6.369 ± 0.125
6.231GlyLys: 6.231 ± 0.154
5.237GlyLeu: 5.237 ± 0.13
1.532GlyMet: 1.532 ± 0.073
3.226GlyAsn: 3.226 ± 0.086
0.949GlyPro: 0.949 ± 0.048
1.314GlyGln: 1.314 ± 0.058
2.085GlyArg: 2.085 ± 0.087
3.577GlySer: 3.577 ± 0.103
3.181GlyThr: 3.181 ± 0.1
4.428GlyVal: 4.428 ± 0.104
0.382GlyTrp: 0.382 ± 0.03
2.739GlyTyr: 2.739 ± 0.084
0.0GlyXaa: 0.0 ± 0.0
His
0.536HisAla: 0.536 ± 0.034
0.162HisCys: 0.162 ± 0.021
0.556HisAsp: 0.556 ± 0.038
0.734HisGlu: 0.734 ± 0.044
0.66HisPhe: 0.66 ± 0.037
0.794HisGly: 0.794 ± 0.05
0.233HisHis: 0.233 ± 0.022
1.265HisIle: 1.265 ± 0.055
0.936HisLys: 0.936 ± 0.046
1.118HisLeu: 1.118 ± 0.056
0.238HisMet: 0.238 ± 0.027
0.88HisAsn: 0.88 ± 0.047
0.511HisPro: 0.511 ± 0.039
0.291HisGln: 0.291 ± 0.024
0.558HisArg: 0.558 ± 0.038
0.84HisSer: 0.84 ± 0.042
0.609HisThr: 0.609 ± 0.036
0.625HisVal: 0.625 ± 0.04
0.087HisTrp: 0.087 ± 0.014
0.549HisTyr: 0.549 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.144IleAla: 5.144 ± 0.13
1.24IleCys: 1.24 ± 0.052
5.782IleAsp: 5.782 ± 0.133
7.336IleGlu: 7.336 ± 0.126
5.68IlePhe: 5.68 ± 0.15
5.735IleGly: 5.735 ± 0.127
1.118IleHis: 1.118 ± 0.051
8.788IleIle: 8.788 ± 0.171
8.719IleLys: 8.719 ± 0.158
10.242IleLeu: 10.242 ± 0.174
2.243IleMet: 2.243 ± 0.07
5.371IleAsn: 5.371 ± 0.127
3.159IlePro: 3.159 ± 0.082
1.878IleGln: 1.878 ± 0.057
3.09IleArg: 3.09 ± 0.104
7.714IleSer: 7.714 ± 0.162
4.435IleThr: 4.435 ± 0.108
6.5IleVal: 6.5 ± 0.141
0.453IleTrp: 0.453 ± 0.036
3.881IleTyr: 3.881 ± 0.107
0.0IleXaa: 0.0 ± 0.0
Lys
4.524LysAla: 4.524 ± 0.124
0.676LysCys: 0.676 ± 0.045
6.336LysAsp: 6.336 ± 0.14
9.339LysGlu: 9.339 ± 0.187
4.495LysPhe: 4.495 ± 0.108
4.782LysGly: 4.782 ± 0.114
1.083LysHis: 1.083 ± 0.047
9.679LysIle: 9.679 ± 0.159
9.861LysLys: 9.861 ± 0.185
8.332LysLeu: 8.332 ± 0.148
2.67LysMet: 2.67 ± 0.077
8.296LysAsn: 8.296 ± 0.165
1.972LysPro: 1.972 ± 0.087
2.087LysGln: 2.087 ± 0.077
3.217LysArg: 3.217 ± 0.092
5.593LysSer: 5.593 ± 0.109
4.575LysThr: 4.575 ± 0.106
6.247LysVal: 6.247 ± 0.158
0.609LysTrp: 0.609 ± 0.042
4.513LysTyr: 4.513 ± 0.126
0.0LysXaa: 0.0 ± 0.0
Leu
4.775LeuAla: 4.775 ± 0.107
1.049LeuCys: 1.049 ± 0.057
5.415LeuAsp: 5.415 ± 0.104
7.254LeuGlu: 7.254 ± 0.159
4.702LeuPhe: 4.702 ± 0.128
5.624LeuGly: 5.624 ± 0.117
1.04LeuHis: 1.04 ± 0.044
8.468LeuIle: 8.468 ± 0.166
9.677LeuLys: 9.677 ± 0.191
7.958LeuLeu: 7.958 ± 0.174
2.212LeuMet: 2.212 ± 0.063
5.9LeuAsn: 5.9 ± 0.136
2.67LeuPro: 2.67 ± 0.097
1.778LeuGln: 1.778 ± 0.068
2.974LeuArg: 2.974 ± 0.091
6.72LeuSer: 6.72 ± 0.121
4.468LeuThr: 4.468 ± 0.105
5.669LeuVal: 5.669 ± 0.124
0.522LeuTrp: 0.522 ± 0.036
3.352LeuTyr: 3.352 ± 0.096
0.0LeuXaa: 0.0 ± 0.0
Met
1.438MetAla: 1.438 ± 0.068
0.267MetCys: 0.267 ± 0.025
1.529MetAsp: 1.529 ± 0.067
1.892MetGlu: 1.892 ± 0.061
1.167MetPhe: 1.167 ± 0.057
1.623MetGly: 1.623 ± 0.074
0.238MetHis: 0.238 ± 0.022
2.294MetIle: 2.294 ± 0.077
2.703MetLys: 2.703 ± 0.087
2.172MetLeu: 2.172 ± 0.069
0.607MetMet: 0.607 ± 0.037
1.623MetAsn: 1.623 ± 0.064
0.702MetPro: 0.702 ± 0.04
0.551MetGln: 0.551 ± 0.033
0.863MetArg: 0.863 ± 0.045
1.507MetSer: 1.507 ± 0.064
1.025MetThr: 1.025 ± 0.05
1.507MetVal: 1.507 ± 0.063
0.144MetTrp: 0.144 ± 0.019
0.874MetTyr: 0.874 ± 0.043
0.0MetXaa: 0.0 ± 0.0
Asn
2.759AsnAla: 2.759 ± 0.075
0.638AsnCys: 0.638 ± 0.036
2.879AsnAsp: 2.879 ± 0.076
4.542AsnGlu: 4.542 ± 0.097
3.65AsnPhe: 3.65 ± 0.102
3.901AsnGly: 3.901 ± 0.132
0.725AsnHis: 0.725 ± 0.04
7.043AsnIle: 7.043 ± 0.135
6.022AsnLys: 6.022 ± 0.148
6.376AsnLeu: 6.376 ± 0.149
1.445AsnMet: 1.445 ± 0.062
3.926AsnAsn: 3.926 ± 0.112
2.087AsnPro: 2.087 ± 0.072
1.207AsnGln: 1.207 ± 0.047
2.201AsnArg: 2.201 ± 0.078
4.393AsnSer: 4.393 ± 0.117
2.834AsnThr: 2.834 ± 0.08
3.935AsnVal: 3.935 ± 0.085
0.445AsnTrp: 0.445 ± 0.036
2.625AsnTyr: 2.625 ± 0.089
0.0AsnXaa: 0.0 ± 0.0
Pro
1.298ProAla: 1.298 ± 0.063
0.269ProCys: 0.269 ± 0.029
1.309ProAsp: 1.309 ± 0.057
2.047ProGlu: 2.047 ± 0.078
1.421ProPhe: 1.421 ± 0.063
1.285ProGly: 1.285 ± 0.064
0.402ProHis: 0.402 ± 0.03
2.479ProIle: 2.479 ± 0.075
2.254ProLys: 2.254 ± 0.077
2.083ProLeu: 2.083 ± 0.075
0.627ProMet: 0.627 ± 0.036
1.496ProAsn: 1.496 ± 0.06
0.491ProPro: 0.491 ± 0.031
0.574ProGln: 0.574 ± 0.037
0.707ProArg: 0.707 ± 0.04
1.549ProSer: 1.549 ± 0.056
1.534ProThr: 1.534 ± 0.069
1.785ProVal: 1.785 ± 0.077
0.189ProTrp: 0.189 ± 0.021
1.1ProTyr: 1.1 ± 0.051
0.0ProXaa: 0.0 ± 0.0
Gln
1.1GlnAla: 1.1 ± 0.051
0.158GlnCys: 0.158 ± 0.019
0.978GlnAsp: 0.978 ± 0.038
1.472GlnGlu: 1.472 ± 0.062
0.847GlnPhe: 0.847 ± 0.04
1.243GlnGly: 1.243 ± 0.053
0.225GlnHis: 0.225 ± 0.021
1.89GlnIle: 1.89 ± 0.063
2.27GlnLys: 2.27 ± 0.078
1.681GlnLeu: 1.681 ± 0.058
0.545GlnMet: 0.545 ± 0.036
1.461GlnAsn: 1.461 ± 0.057
0.385GlnPro: 0.385 ± 0.028
0.469GlnGln: 0.469 ± 0.035
0.956GlnArg: 0.956 ± 0.049
1.174GlnSer: 1.174 ± 0.044
0.858GlnThr: 0.858 ± 0.048
1.18GlnVal: 1.18 ± 0.052
0.133GlnTrp: 0.133 ± 0.019
0.722GlnTyr: 0.722 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
1.776ArgAla: 1.776 ± 0.073
0.338ArgCys: 0.338 ± 0.034
1.927ArgAsp: 1.927 ± 0.073
2.974ArgGlu: 2.974 ± 0.1
1.601ArgPhe: 1.601 ± 0.067
1.91ArgGly: 1.91 ± 0.067
0.462ArgHis: 0.462 ± 0.036
3.128ArgIle: 3.128 ± 0.084
3.501ArgLys: 3.501 ± 0.096
2.979ArgLeu: 2.979 ± 0.09
0.856ArgMet: 0.856 ± 0.043
2.174ArgAsn: 2.174 ± 0.078
0.789ArgPro: 0.789 ± 0.049
0.807ArgGln: 0.807 ± 0.042
1.345ArgArg: 1.345 ± 0.058
1.503ArgSer: 1.503 ± 0.059
1.481ArgThr: 1.481 ± 0.065
2.234ArgVal: 2.234 ± 0.069
0.187ArgTrp: 0.187 ± 0.019
1.467ArgTyr: 1.467 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
3.172SerAla: 3.172 ± 0.087
0.64SerCys: 0.64 ± 0.04
3.49SerAsp: 3.49 ± 0.102
4.346SerGlu: 4.346 ± 0.111
3.739SerPhe: 3.739 ± 0.11
4.639SerGly: 4.639 ± 0.115
0.84SerHis: 0.84 ± 0.046
6.571SerIle: 6.571 ± 0.119
6.389SerLys: 6.389 ± 0.132
5.767SerLeu: 5.767 ± 0.127
1.636SerMet: 1.636 ± 0.062
3.75SerAsn: 3.75 ± 0.098
1.443SerPro: 1.443 ± 0.054
1.352SerGln: 1.352 ± 0.06
2.021SerArg: 2.021 ± 0.064
4.066SerSer: 4.066 ± 0.117
2.932SerThr: 2.932 ± 0.083
3.857SerVal: 3.857 ± 0.088
0.382SerTrp: 0.382 ± 0.031
2.734SerTyr: 2.734 ± 0.083
0.0SerXaa: 0.0 ± 0.0
Thr
2.625ThrAla: 2.625 ± 0.092
0.462ThrCys: 0.462 ± 0.033
2.614ThrAsp: 2.614 ± 0.071
3.177ThrGlu: 3.177 ± 0.081
2.632ThrPhe: 2.632 ± 0.088
3.357ThrGly: 3.357 ± 0.095
0.716ThrHis: 0.716 ± 0.04
4.462ThrIle: 4.462 ± 0.104
4.039ThrLys: 4.039 ± 0.103
4.404ThrLeu: 4.404 ± 0.103
1.076ThrMet: 1.076 ± 0.053
2.521ThrAsn: 2.521 ± 0.084
1.554ThrPro: 1.554 ± 0.055
0.854ThrGln: 0.854 ± 0.046
1.421ThrArg: 1.421 ± 0.058
2.797ThrSer: 2.797 ± 0.087
2.592ThrThr: 2.592 ± 0.104
3.552ThrVal: 3.552 ± 0.099
0.305ThrTrp: 0.305 ± 0.034
1.663ThrTyr: 1.663 ± 0.062
0.0ThrXaa: 0.0 ± 0.0
Val
3.577ValAla: 3.577 ± 0.101
0.796ValCys: 0.796 ± 0.044
3.986ValAsp: 3.986 ± 0.095
5.129ValGlu: 5.129 ± 0.134
3.39ValPhe: 3.39 ± 0.099
4.288ValGly: 4.288 ± 0.108
0.76ValHis: 0.76 ± 0.039
5.907ValIle: 5.907 ± 0.122
6.067ValLys: 6.067 ± 0.148
6.298ValLeu: 6.298 ± 0.122
1.412ValMet: 1.412 ± 0.058
3.806ValAsn: 3.806 ± 0.085
1.801ValPro: 1.801 ± 0.068
1.178ValGln: 1.178 ± 0.052
2.021ValArg: 2.021 ± 0.074
4.357ValSer: 4.357 ± 0.099
3.083ValThr: 3.083 ± 0.102
4.795ValVal: 4.795 ± 0.119
0.34ValTrp: 0.34 ± 0.027
2.436ValTyr: 2.436 ± 0.074
0.0ValXaa: 0.0 ± 0.0
Trp
0.285TrpAla: 0.285 ± 0.027
0.051TrpCys: 0.051 ± 0.012
0.347TrpAsp: 0.347 ± 0.029
0.418TrpGlu: 0.418 ± 0.033
0.313TrpPhe: 0.313 ± 0.032
0.385TrpGly: 0.385 ± 0.029
0.104TrpHis: 0.104 ± 0.018
0.616TrpIle: 0.616 ± 0.039
0.494TrpLys: 0.494 ± 0.038
0.467TrpLeu: 0.467 ± 0.035
0.167TrpMet: 0.167 ± 0.022
0.42TrpAsn: 0.42 ± 0.032
0.109TrpPro: 0.109 ± 0.014
0.136TrpGln: 0.136 ± 0.016
0.229TrpArg: 0.229 ± 0.022
0.28TrpSer: 0.28 ± 0.026
0.302TrpThr: 0.302 ± 0.034
0.327TrpVal: 0.327 ± 0.028
0.089TrpTrp: 0.089 ± 0.022
0.4TrpTyr: 0.4 ± 0.049
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.878TyrAla: 1.878 ± 0.063
0.465TyrCys: 0.465 ± 0.032
2.67TyrAsp: 2.67 ± 0.077
2.948TyrGlu: 2.948 ± 0.082
2.563TyrPhe: 2.563 ± 0.1
2.577TyrGly: 2.577 ± 0.076
0.509TyrHis: 0.509 ± 0.03
3.908TyrIle: 3.908 ± 0.098
3.768TyrLys: 3.768 ± 0.101
3.81TyrLeu: 3.81 ± 0.105
0.98TyrMet: 0.98 ± 0.04
2.621TyrAsn: 2.621 ± 0.076
1.127TyrPro: 1.127 ± 0.051
0.836TyrGln: 0.836 ± 0.045
1.538TyrArg: 1.538 ± 0.058
2.972TyrSer: 2.972 ± 0.087
1.847TyrThr: 1.847 ± 0.066
2.328TyrVal: 2.328 ± 0.075
0.227TyrTrp: 0.227 ± 0.021
1.725TyrTyr: 1.725 ± 0.074
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1418 proteins (449836 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski