Amino acid dipepetide frequency for Altererythrobacter insulae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.273AlaAla: 14.273 ± 0.179
1.055AlaCys: 1.055 ± 0.036
6.858AlaAsp: 6.858 ± 0.105
7.81AlaGlu: 7.81 ± 0.115
4.206AlaPhe: 4.206 ± 0.075
9.627AlaGly: 9.627 ± 0.127
2.07AlaHis: 2.07 ± 0.059
6.97AlaIle: 6.97 ± 0.107
4.722AlaLys: 4.722 ± 0.097
11.984AlaLeu: 11.984 ± 0.15
3.722AlaMet: 3.722 ± 0.075
3.582AlaAsn: 3.582 ± 0.073
4.901AlaPro: 4.901 ± 0.08
4.335AlaGln: 4.335 ± 0.076
7.185AlaArg: 7.185 ± 0.103
6.668AlaSer: 6.668 ± 0.099
5.679AlaThr: 5.679 ± 0.094
7.391AlaVal: 7.391 ± 0.117
1.373AlaTrp: 1.373 ± 0.046
2.365AlaTyr: 2.365 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.954CysAla: 0.954 ± 0.037
0.09CysCys: 0.09 ± 0.013
0.611CysAsp: 0.611 ± 0.031
0.53CysGlu: 0.53 ± 0.026
0.29CysPhe: 0.29 ± 0.021
0.877CysGly: 0.877 ± 0.036
0.204CysHis: 0.204 ± 0.015
0.384CysIle: 0.384 ± 0.02
0.265CysLys: 0.265 ± 0.017
0.747CysLeu: 0.747 ± 0.031
0.155CysMet: 0.155 ± 0.016
0.27CysAsn: 0.27 ± 0.018
0.384CysPro: 0.384 ± 0.022
0.23CysGln: 0.23 ± 0.016
0.439CysArg: 0.439 ± 0.024
0.497CysSer: 0.497 ± 0.019
0.402CysThr: 0.402 ± 0.023
0.607CysVal: 0.607 ± 0.025
0.111CysTrp: 0.111 ± 0.013
0.178CysTyr: 0.178 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
7.404AspAla: 7.404 ± 0.111
0.526AspCys: 0.526 ± 0.027
3.862AspAsp: 3.862 ± 0.091
4.486AspGlu: 4.486 ± 0.072
2.421AspPhe: 2.421 ± 0.053
5.81AspGly: 5.81 ± 0.161
1.251AspHis: 1.251 ± 0.042
3.293AspIle: 3.293 ± 0.051
2.164AspLys: 2.164 ± 0.051
5.705AspLeu: 5.705 ± 0.094
1.582AspMet: 1.582 ± 0.049
1.889AspAsn: 1.889 ± 0.059
3.519AspPro: 3.519 ± 0.067
2.068AspGln: 2.068 ± 0.058
4.115AspArg: 4.115 ± 0.069
2.686AspSer: 2.686 ± 0.083
3.011AspThr: 3.011 ± 0.084
4.182AspVal: 4.182 ± 0.077
1.203AspTrp: 1.203 ± 0.037
1.613AspTyr: 1.613 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
7.805GluAla: 7.805 ± 0.119
0.406GluCys: 0.406 ± 0.021
3.731GluAsp: 3.731 ± 0.078
4.373GluGlu: 4.373 ± 0.086
2.269GluPhe: 2.269 ± 0.051
4.96GluGly: 4.96 ± 0.077
1.339GluHis: 1.339 ± 0.041
3.707GluIle: 3.707 ± 0.07
2.701GluLys: 2.701 ± 0.064
6.454GluLeu: 6.454 ± 0.113
1.89GluMet: 1.89 ± 0.044
2.193GluAsn: 2.193 ± 0.051
2.897GluPro: 2.897 ± 0.063
2.884GluGln: 2.884 ± 0.062
4.911GluArg: 4.911 ± 0.085
2.931GluSer: 2.931 ± 0.065
3.637GluThr: 3.637 ± 0.07
4.245GluVal: 4.245 ± 0.07
0.999GluTrp: 0.999 ± 0.035
1.407GluTyr: 1.407 ± 0.044
0.0GluXaa: 0.0 ± 0.0
Phe
4.922PheAla: 4.922 ± 0.075
0.365PheCys: 0.365 ± 0.021
2.92PheAsp: 2.92 ± 0.067
2.575PheGlu: 2.575 ± 0.049
1.489PhePhe: 1.489 ± 0.049
3.771PheGly: 3.771 ± 0.073
0.697PheHis: 0.697 ± 0.027
1.818PheIle: 1.818 ± 0.053
1.123PheLys: 1.123 ± 0.04
3.177PheLeu: 3.177 ± 0.076
0.858PheMet: 0.858 ± 0.034
1.163PheAsn: 1.163 ± 0.038
1.495PhePro: 1.495 ± 0.048
0.992PheGln: 0.992 ± 0.033
1.91PheArg: 1.91 ± 0.046
2.367PheSer: 2.367 ± 0.055
2.13PheThr: 2.13 ± 0.054
2.764PheVal: 2.764 ± 0.055
0.571PheTrp: 0.571 ± 0.027
1.012PheTyr: 1.012 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
8.735GlyAla: 8.735 ± 0.127
0.795GlyCys: 0.795 ± 0.029
5.096GlyAsp: 5.096 ± 0.092
5.74GlyGlu: 5.74 ± 0.09
3.742GlyPhe: 3.742 ± 0.065
7.416GlyGly: 7.416 ± 0.133
1.695GlyHis: 1.695 ± 0.044
4.517GlyIle: 4.517 ± 0.079
3.615GlyLys: 3.615 ± 0.074
8.125GlyLeu: 8.125 ± 0.113
2.382GlyMet: 2.382 ± 0.051
2.509GlyAsn: 2.509 ± 0.057
3.216GlyPro: 3.216 ± 0.059
2.978GlyGln: 2.978 ± 0.056
5.118GlyArg: 5.118 ± 0.079
4.995GlySer: 4.995 ± 0.083
4.477GlyThr: 4.477 ± 0.091
5.795GlyVal: 5.795 ± 0.094
1.419GlyTrp: 1.419 ± 0.051
2.259GlyTyr: 2.259 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
2.049HisAla: 2.049 ± 0.05
0.244HisCys: 0.244 ± 0.018
1.193HisAsp: 1.193 ± 0.04
1.103HisGlu: 1.103 ± 0.037
0.887HisPhe: 0.887 ± 0.037
1.751HisGly: 1.751 ± 0.048
0.539HisHis: 0.539 ± 0.026
0.964HisIle: 0.964 ± 0.032
0.605HisLys: 0.605 ± 0.026
1.768HisLeu: 1.768 ± 0.052
0.478HisMet: 0.478 ± 0.024
0.591HisAsn: 0.591 ± 0.028
1.188HisPro: 1.188 ± 0.04
0.557HisGln: 0.557 ± 0.026
1.221HisArg: 1.221 ± 0.045
1.094HisSer: 1.094 ± 0.039
0.811HisThr: 0.811 ± 0.033
1.285HisVal: 1.285 ± 0.043
0.345HisTrp: 0.345 ± 0.022
0.601HisTyr: 0.601 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
7.681IleAla: 7.681 ± 0.109
0.545IleCys: 0.545 ± 0.024
4.005IleAsp: 4.005 ± 0.073
4.282IleGlu: 4.282 ± 0.073
1.843IlePhe: 1.843 ± 0.044
5.271IleGly: 5.271 ± 0.084
0.916IleHis: 0.916 ± 0.031
2.677IleIle: 2.677 ± 0.062
1.775IleLys: 1.775 ± 0.044
4.198IleLeu: 4.198 ± 0.08
1.114IleMet: 1.114 ± 0.04
1.757IleAsn: 1.757 ± 0.05
2.403IlePro: 2.403 ± 0.058
1.287IleGln: 1.287 ± 0.038
2.952IleArg: 2.952 ± 0.055
3.242IleSer: 3.242 ± 0.069
2.98IleThr: 2.98 ± 0.069
4.015IleVal: 4.015 ± 0.077
0.704IleTrp: 0.704 ± 0.031
1.167IleTyr: 1.167 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.183LysAla: 4.183 ± 0.083
0.208LysCys: 0.208 ± 0.015
1.958LysAsp: 1.958 ± 0.051
1.941LysGlu: 1.941 ± 0.054
1.137LysPhe: 1.137 ± 0.037
2.735LysGly: 2.735 ± 0.064
0.838LysHis: 0.838 ± 0.031
1.939LysIle: 1.939 ± 0.051
1.726LysLys: 1.726 ± 0.062
3.955LysLeu: 3.955 ± 0.07
1.0LysMet: 1.0 ± 0.036
1.04LysAsn: 1.04 ± 0.038
1.989LysPro: 1.989 ± 0.048
1.292LysGln: 1.292 ± 0.045
2.7LysArg: 2.7 ± 0.061
2.076LysSer: 2.076 ± 0.049
1.93LysThr: 1.93 ± 0.051
2.577LysVal: 2.577 ± 0.063
0.472LysTrp: 0.472 ± 0.024
0.748LysTyr: 0.748 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
12.352LeuAla: 12.352 ± 0.126
0.77LeuCys: 0.77 ± 0.032
6.022LeuAsp: 6.022 ± 0.098
5.884LeuGlu: 5.884 ± 0.083
3.517LeuPhe: 3.517 ± 0.089
7.927LeuGly: 7.927 ± 0.107
1.638LeuHis: 1.638 ± 0.048
5.172LeuIle: 5.172 ± 0.086
3.464LeuLys: 3.464 ± 0.072
8.83LeuLeu: 8.83 ± 0.146
2.215LeuMet: 2.215 ± 0.058
2.708LeuAsn: 2.708 ± 0.062
5.052LeuPro: 5.052 ± 0.073
2.65LeuGln: 2.65 ± 0.062
5.8LeuArg: 5.8 ± 0.109
6.385LeuSer: 6.385 ± 0.101
5.456LeuThr: 5.456 ± 0.093
6.917LeuVal: 6.917 ± 0.102
1.141LeuTrp: 1.141 ± 0.043
1.91LeuTyr: 1.91 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
3.268MetAla: 3.268 ± 0.061
0.17MetCys: 0.17 ± 0.013
1.304MetAsp: 1.304 ± 0.037
1.302MetGlu: 1.302 ± 0.042
0.776MetPhe: 0.776 ± 0.036
2.035MetGly: 2.035 ± 0.052
0.537MetHis: 0.537 ± 0.026
1.483MetIle: 1.483 ± 0.04
1.094MetLys: 1.094 ± 0.029
2.658MetLeu: 2.658 ± 0.053
0.737MetMet: 0.737 ± 0.028
0.859MetAsn: 0.859 ± 0.029
1.511MetPro: 1.511 ± 0.036
0.901MetGln: 0.901 ± 0.028
1.859MetArg: 1.859 ± 0.048
1.743MetSer: 1.743 ± 0.044
1.655MetThr: 1.655 ± 0.045
1.736MetVal: 1.736 ± 0.048
0.254MetTrp: 0.254 ± 0.019
0.349MetTyr: 0.349 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.462AsnAla: 3.462 ± 0.061
0.279AsnCys: 0.279 ± 0.018
1.823AsnAsp: 1.823 ± 0.054
1.762AsnGlu: 1.762 ± 0.051
1.226AsnPhe: 1.226 ± 0.041
2.735AsnGly: 2.735 ± 0.07
0.522AsnHis: 0.522 ± 0.026
1.614AsnIle: 1.614 ± 0.044
0.968AsnLys: 0.968 ± 0.035
2.834AsnLeu: 2.834 ± 0.059
0.753AsnMet: 0.753 ± 0.029
0.913AsnAsn: 0.913 ± 0.038
1.9AsnPro: 1.9 ± 0.048
0.96AsnGln: 0.96 ± 0.034
2.013AsnArg: 2.013 ± 0.053
1.784AsnSer: 1.784 ± 0.048
1.497AsnThr: 1.497 ± 0.051
2.185AsnVal: 2.185 ± 0.057
0.501AsnTrp: 0.501 ± 0.023
0.808AsnTyr: 0.808 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
5.064ProAla: 5.064 ± 0.074
0.281ProCys: 0.281 ± 0.022
3.837ProAsp: 3.837 ± 0.069
3.903ProGlu: 3.903 ± 0.065
1.907ProPhe: 1.907 ± 0.051
3.873ProGly: 3.873 ± 0.059
0.95ProHis: 0.95 ± 0.039
2.539ProIle: 2.539 ± 0.056
1.721ProLys: 1.721 ± 0.054
4.369ProLeu: 4.369 ± 0.079
1.153ProMet: 1.153 ± 0.039
1.435ProAsn: 1.435 ± 0.045
2.131ProPro: 2.131 ± 0.069
1.776ProGln: 1.776 ± 0.05
2.471ProArg: 2.471 ± 0.064
2.72ProSer: 2.72 ± 0.061
2.474ProThr: 2.474 ± 0.056
3.733ProVal: 3.733 ± 0.062
0.609ProTrp: 0.609 ± 0.026
1.075ProTyr: 1.075 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
3.952GlnAla: 3.952 ± 0.08
0.264GlnCys: 0.264 ± 0.019
1.753GlnAsp: 1.753 ± 0.044
1.832GlnGlu: 1.832 ± 0.053
1.309GlnPhe: 1.309 ± 0.041
2.396GlnGly: 2.396 ± 0.046
0.684GlnHis: 0.684 ± 0.031
2.059GlnIle: 2.059 ± 0.05
1.077GlnLys: 1.077 ± 0.036
3.471GlnLeu: 3.471 ± 0.064
1.004GlnMet: 1.004 ± 0.03
0.947GlnAsn: 0.947 ± 0.035
1.662GlnPro: 1.662 ± 0.042
1.32GlnGln: 1.32 ± 0.045
2.233GlnArg: 2.233 ± 0.048
2.163GlnSer: 2.163 ± 0.052
1.819GlnThr: 1.819 ± 0.041
2.439GlnVal: 2.439 ± 0.057
0.483GlnTrp: 0.483 ± 0.022
0.764GlnTyr: 0.764 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
6.753ArgAla: 6.753 ± 0.113
0.466ArgCys: 0.466 ± 0.023
3.991ArgAsp: 3.991 ± 0.077
4.403ArgGlu: 4.403 ± 0.076
2.74ArgPhe: 2.74 ± 0.061
4.404ArgGly: 4.404 ± 0.077
1.265ArgHis: 1.265 ± 0.041
3.68ArgIle: 3.68 ± 0.065
2.409ArgLys: 2.409 ± 0.053
6.423ArgLeu: 6.423 ± 0.115
1.782ArgMet: 1.782 ± 0.048
1.95ArgAsn: 1.95 ± 0.047
2.692ArgPro: 2.692 ± 0.055
2.272ArgGln: 2.272 ± 0.057
4.075ArgArg: 4.075 ± 0.085
3.521ArgSer: 3.521 ± 0.065
2.841ArgThr: 2.841 ± 0.057
4.168ArgVal: 4.168 ± 0.071
0.975ArgTrp: 0.975 ± 0.037
1.647ArgTyr: 1.647 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
6.416SerAla: 6.416 ± 0.1
0.425SerCys: 0.425 ± 0.023
3.925SerAsp: 3.925 ± 0.072
3.904SerGlu: 3.904 ± 0.063
2.322SerPhe: 2.322 ± 0.058
5.732SerGly: 5.732 ± 0.085
1.034SerHis: 1.034 ± 0.039
2.996SerIle: 2.996 ± 0.056
1.932SerLys: 1.932 ± 0.045
5.467SerLeu: 5.467 ± 0.084
1.501SerMet: 1.501 ± 0.045
1.808SerAsn: 1.808 ± 0.049
2.692SerPro: 2.692 ± 0.053
1.989SerGln: 1.989 ± 0.047
3.438SerArg: 3.438 ± 0.059
3.374SerSer: 3.374 ± 0.064
2.874SerThr: 2.874 ± 0.06
3.893SerVal: 3.893 ± 0.066
0.809SerTrp: 0.809 ± 0.029
1.419SerTyr: 1.419 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
5.824ThrAla: 5.824 ± 0.099
0.403ThrCys: 0.403 ± 0.023
3.187ThrAsp: 3.187 ± 0.089
2.979ThrGlu: 2.979 ± 0.064
1.914ThrPhe: 1.914 ± 0.056
4.938ThrGly: 4.938 ± 0.09
1.004ThrHis: 1.004 ± 0.037
3.104ThrIle: 3.104 ± 0.085
1.687ThrLys: 1.687 ± 0.045
5.298ThrLeu: 5.298 ± 0.094
1.295ThrMet: 1.295 ± 0.036
1.554ThrAsn: 1.554 ± 0.046
3.131ThrPro: 3.131 ± 0.077
1.63ThrGln: 1.63 ± 0.039
3.096ThrArg: 3.096 ± 0.061
3.03ThrSer: 3.03 ± 0.069
2.638ThrThr: 2.638 ± 0.063
3.894ThrVal: 3.894 ± 0.077
0.604ThrTrp: 0.604 ± 0.028
1.268ThrTyr: 1.268 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
8.004ValAla: 8.004 ± 0.102
0.571ValCys: 0.571 ± 0.026
4.342ValAsp: 4.342 ± 0.081
4.786ValGlu: 4.786 ± 0.084
2.511ValPhe: 2.511 ± 0.059
5.297ValGly: 5.297 ± 0.089
1.235ValHis: 1.235 ± 0.037
4.14ValIle: 4.14 ± 0.077
2.237ValLys: 2.237 ± 0.059
6.351ValLeu: 6.351 ± 0.092
1.765ValMet: 1.765 ± 0.047
2.183ValAsn: 2.183 ± 0.059
3.596ValPro: 3.596 ± 0.068
2.056ValGln: 2.056 ± 0.051
4.152ValArg: 4.152 ± 0.071
4.408ValSer: 4.408 ± 0.078
4.283ValThr: 4.283 ± 0.102
5.032ValVal: 5.032 ± 0.098
0.865ValTrp: 0.865 ± 0.034
1.336ValTyr: 1.336 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.225TrpAla: 1.225 ± 0.041
0.116TrpCys: 0.116 ± 0.012
0.781TrpAsp: 0.781 ± 0.031
0.733TrpGlu: 0.733 ± 0.027
0.606TrpPhe: 0.606 ± 0.032
0.96TrpGly: 0.96 ± 0.04
0.347TrpHis: 0.347 ± 0.022
0.75TrpIle: 0.75 ± 0.028
0.467TrpLys: 0.467 ± 0.023
1.759TrpLeu: 1.759 ± 0.049
0.385TrpMet: 0.385 ± 0.02
0.473TrpAsn: 0.473 ± 0.023
0.663TrpPro: 0.663 ± 0.027
0.65TrpGln: 0.65 ± 0.028
1.11TrpArg: 1.11 ± 0.034
0.89TrpSer: 0.89 ± 0.032
0.735TrpThr: 0.735 ± 0.028
0.849TrpVal: 0.849 ± 0.031
0.285TrpTrp: 0.285 ± 0.019
0.283TrpTyr: 0.283 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.45TyrAla: 2.45 ± 0.061
0.245TyrCys: 0.245 ± 0.015
1.593TyrAsp: 1.593 ± 0.043
1.41TyrGlu: 1.41 ± 0.041
1.007TyrPhe: 1.007 ± 0.036
2.109TyrGly: 2.109 ± 0.049
0.512TyrHis: 0.512 ± 0.025
1.007TyrIle: 1.007 ± 0.034
0.691TyrLys: 0.691 ± 0.031
2.236TyrLeu: 2.236 ± 0.058
0.458TyrMet: 0.458 ± 0.022
0.718TyrAsn: 0.718 ± 0.029
1.002TyrPro: 1.002 ± 0.032
0.802TyrGln: 0.802 ± 0.031
1.659TyrArg: 1.659 ± 0.049
1.376TyrSer: 1.376 ± 0.043
1.148TyrThr: 1.148 ± 0.048
1.443TyrVal: 1.443 ± 0.04
0.343TyrTrp: 0.343 ± 0.022
0.645TyrTyr: 0.645 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2967 proteins (877807 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski