Amino acid dipepetide frequency for Umboniibacter marinipuniceus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.519AlaAla: 10.519 ± 0.143
0.955AlaCys: 0.955 ± 0.037
5.622AlaAsp: 5.622 ± 0.081
7.078AlaGlu: 7.078 ± 0.121
3.575AlaPhe: 3.575 ± 0.077
7.182AlaGly: 7.182 ± 0.122
1.909AlaHis: 1.909 ± 0.048
6.16AlaIle: 6.16 ± 0.084
4.65AlaLys: 4.65 ± 0.085
10.882AlaLeu: 10.882 ± 0.129
2.878AlaMet: 2.878 ± 0.059
4.068AlaAsn: 4.068 ± 0.075
3.495AlaPro: 3.495 ± 0.08
4.302AlaGln: 4.302 ± 0.078
4.709AlaArg: 4.709 ± 0.079
6.364AlaSer: 6.364 ± 0.096
5.383AlaThr: 5.383 ± 0.078
7.07AlaVal: 7.07 ± 0.106
1.178AlaTrp: 1.178 ± 0.037
2.552AlaTyr: 2.552 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.884CysAla: 0.884 ± 0.035
0.139CysCys: 0.139 ± 0.015
0.583CysAsp: 0.583 ± 0.027
0.673CysGlu: 0.673 ± 0.031
0.439CysPhe: 0.439 ± 0.02
0.852CysGly: 0.852 ± 0.038
0.302CysHis: 0.302 ± 0.021
0.54CysIle: 0.54 ± 0.027
0.291CysLys: 0.291 ± 0.022
0.925CysLeu: 0.925 ± 0.037
0.169CysMet: 0.169 ± 0.017
0.323CysAsn: 0.323 ± 0.021
0.433CysPro: 0.433 ± 0.026
0.369CysGln: 0.369 ± 0.022
0.466CysArg: 0.466 ± 0.024
0.704CysSer: 0.704 ± 0.029
0.453CysThr: 0.453 ± 0.027
0.646CysVal: 0.646 ± 0.029
0.144CysTrp: 0.144 ± 0.013
0.348CysTyr: 0.348 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
5.265AspAla: 5.265 ± 0.086
0.54AspCys: 0.54 ± 0.025
3.43AspAsp: 3.43 ± 0.08
4.388AspGlu: 4.388 ± 0.081
2.511AspPhe: 2.511 ± 0.064
4.408AspGly: 4.408 ± 0.099
1.285AspHis: 1.285 ± 0.043
3.767AspIle: 3.767 ± 0.072
1.958AspLys: 1.958 ± 0.054
5.454AspLeu: 5.454 ± 0.08
1.302AspMet: 1.302 ± 0.038
2.047AspAsn: 2.047 ± 0.053
2.361AspPro: 2.361 ± 0.059
2.474AspGln: 2.474 ± 0.054
3.019AspArg: 3.019 ± 0.065
3.583AspSer: 3.583 ± 0.084
2.776AspThr: 2.776 ± 0.062
4.238AspVal: 4.238 ± 0.08
0.933AspTrp: 0.933 ± 0.035
2.055AspTyr: 2.055 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
7.005GluAla: 7.005 ± 0.114
0.488GluCys: 0.488 ± 0.029
3.157GluAsp: 3.157 ± 0.061
3.633GluGlu: 3.633 ± 0.1
2.597GluPhe: 2.597 ± 0.06
4.032GluGly: 4.032 ± 0.078
1.415GluHis: 1.415 ± 0.042
3.665GluIle: 3.665 ± 0.071
2.366GluLys: 2.366 ± 0.062
7.721GluLeu: 7.721 ± 0.111
1.6GluMet: 1.6 ± 0.049
2.163GluAsn: 2.163 ± 0.056
2.185GluPro: 2.185 ± 0.055
3.471GluGln: 3.471 ± 0.074
4.131GluArg: 4.131 ± 0.077
3.951GluSer: 3.951 ± 0.088
3.087GluThr: 3.087 ± 0.061
5.068GluVal: 5.068 ± 0.097
0.862GluTrp: 0.862 ± 0.035
1.767GluTyr: 1.767 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
3.791PheAla: 3.791 ± 0.078
0.489PheCys: 0.489 ± 0.022
2.978PheAsp: 2.978 ± 0.063
2.641PheGlu: 2.641 ± 0.058
1.612PhePhe: 1.612 ± 0.048
3.281PheGly: 3.281 ± 0.073
0.816PheHis: 0.816 ± 0.03
2.307PheIle: 2.307 ± 0.061
1.493PheLys: 1.493 ± 0.045
3.294PheLeu: 3.294 ± 0.07
0.927PheMet: 0.927 ± 0.037
1.867PheAsn: 1.867 ± 0.048
1.396PhePro: 1.396 ± 0.045
1.277PheGln: 1.277 ± 0.039
1.78PheArg: 1.78 ± 0.046
3.285PheSer: 3.285 ± 0.057
2.044PheThr: 2.044 ± 0.053
2.838PheVal: 2.838 ± 0.062
0.494PheTrp: 0.494 ± 0.025
1.257PheTyr: 1.257 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
6.474GlyAla: 6.474 ± 0.109
0.909GlyCys: 0.909 ± 0.036
4.28GlyAsp: 4.28 ± 0.078
4.837GlyGlu: 4.837 ± 0.077
3.396GlyPhe: 3.396 ± 0.077
5.504GlyGly: 5.504 ± 0.11
1.598GlyHis: 1.598 ± 0.048
4.333GlyIle: 4.333 ± 0.075
3.03GlyLys: 3.03 ± 0.064
7.286GlyLeu: 7.286 ± 0.098
1.889GlyMet: 1.889 ± 0.056
2.512GlyAsn: 2.512 ± 0.07
2.061GlyPro: 2.061 ± 0.051
2.698GlyGln: 2.698 ± 0.062
3.653GlyArg: 3.653 ± 0.083
4.685GlySer: 4.685 ± 0.081
3.716GlyThr: 3.716 ± 0.091
5.999GlyVal: 5.999 ± 0.096
1.076GlyTrp: 1.076 ± 0.037
2.571GlyTyr: 2.571 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.713HisAla: 1.713 ± 0.045
0.34HisCys: 0.34 ± 0.02
1.047HisAsp: 1.047 ± 0.038
1.189HisGlu: 1.189 ± 0.043
1.075HisPhe: 1.075 ± 0.036
1.596HisGly: 1.596 ± 0.044
0.688HisHis: 0.688 ± 0.04
1.169HisIle: 1.169 ± 0.042
0.626HisLys: 0.626 ± 0.029
2.162HisLeu: 2.162 ± 0.056
0.404HisMet: 0.404 ± 0.023
0.764HisAsn: 0.764 ± 0.03
1.167HisPro: 1.167 ± 0.041
1.065HisGln: 1.065 ± 0.036
1.434HisArg: 1.434 ± 0.038
1.434HisSer: 1.434 ± 0.042
0.97HisThr: 0.97 ± 0.036
1.262HisVal: 1.262 ± 0.042
0.369HisTrp: 0.369 ± 0.022
0.811HisTyr: 0.811 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.598IleAla: 6.598 ± 0.096
0.583IleCys: 0.583 ± 0.029
4.32IleAsp: 4.32 ± 0.078
4.43IleGlu: 4.43 ± 0.083
1.955IlePhe: 1.955 ± 0.057
4.447IleGly: 4.447 ± 0.074
1.152IleHis: 1.152 ± 0.037
3.234IleIle: 3.234 ± 0.071
2.29IleLys: 2.29 ± 0.063
4.618IleLeu: 4.618 ± 0.09
1.087IleMet: 1.087 ± 0.038
2.633IleAsn: 2.633 ± 0.053
2.448IlePro: 2.448 ± 0.057
2.095IleGln: 2.095 ± 0.053
3.078IleArg: 3.078 ± 0.062
4.123IleSer: 4.123 ± 0.066
3.104IleThr: 3.104 ± 0.073
3.991IleVal: 3.991 ± 0.071
0.596IleTrp: 0.596 ± 0.026
1.438IleTyr: 1.438 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
4.068LysAla: 4.068 ± 0.077
0.249LysCys: 0.249 ± 0.018
1.925LysAsp: 1.925 ± 0.056
1.94LysGlu: 1.94 ± 0.05
1.219LysPhe: 1.219 ± 0.04
2.399LysGly: 2.399 ± 0.06
0.899LysHis: 0.899 ± 0.035
2.007LysIle: 2.007 ± 0.05
1.738LysLys: 1.738 ± 0.058
4.468LysLeu: 4.468 ± 0.081
0.979LysMet: 0.979 ± 0.041
1.184LysAsn: 1.184 ± 0.039
1.883LysPro: 1.883 ± 0.054
1.869LysGln: 1.869 ± 0.048
2.655LysArg: 2.655 ± 0.06
2.447LysSer: 2.447 ± 0.063
2.022LysThr: 2.022 ± 0.051
2.854LysVal: 2.854 ± 0.068
0.449LysTrp: 0.449 ± 0.021
0.928LysTyr: 0.928 ± 0.032
0.0LysXaa: 0.0 ± 0.0
Leu
11.96LeuAla: 11.96 ± 0.135
0.991LeuCys: 0.991 ± 0.032
5.905LeuAsp: 5.905 ± 0.09
6.046LeuGlu: 6.046 ± 0.11
3.797LeuPhe: 3.797 ± 0.076
7.173LeuGly: 7.173 ± 0.107
1.868LeuHis: 1.868 ± 0.051
6.026LeuIle: 6.026 ± 0.107
4.218LeuLys: 4.218 ± 0.079
10.282LeuLeu: 10.282 ± 0.158
2.393LeuMet: 2.393 ± 0.058
4.317LeuAsn: 4.317 ± 0.07
4.509LeuPro: 4.509 ± 0.077
3.668LeuGln: 3.668 ± 0.079
5.396LeuArg: 5.396 ± 0.093
8.376LeuSer: 8.376 ± 0.121
5.499LeuThr: 5.499 ± 0.079
7.499LeuVal: 7.499 ± 0.107
1.131LeuTrp: 1.131 ± 0.038
2.299LeuTyr: 2.299 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
2.636MetAla: 2.636 ± 0.062
0.155MetCys: 0.155 ± 0.012
1.147MetAsp: 1.147 ± 0.035
1.162MetGlu: 1.162 ± 0.038
0.698MetPhe: 0.698 ± 0.034
1.794MetGly: 1.794 ± 0.055
0.418MetHis: 0.418 ± 0.024
1.418MetIle: 1.418 ± 0.043
1.164MetLys: 1.164 ± 0.04
2.317MetLeu: 2.317 ± 0.058
0.682MetMet: 0.682 ± 0.032
0.988MetAsn: 0.988 ± 0.034
1.1MetPro: 1.1 ± 0.039
0.934MetGln: 0.934 ± 0.032
1.19MetArg: 1.19 ± 0.039
2.004MetSer: 2.004 ± 0.05
1.549MetThr: 1.549 ± 0.047
1.648MetVal: 1.648 ± 0.047
0.216MetTrp: 0.216 ± 0.016
0.473MetTyr: 0.473 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.634AsnAla: 3.634 ± 0.06
0.391AsnCys: 0.391 ± 0.025
2.279AsnAsp: 2.279 ± 0.053
2.432AsnGlu: 2.432 ± 0.059
1.593AsnPhe: 1.593 ± 0.044
2.96AsnGly: 2.96 ± 0.079
0.84AsnHis: 0.84 ± 0.03
2.158AsnIle: 2.158 ± 0.056
1.289AsnLys: 1.289 ± 0.036
3.377AsnLeu: 3.377 ± 0.076
0.789AsnMet: 0.789 ± 0.033
1.512AsnAsn: 1.512 ± 0.046
2.049AsnPro: 2.049 ± 0.055
1.582AsnGln: 1.582 ± 0.039
2.142AsnArg: 2.142 ± 0.054
2.459AsnSer: 2.459 ± 0.061
2.193AsnThr: 2.193 ± 0.053
2.427AsnVal: 2.427 ± 0.06
0.66AsnTrp: 0.66 ± 0.026
1.333AsnTyr: 1.333 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
3.751ProAla: 3.751 ± 0.071
0.307ProCys: 0.307 ± 0.018
2.313ProAsp: 2.313 ± 0.062
3.213ProGlu: 3.213 ± 0.067
1.6ProPhe: 1.6 ± 0.048
2.734ProGly: 2.734 ± 0.058
0.84ProHis: 0.84 ± 0.035
2.392ProIle: 2.392 ± 0.053
1.644ProLys: 1.644 ± 0.05
4.024ProLeu: 4.024 ± 0.076
0.986ProMet: 0.986 ± 0.034
1.797ProAsn: 1.797 ± 0.045
1.312ProPro: 1.312 ± 0.05
1.496ProGln: 1.496 ± 0.042
1.684ProArg: 1.684 ± 0.049
3.029ProSer: 3.029 ± 0.061
2.294ProThr: 2.294 ± 0.054
3.141ProVal: 3.141 ± 0.068
0.575ProTrp: 0.575 ± 0.029
1.098ProTyr: 1.098 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
4.158GlnAla: 4.158 ± 0.077
0.342GlnCys: 0.342 ± 0.023
1.632GlnAsp: 1.632 ± 0.043
1.843GlnGlu: 1.843 ± 0.052
1.711GlnPhe: 1.711 ± 0.044
2.688GlnGly: 2.688 ± 0.051
1.057GlnHis: 1.057 ± 0.039
2.201GlnIle: 2.201 ± 0.052
1.262GlnLys: 1.262 ± 0.043
5.759GlnLeu: 5.759 ± 0.104
0.943GlnMet: 0.943 ± 0.031
1.154GlnAsn: 1.154 ± 0.037
1.787GlnPro: 1.787 ± 0.052
2.586GlnGln: 2.586 ± 0.072
3.197GlnArg: 3.197 ± 0.064
2.838GlnSer: 2.838 ± 0.063
1.963GlnThr: 1.963 ± 0.055
2.899GlnVal: 2.899 ± 0.064
0.644GlnTrp: 0.644 ± 0.031
1.128GlnTyr: 1.128 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
4.889ArgAla: 4.889 ± 0.082
0.544ArgCys: 0.544 ± 0.028
3.085ArgAsp: 3.085 ± 0.064
3.697ArgGlu: 3.697 ± 0.078
2.586ArgPhe: 2.586 ± 0.056
3.547ArgGly: 3.547 ± 0.075
1.272ArgHis: 1.272 ± 0.038
3.287ArgIle: 3.287 ± 0.061
1.981ArgLys: 1.981 ± 0.053
5.79ArgLeu: 5.79 ± 0.089
1.287ArgMet: 1.287 ± 0.041
1.905ArgAsn: 1.905 ± 0.045
1.886ArgPro: 1.886 ± 0.051
2.348ArgGln: 2.348 ± 0.057
3.136ArgArg: 3.136 ± 0.078
3.357ArgSer: 3.357 ± 0.069
2.34ArgThr: 2.34 ± 0.052
3.993ArgVal: 3.993 ± 0.073
0.903ArgTrp: 0.903 ± 0.034
2.0ArgTyr: 2.0 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
6.944SerAla: 6.944 ± 0.094
0.67SerCys: 0.67 ± 0.035
3.993SerAsp: 3.993 ± 0.091
4.53SerGlu: 4.53 ± 0.088
3.004SerPhe: 3.004 ± 0.068
5.514SerGly: 5.514 ± 0.108
1.449SerHis: 1.449 ± 0.043
3.697SerIle: 3.697 ± 0.069
2.44SerLys: 2.44 ± 0.055
7.29SerLeu: 7.29 ± 0.097
1.618SerMet: 1.618 ± 0.043
2.468SerAsn: 2.468 ± 0.063
2.816SerPro: 2.816 ± 0.059
2.743SerGln: 2.743 ± 0.062
3.323SerArg: 3.323 ± 0.064
4.892SerSer: 4.892 ± 0.089
3.483SerThr: 3.483 ± 0.068
5.087SerVal: 5.087 ± 0.086
0.978SerTrp: 0.978 ± 0.035
2.21SerTyr: 2.21 ± 0.061
0.0SerXaa: 0.0 ± 0.0
Thr
5.164ThrAla: 5.164 ± 0.09
0.376ThrCys: 0.376 ± 0.025
2.862ThrAsp: 2.862 ± 0.08
3.15ThrGlu: 3.15 ± 0.066
1.805ThrPhe: 1.805 ± 0.048
4.035ThrGly: 4.035 ± 0.081
1.116ThrHis: 1.116 ± 0.038
3.154ThrIle: 3.154 ± 0.065
1.835ThrLys: 1.835 ± 0.054
5.851ThrLeu: 5.851 ± 0.087
1.07ThrMet: 1.07 ± 0.035
1.945ThrAsn: 1.945 ± 0.054
2.803ThrPro: 2.803 ± 0.058
2.183ThrGln: 2.183 ± 0.053
2.485ThrArg: 2.485 ± 0.054
3.398ThrSer: 3.398 ± 0.074
3.121ThrThr: 3.121 ± 0.07
3.781ThrVal: 3.781 ± 0.079
0.642ThrTrp: 0.642 ± 0.032
1.275ThrTyr: 1.275 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
7.825ValAla: 7.825 ± 0.104
0.741ValCys: 0.741 ± 0.031
4.811ValAsp: 4.811 ± 0.078
5.068ValGlu: 5.068 ± 0.101
2.775ValPhe: 2.775 ± 0.064
5.259ValGly: 5.259 ± 0.093
1.299ValHis: 1.299 ± 0.042
4.765ValIle: 4.765 ± 0.077
2.719ValLys: 2.719 ± 0.063
6.858ValLeu: 6.858 ± 0.1
1.867ValMet: 1.867 ± 0.053
2.99ValAsn: 2.99 ± 0.056
2.766ValPro: 2.766 ± 0.06
2.134ValGln: 2.134 ± 0.059
3.382ValArg: 3.382 ± 0.072
5.302ValSer: 5.302 ± 0.091
4.005ValThr: 4.005 ± 0.082
6.122ValVal: 6.122 ± 0.095
0.825ValTrp: 0.825 ± 0.031
1.799ValTyr: 1.799 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.973TrpAla: 0.973 ± 0.039
0.167TrpCys: 0.167 ± 0.014
0.69TrpAsp: 0.69 ± 0.03
0.669TrpGlu: 0.669 ± 0.035
0.612TrpPhe: 0.612 ± 0.03
0.881TrpGly: 0.881 ± 0.038
0.338TrpHis: 0.338 ± 0.018
0.704TrpIle: 0.704 ± 0.035
0.345TrpLys: 0.345 ± 0.02
1.919TrpLeu: 1.919 ± 0.055
0.328TrpMet: 0.328 ± 0.021
0.463TrpAsn: 0.463 ± 0.025
0.524TrpPro: 0.524 ± 0.026
0.842TrpGln: 0.842 ± 0.036
0.91TrpArg: 0.91 ± 0.036
0.866TrpSer: 0.866 ± 0.031
0.577TrpThr: 0.577 ± 0.029
0.937TrpVal: 0.937 ± 0.036
0.235TrpTrp: 0.235 ± 0.018
0.394TrpTyr: 0.394 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.281TyrAla: 2.281 ± 0.06
0.31TyrCys: 0.31 ± 0.017
1.736TyrAsp: 1.736 ± 0.06
1.746TyrGlu: 1.746 ± 0.048
1.306TyrPhe: 1.306 ± 0.039
2.224TyrGly: 2.224 ± 0.056
0.723TyrHis: 0.723 ± 0.036
1.234TyrIle: 1.234 ± 0.042
0.847TyrLys: 0.847 ± 0.033
3.076TyrLeu: 3.076 ± 0.063
0.491TyrMet: 0.491 ± 0.028
0.995TyrAsn: 0.995 ± 0.035
1.25TyrPro: 1.25 ± 0.045
1.657TyrGln: 1.657 ± 0.046
2.062TyrArg: 2.062 ± 0.056
2.05TyrSer: 2.05 ± 0.057
1.473TyrThr: 1.473 ± 0.04
1.812TyrVal: 1.812 ± 0.048
0.464TyrTrp: 0.464 ± 0.028
0.953TyrTyr: 0.953 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2413 proteins (804001 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski