Amino acid dipepetide frequency for Prevotella sp. ne3005

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.897AlaAla: 5.897 ± 0.107
1.041AlaCys: 1.041 ± 0.035
4.537AlaAsp: 4.537 ± 0.064
4.955AlaGlu: 4.955 ± 0.079
3.154AlaPhe: 3.154 ± 0.055
4.999AlaGly: 4.999 ± 0.081
1.345AlaHis: 1.345 ± 0.031
5.017AlaIle: 5.017 ± 0.074
4.552AlaLys: 4.552 ± 0.069
6.77AlaLeu: 6.77 ± 0.098
2.257AlaMet: 2.257 ± 0.047
3.094AlaAsn: 3.094 ± 0.059
2.411AlaPro: 2.411 ± 0.053
3.207AlaGln: 3.207 ± 0.057
3.11AlaArg: 3.11 ± 0.059
4.079AlaSer: 4.079 ± 0.072
3.847AlaThr: 3.847 ± 0.066
4.851AlaVal: 4.851 ± 0.07
0.932AlaTrp: 0.932 ± 0.031
2.885AlaTyr: 2.885 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.867CysAla: 0.867 ± 0.032
0.243CysCys: 0.243 ± 0.015
0.764CysAsp: 0.764 ± 0.031
0.811CysGlu: 0.811 ± 0.03
0.585CysPhe: 0.585 ± 0.023
1.155CysGly: 1.155 ± 0.038
0.398CysHis: 0.398 ± 0.019
0.931CysIle: 0.931 ± 0.028
0.706CysLys: 0.706 ± 0.031
1.136CysLeu: 1.136 ± 0.03
0.365CysMet: 0.365 ± 0.016
0.578CysAsn: 0.578 ± 0.023
0.542CysPro: 0.542 ± 0.025
0.496CysGln: 0.496 ± 0.024
0.656CysArg: 0.656 ± 0.026
0.794CysSer: 0.794 ± 0.026
0.661CysThr: 0.661 ± 0.027
0.894CysVal: 0.894 ± 0.026
0.2CysTrp: 0.2 ± 0.014
0.59CysTyr: 0.59 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
4.298AspAla: 4.298 ± 0.062
0.75AspCys: 0.75 ± 0.025
3.459AspAsp: 3.459 ± 0.072
4.329AspGlu: 4.329 ± 0.082
3.103AspPhe: 3.103 ± 0.054
4.798AspGly: 4.798 ± 0.083
1.175AspHis: 1.175 ± 0.034
4.23AspIle: 4.23 ± 0.068
3.722AspLys: 3.722 ± 0.07
4.788AspLeu: 4.788 ± 0.07
1.669AspMet: 1.669 ± 0.041
2.793AspAsn: 2.793 ± 0.058
2.09AspPro: 2.09 ± 0.051
1.711AspGln: 1.711 ± 0.037
2.575AspArg: 2.575 ± 0.051
3.181AspSer: 3.181 ± 0.058
2.726AspThr: 2.726 ± 0.055
3.952AspVal: 3.952 ± 0.055
0.945AspTrp: 0.945 ± 0.026
3.193AspTyr: 3.193 ± 0.062
0.0AspXaa: 0.0 ± 0.0
Glu
5.232GluAla: 5.232 ± 0.083
0.701GluCys: 0.701 ± 0.027
3.63GluAsp: 3.63 ± 0.054
4.716GluGlu: 4.716 ± 0.096
2.391GluPhe: 2.391 ± 0.044
4.391GluGly: 4.391 ± 0.071
1.4GluHis: 1.4 ± 0.041
4.181GluIle: 4.181 ± 0.076
4.68GluLys: 4.68 ± 0.093
5.673GluLeu: 5.673 ± 0.078
2.234GluMet: 2.234 ± 0.041
3.114GluAsn: 3.114 ± 0.053
2.033GluPro: 2.033 ± 0.049
2.999GluGln: 2.999 ± 0.077
3.386GluArg: 3.386 ± 0.062
3.011GluSer: 3.011 ± 0.044
3.249GluThr: 3.249 ± 0.054
4.204GluVal: 4.204 ± 0.067
0.877GluTrp: 0.877 ± 0.029
2.455GluTyr: 2.455 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
2.932PheAla: 2.932 ± 0.054
0.802PheCys: 0.802 ± 0.028
2.926PheAsp: 2.926 ± 0.051
2.507PheGlu: 2.507 ± 0.044
2.124PhePhe: 2.124 ± 0.052
3.321PheGly: 3.321 ± 0.061
0.921PheHis: 0.921 ± 0.027
2.809PheIle: 2.809 ± 0.049
2.325PheLys: 2.325 ± 0.048
3.709PheLeu: 3.709 ± 0.067
1.18PheMet: 1.18 ± 0.03
2.238PheAsn: 2.238 ± 0.045
1.646PhePro: 1.646 ± 0.036
1.309PheGln: 1.309 ± 0.032
2.12PheArg: 2.12 ± 0.049
3.125PheSer: 3.125 ± 0.057
2.553PheThr: 2.553 ± 0.047
2.95PheVal: 2.95 ± 0.062
0.636PheTrp: 0.636 ± 0.029
1.893PheTyr: 1.893 ± 0.042
0.0PheXaa: 0.0 ± 0.0
Gly
4.545GlyAla: 4.545 ± 0.073
1.039GlyCys: 1.039 ± 0.036
3.948GlyAsp: 3.948 ± 0.07
4.19GlyGlu: 4.19 ± 0.068
3.218GlyPhe: 3.218 ± 0.064
4.976GlyGly: 4.976 ± 0.089
1.582GlyHis: 1.582 ± 0.044
5.084GlyIle: 5.084 ± 0.075
4.899GlyLys: 4.899 ± 0.08
6.001GlyLeu: 6.001 ± 0.089
2.213GlyMet: 2.213 ± 0.05
3.34GlyAsn: 3.34 ± 0.067
1.47GlyPro: 1.47 ± 0.038
2.484GlyGln: 2.484 ± 0.048
3.277GlyArg: 3.277 ± 0.061
4.059GlySer: 4.059 ± 0.071
4.042GlyThr: 4.042 ± 0.063
4.794GlyVal: 4.794 ± 0.062
1.073GlyTrp: 1.073 ± 0.033
3.276GlyTyr: 3.276 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
1.27HisAla: 1.27 ± 0.031
0.365HisCys: 0.365 ± 0.02
1.207HisAsp: 1.207 ± 0.035
1.216HisGlu: 1.216 ± 0.037
1.13HisPhe: 1.13 ± 0.03
1.467HisGly: 1.467 ± 0.041
0.646HisHis: 0.646 ± 0.027
1.436HisIle: 1.436 ± 0.038
1.012HisLys: 1.012 ± 0.033
2.036HisLeu: 2.036 ± 0.045
0.442HisMet: 0.442 ± 0.02
0.973HisAsn: 0.973 ± 0.032
1.16HisPro: 1.16 ± 0.033
0.863HisGln: 0.863 ± 0.027
1.046HisArg: 1.046 ± 0.035
1.1HisSer: 1.1 ± 0.033
1.047HisThr: 1.047 ± 0.029
1.237HisVal: 1.237 ± 0.037
0.281HisTrp: 0.281 ± 0.016
1.061HisTyr: 1.061 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
5.139IleAla: 5.139 ± 0.079
1.032IleCys: 1.032 ± 0.034
4.495IleAsp: 4.495 ± 0.069
4.283IleGlu: 4.283 ± 0.073
2.601IlePhe: 2.601 ± 0.051
4.746IleGly: 4.746 ± 0.077
1.328IleHis: 1.328 ± 0.035
4.502IleIle: 4.502 ± 0.078
3.773IleLys: 3.773 ± 0.071
5.496IleLeu: 5.496 ± 0.079
1.591IleMet: 1.591 ± 0.039
3.232IleAsn: 3.232 ± 0.058
2.948IlePro: 2.948 ± 0.049
2.202IleGln: 2.202 ± 0.041
3.335IleArg: 3.335 ± 0.064
4.198IleSer: 4.198 ± 0.066
3.736IleThr: 3.736 ± 0.071
4.453IleVal: 4.453 ± 0.071
0.761IleTrp: 0.761 ± 0.029
2.549IleTyr: 2.549 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
5.112LysAla: 5.112 ± 0.084
0.581LysCys: 0.581 ± 0.025
4.023LysAsp: 4.023 ± 0.073
4.888LysGlu: 4.888 ± 0.077
2.044LysPhe: 2.044 ± 0.046
4.389LysGly: 4.389 ± 0.076
1.216LysHis: 1.216 ± 0.037
3.748LysIle: 3.748 ± 0.062
4.702LysLys: 4.702 ± 0.1
4.952LysLeu: 4.952 ± 0.068
2.352LysMet: 2.352 ± 0.044
3.134LysAsn: 3.134 ± 0.055
2.282LysPro: 2.282 ± 0.043
2.603LysGln: 2.603 ± 0.075
3.052LysArg: 3.052 ± 0.056
3.226LysSer: 3.226 ± 0.056
3.636LysThr: 3.636 ± 0.062
4.123LysVal: 4.123 ± 0.066
0.817LysTrp: 0.817 ± 0.028
2.773LysTyr: 2.773 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
6.435LeuAla: 6.435 ± 0.095
1.322LeuCys: 1.322 ± 0.036
4.929LeuAsp: 4.929 ± 0.074
4.918LeuGlu: 4.918 ± 0.073
4.116LeuPhe: 4.116 ± 0.074
5.692LeuGly: 5.692 ± 0.076
1.833LeuHis: 1.833 ± 0.041
5.293LeuIle: 5.293 ± 0.079
5.937LeuLys: 5.937 ± 0.085
8.419LeuLeu: 8.419 ± 0.136
2.949LeuMet: 2.949 ± 0.053
4.293LeuAsn: 4.293 ± 0.071
3.882LeuPro: 3.882 ± 0.06
3.378LeuGln: 3.378 ± 0.065
4.278LeuArg: 4.278 ± 0.077
6.024LeuSer: 6.024 ± 0.088
5.509LeuThr: 5.509 ± 0.075
5.231LeuVal: 5.231 ± 0.083
1.162LeuTrp: 1.162 ± 0.034
3.515LeuTyr: 3.515 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
2.657MetAla: 2.657 ± 0.048
0.304MetCys: 0.304 ± 0.017
1.562MetAsp: 1.562 ± 0.038
1.86MetGlu: 1.86 ± 0.041
1.13MetPhe: 1.13 ± 0.032
1.989MetGly: 1.989 ± 0.043
0.533MetHis: 0.533 ± 0.018
1.756MetIle: 1.756 ± 0.039
2.656MetLys: 2.656 ± 0.05
2.771MetLeu: 2.771 ± 0.055
1.202MetMet: 1.202 ± 0.039
1.568MetAsn: 1.568 ± 0.035
1.309MetPro: 1.309 ± 0.037
1.219MetGln: 1.219 ± 0.038
1.45MetArg: 1.45 ± 0.034
1.691MetSer: 1.691 ± 0.044
1.91MetThr: 1.91 ± 0.04
1.821MetVal: 1.821 ± 0.039
0.302MetTrp: 0.302 ± 0.017
0.865MetTyr: 0.865 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
3.403AsnAla: 3.403 ± 0.059
0.569AsnCys: 0.569 ± 0.024
2.829AsnAsp: 2.829 ± 0.064
2.845AsnGlu: 2.845 ± 0.05
2.01AsnPhe: 2.01 ± 0.041
3.798AsnGly: 3.798 ± 0.078
1.059AsnHis: 1.059 ± 0.035
3.348AsnIle: 3.348 ± 0.057
2.733AsnLys: 2.733 ± 0.051
3.975AsnLeu: 3.975 ± 0.06
1.306AsnMet: 1.306 ± 0.035
2.336AsnAsn: 2.336 ± 0.06
2.421AsnPro: 2.421 ± 0.045
1.656AsnGln: 1.656 ± 0.041
2.206AsnArg: 2.206 ± 0.049
2.465AsnSer: 2.465 ± 0.057
2.503AsnThr: 2.503 ± 0.062
3.096AsnVal: 3.096 ± 0.06
0.642AsnTrp: 0.642 ± 0.025
2.182AsnTyr: 2.182 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
2.722ProAla: 2.722 ± 0.055
0.42ProCys: 0.42 ± 0.02
2.578ProAsp: 2.578 ± 0.047
3.388ProGlu: 3.388 ± 0.063
1.77ProPhe: 1.77 ± 0.043
2.229ProGly: 2.229 ± 0.046
0.766ProHis: 0.766 ± 0.028
2.263ProIle: 2.263 ± 0.045
2.205ProLys: 2.205 ± 0.044
3.337ProLeu: 3.337 ± 0.056
1.109ProMet: 1.109 ± 0.032
1.702ProAsn: 1.702 ± 0.04
0.771ProPro: 0.771 ± 0.029
1.627ProGln: 1.627 ± 0.044
1.386ProArg: 1.386 ± 0.039
2.122ProSer: 2.122 ± 0.042
2.02ProThr: 2.02 ± 0.044
2.769ProVal: 2.769 ± 0.05
0.509ProTrp: 0.509 ± 0.025
1.782ProTyr: 1.782 ± 0.049
0.0ProXaa: 0.0 ± 0.0
Gln
2.736GlnAla: 2.736 ± 0.05
0.373GlnCys: 0.373 ± 0.018
1.794GlnAsp: 1.794 ± 0.044
2.56GlnGlu: 2.56 ± 0.088
1.58GlnPhe: 1.58 ± 0.041
2.374GlnGly: 2.374 ± 0.049
0.863GlnHis: 0.863 ± 0.031
2.383GlnIle: 2.383 ± 0.051
2.67GlnLys: 2.67 ± 0.062
3.834GlnLeu: 3.834 ± 0.064
1.448GlnMet: 1.448 ± 0.033
1.75GlnAsn: 1.75 ± 0.042
1.49GlnPro: 1.49 ± 0.035
2.221GlnGln: 2.221 ± 0.107
1.99GlnArg: 1.99 ± 0.062
1.98GlnSer: 1.98 ± 0.045
2.145GlnThr: 2.145 ± 0.041
2.22GlnVal: 2.22 ± 0.042
0.603GlnTrp: 0.603 ± 0.025
1.648GlnTyr: 1.648 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
2.796ArgAla: 2.796 ± 0.054
0.538ArgCys: 0.538 ± 0.023
2.495ArgAsp: 2.495 ± 0.05
3.156ArgGlu: 3.156 ± 0.059
2.198ArgPhe: 2.198 ± 0.049
2.568ArgGly: 2.568 ± 0.047
1.189ArgHis: 1.189 ± 0.034
3.442ArgIle: 3.442 ± 0.062
3.056ArgLys: 3.056 ± 0.056
4.816ArgLeu: 4.816 ± 0.079
1.612ArgMet: 1.612 ± 0.045
2.198ArgAsn: 2.198 ± 0.054
1.679ArgPro: 1.679 ± 0.043
2.429ArgGln: 2.429 ± 0.052
2.526ArgArg: 2.526 ± 0.052
2.35ArgSer: 2.35 ± 0.055
2.355ArgThr: 2.355 ± 0.044
2.65ArgVal: 2.65 ± 0.05
0.792ArgTrp: 0.792 ± 0.027
2.257ArgTyr: 2.257 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
4.097SerAla: 4.097 ± 0.072
0.815SerCys: 0.815 ± 0.025
3.471SerAsp: 3.471 ± 0.058
3.417SerGlu: 3.417 ± 0.056
2.85SerPhe: 2.85 ± 0.054
4.16SerGly: 4.16 ± 0.061
1.252SerHis: 1.252 ± 0.034
3.906SerIle: 3.906 ± 0.062
3.362SerLys: 3.362 ± 0.065
5.503SerLeu: 5.503 ± 0.089
1.607SerMet: 1.607 ± 0.04
2.432SerAsn: 2.432 ± 0.056
2.179SerPro: 2.179 ± 0.043
2.119SerGln: 2.119 ± 0.04
2.558SerArg: 2.558 ± 0.05
3.614SerSer: 3.614 ± 0.072
3.032SerThr: 3.032 ± 0.053
3.938SerVal: 3.938 ± 0.069
0.86SerTrp: 0.86 ± 0.031
2.551SerTyr: 2.551 ± 0.062
0.0SerXaa: 0.0 ± 0.0
Thr
4.104ThrAla: 4.104 ± 0.064
0.623ThrCys: 0.623 ± 0.022
3.394ThrAsp: 3.394 ± 0.061
3.11ThrGlu: 3.11 ± 0.058
2.597ThrPhe: 2.597 ± 0.05
4.038ThrGly: 4.038 ± 0.065
1.03ThrHis: 1.03 ± 0.031
4.167ThrIle: 4.167 ± 0.068
2.964ThrLys: 2.964 ± 0.053
5.233ThrLeu: 5.233 ± 0.07
1.503ThrMet: 1.503 ± 0.035
2.397ThrAsn: 2.397 ± 0.056
2.621ThrPro: 2.621 ± 0.05
1.774ThrGln: 1.774 ± 0.043
2.176ThrArg: 2.176 ± 0.042
3.275ThrSer: 3.275 ± 0.059
3.138ThrThr: 3.138 ± 0.063
3.73ThrVal: 3.73 ± 0.058
0.767ThrTrp: 0.767 ± 0.03
2.41ThrTyr: 2.41 ± 0.055
0.001ThrXaa: 0.001 ± 0.001
Val
4.801ValAla: 4.801 ± 0.068
1.073ValCys: 1.073 ± 0.035
3.802ValAsp: 3.802 ± 0.061
4.133ValGlu: 4.133 ± 0.065
2.815ValPhe: 2.815 ± 0.055
4.241ValGly: 4.241 ± 0.067
1.122ValHis: 1.122 ± 0.035
4.392ValIle: 4.392 ± 0.065
4.433ValLys: 4.433 ± 0.069
5.552ValLeu: 5.552 ± 0.07
1.957ValMet: 1.957 ± 0.042
3.201ValAsn: 3.201 ± 0.056
2.426ValPro: 2.426 ± 0.051
1.905ValGln: 1.905 ± 0.041
3.009ValArg: 3.009 ± 0.054
4.182ValSer: 4.182 ± 0.064
3.827ValThr: 3.827 ± 0.068
4.717ValVal: 4.717 ± 0.082
0.818ValTrp: 0.818 ± 0.028
2.568ValTyr: 2.568 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.894TrpAla: 0.894 ± 0.027
0.184TrpCys: 0.184 ± 0.012
0.748TrpAsp: 0.748 ± 0.025
0.77TrpGlu: 0.77 ± 0.028
0.601TrpPhe: 0.601 ± 0.024
1.061TrpGly: 1.061 ± 0.031
0.327TrpHis: 0.327 ± 0.016
0.82TrpIle: 0.82 ± 0.031
0.841TrpLys: 0.841 ± 0.027
1.441TrpLeu: 1.441 ± 0.039
0.465TrpMet: 0.465 ± 0.024
0.755TrpAsn: 0.755 ± 0.029
0.395TrpPro: 0.395 ± 0.02
0.681TrpGln: 0.681 ± 0.024
0.719TrpArg: 0.719 ± 0.024
0.782TrpSer: 0.782 ± 0.031
0.733TrpThr: 0.733 ± 0.027
0.779TrpVal: 0.779 ± 0.029
0.25TrpTrp: 0.25 ± 0.014
0.582TrpTyr: 0.582 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.001TyrAla: 3.001 ± 0.058
0.596TyrCys: 0.596 ± 0.026
2.898TyrAsp: 2.898 ± 0.056
2.502TyrGlu: 2.502 ± 0.051
2.011TyrPhe: 2.011 ± 0.048
3.118TyrGly: 3.118 ± 0.06
1.008TyrHis: 1.008 ± 0.031
2.692TyrIle: 2.692 ± 0.053
2.425TyrLys: 2.425 ± 0.045
3.673TyrLeu: 3.673 ± 0.068
1.093TyrMet: 1.093 ± 0.032
2.256TyrAsn: 2.256 ± 0.046
1.72TyrPro: 1.72 ± 0.042
1.781TyrGln: 1.781 ± 0.042
2.247TyrArg: 2.247 ± 0.047
2.464TyrSer: 2.464 ± 0.049
2.383TyrThr: 2.383 ± 0.044
2.563TyrVal: 2.563 ± 0.055
0.602TyrTrp: 0.602 ± 0.025
2.156TyrTyr: 2.156 ± 0.063
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.003
Statistics based on 3038 proteins (1096786 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski