Amino acid dipepetide frequency for Leifsonia sp. Root227

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.429AlaAla: 20.429 ± 0.203
0.632AlaCys: 0.632 ± 0.026
8.215AlaAsp: 8.215 ± 0.099
7.798AlaGlu: 7.798 ± 0.106
4.257AlaPhe: 4.257 ± 0.06
12.096AlaGly: 12.096 ± 0.12
2.338AlaHis: 2.338 ± 0.049
6.49AlaIle: 6.49 ± 0.077
3.175AlaLys: 3.175 ± 0.065
13.469AlaLeu: 13.469 ± 0.136
2.646AlaMet: 2.646 ± 0.047
2.712AlaAsn: 2.712 ± 0.053
6.184AlaPro: 6.184 ± 0.098
3.803AlaGln: 3.803 ± 0.062
8.202AlaArg: 8.202 ± 0.1
8.006AlaSer: 8.006 ± 0.099
7.886AlaThr: 7.886 ± 0.087
12.154AlaVal: 12.154 ± 0.116
1.822AlaTrp: 1.822 ± 0.046
2.465AlaTyr: 2.465 ± 0.05
0.001AlaXaa: 0.001 ± 0.001
Cys
0.618CysAla: 0.618 ± 0.025
0.041CysCys: 0.041 ± 0.006
0.296CysAsp: 0.296 ± 0.014
0.211CysGlu: 0.211 ± 0.014
0.16CysPhe: 0.16 ± 0.011
0.507CysGly: 0.507 ± 0.021
0.102CysHis: 0.102 ± 0.01
0.203CysIle: 0.203 ± 0.013
0.068CysLys: 0.068 ± 0.008
0.404CysLeu: 0.404 ± 0.019
0.071CysMet: 0.071 ± 0.008
0.095CysAsn: 0.095 ± 0.009
0.243CysPro: 0.243 ± 0.018
0.116CysGln: 0.116 ± 0.01
0.245CysArg: 0.245 ± 0.013
0.357CysSer: 0.357 ± 0.019
0.307CysThr: 0.307 ± 0.019
0.425CysVal: 0.425 ± 0.021
0.068CysTrp: 0.068 ± 0.008
0.087CysTyr: 0.087 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
9.438AspAla: 9.438 ± 0.105
0.211AspCys: 0.211 ± 0.014
4.174AspAsp: 4.174 ± 0.077
3.855AspGlu: 3.855 ± 0.071
1.746AspPhe: 1.746 ± 0.039
6.357AspGly: 6.357 ± 0.087
1.141AspHis: 1.141 ± 0.035
2.449AspIle: 2.449 ± 0.044
1.169AspLys: 1.169 ± 0.042
6.056AspLeu: 6.056 ± 0.082
0.683AspMet: 0.683 ± 0.025
1.103AspAsn: 1.103 ± 0.035
3.761AspPro: 3.761 ± 0.055
1.737AspGln: 1.737 ± 0.04
4.238AspArg: 4.238 ± 0.073
2.649AspSer: 2.649 ± 0.044
3.111AspThr: 3.111 ± 0.055
5.113AspVal: 5.113 ± 0.073
1.0AspTrp: 1.0 ± 0.034
1.285AspTyr: 1.285 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
6.254GluAla: 6.254 ± 0.086
0.202GluCys: 0.202 ± 0.012
2.323GluAsp: 2.323 ± 0.043
2.713GluGlu: 2.713 ± 0.064
1.68GluPhe: 1.68 ± 0.034
3.12GluGly: 3.12 ± 0.056
1.412GluHis: 1.412 ± 0.036
2.404GluIle: 2.404 ± 0.057
1.468GluLys: 1.468 ± 0.042
6.622GluLeu: 6.622 ± 0.088
0.768GluMet: 0.768 ± 0.027
1.229GluAsn: 1.229 ± 0.033
2.842GluPro: 2.842 ± 0.063
2.125GluGln: 2.125 ± 0.049
4.95GluArg: 4.95 ± 0.087
2.772GluSer: 2.772 ± 0.049
3.033GluThr: 3.033 ± 0.057
4.036GluVal: 4.036 ± 0.065
0.779GluTrp: 0.779 ± 0.028
1.089GluTyr: 1.089 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
4.513PheAla: 4.513 ± 0.067
0.165PheCys: 0.165 ± 0.013
2.432PheAsp: 2.432 ± 0.053
1.718PheGlu: 1.718 ± 0.042
1.205PhePhe: 1.205 ± 0.035
3.587PheGly: 3.587 ± 0.063
0.556PheHis: 0.556 ± 0.021
1.323PheIle: 1.323 ± 0.036
0.532PheLys: 0.532 ± 0.022
3.052PheLeu: 3.052 ± 0.057
0.448PheMet: 0.448 ± 0.019
0.765PheAsn: 0.765 ± 0.027
1.51PhePro: 1.51 ± 0.038
0.869PheGln: 0.869 ± 0.026
1.857PheArg: 1.857 ± 0.035
2.011PheSer: 2.011 ± 0.042
2.203PheThr: 2.203 ± 0.047
3.094PheVal: 3.094 ± 0.056
0.564PheTrp: 0.564 ± 0.022
0.717PheTyr: 0.717 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
10.884GlyAla: 10.884 ± 0.109
0.542GlyCys: 0.542 ± 0.022
5.036GlyAsp: 5.036 ± 0.069
4.437GlyGlu: 4.437 ± 0.068
3.362GlyPhe: 3.362 ± 0.049
7.77GlyGly: 7.77 ± 0.126
1.776GlyHis: 1.776 ± 0.038
4.768GlyIle: 4.768 ± 0.067
2.429GlyLys: 2.429 ± 0.054
8.761GlyLeu: 8.761 ± 0.095
1.976GlyMet: 1.976 ± 0.045
1.808GlyAsn: 1.808 ± 0.05
3.635GlyPro: 3.635 ± 0.049
2.581GlyGln: 2.581 ± 0.048
5.96GlyArg: 5.96 ± 0.083
5.792GlySer: 5.792 ± 0.075
5.679GlyThr: 5.679 ± 0.095
8.019GlyVal: 8.019 ± 0.098
1.688GlyTrp: 1.688 ± 0.038
2.299GlyTyr: 2.299 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.183HisAla: 2.183 ± 0.047
0.097HisCys: 0.097 ± 0.009
1.29HisAsp: 1.29 ± 0.037
1.071HisGlu: 1.071 ± 0.034
0.556HisPhe: 0.556 ± 0.023
1.843HisGly: 1.843 ± 0.037
0.466HisHis: 0.466 ± 0.02
0.789HisIle: 0.789 ± 0.024
0.314HisLys: 0.314 ± 0.016
1.944HisLeu: 1.944 ± 0.046
0.273HisMet: 0.273 ± 0.016
0.366HisAsn: 0.366 ± 0.018
1.551HisPro: 1.551 ± 0.032
0.514HisGln: 0.514 ± 0.02
1.367HisArg: 1.367 ± 0.037
0.989HisSer: 0.989 ± 0.028
1.003HisThr: 1.003 ± 0.03
1.502HisVal: 1.502 ± 0.038
0.3HisTrp: 0.3 ± 0.014
0.45HisTyr: 0.45 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
7.394IleAla: 7.394 ± 0.089
0.224IleCys: 0.224 ± 0.014
3.423IleAsp: 3.423 ± 0.056
2.605IleGlu: 2.605 ± 0.053
1.266IlePhe: 1.266 ± 0.032
4.968IleGly: 4.968 ± 0.086
0.803IleHis: 0.803 ± 0.024
1.872IleIle: 1.872 ± 0.051
0.902IleLys: 0.902 ± 0.031
4.003IleLeu: 4.003 ± 0.062
0.634IleMet: 0.634 ± 0.025
0.981IleAsn: 0.981 ± 0.034
2.586IlePro: 2.586 ± 0.044
1.159IleGln: 1.159 ± 0.031
2.578IleArg: 2.578 ± 0.057
2.366IleSer: 2.366 ± 0.042
2.802IleThr: 2.802 ± 0.055
5.391IleVal: 5.391 ± 0.066
0.53IleTrp: 0.53 ± 0.02
0.827IleTyr: 0.827 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
2.843LysAla: 2.843 ± 0.063
0.063LysCys: 0.063 ± 0.007
1.36LysAsp: 1.36 ± 0.039
1.054LysGlu: 1.054 ± 0.035
0.563LysPhe: 0.563 ± 0.022
1.667LysGly: 1.667 ± 0.044
0.5LysHis: 0.5 ± 0.022
0.964LysIle: 0.964 ± 0.033
0.944LysLys: 0.944 ± 0.04
2.282LysLeu: 2.282 ± 0.049
0.436LysMet: 0.436 ± 0.019
0.693LysAsn: 0.693 ± 0.025
1.417LysPro: 1.417 ± 0.043
0.864LysGln: 0.864 ± 0.029
1.743LysArg: 1.743 ± 0.04
1.308LysSer: 1.308 ± 0.039
1.486LysThr: 1.486 ± 0.042
1.77LysVal: 1.77 ± 0.043
0.298LysTrp: 0.298 ± 0.016
0.533LysTyr: 0.533 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
14.378LeuAla: 14.378 ± 0.124
0.552LeuCys: 0.552 ± 0.021
6.708LeuAsp: 6.708 ± 0.088
4.471LeuGlu: 4.471 ± 0.069
3.153LeuPhe: 3.153 ± 0.055
9.193LeuGly: 9.193 ± 0.093
1.852LeuHis: 1.852 ± 0.04
4.7LeuIle: 4.7 ± 0.066
1.988LeuLys: 1.988 ± 0.048
10.259LeuLeu: 10.259 ± 0.131
1.549LeuMet: 1.549 ± 0.035
2.03LeuAsn: 2.03 ± 0.039
5.585LeuPro: 5.585 ± 0.076
2.484LeuGln: 2.484 ± 0.044
6.966LeuArg: 6.966 ± 0.095
6.169LeuSer: 6.169 ± 0.08
6.494LeuThr: 6.494 ± 0.072
9.284LeuVal: 9.284 ± 0.105
1.293LeuTrp: 1.293 ± 0.035
1.8LeuTyr: 1.8 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
1.96MetAla: 1.96 ± 0.042
0.084MetCys: 0.084 ± 0.009
0.802MetAsp: 0.802 ± 0.026
0.583MetGlu: 0.583 ± 0.024
0.545MetPhe: 0.545 ± 0.02
1.187MetGly: 1.187 ± 0.033
0.362MetHis: 0.362 ± 0.015
0.83MetIle: 0.83 ± 0.03
0.432MetLys: 0.432 ± 0.02
1.99MetLeu: 1.99 ± 0.04
0.288MetMet: 0.288 ± 0.017
0.496MetAsn: 0.496 ± 0.02
1.0MetPro: 1.0 ± 0.031
0.519MetGln: 0.519 ± 0.021
1.396MetArg: 1.396 ± 0.032
1.31MetSer: 1.31 ± 0.03
1.565MetThr: 1.565 ± 0.031
1.313MetVal: 1.313 ± 0.036
0.185MetTrp: 0.185 ± 0.011
0.268MetTyr: 0.268 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
2.828AsnAla: 2.828 ± 0.054
0.127AsnCys: 0.127 ± 0.01
1.279AsnAsp: 1.279 ± 0.033
1.0AsnGlu: 1.0 ± 0.031
0.717AsnPhe: 0.717 ± 0.03
2.274AsnGly: 2.274 ± 0.052
0.38AsnHis: 0.38 ± 0.021
0.907AsnIle: 0.907 ± 0.028
0.535AsnLys: 0.535 ± 0.022
2.038AsnLeu: 2.038 ± 0.047
0.318AsnMet: 0.318 ± 0.018
0.57AsnAsn: 0.57 ± 0.026
1.617AsnPro: 1.617 ± 0.04
0.691AsnGln: 0.691 ± 0.03
1.328AsnArg: 1.328 ± 0.037
1.117AsnSer: 1.117 ± 0.035
1.348AsnThr: 1.348 ± 0.04
1.844AsnVal: 1.844 ± 0.041
0.356AsnTrp: 0.356 ± 0.018
0.561AsnTyr: 0.561 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
6.954ProAla: 6.954 ± 0.094
0.171ProCys: 0.171 ± 0.013
3.747ProAsp: 3.747 ± 0.054
3.369ProGlu: 3.369 ± 0.063
1.872ProPhe: 1.872 ± 0.037
4.846ProGly: 4.846 ± 0.081
0.995ProHis: 0.995 ± 0.028
2.307ProIle: 2.307 ± 0.047
1.158ProLys: 1.158 ± 0.032
4.825ProLeu: 4.825 ± 0.066
0.837ProMet: 0.837 ± 0.028
1.186ProAsn: 1.186 ± 0.036
2.162ProPro: 2.162 ± 0.06
1.499ProGln: 1.499 ± 0.039
2.892ProArg: 2.892 ± 0.059
3.472ProSer: 3.472 ± 0.064
3.49ProThr: 3.49 ± 0.054
4.877ProVal: 4.877 ± 0.078
0.815ProTrp: 0.815 ± 0.026
1.109ProTyr: 1.109 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.633GlnAla: 3.633 ± 0.061
0.081GlnCys: 0.081 ± 0.008
1.369GlnAsp: 1.369 ± 0.033
1.278GlnGlu: 1.278 ± 0.036
0.931GlnPhe: 0.931 ± 0.031
1.974GlnGly: 1.974 ± 0.041
0.617GlnHis: 0.617 ± 0.025
1.375GlnIle: 1.375 ± 0.033
0.856GlnLys: 0.856 ± 0.027
3.434GlnLeu: 3.434 ± 0.055
0.43GlnMet: 0.43 ± 0.019
0.841GlnAsn: 0.841 ± 0.025
1.669GlnPro: 1.669 ± 0.045
1.367GlnGln: 1.367 ± 0.039
2.274GlnArg: 2.274 ± 0.051
1.61GlnSer: 1.61 ± 0.045
1.72GlnThr: 1.72 ± 0.046
2.433GlnVal: 2.433 ± 0.045
0.46GlnTrp: 0.46 ± 0.019
0.689GlnTyr: 0.689 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
8.157ArgAla: 8.157 ± 0.095
0.256ArgCys: 0.256 ± 0.015
4.095ArgAsp: 4.095 ± 0.073
3.814ArgGlu: 3.814 ± 0.068
2.389ArgPhe: 2.389 ± 0.046
4.94ArgGly: 4.94 ± 0.069
1.338ArgHis: 1.338 ± 0.038
3.756ArgIle: 3.756 ± 0.062
1.466ArgLys: 1.466 ± 0.039
6.741ArgLeu: 6.741 ± 0.096
1.694ArgMet: 1.694 ± 0.043
1.374ArgAsn: 1.374 ± 0.034
3.32ArgPro: 3.32 ± 0.06
1.972ArgGln: 1.972 ± 0.043
5.569ArgArg: 5.569 ± 0.088
4.089ArgSer: 4.089 ± 0.068
3.98ArgThr: 3.98 ± 0.056
5.603ArgVal: 5.603 ± 0.075
1.078ArgTrp: 1.078 ± 0.034
1.522ArgTyr: 1.522 ± 0.036
0.001ArgXaa: 0.001 ± 0.001
Ser
7.539SerAla: 7.539 ± 0.1
0.224SerCys: 0.224 ± 0.014
3.297SerAsp: 3.297 ± 0.061
2.658SerGlu: 2.658 ± 0.051
2.071SerPhe: 2.071 ± 0.043
6.413SerGly: 6.413 ± 0.081
0.976SerHis: 0.976 ± 0.033
3.024SerIle: 3.024 ± 0.054
1.291SerLys: 1.291 ± 0.04
5.488SerLeu: 5.488 ± 0.072
1.114SerMet: 1.114 ± 0.032
1.247SerAsn: 1.247 ± 0.033
3.072SerPro: 3.072 ± 0.055
1.58SerGln: 1.58 ± 0.038
3.81SerArg: 3.81 ± 0.064
3.855SerSer: 3.855 ± 0.066
4.071SerThr: 4.071 ± 0.064
5.031SerVal: 5.031 ± 0.064
0.991SerTrp: 0.991 ± 0.029
1.297SerTyr: 1.297 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
8.482ThrAla: 8.482 ± 0.1
0.246ThrCys: 0.246 ± 0.014
3.57ThrAsp: 3.57 ± 0.062
2.989ThrGlu: 2.989 ± 0.055
2.09ThrPhe: 2.09 ± 0.046
5.954ThrGly: 5.954 ± 0.083
1.068ThrHis: 1.068 ± 0.027
3.351ThrIle: 3.351 ± 0.051
1.482ThrLys: 1.482 ± 0.041
5.895ThrLeu: 5.895 ± 0.073
0.974ThrMet: 0.974 ± 0.028
1.3ThrAsn: 1.3 ± 0.04
3.976ThrPro: 3.976 ± 0.066
1.591ThrGln: 1.591 ± 0.041
3.501ThrArg: 3.501 ± 0.063
3.597ThrSer: 3.597 ± 0.062
4.141ThrThr: 4.141 ± 0.083
6.336ThrVal: 6.336 ± 0.083
0.889ThrTrp: 0.889 ± 0.028
1.225ThrTyr: 1.225 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
12.205ValAla: 12.205 ± 0.134
0.499ValCys: 0.499 ± 0.023
5.801ValAsp: 5.801 ± 0.07
4.487ValGlu: 4.487 ± 0.079
3.095ValPhe: 3.095 ± 0.057
7.423ValGly: 7.423 ± 0.089
1.564ValHis: 1.564 ± 0.043
4.397ValIle: 4.397 ± 0.061
1.819ValLys: 1.819 ± 0.047
9.398ValLeu: 9.398 ± 0.109
1.372ValMet: 1.372 ± 0.037
2.043ValAsn: 2.043 ± 0.045
4.67ValPro: 4.67 ± 0.067
2.288ValGln: 2.288 ± 0.044
5.521ValArg: 5.521 ± 0.082
5.437ValSer: 5.437 ± 0.072
6.111ValThr: 6.111 ± 0.088
9.15ValVal: 9.15 ± 0.106
1.174ValTrp: 1.174 ± 0.033
1.701ValTyr: 1.701 ± 0.043
0.0ValXaa: 0.0 ± 0.0
Trp
1.564TrpAla: 1.564 ± 0.038
0.095TrpCys: 0.095 ± 0.01
0.742TrpAsp: 0.742 ± 0.026
0.629TrpGlu: 0.629 ± 0.023
0.638TrpPhe: 0.638 ± 0.025
1.086TrpGly: 1.086 ± 0.032
0.332TrpHis: 0.332 ± 0.016
0.691TrpIle: 0.691 ± 0.026
0.376TrpLys: 0.376 ± 0.019
1.876TrpLeu: 1.876 ± 0.045
0.369TrpMet: 0.369 ± 0.019
0.488TrpAsn: 0.488 ± 0.024
0.717TrpPro: 0.717 ± 0.024
0.592TrpGln: 0.592 ± 0.026
1.173TrpArg: 1.173 ± 0.036
0.924TrpSer: 0.924 ± 0.029
0.932TrpThr: 0.932 ± 0.03
1.112TrpVal: 1.112 ± 0.031
0.359TrpTrp: 0.359 ± 0.02
0.292TrpTyr: 0.292 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.523TyrAla: 2.523 ± 0.055
0.113TyrCys: 0.113 ± 0.009
1.32TyrAsp: 1.32 ± 0.037
1.078TyrGlu: 1.078 ± 0.033
0.824TyrPhe: 0.824 ± 0.026
1.995TyrGly: 1.995 ± 0.048
0.292TyrHis: 0.292 ± 0.017
0.715TyrIle: 0.715 ± 0.025
0.381TyrLys: 0.381 ± 0.019
2.37TyrLeu: 2.37 ± 0.05
0.238TyrMet: 0.238 ± 0.014
0.508TyrAsn: 0.508 ± 0.022
1.066TyrPro: 1.066 ± 0.029
0.691TyrGln: 0.691 ± 0.024
1.583TyrArg: 1.583 ± 0.034
1.217TyrSer: 1.217 ± 0.035
1.296TyrThr: 1.296 ± 0.04
1.67TyrVal: 1.67 ± 0.042
0.336TyrTrp: 0.336 ± 0.018
0.503TyrTyr: 0.503 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3674 proteins (1184469 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski