Amino acid dipepetide frequency for Pseudogracilibacillus auburnensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.784AlaAla: 4.784 ± 0.073
0.619AlaCys: 0.619 ± 0.022
3.156AlaAsp: 3.156 ± 0.077
4.269AlaGlu: 4.269 ± 0.076
3.25AlaPhe: 3.25 ± 0.058
4.758AlaGly: 4.758 ± 0.075
1.212AlaHis: 1.212 ± 0.032
6.336AlaIle: 6.336 ± 0.07
4.373AlaLys: 4.373 ± 0.07
6.618AlaLeu: 6.618 ± 0.074
1.883AlaMet: 1.883 ± 0.038
3.0AlaAsn: 3.0 ± 0.047
1.787AlaPro: 1.787 ± 0.037
1.999AlaGln: 1.999 ± 0.045
2.327AlaArg: 2.327 ± 0.045
3.792AlaSer: 3.792 ± 0.063
3.959AlaThr: 3.959 ± 0.125
4.617AlaVal: 4.617 ± 0.065
0.541AlaTrp: 0.541 ± 0.019
2.276AlaTyr: 2.276 ± 0.044
0.0AlaXaa: 0.0 ± 0.0
Cys
0.446CysAla: 0.446 ± 0.017
0.089CysCys: 0.089 ± 0.009
0.342CysAsp: 0.342 ± 0.017
0.42CysGlu: 0.42 ± 0.02
0.305CysPhe: 0.305 ± 0.017
0.618CysGly: 0.618 ± 0.026
0.234CysHis: 0.234 ± 0.016
0.546CysIle: 0.546 ± 0.024
0.388CysLys: 0.388 ± 0.018
0.646CysLeu: 0.646 ± 0.024
0.176CysMet: 0.176 ± 0.012
0.296CysAsn: 0.296 ± 0.016
0.304CysPro: 0.304 ± 0.016
0.21CysGln: 0.21 ± 0.012
0.246CysArg: 0.246 ± 0.014
0.481CysSer: 0.481 ± 0.018
0.451CysThr: 0.451 ± 0.022
0.407CysVal: 0.407 ± 0.02
0.058CysTrp: 0.058 ± 0.006
0.237CysTyr: 0.237 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.183AspAla: 3.183 ± 0.078
0.299AspCys: 0.299 ± 0.018
2.711AspAsp: 2.711 ± 0.058
4.772AspGlu: 4.772 ± 0.078
2.314AspPhe: 2.314 ± 0.041
3.411AspGly: 3.411 ± 0.064
1.337AspHis: 1.337 ± 0.038
4.449AspIle: 4.449 ± 0.063
3.312AspLys: 3.312 ± 0.054
4.787AspLeu: 4.787 ± 0.06
1.343AspMet: 1.343 ± 0.032
1.935AspAsn: 1.935 ± 0.044
2.047AspPro: 2.047 ± 0.056
1.985AspGln: 1.985 ± 0.039
2.114AspArg: 2.114 ± 0.048
2.52AspSer: 2.52 ± 0.052
2.449AspThr: 2.449 ± 0.051
3.846AspVal: 3.846 ± 0.054
0.581AspTrp: 0.581 ± 0.025
2.079AspTyr: 2.079 ± 0.044
0.001AspXaa: 0.001 ± 0.001
Glu
5.042GluAla: 5.042 ± 0.073
0.337GluCys: 0.337 ± 0.018
3.909GluAsp: 3.909 ± 0.071
7.182GluGlu: 7.182 ± 0.097
2.441GluPhe: 2.441 ± 0.045
4.046GluGly: 4.046 ± 0.057
1.575GluHis: 1.575 ± 0.035
6.036GluIle: 6.036 ± 0.073
7.195GluLys: 7.195 ± 0.084
6.934GluLeu: 6.934 ± 0.089
2.492GluMet: 2.492 ± 0.044
4.128GluAsn: 4.128 ± 0.066
1.962GluPro: 1.962 ± 0.059
3.547GluGln: 3.547 ± 0.064
3.248GluArg: 3.248 ± 0.059
3.496GluSer: 3.496 ± 0.059
4.005GluThr: 4.005 ± 0.062
5.218GluVal: 5.218 ± 0.075
0.744GluTrp: 0.744 ± 0.026
2.167GluTyr: 2.167 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
2.988PheAla: 2.988 ± 0.049
0.339PheCys: 0.339 ± 0.017
2.459PheAsp: 2.459 ± 0.045
2.777PheGlu: 2.777 ± 0.049
2.527PhePhe: 2.527 ± 0.061
3.316PheGly: 3.316 ± 0.06
1.11PheHis: 1.11 ± 0.03
4.96PheIle: 4.96 ± 0.088
2.398PheLys: 2.398 ± 0.042
4.854PheLeu: 4.854 ± 0.079
1.281PheMet: 1.281 ± 0.032
1.994PheAsn: 1.994 ± 0.037
1.664PhePro: 1.664 ± 0.04
1.655PheGln: 1.655 ± 0.035
1.438PheArg: 1.438 ± 0.037
3.267PheSer: 3.267 ± 0.046
2.946PheThr: 2.946 ± 0.05
3.139PheVal: 3.139 ± 0.059
0.498PheTrp: 0.498 ± 0.02
1.875PheTyr: 1.875 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
4.774GlyAla: 4.774 ± 0.128
0.582GlyCys: 0.582 ± 0.026
2.987GlyAsp: 2.987 ± 0.064
4.556GlyGlu: 4.556 ± 0.081
3.348GlyPhe: 3.348 ± 0.058
4.485GlyGly: 4.485 ± 0.066
1.345GlyHis: 1.345 ± 0.037
6.144GlyIle: 6.144 ± 0.079
4.874GlyLys: 4.874 ± 0.071
6.178GlyLeu: 6.178 ± 0.071
2.103GlyMet: 2.103 ± 0.042
2.843GlyAsn: 2.843 ± 0.046
1.629GlyPro: 1.629 ± 0.044
1.841GlyGln: 1.841 ± 0.041
2.303GlyArg: 2.303 ± 0.049
3.747GlySer: 3.747 ± 0.057
3.974GlyThr: 3.974 ± 0.058
4.81GlyVal: 4.81 ± 0.064
0.71GlyTrp: 0.71 ± 0.025
2.629GlyTyr: 2.629 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
1.403HisAla: 1.403 ± 0.035
0.205HisCys: 0.205 ± 0.014
1.133HisAsp: 1.133 ± 0.029
1.49HisGlu: 1.49 ± 0.036
1.238HisPhe: 1.238 ± 0.032
1.384HisGly: 1.384 ± 0.034
0.808HisHis: 0.808 ± 0.031
1.888HisIle: 1.888 ± 0.038
1.082HisLys: 1.082 ± 0.027
2.302HisLeu: 2.302 ± 0.047
0.594HisMet: 0.594 ± 0.022
0.783HisAsn: 0.783 ± 0.027
1.124HisPro: 1.124 ± 0.028
0.854HisGln: 0.854 ± 0.029
0.862HisArg: 0.862 ± 0.028
1.212HisSer: 1.212 ± 0.03
1.195HisThr: 1.195 ± 0.031
1.582HisVal: 1.582 ± 0.041
0.23HisTrp: 0.23 ± 0.013
0.928HisTyr: 0.928 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.565IleAla: 6.565 ± 0.072
0.73IleCys: 0.73 ± 0.027
5.024IleAsp: 5.024 ± 0.068
6.112IleGlu: 6.112 ± 0.08
4.046IlePhe: 4.046 ± 0.075
6.609IleGly: 6.609 ± 0.087
2.06IleHis: 2.06 ± 0.05
7.959IleIle: 7.959 ± 0.117
4.996IleLys: 4.996 ± 0.065
7.719IleLeu: 7.719 ± 0.098
2.096IleMet: 2.096 ± 0.043
3.948IleAsn: 3.948 ± 0.058
3.728IlePro: 3.728 ± 0.058
3.018IleGln: 3.018 ± 0.056
3.035IleArg: 3.035 ± 0.048
5.409IleSer: 5.409 ± 0.064
5.126IleThr: 5.126 ± 0.075
6.461IleVal: 6.461 ± 0.085
0.69IleTrp: 0.69 ± 0.027
2.821IleTyr: 2.821 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
4.066LysAla: 4.066 ± 0.073
0.31LysCys: 0.31 ± 0.017
3.684LysAsp: 3.684 ± 0.053
7.17LysGlu: 7.17 ± 0.081
1.904LysPhe: 1.904 ± 0.034
4.154LysGly: 4.154 ± 0.069
1.39LysHis: 1.39 ± 0.036
5.121LysIle: 5.121 ± 0.067
5.837LysLys: 5.837 ± 0.075
5.893LysLeu: 5.893 ± 0.074
2.371LysMet: 2.371 ± 0.041
3.827LysAsn: 3.827 ± 0.05
1.901LysPro: 1.901 ± 0.045
3.243LysGln: 3.243 ± 0.063
3.301LysArg: 3.301 ± 0.05
3.588LysSer: 3.588 ± 0.046
3.717LysThr: 3.717 ± 0.061
4.631LysVal: 4.631 ± 0.07
0.755LysTrp: 0.755 ± 0.026
2.259LysTyr: 2.259 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
6.774LeuAla: 6.774 ± 0.085
0.619LeuCys: 0.619 ± 0.026
4.564LeuAsp: 4.564 ± 0.067
6.34LeuGlu: 6.34 ± 0.079
5.418LeuPhe: 5.418 ± 0.09
6.131LeuGly: 6.131 ± 0.081
2.315LeuHis: 2.315 ± 0.045
7.981LeuIle: 7.981 ± 0.092
6.151LeuLys: 6.151 ± 0.076
10.213LeuLeu: 10.213 ± 0.14
2.517LeuMet: 2.517 ± 0.053
4.13LeuAsn: 4.13 ± 0.061
3.781LeuPro: 3.781 ± 0.06
3.783LeuGln: 3.783 ± 0.066
3.233LeuArg: 3.233 ± 0.052
6.396LeuSer: 6.396 ± 0.076
5.906LeuThr: 5.906 ± 0.073
5.917LeuVal: 5.917 ± 0.076
0.74LeuTrp: 0.74 ± 0.026
3.175LeuTyr: 3.175 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
1.81MetAla: 1.81 ± 0.037
0.137MetCys: 0.137 ± 0.011
1.528MetAsp: 1.528 ± 0.038
2.244MetGlu: 2.244 ± 0.038
1.206MetPhe: 1.206 ± 0.034
1.676MetGly: 1.676 ± 0.042
0.496MetHis: 0.496 ± 0.022
2.661MetIle: 2.661 ± 0.048
2.595MetLys: 2.595 ± 0.043
2.56MetLeu: 2.56 ± 0.049
0.964MetMet: 0.964 ± 0.028
1.755MetAsn: 1.755 ± 0.041
0.897MetPro: 0.897 ± 0.026
0.907MetGln: 0.907 ± 0.025
1.068MetArg: 1.068 ± 0.027
1.665MetSer: 1.665 ± 0.031
1.72MetThr: 1.72 ± 0.035
1.718MetVal: 1.718 ± 0.034
0.204MetTrp: 0.204 ± 0.013
0.861MetTyr: 0.861 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
2.585AsnAla: 2.585 ± 0.049
0.317AsnCys: 0.317 ± 0.015
2.757AsnAsp: 2.757 ± 0.052
4.276AsnGlu: 4.276 ± 0.07
1.736AsnPhe: 1.736 ± 0.04
3.133AsnGly: 3.133 ± 0.05
1.168AsnHis: 1.168 ± 0.033
4.017AsnIle: 4.017 ± 0.064
3.425AsnLys: 3.425 ± 0.06
3.853AsnLeu: 3.853 ± 0.06
1.37AsnMet: 1.37 ± 0.032
2.454AsnAsn: 2.454 ± 0.053
2.004AsnPro: 2.004 ± 0.041
1.867AsnGln: 1.867 ± 0.045
2.027AsnArg: 2.027 ± 0.043
2.219AsnSer: 2.219 ± 0.048
2.221AsnThr: 2.221 ± 0.044
3.335AsnVal: 3.335 ± 0.049
0.536AsnTrp: 0.536 ± 0.022
1.65AsnTyr: 1.65 ± 0.04
0.0AsnXaa: 0.0 ± 0.0
Pro
2.004ProAla: 2.004 ± 0.047
0.186ProCys: 0.186 ± 0.013
1.748ProAsp: 1.748 ± 0.042
2.626ProGlu: 2.626 ± 0.056
1.988ProPhe: 1.988 ± 0.04
2.045ProGly: 2.045 ± 0.062
0.798ProHis: 0.798 ± 0.027
3.244ProIle: 3.244 ± 0.053
2.205ProLys: 2.205 ± 0.054
3.348ProLeu: 3.348 ± 0.046
0.79ProMet: 0.79 ± 0.026
1.856ProAsn: 1.856 ± 0.041
0.957ProPro: 0.957 ± 0.031
0.912ProGln: 0.912 ± 0.026
0.973ProArg: 0.973 ± 0.028
2.081ProSer: 2.081 ± 0.039
2.145ProThr: 2.145 ± 0.042
2.572ProVal: 2.572 ± 0.042
0.367ProTrp: 0.367 ± 0.017
1.39ProTyr: 1.39 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
2.495GlnAla: 2.495 ± 0.048
0.188GlnCys: 0.188 ± 0.012
1.582GlnAsp: 1.582 ± 0.041
2.578GlnGlu: 2.578 ± 0.05
1.737GlnPhe: 1.737 ± 0.035
1.879GlnGly: 1.879 ± 0.034
0.774GlnHis: 0.774 ± 0.026
3.008GlnIle: 3.008 ± 0.053
2.59GlnLys: 2.59 ± 0.052
4.08GlnLeu: 4.08 ± 0.063
1.155GlnMet: 1.155 ± 0.029
1.531GlnAsn: 1.531 ± 0.033
1.044GlnPro: 1.044 ± 0.029
1.679GlnGln: 1.679 ± 0.04
1.283GlnArg: 1.283 ± 0.032
2.062GlnSer: 2.062 ± 0.042
1.99GlnThr: 1.99 ± 0.044
2.319GlnVal: 2.319 ± 0.043
0.339GlnTrp: 0.339 ± 0.018
1.276GlnTyr: 1.276 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
2.262ArgAla: 2.262 ± 0.047
0.285ArgCys: 0.285 ± 0.018
1.821ArgAsp: 1.821 ± 0.037
2.922ArgGlu: 2.922 ± 0.049
1.795ArgPhe: 1.795 ± 0.04
2.094ArgGly: 2.094 ± 0.048
0.782ArgHis: 0.782 ± 0.024
3.029ArgIle: 3.029 ± 0.049
3.205ArgLys: 3.205 ± 0.049
3.645ArgLeu: 3.645 ± 0.063
1.233ArgMet: 1.233 ± 0.034
1.872ArgAsn: 1.872 ± 0.039
1.188ArgPro: 1.188 ± 0.031
1.313ArgGln: 1.313 ± 0.032
1.672ArgArg: 1.672 ± 0.044
2.007ArgSer: 2.007 ± 0.046
2.047ArgThr: 2.047 ± 0.046
2.4ArgVal: 2.4 ± 0.048
0.339ArgTrp: 0.339 ± 0.016
1.37ArgTyr: 1.37 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
3.339SerAla: 3.339 ± 0.052
0.371SerCys: 0.371 ± 0.019
2.776SerAsp: 2.776 ± 0.052
3.778SerGlu: 3.778 ± 0.05
3.423SerPhe: 3.423 ± 0.06
4.143SerGly: 4.143 ± 0.065
1.223SerHis: 1.223 ± 0.032
5.524SerIle: 5.524 ± 0.077
3.595SerLys: 3.595 ± 0.059
5.978SerLeu: 5.978 ± 0.084
1.692SerMet: 1.692 ± 0.038
2.696SerAsn: 2.696 ± 0.045
1.932SerPro: 1.932 ± 0.038
1.567SerGln: 1.567 ± 0.036
2.098SerArg: 2.098 ± 0.042
3.802SerSer: 3.802 ± 0.066
3.141SerThr: 3.141 ± 0.055
4.008SerVal: 4.008 ± 0.059
0.605SerTrp: 0.605 ± 0.023
2.204SerTyr: 2.204 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
3.697ThrAla: 3.697 ± 0.053
0.384ThrCys: 0.384 ± 0.018
2.968ThrAsp: 2.968 ± 0.061
3.829ThrGlu: 3.829 ± 0.06
3.106ThrPhe: 3.106 ± 0.053
4.193ThrGly: 4.193 ± 0.129
1.037ThrHis: 1.037 ± 0.027
5.549ThrIle: 5.549 ± 0.081
3.675ThrLys: 3.675 ± 0.056
5.422ThrLeu: 5.422 ± 0.071
1.436ThrMet: 1.436 ± 0.032
2.866ThrAsn: 2.866 ± 0.047
2.147ThrPro: 2.147 ± 0.044
1.26ThrGln: 1.26 ± 0.03
1.761ThrArg: 1.761 ± 0.035
3.312ThrSer: 3.312 ± 0.046
3.129ThrThr: 3.129 ± 0.054
4.263ThrVal: 4.263 ± 0.059
0.526ThrTrp: 0.526 ± 0.023
2.119ThrTyr: 2.119 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
4.697ValAla: 4.697 ± 0.074
0.544ValCys: 0.544 ± 0.02
3.785ValAsp: 3.785 ± 0.066
4.948ValGlu: 4.948 ± 0.075
3.25ValPhe: 3.25 ± 0.055
4.698ValGly: 4.698 ± 0.064
1.518ValHis: 1.518 ± 0.035
6.072ValIle: 6.072 ± 0.07
4.492ValLys: 4.492 ± 0.058
6.597ValLeu: 6.597 ± 0.076
1.948ValMet: 1.948 ± 0.044
3.107ValAsn: 3.107 ± 0.056
2.513ValPro: 2.513 ± 0.039
2.338ValGln: 2.338 ± 0.046
2.398ValArg: 2.398 ± 0.045
4.326ValSer: 4.326 ± 0.064
4.221ValThr: 4.221 ± 0.066
4.881ValVal: 4.881 ± 0.063
0.611ValTrp: 0.611 ± 0.023
2.221ValTyr: 2.221 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.544TrpAla: 0.544 ± 0.021
0.051TrpCys: 0.051 ± 0.005
0.498TrpAsp: 0.498 ± 0.021
0.672TrpGlu: 0.672 ± 0.025
0.49TrpPhe: 0.49 ± 0.021
0.651TrpGly: 0.651 ± 0.024
0.201TrpHis: 0.201 ± 0.015
0.885TrpIle: 0.885 ± 0.026
0.718TrpLys: 0.718 ± 0.027
1.06TrpLeu: 1.06 ± 0.036
0.324TrpMet: 0.324 ± 0.016
0.529TrpAsn: 0.529 ± 0.021
0.227TrpPro: 0.227 ± 0.014
0.346TrpGln: 0.346 ± 0.019
0.355TrpArg: 0.355 ± 0.016
0.501TrpSer: 0.501 ± 0.021
0.475TrpThr: 0.475 ± 0.02
0.589TrpVal: 0.589 ± 0.02
0.138TrpTrp: 0.138 ± 0.011
0.334TrpTyr: 0.334 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.099TyrAla: 2.099 ± 0.043
0.309TyrCys: 0.309 ± 0.016
2.045TyrAsp: 2.045 ± 0.044
2.743TyrGlu: 2.743 ± 0.052
1.964TyrPhe: 1.964 ± 0.051
2.436TyrGly: 2.436 ± 0.047
0.914TyrHis: 0.914 ± 0.032
2.746TyrIle: 2.746 ± 0.049
2.015TyrLys: 2.015 ± 0.04
3.398TyrLeu: 3.398 ± 0.059
0.904TyrMet: 0.904 ± 0.025
1.399TyrAsn: 1.399 ± 0.037
1.399TyrPro: 1.399 ± 0.039
1.266TyrGln: 1.266 ± 0.032
1.538TyrArg: 1.538 ± 0.037
2.01TyrSer: 2.01 ± 0.046
1.863TyrThr: 1.863 ± 0.038
2.448TyrVal: 2.448 ± 0.047
0.376TyrTrp: 0.376 ± 0.017
1.45TyrTyr: 1.45 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 4444 proteins (1279137 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski