Amino acid dipepetide frequency for Nocardioides gansuensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.296AlaAla: 19.296 ± 0.178
1.072AlaCys: 1.072 ± 0.03
8.192AlaAsp: 8.192 ± 0.082
8.404AlaGlu: 8.404 ± 0.097
3.595AlaPhe: 3.595 ± 0.051
12.27AlaGly: 12.27 ± 0.105
2.704AlaHis: 2.704 ± 0.047
4.294AlaIle: 4.294 ± 0.061
2.414AlaLys: 2.414 ± 0.057
13.432AlaLeu: 13.432 ± 0.129
2.799AlaMet: 2.799 ± 0.04
1.99AlaAsn: 1.99 ± 0.045
6.078AlaPro: 6.078 ± 0.08
3.473AlaGln: 3.473 ± 0.05
9.859AlaArg: 9.859 ± 0.108
6.534AlaSer: 6.534 ± 0.071
7.494AlaThr: 7.494 ± 0.097
11.342AlaVal: 11.342 ± 0.117
2.061AlaTrp: 2.061 ± 0.039
2.519AlaTyr: 2.519 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.917CysAla: 0.917 ± 0.027
0.103CysCys: 0.103 ± 0.008
0.528CysAsp: 0.528 ± 0.017
0.43CysGlu: 0.43 ± 0.019
0.241CysPhe: 0.241 ± 0.014
0.9CysGly: 0.9 ± 0.029
0.205CysHis: 0.205 ± 0.012
0.221CysIle: 0.221 ± 0.013
0.111CysLys: 0.111 ± 0.009
0.696CysLeu: 0.696 ± 0.023
0.124CysMet: 0.124 ± 0.01
0.126CysAsn: 0.126 ± 0.011
0.45CysPro: 0.45 ± 0.018
0.185CysGln: 0.185 ± 0.012
0.622CysArg: 0.622 ± 0.026
0.489CysSer: 0.489 ± 0.02
0.506CysThr: 0.506 ± 0.026
0.637CysVal: 0.637 ± 0.02
0.133CysTrp: 0.133 ± 0.014
0.16CysTyr: 0.16 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.642AspAla: 7.642 ± 0.083
0.412AspCys: 0.412 ± 0.016
4.353AspAsp: 4.353 ± 0.061
4.478AspGlu: 4.478 ± 0.062
1.763AspPhe: 1.763 ± 0.044
6.301AspGly: 6.301 ± 0.095
1.571AspHis: 1.571 ± 0.036
1.951AspIle: 1.951 ± 0.044
1.132AspLys: 1.132 ± 0.034
7.098AspLeu: 7.098 ± 0.078
0.894AspMet: 0.894 ± 0.024
1.067AspAsn: 1.067 ± 0.034
4.655AspPro: 4.655 ± 0.064
1.929AspGln: 1.929 ± 0.035
4.861AspArg: 4.861 ± 0.071
2.587AspSer: 2.587 ± 0.043
2.946AspThr: 2.946 ± 0.051
6.118AspVal: 6.118 ± 0.067
1.034AspTrp: 1.034 ± 0.029
1.238AspTyr: 1.238 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
7.762GluAla: 7.762 ± 0.099
0.414GluCys: 0.414 ± 0.022
3.202GluAsp: 3.202 ± 0.05
3.763GluGlu: 3.763 ± 0.065
1.586GluPhe: 1.586 ± 0.044
4.791GluGly: 4.791 ± 0.063
1.76GluHis: 1.76 ± 0.039
2.497GluIle: 2.497 ± 0.049
1.303GluLys: 1.303 ± 0.032
6.902GluLeu: 6.902 ± 0.08
1.08GluMet: 1.08 ± 0.028
0.83GluAsn: 0.83 ± 0.025
3.307GluPro: 3.307 ± 0.055
2.369GluGln: 2.369 ± 0.044
5.182GluArg: 5.182 ± 0.082
2.954GluSer: 2.954 ± 0.042
3.048GluThr: 3.048 ± 0.049
5.682GluVal: 5.682 ± 0.066
0.918GluTrp: 0.918 ± 0.025
0.927GluTyr: 0.927 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.632PheAla: 3.632 ± 0.055
0.282PheCys: 0.282 ± 0.014
2.115PheAsp: 2.115 ± 0.04
1.734PheGlu: 1.734 ± 0.039
0.913PhePhe: 0.913 ± 0.03
3.07PheGly: 3.07 ± 0.052
0.645PheHis: 0.645 ± 0.021
0.809PheIle: 0.809 ± 0.025
0.465PheLys: 0.465 ± 0.018
2.695PheLeu: 2.695 ± 0.052
0.426PheMet: 0.426 ± 0.017
0.585PheAsn: 0.585 ± 0.024
1.347PhePro: 1.347 ± 0.031
0.657PheGln: 0.657 ± 0.025
1.849PheArg: 1.849 ± 0.038
1.46PheSer: 1.46 ± 0.037
1.91PheThr: 1.91 ± 0.044
2.649PheVal: 2.649 ± 0.045
0.435PheTrp: 0.435 ± 0.019
0.618PheTyr: 0.618 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
10.197GlyAla: 10.197 ± 0.096
0.846GlyCys: 0.846 ± 0.025
5.593GlyAsp: 5.593 ± 0.07
5.38GlyGlu: 5.38 ± 0.064
3.021GlyPhe: 3.021 ± 0.044
7.937GlyGly: 7.937 ± 0.099
2.266GlyHis: 2.266 ± 0.047
3.747GlyIle: 3.747 ± 0.059
2.146GlyLys: 2.146 ± 0.048
9.544GlyLeu: 9.544 ± 0.101
2.058GlyMet: 2.058 ± 0.04
1.757GlyAsn: 1.757 ± 0.049
4.436GlyPro: 4.436 ± 0.063
2.703GlyGln: 2.703 ± 0.045
7.303GlyArg: 7.303 ± 0.078
5.432GlySer: 5.432 ± 0.062
5.49GlyThr: 5.49 ± 0.079
8.05GlyVal: 8.05 ± 0.081
1.748GlyTrp: 1.748 ± 0.038
2.078GlyTyr: 2.078 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
2.715HisAla: 2.715 ± 0.054
0.202HisCys: 0.202 ± 0.012
1.571HisAsp: 1.571 ± 0.04
1.439HisGlu: 1.439 ± 0.041
0.615HisPhe: 0.615 ± 0.023
2.387HisGly: 2.387 ± 0.043
0.747HisHis: 0.747 ± 0.027
0.613HisIle: 0.613 ± 0.022
0.355HisLys: 0.355 ± 0.015
2.492HisLeu: 2.492 ± 0.052
0.36HisMet: 0.36 ± 0.016
0.362HisAsn: 0.362 ± 0.017
1.704HisPro: 1.704 ± 0.039
0.657HisGln: 0.657 ± 0.023
1.935HisArg: 1.935 ± 0.043
0.945HisSer: 0.945 ± 0.027
1.221HisThr: 1.221 ± 0.032
2.178HisVal: 2.178 ± 0.038
0.357HisTrp: 0.357 ± 0.016
0.488HisTyr: 0.488 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
5.016IleAla: 5.016 ± 0.063
0.299IleCys: 0.299 ± 0.016
2.652IleAsp: 2.652 ± 0.052
2.431IleGlu: 2.431 ± 0.043
0.83IlePhe: 0.83 ± 0.026
3.549IleGly: 3.549 ± 0.056
0.698IleHis: 0.698 ± 0.024
1.011IleIle: 1.011 ± 0.032
0.736IleLys: 0.736 ± 0.023
2.619IleLeu: 2.619 ± 0.048
0.462IleMet: 0.462 ± 0.02
0.748IleAsn: 0.748 ± 0.024
1.786IlePro: 1.786 ± 0.033
0.798IleGln: 0.798 ± 0.023
2.317IleArg: 2.317 ± 0.044
1.881IleSer: 1.881 ± 0.041
2.239IleThr: 2.239 ± 0.042
3.107IleVal: 3.107 ± 0.05
0.403IleTrp: 0.403 ± 0.017
0.62IleTyr: 0.62 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
2.593LysAla: 2.593 ± 0.054
0.107LysCys: 0.107 ± 0.009
1.12LysAsp: 1.12 ± 0.036
1.084LysGlu: 1.084 ± 0.033
0.467LysPhe: 0.467 ± 0.019
1.586LysGly: 1.586 ± 0.041
0.436LysHis: 0.436 ± 0.015
0.805LysIle: 0.805 ± 0.026
0.65LysLys: 0.65 ± 0.03
1.673LysLeu: 1.673 ± 0.039
0.356LysMet: 0.356 ± 0.015
0.402LysAsn: 0.402 ± 0.02
1.183LysPro: 1.183 ± 0.028
0.656LysGln: 0.656 ± 0.022
1.342LysArg: 1.342 ± 0.035
1.088LysSer: 1.088 ± 0.029
1.159LysThr: 1.159 ± 0.033
1.975LysVal: 1.975 ± 0.045
0.218LysTrp: 0.218 ± 0.013
0.421LysTyr: 0.421 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
15.267LeuAla: 15.267 ± 0.142
0.72LeuCys: 0.72 ± 0.026
7.295LeuAsp: 7.295 ± 0.074
5.845LeuGlu: 5.845 ± 0.078
2.458LeuPhe: 2.458 ± 0.053
9.717LeuGly: 9.717 ± 0.09
2.206LeuHis: 2.206 ± 0.043
2.994LeuIle: 2.994 ± 0.058
1.709LeuLys: 1.709 ± 0.044
10.692LeuLeu: 10.692 ± 0.131
1.738LeuMet: 1.738 ± 0.036
1.61LeuAsn: 1.61 ± 0.037
5.763LeuPro: 5.763 ± 0.072
2.424LeuGln: 2.424 ± 0.039
7.77LeuArg: 7.77 ± 0.086
5.23LeuSer: 5.23 ± 0.056
6.161LeuThr: 6.161 ± 0.072
10.455LeuVal: 10.455 ± 0.094
1.204LeuTrp: 1.204 ± 0.034
1.509LeuTyr: 1.509 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
2.363MetAla: 2.363 ± 0.04
0.157MetCys: 0.157 ± 0.011
0.948MetAsp: 0.948 ± 0.025
0.815MetGlu: 0.815 ± 0.028
0.523MetPhe: 0.523 ± 0.02
1.452MetGly: 1.452 ± 0.03
0.382MetHis: 0.382 ± 0.018
0.654MetIle: 0.654 ± 0.02
0.463MetLys: 0.463 ± 0.018
1.951MetLeu: 1.951 ± 0.036
0.343MetMet: 0.343 ± 0.016
0.366MetAsn: 0.366 ± 0.017
1.152MetPro: 1.152 ± 0.03
0.49MetGln: 0.49 ± 0.019
1.529MetArg: 1.529 ± 0.033
1.443MetSer: 1.443 ± 0.032
1.685MetThr: 1.685 ± 0.033
1.609MetVal: 1.609 ± 0.032
0.221MetTrp: 0.221 ± 0.012
0.274MetTyr: 0.274 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.159AsnAla: 2.159 ± 0.045
0.143AsnCys: 0.143 ± 0.011
1.059AsnAsp: 1.059 ± 0.03
0.905AsnGlu: 0.905 ± 0.023
0.486AsnPhe: 0.486 ± 0.018
1.651AsnGly: 1.651 ± 0.042
0.388AsnHis: 0.388 ± 0.017
0.686AsnIle: 0.686 ± 0.025
0.364AsnLys: 0.364 ± 0.017
1.691AsnLeu: 1.691 ± 0.038
0.283AsnMet: 0.283 ± 0.014
0.441AsnAsn: 0.441 ± 0.022
1.366AsnPro: 1.366 ± 0.036
0.513AsnGln: 0.513 ± 0.02
1.172AsnArg: 1.172 ± 0.03
0.807AsnSer: 0.807 ± 0.027
1.029AsnThr: 1.029 ± 0.034
1.428AsnVal: 1.428 ± 0.033
0.277AsnTrp: 0.277 ± 0.014
0.386AsnTyr: 0.386 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
7.394ProAla: 7.394 ± 0.093
0.329ProCys: 0.329 ± 0.018
4.334ProAsp: 4.334 ± 0.058
4.129ProGlu: 4.129 ± 0.057
1.552ProPhe: 1.552 ± 0.03
5.801ProGly: 5.801 ± 0.066
1.305ProHis: 1.305 ± 0.03
1.627ProIle: 1.627 ± 0.033
0.956ProLys: 0.956 ± 0.032
4.875ProLeu: 4.875 ± 0.066
1.088ProMet: 1.088 ± 0.026
0.817ProAsn: 0.817 ± 0.03
3.034ProPro: 3.034 ± 0.061
1.43ProGln: 1.43 ± 0.031
3.844ProArg: 3.844 ± 0.059
3.104ProSer: 3.104 ± 0.045
3.547ProThr: 3.547 ± 0.061
5.22ProVal: 5.22 ± 0.063
0.974ProTrp: 0.974 ± 0.029
1.08ProTyr: 1.08 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
3.747GlnAla: 3.747 ± 0.055
0.181GlnCys: 0.181 ± 0.012
1.283GlnAsp: 1.283 ± 0.031
1.435GlnGlu: 1.435 ± 0.035
0.702GlnPhe: 0.702 ± 0.023
2.167GlnGly: 2.167 ± 0.042
0.709GlnHis: 0.709 ± 0.022
1.062GlnIle: 1.062 ± 0.032
0.543GlnLys: 0.543 ± 0.02
3.182GlnLeu: 3.182 ± 0.051
0.522GlnMet: 0.522 ± 0.022
0.412GlnAsn: 0.412 ± 0.017
1.717GlnPro: 1.717 ± 0.042
1.158GlnGln: 1.158 ± 0.036
2.467GlnArg: 2.467 ± 0.047
1.285GlnSer: 1.285 ± 0.034
1.344GlnThr: 1.344 ± 0.03
2.982GlnVal: 2.982 ± 0.048
0.442GlnTrp: 0.442 ± 0.016
0.502GlnTyr: 0.502 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
8.966ArgAla: 8.966 ± 0.078
0.583ArgCys: 0.583 ± 0.022
4.618ArgAsp: 4.618 ± 0.064
4.723ArgGlu: 4.723 ± 0.071
2.273ArgPhe: 2.273 ± 0.035
5.83ArgGly: 5.83 ± 0.073
2.051ArgHis: 2.051 ± 0.042
3.077ArgIle: 3.077 ± 0.047
1.554ArgLys: 1.554 ± 0.041
8.551ArgLeu: 8.551 ± 0.096
1.751ArgMet: 1.751 ± 0.033
1.314ArgAsn: 1.314 ± 0.032
4.426ArgPro: 4.426 ± 0.067
2.224ArgGln: 2.224 ± 0.042
7.298ArgArg: 7.298 ± 0.089
4.112ArgSer: 4.112 ± 0.064
4.569ArgThr: 4.569 ± 0.057
6.397ArgVal: 6.397 ± 0.082
1.386ArgTrp: 1.386 ± 0.036
1.545ArgTyr: 1.545 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
6.322SerAla: 6.322 ± 0.069
0.413SerCys: 0.413 ± 0.019
2.975SerAsp: 2.975 ± 0.051
2.575SerGlu: 2.575 ± 0.045
1.703SerPhe: 1.703 ± 0.037
5.605SerGly: 5.605 ± 0.071
1.125SerHis: 1.125 ± 0.028
1.81SerIle: 1.81 ± 0.036
0.96SerLys: 0.96 ± 0.028
5.254SerLeu: 5.254 ± 0.07
1.285SerMet: 1.285 ± 0.029
0.92SerAsn: 0.92 ± 0.03
3.212SerPro: 3.212 ± 0.057
1.35SerGln: 1.35 ± 0.033
4.048SerArg: 4.048 ± 0.052
3.136SerSer: 3.136 ± 0.055
3.37SerThr: 3.37 ± 0.053
4.451SerVal: 4.451 ± 0.058
0.998SerTrp: 0.998 ± 0.034
1.259SerTyr: 1.259 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
7.343ThrAla: 7.343 ± 0.082
0.487ThrCys: 0.487 ± 0.024
3.502ThrAsp: 3.502 ± 0.065
3.092ThrGlu: 3.092 ± 0.042
1.904ThrPhe: 1.904 ± 0.052
6.061ThrGly: 6.061 ± 0.075
1.265ThrHis: 1.265 ± 0.028
2.166ThrIle: 2.166 ± 0.047
1.088ThrLys: 1.088 ± 0.03
5.711ThrLeu: 5.711 ± 0.066
1.089ThrMet: 1.089 ± 0.03
1.069ThrAsn: 1.069 ± 0.029
3.956ThrPro: 3.956 ± 0.067
1.487ThrGln: 1.487 ± 0.033
4.145ThrArg: 4.145 ± 0.057
3.529ThrSer: 3.529 ± 0.061
4.099ThrThr: 4.099 ± 0.08
5.557ThrVal: 5.557 ± 0.079
1.042ThrTrp: 1.042 ± 0.028
1.351ThrTyr: 1.351 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
12.312ValAla: 12.312 ± 0.11
0.711ValCys: 0.711 ± 0.024
6.182ValAsp: 6.182 ± 0.067
5.821ValGlu: 5.821 ± 0.075
2.465ValPhe: 2.465 ± 0.046
7.584ValGly: 7.584 ± 0.085
2.107ValHis: 2.107 ± 0.039
3.141ValIle: 3.141 ± 0.05
1.704ValLys: 1.704 ± 0.041
9.838ValLeu: 9.838 ± 0.102
1.614ValMet: 1.614 ± 0.032
1.717ValAsn: 1.717 ± 0.035
5.13ValPro: 5.13 ± 0.057
2.217ValGln: 2.217 ± 0.041
6.894ValArg: 6.894 ± 0.087
4.801ValSer: 4.801 ± 0.067
6.08ValThr: 6.08 ± 0.078
10.371ValVal: 10.371 ± 0.112
1.123ValTrp: 1.123 ± 0.028
1.482ValTyr: 1.482 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.675TrpAla: 1.675 ± 0.032
0.15TrpCys: 0.15 ± 0.01
0.879TrpAsp: 0.879 ± 0.024
0.746TrpGlu: 0.746 ± 0.025
0.564TrpPhe: 0.564 ± 0.02
1.144TrpGly: 1.144 ± 0.03
0.416TrpHis: 0.416 ± 0.019
0.567TrpIle: 0.567 ± 0.023
0.302TrpLys: 0.302 ± 0.015
1.916TrpLeu: 1.916 ± 0.048
0.318TrpMet: 0.318 ± 0.016
0.326TrpAsn: 0.326 ± 0.017
0.767TrpPro: 0.767 ± 0.024
0.596TrpGln: 0.596 ± 0.023
1.33TrpArg: 1.33 ± 0.033
0.999TrpSer: 0.999 ± 0.031
1.01TrpThr: 1.01 ± 0.033
1.298TrpVal: 1.298 ± 0.035
0.36TrpTrp: 0.36 ± 0.015
0.286TrpTyr: 0.286 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.503TyrAla: 2.503 ± 0.043
0.174TyrCys: 0.174 ± 0.009
1.628TyrAsp: 1.628 ± 0.038
1.049TyrGlu: 1.049 ± 0.029
0.634TyrPhe: 0.634 ± 0.023
1.925TyrGly: 1.925 ± 0.04
0.355TyrHis: 0.355 ± 0.018
0.474TyrIle: 0.474 ± 0.019
0.365TyrLys: 0.365 ± 0.016
1.954TyrLeu: 1.954 ± 0.04
0.233TyrMet: 0.233 ± 0.015
0.377TyrAsn: 0.377 ± 0.017
0.967TyrPro: 0.967 ± 0.026
0.526TyrGln: 0.526 ± 0.019
1.486TyrArg: 1.486 ± 0.036
0.958TyrSer: 0.958 ± 0.023
1.035TyrThr: 1.035 ± 0.028
1.786TyrVal: 1.786 ± 0.034
0.316TyrTrp: 0.316 ± 0.016
0.439TyrTyr: 0.439 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4337 proteins (1410258 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski