Amino acid dipepetide frequency for Tissierella praeacuta DSM 18095

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.522AlaAla: 3.522 ± 0.082
0.514AlaCys: 0.514 ± 0.028
2.425AlaAsp: 2.425 ± 0.051
3.427AlaGlu: 3.427 ± 0.071
2.285AlaPhe: 2.285 ± 0.052
3.719AlaGly: 3.719 ± 0.076
0.756AlaHis: 0.756 ± 0.029
5.86AlaIle: 5.86 ± 0.095
4.44AlaLys: 4.44 ± 0.084
5.541AlaLeu: 5.541 ± 0.092
1.743AlaMet: 1.743 ± 0.047
2.729AlaAsn: 2.729 ± 0.054
1.363AlaPro: 1.363 ± 0.04
1.277AlaGln: 1.277 ± 0.041
2.016AlaArg: 2.016 ± 0.047
3.002AlaSer: 3.002 ± 0.054
2.755AlaThr: 2.755 ± 0.061
3.627AlaVal: 3.627 ± 0.081
0.325AlaTrp: 0.325 ± 0.018
1.998AlaTyr: 1.998 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.41CysAla: 0.41 ± 0.024
0.103CysCys: 0.103 ± 0.011
0.499CysAsp: 0.499 ± 0.025
0.536CysGlu: 0.536 ± 0.027
0.356CysPhe: 0.356 ± 0.019
0.842CysGly: 0.842 ± 0.031
0.197CysHis: 0.197 ± 0.015
0.937CysIle: 0.937 ± 0.033
0.684CysLys: 0.684 ± 0.03
0.647CysLeu: 0.647 ± 0.026
0.213CysMet: 0.213 ± 0.014
0.588CysAsn: 0.588 ± 0.026
0.455CysPro: 0.455 ± 0.026
0.207CysGln: 0.207 ± 0.015
0.332CysArg: 0.332 ± 0.022
0.611CysSer: 0.611 ± 0.025
0.439CysThr: 0.439 ± 0.021
0.455CysVal: 0.455 ± 0.027
0.054CysTrp: 0.054 ± 0.008
0.34CysTyr: 0.34 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
2.45AspAla: 2.45 ± 0.055
0.461AspCys: 0.461 ± 0.025
2.671AspAsp: 2.671 ± 0.057
4.865AspGlu: 4.865 ± 0.089
2.811AspPhe: 2.811 ± 0.053
3.434AspGly: 3.434 ± 0.069
0.567AspHis: 0.567 ± 0.026
7.12AspIle: 7.12 ± 0.09
5.292AspLys: 5.292 ± 0.088
4.898AspLeu: 4.898 ± 0.082
1.682AspMet: 1.682 ± 0.04
3.487AspAsn: 3.487 ± 0.069
1.418AspPro: 1.418 ± 0.045
0.677AspGln: 0.677 ± 0.026
2.056AspArg: 2.056 ± 0.049
2.942AspSer: 2.942 ± 0.063
2.509AspThr: 2.509 ± 0.055
3.141AspVal: 3.141 ± 0.064
0.434AspTrp: 0.434 ± 0.025
2.779AspTyr: 2.779 ± 0.061
0.0AspXaa: 0.0 ± 0.0
Glu
3.956GluAla: 3.956 ± 0.076
0.469GluCys: 0.469 ± 0.023
4.559GluAsp: 4.559 ± 0.084
7.942GluGlu: 7.942 ± 0.135
2.985GluPhe: 2.985 ± 0.065
4.537GluGly: 4.537 ± 0.077
0.995GluHis: 0.995 ± 0.035
8.423GluIle: 8.423 ± 0.119
7.816GluLys: 7.816 ± 0.09
7.496GluLeu: 7.496 ± 0.101
2.189GluMet: 2.189 ± 0.05
5.223GluAsn: 5.223 ± 0.097
1.739GluPro: 1.739 ± 0.045
1.791GluGln: 1.791 ± 0.048
2.957GluArg: 2.957 ± 0.07
3.552GluSer: 3.552 ± 0.063
3.147GluThr: 3.147 ± 0.068
4.844GluVal: 4.844 ± 0.083
0.496GluTrp: 0.496 ± 0.026
3.217GluTyr: 3.217 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
2.208PheAla: 2.208 ± 0.05
0.381PheCys: 0.381 ± 0.022
2.296PheAsp: 2.296 ± 0.054
2.761PheGlu: 2.761 ± 0.061
2.017PhePhe: 2.017 ± 0.051
2.943PheGly: 2.943 ± 0.069
0.642PheHis: 0.642 ± 0.023
4.775PheIle: 4.775 ± 0.093
3.401PheLys: 3.401 ± 0.064
3.96PheLeu: 3.96 ± 0.08
1.13PheMet: 1.13 ± 0.035
2.806PheAsn: 2.806 ± 0.054
1.346PhePro: 1.346 ± 0.04
1.107PheGln: 1.107 ± 0.035
1.364PheArg: 1.364 ± 0.039
3.019PheSer: 3.019 ± 0.07
2.395PheThr: 2.395 ± 0.05
2.483PheVal: 2.483 ± 0.053
0.288PheTrp: 0.288 ± 0.017
1.8PheTyr: 1.8 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
3.892GlyAla: 3.892 ± 0.089
0.811GlyCys: 0.811 ± 0.033
3.27GlyAsp: 3.27 ± 0.068
4.655GlyGlu: 4.655 ± 0.071
3.219GlyPhe: 3.219 ± 0.059
4.551GlyGly: 4.551 ± 0.082
1.058GlyHis: 1.058 ± 0.033
7.488GlyIle: 7.488 ± 0.092
5.857GlyLys: 5.857 ± 0.093
6.061GlyLeu: 6.061 ± 0.096
1.867GlyMet: 1.867 ± 0.043
3.481GlyAsn: 3.481 ± 0.068
1.217GlyPro: 1.217 ± 0.037
1.545GlyGln: 1.545 ± 0.042
2.523GlyArg: 2.523 ± 0.06
3.701GlySer: 3.701 ± 0.063
3.597GlyThr: 3.597 ± 0.057
4.518GlyVal: 4.518 ± 0.071
0.461GlyTrp: 0.461 ± 0.021
2.984GlyTyr: 2.984 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
0.68HisAla: 0.68 ± 0.024
0.179HisCys: 0.179 ± 0.013
0.7HisAsp: 0.7 ± 0.026
0.908HisGlu: 0.908 ± 0.035
0.608HisPhe: 0.608 ± 0.028
1.02HisGly: 1.02 ± 0.037
0.286HisHis: 0.286 ± 0.02
1.492HisIle: 1.492 ± 0.041
1.038HisLys: 1.038 ± 0.033
1.216HisLeu: 1.216 ± 0.034
0.355HisMet: 0.355 ± 0.017
0.795HisAsn: 0.795 ± 0.028
0.655HisPro: 0.655 ± 0.029
0.347HisGln: 0.347 ± 0.019
0.577HisArg: 0.577 ± 0.027
0.9HisSer: 0.9 ± 0.034
0.685HisThr: 0.685 ± 0.028
0.717HisVal: 0.717 ± 0.031
0.118HisTrp: 0.118 ± 0.011
0.589HisTyr: 0.589 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.893IleAla: 5.893 ± 0.087
0.991IleCys: 0.991 ± 0.037
6.665IleAsp: 6.665 ± 0.098
8.112IleGlu: 8.112 ± 0.107
4.61IlePhe: 4.61 ± 0.088
6.932IleGly: 6.932 ± 0.111
1.359IleHis: 1.359 ± 0.039
10.961IleIle: 10.961 ± 0.151
8.898IleLys: 8.898 ± 0.107
10.18IleLeu: 10.18 ± 0.137
2.617IleMet: 2.617 ± 0.06
6.326IleAsn: 6.326 ± 0.103
3.571IlePro: 3.571 ± 0.077
2.433IleGln: 2.433 ± 0.051
3.58IleArg: 3.58 ± 0.074
7.443IleSer: 7.443 ± 0.1
5.029IleThr: 5.029 ± 0.078
6.577IleVal: 6.577 ± 0.091
0.637IleTrp: 0.637 ± 0.029
3.879IleTyr: 3.879 ± 0.085
0.0IleXaa: 0.0 ± 0.0
Lys
4.351LysAla: 4.351 ± 0.081
0.576LysCys: 0.576 ± 0.03
6.119LysAsp: 6.119 ± 0.098
9.248LysGlu: 9.248 ± 0.112
2.932LysPhe: 2.932 ± 0.061
5.44LysGly: 5.44 ± 0.087
1.099LysHis: 1.099 ± 0.037
8.59LysIle: 8.59 ± 0.104
7.287LysLys: 7.287 ± 0.112
7.403LysLeu: 7.403 ± 0.085
2.309LysMet: 2.309 ± 0.049
5.392LysAsn: 5.392 ± 0.084
2.044LysPro: 2.044 ± 0.053
1.992LysGln: 1.992 ± 0.05
3.129LysArg: 3.129 ± 0.061
4.732LysSer: 4.732 ± 0.073
4.013LysThr: 4.013 ± 0.071
5.383LysVal: 5.383 ± 0.09
0.604LysTrp: 0.604 ± 0.029
3.848LysTyr: 3.848 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
5.223LeuAla: 5.223 ± 0.078
0.809LeuCys: 0.809 ± 0.028
5.526LeuAsp: 5.526 ± 0.084
7.075LeuGlu: 7.075 ± 0.098
3.922LeuPhe: 3.922 ± 0.071
6.501LeuGly: 6.501 ± 0.108
1.09LeuHis: 1.09 ± 0.032
8.794LeuIle: 8.794 ± 0.117
7.946LeuLys: 7.946 ± 0.108
8.275LeuLeu: 8.275 ± 0.121
2.334LeuMet: 2.334 ± 0.05
5.89LeuAsn: 5.89 ± 0.092
3.049LeuPro: 3.049 ± 0.053
2.107LeuGln: 2.107 ± 0.053
3.251LeuArg: 3.251 ± 0.065
6.893LeuSer: 6.893 ± 0.094
4.454LeuThr: 4.454 ± 0.072
5.475LeuVal: 5.475 ± 0.083
0.556LeuTrp: 0.556 ± 0.027
3.329LeuTyr: 3.329 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
1.854MetAla: 1.854 ± 0.052
0.173MetCys: 0.173 ± 0.013
1.781MetAsp: 1.781 ± 0.039
2.532MetGlu: 2.532 ± 0.05
0.972MetPhe: 0.972 ± 0.037
1.988MetGly: 1.988 ± 0.048
0.287MetHis: 0.287 ± 0.02
2.321MetIle: 2.321 ± 0.05
2.606MetLys: 2.606 ± 0.053
2.287MetLeu: 2.287 ± 0.051
0.725MetMet: 0.725 ± 0.031
1.618MetAsn: 1.618 ± 0.042
0.782MetPro: 0.782 ± 0.032
0.512MetGln: 0.512 ± 0.024
0.926MetArg: 0.926 ± 0.032
1.572MetSer: 1.572 ± 0.039
1.346MetThr: 1.346 ± 0.038
1.884MetVal: 1.884 ± 0.046
0.155MetTrp: 0.155 ± 0.012
0.783MetTyr: 0.783 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
2.602AsnAla: 2.602 ± 0.059
0.598AsnCys: 0.598 ± 0.029
2.619AsnAsp: 2.619 ± 0.053
4.008AsnGlu: 4.008 ± 0.076
2.393AsnPhe: 2.393 ± 0.051
3.566AsnGly: 3.566 ± 0.071
0.952AsnHis: 0.952 ± 0.033
7.834AsnIle: 7.834 ± 0.111
5.789AsnLys: 5.789 ± 0.095
5.613AsnLeu: 5.613 ± 0.084
1.75AsnMet: 1.75 ± 0.046
3.936AsnAsn: 3.936 ± 0.078
2.285AsnPro: 2.285 ± 0.055
1.577AsnGln: 1.577 ± 0.042
2.283AsnArg: 2.283 ± 0.049
3.607AsnSer: 3.607 ± 0.061
2.98AsnThr: 2.98 ± 0.06
3.114AsnVal: 3.114 ± 0.057
0.446AsnTrp: 0.446 ± 0.025
2.509AsnTyr: 2.509 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
1.473ProAla: 1.473 ± 0.046
0.274ProCys: 0.274 ± 0.019
1.499ProAsp: 1.499 ± 0.041
2.196ProGlu: 2.196 ± 0.048
1.419ProPhe: 1.419 ± 0.038
1.753ProGly: 1.753 ± 0.055
0.532ProHis: 0.532 ± 0.026
3.185ProIle: 3.185 ± 0.061
2.269ProLys: 2.269 ± 0.054
2.568ProLeu: 2.568 ± 0.058
0.806ProMet: 0.806 ± 0.03
1.727ProAsn: 1.727 ± 0.042
0.779ProPro: 0.779 ± 0.035
0.816ProGln: 0.816 ± 0.029
0.979ProArg: 0.979 ± 0.037
1.844ProSer: 1.844 ± 0.048
1.573ProThr: 1.573 ± 0.046
2.085ProVal: 2.085 ± 0.06
0.235ProTrp: 0.235 ± 0.017
1.339ProTyr: 1.339 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
1.319GlnAla: 1.319 ± 0.041
0.213GlnCys: 0.213 ± 0.017
1.206GlnAsp: 1.206 ± 0.037
1.892GlnGlu: 1.892 ± 0.045
0.885GlnPhe: 0.885 ± 0.03
1.621GlnGly: 1.621 ± 0.043
0.314GlnHis: 0.314 ± 0.018
2.298GlnIle: 2.298 ± 0.05
1.867GlnLys: 1.867 ± 0.048
2.084GlnLeu: 2.084 ± 0.047
0.73GlnMet: 0.73 ± 0.028
1.39GlnAsn: 1.39 ± 0.039
0.601GlnPro: 0.601 ± 0.025
0.664GlnGln: 0.664 ± 0.029
1.102GlnArg: 1.102 ± 0.039
1.324GlnSer: 1.324 ± 0.039
1.044GlnThr: 1.044 ± 0.037
1.481GlnVal: 1.481 ± 0.044
0.17GlnTrp: 0.17 ± 0.016
0.928GlnTyr: 0.928 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
1.973ArgAla: 1.973 ± 0.046
0.328ArgCys: 0.328 ± 0.021
2.081ArgAsp: 2.081 ± 0.047
3.3ArgGlu: 3.3 ± 0.074
1.659ArgPhe: 1.659 ± 0.043
2.3ArgGly: 2.3 ± 0.055
0.472ArgHis: 0.472 ± 0.024
3.669ArgIle: 3.669 ± 0.061
3.277ArgLys: 3.277 ± 0.052
3.433ArgLeu: 3.433 ± 0.066
1.005ArgMet: 1.005 ± 0.032
2.212ArgAsn: 2.212 ± 0.048
0.957ArgPro: 0.957 ± 0.032
0.994ArgGln: 0.994 ± 0.035
1.582ArgArg: 1.582 ± 0.048
1.48ArgSer: 1.48 ± 0.042
1.746ArgThr: 1.746 ± 0.044
2.305ArgVal: 2.305 ± 0.045
0.251ArgTrp: 0.251 ± 0.014
1.601ArgTyr: 1.601 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
2.843SerAla: 2.843 ± 0.062
0.541SerCys: 0.541 ± 0.029
2.833SerAsp: 2.833 ± 0.055
3.597SerGlu: 3.597 ± 0.066
3.049SerPhe: 3.049 ± 0.063
4.2SerGly: 4.2 ± 0.076
0.95SerHis: 0.95 ± 0.034
6.884SerIle: 6.884 ± 0.1
5.148SerLys: 5.148 ± 0.079
6.042SerLeu: 6.042 ± 0.092
1.657SerMet: 1.657 ± 0.046
3.536SerAsn: 3.536 ± 0.079
1.791SerPro: 1.791 ± 0.045
1.682SerGln: 1.682 ± 0.046
2.267SerArg: 2.267 ± 0.052
3.875SerSer: 3.875 ± 0.072
3.056SerThr: 3.056 ± 0.061
3.232SerVal: 3.232 ± 0.065
0.471SerTrp: 0.471 ± 0.028
2.555SerTyr: 2.555 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
2.849ThrAla: 2.849 ± 0.067
0.398ThrCys: 0.398 ± 0.022
2.405ThrAsp: 2.405 ± 0.053
3.188ThrGlu: 3.188 ± 0.062
1.997ThrPhe: 1.997 ± 0.055
3.896ThrGly: 3.896 ± 0.071
0.783ThrHis: 0.783 ± 0.027
5.488ThrIle: 5.488 ± 0.081
3.763ThrLys: 3.763 ± 0.066
4.514ThrLeu: 4.514 ± 0.071
1.263ThrMet: 1.263 ± 0.042
2.694ThrAsn: 2.694 ± 0.059
1.845ThrPro: 1.845 ± 0.052
1.088ThrGln: 1.088 ± 0.033
1.621ThrArg: 1.621 ± 0.034
2.852ThrSer: 2.852 ± 0.065
2.761ThrThr: 2.761 ± 0.067
3.29ThrVal: 3.29 ± 0.063
0.339ThrTrp: 0.339 ± 0.023
1.735ThrTyr: 1.735 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
3.636ValAla: 3.636 ± 0.07
0.581ValCys: 0.581 ± 0.027
3.864ValAsp: 3.864 ± 0.062
4.759ValGlu: 4.759 ± 0.084
2.792ValPhe: 2.792 ± 0.057
4.309ValGly: 4.309 ± 0.079
0.793ValHis: 0.793 ± 0.029
5.694ValIle: 5.694 ± 0.085
4.957ValLys: 4.957 ± 0.085
5.911ValLeu: 5.911 ± 0.093
1.496ValMet: 1.496 ± 0.037
3.366ValAsn: 3.366 ± 0.059
1.96ValPro: 1.96 ± 0.05
1.255ValGln: 1.255 ± 0.039
2.195ValArg: 2.195 ± 0.047
3.899ValSer: 3.899 ± 0.079
2.93ValThr: 2.93 ± 0.056
4.262ValVal: 4.262 ± 0.083
0.363ValTrp: 0.363 ± 0.02
2.252ValTyr: 2.252 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.395TrpAla: 0.395 ± 0.019
0.072TrpCys: 0.072 ± 0.01
0.378TrpAsp: 0.378 ± 0.022
0.471TrpGlu: 0.471 ± 0.024
0.291TrpPhe: 0.291 ± 0.018
0.519TrpGly: 0.519 ± 0.026
0.112TrpHis: 0.112 ± 0.012
0.651TrpIle: 0.651 ± 0.032
0.569TrpLys: 0.569 ± 0.023
0.594TrpLeu: 0.594 ± 0.027
0.217TrpMet: 0.217 ± 0.016
0.431TrpAsn: 0.431 ± 0.024
0.156TrpPro: 0.156 ± 0.015
0.177TrpGln: 0.177 ± 0.014
0.271TrpArg: 0.271 ± 0.017
0.383TrpSer: 0.383 ± 0.02
0.363TrpThr: 0.363 ± 0.022
0.396TrpVal: 0.396 ± 0.023
0.075TrpTrp: 0.075 ± 0.01
0.27TrpTyr: 0.27 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.794TyrAla: 1.794 ± 0.048
0.434TyrCys: 0.434 ± 0.019
2.3TyrAsp: 2.3 ± 0.05
2.863TyrGlu: 2.863 ± 0.062
1.918TyrPhe: 1.918 ± 0.049
2.685TyrGly: 2.685 ± 0.058
0.622TyrHis: 0.622 ± 0.027
4.228TyrIle: 4.228 ± 0.072
3.586TyrLys: 3.586 ± 0.067
3.774TyrLeu: 3.774 ± 0.069
1.013TyrMet: 1.013 ± 0.031
2.836TyrAsn: 2.836 ± 0.07
1.337TyrPro: 1.337 ± 0.037
0.886TyrGln: 0.886 ± 0.036
1.613TyrArg: 1.613 ± 0.04
2.581TyrSer: 2.581 ± 0.058
1.949TyrThr: 1.949 ± 0.049
2.003TyrVal: 2.003 ± 0.048
0.313TyrTrp: 0.313 ± 0.018
1.86TyrTyr: 1.86 ± 0.053
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3095 proteins (911760 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski