Amino acid dipepetide frequency for Rathayibacter toxicus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.803AlaAla: 16.803 ± 0.253
0.784AlaCys: 0.784 ± 0.034
6.781AlaAsp: 6.781 ± 0.135
7.72AlaGlu: 7.72 ± 0.124
3.521AlaPhe: 3.521 ± 0.087
9.945AlaGly: 9.945 ± 0.14
2.533AlaHis: 2.533 ± 0.072
5.699AlaIle: 5.699 ± 0.122
2.8AlaLys: 2.8 ± 0.083
13.805AlaLeu: 13.805 ± 0.183
2.222AlaMet: 2.222 ± 0.062
2.452AlaAsn: 2.452 ± 0.065
5.472AlaPro: 5.472 ± 0.117
4.118AlaGln: 4.118 ± 0.088
8.755AlaArg: 8.755 ± 0.143
7.77AlaSer: 7.77 ± 0.131
7.231AlaThr: 7.231 ± 0.139
11.265AlaVal: 11.265 ± 0.176
1.508AlaTrp: 1.508 ± 0.05
2.263AlaTyr: 2.263 ± 0.068
0.0AlaXaa: 0.0 ± 0.0
Cys
0.886CysAla: 0.886 ± 0.037
0.063CysCys: 0.063 ± 0.011
0.416CysAsp: 0.416 ± 0.029
0.363CysGlu: 0.363 ± 0.025
0.194CysPhe: 0.194 ± 0.016
0.706CysGly: 0.706 ± 0.038
0.126CysHis: 0.126 ± 0.015
0.261CysIle: 0.261 ± 0.023
0.08CysLys: 0.08 ± 0.012
0.578CysLeu: 0.578 ± 0.032
0.102CysMet: 0.102 ± 0.012
0.126CysAsn: 0.126 ± 0.016
0.332CysPro: 0.332 ± 0.025
0.147CysGln: 0.147 ± 0.015
0.411CysArg: 0.411 ± 0.03
0.498CysSer: 0.498 ± 0.029
0.426CysThr: 0.426 ± 0.029
0.602CysVal: 0.602 ± 0.028
0.094CysTrp: 0.094 ± 0.013
0.143CysTyr: 0.143 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
7.061AspAla: 7.061 ± 0.122
0.315AspCys: 0.315 ± 0.025
3.463AspAsp: 3.463 ± 0.081
3.939AspGlu: 3.939 ± 0.085
1.735AspPhe: 1.735 ± 0.06
5.132AspGly: 5.132 ± 0.121
1.159AspHis: 1.159 ± 0.053
2.761AspIle: 2.761 ± 0.071
1.208AspLys: 1.208 ± 0.047
5.745AspLeu: 5.745 ± 0.096
0.678AspMet: 0.678 ± 0.037
1.115AspAsn: 1.115 ± 0.045
3.678AspPro: 3.678 ± 0.105
1.505AspGln: 1.505 ± 0.058
4.111AspArg: 4.111 ± 0.093
3.296AspSer: 3.296 ± 0.077
2.858AspThr: 2.858 ± 0.071
4.985AspVal: 4.985 ± 0.093
0.796AspTrp: 0.796 ± 0.036
1.362AspTyr: 1.362 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
6.439GluAla: 6.439 ± 0.112
0.327GluCys: 0.327 ± 0.024
2.403GluAsp: 2.403 ± 0.071
3.059GluGlu: 3.059 ± 0.082
1.841GluPhe: 1.841 ± 0.056
3.932GluGly: 3.932 ± 0.097
1.517GluHis: 1.517 ± 0.053
3.139GluIle: 3.139 ± 0.077
1.776GluLys: 1.776 ± 0.06
6.642GluLeu: 6.642 ± 0.133
0.985GluMet: 0.985 ± 0.041
1.432GluAsn: 1.432 ± 0.049
2.717GluPro: 2.717 ± 0.078
2.308GluGln: 2.308 ± 0.071
5.553GluArg: 5.553 ± 0.106
3.158GluSer: 3.158 ± 0.072
2.962GluThr: 2.962 ± 0.076
4.617GluVal: 4.617 ± 0.109
0.794GluTrp: 0.794 ± 0.045
1.169GluTyr: 1.169 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
4.036PheAla: 4.036 ± 0.087
0.199PheCys: 0.199 ± 0.02
2.308PheAsp: 2.308 ± 0.065
1.803PheGlu: 1.803 ± 0.054
1.161PhePhe: 1.161 ± 0.042
3.294PheGly: 3.294 ± 0.083
0.586PheHis: 0.586 ± 0.031
1.227PheIle: 1.227 ± 0.055
0.424PheLys: 0.424 ± 0.025
2.899PheLeu: 2.899 ± 0.084
0.402PheMet: 0.402 ± 0.029
0.617PheAsn: 0.617 ± 0.029
1.484PhePro: 1.484 ± 0.057
0.741PheGln: 0.741 ± 0.038
1.975PheArg: 1.975 ± 0.06
2.137PheSer: 2.137 ± 0.066
2.089PheThr: 2.089 ± 0.057
2.821PheVal: 2.821 ± 0.075
0.429PheTrp: 0.429 ± 0.029
0.654PheTyr: 0.654 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
9.818GlyAla: 9.818 ± 0.164
0.671GlyCys: 0.671 ± 0.038
4.361GlyAsp: 4.361 ± 0.079
4.702GlyGlu: 4.702 ± 0.11
3.064GlyPhe: 3.064 ± 0.077
7.207GlyGly: 7.207 ± 0.136
1.924GlyHis: 1.924 ± 0.061
4.777GlyIle: 4.777 ± 0.088
2.28GlyLys: 2.28 ± 0.069
7.997GlyLeu: 7.997 ± 0.114
1.703GlyMet: 1.703 ± 0.053
1.853GlyAsn: 1.853 ± 0.058
3.484GlyPro: 3.484 ± 0.09
2.403GlyGln: 2.403 ± 0.068
6.302GlyArg: 6.302 ± 0.149
5.789GlySer: 5.789 ± 0.094
5.636GlyThr: 5.636 ± 0.112
7.954GlyVal: 7.954 ± 0.127
1.343GlyTrp: 1.343 ± 0.054
2.066GlyTyr: 2.066 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
2.412HisAla: 2.412 ± 0.066
0.145HisCys: 0.145 ± 0.013
1.394HisAsp: 1.394 ± 0.054
1.259HisGlu: 1.259 ± 0.044
0.614HisPhe: 0.614 ± 0.035
1.951HisGly: 1.951 ± 0.065
0.581HisHis: 0.581 ± 0.03
0.859HisIle: 0.859 ± 0.037
0.365HisLys: 0.365 ± 0.026
2.113HisLeu: 2.113 ± 0.068
0.276HisMet: 0.276 ± 0.022
0.568HisAsn: 0.568 ± 0.036
1.369HisPro: 1.369 ± 0.055
0.614HisGln: 0.614 ± 0.035
1.767HisArg: 1.767 ± 0.05
1.348HisSer: 1.348 ± 0.046
1.162HisThr: 1.162 ± 0.053
1.665HisVal: 1.665 ± 0.054
0.233HisTrp: 0.233 ± 0.021
0.552HisTyr: 0.552 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
6.873IleAla: 6.873 ± 0.114
0.337IleCys: 0.337 ± 0.024
3.751IleAsp: 3.751 ± 0.092
2.884IleGlu: 2.884 ± 0.086
1.312IlePhe: 1.312 ± 0.052
4.496IleGly: 4.496 ± 0.103
0.815IleHis: 0.815 ± 0.041
1.972IleIle: 1.972 ± 0.073
0.925IleLys: 0.925 ± 0.046
4.157IleLeu: 4.157 ± 0.092
0.607IleMet: 0.607 ± 0.035
1.079IleAsn: 1.079 ± 0.042
2.531IlePro: 2.531 ± 0.066
1.024IleGln: 1.024 ± 0.048
2.983IleArg: 2.983 ± 0.071
2.834IleSer: 2.834 ± 0.062
3.187IleThr: 3.187 ± 0.073
4.794IleVal: 4.794 ± 0.09
0.475IleTrp: 0.475 ± 0.033
0.847IleTyr: 0.847 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
2.5LysAla: 2.5 ± 0.078
0.089LysCys: 0.089 ± 0.013
1.246LysAsp: 1.246 ± 0.047
1.115LysGlu: 1.115 ± 0.049
0.542LysPhe: 0.542 ± 0.032
1.755LysGly: 1.755 ± 0.065
0.428LysHis: 0.428 ± 0.025
1.138LysIle: 1.138 ± 0.056
0.951LysLys: 0.951 ± 0.046
1.982LysLeu: 1.982 ± 0.061
0.474LysMet: 0.474 ± 0.028
0.792LysAsn: 0.792 ± 0.04
1.142LysPro: 1.142 ± 0.049
0.796LysGln: 0.796 ± 0.037
1.634LysArg: 1.634 ± 0.056
1.352LysSer: 1.352 ± 0.046
1.505LysThr: 1.505 ± 0.057
1.81LysVal: 1.81 ± 0.068
0.251LysTrp: 0.251 ± 0.019
0.554LysTyr: 0.554 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
14.265LeuAla: 14.265 ± 0.205
0.721LeuCys: 0.721 ± 0.034
6.393LeuAsp: 6.393 ± 0.119
5.449LeuGlu: 5.449 ± 0.108
2.918LeuPhe: 2.918 ± 0.072
9.077LeuGly: 9.077 ± 0.156
2.161LeuHis: 2.161 ± 0.071
4.632LeuIle: 4.632 ± 0.093
1.83LeuLys: 1.83 ± 0.054
10.974LeuLeu: 10.974 ± 0.199
1.544LeuMet: 1.544 ± 0.051
2.103LeuAsn: 2.103 ± 0.06
5.558LeuPro: 5.558 ± 0.097
2.447LeuGln: 2.447 ± 0.066
8.026LeuArg: 8.026 ± 0.149
6.679LeuSer: 6.679 ± 0.111
6.425LeuThr: 6.425 ± 0.121
9.563LeuVal: 9.563 ± 0.135
1.229LeuTrp: 1.229 ± 0.05
1.776LeuTyr: 1.776 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
1.842MetAla: 1.842 ± 0.06
0.087MetCys: 0.087 ± 0.013
0.697MetAsp: 0.697 ± 0.036
0.637MetGlu: 0.637 ± 0.035
0.445MetPhe: 0.445 ± 0.031
1.157MetGly: 1.157 ± 0.048
0.307MetHis: 0.307 ± 0.021
0.874MetIle: 0.874 ± 0.044
0.486MetLys: 0.486 ± 0.028
1.815MetLeu: 1.815 ± 0.058
0.331MetMet: 0.331 ± 0.025
0.455MetAsn: 0.455 ± 0.028
1.029MetPro: 1.029 ± 0.043
0.477MetGln: 0.477 ± 0.032
1.266MetArg: 1.266 ± 0.05
1.435MetSer: 1.435 ± 0.048
1.578MetThr: 1.578 ± 0.051
1.346MetVal: 1.346 ± 0.053
0.135MetTrp: 0.135 ± 0.017
0.249MetTyr: 0.249 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
2.522AsnAla: 2.522 ± 0.079
0.169AsnCys: 0.169 ± 0.017
1.392AsnAsp: 1.392 ± 0.046
1.106AsnGlu: 1.106 ± 0.044
0.741AsnPhe: 0.741 ± 0.039
2.147AsnGly: 2.147 ± 0.063
0.411AsnHis: 0.411 ± 0.024
1.026AsnIle: 1.026 ± 0.045
0.516AsnLys: 0.516 ± 0.031
2.089AsnLeu: 2.089 ± 0.063
0.329AsnMet: 0.329 ± 0.022
0.585AsnAsn: 0.585 ± 0.033
1.5AsnPro: 1.5 ± 0.053
0.653AsnGln: 0.653 ± 0.036
1.575AsnArg: 1.575 ± 0.047
1.382AsnSer: 1.382 ± 0.058
1.433AsnThr: 1.433 ± 0.055
1.905AsnVal: 1.905 ± 0.049
0.344AsnTrp: 0.344 ± 0.024
0.566AsnTyr: 0.566 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
5.595ProAla: 5.595 ± 0.103
0.261ProCys: 0.261 ± 0.022
3.337ProAsp: 3.337 ± 0.077
3.565ProGlu: 3.565 ± 0.089
1.696ProPhe: 1.696 ± 0.05
4.656ProGly: 4.656 ± 0.093
1.283ProHis: 1.283 ± 0.046
2.1ProIle: 2.1 ± 0.063
0.978ProLys: 0.978 ± 0.048
5.137ProLeu: 5.137 ± 0.104
0.775ProMet: 0.775 ± 0.036
1.167ProAsn: 1.167 ± 0.046
2.216ProPro: 2.216 ± 0.068
1.663ProGln: 1.663 ± 0.064
3.449ProArg: 3.449 ± 0.085
3.344ProSer: 3.344 ± 0.075
3.402ProThr: 3.402 ± 0.078
4.731ProVal: 4.731 ± 0.086
0.748ProTrp: 0.748 ± 0.035
1.021ProTyr: 1.021 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
3.301GlnAla: 3.301 ± 0.077
0.174GlnCys: 0.174 ± 0.021
1.178GlnAsp: 1.178 ± 0.043
1.505GlnGlu: 1.505 ± 0.054
0.859GlnPhe: 0.859 ± 0.042
2.089GlnGly: 2.089 ± 0.05
0.704GlnHis: 0.704 ± 0.035
1.568GlnIle: 1.568 ± 0.054
0.854GlnLys: 0.854 ± 0.04
3.589GlnLeu: 3.589 ± 0.082
0.532GlnMet: 0.532 ± 0.03
0.721GlnAsn: 0.721 ± 0.036
1.534GlnPro: 1.534 ± 0.056
1.227GlnGln: 1.227 ± 0.046
2.727GlnArg: 2.727 ± 0.073
1.619GlnSer: 1.619 ± 0.055
1.421GlnThr: 1.421 ± 0.048
2.415GlnVal: 2.415 ± 0.064
0.464GlnTrp: 0.464 ± 0.028
0.671GlnTyr: 0.671 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
8.649ArgAla: 8.649 ± 0.135
0.528ArgCys: 0.528 ± 0.03
4.07ArgAsp: 4.07 ± 0.084
4.637ArgGlu: 4.637 ± 0.1
2.565ArgPhe: 2.565 ± 0.07
5.524ArgGly: 5.524 ± 0.103
1.745ArgHis: 1.745 ± 0.06
3.869ArgIle: 3.869 ± 0.089
1.537ArgLys: 1.537 ± 0.051
7.833ArgLeu: 7.833 ± 0.146
1.553ArgMet: 1.553 ± 0.052
1.571ArgAsn: 1.571 ± 0.054
3.526ArgPro: 3.526 ± 0.082
2.132ArgGln: 2.132 ± 0.062
6.701ArgArg: 6.701 ± 0.147
4.988ArgSer: 4.988 ± 0.104
4.245ArgThr: 4.245 ± 0.086
6.284ArgVal: 6.284 ± 0.119
1.184ArgTrp: 1.184 ± 0.043
1.684ArgTyr: 1.684 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
7.959SerAla: 7.959 ± 0.124
0.435SerCys: 0.435 ± 0.028
3.617SerAsp: 3.617 ± 0.086
3.172SerGlu: 3.172 ± 0.085
2.069SerPhe: 2.069 ± 0.063
6.422SerGly: 6.422 ± 0.105
1.236SerHis: 1.236 ± 0.051
2.822SerIle: 2.822 ± 0.072
1.232SerLys: 1.232 ± 0.047
6.473SerLeu: 6.473 ± 0.111
1.159SerMet: 1.159 ± 0.049
1.355SerAsn: 1.355 ± 0.047
3.371SerPro: 3.371 ± 0.078
1.704SerGln: 1.704 ± 0.058
4.61SerArg: 4.61 ± 0.092
4.632SerSer: 4.632 ± 0.101
4.09SerThr: 4.09 ± 0.091
5.638SerVal: 5.638 ± 0.097
1.035SerTrp: 1.035 ± 0.04
1.52SerTyr: 1.52 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
7.918ThrAla: 7.918 ± 0.139
0.404ThrCys: 0.404 ± 0.03
3.148ThrAsp: 3.148 ± 0.076
3.231ThrGlu: 3.231 ± 0.067
1.847ThrPhe: 1.847 ± 0.062
5.493ThrGly: 5.493 ± 0.097
1.251ThrHis: 1.251 ± 0.047
3.001ThrIle: 3.001 ± 0.08
1.333ThrLys: 1.333 ± 0.057
6.514ThrLeu: 6.514 ± 0.102
0.985ThrMet: 0.985 ± 0.041
1.428ThrAsn: 1.428 ± 0.049
3.807ThrPro: 3.807 ± 0.078
1.67ThrGln: 1.67 ± 0.059
3.799ThrArg: 3.799 ± 0.078
3.974ThrSer: 3.974 ± 0.086
4.303ThrThr: 4.303 ± 0.133
6.195ThrVal: 6.195 ± 0.121
0.791ThrTrp: 0.791 ± 0.038
1.017ThrTyr: 1.017 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
10.936ValAla: 10.936 ± 0.129
0.578ValCys: 0.578 ± 0.032
4.881ValAsp: 4.881 ± 0.096
4.885ValGlu: 4.885 ± 0.099
2.921ValPhe: 2.921 ± 0.068
7.514ValGly: 7.514 ± 0.118
1.846ValHis: 1.846 ± 0.06
4.765ValIle: 4.765 ± 0.102
1.767ValLys: 1.767 ± 0.063
9.554ValLeu: 9.554 ± 0.149
1.484ValMet: 1.484 ± 0.05
2.032ValAsn: 2.032 ± 0.067
4.641ValPro: 4.641 ± 0.093
2.463ValGln: 2.463 ± 0.06
6.144ValArg: 6.144 ± 0.111
5.917ValSer: 5.917 ± 0.117
6.297ValThr: 6.297 ± 0.122
9.147ValVal: 9.147 ± 0.153
0.929ValTrp: 0.929 ± 0.039
1.501ValTyr: 1.501 ± 0.056
0.0ValXaa: 0.0 ± 0.0
Trp
1.425TrpAla: 1.425 ± 0.046
0.109TrpCys: 0.109 ± 0.016
0.661TrpAsp: 0.661 ± 0.037
0.62TrpGlu: 0.62 ± 0.032
0.516TrpPhe: 0.516 ± 0.03
0.939TrpGly: 0.939 ± 0.042
0.264TrpHis: 0.264 ± 0.02
0.639TrpIle: 0.639 ± 0.037
0.312TrpLys: 0.312 ± 0.024
1.643TrpLeu: 1.643 ± 0.055
0.264TrpMet: 0.264 ± 0.024
0.429TrpAsn: 0.429 ± 0.031
0.627TrpPro: 0.627 ± 0.035
0.467TrpGln: 0.467 ± 0.025
1.27TrpArg: 1.27 ± 0.046
0.987TrpSer: 0.987 ± 0.042
0.746TrpThr: 0.746 ± 0.04
0.893TrpVal: 0.893 ± 0.039
0.297TrpTrp: 0.297 ± 0.024
0.247TrpTyr: 0.247 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.207TyrAla: 2.207 ± 0.059
0.158TyrCys: 0.158 ± 0.016
1.404TyrAsp: 1.404 ± 0.077
1.116TyrGlu: 1.116 ± 0.047
0.728TyrPhe: 0.728 ± 0.037
1.899TyrGly: 1.899 ± 0.055
0.372TyrHis: 0.372 ± 0.027
0.753TyrIle: 0.753 ± 0.042
0.399TyrLys: 0.399 ± 0.028
2.231TyrLeu: 2.231 ± 0.064
0.242TyrMet: 0.242 ± 0.021
0.513TyrAsn: 0.513 ± 0.029
1.108TyrPro: 1.108 ± 0.042
0.593TyrGln: 0.593 ± 0.029
1.769TyrArg: 1.769 ± 0.057
1.409TyrSer: 1.409 ± 0.051
1.111TyrThr: 1.111 ± 0.05
1.573TyrVal: 1.573 ± 0.054
0.278TyrTrp: 0.278 ± 0.023
0.515TyrTyr: 0.515 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1904 proteins (586753 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski