Amino acid dipepetide frequency for Candidatus Izimaplasma sp. HR1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.288AlaAla: 3.288 ± 0.109
0.514AlaCys: 0.514 ± 0.032
2.464AlaAsp: 2.464 ± 0.073
2.782AlaGlu: 2.782 ± 0.088
2.749AlaPhe: 2.749 ± 0.073
3.396AlaGly: 3.396 ± 0.105
0.909AlaHis: 0.909 ± 0.039
5.341AlaIle: 5.341 ± 0.106
4.174AlaLys: 4.174 ± 0.098
5.81AlaLeu: 5.81 ± 0.118
1.406AlaMet: 1.406 ± 0.05
2.628AlaAsn: 2.628 ± 0.069
1.336AlaPro: 1.336 ± 0.044
1.308AlaGln: 1.308 ± 0.045
1.974AlaArg: 1.974 ± 0.059
3.284AlaSer: 3.284 ± 0.083
2.901AlaThr: 2.901 ± 0.083
3.699AlaVal: 3.699 ± 0.094
0.346AlaTrp: 0.346 ± 0.028
2.251AlaTyr: 2.251 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.383CysAla: 0.383 ± 0.029
0.068CysCys: 0.068 ± 0.014
0.521CysAsp: 0.521 ± 0.031
0.525CysGlu: 0.525 ± 0.031
0.278CysPhe: 0.278 ± 0.02
0.719CysGly: 0.719 ± 0.041
0.182CysHis: 0.182 ± 0.019
0.612CysIle: 0.612 ± 0.032
0.568CysLys: 0.568 ± 0.038
0.602CysLeu: 0.602 ± 0.035
0.157CysMet: 0.157 ± 0.016
0.483CysAsn: 0.483 ± 0.028
0.39CysPro: 0.39 ± 0.029
0.175CysGln: 0.175 ± 0.016
0.248CysArg: 0.248 ± 0.023
0.43CysSer: 0.43 ± 0.03
0.469CysThr: 0.469 ± 0.028
0.488CysVal: 0.488 ± 0.035
0.042CysTrp: 0.042 ± 0.009
0.271CysTyr: 0.271 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
2.943AspAla: 2.943 ± 0.087
0.441AspCys: 0.441 ± 0.031
3.389AspAsp: 3.389 ± 0.106
4.808AspGlu: 4.808 ± 0.097
3.391AspPhe: 3.391 ± 0.089
3.55AspGly: 3.55 ± 0.109
0.93AspHis: 0.93 ± 0.037
6.035AspIle: 6.035 ± 0.104
4.467AspLys: 4.467 ± 0.104
6.063AspLeu: 6.063 ± 0.116
1.473AspMet: 1.473 ± 0.045
3.398AspAsn: 3.398 ± 0.092
1.644AspPro: 1.644 ± 0.065
1.459AspGln: 1.459 ± 0.046
1.913AspArg: 1.913 ± 0.06
3.256AspSer: 3.256 ± 0.094
3.204AspThr: 3.204 ± 0.093
4.01AspVal: 4.01 ± 0.091
0.414AspTrp: 0.414 ± 0.028
3.491AspTyr: 3.491 ± 0.091
0.002AspXaa: 0.002 ± 0.002
Glu
3.722GluAla: 3.722 ± 0.098
0.43GluCys: 0.43 ± 0.028
4.4GluAsp: 4.4 ± 0.089
6.009GluGlu: 6.009 ± 0.137
3.524GluPhe: 3.524 ± 0.084
3.858GluGly: 3.858 ± 0.098
1.14GluHis: 1.14 ± 0.047
6.911GluIle: 6.911 ± 0.112
5.369GluLys: 5.369 ± 0.124
7.422GluLeu: 7.422 ± 0.131
2.204GluMet: 2.204 ± 0.06
4.339GluAsn: 4.339 ± 0.096
1.504GluPro: 1.504 ± 0.071
1.84GluGln: 1.84 ± 0.056
2.541GluArg: 2.541 ± 0.077
3.536GluSer: 3.536 ± 0.079
3.898GluThr: 3.898 ± 0.092
5.364GluVal: 5.364 ± 0.108
0.423GluTrp: 0.423 ± 0.028
3.697GluTyr: 3.697 ± 0.084
0.0GluXaa: 0.0 ± 0.0
Phe
2.417PheAla: 2.417 ± 0.077
0.346PheCys: 0.346 ± 0.028
3.606PheAsp: 3.606 ± 0.09
3.461PheGlu: 3.461 ± 0.076
2.448PhePhe: 2.448 ± 0.072
3.326PheGly: 3.326 ± 0.094
0.728PheHis: 0.728 ± 0.039
4.876PheIle: 4.876 ± 0.11
3.475PheLys: 3.475 ± 0.08
4.363PheLeu: 4.363 ± 0.109
1.191PheMet: 1.191 ± 0.045
3.073PheAsn: 3.073 ± 0.073
1.273PhePro: 1.273 ± 0.046
1.027PheGln: 1.027 ± 0.042
1.525PheArg: 1.525 ± 0.046
3.283PheSer: 3.283 ± 0.076
3.081PheThr: 3.081 ± 0.075
3.526PheVal: 3.526 ± 0.091
0.345PheTrp: 0.345 ± 0.029
2.064PheTyr: 2.064 ± 0.072
0.0PheXaa: 0.0 ± 0.0
Gly
3.468GlyAla: 3.468 ± 0.107
0.635GlyCys: 0.635 ± 0.033
3.339GlyAsp: 3.339 ± 0.088
3.832GlyGlu: 3.832 ± 0.095
3.438GlyPhe: 3.438 ± 0.092
3.998GlyGly: 3.998 ± 0.107
1.13GlyHis: 1.13 ± 0.047
6.049GlyIle: 6.049 ± 0.126
4.344GlyLys: 4.344 ± 0.091
5.867GlyLeu: 5.867 ± 0.125
1.737GlyMet: 1.737 ± 0.072
3.155GlyAsn: 3.155 ± 0.074
1.357GlyPro: 1.357 ± 0.055
1.264GlyGln: 1.264 ± 0.05
2.158GlyArg: 2.158 ± 0.062
3.52GlySer: 3.52 ± 0.087
3.701GlyThr: 3.701 ± 0.09
4.856GlyVal: 4.856 ± 0.085
0.481GlyTrp: 0.481 ± 0.034
3.225GlyTyr: 3.225 ± 0.096
0.0GlyXaa: 0.0 ± 0.0
His
0.866HisAla: 0.866 ± 0.044
0.159HisCys: 0.159 ± 0.019
0.944HisAsp: 0.944 ± 0.042
1.109HisGlu: 1.109 ± 0.042
0.824HisPhe: 0.824 ± 0.038
1.097HisGly: 1.097 ± 0.048
0.395HisHis: 0.395 ± 0.029
1.591HisIle: 1.591 ± 0.047
1.245HisLys: 1.245 ± 0.049
1.618HisLeu: 1.618 ± 0.053
0.418HisMet: 0.418 ± 0.026
1.0HisAsn: 1.0 ± 0.045
0.743HisPro: 0.743 ± 0.038
0.493HisGln: 0.493 ± 0.029
0.675HisArg: 0.675 ± 0.035
1.009HisSer: 1.009 ± 0.039
0.925HisThr: 0.925 ± 0.043
1.004HisVal: 1.004 ± 0.039
0.101HisTrp: 0.101 ± 0.012
0.776HisTyr: 0.776 ± 0.046
0.0HisXaa: 0.0 ± 0.0
Ile
5.367IleAla: 5.367 ± 0.104
0.729IleCys: 0.729 ± 0.036
6.175IleAsp: 6.175 ± 0.11
6.798IleGlu: 6.798 ± 0.126
4.222IlePhe: 4.222 ± 0.099
5.934IleGly: 5.934 ± 0.107
1.467IleHis: 1.467 ± 0.053
8.818IleIle: 8.818 ± 0.174
7.223IleLys: 7.223 ± 0.149
8.256IleLeu: 8.256 ± 0.141
2.204IleMet: 2.204 ± 0.064
5.764IleAsn: 5.764 ± 0.122
3.069IlePro: 3.069 ± 0.08
2.071IleGln: 2.071 ± 0.058
2.961IleArg: 2.961 ± 0.084
6.229IleSer: 6.229 ± 0.133
5.734IleThr: 5.734 ± 0.117
6.444IleVal: 6.444 ± 0.113
0.486IleTrp: 0.486 ± 0.029
3.833IleTyr: 3.833 ± 0.088
0.0IleXaa: 0.0 ± 0.0
Lys
4.031LysAla: 4.031 ± 0.105
0.472LysCys: 0.472 ± 0.03
5.049LysAsp: 5.049 ± 0.113
7.228LysGlu: 7.228 ± 0.144
2.767LysPhe: 2.767 ± 0.07
4.043LysGly: 4.043 ± 0.092
1.357LysHis: 1.357 ± 0.053
6.061LysIle: 6.061 ± 0.124
6.812LysLys: 6.812 ± 0.148
6.857LysLeu: 6.857 ± 0.132
2.291LysMet: 2.291 ± 0.065
4.393LysAsn: 4.393 ± 0.11
1.868LysPro: 1.868 ± 0.071
2.457LysGln: 2.457 ± 0.08
3.354LysArg: 3.354 ± 0.099
4.124LysSer: 4.124 ± 0.089
4.202LysThr: 4.202 ± 0.086
5.267LysVal: 5.267 ± 0.101
0.561LysTrp: 0.561 ± 0.034
3.926LysTyr: 3.926 ± 0.098
0.0LysXaa: 0.0 ± 0.0
Leu
5.274LeuAla: 5.274 ± 0.117
0.633LeuCys: 0.633 ± 0.034
5.918LeuAsp: 5.918 ± 0.126
7.174LeuGlu: 7.174 ± 0.133
4.918LeuPhe: 4.918 ± 0.107
6.359LeuGly: 6.359 ± 0.114
1.555LeuHis: 1.555 ± 0.047
8.363LeuIle: 8.363 ± 0.146
6.941LeuLys: 6.941 ± 0.138
9.321LeuLeu: 9.321 ± 0.149
2.287LeuMet: 2.287 ± 0.06
5.477LeuAsn: 5.477 ± 0.126
2.861LeuPro: 2.861 ± 0.081
2.24LeuGln: 2.24 ± 0.056
3.408LeuArg: 3.408 ± 0.083
6.756LeuSer: 6.756 ± 0.125
5.325LeuThr: 5.325 ± 0.124
6.674LeuVal: 6.674 ± 0.122
0.607LeuTrp: 0.607 ± 0.039
4.049LeuTyr: 4.049 ± 0.09
0.0LeuXaa: 0.0 ± 0.0
Met
1.394MetAla: 1.394 ± 0.056
0.187MetCys: 0.187 ± 0.02
1.366MetAsp: 1.366 ± 0.045
1.446MetGlu: 1.446 ± 0.058
1.284MetPhe: 1.284 ± 0.049
1.579MetGly: 1.579 ± 0.054
0.369MetHis: 0.369 ± 0.023
2.391MetIle: 2.391 ± 0.075
2.413MetLys: 2.413 ± 0.074
2.221MetLeu: 2.221 ± 0.064
0.719MetMet: 0.719 ± 0.039
1.56MetAsn: 1.56 ± 0.053
0.841MetPro: 0.841 ± 0.037
0.586MetGln: 0.586 ± 0.034
0.99MetArg: 0.99 ± 0.039
1.675MetSer: 1.675 ± 0.057
1.308MetThr: 1.308 ± 0.046
1.625MetVal: 1.625 ± 0.056
0.147MetTrp: 0.147 ± 0.015
1.028MetTyr: 1.028 ± 0.044
0.002MetXaa: 0.002 ± 0.002
Asn
2.94AsnAla: 2.94 ± 0.073
0.421AsnCys: 0.421 ± 0.032
3.569AsnAsp: 3.569 ± 0.093
4.559AsnGlu: 4.559 ± 0.101
2.483AsnPhe: 2.483 ± 0.06
3.176AsnGly: 3.176 ± 0.077
1.069AsnHis: 1.069 ± 0.04
5.57AsnIle: 5.57 ± 0.127
4.727AsnLys: 4.727 ± 0.114
5.245AsnLeu: 5.245 ± 0.119
1.31AsnMet: 1.31 ± 0.046
3.678AsnAsn: 3.678 ± 0.122
2.135AsnPro: 2.135 ± 0.052
1.744AsnGln: 1.744 ± 0.059
2.039AsnArg: 2.039 ± 0.063
3.246AsnSer: 3.246 ± 0.085
3.097AsnThr: 3.097 ± 0.068
3.786AsnVal: 3.786 ± 0.081
0.404AsnTrp: 0.404 ± 0.027
2.88AsnTyr: 2.88 ± 0.084
0.0AsnXaa: 0.0 ± 0.0
Pro
1.264ProAla: 1.264 ± 0.046
0.22ProCys: 0.22 ± 0.018
1.513ProAsp: 1.513 ± 0.061
2.162ProGlu: 2.162 ± 0.073
1.567ProPhe: 1.567 ± 0.049
1.737ProGly: 1.737 ± 0.062
0.656ProHis: 0.656 ± 0.033
2.58ProIle: 2.58 ± 0.065
2.174ProLys: 2.174 ± 0.072
2.705ProLeu: 2.705 ± 0.068
0.626ProMet: 0.626 ± 0.032
1.689ProAsn: 1.689 ± 0.056
0.542ProPro: 0.542 ± 0.034
0.661ProGln: 0.661 ± 0.036
0.913ProArg: 0.913 ± 0.043
1.754ProSer: 1.754 ± 0.05
1.754ProThr: 1.754 ± 0.063
2.09ProVal: 2.09 ± 0.069
0.203ProTrp: 0.203 ± 0.018
1.364ProTyr: 1.364 ± 0.052
0.0ProXaa: 0.0 ± 0.0
Gln
1.476GlnAla: 1.476 ± 0.05
0.159GlnCys: 0.159 ± 0.016
1.403GlnAsp: 1.403 ± 0.051
1.994GlnGlu: 1.994 ± 0.064
1.221GlnPhe: 1.221 ± 0.045
1.513GlnGly: 1.513 ± 0.056
0.357GlnHis: 0.357 ± 0.023
2.256GlnIle: 2.256 ± 0.071
2.1GlnLys: 2.1 ± 0.064
2.345GlnLeu: 2.345 ± 0.058
0.623GlnMet: 0.623 ± 0.033
1.387GlnAsn: 1.387 ± 0.053
0.575GlnPro: 0.575 ± 0.03
0.631GlnGln: 0.631 ± 0.036
0.988GlnArg: 0.988 ± 0.049
1.41GlnSer: 1.41 ± 0.055
1.319GlnThr: 1.319 ± 0.047
1.861GlnVal: 1.861 ± 0.053
0.157GlnTrp: 0.157 ± 0.018
1.083GlnTyr: 1.083 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
1.714ArgAla: 1.714 ± 0.059
0.264ArgCys: 0.264 ± 0.022
2.106ArgAsp: 2.106 ± 0.064
2.457ArgGlu: 2.457 ± 0.067
1.803ArgPhe: 1.803 ± 0.052
1.938ArgGly: 1.938 ± 0.071
0.647ArgHis: 0.647 ± 0.037
3.391ArgIle: 3.391 ± 0.075
3.155ArgLys: 3.155 ± 0.081
3.389ArgLeu: 3.389 ± 0.084
0.918ArgMet: 0.918 ± 0.039
2.212ArgAsn: 2.212 ± 0.065
0.895ArgPro: 0.895 ± 0.037
1.023ArgGln: 1.023 ± 0.048
1.616ArgArg: 1.616 ± 0.067
1.89ArgSer: 1.89 ± 0.057
2.008ArgThr: 2.008 ± 0.056
2.426ArgVal: 2.426 ± 0.081
0.194ArgTrp: 0.194 ± 0.017
1.64ArgTyr: 1.64 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
2.844SerAla: 2.844 ± 0.065
0.477SerCys: 0.477 ± 0.032
3.326SerAsp: 3.326 ± 0.099
3.762SerGlu: 3.762 ± 0.07
3.298SerPhe: 3.298 ± 0.089
4.19SerGly: 4.19 ± 0.099
1.067SerHis: 1.067 ± 0.046
5.804SerIle: 5.804 ± 0.109
4.989SerLys: 4.989 ± 0.107
6.025SerLeu: 6.025 ± 0.106
1.42SerMet: 1.42 ± 0.048
3.77SerAsn: 3.77 ± 0.096
1.553SerPro: 1.553 ± 0.059
1.382SerGln: 1.382 ± 0.048
2.289SerArg: 2.289 ± 0.062
4.0SerSer: 4.0 ± 0.112
3.307SerThr: 3.307 ± 0.077
4.087SerVal: 4.087 ± 0.083
0.463SerTrp: 0.463 ± 0.032
2.84SerTyr: 2.84 ± 0.072
0.0SerXaa: 0.0 ± 0.0
Thr
2.775ThrAla: 2.775 ± 0.082
0.434ThrCys: 0.434 ± 0.027
3.195ThrAsp: 3.195 ± 0.103
3.559ThrGlu: 3.559 ± 0.098
2.924ThrPhe: 2.924 ± 0.082
3.652ThrGly: 3.652 ± 0.075
1.051ThrHis: 1.051 ± 0.051
5.565ThrIle: 5.565 ± 0.108
4.236ThrLys: 4.236 ± 0.087
5.6ThrLeu: 5.6 ± 0.122
1.231ThrMet: 1.231 ± 0.05
3.076ThrAsn: 3.076 ± 0.08
1.974ThrPro: 1.974 ± 0.062
1.201ThrGln: 1.201 ± 0.051
1.925ThrArg: 1.925 ± 0.064
3.499ThrSer: 3.499 ± 0.101
3.456ThrThr: 3.456 ± 0.122
4.171ThrVal: 4.171 ± 0.09
0.421ThrTrp: 0.421 ± 0.031
2.637ThrTyr: 2.637 ± 0.09
0.0ThrXaa: 0.0 ± 0.0
Val
3.853ValAla: 3.853 ± 0.086
0.654ValCys: 0.654 ± 0.038
4.356ValAsp: 4.356 ± 0.101
4.832ValGlu: 4.832 ± 0.1
3.534ValPhe: 3.534 ± 0.081
4.327ValGly: 4.327 ± 0.106
1.009ValHis: 1.009 ± 0.042
6.878ValIle: 6.878 ± 0.123
4.918ValLys: 4.918 ± 0.096
7.125ValLeu: 7.125 ± 0.11
1.728ValMet: 1.728 ± 0.061
3.664ValAsn: 3.664 ± 0.089
2.034ValPro: 2.034 ± 0.07
1.48ValGln: 1.48 ± 0.052
2.319ValArg: 2.319 ± 0.065
4.58ValSer: 4.58 ± 0.085
3.97ValThr: 3.97 ± 0.098
5.299ValVal: 5.299 ± 0.126
0.413ValTrp: 0.413 ± 0.026
2.886ValTyr: 2.886 ± 0.072
0.0ValXaa: 0.0 ± 0.0
Trp
0.39TrpAla: 0.39 ± 0.027
0.042TrpCys: 0.042 ± 0.01
0.47TrpAsp: 0.47 ± 0.033
0.406TrpGlu: 0.406 ± 0.027
0.367TrpPhe: 0.367 ± 0.027
0.535TrpGly: 0.535 ± 0.036
0.133TrpHis: 0.133 ± 0.014
0.572TrpIle: 0.572 ± 0.036
0.348TrpLys: 0.348 ± 0.025
0.658TrpLeu: 0.658 ± 0.04
0.159TrpMet: 0.159 ± 0.018
0.416TrpAsn: 0.416 ± 0.028
0.196TrpPro: 0.196 ± 0.02
0.213TrpGln: 0.213 ± 0.02
0.189TrpArg: 0.189 ± 0.018
0.325TrpSer: 0.325 ± 0.026
0.334TrpThr: 0.334 ± 0.031
0.425TrpVal: 0.425 ± 0.033
0.082TrpTrp: 0.082 ± 0.013
0.322TrpTyr: 0.322 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.155TyrAla: 2.155 ± 0.061
0.324TyrCys: 0.324 ± 0.02
3.171TyrAsp: 3.171 ± 0.087
3.23TyrGlu: 3.23 ± 0.084
2.392TyrPhe: 2.392 ± 0.063
2.628TyrGly: 2.628 ± 0.072
0.843TyrHis: 0.843 ± 0.037
4.12TyrIle: 4.12 ± 0.089
3.443TyrLys: 3.443 ± 0.081
4.648TyrLeu: 4.648 ± 0.1
1.028TyrMet: 1.028 ± 0.042
2.915TyrAsn: 2.915 ± 0.079
1.382TyrPro: 1.382 ± 0.052
1.602TyrGln: 1.602 ± 0.058
1.661TyrArg: 1.661 ± 0.057
3.085TyrSer: 3.085 ± 0.078
2.59TyrThr: 2.59 ± 0.076
2.754TyrVal: 2.754 ± 0.063
0.29TyrTrp: 0.29 ± 0.025
2.282TyrTyr: 2.282 ± 0.077
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.002XaaSer: 0.002 ± 0.002
0.0XaaThr: 0.0 ± 0.0
0.002XaaVal: 0.002 ± 0.002
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1811 proteins (571813 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski