Amino acid dipepetide frequency for Aurantiacibacter zhengii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.955AlaAla: 15.955 ± 0.177
1.116AlaCys: 1.116 ± 0.036
7.429AlaAsp: 7.429 ± 0.082
8.187AlaGlu: 8.187 ± 0.1
4.242AlaPhe: 4.242 ± 0.064
10.447AlaGly: 10.447 ± 0.145
2.088AlaHis: 2.088 ± 0.054
6.606AlaIle: 6.606 ± 0.086
3.754AlaLys: 3.754 ± 0.08
12.924AlaLeu: 12.924 ± 0.138
3.849AlaMet: 3.849 ± 0.065
3.429AlaAsn: 3.429 ± 0.069
5.357AlaPro: 5.357 ± 0.082
4.518AlaGln: 4.518 ± 0.076
8.771AlaArg: 8.771 ± 0.12
6.81AlaSer: 6.81 ± 0.144
5.963AlaThr: 5.963 ± 0.101
7.935AlaVal: 7.935 ± 0.11
1.571AlaTrp: 1.571 ± 0.041
2.471AlaTyr: 2.471 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.981CysAla: 0.981 ± 0.032
0.105CysCys: 0.105 ± 0.01
0.621CysAsp: 0.621 ± 0.026
0.576CysGlu: 0.576 ± 0.023
0.304CysPhe: 0.304 ± 0.016
0.862CysGly: 0.862 ± 0.027
0.226CysHis: 0.226 ± 0.015
0.36CysIle: 0.36 ± 0.016
0.201CysLys: 0.201 ± 0.014
0.698CysLeu: 0.698 ± 0.026
0.158CysMet: 0.158 ± 0.012
0.26CysAsn: 0.26 ± 0.018
0.441CysPro: 0.441 ± 0.022
0.231CysGln: 0.231 ± 0.014
0.592CysArg: 0.592 ± 0.024
0.485CysSer: 0.485 ± 0.022
0.391CysThr: 0.391 ± 0.02
0.55CysVal: 0.55 ± 0.023
0.125CysTrp: 0.125 ± 0.01
0.165CysTyr: 0.165 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.593AspAla: 7.593 ± 0.105
0.545AspCys: 0.545 ± 0.022
3.857AspAsp: 3.857 ± 0.077
4.319AspGlu: 4.319 ± 0.065
2.423AspPhe: 2.423 ± 0.051
5.895AspGly: 5.895 ± 0.103
1.306AspHis: 1.306 ± 0.042
3.191AspIle: 3.191 ± 0.061
1.697AspLys: 1.697 ± 0.041
5.971AspLeu: 5.971 ± 0.087
1.651AspMet: 1.651 ± 0.041
1.699AspAsn: 1.699 ± 0.048
3.796AspPro: 3.796 ± 0.075
1.789AspGln: 1.789 ± 0.038
4.59AspArg: 4.59 ± 0.076
2.718AspSer: 2.718 ± 0.066
3.143AspThr: 3.143 ± 0.07
4.176AspVal: 4.176 ± 0.084
1.2AspTrp: 1.2 ± 0.033
1.734AspTyr: 1.734 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
8.123GluAla: 8.123 ± 0.105
0.404GluCys: 0.404 ± 0.019
3.854GluAsp: 3.854 ± 0.069
4.292GluGlu: 4.292 ± 0.082
2.068GluPhe: 2.068 ± 0.046
5.405GluGly: 5.405 ± 0.079
1.313GluHis: 1.313 ± 0.032
3.25GluIle: 3.25 ± 0.052
2.356GluLys: 2.356 ± 0.059
5.784GluLeu: 5.784 ± 0.084
1.72GluMet: 1.72 ± 0.047
1.962GluAsn: 1.962 ± 0.042
3.024GluPro: 3.024 ± 0.056
2.518GluGln: 2.518 ± 0.058
5.327GluArg: 5.327 ± 0.078
2.708GluSer: 2.708 ± 0.048
3.592GluThr: 3.592 ± 0.056
4.229GluVal: 4.229 ± 0.067
1.009GluTrp: 1.009 ± 0.03
1.28GluTyr: 1.28 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
4.993PheAla: 4.993 ± 0.065
0.351PheCys: 0.351 ± 0.019
2.884PheAsp: 2.884 ± 0.057
2.267PheGlu: 2.267 ± 0.047
1.361PhePhe: 1.361 ± 0.04
3.6PheGly: 3.6 ± 0.06
0.702PheHis: 0.702 ± 0.022
1.625PheIle: 1.625 ± 0.041
0.776PheLys: 0.776 ± 0.027
3.099PheLeu: 3.099 ± 0.063
0.799PheMet: 0.799 ± 0.027
1.027PheAsn: 1.027 ± 0.031
1.554PhePro: 1.554 ± 0.041
0.943PheGln: 0.943 ± 0.033
2.086PheArg: 2.086 ± 0.042
2.144PheSer: 2.144 ± 0.047
2.156PheThr: 2.156 ± 0.069
2.744PheVal: 2.744 ± 0.048
0.602PheTrp: 0.602 ± 0.026
0.98PheTyr: 0.98 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
9.163GlyAla: 9.163 ± 0.131
0.821GlyCys: 0.821 ± 0.028
5.325GlyAsp: 5.325 ± 0.106
6.047GlyGlu: 6.047 ± 0.079
3.686GlyPhe: 3.686 ± 0.059
8.094GlyGly: 8.094 ± 0.146
1.794GlyHis: 1.794 ± 0.047
4.335GlyIle: 4.335 ± 0.074
3.157GlyLys: 3.157 ± 0.064
8.256GlyLeu: 8.256 ± 0.105
2.507GlyMet: 2.507 ± 0.056
2.704GlyAsn: 2.704 ± 0.072
3.5GlyPro: 3.5 ± 0.06
3.074GlyGln: 3.074 ± 0.058
5.729GlyArg: 5.729 ± 0.073
5.251GlySer: 5.251 ± 0.1
4.91GlyThr: 4.91 ± 0.102
6.121GlyVal: 6.121 ± 0.085
1.588GlyTrp: 1.588 ± 0.037
2.276GlyTyr: 2.276 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
2.211HisAla: 2.211 ± 0.051
0.245HisCys: 0.245 ± 0.016
1.227HisAsp: 1.227 ± 0.036
1.17HisGlu: 1.17 ± 0.033
0.813HisPhe: 0.813 ± 0.027
1.832HisGly: 1.832 ± 0.041
0.529HisHis: 0.529 ± 0.026
0.854HisIle: 0.854 ± 0.029
0.461HisLys: 0.461 ± 0.023
1.825HisLeu: 1.825 ± 0.043
0.433HisMet: 0.433 ± 0.018
0.486HisAsn: 0.486 ± 0.024
1.166HisPro: 1.166 ± 0.038
0.514HisGln: 0.514 ± 0.022
1.421HisArg: 1.421 ± 0.04
0.973HisSer: 0.973 ± 0.029
0.85HisThr: 0.85 ± 0.028
1.36HisVal: 1.36 ± 0.033
0.336HisTrp: 0.336 ± 0.019
0.572HisTyr: 0.572 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
7.382IleAla: 7.382 ± 0.086
0.438IleCys: 0.438 ± 0.019
3.782IleAsp: 3.782 ± 0.068
3.733IleGlu: 3.733 ± 0.064
1.594IlePhe: 1.594 ± 0.038
5.034IleGly: 5.034 ± 0.075
0.8IleHis: 0.8 ± 0.028
2.047IleIle: 2.047 ± 0.048
1.072IleLys: 1.072 ± 0.036
3.667IleLeu: 3.667 ± 0.063
0.937IleMet: 0.937 ± 0.033
1.281IleAsn: 1.281 ± 0.032
2.301IlePro: 2.301 ± 0.044
1.104IleGln: 1.104 ± 0.031
3.009IleArg: 3.009 ± 0.069
2.724IleSer: 2.724 ± 0.05
2.698IleThr: 2.698 ± 0.055
3.854IleVal: 3.854 ± 0.059
0.587IleTrp: 0.587 ± 0.021
1.096IleTyr: 1.096 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.564LysAla: 3.564 ± 0.075
0.183LysCys: 0.183 ± 0.012
1.612LysAsp: 1.612 ± 0.046
1.401LysGlu: 1.401 ± 0.044
0.825LysPhe: 0.825 ± 0.031
2.418LysGly: 2.418 ± 0.05
0.59LysHis: 0.59 ± 0.024
1.403LysIle: 1.403 ± 0.041
1.187LysLys: 1.187 ± 0.045
3.133LysLeu: 3.133 ± 0.068
0.752LysMet: 0.752 ± 0.028
0.742LysAsn: 0.742 ± 0.029
1.753LysPro: 1.753 ± 0.045
0.925LysGln: 0.925 ± 0.031
2.172LysArg: 2.172 ± 0.056
1.538LysSer: 1.538 ± 0.044
1.511LysThr: 1.511 ± 0.045
2.146LysVal: 2.146 ± 0.052
0.382LysTrp: 0.382 ± 0.019
0.578LysTyr: 0.578 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
13.759LeuAla: 13.759 ± 0.155
0.743LeuCys: 0.743 ± 0.029
6.215LeuAsp: 6.215 ± 0.089
5.83LeuGlu: 5.83 ± 0.076
3.44LeuPhe: 3.44 ± 0.06
8.221LeuGly: 8.221 ± 0.103
1.708LeuHis: 1.708 ± 0.043
4.289LeuIle: 4.289 ± 0.066
2.705LeuLys: 2.705 ± 0.058
9.247LeuLeu: 9.247 ± 0.131
2.155LeuMet: 2.155 ± 0.052
2.351LeuAsn: 2.351 ± 0.055
5.409LeuPro: 5.409 ± 0.071
2.684LeuGln: 2.684 ± 0.047
6.459LeuArg: 6.459 ± 0.092
5.928LeuSer: 5.928 ± 0.077
5.252LeuThr: 5.252 ± 0.085
7.414LeuVal: 7.414 ± 0.096
1.187LeuTrp: 1.187 ± 0.036
1.928LeuTyr: 1.928 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
3.403MetAla: 3.403 ± 0.061
0.159MetCys: 0.159 ± 0.013
1.334MetAsp: 1.334 ± 0.038
1.395MetGlu: 1.395 ± 0.037
0.772MetPhe: 0.772 ± 0.029
2.059MetGly: 2.059 ± 0.05
0.464MetHis: 0.464 ± 0.023
1.253MetIle: 1.253 ± 0.039
0.879MetLys: 0.879 ± 0.03
2.709MetLeu: 2.709 ± 0.053
0.675MetMet: 0.675 ± 0.031
0.731MetAsn: 0.731 ± 0.021
1.473MetPro: 1.473 ± 0.039
0.873MetGln: 0.873 ± 0.025
1.84MetArg: 1.84 ± 0.045
1.46MetSer: 1.46 ± 0.037
1.634MetThr: 1.634 ± 0.034
1.765MetVal: 1.765 ± 0.043
0.255MetTrp: 0.255 ± 0.015
0.296MetTyr: 0.296 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.351AsnAla: 3.351 ± 0.075
0.272AsnCys: 0.272 ± 0.018
1.569AsnAsp: 1.569 ± 0.038
1.434AsnGlu: 1.434 ± 0.036
1.018AsnPhe: 1.018 ± 0.034
2.535AsnGly: 2.535 ± 0.067
0.495AsnHis: 0.495 ± 0.023
1.336AsnIle: 1.336 ± 0.033
0.64AsnLys: 0.64 ± 0.024
2.646AsnLeu: 2.646 ± 0.053
0.652AsnMet: 0.652 ± 0.026
0.739AsnAsn: 0.739 ± 0.032
1.898AsnPro: 1.898 ± 0.045
0.781AsnGln: 0.781 ± 0.029
2.042AsnArg: 2.042 ± 0.045
1.525AsnSer: 1.525 ± 0.044
1.43AsnThr: 1.43 ± 0.045
1.995AsnVal: 1.995 ± 0.05
0.494AsnTrp: 0.494 ± 0.022
0.749AsnTyr: 0.749 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
6.067ProAla: 6.067 ± 0.092
0.344ProCys: 0.344 ± 0.018
3.914ProAsp: 3.914 ± 0.067
3.976ProGlu: 3.976 ± 0.069
1.881ProPhe: 1.881 ± 0.043
4.595ProGly: 4.595 ± 0.074
1.019ProHis: 1.019 ± 0.035
2.283ProIle: 2.283 ± 0.051
1.333ProLys: 1.333 ± 0.042
4.673ProLeu: 4.673 ± 0.062
1.16ProMet: 1.16 ± 0.035
1.254ProAsn: 1.254 ± 0.033
2.577ProPro: 2.577 ± 0.075
1.849ProGln: 1.849 ± 0.042
2.859ProArg: 2.859 ± 0.059
2.656ProSer: 2.656 ± 0.049
2.329ProThr: 2.329 ± 0.046
4.069ProVal: 4.069 ± 0.061
0.683ProTrp: 0.683 ± 0.028
1.107ProTyr: 1.107 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
4.073GlnAla: 4.073 ± 0.075
0.254GlnCys: 0.254 ± 0.016
1.851GlnAsp: 1.851 ± 0.042
1.85GlnGlu: 1.85 ± 0.04
1.227GlnPhe: 1.227 ± 0.03
2.616GlnGly: 2.616 ± 0.051
0.64GlnHis: 0.64 ± 0.024
1.705GlnIle: 1.705 ± 0.039
0.885GlnLys: 0.885 ± 0.034
3.313GlnLeu: 3.313 ± 0.065
0.926GlnMet: 0.926 ± 0.027
0.83GlnAsn: 0.83 ± 0.029
1.842GlnPro: 1.842 ± 0.042
1.354GlnGln: 1.354 ± 0.041
2.461GlnArg: 2.461 ± 0.051
1.838GlnSer: 1.838 ± 0.045
1.637GlnThr: 1.637 ± 0.044
2.506GlnVal: 2.506 ± 0.047
0.43GlnTrp: 0.43 ± 0.02
0.729GlnTyr: 0.729 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
7.616ArgAla: 7.616 ± 0.109
0.474ArgCys: 0.474 ± 0.024
4.337ArgAsp: 4.337 ± 0.077
4.947ArgGlu: 4.947 ± 0.079
2.919ArgPhe: 2.919 ± 0.055
4.887ArgGly: 4.887 ± 0.073
1.545ArgHis: 1.545 ± 0.044
3.759ArgIle: 3.759 ± 0.069
2.253ArgLys: 2.253 ± 0.051
7.261ArgLeu: 7.261 ± 0.096
1.903ArgMet: 1.903 ± 0.043
1.985ArgAsn: 1.985 ± 0.047
3.131ArgPro: 3.131 ± 0.058
2.631ArgGln: 2.631 ± 0.048
5.295ArgArg: 5.295 ± 0.094
3.607ArgSer: 3.607 ± 0.072
3.289ArgThr: 3.289 ± 0.056
4.661ArgVal: 4.661 ± 0.07
1.109ArgTrp: 1.109 ± 0.03
1.749ArgTyr: 1.749 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
6.39SerAla: 6.39 ± 0.123
0.471SerCys: 0.471 ± 0.023
3.544SerAsp: 3.544 ± 0.084
3.176SerGlu: 3.176 ± 0.053
2.183SerPhe: 2.183 ± 0.057
5.85SerGly: 5.85 ± 0.127
0.991SerHis: 0.991 ± 0.032
2.631SerIle: 2.631 ± 0.05
1.412SerLys: 1.412 ± 0.04
5.338SerLeu: 5.338 ± 0.089
1.337SerMet: 1.337 ± 0.039
1.559SerAsn: 1.559 ± 0.038
2.733SerPro: 2.733 ± 0.05
1.869SerGln: 1.869 ± 0.038
3.585SerArg: 3.585 ± 0.067
3.048SerSer: 3.048 ± 0.076
2.767SerThr: 2.767 ± 0.071
3.699SerVal: 3.699 ± 0.085
0.803SerTrp: 0.803 ± 0.029
1.342SerTyr: 1.342 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
6.106ThrAla: 6.106 ± 0.123
0.409ThrCys: 0.409 ± 0.022
3.079ThrAsp: 3.079 ± 0.065
2.61ThrGlu: 2.61 ± 0.053
1.976ThrPhe: 1.976 ± 0.053
5.319ThrGly: 5.319 ± 0.097
0.897ThrHis: 0.897 ± 0.029
2.812ThrIle: 2.812 ± 0.054
1.231ThrLys: 1.231 ± 0.035
5.406ThrLeu: 5.406 ± 0.09
1.293ThrMet: 1.293 ± 0.03
1.419ThrAsn: 1.419 ± 0.043
3.113ThrPro: 3.113 ± 0.056
1.663ThrGln: 1.663 ± 0.04
3.395ThrArg: 3.395 ± 0.061
2.879ThrSer: 2.879 ± 0.073
2.691ThrThr: 2.691 ± 0.059
4.227ThrVal: 4.227 ± 0.084
0.672ThrTrp: 0.672 ± 0.025
1.322ThrTyr: 1.322 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
8.785ValAla: 8.785 ± 0.108
0.609ValCys: 0.609 ± 0.024
4.438ValAsp: 4.438 ± 0.076
4.879ValGlu: 4.879 ± 0.068
2.454ValPhe: 2.454 ± 0.06
5.593ValGly: 5.593 ± 0.09
1.299ValHis: 1.299 ± 0.03
3.789ValIle: 3.789 ± 0.065
1.857ValLys: 1.857 ± 0.054
6.972ValLeu: 6.972 ± 0.094
1.716ValMet: 1.716 ± 0.04
1.997ValAsn: 1.997 ± 0.052
3.811ValPro: 3.811 ± 0.059
2.194ValGln: 2.194 ± 0.04
4.576ValArg: 4.576 ± 0.062
4.231ValSer: 4.231 ± 0.085
4.357ValThr: 4.357 ± 0.088
5.266ValVal: 5.266 ± 0.078
0.897ValTrp: 0.897 ± 0.031
1.349ValTyr: 1.349 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.355TrpAla: 1.355 ± 0.042
0.122TrpCys: 0.122 ± 0.011
0.786TrpAsp: 0.786 ± 0.028
0.708TrpGlu: 0.708 ± 0.026
0.62TrpPhe: 0.62 ± 0.024
0.971TrpGly: 0.971 ± 0.027
0.397TrpHis: 0.397 ± 0.02
0.708TrpIle: 0.708 ± 0.026
0.468TrpLys: 0.468 ± 0.02
1.794TrpLeu: 1.794 ± 0.051
0.378TrpMet: 0.378 ± 0.02
0.485TrpAsn: 0.485 ± 0.018
0.707TrpPro: 0.707 ± 0.027
0.731TrpGln: 0.731 ± 0.031
1.265TrpArg: 1.265 ± 0.034
0.88TrpSer: 0.88 ± 0.027
0.793TrpThr: 0.793 ± 0.033
0.844TrpVal: 0.844 ± 0.031
0.265TrpTrp: 0.265 ± 0.014
0.31TrpTyr: 0.31 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.553TyrAla: 2.553 ± 0.053
0.268TyrCys: 0.268 ± 0.017
1.634TyrAsp: 1.634 ± 0.038
1.421TyrGlu: 1.421 ± 0.036
0.885TyrPhe: 0.885 ± 0.031
2.105TyrGly: 2.105 ± 0.045
0.473TyrHis: 0.473 ± 0.022
0.906TyrIle: 0.906 ± 0.028
0.49TyrLys: 0.49 ± 0.023
2.208TyrLeu: 2.208 ± 0.046
0.428TyrMet: 0.428 ± 0.02
0.661TyrAsn: 0.661 ± 0.023
1.043TyrPro: 1.043 ± 0.034
0.721TyrGln: 0.721 ± 0.026
1.884TyrArg: 1.884 ± 0.041
1.322TyrSer: 1.322 ± 0.04
1.126TyrThr: 1.126 ± 0.04
1.51TyrVal: 1.51 ± 0.036
0.392TyrTrp: 0.392 ± 0.019
0.636TyrTyr: 0.636 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3511 proteins (1132093 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski