Amino acid dipepetide frequency for Gimesia maris DSM 8797

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.33AlaAla: 8.33 ± 0.089
1.017AlaCys: 1.017 ± 0.023
4.97AlaAsp: 4.97 ± 0.06
6.036AlaGlu: 6.036 ± 0.068
3.032AlaPhe: 3.032 ± 0.035
6.993AlaGly: 6.993 ± 0.072
1.54AlaHis: 1.54 ± 0.034
4.863AlaIle: 4.863 ± 0.051
3.392AlaLys: 3.392 ± 0.047
7.563AlaLeu: 7.563 ± 0.082
1.832AlaMet: 1.832 ± 0.031
2.707AlaAsn: 2.707 ± 0.047
3.356AlaPro: 3.356 ± 0.047
2.984AlaGln: 2.984 ± 0.043
4.475AlaArg: 4.475 ± 0.054
4.903AlaSer: 4.903 ± 0.048
4.489AlaThr: 4.489 ± 0.071
5.744AlaVal: 5.744 ± 0.055
1.063AlaTrp: 1.063 ± 0.022
2.066AlaTyr: 2.066 ± 0.032
0.0AlaXaa: 0.0 ± 0.0
Cys
0.736CysAla: 0.736 ± 0.02
0.257CysCys: 0.257 ± 0.013
0.597CysAsp: 0.597 ± 0.018
0.698CysGlu: 0.698 ± 0.023
0.572CysPhe: 0.572 ± 0.018
0.935CysGly: 0.935 ± 0.021
0.428CysHis: 0.428 ± 0.015
0.564CysIle: 0.564 ± 0.017
0.408CysLys: 0.408 ± 0.015
1.372CysLeu: 1.372 ± 0.031
0.225CysMet: 0.225 ± 0.01
0.347CysAsn: 0.347 ± 0.013
0.582CysPro: 0.582 ± 0.017
0.498CysGln: 0.498 ± 0.017
0.652CysArg: 0.652 ± 0.018
0.773CysSer: 0.773 ± 0.02
0.53CysThr: 0.53 ± 0.017
0.721CysVal: 0.721 ± 0.02
0.192CysTrp: 0.192 ± 0.01
0.372CysTyr: 0.372 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
4.422AspAla: 4.422 ± 0.063
0.607AspCys: 0.607 ± 0.017
3.167AspAsp: 3.167 ± 0.072
3.736AspGlu: 3.736 ± 0.053
2.444AspPhe: 2.444 ± 0.038
4.301AspGly: 4.301 ± 0.1
1.384AspHis: 1.384 ± 0.029
2.779AspIle: 2.779 ± 0.043
2.155AspLys: 2.155 ± 0.038
6.042AspLeu: 6.042 ± 0.068
1.013AspMet: 1.013 ± 0.019
1.729AspAsn: 1.729 ± 0.041
3.241AspPro: 3.241 ± 0.053
3.179AspGln: 3.179 ± 0.05
3.228AspArg: 3.228 ± 0.044
3.644AspSer: 3.644 ± 0.06
2.598AspThr: 2.598 ± 0.071
3.486AspVal: 3.486 ± 0.053
1.04AspTrp: 1.04 ± 0.022
1.824AspTyr: 1.824 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
5.135GluAla: 5.135 ± 0.061
0.539GluCys: 0.539 ± 0.015
2.883GluAsp: 2.883 ± 0.042
4.244GluGlu: 4.244 ± 0.062
2.713GluPhe: 2.713 ± 0.037
3.668GluGly: 3.668 ± 0.049
1.364GluHis: 1.364 ± 0.027
4.302GluIle: 4.302 ± 0.044
3.766GluLys: 3.766 ± 0.054
7.141GluLeu: 7.141 ± 0.08
1.627GluMet: 1.627 ± 0.029
2.699GluAsn: 2.699 ± 0.038
2.737GluPro: 2.737 ± 0.043
3.609GluGln: 3.609 ± 0.056
3.514GluArg: 3.514 ± 0.052
4.246GluSer: 4.246 ± 0.05
4.085GluThr: 4.085 ± 0.055
4.098GluVal: 4.098 ± 0.049
0.92GluTrp: 0.92 ± 0.023
1.87GluTyr: 1.87 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
2.986PheAla: 2.986 ± 0.039
0.615PheCys: 0.615 ± 0.015
2.601PheAsp: 2.601 ± 0.044
2.551PheGlu: 2.551 ± 0.036
1.59PhePhe: 1.59 ± 0.032
2.933PheGly: 2.933 ± 0.04
0.941PheHis: 0.941 ± 0.027
2.072PheIle: 2.072 ± 0.032
1.536PheLys: 1.536 ± 0.03
4.186PheLeu: 4.186 ± 0.063
0.83PheMet: 0.83 ± 0.018
1.566PheAsn: 1.566 ± 0.037
1.877PhePro: 1.877 ± 0.036
1.953PheGln: 1.953 ± 0.034
2.21PheArg: 2.21 ± 0.036
3.041PheSer: 3.041 ± 0.043
2.398PheThr: 2.398 ± 0.062
2.531PheVal: 2.531 ± 0.037
0.653PheTrp: 0.653 ± 0.018
1.242PheTyr: 1.242 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
4.98GlyAla: 4.98 ± 0.061
0.952GlyCys: 0.952 ± 0.026
3.811GlyAsp: 3.811 ± 0.07
4.317GlyGlu: 4.317 ± 0.05
3.096GlyPhe: 3.096 ± 0.043
5.555GlyGly: 5.555 ± 0.102
1.547GlyHis: 1.547 ± 0.03
4.581GlyIle: 4.581 ± 0.054
4.357GlyLys: 4.357 ± 0.064
6.645GlyLeu: 6.645 ± 0.056
1.843GlyMet: 1.843 ± 0.033
2.936GlyAsn: 2.936 ± 0.073
2.756GlyPro: 2.756 ± 0.038
2.814GlyGln: 2.814 ± 0.039
3.77GlyArg: 3.77 ± 0.047
4.693GlySer: 4.693 ± 0.073
4.613GlyThr: 4.613 ± 0.123
4.593GlyVal: 4.593 ± 0.049
1.159GlyTrp: 1.159 ± 0.022
2.297GlyTyr: 2.297 ± 0.035
0.0GlyXaa: 0.0 ± 0.0
His
1.693HisAla: 1.693 ± 0.03
0.346HisCys: 0.346 ± 0.013
1.199HisAsp: 1.199 ± 0.025
1.267HisGlu: 1.267 ± 0.028
1.087HisPhe: 1.087 ± 0.025
1.516HisGly: 1.516 ± 0.034
0.662HisHis: 0.662 ± 0.023
1.088HisIle: 1.088 ± 0.023
0.785HisLys: 0.785 ± 0.02
2.288HisLeu: 2.288 ± 0.041
0.377HisMet: 0.377 ± 0.014
0.782HisAsn: 0.782 ± 0.019
1.47HisPro: 1.47 ± 0.03
1.089HisGln: 1.089 ± 0.025
1.273HisArg: 1.273 ± 0.028
1.461HisSer: 1.461 ± 0.03
1.059HisThr: 1.059 ± 0.024
1.301HisVal: 1.301 ± 0.024
0.48HisTrp: 0.48 ± 0.015
0.731HisTyr: 0.731 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
5.109IleAla: 5.109 ± 0.057
0.781IleCys: 0.781 ± 0.022
3.877IleAsp: 3.877 ± 0.056
3.954IleGlu: 3.954 ± 0.043
2.066IlePhe: 2.066 ± 0.035
3.883IleGly: 3.883 ± 0.051
1.305IleHis: 1.305 ± 0.024
2.958IleIle: 2.958 ± 0.044
2.444IleLys: 2.444 ± 0.039
5.602IleLeu: 5.602 ± 0.062
1.037IleMet: 1.037 ± 0.022
2.192IleAsn: 2.192 ± 0.04
3.109IlePro: 3.109 ± 0.041
2.61IleGln: 2.61 ± 0.036
3.354IleArg: 3.354 ± 0.045
4.366IleSer: 4.366 ± 0.052
3.455IleThr: 3.455 ± 0.075
3.763IleVal: 3.763 ± 0.044
0.811IleTrp: 0.811 ± 0.019
1.587IleTyr: 1.587 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.534LysAla: 3.534 ± 0.057
0.323LysCys: 0.323 ± 0.012
2.186LysAsp: 2.186 ± 0.035
2.941LysGlu: 2.941 ± 0.045
1.37LysPhe: 1.37 ± 0.026
2.683LysGly: 2.683 ± 0.047
1.018LysHis: 1.018 ± 0.026
2.73LysIle: 2.73 ± 0.039
2.982LysLys: 2.982 ± 0.049
4.739LysLeu: 4.739 ± 0.067
1.271LysMet: 1.271 ± 0.027
2.093LysAsn: 2.093 ± 0.035
2.807LysPro: 2.807 ± 0.049
2.992LysGln: 2.992 ± 0.05
2.694LysArg: 2.694 ± 0.042
3.171LysSer: 3.171 ± 0.045
3.176LysThr: 3.176 ± 0.041
2.86LysVal: 2.86 ± 0.046
0.592LysTrp: 0.592 ± 0.018
1.401LysTyr: 1.401 ± 0.029
0.0LysXaa: 0.0 ± 0.0
Leu
8.686LeuAla: 8.686 ± 0.082
1.213LeuCys: 1.213 ± 0.027
5.53LeuAsp: 5.53 ± 0.058
6.282LeuGlu: 6.282 ± 0.06
4.154LeuPhe: 4.154 ± 0.06
6.34LeuGly: 6.34 ± 0.065
1.939LeuHis: 1.939 ± 0.034
6.375LeuIle: 6.375 ± 0.07
5.957LeuLys: 5.957 ± 0.083
10.63LeuLeu: 10.63 ± 0.118
2.168LeuMet: 2.168 ± 0.039
4.158LeuAsn: 4.158 ± 0.053
5.283LeuPro: 5.283 ± 0.06
4.398LeuGln: 4.398 ± 0.062
4.826LeuArg: 4.826 ± 0.069
7.163LeuSer: 7.163 ± 0.062
6.122LeuThr: 6.122 ± 0.076
6.422LeuVal: 6.422 ± 0.055
1.284LeuTrp: 1.284 ± 0.028
2.458LeuTyr: 2.458 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
1.753MetAla: 1.753 ± 0.032
0.197MetCys: 0.197 ± 0.009
0.996MetAsp: 0.996 ± 0.023
1.108MetGlu: 1.108 ± 0.026
0.797MetPhe: 0.797 ± 0.019
1.428MetGly: 1.428 ± 0.03
0.444MetHis: 0.444 ± 0.014
1.425MetIle: 1.425 ± 0.029
1.298MetLys: 1.298 ± 0.029
2.322MetLeu: 2.322 ± 0.042
0.541MetMet: 0.541 ± 0.018
1.036MetAsn: 1.036 ± 0.021
1.105MetPro: 1.105 ± 0.023
1.136MetGln: 1.136 ± 0.025
1.172MetArg: 1.172 ± 0.028
1.555MetSer: 1.555 ± 0.027
1.432MetThr: 1.432 ± 0.025
1.359MetVal: 1.359 ± 0.027
0.216MetTrp: 0.216 ± 0.011
0.462MetTyr: 0.462 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.868AsnAla: 2.868 ± 0.041
0.449AsnCys: 0.449 ± 0.015
2.014AsnAsp: 2.014 ± 0.062
2.13AsnGlu: 2.13 ± 0.029
1.344AsnPhe: 1.344 ± 0.028
2.963AsnGly: 2.963 ± 0.077
0.922AsnHis: 0.922 ± 0.02
1.989AsnIle: 1.989 ± 0.037
1.366AsnLys: 1.366 ± 0.03
3.714AsnLeu: 3.714 ± 0.049
0.795AsnMet: 0.795 ± 0.019
1.448AsnAsn: 1.448 ± 0.042
2.387AsnPro: 2.387 ± 0.036
2.319AsnGln: 2.319 ± 0.035
2.363AsnArg: 2.363 ± 0.039
2.708AsnSer: 2.708 ± 0.052
2.042AsnThr: 2.042 ± 0.045
2.298AsnVal: 2.298 ± 0.038
0.657AsnTrp: 0.657 ± 0.018
1.244AsnTyr: 1.244 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
4.924ProAla: 4.924 ± 0.056
0.361ProCys: 0.361 ± 0.013
3.487ProAsp: 3.487 ± 0.048
4.604ProGlu: 4.604 ± 0.049
1.947ProPhe: 1.947 ± 0.029
4.007ProGly: 4.007 ± 0.05
1.129ProHis: 1.129 ± 0.029
2.3ProIle: 2.3 ± 0.029
1.81ProLys: 1.81 ± 0.034
4.65ProLeu: 4.65 ± 0.048
0.918ProMet: 0.918 ± 0.024
1.573ProAsn: 1.573 ± 0.026
2.403ProPro: 2.403 ± 0.041
2.298ProGln: 2.298 ± 0.038
2.396ProArg: 2.396 ± 0.042
2.64ProSer: 2.64 ± 0.039
2.449ProThr: 2.449 ± 0.037
4.182ProVal: 4.182 ± 0.046
0.636ProTrp: 0.636 ± 0.019
1.271ProTyr: 1.271 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
4.122GlnAla: 4.122 ± 0.049
0.397GlnCys: 0.397 ± 0.013
1.976GlnAsp: 1.976 ± 0.033
2.814GlnGlu: 2.814 ± 0.043
1.938GlnPhe: 1.938 ± 0.035
2.88GlnGly: 2.88 ± 0.041
1.047GlnHis: 1.047 ± 0.023
3.103GlnIle: 3.103 ± 0.035
2.743GlnLys: 2.743 ± 0.049
4.9GlnLeu: 4.9 ± 0.059
1.072GlnMet: 1.072 ± 0.023
1.916GlnAsn: 1.916 ± 0.027
2.406GlnPro: 2.406 ± 0.04
2.98GlnGln: 2.98 ± 0.057
2.598GlnArg: 2.598 ± 0.04
3.225GlnSer: 3.225 ± 0.041
2.934GlnThr: 2.934 ± 0.046
2.859GlnVal: 2.859 ± 0.037
0.581GlnTrp: 0.581 ± 0.018
1.177GlnTyr: 1.177 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
3.652ArgAla: 3.652 ± 0.05
0.585ArgCys: 0.585 ± 0.017
3.005ArgAsp: 3.005 ± 0.043
3.79ArgGlu: 3.79 ± 0.061
2.64ArgPhe: 2.64 ± 0.039
3.335ArgGly: 3.335 ± 0.042
1.169ArgHis: 1.169 ± 0.025
3.567ArgIle: 3.567 ± 0.042
2.84ArgLys: 2.84 ± 0.046
5.842ArgLeu: 5.842 ± 0.069
1.378ArgMet: 1.378 ± 0.027
2.139ArgAsn: 2.139 ± 0.03
2.393ArgPro: 2.393 ± 0.041
2.539ArgGln: 2.539 ± 0.044
3.463ArgArg: 3.463 ± 0.053
3.519ArgSer: 3.519 ± 0.045
2.795ArgThr: 2.795 ± 0.036
3.574ArgVal: 3.574 ± 0.052
0.944ArgTrp: 0.944 ± 0.024
1.786ArgTyr: 1.786 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
5.211SerAla: 5.211 ± 0.058
0.696SerCys: 0.696 ± 0.017
3.913SerAsp: 3.913 ± 0.052
4.504SerGlu: 4.504 ± 0.052
2.638SerPhe: 2.638 ± 0.043
5.528SerGly: 5.528 ± 0.076
1.52SerHis: 1.52 ± 0.026
3.788SerIle: 3.788 ± 0.05
2.749SerLys: 2.749 ± 0.04
6.996SerLeu: 6.996 ± 0.063
1.504SerMet: 1.504 ± 0.024
2.423SerAsn: 2.423 ± 0.042
3.399SerPro: 3.399 ± 0.041
3.198SerGln: 3.198 ± 0.038
3.826SerArg: 3.826 ± 0.057
4.586SerSer: 4.586 ± 0.055
3.644SerThr: 3.644 ± 0.06
4.35SerVal: 4.35 ± 0.057
0.896SerTrp: 0.896 ± 0.025
1.711SerTyr: 1.711 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
5.021ThrAla: 5.021 ± 0.085
0.611ThrCys: 0.611 ± 0.019
3.406ThrAsp: 3.406 ± 0.073
3.457ThrGlu: 3.457 ± 0.044
2.315ThrPhe: 2.315 ± 0.055
5.221ThrGly: 5.221 ± 0.089
1.186ThrHis: 1.186 ± 0.022
3.536ThrIle: 3.536 ± 0.094
2.051ThrLys: 2.051 ± 0.034
5.723ThrLeu: 5.723 ± 0.074
1.096ThrMet: 1.096 ± 0.02
1.933ThrAsn: 1.933 ± 0.041
3.372ThrPro: 3.372 ± 0.049
2.165ThrGln: 2.165 ± 0.033
2.925ThrArg: 2.925 ± 0.038
3.668ThrSer: 3.668 ± 0.058
3.269ThrThr: 3.269 ± 0.083
4.281ThrVal: 4.281 ± 0.115
0.783ThrTrp: 0.783 ± 0.021
1.464ThrTyr: 1.464 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
5.235ValAla: 5.235 ± 0.054
0.923ValCys: 0.923 ± 0.026
3.879ValAsp: 3.879 ± 0.065
4.14ValGlu: 4.14 ± 0.051
2.628ValPhe: 2.628 ± 0.036
4.086ValGly: 4.086 ± 0.047
1.283ValHis: 1.283 ± 0.024
4.204ValIle: 4.204 ± 0.049
3.123ValLys: 3.123 ± 0.046
6.43ValLeu: 6.43 ± 0.057
1.408ValMet: 1.408 ± 0.026
2.64ValAsn: 2.64 ± 0.053
3.289ValPro: 3.289 ± 0.05
2.48ValGln: 2.48 ± 0.038
3.459ValArg: 3.459 ± 0.043
4.801ValSer: 4.801 ± 0.063
4.223ValThr: 4.223 ± 0.095
4.646ValVal: 4.646 ± 0.052
0.888ValTrp: 0.888 ± 0.022
1.774ValTyr: 1.774 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
0.851TrpAla: 0.851 ± 0.023
0.194TrpCys: 0.194 ± 0.01
0.814TrpAsp: 0.814 ± 0.025
0.842TrpGlu: 0.842 ± 0.021
0.602TrpPhe: 0.602 ± 0.018
0.976TrpGly: 0.976 ± 0.023
0.352TrpHis: 0.352 ± 0.014
0.869TrpIle: 0.869 ± 0.02
0.929TrpLys: 0.929 ± 0.024
1.512TrpLeu: 1.512 ± 0.031
0.401TrpMet: 0.401 ± 0.013
0.7TrpAsn: 0.7 ± 0.018
0.622TrpPro: 0.622 ± 0.017
0.728TrpGln: 0.728 ± 0.02
0.813TrpArg: 0.813 ± 0.022
1.075TrpSer: 1.075 ± 0.024
0.761TrpThr: 0.761 ± 0.022
0.826TrpVal: 0.826 ± 0.021
0.241TrpTrp: 0.241 ± 0.012
0.428TrpTyr: 0.428 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.107TyrAla: 2.107 ± 0.032
0.394TyrCys: 0.394 ± 0.014
1.71TyrAsp: 1.71 ± 0.046
1.746TyrGlu: 1.746 ± 0.033
1.342TyrPhe: 1.342 ± 0.025
2.084TyrGly: 2.084 ± 0.035
0.774TyrHis: 0.774 ± 0.022
1.175TyrIle: 1.175 ± 0.022
0.876TyrLys: 0.876 ± 0.021
3.089TyrLeu: 3.089 ± 0.041
0.443TyrMet: 0.443 ± 0.015
1.002TyrAsn: 1.002 ± 0.026
1.39TyrPro: 1.39 ± 0.025
1.697TyrGln: 1.697 ± 0.028
1.943TyrArg: 1.943 ± 0.035
1.841TyrSer: 1.841 ± 0.035
1.407TyrThr: 1.407 ± 0.055
1.645TyrVal: 1.645 ± 0.029
0.5TyrTrp: 0.5 ± 0.015
0.996TyrTyr: 0.996 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6463 proteins (2248977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski