Amino acid dipepetide frequency for Sulfurimonas sp. GYSZ_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.879AlaAla: 3.879 ± 0.108
0.615AlaCys: 0.615 ± 0.032
3.402AlaAsp: 3.402 ± 0.068
3.346AlaGlu: 3.346 ± 0.076
2.82AlaPhe: 2.82 ± 0.075
3.805AlaGly: 3.805 ± 0.081
1.254AlaHis: 1.254 ± 0.048
5.514AlaIle: 5.514 ± 0.109
6.196AlaLys: 6.196 ± 0.106
6.745AlaLeu: 6.745 ± 0.12
1.86AlaMet: 1.86 ± 0.056
3.305AlaAsn: 3.305 ± 0.078
1.62AlaPro: 1.62 ± 0.05
2.125AlaGln: 2.125 ± 0.05
2.091AlaArg: 2.091 ± 0.058
4.454AlaSer: 4.454 ± 0.077
3.41AlaThr: 3.41 ± 0.082
3.959AlaVal: 3.959 ± 0.091
0.435AlaTrp: 0.435 ± 0.027
2.613AlaTyr: 2.613 ± 0.062
0.0AlaXaa: 0.0 ± 0.0
Cys
0.596CysAla: 0.596 ± 0.029
0.067CysCys: 0.067 ± 0.01
0.653CysAsp: 0.653 ± 0.033
0.693CysGlu: 0.693 ± 0.034
0.309CysPhe: 0.309 ± 0.022
0.712CysGly: 0.712 ± 0.033
0.276CysHis: 0.276 ± 0.028
0.561CysIle: 0.561 ± 0.029
0.702CysLys: 0.702 ± 0.034
0.48CysLeu: 0.48 ± 0.026
0.214CysMet: 0.214 ± 0.016
0.457CysAsn: 0.457 ± 0.028
0.379CysPro: 0.379 ± 0.031
0.187CysGln: 0.187 ± 0.016
0.28CysArg: 0.28 ± 0.021
0.654CysSer: 0.654 ± 0.03
0.397CysThr: 0.397 ± 0.023
0.445CysVal: 0.445 ± 0.023
0.046CysTrp: 0.046 ± 0.008
0.292CysTyr: 0.292 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
4.839AspAla: 4.839 ± 0.117
0.418AspCys: 0.418 ± 0.023
3.556AspAsp: 3.556 ± 0.081
5.937AspGlu: 5.937 ± 0.156
3.287AspPhe: 3.287 ± 0.061
3.537AspGly: 3.537 ± 0.084
0.579AspHis: 0.579 ± 0.031
6.599AspIle: 6.599 ± 0.11
5.432AspLys: 5.432 ± 0.105
5.062AspLeu: 5.062 ± 0.093
1.831AspMet: 1.831 ± 0.052
3.111AspAsn: 3.111 ± 0.065
1.32AspPro: 1.32 ± 0.049
0.853AspGln: 0.853 ± 0.035
1.822AspArg: 1.822 ± 0.05
3.665AspSer: 3.665 ± 0.079
3.198AspThr: 3.198 ± 0.116
3.886AspVal: 3.886 ± 0.083
0.409AspTrp: 0.409 ± 0.025
2.479AspTyr: 2.479 ± 0.06
0.0AspXaa: 0.0 ± 0.0
Glu
4.79GluAla: 4.79 ± 0.103
0.675GluCys: 0.675 ± 0.032
4.528GluAsp: 4.528 ± 0.128
4.961GluGlu: 4.961 ± 0.112
3.624GluPhe: 3.624 ± 0.078
3.38GluGly: 3.38 ± 0.075
1.585GluHis: 1.585 ± 0.054
6.219GluIle: 6.219 ± 0.106
6.389GluLys: 6.389 ± 0.107
7.831GluLeu: 7.831 ± 0.123
1.872GluMet: 1.872 ± 0.051
4.396GluAsn: 4.396 ± 0.083
1.529GluPro: 1.529 ± 0.044
2.343GluGln: 2.343 ± 0.064
2.227GluArg: 2.227 ± 0.064
4.1GluSer: 4.1 ± 0.086
2.714GluThr: 2.714 ± 0.059
4.579GluVal: 4.579 ± 0.096
0.511GluTrp: 0.511 ± 0.027
3.038GluTyr: 3.038 ± 0.078
0.0GluXaa: 0.0 ± 0.0
Phe
3.022PheAla: 3.022 ± 0.07
0.466PheCys: 0.466 ± 0.025
3.116PheAsp: 3.116 ± 0.069
3.59PheGlu: 3.59 ± 0.078
2.678PhePhe: 2.678 ± 0.082
3.239PheGly: 3.239 ± 0.073
0.739PheHis: 0.739 ± 0.032
4.286PheIle: 4.286 ± 0.096
3.928PheLys: 3.928 ± 0.069
4.708PheLeu: 4.708 ± 0.095
1.181PheMet: 1.181 ± 0.042
2.784PheAsn: 2.784 ± 0.059
1.071PhePro: 1.071 ± 0.041
0.941PheGln: 0.941 ± 0.037
1.383PheArg: 1.383 ± 0.044
3.874PheSer: 3.874 ± 0.086
2.5PheThr: 2.5 ± 0.052
2.99PheVal: 2.99 ± 0.065
0.393PheTrp: 0.393 ± 0.027
2.053PheTyr: 2.053 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
4.051GlyAla: 4.051 ± 0.086
0.705GlyCys: 0.705 ± 0.033
3.18GlyAsp: 3.18 ± 0.077
3.679GlyGlu: 3.679 ± 0.066
3.329GlyPhe: 3.329 ± 0.077
3.815GlyGly: 3.815 ± 0.101
1.137GlyHis: 1.137 ± 0.041
5.077GlyIle: 5.077 ± 0.09
4.166GlyLys: 4.166 ± 0.079
5.063GlyLeu: 5.063 ± 0.101
1.716GlyMet: 1.716 ± 0.056
2.39GlyAsn: 2.39 ± 0.057
0.928GlyPro: 0.928 ± 0.038
1.294GlyGln: 1.294 ± 0.05
1.908GlyArg: 1.908 ± 0.054
3.624GlySer: 3.624 ± 0.071
2.77GlyThr: 2.77 ± 0.073
4.423GlyVal: 4.423 ± 0.102
0.514GlyTrp: 0.514 ± 0.024
2.719GlyTyr: 2.719 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
0.98HisAla: 0.98 ± 0.04
0.172HisCys: 0.172 ± 0.014
0.938HisAsp: 0.938 ± 0.037
1.137HisGlu: 1.137 ± 0.038
1.05HisPhe: 1.05 ± 0.043
1.071HisGly: 1.071 ± 0.042
0.467HisHis: 0.467 ± 0.027
1.656HisIle: 1.656 ± 0.044
1.557HisLys: 1.557 ± 0.049
1.752HisLeu: 1.752 ± 0.053
0.495HisMet: 0.495 ± 0.028
1.078HisAsn: 1.078 ± 0.035
0.876HisPro: 0.876 ± 0.034
0.61HisGln: 0.61 ± 0.026
0.667HisArg: 0.667 ± 0.033
1.237HisSer: 1.237 ± 0.044
1.023HisThr: 1.023 ± 0.032
0.821HisVal: 0.821 ± 0.036
0.143HisTrp: 0.143 ± 0.013
0.786HisTyr: 0.786 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.785IleAla: 5.785 ± 0.115
0.698IleCys: 0.698 ± 0.036
6.138IleAsp: 6.138 ± 0.101
6.522IleGlu: 6.522 ± 0.115
4.19IlePhe: 4.19 ± 0.103
5.023IleGly: 5.023 ± 0.108
1.378IleHis: 1.378 ± 0.041
6.704IleIle: 6.704 ± 0.107
7.649IleLys: 7.649 ± 0.127
7.902IleLeu: 7.902 ± 0.11
1.815IleMet: 1.815 ± 0.049
5.012IleAsn: 5.012 ± 0.095
2.621IlePro: 2.621 ± 0.066
2.325IleGln: 2.325 ± 0.058
2.601IleArg: 2.601 ± 0.058
6.765IleSer: 6.765 ± 0.096
4.392IleThr: 4.392 ± 0.064
5.3IleVal: 5.3 ± 0.1
0.514IleTrp: 0.514 ± 0.026
3.389IleTyr: 3.389 ± 0.073
0.0IleXaa: 0.0 ± 0.0
Lys
5.108LysAla: 5.108 ± 0.096
0.588LysCys: 0.588 ± 0.034
6.074LysAsp: 6.074 ± 0.096
7.867LysGlu: 7.867 ± 0.114
3.139LysPhe: 3.139 ± 0.076
3.829LysGly: 3.829 ± 0.076
1.747LysHis: 1.747 ± 0.046
7.386LysIle: 7.386 ± 0.116
8.665LysLys: 8.665 ± 0.127
8.075LysLeu: 8.075 ± 0.113
2.394LysMet: 2.394 ± 0.067
6.405LysAsn: 6.405 ± 0.113
2.464LysPro: 2.464 ± 0.065
2.808LysGln: 2.808 ± 0.063
2.975LysArg: 2.975 ± 0.069
5.548LysSer: 5.548 ± 0.105
4.74LysThr: 4.74 ± 0.092
5.054LysVal: 5.054 ± 0.098
0.549LysTrp: 0.549 ± 0.028
3.889LysTyr: 3.889 ± 0.078
0.0LysXaa: 0.0 ± 0.0
Leu
5.828LeuAla: 5.828 ± 0.107
0.721LeuCys: 0.721 ± 0.031
6.047LeuAsp: 6.047 ± 0.109
6.873LeuGlu: 6.873 ± 0.112
4.569LeuPhe: 4.569 ± 0.102
5.413LeuGly: 5.413 ± 0.094
1.741LeuHis: 1.741 ± 0.045
7.127LeuIle: 7.127 ± 0.114
8.944LeuLys: 8.944 ± 0.141
8.784LeuLeu: 8.784 ± 0.144
2.23LeuMet: 2.23 ± 0.053
5.723LeuAsn: 5.723 ± 0.104
2.858LeuPro: 2.858 ± 0.062
2.785LeuGln: 2.785 ± 0.054
3.023LeuArg: 3.023 ± 0.062
7.717LeuSer: 7.717 ± 0.134
4.491LeuThr: 4.491 ± 0.073
5.474LeuVal: 5.474 ± 0.093
0.645LeuTrp: 0.645 ± 0.031
3.687LeuTyr: 3.687 ± 0.085
0.0LeuXaa: 0.0 ± 0.0
Met
1.814MetAla: 1.814 ± 0.056
0.245MetCys: 0.245 ± 0.019
1.529MetAsp: 1.529 ± 0.049
1.358MetGlu: 1.358 ± 0.051
1.233MetPhe: 1.233 ± 0.045
1.726MetGly: 1.726 ± 0.054
0.515MetHis: 0.515 ± 0.026
2.059MetIle: 2.059 ± 0.052
2.239MetLys: 2.239 ± 0.059
2.583MetLeu: 2.583 ± 0.065
0.794MetMet: 0.794 ± 0.037
1.405MetAsn: 1.405 ± 0.045
1.0MetPro: 1.0 ± 0.039
1.175MetGln: 1.175 ± 0.038
0.846MetArg: 0.846 ± 0.031
2.016MetSer: 2.016 ± 0.057
1.097MetThr: 1.097 ± 0.035
1.51MetVal: 1.51 ± 0.045
0.191MetTrp: 0.191 ± 0.017
0.737MetTyr: 0.737 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.2AsnAla: 3.2 ± 0.068
0.386AsnCys: 0.386 ± 0.024
3.251AsnAsp: 3.251 ± 0.062
4.144AsnGlu: 4.144 ± 0.074
2.785AsnPhe: 2.785 ± 0.066
3.068AsnGly: 3.068 ± 0.069
0.945AsnHis: 0.945 ± 0.035
6.422AsnIle: 6.422 ± 0.118
4.992AsnLys: 4.992 ± 0.088
5.089AsnLeu: 5.089 ± 0.078
1.402AsnMet: 1.402 ± 0.05
3.29AsnAsn: 3.29 ± 0.079
2.076AsnPro: 2.076 ± 0.056
1.464AsnGln: 1.464 ± 0.048
1.831AsnArg: 1.831 ± 0.054
3.994AsnSer: 3.994 ± 0.082
3.025AsnThr: 3.025 ± 0.071
3.092AsnVal: 3.092 ± 0.066
0.296AsnTrp: 0.296 ± 0.02
2.275AsnTyr: 2.275 ± 0.068
0.0AsnXaa: 0.0 ± 0.0
Pro
1.456ProAla: 1.456 ± 0.05
0.217ProCys: 0.217 ± 0.018
1.551ProAsp: 1.551 ± 0.043
1.948ProGlu: 1.948 ± 0.067
1.596ProPhe: 1.596 ± 0.045
1.076ProGly: 1.076 ± 0.045
0.653ProHis: 0.653 ± 0.031
2.427ProIle: 2.427 ± 0.061
2.551ProLys: 2.551 ± 0.064
2.706ProLeu: 2.706 ± 0.063
0.714ProMet: 0.714 ± 0.035
1.651ProAsn: 1.651 ± 0.045
0.708ProPro: 0.708 ± 0.035
0.88ProGln: 0.88 ± 0.039
0.837ProArg: 0.837 ± 0.036
1.966ProSer: 1.966 ± 0.058
1.584ProThr: 1.584 ± 0.048
1.772ProVal: 1.772 ± 0.053
0.219ProTrp: 0.219 ± 0.018
1.417ProTyr: 1.417 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
1.749GlnAla: 1.749 ± 0.053
0.175GlnCys: 0.175 ± 0.016
1.612GlnAsp: 1.612 ± 0.046
2.265GlnGlu: 2.265 ± 0.067
1.021GlnPhe: 1.021 ± 0.04
1.301GlnGly: 1.301 ± 0.038
0.475GlnHis: 0.475 ± 0.026
2.345GlnIle: 2.345 ± 0.061
3.177GlnLys: 3.177 ± 0.072
2.269GlnLeu: 2.269 ± 0.055
0.83GlnMet: 0.83 ± 0.033
2.419GlnAsn: 2.419 ± 0.063
0.647GlnPro: 0.647 ± 0.03
0.849GlnGln: 0.849 ± 0.045
1.152GlnArg: 1.152 ± 0.039
1.8GlnSer: 1.8 ± 0.05
1.643GlnThr: 1.643 ± 0.058
1.502GlnVal: 1.502 ± 0.051
0.209GlnTrp: 0.209 ± 0.018
0.946GlnTyr: 0.946 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
2.009ArgAla: 2.009 ± 0.054
0.318ArgCys: 0.318 ± 0.024
2.176ArgAsp: 2.176 ± 0.055
2.413ArgGlu: 2.413 ± 0.064
1.734ArgPhe: 1.734 ± 0.057
1.847ArgGly: 1.847 ± 0.057
0.71ArgHis: 0.71 ± 0.036
2.62ArgIle: 2.62 ± 0.064
2.291ArgLys: 2.291 ± 0.058
3.021ArgLeu: 3.021 ± 0.058
0.803ArgMet: 0.803 ± 0.031
1.49ArgAsn: 1.49 ± 0.046
0.862ArgPro: 0.862 ± 0.038
0.86ArgGln: 0.86 ± 0.03
1.235ArgArg: 1.235 ± 0.046
1.796ArgSer: 1.796 ± 0.053
1.413ArgThr: 1.413 ± 0.043
2.398ArgVal: 2.398 ± 0.066
0.265ArgTrp: 0.265 ± 0.022
1.619ArgTyr: 1.619 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
4.173SerAla: 4.173 ± 0.075
0.608SerCys: 0.608 ± 0.027
3.934SerAsp: 3.934 ± 0.078
4.123SerGlu: 4.123 ± 0.081
3.718SerPhe: 3.718 ± 0.074
4.292SerGly: 4.292 ± 0.069
1.261SerHis: 1.261 ± 0.04
6.307SerIle: 6.307 ± 0.097
6.41SerLys: 6.41 ± 0.112
6.761SerLeu: 6.761 ± 0.123
1.892SerMet: 1.892 ± 0.051
3.521SerAsn: 3.521 ± 0.074
1.701SerPro: 1.701 ± 0.041
2.134SerGln: 2.134 ± 0.051
2.098SerArg: 2.098 ± 0.055
5.097SerSer: 5.097 ± 0.105
3.498SerThr: 3.498 ± 0.076
4.298SerVal: 4.298 ± 0.08
0.51SerTrp: 0.51 ± 0.029
2.935SerTyr: 2.935 ± 0.077
0.0SerXaa: 0.0 ± 0.0
Thr
3.025ThrAla: 3.025 ± 0.073
0.383ThrCys: 0.383 ± 0.025
2.745ThrAsp: 2.745 ± 0.103
2.364ThrGlu: 2.364 ± 0.062
2.344ThrPhe: 2.344 ± 0.053
2.84ThrGly: 2.84 ± 0.065
1.078ThrHis: 1.078 ± 0.039
4.298ThrIle: 4.298 ± 0.075
4.528ThrLys: 4.528 ± 0.073
5.54ThrLeu: 5.54 ± 0.073
1.177ThrMet: 1.177 ± 0.038
2.764ThrAsn: 2.764 ± 0.069
2.166ThrPro: 2.166 ± 0.05
1.845ThrGln: 1.845 ± 0.051
1.467ThrArg: 1.467 ± 0.047
3.375ThrSer: 3.375 ± 0.071
2.968ThrThr: 2.968 ± 0.136
2.783ThrVal: 2.783 ± 0.084
0.33ThrTrp: 0.33 ± 0.023
2.173ThrTyr: 2.173 ± 0.067
0.0ThrXaa: 0.0 ± 0.0
Val
4.173ValAla: 4.173 ± 0.085
0.577ValCys: 0.577 ± 0.032
4.16ValAsp: 4.16 ± 0.099
4.343ValGlu: 4.343 ± 0.091
2.879ValPhe: 2.879 ± 0.064
3.764ValGly: 3.764 ± 0.079
1.025ValHis: 1.025 ± 0.032
4.98ValIle: 4.98 ± 0.092
5.229ValLys: 5.229 ± 0.088
5.785ValLeu: 5.785 ± 0.097
1.599ValMet: 1.599 ± 0.055
3.145ValAsn: 3.145 ± 0.07
1.718ValPro: 1.718 ± 0.054
1.531ValGln: 1.531 ± 0.043
1.808ValArg: 1.808 ± 0.055
4.439ValSer: 4.439 ± 0.088
2.89ValThr: 2.89 ± 0.073
4.447ValVal: 4.447 ± 0.103
0.444ValTrp: 0.444 ± 0.026
2.302ValTyr: 2.302 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.47TrpAla: 0.47 ± 0.026
0.074TrpCys: 0.074 ± 0.011
0.429TrpAsp: 0.429 ± 0.026
0.4TrpGlu: 0.4 ± 0.024
0.421TrpPhe: 0.421 ± 0.025
0.479TrpGly: 0.479 ± 0.025
0.21TrpHis: 0.21 ± 0.018
0.64TrpIle: 0.64 ± 0.034
0.357TrpLys: 0.357 ± 0.023
0.743TrpLeu: 0.743 ± 0.032
0.252TrpMet: 0.252 ± 0.018
0.348TrpAsn: 0.348 ± 0.022
0.139TrpPro: 0.139 ± 0.015
0.242TrpGln: 0.242 ± 0.019
0.268TrpArg: 0.268 ± 0.018
0.424TrpSer: 0.424 ± 0.025
0.277TrpThr: 0.277 ± 0.019
0.436TrpVal: 0.436 ± 0.029
0.087TrpTrp: 0.087 ± 0.012
0.319TrpTyr: 0.319 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.503TyrAla: 2.503 ± 0.067
0.292TyrCys: 0.292 ± 0.021
2.801TyrAsp: 2.801 ± 0.067
3.328TyrGlu: 3.328 ± 0.074
2.213TyrPhe: 2.213 ± 0.061
2.269TyrGly: 2.269 ± 0.054
0.74TyrHis: 0.74 ± 0.03
3.498TyrIle: 3.498 ± 0.078
3.902TyrLys: 3.902 ± 0.082
3.781TyrLeu: 3.781 ± 0.088
0.998TyrMet: 0.998 ± 0.037
2.335TyrAsn: 2.335 ± 0.066
1.246TyrPro: 1.246 ± 0.045
1.136TyrGln: 1.136 ± 0.041
1.36TyrArg: 1.36 ± 0.043
2.727TyrSer: 2.727 ± 0.07
2.161TyrThr: 2.161 ± 0.065
2.072TyrVal: 2.072 ± 0.051
0.305TyrTrp: 0.305 ± 0.022
1.718TyrTyr: 1.718 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2324 proteins (743192 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski