Amino acid dipepetide frequency for Halalkalibacillus sediminis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.477AlaAla: 4.477 ± 0.092
0.53AlaCys: 0.53 ± 0.027
3.23AlaAsp: 3.23 ± 0.069
4.488AlaGlu: 4.488 ± 0.09
2.999AlaPhe: 2.999 ± 0.07
4.721AlaGly: 4.721 ± 0.101
1.26AlaHis: 1.26 ± 0.043
5.489AlaIle: 5.489 ± 0.105
4.036AlaLys: 4.036 ± 0.076
6.463AlaLeu: 6.463 ± 0.104
1.86AlaMet: 1.86 ± 0.049
2.555AlaAsn: 2.555 ± 0.058
1.894AlaPro: 1.894 ± 0.055
2.115AlaGln: 2.115 ± 0.052
2.451AlaArg: 2.451 ± 0.063
3.988AlaSer: 3.988 ± 0.084
3.37AlaThr: 3.37 ± 0.068
4.626AlaVal: 4.626 ± 0.093
0.564AlaTrp: 0.564 ± 0.03
2.067AlaTyr: 2.067 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.342CysAla: 0.342 ± 0.019
0.066CysCys: 0.066 ± 0.009
0.348CysAsp: 0.348 ± 0.023
0.443CysGlu: 0.443 ± 0.026
0.221CysPhe: 0.221 ± 0.018
0.606CysGly: 0.606 ± 0.03
0.16CysHis: 0.16 ± 0.016
0.372CysIle: 0.372 ± 0.02
0.328CysLys: 0.328 ± 0.019
0.501CysLeu: 0.501 ± 0.024
0.145CysMet: 0.145 ± 0.013
0.244CysAsn: 0.244 ± 0.016
0.334CysPro: 0.334 ± 0.022
0.187CysGln: 0.187 ± 0.013
0.237CysArg: 0.237 ± 0.017
0.379CysSer: 0.379 ± 0.02
0.307CysThr: 0.307 ± 0.019
0.377CysVal: 0.377 ± 0.022
0.049CysTrp: 0.049 ± 0.008
0.225CysTyr: 0.225 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
3.325AspAla: 3.325 ± 0.078
0.321AspCys: 0.321 ± 0.019
3.292AspAsp: 3.292 ± 0.074
6.034AspGlu: 6.034 ± 0.097
2.678AspPhe: 2.678 ± 0.051
3.798AspGly: 3.798 ± 0.086
1.433AspHis: 1.433 ± 0.046
4.185AspIle: 4.185 ± 0.073
2.946AspLys: 2.946 ± 0.065
5.63AspLeu: 5.63 ± 0.085
1.536AspMet: 1.536 ± 0.04
1.978AspAsn: 1.978 ± 0.056
2.259AspPro: 2.259 ± 0.064
3.008AspGln: 3.008 ± 0.06
2.708AspArg: 2.708 ± 0.055
3.192AspSer: 3.192 ± 0.065
2.549AspThr: 2.549 ± 0.054
4.201AspVal: 4.201 ± 0.077
0.663AspTrp: 0.663 ± 0.03
2.435AspTyr: 2.435 ± 0.061
0.0AspXaa: 0.0 ± 0.0
Glu
5.43GluAla: 5.43 ± 0.102
0.355GluCys: 0.355 ± 0.024
5.163GluAsp: 5.163 ± 0.096
9.099GluGlu: 9.099 ± 0.149
2.768GluPhe: 2.768 ± 0.061
5.047GluGly: 5.047 ± 0.098
1.688GluHis: 1.688 ± 0.044
5.843GluIle: 5.843 ± 0.094
6.714GluLys: 6.714 ± 0.098
7.43GluLeu: 7.43 ± 0.101
2.641GluMet: 2.641 ± 0.049
4.197GluAsn: 4.197 ± 0.076
2.255GluPro: 2.255 ± 0.055
3.714GluGln: 3.714 ± 0.083
3.776GluArg: 3.776 ± 0.069
4.41GluSer: 4.41 ± 0.089
4.012GluThr: 4.012 ± 0.076
6.056GluVal: 6.056 ± 0.108
0.993GluTrp: 0.993 ± 0.034
2.593GluTyr: 2.593 ± 0.06
0.001GluXaa: 0.001 ± 0.001
Phe
2.782PheAla: 2.782 ± 0.073
0.289PheCys: 0.289 ± 0.02
2.737PheAsp: 2.737 ± 0.064
3.41PheGlu: 3.41 ± 0.058
2.369PhePhe: 2.369 ± 0.066
3.26PheGly: 3.26 ± 0.082
1.032PheHis: 1.032 ± 0.035
3.902PheIle: 3.902 ± 0.093
2.414PheLys: 2.414 ± 0.045
4.348PheLeu: 4.348 ± 0.1
1.228PheMet: 1.228 ± 0.042
2.099PheAsn: 2.099 ± 0.054
1.602PhePro: 1.602 ± 0.052
1.728PheGln: 1.728 ± 0.039
1.57PheArg: 1.57 ± 0.049
3.127PheSer: 3.127 ± 0.063
2.477PheThr: 2.477 ± 0.052
3.099PheVal: 3.099 ± 0.064
0.501PheTrp: 0.501 ± 0.028
1.806PheTyr: 1.806 ± 0.059
0.0PheXaa: 0.0 ± 0.0
Gly
4.633GlyAla: 4.633 ± 0.096
0.51GlyCys: 0.51 ± 0.027
3.69GlyAsp: 3.69 ± 0.084
5.183GlyGlu: 5.183 ± 0.108
3.334GlyPhe: 3.334 ± 0.073
4.697GlyGly: 4.697 ± 0.095
1.41GlyHis: 1.41 ± 0.05
5.392GlyIle: 5.392 ± 0.082
4.5GlyLys: 4.5 ± 0.074
6.331GlyLeu: 6.331 ± 0.098
2.147GlyMet: 2.147 ± 0.054
2.541GlyAsn: 2.541 ± 0.072
1.748GlyPro: 1.748 ± 0.049
2.336GlyGln: 2.336 ± 0.049
2.563GlyArg: 2.563 ± 0.074
4.096GlySer: 4.096 ± 0.074
3.782GlyThr: 3.782 ± 0.067
5.178GlyVal: 5.178 ± 0.085
0.711GlyTrp: 0.711 ± 0.033
2.762GlyTyr: 2.762 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
1.299HisAla: 1.299 ± 0.04
0.155HisCys: 0.155 ± 0.014
1.111HisAsp: 1.111 ± 0.033
1.622HisGlu: 1.622 ± 0.043
1.075HisPhe: 1.075 ± 0.041
1.362HisGly: 1.362 ± 0.045
0.681HisHis: 0.681 ± 0.033
1.549HisIle: 1.549 ± 0.042
1.052HisLys: 1.052 ± 0.04
2.224HisLeu: 2.224 ± 0.056
0.579HisMet: 0.579 ± 0.026
0.777HisAsn: 0.777 ± 0.026
1.082HisPro: 1.082 ± 0.038
0.993HisGln: 0.993 ± 0.037
0.915HisArg: 0.915 ± 0.037
1.404HisSer: 1.404 ± 0.043
0.95HisThr: 0.95 ± 0.033
1.445HisVal: 1.445 ± 0.042
0.223HisTrp: 0.223 ± 0.017
0.815HisTyr: 0.815 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
5.608IleAla: 5.608 ± 0.101
0.482IleCys: 0.482 ± 0.027
4.879IleAsp: 4.879 ± 0.075
6.123IleGlu: 6.123 ± 0.106
3.677IlePhe: 3.677 ± 0.095
5.731IleGly: 5.731 ± 0.103
1.721IleHis: 1.721 ± 0.048
5.87IleIle: 5.87 ± 0.108
4.163IleLys: 4.163 ± 0.072
6.975IleLeu: 6.975 ± 0.123
1.833IleMet: 1.833 ± 0.049
3.245IleAsn: 3.245 ± 0.07
3.06IlePro: 3.06 ± 0.064
2.947IleGln: 2.947 ± 0.063
2.97IleArg: 2.97 ± 0.058
4.987IleSer: 4.987 ± 0.086
4.109IleThr: 4.109 ± 0.069
5.397IleVal: 5.397 ± 0.089
0.641IleTrp: 0.641 ± 0.031
2.581IleTyr: 2.581 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
3.832LysAla: 3.832 ± 0.078
0.315LysCys: 0.315 ± 0.019
4.058LysAsp: 4.058 ± 0.075
6.399LysGlu: 6.399 ± 0.108
1.981LysPhe: 1.981 ± 0.049
4.049LysGly: 4.049 ± 0.075
1.385LysHis: 1.385 ± 0.038
4.285LysIle: 4.285 ± 0.075
5.501LysLys: 5.501 ± 0.108
5.23LysLeu: 5.23 ± 0.086
2.12LysMet: 2.12 ± 0.047
3.25LysAsn: 3.25 ± 0.07
1.936LysPro: 1.936 ± 0.051
2.824LysGln: 2.824 ± 0.065
3.282LysArg: 3.282 ± 0.073
3.605LysSer: 3.605 ± 0.078
2.934LysThr: 2.934 ± 0.064
4.388LysVal: 4.388 ± 0.074
0.716LysTrp: 0.716 ± 0.033
2.16LysTyr: 2.16 ± 0.062
0.0LysXaa: 0.0 ± 0.0
Leu
6.428LeuAla: 6.428 ± 0.111
0.468LeuCys: 0.468 ± 0.024
5.266LeuAsp: 5.266 ± 0.078
6.975LeuGlu: 6.975 ± 0.106
4.705LeuPhe: 4.705 ± 0.103
6.264LeuGly: 6.264 ± 0.109
1.799LeuHis: 1.799 ± 0.05
7.426LeuIle: 7.426 ± 0.113
6.114LeuLys: 6.114 ± 0.104
9.253LeuLeu: 9.253 ± 0.147
2.644LeuMet: 2.644 ± 0.06
4.657LeuAsn: 4.657 ± 0.074
3.588LeuPro: 3.588 ± 0.069
3.33LeuGln: 3.33 ± 0.07
3.38LeuArg: 3.38 ± 0.067
6.638LeuSer: 6.638 ± 0.093
5.268LeuThr: 5.268 ± 0.083
6.231LeuVal: 6.231 ± 0.099
0.751LeuTrp: 0.751 ± 0.032
2.832LeuTyr: 2.832 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
1.813MetAla: 1.813 ± 0.051
0.136MetCys: 0.136 ± 0.014
1.706MetAsp: 1.706 ± 0.043
2.12MetGlu: 2.12 ± 0.048
1.129MetPhe: 1.129 ± 0.036
1.784MetGly: 1.784 ± 0.056
0.455MetHis: 0.455 ± 0.024
2.322MetIle: 2.322 ± 0.056
2.499MetLys: 2.499 ± 0.054
2.437MetLeu: 2.437 ± 0.056
0.956MetMet: 0.956 ± 0.034
1.759MetAsn: 1.759 ± 0.051
0.933MetPro: 0.933 ± 0.034
0.981MetGln: 0.981 ± 0.033
1.165MetArg: 1.165 ± 0.039
1.872MetSer: 1.872 ± 0.047
1.675MetThr: 1.675 ± 0.049
1.937MetVal: 1.937 ± 0.045
0.214MetTrp: 0.214 ± 0.018
0.802MetTyr: 0.802 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
2.397AsnAla: 2.397 ± 0.053
0.257AsnCys: 0.257 ± 0.018
2.776AsnAsp: 2.776 ± 0.058
4.269AsnGlu: 4.269 ± 0.083
1.824AsnPhe: 1.824 ± 0.046
3.063AsnGly: 3.063 ± 0.07
1.162AsnHis: 1.162 ± 0.036
3.191AsnIle: 3.191 ± 0.063
2.78AsnLys: 2.78 ± 0.071
3.85AsnLeu: 3.85 ± 0.071
1.195AsnMet: 1.195 ± 0.04
2.08AsnAsn: 2.08 ± 0.059
2.028AsnPro: 2.028 ± 0.056
2.425AsnGln: 2.425 ± 0.063
2.186AsnArg: 2.186 ± 0.055
2.33AsnSer: 2.33 ± 0.062
2.026AsnThr: 2.026 ± 0.045
3.104AsnVal: 3.104 ± 0.073
0.521AsnTrp: 0.521 ± 0.02
1.655AsnTyr: 1.655 ± 0.048
0.001AsnXaa: 0.001 ± 0.001
Pro
1.827ProAla: 1.827 ± 0.046
0.18ProCys: 0.18 ± 0.015
2.118ProAsp: 2.118 ± 0.051
3.167ProGlu: 3.167 ± 0.071
1.948ProPhe: 1.948 ± 0.052
2.226ProGly: 2.226 ± 0.047
0.818ProHis: 0.818 ± 0.033
2.74ProIle: 2.74 ± 0.07
2.043ProLys: 2.043 ± 0.05
3.182ProLeu: 3.182 ± 0.06
0.861ProMet: 0.861 ± 0.037
1.621ProAsn: 1.621 ± 0.041
1.02ProPro: 1.02 ± 0.042
1.086ProGln: 1.086 ± 0.039
1.102ProArg: 1.102 ± 0.038
2.34ProSer: 2.34 ± 0.058
2.002ProThr: 2.002 ± 0.051
2.731ProVal: 2.731 ± 0.054
0.363ProTrp: 0.363 ± 0.023
1.404ProTyr: 1.404 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
2.69GlnAla: 2.69 ± 0.061
0.134GlnCys: 0.134 ± 0.014
1.896GlnAsp: 1.896 ± 0.049
3.347GlnGlu: 3.347 ± 0.063
1.634GlnPhe: 1.634 ± 0.044
2.284GlnGly: 2.284 ± 0.052
0.824GlnHis: 0.824 ± 0.033
2.661GlnIle: 2.661 ± 0.062
2.629GlnLys: 2.629 ± 0.073
4.24GlnLeu: 4.24 ± 0.092
1.293GlnMet: 1.293 ± 0.041
1.784GlnAsn: 1.784 ± 0.047
1.389GlnPro: 1.389 ± 0.05
1.998GlnGln: 1.998 ± 0.065
1.524GlnArg: 1.524 ± 0.042
2.665GlnSer: 2.665 ± 0.063
2.044GlnThr: 2.044 ± 0.052
2.643GlnVal: 2.643 ± 0.054
0.455GlnTrp: 0.455 ± 0.025
1.322GlnTyr: 1.322 ± 0.045
0.0GlnXaa: 0.0 ± 0.0
Arg
2.315ArgAla: 2.315 ± 0.053
0.235ArgCys: 0.235 ± 0.018
2.186ArgAsp: 2.186 ± 0.055
3.437ArgGlu: 3.437 ± 0.073
1.873ArgPhe: 1.873 ± 0.046
2.437ArgGly: 2.437 ± 0.058
0.728ArgHis: 0.728 ± 0.034
2.916ArgIle: 2.916 ± 0.056
3.006ArgLys: 3.006 ± 0.059
4.012ArgLeu: 4.012 ± 0.081
1.338ArgMet: 1.338 ± 0.043
2.015ArgAsn: 2.015 ± 0.046
1.332ArgPro: 1.332 ± 0.038
1.513ArgGln: 1.513 ± 0.047
1.792ArgArg: 1.792 ± 0.048
2.327ArgSer: 2.327 ± 0.051
2.077ArgThr: 2.077 ± 0.054
2.869ArgVal: 2.869 ± 0.063
0.4ArgTrp: 0.4 ± 0.024
1.5ArgTyr: 1.5 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
3.449SerAla: 3.449 ± 0.074
0.323SerCys: 0.323 ± 0.021
3.501SerAsp: 3.501 ± 0.072
4.956SerGlu: 4.956 ± 0.09
3.295SerPhe: 3.295 ± 0.068
4.482SerGly: 4.482 ± 0.083
1.244SerHis: 1.244 ± 0.035
5.29SerIle: 5.29 ± 0.095
3.915SerLys: 3.915 ± 0.074
5.921SerLeu: 5.921 ± 0.09
1.833SerMet: 1.833 ± 0.052
2.82SerAsn: 2.82 ± 0.063
2.058SerPro: 2.058 ± 0.05
2.187SerGln: 2.187 ± 0.053
2.344SerArg: 2.344 ± 0.062
4.268SerSer: 4.268 ± 0.082
3.289SerThr: 3.289 ± 0.068
4.259SerVal: 4.259 ± 0.075
0.586SerTrp: 0.586 ± 0.03
2.237SerTyr: 2.237 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
3.173ThrAla: 3.173 ± 0.069
0.33ThrCys: 0.33 ± 0.021
2.908ThrAsp: 2.908 ± 0.065
3.848ThrGlu: 3.848 ± 0.077
2.727ThrPhe: 2.727 ± 0.063
3.785ThrGly: 3.785 ± 0.069
1.038ThrHis: 1.038 ± 0.036
4.521ThrIle: 4.521 ± 0.078
2.912ThrLys: 2.912 ± 0.068
4.871ThrLeu: 4.871 ± 0.079
1.278ThrMet: 1.278 ± 0.041
2.432ThrAsn: 2.432 ± 0.065
2.081ThrPro: 2.081 ± 0.052
1.548ThrGln: 1.548 ± 0.041
1.732ThrArg: 1.732 ± 0.05
3.31ThrSer: 3.31 ± 0.063
2.848ThrThr: 2.848 ± 0.072
3.954ThrVal: 3.954 ± 0.077
0.498ThrTrp: 0.498 ± 0.026
2.046ThrTyr: 2.046 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
4.892ValAla: 4.892 ± 0.09
0.503ValCys: 0.503 ± 0.023
4.441ValAsp: 4.441 ± 0.091
5.645ValGlu: 5.645 ± 0.103
3.023ValPhe: 3.023 ± 0.054
4.885ValGly: 4.885 ± 0.081
1.425ValHis: 1.425 ± 0.04
5.764ValIle: 5.764 ± 0.086
4.153ValLys: 4.153 ± 0.08
6.57ValLeu: 6.57 ± 0.108
1.933ValMet: 1.933 ± 0.05
3.099ValAsn: 3.099 ± 0.062
2.531ValPro: 2.531 ± 0.058
2.576ValGln: 2.576 ± 0.061
2.66ValArg: 2.66 ± 0.053
4.532ValSer: 4.532 ± 0.082
3.946ValThr: 3.946 ± 0.073
5.357ValVal: 5.357 ± 0.092
0.647ValTrp: 0.647 ± 0.031
2.272ValTyr: 2.272 ± 0.058
0.0ValXaa: 0.0 ± 0.0
Trp
0.531TrpAla: 0.531 ± 0.026
0.042TrpCys: 0.042 ± 0.007
0.527TrpAsp: 0.527 ± 0.024
0.63TrpGlu: 0.63 ± 0.03
0.584TrpPhe: 0.584 ± 0.031
0.624TrpGly: 0.624 ± 0.028
0.185TrpHis: 0.185 ± 0.017
0.829TrpIle: 0.829 ± 0.033
0.707TrpLys: 0.707 ± 0.03
1.141TrpLeu: 1.141 ± 0.043
0.384TrpMet: 0.384 ± 0.021
0.539TrpAsn: 0.539 ± 0.025
0.258TrpPro: 0.258 ± 0.018
0.315TrpGln: 0.315 ± 0.02
0.373TrpArg: 0.373 ± 0.02
0.596TrpSer: 0.596 ± 0.027
0.483TrpThr: 0.483 ± 0.026
0.737TrpVal: 0.737 ± 0.033
0.138TrpTrp: 0.138 ± 0.013
0.406TrpTyr: 0.406 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.937TyrAla: 1.937 ± 0.057
0.24TyrCys: 0.24 ± 0.019
2.336TyrAsp: 2.336 ± 0.091
2.99TyrGlu: 2.99 ± 0.072
1.938TyrPhe: 1.938 ± 0.051
2.437TyrGly: 2.437 ± 0.057
0.867TyrHis: 0.867 ± 0.034
2.45TyrIle: 2.45 ± 0.057
1.839TyrLys: 1.839 ± 0.051
3.409TyrLeu: 3.409 ± 0.069
0.891TyrMet: 0.891 ± 0.032
1.428TyrAsn: 1.428 ± 0.039
1.389TyrPro: 1.389 ± 0.043
1.648TyrGln: 1.648 ± 0.048
1.579TyrArg: 1.579 ± 0.047
2.189TyrSer: 2.189 ± 0.056
1.691TyrThr: 1.691 ± 0.051
2.266TyrVal: 2.266 ± 0.055
0.396TyrTrp: 0.396 ± 0.024
1.427TyrTyr: 1.427 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.001
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2899 proteins (832733 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski