Amino acid dipepetide frequency for Moheibacter sediminis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.884AlaAla: 3.884 ± 0.077
0.481AlaCys: 0.481 ± 0.024
3.228AlaAsp: 3.228 ± 0.062
4.296AlaGlu: 4.296 ± 0.072
3.047AlaPhe: 3.047 ± 0.06
4.266AlaGly: 4.266 ± 0.079
1.026AlaHis: 1.026 ± 0.036
4.641AlaIle: 4.641 ± 0.071
4.301AlaLys: 4.301 ± 0.08
5.252AlaLeu: 5.252 ± 0.079
1.446AlaMet: 1.446 ± 0.045
3.178AlaAsn: 3.178 ± 0.057
1.707AlaPro: 1.707 ± 0.044
2.626AlaGln: 2.626 ± 0.062
1.84AlaArg: 1.84 ± 0.05
3.615AlaSer: 3.615 ± 0.066
3.171AlaThr: 3.171 ± 0.071
3.865AlaVal: 3.865 ± 0.088
0.645AlaTrp: 0.645 ± 0.028
2.371AlaTyr: 2.371 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.419CysAla: 0.419 ± 0.022
0.099CysCys: 0.099 ± 0.011
0.368CysAsp: 0.368 ± 0.019
0.46CysGlu: 0.46 ± 0.025
0.353CysPhe: 0.353 ± 0.019
0.633CysGly: 0.633 ± 0.033
0.143CysHis: 0.143 ± 0.014
0.551CysIle: 0.551 ± 0.023
0.45CysLys: 0.45 ± 0.02
0.585CysLeu: 0.585 ± 0.025
0.159CysMet: 0.159 ± 0.013
0.424CysAsn: 0.424 ± 0.021
0.323CysPro: 0.323 ± 0.022
0.194CysGln: 0.194 ± 0.015
0.191CysArg: 0.191 ± 0.015
0.505CysSer: 0.505 ± 0.026
0.412CysThr: 0.412 ± 0.022
0.425CysVal: 0.425 ± 0.022
0.064CysTrp: 0.064 ± 0.008
0.241CysTyr: 0.241 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.44AspAla: 3.44 ± 0.057
0.408AspCys: 0.408 ± 0.023
2.809AspAsp: 2.809 ± 0.064
4.469AspGlu: 4.469 ± 0.078
3.678AspPhe: 3.678 ± 0.066
3.926AspGly: 3.926 ± 0.081
0.661AspHis: 0.661 ± 0.022
3.889AspIle: 3.889 ± 0.082
4.235AspLys: 4.235 ± 0.068
5.306AspLeu: 5.306 ± 0.083
1.158AspMet: 1.158 ± 0.034
3.04AspAsn: 3.04 ± 0.061
1.534AspPro: 1.534 ± 0.048
1.384AspGln: 1.384 ± 0.04
1.714AspArg: 1.714 ± 0.048
3.264AspSer: 3.264 ± 0.068
2.312AspThr: 2.312 ± 0.046
3.334AspVal: 3.334 ± 0.053
0.735AspTrp: 0.735 ± 0.028
2.832AspTyr: 2.832 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
3.966GluAla: 3.966 ± 0.081
0.351GluCys: 0.351 ± 0.02
3.385GluAsp: 3.385 ± 0.064
5.36GluGlu: 5.36 ± 0.096
3.971GluPhe: 3.971 ± 0.075
3.768GluGly: 3.768 ± 0.077
1.089GluHis: 1.089 ± 0.036
6.999GluIle: 6.999 ± 0.097
6.629GluLys: 6.629 ± 0.109
6.865GluLeu: 6.865 ± 0.1
1.778GluMet: 1.778 ± 0.041
5.861GluAsn: 5.861 ± 0.095
1.571GluPro: 1.571 ± 0.042
2.327GluGln: 2.327 ± 0.051
2.471GluArg: 2.471 ± 0.046
3.784GluSer: 3.784 ± 0.063
3.806GluThr: 3.806 ± 0.066
4.097GluVal: 4.097 ± 0.066
0.734GluTrp: 0.734 ± 0.029
2.489GluTyr: 2.489 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
3.045PheAla: 3.045 ± 0.051
0.451PheCys: 0.451 ± 0.021
3.339PheAsp: 3.339 ± 0.064
3.807PheGlu: 3.807 ± 0.07
2.817PhePhe: 2.817 ± 0.068
3.662PheGly: 3.662 ± 0.062
1.021PheHis: 1.021 ± 0.038
4.212PheIle: 4.212 ± 0.081
3.482PheLys: 3.482 ± 0.057
4.827PheLeu: 4.827 ± 0.086
1.169PheMet: 1.169 ± 0.035
3.312PheAsn: 3.312 ± 0.056
1.843PhePro: 1.843 ± 0.045
1.782PheGln: 1.782 ± 0.05
1.765PheArg: 1.765 ± 0.039
4.178PheSer: 4.178 ± 0.072
3.229PheThr: 3.229 ± 0.064
3.206PheVal: 3.206 ± 0.058
0.643PheTrp: 0.643 ± 0.025
2.33PheTyr: 2.33 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
3.904GlyAla: 3.904 ± 0.082
0.562GlyCys: 0.562 ± 0.026
3.235GlyAsp: 3.235 ± 0.08
4.019GlyGlu: 4.019 ± 0.073
3.697GlyPhe: 3.697 ± 0.068
4.796GlyGly: 4.796 ± 0.118
1.022GlyHis: 1.022 ± 0.037
5.8GlyIle: 5.8 ± 0.081
5.312GlyLys: 5.312 ± 0.091
5.532GlyLeu: 5.532 ± 0.077
1.694GlyMet: 1.694 ± 0.046
4.509GlyAsn: 4.509 ± 0.096
1.123GlyPro: 1.123 ± 0.037
2.055GlyGln: 2.055 ± 0.052
1.977GlyArg: 1.977 ± 0.055
4.039GlySer: 4.039 ± 0.072
4.149GlyThr: 4.149 ± 0.099
3.95GlyVal: 3.95 ± 0.08
0.785GlyTrp: 0.785 ± 0.03
2.823GlyTyr: 2.823 ± 0.067
0.0GlyXaa: 0.0 ± 0.0
His
0.928HisAla: 0.928 ± 0.031
0.161HisCys: 0.161 ± 0.013
0.833HisAsp: 0.833 ± 0.032
1.003HisGlu: 1.003 ± 0.032
1.089HisPhe: 1.089 ± 0.033
1.01HisGly: 1.01 ± 0.033
0.43HisHis: 0.43 ± 0.024
1.292HisIle: 1.292 ± 0.038
1.162HisLys: 1.162 ± 0.03
1.728HisLeu: 1.728 ± 0.04
0.274HisMet: 0.274 ± 0.016
0.964HisAsn: 0.964 ± 0.032
0.823HisPro: 0.823 ± 0.03
0.706HisGln: 0.706 ± 0.028
0.649HisArg: 0.649 ± 0.029
1.259HisSer: 1.259 ± 0.038
0.915HisThr: 0.915 ± 0.032
0.737HisVal: 0.737 ± 0.032
0.213HisTrp: 0.213 ± 0.015
0.783HisTyr: 0.783 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.095IleAla: 5.095 ± 0.07
0.661IleCys: 0.661 ± 0.026
4.77IleAsp: 4.77 ± 0.076
5.73IleGlu: 5.73 ± 0.085
4.159IlePhe: 4.159 ± 0.094
5.121IleGly: 5.121 ± 0.076
1.471IleHis: 1.471 ± 0.04
6.506IleIle: 6.506 ± 0.096
5.486IleLys: 5.486 ± 0.074
7.667IleLeu: 7.667 ± 0.112
1.443IleMet: 1.443 ± 0.041
5.009IleAsn: 5.009 ± 0.076
3.4IlePro: 3.4 ± 0.069
3.316IleGln: 3.316 ± 0.07
2.579IleArg: 2.579 ± 0.049
6.354IleSer: 6.354 ± 0.082
4.46IleThr: 4.46 ± 0.077
4.644IleVal: 4.644 ± 0.073
0.767IleTrp: 0.767 ± 0.026
3.404IleTyr: 3.404 ± 0.063
0.0IleXaa: 0.0 ± 0.0
Lys
4.076LysAla: 4.076 ± 0.077
0.306LysCys: 0.306 ± 0.021
4.107LysAsp: 4.107 ± 0.072
5.951LysGlu: 5.951 ± 0.091
3.614LysPhe: 3.614 ± 0.072
4.36LysGly: 4.36 ± 0.08
1.253LysHis: 1.253 ± 0.037
7.08LysIle: 7.08 ± 0.099
6.48LysLys: 6.48 ± 0.104
6.751LysLeu: 6.751 ± 0.092
2.07LysMet: 2.07 ± 0.046
5.514LysAsn: 5.514 ± 0.108
2.367LysPro: 2.367 ± 0.055
2.67LysGln: 2.67 ± 0.058
2.47LysArg: 2.47 ± 0.049
4.974LysSer: 4.974 ± 0.084
4.276LysThr: 4.276 ± 0.082
4.436LysVal: 4.436 ± 0.071
0.762LysTrp: 0.762 ± 0.028
2.991LysTyr: 2.991 ± 0.055
0.0LysXaa: 0.0 ± 0.0
Leu
5.204LeuAla: 5.204 ± 0.086
0.606LeuCys: 0.606 ± 0.029
4.878LeuAsp: 4.878 ± 0.082
6.099LeuGlu: 6.099 ± 0.098
4.779LeuPhe: 4.779 ± 0.07
5.634LeuGly: 5.634 ± 0.081
1.435LeuHis: 1.435 ± 0.044
7.544LeuIle: 7.544 ± 0.11
7.486LeuLys: 7.486 ± 0.114
7.978LeuLeu: 7.978 ± 0.1
2.244LeuMet: 2.244 ± 0.053
6.585LeuAsn: 6.585 ± 0.09
3.31LeuPro: 3.31 ± 0.063
3.222LeuGln: 3.222 ± 0.058
2.968LeuArg: 2.968 ± 0.065
6.61LeuSer: 6.61 ± 0.095
5.146LeuThr: 5.146 ± 0.089
4.97LeuVal: 4.97 ± 0.074
0.776LeuTrp: 0.776 ± 0.029
3.164LeuTyr: 3.164 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
1.395MetAla: 1.395 ± 0.037
0.108MetCys: 0.108 ± 0.01
1.271MetAsp: 1.271 ± 0.037
1.557MetGlu: 1.557 ± 0.043
0.92MetPhe: 0.92 ± 0.036
1.557MetGly: 1.557 ± 0.041
0.36MetHis: 0.36 ± 0.019
1.666MetIle: 1.666 ± 0.049
2.391MetLys: 2.391 ± 0.051
1.855MetLeu: 1.855 ± 0.042
0.64MetMet: 0.64 ± 0.027
1.625MetAsn: 1.625 ± 0.029
0.817MetPro: 0.817 ± 0.032
0.808MetGln: 0.808 ± 0.03
0.933MetArg: 0.933 ± 0.032
1.412MetSer: 1.412 ± 0.034
1.203MetThr: 1.203 ± 0.037
1.362MetVal: 1.362 ± 0.037
0.182MetTrp: 0.182 ± 0.013
0.733MetTyr: 0.733 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.753AsnAla: 3.753 ± 0.067
0.49AsnCys: 0.49 ± 0.027
3.274AsnAsp: 3.274 ± 0.064
4.619AsnGlu: 4.619 ± 0.085
3.481AsnPhe: 3.481 ± 0.067
4.494AsnGly: 4.494 ± 0.1
1.254AsnHis: 1.254 ± 0.036
4.814AsnIle: 4.814 ± 0.08
4.133AsnLys: 4.133 ± 0.074
6.444AsnLeu: 6.444 ± 0.09
1.292AsnMet: 1.292 ± 0.035
4.022AsnAsn: 4.022 ± 0.1
3.267AsnPro: 3.267 ± 0.072
2.835AsnGln: 2.835 ± 0.065
1.911AsnArg: 1.911 ± 0.046
4.762AsnSer: 4.762 ± 0.1
3.217AsnThr: 3.217 ± 0.066
3.767AsnVal: 3.767 ± 0.076
0.842AsnTrp: 0.842 ± 0.032
3.203AsnTyr: 3.203 ± 0.061
0.0AsnXaa: 0.0 ± 0.0
Pro
2.06ProAla: 2.06 ± 0.058
0.189ProCys: 0.189 ± 0.016
2.074ProAsp: 2.074 ± 0.049
2.974ProGlu: 2.974 ± 0.051
1.824ProPhe: 1.824 ± 0.044
1.909ProGly: 1.909 ± 0.047
0.569ProHis: 0.569 ± 0.025
2.579ProIle: 2.579 ± 0.05
2.431ProLys: 2.431 ± 0.059
2.682ProLeu: 2.682 ± 0.061
0.761ProMet: 0.761 ± 0.027
2.27ProAsn: 2.27 ± 0.054
0.838ProPro: 0.838 ± 0.029
1.284ProGln: 1.284 ± 0.033
0.811ProArg: 0.811 ± 0.03
2.007ProSer: 2.007 ± 0.047
1.946ProThr: 1.946 ± 0.052
2.377ProVal: 2.377 ± 0.048
0.284ProTrp: 0.284 ± 0.016
1.356ProTyr: 1.356 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
1.95GlnAla: 1.95 ± 0.047
0.16GlnCys: 0.16 ± 0.012
1.732GlnAsp: 1.732 ± 0.039
2.443GlnGlu: 2.443 ± 0.055
1.79GlnPhe: 1.79 ± 0.041
1.873GlnGly: 1.873 ± 0.047
0.613GlnHis: 0.613 ± 0.027
3.334GlnIle: 3.334 ± 0.059
3.261GlnLys: 3.261 ± 0.061
3.256GlnLeu: 3.256 ± 0.06
0.879GlnMet: 0.879 ± 0.031
3.012GlnAsn: 3.012 ± 0.066
1.104GlnPro: 1.104 ± 0.033
1.475GlnGln: 1.475 ± 0.045
1.303GlnArg: 1.303 ± 0.037
2.238GlnSer: 2.238 ± 0.051
2.21GlnThr: 2.21 ± 0.052
1.836GlnVal: 1.836 ± 0.042
0.375GlnTrp: 0.375 ± 0.021
1.391GlnTyr: 1.391 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
1.779ArgAla: 1.779 ± 0.043
0.172ArgCys: 0.172 ± 0.013
1.554ArgAsp: 1.554 ± 0.04
2.243ArgGlu: 2.243 ± 0.064
1.85ArgPhe: 1.85 ± 0.046
1.798ArgGly: 1.798 ± 0.044
0.557ArgHis: 0.557 ± 0.022
2.893ArgIle: 2.893 ± 0.053
2.829ArgLys: 2.829 ± 0.066
2.949ArgLeu: 2.949 ± 0.069
0.912ArgMet: 0.912 ± 0.035
2.298ArgAsn: 2.298 ± 0.056
1.002ArgPro: 1.002 ± 0.033
1.092ArgGln: 1.092 ± 0.036
1.215ArgArg: 1.215 ± 0.033
1.66ArgSer: 1.66 ± 0.042
1.759ArgThr: 1.759 ± 0.042
1.929ArgVal: 1.929 ± 0.05
0.309ArgTrp: 0.309 ± 0.016
1.424ArgTyr: 1.424 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
3.899SerAla: 3.899 ± 0.069
0.593SerCys: 0.593 ± 0.025
3.692SerAsp: 3.692 ± 0.065
4.847SerGlu: 4.847 ± 0.08
3.877SerPhe: 3.877 ± 0.063
5.075SerGly: 5.075 ± 0.08
1.129SerHis: 1.129 ± 0.036
5.357SerIle: 5.357 ± 0.085
4.826SerLys: 4.826 ± 0.085
5.925SerLeu: 5.925 ± 0.094
1.318SerMet: 1.318 ± 0.039
3.915SerAsn: 3.915 ± 0.074
2.198SerPro: 2.198 ± 0.054
2.418SerGln: 2.418 ± 0.051
1.903SerArg: 1.903 ± 0.047
4.393SerSer: 4.393 ± 0.08
3.605SerThr: 3.605 ± 0.067
4.161SerVal: 4.161 ± 0.066
0.664SerTrp: 0.664 ± 0.027
2.667SerTyr: 2.667 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
3.5ThrAla: 3.5 ± 0.073
0.308ThrCys: 0.308 ± 0.019
3.194ThrAsp: 3.194 ± 0.063
3.893ThrGlu: 3.893 ± 0.075
2.922ThrPhe: 2.922 ± 0.059
4.047ThrGly: 4.047 ± 0.094
1.006ThrHis: 1.006 ± 0.033
4.614ThrIle: 4.614 ± 0.068
3.496ThrLys: 3.496 ± 0.06
4.851ThrLeu: 4.851 ± 0.077
0.985ThrMet: 0.985 ± 0.032
3.245ThrAsn: 3.245 ± 0.063
2.268ThrPro: 2.268 ± 0.05
2.051ThrGln: 2.051 ± 0.051
1.61ThrArg: 1.61 ± 0.042
3.541ThrSer: 3.541 ± 0.077
3.195ThrThr: 3.195 ± 0.086
3.477ThrVal: 3.477 ± 0.062
0.561ThrTrp: 0.561 ± 0.028
2.363ThrTyr: 2.363 ± 0.052
0.0ThrXaa: 0.0 ± 0.0
Val
3.605ValAla: 3.605 ± 0.069
0.479ValCys: 0.479 ± 0.023
3.273ValAsp: 3.273 ± 0.066
4.124ValGlu: 4.124 ± 0.063
3.074ValPhe: 3.074 ± 0.061
3.749ValGly: 3.749 ± 0.079
0.945ValHis: 0.945 ± 0.033
4.618ValIle: 4.618 ± 0.07
4.375ValLys: 4.375 ± 0.069
5.355ValLeu: 5.355 ± 0.081
1.438ValMet: 1.438 ± 0.035
3.623ValAsn: 3.623 ± 0.078
2.013ValPro: 2.013 ± 0.047
1.948ValGln: 1.948 ± 0.045
2.078ValArg: 2.078 ± 0.047
4.395ValSer: 4.395 ± 0.063
3.189ValThr: 3.189 ± 0.064
3.819ValVal: 3.819 ± 0.077
0.659ValTrp: 0.659 ± 0.03
2.501ValTyr: 2.501 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.613TrpAla: 0.613 ± 0.028
0.09TrpCys: 0.09 ± 0.01
0.621TrpAsp: 0.621 ± 0.025
0.714TrpGlu: 0.714 ± 0.031
0.569TrpPhe: 0.569 ± 0.023
0.713TrpGly: 0.713 ± 0.035
0.205TrpHis: 0.205 ± 0.015
0.894TrpIle: 0.894 ± 0.035
0.836TrpLys: 0.836 ± 0.034
0.877TrpLeu: 0.877 ± 0.026
0.353TrpMet: 0.353 ± 0.022
0.787TrpAsn: 0.787 ± 0.03
0.194TrpPro: 0.194 ± 0.015
0.406TrpGln: 0.406 ± 0.02
0.336TrpArg: 0.336 ± 0.017
0.636TrpSer: 0.636 ± 0.029
0.575TrpThr: 0.575 ± 0.033
0.649TrpVal: 0.649 ± 0.028
0.143TrpTrp: 0.143 ± 0.013
0.484TrpTyr: 0.484 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.372TyrAla: 2.372 ± 0.047
0.323TyrCys: 0.323 ± 0.021
2.491TyrAsp: 2.491 ± 0.063
2.694TyrGlu: 2.694 ± 0.064
2.609TyrPhe: 2.609 ± 0.059
2.594TyrGly: 2.594 ± 0.058
0.771TyrHis: 0.771 ± 0.031
2.758TyrIle: 2.758 ± 0.053
2.906TyrLys: 2.906 ± 0.06
3.95TyrLeu: 3.95 ± 0.075
0.78TyrMet: 0.78 ± 0.025
2.683TyrAsn: 2.683 ± 0.059
1.526TyrPro: 1.526 ± 0.038
1.635TyrGln: 1.635 ± 0.038
1.5TyrArg: 1.5 ± 0.04
2.836TyrSer: 2.836 ± 0.061
2.355TyrThr: 2.355 ± 0.07
2.219TyrVal: 2.219 ± 0.043
0.55TyrTrp: 0.55 ± 0.025
1.971TyrTyr: 1.971 ± 0.053
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2974 proteins (985835 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski