Amino acid dipepetide frequency for Flavobacteria bacterium BAL38

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.058AlaAla: 4.058 ± 0.109
0.532AlaCys: 0.532 ± 0.03
3.07AlaAsp: 3.07 ± 0.074
3.767AlaGlu: 3.767 ± 0.093
3.393AlaPhe: 3.393 ± 0.07
4.176AlaGly: 4.176 ± 0.113
0.945AlaHis: 0.945 ± 0.034
5.503AlaIle: 5.503 ± 0.102
4.881AlaLys: 4.881 ± 0.09
5.691AlaLeu: 5.691 ± 0.089
1.447AlaMet: 1.447 ± 0.049
3.706AlaAsn: 3.706 ± 0.096
1.839AlaPro: 1.839 ± 0.054
2.43AlaGln: 2.43 ± 0.058
1.734AlaArg: 1.734 ± 0.055
4.144AlaSer: 4.144 ± 0.121
4.275AlaThr: 4.275 ± 0.133
3.921AlaVal: 3.921 ± 0.082
0.589AlaTrp: 0.589 ± 0.038
2.262AlaTyr: 2.262 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.496CysAla: 0.496 ± 0.03
0.096CysCys: 0.096 ± 0.013
0.431CysAsp: 0.431 ± 0.024
0.498CysGlu: 0.498 ± 0.029
0.421CysPhe: 0.421 ± 0.026
0.61CysGly: 0.61 ± 0.032
0.167CysHis: 0.167 ± 0.016
0.592CysIle: 0.592 ± 0.025
0.531CysLys: 0.531 ± 0.026
0.62CysLeu: 0.62 ± 0.03
0.148CysMet: 0.148 ± 0.015
0.507CysAsn: 0.507 ± 0.026
0.313CysPro: 0.313 ± 0.024
0.233CysGln: 0.233 ± 0.017
0.19CysArg: 0.19 ± 0.015
0.665CysSer: 0.665 ± 0.05
0.494CysThr: 0.494 ± 0.043
0.463CysVal: 0.463 ± 0.027
0.058CysTrp: 0.058 ± 0.009
0.325CysTyr: 0.325 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.514AspAla: 3.514 ± 0.075
0.422AspCys: 0.422 ± 0.024
2.603AspAsp: 2.603 ± 0.061
3.658AspGlu: 3.658 ± 0.081
3.747AspPhe: 3.747 ± 0.069
3.444AspGly: 3.444 ± 0.121
0.635AspHis: 0.635 ± 0.027
4.124AspIle: 4.124 ± 0.071
4.283AspLys: 4.283 ± 0.081
4.931AspLeu: 4.931 ± 0.081
1.051AspMet: 1.051 ± 0.041
3.159AspAsn: 3.159 ± 0.077
1.304AspPro: 1.304 ± 0.05
1.201AspGln: 1.201 ± 0.04
1.525AspArg: 1.525 ± 0.047
2.932AspSer: 2.932 ± 0.064
2.514AspThr: 2.514 ± 0.057
3.538AspVal: 3.538 ± 0.081
0.716AspTrp: 0.716 ± 0.034
2.694AspTyr: 2.694 ± 0.063
0.0AspXaa: 0.0 ± 0.0
Glu
4.089GluAla: 4.089 ± 0.085
0.365GluCys: 0.365 ± 0.02
3.119GluAsp: 3.119 ± 0.074
4.674GluGlu: 4.674 ± 0.101
3.153GluPhe: 3.153 ± 0.071
3.459GluGly: 3.459 ± 0.068
1.032GluHis: 1.032 ± 0.036
6.197GluIle: 6.197 ± 0.106
6.379GluLys: 6.379 ± 0.109
6.079GluLeu: 6.079 ± 0.105
1.732GluMet: 1.732 ± 0.046
5.372GluAsn: 5.372 ± 0.09
1.462GluPro: 1.462 ± 0.047
2.205GluGln: 2.205 ± 0.058
2.228GluArg: 2.228 ± 0.063
3.232GluSer: 3.232 ± 0.07
3.687GluThr: 3.687 ± 0.07
4.323GluVal: 4.323 ± 0.091
0.578GluTrp: 0.578 ± 0.026
2.26GluTyr: 2.26 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
3.115PheAla: 3.115 ± 0.074
0.508PheCys: 0.508 ± 0.029
3.335PheAsp: 3.335 ± 0.062
3.766PheGlu: 3.766 ± 0.07
3.11PhePhe: 3.11 ± 0.073
3.802PheGly: 3.802 ± 0.061
0.879PheHis: 0.879 ± 0.035
4.269PheIle: 4.269 ± 0.093
3.956PheLys: 3.956 ± 0.083
4.978PheLeu: 4.978 ± 0.107
1.169PheMet: 1.169 ± 0.039
3.625PheAsn: 3.625 ± 0.072
1.815PhePro: 1.815 ± 0.05
1.808PheGln: 1.808 ± 0.048
1.577PheArg: 1.577 ± 0.047
4.468PheSer: 4.468 ± 0.08
3.353PheThr: 3.353 ± 0.076
3.337PheVal: 3.337 ± 0.069
0.595PheTrp: 0.595 ± 0.027
2.422PheTyr: 2.422 ± 0.06
0.0PheXaa: 0.0 ± 0.0
Gly
4.103GlyAla: 4.103 ± 0.093
0.647GlyCys: 0.647 ± 0.04
3.082GlyAsp: 3.082 ± 0.076
3.27GlyGlu: 3.27 ± 0.064
3.811GlyPhe: 3.811 ± 0.079
4.382GlyGly: 4.382 ± 0.111
1.047GlyHis: 1.047 ± 0.039
5.451GlyIle: 5.451 ± 0.093
5.066GlyLys: 5.066 ± 0.096
5.475GlyLeu: 5.475 ± 0.093
1.595GlyMet: 1.595 ± 0.041
3.795GlyAsn: 3.795 ± 0.079
1.312GlyPro: 1.312 ± 0.05
1.808GlyGln: 1.808 ± 0.055
1.838GlyArg: 1.838 ± 0.06
4.012GlySer: 4.012 ± 0.094
4.303GlyThr: 4.303 ± 0.142
4.237GlyVal: 4.237 ± 0.082
0.691GlyTrp: 0.691 ± 0.034
2.68GlyTyr: 2.68 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
0.883HisAla: 0.883 ± 0.035
0.165HisCys: 0.165 ± 0.015
0.796HisAsp: 0.796 ± 0.033
0.957HisGlu: 0.957 ± 0.038
1.207HisPhe: 1.207 ± 0.043
0.917HisGly: 0.917 ± 0.036
0.449HisHis: 0.449 ± 0.025
1.328HisIle: 1.328 ± 0.036
1.077HisLys: 1.077 ± 0.042
1.596HisLeu: 1.596 ± 0.044
0.315HisMet: 0.315 ± 0.018
0.951HisAsn: 0.951 ± 0.03
0.785HisPro: 0.785 ± 0.037
0.671HisGln: 0.671 ± 0.027
0.554HisArg: 0.554 ± 0.025
0.987HisSer: 0.987 ± 0.041
0.801HisThr: 0.801 ± 0.028
0.921HisVal: 0.921 ± 0.036
0.193HisTrp: 0.193 ± 0.014
0.785HisTyr: 0.785 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.643IleAla: 5.643 ± 0.084
0.665IleCys: 0.665 ± 0.033
4.862IleAsp: 4.862 ± 0.076
5.907IleGlu: 5.907 ± 0.096
3.919IlePhe: 3.919 ± 0.082
5.266IleGly: 5.266 ± 0.095
1.362IleHis: 1.362 ± 0.046
6.713IleIle: 6.713 ± 0.122
6.261IleLys: 6.261 ± 0.112
7.146IleLeu: 7.146 ± 0.118
1.454IleMet: 1.454 ± 0.045
5.128IleAsn: 5.128 ± 0.106
3.2IlePro: 3.2 ± 0.066
2.933IleGln: 2.933 ± 0.064
2.379IleArg: 2.379 ± 0.06
6.167IleSer: 6.167 ± 0.117
5.259IleThr: 5.259 ± 0.104
5.189IleVal: 5.189 ± 0.083
0.704IleTrp: 0.704 ± 0.035
2.991IleTyr: 2.991 ± 0.066
0.0IleXaa: 0.0 ± 0.0
Lys
4.952LysAla: 4.952 ± 0.095
0.395LysCys: 0.395 ± 0.023
4.042LysAsp: 4.042 ± 0.086
6.286LysGlu: 6.286 ± 0.101
3.275LysPhe: 3.275 ± 0.068
4.401LysGly: 4.401 ± 0.088
1.239LysHis: 1.239 ± 0.041
7.275LysIle: 7.275 ± 0.114
7.489LysLys: 7.489 ± 0.118
6.731LysLeu: 6.731 ± 0.112
2.387LysMet: 2.387 ± 0.066
5.909LysAsn: 5.909 ± 0.1
2.443LysPro: 2.443 ± 0.057
2.752LysGln: 2.752 ± 0.063
2.566LysArg: 2.566 ± 0.062
4.7LysSer: 4.7 ± 0.081
4.67LysThr: 4.67 ± 0.081
4.964LysVal: 4.964 ± 0.086
0.748LysTrp: 0.748 ± 0.029
3.224LysTyr: 3.224 ± 0.069
0.0LysXaa: 0.0 ± 0.0
Leu
5.541LeuAla: 5.541 ± 0.087
0.598LeuCys: 0.598 ± 0.024
4.777LeuAsp: 4.777 ± 0.085
5.882LeuGlu: 5.882 ± 0.106
5.304LeuPhe: 5.304 ± 0.104
5.472LeuGly: 5.472 ± 0.094
1.556LeuHis: 1.556 ± 0.048
6.869LeuIle: 6.869 ± 0.112
7.459LeuLys: 7.459 ± 0.115
8.455LeuLeu: 8.455 ± 0.128
1.91LeuMet: 1.91 ± 0.049
5.751LeuAsn: 5.751 ± 0.107
3.35LeuPro: 3.35 ± 0.069
3.213LeuGln: 3.213 ± 0.07
2.781LeuArg: 2.781 ± 0.061
6.382LeuSer: 6.382 ± 0.097
4.924LeuThr: 4.924 ± 0.085
5.521LeuVal: 5.521 ± 0.087
0.698LeuTrp: 0.698 ± 0.033
3.116LeuTyr: 3.116 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
1.516MetAla: 1.516 ± 0.045
0.141MetCys: 0.141 ± 0.013
1.001MetAsp: 1.001 ± 0.035
1.422MetGlu: 1.422 ± 0.044
0.961MetPhe: 0.961 ± 0.044
1.398MetGly: 1.398 ± 0.038
0.38MetHis: 0.38 ± 0.025
1.65MetIle: 1.65 ± 0.049
2.322MetLys: 2.322 ± 0.057
1.888MetLeu: 1.888 ± 0.051
0.641MetMet: 0.641 ± 0.035
1.447MetAsn: 1.447 ± 0.043
0.815MetPro: 0.815 ± 0.032
0.9MetGln: 0.9 ± 0.031
0.791MetArg: 0.791 ± 0.033
1.449MetSer: 1.449 ± 0.045
1.178MetThr: 1.178 ± 0.042
1.374MetVal: 1.374 ± 0.047
0.171MetTrp: 0.171 ± 0.014
0.771MetTyr: 0.771 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
3.831AsnAla: 3.831 ± 0.076
0.549AsnCys: 0.549 ± 0.027
3.331AsnAsp: 3.331 ± 0.071
4.475AsnGlu: 4.475 ± 0.08
3.549AsnPhe: 3.549 ± 0.076
4.381AsnGly: 4.381 ± 0.114
1.13AsnHis: 1.13 ± 0.041
4.997AsnIle: 4.997 ± 0.09
4.842AsnLys: 4.842 ± 0.083
5.735AsnLeu: 5.735 ± 0.096
1.34AsnMet: 1.34 ± 0.041
4.474AsnAsn: 4.474 ± 0.101
3.051AsnPro: 3.051 ± 0.068
2.738AsnGln: 2.738 ± 0.064
2.052AsnArg: 2.052 ± 0.057
4.401AsnSer: 4.401 ± 0.09
3.783AsnThr: 3.783 ± 0.097
3.77AsnVal: 3.77 ± 0.069
0.935AsnTrp: 0.935 ± 0.039
3.265AsnTyr: 3.265 ± 0.07
0.0AsnXaa: 0.0 ± 0.0
Pro
2.013ProAla: 2.013 ± 0.074
0.198ProCys: 0.198 ± 0.015
1.631ProAsp: 1.631 ± 0.047
2.552ProGlu: 2.552 ± 0.054
1.952ProPhe: 1.952 ± 0.05
1.788ProGly: 1.788 ± 0.048
0.525ProHis: 0.525 ± 0.027
2.85ProIle: 2.85 ± 0.061
2.552ProLys: 2.552 ± 0.063
2.599ProLeu: 2.599 ± 0.057
0.697ProMet: 0.697 ± 0.029
2.468ProAsn: 2.468 ± 0.061
0.666ProPro: 0.666 ± 0.032
1.005ProGln: 1.005 ± 0.038
0.812ProArg: 0.812 ± 0.032
2.234ProSer: 2.234 ± 0.052
2.294ProThr: 2.294 ± 0.105
2.191ProVal: 2.191 ± 0.066
0.259ProTrp: 0.259 ± 0.016
1.317ProTyr: 1.317 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
1.884GlnAla: 1.884 ± 0.049
0.203GlnCys: 0.203 ± 0.019
1.623GlnAsp: 1.623 ± 0.046
2.289GlnGlu: 2.289 ± 0.053
1.91GlnPhe: 1.91 ± 0.046
1.687GlnGly: 1.687 ± 0.047
0.575GlnHis: 0.575 ± 0.029
2.906GlnIle: 2.906 ± 0.053
2.983GlnLys: 2.983 ± 0.071
3.471GlnLeu: 3.471 ± 0.07
0.917GlnMet: 0.917 ± 0.033
2.58GlnAsn: 2.58 ± 0.06
1.021GlnPro: 1.021 ± 0.036
1.298GlnGln: 1.298 ± 0.046
0.981GlnArg: 0.981 ± 0.034
1.937GlnSer: 1.937 ± 0.045
1.948GlnThr: 1.948 ± 0.051
1.937GlnVal: 1.937 ± 0.05
0.316GlnTrp: 0.316 ± 0.021
1.384GlnTyr: 1.384 ± 0.05
0.0GlnXaa: 0.0 ± 0.0
Arg
1.814ArgAla: 1.814 ± 0.05
0.173ArgCys: 0.173 ± 0.015
1.546ArgAsp: 1.546 ± 0.048
1.911ArgGlu: 1.911 ± 0.056
1.782ArgPhe: 1.782 ± 0.048
1.733ArgGly: 1.733 ± 0.048
0.504ArgHis: 0.504 ± 0.027
2.625ArgIle: 2.625 ± 0.066
2.577ArgLys: 2.577 ± 0.061
2.717ArgLeu: 2.717 ± 0.065
0.823ArgMet: 0.823 ± 0.032
2.046ArgAsn: 2.046 ± 0.053
0.953ArgPro: 0.953 ± 0.04
0.908ArgGln: 0.908 ± 0.033
1.099ArgArg: 1.099 ± 0.044
1.644ArgSer: 1.644 ± 0.051
1.638ArgThr: 1.638 ± 0.046
1.891ArgVal: 1.891 ± 0.047
0.302ArgTrp: 0.302 ± 0.022
1.313ArgTyr: 1.313 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
3.758SerAla: 3.758 ± 0.105
0.731SerCys: 0.731 ± 0.042
3.495SerAsp: 3.495 ± 0.063
4.097SerGlu: 4.097 ± 0.072
4.067SerPhe: 4.067 ± 0.084
4.716SerGly: 4.716 ± 0.098
1.065SerHis: 1.065 ± 0.038
5.413SerIle: 5.413 ± 0.092
5.15SerLys: 5.15 ± 0.088
5.728SerLeu: 5.728 ± 0.094
1.272SerMet: 1.272 ± 0.039
4.523SerAsn: 4.523 ± 0.11
1.997SerPro: 1.997 ± 0.062
2.146SerGln: 2.146 ± 0.057
1.859SerArg: 1.859 ± 0.055
4.445SerSer: 4.445 ± 0.116
3.689SerThr: 3.689 ± 0.101
4.1SerVal: 4.1 ± 0.1
0.661SerTrp: 0.661 ± 0.031
2.795SerTyr: 2.795 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
3.827ThrAla: 3.827 ± 0.111
0.45ThrCys: 0.45 ± 0.039
2.988ThrAsp: 2.988 ± 0.068
3.403ThrGlu: 3.403 ± 0.068
3.404ThrPhe: 3.404 ± 0.08
4.033ThrGly: 4.033 ± 0.107
0.917ThrHis: 0.917 ± 0.033
5.631ThrIle: 5.631 ± 0.119
4.173ThrLys: 4.173 ± 0.079
5.181ThrLeu: 5.181 ± 0.1
0.983ThrMet: 0.983 ± 0.041
3.821ThrAsn: 3.821 ± 0.091
2.611ThrPro: 2.611 ± 0.096
1.838ThrGln: 1.838 ± 0.04
1.47ThrArg: 1.47 ± 0.046
4.131ThrSer: 4.131 ± 0.121
4.092ThrThr: 4.092 ± 0.14
3.69ThrVal: 3.69 ± 0.126
0.617ThrTrp: 0.617 ± 0.031
2.445ThrTyr: 2.445 ± 0.08
0.0ThrXaa: 0.0 ± 0.0
Val
4.436ValAla: 4.436 ± 0.097
0.54ValCys: 0.54 ± 0.027
3.387ValAsp: 3.387 ± 0.074
3.839ValGlu: 3.839 ± 0.09
3.56ValPhe: 3.56 ± 0.065
3.856ValGly: 3.856 ± 0.078
0.956ValHis: 0.956 ± 0.034
5.122ValIle: 5.122 ± 0.082
4.542ValLys: 4.542 ± 0.083
5.887ValLeu: 5.887 ± 0.1
1.307ValMet: 1.307 ± 0.037
3.772ValAsn: 3.772 ± 0.072
2.078ValPro: 2.078 ± 0.05
1.826ValGln: 1.826 ± 0.052
1.866ValArg: 1.866 ± 0.047
4.417ValSer: 4.417 ± 0.093
3.92ValThr: 3.92 ± 0.118
4.469ValVal: 4.469 ± 0.099
0.547ValTrp: 0.547 ± 0.025
2.392ValTyr: 2.392 ± 0.062
0.0ValXaa: 0.0 ± 0.0
Trp
0.563TrpAla: 0.563 ± 0.028
0.1TrpCys: 0.1 ± 0.011
0.565TrpAsp: 0.565 ± 0.031
0.628TrpGlu: 0.628 ± 0.028
0.583TrpPhe: 0.583 ± 0.025
0.598TrpGly: 0.598 ± 0.036
0.209TrpHis: 0.209 ± 0.016
0.787TrpIle: 0.787 ± 0.033
0.802TrpLys: 0.802 ± 0.032
0.886TrpLeu: 0.886 ± 0.033
0.281TrpMet: 0.281 ± 0.018
0.836TrpAsn: 0.836 ± 0.035
0.191TrpPro: 0.191 ± 0.018
0.385TrpGln: 0.385 ± 0.027
0.339TrpArg: 0.339 ± 0.023
0.589TrpSer: 0.589 ± 0.032
0.532TrpThr: 0.532 ± 0.027
0.559TrpVal: 0.559 ± 0.026
0.122TrpTrp: 0.122 ± 0.013
0.427TrpTyr: 0.427 ± 0.028
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.329TyrAla: 2.329 ± 0.06
0.381TyrCys: 0.381 ± 0.021
2.309TyrAsp: 2.309 ± 0.062
2.345TyrGlu: 2.345 ± 0.06
2.778TyrPhe: 2.778 ± 0.068
2.48TyrGly: 2.48 ± 0.054
0.767TyrHis: 0.767 ± 0.032
2.845TyrIle: 2.845 ± 0.059
3.124TyrLys: 3.124 ± 0.066
3.764TyrLeu: 3.764 ± 0.062
0.749TyrMet: 0.749 ± 0.031
2.76TyrAsn: 2.76 ± 0.068
1.397TyrPro: 1.397 ± 0.04
1.553TyrGln: 1.553 ± 0.047
1.358TyrArg: 1.358 ± 0.036
2.773TyrSer: 2.773 ± 0.066
2.352TyrThr: 2.352 ± 0.082
2.323TyrVal: 2.323 ± 0.061
0.481TyrTrp: 0.481 ± 0.022
1.894TyrTyr: 1.894 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2588 proteins (844067 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski