Amino acid dipepetide frequency for Bacteroides sp. OF04-15BH

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.008AlaAla: 6.008 ± 0.087
1.141AlaCys: 1.141 ± 0.035
4.376AlaAsp: 4.376 ± 0.072
4.962AlaGlu: 4.962 ± 0.08
3.303AlaPhe: 3.303 ± 0.064
5.063AlaGly: 5.063 ± 0.089
1.363AlaHis: 1.363 ± 0.033
4.179AlaIle: 4.179 ± 0.063
3.643AlaLys: 3.643 ± 0.067
7.494AlaLeu: 7.494 ± 0.097
1.832AlaMet: 1.832 ± 0.038
2.861AlaAsn: 2.861 ± 0.053
2.37AlaPro: 2.37 ± 0.054
3.259AlaGln: 3.259 ± 0.054
3.566AlaArg: 3.566 ± 0.066
4.282AlaSer: 4.282 ± 0.07
3.856AlaThr: 3.856 ± 0.059
5.073AlaVal: 5.073 ± 0.074
0.885AlaTrp: 0.885 ± 0.031
3.037AlaTyr: 3.037 ± 0.063
0.0AlaXaa: 0.0 ± 0.0
Cys
0.954CysAla: 0.954 ± 0.033
0.296CysCys: 0.296 ± 0.016
0.708CysAsp: 0.708 ± 0.03
0.706CysGlu: 0.706 ± 0.027
0.703CysPhe: 0.703 ± 0.026
1.173CysGly: 1.173 ± 0.035
0.333CysHis: 0.333 ± 0.018
0.951CysIle: 0.951 ± 0.034
0.737CysLys: 0.737 ± 0.026
1.231CysLeu: 1.231 ± 0.038
0.368CysMet: 0.368 ± 0.02
0.592CysAsn: 0.592 ± 0.023
0.619CysPro: 0.619 ± 0.03
0.438CysGln: 0.438 ± 0.021
0.79CysArg: 0.79 ± 0.027
0.886CysSer: 0.886 ± 0.027
0.686CysThr: 0.686 ± 0.028
0.846CysVal: 0.846 ± 0.031
0.177CysTrp: 0.177 ± 0.013
0.523CysTyr: 0.523 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
3.903AspAla: 3.903 ± 0.075
0.724AspCys: 0.724 ± 0.026
2.473AspAsp: 2.473 ± 0.06
3.91AspGlu: 3.91 ± 0.059
3.076AspPhe: 3.076 ± 0.048
4.095AspGly: 4.095 ± 0.081
0.911AspHis: 0.911 ± 0.033
3.581AspIle: 3.581 ± 0.056
3.306AspLys: 3.306 ± 0.066
5.05AspLeu: 5.05 ± 0.066
1.514AspMet: 1.514 ± 0.039
2.345AspAsn: 2.345 ± 0.057
1.868AspPro: 1.868 ± 0.043
1.661AspGln: 1.661 ± 0.041
2.792AspArg: 2.792 ± 0.044
3.156AspSer: 3.156 ± 0.068
2.826AspThr: 2.826 ± 0.059
3.345AspVal: 3.345 ± 0.058
0.822AspTrp: 0.822 ± 0.029
2.743AspTyr: 2.743 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
5.12GluAla: 5.12 ± 0.075
0.742GluCys: 0.742 ± 0.025
3.369GluAsp: 3.369 ± 0.057
5.493GluGlu: 5.493 ± 0.096
2.436GluPhe: 2.436 ± 0.048
4.089GluGly: 4.089 ± 0.056
1.376GluHis: 1.376 ± 0.034
4.469GluIle: 4.469 ± 0.076
5.321GluLys: 5.321 ± 0.077
6.287GluLeu: 6.287 ± 0.086
2.029GluMet: 2.029 ± 0.043
3.65GluAsn: 3.65 ± 0.061
1.826GluPro: 1.826 ± 0.046
3.424GluGln: 3.424 ± 0.073
3.556GluArg: 3.556 ± 0.062
3.264GluSer: 3.264 ± 0.06
3.436GluThr: 3.436 ± 0.056
4.206GluVal: 4.206 ± 0.071
0.849GluTrp: 0.849 ± 0.027
2.724GluTyr: 2.724 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
2.989PheAla: 2.989 ± 0.051
0.837PheCys: 0.837 ± 0.025
2.661PheAsp: 2.661 ± 0.046
2.548PheGlu: 2.548 ± 0.048
2.228PhePhe: 2.228 ± 0.05
3.044PheGly: 3.044 ± 0.063
0.9PheHis: 0.9 ± 0.026
2.859PheIle: 2.859 ± 0.062
2.335PheLys: 2.335 ± 0.047
4.447PheLeu: 4.447 ± 0.075
1.252PheMet: 1.252 ± 0.036
2.066PheAsn: 2.066 ± 0.053
1.721PhePro: 1.721 ± 0.043
1.468PheGln: 1.468 ± 0.032
2.349PheArg: 2.349 ± 0.046
3.229PheSer: 3.229 ± 0.054
2.607PheThr: 2.607 ± 0.054
2.928PheVal: 2.928 ± 0.061
0.634PheTrp: 0.634 ± 0.027
2.037PheTyr: 2.037 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
4.485GlyAla: 4.485 ± 0.083
1.038GlyCys: 1.038 ± 0.039
3.255GlyAsp: 3.255 ± 0.058
3.993GlyGlu: 3.993 ± 0.061
3.145GlyPhe: 3.145 ± 0.057
4.668GlyGly: 4.668 ± 0.089
1.223GlyHis: 1.223 ± 0.03
4.865GlyIle: 4.865 ± 0.073
5.133GlyLys: 5.133 ± 0.084
5.826GlyLeu: 5.826 ± 0.073
2.158GlyMet: 2.158 ± 0.048
3.505GlyAsn: 3.505 ± 0.069
1.38GlyPro: 1.38 ± 0.04
2.253GlyGln: 2.253 ± 0.047
3.069GlyArg: 3.069 ± 0.058
4.078GlySer: 4.078 ± 0.066
3.829GlyThr: 3.829 ± 0.065
4.739GlyVal: 4.739 ± 0.069
1.0GlyTrp: 1.0 ± 0.032
3.12GlyTyr: 3.12 ± 0.06
0.001GlyXaa: 0.001 ± 0.001
His
1.281HisAla: 1.281 ± 0.035
0.332HisCys: 0.332 ± 0.015
0.986HisAsp: 0.986 ± 0.031
1.179HisGlu: 1.179 ± 0.035
1.031HisPhe: 1.031 ± 0.028
1.206HisGly: 1.206 ± 0.035
0.558HisHis: 0.558 ± 0.029
1.39HisIle: 1.39 ± 0.036
1.002HisLys: 1.002 ± 0.035
1.926HisLeu: 1.926 ± 0.043
0.338HisMet: 0.338 ± 0.019
0.81HisAsn: 0.81 ± 0.025
1.073HisPro: 1.073 ± 0.033
0.744HisGln: 0.744 ± 0.027
0.969HisArg: 0.969 ± 0.034
1.071HisSer: 1.071 ± 0.036
1.105HisThr: 1.105 ± 0.027
1.211HisVal: 1.211 ± 0.034
0.257HisTrp: 0.257 ± 0.015
0.971HisTyr: 0.971 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
4.753IleAla: 4.753 ± 0.071
0.968IleCys: 0.968 ± 0.029
3.698IleAsp: 3.698 ± 0.061
4.102IleGlu: 4.102 ± 0.066
2.618IlePhe: 2.618 ± 0.051
4.114IleGly: 4.114 ± 0.07
1.309IleHis: 1.309 ± 0.034
3.929IleIle: 3.929 ± 0.073
3.831IleLys: 3.831 ± 0.074
5.905IleLeu: 5.905 ± 0.094
1.44IleMet: 1.44 ± 0.034
3.163IleAsn: 3.163 ± 0.063
3.0IlePro: 3.0 ± 0.057
2.449IleGln: 2.449 ± 0.05
3.521IleArg: 3.521 ± 0.067
4.122IleSer: 4.122 ± 0.064
3.681IleThr: 3.681 ± 0.06
4.011IleVal: 4.011 ± 0.063
0.646IleTrp: 0.646 ± 0.024
2.61IleTyr: 2.61 ± 0.051
0.0IleXaa: 0.0 ± 0.0
Lys
4.79LysAla: 4.79 ± 0.083
0.574LysCys: 0.574 ± 0.024
3.785LysAsp: 3.785 ± 0.068
5.563LysGlu: 5.563 ± 0.082
1.988LysPhe: 1.988 ± 0.047
4.323LysGly: 4.323 ± 0.068
1.081LysHis: 1.081 ± 0.027
3.977LysIle: 3.977 ± 0.068
5.216LysLys: 5.216 ± 0.091
4.965LysLeu: 4.965 ± 0.073
2.122LysMet: 2.122 ± 0.041
3.507LysAsn: 3.507 ± 0.063
2.127LysPro: 2.127 ± 0.055
2.649LysGln: 2.649 ± 0.054
3.193LysArg: 3.193 ± 0.054
3.365LysSer: 3.365 ± 0.055
3.412LysThr: 3.412 ± 0.061
4.05LysVal: 4.05 ± 0.069
0.709LysTrp: 0.709 ± 0.027
2.719LysTyr: 2.719 ± 0.051
0.0LysXaa: 0.0 ± 0.0
Leu
6.652LeuAla: 6.652 ± 0.091
1.484LeuCys: 1.484 ± 0.037
4.789LeuAsp: 4.789 ± 0.071
5.547LeuGlu: 5.547 ± 0.072
4.654LeuPhe: 4.654 ± 0.084
5.688LeuGly: 5.688 ± 0.083
1.962LeuHis: 1.962 ± 0.039
5.41LeuIle: 5.41 ± 0.087
6.446LeuLys: 6.446 ± 0.084
9.645LeuLeu: 9.645 ± 0.136
2.532LeuMet: 2.532 ± 0.052
4.726LeuAsn: 4.726 ± 0.074
4.243LeuPro: 4.243 ± 0.059
3.959LeuGln: 3.959 ± 0.068
4.521LeuArg: 4.521 ± 0.07
6.729LeuSer: 6.729 ± 0.102
5.25LeuThr: 5.25 ± 0.067
5.397LeuVal: 5.397 ± 0.085
1.172LeuTrp: 1.172 ± 0.037
3.805LeuTyr: 3.805 ± 0.061
0.001LeuXaa: 0.001 ± 0.001
Met
2.165MetAla: 2.165 ± 0.042
0.257MetCys: 0.257 ± 0.016
1.549MetAsp: 1.549 ± 0.037
1.945MetGlu: 1.945 ± 0.041
0.95MetPhe: 0.95 ± 0.033
1.836MetGly: 1.836 ± 0.043
0.494MetHis: 0.494 ± 0.021
1.635MetIle: 1.635 ± 0.039
2.367MetLys: 2.367 ± 0.046
2.59MetLeu: 2.59 ± 0.05
0.842MetMet: 0.842 ± 0.029
1.577MetAsn: 1.577 ± 0.039
1.203MetPro: 1.203 ± 0.031
1.191MetGln: 1.191 ± 0.033
1.387MetArg: 1.387 ± 0.038
1.527MetSer: 1.527 ± 0.037
1.443MetThr: 1.443 ± 0.039
1.613MetVal: 1.613 ± 0.044
0.226MetTrp: 0.226 ± 0.015
0.81MetTyr: 0.81 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
3.33AsnAla: 3.33 ± 0.063
0.526AsnCys: 0.526 ± 0.026
2.381AsnAsp: 2.381 ± 0.05
3.078AsnGlu: 3.078 ± 0.056
2.079AsnPhe: 2.079 ± 0.044
3.783AsnGly: 3.783 ± 0.071
0.931AsnHis: 0.931 ± 0.031
3.355AsnIle: 3.355 ± 0.055
3.017AsnLys: 3.017 ± 0.063
4.292AsnLeu: 4.292 ± 0.078
1.376AsnMet: 1.376 ± 0.038
2.377AsnAsn: 2.377 ± 0.063
2.45AsnPro: 2.45 ± 0.053
1.77AsnGln: 1.77 ± 0.039
2.679AsnArg: 2.679 ± 0.055
2.641AsnSer: 2.641 ± 0.058
2.616AsnThr: 2.616 ± 0.054
2.888AsnVal: 2.888 ± 0.057
0.645AsnTrp: 0.645 ± 0.027
2.15AsnTyr: 2.15 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
2.919ProAla: 2.919 ± 0.059
0.39ProCys: 0.39 ± 0.023
2.556ProAsp: 2.556 ± 0.05
3.735ProGlu: 3.735 ± 0.055
1.796ProPhe: 1.796 ± 0.042
2.392ProGly: 2.392 ± 0.052
0.737ProHis: 0.737 ± 0.03
2.105ProIle: 2.105 ± 0.044
2.04ProLys: 2.04 ± 0.042
3.35ProLeu: 3.35 ± 0.057
0.968ProMet: 0.968 ± 0.032
1.616ProAsn: 1.616 ± 0.044
0.819ProPro: 0.819 ± 0.034
1.678ProGln: 1.678 ± 0.047
1.258ProArg: 1.258 ± 0.035
2.018ProSer: 2.018 ± 0.045
1.936ProThr: 1.936 ± 0.045
3.079ProVal: 3.079 ± 0.057
0.453ProTrp: 0.453 ± 0.021
1.714ProTyr: 1.714 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
3.04GlnAla: 3.04 ± 0.054
0.36GlnCys: 0.36 ± 0.018
1.94GlnAsp: 1.94 ± 0.04
2.964GlnGlu: 2.964 ± 0.056
1.44GlnPhe: 1.44 ± 0.037
2.435GlnGly: 2.435 ± 0.048
0.719GlnHis: 0.719 ± 0.023
2.792GlnIle: 2.792 ± 0.049
2.975GlnLys: 2.975 ± 0.058
3.448GlnLeu: 3.448 ± 0.069
1.234GlnMet: 1.234 ± 0.035
2.092GlnAsn: 2.092 ± 0.048
1.373GlnPro: 1.373 ± 0.035
1.908GlnGln: 1.908 ± 0.054
1.762GlnArg: 1.762 ± 0.047
2.096GlnSer: 2.096 ± 0.046
2.411GlnThr: 2.411 ± 0.049
2.504GlnVal: 2.504 ± 0.054
0.523GlnTrp: 0.523 ± 0.022
1.584GlnTyr: 1.584 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
2.903ArgAla: 2.903 ± 0.054
0.548ArgCys: 0.548 ± 0.023
2.232ArgAsp: 2.232 ± 0.044
3.254ArgGlu: 3.254 ± 0.061
2.449ArgPhe: 2.449 ± 0.045
2.462ArgGly: 2.462 ± 0.049
1.053ArgHis: 1.053 ± 0.035
3.833ArgIle: 3.833 ± 0.076
3.866ArgLys: 3.866 ± 0.068
4.792ArgLeu: 4.792 ± 0.067
1.717ArgMet: 1.717 ± 0.047
2.721ArgAsn: 2.721 ± 0.058
1.777ArgPro: 1.777 ± 0.037
2.17ArgGln: 2.17 ± 0.05
2.571ArgArg: 2.571 ± 0.067
2.606ArgSer: 2.606 ± 0.058
2.669ArgThr: 2.669 ± 0.05
2.916ArgVal: 2.916 ± 0.059
0.673ArgTrp: 0.673 ± 0.026
2.381ArgTyr: 2.381 ± 0.049
0.001ArgXaa: 0.001 ± 0.001
Ser
4.401SerAla: 4.401 ± 0.068
0.949SerCys: 0.949 ± 0.032
3.279SerAsp: 3.279 ± 0.061
3.704SerGlu: 3.704 ± 0.068
3.224SerPhe: 3.224 ± 0.055
4.581SerGly: 4.581 ± 0.078
1.004SerHis: 1.004 ± 0.033
3.811SerIle: 3.811 ± 0.067
3.224SerLys: 3.224 ± 0.051
6.227SerLeu: 6.227 ± 0.095
1.533SerMet: 1.533 ± 0.039
2.467SerAsn: 2.467 ± 0.049
2.197SerPro: 2.197 ± 0.045
1.946SerGln: 1.946 ± 0.041
2.784SerArg: 2.784 ± 0.049
3.792SerSer: 3.792 ± 0.076
2.976SerThr: 2.976 ± 0.056
4.226SerVal: 4.226 ± 0.071
0.84SerTrp: 0.84 ± 0.031
2.65SerTyr: 2.65 ± 0.057
0.001SerXaa: 0.001 ± 0.001
Thr
4.26ThrAla: 4.26 ± 0.061
0.636ThrCys: 0.636 ± 0.028
3.549ThrAsp: 3.549 ± 0.055
3.656ThrGlu: 3.656 ± 0.071
2.543ThrPhe: 2.543 ± 0.05
4.07ThrGly: 4.07 ± 0.066
1.032ThrHis: 1.032 ± 0.031
3.341ThrIle: 3.341 ± 0.059
2.439ThrLys: 2.439 ± 0.055
5.647ThrLeu: 5.647 ± 0.075
1.156ThrMet: 1.156 ± 0.037
2.222ThrAsn: 2.222 ± 0.051
2.782ThrPro: 2.782 ± 0.056
1.91ThrGln: 1.91 ± 0.043
2.278ThrArg: 2.278 ± 0.05
3.115ThrSer: 3.115 ± 0.061
3.092ThrThr: 3.092 ± 0.062
4.051ThrVal: 4.051 ± 0.074
0.657ThrTrp: 0.657 ± 0.024
2.301ThrTyr: 2.301 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
4.726ValAla: 4.726 ± 0.07
1.14ValCys: 1.14 ± 0.031
3.458ValAsp: 3.458 ± 0.068
3.979ValGlu: 3.979 ± 0.065
2.962ValPhe: 2.962 ± 0.061
3.902ValGly: 3.902 ± 0.07
1.185ValHis: 1.185 ± 0.034
3.948ValIle: 3.948 ± 0.073
3.863ValLys: 3.863 ± 0.06
6.159ValLeu: 6.159 ± 0.088
1.675ValMet: 1.675 ± 0.039
3.031ValAsn: 3.031 ± 0.053
2.667ValPro: 2.667 ± 0.05
2.289ValGln: 2.289 ± 0.045
3.536ValArg: 3.536 ± 0.056
4.481ValSer: 4.481 ± 0.062
3.76ValThr: 3.76 ± 0.066
4.395ValVal: 4.395 ± 0.069
0.847ValTrp: 0.847 ± 0.029
2.729ValTyr: 2.729 ± 0.052
0.0ValXaa: 0.0 ± 0.0
Trp
0.78TrpAla: 0.78 ± 0.026
0.186TrpCys: 0.186 ± 0.013
0.659TrpAsp: 0.659 ± 0.025
0.748TrpGlu: 0.748 ± 0.028
0.563TrpPhe: 0.563 ± 0.024
0.966TrpGly: 0.966 ± 0.033
0.304TrpHis: 0.304 ± 0.021
0.818TrpIle: 0.818 ± 0.029
0.918TrpLys: 0.918 ± 0.03
1.217TrpLeu: 1.217 ± 0.034
0.444TrpMet: 0.444 ± 0.021
0.876TrpAsn: 0.876 ± 0.031
0.289TrpPro: 0.289 ± 0.016
0.575TrpGln: 0.575 ± 0.024
0.579TrpArg: 0.579 ± 0.02
0.737TrpSer: 0.737 ± 0.03
0.694TrpThr: 0.694 ± 0.027
0.752TrpVal: 0.752 ± 0.026
0.166TrpTrp: 0.166 ± 0.011
0.5TrpTyr: 0.5 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.092TyrAla: 3.092 ± 0.06
0.592TyrCys: 0.592 ± 0.021
2.399TyrAsp: 2.399 ± 0.051
2.602TyrGlu: 2.602 ± 0.046
1.949TyrPhe: 1.949 ± 0.042
2.871TyrGly: 2.871 ± 0.056
0.968TyrHis: 0.968 ± 0.033
2.62TyrIle: 2.62 ± 0.052
2.435TyrLys: 2.435 ± 0.056
4.111TyrLeu: 4.111 ± 0.069
1.071TyrMet: 1.071 ± 0.031
2.161TyrAsn: 2.161 ± 0.056
1.774TyrPro: 1.774 ± 0.039
1.86TyrGln: 1.86 ± 0.047
2.434TyrArg: 2.434 ± 0.053
2.62TyrSer: 2.62 ± 0.05
2.456TyrThr: 2.456 ± 0.048
2.503TyrVal: 2.503 ± 0.058
0.592TyrTrp: 0.592 ± 0.025
2.124TyrTyr: 2.124 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.001XaaMet: 0.001 ± 0.001
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.001
0.007XaaXaa: 0.007 ± 0.004
Statistics based on 3052 proteins (1099428 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski