Amino acid dipepetide frequency for Tannerella forsythia (strain ATCC 43037 / JCM 10827 / CCUG 33226 / KCTC 5666 / FDC 338) (Bacteroides forsythus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.681AlaAla: 5.681 ± 0.09
0.968AlaCys: 0.968 ± 0.035
4.206AlaAsp: 4.206 ± 0.07
4.669AlaGlu: 4.669 ± 0.085
3.319AlaPhe: 3.319 ± 0.063
5.464AlaGly: 5.464 ± 0.088
1.325AlaHis: 1.325 ± 0.038
4.393AlaIle: 4.393 ± 0.081
3.638AlaLys: 3.638 ± 0.069
6.845AlaLeu: 6.845 ± 0.095
1.923AlaMet: 1.923 ± 0.05
2.798AlaAsn: 2.798 ± 0.063
2.347AlaPro: 2.347 ± 0.058
2.665AlaGln: 2.665 ± 0.054
3.579AlaArg: 3.579 ± 0.066
4.526AlaSer: 4.526 ± 0.07
3.851AlaThr: 3.851 ± 0.076
4.99AlaVal: 4.99 ± 0.084
0.832AlaTrp: 0.832 ± 0.028
2.958AlaTyr: 2.958 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.703CysAla: 0.703 ± 0.025
0.197CysCys: 0.197 ± 0.014
0.624CysAsp: 0.624 ± 0.026
0.69CysGlu: 0.69 ± 0.032
0.618CysPhe: 0.618 ± 0.028
1.005CysGly: 1.005 ± 0.036
0.308CysHis: 0.308 ± 0.022
0.835CysIle: 0.835 ± 0.03
0.653CysLys: 0.653 ± 0.029
1.124CysLeu: 1.124 ± 0.038
0.346CysMet: 0.346 ± 0.018
0.502CysAsn: 0.502 ± 0.024
0.499CysPro: 0.499 ± 0.028
0.282CysGln: 0.282 ± 0.017
0.814CysArg: 0.814 ± 0.032
0.797CysSer: 0.797 ± 0.033
0.638CysThr: 0.638 ± 0.026
0.731CysVal: 0.731 ± 0.029
0.147CysTrp: 0.147 ± 0.014
0.487CysTyr: 0.487 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
3.912AspAla: 3.912 ± 0.066
0.612AspCys: 0.612 ± 0.026
2.671AspAsp: 2.671 ± 0.056
3.992AspGlu: 3.992 ± 0.065
3.006AspPhe: 3.006 ± 0.057
4.055AspGly: 4.055 ± 0.078
1.072AspHis: 1.072 ± 0.035
3.891AspIle: 3.891 ± 0.064
3.508AspLys: 3.508 ± 0.068
4.541AspLeu: 4.541 ± 0.067
1.573AspMet: 1.573 ± 0.04
2.221AspAsn: 2.221 ± 0.054
1.991AspPro: 1.991 ± 0.046
1.202AspGln: 1.202 ± 0.041
3.131AspArg: 3.131 ± 0.067
2.833AspSer: 2.833 ± 0.063
2.959AspThr: 2.959 ± 0.064
3.448AspVal: 3.448 ± 0.061
0.769AspTrp: 0.769 ± 0.028
2.797AspTyr: 2.797 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
5.096GluAla: 5.096 ± 0.092
0.584GluCys: 0.584 ± 0.027
2.778GluAsp: 2.778 ± 0.052
4.731GluGlu: 4.731 ± 0.094
2.296GluPhe: 2.296 ± 0.048
3.923GluGly: 3.923 ± 0.065
1.37GluHis: 1.37 ± 0.037
4.433GluIle: 4.433 ± 0.071
5.218GluLys: 5.218 ± 0.091
5.722GluLeu: 5.722 ± 0.087
1.995GluMet: 1.995 ± 0.053
3.254GluAsn: 3.254 ± 0.065
1.819GluPro: 1.819 ± 0.041
2.734GluGln: 2.734 ± 0.062
3.829GluArg: 3.829 ± 0.065
3.081GluSer: 3.081 ± 0.057
3.772GluThr: 3.772 ± 0.069
4.133GluVal: 4.133 ± 0.069
0.8GluTrp: 0.8 ± 0.029
2.594GluTyr: 2.594 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
3.235PheAla: 3.235 ± 0.063
0.669PheCys: 0.669 ± 0.027
2.844PheAsp: 2.844 ± 0.053
2.621PheGlu: 2.621 ± 0.058
2.591PhePhe: 2.591 ± 0.064
3.419PheGly: 3.419 ± 0.055
0.99PheHis: 0.99 ± 0.032
3.026PheIle: 3.026 ± 0.064
2.275PheLys: 2.275 ± 0.052
4.128PheLeu: 4.128 ± 0.081
1.266PheMet: 1.266 ± 0.035
2.168PheAsn: 2.168 ± 0.047
1.835PhePro: 1.835 ± 0.047
1.22PheGln: 1.22 ± 0.028
2.611PheArg: 2.611 ± 0.054
3.443PheSer: 3.443 ± 0.07
2.614PheThr: 2.614 ± 0.056
3.118PheVal: 3.118 ± 0.065
0.524PheTrp: 0.524 ± 0.023
2.062PheTyr: 2.062 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.591GlyAla: 4.591 ± 0.084
0.938GlyCys: 0.938 ± 0.034
3.344GlyAsp: 3.344 ± 0.064
4.031GlyGlu: 4.031 ± 0.072
3.209GlyPhe: 3.209 ± 0.054
4.978GlyGly: 4.978 ± 0.102
1.355GlyHis: 1.355 ± 0.04
5.219GlyIle: 5.219 ± 0.088
5.106GlyLys: 5.106 ± 0.085
5.976GlyLeu: 5.976 ± 0.087
2.141GlyMet: 2.141 ± 0.047
3.302GlyAsn: 3.302 ± 0.067
1.21GlyPro: 1.21 ± 0.036
2.239GlyGln: 2.239 ± 0.046
3.654GlyArg: 3.654 ± 0.062
3.961GlySer: 3.961 ± 0.074
4.186GlyThr: 4.186 ± 0.073
4.527GlyVal: 4.527 ± 0.082
1.018GlyTrp: 1.018 ± 0.037
3.388GlyTyr: 3.388 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.435HisAla: 1.435 ± 0.041
0.324HisCys: 0.324 ± 0.018
1.038HisAsp: 1.038 ± 0.035
1.132HisGlu: 1.132 ± 0.034
1.129HisPhe: 1.129 ± 0.037
1.331HisGly: 1.331 ± 0.041
0.58HisHis: 0.58 ± 0.025
1.586HisIle: 1.586 ± 0.043
1.0HisLys: 1.0 ± 0.033
2.055HisLeu: 2.055 ± 0.052
0.373HisMet: 0.373 ± 0.02
0.881HisAsn: 0.881 ± 0.028
1.262HisPro: 1.262 ± 0.04
0.609HisGln: 0.609 ± 0.029
1.193HisArg: 1.193 ± 0.035
1.216HisSer: 1.216 ± 0.039
1.326HisThr: 1.326 ± 0.042
1.15HisVal: 1.15 ± 0.036
0.256HisTrp: 0.256 ± 0.016
0.983HisTyr: 0.983 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.027IleAla: 5.027 ± 0.082
0.887IleCys: 0.887 ± 0.032
4.39IleAsp: 4.39 ± 0.073
4.449IleGlu: 4.449 ± 0.075
2.852IlePhe: 2.852 ± 0.061
4.836IleGly: 4.836 ± 0.087
1.474IleHis: 1.474 ± 0.042
4.085IleIle: 4.085 ± 0.085
3.663IleLys: 3.663 ± 0.07
5.668IleLeu: 5.668 ± 0.092
1.318IleMet: 1.318 ± 0.044
3.002IleAsn: 3.002 ± 0.066
3.188IlePro: 3.188 ± 0.06
2.031IleGln: 2.031 ± 0.045
4.559IleArg: 4.559 ± 0.073
4.236IleSer: 4.236 ± 0.071
3.831IleThr: 3.831 ± 0.075
4.433IleVal: 4.433 ± 0.076
0.697IleTrp: 0.697 ± 0.027
2.79IleTyr: 2.79 ± 0.059
0.0IleXaa: 0.0 ± 0.0
Lys
4.551LysAla: 4.551 ± 0.075
0.44LysCys: 0.44 ± 0.023
3.569LysAsp: 3.569 ± 0.065
5.115LysGlu: 5.115 ± 0.085
1.957LysPhe: 1.957 ± 0.048
4.193LysGly: 4.193 ± 0.062
1.272LysHis: 1.272 ± 0.041
4.401LysIle: 4.401 ± 0.072
4.896LysLys: 4.896 ± 0.082
4.711LysLeu: 4.711 ± 0.073
2.038LysMet: 2.038 ± 0.048
3.558LysAsn: 3.558 ± 0.063
2.222LysPro: 2.222 ± 0.051
2.459LysGln: 2.459 ± 0.057
3.407LysArg: 3.407 ± 0.071
3.156LysSer: 3.156 ± 0.058
3.771LysThr: 3.771 ± 0.074
3.784LysVal: 3.784 ± 0.07
0.706LysTrp: 0.706 ± 0.026
2.613LysTyr: 2.613 ± 0.052
0.0LysXaa: 0.0 ± 0.0
Leu
5.942LeuAla: 5.942 ± 0.092
1.327LeuCys: 1.327 ± 0.043
4.353LeuAsp: 4.353 ± 0.069
4.782LeuGlu: 4.782 ± 0.084
4.7LeuPhe: 4.7 ± 0.082
5.202LeuGly: 5.202 ± 0.089
2.046LeuHis: 2.046 ± 0.051
5.813LeuIle: 5.813 ± 0.091
6.042LeuLys: 6.042 ± 0.074
9.091LeuLeu: 9.091 ± 0.156
2.508LeuMet: 2.508 ± 0.054
4.575LeuAsn: 4.575 ± 0.07
4.191LeuPro: 4.191 ± 0.072
3.208LeuGln: 3.208 ± 0.065
5.064LeuArg: 5.064 ± 0.077
7.137LeuSer: 7.137 ± 0.089
5.554LeuThr: 5.554 ± 0.082
4.533LeuVal: 4.533 ± 0.071
1.005LeuTrp: 1.005 ± 0.04
3.861LeuTyr: 3.861 ± 0.07
0.0LeuXaa: 0.0 ± 0.0
Met
2.057MetAla: 2.057 ± 0.049
0.277MetCys: 0.277 ± 0.021
1.348MetAsp: 1.348 ± 0.043
1.799MetGlu: 1.799 ± 0.044
0.992MetPhe: 0.992 ± 0.035
1.75MetGly: 1.75 ± 0.042
0.528MetHis: 0.528 ± 0.02
1.812MetIle: 1.812 ± 0.043
2.386MetLys: 2.386 ± 0.046
2.688MetLeu: 2.688 ± 0.065
0.848MetMet: 0.848 ± 0.034
1.56MetAsn: 1.56 ± 0.038
1.222MetPro: 1.222 ± 0.039
1.062MetGln: 1.062 ± 0.033
1.527MetArg: 1.527 ± 0.041
1.479MetSer: 1.479 ± 0.041
1.594MetThr: 1.594 ± 0.04
1.511MetVal: 1.511 ± 0.042
0.197MetTrp: 0.197 ± 0.014
0.88MetTyr: 0.88 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
3.27AsnAla: 3.27 ± 0.057
0.465AsnCys: 0.465 ± 0.026
2.553AsnAsp: 2.553 ± 0.05
2.994AsnGlu: 2.994 ± 0.054
1.876AsnPhe: 1.876 ± 0.046
3.642AsnGly: 3.642 ± 0.072
0.945AsnHis: 0.945 ± 0.033
3.424AsnIle: 3.424 ± 0.072
2.855AsnLys: 2.855 ± 0.062
3.985AsnLeu: 3.985 ± 0.07
1.237AsnMet: 1.237 ± 0.036
2.394AsnAsn: 2.394 ± 0.065
2.409AsnPro: 2.409 ± 0.059
1.313AsnGln: 1.313 ± 0.039
3.005AsnArg: 3.005 ± 0.061
2.441AsnSer: 2.441 ± 0.057
2.54AsnThr: 2.54 ± 0.058
2.851AsnVal: 2.851 ± 0.061
0.523AsnTrp: 0.523 ± 0.025
2.206AsnTyr: 2.206 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
3.02ProAla: 3.02 ± 0.067
0.414ProCys: 0.414 ± 0.019
2.741ProAsp: 2.741 ± 0.055
3.263ProGlu: 3.263 ± 0.058
1.924ProPhe: 1.924 ± 0.046
2.433ProGly: 2.433 ± 0.059
0.827ProHis: 0.827 ± 0.028
2.22ProIle: 2.22 ± 0.053
1.993ProLys: 1.993 ± 0.045
3.414ProLeu: 3.414 ± 0.061
1.001ProMet: 1.001 ± 0.033
1.705ProAsn: 1.705 ± 0.043
1.15ProPro: 1.15 ± 0.036
1.439ProGln: 1.439 ± 0.038
1.535ProArg: 1.535 ± 0.042
2.518ProSer: 2.518 ± 0.059
1.926ProThr: 1.926 ± 0.059
3.176ProVal: 3.176 ± 0.053
0.465ProTrp: 0.465 ± 0.023
1.713ProTyr: 1.713 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
2.524GlnAla: 2.524 ± 0.061
0.31GlnCys: 0.31 ± 0.019
1.432GlnAsp: 1.432 ± 0.039
2.127GlnGlu: 2.127 ± 0.055
1.216GlnPhe: 1.216 ± 0.033
2.06GlnGly: 2.06 ± 0.053
0.67GlnHis: 0.67 ± 0.026
2.39GlnIle: 2.39 ± 0.05
2.345GlnLys: 2.345 ± 0.051
3.0GlnLeu: 3.0 ± 0.065
0.979GlnMet: 0.979 ± 0.033
1.537GlnAsn: 1.537 ± 0.043
1.383GlnPro: 1.383 ± 0.036
1.53GlnGln: 1.53 ± 0.051
1.839GlnArg: 1.839 ± 0.043
2.02GlnSer: 2.02 ± 0.05
2.277GlnThr: 2.277 ± 0.046
1.96GlnVal: 1.96 ± 0.049
0.543GlnTrp: 0.543 ± 0.025
1.338GlnTyr: 1.338 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
3.18ArgAla: 3.18 ± 0.066
0.563ArgCys: 0.563 ± 0.025
2.501ArgAsp: 2.501 ± 0.046
3.557ArgGlu: 3.557 ± 0.066
2.839ArgPhe: 2.839 ± 0.059
2.848ArgGly: 2.848 ± 0.056
1.198ArgHis: 1.198 ± 0.034
4.404ArgIle: 4.404 ± 0.068
4.009ArgLys: 4.009 ± 0.074
5.443ArgLeu: 5.443 ± 0.078
1.851ArgMet: 1.851 ± 0.045
2.865ArgAsn: 2.865 ± 0.065
1.997ArgPro: 1.997 ± 0.047
2.13ArgGln: 2.13 ± 0.046
3.378ArgArg: 3.378 ± 0.073
3.12ArgSer: 3.12 ± 0.064
3.191ArgThr: 3.191 ± 0.054
2.984ArgVal: 2.984 ± 0.056
0.802ArgTrp: 0.802 ± 0.03
2.768ArgTyr: 2.768 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
4.16SerAla: 4.16 ± 0.076
0.751SerCys: 0.751 ± 0.025
3.435SerAsp: 3.435 ± 0.057
3.686SerGlu: 3.686 ± 0.065
3.464SerPhe: 3.464 ± 0.063
4.86SerGly: 4.86 ± 0.089
1.228SerHis: 1.228 ± 0.037
4.004SerIle: 4.004 ± 0.071
3.006SerLys: 3.006 ± 0.057
5.973SerLeu: 5.973 ± 0.09
1.535SerMet: 1.535 ± 0.038
2.459SerAsn: 2.459 ± 0.058
2.528SerPro: 2.528 ± 0.054
1.75SerGln: 1.75 ± 0.037
2.911SerArg: 2.911 ± 0.054
4.062SerSer: 4.062 ± 0.09
3.087SerThr: 3.087 ± 0.058
4.514SerVal: 4.514 ± 0.07
0.758SerTrp: 0.758 ± 0.028
2.901SerTyr: 2.901 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
4.296ThrAla: 4.296 ± 0.07
0.603ThrCys: 0.603 ± 0.026
3.673ThrAsp: 3.673 ± 0.051
3.451ThrGlu: 3.451 ± 0.058
2.899ThrPhe: 2.899 ± 0.069
4.436ThrGly: 4.436 ± 0.069
1.126ThrHis: 1.126 ± 0.035
3.703ThrIle: 3.703 ± 0.076
2.852ThrLys: 2.852 ± 0.056
5.522ThrLeu: 5.522 ± 0.088
1.362ThrMet: 1.362 ± 0.036
2.431ThrAsn: 2.431 ± 0.056
2.766ThrPro: 2.766 ± 0.06
1.793ThrGln: 1.793 ± 0.042
2.508ThrArg: 2.508 ± 0.045
3.331ThrSer: 3.331 ± 0.069
3.136ThrThr: 3.136 ± 0.065
4.593ThrVal: 4.593 ± 0.094
0.66ThrTrp: 0.66 ± 0.025
2.609ThrTyr: 2.609 ± 0.061
0.0ThrXaa: 0.0 ± 0.0
Val
4.473ValAla: 4.473 ± 0.072
0.98ValCys: 0.98 ± 0.036
3.403ValAsp: 3.403 ± 0.063
3.766ValGlu: 3.766 ± 0.07
3.047ValPhe: 3.047 ± 0.059
3.961ValGly: 3.961 ± 0.066
1.215ValHis: 1.215 ± 0.034
4.213ValIle: 4.213 ± 0.077
3.98ValLys: 3.98 ± 0.07
5.794ValLeu: 5.794 ± 0.084
1.69ValMet: 1.69 ± 0.047
2.883ValAsn: 2.883 ± 0.064
2.575ValPro: 2.575 ± 0.057
1.944ValGln: 1.944 ± 0.047
3.654ValArg: 3.654 ± 0.063
4.526ValSer: 4.526 ± 0.082
4.029ValThr: 4.029 ± 0.094
4.249ValVal: 4.249 ± 0.08
0.897ValTrp: 0.897 ± 0.034
2.656ValTyr: 2.656 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.73TrpAla: 0.73 ± 0.026
0.166TrpCys: 0.166 ± 0.012
0.673TrpAsp: 0.673 ± 0.027
0.812TrpGlu: 0.812 ± 0.027
0.513TrpPhe: 0.513 ± 0.027
0.895TrpGly: 0.895 ± 0.035
0.277TrpHis: 0.277 ± 0.016
0.891TrpIle: 0.891 ± 0.037
0.926TrpLys: 0.926 ± 0.034
1.191TrpLeu: 1.191 ± 0.038
0.457TrpMet: 0.457 ± 0.021
0.709TrpAsn: 0.709 ± 0.027
0.267TrpPro: 0.267 ± 0.019
0.491TrpGln: 0.491 ± 0.023
0.591TrpArg: 0.591 ± 0.025
0.702TrpSer: 0.702 ± 0.03
0.698TrpThr: 0.698 ± 0.039
0.694TrpVal: 0.694 ± 0.03
0.179TrpTrp: 0.179 ± 0.013
0.479TrpTyr: 0.479 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.094TyrAla: 3.094 ± 0.062
0.525TyrCys: 0.525 ± 0.024
2.61TyrAsp: 2.61 ± 0.068
2.493TyrGlu: 2.493 ± 0.051
2.211TyrPhe: 2.211 ± 0.051
3.083TyrGly: 3.083 ± 0.07
1.053TyrHis: 1.053 ± 0.032
2.671TyrIle: 2.671 ± 0.051
2.466TyrLys: 2.466 ± 0.052
4.061TyrLeu: 4.061 ± 0.075
1.075TyrMet: 1.075 ± 0.036
2.127TyrAsn: 2.127 ± 0.059
2.015TyrPro: 2.015 ± 0.051
1.382TyrGln: 1.382 ± 0.037
2.839TyrArg: 2.839 ± 0.059
2.488TyrSer: 2.488 ± 0.05
2.771TyrThr: 2.771 ± 0.067
2.558TyrVal: 2.558 ± 0.058
0.56TyrTrp: 0.56 ± 0.026
1.999TyrTyr: 1.999 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2978 proteins (978559 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski