Amino acid dipepetide frequency for Bacteroides sp. CAG:754

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.473AlaAla: 5.473 ± 0.073
0.982AlaCys: 0.982 ± 0.025
4.103AlaAsp: 4.103 ± 0.05
4.495AlaGlu: 4.495 ± 0.068
3.043AlaPhe: 3.043 ± 0.041
5.191AlaGly: 5.191 ± 0.065
1.157AlaHis: 1.157 ± 0.026
4.428AlaIle: 4.428 ± 0.061
3.826AlaLys: 3.826 ± 0.058
6.24AlaLeu: 6.24 ± 0.071
1.831AlaMet: 1.831 ± 0.037
3.32AlaAsn: 3.32 ± 0.055
2.29AlaPro: 2.29 ± 0.038
2.489AlaGln: 2.489 ± 0.039
2.952AlaArg: 2.952 ± 0.046
4.327AlaSer: 4.327 ± 0.047
3.978AlaThr: 3.978 ± 0.05
4.461AlaVal: 4.461 ± 0.06
0.867AlaTrp: 0.867 ± 0.024
3.04AlaTyr: 3.04 ± 0.047
0.001AlaXaa: 0.001 ± 0.001
Cys
0.699CysAla: 0.699 ± 0.021
0.211CysCys: 0.211 ± 0.011
0.659CysAsp: 0.659 ± 0.023
0.67CysGlu: 0.67 ± 0.02
0.623CysPhe: 0.623 ± 0.021
0.921CysGly: 0.921 ± 0.023
0.263CysHis: 0.263 ± 0.015
0.889CysIle: 0.889 ± 0.023
0.682CysLys: 0.682 ± 0.02
1.05CysLeu: 1.05 ± 0.027
0.327CysMet: 0.327 ± 0.015
0.556CysAsn: 0.556 ± 0.018
0.466CysPro: 0.466 ± 0.015
0.354CysGln: 0.354 ± 0.015
0.605CysArg: 0.605 ± 0.019
0.816CysSer: 0.816 ± 0.021
0.692CysThr: 0.692 ± 0.024
0.704CysVal: 0.704 ± 0.021
0.15CysTrp: 0.15 ± 0.011
0.586CysTyr: 0.586 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
3.906AspAla: 3.906 ± 0.045
0.596AspCys: 0.596 ± 0.019
2.713AspAsp: 2.713 ± 0.05
3.901AspGlu: 3.901 ± 0.049
3.017AspPhe: 3.017 ± 0.048
4.538AspGly: 4.538 ± 0.072
0.709AspHis: 0.709 ± 0.02
4.122AspIle: 4.122 ± 0.051
4.158AspLys: 4.158 ± 0.054
4.538AspLeu: 4.538 ± 0.056
1.582AspMet: 1.582 ± 0.033
3.005AspAsn: 3.005 ± 0.052
1.915AspPro: 1.915 ± 0.037
1.187AspGln: 1.187 ± 0.028
2.301AspArg: 2.301 ± 0.037
3.099AspSer: 3.099 ± 0.05
2.718AspThr: 2.718 ± 0.042
3.472AspVal: 3.472 ± 0.047
0.987AspTrp: 0.987 ± 0.028
3.108AspTyr: 3.108 ± 0.049
0.001AspXaa: 0.001 ± 0.001
Glu
4.554GluAla: 4.554 ± 0.068
0.6GluCys: 0.6 ± 0.018
3.261GluAsp: 3.261 ± 0.054
5.039GluGlu: 5.039 ± 0.073
2.388GluPhe: 2.388 ± 0.044
4.316GluGly: 4.316 ± 0.051
1.173GluHis: 1.173 ± 0.029
4.498GluIle: 4.498 ± 0.052
5.151GluLys: 5.151 ± 0.056
6.033GluLeu: 6.033 ± 0.067
2.019GluMet: 2.019 ± 0.038
3.608GluAsn: 3.608 ± 0.051
1.862GluPro: 1.862 ± 0.036
2.613GluGln: 2.613 ± 0.039
3.1GluArg: 3.1 ± 0.052
3.285GluSer: 3.285 ± 0.045
3.37GluThr: 3.37 ± 0.048
4.234GluVal: 4.234 ± 0.054
0.891GluTrp: 0.891 ± 0.026
2.914GluTyr: 2.914 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
2.983PheAla: 2.983 ± 0.046
0.683PheCys: 0.683 ± 0.02
2.78PheAsp: 2.78 ± 0.038
2.492PheGlu: 2.492 ± 0.039
2.27PhePhe: 2.27 ± 0.045
3.122PheGly: 3.122 ± 0.05
0.87PheHis: 0.87 ± 0.028
3.2PheIle: 3.2 ± 0.054
2.429PheLys: 2.429 ± 0.043
4.003PheLeu: 4.003 ± 0.052
1.237PheMet: 1.237 ± 0.032
2.54PheAsn: 2.54 ± 0.035
1.737PhePro: 1.737 ± 0.03
1.348PheGln: 1.348 ± 0.029
2.188PheArg: 2.188 ± 0.04
3.534PheSer: 3.534 ± 0.052
2.896PheThr: 2.896 ± 0.047
2.788PheVal: 2.788 ± 0.038
0.62PheTrp: 0.62 ± 0.018
2.122PheTyr: 2.122 ± 0.033
0.001PheXaa: 0.001 ± 0.0
Gly
4.271GlyAla: 4.271 ± 0.06
0.876GlyCys: 0.876 ± 0.024
3.79GlyAsp: 3.79 ± 0.049
4.334GlyGlu: 4.334 ± 0.068
3.244GlyPhe: 3.244 ± 0.046
4.818GlyGly: 4.818 ± 0.063
1.184GlyHis: 1.184 ± 0.027
5.36GlyIle: 5.36 ± 0.07
5.408GlyLys: 5.408 ± 0.062
5.775GlyLeu: 5.775 ± 0.064
2.117GlyMet: 2.117 ± 0.039
3.915GlyAsn: 3.915 ± 0.061
1.203GlyPro: 1.203 ± 0.033
2.173GlyGln: 2.173 ± 0.042
2.792GlyArg: 2.792 ± 0.044
4.148GlySer: 4.148 ± 0.06
4.62GlyThr: 4.62 ± 0.068
4.824GlyVal: 4.824 ± 0.066
1.142GlyTrp: 1.142 ± 0.029
3.643GlyTyr: 3.643 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
1.07HisAla: 1.07 ± 0.028
0.287HisCys: 0.287 ± 0.013
0.845HisAsp: 0.845 ± 0.02
0.974HisGlu: 0.974 ± 0.024
1.0HisPhe: 1.0 ± 0.027
1.125HisGly: 1.125 ± 0.028
0.434HisHis: 0.434 ± 0.019
1.365HisIle: 1.365 ± 0.032
0.987HisLys: 0.987 ± 0.022
1.664HisLeu: 1.664 ± 0.034
0.345HisMet: 0.345 ± 0.015
0.861HisAsn: 0.861 ± 0.024
1.033HisPro: 1.033 ± 0.026
0.627HisGln: 0.627 ± 0.022
0.836HisArg: 0.836 ± 0.024
1.088HisSer: 1.088 ± 0.022
1.062HisThr: 1.062 ± 0.023
0.937HisVal: 0.937 ± 0.027
0.272HisTrp: 0.272 ± 0.013
0.951HisTyr: 0.951 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
4.926IleAla: 4.926 ± 0.07
0.933IleCys: 0.933 ± 0.022
4.068IleAsp: 4.068 ± 0.049
4.281IleGlu: 4.281 ± 0.051
2.677IlePhe: 2.677 ± 0.044
4.662IleGly: 4.662 ± 0.054
1.314IleHis: 1.314 ± 0.034
4.394IleIle: 4.394 ± 0.068
3.894IleLys: 3.894 ± 0.051
5.878IleLeu: 5.878 ± 0.079
1.43IleMet: 1.43 ± 0.029
3.524IleAsn: 3.524 ± 0.049
3.17IlePro: 3.17 ± 0.043
2.194IleGln: 2.194 ± 0.038
3.389IleArg: 3.389 ± 0.047
4.729IleSer: 4.729 ± 0.063
4.221IleThr: 4.221 ± 0.056
4.133IleVal: 4.133 ± 0.055
0.765IleTrp: 0.765 ± 0.024
3.007IleTyr: 3.007 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
4.722LysAla: 4.722 ± 0.055
0.498LysCys: 0.498 ± 0.017
4.205LysAsp: 4.205 ± 0.05
5.771LysGlu: 5.771 ± 0.074
2.253LysPhe: 2.253 ± 0.042
4.675LysGly: 4.675 ± 0.054
1.157LysHis: 1.157 ± 0.026
4.06LysIle: 4.06 ± 0.052
4.993LysLys: 4.993 ± 0.068
5.328LysLeu: 5.328 ± 0.061
2.067LysMet: 2.067 ± 0.04
3.716LysAsn: 3.716 ± 0.048
2.06LysPro: 2.06 ± 0.037
2.495LysGln: 2.495 ± 0.04
2.983LysArg: 2.983 ± 0.046
3.411LysSer: 3.411 ± 0.049
3.468LysThr: 3.468 ± 0.045
4.39LysVal: 4.39 ± 0.053
0.842LysTrp: 0.842 ± 0.025
3.094LysTyr: 3.094 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
6.006LeuAla: 6.006 ± 0.061
1.229LeuCys: 1.229 ± 0.033
4.472LeuAsp: 4.472 ± 0.054
4.946LeuGlu: 4.946 ± 0.056
4.527LeuPhe: 4.527 ± 0.064
5.358LeuGly: 5.358 ± 0.06
1.623LeuHis: 1.623 ± 0.034
5.38LeuIle: 5.38 ± 0.073
6.402LeuLys: 6.402 ± 0.066
8.867LeuLeu: 8.867 ± 0.106
2.365LeuMet: 2.365 ± 0.04
4.71LeuAsn: 4.71 ± 0.061
4.101LeuPro: 4.101 ± 0.055
3.274LeuGln: 3.274 ± 0.055
4.134LeuArg: 4.134 ± 0.055
6.725LeuSer: 6.725 ± 0.071
5.341LeuThr: 5.341 ± 0.062
4.906LeuVal: 4.906 ± 0.065
1.14LeuTrp: 1.14 ± 0.029
3.746LeuTyr: 3.746 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
1.876MetAla: 1.876 ± 0.032
0.239MetCys: 0.239 ± 0.013
1.48MetAsp: 1.48 ± 0.032
1.773MetGlu: 1.773 ± 0.032
0.971MetPhe: 0.971 ± 0.025
1.828MetGly: 1.828 ± 0.036
0.466MetHis: 0.466 ± 0.019
1.575MetIle: 1.575 ± 0.031
2.706MetLys: 2.706 ± 0.038
2.315MetLeu: 2.315 ± 0.043
0.826MetMet: 0.826 ± 0.024
1.715MetAsn: 1.715 ± 0.036
1.115MetPro: 1.115 ± 0.025
1.03MetGln: 1.03 ± 0.025
1.28MetArg: 1.28 ± 0.029
1.559MetSer: 1.559 ± 0.03
1.466MetThr: 1.466 ± 0.03
1.58MetVal: 1.58 ± 0.031
0.309MetTrp: 0.309 ± 0.013
0.958MetTyr: 0.958 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.48AsnAla: 3.48 ± 0.049
0.55AsnCys: 0.55 ± 0.021
2.707AsnAsp: 2.707 ± 0.045
3.22AsnGlu: 3.22 ± 0.047
2.39AsnPhe: 2.39 ± 0.043
4.276AsnGly: 4.276 ± 0.062
0.963AsnHis: 0.963 ± 0.03
3.845AsnIle: 3.845 ± 0.052
3.389AsnLys: 3.389 ± 0.044
4.53AsnLeu: 4.53 ± 0.062
1.449AsnMet: 1.449 ± 0.028
2.94AsnAsn: 2.94 ± 0.052
2.599AsnPro: 2.599 ± 0.043
1.836AsnGln: 1.836 ± 0.037
2.578AsnArg: 2.578 ± 0.041
3.149AsnSer: 3.149 ± 0.05
2.933AsnThr: 2.933 ± 0.038
3.388AsnVal: 3.388 ± 0.051
0.782AsnTrp: 0.782 ± 0.022
2.631AsnTyr: 2.631 ± 0.048
0.0AsnXaa: 0.0 ± 0.0
Pro
2.842ProAla: 2.842 ± 0.044
0.362ProCys: 0.362 ± 0.015
2.616ProAsp: 2.616 ± 0.04
3.361ProGlu: 3.361 ± 0.05
1.819ProPhe: 1.819 ± 0.033
2.516ProGly: 2.516 ± 0.043
0.636ProHis: 0.636 ± 0.021
2.192ProIle: 2.192 ± 0.042
2.009ProLys: 2.009 ± 0.037
3.16ProLeu: 3.16 ± 0.049
0.909ProMet: 0.909 ± 0.024
1.729ProAsn: 1.729 ± 0.03
0.744ProPro: 0.744 ± 0.019
1.427ProGln: 1.427 ± 0.034
1.302ProArg: 1.302 ± 0.033
2.332ProSer: 2.332 ± 0.04
1.991ProThr: 1.991 ± 0.035
3.159ProVal: 3.159 ± 0.049
0.498ProTrp: 0.498 ± 0.019
1.813ProTyr: 1.813 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
2.345GlnAla: 2.345 ± 0.039
0.312GlnCys: 0.312 ± 0.014
1.63GlnAsp: 1.63 ± 0.034
2.386GlnGlu: 2.386 ± 0.044
1.322GlnPhe: 1.322 ± 0.031
2.157GlnGly: 2.157 ± 0.041
0.633GlnHis: 0.633 ± 0.02
2.335GlnIle: 2.335 ± 0.041
2.397GlnLys: 2.397 ± 0.042
3.276GlnLeu: 3.276 ± 0.042
1.015GlnMet: 1.015 ± 0.023
1.841GlnAsn: 1.841 ± 0.035
1.326GlnPro: 1.326 ± 0.033
1.555GlnGln: 1.555 ± 0.036
1.569GlnArg: 1.569 ± 0.034
2.045GlnSer: 2.045 ± 0.038
2.046GlnThr: 2.046 ± 0.037
2.166GlnVal: 2.166 ± 0.041
0.525GlnTrp: 0.525 ± 0.018
1.531GlnTyr: 1.531 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
2.648ArgAla: 2.648 ± 0.041
0.426ArgCys: 0.426 ± 0.017
2.111ArgAsp: 2.111 ± 0.036
2.867ArgGlu: 2.867 ± 0.052
2.354ArgPhe: 2.354 ± 0.039
2.465ArgGly: 2.465 ± 0.041
0.866ArgHis: 0.866 ± 0.026
3.537ArgIle: 3.537 ± 0.053
3.207ArgLys: 3.207 ± 0.045
4.306ArgLeu: 4.306 ± 0.056
1.487ArgMet: 1.487 ± 0.034
2.554ArgAsn: 2.554 ± 0.041
1.602ArgPro: 1.602 ± 0.034
1.716ArgGln: 1.716 ± 0.038
2.08ArgArg: 2.08 ± 0.043
2.336ArgSer: 2.336 ± 0.042
2.57ArgThr: 2.57 ± 0.041
2.553ArgVal: 2.553 ± 0.041
0.732ArgTrp: 0.732 ± 0.021
2.346ArgTyr: 2.346 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
4.197SerAla: 4.197 ± 0.055
0.879SerCys: 0.879 ± 0.026
3.447SerAsp: 3.447 ± 0.05
3.544SerGlu: 3.544 ± 0.047
3.421SerPhe: 3.421 ± 0.046
4.78SerGly: 4.78 ± 0.06
1.144SerHis: 1.144 ± 0.029
4.401SerIle: 4.401 ± 0.051
3.428SerLys: 3.428 ± 0.046
6.027SerLeu: 6.027 ± 0.068
1.547SerMet: 1.547 ± 0.028
3.036SerAsn: 3.036 ± 0.046
2.43SerPro: 2.43 ± 0.042
1.942SerGln: 1.942 ± 0.035
2.593SerArg: 2.593 ± 0.043
4.23SerSer: 4.23 ± 0.063
3.494SerThr: 3.494 ± 0.042
4.296SerVal: 4.296 ± 0.06
0.936SerTrp: 0.936 ± 0.023
3.099SerTyr: 3.099 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.061ThrAla: 4.061 ± 0.052
0.604ThrCys: 0.604 ± 0.02
3.56ThrAsp: 3.56 ± 0.055
3.476ThrGlu: 3.476 ± 0.05
2.869ThrPhe: 2.869 ± 0.05
4.628ThrGly: 4.628 ± 0.053
0.973ThrHis: 0.973 ± 0.024
3.864ThrIle: 3.864 ± 0.053
2.986ThrLys: 2.986 ± 0.043
5.439ThrLeu: 5.439 ± 0.06
1.205ThrMet: 1.205 ± 0.026
2.777ThrAsn: 2.777 ± 0.047
2.895ThrPro: 2.895 ± 0.04
1.831ThrGln: 1.831 ± 0.037
2.223ThrArg: 2.223 ± 0.037
3.586ThrSer: 3.586 ± 0.052
3.311ThrThr: 3.311 ± 0.052
4.14ThrVal: 4.14 ± 0.056
0.762ThrTrp: 0.762 ± 0.024
2.697ThrTyr: 2.697 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
4.399ValAla: 4.399 ± 0.06
0.889ValCys: 0.889 ± 0.027
3.663ValAsp: 3.663 ± 0.052
3.941ValGlu: 3.941 ± 0.058
2.823ValPhe: 2.823 ± 0.041
3.942ValGly: 3.942 ± 0.052
1.03ValHis: 1.03 ± 0.022
4.321ValIle: 4.321 ± 0.06
4.221ValLys: 4.221 ± 0.053
5.463ValLeu: 5.463 ± 0.064
1.631ValMet: 1.631 ± 0.03
3.488ValAsn: 3.488 ± 0.046
2.485ValPro: 2.485 ± 0.041
1.949ValGln: 1.949 ± 0.034
2.936ValArg: 2.936 ± 0.054
4.617ValSer: 4.617 ± 0.05
4.006ValThr: 4.006 ± 0.054
4.307ValVal: 4.307 ± 0.061
0.796ValTrp: 0.796 ± 0.025
2.807ValTyr: 2.807 ± 0.041
0.001ValXaa: 0.001 ± 0.001
Trp
0.875TrpAla: 0.875 ± 0.026
0.197TrpCys: 0.197 ± 0.01
0.757TrpAsp: 0.757 ± 0.02
0.921TrpGlu: 0.921 ± 0.026
0.609TrpPhe: 0.609 ± 0.019
1.169TrpGly: 1.169 ± 0.031
0.26TrpHis: 0.26 ± 0.014
0.894TrpIle: 0.894 ± 0.024
1.052TrpLys: 1.052 ± 0.028
1.165TrpLeu: 1.165 ± 0.031
0.482TrpMet: 0.482 ± 0.016
0.913TrpAsn: 0.913 ± 0.024
0.31TrpPro: 0.31 ± 0.015
0.564TrpGln: 0.564 ± 0.019
0.59TrpArg: 0.59 ± 0.021
0.805TrpSer: 0.805 ± 0.023
0.773TrpThr: 0.773 ± 0.022
0.756TrpVal: 0.756 ± 0.024
0.23TrpTrp: 0.23 ± 0.013
0.6TrpTyr: 0.6 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.161TyrAla: 3.161 ± 0.05
0.57TyrCys: 0.57 ± 0.021
2.706TyrAsp: 2.706 ± 0.047
2.519TyrGlu: 2.519 ± 0.042
2.226TyrPhe: 2.226 ± 0.032
3.153TyrGly: 3.153 ± 0.049
0.909TyrHis: 0.909 ± 0.024
2.93TyrIle: 2.93 ± 0.042
2.864TyrLys: 2.864 ± 0.042
4.195TyrLeu: 4.195 ± 0.05
1.195TyrMet: 1.195 ± 0.028
2.884TyrAsn: 2.884 ± 0.053
2.072TyrPro: 2.072 ± 0.042
1.807TyrGln: 1.807 ± 0.035
2.329TyrArg: 2.329 ± 0.05
3.048TyrSer: 3.048 ± 0.048
2.931TyrThr: 2.931 ± 0.046
2.52TyrVal: 2.52 ± 0.04
0.674TyrTrp: 0.674 ± 0.019
2.436TyrTyr: 2.436 ± 0.047
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.001XaaHis: 0.001 ± 0.001
0.001XaaIle: 0.001 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.005XaaXaa: 0.005 ± 0.003
Statistics based on 4054 proteins (1597693 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski