Amino acid dipepetide frequency for Bacteroides helcogenes (strain ATCC 35417 / DSM 20613 / JCM 6297 / P 36-108)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.124AlaAla: 6.124 ± 0.096
1.158AlaCys: 1.158 ± 0.031
4.548AlaAsp: 4.548 ± 0.068
4.765AlaGlu: 4.765 ± 0.085
3.324AlaPhe: 3.324 ± 0.052
5.438AlaGly: 5.438 ± 0.078
1.234AlaHis: 1.234 ± 0.033
4.603AlaIle: 4.603 ± 0.065
4.109AlaLys: 4.109 ± 0.07
6.977AlaLeu: 6.977 ± 0.081
1.975AlaMet: 1.975 ± 0.045
3.236AlaAsn: 3.236 ± 0.065
2.278AlaPro: 2.278 ± 0.044
2.648AlaGln: 2.648 ± 0.055
3.312AlaArg: 3.312 ± 0.061
4.499AlaSer: 4.499 ± 0.066
4.041AlaThr: 4.041 ± 0.064
4.995AlaVal: 4.995 ± 0.064
0.792AlaTrp: 0.792 ± 0.027
3.089AlaTyr: 3.089 ± 0.05
0.0AlaXaa: 0.0 ± 0.0
Cys
0.821CysAla: 0.821 ± 0.029
0.256CysCys: 0.256 ± 0.018
0.643CysAsp: 0.643 ± 0.026
0.662CysGlu: 0.662 ± 0.023
0.682CysPhe: 0.682 ± 0.028
1.139CysGly: 1.139 ± 0.038
0.304CysHis: 0.304 ± 0.016
1.018CysIle: 1.018 ± 0.031
0.74CysLys: 0.74 ± 0.026
1.257CysLeu: 1.257 ± 0.041
0.382CysMet: 0.382 ± 0.018
0.609CysAsn: 0.609 ± 0.022
0.539CysPro: 0.539 ± 0.026
0.352CysGln: 0.352 ± 0.016
0.736CysArg: 0.736 ± 0.025
0.862CysSer: 0.862 ± 0.03
0.78CysThr: 0.78 ± 0.024
0.758CysVal: 0.758 ± 0.028
0.155CysTrp: 0.155 ± 0.011
0.58CysTyr: 0.58 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
4.1AspAla: 4.1 ± 0.069
0.653AspCys: 0.653 ± 0.028
2.493AspAsp: 2.493 ± 0.055
3.844AspGlu: 3.844 ± 0.059
2.948AspPhe: 2.948 ± 0.056
4.215AspGly: 4.215 ± 0.084
0.768AspHis: 0.768 ± 0.026
4.101AspIle: 4.101 ± 0.058
3.874AspLys: 3.874 ± 0.063
4.559AspLeu: 4.559 ± 0.066
1.634AspMet: 1.634 ± 0.041
2.746AspAsn: 2.746 ± 0.047
1.612AspPro: 1.612 ± 0.037
1.026AspGln: 1.026 ± 0.035
2.542AspArg: 2.542 ± 0.052
3.123AspSer: 3.123 ± 0.057
2.84AspThr: 2.84 ± 0.046
3.552AspVal: 3.552 ± 0.059
0.869AspTrp: 0.869 ± 0.028
2.967AspTyr: 2.967 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
5.006GluAla: 5.006 ± 0.085
0.676GluCys: 0.676 ± 0.025
3.249GluAsp: 3.249 ± 0.057
4.855GluGlu: 4.855 ± 0.096
2.397GluPhe: 2.397 ± 0.049
4.064GluGly: 4.064 ± 0.067
1.286GluHis: 1.286 ± 0.038
4.416GluIle: 4.416 ± 0.073
5.033GluLys: 5.033 ± 0.077
5.981GluLeu: 5.981 ± 0.082
2.059GluMet: 2.059 ± 0.039
3.337GluAsn: 3.337 ± 0.055
1.618GluPro: 1.618 ± 0.034
2.531GluGln: 2.531 ± 0.055
3.295GluArg: 3.295 ± 0.059
2.992GluSer: 2.992 ± 0.051
3.289GluThr: 3.289 ± 0.051
4.245GluVal: 4.245 ± 0.069
0.778GluTrp: 0.778 ± 0.028
2.738GluTyr: 2.738 ± 0.044
0.0GluXaa: 0.0 ± 0.0
Phe
3.074PheAla: 3.074 ± 0.053
0.746PheCys: 0.746 ± 0.028
2.645PheAsp: 2.645 ± 0.046
2.349PheGlu: 2.349 ± 0.042
2.335PhePhe: 2.335 ± 0.051
3.229PheGly: 3.229 ± 0.057
0.884PheHis: 0.884 ± 0.027
3.193PheIle: 3.193 ± 0.064
2.314PheLys: 2.314 ± 0.043
4.17PheLeu: 4.17 ± 0.061
1.295PheMet: 1.295 ± 0.037
2.252PheAsn: 2.252 ± 0.05
1.726PhePro: 1.726 ± 0.037
1.189PheGln: 1.189 ± 0.037
2.296PheArg: 2.296 ± 0.05
3.545PheSer: 3.545 ± 0.058
2.818PheThr: 2.818 ± 0.054
2.835PheVal: 2.835 ± 0.056
0.572PheTrp: 0.572 ± 0.022
2.023PheTyr: 2.023 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
4.416GlyAla: 4.416 ± 0.084
0.971GlyCys: 0.971 ± 0.033
3.596GlyAsp: 3.596 ± 0.064
4.172GlyGlu: 4.172 ± 0.07
3.187GlyPhe: 3.187 ± 0.051
5.038GlyGly: 5.038 ± 0.081
1.258GlyHis: 1.258 ± 0.04
5.631GlyIle: 5.631 ± 0.078
5.443GlyLys: 5.443 ± 0.072
5.774GlyLeu: 5.774 ± 0.081
2.341GlyMet: 2.341 ± 0.053
3.745GlyAsn: 3.745 ± 0.06
1.161GlyPro: 1.161 ± 0.03
2.053GlyGln: 2.053 ± 0.04
3.032GlyArg: 3.032 ± 0.053
4.12GlySer: 4.12 ± 0.07
4.355GlyThr: 4.355 ± 0.066
4.859GlyVal: 4.859 ± 0.062
1.024GlyTrp: 1.024 ± 0.033
3.343GlyTyr: 3.343 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
1.267HisAla: 1.267 ± 0.036
0.325HisCys: 0.325 ± 0.016
0.919HisAsp: 0.919 ± 0.03
1.011HisGlu: 1.011 ± 0.031
1.04HisPhe: 1.04 ± 0.033
1.262HisGly: 1.262 ± 0.034
0.502HisHis: 0.502 ± 0.022
1.46HisIle: 1.46 ± 0.034
1.011HisLys: 1.011 ± 0.027
1.892HisLeu: 1.892 ± 0.045
0.332HisMet: 0.332 ± 0.017
0.915HisAsn: 0.915 ± 0.028
1.116HisPro: 1.116 ± 0.031
0.572HisGln: 0.572 ± 0.024
0.985HisArg: 0.985 ± 0.026
1.196HisSer: 1.196 ± 0.032
1.119HisThr: 1.119 ± 0.031
1.048HisVal: 1.048 ± 0.026
0.272HisTrp: 0.272 ± 0.016
0.953HisTyr: 0.953 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.37IleAla: 5.37 ± 0.077
1.033IleCys: 1.033 ± 0.026
4.053IleAsp: 4.053 ± 0.058
4.198IleGlu: 4.198 ± 0.068
2.798IlePhe: 2.798 ± 0.055
4.802IleGly: 4.802 ± 0.07
1.387IleHis: 1.387 ± 0.037
4.498IleIle: 4.498 ± 0.071
3.922IleLys: 3.922 ± 0.063
6.064IleLeu: 6.064 ± 0.091
1.51IleMet: 1.51 ± 0.038
3.316IleAsn: 3.316 ± 0.061
3.145IlePro: 3.145 ± 0.053
2.012IleGln: 2.012 ± 0.044
3.526IleArg: 3.526 ± 0.062
4.657IleSer: 4.657 ± 0.074
4.007IleThr: 4.007 ± 0.071
4.318IleVal: 4.318 ± 0.072
0.68IleTrp: 0.68 ± 0.028
2.807IleTyr: 2.807 ± 0.052
0.0IleXaa: 0.0 ± 0.0
Lys
4.871LysAla: 4.871 ± 0.073
0.572LysCys: 0.572 ± 0.024
3.912LysAsp: 3.912 ± 0.059
5.581LysGlu: 5.581 ± 0.081
2.125LysPhe: 2.125 ± 0.039
4.404LysGly: 4.404 ± 0.058
1.262LysHis: 1.262 ± 0.028
3.99LysIle: 3.99 ± 0.06
4.907LysLys: 4.907 ± 0.074
5.302LysLeu: 5.302 ± 0.072
2.092LysMet: 2.092 ± 0.048
3.474LysAsn: 3.474 ± 0.055
2.104LysPro: 2.104 ± 0.043
2.487LysGln: 2.487 ± 0.051
3.139LysArg: 3.139 ± 0.057
3.304LysSer: 3.304 ± 0.063
3.486LysThr: 3.486 ± 0.064
4.106LysVal: 4.106 ± 0.056
0.772LysTrp: 0.772 ± 0.029
2.946LysTyr: 2.946 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
6.428LeuAla: 6.428 ± 0.09
1.492LeuCys: 1.492 ± 0.038
4.491LeuAsp: 4.491 ± 0.068
4.966LeuGlu: 4.966 ± 0.071
4.525LeuPhe: 4.525 ± 0.082
5.597LeuGly: 5.597 ± 0.087
1.935LeuHis: 1.935 ± 0.04
5.537LeuIle: 5.537 ± 0.077
6.378LeuLys: 6.378 ± 0.085
9.434LeuLeu: 9.434 ± 0.121
2.641LeuMet: 2.641 ± 0.05
4.849LeuAsn: 4.849 ± 0.07
4.171LeuPro: 4.171 ± 0.063
3.451LeuGln: 3.451 ± 0.066
4.382LeuArg: 4.382 ± 0.063
6.786LeuSer: 6.786 ± 0.095
5.406LeuThr: 5.406 ± 0.076
5.022LeuVal: 5.022 ± 0.079
1.105LeuTrp: 1.105 ± 0.032
3.763LeuTyr: 3.763 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
2.077MetAla: 2.077 ± 0.05
0.266MetCys: 0.266 ± 0.016
1.521MetAsp: 1.521 ± 0.034
1.944MetGlu: 1.944 ± 0.044
0.967MetPhe: 0.967 ± 0.031
1.93MetGly: 1.93 ± 0.044
0.523MetHis: 0.523 ± 0.022
1.675MetIle: 1.675 ± 0.035
2.632MetLys: 2.632 ± 0.05
2.607MetLeu: 2.607 ± 0.052
0.844MetMet: 0.844 ± 0.03
1.658MetAsn: 1.658 ± 0.036
1.264MetPro: 1.264 ± 0.038
1.141MetGln: 1.141 ± 0.029
1.392MetArg: 1.392 ± 0.032
1.545MetSer: 1.545 ± 0.038
1.524MetThr: 1.524 ± 0.035
1.558MetVal: 1.558 ± 0.038
0.229MetTrp: 0.229 ± 0.016
0.875MetTyr: 0.875 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
3.638AsnAla: 3.638 ± 0.059
0.551AsnCys: 0.551 ± 0.023
2.594AsnAsp: 2.594 ± 0.052
2.938AsnGlu: 2.938 ± 0.053
2.236AsnPhe: 2.236 ± 0.042
4.029AsnGly: 4.029 ± 0.078
0.935AsnHis: 0.935 ± 0.025
3.786AsnIle: 3.786 ± 0.062
3.03AsnLys: 3.03 ± 0.053
4.407AsnLeu: 4.407 ± 0.072
1.353AsnMet: 1.353 ± 0.032
2.57AsnAsn: 2.57 ± 0.056
2.447AsnPro: 2.447 ± 0.048
1.485AsnGln: 1.485 ± 0.034
2.554AsnArg: 2.554 ± 0.045
2.939AsnSer: 2.939 ± 0.055
2.729AsnThr: 2.729 ± 0.054
3.199AsnVal: 3.199 ± 0.055
0.652AsnTrp: 0.652 ± 0.024
2.406AsnTyr: 2.406 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
2.894ProAla: 2.894 ± 0.052
0.413ProCys: 0.413 ± 0.02
2.524ProAsp: 2.524 ± 0.042
3.224ProGlu: 3.224 ± 0.051
1.857ProPhe: 1.857 ± 0.04
2.458ProGly: 2.458 ± 0.052
0.752ProHis: 0.752 ± 0.028
2.171ProIle: 2.171 ± 0.052
1.85ProLys: 1.85 ± 0.042
3.339ProLeu: 3.339 ± 0.056
0.99ProMet: 0.99 ± 0.027
1.598ProAsn: 1.598 ± 0.042
0.816ProPro: 0.816 ± 0.027
1.421ProGln: 1.421 ± 0.039
1.32ProArg: 1.32 ± 0.033
2.131ProSer: 2.131 ± 0.05
1.917ProThr: 1.917 ± 0.042
2.95ProVal: 2.95 ± 0.059
0.428ProTrp: 0.428 ± 0.02
1.718ProTyr: 1.718 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
2.553GlnAla: 2.553 ± 0.048
0.282GlnCys: 0.282 ± 0.014
1.569GlnAsp: 1.569 ± 0.039
2.321GlnGlu: 2.321 ± 0.051
1.243GlnPhe: 1.243 ± 0.028
2.051GlnGly: 2.051 ± 0.044
0.651GlnHis: 0.651 ± 0.025
2.291GlnIle: 2.291 ± 0.047
2.32GlnLys: 2.32 ± 0.045
3.087GlnLeu: 3.087 ± 0.055
1.058GlnMet: 1.058 ± 0.031
1.741GlnAsn: 1.741 ± 0.04
1.215GlnPro: 1.215 ± 0.034
1.486GlnGln: 1.486 ± 0.041
1.671GlnArg: 1.671 ± 0.039
1.929GlnSer: 1.929 ± 0.037
1.983GlnThr: 1.983 ± 0.048
2.054GlnVal: 2.054 ± 0.039
0.441GlnTrp: 0.441 ± 0.023
1.425GlnTyr: 1.425 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
2.948ArgAla: 2.948 ± 0.049
0.539ArgCys: 0.539 ± 0.023
2.149ArgAsp: 2.149 ± 0.049
3.056ArgGlu: 3.056 ± 0.059
2.311ArgPhe: 2.311 ± 0.041
2.589ArgGly: 2.589 ± 0.047
1.057ArgHis: 1.057 ± 0.028
3.731ArgIle: 3.731 ± 0.065
3.516ArgLys: 3.516 ± 0.058
4.767ArgLeu: 4.767 ± 0.074
1.661ArgMet: 1.661 ± 0.046
2.532ArgAsn: 2.532 ± 0.045
1.722ArgPro: 1.722 ± 0.038
2.001ArgGln: 2.001 ± 0.041
2.546ArgArg: 2.546 ± 0.044
2.435ArgSer: 2.435 ± 0.051
2.579ArgThr: 2.579 ± 0.051
2.651ArgVal: 2.651 ± 0.05
0.654ArgTrp: 0.654 ± 0.024
2.3ArgTyr: 2.3 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
4.529SerAla: 4.529 ± 0.073
0.903SerCys: 0.903 ± 0.033
3.359SerAsp: 3.359 ± 0.055
3.405SerGlu: 3.405 ± 0.055
3.255SerPhe: 3.255 ± 0.052
4.612SerGly: 4.612 ± 0.072
1.203SerHis: 1.203 ± 0.034
4.4SerIle: 4.4 ± 0.076
3.242SerLys: 3.242 ± 0.059
6.128SerLeu: 6.128 ± 0.08
1.546SerMet: 1.546 ± 0.036
2.76SerAsn: 2.76 ± 0.053
2.305SerPro: 2.305 ± 0.048
1.858SerGln: 1.858 ± 0.04
2.67SerArg: 2.67 ± 0.047
4.006SerSer: 4.006 ± 0.066
3.331SerThr: 3.331 ± 0.058
4.422SerVal: 4.422 ± 0.07
0.758SerTrp: 0.758 ± 0.025
2.806SerTyr: 2.806 ± 0.055
0.0SerXaa: 0.0 ± 0.0
Thr
4.544ThrAla: 4.544 ± 0.074
0.616ThrCys: 0.616 ± 0.025
3.631ThrAsp: 3.631 ± 0.063
3.352ThrGlu: 3.352 ± 0.056
2.723ThrPhe: 2.723 ± 0.05
4.583ThrGly: 4.583 ± 0.06
1.001ThrHis: 1.001 ± 0.026
3.639ThrIle: 3.639 ± 0.06
2.733ThrLys: 2.733 ± 0.047
5.566ThrLeu: 5.566 ± 0.076
1.152ThrMet: 1.152 ± 0.031
2.459ThrAsn: 2.459 ± 0.049
2.864ThrPro: 2.864 ± 0.05
1.677ThrGln: 1.677 ± 0.038
2.266ThrArg: 2.266 ± 0.045
3.411ThrSer: 3.411 ± 0.054
3.138ThrThr: 3.138 ± 0.056
4.175ThrVal: 4.175 ± 0.065
0.621ThrTrp: 0.621 ± 0.022
2.425ThrTyr: 2.425 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
4.676ValAla: 4.676 ± 0.077
1.04ValCys: 1.04 ± 0.034
3.379ValAsp: 3.379 ± 0.054
3.976ValGlu: 3.976 ± 0.062
2.862ValPhe: 2.862 ± 0.055
4.087ValGly: 4.087 ± 0.069
1.091ValHis: 1.091 ± 0.031
4.268ValIle: 4.268 ± 0.07
4.16ValLys: 4.16 ± 0.069
5.8ValLeu: 5.8 ± 0.073
1.739ValMet: 1.739 ± 0.041
3.352ValAsn: 3.352 ± 0.052
2.486ValPro: 2.486 ± 0.055
1.971ValGln: 1.971 ± 0.043
3.116ValArg: 3.116 ± 0.051
4.546ValSer: 4.546 ± 0.069
3.85ValThr: 3.85 ± 0.074
4.351ValVal: 4.351 ± 0.073
0.724ValTrp: 0.724 ± 0.025
2.736ValTyr: 2.736 ± 0.051
0.0ValXaa: 0.0 ± 0.0
Trp
0.791TrpAla: 0.791 ± 0.026
0.188TrpCys: 0.188 ± 0.014
0.652TrpAsp: 0.652 ± 0.025
0.806TrpGlu: 0.806 ± 0.025
0.528TrpPhe: 0.528 ± 0.02
0.917TrpGly: 0.917 ± 0.029
0.274TrpHis: 0.274 ± 0.015
0.809TrpIle: 0.809 ± 0.026
0.952TrpLys: 0.952 ± 0.03
1.15TrpLeu: 1.15 ± 0.032
0.438TrpMet: 0.438 ± 0.021
0.795TrpAsn: 0.795 ± 0.023
0.259TrpPro: 0.259 ± 0.016
0.501TrpGln: 0.501 ± 0.02
0.561TrpArg: 0.561 ± 0.023
0.66TrpSer: 0.66 ± 0.028
0.668TrpThr: 0.668 ± 0.026
0.647TrpVal: 0.647 ± 0.028
0.193TrpTrp: 0.193 ± 0.015
0.509TrpTyr: 0.509 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.017TyrAla: 3.017 ± 0.049
0.596TyrCys: 0.596 ± 0.025
2.537TyrAsp: 2.537 ± 0.051
2.419TyrGlu: 2.419 ± 0.046
2.144TyrPhe: 2.144 ± 0.042
3.077TyrGly: 3.077 ± 0.055
0.893TyrHis: 0.893 ± 0.026
2.884TyrIle: 2.884 ± 0.053
2.671TyrLys: 2.671 ± 0.054
4.099TyrLeu: 4.099 ± 0.06
1.128TyrMet: 1.128 ± 0.032
2.536TyrAsn: 2.536 ± 0.056
1.924TyrPro: 1.924 ± 0.045
1.51TyrGln: 1.51 ± 0.037
2.401TyrArg: 2.401 ± 0.045
2.813TyrSer: 2.813 ± 0.056
2.717TyrThr: 2.717 ± 0.057
2.464TyrVal: 2.464 ± 0.055
0.577TyrTrp: 0.577 ± 0.024
2.236TyrTyr: 2.236 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3240 proteins (1182685 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski