Amino acid dipepetide frequency for Bifidobacterium subtile

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.441AlaAla: 15.441 ± 0.223
1.17AlaCys: 1.17 ± 0.037
7.327AlaAsp: 7.327 ± 0.121
5.831AlaGlu: 5.831 ± 0.106
3.744AlaPhe: 3.744 ± 0.064
9.729AlaGly: 9.729 ± 0.155
2.53AlaHis: 2.53 ± 0.059
5.969AlaIle: 5.969 ± 0.094
4.459AlaLys: 4.459 ± 0.088
11.556AlaLeu: 11.556 ± 0.14
3.296AlaMet: 3.296 ± 0.076
3.465AlaAsn: 3.465 ± 0.075
4.293AlaPro: 4.293 ± 0.087
5.419AlaGln: 5.419 ± 0.092
6.629AlaArg: 6.629 ± 0.102
7.67AlaSer: 7.67 ± 0.112
5.925AlaThr: 5.925 ± 0.089
8.851AlaVal: 8.851 ± 0.134
1.485AlaTrp: 1.485 ± 0.051
2.692AlaTyr: 2.692 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
1.093CysAla: 1.093 ± 0.044
0.128CysCys: 0.128 ± 0.016
0.518CysAsp: 0.518 ± 0.026
0.514CysGlu: 0.514 ± 0.026
0.311CysPhe: 0.311 ± 0.019
0.963CysGly: 0.963 ± 0.038
0.205CysHis: 0.205 ± 0.017
0.413CysIle: 0.413 ± 0.02
0.21CysLys: 0.21 ± 0.018
0.751CysLeu: 0.751 ± 0.033
0.19CysMet: 0.19 ± 0.015
0.206CysAsn: 0.206 ± 0.015
0.468CysPro: 0.468 ± 0.027
0.229CysGln: 0.229 ± 0.022
0.525CysArg: 0.525 ± 0.026
0.595CysSer: 0.595 ± 0.031
0.47CysThr: 0.47 ± 0.025
0.731CysVal: 0.731 ± 0.031
0.111CysTrp: 0.111 ± 0.011
0.229CysTyr: 0.229 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
7.989AspAla: 7.989 ± 0.125
0.496AspCys: 0.496 ± 0.027
4.298AspAsp: 4.298 ± 0.086
4.592AspGlu: 4.592 ± 0.092
2.182AspPhe: 2.182 ± 0.053
5.508AspGly: 5.508 ± 0.114
1.272AspHis: 1.272 ± 0.041
3.36AspIle: 3.36 ± 0.069
2.064AspLys: 2.064 ± 0.052
4.893AspLeu: 4.893 ± 0.085
1.615AspMet: 1.615 ± 0.048
1.64AspAsn: 1.64 ± 0.05
3.355AspPro: 3.355 ± 0.083
1.808AspGln: 1.808 ± 0.048
3.322AspArg: 3.322 ± 0.079
3.929AspSer: 3.929 ± 0.087
3.113AspThr: 3.113 ± 0.071
4.608AspVal: 4.608 ± 0.086
0.926AspTrp: 0.926 ± 0.035
1.772AspTyr: 1.772 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
5.941GluAla: 5.941 ± 0.104
0.431GluCys: 0.431 ± 0.024
2.877GluAsp: 2.877 ± 0.069
2.959GluGlu: 2.959 ± 0.091
1.759GluPhe: 1.759 ± 0.044
3.743GluGly: 3.743 ± 0.076
1.783GluHis: 1.783 ± 0.05
2.663GluIle: 2.663 ± 0.068
1.614GluLys: 1.614 ± 0.055
5.297GluLeu: 5.297 ± 0.106
1.329GluMet: 1.329 ± 0.037
1.762GluAsn: 1.762 ± 0.05
2.641GluPro: 2.641 ± 0.065
2.925GluGln: 2.925 ± 0.079
4.495GluArg: 4.495 ± 0.094
3.62GluSer: 3.62 ± 0.079
2.993GluThr: 2.993 ± 0.069
3.394GluVal: 3.394 ± 0.072
0.566GluTrp: 0.566 ± 0.031
1.5GluTyr: 1.5 ± 0.047
0.0GluXaa: 0.0 ± 0.0
Phe
4.234PheAla: 4.234 ± 0.08
0.316PheCys: 0.316 ± 0.02
2.522PheAsp: 2.522 ± 0.054
1.808PheGlu: 1.808 ± 0.056
1.144PhePhe: 1.144 ± 0.046
3.21PheGly: 3.21 ± 0.078
0.676PheHis: 0.676 ± 0.03
1.72PheIle: 1.72 ± 0.053
0.957PheLys: 0.957 ± 0.042
2.845PheLeu: 2.845 ± 0.071
0.745PheMet: 0.745 ± 0.034
1.176PheAsn: 1.176 ± 0.044
1.358PhePro: 1.358 ± 0.045
0.912PheGln: 0.912 ± 0.038
1.59PheArg: 1.59 ± 0.045
2.369PheSer: 2.369 ± 0.06
2.014PheThr: 2.014 ± 0.054
2.574PheVal: 2.574 ± 0.057
0.456PheTrp: 0.456 ± 0.025
0.797PheTyr: 0.797 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
8.167GlyAla: 8.167 ± 0.115
0.732GlyCys: 0.732 ± 0.035
4.669GlyAsp: 4.669 ± 0.085
4.675GlyGlu: 4.675 ± 0.086
3.24GlyPhe: 3.24 ± 0.066
6.195GlyGly: 6.195 ± 0.102
1.79GlyHis: 1.79 ± 0.047
4.915GlyIle: 4.915 ± 0.084
3.612GlyLys: 3.612 ± 0.076
7.558GlyLeu: 7.558 ± 0.119
2.307GlyMet: 2.307 ± 0.052
2.733GlyAsn: 2.733 ± 0.072
2.732GlyPro: 2.732 ± 0.061
2.815GlyGln: 2.815 ± 0.069
4.969GlyArg: 4.969 ± 0.09
6.012GlySer: 6.012 ± 0.098
4.792GlyThr: 4.792 ± 0.097
6.418GlyVal: 6.418 ± 0.086
1.152GlyTrp: 1.152 ± 0.041
2.489GlyTyr: 2.489 ± 0.062
0.0GlyXaa: 0.0 ± 0.0
His
2.73HisAla: 2.73 ± 0.051
0.207HisCys: 0.207 ± 0.015
1.842HisAsp: 1.842 ± 0.054
1.407HisGlu: 1.407 ± 0.047
0.653HisPhe: 0.653 ± 0.027
2.119HisGly: 2.119 ± 0.056
0.626HisHis: 0.626 ± 0.032
1.215HisIle: 1.215 ± 0.043
0.525HisLys: 0.525 ± 0.027
1.644HisLeu: 1.644 ± 0.046
0.608HisMet: 0.608 ± 0.032
0.632HisAsn: 0.632 ± 0.032
1.288HisPro: 1.288 ± 0.047
0.693HisGln: 0.693 ± 0.031
1.571HisArg: 1.571 ± 0.048
1.233HisSer: 1.233 ± 0.043
1.185HisThr: 1.185 ± 0.042
1.801HisVal: 1.801 ± 0.054
0.294HisTrp: 0.294 ± 0.018
0.68HisTyr: 0.68 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
7.453IleAla: 7.453 ± 0.112
0.539IleCys: 0.539 ± 0.027
3.802IleAsp: 3.802 ± 0.074
3.214IleGlu: 3.214 ± 0.075
1.525IlePhe: 1.525 ± 0.05
4.829IleGly: 4.829 ± 0.085
1.075IleHis: 1.075 ± 0.042
2.837IleIle: 2.837 ± 0.066
1.459IleLys: 1.459 ± 0.051
3.923IleLeu: 3.923 ± 0.077
1.185IleMet: 1.185 ± 0.041
1.579IleAsn: 1.579 ± 0.05
2.658IlePro: 2.658 ± 0.06
1.245IleGln: 1.245 ± 0.04
2.964IleArg: 2.964 ± 0.061
3.323IleSer: 3.323 ± 0.076
2.955IleThr: 2.955 ± 0.066
4.828IleVal: 4.828 ± 0.093
0.596IleTrp: 0.596 ± 0.029
1.13IleTyr: 1.13 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.259LysAla: 4.259 ± 0.084
0.115LysCys: 0.115 ± 0.012
2.115LysAsp: 2.115 ± 0.064
1.824LysGlu: 1.824 ± 0.054
0.785LysPhe: 0.785 ± 0.034
2.552LysGly: 2.552 ± 0.065
0.733LysHis: 0.733 ± 0.027
1.621LysIle: 1.621 ± 0.056
1.397LysLys: 1.397 ± 0.056
2.917LysLeu: 2.917 ± 0.069
0.732LysMet: 0.732 ± 0.03
1.148LysAsn: 1.148 ± 0.045
2.026LysPro: 2.026 ± 0.054
1.426LysGln: 1.426 ± 0.049
2.228LysArg: 2.228 ± 0.057
2.161LysSer: 2.161 ± 0.056
2.35LysThr: 2.35 ± 0.058
2.353LysVal: 2.353 ± 0.064
0.342LysTrp: 0.342 ± 0.022
0.842LysTyr: 0.842 ± 0.037
0.0LysXaa: 0.0 ± 0.0
Leu
11.037LeuAla: 11.037 ± 0.149
0.95LeuCys: 0.95 ± 0.038
6.198LeuAsp: 6.198 ± 0.103
4.272LeuGlu: 4.272 ± 0.087
3.095LeuPhe: 3.095 ± 0.071
7.585LeuGly: 7.585 ± 0.123
2.048LeuHis: 2.048 ± 0.056
4.973LeuIle: 4.973 ± 0.089
3.013LeuLys: 3.013 ± 0.065
8.707LeuLeu: 8.707 ± 0.136
2.158LeuMet: 2.158 ± 0.057
2.805LeuAsn: 2.805 ± 0.058
4.513LeuPro: 4.513 ± 0.078
2.846LeuGln: 2.846 ± 0.072
5.986LeuArg: 5.986 ± 0.087
6.353LeuSer: 6.353 ± 0.091
5.107LeuThr: 5.107 ± 0.085
6.634LeuVal: 6.634 ± 0.113
1.082LeuTrp: 1.082 ± 0.038
1.948LeuTyr: 1.948 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
2.838MetAla: 2.838 ± 0.064
0.2MetCys: 0.2 ± 0.019
1.233MetAsp: 1.233 ± 0.042
0.965MetGlu: 0.965 ± 0.035
0.824MetPhe: 0.824 ± 0.035
1.757MetGly: 1.757 ± 0.05
0.622MetHis: 0.622 ± 0.03
1.273MetIle: 1.273 ± 0.039
0.86MetLys: 0.86 ± 0.033
2.596MetLeu: 2.596 ± 0.066
0.6MetMet: 0.6 ± 0.031
0.996MetAsn: 0.996 ± 0.038
1.522MetPro: 1.522 ± 0.046
0.937MetGln: 0.937 ± 0.033
1.841MetArg: 1.841 ± 0.06
1.876MetSer: 1.876 ± 0.047
1.851MetThr: 1.851 ± 0.046
1.694MetVal: 1.694 ± 0.049
0.247MetTrp: 0.247 ± 0.016
0.455MetTyr: 0.455 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
4.086AsnAla: 4.086 ± 0.073
0.203AsnCys: 0.203 ± 0.017
2.016AsnAsp: 2.016 ± 0.057
1.657AsnGlu: 1.657 ± 0.049
0.852AsnPhe: 0.852 ± 0.034
3.055AsnGly: 3.055 ± 0.063
0.641AsnHis: 0.641 ± 0.03
1.543AsnIle: 1.543 ± 0.054
0.978AsnLys: 0.978 ± 0.038
2.489AsnLeu: 2.489 ± 0.058
0.737AsnMet: 0.737 ± 0.033
0.964AsnAsn: 0.964 ± 0.04
2.044AsnPro: 2.044 ± 0.054
1.06AsnGln: 1.06 ± 0.04
1.816AsnArg: 1.816 ± 0.046
1.671AsnSer: 1.671 ± 0.046
1.726AsnThr: 1.726 ± 0.05
2.207AsnVal: 2.207 ± 0.071
0.39AsnTrp: 0.39 ± 0.025
0.793AsnTyr: 0.793 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
5.177ProAla: 5.177 ± 0.094
0.347ProCys: 0.347 ± 0.021
3.228ProAsp: 3.228 ± 0.068
3.228ProGlu: 3.228 ± 0.069
1.548ProPhe: 1.548 ± 0.049
3.745ProGly: 3.745 ± 0.07
1.08ProHis: 1.08 ± 0.037
2.258ProIle: 2.258 ± 0.061
1.718ProLys: 1.718 ± 0.045
3.968ProLeu: 3.968 ± 0.071
1.058ProMet: 1.058 ± 0.039
1.384ProAsn: 1.384 ± 0.041
1.362ProPro: 1.362 ± 0.052
2.165ProGln: 2.165 ± 0.061
2.411ProArg: 2.411 ± 0.059
3.179ProSer: 3.179 ± 0.069
2.661ProThr: 2.661 ± 0.058
3.756ProVal: 3.756 ± 0.072
0.64ProTrp: 0.64 ± 0.03
1.345ProTyr: 1.345 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
4.449GlnAla: 4.449 ± 0.088
0.308GlnCys: 0.308 ± 0.019
1.846GlnAsp: 1.846 ± 0.049
1.884GlnGlu: 1.884 ± 0.051
1.099GlnPhe: 1.099 ± 0.04
2.893GlnGly: 2.893 ± 0.06
0.968GlnHis: 0.968 ± 0.037
1.924GlnIle: 1.924 ± 0.045
0.908GlnLys: 0.908 ± 0.035
3.74GlnLeu: 3.74 ± 0.072
0.93GlnMet: 0.93 ± 0.036
1.023GlnAsn: 1.023 ± 0.044
1.897GlnPro: 1.897 ± 0.057
1.849GlnGln: 1.849 ± 0.064
2.954GlnArg: 2.954 ± 0.068
2.539GlnSer: 2.539 ± 0.062
2.141GlnThr: 2.141 ± 0.054
2.565GlnVal: 2.565 ± 0.053
0.627GlnTrp: 0.627 ± 0.025
1.078GlnTyr: 1.078 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
5.723ArgAla: 5.723 ± 0.097
0.543ArgCys: 0.543 ± 0.027
3.739ArgAsp: 3.739 ± 0.075
3.883ArgGlu: 3.883 ± 0.076
2.377ArgPhe: 2.377 ± 0.058
4.009ArgGly: 4.009 ± 0.084
1.783ArgHis: 1.783 ± 0.051
3.68ArgIle: 3.68 ± 0.069
2.452ArgLys: 2.452 ± 0.053
5.982ArgLeu: 5.982 ± 0.092
1.773ArgMet: 1.773 ± 0.051
2.082ArgAsn: 2.082 ± 0.056
2.68ArgPro: 2.68 ± 0.06
2.627ArgGln: 2.627 ± 0.056
5.331ArgArg: 5.331 ± 0.105
4.011ArgSer: 4.011 ± 0.087
3.515ArgThr: 3.515 ± 0.07
4.206ArgVal: 4.206 ± 0.086
0.868ArgTrp: 0.868 ± 0.036
1.978ArgTyr: 1.978 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
7.698SerAla: 7.698 ± 0.118
0.534SerCys: 0.534 ± 0.029
4.179SerAsp: 4.179 ± 0.076
3.171SerGlu: 3.171 ± 0.07
2.167SerPhe: 2.167 ± 0.061
6.438SerGly: 6.438 ± 0.1
1.53SerHis: 1.53 ± 0.043
3.504SerIle: 3.504 ± 0.073
2.294SerLys: 2.294 ± 0.061
5.683SerLeu: 5.683 ± 0.087
1.691SerMet: 1.691 ± 0.044
2.058SerAsn: 2.058 ± 0.052
2.824SerPro: 2.824 ± 0.06
2.742SerGln: 2.742 ± 0.066
4.017SerArg: 4.017 ± 0.084
4.719SerSer: 4.719 ± 0.098
3.749SerThr: 3.749 ± 0.086
4.876SerVal: 4.876 ± 0.097
0.9SerTrp: 0.9 ± 0.033
1.714SerTyr: 1.714 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
6.503ThrAla: 6.503 ± 0.099
0.447ThrCys: 0.447 ± 0.021
3.103ThrAsp: 3.103 ± 0.067
2.321ThrGlu: 2.321 ± 0.058
1.851ThrPhe: 1.851 ± 0.057
5.019ThrGly: 5.019 ± 0.098
1.236ThrHis: 1.236 ± 0.044
3.29ThrIle: 3.29 ± 0.068
1.865ThrLys: 1.865 ± 0.061
5.573ThrLeu: 5.573 ± 0.086
1.456ThrMet: 1.456 ± 0.046
1.685ThrAsn: 1.685 ± 0.053
3.18ThrPro: 3.18 ± 0.073
2.078ThrGln: 2.078 ± 0.055
3.256ThrArg: 3.256 ± 0.076
3.452ThrSer: 3.452 ± 0.061
3.419ThrThr: 3.419 ± 0.071
4.839ThrVal: 4.839 ± 0.087
0.693ThrTrp: 0.693 ± 0.03
1.349ThrTyr: 1.349 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
8.49ValAla: 8.49 ± 0.123
0.784ValCys: 0.784 ± 0.035
4.753ValAsp: 4.753 ± 0.084
4.06ValGlu: 4.06 ± 0.087
2.831ValPhe: 2.831 ± 0.062
5.389ValGly: 5.389 ± 0.096
1.521ValHis: 1.521 ± 0.046
4.13ValIle: 4.13 ± 0.083
2.378ValLys: 2.378 ± 0.061
7.315ValLeu: 7.315 ± 0.111
1.893ValMet: 1.893 ± 0.051
2.404ValAsn: 2.404 ± 0.066
3.77ValPro: 3.77 ± 0.083
2.312ValGln: 2.312 ± 0.063
4.624ValArg: 4.624 ± 0.077
5.34ValSer: 5.34 ± 0.091
4.533ValThr: 4.533 ± 0.075
6.155ValVal: 6.155 ± 0.114
0.894ValTrp: 0.894 ± 0.037
1.621ValTyr: 1.621 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
1.163TrpAla: 1.163 ± 0.045
0.154TrpCys: 0.154 ± 0.016
0.692TrpAsp: 0.692 ± 0.032
0.526TrpGlu: 0.526 ± 0.027
0.485TrpPhe: 0.485 ± 0.028
0.882TrpGly: 0.882 ± 0.033
0.341TrpHis: 0.341 ± 0.019
0.681TrpIle: 0.681 ± 0.03
0.477TrpLys: 0.477 ± 0.028
1.498TrpLeu: 1.498 ± 0.052
0.371TrpMet: 0.371 ± 0.022
0.527TrpAsn: 0.527 ± 0.031
0.482TrpPro: 0.482 ± 0.023
0.605TrpGln: 0.605 ± 0.027
0.979TrpArg: 0.979 ± 0.036
0.861TrpSer: 0.861 ± 0.036
0.715TrpThr: 0.715 ± 0.031
0.827TrpVal: 0.827 ± 0.03
0.277TrpTrp: 0.277 ± 0.019
0.364TrpTyr: 0.364 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.009TyrAla: 3.009 ± 0.066
0.257TyrCys: 0.257 ± 0.02
1.786TyrAsp: 1.786 ± 0.055
1.5TyrGlu: 1.5 ± 0.051
0.953TyrPhe: 0.953 ± 0.036
2.353TyrGly: 2.353 ± 0.063
0.494TyrHis: 0.494 ± 0.029
1.146TyrIle: 1.146 ± 0.04
0.687TyrLys: 0.687 ± 0.03
2.376TyrLeu: 2.376 ± 0.06
0.512TyrMet: 0.512 ± 0.026
0.657TyrAsn: 0.657 ± 0.031
1.163TyrPro: 1.163 ± 0.04
0.926TyrGln: 0.926 ± 0.035
1.736TyrArg: 1.736 ± 0.054
1.577TyrSer: 1.577 ± 0.045
1.402TyrThr: 1.402 ± 0.047
1.881TyrVal: 1.881 ± 0.052
0.36TyrTrp: 0.36 ± 0.023
0.694TyrTyr: 0.694 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2259 proteins (771925 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski