Amino acid dipepetide frequency for Bacteroides plebeius CAG:211

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.966AlaAla: 5.966 ± 0.105
1.158AlaCys: 1.158 ± 0.037
4.318AlaAsp: 4.318 ± 0.068
5.012AlaGlu: 5.012 ± 0.09
3.175AlaPhe: 3.175 ± 0.06
5.371AlaGly: 5.371 ± 0.089
1.363AlaHis: 1.363 ± 0.036
4.233AlaIle: 4.233 ± 0.075
3.906AlaLys: 3.906 ± 0.077
6.907AlaLeu: 6.907 ± 0.105
1.943AlaMet: 1.943 ± 0.046
2.938AlaAsn: 2.938 ± 0.058
2.325AlaPro: 2.325 ± 0.055
2.989AlaGln: 2.989 ± 0.068
3.386AlaArg: 3.386 ± 0.069
4.498AlaSer: 4.498 ± 0.084
3.741AlaThr: 3.741 ± 0.073
4.981AlaVal: 4.981 ± 0.083
0.862AlaTrp: 0.862 ± 0.033
3.037AlaTyr: 3.037 ± 0.053
0.001AlaXaa: 0.001 ± 0.001
Cys
0.883CysAla: 0.883 ± 0.032
0.26CysCys: 0.26 ± 0.018
0.637CysAsp: 0.637 ± 0.025
0.789CysGlu: 0.789 ± 0.028
0.691CysPhe: 0.691 ± 0.029
1.162CysGly: 1.162 ± 0.036
0.34CysHis: 0.34 ± 0.017
0.952CysIle: 0.952 ± 0.038
0.733CysLys: 0.733 ± 0.027
1.268CysLeu: 1.268 ± 0.039
0.413CysMet: 0.413 ± 0.019
0.553CysAsn: 0.553 ± 0.024
0.588CysPro: 0.588 ± 0.027
0.41CysGln: 0.41 ± 0.022
0.694CysArg: 0.694 ± 0.029
0.743CysSer: 0.743 ± 0.03
0.76CysThr: 0.76 ± 0.03
0.818CysVal: 0.818 ± 0.032
0.19CysTrp: 0.19 ± 0.015
0.542CysTyr: 0.542 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
3.906AspAla: 3.906 ± 0.066
0.639AspCys: 0.639 ± 0.025
2.365AspAsp: 2.365 ± 0.052
3.964AspGlu: 3.964 ± 0.075
3.03AspPhe: 3.03 ± 0.053
4.152AspGly: 4.152 ± 0.073
0.793AspHis: 0.793 ± 0.03
3.761AspIle: 3.761 ± 0.07
3.674AspLys: 3.674 ± 0.067
4.674AspLeu: 4.674 ± 0.079
1.775AspMet: 1.775 ± 0.044
2.533AspAsn: 2.533 ± 0.056
1.758AspPro: 1.758 ± 0.042
1.257AspGln: 1.257 ± 0.041
2.364AspArg: 2.364 ± 0.057
2.902AspSer: 2.902 ± 0.063
2.721AspThr: 2.721 ± 0.058
3.49AspVal: 3.49 ± 0.061
0.868AspTrp: 0.868 ± 0.035
2.821AspTyr: 2.821 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
5.316GluAla: 5.316 ± 0.088
0.693GluCys: 0.693 ± 0.032
3.406GluAsp: 3.406 ± 0.068
5.626GluGlu: 5.626 ± 0.104
2.494GluPhe: 2.494 ± 0.045
4.811GluGly: 4.811 ± 0.081
1.277GluHis: 1.277 ± 0.032
4.391GluIle: 4.391 ± 0.075
5.587GluLys: 5.587 ± 0.084
6.294GluLeu: 6.294 ± 0.095
2.192GluMet: 2.192 ± 0.05
3.607GluAsn: 3.607 ± 0.07
1.777GluPro: 1.777 ± 0.043
2.626GluGln: 2.626 ± 0.055
3.218GluArg: 3.218 ± 0.062
3.125GluSer: 3.125 ± 0.067
3.529GluThr: 3.529 ± 0.058
4.802GluVal: 4.802 ± 0.07
0.896GluTrp: 0.896 ± 0.033
2.727GluTyr: 2.727 ± 0.062
0.0GluXaa: 0.0 ± 0.0
Phe
2.929PheAla: 2.929 ± 0.062
0.725PheCys: 0.725 ± 0.026
2.678PheAsp: 2.678 ± 0.054
2.469PheGlu: 2.469 ± 0.049
2.253PhePhe: 2.253 ± 0.058
3.247PheGly: 3.247 ± 0.061
1.015PheHis: 1.015 ± 0.034
2.973PheIle: 2.973 ± 0.066
2.242PheLys: 2.242 ± 0.056
4.148PheLeu: 4.148 ± 0.081
1.308PheMet: 1.308 ± 0.036
2.222PheAsn: 2.222 ± 0.048
1.764PhePro: 1.764 ± 0.04
1.566PheGln: 1.566 ± 0.042
2.265PheArg: 2.265 ± 0.053
3.404PheSer: 3.404 ± 0.06
2.748PheThr: 2.748 ± 0.058
2.876PheVal: 2.876 ± 0.061
0.586PheTrp: 0.586 ± 0.026
1.99PheTyr: 1.99 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.564GlyAla: 4.564 ± 0.079
1.009GlyCys: 1.009 ± 0.036
3.447GlyAsp: 3.447 ± 0.069
4.463GlyGlu: 4.463 ± 0.076
3.369GlyPhe: 3.369 ± 0.069
4.878GlyGly: 4.878 ± 0.084
1.359GlyHis: 1.359 ± 0.039
5.242GlyIle: 5.242 ± 0.08
5.536GlyLys: 5.536 ± 0.079
6.127GlyLeu: 6.127 ± 0.1
2.35GlyMet: 2.35 ± 0.052
3.562GlyAsn: 3.562 ± 0.072
1.43GlyPro: 1.43 ± 0.046
2.273GlyGln: 2.273 ± 0.046
2.975GlyArg: 2.975 ± 0.067
3.889GlySer: 3.889 ± 0.072
4.231GlyThr: 4.231 ± 0.075
5.087GlyVal: 5.087 ± 0.082
1.12GlyTrp: 1.12 ± 0.038
3.394GlyTyr: 3.394 ± 0.064
0.001GlyXaa: 0.001 ± 0.001
His
1.287HisAla: 1.287 ± 0.035
0.32HisCys: 0.32 ± 0.02
0.973HisAsp: 0.973 ± 0.03
1.18HisGlu: 1.18 ± 0.036
1.052HisPhe: 1.052 ± 0.035
1.32HisGly: 1.32 ± 0.04
0.599HisHis: 0.599 ± 0.028
1.533HisIle: 1.533 ± 0.04
1.023HisLys: 1.023 ± 0.034
1.901HisLeu: 1.901 ± 0.054
0.366HisMet: 0.366 ± 0.019
0.956HisAsn: 0.956 ± 0.036
1.166HisPro: 1.166 ± 0.041
0.72HisGln: 0.72 ± 0.029
0.944HisArg: 0.944 ± 0.034
1.096HisSer: 1.096 ± 0.037
1.272HisThr: 1.272 ± 0.039
1.231HisVal: 1.231 ± 0.041
0.266HisTrp: 0.266 ± 0.017
0.944HisTyr: 0.944 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
4.706IleAla: 4.706 ± 0.076
1.004IleCys: 1.004 ± 0.035
3.842IleAsp: 3.842 ± 0.079
4.124IleGlu: 4.124 ± 0.077
2.573IlePhe: 2.573 ± 0.063
4.489IleGly: 4.489 ± 0.088
1.427IleHis: 1.427 ± 0.039
3.754IleIle: 3.754 ± 0.082
3.418IleLys: 3.418 ± 0.068
5.667IleLeu: 5.667 ± 0.096
1.409IleMet: 1.409 ± 0.045
2.889IleAsn: 2.889 ± 0.052
3.002IlePro: 3.002 ± 0.054
2.447IleGln: 2.447 ± 0.056
3.339IleArg: 3.339 ± 0.067
4.224IleSer: 4.224 ± 0.067
3.591IleThr: 3.591 ± 0.068
4.161IleVal: 4.161 ± 0.087
0.692IleTrp: 0.692 ± 0.031
2.603IleTyr: 2.603 ± 0.051
0.0IleXaa: 0.0 ± 0.0
Lys
4.856LysAla: 4.856 ± 0.075
0.565LysCys: 0.565 ± 0.025
3.763LysAsp: 3.763 ± 0.065
5.994LysGlu: 5.994 ± 0.099
2.118LysPhe: 2.118 ± 0.052
4.706LysGly: 4.706 ± 0.07
1.174LysHis: 1.174 ± 0.037
3.643LysIle: 3.643 ± 0.064
4.881LysLys: 4.881 ± 0.074
5.325LysLeu: 5.325 ± 0.064
1.996LysMet: 1.996 ± 0.047
3.272LysAsn: 3.272 ± 0.057
2.044LysPro: 2.044 ± 0.052
2.502LysGln: 2.502 ± 0.055
2.932LysArg: 2.932 ± 0.054
3.177LysSer: 3.177 ± 0.063
3.272LysThr: 3.272 ± 0.057
4.537LysVal: 4.537 ± 0.072
0.735LysTrp: 0.735 ± 0.024
2.716LysTyr: 2.716 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
6.705LeuAla: 6.705 ± 0.101
1.502LeuCys: 1.502 ± 0.047
4.616LeuAsp: 4.616 ± 0.073
5.42LeuGlu: 5.42 ± 0.08
4.434LeuPhe: 4.434 ± 0.092
5.85LeuGly: 5.85 ± 0.09
1.987LeuHis: 1.987 ± 0.052
5.259LeuIle: 5.259 ± 0.084
6.525LeuLys: 6.525 ± 0.083
9.015LeuLeu: 9.015 ± 0.161
2.568LeuMet: 2.568 ± 0.061
4.513LeuAsn: 4.513 ± 0.072
4.128LeuPro: 4.128 ± 0.067
3.712LeuGln: 3.712 ± 0.068
4.337LeuArg: 4.337 ± 0.081
6.635LeuSer: 6.635 ± 0.091
5.486LeuThr: 5.486 ± 0.084
5.379LeuVal: 5.379 ± 0.094
1.1LeuTrp: 1.1 ± 0.037
3.727LeuTyr: 3.727 ± 0.074
0.002LeuXaa: 0.002 ± 0.002
Met
2.258MetAla: 2.258 ± 0.047
0.301MetCys: 0.301 ± 0.016
1.551MetAsp: 1.551 ± 0.034
2.034MetGlu: 2.034 ± 0.043
1.076MetPhe: 1.076 ± 0.03
1.992MetGly: 1.992 ± 0.055
0.515MetHis: 0.515 ± 0.024
1.589MetIle: 1.589 ± 0.043
2.609MetLys: 2.609 ± 0.051
2.613MetLeu: 2.613 ± 0.054
0.839MetMet: 0.839 ± 0.031
1.617MetAsn: 1.617 ± 0.038
1.192MetPro: 1.192 ± 0.035
1.137MetGln: 1.137 ± 0.034
1.326MetArg: 1.326 ± 0.037
1.477MetSer: 1.477 ± 0.038
1.527MetThr: 1.527 ± 0.041
1.648MetVal: 1.648 ± 0.046
0.276MetTrp: 0.276 ± 0.017
0.996MetTyr: 0.996 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
3.326AsnAla: 3.326 ± 0.057
0.517AsnCys: 0.517 ± 0.024
2.315AsnAsp: 2.315 ± 0.051
2.913AsnGlu: 2.913 ± 0.062
2.129AsnPhe: 2.129 ± 0.057
3.753AsnGly: 3.753 ± 0.078
0.993AsnHis: 0.993 ± 0.035
3.12AsnIle: 3.12 ± 0.068
2.725AsnLys: 2.725 ± 0.05
4.423AsnLeu: 4.423 ± 0.079
1.332AsnMet: 1.332 ± 0.039
2.32AsnAsn: 2.32 ± 0.071
2.549AsnPro: 2.549 ± 0.052
1.692AsnGln: 1.692 ± 0.051
2.42AsnArg: 2.42 ± 0.052
2.468AsnSer: 2.468 ± 0.049
2.548AsnThr: 2.548 ± 0.056
3.196AsnVal: 3.196 ± 0.063
0.689AsnTrp: 0.689 ± 0.032
2.304AsnTyr: 2.304 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
2.775ProAla: 2.775 ± 0.057
0.411ProCys: 0.411 ± 0.02
2.569ProAsp: 2.569 ± 0.056
3.741ProGlu: 3.741 ± 0.065
1.846ProPhe: 1.846 ± 0.046
2.489ProGly: 2.489 ± 0.048
0.828ProHis: 0.828 ± 0.03
2.096ProIle: 2.096 ± 0.052
1.875ProLys: 1.875 ± 0.046
3.344ProLeu: 3.344 ± 0.052
1.016ProMet: 1.016 ± 0.036
1.553ProAsn: 1.553 ± 0.044
0.737ProPro: 0.737 ± 0.029
1.581ProGln: 1.581 ± 0.047
1.25ProArg: 1.25 ± 0.041
2.359ProSer: 2.359 ± 0.055
1.833ProThr: 1.833 ± 0.047
3.146ProVal: 3.146 ± 0.064
0.478ProTrp: 0.478 ± 0.026
1.759ProTyr: 1.759 ± 0.041
0.001ProXaa: 0.001 ± 0.001
Gln
2.894GlnAla: 2.894 ± 0.073
0.33GlnCys: 0.33 ± 0.02
1.612GlnAsp: 1.612 ± 0.042
2.618GlnGlu: 2.618 ± 0.058
1.427GlnPhe: 1.427 ± 0.036
2.308GlnGly: 2.308 ± 0.051
0.722GlnHis: 0.722 ± 0.029
2.348GlnIle: 2.348 ± 0.05
2.614GlnLys: 2.614 ± 0.058
3.66GlnLeu: 3.66 ± 0.067
1.179GlnMet: 1.179 ± 0.036
1.849GlnAsn: 1.849 ± 0.048
1.442GlnPro: 1.442 ± 0.037
1.72GlnGln: 1.72 ± 0.051
1.729GlnArg: 1.729 ± 0.042
1.902GlnSer: 1.902 ± 0.042
2.151GlnThr: 2.151 ± 0.048
2.493GlnVal: 2.493 ± 0.051
0.486GlnTrp: 0.486 ± 0.023
1.451GlnTyr: 1.451 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
2.801ArgAla: 2.801 ± 0.085
0.532ArgCys: 0.532 ± 0.02
2.055ArgAsp: 2.055 ± 0.051
3.121ArgGlu: 3.121 ± 0.062
2.357ArgPhe: 2.357 ± 0.049
2.466ArgGly: 2.466 ± 0.052
0.978ArgHis: 0.978 ± 0.032
3.577ArgIle: 3.577 ± 0.066
3.536ArgLys: 3.536 ± 0.069
4.634ArgLeu: 4.634 ± 0.078
1.696ArgMet: 1.696 ± 0.042
2.387ArgAsn: 2.387 ± 0.054
1.672ArgPro: 1.672 ± 0.046
1.953ArgGln: 1.953 ± 0.05
2.384ArgArg: 2.384 ± 0.056
2.376ArgSer: 2.376 ± 0.047
2.584ArgThr: 2.584 ± 0.048
2.817ArgVal: 2.817 ± 0.059
0.646ArgTrp: 0.646 ± 0.029
2.204ArgTyr: 2.204 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
4.044SerAla: 4.044 ± 0.067
0.884SerCys: 0.884 ± 0.028
3.167SerAsp: 3.167 ± 0.055
3.486SerGlu: 3.486 ± 0.061
3.155SerPhe: 3.155 ± 0.063
4.611SerGly: 4.611 ± 0.073
1.23SerHis: 1.23 ± 0.038
3.811SerIle: 3.811 ± 0.07
3.076SerLys: 3.076 ± 0.058
6.199SerLeu: 6.199 ± 0.103
1.544SerMet: 1.544 ± 0.039
2.397SerAsn: 2.397 ± 0.057
2.319SerPro: 2.319 ± 0.048
2.054SerGln: 2.054 ± 0.045
2.794SerArg: 2.794 ± 0.051
3.84SerSer: 3.84 ± 0.088
2.984SerThr: 2.984 ± 0.058
4.302SerVal: 4.302 ± 0.073
0.804SerTrp: 0.804 ± 0.031
2.795SerTyr: 2.795 ± 0.062
0.0SerXaa: 0.0 ± 0.0
Thr
3.954ThrAla: 3.954 ± 0.064
0.653ThrCys: 0.653 ± 0.031
3.401ThrAsp: 3.401 ± 0.059
3.622ThrGlu: 3.622 ± 0.064
2.673ThrPhe: 2.673 ± 0.058
4.36ThrGly: 4.36 ± 0.066
1.144ThrHis: 1.144 ± 0.037
3.333ThrIle: 3.333 ± 0.063
2.431ThrLys: 2.431 ± 0.061
5.709ThrLeu: 5.709 ± 0.085
1.236ThrMet: 1.236 ± 0.042
2.251ThrAsn: 2.251 ± 0.06
2.877ThrPro: 2.877 ± 0.053
1.863ThrGln: 1.863 ± 0.043
2.334ThrArg: 2.334 ± 0.054
3.327ThrSer: 3.327 ± 0.061
2.938ThrThr: 2.938 ± 0.068
4.006ThrVal: 4.006 ± 0.07
0.669ThrTrp: 0.669 ± 0.027
2.473ThrTyr: 2.473 ± 0.06
0.0ThrXaa: 0.0 ± 0.0
Val
4.919ValAla: 4.919 ± 0.076
1.156ValCys: 1.156 ± 0.041
3.62ValAsp: 3.62 ± 0.069
4.436ValGlu: 4.436 ± 0.074
2.876ValPhe: 2.876 ± 0.049
4.236ValGly: 4.236 ± 0.076
1.191ValHis: 1.191 ± 0.037
4.389ValIle: 4.389 ± 0.083
4.332ValLys: 4.332 ± 0.069
5.898ValLeu: 5.898 ± 0.092
1.809ValMet: 1.809 ± 0.052
3.277ValAsn: 3.277 ± 0.07
2.69ValPro: 2.69 ± 0.05
2.104ValGln: 2.104 ± 0.048
3.214ValArg: 3.214 ± 0.061
4.757ValSer: 4.757 ± 0.073
3.91ValThr: 3.91 ± 0.069
4.678ValVal: 4.678 ± 0.074
0.829ValTrp: 0.829 ± 0.03
2.756ValTyr: 2.756 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.821TrpAla: 0.821 ± 0.027
0.18TrpCys: 0.18 ± 0.016
0.688TrpAsp: 0.688 ± 0.03
0.784TrpGlu: 0.784 ± 0.032
0.59TrpPhe: 0.59 ± 0.025
1.057TrpGly: 1.057 ± 0.036
0.288TrpHis: 0.288 ± 0.018
0.822TrpIle: 0.822 ± 0.029
1.001TrpLys: 1.001 ± 0.036
1.157TrpLeu: 1.157 ± 0.041
0.477TrpMet: 0.477 ± 0.024
0.774TrpAsn: 0.774 ± 0.032
0.29TrpPro: 0.29 ± 0.018
0.527TrpGln: 0.527 ± 0.024
0.54TrpArg: 0.54 ± 0.023
0.72TrpSer: 0.72 ± 0.029
0.721TrpThr: 0.721 ± 0.026
0.777TrpVal: 0.777 ± 0.032
0.208TrpTrp: 0.208 ± 0.016
0.542TrpTyr: 0.542 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.199TyrAla: 3.199 ± 0.062
0.586TyrCys: 0.586 ± 0.027
2.422TyrAsp: 2.422 ± 0.056
2.6TyrGlu: 2.6 ± 0.055
2.088TyrPhe: 2.088 ± 0.049
3.047TyrGly: 3.047 ± 0.061
0.927TyrHis: 0.927 ± 0.032
2.537TyrIle: 2.537 ± 0.052
2.481TyrLys: 2.481 ± 0.05
3.938TyrLeu: 3.938 ± 0.072
1.123TyrMet: 1.123 ± 0.032
2.264TyrAsn: 2.264 ± 0.057
1.934TyrPro: 1.934 ± 0.055
1.802TyrGln: 1.802 ± 0.049
2.33TyrArg: 2.33 ± 0.054
2.58TyrSer: 2.58 ± 0.063
2.708TyrThr: 2.708 ± 0.068
2.653TyrVal: 2.653 ± 0.058
0.565TyrTrp: 0.565 ± 0.026
2.005TyrTyr: 2.005 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.001XaaPhe: 0.001 ± 0.001
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.001XaaMet: 0.001 ± 0.001
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.002XaaTyr: 0.002 ± 0.002
0.003XaaXaa: 0.003 ± 0.002
Statistics based on 2643 proteins (961016 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski