Amino acid dipepetide frequency for Capnocytophaga sp. oral taxon 338 str. F0234

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.257AlaAla: 3.257 ± 0.088
0.713AlaCys: 0.713 ± 0.038
2.9AlaAsp: 2.9 ± 0.069
3.619AlaGlu: 3.619 ± 0.081
3.291AlaPhe: 3.291 ± 0.068
3.761AlaGly: 3.761 ± 0.098
1.25AlaHis: 1.25 ± 0.042
5.104AlaIle: 5.104 ± 0.092
4.685AlaLys: 4.685 ± 0.081
6.57AlaLeu: 6.57 ± 0.114
1.479AlaMet: 1.479 ± 0.045
2.972AlaAsn: 2.972 ± 0.078
2.352AlaPro: 2.352 ± 0.065
3.232AlaGln: 3.232 ± 0.072
2.171AlaArg: 2.171 ± 0.06
3.658AlaSer: 3.658 ± 0.082
3.793AlaThr: 3.793 ± 0.1
3.401AlaVal: 3.401 ± 0.068
0.529AlaTrp: 0.529 ± 0.026
2.693AlaTyr: 2.693 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.448CysAla: 0.448 ± 0.03
0.117CysCys: 0.117 ± 0.014
0.376CysAsp: 0.376 ± 0.025
0.534CysGlu: 0.534 ± 0.026
0.489CysPhe: 0.489 ± 0.03
0.604CysGly: 0.604 ± 0.033
0.241CysHis: 0.241 ± 0.022
0.685CysIle: 0.685 ± 0.028
0.617CysLys: 0.617 ± 0.032
0.79CysLeu: 0.79 ± 0.035
0.179CysMet: 0.179 ± 0.014
0.484CysAsn: 0.484 ± 0.024
0.337CysPro: 0.337 ± 0.024
0.358CysGln: 0.358 ± 0.023
0.301CysArg: 0.301 ± 0.023
0.56CysSer: 0.56 ± 0.029
0.493CysThr: 0.493 ± 0.029
0.473CysVal: 0.473 ± 0.027
0.089CysTrp: 0.089 ± 0.01
0.454CysTyr: 0.454 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
2.889AspAla: 2.889 ± 0.073
0.406AspCys: 0.406 ± 0.024
2.268AspAsp: 2.268 ± 0.063
2.953AspGlu: 2.953 ± 0.073
3.462AspPhe: 3.462 ± 0.06
2.742AspGly: 2.742 ± 0.07
0.728AspHis: 0.728 ± 0.032
4.436AspIle: 4.436 ± 0.074
4.519AspLys: 4.519 ± 0.095
4.371AspLeu: 4.371 ± 0.084
1.09AspMet: 1.09 ± 0.037
2.801AspAsn: 2.801 ± 0.069
1.703AspPro: 1.703 ± 0.056
1.153AspGln: 1.153 ± 0.041
1.862AspArg: 1.862 ± 0.05
2.722AspSer: 2.722 ± 0.069
3.064AspThr: 3.064 ± 0.063
2.588AspVal: 2.588 ± 0.071
0.658AspTrp: 0.658 ± 0.029
2.781AspTyr: 2.781 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
4.557GluAla: 4.557 ± 0.098
0.441GluCys: 0.441 ± 0.026
3.287GluAsp: 3.287 ± 0.063
6.262GluGlu: 6.262 ± 0.142
2.545GluPhe: 2.545 ± 0.066
4.323GluGly: 4.323 ± 0.079
1.302GluHis: 1.302 ± 0.042
5.501GluIle: 5.501 ± 0.094
7.261GluLys: 7.261 ± 0.134
6.071GluLeu: 6.071 ± 0.092
1.536GluMet: 1.536 ± 0.046
4.302GluAsn: 4.302 ± 0.078
1.588GluPro: 1.588 ± 0.047
3.048GluGln: 3.048 ± 0.067
3.237GluArg: 3.237 ± 0.079
3.025GluSer: 3.025 ± 0.072
3.23GluThr: 3.23 ± 0.067
4.852GluVal: 4.852 ± 0.109
0.621GluTrp: 0.621 ± 0.03
3.016GluTyr: 3.016 ± 0.074
0.0GluXaa: 0.0 ± 0.0
Phe
3.057PheAla: 3.057 ± 0.074
0.552PheCys: 0.552 ± 0.028
2.748PheAsp: 2.748 ± 0.067
3.025PheGlu: 3.025 ± 0.075
3.298PhePhe: 3.298 ± 0.089
3.065PheGly: 3.065 ± 0.083
0.953PheHis: 0.953 ± 0.032
4.07PheIle: 4.07 ± 0.095
3.146PheLys: 3.146 ± 0.073
5.385PheLeu: 5.385 ± 0.097
1.062PheMet: 1.062 ± 0.036
2.356PheAsn: 2.356 ± 0.053
2.012PhePro: 2.012 ± 0.056
1.63PheGln: 1.63 ± 0.049
1.855PheArg: 1.855 ± 0.055
4.348PheSer: 4.348 ± 0.085
2.898PheThr: 2.898 ± 0.074
3.247PheVal: 3.247 ± 0.079
0.6PheTrp: 0.6 ± 0.033
2.511PheTyr: 2.511 ± 0.06
0.0PheXaa: 0.0 ± 0.0
Gly
3.958GlyAla: 3.958 ± 0.076
0.598GlyCys: 0.598 ± 0.047
3.009GlyAsp: 3.009 ± 0.076
4.111GlyGlu: 4.111 ± 0.083
3.045GlyPhe: 3.045 ± 0.071
4.346GlyGly: 4.346 ± 0.096
1.089GlyHis: 1.089 ± 0.033
5.09GlyIle: 5.09 ± 0.09
5.787GlyLys: 5.787 ± 0.088
4.879GlyLeu: 4.879 ± 0.102
1.5GlyMet: 1.5 ± 0.045
3.453GlyAsn: 3.453 ± 0.081
0.673GlyPro: 0.673 ± 0.036
1.938GlyGln: 1.938 ± 0.056
2.339GlyArg: 2.339 ± 0.059
3.17GlySer: 3.17 ± 0.078
3.717GlyThr: 3.717 ± 0.118
4.264GlyVal: 4.264 ± 0.103
0.62GlyTrp: 0.62 ± 0.029
2.916GlyTyr: 2.916 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
0.921HisAla: 0.921 ± 0.036
0.228HisCys: 0.228 ± 0.017
0.64HisAsp: 0.64 ± 0.03
0.806HisGlu: 0.806 ± 0.033
1.365HisPhe: 1.365 ± 0.046
0.955HisGly: 0.955 ± 0.039
0.464HisHis: 0.464 ± 0.028
1.79HisIle: 1.79 ± 0.052
1.441HisLys: 1.441 ± 0.052
2.209HisLeu: 2.209 ± 0.054
0.195HisMet: 0.195 ± 0.016
0.946HisAsn: 0.946 ± 0.035
0.926HisPro: 0.926 ± 0.037
0.784HisGln: 0.784 ± 0.04
0.804HisArg: 0.804 ± 0.034
1.317HisSer: 1.317 ± 0.043
1.275HisThr: 1.275 ± 0.043
0.664HisVal: 0.664 ± 0.03
0.3HisTrp: 0.3 ± 0.021
1.093HisTyr: 1.093 ± 0.043
0.0HisXaa: 0.0 ± 0.0
Ile
5.666IleAla: 5.666 ± 0.101
0.708IleCys: 0.708 ± 0.03
4.391IleAsp: 4.391 ± 0.083
5.715IleGlu: 5.715 ± 0.1
4.09IlePhe: 4.09 ± 0.1
5.044IleGly: 5.044 ± 0.095
1.604IleHis: 1.604 ± 0.046
6.122IleIle: 6.122 ± 0.111
5.522IleLys: 5.522 ± 0.096
7.381IleLeu: 7.381 ± 0.12
1.325IleMet: 1.325 ± 0.048
3.936IleAsn: 3.936 ± 0.093
3.61IlePro: 3.61 ± 0.069
2.753IleGln: 2.753 ± 0.059
3.24IleArg: 3.24 ± 0.07
5.452IleSer: 5.452 ± 0.094
5.21IleThr: 5.21 ± 0.093
4.529IleVal: 4.529 ± 0.093
0.621IleTrp: 0.621 ± 0.031
3.153IleTyr: 3.153 ± 0.073
0.0IleXaa: 0.0 ± 0.0
Lys
5.132LysAla: 5.132 ± 0.084
0.388LysCys: 0.388 ± 0.02
4.727LysAsp: 4.727 ± 0.087
8.33LysGlu: 8.33 ± 0.121
2.467LysPhe: 2.467 ± 0.067
5.512LysGly: 5.512 ± 0.087
1.359LysHis: 1.359 ± 0.043
6.294LysIle: 6.294 ± 0.103
7.252LysLys: 7.252 ± 0.13
5.813LysLeu: 5.813 ± 0.088
2.077LysMet: 2.077 ± 0.059
5.102LysAsn: 5.102 ± 0.101
2.285LysPro: 2.285 ± 0.051
2.973LysGln: 2.973 ± 0.073
3.421LysArg: 3.421 ± 0.079
3.773LysSer: 3.773 ± 0.083
4.15LysThr: 4.15 ± 0.07
4.973LysVal: 4.973 ± 0.098
0.806LysTrp: 0.806 ± 0.034
3.621LysTyr: 3.621 ± 0.071
0.0LysXaa: 0.0 ± 0.0
Leu
5.825LeuAla: 5.825 ± 0.097
0.902LeuCys: 0.902 ± 0.038
4.222LeuAsp: 4.222 ± 0.078
5.854LeuGlu: 5.854 ± 0.098
5.298LeuPhe: 5.298 ± 0.103
5.378LeuGly: 5.378 ± 0.107
1.872LeuHis: 1.872 ± 0.054
6.762LeuIle: 6.762 ± 0.126
7.311LeuLys: 7.311 ± 0.105
10.133LeuLeu: 10.133 ± 0.161
1.959LeuMet: 1.959 ± 0.054
4.364LeuAsn: 4.364 ± 0.083
4.03LeuPro: 4.03 ± 0.072
4.007LeuGln: 4.007 ± 0.069
3.876LeuArg: 3.876 ± 0.075
7.918LeuSer: 7.918 ± 0.125
5.592LeuThr: 5.592 ± 0.094
4.901LeuVal: 4.901 ± 0.088
1.05LeuTrp: 1.05 ± 0.035
4.128LeuTyr: 4.128 ± 0.096
0.0LeuXaa: 0.0 ± 0.0
Met
1.433MetAla: 1.433 ± 0.054
0.139MetCys: 0.139 ± 0.014
1.118MetAsp: 1.118 ± 0.038
1.473MetGlu: 1.473 ± 0.043
0.765MetPhe: 0.765 ± 0.032
1.473MetGly: 1.473 ± 0.042
0.364MetHis: 0.364 ± 0.022
1.619MetIle: 1.619 ± 0.049
2.189MetLys: 2.189 ± 0.049
2.027MetLeu: 2.027 ± 0.048
0.528MetMet: 0.528 ± 0.029
1.381MetAsn: 1.381 ± 0.043
0.816MetPro: 0.816 ± 0.033
0.886MetGln: 0.886 ± 0.04
0.99MetArg: 0.99 ± 0.034
1.193MetSer: 1.193 ± 0.043
1.019MetThr: 1.019 ± 0.041
1.261MetVal: 1.261 ± 0.045
0.147MetTrp: 0.147 ± 0.014
0.713MetTyr: 0.713 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
3.285AsnAla: 3.285 ± 0.084
0.458AsnCys: 0.458 ± 0.026
2.479AsnAsp: 2.479 ± 0.063
3.245AsnGlu: 3.245 ± 0.07
2.627AsnPhe: 2.627 ± 0.078
3.098AsnGly: 3.098 ± 0.063
1.049AsnHis: 1.049 ± 0.038
4.723AsnIle: 4.723 ± 0.093
4.13AsnLys: 4.13 ± 0.085
4.693AsnLeu: 4.693 ± 0.096
1.117AsnMet: 1.117 ± 0.042
3.133AsnAsn: 3.133 ± 0.086
2.588AsnPro: 2.588 ± 0.059
1.872AsnGln: 1.872 ± 0.049
2.169AsnArg: 2.169 ± 0.058
2.893AsnSer: 2.893 ± 0.06
3.399AsnThr: 3.399 ± 0.079
2.758AsnVal: 2.758 ± 0.072
0.576AsnTrp: 0.576 ± 0.027
2.608AsnTyr: 2.608 ± 0.058
0.0AsnXaa: 0.0 ± 0.0
Pro
1.967ProAla: 1.967 ± 0.063
0.31ProCys: 0.31 ± 0.023
1.734ProAsp: 1.734 ± 0.054
2.825ProGlu: 2.825 ± 0.058
2.113ProPhe: 2.113 ± 0.051
1.214ProGly: 1.214 ± 0.059
0.75ProHis: 0.75 ± 0.031
3.181ProIle: 3.181 ± 0.071
2.906ProLys: 2.906 ± 0.069
3.582ProLeu: 3.582 ± 0.076
0.873ProMet: 0.873 ± 0.036
2.147ProAsn: 2.147 ± 0.057
0.99ProPro: 0.99 ± 0.045
1.626ProGln: 1.626 ± 0.054
0.959ProArg: 0.959 ± 0.041
2.384ProSer: 2.384 ± 0.068
2.46ProThr: 2.46 ± 0.068
2.032ProVal: 2.032 ± 0.058
0.313ProTrp: 0.313 ± 0.022
1.854ProTyr: 1.854 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
2.595GlnAla: 2.595 ± 0.073
0.265GlnCys: 0.265 ± 0.02
1.528GlnAsp: 1.528 ± 0.048
3.214GlnGlu: 3.214 ± 0.07
1.536GlnPhe: 1.536 ± 0.047
2.539GlnGly: 2.539 ± 0.062
0.693GlnHis: 0.693 ± 0.029
2.882GlnIle: 2.882 ± 0.072
3.675GlnLys: 3.675 ± 0.079
3.655GlnLeu: 3.655 ± 0.088
1.05GlnMet: 1.05 ± 0.04
1.864GlnAsn: 1.864 ± 0.058
1.209GlnPro: 1.209 ± 0.042
1.916GlnGln: 1.916 ± 0.064
1.739GlnArg: 1.739 ± 0.055
1.982GlnSer: 1.982 ± 0.055
2.161GlnThr: 2.161 ± 0.06
2.52GlnVal: 2.52 ± 0.062
0.638GlnTrp: 0.638 ± 0.033
1.807GlnTyr: 1.807 ± 0.057
0.0GlnXaa: 0.0 ± 0.0
Arg
2.324ArgAla: 2.324 ± 0.065
0.256ArgCys: 0.256 ± 0.018
1.834ArgAsp: 1.834 ± 0.062
3.001ArgGlu: 3.001 ± 0.072
1.96ArgPhe: 1.96 ± 0.052
2.076ArgGly: 2.076 ± 0.055
0.674ArgHis: 0.674 ± 0.028
3.569ArgIle: 3.569 ± 0.073
3.361ArgLys: 3.361 ± 0.078
3.675ArgLeu: 3.675 ± 0.083
1.03ArgMet: 1.03 ± 0.04
2.203ArgAsn: 2.203 ± 0.059
1.273ArgPro: 1.273 ± 0.041
1.358ArgGln: 1.358 ± 0.039
1.622ArgArg: 1.622 ± 0.051
1.97ArgSer: 1.97 ± 0.054
2.341ArgThr: 2.341 ± 0.063
2.251ArgVal: 2.251 ± 0.062
0.426ArgTrp: 0.426 ± 0.024
2.018ArgTyr: 2.018 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
3.415SerAla: 3.415 ± 0.078
0.613SerCys: 0.613 ± 0.029
3.062SerAsp: 3.062 ± 0.066
3.637SerGlu: 3.637 ± 0.088
3.93SerPhe: 3.93 ± 0.082
3.661SerGly: 3.661 ± 0.081
1.143SerHis: 1.143 ± 0.041
4.653SerIle: 4.653 ± 0.084
4.255SerLys: 4.255 ± 0.083
6.712SerLeu: 6.712 ± 0.111
1.149SerMet: 1.149 ± 0.037
2.765SerAsn: 2.765 ± 0.076
2.375SerPro: 2.375 ± 0.052
2.701SerGln: 2.701 ± 0.06
2.065SerArg: 2.065 ± 0.06
3.851SerSer: 3.851 ± 0.092
3.04SerThr: 3.04 ± 0.073
3.718SerVal: 3.718 ± 0.083
0.748SerTrp: 0.748 ± 0.032
3.056SerTyr: 3.056 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
3.514ThrAla: 3.514 ± 0.108
0.422ThrCys: 0.422 ± 0.027
3.109ThrAsp: 3.109 ± 0.073
3.839ThrGlu: 3.839 ± 0.071
3.282ThrPhe: 3.282 ± 0.067
3.655ThrGly: 3.655 ± 0.105
1.338ThrHis: 1.338 ± 0.037
4.684ThrIle: 4.684 ± 0.115
3.583ThrLys: 3.583 ± 0.08
6.286ThrLeu: 6.286 ± 0.094
0.939ThrMet: 0.939 ± 0.037
2.693ThrAsn: 2.693 ± 0.074
3.241ThrPro: 3.241 ± 0.097
2.561ThrGln: 2.561 ± 0.056
1.68ThrArg: 1.68 ± 0.049
3.425ThrSer: 3.425 ± 0.083
3.573ThrThr: 3.573 ± 0.096
3.017ThrVal: 3.017 ± 0.105
0.446ThrTrp: 0.446 ± 0.023
2.937ThrTyr: 2.937 ± 0.083
0.0ThrXaa: 0.0 ± 0.0
Val
4.019ValAla: 4.019 ± 0.085
0.606ValCys: 0.606 ± 0.026
2.705ValAsp: 2.705 ± 0.06
3.894ValGlu: 3.894 ± 0.074
3.04ValPhe: 3.04 ± 0.066
3.725ValGly: 3.725 ± 0.086
0.959ValHis: 0.959 ± 0.042
4.548ValIle: 4.548 ± 0.082
4.3ValLys: 4.3 ± 0.081
5.548ValLeu: 5.548 ± 0.106
1.27ValMet: 1.27 ± 0.042
2.782ValAsn: 2.782 ± 0.07
2.272ValPro: 2.272 ± 0.07
2.032ValGln: 2.032 ± 0.06
2.44ValArg: 2.44 ± 0.063
3.827ValSer: 3.827 ± 0.084
3.427ValThr: 3.427 ± 0.137
3.681ValVal: 3.681 ± 0.096
0.561ValTrp: 0.561 ± 0.028
2.444ValTyr: 2.444 ± 0.059
0.0ValXaa: 0.0 ± 0.0
Trp
0.618TrpAla: 0.618 ± 0.029
0.087TrpCys: 0.087 ± 0.011
0.573TrpAsp: 0.573 ± 0.027
0.794TrpGlu: 0.794 ± 0.033
0.488TrpPhe: 0.488 ± 0.029
0.729TrpGly: 0.729 ± 0.034
0.228TrpHis: 0.228 ± 0.017
0.84TrpIle: 0.84 ± 0.035
0.83TrpLys: 0.83 ± 0.037
1.077TrpLeu: 1.077 ± 0.04
0.265TrpMet: 0.265 ± 0.019
0.546TrpAsn: 0.546 ± 0.031
0.097TrpPro: 0.097 ± 0.013
0.534TrpGln: 0.534 ± 0.028
0.468TrpArg: 0.468 ± 0.026
0.496TrpSer: 0.496 ± 0.026
0.438TrpThr: 0.438 ± 0.022
0.722TrpVal: 0.722 ± 0.035
0.145TrpTrp: 0.145 ± 0.015
0.501TrpTyr: 0.501 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.549TyrAla: 2.549 ± 0.061
0.42TyrCys: 0.42 ± 0.022
2.488TyrAsp: 2.488 ± 0.076
2.874TyrGlu: 2.874 ± 0.072
2.726TyrPhe: 2.726 ± 0.075
2.603TyrGly: 2.603 ± 0.077
1.059TyrHis: 1.059 ± 0.037
3.521TyrIle: 3.521 ± 0.07
3.683TyrLys: 3.683 ± 0.07
4.525TyrLeu: 4.525 ± 0.082
0.906TyrMet: 0.906 ± 0.032
2.669TyrAsn: 2.669 ± 0.066
1.844TyrPro: 1.844 ± 0.047
2.108TyrGln: 2.108 ± 0.06
1.983TyrArg: 1.983 ± 0.056
2.587TyrSer: 2.587 ± 0.057
2.945TyrThr: 2.945 ± 0.082
2.228TyrVal: 2.228 ± 0.054
0.582TyrTrp: 0.582 ± 0.025
2.421TyrTyr: 2.421 ± 0.072
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2444 proteins (750425 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski