Amino acid dipepetide frequency for Clostridium kluyveri (strain ATCC 8527 / DSM 555 / NCIMB 10680)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.556AlaAla: 4.556 ± 0.084
0.702AlaCys: 0.702 ± 0.03
2.75AlaAsp: 2.75 ± 0.059
3.412AlaGlu: 3.412 ± 0.067
2.478AlaPhe: 2.478 ± 0.053
3.755AlaGly: 3.755 ± 0.067
0.829AlaHis: 0.829 ± 0.029
5.107AlaIle: 5.107 ± 0.078
4.151AlaLys: 4.151 ± 0.077
5.844AlaLeu: 5.844 ± 0.082
1.482AlaMet: 1.482 ± 0.042
2.371AlaAsn: 2.371 ± 0.051
1.422AlaPro: 1.422 ± 0.036
1.598AlaGln: 1.598 ± 0.042
2.056AlaArg: 2.056 ± 0.042
3.578AlaSer: 3.578 ± 0.064
2.43AlaThr: 2.43 ± 0.088
4.667AlaVal: 4.667 ± 0.083
0.354AlaTrp: 0.354 ± 0.016
2.114AlaTyr: 2.114 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.712CysAla: 0.712 ± 0.027
0.24CysCys: 0.24 ± 0.014
0.721CysAsp: 0.721 ± 0.025
0.881CysGlu: 0.881 ± 0.026
0.556CysPhe: 0.556 ± 0.022
1.158CysGly: 1.158 ± 0.036
0.258CysHis: 0.258 ± 0.016
1.317CysIle: 1.317 ± 0.038
1.042CysLys: 1.042 ± 0.033
0.926CysLeu: 0.926 ± 0.03
0.347CysMet: 0.347 ± 0.016
0.784CysAsn: 0.784 ± 0.03
0.496CysPro: 0.496 ± 0.024
0.268CysGln: 0.268 ± 0.017
0.583CysArg: 0.583 ± 0.023
0.957CysSer: 0.957 ± 0.039
0.627CysThr: 0.627 ± 0.027
0.741CysVal: 0.741 ± 0.026
0.082CysTrp: 0.082 ± 0.008
0.555CysTyr: 0.555 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
2.634AspAla: 2.634 ± 0.057
0.718AspCys: 0.718 ± 0.027
2.394AspAsp: 2.394 ± 0.053
4.017AspGlu: 4.017 ± 0.071
2.662AspPhe: 2.662 ± 0.048
3.066AspGly: 3.066 ± 0.06
0.525AspHis: 0.525 ± 0.024
6.409AspIle: 6.409 ± 0.084
5.408AspLys: 5.408 ± 0.072
4.448AspLeu: 4.448 ± 0.061
1.669AspMet: 1.669 ± 0.039
3.288AspAsn: 3.288 ± 0.064
1.296AspPro: 1.296 ± 0.038
0.74AspGln: 0.74 ± 0.029
2.002AspArg: 2.002 ± 0.045
3.284AspSer: 3.284 ± 0.061
2.775AspThr: 2.775 ± 0.065
3.428AspVal: 3.428 ± 0.057
0.387AspTrp: 0.387 ± 0.019
2.341AspTyr: 2.341 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
3.804GluAla: 3.804 ± 0.058
0.712GluCys: 0.712 ± 0.026
4.337GluAsp: 4.337 ± 0.075
6.057GluGlu: 6.057 ± 0.098
2.873GluPhe: 2.873 ± 0.056
3.889GluGly: 3.889 ± 0.062
1.003GluHis: 1.003 ± 0.031
6.582GluIle: 6.582 ± 0.094
7.267GluLys: 7.267 ± 0.096
6.417GluLeu: 6.417 ± 0.088
1.775GluMet: 1.775 ± 0.035
5.31GluAsn: 5.31 ± 0.079
1.378GluPro: 1.378 ± 0.035
2.16GluGln: 2.16 ± 0.049
2.518GluArg: 2.518 ± 0.053
3.634GluSer: 3.634 ± 0.059
2.692GluThr: 2.692 ± 0.055
4.34GluVal: 4.34 ± 0.068
0.403GluTrp: 0.403 ± 0.019
2.9GluTyr: 2.9 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
2.114PheAla: 2.114 ± 0.048
0.577PheCys: 0.577 ± 0.024
2.233PheAsp: 2.233 ± 0.044
2.496PheGlu: 2.496 ± 0.049
1.886PhePhe: 1.886 ± 0.051
2.703PheGly: 2.703 ± 0.053
0.673PheHis: 0.673 ± 0.025
4.641PheIle: 4.641 ± 0.078
3.919PheLys: 3.919 ± 0.065
3.842PheLeu: 3.842 ± 0.07
1.197PheMet: 1.197 ± 0.033
2.845PheAsn: 2.845 ± 0.051
1.272PhePro: 1.272 ± 0.037
1.252PheGln: 1.252 ± 0.031
1.323PheArg: 1.323 ± 0.034
3.282PheSer: 3.282 ± 0.06
2.449PheThr: 2.449 ± 0.048
2.437PheVal: 2.437 ± 0.052
0.321PheTrp: 0.321 ± 0.017
1.861PheTyr: 1.861 ± 0.046
0.0PheXaa: 0.0 ± 0.0
Gly
3.877GlyAla: 3.877 ± 0.103
1.008GlyCys: 1.008 ± 0.034
3.238GlyAsp: 3.238 ± 0.057
4.198GlyGlu: 4.198 ± 0.074
2.916GlyPhe: 2.916 ± 0.048
4.318GlyGly: 4.318 ± 0.085
1.026GlyHis: 1.026 ± 0.036
7.036GlyIle: 7.036 ± 0.091
5.627GlyLys: 5.627 ± 0.078
4.97GlyLeu: 4.97 ± 0.069
1.755GlyMet: 1.755 ± 0.038
3.388GlyAsn: 3.388 ± 0.053
1.09GlyPro: 1.09 ± 0.044
1.496GlyGln: 1.496 ± 0.04
2.439GlyArg: 2.439 ± 0.053
3.857GlySer: 3.857 ± 0.067
3.576GlyThr: 3.576 ± 0.064
4.273GlyVal: 4.273 ± 0.063
0.498GlyTrp: 0.498 ± 0.024
2.933GlyTyr: 2.933 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
0.7HisAla: 0.7 ± 0.028
0.26HisCys: 0.26 ± 0.015
0.651HisAsp: 0.651 ± 0.022
0.873HisGlu: 0.873 ± 0.03
0.652HisPhe: 0.652 ± 0.026
0.995HisGly: 0.995 ± 0.031
0.327HisHis: 0.327 ± 0.02
1.531HisIle: 1.531 ± 0.04
1.174HisLys: 1.174 ± 0.032
1.153HisLeu: 1.153 ± 0.034
0.429HisMet: 0.429 ± 0.02
0.845HisAsn: 0.845 ± 0.029
0.654HisPro: 0.654 ± 0.027
0.369HisGln: 0.369 ± 0.02
0.654HisArg: 0.654 ± 0.026
0.989HisSer: 0.989 ± 0.035
0.695HisThr: 0.695 ± 0.025
0.833HisVal: 0.833 ± 0.026
0.124HisTrp: 0.124 ± 0.01
0.592HisTyr: 0.592 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.605IleAla: 5.605 ± 0.081
1.43IleCys: 1.43 ± 0.043
5.491IleAsp: 5.491 ± 0.074
6.758IleGlu: 6.758 ± 0.085
4.412IlePhe: 4.412 ± 0.077
6.159IleGly: 6.159 ± 0.086
1.309IleHis: 1.309 ± 0.034
9.66IleIle: 9.66 ± 0.134
8.842IleLys: 8.842 ± 0.109
9.2IleLeu: 9.2 ± 0.112
2.525IleMet: 2.525 ± 0.054
6.051IleAsn: 6.051 ± 0.082
3.605IlePro: 3.605 ± 0.059
2.567IleGln: 2.567 ± 0.048
3.309IleArg: 3.309 ± 0.053
7.369IleSer: 7.369 ± 0.087
5.085IleThr: 5.085 ± 0.072
6.097IleVal: 6.097 ± 0.084
0.62IleTrp: 0.62 ± 0.024
3.789IleTyr: 3.789 ± 0.065
0.0IleXaa: 0.0 ± 0.0
Lys
4.621LysAla: 4.621 ± 0.076
0.988LysCys: 0.988 ± 0.031
5.536LysAsp: 5.536 ± 0.079
7.466LysGlu: 7.466 ± 0.096
3.347LysPhe: 3.347 ± 0.054
5.037LysGly: 5.037 ± 0.071
1.179LysHis: 1.179 ± 0.03
8.738LysIle: 8.738 ± 0.087
8.307LysLys: 8.307 ± 0.095
7.72LysLeu: 7.72 ± 0.076
2.444LysMet: 2.444 ± 0.049
6.901LysAsn: 6.901 ± 0.099
2.147LysPro: 2.147 ± 0.048
2.402LysGln: 2.402 ± 0.051
3.176LysArg: 3.176 ± 0.055
5.521LysSer: 5.521 ± 0.074
3.826LysThr: 3.826 ± 0.059
5.743LysVal: 5.743 ± 0.08
0.697LysTrp: 0.697 ± 0.027
4.234LysTyr: 4.234 ± 0.068
0.0LysXaa: 0.0 ± 0.0
Leu
4.837LeuAla: 4.837 ± 0.062
1.273LeuCys: 1.273 ± 0.035
4.756LeuAsp: 4.756 ± 0.069
5.913LeuGlu: 5.913 ± 0.092
3.641LeuPhe: 3.641 ± 0.067
5.748LeuGly: 5.748 ± 0.085
1.219LeuHis: 1.219 ± 0.034
7.989LeuIle: 7.989 ± 0.108
8.947LeuLys: 8.947 ± 0.087
7.483LeuLeu: 7.483 ± 0.114
2.279LeuMet: 2.279 ± 0.044
6.03LeuAsn: 6.03 ± 0.075
2.833LeuPro: 2.833 ± 0.056
2.398LeuGln: 2.398 ± 0.049
3.174LeuArg: 3.174 ± 0.05
6.593LeuSer: 6.593 ± 0.08
4.325LeuThr: 4.325 ± 0.084
4.969LeuVal: 4.969 ± 0.069
0.664LeuTrp: 0.664 ± 0.027
3.294LeuTyr: 3.294 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
1.848MetAla: 1.848 ± 0.048
0.319MetCys: 0.319 ± 0.016
1.781MetAsp: 1.781 ± 0.043
2.116MetGlu: 2.116 ± 0.046
0.99MetPhe: 0.99 ± 0.032
1.89MetGly: 1.89 ± 0.04
0.356MetHis: 0.356 ± 0.017
2.167MetIle: 2.167 ± 0.047
2.458MetLys: 2.458 ± 0.048
2.336MetLeu: 2.336 ± 0.048
0.63MetMet: 0.63 ± 0.028
1.686MetAsn: 1.686 ± 0.038
0.946MetPro: 0.946 ± 0.027
0.638MetGln: 0.638 ± 0.025
0.944MetArg: 0.944 ± 0.031
1.765MetSer: 1.765 ± 0.037
1.23MetThr: 1.23 ± 0.029
1.683MetVal: 1.683 ± 0.044
0.171MetTrp: 0.171 ± 0.012
0.899MetTyr: 0.899 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
2.889AsnAla: 2.889 ± 0.055
0.843AsnCys: 0.843 ± 0.031
2.623AsnAsp: 2.623 ± 0.054
3.715AsnGlu: 3.715 ± 0.065
2.719AsnPhe: 2.719 ± 0.049
3.414AsnGly: 3.414 ± 0.069
0.843AsnHis: 0.843 ± 0.027
7.413AsnIle: 7.413 ± 0.088
5.926AsnLys: 5.926 ± 0.086
5.599AsnLeu: 5.599 ± 0.075
1.839AsnMet: 1.839 ± 0.048
4.055AsnAsn: 4.055 ± 0.08
2.064AsnPro: 2.064 ± 0.044
1.476AsnGln: 1.476 ± 0.044
2.247AsnArg: 2.247 ± 0.052
4.308AsnSer: 4.308 ± 0.078
3.269AsnThr: 3.269 ± 0.072
3.706AsnVal: 3.706 ± 0.056
0.485AsnTrp: 0.485 ± 0.021
2.758AsnTyr: 2.758 ± 0.053
0.0AsnXaa: 0.0 ± 0.0
Pro
1.46ProAla: 1.46 ± 0.04
0.375ProCys: 0.375 ± 0.02
1.618ProAsp: 1.618 ± 0.037
2.475ProGlu: 2.475 ± 0.048
1.345ProPhe: 1.345 ± 0.039
1.765ProGly: 1.765 ± 0.036
0.508ProHis: 0.508 ± 0.025
2.737ProIle: 2.737 ± 0.048
2.247ProLys: 2.247 ± 0.047
2.614ProLeu: 2.614 ± 0.043
0.721ProMet: 0.721 ± 0.025
1.362ProAsn: 1.362 ± 0.032
0.759ProPro: 0.759 ± 0.032
0.866ProGln: 0.866 ± 0.031
0.872ProArg: 0.872 ± 0.031
1.82ProSer: 1.82 ± 0.039
1.403ProThr: 1.403 ± 0.045
2.239ProVal: 2.239 ± 0.046
0.246ProTrp: 0.246 ± 0.013
1.416ProTyr: 1.416 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
1.495GlnAla: 1.495 ± 0.045
0.348GlnCys: 0.348 ± 0.017
1.306GlnAsp: 1.306 ± 0.031
1.711GlnGlu: 1.711 ± 0.042
1.085GlnPhe: 1.085 ± 0.031
1.732GlnGly: 1.732 ± 0.041
0.39GlnHis: 0.39 ± 0.018
2.516GlnIle: 2.516 ± 0.053
2.326GlnLys: 2.326 ± 0.04
2.306GlnLeu: 2.306 ± 0.054
0.768GlnMet: 0.768 ± 0.028
1.703GlnAsn: 1.703 ± 0.041
0.64GlnPro: 0.64 ± 0.026
0.96GlnGln: 0.96 ± 0.036
1.017GlnArg: 1.017 ± 0.03
1.639GlnSer: 1.639 ± 0.038
1.192GlnThr: 1.192 ± 0.038
1.663GlnVal: 1.663 ± 0.039
0.242GlnTrp: 0.242 ± 0.015
1.131GlnTyr: 1.131 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
1.996ArgAla: 1.996 ± 0.042
0.448ArgCys: 0.448 ± 0.021
1.895ArgAsp: 1.895 ± 0.041
3.208ArgGlu: 3.208 ± 0.059
1.483ArgPhe: 1.483 ± 0.037
2.154ArgGly: 2.154 ± 0.04
0.56ArgHis: 0.56 ± 0.022
3.412ArgIle: 3.412 ± 0.054
3.336ArgLys: 3.336 ± 0.058
3.116ArgLeu: 3.116 ± 0.058
1.0ArgMet: 1.0 ± 0.034
2.106ArgAsn: 2.106 ± 0.046
0.966ArgPro: 0.966 ± 0.028
1.079ArgGln: 1.079 ± 0.031
1.631ArgArg: 1.631 ± 0.047
1.75ArgSer: 1.75 ± 0.041
1.691ArgThr: 1.691 ± 0.036
2.152ArgVal: 2.152 ± 0.046
0.319ArgTrp: 0.319 ± 0.019
1.47ArgTyr: 1.47 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
3.509SerAla: 3.509 ± 0.065
0.782SerCys: 0.782 ± 0.028
3.184SerAsp: 3.184 ± 0.059
4.178SerGlu: 4.178 ± 0.07
2.948SerPhe: 2.948 ± 0.056
4.773SerGly: 4.773 ± 0.076
1.031SerHis: 1.031 ± 0.032
7.077SerIle: 7.077 ± 0.089
5.617SerLys: 5.617 ± 0.071
5.748SerLeu: 5.748 ± 0.071
1.844SerMet: 1.844 ± 0.04
3.82SerAsn: 3.82 ± 0.073
1.866SerPro: 1.866 ± 0.04
1.941SerGln: 1.941 ± 0.045
2.31SerArg: 2.31 ± 0.045
4.906SerSer: 4.906 ± 0.081
3.393SerThr: 3.393 ± 0.06
4.145SerVal: 4.145 ± 0.067
0.463SerTrp: 0.463 ± 0.02
2.759SerTyr: 2.759 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
3.312ThrAla: 3.312 ± 0.065
0.565ThrCys: 0.565 ± 0.023
2.502ThrAsp: 2.502 ± 0.046
2.981ThrGlu: 2.981 ± 0.055
2.118ThrPhe: 2.118 ± 0.046
3.862ThrGly: 3.862 ± 0.15
0.775ThrHis: 0.775 ± 0.026
4.712ThrIle: 4.712 ± 0.073
3.473ThrLys: 3.473 ± 0.06
4.803ThrLeu: 4.803 ± 0.074
1.107ThrMet: 1.107 ± 0.032
2.496ThrAsn: 2.496 ± 0.056
1.874ThrPro: 1.874 ± 0.046
1.243ThrGln: 1.243 ± 0.038
1.539ThrArg: 1.539 ± 0.033
3.411ThrSer: 3.411 ± 0.063
2.627ThrThr: 2.627 ± 0.065
3.548ThrVal: 3.548 ± 0.066
0.384ThrTrp: 0.384 ± 0.019
1.803ThrTyr: 1.803 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
3.401ValAla: 3.401 ± 0.062
0.942ValCys: 0.942 ± 0.031
3.678ValAsp: 3.678 ± 0.064
4.404ValGlu: 4.404 ± 0.074
2.859ValPhe: 2.859 ± 0.052
3.883ValGly: 3.883 ± 0.073
0.94ValHis: 0.94 ± 0.031
6.035ValIle: 6.035 ± 0.079
5.686ValLys: 5.686 ± 0.082
5.727ValLeu: 5.727 ± 0.072
1.67ValMet: 1.67 ± 0.034
3.756ValAsn: 3.756 ± 0.06
2.032ValPro: 2.032 ± 0.044
1.623ValGln: 1.623 ± 0.035
2.052ValArg: 2.052 ± 0.043
4.468ValSer: 4.468 ± 0.081
3.425ValThr: 3.425 ± 0.066
4.481ValVal: 4.481 ± 0.073
0.397ValTrp: 0.397 ± 0.019
2.48ValTyr: 2.48 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.347TrpAla: 0.347 ± 0.018
0.113TrpCys: 0.113 ± 0.011
0.407TrpAsp: 0.407 ± 0.019
0.46TrpGlu: 0.46 ± 0.023
0.335TrpPhe: 0.335 ± 0.016
0.538TrpGly: 0.538 ± 0.023
0.153TrpHis: 0.153 ± 0.012
0.731TrpIle: 0.731 ± 0.027
0.584TrpLys: 0.584 ± 0.024
0.613TrpLeu: 0.613 ± 0.024
0.209TrpMet: 0.209 ± 0.016
0.499TrpAsn: 0.499 ± 0.021
0.173TrpPro: 0.173 ± 0.013
0.264TrpGln: 0.264 ± 0.018
0.301TrpArg: 0.301 ± 0.017
0.412TrpSer: 0.412 ± 0.02
0.356TrpThr: 0.356 ± 0.02
0.419TrpVal: 0.419 ± 0.019
0.089TrpTrp: 0.089 ± 0.009
0.255TrpTyr: 0.255 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.938TyrAla: 1.938 ± 0.04
0.607TyrCys: 0.607 ± 0.025
2.392TyrAsp: 2.392 ± 0.045
2.733TyrGlu: 2.733 ± 0.053
2.019TyrPhe: 2.019 ± 0.047
2.681TyrGly: 2.681 ± 0.048
0.597TyrHis: 0.597 ± 0.023
4.07TyrIle: 4.07 ± 0.066
3.692TyrLys: 3.692 ± 0.056
3.501TyrLeu: 3.501 ± 0.061
1.161TyrMet: 1.161 ± 0.033
2.897TyrAsn: 2.897 ± 0.06
1.332TyrPro: 1.332 ± 0.034
0.828TyrGln: 0.828 ± 0.027
1.624TyrArg: 1.624 ± 0.036
2.751TyrSer: 2.751 ± 0.054
2.073TyrThr: 2.073 ± 0.045
2.374TyrVal: 2.374 ± 0.044
0.309TyrTrp: 0.309 ± 0.017
1.88TyrTyr: 1.88 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3828 proteins (1110703 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski