Amino acid dipepetide frequency for Pelobacter carbinolicus (strain DSM 2380 / NBRC 103641 / GraBd1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.288AlaAla: 10.288 ± 0.12
1.412AlaCys: 1.412 ± 0.037
5.355AlaAsp: 5.355 ± 0.077
6.449AlaGlu: 6.449 ± 0.084
3.514AlaPhe: 3.514 ± 0.066
8.202AlaGly: 8.202 ± 0.102
1.86AlaHis: 1.86 ± 0.041
5.058AlaIle: 5.058 ± 0.083
3.602AlaLys: 3.602 ± 0.086
10.921AlaLeu: 10.921 ± 0.109
2.79AlaMet: 2.79 ± 0.056
2.424AlaAsn: 2.424 ± 0.052
3.59AlaPro: 3.59 ± 0.069
3.308AlaGln: 3.308 ± 0.062
6.213AlaArg: 6.213 ± 0.084
4.871AlaSer: 4.871 ± 0.066
4.479AlaThr: 4.479 ± 0.071
7.142AlaVal: 7.142 ± 0.099
1.061AlaTrp: 1.061 ± 0.035
2.238AlaTyr: 2.238 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
1.163CysAla: 1.163 ± 0.032
0.289CysCys: 0.289 ± 0.018
0.782CysAsp: 0.782 ± 0.026
0.695CysGlu: 0.695 ± 0.029
0.564CysPhe: 0.564 ± 0.024
1.564CysGly: 1.564 ± 0.044
0.412CysHis: 0.412 ± 0.021
0.672CysIle: 0.672 ± 0.027
0.473CysLys: 0.473 ± 0.02
1.496CysLeu: 1.496 ± 0.04
0.313CysMet: 0.313 ± 0.019
0.417CysAsn: 0.417 ± 0.017
0.913CysPro: 0.913 ± 0.032
0.513CysGln: 0.513 ± 0.021
1.17CysArg: 1.17 ± 0.033
0.886CysSer: 0.886 ± 0.027
0.562CysThr: 0.562 ± 0.024
0.778CysVal: 0.778 ± 0.028
0.173CysTrp: 0.173 ± 0.011
0.385CysTyr: 0.385 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
4.947AspAla: 4.947 ± 0.065
0.78AspCys: 0.78 ± 0.029
2.926AspAsp: 2.926 ± 0.061
3.401AspGlu: 3.401 ± 0.051
2.568AspPhe: 2.568 ± 0.048
4.167AspGly: 4.167 ± 0.071
1.185AspHis: 1.185 ± 0.033
3.629AspIle: 3.629 ± 0.057
2.372AspLys: 2.372 ± 0.049
6.364AspLeu: 6.364 ± 0.086
1.513AspMet: 1.513 ± 0.039
1.727AspAsn: 1.727 ± 0.043
2.796AspPro: 2.796 ± 0.055
1.871AspGln: 1.871 ± 0.051
3.82AspArg: 3.82 ± 0.071
2.717AspSer: 2.717 ± 0.052
2.616AspThr: 2.616 ± 0.051
3.614AspVal: 3.614 ± 0.066
0.739AspTrp: 0.739 ± 0.028
1.75AspTyr: 1.75 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
6.08GluAla: 6.08 ± 0.086
0.567GluCys: 0.567 ± 0.025
3.274GluAsp: 3.274 ± 0.063
4.639GluGlu: 4.639 ± 0.094
2.078GluPhe: 2.078 ± 0.043
4.304GluGly: 4.304 ± 0.074
1.382GluHis: 1.382 ± 0.038
3.977GluIle: 3.977 ± 0.066
3.634GluLys: 3.634 ± 0.058
6.557GluLeu: 6.557 ± 0.08
1.754GluMet: 1.754 ± 0.035
2.262GluAsn: 2.262 ± 0.052
2.337GluPro: 2.337 ± 0.062
3.118GluGln: 3.118 ± 0.052
4.207GluArg: 4.207 ± 0.073
2.988GluSer: 2.988 ± 0.055
3.378GluThr: 3.378 ± 0.066
4.548GluVal: 4.548 ± 0.074
0.546GluTrp: 0.546 ± 0.025
1.506GluTyr: 1.506 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.469PheAla: 3.469 ± 0.056
0.687PheCys: 0.687 ± 0.024
2.596PheAsp: 2.596 ± 0.051
2.28PheGlu: 2.28 ± 0.045
1.89PhePhe: 1.89 ± 0.042
3.329PheGly: 3.329 ± 0.059
0.894PheHis: 0.894 ± 0.026
2.038PheIle: 2.038 ± 0.046
1.5PheLys: 1.5 ± 0.034
4.022PheLeu: 4.022 ± 0.072
0.971PheMet: 0.971 ± 0.032
1.342PheAsn: 1.342 ± 0.034
1.753PhePro: 1.753 ± 0.042
1.191PheGln: 1.191 ± 0.036
2.41PheArg: 2.41 ± 0.057
2.783PheSer: 2.783 ± 0.054
2.015PheThr: 2.015 ± 0.042
2.649PheVal: 2.649 ± 0.052
0.497PheTrp: 0.497 ± 0.022
1.171PheTyr: 1.171 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
6.417GlyAla: 6.417 ± 0.096
1.392GlyCys: 1.392 ± 0.037
4.058GlyAsp: 4.058 ± 0.067
4.689GlyGlu: 4.689 ± 0.077
3.312GlyPhe: 3.312 ± 0.063
6.116GlyGly: 6.116 ± 0.097
1.955GlyHis: 1.955 ± 0.042
4.994GlyIle: 4.994 ± 0.078
4.04GlyLys: 4.04 ± 0.07
8.496GlyLeu: 8.496 ± 0.094
2.358GlyMet: 2.358 ± 0.048
2.323GlyAsn: 2.323 ± 0.051
2.542GlyPro: 2.542 ± 0.05
3.036GlyGln: 3.036 ± 0.059
5.319GlyArg: 5.319 ± 0.081
4.211GlySer: 4.211 ± 0.077
4.093GlyThr: 4.093 ± 0.069
5.698GlyVal: 5.698 ± 0.084
1.03GlyTrp: 1.03 ± 0.039
2.572GlyTyr: 2.572 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
1.923HisAla: 1.923 ± 0.044
0.397HisCys: 0.397 ± 0.02
1.122HisAsp: 1.122 ± 0.034
1.085HisGlu: 1.085 ± 0.033
1.027HisPhe: 1.027 ± 0.03
1.81HisGly: 1.81 ± 0.037
0.623HisHis: 0.623 ± 0.022
1.265HisIle: 1.265 ± 0.034
0.827HisLys: 0.827 ± 0.032
2.642HisLeu: 2.642 ± 0.063
0.497HisMet: 0.497 ± 0.022
0.687HisAsn: 0.687 ± 0.027
1.513HisPro: 1.513 ± 0.038
0.827HisGln: 0.827 ± 0.027
1.67HisArg: 1.67 ± 0.042
1.137HisSer: 1.137 ± 0.032
1.02HisThr: 1.02 ± 0.032
1.461HisVal: 1.461 ± 0.042
0.289HisTrp: 0.289 ± 0.017
0.687HisTyr: 0.687 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.359IleAla: 5.359 ± 0.079
0.866IleCys: 0.866 ± 0.028
3.721IleAsp: 3.721 ± 0.064
3.758IleGlu: 3.758 ± 0.066
2.129IlePhe: 2.129 ± 0.052
4.574IleGly: 4.574 ± 0.078
1.212IleHis: 1.212 ± 0.036
2.924IleIle: 2.924 ± 0.058
2.378IleLys: 2.378 ± 0.053
5.643IleLeu: 5.643 ± 0.087
1.165IleMet: 1.165 ± 0.034
1.819IleAsn: 1.819 ± 0.045
2.881IlePro: 2.881 ± 0.054
1.678IleGln: 1.678 ± 0.037
3.594IleArg: 3.594 ± 0.064
3.249IleSer: 3.249 ± 0.055
2.772IleThr: 2.772 ± 0.056
3.854IleVal: 3.854 ± 0.063
0.491IleTrp: 0.491 ± 0.024
1.435IleTyr: 1.435 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.042LysAla: 4.042 ± 0.063
0.456LysCys: 0.456 ± 0.021
2.306LysAsp: 2.306 ± 0.059
2.909LysGlu: 2.909 ± 0.071
1.177LysPhe: 1.177 ± 0.032
3.104LysGly: 3.104 ± 0.053
0.886LysHis: 0.886 ± 0.028
2.829LysIle: 2.829 ± 0.05
2.665LysLys: 2.665 ± 0.063
3.947LysLeu: 3.947 ± 0.062
1.112LysMet: 1.112 ± 0.035
1.645LysAsn: 1.645 ± 0.047
2.078LysPro: 2.078 ± 0.055
1.809LysGln: 1.809 ± 0.045
2.703LysArg: 2.703 ± 0.055
2.28LysSer: 2.28 ± 0.051
2.617LysThr: 2.617 ± 0.054
3.279LysVal: 3.279 ± 0.057
0.377LysTrp: 0.377 ± 0.019
1.063LysTyr: 1.063 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
11.74LeuAla: 11.74 ± 0.123
1.534LeuCys: 1.534 ± 0.037
6.092LeuAsp: 6.092 ± 0.072
6.932LeuGlu: 6.932 ± 0.096
4.351LeuPhe: 4.351 ± 0.081
8.18LeuGly: 8.18 ± 0.085
2.516LeuHis: 2.516 ± 0.057
5.233LeuIle: 5.233 ± 0.075
4.83LeuLys: 4.83 ± 0.071
12.47LeuLeu: 12.47 ± 0.14
2.468LeuMet: 2.468 ± 0.054
3.232LeuAsn: 3.232 ± 0.062
5.769LeuPro: 5.769 ± 0.083
4.619LeuGln: 4.619 ± 0.078
7.291LeuArg: 7.291 ± 0.111
6.736LeuSer: 6.736 ± 0.09
5.525LeuThr: 5.525 ± 0.083
7.731LeuVal: 7.731 ± 0.099
1.163LeuTrp: 1.163 ± 0.041
2.515LeuTyr: 2.515 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.844MetAla: 2.844 ± 0.054
0.23MetCys: 0.23 ± 0.016
1.361MetAsp: 1.361 ± 0.037
1.565MetGlu: 1.565 ± 0.043
0.742MetPhe: 0.742 ± 0.027
1.802MetGly: 1.802 ± 0.044
0.49MetHis: 0.49 ± 0.021
1.286MetIle: 1.286 ± 0.037
1.301MetLys: 1.301 ± 0.034
2.643MetLeu: 2.643 ± 0.049
0.616MetMet: 0.616 ± 0.025
0.852MetAsn: 0.852 ± 0.03
1.279MetPro: 1.279 ± 0.04
1.027MetGln: 1.027 ± 0.031
1.584MetArg: 1.584 ± 0.038
1.416MetSer: 1.416 ± 0.035
1.527MetThr: 1.527 ± 0.047
2.042MetVal: 2.042 ± 0.049
0.147MetTrp: 0.147 ± 0.013
0.383MetTyr: 0.383 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
2.671AsnAla: 2.671 ± 0.052
0.45AsnCys: 0.45 ± 0.022
1.59AsnAsp: 1.59 ± 0.034
1.433AsnGlu: 1.433 ± 0.037
1.116AsnPhe: 1.116 ± 0.036
2.319AsnGly: 2.319 ± 0.051
0.711AsnHis: 0.711 ± 0.026
2.131AsnIle: 2.131 ± 0.052
1.194AsnLys: 1.194 ± 0.038
3.797AsnLeu: 3.797 ± 0.067
0.76AsnMet: 0.76 ± 0.027
0.971AsnAsn: 0.971 ± 0.034
1.955AsnPro: 1.955 ± 0.046
1.083AsnGln: 1.083 ± 0.03
2.274AsnArg: 2.274 ± 0.046
1.476AsnSer: 1.476 ± 0.04
1.386AsnThr: 1.386 ± 0.038
1.99AsnVal: 1.99 ± 0.047
0.343AsnTrp: 0.343 ± 0.02
0.87AsnTyr: 0.87 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
4.379ProAla: 4.379 ± 0.075
0.59ProCys: 0.59 ± 0.024
3.105ProAsp: 3.105 ± 0.062
3.815ProGlu: 3.815 ± 0.069
1.907ProPhe: 1.907 ± 0.047
3.982ProGly: 3.982 ± 0.065
1.055ProHis: 1.055 ± 0.033
2.023ProIle: 2.023 ± 0.043
1.575ProLys: 1.575 ± 0.041
4.997ProLeu: 4.997 ± 0.064
1.159ProMet: 1.159 ± 0.033
1.11ProAsn: 1.11 ± 0.039
1.961ProPro: 1.961 ± 0.052
1.766ProGln: 1.766 ± 0.037
2.456ProArg: 2.456 ± 0.051
2.458ProSer: 2.458 ± 0.045
2.038ProThr: 2.038 ± 0.04
3.94ProVal: 3.94 ± 0.076
0.613ProTrp: 0.613 ± 0.024
1.214ProTyr: 1.214 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
4.263GlnAla: 4.263 ± 0.073
0.395GlnCys: 0.395 ± 0.02
1.796GlnAsp: 1.796 ± 0.041
2.372GlnGlu: 2.372 ± 0.051
1.124GlnPhe: 1.124 ± 0.035
2.908GlnGly: 2.908 ± 0.059
0.848GlnHis: 0.848 ± 0.03
2.042GlnIle: 2.042 ± 0.046
1.912GlnLys: 1.912 ± 0.047
4.149GlnLeu: 4.149 ± 0.079
0.903GlnMet: 0.903 ± 0.031
1.242GlnAsn: 1.242 ± 0.039
1.791GlnPro: 1.791 ± 0.046
2.15GlnGln: 2.15 ± 0.052
2.932GlnArg: 2.932 ± 0.059
1.971GlnSer: 1.971 ± 0.037
1.996GlnThr: 1.996 ± 0.048
3.147GlnVal: 3.147 ± 0.053
0.445GlnTrp: 0.445 ± 0.02
0.814GlnTyr: 0.814 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
5.11ArgAla: 5.11 ± 0.06
1.021ArgCys: 1.021 ± 0.032
3.523ArgAsp: 3.523 ± 0.06
4.417ArgGlu: 4.417 ± 0.072
3.069ArgPhe: 3.069 ± 0.054
4.084ArgGly: 4.084 ± 0.069
1.807ArgHis: 1.807 ± 0.044
4.22ArgIle: 4.22 ± 0.068
3.16ArgLys: 3.16 ± 0.057
7.819ArgLeu: 7.819 ± 0.116
1.796ArgMet: 1.796 ± 0.047
2.213ArgAsn: 2.213 ± 0.055
2.667ArgPro: 2.667 ± 0.053
3.672ArgGln: 3.672 ± 0.062
5.125ArgArg: 5.125 ± 0.088
3.577ArgSer: 3.577 ± 0.055
2.853ArgThr: 2.853 ± 0.051
4.123ArgVal: 4.123 ± 0.058
0.878ArgTrp: 0.878 ± 0.03
2.142ArgTyr: 2.142 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
4.803SerAla: 4.803 ± 0.083
0.816SerCys: 0.816 ± 0.03
2.918SerAsp: 2.918 ± 0.05
3.225SerGlu: 3.225 ± 0.056
2.357SerPhe: 2.357 ± 0.051
5.133SerGly: 5.133 ± 0.083
1.268SerHis: 1.268 ± 0.032
2.898SerIle: 2.898 ± 0.056
2.023SerLys: 2.023 ± 0.046
6.445SerLeu: 6.445 ± 0.084
1.385SerMet: 1.385 ± 0.037
1.525SerAsn: 1.525 ± 0.039
2.514SerPro: 2.514 ± 0.051
1.967SerGln: 1.967 ± 0.045
3.781SerArg: 3.781 ± 0.063
3.162SerSer: 3.162 ± 0.062
2.502SerThr: 2.502 ± 0.051
3.836SerVal: 3.836 ± 0.062
0.722SerTrp: 0.722 ± 0.026
1.525SerTyr: 1.525 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
4.861ThrAla: 4.861 ± 0.066
0.704ThrCys: 0.704 ± 0.026
2.75ThrAsp: 2.75 ± 0.052
2.856ThrGlu: 2.856 ± 0.054
1.965ThrPhe: 1.965 ± 0.053
4.637ThrGly: 4.637 ± 0.071
1.008ThrHis: 1.008 ± 0.029
2.721ThrIle: 2.721 ± 0.054
1.518ThrLys: 1.518 ± 0.043
5.868ThrLeu: 5.868 ± 0.078
1.145ThrMet: 1.145 ± 0.032
1.4ThrAsn: 1.4 ± 0.034
2.771ThrPro: 2.771 ± 0.057
1.398ThrGln: 1.398 ± 0.039
2.94ThrArg: 2.94 ± 0.047
2.614ThrSer: 2.614 ± 0.052
2.548ThrThr: 2.548 ± 0.057
4.033ThrVal: 4.033 ± 0.065
0.493ThrTrp: 0.493 ± 0.02
1.245ThrTyr: 1.245 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
7.363ValAla: 7.363 ± 0.106
1.097ValCys: 1.097 ± 0.037
4.147ValAsp: 4.147 ± 0.064
4.641ValGlu: 4.641 ± 0.079
2.948ValPhe: 2.948 ± 0.058
5.306ValGly: 5.306 ± 0.084
1.416ValHis: 1.416 ± 0.037
3.828ValIle: 3.828 ± 0.061
2.955ValLys: 2.955 ± 0.059
7.989ValLeu: 7.989 ± 0.099
1.7ValMet: 1.7 ± 0.045
2.163ValAsn: 2.163 ± 0.043
3.187ValPro: 3.187 ± 0.063
2.324ValGln: 2.324 ± 0.046
4.527ValArg: 4.527 ± 0.066
4.289ValSer: 4.289 ± 0.062
3.968ValThr: 3.968 ± 0.062
5.894ValVal: 5.894 ± 0.091
0.67ValTrp: 0.67 ± 0.03
1.65ValTyr: 1.65 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.794TrpAla: 0.794 ± 0.03
0.145TrpCys: 0.145 ± 0.013
0.508TrpAsp: 0.508 ± 0.028
0.626TrpGlu: 0.626 ± 0.023
0.468TrpPhe: 0.468 ± 0.019
0.865TrpGly: 0.865 ± 0.034
0.303TrpHis: 0.303 ± 0.019
0.549TrpIle: 0.549 ± 0.023
0.448TrpLys: 0.448 ± 0.022
1.391TrpLeu: 1.391 ± 0.048
0.259TrpMet: 0.259 ± 0.014
0.35TrpAsn: 0.35 ± 0.018
0.516TrpPro: 0.516 ± 0.025
0.731TrpGln: 0.731 ± 0.03
0.899TrpArg: 0.899 ± 0.033
0.624TrpSer: 0.624 ± 0.022
0.492TrpThr: 0.492 ± 0.021
0.698TrpVal: 0.698 ± 0.029
0.16TrpTrp: 0.16 ± 0.013
0.299TrpTyr: 0.299 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.261TyrAla: 2.261 ± 0.052
0.394TyrCys: 0.394 ± 0.021
1.569TyrAsp: 1.569 ± 0.045
1.348TyrGlu: 1.348 ± 0.037
1.179TyrPhe: 1.179 ± 0.033
2.264TyrGly: 2.264 ± 0.053
0.68TyrHis: 0.68 ± 0.025
1.185TyrIle: 1.185 ± 0.033
0.879TyrLys: 0.879 ± 0.031
3.273TyrLeu: 3.273 ± 0.059
0.474TyrMet: 0.474 ± 0.027
0.818TyrAsn: 0.818 ± 0.031
1.329TyrPro: 1.329 ± 0.036
1.113TyrGln: 1.113 ± 0.031
2.36TyrArg: 2.36 ± 0.045
1.357TyrSer: 1.357 ± 0.038
1.089TyrThr: 1.089 ± 0.035
1.607TyrVal: 1.607 ± 0.032
0.287TyrTrp: 0.287 ± 0.017
0.79TyrTyr: 0.79 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3292 proteins (1056071 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski