Amino acid dipepetide frequency for Pedobacter sp. PACM 27299

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.185AlaAla: 7.185 ± 0.104
0.612AlaCys: 0.612 ± 0.018
4.265AlaAsp: 4.265 ± 0.057
4.568AlaGlu: 4.568 ± 0.056
3.496AlaPhe: 3.496 ± 0.046
5.695AlaGly: 5.695 ± 0.109
1.239AlaHis: 1.239 ± 0.029
5.617AlaIle: 5.617 ± 0.063
4.857AlaLys: 4.857 ± 0.064
7.232AlaLeu: 7.232 ± 0.073
1.821AlaMet: 1.821 ± 0.035
3.802AlaAsn: 3.802 ± 0.062
2.367AlaPro: 2.367 ± 0.047
2.842AlaGln: 2.842 ± 0.042
2.299AlaArg: 2.299 ± 0.038
4.71AlaSer: 4.71 ± 0.068
4.109AlaThr: 4.109 ± 0.099
4.879AlaVal: 4.879 ± 0.07
0.766AlaTrp: 0.766 ± 0.02
3.047AlaTyr: 3.047 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.483CysAla: 0.483 ± 0.017
0.113CysCys: 0.113 ± 0.007
0.342CysAsp: 0.342 ± 0.013
0.367CysGlu: 0.367 ± 0.03
0.453CysPhe: 0.453 ± 0.018
0.53CysGly: 0.53 ± 0.018
0.173CysHis: 0.173 ± 0.011
0.536CysIle: 0.536 ± 0.02
0.485CysLys: 0.485 ± 0.017
0.732CysLeu: 0.732 ± 0.022
0.169CysMet: 0.169 ± 0.01
0.346CysAsn: 0.346 ± 0.014
0.283CysPro: 0.283 ± 0.014
0.21CysGln: 0.21 ± 0.012
0.27CysArg: 0.27 ± 0.013
0.506CysSer: 0.506 ± 0.015
0.397CysThr: 0.397 ± 0.028
0.401CysVal: 0.401 ± 0.016
0.087CysTrp: 0.087 ± 0.012
0.296CysTyr: 0.296 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.761AspAla: 3.761 ± 0.056
0.339AspCys: 0.339 ± 0.014
2.267AspAsp: 2.267 ± 0.041
2.998AspGlu: 2.998 ± 0.05
3.228AspPhe: 3.228 ± 0.051
3.758AspGly: 3.758 ± 0.059
1.185AspHis: 1.185 ± 0.024
3.539AspIle: 3.539 ± 0.047
3.591AspLys: 3.591 ± 0.045
5.455AspLeu: 5.455 ± 0.054
1.116AspMet: 1.116 ± 0.024
2.333AspAsn: 2.333 ± 0.037
2.268AspPro: 2.268 ± 0.041
2.502AspGln: 2.502 ± 0.039
2.146AspArg: 2.146 ± 0.037
2.561AspSer: 2.561 ± 0.043
2.318AspThr: 2.318 ± 0.036
3.11AspVal: 3.11 ± 0.047
0.792AspTrp: 0.792 ± 0.022
2.505AspTyr: 2.505 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
4.17GluAla: 4.17 ± 0.05
0.302GluCys: 0.302 ± 0.012
2.827GluAsp: 2.827 ± 0.049
4.035GluGlu: 4.035 ± 0.066
2.459GluPhe: 2.459 ± 0.043
3.46GluGly: 3.46 ± 0.05
1.045GluHis: 1.045 ± 0.027
4.523GluIle: 4.523 ± 0.061
4.884GluLys: 4.884 ± 0.068
6.232GluLeu: 6.232 ± 0.084
1.526GluMet: 1.526 ± 0.033
3.506GluAsn: 3.506 ± 0.049
1.552GluPro: 1.552 ± 0.028
2.543GluGln: 2.543 ± 0.042
2.443GluArg: 2.443 ± 0.042
3.063GluSer: 3.063 ± 0.05
2.891GluThr: 2.891 ± 0.04
3.893GluVal: 3.893 ± 0.054
0.612GluTrp: 0.612 ± 0.022
1.951GluTyr: 1.951 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.246PheAla: 3.246 ± 0.044
0.494PheCys: 0.494 ± 0.028
2.788PheAsp: 2.788 ± 0.043
2.801PheGlu: 2.801 ± 0.04
2.499PhePhe: 2.499 ± 0.051
3.354PheGly: 3.354 ± 0.047
0.845PheHis: 0.845 ± 0.023
3.468PheIle: 3.468 ± 0.052
3.504PheLys: 3.504 ± 0.049
4.561PheLeu: 4.561 ± 0.065
1.162PheMet: 1.162 ± 0.026
3.099PheAsn: 3.099 ± 0.049
1.774PhePro: 1.774 ± 0.032
1.547PheGln: 1.547 ± 0.029
1.767PheArg: 1.767 ± 0.031
4.098PheSer: 4.098 ± 0.053
2.843PheThr: 2.843 ± 0.048
2.777PheVal: 2.777 ± 0.04
0.559PheTrp: 0.559 ± 0.02
2.122PheTyr: 2.122 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
4.831GlyAla: 4.831 ± 0.09
0.567GlyCys: 0.567 ± 0.035
3.178GlyAsp: 3.178 ± 0.048
3.239GlyGlu: 3.239 ± 0.046
3.661GlyPhe: 3.661 ± 0.046
4.859GlyGly: 4.859 ± 0.102
1.119GlyHis: 1.119 ± 0.025
5.384GlyIle: 5.384 ± 0.059
5.593GlyLys: 5.593 ± 0.061
6.829GlyLeu: 6.829 ± 0.066
1.802GlyMet: 1.802 ± 0.034
3.807GlyAsn: 3.807 ± 0.088
1.589GlyPro: 1.589 ± 0.04
2.326GlyGln: 2.326 ± 0.036
2.44GlyArg: 2.44 ± 0.044
4.513GlySer: 4.513 ± 0.066
4.374GlyThr: 4.374 ± 0.127
4.413GlyVal: 4.413 ± 0.056
0.899GlyTrp: 0.899 ± 0.026
3.174GlyTyr: 3.174 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.056HisAla: 1.056 ± 0.027
0.175HisCys: 0.175 ± 0.011
0.806HisAsp: 0.806 ± 0.021
0.973HisGlu: 0.973 ± 0.024
1.081HisPhe: 1.081 ± 0.028
1.042HisGly: 1.042 ± 0.027
0.534HisHis: 0.534 ± 0.019
1.308HisIle: 1.308 ± 0.028
1.059HisLys: 1.059 ± 0.025
2.0HisLeu: 2.0 ± 0.041
0.277HisMet: 0.277 ± 0.013
0.817HisAsn: 0.817 ± 0.02
0.995HisPro: 0.995 ± 0.032
0.932HisGln: 0.932 ± 0.027
0.702HisArg: 0.702 ± 0.02
1.032HisSer: 1.032 ± 0.024
1.013HisThr: 1.013 ± 0.034
0.897HisVal: 0.897 ± 0.028
0.244HisTrp: 0.244 ± 0.012
0.846HisTyr: 0.846 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
5.868IleAla: 5.868 ± 0.062
0.644IleCys: 0.644 ± 0.022
4.038IleAsp: 4.038 ± 0.057
4.098IleGlu: 4.098 ± 0.058
3.15IlePhe: 3.15 ± 0.054
5.024IleGly: 5.024 ± 0.062
1.302IleHis: 1.302 ± 0.028
4.681IleIle: 4.681 ± 0.067
5.059IleLys: 5.059 ± 0.064
6.494IleLeu: 6.494 ± 0.078
1.354IleMet: 1.354 ± 0.032
4.2IleAsn: 4.2 ± 0.058
3.279IlePro: 3.279 ± 0.047
2.5IleGln: 2.5 ± 0.041
2.912IleArg: 2.912 ± 0.045
5.482IleSer: 5.482 ± 0.058
4.338IleThr: 4.338 ± 0.08
3.981IleVal: 3.981 ± 0.059
0.726IleTrp: 0.726 ± 0.021
2.713IleTyr: 2.713 ± 0.04
0.0IleXaa: 0.0 ± 0.0
Lys
5.238LysAla: 5.238 ± 0.066
0.303LysCys: 0.303 ± 0.012
4.352LysAsp: 4.352 ± 0.058
5.022LysGlu: 5.022 ± 0.074
2.646LysPhe: 2.646 ± 0.039
4.728LysGly: 4.728 ± 0.055
1.293LysHis: 1.293 ± 0.029
5.181LysIle: 5.181 ± 0.065
5.557LysLys: 5.557 ± 0.079
6.412LysLeu: 6.412 ± 0.076
1.999LysMet: 1.999 ± 0.036
4.431LysAsn: 4.431 ± 0.054
2.745LysPro: 2.745 ± 0.044
2.753LysGln: 2.753 ± 0.047
2.73LysArg: 2.73 ± 0.041
4.329LysSer: 4.329 ± 0.053
4.418LysThr: 4.418 ± 0.055
4.659LysVal: 4.659 ± 0.055
0.8LysTrp: 0.8 ± 0.022
2.832LysTyr: 2.832 ± 0.05
0.0LysXaa: 0.0 ± 0.0
Leu
6.989LeuAla: 6.989 ± 0.062
0.793LeuCys: 0.793 ± 0.023
4.933LeuAsp: 4.933 ± 0.055
5.37LeuGlu: 5.37 ± 0.069
4.771LeuPhe: 4.771 ± 0.07
5.917LeuGly: 5.917 ± 0.061
1.641LeuHis: 1.641 ± 0.034
6.853LeuIle: 6.853 ± 0.077
8.065LeuLys: 8.065 ± 0.085
9.902LeuLeu: 9.902 ± 0.111
2.432LeuMet: 2.432 ± 0.038
6.193LeuAsn: 6.193 ± 0.067
4.2LeuPro: 4.2 ± 0.054
3.691LeuGln: 3.691 ± 0.055
3.673LeuArg: 3.673 ± 0.056
7.758LeuSer: 7.758 ± 0.068
5.539LeuThr: 5.539 ± 0.071
5.378LeuVal: 5.378 ± 0.057
0.96LeuTrp: 0.96 ± 0.027
3.289LeuTyr: 3.289 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
1.921MetAla: 1.921 ± 0.035
0.118MetCys: 0.118 ± 0.008
1.243MetAsp: 1.243 ± 0.025
1.609MetGlu: 1.609 ± 0.03
0.826MetPhe: 0.826 ± 0.022
1.562MetGly: 1.562 ± 0.03
0.339MetHis: 0.339 ± 0.014
1.665MetIle: 1.665 ± 0.031
2.084MetLys: 2.084 ± 0.038
2.247MetLeu: 2.247 ± 0.042
0.728MetMet: 0.728 ± 0.023
1.452MetAsn: 1.452 ± 0.03
1.04MetPro: 1.04 ± 0.026
0.84MetGln: 0.84 ± 0.026
0.94MetArg: 0.94 ± 0.023
1.393MetSer: 1.393 ± 0.028
1.224MetThr: 1.224 ± 0.026
1.592MetVal: 1.592 ± 0.031
0.171MetTrp: 0.171 ± 0.01
0.676MetTyr: 0.676 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
4.152AsnAla: 4.152 ± 0.071
0.372AsnCys: 0.372 ± 0.016
2.738AsnAsp: 2.738 ± 0.05
2.974AsnGlu: 2.974 ± 0.042
2.743AsnPhe: 2.743 ± 0.043
4.23AsnGly: 4.23 ± 0.068
0.993AsnHis: 0.993 ± 0.023
3.985AsnIle: 3.985 ± 0.052
3.745AsnLys: 3.745 ± 0.051
5.209AsnLeu: 5.209 ± 0.065
1.257AsnMet: 1.257 ± 0.027
3.137AsnAsn: 3.137 ± 0.074
2.822AsnPro: 2.822 ± 0.053
2.099AsnGln: 2.099 ± 0.04
2.201AsnArg: 2.201 ± 0.038
3.54AsnSer: 3.54 ± 0.054
3.392AsnThr: 3.392 ± 0.063
3.314AsnVal: 3.314 ± 0.061
0.744AsnTrp: 0.744 ± 0.022
2.702AsnTyr: 2.702 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
3.298ProAla: 3.298 ± 0.071
0.217ProCys: 0.217 ± 0.013
2.401ProAsp: 2.401 ± 0.036
3.06ProGlu: 3.06 ± 0.047
1.949ProPhe: 1.949 ± 0.038
2.88ProGly: 2.88 ± 0.051
0.603ProHis: 0.603 ± 0.023
2.574ProIle: 2.574 ± 0.038
2.345ProLys: 2.345 ± 0.037
3.463ProLeu: 3.463 ± 0.042
0.849ProMet: 0.849 ± 0.021
1.998ProAsn: 1.998 ± 0.034
0.918ProPro: 0.918 ± 0.028
1.317ProGln: 1.317 ± 0.029
1.064ProArg: 1.064 ± 0.025
2.279ProSer: 2.279 ± 0.04
1.93ProThr: 1.93 ± 0.047
3.088ProVal: 3.088 ± 0.058
0.387ProTrp: 0.387 ± 0.013
1.518ProTyr: 1.518 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
2.676GlnAla: 2.676 ± 0.045
0.164GlnCys: 0.164 ± 0.009
1.731GlnAsp: 1.731 ± 0.031
2.37GlnGlu: 2.37 ± 0.04
1.66GlnPhe: 1.66 ± 0.031
2.135GlnGly: 2.135 ± 0.04
0.818GlnHis: 0.818 ± 0.023
2.555GlnIle: 2.555 ± 0.039
2.721GlnLys: 2.721 ± 0.044
4.275GlnLeu: 4.275 ± 0.058
0.949GlnMet: 0.949 ± 0.024
2.016GlnAsn: 2.016 ± 0.033
1.4GlnPro: 1.4 ± 0.027
2.186GlnGln: 2.186 ± 0.041
1.485GlnArg: 1.485 ± 0.031
2.288GlnSer: 2.288 ± 0.039
1.996GlnThr: 1.996 ± 0.035
2.434GlnVal: 2.434 ± 0.039
0.425GlnTrp: 0.425 ± 0.03
1.498GlnTyr: 1.498 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
2.425ArgAla: 2.425 ± 0.043
0.236ArgCys: 0.236 ± 0.016
1.758ArgAsp: 1.758 ± 0.035
2.098ArgGlu: 2.098 ± 0.04
2.056ArgPhe: 2.056 ± 0.035
2.127ArgGly: 2.127 ± 0.044
0.616ArgHis: 0.616 ± 0.023
3.056ArgIle: 3.056 ± 0.041
2.913ArgLys: 2.913 ± 0.042
3.77ArgLeu: 3.77 ± 0.042
1.079ArgMet: 1.079 ± 0.029
2.217ArgAsn: 2.217 ± 0.041
1.321ArgPro: 1.321 ± 0.03
1.277ArgGln: 1.277 ± 0.029
1.384ArgArg: 1.384 ± 0.032
2.327ArgSer: 2.327 ± 0.045
2.0ArgThr: 2.0 ± 0.033
2.201ArgVal: 2.201 ± 0.042
0.49ArgTrp: 0.49 ± 0.019
1.738ArgTyr: 1.738 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
4.928SerAla: 4.928 ± 0.08
0.526SerCys: 0.526 ± 0.018
3.061SerAsp: 3.061 ± 0.041
3.142SerGlu: 3.142 ± 0.049
3.838SerPhe: 3.838 ± 0.052
5.139SerGly: 5.139 ± 0.077
1.059SerHis: 1.059 ± 0.029
4.95SerIle: 4.95 ± 0.057
4.372SerLys: 4.372 ± 0.054
6.438SerLeu: 6.438 ± 0.064
1.468SerMet: 1.468 ± 0.032
3.441SerAsn: 3.441 ± 0.055
2.574SerPro: 2.574 ± 0.039
1.973SerGln: 1.973 ± 0.034
2.568SerArg: 2.568 ± 0.039
4.592SerSer: 4.592 ± 0.084
3.84SerThr: 3.84 ± 0.058
3.988SerVal: 3.988 ± 0.052
0.853SerTrp: 0.853 ± 0.027
2.843SerTyr: 2.843 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
5.072ThrAla: 5.072 ± 0.107
0.312ThrCys: 0.312 ± 0.014
3.182ThrAsp: 3.182 ± 0.049
3.173ThrGlu: 3.173 ± 0.045
2.777ThrPhe: 2.777 ± 0.043
4.789ThrGly: 4.789 ± 0.111
0.893ThrHis: 0.893 ± 0.025
4.166ThrIle: 4.166 ± 0.078
3.379ThrLys: 3.379 ± 0.034
5.482ThrLeu: 5.482 ± 0.071
1.07ThrMet: 1.07 ± 0.024
2.807ThrAsn: 2.807 ± 0.057
2.466ThrPro: 2.466 ± 0.045
1.807ThrGln: 1.807 ± 0.035
1.78ThrArg: 1.78 ± 0.035
3.432ThrSer: 3.432 ± 0.071
3.262ThrThr: 3.262 ± 0.08
3.834ThrVal: 3.834 ± 0.105
0.607ThrTrp: 0.607 ± 0.02
2.387ThrTyr: 2.387 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
4.483ValAla: 4.483 ± 0.069
0.491ValCys: 0.491 ± 0.02
3.106ValAsp: 3.106 ± 0.038
3.254ValGlu: 3.254 ± 0.054
3.158ValPhe: 3.158 ± 0.049
3.636ValGly: 3.636 ± 0.047
1.002ValHis: 1.002 ± 0.031
4.708ValIle: 4.708 ± 0.047
4.651ValLys: 4.651 ± 0.065
6.343ValLeu: 6.343 ± 0.062
1.519ValMet: 1.519 ± 0.03
3.672ValAsn: 3.672 ± 0.056
2.497ValPro: 2.497 ± 0.042
2.038ValGln: 2.038 ± 0.033
2.061ValArg: 2.061 ± 0.037
4.295ValSer: 4.295 ± 0.069
3.615ValThr: 3.615 ± 0.097
4.052ValVal: 4.052 ± 0.059
0.665ValTrp: 0.665 ± 0.022
2.48ValTyr: 2.48 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
0.779TrpAla: 0.779 ± 0.023
0.105TrpCys: 0.105 ± 0.008
0.595TrpAsp: 0.595 ± 0.02
0.655TrpGlu: 0.655 ± 0.021
0.562TrpPhe: 0.562 ± 0.019
0.787TrpGly: 0.787 ± 0.02
0.199TrpHis: 0.199 ± 0.011
0.725TrpIle: 0.725 ± 0.023
0.865TrpLys: 0.865 ± 0.022
1.157TrpLeu: 1.157 ± 0.028
0.352TrpMet: 0.352 ± 0.015
0.704TrpAsn: 0.704 ± 0.02
0.338TrpPro: 0.338 ± 0.014
0.465TrpGln: 0.465 ± 0.019
0.448TrpArg: 0.448 ± 0.019
0.731TrpSer: 0.731 ± 0.033
0.666TrpThr: 0.666 ± 0.027
0.639TrpVal: 0.639 ± 0.022
0.17TrpTrp: 0.17 ± 0.012
0.508TrpTyr: 0.508 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.844TyrAla: 2.844 ± 0.048
0.296TyrCys: 0.296 ± 0.012
2.161TyrAsp: 2.161 ± 0.04
2.112TyrGlu: 2.112 ± 0.037
2.297TyrPhe: 2.297 ± 0.046
2.942TyrGly: 2.942 ± 0.05
0.888TyrHis: 0.888 ± 0.023
2.37TyrIle: 2.37 ± 0.039
2.701TyrLys: 2.701 ± 0.048
4.115TyrLeu: 4.115 ± 0.059
0.743TyrMet: 0.743 ± 0.02
2.369TyrAsn: 2.369 ± 0.048
1.71TyrPro: 1.71 ± 0.033
1.934TyrGln: 1.934 ± 0.035
1.778TyrArg: 1.778 ± 0.041
2.663TyrSer: 2.663 ± 0.047
2.512TyrThr: 2.512 ± 0.064
2.197TyrVal: 2.197 ± 0.039
0.492TyrTrp: 0.492 ± 0.02
1.815TyrTyr: 1.815 ± 0.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4966 proteins (1786279 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski