Amino acid dipepetide frequency for Paenibacillus sp. 7516

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.264AlaAla: 8.264 ± 0.105
0.7AlaCys: 0.7 ± 0.018
4.253AlaAsp: 4.253 ± 0.053
5.734AlaGlu: 5.734 ± 0.069
3.231AlaPhe: 3.231 ± 0.044
6.57AlaGly: 6.57 ± 0.07
1.456AlaHis: 1.456 ± 0.033
5.11AlaIle: 5.11 ± 0.063
3.894AlaLys: 3.894 ± 0.054
7.975AlaLeu: 7.975 ± 0.072
2.29AlaMet: 2.29 ± 0.042
2.53AlaAsn: 2.53 ± 0.039
2.613AlaPro: 2.613 ± 0.042
2.727AlaGln: 2.727 ± 0.04
3.41AlaArg: 3.41 ± 0.044
5.217AlaSer: 5.217 ± 0.061
3.569AlaThr: 3.569 ± 0.052
6.332AlaVal: 6.332 ± 0.072
0.935AlaTrp: 0.935 ± 0.023
2.663AlaTyr: 2.663 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
0.508CysAla: 0.508 ± 0.018
0.098CysCys: 0.098 ± 0.008
0.37CysAsp: 0.37 ± 0.014
0.4CysGlu: 0.4 ± 0.017
0.292CysPhe: 0.292 ± 0.013
0.714CysGly: 0.714 ± 0.019
0.181CysHis: 0.181 ± 0.01
0.508CysIle: 0.508 ± 0.016
0.275CysLys: 0.275 ± 0.012
0.684CysLeu: 0.684 ± 0.02
0.206CysMet: 0.206 ± 0.011
0.275CysAsn: 0.275 ± 0.011
0.321CysPro: 0.321 ± 0.014
0.203CysGln: 0.203 ± 0.012
0.402CysArg: 0.402 ± 0.017
0.559CysSer: 0.559 ± 0.02
0.417CysThr: 0.417 ± 0.017
0.442CysVal: 0.442 ± 0.015
0.098CysTrp: 0.098 ± 0.007
0.259CysTyr: 0.259 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
3.792AspAla: 3.792 ± 0.053
0.352AspCys: 0.352 ± 0.016
2.45AspAsp: 2.45 ± 0.044
3.971AspGlu: 3.971 ± 0.047
2.053AspPhe: 2.053 ± 0.034
3.958AspGly: 3.958 ± 0.056
1.288AspHis: 1.288 ± 0.03
3.691AspIle: 3.691 ± 0.047
2.479AspLys: 2.479 ± 0.044
4.998AspLeu: 4.998 ± 0.052
1.539AspMet: 1.539 ± 0.028
1.81AspAsn: 1.81 ± 0.032
2.429AspPro: 2.429 ± 0.038
2.318AspGln: 2.318 ± 0.036
2.728AspArg: 2.728 ± 0.038
2.841AspSer: 2.841 ± 0.04
2.748AspThr: 2.748 ± 0.041
3.702AspVal: 3.702 ± 0.051
0.855AspTrp: 0.855 ± 0.022
2.114AspTyr: 2.114 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
5.949GluAla: 5.949 ± 0.07
0.397GluCys: 0.397 ± 0.016
3.403GluAsp: 3.403 ± 0.045
5.544GluGlu: 5.544 ± 0.064
2.208GluPhe: 2.208 ± 0.034
4.698GluGly: 4.698 ± 0.054
1.727GluHis: 1.727 ± 0.034
4.199GluIle: 4.199 ± 0.05
3.745GluLys: 3.745 ± 0.048
7.028GluLeu: 7.028 ± 0.07
2.213GluMet: 2.213 ± 0.038
2.601GluAsn: 2.601 ± 0.039
2.328GluPro: 2.328 ± 0.039
4.238GluGln: 4.238 ± 0.066
4.007GluArg: 4.007 ± 0.067
3.63GluSer: 3.63 ± 0.051
3.398GluThr: 3.398 ± 0.04
4.657GluVal: 4.657 ± 0.055
0.992GluTrp: 0.992 ± 0.024
2.083GluTyr: 2.083 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.137PheAla: 3.137 ± 0.047
0.319PheCys: 0.319 ± 0.014
2.175PheAsp: 2.175 ± 0.038
2.342PheGlu: 2.342 ± 0.038
1.664PhePhe: 1.664 ± 0.032
3.056PheGly: 3.056 ± 0.046
0.854PheHis: 0.854 ± 0.022
2.789PheIle: 2.789 ± 0.038
1.874PheLys: 1.874 ± 0.031
3.624PheLeu: 3.624 ± 0.045
1.228PheMet: 1.228 ± 0.029
1.696PheAsn: 1.696 ± 0.036
1.567PhePro: 1.567 ± 0.035
1.458PheGln: 1.458 ± 0.029
2.005PheArg: 2.005 ± 0.035
2.755PheSer: 2.755 ± 0.039
2.465PheThr: 2.465 ± 0.038
2.888PheVal: 2.888 ± 0.047
0.568PheTrp: 0.568 ± 0.019
1.511PheTyr: 1.511 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
5.383GlyAla: 5.383 ± 0.059
0.644GlyCys: 0.644 ± 0.02
3.539GlyAsp: 3.539 ± 0.05
4.706GlyGlu: 4.706 ± 0.056
3.145GlyPhe: 3.145 ± 0.036
5.427GlyGly: 5.427 ± 0.073
1.574GlyHis: 1.574 ± 0.03
5.599GlyIle: 5.599 ± 0.065
4.315GlyLys: 4.315 ± 0.051
7.21GlyLeu: 7.21 ± 0.076
2.51GlyMet: 2.51 ± 0.036
2.745GlyAsn: 2.745 ± 0.045
1.98GlyPro: 1.98 ± 0.039
2.862GlyGln: 2.862 ± 0.042
3.411GlyArg: 3.411 ± 0.048
4.866GlySer: 4.866 ± 0.06
4.61GlyThr: 4.61 ± 0.066
5.177GlyVal: 5.177 ± 0.057
1.136GlyTrp: 1.136 ± 0.026
2.942GlyTyr: 2.942 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
1.712HisAla: 1.712 ± 0.029
0.203HisCys: 0.203 ± 0.01
1.092HisAsp: 1.092 ± 0.028
1.47HisGlu: 1.47 ± 0.027
1.076HisPhe: 1.076 ± 0.022
1.518HisGly: 1.518 ± 0.028
0.673HisHis: 0.673 ± 0.024
1.484HisIle: 1.484 ± 0.036
0.833HisLys: 0.833 ± 0.024
2.209HisLeu: 2.209 ± 0.047
0.678HisMet: 0.678 ± 0.02
0.731HisAsn: 0.731 ± 0.022
1.249HisPro: 1.249 ± 0.029
0.944HisGln: 0.944 ± 0.021
1.105HisArg: 1.105 ± 0.026
1.248HisSer: 1.248 ± 0.029
1.164HisThr: 1.164 ± 0.025
1.594HisVal: 1.594 ± 0.031
0.355HisTrp: 0.355 ± 0.015
0.918HisTyr: 0.918 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.695IleAla: 5.695 ± 0.054
0.575IleCys: 0.575 ± 0.019
3.456IleAsp: 3.456 ± 0.05
4.181IleGlu: 4.181 ± 0.055
2.36IlePhe: 2.36 ± 0.042
5.209IleGly: 5.209 ± 0.063
1.651IleHis: 1.651 ± 0.029
4.297IleIle: 4.297 ± 0.066
2.827IleLys: 2.827 ± 0.042
5.93IleLeu: 5.93 ± 0.067
1.755IleMet: 1.755 ± 0.033
2.391IleAsn: 2.391 ± 0.04
3.206IlePro: 3.206 ± 0.041
2.892IleGln: 2.892 ± 0.041
3.809IleArg: 3.809 ± 0.048
4.513IleSer: 4.513 ± 0.053
4.004IleThr: 4.004 ± 0.041
4.944IleVal: 4.944 ± 0.059
0.742IleTrp: 0.742 ± 0.022
2.177IleTyr: 2.177 ± 0.034
0.0IleXaa: 0.0 ± 0.0
Lys
3.839LysAla: 3.839 ± 0.055
0.222LysCys: 0.222 ± 0.011
2.785LysAsp: 2.785 ± 0.046
4.117LysGlu: 4.117 ± 0.053
1.418LysPhe: 1.418 ± 0.031
3.469LysGly: 3.469 ± 0.05
1.098LysHis: 1.098 ± 0.026
2.839LysIle: 2.839 ± 0.041
3.006LysLys: 3.006 ± 0.053
5.031LysLeu: 5.031 ± 0.06
1.624LysMet: 1.624 ± 0.028
1.907LysAsn: 1.907 ± 0.033
2.173LysPro: 2.173 ± 0.032
2.433LysGln: 2.433 ± 0.039
2.721LysArg: 2.721 ± 0.046
2.898LysSer: 2.898 ± 0.041
2.736LysThr: 2.736 ± 0.044
3.5LysVal: 3.5 ± 0.045
0.705LysTrp: 0.705 ± 0.02
1.709LysTyr: 1.709 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
7.873LeuAla: 7.873 ± 0.076
0.761LeuCys: 0.761 ± 0.018
5.224LeuAsp: 5.224 ± 0.062
6.323LeuGlu: 6.323 ± 0.078
4.376LeuPhe: 4.376 ± 0.064
6.747LeuGly: 6.747 ± 0.072
2.224LeuHis: 2.224 ± 0.038
6.599LeuIle: 6.599 ± 0.073
5.001LeuLys: 5.001 ± 0.058
10.766LeuLeu: 10.766 ± 0.107
2.76LeuMet: 2.76 ± 0.037
4.123LeuAsn: 4.123 ± 0.052
4.388LeuPro: 4.388 ± 0.049
4.182LeuGln: 4.182 ± 0.055
4.771LeuArg: 4.771 ± 0.057
6.971LeuSer: 6.971 ± 0.066
5.748LeuThr: 5.748 ± 0.053
6.431LeuVal: 6.431 ± 0.064
1.068LeuTrp: 1.068 ± 0.026
3.209LeuTyr: 3.209 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
2.226MetAla: 2.226 ± 0.039
0.167MetCys: 0.167 ± 0.01
1.686MetAsp: 1.686 ± 0.032
2.067MetGlu: 2.067 ± 0.036
1.024MetPhe: 1.024 ± 0.026
1.893MetGly: 1.893 ± 0.035
0.54MetHis: 0.54 ± 0.018
2.105MetIle: 2.105 ± 0.033
2.066MetLys: 2.066 ± 0.032
3.142MetLeu: 3.142 ± 0.042
0.963MetMet: 0.963 ± 0.025
1.653MetAsn: 1.653 ± 0.033
1.19MetPro: 1.19 ± 0.027
1.21MetGln: 1.21 ± 0.026
1.328MetArg: 1.328 ± 0.025
1.97MetSer: 1.97 ± 0.034
1.812MetThr: 1.812 ± 0.032
1.902MetVal: 1.902 ± 0.029
0.278MetTrp: 0.278 ± 0.014
0.838MetTyr: 0.838 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
2.833AsnAla: 2.833 ± 0.044
0.241AsnCys: 0.241 ± 0.011
1.988AsnAsp: 1.988 ± 0.035
2.767AsnGlu: 2.767 ± 0.044
1.329AsnPhe: 1.329 ± 0.027
3.254AsnGly: 3.254 ± 0.053
0.929AsnHis: 0.929 ± 0.023
2.408AsnIle: 2.408 ± 0.041
1.926AsnLys: 1.926 ± 0.037
3.448AsnLeu: 3.448 ± 0.045
1.153AsnMet: 1.153 ± 0.026
1.665AsnAsn: 1.665 ± 0.04
2.016AsnPro: 2.016 ± 0.034
1.698AsnGln: 1.698 ± 0.033
2.092AsnArg: 2.092 ± 0.033
2.239AsnSer: 2.239 ± 0.04
2.188AsnThr: 2.188 ± 0.032
2.766AsnVal: 2.766 ± 0.047
0.551AsnTrp: 0.551 ± 0.02
1.398AsnTyr: 1.398 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
3.301ProAla: 3.301 ± 0.049
0.231ProCys: 0.231 ± 0.012
2.648ProAsp: 2.648 ± 0.042
3.588ProGlu: 3.588 ± 0.048
1.794ProPhe: 1.794 ± 0.036
3.078ProGly: 3.078 ± 0.052
0.908ProHis: 0.908 ± 0.026
2.321ProIle: 2.321 ± 0.028
1.676ProLys: 1.676 ± 0.033
3.906ProLeu: 3.906 ± 0.051
0.995ProMet: 0.995 ± 0.026
1.513ProAsn: 1.513 ± 0.031
1.18ProPro: 1.18 ± 0.026
1.449ProGln: 1.449 ± 0.03
1.39ProArg: 1.39 ± 0.028
2.635ProSer: 2.635 ± 0.039
1.941ProThr: 1.941 ± 0.036
3.423ProVal: 3.423 ± 0.044
0.548ProTrp: 0.548 ± 0.018
1.571ProTyr: 1.571 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.565GlnAla: 3.565 ± 0.045
0.24GlnCys: 0.24 ± 0.011
2.027GlnAsp: 2.027 ± 0.035
3.021GlnGlu: 3.021 ± 0.046
1.629GlnPhe: 1.629 ± 0.031
2.904GlnGly: 2.904 ± 0.04
0.99GlnHis: 0.99 ± 0.027
2.608GlnIle: 2.608 ± 0.035
1.988GlnLys: 1.988 ± 0.035
4.434GlnLeu: 4.434 ± 0.062
1.353GlnMet: 1.353 ± 0.033
1.523GlnAsn: 1.523 ± 0.03
1.639GlnPro: 1.639 ± 0.029
2.128GlnGln: 2.128 ± 0.042
1.962GlnArg: 1.962 ± 0.03
2.489GlnSer: 2.489 ± 0.04
2.24GlnThr: 2.24 ± 0.034
2.851GlnVal: 2.851 ± 0.04
0.595GlnTrp: 0.595 ± 0.018
1.42GlnTyr: 1.42 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
3.184ArgAla: 3.184 ± 0.047
0.327ArgCys: 0.327 ± 0.014
2.432ArgAsp: 2.432 ± 0.04
3.744ArgGlu: 3.744 ± 0.053
2.136ArgPhe: 2.136 ± 0.033
2.865ArgGly: 2.865 ± 0.034
1.038ArgHis: 1.038 ± 0.025
3.614ArgIle: 3.614 ± 0.046
2.926ArgLys: 2.926 ± 0.044
5.138ArgLeu: 5.138 ± 0.063
1.68ArgMet: 1.68 ± 0.03
2.035ArgAsn: 2.035 ± 0.035
1.645ArgPro: 1.645 ± 0.026
2.098ArgGln: 2.098 ± 0.034
2.603ArgArg: 2.603 ± 0.048
3.24ArgSer: 3.24 ± 0.048
2.757ArgThr: 2.757 ± 0.041
3.196ArgVal: 3.196 ± 0.044
0.738ArgTrp: 0.738 ± 0.021
1.983ArgTyr: 1.983 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
4.819SerAla: 4.819 ± 0.058
0.416SerCys: 0.416 ± 0.017
3.192SerAsp: 3.192 ± 0.037
3.998SerGlu: 3.998 ± 0.05
2.863SerPhe: 2.863 ± 0.043
5.462SerGly: 5.462 ± 0.055
1.313SerHis: 1.313 ± 0.026
4.384SerIle: 4.384 ± 0.056
3.163SerLys: 3.163 ± 0.046
6.392SerLeu: 6.392 ± 0.063
1.991SerMet: 1.991 ± 0.033
2.443SerAsn: 2.443 ± 0.044
2.543SerPro: 2.543 ± 0.042
2.191SerGln: 2.191 ± 0.035
3.091SerArg: 3.091 ± 0.044
4.719SerSer: 4.719 ± 0.061
3.487SerThr: 3.487 ± 0.044
4.604SerVal: 4.604 ± 0.063
0.862SerTrp: 0.862 ± 0.022
2.32SerTyr: 2.32 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
4.768ThrAla: 4.768 ± 0.049
0.349ThrCys: 0.349 ± 0.015
3.024ThrAsp: 3.024 ± 0.043
3.543ThrGlu: 3.543 ± 0.043
2.4ThrPhe: 2.4 ± 0.041
4.781ThrGly: 4.781 ± 0.063
1.119ThrHis: 1.119 ± 0.022
3.742ThrIle: 3.742 ± 0.047
2.312ThrLys: 2.312 ± 0.042
5.625ThrLeu: 5.625 ± 0.058
1.485ThrMet: 1.485 ± 0.027
2.006ThrAsn: 2.006 ± 0.034
2.601ThrPro: 2.601 ± 0.036
1.741ThrGln: 1.741 ± 0.031
2.396ThrArg: 2.396 ± 0.036
3.6ThrSer: 3.6 ± 0.048
3.082ThrThr: 3.082 ± 0.048
4.416ThrVal: 4.416 ± 0.06
0.721ThrTrp: 0.721 ± 0.021
2.011ThrTyr: 2.011 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
5.041ValAla: 5.041 ± 0.061
0.586ValCys: 0.586 ± 0.019
3.693ValAsp: 3.693 ± 0.049
4.424ValGlu: 4.424 ± 0.052
2.861ValPhe: 2.861 ± 0.041
4.535ValGly: 4.535 ± 0.056
1.64ValHis: 1.64 ± 0.031
5.07ValIle: 5.07 ± 0.058
3.659ValLys: 3.659 ± 0.051
7.306ValLeu: 7.306 ± 0.079
2.155ValMet: 2.155 ± 0.033
3.019ValAsn: 3.019 ± 0.046
3.069ValPro: 3.069 ± 0.04
3.003ValGln: 3.003 ± 0.039
3.446ValArg: 3.446 ± 0.045
4.803ValSer: 4.803 ± 0.055
4.56ValThr: 4.56 ± 0.061
5.013ValVal: 5.013 ± 0.058
0.86ValTrp: 0.86 ± 0.023
2.42ValTyr: 2.42 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
0.826TrpAla: 0.826 ± 0.021
0.104TrpCys: 0.104 ± 0.008
0.727TrpAsp: 0.727 ± 0.019
0.796TrpGlu: 0.796 ± 0.02
0.612TrpPhe: 0.612 ± 0.02
0.913TrpGly: 0.913 ± 0.024
0.267TrpHis: 0.267 ± 0.011
1.004TrpIle: 1.004 ± 0.027
0.741TrpLys: 0.741 ± 0.023
1.426TrpLeu: 1.426 ± 0.033
0.489TrpMet: 0.489 ± 0.017
0.776TrpAsn: 0.776 ± 0.019
0.372TrpPro: 0.372 ± 0.016
0.5TrpGln: 0.5 ± 0.019
0.584TrpArg: 0.584 ± 0.019
0.898TrpSer: 0.898 ± 0.022
0.764TrpThr: 0.764 ± 0.023
0.86TrpVal: 0.86 ± 0.024
0.197TrpTrp: 0.197 ± 0.009
0.418TrpTyr: 0.418 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.757TyrAla: 2.757 ± 0.04
0.277TyrCys: 0.277 ± 0.012
1.949TyrAsp: 1.949 ± 0.031
2.309TyrGlu: 2.309 ± 0.036
1.503TyrPhe: 1.503 ± 0.03
2.731TyrGly: 2.731 ± 0.038
0.786TyrHis: 0.786 ± 0.02
2.241TyrIle: 2.241 ± 0.037
1.524TyrLys: 1.524 ± 0.03
3.272TyrLeu: 3.272 ± 0.046
0.991TyrMet: 0.991 ± 0.021
1.446TyrAsn: 1.446 ± 0.027
1.589TyrPro: 1.589 ± 0.034
1.331TyrGln: 1.331 ± 0.029
2.115TyrArg: 2.115 ± 0.032
2.166TyrSer: 2.166 ± 0.038
2.039TyrThr: 2.039 ± 0.037
2.465TyrVal: 2.465 ± 0.034
0.473TyrTrp: 0.473 ± 0.015
1.33TyrTyr: 1.33 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5798 proteins (1863486 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski