Amino acid dipepetide frequency for Virgibacillus sp. SK37

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.37AlaAla: 5.37 ± 0.092
0.57AlaCys: 0.57 ± 0.02
3.4AlaAsp: 3.4 ± 0.062
4.541AlaGlu: 4.541 ± 0.067
3.316AlaPhe: 3.316 ± 0.06
5.255AlaGly: 5.255 ± 0.078
1.256AlaHis: 1.256 ± 0.036
6.079AlaIle: 6.079 ± 0.076
4.636AlaLys: 4.636 ± 0.082
6.767AlaLeu: 6.767 ± 0.085
1.985AlaMet: 1.985 ± 0.042
2.835AlaAsn: 2.835 ± 0.053
1.944AlaPro: 1.944 ± 0.053
2.002AlaGln: 2.002 ± 0.042
2.325AlaArg: 2.325 ± 0.05
3.952AlaSer: 3.952 ± 0.07
3.628AlaThr: 3.628 ± 0.06
4.997AlaVal: 4.997 ± 0.072
0.649AlaTrp: 0.649 ± 0.028
2.416AlaTyr: 2.416 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.38CysAla: 0.38 ± 0.021
0.085CysCys: 0.085 ± 0.011
0.356CysAsp: 0.356 ± 0.02
0.422CysGlu: 0.422 ± 0.021
0.284CysPhe: 0.284 ± 0.019
0.583CysGly: 0.583 ± 0.025
0.169CysHis: 0.169 ± 0.013
0.495CysIle: 0.495 ± 0.022
0.399CysLys: 0.399 ± 0.02
0.573CysLeu: 0.573 ± 0.026
0.196CysMet: 0.196 ± 0.013
0.299CysAsn: 0.299 ± 0.018
0.309CysPro: 0.309 ± 0.018
0.19CysGln: 0.19 ± 0.013
0.241CysArg: 0.241 ± 0.015
0.482CysSer: 0.482 ± 0.019
0.383CysThr: 0.383 ± 0.018
0.378CysVal: 0.378 ± 0.02
0.058CysTrp: 0.058 ± 0.008
0.232CysTyr: 0.232 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.18AspAla: 3.18 ± 0.054
0.356AspCys: 0.356 ± 0.021
2.517AspAsp: 2.517 ± 0.062
4.377AspGlu: 4.377 ± 0.074
2.532AspPhe: 2.532 ± 0.059
3.321AspGly: 3.321 ± 0.074
1.139AspHis: 1.139 ± 0.032
4.441AspIle: 4.441 ± 0.064
3.761AspLys: 3.761 ± 0.071
4.957AspLeu: 4.957 ± 0.066
1.516AspMet: 1.516 ± 0.04
2.176AspAsn: 2.176 ± 0.051
2.019AspPro: 2.019 ± 0.041
2.129AspGln: 2.129 ± 0.047
2.153AspArg: 2.153 ± 0.043
2.615AspSer: 2.615 ± 0.056
2.629AspThr: 2.629 ± 0.053
3.575AspVal: 3.575 ± 0.059
0.641AspTrp: 0.641 ± 0.03
2.284AspTyr: 2.284 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
5.417GluAla: 5.417 ± 0.096
0.318GluCys: 0.318 ± 0.019
4.153GluAsp: 4.153 ± 0.07
7.764GluGlu: 7.764 ± 0.122
2.487GluPhe: 2.487 ± 0.046
4.331GluGly: 4.331 ± 0.079
1.554GluHis: 1.554 ± 0.043
5.841GluIle: 5.841 ± 0.082
7.33GluLys: 7.33 ± 0.103
7.147GluLeu: 7.147 ± 0.091
2.378GluMet: 2.378 ± 0.051
4.163GluAsn: 4.163 ± 0.072
1.919GluPro: 1.919 ± 0.047
3.42GluGln: 3.42 ± 0.064
3.345GluArg: 3.345 ± 0.057
3.49GluSer: 3.49 ± 0.062
3.824GluThr: 3.824 ± 0.06
5.233GluVal: 5.233 ± 0.08
0.863GluTrp: 0.863 ± 0.03
2.347GluTyr: 2.347 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
2.87PheAla: 2.87 ± 0.054
0.312PheCys: 0.312 ± 0.017
2.257PheAsp: 2.257 ± 0.045
2.673PheGlu: 2.673 ± 0.054
2.311PhePhe: 2.311 ± 0.059
3.219PheGly: 3.219 ± 0.07
1.013PheHis: 1.013 ± 0.032
4.306PheIle: 4.306 ± 0.087
2.397PheLys: 2.397 ± 0.045
4.565PheLeu: 4.565 ± 0.087
1.217PheMet: 1.217 ± 0.04
1.908PheAsn: 1.908 ± 0.044
1.621PhePro: 1.621 ± 0.037
1.656PheGln: 1.656 ± 0.046
1.482PheArg: 1.482 ± 0.041
3.301PheSer: 3.301 ± 0.066
2.656PheThr: 2.656 ± 0.05
2.964PheVal: 2.964 ± 0.065
0.468PheTrp: 0.468 ± 0.024
1.77PheTyr: 1.77 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
4.658GlyAla: 4.658 ± 0.085
0.534GlyCys: 0.534 ± 0.023
3.261GlyAsp: 3.261 ± 0.072
4.669GlyGlu: 4.669 ± 0.079
3.342GlyPhe: 3.342 ± 0.055
4.735GlyGly: 4.735 ± 0.085
1.272GlyHis: 1.272 ± 0.041
5.913GlyIle: 5.913 ± 0.08
5.286GlyLys: 5.286 ± 0.067
6.345GlyLeu: 6.345 ± 0.095
2.182GlyMet: 2.182 ± 0.044
2.907GlyAsn: 2.907 ± 0.058
1.613GlyPro: 1.613 ± 0.041
2.017GlyGln: 2.017 ± 0.052
2.446GlyArg: 2.446 ± 0.055
3.837GlySer: 3.837 ± 0.065
3.9GlyThr: 3.9 ± 0.064
4.994GlyVal: 4.994 ± 0.078
0.817GlyTrp: 0.817 ± 0.032
2.846GlyTyr: 2.846 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
1.441HisAla: 1.441 ± 0.039
0.168HisCys: 0.168 ± 0.012
0.985HisAsp: 0.985 ± 0.033
1.284HisGlu: 1.284 ± 0.033
1.079HisPhe: 1.079 ± 0.032
1.419HisGly: 1.419 ± 0.041
0.622HisHis: 0.622 ± 0.026
1.573HisIle: 1.573 ± 0.039
1.076HisLys: 1.076 ± 0.034
2.035HisLeu: 2.035 ± 0.047
0.572HisMet: 0.572 ± 0.023
0.814HisAsn: 0.814 ± 0.031
1.1HisPro: 1.1 ± 0.032
0.828HisGln: 0.828 ± 0.025
0.822HisArg: 0.822 ± 0.028
1.13HisSer: 1.13 ± 0.036
1.147HisThr: 1.147 ± 0.033
1.464HisVal: 1.464 ± 0.039
0.214HisTrp: 0.214 ± 0.015
0.843HisTyr: 0.843 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
6.0IleAla: 6.0 ± 0.094
0.635IleCys: 0.635 ± 0.022
4.451IleAsp: 4.451 ± 0.07
5.377IleGlu: 5.377 ± 0.077
3.401IlePhe: 3.401 ± 0.069
6.138IleGly: 6.138 ± 0.1
1.742IleHis: 1.742 ± 0.045
6.731IleIle: 6.731 ± 0.104
4.945IleLys: 4.945 ± 0.08
7.303IleLeu: 7.303 ± 0.111
1.986IleMet: 1.986 ± 0.039
3.786IleAsn: 3.786 ± 0.065
3.347IlePro: 3.347 ± 0.063
3.033IleGln: 3.033 ± 0.056
3.112IleArg: 3.112 ± 0.051
5.235IleSer: 5.235 ± 0.069
4.779IleThr: 4.779 ± 0.071
5.49IleVal: 5.49 ± 0.083
0.689IleTrp: 0.689 ± 0.026
2.578IleTyr: 2.578 ± 0.052
0.0IleXaa: 0.0 ± 0.0
Lys
4.652LysAla: 4.652 ± 0.075
0.3LysCys: 0.3 ± 0.017
4.219LysAsp: 4.219 ± 0.069
7.854LysGlu: 7.854 ± 0.114
1.89LysPhe: 1.89 ± 0.046
4.413LysGly: 4.413 ± 0.07
1.506LysHis: 1.506 ± 0.041
4.741LysIle: 4.741 ± 0.075
6.291LysLys: 6.291 ± 0.096
6.122LysLeu: 6.122 ± 0.088
2.273LysMet: 2.273 ± 0.041
3.747LysAsn: 3.747 ± 0.057
2.222LysPro: 2.222 ± 0.052
3.71LysGln: 3.71 ± 0.064
3.229LysArg: 3.229 ± 0.053
3.627LysSer: 3.627 ± 0.067
3.627LysThr: 3.627 ± 0.062
4.522LysVal: 4.522 ± 0.069
0.791LysTrp: 0.791 ± 0.026
2.279LysTyr: 2.279 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
6.86LeuAla: 6.86 ± 0.089
0.578LeuCys: 0.578 ± 0.025
4.823LeuAsp: 4.823 ± 0.076
6.599LeuGlu: 6.599 ± 0.078
5.012LeuPhe: 5.012 ± 0.103
6.282LeuGly: 6.282 ± 0.091
1.971LeuHis: 1.971 ± 0.039
7.506LeuIle: 7.506 ± 0.115
6.323LeuLys: 6.323 ± 0.08
10.057LeuLeu: 10.057 ± 0.138
2.408LeuMet: 2.408 ± 0.049
4.234LeuAsn: 4.234 ± 0.067
3.969LeuPro: 3.969 ± 0.062
3.659LeuGln: 3.659 ± 0.065
3.429LeuArg: 3.429 ± 0.068
6.334LeuSer: 6.334 ± 0.083
5.48LeuThr: 5.48 ± 0.079
6.145LeuVal: 6.145 ± 0.07
0.759LeuTrp: 0.759 ± 0.027
3.118LeuTyr: 3.118 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
2.081MetAla: 2.081 ± 0.041
0.143MetCys: 0.143 ± 0.012
1.652MetAsp: 1.652 ± 0.043
2.419MetGlu: 2.419 ± 0.05
1.061MetPhe: 1.061 ± 0.034
1.798MetGly: 1.798 ± 0.046
0.509MetHis: 0.509 ± 0.022
2.017MetIle: 2.017 ± 0.051
2.536MetLys: 2.536 ± 0.051
2.618MetLeu: 2.618 ± 0.055
0.832MetMet: 0.832 ± 0.029
1.623MetAsn: 1.623 ± 0.042
1.02MetPro: 1.02 ± 0.034
1.004MetGln: 1.004 ± 0.029
1.126MetArg: 1.126 ± 0.037
1.615MetSer: 1.615 ± 0.041
1.558MetThr: 1.558 ± 0.041
1.838MetVal: 1.838 ± 0.04
0.212MetTrp: 0.212 ± 0.017
0.793MetTyr: 0.793 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
2.656AsnAla: 2.656 ± 0.053
0.321AsnCys: 0.321 ± 0.019
2.387AsnAsp: 2.387 ± 0.057
3.924AsnGlu: 3.924 ± 0.068
1.777AsnPhe: 1.777 ± 0.047
3.318AsnGly: 3.318 ± 0.057
1.014AsnHis: 1.014 ± 0.033
3.504AsnIle: 3.504 ± 0.058
3.791AsnLys: 3.791 ± 0.061
3.935AsnLeu: 3.935 ± 0.065
1.393AsnMet: 1.393 ± 0.036
2.55AsnAsn: 2.55 ± 0.062
2.081AsnPro: 2.081 ± 0.05
2.071AsnGln: 2.071 ± 0.051
1.983AsnArg: 1.983 ± 0.045
2.418AsnSer: 2.418 ± 0.05
2.466AsnThr: 2.466 ± 0.046
2.985AsnVal: 2.985 ± 0.059
0.565AsnTrp: 0.565 ± 0.026
1.92AsnTyr: 1.92 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
2.23ProAla: 2.23 ± 0.046
0.199ProCys: 0.199 ± 0.014
1.998ProAsp: 1.998 ± 0.053
2.99ProGlu: 2.99 ± 0.056
1.898ProPhe: 1.898 ± 0.042
2.254ProGly: 2.254 ± 0.05
0.778ProHis: 0.778 ± 0.03
2.817ProIle: 2.817 ± 0.052
2.132ProLys: 2.132 ± 0.04
3.283ProLeu: 3.283 ± 0.05
0.859ProMet: 0.859 ± 0.03
1.61ProAsn: 1.61 ± 0.04
0.992ProPro: 0.992 ± 0.036
1.074ProGln: 1.074 ± 0.034
1.061ProArg: 1.061 ± 0.034
2.223ProSer: 2.223 ± 0.047
2.015ProThr: 2.015 ± 0.045
2.837ProVal: 2.837 ± 0.059
0.358ProTrp: 0.358 ± 0.02
1.406ProTyr: 1.406 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
2.787GlnAla: 2.787 ± 0.061
0.174GlnCys: 0.174 ± 0.016
1.776GlnAsp: 1.776 ± 0.043
3.166GlnGlu: 3.166 ± 0.058
1.542GlnPhe: 1.542 ± 0.041
2.204GlnGly: 2.204 ± 0.051
0.81GlnHis: 0.81 ± 0.025
2.703GlnIle: 2.703 ± 0.047
2.689GlnLys: 2.689 ± 0.061
4.295GlnLeu: 4.295 ± 0.079
1.161GlnMet: 1.161 ± 0.037
1.583GlnAsn: 1.583 ± 0.041
1.284GlnPro: 1.284 ± 0.034
1.962GlnGln: 1.962 ± 0.049
1.547GlnArg: 1.547 ± 0.04
2.093GlnSer: 2.093 ± 0.043
2.042GlnThr: 2.042 ± 0.053
2.431GlnVal: 2.431 ± 0.056
0.415GlnTrp: 0.415 ± 0.021
1.147GlnTyr: 1.147 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
2.286ArgAla: 2.286 ± 0.05
0.225ArgCys: 0.225 ± 0.013
1.942ArgAsp: 1.942 ± 0.042
3.12ArgGlu: 3.12 ± 0.065
1.82ArgPhe: 1.82 ± 0.043
2.209ArgGly: 2.209 ± 0.054
0.705ArgHis: 0.705 ± 0.027
2.969ArgIle: 2.969 ± 0.059
3.28ArgLys: 3.28 ± 0.061
3.723ArgLeu: 3.723 ± 0.066
1.262ArgMet: 1.262 ± 0.031
2.014ArgAsn: 2.014 ± 0.045
1.176ArgPro: 1.176 ± 0.035
1.435ArgGln: 1.435 ± 0.04
1.744ArgArg: 1.744 ± 0.048
2.095ArgSer: 2.095 ± 0.045
1.964ArgThr: 1.964 ± 0.05
2.477ArgVal: 2.477 ± 0.052
0.384ArgTrp: 0.384 ± 0.02
1.465ArgTyr: 1.465 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
3.6SerAla: 3.6 ± 0.06
0.356SerCys: 0.356 ± 0.021
2.896SerAsp: 2.896 ± 0.058
3.913SerGlu: 3.913 ± 0.068
3.109SerPhe: 3.109 ± 0.057
4.309SerGly: 4.309 ± 0.069
1.192SerHis: 1.192 ± 0.034
5.262SerIle: 5.262 ± 0.08
3.942SerLys: 3.942 ± 0.07
5.704SerLeu: 5.704 ± 0.076
1.708SerMet: 1.708 ± 0.04
2.786SerAsn: 2.786 ± 0.056
2.1SerPro: 2.1 ± 0.044
1.934SerGln: 1.934 ± 0.046
2.188SerArg: 2.188 ± 0.051
3.845SerSer: 3.845 ± 0.077
3.134SerThr: 3.134 ± 0.049
3.91SerVal: 3.91 ± 0.066
0.568SerTrp: 0.568 ± 0.027
2.362SerTyr: 2.362 ± 0.053
0.0SerXaa: 0.0 ± 0.0
Thr
3.912ThrAla: 3.912 ± 0.067
0.338ThrCys: 0.338 ± 0.016
2.919ThrAsp: 2.919 ± 0.054
3.986ThrGlu: 3.986 ± 0.061
2.636ThrPhe: 2.636 ± 0.061
4.213ThrGly: 4.213 ± 0.067
1.017ThrHis: 1.017 ± 0.031
4.654ThrIle: 4.654 ± 0.076
3.525ThrLys: 3.525 ± 0.066
4.945ThrLeu: 4.945 ± 0.07
1.349ThrMet: 1.349 ± 0.036
2.655ThrAsn: 2.655 ± 0.052
2.197ThrPro: 2.197 ± 0.048
1.495ThrGln: 1.495 ± 0.039
1.714ThrArg: 1.714 ± 0.038
3.424ThrSer: 3.424 ± 0.058
3.081ThrThr: 3.081 ± 0.057
4.087ThrVal: 4.087 ± 0.063
0.544ThrTrp: 0.544 ± 0.025
2.076ThrTyr: 2.076 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
4.805ValAla: 4.805 ± 0.079
0.565ValCys: 0.565 ± 0.021
3.757ValAsp: 3.757 ± 0.079
4.923ValGlu: 4.923 ± 0.086
3.077ValPhe: 3.077 ± 0.063
4.497ValGly: 4.497 ± 0.072
1.315ValHis: 1.315 ± 0.042
5.696ValIle: 5.696 ± 0.085
4.541ValLys: 4.541 ± 0.076
6.482ValLeu: 6.482 ± 0.089
1.882ValMet: 1.882 ± 0.054
3.258ValAsn: 3.258 ± 0.057
2.442ValPro: 2.442 ± 0.051
2.247ValGln: 2.247 ± 0.048
2.409ValArg: 2.409 ± 0.044
4.432ValSer: 4.432 ± 0.067
3.961ValThr: 3.961 ± 0.066
4.74ValVal: 4.74 ± 0.074
0.618ValTrp: 0.618 ± 0.025
2.311ValTyr: 2.311 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.601TrpAla: 0.601 ± 0.027
0.066TrpCys: 0.066 ± 0.008
0.54TrpAsp: 0.54 ± 0.024
0.737TrpGlu: 0.737 ± 0.028
0.5TrpPhe: 0.5 ± 0.025
0.665TrpGly: 0.665 ± 0.026
0.157TrpHis: 0.157 ± 0.012
0.812TrpIle: 0.812 ± 0.033
0.799TrpLys: 0.799 ± 0.028
1.114TrpLeu: 1.114 ± 0.033
0.338TrpMet: 0.338 ± 0.016
0.499TrpAsn: 0.499 ± 0.023
0.246TrpPro: 0.246 ± 0.016
0.394TrpGln: 0.394 ± 0.023
0.378TrpArg: 0.378 ± 0.02
0.587TrpSer: 0.587 ± 0.026
0.522TrpThr: 0.522 ± 0.024
0.684TrpVal: 0.684 ± 0.027
0.148TrpTrp: 0.148 ± 0.013
0.378TrpTyr: 0.378 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.134TyrAla: 2.134 ± 0.044
0.273TyrCys: 0.273 ± 0.018
2.031TyrAsp: 2.031 ± 0.054
2.587TyrGlu: 2.587 ± 0.054
1.894TyrPhe: 1.894 ± 0.044
2.414TyrGly: 2.414 ± 0.051
0.875TyrHis: 0.875 ± 0.031
2.698TyrIle: 2.698 ± 0.059
2.333TyrLys: 2.333 ± 0.045
3.575TyrLeu: 3.575 ± 0.052
0.957TyrMet: 0.957 ± 0.031
1.611TyrAsn: 1.611 ± 0.04
1.415TyrPro: 1.415 ± 0.041
1.552TyrGln: 1.552 ± 0.045
1.56TyrArg: 1.56 ± 0.037
2.083TyrSer: 2.083 ± 0.042
1.969TyrThr: 1.969 ± 0.041
2.207TyrVal: 2.207 ± 0.05
0.401TyrTrp: 0.401 ± 0.021
1.536TyrTyr: 1.536 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3815 proteins (1046293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski