Amino acid dipepetide frequency for Fictibacillus macauensis ZFHKF-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.598AlaAla: 6.598 ± 0.104
0.732AlaCys: 0.732 ± 0.024
3.16AlaAsp: 3.16 ± 0.062
4.221AlaGlu: 4.221 ± 0.065
3.684AlaPhe: 3.684 ± 0.06
5.445AlaGly: 5.445 ± 0.074
1.929AlaHis: 1.929 ± 0.047
5.944AlaIle: 5.944 ± 0.089
4.851AlaLys: 4.851 ± 0.077
8.771AlaLeu: 8.771 ± 0.099
2.284AlaMet: 2.284 ± 0.047
2.625AlaAsn: 2.625 ± 0.05
2.501AlaPro: 2.501 ± 0.055
3.128AlaGln: 3.128 ± 0.076
3.152AlaArg: 3.152 ± 0.056
4.972AlaSer: 4.972 ± 0.065
4.436AlaThr: 4.436 ± 0.074
5.846AlaVal: 5.846 ± 0.09
0.677AlaTrp: 0.677 ± 0.025
2.664AlaTyr: 2.664 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
0.547CysAla: 0.547 ± 0.025
0.134CysCys: 0.134 ± 0.011
0.43CysAsp: 0.43 ± 0.019
0.493CysGlu: 0.493 ± 0.022
0.365CysPhe: 0.365 ± 0.02
0.696CysGly: 0.696 ± 0.03
0.23CysHis: 0.23 ± 0.014
0.598CysIle: 0.598 ± 0.025
0.375CysLys: 0.375 ± 0.019
0.793CysLeu: 0.793 ± 0.029
0.219CysMet: 0.219 ± 0.014
0.309CysAsn: 0.309 ± 0.018
0.337CysPro: 0.337 ± 0.018
0.296CysGln: 0.296 ± 0.015
0.32CysArg: 0.32 ± 0.018
0.584CysSer: 0.584 ± 0.026
0.435CysThr: 0.435 ± 0.019
0.512CysVal: 0.512 ± 0.022
0.083CysTrp: 0.083 ± 0.008
0.307CysTyr: 0.307 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.481AspAla: 3.481 ± 0.063
0.404AspCys: 0.404 ± 0.02
2.262AspAsp: 2.262 ± 0.053
3.989AspGlu: 3.989 ± 0.069
2.12AspPhe: 2.12 ± 0.048
3.208AspGly: 3.208 ± 0.069
1.466AspHis: 1.466 ± 0.037
3.043AspIle: 3.043 ± 0.064
2.178AspLys: 2.178 ± 0.059
4.669AspLeu: 4.669 ± 0.067
1.158AspMet: 1.158 ± 0.037
1.291AspAsn: 1.291 ± 0.039
1.823AspPro: 1.823 ± 0.044
2.105AspGln: 2.105 ± 0.04
2.254AspArg: 2.254 ± 0.057
2.317AspSer: 2.317 ± 0.052
2.276AspThr: 2.276 ± 0.049
4.119AspVal: 4.119 ± 0.068
0.582AspTrp: 0.582 ± 0.022
2.038AspTyr: 2.038 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
6.01GluAla: 6.01 ± 0.083
0.415GluCys: 0.415 ± 0.02
3.293GluAsp: 3.293 ± 0.057
6.475GluGlu: 6.475 ± 0.11
1.977GluPhe: 1.977 ± 0.047
4.249GluGly: 4.249 ± 0.079
1.77GluHis: 1.77 ± 0.039
4.227GluIle: 4.227 ± 0.073
5.657GluLys: 5.657 ± 0.089
6.553GluLeu: 6.553 ± 0.099
2.2GluMet: 2.2 ± 0.045
2.797GluAsn: 2.797 ± 0.047
1.939GluPro: 1.939 ± 0.047
3.884GluGln: 3.884 ± 0.071
3.932GluArg: 3.932 ± 0.072
3.056GluSer: 3.056 ± 0.057
3.783GluThr: 3.783 ± 0.07
4.853GluVal: 4.853 ± 0.081
0.798GluTrp: 0.798 ± 0.032
1.817GluTyr: 1.817 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
3.121PheAla: 3.121 ± 0.051
0.385PheCys: 0.385 ± 0.017
1.986PheAsp: 1.986 ± 0.046
2.39PheGlu: 2.39 ± 0.05
2.287PhePhe: 2.287 ± 0.053
3.175PheGly: 3.175 ± 0.057
1.09PheHis: 1.09 ± 0.03
3.397PheIle: 3.397 ± 0.071
2.277PheLys: 2.277 ± 0.048
4.622PheLeu: 4.622 ± 0.1
1.153PheMet: 1.153 ± 0.036
1.569PheAsn: 1.569 ± 0.044
1.612PhePro: 1.612 ± 0.042
1.728PheGln: 1.728 ± 0.04
1.493PheArg: 1.493 ± 0.042
3.309PheSer: 3.309 ± 0.059
2.725PheThr: 2.725 ± 0.045
3.219PheVal: 3.219 ± 0.063
0.449PheTrp: 0.449 ± 0.022
1.7PheTyr: 1.7 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
5.209GlyAla: 5.209 ± 0.08
0.669GlyCys: 0.669 ± 0.028
3.046GlyAsp: 3.046 ± 0.056
4.179GlyGlu: 4.179 ± 0.067
3.223GlyPhe: 3.223 ± 0.051
4.846GlyGly: 4.846 ± 0.089
1.529GlyHis: 1.529 ± 0.037
5.481GlyIle: 5.481 ± 0.086
4.991GlyLys: 4.991 ± 0.075
6.523GlyLeu: 6.523 ± 0.088
2.188GlyMet: 2.188 ± 0.047
2.49GlyAsn: 2.49 ± 0.049
1.753GlyPro: 1.753 ± 0.046
2.219GlyGln: 2.219 ± 0.046
2.713GlyArg: 2.713 ± 0.059
4.079GlySer: 4.079 ± 0.071
4.247GlyThr: 4.247 ± 0.057
5.295GlyVal: 5.295 ± 0.083
0.747GlyTrp: 0.747 ± 0.029
2.814GlyTyr: 2.814 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
1.675HisAla: 1.675 ± 0.042
0.251HisCys: 0.251 ± 0.017
1.174HisAsp: 1.174 ± 0.032
1.835HisGlu: 1.835 ± 0.039
1.249HisPhe: 1.249 ± 0.039
1.619HisGly: 1.619 ± 0.039
0.963HisHis: 0.963 ± 0.034
1.6HisIle: 1.6 ± 0.04
1.233HisLys: 1.233 ± 0.036
2.416HisLeu: 2.416 ± 0.042
0.661HisMet: 0.661 ± 0.024
0.864HisAsn: 0.864 ± 0.027
1.195HisPro: 1.195 ± 0.036
0.949HisGln: 0.949 ± 0.029
1.096HisArg: 1.096 ± 0.035
1.485HisSer: 1.485 ± 0.034
1.343HisThr: 1.343 ± 0.029
1.946HisVal: 1.946 ± 0.041
0.252HisTrp: 0.252 ± 0.014
1.285HisTyr: 1.285 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
6.278IleAla: 6.278 ± 0.09
0.6IleCys: 0.6 ± 0.027
3.617IleAsp: 3.617 ± 0.061
4.837IleGlu: 4.837 ± 0.076
2.634IlePhe: 2.634 ± 0.063
5.673IleGly: 5.673 ± 0.081
1.622IleHis: 1.622 ± 0.045
4.834IleIle: 4.834 ± 0.093
3.678IleLys: 3.678 ± 0.068
6.01IleLeu: 6.01 ± 0.09
1.656IleMet: 1.656 ± 0.045
2.428IleAsn: 2.428 ± 0.048
2.88IlePro: 2.88 ± 0.055
2.523IleGln: 2.523 ± 0.049
2.686IleArg: 2.686 ± 0.058
4.427IleSer: 4.427 ± 0.064
4.417IleThr: 4.417 ± 0.063
5.571IleVal: 5.571 ± 0.079
0.584IleTrp: 0.584 ± 0.024
2.108IleTyr: 2.108 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
5.197LysAla: 5.197 ± 0.078
0.277LysCys: 0.277 ± 0.018
3.471LysAsp: 3.471 ± 0.067
6.671LysGlu: 6.671 ± 0.098
1.586LysPhe: 1.586 ± 0.042
4.643LysGly: 4.643 ± 0.078
1.467LysHis: 1.467 ± 0.04
3.637LysIle: 3.637 ± 0.066
6.572LysLys: 6.572 ± 0.117
5.398LysLeu: 5.398 ± 0.079
2.149LysMet: 2.149 ± 0.044
2.891LysAsn: 2.891 ± 0.054
2.182LysPro: 2.182 ± 0.053
3.373LysGln: 3.373 ± 0.065
3.544LysArg: 3.544 ± 0.054
3.164LysSer: 3.164 ± 0.058
3.507LysThr: 3.507 ± 0.066
4.503LysVal: 4.503 ± 0.072
0.765LysTrp: 0.765 ± 0.029
1.84LysTyr: 1.84 ± 0.048
0.0LysXaa: 0.0 ± 0.0
Leu
8.143LeuAla: 8.143 ± 0.103
0.945LeuCys: 0.945 ± 0.034
4.321LeuAsp: 4.321 ± 0.084
5.839LeuGlu: 5.839 ± 0.086
5.012LeuPhe: 5.012 ± 0.101
6.384LeuGly: 6.384 ± 0.094
2.727LeuHis: 2.727 ± 0.05
6.461LeuIle: 6.461 ± 0.096
6.171LeuLys: 6.171 ± 0.074
11.618LeuLeu: 11.618 ± 0.177
2.56LeuMet: 2.56 ± 0.051
3.447LeuAsn: 3.447 ± 0.059
4.321LeuPro: 4.321 ± 0.061
4.94LeuGln: 4.94 ± 0.075
4.215LeuArg: 4.215 ± 0.07
6.928LeuSer: 6.928 ± 0.09
5.899LeuThr: 5.899 ± 0.076
6.565LeuVal: 6.565 ± 0.081
0.924LeuTrp: 0.924 ± 0.027
3.452LeuTyr: 3.452 ± 0.07
0.0LeuXaa: 0.0 ± 0.0
Met
2.076MetAla: 2.076 ± 0.051
0.168MetCys: 0.168 ± 0.012
1.392MetAsp: 1.392 ± 0.038
1.935MetGlu: 1.935 ± 0.038
0.985MetPhe: 0.985 ± 0.031
1.734MetGly: 1.734 ± 0.043
0.524MetHis: 0.524 ± 0.025
2.143MetIle: 2.143 ± 0.045
2.815MetLys: 2.815 ± 0.053
2.657MetLeu: 2.657 ± 0.053
1.0MetMet: 1.0 ± 0.035
1.685MetAsn: 1.685 ± 0.043
1.048MetPro: 1.048 ± 0.033
0.962MetGln: 0.962 ± 0.032
1.15MetArg: 1.15 ± 0.031
1.718MetSer: 1.718 ± 0.04
1.779MetThr: 1.779 ± 0.041
1.671MetVal: 1.671 ± 0.038
0.181MetTrp: 0.181 ± 0.013
0.801MetTyr: 0.801 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
2.718AsnAla: 2.718 ± 0.053
0.255AsnCys: 0.255 ± 0.015
2.03AsnAsp: 2.03 ± 0.044
3.114AsnGlu: 3.114 ± 0.058
1.333AsnPhe: 1.333 ± 0.032
2.909AsnGly: 2.909 ± 0.051
0.986AsnHis: 0.986 ± 0.032
2.62AsnIle: 2.62 ± 0.055
2.541AsnLys: 2.541 ± 0.048
2.988AsnLeu: 2.988 ± 0.054
1.084AsnMet: 1.084 ± 0.032
1.628AsnAsn: 1.628 ± 0.054
1.757AsnPro: 1.757 ± 0.044
1.529AsnGln: 1.529 ± 0.041
1.738AsnArg: 1.738 ± 0.047
1.78AsnSer: 1.78 ± 0.039
1.946AsnThr: 1.946 ± 0.048
3.144AsnVal: 3.144 ± 0.057
0.441AsnTrp: 0.441 ± 0.02
1.328AsnTyr: 1.328 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
2.61ProAla: 2.61 ± 0.05
0.244ProCys: 0.244 ± 0.017
1.762ProAsp: 1.762 ± 0.043
2.548ProGlu: 2.548 ± 0.049
2.04ProPhe: 2.04 ± 0.047
2.277ProGly: 2.277 ± 0.057
1.059ProHis: 1.059 ± 0.035
2.48ProIle: 2.48 ± 0.049
2.274ProLys: 2.274 ± 0.045
3.926ProLeu: 3.926 ± 0.069
0.849ProMet: 0.849 ± 0.033
1.401ProAsn: 1.401 ± 0.04
1.183ProPro: 1.183 ± 0.047
1.294ProGln: 1.294 ± 0.039
1.202ProArg: 1.202 ± 0.035
2.545ProSer: 2.545 ± 0.054
2.195ProThr: 2.195 ± 0.042
2.763ProVal: 2.763 ± 0.049
0.369ProTrp: 0.369 ± 0.018
1.547ProTyr: 1.547 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
3.601GlnAla: 3.601 ± 0.066
0.273GlnCys: 0.273 ± 0.019
1.71GlnAsp: 1.71 ± 0.041
3.466GlnGlu: 3.466 ± 0.071
1.538GlnPhe: 1.538 ± 0.039
2.418GlnGly: 2.418 ± 0.053
1.206GlnHis: 1.206 ± 0.038
1.96GlnIle: 1.96 ± 0.043
3.042GlnLys: 3.042 ± 0.053
4.658GlnLeu: 4.658 ± 0.077
1.159GlnMet: 1.159 ± 0.036
1.407GlnAsn: 1.407 ± 0.035
1.536GlnPro: 1.536 ± 0.035
2.653GlnGln: 2.653 ± 0.067
2.12GlnArg: 2.12 ± 0.051
2.366GlnSer: 2.366 ± 0.05
2.313GlnThr: 2.313 ± 0.05
2.233GlnVal: 2.233 ± 0.042
0.573GlnTrp: 0.573 ± 0.026
1.309GlnTyr: 1.309 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
3.065ArgAla: 3.065 ± 0.066
0.312ArgCys: 0.312 ± 0.016
2.13ArgAsp: 2.13 ± 0.047
3.166ArgGlu: 3.166 ± 0.062
1.991ArgPhe: 1.991 ± 0.043
2.534ArgGly: 2.534 ± 0.057
1.003ArgHis: 1.003 ± 0.025
2.942ArgIle: 2.942 ± 0.052
3.161ArgLys: 3.161 ± 0.048
4.249ArgLeu: 4.249 ± 0.063
1.396ArgMet: 1.396 ± 0.039
1.905ArgAsn: 1.905 ± 0.046
1.386ArgPro: 1.386 ± 0.038
1.817ArgGln: 1.817 ± 0.041
2.034ArgArg: 2.034 ± 0.049
2.599ArgSer: 2.599 ± 0.053
2.201ArgThr: 2.201 ± 0.047
2.612ArgVal: 2.612 ± 0.053
0.492ArgTrp: 0.492 ± 0.023
1.9ArgTyr: 1.9 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
4.111SerAla: 4.111 ± 0.066
0.46SerCys: 0.46 ± 0.022
2.674SerAsp: 2.674 ± 0.047
3.614SerGlu: 3.614 ± 0.057
3.424SerPhe: 3.424 ± 0.064
4.245SerGly: 4.245 ± 0.068
1.473SerHis: 1.473 ± 0.042
4.645SerIle: 4.645 ± 0.06
3.667SerLys: 3.667 ± 0.059
6.92SerLeu: 6.92 ± 0.091
1.789SerMet: 1.789 ± 0.043
2.191SerAsn: 2.191 ± 0.044
2.226SerPro: 2.226 ± 0.046
2.176SerGln: 2.176 ± 0.043
2.345SerArg: 2.345 ± 0.053
4.302SerSer: 4.302 ± 0.078
3.223SerThr: 3.223 ± 0.056
4.379SerVal: 4.379 ± 0.069
0.692SerTrp: 0.692 ± 0.027
2.346SerTyr: 2.346 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
4.464ThrAla: 4.464 ± 0.071
0.414ThrCys: 0.414 ± 0.022
2.404ThrAsp: 2.404 ± 0.051
3.348ThrGlu: 3.348 ± 0.057
2.844ThrPhe: 2.844 ± 0.048
4.308ThrGly: 4.308 ± 0.151
1.259ThrHis: 1.259 ± 0.039
4.504ThrIle: 4.504 ± 0.067
3.592ThrLys: 3.592 ± 0.054
6.109ThrLeu: 6.109 ± 0.08
1.596ThrMet: 1.596 ± 0.037
2.23ThrAsn: 2.23 ± 0.05
2.528ThrPro: 2.528 ± 0.05
1.61ThrGln: 1.61 ± 0.039
1.931ThrArg: 1.931 ± 0.048
3.544ThrSer: 3.544 ± 0.056
3.388ThrThr: 3.388 ± 0.059
4.587ThrVal: 4.587 ± 0.066
0.564ThrTrp: 0.564 ± 0.022
2.204ThrTyr: 2.204 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
5.779ValAla: 5.779 ± 0.087
0.652ValCys: 0.652 ± 0.024
3.429ValAsp: 3.429 ± 0.059
4.193ValGlu: 4.193 ± 0.074
3.12ValPhe: 3.12 ± 0.066
4.685ValGly: 4.685 ± 0.075
1.594ValHis: 1.594 ± 0.042
5.555ValIle: 5.555 ± 0.077
4.817ValLys: 4.817 ± 0.075
7.375ValLeu: 7.375 ± 0.085
2.108ValMet: 2.108 ± 0.047
2.761ValAsn: 2.761 ± 0.059
2.813ValPro: 2.813 ± 0.056
2.608ValGln: 2.608 ± 0.043
2.816ValArg: 2.816 ± 0.058
4.879ValSer: 4.879 ± 0.068
4.906ValThr: 4.906 ± 0.067
5.177ValVal: 5.177 ± 0.071
0.674ValTrp: 0.674 ± 0.022
2.356ValTyr: 2.356 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.585TrpAla: 0.585 ± 0.022
0.094TrpCys: 0.094 ± 0.009
0.514TrpAsp: 0.514 ± 0.024
0.64TrpGlu: 0.64 ± 0.024
0.513TrpPhe: 0.513 ± 0.025
0.661TrpGly: 0.661 ± 0.026
0.269TrpHis: 0.269 ± 0.016
0.774TrpIle: 0.774 ± 0.028
0.708TrpLys: 0.708 ± 0.026
1.302TrpLeu: 1.302 ± 0.036
0.345TrpMet: 0.345 ± 0.018
0.533TrpAsn: 0.533 ± 0.019
0.248TrpPro: 0.248 ± 0.015
0.464TrpGln: 0.464 ± 0.022
0.481TrpArg: 0.481 ± 0.021
0.643TrpSer: 0.643 ± 0.027
0.482TrpThr: 0.482 ± 0.02
0.657TrpVal: 0.657 ± 0.023
0.141TrpTrp: 0.141 ± 0.011
0.368TrpTyr: 0.368 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.353TyrAla: 2.353 ± 0.049
0.381TyrCys: 0.381 ± 0.02
1.978TyrAsp: 1.978 ± 0.053
2.731TyrGlu: 2.731 ± 0.055
1.763TyrPhe: 1.763 ± 0.044
2.532TyrGly: 2.532 ± 0.053
0.925TyrHis: 0.925 ± 0.028
2.215TyrIle: 2.215 ± 0.046
2.238TyrLys: 2.238 ± 0.051
3.315TyrLeu: 3.315 ± 0.056
0.87TyrMet: 0.87 ± 0.028
1.416TyrAsn: 1.416 ± 0.042
1.3TyrPro: 1.3 ± 0.036
1.283TyrGln: 1.283 ± 0.034
1.675TyrArg: 1.675 ± 0.045
2.203TyrSer: 2.203 ± 0.052
1.927TyrThr: 1.927 ± 0.047
2.645TyrVal: 2.645 ± 0.05
0.435TyrTrp: 0.435 ± 0.018
1.53TyrTyr: 1.53 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3850 proteins (1063458 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski