Amino acid dipepetide frequency for Sinocyclocheilus anshuiensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.36AlaAla: 5.36 ± 0.016
1.317AlaCys: 1.317 ± 0.005
3.104AlaAsp: 3.104 ± 0.008
4.473AlaGlu: 4.473 ± 0.012
2.437AlaPhe: 2.437 ± 0.006
3.949AlaGly: 3.949 ± 0.012
1.529AlaHis: 1.529 ± 0.005
2.934AlaIle: 2.934 ± 0.008
3.415AlaLys: 3.415 ± 0.011
6.542AlaLeu: 6.542 ± 0.015
1.587AlaMet: 1.587 ± 0.005
2.198AlaAsn: 2.198 ± 0.007
3.04AlaPro: 3.04 ± 0.011
2.9AlaGln: 2.9 ± 0.008
3.017AlaArg: 3.017 ± 0.008
5.017AlaSer: 5.017 ± 0.011
3.237AlaThr: 3.237 ± 0.009
4.846AlaVal: 4.846 ± 0.011
0.641AlaTrp: 0.641 ± 0.003
1.584AlaTyr: 1.584 ± 0.006
0.0AlaXaa: 0.0 ± 0.0
Cys
1.24CysAla: 1.24 ± 0.006
0.641CysCys: 0.641 ± 0.004
1.161CysAsp: 1.161 ± 0.007
1.314CysGlu: 1.314 ± 0.007
1.011CysPhe: 1.011 ± 0.004
1.482CysGly: 1.482 ± 0.007
0.662CysHis: 0.662 ± 0.004
1.124CysIle: 1.124 ± 0.005
1.266CysLys: 1.266 ± 0.006
2.305CysLeu: 2.305 ± 0.008
0.528CysMet: 0.528 ± 0.003
0.876CysAsn: 0.876 ± 0.005
1.215CysPro: 1.215 ± 0.007
1.034CysGln: 1.034 ± 0.007
1.256CysArg: 1.256 ± 0.006
2.061CysSer: 2.061 ± 0.008
1.227CysThr: 1.227 ± 0.006
1.71CysVal: 1.71 ± 0.008
0.299CysTrp: 0.299 ± 0.002
0.648CysTyr: 0.648 ± 0.004
0.0CysXaa: 0.0 ± 0.0
Asp
3.005AspAla: 3.005 ± 0.008
1.152AspCys: 1.152 ± 0.006
3.069AspAsp: 3.069 ± 0.011
3.847AspGlu: 3.847 ± 0.01
2.24AspPhe: 2.24 ± 0.006
3.597AspGly: 3.597 ± 0.012
1.222AspHis: 1.222 ± 0.004
2.969AspIle: 2.969 ± 0.008
2.831AspLys: 2.831 ± 0.008
5.188AspLeu: 5.188 ± 0.01
1.346AspMet: 1.346 ± 0.005
1.959AspAsn: 1.959 ± 0.007
2.841AspPro: 2.841 ± 0.009
1.958AspGln: 1.958 ± 0.006
2.633AspArg: 2.633 ± 0.008
4.319AspSer: 4.319 ± 0.011
2.665AspThr: 2.665 ± 0.007
3.436AspVal: 3.436 ± 0.009
0.704AspTrp: 0.704 ± 0.004
1.633AspTyr: 1.633 ± 0.006
0.0AspXaa: 0.0 ± 0.0
Glu
4.52GluAla: 4.52 ± 0.013
1.325GluCys: 1.325 ± 0.007
4.37GluAsp: 4.37 ± 0.011
7.194GluGlu: 7.194 ± 0.026
2.152GluPhe: 2.152 ± 0.006
3.967GluGly: 3.967 ± 0.01
1.542GluHis: 1.542 ± 0.005
3.191GluIle: 3.191 ± 0.008
4.929GluLys: 4.929 ± 0.016
6.359GluLeu: 6.359 ± 0.014
1.872GluMet: 1.872 ± 0.006
2.943GluAsn: 2.943 ± 0.007
2.645GluPro: 2.645 ± 0.008
3.122GluGln: 3.122 ± 0.01
4.224GluArg: 4.224 ± 0.013
4.359GluSer: 4.359 ± 0.011
3.401GluThr: 3.401 ± 0.008
4.332GluVal: 4.332 ± 0.009
0.717GluTrp: 0.717 ± 0.004
1.761GluTyr: 1.761 ± 0.005
0.0GluXaa: 0.0 ± 0.0
Phe
2.036PheAla: 2.036 ± 0.006
1.028PheCys: 1.028 ± 0.004
1.903PheAsp: 1.903 ± 0.006
2.107PheGlu: 2.107 ± 0.006
1.808PhePhe: 1.808 ± 0.006
2.298PheGly: 2.298 ± 0.007
1.088PheHis: 1.088 ± 0.004
2.166PheIle: 2.166 ± 0.008
2.013PheLys: 2.013 ± 0.006
4.102PheLeu: 4.102 ± 0.013
0.895PheMet: 0.895 ± 0.004
1.619PheAsn: 1.619 ± 0.005
1.827PhePro: 1.827 ± 0.006
1.727PheGln: 1.727 ± 0.006
1.93PheArg: 1.93 ± 0.006
3.526PheSer: 3.526 ± 0.008
2.396PheThr: 2.396 ± 0.007
2.367PheVal: 2.367 ± 0.007
0.497PheTrp: 0.497 ± 0.003
1.319PheTyr: 1.319 ± 0.005
0.0PheXaa: 0.0 ± 0.0
Gly
3.68GlyAla: 3.68 ± 0.012
1.199GlyCys: 1.199 ± 0.005
3.15GlyAsp: 3.15 ± 0.008
3.876GlyGlu: 3.876 ± 0.011
2.482GlyPhe: 2.482 ± 0.008
4.439GlyGly: 4.439 ± 0.015
1.627GlyHis: 1.627 ± 0.006
2.793GlyIle: 2.793 ± 0.009
3.707GlyLys: 3.707 ± 0.01
5.329GlyLeu: 5.329 ± 0.014
1.535GlyMet: 1.535 ± 0.007
2.444GlyAsn: 2.444 ± 0.008
2.93GlyPro: 2.93 ± 0.017
2.662GlyGln: 2.662 ± 0.008
3.346GlyArg: 3.346 ± 0.009
5.204GlySer: 5.204 ± 0.013
3.341GlyThr: 3.341 ± 0.011
3.881GlyVal: 3.881 ± 0.009
0.759GlyTrp: 0.759 ± 0.004
1.884GlyTyr: 1.884 ± 0.007
0.0GlyXaa: 0.0 ± 0.0
His
1.363HisAla: 1.363 ± 0.005
0.789HisCys: 0.789 ± 0.004
1.045HisAsp: 1.045 ± 0.005
1.353HisGlu: 1.353 ± 0.006
1.144HisPhe: 1.144 ± 0.004
1.543HisGly: 1.543 ± 0.007
0.934HisHis: 0.934 ± 0.006
1.459HisIle: 1.459 ± 0.005
1.393HisLys: 1.393 ± 0.005
2.771HisLeu: 2.771 ± 0.008
0.709HisMet: 0.709 ± 0.004
1.079HisAsn: 1.079 ± 0.005
1.5HisPro: 1.5 ± 0.006
1.215HisGln: 1.215 ± 0.005
1.56HisArg: 1.56 ± 0.005
2.385HisSer: 2.385 ± 0.007
1.675HisThr: 1.675 ± 0.008
1.536HisVal: 1.536 ± 0.006
0.347HisTrp: 0.347 ± 0.002
0.911HisTyr: 0.911 ± 0.004
0.0HisXaa: 0.0 ± 0.0
Ile
2.798IleAla: 2.798 ± 0.008
1.179IleCys: 1.179 ± 0.005
2.366IleAsp: 2.366 ± 0.007
2.727IleGlu: 2.727 ± 0.008
2.052IlePhe: 2.052 ± 0.007
2.449IleGly: 2.449 ± 0.007
1.411IleHis: 1.411 ± 0.005
2.739IleIle: 2.739 ± 0.008
2.888IleLys: 2.888 ± 0.007
4.699IleLeu: 4.699 ± 0.011
1.233IleMet: 1.233 ± 0.005
2.209IleAsn: 2.209 ± 0.007
2.543IlePro: 2.543 ± 0.007
2.378IleGln: 2.378 ± 0.008
2.629IleArg: 2.629 ± 0.008
4.098IleSer: 4.098 ± 0.009
2.993IleThr: 2.993 ± 0.009
2.829IleVal: 2.829 ± 0.008
0.533IleTrp: 0.533 ± 0.004
1.62IleTyr: 1.62 ± 0.006
0.0IleXaa: 0.0 ± 0.0
Lys
3.79LysAla: 3.79 ± 0.011
1.15LysCys: 1.15 ± 0.005
3.426LysAsp: 3.426 ± 0.009
4.806LysGlu: 4.806 ± 0.015
1.809LysPhe: 1.809 ± 0.006
3.242LysGly: 3.242 ± 0.008
1.556LysHis: 1.556 ± 0.006
2.88LysIle: 2.88 ± 0.008
4.564LysLys: 4.564 ± 0.016
5.386LysLeu: 5.386 ± 0.012
1.569LysMet: 1.569 ± 0.006
2.542LysAsn: 2.542 ± 0.007
2.866LysPro: 2.866 ± 0.01
2.721LysGln: 2.721 ± 0.007
3.443LysArg: 3.443 ± 0.008
3.962LysSer: 3.962 ± 0.009
3.287LysThr: 3.287 ± 0.008
3.684LysVal: 3.684 ± 0.01
0.622LysTrp: 0.622 ± 0.003
1.676LysTyr: 1.676 ± 0.006
0.001LysXaa: 0.001 ± 0.0
Leu
6.035LeuAla: 6.035 ± 0.013
2.324LeuCys: 2.324 ± 0.007
4.998LeuAsp: 4.998 ± 0.009
6.66LeuGlu: 6.66 ± 0.018
3.733LeuPhe: 3.733 ± 0.011
5.061LeuGly: 5.061 ± 0.012
2.797LeuHis: 2.797 ± 0.008
4.261LeuIle: 4.261 ± 0.01
6.07LeuLys: 6.07 ± 0.011
10.139LeuLeu: 10.139 ± 0.022
2.278LeuMet: 2.278 ± 0.007
3.951LeuAsn: 3.951 ± 0.008
4.966LeuPro: 4.966 ± 0.012
5.534LeuGln: 5.534 ± 0.014
5.544LeuArg: 5.544 ± 0.011
8.085LeuSer: 8.085 ± 0.014
5.246LeuThr: 5.246 ± 0.011
5.484LeuVal: 5.484 ± 0.012
1.077LeuTrp: 1.077 ± 0.006
2.84LeuTyr: 2.84 ± 0.007
0.001LeuXaa: 0.001 ± 0.0
Met
1.9MetAla: 1.9 ± 0.007
0.533MetCys: 0.533 ± 0.003
1.492MetAsp: 1.492 ± 0.005
2.041MetGlu: 2.041 ± 0.006
0.947MetPhe: 0.947 ± 0.004
1.473MetGly: 1.473 ± 0.005
0.57MetHis: 0.57 ± 0.003
1.001MetIle: 1.001 ± 0.004
1.586MetLys: 1.586 ± 0.005
2.259MetLeu: 2.259 ± 0.007
0.731MetMet: 0.731 ± 0.003
1.023MetAsn: 1.023 ± 0.004
1.146MetPro: 1.146 ± 0.005
1.089MetGln: 1.089 ± 0.005
1.26MetArg: 1.26 ± 0.005
1.888MetSer: 1.888 ± 0.006
1.312MetThr: 1.312 ± 0.005
1.62MetVal: 1.62 ± 0.005
0.279MetTrp: 0.279 ± 0.002
0.697MetTyr: 0.697 ± 0.003
0.0MetXaa: 0.0 ± 0.0
Asn
2.318AsnAla: 2.318 ± 0.007
0.929AsnCys: 0.929 ± 0.005
1.81AsnAsp: 1.81 ± 0.007
2.342AsnGlu: 2.342 ± 0.007
1.514AsnPhe: 1.514 ± 0.005
2.812AsnGly: 2.812 ± 0.01
1.047AsnHis: 1.047 ± 0.005
2.45AsnIle: 2.45 ± 0.007
2.387AsnLys: 2.387 ± 0.006
3.791AsnLeu: 3.791 ± 0.009
1.121AsnMet: 1.121 ± 0.005
1.828AsnAsn: 1.828 ± 0.007
2.287AsnPro: 2.287 ± 0.008
1.774AsnGln: 1.774 ± 0.005
2.039AsnArg: 2.039 ± 0.006
3.277AsnSer: 3.277 ± 0.008
2.353AsnThr: 2.353 ± 0.007
2.463AsnVal: 2.463 ± 0.006
0.476AsnTrp: 0.476 ± 0.003
1.248AsnTyr: 1.248 ± 0.005
0.0AsnXaa: 0.0 ± 0.0
Pro
3.55ProAla: 3.55 ± 0.012
1.006ProCys: 1.006 ± 0.006
2.831ProAsp: 2.831 ± 0.007
3.673ProGlu: 3.673 ± 0.009
1.838ProPhe: 1.838 ± 0.007
3.525ProGly: 3.525 ± 0.018
1.419ProHis: 1.419 ± 0.006
1.979ProIle: 1.979 ± 0.006
2.579ProLys: 2.579 ± 0.01
4.588ProLeu: 4.588 ± 0.011
1.051ProMet: 1.051 ± 0.004
1.968ProAsn: 1.968 ± 0.006
4.476ProPro: 4.476 ± 0.022
2.441ProGln: 2.441 ± 0.01
2.448ProArg: 2.448 ± 0.008
4.924ProSer: 4.924 ± 0.015
2.841ProThr: 2.841 ± 0.01
3.595ProVal: 3.595 ± 0.01
0.537ProTrp: 0.537 ± 0.003
1.416ProTyr: 1.416 ± 0.005
0.001ProXaa: 0.001 ± 0.0
Gln
3.083GlnAla: 3.083 ± 0.01
1.065GlnCys: 1.065 ± 0.007
2.356GlnAsp: 2.356 ± 0.007
3.434GlnGlu: 3.434 ± 0.011
1.518GlnPhe: 1.518 ± 0.006
2.553GlnGly: 2.553 ± 0.008
1.357GlnHis: 1.357 ± 0.005
2.153GlnIle: 2.153 ± 0.007
2.776GlnLys: 2.776 ± 0.009
4.49GlnLeu: 4.49 ± 0.013
1.237GlnMet: 1.237 ± 0.005
1.912GlnAsn: 1.912 ± 0.006
2.306GlnPro: 2.306 ± 0.008
3.032GlnGln: 3.032 ± 0.018
2.899GlnArg: 2.899 ± 0.008
3.367GlnSer: 3.367 ± 0.01
2.574GlnThr: 2.574 ± 0.007
2.801GlnVal: 2.801 ± 0.008
0.57GlnTrp: 0.57 ± 0.003
1.355GlnTyr: 1.355 ± 0.005
0.001GlnXaa: 0.001 ± 0.0
Arg
3.278ArgAla: 3.278 ± 0.009
1.229ArgCys: 1.229 ± 0.007
2.926ArgAsp: 2.926 ± 0.008
3.822ArgGlu: 3.822 ± 0.012
2.031ArgPhe: 2.031 ± 0.005
3.134ArgGly: 3.134 ± 0.01
1.506ArgHis: 1.506 ± 0.006
2.572ArgIle: 2.572 ± 0.007
3.589ArgLys: 3.589 ± 0.008
5.231ArgLeu: 5.231 ± 0.009
1.349ArgMet: 1.349 ± 0.005
2.206ArgAsn: 2.206 ± 0.006
2.632ArgPro: 2.632 ± 0.008
2.58ArgGln: 2.58 ± 0.008
3.818ArgArg: 3.818 ± 0.012
4.133ArgSer: 4.133 ± 0.012
2.774ArgThr: 2.774 ± 0.007
3.311ArgVal: 3.311 ± 0.007
0.662ArgTrp: 0.662 ± 0.003
1.598ArgTyr: 1.598 ± 0.005
0.0ArgXaa: 0.0 ± 0.0
Ser
5.188SerAla: 5.188 ± 0.011
1.885SerCys: 1.885 ± 0.008
4.268SerAsp: 4.268 ± 0.012
4.937SerGlu: 4.937 ± 0.012
3.172SerPhe: 3.172 ± 0.009
5.298SerGly: 5.298 ± 0.013
2.135SerHis: 2.135 ± 0.007
3.6SerIle: 3.6 ± 0.008
4.134SerLys: 4.134 ± 0.01
8.035SerLeu: 8.035 ± 0.016
1.885SerMet: 1.885 ± 0.006
3.035SerAsn: 3.035 ± 0.008
5.154SerPro: 5.154 ± 0.018
3.676SerGln: 3.676 ± 0.011
4.187SerArg: 4.187 ± 0.011
9.032SerSer: 9.032 ± 0.025
4.608SerThr: 4.608 ± 0.013
5.388SerVal: 5.388 ± 0.011
0.953SerTrp: 0.953 ± 0.004
2.169SerTyr: 2.169 ± 0.007
0.001SerXaa: 0.001 ± 0.0
Thr
3.815ThrAla: 3.815 ± 0.01
1.381ThrCys: 1.381 ± 0.008
2.978ThrAsp: 2.978 ± 0.009
3.697ThrGlu: 3.697 ± 0.008
2.204ThrPhe: 2.204 ± 0.006
3.67ThrGly: 3.67 ± 0.011
1.524ThrHis: 1.524 ± 0.007
2.595ThrIle: 2.595 ± 0.007
2.676ThrLys: 2.676 ± 0.007
5.486ThrLeu: 5.486 ± 0.011
1.203ThrMet: 1.203 ± 0.004
2.004ThrAsn: 2.004 ± 0.006
3.382ThrPro: 3.382 ± 0.01
2.388ThrGln: 2.388 ± 0.007
2.435ThrArg: 2.435 ± 0.007
4.503ThrSer: 4.503 ± 0.013
3.03ThrThr: 3.03 ± 0.014
4.13ThrVal: 4.13 ± 0.011
0.641ThrTrp: 0.641 ± 0.004
1.517ThrTyr: 1.517 ± 0.006
0.001ThrXaa: 0.001 ± 0.0
Val
3.98ValAla: 3.98 ± 0.009
1.884ValCys: 1.884 ± 0.008
3.232ValAsp: 3.232 ± 0.008
4.107ValGlu: 4.107 ± 0.009
2.774ValPhe: 2.774 ± 0.008
3.385ValGly: 3.385 ± 0.009
1.657ValHis: 1.657 ± 0.006
3.225ValIle: 3.225 ± 0.01
3.79ValLys: 3.79 ± 0.009
6.375ValLeu: 6.375 ± 0.013
1.594ValMet: 1.594 ± 0.006
2.646ValAsn: 2.646 ± 0.008
3.18ValPro: 3.18 ± 0.008
2.846ValGln: 2.846 ± 0.007
3.314ValArg: 3.314 ± 0.009
5.27ValSer: 5.27 ± 0.011
3.862ValThr: 3.862 ± 0.01
4.245ValVal: 4.245 ± 0.011
0.782ValTrp: 0.782 ± 0.004
1.928ValTyr: 1.928 ± 0.006
0.001ValXaa: 0.001 ± 0.0
Trp
0.669TrpAla: 0.669 ± 0.003
0.272TrpCys: 0.272 ± 0.002
0.666TrpAsp: 0.666 ± 0.004
0.757TrpGlu: 0.757 ± 0.003
0.465TrpPhe: 0.465 ± 0.003
0.625TrpGly: 0.625 ± 0.004
0.299TrpHis: 0.299 ± 0.003
0.603TrpIle: 0.603 ± 0.003
0.757TrpLys: 0.757 ± 0.003
1.168TrpLeu: 1.168 ± 0.006
0.367TrpMet: 0.367 ± 0.003
0.534TrpAsn: 0.534 ± 0.003
0.437TrpPro: 0.437 ± 0.003
0.497TrpGln: 0.497 ± 0.003
0.699TrpArg: 0.699 ± 0.004
0.933TrpSer: 0.933 ± 0.005
0.693TrpThr: 0.693 ± 0.004
0.702TrpVal: 0.702 ± 0.003
0.202TrpTrp: 0.202 ± 0.002
0.351TrpTyr: 0.351 ± 0.002
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.518TyrAla: 1.518 ± 0.006
0.772TyrCys: 0.772 ± 0.004
1.463TyrAsp: 1.463 ± 0.006
1.76TyrGlu: 1.76 ± 0.006
1.322TyrPhe: 1.322 ± 0.005
1.764TyrGly: 1.764 ± 0.006
0.829TyrHis: 0.829 ± 0.004
1.678TyrIle: 1.678 ± 0.006
1.63TyrLys: 1.63 ± 0.008
2.838TyrLeu: 2.838 ± 0.008
0.76TyrMet: 0.76 ± 0.003
1.286TyrAsn: 1.286 ± 0.005
1.309TyrPro: 1.309 ± 0.005
1.267TyrGln: 1.267 ± 0.005
1.703TyrArg: 1.703 ± 0.006
2.399TyrSer: 2.399 ± 0.007
1.741TyrThr: 1.741 ± 0.006
1.716TyrVal: 1.716 ± 0.006
0.399TyrTrp: 0.399 ± 0.003
1.074TyrTyr: 1.074 ± 0.006
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.177XaaXaa: 0.177 ± 0.016
Statistics based on 100275 proteins (62118981 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski