Amino acid dipepetide frequency for Sinocyclocheilus grahami (Dianchi golden-line fish) (Barbus grahami)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.277AlaAla: 5.277 ± 0.017
1.314AlaCys: 1.314 ± 0.005
3.075AlaAsp: 3.075 ± 0.007
4.385AlaGlu: 4.385 ± 0.012
2.447AlaPhe: 2.447 ± 0.008
3.906AlaGly: 3.906 ± 0.01
1.514AlaHis: 1.514 ± 0.005
2.924AlaIle: 2.924 ± 0.008
3.362AlaLys: 3.362 ± 0.01
6.467AlaLeu: 6.467 ± 0.013
1.569AlaMet: 1.569 ± 0.006
2.186AlaAsn: 2.186 ± 0.007
3.034AlaPro: 3.034 ± 0.011
2.856AlaGln: 2.856 ± 0.01
2.997AlaArg: 2.997 ± 0.008
4.971AlaSer: 4.971 ± 0.012
3.27AlaThr: 3.27 ± 0.009
4.814AlaVal: 4.814 ± 0.012
0.643AlaTrp: 0.643 ± 0.004
1.597AlaTyr: 1.597 ± 0.006
0.001AlaXaa: 0.001 ± 0.0
Cys
1.234CysAla: 1.234 ± 0.006
0.656CysCys: 0.656 ± 0.004
1.158CysAsp: 1.158 ± 0.007
1.307CysGlu: 1.307 ± 0.007
1.03CysPhe: 1.03 ± 0.004
1.462CysGly: 1.462 ± 0.007
0.671CysHis: 0.671 ± 0.004
1.16CysIle: 1.16 ± 0.005
1.255CysLys: 1.255 ± 0.005
2.326CysLeu: 2.326 ± 0.008
0.529CysMet: 0.529 ± 0.003
0.881CysAsn: 0.881 ± 0.005
1.21CysPro: 1.21 ± 0.007
1.033CysGln: 1.033 ± 0.006
1.245CysArg: 1.245 ± 0.005
2.083CysSer: 2.083 ± 0.009
1.243CysThr: 1.243 ± 0.007
1.751CysVal: 1.751 ± 0.008
0.304CysTrp: 0.304 ± 0.002
0.655CysTyr: 0.655 ± 0.004
0.0CysXaa: 0.0 ± 0.0
Asp
2.987AspAla: 2.987 ± 0.008
1.165AspCys: 1.165 ± 0.007
3.032AspAsp: 3.032 ± 0.011
3.792AspGlu: 3.792 ± 0.009
2.225AspPhe: 2.225 ± 0.006
3.597AspGly: 3.597 ± 0.011
1.224AspHis: 1.224 ± 0.005
2.954AspIle: 2.954 ± 0.008
2.816AspLys: 2.816 ± 0.009
5.154AspLeu: 5.154 ± 0.01
1.337AspMet: 1.337 ± 0.004
1.986AspAsn: 1.986 ± 0.007
2.835AspPro: 2.835 ± 0.008
1.949AspGln: 1.949 ± 0.006
2.629AspArg: 2.629 ± 0.008
4.298AspSer: 4.298 ± 0.01
2.665AspThr: 2.665 ± 0.007
3.389AspVal: 3.389 ± 0.009
0.698AspTrp: 0.698 ± 0.004
1.648AspTyr: 1.648 ± 0.006
0.0AspXaa: 0.0 ± 0.0
Glu
4.436GluAla: 4.436 ± 0.013
1.314GluCys: 1.314 ± 0.007
4.318GluAsp: 4.318 ± 0.01
7.016GluGlu: 7.016 ± 0.019
2.146GluPhe: 2.146 ± 0.006
3.927GluGly: 3.927 ± 0.01
1.536GluHis: 1.536 ± 0.005
3.21GluIle: 3.21 ± 0.008
4.79GluLys: 4.79 ± 0.013
6.258GluLeu: 6.258 ± 0.015
1.857GluMet: 1.857 ± 0.006
2.931GluAsn: 2.931 ± 0.007
2.624GluPro: 2.624 ± 0.008
3.104GluGln: 3.104 ± 0.009
4.181GluArg: 4.181 ± 0.011
4.368GluSer: 4.368 ± 0.012
3.39GluThr: 3.39 ± 0.011
4.272GluVal: 4.272 ± 0.01
0.716GluTrp: 0.716 ± 0.004
1.766GluTyr: 1.766 ± 0.006
0.001GluXaa: 0.001 ± 0.0
Phe
2.041PheAla: 2.041 ± 0.007
1.042PheCys: 1.042 ± 0.004
1.915PheAsp: 1.915 ± 0.006
2.091PheGlu: 2.091 ± 0.006
1.859PhePhe: 1.859 ± 0.007
2.288PheGly: 2.288 ± 0.007
1.082PheHis: 1.082 ± 0.004
2.217PheIle: 2.217 ± 0.008
2.021PheLys: 2.021 ± 0.007
4.152PheLeu: 4.152 ± 0.012
0.907PheMet: 0.907 ± 0.004
1.63PheAsn: 1.63 ± 0.005
1.837PhePro: 1.837 ± 0.006
1.748PheGln: 1.748 ± 0.005
1.939PheArg: 1.939 ± 0.006
3.579PheSer: 3.579 ± 0.008
2.416PheThr: 2.416 ± 0.007
2.418PheVal: 2.418 ± 0.007
0.509PheTrp: 0.509 ± 0.003
1.341PheTyr: 1.341 ± 0.004
0.0PheXaa: 0.0 ± 0.0
Gly
3.64GlyAla: 3.64 ± 0.011
1.195GlyCys: 1.195 ± 0.005
3.129GlyAsp: 3.129 ± 0.009
3.851GlyGlu: 3.851 ± 0.011
2.467GlyPhe: 2.467 ± 0.008
4.406GlyGly: 4.406 ± 0.013
1.616GlyHis: 1.616 ± 0.005
2.816GlyIle: 2.816 ± 0.009
3.669GlyLys: 3.669 ± 0.01
5.314GlyLeu: 5.314 ± 0.01
1.539GlyMet: 1.539 ± 0.007
2.458GlyAsn: 2.458 ± 0.008
2.919GlyPro: 2.919 ± 0.016
2.641GlyGln: 2.641 ± 0.009
3.364GlyArg: 3.364 ± 0.009
5.203GlySer: 5.203 ± 0.012
3.344GlyThr: 3.344 ± 0.01
3.864GlyVal: 3.864 ± 0.009
0.751GlyTrp: 0.751 ± 0.004
1.887GlyTyr: 1.887 ± 0.007
0.001GlyXaa: 0.001 ± 0.0
His
1.357HisAla: 1.357 ± 0.006
0.794HisCys: 0.794 ± 0.004
1.055HisAsp: 1.055 ± 0.004
1.354HisGlu: 1.354 ± 0.005
1.156HisPhe: 1.156 ± 0.004
1.538HisGly: 1.538 ± 0.005
0.943HisHis: 0.943 ± 0.006
1.476HisIle: 1.476 ± 0.005
1.377HisLys: 1.377 ± 0.005
2.766HisLeu: 2.766 ± 0.008
0.694HisMet: 0.694 ± 0.004
1.095HisAsn: 1.095 ± 0.004
1.501HisPro: 1.501 ± 0.006
1.21HisGln: 1.21 ± 0.005
1.547HisArg: 1.547 ± 0.005
2.396HisSer: 2.396 ± 0.008
1.714HisThr: 1.714 ± 0.01
1.552HisVal: 1.552 ± 0.005
0.353HisTrp: 0.353 ± 0.002
0.919HisTyr: 0.919 ± 0.004
0.0HisXaa: 0.0 ± 0.0
Ile
2.801IleAla: 2.801 ± 0.007
1.198IleCys: 1.198 ± 0.005
2.367IleAsp: 2.367 ± 0.007
2.722IleGlu: 2.722 ± 0.007
2.112IlePhe: 2.112 ± 0.008
2.469IleGly: 2.469 ± 0.006
1.422IleHis: 1.422 ± 0.006
2.795IleIle: 2.795 ± 0.009
2.884IleLys: 2.884 ± 0.009
4.776IleLeu: 4.776 ± 0.011
1.248IleMet: 1.248 ± 0.005
2.225IleAsn: 2.225 ± 0.007
2.549IlePro: 2.549 ± 0.007
2.396IleGln: 2.396 ± 0.007
2.611IleArg: 2.611 ± 0.006
4.141IleSer: 4.141 ± 0.009
3.066IleThr: 3.066 ± 0.008
2.881IleVal: 2.881 ± 0.008
0.535IleTrp: 0.535 ± 0.003
1.656IleTyr: 1.656 ± 0.006
0.0IleXaa: 0.0 ± 0.0
Lys
3.684LysAla: 3.684 ± 0.011
1.153LysCys: 1.153 ± 0.005
3.405LysAsp: 3.405 ± 0.01
4.708LysGlu: 4.708 ± 0.012
1.796LysPhe: 1.796 ± 0.006
3.22LysGly: 3.22 ± 0.009
1.556LysHis: 1.556 ± 0.006
2.861LysIle: 2.861 ± 0.008
4.45LysLys: 4.45 ± 0.015
5.324LysLeu: 5.324 ± 0.013
1.551LysMet: 1.551 ± 0.005
2.537LysAsn: 2.537 ± 0.007
2.841LysPro: 2.841 ± 0.009
2.674LysGln: 2.674 ± 0.008
3.374LysArg: 3.374 ± 0.008
3.987LysSer: 3.987 ± 0.011
3.274LysThr: 3.274 ± 0.009
3.656LysVal: 3.656 ± 0.011
0.619LysTrp: 0.619 ± 0.003
1.675LysTyr: 1.675 ± 0.006
0.001LysXaa: 0.001 ± 0.0
Leu
5.964LeuAla: 5.964 ± 0.012
2.338LeuCys: 2.338 ± 0.008
4.956LeuAsp: 4.956 ± 0.01
6.595LeuGlu: 6.595 ± 0.017
3.81LeuPhe: 3.81 ± 0.01
5.038LeuGly: 5.038 ± 0.012
2.805LeuHis: 2.805 ± 0.009
4.288LeuIle: 4.288 ± 0.009
6.02LeuLys: 6.02 ± 0.012
10.122LeuLeu: 10.122 ± 0.023
2.279LeuMet: 2.279 ± 0.007
3.972LeuAsn: 3.972 ± 0.009
4.939LeuPro: 4.939 ± 0.012
5.484LeuGln: 5.484 ± 0.014
5.547LeuArg: 5.547 ± 0.012
8.192LeuSer: 8.192 ± 0.016
5.286LeuThr: 5.286 ± 0.011
5.483LeuVal: 5.483 ± 0.013
1.081LeuTrp: 1.081 ± 0.005
2.875LeuTyr: 2.875 ± 0.007
0.001LeuXaa: 0.001 ± 0.0
Met
1.876MetAla: 1.876 ± 0.007
0.546MetCys: 0.546 ± 0.003
1.489MetAsp: 1.489 ± 0.005
2.022MetGlu: 2.022 ± 0.006
0.954MetPhe: 0.954 ± 0.005
1.463MetGly: 1.463 ± 0.005
0.583MetHis: 0.583 ± 0.003
1.014MetIle: 1.014 ± 0.004
1.565MetLys: 1.565 ± 0.005
2.257MetLeu: 2.257 ± 0.006
0.716MetMet: 0.716 ± 0.004
1.027MetAsn: 1.027 ± 0.005
1.139MetPro: 1.139 ± 0.005
1.075MetGln: 1.075 ± 0.004
1.251MetArg: 1.251 ± 0.005
1.893MetSer: 1.893 ± 0.006
1.305MetThr: 1.305 ± 0.005
1.619MetVal: 1.619 ± 0.006
0.285MetTrp: 0.285 ± 0.002
0.715MetTyr: 0.715 ± 0.004
0.0MetXaa: 0.0 ± 0.0
Asn
2.303AsnAla: 2.303 ± 0.007
0.936AsnCys: 0.936 ± 0.005
1.823AsnAsp: 1.823 ± 0.007
2.327AsnGlu: 2.327 ± 0.006
1.537AsnPhe: 1.537 ± 0.005
2.834AsnGly: 2.834 ± 0.01
1.036AsnHis: 1.036 ± 0.004
2.463AsnIle: 2.463 ± 0.006
2.375AsnLys: 2.375 ± 0.006
3.831AsnLeu: 3.831 ± 0.009
1.129AsnMet: 1.129 ± 0.005
1.857AsnAsn: 1.857 ± 0.007
2.274AsnPro: 2.274 ± 0.006
1.775AsnGln: 1.775 ± 0.006
2.049AsnArg: 2.049 ± 0.006
3.294AsnSer: 3.294 ± 0.008
2.367AsnThr: 2.367 ± 0.006
2.467AsnVal: 2.467 ± 0.006
0.473AsnTrp: 0.473 ± 0.003
1.259AsnTyr: 1.259 ± 0.005
0.001AsnXaa: 0.001 ± 0.0
Pro
3.536ProAla: 3.536 ± 0.01
1.018ProCys: 1.018 ± 0.006
2.793ProAsp: 2.793 ± 0.006
3.645ProGlu: 3.645 ± 0.011
1.85ProPhe: 1.85 ± 0.006
3.528ProGly: 3.528 ± 0.019
1.431ProHis: 1.431 ± 0.006
1.995ProIle: 1.995 ± 0.005
2.539ProLys: 2.539 ± 0.007
4.615ProLeu: 4.615 ± 0.011
1.047ProMet: 1.047 ± 0.005
1.945ProAsn: 1.945 ± 0.006
4.547ProPro: 4.547 ± 0.02
2.441ProGln: 2.441 ± 0.009
2.464ProArg: 2.464 ± 0.008
4.965ProSer: 4.965 ± 0.014
2.847ProThr: 2.847 ± 0.009
3.59ProVal: 3.59 ± 0.01
0.536ProTrp: 0.536 ± 0.003
1.429ProTyr: 1.429 ± 0.006
0.0ProXaa: 0.0 ± 0.0
Gln
3.042GlnAla: 3.042 ± 0.01
1.058GlnCys: 1.058 ± 0.006
2.344GlnAsp: 2.344 ± 0.007
3.371GlnGlu: 3.371 ± 0.01
1.509GlnPhe: 1.509 ± 0.005
2.535GlnGly: 2.535 ± 0.008
1.354GlnHis: 1.354 ± 0.005
2.175GlnIle: 2.175 ± 0.006
2.73GlnLys: 2.73 ± 0.008
4.461GlnLeu: 4.461 ± 0.012
1.235GlnMet: 1.235 ± 0.005
1.918GlnAsn: 1.918 ± 0.006
2.307GlnPro: 2.307 ± 0.01
2.999GlnGln: 2.999 ± 0.021
2.88GlnArg: 2.88 ± 0.009
3.365GlnSer: 3.365 ± 0.01
2.564GlnThr: 2.564 ± 0.008
2.832GlnVal: 2.832 ± 0.007
0.561GlnTrp: 0.561 ± 0.003
1.371GlnTyr: 1.371 ± 0.005
0.0GlnXaa: 0.0 ± 0.0
Arg
3.256ArgAla: 3.256 ± 0.008
1.229ArgCys: 1.229 ± 0.006
2.909ArgAsp: 2.909 ± 0.007
3.784ArgGlu: 3.784 ± 0.012
2.037ArgPhe: 2.037 ± 0.007
3.137ArgGly: 3.137 ± 0.009
1.507ArgHis: 1.507 ± 0.005
2.581ArgIle: 2.581 ± 0.007
3.547ArgLys: 3.547 ± 0.008
5.191ArgLeu: 5.191 ± 0.012
1.344ArgMet: 1.344 ± 0.005
2.21ArgAsn: 2.21 ± 0.005
2.632ArgPro: 2.632 ± 0.007
2.569ArgGln: 2.569 ± 0.008
3.804ArgArg: 3.804 ± 0.011
4.154ArgSer: 4.154 ± 0.012
2.781ArgThr: 2.781 ± 0.008
3.32ArgVal: 3.32 ± 0.009
0.667ArgTrp: 0.667 ± 0.004
1.609ArgTyr: 1.609 ± 0.005
0.0ArgXaa: 0.0 ± 0.0
Ser
5.2SerAla: 5.2 ± 0.013
1.89SerCys: 1.89 ± 0.008
4.261SerAsp: 4.261 ± 0.011
4.903SerGlu: 4.903 ± 0.011
3.221SerPhe: 3.221 ± 0.007
5.297SerGly: 5.297 ± 0.013
2.171SerHis: 2.171 ± 0.006
3.662SerIle: 3.662 ± 0.009
4.134SerLys: 4.134 ± 0.011
8.098SerLeu: 8.098 ± 0.016
1.876SerMet: 1.876 ± 0.006
3.048SerAsn: 3.048 ± 0.008
5.216SerPro: 5.216 ± 0.015
3.675SerGln: 3.675 ± 0.011
4.195SerArg: 4.195 ± 0.01
9.087SerSer: 9.087 ± 0.02
4.665SerThr: 4.665 ± 0.011
5.479SerVal: 5.479 ± 0.01
0.96SerTrp: 0.96 ± 0.004
2.183SerTyr: 2.183 ± 0.006
0.001SerXaa: 0.001 ± 0.0
Thr
3.818ThrAla: 3.818 ± 0.009
1.379ThrCys: 1.379 ± 0.008
2.976ThrAsp: 2.976 ± 0.009
3.706ThrGlu: 3.706 ± 0.009
2.23ThrPhe: 2.23 ± 0.007
3.637ThrGly: 3.637 ± 0.01
1.559ThrHis: 1.559 ± 0.011
2.657ThrIle: 2.657 ± 0.007
2.67ThrLys: 2.67 ± 0.007
5.525ThrLeu: 5.525 ± 0.013
1.211ThrMet: 1.211 ± 0.005
2.011ThrAsn: 2.011 ± 0.006
3.431ThrPro: 3.431 ± 0.01
2.365ThrGln: 2.365 ± 0.007
2.449ThrArg: 2.449 ± 0.006
4.549ThrSer: 4.549 ± 0.015
3.108ThrThr: 3.108 ± 0.018
4.178ThrVal: 4.178 ± 0.009
0.646ThrTrp: 0.646 ± 0.004
1.549ThrTyr: 1.549 ± 0.005
0.0ThrXaa: 0.0 ± 0.0
Val
3.961ValAla: 3.961 ± 0.009
1.907ValCys: 1.907 ± 0.008
3.229ValAsp: 3.229 ± 0.008
4.069ValGlu: 4.069 ± 0.009
2.812ValPhe: 2.812 ± 0.009
3.402ValGly: 3.402 ± 0.009
1.657ValHis: 1.657 ± 0.005
3.267ValIle: 3.267 ± 0.008
3.76ValLys: 3.76 ± 0.009
6.42ValLeu: 6.42 ± 0.012
1.6ValMet: 1.6 ± 0.005
2.658ValAsn: 2.658 ± 0.008
3.175ValPro: 3.175 ± 0.009
2.838ValGln: 2.838 ± 0.007
3.314ValArg: 3.314 ± 0.009
5.317ValSer: 5.317 ± 0.011
3.885ValThr: 3.885 ± 0.009
4.247ValVal: 4.247 ± 0.01
0.791ValTrp: 0.791 ± 0.004
1.954ValTyr: 1.954 ± 0.007
0.001ValXaa: 0.001 ± 0.0
Trp
0.659TrpAla: 0.659 ± 0.004
0.272TrpCys: 0.272 ± 0.002
0.661TrpAsp: 0.661 ± 0.004
0.757TrpGlu: 0.757 ± 0.004
0.485TrpPhe: 0.485 ± 0.003
0.623TrpGly: 0.623 ± 0.004
0.299TrpHis: 0.299 ± 0.002
0.614TrpIle: 0.614 ± 0.004
0.755TrpLys: 0.755 ± 0.004
1.158TrpLeu: 1.158 ± 0.005
0.369TrpMet: 0.369 ± 0.003
0.538TrpAsn: 0.538 ± 0.003
0.442TrpPro: 0.442 ± 0.003
0.504TrpGln: 0.504 ± 0.003
0.696TrpArg: 0.696 ± 0.003
0.943TrpSer: 0.943 ± 0.005
0.697TrpThr: 0.697 ± 0.004
0.704TrpVal: 0.704 ± 0.004
0.206TrpTrp: 0.206 ± 0.002
0.356TrpTyr: 0.356 ± 0.003
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.537TyrAla: 1.537 ± 0.006
0.789TyrCys: 0.789 ± 0.004
1.486TyrAsp: 1.486 ± 0.006
1.755TyrGlu: 1.755 ± 0.006
1.349TyrPhe: 1.349 ± 0.005
1.767TyrGly: 1.767 ± 0.006
0.829TyrHis: 0.829 ± 0.004
1.725TyrIle: 1.725 ± 0.007
1.63TyrLys: 1.63 ± 0.007
2.861TyrLeu: 2.861 ± 0.007
0.756TyrMet: 0.756 ± 0.004
1.297TyrAsn: 1.297 ± 0.005
1.31TyrPro: 1.31 ± 0.005
1.276TyrGln: 1.276 ± 0.005
1.729TyrArg: 1.729 ± 0.005
2.433TyrSer: 2.433 ± 0.007
1.766TyrThr: 1.766 ± 0.006
1.744TyrVal: 1.744 ± 0.005
0.407TyrTrp: 0.407 ± 0.003
1.079TyrTyr: 1.079 ± 0.005
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.221XaaXaa: 0.221 ± 0.028
Statistics based on 102128 proteins (62724889 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski