Amino acid dipepetide frequency for Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.215AlaAla: 5.215 ± 0.025
1.205AlaCys: 1.205 ± 0.01
2.906AlaAsp: 2.906 ± 0.013
4.424AlaGlu: 4.424 ± 0.019
2.348AlaPhe: 2.348 ± 0.012
3.656AlaGly: 3.656 ± 0.023
1.357AlaHis: 1.357 ± 0.007
2.962AlaIle: 2.962 ± 0.013
3.63AlaLys: 3.63 ± 0.018
5.984AlaLeu: 5.984 ± 0.022
1.406AlaMet: 1.406 ± 0.008
2.316AlaAsn: 2.316 ± 0.01
3.182AlaPro: 3.182 ± 0.025
2.791AlaGln: 2.791 ± 0.014
2.806AlaArg: 2.806 ± 0.015
5.038AlaSer: 5.038 ± 0.02
3.32AlaThr: 3.32 ± 0.019
4.309AlaVal: 4.309 ± 0.016
0.604AlaTrp: 0.604 ± 0.006
1.5AlaTyr: 1.5 ± 0.01
0.0AlaXaa: 0.0 ± 0.0
Cys
1.22CysAla: 1.22 ± 0.01
0.65CysCys: 0.65 ± 0.009
1.108CysAsp: 1.108 ± 0.01
1.268CysGlu: 1.268 ± 0.012
0.94CysPhe: 0.94 ± 0.008
1.621CysGly: 1.621 ± 0.015
0.651CysHis: 0.651 ± 0.006
1.209CysIle: 1.209 ± 0.01
1.326CysLys: 1.326 ± 0.009
2.128CysLeu: 2.128 ± 0.022
0.506CysMet: 0.506 ± 0.012
1.029CysAsn: 1.029 ± 0.008
1.307CysPro: 1.307 ± 0.014
1.059CysGln: 1.059 ± 0.009
1.211CysArg: 1.211 ± 0.009
2.146CysSer: 2.146 ± 0.017
1.393CysThr: 1.393 ± 0.012
1.417CysVal: 1.417 ± 0.03
0.263CysTrp: 0.263 ± 0.004
0.629CysTyr: 0.629 ± 0.005
0.0CysXaa: 0.0 ± 0.0
Asp
2.691AspAla: 2.691 ± 0.011
1.114AspCys: 1.114 ± 0.008
2.861AspAsp: 2.861 ± 0.02
3.61AspGlu: 3.61 ± 0.016
2.194AspPhe: 2.194 ± 0.011
3.26AspGly: 3.26 ± 0.016
1.156AspHis: 1.156 ± 0.007
3.158AspIle: 3.158 ± 0.02
2.878AspLys: 2.878 ± 0.011
5.051AspLeu: 5.051 ± 0.017
1.219AspMet: 1.219 ± 0.008
2.127AspAsn: 2.127 ± 0.01
2.705AspPro: 2.705 ± 0.014
1.841AspGln: 1.841 ± 0.008
2.384AspArg: 2.384 ± 0.014
4.341AspSer: 4.341 ± 0.017
2.823AspThr: 2.823 ± 0.016
3.132AspVal: 3.132 ± 0.02
0.656AspTrp: 0.656 ± 0.006
1.597AspTyr: 1.597 ± 0.009
0.0AspXaa: 0.0 ± 0.0
Glu
4.304GluAla: 4.304 ± 0.019
1.49GluCys: 1.49 ± 0.017
4.294GluAsp: 4.294 ± 0.016
7.534GluGlu: 7.534 ± 0.033
2.063GluPhe: 2.063 ± 0.01
3.878GluGly: 3.878 ± 0.02
1.563GluHis: 1.563 ± 0.011
3.577GluIle: 3.577 ± 0.018
5.593GluLys: 5.593 ± 0.031
6.136GluLeu: 6.136 ± 0.023
1.749GluMet: 1.749 ± 0.01
3.472GluAsn: 3.472 ± 0.015
2.73GluPro: 2.73 ± 0.019
3.212GluGln: 3.212 ± 0.018
3.941GluArg: 3.941 ± 0.026
4.745GluSer: 4.745 ± 0.021
3.767GluThr: 3.767 ± 0.034
3.98GluVal: 3.98 ± 0.014
0.717GluTrp: 0.717 ± 0.006
1.772GluTyr: 1.772 ± 0.01
0.0GluXaa: 0.0 ± 0.0
Phe
2.027PheAla: 2.027 ± 0.011
0.969PheCys: 0.969 ± 0.007
1.722PheAsp: 1.722 ± 0.009
1.898PheGlu: 1.898 ± 0.009
1.577PhePhe: 1.577 ± 0.01
2.158PheGly: 2.158 ± 0.011
1.052PheHis: 1.052 ± 0.008
2.066PheIle: 2.066 ± 0.011
1.946PheLys: 1.946 ± 0.013
3.855PheLeu: 3.855 ± 0.017
0.826PheMet: 0.826 ± 0.006
1.573PheAsn: 1.573 ± 0.01
1.953PhePro: 1.953 ± 0.013
1.747PheGln: 1.747 ± 0.008
1.776PheArg: 1.776 ± 0.009
3.569PheSer: 3.569 ± 0.019
2.249PheThr: 2.249 ± 0.012
2.131PheVal: 2.131 ± 0.01
0.489PheTrp: 0.489 ± 0.005
1.269PheTyr: 1.269 ± 0.008
0.0PheXaa: 0.0 ± 0.0
Gly
3.548GlyAla: 3.548 ± 0.019
1.246GlyCys: 1.246 ± 0.011
3.028GlyAsp: 3.028 ± 0.016
3.885GlyGlu: 3.885 ± 0.021
2.43GlyPhe: 2.43 ± 0.013
3.994GlyGly: 3.994 ± 0.024
1.533GlyHis: 1.533 ± 0.01
3.085GlyIle: 3.085 ± 0.016
4.026GlyLys: 4.026 ± 0.016
4.883GlyLeu: 4.883 ± 0.02
1.337GlyMet: 1.337 ± 0.009
2.745GlyAsn: 2.745 ± 0.015
2.859GlyPro: 2.859 ± 0.036
2.474GlyGln: 2.474 ± 0.015
3.169GlyArg: 3.169 ± 0.016
5.286GlySer: 5.286 ± 0.023
3.677GlyThr: 3.677 ± 0.022
3.389GlyVal: 3.389 ± 0.016
0.692GlyTrp: 0.692 ± 0.009
1.885GlyTyr: 1.885 ± 0.011
0.0GlyXaa: 0.0 ± 0.0
His
1.192HisAla: 1.192 ± 0.008
0.707HisCys: 0.707 ± 0.007
0.946HisAsp: 0.946 ± 0.008
1.326HisGlu: 1.326 ± 0.008
1.067HisPhe: 1.067 ± 0.007
1.455HisGly: 1.455 ± 0.009
0.897HisHis: 0.897 ± 0.011
1.49HisIle: 1.49 ± 0.013
1.482HisLys: 1.482 ± 0.009
2.734HisLeu: 2.734 ± 0.013
0.611HisMet: 0.611 ± 0.006
1.063HisAsn: 1.063 ± 0.007
1.469HisPro: 1.469 ± 0.01
1.244HisGln: 1.244 ± 0.01
1.513HisArg: 1.513 ± 0.011
2.349HisSer: 2.349 ± 0.017
1.795HisThr: 1.795 ± 0.047
1.451HisVal: 1.451 ± 0.01
0.34HisTrp: 0.34 ± 0.004
0.861HisTyr: 0.861 ± 0.006
0.0HisXaa: 0.0 ± 0.0
Ile
2.845IleAla: 2.845 ± 0.011
1.263IleCys: 1.263 ± 0.01
2.396IleAsp: 2.396 ± 0.014
2.913IleGlu: 2.913 ± 0.012
2.036IlePhe: 2.036 ± 0.01
2.556IleGly: 2.556 ± 0.013
1.566IleHis: 1.566 ± 0.021
2.869IleIle: 2.869 ± 0.015
3.192IleLys: 3.192 ± 0.015
4.969IleLeu: 4.969 ± 0.02
1.154IleMet: 1.154 ± 0.008
2.362IleAsn: 2.362 ± 0.011
2.964IlePro: 2.964 ± 0.014
2.604IleGln: 2.604 ± 0.015
2.577IleArg: 2.577 ± 0.011
4.503IleSer: 4.503 ± 0.016
3.106IleThr: 3.106 ± 0.016
2.866IleVal: 2.866 ± 0.015
0.581IleTrp: 0.581 ± 0.009
1.708IleTyr: 1.708 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
3.982LysAla: 3.982 ± 0.02
1.343LysCys: 1.343 ± 0.012
3.529LysAsp: 3.529 ± 0.016
5.507LysGlu: 5.507 ± 0.028
1.791LysPhe: 1.791 ± 0.013
3.539LysGly: 3.539 ± 0.028
1.595LysHis: 1.595 ± 0.009
3.129LysIle: 3.129 ± 0.013
5.257LysLys: 5.257 ± 0.024
5.442LysLeu: 5.442 ± 0.02
1.627LysMet: 1.627 ± 0.013
2.814LysAsn: 2.814 ± 0.012
3.223LysPro: 3.223 ± 0.021
2.934LysGln: 2.934 ± 0.015
3.511LysArg: 3.511 ± 0.016
4.502LysSer: 4.502 ± 0.021
3.413LysThr: 3.413 ± 0.018
3.794LysVal: 3.794 ± 0.023
0.676LysTrp: 0.676 ± 0.006
1.788LysTyr: 1.788 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
5.63LeuAla: 5.63 ± 0.021
2.231LeuCys: 2.231 ± 0.019
4.615LeuAsp: 4.615 ± 0.016
6.561LeuGlu: 6.561 ± 0.026
3.369LeuPhe: 3.369 ± 0.017
4.932LeuGly: 4.932 ± 0.018
2.737LeuHis: 2.737 ± 0.013
4.258LeuIle: 4.258 ± 0.019
6.097LeuLys: 6.097 ± 0.024
9.561LeuLeu: 9.561 ± 0.033
1.953LeuMet: 1.953 ± 0.009
4.048LeuAsn: 4.048 ± 0.015
5.252LeuPro: 5.252 ± 0.028
5.529LeuGln: 5.529 ± 0.025
5.062LeuArg: 5.062 ± 0.019
8.131LeuSer: 8.131 ± 0.035
5.054LeuThr: 5.054 ± 0.02
4.975LeuVal: 4.975 ± 0.019
1.049LeuTrp: 1.049 ± 0.007
2.78LeuTyr: 2.78 ± 0.016
0.0LeuXaa: 0.0 ± 0.0
Met
1.732MetAla: 1.732 ± 0.01
0.478MetCys: 0.478 ± 0.007
1.359MetAsp: 1.359 ± 0.008
2.012MetGlu: 2.012 ± 0.01
0.815MetPhe: 0.815 ± 0.006
1.323MetGly: 1.323 ± 0.011
0.519MetHis: 0.519 ± 0.005
0.962MetIle: 0.962 ± 0.006
1.552MetLys: 1.552 ± 0.008
1.95MetLeu: 1.95 ± 0.01
0.582MetMet: 0.582 ± 0.006
0.991MetAsn: 0.991 ± 0.006
1.126MetPro: 1.126 ± 0.018
1.081MetGln: 1.081 ± 0.009
1.085MetArg: 1.085 ± 0.007
1.734MetSer: 1.734 ± 0.01
1.126MetThr: 1.126 ± 0.007
1.374MetVal: 1.374 ± 0.008
0.249MetTrp: 0.249 ± 0.003
0.692MetTyr: 0.692 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.414AsnAla: 2.414 ± 0.01
0.988AsnCys: 0.988 ± 0.008
1.94AsnAsp: 1.94 ± 0.011
2.68AsnGlu: 2.68 ± 0.013
1.567AsnPhe: 1.567 ± 0.008
2.903AsnGly: 2.903 ± 0.015
1.033AsnHis: 1.033 ± 0.007
2.751AsnIle: 2.751 ± 0.013
2.736AsnLys: 2.736 ± 0.012
4.113AsnLeu: 4.113 ± 0.019
1.134AsnMet: 1.134 ± 0.008
2.08AsnAsn: 2.08 ± 0.012
2.366AsnPro: 2.366 ± 0.012
1.923AsnGln: 1.923 ± 0.009
2.023AsnArg: 2.023 ± 0.011
3.708AsnSer: 3.708 ± 0.016
2.451AsnThr: 2.451 ± 0.012
2.662AsnVal: 2.662 ± 0.013
0.539AsnTrp: 0.539 ± 0.004
1.361AsnTyr: 1.361 ± 0.007
0.0AsnXaa: 0.0 ± 0.0
Pro
3.579ProAla: 3.579 ± 0.023
1.114ProCys: 1.114 ± 0.011
2.719ProAsp: 2.719 ± 0.018
3.773ProGlu: 3.773 ± 0.019
1.947ProPhe: 1.947 ± 0.014
3.773ProGly: 3.773 ± 0.043
1.329ProHis: 1.329 ± 0.011
2.204ProIle: 2.204 ± 0.013
2.893ProLys: 2.893 ± 0.029
4.69ProLeu: 4.69 ± 0.02
1.014ProMet: 1.014 ± 0.007
2.104ProAsn: 2.104 ± 0.014
4.566ProPro: 4.566 ± 0.036
2.471ProGln: 2.471 ± 0.016
2.541ProArg: 2.541 ± 0.015
5.253ProSer: 5.253 ± 0.031
3.171ProThr: 3.171 ± 0.025
3.752ProVal: 3.752 ± 0.02
0.539ProTrp: 0.539 ± 0.005
1.513ProTyr: 1.513 ± 0.011
0.0ProXaa: 0.0 ± 0.0
Gln
2.964GlnAla: 2.964 ± 0.016
1.028GlnCys: 1.028 ± 0.01
2.315GlnAsp: 2.315 ± 0.01
3.652GlnGlu: 3.652 ± 0.021
1.493GlnPhe: 1.493 ± 0.012
2.568GlnGly: 2.568 ± 0.015
1.287GlnHis: 1.287 ± 0.008
2.346GlnIle: 2.346 ± 0.01
3.127GlnLys: 3.127 ± 0.015
4.392GlnLeu: 4.392 ± 0.02
1.168GlnMet: 1.168 ± 0.007
2.211GlnAsn: 2.211 ± 0.011
2.338GlnPro: 2.338 ± 0.017
2.987GlnGln: 2.987 ± 0.031
2.8GlnArg: 2.8 ± 0.014
3.524GlnSer: 3.524 ± 0.014
2.54GlnThr: 2.54 ± 0.017
2.607GlnVal: 2.607 ± 0.01
0.548GlnTrp: 0.548 ± 0.005
1.322GlnTyr: 1.322 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
2.976ArgAla: 2.976 ± 0.014
1.122ArgCys: 1.122 ± 0.01
2.683ArgAsp: 2.683 ± 0.013
3.656ArgGlu: 3.656 ± 0.024
1.826ArgPhe: 1.826 ± 0.009
2.98ArgGly: 2.98 ± 0.019
1.399ArgHis: 1.399 ± 0.011
2.632ArgIle: 2.632 ± 0.012
3.74ArgLys: 3.74 ± 0.016
4.752ArgLeu: 4.752 ± 0.017
1.186ArgMet: 1.186 ± 0.008
2.321ArgAsn: 2.321 ± 0.01
2.55ArgPro: 2.55 ± 0.017
2.473ArgGln: 2.473 ± 0.015
3.626ArgArg: 3.626 ± 0.021
4.109ArgSer: 4.109 ± 0.023
2.718ArgThr: 2.718 ± 0.011
2.902ArgVal: 2.902 ± 0.015
0.596ArgTrp: 0.596 ± 0.006
1.506ArgTyr: 1.506 ± 0.009
0.0ArgXaa: 0.0 ± 0.0
Ser
5.164SerAla: 5.164 ± 0.018
2.041SerCys: 2.041 ± 0.015
4.407SerAsp: 4.407 ± 0.018
5.328SerGlu: 5.328 ± 0.024
3.18SerPhe: 3.18 ± 0.015
5.22SerGly: 5.22 ± 0.027
2.221SerHis: 2.221 ± 0.016
3.856SerIle: 3.856 ± 0.019
4.589SerLys: 4.589 ± 0.017
8.219SerLeu: 8.219 ± 0.033
1.744SerMet: 1.744 ± 0.009
3.444SerAsn: 3.444 ± 0.016
5.505SerPro: 5.505 ± 0.032
3.819SerGln: 3.819 ± 0.02
4.135SerArg: 4.135 ± 0.021
9.362SerSer: 9.362 ± 0.049
4.914SerThr: 4.914 ± 0.047
5.356SerVal: 5.356 ± 0.028
0.989SerTrp: 0.989 ± 0.008
2.295SerTyr: 2.295 ± 0.011
0.0SerXaa: 0.0 ± 0.0
Thr
3.725ThrAla: 3.725 ± 0.017
1.449ThrCys: 1.449 ± 0.016
2.976ThrAsp: 2.976 ± 0.019
4.168ThrGlu: 4.168 ± 0.032
2.194ThrPhe: 2.194 ± 0.011
3.759ThrGly: 3.759 ± 0.03
1.528ThrHis: 1.528 ± 0.042
2.794ThrIle: 2.794 ± 0.015
3.034ThrLys: 3.034 ± 0.013
5.264ThrLeu: 5.264 ± 0.02
1.181ThrMet: 1.181 ± 0.008
2.172ThrAsn: 2.172 ± 0.011
3.482ThrPro: 3.482 ± 0.024
2.362ThrGln: 2.362 ± 0.02
2.339ThrArg: 2.339 ± 0.014
4.979ThrSer: 4.979 ± 0.036
3.578ThrThr: 3.578 ± 0.098
4.155ThrVal: 4.155 ± 0.024
0.683ThrTrp: 0.683 ± 0.006
1.66ThrTyr: 1.66 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
3.674ValAla: 3.674 ± 0.015
1.649ValCys: 1.649 ± 0.035
2.909ValAsp: 2.909 ± 0.019
3.807ValGlu: 3.807 ± 0.019
2.341ValPhe: 2.341 ± 0.011
3.189ValGly: 3.189 ± 0.017
1.507ValHis: 1.507 ± 0.009
3.214ValIle: 3.214 ± 0.014
3.679ValLys: 3.679 ± 0.017
5.834ValLeu: 5.834 ± 0.019
1.334ValMet: 1.334 ± 0.007
2.536ValAsn: 2.536 ± 0.013
3.558ValPro: 3.558 ± 0.024
2.841ValGln: 2.841 ± 0.012
2.88ValArg: 2.88 ± 0.012
5.204ValSer: 5.204 ± 0.021
3.961ValThr: 3.961 ± 0.018
3.691ValVal: 3.691 ± 0.014
0.69ValTrp: 0.69 ± 0.006
1.784ValTyr: 1.784 ± 0.012
0.0ValXaa: 0.0 ± 0.0
Trp
0.675TrpAla: 0.675 ± 0.006
0.242TrpCys: 0.242 ± 0.003
0.649TrpAsp: 0.649 ± 0.008
0.782TrpGlu: 0.782 ± 0.006
0.415TrpPhe: 0.415 ± 0.004
0.671TrpGly: 0.671 ± 0.008
0.288TrpHis: 0.288 ± 0.004
0.632TrpIle: 0.632 ± 0.007
0.827TrpLys: 0.827 ± 0.006
1.113TrpLeu: 1.113 ± 0.009
0.317TrpMet: 0.317 ± 0.003
0.595TrpAsn: 0.595 ± 0.005
0.424TrpPro: 0.424 ± 0.005
0.514TrpGln: 0.514 ± 0.005
0.648TrpArg: 0.648 ± 0.005
0.842TrpSer: 0.842 ± 0.007
0.613TrpThr: 0.613 ± 0.006
0.666TrpVal: 0.666 ± 0.008
0.179TrpTrp: 0.179 ± 0.003
0.359TrpTyr: 0.359 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.402TyrAla: 1.402 ± 0.008
0.75TyrCys: 0.75 ± 0.006
1.435TyrAsp: 1.435 ± 0.009
1.735TyrGlu: 1.735 ± 0.009
1.258TyrPhe: 1.258 ± 0.008
1.739TyrGly: 1.739 ± 0.011
0.774TyrHis: 0.774 ± 0.008
1.835TyrIle: 1.835 ± 0.028
1.793TyrLys: 1.793 ± 0.017
2.753TyrLeu: 2.753 ± 0.015
0.694TyrMet: 0.694 ± 0.006
1.37TyrAsn: 1.37 ± 0.009
1.387TyrPro: 1.387 ± 0.01
1.308TyrGln: 1.308 ± 0.007
1.728TyrArg: 1.728 ± 0.011
2.464TyrSer: 2.464 ± 0.012
1.813TyrThr: 1.813 ± 0.015
1.669TyrVal: 1.669 ± 0.015
0.371TyrTrp: 0.371 ± 0.005
1.068TyrTyr: 1.068 ± 0.007
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.004XaaXaa: 0.004 ± 0.002
Statistics based on 46313 proteins (29680869 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski