Amino acid dipepetide frequency for Xiphophorus maculatus (Southern platyfish) (Platypoecilus maculatus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.493AlaAla: 6.493 ± 0.035
1.244AlaCys: 1.244 ± 0.008
3.187AlaAsp: 3.187 ± 0.014
4.634AlaGlu: 4.634 ± 0.02
2.35AlaPhe: 2.35 ± 0.012
4.383AlaGly: 4.383 ± 0.023
1.431AlaHis: 1.431 ± 0.009
2.606AlaIle: 2.606 ± 0.014
3.293AlaLys: 3.293 ± 0.019
6.303AlaLeu: 6.303 ± 0.025
1.505AlaMet: 1.505 ± 0.009
2.146AlaAsn: 2.146 ± 0.011
3.57AlaPro: 3.57 ± 0.02
2.828AlaGln: 2.828 ± 0.016
2.994AlaArg: 2.994 ± 0.014
5.577AlaSer: 5.577 ± 0.026
3.401AlaThr: 3.401 ± 0.018
4.856AlaVal: 4.856 ± 0.02
0.634AlaTrp: 0.634 ± 0.006
1.468AlaTyr: 1.468 ± 0.01
0.0AlaXaa: 0.0 ± 0.0
Cys
1.141CysAla: 1.141 ± 0.009
0.667CysCys: 0.667 ± 0.009
1.121CysAsp: 1.121 ± 0.011
1.268CysGlu: 1.268 ± 0.01
0.938CysPhe: 0.938 ± 0.008
1.707CysGly: 1.707 ± 0.017
0.63CysHis: 0.63 ± 0.007
0.969CysIle: 0.969 ± 0.008
1.154CysLys: 1.154 ± 0.009
2.156CysLeu: 2.156 ± 0.013
0.468CysMet: 0.468 ± 0.005
0.867CysAsn: 0.867 ± 0.008
1.271CysPro: 1.271 ± 0.012
1.024CysGln: 1.024 ± 0.009
1.308CysArg: 1.308 ± 0.01
2.269CysSer: 2.269 ± 0.014
1.132CysThr: 1.132 ± 0.009
1.567CysVal: 1.567 ± 0.015
0.306CysTrp: 0.306 ± 0.004
0.613CysTyr: 0.613 ± 0.006
0.0CysXaa: 0.0 ± 0.0
Asp
2.935AspAla: 2.935 ± 0.013
1.154AspCys: 1.154 ± 0.01
3.103AspAsp: 3.103 ± 0.021
3.753AspGlu: 3.753 ± 0.017
2.137AspPhe: 2.137 ± 0.012
3.791AspGly: 3.791 ± 0.019
1.166AspHis: 1.166 ± 0.007
2.608AspIle: 2.608 ± 0.016
2.699AspLys: 2.699 ± 0.014
4.999AspLeu: 4.999 ± 0.019
1.247AspMet: 1.247 ± 0.009
1.922AspAsn: 1.922 ± 0.013
2.873AspPro: 2.873 ± 0.013
2.006AspGln: 2.006 ± 0.013
2.806AspArg: 2.806 ± 0.016
4.604AspSer: 4.604 ± 0.023
2.467AspThr: 2.467 ± 0.013
3.33AspVal: 3.33 ± 0.018
0.669AspTrp: 0.669 ± 0.007
1.513AspTyr: 1.513 ± 0.011
0.0AspXaa: 0.0 ± 0.0
Glu
4.667GluAla: 4.667 ± 0.023
1.2GluCys: 1.2 ± 0.012
4.372GluAsp: 4.372 ± 0.019
7.671GluGlu: 7.671 ± 0.044
1.933GluPhe: 1.933 ± 0.011
3.947GluGly: 3.947 ± 0.015
1.405GluHis: 1.405 ± 0.01
2.875GluIle: 2.875 ± 0.013
4.893GluLys: 4.893 ± 0.028
6.069GluLeu: 6.069 ± 0.025
1.712GluMet: 1.712 ± 0.009
2.877GluAsn: 2.877 ± 0.014
2.894GluPro: 2.894 ± 0.025
3.066GluGln: 3.066 ± 0.016
4.196GluArg: 4.196 ± 0.022
4.56GluSer: 4.56 ± 0.029
3.549GluThr: 3.549 ± 0.019
4.219GluVal: 4.219 ± 0.017
0.678GluTrp: 0.678 ± 0.006
1.539GluTyr: 1.539 ± 0.01
0.0GluXaa: 0.0 ± 0.0
Phe
1.901PheAla: 1.901 ± 0.012
1.023PheCys: 1.023 ± 0.008
1.772PheAsp: 1.772 ± 0.011
1.839PheGlu: 1.839 ± 0.011
1.65PhePhe: 1.65 ± 0.012
2.224PheGly: 2.224 ± 0.014
1.018PheHis: 1.018 ± 0.008
1.974PheIle: 1.974 ± 0.013
1.83PheLys: 1.83 ± 0.011
3.943PheLeu: 3.943 ± 0.026
0.827PheMet: 0.827 ± 0.007
1.534PheAsn: 1.534 ± 0.01
1.883PhePro: 1.883 ± 0.011
1.631PheGln: 1.631 ± 0.01
1.909PheArg: 1.909 ± 0.011
3.723PheSer: 3.723 ± 0.022
2.33PheThr: 2.33 ± 0.013
2.238PheVal: 2.238 ± 0.013
0.493PheTrp: 0.493 ± 0.005
1.238PheTyr: 1.238 ± 0.009
0.0PheXaa: 0.0 ± 0.0
Gly
3.977GlyAla: 3.977 ± 0.022
1.252GlyCys: 1.252 ± 0.01
3.231GlyAsp: 3.231 ± 0.016
4.153GlyGlu: 4.153 ± 0.02
2.498GlyPhe: 2.498 ± 0.014
5.303GlyGly: 5.303 ± 0.032
1.616GlyHis: 1.616 ± 0.012
2.568GlyIle: 2.568 ± 0.012
3.673GlyLys: 3.673 ± 0.02
5.399GlyLeu: 5.399 ± 0.023
1.412GlyMet: 1.412 ± 0.011
2.501GlyAsn: 2.501 ± 0.013
3.43GlyPro: 3.43 ± 0.044
2.717GlyGln: 2.717 ± 0.015
3.636GlyArg: 3.636 ± 0.02
5.959GlySer: 5.959 ± 0.026
3.386GlyThr: 3.386 ± 0.017
3.923GlyVal: 3.923 ± 0.019
0.763GlyTrp: 0.763 ± 0.007
1.785GlyTyr: 1.785 ± 0.013
0.0GlyXaa: 0.0 ± 0.0
His
1.27HisAla: 1.27 ± 0.008
0.723HisCys: 0.723 ± 0.006
0.97HisAsp: 0.97 ± 0.007
1.17HisGlu: 1.17 ± 0.007
1.066HisPhe: 1.066 ± 0.008
1.524HisGly: 1.524 ± 0.01
1.065HisHis: 1.065 ± 0.013
1.283HisIle: 1.283 ± 0.008
1.319HisLys: 1.319 ± 0.009
2.698HisLeu: 2.698 ± 0.014
0.769HisMet: 0.769 ± 0.011
1.006HisAsn: 1.006 ± 0.008
1.614HisPro: 1.614 ± 0.01
1.322HisGln: 1.322 ± 0.01
1.651HisArg: 1.651 ± 0.01
2.435HisSer: 2.435 ± 0.014
1.646HisThr: 1.646 ± 0.014
1.414HisVal: 1.414 ± 0.01
0.331HisTrp: 0.331 ± 0.004
0.814HisTyr: 0.814 ± 0.006
0.0HisXaa: 0.0 ± 0.0
Ile
2.395IleAla: 2.395 ± 0.013
1.11IleCys: 1.11 ± 0.008
2.061IleAsp: 2.061 ± 0.012
2.353IleGlu: 2.353 ± 0.012
1.862IlePhe: 1.862 ± 0.013
2.26IleGly: 2.26 ± 0.013
1.36IleHis: 1.36 ± 0.011
2.415IleIle: 2.415 ± 0.016
2.605IleLys: 2.605 ± 0.013
4.268IleLeu: 4.268 ± 0.021
1.031IleMet: 1.031 ± 0.007
1.997IleAsn: 1.997 ± 0.013
2.453IlePro: 2.453 ± 0.014
2.182IleGln: 2.182 ± 0.011
2.475IleArg: 2.475 ± 0.011
3.831IleSer: 3.831 ± 0.014
2.759IleThr: 2.759 ± 0.013
2.531IleVal: 2.531 ± 0.013
0.486IleTrp: 0.486 ± 0.006
1.397IleTyr: 1.397 ± 0.01
0.0IleXaa: 0.0 ± 0.0
Lys
3.717LysAla: 3.717 ± 0.018
1.056LysCys: 1.056 ± 0.008
3.254LysAsp: 3.254 ± 0.014
4.755LysGlu: 4.755 ± 0.025
1.606LysPhe: 1.606 ± 0.01
3.21LysGly: 3.21 ± 0.023
1.436LysHis: 1.436 ± 0.009
2.55LysIle: 2.55 ± 0.013
4.514LysLys: 4.514 ± 0.026
5.022LysLeu: 5.022 ± 0.019
1.49LysMet: 1.49 ± 0.01
2.351LysAsn: 2.351 ± 0.014
3.117LysPro: 3.117 ± 0.023
2.612LysGln: 2.612 ± 0.017
3.433LysArg: 3.433 ± 0.015
4.07LysSer: 4.07 ± 0.022
3.343LysThr: 3.343 ± 0.015
3.547LysVal: 3.547 ± 0.022
0.576LysTrp: 0.576 ± 0.006
1.474LysTyr: 1.474 ± 0.01
0.0LysXaa: 0.0 ± 0.0
Leu
5.859LeuAla: 5.859 ± 0.023
2.206LeuCys: 2.206 ± 0.014
4.868LeuAsp: 4.868 ± 0.019
6.329LeuGlu: 6.329 ± 0.027
3.445LeuPhe: 3.445 ± 0.019
5.03LeuGly: 5.03 ± 0.021
2.709LeuHis: 2.709 ± 0.015
3.823LeuIle: 3.823 ± 0.017
5.658LeuLys: 5.658 ± 0.027
9.978LeuLeu: 9.978 ± 0.037
2.08LeuMet: 2.08 ± 0.011
3.688LeuAsn: 3.688 ± 0.014
5.349LeuPro: 5.349 ± 0.025
5.501LeuGln: 5.501 ± 0.022
5.586LeuArg: 5.586 ± 0.021
8.397LeuSer: 8.397 ± 0.029
5.382LeuThr: 5.382 ± 0.022
5.389LeuVal: 5.389 ± 0.021
1.048LeuTrp: 1.048 ± 0.008
2.535LeuTyr: 2.535 ± 0.012
0.0LeuXaa: 0.0 ± 0.0
Met
1.832MetAla: 1.832 ± 0.011
0.474MetCys: 0.474 ± 0.005
1.393MetAsp: 1.393 ± 0.008
1.955MetGlu: 1.955 ± 0.011
0.872MetPhe: 0.872 ± 0.008
1.355MetGly: 1.355 ± 0.011
0.479MetHis: 0.479 ± 0.005
0.916MetIle: 0.916 ± 0.009
1.516MetLys: 1.516 ± 0.01
2.021MetLeu: 2.021 ± 0.012
0.742MetMet: 0.742 ± 0.008
0.915MetAsn: 0.915 ± 0.008
1.062MetPro: 1.062 ± 0.011
0.966MetGln: 0.966 ± 0.007
1.23MetArg: 1.23 ± 0.009
1.883MetSer: 1.883 ± 0.01
1.278MetThr: 1.278 ± 0.008
1.49MetVal: 1.49 ± 0.009
0.257MetTrp: 0.257 ± 0.004
0.626MetTyr: 0.626 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
2.153AsnAla: 2.153 ± 0.012
0.919AsnCys: 0.919 ± 0.008
1.714AsnAsp: 1.714 ± 0.012
2.1AsnGlu: 2.1 ± 0.011
1.451AsnPhe: 1.451 ± 0.009
2.794AsnGly: 2.794 ± 0.018
1.021AsnHis: 1.021 ± 0.007
2.196AsnIle: 2.196 ± 0.011
2.28AsnLys: 2.28 ± 0.012
3.821AsnLeu: 3.821 ± 0.019
1.065AsnMet: 1.065 ± 0.007
1.875AsnAsn: 1.875 ± 0.018
2.307AsnPro: 2.307 ± 0.014
1.851AsnGln: 1.851 ± 0.013
2.029AsnArg: 2.029 ± 0.012
3.344AsnSer: 3.344 ± 0.015
2.245AsnThr: 2.245 ± 0.017
2.328AsnVal: 2.328 ± 0.011
0.454AsnTrp: 0.454 ± 0.005
1.14AsnTyr: 1.14 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
4.29ProAla: 4.29 ± 0.028
1.063ProCys: 1.063 ± 0.011
2.996ProAsp: 2.996 ± 0.022
3.796ProGlu: 3.796 ± 0.019
1.924ProPhe: 1.924 ± 0.014
4.118ProGly: 4.118 ± 0.042
1.493ProHis: 1.493 ± 0.012
1.871ProIle: 1.871 ± 0.01
2.636ProLys: 2.636 ± 0.023
4.866ProLeu: 4.866 ± 0.02
1.007ProMet: 1.007 ± 0.008
1.997ProAsn: 1.997 ± 0.012
5.605ProPro: 5.605 ± 0.045
2.685ProGln: 2.685 ± 0.018
2.769ProArg: 2.769 ± 0.016
5.705ProSer: 5.705 ± 0.034
3.202ProThr: 3.202 ± 0.024
3.789ProVal: 3.789 ± 0.017
0.527ProTrp: 0.527 ± 0.006
1.384ProTyr: 1.384 ± 0.01
0.0ProXaa: 0.0 ± 0.0
Gln
3.071GlnAla: 3.071 ± 0.016
0.93GlnCys: 0.93 ± 0.009
2.327GlnAsp: 2.327 ± 0.013
3.5GlnGlu: 3.5 ± 0.018
1.361GlnPhe: 1.361 ± 0.009
2.605GlnGly: 2.605 ± 0.013
1.383GlnHis: 1.383 ± 0.01
1.99GlnIle: 1.99 ± 0.01
2.735GlnLys: 2.735 ± 0.015
4.477GlnLeu: 4.477 ± 0.021
1.142GlnMet: 1.142 ± 0.008
1.892GlnAsn: 1.892 ± 0.011
2.57GlnPro: 2.57 ± 0.019
3.369GlnGln: 3.369 ± 0.028
3.09GlnArg: 3.09 ± 0.015
3.588GlnSer: 3.588 ± 0.02
2.63GlnThr: 2.63 ± 0.012
2.76GlnVal: 2.76 ± 0.015
0.538GlnTrp: 0.538 ± 0.006
1.199GlnTyr: 1.199 ± 0.01
0.0GlnXaa: 0.0 ± 0.0
Arg
3.321ArgAla: 3.321 ± 0.015
1.258ArgCys: 1.258 ± 0.01
2.928ArgAsp: 2.928 ± 0.016
3.823ArgGlu: 3.823 ± 0.019
2.043ArgPhe: 2.043 ± 0.011
3.444ArgGly: 3.444 ± 0.018
1.583ArgHis: 1.583 ± 0.011
2.446ArgIle: 2.446 ± 0.011
3.651ArgLys: 3.651 ± 0.019
5.314ArgLeu: 5.314 ± 0.021
1.279ArgMet: 1.279 ± 0.009
2.182ArgAsn: 2.182 ± 0.011
3.022ArgPro: 3.022 ± 0.017
2.629ArgGln: 2.629 ± 0.013
4.447ArgArg: 4.447 ± 0.021
4.736ArgSer: 4.736 ± 0.022
3.018ArgThr: 3.018 ± 0.012
3.206ArgVal: 3.206 ± 0.015
0.672ArgTrp: 0.672 ± 0.007
1.519ArgTyr: 1.519 ± 0.009
0.0ArgXaa: 0.0 ± 0.0
Ser
5.797SerAla: 5.797 ± 0.023
2.223SerCys: 2.223 ± 0.016
4.477SerAsp: 4.477 ± 0.02
5.151SerGlu: 5.151 ± 0.033
3.336SerPhe: 3.336 ± 0.018
5.89SerGly: 5.89 ± 0.03
2.273SerHis: 2.273 ± 0.013
3.36SerIle: 3.36 ± 0.012
4.146SerLys: 4.146 ± 0.019
8.41SerLeu: 8.41 ± 0.024
1.794SerMet: 1.794 ± 0.01
3.17SerAsn: 3.17 ± 0.023
6.087SerPro: 6.087 ± 0.036
3.95SerGln: 3.95 ± 0.018
4.719SerArg: 4.719 ± 0.022
10.79SerSer: 10.79 ± 0.059
4.844SerThr: 4.844 ± 0.027
5.507SerVal: 5.507 ± 0.019
1.024SerTrp: 1.024 ± 0.009
2.195SerTyr: 2.195 ± 0.011
0.0SerXaa: 0.0 ± 0.0
Thr
4.004ThrAla: 4.004 ± 0.022
1.44ThrCys: 1.44 ± 0.015
2.891ThrAsp: 2.891 ± 0.014
3.781ThrGlu: 3.781 ± 0.023
2.162ThrPhe: 2.162 ± 0.016
3.801ThrGly: 3.801 ± 0.017
1.411ThrHis: 1.411 ± 0.009
2.411ThrIle: 2.411 ± 0.013
2.758ThrLys: 2.758 ± 0.014
5.29ThrLeu: 5.29 ± 0.018
1.191ThrMet: 1.191 ± 0.008
2.001ThrAsn: 2.001 ± 0.011
3.624ThrPro: 3.624 ± 0.021
2.355ThrGln: 2.355 ± 0.011
2.549ThrArg: 2.549 ± 0.012
4.981ThrSer: 4.981 ± 0.03
3.483ThrThr: 3.483 ± 0.069
4.115ThrVal: 4.115 ± 0.018
0.667ThrTrp: 0.667 ± 0.007
1.417ThrTyr: 1.417 ± 0.011
0.0ThrXaa: 0.0 ± 0.0
Val
4.047ValAla: 4.047 ± 0.016
1.68ValCys: 1.68 ± 0.011
3.119ValAsp: 3.119 ± 0.016
3.998ValGlu: 3.998 ± 0.019
2.663ValPhe: 2.663 ± 0.015
3.534ValGly: 3.534 ± 0.016
1.566ValHis: 1.566 ± 0.009
2.959ValIle: 2.959 ± 0.015
3.594ValLys: 3.594 ± 0.017
6.157ValLeu: 6.157 ± 0.026
1.5ValMet: 1.5 ± 0.009
2.479ValAsn: 2.479 ± 0.016
3.287ValPro: 3.287 ± 0.018
2.763ValGln: 2.763 ± 0.013
3.255ValArg: 3.255 ± 0.012
5.357ValSer: 5.357 ± 0.018
3.944ValThr: 3.944 ± 0.016
4.317ValVal: 4.317 ± 0.019
0.755ValTrp: 0.755 ± 0.006
1.773ValTyr: 1.773 ± 0.011
0.0ValXaa: 0.0 ± 0.0
Trp
0.65TrpAla: 0.65 ± 0.006
0.243TrpCys: 0.243 ± 0.003
0.635TrpAsp: 0.635 ± 0.007
0.736TrpGlu: 0.736 ± 0.007
0.48TrpPhe: 0.48 ± 0.005
0.626TrpGly: 0.626 ± 0.007
0.26TrpHis: 0.26 ± 0.004
0.572TrpIle: 0.572 ± 0.006
0.711TrpLys: 0.711 ± 0.006
1.121TrpLeu: 1.121 ± 0.009
0.348TrpMet: 0.348 ± 0.005
0.504TrpAsn: 0.504 ± 0.005
0.428TrpPro: 0.428 ± 0.005
0.467TrpGln: 0.467 ± 0.004
0.748TrpArg: 0.748 ± 0.007
0.977TrpSer: 0.977 ± 0.009
0.732TrpThr: 0.732 ± 0.007
0.669TrpVal: 0.669 ± 0.006
0.19TrpTrp: 0.19 ± 0.004
0.342TrpTyr: 0.342 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.383TyrAla: 1.383 ± 0.008
0.71TyrCys: 0.71 ± 0.008
1.363TyrAsp: 1.363 ± 0.011
1.555TyrGlu: 1.555 ± 0.01
1.202TyrPhe: 1.202 ± 0.009
1.634TyrGly: 1.634 ± 0.01
0.787TyrHis: 0.787 ± 0.006
1.438TyrIle: 1.438 ± 0.011
1.469TyrLys: 1.469 ± 0.01
2.547TyrLeu: 2.547 ± 0.011
0.654TyrMet: 0.654 ± 0.006
1.183TyrAsn: 1.183 ± 0.008
1.276TyrPro: 1.276 ± 0.009
1.232TyrGln: 1.232 ± 0.008
1.692TyrArg: 1.692 ± 0.01
2.356TyrSer: 2.356 ± 0.012
1.562TyrThr: 1.562 ± 0.01
1.555TyrVal: 1.555 ± 0.01
0.374TyrTrp: 0.374 ± 0.005
0.965TyrTyr: 0.965 ± 0.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 35279 proteins (21003233 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski