Amino acid dipepetide frequency for Xenopus laevis (African clawed frog)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.903AlaAla: 4.903 ± 0.024
1.23AlaCys: 1.23 ± 0.008
2.717AlaAsp: 2.717 ± 0.012
4.084AlaGlu: 4.084 ± 0.017
2.374AlaPhe: 2.374 ± 0.012
3.51AlaGly: 3.51 ± 0.02
1.369AlaHis: 1.369 ± 0.01
2.989AlaIle: 2.989 ± 0.014
3.375AlaLys: 3.375 ± 0.017
5.779AlaLeu: 5.779 ± 0.025
1.422AlaMet: 1.422 ± 0.01
2.263AlaAsn: 2.263 ± 0.013
2.999AlaPro: 2.999 ± 0.019
2.616AlaGln: 2.616 ± 0.014
2.591AlaArg: 2.591 ± 0.013
4.927AlaSer: 4.927 ± 0.02
3.271AlaThr: 3.271 ± 0.018
4.193AlaVal: 4.193 ± 0.017
0.61AlaTrp: 0.61 ± 0.007
1.493AlaTyr: 1.493 ± 0.009
0.0AlaXaa: 0.0 ± 0.0
Cys
1.249CysAla: 1.249 ± 0.01
0.73CysCys: 0.73 ± 0.008
1.161CysAsp: 1.161 ± 0.012
1.291CysGlu: 1.291 ± 0.013
0.987CysPhe: 0.987 ± 0.007
1.616CysGly: 1.616 ± 0.017
0.689CysHis: 0.689 ± 0.007
1.343CysIle: 1.343 ± 0.012
1.371CysLys: 1.371 ± 0.011
2.252CysLeu: 2.252 ± 0.013
0.534CysMet: 0.534 ± 0.005
1.124CysAsn: 1.124 ± 0.01
1.367CysPro: 1.367 ± 0.015
1.072CysGln: 1.072 ± 0.011
1.229CysArg: 1.229 ± 0.01
2.321CysSer: 2.321 ± 0.016
1.501CysThr: 1.501 ± 0.014
1.452CysVal: 1.452 ± 0.012
0.299CysTrp: 0.299 ± 0.004
0.725CysTyr: 0.725 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
2.55AspAla: 2.55 ± 0.013
1.138AspCys: 1.138 ± 0.012
2.751AspAsp: 2.751 ± 0.017
3.424AspGlu: 3.424 ± 0.018
2.172AspPhe: 2.172 ± 0.013
3.098AspGly: 3.098 ± 0.018
1.142AspHis: 1.142 ± 0.009
3.174AspIle: 3.174 ± 0.019
2.808AspLys: 2.808 ± 0.016
4.848AspLeu: 4.848 ± 0.019
1.223AspMet: 1.223 ± 0.009
2.151AspAsn: 2.151 ± 0.013
2.616AspPro: 2.616 ± 0.014
1.823AspGln: 1.823 ± 0.015
2.354AspArg: 2.354 ± 0.013
4.257AspSer: 4.257 ± 0.02
2.747AspThr: 2.747 ± 0.016
2.981AspVal: 2.981 ± 0.016
0.625AspTrp: 0.625 ± 0.006
1.591AspTyr: 1.591 ± 0.011
0.0AspXaa: 0.0 ± 0.0
Glu
4.058GluAla: 4.058 ± 0.018
1.445GluCys: 1.445 ± 0.018
4.161GluAsp: 4.161 ± 0.021
7.161GluGlu: 7.161 ± 0.04
2.066GluPhe: 2.066 ± 0.012
3.635GluGly: 3.635 ± 0.018
1.529GluHis: 1.529 ± 0.01
3.527GluIle: 3.527 ± 0.016
5.33GluLys: 5.33 ± 0.028
5.735GluLeu: 5.735 ± 0.028
1.738GluMet: 1.738 ± 0.011
3.426GluAsn: 3.426 ± 0.018
2.576GluPro: 2.576 ± 0.017
3.054GluGln: 3.054 ± 0.018
3.739GluArg: 3.739 ± 0.019
4.655GluSer: 4.655 ± 0.022
3.701GluThr: 3.701 ± 0.025
3.79GluVal: 3.79 ± 0.018
0.677GluTrp: 0.677 ± 0.007
1.711GluTyr: 1.711 ± 0.011
0.0GluXaa: 0.0 ± 0.0
Phe
2.002PheAla: 2.002 ± 0.012
1.129PheCys: 1.129 ± 0.009
1.713PheAsp: 1.713 ± 0.012
1.858PheGlu: 1.858 ± 0.012
1.786PhePhe: 1.786 ± 0.015
2.205PheGly: 2.205 ± 0.013
1.109PheHis: 1.109 ± 0.008
2.312PheIle: 2.312 ± 0.013
1.988PheLys: 1.988 ± 0.013
4.224PheLeu: 4.224 ± 0.024
0.905PheMet: 0.905 ± 0.007
1.634PheAsn: 1.634 ± 0.008
2.019PhePro: 2.019 ± 0.012
1.805PheGln: 1.805 ± 0.009
1.824PheArg: 1.824 ± 0.011
3.69PheSer: 3.69 ± 0.017
2.338PheThr: 2.338 ± 0.017
2.222PheVal: 2.222 ± 0.013
0.506PheTrp: 0.506 ± 0.006
1.37PheTyr: 1.37 ± 0.01
0.0PheXaa: 0.0 ± 0.0
Gly
3.347GlyAla: 3.347 ± 0.02
1.241GlyCys: 1.241 ± 0.011
2.919GlyAsp: 2.919 ± 0.017
3.715GlyGlu: 3.715 ± 0.024
2.45GlyPhe: 2.45 ± 0.014
3.946GlyGly: 3.946 ± 0.026
1.559GlyHis: 1.559 ± 0.01
3.104GlyIle: 3.104 ± 0.016
3.867GlyLys: 3.867 ± 0.018
4.869GlyLeu: 4.869 ± 0.022
1.347GlyMet: 1.347 ± 0.01
2.818GlyAsn: 2.818 ± 0.013
2.69GlyPro: 2.69 ± 0.035
2.463GlyGln: 2.463 ± 0.013
3.049GlyArg: 3.049 ± 0.017
5.24GlySer: 5.24 ± 0.024
3.533GlyThr: 3.533 ± 0.018
3.279GlyVal: 3.279 ± 0.016
0.7GlyTrp: 0.7 ± 0.007
1.888GlyTyr: 1.888 ± 0.013
0.0GlyXaa: 0.0 ± 0.0
His
1.222HisAla: 1.222 ± 0.009
0.761HisCys: 0.761 ± 0.008
0.938HisAsp: 0.938 ± 0.008
1.301HisGlu: 1.301 ± 0.009
1.179HisPhe: 1.179 ± 0.009
1.459HisGly: 1.459 ± 0.011
0.958HisHis: 0.958 ± 0.013
1.504HisIle: 1.504 ± 0.01
1.476HisLys: 1.476 ± 0.011
2.812HisLeu: 2.812 ± 0.014
0.665HisMet: 0.665 ± 0.007
1.134HisAsn: 1.134 ± 0.009
1.519HisPro: 1.519 ± 0.013
1.246HisGln: 1.246 ± 0.011
1.503HisArg: 1.503 ± 0.011
2.491HisSer: 2.491 ± 0.016
1.678HisThr: 1.678 ± 0.019
1.457HisVal: 1.457 ± 0.01
0.357HisTrp: 0.357 ± 0.005
0.933HisTyr: 0.933 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
2.842IleAla: 2.842 ± 0.016
1.444IleCys: 1.444 ± 0.011
2.331IleAsp: 2.331 ± 0.015
2.858IleGlu: 2.858 ± 0.015
2.267IlePhe: 2.267 ± 0.014
2.665IleGly: 2.665 ± 0.015
1.562IleHis: 1.562 ± 0.01
3.18IleIle: 3.18 ± 0.017
3.219IleLys: 3.219 ± 0.018
5.28IleLeu: 5.28 ± 0.024
1.238IleMet: 1.238 ± 0.01
2.438IleAsn: 2.438 ± 0.014
3.045IlePro: 3.045 ± 0.015
2.562IleGln: 2.562 ± 0.015
2.556IleArg: 2.556 ± 0.015
4.746IleSer: 4.746 ± 0.019
3.295IleThr: 3.295 ± 0.018
3.059IleVal: 3.059 ± 0.017
0.599IleTrp: 0.599 ± 0.006
1.817IleTyr: 1.817 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
3.669LysAla: 3.669 ± 0.016
1.39LysCys: 1.39 ± 0.011
3.43LysAsp: 3.43 ± 0.018
5.274LysGlu: 5.274 ± 0.025
1.794LysPhe: 1.794 ± 0.01
3.426LysGly: 3.426 ± 0.024
1.608LysHis: 1.608 ± 0.012
3.203LysIle: 3.203 ± 0.017
5.209LysLys: 5.209 ± 0.025
5.278LysLeu: 5.278 ± 0.024
1.621LysMet: 1.621 ± 0.011
2.885LysAsn: 2.885 ± 0.014
3.074LysPro: 3.074 ± 0.019
2.895LysGln: 2.895 ± 0.025
3.519LysArg: 3.519 ± 0.02
4.573LysSer: 4.573 ± 0.023
3.474LysThr: 3.474 ± 0.018
3.616LysVal: 3.616 ± 0.017
0.68LysTrp: 0.68 ± 0.006
1.8LysTyr: 1.8 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
5.472LeuAla: 5.472 ± 0.022
2.375LeuCys: 2.375 ± 0.016
4.47LeuAsp: 4.47 ± 0.019
6.214LeuGlu: 6.214 ± 0.028
3.583LeuPhe: 3.583 ± 0.022
4.869LeuGly: 4.869 ± 0.022
2.864LeuHis: 2.864 ± 0.015
4.502LeuIle: 4.502 ± 0.022
5.963LeuLys: 5.963 ± 0.024
9.945LeuLeu: 9.945 ± 0.043
2.081LeuMet: 2.081 ± 0.013
4.094LeuAsn: 4.094 ± 0.018
5.221LeuPro: 5.221 ± 0.026
5.387LeuGln: 5.387 ± 0.026
4.897LeuArg: 4.897 ± 0.022
8.262LeuSer: 8.262 ± 0.027
5.147LeuThr: 5.147 ± 0.022
5.142LeuVal: 5.142 ± 0.021
1.043LeuTrp: 1.043 ± 0.009
2.885LeuTyr: 2.885 ± 0.017
0.0LeuXaa: 0.0 ± 0.0
Met
1.792MetAla: 1.792 ± 0.011
0.531MetCys: 0.531 ± 0.006
1.374MetAsp: 1.374 ± 0.01
1.994MetGlu: 1.994 ± 0.012
0.888MetPhe: 0.888 ± 0.008
1.361MetGly: 1.361 ± 0.013
0.543MetHis: 0.543 ± 0.006
1.026MetIle: 1.026 ± 0.008
1.624MetLys: 1.624 ± 0.011
2.073MetLeu: 2.073 ± 0.012
0.634MetMet: 0.634 ± 0.007
1.08MetAsn: 1.08 ± 0.009
1.13MetPro: 1.13 ± 0.013
1.068MetGln: 1.068 ± 0.008
1.115MetArg: 1.115 ± 0.009
1.898MetSer: 1.898 ± 0.011
1.198MetThr: 1.198 ± 0.009
1.396MetVal: 1.396 ± 0.009
0.261MetTrp: 0.261 ± 0.005
0.746MetTyr: 0.746 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.35AsnAla: 2.35 ± 0.013
1.095AsnCys: 1.095 ± 0.01
1.95AsnAsp: 1.95 ± 0.013
2.687AsnGlu: 2.687 ± 0.016
1.643AsnPhe: 1.643 ± 0.011
2.928AsnGly: 2.928 ± 0.02
1.108AsnHis: 1.108 ± 0.009
2.841AsnIle: 2.841 ± 0.015
2.807AsnLys: 2.807 ± 0.014
4.106AsnLeu: 4.106 ± 0.019
1.184AsnMet: 1.184 ± 0.007
2.226AsnAsn: 2.226 ± 0.014
2.44AsnPro: 2.44 ± 0.014
1.916AsnGln: 1.916 ± 0.012
2.103AsnArg: 2.103 ± 0.011
3.854AsnSer: 3.854 ± 0.018
2.63AsnThr: 2.63 ± 0.016
2.711AsnVal: 2.711 ± 0.015
0.515AsnTrp: 0.515 ± 0.005
1.375AsnTyr: 1.375 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
3.417ProAla: 3.417 ± 0.02
1.171ProCys: 1.171 ± 0.011
2.59ProAsp: 2.59 ± 0.016
3.602ProGlu: 3.602 ± 0.018
2.017ProPhe: 2.017 ± 0.014
3.581ProGly: 3.581 ± 0.044
1.352ProHis: 1.352 ± 0.012
2.297ProIle: 2.297 ± 0.014
2.796ProLys: 2.796 ± 0.021
4.714ProLeu: 4.714 ± 0.023
1.09ProMet: 1.09 ± 0.009
2.162ProAsn: 2.162 ± 0.013
4.453ProPro: 4.453 ± 0.04
2.377ProGln: 2.377 ± 0.018
2.423ProArg: 2.423 ± 0.017
5.219ProSer: 5.219 ± 0.029
3.096ProThr: 3.096 ± 0.023
3.658ProVal: 3.658 ± 0.017
0.536ProTrp: 0.536 ± 0.006
1.527ProTyr: 1.527 ± 0.012
0.0ProXaa: 0.0 ± 0.0
Gln
2.698GlnAla: 2.698 ± 0.015
1.062GlnCys: 1.062 ± 0.01
2.25GlnAsp: 2.25 ± 0.019
3.515GlnGlu: 3.515 ± 0.02
1.456GlnPhe: 1.456 ± 0.009
2.504GlnGly: 2.504 ± 0.017
1.304GlnHis: 1.304 ± 0.01
2.365GlnIle: 2.365 ± 0.012
3.069GlnLys: 3.069 ± 0.02
4.306GlnLeu: 4.306 ± 0.022
1.19GlnMet: 1.19 ± 0.009
2.231GlnAsn: 2.231 ± 0.013
2.286GlnPro: 2.286 ± 0.018
2.948GlnGln: 2.948 ± 0.035
2.664GlnArg: 2.664 ± 0.015
3.502GlnSer: 3.502 ± 0.02
2.552GlnThr: 2.552 ± 0.016
2.572GlnVal: 2.572 ± 0.013
0.54GlnTrp: 0.54 ± 0.005
1.326GlnTyr: 1.326 ± 0.01
0.0GlnXaa: 0.0 ± 0.0
Arg
2.76ArgAla: 2.76 ± 0.014
1.133ArgCys: 1.133 ± 0.012
2.552ArgAsp: 2.552 ± 0.013
3.497ArgGlu: 3.497 ± 0.019
1.853ArgPhe: 1.853 ± 0.011
2.914ArgGly: 2.914 ± 0.02
1.458ArgHis: 1.458 ± 0.01
2.649ArgIle: 2.649 ± 0.013
3.758ArgLys: 3.758 ± 0.019
4.522ArgLeu: 4.522 ± 0.018
1.21ArgMet: 1.21 ± 0.01
2.352ArgAsn: 2.352 ± 0.012
2.423ArgPro: 2.423 ± 0.016
2.353ArgGln: 2.353 ± 0.014
3.639ArgArg: 3.639 ± 0.026
4.12ArgSer: 4.12 ± 0.027
2.681ArgThr: 2.681 ± 0.013
2.765ArgVal: 2.765 ± 0.014
0.625ArgTrp: 0.625 ± 0.007
1.54ArgTyr: 1.54 ± 0.01
0.0ArgXaa: 0.0 ± 0.0
Ser
5.087SerAla: 5.087 ± 0.022
2.192SerCys: 2.192 ± 0.016
4.354SerAsp: 4.354 ± 0.021
5.167SerGlu: 5.167 ± 0.023
3.432SerPhe: 3.432 ± 0.014
5.174SerGly: 5.174 ± 0.024
2.318SerHis: 2.318 ± 0.013
4.116SerIle: 4.116 ± 0.017
4.543SerLys: 4.543 ± 0.021
8.375SerLeu: 8.375 ± 0.033
1.867SerMet: 1.867 ± 0.012
3.609SerAsn: 3.609 ± 0.017
5.527SerPro: 5.527 ± 0.037
3.805SerGln: 3.805 ± 0.021
4.19SerArg: 4.19 ± 0.024
9.786SerSer: 9.786 ± 0.055
5.083SerThr: 5.083 ± 0.035
5.444SerVal: 5.444 ± 0.023
1.0SerTrp: 1.0 ± 0.009
2.437SerTyr: 2.437 ± 0.014
0.0SerXaa: 0.0 ± 0.0
Thr
3.626ThrAla: 3.626 ± 0.018
1.576ThrCys: 1.576 ± 0.018
2.953ThrAsp: 2.953 ± 0.015
3.976ThrGlu: 3.976 ± 0.028
2.366ThrPhe: 2.366 ± 0.013
3.622ThrGly: 3.622 ± 0.021
1.492ThrHis: 1.492 ± 0.02
3.01ThrIle: 3.01 ± 0.018
3.044ThrLys: 3.044 ± 0.019
5.411ThrLeu: 5.411 ± 0.021
1.268ThrMet: 1.268 ± 0.01
2.289ThrAsn: 2.289 ± 0.017
3.424ThrPro: 3.424 ± 0.022
2.354ThrGln: 2.354 ± 0.015
2.356ThrArg: 2.356 ± 0.012
5.179ThrSer: 5.179 ± 0.039
3.544ThrThr: 3.544 ± 0.055
4.112ThrVal: 4.112 ± 0.019
0.684ThrTrp: 0.684 ± 0.008
1.654ThrTyr: 1.654 ± 0.011
0.0ThrXaa: 0.0 ± 0.0
Val
3.625ValAla: 3.625 ± 0.018
1.595ValCys: 1.595 ± 0.013
2.792ValAsp: 2.792 ± 0.016
3.649ValGlu: 3.649 ± 0.016
2.494ValPhe: 2.494 ± 0.016
3.121ValGly: 3.121 ± 0.014
1.542ValHis: 1.542 ± 0.009
3.283ValIle: 3.283 ± 0.013
3.566ValLys: 3.566 ± 0.02
5.898ValLeu: 5.898 ± 0.025
1.394ValMet: 1.394 ± 0.01
2.544ValAsn: 2.544 ± 0.012
3.383ValPro: 3.383 ± 0.017
2.782ValGln: 2.782 ± 0.012
2.732ValArg: 2.732 ± 0.012
5.279ValSer: 5.279 ± 0.025
3.954ValThr: 3.954 ± 0.02
3.803ValVal: 3.803 ± 0.026
0.726ValTrp: 0.726 ± 0.006
1.866ValTyr: 1.866 ± 0.012
0.0ValXaa: 0.0 ± 0.0
Trp
0.633TrpAla: 0.633 ± 0.006
0.249TrpCys: 0.249 ± 0.004
0.623TrpAsp: 0.623 ± 0.007
0.734TrpGlu: 0.734 ± 0.007
0.445TrpPhe: 0.445 ± 0.006
0.63TrpGly: 0.63 ± 0.008
0.296TrpHis: 0.296 ± 0.005
0.704TrpIle: 0.704 ± 0.007
0.825TrpLys: 0.825 ± 0.006
1.128TrpLeu: 1.128 ± 0.01
0.35TrpMet: 0.35 ± 0.005
0.588TrpAsn: 0.588 ± 0.007
0.426TrpPro: 0.426 ± 0.005
0.511TrpGln: 0.511 ± 0.005
0.658TrpArg: 0.658 ± 0.007
0.859TrpSer: 0.859 ± 0.009
0.652TrpThr: 0.652 ± 0.007
0.688TrpVal: 0.688 ± 0.007
0.204TrpTrp: 0.204 ± 0.004
0.378TrpTyr: 0.378 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.412TyrAla: 1.412 ± 0.01
0.827TyrCys: 0.827 ± 0.008
1.446TyrAsp: 1.446 ± 0.01
1.711TyrGlu: 1.711 ± 0.011
1.387TyrPhe: 1.387 ± 0.01
1.764TyrGly: 1.764 ± 0.012
0.793TyrHis: 0.793 ± 0.009
1.871TyrIle: 1.871 ± 0.014
1.777TyrLys: 1.777 ± 0.014
2.858TyrLeu: 2.858 ± 0.017
0.768TyrMet: 0.768 ± 0.008
1.424TyrAsn: 1.424 ± 0.009
1.459TyrPro: 1.459 ± 0.014
1.304TyrGln: 1.304 ± 0.009
1.665TyrArg: 1.665 ± 0.012
2.649TyrSer: 2.649 ± 0.015
1.867TyrThr: 1.867 ± 0.014
1.689TyrVal: 1.689 ± 0.011
0.393TyrTrp: 0.393 ± 0.006
1.126TyrTyr: 1.126 ± 0.01
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.032XaaXaa: 0.032 ± 0.012
Statistics based on 44580 proteins (19963946 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski