Amino acid dipepetide frequency for Dendrobium catenatum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.038AlaAla: 7.038 ± 0.046
1.291AlaCys: 1.291 ± 0.013
3.335AlaAsp: 3.335 ± 0.019
4.272AlaGlu: 4.272 ± 0.025
2.861AlaPhe: 2.861 ± 0.017
4.533AlaGly: 4.533 ± 0.024
1.335AlaHis: 1.335 ± 0.012
3.964AlaIle: 3.964 ± 0.024
3.679AlaLys: 3.679 ± 0.022
6.812AlaLeu: 6.812 ± 0.035
1.859AlaMet: 1.859 ± 0.013
2.598AlaAsn: 2.598 ± 0.017
2.851AlaPro: 2.851 ± 0.021
2.0AlaGln: 2.0 ± 0.016
3.713AlaArg: 3.713 ± 0.02
6.304AlaSer: 6.304 ± 0.027
3.563AlaThr: 3.563 ± 0.02
4.82AlaVal: 4.82 ± 0.028
0.777AlaTrp: 0.777 ± 0.009
1.833AlaTyr: 1.833 ± 0.014
0.0AlaXaa: 0.0 ± 0.0
Cys
1.033CysAla: 1.033 ± 0.01
0.527CysCys: 0.527 ± 0.009
0.861CysAsp: 0.861 ± 0.01
0.875CysGlu: 0.875 ± 0.01
0.985CysPhe: 0.985 ± 0.01
1.323CysGly: 1.323 ± 0.015
0.511CysHis: 0.511 ± 0.008
1.054CysIle: 1.054 ± 0.009
1.156CysLys: 1.156 ± 0.013
1.951CysLeu: 1.951 ± 0.015
0.452CysMet: 0.452 ± 0.006
0.893CysAsn: 0.893 ± 0.011
0.979CysPro: 0.979 ± 0.011
0.622CysGln: 0.622 ± 0.008
1.182CysArg: 1.182 ± 0.011
2.028CysSer: 2.028 ± 0.016
0.823CysThr: 0.823 ± 0.01
0.989CysVal: 0.989 ± 0.012
0.29CysTrp: 0.29 ± 0.006
0.552CysTyr: 0.552 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
3.468AspAla: 3.468 ± 0.021
0.984AspCys: 0.984 ± 0.01
3.336AspAsp: 3.336 ± 0.025
3.742AspGlu: 3.742 ± 0.025
2.484AspPhe: 2.484 ± 0.018
3.784AspGly: 3.784 ± 0.022
1.231AspHis: 1.231 ± 0.011
3.083AspIle: 3.083 ± 0.019
2.404AspLys: 2.404 ± 0.02
5.171AspLeu: 5.171 ± 0.022
1.227AspMet: 1.227 ± 0.012
2.011AspAsn: 2.011 ± 0.016
2.522AspPro: 2.522 ± 0.016
1.665AspGln: 1.665 ± 0.014
2.534AspArg: 2.534 ± 0.019
4.103AspSer: 4.103 ± 0.025
1.908AspThr: 1.908 ± 0.016
3.455AspVal: 3.455 ± 0.021
0.756AspTrp: 0.756 ± 0.009
1.467AspTyr: 1.467 ± 0.012
0.0AspXaa: 0.0 ± 0.0
Glu
4.753GluAla: 4.753 ± 0.028
0.895GluCys: 0.895 ± 0.012
3.744GluAsp: 3.744 ± 0.022
6.156GluGlu: 6.156 ± 0.045
2.375GluPhe: 2.375 ± 0.015
3.733GluGly: 3.733 ± 0.022
1.238GluHis: 1.238 ± 0.011
3.842GluIle: 3.842 ± 0.024
4.44GluLys: 4.44 ± 0.029
5.909GluLeu: 5.909 ± 0.032
1.787GluMet: 1.787 ± 0.017
2.87GluAsn: 2.87 ± 0.018
2.032GluPro: 2.032 ± 0.016
2.073GluGln: 2.073 ± 0.017
3.484GluArg: 3.484 ± 0.023
4.237GluSer: 4.237 ± 0.023
2.731GluThr: 2.731 ± 0.018
4.075GluVal: 4.075 ± 0.021
0.739GluTrp: 0.739 ± 0.009
1.506GluTyr: 1.506 ± 0.012
0.0GluXaa: 0.0 ± 0.0
Phe
2.617PheAla: 2.617 ± 0.019
1.018PheCys: 1.018 ± 0.012
2.479PheAsp: 2.479 ± 0.016
2.316PheGlu: 2.316 ± 0.016
2.29PhePhe: 2.29 ± 0.019
3.017PheGly: 3.017 ± 0.019
1.272PheHis: 1.272 ± 0.013
2.36PheIle: 2.36 ± 0.016
2.123PheLys: 2.123 ± 0.017
4.809PheLeu: 4.809 ± 0.028
1.009PheMet: 1.009 ± 0.01
1.872PheAsn: 1.872 ± 0.014
2.217PhePro: 2.217 ± 0.018
1.558PheGln: 1.558 ± 0.014
2.206PheArg: 2.206 ± 0.015
4.499PheSer: 4.499 ± 0.023
2.007PheThr: 2.007 ± 0.018
2.744PheVal: 2.744 ± 0.019
0.592PheTrp: 0.592 ± 0.008
1.333PheTyr: 1.333 ± 0.013
0.0PheXaa: 0.0 ± 0.0
Gly
3.725GlyAla: 3.725 ± 0.025
1.333GlyCys: 1.333 ± 0.013
3.309GlyAsp: 3.309 ± 0.02
3.73GlyGlu: 3.73 ± 0.021
3.299GlyPhe: 3.299 ± 0.021
5.261GlyGly: 5.261 ± 0.043
1.546GlyHis: 1.546 ± 0.014
3.62GlyIle: 3.62 ± 0.024
3.837GlyLys: 3.837 ± 0.021
5.873GlyLeu: 5.873 ± 0.025
1.501GlyMet: 1.501 ± 0.013
2.883GlyAsn: 2.883 ± 0.02
2.269GlyPro: 2.269 ± 0.016
1.885GlyGln: 1.885 ± 0.014
4.117GlyArg: 4.117 ± 0.026
5.71GlySer: 5.71 ± 0.029
2.894GlyThr: 2.894 ± 0.019
3.861GlyVal: 3.861 ± 0.023
0.885GlyTrp: 0.885 ± 0.009
1.969GlyTyr: 1.969 ± 0.016
0.0GlyXaa: 0.0 ± 0.0
His
1.476HisAla: 1.476 ± 0.013
0.557HisCys: 0.557 ± 0.007
1.092HisAsp: 1.092 ± 0.01
1.246HisGlu: 1.246 ± 0.011
1.168HisPhe: 1.168 ± 0.012
1.761HisGly: 1.761 ± 0.016
0.865HisHis: 0.865 ± 0.012
1.42HisIle: 1.42 ± 0.014
1.175HisLys: 1.175 ± 0.011
2.711HisLeu: 2.711 ± 0.016
0.545HisMet: 0.545 ± 0.007
0.991HisAsn: 0.991 ± 0.011
1.456HisPro: 1.456 ± 0.011
1.023HisGln: 1.023 ± 0.011
1.525HisArg: 1.525 ± 0.013
2.134HisSer: 2.134 ± 0.016
0.96HisThr: 0.96 ± 0.012
1.463HisVal: 1.463 ± 0.012
0.326HisTrp: 0.326 ± 0.005
0.714HisTyr: 0.714 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.871IleAla: 3.871 ± 0.021
1.204IleCys: 1.204 ± 0.012
2.995IleAsp: 2.995 ± 0.019
3.274IleGlu: 3.274 ± 0.019
2.554IlePhe: 2.554 ± 0.017
3.367IleGly: 3.367 ± 0.021
1.462IleHis: 1.462 ± 0.014
3.142IleIle: 3.142 ± 0.02
3.059IleLys: 3.059 ± 0.021
5.682IleLeu: 5.682 ± 0.023
1.21IleMet: 1.21 ± 0.011
2.401IleAsn: 2.401 ± 0.016
3.079IlePro: 3.079 ± 0.022
2.081IleGln: 2.081 ± 0.016
2.926IleArg: 2.926 ± 0.016
5.615IleSer: 5.615 ± 0.024
2.722IleThr: 2.722 ± 0.018
3.374IleVal: 3.374 ± 0.021
0.81IleTrp: 0.81 ± 0.009
1.631IleTyr: 1.631 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
3.916LysAla: 3.916 ± 0.019
0.946LysCys: 0.946 ± 0.011
2.987LysAsp: 2.987 ± 0.023
4.398LysGlu: 4.398 ± 0.027
2.238LysPhe: 2.238 ± 0.016
3.432LysGly: 3.432 ± 0.02
1.373LysHis: 1.373 ± 0.012
3.394LysIle: 3.394 ± 0.022
4.565LysLys: 4.565 ± 0.035
5.888LysLeu: 5.888 ± 0.028
1.519LysMet: 1.519 ± 0.014
2.654LysAsn: 2.654 ± 0.018
2.6LysPro: 2.6 ± 0.017
2.16LysGln: 2.16 ± 0.016
3.364LysArg: 3.364 ± 0.024
4.449LysSer: 4.449 ± 0.027
2.63LysThr: 2.63 ± 0.019
3.575LysVal: 3.575 ± 0.023
0.803LysTrp: 0.803 ± 0.009
1.481LysTyr: 1.481 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
6.748LeuAla: 6.748 ± 0.03
2.025LeuCys: 2.025 ± 0.017
5.003LeuAsp: 5.003 ± 0.024
6.046LeuGlu: 6.046 ± 0.034
4.177LeuPhe: 4.177 ± 0.027
5.692LeuGly: 5.692 ± 0.029
2.901LeuHis: 2.901 ± 0.021
5.117LeuIle: 5.117 ± 0.024
6.025LeuLys: 6.025 ± 0.03
10.847LeuLeu: 10.847 ± 0.049
2.206LeuMet: 2.206 ± 0.018
3.993LeuAsn: 3.993 ± 0.024
5.7LeuPro: 5.7 ± 0.03
4.321LeuGln: 4.321 ± 0.025
5.945LeuArg: 5.945 ± 0.027
8.982LeuSer: 8.982 ± 0.034
4.466LeuThr: 4.466 ± 0.028
6.175LeuVal: 6.175 ± 0.028
1.187LeuTrp: 1.187 ± 0.012
2.597LeuTyr: 2.597 ± 0.018
0.0LeuXaa: 0.0 ± 0.0
Met
2.199MetAla: 2.199 ± 0.015
0.313MetCys: 0.313 ± 0.006
1.39MetAsp: 1.39 ± 0.013
2.099MetGlu: 2.099 ± 0.017
0.783MetPhe: 0.783 ± 0.01
1.478MetGly: 1.478 ± 0.012
0.558MetHis: 0.558 ± 0.007
1.258MetIle: 1.258 ± 0.011
1.667MetLys: 1.667 ± 0.013
2.209MetLeu: 2.209 ± 0.017
0.657MetMet: 0.657 ± 0.009
1.036MetAsn: 1.036 ± 0.011
1.151MetPro: 1.151 ± 0.011
0.918MetGln: 0.918 ± 0.01
1.291MetArg: 1.291 ± 0.013
1.674MetSer: 1.674 ± 0.012
1.007MetThr: 1.007 ± 0.01
1.656MetVal: 1.656 ± 0.015
0.258MetTrp: 0.258 ± 0.005
0.554MetTyr: 0.554 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.704AsnAla: 2.704 ± 0.016
0.893AsnCys: 0.893 ± 0.01
2.016AsnAsp: 2.016 ± 0.015
2.375AsnGlu: 2.375 ± 0.019
2.01AsnPhe: 2.01 ± 0.015
3.0AsnGly: 3.0 ± 0.019
1.112AsnHis: 1.112 ± 0.012
2.672AsnIle: 2.672 ± 0.02
2.317AsnLys: 2.317 ± 0.018
4.716AsnLeu: 4.716 ± 0.034
1.038AsnMet: 1.038 ± 0.011
2.087AsnAsn: 2.087 ± 0.02
2.369AsnPro: 2.369 ± 0.017
1.623AsnGln: 1.623 ± 0.016
2.12AsnArg: 2.12 ± 0.012
4.03AsnSer: 4.03 ± 0.025
1.889AsnThr: 1.889 ± 0.015
2.617AsnVal: 2.617 ± 0.018
0.558AsnTrp: 0.558 ± 0.007
1.277AsnTyr: 1.277 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
3.4ProAla: 3.4 ± 0.022
0.799ProCys: 0.799 ± 0.011
2.434ProAsp: 2.434 ± 0.02
2.865ProGlu: 2.865 ± 0.019
2.185ProPhe: 2.185 ± 0.017
2.511ProGly: 2.511 ± 0.02
1.133ProHis: 1.133 ± 0.011
2.661ProIle: 2.661 ± 0.02
2.547ProLys: 2.547 ± 0.019
4.683ProLeu: 4.683 ± 0.022
0.977ProMet: 0.977 ± 0.011
2.311ProAsn: 2.311 ± 0.018
3.993ProPro: 3.993 ± 0.044
1.659ProGln: 1.659 ± 0.015
2.467ProArg: 2.467 ± 0.019
5.463ProSer: 5.463 ± 0.027
2.76ProThr: 2.76 ± 0.021
3.027ProVal: 3.027 ± 0.019
0.628ProTrp: 0.628 ± 0.009
1.332ProTyr: 1.332 ± 0.012
0.0ProXaa: 0.0 ± 0.0
Gln
2.322GlnAla: 2.322 ± 0.018
0.56GlnCys: 0.56 ± 0.008
1.483GlnAsp: 1.483 ± 0.013
2.192GlnGlu: 2.192 ± 0.017
1.37GlnPhe: 1.37 ± 0.011
1.899GlnGly: 1.899 ± 0.012
0.94GlnHis: 0.94 ± 0.012
2.137GlnIle: 2.137 ± 0.016
2.183GlnLys: 2.183 ± 0.017
3.595GlnLeu: 3.595 ± 0.021
0.912GlnMet: 0.912 ± 0.009
1.761GlnAsn: 1.761 ± 0.016
1.771GlnPro: 1.771 ± 0.017
1.846GlnGln: 1.846 ± 0.027
2.104GlnArg: 2.104 ± 0.016
2.75GlnSer: 2.75 ± 0.018
1.663GlnThr: 1.663 ± 0.014
2.151GlnVal: 2.151 ± 0.014
0.493GlnTrp: 0.493 ± 0.007
0.925GlnTyr: 0.925 ± 0.01
0.0GlnXaa: 0.0 ± 0.0
Arg
3.46ArgAla: 3.46 ± 0.022
1.083ArgCys: 1.083 ± 0.013
2.539ArgAsp: 2.539 ± 0.015
3.4ArgGlu: 3.4 ± 0.023
2.505ArgPhe: 2.505 ± 0.018
3.319ArgGly: 3.319 ± 0.025
1.414ArgHis: 1.414 ± 0.011
3.12ArgIle: 3.12 ± 0.018
3.819ArgLys: 3.819 ± 0.021
5.507ArgLeu: 5.507 ± 0.023
1.464ArgMet: 1.464 ± 0.014
2.454ArgAsn: 2.454 ± 0.017
2.702ArgPro: 2.702 ± 0.017
1.923ArgGln: 1.923 ± 0.016
4.857ArgArg: 4.857 ± 0.031
4.902ArgSer: 4.902 ± 0.024
2.601ArgThr: 2.601 ± 0.017
3.085ArgVal: 3.085 ± 0.022
0.891ArgTrp: 0.891 ± 0.01
1.483ArgTyr: 1.483 ± 0.012
0.0ArgXaa: 0.0 ± 0.0
Ser
5.787SerAla: 5.787 ± 0.027
1.789SerCys: 1.789 ± 0.014
4.393SerAsp: 4.393 ± 0.025
4.473SerGlu: 4.473 ± 0.027
4.366SerPhe: 4.366 ± 0.022
5.674SerGly: 5.674 ± 0.034
2.162SerHis: 2.162 ± 0.014
5.059SerIle: 5.059 ± 0.023
4.826SerLys: 4.826 ± 0.028
9.141SerLeu: 9.141 ± 0.036
2.087SerMet: 2.087 ± 0.014
4.124SerAsn: 4.124 ± 0.024
4.944SerPro: 4.944 ± 0.031
2.91SerGln: 2.91 ± 0.019
4.631SerArg: 4.631 ± 0.024
11.501SerSer: 11.501 ± 0.053
4.853SerThr: 4.853 ± 0.027
5.027SerVal: 5.027 ± 0.025
1.246SerTrp: 1.246 ± 0.014
2.339SerTyr: 2.339 ± 0.017
0.0SerXaa: 0.0 ± 0.0
Thr
3.548ThrAla: 3.548 ± 0.021
0.873ThrCys: 0.873 ± 0.009
2.189ThrAsp: 2.189 ± 0.016
2.643ThrGlu: 2.643 ± 0.017
2.129ThrPhe: 2.129 ± 0.017
3.073ThrGly: 3.073 ± 0.02
1.02ThrHis: 1.02 ± 0.01
2.839ThrIle: 2.839 ± 0.021
2.584ThrLys: 2.584 ± 0.019
4.367ThrLeu: 4.367 ± 0.022
1.114ThrMet: 1.114 ± 0.011
2.015ThrAsn: 2.015 ± 0.015
2.483ThrPro: 2.483 ± 0.021
1.375ThrGln: 1.375 ± 0.012
2.28ThrArg: 2.28 ± 0.016
4.492ThrSer: 4.492 ± 0.023
2.743ThrThr: 2.743 ± 0.02
3.159ThrVal: 3.159 ± 0.022
0.646ThrTrp: 0.646 ± 0.008
1.34ThrTyr: 1.34 ± 0.014
0.0ThrXaa: 0.0 ± 0.0
Val
4.782ValAla: 4.782 ± 0.025
1.137ValCys: 1.137 ± 0.011
3.668ValAsp: 3.668 ± 0.02
4.277ValGlu: 4.277 ± 0.026
2.682ValPhe: 2.682 ± 0.017
3.992ValGly: 3.992 ± 0.022
1.445ValHis: 1.445 ± 0.012
3.404ValIle: 3.404 ± 0.02
3.509ValLys: 3.509 ± 0.022
6.162ValLeu: 6.162 ± 0.029
1.512ValMet: 1.512 ± 0.015
2.483ValAsn: 2.483 ± 0.017
2.971ValPro: 2.971 ± 0.016
2.076ValGln: 2.076 ± 0.015
3.221ValArg: 3.221 ± 0.02
5.103ValSer: 5.103 ± 0.027
2.771ValThr: 2.771 ± 0.018
4.572ValVal: 4.572 ± 0.028
0.765ValTrp: 0.765 ± 0.009
1.71ValTyr: 1.71 ± 0.014
0.0ValXaa: 0.0 ± 0.0
Trp
0.796TrpAla: 0.796 ± 0.008
0.227TrpCys: 0.227 ± 0.005
0.674TrpAsp: 0.674 ± 0.008
0.777TrpGlu: 0.777 ± 0.01
0.557TrpPhe: 0.557 ± 0.008
0.712TrpGly: 0.712 ± 0.01
0.348TrpHis: 0.348 ± 0.007
0.811TrpIle: 0.811 ± 0.008
0.984TrpLys: 0.984 ± 0.01
1.305TrpLeu: 1.305 ± 0.013
0.384TrpMet: 0.384 ± 0.007
0.7TrpAsn: 0.7 ± 0.01
0.549TrpPro: 0.549 ± 0.008
0.429TrpGln: 0.429 ± 0.007
0.989TrpArg: 0.989 ± 0.012
1.06TrpSer: 1.06 ± 0.009
0.682TrpThr: 0.682 ± 0.008
0.751TrpVal: 0.751 ± 0.01
0.248TrpTrp: 0.248 ± 0.006
0.324TrpTyr: 0.324 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.796TyrAla: 1.796 ± 0.014
0.63TyrCys: 0.63 ± 0.009
1.405TyrAsp: 1.405 ± 0.012
1.465TyrGlu: 1.465 ± 0.013
1.32TyrPhe: 1.32 ± 0.012
1.945TyrGly: 1.945 ± 0.018
0.743TyrHis: 0.743 ± 0.01
1.502TyrIle: 1.502 ± 0.012
1.473TyrLys: 1.473 ± 0.012
2.814TyrLeu: 2.814 ± 0.017
0.695TyrMet: 0.695 ± 0.007
1.287TyrAsn: 1.287 ± 0.013
1.197TyrPro: 1.197 ± 0.013
0.912TyrGln: 0.912 ± 0.01
1.573TyrArg: 1.573 ± 0.013
2.297TyrSer: 2.297 ± 0.016
1.236TyrThr: 1.236 ± 0.014
1.667TyrVal: 1.667 ± 0.014
0.41TyrTrp: 0.41 ± 0.006
0.906TyrTyr: 0.906 ± 0.011
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.016XaaXaa: 0.016 ± 0.007
Statistics based on 29029 proteins (9774483 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski