Amino acid dipepetide frequency for Nicotiana tabacum (Common tobacco)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.039AlaAla: 6.039 ± 0.021
1.191AlaCys: 1.191 ± 0.008
3.132AlaAsp: 3.132 ± 0.011
4.344AlaGlu: 4.344 ± 0.016
2.687AlaPhe: 2.687 ± 0.01
3.859AlaGly: 3.859 ± 0.013
1.269AlaHis: 1.269 ± 0.007
3.814AlaIle: 3.814 ± 0.014
4.124AlaLys: 4.124 ± 0.013
6.33AlaLeu: 6.33 ± 0.019
1.69AlaMet: 1.69 ± 0.008
2.707AlaAsn: 2.707 ± 0.012
2.704AlaPro: 2.704 ± 0.013
2.134AlaGln: 2.134 ± 0.009
3.254AlaArg: 3.254 ± 0.012
5.751AlaSer: 5.751 ± 0.017
3.563AlaThr: 3.563 ± 0.011
4.597AlaVal: 4.597 ± 0.017
0.736AlaTrp: 0.736 ± 0.005
1.851AlaTyr: 1.851 ± 0.008
0.002AlaXaa: 0.002 ± 0.0
Cys
0.97CysAla: 0.97 ± 0.006
0.527CysCys: 0.527 ± 0.005
0.93CysAsp: 0.93 ± 0.006
0.915CysGlu: 0.915 ± 0.006
0.87CysPhe: 0.87 ± 0.005
1.347CysGly: 1.347 ± 0.007
0.461CysHis: 0.461 ± 0.004
1.07CysIle: 1.07 ± 0.005
1.172CysLys: 1.172 ± 0.007
1.841CysLeu: 1.841 ± 0.009
0.448CysMet: 0.448 ± 0.003
0.866CysAsn: 0.866 ± 0.006
0.944CysPro: 0.944 ± 0.006
0.636CysGln: 0.636 ± 0.005
1.024CysArg: 1.024 ± 0.006
1.826CysSer: 1.826 ± 0.009
0.902CysThr: 0.902 ± 0.005
1.007CysVal: 1.007 ± 0.006
0.246CysTrp: 0.246 ± 0.003
0.577CysTyr: 0.577 ± 0.004
0.001CysXaa: 0.001 ± 0.0
Asp
3.358AspAla: 3.358 ± 0.012
0.956AspCys: 0.956 ± 0.007
3.551AspAsp: 3.551 ± 0.016
4.125AspGlu: 4.125 ± 0.016
2.404AspPhe: 2.404 ± 0.008
3.496AspGly: 3.496 ± 0.012
1.224AspHis: 1.224 ± 0.007
3.255AspIle: 3.255 ± 0.011
2.909AspLys: 2.909 ± 0.012
5.021AspLeu: 5.021 ± 0.015
1.358AspMet: 1.358 ± 0.007
2.215AspAsn: 2.215 ± 0.009
2.478AspPro: 2.478 ± 0.012
1.804AspGln: 1.804 ± 0.008
2.326AspArg: 2.326 ± 0.01
4.212AspSer: 4.212 ± 0.013
2.268AspThr: 2.268 ± 0.009
3.622AspVal: 3.622 ± 0.012
0.693AspTrp: 0.693 ± 0.005
1.6AspTyr: 1.6 ± 0.007
0.002AspXaa: 0.002 ± 0.0
Glu
4.673GluAla: 4.673 ± 0.013
0.939GluCys: 0.939 ± 0.006
3.992GluAsp: 3.992 ± 0.013
6.561GluGlu: 6.561 ± 0.045
2.418GluPhe: 2.418 ± 0.009
3.584GluGly: 3.584 ± 0.012
1.301GluHis: 1.301 ± 0.006
3.965GluIle: 3.965 ± 0.012
5.216GluLys: 5.216 ± 0.034
6.158GluLeu: 6.158 ± 0.017
1.834GluMet: 1.834 ± 0.01
3.363GluAsn: 3.363 ± 0.012
2.15GluPro: 2.15 ± 0.019
2.288GluGln: 2.288 ± 0.01
3.368GluArg: 3.368 ± 0.013
4.72GluSer: 4.72 ± 0.017
3.088GluThr: 3.088 ± 0.012
4.385GluVal: 4.385 ± 0.011
0.727GluTrp: 0.727 ± 0.005
1.76GluTyr: 1.76 ± 0.008
0.002GluXaa: 0.002 ± 0.0
Phe
2.469PheAla: 2.469 ± 0.01
0.911PheCys: 0.911 ± 0.005
2.384PheAsp: 2.384 ± 0.009
2.324PheGlu: 2.324 ± 0.01
1.928PhePhe: 1.928 ± 0.011
2.977PheGly: 2.977 ± 0.012
1.118PheHis: 1.118 ± 0.006
2.139PheIle: 2.139 ± 0.01
2.202PheLys: 2.202 ± 0.009
4.351PheLeu: 4.351 ± 0.013
0.965PheMet: 0.965 ± 0.006
1.828PheAsn: 1.828 ± 0.008
2.136PhePro: 2.136 ± 0.009
1.654PheGln: 1.654 ± 0.009
1.989PheArg: 1.989 ± 0.008
4.076PheSer: 4.076 ± 0.012
2.035PheThr: 2.035 ± 0.009
2.688PheVal: 2.688 ± 0.01
0.555PheTrp: 0.555 ± 0.004
1.285PheTyr: 1.285 ± 0.007
0.003PheXaa: 0.003 ± 0.0
Gly
3.654GlyAla: 3.654 ± 0.014
1.204GlyCys: 1.204 ± 0.007
3.237GlyAsp: 3.237 ± 0.012
3.624GlyGlu: 3.624 ± 0.012
2.94GlyPhe: 2.94 ± 0.011
4.915GlyGly: 4.915 ± 0.026
1.509GlyHis: 1.509 ± 0.007
3.552GlyIle: 3.552 ± 0.012
4.152GlyLys: 4.152 ± 0.014
5.566GlyLeu: 5.566 ± 0.015
1.448GlyMet: 1.448 ± 0.008
3.112GlyAsn: 3.112 ± 0.012
2.309GlyPro: 2.309 ± 0.01
2.081GlyGln: 2.081 ± 0.01
3.459GlyArg: 3.459 ± 0.014
5.694GlySer: 5.694 ± 0.017
3.16GlyThr: 3.16 ± 0.012
3.956GlyVal: 3.956 ± 0.014
0.836GlyTrp: 0.836 ± 0.006
2.074GlyTyr: 2.074 ± 0.01
0.003GlyXaa: 0.003 ± 0.0
His
1.368HisAla: 1.368 ± 0.008
0.52HisCys: 0.52 ± 0.004
1.122HisAsp: 1.122 ± 0.007
1.308HisGlu: 1.308 ± 0.007
1.112HisPhe: 1.112 ± 0.007
1.572HisGly: 1.572 ± 0.008
0.837HisHis: 0.837 ± 0.007
1.31HisIle: 1.31 ± 0.007
1.23HisLys: 1.23 ± 0.007
2.464HisLeu: 2.464 ± 0.01
0.586HisMet: 0.586 ± 0.005
1.013HisAsn: 1.013 ± 0.006
1.307HisPro: 1.307 ± 0.006
1.028HisGln: 1.028 ± 0.007
1.264HisArg: 1.264 ± 0.007
1.936HisSer: 1.936 ± 0.009
0.987HisThr: 0.987 ± 0.006
1.536HisVal: 1.536 ± 0.007
0.298HisTrp: 0.298 ± 0.003
0.726HisTyr: 0.726 ± 0.005
0.001HisXaa: 0.001 ± 0.0
Ile
3.601IleAla: 3.601 ± 0.012
1.154IleCys: 1.154 ± 0.007
2.989IleAsp: 2.989 ± 0.011
3.376IleGlu: 3.376 ± 0.011
2.427IlePhe: 2.427 ± 0.01
3.458IleGly: 3.458 ± 0.014
1.32IleHis: 1.32 ± 0.007
3.013IleIle: 3.013 ± 0.012
3.179IleLys: 3.179 ± 0.01
5.467IleLeu: 5.467 ± 0.016
1.201IleMet: 1.201 ± 0.006
2.322IleAsn: 2.322 ± 0.009
3.056IlePro: 3.056 ± 0.012
2.1IleGln: 2.1 ± 0.01
2.665IleArg: 2.665 ± 0.01
5.11IleSer: 5.11 ± 0.014
2.722IleThr: 2.722 ± 0.011
3.525IleVal: 3.525 ± 0.011
0.757IleTrp: 0.757 ± 0.005
1.557IleTyr: 1.557 ± 0.008
0.003IleXaa: 0.003 ± 0.0
Lys
4.149LysAla: 4.149 ± 0.016
1.061LysCys: 1.061 ± 0.007
3.448LysAsp: 3.448 ± 0.011
5.06LysGlu: 5.06 ± 0.019
2.407LysPhe: 2.407 ± 0.012
3.668LysGly: 3.668 ± 0.013
1.39LysHis: 1.39 ± 0.006
3.476LysIle: 3.476 ± 0.011
5.206LysLys: 5.206 ± 0.022
6.361LysLeu: 6.361 ± 0.016
1.624LysMet: 1.624 ± 0.007
2.918LysAsn: 2.918 ± 0.012
2.756LysPro: 2.756 ± 0.02
2.465LysGln: 2.465 ± 0.01
3.758LysArg: 3.758 ± 0.012
4.922LysSer: 4.922 ± 0.015
2.974LysThr: 2.974 ± 0.011
4.036LysVal: 4.036 ± 0.012
0.834LysTrp: 0.834 ± 0.006
1.845LysTyr: 1.845 ± 0.009
0.002LysXaa: 0.002 ± 0.0
Leu
6.331LeuAla: 6.331 ± 0.018
1.823LeuCys: 1.823 ± 0.009
5.049LeuAsp: 5.049 ± 0.016
6.425LeuGlu: 6.425 ± 0.02
3.799LeuPhe: 3.799 ± 0.012
5.571LeuGly: 5.571 ± 0.018
2.544LeuHis: 2.544 ± 0.01
4.744LeuIle: 4.744 ± 0.016
6.453LeuLys: 6.453 ± 0.014
9.785LeuLeu: 9.785 ± 0.028
2.2LeuMet: 2.2 ± 0.009
4.107LeuAsn: 4.107 ± 0.013
5.099LeuPro: 5.099 ± 0.016
4.373LeuGln: 4.373 ± 0.013
5.222LeuArg: 5.222 ± 0.015
8.381LeuSer: 8.381 ± 0.024
4.412LeuThr: 4.412 ± 0.013
6.238LeuVal: 6.238 ± 0.016
1.167LeuTrp: 1.167 ± 0.008
2.581LeuTyr: 2.581 ± 0.011
0.004LeuXaa: 0.004 ± 0.0
Met
2.05MetAla: 2.05 ± 0.009
0.344MetCys: 0.344 ± 0.004
1.45MetAsp: 1.45 ± 0.007
1.954MetGlu: 1.954 ± 0.009
0.811MetPhe: 0.811 ± 0.005
1.518MetGly: 1.518 ± 0.008
0.548MetHis: 0.548 ± 0.004
1.222MetIle: 1.222 ± 0.007
1.704MetLys: 1.704 ± 0.008
2.218MetLeu: 2.218 ± 0.009
0.674MetMet: 0.674 ± 0.005
1.08MetAsn: 1.08 ± 0.006
1.091MetPro: 1.091 ± 0.008
1.009MetGln: 1.009 ± 0.007
1.216MetArg: 1.216 ± 0.007
1.801MetSer: 1.801 ± 0.007
1.113MetThr: 1.113 ± 0.007
1.616MetVal: 1.616 ± 0.007
0.271MetTrp: 0.271 ± 0.003
0.595MetTyr: 0.595 ± 0.005
0.001MetXaa: 0.001 ± 0.0
Asn
2.725AsnAla: 2.725 ± 0.011
0.904AsnCys: 0.904 ± 0.006
2.248AsnAsp: 2.248 ± 0.01
2.772AsnGlu: 2.772 ± 0.011
2.109AsnPhe: 2.109 ± 0.009
3.315AsnGly: 3.315 ± 0.012
1.113AsnHis: 1.113 ± 0.006
2.716AsnIle: 2.716 ± 0.011
2.659AsnLys: 2.659 ± 0.009
4.641AsnLeu: 4.641 ± 0.017
1.192AsnMet: 1.192 ± 0.007
2.577AsnAsn: 2.577 ± 0.016
2.309AsnPro: 2.309 ± 0.01
1.838AsnGln: 1.838 ± 0.008
2.059AsnArg: 2.059 ± 0.009
4.147AsnSer: 4.147 ± 0.015
2.041AsnThr: 2.041 ± 0.008
2.89AsnVal: 2.89 ± 0.01
0.582AsnTrp: 0.582 ± 0.005
1.433AsnTyr: 1.433 ± 0.008
0.002AsnXaa: 0.002 ± 0.0
Pro
2.961ProAla: 2.961 ± 0.016
0.788ProCys: 0.788 ± 0.005
2.339ProAsp: 2.339 ± 0.009
3.118ProGlu: 3.118 ± 0.018
2.005ProPhe: 2.005 ± 0.01
2.562ProGly: 2.562 ± 0.01
1.074ProHis: 1.074 ± 0.006
2.442ProIle: 2.442 ± 0.01
2.861ProLys: 2.861 ± 0.013
4.243ProLeu: 4.243 ± 0.014
0.944ProMet: 0.944 ± 0.006
2.341ProAsn: 2.341 ± 0.01
3.655ProPro: 3.655 ± 0.031
1.845ProGln: 1.845 ± 0.01
2.27ProArg: 2.27 ± 0.009
4.929ProSer: 4.929 ± 0.017
2.667ProThr: 2.667 ± 0.011
3.014ProVal: 3.014 ± 0.012
0.579ProTrp: 0.579 ± 0.004
1.356ProTyr: 1.356 ± 0.008
0.003ProXaa: 0.003 ± 0.0
Gln
2.416GlnAla: 2.416 ± 0.011
0.593GlnCys: 0.593 ± 0.005
1.703GlnAsp: 1.703 ± 0.008
2.542GlnGlu: 2.542 ± 0.009
1.453GlnPhe: 1.453 ± 0.006
2.104GlnGly: 2.104 ± 0.01
0.991GlnHis: 0.991 ± 0.006
2.071GlnIle: 2.071 ± 0.009
2.583GlnLys: 2.583 ± 0.011
3.855GlnLeu: 3.855 ± 0.013
1.03GlnMet: 1.03 ± 0.006
1.951GlnAsn: 1.951 ± 0.01
1.735GlnPro: 1.735 ± 0.01
2.259GlnGln: 2.259 ± 0.021
2.15GlnArg: 2.15 ± 0.007
2.945GlnSer: 2.945 ± 0.011
1.736GlnThr: 1.736 ± 0.007
2.414GlnVal: 2.414 ± 0.009
0.467GlnTrp: 0.467 ± 0.004
0.992GlnTyr: 0.992 ± 0.006
0.001GlnXaa: 0.001 ± 0.0
Arg
3.152ArgAla: 3.152 ± 0.011
0.946ArgCys: 0.946 ± 0.006
2.612ArgAsp: 2.612 ± 0.011
3.322ArgGlu: 3.322 ± 0.012
2.14ArgPhe: 2.14 ± 0.01
3.114ArgGly: 3.114 ± 0.012
1.248ArgHis: 1.248 ± 0.007
2.856ArgIle: 2.856 ± 0.011
3.907ArgLys: 3.907 ± 0.013
4.844ArgLeu: 4.844 ± 0.014
1.277ArgMet: 1.277 ± 0.006
2.515ArgAsn: 2.515 ± 0.01
2.302ArgPro: 2.302 ± 0.01
1.854ArgGln: 1.854 ± 0.009
3.798ArgArg: 3.798 ± 0.013
4.28ArgSer: 4.28 ± 0.015
2.422ArgThr: 2.422 ± 0.009
3.176ArgVal: 3.176 ± 0.011
0.726ArgTrp: 0.726 ± 0.005
1.51ArgTyr: 1.51 ± 0.007
0.002ArgXaa: 0.002 ± 0.0
Ser
5.348SerAla: 5.348 ± 0.017
1.756SerCys: 1.756 ± 0.009
4.419SerAsp: 4.419 ± 0.015
4.879SerGlu: 4.879 ± 0.015
3.929SerPhe: 3.929 ± 0.013
5.649SerGly: 5.649 ± 0.019
1.984SerHis: 1.984 ± 0.009
4.721SerIle: 4.721 ± 0.014
5.146SerLys: 5.146 ± 0.017
8.346SerLeu: 8.346 ± 0.021
2.068SerMet: 2.068 ± 0.008
4.182SerAsn: 4.182 ± 0.015
4.46SerPro: 4.46 ± 0.02
3.153SerGln: 3.153 ± 0.014
4.468SerArg: 4.468 ± 0.015
10.829SerSer: 10.829 ± 0.03
4.83SerThr: 4.83 ± 0.014
5.159SerVal: 5.159 ± 0.015
1.162SerTrp: 1.162 ± 0.006
2.387SerTyr: 2.387 ± 0.01
0.004SerXaa: 0.004 ± 0.0
Thr
3.305ThrAla: 3.305 ± 0.013
0.956ThrCys: 0.956 ± 0.006
2.303ThrAsp: 2.303 ± 0.01
2.883ThrGlu: 2.883 ± 0.012
2.096ThrPhe: 2.096 ± 0.008
3.143ThrGly: 3.143 ± 0.01
1.045ThrHis: 1.045 ± 0.006
2.855ThrIle: 2.855 ± 0.011
2.87ThrLys: 2.87 ± 0.012
4.623ThrLeu: 4.623 ± 0.013
1.144ThrMet: 1.144 ± 0.006
2.233ThrAsn: 2.233 ± 0.009
2.607ThrPro: 2.607 ± 0.011
1.619ThrGln: 1.619 ± 0.007
2.414ThrArg: 2.414 ± 0.01
4.75ThrSer: 4.75 ± 0.013
3.06ThrThr: 3.06 ± 0.012
3.233ThrVal: 3.233 ± 0.012
0.643ThrTrp: 0.643 ± 0.005
1.428ThrTyr: 1.428 ± 0.008
0.002ThrXaa: 0.002 ± 0.0
Val
4.625ValAla: 4.625 ± 0.012
1.107ValCys: 1.107 ± 0.007
3.709ValAsp: 3.709 ± 0.009
4.556ValGlu: 4.556 ± 0.019
2.588ValPhe: 2.588 ± 0.01
3.864ValGly: 3.864 ± 0.013
1.503ValHis: 1.503 ± 0.007
3.515ValIle: 3.515 ± 0.012
4.131ValLys: 4.131 ± 0.013
6.131ValLeu: 6.131 ± 0.015
1.501ValMet: 1.501 ± 0.007
2.792ValAsn: 2.792 ± 0.01
3.134ValPro: 3.134 ± 0.012
2.384ValGln: 2.384 ± 0.009
2.983ValArg: 2.983 ± 0.01
5.225ValSer: 5.225 ± 0.015
3.246ValThr: 3.246 ± 0.01
4.759ValVal: 4.759 ± 0.014
0.717ValTrp: 0.717 ± 0.006
1.898ValTyr: 1.898 ± 0.008
0.003ValXaa: 0.003 ± 0.0
Trp
0.757TrpAla: 0.757 ± 0.007
0.236TrpCys: 0.236 ± 0.003
0.698TrpAsp: 0.698 ± 0.005
0.78TrpGlu: 0.78 ± 0.006
0.535TrpPhe: 0.535 ± 0.004
0.697TrpGly: 0.697 ± 0.005
0.3TrpHis: 0.3 ± 0.003
0.729TrpIle: 0.729 ± 0.005
0.972TrpLys: 0.972 ± 0.007
1.174TrpLeu: 1.174 ± 0.006
0.34TrpMet: 0.34 ± 0.003
0.739TrpAsn: 0.739 ± 0.006
0.475TrpPro: 0.475 ± 0.004
0.447TrpGln: 0.447 ± 0.004
0.811TrpArg: 0.811 ± 0.005
0.964TrpSer: 0.964 ± 0.006
0.653TrpThr: 0.653 ± 0.005
0.745TrpVal: 0.745 ± 0.006
0.241TrpTrp: 0.241 ± 0.003
0.355TrpTyr: 0.355 ± 0.004
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.825TyrAla: 1.825 ± 0.008
0.665TyrCys: 0.665 ± 0.004
1.559TyrAsp: 1.559 ± 0.008
1.625TyrGlu: 1.625 ± 0.007
1.355TyrPhe: 1.355 ± 0.007
2.069TyrGly: 2.069 ± 0.011
0.744TyrHis: 0.744 ± 0.005
1.534TyrIle: 1.534 ± 0.007
1.681TyrLys: 1.681 ± 0.01
2.885TyrLeu: 2.885 ± 0.011
0.759TyrMet: 0.759 ± 0.006
1.41TyrAsn: 1.41 ± 0.008
1.286TyrPro: 1.286 ± 0.007
1.017TyrGln: 1.017 ± 0.007
1.505TyrArg: 1.505 ± 0.007
2.377TyrSer: 2.377 ± 0.009
1.33TyrThr: 1.33 ± 0.007
1.773TyrVal: 1.773 ± 0.008
0.41TyrTrp: 0.41 ± 0.004
1.061TyrTyr: 1.061 ± 0.009
0.002TyrXaa: 0.002 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.003XaaGly: 0.003 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.003XaaLys: 0.003 ± 0.0
0.004XaaLeu: 0.004 ± 0.0
0.002XaaMet: 0.002 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.003XaaPro: 0.003 ± 0.0
0.002XaaGln: 0.002 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.004XaaSer: 0.004 ± 0.0
0.002XaaThr: 0.002 ± 0.0
0.003XaaVal: 0.003 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.002XaaTyr: 0.002 ± 0.0
0.001XaaXaa: 0.001 ± 0.0
Statistics based on 73607 proteins (29996979 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski