Amino acid dipepetide frequency for Nicotiana sylvestris (Wood tobacco) (South American tobacco)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.918AlaAla: 5.918 ± 0.029
1.184AlaCys: 1.184 ± 0.009
3.107AlaAsp: 3.107 ± 0.015
4.267AlaGlu: 4.267 ± 0.019
2.674AlaPhe: 2.674 ± 0.014
3.806AlaGly: 3.806 ± 0.018
1.286AlaHis: 1.286 ± 0.009
3.783AlaIle: 3.783 ± 0.016
4.115AlaLys: 4.115 ± 0.019
6.285AlaLeu: 6.285 ± 0.023
1.669AlaMet: 1.669 ± 0.011
2.692AlaAsn: 2.692 ± 0.014
2.724AlaPro: 2.724 ± 0.018
2.146AlaGln: 2.146 ± 0.012
3.296AlaArg: 3.296 ± 0.015
5.748AlaSer: 5.748 ± 0.023
3.577AlaThr: 3.577 ± 0.018
4.524AlaVal: 4.524 ± 0.018
0.745AlaTrp: 0.745 ± 0.007
1.837AlaTyr: 1.837 ± 0.011
0.002AlaXaa: 0.002 ± 0.0
Cys
0.968CysAla: 0.968 ± 0.007
0.545CysCys: 0.545 ± 0.007
0.924CysAsp: 0.924 ± 0.007
0.915CysGlu: 0.915 ± 0.007
0.883CysPhe: 0.883 ± 0.008
1.358CysGly: 1.358 ± 0.011
0.479CysHis: 0.479 ± 0.005
1.073CysIle: 1.073 ± 0.009
1.182CysLys: 1.182 ± 0.009
1.85CysLeu: 1.85 ± 0.011
0.453CysMet: 0.453 ± 0.005
0.871CysAsn: 0.871 ± 0.007
0.947CysPro: 0.947 ± 0.009
0.648CysGln: 0.648 ± 0.007
1.053CysArg: 1.053 ± 0.008
1.845CysSer: 1.845 ± 0.011
0.905CysThr: 0.905 ± 0.007
1.021CysVal: 1.021 ± 0.008
0.248CysTrp: 0.248 ± 0.004
0.584CysTyr: 0.584 ± 0.006
0.001CysXaa: 0.001 ± 0.0
Asp
3.376AspAla: 3.376 ± 0.015
0.974AspCys: 0.974 ± 0.008
3.566AspAsp: 3.566 ± 0.02
4.132AspGlu: 4.132 ± 0.019
2.424AspPhe: 2.424 ± 0.013
3.472AspGly: 3.472 ± 0.019
1.234AspHis: 1.234 ± 0.009
3.247AspIle: 3.247 ± 0.015
2.908AspLys: 2.908 ± 0.018
5.017AspLeu: 5.017 ± 0.02
1.379AspMet: 1.379 ± 0.009
2.217AspAsn: 2.217 ± 0.014
2.475AspPro: 2.475 ± 0.014
1.803AspGln: 1.803 ± 0.011
2.338AspArg: 2.338 ± 0.016
4.22AspSer: 4.22 ± 0.02
2.249AspThr: 2.249 ± 0.012
3.633AspVal: 3.633 ± 0.018
0.68AspTrp: 0.68 ± 0.006
1.584AspTyr: 1.584 ± 0.01
0.001AspXaa: 0.001 ± 0.0
Glu
4.669GluAla: 4.669 ± 0.021
0.952GluCys: 0.952 ± 0.008
4.005GluAsp: 4.005 ± 0.02
6.392GluGlu: 6.392 ± 0.039
2.425GluPhe: 2.425 ± 0.013
3.589GluGly: 3.589 ± 0.016
1.323GluHis: 1.323 ± 0.01
3.98GluIle: 3.98 ± 0.016
5.125GluLys: 5.125 ± 0.027
6.161GluLeu: 6.161 ± 0.025
1.841GluMet: 1.841 ± 0.011
3.348GluAsn: 3.348 ± 0.017
2.084GluPro: 2.084 ± 0.015
2.31GluGln: 2.31 ± 0.013
3.391GluArg: 3.391 ± 0.019
4.71GluSer: 4.71 ± 0.018
3.08GluThr: 3.08 ± 0.015
4.404GluVal: 4.404 ± 0.018
0.729GluTrp: 0.729 ± 0.007
1.784GluTyr: 1.784 ± 0.011
0.002GluXaa: 0.002 ± 0.0
Phe
2.439PheAla: 2.439 ± 0.013
0.905PheCys: 0.905 ± 0.007
2.379PheAsp: 2.379 ± 0.014
2.369PheGlu: 2.369 ± 0.013
1.897PhePhe: 1.897 ± 0.013
2.945PheGly: 2.945 ± 0.016
1.114PheHis: 1.114 ± 0.009
2.154PheIle: 2.154 ± 0.013
2.218PheLys: 2.218 ± 0.011
4.322PheLeu: 4.322 ± 0.019
0.962PheMet: 0.962 ± 0.008
1.845PheAsn: 1.845 ± 0.011
2.137PhePro: 2.137 ± 0.012
1.656PheGln: 1.656 ± 0.012
2.001PheArg: 2.001 ± 0.011
4.056PheSer: 4.056 ± 0.017
2.013PheThr: 2.013 ± 0.012
2.694PheVal: 2.694 ± 0.013
0.554PheTrp: 0.554 ± 0.007
1.263PheTyr: 1.263 ± 0.009
0.002PheXaa: 0.002 ± 0.0
Gly
3.627GlyAla: 3.627 ± 0.018
1.199GlyCys: 1.199 ± 0.009
3.228GlyAsp: 3.228 ± 0.015
3.607GlyGlu: 3.607 ± 0.015
2.897GlyPhe: 2.897 ± 0.012
4.9GlyGly: 4.9 ± 0.038
1.532GlyHis: 1.532 ± 0.011
3.513GlyIle: 3.513 ± 0.016
4.117GlyLys: 4.117 ± 0.015
5.472GlyLeu: 5.472 ± 0.02
1.445GlyMet: 1.445 ± 0.01
3.109GlyAsn: 3.109 ± 0.016
2.309GlyPro: 2.309 ± 0.013
2.12GlyGln: 2.12 ± 0.013
3.478GlyArg: 3.478 ± 0.014
5.674GlySer: 5.674 ± 0.023
3.139GlyThr: 3.139 ± 0.015
3.929GlyVal: 3.929 ± 0.018
0.831GlyTrp: 0.831 ± 0.007
2.039GlyTyr: 2.039 ± 0.013
0.002GlyXaa: 0.002 ± 0.0
His
1.378HisAla: 1.378 ± 0.01
0.524HisCys: 0.524 ± 0.006
1.14HisAsp: 1.14 ± 0.009
1.317HisGlu: 1.317 ± 0.011
1.126HisPhe: 1.126 ± 0.008
1.57HisGly: 1.57 ± 0.011
0.843HisHis: 0.843 ± 0.009
1.327HisIle: 1.327 ± 0.009
1.254HisLys: 1.254 ± 0.009
2.486HisLeu: 2.486 ± 0.012
0.58HisMet: 0.58 ± 0.006
1.027HisAsn: 1.027 ± 0.008
1.308HisPro: 1.308 ± 0.009
1.037HisGln: 1.037 ± 0.008
1.274HisArg: 1.274 ± 0.011
1.956HisSer: 1.956 ± 0.012
0.991HisThr: 0.991 ± 0.008
1.532HisVal: 1.532 ± 0.01
0.296HisTrp: 0.296 ± 0.004
0.718HisTyr: 0.718 ± 0.007
0.001HisXaa: 0.001 ± 0.0
Ile
3.547IleAla: 3.547 ± 0.017
1.177IleCys: 1.177 ± 0.009
2.987IleAsp: 2.987 ± 0.013
3.388IleGlu: 3.388 ± 0.015
2.423IlePhe: 2.423 ± 0.014
3.437IleGly: 3.437 ± 0.017
1.335IleHis: 1.335 ± 0.009
3.018IleIle: 3.018 ± 0.015
3.181IleLys: 3.181 ± 0.014
5.463IleLeu: 5.463 ± 0.02
1.213IleMet: 1.213 ± 0.008
2.339IleAsn: 2.339 ± 0.012
3.059IlePro: 3.059 ± 0.019
2.135IleGln: 2.135 ± 0.011
2.697IleArg: 2.697 ± 0.012
5.105IleSer: 5.105 ± 0.019
2.672IleThr: 2.672 ± 0.013
3.523IleVal: 3.523 ± 0.015
0.773IleTrp: 0.773 ± 0.007
1.571IleTyr: 1.571 ± 0.01
0.002IleXaa: 0.002 ± 0.0
Lys
4.145LysAla: 4.145 ± 0.02
1.08LysCys: 1.08 ± 0.009
3.459LysAsp: 3.459 ± 0.016
5.071LysGlu: 5.071 ± 0.028
2.444LysPhe: 2.444 ± 0.01
3.669LysGly: 3.669 ± 0.016
1.371LysHis: 1.371 ± 0.009
3.496LysIle: 3.496 ± 0.014
5.218LysLys: 5.218 ± 0.026
6.387LysLeu: 6.387 ± 0.025
1.63LysMet: 1.63 ± 0.011
2.911LysAsn: 2.911 ± 0.015
2.664LysPro: 2.664 ± 0.016
2.463LysGln: 2.463 ± 0.015
3.751LysArg: 3.751 ± 0.019
4.904LysSer: 4.904 ± 0.022
2.966LysThr: 2.966 ± 0.014
4.045LysVal: 4.045 ± 0.016
0.848LysTrp: 0.848 ± 0.008
1.837LysTyr: 1.837 ± 0.012
0.002LysXaa: 0.002 ± 0.0
Leu
6.327LeuAla: 6.327 ± 0.021
1.835LeuCys: 1.835 ± 0.013
5.039LeuAsp: 5.039 ± 0.018
6.458LeuGlu: 6.458 ± 0.028
3.756LeuPhe: 3.756 ± 0.018
5.523LeuGly: 5.523 ± 0.021
2.543LeuHis: 2.543 ± 0.013
4.725LeuIle: 4.725 ± 0.019
6.495LeuLys: 6.495 ± 0.024
9.753LeuLeu: 9.753 ± 0.034
2.203LeuMet: 2.203 ± 0.012
4.082LeuAsn: 4.082 ± 0.016
5.091LeuPro: 5.091 ± 0.024
4.412LeuGln: 4.412 ± 0.019
5.291LeuArg: 5.291 ± 0.018
8.355LeuSer: 8.355 ± 0.031
4.35LeuThr: 4.35 ± 0.018
6.233LeuVal: 6.233 ± 0.024
1.189LeuTrp: 1.189 ± 0.009
2.581LeuTyr: 2.581 ± 0.017
0.004LeuXaa: 0.004 ± 0.001
Met
2.041MetAla: 2.041 ± 0.012
0.351MetCys: 0.351 ± 0.005
1.471MetAsp: 1.471 ± 0.01
1.973MetGlu: 1.973 ± 0.01
0.807MetPhe: 0.807 ± 0.007
1.504MetGly: 1.504 ± 0.011
0.556MetHis: 0.556 ± 0.006
1.215MetIle: 1.215 ± 0.009
1.712MetLys: 1.712 ± 0.009
2.221MetLeu: 2.221 ± 0.013
0.665MetMet: 0.665 ± 0.007
1.096MetAsn: 1.096 ± 0.01
1.096MetPro: 1.096 ± 0.009
1.019MetGln: 1.019 ± 0.01
1.232MetArg: 1.232 ± 0.008
1.801MetSer: 1.801 ± 0.01
1.113MetThr: 1.113 ± 0.009
1.615MetVal: 1.615 ± 0.011
0.279MetTrp: 0.279 ± 0.004
0.593MetTyr: 0.593 ± 0.006
0.001MetXaa: 0.001 ± 0.0
Asn
2.708AsnAla: 2.708 ± 0.013
0.911AsnCys: 0.911 ± 0.008
2.263AsnAsp: 2.263 ± 0.013
2.807AsnGlu: 2.807 ± 0.015
2.095AsnPhe: 2.095 ± 0.012
3.273AsnGly: 3.273 ± 0.015
1.108AsnHis: 1.108 ± 0.009
2.745AsnIle: 2.745 ± 0.013
2.678AsnLys: 2.678 ± 0.015
4.666AsnLeu: 4.666 ± 0.022
1.196AsnMet: 1.196 ± 0.008
2.585AsnAsn: 2.585 ± 0.023
2.275AsnPro: 2.275 ± 0.012
1.837AsnGln: 1.837 ± 0.012
2.069AsnArg: 2.069 ± 0.012
4.145AsnSer: 4.145 ± 0.022
2.041AsnThr: 2.041 ± 0.011
2.892AsnVal: 2.892 ± 0.014
0.59AsnTrp: 0.59 ± 0.005
1.434AsnTyr: 1.434 ± 0.011
0.002AsnXaa: 0.002 ± 0.0
Pro
2.9ProAla: 2.9 ± 0.016
0.781ProCys: 0.781 ± 0.006
2.331ProAsp: 2.331 ± 0.014
3.035ProGlu: 3.035 ± 0.016
2.001ProPhe: 2.001 ± 0.012
2.552ProGly: 2.552 ± 0.014
1.083ProHis: 1.083 ± 0.008
2.436ProIle: 2.436 ± 0.011
2.852ProLys: 2.852 ± 0.014
4.223ProLeu: 4.223 ± 0.018
0.946ProMet: 0.946 ± 0.008
2.32ProAsn: 2.32 ± 0.014
3.682ProPro: 3.682 ± 0.046
1.859ProGln: 1.859 ± 0.012
2.291ProArg: 2.291 ± 0.015
4.955ProSer: 4.955 ± 0.023
2.654ProThr: 2.654 ± 0.013
2.981ProVal: 2.981 ± 0.016
0.578ProTrp: 0.578 ± 0.006
1.369ProTyr: 1.369 ± 0.011
0.002ProXaa: 0.002 ± 0.0
Gln
2.415GlnAla: 2.415 ± 0.014
0.614GlnCys: 0.614 ± 0.006
1.722GlnAsp: 1.722 ± 0.01
2.563GlnGlu: 2.563 ± 0.014
1.463GlnPhe: 1.463 ± 0.009
2.139GlnGly: 2.139 ± 0.012
0.99GlnHis: 0.99 ± 0.009
2.086GlnIle: 2.086 ± 0.011
2.577GlnLys: 2.577 ± 0.015
3.888GlnLeu: 3.888 ± 0.017
1.055GlnMet: 1.055 ± 0.009
1.957GlnAsn: 1.957 ± 0.012
1.744GlnPro: 1.744 ± 0.011
2.28GlnGln: 2.28 ± 0.027
2.181GlnArg: 2.181 ± 0.012
2.987GlnSer: 2.987 ± 0.016
1.746GlnThr: 1.746 ± 0.011
2.439GlnVal: 2.439 ± 0.013
0.475GlnTrp: 0.475 ± 0.005
1.002GlnTyr: 1.002 ± 0.009
0.002GlnXaa: 0.002 ± 0.0
Arg
3.163ArgAla: 3.163 ± 0.015
0.968ArgCys: 0.968 ± 0.008
2.619ArgAsp: 2.619 ± 0.015
3.344ArgGlu: 3.344 ± 0.017
2.151ArgPhe: 2.151 ± 0.012
3.173ArgGly: 3.173 ± 0.015
1.279ArgHis: 1.279 ± 0.009
2.88ArgIle: 2.88 ± 0.014
3.916ArgLys: 3.916 ± 0.019
4.892ArgLeu: 4.892 ± 0.021
1.299ArgMet: 1.299 ± 0.009
2.527ArgAsn: 2.527 ± 0.013
2.308ArgPro: 2.308 ± 0.014
1.889ArgGln: 1.889 ± 0.012
3.836ArgArg: 3.836 ± 0.022
4.299ArgSer: 4.299 ± 0.021
2.447ArgThr: 2.447 ± 0.012
3.213ArgVal: 3.213 ± 0.016
0.741ArgTrp: 0.741 ± 0.007
1.509ArgTyr: 1.509 ± 0.01
0.002ArgXaa: 0.002 ± 0.0
Ser
5.323SerAla: 5.323 ± 0.02
1.776SerCys: 1.776 ± 0.011
4.391SerAsp: 4.391 ± 0.018
4.84SerGlu: 4.84 ± 0.018
3.916SerPhe: 3.916 ± 0.02
5.605SerGly: 5.605 ± 0.026
1.985SerHis: 1.985 ± 0.012
4.72SerIle: 4.72 ± 0.02
5.142SerLys: 5.142 ± 0.018
8.362SerLeu: 8.362 ± 0.028
2.068SerMet: 2.068 ± 0.013
4.19SerAsn: 4.19 ± 0.018
4.446SerPro: 4.446 ± 0.028
3.165SerGln: 3.165 ± 0.016
4.523SerArg: 4.523 ± 0.023
10.876SerSer: 10.876 ± 0.042
4.845SerThr: 4.845 ± 0.017
5.143SerVal: 5.143 ± 0.016
1.154SerTrp: 1.154 ± 0.009
2.427SerTyr: 2.427 ± 0.012
0.003SerXaa: 0.003 ± 0.0
Thr
3.267ThrAla: 3.267 ± 0.015
0.965ThrCys: 0.965 ± 0.007
2.299ThrAsp: 2.299 ± 0.012
2.85ThrGlu: 2.85 ± 0.014
2.076ThrPhe: 2.076 ± 0.012
3.075ThrGly: 3.075 ± 0.015
1.039ThrHis: 1.039 ± 0.008
2.868ThrIle: 2.868 ± 0.016
2.892ThrLys: 2.892 ± 0.016
4.617ThrLeu: 4.617 ± 0.018
1.16ThrMet: 1.16 ± 0.008
2.239ThrAsn: 2.239 ± 0.011
2.584ThrPro: 2.584 ± 0.014
1.649ThrGln: 1.649 ± 0.01
2.413ThrArg: 2.413 ± 0.012
4.728ThrSer: 4.728 ± 0.018
3.062ThrThr: 3.062 ± 0.017
3.188ThrVal: 3.188 ± 0.015
0.649ThrTrp: 0.649 ± 0.006
1.433ThrTyr: 1.433 ± 0.01
0.002ThrXaa: 0.002 ± 0.0
Val
4.603ValAla: 4.603 ± 0.02
1.108ValCys: 1.108 ± 0.008
3.719ValAsp: 3.719 ± 0.015
4.531ValGlu: 4.531 ± 0.021
2.579ValPhe: 2.579 ± 0.014
3.828ValGly: 3.828 ± 0.018
1.521ValHis: 1.521 ± 0.009
3.516ValIle: 3.516 ± 0.014
4.085ValLys: 4.085 ± 0.017
6.11ValLeu: 6.11 ± 0.022
1.498ValMet: 1.498 ± 0.009
2.795ValAsn: 2.795 ± 0.012
3.147ValPro: 3.147 ± 0.017
2.419ValGln: 2.419 ± 0.012
3.006ValArg: 3.006 ± 0.015
5.208ValSer: 5.208 ± 0.019
3.217ValThr: 3.217 ± 0.015
4.756ValVal: 4.756 ± 0.019
0.728ValTrp: 0.728 ± 0.006
1.901ValTyr: 1.901 ± 0.012
0.002ValXaa: 0.002 ± 0.0
Trp
0.745TrpAla: 0.745 ± 0.007
0.24TrpCys: 0.24 ± 0.004
0.712TrpAsp: 0.712 ± 0.007
0.798TrpGlu: 0.798 ± 0.007
0.545TrpPhe: 0.545 ± 0.006
0.689TrpGly: 0.689 ± 0.007
0.299TrpHis: 0.299 ± 0.004
0.724TrpIle: 0.724 ± 0.007
0.995TrpLys: 0.995 ± 0.008
1.181TrpLeu: 1.181 ± 0.009
0.344TrpMet: 0.344 ± 0.005
0.751TrpAsn: 0.751 ± 0.008
0.481TrpPro: 0.481 ± 0.006
0.45TrpGln: 0.45 ± 0.005
0.818TrpArg: 0.818 ± 0.006
0.966TrpSer: 0.966 ± 0.008
0.654TrpThr: 0.654 ± 0.006
0.749TrpVal: 0.749 ± 0.006
0.249TrpTrp: 0.249 ± 0.004
0.361TrpTyr: 0.361 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.823TyrAla: 1.823 ± 0.011
0.664TyrCys: 0.664 ± 0.007
1.565TyrAsp: 1.565 ± 0.009
1.645TyrGlu: 1.645 ± 0.01
1.342TyrPhe: 1.342 ± 0.008
2.057TyrGly: 2.057 ± 0.014
0.764TyrHis: 0.764 ± 0.006
1.54TyrIle: 1.54 ± 0.01
1.696TyrLys: 1.696 ± 0.013
2.883TyrLeu: 2.883 ± 0.016
0.755TyrMet: 0.755 ± 0.006
1.417TyrAsn: 1.417 ± 0.011
1.267TyrPro: 1.267 ± 0.01
1.027TyrGln: 1.027 ± 0.008
1.513TyrArg: 1.513 ± 0.01
2.362TyrSer: 2.362 ± 0.013
1.329TyrThr: 1.329 ± 0.011
1.762TyrVal: 1.762 ± 0.01
0.417TyrTrp: 0.417 ± 0.005
1.077TyrTyr: 1.077 ± 0.012
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.003XaaGly: 0.003 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.003XaaLeu: 0.003 ± 0.001
0.002XaaMet: 0.002 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.002XaaXaa: 0.002 ± 0.001
Statistics based on 41327 proteins (17030923 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski