Amino acid dipepetide frequency for Gossypium barbadense (Sea-island cotton) (Egyptian cotton)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.899AlaAla: 5.899 ± 0.028
1.284AlaCys: 1.284 ± 0.011
2.98AlaAsp: 2.98 ± 0.014
4.027AlaGlu: 4.027 ± 0.021
2.698AlaPhe: 2.698 ± 0.014
3.851AlaGly: 3.851 ± 0.018
1.289AlaHis: 1.289 ± 0.01
3.846AlaIle: 3.846 ± 0.017
3.896AlaLys: 3.896 ± 0.019
6.586AlaLeu: 6.586 ± 0.025
1.826AlaMet: 1.826 ± 0.01
2.624AlaAsn: 2.624 ± 0.013
2.735AlaPro: 2.735 ± 0.019
2.041AlaGln: 2.041 ± 0.012
3.274AlaArg: 3.274 ± 0.016
5.799AlaSer: 5.799 ± 0.022
3.604AlaThr: 3.604 ± 0.018
4.615AlaVal: 4.615 ± 0.017
0.789AlaTrp: 0.789 ± 0.008
1.8AlaTyr: 1.8 ± 0.01
0.002AlaXaa: 0.002 ± 0.0
Cys
0.986CysAla: 0.986 ± 0.009
0.553CysCys: 0.553 ± 0.007
0.886CysAsp: 0.886 ± 0.009
0.91CysGlu: 0.91 ± 0.007
0.952CysPhe: 0.952 ± 0.008
1.394CysGly: 1.394 ± 0.011
0.493CysHis: 0.493 ± 0.006
1.027CysIle: 1.027 ± 0.008
1.202CysLys: 1.202 ± 0.011
1.98CysLeu: 1.98 ± 0.013
0.492CysMet: 0.492 ± 0.006
0.921CysAsn: 0.921 ± 0.008
0.973CysPro: 0.973 ± 0.01
0.647CysGln: 0.647 ± 0.007
1.074CysArg: 1.074 ± 0.008
1.889CysSer: 1.889 ± 0.014
0.843CysThr: 0.843 ± 0.008
1.008CysVal: 1.008 ± 0.009
0.293CysTrp: 0.293 ± 0.005
0.575CysTyr: 0.575 ± 0.007
0.001CysXaa: 0.001 ± 0.0
Asp
3.352AspAla: 3.352 ± 0.017
0.983AspCys: 0.983 ± 0.009
3.448AspAsp: 3.448 ± 0.02
3.979AspGlu: 3.979 ± 0.019
2.297AspPhe: 2.297 ± 0.014
3.664AspGly: 3.664 ± 0.016
1.2AspHis: 1.2 ± 0.009
3.068AspIle: 3.068 ± 0.014
2.841AspLys: 2.841 ± 0.015
4.995AspLeu: 4.995 ± 0.023
1.335AspMet: 1.335 ± 0.01
2.191AspAsn: 2.191 ± 0.013
2.667AspPro: 2.667 ± 0.016
1.678AspGln: 1.678 ± 0.012
2.287AspArg: 2.287 ± 0.013
4.061AspSer: 4.061 ± 0.019
2.17AspThr: 2.17 ± 0.013
3.558AspVal: 3.558 ± 0.016
0.705AspTrp: 0.705 ± 0.006
1.472AspTyr: 1.472 ± 0.01
0.002AspXaa: 0.002 ± 0.0
Glu
4.673GluAla: 4.673 ± 0.021
0.93GluCys: 0.93 ± 0.008
3.764GluAsp: 3.764 ± 0.017
5.919GluGlu: 5.919 ± 0.031
2.419GluPhe: 2.419 ± 0.013
3.698GluGly: 3.698 ± 0.019
1.231GluHis: 1.231 ± 0.01
3.829GluIle: 3.829 ± 0.017
4.664GluLys: 4.664 ± 0.025
6.131GluLeu: 6.131 ± 0.029
1.805GluMet: 1.805 ± 0.014
3.189GluAsn: 3.189 ± 0.017
2.282GluPro: 2.282 ± 0.015
2.217GluGln: 2.217 ± 0.014
3.354GluArg: 3.354 ± 0.02
4.459GluSer: 4.459 ± 0.021
3.233GluThr: 3.233 ± 0.02
4.169GluVal: 4.169 ± 0.018
0.797GluTrp: 0.797 ± 0.007
1.603GluTyr: 1.603 ± 0.011
0.003GluXaa: 0.003 ± 0.0
Phe
2.473PheAla: 2.473 ± 0.015
0.888PheCys: 0.888 ± 0.008
2.36PheAsp: 2.36 ± 0.013
2.343PheGlu: 2.343 ± 0.014
1.96PhePhe: 1.96 ± 0.015
3.109PheGly: 3.109 ± 0.016
1.152PheHis: 1.152 ± 0.009
2.19PheIle: 2.19 ± 0.013
2.33PheLys: 2.33 ± 0.013
4.207PheLeu: 4.207 ± 0.019
1.028PheMet: 1.028 ± 0.009
1.946PheAsn: 1.946 ± 0.013
2.21PhePro: 2.21 ± 0.014
1.65PheGln: 1.65 ± 0.011
2.027PheArg: 2.027 ± 0.011
4.043PheSer: 4.043 ± 0.019
2.025PheThr: 2.025 ± 0.012
2.623PheVal: 2.623 ± 0.016
0.568PheTrp: 0.568 ± 0.006
1.306PheTyr: 1.306 ± 0.01
0.002PheXaa: 0.002 ± 0.0
Gly
3.697GlyAla: 3.697 ± 0.022
1.369GlyCys: 1.369 ± 0.012
3.313GlyAsp: 3.313 ± 0.017
3.582GlyGlu: 3.582 ± 0.016
3.156GlyPhe: 3.156 ± 0.016
4.972GlyGly: 4.972 ± 0.031
1.562GlyHis: 1.562 ± 0.011
3.702GlyIle: 3.702 ± 0.014
4.194GlyLys: 4.194 ± 0.017
5.918GlyLeu: 5.918 ± 0.022
1.57GlyMet: 1.57 ± 0.013
3.317GlyAsn: 3.317 ± 0.016
2.469GlyPro: 2.469 ± 0.016
2.117GlyGln: 2.117 ± 0.015
3.619GlyArg: 3.619 ± 0.018
6.006GlySer: 6.006 ± 0.026
3.403GlyThr: 3.403 ± 0.017
4.099GlyVal: 4.099 ± 0.018
0.919GlyTrp: 0.919 ± 0.009
2.038GlyTyr: 2.038 ± 0.013
0.003GlyXaa: 0.003 ± 0.0
His
1.39HisAla: 1.39 ± 0.01
0.554HisCys: 0.554 ± 0.007
1.18HisAsp: 1.18 ± 0.009
1.346HisGlu: 1.346 ± 0.011
1.109HisPhe: 1.109 ± 0.009
1.769HisGly: 1.769 ± 0.013
0.939HisHis: 0.939 ± 0.011
1.266HisIle: 1.266 ± 0.01
1.213HisLys: 1.213 ± 0.009
2.46HisLeu: 2.46 ± 0.015
0.561HisMet: 0.561 ± 0.006
0.986HisAsn: 0.986 ± 0.01
1.443HisPro: 1.443 ± 0.011
1.011HisGln: 1.011 ± 0.008
1.406HisArg: 1.406 ± 0.01
1.891HisSer: 1.891 ± 0.013
0.957HisThr: 0.957 ± 0.009
1.559HisVal: 1.559 ± 0.011
0.315HisTrp: 0.315 ± 0.005
0.71HisTyr: 0.71 ± 0.007
0.001HisXaa: 0.001 ± 0.0
Ile
3.612IleAla: 3.612 ± 0.017
1.145IleCys: 1.145 ± 0.009
3.029IleAsp: 3.029 ± 0.015
3.363IleGlu: 3.363 ± 0.016
2.335IlePhe: 2.335 ± 0.014
3.543IleGly: 3.543 ± 0.018
1.381IleHis: 1.381 ± 0.01
2.914IleIle: 2.914 ± 0.013
3.06IleLys: 3.06 ± 0.013
5.15IleLeu: 5.15 ± 0.02
1.205IleMet: 1.205 ± 0.01
2.365IleAsn: 2.365 ± 0.014
3.065IlePro: 3.065 ± 0.019
2.022IleGln: 2.022 ± 0.011
2.663IleArg: 2.663 ± 0.014
4.906IleSer: 4.906 ± 0.018
2.649IleThr: 2.649 ± 0.015
3.519IleVal: 3.519 ± 0.017
0.728IleTrp: 0.728 ± 0.008
1.475IleTyr: 1.475 ± 0.01
0.003IleXaa: 0.003 ± 0.0
Lys
4.179LysAla: 4.179 ± 0.02
1.031LysCys: 1.031 ± 0.009
3.259LysAsp: 3.259 ± 0.018
4.722LysGlu: 4.722 ± 0.022
2.307LysPhe: 2.307 ± 0.013
3.84LysGly: 3.84 ± 0.016
1.384LysHis: 1.384 ± 0.01
3.274LysIle: 3.274 ± 0.013
4.667LysLys: 4.667 ± 0.028
6.317LysLeu: 6.317 ± 0.025
1.54LysMet: 1.54 ± 0.011
2.802LysAsn: 2.802 ± 0.012
2.988LysPro: 2.988 ± 0.02
2.398LysGln: 2.398 ± 0.015
3.538LysArg: 3.538 ± 0.018
4.534LysSer: 4.534 ± 0.019
2.958LysThr: 2.958 ± 0.015
4.111LysVal: 4.111 ± 0.02
0.818LysTrp: 0.818 ± 0.008
1.618LysTyr: 1.618 ± 0.011
0.003LysXaa: 0.003 ± 0.0
Leu
6.405LeuAla: 6.405 ± 0.025
1.839LeuCys: 1.839 ± 0.012
5.165LeuAsp: 5.165 ± 0.023
6.397LeuGlu: 6.397 ± 0.03
3.846LeuPhe: 3.846 ± 0.02
5.846LeuGly: 5.846 ± 0.022
2.564LeuHis: 2.564 ± 0.014
4.649LeuIle: 4.649 ± 0.019
6.583LeuLys: 6.583 ± 0.025
9.711LeuLeu: 9.711 ± 0.036
2.187LeuMet: 2.187 ± 0.013
4.29LeuAsn: 4.29 ± 0.019
5.141LeuPro: 5.141 ± 0.022
4.365LeuGln: 4.365 ± 0.021
5.453LeuArg: 5.453 ± 0.023
8.675LeuSer: 8.675 ± 0.038
4.654LeuThr: 4.654 ± 0.022
6.267LeuVal: 6.267 ± 0.022
1.18LeuTrp: 1.18 ± 0.008
2.505LeuTyr: 2.505 ± 0.013
0.004LeuXaa: 0.004 ± 0.0
Met
2.211MetAla: 2.211 ± 0.014
0.319MetCys: 0.319 ± 0.004
1.441MetAsp: 1.441 ± 0.011
2.135MetGlu: 2.135 ± 0.013
0.842MetPhe: 0.842 ± 0.008
1.658MetGly: 1.658 ± 0.013
0.546MetHis: 0.546 ± 0.006
1.252MetIle: 1.252 ± 0.01
1.694MetLys: 1.694 ± 0.013
2.308MetLeu: 2.308 ± 0.013
0.687MetMet: 0.687 ± 0.007
1.08MetAsn: 1.08 ± 0.009
1.093MetPro: 1.093 ± 0.009
0.925MetGln: 0.925 ± 0.009
1.183MetArg: 1.183 ± 0.009
1.825MetSer: 1.825 ± 0.011
1.076MetThr: 1.076 ± 0.01
1.831MetVal: 1.831 ± 0.011
0.257MetTrp: 0.257 ± 0.004
0.577MetTyr: 0.577 ± 0.006
0.001MetXaa: 0.001 ± 0.0
Asn
2.718AsnAla: 2.718 ± 0.014
0.94AsnCys: 0.94 ± 0.009
2.174AsnAsp: 2.174 ± 0.013
2.796AsnGlu: 2.796 ± 0.015
1.97AsnPhe: 1.97 ± 0.013
3.48AsnGly: 3.48 ± 0.017
1.146AsnHis: 1.146 ± 0.009
2.615AsnIle: 2.615 ± 0.016
2.617AsnLys: 2.617 ± 0.013
4.841AsnLeu: 4.841 ± 0.029
1.201AsnMet: 1.201 ± 0.009
2.506AsnAsn: 2.506 ± 0.018
2.452AsnPro: 2.452 ± 0.014
1.899AsnGln: 1.899 ± 0.013
2.101AsnArg: 2.101 ± 0.012
3.93AsnSer: 3.93 ± 0.019
2.152AsnThr: 2.152 ± 0.014
2.88AsnVal: 2.88 ± 0.017
0.591AsnTrp: 0.591 ± 0.007
1.324AsnTyr: 1.324 ± 0.01
0.002AsnXaa: 0.002 ± 0.0
Pro
2.806ProAla: 2.806 ± 0.017
0.889ProCys: 0.889 ± 0.009
2.436ProAsp: 2.436 ± 0.014
3.086ProGlu: 3.086 ± 0.014
2.11ProPhe: 2.11 ± 0.012
2.644ProGly: 2.644 ± 0.017
1.147ProHis: 1.147 ± 0.01
2.404ProIle: 2.404 ± 0.015
2.935ProLys: 2.935 ± 0.017
4.359ProLeu: 4.359 ± 0.019
1.078ProMet: 1.078 ± 0.009
2.481ProAsn: 2.481 ± 0.015
3.727ProPro: 3.727 ± 0.044
1.838ProGln: 1.838 ± 0.012
2.491ProArg: 2.491 ± 0.017
5.17ProSer: 5.17 ± 0.023
2.73ProThr: 2.73 ± 0.016
3.003ProVal: 3.003 ± 0.016
0.671ProTrp: 0.671 ± 0.007
1.372ProTyr: 1.372 ± 0.011
0.002ProXaa: 0.002 ± 0.0
Gln
2.44GlnAla: 2.44 ± 0.015
0.621GlnCys: 0.621 ± 0.006
1.609GlnAsp: 1.609 ± 0.011
2.294GlnGlu: 2.294 ± 0.015
1.439GlnPhe: 1.439 ± 0.01
2.223GlnGly: 2.223 ± 0.013
0.987GlnHis: 0.987 ± 0.009
2.119GlnIle: 2.119 ± 0.012
2.272GlnLys: 2.272 ± 0.014
3.785GlnLeu: 3.785 ± 0.018
1.019GlnMet: 1.019 ± 0.009
1.858GlnAsn: 1.858 ± 0.013
1.759GlnPro: 1.759 ± 0.012
2.032GlnGln: 2.032 ± 0.02
2.057GlnArg: 2.057 ± 0.012
2.839GlnSer: 2.839 ± 0.017
1.76GlnThr: 1.76 ± 0.01
2.347GlnVal: 2.347 ± 0.013
0.498GlnTrp: 0.498 ± 0.007
0.978GlnTyr: 0.978 ± 0.008
0.002GlnXaa: 0.002 ± 0.0
Arg
3.124ArgAla: 3.124 ± 0.015
1.017ArgCys: 1.017 ± 0.01
2.563ArgAsp: 2.563 ± 0.016
3.211ArgGlu: 3.211 ± 0.017
2.268ArgPhe: 2.268 ± 0.014
3.179ArgGly: 3.179 ± 0.017
1.328ArgHis: 1.328 ± 0.01
2.87ArgIle: 2.87 ± 0.013
3.794ArgLys: 3.794 ± 0.02
5.136ArgLeu: 5.136 ± 0.02
1.312ArgMet: 1.312 ± 0.011
2.535ArgAsn: 2.535 ± 0.012
2.318ArgPro: 2.318 ± 0.015
1.882ArgGln: 1.882 ± 0.011
3.765ArgArg: 3.765 ± 0.022
4.135ArgSer: 4.135 ± 0.018
2.51ArgThr: 2.51 ± 0.013
3.29ArgVal: 3.29 ± 0.016
0.74ArgTrp: 0.74 ± 0.008
1.47ArgTyr: 1.47 ± 0.01
0.003ArgXaa: 0.003 ± 0.0
Ser
4.991SerAla: 4.991 ± 0.023
1.707SerCys: 1.707 ± 0.012
4.209SerAsp: 4.209 ± 0.019
4.647SerGlu: 4.647 ± 0.023
4.039SerPhe: 4.039 ± 0.021
5.744SerGly: 5.744 ± 0.022
1.996SerHis: 1.996 ± 0.013
4.681SerIle: 4.681 ± 0.019
5.139SerLys: 5.139 ± 0.022
8.726SerLeu: 8.726 ± 0.036
2.195SerMet: 2.195 ± 0.011
4.293SerAsn: 4.293 ± 0.019
4.435SerPro: 4.435 ± 0.028
2.96SerGln: 2.96 ± 0.018
4.347SerArg: 4.347 ± 0.02
10.622SerSer: 10.622 ± 0.04
4.728SerThr: 4.728 ± 0.018
5.019SerVal: 5.019 ± 0.022
1.176SerTrp: 1.176 ± 0.01
2.338SerTyr: 2.338 ± 0.015
0.004SerXaa: 0.004 ± 0.001
Thr
3.404ThrAla: 3.404 ± 0.014
0.97ThrCys: 0.97 ± 0.008
2.278ThrAsp: 2.278 ± 0.014
2.771ThrGlu: 2.771 ± 0.014
2.11ThrPhe: 2.11 ± 0.013
3.284ThrGly: 3.284 ± 0.018
1.155ThrHis: 1.155 ± 0.009
2.854ThrIle: 2.854 ± 0.015
2.756ThrLys: 2.756 ± 0.016
4.758ThrLeu: 4.758 ± 0.019
1.243ThrMet: 1.243 ± 0.009
2.149ThrAsn: 2.149 ± 0.013
2.558ThrPro: 2.558 ± 0.016
1.575ThrGln: 1.575 ± 0.01
2.472ThrArg: 2.472 ± 0.015
4.607ThrSer: 4.607 ± 0.022
3.049ThrThr: 3.049 ± 0.015
3.52ThrVal: 3.52 ± 0.016
0.669ThrTrp: 0.669 ± 0.007
1.386ThrTyr: 1.386 ± 0.011
0.003ThrXaa: 0.003 ± 0.0
Val
4.649ValAla: 4.649 ± 0.019
1.132ValCys: 1.132 ± 0.01
3.636ValAsp: 3.636 ± 0.016
4.437ValGlu: 4.437 ± 0.019
2.684ValPhe: 2.684 ± 0.015
4.231ValGly: 4.231 ± 0.019
1.496ValHis: 1.496 ± 0.011
3.397ValIle: 3.397 ± 0.016
3.921ValLys: 3.921 ± 0.016
6.319ValLeu: 6.319 ± 0.021
1.544ValMet: 1.544 ± 0.01
2.739ValAsn: 2.739 ± 0.014
3.269ValPro: 3.269 ± 0.017
2.281ValGln: 2.281 ± 0.012
2.998ValArg: 2.998 ± 0.015
5.41ValSer: 5.41 ± 0.021
3.168ValThr: 3.168 ± 0.016
4.812ValVal: 4.812 ± 0.022
0.773ValTrp: 0.773 ± 0.008
1.891ValTyr: 1.891 ± 0.011
0.003ValXaa: 0.003 ± 0.001
Trp
0.776TrpAla: 0.776 ± 0.008
0.268TrpCys: 0.268 ± 0.004
0.714TrpAsp: 0.714 ± 0.008
0.809TrpGlu: 0.809 ± 0.008
0.549TrpPhe: 0.549 ± 0.006
0.767TrpGly: 0.767 ± 0.008
0.334TrpHis: 0.334 ± 0.005
0.733TrpIle: 0.733 ± 0.008
0.964TrpLys: 0.964 ± 0.009
1.242TrpLeu: 1.242 ± 0.01
0.351TrpMet: 0.351 ± 0.005
0.715TrpAsn: 0.715 ± 0.008
0.523TrpPro: 0.523 ± 0.006
0.474TrpGln: 0.474 ± 0.006
0.88TrpArg: 0.88 ± 0.008
0.978TrpSer: 0.978 ± 0.008
0.625TrpThr: 0.625 ± 0.006
0.843TrpVal: 0.843 ± 0.008
0.26TrpTrp: 0.26 ± 0.004
0.341TrpTyr: 0.341 ± 0.005
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.678TyrAla: 1.678 ± 0.011
0.658TyrCys: 0.658 ± 0.008
1.506TyrAsp: 1.506 ± 0.011
1.592TyrGlu: 1.592 ± 0.011
1.349TyrPhe: 1.349 ± 0.011
2.13TyrGly: 2.13 ± 0.012
0.736TyrHis: 0.736 ± 0.007
1.439TyrIle: 1.439 ± 0.009
1.543TyrLys: 1.543 ± 0.013
2.794TyrLeu: 2.794 ± 0.016
0.758TyrMet: 0.758 ± 0.008
1.347TyrAsn: 1.347 ± 0.01
1.27TyrPro: 1.27 ± 0.01
0.924TyrGln: 0.924 ± 0.008
1.457TyrArg: 1.457 ± 0.01
2.213TyrSer: 2.213 ± 0.012
1.277TyrThr: 1.277 ± 0.012
1.712TyrVal: 1.712 ± 0.012
0.398TyrTrp: 0.398 ± 0.005
0.954TyrTyr: 0.954 ± 0.014
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.003XaaGlu: 0.003 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.003XaaGly: 0.003 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.003XaaIle: 0.003 ± 0.0
0.003XaaLys: 0.003 ± 0.0
0.004XaaLeu: 0.004 ± 0.0
0.002XaaMet: 0.002 ± 0.0
0.003XaaAsn: 0.003 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.002XaaGln: 0.002 ± 0.0
0.003XaaArg: 0.003 ± 0.0
0.003XaaSer: 0.003 ± 0.0
0.002XaaThr: 0.002 ± 0.0
0.003XaaVal: 0.003 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.1XaaXaa: 0.1 ± 0.036
Statistics based on 40046 proteins (15047206 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski