Amino acid dipepetide frequency for Gossypium mustelinum (Cotton)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.101AlaAla: 6.101 ± 0.016
1.263AlaCys: 1.263 ± 0.006
3.042AlaAsp: 3.042 ± 0.009
4.135AlaGlu: 4.135 ± 0.014
2.761AlaPhe: 2.761 ± 0.011
3.881AlaGly: 3.881 ± 0.013
1.215AlaHis: 1.215 ± 0.005
3.776AlaIle: 3.776 ± 0.013
3.921AlaLys: 3.921 ± 0.012
6.502AlaLeu: 6.502 ± 0.018
1.78AlaMet: 1.78 ± 0.007
2.632AlaAsn: 2.632 ± 0.009
2.749AlaPro: 2.749 ± 0.01
2.051AlaGln: 2.051 ± 0.01
3.181AlaArg: 3.181 ± 0.01
5.966AlaSer: 5.966 ± 0.014
3.567AlaThr: 3.567 ± 0.009
4.663AlaVal: 4.663 ± 0.012
0.746AlaTrp: 0.746 ± 0.005
1.773AlaTyr: 1.773 ± 0.008
0.0AlaXaa: 0.0 ± 0.0
Cys
0.966CysAla: 0.966 ± 0.006
0.574CysCys: 0.574 ± 0.005
0.894CysAsp: 0.894 ± 0.005
0.921CysGlu: 0.921 ± 0.005
0.965CysPhe: 0.965 ± 0.005
1.384CysGly: 1.384 ± 0.007
0.484CysHis: 0.484 ± 0.004
1.03CysIle: 1.03 ± 0.006
1.221CysLys: 1.221 ± 0.007
2.009CysLeu: 2.009 ± 0.008
0.464CysMet: 0.464 ± 0.004
0.949CysAsn: 0.949 ± 0.006
0.962CysPro: 0.962 ± 0.006
0.627CysGln: 0.627 ± 0.004
1.067CysArg: 1.067 ± 0.005
1.933CysSer: 1.933 ± 0.009
0.84CysThr: 0.84 ± 0.006
1.023CysVal: 1.023 ± 0.006
0.262CysTrp: 0.262 ± 0.002
0.575CysTyr: 0.575 ± 0.004
0.0CysXaa: 0.0 ± 0.0
Asp
3.351AspAla: 3.351 ± 0.011
1.009AspCys: 1.009 ± 0.006
3.675AspAsp: 3.675 ± 0.015
4.001AspGlu: 4.001 ± 0.013
2.323AspPhe: 2.323 ± 0.008
3.723AspGly: 3.723 ± 0.011
1.212AspHis: 1.212 ± 0.006
3.062AspIle: 3.062 ± 0.009
2.781AspLys: 2.781 ± 0.009
5.047AspLeu: 5.047 ± 0.015
1.312AspMet: 1.312 ± 0.006
2.175AspAsn: 2.175 ± 0.008
2.581AspPro: 2.581 ± 0.01
1.735AspGln: 1.735 ± 0.007
2.288AspArg: 2.288 ± 0.009
4.213AspSer: 4.213 ± 0.011
2.159AspThr: 2.159 ± 0.008
3.533AspVal: 3.533 ± 0.01
0.703AspTrp: 0.703 ± 0.005
1.529AspTyr: 1.529 ± 0.006
0.0AspXaa: 0.0 ± 0.0
Glu
4.686GluAla: 4.686 ± 0.016
0.937GluCys: 0.937 ± 0.005
3.922GluAsp: 3.922 ± 0.015
6.138GluGlu: 6.138 ± 0.028
2.364GluPhe: 2.364 ± 0.008
3.648GluGly: 3.648 ± 0.01
1.247GluHis: 1.247 ± 0.007
3.769GluIle: 3.769 ± 0.012
4.868GluLys: 4.868 ± 0.019
6.016GluLeu: 6.016 ± 0.016
1.796GluMet: 1.796 ± 0.009
3.318GluAsn: 3.318 ± 0.012
2.177GluPro: 2.177 ± 0.009
2.224GluGln: 2.224 ± 0.008
3.337GluArg: 3.337 ± 0.011
4.61GluSer: 4.61 ± 0.014
3.239GluThr: 3.239 ± 0.014
4.096GluVal: 4.096 ± 0.013
0.728GluTrp: 0.728 ± 0.004
1.619GluTyr: 1.619 ± 0.007
0.0GluXaa: 0.0 ± 0.0
Phe
2.443PheAla: 2.443 ± 0.009
0.928PheCys: 0.928 ± 0.005
2.347PheAsp: 2.347 ± 0.009
2.306PheGlu: 2.306 ± 0.009
2.052PhePhe: 2.052 ± 0.008
3.082PheGly: 3.082 ± 0.013
1.153PheHis: 1.153 ± 0.006
2.186PheIle: 2.186 ± 0.007
2.225PheLys: 2.225 ± 0.008
4.351PheLeu: 4.351 ± 0.013
1.002PheMet: 1.002 ± 0.005
1.909PheAsn: 1.909 ± 0.008
2.199PhePro: 2.199 ± 0.008
1.613PheGln: 1.613 ± 0.006
2.031PheArg: 2.031 ± 0.009
4.209PheSer: 4.209 ± 0.011
2.015PheThr: 2.015 ± 0.009
2.66PheVal: 2.66 ± 0.008
0.586PheTrp: 0.586 ± 0.004
1.307PheTyr: 1.307 ± 0.007
0.0PheXaa: 0.0 ± 0.0
Gly
3.708GlyAla: 3.708 ± 0.013
1.346GlyCys: 1.346 ± 0.008
3.33GlyAsp: 3.33 ± 0.009
3.586GlyGlu: 3.586 ± 0.011
3.188GlyPhe: 3.188 ± 0.01
5.127GlyGly: 5.127 ± 0.028
1.509GlyHis: 1.509 ± 0.006
3.648GlyIle: 3.648 ± 0.011
4.161GlyLys: 4.161 ± 0.013
5.855GlyLeu: 5.855 ± 0.015
1.502GlyMet: 1.502 ± 0.007
3.345GlyAsn: 3.345 ± 0.011
2.406GlyPro: 2.406 ± 0.009
2.072GlyGln: 2.072 ± 0.01
3.427GlyArg: 3.427 ± 0.012
6.161GlySer: 6.161 ± 0.016
3.293GlyThr: 3.293 ± 0.011
4.02GlyVal: 4.02 ± 0.012
0.878GlyTrp: 0.878 ± 0.005
2.042GlyTyr: 2.042 ± 0.009
0.0GlyXaa: 0.0 ± 0.0
His
1.357HisAla: 1.357 ± 0.006
0.55HisCys: 0.55 ± 0.004
1.131HisAsp: 1.131 ± 0.006
1.286HisGlu: 1.286 ± 0.006
1.107HisPhe: 1.107 ± 0.005
1.723HisGly: 1.723 ± 0.008
1.009HisHis: 1.009 ± 0.008
1.209HisIle: 1.209 ± 0.006
1.186HisLys: 1.186 ± 0.006
2.473HisLeu: 2.473 ± 0.009
0.531HisMet: 0.531 ± 0.004
0.982HisAsn: 0.982 ± 0.006
1.408HisPro: 1.408 ± 0.006
1.058HisGln: 1.058 ± 0.005
1.365HisArg: 1.365 ± 0.006
1.949HisSer: 1.949 ± 0.008
0.905HisThr: 0.905 ± 0.005
1.513HisVal: 1.513 ± 0.007
0.317HisTrp: 0.317 ± 0.003
0.703HisTyr: 0.703 ± 0.004
0.0HisXaa: 0.0 ± 0.0
Ile
3.561IleAla: 3.561 ± 0.011
1.176IleCys: 1.176 ± 0.006
2.921IleAsp: 2.921 ± 0.009
3.314IleGlu: 3.314 ± 0.012
2.34IlePhe: 2.34 ± 0.009
3.43IleGly: 3.43 ± 0.014
1.298IleHis: 1.298 ± 0.005
2.872IleIle: 2.872 ± 0.008
3.071IleLys: 3.071 ± 0.01
5.19IleLeu: 5.19 ± 0.014
1.147IleMet: 1.147 ± 0.005
2.321IleAsn: 2.321 ± 0.008
3.014IlePro: 3.014 ± 0.011
2.016IleGln: 2.016 ± 0.008
2.607IleArg: 2.607 ± 0.01
4.981IleSer: 4.981 ± 0.012
2.588IleThr: 2.588 ± 0.009
3.426IleVal: 3.426 ± 0.012
0.713IleTrp: 0.713 ± 0.005
1.487IleTyr: 1.487 ± 0.008
0.0IleXaa: 0.0 ± 0.0
Lys
4.168LysAla: 4.168 ± 0.013
1.023LysCys: 1.023 ± 0.007
3.333LysAsp: 3.333 ± 0.012
4.822LysGlu: 4.822 ± 0.023
2.217LysPhe: 2.217 ± 0.008
3.783LysGly: 3.783 ± 0.011
1.429LysHis: 1.429 ± 0.006
3.274LysIle: 3.274 ± 0.01
4.826LysLys: 4.826 ± 0.018
6.237LysLeu: 6.237 ± 0.015
1.539LysMet: 1.539 ± 0.007
2.762LysAsn: 2.762 ± 0.009
2.91LysPro: 2.91 ± 0.012
2.48LysGln: 2.48 ± 0.008
3.661LysArg: 3.661 ± 0.012
4.744LysSer: 4.744 ± 0.013
2.97LysThr: 2.97 ± 0.009
3.938LysVal: 3.938 ± 0.011
0.802LysTrp: 0.802 ± 0.005
1.642LysTyr: 1.642 ± 0.007
0.0LysXaa: 0.0 ± 0.0
Leu
6.345LeuAla: 6.345 ± 0.015
1.853LeuCys: 1.853 ± 0.007
5.059LeuAsp: 5.059 ± 0.014
6.479LeuGlu: 6.479 ± 0.018
3.925LeuPhe: 3.925 ± 0.012
5.685LeuGly: 5.685 ± 0.015
2.611LeuHis: 2.611 ± 0.009
4.646LeuIle: 4.646 ± 0.014
6.497LeuLys: 6.497 ± 0.018
9.912LeuLeu: 9.912 ± 0.024
2.171LeuMet: 2.171 ± 0.007
4.221LeuAsn: 4.221 ± 0.012
5.24LeuPro: 5.24 ± 0.013
4.496LeuGln: 4.496 ± 0.014
5.351LeuArg: 5.351 ± 0.015
8.848LeuSer: 8.848 ± 0.021
4.521LeuThr: 4.521 ± 0.013
6.244LeuVal: 6.244 ± 0.016
1.151LeuTrp: 1.151 ± 0.006
2.537LeuTyr: 2.537 ± 0.009
0.0LeuXaa: 0.0 ± 0.0
Met
2.151MetAla: 2.151 ± 0.009
0.318MetCys: 0.318 ± 0.003
1.432MetAsp: 1.432 ± 0.006
2.071MetGlu: 2.071 ± 0.009
0.845MetPhe: 0.845 ± 0.005
1.644MetGly: 1.644 ± 0.007
0.547MetHis: 0.547 ± 0.004
1.18MetIle: 1.18 ± 0.006
1.704MetLys: 1.704 ± 0.007
2.241MetLeu: 2.241 ± 0.008
0.69MetMet: 0.69 ± 0.004
1.073MetAsn: 1.073 ± 0.006
1.075MetPro: 1.075 ± 0.006
0.967MetGln: 0.967 ± 0.005
1.143MetArg: 1.143 ± 0.006
1.782MetSer: 1.782 ± 0.007
1.011MetThr: 1.011 ± 0.005
1.711MetVal: 1.711 ± 0.007
0.245MetTrp: 0.245 ± 0.003
0.549MetTyr: 0.549 ± 0.004
0.0MetXaa: 0.0 ± 0.0
Asn
2.731AsnAla: 2.731 ± 0.009
0.947AsnCys: 0.947 ± 0.006
2.203AsnAsp: 2.203 ± 0.007
2.727AsnGlu: 2.727 ± 0.012
1.991AsnPhe: 1.991 ± 0.008
3.54AsnGly: 3.54 ± 0.012
1.164AsnHis: 1.164 ± 0.006
2.562AsnIle: 2.562 ± 0.009
2.586AsnLys: 2.586 ± 0.009
4.901AsnLeu: 4.901 ± 0.018
1.165AsnMet: 1.165 ± 0.006
2.536AsnAsn: 2.536 ± 0.012
2.458AsnPro: 2.458 ± 0.009
1.877AsnGln: 1.877 ± 0.008
2.093AsnArg: 2.093 ± 0.008
4.035AsnSer: 4.035 ± 0.013
1.999AsnThr: 1.999 ± 0.007
2.824AsnVal: 2.824 ± 0.01
0.578AsnTrp: 0.578 ± 0.004
1.314AsnTyr: 1.314 ± 0.006
0.0AsnXaa: 0.0 ± 0.0
Pro
2.93ProAla: 2.93 ± 0.011
0.84ProCys: 0.84 ± 0.006
2.403ProAsp: 2.403 ± 0.009
3.089ProGlu: 3.089 ± 0.012
2.113ProPhe: 2.113 ± 0.008
2.667ProGly: 2.667 ± 0.01
1.098ProHis: 1.098 ± 0.006
2.366ProIle: 2.366 ± 0.008
2.906ProLys: 2.906 ± 0.013
4.434ProLeu: 4.434 ± 0.013
1.025ProMet: 1.025 ± 0.005
2.374ProAsn: 2.374 ± 0.009
3.855ProPro: 3.855 ± 0.035
1.885ProGln: 1.885 ± 0.007
2.363ProArg: 2.363 ± 0.01
5.298ProSer: 5.298 ± 0.016
2.672ProThr: 2.672 ± 0.01
3.072ProVal: 3.072 ± 0.01
0.598ProTrp: 0.598 ± 0.004
1.319ProTyr: 1.319 ± 0.007
0.0ProXaa: 0.0 ± 0.0
Gln
2.429GlnAla: 2.429 ± 0.008
0.618GlnCys: 0.618 ± 0.004
1.654GlnAsp: 1.654 ± 0.007
2.378GlnGlu: 2.378 ± 0.009
1.416GlnPhe: 1.416 ± 0.006
2.177GlnGly: 2.177 ± 0.008
1.009GlnHis: 1.009 ± 0.006
2.058GlnIle: 2.058 ± 0.009
2.356GlnLys: 2.356 ± 0.01
3.798GlnLeu: 3.798 ± 0.011
1.009GlnMet: 1.009 ± 0.005
1.861GlnAsn: 1.861 ± 0.008
1.826GlnPro: 1.826 ± 0.009
2.368GlnGln: 2.368 ± 0.022
2.141GlnArg: 2.141 ± 0.008
2.999GlnSer: 2.999 ± 0.011
1.769GlnThr: 1.769 ± 0.007
2.388GlnVal: 2.388 ± 0.009
0.478GlnTrp: 0.478 ± 0.004
0.956GlnTyr: 0.956 ± 0.005
0.0GlnXaa: 0.0 ± 0.0
Arg
3.037ArgAla: 3.037 ± 0.01
1.007ArgCys: 1.007 ± 0.007
2.58ArgAsp: 2.58 ± 0.009
3.212ArgGlu: 3.212 ± 0.01
2.236ArgPhe: 2.236 ± 0.009
3.05ArgGly: 3.05 ± 0.011
1.262ArgHis: 1.262 ± 0.006
2.822ArgIle: 2.822 ± 0.01
3.836ArgLys: 3.836 ± 0.012
5.008ArgLeu: 5.008 ± 0.015
1.271ArgMet: 1.271 ± 0.006
2.521ArgAsn: 2.521 ± 0.009
2.263ArgPro: 2.263 ± 0.009
1.869ArgGln: 1.869 ± 0.008
3.757ArgArg: 3.757 ± 0.015
4.289ArgSer: 4.289 ± 0.014
2.426ArgThr: 2.426 ± 0.008
3.15ArgVal: 3.15 ± 0.011
0.721ArgTrp: 0.721 ± 0.005
1.451ArgTyr: 1.451 ± 0.007
0.0ArgXaa: 0.0 ± 0.0
Ser
5.287SerAla: 5.287 ± 0.014
1.777SerCys: 1.777 ± 0.008
4.381SerAsp: 4.381 ± 0.011
4.831SerGlu: 4.831 ± 0.017
4.194SerPhe: 4.194 ± 0.013
5.943SerGly: 5.943 ± 0.018
2.005SerHis: 2.005 ± 0.008
4.699SerIle: 4.699 ± 0.012
5.218SerLys: 5.218 ± 0.015
8.957SerLeu: 8.957 ± 0.02
2.187SerMet: 2.187 ± 0.008
4.393SerAsn: 4.393 ± 0.014
4.653SerPro: 4.653 ± 0.019
3.122SerGln: 3.122 ± 0.011
4.396SerArg: 4.396 ± 0.013
11.417SerSer: 11.417 ± 0.032
4.817SerThr: 4.817 ± 0.014
5.107SerVal: 5.107 ± 0.012
1.129SerTrp: 1.129 ± 0.006
2.35SerTyr: 2.35 ± 0.008
0.0SerXaa: 0.0 ± 0.0
Thr
3.433ThrAla: 3.433 ± 0.011
0.955ThrCys: 0.955 ± 0.006
2.301ThrAsp: 2.301 ± 0.009
2.773ThrGlu: 2.773 ± 0.012
2.038ThrPhe: 2.038 ± 0.008
3.256ThrGly: 3.256 ± 0.01
1.008ThrHis: 1.008 ± 0.005
2.773ThrIle: 2.773 ± 0.01
2.788ThrLys: 2.788 ± 0.01
4.651ThrLeu: 4.651 ± 0.013
1.193ThrMet: 1.193 ± 0.005
2.156ThrAsn: 2.156 ± 0.008
2.58ThrPro: 2.58 ± 0.009
1.541ThrGln: 1.541 ± 0.007
2.325ThrArg: 2.325 ± 0.008
4.729ThrSer: 4.729 ± 0.014
3.002ThrThr: 3.002 ± 0.012
3.319ThrVal: 3.319 ± 0.009
0.627ThrTrp: 0.627 ± 0.005
1.331ThrTyr: 1.331 ± 0.007
0.0ThrXaa: 0.0 ± 0.0
Val
4.629ValAla: 4.629 ± 0.013
1.132ValCys: 1.132 ± 0.007
3.614ValAsp: 3.614 ± 0.01
4.337ValGlu: 4.337 ± 0.015
2.671ValPhe: 2.671 ± 0.009
4.029ValGly: 4.029 ± 0.011
1.467ValHis: 1.467 ± 0.007
3.375ValIle: 3.375 ± 0.01
3.919ValLys: 3.919 ± 0.012
6.189ValLeu: 6.189 ± 0.016
1.496ValMet: 1.496 ± 0.007
2.674ValAsn: 2.674 ± 0.009
3.191ValPro: 3.191 ± 0.011
2.273ValGln: 2.273 ± 0.007
2.944ValArg: 2.944 ± 0.01
5.467ValSer: 5.467 ± 0.013
3.141ValThr: 3.141 ± 0.012
4.649ValVal: 4.649 ± 0.013
0.736ValTrp: 0.736 ± 0.005
1.895ValTyr: 1.895 ± 0.008
0.0ValXaa: 0.0 ± 0.0
Trp
0.726TrpAla: 0.726 ± 0.004
0.245TrpCys: 0.245 ± 0.003
0.693TrpAsp: 0.693 ± 0.005
0.763TrpGlu: 0.763 ± 0.004
0.547TrpPhe: 0.547 ± 0.004
0.726TrpGly: 0.726 ± 0.005
0.301TrpHis: 0.301 ± 0.003
0.701TrpIle: 0.701 ± 0.004
0.946TrpLys: 0.946 ± 0.005
1.235TrpLeu: 1.235 ± 0.006
0.342TrpMet: 0.342 ± 0.003
0.696TrpAsn: 0.696 ± 0.005
0.499TrpPro: 0.499 ± 0.004
0.455TrpGln: 0.455 ± 0.003
0.828TrpArg: 0.828 ± 0.005
0.96TrpSer: 0.96 ± 0.006
0.609TrpThr: 0.609 ± 0.004
0.795TrpVal: 0.795 ± 0.005
0.247TrpTrp: 0.247 ± 0.003
0.336TrpTyr: 0.336 ± 0.003
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.666TyrAla: 1.666 ± 0.007
0.649TyrCys: 0.649 ± 0.004
1.498TyrAsp: 1.498 ± 0.007
1.569TyrGlu: 1.569 ± 0.007
1.313TyrPhe: 1.313 ± 0.006
2.107TyrGly: 2.107 ± 0.009
0.733TyrHis: 0.733 ± 0.004
1.464TyrIle: 1.464 ± 0.006
1.541TyrLys: 1.541 ± 0.008
2.788TyrLeu: 2.788 ± 0.011
0.756TyrMet: 0.756 ± 0.005
1.333TyrAsn: 1.333 ± 0.007
1.263TyrPro: 1.263 ± 0.007
0.96TyrGln: 0.96 ± 0.006
1.463TyrArg: 1.463 ± 0.007
2.273TyrSer: 2.273 ± 0.009
1.238TyrThr: 1.238 ± 0.006
1.696TyrVal: 1.696 ± 0.008
0.405TyrTrp: 0.405 ± 0.004
0.981TyrTyr: 0.981 ± 0.007
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91033 proteins (37160271 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski