Amino acid dipepetide frequency for Thraustotheca clavata

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.813AlaAla: 6.813 ± 0.055
1.345AlaCys: 1.345 ± 0.014
3.087AlaAsp: 3.087 ± 0.027
3.78AlaGlu: 3.78 ± 0.03
3.315AlaPhe: 3.315 ± 0.026
3.582AlaGly: 3.582 ± 0.037
1.665AlaHis: 1.665 ± 0.016
5.234AlaIle: 5.234 ± 0.032
4.632AlaLys: 4.632 ± 0.04
8.133AlaLeu: 8.133 ± 0.042
2.492AlaMet: 2.492 ± 0.02
3.307AlaAsn: 3.307 ± 0.031
3.671AlaPro: 3.671 ± 0.034
3.274AlaGln: 3.274 ± 0.026
3.326AlaArg: 3.326 ± 0.024
6.006AlaSer: 6.006 ± 0.031
5.197AlaThr: 5.197 ± 0.034
4.736AlaVal: 4.736 ± 0.03
0.912AlaTrp: 0.912 ± 0.014
2.208AlaTyr: 2.208 ± 0.018
0.0AlaXaa: 0.0 ± 0.0
Cys
1.249CysAla: 1.249 ± 0.016
0.424CysCys: 0.424 ± 0.009
0.937CysAsp: 0.937 ± 0.014
0.853CysGlu: 0.853 ± 0.012
0.872CysPhe: 0.872 ± 0.012
1.263CysGly: 1.263 ± 0.017
0.518CysHis: 0.518 ± 0.01
1.256CysIle: 1.256 ± 0.014
0.894CysLys: 0.894 ± 0.012
1.84CysLeu: 1.84 ± 0.015
0.493CysMet: 0.493 ± 0.01
0.787CysAsn: 0.787 ± 0.013
0.928CysPro: 0.928 ± 0.02
0.748CysGln: 0.748 ± 0.01
0.904CysArg: 0.904 ± 0.014
1.432CysSer: 1.432 ± 0.017
1.323CysThr: 1.323 ± 0.019
1.279CysVal: 1.279 ± 0.015
0.237CysTrp: 0.237 ± 0.007
0.589CysTyr: 0.589 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
4.166AspAla: 4.166 ± 0.028
0.896AspCys: 0.896 ± 0.012
3.463AspAsp: 3.463 ± 0.035
3.985AspGlu: 3.985 ± 0.034
2.103AspPhe: 2.103 ± 0.019
3.109AspGly: 3.109 ± 0.026
1.171AspHis: 1.171 ± 0.015
3.699AspIle: 3.699 ± 0.028
2.628AspLys: 2.628 ± 0.023
4.536AspLeu: 4.536 ± 0.027
1.514AspMet: 1.514 ± 0.016
2.184AspAsn: 2.184 ± 0.02
2.309AspPro: 2.309 ± 0.022
1.761AspGln: 1.761 ± 0.017
2.138AspArg: 2.138 ± 0.02
3.282AspSer: 3.282 ± 0.026
3.056AspThr: 3.056 ± 0.02
3.732AspVal: 3.732 ± 0.022
0.606AspTrp: 0.606 ± 0.011
1.546AspTyr: 1.546 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
4.869GluAla: 4.869 ± 0.04
0.993GluCys: 0.993 ± 0.014
3.342GluAsp: 3.342 ± 0.03
4.728GluGlu: 4.728 ± 0.05
2.19GluPhe: 2.19 ± 0.02
2.534GluGly: 2.534 ± 0.022
1.445GluHis: 1.445 ± 0.015
3.586GluIle: 3.586 ± 0.025
3.904GluLys: 3.904 ± 0.034
5.996GluLeu: 5.996 ± 0.042
1.746GluMet: 1.746 ± 0.018
2.955GluAsn: 2.955 ± 0.026
2.171GluPro: 2.171 ± 0.023
2.366GluGln: 2.366 ± 0.024
3.172GluArg: 3.172 ± 0.031
4.074GluSer: 4.074 ± 0.029
2.859GluThr: 2.859 ± 0.024
3.416GluVal: 3.416 ± 0.027
0.843GluTrp: 0.843 ± 0.013
1.986GluTyr: 1.986 ± 0.02
0.0GluXaa: 0.0 ± 0.0
Phe
3.021PheAla: 3.021 ± 0.025
0.861PheCys: 0.861 ± 0.012
2.45PheAsp: 2.45 ± 0.022
2.37PheGlu: 2.37 ± 0.02
1.761PhePhe: 1.761 ± 0.021
2.91PheGly: 2.91 ± 0.029
1.12PheHis: 1.12 ± 0.014
2.255PheIle: 2.255 ± 0.021
1.797PheLys: 1.797 ± 0.018
3.977PheLeu: 3.977 ± 0.034
0.958PheMet: 0.958 ± 0.014
1.812PheAsn: 1.812 ± 0.02
1.677PhePro: 1.677 ± 0.016
1.77PheGln: 1.77 ± 0.018
1.839PheArg: 1.839 ± 0.019
2.913PheSer: 2.913 ± 0.022
2.503PheThr: 2.503 ± 0.022
2.997PheVal: 2.997 ± 0.022
0.51PheTrp: 0.51 ± 0.009
1.396PheTyr: 1.396 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
3.857GlyAla: 3.857 ± 0.038
1.152GlyCys: 1.152 ± 0.018
2.75GlyAsp: 2.75 ± 0.023
2.518GlyGlu: 2.518 ± 0.021
2.451GlyPhe: 2.451 ± 0.021
3.56GlyGly: 3.56 ± 0.06
1.627GlyHis: 1.627 ± 0.022
3.432GlyIle: 3.432 ± 0.027
2.848GlyLys: 2.848 ± 0.023
4.809GlyLeu: 4.809 ± 0.032
1.377GlyMet: 1.377 ± 0.018
2.524GlyAsn: 2.524 ± 0.029
1.79GlyPro: 1.79 ± 0.022
1.955GlyGln: 1.955 ± 0.02
2.445GlyArg: 2.445 ± 0.025
3.964GlySer: 3.964 ± 0.053
3.232GlyThr: 3.232 ± 0.04
3.637GlyVal: 3.637 ± 0.03
0.774GlyTrp: 0.774 ± 0.013
2.005GlyTyr: 2.005 ± 0.02
0.0GlyXaa: 0.0 ± 0.0
His
1.896HisAla: 1.896 ± 0.021
0.543HisCys: 0.543 ± 0.01
1.355HisAsp: 1.355 ± 0.016
1.651HisGlu: 1.651 ± 0.018
1.121HisPhe: 1.121 ± 0.015
1.635HisGly: 1.635 ± 0.02
0.893HisHis: 0.893 ± 0.015
1.526HisIle: 1.526 ± 0.017
1.142HisLys: 1.142 ± 0.015
2.695HisLeu: 2.695 ± 0.025
0.543HisMet: 0.543 ± 0.01
1.013HisAsn: 1.013 ± 0.014
1.3HisPro: 1.3 ± 0.014
1.139HisGln: 1.139 ± 0.013
1.56HisArg: 1.56 ± 0.017
1.716HisSer: 1.716 ± 0.019
1.412HisThr: 1.412 ± 0.015
1.86HisVal: 1.86 ± 0.018
0.347HisTrp: 0.347 ± 0.007
0.87HisTyr: 0.87 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
5.366IleAla: 5.366 ± 0.033
1.108IleCys: 1.108 ± 0.012
3.541IleAsp: 3.541 ± 0.025
3.686IleGlu: 3.686 ± 0.028
2.285IlePhe: 2.285 ± 0.022
3.07IleGly: 3.07 ± 0.029
1.522IleHis: 1.522 ± 0.015
3.029IleIle: 3.029 ± 0.024
2.923IleLys: 2.923 ± 0.025
5.601IleLeu: 5.601 ± 0.032
1.326IleMet: 1.326 ± 0.016
2.454IleAsn: 2.454 ± 0.021
2.996IlePro: 2.996 ± 0.025
2.766IleGln: 2.766 ± 0.024
2.666IleArg: 2.666 ± 0.025
4.231IleSer: 4.231 ± 0.036
3.321IleThr: 3.321 ± 0.029
4.624IleVal: 4.624 ± 0.032
0.673IleTrp: 0.673 ± 0.011
1.828IleTyr: 1.828 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
4.433LysAla: 4.433 ± 0.041
1.011LysCys: 1.011 ± 0.015
2.905LysAsp: 2.905 ± 0.024
3.734LysGlu: 3.734 ± 0.034
1.806LysPhe: 1.806 ± 0.017
2.367LysGly: 2.367 ± 0.023
1.486LysHis: 1.486 ± 0.016
2.801LysIle: 2.801 ± 0.023
4.214LysLys: 4.214 ± 0.052
5.328LysLeu: 5.328 ± 0.033
1.422LysMet: 1.422 ± 0.016
2.394LysAsn: 2.394 ± 0.018
2.509LysPro: 2.509 ± 0.027
2.667LysGln: 2.667 ± 0.025
3.24LysArg: 3.24 ± 0.034
4.187LysSer: 4.187 ± 0.03
3.145LysThr: 3.145 ± 0.023
3.227LysVal: 3.227 ± 0.025
0.757LysTrp: 0.757 ± 0.012
1.969LysTyr: 1.969 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
7.342LeuAla: 7.342 ± 0.036
1.93LeuCys: 1.93 ± 0.021
5.314LeuAsp: 5.314 ± 0.035
6.128LeuGlu: 6.128 ± 0.045
3.972LeuPhe: 3.972 ± 0.037
4.923LeuGly: 4.923 ± 0.032
3.202LeuHis: 3.202 ± 0.029
4.757LeuIle: 4.757 ± 0.032
5.411LeuLys: 5.411 ± 0.031
10.391LeuLeu: 10.391 ± 0.066
2.275LeuMet: 2.275 ± 0.018
4.369LeuAsn: 4.369 ± 0.031
4.828LeuPro: 4.828 ± 0.035
5.091LeuGln: 5.091 ± 0.035
5.274LeuArg: 5.274 ± 0.032
7.744LeuSer: 7.744 ± 0.046
5.614LeuThr: 5.614 ± 0.04
6.418LeuVal: 6.418 ± 0.037
1.239LeuTrp: 1.239 ± 0.015
3.077LeuTyr: 3.077 ± 0.023
0.0LeuXaa: 0.0 ± 0.0
Met
2.141MetAla: 2.141 ± 0.02
0.447MetCys: 0.447 ± 0.009
1.681MetAsp: 1.681 ± 0.016
1.762MetGlu: 1.762 ± 0.02
0.89MetPhe: 0.89 ± 0.013
1.312MetGly: 1.312 ± 0.016
0.727MetHis: 0.727 ± 0.01
1.339MetIle: 1.339 ± 0.016
1.459MetLys: 1.459 ± 0.019
2.56MetLeu: 2.56 ± 0.019
0.72MetMet: 0.72 ± 0.01
1.269MetAsn: 1.269 ± 0.014
1.145MetPro: 1.145 ± 0.012
1.277MetGln: 1.277 ± 0.014
1.147MetArg: 1.147 ± 0.013
1.877MetSer: 1.877 ± 0.017
1.722MetThr: 1.722 ± 0.017
1.559MetVal: 1.559 ± 0.018
0.323MetTrp: 0.323 ± 0.006
0.881MetTyr: 0.881 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
3.725AsnAla: 3.725 ± 0.03
0.813AsnCys: 0.813 ± 0.013
2.604AsnAsp: 2.604 ± 0.023
2.917AsnGlu: 2.917 ± 0.024
1.642AsnPhe: 1.642 ± 0.017
3.063AsnGly: 3.063 ± 0.031
1.1AsnHis: 1.1 ± 0.012
2.678AsnIle: 2.678 ± 0.027
2.153AsnLys: 2.153 ± 0.019
3.882AsnLeu: 3.882 ± 0.027
1.141AsnMet: 1.141 ± 0.015
2.106AsnAsn: 2.106 ± 0.022
2.191AsnPro: 2.191 ± 0.018
1.999AsnGln: 1.999 ± 0.019
1.943AsnArg: 1.943 ± 0.018
3.207AsnSer: 3.207 ± 0.027
2.685AsnThr: 2.685 ± 0.021
3.07AsnVal: 3.07 ± 0.024
0.554AsnTrp: 0.554 ± 0.01
1.424AsnTyr: 1.424 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
3.186ProAla: 3.186 ± 0.03
0.654ProCys: 0.654 ± 0.012
1.923ProAsp: 1.923 ± 0.02
2.739ProGlu: 2.739 ± 0.025
1.848ProPhe: 1.848 ± 0.018
1.998ProGly: 1.998 ± 0.025
1.05ProHis: 1.05 ± 0.014
2.639ProIle: 2.639 ± 0.022
2.74ProLys: 2.74 ± 0.025
4.606ProLeu: 4.606 ± 0.03
1.212ProMet: 1.212 ± 0.014
2.214ProAsn: 2.214 ± 0.019
3.13ProPro: 3.13 ± 0.039
1.899ProGln: 1.899 ± 0.021
2.178ProArg: 2.178 ± 0.019
4.449ProSer: 4.449 ± 0.039
3.791ProThr: 3.791 ± 0.04
3.045ProVal: 3.045 ± 0.027
0.577ProTrp: 0.577 ± 0.009
1.304ProTyr: 1.304 ± 0.014
0.0ProXaa: 0.0 ± 0.0
Gln
3.422GlnAla: 3.422 ± 0.028
0.89GlnCys: 0.89 ± 0.015
1.965GlnAsp: 1.965 ± 0.019
2.689GlnGlu: 2.689 ± 0.025
1.624GlnPhe: 1.624 ± 0.018
1.895GlnGly: 1.895 ± 0.021
1.204GlnHis: 1.204 ± 0.013
2.32GlnIle: 2.32 ± 0.023
2.291GlnLys: 2.291 ± 0.022
4.89GlnLeu: 4.89 ± 0.031
1.095GlnMet: 1.095 ± 0.013
1.851GlnAsn: 1.851 ± 0.018
1.861GlnPro: 1.861 ± 0.018
2.324GlnGln: 2.324 ± 0.03
2.469GlnArg: 2.469 ± 0.023
3.208GlnSer: 3.208 ± 0.024
2.534GlnThr: 2.534 ± 0.023
2.983GlnVal: 2.983 ± 0.023
0.713GlnTrp: 0.713 ± 0.011
1.488GlnTyr: 1.488 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
3.337ArgAla: 3.337 ± 0.024
0.9ArgCys: 0.9 ± 0.013
2.346ArgAsp: 2.346 ± 0.022
2.861ArgGlu: 2.861 ± 0.027
1.981ArgPhe: 1.981 ± 0.017
2.478ArgGly: 2.478 ± 0.023
1.426ArgHis: 1.426 ± 0.015
2.853ArgIle: 2.853 ± 0.02
3.03ArgLys: 3.03 ± 0.029
4.958ArgLeu: 4.958 ± 0.035
1.279ArgMet: 1.279 ± 0.013
2.128ArgAsn: 2.128 ± 0.018
1.97ArgPro: 1.97 ± 0.021
2.399ArgGln: 2.399 ± 0.022
3.311ArgArg: 3.311 ± 0.032
3.503ArgSer: 3.503 ± 0.025
2.56ArgThr: 2.56 ± 0.022
3.181ArgVal: 3.181 ± 0.023
0.668ArgTrp: 0.668 ± 0.011
1.54ArgTyr: 1.54 ± 0.016
0.0ArgXaa: 0.0 ± 0.0
Ser
5.045SerAla: 5.045 ± 0.04
1.392SerCys: 1.392 ± 0.014
3.524SerAsp: 3.524 ± 0.025
3.56SerGlu: 3.56 ± 0.026
3.503SerPhe: 3.503 ± 0.026
4.017SerGly: 4.017 ± 0.068
1.713SerHis: 1.713 ± 0.017
5.163SerIle: 5.163 ± 0.035
4.183SerLys: 4.183 ± 0.029
7.355SerLeu: 7.355 ± 0.043
2.119SerMet: 2.119 ± 0.02
3.702SerAsn: 3.702 ± 0.025
3.903SerPro: 3.903 ± 0.036
2.759SerGln: 2.759 ± 0.02
3.295SerArg: 3.295 ± 0.024
7.59SerSer: 7.59 ± 0.065
5.662SerThr: 5.662 ± 0.044
4.689SerVal: 4.689 ± 0.031
1.012SerTrp: 1.012 ± 0.013
2.333SerTyr: 2.333 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
4.268ThrAla: 4.268 ± 0.031
1.12ThrCys: 1.12 ± 0.016
2.521ThrAsp: 2.521 ± 0.022
3.001ThrGlu: 3.001 ± 0.025
2.562ThrPhe: 2.562 ± 0.022
2.979ThrGly: 2.979 ± 0.041
1.299ThrHis: 1.299 ± 0.016
4.195ThrIle: 4.195 ± 0.028
3.632ThrLys: 3.632 ± 0.027
6.182ThrLeu: 6.182 ± 0.035
1.726ThrMet: 1.726 ± 0.015
2.759ThrAsn: 2.759 ± 0.024
3.831ThrPro: 3.831 ± 0.038
2.561ThrGln: 2.561 ± 0.026
2.586ThrArg: 2.586 ± 0.021
5.34ThrSer: 5.34 ± 0.047
4.826ThrThr: 4.826 ± 0.043
3.367ThrVal: 3.367 ± 0.028
0.877ThrTrp: 0.877 ± 0.012
1.793ThrTyr: 1.793 ± 0.016
0.0ThrXaa: 0.0 ± 0.0
Val
5.476ValAla: 5.476 ± 0.028
1.334ValCys: 1.334 ± 0.014
3.695ValAsp: 3.695 ± 0.024
3.82ValGlu: 3.82 ± 0.03
2.828ValPhe: 2.828 ± 0.027
3.399ValGly: 3.399 ± 0.031
1.842ValHis: 1.842 ± 0.019
3.536ValIle: 3.536 ± 0.026
3.409ValLys: 3.409 ± 0.024
6.95ValLeu: 6.95 ± 0.041
1.566ValMet: 1.566 ± 0.017
2.875ValAsn: 2.875 ± 0.022
3.196ValPro: 3.196 ± 0.027
3.028ValGln: 3.028 ± 0.024
2.929ValArg: 2.929 ± 0.02
4.476ValSer: 4.476 ± 0.032
3.224ValThr: 3.224 ± 0.029
5.161ValVal: 5.161 ± 0.037
0.876ValTrp: 0.876 ± 0.014
2.17ValTyr: 2.17 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
0.804TrpAla: 0.804 ± 0.01
0.273TrpCys: 0.273 ± 0.007
0.678TrpAsp: 0.678 ± 0.011
0.617TrpGlu: 0.617 ± 0.01
0.6TrpPhe: 0.6 ± 0.01
0.616TrpGly: 0.616 ± 0.013
0.343TrpHis: 0.343 ± 0.007
0.857TrpIle: 0.857 ± 0.012
0.829TrpLys: 0.829 ± 0.012
1.318TrpLeu: 1.318 ± 0.017
0.381TrpMet: 0.381 ± 0.007
0.689TrpAsn: 0.689 ± 0.013
0.453TrpPro: 0.453 ± 0.009
0.588TrpGln: 0.588 ± 0.01
0.693TrpArg: 0.693 ± 0.012
1.048TrpSer: 1.048 ± 0.015
0.906TrpThr: 0.906 ± 0.013
0.735TrpVal: 0.735 ± 0.012
0.187TrpTrp: 0.187 ± 0.005
0.478TrpTyr: 0.478 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.298TyrAla: 2.298 ± 0.02
0.74TyrCys: 0.74 ± 0.011
1.802TyrAsp: 1.802 ± 0.02
1.775TyrGlu: 1.775 ± 0.018
1.548TyrPhe: 1.548 ± 0.02
2.0TyrGly: 2.0 ± 0.021
0.866TyrHis: 0.866 ± 0.012
1.862TyrIle: 1.862 ± 0.017
1.479TyrLys: 1.479 ± 0.016
3.292TyrLeu: 3.292 ± 0.025
0.851TyrMet: 0.851 ± 0.012
1.506TyrAsn: 1.506 ± 0.015
1.323TyrPro: 1.323 ± 0.017
1.344TyrGln: 1.344 ± 0.015
1.594TyrArg: 1.594 ± 0.017
2.271TyrSer: 2.271 ± 0.021
1.852TyrThr: 1.852 ± 0.019
2.074TyrVal: 2.074 ± 0.019
0.407TyrTrp: 0.407 ± 0.008
1.219TyrTyr: 1.219 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13171 proteins (6794705 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski