Amino acid dipepetide frequency for Kwoniella pini CBS 10737

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.181AlaAla: 6.181 ± 0.062
0.752AlaCys: 0.752 ± 0.014
3.118AlaAsp: 3.118 ± 0.031
4.361AlaGlu: 4.361 ± 0.042
2.546AlaPhe: 2.546 ± 0.032
4.797AlaGly: 4.797 ± 0.041
1.377AlaHis: 1.377 ± 0.02
4.122AlaIle: 4.122 ± 0.032
3.981AlaLys: 3.981 ± 0.034
6.565AlaLeu: 6.565 ± 0.047
1.54AlaMet: 1.54 ± 0.021
2.657AlaAsn: 2.657 ± 0.028
3.962AlaPro: 3.962 ± 0.05
2.954AlaGln: 2.954 ± 0.039
3.722AlaArg: 3.722 ± 0.033
6.648AlaSer: 6.648 ± 0.062
4.141AlaThr: 4.141 ± 0.04
3.921AlaVal: 3.921 ± 0.036
0.898AlaTrp: 0.898 ± 0.014
1.847AlaTyr: 1.847 ± 0.026
0.0AlaXaa: 0.0 ± 0.0
Cys
0.621CysAla: 0.621 ± 0.015
0.165CysCys: 0.165 ± 0.007
0.498CysAsp: 0.498 ± 0.012
0.509CysGlu: 0.509 ± 0.013
0.414CysPhe: 0.414 ± 0.01
0.745CysGly: 0.745 ± 0.019
0.243CysHis: 0.243 ± 0.009
0.612CysIle: 0.612 ± 0.013
0.491CysLys: 0.491 ± 0.012
1.037CysLeu: 1.037 ± 0.018
0.21CysMet: 0.21 ± 0.008
0.342CysAsn: 0.342 ± 0.01
0.516CysPro: 0.516 ± 0.016
0.383CysGln: 0.383 ± 0.012
0.479CysArg: 0.479 ± 0.01
0.763CysSer: 0.763 ± 0.014
0.539CysThr: 0.539 ± 0.013
0.554CysVal: 0.554 ± 0.011
0.16CysTrp: 0.16 ± 0.006
0.289CysTyr: 0.289 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
3.285AspAla: 3.285 ± 0.031
0.494AspCys: 0.494 ± 0.012
4.212AspAsp: 4.212 ± 0.048
5.113AspGlu: 5.113 ± 0.048
1.925AspPhe: 1.925 ± 0.02
3.592AspGly: 3.592 ± 0.035
1.226AspHis: 1.226 ± 0.018
3.253AspIle: 3.253 ± 0.029
2.902AspLys: 2.902 ± 0.036
5.229AspLeu: 5.229 ± 0.046
1.124AspMet: 1.124 ± 0.019
2.283AspAsn: 2.283 ± 0.027
3.014AspPro: 3.014 ± 0.027
2.207AspGln: 2.207 ± 0.027
2.723AspArg: 2.723 ± 0.037
4.07AspSer: 4.07 ± 0.03
2.662AspThr: 2.662 ± 0.028
3.24AspVal: 3.24 ± 0.03
0.844AspTrp: 0.844 ± 0.015
1.48AspTyr: 1.48 ± 0.021
0.0AspXaa: 0.0 ± 0.0
Glu
4.357GluAla: 4.357 ± 0.039
0.541GluCys: 0.541 ± 0.012
4.755GluAsp: 4.755 ± 0.043
7.018GluGlu: 7.018 ± 0.069
1.892GluPhe: 1.892 ± 0.024
4.261GluGly: 4.261 ± 0.037
1.183GluHis: 1.183 ± 0.018
4.16GluIle: 4.16 ± 0.042
4.633GluLys: 4.633 ± 0.055
5.257GluLeu: 5.257 ± 0.046
1.491GluMet: 1.491 ± 0.021
3.314GluAsn: 3.314 ± 0.037
2.379GluPro: 2.379 ± 0.028
2.4GluGln: 2.4 ± 0.027
3.814GluArg: 3.814 ± 0.044
4.577GluSer: 4.577 ± 0.037
3.061GluThr: 3.061 ± 0.028
3.484GluVal: 3.484 ± 0.034
0.928GluTrp: 0.928 ± 0.017
1.681GluTyr: 1.681 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
2.536PheAla: 2.536 ± 0.027
0.434PheCys: 0.434 ± 0.011
2.212PheAsp: 2.212 ± 0.023
2.127PheGlu: 2.127 ± 0.024
1.405PhePhe: 1.405 ± 0.022
2.751PheGly: 2.751 ± 0.035
0.773PheHis: 0.773 ± 0.013
1.964PheIle: 1.964 ± 0.026
1.806PheLys: 1.806 ± 0.021
3.105PheLeu: 3.105 ± 0.035
0.67PheMet: 0.67 ± 0.013
1.641PheAsn: 1.641 ± 0.023
1.851PhePro: 1.851 ± 0.021
1.195PheGln: 1.195 ± 0.016
1.609PheArg: 1.609 ± 0.017
3.014PheSer: 3.014 ± 0.028
2.027PheThr: 2.027 ± 0.021
1.96PheVal: 1.96 ± 0.027
0.534PheTrp: 0.534 ± 0.013
0.951PheTyr: 0.951 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
3.994GlyAla: 3.994 ± 0.042
0.701GlyCys: 0.701 ± 0.016
3.247GlyAsp: 3.247 ± 0.031
4.042GlyGlu: 4.042 ± 0.032
2.506GlyPhe: 2.506 ± 0.027
6.127GlyGly: 6.127 ± 0.077
1.457GlyHis: 1.457 ± 0.018
3.975GlyIle: 3.975 ± 0.035
4.358GlyLys: 4.358 ± 0.037
6.116GlyLeu: 6.116 ± 0.048
1.63GlyMet: 1.63 ± 0.027
2.962GlyAsn: 2.962 ± 0.03
3.006GlyPro: 3.006 ± 0.033
2.669GlyGln: 2.669 ± 0.032
3.595GlyArg: 3.595 ± 0.033
5.966GlySer: 5.966 ± 0.047
3.617GlyThr: 3.617 ± 0.037
3.776GlyVal: 3.776 ± 0.037
1.246GlyTrp: 1.246 ± 0.02
2.088GlyTyr: 2.088 ± 0.024
0.0GlyXaa: 0.0 ± 0.0
His
1.381HisAla: 1.381 ± 0.02
0.226HisCys: 0.226 ± 0.007
1.086HisAsp: 1.086 ± 0.019
1.168HisGlu: 1.168 ± 0.018
0.797HisPhe: 0.797 ± 0.014
1.326HisGly: 1.326 ± 0.018
0.846HisHis: 0.846 ± 0.02
1.222HisIle: 1.222 ± 0.02
0.951HisLys: 0.951 ± 0.015
2.326HisLeu: 2.326 ± 0.029
0.382HisMet: 0.382 ± 0.01
0.923HisAsn: 0.923 ± 0.017
1.713HisPro: 1.713 ± 0.024
1.011HisGln: 1.011 ± 0.018
1.264HisArg: 1.264 ± 0.018
2.033HisSer: 2.033 ± 0.026
1.297HisThr: 1.297 ± 0.019
1.138HisVal: 1.138 ± 0.017
0.265HisTrp: 0.265 ± 0.007
0.597HisTyr: 0.597 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.12IleAla: 4.12 ± 0.035
0.707IleCys: 0.707 ± 0.014
3.423IleAsp: 3.423 ± 0.031
3.757IleGlu: 3.757 ± 0.033
2.065IlePhe: 2.065 ± 0.024
3.769IleGly: 3.769 ± 0.038
1.348IleHis: 1.348 ± 0.021
3.661IleIle: 3.661 ± 0.036
3.454IleLys: 3.454 ± 0.037
5.26IleLeu: 5.26 ± 0.043
1.043IleMet: 1.043 ± 0.018
2.893IleAsn: 2.893 ± 0.031
4.16IlePro: 4.16 ± 0.032
2.267IleGln: 2.267 ± 0.023
3.009IleArg: 3.009 ± 0.032
5.461IleSer: 5.461 ± 0.042
3.349IleThr: 3.349 ± 0.027
3.05IleVal: 3.05 ± 0.032
0.827IleTrp: 0.827 ± 0.017
1.523IleTyr: 1.523 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
4.299LysAla: 4.299 ± 0.04
0.47LysCys: 0.47 ± 0.013
3.348LysAsp: 3.348 ± 0.032
4.632LysGlu: 4.632 ± 0.052
1.66LysPhe: 1.66 ± 0.021
3.959LysGly: 3.959 ± 0.035
1.065LysHis: 1.065 ± 0.018
3.327LysIle: 3.327 ± 0.034
4.57LysLys: 4.57 ± 0.053
4.703LysLeu: 4.703 ± 0.037
1.109LysMet: 1.109 ± 0.016
2.488LysAsn: 2.488 ± 0.029
2.988LysPro: 2.988 ± 0.032
2.079LysGln: 2.079 ± 0.023
3.783LysArg: 3.783 ± 0.033
4.977LysSer: 4.977 ± 0.048
2.991LysThr: 2.991 ± 0.028
3.156LysVal: 3.156 ± 0.028
0.801LysTrp: 0.801 ± 0.016
1.477LysTyr: 1.477 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
6.608LeuAla: 6.608 ± 0.046
0.915LeuCys: 0.915 ± 0.018
4.913LeuAsp: 4.913 ± 0.036
5.351LeuGlu: 5.351 ± 0.049
3.13LeuPhe: 3.13 ± 0.035
5.759LeuGly: 5.759 ± 0.04
1.982LeuHis: 1.982 ± 0.026
5.162LeuIle: 5.162 ± 0.046
5.063LeuLys: 5.063 ± 0.04
8.169LeuLeu: 8.169 ± 0.072
1.612LeuMet: 1.612 ± 0.02
4.373LeuAsn: 4.373 ± 0.036
6.409LeuPro: 6.409 ± 0.046
3.291LeuGln: 3.291 ± 0.034
4.723LeuArg: 4.723 ± 0.041
8.381LeuSer: 8.381 ± 0.054
5.069LeuThr: 5.069 ± 0.04
4.594LeuVal: 4.594 ± 0.043
0.992LeuTrp: 0.992 ± 0.016
2.181LeuTyr: 2.181 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
1.536MetAla: 1.536 ± 0.02
0.18MetCys: 0.18 ± 0.007
1.098MetAsp: 1.098 ± 0.017
1.159MetGlu: 1.159 ± 0.018
0.658MetPhe: 0.658 ± 0.013
1.418MetGly: 1.418 ± 0.025
0.313MetHis: 0.313 ± 0.01
1.155MetIle: 1.155 ± 0.016
1.105MetLys: 1.105 ± 0.02
1.552MetLeu: 1.552 ± 0.02
0.547MetMet: 0.547 ± 0.013
0.906MetAsn: 0.906 ± 0.019
1.12MetPro: 1.12 ± 0.018
0.66MetGln: 0.66 ± 0.015
1.064MetArg: 1.064 ± 0.016
2.145MetSer: 2.145 ± 0.024
1.3MetThr: 1.3 ± 0.018
1.047MetVal: 1.047 ± 0.019
0.217MetTrp: 0.217 ± 0.007
0.458MetTyr: 0.458 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.893AsnAla: 2.893 ± 0.028
0.377AsnCys: 0.377 ± 0.011
2.667AsnAsp: 2.667 ± 0.029
3.092AsnGlu: 3.092 ± 0.033
1.511AsnPhe: 1.511 ± 0.022
3.688AsnGly: 3.688 ± 0.035
1.083AsnHis: 1.083 ± 0.019
2.505AsnIle: 2.505 ± 0.03
2.549AsnLys: 2.549 ± 0.03
4.389AsnLeu: 4.389 ± 0.041
0.776AsnMet: 0.776 ± 0.017
2.895AsnAsn: 2.895 ± 0.05
2.743AsnPro: 2.743 ± 0.028
2.187AsnGln: 2.187 ± 0.028
2.136AsnArg: 2.136 ± 0.029
4.475AsnSer: 4.475 ± 0.043
2.677AsnThr: 2.677 ± 0.03
2.382AsnVal: 2.382 ± 0.025
0.603AsnTrp: 0.603 ± 0.011
1.177AsnTyr: 1.177 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
4.248ProAla: 4.248 ± 0.052
0.39ProCys: 0.39 ± 0.012
2.732ProAsp: 2.732 ± 0.024
3.354ProGlu: 3.354 ± 0.029
2.169ProPhe: 2.169 ± 0.025
3.236ProGly: 3.236 ± 0.036
1.38ProHis: 1.38 ± 0.021
3.86ProIle: 3.86 ± 0.031
3.061ProLys: 3.061 ± 0.03
5.28ProLeu: 5.28 ± 0.045
0.938ProMet: 0.938 ± 0.016
2.841ProAsn: 2.841 ± 0.028
5.945ProPro: 5.945 ± 0.089
2.379ProGln: 2.379 ± 0.039
2.879ProArg: 2.879 ± 0.028
7.838ProSer: 7.838 ± 0.069
4.648ProThr: 4.648 ± 0.045
3.018ProVal: 3.018 ± 0.034
0.581ProTrp: 0.581 ± 0.015
1.544ProTyr: 1.544 ± 0.022
0.0ProXaa: 0.0 ± 0.0
Gln
3.237GlnAla: 3.237 ± 0.037
0.345GlnCys: 0.345 ± 0.011
1.987GlnAsp: 1.987 ± 0.025
2.248GlnGlu: 2.248 ± 0.026
1.165GlnPhe: 1.165 ± 0.018
2.366GlnGly: 2.366 ± 0.03
0.959GlnHis: 0.959 ± 0.017
2.349GlnIle: 2.349 ± 0.028
1.942GlnLys: 1.942 ± 0.024
3.261GlnLeu: 3.261 ± 0.032
0.825GlnMet: 0.825 ± 0.019
2.069GlnAsn: 2.069 ± 0.025
2.615GlnPro: 2.615 ± 0.038
2.653GlnGln: 2.653 ± 0.091
2.163GlnArg: 2.163 ± 0.026
3.723GlnSer: 3.723 ± 0.035
2.272GlnThr: 2.272 ± 0.022
2.071GlnVal: 2.071 ± 0.026
0.487GlnTrp: 0.487 ± 0.012
1.096GlnTyr: 1.096 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
3.596ArgAla: 3.596 ± 0.033
0.499ArgCys: 0.499 ± 0.012
2.831ArgAsp: 2.831 ± 0.031
3.449ArgGlu: 3.449 ± 0.04
1.779ArgPhe: 1.779 ± 0.022
3.3ArgGly: 3.3 ± 0.033
1.198ArgHis: 1.198 ± 0.019
2.968ArgIle: 2.968 ± 0.026
3.643ArgLys: 3.643 ± 0.035
4.7ArgLeu: 4.7 ± 0.039
1.139ArgMet: 1.139 ± 0.017
2.277ArgAsn: 2.277 ± 0.029
3.18ArgPro: 3.18 ± 0.036
2.263ArgGln: 2.263 ± 0.026
4.308ArgArg: 4.308 ± 0.046
5.022ArgSer: 5.022 ± 0.049
2.845ArgThr: 2.845 ± 0.027
2.617ArgVal: 2.617 ± 0.027
0.733ArgTrp: 0.733 ± 0.014
1.438ArgTyr: 1.438 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
6.511SerAla: 6.511 ± 0.058
0.708SerCys: 0.708 ± 0.017
4.459SerAsp: 4.459 ± 0.038
4.677SerGlu: 4.677 ± 0.037
3.259SerPhe: 3.259 ± 0.029
5.877SerGly: 5.877 ± 0.048
2.117SerHis: 2.117 ± 0.024
5.711SerIle: 5.711 ± 0.045
5.075SerLys: 5.075 ± 0.044
7.997SerLeu: 7.997 ± 0.049
1.656SerMet: 1.656 ± 0.022
4.984SerAsn: 4.984 ± 0.052
6.665SerPro: 6.665 ± 0.064
3.603SerGln: 3.603 ± 0.036
5.048SerArg: 5.048 ± 0.051
13.42SerSer: 13.42 ± 0.143
7.426SerThr: 7.426 ± 0.061
4.301SerVal: 4.301 ± 0.037
1.023SerTrp: 1.023 ± 0.015
2.24SerTyr: 2.24 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
4.278ThrAla: 4.278 ± 0.041
0.593ThrCys: 0.593 ± 0.013
2.61ThrAsp: 2.61 ± 0.028
2.916ThrGlu: 2.916 ± 0.029
2.232ThrPhe: 2.232 ± 0.027
3.77ThrGly: 3.77 ± 0.039
1.31ThrHis: 1.31 ± 0.02
3.657ThrIle: 3.657 ± 0.032
2.879ThrLys: 2.879 ± 0.028
5.351ThrLeu: 5.351 ± 0.04
1.004ThrMet: 1.004 ± 0.016
2.675ThrAsn: 2.675 ± 0.032
4.916ThrPro: 4.916 ± 0.055
2.057ThrGln: 2.057 ± 0.023
2.749ThrArg: 2.749 ± 0.028
6.947ThrSer: 6.947 ± 0.059
4.354ThrThr: 4.354 ± 0.05
3.017ThrVal: 3.017 ± 0.029
0.693ThrTrp: 0.693 ± 0.015
1.606ThrTyr: 1.606 ± 0.022
0.0ThrXaa: 0.0 ± 0.0
Val
3.726ValAla: 3.726 ± 0.031
0.569ValCys: 0.569 ± 0.012
3.253ValAsp: 3.253 ± 0.029
3.798ValGlu: 3.798 ± 0.046
1.901ValPhe: 1.901 ± 0.028
3.586ValGly: 3.586 ± 0.035
1.109ValHis: 1.109 ± 0.017
3.226ValIle: 3.226 ± 0.031
3.249ValLys: 3.249 ± 0.031
4.642ValLeu: 4.642 ± 0.04
1.093ValMet: 1.093 ± 0.019
2.35ValAsn: 2.35 ± 0.029
3.124ValPro: 3.124 ± 0.036
2.02ValGln: 2.02 ± 0.024
2.62ValArg: 2.62 ± 0.032
4.077ValSer: 4.077 ± 0.034
2.936ValThr: 2.936 ± 0.029
3.283ValVal: 3.283 ± 0.038
0.742ValTrp: 0.742 ± 0.014
1.412ValTyr: 1.412 ± 0.018
0.0ValXaa: 0.0 ± 0.0
Trp
0.881TrpAla: 0.881 ± 0.016
0.172TrpCys: 0.172 ± 0.006
0.818TrpAsp: 0.818 ± 0.015
0.892TrpGlu: 0.892 ± 0.016
0.482TrpPhe: 0.482 ± 0.012
0.873TrpGly: 0.873 ± 0.015
0.263TrpHis: 0.263 ± 0.007
0.802TrpIle: 0.802 ± 0.016
0.886TrpLys: 0.886 ± 0.016
1.175TrpLeu: 1.175 ± 0.017
0.319TrpMet: 0.319 ± 0.009
0.66TrpAsn: 0.66 ± 0.013
0.485TrpPro: 0.485 ± 0.012
0.474TrpGln: 0.474 ± 0.01
0.777TrpArg: 0.777 ± 0.017
1.072TrpSer: 1.072 ± 0.018
0.799TrpThr: 0.799 ± 0.015
0.733TrpVal: 0.733 ± 0.014
0.282TrpTrp: 0.282 ± 0.008
0.39TrpTyr: 0.39 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.801TyrAla: 1.801 ± 0.021
0.332TyrCys: 0.332 ± 0.011
1.61TyrAsp: 1.61 ± 0.022
1.525TyrGlu: 1.525 ± 0.022
1.068TyrPhe: 1.068 ± 0.016
1.883TyrGly: 1.883 ± 0.024
0.718TyrHis: 0.718 ± 0.013
1.458TyrIle: 1.458 ± 0.02
1.286TyrLys: 1.286 ± 0.019
2.563TyrLeu: 2.563 ± 0.026
0.528TyrMet: 0.528 ± 0.014
1.272TyrAsn: 1.272 ± 0.018
1.531TyrPro: 1.531 ± 0.023
1.08TyrGln: 1.08 ± 0.017
1.319TyrArg: 1.319 ± 0.02
2.15TyrSer: 2.15 ± 0.026
1.604TyrThr: 1.604 ± 0.022
1.371TyrVal: 1.371 ± 0.02
0.378TyrTrp: 0.378 ± 0.01
0.794TyrTyr: 0.794 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7827 proteins (4033902 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski