Amino acid dipepetide frequency for Scheffersomyces stipitis (strain ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545) (Yeast) (Pichia stipitis)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.639AlaAla: 4.639 ± 0.07
0.624AlaCys: 0.624 ± 0.015
2.975AlaAsp: 2.975 ± 0.038
3.561AlaGlu: 3.561 ± 0.048
2.411AlaPhe: 2.411 ± 0.032
3.276AlaGly: 3.276 ± 0.044
1.136AlaHis: 1.136 ± 0.023
4.069AlaIle: 4.069 ± 0.047
4.058AlaLys: 4.058 ± 0.045
5.28AlaLeu: 5.28 ± 0.047
1.107AlaMet: 1.107 ± 0.021
3.276AlaAsn: 3.276 ± 0.035
2.602AlaPro: 2.602 ± 0.04
2.083AlaGln: 2.083 ± 0.032
2.516AlaArg: 2.516 ± 0.033
5.439AlaSer: 5.439 ± 0.054
3.698AlaThr: 3.698 ± 0.043
3.711AlaVal: 3.711 ± 0.039
0.511AlaTrp: 0.511 ± 0.014
1.829AlaTyr: 1.829 ± 0.025
0.0AlaXaa: 0.0 ± 0.0
Cys
0.54CysAla: 0.54 ± 0.015
0.212CysCys: 0.212 ± 0.009
0.57CysAsp: 0.57 ± 0.016
0.515CysGlu: 0.515 ± 0.016
0.588CysPhe: 0.588 ± 0.015
0.769CysGly: 0.769 ± 0.018
0.265CysHis: 0.265 ± 0.009
0.789CysIle: 0.789 ± 0.017
0.599CysLys: 0.599 ± 0.017
1.093CysLeu: 1.093 ± 0.02
0.207CysMet: 0.207 ± 0.008
0.539CysAsn: 0.539 ± 0.014
0.45CysPro: 0.45 ± 0.014
0.33CysGln: 0.33 ± 0.012
0.414CysArg: 0.414 ± 0.012
0.876CysSer: 0.876 ± 0.022
0.521CysThr: 0.521 ± 0.015
0.653CysVal: 0.653 ± 0.017
0.134CysTrp: 0.134 ± 0.007
0.406CysTyr: 0.406 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.026AspAla: 3.026 ± 0.041
0.556AspCys: 0.556 ± 0.014
4.717AspAsp: 4.717 ± 0.065
5.21AspGlu: 5.21 ± 0.056
2.907AspPhe: 2.907 ± 0.036
2.865AspGly: 2.865 ± 0.047
1.161AspHis: 1.161 ± 0.022
4.252AspIle: 4.252 ± 0.045
3.631AspLys: 3.631 ± 0.038
5.79AspLeu: 5.79 ± 0.047
1.033AspMet: 1.033 ± 0.017
3.07AspAsn: 3.07 ± 0.029
2.493AspPro: 2.493 ± 0.035
1.779AspGln: 1.779 ± 0.025
2.088AspArg: 2.088 ± 0.028
5.317AspSer: 5.317 ± 0.06
2.831AspThr: 2.831 ± 0.036
3.57AspVal: 3.57 ± 0.035
0.61AspTrp: 0.61 ± 0.015
2.366AspTyr: 2.366 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
3.774GluAla: 3.774 ± 0.049
0.552GluCys: 0.552 ± 0.014
4.792GluAsp: 4.792 ± 0.046
6.359GluGlu: 6.359 ± 0.091
2.828GluPhe: 2.828 ± 0.031
2.753GluGly: 2.753 ± 0.03
1.203GluHis: 1.203 ± 0.022
4.465GluIle: 4.465 ± 0.043
4.962GluLys: 4.962 ± 0.05
6.505GluLeu: 6.505 ± 0.062
1.219GluMet: 1.219 ± 0.02
3.856GluAsn: 3.856 ± 0.04
2.117GluPro: 2.117 ± 0.035
2.504GluGln: 2.504 ± 0.032
2.7GluArg: 2.7 ± 0.037
5.344GluSer: 5.344 ± 0.049
3.534GluThr: 3.534 ± 0.053
3.992GluVal: 3.992 ± 0.047
0.633GluTrp: 0.633 ± 0.016
2.31GluTyr: 2.31 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
2.616PheAla: 2.616 ± 0.032
0.526PheCys: 0.526 ± 0.013
2.848PheAsp: 2.848 ± 0.032
2.863PheGlu: 2.863 ± 0.031
2.148PhePhe: 2.148 ± 0.037
2.892PheGly: 2.892 ± 0.046
1.032PheHis: 1.032 ± 0.019
2.924PheIle: 2.924 ± 0.038
3.081PheLys: 3.081 ± 0.034
4.174PheLeu: 4.174 ± 0.048
0.857PheMet: 0.857 ± 0.018
2.766PheAsn: 2.766 ± 0.031
1.826PhePro: 1.826 ± 0.025
1.797PheGln: 1.797 ± 0.028
1.756PheArg: 1.756 ± 0.028
3.832PheSer: 3.832 ± 0.035
2.485PheThr: 2.485 ± 0.036
3.007PheVal: 3.007 ± 0.038
0.542PheTrp: 0.542 ± 0.014
1.64PheTyr: 1.64 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
3.304GlyAla: 3.304 ± 0.05
0.651GlyCys: 0.651 ± 0.017
2.855GlyAsp: 2.855 ± 0.035
3.011GlyGlu: 3.011 ± 0.037
2.659GlyPhe: 2.659 ± 0.036
3.602GlyGly: 3.602 ± 0.056
1.114GlyHis: 1.114 ± 0.021
3.602GlyIle: 3.602 ± 0.046
3.489GlyLys: 3.489 ± 0.037
4.868GlyLeu: 4.868 ± 0.043
0.917GlyMet: 0.917 ± 0.018
2.765GlyAsn: 2.765 ± 0.036
1.739GlyPro: 1.739 ± 0.029
1.64GlyGln: 1.64 ± 0.027
2.165GlyArg: 2.165 ± 0.037
5.087GlySer: 5.087 ± 0.08
2.999GlyThr: 2.999 ± 0.039
3.336GlyVal: 3.336 ± 0.046
0.644GlyTrp: 0.644 ± 0.015
2.08GlyTyr: 2.08 ± 0.03
0.0GlyXaa: 0.0 ± 0.0
His
0.975HisAla: 0.975 ± 0.022
0.253HisCys: 0.253 ± 0.01
1.195HisAsp: 1.195 ± 0.019
1.276HisGlu: 1.276 ± 0.022
0.998HisPhe: 0.998 ± 0.02
1.13HisGly: 1.13 ± 0.022
0.725HisHis: 0.725 ± 0.021
1.419HisIle: 1.419 ± 0.025
1.344HisLys: 1.344 ± 0.025
2.066HisLeu: 2.066 ± 0.031
0.354HisMet: 0.354 ± 0.011
1.231HisAsn: 1.231 ± 0.021
1.131HisPro: 1.131 ± 0.023
0.949HisGln: 0.949 ± 0.023
0.986HisArg: 0.986 ± 0.02
1.92HisSer: 1.92 ± 0.028
1.075HisThr: 1.075 ± 0.02
1.084HisVal: 1.084 ± 0.019
0.221HisTrp: 0.221 ± 0.007
0.832HisTyr: 0.832 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
4.071IleAla: 4.071 ± 0.042
0.76IleCys: 0.76 ± 0.018
4.355IleAsp: 4.355 ± 0.044
4.253IleGlu: 4.253 ± 0.041
2.874IlePhe: 2.874 ± 0.045
3.486IleGly: 3.486 ± 0.04
1.46IleHis: 1.46 ± 0.024
4.223IleIle: 4.223 ± 0.047
4.291IleLys: 4.291 ± 0.042
6.078IleLeu: 6.078 ± 0.061
1.15IleMet: 1.15 ± 0.021
3.964IleAsn: 3.964 ± 0.039
3.303IlePro: 3.303 ± 0.039
2.442IleGln: 2.442 ± 0.033
2.843IleArg: 2.843 ± 0.037
5.999IleSer: 5.999 ± 0.052
3.715IleThr: 3.715 ± 0.042
4.246IleVal: 4.246 ± 0.046
0.694IleTrp: 0.694 ± 0.016
2.198IleTyr: 2.198 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.47LysAla: 3.47 ± 0.043
0.643LysCys: 0.643 ± 0.015
3.808LysAsp: 3.808 ± 0.044
4.823LysGlu: 4.823 ± 0.046
3.222LysPhe: 3.222 ± 0.039
2.772LysGly: 2.772 ± 0.035
1.479LysHis: 1.479 ± 0.025
4.419LysIle: 4.419 ± 0.045
5.876LysLys: 5.876 ± 0.067
7.069LysLeu: 7.069 ± 0.057
1.219LysMet: 1.219 ± 0.02
3.833LysAsn: 3.833 ± 0.043
2.765LysPro: 2.765 ± 0.033
2.677LysGln: 2.677 ± 0.032
3.341LysArg: 3.341 ± 0.038
5.921LysSer: 5.921 ± 0.057
3.654LysThr: 3.654 ± 0.044
4.069LysVal: 4.069 ± 0.041
0.721LysTrp: 0.721 ± 0.016
2.77LysTyr: 2.77 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
5.923LeuAla: 5.923 ± 0.051
1.044LeuCys: 1.044 ± 0.021
5.363LeuAsp: 5.363 ± 0.05
5.787LeuGlu: 5.787 ± 0.056
4.088LeuPhe: 4.088 ± 0.044
4.79LeuGly: 4.79 ± 0.05
2.036LeuHis: 2.036 ± 0.032
6.045LeuIle: 6.045 ± 0.059
6.616LeuLys: 6.616 ± 0.062
8.941LeuLeu: 8.941 ± 0.078
1.722LeuMet: 1.722 ± 0.025
5.517LeuAsn: 5.517 ± 0.049
4.535LeuPro: 4.535 ± 0.039
3.881LeuGln: 3.881 ± 0.041
4.168LeuArg: 4.168 ± 0.046
8.618LeuSer: 8.618 ± 0.072
5.194LeuThr: 5.194 ± 0.051
5.996LeuVal: 5.996 ± 0.055
0.856LeuTrp: 0.856 ± 0.019
3.108LeuTyr: 3.108 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
1.277MetAla: 1.277 ± 0.023
0.204MetCys: 0.204 ± 0.009
1.033MetAsp: 1.033 ± 0.02
1.037MetGlu: 1.037 ± 0.023
0.782MetPhe: 0.782 ± 0.017
1.005MetGly: 1.005 ± 0.02
0.273MetHis: 0.273 ± 0.009
1.142MetIle: 1.142 ± 0.019
1.276MetLys: 1.276 ± 0.022
1.54MetLeu: 1.54 ± 0.026
0.407MetMet: 0.407 ± 0.013
1.065MetAsn: 1.065 ± 0.023
0.685MetPro: 0.685 ± 0.018
0.521MetGln: 0.521 ± 0.014
0.707MetArg: 0.707 ± 0.014
1.982MetSer: 1.982 ± 0.025
1.042MetThr: 1.042 ± 0.022
1.083MetVal: 1.083 ± 0.021
0.158MetTrp: 0.158 ± 0.007
0.576MetTyr: 0.576 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.934AsnAla: 2.934 ± 0.037
0.604AsnCys: 0.604 ± 0.017
3.598AsnAsp: 3.598 ± 0.041
3.827AsnGlu: 3.827 ± 0.043
2.681AsnPhe: 2.681 ± 0.032
3.263AsnGly: 3.263 ± 0.043
1.231AsnHis: 1.231 ± 0.022
3.733AsnIle: 3.733 ± 0.036
3.662AsnLys: 3.662 ± 0.043
5.357AsnLeu: 5.357 ± 0.057
0.959AsnMet: 0.959 ± 0.019
3.755AsnAsn: 3.755 ± 0.053
2.549AsnPro: 2.549 ± 0.031
2.136AsnGln: 2.136 ± 0.037
2.173AsnArg: 2.173 ± 0.033
5.783AsnSer: 5.783 ± 0.066
3.041AsnThr: 3.041 ± 0.044
3.206AsnVal: 3.206 ± 0.041
0.675AsnTrp: 0.675 ± 0.015
2.307AsnTyr: 2.307 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
2.555ProAla: 2.555 ± 0.035
0.337ProCys: 0.337 ± 0.012
2.236ProAsp: 2.236 ± 0.03
3.189ProGlu: 3.189 ± 0.043
1.877ProPhe: 1.877 ± 0.027
2.082ProGly: 2.082 ± 0.033
0.905ProHis: 0.905 ± 0.019
2.88ProIle: 2.88 ± 0.036
2.828ProLys: 2.828 ± 0.034
3.708ProLeu: 3.708 ± 0.035
0.695ProMet: 0.695 ± 0.017
2.386ProAsn: 2.386 ± 0.031
2.621ProPro: 2.621 ± 0.08
1.913ProGln: 1.913 ± 0.039
1.713ProArg: 1.713 ± 0.025
4.224ProSer: 4.224 ± 0.056
2.891ProThr: 2.891 ± 0.052
2.991ProVal: 2.991 ± 0.035
0.413ProTrp: 0.413 ± 0.014
1.421ProTyr: 1.421 ± 0.022
0.0ProXaa: 0.0 ± 0.0
Gln
2.071GlnAla: 2.071 ± 0.031
0.352GlnCys: 0.352 ± 0.011
1.93GlnAsp: 1.93 ± 0.031
2.529GlnGlu: 2.529 ± 0.034
1.816GlnPhe: 1.816 ± 0.031
1.568GlnGly: 1.568 ± 0.026
0.899GlnHis: 0.899 ± 0.02
2.505GlnIle: 2.505 ± 0.034
2.469GlnLys: 2.469 ± 0.033
3.979GlnLeu: 3.979 ± 0.041
0.753GlnMet: 0.753 ± 0.017
2.032GlnAsn: 2.032 ± 0.03
1.668GlnPro: 1.668 ± 0.038
2.838GlnGln: 2.838 ± 0.116
1.667GlnArg: 1.667 ± 0.029
3.063GlnSer: 3.063 ± 0.036
2.007GlnThr: 2.007 ± 0.028
2.167GlnVal: 2.167 ± 0.027
0.387GlnTrp: 0.387 ± 0.012
1.398GlnTyr: 1.398 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
2.571ArgAla: 2.571 ± 0.034
0.442ArgCys: 0.442 ± 0.014
2.379ArgAsp: 2.379 ± 0.029
2.645ArgGlu: 2.645 ± 0.033
2.018ArgPhe: 2.018 ± 0.026
2.085ArgGly: 2.085 ± 0.032
0.932ArgHis: 0.932 ± 0.019
2.853ArgIle: 2.853 ± 0.035
3.298ArgLys: 3.298 ± 0.042
4.026ArgLeu: 4.026 ± 0.04
0.789ArgMet: 0.789 ± 0.017
2.428ArgAsn: 2.428 ± 0.033
1.618ArgPro: 1.618 ± 0.028
1.658ArgGln: 1.658 ± 0.026
2.585ArgArg: 2.585 ± 0.034
3.602ArgSer: 3.602 ± 0.045
2.223ArgThr: 2.223 ± 0.033
2.384ArgVal: 2.384 ± 0.027
0.442ArgTrp: 0.442 ± 0.013
1.463ArgTyr: 1.463 ± 0.023
0.0ArgXaa: 0.0 ± 0.0
Ser
5.181SerAla: 5.181 ± 0.057
0.775SerCys: 0.775 ± 0.019
4.931SerAsp: 4.931 ± 0.052
5.315SerGlu: 5.315 ± 0.053
4.073SerPhe: 4.073 ± 0.04
4.931SerGly: 4.931 ± 0.062
1.933SerHis: 1.933 ± 0.026
6.365SerIle: 6.365 ± 0.056
6.357SerLys: 6.357 ± 0.055
8.407SerLeu: 8.407 ± 0.059
1.591SerMet: 1.591 ± 0.024
5.603SerAsn: 5.603 ± 0.064
4.031SerPro: 4.031 ± 0.058
3.419SerGln: 3.419 ± 0.042
3.907SerArg: 3.907 ± 0.04
10.914SerSer: 10.914 ± 0.165
6.165SerThr: 6.165 ± 0.074
5.285SerVal: 5.285 ± 0.052
0.851SerTrp: 0.851 ± 0.025
2.949SerTyr: 2.949 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
3.509ThrAla: 3.509 ± 0.036
0.57ThrCys: 0.57 ± 0.017
2.793ThrAsp: 2.793 ± 0.037
3.304ThrGlu: 3.304 ± 0.062
2.499ThrPhe: 2.499 ± 0.03
3.286ThrGly: 3.286 ± 0.052
1.104ThrHis: 1.104 ± 0.02
3.976ThrIle: 3.976 ± 0.051
3.71ThrLys: 3.71 ± 0.04
5.038ThrLeu: 5.038 ± 0.051
0.907ThrMet: 0.907 ± 0.017
3.272ThrAsn: 3.272 ± 0.037
3.121ThrPro: 3.121 ± 0.049
1.732ThrGln: 1.732 ± 0.027
2.287ThrArg: 2.287 ± 0.031
5.82ThrSer: 5.82 ± 0.078
4.062ThrThr: 4.062 ± 0.128
3.512ThrVal: 3.512 ± 0.041
0.562ThrTrp: 0.562 ± 0.017
1.804ThrTyr: 1.804 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
3.993ValAla: 3.993 ± 0.043
0.679ValCys: 0.679 ± 0.015
3.98ValAsp: 3.98 ± 0.04
4.164ValGlu: 4.164 ± 0.042
2.778ValPhe: 2.778 ± 0.039
3.373ValGly: 3.373 ± 0.04
1.236ValHis: 1.236 ± 0.021
3.865ValIle: 3.865 ± 0.044
3.958ValLys: 3.958 ± 0.044
5.671ValLeu: 5.671 ± 0.052
1.076ValMet: 1.076 ± 0.019
3.302ValAsn: 3.302 ± 0.034
2.895ValPro: 2.895 ± 0.038
2.081ValGln: 2.081 ± 0.026
2.425ValArg: 2.425 ± 0.034
5.428ValSer: 5.428 ± 0.051
3.3ValThr: 3.3 ± 0.037
4.339ValVal: 4.339 ± 0.051
0.623ValTrp: 0.623 ± 0.016
2.1ValTyr: 2.1 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
0.558TrpAla: 0.558 ± 0.013
0.191TrpCys: 0.191 ± 0.008
0.64TrpAsp: 0.64 ± 0.017
0.557TrpGlu: 0.557 ± 0.014
0.567TrpPhe: 0.567 ± 0.014
0.557TrpGly: 0.557 ± 0.015
0.212TrpHis: 0.212 ± 0.008
0.673TrpIle: 0.673 ± 0.017
0.804TrpLys: 0.804 ± 0.018
1.009TrpLeu: 1.009 ± 0.021
0.213TrpMet: 0.213 ± 0.009
0.62TrpAsn: 0.62 ± 0.015
0.301TrpPro: 0.301 ± 0.01
0.311TrpGln: 0.311 ± 0.011
0.491TrpArg: 0.491 ± 0.012
0.831TrpSer: 0.831 ± 0.02
0.563TrpThr: 0.563 ± 0.019
0.585TrpVal: 0.585 ± 0.015
0.171TrpTrp: 0.171 ± 0.009
0.423TrpTyr: 0.423 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.814TyrAla: 1.814 ± 0.027
0.495TyrCys: 0.495 ± 0.015
2.275TyrAsp: 2.275 ± 0.031
2.178TyrGlu: 2.178 ± 0.029
1.8TyrPhe: 1.8 ± 0.026
2.042TyrGly: 2.042 ± 0.032
0.827TyrHis: 0.827 ± 0.019
2.204TyrIle: 2.204 ± 0.027
2.239TyrLys: 2.239 ± 0.027
3.629TyrLeu: 3.629 ± 0.043
0.619TyrMet: 0.619 ± 0.016
2.193TyrAsn: 2.193 ± 0.03
1.416TyrPro: 1.416 ± 0.023
1.406TyrGln: 1.406 ± 0.028
1.511TyrArg: 1.511 ± 0.021
2.981TyrSer: 2.981 ± 0.031
1.868TyrThr: 1.868 ± 0.043
2.052TyrVal: 2.052 ± 0.028
0.428TyrTrp: 0.428 ± 0.014
1.534TyrTyr: 1.534 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5797 proteins (2852044 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski