Amino acid dipepetide frequency for Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.771AlaAla: 4.771 ± 0.064
0.985AlaCys: 0.985 ± 0.02
2.902AlaAsp: 2.902 ± 0.031
3.712AlaGlu: 3.712 ± 0.047
2.93AlaPhe: 2.93 ± 0.038
3.121AlaGly: 3.121 ± 0.044
1.263AlaHis: 1.263 ± 0.026
4.027AlaIle: 4.027 ± 0.05
3.933AlaLys: 3.933 ± 0.042
6.291AlaLeu: 6.291 ± 0.056
1.419AlaMet: 1.419 ± 0.027
2.994AlaAsn: 2.994 ± 0.04
2.721AlaPro: 2.721 ± 0.049
2.156AlaGln: 2.156 ± 0.036
2.876AlaArg: 2.876 ± 0.033
6.011AlaSer: 6.011 ± 0.064
3.496AlaThr: 3.496 ± 0.045
3.893AlaVal: 3.893 ± 0.046
0.665AlaTrp: 0.665 ± 0.018
2.071AlaTyr: 2.071 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
0.786CysAla: 0.786 ± 0.019
0.31CysCys: 0.31 ± 0.011
0.7CysAsp: 0.7 ± 0.019
0.752CysGlu: 0.752 ± 0.018
0.82CysPhe: 0.82 ± 0.021
0.859CysGly: 0.859 ± 0.023
0.358CysHis: 0.358 ± 0.014
1.095CysIle: 1.095 ± 0.023
0.842CysLys: 0.842 ± 0.022
1.675CysLeu: 1.675 ± 0.029
0.329CysMet: 0.329 ± 0.012
0.658CysAsn: 0.658 ± 0.017
0.642CysPro: 0.642 ± 0.019
0.506CysGln: 0.506 ± 0.016
0.665CysArg: 0.665 ± 0.016
1.25CysSer: 1.25 ± 0.025
0.746CysThr: 0.746 ± 0.02
0.956CysVal: 0.956 ± 0.022
0.194CysTrp: 0.194 ± 0.01
0.536CysTyr: 0.536 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.455AspAla: 3.455 ± 0.037
0.733AspCys: 0.733 ± 0.015
3.4AspAsp: 3.4 ± 0.049
4.353AspGlu: 4.353 ± 0.057
2.707AspPhe: 2.707 ± 0.036
2.7AspGly: 2.7 ± 0.035
1.054AspHis: 1.054 ± 0.022
3.587AspIle: 3.587 ± 0.046
2.778AspLys: 2.778 ± 0.042
5.249AspLeu: 5.249 ± 0.046
1.155AspMet: 1.155 ± 0.023
2.466AspAsn: 2.466 ± 0.036
2.434AspPro: 2.434 ± 0.03
1.647AspGln: 1.647 ± 0.025
2.142AspArg: 2.142 ± 0.032
4.69AspSer: 4.69 ± 0.055
2.767AspThr: 2.767 ± 0.035
3.436AspVal: 3.436 ± 0.036
0.64AspTrp: 0.64 ± 0.016
1.994AspTyr: 1.994 ± 0.028
0.0AspXaa: 0.0 ± 0.0
Glu
4.132GluAla: 4.132 ± 0.047
0.767GluCys: 0.767 ± 0.019
3.84GluAsp: 3.84 ± 0.05
5.87GluGlu: 5.87 ± 0.086
2.652GluPhe: 2.652 ± 0.035
2.724GluGly: 2.724 ± 0.035
1.363GluHis: 1.363 ± 0.024
4.037GluIle: 4.037 ± 0.048
5.315GluLys: 5.315 ± 0.073
6.108GluLeu: 6.108 ± 0.067
1.415GluMet: 1.415 ± 0.03
4.033GluAsn: 4.033 ± 0.046
2.185GluPro: 2.185 ± 0.034
2.582GluGln: 2.582 ± 0.04
3.2GluArg: 3.2 ± 0.045
5.105GluSer: 5.105 ± 0.061
3.503GluThr: 3.503 ± 0.049
3.545GluVal: 3.545 ± 0.045
0.735GluTrp: 0.735 ± 0.02
2.116GluTyr: 2.116 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
2.749PheAla: 2.749 ± 0.04
0.789PheCys: 0.789 ± 0.018
2.635PheAsp: 2.635 ± 0.037
2.898PheGlu: 2.898 ± 0.036
2.335PhePhe: 2.335 ± 0.043
2.665PheGly: 2.665 ± 0.051
1.144PheHis: 1.144 ± 0.021
2.672PheIle: 2.672 ± 0.038
2.209PheLys: 2.209 ± 0.03
4.769PheLeu: 4.769 ± 0.054
0.939PheMet: 0.939 ± 0.022
2.115PheAsn: 2.115 ± 0.03
2.107PhePro: 2.107 ± 0.028
1.901PheGln: 1.901 ± 0.029
2.083PheArg: 2.083 ± 0.03
4.489PheSer: 4.489 ± 0.05
2.537PheThr: 2.537 ± 0.035
2.881PheVal: 2.881 ± 0.034
0.582PheTrp: 0.582 ± 0.016
1.659PheTyr: 1.659 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
2.861GlyAla: 2.861 ± 0.045
0.765GlyCys: 0.765 ± 0.021
2.434GlyAsp: 2.434 ± 0.035
2.545GlyGlu: 2.545 ± 0.04
2.623GlyPhe: 2.623 ± 0.035
2.862GlyGly: 2.862 ± 0.057
1.15GlyHis: 1.15 ± 0.024
3.585GlyIle: 3.585 ± 0.045
3.395GlyLys: 3.395 ± 0.04
4.662GlyLeu: 4.662 ± 0.052
1.13GlyMet: 1.13 ± 0.023
2.54GlyAsn: 2.54 ± 0.04
1.789GlyPro: 1.789 ± 0.029
1.508GlyGln: 1.508 ± 0.023
2.387GlyArg: 2.387 ± 0.038
4.406GlySer: 4.406 ± 0.05
2.955GlyThr: 2.955 ± 0.069
3.126GlyVal: 3.126 ± 0.042
0.658GlyTrp: 0.658 ± 0.018
1.909GlyTyr: 1.909 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
1.292HisAla: 1.292 ± 0.027
0.369HisCys: 0.369 ± 0.015
1.109HisAsp: 1.109 ± 0.023
1.342HisGlu: 1.342 ± 0.024
1.068HisPhe: 1.068 ± 0.021
1.198HisGly: 1.198 ± 0.022
0.641HisHis: 0.641 ± 0.024
1.4HisIle: 1.4 ± 0.025
1.25HisLys: 1.25 ± 0.023
2.375HisLeu: 2.375 ± 0.037
0.474HisMet: 0.474 ± 0.014
1.014HisAsn: 1.014 ± 0.02
1.405HisPro: 1.405 ± 0.027
0.819HisGln: 0.819 ± 0.021
1.149HisArg: 1.149 ± 0.021
2.035HisSer: 2.035 ± 0.026
1.137HisThr: 1.137 ± 0.023
1.418HisVal: 1.418 ± 0.023
0.295HisTrp: 0.295 ± 0.01
0.81HisTyr: 0.81 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
3.917IleAla: 3.917 ± 0.049
1.077IleCys: 1.077 ± 0.024
3.621IleAsp: 3.621 ± 0.044
3.853IleGlu: 3.853 ± 0.041
2.799IlePhe: 2.799 ± 0.044
2.993IleGly: 2.993 ± 0.049
1.498IleHis: 1.498 ± 0.024
3.606IleIle: 3.606 ± 0.05
3.368IleLys: 3.368 ± 0.04
6.011IleLeu: 6.011 ± 0.06
1.204IleMet: 1.204 ± 0.022
2.999IleAsn: 2.999 ± 0.039
3.452IlePro: 3.452 ± 0.044
2.424IleGln: 2.424 ± 0.031
3.098IleArg: 3.098 ± 0.04
5.723IleSer: 5.723 ± 0.06
3.368IleThr: 3.368 ± 0.088
3.758IleVal: 3.758 ± 0.045
0.695IleTrp: 0.695 ± 0.02
2.112IleTyr: 2.112 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
3.881LysAla: 3.881 ± 0.04
0.814LysCys: 0.814 ± 0.019
3.503LysAsp: 3.503 ± 0.04
4.605LysGlu: 4.605 ± 0.059
2.517LysPhe: 2.517 ± 0.033
2.763LysGly: 2.763 ± 0.035
1.573LysHis: 1.573 ± 0.028
3.763LysIle: 3.763 ± 0.044
5.432LysLys: 5.432 ± 0.06
6.094LysLeu: 6.094 ± 0.062
1.247LysMet: 1.247 ± 0.021
3.728LysAsn: 3.728 ± 0.045
2.816LysPro: 2.816 ± 0.041
2.641LysGln: 2.641 ± 0.038
3.758LysArg: 3.758 ± 0.051
5.427LysSer: 5.427 ± 0.059
3.36LysThr: 3.36 ± 0.04
3.601LysVal: 3.601 ± 0.044
0.633LysTrp: 0.633 ± 0.018
2.175LysTyr: 2.175 ± 0.029
0.0LysXaa: 0.0 ± 0.0
Leu
6.292LeuAla: 6.292 ± 0.062
1.579LeuCys: 1.579 ± 0.027
5.331LeuAsp: 5.331 ± 0.056
6.551LeuGlu: 6.551 ± 0.068
4.445LeuPhe: 4.445 ± 0.054
4.709LeuGly: 4.709 ± 0.048
2.362LeuHis: 2.362 ± 0.035
5.323LeuIle: 5.323 ± 0.06
6.572LeuLys: 6.572 ± 0.064
10.174LeuLeu: 10.174 ± 0.113
1.913LeuMet: 1.913 ± 0.03
5.2LeuAsn: 5.2 ± 0.076
4.894LeuPro: 4.894 ± 0.054
4.463LeuGln: 4.463 ± 0.051
5.215LeuArg: 5.215 ± 0.049
9.106LeuSer: 9.106 ± 0.08
4.897LeuThr: 4.897 ± 0.049
5.395LeuVal: 5.395 ± 0.057
1.051LeuTrp: 1.051 ± 0.025
3.236LeuTyr: 3.236 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
1.462MetAla: 1.462 ± 0.023
0.264MetCys: 0.264 ± 0.012
1.292MetAsp: 1.292 ± 0.026
1.388MetGlu: 1.388 ± 0.024
0.906MetPhe: 0.906 ± 0.022
1.11MetGly: 1.11 ± 0.025
0.478MetHis: 0.478 ± 0.015
1.108MetIle: 1.108 ± 0.02
1.34MetLys: 1.34 ± 0.026
2.003MetLeu: 2.003 ± 0.033
0.424MetMet: 0.424 ± 0.012
1.143MetAsn: 1.143 ± 0.02
0.929MetPro: 0.929 ± 0.021
0.885MetGln: 0.885 ± 0.022
0.965MetArg: 0.965 ± 0.019
2.016MetSer: 2.016 ± 0.03
1.031MetThr: 1.031 ± 0.023
1.156MetVal: 1.156 ± 0.021
0.153MetTrp: 0.153 ± 0.008
0.619MetTyr: 0.619 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.353AsnAla: 3.353 ± 0.033
0.717AsnCys: 0.717 ± 0.02
2.935AsnAsp: 2.935 ± 0.035
3.565AsnGlu: 3.565 ± 0.045
2.28AsnPhe: 2.28 ± 0.037
2.821AsnGly: 2.821 ± 0.037
1.102AsnHis: 1.102 ± 0.024
3.297AsnIle: 3.297 ± 0.038
2.856AsnLys: 2.856 ± 0.04
4.822AsnLeu: 4.822 ± 0.053
1.087AsnMet: 1.087 ± 0.021
2.797AsnAsn: 2.797 ± 0.038
2.639AsnPro: 2.639 ± 0.037
1.838AsnGln: 1.838 ± 0.032
2.26AsnArg: 2.26 ± 0.029
4.887AsnSer: 4.887 ± 0.074
2.983AsnThr: 2.983 ± 0.051
3.45AsnVal: 3.45 ± 0.037
0.574AsnTrp: 0.574 ± 0.015
1.851AsnTyr: 1.851 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
2.741ProAla: 2.741 ± 0.045
0.538ProCys: 0.538 ± 0.018
2.227ProAsp: 2.227 ± 0.035
3.115ProGlu: 3.115 ± 0.045
2.223ProPhe: 2.223 ± 0.033
2.02ProGly: 2.02 ± 0.034
0.952ProHis: 0.952 ± 0.024
2.943ProIle: 2.943 ± 0.075
2.993ProLys: 2.993 ± 0.041
4.433ProLeu: 4.433 ± 0.044
0.896ProMet: 0.896 ± 0.024
2.51ProAsn: 2.51 ± 0.039
2.683ProPro: 2.683 ± 0.062
1.653ProGln: 1.653 ± 0.04
1.893ProArg: 1.893 ± 0.028
5.34ProSer: 5.34 ± 0.076
2.858ProThr: 2.858 ± 0.047
3.012ProVal: 3.012 ± 0.042
0.506ProTrp: 0.506 ± 0.016
1.594ProTyr: 1.594 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
2.329GlnAla: 2.329 ± 0.033
0.497GlnCys: 0.497 ± 0.016
1.737GlnAsp: 1.737 ± 0.029
2.449GlnGlu: 2.449 ± 0.034
1.589GlnPhe: 1.589 ± 0.027
1.624GlnGly: 1.624 ± 0.025
0.873GlnHis: 0.873 ± 0.018
2.318GlnIle: 2.318 ± 0.035
2.866GlnLys: 2.866 ± 0.033
4.072GlnLeu: 4.072 ± 0.051
0.837GlnMet: 0.837 ± 0.02
2.111GlnAsn: 2.111 ± 0.031
1.753GlnPro: 1.753 ± 0.043
1.976GlnGln: 1.976 ± 0.049
2.1GlnArg: 2.1 ± 0.035
3.245GlnSer: 3.245 ± 0.04
2.034GlnThr: 2.034 ± 0.03
2.095GlnVal: 2.095 ± 0.03
0.416GlnTrp: 0.416 ± 0.012
1.266GlnTyr: 1.266 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
2.812ArgAla: 2.812 ± 0.04
0.702ArgCys: 0.702 ± 0.018
2.372ArgAsp: 2.372 ± 0.034
3.071ArgGlu: 3.071 ± 0.046
2.285ArgPhe: 2.285 ± 0.031
2.19ArgGly: 2.19 ± 0.035
1.115ArgHis: 1.115 ± 0.023
3.108ArgIle: 3.108 ± 0.038
3.796ArgLys: 3.796 ± 0.048
4.966ArgLeu: 4.966 ± 0.052
1.046ArgMet: 1.046 ± 0.022
2.571ArgAsn: 2.571 ± 0.036
2.054ArgPro: 2.054 ± 0.029
1.934ArgGln: 1.934 ± 0.031
3.184ArgArg: 3.184 ± 0.044
3.944ArgSer: 3.944 ± 0.046
2.371ArgThr: 2.371 ± 0.029
2.849ArgVal: 2.849 ± 0.036
0.597ArgTrp: 0.597 ± 0.014
1.705ArgTyr: 1.705 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
5.349SerAla: 5.349 ± 0.058
1.204SerCys: 1.204 ± 0.024
4.512SerAsp: 4.512 ± 0.053
5.325SerGlu: 5.325 ± 0.067
4.521SerPhe: 4.521 ± 0.049
4.358SerGly: 4.358 ± 0.064
2.01SerHis: 2.01 ± 0.032
5.84SerIle: 5.84 ± 0.053
5.998SerLys: 5.998 ± 0.059
9.317SerLeu: 9.317 ± 0.079
1.866SerMet: 1.866 ± 0.03
5.044SerAsn: 5.044 ± 0.064
4.472SerPro: 4.472 ± 0.065
3.457SerGln: 3.457 ± 0.044
4.203SerArg: 4.203 ± 0.052
11.511SerSer: 11.511 ± 0.25
5.934SerThr: 5.934 ± 0.142
5.5SerVal: 5.5 ± 0.063
0.959SerTrp: 0.959 ± 0.021
2.873SerTyr: 2.873 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
3.502ThrAla: 3.502 ± 0.053
0.774ThrCys: 0.774 ± 0.019
2.67ThrAsp: 2.67 ± 0.036
3.164ThrGlu: 3.164 ± 0.038
2.504ThrPhe: 2.504 ± 0.039
2.986ThrGly: 2.986 ± 0.06
1.178ThrHis: 1.178 ± 0.023
3.493ThrIle: 3.493 ± 0.047
3.212ThrLys: 3.212 ± 0.038
5.229ThrLeu: 5.229 ± 0.054
1.037ThrMet: 1.037 ± 0.02
2.807ThrAsn: 2.807 ± 0.041
3.19ThrPro: 3.19 ± 0.078
1.782ThrGln: 1.782 ± 0.03
2.355ThrArg: 2.355 ± 0.032
5.691ThrSer: 5.691 ± 0.131
3.545ThrThr: 3.545 ± 0.114
3.64ThrVal: 3.64 ± 0.076
0.584ThrTrp: 0.584 ± 0.017
1.712ThrTyr: 1.712 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
3.824ValAla: 3.824 ± 0.045
1.014ValCys: 1.014 ± 0.021
3.529ValAsp: 3.529 ± 0.04
3.935ValGlu: 3.935 ± 0.053
2.81ValPhe: 2.81 ± 0.041
3.099ValGly: 3.099 ± 0.046
1.371ValHis: 1.371 ± 0.022
3.534ValIle: 3.534 ± 0.043
3.502ValLys: 3.502 ± 0.043
5.922ValLeu: 5.922 ± 0.058
1.206ValMet: 1.206 ± 0.023
3.004ValAsn: 3.004 ± 0.041
2.982ValPro: 2.982 ± 0.046
2.348ValGln: 2.348 ± 0.029
2.828ValArg: 2.828 ± 0.033
5.479ValSer: 5.479 ± 0.062
3.131ValThr: 3.131 ± 0.055
3.929ValVal: 3.929 ± 0.06
0.676ValTrp: 0.676 ± 0.016
2.226ValTyr: 2.226 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
0.598TrpAla: 0.598 ± 0.014
0.198TrpCys: 0.198 ± 0.009
0.685TrpAsp: 0.685 ± 0.02
0.579TrpGlu: 0.579 ± 0.017
0.47TrpPhe: 0.47 ± 0.015
0.533TrpGly: 0.533 ± 0.015
0.265TrpHis: 0.265 ± 0.012
0.772TrpIle: 0.772 ± 0.019
0.863TrpLys: 0.863 ± 0.021
1.066TrpLeu: 1.066 ± 0.024
0.289TrpMet: 0.289 ± 0.011
0.72TrpAsn: 0.72 ± 0.019
0.379TrpPro: 0.379 ± 0.013
0.411TrpGln: 0.411 ± 0.016
0.615TrpArg: 0.615 ± 0.017
0.949TrpSer: 0.949 ± 0.02
0.622TrpThr: 0.622 ± 0.017
0.617TrpVal: 0.617 ± 0.013
0.164TrpTrp: 0.164 ± 0.009
0.391TrpTyr: 0.391 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.131TyrAla: 2.131 ± 0.031
0.585TyrCys: 0.585 ± 0.014
1.955TyrAsp: 1.955 ± 0.029
2.153TyrGlu: 2.153 ± 0.032
1.673TyrPhe: 1.673 ± 0.033
1.957TyrGly: 1.957 ± 0.035
0.847TyrHis: 0.847 ± 0.018
2.066TyrIle: 2.066 ± 0.027
1.811TyrLys: 1.811 ± 0.027
3.473TyrLeu: 3.473 ± 0.039
0.758TyrMet: 0.758 ± 0.017
1.663TyrAsn: 1.663 ± 0.029
1.604TyrPro: 1.604 ± 0.028
1.255TyrGln: 1.255 ± 0.023
1.697TyrArg: 1.697 ± 0.03
2.949TyrSer: 2.949 ± 0.039
1.781TyrThr: 1.781 ± 0.033
2.09TyrVal: 2.09 ± 0.032
0.41TyrTrp: 0.41 ± 0.014
1.33TyrTyr: 1.33 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5151 proteins (2391875 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski