Amino acid dipepetide frequency for Aspergillus heteromorphus CBS 117.55

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.932AlaAla: 8.932 ± 0.066
1.141AlaCys: 1.141 ± 0.017
4.31AlaAsp: 4.31 ± 0.028
5.028AlaGlu: 5.028 ± 0.048
3.186AlaPhe: 3.186 ± 0.031
5.998AlaGly: 5.998 ± 0.045
1.893AlaHis: 1.893 ± 0.022
4.158AlaIle: 4.158 ± 0.027
3.531AlaLys: 3.531 ± 0.034
7.901AlaLeu: 7.901 ± 0.041
1.986AlaMet: 1.986 ± 0.021
2.721AlaAsn: 2.721 ± 0.026
4.915AlaPro: 4.915 ± 0.043
3.372AlaGln: 3.372 ± 0.033
5.15AlaArg: 5.15 ± 0.037
7.417AlaSer: 7.417 ± 0.047
5.324AlaThr: 5.324 ± 0.038
5.558AlaVal: 5.558 ± 0.041
1.197AlaTrp: 1.197 ± 0.017
2.202AlaTyr: 2.202 ± 0.023
0.0AlaXaa: 0.0 ± 0.0
Cys
0.967CysAla: 0.967 ± 0.013
0.293CysCys: 0.293 ± 0.01
0.675CysAsp: 0.675 ± 0.012
0.609CysGlu: 0.609 ± 0.011
0.561CysPhe: 0.561 ± 0.011
0.952CysGly: 0.952 ± 0.019
0.368CysHis: 0.368 ± 0.008
0.703CysIle: 0.703 ± 0.013
0.441CysLys: 0.441 ± 0.01
1.409CysLeu: 1.409 ± 0.018
0.289CysMet: 0.289 ± 0.008
0.408CysAsn: 0.408 ± 0.008
0.721CysPro: 0.721 ± 0.014
0.466CysGln: 0.466 ± 0.01
0.854CysArg: 0.854 ± 0.015
0.969CysSer: 0.969 ± 0.015
0.703CysThr: 0.703 ± 0.013
0.847CysVal: 0.847 ± 0.015
0.209CysTrp: 0.209 ± 0.007
0.363CysTyr: 0.363 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.719AspAla: 4.719 ± 0.032
0.619AspCys: 0.619 ± 0.012
4.078AspAsp: 4.078 ± 0.047
4.258AspGlu: 4.258 ± 0.041
2.111AspPhe: 2.111 ± 0.021
4.048AspGly: 4.048 ± 0.032
1.32AspHis: 1.32 ± 0.016
2.943AspIle: 2.943 ± 0.026
2.025AspLys: 2.025 ± 0.026
5.198AspLeu: 5.198 ± 0.041
1.246AspMet: 1.246 ± 0.016
1.751AspAsn: 1.751 ± 0.021
3.533AspPro: 3.533 ± 0.031
1.905AspGln: 1.905 ± 0.022
3.266AspArg: 3.266 ± 0.027
4.094AspSer: 4.094 ± 0.033
2.95AspThr: 2.95 ± 0.024
3.663AspVal: 3.663 ± 0.026
0.898AspTrp: 0.898 ± 0.015
1.621AspTyr: 1.621 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
5.146GluAla: 5.146 ± 0.046
0.614GluCys: 0.614 ± 0.012
4.035GluAsp: 4.035 ± 0.035
5.432GluGlu: 5.432 ± 0.066
1.884GluPhe: 1.884 ± 0.023
3.88GluGly: 3.88 ± 0.031
1.32GluHis: 1.32 ± 0.019
2.923GluIle: 2.923 ± 0.028
3.332GluLys: 3.332 ± 0.036
4.916GluLeu: 4.916 ± 0.038
1.461GluMet: 1.461 ± 0.016
2.083GluAsn: 2.083 ± 0.023
2.818GluPro: 2.818 ± 0.041
2.387GluGln: 2.387 ± 0.028
3.95GluArg: 3.95 ± 0.034
4.272GluSer: 4.272 ± 0.035
3.537GluThr: 3.537 ± 0.029
3.502GluVal: 3.502 ± 0.032
0.886GluTrp: 0.886 ± 0.016
1.713GluTyr: 1.713 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.0PheAla: 3.0 ± 0.023
0.592PheCys: 0.592 ± 0.012
2.244PheAsp: 2.244 ± 0.022
1.99PheGlu: 1.99 ± 0.02
1.703PhePhe: 1.703 ± 0.028
2.807PheGly: 2.807 ± 0.031
0.994PheHis: 0.994 ± 0.016
1.736PheIle: 1.736 ± 0.019
1.285PheLys: 1.285 ± 0.019
3.719PheLeu: 3.719 ± 0.035
0.775PheMet: 0.775 ± 0.012
1.326PheAsn: 1.326 ± 0.017
2.096PhePro: 2.096 ± 0.023
1.419PheGln: 1.419 ± 0.016
2.054PheArg: 2.054 ± 0.023
3.062PheSer: 3.062 ± 0.029
2.162PheThr: 2.162 ± 0.026
2.431PheVal: 2.431 ± 0.026
0.639PheTrp: 0.639 ± 0.011
1.146PheTyr: 1.146 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
5.356GlyAla: 5.356 ± 0.043
0.952GlyCys: 0.952 ± 0.013
3.651GlyAsp: 3.651 ± 0.032
3.768GlyGlu: 3.768 ± 0.034
2.826GlyPhe: 2.826 ± 0.031
5.832GlyGly: 5.832 ± 0.06
1.702GlyHis: 1.702 ± 0.021
3.42GlyIle: 3.42 ± 0.029
3.114GlyLys: 3.114 ± 0.029
6.352GlyLeu: 6.352 ± 0.043
1.627GlyMet: 1.627 ± 0.021
2.343GlyAsn: 2.343 ± 0.023
3.465GlyPro: 3.465 ± 0.036
2.547GlyGln: 2.547 ± 0.028
4.309GlyArg: 4.309 ± 0.034
5.883GlySer: 5.883 ± 0.053
3.975GlyThr: 3.975 ± 0.031
4.611GlyVal: 4.611 ± 0.036
1.245GlyTrp: 1.245 ± 0.017
2.175GlyTyr: 2.175 ± 0.027
0.0GlyXaa: 0.0 ± 0.0
His
1.978HisAla: 1.978 ± 0.022
0.349HisCys: 0.349 ± 0.01
1.348HisAsp: 1.348 ± 0.018
1.33HisGlu: 1.33 ± 0.017
0.932HisPhe: 0.932 ± 0.015
1.761HisGly: 1.761 ± 0.02
1.047HisHis: 1.047 ± 0.021
1.232HisIle: 1.232 ± 0.016
0.772HisLys: 0.772 ± 0.012
2.413HisLeu: 2.413 ± 0.024
0.504HisMet: 0.504 ± 0.01
0.831HisAsn: 0.831 ± 0.015
2.012HisPro: 2.012 ± 0.025
1.018HisGln: 1.018 ± 0.015
1.728HisArg: 1.728 ± 0.019
1.914HisSer: 1.914 ± 0.019
1.373HisThr: 1.373 ± 0.018
1.454HisVal: 1.454 ± 0.018
0.36HisTrp: 0.36 ± 0.009
0.71HisTyr: 0.71 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
4.087IleAla: 4.087 ± 0.033
0.768IleCys: 0.768 ± 0.014
2.715IleAsp: 2.715 ± 0.024
2.622IleGlu: 2.622 ± 0.027
1.971IlePhe: 1.971 ± 0.024
3.078IleGly: 3.078 ± 0.03
1.314IleHis: 1.314 ± 0.017
2.452IleIle: 2.452 ± 0.027
1.859IleLys: 1.859 ± 0.021
4.627IleLeu: 4.627 ± 0.037
0.996IleMet: 0.996 ± 0.015
1.68IleAsn: 1.68 ± 0.02
3.239IlePro: 3.239 ± 0.029
1.875IleGln: 1.875 ± 0.018
2.787IleArg: 2.787 ± 0.023
3.771IleSer: 3.771 ± 0.031
2.804IleThr: 2.804 ± 0.026
3.076IleVal: 3.076 ± 0.028
0.686IleTrp: 0.686 ± 0.014
1.441IleTyr: 1.441 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
3.755LysAla: 3.755 ± 0.033
0.445LysCys: 0.445 ± 0.01
2.398LysAsp: 2.398 ± 0.023
2.978LysGlu: 2.978 ± 0.036
1.253LysPhe: 1.253 ± 0.017
2.689LysGly: 2.689 ± 0.021
1.002LysHis: 1.002 ± 0.015
1.96LysIle: 1.96 ± 0.022
2.847LysLys: 2.847 ± 0.041
3.572LysLeu: 3.572 ± 0.032
0.897LysMet: 0.897 ± 0.012
1.482LysAsn: 1.482 ± 0.019
2.523LysPro: 2.523 ± 0.027
1.667LysGln: 1.667 ± 0.021
3.131LysArg: 3.131 ± 0.031
3.105LysSer: 3.105 ± 0.032
2.505LysThr: 2.505 ± 0.023
2.506LysVal: 2.506 ± 0.024
0.581LysTrp: 0.581 ± 0.011
1.219LysTyr: 1.219 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
8.006LeuAla: 8.006 ± 0.05
1.278LeuCys: 1.278 ± 0.021
5.269LeuAsp: 5.269 ± 0.037
5.368LeuGlu: 5.368 ± 0.041
3.489LeuPhe: 3.489 ± 0.034
6.116LeuGly: 6.116 ± 0.038
2.393LeuHis: 2.393 ± 0.023
4.003LeuIle: 4.003 ± 0.033
3.796LeuLys: 3.796 ± 0.036
8.875LeuLeu: 8.875 ± 0.065
1.786LeuMet: 1.786 ± 0.019
3.069LeuAsn: 3.069 ± 0.026
5.782LeuPro: 5.782 ± 0.044
3.891LeuGln: 3.891 ± 0.037
6.016LeuArg: 6.016 ± 0.044
7.734LeuSer: 7.734 ± 0.045
5.093LeuThr: 5.093 ± 0.039
5.73LeuVal: 5.73 ± 0.041
1.232LeuTrp: 1.232 ± 0.017
2.497LeuTyr: 2.497 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
2.174MetAla: 2.174 ± 0.024
0.254MetCys: 0.254 ± 0.008
1.279MetAsp: 1.279 ± 0.016
1.303MetGlu: 1.303 ± 0.016
0.731MetPhe: 0.731 ± 0.011
1.531MetGly: 1.531 ± 0.02
0.493MetHis: 0.493 ± 0.009
1.0MetIle: 1.0 ± 0.016
0.939MetLys: 0.939 ± 0.014
1.824MetLeu: 1.824 ± 0.022
0.591MetMet: 0.591 ± 0.012
0.773MetAsn: 0.773 ± 0.014
1.243MetPro: 1.243 ± 0.017
0.874MetGln: 0.874 ± 0.014
1.282MetArg: 1.282 ± 0.015
1.857MetSer: 1.857 ± 0.02
1.304MetThr: 1.304 ± 0.014
1.368MetVal: 1.368 ± 0.019
0.27MetTrp: 0.27 ± 0.007
0.526MetTyr: 0.526 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.955AsnAla: 2.955 ± 0.023
0.423AsnCys: 0.423 ± 0.01
1.767AsnAsp: 1.767 ± 0.019
1.843AsnGlu: 1.843 ± 0.022
1.249AsnPhe: 1.249 ± 0.018
2.651AsnGly: 2.651 ± 0.027
0.846AsnHis: 0.846 ± 0.013
1.887AsnIle: 1.887 ± 0.023
1.287AsnLys: 1.287 ± 0.017
3.073AsnLeu: 3.073 ± 0.029
0.796AsnMet: 0.796 ± 0.013
1.389AsnAsn: 1.389 ± 0.02
2.566AsnPro: 2.566 ± 0.026
1.264AsnGln: 1.264 ± 0.02
1.913AsnArg: 1.913 ± 0.02
2.496AsnSer: 2.496 ± 0.021
2.093AsnThr: 2.093 ± 0.023
2.146AsnVal: 2.146 ± 0.021
0.527AsnTrp: 0.527 ± 0.011
1.018AsnTyr: 1.018 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
5.573ProAla: 5.573 ± 0.056
0.608ProCys: 0.608 ± 0.012
3.404ProAsp: 3.404 ± 0.029
3.919ProGlu: 3.919 ± 0.038
2.177ProPhe: 2.177 ± 0.022
4.097ProGly: 4.097 ± 0.034
1.512ProHis: 1.512 ± 0.02
2.569ProIle: 2.569 ± 0.023
2.376ProLys: 2.376 ± 0.025
5.063ProLeu: 5.063 ± 0.037
1.091ProMet: 1.091 ± 0.017
2.107ProAsn: 2.107 ± 0.024
5.493ProPro: 5.493 ± 0.068
2.557ProGln: 2.557 ± 0.033
3.719ProArg: 3.719 ± 0.035
6.582ProSer: 6.582 ± 0.057
4.223ProThr: 4.223 ± 0.034
3.884ProVal: 3.884 ± 0.036
0.802ProTrp: 0.802 ± 0.012
1.571ProTyr: 1.571 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.427GlnAla: 3.427 ± 0.035
0.454GlnCys: 0.454 ± 0.01
2.007GlnAsp: 2.007 ± 0.023
2.36GlnGlu: 2.36 ± 0.029
1.269GlnPhe: 1.269 ± 0.015
2.427GlnGly: 2.427 ± 0.023
1.067GlnHis: 1.067 ± 0.017
1.857GlnIle: 1.857 ± 0.022
1.821GlnLys: 1.821 ± 0.023
3.469GlnLeu: 3.469 ± 0.029
0.877GlnMet: 0.877 ± 0.014
1.414GlnAsn: 1.414 ± 0.016
2.683GlnPro: 2.683 ± 0.036
2.362GlnGln: 2.362 ± 0.044
2.663GlnArg: 2.663 ± 0.025
3.171GlnSer: 3.171 ± 0.029
2.485GlnThr: 2.485 ± 0.024
2.181GlnVal: 2.181 ± 0.024
0.576GlnTrp: 0.576 ± 0.013
1.145GlnTyr: 1.145 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
4.928ArgAla: 4.928 ± 0.04
0.797ArgCys: 0.797 ± 0.014
3.481ArgAsp: 3.481 ± 0.034
3.879ArgGlu: 3.879 ± 0.037
2.242ArgPhe: 2.242 ± 0.025
3.941ArgGly: 3.941 ± 0.032
1.679ArgHis: 1.679 ± 0.022
2.906ArgIle: 2.906 ± 0.025
3.246ArgLys: 3.246 ± 0.031
5.767ArgLeu: 5.767 ± 0.04
1.404ArgMet: 1.404 ± 0.02
2.153ArgAsn: 2.153 ± 0.021
3.78ArgPro: 3.78 ± 0.034
2.677ArgGln: 2.677 ± 0.022
5.342ArgArg: 5.342 ± 0.047
4.964ArgSer: 4.964 ± 0.045
3.418ArgThr: 3.418 ± 0.03
3.668ArgVal: 3.668 ± 0.031
0.994ArgTrp: 0.994 ± 0.015
1.737ArgTyr: 1.737 ± 0.018
0.0ArgXaa: 0.0 ± 0.0
Ser
6.863SerAla: 6.863 ± 0.047
0.926SerCys: 0.926 ± 0.014
4.299SerAsp: 4.299 ± 0.037
4.057SerGlu: 4.057 ± 0.034
3.102SerPhe: 3.102 ± 0.026
5.708SerGly: 5.708 ± 0.045
2.118SerHis: 2.118 ± 0.024
3.919SerIle: 3.919 ± 0.029
3.29SerLys: 3.29 ± 0.03
7.678SerLeu: 7.678 ± 0.045
1.726SerMet: 1.726 ± 0.022
2.821SerAsn: 2.821 ± 0.022
5.905SerPro: 5.905 ± 0.06
3.276SerGln: 3.276 ± 0.028
5.186SerArg: 5.186 ± 0.036
9.19SerSer: 9.19 ± 0.083
5.728SerThr: 5.728 ± 0.047
4.795SerVal: 4.795 ± 0.038
1.142SerTrp: 1.142 ± 0.017
2.137SerTyr: 2.137 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
5.398ThrAla: 5.398 ± 0.042
0.773ThrCys: 0.773 ± 0.014
2.981ThrAsp: 2.981 ± 0.027
3.094ThrGlu: 3.094 ± 0.033
2.214ThrPhe: 2.214 ± 0.022
4.364ThrGly: 4.364 ± 0.035
1.401ThrHis: 1.401 ± 0.019
3.026ThrIle: 3.026 ± 0.025
2.26ThrLys: 2.26 ± 0.023
5.466ThrLeu: 5.466 ± 0.039
1.191ThrMet: 1.191 ± 0.014
1.992ThrAsn: 1.992 ± 0.022
4.531ThrPro: 4.531 ± 0.042
2.086ThrGln: 2.086 ± 0.023
3.266ThrArg: 3.266 ± 0.028
5.365ThrSer: 5.365 ± 0.042
4.742ThrThr: 4.742 ± 0.056
3.954ThrVal: 3.954 ± 0.035
0.854ThrTrp: 0.854 ± 0.014
1.683ThrTyr: 1.683 ± 0.023
0.0ThrXaa: 0.0 ± 0.0
Val
5.324ValAla: 5.324 ± 0.039
0.894ValCys: 0.894 ± 0.014
3.79ValAsp: 3.79 ± 0.033
3.873ValGlu: 3.873 ± 0.039
2.54ValPhe: 2.54 ± 0.026
4.183ValGly: 4.183 ± 0.042
1.461ValHis: 1.461 ± 0.019
2.969ValIle: 2.969 ± 0.028
2.578ValLys: 2.578 ± 0.03
5.878ValLeu: 5.878 ± 0.037
1.355ValMet: 1.355 ± 0.016
2.139ValAsn: 2.139 ± 0.02
3.738ValPro: 3.738 ± 0.031
2.409ValGln: 2.409 ± 0.024
3.696ValArg: 3.696 ± 0.03
4.891ValSer: 4.891 ± 0.033
3.565ValThr: 3.565 ± 0.033
4.494ValVal: 4.494 ± 0.04
0.894ValTrp: 0.894 ± 0.013
1.846ValTyr: 1.846 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.147TrpAla: 1.147 ± 0.016
0.199TrpCys: 0.199 ± 0.007
0.903TrpAsp: 0.903 ± 0.014
0.851TrpGlu: 0.851 ± 0.014
0.542TrpPhe: 0.542 ± 0.01
0.961TrpGly: 0.961 ± 0.017
0.359TrpHis: 0.359 ± 0.01
0.743TrpIle: 0.743 ± 0.012
0.747TrpLys: 0.747 ± 0.013
1.39TrpLeu: 1.39 ± 0.018
0.394TrpMet: 0.394 ± 0.008
0.619TrpAsn: 0.619 ± 0.011
0.639TrpPro: 0.639 ± 0.011
0.559TrpGln: 0.559 ± 0.011
0.993TrpArg: 0.993 ± 0.016
1.067TrpSer: 1.067 ± 0.015
0.947TrpThr: 0.947 ± 0.016
0.961TrpVal: 0.961 ± 0.013
0.29TrpTrp: 0.29 ± 0.008
0.425TrpTyr: 0.425 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.183TyrAla: 2.183 ± 0.024
0.427TyrCys: 0.427 ± 0.009
1.613TyrAsp: 1.613 ± 0.02
1.531TyrGlu: 1.531 ± 0.018
1.196TyrPhe: 1.196 ± 0.017
2.13TyrGly: 2.13 ± 0.026
0.779TyrHis: 0.779 ± 0.015
1.443TyrIle: 1.443 ± 0.019
0.968TyrLys: 0.968 ± 0.015
2.831TyrLeu: 2.831 ± 0.028
0.624TyrMet: 0.624 ± 0.012
1.087TyrAsn: 1.087 ± 0.015
1.644TyrPro: 1.644 ± 0.02
1.104TyrGln: 1.104 ± 0.014
1.698TyrArg: 1.698 ± 0.021
2.064TyrSer: 2.064 ± 0.024
1.709TyrThr: 1.709 ± 0.019
1.688TyrVal: 1.688 ± 0.018
0.456TyrTrp: 0.456 ± 0.01
0.969TyrTyr: 0.969 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10837 proteins (4977815 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski