Amino acid dipepetide frequency for Podiceps cristatus (Great crested grebe)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.592AlaAla: 5.592 ± 0.072
1.341AlaCys: 1.341 ± 0.024
2.986AlaAsp: 2.986 ± 0.033
4.55AlaGlu: 4.55 ± 0.05
2.681AlaPhe: 2.681 ± 0.029
3.79AlaGly: 3.79 ± 0.043
1.376AlaHis: 1.376 ± 0.023
3.121AlaIle: 3.121 ± 0.032
3.765AlaLys: 3.765 ± 0.042
6.435AlaLeu: 6.435 ± 0.062
1.474AlaMet: 1.474 ± 0.023
2.26AlaAsn: 2.26 ± 0.029
2.876AlaPro: 2.876 ± 0.048
2.695AlaGln: 2.695 ± 0.035
2.944AlaArg: 2.944 ± 0.038
5.155AlaSer: 5.155 ± 0.049
3.362AlaThr: 3.362 ± 0.033
4.835AlaVal: 4.835 ± 0.039
0.713AlaTrp: 0.713 ± 0.018
1.649AlaTyr: 1.649 ± 0.026
0.002AlaXaa: 0.002 ± 0.001
Cys
1.186CysAla: 1.186 ± 0.024
0.661CysCys: 0.661 ± 0.018
1.054CysAsp: 1.054 ± 0.023
1.29CysGlu: 1.29 ± 0.025
0.951CysPhe: 0.951 ± 0.02
1.474CysGly: 1.474 ± 0.033
0.643CysHis: 0.643 ± 0.015
1.175CysIle: 1.175 ± 0.026
1.35CysLys: 1.35 ± 0.022
2.185CysLeu: 2.185 ± 0.03
0.458CysMet: 0.458 ± 0.013
0.933CysAsn: 0.933 ± 0.021
1.229CysPro: 1.229 ± 0.029
1.061CysGln: 1.061 ± 0.024
1.206CysArg: 1.206 ± 0.021
2.016CysSer: 2.016 ± 0.036
1.17CysThr: 1.17 ± 0.023
1.368CysVal: 1.368 ± 0.035
0.313CysTrp: 0.313 ± 0.01
0.697CysTyr: 0.697 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
2.916AspAla: 2.916 ± 0.03
1.106AspCys: 1.106 ± 0.021
2.92AspAsp: 2.92 ± 0.039
3.656AspGlu: 3.656 ± 0.045
2.266AspPhe: 2.266 ± 0.033
3.274AspGly: 3.274 ± 0.045
1.177AspHis: 1.177 ± 0.019
2.974AspIle: 2.974 ± 0.037
2.811AspLys: 2.811 ± 0.034
5.013AspLeu: 5.013 ± 0.041
1.182AspMet: 1.182 ± 0.022
1.975AspAsn: 1.975 ± 0.023
2.615AspPro: 2.615 ± 0.037
1.834AspGln: 1.834 ± 0.025
2.333AspArg: 2.333 ± 0.034
4.083AspSer: 4.083 ± 0.043
2.569AspThr: 2.569 ± 0.028
3.226AspVal: 3.226 ± 0.039
0.668AspTrp: 0.668 ± 0.015
1.711AspTyr: 1.711 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
4.69GluAla: 4.69 ± 0.056
1.293GluCys: 1.293 ± 0.033
4.489GluAsp: 4.489 ± 0.043
7.868GluGlu: 7.868 ± 0.103
2.248GluPhe: 2.248 ± 0.026
3.844GluGly: 3.844 ± 0.041
1.557GluHis: 1.557 ± 0.023
3.546GluIle: 3.546 ± 0.036
5.795GluLys: 5.795 ± 0.074
6.293GluLeu: 6.293 ± 0.068
1.821GluMet: 1.821 ± 0.027
3.459GluAsn: 3.459 ± 0.039
2.533GluPro: 2.533 ± 0.036
3.223GluGln: 3.223 ± 0.043
3.893GluArg: 3.893 ± 0.055
4.544GluSer: 4.544 ± 0.052
3.602GluThr: 3.602 ± 0.041
4.417GluVal: 4.417 ± 0.04
0.73GluTrp: 0.73 ± 0.017
1.851GluTyr: 1.851 ± 0.026
0.001GluXaa: 0.001 ± 0.0
Phe
2.197PheAla: 2.197 ± 0.03
1.039PheCys: 1.039 ± 0.022
1.861PheAsp: 1.861 ± 0.03
2.168PheGlu: 2.168 ± 0.028
2.003PhePhe: 2.003 ± 0.031
2.332PheGly: 2.332 ± 0.031
1.094PheHis: 1.094 ± 0.017
2.172PheIle: 2.172 ± 0.031
2.235PheLys: 2.235 ± 0.034
4.343PheLeu: 4.343 ± 0.045
0.815PheMet: 0.815 ± 0.019
1.554PheAsn: 1.554 ± 0.026
1.966PhePro: 1.966 ± 0.03
1.847PheGln: 1.847 ± 0.025
1.975PheArg: 1.975 ± 0.029
3.57PheSer: 3.57 ± 0.035
2.358PheThr: 2.358 ± 0.035
2.397PheVal: 2.397 ± 0.028
0.546PheTrp: 0.546 ± 0.015
1.367PheTyr: 1.367 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
3.428GlyAla: 3.428 ± 0.037
1.204GlyCys: 1.204 ± 0.022
2.886GlyAsp: 2.886 ± 0.041
3.723GlyGlu: 3.723 ± 0.05
2.56GlyPhe: 2.56 ± 0.038
3.774GlyGly: 3.774 ± 0.055
1.503GlyHis: 1.503 ± 0.027
3.152GlyIle: 3.152 ± 0.034
4.045GlyLys: 4.045 ± 0.042
5.065GlyLeu: 5.065 ± 0.055
1.394GlyMet: 1.394 ± 0.026
2.649GlyAsn: 2.649 ± 0.028
2.617GlyPro: 2.617 ± 0.066
2.487GlyGln: 2.487 ± 0.035
3.071GlyArg: 3.071 ± 0.04
5.035GlySer: 5.035 ± 0.058
3.381GlyThr: 3.381 ± 0.038
3.433GlyVal: 3.433 ± 0.034
0.757GlyTrp: 0.757 ± 0.02
1.862GlyTyr: 1.862 ± 0.028
0.002GlyXaa: 0.002 ± 0.001
His
1.325HisAla: 1.325 ± 0.022
0.72HisCys: 0.72 ± 0.017
0.909HisAsp: 0.909 ± 0.02
1.373HisGlu: 1.373 ± 0.023
1.133HisPhe: 1.133 ± 0.023
1.459HisGly: 1.459 ± 0.024
0.848HisHis: 0.848 ± 0.021
1.355HisIle: 1.355 ± 0.023
1.427HisLys: 1.427 ± 0.024
2.776HisLeu: 2.776 ± 0.03
0.609HisMet: 0.609 ± 0.016
1.003HisAsn: 1.003 ± 0.019
1.458HisPro: 1.458 ± 0.025
1.184HisGln: 1.184 ± 0.023
1.462HisArg: 1.462 ± 0.023
2.193HisSer: 2.193 ± 0.031
1.339HisThr: 1.339 ± 0.024
1.559HisVal: 1.559 ± 0.028
0.411HisTrp: 0.411 ± 0.011
0.859HisTyr: 0.859 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
3.079IleAla: 3.079 ± 0.033
1.262IleCys: 1.262 ± 0.024
2.359IleAsp: 2.359 ± 0.031
2.91IleGlu: 2.91 ± 0.038
2.276IlePhe: 2.276 ± 0.035
2.512IleGly: 2.512 ± 0.033
1.407IleHis: 1.407 ± 0.021
2.768IleIle: 2.768 ± 0.032
3.091IleLys: 3.091 ± 0.034
5.035IleLeu: 5.035 ± 0.041
1.107IleMet: 1.107 ± 0.019
2.238IleAsn: 2.238 ± 0.027
2.895IlePro: 2.895 ± 0.031
2.443IleGln: 2.443 ± 0.032
2.567IleArg: 2.567 ± 0.028
4.176IleSer: 4.176 ± 0.04
2.877IleThr: 2.877 ± 0.037
2.962IleVal: 2.962 ± 0.034
0.598IleTrp: 0.598 ± 0.016
1.677IleTyr: 1.677 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
4.173LysAla: 4.173 ± 0.045
1.246LysCys: 1.246 ± 0.026
3.496LysAsp: 3.496 ± 0.037
5.808LysGlu: 5.808 ± 0.08
2.025LysPhe: 2.025 ± 0.028
3.495LysGly: 3.495 ± 0.052
1.67LysHis: 1.67 ± 0.028
3.264LysIle: 3.264 ± 0.035
5.562LysLys: 5.562 ± 0.07
5.811LysLeu: 5.811 ± 0.057
1.614LysMet: 1.614 ± 0.026
2.829LysAsn: 2.829 ± 0.032
3.168LysPro: 3.168 ± 0.045
3.072LysGln: 3.072 ± 0.042
3.577LysArg: 3.577 ± 0.042
4.428LysSer: 4.428 ± 0.052
3.468LysThr: 3.468 ± 0.036
3.775LysVal: 3.775 ± 0.037
0.666LysTrp: 0.666 ± 0.015
1.906LysTyr: 1.906 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
6.25LeuAla: 6.25 ± 0.061
2.201LeuCys: 2.201 ± 0.034
4.906LeuAsp: 4.906 ± 0.046
7.01LeuGlu: 7.01 ± 0.075
3.664LeuPhe: 3.664 ± 0.039
5.18LeuGly: 5.18 ± 0.048
2.679LeuHis: 2.679 ± 0.034
4.347LeuIle: 4.347 ± 0.037
6.473LeuLys: 6.473 ± 0.055
10.07LeuLeu: 10.07 ± 0.098
2.071LeuMet: 2.071 ± 0.027
3.954LeuAsn: 3.954 ± 0.039
5.352LeuPro: 5.352 ± 0.053
5.583LeuGln: 5.583 ± 0.064
5.076LeuArg: 5.076 ± 0.045
7.811LeuSer: 7.811 ± 0.057
5.017LeuThr: 5.017 ± 0.043
5.512LeuVal: 5.512 ± 0.044
1.097LeuTrp: 1.097 ± 0.021
2.794LeuTyr: 2.794 ± 0.032
0.001LeuXaa: 0.001 ± 0.001
Met
1.63MetAla: 1.63 ± 0.022
0.442MetCys: 0.442 ± 0.013
1.296MetAsp: 1.296 ± 0.024
1.909MetGlu: 1.909 ± 0.031
0.872MetPhe: 0.872 ± 0.019
1.281MetGly: 1.281 ± 0.023
0.536MetHis: 0.536 ± 0.014
1.011MetIle: 1.011 ± 0.021
1.629MetLys: 1.629 ± 0.025
2.126MetLeu: 2.126 ± 0.029
0.62MetMet: 0.62 ± 0.017
1.01MetAsn: 1.01 ± 0.016
1.06MetPro: 1.06 ± 0.022
1.043MetGln: 1.043 ± 0.022
1.091MetArg: 1.091 ± 0.019
1.532MetSer: 1.532 ± 0.025
1.163MetThr: 1.163 ± 0.025
1.478MetVal: 1.478 ± 0.022
0.259MetTrp: 0.259 ± 0.009
0.699MetTyr: 0.699 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.362AsnAla: 2.362 ± 0.031
0.98AsnCys: 0.98 ± 0.025
1.732AsnAsp: 1.732 ± 0.025
2.602AsnGlu: 2.602 ± 0.037
1.683AsnPhe: 1.683 ± 0.026
2.817AsnGly: 2.817 ± 0.041
1.039AsnHis: 1.039 ± 0.02
2.543AsnIle: 2.543 ± 0.034
2.679AsnLys: 2.679 ± 0.032
4.173AsnLeu: 4.173 ± 0.045
1.036AsnMet: 1.036 ± 0.021
1.897AsnAsn: 1.897 ± 0.03
2.317AsnPro: 2.317 ± 0.033
1.756AsnGln: 1.756 ± 0.027
2.088AsnArg: 2.088 ± 0.023
3.498AsnSer: 3.498 ± 0.048
2.279AsnThr: 2.279 ± 0.027
2.533AsnVal: 2.533 ± 0.027
0.514AsnTrp: 0.514 ± 0.012
1.346AsnTyr: 1.346 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
3.633ProAla: 3.633 ± 0.045
1.064ProCys: 1.064 ± 0.025
2.606ProAsp: 2.606 ± 0.03
3.871ProGlu: 3.871 ± 0.046
1.947ProPhe: 1.947 ± 0.028
3.444ProGly: 3.444 ± 0.105
1.22ProHis: 1.22 ± 0.021
2.002ProIle: 2.002 ± 0.028
2.785ProLys: 2.785 ± 0.038
4.617ProLeu: 4.617 ± 0.048
0.959ProMet: 0.959 ± 0.018
1.957ProAsn: 1.957 ± 0.033
4.165ProPro: 4.165 ± 0.084
2.293ProGln: 2.293 ± 0.038
2.54ProArg: 2.54 ± 0.035
4.894ProSer: 4.894 ± 0.061
2.623ProThr: 2.623 ± 0.037
3.666ProVal: 3.666 ± 0.042
0.549ProTrp: 0.549 ± 0.015
1.484ProTyr: 1.484 ± 0.022
0.001ProXaa: 0.001 ± 0.0
Gln
3.005GlnAla: 3.005 ± 0.041
0.965GlnCys: 0.965 ± 0.019
2.215GlnAsp: 2.215 ± 0.029
3.696GlnGlu: 3.696 ± 0.046
1.483GlnPhe: 1.483 ± 0.02
2.431GlnGly: 2.431 ± 0.038
1.265GlnHis: 1.265 ± 0.024
2.259GlnIle: 2.259 ± 0.033
3.293GlnLys: 3.293 ± 0.043
4.605GlnLeu: 4.605 ± 0.051
1.116GlnMet: 1.116 ± 0.021
2.056GlnAsn: 2.056 ± 0.029
2.275GlnPro: 2.275 ± 0.035
3.009GlnGln: 3.009 ± 0.07
2.635GlnArg: 2.635 ± 0.033
3.162GlnSer: 3.162 ± 0.04
2.357GlnThr: 2.357 ± 0.033
2.735GlnVal: 2.735 ± 0.031
0.531GlnTrp: 0.531 ± 0.014
1.278GlnTyr: 1.278 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
3.019ArgAla: 3.019 ± 0.033
1.098ArgCys: 1.098 ± 0.022
2.576ArgAsp: 2.576 ± 0.031
3.703ArgGlu: 3.703 ± 0.048
1.969ArgPhe: 1.969 ± 0.027
2.796ArgGly: 2.796 ± 0.045
1.421ArgHis: 1.421 ± 0.023
2.61ArgIle: 2.61 ± 0.032
3.974ArgLys: 3.974 ± 0.049
4.912ArgLeu: 4.912 ± 0.05
1.178ArgMet: 1.178 ± 0.021
2.285ArgAsn: 2.285 ± 0.029
2.374ArgPro: 2.374 ± 0.039
2.473ArgGln: 2.473 ± 0.032
3.585ArgArg: 3.585 ± 0.051
3.936ArgSer: 3.936 ± 0.061
2.711ArgThr: 2.711 ± 0.028
2.933ArgVal: 2.933 ± 0.03
0.62ArgTrp: 0.62 ± 0.015
1.618ArgTyr: 1.618 ± 0.023
0.001ArgXaa: 0.001 ± 0.001
Ser
5.028SerAla: 5.028 ± 0.043
1.829SerCys: 1.829 ± 0.029
4.019SerAsp: 4.019 ± 0.043
5.163SerGlu: 5.163 ± 0.059
3.213SerPhe: 3.213 ± 0.037
4.967SerGly: 4.967 ± 0.065
2.029SerHis: 2.029 ± 0.028
3.637SerIle: 3.637 ± 0.041
4.615SerLys: 4.615 ± 0.05
8.0SerLeu: 8.0 ± 0.063
1.649SerMet: 1.649 ± 0.028
3.164SerAsn: 3.164 ± 0.033
5.096SerPro: 5.096 ± 0.074
3.605SerGln: 3.605 ± 0.041
4.043SerArg: 4.043 ± 0.048
9.264SerSer: 9.264 ± 0.111
4.547SerThr: 4.547 ± 0.05
5.163SerVal: 5.163 ± 0.044
0.968SerTrp: 0.968 ± 0.018
2.319SerTyr: 2.319 ± 0.032
0.001SerXaa: 0.001 ± 0.001
Thr
3.738ThrAla: 3.738 ± 0.036
1.343ThrCys: 1.343 ± 0.024
2.752ThrAsp: 2.752 ± 0.033
3.834ThrGlu: 3.834 ± 0.042
2.257ThrPhe: 2.257 ± 0.033
3.449ThrGly: 3.449 ± 0.039
1.209ThrHis: 1.209 ± 0.026
2.617ThrIle: 2.617 ± 0.033
2.977ThrLys: 2.977 ± 0.036
5.155ThrLeu: 5.155 ± 0.04
1.164ThrMet: 1.164 ± 0.02
2.0ThrAsn: 2.0 ± 0.025
3.111ThrPro: 3.111 ± 0.042
2.101ThrGln: 2.101 ± 0.031
2.289ThrArg: 2.289 ± 0.027
4.707ThrSer: 4.707 ± 0.053
2.985ThrThr: 2.985 ± 0.042
4.116ThrVal: 4.116 ± 0.039
0.653ThrTrp: 0.653 ± 0.015
1.547ThrTyr: 1.547 ± 0.023
0.0ThrXaa: 0.0 ± 0.0
Val
4.08ValAla: 4.08 ± 0.045
1.596ValCys: 1.596 ± 0.032
3.078ValAsp: 3.078 ± 0.033
3.949ValGlu: 3.949 ± 0.035
2.732ValPhe: 2.732 ± 0.034
3.278ValGly: 3.278 ± 0.04
1.617ValHis: 1.617 ± 0.026
3.356ValIle: 3.356 ± 0.033
3.925ValLys: 3.925 ± 0.036
6.269ValLeu: 6.269 ± 0.055
1.409ValMet: 1.409 ± 0.024
2.622ValAsn: 2.622 ± 0.029
3.425ValPro: 3.425 ± 0.04
2.779ValGln: 2.779 ± 0.03
3.025ValArg: 3.025 ± 0.031
5.001ValSer: 5.001 ± 0.048
3.852ValThr: 3.852 ± 0.046
4.376ValVal: 4.376 ± 0.045
0.722ValTrp: 0.722 ± 0.017
1.875ValTyr: 1.875 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
0.652TrpAla: 0.652 ± 0.017
0.257TrpCys: 0.257 ± 0.011
0.704TrpAsp: 0.704 ± 0.017
0.773TrpGlu: 0.773 ± 0.018
0.451TrpPhe: 0.451 ± 0.014
0.639TrpGly: 0.639 ± 0.021
0.308TrpHis: 0.308 ± 0.01
0.626TrpIle: 0.626 ± 0.014
0.89TrpLys: 0.89 ± 0.018
1.192TrpLeu: 1.192 ± 0.025
0.314TrpMet: 0.314 ± 0.009
0.699TrpAsn: 0.699 ± 0.016
0.424TrpPro: 0.424 ± 0.012
0.547TrpGln: 0.547 ± 0.015
0.646TrpArg: 0.646 ± 0.016
0.897TrpSer: 0.897 ± 0.019
0.66TrpThr: 0.66 ± 0.015
0.666TrpVal: 0.666 ± 0.016
0.197TrpTrp: 0.197 ± 0.008
0.371TrpTyr: 0.371 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.616TyrAla: 1.616 ± 0.023
0.772TyrCys: 0.772 ± 0.02
1.465TyrAsp: 1.465 ± 0.023
1.842TyrGlu: 1.842 ± 0.031
1.424TyrPhe: 1.424 ± 0.025
1.789TyrGly: 1.789 ± 0.027
0.804TyrHis: 0.804 ± 0.018
1.7TyrIle: 1.7 ± 0.031
1.732TyrLys: 1.732 ± 0.026
2.973TyrLeu: 2.973 ± 0.042
0.696TyrMet: 0.696 ± 0.015
1.33TyrAsn: 1.33 ± 0.023
1.364TyrPro: 1.364 ± 0.026
1.336TyrGln: 1.336 ± 0.021
1.744TyrArg: 1.744 ± 0.024
2.417TyrSer: 2.417 ± 0.03
1.684TyrThr: 1.684 ± 0.034
1.817TyrVal: 1.817 ± 0.028
0.403TyrTrp: 0.403 ± 0.015
1.14TyrTyr: 1.14 ± 0.021
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.001
0.002XaaGly: 0.002 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.002XaaPro: 0.002 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.05XaaXaa: 0.05 ± 0.009
Statistics based on 7899 proteins (3087637 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski