Amino acid dipepetide frequency for Buceros rhinoceros silvestris

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.781AlaAla: 5.781 ± 0.058
1.292AlaCys: 1.292 ± 0.02
3.051AlaAsp: 3.051 ± 0.025
4.698AlaGlu: 4.698 ± 0.04
2.613AlaPhe: 2.613 ± 0.024
3.942AlaGly: 3.942 ± 0.041
1.369AlaHis: 1.369 ± 0.017
3.04AlaIle: 3.04 ± 0.03
3.744AlaLys: 3.744 ± 0.033
6.466AlaLeu: 6.466 ± 0.047
1.492AlaMet: 1.492 ± 0.021
2.253AlaAsn: 2.253 ± 0.024
2.888AlaPro: 2.888 ± 0.034
2.755AlaGln: 2.755 ± 0.03
2.928AlaArg: 2.928 ± 0.03
5.242AlaSer: 5.242 ± 0.043
3.379AlaThr: 3.379 ± 0.033
4.917AlaVal: 4.917 ± 0.039
0.692AlaTrp: 0.692 ± 0.013
1.651AlaTyr: 1.651 ± 0.02
0.0AlaXaa: 0.0 ± 0.0
Cys
1.166CysAla: 1.166 ± 0.019
0.635CysCys: 0.635 ± 0.014
1.072CysAsp: 1.072 ± 0.024
1.308CysGlu: 1.308 ± 0.025
0.925CysPhe: 0.925 ± 0.015
1.436CysGly: 1.436 ± 0.025
0.634CysHis: 0.634 ± 0.016
1.133CysIle: 1.133 ± 0.019
1.327CysLys: 1.327 ± 0.019
2.16CysLeu: 2.16 ± 0.026
0.448CysMet: 0.448 ± 0.01
0.912CysAsn: 0.912 ± 0.017
1.23CysPro: 1.23 ± 0.024
1.089CysGln: 1.089 ± 0.018
1.214CysArg: 1.214 ± 0.02
2.046CysSer: 2.046 ± 0.03
1.192CysThr: 1.192 ± 0.019
1.357CysVal: 1.357 ± 0.022
0.296CysTrp: 0.296 ± 0.009
0.667CysTyr: 0.667 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
2.927AspAla: 2.927 ± 0.026
1.131AspCys: 1.131 ± 0.023
2.919AspAsp: 2.919 ± 0.037
3.727AspGlu: 3.727 ± 0.037
2.284AspPhe: 2.284 ± 0.024
3.285AspGly: 3.285 ± 0.042
1.168AspHis: 1.168 ± 0.02
2.985AspIle: 2.985 ± 0.026
2.813AspLys: 2.813 ± 0.027
5.061AspLeu: 5.061 ± 0.039
1.178AspMet: 1.178 ± 0.015
1.976AspAsn: 1.976 ± 0.024
2.604AspPro: 2.604 ± 0.028
1.892AspGln: 1.892 ± 0.021
2.397AspArg: 2.397 ± 0.027
4.134AspSer: 4.134 ± 0.037
2.56AspThr: 2.56 ± 0.027
3.282AspVal: 3.282 ± 0.033
0.659AspTrp: 0.659 ± 0.014
1.665AspTyr: 1.665 ± 0.021
0.0AspXaa: 0.0 ± 0.0
Glu
4.696GluAla: 4.696 ± 0.041
1.33GluCys: 1.33 ± 0.03
4.556GluAsp: 4.556 ± 0.042
7.939GluGlu: 7.939 ± 0.088
2.241GluPhe: 2.241 ± 0.026
3.909GluGly: 3.909 ± 0.035
1.554GluHis: 1.554 ± 0.019
3.553GluIle: 3.553 ± 0.037
5.836GluLys: 5.836 ± 0.063
6.421GluLeu: 6.421 ± 0.055
1.759GluMet: 1.759 ± 0.022
3.557GluAsn: 3.557 ± 0.03
2.559GluPro: 2.559 ± 0.027
3.238GluGln: 3.238 ± 0.036
3.936GluArg: 3.936 ± 0.046
4.586GluSer: 4.586 ± 0.039
3.592GluThr: 3.592 ± 0.034
4.403GluVal: 4.403 ± 0.04
0.754GluTrp: 0.754 ± 0.015
1.818GluTyr: 1.818 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
2.198PheAla: 2.198 ± 0.023
1.016PheCys: 1.016 ± 0.018
1.838PheAsp: 1.838 ± 0.021
2.181PheGlu: 2.181 ± 0.027
1.863PhePhe: 1.863 ± 0.023
2.288PheGly: 2.288 ± 0.024
1.101PheHis: 1.101 ± 0.018
2.092PheIle: 2.092 ± 0.024
2.086PheLys: 2.086 ± 0.024
4.191PheLeu: 4.191 ± 0.041
0.802PheMet: 0.802 ± 0.012
1.499PheAsn: 1.499 ± 0.021
1.971PhePro: 1.971 ± 0.025
1.843PheGln: 1.843 ± 0.021
1.948PheArg: 1.948 ± 0.023
3.509PheSer: 3.509 ± 0.03
2.268PheThr: 2.268 ± 0.023
2.402PheVal: 2.402 ± 0.023
0.514PheTrp: 0.514 ± 0.012
1.349PheTyr: 1.349 ± 0.02
0.0PheXaa: 0.0 ± 0.0
Gly
3.499GlyAla: 3.499 ± 0.038
1.216GlyCys: 1.216 ± 0.021
2.914GlyAsp: 2.914 ± 0.036
3.711GlyGlu: 3.711 ± 0.044
2.495GlyPhe: 2.495 ± 0.034
3.755GlyGly: 3.755 ± 0.046
1.513GlyHis: 1.513 ± 0.02
3.068GlyIle: 3.068 ± 0.031
3.957GlyLys: 3.957 ± 0.035
5.166GlyLeu: 5.166 ± 0.041
1.384GlyMet: 1.384 ± 0.024
2.607GlyAsn: 2.607 ± 0.026
2.676GlyPro: 2.676 ± 0.065
2.527GlyGln: 2.527 ± 0.028
3.153GlyArg: 3.153 ± 0.039
5.051GlySer: 5.051 ± 0.047
3.41GlyThr: 3.41 ± 0.035
3.479GlyVal: 3.479 ± 0.032
0.754GlyTrp: 0.754 ± 0.017
1.869GlyTyr: 1.869 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
1.309HisAla: 1.309 ± 0.018
0.708HisCys: 0.708 ± 0.015
0.944HisAsp: 0.944 ± 0.016
1.362HisGlu: 1.362 ± 0.02
1.098HisPhe: 1.098 ± 0.016
1.459HisGly: 1.459 ± 0.02
0.866HisHis: 0.866 ± 0.015
1.347HisIle: 1.347 ± 0.019
1.383HisLys: 1.383 ± 0.021
2.777HisLeu: 2.777 ± 0.027
0.571HisMet: 0.571 ± 0.013
0.994HisAsn: 0.994 ± 0.016
1.486HisPro: 1.486 ± 0.02
1.205HisGln: 1.205 ± 0.019
1.437HisArg: 1.437 ± 0.019
2.213HisSer: 2.213 ± 0.028
1.284HisThr: 1.284 ± 0.019
1.564HisVal: 1.564 ± 0.021
0.39HisTrp: 0.39 ± 0.008
0.863HisTyr: 0.863 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
2.995IleAla: 2.995 ± 0.028
1.226IleCys: 1.226 ± 0.019
2.354IleAsp: 2.354 ± 0.028
2.925IleGlu: 2.925 ± 0.029
2.141IlePhe: 2.141 ± 0.026
2.473IleGly: 2.473 ± 0.029
1.382IleHis: 1.382 ± 0.019
2.726IleIle: 2.726 ± 0.029
3.064IleLys: 3.064 ± 0.031
4.939IleLeu: 4.939 ± 0.044
1.047IleMet: 1.047 ± 0.015
2.167IleAsn: 2.167 ± 0.024
2.863IlePro: 2.863 ± 0.031
2.437IleGln: 2.437 ± 0.028
2.57IleArg: 2.57 ± 0.024
4.085IleSer: 4.085 ± 0.032
2.861IleThr: 2.861 ± 0.029
2.974IleVal: 2.974 ± 0.03
0.592IleTrp: 0.592 ± 0.012
1.604IleTyr: 1.604 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
4.161LysAla: 4.161 ± 0.039
1.23LysCys: 1.23 ± 0.02
3.472LysAsp: 3.472 ± 0.032
5.76LysGlu: 5.76 ± 0.056
1.938LysPhe: 1.938 ± 0.023
3.441LysGly: 3.441 ± 0.044
1.628LysHis: 1.628 ± 0.021
3.192LysIle: 3.192 ± 0.03
5.358LysLys: 5.358 ± 0.058
5.793LysLeu: 5.793 ± 0.042
1.568LysMet: 1.568 ± 0.019
2.859LysAsn: 2.859 ± 0.026
3.145LysPro: 3.145 ± 0.031
3.015LysGln: 3.015 ± 0.033
3.594LysArg: 3.594 ± 0.037
4.326LysSer: 4.326 ± 0.035
3.484LysThr: 3.484 ± 0.027
3.78LysVal: 3.78 ± 0.03
0.677LysTrp: 0.677 ± 0.012
1.911LysTyr: 1.911 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
6.212LeuAla: 6.212 ± 0.049
2.192LeuCys: 2.192 ± 0.028
4.872LeuAsp: 4.872 ± 0.033
7.061LeuGlu: 7.061 ± 0.063
3.571LeuPhe: 3.571 ± 0.037
5.136LeuGly: 5.136 ± 0.04
2.687LeuHis: 2.687 ± 0.031
4.238LeuIle: 4.238 ± 0.036
6.54LeuLys: 6.54 ± 0.05
10.068LeuLeu: 10.068 ± 0.089
2.035LeuMet: 2.035 ± 0.024
3.871LeuAsn: 3.871 ± 0.033
5.449LeuPro: 5.449 ± 0.046
5.707LeuGln: 5.707 ± 0.048
5.204LeuArg: 5.204 ± 0.045
7.872LeuSer: 7.872 ± 0.055
5.022LeuThr: 5.022 ± 0.037
5.514LeuVal: 5.514 ± 0.048
1.087LeuTrp: 1.087 ± 0.017
2.813LeuTyr: 2.813 ± 0.031
0.0LeuXaa: 0.0 ± 0.0
Met
1.601MetAla: 1.601 ± 0.022
0.421MetCys: 0.421 ± 0.01
1.284MetAsp: 1.284 ± 0.016
1.871MetGlu: 1.871 ± 0.02
0.852MetPhe: 0.852 ± 0.016
1.268MetGly: 1.268 ± 0.021
0.514MetHis: 0.514 ± 0.011
0.979MetIle: 0.979 ± 0.015
1.609MetLys: 1.609 ± 0.022
2.081MetLeu: 2.081 ± 0.023
0.616MetMet: 0.616 ± 0.015
1.002MetAsn: 1.002 ± 0.016
1.031MetPro: 1.031 ± 0.017
1.029MetGln: 1.029 ± 0.019
1.056MetArg: 1.056 ± 0.016
1.55MetSer: 1.55 ± 0.02
1.163MetThr: 1.163 ± 0.017
1.461MetVal: 1.461 ± 0.019
0.246MetTrp: 0.246 ± 0.008
0.655MetTyr: 0.655 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.353AsnAla: 2.353 ± 0.023
1.002AsnCys: 1.002 ± 0.019
1.751AsnAsp: 1.751 ± 0.022
2.609AsnGlu: 2.609 ± 0.025
1.677AsnPhe: 1.677 ± 0.022
2.791AsnGly: 2.791 ± 0.034
1.011AsnHis: 1.011 ± 0.017
2.487AsnIle: 2.487 ± 0.024
2.605AsnLys: 2.605 ± 0.025
4.111AsnLeu: 4.111 ± 0.036
1.006AsnMet: 1.006 ± 0.017
1.853AsnAsn: 1.853 ± 0.026
2.316AsnPro: 2.316 ± 0.025
1.729AsnGln: 1.729 ± 0.021
2.07AsnArg: 2.07 ± 0.022
3.501AsnSer: 3.501 ± 0.033
2.283AsnThr: 2.283 ± 0.031
2.527AsnVal: 2.527 ± 0.025
0.5AsnTrp: 0.5 ± 0.01
1.302AsnTyr: 1.302 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
3.754ProAla: 3.754 ± 0.038
1.047ProCys: 1.047 ± 0.02
2.619ProAsp: 2.619 ± 0.026
3.859ProGlu: 3.859 ± 0.033
1.932ProPhe: 1.932 ± 0.024
3.619ProGly: 3.619 ± 0.101
1.23ProHis: 1.23 ± 0.019
1.935ProIle: 1.935 ± 0.022
2.771ProLys: 2.771 ± 0.029
4.648ProLeu: 4.648 ± 0.035
0.933ProMet: 0.933 ± 0.016
1.903ProAsn: 1.903 ± 0.022
4.229ProPro: 4.229 ± 0.074
2.332ProGln: 2.332 ± 0.031
2.517ProArg: 2.517 ± 0.027
4.995ProSer: 4.995 ± 0.052
2.651ProThr: 2.651 ± 0.03
3.814ProVal: 3.814 ± 0.036
0.556ProTrp: 0.556 ± 0.011
1.444ProTyr: 1.444 ± 0.021
0.0ProXaa: 0.0 ± 0.0
Gln
3.052GlnAla: 3.052 ± 0.033
0.979GlnCys: 0.979 ± 0.018
2.258GlnAsp: 2.258 ± 0.024
3.726GlnGlu: 3.726 ± 0.042
1.487GlnPhe: 1.487 ± 0.02
2.478GlnGly: 2.478 ± 0.031
1.294GlnHis: 1.294 ± 0.016
2.233GlnIle: 2.233 ± 0.023
3.228GlnLys: 3.228 ± 0.036
4.654GlnLeu: 4.654 ± 0.043
1.135GlnMet: 1.135 ± 0.018
2.095GlnAsn: 2.095 ± 0.023
2.365GlnPro: 2.365 ± 0.031
3.074GlnGln: 3.074 ± 0.055
2.641GlnArg: 2.641 ± 0.026
3.218GlnSer: 3.218 ± 0.034
2.402GlnThr: 2.402 ± 0.028
2.783GlnVal: 2.783 ± 0.029
0.524GlnTrp: 0.524 ± 0.011
1.288GlnTyr: 1.288 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
3.052ArgAla: 3.052 ± 0.029
1.128ArgCys: 1.128 ± 0.022
2.632ArgAsp: 2.632 ± 0.032
3.775ArgGlu: 3.775 ± 0.039
1.899ArgPhe: 1.899 ± 0.024
2.818ArgGly: 2.818 ± 0.035
1.406ArgHis: 1.406 ± 0.021
2.6ArgIle: 2.6 ± 0.028
3.984ArgLys: 3.984 ± 0.044
4.904ArgLeu: 4.904 ± 0.041
1.185ArgMet: 1.185 ± 0.017
2.286ArgAsn: 2.286 ± 0.024
2.368ArgPro: 2.368 ± 0.032
2.487ArgGln: 2.487 ± 0.027
3.668ArgArg: 3.668 ± 0.043
4.012ArgSer: 4.012 ± 0.051
2.711ArgThr: 2.711 ± 0.027
2.954ArgVal: 2.954 ± 0.025
0.619ArgTrp: 0.619 ± 0.014
1.608ArgTyr: 1.608 ± 0.017
0.0ArgXaa: 0.0 ± 0.0
Ser
5.203SerAla: 5.203 ± 0.04
1.832SerCys: 1.832 ± 0.028
4.089SerAsp: 4.089 ± 0.039
5.252SerGlu: 5.252 ± 0.042
3.159SerPhe: 3.159 ± 0.027
4.987SerGly: 4.987 ± 0.049
2.029SerHis: 2.029 ± 0.024
3.557SerIle: 3.557 ± 0.03
4.545SerLys: 4.545 ± 0.039
8.021SerLeu: 8.021 ± 0.056
1.619SerMet: 1.619 ± 0.023
3.081SerAsn: 3.081 ± 0.03
5.203SerPro: 5.203 ± 0.058
3.698SerGln: 3.698 ± 0.036
4.094SerArg: 4.094 ± 0.045
9.279SerSer: 9.279 ± 0.099
4.601SerThr: 4.601 ± 0.04
5.193SerVal: 5.193 ± 0.043
0.987SerTrp: 0.987 ± 0.016
2.268SerTyr: 2.268 ± 0.028
0.0SerXaa: 0.0 ± 0.0
Thr
3.91ThrAla: 3.91 ± 0.037
1.302ThrCys: 1.302 ± 0.026
2.758ThrAsp: 2.758 ± 0.025
3.824ThrGlu: 3.824 ± 0.033
2.187ThrPhe: 2.187 ± 0.027
3.428ThrGly: 3.428 ± 0.034
1.185ThrHis: 1.185 ± 0.016
2.617ThrIle: 2.617 ± 0.029
2.918ThrLys: 2.918 ± 0.026
5.174ThrLeu: 5.174 ± 0.039
1.121ThrMet: 1.121 ± 0.017
1.989ThrAsn: 1.989 ± 0.023
3.124ThrPro: 3.124 ± 0.036
2.13ThrGln: 2.13 ± 0.025
2.303ThrArg: 2.303 ± 0.023
4.737ThrSer: 4.737 ± 0.047
3.028ThrThr: 3.028 ± 0.035
4.169ThrVal: 4.169 ± 0.04
0.676ThrTrp: 0.676 ± 0.013
1.58ThrTyr: 1.58 ± 0.022
0.0ThrXaa: 0.0 ± 0.0
Val
4.08ValAla: 4.08 ± 0.03
1.532ValCys: 1.532 ± 0.022
3.139ValAsp: 3.139 ± 0.029
4.059ValGlu: 4.059 ± 0.032
2.73ValPhe: 2.73 ± 0.031
3.284ValGly: 3.284 ± 0.038
1.572ValHis: 1.572 ± 0.019
3.356ValIle: 3.356 ± 0.03
3.942ValLys: 3.942 ± 0.031
6.358ValLeu: 6.358 ± 0.052
1.402ValMet: 1.402 ± 0.019
2.609ValAsn: 2.609 ± 0.031
3.468ValPro: 3.468 ± 0.035
2.848ValGln: 2.848 ± 0.031
2.989ValArg: 2.989 ± 0.024
5.053ValSer: 5.053 ± 0.042
3.967ValThr: 3.967 ± 0.034
4.324ValVal: 4.324 ± 0.037
0.727ValTrp: 0.727 ± 0.013
1.903ValTyr: 1.903 ± 0.022
0.0ValXaa: 0.0 ± 0.0
Trp
0.663TrpAla: 0.663 ± 0.014
0.253TrpCys: 0.253 ± 0.007
0.652TrpAsp: 0.652 ± 0.013
0.765TrpGlu: 0.765 ± 0.014
0.459TrpPhe: 0.459 ± 0.011
0.623TrpGly: 0.623 ± 0.016
0.305TrpHis: 0.305 ± 0.008
0.633TrpIle: 0.633 ± 0.013
0.907TrpLys: 0.907 ± 0.016
1.172TrpLeu: 1.172 ± 0.018
0.314TrpMet: 0.314 ± 0.009
0.657TrpAsn: 0.657 ± 0.015
0.433TrpPro: 0.433 ± 0.009
0.552TrpGln: 0.552 ± 0.011
0.67TrpArg: 0.67 ± 0.012
0.897TrpSer: 0.897 ± 0.015
0.66TrpThr: 0.66 ± 0.014
0.663TrpVal: 0.663 ± 0.011
0.196TrpTrp: 0.196 ± 0.008
0.368TrpTyr: 0.368 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.584TyrAla: 1.584 ± 0.021
0.774TyrCys: 0.774 ± 0.015
1.471TyrAsp: 1.471 ± 0.017
1.823TyrGlu: 1.823 ± 0.023
1.409TyrPhe: 1.409 ± 0.022
1.782TyrGly: 1.782 ± 0.022
0.812TyrHis: 0.812 ± 0.012
1.654TyrIle: 1.654 ± 0.022
1.717TyrLys: 1.717 ± 0.022
2.985TyrLeu: 2.985 ± 0.035
0.675TyrMet: 0.675 ± 0.013
1.314TyrAsn: 1.314 ± 0.018
1.381TyrPro: 1.381 ± 0.022
1.325TyrGln: 1.325 ± 0.018
1.706TyrArg: 1.706 ± 0.025
2.393TyrSer: 2.393 ± 0.026
1.641TyrThr: 1.641 ± 0.02
1.782TyrVal: 1.782 ± 0.022
0.397TyrTrp: 0.397 ± 0.012
1.109TyrTyr: 1.109 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 10603 proteins (4535817 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski