Amino acid dipepetide frequency for Egretta garzetta (Little egret)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.608AlaAla: 5.608 ± 0.047
1.352AlaCys: 1.352 ± 0.019
3.005AlaAsp: 3.005 ± 0.025
4.52AlaGlu: 4.52 ± 0.039
2.718AlaPhe: 2.718 ± 0.025
3.828AlaGly: 3.828 ± 0.034
1.357AlaHis: 1.357 ± 0.017
3.113AlaIle: 3.113 ± 0.027
3.737AlaLys: 3.737 ± 0.033
6.559AlaLeu: 6.559 ± 0.049
1.5AlaMet: 1.5 ± 0.017
2.262AlaAsn: 2.262 ± 0.023
3.085AlaPro: 3.085 ± 0.032
2.705AlaGln: 2.705 ± 0.032
3.151AlaArg: 3.151 ± 0.026
5.099AlaSer: 5.099 ± 0.038
3.294AlaThr: 3.294 ± 0.028
4.878AlaVal: 4.878 ± 0.037
0.714AlaTrp: 0.714 ± 0.011
1.674AlaTyr: 1.674 ± 0.02
0.001AlaXaa: 0.001 ± 0.0
Cys
1.167CysAla: 1.167 ± 0.018
0.638CysCys: 0.638 ± 0.013
1.053CysAsp: 1.053 ± 0.018
1.294CysGlu: 1.294 ± 0.022
0.918CysPhe: 0.918 ± 0.013
1.399CysGly: 1.399 ± 0.022
0.619CysHis: 0.619 ± 0.014
1.181CysIle: 1.181 ± 0.018
1.336CysLys: 1.336 ± 0.019
2.141CysLeu: 2.141 ± 0.026
0.447CysMet: 0.447 ± 0.01
0.911CysAsn: 0.911 ± 0.015
1.221CysPro: 1.221 ± 0.025
1.037CysGln: 1.037 ± 0.018
1.18CysArg: 1.18 ± 0.016
1.987CysSer: 1.987 ± 0.027
1.161CysThr: 1.161 ± 0.019
1.395CysVal: 1.395 ± 0.028
0.284CysTrp: 0.284 ± 0.008
0.67CysTyr: 0.67 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
2.89AspAla: 2.89 ± 0.024
1.092AspCys: 1.092 ± 0.019
2.851AspAsp: 2.851 ± 0.032
3.665AspGlu: 3.665 ± 0.034
2.26AspPhe: 2.26 ± 0.021
3.505AspGly: 3.505 ± 0.029
1.164AspHis: 1.164 ± 0.015
3.156AspIle: 3.156 ± 0.027
2.805AspLys: 2.805 ± 0.026
5.074AspLeu: 5.074 ± 0.031
1.177AspMet: 1.177 ± 0.014
1.92AspAsn: 1.92 ± 0.024
2.607AspPro: 2.607 ± 0.024
1.848AspGln: 1.848 ± 0.018
2.434AspArg: 2.434 ± 0.026
4.029AspSer: 4.029 ± 0.036
2.503AspThr: 2.503 ± 0.024
3.225AspVal: 3.225 ± 0.03
0.649AspTrp: 0.649 ± 0.014
1.645AspTyr: 1.645 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.0GluAla: 5.0 ± 0.037
1.275GluCys: 1.275 ± 0.028
4.51GluAsp: 4.51 ± 0.03
7.81GluGlu: 7.81 ± 0.073
2.21GluPhe: 2.21 ± 0.023
3.845GluGly: 3.845 ± 0.032
1.56GluHis: 1.56 ± 0.015
3.558GluIle: 3.558 ± 0.032
5.785GluLys: 5.785 ± 0.06
6.365GluLeu: 6.365 ± 0.056
1.774GluMet: 1.774 ± 0.023
3.45GluAsn: 3.45 ± 0.035
2.489GluPro: 2.489 ± 0.025
3.198GluGln: 3.198 ± 0.037
3.828GluArg: 3.828 ± 0.038
4.432GluSer: 4.432 ± 0.036
3.659GluThr: 3.659 ± 0.028
4.55GluVal: 4.55 ± 0.035
0.719GluTrp: 0.719 ± 0.011
1.858GluTyr: 1.858 ± 0.02
0.001GluXaa: 0.001 ± 0.0
Phe
2.151PheAla: 2.151 ± 0.022
0.994PheCys: 0.994 ± 0.018
1.828PheAsp: 1.828 ± 0.021
2.146PheGlu: 2.146 ± 0.02
2.132PhePhe: 2.132 ± 0.026
2.293PheGly: 2.293 ± 0.026
1.082PheHis: 1.082 ± 0.015
2.124PheIle: 2.124 ± 0.028
2.581PheLys: 2.581 ± 0.025
4.231PheLeu: 4.231 ± 0.035
0.795PheMet: 0.795 ± 0.013
1.506PheAsn: 1.506 ± 0.018
1.909PhePro: 1.909 ± 0.022
1.837PheGln: 1.837 ± 0.018
2.141PheArg: 2.141 ± 0.02
3.474PheSer: 3.474 ± 0.031
2.437PheThr: 2.437 ± 0.02
2.36PheVal: 2.36 ± 0.024
0.523PheTrp: 0.523 ± 0.011
1.311PheTyr: 1.311 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
3.682GlyAla: 3.682 ± 0.03
1.258GlyCys: 1.258 ± 0.02
2.948GlyAsp: 2.948 ± 0.027
3.722GlyGlu: 3.722 ± 0.039
2.71GlyPhe: 2.71 ± 0.031
3.814GlyGly: 3.814 ± 0.046
1.485GlyHis: 1.485 ± 0.021
3.056GlyIle: 3.056 ± 0.026
3.93GlyLys: 3.93 ± 0.031
5.147GlyLeu: 5.147 ± 0.038
1.337GlyMet: 1.337 ± 0.02
2.614GlyAsn: 2.614 ± 0.022
2.638GlyPro: 2.638 ± 0.052
2.441GlyGln: 2.441 ± 0.024
3.341GlyArg: 3.341 ± 0.029
4.921GlySer: 4.921 ± 0.043
3.363GlyThr: 3.363 ± 0.032
3.411GlyVal: 3.411 ± 0.032
0.765GlyTrp: 0.765 ± 0.015
1.846GlyTyr: 1.846 ± 0.023
0.001GlyXaa: 0.001 ± 0.0
His
1.338HisAla: 1.338 ± 0.017
0.7HisCys: 0.7 ± 0.012
0.932HisAsp: 0.932 ± 0.015
1.354HisGlu: 1.354 ± 0.019
1.082HisPhe: 1.082 ± 0.014
1.49HisGly: 1.49 ± 0.019
0.871HisHis: 0.871 ± 0.018
1.327HisIle: 1.327 ± 0.018
1.415HisLys: 1.415 ± 0.019
2.812HisLeu: 2.812 ± 0.027
0.602HisMet: 0.602 ± 0.011
0.973HisAsn: 0.973 ± 0.014
1.466HisPro: 1.466 ± 0.019
1.156HisGln: 1.156 ± 0.017
1.42HisArg: 1.42 ± 0.018
2.137HisSer: 2.137 ± 0.023
1.295HisThr: 1.295 ± 0.019
1.512HisVal: 1.512 ± 0.016
0.605HisTrp: 0.605 ± 0.011
0.859HisTyr: 0.859 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
3.066IleAla: 3.066 ± 0.028
1.221IleCys: 1.221 ± 0.019
2.343IleAsp: 2.343 ± 0.021
2.926IleGlu: 2.926 ± 0.03
2.181IlePhe: 2.181 ± 0.025
2.487IleGly: 2.487 ± 0.025
1.382IleHis: 1.382 ± 0.016
2.729IleIle: 2.729 ± 0.027
3.058IleLys: 3.058 ± 0.022
4.963IleLeu: 4.963 ± 0.037
1.121IleMet: 1.121 ± 0.016
2.191IleAsn: 2.191 ± 0.025
2.855IlePro: 2.855 ± 0.026
2.473IleGln: 2.473 ± 0.024
2.782IleArg: 2.782 ± 0.022
4.096IleSer: 4.096 ± 0.03
2.855IleThr: 2.855 ± 0.028
2.92IleVal: 2.92 ± 0.026
0.582IleTrp: 0.582 ± 0.011
1.629IleTyr: 1.629 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
4.381LysAla: 4.381 ± 0.033
1.223LysCys: 1.223 ± 0.019
3.498LysAsp: 3.498 ± 0.029
5.676LysGlu: 5.676 ± 0.056
2.193LysPhe: 2.193 ± 0.022
3.401LysGly: 3.401 ± 0.036
1.84LysHis: 1.84 ± 0.02
3.216LysIle: 3.216 ± 0.028
5.703LysLys: 5.703 ± 0.047
6.031LysLeu: 6.031 ± 0.04
1.569LysMet: 1.569 ± 0.02
2.79LysAsn: 2.79 ± 0.023
3.094LysPro: 3.094 ± 0.031
3.078LysGln: 3.078 ± 0.026
3.545LysArg: 3.545 ± 0.031
4.279LysSer: 4.279 ± 0.038
3.424LysThr: 3.424 ± 0.025
3.767LysVal: 3.767 ± 0.026
0.68LysTrp: 0.68 ± 0.012
1.889LysTyr: 1.889 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
6.182LeuAla: 6.182 ± 0.044
2.159LeuCys: 2.159 ± 0.028
5.305LeuAsp: 5.305 ± 0.035
7.261LeuGlu: 7.261 ± 0.06
3.607LeuPhe: 3.607 ± 0.034
5.196LeuGly: 5.196 ± 0.037
2.694LeuHis: 2.694 ± 0.024
4.488LeuIle: 4.488 ± 0.032
6.582LeuLys: 6.582 ± 0.052
10.023LeuLeu: 10.023 ± 0.071
2.053LeuMet: 2.053 ± 0.021
4.041LeuAsn: 4.041 ± 0.033
5.489LeuPro: 5.489 ± 0.039
5.567LeuGln: 5.567 ± 0.044
5.146LeuArg: 5.146 ± 0.035
7.956LeuSer: 7.956 ± 0.045
4.906LeuThr: 4.906 ± 0.032
5.398LeuVal: 5.398 ± 0.038
1.096LeuTrp: 1.096 ± 0.015
2.814LeuTyr: 2.814 ± 0.027
0.001LeuXaa: 0.001 ± 0.001
Met
1.661MetAla: 1.661 ± 0.018
0.436MetCys: 0.436 ± 0.01
1.285MetAsp: 1.285 ± 0.015
1.903MetGlu: 1.903 ± 0.023
0.827MetPhe: 0.827 ± 0.013
1.258MetGly: 1.258 ± 0.018
0.535MetHis: 0.535 ± 0.01
1.001MetIle: 1.001 ± 0.015
1.629MetLys: 1.629 ± 0.017
2.041MetLeu: 2.041 ± 0.02
0.597MetMet: 0.597 ± 0.012
1.0MetAsn: 1.0 ± 0.015
1.022MetPro: 1.022 ± 0.016
1.009MetGln: 1.009 ± 0.014
1.3MetArg: 1.3 ± 0.016
1.548MetSer: 1.548 ± 0.017
1.134MetThr: 1.134 ± 0.016
1.406MetVal: 1.406 ± 0.018
0.248MetTrp: 0.248 ± 0.007
0.656MetTyr: 0.656 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.362AsnAla: 2.362 ± 0.021
0.955AsnCys: 0.955 ± 0.016
1.724AsnAsp: 1.724 ± 0.021
2.577AsnGlu: 2.577 ± 0.026
1.616AsnPhe: 1.616 ± 0.016
3.021AsnGly: 3.021 ± 0.03
0.994AsnHis: 0.994 ± 0.015
2.453AsnIle: 2.453 ± 0.022
2.595AsnLys: 2.595 ± 0.021
4.276AsnLeu: 4.276 ± 0.03
0.997AsnMet: 0.997 ± 0.015
1.827AsnAsn: 1.827 ± 0.021
2.259AsnPro: 2.259 ± 0.026
1.856AsnGln: 1.856 ± 0.02
2.196AsnArg: 2.196 ± 0.021
3.428AsnSer: 3.428 ± 0.031
2.224AsnThr: 2.224 ± 0.021
2.464AsnVal: 2.464 ± 0.023
0.501AsnTrp: 0.501 ± 0.011
1.304AsnTyr: 1.304 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
3.614ProAla: 3.614 ± 0.036
1.04ProCys: 1.04 ± 0.019
2.577ProAsp: 2.577 ± 0.022
3.799ProGlu: 3.799 ± 0.033
1.866ProPhe: 1.866 ± 0.02
3.51ProGly: 3.51 ± 0.069
1.245ProHis: 1.245 ± 0.016
1.926ProIle: 1.926 ± 0.02
2.749ProLys: 2.749 ± 0.033
4.574ProLeu: 4.574 ± 0.037
0.937ProMet: 0.937 ± 0.014
1.878ProAsn: 1.878 ± 0.02
4.173ProPro: 4.173 ± 0.056
2.315ProGln: 2.315 ± 0.029
2.699ProArg: 2.699 ± 0.027
5.048ProSer: 5.048 ± 0.047
2.585ProThr: 2.585 ± 0.027
3.626ProVal: 3.626 ± 0.029
0.563ProTrp: 0.563 ± 0.011
1.431ProTyr: 1.431 ± 0.017
0.0ProXaa: 0.0 ± 0.0
Gln
3.021GlnAla: 3.021 ± 0.031
0.963GlnCys: 0.963 ± 0.019
2.197GlnAsp: 2.197 ± 0.021
3.638GlnGlu: 3.638 ± 0.038
1.498GlnPhe: 1.498 ± 0.019
2.664GlnGly: 2.664 ± 0.026
1.29GlnHis: 1.29 ± 0.018
2.264GlnIle: 2.264 ± 0.023
3.206GlnLys: 3.206 ± 0.029
4.527GlnLeu: 4.527 ± 0.041
1.098GlnMet: 1.098 ± 0.016
2.014GlnAsn: 2.014 ± 0.019
2.284GlnPro: 2.284 ± 0.03
2.984GlnGln: 2.984 ± 0.054
2.589GlnArg: 2.589 ± 0.022
3.136GlnSer: 3.136 ± 0.036
2.308GlnThr: 2.308 ± 0.025
2.73GlnVal: 2.73 ± 0.024
0.522GlnTrp: 0.522 ± 0.011
1.335GlnTyr: 1.335 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
3.109ArgAla: 3.109 ± 0.029
1.076ArgCys: 1.076 ± 0.02
2.584ArgAsp: 2.584 ± 0.026
4.016ArgGlu: 4.016 ± 0.037
2.134ArgPhe: 2.134 ± 0.021
2.849ArgGly: 2.849 ± 0.035
1.403ArgHis: 1.403 ± 0.018
2.586ArgIle: 2.586 ± 0.024
4.086ArgLys: 4.086 ± 0.033
5.578ArgLeu: 5.578 ± 0.041
1.165ArgMet: 1.165 ± 0.014
2.238ArgAsn: 2.238 ± 0.021
2.333ArgPro: 2.333 ± 0.026
2.433ArgGln: 2.433 ± 0.022
3.628ArgArg: 3.628 ± 0.035
3.901ArgSer: 3.901 ± 0.043
2.671ArgThr: 2.671 ± 0.023
3.11ArgVal: 3.11 ± 0.025
0.635ArgTrp: 0.635 ± 0.011
1.599ArgTyr: 1.599 ± 0.018
0.001ArgXaa: 0.001 ± 0.0
Ser
4.948SerAla: 4.948 ± 0.037
1.799SerCys: 1.799 ± 0.023
3.991SerAsp: 3.991 ± 0.034
5.05SerGlu: 5.05 ± 0.04
3.05SerPhe: 3.05 ± 0.026
4.819SerGly: 4.819 ± 0.047
1.978SerHis: 1.978 ± 0.022
3.592SerIle: 3.592 ± 0.029
4.498SerLys: 4.498 ± 0.037
8.174SerLeu: 8.174 ± 0.05
1.637SerMet: 1.637 ± 0.02
3.274SerAsn: 3.274 ± 0.026
5.033SerPro: 5.033 ± 0.056
3.55SerGln: 3.55 ± 0.034
4.098SerArg: 4.098 ± 0.043
9.0SerSer: 9.0 ± 0.081
4.458SerThr: 4.458 ± 0.039
5.012SerVal: 5.012 ± 0.036
0.963SerTrp: 0.963 ± 0.012
2.249SerTyr: 2.249 ± 0.021
0.001SerXaa: 0.001 ± 0.0
Thr
3.739ThrAla: 3.739 ± 0.027
1.296ThrCys: 1.296 ± 0.023
2.683ThrAsp: 2.683 ± 0.028
3.685ThrGlu: 3.685 ± 0.028
2.327ThrPhe: 2.327 ± 0.021
3.351ThrGly: 3.351 ± 0.032
1.174ThrHis: 1.174 ± 0.019
2.569ThrIle: 2.569 ± 0.027
2.888ThrLys: 2.888 ± 0.024
5.061ThrLeu: 5.061 ± 0.033
1.329ThrMet: 1.329 ± 0.014
1.959ThrAsn: 1.959 ± 0.02
3.014ThrPro: 3.014 ± 0.03
2.086ThrGln: 2.086 ± 0.023
2.331ThrArg: 2.331 ± 0.022
4.507ThrSer: 4.507 ± 0.035
2.92ThrThr: 2.92 ± 0.032
4.074ThrVal: 4.074 ± 0.034
0.654ThrTrp: 0.654 ± 0.012
1.548ThrTyr: 1.548 ± 0.017
0.001ThrXaa: 0.001 ± 0.0
Val
4.015ValAla: 4.015 ± 0.026
1.528ValCys: 1.528 ± 0.027
3.034ValAsp: 3.034 ± 0.027
4.211ValGlu: 4.211 ± 0.03
2.649ValPhe: 2.649 ± 0.025
3.251ValGly: 3.251 ± 0.028
1.566ValHis: 1.566 ± 0.017
3.265ValIle: 3.265 ± 0.03
4.085ValLys: 4.085 ± 0.027
6.249ValLeu: 6.249 ± 0.039
1.384ValMet: 1.384 ± 0.017
2.608ValAsn: 2.608 ± 0.024
3.414ValPro: 3.414 ± 0.027
2.811ValGln: 2.811 ± 0.028
2.964ValArg: 2.964 ± 0.027
4.908ValSer: 4.908 ± 0.03
3.756ValThr: 3.756 ± 0.037
4.703ValVal: 4.703 ± 0.035
0.719ValTrp: 0.719 ± 0.014
1.844ValTyr: 1.844 ± 0.022
0.001ValXaa: 0.001 ± 0.0
Trp
0.667TrpAla: 0.667 ± 0.013
0.242TrpCys: 0.242 ± 0.007
0.676TrpAsp: 0.676 ± 0.014
0.774TrpGlu: 0.774 ± 0.013
0.462TrpPhe: 0.462 ± 0.01
0.665TrpGly: 0.665 ± 0.014
0.315TrpHis: 0.315 ± 0.008
0.616TrpIle: 0.616 ± 0.011
0.909TrpLys: 0.909 ± 0.013
1.17TrpLeu: 1.17 ± 0.017
0.299TrpMet: 0.299 ± 0.007
0.867TrpAsn: 0.867 ± 0.016
0.426TrpPro: 0.426 ± 0.009
0.549TrpGln: 0.549 ± 0.01
0.636TrpArg: 0.636 ± 0.012
0.873TrpSer: 0.873 ± 0.013
0.641TrpThr: 0.641 ± 0.011
0.657TrpVal: 0.657 ± 0.012
0.194TrpTrp: 0.194 ± 0.007
0.365TrpTyr: 0.365 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.558TyrAla: 1.558 ± 0.019
0.788TyrCys: 0.788 ± 0.012
1.474TyrAsp: 1.474 ± 0.019
1.845TyrGlu: 1.845 ± 0.019
1.417TyrPhe: 1.417 ± 0.018
1.784TyrGly: 1.784 ± 0.021
0.791TyrHis: 0.791 ± 0.011
1.639TyrIle: 1.639 ± 0.023
1.701TyrLys: 1.701 ± 0.019
2.966TyrLeu: 2.966 ± 0.029
0.681TyrMet: 0.681 ± 0.013
1.304TyrAsn: 1.304 ± 0.016
1.345TyrPro: 1.345 ± 0.017
1.331TyrGln: 1.331 ± 0.018
1.726TyrArg: 1.726 ± 0.021
2.416TyrSer: 2.416 ± 0.025
1.601TyrThr: 1.601 ± 0.019
1.768TyrVal: 1.768 ± 0.021
0.39TyrTrp: 0.39 ± 0.011
1.101TyrTyr: 1.101 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.041XaaXaa: 0.041 ± 0.006
Statistics based on 13489 proteins (5606078 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski