Amino acid dipepetide frequency for Dendroctonus ponderosae (Mountain pine beetle)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.415AlaAla: 5.415 ± 0.047
1.203AlaCys: 1.203 ± 0.039
3.342AlaAsp: 3.342 ± 0.028
4.162AlaGlu: 4.162 ± 0.038
2.504AlaPhe: 2.504 ± 0.026
3.69AlaGly: 3.69 ± 0.035
1.483AlaHis: 1.483 ± 0.019
3.718AlaIle: 3.718 ± 0.031
4.112AlaLys: 4.112 ± 0.037
6.341AlaLeu: 6.341 ± 0.044
1.456AlaMet: 1.456 ± 0.021
2.9AlaAsn: 2.9 ± 0.026
3.125AlaPro: 3.125 ± 0.037
2.798AlaGln: 2.798 ± 0.03
2.991AlaArg: 2.991 ± 0.027
4.866AlaSer: 4.866 ± 0.037
3.627AlaThr: 3.627 ± 0.03
4.324AlaVal: 4.324 ± 0.035
0.635AlaTrp: 0.635 ± 0.012
1.784AlaTyr: 1.784 ± 0.023
0.001AlaXaa: 0.001 ± 0.0
Cys
1.154CysAla: 1.154 ± 0.025
0.485CysCys: 0.485 ± 0.013
1.159CysAsp: 1.159 ± 0.021
1.19CysGlu: 1.19 ± 0.019
0.82CysPhe: 0.82 ± 0.014
1.366CysGly: 1.366 ± 0.052
0.499CysHis: 0.499 ± 0.011
1.139CysIle: 1.139 ± 0.025
1.231CysLys: 1.231 ± 0.024
1.944CysLeu: 1.944 ± 0.039
0.427CysMet: 0.427 ± 0.01
0.962CysAsn: 0.962 ± 0.02
1.047CysPro: 1.047 ± 0.043
0.844CysGln: 0.844 ± 0.026
1.038CysArg: 1.038 ± 0.054
1.646CysSer: 1.646 ± 0.057
1.03CysThr: 1.03 ± 0.02
1.212CysVal: 1.212 ± 0.042
0.226CysTrp: 0.226 ± 0.007
0.624CysTyr: 0.624 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.124AspAla: 3.124 ± 0.026
1.051AspCys: 1.051 ± 0.021
3.463AspAsp: 3.463 ± 0.041
4.137AspGlu: 4.137 ± 0.038
2.487AspPhe: 2.487 ± 0.027
3.108AspGly: 3.108 ± 0.032
1.171AspHis: 1.171 ± 0.015
3.537AspIle: 3.537 ± 0.034
3.277AspLys: 3.277 ± 0.034
5.273AspLeu: 5.273 ± 0.041
1.201AspMet: 1.201 ± 0.014
2.506AspAsn: 2.506 ± 0.025
2.592AspPro: 2.592 ± 0.054
2.013AspGln: 2.013 ± 0.022
2.541AspArg: 2.541 ± 0.026
4.234AspSer: 4.234 ± 0.037
2.54AspThr: 2.54 ± 0.025
3.52AspVal: 3.52 ± 0.03
0.684AspTrp: 0.684 ± 0.013
1.896AspTyr: 1.896 ± 0.02
0.001AspXaa: 0.001 ± 0.0
Glu
4.417GluAla: 4.417 ± 0.046
1.243GluCys: 1.243 ± 0.055
4.19GluAsp: 4.19 ± 0.035
5.986GluGlu: 5.986 ± 0.073
2.396GluPhe: 2.396 ± 0.02
3.15GluGly: 3.15 ± 0.034
1.534GluHis: 1.534 ± 0.017
3.923GluIle: 3.923 ± 0.036
5.163GluLys: 5.163 ± 0.062
6.097GluLeu: 6.097 ± 0.043
1.554GluMet: 1.554 ± 0.02
3.753GluAsn: 3.753 ± 0.033
2.709GluPro: 2.709 ± 0.035
2.803GluGln: 2.803 ± 0.034
3.466GluArg: 3.466 ± 0.039
4.45GluSer: 4.45 ± 0.04
3.521GluThr: 3.521 ± 0.032
4.047GluVal: 4.047 ± 0.035
0.663GluTrp: 0.663 ± 0.014
1.991GluTyr: 1.991 ± 0.02
0.001GluXaa: 0.001 ± 0.0
Phe
2.478PheAla: 2.478 ± 0.024
0.889PheCys: 0.889 ± 0.014
2.269PheAsp: 2.269 ± 0.025
2.487PheGlu: 2.487 ± 0.022
1.729PhePhe: 1.729 ± 0.023
2.577PheGly: 2.577 ± 0.029
1.034PheHis: 1.034 ± 0.016
2.275PheIle: 2.275 ± 0.025
2.429PheLys: 2.429 ± 0.023
3.949PheLeu: 3.949 ± 0.031
0.92PheMet: 0.92 ± 0.015
2.017PheAsn: 2.017 ± 0.02
1.774PhePro: 1.774 ± 0.02
1.703PheGln: 1.703 ± 0.017
1.96PheArg: 1.96 ± 0.018
3.231PheSer: 3.231 ± 0.024
2.24PheThr: 2.24 ± 0.021
2.553PheVal: 2.553 ± 0.023
0.489PheTrp: 0.489 ± 0.012
1.479PheTyr: 1.479 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
3.577GlyAla: 3.577 ± 0.041
1.039GlyCys: 1.039 ± 0.018
2.99GlyAsp: 2.99 ± 0.034
3.209GlyGlu: 3.209 ± 0.033
2.52GlyPhe: 2.52 ± 0.027
4.057GlyGly: 4.057 ± 0.065
1.461GlyHis: 1.461 ± 0.024
3.223GlyIle: 3.223 ± 0.032
3.737GlyLys: 3.737 ± 0.029
4.986GlyLeu: 4.986 ± 0.039
1.193GlyMet: 1.193 ± 0.016
2.683GlyAsn: 2.683 ± 0.032
2.46GlyPro: 2.46 ± 0.042
2.273GlyGln: 2.273 ± 0.025
2.836GlyArg: 2.836 ± 0.029
4.56GlySer: 4.56 ± 0.041
3.067GlyThr: 3.067 ± 0.031
3.468GlyVal: 3.468 ± 0.03
0.713GlyTrp: 0.713 ± 0.012
2.033GlyTyr: 2.033 ± 0.029
0.003GlyXaa: 0.003 ± 0.001
His
1.32HisAla: 1.32 ± 0.016
0.574HisCys: 0.574 ± 0.011
1.051HisAsp: 1.051 ± 0.017
1.318HisGlu: 1.318 ± 0.019
1.092HisPhe: 1.092 ± 0.016
1.332HisGly: 1.332 ± 0.021
0.854HisHis: 0.854 ± 0.026
1.505HisIle: 1.505 ± 0.017
1.425HisLys: 1.425 ± 0.02
2.549HisLeu: 2.549 ± 0.025
0.62HisMet: 0.62 ± 0.012
1.118HisAsn: 1.118 ± 0.017
1.356HisPro: 1.356 ± 0.018
1.148HisGln: 1.148 ± 0.019
1.299HisArg: 1.299 ± 0.016
1.952HisSer: 1.952 ± 0.019
1.263HisThr: 1.263 ± 0.016
1.444HisVal: 1.444 ± 0.017
0.302HisTrp: 0.302 ± 0.008
0.886HisTyr: 0.886 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
3.627IleAla: 3.627 ± 0.027
1.318IleCys: 1.318 ± 0.024
3.147IleAsp: 3.147 ± 0.03
3.742IleGlu: 3.742 ± 0.041
2.457IlePhe: 2.457 ± 0.03
3.095IleGly: 3.095 ± 0.033
1.36IleHis: 1.36 ± 0.017
3.334IleIle: 3.334 ± 0.033
3.719IleLys: 3.719 ± 0.037
5.445IleLeu: 5.445 ± 0.038
1.229IleMet: 1.229 ± 0.015
2.884IleAsn: 2.884 ± 0.03
2.872IlePro: 2.872 ± 0.024
2.444IleGln: 2.444 ± 0.024
2.808IleArg: 2.808 ± 0.026
4.58IleSer: 4.58 ± 0.035
3.17IleThr: 3.17 ± 0.033
3.644IleVal: 3.644 ± 0.028
0.623IleTrp: 0.623 ± 0.012
1.86IleTyr: 1.86 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
3.993LysAla: 3.993 ± 0.039
1.347LysCys: 1.347 ± 0.028
3.487LysAsp: 3.487 ± 0.036
4.833LysGlu: 4.833 ± 0.057
2.385LysPhe: 2.385 ± 0.023
3.152LysGly: 3.152 ± 0.032
1.661LysHis: 1.661 ± 0.021
3.818LysIle: 3.818 ± 0.032
5.304LysLys: 5.304 ± 0.063
6.254LysLeu: 6.254 ± 0.046
1.558LysMet: 1.558 ± 0.02
3.3LysAsn: 3.3 ± 0.028
3.263LysPro: 3.263 ± 0.039
2.875LysGln: 2.875 ± 0.03
3.684LysArg: 3.684 ± 0.033
4.844LysSer: 4.844 ± 0.05
3.689LysThr: 3.689 ± 0.033
4.006LysVal: 4.006 ± 0.035
0.738LysTrp: 0.738 ± 0.013
2.225LysTyr: 2.225 ± 0.023
0.001LysXaa: 0.001 ± 0.0
Leu
6.311LeuAla: 6.311 ± 0.036
1.792LeuCys: 1.792 ± 0.019
5.189LeuAsp: 5.189 ± 0.035
6.471LeuGlu: 6.471 ± 0.054
3.568LeuPhe: 3.568 ± 0.03
4.97LeuGly: 4.97 ± 0.04
2.393LeuHis: 2.393 ± 0.026
4.996LeuIle: 4.996 ± 0.039
6.769LeuLys: 6.769 ± 0.049
9.188LeuLeu: 9.188 ± 0.07
2.085LeuMet: 2.085 ± 0.022
4.677LeuAsn: 4.677 ± 0.028
4.744LeuPro: 4.744 ± 0.04
4.698LeuGln: 4.698 ± 0.041
4.941LeuArg: 4.941 ± 0.038
7.101LeuSer: 7.101 ± 0.046
5.13LeuThr: 5.13 ± 0.04
5.605LeuVal: 5.605 ± 0.037
0.977LeuTrp: 0.977 ± 0.014
2.789LeuTyr: 2.789 ± 0.029
0.002LeuXaa: 0.002 ± 0.001
Met
1.642MetAla: 1.642 ± 0.02
0.446MetCys: 0.446 ± 0.009
1.308MetAsp: 1.308 ± 0.019
1.637MetGlu: 1.637 ± 0.019
0.967MetPhe: 0.967 ± 0.017
1.31MetGly: 1.31 ± 0.019
0.505MetHis: 0.505 ± 0.01
1.081MetIle: 1.081 ± 0.017
1.47MetLys: 1.47 ± 0.018
2.039MetLeu: 2.039 ± 0.021
0.58MetMet: 0.58 ± 0.011
1.044MetAsn: 1.044 ± 0.015
1.041MetPro: 1.041 ± 0.016
0.986MetGln: 0.986 ± 0.013
1.101MetArg: 1.101 ± 0.014
1.762MetSer: 1.762 ± 0.019
1.102MetThr: 1.102 ± 0.016
1.345MetVal: 1.345 ± 0.016
0.258MetTrp: 0.258 ± 0.007
0.694MetTyr: 0.694 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.974AsnAla: 2.974 ± 0.04
1.045AsnCys: 1.045 ± 0.018
2.381AsnAsp: 2.381 ± 0.025
3.079AsnGlu: 3.079 ± 0.029
2.056AsnPhe: 2.056 ± 0.024
3.042AsnGly: 3.042 ± 0.037
1.174AsnHis: 1.174 ± 0.026
3.265AsnIle: 3.265 ± 0.03
2.923AsnLys: 2.923 ± 0.026
4.667AsnLeu: 4.667 ± 0.037
1.207AsnMet: 1.207 ± 0.016
2.652AsnAsn: 2.652 ± 0.026
2.518AsnPro: 2.518 ± 0.046
2.094AsnGln: 2.094 ± 0.031
2.378AsnArg: 2.378 ± 0.022
3.981AsnSer: 3.981 ± 0.035
2.511AsnThr: 2.511 ± 0.025
3.237AsnVal: 3.237 ± 0.03
0.567AsnTrp: 0.567 ± 0.011
1.704AsnTyr: 1.704 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
3.232ProAla: 3.232 ± 0.035
0.845ProCys: 0.845 ± 0.078
2.744ProAsp: 2.744 ± 0.027
3.61ProGlu: 3.61 ± 0.054
1.836ProPhe: 1.836 ± 0.026
2.829ProGly: 2.829 ± 0.057
1.207ProHis: 1.207 ± 0.018
2.725ProIle: 2.725 ± 0.029
3.23ProLys: 3.23 ± 0.032
4.219ProLeu: 4.219 ± 0.034
0.985ProMet: 0.985 ± 0.018
2.492ProAsn: 2.492 ± 0.033
3.844ProPro: 3.844 ± 0.07
2.272ProGln: 2.272 ± 0.029
2.21ProArg: 2.21 ± 0.023
4.052ProSer: 4.052 ± 0.051
2.969ProThr: 2.969 ± 0.031
3.291ProVal: 3.291 ± 0.036
0.498ProTrp: 0.498 ± 0.009
1.558ProTyr: 1.558 ± 0.022
0.003ProXaa: 0.003 ± 0.001
Gln
2.82GlnAla: 2.82 ± 0.028
0.889GlnCys: 0.889 ± 0.03
2.016GlnAsp: 2.016 ± 0.019
2.979GlnGlu: 2.979 ± 0.027
1.679GlnPhe: 1.679 ± 0.021
2.076GlnGly: 2.076 ± 0.028
1.135GlnHis: 1.135 ± 0.019
2.462GlnIle: 2.462 ± 0.021
2.859GlnLys: 2.859 ± 0.028
4.329GlnLeu: 4.329 ± 0.039
1.057GlnMet: 1.057 ± 0.017
2.269GlnAsn: 2.269 ± 0.025
2.159GlnPro: 2.159 ± 0.029
2.747GlnGln: 2.747 ± 0.062
2.293GlnArg: 2.293 ± 0.024
3.072GlnSer: 3.072 ± 0.031
2.336GlnThr: 2.336 ± 0.025
2.663GlnVal: 2.663 ± 0.023
0.517GlnTrp: 0.517 ± 0.011
1.401GlnTyr: 1.401 ± 0.021
0.001GlnXaa: 0.001 ± 0.0
Arg
3.016ArgAla: 3.016 ± 0.024
0.944ArgCys: 0.944 ± 0.022
2.616ArgAsp: 2.616 ± 0.027
3.136ArgGlu: 3.136 ± 0.029
2.013ArgPhe: 2.013 ± 0.022
2.495ArgGly: 2.495 ± 0.03
1.36ArgHis: 1.36 ± 0.017
2.808ArgIle: 2.808 ± 0.025
3.886ArgLys: 3.886 ± 0.035
4.653ArgLeu: 4.653 ± 0.034
1.134ArgMet: 1.134 ± 0.016
2.638ArgAsn: 2.638 ± 0.023
2.547ArgPro: 2.547 ± 0.05
2.264ArgGln: 2.264 ± 0.024
3.395ArgArg: 3.395 ± 0.042
3.732ArgSer: 3.732 ± 0.036
2.639ArgThr: 2.639 ± 0.025
2.809ArgVal: 2.809 ± 0.025
0.533ArgTrp: 0.533 ± 0.01
1.568ArgTyr: 1.568 ± 0.016
0.001ArgXaa: 0.001 ± 0.001
Ser
4.861SerAla: 4.861 ± 0.036
1.507SerCys: 1.507 ± 0.053
4.396SerAsp: 4.396 ± 0.038
4.937SerGlu: 4.937 ± 0.042
2.967SerPhe: 2.967 ± 0.027
4.72SerGly: 4.72 ± 0.04
1.738SerHis: 1.738 ± 0.02
4.22SerIle: 4.22 ± 0.031
4.924SerLys: 4.924 ± 0.042
7.004SerLeu: 7.004 ± 0.049
1.635SerMet: 1.635 ± 0.019
3.852SerAsn: 3.852 ± 0.032
4.268SerPro: 4.268 ± 0.057
3.18SerGln: 3.18 ± 0.031
3.792SerArg: 3.792 ± 0.038
7.704SerSer: 7.704 ± 0.08
4.617SerThr: 4.617 ± 0.041
4.611SerVal: 4.611 ± 0.033
0.822SerTrp: 0.822 ± 0.014
2.318SerTyr: 2.318 ± 0.024
0.002SerXaa: 0.002 ± 0.001
Thr
3.597ThrAla: 3.597 ± 0.034
1.145ThrCys: 1.145 ± 0.029
2.814ThrAsp: 2.814 ± 0.025
3.474ThrGlu: 3.474 ± 0.044
2.319ThrPhe: 2.319 ± 0.022
3.25ThrGly: 3.25 ± 0.031
1.229ThrHis: 1.229 ± 0.017
3.283ThrIle: 3.283 ± 0.029
3.263ThrLys: 3.263 ± 0.03
5.178ThrLeu: 5.178 ± 0.041
1.113ThrMet: 1.113 ± 0.014
2.582ThrAsn: 2.582 ± 0.023
3.128ThrPro: 3.128 ± 0.035
2.083ThrGln: 2.083 ± 0.025
2.327ThrArg: 2.327 ± 0.023
4.455ThrSer: 4.455 ± 0.041
3.465ThrThr: 3.465 ± 0.055
3.789ThrVal: 3.789 ± 0.031
0.602ThrTrp: 0.602 ± 0.012
1.686ThrTyr: 1.686 ± 0.019
0.001ThrXaa: 0.001 ± 0.0
Val
4.394ValAla: 4.394 ± 0.035
1.344ValCys: 1.344 ± 0.029
3.466ValAsp: 3.466 ± 0.027
4.116ValGlu: 4.116 ± 0.041
2.653ValPhe: 2.653 ± 0.026
3.255ValGly: 3.255 ± 0.03
1.502ValHis: 1.502 ± 0.02
3.511ValIle: 3.511 ± 0.03
3.977ValLys: 3.977 ± 0.034
6.066ValLeu: 6.066 ± 0.043
1.303ValMet: 1.303 ± 0.016
2.939ValAsn: 2.939 ± 0.031
3.291ValPro: 3.291 ± 0.027
2.644ValGln: 2.644 ± 0.026
2.893ValArg: 2.893 ± 0.028
4.597ValSer: 4.597 ± 0.035
3.552ValThr: 3.552 ± 0.028
4.191ValVal: 4.191 ± 0.036
0.707ValTrp: 0.707 ± 0.013
1.926ValTyr: 1.926 ± 0.021
0.001ValXaa: 0.001 ± 0.0
Trp
0.652TrpAla: 0.652 ± 0.012
0.216TrpCys: 0.216 ± 0.006
0.635TrpAsp: 0.635 ± 0.012
0.633TrpGlu: 0.633 ± 0.013
0.497TrpPhe: 0.497 ± 0.011
0.645TrpGly: 0.645 ± 0.012
0.265TrpHis: 0.265 ± 0.007
0.674TrpIle: 0.674 ± 0.013
0.793TrpLys: 0.793 ± 0.017
1.104TrpLeu: 1.104 ± 0.017
0.306TrpMet: 0.306 ± 0.007
0.599TrpAsn: 0.599 ± 0.013
0.45TrpPro: 0.45 ± 0.01
0.466TrpGln: 0.466 ± 0.01
0.619TrpArg: 0.619 ± 0.01
0.798TrpSer: 0.798 ± 0.013
0.64TrpThr: 0.64 ± 0.014
0.61TrpVal: 0.61 ± 0.013
0.178TrpTrp: 0.178 ± 0.007
0.361TrpTyr: 0.361 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.869TyrAla: 1.869 ± 0.019
0.721TyrCys: 0.721 ± 0.013
1.692TyrAsp: 1.692 ± 0.018
1.921TyrGlu: 1.921 ± 0.019
1.538TyrPhe: 1.538 ± 0.019
1.966TyrGly: 1.966 ± 0.021
0.843TyrHis: 0.843 ± 0.016
1.863TyrIle: 1.863 ± 0.021
1.923TyrLys: 1.923 ± 0.022
3.109TyrLeu: 3.109 ± 0.032
0.756TyrMet: 0.756 ± 0.013
1.589TyrAsn: 1.589 ± 0.019
1.46TyrPro: 1.46 ± 0.02
1.423TyrGln: 1.423 ± 0.017
1.616TyrArg: 1.616 ± 0.018
2.453TyrSer: 2.453 ± 0.025
1.671TyrThr: 1.671 ± 0.019
1.959TyrVal: 1.959 ± 0.021
0.407TyrTrp: 0.407 ± 0.008
1.252TyrTyr: 1.252 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.003XaaPro: 0.003 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.001
0.002XaaSer: 0.002 ± 0.001
0.001XaaThr: 0.001 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.17XaaXaa: 0.17 ± 0.04
Statistics based on 12251 proteins (5412742 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski