Amino acid dipepetide frequency for Rosistilla oblonga

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.714AlaAla: 13.714 ± 0.148
1.12AlaCys: 1.12 ± 0.026
6.898AlaAsp: 6.898 ± 0.073
6.669AlaGlu: 6.669 ± 0.074
3.335AlaPhe: 3.335 ± 0.041
8.273AlaGly: 8.273 ± 0.082
1.526AlaHis: 1.526 ± 0.027
6.386AlaIle: 6.386 ± 0.074
4.023AlaLys: 4.023 ± 0.059
8.301AlaLeu: 8.301 ± 0.072
2.918AlaMet: 2.918 ± 0.046
3.177AlaAsn: 3.177 ± 0.044
4.191AlaPro: 4.191 ± 0.055
3.594AlaGln: 3.594 ± 0.054
5.347AlaArg: 5.347 ± 0.068
6.842AlaSer: 6.842 ± 0.076
6.39AlaThr: 6.39 ± 0.082
7.067AlaVal: 7.067 ± 0.071
1.408AlaTrp: 1.408 ± 0.031
2.035AlaTyr: 2.035 ± 0.036
0.0AlaXaa: 0.0 ± 0.0
Cys
0.789CysAla: 0.789 ± 0.021
0.275CysCys: 0.275 ± 0.013
0.814CysAsp: 0.814 ± 0.024
0.674CysGlu: 0.674 ± 0.021
0.47CysPhe: 0.47 ± 0.015
1.082CysGly: 1.082 ± 0.031
0.406CysHis: 0.406 ± 0.02
0.478CysIle: 0.478 ± 0.017
0.356CysLys: 0.356 ± 0.012
1.102CysLeu: 1.102 ± 0.023
0.209CysMet: 0.209 ± 0.008
0.358CysAsn: 0.358 ± 0.013
0.563CysPro: 0.563 ± 0.02
0.469CysGln: 0.469 ± 0.018
0.774CysArg: 0.774 ± 0.021
0.692CysSer: 0.692 ± 0.02
0.478CysThr: 0.478 ± 0.017
0.789CysVal: 0.789 ± 0.022
0.179CysTrp: 0.179 ± 0.01
0.288CysTyr: 0.288 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
7.096AspAla: 7.096 ± 0.072
0.615AspCys: 0.615 ± 0.018
4.479AspAsp: 4.479 ± 0.07
3.906AspGlu: 3.906 ± 0.048
2.421AspPhe: 2.421 ± 0.044
5.733AspGly: 5.733 ± 0.099
1.393AspHis: 1.393 ± 0.028
2.533AspIle: 2.533 ± 0.048
1.849AspLys: 1.849 ± 0.032
6.259AspLeu: 6.259 ± 0.058
0.983AspMet: 0.983 ± 0.025
1.755AspAsn: 1.755 ± 0.039
3.987AspPro: 3.987 ± 0.048
2.947AspGln: 2.947 ± 0.041
4.828AspArg: 4.828 ± 0.057
3.999AspSer: 3.999 ± 0.05
2.704AspThr: 2.704 ± 0.052
4.335AspVal: 4.335 ± 0.061
1.128AspTrp: 1.128 ± 0.027
1.593AspTyr: 1.593 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
6.187GluAla: 6.187 ± 0.086
0.458GluCys: 0.458 ± 0.015
2.778GluAsp: 2.778 ± 0.041
3.194GluGlu: 3.194 ± 0.051
2.207GluPhe: 2.207 ± 0.032
3.608GluGly: 3.608 ± 0.049
1.296GluHis: 1.296 ± 0.027
3.602GluIle: 3.602 ± 0.047
2.323GluLys: 2.323 ± 0.042
6.601GluLeu: 6.601 ± 0.06
1.475GluMet: 1.475 ± 0.029
1.952GluAsn: 1.952 ± 0.032
2.951GluPro: 2.951 ± 0.055
2.908GluGln: 2.908 ± 0.047
3.526GluArg: 3.526 ± 0.053
4.201GluSer: 4.201 ± 0.055
3.684GluThr: 3.684 ± 0.048
4.04GluVal: 4.04 ± 0.046
0.583GluTrp: 0.583 ± 0.02
1.304GluTyr: 1.304 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
3.918PheAla: 3.918 ± 0.042
0.477PheCys: 0.477 ± 0.016
2.88PheAsp: 2.88 ± 0.046
2.275PheGlu: 2.275 ± 0.038
1.269PhePhe: 1.269 ± 0.029
3.207PheGly: 3.207 ± 0.044
0.793PheHis: 0.793 ± 0.021
1.464PheIle: 1.464 ± 0.029
0.956PheLys: 0.956 ± 0.024
3.147PheLeu: 3.147 ± 0.038
0.653PheMet: 0.653 ± 0.022
1.114PheAsn: 1.114 ± 0.029
1.642PhePro: 1.642 ± 0.034
1.364PheGln: 1.364 ± 0.026
2.26PheArg: 2.26 ± 0.034
2.332PheSer: 2.332 ± 0.032
1.941PheThr: 1.941 ± 0.04
2.795PheVal: 2.795 ± 0.04
0.524PheTrp: 0.524 ± 0.016
0.905PheTyr: 0.905 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
5.91GlyAla: 5.91 ± 0.073
1.055GlyCys: 1.055 ± 0.03
5.086GlyAsp: 5.086 ± 0.08
4.534GlyGlu: 4.534 ± 0.05
3.104GlyPhe: 3.104 ± 0.039
6.7GlyGly: 6.7 ± 0.095
1.612GlyHis: 1.612 ± 0.033
4.2GlyIle: 4.2 ± 0.053
3.408GlyLys: 3.408 ± 0.051
7.047GlyLeu: 7.047 ± 0.067
1.919GlyMet: 1.919 ± 0.036
2.794GlyAsn: 2.794 ± 0.074
3.071GlyPro: 3.071 ± 0.043
3.105GlyGln: 3.105 ± 0.039
4.761GlyArg: 4.761 ± 0.049
5.339GlySer: 5.339 ± 0.077
4.579GlyThr: 4.579 ± 0.089
5.129GlyVal: 5.129 ± 0.063
1.34GlyTrp: 1.34 ± 0.025
2.065GlyTyr: 2.065 ± 0.037
0.0GlyXaa: 0.0 ± 0.0
His
2.018HisAla: 2.018 ± 0.04
0.349HisCys: 0.349 ± 0.014
1.356HisAsp: 1.356 ± 0.027
1.135HisGlu: 1.135 ± 0.024
0.894HisPhe: 0.894 ± 0.024
1.734HisGly: 1.734 ± 0.036
0.624HisHis: 0.624 ± 0.021
0.746HisIle: 0.746 ± 0.023
0.551HisLys: 0.551 ± 0.015
2.106HisLeu: 2.106 ± 0.034
0.358HisMet: 0.358 ± 0.013
0.667HisAsn: 0.667 ± 0.016
1.447HisPro: 1.447 ± 0.032
0.909HisGln: 0.909 ± 0.024
1.728HisArg: 1.728 ± 0.036
1.217HisSer: 1.217 ± 0.024
0.854HisThr: 0.854 ± 0.018
1.413HisVal: 1.413 ± 0.026
0.425HisTrp: 0.425 ± 0.016
0.624HisTyr: 0.624 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
6.605IleAla: 6.605 ± 0.066
0.667IleCys: 0.667 ± 0.02
4.882IleAsp: 4.882 ± 0.058
3.921IleGlu: 3.921 ± 0.046
1.417IlePhe: 1.417 ± 0.031
4.396IleGly: 4.396 ± 0.053
0.999IleHis: 0.999 ± 0.021
1.796IleIle: 1.796 ± 0.031
1.394IleLys: 1.394 ± 0.033
3.966IleLeu: 3.966 ± 0.047
0.726IleMet: 0.726 ± 0.022
1.489IleAsn: 1.489 ± 0.036
2.504IlePro: 2.504 ± 0.036
1.983IleGln: 1.983 ± 0.028
3.404IleArg: 3.404 ± 0.041
3.08IleSer: 3.08 ± 0.043
2.587IleThr: 2.587 ± 0.051
4.299IleVal: 4.299 ± 0.056
0.632IleTrp: 0.632 ± 0.019
1.238IleTyr: 1.238 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.062LysAla: 3.062 ± 0.05
0.297LysCys: 0.297 ± 0.012
1.545LysAsp: 1.545 ± 0.031
1.769LysGlu: 1.769 ± 0.037
1.136LysPhe: 1.136 ± 0.025
1.911LysGly: 1.911 ± 0.036
0.843LysHis: 0.843 ± 0.022
1.856LysIle: 1.856 ± 0.036
1.579LysLys: 1.579 ± 0.044
3.565LysLeu: 3.565 ± 0.053
0.916LysMet: 0.916 ± 0.023
1.088LysAsn: 1.088 ± 0.022
2.357LysPro: 2.357 ± 0.047
1.939LysGln: 1.939 ± 0.039
2.391LysArg: 2.391 ± 0.042
2.371LysSer: 2.371 ± 0.039
2.215LysThr: 2.215 ± 0.034
2.173LysVal: 2.173 ± 0.039
0.448LysTrp: 0.448 ± 0.014
0.84LysTyr: 0.84 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
10.988LeuAla: 10.988 ± 0.096
1.087LeuCys: 1.087 ± 0.025
5.946LeuAsp: 5.946 ± 0.049
5.32LeuGlu: 5.32 ± 0.057
3.315LeuPhe: 3.315 ± 0.042
6.89LeuGly: 6.89 ± 0.066
2.001LeuHis: 2.001 ± 0.031
4.869LeuIle: 4.869 ± 0.054
3.394LeuLys: 3.394 ± 0.053
9.623LeuLeu: 9.623 ± 0.097
2.111LeuMet: 2.111 ± 0.041
2.901LeuAsn: 2.901 ± 0.044
5.57LeuPro: 5.57 ± 0.058
4.528LeuGln: 4.528 ± 0.053
6.371LeuArg: 6.371 ± 0.071
6.36LeuSer: 6.36 ± 0.055
5.466LeuThr: 5.466 ± 0.064
6.746LeuVal: 6.746 ± 0.071
1.136LeuTrp: 1.136 ± 0.027
1.976LeuTyr: 1.976 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
2.115MetAla: 2.115 ± 0.032
0.194MetCys: 0.194 ± 0.009
1.001MetAsp: 1.001 ± 0.025
1.097MetGlu: 1.097 ± 0.025
0.863MetPhe: 0.863 ± 0.021
1.464MetGly: 1.464 ± 0.034
0.535MetHis: 0.535 ± 0.015
1.293MetIle: 1.293 ± 0.023
0.938MetLys: 0.938 ± 0.023
2.631MetLeu: 2.631 ± 0.047
0.608MetMet: 0.608 ± 0.021
0.887MetAsn: 0.887 ± 0.021
1.401MetPro: 1.401 ± 0.026
1.186MetGln: 1.186 ± 0.025
1.473MetArg: 1.473 ± 0.028
1.477MetSer: 1.477 ± 0.026
1.41MetThr: 1.41 ± 0.025
1.451MetVal: 1.451 ± 0.029
0.241MetTrp: 0.241 ± 0.01
0.42MetTyr: 0.42 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.006AsnAla: 3.006 ± 0.044
0.341AsnCys: 0.341 ± 0.014
2.217AsnAsp: 2.217 ± 0.064
1.738AsnGlu: 1.738 ± 0.03
1.099AsnPhe: 1.099 ± 0.027
2.537AsnGly: 2.537 ± 0.064
0.73AsnHis: 0.73 ± 0.022
1.357AsnIle: 1.357 ± 0.028
0.887AsnLys: 0.887 ± 0.02
2.986AsnLeu: 2.986 ± 0.044
0.576AsnMet: 0.576 ± 0.018
1.133AsnAsn: 1.133 ± 0.035
2.008AsnPro: 2.008 ± 0.031
1.389AsnGln: 1.389 ± 0.026
2.302AsnArg: 2.302 ± 0.036
1.858AsnSer: 1.858 ± 0.04
1.51AsnThr: 1.51 ± 0.038
2.304AsnVal: 2.304 ± 0.041
0.525AsnTrp: 0.525 ± 0.017
0.865AsnTyr: 0.865 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
5.871ProAla: 5.871 ± 0.086
0.382ProCys: 0.382 ± 0.013
3.442ProAsp: 3.442 ± 0.044
3.806ProGlu: 3.806 ± 0.054
1.722ProPhe: 1.722 ± 0.033
3.87ProGly: 3.87 ± 0.056
1.083ProHis: 1.083 ± 0.029
2.827ProIle: 2.827 ± 0.04
1.893ProLys: 1.893 ± 0.036
4.729ProLeu: 4.729 ± 0.048
1.238ProMet: 1.238 ± 0.03
1.791ProAsn: 1.791 ± 0.03
2.851ProPro: 2.851 ± 0.051
2.373ProGln: 2.373 ± 0.039
2.805ProArg: 2.805 ± 0.043
3.416ProSer: 3.416 ± 0.044
3.223ProThr: 3.223 ± 0.046
3.652ProVal: 3.652 ± 0.049
0.688ProTrp: 0.688 ± 0.018
1.093ProTyr: 1.093 ± 0.024
0.0ProXaa: 0.0 ± 0.0
Gln
4.362GlnAla: 4.362 ± 0.06
0.43GlnCys: 0.43 ± 0.016
1.731GlnAsp: 1.731 ± 0.034
1.907GlnGlu: 1.907 ± 0.04
1.605GlnPhe: 1.605 ± 0.027
2.384GlnGly: 2.384 ± 0.036
1.064GlnHis: 1.064 ± 0.024
2.666GlnIle: 2.666 ± 0.037
1.343GlnLys: 1.343 ± 0.029
4.959GlnLeu: 4.959 ± 0.059
1.13GlnMet: 1.13 ± 0.023
1.201GlnAsn: 1.201 ± 0.028
2.803GlnPro: 2.803 ± 0.046
2.842GlnGln: 2.842 ± 0.051
3.705GlnArg: 3.705 ± 0.053
2.883GlnSer: 2.883 ± 0.041
2.775GlnThr: 2.775 ± 0.039
2.856GlnVal: 2.856 ± 0.042
0.758GlnTrp: 0.758 ± 0.02
1.019GlnTyr: 1.019 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
4.809ArgAla: 4.809 ± 0.055
0.829ArgCys: 0.829 ± 0.023
4.119ArgAsp: 4.119 ± 0.048
3.834ArgGlu: 3.834 ± 0.056
2.844ArgPhe: 2.844 ± 0.039
4.349ArgGly: 4.349 ± 0.048
1.402ArgHis: 1.402 ± 0.028
3.704ArgIle: 3.704 ± 0.042
2.274ArgLys: 2.274 ± 0.037
6.926ArgLeu: 6.926 ± 0.074
1.821ArgMet: 1.821 ± 0.033
1.988ArgAsn: 1.988 ± 0.033
3.058ArgPro: 3.058 ± 0.042
3.223ArgGln: 3.223 ± 0.052
5.189ArgArg: 5.189 ± 0.074
4.231ArgSer: 4.231 ± 0.052
3.161ArgThr: 3.161 ± 0.043
4.54ArgVal: 4.54 ± 0.051
1.239ArgTrp: 1.239 ± 0.032
1.891ArgTyr: 1.891 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
5.651SerAla: 5.651 ± 0.059
0.674SerCys: 0.674 ± 0.021
4.444SerAsp: 4.444 ± 0.056
3.879SerGlu: 3.879 ± 0.053
2.279SerPhe: 2.279 ± 0.037
5.778SerGly: 5.778 ± 0.069
1.374SerHis: 1.374 ± 0.026
3.518SerIle: 3.518 ± 0.048
2.11SerLys: 2.11 ± 0.04
6.532SerLeu: 6.532 ± 0.056
1.517SerMet: 1.517 ± 0.028
2.033SerAsn: 2.033 ± 0.037
3.673SerPro: 3.673 ± 0.049
2.818SerGln: 2.818 ± 0.038
3.916SerArg: 3.916 ± 0.045
4.112SerSer: 4.112 ± 0.058
3.48SerThr: 3.48 ± 0.057
4.617SerVal: 4.617 ± 0.051
0.832SerTrp: 0.832 ± 0.021
1.38SerTyr: 1.38 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
5.895ThrAla: 5.895 ± 0.076
0.547ThrCys: 0.547 ± 0.017
3.439ThrAsp: 3.439 ± 0.057
2.959ThrGlu: 2.959 ± 0.037
2.043ThrPhe: 2.043 ± 0.041
4.554ThrGly: 4.554 ± 0.072
1.119ThrHis: 1.119 ± 0.025
3.356ThrIle: 3.356 ± 0.06
1.704ThrLys: 1.704 ± 0.034
5.705ThrLeu: 5.705 ± 0.081
1.159ThrMet: 1.159 ± 0.027
1.708ThrAsn: 1.708 ± 0.041
3.403ThrPro: 3.403 ± 0.05
2.157ThrGln: 2.157 ± 0.037
3.035ThrArg: 3.035 ± 0.05
3.448ThrSer: 3.448 ± 0.06
3.426ThrThr: 3.426 ± 0.069
4.074ThrVal: 4.074 ± 0.068
0.753ThrTrp: 0.753 ± 0.021
1.294ThrTyr: 1.294 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
8.069ValAla: 8.069 ± 0.075
0.935ValCys: 0.935 ± 0.026
5.055ValAsp: 5.055 ± 0.055
4.275ValGlu: 4.275 ± 0.048
2.36ValPhe: 2.36 ± 0.038
5.438ValGly: 5.438 ± 0.052
1.342ValHis: 1.342 ± 0.025
3.595ValIle: 3.595 ± 0.046
2.028ValLys: 2.028 ± 0.035
6.469ValLeu: 6.469 ± 0.073
1.53ValMet: 1.53 ± 0.03
2.012ValAsn: 2.012 ± 0.041
3.468ValPro: 3.468 ± 0.043
2.7ValGln: 2.7 ± 0.039
4.583ValArg: 4.583 ± 0.053
4.378ValSer: 4.378 ± 0.05
3.942ValThr: 3.942 ± 0.073
5.774ValVal: 5.774 ± 0.064
0.958ValTrp: 0.958 ± 0.02
1.55ValTyr: 1.55 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.025TrpAla: 1.025 ± 0.02
0.186TrpCys: 0.186 ± 0.01
0.832TrpAsp: 0.832 ± 0.022
0.649TrpGlu: 0.649 ± 0.019
0.573TrpPhe: 0.573 ± 0.02
0.929TrpGly: 0.929 ± 0.024
0.393TrpHis: 0.393 ± 0.015
0.914TrpIle: 0.914 ± 0.025
0.608TrpLys: 0.608 ± 0.019
1.627TrpLeu: 1.627 ± 0.032
0.435TrpMet: 0.435 ± 0.015
0.604TrpAsn: 0.604 ± 0.017
0.661TrpPro: 0.661 ± 0.018
0.858TrpGln: 0.858 ± 0.025
0.956TrpArg: 0.956 ± 0.026
0.96TrpSer: 0.96 ± 0.021
0.824TrpThr: 0.824 ± 0.024
0.837TrpVal: 0.837 ± 0.019
0.255TrpTrp: 0.255 ± 0.011
0.334TrpTyr: 0.334 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.124TyrAla: 2.124 ± 0.031
0.331TyrCys: 0.331 ± 0.012
1.597TyrAsp: 1.597 ± 0.031
1.337TyrGlu: 1.337 ± 0.027
0.96TyrPhe: 0.96 ± 0.023
1.92TyrGly: 1.92 ± 0.037
0.621TyrHis: 0.621 ± 0.018
0.827TyrIle: 0.827 ± 0.018
0.683TyrLys: 0.683 ± 0.019
2.315TyrLeu: 2.315 ± 0.038
0.4TyrMet: 0.4 ± 0.012
0.715TyrAsn: 0.715 ± 0.02
1.145TyrPro: 1.145 ± 0.024
1.13TyrGln: 1.13 ± 0.024
2.078TyrArg: 2.078 ± 0.035
1.42TyrSer: 1.42 ± 0.028
1.129TyrThr: 1.129 ± 0.03
1.571TyrVal: 1.571 ± 0.03
0.412TyrTrp: 0.412 ± 0.014
0.721TyrTyr: 0.721 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5436 proteins (2136412 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski