Amino acid dipepetide frequency for Lithobates catesbeianus (American bullfrog) (Rana catesbeiana)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.616AlaAla: 4.616 ± 0.035
1.099AlaCys: 1.099 ± 0.015
2.639AlaAsp: 2.639 ± 0.023
4.245AlaGlu: 4.245 ± 0.033
1.992AlaPhe: 1.992 ± 0.02
3.63AlaGly: 3.63 ± 0.039
1.427AlaHis: 1.427 ± 0.016
2.703AlaIle: 2.703 ± 0.023
3.105AlaLys: 3.105 ± 0.021
5.046AlaLeu: 5.046 ± 0.038
1.274AlaMet: 1.274 ± 0.015
1.887AlaAsn: 1.887 ± 0.017
2.83AlaPro: 2.83 ± 0.029
2.76AlaGln: 2.76 ± 0.024
2.351AlaArg: 2.351 ± 0.022
4.69AlaSer: 4.69 ± 0.041
3.089AlaThr: 3.089 ± 0.029
3.773AlaVal: 3.773 ± 0.033
0.516AlaTrp: 0.516 ± 0.009
1.312AlaTyr: 1.312 ± 0.013
0.001AlaXaa: 0.001 ± 0.0
Cys
1.388CysAla: 1.388 ± 0.019
0.594CysCys: 0.594 ± 0.012
1.25CysAsp: 1.25 ± 0.027
1.146CysGlu: 1.146 ± 0.018
1.255CysPhe: 1.255 ± 0.018
2.027CysGly: 2.027 ± 0.033
0.721CysHis: 0.721 ± 0.014
1.051CysIle: 1.051 ± 0.016
1.176CysLys: 1.176 ± 0.018
2.017CysLeu: 2.017 ± 0.02
0.486CysMet: 0.486 ± 0.009
1.194CysAsn: 1.194 ± 0.017
2.042CysPro: 2.042 ± 0.03
0.948CysGln: 0.948 ± 0.014
1.255CysArg: 1.255 ± 0.018
2.455CysSer: 2.455 ± 0.027
1.36CysThr: 1.36 ± 0.018
1.425CysVal: 1.425 ± 0.021
0.278CysTrp: 0.278 ± 0.006
0.644CysTyr: 0.644 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
2.267AspAla: 2.267 ± 0.02
1.187AspCys: 1.187 ± 0.02
2.662AspAsp: 2.662 ± 0.028
3.265AspGlu: 3.265 ± 0.027
1.897AspPhe: 1.897 ± 0.019
3.275AspGly: 3.275 ± 0.03
1.291AspHis: 1.291 ± 0.014
3.207AspIle: 3.207 ± 0.027
2.471AspLys: 2.471 ± 0.023
4.912AspLeu: 4.912 ± 0.032
1.197AspMet: 1.197 ± 0.017
1.991AspAsn: 1.991 ± 0.025
3.145AspPro: 3.145 ± 0.043
2.03AspGln: 2.03 ± 0.017
2.653AspArg: 2.653 ± 0.03
4.217AspSer: 4.217 ± 0.034
2.658AspThr: 2.658 ± 0.021
4.584AspVal: 4.584 ± 0.038
0.518AspTrp: 0.518 ± 0.01
1.405AspTyr: 1.405 ± 0.016
0.0AspXaa: 0.0 ± 0.0
Glu
3.929GluAla: 3.929 ± 0.033
1.865GluCys: 1.865 ± 0.031
5.133GluAsp: 5.133 ± 0.037
9.823GluGlu: 9.823 ± 0.079
1.836GluPhe: 1.836 ± 0.019
4.254GluGly: 4.254 ± 0.033
1.472GluHis: 1.472 ± 0.017
4.116GluIle: 4.116 ± 0.032
5.225GluLys: 5.225 ± 0.046
5.085GluLeu: 5.085 ± 0.039
1.819GluMet: 1.819 ± 0.019
3.094GluAsn: 3.094 ± 0.026
2.72GluPro: 2.72 ± 0.023
2.925GluGln: 2.925 ± 0.03
3.942GluArg: 3.942 ± 0.04
4.872GluSer: 4.872 ± 0.04
3.339GluThr: 3.339 ± 0.031
4.291GluVal: 4.291 ± 0.033
0.639GluTrp: 0.639 ± 0.01
1.722GluTyr: 1.722 ± 0.024
0.001GluXaa: 0.001 ± 0.0
Phe
1.639PheAla: 1.639 ± 0.017
1.036PheCys: 1.036 ± 0.013
1.39PheAsp: 1.39 ± 0.015
1.575PheGlu: 1.575 ± 0.016
1.886PhePhe: 1.886 ± 0.023
1.961PheGly: 1.961 ± 0.02
1.171PheHis: 1.171 ± 0.013
2.176PheIle: 2.176 ± 0.021
1.783PheLys: 1.783 ± 0.018
3.835PheLeu: 3.835 ± 0.029
0.825PheMet: 0.825 ± 0.011
1.341PheAsn: 1.341 ± 0.015
1.877PhePro: 1.877 ± 0.02
1.554PheGln: 1.554 ± 0.019
1.642PheArg: 1.642 ± 0.016
3.612PheSer: 3.612 ± 0.029
2.693PheThr: 2.693 ± 0.024
2.029PheVal: 2.029 ± 0.02
0.509PheTrp: 0.509 ± 0.01
1.23PheTyr: 1.23 ± 0.015
0.002PheXaa: 0.002 ± 0.0
Gly
3.217GlyAla: 3.217 ± 0.034
1.372GlyCys: 1.372 ± 0.017
3.466GlyAsp: 3.466 ± 0.026
5.133GlyGlu: 5.133 ± 0.05
2.006GlyPhe: 2.006 ± 0.022
3.975GlyGly: 3.975 ± 0.044
2.043GlyHis: 2.043 ± 0.023
2.751GlyIle: 2.751 ± 0.025
3.836GlyLys: 3.836 ± 0.033
4.269GlyLeu: 4.269 ± 0.031
1.271GlyMet: 1.271 ± 0.015
2.567GlyAsn: 2.567 ± 0.023
3.071GlyPro: 3.071 ± 0.067
2.398GlyGln: 2.398 ± 0.023
3.341GlyArg: 3.341 ± 0.028
5.116GlySer: 5.116 ± 0.039
3.648GlyThr: 3.648 ± 0.039
3.921GlyVal: 3.921 ± 0.086
0.621GlyTrp: 0.621 ± 0.011
1.711GlyTyr: 1.711 ± 0.021
0.0GlyXaa: 0.0 ± 0.0
His
1.191HisAla: 1.191 ± 0.013
0.775HisCys: 0.775 ± 0.015
0.869HisAsp: 0.869 ± 0.012
1.238HisGlu: 1.238 ± 0.016
1.411HisPhe: 1.411 ± 0.013
1.324HisGly: 1.324 ± 0.016
1.453HisHis: 1.453 ± 0.022
1.578HisIle: 1.578 ± 0.016
1.57HisLys: 1.57 ± 0.018
2.842HisLeu: 2.842 ± 0.027
0.719HisMet: 0.719 ± 0.014
0.998HisAsn: 0.998 ± 0.012
1.783HisPro: 1.783 ± 0.018
2.35HisGln: 2.35 ± 0.037
1.574HisArg: 1.574 ± 0.017
2.381HisSer: 2.381 ± 0.025
2.283HisThr: 2.283 ± 0.033
1.516HisVal: 1.516 ± 0.013
0.304HisTrp: 0.304 ± 0.007
0.784HisTyr: 0.784 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
2.568IleAla: 2.568 ± 0.028
1.269IleCys: 1.269 ± 0.017
2.496IleAsp: 2.496 ± 0.02
2.649IleGlu: 2.649 ± 0.025
1.952IlePhe: 1.952 ± 0.018
2.947IleGly: 2.947 ± 0.025
1.679IleHis: 1.679 ± 0.023
2.88IleIle: 2.88 ± 0.022
3.256IleLys: 3.256 ± 0.027
4.953IleLeu: 4.953 ± 0.031
1.781IleMet: 1.781 ± 0.019
2.27IleAsn: 2.27 ± 0.023
3.31IlePro: 3.31 ± 0.025
2.483IleGln: 2.483 ± 0.021
2.681IleArg: 2.681 ± 0.026
4.301IleSer: 4.301 ± 0.036
3.129IleThr: 3.129 ± 0.034
2.997IleVal: 2.997 ± 0.026
0.471IleTrp: 0.471 ± 0.01
1.535IleTyr: 1.535 ± 0.016
0.001IleXaa: 0.001 ± 0.0
Lys
3.325LysAla: 3.325 ± 0.029
1.499LysCys: 1.499 ± 0.02
3.305LysAsp: 3.305 ± 0.028
5.025LysGlu: 5.025 ± 0.047
1.529LysPhe: 1.529 ± 0.016
3.048LysGly: 3.048 ± 0.027
1.535LysHis: 1.535 ± 0.019
3.123LysIle: 3.123 ± 0.025
4.761LysLys: 4.761 ± 0.043
4.486LysLeu: 4.486 ± 0.033
1.722LysMet: 1.722 ± 0.017
2.944LysAsn: 2.944 ± 0.023
2.838LysPro: 2.838 ± 0.029
2.408LysGln: 2.408 ± 0.024
3.671LysArg: 3.671 ± 0.031
4.405LysSer: 4.405 ± 0.038
3.194LysThr: 3.194 ± 0.029
3.47LysVal: 3.47 ± 0.031
0.537LysTrp: 0.537 ± 0.01
2.03LysTyr: 2.03 ± 0.019
0.001LysXaa: 0.001 ± 0.0
Leu
4.771LeuAla: 4.771 ± 0.048
2.397LeuCys: 2.397 ± 0.024
4.106LeuAsp: 4.106 ± 0.061
6.126LeuGlu: 6.126 ± 0.047
3.14LeuPhe: 3.14 ± 0.027
4.618LeuGly: 4.618 ± 0.029
3.123LeuHis: 3.123 ± 0.029
4.419LeuIle: 4.419 ± 0.033
5.51LeuLys: 5.51 ± 0.033
8.585LeuLeu: 8.585 ± 0.058
1.885LeuMet: 1.885 ± 0.02
3.42LeuAsn: 3.42 ± 0.026
4.85LeuPro: 4.85 ± 0.036
5.0LeuGln: 5.0 ± 0.036
4.975LeuArg: 4.975 ± 0.034
7.435LeuSer: 7.435 ± 0.045
5.045LeuThr: 5.045 ± 0.032
4.623LeuVal: 4.623 ± 0.033
1.067LeuTrp: 1.067 ± 0.013
2.952LeuTyr: 2.952 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
1.585MetAla: 1.585 ± 0.017
0.559MetCys: 0.559 ± 0.01
1.5MetAsp: 1.5 ± 0.02
2.28MetGlu: 2.28 ± 0.023
1.138MetPhe: 1.138 ± 0.014
1.45MetGly: 1.45 ± 0.016
0.488MetHis: 0.488 ± 0.008
0.985MetIle: 0.985 ± 0.013
1.612MetLys: 1.612 ± 0.015
1.934MetLeu: 1.934 ± 0.019
0.816MetMet: 0.816 ± 0.014
0.97MetAsn: 0.97 ± 0.014
1.042MetPro: 1.042 ± 0.014
1.115MetGln: 1.115 ± 0.014
1.177MetArg: 1.177 ± 0.013
1.91MetSer: 1.91 ± 0.022
1.321MetThr: 1.321 ± 0.016
1.702MetVal: 1.702 ± 0.02
0.277MetTrp: 0.277 ± 0.006
0.706MetTyr: 0.706 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.047AsnAla: 2.047 ± 0.022
0.931AsnCys: 0.931 ± 0.018
1.91AsnAsp: 1.91 ± 0.018
2.176AsnGlu: 2.176 ± 0.022
1.525AsnPhe: 1.525 ± 0.015
2.511AsnGly: 2.511 ± 0.029
0.969AsnHis: 0.969 ± 0.011
3.143AsnIle: 3.143 ± 0.025
2.423AsnLys: 2.423 ± 0.023
3.802AsnLeu: 3.802 ± 0.026
1.182AsnMet: 1.182 ± 0.013
1.827AsnAsn: 1.827 ± 0.018
2.618AsnPro: 2.618 ± 0.022
1.685AsnGln: 1.685 ± 0.017
2.077AsnArg: 2.077 ± 0.017
3.114AsnSer: 3.114 ± 0.03
2.399AsnThr: 2.399 ± 0.022
2.573AsnVal: 2.573 ± 0.021
0.454AsnTrp: 0.454 ± 0.009
1.193AsnTyr: 1.193 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
3.47ProAla: 3.47 ± 0.031
1.097ProCys: 1.097 ± 0.018
2.51ProAsp: 2.51 ± 0.019
4.042ProGlu: 4.042 ± 0.029
2.078ProPhe: 2.078 ± 0.025
3.893ProGly: 3.893 ± 0.086
1.938ProHis: 1.938 ± 0.022
2.436ProIle: 2.436 ± 0.024
2.557ProLys: 2.557 ± 0.028
4.932ProLeu: 4.932 ± 0.034
1.014ProMet: 1.014 ± 0.014
2.127ProAsn: 2.127 ± 0.019
6.547ProPro: 6.547 ± 0.074
3.006ProGln: 3.006 ± 0.029
2.67ProArg: 2.67 ± 0.025
6.371ProSer: 6.371 ± 0.053
3.193ProThr: 3.193 ± 0.026
3.319ProVal: 3.319 ± 0.034
0.445ProTrp: 0.445 ± 0.008
2.032ProTyr: 2.032 ± 0.025
0.001ProXaa: 0.001 ± 0.0
Gln
2.693GlnAla: 2.693 ± 0.025
1.05GlnCys: 1.05 ± 0.018
2.273GlnAsp: 2.273 ± 0.02
3.935GlnGlu: 3.935 ± 0.032
1.202GlnPhe: 1.202 ± 0.016
2.425GlnGly: 2.425 ± 0.025
1.247GlnHis: 1.247 ± 0.015
2.429GlnIle: 2.429 ± 0.021
3.177GlnLys: 3.177 ± 0.025
3.74GlnLeu: 3.74 ± 0.033
1.356GlnMet: 1.356 ± 0.02
2.036GlnAsn: 2.036 ± 0.021
2.577GlnPro: 2.577 ± 0.031
2.93GlnGln: 2.93 ± 0.039
3.162GlnArg: 3.162 ± 0.027
3.752GlnSer: 3.752 ± 0.031
2.498GlnThr: 2.498 ± 0.021
2.856GlnVal: 2.856 ± 0.026
0.554GlnTrp: 0.554 ± 0.012
1.374GlnTyr: 1.374 ± 0.017
0.001GlnXaa: 0.001 ± 0.0
Arg
2.623ArgAla: 2.623 ± 0.027
1.61ArgCys: 1.61 ± 0.023
3.28ArgAsp: 3.28 ± 0.035
3.386ArgGlu: 3.386 ± 0.03
1.763ArgPhe: 1.763 ± 0.017
3.068ArgGly: 3.068 ± 0.04
1.49ArgHis: 1.49 ± 0.018
2.58ArgIle: 2.58 ± 0.022
3.612ArgLys: 3.612 ± 0.029
4.65ArgLeu: 4.65 ± 0.033
1.3ArgMet: 1.3 ± 0.016
2.309ArgAsn: 2.309 ± 0.022
2.95ArgPro: 2.95 ± 0.028
2.202ArgGln: 2.202 ± 0.018
3.698ArgArg: 3.698 ± 0.038
4.534ArgSer: 4.534 ± 0.04
2.713ArgThr: 2.713 ± 0.018
3.019ArgVal: 3.019 ± 0.032
0.701ArgTrp: 0.701 ± 0.012
1.619ArgTyr: 1.619 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
5.48SerAla: 5.48 ± 0.034
2.297SerCys: 2.297 ± 0.031
4.267SerAsp: 4.267 ± 0.034
5.572SerGlu: 5.572 ± 0.04
3.11SerPhe: 3.11 ± 0.024
5.997SerGly: 5.997 ± 0.061
2.631SerHis: 2.631 ± 0.034
3.67SerIle: 3.67 ± 0.029
3.926SerLys: 3.926 ± 0.036
8.091SerLeu: 8.091 ± 0.045
2.003SerMet: 2.003 ± 0.019
3.519SerAsn: 3.519 ± 0.029
6.002SerPro: 6.002 ± 0.06
3.831SerGln: 3.831 ± 0.031
4.291SerArg: 4.291 ± 0.036
9.814SerSer: 9.814 ± 0.074
5.537SerThr: 5.537 ± 0.057
5.132SerVal: 5.132 ± 0.037
0.877SerTrp: 0.877 ± 0.011
2.294SerTyr: 2.294 ± 0.023
0.002SerXaa: 0.002 ± 0.0
Thr
3.303ThrAla: 3.303 ± 0.031
1.353ThrCys: 1.353 ± 0.021
2.846ThrAsp: 2.846 ± 0.023
3.663ThrGlu: 3.663 ± 0.041
2.224ThrPhe: 2.224 ± 0.024
4.184ThrGly: 4.184 ± 0.031
1.551ThrHis: 1.551 ± 0.015
2.855ThrIle: 2.855 ± 0.024
2.8ThrLys: 2.8 ± 0.031
4.958ThrLeu: 4.958 ± 0.032
1.335ThrMet: 1.335 ± 0.018
1.968ThrAsn: 1.968 ± 0.017
3.766ThrPro: 3.766 ± 0.031
3.049ThrGln: 3.049 ± 0.036
2.639ThrArg: 2.639 ± 0.021
6.121ThrSer: 6.121 ± 0.05
4.257ThrThr: 4.257 ± 0.064
3.858ThrVal: 3.858 ± 0.033
0.627ThrTrp: 0.627 ± 0.009
1.693ThrTyr: 1.693 ± 0.017
0.0ThrXaa: 0.0 ± 0.0
Val
3.125ValAla: 3.125 ± 0.031
1.576ValCys: 1.576 ± 0.016
3.041ValAsp: 3.041 ± 0.025
4.644ValGlu: 4.644 ± 0.036
2.17ValPhe: 2.17 ± 0.022
2.895ValGly: 2.895 ± 0.025
1.743ValHis: 1.743 ± 0.017
3.013ValIle: 3.013 ± 0.028
3.607ValLys: 3.607 ± 0.027
6.215ValLeu: 6.215 ± 0.105
1.575ValMet: 1.575 ± 0.017
2.274ValAsn: 2.274 ± 0.025
3.959ValPro: 3.959 ± 0.043
2.955ValGln: 2.955 ± 0.03
2.973ValArg: 2.973 ± 0.022
5.074ValSer: 5.074 ± 0.037
4.123ValThr: 4.123 ± 0.034
4.421ValVal: 4.421 ± 0.101
0.649ValTrp: 0.649 ± 0.013
1.761ValTyr: 1.761 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
0.548TrpAla: 0.548 ± 0.01
0.256TrpCys: 0.256 ± 0.007
0.558TrpAsp: 0.558 ± 0.011
0.716TrpGlu: 0.716 ± 0.01
0.354TrpPhe: 0.354 ± 0.007
0.593TrpGly: 0.593 ± 0.012
0.254TrpHis: 0.254 ± 0.007
0.525TrpIle: 0.525 ± 0.009
0.792TrpLys: 0.792 ± 0.012
0.907TrpLeu: 0.907 ± 0.012
0.351TrpMet: 0.351 ± 0.007
0.547TrpAsn: 0.547 ± 0.01
0.371TrpPro: 0.371 ± 0.008
0.436TrpGln: 0.436 ± 0.009
0.639TrpArg: 0.639 ± 0.011
0.923TrpSer: 0.923 ± 0.015
0.595TrpThr: 0.595 ± 0.011
0.606TrpVal: 0.606 ± 0.01
0.185TrpTrp: 0.185 ± 0.007
0.393TrpTyr: 0.393 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.2TyrAla: 1.2 ± 0.015
0.891TyrCys: 0.891 ± 0.021
1.332TyrAsp: 1.332 ± 0.018
1.461TyrGlu: 1.461 ± 0.017
1.298TyrPhe: 1.298 ± 0.015
1.658TyrGly: 1.658 ± 0.02
0.717TyrHis: 0.717 ± 0.011
1.967TyrIle: 1.967 ± 0.025
1.658TyrLys: 1.658 ± 0.018
2.785TyrLeu: 2.785 ± 0.022
0.656TyrMet: 0.656 ± 0.01
1.277TyrAsn: 1.277 ± 0.014
1.38TyrPro: 1.38 ± 0.015
1.198TyrGln: 1.198 ± 0.015
1.698TyrArg: 1.698 ± 0.021
3.243TyrSer: 3.243 ± 0.037
1.973TyrThr: 1.973 ± 0.02
1.677TyrVal: 1.677 ± 0.017
0.321TyrTrp: 0.321 ± 0.006
1.036TyrTyr: 1.036 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.059XaaXaa: 0.059 ± 0.013
Statistics based on 28218 proteins (7230904 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski