Amino acid dipepetide frequency for Ursus arctos horribilis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.276AlaAla: 7.276 ± 0.039
1.408AlaCys: 1.408 ± 0.01
2.971AlaAsp: 2.971 ± 0.013
4.983AlaGlu: 4.983 ± 0.023
2.595AlaPhe: 2.595 ± 0.015
4.947AlaGly: 4.947 ± 0.024
1.586AlaHis: 1.586 ± 0.01
2.64AlaIle: 2.64 ± 0.011
3.377AlaLys: 3.377 ± 0.019
7.331AlaLeu: 7.331 ± 0.032
1.442AlaMet: 1.442 ± 0.009
1.982AlaAsn: 1.982 ± 0.01
4.583AlaPro: 4.583 ± 0.026
3.405AlaGln: 3.405 ± 0.021
3.888AlaArg: 3.888 ± 0.018
6.017AlaSer: 6.017 ± 0.023
3.53AlaThr: 3.53 ± 0.017
4.786AlaVal: 4.786 ± 0.015
0.793AlaTrp: 0.793 ± 0.008
1.477AlaTyr: 1.477 ± 0.008
0.003AlaXaa: 0.003 ± 0.0
Cys
1.272CysAla: 1.272 ± 0.01
0.6CysCys: 0.6 ± 0.01
0.993CysAsp: 0.993 ± 0.009
1.253CysGlu: 1.253 ± 0.011
0.806CysPhe: 0.806 ± 0.006
1.784CysGly: 1.784 ± 0.019
0.652CysHis: 0.652 ± 0.007
0.882CysIle: 0.882 ± 0.008
1.097CysLys: 1.097 ± 0.009
2.078CysLeu: 2.078 ± 0.013
0.391CysMet: 0.391 ± 0.005
0.768CysAsn: 0.768 ± 0.008
1.361CysPro: 1.361 ± 0.013
1.059CysGln: 1.059 ± 0.01
1.299CysArg: 1.299 ± 0.01
1.979CysSer: 1.979 ± 0.011
1.059CysThr: 1.059 ± 0.009
1.275CysVal: 1.275 ± 0.01
0.277CysTrp: 0.277 ± 0.004
0.546CysTyr: 0.546 ± 0.005
0.001CysXaa: 0.001 ± 0.0
Asp
2.869AspAla: 2.869 ± 0.012
1.014AspCys: 1.014 ± 0.009
2.587AspAsp: 2.587 ± 0.016
3.41AspGlu: 3.41 ± 0.016
2.05AspPhe: 2.05 ± 0.01
3.266AspGly: 3.266 ± 0.021
1.118AspHis: 1.118 ± 0.008
2.457AspIle: 2.457 ± 0.013
2.485AspLys: 2.485 ± 0.013
4.991AspLeu: 4.991 ± 0.018
1.076AspMet: 1.076 ± 0.008
1.58AspAsn: 1.58 ± 0.011
2.958AspPro: 2.958 ± 0.015
1.851AspGln: 1.851 ± 0.01
2.494AspArg: 2.494 ± 0.013
4.243AspSer: 4.243 ± 0.02
2.449AspThr: 2.449 ± 0.014
3.026AspVal: 3.026 ± 0.013
0.607AspTrp: 0.607 ± 0.006
1.41AspTyr: 1.41 ± 0.009
0.002AspXaa: 0.002 ± 0.0
Glu
5.436GluAla: 5.436 ± 0.023
1.452GluCys: 1.452 ± 0.018
4.457GluAsp: 4.457 ± 0.018
8.165GluGlu: 8.165 ± 0.04
2.011GluPhe: 2.011 ± 0.01
4.3GluGly: 4.3 ± 0.018
1.518GluHis: 1.518 ± 0.01
3.091GluIle: 3.091 ± 0.016
5.525GluLys: 5.525 ± 0.028
6.592GluLeu: 6.592 ± 0.032
1.655GluMet: 1.655 ± 0.01
3.095GluAsn: 3.095 ± 0.018
3.447GluPro: 3.447 ± 0.022
3.276GluGln: 3.276 ± 0.019
4.241GluArg: 4.241 ± 0.025
4.515GluSer: 4.515 ± 0.019
3.4GluThr: 3.4 ± 0.016
4.152GluVal: 4.152 ± 0.023
0.679GluTrp: 0.679 ± 0.007
1.516GluTyr: 1.516 ± 0.014
0.002GluXaa: 0.002 ± 0.0
Phe
1.904PheAla: 1.904 ± 0.01
0.918PheCys: 0.918 ± 0.008
1.587PheAsp: 1.587 ± 0.01
1.942PheGlu: 1.942 ± 0.011
1.569PhePhe: 1.569 ± 0.011
2.135PheGly: 2.135 ± 0.014
1.004PheHis: 1.004 ± 0.008
1.74PheIle: 1.74 ± 0.012
1.669PheLys: 1.669 ± 0.01
3.985PheLeu: 3.985 ± 0.021
0.73PheMet: 0.73 ± 0.007
1.243PheAsn: 1.243 ± 0.009
1.993PhePro: 1.993 ± 0.012
1.759PheGln: 1.759 ± 0.01
1.966PheArg: 1.966 ± 0.015
3.37PheSer: 3.37 ± 0.017
1.937PheThr: 1.937 ± 0.011
2.047PheVal: 2.047 ± 0.011
0.47PheTrp: 0.47 ± 0.005
1.125PheTyr: 1.125 ± 0.01
0.002PheXaa: 0.002 ± 0.0
Gly
4.73GlyAla: 4.73 ± 0.025
1.272GlyCys: 1.272 ± 0.01
3.114GlyAsp: 3.114 ± 0.014
4.191GlyGlu: 4.191 ± 0.023
2.315GlyPhe: 2.315 ± 0.014
5.257GlyGly: 5.257 ± 0.033
1.703GlyHis: 1.703 ± 0.011
2.586GlyIle: 2.586 ± 0.014
3.705GlyLys: 3.705 ± 0.021
5.896GlyLeu: 5.896 ± 0.023
1.259GlyMet: 1.259 ± 0.008
2.28GlyAsn: 2.28 ± 0.014
4.563GlyPro: 4.563 ± 0.037
2.84GlyGln: 2.84 ± 0.016
3.955GlyArg: 3.955 ± 0.019
6.006GlySer: 6.006 ± 0.024
3.551GlyThr: 3.551 ± 0.016
3.54GlyVal: 3.54 ± 0.017
0.76GlyTrp: 0.76 ± 0.007
1.634GlyTyr: 1.634 ± 0.013
0.004GlyXaa: 0.004 ± 0.0
His
1.352HisAla: 1.352 ± 0.009
0.688HisCys: 0.688 ± 0.007
0.859HisAsp: 0.859 ± 0.006
1.32HisGlu: 1.32 ± 0.008
1.059HisPhe: 1.059 ± 0.008
1.552HisGly: 1.552 ± 0.009
0.886HisHis: 0.886 ± 0.009
1.192HisIle: 1.192 ± 0.008
1.259HisLys: 1.259 ± 0.007
2.922HisLeu: 2.922 ± 0.014
0.578HisMet: 0.578 ± 0.006
0.815HisAsn: 0.815 ± 0.007
1.69HisPro: 1.69 ± 0.011
1.422HisGln: 1.422 ± 0.014
1.651HisArg: 1.651 ± 0.01
2.323HisSer: 2.323 ± 0.014
1.567HisThr: 1.567 ± 0.014
1.512HisVal: 1.512 ± 0.008
0.339HisTrp: 0.339 ± 0.004
0.789HisTyr: 0.789 ± 0.006
0.001HisXaa: 0.001 ± 0.0
Ile
2.468IleAla: 2.468 ± 0.012
1.02IleCys: 1.02 ± 0.008
1.903IleAsp: 1.903 ± 0.011
2.528IleGlu: 2.528 ± 0.017
1.777IlePhe: 1.777 ± 0.012
2.068IleGly: 2.068 ± 0.01
1.324IleHis: 1.324 ± 0.011
2.209IleIle: 2.209 ± 0.014
2.49IleLys: 2.49 ± 0.016
4.366IleLeu: 4.366 ± 0.018
0.933IleMet: 0.933 ± 0.008
1.685IleAsn: 1.685 ± 0.011
2.552IlePro: 2.552 ± 0.013
2.243IleGln: 2.243 ± 0.013
2.311IleArg: 2.311 ± 0.011
3.593IleSer: 3.593 ± 0.016
2.447IleThr: 2.447 ± 0.021
2.378IleVal: 2.378 ± 0.016
0.482IleTrp: 0.482 ± 0.005
1.293IleTyr: 1.293 ± 0.007
0.002IleXaa: 0.002 ± 0.0
Lys
4.024LysAla: 4.024 ± 0.02
1.094LysCys: 1.094 ± 0.01
3.11LysAsp: 3.11 ± 0.02
5.153LysGlu: 5.153 ± 0.024
1.624LysPhe: 1.624 ± 0.01
3.135LysGly: 3.135 ± 0.018
1.38LysHis: 1.38 ± 0.008
2.64LysIle: 2.64 ± 0.017
4.589LysLys: 4.589 ± 0.027
5.137LysLeu: 5.137 ± 0.021
1.4LysMet: 1.4 ± 0.009
2.295LysAsn: 2.295 ± 0.014
3.251LysPro: 3.251 ± 0.027
2.632LysGln: 2.632 ± 0.015
3.291LysArg: 3.291 ± 0.014
3.976LysSer: 3.976 ± 0.021
3.091LysThr: 3.091 ± 0.016
3.393LysVal: 3.393 ± 0.021
0.591LysTrp: 0.591 ± 0.008
1.455LysTyr: 1.455 ± 0.012
0.002LysXaa: 0.002 ± 0.0
Leu
6.916LeuAla: 6.916 ± 0.028
2.14LeuCys: 2.14 ± 0.014
4.68LeuAsp: 4.68 ± 0.019
7.412LeuGlu: 7.412 ± 0.035
3.243LeuPhe: 3.243 ± 0.018
5.892LeuGly: 5.892 ± 0.022
2.729LeuHis: 2.729 ± 0.014
3.741LeuIle: 3.741 ± 0.017
5.761LeuLys: 5.761 ± 0.021
10.822LeuLeu: 10.822 ± 0.049
1.96LeuMet: 1.96 ± 0.011
3.418LeuAsn: 3.418 ± 0.016
6.159LeuPro: 6.159 ± 0.025
5.924LeuGln: 5.924 ± 0.031
6.146LeuArg: 6.146 ± 0.026
8.08LeuSer: 8.08 ± 0.028
5.046LeuThr: 5.046 ± 0.015
5.414LeuVal: 5.414 ± 0.021
1.11LeuTrp: 1.11 ± 0.009
2.433LeuTyr: 2.433 ± 0.012
0.004LeuXaa: 0.004 ± 0.001
Met
1.879MetAla: 1.879 ± 0.01
0.371MetCys: 0.371 ± 0.005
1.213MetAsp: 1.213 ± 0.009
1.836MetGlu: 1.836 ± 0.012
0.687MetPhe: 0.687 ± 0.006
1.252MetGly: 1.252 ± 0.01
0.45MetHis: 0.45 ± 0.005
0.784MetIle: 0.784 ± 0.006
1.391MetLys: 1.391 ± 0.009
1.914MetLeu: 1.914 ± 0.009
0.55MetMet: 0.55 ± 0.005
0.867MetAsn: 0.867 ± 0.007
1.049MetPro: 1.049 ± 0.008
0.927MetGln: 0.927 ± 0.007
1.029MetArg: 1.029 ± 0.007
1.533MetSer: 1.533 ± 0.008
1.091MetThr: 1.091 ± 0.008
1.342MetVal: 1.342 ± 0.009
0.232MetTrp: 0.232 ± 0.004
0.574MetTyr: 0.574 ± 0.006
0.001MetXaa: 0.001 ± 0.0
Asn
2.003AsnAla: 2.003 ± 0.011
0.776AsnCys: 0.776 ± 0.007
1.439AsnAsp: 1.439 ± 0.01
2.187AsnGlu: 2.187 ± 0.013
1.384AsnPhe: 1.384 ± 0.009
2.32AsnGly: 2.32 ± 0.014
0.931AsnHis: 0.931 ± 0.007
1.983AsnIle: 1.983 ± 0.012
2.142AsnLys: 2.142 ± 0.013
3.594AsnLeu: 3.594 ± 0.014
0.861AsnMet: 0.861 ± 0.006
1.421AsnAsn: 1.421 ± 0.01
2.137AsnPro: 2.137 ± 0.011
1.641AsnGln: 1.641 ± 0.012
1.784AsnArg: 1.784 ± 0.01
3.064AsnSer: 3.064 ± 0.015
1.904AsnThr: 1.904 ± 0.011
2.123AsnVal: 2.123 ± 0.012
0.428AsnTrp: 0.428 ± 0.004
1.029AsnTyr: 1.029 ± 0.007
0.001AsnXaa: 0.001 ± 0.0
Pro
5.357ProAla: 5.357 ± 0.029
1.152ProCys: 1.152 ± 0.012
2.815ProAsp: 2.815 ± 0.014
4.613ProGlu: 4.613 ± 0.021
1.941ProPhe: 1.941 ± 0.012
5.635ProGly: 5.635 ± 0.049
1.502ProHis: 1.502 ± 0.011
1.82ProIle: 1.82 ± 0.014
2.823ProLys: 2.823 ± 0.021
5.444ProLeu: 5.444 ± 0.022
1.07ProMet: 1.07 ± 0.008
1.786ProAsn: 1.786 ± 0.012
6.681ProPro: 6.681 ± 0.051
3.01ProGln: 3.01 ± 0.021
3.614ProArg: 3.614 ± 0.022
6.07ProSer: 6.07 ± 0.025
3.187ProThr: 3.187 ± 0.019
3.965ProVal: 3.965 ± 0.024
0.706ProTrp: 0.706 ± 0.006
1.525ProTyr: 1.525 ± 0.012
0.003ProXaa: 0.003 ± 0.0
Gln
3.659GlnAla: 3.659 ± 0.02
0.92GlnCys: 0.92 ± 0.01
2.375GlnAsp: 2.375 ± 0.012
4.071GlnGlu: 4.071 ± 0.023
1.312GlnPhe: 1.312 ± 0.009
2.87GlnGly: 2.87 ± 0.015
1.349GlnHis: 1.349 ± 0.01
1.959GlnIle: 1.959 ± 0.011
3.001GlnLys: 3.001 ± 0.017
4.871GlnLeu: 4.871 ± 0.024
1.118GlnMet: 1.118 ± 0.008
1.848GlnAsn: 1.848 ± 0.012
2.978GlnPro: 2.978 ± 0.023
3.257GlnGln: 3.257 ± 0.032
3.152GlnArg: 3.152 ± 0.021
3.246GlnSer: 3.246 ± 0.017
2.359GlnThr: 2.359 ± 0.012
2.84GlnVal: 2.84 ± 0.013
0.533GlnTrp: 0.533 ± 0.005
1.12GlnTyr: 1.12 ± 0.008
0.001GlnXaa: 0.001 ± 0.0
Arg
4.103ArgAla: 4.103 ± 0.019
1.204ArgCys: 1.204 ± 0.011
2.797ArgAsp: 2.797 ± 0.015
4.218ArgGlu: 4.218 ± 0.021
1.81ArgPhe: 1.81 ± 0.01
3.79ArgGly: 3.79 ± 0.026
1.582ArgHis: 1.582 ± 0.01
2.384ArgIle: 2.384 ± 0.012
3.74ArgLys: 3.74 ± 0.017
5.593ArgLeu: 5.593 ± 0.024
1.169ArgMet: 1.169 ± 0.009
2.087ArgAsn: 2.087 ± 0.01
3.517ArgPro: 3.517 ± 0.018
2.751ArgGln: 2.751 ± 0.016
4.631ArgArg: 4.631 ± 0.03
4.504ArgSer: 4.504 ± 0.029
2.931ArgThr: 2.931 ± 0.014
3.214ArgVal: 3.214 ± 0.02
0.711ArgTrp: 0.711 ± 0.007
1.434ArgTyr: 1.434 ± 0.008
0.002ArgXaa: 0.002 ± 0.0
Ser
5.579SerAla: 5.579 ± 0.018
1.817SerCys: 1.817 ± 0.013
3.912SerAsp: 3.912 ± 0.02
5.37SerGlu: 5.37 ± 0.021
2.99SerPhe: 2.99 ± 0.014
5.754SerGly: 5.754 ± 0.023
2.17SerHis: 2.17 ± 0.013
3.08SerIle: 3.08 ± 0.013
4.109SerLys: 4.109 ± 0.018
8.342SerLeu: 8.342 ± 0.028
1.573SerMet: 1.573 ± 0.009
2.622SerAsn: 2.622 ± 0.015
6.45SerPro: 6.45 ± 0.037
4.008SerGln: 4.008 ± 0.022
4.793SerArg: 4.793 ± 0.025
9.764SerSer: 9.764 ± 0.051
4.49SerThr: 4.49 ± 0.021
4.98SerVal: 4.98 ± 0.017
1.069SerTrp: 1.069 ± 0.009
2.024SerTyr: 2.024 ± 0.013
0.004SerXaa: 0.004 ± 0.0
Thr
3.778ThrAla: 3.778 ± 0.016
1.27ThrCys: 1.27 ± 0.011
2.417ThrAsp: 2.417 ± 0.013
3.583ThrGlu: 3.583 ± 0.017
2.045ThrPhe: 2.045 ± 0.011
3.515ThrGly: 3.515 ± 0.02
1.302ThrHis: 1.302 ± 0.01
2.263ThrIle: 2.263 ± 0.015
2.582ThrLys: 2.582 ± 0.019
5.219ThrLeu: 5.219 ± 0.017
1.056ThrMet: 1.056 ± 0.008
1.657ThrAsn: 1.657 ± 0.01
3.733ThrPro: 3.733 ± 0.02
2.331ThrGln: 2.331 ± 0.013
2.537ThrArg: 2.537 ± 0.012
4.673ThrSer: 4.673 ± 0.024
2.933ThrThr: 2.933 ± 0.027
3.839ThrVal: 3.839 ± 0.024
0.68ThrTrp: 0.68 ± 0.01
1.353ThrTyr: 1.353 ± 0.008
0.002ThrXaa: 0.002 ± 0.0
Val
4.28ValAla: 4.28 ± 0.016
1.429ValCys: 1.429 ± 0.01
2.853ValAsp: 2.853 ± 0.012
3.82ValGlu: 3.82 ± 0.021
2.302ValPhe: 2.302 ± 0.012
3.35ValGly: 3.35 ± 0.016
1.573ValHis: 1.573 ± 0.009
2.803ValIle: 2.803 ± 0.015
3.35ValLys: 3.35 ± 0.022
6.198ValLeu: 6.198 ± 0.024
1.295ValMet: 1.295 ± 0.009
2.192ValAsn: 2.192 ± 0.013
3.818ValPro: 3.818 ± 0.023
2.748ValGln: 2.748 ± 0.012
3.095ValArg: 3.095 ± 0.016
4.941ValSer: 4.941 ± 0.022
3.739ValThr: 3.739 ± 0.029
3.906ValVal: 3.906 ± 0.019
0.697ValTrp: 0.697 ± 0.006
1.555ValTyr: 1.555 ± 0.009
0.002ValXaa: 0.002 ± 0.0
Trp
0.801TrpAla: 0.801 ± 0.007
0.238TrpCys: 0.238 ± 0.004
0.638TrpAsp: 0.638 ± 0.006
0.802TrpGlu: 0.802 ± 0.006
0.421TrpPhe: 0.421 ± 0.005
0.722TrpGly: 0.722 ± 0.007
0.297TrpHis: 0.297 ± 0.004
0.52TrpIle: 0.52 ± 0.006
0.766TrpLys: 0.766 ± 0.006
1.202TrpLeu: 1.202 ± 0.01
0.294TrpMet: 0.294 ± 0.004
0.505TrpAsn: 0.505 ± 0.005
0.531TrpPro: 0.531 ± 0.006
0.521TrpGln: 0.521 ± 0.005
0.738TrpArg: 0.738 ± 0.007
0.858TrpSer: 0.858 ± 0.008
0.655TrpThr: 0.655 ± 0.008
0.666TrpVal: 0.666 ± 0.007
0.176TrpTrp: 0.176 ± 0.003
0.314TrpTyr: 0.314 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.334TyrAla: 1.334 ± 0.009
0.648TyrCys: 0.648 ± 0.007
1.217TyrAsp: 1.217 ± 0.01
1.663TyrGlu: 1.663 ± 0.011
1.154TyrPhe: 1.154 ± 0.008
1.612TyrGly: 1.612 ± 0.01
0.718TyrHis: 0.718 ± 0.006
1.3TyrIle: 1.3 ± 0.011
1.399TyrLys: 1.399 ± 0.011
2.529TyrLeu: 2.529 ± 0.013
0.561TyrMet: 0.561 ± 0.005
1.005TyrAsn: 1.005 ± 0.007
1.264TyrPro: 1.264 ± 0.008
1.234TyrGln: 1.234 ± 0.008
1.555TyrArg: 1.555 ± 0.01
2.14TyrSer: 2.14 ± 0.01
1.402TyrThr: 1.402 ± 0.012
1.547TyrVal: 1.547 ± 0.01
0.325TyrTrp: 0.325 ± 0.005
0.892TyrTyr: 0.892 ± 0.008
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.0
0.002XaaCys: 0.002 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.003XaaGlu: 0.003 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.003XaaGly: 0.003 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.003XaaLeu: 0.003 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.003XaaSer: 0.003 ± 0.0
0.002XaaThr: 0.002 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.008XaaXaa: 0.008 ± 0.002
Statistics based on 35419 proteins (23023345 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski