Amino acid dipepetide frequency for Cryobacterium arcticum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.811AlaAla: 20.811 ± 0.224
0.684AlaCys: 0.684 ± 0.028
8.258AlaAsp: 8.258 ± 0.095
7.643AlaGlu: 7.643 ± 0.096
3.735AlaPhe: 3.735 ± 0.062
12.953AlaGly: 12.953 ± 0.12
2.367AlaHis: 2.367 ± 0.046
5.666AlaIle: 5.666 ± 0.071
2.631AlaLys: 2.631 ± 0.052
14.185AlaLeu: 14.185 ± 0.132
2.526AlaMet: 2.526 ± 0.05
2.505AlaAsn: 2.505 ± 0.049
6.551AlaPro: 6.551 ± 0.09
3.942AlaGln: 3.942 ± 0.059
8.231AlaArg: 8.231 ± 0.107
7.153AlaSer: 7.153 ± 0.085
8.156AlaThr: 8.156 ± 0.098
11.404AlaVal: 11.404 ± 0.125
1.842AlaTrp: 1.842 ± 0.042
2.262AlaTyr: 2.262 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
0.677CysAla: 0.677 ± 0.025
0.057CysCys: 0.057 ± 0.007
0.298CysAsp: 0.298 ± 0.016
0.222CysGlu: 0.222 ± 0.013
0.17CysPhe: 0.17 ± 0.012
0.557CysGly: 0.557 ± 0.023
0.112CysHis: 0.112 ± 0.01
0.18CysIle: 0.18 ± 0.012
0.063CysLys: 0.063 ± 0.007
0.48CysLeu: 0.48 ± 0.019
0.083CysMet: 0.083 ± 0.008
0.115CysAsn: 0.115 ± 0.009
0.281CysPro: 0.281 ± 0.018
0.114CysGln: 0.114 ± 0.01
0.301CysArg: 0.301 ± 0.016
0.365CysSer: 0.365 ± 0.016
0.346CysThr: 0.346 ± 0.018
0.462CysVal: 0.462 ± 0.021
0.078CysTrp: 0.078 ± 0.008
0.117CysTyr: 0.117 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.24AspAla: 8.24 ± 0.109
0.28AspCys: 0.28 ± 0.014
3.724AspAsp: 3.724 ± 0.068
3.627AspGlu: 3.627 ± 0.063
1.868AspPhe: 1.868 ± 0.036
5.515AspGly: 5.515 ± 0.082
1.126AspHis: 1.126 ± 0.033
2.447AspIle: 2.447 ± 0.045
1.096AspLys: 1.096 ± 0.03
6.417AspLeu: 6.417 ± 0.074
0.777AspMet: 0.777 ± 0.025
1.158AspAsn: 1.158 ± 0.031
3.953AspPro: 3.953 ± 0.065
1.784AspGln: 1.784 ± 0.036
4.085AspArg: 4.085 ± 0.077
3.12AspSer: 3.12 ± 0.057
3.509AspThr: 3.509 ± 0.06
4.777AspVal: 4.777 ± 0.071
0.955AspTrp: 0.955 ± 0.024
1.429AspTyr: 1.429 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
6.251GluAla: 6.251 ± 0.081
0.206GluCys: 0.206 ± 0.014
2.226GluAsp: 2.226 ± 0.042
2.349GluGlu: 2.349 ± 0.055
1.696GluPhe: 1.696 ± 0.034
3.197GluGly: 3.197 ± 0.055
1.371GluHis: 1.371 ± 0.036
2.558GluIle: 2.558 ± 0.044
1.327GluLys: 1.327 ± 0.04
6.146GluLeu: 6.146 ± 0.081
0.924GluMet: 0.924 ± 0.027
1.299GluAsn: 1.299 ± 0.036
2.884GluPro: 2.884 ± 0.058
1.917GluGln: 1.917 ± 0.041
4.195GluArg: 4.195 ± 0.066
2.977GluSer: 2.977 ± 0.057
3.016GluThr: 3.016 ± 0.046
4.04GluVal: 4.04 ± 0.064
0.787GluTrp: 0.787 ± 0.029
1.083GluTyr: 1.083 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
4.171PheAla: 4.171 ± 0.064
0.161PheCys: 0.161 ± 0.011
2.311PheAsp: 2.311 ± 0.046
1.615PheGlu: 1.615 ± 0.039
1.102PhePhe: 1.102 ± 0.036
3.501PheGly: 3.501 ± 0.07
0.54PheHis: 0.54 ± 0.022
1.338PheIle: 1.338 ± 0.037
0.5PheLys: 0.5 ± 0.021
3.009PheLeu: 3.009 ± 0.056
0.48PheMet: 0.48 ± 0.021
0.748PheAsn: 0.748 ± 0.024
1.443PhePro: 1.443 ± 0.035
0.786PheGln: 0.786 ± 0.024
1.711PheArg: 1.711 ± 0.04
2.0PheSer: 2.0 ± 0.039
2.304PheThr: 2.304 ± 0.052
2.897PheVal: 2.897 ± 0.054
0.505PheTrp: 0.505 ± 0.022
0.712PheTyr: 0.712 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
10.446GlyAla: 10.446 ± 0.125
0.614GlyCys: 0.614 ± 0.024
4.796GlyAsp: 4.796 ± 0.069
4.285GlyGlu: 4.285 ± 0.066
3.273GlyPhe: 3.273 ± 0.057
7.183GlyGly: 7.183 ± 0.093
1.823GlyHis: 1.823 ± 0.044
4.838GlyIle: 4.838 ± 0.067
2.053GlyLys: 2.053 ± 0.049
9.532GlyLeu: 9.532 ± 0.111
1.801GlyMet: 1.801 ± 0.038
1.898GlyAsn: 1.898 ± 0.051
3.961GlyPro: 3.961 ± 0.067
2.719GlyGln: 2.719 ± 0.049
5.966GlyArg: 5.966 ± 0.081
5.855GlySer: 5.855 ± 0.067
6.298GlyThr: 6.298 ± 0.091
7.562GlyVal: 7.562 ± 0.07
1.612GlyTrp: 1.612 ± 0.039
2.339GlyTyr: 2.339 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
2.154HisAla: 2.154 ± 0.042
0.134HisCys: 0.134 ± 0.01
1.27HisAsp: 1.27 ± 0.035
0.989HisGlu: 0.989 ± 0.029
0.547HisPhe: 0.547 ± 0.021
1.886HisGly: 1.886 ± 0.046
0.484HisHis: 0.484 ± 0.021
0.718HisIle: 0.718 ± 0.027
0.304HisLys: 0.304 ± 0.017
2.019HisLeu: 2.019 ± 0.05
0.28HisMet: 0.28 ± 0.017
0.431HisAsn: 0.431 ± 0.018
1.469HisPro: 1.469 ± 0.034
0.548HisGln: 0.548 ± 0.021
1.531HisArg: 1.531 ± 0.035
1.148HisSer: 1.148 ± 0.03
1.134HisThr: 1.134 ± 0.032
1.446HisVal: 1.446 ± 0.034
0.289HisTrp: 0.289 ± 0.014
0.458HisTyr: 0.458 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
6.156IleAla: 6.156 ± 0.075
0.264IleCys: 0.264 ± 0.016
3.43IleAsp: 3.43 ± 0.052
2.635IleGlu: 2.635 ± 0.053
1.33IlePhe: 1.33 ± 0.037
4.592IleGly: 4.592 ± 0.073
0.687IleHis: 0.687 ± 0.028
1.964IleIle: 1.964 ± 0.047
0.905IleLys: 0.905 ± 0.031
4.202IleLeu: 4.202 ± 0.068
0.664IleMet: 0.664 ± 0.022
1.099IleAsn: 1.099 ± 0.029
2.438IlePro: 2.438 ± 0.045
1.069IleGln: 1.069 ± 0.024
2.666IleArg: 2.666 ± 0.046
2.578IleSer: 2.578 ± 0.048
3.083IleThr: 3.083 ± 0.051
4.69IleVal: 4.69 ± 0.073
0.541IleTrp: 0.541 ± 0.019
0.81IleTyr: 0.81 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
2.522LysAla: 2.522 ± 0.054
0.06LysCys: 0.06 ± 0.006
1.081LysAsp: 1.081 ± 0.032
0.87LysGlu: 0.87 ± 0.032
0.58LysPhe: 0.58 ± 0.021
1.422LysGly: 1.422 ± 0.041
0.439LysHis: 0.439 ± 0.019
0.996LysIle: 0.996 ± 0.033
0.798LysLys: 0.798 ± 0.029
2.067LysLeu: 2.067 ± 0.043
0.414LysMet: 0.414 ± 0.021
0.624LysAsn: 0.624 ± 0.024
1.278LysPro: 1.278 ± 0.036
0.687LysGln: 0.687 ± 0.024
1.488LysArg: 1.488 ± 0.039
1.2LysSer: 1.2 ± 0.036
1.45LysThr: 1.45 ± 0.035
1.632LysVal: 1.632 ± 0.041
0.266LysTrp: 0.266 ± 0.015
0.47LysTyr: 0.47 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
15.387LeuAla: 15.387 ± 0.152
0.547LeuCys: 0.547 ± 0.019
6.585LeuAsp: 6.585 ± 0.095
4.768LeuGlu: 4.768 ± 0.063
3.105LeuPhe: 3.105 ± 0.056
9.717LeuGly: 9.717 ± 0.11
1.903LeuHis: 1.903 ± 0.04
4.865LeuIle: 4.865 ± 0.067
1.894LeuLys: 1.894 ± 0.042
10.972LeuLeu: 10.972 ± 0.149
1.772LeuMet: 1.772 ± 0.04
2.187LeuAsn: 2.187 ± 0.043
5.766LeuPro: 5.766 ± 0.064
2.62LeuGln: 2.62 ± 0.044
6.786LeuArg: 6.786 ± 0.084
6.294LeuSer: 6.294 ± 0.079
7.289LeuThr: 7.289 ± 0.075
9.78LeuVal: 9.78 ± 0.11
1.316LeuTrp: 1.316 ± 0.034
1.759LeuTyr: 1.759 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
2.099MetAla: 2.099 ± 0.042
0.083MetCys: 0.083 ± 0.007
0.783MetAsp: 0.783 ± 0.025
0.608MetGlu: 0.608 ± 0.02
0.509MetPhe: 0.509 ± 0.021
1.298MetGly: 1.298 ± 0.035
0.334MetHis: 0.334 ± 0.015
0.889MetIle: 0.889 ± 0.028
0.48MetLys: 0.48 ± 0.019
2.004MetLeu: 2.004 ± 0.04
0.305MetMet: 0.305 ± 0.018
0.53MetAsn: 0.53 ± 0.021
1.051MetPro: 1.051 ± 0.029
0.526MetGln: 0.526 ± 0.022
1.23MetArg: 1.23 ± 0.033
1.423MetSer: 1.423 ± 0.036
1.712MetThr: 1.712 ± 0.039
1.362MetVal: 1.362 ± 0.037
0.167MetTrp: 0.167 ± 0.012
0.272MetTyr: 0.272 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.633AsnAla: 2.633 ± 0.05
0.122AsnCys: 0.122 ± 0.011
1.336AsnAsp: 1.336 ± 0.036
1.028AsnGlu: 1.028 ± 0.029
0.729AsnPhe: 0.729 ± 0.024
2.191AsnGly: 2.191 ± 0.041
0.396AsnHis: 0.396 ± 0.017
0.958AsnIle: 0.958 ± 0.027
0.47AsnLys: 0.47 ± 0.022
2.277AsnLeu: 2.277 ± 0.044
0.366AsnMet: 0.366 ± 0.021
0.589AsnAsn: 0.589 ± 0.022
1.786AsnPro: 1.786 ± 0.04
0.666AsnGln: 0.666 ± 0.026
1.431AsnArg: 1.431 ± 0.034
1.187AsnSer: 1.187 ± 0.042
1.463AsnThr: 1.463 ± 0.04
1.827AsnVal: 1.827 ± 0.038
0.37AsnTrp: 0.37 ± 0.019
0.55AsnTyr: 0.55 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
8.258ProAla: 8.258 ± 0.104
0.178ProCys: 0.178 ± 0.011
3.837ProAsp: 3.837 ± 0.068
3.432ProGlu: 3.432 ± 0.054
1.664ProPhe: 1.664 ± 0.039
5.28ProGly: 5.28 ± 0.083
1.081ProHis: 1.081 ± 0.029
2.072ProIle: 2.072 ± 0.043
0.979ProLys: 0.979 ± 0.033
4.986ProLeu: 4.986 ± 0.064
0.9ProMet: 0.9 ± 0.027
1.179ProAsn: 1.179 ± 0.033
2.436ProPro: 2.436 ± 0.06
1.511ProGln: 1.511 ± 0.039
3.178ProArg: 3.178 ± 0.056
3.116ProSer: 3.116 ± 0.052
3.988ProThr: 3.988 ± 0.061
5.26ProVal: 5.26 ± 0.07
0.773ProTrp: 0.773 ± 0.022
1.051ProTyr: 1.051 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.845GlnAla: 3.845 ± 0.064
0.124GlnCys: 0.124 ± 0.011
1.327GlnAsp: 1.327 ± 0.035
1.197GlnGlu: 1.197 ± 0.039
0.905GlnPhe: 0.905 ± 0.026
2.088GlnGly: 2.088 ± 0.041
0.591GlnHis: 0.591 ± 0.023
1.479GlnIle: 1.479 ± 0.036
0.724GlnLys: 0.724 ± 0.025
3.201GlnLeu: 3.201 ± 0.05
0.554GlnMet: 0.554 ± 0.018
0.771GlnAsn: 0.771 ± 0.028
1.683GlnPro: 1.683 ± 0.038
1.098GlnGln: 1.098 ± 0.032
2.173GlnArg: 2.173 ± 0.045
1.687GlnSer: 1.687 ± 0.037
1.702GlnThr: 1.702 ± 0.035
2.611GlnVal: 2.611 ± 0.05
0.444GlnTrp: 0.444 ± 0.02
0.628GlnTyr: 0.628 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
7.925ArgAla: 7.925 ± 0.101
0.29ArgCys: 0.29 ± 0.016
3.748ArgAsp: 3.748 ± 0.055
3.478ArgGlu: 3.478 ± 0.061
2.357ArgPhe: 2.357 ± 0.046
4.805ArgGly: 4.805 ± 0.077
1.459ArgHis: 1.459 ± 0.035
3.202ArgIle: 3.202 ± 0.046
1.339ArgLys: 1.339 ± 0.038
7.129ArgLeu: 7.129 ± 0.089
1.611ArgMet: 1.611 ± 0.037
1.423ArgAsn: 1.423 ± 0.035
3.714ArgPro: 3.714 ± 0.063
2.046ArgGln: 2.046 ± 0.045
5.566ArgArg: 5.566 ± 0.086
4.006ArgSer: 4.006 ± 0.055
4.198ArgThr: 4.198 ± 0.067
5.418ArgVal: 5.418 ± 0.071
1.028ArgTrp: 1.028 ± 0.028
1.533ArgTyr: 1.533 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
7.766SerAla: 7.766 ± 0.088
0.276SerCys: 0.276 ± 0.015
3.222SerAsp: 3.222 ± 0.064
2.547SerGlu: 2.547 ± 0.047
2.061SerPhe: 2.061 ± 0.045
6.034SerGly: 6.034 ± 0.074
1.081SerHis: 1.081 ± 0.032
2.829SerIle: 2.829 ± 0.05
1.128SerLys: 1.128 ± 0.034
5.742SerLeu: 5.742 ± 0.079
1.204SerMet: 1.204 ± 0.034
1.285SerAsn: 1.285 ± 0.033
3.333SerPro: 3.333 ± 0.052
1.492SerGln: 1.492 ± 0.033
3.718SerArg: 3.718 ± 0.057
3.837SerSer: 3.837 ± 0.065
4.397SerThr: 4.397 ± 0.061
5.237SerVal: 5.237 ± 0.063
0.958SerTrp: 0.958 ± 0.028
1.369SerTyr: 1.369 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
8.762ThrAla: 8.762 ± 0.102
0.302ThrCys: 0.302 ± 0.016
4.254ThrAsp: 4.254 ± 0.059
3.356ThrGlu: 3.356 ± 0.052
1.936ThrPhe: 1.936 ± 0.04
6.606ThrGly: 6.606 ± 0.095
1.176ThrHis: 1.176 ± 0.032
3.014ThrIle: 3.014 ± 0.046
1.297ThrLys: 1.297 ± 0.036
6.777ThrLeu: 6.777 ± 0.073
0.98ThrMet: 0.98 ± 0.028
1.45ThrAsn: 1.45 ± 0.043
4.541ThrPro: 4.541 ± 0.066
1.626ThrGln: 1.626 ± 0.039
3.915ThrArg: 3.915 ± 0.065
3.889ThrSer: 3.889 ± 0.065
4.259ThrThr: 4.259 ± 0.068
6.652ThrVal: 6.652 ± 0.088
0.926ThrTrp: 0.926 ± 0.027
1.128ThrTyr: 1.128 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
11.351ValAla: 11.351 ± 0.113
0.473ValCys: 0.473 ± 0.02
5.408ValAsp: 5.408 ± 0.069
4.215ValGlu: 4.215 ± 0.057
2.957ValPhe: 2.957 ± 0.049
7.13ValGly: 7.13 ± 0.085
1.643ValHis: 1.643 ± 0.038
4.41ValIle: 4.41 ± 0.063
1.649ValLys: 1.649 ± 0.038
9.932ValLeu: 9.932 ± 0.134
1.446ValMet: 1.446 ± 0.035
2.067ValAsn: 2.067 ± 0.039
4.704ValPro: 4.704 ± 0.067
2.413ValGln: 2.413 ± 0.048
5.425ValArg: 5.425 ± 0.075
5.449ValSer: 5.449 ± 0.07
6.315ValThr: 6.315 ± 0.086
8.257ValVal: 8.257 ± 0.087
1.132ValTrp: 1.132 ± 0.028
1.58ValTyr: 1.58 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
1.627TrpAla: 1.627 ± 0.044
0.09TrpCys: 0.09 ± 0.008
0.709TrpAsp: 0.709 ± 0.025
0.562TrpGlu: 0.562 ± 0.018
0.557TrpPhe: 0.557 ± 0.022
1.032TrpGly: 1.032 ± 0.029
0.334TrpHis: 0.334 ± 0.017
0.673TrpIle: 0.673 ± 0.026
0.313TrpLys: 0.313 ± 0.018
1.824TrpLeu: 1.824 ± 0.039
0.315TrpMet: 0.315 ± 0.017
0.474TrpAsn: 0.474 ± 0.019
0.781TrpPro: 0.781 ± 0.025
0.624TrpGln: 0.624 ± 0.019
1.088TrpArg: 1.088 ± 0.032
0.949TrpSer: 0.949 ± 0.028
0.949TrpThr: 0.949 ± 0.03
1.122TrpVal: 1.122 ± 0.033
0.345TrpTrp: 0.345 ± 0.017
0.318TrpTyr: 0.318 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.424TyrAla: 2.424 ± 0.037
0.138TyrCys: 0.138 ± 0.009
1.282TyrAsp: 1.282 ± 0.036
1.022TyrGlu: 1.022 ± 0.029
0.754TyrPhe: 0.754 ± 0.023
1.868TyrGly: 1.868 ± 0.039
0.293TyrHis: 0.293 ± 0.016
0.708TyrIle: 0.708 ± 0.023
0.395TyrLys: 0.395 ± 0.022
2.409TyrLeu: 2.409 ± 0.05
0.259TyrMet: 0.259 ± 0.015
0.542TyrAsn: 0.542 ± 0.022
1.105TyrPro: 1.105 ± 0.031
0.641TyrGln: 0.641 ± 0.021
1.591TyrArg: 1.591 ± 0.033
1.285TyrSer: 1.285 ± 0.033
1.28TyrThr: 1.28 ± 0.032
1.521TyrVal: 1.521 ± 0.032
0.353TyrTrp: 0.353 ± 0.02
0.507TyrTyr: 0.507 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3936 proteins (1284498 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski