Amino acid dipepetide frequency for Aneurinibacillus soli

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.904AlaAla: 7.904 ± 0.121
0.845AlaCys: 0.845 ± 0.028
4.392AlaAsp: 4.392 ± 0.077
5.69AlaGlu: 5.69 ± 0.09
3.244AlaPhe: 3.244 ± 0.062
6.515AlaGly: 6.515 ± 0.087
1.656AlaHis: 1.656 ± 0.041
5.707AlaIle: 5.707 ± 0.068
4.367AlaLys: 4.367 ± 0.071
8.018AlaLeu: 8.018 ± 0.088
2.247AlaMet: 2.247 ± 0.046
2.736AlaAsn: 2.736 ± 0.053
2.683AlaPro: 2.683 ± 0.061
3.104AlaGln: 3.104 ± 0.06
4.268AlaArg: 4.268 ± 0.072
4.982AlaSer: 4.982 ± 0.093
4.097AlaThr: 4.097 ± 0.07
6.557AlaVal: 6.557 ± 0.103
0.739AlaTrp: 0.739 ± 0.028
2.602AlaTyr: 2.602 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.619CysAla: 0.619 ± 0.027
0.13CysCys: 0.13 ± 0.011
0.49CysAsp: 0.49 ± 0.023
0.505CysGlu: 0.505 ± 0.021
0.355CysPhe: 0.355 ± 0.017
0.85CysGly: 0.85 ± 0.03
0.19CysHis: 0.19 ± 0.012
0.58CysIle: 0.58 ± 0.023
0.405CysLys: 0.405 ± 0.021
0.802CysLeu: 0.802 ± 0.026
0.25CysMet: 0.25 ± 0.015
0.294CysAsn: 0.294 ± 0.017
0.428CysPro: 0.428 ± 0.021
0.251CysGln: 0.251 ± 0.015
0.515CysArg: 0.515 ± 0.021
0.539CysSer: 0.539 ± 0.023
0.542CysThr: 0.542 ± 0.019
0.586CysVal: 0.586 ± 0.024
0.096CysTrp: 0.096 ± 0.01
0.24CysTyr: 0.24 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.871AspAla: 3.871 ± 0.063
0.429AspCys: 0.429 ± 0.022
2.285AspAsp: 2.285 ± 0.054
4.069AspGlu: 4.069 ± 0.069
1.988AspPhe: 1.988 ± 0.045
3.656AspGly: 3.656 ± 0.08
1.021AspHis: 1.021 ± 0.028
3.815AspIle: 3.815 ± 0.061
2.713AspLys: 2.713 ± 0.051
4.271AspLeu: 4.271 ± 0.057
1.669AspMet: 1.669 ± 0.038
1.565AspAsn: 1.565 ± 0.043
1.873AspPro: 1.873 ± 0.039
1.718AspGln: 1.718 ± 0.042
2.592AspArg: 2.592 ± 0.051
2.496AspSer: 2.496 ± 0.051
2.836AspThr: 2.836 ± 0.052
4.242AspVal: 4.242 ± 0.076
0.642AspTrp: 0.642 ± 0.026
1.682AspTyr: 1.682 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
6.227GluAla: 6.227 ± 0.088
0.42GluCys: 0.42 ± 0.019
3.111GluAsp: 3.111 ± 0.066
6.315GluGlu: 6.315 ± 0.115
2.184GluPhe: 2.184 ± 0.05
4.233GluGly: 4.233 ± 0.062
1.509GluHis: 1.509 ± 0.036
4.883GluIle: 4.883 ± 0.07
5.389GluLys: 5.389 ± 0.084
6.743GluLeu: 6.743 ± 0.086
2.336GluMet: 2.336 ± 0.057
2.8GluAsn: 2.8 ± 0.052
1.95GluPro: 1.95 ± 0.047
3.929GluGln: 3.929 ± 0.068
4.231GluArg: 4.231 ± 0.075
3.202GluSer: 3.202 ± 0.052
3.537GluThr: 3.537 ± 0.056
4.912GluVal: 4.912 ± 0.073
0.848GluTrp: 0.848 ± 0.024
2.037GluTyr: 2.037 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.309PheAla: 3.309 ± 0.054
0.389PheCys: 0.389 ± 0.018
2.058PheAsp: 2.058 ± 0.046
2.235PheGlu: 2.235 ± 0.042
1.901PhePhe: 1.901 ± 0.051
3.064PheGly: 3.064 ± 0.048
0.893PheHis: 0.893 ± 0.027
2.803PheIle: 2.803 ± 0.06
1.653PheLys: 1.653 ± 0.042
4.029PheLeu: 4.029 ± 0.081
1.045PheMet: 1.045 ± 0.03
1.372PheAsn: 1.372 ± 0.04
1.502PhePro: 1.502 ± 0.039
1.271PheGln: 1.271 ± 0.037
1.796PheArg: 1.796 ± 0.044
2.687PheSer: 2.687 ± 0.061
2.627PheThr: 2.627 ± 0.05
3.082PheVal: 3.082 ± 0.066
0.446PheTrp: 0.446 ± 0.021
1.436PheTyr: 1.436 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
5.402GlyAla: 5.402 ± 0.086
0.749GlyCys: 0.749 ± 0.025
3.367GlyAsp: 3.367 ± 0.061
4.571GlyGlu: 4.571 ± 0.076
3.01GlyPhe: 3.01 ± 0.059
5.227GlyGly: 5.227 ± 0.084
1.498GlyHis: 1.498 ± 0.038
5.628GlyIle: 5.628 ± 0.071
4.65GlyLys: 4.65 ± 0.069
6.572GlyLeu: 6.572 ± 0.089
2.385GlyMet: 2.385 ± 0.047
2.511GlyAsn: 2.511 ± 0.058
1.716GlyPro: 1.716 ± 0.039
2.7GlyGln: 2.7 ± 0.053
3.518GlyArg: 3.518 ± 0.063
4.045GlySer: 4.045 ± 0.071
4.679GlyThr: 4.679 ± 0.09
5.541GlyVal: 5.541 ± 0.073
0.901GlyTrp: 0.901 ± 0.034
2.761GlyTyr: 2.761 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
1.554HisAla: 1.554 ± 0.038
0.195HisCys: 0.195 ± 0.012
1.119HisAsp: 1.119 ± 0.032
1.428HisGlu: 1.428 ± 0.038
0.901HisPhe: 0.901 ± 0.026
1.533HisGly: 1.533 ± 0.042
0.68HisHis: 0.68 ± 0.026
1.537HisIle: 1.537 ± 0.04
1.034HisLys: 1.034 ± 0.027
2.082HisLeu: 2.082 ± 0.051
0.615HisMet: 0.615 ± 0.023
0.757HisAsn: 0.757 ± 0.025
1.255HisPro: 1.255 ± 0.032
0.838HisGln: 0.838 ± 0.027
1.017HisArg: 1.017 ± 0.032
1.214HisSer: 1.214 ± 0.036
1.37HisThr: 1.37 ± 0.033
1.74HisVal: 1.74 ± 0.044
0.225HisTrp: 0.225 ± 0.013
0.768HisTyr: 0.768 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
6.051IleAla: 6.051 ± 0.076
0.676IleCys: 0.676 ± 0.029
3.695IleAsp: 3.695 ± 0.059
5.26IleGlu: 5.26 ± 0.077
2.468IlePhe: 2.468 ± 0.059
5.538IleGly: 5.538 ± 0.081
1.604IleHis: 1.604 ± 0.036
4.637IleIle: 4.637 ± 0.069
3.347IleLys: 3.347 ± 0.06
5.897IleLeu: 5.897 ± 0.083
1.833IleMet: 1.833 ± 0.04
2.459IleAsn: 2.459 ± 0.048
3.185IlePro: 3.185 ± 0.054
2.81IleGln: 2.81 ± 0.046
3.713IleArg: 3.713 ± 0.06
4.343IleSer: 4.343 ± 0.07
4.426IleThr: 4.426 ± 0.062
5.35IleVal: 5.35 ± 0.077
0.624IleTrp: 0.624 ± 0.024
1.832IleTyr: 1.832 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.424LysAla: 4.424 ± 0.076
0.319LysCys: 0.319 ± 0.018
2.777LysAsp: 2.777 ± 0.05
4.965LysGlu: 4.965 ± 0.09
1.578LysPhe: 1.578 ± 0.038
3.646LysGly: 3.646 ± 0.064
1.095LysHis: 1.095 ± 0.033
3.633LysIle: 3.633 ± 0.056
4.479LysLys: 4.479 ± 0.083
5.116LysLeu: 5.116 ± 0.065
1.86LysMet: 1.86 ± 0.04
2.501LysAsn: 2.501 ± 0.05
2.254LysPro: 2.254 ± 0.043
3.171LysGln: 3.171 ± 0.059
3.075LysArg: 3.075 ± 0.061
2.858LysSer: 2.858 ± 0.055
3.125LysThr: 3.125 ± 0.056
3.99LysVal: 3.99 ± 0.069
0.696LysTrp: 0.696 ± 0.025
1.751LysTyr: 1.751 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
8.798LeuAla: 8.798 ± 0.106
0.895LeuCys: 0.895 ± 0.03
4.849LeuAsp: 4.849 ± 0.077
5.847LeuGlu: 5.847 ± 0.09
4.233LeuPhe: 4.233 ± 0.083
6.394LeuGly: 6.394 ± 0.077
2.342LeuHis: 2.342 ± 0.046
6.059LeuIle: 6.059 ± 0.085
4.704LeuLys: 4.704 ± 0.069
10.232LeuLeu: 10.232 ± 0.139
2.37LeuMet: 2.37 ± 0.047
3.406LeuAsn: 3.406 ± 0.051
4.17LeuPro: 4.17 ± 0.065
3.57LeuGln: 3.57 ± 0.058
4.696LeuArg: 4.696 ± 0.072
6.475LeuSer: 6.475 ± 0.094
5.867LeuThr: 5.867 ± 0.083
6.613LeuVal: 6.613 ± 0.096
0.795LeuTrp: 0.795 ± 0.028
3.224LeuTyr: 3.224 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
2.349MetAla: 2.349 ± 0.051
0.191MetCys: 0.191 ± 0.012
1.417MetAsp: 1.417 ± 0.038
2.104MetGlu: 2.104 ± 0.045
0.98MetPhe: 0.98 ± 0.032
1.886MetGly: 1.886 ± 0.041
0.509MetHis: 0.509 ± 0.024
2.007MetIle: 2.007 ± 0.044
2.185MetLys: 2.185 ± 0.044
2.873MetLeu: 2.873 ± 0.049
1.037MetMet: 1.037 ± 0.038
1.533MetAsn: 1.533 ± 0.039
1.179MetPro: 1.179 ± 0.034
1.255MetGln: 1.255 ± 0.035
1.539MetArg: 1.539 ± 0.039
1.782MetSer: 1.782 ± 0.041
1.638MetThr: 1.638 ± 0.04
1.873MetVal: 1.873 ± 0.035
0.218MetTrp: 0.218 ± 0.015
0.905MetTyr: 0.905 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.757AsnAla: 2.757 ± 0.054
0.286AsnCys: 0.286 ± 0.014
1.68AsnAsp: 1.68 ± 0.045
2.502AsnGlu: 2.502 ± 0.051
1.248AsnPhe: 1.248 ± 0.034
3.101AsnGly: 3.101 ± 0.074
0.884AsnHis: 0.884 ± 0.028
2.576AsnIle: 2.576 ± 0.047
2.199AsnLys: 2.199 ± 0.048
3.186AsnLeu: 3.186 ± 0.058
1.138AsnMet: 1.138 ± 0.027
1.43AsnAsn: 1.43 ± 0.045
2.071AsnPro: 2.071 ± 0.045
1.647AsnGln: 1.647 ± 0.04
2.106AsnArg: 2.106 ± 0.049
1.883AsnSer: 1.883 ± 0.047
2.041AsnThr: 2.041 ± 0.046
2.93AsnVal: 2.93 ± 0.059
0.417AsnTrp: 0.417 ± 0.021
1.124AsnTyr: 1.124 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
3.028ProAla: 3.028 ± 0.062
0.272ProCys: 0.272 ± 0.016
2.464ProAsp: 2.464 ± 0.051
2.975ProGlu: 2.975 ± 0.057
1.786ProPhe: 1.786 ± 0.046
2.765ProGly: 2.765 ± 0.053
0.937ProHis: 0.937 ± 0.029
2.453ProIle: 2.453 ± 0.055
1.92ProLys: 1.92 ± 0.046
3.822ProLeu: 3.822 ± 0.07
0.875ProMet: 0.875 ± 0.026
1.457ProAsn: 1.457 ± 0.034
1.276ProPro: 1.276 ± 0.043
1.398ProGln: 1.398 ± 0.035
1.433ProArg: 1.433 ± 0.032
2.282ProSer: 2.282 ± 0.05
2.159ProThr: 2.159 ± 0.053
3.328ProVal: 3.328 ± 0.057
0.376ProTrp: 0.376 ± 0.019
1.493ProTyr: 1.493 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
3.814GlnAla: 3.814 ± 0.072
0.226GlnCys: 0.226 ± 0.015
1.706GlnAsp: 1.706 ± 0.034
3.213GlnGlu: 3.213 ± 0.061
1.433GlnPhe: 1.433 ± 0.037
2.489GlnGly: 2.489 ± 0.045
0.87GlnHis: 0.87 ± 0.023
2.684GlnIle: 2.684 ± 0.047
2.715GlnLys: 2.715 ± 0.057
3.929GlnLeu: 3.929 ± 0.061
1.341GlnMet: 1.341 ± 0.035
1.545GlnAsn: 1.545 ± 0.04
1.464GlnPro: 1.464 ± 0.037
2.201GlnGln: 2.201 ± 0.058
1.957GlnArg: 1.957 ± 0.042
2.19GlnSer: 2.19 ± 0.048
2.426GlnThr: 2.426 ± 0.048
2.952GlnVal: 2.952 ± 0.05
0.429GlnTrp: 0.429 ± 0.02
1.322GlnTyr: 1.322 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
3.705ArgAla: 3.705 ± 0.067
0.399ArgCys: 0.399 ± 0.019
2.584ArgAsp: 2.584 ± 0.054
3.875ArgGlu: 3.875 ± 0.067
2.13ArgPhe: 2.13 ± 0.049
2.967ArgGly: 2.967 ± 0.055
1.117ArgHis: 1.117 ± 0.032
3.905ArgIle: 3.905 ± 0.061
3.207ArgLys: 3.207 ± 0.056
5.177ArgLeu: 5.177 ± 0.079
1.768ArgMet: 1.768 ± 0.036
1.9ArgAsn: 1.9 ± 0.041
1.7ArgPro: 1.7 ± 0.039
2.216ArgGln: 2.216 ± 0.053
2.994ArgArg: 2.994 ± 0.062
2.754ArgSer: 2.754 ± 0.052
2.909ArgThr: 2.909 ± 0.05
3.615ArgVal: 3.615 ± 0.054
0.56ArgTrp: 0.56 ± 0.023
1.88ArgTyr: 1.88 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
4.306SerAla: 4.306 ± 0.08
0.541SerCys: 0.541 ± 0.021
2.627SerAsp: 2.627 ± 0.058
3.45SerGlu: 3.45 ± 0.057
2.77SerPhe: 2.77 ± 0.056
4.751SerGly: 4.751 ± 0.078
1.299SerHis: 1.299 ± 0.036
4.401SerIle: 4.401 ± 0.071
2.892SerLys: 2.892 ± 0.059
5.826SerLeu: 5.826 ± 0.078
1.78SerMet: 1.78 ± 0.038
2.112SerAsn: 2.112 ± 0.056
2.217SerPro: 2.217 ± 0.048
2.115SerGln: 2.115 ± 0.045
2.917SerArg: 2.917 ± 0.041
3.876SerSer: 3.876 ± 0.083
3.268SerThr: 3.268 ± 0.064
4.354SerVal: 4.354 ± 0.073
0.613SerTrp: 0.613 ± 0.023
2.19SerTyr: 2.19 ± 0.049
0.0SerXaa: 0.0 ± 0.0
Thr
4.909ThrAla: 4.909 ± 0.086
0.477ThrCys: 0.477 ± 0.02
2.838ThrAsp: 2.838 ± 0.055
3.686ThrGlu: 3.686 ± 0.056
2.565ThrPhe: 2.565 ± 0.054
4.833ThrGly: 4.833 ± 0.064
1.171ThrHis: 1.171 ± 0.035
4.252ThrIle: 4.252 ± 0.062
2.969ThrLys: 2.969 ± 0.059
5.424ThrLeu: 5.424 ± 0.079
1.475ThrMet: 1.475 ± 0.034
2.216ThrAsn: 2.216 ± 0.056
2.684ThrPro: 2.684 ± 0.058
1.848ThrGln: 1.848 ± 0.043
2.591ThrArg: 2.591 ± 0.052
3.445ThrSer: 3.445 ± 0.068
3.245ThrThr: 3.245 ± 0.07
4.922ThrVal: 4.922 ± 0.088
0.631ThrTrp: 0.631 ± 0.027
2.03ThrTyr: 2.03 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
6.102ValAla: 6.102 ± 0.087
0.791ValCys: 0.791 ± 0.026
3.704ValAsp: 3.704 ± 0.065
5.109ValGlu: 5.109 ± 0.085
2.886ValPhe: 2.886 ± 0.053
4.822ValGly: 4.822 ± 0.079
1.578ValHis: 1.578 ± 0.035
5.228ValIle: 5.228 ± 0.071
4.138ValLys: 4.138 ± 0.067
7.457ValLeu: 7.457 ± 0.101
2.113ValMet: 2.113 ± 0.043
2.849ValAsn: 2.849 ± 0.056
3.186ValPro: 3.186 ± 0.056
3.101ValGln: 3.101 ± 0.054
3.962ValArg: 3.962 ± 0.064
4.744ValSer: 4.744 ± 0.073
4.793ValThr: 4.793 ± 0.09
5.642ValVal: 5.642 ± 0.088
0.75ValTrp: 0.75 ± 0.026
2.474ValTyr: 2.474 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.717TrpAla: 0.717 ± 0.028
0.084TrpCys: 0.084 ± 0.008
0.544TrpAsp: 0.544 ± 0.025
0.656TrpGlu: 0.656 ± 0.024
0.495TrpPhe: 0.495 ± 0.021
0.667TrpGly: 0.667 ± 0.031
0.231TrpHis: 0.231 ± 0.015
0.736TrpIle: 0.736 ± 0.027
0.646TrpLys: 0.646 ± 0.026
1.201TrpLeu: 1.201 ± 0.041
0.41TrpMet: 0.41 ± 0.019
0.51TrpAsn: 0.51 ± 0.024
0.281TrpPro: 0.281 ± 0.015
0.433TrpGln: 0.433 ± 0.017
0.509TrpArg: 0.509 ± 0.024
0.597TrpSer: 0.597 ± 0.024
0.576TrpThr: 0.576 ± 0.031
0.661TrpVal: 0.661 ± 0.022
0.139TrpTrp: 0.139 ± 0.012
0.387TrpTyr: 0.387 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.509TyrAla: 2.509 ± 0.057
0.355TyrCys: 0.355 ± 0.02
1.727TyrAsp: 1.727 ± 0.037
2.339TyrGlu: 2.339 ± 0.053
1.445TyrPhe: 1.445 ± 0.04
2.494TyrGly: 2.494 ± 0.051
0.774TyrHis: 0.774 ± 0.029
2.225TyrIle: 2.225 ± 0.048
1.701TyrLys: 1.701 ± 0.042
2.881TyrLeu: 2.881 ± 0.052
0.914TyrMet: 0.914 ± 0.027
1.252TyrAsn: 1.252 ± 0.038
1.424TyrPro: 1.424 ± 0.038
1.331TyrGln: 1.331 ± 0.039
1.933TyrArg: 1.933 ± 0.042
1.912TyrSer: 1.912 ± 0.041
2.049TyrThr: 2.049 ± 0.045
2.538TyrVal: 2.538 ± 0.052
0.334TyrTrp: 0.334 ± 0.017
1.267TyrTyr: 1.267 ± 0.039
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3962 proteins (1158177 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski