Amino acid dipepetide frequency for Archaeoglobus fulgidus (strain ATCC 49558 / VC-16 / DSM 4304 / JCM 9628 / NBRC 100126)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.74AlaAla: 6.74 ± 0.141
0.932AlaCys: 0.932 ± 0.045
3.582AlaAsp: 3.582 ± 0.078
7.619AlaGlu: 7.619 ± 0.122
3.582AlaPhe: 3.582 ± 0.082
5.968AlaGly: 5.968 ± 0.099
0.952AlaHis: 0.952 ± 0.041
6.133AlaIle: 6.133 ± 0.092
5.54AlaLys: 5.54 ± 0.101
7.709AlaLeu: 7.709 ± 0.126
2.402AlaMet: 2.402 ± 0.067
2.108AlaAsn: 2.108 ± 0.056
2.019AlaPro: 2.019 ± 0.057
1.094AlaGln: 1.094 ± 0.043
4.101AlaArg: 4.101 ± 0.089
3.804AlaSer: 3.804 ± 0.08
3.278AlaThr: 3.278 ± 0.076
7.883AlaVal: 7.883 ± 0.115
0.627AlaTrp: 0.627 ± 0.035
2.585AlaTyr: 2.585 ± 0.062
0.0AlaXaa: 0.0 ± 0.0
Cys
0.627CysAla: 0.627 ± 0.034
0.206CysCys: 0.206 ± 0.02
0.614CysAsp: 0.614 ± 0.03
0.802CysGlu: 0.802 ± 0.034
0.424CysPhe: 0.424 ± 0.025
1.333CysGly: 1.333 ± 0.054
0.281CysHis: 0.281 ± 0.021
0.69CysIle: 0.69 ± 0.034
0.619CysLys: 0.619 ± 0.033
0.917CysLeu: 0.917 ± 0.042
0.325CysMet: 0.325 ± 0.023
0.443CysAsn: 0.443 ± 0.028
0.811CysPro: 0.811 ± 0.046
0.206CysGln: 0.206 ± 0.019
0.726CysArg: 0.726 ± 0.036
0.807CysSer: 0.807 ± 0.045
0.515CysThr: 0.515 ± 0.032
0.844CysVal: 0.844 ± 0.041
0.129CysTrp: 0.129 ± 0.013
0.465CysTyr: 0.465 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
3.842AspAla: 3.842 ± 0.08
0.598AspCys: 0.598 ± 0.029
2.274AspAsp: 2.274 ± 0.073
5.106AspGlu: 5.106 ± 0.095
2.893AspPhe: 2.893 ± 0.067
3.697AspGly: 3.697 ± 0.09
0.63AspHis: 0.63 ± 0.03
3.605AspIle: 3.605 ± 0.079
2.285AspLys: 2.285 ± 0.059
4.354AspLeu: 4.354 ± 0.082
1.182AspMet: 1.182 ± 0.046
1.112AspAsn: 1.112 ± 0.046
1.792AspPro: 1.792 ± 0.062
0.521AspGln: 0.521 ± 0.028
2.703AspArg: 2.703 ± 0.076
2.48AspSer: 2.48 ± 0.057
1.707AspThr: 1.707 ± 0.052
5.133AspVal: 5.133 ± 0.098
0.616AspTrp: 0.616 ± 0.031
2.438AspTyr: 2.438 ± 0.067
0.0AspXaa: 0.0 ± 0.0
Glu
6.121GluAla: 6.121 ± 0.11
0.798GluCys: 0.798 ± 0.04
4.086GluAsp: 4.086 ± 0.086
9.14GluGlu: 9.14 ± 0.171
3.854GluPhe: 3.854 ± 0.078
5.698GluGly: 5.698 ± 0.097
1.176GluHis: 1.176 ± 0.038
7.756GluIle: 7.756 ± 0.117
8.225GluLys: 8.225 ± 0.119
8.273GluLeu: 8.273 ± 0.129
2.654GluMet: 2.654 ± 0.07
3.291GluAsn: 3.291 ± 0.071
2.158GluPro: 2.158 ± 0.057
1.466GluGln: 1.466 ± 0.055
5.947GluArg: 5.947 ± 0.104
3.915GluSer: 3.915 ± 0.082
2.818GluThr: 2.818 ± 0.071
8.316GluVal: 8.316 ± 0.142
0.953GluTrp: 0.953 ± 0.04
2.57GluTyr: 2.57 ± 0.076
0.0GluXaa: 0.0 ± 0.0
Phe
3.774PheAla: 3.774 ± 0.09
0.592PheCys: 0.592 ± 0.031
2.556PheAsp: 2.556 ± 0.067
3.9PheGlu: 3.9 ± 0.084
2.218PhePhe: 2.218 ± 0.061
3.767PheGly: 3.767 ± 0.084
0.645PheHis: 0.645 ± 0.033
2.906PheIle: 2.906 ± 0.071
2.282PheLys: 2.282 ± 0.056
4.301PheLeu: 4.301 ± 0.104
1.011PheMet: 1.011 ± 0.041
1.413PheAsn: 1.413 ± 0.041
1.693PhePro: 1.693 ± 0.051
0.831PheGln: 0.831 ± 0.036
2.707PheArg: 2.707 ± 0.066
3.043PheSer: 3.043 ± 0.069
2.091PheThr: 2.091 ± 0.067
3.797PheVal: 3.797 ± 0.077
0.515PheTrp: 0.515 ± 0.031
1.701PheTyr: 1.701 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
4.7GlyAla: 4.7 ± 0.095
1.126GlyCys: 1.126 ± 0.05
3.706GlyAsp: 3.706 ± 0.085
6.342GlyGlu: 6.342 ± 0.12
3.671GlyPhe: 3.671 ± 0.079
5.44GlyGly: 5.44 ± 0.125
1.053GlyHis: 1.053 ± 0.038
5.849GlyIle: 5.849 ± 0.093
5.725GlyLys: 5.725 ± 0.105
6.274GlyLeu: 6.274 ± 0.109
2.205GlyMet: 2.205 ± 0.058
2.234GlyAsn: 2.234 ± 0.068
1.426GlyPro: 1.426 ± 0.051
1.167GlyGln: 1.167 ± 0.045
4.098GlyArg: 4.098 ± 0.094
3.804GlySer: 3.804 ± 0.086
2.934GlyThr: 2.934 ± 0.076
6.677GlyVal: 6.677 ± 0.11
0.932GlyTrp: 0.932 ± 0.036
3.192GlyTyr: 3.192 ± 0.067
0.0GlyXaa: 0.0 ± 0.0
His
1.189HisAla: 1.189 ± 0.041
0.242HisCys: 0.242 ± 0.02
0.69HisAsp: 0.69 ± 0.036
0.934HisGlu: 0.934 ± 0.042
0.749HisPhe: 0.749 ± 0.033
1.244HisGly: 1.244 ± 0.05
0.369HisHis: 0.369 ± 0.026
1.009HisIle: 1.009 ± 0.04
0.581HisLys: 0.581 ± 0.027
1.512HisLeu: 1.512 ± 0.05
0.348HisMet: 0.348 ± 0.021
0.445HisAsn: 0.445 ± 0.026
1.102HisPro: 1.102 ± 0.041
0.322HisGln: 0.322 ± 0.024
0.863HisArg: 0.863 ± 0.035
0.911HisSer: 0.911 ± 0.044
0.675HisThr: 0.675 ± 0.033
1.114HisVal: 1.114 ± 0.043
0.174HisTrp: 0.174 ± 0.017
0.634HisTyr: 0.634 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
7.125IleAla: 7.125 ± 0.124
0.782IleCys: 0.782 ± 0.036
3.82IleAsp: 3.82 ± 0.086
6.292IleGlu: 6.292 ± 0.093
3.264IlePhe: 3.264 ± 0.085
4.859IleGly: 4.859 ± 0.103
1.102IleHis: 1.102 ± 0.041
4.529IleIle: 4.529 ± 0.1
4.352IleLys: 4.352 ± 0.083
6.801IleLeu: 6.801 ± 0.111
1.642IleMet: 1.642 ± 0.058
2.255IleAsn: 2.255 ± 0.059
3.396IlePro: 3.396 ± 0.076
1.268IleGln: 1.268 ± 0.049
3.647IleArg: 3.647 ± 0.081
4.538IleSer: 4.538 ± 0.091
3.453IleThr: 3.453 ± 0.076
5.991IleVal: 5.991 ± 0.097
0.552IleTrp: 0.552 ± 0.03
2.62IleTyr: 2.62 ± 0.064
0.0IleXaa: 0.0 ± 0.0
Lys
5.737LysAla: 5.737 ± 0.089
0.761LysCys: 0.761 ± 0.04
3.219LysAsp: 3.219 ± 0.071
6.264LysGlu: 6.264 ± 0.116
2.657LysPhe: 2.657 ± 0.069
4.625LysGly: 4.625 ± 0.082
1.079LysHis: 1.079 ± 0.036
5.349LysIle: 5.349 ± 0.103
5.094LysLys: 5.094 ± 0.095
6.61LysLeu: 6.61 ± 0.107
1.867LysMet: 1.867 ± 0.054
2.34LysAsn: 2.34 ± 0.061
2.86LysPro: 2.86 ± 0.068
1.082LysGln: 1.082 ± 0.038
3.907LysArg: 3.907 ± 0.079
3.3LysSer: 3.3 ± 0.065
2.81LysThr: 2.81 ± 0.074
5.981LysVal: 5.981 ± 0.107
0.696LysTrp: 0.696 ± 0.033
2.408LysTyr: 2.408 ± 0.066
0.0LysXaa: 0.0 ± 0.0
Leu
7.78LeuAla: 7.78 ± 0.128
0.996LeuCys: 0.996 ± 0.042
4.429LeuAsp: 4.429 ± 0.077
7.7LeuGlu: 7.7 ± 0.115
3.758LeuPhe: 3.758 ± 0.099
6.484LeuGly: 6.484 ± 0.11
1.339LeuHis: 1.339 ± 0.047
6.506LeuIle: 6.506 ± 0.113
7.65LeuLys: 7.65 ± 0.112
9.408LeuLeu: 9.408 ± 0.18
2.344LeuMet: 2.344 ± 0.068
3.437LeuAsn: 3.437 ± 0.074
4.04LeuPro: 4.04 ± 0.086
1.943LeuGln: 1.943 ± 0.058
6.13LeuArg: 6.13 ± 0.115
6.454LeuSer: 6.454 ± 0.109
4.379LeuThr: 4.379 ± 0.079
6.326LeuVal: 6.326 ± 0.105
0.788LeuTrp: 0.788 ± 0.037
3.09LeuTyr: 3.09 ± 0.076
0.0LeuXaa: 0.0 ± 0.0
Met
2.116MetAla: 2.116 ± 0.064
0.201MetCys: 0.201 ± 0.018
1.388MetAsp: 1.388 ± 0.051
2.122MetGlu: 2.122 ± 0.065
0.846MetPhe: 0.846 ± 0.038
1.911MetGly: 1.911 ± 0.06
0.486MetHis: 0.486 ± 0.029
1.613MetIle: 1.613 ± 0.053
2.085MetLys: 2.085 ± 0.057
2.943MetLeu: 2.943 ± 0.064
0.695MetMet: 0.695 ± 0.032
0.885MetAsn: 0.885 ± 0.037
1.224MetPro: 1.224 ± 0.047
0.617MetGln: 0.617 ± 0.03
1.807MetArg: 1.807 ± 0.054
1.36MetSer: 1.36 ± 0.049
1.006MetThr: 1.006 ± 0.041
2.005MetVal: 2.005 ± 0.05
0.216MetTrp: 0.216 ± 0.018
0.663MetTyr: 0.663 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
2.721AsnAla: 2.721 ± 0.065
0.474AsnCys: 0.474 ± 0.027
1.221AsnAsp: 1.221 ± 0.051
2.114AsnGlu: 2.114 ± 0.06
1.713AsnPhe: 1.713 ± 0.055
2.432AsnGly: 2.432 ± 0.073
0.502AsnHis: 0.502 ± 0.024
2.064AsnIle: 2.064 ± 0.062
1.285AsnLys: 1.285 ± 0.049
3.697AsnLeu: 3.697 ± 0.075
0.66AsnMet: 0.66 ± 0.03
0.867AsnAsn: 0.867 ± 0.048
2.341AsnPro: 2.341 ± 0.058
0.625AsnGln: 0.625 ± 0.033
1.64AsnArg: 1.64 ± 0.05
1.805AsnSer: 1.805 ± 0.057
1.359AsnThr: 1.359 ± 0.053
2.825AsnVal: 2.825 ± 0.062
0.422AsnTrp: 0.422 ± 0.023
1.356AsnTyr: 1.356 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
2.6ProAla: 2.6 ± 0.073
0.396ProCys: 0.396 ± 0.028
2.291ProAsp: 2.291 ± 0.058
4.225ProGlu: 4.225 ± 0.078
1.896ProPhe: 1.896 ± 0.054
2.344ProGly: 2.344 ± 0.064
0.716ProHis: 0.716 ± 0.033
2.403ProIle: 2.403 ± 0.057
2.476ProLys: 2.476 ± 0.066
3.65ProLeu: 3.65 ± 0.081
0.916ProMet: 0.916 ± 0.044
1.255ProAsn: 1.255 ± 0.051
1.833ProPro: 1.833 ± 0.057
0.908ProGln: 0.908 ± 0.036
1.551ProArg: 1.551 ± 0.048
2.2ProSer: 2.2 ± 0.054
1.674ProThr: 1.674 ± 0.054
3.311ProVal: 3.311 ± 0.074
0.413ProTrp: 0.413 ± 0.026
1.539ProTyr: 1.539 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
1.241GlnAla: 1.241 ± 0.046
0.197GlnCys: 0.197 ± 0.019
0.672GlnAsp: 0.672 ± 0.034
1.218GlnGlu: 1.218 ± 0.047
0.748GlnPhe: 0.748 ± 0.038
1.052GlnGly: 1.052 ± 0.04
0.316GlnHis: 0.316 ± 0.022
1.548GlnIle: 1.548 ± 0.045
1.494GlnLys: 1.494 ± 0.049
1.792GlnLeu: 1.792 ± 0.063
0.578GlnMet: 0.578 ± 0.026
0.764GlnAsn: 0.764 ± 0.038
0.822GlnPro: 0.822 ± 0.039
0.522GlnGln: 0.522 ± 0.033
1.179GlnArg: 1.179 ± 0.043
0.869GlnSer: 0.869 ± 0.043
0.804GlnThr: 0.804 ± 0.035
1.256GlnVal: 1.256 ± 0.044
0.201GlnTrp: 0.201 ± 0.017
0.537GlnTyr: 0.537 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
4.072ArgAla: 4.072 ± 0.092
0.711ArgCys: 0.711 ± 0.043
2.856ArgAsp: 2.856 ± 0.065
5.834ArgGlu: 5.834 ± 0.116
2.529ArgPhe: 2.529 ± 0.066
3.953ArgGly: 3.953 ± 0.087
0.87ArgHis: 0.87 ± 0.036
4.746ArgIle: 4.746 ± 0.093
4.788ArgLys: 4.788 ± 0.077
5.11ArgLeu: 5.11 ± 0.101
1.736ArgMet: 1.736 ± 0.047
1.954ArgAsn: 1.954 ± 0.052
1.53ArgPro: 1.53 ± 0.053
1.082ArgGln: 1.082 ± 0.045
4.025ArgArg: 4.025 ± 0.09
2.255ArgSer: 2.255 ± 0.056
1.928ArgThr: 1.928 ± 0.06
4.951ArgVal: 4.951 ± 0.09
0.631ArgTrp: 0.631 ± 0.03
2.088ArgTyr: 2.088 ± 0.059
0.0ArgXaa: 0.0 ± 0.0
Ser
4.139SerAla: 4.139 ± 0.085
0.658SerCys: 0.658 ± 0.035
2.679SerAsp: 2.679 ± 0.057
4.274SerGlu: 4.274 ± 0.091
2.772SerPhe: 2.772 ± 0.073
4.656SerGly: 4.656 ± 0.098
0.791SerHis: 0.791 ± 0.036
3.68SerIle: 3.68 ± 0.089
3.36SerLys: 3.36 ± 0.067
5.437SerLeu: 5.437 ± 0.097
1.489SerMet: 1.489 ± 0.043
1.755SerAsn: 1.755 ± 0.059
2.445SerPro: 2.445 ± 0.068
1.1SerGln: 1.1 ± 0.042
3.027SerArg: 3.027 ± 0.065
3.34SerSer: 3.34 ± 0.094
2.452SerThr: 2.452 ± 0.073
4.227SerVal: 4.227 ± 0.098
0.564SerTrp: 0.564 ± 0.029
2.099SerTyr: 2.099 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
3.826ThrAla: 3.826 ± 0.082
0.462ThrCys: 0.462 ± 0.024
1.811ThrAsp: 1.811 ± 0.06
2.765ThrGlu: 2.765 ± 0.063
2.029ThrPhe: 2.029 ± 0.055
3.577ThrGly: 3.577 ± 0.08
0.808ThrHis: 0.808 ± 0.041
3.04ThrIle: 3.04 ± 0.076
2.24ThrLys: 2.24 ± 0.067
4.242ThrLeu: 4.242 ± 0.089
0.97ThrMet: 0.97 ± 0.042
1.326ThrAsn: 1.326 ± 0.052
2.305ThrPro: 2.305 ± 0.06
0.761ThrGln: 0.761 ± 0.034
1.731ThrArg: 1.731 ± 0.048
2.396ThrSer: 2.396 ± 0.053
2.259ThrThr: 2.259 ± 0.077
3.282ThrVal: 3.282 ± 0.08
0.387ThrTrp: 0.387 ± 0.024
1.442ThrTyr: 1.442 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
6.819ValAla: 6.819 ± 0.101
0.988ValCys: 0.988 ± 0.043
4.532ValAsp: 4.532 ± 0.088
8.954ValGlu: 8.954 ± 0.118
3.826ValPhe: 3.826 ± 0.085
5.849ValGly: 5.849 ± 0.106
1.156ValHis: 1.156 ± 0.044
5.784ValIle: 5.784 ± 0.095
6.136ValLys: 6.136 ± 0.111
7.344ValLeu: 7.344 ± 0.118
2.169ValMet: 2.169 ± 0.058
2.71ValAsn: 2.71 ± 0.059
2.851ValPro: 2.851 ± 0.072
1.359ValGln: 1.359 ± 0.047
4.908ValArg: 4.908 ± 0.101
4.788ValSer: 4.788 ± 0.1
3.411ValThr: 3.411 ± 0.076
8.741ValVal: 8.741 ± 0.149
0.843ValTrp: 0.843 ± 0.039
3.058ValTyr: 3.058 ± 0.068
0.0ValXaa: 0.0 ± 0.0
Trp
0.652TrpAla: 0.652 ± 0.034
0.13TrpCys: 0.13 ± 0.015
0.586TrpAsp: 0.586 ± 0.036
0.754TrpGlu: 0.754 ± 0.033
0.465TrpPhe: 0.465 ± 0.029
0.735TrpGly: 0.735 ± 0.033
0.186TrpHis: 0.186 ± 0.017
0.752TrpIle: 0.752 ± 0.032
0.802TrpLys: 0.802 ± 0.039
1.07TrpLeu: 1.07 ± 0.048
0.327TrpMet: 0.327 ± 0.023
0.422TrpAsn: 0.422 ± 0.028
0.236TrpPro: 0.236 ± 0.019
0.259TrpGln: 0.259 ± 0.022
0.643TrpArg: 0.643 ± 0.033
0.537TrpSer: 0.537 ± 0.029
0.375TrpThr: 0.375 ± 0.027
0.838TrpVal: 0.838 ± 0.034
0.238TrpTrp: 0.238 ± 0.019
0.381TrpTyr: 0.381 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.837TyrAla: 2.837 ± 0.061
0.534TyrCys: 0.534 ± 0.032
1.955TyrAsp: 1.955 ± 0.05
2.857TyrGlu: 2.857 ± 0.07
1.848TyrPhe: 1.848 ± 0.059
2.93TyrGly: 2.93 ± 0.063
0.649TyrHis: 0.649 ± 0.03
2.326TyrIle: 2.326 ± 0.061
1.628TyrLys: 1.628 ± 0.055
3.538TyrLeu: 3.538 ± 0.088
0.67TyrMet: 0.67 ± 0.03
1.108TyrAsn: 1.108 ± 0.046
1.713TyrPro: 1.713 ± 0.054
0.675TyrGln: 0.675 ± 0.03
2.308TyrArg: 2.308 ± 0.059
2.337TyrSer: 2.337 ± 0.065
1.731TyrThr: 1.731 ± 0.055
2.728TyrVal: 2.728 ± 0.067
0.492TyrTrp: 0.492 ± 0.032
1.604TyrTyr: 1.604 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2394 proteins (660812 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski