Amino acid dipepetide frequency for Methanophagales archaeon

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.83AlaAla: 5.83 ± 0.149
1.041AlaCys: 1.041 ± 0.048
4.115AlaAsp: 4.115 ± 0.094
5.689AlaGlu: 5.689 ± 0.121
2.915AlaPhe: 2.915 ± 0.085
6.006AlaGly: 6.006 ± 0.128
1.366AlaHis: 1.366 ± 0.053
6.351AlaIle: 6.351 ± 0.125
4.604AlaLys: 4.604 ± 0.106
7.564AlaLeu: 7.564 ± 0.155
2.107AlaMet: 2.107 ± 0.062
2.534AlaAsn: 2.534 ± 0.09
2.355AlaPro: 2.355 ± 0.066
1.48AlaGln: 1.48 ± 0.058
4.401AlaArg: 4.401 ± 0.127
4.688AlaSer: 4.688 ± 0.088
3.764AlaThr: 3.764 ± 0.087
5.916AlaVal: 5.916 ± 0.12
0.717AlaTrp: 0.717 ± 0.043
2.297AlaTyr: 2.297 ± 0.071
0.0AlaXaa: 0.0 ± 0.0
Cys
0.907CysAla: 0.907 ± 0.039
0.23CysCys: 0.23 ± 0.022
0.745CysAsp: 0.745 ± 0.038
0.907CysGlu: 0.907 ± 0.048
0.439CysPhe: 0.439 ± 0.036
1.366CysGly: 1.366 ± 0.067
0.373CysHis: 0.373 ± 0.043
1.012CysIle: 1.012 ± 0.045
0.687CysLys: 0.687 ± 0.036
0.894CysLeu: 0.894 ± 0.039
0.338CysMet: 0.338 ± 0.028
0.636CysAsn: 0.636 ± 0.038
0.747CysPro: 0.747 ± 0.045
0.216CysGln: 0.216 ± 0.018
0.614CysArg: 0.614 ± 0.033
0.817CysSer: 0.817 ± 0.043
0.691CysThr: 0.691 ± 0.04
0.86CysVal: 0.86 ± 0.04
0.153CysTrp: 0.153 ± 0.017
0.48CysTyr: 0.48 ± 0.029
0.0CysXaa: 0.0 ± 0.0
Asp
4.698AspAla: 4.698 ± 0.101
0.597AspCys: 0.597 ± 0.035
2.428AspAsp: 2.428 ± 0.087
4.263AspGlu: 4.263 ± 0.095
2.38AspPhe: 2.38 ± 0.072
3.835AspGly: 3.835 ± 0.128
0.575AspHis: 0.575 ± 0.033
4.686AspIle: 4.686 ± 0.092
3.344AspLys: 3.344 ± 0.08
4.449AspLeu: 4.449 ± 0.09
1.467AspMet: 1.467 ± 0.058
1.896AspAsn: 1.896 ± 0.085
2.055AspPro: 2.055 ± 0.082
0.53AspGln: 0.53 ± 0.028
2.917AspArg: 2.917 ± 0.077
2.695AspSer: 2.695 ± 0.081
2.887AspThr: 2.887 ± 0.099
4.293AspVal: 4.293 ± 0.095
0.732AspTrp: 0.732 ± 0.046
2.087AspTyr: 2.087 ± 0.077
0.0AspXaa: 0.0 ± 0.0
Glu
5.942GluAla: 5.942 ± 0.142
0.84GluCys: 0.84 ± 0.033
4.098GluAsp: 4.098 ± 0.094
8.329GluGlu: 8.329 ± 0.189
2.71GluPhe: 2.71 ± 0.065
5.552GluGly: 5.552 ± 0.13
1.68GluHis: 1.68 ± 0.069
6.973GluIle: 6.973 ± 0.12
5.373GluLys: 5.373 ± 0.136
8.686GluLeu: 8.686 ± 0.196
2.243GluMet: 2.243 ± 0.069
2.82GluAsn: 2.82 ± 0.087
2.697GluPro: 2.697 ± 0.076
1.984GluGln: 1.984 ± 0.076
5.842GluArg: 5.842 ± 0.13
3.735GluSer: 3.735 ± 0.099
3.115GluThr: 3.115 ± 0.087
6.185GluVal: 6.185 ± 0.113
0.855GluTrp: 0.855 ± 0.041
2.699GluTyr: 2.699 ± 0.074
0.0GluXaa: 0.0 ± 0.0
Phe
2.794PheAla: 2.794 ± 0.077
0.58PheCys: 0.58 ± 0.036
2.225PheAsp: 2.225 ± 0.073
2.755PheGlu: 2.755 ± 0.081
1.823PhePhe: 1.823 ± 0.07
3.007PheGly: 3.007 ± 0.085
0.687PheHis: 0.687 ± 0.038
3.022PheIle: 3.022 ± 0.081
2.003PheLys: 2.003 ± 0.059
3.531PheLeu: 3.531 ± 0.11
1.041PheMet: 1.041 ± 0.05
1.474PheAsn: 1.474 ± 0.06
1.334PhePro: 1.334 ± 0.053
0.754PheGln: 0.754 ± 0.039
1.81PheArg: 1.81 ± 0.061
2.742PheSer: 2.742 ± 0.076
2.062PheThr: 2.062 ± 0.062
2.893PheVal: 2.893 ± 0.07
0.388PheTrp: 0.388 ± 0.028
1.497PheTyr: 1.497 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
5.618GlyAla: 5.618 ± 0.121
1.135GlyCys: 1.135 ± 0.055
4.317GlyAsp: 4.317 ± 0.111
6.01GlyGlu: 6.01 ± 0.122
3.003GlyPhe: 3.003 ± 0.079
5.726GlyGly: 5.726 ± 0.155
1.234GlyHis: 1.234 ± 0.054
6.924GlyIle: 6.924 ± 0.129
5.222GlyLys: 5.222 ± 0.115
6.211GlyLeu: 6.211 ± 0.119
2.322GlyMet: 2.322 ± 0.081
2.76GlyAsn: 2.76 ± 0.088
1.605GlyPro: 1.605 ± 0.064
1.069GlyGln: 1.069 ± 0.049
4.065GlyArg: 4.065 ± 0.09
4.253GlySer: 4.253 ± 0.118
4.3GlyThr: 4.3 ± 0.1
6.114GlyVal: 6.114 ± 0.134
0.853GlyTrp: 0.853 ± 0.039
3.068GlyTyr: 3.068 ± 0.089
0.0GlyXaa: 0.0 ± 0.0
His
1.398HisAla: 1.398 ± 0.057
0.282HisCys: 0.282 ± 0.023
0.965HisAsp: 0.965 ± 0.042
1.331HisGlu: 1.331 ± 0.049
0.799HisPhe: 0.799 ± 0.039
1.48HisGly: 1.48 ± 0.056
0.487HisHis: 0.487 ± 0.036
1.405HisIle: 1.405 ± 0.056
0.881HisLys: 0.881 ± 0.038
1.592HisLeu: 1.592 ± 0.062
0.269HisMet: 0.269 ± 0.021
0.709HisAsn: 0.709 ± 0.036
1.034HisPro: 1.034 ± 0.055
0.444HisGln: 0.444 ± 0.028
1.137HisArg: 1.137 ± 0.042
1.084HisSer: 1.084 ± 0.053
1.019HisThr: 1.019 ± 0.044
1.321HisVal: 1.321 ± 0.048
0.215HisTrp: 0.215 ± 0.027
0.608HisTyr: 0.608 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
7.21IleAla: 7.21 ± 0.122
1.051IleCys: 1.051 ± 0.047
4.188IleAsp: 4.188 ± 0.114
6.971IleGlu: 6.971 ± 0.14
2.902IlePhe: 2.902 ± 0.087
5.969IleGly: 5.969 ± 0.109
1.305IleHis: 1.305 ± 0.047
5.621IleIle: 5.621 ± 0.139
5.097IleLys: 5.097 ± 0.115
6.709IleLeu: 6.709 ± 0.155
1.68IleMet: 1.68 ± 0.057
2.686IleAsn: 2.686 ± 0.102
3.951IlePro: 3.951 ± 0.093
1.469IleGln: 1.469 ± 0.054
4.199IleArg: 4.199 ± 0.094
4.892IleSer: 4.892 ± 0.1
4.559IleThr: 4.559 ± 0.108
5.265IleVal: 5.265 ± 0.109
0.633IleTrp: 0.633 ± 0.036
2.551IleTyr: 2.551 ± 0.085
0.0IleXaa: 0.0 ± 0.0
Lys
4.5LysAla: 4.5 ± 0.107
0.625LysCys: 0.625 ± 0.033
3.221LysAsp: 3.221 ± 0.089
6.605LysGlu: 6.605 ± 0.165
1.932LysPhe: 1.932 ± 0.067
4.503LysGly: 4.503 ± 0.104
1.006LysHis: 1.006 ± 0.042
4.535LysIle: 4.535 ± 0.1
4.578LysLys: 4.578 ± 0.11
5.341LysLeu: 5.341 ± 0.099
1.717LysMet: 1.717 ± 0.061
2.299LysAsn: 2.299 ± 0.074
2.437LysPro: 2.437 ± 0.073
1.45LysGln: 1.45 ± 0.053
5.015LysArg: 5.015 ± 0.116
3.137LysSer: 3.137 ± 0.083
2.936LysThr: 2.936 ± 0.066
4.266LysVal: 4.266 ± 0.097
0.593LysTrp: 0.593 ± 0.034
1.937LysTyr: 1.937 ± 0.072
0.0LysXaa: 0.0 ± 0.0
Leu
6.816LeuAla: 6.816 ± 0.137
1.125LeuCys: 1.125 ± 0.054
4.431LeuAsp: 4.431 ± 0.095
6.504LeuGlu: 6.504 ± 0.122
3.639LeuPhe: 3.639 ± 0.092
6.258LeuGly: 6.258 ± 0.141
1.82LeuHis: 1.82 ± 0.057
6.694LeuIle: 6.694 ± 0.132
6.831LeuLys: 6.831 ± 0.148
8.313LeuLeu: 8.313 ± 0.182
2.434LeuMet: 2.434 ± 0.072
3.733LeuAsn: 3.733 ± 0.094
3.552LeuPro: 3.552 ± 0.093
1.822LeuGln: 1.822 ± 0.056
6.054LeuArg: 6.054 ± 0.128
6.181LeuSer: 6.181 ± 0.112
4.369LeuThr: 4.369 ± 0.105
5.771LeuVal: 5.771 ± 0.113
0.875LeuTrp: 0.875 ± 0.04
2.88LeuTyr: 2.88 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
1.719MetAla: 1.719 ± 0.063
0.25MetCys: 0.25 ± 0.023
1.437MetAsp: 1.437 ± 0.056
2.124MetGlu: 2.124 ± 0.067
0.7MetPhe: 0.7 ± 0.037
1.907MetGly: 1.907 ± 0.063
0.569MetHis: 0.569 ± 0.037
1.624MetIle: 1.624 ± 0.057
2.047MetLys: 2.047 ± 0.057
2.503MetLeu: 2.503 ± 0.072
0.689MetMet: 0.689 ± 0.04
1.124MetAsn: 1.124 ± 0.051
1.148MetPro: 1.148 ± 0.046
0.616MetGln: 0.616 ± 0.04
1.924MetArg: 1.924 ± 0.064
1.459MetSer: 1.459 ± 0.051
1.14MetThr: 1.14 ± 0.051
2.001MetVal: 2.001 ± 0.06
0.203MetTrp: 0.203 ± 0.021
0.648MetTyr: 0.648 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
3.171AsnAla: 3.171 ± 0.091
0.571AsnCys: 0.571 ± 0.035
1.646AsnAsp: 1.646 ± 0.063
2.76AsnGlu: 2.76 ± 0.099
1.396AsnPhe: 1.396 ± 0.053
2.798AsnGly: 2.798 ± 0.102
0.593AsnHis: 0.593 ± 0.035
3.025AsnIle: 3.025 ± 0.103
2.096AsnLys: 2.096 ± 0.073
3.356AsnLeu: 3.356 ± 0.088
0.868AsnMet: 0.868 ± 0.043
1.969AsnAsn: 1.969 ± 0.181
2.023AsnPro: 2.023 ± 0.069
0.709AsnGln: 0.709 ± 0.04
2.245AsnArg: 2.245 ± 0.077
2.081AsnSer: 2.081 ± 0.086
2.144AsnThr: 2.144 ± 0.098
2.704AsnVal: 2.704 ± 0.106
0.549AsnTrp: 0.549 ± 0.038
1.461AsnTyr: 1.461 ± 0.062
0.0AsnXaa: 0.0 ± 0.0
Pro
2.844ProAla: 2.844 ± 0.083
0.543ProCys: 0.543 ± 0.035
2.492ProAsp: 2.492 ± 0.093
3.751ProGlu: 3.751 ± 0.105
1.754ProPhe: 1.754 ± 0.058
3.188ProGly: 3.188 ± 0.082
0.825ProHis: 0.825 ± 0.042
2.521ProIle: 2.521 ± 0.067
1.855ProLys: 1.855 ± 0.06
3.432ProLeu: 3.432 ± 0.083
0.836ProMet: 0.836 ± 0.041
1.2ProAsn: 1.2 ± 0.056
1.915ProPro: 1.915 ± 0.077
0.875ProGln: 0.875 ± 0.041
1.711ProArg: 1.711 ± 0.055
2.225ProSer: 2.225 ± 0.066
2.029ProThr: 2.029 ± 0.088
3.406ProVal: 3.406 ± 0.074
0.467ProTrp: 0.467 ± 0.036
1.53ProTyr: 1.53 ± 0.053
0.0ProXaa: 0.0 ± 0.0
Gln
1.312GlnAla: 1.312 ± 0.059
0.231GlnCys: 0.231 ± 0.021
0.89GlnAsp: 0.89 ± 0.039
1.364GlnGlu: 1.364 ± 0.045
0.737GlnPhe: 0.737 ± 0.042
1.208GlnGly: 1.208 ± 0.056
0.474GlnHis: 0.474 ± 0.033
1.64GlnIle: 1.64 ± 0.061
1.206GlnLys: 1.206 ± 0.055
2.059GlnLeu: 2.059 ± 0.068
0.61GlnMet: 0.61 ± 0.043
0.849GlnAsn: 0.849 ± 0.041
0.804GlnPro: 0.804 ± 0.049
0.67GlnGln: 0.67 ± 0.037
1.4GlnArg: 1.4 ± 0.06
1.051GlnSer: 1.051 ± 0.048
0.89GlnThr: 0.89 ± 0.046
1.288GlnVal: 1.288 ± 0.051
0.252GlnTrp: 0.252 ± 0.027
0.689GlnTyr: 0.689 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
4.539ArgAla: 4.539 ± 0.092
0.84ArgCys: 0.84 ± 0.046
3.455ArgAsp: 3.455 ± 0.086
5.756ArgGlu: 5.756 ± 0.133
2.286ArgPhe: 2.286 ± 0.075
4.744ArgGly: 4.744 ± 0.106
1.004ArgHis: 1.004 ± 0.046
5.27ArgIle: 5.27 ± 0.114
3.652ArgLys: 3.652 ± 0.096
5.108ArgLeu: 5.108 ± 0.108
1.836ArgMet: 1.836 ± 0.066
2.131ArgAsn: 2.131 ± 0.063
1.482ArgPro: 1.482 ± 0.057
1.112ArgGln: 1.112 ± 0.055
4.337ArgArg: 4.337 ± 0.117
2.867ArgSer: 2.867 ± 0.071
2.492ArgThr: 2.492 ± 0.067
4.481ArgVal: 4.481 ± 0.116
0.719ArgTrp: 0.719 ± 0.043
2.473ArgTyr: 2.473 ± 0.076
0.0ArgXaa: 0.0 ± 0.0
Ser
4.214SerAla: 4.214 ± 0.104
0.747SerCys: 0.747 ± 0.041
3.227SerAsp: 3.227 ± 0.083
4.442SerGlu: 4.442 ± 0.096
2.437SerPhe: 2.437 ± 0.066
5.209SerGly: 5.209 ± 0.103
1.058SerHis: 1.058 ± 0.051
4.304SerIle: 4.304 ± 0.096
3.005SerLys: 3.005 ± 0.086
5.27SerLeu: 5.27 ± 0.103
1.331SerMet: 1.331 ± 0.051
2.202SerAsn: 2.202 ± 0.118
2.499SerPro: 2.499 ± 0.079
1.043SerGln: 1.043 ± 0.043
3.171SerArg: 3.171 ± 0.089
3.766SerSer: 3.766 ± 0.116
2.917SerThr: 2.917 ± 0.078
4.289SerVal: 4.289 ± 0.112
0.635SerTrp: 0.635 ± 0.037
2.199SerTyr: 2.199 ± 0.072
0.0SerXaa: 0.0 ± 0.0
Thr
3.908ThrAla: 3.908 ± 0.087
0.629ThrCys: 0.629 ± 0.038
2.518ThrAsp: 2.518 ± 0.081
3.488ThrGlu: 3.488 ± 0.076
1.874ThrPhe: 1.874 ± 0.066
4.75ThrGly: 4.75 ± 0.118
0.97ThrHis: 0.97 ± 0.044
3.919ThrIle: 3.919 ± 0.114
2.49ThrLys: 2.49 ± 0.067
4.349ThrLeu: 4.349 ± 0.09
1.116ThrMet: 1.116 ± 0.044
1.993ThrAsn: 1.993 ± 0.096
2.818ThrPro: 2.818 ± 0.101
0.941ThrGln: 0.941 ± 0.043
2.641ThrArg: 2.641 ± 0.072
2.915ThrSer: 2.915 ± 0.094
2.757ThrThr: 2.757 ± 0.094
3.897ThrVal: 3.897 ± 0.159
0.538ThrTrp: 0.538 ± 0.041
1.689ThrTyr: 1.689 ± 0.076
0.0ThrXaa: 0.0 ± 0.0
Val
5.448ValAla: 5.448 ± 0.12
1.058ValCys: 1.058 ± 0.048
3.953ValAsp: 3.953 ± 0.075
5.924ValGlu: 5.924 ± 0.112
2.833ValPhe: 2.833 ± 0.085
4.763ValGly: 4.763 ± 0.113
1.364ValHis: 1.364 ± 0.053
6.073ValIle: 6.073 ± 0.127
4.981ValLys: 4.981 ± 0.102
6.431ValLeu: 6.431 ± 0.113
1.861ValMet: 1.861 ± 0.065
2.908ValAsn: 2.908 ± 0.101
2.982ValPro: 2.982 ± 0.088
1.34ValGln: 1.34 ± 0.053
4.3ValArg: 4.3 ± 0.102
4.686ValSer: 4.686 ± 0.11
3.805ValThr: 3.805 ± 0.122
5.97ValVal: 5.97 ± 0.107
0.681ValTrp: 0.681 ± 0.043
2.576ValTyr: 2.576 ± 0.077
0.0ValXaa: 0.0 ± 0.0
Trp
0.567TrpAla: 0.567 ± 0.036
0.157TrpCys: 0.157 ± 0.02
0.661TrpAsp: 0.661 ± 0.042
0.672TrpGlu: 0.672 ± 0.041
0.418TrpPhe: 0.418 ± 0.029
0.739TrpGly: 0.739 ± 0.041
0.261TrpHis: 0.261 ± 0.022
0.847TrpIle: 0.847 ± 0.042
0.592TrpLys: 0.592 ± 0.037
1.099TrpLeu: 1.099 ± 0.048
0.314TrpMet: 0.314 ± 0.025
0.644TrpAsn: 0.644 ± 0.046
0.147TrpPro: 0.147 ± 0.017
0.317TrpGln: 0.317 ± 0.028
0.676TrpArg: 0.676 ± 0.035
0.631TrpSer: 0.631 ± 0.039
0.534TrpThr: 0.534 ± 0.051
0.78TrpVal: 0.78 ± 0.036
0.196TrpTrp: 0.196 ± 0.022
0.45TrpTyr: 0.45 ± 0.036
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.309TyrAla: 2.309 ± 0.069
0.579TyrCys: 0.579 ± 0.036
1.803TyrAsp: 1.803 ± 0.067
2.714TyrGlu: 2.714 ± 0.076
1.445TyrPhe: 1.445 ± 0.056
2.803TyrGly: 2.803 ± 0.08
0.808TyrHis: 0.808 ± 0.042
2.458TyrIle: 2.458 ± 0.071
2.092TyrLys: 2.092 ± 0.072
3.203TyrLeu: 3.203 ± 0.09
0.752TyrMet: 0.752 ± 0.042
1.598TyrAsn: 1.598 ± 0.069
1.814TyrPro: 1.814 ± 0.057
0.782TyrGln: 0.782 ± 0.035
2.126TyrArg: 2.126 ± 0.061
2.059TyrSer: 2.059 ± 0.068
1.788TyrThr: 1.788 ± 0.064
2.238TyrVal: 2.238 ± 0.062
0.45TyrTrp: 0.45 ± 0.035
1.417TyrTyr: 1.417 ± 0.062
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1797 proteins (535814 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski