Amino acid dipepetide frequency for Candidatus Bathyarchaeota archaeon

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.633AlaAla: 5.633 ± 0.121
0.895AlaCys: 0.895 ± 0.039
3.755AlaAsp: 3.755 ± 0.083
4.516AlaGlu: 4.516 ± 0.108
2.943AlaPhe: 2.943 ± 0.083
4.995AlaGly: 4.995 ± 0.101
1.107AlaHis: 1.107 ± 0.038
5.529AlaIle: 5.529 ± 0.1
5.211AlaLys: 5.211 ± 0.109
7.259AlaLeu: 7.259 ± 0.126
1.652AlaMet: 1.652 ± 0.044
2.668AlaAsn: 2.668 ± 0.06
2.496AlaPro: 2.496 ± 0.081
2.264AlaGln: 2.264 ± 0.056
2.665AlaArg: 2.665 ± 0.06
4.409AlaSer: 4.409 ± 0.085
4.485AlaThr: 4.485 ± 0.087
6.374AlaVal: 6.374 ± 0.111
0.734AlaTrp: 0.734 ± 0.036
2.461AlaTyr: 2.461 ± 0.066
0.0AlaXaa: 0.0 ± 0.0
Cys
0.755CysAla: 0.755 ± 0.035
0.199CysCys: 0.199 ± 0.019
0.698CysAsp: 0.698 ± 0.031
0.724CysGlu: 0.724 ± 0.041
0.506CysPhe: 0.506 ± 0.029
1.33CysGly: 1.33 ± 0.055
0.246CysHis: 0.246 ± 0.022
0.766CysIle: 0.766 ± 0.039
0.738CysLys: 0.738 ± 0.038
1.085CysLeu: 1.085 ± 0.046
0.282CysMet: 0.282 ± 0.02
0.635CysAsn: 0.635 ± 0.038
0.886CysPro: 0.886 ± 0.046
0.324CysGln: 0.324 ± 0.023
0.503CysArg: 0.503 ± 0.029
0.889CysSer: 0.889 ± 0.046
0.642CysThr: 0.642 ± 0.029
0.917CysVal: 0.917 ± 0.04
0.178CysTrp: 0.178 ± 0.017
0.547CysTyr: 0.547 ± 0.03
0.0CysXaa: 0.0 ± 0.0
Asp
4.116AspAla: 4.116 ± 0.08
0.69AspCys: 0.69 ± 0.037
2.436AspAsp: 2.436 ± 0.078
3.478AspGlu: 3.478 ± 0.074
2.387AspPhe: 2.387 ± 0.072
3.992AspGly: 3.992 ± 0.13
0.788AspHis: 0.788 ± 0.035
3.722AspIle: 3.722 ± 0.07
2.912AspLys: 2.912 ± 0.077
4.814AspLeu: 4.814 ± 0.083
1.227AspMet: 1.227 ± 0.038
2.106AspAsn: 2.106 ± 0.067
2.437AspPro: 2.437 ± 0.065
1.282AspGln: 1.282 ± 0.041
1.774AspArg: 1.774 ± 0.053
3.275AspSer: 3.275 ± 0.092
2.713AspThr: 2.713 ± 0.078
4.577AspVal: 4.577 ± 0.099
0.783AspTrp: 0.783 ± 0.034
2.411AspTyr: 2.411 ± 0.067
0.0AspXaa: 0.0 ± 0.0
Glu
5.167GluAla: 5.167 ± 0.114
0.642GluCys: 0.642 ± 0.032
3.116GluAsp: 3.116 ± 0.077
4.826GluGlu: 4.826 ± 0.116
2.352GluPhe: 2.352 ± 0.054
3.682GluGly: 3.682 ± 0.093
1.227GluHis: 1.227 ± 0.043
4.92GluIle: 4.92 ± 0.1
5.607GluLys: 5.607 ± 0.155
6.406GluLeu: 6.406 ± 0.12
1.603GluMet: 1.603 ± 0.05
3.429GluAsn: 3.429 ± 0.087
2.221GluPro: 2.221 ± 0.066
2.585GluGln: 2.585 ± 0.062
2.676GluArg: 2.676 ± 0.073
3.361GluSer: 3.361 ± 0.08
4.722GluThr: 4.722 ± 0.1
4.403GluVal: 4.403 ± 0.085
0.808GluTrp: 0.808 ± 0.035
2.127GluTyr: 2.127 ± 0.063
0.0GluXaa: 0.0 ± 0.0
Phe
2.827PheAla: 2.827 ± 0.066
0.582PheCys: 0.582 ± 0.032
2.456PheAsp: 2.456 ± 0.06
2.638PheGlu: 2.638 ± 0.066
1.975PhePhe: 1.975 ± 0.062
3.476PheGly: 3.476 ± 0.085
0.696PheHis: 0.696 ± 0.03
2.518PheIle: 2.518 ± 0.065
2.205PheLys: 2.205 ± 0.062
4.082PheLeu: 4.082 ± 0.095
0.983PheMet: 0.983 ± 0.037
1.668PheAsn: 1.668 ± 0.043
1.705PhePro: 1.705 ± 0.052
1.093PheGln: 1.093 ± 0.047
1.726PheArg: 1.726 ± 0.059
3.362PheSer: 3.362 ± 0.074
2.526PheThr: 2.526 ± 0.071
3.412PheVal: 3.412 ± 0.083
0.614PheTrp: 0.614 ± 0.032
1.366PheTyr: 1.366 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
4.94GlyAla: 4.94 ± 0.114
0.984GlyCys: 0.984 ± 0.05
3.369GlyAsp: 3.369 ± 0.077
4.115GlyGlu: 4.115 ± 0.09
3.196GlyPhe: 3.196 ± 0.072
4.99GlyGly: 4.99 ± 0.129
1.16GlyHis: 1.16 ± 0.044
5.315GlyIle: 5.315 ± 0.099
5.118GlyLys: 5.118 ± 0.11
6.03GlyLeu: 6.03 ± 0.105
1.564GlyMet: 1.564 ± 0.048
3.129GlyAsn: 3.129 ± 0.088
2.046GlyPro: 2.046 ± 0.055
1.649GlyGln: 1.649 ± 0.051
2.71GlyArg: 2.71 ± 0.063
4.635GlySer: 4.635 ± 0.115
4.806GlyThr: 4.806 ± 0.126
5.663GlyVal: 5.663 ± 0.103
1.076GlyTrp: 1.076 ± 0.041
3.048GlyTyr: 3.048 ± 0.086
0.0GlyXaa: 0.0 ± 0.0
His
1.079HisAla: 1.079 ± 0.039
0.269HisCys: 0.269 ± 0.022
0.925HisAsp: 0.925 ± 0.038
1.045HisGlu: 1.045 ± 0.037
0.758HisPhe: 0.758 ± 0.034
1.342HisGly: 1.342 ± 0.047
0.34HisHis: 0.34 ± 0.024
1.174HisIle: 1.174 ± 0.044
1.109HisLys: 1.109 ± 0.043
1.554HisLeu: 1.554 ± 0.049
0.414HisMet: 0.414 ± 0.023
0.702HisAsn: 0.702 ± 0.033
0.911HisPro: 0.911 ± 0.039
0.455HisGln: 0.455 ± 0.024
0.705HisArg: 0.705 ± 0.031
1.143HisSer: 1.143 ± 0.041
0.92HisThr: 0.92 ± 0.041
1.269HisVal: 1.269 ± 0.048
0.206HisTrp: 0.206 ± 0.02
0.592HisTyr: 0.592 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.392IleAla: 5.392 ± 0.102
0.846IleCys: 0.846 ± 0.035
3.967IleAsp: 3.967 ± 0.087
4.73IleGlu: 4.73 ± 0.096
2.922IlePhe: 2.922 ± 0.066
4.976IleGly: 4.976 ± 0.101
1.271IleHis: 1.271 ± 0.05
5.398IleIle: 5.398 ± 0.116
4.382IleLys: 4.382 ± 0.096
6.848IleLeu: 6.848 ± 0.113
1.534IleMet: 1.534 ± 0.044
2.892IleAsn: 2.892 ± 0.069
3.157IlePro: 3.157 ± 0.072
2.3IleGln: 2.3 ± 0.062
2.979IleArg: 2.979 ± 0.067
4.962IleSer: 4.962 ± 0.095
4.47IleThr: 4.47 ± 0.09
5.755IleVal: 5.755 ± 0.102
0.76IleTrp: 0.76 ± 0.042
2.264IleTyr: 2.264 ± 0.07
0.0IleXaa: 0.0 ± 0.0
Lys
4.792LysAla: 4.792 ± 0.095
0.824LysCys: 0.824 ± 0.048
3.283LysAsp: 3.283 ± 0.089
5.138LysGlu: 5.138 ± 0.124
2.127LysPhe: 2.127 ± 0.06
3.809LysGly: 3.809 ± 0.091
1.26LysHis: 1.26 ± 0.048
5.322LysIle: 5.322 ± 0.111
6.231LysLys: 6.231 ± 0.131
6.42LysLeu: 6.42 ± 0.115
2.046LysMet: 2.046 ± 0.058
3.367LysAsn: 3.367 ± 0.076
2.722LysPro: 2.722 ± 0.073
2.715LysGln: 2.715 ± 0.07
3.596LysArg: 3.596 ± 0.084
3.453LysSer: 3.453 ± 0.083
4.917LysThr: 4.917 ± 0.095
4.635LysVal: 4.635 ± 0.108
0.642LysTrp: 0.642 ± 0.03
1.948LysTyr: 1.948 ± 0.056
0.0LysXaa: 0.0 ± 0.0
Leu
7.081LeuAla: 7.081 ± 0.129
1.118LeuCys: 1.118 ± 0.054
5.055LeuAsp: 5.055 ± 0.086
6.536LeuGlu: 6.536 ± 0.101
3.96LeuPhe: 3.96 ± 0.098
6.544LeuGly: 6.544 ± 0.115
1.536LeuHis: 1.536 ± 0.051
6.105LeuIle: 6.105 ± 0.117
6.614LeuLys: 6.614 ± 0.122
8.651LeuLeu: 8.651 ± 0.166
1.997LeuMet: 1.997 ± 0.058
3.982LeuAsn: 3.982 ± 0.087
3.599LeuPro: 3.599 ± 0.073
2.923LeuGln: 2.923 ± 0.078
4.18LeuArg: 4.18 ± 0.081
6.335LeuSer: 6.335 ± 0.105
5.915LeuThr: 5.915 ± 0.115
7.131LeuVal: 7.131 ± 0.12
1.177LeuTrp: 1.177 ± 0.056
2.591LeuTyr: 2.591 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
1.638MetAla: 1.638 ± 0.054
0.252MetCys: 0.252 ± 0.022
1.268MetAsp: 1.268 ± 0.046
1.411MetGlu: 1.411 ± 0.048
0.914MetPhe: 0.914 ± 0.037
1.722MetGly: 1.722 ± 0.045
0.417MetHis: 0.417 ± 0.021
1.492MetIle: 1.492 ± 0.049
1.941MetLys: 1.941 ± 0.059
2.23MetLeu: 2.23 ± 0.055
0.607MetMet: 0.607 ± 0.029
1.14MetAsn: 1.14 ± 0.048
1.1MetPro: 1.1 ± 0.039
0.796MetGln: 0.796 ± 0.036
0.976MetArg: 0.976 ± 0.042
1.617MetSer: 1.617 ± 0.054
1.478MetThr: 1.478 ± 0.048
1.73MetVal: 1.73 ± 0.055
0.274MetTrp: 0.274 ± 0.02
0.582MetTyr: 0.582 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
2.774AsnAla: 2.774 ± 0.068
0.66AsnCys: 0.66 ± 0.035
1.775AsnAsp: 1.775 ± 0.07
2.718AsnGlu: 2.718 ± 0.071
1.816AsnPhe: 1.816 ± 0.059
3.066AsnGly: 3.066 ± 0.075
0.676AsnHis: 0.676 ± 0.03
3.39AsnIle: 3.39 ± 0.084
2.886AsnLys: 2.886 ± 0.082
4.199AsnLeu: 4.199 ± 0.087
1.146AsnMet: 1.146 ± 0.044
2.367AsnAsn: 2.367 ± 0.114
2.336AsnPro: 2.336 ± 0.06
1.54AsnGln: 1.54 ± 0.055
1.819AsnArg: 1.819 ± 0.05
2.81AsnSer: 2.81 ± 0.07
2.554AsnThr: 2.554 ± 0.083
3.666AsnVal: 3.666 ± 0.083
0.671AsnTrp: 0.671 ± 0.034
1.749AsnTyr: 1.749 ± 0.059
0.0AsnXaa: 0.0 ± 0.0
Pro
2.701ProAla: 2.701 ± 0.072
0.458ProCys: 0.458 ± 0.027
2.403ProAsp: 2.403 ± 0.069
3.735ProGlu: 3.735 ± 0.089
1.814ProPhe: 1.814 ± 0.054
1.984ProGly: 1.984 ± 0.066
0.754ProHis: 0.754 ± 0.037
2.805ProIle: 2.805 ± 0.066
2.73ProLys: 2.73 ± 0.065
3.498ProLeu: 3.498 ± 0.077
0.883ProMet: 0.883 ± 0.039
1.786ProAsn: 1.786 ± 0.054
1.536ProPro: 1.536 ± 0.065
1.322ProGln: 1.322 ± 0.056
1.484ProArg: 1.484 ± 0.049
2.643ProSer: 2.643 ± 0.065
2.668ProThr: 2.668 ± 0.065
3.295ProVal: 3.295 ± 0.071
0.455ProTrp: 0.455 ± 0.032
1.35ProTyr: 1.35 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
2.096GlnAla: 2.096 ± 0.061
0.385GlnCys: 0.385 ± 0.027
1.31GlnAsp: 1.31 ± 0.051
1.969GlnGlu: 1.969 ± 0.056
1.154GlnPhe: 1.154 ± 0.043
1.659GlnGly: 1.659 ± 0.054
0.525GlnHis: 0.525 ± 0.029
2.444GlnIle: 2.444 ± 0.058
2.576GlnLys: 2.576 ± 0.069
2.887GlnLeu: 2.887 ± 0.071
0.796GlnMet: 0.796 ± 0.034
1.64GlnAsn: 1.64 ± 0.055
1.057GlnPro: 1.057 ± 0.042
1.216GlnGln: 1.216 ± 0.049
1.439GlnArg: 1.439 ± 0.05
1.777GlnSer: 1.777 ± 0.052
2.302GlnThr: 2.302 ± 0.065
2.207GlnVal: 2.207 ± 0.062
0.357GlnTrp: 0.357 ± 0.029
0.916GlnTyr: 0.916 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.686ArgAla: 2.686 ± 0.069
0.575ArgCys: 0.575 ± 0.033
1.995ArgAsp: 1.995 ± 0.067
2.637ArgGlu: 2.637 ± 0.082
1.93ArgPhe: 1.93 ± 0.054
2.68ArgGly: 2.68 ± 0.071
0.783ArgHis: 0.783 ± 0.037
3.034ArgIle: 3.034 ± 0.065
3.456ArgLys: 3.456 ± 0.073
4.031ArgLeu: 4.031 ± 0.089
1.064ArgMet: 1.064 ± 0.04
1.861ArgAsn: 1.861 ± 0.062
1.469ArgPro: 1.469 ± 0.047
1.275ArgGln: 1.275 ± 0.049
2.188ArgArg: 2.188 ± 0.067
2.141ArgSer: 2.141 ± 0.057
2.179ArgThr: 2.179 ± 0.063
3.023ArgVal: 3.023 ± 0.067
0.531ArgTrp: 0.531 ± 0.026
1.395ArgTyr: 1.395 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
4.314SerAla: 4.314 ± 0.089
0.813SerCys: 0.813 ± 0.037
3.333SerAsp: 3.333 ± 0.075
3.981SerGlu: 3.981 ± 0.083
3.062SerPhe: 3.062 ± 0.071
5.297SerGly: 5.297 ± 0.119
1.087SerHis: 1.087 ± 0.036
4.35SerIle: 4.35 ± 0.09
4.007SerLys: 4.007 ± 0.093
6.066SerLeu: 6.066 ± 0.106
1.515SerMet: 1.515 ± 0.048
2.701SerAsn: 2.701 ± 0.084
2.634SerPro: 2.634 ± 0.07
1.821SerGln: 1.821 ± 0.053
2.509SerArg: 2.509 ± 0.068
4.914SerSer: 4.914 ± 0.12
4.002SerThr: 4.002 ± 0.092
5.065SerVal: 5.065 ± 0.097
0.925SerTrp: 0.925 ± 0.039
2.355SerTyr: 2.355 ± 0.078
0.0SerXaa: 0.0 ± 0.0
Thr
4.509ThrAla: 4.509 ± 0.1
0.673ThrCys: 0.673 ± 0.034
3.218ThrAsp: 3.218 ± 0.086
4.171ThrGlu: 4.171 ± 0.104
2.599ThrPhe: 2.599 ± 0.069
5.009ThrGly: 5.009 ± 0.098
1.015ThrHis: 1.015 ± 0.043
5.138ThrIle: 5.138 ± 0.111
4.216ThrLys: 4.216 ± 0.083
5.857ThrLeu: 5.857 ± 0.107
1.395ThrMet: 1.395 ± 0.043
2.507ThrAsn: 2.507 ± 0.071
2.853ThrPro: 2.853 ± 0.091
1.948ThrGln: 1.948 ± 0.055
2.299ThrArg: 2.299 ± 0.066
3.76ThrSer: 3.76 ± 0.084
4.059ThrThr: 4.059 ± 0.102
5.744ThrVal: 5.744 ± 0.133
0.716ThrTrp: 0.716 ± 0.035
2.182ThrTyr: 2.182 ± 0.071
0.0ThrXaa: 0.0 ± 0.0
Val
6.34ValAla: 6.34 ± 0.092
1.209ValCys: 1.209 ± 0.052
4.666ValAsp: 4.666 ± 0.095
4.842ValGlu: 4.842 ± 0.108
3.425ValPhe: 3.425 ± 0.076
5.376ValGly: 5.376 ± 0.099
1.227ValHis: 1.227 ± 0.046
5.484ValIle: 5.484 ± 0.096
4.82ValLys: 4.82 ± 0.096
6.977ValLeu: 6.977 ± 0.12
1.718ValMet: 1.718 ± 0.06
3.543ValAsn: 3.543 ± 0.066
3.188ValPro: 3.188 ± 0.076
1.955ValGln: 1.955 ± 0.058
2.613ValArg: 2.613 ± 0.068
5.692ValSer: 5.692 ± 0.105
5.396ValThr: 5.396 ± 0.119
6.395ValVal: 6.395 ± 0.113
0.914ValTrp: 0.914 ± 0.043
2.996ValTyr: 2.996 ± 0.078
0.0ValXaa: 0.0 ± 0.0
Trp
0.766TrpAla: 0.766 ± 0.039
0.164TrpCys: 0.164 ± 0.017
0.744TrpAsp: 0.744 ± 0.039
0.699TrpGlu: 0.699 ± 0.034
0.562TrpPhe: 0.562 ± 0.032
0.93TrpGly: 0.93 ± 0.044
0.204TrpHis: 0.204 ± 0.02
0.843TrpIle: 0.843 ± 0.045
0.819TrpLys: 0.819 ± 0.033
1.02TrpLeu: 1.02 ± 0.039
0.396TrpMet: 0.396 ± 0.025
0.832TrpAsn: 0.832 ± 0.044
0.368TrpPro: 0.368 ± 0.029
0.378TrpGln: 0.378 ± 0.024
0.604TrpArg: 0.604 ± 0.029
0.93TrpSer: 0.93 ± 0.048
1.028TrpThr: 1.028 ± 0.051
0.804TrpVal: 0.804 ± 0.04
0.201TrpTrp: 0.201 ± 0.018
0.392TrpTyr: 0.392 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.445TyrAla: 2.445 ± 0.082
0.614TyrCys: 0.614 ± 0.029
2.149TyrAsp: 2.149 ± 0.059
1.972TyrGlu: 1.972 ± 0.058
1.508TyrPhe: 1.508 ± 0.048
2.683TyrGly: 2.683 ± 0.076
0.595TyrHis: 0.595 ± 0.03
2.056TyrIle: 2.056 ± 0.06
1.668TyrLys: 1.668 ± 0.051
3.119TyrLeu: 3.119 ± 0.067
0.763TyrMet: 0.763 ± 0.035
1.663TyrAsn: 1.663 ± 0.056
1.582TyrPro: 1.582 ± 0.049
0.867TyrGln: 0.867 ± 0.036
1.487TyrArg: 1.487 ± 0.055
2.643TyrSer: 2.643 ± 0.069
2.085TyrThr: 2.085 ± 0.077
2.711TyrVal: 2.711 ± 0.067
0.662TyrTrp: 0.662 ± 0.036
1.347TyrTyr: 1.347 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2329 proteins (642103 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski