Amino acid dipepetide frequency for Candidatus Nitrosarchaeum limnium BG20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.037AlaAla: 4.037 ± 0.124
0.658AlaCys: 0.658 ± 0.032
2.937AlaAsp: 2.937 ± 0.078
3.418AlaGlu: 3.418 ± 0.084
2.354AlaPhe: 2.354 ± 0.069
3.886AlaGly: 3.886 ± 0.104
0.961AlaHis: 0.961 ± 0.037
5.651AlaIle: 5.651 ± 0.099
5.412AlaLys: 5.412 ± 0.115
5.161AlaLeu: 5.161 ± 0.108
1.448AlaMet: 1.448 ± 0.059
2.544AlaAsn: 2.544 ± 0.078
1.725AlaPro: 1.725 ± 0.067
1.671AlaGln: 1.671 ± 0.058
2.091AlaArg: 2.091 ± 0.066
3.924AlaSer: 3.924 ± 0.095
3.286AlaThr: 3.286 ± 0.096
3.668AlaVal: 3.668 ± 0.092
0.457AlaTrp: 0.457 ± 0.032
1.773AlaTyr: 1.773 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.552CysAla: 0.552 ± 0.033
0.154CysCys: 0.154 ± 0.019
0.636CysAsp: 0.636 ± 0.035
0.587CysGlu: 0.587 ± 0.037
0.417CysPhe: 0.417 ± 0.029
0.945CysGly: 0.945 ± 0.046
0.238CysHis: 0.238 ± 0.023
0.74CysIle: 0.74 ± 0.034
0.989CysLys: 0.989 ± 0.045
0.645CysLeu: 0.645 ± 0.031
0.263CysMet: 0.263 ± 0.023
0.616CysAsn: 0.616 ± 0.044
0.711CysPro: 0.711 ± 0.093
0.28CysGln: 0.28 ± 0.023
0.38CysArg: 0.38 ± 0.022
0.78CysSer: 0.78 ± 0.041
0.568CysThr: 0.568 ± 0.037
0.636CysVal: 0.636 ± 0.039
0.088CysTrp: 0.088 ± 0.012
0.336CysTyr: 0.336 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
3.506AspAla: 3.506 ± 0.079
0.552AspCys: 0.552 ± 0.036
3.255AspAsp: 3.255 ± 0.094
4.067AspGlu: 4.067 ± 0.101
2.692AspPhe: 2.692 ± 0.068
3.878AspGly: 3.878 ± 0.141
0.901AspHis: 0.901 ± 0.044
4.924AspIle: 4.924 ± 0.097
4.483AspLys: 4.483 ± 0.101
4.82AspLeu: 4.82 ± 0.097
1.309AspMet: 1.309 ± 0.05
2.583AspAsn: 2.583 ± 0.08
2.281AspPro: 2.281 ± 0.066
1.566AspGln: 1.566 ± 0.054
1.718AspArg: 1.718 ± 0.059
4.386AspSer: 4.386 ± 0.109
2.762AspThr: 2.762 ± 0.082
4.253AspVal: 4.253 ± 0.081
0.574AspTrp: 0.574 ± 0.035
2.122AspTyr: 2.122 ± 0.058
0.0AspXaa: 0.0 ± 0.0
Glu
3.047GluAla: 3.047 ± 0.081
0.579GluCys: 0.579 ± 0.036
3.038GluAsp: 3.038 ± 0.08
4.368GluGlu: 4.368 ± 0.088
3.336GluPhe: 3.336 ± 0.089
2.999GluGly: 2.999 ± 0.074
1.289GluHis: 1.289 ± 0.051
7.26GluIle: 7.26 ± 0.119
7.444GluLys: 7.444 ± 0.134
6.031GluLeu: 6.031 ± 0.123
1.793GluMet: 1.793 ± 0.055
4.074GluAsn: 4.074 ± 0.089
1.928GluPro: 1.928 ± 0.072
2.275GluGln: 2.275 ± 0.067
2.347GluArg: 2.347 ± 0.068
4.664GluSer: 4.664 ± 0.085
3.202GluThr: 3.202 ± 0.094
3.266GluVal: 3.266 ± 0.075
0.59GluTrp: 0.59 ± 0.034
2.188GluTyr: 2.188 ± 0.069
0.0GluXaa: 0.0 ± 0.0
Phe
2.948PheAla: 2.948 ± 0.085
0.605PheCys: 0.605 ± 0.035
3.169PheAsp: 3.169 ± 0.068
3.045PheGlu: 3.045 ± 0.07
1.956PhePhe: 1.956 ± 0.075
3.222PheGly: 3.222 ± 0.079
0.848PheHis: 0.848 ± 0.04
2.954PheIle: 2.954 ± 0.085
2.769PheLys: 2.769 ± 0.08
4.074PheLeu: 4.074 ± 0.098
1.04PheMet: 1.04 ± 0.051
2.005PheAsn: 2.005 ± 0.056
1.537PhePro: 1.537 ± 0.063
1.321PheGln: 1.321 ± 0.046
1.329PheArg: 1.329 ± 0.046
3.763PheSer: 3.763 ± 0.088
2.705PheThr: 2.705 ± 0.072
3.374PheVal: 3.374 ± 0.076
0.439PheTrp: 0.439 ± 0.028
1.497PheTyr: 1.497 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
3.749GlyAla: 3.749 ± 0.113
0.76GlyCys: 0.76 ± 0.057
3.29GlyAsp: 3.29 ± 0.099
3.156GlyGlu: 3.156 ± 0.081
3.219GlyPhe: 3.219 ± 0.09
4.215GlyGly: 4.215 ± 0.112
1.206GlyHis: 1.206 ± 0.052
6.96GlyIle: 6.96 ± 0.122
5.558GlyLys: 5.558 ± 0.097
5.447GlyLeu: 5.447 ± 0.112
1.846GlyMet: 1.846 ± 0.067
2.93GlyAsn: 2.93 ± 0.083
1.729GlyPro: 1.729 ± 0.052
1.811GlyGln: 1.811 ± 0.06
2.173GlyArg: 2.173 ± 0.064
4.204GlySer: 4.204 ± 0.096
3.891GlyThr: 3.891 ± 0.102
4.123GlyVal: 4.123 ± 0.095
0.669GlyTrp: 0.669 ± 0.037
2.18GlyTyr: 2.18 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
1.098HisAla: 1.098 ± 0.048
0.223HisCys: 0.223 ± 0.018
1.122HisAsp: 1.122 ± 0.045
1.183HisGlu: 1.183 ± 0.047
0.766HisPhe: 0.766 ± 0.043
1.314HisGly: 1.314 ± 0.05
0.431HisHis: 0.431 ± 0.03
1.385HisIle: 1.385 ± 0.055
1.23HisLys: 1.23 ± 0.046
1.554HisLeu: 1.554 ± 0.052
0.475HisMet: 0.475 ± 0.028
0.793HisAsn: 0.793 ± 0.04
0.952HisPro: 0.952 ± 0.044
0.599HisGln: 0.599 ± 0.035
0.623HisArg: 0.623 ± 0.032
1.157HisSer: 1.157 ± 0.045
1.003HisThr: 1.003 ± 0.044
1.281HisVal: 1.281 ± 0.047
0.194HisTrp: 0.194 ± 0.019
0.616HisTyr: 0.616 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.71IleAla: 5.71 ± 0.12
0.896IleCys: 0.896 ± 0.039
5.401IleAsp: 5.401 ± 0.107
6.165IleGlu: 6.165 ± 0.115
3.973IlePhe: 3.973 ± 0.113
6.011IleGly: 6.011 ± 0.137
1.681IleHis: 1.681 ± 0.055
8.824IleIle: 8.824 ± 0.166
7.713IleLys: 7.713 ± 0.132
8.325IleLeu: 8.325 ± 0.15
2.188IleMet: 2.188 ± 0.063
4.509IleAsn: 4.509 ± 0.086
4.672IlePro: 4.672 ± 0.099
3.133IleGln: 3.133 ± 0.078
3.153IleArg: 3.153 ± 0.075
7.623IleSer: 7.623 ± 0.136
5.821IleThr: 5.821 ± 0.102
5.967IleVal: 5.967 ± 0.116
0.674IleTrp: 0.674 ± 0.038
2.272IleTyr: 2.272 ± 0.063
0.0IleXaa: 0.0 ± 0.0
Lys
4.026LysAla: 4.026 ± 0.088
0.963LysCys: 0.963 ± 0.048
4.456LysAsp: 4.456 ± 0.096
6.282LysGlu: 6.282 ± 0.123
3.911LysPhe: 3.911 ± 0.11
4.123LysGly: 4.123 ± 0.09
1.411LysHis: 1.411 ± 0.056
10.57LysIle: 10.57 ± 0.143
9.36LysLys: 9.36 ± 0.153
7.344LysLeu: 7.344 ± 0.119
2.579LysMet: 2.579 ± 0.064
6.417LysAsn: 6.417 ± 0.125
2.752LysPro: 2.752 ± 0.083
2.924LysGln: 2.924 ± 0.088
2.926LysArg: 2.926 ± 0.085
5.843LysSer: 5.843 ± 0.113
5.092LysThr: 5.092 ± 0.096
4.617LysVal: 4.617 ± 0.103
0.735LysTrp: 0.735 ± 0.044
2.853LysTyr: 2.853 ± 0.078
0.0LysXaa: 0.0 ± 0.0
Leu
5.521LeuAla: 5.521 ± 0.119
0.786LeuCys: 0.786 ± 0.036
5.3LeuAsp: 5.3 ± 0.1
6.541LeuGlu: 6.541 ± 0.118
3.599LeuPhe: 3.599 ± 0.099
5.638LeuGly: 5.638 ± 0.093
1.442LeuHis: 1.442 ± 0.051
6.963LeuIle: 6.963 ± 0.138
7.916LeuLys: 7.916 ± 0.128
6.879LeuLeu: 6.879 ± 0.139
1.989LeuMet: 1.989 ± 0.058
4.343LeuAsn: 4.343 ± 0.105
3.195LeuPro: 3.195 ± 0.071
2.91LeuGln: 2.91 ± 0.07
3.293LeuArg: 3.293 ± 0.094
6.682LeuSer: 6.682 ± 0.143
4.556LeuThr: 4.556 ± 0.09
5.651LeuVal: 5.651 ± 0.11
0.632LeuTrp: 0.632 ± 0.043
2.365LeuTyr: 2.365 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
1.557MetAla: 1.557 ± 0.065
0.192MetCys: 0.192 ± 0.019
1.323MetAsp: 1.323 ± 0.046
1.475MetGlu: 1.475 ± 0.045
0.96MetPhe: 0.96 ± 0.043
1.661MetGly: 1.661 ± 0.06
0.483MetHis: 0.483 ± 0.028
2.645MetIle: 2.645 ± 0.069
2.555MetLys: 2.555 ± 0.071
2.127MetLeu: 2.127 ± 0.063
0.808MetMet: 0.808 ± 0.047
1.327MetAsn: 1.327 ± 0.05
0.987MetPro: 0.987 ± 0.047
0.832MetGln: 0.832 ± 0.042
0.857MetArg: 0.857 ± 0.037
1.901MetSer: 1.901 ± 0.062
1.528MetThr: 1.528 ± 0.055
1.497MetVal: 1.497 ± 0.051
0.207MetTrp: 0.207 ± 0.021
0.746MetTyr: 0.746 ± 0.042
0.0MetXaa: 0.0 ± 0.0
Asn
3.038AsnAla: 3.038 ± 0.086
0.621AsnCys: 0.621 ± 0.034
2.871AsnAsp: 2.871 ± 0.09
3.52AsnGlu: 3.52 ± 0.083
2.471AsnPhe: 2.471 ± 0.072
3.109AsnGly: 3.109 ± 0.08
1.073AsnHis: 1.073 ± 0.05
4.595AsnIle: 4.595 ± 0.096
3.993AsnLys: 3.993 ± 0.083
4.821AsnLeu: 4.821 ± 0.096
1.464AsnMet: 1.464 ± 0.055
3.032AsnAsn: 3.032 ± 0.085
2.636AsnPro: 2.636 ± 0.07
2.012AsnGln: 2.012 ± 0.068
1.57AsnArg: 1.57 ± 0.055
4.156AsnSer: 4.156 ± 0.116
2.871AsnThr: 2.871 ± 0.082
3.153AsnVal: 3.153 ± 0.081
0.528AsnTrp: 0.528 ± 0.035
1.824AsnTyr: 1.824 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
1.815ProAla: 1.815 ± 0.059
0.256ProCys: 0.256 ± 0.021
2.489ProAsp: 2.489 ± 0.088
2.941ProGlu: 2.941 ± 0.075
1.603ProPhe: 1.603 ± 0.046
2.233ProGly: 2.233 ± 0.07
0.724ProHis: 0.724 ± 0.041
3.447ProIle: 3.447 ± 0.086
3.213ProLys: 3.213 ± 0.075
2.922ProLeu: 2.922 ± 0.067
0.841ProMet: 0.841 ± 0.04
2.054ProAsn: 2.054 ± 0.069
1.131ProPro: 1.131 ± 0.049
1.217ProGln: 1.217 ± 0.047
1.162ProArg: 1.162 ± 0.049
2.548ProSer: 2.548 ± 0.083
2.321ProThr: 2.321 ± 0.076
2.418ProVal: 2.418 ± 0.071
0.367ProTrp: 0.367 ± 0.026
1.172ProTyr: 1.172 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
1.612GlnAla: 1.612 ± 0.053
0.285GlnCys: 0.285 ± 0.024
1.493GlnAsp: 1.493 ± 0.056
2.018GlnGlu: 2.018 ± 0.062
1.331GlnPhe: 1.331 ± 0.05
1.605GlnGly: 1.605 ± 0.058
0.543GlnHis: 0.543 ± 0.028
3.657GlnIle: 3.657 ± 0.087
3.537GlnLys: 3.537 ± 0.078
2.544GlnLeu: 2.544 ± 0.062
0.905GlnMet: 0.905 ± 0.043
2.049GlnAsn: 2.049 ± 0.062
0.782GlnPro: 0.782 ± 0.042
1.024GlnGln: 1.024 ± 0.047
1.161GlnArg: 1.161 ± 0.047
2.126GlnSer: 2.126 ± 0.063
1.87GlnThr: 1.87 ± 0.057
1.875GlnVal: 1.875 ± 0.061
0.249GlnTrp: 0.249 ± 0.024
1.053GlnTyr: 1.053 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
1.733ArgAla: 1.733 ± 0.063
0.42ArgCys: 0.42 ± 0.029
1.939ArgAsp: 1.939 ± 0.065
2.347ArgGlu: 2.347 ± 0.071
1.466ArgPhe: 1.466 ± 0.047
2.034ArgGly: 2.034 ± 0.065
0.59ArgHis: 0.59 ± 0.029
3.336ArgIle: 3.336 ± 0.084
3.367ArgLys: 3.367 ± 0.095
3.125ArgLeu: 3.125 ± 0.071
0.938ArgMet: 0.938 ± 0.043
1.837ArgAsn: 1.837 ± 0.058
1.122ArgPro: 1.122 ± 0.044
1.069ArgGln: 1.069 ± 0.045
1.371ArgArg: 1.371 ± 0.054
2.052ArgSer: 2.052 ± 0.064
1.724ArgThr: 1.724 ± 0.055
1.957ArgVal: 1.957 ± 0.058
0.318ArgTrp: 0.318 ± 0.026
1.203ArgTyr: 1.203 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
3.889SerAla: 3.889 ± 0.094
0.813SerCys: 0.813 ± 0.045
4.288SerAsp: 4.288 ± 0.107
4.911SerGlu: 4.911 ± 0.096
3.283SerPhe: 3.283 ± 0.079
5.158SerGly: 5.158 ± 0.115
1.239SerHis: 1.239 ± 0.042
6.783SerIle: 6.783 ± 0.141
6.865SerLys: 6.865 ± 0.13
6.291SerLeu: 6.291 ± 0.105
1.928SerMet: 1.928 ± 0.055
3.904SerAsn: 3.904 ± 0.112
2.429SerPro: 2.429 ± 0.07
2.307SerGln: 2.307 ± 0.067
2.296SerArg: 2.296 ± 0.066
5.214SerSer: 5.214 ± 0.133
4.045SerThr: 4.045 ± 0.117
4.566SerVal: 4.566 ± 0.1
0.733SerTrp: 0.733 ± 0.043
2.146SerTyr: 2.146 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
3.261ThrAla: 3.261 ± 0.079
0.548ThrCys: 0.548 ± 0.034
2.977ThrAsp: 2.977 ± 0.076
3.262ThrGlu: 3.262 ± 0.089
2.46ThrPhe: 2.46 ± 0.068
4.043ThrGly: 4.043 ± 0.095
0.989ThrHis: 0.989 ± 0.046
5.414ThrIle: 5.414 ± 0.113
4.785ThrLys: 4.785 ± 0.106
4.748ThrLeu: 4.748 ± 0.091
1.279ThrMet: 1.279 ± 0.051
3.021ThrAsn: 3.021 ± 0.084
2.531ThrPro: 2.531 ± 0.076
1.689ThrGln: 1.689 ± 0.06
1.831ThrArg: 1.831 ± 0.061
4.151ThrSer: 4.151 ± 0.097
3.248ThrThr: 3.248 ± 0.104
3.824ThrVal: 3.824 ± 0.091
0.472ThrTrp: 0.472 ± 0.032
1.753ThrTyr: 1.753 ± 0.062
0.0ThrXaa: 0.0 ± 0.0
Val
3.725ValAla: 3.725 ± 0.097
0.724ValCys: 0.724 ± 0.041
3.781ValAsp: 3.781 ± 0.089
4.014ValGlu: 4.014 ± 0.089
2.88ValPhe: 2.88 ± 0.08
4.337ValGly: 4.337 ± 0.09
1.084ValHis: 1.084 ± 0.048
5.942ValIle: 5.942 ± 0.115
4.99ValLys: 4.99 ± 0.097
5.62ValLeu: 5.62 ± 0.098
1.544ValMet: 1.544 ± 0.047
3.182ValAsn: 3.182 ± 0.084
2.188ValPro: 2.188 ± 0.069
1.596ValGln: 1.596 ± 0.059
2.173ValArg: 2.173 ± 0.069
4.823ValSer: 4.823 ± 0.093
3.767ValThr: 3.767 ± 0.077
4.231ValVal: 4.231 ± 0.103
0.492ValTrp: 0.492 ± 0.033
1.873ValTyr: 1.873 ± 0.058
0.0ValXaa: 0.0 ± 0.0
Trp
0.484TrpAla: 0.484 ± 0.031
0.071TrpCys: 0.071 ± 0.01
0.503TrpAsp: 0.503 ± 0.032
0.519TrpGlu: 0.519 ± 0.035
0.415TrpPhe: 0.415 ± 0.028
0.514TrpGly: 0.514 ± 0.031
0.185TrpHis: 0.185 ± 0.017
0.843TrpIle: 0.843 ± 0.038
0.791TrpLys: 0.791 ± 0.046
0.757TrpLeu: 0.757 ± 0.036
0.285TrpMet: 0.285 ± 0.027
0.589TrpAsn: 0.589 ± 0.036
0.28TrpPro: 0.28 ± 0.023
0.322TrpGln: 0.322 ± 0.023
0.318TrpArg: 0.318 ± 0.025
0.568TrpSer: 0.568 ± 0.035
0.506TrpThr: 0.506 ± 0.03
0.521TrpVal: 0.521 ± 0.034
0.15TrpTrp: 0.15 ± 0.02
0.296TrpTyr: 0.296 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.755TyrAla: 1.755 ± 0.058
0.453TyrCys: 0.453 ± 0.031
2.361TyrAsp: 2.361 ± 0.076
1.901TyrGlu: 1.901 ± 0.063
1.51TyrPhe: 1.51 ± 0.051
2.19TyrGly: 2.19 ± 0.063
0.684TyrHis: 0.684 ± 0.034
1.82TyrIle: 1.82 ± 0.053
2.217TyrLys: 2.217 ± 0.073
2.966TyrLeu: 2.966 ± 0.079
0.744TyrMet: 0.744 ± 0.037
1.552TyrAsn: 1.552 ± 0.057
1.307TyrPro: 1.307 ± 0.048
1.159TyrGln: 1.159 ± 0.047
1.175TyrArg: 1.175 ± 0.045
2.519TyrSer: 2.519 ± 0.061
1.519TyrThr: 1.519 ± 0.055
2.087TyrVal: 2.087 ± 0.066
0.349TyrTrp: 0.349 ± 0.024
1.133TyrTyr: 1.133 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2621 proteins (547141 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski