Amino acid dipepetide frequency for Polaribacter sp. Hel1_85

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.754AlaAla: 3.754 ± 0.078
0.473AlaCys: 0.473 ± 0.022
3.122AlaAsp: 3.122 ± 0.063
3.518AlaGlu: 3.518 ± 0.065
3.136AlaPhe: 3.136 ± 0.06
3.765AlaGly: 3.765 ± 0.078
0.954AlaHis: 0.954 ± 0.028
5.248AlaIle: 5.248 ± 0.072
4.673AlaLys: 4.673 ± 0.077
5.313AlaLeu: 5.313 ± 0.072
1.229AlaMet: 1.229 ± 0.038
3.404AlaAsn: 3.404 ± 0.061
1.667AlaPro: 1.667 ± 0.041
1.895AlaGln: 1.895 ± 0.041
1.71AlaArg: 1.71 ± 0.042
4.051AlaSer: 4.051 ± 0.068
3.707AlaThr: 3.707 ± 0.077
3.596AlaVal: 3.596 ± 0.062
0.579AlaTrp: 0.579 ± 0.021
2.131AlaTyr: 2.131 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
0.431CysAla: 0.431 ± 0.02
0.081CysCys: 0.081 ± 0.009
0.382CysAsp: 0.382 ± 0.021
0.418CysGlu: 0.418 ± 0.021
0.396CysPhe: 0.396 ± 0.021
0.563CysGly: 0.563 ± 0.026
0.174CysHis: 0.174 ± 0.019
0.542CysIle: 0.542 ± 0.023
0.557CysLys: 0.557 ± 0.024
0.571CysLeu: 0.571 ± 0.024
0.133CysMet: 0.133 ± 0.011
0.431CysAsn: 0.431 ± 0.02
0.266CysPro: 0.266 ± 0.016
0.182CysGln: 0.182 ± 0.012
0.156CysArg: 0.156 ± 0.011
0.518CysSer: 0.518 ± 0.023
0.391CysThr: 0.391 ± 0.021
0.397CysVal: 0.397 ± 0.02
0.066CysTrp: 0.066 ± 0.009
0.269CysTyr: 0.269 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.553AspAla: 3.553 ± 0.057
0.404AspCys: 0.404 ± 0.019
2.984AspAsp: 2.984 ± 0.082
3.643AspGlu: 3.643 ± 0.06
3.824AspPhe: 3.824 ± 0.057
3.767AspGly: 3.767 ± 0.111
0.694AspHis: 0.694 ± 0.028
4.562AspIle: 4.562 ± 0.069
4.597AspLys: 4.597 ± 0.08
5.093AspLeu: 5.093 ± 0.078
0.889AspMet: 0.889 ± 0.035
3.511AspAsn: 3.511 ± 0.077
1.348AspPro: 1.348 ± 0.039
1.166AspGln: 1.166 ± 0.035
1.559AspArg: 1.559 ± 0.037
3.214AspSer: 3.214 ± 0.071
2.708AspThr: 2.708 ± 0.065
3.774AspVal: 3.774 ± 0.066
0.734AspTrp: 0.734 ± 0.028
2.695AspTyr: 2.695 ± 0.056
0.0AspXaa: 0.0 ± 0.0
Glu
3.844GluAla: 3.844 ± 0.064
0.328GluCys: 0.328 ± 0.019
3.419GluAsp: 3.419 ± 0.052
4.842GluGlu: 4.842 ± 0.084
3.154GluPhe: 3.154 ± 0.055
3.496GluGly: 3.496 ± 0.065
1.004GluHis: 1.004 ± 0.034
6.101GluIle: 6.101 ± 0.088
6.595GluLys: 6.595 ± 0.103
6.043GluLeu: 6.043 ± 0.073
1.499GluMet: 1.499 ± 0.04
5.542GluAsn: 5.542 ± 0.081
1.329GluPro: 1.329 ± 0.034
1.853GluGln: 1.853 ± 0.04
2.195GluArg: 2.195 ± 0.05
3.267GluSer: 3.267 ± 0.053
3.856GluThr: 3.856 ± 0.062
4.374GluVal: 4.374 ± 0.064
0.587GluTrp: 0.587 ± 0.024
2.452GluTyr: 2.452 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
2.822PheAla: 2.822 ± 0.058
0.4PheCys: 0.4 ± 0.019
3.311PheAsp: 3.311 ± 0.061
3.322PheGlu: 3.322 ± 0.055
2.957PhePhe: 2.957 ± 0.068
3.753PheGly: 3.753 ± 0.074
0.798PheHis: 0.798 ± 0.024
4.437PheIle: 4.437 ± 0.075
4.34PheLys: 4.34 ± 0.07
5.073PheLeu: 5.073 ± 0.086
1.129PheMet: 1.129 ± 0.033
3.719PheAsn: 3.719 ± 0.075
1.645PhePro: 1.645 ± 0.04
1.468PheGln: 1.468 ± 0.037
1.452PheArg: 1.452 ± 0.038
4.623PheSer: 4.623 ± 0.081
3.429PheThr: 3.429 ± 0.061
3.111PheVal: 3.111 ± 0.058
0.618PheTrp: 0.618 ± 0.024
2.419PheTyr: 2.419 ± 0.055
0.0PheXaa: 0.0 ± 0.0
Gly
3.853GlyAla: 3.853 ± 0.079
0.507GlyCys: 0.507 ± 0.022
3.349GlyAsp: 3.349 ± 0.07
3.433GlyGlu: 3.433 ± 0.053
3.926GlyPhe: 3.926 ± 0.066
4.426GlyGly: 4.426 ± 0.095
1.03GlyHis: 1.03 ± 0.03
5.587GlyIle: 5.587 ± 0.079
5.228GlyLys: 5.228 ± 0.078
5.345GlyLeu: 5.345 ± 0.069
1.433GlyMet: 1.433 ± 0.035
3.935GlyAsn: 3.935 ± 0.085
1.17GlyPro: 1.17 ± 0.038
1.616GlyGln: 1.616 ± 0.041
1.875GlyArg: 1.875 ± 0.047
4.056GlySer: 4.056 ± 0.082
4.07GlyThr: 4.07 ± 0.103
4.304GlyVal: 4.304 ± 0.08
0.715GlyTrp: 0.715 ± 0.028
2.713GlyTyr: 2.713 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
0.798HisAla: 0.798 ± 0.027
0.14HisCys: 0.14 ± 0.011
0.737HisAsp: 0.737 ± 0.026
0.811HisGlu: 0.811 ± 0.027
1.078HisPhe: 1.078 ± 0.03
0.944HisGly: 0.944 ± 0.027
0.462HisHis: 0.462 ± 0.025
1.318HisIle: 1.318 ± 0.036
1.325HisLys: 1.325 ± 0.033
1.688HisLeu: 1.688 ± 0.047
0.303HisMet: 0.303 ± 0.016
0.983HisAsn: 0.983 ± 0.032
0.791HisPro: 0.791 ± 0.032
0.726HisGln: 0.726 ± 0.025
0.599HisArg: 0.599 ± 0.022
0.995HisSer: 0.995 ± 0.031
0.922HisThr: 0.922 ± 0.033
0.803HisVal: 0.803 ± 0.024
0.192HisTrp: 0.192 ± 0.013
0.714HisTyr: 0.714 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
5.347IleAla: 5.347 ± 0.075
0.599IleCys: 0.599 ± 0.023
5.292IleAsp: 5.292 ± 0.088
5.789IleGlu: 5.789 ± 0.086
4.062IlePhe: 4.062 ± 0.065
5.251IleGly: 5.251 ± 0.08
1.393IleHis: 1.393 ± 0.037
7.105IleIle: 7.105 ± 0.1
7.108IleLys: 7.108 ± 0.095
7.365IleLeu: 7.365 ± 0.101
1.305IleMet: 1.305 ± 0.029
5.532IleAsn: 5.532 ± 0.075
3.319IlePro: 3.319 ± 0.06
2.551IleGln: 2.551 ± 0.058
2.415IleArg: 2.415 ± 0.043
6.54IleSer: 6.54 ± 0.083
5.378IleThr: 5.378 ± 0.091
4.762IleVal: 4.762 ± 0.065
0.697IleTrp: 0.697 ± 0.027
3.103IleTyr: 3.103 ± 0.059
0.0IleXaa: 0.0 ± 0.0
Lys
4.767LysAla: 4.767 ± 0.089
0.372LysCys: 0.372 ± 0.022
4.7LysAsp: 4.7 ± 0.072
7.57LysGlu: 7.57 ± 0.114
3.326LysPhe: 3.326 ± 0.053
5.065LysGly: 5.065 ± 0.076
1.515LysHis: 1.515 ± 0.039
7.487LysIle: 7.487 ± 0.098
9.097LysLys: 9.097 ± 0.144
7.009LysLeu: 7.009 ± 0.087
2.234LysMet: 2.234 ± 0.053
6.611LysAsn: 6.611 ± 0.111
2.501LysPro: 2.501 ± 0.059
2.8LysGln: 2.8 ± 0.057
2.887LysArg: 2.887 ± 0.046
5.142LysSer: 5.142 ± 0.08
5.202LysThr: 5.202 ± 0.081
5.203LysVal: 5.203 ± 0.069
0.877LysTrp: 0.877 ± 0.028
3.449LysTyr: 3.449 ± 0.065
0.0LysXaa: 0.0 ± 0.0
Leu
5.188LeuAla: 5.188 ± 0.074
0.567LeuCys: 0.567 ± 0.022
5.002LeuAsp: 5.002 ± 0.079
6.305LeuGlu: 6.305 ± 0.078
5.078LeuPhe: 5.078 ± 0.098
5.712LeuGly: 5.712 ± 0.078
1.403LeuHis: 1.403 ± 0.038
7.268LeuIle: 7.268 ± 0.101
8.415LeuLys: 8.415 ± 0.104
8.365LeuLeu: 8.365 ± 0.118
1.872LeuMet: 1.872 ± 0.049
5.942LeuAsn: 5.942 ± 0.082
3.194LeuPro: 3.194 ± 0.057
2.913LeuGln: 2.913 ± 0.058
2.647LeuArg: 2.647 ± 0.048
6.523LeuSer: 6.523 ± 0.088
5.12LeuThr: 5.12 ± 0.088
5.066LeuVal: 5.066 ± 0.06
0.765LeuTrp: 0.765 ± 0.029
3.097LeuTyr: 3.097 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
1.299MetAla: 1.299 ± 0.039
0.133MetCys: 0.133 ± 0.012
0.988MetAsp: 0.988 ± 0.03
1.134MetGlu: 1.134 ± 0.035
0.907MetPhe: 0.907 ± 0.029
1.172MetGly: 1.172 ± 0.037
0.369MetHis: 0.369 ± 0.019
1.596MetIle: 1.596 ± 0.034
2.221MetLys: 2.221 ± 0.046
1.776MetLeu: 1.776 ± 0.044
0.541MetMet: 0.541 ± 0.026
1.284MetAsn: 1.284 ± 0.038
0.736MetPro: 0.736 ± 0.025
0.711MetGln: 0.711 ± 0.025
0.673MetArg: 0.673 ± 0.023
1.379MetSer: 1.379 ± 0.031
0.967MetThr: 0.967 ± 0.034
1.187MetVal: 1.187 ± 0.035
0.164MetTrp: 0.164 ± 0.011
0.66MetTyr: 0.66 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
3.734AsnAla: 3.734 ± 0.052
0.494AsnCys: 0.494 ± 0.027
3.486AsnAsp: 3.486 ± 0.073
3.877AsnGlu: 3.877 ± 0.061
3.551AsnPhe: 3.551 ± 0.064
4.261AsnGly: 4.261 ± 0.093
1.119AsnHis: 1.119 ± 0.03
5.676AsnIle: 5.676 ± 0.096
5.436AsnLys: 5.436 ± 0.085
6.211AsnLeu: 6.211 ± 0.091
1.197AsnMet: 1.197 ± 0.032
4.911AsnAsn: 4.911 ± 0.117
2.882AsnPro: 2.882 ± 0.06
2.306AsnGln: 2.306 ± 0.047
2.056AsnArg: 2.056 ± 0.045
4.724AsnSer: 4.724 ± 0.084
4.211AsnThr: 4.211 ± 0.088
3.756AsnVal: 3.756 ± 0.069
0.888AsnTrp: 0.888 ± 0.029
3.241AsnTyr: 3.241 ± 0.069
0.0AsnXaa: 0.0 ± 0.0
Pro
1.666ProAla: 1.666 ± 0.041
0.198ProCys: 0.198 ± 0.013
1.671ProAsp: 1.671 ± 0.038
2.439ProGlu: 2.439 ± 0.053
1.892ProPhe: 1.892 ± 0.043
1.61ProGly: 1.61 ± 0.041
0.58ProHis: 0.58 ± 0.024
2.734ProIle: 2.734 ± 0.05
2.785ProLys: 2.785 ± 0.051
2.642ProLeu: 2.642 ± 0.05
0.551ProMet: 0.551 ± 0.023
2.275ProAsn: 2.275 ± 0.045
0.66ProPro: 0.66 ± 0.028
0.966ProGln: 0.966 ± 0.028
0.802ProArg: 0.802 ± 0.03
2.073ProSer: 2.073 ± 0.046
2.119ProThr: 2.119 ± 0.05
1.919ProVal: 1.919 ± 0.041
0.334ProTrp: 0.334 ± 0.02
1.21ProTyr: 1.21 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
1.504GlnAla: 1.504 ± 0.04
0.153GlnCys: 0.153 ± 0.013
1.416GlnAsp: 1.416 ± 0.034
2.187GlnGlu: 2.187 ± 0.043
1.697GlnPhe: 1.697 ± 0.041
1.591GlnGly: 1.591 ± 0.038
0.498GlnHis: 0.498 ± 0.021
2.658GlnIle: 2.658 ± 0.047
2.906GlnLys: 2.906 ± 0.064
3.102GlnLeu: 3.102 ± 0.056
0.668GlnMet: 0.668 ± 0.026
2.121GlnAsn: 2.121 ± 0.044
0.941GlnPro: 0.941 ± 0.027
1.276GlnGln: 1.276 ± 0.039
1.039GlnArg: 1.039 ± 0.029
1.737GlnSer: 1.737 ± 0.036
1.808GlnThr: 1.808 ± 0.033
1.694GlnVal: 1.694 ± 0.038
0.277GlnTrp: 0.277 ± 0.015
1.153GlnTyr: 1.153 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
1.755ArgAla: 1.755 ± 0.048
0.177ArgCys: 0.177 ± 0.013
1.495ArgAsp: 1.495 ± 0.033
1.91ArgGlu: 1.91 ± 0.043
1.762ArgPhe: 1.762 ± 0.039
1.745ArgGly: 1.745 ± 0.04
0.485ArgHis: 0.485 ± 0.02
2.689ArgIle: 2.689 ± 0.052
2.845ArgLys: 2.845 ± 0.058
2.799ArgLeu: 2.799 ± 0.057
0.689ArgMet: 0.689 ± 0.023
2.015ArgAsn: 2.015 ± 0.04
0.922ArgPro: 0.922 ± 0.03
0.872ArgGln: 0.872 ± 0.027
1.107ArgArg: 1.107 ± 0.039
1.654ArgSer: 1.654 ± 0.044
1.746ArgThr: 1.746 ± 0.035
1.85ArgVal: 1.85 ± 0.043
0.314ArgTrp: 0.314 ± 0.014
1.242ArgTyr: 1.242 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
3.491SerAla: 3.491 ± 0.06
0.647SerCys: 0.647 ± 0.025
3.632SerAsp: 3.632 ± 0.064
4.354SerGlu: 4.354 ± 0.067
4.211SerPhe: 4.211 ± 0.062
4.663SerGly: 4.663 ± 0.09
1.003SerHis: 1.003 ± 0.035
5.844SerIle: 5.844 ± 0.069
5.916SerLys: 5.916 ± 0.08
6.226SerLeu: 6.226 ± 0.094
1.248SerMet: 1.248 ± 0.033
4.413SerAsn: 4.413 ± 0.089
1.96SerPro: 1.96 ± 0.046
1.963SerGln: 1.963 ± 0.042
1.82SerArg: 1.82 ± 0.045
4.857SerSer: 4.857 ± 0.098
3.76SerThr: 3.76 ± 0.072
4.065SerVal: 4.065 ± 0.065
0.749SerTrp: 0.749 ± 0.03
2.856SerTyr: 2.856 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
3.53ThrAla: 3.53 ± 0.07
0.392ThrCys: 0.392 ± 0.023
3.408ThrAsp: 3.408 ± 0.082
3.719ThrGlu: 3.719 ± 0.061
3.257ThrPhe: 3.257 ± 0.066
4.007ThrGly: 4.007 ± 0.086
0.954ThrHis: 0.954 ± 0.028
5.562ThrIle: 5.562 ± 0.085
4.526ThrLys: 4.526 ± 0.066
5.159ThrLeu: 5.159 ± 0.08
0.874ThrMet: 0.874 ± 0.027
3.914ThrAsn: 3.914 ± 0.092
2.251ThrPro: 2.251 ± 0.051
1.737ThrGln: 1.737 ± 0.036
1.551ThrArg: 1.551 ± 0.039
4.378ThrSer: 4.378 ± 0.091
3.835ThrThr: 3.835 ± 0.1
3.598ThrVal: 3.598 ± 0.061
0.629ThrTrp: 0.629 ± 0.026
2.365ThrTyr: 2.365 ± 0.055
0.0ThrXaa: 0.0 ± 0.0
Val
3.852ValAla: 3.852 ± 0.069
0.47ValCys: 0.47 ± 0.019
3.602ValAsp: 3.602 ± 0.066
3.752ValGlu: 3.752 ± 0.062
3.459ValPhe: 3.459 ± 0.065
3.681ValGly: 3.681 ± 0.064
0.924ValHis: 0.924 ± 0.03
4.815ValIle: 4.815 ± 0.072
4.765ValLys: 4.765 ± 0.073
5.933ValLeu: 5.933 ± 0.085
1.134ValMet: 1.134 ± 0.031
3.789ValAsn: 3.789 ± 0.058
1.866ValPro: 1.866 ± 0.044
1.532ValGln: 1.532 ± 0.035
1.738ValArg: 1.738 ± 0.043
4.56ValSer: 4.56 ± 0.078
3.493ValThr: 3.493 ± 0.071
3.986ValVal: 3.986 ± 0.065
0.585ValTrp: 0.585 ± 0.022
2.271ValTyr: 2.271 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.563TrpAla: 0.563 ± 0.026
0.09TrpCys: 0.09 ± 0.009
0.54TrpAsp: 0.54 ± 0.022
0.616TrpGlu: 0.616 ± 0.024
0.636TrpPhe: 0.636 ± 0.024
0.661TrpGly: 0.661 ± 0.027
0.216TrpHis: 0.216 ± 0.014
0.759TrpIle: 0.759 ± 0.028
0.893TrpLys: 0.893 ± 0.031
0.907TrpLeu: 0.907 ± 0.03
0.292TrpMet: 0.292 ± 0.016
0.753TrpAsn: 0.753 ± 0.026
0.223TrpPro: 0.223 ± 0.016
0.395TrpGln: 0.395 ± 0.018
0.357TrpArg: 0.357 ± 0.018
0.675TrpSer: 0.675 ± 0.025
0.539TrpThr: 0.539 ± 0.023
0.67TrpVal: 0.67 ± 0.029
0.138TrpTrp: 0.138 ± 0.011
0.441TrpTyr: 0.441 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.175TyrAla: 2.175 ± 0.043
0.299TyrCys: 0.299 ± 0.018
2.183TyrAsp: 2.183 ± 0.052
2.142TyrGlu: 2.142 ± 0.046
2.516TyrPhe: 2.516 ± 0.048
2.437TyrGly: 2.437 ± 0.053
0.752TyrHis: 0.752 ± 0.027
2.925TyrIle: 2.925 ± 0.05
3.583TyrLys: 3.583 ± 0.061
3.824TyrLeu: 3.824 ± 0.058
0.674TyrMet: 0.674 ± 0.024
2.935TyrAsn: 2.935 ± 0.052
1.452TyrPro: 1.452 ± 0.038
1.5TyrGln: 1.5 ± 0.037
1.435TyrArg: 1.435 ± 0.04
2.71TyrSer: 2.71 ± 0.048
2.388TyrThr: 2.388 ± 0.049
2.091TyrVal: 2.091 ± 0.042
0.457TyrTrp: 0.457 ± 0.024
1.832TyrTyr: 1.832 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3382 proteins (1146162 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski