Amino acid dipepetide frequency for Sulfitobacter sp. HI0023

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.431AlaAla: 16.431 ± 0.18
1.069AlaCys: 1.069 ± 0.033
6.896AlaAsp: 6.896 ± 0.086
8.795AlaGlu: 8.795 ± 0.107
4.252AlaPhe: 4.252 ± 0.062
10.408AlaGly: 10.408 ± 0.1
2.322AlaHis: 2.322 ± 0.051
5.703AlaIle: 5.703 ± 0.071
3.729AlaLys: 3.729 ± 0.071
13.469AlaLeu: 13.469 ± 0.124
3.752AlaMet: 3.752 ± 0.057
2.631AlaAsn: 2.631 ± 0.052
5.779AlaPro: 5.779 ± 0.08
4.566AlaGln: 4.566 ± 0.069
9.203AlaArg: 9.203 ± 0.093
5.445AlaSer: 5.445 ± 0.063
5.828AlaThr: 5.828 ± 0.076
8.175AlaVal: 8.175 ± 0.104
1.364AlaTrp: 1.364 ± 0.038
2.532AlaTyr: 2.532 ± 0.048
0.002AlaXaa: 0.002 ± 0.001
Cys
1.066CysAla: 1.066 ± 0.033
0.116CysCys: 0.116 ± 0.01
0.612CysAsp: 0.612 ± 0.023
0.491CysGlu: 0.491 ± 0.022
0.327CysPhe: 0.327 ± 0.015
0.944CysGly: 0.944 ± 0.03
0.255CysHis: 0.255 ± 0.014
0.452CysIle: 0.452 ± 0.02
0.217CysLys: 0.217 ± 0.011
0.784CysLeu: 0.784 ± 0.024
0.179CysMet: 0.179 ± 0.012
0.22CysAsn: 0.22 ± 0.012
0.511CysPro: 0.511 ± 0.022
0.217CysGln: 0.217 ± 0.014
0.536CysArg: 0.536 ± 0.022
0.433CysSer: 0.433 ± 0.02
0.453CysThr: 0.453 ± 0.021
0.613CysVal: 0.613 ± 0.024
0.113CysTrp: 0.113 ± 0.011
0.218CysTyr: 0.218 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.603AspAla: 7.603 ± 0.108
0.545AspCys: 0.545 ± 0.022
3.583AspAsp: 3.583 ± 0.07
3.62AspGlu: 3.62 ± 0.056
2.211AspPhe: 2.211 ± 0.045
5.468AspGly: 5.468 ± 0.068
1.369AspHis: 1.369 ± 0.034
3.231AspIle: 3.231 ± 0.049
1.738AspLys: 1.738 ± 0.046
6.349AspLeu: 6.349 ± 0.076
1.775AspMet: 1.775 ± 0.038
1.296AspAsn: 1.296 ± 0.033
3.694AspPro: 3.694 ± 0.055
1.703AspGln: 1.703 ± 0.042
4.692AspArg: 4.692 ± 0.071
2.319AspSer: 2.319 ± 0.045
3.1AspThr: 3.1 ± 0.055
4.377AspVal: 4.377 ± 0.057
1.207AspTrp: 1.207 ± 0.036
1.596AspTyr: 1.596 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
8.248GluAla: 8.248 ± 0.099
0.376GluCys: 0.376 ± 0.02
3.767GluAsp: 3.767 ± 0.058
4.159GluGlu: 4.159 ± 0.069
1.82GluPhe: 1.82 ± 0.041
5.403GluGly: 5.403 ± 0.078
1.195GluHis: 1.195 ± 0.029
3.475GluIle: 3.475 ± 0.057
2.357GluLys: 2.357 ± 0.055
5.344GluLeu: 5.344 ± 0.082
1.957GluMet: 1.957 ± 0.042
1.843GluAsn: 1.843 ± 0.039
2.626GluPro: 2.626 ± 0.049
2.092GluGln: 2.092 ± 0.048
4.65GluArg: 4.65 ± 0.064
2.311GluSer: 2.311 ± 0.042
3.988GluThr: 3.988 ± 0.056
4.63GluVal: 4.63 ± 0.062
0.786GluTrp: 0.786 ± 0.03
1.089GluTyr: 1.089 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
4.641PheAla: 4.641 ± 0.075
0.43PheCys: 0.43 ± 0.018
3.008PheAsp: 3.008 ± 0.053
2.306PheGlu: 2.306 ± 0.045
1.407PhePhe: 1.407 ± 0.04
3.746PheGly: 3.746 ± 0.063
0.716PheHis: 0.716 ± 0.025
1.628PheIle: 1.628 ± 0.04
0.938PheLys: 0.938 ± 0.028
3.268PheLeu: 3.268 ± 0.056
0.884PheMet: 0.884 ± 0.027
1.009PheAsn: 1.009 ± 0.029
1.491PhePro: 1.491 ± 0.034
0.951PheGln: 0.951 ± 0.027
2.16PheArg: 2.16 ± 0.042
1.943PheSer: 1.943 ± 0.045
2.059PheThr: 2.059 ± 0.039
2.625PheVal: 2.625 ± 0.052
0.555PheTrp: 0.555 ± 0.024
0.926PheTyr: 0.926 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
9.81GlyAla: 9.81 ± 0.105
0.842GlyCys: 0.842 ± 0.028
4.742GlyAsp: 4.742 ± 0.067
4.846GlyGlu: 4.846 ± 0.07
3.655GlyPhe: 3.655 ± 0.063
7.616GlyGly: 7.616 ± 0.116
1.931GlyHis: 1.931 ± 0.042
4.598GlyIle: 4.598 ± 0.067
3.23GlyLys: 3.23 ± 0.059
9.078GlyLeu: 9.078 ± 0.091
2.679GlyMet: 2.679 ± 0.055
2.164GlyAsn: 2.164 ± 0.047
3.835GlyPro: 3.835 ± 0.059
3.086GlyGln: 3.086 ± 0.05
5.928GlyArg: 5.928 ± 0.079
4.456GlySer: 4.456 ± 0.082
4.687GlyThr: 4.687 ± 0.074
6.391GlyVal: 6.391 ± 0.093
1.477GlyTrp: 1.477 ± 0.04
2.3GlyTyr: 2.3 ± 0.043
0.001GlyXaa: 0.001 ± 0.001
His
2.352HisAla: 2.352 ± 0.054
0.235HisCys: 0.235 ± 0.016
1.322HisAsp: 1.322 ± 0.039
1.149HisGlu: 1.149 ± 0.035
0.797HisPhe: 0.797 ± 0.031
1.99HisGly: 1.99 ± 0.045
0.565HisHis: 0.565 ± 0.024
0.97HisIle: 0.97 ± 0.03
0.502HisLys: 0.502 ± 0.021
2.054HisLeu: 2.054 ± 0.04
0.584HisMet: 0.584 ± 0.021
0.419HisAsn: 0.419 ± 0.018
1.357HisPro: 1.357 ± 0.041
0.534HisGln: 0.534 ± 0.025
1.433HisArg: 1.433 ± 0.033
0.913HisSer: 0.913 ± 0.027
0.855HisThr: 0.855 ± 0.025
1.594HisVal: 1.594 ± 0.037
0.33HisTrp: 0.33 ± 0.016
0.58HisTyr: 0.58 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
6.875IleAla: 6.875 ± 0.072
0.623IleCys: 0.623 ± 0.021
3.532IleAsp: 3.532 ± 0.053
3.652IleGlu: 3.652 ± 0.066
1.796IlePhe: 1.796 ± 0.045
4.662IleGly: 4.662 ± 0.072
0.918IleHis: 0.918 ± 0.026
2.2IleIle: 2.2 ± 0.046
1.344IleLys: 1.344 ± 0.042
4.581IleLeu: 4.581 ± 0.068
1.093IleMet: 1.093 ± 0.03
1.352IleAsn: 1.352 ± 0.035
2.332IlePro: 2.332 ± 0.042
1.212IleGln: 1.212 ± 0.035
3.179IleArg: 3.179 ± 0.057
2.767IleSer: 2.767 ± 0.045
2.886IleThr: 2.886 ± 0.052
3.922IleVal: 3.922 ± 0.058
0.714IleTrp: 0.714 ± 0.027
1.186IleTyr: 1.186 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
3.899LysAla: 3.899 ± 0.067
0.169LysCys: 0.169 ± 0.013
1.821LysAsp: 1.821 ± 0.045
1.701LysGlu: 1.701 ± 0.047
0.931LysPhe: 0.931 ± 0.031
2.726LysGly: 2.726 ± 0.054
0.575LysHis: 0.575 ± 0.022
1.595LysIle: 1.595 ± 0.04
1.222LysLys: 1.222 ± 0.038
2.986LysLeu: 2.986 ± 0.054
0.859LysMet: 0.859 ± 0.025
0.776LysAsn: 0.776 ± 0.027
1.742LysPro: 1.742 ± 0.044
0.914LysGln: 0.914 ± 0.025
2.3LysArg: 2.3 ± 0.05
1.765LysSer: 1.765 ± 0.046
1.976LysThr: 1.976 ± 0.044
2.364LysVal: 2.364 ± 0.05
0.366LysTrp: 0.366 ± 0.015
0.645LysTyr: 0.645 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
12.47LeuAla: 12.47 ± 0.12
0.965LeuCys: 0.965 ± 0.03
5.965LeuAsp: 5.965 ± 0.079
5.515LeuGlu: 5.515 ± 0.075
3.474LeuPhe: 3.474 ± 0.07
8.369LeuGly: 8.369 ± 0.111
1.94LeuHis: 1.94 ± 0.044
5.022LeuIle: 5.022 ± 0.067
3.176LeuLys: 3.176 ± 0.063
9.188LeuLeu: 9.188 ± 0.132
2.629LeuMet: 2.629 ± 0.048
2.657LeuAsn: 2.657 ± 0.041
5.564LeuPro: 5.564 ± 0.087
2.835LeuGln: 2.835 ± 0.047
7.503LeuArg: 7.503 ± 0.075
6.54LeuSer: 6.54 ± 0.084
5.819LeuThr: 5.819 ± 0.066
6.623LeuVal: 6.623 ± 0.078
1.259LeuTrp: 1.259 ± 0.039
1.932LeuTyr: 1.932 ± 0.042
0.0LeuXaa: 0.0 ± 0.0
Met
3.509MetAla: 3.509 ± 0.051
0.187MetCys: 0.187 ± 0.012
1.418MetAsp: 1.418 ± 0.033
1.559MetGlu: 1.559 ± 0.035
0.806MetPhe: 0.806 ± 0.026
2.348MetGly: 2.348 ± 0.05
0.481MetHis: 0.481 ± 0.02
1.515MetIle: 1.515 ± 0.034
1.134MetLys: 1.134 ± 0.031
2.659MetLeu: 2.659 ± 0.051
0.831MetMet: 0.831 ± 0.032
0.94MetAsn: 0.94 ± 0.027
1.549MetPro: 1.549 ± 0.037
1.033MetGln: 1.033 ± 0.03
2.014MetArg: 2.014 ± 0.039
1.742MetSer: 1.742 ± 0.04
2.053MetThr: 2.053 ± 0.041
1.812MetVal: 1.812 ± 0.039
0.237MetTrp: 0.237 ± 0.015
0.379MetTyr: 0.379 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.197AsnAla: 3.197 ± 0.057
0.26AsnCys: 0.26 ± 0.016
1.493AsnAsp: 1.493 ± 0.041
1.309AsnGlu: 1.309 ± 0.04
0.956AsnPhe: 0.956 ± 0.03
2.303AsnGly: 2.303 ± 0.051
0.505AsnHis: 0.505 ± 0.022
1.329AsnIle: 1.329 ± 0.037
0.651AsnLys: 0.651 ± 0.025
2.492AsnLeu: 2.492 ± 0.05
0.723AsnMet: 0.723 ± 0.027
0.639AsnAsn: 0.639 ± 0.025
1.767AsnPro: 1.767 ± 0.043
0.692AsnGln: 0.692 ± 0.023
1.854AsnArg: 1.854 ± 0.039
1.139AsnSer: 1.139 ± 0.034
1.324AsnThr: 1.324 ± 0.032
1.801AsnVal: 1.801 ± 0.038
0.411AsnTrp: 0.411 ± 0.021
0.657AsnTyr: 0.657 ± 0.024
0.001AsnXaa: 0.001 ± 0.001
Pro
5.762ProAla: 5.762 ± 0.078
0.358ProCys: 0.358 ± 0.017
3.907ProAsp: 3.907 ± 0.064
4.321ProGlu: 4.321 ± 0.066
2.017ProPhe: 2.017 ± 0.046
4.53ProGly: 4.53 ± 0.066
1.096ProHis: 1.096 ± 0.033
2.198ProIle: 2.198 ± 0.041
1.584ProLys: 1.584 ± 0.041
4.708ProLeu: 4.708 ± 0.065
1.241ProMet: 1.241 ± 0.034
1.239ProAsn: 1.239 ± 0.035
2.414ProPro: 2.414 ± 0.051
1.672ProGln: 1.672 ± 0.036
3.1ProArg: 3.1 ± 0.057
2.41ProSer: 2.41 ± 0.04
2.298ProThr: 2.298 ± 0.05
4.178ProVal: 4.178 ± 0.066
0.637ProTrp: 0.637 ± 0.027
1.204ProTyr: 1.204 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
3.881GlnAla: 3.881 ± 0.062
0.204GlnCys: 0.204 ± 0.013
1.784GlnAsp: 1.784 ± 0.039
1.726GlnGlu: 1.726 ± 0.043
1.07GlnPhe: 1.07 ± 0.031
2.512GlnGly: 2.512 ± 0.047
0.597GlnHis: 0.597 ± 0.024
1.936GlnIle: 1.936 ± 0.039
1.138GlnLys: 1.138 ± 0.032
2.804GlnLeu: 2.804 ± 0.055
1.101GlnMet: 1.101 ± 0.029
0.933GlnAsn: 0.933 ± 0.029
1.591GlnPro: 1.591 ± 0.041
1.133GlnGln: 1.133 ± 0.038
2.264GlnArg: 2.264 ± 0.048
1.768GlnSer: 1.768 ± 0.039
1.83GlnThr: 1.83 ± 0.039
2.328GlnVal: 2.328 ± 0.042
0.399GlnTrp: 0.399 ± 0.016
0.574GlnTyr: 0.574 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
8.376ArgAla: 8.376 ± 0.095
0.485ArgCys: 0.485 ± 0.021
4.618ArgAsp: 4.618 ± 0.06
4.224ArgGlu: 4.224 ± 0.06
2.778ArgPhe: 2.778 ± 0.055
4.835ArgGly: 4.835 ± 0.061
1.65ArgHis: 1.65 ± 0.04
4.041ArgIle: 4.041 ± 0.06
2.452ArgLys: 2.452 ± 0.049
7.498ArgLeu: 7.498 ± 0.084
2.087ArgMet: 2.087 ± 0.036
1.851ArgAsn: 1.851 ± 0.039
3.353ArgPro: 3.353 ± 0.061
2.474ArgGln: 2.474 ± 0.054
5.392ArgArg: 5.392 ± 0.082
3.391ArgSer: 3.391 ± 0.052
3.21ArgThr: 3.21 ± 0.046
4.829ArgVal: 4.829 ± 0.068
0.949ArgTrp: 0.949 ± 0.029
1.722ArgTyr: 1.722 ± 0.043
0.001ArgXaa: 0.001 ± 0.001
Ser
5.736SerAla: 5.736 ± 0.068
0.399SerCys: 0.399 ± 0.018
3.443SerAsp: 3.443 ± 0.055
2.974SerGlu: 2.974 ± 0.058
2.19SerPhe: 2.19 ± 0.037
5.302SerGly: 5.302 ± 0.087
1.056SerHis: 1.056 ± 0.029
2.483SerIle: 2.483 ± 0.044
1.431SerLys: 1.431 ± 0.039
4.825SerLeu: 4.825 ± 0.064
1.444SerMet: 1.444 ± 0.037
1.36SerAsn: 1.36 ± 0.032
2.445SerPro: 2.445 ± 0.045
1.496SerGln: 1.496 ± 0.033
3.31SerArg: 3.31 ± 0.052
2.494SerSer: 2.494 ± 0.05
2.392SerThr: 2.392 ± 0.042
3.748SerVal: 3.748 ± 0.056
0.635SerTrp: 0.635 ± 0.022
1.353SerTyr: 1.353 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
6.181ThrAla: 6.181 ± 0.08
0.497ThrCys: 0.497 ± 0.023
3.115ThrAsp: 3.115 ± 0.052
3.1ThrGlu: 3.1 ± 0.053
1.975ThrPhe: 1.975 ± 0.041
5.518ThrGly: 5.518 ± 0.086
1.066ThrHis: 1.066 ± 0.031
2.653ThrIle: 2.653 ± 0.048
1.36ThrLys: 1.36 ± 0.035
5.988ThrLeu: 5.988 ± 0.07
1.366ThrMet: 1.366 ± 0.032
1.272ThrAsn: 1.272 ± 0.031
3.419ThrPro: 3.419 ± 0.055
1.636ThrGln: 1.636 ± 0.04
3.573ThrArg: 3.573 ± 0.054
2.621ThrSer: 2.621 ± 0.044
2.782ThrThr: 2.782 ± 0.053
4.051ThrVal: 4.051 ± 0.058
0.643ThrTrp: 0.643 ± 0.02
1.251ThrTyr: 1.251 ± 0.035
0.001ThrXaa: 0.001 ± 0.001
Val
8.471ValAla: 8.471 ± 0.1
0.637ValCys: 0.637 ± 0.021
4.052ValAsp: 4.052 ± 0.058
4.694ValGlu: 4.694 ± 0.065
2.764ValPhe: 2.764 ± 0.054
5.347ValGly: 5.347 ± 0.071
1.408ValHis: 1.408 ± 0.038
4.117ValIle: 4.117 ± 0.075
2.09ValLys: 2.09 ± 0.048
7.456ValLeu: 7.456 ± 0.091
2.097ValMet: 2.097 ± 0.042
1.869ValAsn: 1.869 ± 0.041
3.673ValPro: 3.673 ± 0.059
2.131ValGln: 2.131 ± 0.046
4.32ValArg: 4.32 ± 0.064
4.219ValSer: 4.219 ± 0.057
4.656ValThr: 4.656 ± 0.064
5.513ValVal: 5.513 ± 0.074
0.945ValTrp: 0.945 ± 0.027
1.542ValTyr: 1.542 ± 0.039
0.001ValXaa: 0.001 ± 0.001
Trp
1.286TrpAla: 1.286 ± 0.034
0.144TrpCys: 0.144 ± 0.01
0.731TrpAsp: 0.731 ± 0.024
0.655TrpGlu: 0.655 ± 0.029
0.565TrpPhe: 0.565 ± 0.025
1.081TrpGly: 1.081 ± 0.036
0.348TrpHis: 0.348 ± 0.018
0.7TrpIle: 0.7 ± 0.026
0.469TrpLys: 0.469 ± 0.019
1.581TrpLeu: 1.581 ± 0.036
0.428TrpMet: 0.428 ± 0.023
0.446TrpAsn: 0.446 ± 0.023
0.649TrpPro: 0.649 ± 0.024
0.585TrpGln: 0.585 ± 0.024
1.11TrpArg: 1.11 ± 0.031
0.763TrpSer: 0.763 ± 0.027
0.736TrpThr: 0.736 ± 0.024
0.857TrpVal: 0.857 ± 0.028
0.216TrpTrp: 0.216 ± 0.013
0.274TrpTyr: 0.274 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.554TyrAla: 2.554 ± 0.047
0.216TyrCys: 0.216 ± 0.012
1.665TyrAsp: 1.665 ± 0.041
1.318TyrGlu: 1.318 ± 0.039
0.898TyrPhe: 0.898 ± 0.028
2.244TyrGly: 2.244 ± 0.043
0.542TyrHis: 0.542 ± 0.023
0.981TyrIle: 0.981 ± 0.026
0.571TyrLys: 0.571 ± 0.023
2.334TyrLeu: 2.334 ± 0.043
0.496TyrMet: 0.496 ± 0.023
0.589TyrAsn: 0.589 ± 0.022
1.083TyrPro: 1.083 ± 0.031
0.612TyrGln: 0.612 ± 0.02
1.695TyrArg: 1.695 ± 0.039
1.077TyrSer: 1.077 ± 0.031
1.135TyrThr: 1.135 ± 0.027
1.572TyrVal: 1.572 ± 0.038
0.376TyrTrp: 0.376 ± 0.019
0.58TyrTyr: 0.58 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 3900 proteins (1190183 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski