Amino acid dipepetide frequency for Pedobacter sp. AR-3-17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.239AlaAla: 5.239 ± 0.109
0.609AlaCys: 0.609 ± 0.024
3.99AlaAsp: 3.99 ± 0.06
3.993AlaGlu: 3.993 ± 0.063
3.466AlaPhe: 3.466 ± 0.052
4.896AlaGly: 4.896 ± 0.096
1.097AlaHis: 1.097 ± 0.035
5.657AlaIle: 5.657 ± 0.073
5.275AlaLys: 5.275 ± 0.096
6.568AlaLeu: 6.568 ± 0.09
1.496AlaMet: 1.496 ± 0.04
4.179AlaAsn: 4.179 ± 0.1
2.041AlaPro: 2.041 ± 0.046
2.891AlaGln: 2.891 ± 0.052
1.924AlaArg: 1.924 ± 0.047
4.336AlaSer: 4.336 ± 0.072
4.019AlaThr: 4.019 ± 0.096
4.49AlaVal: 4.49 ± 0.069
0.68AlaTrp: 0.68 ± 0.028
2.712AlaTyr: 2.712 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.509CysAla: 0.509 ± 0.02
0.102CysCys: 0.102 ± 0.011
0.398CysAsp: 0.398 ± 0.018
0.448CysGlu: 0.448 ± 0.022
0.464CysPhe: 0.464 ± 0.023
0.62CysGly: 0.62 ± 0.026
0.173CysHis: 0.173 ± 0.014
0.598CysIle: 0.598 ± 0.022
0.504CysLys: 0.504 ± 0.023
0.735CysLeu: 0.735 ± 0.025
0.147CysMet: 0.147 ± 0.011
0.409CysAsn: 0.409 ± 0.02
0.33CysPro: 0.33 ± 0.019
0.199CysGln: 0.199 ± 0.011
0.234CysArg: 0.234 ± 0.014
0.51CysSer: 0.51 ± 0.019
0.4CysThr: 0.4 ± 0.022
0.428CysVal: 0.428 ± 0.021
0.075CysTrp: 0.075 ± 0.008
0.284CysTyr: 0.284 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.958AspAla: 3.958 ± 0.075
0.394AspCys: 0.394 ± 0.018
2.719AspAsp: 2.719 ± 0.058
3.475AspGlu: 3.475 ± 0.056
3.521AspPhe: 3.521 ± 0.065
3.717AspGly: 3.717 ± 0.076
0.932AspHis: 0.932 ± 0.026
4.049AspIle: 4.049 ± 0.069
4.048AspLys: 4.048 ± 0.06
5.505AspLeu: 5.505 ± 0.071
1.0AspMet: 1.0 ± 0.033
2.52AspAsn: 2.52 ± 0.052
1.776AspPro: 1.776 ± 0.037
1.99AspGln: 1.99 ± 0.042
1.812AspArg: 1.812 ± 0.04
2.663AspSer: 2.663 ± 0.047
2.27AspThr: 2.27 ± 0.044
3.294AspVal: 3.294 ± 0.058
0.694AspTrp: 0.694 ± 0.026
2.412AspTyr: 2.412 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
4.031GluAla: 4.031 ± 0.068
0.314GluCys: 0.314 ± 0.019
2.936GluAsp: 2.936 ± 0.061
4.313GluGlu: 4.313 ± 0.089
2.748GluPhe: 2.748 ± 0.047
3.364GluGly: 3.364 ± 0.056
0.926GluHis: 0.926 ± 0.029
5.378GluIle: 5.378 ± 0.076
5.43GluLys: 5.43 ± 0.089
5.706GluLeu: 5.706 ± 0.079
1.481GluMet: 1.481 ± 0.039
3.989GluAsn: 3.989 ± 0.063
1.525GluPro: 1.525 ± 0.048
2.242GluGln: 2.242 ± 0.055
2.204GluArg: 2.204 ± 0.049
3.041GluSer: 3.041 ± 0.054
3.094GluThr: 3.094 ± 0.054
4.014GluVal: 4.014 ± 0.055
0.61GluTrp: 0.61 ± 0.023
2.011GluTyr: 2.011 ± 0.046
0.0GluXaa: 0.0 ± 0.0
Phe
3.418PheAla: 3.418 ± 0.064
0.503PheCys: 0.503 ± 0.02
3.096PheAsp: 3.096 ± 0.054
3.13PheGlu: 3.13 ± 0.058
2.718PhePhe: 2.718 ± 0.053
3.632PheGly: 3.632 ± 0.063
0.845PheHis: 0.845 ± 0.027
4.075PheIle: 4.075 ± 0.065
4.097PheLys: 4.097 ± 0.07
4.797PheLeu: 4.797 ± 0.077
1.053PheMet: 1.053 ± 0.027
3.547PheAsn: 3.547 ± 0.054
1.714PhePro: 1.714 ± 0.035
1.462PheGln: 1.462 ± 0.04
1.603PheArg: 1.603 ± 0.039
4.115PheSer: 4.115 ± 0.066
3.203PheThr: 3.203 ± 0.052
2.898PheVal: 2.898 ± 0.062
0.597PheTrp: 0.597 ± 0.024
2.218PheTyr: 2.218 ± 0.046
0.0PheXaa: 0.0 ± 0.0
Gly
4.498GlyAla: 4.498 ± 0.109
0.628GlyCys: 0.628 ± 0.026
3.262GlyAsp: 3.262 ± 0.075
3.452GlyGlu: 3.452 ± 0.054
3.898GlyPhe: 3.898 ± 0.063
4.728GlyGly: 4.728 ± 0.102
1.031GlyHis: 1.031 ± 0.035
5.444GlyIle: 5.444 ± 0.084
5.359GlyLys: 5.359 ± 0.076
6.296GlyLeu: 6.296 ± 0.096
1.527GlyMet: 1.527 ± 0.046
3.809GlyAsn: 3.809 ± 0.077
1.406GlyPro: 1.406 ± 0.038
1.825GlyGln: 1.825 ± 0.039
2.155GlyArg: 2.155 ± 0.05
4.283GlySer: 4.283 ± 0.088
3.956GlyThr: 3.956 ± 0.091
4.329GlyVal: 4.329 ± 0.067
0.805GlyTrp: 0.805 ± 0.028
2.771GlyTyr: 2.771 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
0.932HisAla: 0.932 ± 0.029
0.183HisCys: 0.183 ± 0.012
0.72HisAsp: 0.72 ± 0.022
0.911HisGlu: 0.911 ± 0.029
1.072HisPhe: 1.072 ± 0.031
0.957HisGly: 0.957 ± 0.033
0.574HisHis: 0.574 ± 0.025
1.333HisIle: 1.333 ± 0.035
1.03HisLys: 1.03 ± 0.028
1.824HisLeu: 1.824 ± 0.052
0.275HisMet: 0.275 ± 0.017
0.769HisAsn: 0.769 ± 0.024
0.978HisPro: 0.978 ± 0.032
1.094HisGln: 1.094 ± 0.033
0.586HisArg: 0.586 ± 0.021
0.991HisSer: 0.991 ± 0.032
0.834HisThr: 0.834 ± 0.029
0.785HisVal: 0.785 ± 0.026
0.217HisTrp: 0.217 ± 0.014
0.762HisTyr: 0.762 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.926IleAla: 5.926 ± 0.078
0.711IleCys: 0.711 ± 0.024
4.473IleAsp: 4.473 ± 0.058
4.721IleGlu: 4.721 ± 0.071
3.675IlePhe: 3.675 ± 0.076
5.018IleGly: 5.018 ± 0.077
1.318IleHis: 1.318 ± 0.034
6.499IleIle: 6.499 ± 0.108
6.542IleLys: 6.542 ± 0.094
7.243IleLeu: 7.243 ± 0.095
1.377IleMet: 1.377 ± 0.033
5.222IleAsn: 5.222 ± 0.075
3.276IlePro: 3.276 ± 0.061
2.575IleGln: 2.575 ± 0.043
2.523IleArg: 2.523 ± 0.051
5.84IleSer: 5.84 ± 0.081
4.795IleThr: 4.795 ± 0.085
4.354IleVal: 4.354 ± 0.067
0.762IleTrp: 0.762 ± 0.026
3.051IleTyr: 3.051 ± 0.057
0.0IleXaa: 0.0 ± 0.0
Lys
5.465LysAla: 5.465 ± 0.078
0.361LysCys: 0.361 ± 0.019
4.473LysAsp: 4.473 ± 0.077
5.695LysGlu: 5.695 ± 0.086
3.174LysPhe: 3.174 ± 0.053
4.891LysGly: 4.891 ± 0.086
1.232LysHis: 1.232 ± 0.034
6.515LysIle: 6.515 ± 0.081
6.491LysLys: 6.491 ± 0.101
6.926LysLeu: 6.926 ± 0.093
2.054LysMet: 2.054 ± 0.046
5.43LysAsn: 5.43 ± 0.087
2.763LysPro: 2.763 ± 0.055
2.843LysGln: 2.843 ± 0.054
2.675LysArg: 2.675 ± 0.052
4.74LysSer: 4.74 ± 0.075
4.75LysThr: 4.75 ± 0.072
5.003LysVal: 5.003 ± 0.082
0.81LysTrp: 0.81 ± 0.028
3.069LysTyr: 3.069 ± 0.063
0.0LysXaa: 0.0 ± 0.0
Leu
6.593LeuAla: 6.593 ± 0.083
0.696LeuCys: 0.696 ± 0.029
4.881LeuAsp: 4.881 ± 0.067
5.303LeuGlu: 5.303 ± 0.072
4.903LeuPhe: 4.903 ± 0.079
5.762LeuGly: 5.762 ± 0.088
1.516LeuHis: 1.516 ± 0.041
7.557LeuIle: 7.557 ± 0.118
8.61LeuLys: 8.61 ± 0.109
8.939LeuLeu: 8.939 ± 0.127
2.141LeuMet: 2.141 ± 0.043
6.542LeuAsn: 6.542 ± 0.094
3.609LeuPro: 3.609 ± 0.057
3.28LeuGln: 3.28 ± 0.051
3.179LeuArg: 3.179 ± 0.057
7.03LeuSer: 7.03 ± 0.088
5.412LeuThr: 5.412 ± 0.081
5.436LeuVal: 5.436 ± 0.068
0.877LeuTrp: 0.877 ± 0.033
3.006LeuTyr: 3.006 ± 0.055
0.001LeuXaa: 0.001 ± 0.001
Met
1.659MetAla: 1.659 ± 0.042
0.119MetCys: 0.119 ± 0.01
1.11MetAsp: 1.11 ± 0.033
1.184MetGlu: 1.184 ± 0.036
0.779MetPhe: 0.779 ± 0.028
1.446MetGly: 1.446 ± 0.039
0.337MetHis: 0.337 ± 0.017
1.658MetIle: 1.658 ± 0.047
2.068MetLys: 2.068 ± 0.044
1.889MetLeu: 1.889 ± 0.043
0.666MetMet: 0.666 ± 0.026
1.261MetAsn: 1.261 ± 0.034
0.906MetPro: 0.906 ± 0.032
0.826MetGln: 0.826 ± 0.026
0.752MetArg: 0.752 ± 0.026
1.288MetSer: 1.288 ± 0.036
0.949MetThr: 0.949 ± 0.03
1.396MetVal: 1.396 ± 0.036
0.178MetTrp: 0.178 ± 0.015
0.568MetTyr: 0.568 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
4.123AsnAla: 4.123 ± 0.073
0.475AsnCys: 0.475 ± 0.025
2.891AsnAsp: 2.891 ± 0.05
3.339AsnGlu: 3.339 ± 0.056
3.319AsnPhe: 3.319 ± 0.062
4.357AsnGly: 4.357 ± 0.092
1.131AsnHis: 1.131 ± 0.028
4.871AsnIle: 4.871 ± 0.069
4.49AsnLys: 4.49 ± 0.075
5.945AsnLeu: 5.945 ± 0.072
1.202AsnMet: 1.202 ± 0.036
3.994AsnAsn: 3.994 ± 0.075
2.896AsnPro: 2.896 ± 0.054
2.83AsnGln: 2.83 ± 0.056
2.1AsnArg: 2.1 ± 0.048
3.919AsnSer: 3.919 ± 0.069
3.456AsnThr: 3.456 ± 0.086
3.312AsnVal: 3.312 ± 0.06
0.841AsnTrp: 0.841 ± 0.031
3.023AsnTyr: 3.023 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
2.437ProAla: 2.437 ± 0.051
0.204ProCys: 0.204 ± 0.015
2.119ProAsp: 2.119 ± 0.043
2.437ProGlu: 2.437 ± 0.041
1.995ProPhe: 1.995 ± 0.046
2.004ProGly: 2.004 ± 0.048
0.637ProHis: 0.637 ± 0.024
2.702ProIle: 2.702 ± 0.055
2.521ProLys: 2.521 ± 0.051
3.155ProLeu: 3.155 ± 0.062
0.665ProMet: 0.665 ± 0.025
2.264ProAsn: 2.264 ± 0.049
0.789ProPro: 0.789 ± 0.028
1.407ProGln: 1.407 ± 0.037
0.92ProArg: 0.92 ± 0.033
2.263ProSer: 2.263 ± 0.048
2.044ProThr: 2.044 ± 0.056
2.575ProVal: 2.575 ± 0.053
0.353ProTrp: 0.353 ± 0.017
1.387ProTyr: 1.387 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
2.316GlnAla: 2.316 ± 0.048
0.165GlnCys: 0.165 ± 0.012
1.704GlnAsp: 1.704 ± 0.036
2.264GlnGlu: 2.264 ± 0.054
1.82GlnPhe: 1.82 ± 0.038
1.806GlnGly: 1.806 ± 0.045
0.705GlnHis: 0.705 ± 0.026
3.016GlnIle: 3.016 ± 0.045
3.234GlnLys: 3.234 ± 0.062
3.781GlnLeu: 3.781 ± 0.065
0.818GlnMet: 0.818 ± 0.031
2.535GlnAsn: 2.535 ± 0.051
1.276GlnPro: 1.276 ± 0.034
1.852GlnGln: 1.852 ± 0.047
1.22GlnArg: 1.22 ± 0.036
2.183GlnSer: 2.183 ± 0.047
2.238GlnThr: 2.238 ± 0.047
2.071GlnVal: 2.071 ± 0.042
0.375GlnTrp: 0.375 ± 0.019
1.357GlnTyr: 1.357 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
1.965ArgAla: 1.965 ± 0.044
0.194ArgCys: 0.194 ± 0.012
1.66ArgAsp: 1.66 ± 0.038
1.987ArgGlu: 1.987 ± 0.045
1.86ArgPhe: 1.86 ± 0.039
1.977ArgGly: 1.977 ± 0.046
0.471ArgHis: 0.471 ± 0.02
2.736ArgIle: 2.736 ± 0.049
2.602ArgLys: 2.602 ± 0.047
3.163ArgLeu: 3.163 ± 0.061
0.825ArgMet: 0.825 ± 0.031
2.039ArgAsn: 2.039 ± 0.046
1.058ArgPro: 1.058 ± 0.034
1.024ArgGln: 1.024 ± 0.036
1.246ArgArg: 1.246 ± 0.04
1.947ArgSer: 1.947 ± 0.045
1.819ArgThr: 1.819 ± 0.044
2.146ArgVal: 2.146 ± 0.042
0.415ArgTrp: 0.415 ± 0.02
1.477ArgTyr: 1.477 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
4.488SerAla: 4.488 ± 0.078
0.58SerCys: 0.58 ± 0.022
3.206SerAsp: 3.206 ± 0.057
3.365SerGlu: 3.365 ± 0.051
4.048SerPhe: 4.048 ± 0.062
4.668SerGly: 4.668 ± 0.093
0.976SerHis: 0.976 ± 0.029
5.241SerIle: 5.241 ± 0.079
4.838SerLys: 4.838 ± 0.063
6.591SerLeu: 6.591 ± 0.083
1.189SerMet: 1.189 ± 0.028
3.808SerAsn: 3.808 ± 0.07
2.257SerPro: 2.257 ± 0.049
2.126SerGln: 2.126 ± 0.051
2.091SerArg: 2.091 ± 0.047
4.571SerSer: 4.571 ± 0.092
3.694SerThr: 3.694 ± 0.082
4.01SerVal: 4.01 ± 0.068
0.725SerTrp: 0.725 ± 0.028
2.786SerTyr: 2.786 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
4.343ThrAla: 4.343 ± 0.121
0.365ThrCys: 0.365 ± 0.019
3.147ThrAsp: 3.147 ± 0.079
3.116ThrGlu: 3.116 ± 0.06
2.933ThrPhe: 2.933 ± 0.059
4.386ThrGly: 4.386 ± 0.116
1.03ThrHis: 1.03 ± 0.034
4.39ThrIle: 4.39 ± 0.077
3.659ThrLys: 3.659 ± 0.059
5.372ThrLeu: 5.372 ± 0.078
0.861ThrMet: 0.861 ± 0.029
3.283ThrAsn: 3.283 ± 0.075
2.383ThrPro: 2.383 ± 0.043
2.108ThrGln: 2.108 ± 0.048
1.527ThrArg: 1.527 ± 0.037
3.831ThrSer: 3.831 ± 0.088
3.325ThrThr: 3.325 ± 0.082
3.675ThrVal: 3.675 ± 0.081
0.557ThrTrp: 0.557 ± 0.024
2.363ThrTyr: 2.363 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
4.381ValAla: 4.381 ± 0.07
0.522ValCys: 0.522 ± 0.02
3.204ValAsp: 3.204 ± 0.051
3.544ValGlu: 3.544 ± 0.068
3.342ValPhe: 3.342 ± 0.063
3.852ValGly: 3.852 ± 0.078
0.823ValHis: 0.823 ± 0.023
4.817ValIle: 4.817 ± 0.065
4.921ValLys: 4.921 ± 0.065
5.661ValLeu: 5.661 ± 0.081
1.331ValMet: 1.331 ± 0.034
3.939ValAsn: 3.939 ± 0.065
2.018ValPro: 2.018 ± 0.047
1.709ValGln: 1.709 ± 0.041
1.862ValArg: 1.862 ± 0.044
4.393ValSer: 4.393 ± 0.069
3.487ValThr: 3.487 ± 0.083
4.014ValVal: 4.014 ± 0.071
0.643ValTrp: 0.643 ± 0.024
2.402ValTyr: 2.402 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.668TrpAla: 0.668 ± 0.024
0.103TrpCys: 0.103 ± 0.009
0.631TrpAsp: 0.631 ± 0.029
0.691TrpGlu: 0.691 ± 0.027
0.555TrpPhe: 0.555 ± 0.023
0.736TrpGly: 0.736 ± 0.027
0.212TrpHis: 0.212 ± 0.012
0.752TrpIle: 0.752 ± 0.024
0.817TrpLys: 0.817 ± 0.033
1.082TrpLeu: 1.082 ± 0.037
0.304TrpMet: 0.304 ± 0.015
0.679TrpAsn: 0.679 ± 0.022
0.299TrpPro: 0.299 ± 0.016
0.419TrpGln: 0.419 ± 0.018
0.438TrpArg: 0.438 ± 0.022
0.646TrpSer: 0.646 ± 0.023
0.58TrpThr: 0.58 ± 0.025
0.643TrpVal: 0.643 ± 0.026
0.157TrpTrp: 0.157 ± 0.012
0.443TrpTyr: 0.443 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.608TyrAla: 2.608 ± 0.048
0.337TyrCys: 0.337 ± 0.019
2.127TyrAsp: 2.127 ± 0.045
1.991TyrGlu: 1.991 ± 0.045
2.431TyrPhe: 2.431 ± 0.048
2.647TyrGly: 2.647 ± 0.06
0.86TyrHis: 0.86 ± 0.029
2.61TyrIle: 2.61 ± 0.052
2.826TyrLys: 2.826 ± 0.05
4.094TyrLeu: 4.094 ± 0.069
0.626TyrMet: 0.626 ± 0.024
2.294TyrAsn: 2.294 ± 0.047
1.579TyrPro: 1.579 ± 0.041
2.031TyrGln: 2.031 ± 0.048
1.56TyrArg: 1.56 ± 0.041
2.667TyrSer: 2.667 ± 0.054
2.333TyrThr: 2.333 ± 0.055
1.996TyrVal: 1.996 ± 0.04
0.484TyrTrp: 0.484 ± 0.023
1.715TyrTyr: 1.715 ± 0.039
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3357 proteins (1173386 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski