Amino acid dipepetide frequency for Sphingomonas sp. MM-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.84AlaAla: 21.84 ± 0.206
1.061AlaCys: 1.061 ± 0.03
8.708AlaAsp: 8.708 ± 0.085
8.378AlaGlu: 8.378 ± 0.106
4.487AlaPhe: 4.487 ± 0.066
12.869AlaGly: 12.869 ± 0.158
2.395AlaHis: 2.395 ± 0.046
7.581AlaIle: 7.581 ± 0.096
3.771AlaLys: 3.771 ± 0.066
14.47AlaLeu: 14.47 ± 0.136
4.091AlaMet: 4.091 ± 0.066
3.066AlaAsn: 3.066 ± 0.055
6.8AlaPro: 6.8 ± 0.085
4.123AlaGln: 4.123 ± 0.057
11.133AlaArg: 11.133 ± 0.131
6.327AlaSer: 6.327 ± 0.074
6.692AlaThr: 6.692 ± 0.077
9.098AlaVal: 9.098 ± 0.101
1.778AlaTrp: 1.778 ± 0.041
2.769AlaTyr: 2.769 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.953CysAla: 0.953 ± 0.026
0.089CysCys: 0.089 ± 0.009
0.444CysAsp: 0.444 ± 0.019
0.346CysGlu: 0.346 ± 0.015
0.263CysPhe: 0.263 ± 0.013
0.82CysGly: 0.82 ± 0.03
0.221CysHis: 0.221 ± 0.015
0.327CysIle: 0.327 ± 0.015
0.134CysLys: 0.134 ± 0.01
0.667CysLeu: 0.667 ± 0.023
0.136CysMet: 0.136 ± 0.011
0.154CysAsn: 0.154 ± 0.011
0.452CysPro: 0.452 ± 0.019
0.184CysGln: 0.184 ± 0.012
0.574CysArg: 0.574 ± 0.024
0.368CysSer: 0.368 ± 0.018
0.339CysThr: 0.339 ± 0.018
0.48CysVal: 0.48 ± 0.021
0.109CysTrp: 0.109 ± 0.008
0.163CysTyr: 0.163 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
8.285AspAla: 8.285 ± 0.096
0.458AspCys: 0.458 ± 0.017
3.32AspAsp: 3.32 ± 0.056
3.353AspGlu: 3.353 ± 0.063
2.156AspPhe: 2.156 ± 0.038
5.993AspGly: 5.993 ± 0.089
1.423AspHis: 1.423 ± 0.034
3.008AspIle: 3.008 ± 0.046
1.498AspLys: 1.498 ± 0.037
5.778AspLeu: 5.778 ± 0.073
1.381AspMet: 1.381 ± 0.032
1.325AspAsn: 1.325 ± 0.033
4.0AspPro: 4.0 ± 0.068
1.794AspGln: 1.794 ± 0.036
5.506AspArg: 5.506 ± 0.065
2.184AspSer: 2.184 ± 0.047
2.341AspThr: 2.341 ± 0.042
4.139AspVal: 4.139 ± 0.053
1.079AspTrp: 1.079 ± 0.03
1.609AspTyr: 1.609 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
8.326GluAla: 8.326 ± 0.106
0.303GluCys: 0.303 ± 0.016
2.69GluAsp: 2.69 ± 0.046
2.963GluGlu: 2.963 ± 0.058
1.442GluPhe: 1.442 ± 0.035
4.769GluGly: 4.769 ± 0.065
1.049GluHis: 1.049 ± 0.028
2.931GluIle: 2.931 ± 0.052
1.886GluLys: 1.886 ± 0.048
4.796GluLeu: 4.796 ± 0.069
1.452GluMet: 1.452 ± 0.036
1.325GluAsn: 1.325 ± 0.032
2.634GluPro: 2.634 ± 0.052
1.843GluGln: 1.843 ± 0.037
5.036GluArg: 5.036 ± 0.075
2.149GluSer: 2.149 ± 0.042
3.086GluThr: 3.086 ± 0.045
3.44GluVal: 3.44 ± 0.053
0.756GluTrp: 0.756 ± 0.023
1.006GluTyr: 1.006 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
4.777PheAla: 4.777 ± 0.067
0.276PheCys: 0.276 ± 0.014
2.643PheAsp: 2.643 ± 0.052
1.871PheGlu: 1.871 ± 0.038
1.187PhePhe: 1.187 ± 0.038
3.552PheGly: 3.552 ± 0.059
0.744PheHis: 0.744 ± 0.027
1.387PheIle: 1.387 ± 0.033
0.765PheLys: 0.765 ± 0.028
3.024PheLeu: 3.024 ± 0.058
0.686PheMet: 0.686 ± 0.024
1.002PheAsn: 1.002 ± 0.029
1.484PhePro: 1.484 ± 0.036
0.934PheGln: 0.934 ± 0.027
2.384PheArg: 2.384 ± 0.046
1.879PheSer: 1.879 ± 0.039
1.99PheThr: 1.99 ± 0.042
2.507PheVal: 2.507 ± 0.045
0.529PheTrp: 0.529 ± 0.018
0.844PheTyr: 0.844 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
10.527GlyAla: 10.527 ± 0.131
0.782GlyCys: 0.782 ± 0.027
5.266GlyAsp: 5.266 ± 0.069
4.99GlyGlu: 4.99 ± 0.071
3.597GlyPhe: 3.597 ± 0.056
8.777GlyGly: 8.777 ± 0.166
1.975GlyHis: 1.975 ± 0.04
4.636GlyIle: 4.636 ± 0.06
3.016GlyLys: 3.016 ± 0.048
9.152GlyLeu: 9.152 ± 0.09
2.378GlyMet: 2.378 ± 0.044
2.197GlyAsn: 2.197 ± 0.052
3.849GlyPro: 3.849 ± 0.053
3.022GlyGln: 3.022 ± 0.051
7.519GlyArg: 7.519 ± 0.085
4.598GlySer: 4.598 ± 0.087
4.889GlyThr: 4.889 ± 0.154
6.235GlyVal: 6.235 ± 0.078
1.719GlyTrp: 1.719 ± 0.038
2.279GlyTyr: 2.279 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
2.517HisAla: 2.517 ± 0.049
0.211HisCys: 0.211 ± 0.013
1.176HisAsp: 1.176 ± 0.031
0.931HisGlu: 0.931 ± 0.024
0.82HisPhe: 0.82 ± 0.027
2.038HisGly: 2.038 ± 0.042
0.55HisHis: 0.55 ± 0.023
0.979HisIle: 0.979 ± 0.026
0.412HisLys: 0.412 ± 0.018
1.898HisLeu: 1.898 ± 0.036
0.454HisMet: 0.454 ± 0.02
0.427HisAsn: 0.427 ± 0.018
1.312HisPro: 1.312 ± 0.037
0.543HisGln: 0.543 ± 0.022
1.566HisArg: 1.566 ± 0.034
0.841HisSer: 0.841 ± 0.024
0.611HisThr: 0.611 ± 0.024
1.476HisVal: 1.476 ± 0.038
0.336HisTrp: 0.336 ± 0.017
0.576HisTyr: 0.576 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
8.544IleAla: 8.544 ± 0.096
0.36IleCys: 0.36 ± 0.02
4.028IleAsp: 4.028 ± 0.062
3.462IleGlu: 3.462 ± 0.053
1.576IlePhe: 1.576 ± 0.034
5.446IleGly: 5.446 ± 0.066
0.922IleHis: 0.922 ± 0.026
1.988IleIle: 1.988 ± 0.044
1.039IleLys: 1.039 ± 0.031
4.132IleLeu: 4.132 ± 0.07
0.765IleMet: 0.765 ± 0.027
1.271IleAsn: 1.271 ± 0.041
2.181IlePro: 2.181 ± 0.037
1.16IleGln: 1.16 ± 0.03
3.559IleArg: 3.559 ± 0.052
2.523IleSer: 2.523 ± 0.044
2.416IleThr: 2.416 ± 0.047
4.251IleVal: 4.251 ± 0.063
0.598IleTrp: 0.598 ± 0.023
0.97IleTyr: 0.97 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
4.123LysAla: 4.123 ± 0.071
0.118LysCys: 0.118 ± 0.01
1.455LysAsp: 1.455 ± 0.044
1.159LysGlu: 1.159 ± 0.035
0.7LysPhe: 0.7 ± 0.026
2.486LysGly: 2.486 ± 0.047
0.454LysHis: 0.454 ± 0.019
1.285LysIle: 1.285 ± 0.034
0.873LysLys: 0.873 ± 0.03
2.902LysLeu: 2.902 ± 0.057
0.617LysMet: 0.617 ± 0.025
0.623LysAsn: 0.623 ± 0.024
1.836LysPro: 1.836 ± 0.042
0.806LysGln: 0.806 ± 0.023
2.067LysArg: 2.067 ± 0.045
1.431LysSer: 1.431 ± 0.035
1.472LysThr: 1.472 ± 0.041
2.036LysVal: 2.036 ± 0.044
0.338LysTrp: 0.338 ± 0.016
0.527LysTyr: 0.527 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
15.196LeuAla: 15.196 ± 0.15
0.729LeuCys: 0.729 ± 0.027
6.237LeuAsp: 6.237 ± 0.07
4.739LeuGlu: 4.739 ± 0.067
3.451LeuPhe: 3.451 ± 0.066
8.483LeuGly: 8.483 ± 0.094
1.769LeuHis: 1.769 ± 0.043
5.025LeuIle: 5.025 ± 0.068
3.005LeuLys: 3.005 ± 0.049
9.364LeuLeu: 9.364 ± 0.116
2.092LeuMet: 2.092 ± 0.044
2.383LeuAsn: 2.383 ± 0.045
5.685LeuPro: 5.685 ± 0.076
2.325LeuGln: 2.325 ± 0.049
7.114LeuArg: 7.114 ± 0.078
5.652LeuSer: 5.652 ± 0.065
5.092LeuThr: 5.092 ± 0.078
7.138LeuVal: 7.138 ± 0.08
1.281LeuTrp: 1.281 ± 0.035
1.91LeuTyr: 1.91 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
3.539MetAla: 3.539 ± 0.061
0.154MetCys: 0.154 ± 0.011
1.067MetAsp: 1.067 ± 0.028
1.078MetGlu: 1.078 ± 0.031
0.67MetPhe: 0.67 ± 0.023
1.819MetGly: 1.819 ± 0.043
0.417MetHis: 0.417 ± 0.016
1.332MetIle: 1.332 ± 0.034
0.857MetLys: 0.857 ± 0.025
2.591MetLeu: 2.591 ± 0.051
0.618MetMet: 0.618 ± 0.026
0.634MetAsn: 0.634 ± 0.023
1.452MetPro: 1.452 ± 0.032
0.647MetGln: 0.647 ± 0.022
1.854MetArg: 1.854 ± 0.038
1.301MetSer: 1.301 ± 0.03
1.604MetThr: 1.604 ± 0.036
1.523MetVal: 1.523 ± 0.035
0.225MetTrp: 0.225 ± 0.013
0.236MetTyr: 0.236 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.12AsnAla: 3.12 ± 0.053
0.198AsnCys: 0.198 ± 0.012
1.288AsnAsp: 1.288 ± 0.038
1.102AsnGlu: 1.102 ± 0.029
0.873AsnPhe: 0.873 ± 0.027
2.455AsnGly: 2.455 ± 0.067
0.43AsnHis: 0.43 ± 0.018
1.296AsnIle: 1.296 ± 0.036
0.562AsnLys: 0.562 ± 0.021
2.429AsnLeu: 2.429 ± 0.047
0.536AsnMet: 0.536 ± 0.018
0.648AsnAsn: 0.648 ± 0.023
1.672AsnPro: 1.672 ± 0.036
0.732AsnGln: 0.732 ± 0.021
1.831AsnArg: 1.831 ± 0.037
1.152AsnSer: 1.152 ± 0.038
0.985AsnThr: 0.985 ± 0.029
1.772AsnVal: 1.772 ± 0.043
0.367AsnTrp: 0.367 ± 0.021
0.657AsnTyr: 0.657 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
8.185ProAla: 8.185 ± 0.102
0.3ProCys: 0.3 ± 0.015
3.881ProAsp: 3.881 ± 0.058
3.538ProGlu: 3.538 ± 0.061
1.922ProPhe: 1.922 ± 0.038
5.097ProGly: 5.097 ± 0.074
1.025ProHis: 1.025 ± 0.032
2.591ProIle: 2.591 ± 0.043
1.316ProLys: 1.316 ± 0.034
4.95ProLeu: 4.95 ± 0.073
1.246ProMet: 1.246 ± 0.033
1.18ProAsn: 1.18 ± 0.029
3.133ProPro: 3.133 ± 0.077
1.584ProGln: 1.584 ± 0.035
3.306ProArg: 3.306 ± 0.059
2.602ProSer: 2.602 ± 0.048
2.466ProThr: 2.466 ± 0.044
4.247ProVal: 4.247 ± 0.064
0.735ProTrp: 0.735 ± 0.024
1.08ProTyr: 1.08 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
4.388GlnAla: 4.388 ± 0.067
0.186GlnCys: 0.186 ± 0.013
1.358GlnAsp: 1.358 ± 0.028
1.258GlnGlu: 1.258 ± 0.03
1.017GlnPhe: 1.017 ± 0.027
2.447GlnGly: 2.447 ± 0.05
0.533GlnHis: 0.533 ± 0.023
1.645GlnIle: 1.645 ± 0.03
0.82GlnLys: 0.82 ± 0.025
2.757GlnLeu: 2.757 ± 0.053
0.744GlnMet: 0.744 ± 0.029
0.68GlnAsn: 0.68 ± 0.023
1.814GlnPro: 1.814 ± 0.042
1.085GlnGln: 1.085 ± 0.034
2.425GlnArg: 2.425 ± 0.047
1.551GlnSer: 1.551 ± 0.034
1.438GlnThr: 1.438 ± 0.034
2.137GlnVal: 2.137 ± 0.039
0.395GlnTrp: 0.395 ± 0.019
0.575GlnTyr: 0.575 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
10.049ArgAla: 10.049 ± 0.113
0.498ArgCys: 0.498 ± 0.02
4.661ArgAsp: 4.661 ± 0.066
4.113ArgGlu: 4.113 ± 0.065
3.167ArgPhe: 3.167 ± 0.052
5.498ArgGly: 5.498 ± 0.073
1.866ArgHis: 1.866 ± 0.041
4.643ArgIle: 4.643 ± 0.065
2.05ArgLys: 2.05 ± 0.035
8.957ArgLeu: 8.957 ± 0.112
2.022ArgMet: 2.022 ± 0.043
1.867ArgAsn: 1.867 ± 0.043
4.377ArgPro: 4.377 ± 0.066
2.558ArgGln: 2.558 ± 0.043
6.567ArgArg: 6.567 ± 0.088
3.67ArgSer: 3.67 ± 0.058
3.563ArgThr: 3.563 ± 0.051
4.809ArgVal: 4.809 ± 0.069
1.323ArgTrp: 1.323 ± 0.034
1.89ArgTyr: 1.89 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
6.16SerAla: 6.16 ± 0.07
0.358SerCys: 0.358 ± 0.017
2.803SerAsp: 2.803 ± 0.047
2.218SerGlu: 2.218 ± 0.049
2.094SerPhe: 2.094 ± 0.041
5.24SerGly: 5.24 ± 0.102
0.876SerHis: 0.876 ± 0.026
2.629SerIle: 2.629 ± 0.047
1.157SerLys: 1.157 ± 0.031
4.913SerLeu: 4.913 ± 0.06
1.04SerMet: 1.04 ± 0.029
1.193SerAsn: 1.193 ± 0.029
2.751SerPro: 2.751 ± 0.04
1.421SerGln: 1.421 ± 0.037
3.541SerArg: 3.541 ± 0.053
2.46SerSer: 2.46 ± 0.053
2.458SerThr: 2.458 ± 0.049
3.336SerVal: 3.336 ± 0.051
0.753SerTrp: 0.753 ± 0.02
1.287SerTyr: 1.287 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
6.534ThrAla: 6.534 ± 0.077
0.332ThrCys: 0.332 ± 0.018
2.767ThrAsp: 2.767 ± 0.049
2.212ThrGlu: 2.212 ± 0.04
1.696ThrPhe: 1.696 ± 0.042
5.215ThrGly: 5.215 ± 0.09
0.853ThrHis: 0.853 ± 0.024
2.932ThrIle: 2.932 ± 0.06
1.14ThrLys: 1.14 ± 0.034
5.703ThrLeu: 5.703 ± 0.098
1.064ThrMet: 1.064 ± 0.03
1.189ThrAsn: 1.189 ± 0.038
3.247ThrPro: 3.247 ± 0.048
1.271ThrGln: 1.271 ± 0.032
3.403ThrArg: 3.403 ± 0.056
2.37ThrSer: 2.37 ± 0.055
2.505ThrThr: 2.505 ± 0.053
3.984ThrVal: 3.984 ± 0.06
0.555ThrTrp: 0.555 ± 0.021
1.118ThrTyr: 1.118 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
10.137ValAla: 10.137 ± 0.103
0.477ValCys: 0.477 ± 0.021
4.54ValAsp: 4.54 ± 0.062
4.447ValGlu: 4.447 ± 0.071
1.957ValPhe: 1.957 ± 0.037
5.359ValGly: 5.359 ± 0.071
1.37ValHis: 1.37 ± 0.029
3.495ValIle: 3.495 ± 0.055
1.989ValLys: 1.989 ± 0.04
6.298ValLeu: 6.298 ± 0.077
1.555ValMet: 1.555 ± 0.031
1.911ValAsn: 1.911 ± 0.052
3.965ValPro: 3.965 ± 0.055
1.963ValGln: 1.963 ± 0.034
5.411ValArg: 5.411 ± 0.066
3.735ValSer: 3.735 ± 0.061
4.33ValThr: 4.33 ± 0.061
5.084ValVal: 5.084 ± 0.063
0.742ValTrp: 0.742 ± 0.026
1.264ValTyr: 1.264 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.487TrpAla: 1.487 ± 0.036
0.119TrpCys: 0.119 ± 0.009
0.748TrpAsp: 0.748 ± 0.025
0.603TrpGlu: 0.603 ± 0.022
0.577TrpPhe: 0.577 ± 0.019
0.94TrpGly: 0.94 ± 0.026
0.333TrpHis: 0.333 ± 0.015
0.685TrpIle: 0.685 ± 0.026
0.487TrpLys: 0.487 ± 0.019
1.798TrpLeu: 1.798 ± 0.041
0.355TrpMet: 0.355 ± 0.014
0.452TrpAsn: 0.452 ± 0.023
0.724TrpPro: 0.724 ± 0.028
0.579TrpGln: 0.579 ± 0.022
1.371TrpArg: 1.371 ± 0.033
0.782TrpSer: 0.782 ± 0.025
0.816TrpThr: 0.816 ± 0.027
0.79TrpVal: 0.79 ± 0.028
0.256TrpTrp: 0.256 ± 0.016
0.28TrpTyr: 0.28 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.75TyrAla: 2.75 ± 0.047
0.214TyrCys: 0.214 ± 0.013
1.552TyrAsp: 1.552 ± 0.039
1.12TyrGlu: 1.12 ± 0.031
0.814TyrPhe: 0.814 ± 0.025
2.1TyrGly: 2.1 ± 0.044
0.466TyrHis: 0.466 ± 0.018
0.798TyrIle: 0.798 ± 0.022
0.53TyrLys: 0.53 ± 0.02
2.094TyrLeu: 2.094 ± 0.043
0.382TyrMet: 0.382 ± 0.016
0.58TyrAsn: 0.58 ± 0.021
1.015TyrPro: 1.015 ± 0.033
0.679TyrGln: 0.679 ± 0.022
1.982TyrArg: 1.982 ± 0.047
1.112TyrSer: 1.112 ± 0.034
0.995TyrThr: 0.995 ± 0.028
1.548TyrVal: 1.548 ± 0.033
0.31TyrTrp: 0.31 ± 0.014
0.571TyrTyr: 0.571 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4252 proteins (1340665 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski