Amino acid dipepetide frequency for Novosphingobium sp. ST904

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.204AlaAla: 18.204 ± 0.157
1.153AlaCys: 1.153 ± 0.03
7.104AlaAsp: 7.104 ± 0.076
7.638AlaGlu: 7.638 ± 0.101
4.263AlaPhe: 4.263 ± 0.055
11.348AlaGly: 11.348 ± 0.096
2.376AlaHis: 2.376 ± 0.038
6.369AlaIle: 6.369 ± 0.077
3.804AlaLys: 3.804 ± 0.063
13.622AlaLeu: 13.622 ± 0.117
4.016AlaMet: 4.016 ± 0.057
3.005AlaAsn: 3.005 ± 0.053
6.228AlaPro: 6.228 ± 0.08
4.444AlaGln: 4.444 ± 0.065
9.91AlaArg: 9.91 ± 0.11
6.986AlaSer: 6.986 ± 0.086
6.02AlaThr: 6.02 ± 0.075
8.601AlaVal: 8.601 ± 0.099
1.751AlaTrp: 1.751 ± 0.044
2.498AlaTyr: 2.498 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
1.05CysAla: 1.05 ± 0.027
0.106CysCys: 0.106 ± 0.009
0.526CysAsp: 0.526 ± 0.024
0.513CysGlu: 0.513 ± 0.019
0.305CysPhe: 0.305 ± 0.014
0.92CysGly: 0.92 ± 0.028
0.225CysHis: 0.225 ± 0.014
0.399CysIle: 0.399 ± 0.016
0.187CysLys: 0.187 ± 0.013
0.778CysLeu: 0.778 ± 0.027
0.159CysMet: 0.159 ± 0.012
0.242CysAsn: 0.242 ± 0.014
0.478CysPro: 0.478 ± 0.019
0.231CysGln: 0.231 ± 0.013
0.632CysArg: 0.632 ± 0.022
0.486CysSer: 0.486 ± 0.02
0.415CysThr: 0.415 ± 0.018
0.543CysVal: 0.543 ± 0.021
0.142CysTrp: 0.142 ± 0.011
0.179CysTyr: 0.179 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.242AspAla: 7.242 ± 0.084
0.513AspCys: 0.513 ± 0.023
3.113AspAsp: 3.113 ± 0.055
3.44AspGlu: 3.44 ± 0.055
2.153AspPhe: 2.153 ± 0.038
5.407AspGly: 5.407 ± 0.077
1.312AspHis: 1.312 ± 0.034
2.763AspIle: 2.763 ± 0.043
1.678AspLys: 1.678 ± 0.045
5.752AspLeu: 5.752 ± 0.066
1.393AspMet: 1.393 ± 0.03
1.311AspAsn: 1.311 ± 0.034
3.6AspPro: 3.6 ± 0.054
1.735AspGln: 1.735 ± 0.038
4.7AspArg: 4.7 ± 0.07
2.382AspSer: 2.382 ± 0.041
2.545AspThr: 2.545 ± 0.044
3.935AspVal: 3.935 ± 0.058
1.173AspTrp: 1.173 ± 0.03
1.681AspTyr: 1.681 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
7.988GluAla: 7.988 ± 0.087
0.401GluCys: 0.401 ± 0.017
2.945GluAsp: 2.945 ± 0.056
3.103GluGlu: 3.103 ± 0.054
1.704GluPhe: 1.704 ± 0.038
4.689GluGly: 4.689 ± 0.061
1.257GluHis: 1.257 ± 0.031
3.063GluIle: 3.063 ± 0.052
2.038GluLys: 2.038 ± 0.045
5.29GluLeu: 5.29 ± 0.086
1.468GluMet: 1.468 ± 0.037
1.5GluAsn: 1.5 ± 0.035
2.632GluPro: 2.632 ± 0.054
2.273GluGln: 2.273 ± 0.037
4.872GluArg: 4.872 ± 0.074
2.385GluSer: 2.385 ± 0.044
3.387GluThr: 3.387 ± 0.054
3.815GluVal: 3.815 ± 0.061
0.859GluTrp: 0.859 ± 0.027
1.043GluTyr: 1.043 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
4.77PheAla: 4.77 ± 0.064
0.346PheCys: 0.346 ± 0.014
2.694PheAsp: 2.694 ± 0.048
2.109PheGlu: 2.109 ± 0.037
1.251PhePhe: 1.251 ± 0.035
3.617PheGly: 3.617 ± 0.061
0.746PheHis: 0.746 ± 0.024
1.339PheIle: 1.339 ± 0.032
0.846PheLys: 0.846 ± 0.024
3.129PheLeu: 3.129 ± 0.047
0.775PheMet: 0.775 ± 0.023
1.01PheAsn: 1.01 ± 0.027
1.548PhePro: 1.548 ± 0.033
0.874PheGln: 0.874 ± 0.027
2.31PheArg: 2.31 ± 0.04
2.196PheSer: 2.196 ± 0.042
2.053PheThr: 2.053 ± 0.043
2.646PheVal: 2.646 ± 0.046
0.535PheTrp: 0.535 ± 0.02
0.852PheTyr: 0.852 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
9.632GlyAla: 9.632 ± 0.101
0.89GlyCys: 0.89 ± 0.026
4.758GlyAsp: 4.758 ± 0.068
5.183GlyGlu: 5.183 ± 0.064
3.592GlyPhe: 3.592 ± 0.051
8.003GlyGly: 8.003 ± 0.128
1.909GlyHis: 1.909 ± 0.044
4.477GlyIle: 4.477 ± 0.068
3.598GlyLys: 3.598 ± 0.053
8.8GlyLeu: 8.8 ± 0.096
2.489GlyMet: 2.489 ± 0.047
2.458GlyAsn: 2.458 ± 0.047
3.712GlyPro: 3.712 ± 0.055
3.003GlyGln: 3.003 ± 0.054
6.364GlyArg: 6.364 ± 0.073
5.178GlySer: 5.178 ± 0.073
5.098GlyThr: 5.098 ± 0.071
6.023GlyVal: 6.023 ± 0.076
1.613GlyTrp: 1.613 ± 0.034
2.392GlyTyr: 2.392 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.523HisAla: 2.523 ± 0.046
0.239HisCys: 0.239 ± 0.014
1.243HisAsp: 1.243 ± 0.035
1.139HisGlu: 1.139 ± 0.029
0.885HisPhe: 0.885 ± 0.03
2.035HisGly: 2.035 ± 0.042
0.571HisHis: 0.571 ± 0.022
0.84HisIle: 0.84 ± 0.025
0.466HisLys: 0.466 ± 0.02
1.893HisLeu: 1.893 ± 0.035
0.461HisMet: 0.461 ± 0.02
0.459HisAsn: 0.459 ± 0.02
1.271HisPro: 1.271 ± 0.031
0.567HisGln: 0.567 ± 0.023
1.501HisArg: 1.501 ± 0.032
1.007HisSer: 1.007 ± 0.027
0.772HisThr: 0.772 ± 0.025
1.527HisVal: 1.527 ± 0.037
0.365HisTrp: 0.365 ± 0.017
0.595HisTyr: 0.595 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
7.394IleAla: 7.394 ± 0.084
0.45IleCys: 0.45 ± 0.019
3.693IleAsp: 3.693 ± 0.061
3.424IleGlu: 3.424 ± 0.054
1.385IlePhe: 1.385 ± 0.035
4.998IleGly: 4.998 ± 0.069
0.87IleHis: 0.87 ± 0.028
1.634IleIle: 1.634 ± 0.038
1.121IleLys: 1.121 ± 0.034
3.722IleLeu: 3.722 ± 0.065
0.876IleMet: 0.876 ± 0.028
1.228IleAsn: 1.228 ± 0.034
2.297IlePro: 2.297 ± 0.046
1.194IleGln: 1.194 ± 0.029
3.07IleArg: 3.07 ± 0.053
2.738IleSer: 2.738 ± 0.043
2.429IleThr: 2.429 ± 0.045
3.77IleVal: 3.77 ± 0.058
0.617IleTrp: 0.617 ± 0.022
1.06IleTyr: 1.06 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.197LysAla: 4.197 ± 0.07
0.168LysCys: 0.168 ± 0.011
1.599LysAsp: 1.599 ± 0.038
1.288LysGlu: 1.288 ± 0.037
0.876LysPhe: 0.876 ± 0.026
2.76LysGly: 2.76 ± 0.046
0.511LysHis: 0.511 ± 0.019
1.42LysIle: 1.42 ± 0.033
0.956LysLys: 0.956 ± 0.028
3.104LysLeu: 3.104 ± 0.05
0.723LysMet: 0.723 ± 0.027
0.729LysAsn: 0.729 ± 0.024
1.985LysPro: 1.985 ± 0.045
0.962LysGln: 0.962 ± 0.029
2.163LysArg: 2.163 ± 0.041
1.672LysSer: 1.672 ± 0.041
1.638LysThr: 1.638 ± 0.038
2.409LysVal: 2.409 ± 0.044
0.423LysTrp: 0.423 ± 0.017
0.586LysTyr: 0.586 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
14.133LeuAla: 14.133 ± 0.11
0.85LeuCys: 0.85 ± 0.025
6.204LeuAsp: 6.204 ± 0.074
5.188LeuGlu: 5.188 ± 0.068
3.446LeuPhe: 3.446 ± 0.056
8.552LeuGly: 8.552 ± 0.096
1.893LeuHis: 1.893 ± 0.039
4.379LeuIle: 4.379 ± 0.065
2.945LeuLys: 2.945 ± 0.05
9.449LeuLeu: 9.449 ± 0.116
2.131LeuMet: 2.131 ± 0.042
2.309LeuAsn: 2.309 ± 0.046
5.637LeuPro: 5.637 ± 0.069
2.682LeuGln: 2.682 ± 0.045
7.094LeuArg: 7.094 ± 0.085
6.145LeuSer: 6.145 ± 0.071
5.561LeuThr: 5.561 ± 0.064
7.378LeuVal: 7.378 ± 0.083
1.206LeuTrp: 1.206 ± 0.036
1.937LeuTyr: 1.937 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
3.441MetAla: 3.441 ± 0.045
0.156MetCys: 0.156 ± 0.011
1.16MetAsp: 1.16 ± 0.03
1.129MetGlu: 1.129 ± 0.029
0.745MetPhe: 0.745 ± 0.024
1.921MetGly: 1.921 ± 0.045
0.415MetHis: 0.415 ± 0.017
1.326MetIle: 1.326 ± 0.03
0.955MetLys: 0.955 ± 0.024
2.68MetLeu: 2.68 ± 0.045
0.655MetMet: 0.655 ± 0.025
0.747MetAsn: 0.747 ± 0.022
1.547MetPro: 1.547 ± 0.035
0.791MetGln: 0.791 ± 0.026
1.836MetArg: 1.836 ± 0.038
1.524MetSer: 1.524 ± 0.036
1.789MetThr: 1.789 ± 0.042
1.673MetVal: 1.673 ± 0.036
0.213MetTrp: 0.213 ± 0.013
0.251MetTyr: 0.251 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
3.14AsnAla: 3.14 ± 0.05
0.248AsnCys: 0.248 ± 0.013
1.457AsnAsp: 1.457 ± 0.034
1.141AsnGlu: 1.141 ± 0.03
0.935AsnPhe: 0.935 ± 0.029
2.501AsnGly: 2.501 ± 0.05
0.512AsnHis: 0.512 ± 0.02
1.132AsnIle: 1.132 ± 0.035
0.617AsnLys: 0.617 ± 0.023
2.622AsnLeu: 2.622 ± 0.046
0.558AsnMet: 0.558 ± 0.024
0.674AsnAsn: 0.674 ± 0.022
1.776AsnPro: 1.776 ± 0.042
0.699AsnGln: 0.699 ± 0.026
1.927AsnArg: 1.927 ± 0.037
1.282AsnSer: 1.282 ± 0.035
1.189AsnThr: 1.189 ± 0.032
1.876AsnVal: 1.876 ± 0.034
0.453AsnTrp: 0.453 ± 0.019
0.713AsnTyr: 0.713 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
7.03ProAla: 7.03 ± 0.088
0.384ProCys: 0.384 ± 0.017
3.535ProAsp: 3.535 ± 0.06
3.908ProGlu: 3.908 ± 0.055
1.919ProPhe: 1.919 ± 0.037
4.924ProGly: 4.924 ± 0.065
1.025ProHis: 1.025 ± 0.026
2.283ProIle: 2.283 ± 0.042
1.426ProLys: 1.426 ± 0.034
4.902ProLeu: 4.902 ± 0.06
1.315ProMet: 1.315 ± 0.03
1.224ProAsn: 1.224 ± 0.03
2.592ProPro: 2.592 ± 0.062
1.778ProGln: 1.778 ± 0.04
3.219ProArg: 3.219 ± 0.052
2.897ProSer: 2.897 ± 0.056
2.339ProThr: 2.339 ± 0.049
4.361ProVal: 4.361 ± 0.061
0.697ProTrp: 0.697 ± 0.024
1.116ProTyr: 1.116 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
4.222GlnAla: 4.222 ± 0.069
0.247GlnCys: 0.247 ± 0.016
1.59GlnAsp: 1.59 ± 0.035
1.407GlnGlu: 1.407 ± 0.032
1.183GlnPhe: 1.183 ± 0.03
2.633GlnGly: 2.633 ± 0.044
0.586GlnHis: 0.586 ± 0.022
1.744GlnIle: 1.744 ± 0.039
0.955GlnLys: 0.955 ± 0.024
3.019GlnLeu: 3.019 ± 0.049
0.857GlnMet: 0.857 ± 0.025
0.81GlnAsn: 0.81 ± 0.027
1.798GlnPro: 1.798 ± 0.037
1.312GlnGln: 1.312 ± 0.039
2.432GlnArg: 2.432 ± 0.045
1.899GlnSer: 1.899 ± 0.038
1.636GlnThr: 1.636 ± 0.034
2.442GlnVal: 2.442 ± 0.047
0.485GlnTrp: 0.485 ± 0.02
0.639GlnTyr: 0.639 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
8.298ArgAla: 8.298 ± 0.081
0.507ArgCys: 0.507 ± 0.019
4.097ArgAsp: 4.097 ± 0.056
4.482ArgGlu: 4.482 ± 0.065
3.162ArgPhe: 3.162 ± 0.049
5.204ArgGly: 5.204 ± 0.061
1.805ArgHis: 1.805 ± 0.041
4.168ArgIle: 4.168 ± 0.061
2.534ArgLys: 2.534 ± 0.044
7.942ArgLeu: 7.942 ± 0.107
2.015ArgMet: 2.015 ± 0.035
1.914ArgAsn: 1.914 ± 0.037
3.687ArgPro: 3.687 ± 0.059
2.596ArgGln: 2.596 ± 0.053
5.9ArgArg: 5.9 ± 0.08
4.018ArgSer: 4.018 ± 0.06
3.542ArgThr: 3.542 ± 0.055
4.685ArgVal: 4.685 ± 0.06
1.214ArgTrp: 1.214 ± 0.03
1.937ArgTyr: 1.937 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
6.839SerAla: 6.839 ± 0.082
0.45SerCys: 0.45 ± 0.019
3.084SerAsp: 3.084 ± 0.049
2.862SerGlu: 2.862 ± 0.044
2.144SerPhe: 2.144 ± 0.043
5.751SerGly: 5.751 ± 0.076
1.093SerHis: 1.093 ± 0.03
2.602SerIle: 2.602 ± 0.047
1.517SerLys: 1.517 ± 0.041
5.603SerLeu: 5.603 ± 0.069
1.35SerMet: 1.35 ± 0.031
1.391SerAsn: 1.391 ± 0.03
2.983SerPro: 2.983 ± 0.043
1.734SerGln: 1.734 ± 0.038
3.951SerArg: 3.951 ± 0.063
3.132SerSer: 3.132 ± 0.055
2.782SerThr: 2.782 ± 0.052
3.835SerVal: 3.835 ± 0.052
0.85SerTrp: 0.85 ± 0.029
1.423SerTyr: 1.423 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
6.298ThrAla: 6.298 ± 0.073
0.425ThrCys: 0.425 ± 0.019
2.718ThrAsp: 2.718 ± 0.05
2.461ThrGlu: 2.461 ± 0.047
1.902ThrPhe: 1.902 ± 0.041
5.421ThrGly: 5.421 ± 0.073
0.95ThrHis: 0.95 ± 0.032
2.816ThrIle: 2.816 ± 0.051
1.311ThrLys: 1.311 ± 0.036
5.583ThrLeu: 5.583 ± 0.07
1.271ThrMet: 1.271 ± 0.03
1.274ThrAsn: 1.274 ± 0.031
3.257ThrPro: 3.257 ± 0.05
1.498ThrGln: 1.498 ± 0.042
3.492ThrArg: 3.492 ± 0.052
2.783ThrSer: 2.783 ± 0.048
2.773ThrThr: 2.773 ± 0.057
4.176ThrVal: 4.176 ± 0.064
0.697ThrTrp: 0.697 ± 0.024
1.281ThrTyr: 1.281 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
8.846ValAla: 8.846 ± 0.083
0.591ValCys: 0.591 ± 0.02
4.081ValAsp: 4.081 ± 0.056
4.401ValGlu: 4.401 ± 0.064
2.286ValPhe: 2.286 ± 0.039
5.292ValGly: 5.292 ± 0.081
1.414ValHis: 1.414 ± 0.031
3.681ValIle: 3.681 ± 0.055
2.047ValLys: 2.047 ± 0.044
7.205ValLeu: 7.205 ± 0.09
1.685ValMet: 1.685 ± 0.039
2.011ValAsn: 2.011 ± 0.044
4.048ValPro: 4.048 ± 0.067
2.133ValGln: 2.133 ± 0.036
5.006ValArg: 5.006 ± 0.067
4.496ValSer: 4.496 ± 0.061
4.491ValThr: 4.491 ± 0.077
5.14ValVal: 5.14 ± 0.08
0.912ValTrp: 0.912 ± 0.029
1.385ValTyr: 1.385 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.432TrpAla: 1.432 ± 0.033
0.144TrpCys: 0.144 ± 0.011
0.742TrpAsp: 0.742 ± 0.028
0.649TrpGlu: 0.649 ± 0.023
0.618TrpPhe: 0.618 ± 0.022
1.029TrpGly: 1.029 ± 0.031
0.4TrpHis: 0.4 ± 0.018
0.706TrpIle: 0.706 ± 0.024
0.569TrpLys: 0.569 ± 0.022
1.809TrpLeu: 1.809 ± 0.045
0.354TrpMet: 0.354 ± 0.016
0.526TrpAsn: 0.526 ± 0.019
0.712TrpPro: 0.712 ± 0.028
0.666TrpGln: 0.666 ± 0.023
1.359TrpArg: 1.359 ± 0.036
0.934TrpSer: 0.934 ± 0.025
0.784TrpThr: 0.784 ± 0.024
0.812TrpVal: 0.812 ± 0.025
0.267TrpTrp: 0.267 ± 0.017
0.321TrpTyr: 0.321 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.66TyrAla: 2.66 ± 0.04
0.25TyrCys: 0.25 ± 0.013
1.486TyrAsp: 1.486 ± 0.036
1.246TyrGlu: 1.246 ± 0.028
0.891TyrPhe: 0.891 ± 0.026
2.168TyrGly: 2.168 ± 0.042
0.492TyrHis: 0.492 ± 0.019
0.826TyrIle: 0.826 ± 0.029
0.597TyrLys: 0.597 ± 0.023
2.213TyrLeu: 2.213 ± 0.04
0.403TyrMet: 0.403 ± 0.016
0.631TyrAsn: 0.631 ± 0.026
1.089TyrPro: 1.089 ± 0.032
0.721TyrGln: 0.721 ± 0.027
1.969TyrArg: 1.969 ± 0.047
1.234TyrSer: 1.234 ± 0.037
1.146TyrThr: 1.146 ± 0.031
1.506TyrVal: 1.506 ± 0.035
0.362TyrTrp: 0.362 ± 0.017
0.641TyrTyr: 0.641 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4594 proteins (1316690 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski