Amino acid dipepetide frequency for Azospirillum lipoferum (strain 4B)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.122AlaAla: 20.122 ± 0.166
1.193AlaCys: 1.193 ± 0.028
7.821AlaAsp: 7.821 ± 0.074
8.244AlaGlu: 8.244 ± 0.08
3.994AlaPhe: 3.994 ± 0.055
12.291AlaGly: 12.291 ± 0.131
2.201AlaHis: 2.201 ± 0.04
5.507AlaIle: 5.507 ± 0.056
3.71AlaLys: 3.71 ± 0.054
14.802AlaLeu: 14.802 ± 0.131
3.717AlaMet: 3.717 ± 0.049
2.746AlaAsn: 2.746 ± 0.052
6.183AlaPro: 6.183 ± 0.079
3.79AlaGln: 3.79 ± 0.053
9.095AlaArg: 9.095 ± 0.093
5.847AlaSer: 5.847 ± 0.067
6.445AlaThr: 6.445 ± 0.085
10.438AlaVal: 10.438 ± 0.088
1.427AlaTrp: 1.427 ± 0.031
2.362AlaTyr: 2.362 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
1.026CysAla: 1.026 ± 0.026
0.116CysCys: 0.116 ± 0.008
0.558CysAsp: 0.558 ± 0.018
0.39CysGlu: 0.39 ± 0.015
0.292CysPhe: 0.292 ± 0.012
0.987CysGly: 0.987 ± 0.027
0.256CysHis: 0.256 ± 0.013
0.369CysIle: 0.369 ± 0.014
0.175CysLys: 0.175 ± 0.009
0.856CysLeu: 0.856 ± 0.02
0.169CysMet: 0.169 ± 0.008
0.198CysAsn: 0.198 ± 0.01
0.504CysPro: 0.504 ± 0.018
0.221CysGln: 0.221 ± 0.011
0.768CysArg: 0.768 ± 0.021
0.462CysSer: 0.462 ± 0.02
0.424CysThr: 0.424 ± 0.016
0.571CysVal: 0.571 ± 0.018
0.119CysTrp: 0.119 ± 0.006
0.215CysTyr: 0.215 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.304AspAla: 7.304 ± 0.068
0.505AspCys: 0.505 ± 0.017
3.196AspAsp: 3.196 ± 0.058
3.237AspGlu: 3.237 ± 0.046
1.894AspPhe: 1.894 ± 0.035
6.162AspGly: 6.162 ± 0.103
1.301AspHis: 1.301 ± 0.03
2.687AspIle: 2.687 ± 0.039
1.361AspLys: 1.361 ± 0.031
6.333AspLeu: 6.333 ± 0.069
1.189AspMet: 1.189 ± 0.025
1.145AspAsn: 1.145 ± 0.026
3.873AspPro: 3.873 ± 0.049
1.582AspGln: 1.582 ± 0.029
5.495AspArg: 5.495 ± 0.059
2.626AspSer: 2.626 ± 0.04
2.691AspThr: 2.691 ± 0.072
3.854AspVal: 3.854 ± 0.062
0.996AspTrp: 0.996 ± 0.023
1.289AspTyr: 1.289 ± 0.027
0.0AspXaa: 0.0 ± 0.0
Glu
7.696GluAla: 7.696 ± 0.076
0.324GluCys: 0.324 ± 0.013
2.69GluAsp: 2.69 ± 0.041
3.419GluGlu: 3.419 ± 0.053
1.518GluPhe: 1.518 ± 0.029
4.045GluGly: 4.045 ± 0.064
1.045GluHis: 1.045 ± 0.026
2.722GluIle: 2.722 ± 0.041
1.72GluLys: 1.72 ± 0.036
5.45GluLeu: 5.45 ± 0.053
1.409GluMet: 1.409 ± 0.027
1.213GluAsn: 1.213 ± 0.025
2.828GluPro: 2.828 ± 0.044
2.07GluGln: 2.07 ± 0.039
5.504GluArg: 5.504 ± 0.063
2.355GluSer: 2.355 ± 0.035
3.211GluThr: 3.211 ± 0.056
3.939GluVal: 3.939 ± 0.054
0.627GluTrp: 0.627 ± 0.016
0.843GluTyr: 0.843 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
4.162PheAla: 4.162 ± 0.047
0.354PheCys: 0.354 ± 0.014
2.419PheAsp: 2.419 ± 0.038
1.733PheGlu: 1.733 ± 0.033
1.134PhePhe: 1.134 ± 0.027
3.313PheGly: 3.313 ± 0.048
0.734PheHis: 0.734 ± 0.02
1.331PheIle: 1.331 ± 0.027
0.841PheLys: 0.841 ± 0.023
3.256PheLeu: 3.256 ± 0.048
0.659PheMet: 0.659 ± 0.02
0.891PheAsn: 0.891 ± 0.021
1.544PhePro: 1.544 ± 0.03
1.026PheGln: 1.026 ± 0.024
2.246PheArg: 2.246 ± 0.035
1.774PheSer: 1.774 ± 0.039
1.952PheThr: 1.952 ± 0.033
2.415PheVal: 2.415 ± 0.037
0.442PheTrp: 0.442 ± 0.016
0.777PheTyr: 0.777 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
9.731GlyAla: 9.731 ± 0.091
0.958GlyCys: 0.958 ± 0.022
4.697GlyAsp: 4.697 ± 0.107
4.603GlyGlu: 4.603 ± 0.054
3.438GlyPhe: 3.438 ± 0.047
8.296GlyGly: 8.296 ± 0.166
1.968GlyHis: 1.968 ± 0.034
4.301GlyIle: 4.301 ± 0.052
2.939GlyLys: 2.939 ± 0.051
9.494GlyLeu: 9.494 ± 0.084
2.576GlyMet: 2.576 ± 0.04
2.203GlyAsn: 2.203 ± 0.077
3.873GlyPro: 3.873 ± 0.055
2.812GlyGln: 2.812 ± 0.043
7.167GlyArg: 7.167 ± 0.072
4.873GlySer: 4.873 ± 0.087
5.351GlyThr: 5.351 ± 0.144
6.413GlyVal: 6.413 ± 0.066
1.536GlyTrp: 1.536 ± 0.029
2.178GlyTyr: 2.178 ± 0.042
0.0GlyXaa: 0.0 ± 0.0
His
2.437HisAla: 2.437 ± 0.04
0.231HisCys: 0.231 ± 0.011
1.125HisAsp: 1.125 ± 0.027
0.901HisGlu: 0.901 ± 0.021
0.764HisPhe: 0.764 ± 0.022
1.97HisGly: 1.97 ± 0.034
0.57HisHis: 0.57 ± 0.019
0.86HisIle: 0.86 ± 0.022
0.457HisLys: 0.457 ± 0.015
2.057HisLeu: 2.057 ± 0.037
0.456HisMet: 0.456 ± 0.015
0.421HisAsn: 0.421 ± 0.016
1.445HisPro: 1.445 ± 0.024
0.584HisGln: 0.584 ± 0.021
1.737HisArg: 1.737 ± 0.036
0.98HisSer: 0.98 ± 0.025
0.818HisThr: 0.818 ± 0.022
1.377HisVal: 1.377 ± 0.027
0.334HisTrp: 0.334 ± 0.014
0.496HisTyr: 0.496 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.601IleAla: 6.601 ± 0.062
0.378IleCys: 0.378 ± 0.013
3.396IleAsp: 3.396 ± 0.04
2.695IleGlu: 2.695 ± 0.041
1.229IlePhe: 1.229 ± 0.029
4.691IleGly: 4.691 ± 0.058
0.916IleHis: 0.916 ± 0.022
1.444IleIle: 1.444 ± 0.028
1.073IleLys: 1.073 ± 0.03
4.327IleLeu: 4.327 ± 0.06
0.751IleMet: 0.751 ± 0.025
1.103IleAsn: 1.103 ± 0.027
2.261IlePro: 2.261 ± 0.036
1.199IleGln: 1.199 ± 0.026
3.166IleArg: 3.166 ± 0.043
1.996IleSer: 1.996 ± 0.039
2.188IleThr: 2.188 ± 0.053
3.662IleVal: 3.662 ± 0.049
0.397IleTrp: 0.397 ± 0.015
0.858IleTyr: 0.858 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.923LysAla: 3.923 ± 0.054
0.13LysCys: 0.13 ± 0.008
1.526LysAsp: 1.526 ± 0.033
1.374LysGlu: 1.374 ± 0.032
0.658LysPhe: 0.658 ± 0.019
2.373LysGly: 2.373 ± 0.043
0.485LysHis: 0.485 ± 0.019
1.232LysIle: 1.232 ± 0.032
0.912LysLys: 0.912 ± 0.028
2.841LysLeu: 2.841 ± 0.049
0.626LysMet: 0.626 ± 0.018
0.645LysAsn: 0.645 ± 0.021
2.008LysPro: 2.008 ± 0.038
0.831LysGln: 0.831 ± 0.022
2.112LysArg: 2.112 ± 0.039
1.514LysSer: 1.514 ± 0.033
1.639LysThr: 1.639 ± 0.032
2.019LysVal: 2.019 ± 0.038
0.258LysTrp: 0.258 ± 0.01
0.471LysTyr: 0.471 ± 0.017
0.0LysXaa: 0.0 ± 0.0
Leu
14.61LeuAla: 14.61 ± 0.116
1.038LeuCys: 1.038 ± 0.022
6.502LeuAsp: 6.502 ± 0.055
5.441LeuGlu: 5.441 ± 0.058
3.751LeuPhe: 3.751 ± 0.046
8.497LeuGly: 8.497 ± 0.078
2.097LeuHis: 2.097 ± 0.033
4.344LeuIle: 4.344 ± 0.06
2.972LeuLys: 2.972 ± 0.045
10.928LeuLeu: 10.928 ± 0.102
2.339LeuMet: 2.339 ± 0.035
2.442LeuAsn: 2.442 ± 0.038
6.313LeuPro: 6.313 ± 0.072
2.561LeuGln: 2.561 ± 0.043
8.11LeuArg: 8.11 ± 0.073
6.709LeuSer: 6.709 ± 0.083
6.42LeuThr: 6.42 ± 0.126
7.804LeuVal: 7.804 ± 0.075
1.241LeuTrp: 1.241 ± 0.032
2.052LeuTyr: 2.052 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
3.401MetAla: 3.401 ± 0.042
0.134MetCys: 0.134 ± 0.009
1.242MetAsp: 1.242 ± 0.026
1.24MetGlu: 1.24 ± 0.025
0.594MetPhe: 0.594 ± 0.019
1.835MetGly: 1.835 ± 0.033
0.386MetHis: 0.386 ± 0.015
1.128MetIle: 1.128 ± 0.027
0.785MetLys: 0.785 ± 0.021
2.521MetLeu: 2.521 ± 0.041
0.653MetMet: 0.653 ± 0.023
0.632MetAsn: 0.632 ± 0.02
1.59MetPro: 1.59 ± 0.027
0.718MetGln: 0.718 ± 0.02
1.701MetArg: 1.701 ± 0.031
1.454MetSer: 1.454 ± 0.029
1.962MetThr: 1.962 ± 0.03
1.869MetVal: 1.869 ± 0.032
0.18MetTrp: 0.18 ± 0.01
0.254MetTyr: 0.254 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.961AsnAla: 2.961 ± 0.048
0.217AsnCys: 0.217 ± 0.011
1.348AsnAsp: 1.348 ± 0.059
1.031AsnGlu: 1.031 ± 0.023
0.722AsnPhe: 0.722 ± 0.018
2.288AsnGly: 2.288 ± 0.042
0.476AsnHis: 0.476 ± 0.018
1.047AsnIle: 1.047 ± 0.024
0.55AsnLys: 0.55 ± 0.018
2.518AsnLeu: 2.518 ± 0.044
0.454AsnMet: 0.454 ± 0.018
0.597AsnAsn: 0.597 ± 0.02
1.74AsnPro: 1.74 ± 0.038
0.656AsnGln: 0.656 ± 0.022
1.936AsnArg: 1.936 ± 0.035
1.165AsnSer: 1.165 ± 0.032
1.149AsnThr: 1.149 ± 0.05
1.6AsnVal: 1.6 ± 0.035
0.329AsnTrp: 0.329 ± 0.013
0.498AsnTyr: 0.498 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
7.912ProAla: 7.912 ± 0.094
0.372ProCys: 0.372 ± 0.014
4.185ProAsp: 4.185 ± 0.052
3.466ProGlu: 3.466 ± 0.047
1.903ProPhe: 1.903 ± 0.034
4.996ProGly: 4.996 ± 0.062
1.041ProHis: 1.041 ± 0.023
2.017ProIle: 2.017 ± 0.035
1.51ProLys: 1.51 ± 0.031
5.392ProLeu: 5.392 ± 0.063
1.304ProMet: 1.304 ± 0.025
1.298ProAsn: 1.298 ± 0.029
3.635ProPro: 3.635 ± 0.076
1.691ProGln: 1.691 ± 0.036
3.2ProArg: 3.2 ± 0.044
2.875ProSer: 2.875 ± 0.044
2.803ProThr: 2.803 ± 0.045
4.853ProVal: 4.853 ± 0.058
0.76ProTrp: 0.76 ± 0.02
1.111ProTyr: 1.111 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
4.216GlnAla: 4.216 ± 0.046
0.187GlnCys: 0.187 ± 0.009
1.438GlnAsp: 1.438 ± 0.027
1.513GlnGlu: 1.513 ± 0.027
0.844GlnPhe: 0.844 ± 0.024
2.495GlnGly: 2.495 ± 0.04
0.587GlnHis: 0.587 ± 0.017
1.446GlnIle: 1.446 ± 0.031
0.855GlnLys: 0.855 ± 0.024
2.683GlnLeu: 2.683 ± 0.042
0.784GlnMet: 0.784 ± 0.021
0.68GlnAsn: 0.68 ± 0.021
1.936GlnPro: 1.936 ± 0.039
1.183GlnGln: 1.183 ± 0.038
2.422GlnArg: 2.422 ± 0.04
1.693GlnSer: 1.693 ± 0.031
1.761GlnThr: 1.761 ± 0.034
2.174GlnVal: 2.174 ± 0.037
0.348GlnTrp: 0.348 ± 0.013
0.527GlnTyr: 0.527 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
8.613ArgAla: 8.613 ± 0.089
0.654ArgCys: 0.654 ± 0.019
4.465ArgAsp: 4.465 ± 0.063
4.019ArgGlu: 4.019 ± 0.05
3.094ArgPhe: 3.094 ± 0.046
5.16ArgGly: 5.16 ± 0.05
1.909ArgHis: 1.909 ± 0.036
3.996ArgIle: 3.996 ± 0.054
2.108ArgLys: 2.108 ± 0.038
9.315ArgLeu: 9.315 ± 0.09
2.083ArgMet: 2.083 ± 0.035
1.866ArgAsn: 1.866 ± 0.034
4.354ArgPro: 4.354 ± 0.053
2.704ArgGln: 2.704 ± 0.035
7.107ArgArg: 7.107 ± 0.093
3.952ArgSer: 3.952 ± 0.055
3.665ArgThr: 3.665 ± 0.045
5.302ArgVal: 5.302 ± 0.06
1.123ArgTrp: 1.123 ± 0.026
1.642ArgTyr: 1.642 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
6.506SerAla: 6.506 ± 0.084
0.455SerCys: 0.455 ± 0.018
2.77SerAsp: 2.77 ± 0.036
2.19SerGlu: 2.19 ± 0.039
2.028SerPhe: 2.028 ± 0.036
5.558SerGly: 5.558 ± 0.113
1.06SerHis: 1.06 ± 0.025
2.401SerIle: 2.401 ± 0.037
1.267SerLys: 1.267 ± 0.03
5.444SerLeu: 5.444 ± 0.055
1.275SerMet: 1.275 ± 0.026
1.205SerAsn: 1.205 ± 0.027
2.835SerPro: 2.835 ± 0.043
1.442SerGln: 1.442 ± 0.03
3.544SerArg: 3.544 ± 0.049
2.76SerSer: 2.76 ± 0.061
2.608SerThr: 2.608 ± 0.049
3.937SerVal: 3.937 ± 0.05
0.758SerTrp: 0.758 ± 0.022
1.166SerTyr: 1.166 ± 0.028
0.0SerXaa: 0.0 ± 0.0
Thr
7.255ThrAla: 7.255 ± 0.09
0.394ThrCys: 0.394 ± 0.016
3.179ThrAsp: 3.179 ± 0.078
2.614ThrGlu: 2.614 ± 0.036
1.532ThrPhe: 1.532 ± 0.031
5.545ThrGly: 5.545 ± 0.102
0.958ThrHis: 0.958 ± 0.019
2.658ThrIle: 2.658 ± 0.063
1.265ThrLys: 1.265 ± 0.029
6.474ThrLeu: 6.474 ± 0.159
1.251ThrMet: 1.251 ± 0.024
1.207ThrAsn: 1.207 ± 0.031
3.489ThrPro: 3.489 ± 0.057
1.374ThrGln: 1.374 ± 0.029
3.404ThrArg: 3.404 ± 0.043
2.41ThrSer: 2.41 ± 0.049
2.803ThrThr: 2.803 ± 0.067
5.303ThrVal: 5.303 ± 0.091
0.505ThrTrp: 0.505 ± 0.016
1.025ThrTyr: 1.025 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
9.937ValAla: 9.937 ± 0.08
0.684ValCys: 0.684 ± 0.018
4.186ValAsp: 4.186 ± 0.053
4.902ValGlu: 4.902 ± 0.051
2.432ValPhe: 2.432 ± 0.04
5.995ValGly: 5.995 ± 0.066
1.318ValHis: 1.318 ± 0.026
3.474ValIle: 3.474 ± 0.05
2.176ValLys: 2.176 ± 0.039
7.816ValLeu: 7.816 ± 0.067
1.849ValMet: 1.849 ± 0.037
1.904ValAsn: 1.904 ± 0.038
4.199ValPro: 4.199 ± 0.053
2.195ValGln: 2.195 ± 0.038
5.429ValArg: 5.429 ± 0.061
4.017ValSer: 4.017 ± 0.059
4.871ValThr: 4.871 ± 0.091
6.306ValVal: 6.306 ± 0.069
0.973ValTrp: 0.973 ± 0.022
1.407ValTyr: 1.407 ± 0.026
0.0ValXaa: 0.0 ± 0.0
Trp
1.254TrpAla: 1.254 ± 0.026
0.132TrpCys: 0.132 ± 0.009
0.645TrpAsp: 0.645 ± 0.021
0.57TrpGlu: 0.57 ± 0.017
0.468TrpPhe: 0.468 ± 0.019
0.898TrpGly: 0.898 ± 0.024
0.285TrpHis: 0.285 ± 0.014
0.603TrpIle: 0.603 ± 0.018
0.386TrpLys: 0.386 ± 0.015
1.628TrpLeu: 1.628 ± 0.037
0.363TrpMet: 0.363 ± 0.013
0.375TrpAsn: 0.375 ± 0.014
0.696TrpPro: 0.696 ± 0.02
0.519TrpGln: 0.519 ± 0.017
1.177TrpArg: 1.177 ± 0.029
0.743TrpSer: 0.743 ± 0.019
0.843TrpThr: 0.843 ± 0.023
0.804TrpVal: 0.804 ± 0.02
0.212TrpTrp: 0.212 ± 0.012
0.286TrpTyr: 0.286 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.267TyrAla: 2.267 ± 0.04
0.218TyrCys: 0.218 ± 0.011
1.334TyrAsp: 1.334 ± 0.03
1.047TyrGlu: 1.047 ± 0.028
0.714TyrPhe: 0.714 ± 0.019
2.01TyrGly: 2.01 ± 0.039
0.428TyrHis: 0.428 ± 0.015
0.764TyrIle: 0.764 ± 0.022
0.517TyrLys: 0.517 ± 0.019
2.035TyrLeu: 2.035 ± 0.033
0.372TyrMet: 0.372 ± 0.015
0.51TyrAsn: 0.51 ± 0.018
1.019TyrPro: 1.019 ± 0.026
0.611TyrGln: 0.611 ± 0.02
1.809TyrArg: 1.809 ± 0.031
1.004TyrSer: 1.004 ± 0.024
1.042TyrThr: 1.042 ± 0.029
1.432TyrVal: 1.432 ± 0.027
0.325TyrTrp: 0.325 ± 0.014
0.475TyrTyr: 0.475 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6062 proteins (1974684 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski