Amino acid dipepetide frequency for Nitrosomonas sp. (strain Is79A3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.32AlaAla: 9.32 ± 0.14
1.0AlaCys: 1.0 ± 0.033
4.683AlaAsp: 4.683 ± 0.064
5.641AlaGlu: 5.641 ± 0.083
3.27AlaPhe: 3.27 ± 0.056
6.561AlaGly: 6.561 ± 0.106
1.986AlaHis: 1.986 ± 0.049
6.456AlaIle: 6.456 ± 0.088
4.453AlaLys: 4.453 ± 0.09
9.673AlaLeu: 9.673 ± 0.131
2.378AlaMet: 2.378 ± 0.05
3.427AlaAsn: 3.427 ± 0.068
3.026AlaPro: 3.026 ± 0.069
3.862AlaGln: 3.862 ± 0.075
4.766AlaArg: 4.766 ± 0.086
5.241AlaSer: 5.241 ± 0.085
4.546AlaThr: 4.546 ± 0.083
5.892AlaVal: 5.892 ± 0.09
1.136AlaTrp: 1.136 ± 0.043
2.383AlaTyr: 2.383 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.905CysAla: 0.905 ± 0.034
0.167CysCys: 0.167 ± 0.013
0.586CysAsp: 0.586 ± 0.025
0.568CysGlu: 0.568 ± 0.025
0.431CysPhe: 0.431 ± 0.025
0.878CysGly: 0.878 ± 0.034
0.372CysHis: 0.372 ± 0.024
0.634CysIle: 0.634 ± 0.026
0.446CysLys: 0.446 ± 0.025
0.958CysLeu: 0.958 ± 0.031
0.244CysMet: 0.244 ± 0.016
0.418CysAsn: 0.418 ± 0.024
0.451CysPro: 0.451 ± 0.022
0.376CysGln: 0.376 ± 0.018
0.547CysArg: 0.547 ± 0.022
0.54CysSer: 0.54 ± 0.027
0.501CysThr: 0.501 ± 0.027
0.658CysVal: 0.658 ± 0.025
0.125CysTrp: 0.125 ± 0.011
0.321CysTyr: 0.321 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
4.524AspAla: 4.524 ± 0.068
0.541AspCys: 0.541 ± 0.024
2.75AspAsp: 2.75 ± 0.061
3.532AspGlu: 3.532 ± 0.071
2.35AspPhe: 2.35 ± 0.047
3.67AspGly: 3.67 ± 0.068
1.292AspHis: 1.292 ± 0.042
3.662AspIle: 3.662 ± 0.066
2.853AspLys: 2.853 ± 0.058
5.214AspLeu: 5.214 ± 0.077
1.257AspMet: 1.257 ± 0.036
2.117AspAsn: 2.117 ± 0.046
2.461AspPro: 2.461 ± 0.05
2.141AspGln: 2.141 ± 0.05
2.657AspArg: 2.657 ± 0.055
3.239AspSer: 3.239 ± 0.058
2.884AspThr: 2.884 ± 0.06
3.222AspVal: 3.222 ± 0.061
0.897AspTrp: 0.897 ± 0.031
1.884AspTyr: 1.884 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
5.191GluAla: 5.191 ± 0.086
0.506GluCys: 0.506 ± 0.025
2.653GluAsp: 2.653 ± 0.059
3.41GluGlu: 3.41 ± 0.074
2.381GluPhe: 2.381 ± 0.054
3.164GluGly: 3.164 ± 0.052
1.411GluHis: 1.411 ± 0.036
5.045GluIle: 5.045 ± 0.077
3.973GluLys: 3.973 ± 0.081
6.358GluLeu: 6.358 ± 0.087
1.605GluMet: 1.605 ± 0.043
2.786GluAsn: 2.786 ± 0.047
2.043GluPro: 2.043 ± 0.051
3.082GluGln: 3.082 ± 0.063
3.586GluArg: 3.586 ± 0.073
3.725GluSer: 3.725 ± 0.068
3.311GluThr: 3.311 ± 0.059
3.575GluVal: 3.575 ± 0.061
0.785GluTrp: 0.785 ± 0.029
1.604GluTyr: 1.604 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
3.42PheAla: 3.42 ± 0.059
0.484PheCys: 0.484 ± 0.024
2.623PheAsp: 2.623 ± 0.05
2.327PheGlu: 2.327 ± 0.044
1.823PhePhe: 1.823 ± 0.047
2.955PheGly: 2.955 ± 0.062
0.851PheHis: 0.851 ± 0.031
2.69PheIle: 2.69 ± 0.06
1.723PheLys: 1.723 ± 0.037
3.844PheLeu: 3.844 ± 0.067
0.97PheMet: 0.97 ± 0.03
1.863PheAsn: 1.863 ± 0.045
1.708PhePro: 1.708 ± 0.046
1.334PheGln: 1.334 ± 0.041
1.911PheArg: 1.911 ± 0.047
3.14PheSer: 3.14 ± 0.061
2.287PheThr: 2.287 ± 0.056
2.539PheVal: 2.539 ± 0.055
0.58PheTrp: 0.58 ± 0.027
1.289PheTyr: 1.289 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
5.346GlyAla: 5.346 ± 0.087
0.872GlyCys: 0.872 ± 0.032
3.462GlyAsp: 3.462 ± 0.062
3.849GlyGlu: 3.849 ± 0.059
3.185GlyPhe: 3.185 ± 0.056
5.09GlyGly: 5.09 ± 0.116
1.733GlyHis: 1.733 ± 0.04
5.284GlyIle: 5.284 ± 0.079
4.219GlyLys: 4.219 ± 0.07
6.824GlyLeu: 6.824 ± 0.087
1.935GlyMet: 1.935 ± 0.05
2.962GlyAsn: 2.962 ± 0.065
1.746GlyPro: 1.746 ± 0.046
2.485GlyGln: 2.485 ± 0.056
3.411GlyArg: 3.411 ± 0.067
4.223GlySer: 4.223 ± 0.077
3.716GlyThr: 3.716 ± 0.077
4.579GlyVal: 4.579 ± 0.077
1.049GlyTrp: 1.049 ± 0.037
2.385GlyTyr: 2.385 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
2.198HisAla: 2.198 ± 0.054
0.336HisCys: 0.336 ± 0.019
1.275HisAsp: 1.275 ± 0.039
1.33HisGlu: 1.33 ± 0.035
1.133HisPhe: 1.133 ± 0.036
1.775HisGly: 1.775 ± 0.049
0.779HisHis: 0.779 ± 0.028
1.522HisIle: 1.522 ± 0.039
1.01HisLys: 1.01 ± 0.033
2.447HisLeu: 2.447 ± 0.05
0.507HisMet: 0.507 ± 0.022
0.902HisAsn: 0.902 ± 0.029
1.318HisPro: 1.318 ± 0.041
1.122HisGln: 1.122 ± 0.035
1.208HisArg: 1.208 ± 0.034
1.364HisSer: 1.364 ± 0.038
1.235HisThr: 1.235 ± 0.04
1.441HisVal: 1.441 ± 0.038
0.364HisTrp: 0.364 ± 0.018
0.882HisTyr: 0.882 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
6.952IleAla: 6.952 ± 0.094
0.712IleCys: 0.712 ± 0.026
4.274IleAsp: 4.274 ± 0.078
4.751IleGlu: 4.751 ± 0.076
2.486IlePhe: 2.486 ± 0.057
4.965IleGly: 4.965 ± 0.078
1.62IleHis: 1.62 ± 0.038
4.298IleIle: 4.298 ± 0.075
3.729IleLys: 3.729 ± 0.076
6.502IleLeu: 6.502 ± 0.097
1.372IleMet: 1.372 ± 0.038
3.211IleAsn: 3.211 ± 0.06
3.309IlePro: 3.309 ± 0.055
2.584IleGln: 2.584 ± 0.053
3.543IleArg: 3.543 ± 0.061
4.63IleSer: 4.63 ± 0.086
4.271IleThr: 4.271 ± 0.079
4.207IleVal: 4.207 ± 0.068
0.701IleTrp: 0.701 ± 0.03
1.897IleTyr: 1.897 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
4.095LysAla: 4.095 ± 0.081
0.356LysCys: 0.356 ± 0.019
2.526LysAsp: 2.526 ± 0.055
3.254LysGlu: 3.254 ± 0.061
1.664LysPhe: 1.664 ± 0.042
2.767LysGly: 2.767 ± 0.053
1.218LysHis: 1.218 ± 0.037
3.876LysIle: 3.876 ± 0.064
3.149LysLys: 3.149 ± 0.07
5.489LysLeu: 5.489 ± 0.081
1.314LysMet: 1.314 ± 0.041
2.598LysAsn: 2.598 ± 0.059
2.404LysPro: 2.404 ± 0.048
2.813LysGln: 2.813 ± 0.064
2.982LysArg: 2.982 ± 0.067
3.148LysSer: 3.148 ± 0.062
2.945LysThr: 2.945 ± 0.05
2.918LysVal: 2.918 ± 0.062
0.57LysTrp: 0.57 ± 0.027
1.277LysTyr: 1.277 ± 0.037
0.0LysXaa: 0.0 ± 0.0
Leu
9.88LeuAla: 9.88 ± 0.124
1.05LeuCys: 1.05 ± 0.035
5.567LeuAsp: 5.567 ± 0.073
5.969LeuGlu: 5.969 ± 0.091
4.104LeuPhe: 4.104 ± 0.068
6.668LeuGly: 6.668 ± 0.083
2.42LeuHis: 2.42 ± 0.05
7.047LeuIle: 7.047 ± 0.107
5.281LeuLys: 5.281 ± 0.08
11.4LeuLeu: 11.4 ± 0.15
2.421LeuMet: 2.421 ± 0.049
4.431LeuAsn: 4.431 ± 0.072
5.112LeuPro: 5.112 ± 0.085
4.38LeuGln: 4.38 ± 0.08
5.858LeuArg: 5.858 ± 0.079
7.161LeuSer: 7.161 ± 0.097
5.931LeuThr: 5.931 ± 0.101
6.256LeuVal: 6.256 ± 0.083
1.169LeuTrp: 1.169 ± 0.039
2.554LeuTyr: 2.554 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
2.161MetAla: 2.161 ± 0.058
0.169MetCys: 0.169 ± 0.012
1.169MetAsp: 1.169 ± 0.035
1.301MetGlu: 1.301 ± 0.035
0.744MetPhe: 0.744 ± 0.029
1.555MetGly: 1.555 ± 0.043
0.661MetHis: 0.661 ± 0.027
1.563MetIle: 1.563 ± 0.039
1.388MetLys: 1.388 ± 0.036
2.698MetLeu: 2.698 ± 0.056
0.618MetMet: 0.618 ± 0.027
1.131MetAsn: 1.131 ± 0.032
1.236MetPro: 1.236 ± 0.039
1.238MetGln: 1.238 ± 0.034
1.444MetArg: 1.444 ± 0.037
1.525MetSer: 1.525 ± 0.041
1.421MetThr: 1.421 ± 0.041
1.532MetVal: 1.532 ± 0.045
0.177MetTrp: 0.177 ± 0.014
0.419MetTyr: 0.419 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
3.535AsnAla: 3.535 ± 0.067
0.403AsnCys: 0.403 ± 0.021
2.28AsnAsp: 2.28 ± 0.059
2.411AsnGlu: 2.411 ± 0.053
1.72AsnPhe: 1.72 ± 0.042
2.933AsnGly: 2.933 ± 0.067
0.989AsnHis: 0.989 ± 0.033
2.914AsnIle: 2.914 ± 0.061
2.255AsnLys: 2.255 ± 0.053
4.54AsnLeu: 4.54 ± 0.075
0.88AsnMet: 0.88 ± 0.028
1.976AsnAsn: 1.976 ± 0.05
2.548AsnPro: 2.548 ± 0.055
2.058AsnGln: 2.058 ± 0.051
2.287AsnArg: 2.287 ± 0.047
2.439AsnSer: 2.439 ± 0.054
2.295AsnThr: 2.295 ± 0.052
2.422AsnVal: 2.422 ± 0.055
0.628AsnTrp: 0.628 ± 0.026
1.279AsnTyr: 1.279 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
3.923ProAla: 3.923 ± 0.064
0.372ProCys: 0.372 ± 0.02
2.846ProAsp: 2.846 ± 0.054
3.486ProGlu: 3.486 ± 0.058
1.783ProPhe: 1.783 ± 0.04
3.192ProGly: 3.192 ± 0.062
0.972ProHis: 0.972 ± 0.034
2.767ProIle: 2.767 ± 0.055
1.818ProLys: 1.818 ± 0.042
4.081ProLeu: 4.081 ± 0.06
0.992ProMet: 0.992 ± 0.032
1.688ProAsn: 1.688 ± 0.038
1.848ProPro: 1.848 ± 0.055
1.721ProGln: 1.721 ± 0.041
1.717ProArg: 1.717 ± 0.049
2.416ProSer: 2.416 ± 0.046
1.898ProThr: 1.898 ± 0.045
3.376ProVal: 3.376 ± 0.066
0.598ProTrp: 0.598 ± 0.025
1.266ProTyr: 1.266 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
4.252GlnAla: 4.252 ± 0.068
0.372GlnCys: 0.372 ± 0.019
1.819GlnAsp: 1.819 ± 0.044
2.414GlnGlu: 2.414 ± 0.06
1.668GlnPhe: 1.668 ± 0.035
2.607GlnGly: 2.607 ± 0.054
1.233GlnHis: 1.233 ± 0.037
3.094GlnIle: 3.094 ± 0.049
2.264GlnLys: 2.264 ± 0.054
4.791GlnLeu: 4.791 ± 0.068
1.004GlnMet: 1.004 ± 0.032
1.741GlnAsn: 1.741 ± 0.045
1.805GlnPro: 1.805 ± 0.046
2.487GlnGln: 2.487 ± 0.059
2.556GlnArg: 2.556 ± 0.055
2.625GlnSer: 2.625 ± 0.06
2.292GlnThr: 2.292 ± 0.049
2.689GlnVal: 2.689 ± 0.055
0.614GlnTrp: 0.614 ± 0.023
1.176GlnTyr: 1.176 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
4.166ArgAla: 4.166 ± 0.074
0.484ArgCys: 0.484 ± 0.024
2.707ArgAsp: 2.707 ± 0.056
3.389ArgGlu: 3.389 ± 0.059
2.407ArgPhe: 2.407 ± 0.047
3.089ArgGly: 3.089 ± 0.058
1.332ArgHis: 1.332 ± 0.044
4.206ArgIle: 4.206 ± 0.067
2.722ArgLys: 2.722 ± 0.059
5.904ArgLeu: 5.904 ± 0.09
1.409ArgMet: 1.409 ± 0.036
2.501ArgAsn: 2.501 ± 0.051
1.892ArgPro: 1.892 ± 0.048
2.333ArgGln: 2.333 ± 0.056
3.119ArgArg: 3.119 ± 0.059
2.935ArgSer: 2.935 ± 0.052
2.533ArgThr: 2.533 ± 0.057
3.241ArgVal: 3.241 ± 0.066
0.828ArgTrp: 0.828 ± 0.032
1.952ArgTyr: 1.952 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
5.551SerAla: 5.551 ± 0.083
0.66SerCys: 0.66 ± 0.028
3.41SerAsp: 3.41 ± 0.066
3.513SerGlu: 3.513 ± 0.074
2.613SerPhe: 2.613 ± 0.053
5.36SerGly: 5.36 ± 0.089
1.48SerHis: 1.48 ± 0.042
4.256SerIle: 4.256 ± 0.07
2.888SerLys: 2.888 ± 0.065
6.235SerLeu: 6.235 ± 0.081
1.539SerMet: 1.539 ± 0.041
2.737SerAsn: 2.737 ± 0.06
2.58SerPro: 2.58 ± 0.051
2.539SerGln: 2.539 ± 0.052
3.029SerArg: 3.029 ± 0.053
4.074SerSer: 4.074 ± 0.085
3.236SerThr: 3.236 ± 0.061
4.028SerVal: 4.028 ± 0.068
0.757SerTrp: 0.757 ± 0.023
1.719SerTyr: 1.719 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
4.959ThrAla: 4.959 ± 0.077
0.501ThrCys: 0.501 ± 0.022
2.904ThrAsp: 2.904 ± 0.06
3.174ThrGlu: 3.174 ± 0.06
2.046ThrPhe: 2.046 ± 0.047
4.538ThrGly: 4.538 ± 0.092
1.342ThrHis: 1.342 ± 0.038
3.546ThrIle: 3.546 ± 0.065
2.196ThrLys: 2.196 ± 0.049
6.111ThrLeu: 6.111 ± 0.1
1.08ThrMet: 1.08 ± 0.034
2.135ThrAsn: 2.135 ± 0.052
2.82ThrPro: 2.82 ± 0.056
2.375ThrGln: 2.375 ± 0.05
2.635ThrArg: 2.635 ± 0.05
3.149ThrSer: 3.149 ± 0.068
2.992ThrThr: 2.992 ± 0.072
3.71ThrVal: 3.71 ± 0.065
0.65ThrTrp: 0.65 ± 0.029
1.508ThrTyr: 1.508 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
5.82ValAla: 5.82 ± 0.086
0.647ValCys: 0.647 ± 0.027
3.456ValAsp: 3.456 ± 0.063
3.654ValGlu: 3.654 ± 0.065
2.601ValPhe: 2.601 ± 0.055
3.806ValGly: 3.806 ± 0.071
1.327ValHis: 1.327 ± 0.038
4.746ValIle: 4.746 ± 0.066
3.223ValLys: 3.223 ± 0.06
6.656ValLeu: 6.656 ± 0.112
1.785ValMet: 1.785 ± 0.041
2.632ValAsn: 2.632 ± 0.05
2.593ValPro: 2.593 ± 0.055
2.258ValGln: 2.258 ± 0.053
3.169ValArg: 3.169 ± 0.061
4.104ValSer: 4.104 ± 0.071
3.976ValThr: 3.976 ± 0.067
4.404ValVal: 4.404 ± 0.081
0.753ValTrp: 0.753 ± 0.027
1.633ValTyr: 1.633 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.877TrpAla: 0.877 ± 0.033
0.155TrpCys: 0.155 ± 0.013
0.643TrpAsp: 0.643 ± 0.027
0.644TrpGlu: 0.644 ± 0.025
0.564TrpPhe: 0.564 ± 0.022
0.778TrpGly: 0.778 ± 0.031
0.357TrpHis: 0.357 ± 0.018
0.913TrpIle: 0.913 ± 0.032
0.636TrpLys: 0.636 ± 0.023
1.738TrpLeu: 1.738 ± 0.049
0.312TrpMet: 0.312 ± 0.018
0.566TrpAsn: 0.566 ± 0.026
0.48TrpPro: 0.48 ± 0.023
0.796TrpGln: 0.796 ± 0.032
0.835TrpArg: 0.835 ± 0.028
0.739TrpSer: 0.739 ± 0.033
0.562TrpThr: 0.562 ± 0.028
0.868TrpVal: 0.868 ± 0.027
0.206TrpTrp: 0.206 ± 0.016
0.345TrpTyr: 0.345 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.628TyrAla: 2.628 ± 0.055
0.34TyrCys: 0.34 ± 0.019
1.518TyrAsp: 1.518 ± 0.046
1.577TyrGlu: 1.577 ± 0.042
1.384TyrPhe: 1.384 ± 0.034
2.004TyrGly: 2.004 ± 0.049
0.772TyrHis: 0.772 ± 0.032
1.496TyrIle: 1.496 ± 0.047
1.134TyrLys: 1.134 ± 0.036
3.2TyrLeu: 3.2 ± 0.066
0.524TyrMet: 0.524 ± 0.023
0.974TyrAsn: 0.974 ± 0.035
1.378TyrPro: 1.378 ± 0.041
1.52TyrGln: 1.52 ± 0.043
1.89TyrArg: 1.89 ± 0.047
1.787TyrSer: 1.787 ± 0.039
1.539TyrThr: 1.539 ± 0.041
1.686TyrVal: 1.686 ± 0.033
0.424TyrTrp: 0.424 ± 0.02
0.972TyrTyr: 0.972 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3273 proteins (1023771 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski