Amino acid dipepetide frequency for Lachnospiraceae bacterium RM5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.873AlaAla: 3.873 ± 0.115
0.745AlaCys: 0.745 ± 0.034
3.828AlaAsp: 3.828 ± 0.085
3.51AlaGlu: 3.51 ± 0.077
2.434AlaPhe: 2.434 ± 0.063
4.295AlaGly: 4.295 ± 0.092
0.827AlaHis: 0.827 ± 0.034
5.103AlaIle: 5.103 ± 0.096
5.231AlaLys: 5.231 ± 0.102
5.161AlaLeu: 5.161 ± 0.093
1.703AlaMet: 1.703 ± 0.047
2.874AlaAsn: 2.874 ± 0.063
1.35AlaPro: 1.35 ± 0.046
1.056AlaGln: 1.056 ± 0.043
2.077AlaArg: 2.077 ± 0.056
3.354AlaSer: 3.354 ± 0.083
3.311AlaThr: 3.311 ± 0.102
4.069AlaVal: 4.069 ± 0.091
0.415AlaTrp: 0.415 ± 0.025
2.619AlaTyr: 2.619 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.708CysAla: 0.708 ± 0.033
0.16CysCys: 0.16 ± 0.016
0.861CysAsp: 0.861 ± 0.036
0.838CysGlu: 0.838 ± 0.038
0.678CysPhe: 0.678 ± 0.034
1.162CysGly: 1.162 ± 0.047
0.238CysHis: 0.238 ± 0.02
1.219CysIle: 1.219 ± 0.044
0.966CysLys: 0.966 ± 0.038
0.947CysLeu: 0.947 ± 0.035
0.4CysMet: 0.4 ± 0.026
0.749CysAsn: 0.749 ± 0.033
0.434CysPro: 0.434 ± 0.025
0.231CysGln: 0.231 ± 0.016
0.391CysArg: 0.391 ± 0.025
0.792CysSer: 0.792 ± 0.034
0.559CysThr: 0.559 ± 0.03
0.815CysVal: 0.815 ± 0.038
0.076CysTrp: 0.076 ± 0.012
0.559CysTyr: 0.559 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
3.774AspAla: 3.774 ± 0.075
0.712AspCys: 0.712 ± 0.037
4.668AspAsp: 4.668 ± 0.115
6.566AspGlu: 6.566 ± 0.124
3.302AspPhe: 3.302 ± 0.08
4.524AspGly: 4.524 ± 0.104
0.639AspHis: 0.639 ± 0.028
6.616AspIle: 6.616 ± 0.1
5.782AspLys: 5.782 ± 0.099
4.897AspLeu: 4.897 ± 0.083
1.887AspMet: 1.887 ± 0.056
4.123AspAsn: 4.123 ± 0.094
1.312AspPro: 1.312 ± 0.047
0.764AspGln: 0.764 ± 0.033
2.003AspArg: 2.003 ± 0.059
3.817AspSer: 3.817 ± 0.087
3.245AspThr: 3.245 ± 0.079
4.615AspVal: 4.615 ± 0.097
0.458AspTrp: 0.458 ± 0.029
3.739AspTyr: 3.739 ± 0.075
0.0AspXaa: 0.0 ± 0.0
Glu
4.347GluAla: 4.347 ± 0.104
0.88GluCys: 0.88 ± 0.038
4.801GluAsp: 4.801 ± 0.102
7.318GluGlu: 7.318 ± 0.121
3.188GluPhe: 3.188 ± 0.071
3.946GluGly: 3.946 ± 0.086
1.014GluHis: 1.014 ± 0.05
6.934GluIle: 6.934 ± 0.115
8.355GluLys: 8.355 ± 0.133
6.424GluLeu: 6.424 ± 0.105
2.295GluMet: 2.295 ± 0.059
6.052GluAsn: 6.052 ± 0.107
1.291GluPro: 1.291 ± 0.046
1.605GluGln: 1.605 ± 0.052
2.692GluArg: 2.692 ± 0.077
3.937GluSer: 3.937 ± 0.09
3.448GluThr: 3.448 ± 0.083
4.592GluVal: 4.592 ± 0.095
0.539GluTrp: 0.539 ± 0.028
3.959GluTyr: 3.959 ± 0.082
0.0GluXaa: 0.0 ± 0.0
Phe
2.448PheAla: 2.448 ± 0.06
0.628PheCys: 0.628 ± 0.033
3.226PheAsp: 3.226 ± 0.072
3.211PheGlu: 3.211 ± 0.073
2.04PhePhe: 2.04 ± 0.066
2.792PheGly: 2.792 ± 0.067
0.532PheHis: 0.532 ± 0.029
3.844PheIle: 3.844 ± 0.095
3.214PheLys: 3.214 ± 0.066
3.659PheLeu: 3.659 ± 0.092
1.136PheMet: 1.136 ± 0.035
2.51PheAsn: 2.51 ± 0.067
1.064PhePro: 1.064 ± 0.037
0.689PheGln: 0.689 ± 0.03
1.395PheArg: 1.395 ± 0.048
3.086PheSer: 3.086 ± 0.078
2.075PheThr: 2.075 ± 0.062
3.075PheVal: 3.075 ± 0.077
0.264PheTrp: 0.264 ± 0.02
2.063PheTyr: 2.063 ± 0.06
0.0PheXaa: 0.0 ± 0.0
Gly
3.587GlyAla: 3.587 ± 0.089
1.043GlyCys: 1.043 ± 0.042
3.53GlyAsp: 3.53 ± 0.083
4.317GlyGlu: 4.317 ± 0.088
2.967GlyPhe: 2.967 ± 0.063
4.016GlyGly: 4.016 ± 0.112
1.01GlyHis: 1.01 ± 0.046
6.255GlyIle: 6.255 ± 0.11
5.939GlyLys: 5.939 ± 0.106
4.841GlyLeu: 4.841 ± 0.093
1.873GlyMet: 1.873 ± 0.066
3.742GlyAsn: 3.742 ± 0.12
0.984GlyPro: 0.984 ± 0.041
1.252GlyGln: 1.252 ± 0.04
2.311GlyArg: 2.311 ± 0.062
3.745GlySer: 3.745 ± 0.097
3.596GlyThr: 3.596 ± 0.108
4.398GlyVal: 4.398 ± 0.089
0.768GlyTrp: 0.768 ± 0.056
3.504GlyTyr: 3.504 ± 0.082
0.0GlyXaa: 0.0 ± 0.0
His
0.718HisAla: 0.718 ± 0.028
0.181HisCys: 0.181 ± 0.017
0.794HisAsp: 0.794 ± 0.04
0.913HisGlu: 0.913 ± 0.045
0.662HisPhe: 0.662 ± 0.037
1.0HisGly: 1.0 ± 0.036
0.292HisHis: 0.292 ± 0.033
1.255HisIle: 1.255 ± 0.047
0.946HisLys: 0.946 ± 0.038
1.125HisLeu: 1.125 ± 0.047
0.384HisMet: 0.384 ± 0.023
0.789HisAsn: 0.789 ± 0.033
0.612HisPro: 0.612 ± 0.032
0.281HisGln: 0.281 ± 0.021
0.504HisArg: 0.504 ± 0.031
0.798HisSer: 0.798 ± 0.03
0.712HisThr: 0.712 ± 0.029
0.865HisVal: 0.865 ± 0.04
0.115HisTrp: 0.115 ± 0.013
0.638HisTyr: 0.638 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.298IleAla: 5.298 ± 0.107
1.331IleCys: 1.331 ± 0.048
6.156IleAsp: 6.156 ± 0.11
6.454IleGlu: 6.454 ± 0.111
3.719IlePhe: 3.719 ± 0.086
5.045IleGly: 5.045 ± 0.102
1.176IleHis: 1.176 ± 0.043
7.927IleIle: 7.927 ± 0.152
7.804IleLys: 7.804 ± 0.13
7.317IleLeu: 7.317 ± 0.132
2.166IleMet: 2.166 ± 0.065
5.539IleAsn: 5.539 ± 0.108
2.92IlePro: 2.92 ± 0.064
1.738IleGln: 1.738 ± 0.056
3.149IleArg: 3.149 ± 0.072
6.449IleSer: 6.449 ± 0.113
4.701IleThr: 4.701 ± 0.103
5.678IleVal: 5.678 ± 0.099
0.549IleTrp: 0.549 ± 0.028
3.961IleTyr: 3.961 ± 0.094
0.001IleXaa: 0.001 ± 0.001
Lys
4.871LysAla: 4.871 ± 0.088
0.9LysCys: 0.9 ± 0.04
6.208LysAsp: 6.208 ± 0.112
8.583LysGlu: 8.583 ± 0.134
2.837LysPhe: 2.837 ± 0.07
4.772LysGly: 4.772 ± 0.075
1.11LysHis: 1.11 ± 0.039
7.715LysIle: 7.715 ± 0.14
9.34LysLys: 9.34 ± 0.147
6.575LysLeu: 6.575 ± 0.108
2.397LysMet: 2.397 ± 0.068
6.725LysAsn: 6.725 ± 0.107
1.905LysPro: 1.905 ± 0.053
1.807LysGln: 1.807 ± 0.051
3.325LysArg: 3.325 ± 0.075
4.559LysSer: 4.559 ± 0.089
4.4LysThr: 4.4 ± 0.09
5.33LysVal: 5.33 ± 0.095
0.569LysTrp: 0.569 ± 0.026
4.602LysTyr: 4.602 ± 0.082
0.0LysXaa: 0.0 ± 0.0
Leu
4.626LeuAla: 4.626 ± 0.101
1.186LeuCys: 1.186 ± 0.04
5.503LeuAsp: 5.503 ± 0.101
6.06LeuGlu: 6.06 ± 0.117
3.509LeuPhe: 3.509 ± 0.091
4.871LeuGly: 4.871 ± 0.088
1.1LeuHis: 1.1 ± 0.043
6.778LeuIle: 6.778 ± 0.116
7.29LeuLys: 7.29 ± 0.128
6.688LeuLeu: 6.688 ± 0.121
2.201LeuMet: 2.201 ± 0.063
4.956LeuAsn: 4.956 ± 0.088
2.397LeuPro: 2.397 ± 0.07
1.639LeuGln: 1.639 ± 0.049
2.963LeuArg: 2.963 ± 0.07
6.301LeuSer: 6.301 ± 0.104
4.196LeuThr: 4.196 ± 0.085
5.045LeuVal: 5.045 ± 0.092
0.499LeuTrp: 0.499 ± 0.027
3.45LeuTyr: 3.45 ± 0.079
0.001LeuXaa: 0.001 ± 0.001
Met
1.857MetAla: 1.857 ± 0.062
0.355MetCys: 0.355 ± 0.023
1.905MetAsp: 1.905 ± 0.058
2.116MetGlu: 2.116 ± 0.067
1.126MetPhe: 1.126 ± 0.041
1.716MetGly: 1.716 ± 0.051
0.415MetHis: 0.415 ± 0.024
2.259MetIle: 2.259 ± 0.055
2.291MetLys: 2.291 ± 0.056
2.281MetLeu: 2.281 ± 0.07
0.758MetMet: 0.758 ± 0.035
1.645MetAsn: 1.645 ± 0.051
1.0MetPro: 1.0 ± 0.039
0.607MetGln: 0.607 ± 0.031
0.905MetArg: 0.905 ± 0.035
1.881MetSer: 1.881 ± 0.059
1.321MetThr: 1.321 ± 0.038
1.701MetVal: 1.701 ± 0.057
0.14MetTrp: 0.14 ± 0.014
1.05MetTyr: 1.05 ± 0.042
0.0MetXaa: 0.0 ± 0.0
Asn
3.544AsnAla: 3.544 ± 0.09
0.695AsnCys: 0.695 ± 0.036
3.941AsnAsp: 3.941 ± 0.077
4.659AsnGlu: 4.659 ± 0.087
2.201AsnPhe: 2.201 ± 0.053
4.41AsnGly: 4.41 ± 0.117
0.851AsnHis: 0.851 ± 0.037
6.235AsnIle: 6.235 ± 0.097
5.361AsnLys: 5.361 ± 0.099
4.787AsnLeu: 4.787 ± 0.109
1.672AsnMet: 1.672 ± 0.052
4.314AsnAsn: 4.314 ± 0.112
2.044AsnPro: 2.044 ± 0.049
1.304AsnGln: 1.304 ± 0.053
1.855AsnArg: 1.855 ± 0.055
3.513AsnSer: 3.513 ± 0.085
3.146AsnThr: 3.146 ± 0.087
4.222AsnVal: 4.222 ± 0.084
0.395AsnTrp: 0.395 ± 0.027
2.841AsnTyr: 2.841 ± 0.086
0.0AsnXaa: 0.0 ± 0.0
Pro
1.447ProAla: 1.447 ± 0.054
0.325ProCys: 0.325 ± 0.018
2.112ProAsp: 2.112 ± 0.057
2.384ProGlu: 2.384 ± 0.065
1.264ProPhe: 1.264 ± 0.045
1.705ProGly: 1.705 ± 0.057
0.401ProHis: 0.401 ± 0.026
1.93ProIle: 1.93 ± 0.054
1.805ProLys: 1.805 ± 0.048
2.112ProLeu: 2.112 ± 0.061
0.708ProMet: 0.708 ± 0.037
1.199ProAsn: 1.199 ± 0.045
0.517ProPro: 0.517 ± 0.029
0.547ProGln: 0.547 ± 0.029
0.745ProArg: 0.745 ± 0.036
1.473ProSer: 1.473 ± 0.043
1.315ProThr: 1.315 ± 0.052
2.256ProVal: 2.256 ± 0.062
0.216ProTrp: 0.216 ± 0.019
1.292ProTyr: 1.292 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
1.301GlnAla: 1.301 ± 0.043
0.196GlnCys: 0.196 ± 0.017
1.069GlnAsp: 1.069 ± 0.039
1.255GlnGlu: 1.255 ± 0.043
0.834GlnPhe: 0.834 ± 0.039
1.183GlnGly: 1.183 ± 0.042
0.259GlnHis: 0.259 ± 0.019
1.924GlnIle: 1.924 ± 0.057
1.871GlnLys: 1.871 ± 0.062
1.529GlnLeu: 1.529 ± 0.056
0.692GlnMet: 0.692 ± 0.032
1.152GlnAsn: 1.152 ± 0.044
0.487GlnPro: 0.487 ± 0.027
0.467GlnGln: 0.467 ± 0.029
0.789GlnArg: 0.789 ± 0.037
1.171GlnSer: 1.171 ± 0.038
1.162GlnThr: 1.162 ± 0.044
1.245GlnVal: 1.245 ± 0.045
0.168GlnTrp: 0.168 ± 0.015
0.888GlnTyr: 0.888 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
1.82ArgAla: 1.82 ± 0.055
0.436ArgCys: 0.436 ± 0.025
2.12ArgAsp: 2.12 ± 0.064
2.903ArgGlu: 2.903 ± 0.079
1.56ArgPhe: 1.56 ± 0.047
1.983ArgGly: 1.983 ± 0.065
0.519ArgHis: 0.519 ± 0.029
3.14ArgIle: 3.14 ± 0.076
3.262ArgLys: 3.262 ± 0.073
2.96ArgLeu: 2.96 ± 0.076
1.032ArgMet: 1.032 ± 0.041
2.116ArgAsn: 2.116 ± 0.055
0.861ArgPro: 0.861 ± 0.037
0.854ArgGln: 0.854 ± 0.034
1.54ArgArg: 1.54 ± 0.054
1.632ArgSer: 1.632 ± 0.05
1.596ArgThr: 1.596 ± 0.047
2.325ArgVal: 2.325 ± 0.06
0.256ArgTrp: 0.256 ± 0.02
1.695ArgTyr: 1.695 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
3.532SerAla: 3.532 ± 0.079
0.659SerCys: 0.659 ± 0.03
4.632SerAsp: 4.632 ± 0.096
4.45SerGlu: 4.45 ± 0.096
2.954SerPhe: 2.954 ± 0.068
5.002SerGly: 5.002 ± 0.119
0.842SerHis: 0.842 ± 0.036
5.062SerIle: 5.062 ± 0.091
5.053SerLys: 5.053 ± 0.082
5.269SerLeu: 5.269 ± 0.101
1.682SerMet: 1.682 ± 0.049
3.464SerAsn: 3.464 ± 0.089
1.476SerPro: 1.476 ± 0.041
1.249SerGln: 1.249 ± 0.048
2.007SerArg: 2.007 ± 0.056
4.195SerSer: 4.195 ± 0.129
3.119SerThr: 3.119 ± 0.09
4.229SerVal: 4.229 ± 0.082
0.499SerTrp: 0.499 ± 0.028
2.99SerTyr: 2.99 ± 0.086
0.0SerXaa: 0.0 ± 0.0
Thr
3.086ThrAla: 3.086 ± 0.108
0.544ThrCys: 0.544 ± 0.033
3.517ThrAsp: 3.517 ± 0.1
3.259ThrGlu: 3.259 ± 0.08
2.189ThrPhe: 2.189 ± 0.061
3.974ThrGly: 3.974 ± 0.089
0.811ThrHis: 0.811 ± 0.037
4.391ThrIle: 4.391 ± 0.089
4.138ThrLys: 4.138 ± 0.079
4.258ThrLeu: 4.258 ± 0.1
1.178ThrMet: 1.178 ± 0.043
2.721ThrAsn: 2.721 ± 0.078
1.731ThrPro: 1.731 ± 0.057
1.013ThrGln: 1.013 ± 0.042
1.595ThrArg: 1.595 ± 0.054
3.46ThrSer: 3.46 ± 0.097
3.056ThrThr: 3.056 ± 0.117
3.61ThrVal: 3.61 ± 0.109
0.358ThrTrp: 0.358 ± 0.021
2.587ThrTyr: 2.587 ± 0.096
0.0ThrXaa: 0.0 ± 0.0
Val
4.076ValAla: 4.076 ± 0.096
1.076ValCys: 1.076 ± 0.041
4.31ValAsp: 4.31 ± 0.074
4.493ValGlu: 4.493 ± 0.091
2.969ValPhe: 2.969 ± 0.081
3.893ValGly: 3.893 ± 0.094
0.785ValHis: 0.785 ± 0.032
5.851ValIle: 5.851 ± 0.11
5.447ValLys: 5.447 ± 0.099
5.815ValLeu: 5.815 ± 0.094
1.752ValMet: 1.752 ± 0.056
3.847ValAsn: 3.847 ± 0.08
1.907ValPro: 1.907 ± 0.055
1.309ValGln: 1.309 ± 0.053
2.226ValArg: 2.226 ± 0.058
4.735ValSer: 4.735 ± 0.092
3.646ValThr: 3.646 ± 0.099
4.747ValVal: 4.747 ± 0.101
0.426ValTrp: 0.426 ± 0.025
3.076ValTyr: 3.076 ± 0.069
0.001ValXaa: 0.001 ± 0.001
Trp
0.43TrpAla: 0.43 ± 0.024
0.107TrpCys: 0.107 ± 0.013
0.47TrpAsp: 0.47 ± 0.025
0.533TrpGlu: 0.533 ± 0.032
0.307TrpPhe: 0.307 ± 0.022
0.405TrpGly: 0.405 ± 0.027
0.125TrpHis: 0.125 ± 0.015
0.566TrpIle: 0.566 ± 0.03
0.599TrpLys: 0.599 ± 0.028
0.649TrpLeu: 0.649 ± 0.032
0.234TrpMet: 0.234 ± 0.022
0.456TrpAsn: 0.456 ± 0.025
0.169TrpPro: 0.169 ± 0.018
0.274TrpGln: 0.274 ± 0.022
0.228TrpArg: 0.228 ± 0.019
0.413TrpSer: 0.413 ± 0.025
0.312TrpThr: 0.312 ± 0.022
0.38TrpVal: 0.38 ± 0.024
0.09TrpTrp: 0.09 ± 0.011
0.358TrpTyr: 0.358 ± 0.03
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.493TyrAla: 2.493 ± 0.055
0.623TyrCys: 0.623 ± 0.034
3.792TyrAsp: 3.792 ± 0.092
3.794TyrGlu: 3.794 ± 0.084
2.212TyrPhe: 2.212 ± 0.059
2.984TyrGly: 2.984 ± 0.075
0.635TyrHis: 0.635 ± 0.034
4.072TyrIle: 4.072 ± 0.087
3.835TyrLys: 3.835 ± 0.075
3.966TyrLeu: 3.966 ± 0.093
1.182TyrMet: 1.182 ± 0.046
3.019TyrAsn: 3.019 ± 0.085
1.292TyrPro: 1.292 ± 0.048
0.957TyrGln: 0.957 ± 0.036
1.905TyrArg: 1.905 ± 0.057
3.055TyrSer: 3.055 ± 0.077
2.56TyrThr: 2.56 ± 0.083
3.192TyrVal: 3.192 ± 0.066
0.304TyrTrp: 0.304 ± 0.022
2.669TyrTyr: 2.669 ± 0.084
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.04XaaXaa: 0.04 ± 0.022
Statistics based on 1991 proteins (697987 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski