Amino acid dipepetide frequency for Beijerinckiaceae bacterium RH AL1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.813AlaAla: 21.813 ± 0.241
1.283AlaCys: 1.283 ± 0.036
7.576AlaAsp: 7.576 ± 0.092
7.662AlaGlu: 7.662 ± 0.087
4.997AlaPhe: 4.997 ± 0.075
11.263AlaGly: 11.263 ± 0.118
2.769AlaHis: 2.769 ± 0.049
6.733AlaIle: 6.733 ± 0.086
4.571AlaLys: 4.571 ± 0.074
14.848AlaLeu: 14.848 ± 0.124
3.677AlaMet: 3.677 ± 0.064
2.713AlaAsn: 2.713 ± 0.056
7.444AlaPro: 7.444 ± 0.127
4.091AlaGln: 4.091 ± 0.067
10.244AlaArg: 10.244 ± 0.115
7.001AlaSer: 7.001 ± 0.078
7.519AlaThr: 7.519 ± 0.094
9.127AlaVal: 9.127 ± 0.093
1.668AlaTrp: 1.668 ± 0.038
2.706AlaTyr: 2.706 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.138CysAla: 1.138 ± 0.039
0.126CysCys: 0.126 ± 0.011
0.556CysAsp: 0.556 ± 0.021
0.477CysGlu: 0.477 ± 0.021
0.362CysPhe: 0.362 ± 0.015
0.966CysGly: 0.966 ± 0.03
0.225CysHis: 0.225 ± 0.013
0.349CysIle: 0.349 ± 0.017
0.212CysLys: 0.212 ± 0.013
0.876CysLeu: 0.876 ± 0.025
0.159CysMet: 0.159 ± 0.012
0.17CysAsn: 0.17 ± 0.01
0.466CysPro: 0.466 ± 0.021
0.215CysGln: 0.215 ± 0.013
0.756CysArg: 0.756 ± 0.028
0.443CysSer: 0.443 ± 0.018
0.428CysThr: 0.428 ± 0.018
0.692CysVal: 0.692 ± 0.024
0.112CysTrp: 0.112 ± 0.008
0.19CysTyr: 0.19 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
8.176AspAla: 8.176 ± 0.096
0.506AspCys: 0.506 ± 0.022
3.469AspAsp: 3.469 ± 0.065
3.49AspGlu: 3.49 ± 0.06
2.04AspPhe: 2.04 ± 0.045
5.408AspGly: 5.408 ± 0.077
1.286AspHis: 1.286 ± 0.027
2.772AspIle: 2.772 ± 0.046
1.912AspLys: 1.912 ± 0.047
6.068AspLeu: 6.068 ± 0.081
1.155AspMet: 1.155 ± 0.029
1.09AspAsn: 1.09 ± 0.031
3.71AspPro: 3.71 ± 0.063
1.459AspGln: 1.459 ± 0.038
4.304AspArg: 4.304 ± 0.063
1.967AspSer: 1.967 ± 0.043
2.646AspThr: 2.646 ± 0.048
4.449AspVal: 4.449 ± 0.062
0.877AspTrp: 0.877 ± 0.027
1.412AspTyr: 1.412 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
8.288GluAla: 8.288 ± 0.112
0.309GluCys: 0.309 ± 0.017
2.614GluAsp: 2.614 ± 0.055
2.518GluGlu: 2.518 ± 0.062
1.311GluPhe: 1.311 ± 0.031
3.977GluGly: 3.977 ± 0.062
1.216GluHis: 1.216 ± 0.034
3.011GluIle: 3.011 ± 0.05
2.024GluLys: 2.024 ± 0.049
4.611GluLeu: 4.611 ± 0.061
1.417GluMet: 1.417 ± 0.033
1.158GluAsn: 1.158 ± 0.031
3.016GluPro: 3.016 ± 0.056
1.669GluGln: 1.669 ± 0.041
4.927GluArg: 4.927 ± 0.073
2.207GluSer: 2.207 ± 0.041
3.436GluThr: 3.436 ± 0.051
3.789GluVal: 3.789 ± 0.058
0.542GluTrp: 0.542 ± 0.022
0.741GluTyr: 0.741 ± 0.026
0.0GluXaa: 0.0 ± 0.0
Phe
5.17PheAla: 5.17 ± 0.062
0.421PheCys: 0.421 ± 0.019
2.595PheAsp: 2.595 ± 0.049
2.118PheGlu: 2.118 ± 0.038
1.388PhePhe: 1.388 ± 0.039
3.592PheGly: 3.592 ± 0.057
0.768PheHis: 0.768 ± 0.024
1.387PheIle: 1.387 ± 0.035
1.074PheLys: 1.074 ± 0.033
3.221PheLeu: 3.221 ± 0.068
0.705PheMet: 0.705 ± 0.028
0.931PheAsn: 0.931 ± 0.029
1.546PhePro: 1.546 ± 0.034
0.887PheGln: 0.887 ± 0.027
2.105PheArg: 2.105 ± 0.042
1.995PheSer: 1.995 ± 0.041
1.933PheThr: 1.933 ± 0.034
3.004PheVal: 3.004 ± 0.056
0.466PheTrp: 0.466 ± 0.021
0.876PheTyr: 0.876 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
10.264GlyAla: 10.264 ± 0.106
0.908GlyCys: 0.908 ± 0.023
4.566GlyAsp: 4.566 ± 0.066
4.271GlyGlu: 4.271 ± 0.065
3.67GlyPhe: 3.67 ± 0.064
7.554GlyGly: 7.554 ± 0.123
1.974GlyHis: 1.974 ± 0.046
4.168GlyIle: 4.168 ± 0.064
3.089GlyLys: 3.089 ± 0.062
9.022GlyLeu: 9.022 ± 0.092
2.076GlyMet: 2.076 ± 0.041
1.817GlyAsn: 1.817 ± 0.041
3.833GlyPro: 3.833 ± 0.065
2.482GlyGln: 2.482 ± 0.049
6.442GlyArg: 6.442 ± 0.075
4.432GlySer: 4.432 ± 0.068
4.796GlyThr: 4.796 ± 0.078
5.949GlyVal: 5.949 ± 0.079
1.293GlyTrp: 1.293 ± 0.034
2.259GlyTyr: 2.259 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.781HisAla: 2.781 ± 0.059
0.214HisCys: 0.214 ± 0.014
1.418HisAsp: 1.418 ± 0.036
1.172HisGlu: 1.172 ± 0.035
0.818HisPhe: 0.818 ± 0.03
2.116HisGly: 2.116 ± 0.044
0.651HisHis: 0.651 ± 0.032
0.86HisIle: 0.86 ± 0.029
0.51HisLys: 0.51 ± 0.017
2.118HisLeu: 2.118 ± 0.04
0.473HisMet: 0.473 ± 0.018
0.412HisAsn: 0.412 ± 0.02
1.403HisPro: 1.403 ± 0.037
0.527HisGln: 0.527 ± 0.019
1.577HisArg: 1.577 ± 0.037
0.825HisSer: 0.825 ± 0.027
0.801HisThr: 0.801 ± 0.023
1.72HisVal: 1.72 ± 0.038
0.323HisTrp: 0.323 ± 0.016
0.576HisTyr: 0.576 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
7.481IleAla: 7.481 ± 0.074
0.474IleCys: 0.474 ± 0.022
3.48IleAsp: 3.48 ± 0.056
3.394IleGlu: 3.394 ± 0.059
1.615IlePhe: 1.615 ± 0.044
4.643IleGly: 4.643 ± 0.063
0.81IleHis: 0.81 ± 0.025
1.851IleIle: 1.851 ± 0.04
1.381IleLys: 1.381 ± 0.034
3.968IleLeu: 3.968 ± 0.058
0.849IleMet: 0.849 ± 0.027
1.132IleAsn: 1.132 ± 0.03
2.083IlePro: 2.083 ± 0.046
1.003IleGln: 1.003 ± 0.034
2.719IleArg: 2.719 ± 0.054
2.259IleSer: 2.259 ± 0.054
2.29IleThr: 2.29 ± 0.049
4.63IleVal: 4.63 ± 0.075
0.518IleTrp: 0.518 ± 0.02
1.032IleTyr: 1.032 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
4.561LysAla: 4.561 ± 0.074
0.161LysCys: 0.161 ± 0.011
1.698LysAsp: 1.698 ± 0.042
1.296LysGlu: 1.296 ± 0.035
0.812LysPhe: 0.812 ± 0.025
2.633LysGly: 2.633 ± 0.058
0.591LysHis: 0.591 ± 0.021
1.728LysIle: 1.728 ± 0.04
1.187LysLys: 1.187 ± 0.041
3.547LysLeu: 3.547 ± 0.054
0.739LysMet: 0.739 ± 0.028
0.719LysAsn: 0.719 ± 0.025
2.39LysPro: 2.39 ± 0.05
0.927LysGln: 0.927 ± 0.028
2.338LysArg: 2.338 ± 0.042
1.68LysSer: 1.68 ± 0.039
1.96LysThr: 1.96 ± 0.047
2.584LysVal: 2.584 ± 0.051
0.291LysTrp: 0.291 ± 0.014
0.532LysTyr: 0.532 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
15.531LeuAla: 15.531 ± 0.152
0.923LeuCys: 0.923 ± 0.028
6.336LeuAsp: 6.336 ± 0.08
5.077LeuGlu: 5.077 ± 0.081
3.513LeuPhe: 3.513 ± 0.068
8.309LeuGly: 8.309 ± 0.106
1.892LeuHis: 1.892 ± 0.045
4.387LeuIle: 4.387 ± 0.067
3.276LeuLys: 3.276 ± 0.056
8.957LeuLeu: 8.957 ± 0.099
2.116LeuMet: 2.116 ± 0.044
2.054LeuAsn: 2.054 ± 0.041
5.671LeuPro: 5.671 ± 0.073
2.549LeuGln: 2.549 ± 0.048
7.059LeuArg: 7.059 ± 0.078
5.518LeuSer: 5.518 ± 0.07
5.254LeuThr: 5.254 ± 0.067
8.567LeuVal: 8.567 ± 0.086
1.113LeuTrp: 1.113 ± 0.032
2.029LeuTyr: 2.029 ± 0.037
0.001LeuXaa: 0.001 ± 0.001
Met
2.829MetAla: 2.829 ± 0.049
0.158MetCys: 0.158 ± 0.012
1.025MetAsp: 1.025 ± 0.03
0.854MetGlu: 0.854 ± 0.025
0.65MetPhe: 0.65 ± 0.022
1.512MetGly: 1.512 ± 0.038
0.441MetHis: 0.441 ± 0.016
1.261MetIle: 1.261 ± 0.035
0.864MetLys: 0.864 ± 0.028
2.338MetLeu: 2.338 ± 0.045
0.598MetMet: 0.598 ± 0.026
0.656MetAsn: 0.656 ± 0.026
1.609MetPro: 1.609 ± 0.036
0.752MetGln: 0.752 ± 0.024
1.969MetArg: 1.969 ± 0.037
1.581MetSer: 1.581 ± 0.037
1.812MetThr: 1.812 ± 0.036
1.497MetVal: 1.497 ± 0.029
0.168MetTrp: 0.168 ± 0.011
0.306MetTyr: 0.306 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.891AsnAla: 2.891 ± 0.052
0.207AsnCys: 0.207 ± 0.013
1.261AsnAsp: 1.261 ± 0.034
1.118AsnGlu: 1.118 ± 0.031
0.831AsnPhe: 0.831 ± 0.027
2.115AsnGly: 2.115 ± 0.055
0.459AsnHis: 0.459 ± 0.022
1.098AsnIle: 1.098 ± 0.032
0.624AsnLys: 0.624 ± 0.024
2.194AsnLeu: 2.194 ± 0.042
0.475AsnMet: 0.475 ± 0.02
0.575AsnAsn: 0.575 ± 0.031
1.517AsnPro: 1.517 ± 0.039
0.614AsnGln: 0.614 ± 0.025
1.366AsnArg: 1.366 ± 0.03
0.912AsnSer: 0.912 ± 0.03
1.051AsnThr: 1.051 ± 0.036
1.845AsnVal: 1.845 ± 0.042
0.302AsnTrp: 0.302 ± 0.016
0.56AsnTyr: 0.56 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
7.735ProAla: 7.735 ± 0.097
0.393ProCys: 0.393 ± 0.019
3.812ProAsp: 3.812 ± 0.063
3.33ProGlu: 3.33 ± 0.051
2.064ProPhe: 2.064 ± 0.042
4.88ProGly: 4.88 ± 0.071
1.226ProHis: 1.226 ± 0.033
2.517ProIle: 2.517 ± 0.045
1.929ProLys: 1.929 ± 0.047
5.029ProLeu: 5.029 ± 0.069
1.254ProMet: 1.254 ± 0.03
1.24ProAsn: 1.24 ± 0.036
3.509ProPro: 3.509 ± 0.075
1.789ProGln: 1.789 ± 0.038
3.717ProArg: 3.717 ± 0.073
3.159ProSer: 3.159 ± 0.056
2.975ProThr: 2.975 ± 0.059
4.208ProVal: 4.208 ± 0.064
0.671ProTrp: 0.671 ± 0.024
1.194ProTyr: 1.194 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
4.128GlnAla: 4.128 ± 0.058
0.188GlnCys: 0.188 ± 0.011
1.362GlnAsp: 1.362 ± 0.034
1.198GlnGlu: 1.198 ± 0.029
0.805GlnPhe: 0.805 ± 0.027
2.268GlnGly: 2.268 ± 0.039
0.646GlnHis: 0.646 ± 0.024
1.521GlnIle: 1.521 ± 0.033
1.019GlnLys: 1.019 ± 0.03
2.576GlnLeu: 2.576 ± 0.045
0.671GlnMet: 0.671 ± 0.023
0.717GlnAsn: 0.717 ± 0.026
1.668GlnPro: 1.668 ± 0.04
1.022GlnGln: 1.022 ± 0.037
2.247GlnArg: 2.247 ± 0.041
1.59GlnSer: 1.59 ± 0.034
1.648GlnThr: 1.648 ± 0.033
2.039GlnVal: 2.039 ± 0.043
0.303GlnTrp: 0.303 ± 0.019
0.514GlnTyr: 0.514 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
9.189ArgAla: 9.189 ± 0.099
0.613ArgCys: 0.613 ± 0.021
4.479ArgAsp: 4.479 ± 0.062
4.207ArgGlu: 4.207 ± 0.064
3.061ArgPhe: 3.061 ± 0.051
5.091ArgGly: 5.091 ± 0.067
1.846ArgHis: 1.846 ± 0.042
3.547ArgIle: 3.547 ± 0.059
2.278ArgLys: 2.278 ± 0.045
8.163ArgLeu: 8.163 ± 0.081
1.717ArgMet: 1.717 ± 0.036
1.594ArgAsn: 1.594 ± 0.034
4.074ArgPro: 4.074 ± 0.063
2.3ArgGln: 2.3 ± 0.046
6.727ArgArg: 6.727 ± 0.096
3.571ArgSer: 3.571 ± 0.053
3.523ArgThr: 3.523 ± 0.051
4.934ArgVal: 4.934 ± 0.067
1.03ArgTrp: 1.03 ± 0.031
1.758ArgTyr: 1.758 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
5.984SerAla: 5.984 ± 0.068
0.463SerCys: 0.463 ± 0.019
2.849SerAsp: 2.849 ± 0.047
2.401SerGlu: 2.401 ± 0.045
2.19SerPhe: 2.19 ± 0.042
4.813SerGly: 4.813 ± 0.073
1.068SerHis: 1.068 ± 0.026
2.534SerIle: 2.534 ± 0.049
1.526SerLys: 1.526 ± 0.037
5.359SerLeu: 5.359 ± 0.068
1.241SerMet: 1.241 ± 0.034
1.238SerAsn: 1.238 ± 0.038
2.982SerPro: 2.982 ± 0.052
1.464SerGln: 1.464 ± 0.038
3.646SerArg: 3.646 ± 0.06
2.968SerSer: 2.968 ± 0.062
2.66SerThr: 2.66 ± 0.062
3.628SerVal: 3.628 ± 0.053
0.668SerTrp: 0.668 ± 0.022
1.245SerTyr: 1.245 ± 0.032
0.001SerXaa: 0.001 ± 0.001
Thr
6.474ThrAla: 6.474 ± 0.092
0.473ThrCys: 0.473 ± 0.017
2.632ThrAsp: 2.632 ± 0.054
2.18ThrGlu: 2.18 ± 0.05
2.14ThrPhe: 2.14 ± 0.044
4.782ThrGly: 4.782 ± 0.075
1.14ThrHis: 1.14 ± 0.031
3.204ThrIle: 3.204 ± 0.053
1.673ThrLys: 1.673 ± 0.043
6.069ThrLeu: 6.069 ± 0.071
1.186ThrMet: 1.186 ± 0.031
1.238ThrAsn: 1.238 ± 0.033
3.739ThrPro: 3.739 ± 0.059
1.49ThrGln: 1.49 ± 0.041
3.668ThrArg: 3.668 ± 0.055
3.012ThrSer: 3.012 ± 0.065
3.149ThrThr: 3.149 ± 0.06
3.97ThrVal: 3.97 ± 0.067
0.682ThrTrp: 0.682 ± 0.023
1.289ThrTyr: 1.289 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
11.327ValAla: 11.327 ± 0.101
0.707ValCys: 0.707 ± 0.023
4.402ValAsp: 4.402 ± 0.063
4.342ValGlu: 4.342 ± 0.056
2.678ValPhe: 2.678 ± 0.046
6.101ValGly: 6.101 ± 0.076
1.441ValHis: 1.441 ± 0.034
3.325ValIle: 3.325 ± 0.059
2.196ValLys: 2.196 ± 0.049
7.407ValLeu: 7.407 ± 0.075
1.67ValMet: 1.67 ± 0.037
1.622ValAsn: 1.622 ± 0.045
4.181ValPro: 4.181 ± 0.054
1.81ValGln: 1.81 ± 0.04
4.985ValArg: 4.985 ± 0.06
3.974ValSer: 3.974 ± 0.063
4.464ValThr: 4.464 ± 0.078
6.508ValVal: 6.508 ± 0.084
0.853ValTrp: 0.853 ± 0.025
1.49ValTyr: 1.49 ± 0.036
0.001ValXaa: 0.001 ± 0.001
Trp
1.212TrpAla: 1.212 ± 0.032
0.136TrpCys: 0.136 ± 0.01
0.56TrpAsp: 0.56 ± 0.022
0.45TrpGlu: 0.45 ± 0.021
0.469TrpPhe: 0.469 ± 0.02
0.788TrpGly: 0.788 ± 0.025
0.4TrpHis: 0.4 ± 0.02
0.561TrpIle: 0.561 ± 0.022
0.372TrpLys: 0.372 ± 0.016
1.573TrpLeu: 1.573 ± 0.037
0.305TrpMet: 0.305 ± 0.016
0.375TrpAsn: 0.375 ± 0.018
0.678TrpPro: 0.678 ± 0.021
0.455TrpGln: 0.455 ± 0.017
1.219TrpArg: 1.219 ± 0.033
0.802TrpSer: 0.802 ± 0.027
0.797TrpThr: 0.797 ± 0.027
0.755TrpVal: 0.755 ± 0.024
0.208TrpTrp: 0.208 ± 0.015
0.287TrpTyr: 0.287 ± 0.016
0.001TrpXaa: 0.001 ± 0.001
Tyr
2.737TyrAla: 2.737 ± 0.051
0.253TyrCys: 0.253 ± 0.015
1.507TyrAsp: 1.507 ± 0.032
1.225TyrGlu: 1.225 ± 0.03
0.769TyrPhe: 0.769 ± 0.029
2.054TyrGly: 2.054 ± 0.044
0.449TyrHis: 0.449 ± 0.02
0.795TyrIle: 0.795 ± 0.027
0.656TyrLys: 0.656 ± 0.025
2.188TyrLeu: 2.188 ± 0.048
0.393TyrMet: 0.393 ± 0.019
0.561TyrAsn: 0.561 ± 0.022
1.111TyrPro: 1.111 ± 0.032
0.625TyrGln: 0.625 ± 0.022
1.688TyrArg: 1.688 ± 0.041
1.03TyrSer: 1.03 ± 0.032
1.049TyrThr: 1.049 ± 0.032
1.59TyrVal: 1.59 ± 0.034
0.315TyrTrp: 0.315 ± 0.018
0.581TyrTyr: 0.581 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4238 proteins (1273430 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski