Amino acid dipepetide frequency for Pseudomonas benzenivorans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.114AlaAla: 15.114 ± 0.145
1.355AlaCys: 1.355 ± 0.032
6.063AlaAsp: 6.063 ± 0.058
8.365AlaGlu: 8.365 ± 0.087
3.972AlaPhe: 3.972 ± 0.061
10.061AlaGly: 10.061 ± 0.101
2.296AlaHis: 2.296 ± 0.041
5.328AlaIle: 5.328 ± 0.072
3.559AlaLys: 3.559 ± 0.062
15.347AlaLeu: 15.347 ± 0.144
2.87AlaMet: 2.87 ± 0.046
2.795AlaAsn: 2.795 ± 0.048
5.11AlaPro: 5.11 ± 0.066
5.669AlaGln: 5.669 ± 0.064
7.997AlaArg: 7.997 ± 0.085
6.449AlaSer: 6.449 ± 0.072
4.76AlaThr: 4.76 ± 0.059
7.622AlaVal: 7.622 ± 0.075
1.801AlaTrp: 1.801 ± 0.035
2.629AlaTyr: 2.629 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
1.266CysAla: 1.266 ± 0.029
0.155CysCys: 0.155 ± 0.01
0.533CysAsp: 0.533 ± 0.018
0.558CysGlu: 0.558 ± 0.019
0.343CysPhe: 0.343 ± 0.012
1.047CysGly: 1.047 ± 0.023
0.302CysHis: 0.302 ± 0.014
0.45CysIle: 0.45 ± 0.016
0.27CysLys: 0.27 ± 0.013
1.235CysLeu: 1.235 ± 0.03
0.202CysMet: 0.202 ± 0.011
0.265CysAsn: 0.265 ± 0.013
0.562CysPro: 0.562 ± 0.018
0.421CysGln: 0.421 ± 0.017
0.731CysArg: 0.731 ± 0.022
0.63CysSer: 0.63 ± 0.02
0.468CysThr: 0.468 ± 0.018
0.702CysVal: 0.702 ± 0.021
0.152CysTrp: 0.152 ± 0.01
0.264CysTyr: 0.264 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
5.846AspAla: 5.846 ± 0.061
0.613AspCys: 0.613 ± 0.023
2.677AspAsp: 2.677 ± 0.048
3.489AspGlu: 3.489 ± 0.046
2.092AspPhe: 2.092 ± 0.038
4.585AspGly: 4.585 ± 0.058
1.032AspHis: 1.032 ± 0.026
2.38AspIle: 2.38 ± 0.035
1.762AspLys: 1.762 ± 0.037
5.883AspLeu: 5.883 ± 0.064
1.049AspMet: 1.049 ± 0.027
1.453AspAsn: 1.453 ± 0.037
2.742AspPro: 2.742 ± 0.042
2.007AspGln: 2.007 ± 0.037
2.947AspArg: 2.947 ± 0.039
3.044AspSer: 3.044 ± 0.044
1.992AspThr: 1.992 ± 0.033
3.036AspVal: 3.036 ± 0.052
1.074AspTrp: 1.074 ± 0.025
1.735AspTyr: 1.735 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
7.003GluAla: 7.003 ± 0.066
0.462GluCys: 0.462 ± 0.017
2.537GluAsp: 2.537 ± 0.041
3.213GluGlu: 3.213 ± 0.044
1.871GluPhe: 1.871 ± 0.046
3.934GluGly: 3.934 ± 0.053
1.654GluHis: 1.654 ± 0.033
2.719GluIle: 2.719 ± 0.046
1.933GluLys: 1.933 ± 0.038
7.936GluLeu: 7.936 ± 0.078
1.302GluMet: 1.302 ± 0.031
1.307GluAsn: 1.307 ± 0.03
2.633GluPro: 2.633 ± 0.042
4.088GluGln: 4.088 ± 0.063
5.395GluArg: 5.395 ± 0.073
2.602GluSer: 2.602 ± 0.044
2.373GluThr: 2.373 ± 0.039
4.446GluVal: 4.446 ± 0.052
0.729GluTrp: 0.729 ± 0.025
1.137GluTyr: 1.137 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
4.565PheAla: 4.565 ± 0.063
0.466PheCys: 0.466 ± 0.015
2.405PheAsp: 2.405 ± 0.041
2.061PheGlu: 2.061 ± 0.034
1.398PhePhe: 1.398 ± 0.034
3.227PheGly: 3.227 ± 0.052
0.739PheHis: 0.739 ± 0.022
1.71PheIle: 1.71 ± 0.039
1.134PheLys: 1.134 ± 0.031
3.291PheLeu: 3.291 ± 0.052
0.758PheMet: 0.758 ± 0.022
1.226PheAsn: 1.226 ± 0.027
1.438PhePro: 1.438 ± 0.029
1.223PheGln: 1.223 ± 0.028
1.955PheArg: 1.955 ± 0.036
2.44PheSer: 2.44 ± 0.042
1.642PheThr: 1.642 ± 0.03
2.513PheVal: 2.513 ± 0.042
0.55PheTrp: 0.55 ± 0.024
1.029PheTyr: 1.029 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
8.163GlyAla: 8.163 ± 0.086
1.017GlyCys: 1.017 ± 0.029
3.926GlyAsp: 3.926 ± 0.054
5.304GlyGlu: 5.304 ± 0.059
3.302GlyPhe: 3.302 ± 0.045
6.412GlyGly: 6.412 ± 0.101
1.901GlyHis: 1.901 ± 0.034
4.02GlyIle: 4.02 ± 0.051
3.121GlyLys: 3.121 ± 0.054
9.985GlyLeu: 9.985 ± 0.095
2.173GlyMet: 2.173 ± 0.041
2.232GlyAsn: 2.232 ± 0.045
2.682GlyPro: 2.682 ± 0.041
3.858GlyGln: 3.858 ± 0.052
5.335GlyArg: 5.335 ± 0.07
4.746GlySer: 4.746 ± 0.071
3.45GlyThr: 3.45 ± 0.062
5.9GlyVal: 5.9 ± 0.071
1.353GlyTrp: 1.353 ± 0.033
2.485GlyTyr: 2.485 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
2.425HisAla: 2.425 ± 0.036
0.381HisCys: 0.381 ± 0.013
1.148HisAsp: 1.148 ± 0.025
1.222HisGlu: 1.222 ± 0.032
1.001HisPhe: 1.001 ± 0.024
2.051HisGly: 2.051 ± 0.037
0.584HisHis: 0.584 ± 0.02
0.965HisIle: 0.965 ± 0.026
0.634HisLys: 0.634 ± 0.021
2.677HisLeu: 2.677 ± 0.039
0.459HisMet: 0.459 ± 0.017
0.636HisAsn: 0.636 ± 0.019
1.413HisPro: 1.413 ± 0.032
0.932HisGln: 0.932 ± 0.026
1.405HisArg: 1.405 ± 0.031
1.271HisSer: 1.271 ± 0.028
0.862HisThr: 0.862 ± 0.024
1.24HisVal: 1.24 ± 0.028
0.46HisTrp: 0.46 ± 0.019
0.74HisTyr: 0.74 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
5.849IleAla: 5.849 ± 0.069
0.486IleCys: 0.486 ± 0.017
3.042IleAsp: 3.042 ± 0.048
3.289IleGlu: 3.289 ± 0.044
1.313IlePhe: 1.313 ± 0.034
4.447IleGly: 4.447 ± 0.055
0.937IleHis: 0.937 ± 0.023
1.738IleIle: 1.738 ± 0.036
1.505IleLys: 1.505 ± 0.038
4.101IleLeu: 4.101 ± 0.056
0.763IleMet: 0.763 ± 0.021
1.46IleAsn: 1.46 ± 0.032
2.211IlePro: 2.211 ± 0.04
1.583IleGln: 1.583 ± 0.034
2.885IleArg: 2.885 ± 0.045
2.6IleSer: 2.6 ± 0.045
2.102IleThr: 2.102 ± 0.04
3.009IleVal: 3.009 ± 0.046
0.471IleTrp: 0.471 ± 0.017
1.065IleTyr: 1.065 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.745LysAla: 3.745 ± 0.056
0.188LysCys: 0.188 ± 0.011
1.537LysAsp: 1.537 ± 0.03
1.48LysGlu: 1.48 ± 0.033
0.82LysPhe: 0.82 ± 0.021
2.335LysGly: 2.335 ± 0.046
0.676LysHis: 0.676 ± 0.022
1.401LysIle: 1.401 ± 0.033
1.116LysLys: 1.116 ± 0.034
3.64LysLeu: 3.64 ± 0.047
0.631LysMet: 0.631 ± 0.02
0.839LysAsn: 0.839 ± 0.026
1.894LysPro: 1.894 ± 0.038
1.537LysGln: 1.537 ± 0.035
2.31LysArg: 2.31 ± 0.043
1.611LysSer: 1.611 ± 0.034
1.59LysThr: 1.59 ± 0.032
2.445LysVal: 2.445 ± 0.046
0.326LysTrp: 0.326 ± 0.015
0.65LysTyr: 0.65 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
16.658LeuAla: 16.658 ± 0.147
1.367LeuCys: 1.367 ± 0.029
6.973LeuAsp: 6.973 ± 0.083
6.918LeuGlu: 6.918 ± 0.082
4.435LeuPhe: 4.435 ± 0.071
10.179LeuGly: 10.179 ± 0.095
2.658LeuHis: 2.658 ± 0.044
5.489LeuIle: 5.489 ± 0.069
4.083LeuLys: 4.083 ± 0.053
15.916LeuLeu: 15.916 ± 0.181
2.365LeuMet: 2.365 ± 0.039
3.385LeuAsn: 3.385 ± 0.048
6.759LeuPro: 6.759 ± 0.07
5.31LeuGln: 5.31 ± 0.075
8.252LeuArg: 8.252 ± 0.081
7.269LeuSer: 7.269 ± 0.089
5.173LeuThr: 5.173 ± 0.062
7.981LeuVal: 7.981 ± 0.072
1.535LeuTrp: 1.535 ± 0.035
2.694LeuTyr: 2.694 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
2.528MetAla: 2.528 ± 0.044
0.152MetCys: 0.152 ± 0.01
0.94MetAsp: 0.94 ± 0.027
0.877MetGlu: 0.877 ± 0.025
0.604MetPhe: 0.604 ± 0.02
1.57MetGly: 1.57 ± 0.034
0.511MetHis: 0.511 ± 0.017
0.984MetIle: 0.984 ± 0.025
0.756MetLys: 0.756 ± 0.021
2.634MetLeu: 2.634 ± 0.044
0.416MetMet: 0.416 ± 0.02
0.799MetAsn: 0.799 ± 0.023
1.347MetPro: 1.347 ± 0.028
1.016MetGln: 1.016 ± 0.026
1.487MetArg: 1.487 ± 0.026
1.584MetSer: 1.584 ± 0.032
1.237MetThr: 1.237 ± 0.033
1.361MetVal: 1.361 ± 0.033
0.167MetTrp: 0.167 ± 0.01
0.316MetTyr: 0.316 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.924AsnAla: 2.924 ± 0.049
0.313AsnCys: 0.313 ± 0.015
1.3AsnAsp: 1.3 ± 0.028
1.265AsnGlu: 1.265 ± 0.031
0.955AsnPhe: 0.955 ± 0.027
2.283AsnGly: 2.283 ± 0.044
0.547AsnHis: 0.547 ± 0.018
1.267AsnIle: 1.267 ± 0.029
0.787AsnLys: 0.787 ± 0.024
3.303AsnLeu: 3.303 ± 0.046
0.507AsnMet: 0.507 ± 0.019
0.768AsnAsn: 0.768 ± 0.023
1.935AsnPro: 1.935 ± 0.037
1.262AsnGln: 1.262 ± 0.032
1.843AsnArg: 1.843 ± 0.038
1.36AsnSer: 1.36 ± 0.031
1.196AsnThr: 1.196 ± 0.027
1.665AsnVal: 1.665 ± 0.037
0.408AsnTrp: 0.408 ± 0.016
0.722AsnTyr: 0.722 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
6.404ProAla: 6.404 ± 0.08
0.395ProCys: 0.395 ± 0.016
2.589ProAsp: 2.589 ± 0.042
3.237ProGlu: 3.237 ± 0.041
1.786ProPhe: 1.786 ± 0.034
4.214ProGly: 4.214 ± 0.058
1.098ProHis: 1.098 ± 0.027
1.916ProIle: 1.916 ± 0.034
1.388ProLys: 1.388 ± 0.03
6.384ProLeu: 6.384 ± 0.07
1.055ProMet: 1.055 ± 0.026
1.237ProAsn: 1.237 ± 0.027
2.168ProPro: 2.168 ± 0.045
2.422ProGln: 2.422 ± 0.035
2.868ProArg: 2.868 ± 0.051
2.659ProSer: 2.659 ± 0.043
2.062ProThr: 2.062 ± 0.031
3.498ProVal: 3.498 ± 0.047
0.803ProTrp: 0.803 ± 0.023
1.232ProTyr: 1.232 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
6.705GlnAla: 6.705 ± 0.083
0.356GlnCys: 0.356 ± 0.017
1.941GlnAsp: 1.941 ± 0.035
2.086GlnGlu: 2.086 ± 0.037
1.399GlnPhe: 1.399 ± 0.031
3.644GlnGly: 3.644 ± 0.049
1.206GlnHis: 1.206 ± 0.031
1.965GlnIle: 1.965 ± 0.033
1.105GlnLys: 1.105 ± 0.025
6.152GlnLeu: 6.152 ± 0.087
0.959GlnMet: 0.959 ± 0.028
0.947GlnAsn: 0.947 ± 0.026
2.67GlnPro: 2.67 ± 0.043
2.987GlnGln: 2.987 ± 0.056
4.244GlnArg: 4.244 ± 0.058
2.184GlnSer: 2.184 ± 0.036
1.764GlnThr: 1.764 ± 0.034
3.567GlnVal: 3.567 ± 0.045
0.647GlnTrp: 0.647 ± 0.021
0.923GlnTyr: 0.923 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
6.967ArgAla: 6.967 ± 0.072
0.656ArgCys: 0.656 ± 0.023
3.526ArgAsp: 3.526 ± 0.048
4.521ArgGlu: 4.521 ± 0.062
2.892ArgPhe: 2.892 ± 0.043
4.466ArgGly: 4.466 ± 0.06
1.81ArgHis: 1.81 ± 0.035
3.36ArgIle: 3.36 ± 0.047
1.946ArgLys: 1.946 ± 0.033
9.609ArgLeu: 9.609 ± 0.097
1.582ArgMet: 1.582 ± 0.032
1.762ArgAsn: 1.762 ± 0.038
3.008ArgPro: 3.008 ± 0.045
3.829ArgGln: 3.829 ± 0.056
5.088ArgArg: 5.088 ± 0.066
3.591ArgSer: 3.591 ± 0.046
2.478ArgThr: 2.478 ± 0.038
4.728ArgVal: 4.728 ± 0.056
1.086ArgTrp: 1.086 ± 0.033
2.135ArgTyr: 2.135 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
6.398SerAla: 6.398 ± 0.07
0.547SerCys: 0.547 ± 0.019
2.658SerAsp: 2.658 ± 0.043
3.087SerGlu: 3.087 ± 0.046
2.089SerPhe: 2.089 ± 0.037
5.208SerGly: 5.208 ± 0.079
1.286SerHis: 1.286 ± 0.031
2.502SerIle: 2.502 ± 0.043
1.565SerLys: 1.565 ± 0.032
7.368SerLeu: 7.368 ± 0.079
1.244SerMet: 1.244 ± 0.031
1.589SerAsn: 1.589 ± 0.041
2.629SerPro: 2.629 ± 0.042
2.4SerGln: 2.4 ± 0.039
3.599SerArg: 3.599 ± 0.045
3.304SerSer: 3.304 ± 0.056
2.446SerThr: 2.446 ± 0.043
3.585SerVal: 3.585 ± 0.053
0.816SerTrp: 0.816 ± 0.025
1.431SerTyr: 1.431 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
4.819ThrAla: 4.819 ± 0.064
0.446ThrCys: 0.446 ± 0.017
1.953ThrAsp: 1.953 ± 0.036
2.107ThrGlu: 2.107 ± 0.036
1.563ThrPhe: 1.563 ± 0.034
3.668ThrGly: 3.668 ± 0.065
0.929ThrHis: 0.929 ± 0.023
1.714ThrIle: 1.714 ± 0.041
0.904ThrLys: 0.904 ± 0.029
6.098ThrLeu: 6.098 ± 0.078
0.672ThrMet: 0.672 ± 0.02
0.97ThrAsn: 0.97 ± 0.026
2.916ThrPro: 2.916 ± 0.045
1.753ThrGln: 1.753 ± 0.029
2.861ThrArg: 2.861 ± 0.044
2.308ThrSer: 2.308 ± 0.041
2.071ThrThr: 2.071 ± 0.05
2.953ThrVal: 2.953 ± 0.054
0.607ThrTrp: 0.607 ± 0.02
1.081ThrTyr: 1.081 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
7.91ValAla: 7.91 ± 0.069
0.731ValCys: 0.731 ± 0.025
3.777ValAsp: 3.777 ± 0.053
4.517ValGlu: 4.517 ± 0.051
2.429ValPhe: 2.429 ± 0.035
5.092ValGly: 5.092 ± 0.068
1.365ValHis: 1.365 ± 0.03
3.41ValIle: 3.41 ± 0.053
2.149ValLys: 2.149 ± 0.042
8.207ValLeu: 8.207 ± 0.078
1.534ValMet: 1.534 ± 0.035
1.898ValAsn: 1.898 ± 0.035
3.229ValPro: 3.229 ± 0.047
2.706ValGln: 2.706 ± 0.042
4.418ValArg: 4.418 ± 0.057
3.841ValSer: 3.841 ± 0.052
3.128ValThr: 3.128 ± 0.06
4.962ValVal: 4.962 ± 0.066
0.798ValTrp: 0.798 ± 0.025
1.529ValTyr: 1.529 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.248TrpAla: 1.248 ± 0.029
0.166TrpCys: 0.166 ± 0.01
0.611TrpAsp: 0.611 ± 0.02
0.564TrpGlu: 0.564 ± 0.018
0.511TrpPhe: 0.511 ± 0.018
0.892TrpGly: 0.892 ± 0.025
0.393TrpHis: 0.393 ± 0.014
0.635TrpIle: 0.635 ± 0.024
0.398TrpLys: 0.398 ± 0.016
2.456TrpLeu: 2.456 ± 0.054
0.339TrpMet: 0.339 ± 0.015
0.407TrpAsn: 0.407 ± 0.016
0.737TrpPro: 0.737 ± 0.022
1.056TrpGln: 1.056 ± 0.028
1.207TrpArg: 1.207 ± 0.031
0.765TrpSer: 0.765 ± 0.021
0.565TrpThr: 0.565 ± 0.02
0.883TrpVal: 0.883 ± 0.023
0.247TrpTrp: 0.247 ± 0.013
0.363TrpTyr: 0.363 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.625TyrAla: 2.625 ± 0.037
0.302TyrCys: 0.302 ± 0.014
1.301TyrAsp: 1.301 ± 0.036
1.113TyrGlu: 1.113 ± 0.03
0.997TyrPhe: 0.997 ± 0.022
2.052TyrGly: 2.052 ± 0.037
0.584TyrHis: 0.584 ± 0.023
0.903TyrIle: 0.903 ± 0.023
0.658TyrLys: 0.658 ± 0.022
3.177TyrLeu: 3.177 ± 0.046
0.406TyrMet: 0.406 ± 0.017
0.635TyrAsn: 0.635 ± 0.021
1.332TyrPro: 1.332 ± 0.031
1.394TyrGln: 1.394 ± 0.029
2.212TyrArg: 2.212 ± 0.035
1.489TyrSer: 1.489 ± 0.033
1.034TyrThr: 1.034 ± 0.027
1.536TyrVal: 1.536 ± 0.03
0.412TyrTrp: 0.412 ± 0.018
0.722TyrTyr: 0.722 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5186 proteins (1669955 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski