Amino acid dipepetide frequency for Pandoraea terrae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.158AlaAla: 17.158 ± 0.147
1.322AlaCys: 1.322 ± 0.028
6.494AlaAsp: 6.494 ± 0.059
6.104AlaGlu: 6.104 ± 0.068
4.316AlaPhe: 4.316 ± 0.05
11.078AlaGly: 11.078 ± 0.1
2.734AlaHis: 2.734 ± 0.043
5.809AlaIle: 5.809 ± 0.062
3.853AlaLys: 3.853 ± 0.057
14.101AlaLeu: 14.101 ± 0.107
3.522AlaMet: 3.522 ± 0.049
3.239AlaAsn: 3.239 ± 0.045
6.034AlaPro: 6.034 ± 0.075
5.304AlaGln: 5.304 ± 0.069
8.681AlaArg: 8.681 ± 0.098
6.937AlaSer: 6.937 ± 0.068
6.117AlaThr: 6.117 ± 0.066
8.967AlaVal: 8.967 ± 0.074
1.68AlaTrp: 1.68 ± 0.033
2.801AlaTyr: 2.801 ± 0.036
0.0AlaXaa: 0.0 ± 0.0
Cys
1.289CysAla: 1.289 ± 0.028
0.135CysCys: 0.135 ± 0.011
0.567CysAsp: 0.567 ± 0.018
0.528CysGlu: 0.528 ± 0.017
0.327CysPhe: 0.327 ± 0.013
1.076CysGly: 1.076 ± 0.026
0.299CysHis: 0.299 ± 0.015
0.422CysIle: 0.422 ± 0.015
0.252CysLys: 0.252 ± 0.011
0.913CysLeu: 0.913 ± 0.023
0.207CysMet: 0.207 ± 0.011
0.244CysAsn: 0.244 ± 0.01
0.49CysPro: 0.49 ± 0.02
0.273CysGln: 0.273 ± 0.012
0.664CysArg: 0.664 ± 0.02
0.507CysSer: 0.507 ± 0.019
0.47CysThr: 0.47 ± 0.018
0.838CysVal: 0.838 ± 0.022
0.12CysTrp: 0.12 ± 0.009
0.226CysTyr: 0.226 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.528AspAla: 7.528 ± 0.077
0.473AspCys: 0.473 ± 0.016
3.068AspAsp: 3.068 ± 0.052
3.149AspGlu: 3.149 ± 0.046
2.024AspPhe: 2.024 ± 0.036
4.759AspGly: 4.759 ± 0.056
1.074AspHis: 1.074 ± 0.024
2.767AspIle: 2.767 ± 0.04
1.629AspLys: 1.629 ± 0.03
5.081AspLeu: 5.081 ± 0.062
1.228AspMet: 1.228 ± 0.026
1.35AspAsn: 1.35 ± 0.029
2.882AspPro: 2.882 ± 0.045
1.531AspGln: 1.531 ± 0.033
3.479AspArg: 3.479 ± 0.049
2.232AspSer: 2.232 ± 0.031
2.851AspThr: 2.851 ± 0.045
4.471AspVal: 4.471 ± 0.06
0.878AspTrp: 0.878 ± 0.024
1.411AspTyr: 1.411 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
6.649GluAla: 6.649 ± 0.07
0.446GluCys: 0.446 ± 0.015
2.236GluAsp: 2.236 ± 0.041
2.156GluGlu: 2.156 ± 0.039
1.782GluPhe: 1.782 ± 0.034
3.417GluGly: 3.417 ± 0.049
1.345GluHis: 1.345 ± 0.032
2.914GluIle: 2.914 ± 0.044
1.857GluLys: 1.857 ± 0.038
5.1GluLeu: 5.1 ± 0.063
1.327GluMet: 1.327 ± 0.028
1.492GluAsn: 1.492 ± 0.03
2.157GluPro: 2.157 ± 0.034
2.221GluGln: 2.221 ± 0.043
4.537GluArg: 4.537 ± 0.057
2.49GluSer: 2.49 ± 0.038
2.854GluThr: 2.854 ± 0.038
3.636GluVal: 3.636 ± 0.043
0.712GluTrp: 0.712 ± 0.021
1.201GluTyr: 1.201 ± 0.026
0.0GluXaa: 0.0 ± 0.0
Phe
4.455PheAla: 4.455 ± 0.058
0.438PheCys: 0.438 ± 0.014
2.584PheAsp: 2.584 ± 0.04
2.088PheGlu: 2.088 ± 0.036
1.384PhePhe: 1.384 ± 0.032
3.739PheGly: 3.739 ± 0.046
0.796PheHis: 0.796 ± 0.024
1.624PheIle: 1.624 ± 0.033
1.112PheLys: 1.112 ± 0.03
3.154PheLeu: 3.154 ± 0.046
0.81PheMet: 0.81 ± 0.02
1.167PheAsn: 1.167 ± 0.025
1.642PhePro: 1.642 ± 0.031
1.029PheGln: 1.029 ± 0.026
2.012PheArg: 2.012 ± 0.036
2.308PheSer: 2.308 ± 0.039
1.906PheThr: 1.906 ± 0.036
3.024PheVal: 3.024 ± 0.045
0.513PheTrp: 0.513 ± 0.018
0.973PheTyr: 0.973 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
9.752GlyAla: 9.752 ± 0.096
0.911GlyCys: 0.911 ± 0.02
4.223GlyAsp: 4.223 ± 0.06
4.396GlyGlu: 4.396 ± 0.055
3.409GlyPhe: 3.409 ± 0.049
7.275GlyGly: 7.275 ± 0.104
1.943GlyHis: 1.943 ± 0.035
4.387GlyIle: 4.387 ± 0.049
3.353GlyLys: 3.353 ± 0.055
8.279GlyLeu: 8.279 ± 0.079
2.478GlyMet: 2.478 ± 0.039
2.514GlyAsn: 2.514 ± 0.053
3.068GlyPro: 3.068 ± 0.041
3.047GlyGln: 3.047 ± 0.048
5.562GlyArg: 5.562 ± 0.056
4.225GlySer: 4.225 ± 0.052
4.69GlyThr: 4.69 ± 0.075
7.069GlyVal: 7.069 ± 0.059
1.343GlyTrp: 1.343 ± 0.03
2.341GlyTyr: 2.341 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
2.991HisAla: 2.991 ± 0.038
0.292HisCys: 0.292 ± 0.011
1.365HisAsp: 1.365 ± 0.031
1.149HisGlu: 1.149 ± 0.027
0.971HisPhe: 0.971 ± 0.023
2.193HisGly: 2.193 ± 0.039
0.646HisHis: 0.646 ± 0.021
0.96HisIle: 0.96 ± 0.022
0.572HisLys: 0.572 ± 0.016
2.246HisLeu: 2.246 ± 0.042
0.528HisMet: 0.528 ± 0.016
0.541HisAsn: 0.541 ± 0.019
1.49HisPro: 1.49 ± 0.031
0.697HisGln: 0.697 ± 0.019
1.568HisArg: 1.568 ± 0.028
1.002HisSer: 1.002 ± 0.023
1.073HisThr: 1.073 ± 0.025
1.751HisVal: 1.751 ± 0.031
0.411HisTrp: 0.411 ± 0.015
0.674HisTyr: 0.674 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
6.605IleAla: 6.605 ± 0.062
0.486IleCys: 0.486 ± 0.018
3.387IleAsp: 3.387 ± 0.047
3.179IleGlu: 3.179 ± 0.044
1.507IlePhe: 1.507 ± 0.03
4.663IleGly: 4.663 ± 0.062
0.981IleHis: 0.981 ± 0.024
1.753IleIle: 1.753 ± 0.034
1.546IleLys: 1.546 ± 0.031
3.744IleLeu: 3.744 ± 0.049
0.841IleMet: 0.841 ± 0.021
1.469IleAsn: 1.469 ± 0.028
2.141IlePro: 2.141 ± 0.036
1.261IleGln: 1.261 ± 0.027
2.904IleArg: 2.904 ± 0.045
2.653IleSer: 2.653 ± 0.041
2.522IleThr: 2.522 ± 0.043
4.224IleVal: 4.224 ± 0.056
0.503IleTrp: 0.503 ± 0.02
1.074IleTyr: 1.074 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.588LysAla: 3.588 ± 0.048
0.192LysCys: 0.192 ± 0.01
1.532LysAsp: 1.532 ± 0.029
1.417LysGlu: 1.417 ± 0.034
1.019LysPhe: 1.019 ± 0.024
2.202LysGly: 2.202 ± 0.037
0.753LysHis: 0.753 ± 0.019
1.609LysIle: 1.609 ± 0.03
1.197LysLys: 1.197 ± 0.036
3.436LysLeu: 3.436 ± 0.045
0.82LysMet: 0.82 ± 0.021
0.888LysAsn: 0.888 ± 0.026
1.953LysPro: 1.953 ± 0.035
1.298LysGln: 1.298 ± 0.029
2.395LysArg: 2.395 ± 0.038
1.781LysSer: 1.781 ± 0.034
1.962LysThr: 1.962 ± 0.037
2.432LysVal: 2.432 ± 0.046
0.389LysTrp: 0.389 ± 0.015
0.766LysTyr: 0.766 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
13.917LeuAla: 13.917 ± 0.113
1.074LeuCys: 1.074 ± 0.023
5.871LeuAsp: 5.871 ± 0.062
4.86LeuGlu: 4.86 ± 0.06
3.539LeuPhe: 3.539 ± 0.052
8.488LeuGly: 8.488 ± 0.084
2.232LeuHis: 2.232 ± 0.036
4.567LeuIle: 4.567 ± 0.057
3.351LeuLys: 3.351 ± 0.045
10.081LeuLeu: 10.081 ± 0.1
2.409LeuMet: 2.409 ± 0.043
2.709LeuAsn: 2.709 ± 0.039
5.877LeuPro: 5.877 ± 0.065
3.421LeuGln: 3.421 ± 0.051
7.304LeuArg: 7.304 ± 0.071
6.338LeuSer: 6.338 ± 0.066
5.897LeuThr: 5.897 ± 0.061
7.275LeuVal: 7.275 ± 0.071
1.196LeuTrp: 1.196 ± 0.028
2.109LeuTyr: 2.109 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
2.787MetAla: 2.787 ± 0.036
0.217MetCys: 0.217 ± 0.01
1.003MetAsp: 1.003 ± 0.022
0.988MetGlu: 0.988 ± 0.027
0.783MetPhe: 0.783 ± 0.021
1.759MetGly: 1.759 ± 0.033
0.571MetHis: 0.571 ± 0.016
1.164MetIle: 1.164 ± 0.027
1.001MetLys: 1.001 ± 0.023
2.776MetLeu: 2.776 ± 0.042
0.679MetMet: 0.679 ± 0.021
0.887MetAsn: 0.887 ± 0.022
1.557MetPro: 1.557 ± 0.032
1.029MetGln: 1.029 ± 0.026
1.928MetArg: 1.928 ± 0.035
1.777MetSer: 1.777 ± 0.031
1.868MetThr: 1.868 ± 0.035
1.652MetVal: 1.652 ± 0.028
0.228MetTrp: 0.228 ± 0.011
0.438MetTyr: 0.438 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.502AsnAla: 3.502 ± 0.04
0.272AsnCys: 0.272 ± 0.012
1.458AsnAsp: 1.458 ± 0.029
1.298AsnGlu: 1.298 ± 0.025
1.028AsnPhe: 1.028 ± 0.025
2.541AsnGly: 2.541 ± 0.049
0.57AsnHis: 0.57 ± 0.018
1.308AsnIle: 1.308 ± 0.027
0.758AsnLys: 0.758 ± 0.019
2.736AsnLeu: 2.736 ± 0.042
0.596AsnMet: 0.596 ± 0.021
0.794AsnAsn: 0.794 ± 0.027
1.855AsnPro: 1.855 ± 0.037
0.904AsnGln: 0.904 ± 0.021
1.891AsnArg: 1.891 ± 0.039
1.3AsnSer: 1.3 ± 0.028
1.536AsnThr: 1.536 ± 0.033
2.308AsnVal: 2.308 ± 0.037
0.384AsnTrp: 0.384 ± 0.016
0.769AsnTyr: 0.769 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
6.892ProAla: 6.892 ± 0.076
0.393ProCys: 0.393 ± 0.015
3.247ProAsp: 3.247 ± 0.044
3.069ProGlu: 3.069 ± 0.047
1.879ProPhe: 1.879 ± 0.033
4.462ProGly: 4.462 ± 0.052
1.215ProHis: 1.215 ± 0.029
2.125ProIle: 2.125 ± 0.033
1.567ProLys: 1.567 ± 0.027
5.052ProLeu: 5.052 ± 0.055
1.234ProMet: 1.234 ± 0.024
1.476ProAsn: 1.476 ± 0.03
2.53ProPro: 2.53 ± 0.049
1.813ProGln: 1.813 ± 0.04
2.876ProArg: 2.876 ± 0.044
2.817ProSer: 2.817 ± 0.043
2.513ProThr: 2.513 ± 0.036
4.274ProVal: 4.274 ± 0.049
0.682ProTrp: 0.682 ± 0.019
1.256ProTyr: 1.256 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
4.711GlnAla: 4.711 ± 0.06
0.327GlnCys: 0.327 ± 0.014
1.479GlnAsp: 1.479 ± 0.029
1.403GlnGlu: 1.403 ± 0.03
1.294GlnPhe: 1.294 ± 0.027
2.682GlnGly: 2.682 ± 0.038
0.872GlnHis: 0.872 ± 0.023
1.923GlnIle: 1.923 ± 0.033
1.076GlnLys: 1.076 ± 0.025
3.789GlnLeu: 3.789 ± 0.052
1.031GlnMet: 1.031 ± 0.026
0.928GlnAsn: 0.928 ± 0.023
1.882GlnPro: 1.882 ± 0.039
1.672GlnGln: 1.672 ± 0.034
2.953GlnArg: 2.953 ± 0.046
1.951GlnSer: 1.951 ± 0.035
2.011GlnThr: 2.011 ± 0.034
2.609GlnVal: 2.609 ± 0.043
0.603GlnTrp: 0.603 ± 0.02
0.908GlnTyr: 0.908 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
8.11ArgAla: 8.11 ± 0.072
0.619ArgCys: 0.619 ± 0.022
3.907ArgAsp: 3.907 ± 0.051
4.12ArgGlu: 4.12 ± 0.048
2.923ArgPhe: 2.923 ± 0.046
4.766ArgGly: 4.766 ± 0.056
1.978ArgHis: 1.978 ± 0.034
3.611ArgIle: 3.611 ± 0.046
2.082ArgLys: 2.082 ± 0.035
7.366ArgLeu: 7.366 ± 0.063
1.933ArgMet: 1.933 ± 0.031
1.978ArgAsn: 1.978 ± 0.03
3.114ArgPro: 3.114 ± 0.041
2.809ArgGln: 2.809 ± 0.043
5.274ArgArg: 5.274 ± 0.061
3.274ArgSer: 3.274 ± 0.05
3.361ArgThr: 3.361 ± 0.044
5.267ArgVal: 5.267 ± 0.049
1.052ArgTrp: 1.052 ± 0.024
1.95ArgTyr: 1.95 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
6.599SerAla: 6.599 ± 0.069
0.498SerCys: 0.498 ± 0.019
2.725SerAsp: 2.725 ± 0.039
2.574SerGlu: 2.574 ± 0.042
1.993SerPhe: 1.993 ± 0.034
5.216SerGly: 5.216 ± 0.066
1.252SerHis: 1.252 ± 0.022
2.596SerIle: 2.596 ± 0.04
1.518SerLys: 1.518 ± 0.028
5.794SerLeu: 5.794 ± 0.068
1.399SerMet: 1.399 ± 0.027
1.471SerAsn: 1.471 ± 0.031
2.966SerPro: 2.966 ± 0.041
1.808SerGln: 1.808 ± 0.028
3.628SerArg: 3.628 ± 0.048
3.06SerSer: 3.06 ± 0.048
2.835SerThr: 2.835 ± 0.042
4.292SerVal: 4.292 ± 0.048
0.685SerTrp: 0.685 ± 0.022
1.289SerTyr: 1.289 ± 0.028
0.0SerXaa: 0.0 ± 0.0
Thr
6.076ThrAla: 6.076 ± 0.065
0.497ThrCys: 0.497 ± 0.018
2.683ThrAsp: 2.683 ± 0.041
2.39ThrGlu: 2.39 ± 0.04
2.05ThrPhe: 2.05 ± 0.036
4.876ThrGly: 4.876 ± 0.058
1.278ThrHis: 1.278 ± 0.027
2.536ThrIle: 2.536 ± 0.037
1.279ThrLys: 1.279 ± 0.031
6.406ThrLeu: 6.406 ± 0.067
1.229ThrMet: 1.229 ± 0.026
1.4ThrAsn: 1.4 ± 0.031
3.622ThrPro: 3.622 ± 0.047
1.962ThrGln: 1.962 ± 0.031
3.479ThrArg: 3.479 ± 0.049
2.949ThrSer: 2.949 ± 0.053
2.908ThrThr: 2.908 ± 0.052
4.463ThrVal: 4.463 ± 0.057
0.701ThrTrp: 0.701 ± 0.022
1.294ThrTyr: 1.294 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
9.666ValAla: 9.666 ± 0.094
0.845ValCys: 0.845 ± 0.022
4.109ValAsp: 4.109 ± 0.051
3.943ValGlu: 3.943 ± 0.044
2.897ValPhe: 2.897 ± 0.044
5.908ValGly: 5.908 ± 0.064
1.596ValHis: 1.596 ± 0.032
3.789ValIle: 3.789 ± 0.048
2.564ValLys: 2.564 ± 0.045
7.898ValLeu: 7.898 ± 0.078
1.996ValMet: 1.996 ± 0.04
2.26ValAsn: 2.26 ± 0.045
4.218ValPro: 4.218 ± 0.048
2.438ValGln: 2.438 ± 0.037
5.167ValArg: 5.167 ± 0.056
4.643ValSer: 4.643 ± 0.046
4.788ValThr: 4.788 ± 0.057
6.449ValVal: 6.449 ± 0.074
0.986ValTrp: 0.986 ± 0.025
1.646ValTyr: 1.646 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.204TrpAla: 1.204 ± 0.024
0.146TrpCys: 0.146 ± 0.01
0.593TrpAsp: 0.593 ± 0.018
0.556TrpGlu: 0.556 ± 0.02
0.539TrpPhe: 0.539 ± 0.018
0.96TrpGly: 0.96 ± 0.028
0.437TrpHis: 0.437 ± 0.016
0.684TrpIle: 0.684 ± 0.02
0.394TrpLys: 0.394 ± 0.016
1.946TrpLeu: 1.946 ± 0.04
0.37TrpMet: 0.37 ± 0.015
0.393TrpAsn: 0.393 ± 0.015
0.652TrpPro: 0.652 ± 0.019
0.71TrpGln: 0.71 ± 0.019
1.254TrpArg: 1.254 ± 0.03
0.688TrpSer: 0.688 ± 0.022
0.648TrpThr: 0.648 ± 0.019
0.948TrpVal: 0.948 ± 0.026
0.223TrpTrp: 0.223 ± 0.012
0.328TrpTyr: 0.328 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.772TyrAla: 2.772 ± 0.041
0.266TyrCys: 0.266 ± 0.012
1.332TyrAsp: 1.332 ± 0.031
1.162TyrGlu: 1.162 ± 0.031
1.095TyrPhe: 1.095 ± 0.026
2.204TyrGly: 2.204 ± 0.036
0.549TyrHis: 0.549 ± 0.016
0.896TyrIle: 0.896 ± 0.026
0.668TyrLys: 0.668 ± 0.022
2.62TyrLeu: 2.62 ± 0.04
0.453TyrMet: 0.453 ± 0.016
0.608TyrAsn: 0.608 ± 0.019
1.242TyrPro: 1.242 ± 0.029
0.875TyrGln: 0.875 ± 0.023
1.938TyrArg: 1.938 ± 0.032
1.266TyrSer: 1.266 ± 0.027
1.268TyrThr: 1.268 ± 0.029
1.856TyrVal: 1.856 ± 0.038
0.384TyrTrp: 0.384 ± 0.015
0.701TyrTyr: 0.701 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5574 proteins (1800002 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski