Amino acid dipepetide frequency for Paenibacillus antarcticus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.56AlaAla: 5.56 ± 0.074
0.62AlaCys: 0.62 ± 0.022
3.372AlaAsp: 3.372 ± 0.052
4.365AlaGlu: 4.365 ± 0.059
2.908AlaPhe: 2.908 ± 0.046
5.059AlaGly: 5.059 ± 0.071
1.244AlaHis: 1.244 ± 0.033
5.62AlaIle: 5.62 ± 0.066
3.968AlaLys: 3.968 ± 0.057
7.069AlaLeu: 7.069 ± 0.064
2.009AlaMet: 2.009 ± 0.04
2.823AlaAsn: 2.823 ± 0.051
2.128AlaPro: 2.128 ± 0.041
2.448AlaGln: 2.448 ± 0.039
2.695AlaArg: 2.695 ± 0.049
4.591AlaSer: 4.591 ± 0.067
3.938AlaThr: 3.938 ± 0.068
5.212AlaVal: 5.212 ± 0.061
0.735AlaTrp: 0.735 ± 0.022
2.346AlaTyr: 2.346 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.449CysAla: 0.449 ± 0.018
0.089CysCys: 0.089 ± 0.007
0.412CysAsp: 0.412 ± 0.018
0.428CysGlu: 0.428 ± 0.019
0.296CysPhe: 0.296 ± 0.015
0.716CysGly: 0.716 ± 0.023
0.193CysHis: 0.193 ± 0.011
0.563CysIle: 0.563 ± 0.019
0.398CysLys: 0.398 ± 0.016
0.703CysLeu: 0.703 ± 0.025
0.22CysMet: 0.22 ± 0.012
0.298CysAsn: 0.298 ± 0.015
0.319CysPro: 0.319 ± 0.016
0.233CysGln: 0.233 ± 0.015
0.355CysArg: 0.355 ± 0.015
0.567CysSer: 0.567 ± 0.018
0.406CysThr: 0.406 ± 0.014
0.489CysVal: 0.489 ± 0.02
0.08CysTrp: 0.08 ± 0.007
0.283CysTyr: 0.283 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.379AspAla: 3.379 ± 0.053
0.374AspCys: 0.374 ± 0.018
2.587AspAsp: 2.587 ± 0.048
3.818AspGlu: 3.818 ± 0.051
2.133AspPhe: 2.133 ± 0.042
3.775AspGly: 3.775 ± 0.067
1.258AspHis: 1.258 ± 0.035
4.363AspIle: 4.363 ± 0.053
3.079AspLys: 3.079 ± 0.048
5.032AspLeu: 5.032 ± 0.061
1.525AspMet: 1.525 ± 0.032
2.229AspAsn: 2.229 ± 0.043
2.176AspPro: 2.176 ± 0.043
2.101AspGln: 2.101 ± 0.037
2.4AspArg: 2.4 ± 0.042
2.997AspSer: 2.997 ± 0.047
2.864AspThr: 2.864 ± 0.048
3.848AspVal: 3.848 ± 0.052
0.721AspTrp: 0.721 ± 0.028
2.158AspTyr: 2.158 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
4.893GluAla: 4.893 ± 0.067
0.365GluCys: 0.365 ± 0.015
3.441GluAsp: 3.441 ± 0.049
5.215GluGlu: 5.215 ± 0.076
2.317GluPhe: 2.317 ± 0.043
4.363GluGly: 4.363 ± 0.058
1.568GluHis: 1.568 ± 0.034
5.016GluIle: 5.016 ± 0.065
3.95GluLys: 3.95 ± 0.062
6.967GluLeu: 6.967 ± 0.075
2.175GluMet: 2.175 ± 0.037
2.709GluAsn: 2.709 ± 0.042
1.916GluPro: 1.916 ± 0.04
3.45GluGln: 3.45 ± 0.052
3.368GluArg: 3.368 ± 0.058
3.819GluSer: 3.819 ± 0.057
3.064GluThr: 3.064 ± 0.05
4.862GluVal: 4.862 ± 0.061
0.868GluTrp: 0.868 ± 0.025
2.178GluTyr: 2.178 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
2.848PheAla: 2.848 ± 0.041
0.359PheCys: 0.359 ± 0.017
2.352PheAsp: 2.352 ± 0.051
2.468PheGlu: 2.468 ± 0.041
1.818PhePhe: 1.818 ± 0.044
3.039PheGly: 3.039 ± 0.046
0.894PheHis: 0.894 ± 0.023
3.446PheIle: 3.446 ± 0.069
2.241PheLys: 2.241 ± 0.039
3.759PheLeu: 3.759 ± 0.058
1.25PheMet: 1.25 ± 0.028
1.923PheAsn: 1.923 ± 0.039
1.446PhePro: 1.446 ± 0.028
1.506PheGln: 1.506 ± 0.034
1.724PheArg: 1.724 ± 0.038
2.963PheSer: 2.963 ± 0.048
2.593PheThr: 2.593 ± 0.045
2.906PheVal: 2.906 ± 0.046
0.48PheTrp: 0.48 ± 0.016
1.442PheTyr: 1.442 ± 0.037
0.0PheXaa: 0.0 ± 0.0
Gly
4.744GlyAla: 4.744 ± 0.065
0.662GlyCys: 0.662 ± 0.02
3.358GlyAsp: 3.358 ± 0.057
4.197GlyGlu: 4.197 ± 0.063
3.167GlyPhe: 3.167 ± 0.053
4.934GlyGly: 4.934 ± 0.077
1.399GlyHis: 1.399 ± 0.035
6.054GlyIle: 6.054 ± 0.069
4.426GlyLys: 4.426 ± 0.059
6.792GlyLeu: 6.792 ± 0.072
2.319GlyMet: 2.319 ± 0.042
2.968GlyAsn: 2.968 ± 0.051
1.635GlyPro: 1.635 ± 0.036
2.47GlyGln: 2.47 ± 0.041
2.9GlyArg: 2.9 ± 0.048
4.766GlySer: 4.766 ± 0.064
4.298GlyThr: 4.298 ± 0.061
5.123GlyVal: 5.123 ± 0.067
0.891GlyTrp: 0.891 ± 0.024
2.947GlyTyr: 2.947 ± 0.048
0.001GlyXaa: 0.001 ± 0.001
His
1.342HisAla: 1.342 ± 0.031
0.209HisCys: 0.209 ± 0.011
1.045HisAsp: 1.045 ± 0.031
1.273HisGlu: 1.273 ± 0.031
0.984HisPhe: 0.984 ± 0.026
1.422HisGly: 1.422 ± 0.034
0.636HisHis: 0.636 ± 0.023
1.65HisIle: 1.65 ± 0.043
1.035HisLys: 1.035 ± 0.03
2.1HisLeu: 2.1 ± 0.05
0.632HisMet: 0.632 ± 0.021
0.857HisAsn: 0.857 ± 0.022
1.086HisPro: 1.086 ± 0.03
0.757HisGln: 0.757 ± 0.022
0.964HisArg: 0.964 ± 0.027
1.295HisSer: 1.295 ± 0.029
1.159HisThr: 1.159 ± 0.028
1.443HisVal: 1.443 ± 0.033
0.299HisTrp: 0.299 ± 0.016
0.907HisTyr: 0.907 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.948IleAla: 5.948 ± 0.069
0.724IleCys: 0.724 ± 0.022
4.239IleAsp: 4.239 ± 0.048
4.933IleGlu: 4.933 ± 0.066
2.919IlePhe: 2.919 ± 0.057
5.72IleGly: 5.72 ± 0.075
1.792IleHis: 1.792 ± 0.04
5.909IleIle: 5.909 ± 0.083
3.801IleLys: 3.801 ± 0.06
7.065IleLeu: 7.065 ± 0.078
2.071IleMet: 2.071 ± 0.037
3.228IleAsn: 3.228 ± 0.048
3.474IlePro: 3.474 ± 0.048
3.12IleGln: 3.12 ± 0.051
3.632IleArg: 3.632 ± 0.054
5.848IleSer: 5.848 ± 0.066
4.842IleThr: 4.842 ± 0.06
5.861IleVal: 5.861 ± 0.07
0.735IleTrp: 0.735 ± 0.025
2.481IleTyr: 2.481 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
3.891LysAla: 3.891 ± 0.057
0.306LysCys: 0.306 ± 0.016
3.48LysAsp: 3.48 ± 0.057
4.933LysGlu: 4.933 ± 0.058
1.786LysPhe: 1.786 ± 0.031
3.937LysGly: 3.937 ± 0.054
1.114LysHis: 1.114 ± 0.027
3.742LysIle: 3.742 ± 0.054
3.869LysLys: 3.869 ± 0.059
5.552LysLeu: 5.552 ± 0.064
1.861LysMet: 1.861 ± 0.036
2.502LysAsn: 2.502 ± 0.047
2.084LysPro: 2.084 ± 0.038
2.512LysGln: 2.512 ± 0.04
2.67LysArg: 2.67 ± 0.043
3.547LysSer: 3.547 ± 0.055
2.884LysThr: 2.884 ± 0.051
4.282LysVal: 4.282 ± 0.059
0.748LysTrp: 0.748 ± 0.022
2.082LysTyr: 2.082 ± 0.04
0.0LysXaa: 0.0 ± 0.0
Leu
6.777LeuAla: 6.777 ± 0.081
0.809LeuCys: 0.809 ± 0.023
5.32LeuAsp: 5.32 ± 0.069
6.109LeuGlu: 6.109 ± 0.08
4.476LeuPhe: 4.476 ± 0.061
6.529LeuGly: 6.529 ± 0.069
2.067LeuHis: 2.067 ± 0.043
7.452LeuIle: 7.452 ± 0.085
5.673LeuLys: 5.673 ± 0.063
10.211LeuLeu: 10.211 ± 0.108
2.784LeuMet: 2.784 ± 0.043
4.514LeuAsn: 4.514 ± 0.057
4.01LeuPro: 4.01 ± 0.06
3.979LeuGln: 3.979 ± 0.057
4.233LeuArg: 4.233 ± 0.051
7.579LeuSer: 7.579 ± 0.087
5.646LeuThr: 5.646 ± 0.065
6.237LeuVal: 6.237 ± 0.074
0.922LeuTrp: 0.922 ± 0.026
3.14LeuTyr: 3.14 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
2.061MetAla: 2.061 ± 0.041
0.17MetCys: 0.17 ± 0.01
1.706MetAsp: 1.706 ± 0.038
1.943MetGlu: 1.943 ± 0.039
1.116MetPhe: 1.116 ± 0.029
2.002MetGly: 2.002 ± 0.039
0.451MetHis: 0.451 ± 0.017
2.407MetIle: 2.407 ± 0.043
2.325MetLys: 2.325 ± 0.037
2.898MetLeu: 2.898 ± 0.043
1.052MetMet: 1.052 ± 0.032
1.828MetAsn: 1.828 ± 0.036
1.079MetPro: 1.079 ± 0.028
0.989MetGln: 0.989 ± 0.03
1.218MetArg: 1.218 ± 0.028
2.008MetSer: 2.008 ± 0.037
1.772MetThr: 1.772 ± 0.031
1.975MetVal: 1.975 ± 0.039
0.243MetTrp: 0.243 ± 0.013
0.819MetTyr: 0.819 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
2.921AsnAla: 2.921 ± 0.049
0.249AsnCys: 0.249 ± 0.014
2.316AsnAsp: 2.316 ± 0.042
3.019AsnGlu: 3.019 ± 0.05
1.623AsnPhe: 1.623 ± 0.037
3.459AsnGly: 3.459 ± 0.068
1.02AsnHis: 1.02 ± 0.025
3.496AsnIle: 3.496 ± 0.046
2.827AsnLys: 2.827 ± 0.048
3.861AsnLeu: 3.861 ± 0.054
1.317AsnMet: 1.317 ± 0.031
2.392AsnAsn: 2.392 ± 0.061
2.118AsnPro: 2.118 ± 0.036
1.787AsnGln: 1.787 ± 0.034
2.037AsnArg: 2.037 ± 0.041
2.716AsnSer: 2.716 ± 0.05
2.521AsnThr: 2.521 ± 0.045
3.159AsnVal: 3.159 ± 0.052
0.569AsnTrp: 0.569 ± 0.02
1.574AsnTyr: 1.574 ± 0.037
0.001AsnXaa: 0.001 ± 0.001
Pro
2.267ProAla: 2.267 ± 0.046
0.231ProCys: 0.231 ± 0.013
2.15ProAsp: 2.15 ± 0.036
2.895ProGlu: 2.895 ± 0.051
1.789ProPhe: 1.789 ± 0.036
2.277ProGly: 2.277 ± 0.046
0.803ProHis: 0.803 ± 0.024
2.866ProIle: 2.866 ± 0.05
1.818ProLys: 1.818 ± 0.035
3.649ProLeu: 3.649 ± 0.052
1.018ProMet: 1.018 ± 0.026
1.743ProAsn: 1.743 ± 0.039
0.931ProPro: 0.931 ± 0.029
1.333ProGln: 1.333 ± 0.031
1.212ProArg: 1.212 ± 0.028
2.506ProSer: 2.506 ± 0.044
2.204ProThr: 2.204 ± 0.043
2.81ProVal: 2.81 ± 0.044
0.461ProTrp: 0.461 ± 0.017
1.381ProTyr: 1.381 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
2.796GlnAla: 2.796 ± 0.044
0.223GlnCys: 0.223 ± 0.012
1.855GlnAsp: 1.855 ± 0.031
2.772GlnGlu: 2.772 ± 0.044
1.605GlnPhe: 1.605 ± 0.038
2.656GlnGly: 2.656 ± 0.043
0.853GlnHis: 0.853 ± 0.024
2.776GlnIle: 2.776 ± 0.043
2.074GlnLys: 2.074 ± 0.036
4.244GlnLeu: 4.244 ± 0.062
1.289GlnMet: 1.289 ± 0.03
1.45GlnAsn: 1.45 ± 0.037
1.33GlnPro: 1.33 ± 0.032
1.96GlnGln: 1.96 ± 0.041
1.765GlnArg: 1.765 ± 0.04
2.602GlnSer: 2.602 ± 0.046
1.874GlnThr: 1.874 ± 0.035
2.663GlnVal: 2.663 ± 0.047
0.529GlnTrp: 0.529 ± 0.02
1.492GlnTyr: 1.492 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.449ArgAla: 2.449 ± 0.039
0.298ArgCys: 0.298 ± 0.014
2.251ArgAsp: 2.251 ± 0.037
3.072ArgGlu: 3.072 ± 0.047
1.87ArgPhe: 1.87 ± 0.04
2.581ArgGly: 2.581 ± 0.048
0.913ArgHis: 0.913 ± 0.028
3.595ArgIle: 3.595 ± 0.054
2.762ArgLys: 2.762 ± 0.043
4.353ArgLeu: 4.353 ± 0.06
1.519ArgMet: 1.519 ± 0.033
2.168ArgAsn: 2.168 ± 0.034
1.384ArgPro: 1.384 ± 0.031
1.646ArgGln: 1.646 ± 0.031
2.155ArgArg: 2.155 ± 0.039
2.801ArgSer: 2.801 ± 0.045
2.444ArgThr: 2.444 ± 0.04
2.857ArgVal: 2.857 ± 0.043
0.561ArgTrp: 0.561 ± 0.02
1.747ArgTyr: 1.747 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
4.235SerAla: 4.235 ± 0.058
0.424SerCys: 0.424 ± 0.019
3.452SerAsp: 3.452 ± 0.053
4.194SerGlu: 4.194 ± 0.059
3.13SerPhe: 3.13 ± 0.049
5.126SerGly: 5.126 ± 0.069
1.376SerHis: 1.376 ± 0.031
5.513SerIle: 5.513 ± 0.074
3.862SerLys: 3.862 ± 0.053
6.812SerLeu: 6.812 ± 0.071
2.015SerMet: 2.015 ± 0.036
3.148SerAsn: 3.148 ± 0.055
2.344SerPro: 2.344 ± 0.038
2.389SerGln: 2.389 ± 0.04
2.784SerArg: 2.784 ± 0.044
5.047SerSer: 5.047 ± 0.076
3.802SerThr: 3.802 ± 0.047
4.815SerVal: 4.815 ± 0.072
0.813SerTrp: 0.813 ± 0.023
2.506SerTyr: 2.506 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
4.091ThrAla: 4.091 ± 0.07
0.369ThrCys: 0.369 ± 0.017
2.923ThrAsp: 2.923 ± 0.046
3.352ThrGlu: 3.352 ± 0.051
2.51ThrPhe: 2.51 ± 0.045
4.315ThrGly: 4.315 ± 0.065
1.087ThrHis: 1.087 ± 0.028
4.557ThrIle: 4.557 ± 0.059
2.941ThrLys: 2.941 ± 0.043
5.817ThrLeu: 5.817 ± 0.071
1.525ThrMet: 1.525 ± 0.032
2.536ThrAsn: 2.536 ± 0.045
2.468ThrPro: 2.468 ± 0.048
1.854ThrGln: 1.854 ± 0.04
2.143ThrArg: 2.143 ± 0.04
3.884ThrSer: 3.884 ± 0.053
3.527ThrThr: 3.527 ± 0.059
4.471ThrVal: 4.471 ± 0.073
0.666ThrTrp: 0.666 ± 0.021
2.06ThrTyr: 2.06 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
4.983ValAla: 4.983 ± 0.069
0.619ValCys: 0.619 ± 0.021
3.86ValAsp: 3.86 ± 0.052
4.499ValGlu: 4.499 ± 0.06
2.869ValPhe: 2.869 ± 0.048
4.762ValGly: 4.762 ± 0.066
1.424ValHis: 1.424 ± 0.033
5.848ValIle: 5.848 ± 0.063
4.083ValLys: 4.083 ± 0.058
6.897ValLeu: 6.897 ± 0.069
2.057ValMet: 2.057 ± 0.04
3.297ValAsn: 3.297 ± 0.052
2.665ValPro: 2.665 ± 0.047
2.563ValGln: 2.563 ± 0.042
2.872ValArg: 2.872 ± 0.045
5.122ValSer: 5.122 ± 0.069
4.628ValThr: 4.628 ± 0.08
5.313ValVal: 5.313 ± 0.066
0.793ValTrp: 0.793 ± 0.022
2.285ValTyr: 2.285 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.705TrpAla: 0.705 ± 0.021
0.082TrpCys: 0.082 ± 0.007
0.684TrpAsp: 0.684 ± 0.023
0.683TrpGlu: 0.683 ± 0.023
0.53TrpPhe: 0.53 ± 0.019
0.774TrpGly: 0.774 ± 0.022
0.231TrpHis: 0.231 ± 0.012
1.01TrpIle: 1.01 ± 0.026
0.702TrpLys: 0.702 ± 0.021
1.228TrpLeu: 1.228 ± 0.033
0.442TrpMet: 0.442 ± 0.019
0.725TrpAsn: 0.725 ± 0.022
0.294TrpPro: 0.294 ± 0.015
0.406TrpGln: 0.406 ± 0.018
0.496TrpArg: 0.496 ± 0.019
0.784TrpSer: 0.784 ± 0.027
0.612TrpThr: 0.612 ± 0.024
0.75TrpVal: 0.75 ± 0.024
0.178TrpTrp: 0.178 ± 0.01
0.39TrpTyr: 0.39 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.369TyrAla: 2.369 ± 0.045
0.312TyrCys: 0.312 ± 0.015
2.014TyrAsp: 2.014 ± 0.04
2.333TyrGlu: 2.333 ± 0.043
1.613TyrPhe: 1.613 ± 0.036
2.605TyrGly: 2.605 ± 0.045
0.772TyrHis: 0.772 ± 0.025
2.461TyrIle: 2.461 ± 0.035
1.95TyrLys: 1.95 ± 0.04
3.458TyrLeu: 3.458 ± 0.055
0.981TyrMet: 0.981 ± 0.028
1.641TyrAsn: 1.641 ± 0.037
1.458TyrPro: 1.458 ± 0.033
1.305TyrGln: 1.305 ± 0.032
1.802TyrArg: 1.802 ± 0.038
2.347TyrSer: 2.347 ± 0.044
2.02TyrThr: 2.02 ± 0.034
2.364TyrVal: 2.364 ± 0.047
0.413TyrTrp: 0.413 ± 0.016
1.421TyrTyr: 1.421 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4663 proteins (1491509 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski