Amino acid dipepetide frequency for Cyclobacterium marinum (strain ATCC 25205 / DSM 745 / LMG 13164 / NCIMB 1802) (Flectobacillus marinus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.641AlaAla: 4.641 ± 0.062
0.583AlaCys: 0.583 ± 0.018
3.36AlaAsp: 3.36 ± 0.049
4.103AlaGlu: 4.103 ± 0.053
3.414AlaPhe: 3.414 ± 0.046
4.633AlaGly: 4.633 ± 0.058
1.128AlaHis: 1.128 ± 0.027
5.259AlaIle: 5.259 ± 0.058
4.314AlaLys: 4.314 ± 0.056
6.47AlaLeu: 6.47 ± 0.076
1.685AlaMet: 1.685 ± 0.033
3.239AlaAsn: 3.239 ± 0.05
2.227AlaPro: 2.227 ± 0.042
2.264AlaGln: 2.264 ± 0.035
2.12AlaArg: 2.12 ± 0.037
4.364AlaSer: 4.364 ± 0.045
3.339AlaThr: 3.339 ± 0.054
4.002AlaVal: 4.002 ± 0.052
0.767AlaTrp: 0.767 ± 0.02
2.775AlaTyr: 2.775 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.388CysAla: 0.388 ± 0.015
0.103CysCys: 0.103 ± 0.008
0.357CysAsp: 0.357 ± 0.016
0.436CysGlu: 0.436 ± 0.018
0.395CysPhe: 0.395 ± 0.016
0.537CysGly: 0.537 ± 0.021
0.208CysHis: 0.208 ± 0.014
0.483CysIle: 0.483 ± 0.018
0.439CysLys: 0.439 ± 0.017
0.694CysLeu: 0.694 ± 0.021
0.171CysMet: 0.171 ± 0.01
0.337CysAsn: 0.337 ± 0.015
0.323CysPro: 0.323 ± 0.014
0.272CysGln: 0.272 ± 0.011
0.263CysArg: 0.263 ± 0.012
0.478CysSer: 0.478 ± 0.018
0.361CysThr: 0.361 ± 0.014
0.355CysVal: 0.355 ± 0.013
0.088CysTrp: 0.088 ± 0.007
0.26CysTyr: 0.26 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.24AspAla: 3.24 ± 0.048
0.375AspCys: 0.375 ± 0.013
2.461AspAsp: 2.461 ± 0.044
3.705AspGlu: 3.705 ± 0.05
3.489AspPhe: 3.489 ± 0.043
3.956AspGly: 3.956 ± 0.068
1.153AspHis: 1.153 ± 0.027
3.915AspIle: 3.915 ± 0.054
3.92AspLys: 3.92 ± 0.054
5.773AspLeu: 5.773 ± 0.054
1.246AspMet: 1.246 ± 0.028
2.825AspAsn: 2.825 ± 0.041
2.493AspPro: 2.493 ± 0.041
2.255AspGln: 2.255 ± 0.034
2.175AspArg: 2.175 ± 0.036
2.916AspSer: 2.916 ± 0.048
2.143AspThr: 2.143 ± 0.034
2.927AspVal: 2.927 ± 0.046
0.969AspTrp: 0.969 ± 0.027
2.493AspTyr: 2.493 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
4.779GluAla: 4.779 ± 0.063
0.306GluCys: 0.306 ± 0.014
3.772GluAsp: 3.772 ± 0.048
5.619GluGlu: 5.619 ± 0.066
2.696GluPhe: 2.696 ± 0.042
4.794GluGly: 4.794 ± 0.057
0.988GluHis: 0.988 ± 0.03
5.519GluIle: 5.519 ± 0.058
6.2GluLys: 6.2 ± 0.077
6.268GluLeu: 6.268 ± 0.065
1.911GluMet: 1.911 ± 0.038
4.701GluAsn: 4.701 ± 0.055
1.79GluPro: 1.79 ± 0.04
2.223GluGln: 2.223 ± 0.037
2.647GluArg: 2.647 ± 0.041
3.625GluSer: 3.625 ± 0.045
3.27GluThr: 3.27 ± 0.048
4.576GluVal: 4.576 ± 0.058
0.808GluTrp: 0.808 ± 0.023
2.2GluTyr: 2.2 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
2.847PheAla: 2.847 ± 0.044
0.398PheCys: 0.398 ± 0.014
3.185PheAsp: 3.185 ± 0.044
3.351PheGlu: 3.351 ± 0.044
2.738PhePhe: 2.738 ± 0.043
3.716PheGly: 3.716 ± 0.047
0.938PheHis: 0.938 ± 0.026
3.678PheIle: 3.678 ± 0.051
3.14PheLys: 3.14 ± 0.045
4.97PheLeu: 4.97 ± 0.068
1.157PheMet: 1.157 ± 0.021
2.932PheAsn: 2.932 ± 0.043
2.007PhePro: 2.007 ± 0.033
1.745PheGln: 1.745 ± 0.03
1.896PheArg: 1.896 ± 0.037
4.131PheSer: 4.131 ± 0.05
2.882PheThr: 2.882 ± 0.047
2.84PheVal: 2.84 ± 0.043
0.686PheTrp: 0.686 ± 0.023
2.139PheTyr: 2.139 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
4.42GlyAla: 4.42 ± 0.06
0.518GlyCys: 0.518 ± 0.02
3.654GlyAsp: 3.654 ± 0.054
4.561GlyGlu: 4.561 ± 0.057
3.833GlyPhe: 3.833 ± 0.056
4.895GlyGly: 4.895 ± 0.076
1.304GlyHis: 1.304 ± 0.031
5.549GlyIle: 5.549 ± 0.067
5.44GlyLys: 5.44 ± 0.058
6.895GlyLeu: 6.895 ± 0.077
1.962GlyMet: 1.962 ± 0.034
3.862GlyAsn: 3.862 ± 0.054
1.872GlyPro: 1.872 ± 0.036
2.359GlyGln: 2.359 ± 0.041
2.645GlyArg: 2.645 ± 0.039
4.248GlySer: 4.248 ± 0.06
3.76GlyThr: 3.76 ± 0.05
4.541GlyVal: 4.541 ± 0.059
0.998GlyTrp: 0.998 ± 0.021
3.048GlyTyr: 3.048 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.12HisAla: 1.12 ± 0.028
0.187HisCys: 0.187 ± 0.011
0.819HisAsp: 0.819 ± 0.023
1.037HisGlu: 1.037 ± 0.028
1.136HisPhe: 1.136 ± 0.026
1.266HisGly: 1.266 ± 0.029
0.522HisHis: 0.522 ± 0.022
1.225HisIle: 1.225 ± 0.031
1.049HisLys: 1.049 ± 0.026
2.004HisLeu: 2.004 ± 0.033
0.399HisMet: 0.399 ± 0.016
0.856HisAsn: 0.856 ± 0.022
1.058HisPro: 1.058 ± 0.025
0.841HisGln: 0.841 ± 0.022
0.724HisArg: 0.724 ± 0.023
1.156HisSer: 1.156 ± 0.029
0.925HisThr: 0.925 ± 0.026
0.986HisVal: 0.986 ± 0.026
0.32HisTrp: 0.32 ± 0.015
0.831HisTyr: 0.831 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
5.14IleAla: 5.14 ± 0.059
0.592IleCys: 0.592 ± 0.02
4.339IleAsp: 4.339 ± 0.052
4.82IleGlu: 4.82 ± 0.066
3.352IlePhe: 3.352 ± 0.049
5.278IleGly: 5.278 ± 0.07
1.448IleHis: 1.448 ± 0.029
5.157IleIle: 5.157 ± 0.065
5.116IleLys: 5.116 ± 0.063
6.764IleLeu: 6.764 ± 0.067
1.416IleMet: 1.416 ± 0.029
4.453IleAsn: 4.453 ± 0.058
3.464IlePro: 3.464 ± 0.044
2.637IleGln: 2.637 ± 0.037
2.926IleArg: 2.926 ± 0.043
5.619IleSer: 5.619 ± 0.06
3.957IleThr: 3.957 ± 0.049
4.094IleVal: 4.094 ± 0.055
0.806IleTrp: 0.806 ± 0.022
2.594IleTyr: 2.594 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.921LysAla: 4.921 ± 0.062
0.333LysCys: 0.333 ± 0.017
4.07LysAsp: 4.07 ± 0.054
5.936LysGlu: 5.936 ± 0.063
2.623LysPhe: 2.623 ± 0.04
4.787LysGly: 4.787 ± 0.049
1.245LysHis: 1.245 ± 0.029
5.491LysIle: 5.491 ± 0.063
5.686LysLys: 5.686 ± 0.078
6.133LysLeu: 6.133 ± 0.064
1.918LysMet: 1.918 ± 0.041
4.354LysAsn: 4.354 ± 0.056
2.64LysPro: 2.64 ± 0.041
2.18LysGln: 2.18 ± 0.042
2.721LysArg: 2.721 ± 0.042
4.305LysSer: 4.305 ± 0.052
3.641LysThr: 3.641 ± 0.048
4.693LysVal: 4.693 ± 0.055
0.901LysTrp: 0.901 ± 0.026
2.594LysTyr: 2.594 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
6.565LeuAla: 6.565 ± 0.072
0.665LeuCys: 0.665 ± 0.021
5.361LeuAsp: 5.361 ± 0.062
6.613LeuGlu: 6.613 ± 0.071
4.921LeuPhe: 4.921 ± 0.061
6.531LeuGly: 6.531 ± 0.078
1.595LeuHis: 1.595 ± 0.03
7.131LeuIle: 7.131 ± 0.075
7.169LeuLys: 7.169 ± 0.079
9.632LeuLeu: 9.632 ± 0.103
2.435LeuMet: 2.435 ± 0.042
5.409LeuAsn: 5.409 ± 0.067
4.339LeuPro: 4.339 ± 0.05
3.134LeuGln: 3.134 ± 0.042
3.446LeuArg: 3.446 ± 0.045
7.214LeuSer: 7.214 ± 0.069
5.063LeuThr: 5.063 ± 0.051
5.819LeuVal: 5.819 ± 0.069
1.021LeuTrp: 1.021 ± 0.024
3.252LeuTyr: 3.252 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
2.046MetAla: 2.046 ± 0.035
0.126MetCys: 0.126 ± 0.009
1.593MetAsp: 1.593 ± 0.03
1.883MetGlu: 1.883 ± 0.039
0.813MetPhe: 0.813 ± 0.021
1.767MetGly: 1.767 ± 0.033
0.455MetHis: 0.455 ± 0.015
1.551MetIle: 1.551 ± 0.031
2.057MetLys: 2.057 ± 0.037
2.039MetLeu: 2.039 ± 0.041
0.648MetMet: 0.648 ± 0.025
1.334MetAsn: 1.334 ± 0.026
1.011MetPro: 1.011 ± 0.025
0.779MetGln: 0.779 ± 0.023
0.894MetArg: 0.894 ± 0.023
1.35MetSer: 1.35 ± 0.027
1.147MetThr: 1.147 ± 0.025
1.831MetVal: 1.831 ± 0.038
0.196MetTrp: 0.196 ± 0.01
0.654MetTyr: 0.654 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.377AsnAla: 3.377 ± 0.042
0.36AsnCys: 0.36 ± 0.015
2.785AsnAsp: 2.785 ± 0.041
3.599AsnGlu: 3.599 ± 0.048
2.954AsnPhe: 2.954 ± 0.049
3.996AsnGly: 3.996 ± 0.061
1.104AsnHis: 1.104 ± 0.026
4.14AsnIle: 4.14 ± 0.058
3.824AsnLys: 3.824 ± 0.052
5.546AsnLeu: 5.546 ± 0.063
1.226AsnMet: 1.226 ± 0.026
3.241AsnAsn: 3.241 ± 0.058
2.934AsnPro: 2.934 ± 0.039
2.43AsnGln: 2.43 ± 0.041
2.371AsnArg: 2.371 ± 0.039
3.544AsnSer: 3.544 ± 0.055
2.895AsnThr: 2.895 ± 0.045
2.961AsnVal: 2.961 ± 0.04
0.889AsnTrp: 0.889 ± 0.028
2.52AsnTyr: 2.52 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
2.256ProAla: 2.256 ± 0.036
0.212ProCys: 0.212 ± 0.012
2.598ProAsp: 2.598 ± 0.045
3.714ProGlu: 3.714 ± 0.053
2.109ProPhe: 2.109 ± 0.034
2.733ProGly: 2.733 ± 0.039
0.715ProHis: 0.715 ± 0.022
2.849ProIle: 2.849 ± 0.042
2.494ProLys: 2.494 ± 0.04
3.622ProLeu: 3.622 ± 0.05
0.893ProMet: 0.893 ± 0.025
2.282ProAsn: 2.282 ± 0.041
1.091ProPro: 1.091 ± 0.026
1.224ProGln: 1.224 ± 0.026
1.168ProArg: 1.168 ± 0.023
2.65ProSer: 2.65 ± 0.04
1.961ProThr: 1.961 ± 0.037
2.724ProVal: 2.724 ± 0.041
0.511ProTrp: 0.511 ± 0.018
1.646ProTyr: 1.646 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
2.357GlnAla: 2.357 ± 0.037
0.176GlnCys: 0.176 ± 0.011
1.644GlnAsp: 1.644 ± 0.032
2.554GlnGlu: 2.554 ± 0.039
1.673GlnPhe: 1.673 ± 0.031
2.174GlnGly: 2.174 ± 0.031
0.574GlnHis: 0.574 ± 0.019
2.444GlnIle: 2.444 ± 0.043
2.581GlnLys: 2.581 ± 0.038
3.684GlnLeu: 3.684 ± 0.049
0.935GlnMet: 0.935 ± 0.022
2.061GlnAsn: 2.061 ± 0.043
1.247GlnPro: 1.247 ± 0.025
1.394GlnGln: 1.394 ± 0.037
1.337GlnArg: 1.337 ± 0.028
2.267GlnSer: 2.267 ± 0.033
1.783GlnThr: 1.783 ± 0.03
2.386GlnVal: 2.386 ± 0.035
0.462GlnTrp: 0.462 ± 0.016
1.306GlnTyr: 1.306 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
2.175ArgAla: 2.175 ± 0.044
0.204ArgCys: 0.204 ± 0.011
1.967ArgAsp: 1.967 ± 0.032
2.564ArgGlu: 2.564 ± 0.041
2.063ArgPhe: 2.063 ± 0.033
2.238ArgGly: 2.238 ± 0.042
0.66ArgHis: 0.66 ± 0.02
2.93ArgIle: 2.93 ± 0.043
2.916ArgLys: 2.916 ± 0.044
3.679ArgLeu: 3.679 ± 0.049
1.069ArgMet: 1.069 ± 0.026
2.238ArgAsn: 2.238 ± 0.035
1.385ArgPro: 1.385 ± 0.031
1.301ArgGln: 1.301 ± 0.028
1.547ArgArg: 1.547 ± 0.033
2.295ArgSer: 2.295 ± 0.04
1.835ArgThr: 1.835 ± 0.036
2.361ArgVal: 2.361 ± 0.033
0.571ArgTrp: 0.571 ± 0.021
1.627ArgTyr: 1.627 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
3.745SerAla: 3.745 ± 0.045
0.606SerCys: 0.606 ± 0.021
3.304SerAsp: 3.304 ± 0.042
3.981SerGlu: 3.981 ± 0.047
3.929SerPhe: 3.929 ± 0.046
5.218SerGly: 5.218 ± 0.07
1.19SerHis: 1.19 ± 0.029
5.103SerIle: 5.103 ± 0.06
4.363SerLys: 4.363 ± 0.051
6.826SerLeu: 6.826 ± 0.07
1.481SerMet: 1.481 ± 0.028
3.581SerAsn: 3.581 ± 0.061
2.744SerPro: 2.744 ± 0.038
2.321SerGln: 2.321 ± 0.039
2.539SerArg: 2.539 ± 0.038
4.537SerSer: 4.537 ± 0.056
3.381SerThr: 3.381 ± 0.046
3.6SerVal: 3.6 ± 0.046
0.853SerTrp: 0.853 ± 0.023
2.638SerTyr: 2.638 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
3.478ThrAla: 3.478 ± 0.044
0.355ThrCys: 0.355 ± 0.017
2.906ThrAsp: 2.906 ± 0.042
3.103ThrGlu: 3.103 ± 0.043
2.773ThrPhe: 2.773 ± 0.042
4.13ThrGly: 4.13 ± 0.057
0.957ThrHis: 0.957 ± 0.023
3.976ThrIle: 3.976 ± 0.05
2.961ThrLys: 2.961 ± 0.039
4.972ThrLeu: 4.972 ± 0.053
0.974ThrMet: 0.974 ± 0.027
2.61ThrAsn: 2.61 ± 0.04
2.161ThrPro: 2.161 ± 0.033
1.637ThrGln: 1.637 ± 0.034
1.633ThrArg: 1.633 ± 0.029
3.459ThrSer: 3.459 ± 0.046
2.574ThrThr: 2.574 ± 0.045
3.25ThrVal: 3.25 ± 0.048
0.653ThrTrp: 0.653 ± 0.024
2.187ThrTyr: 2.187 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
4.013ValAla: 4.013 ± 0.057
0.449ValCys: 0.449 ± 0.017
3.316ValAsp: 3.316 ± 0.05
3.764ValGlu: 3.764 ± 0.053
3.467ValPhe: 3.467 ± 0.046
3.983ValGly: 3.983 ± 0.055
1.089ValHis: 1.089 ± 0.023
4.45ValIle: 4.45 ± 0.053
4.145ValLys: 4.145 ± 0.058
6.019ValLeu: 6.019 ± 0.063
1.477ValMet: 1.477 ± 0.027
3.519ValAsn: 3.519 ± 0.052
2.505ValPro: 2.505 ± 0.034
1.802ValGln: 1.802 ± 0.032
2.222ValArg: 2.222 ± 0.043
4.293ValSer: 4.293 ± 0.053
3.122ValThr: 3.122 ± 0.046
3.842ValVal: 3.842 ± 0.055
0.754ValTrp: 0.754 ± 0.022
2.299ValTyr: 2.299 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
0.807TrpAla: 0.807 ± 0.022
0.081TrpCys: 0.081 ± 0.007
0.771TrpAsp: 0.771 ± 0.022
0.999TrpGlu: 0.999 ± 0.024
0.592TrpPhe: 0.592 ± 0.02
0.868TrpGly: 0.868 ± 0.027
0.291TrpHis: 0.291 ± 0.016
0.834TrpIle: 0.834 ± 0.022
0.935TrpLys: 0.935 ± 0.025
1.225TrpLeu: 1.225 ± 0.031
0.402TrpMet: 0.402 ± 0.015
0.698TrpAsn: 0.698 ± 0.022
0.449TrpPro: 0.449 ± 0.017
0.54TrpGln: 0.54 ± 0.018
0.559TrpArg: 0.559 ± 0.017
0.741TrpSer: 0.741 ± 0.02
0.677TrpThr: 0.677 ± 0.023
0.866TrpVal: 0.866 ± 0.023
0.2TrpTrp: 0.2 ± 0.011
0.498TrpTyr: 0.498 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.37TyrAla: 2.37 ± 0.04
0.318TyrCys: 0.318 ± 0.014
2.167TyrAsp: 2.167 ± 0.038
2.35TyrGlu: 2.35 ± 0.036
2.405TyrPhe: 2.405 ± 0.038
2.864TyrGly: 2.864 ± 0.041
0.894TyrHis: 0.894 ± 0.026
2.287TyrIle: 2.287 ± 0.04
2.424TyrLys: 2.424 ± 0.039
4.104TyrLeu: 4.104 ± 0.049
0.769TyrMet: 0.769 ± 0.022
2.162TyrAsn: 2.162 ± 0.045
1.713TyrPro: 1.713 ± 0.032
1.699TyrGln: 1.699 ± 0.034
1.793TyrArg: 1.793 ± 0.032
2.749TyrSer: 2.749 ± 0.036
2.035TyrThr: 2.035 ± 0.036
1.877TyrVal: 1.877 ± 0.032
0.58TyrTrp: 0.58 ± 0.017
1.788TyrTyr: 1.788 ± 0.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4983 proteins (1776705 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski