Amino acid dipepetide frequency for Paenibacillus methanolicus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.434AlaAla: 12.434 ± 0.144
0.839AlaCys: 0.839 ± 0.019
5.727AlaAsp: 5.727 ± 0.064
6.841AlaGlu: 6.841 ± 0.075
4.075AlaPhe: 4.075 ± 0.045
8.546AlaGly: 8.546 ± 0.077
1.753AlaHis: 1.753 ± 0.03
6.001AlaIle: 6.001 ± 0.057
4.692AlaLys: 4.692 ± 0.049
9.732AlaLeu: 9.732 ± 0.07
2.796AlaMet: 2.796 ± 0.038
3.253AlaAsn: 3.253 ± 0.048
3.555AlaPro: 3.555 ± 0.049
3.199AlaGln: 3.199 ± 0.049
4.952AlaArg: 4.952 ± 0.054
6.594AlaSer: 6.594 ± 0.064
4.398AlaThr: 4.398 ± 0.059
7.365AlaVal: 7.365 ± 0.069
1.273AlaTrp: 1.273 ± 0.023
3.342AlaTyr: 3.342 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.65CysAla: 0.65 ± 0.016
0.101CysCys: 0.101 ± 0.008
0.352CysAsp: 0.352 ± 0.013
0.416CysGlu: 0.416 ± 0.014
0.25CysPhe: 0.25 ± 0.011
0.756CysGly: 0.756 ± 0.02
0.157CysHis: 0.157 ± 0.008
0.36CysIle: 0.36 ± 0.014
0.246CysLys: 0.246 ± 0.011
0.675CysLeu: 0.675 ± 0.018
0.192CysMet: 0.192 ± 0.01
0.199CysAsn: 0.199 ± 0.009
0.314CysPro: 0.314 ± 0.013
0.179CysGln: 0.179 ± 0.008
0.459CysArg: 0.459 ± 0.015
0.469CysSer: 0.469 ± 0.014
0.326CysThr: 0.326 ± 0.01
0.434CysVal: 0.434 ± 0.014
0.093CysTrp: 0.093 ± 0.007
0.238CysTyr: 0.238 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
5.259AspAla: 5.259 ± 0.057
0.332AspCys: 0.332 ± 0.011
2.692AspAsp: 2.692 ± 0.039
3.857AspGlu: 3.857 ± 0.051
2.052AspPhe: 2.052 ± 0.033
4.9AspGly: 4.9 ± 0.061
1.052AspHis: 1.052 ± 0.023
3.104AspIle: 3.104 ± 0.038
2.227AspLys: 2.227 ± 0.039
4.672AspLeu: 4.672 ± 0.046
1.336AspMet: 1.336 ± 0.022
1.762AspAsn: 1.762 ± 0.028
2.664AspPro: 2.664 ± 0.034
1.734AspGln: 1.734 ± 0.027
3.34AspArg: 3.34 ± 0.041
2.486AspSer: 2.486 ± 0.035
2.479AspThr: 2.479 ± 0.036
3.702AspVal: 3.702 ± 0.041
0.988AspTrp: 0.988 ± 0.023
2.179AspTyr: 2.179 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
7.484GluAla: 7.484 ± 0.077
0.319GluCys: 0.319 ± 0.012
2.926GluAsp: 2.926 ± 0.037
5.009GluGlu: 5.009 ± 0.059
1.957GluPhe: 1.957 ± 0.034
4.743GluGly: 4.743 ± 0.045
1.394GluHis: 1.394 ± 0.031
3.537GluIle: 3.537 ± 0.051
2.977GluLys: 2.977 ± 0.047
7.257GluLeu: 7.257 ± 0.084
1.761GluMet: 1.761 ± 0.025
1.976GluAsn: 1.976 ± 0.029
2.576GluPro: 2.576 ± 0.037
3.402GluGln: 3.402 ± 0.041
4.817GluArg: 4.817 ± 0.067
3.412GluSer: 3.412 ± 0.044
3.548GluThr: 3.548 ± 0.041
3.776GluVal: 3.776 ± 0.051
0.948GluTrp: 0.948 ± 0.023
1.692GluTyr: 1.692 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
3.984PheAla: 3.984 ± 0.046
0.302PheCys: 0.302 ± 0.011
2.388PheAsp: 2.388 ± 0.034
2.509PheGlu: 2.509 ± 0.034
1.653PhePhe: 1.653 ± 0.032
3.381PheGly: 3.381 ± 0.041
0.814PheHis: 0.814 ± 0.018
2.361PheIle: 2.361 ± 0.036
1.753PheLys: 1.753 ± 0.031
3.513PheLeu: 3.513 ± 0.05
1.065PheMet: 1.065 ± 0.023
1.453PheAsn: 1.453 ± 0.027
1.532PhePro: 1.532 ± 0.029
1.252PheGln: 1.252 ± 0.022
2.321PheArg: 2.321 ± 0.028
2.343PheSer: 2.343 ± 0.034
2.232PheThr: 2.232 ± 0.034
3.018PheVal: 3.018 ± 0.038
0.556PheTrp: 0.556 ± 0.016
1.413PheTyr: 1.413 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
7.206GlyAla: 7.206 ± 0.083
0.72GlyCys: 0.72 ± 0.02
3.973GlyAsp: 3.973 ± 0.046
5.264GlyGlu: 5.264 ± 0.054
3.265GlyPhe: 3.265 ± 0.041
6.76GlyGly: 6.76 ± 0.079
1.558GlyHis: 1.558 ± 0.03
5.287GlyIle: 5.287 ± 0.055
4.148GlyLys: 4.148 ± 0.045
7.407GlyLeu: 7.407 ± 0.065
2.501GlyMet: 2.501 ± 0.032
2.784GlyAsn: 2.784 ± 0.046
2.138GlyPro: 2.138 ± 0.034
2.866GlyGln: 2.866 ± 0.035
4.361GlyArg: 4.361 ± 0.056
5.039GlySer: 5.039 ± 0.063
4.835GlyThr: 4.835 ± 0.064
5.352GlyVal: 5.352 ± 0.052
1.276GlyTrp: 1.276 ± 0.029
3.032GlyTyr: 3.032 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.05HisAla: 2.05 ± 0.03
0.158HisCys: 0.158 ± 0.007
1.114HisAsp: 1.114 ± 0.024
1.269HisGlu: 1.269 ± 0.025
0.923HisPhe: 0.923 ± 0.019
1.659HisGly: 1.659 ± 0.028
0.575HisHis: 0.575 ± 0.02
1.08HisIle: 1.08 ± 0.019
0.655HisLys: 0.655 ± 0.018
1.909HisLeu: 1.909 ± 0.036
0.517HisMet: 0.517 ± 0.014
0.592HisAsn: 0.592 ± 0.017
1.195HisPro: 1.195 ± 0.022
0.667HisGln: 0.667 ± 0.02
1.168HisArg: 1.168 ± 0.024
0.927HisSer: 0.927 ± 0.019
0.927HisThr: 0.927 ± 0.018
1.422HisVal: 1.422 ± 0.027
0.318HisTrp: 0.318 ± 0.011
0.894HisTyr: 0.894 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.663IleAla: 6.663 ± 0.066
0.461IleCys: 0.461 ± 0.015
3.518IleAsp: 3.518 ± 0.033
3.991IleGlu: 3.991 ± 0.053
1.895IlePhe: 1.895 ± 0.029
5.288IleGly: 5.288 ± 0.06
1.211IleHis: 1.211 ± 0.025
3.122IleIle: 3.122 ± 0.044
2.118IleLys: 2.118 ± 0.032
4.685IleLeu: 4.685 ± 0.053
1.399IleMet: 1.399 ± 0.023
1.928IleAsn: 1.928 ± 0.033
2.795IlePro: 2.795 ± 0.037
2.019IleGln: 2.019 ± 0.031
3.909IleArg: 3.909 ± 0.045
3.381IleSer: 3.381 ± 0.038
3.209IleThr: 3.209 ± 0.043
4.939IleVal: 4.939 ± 0.049
0.702IleTrp: 0.702 ± 0.015
1.816IleTyr: 1.816 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
4.198LysAla: 4.198 ± 0.048
0.164LysCys: 0.164 ± 0.008
2.37LysAsp: 2.37 ± 0.037
3.18LysGlu: 3.18 ± 0.048
1.314LysPhe: 1.314 ± 0.024
3.134LysGly: 3.134 ± 0.04
0.972LysHis: 0.972 ± 0.022
2.223LysIle: 2.223 ± 0.035
2.263LysLys: 2.263 ± 0.039
5.017LysLeu: 5.017 ± 0.043
1.207LysMet: 1.207 ± 0.023
1.478LysAsn: 1.478 ± 0.028
2.252LysPro: 2.252 ± 0.033
2.083LysGln: 2.083 ± 0.035
2.844LysArg: 2.844 ± 0.038
2.509LysSer: 2.509 ± 0.031
2.513LysThr: 2.513 ± 0.035
2.82LysVal: 2.82 ± 0.041
0.65LysTrp: 0.65 ± 0.017
1.375LysTyr: 1.375 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
10.544LeuAla: 10.544 ± 0.088
0.701LeuCys: 0.701 ± 0.017
5.144LeuAsp: 5.144 ± 0.047
5.698LeuGlu: 5.698 ± 0.06
4.459LeuPhe: 4.459 ± 0.055
6.878LeuGly: 6.878 ± 0.065
2.052LeuHis: 2.052 ± 0.03
5.7LeuIle: 5.7 ± 0.06
4.316LeuLys: 4.316 ± 0.046
11.082LeuLeu: 11.082 ± 0.103
2.493LeuMet: 2.493 ± 0.033
3.365LeuAsn: 3.365 ± 0.045
4.725LeuPro: 4.725 ± 0.051
3.645LeuGln: 3.645 ± 0.042
5.824LeuArg: 5.824 ± 0.057
6.495LeuSer: 6.495 ± 0.06
5.872LeuThr: 5.872 ± 0.053
6.418LeuVal: 6.418 ± 0.064
1.039LeuTrp: 1.039 ± 0.024
3.254LeuTyr: 3.254 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
2.532MetAla: 2.532 ± 0.034
0.136MetCys: 0.136 ± 0.008
1.393MetAsp: 1.393 ± 0.026
1.703MetGlu: 1.703 ± 0.029
0.963MetPhe: 0.963 ± 0.023
1.594MetGly: 1.594 ± 0.031
0.522MetHis: 0.522 ± 0.016
1.668MetIle: 1.668 ± 0.029
1.696MetLys: 1.696 ± 0.027
3.034MetLeu: 3.034 ± 0.042
0.859MetMet: 0.859 ± 0.021
1.417MetAsn: 1.417 ± 0.029
1.278MetPro: 1.278 ± 0.023
1.005MetGln: 1.005 ± 0.022
1.629MetArg: 1.629 ± 0.029
1.707MetSer: 1.707 ± 0.03
1.812MetThr: 1.812 ± 0.029
1.534MetVal: 1.534 ± 0.026
0.25MetTrp: 0.25 ± 0.01
0.738MetTyr: 0.738 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.469AsnAla: 3.469 ± 0.05
0.195AsnCys: 0.195 ± 0.01
1.748AsnAsp: 1.748 ± 0.031
2.27AsnGlu: 2.27 ± 0.034
1.188AsnPhe: 1.188 ± 0.028
3.442AsnGly: 3.442 ± 0.054
0.674AsnHis: 0.674 ± 0.018
1.817AsnIle: 1.817 ± 0.032
1.544AsnLys: 1.544 ± 0.031
2.909AsnLeu: 2.909 ± 0.037
0.899AsnMet: 0.899 ± 0.021
1.399AsnAsn: 1.399 ± 0.032
1.922AsnPro: 1.922 ± 0.032
1.267AsnGln: 1.267 ± 0.026
2.17AsnArg: 2.17 ± 0.031
1.636AsnSer: 1.636 ± 0.031
1.81AsnThr: 1.81 ± 0.034
2.589AsnVal: 2.589 ± 0.038
0.531AsnTrp: 0.531 ± 0.016
1.257AsnTyr: 1.257 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
4.355ProAla: 4.355 ± 0.053
0.243ProCys: 0.243 ± 0.01
2.859ProAsp: 2.859 ± 0.036
3.247ProGlu: 3.247 ± 0.038
1.942ProPhe: 1.942 ± 0.029
3.249ProGly: 3.249 ± 0.045
0.89ProHis: 0.89 ± 0.02
2.484ProIle: 2.484 ± 0.026
1.561ProLys: 1.561 ± 0.026
4.142ProLeu: 4.142 ± 0.05
1.054ProMet: 1.054 ± 0.023
1.535ProAsn: 1.535 ± 0.027
1.466ProPro: 1.466 ± 0.031
1.398ProGln: 1.398 ± 0.024
1.671ProArg: 1.671 ± 0.032
2.765ProSer: 2.765 ± 0.042
2.056ProThr: 2.056 ± 0.03
3.372ProVal: 3.372 ± 0.04
0.566ProTrp: 0.566 ± 0.015
1.529ProTyr: 1.529 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
3.986GlnAla: 3.986 ± 0.054
0.19GlnCys: 0.19 ± 0.009
1.594GlnAsp: 1.594 ± 0.028
2.197GlnGlu: 2.197 ± 0.033
1.404GlnPhe: 1.404 ± 0.027
2.597GlnGly: 2.597 ± 0.038
0.708GlnHis: 0.708 ± 0.018
2.034GlnIle: 2.034 ± 0.029
1.35GlnLys: 1.35 ± 0.025
4.109GlnLeu: 4.109 ± 0.051
0.975GlnMet: 0.975 ± 0.023
1.073GlnAsn: 1.073 ± 0.023
1.713GlnPro: 1.713 ± 0.027
1.581GlnGln: 1.581 ± 0.034
1.97GlnArg: 1.97 ± 0.035
2.235GlnSer: 2.235 ± 0.032
2.168GlnThr: 2.168 ± 0.032
2.309GlnVal: 2.309 ± 0.035
0.627GlnTrp: 0.627 ± 0.016
1.21GlnTyr: 1.21 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
4.645ArgAla: 4.645 ± 0.051
0.372ArgCys: 0.372 ± 0.014
2.84ArgAsp: 2.84 ± 0.038
4.264ArgGlu: 4.264 ± 0.055
2.573ArgPhe: 2.573 ± 0.033
3.669ArgGly: 3.669 ± 0.043
1.284ArgHis: 1.284 ± 0.026
4.116ArgIle: 4.116 ± 0.046
2.929ArgLys: 2.929 ± 0.038
6.283ArgLeu: 6.283 ± 0.062
2.012ArgMet: 2.012 ± 0.033
1.938ArgAsn: 1.938 ± 0.029
2.022ArgPro: 2.022 ± 0.032
2.352ArgGln: 2.352 ± 0.037
3.536ArgArg: 3.536 ± 0.052
3.311ArgSer: 3.311 ± 0.042
3.159ArgThr: 3.159 ± 0.045
3.597ArgVal: 3.597 ± 0.041
0.819ArgTrp: 0.819 ± 0.019
2.182ArgTyr: 2.182 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
5.958SerAla: 5.958 ± 0.07
0.397SerCys: 0.397 ± 0.014
3.009SerAsp: 3.009 ± 0.039
3.495SerGlu: 3.495 ± 0.043
2.704SerPhe: 2.704 ± 0.032
5.898SerGly: 5.898 ± 0.06
1.074SerHis: 1.074 ± 0.018
3.587SerIle: 3.587 ± 0.042
2.451SerLys: 2.451 ± 0.038
5.849SerLeu: 5.849 ± 0.052
1.641SerMet: 1.641 ± 0.028
1.918SerAsn: 1.918 ± 0.03
2.473SerPro: 2.473 ± 0.036
1.796SerGln: 1.796 ± 0.028
3.237SerArg: 3.237 ± 0.041
3.707SerSer: 3.707 ± 0.047
2.874SerThr: 2.874 ± 0.042
4.399SerVal: 4.399 ± 0.053
0.852SerTrp: 0.852 ± 0.018
2.108SerTyr: 2.108 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
5.455ThrAla: 5.455 ± 0.059
0.303ThrCys: 0.303 ± 0.012
2.927ThrAsp: 2.927 ± 0.037
3.134ThrGlu: 3.134 ± 0.04
2.323ThrPhe: 2.323 ± 0.037
4.731ThrGly: 4.731 ± 0.056
0.917ThrHis: 0.917 ± 0.022
3.628ThrIle: 3.628 ± 0.054
2.21ThrLys: 2.21 ± 0.035
5.522ThrLeu: 5.522 ± 0.056
1.419ThrMet: 1.419 ± 0.025
1.958ThrAsn: 1.958 ± 0.043
2.597ThrPro: 2.597 ± 0.039
1.464ThrGln: 1.464 ± 0.028
2.423ThrArg: 2.423 ± 0.029
3.163ThrSer: 3.163 ± 0.045
2.966ThrThr: 2.966 ± 0.051
4.571ThrVal: 4.571 ± 0.055
0.668ThrTrp: 0.668 ± 0.019
1.916ThrTyr: 1.916 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
6.084ValAla: 6.084 ± 0.045
0.571ValCys: 0.571 ± 0.016
3.595ValAsp: 3.595 ± 0.042
4.068ValGlu: 4.068 ± 0.046
2.835ValPhe: 2.835 ± 0.034
4.717ValGly: 4.717 ± 0.057
1.442ValHis: 1.442 ± 0.028
4.333ValIle: 4.333 ± 0.05
3.361ValLys: 3.361 ± 0.044
7.109ValLeu: 7.109 ± 0.057
1.951ValMet: 1.951 ± 0.033
2.683ValAsn: 2.683 ± 0.037
3.151ValPro: 3.151 ± 0.038
2.48ValGln: 2.48 ± 0.034
4.115ValArg: 4.115 ± 0.046
4.567ValSer: 4.567 ± 0.052
4.5ValThr: 4.5 ± 0.059
5.077ValVal: 5.077 ± 0.054
0.888ValTrp: 0.888 ± 0.018
2.448ValTyr: 2.448 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
1.021TrpAla: 1.021 ± 0.022
0.101TrpCys: 0.101 ± 0.006
0.73TrpAsp: 0.73 ± 0.018
0.828TrpGlu: 0.828 ± 0.019
0.574TrpPhe: 0.574 ± 0.018
0.928TrpGly: 0.928 ± 0.022
0.302TrpHis: 0.302 ± 0.012
0.818TrpIle: 0.818 ± 0.02
0.683TrpLys: 0.683 ± 0.019
1.555TrpLeu: 1.555 ± 0.03
0.468TrpMet: 0.468 ± 0.015
0.683TrpAsn: 0.683 ± 0.018
0.469TrpPro: 0.469 ± 0.017
0.577TrpGln: 0.577 ± 0.018
0.828TrpArg: 0.828 ± 0.021
0.889TrpSer: 0.889 ± 0.021
0.855TrpThr: 0.855 ± 0.019
0.818TrpVal: 0.818 ± 0.016
0.227TrpTrp: 0.227 ± 0.011
0.452TrpTyr: 0.452 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.39TyrAla: 3.39 ± 0.039
0.263TyrCys: 0.263 ± 0.01
1.916TyrAsp: 1.916 ± 0.036
2.271TyrGlu: 2.271 ± 0.034
1.49TyrPhe: 1.49 ± 0.027
2.904TyrGly: 2.904 ± 0.041
0.669TyrHis: 0.669 ± 0.017
1.792TyrIle: 1.792 ± 0.03
1.386TyrLys: 1.386 ± 0.023
3.242TyrLeu: 3.242 ± 0.04
0.921TyrMet: 0.921 ± 0.021
1.343TyrAsn: 1.343 ± 0.025
1.546TyrPro: 1.546 ± 0.028
1.066TyrGln: 1.066 ± 0.022
2.331TyrArg: 2.331 ± 0.032
1.791TyrSer: 1.791 ± 0.029
1.76TyrThr: 1.76 ± 0.035
2.463TyrVal: 2.463 ± 0.039
0.531TyrTrp: 0.531 ± 0.015
1.374TyrTyr: 1.374 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 6989 proteins (2386252 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski