Amino acid dipepetide frequency for Paenibacillus sp. HGF7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.886AlaAla: 10.886 ± 0.135
0.774AlaCys: 0.774 ± 0.024
4.883AlaAsp: 4.883 ± 0.065
6.185AlaGlu: 6.185 ± 0.089
3.568AlaPhe: 3.568 ± 0.048
8.167AlaGly: 8.167 ± 0.089
1.428AlaHis: 1.428 ± 0.029
4.569AlaIle: 4.569 ± 0.057
4.371AlaLys: 4.371 ± 0.055
8.765AlaLeu: 8.765 ± 0.092
2.222AlaMet: 2.222 ± 0.038
2.494AlaAsn: 2.494 ± 0.041
3.213AlaPro: 3.213 ± 0.055
2.769AlaGln: 2.769 ± 0.042
4.12AlaArg: 4.12 ± 0.056
5.494AlaSer: 5.494 ± 0.075
3.224AlaThr: 3.224 ± 0.053
7.273AlaVal: 7.273 ± 0.084
0.941AlaTrp: 0.941 ± 0.025
2.927AlaTyr: 2.927 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.565CysAla: 0.565 ± 0.018
0.104CysCys: 0.104 ± 0.008
0.358CysAsp: 0.358 ± 0.013
0.432CysGlu: 0.432 ± 0.016
0.331CysPhe: 0.331 ± 0.014
0.786CysGly: 0.786 ± 0.022
0.159CysHis: 0.159 ± 0.01
0.475CysIle: 0.475 ± 0.016
0.319CysLys: 0.319 ± 0.013
0.784CysLeu: 0.784 ± 0.023
0.209CysMet: 0.209 ± 0.011
0.233CysAsn: 0.233 ± 0.012
0.377CysPro: 0.377 ± 0.018
0.18CysGln: 0.18 ± 0.011
0.496CysArg: 0.496 ± 0.017
0.524CysSer: 0.524 ± 0.017
0.425CysThr: 0.425 ± 0.017
0.48CysVal: 0.48 ± 0.017
0.093CysTrp: 0.093 ± 0.007
0.246CysTyr: 0.246 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
4.045AspAla: 4.045 ± 0.059
0.399AspCys: 0.399 ± 0.017
2.244AspAsp: 2.244 ± 0.04
3.703AspGlu: 3.703 ± 0.053
2.116AspPhe: 2.116 ± 0.033
4.153AspGly: 4.153 ± 0.056
1.027AspHis: 1.027 ± 0.027
3.238AspIle: 3.238 ± 0.044
2.971AspLys: 2.971 ± 0.044
4.996AspLeu: 4.996 ± 0.06
1.27AspMet: 1.27 ± 0.027
1.652AspAsn: 1.652 ± 0.035
2.49AspPro: 2.49 ± 0.044
1.591AspGln: 1.591 ± 0.035
2.932AspArg: 2.932 ± 0.041
2.788AspSer: 2.788 ± 0.039
2.598AspThr: 2.598 ± 0.045
3.485AspVal: 3.485 ± 0.048
0.795AspTrp: 0.795 ± 0.022
2.019AspTyr: 2.019 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
6.665GluAla: 6.665 ± 0.095
0.362GluCys: 0.362 ± 0.013
3.152GluAsp: 3.152 ± 0.048
5.78GluGlu: 5.78 ± 0.082
2.204GluPhe: 2.204 ± 0.034
4.485GluGly: 4.485 ± 0.055
1.468GluHis: 1.468 ± 0.029
4.214GluIle: 4.214 ± 0.058
4.175GluLys: 4.175 ± 0.052
7.261GluLeu: 7.261 ± 0.1
1.885GluMet: 1.885 ± 0.035
2.542GluAsn: 2.542 ± 0.039
2.512GluPro: 2.512 ± 0.035
3.272GluGln: 3.272 ± 0.057
4.078GluArg: 4.078 ± 0.069
3.549GluSer: 3.549 ± 0.048
3.684GluThr: 3.684 ± 0.049
4.15GluVal: 4.15 ± 0.054
0.882GluTrp: 0.882 ± 0.026
1.891GluTyr: 1.891 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
3.378PheAla: 3.378 ± 0.048
0.357PheCys: 0.357 ± 0.015
2.242PheAsp: 2.242 ± 0.036
2.37PheGlu: 2.37 ± 0.044
1.902PhePhe: 1.902 ± 0.043
3.237PheGly: 3.237 ± 0.047
0.913PheHis: 0.913 ± 0.024
2.669PheIle: 2.669 ± 0.041
1.921PheLys: 1.921 ± 0.041
4.092PheLeu: 4.092 ± 0.053
1.153PheMet: 1.153 ± 0.027
1.481PheAsn: 1.481 ± 0.033
1.691PhePro: 1.691 ± 0.033
1.319PheGln: 1.319 ± 0.027
2.173PheArg: 2.173 ± 0.04
2.74PheSer: 2.74 ± 0.044
2.546PheThr: 2.546 ± 0.036
3.06PheVal: 3.06 ± 0.045
0.515PheTrp: 0.515 ± 0.018
1.517PheTyr: 1.517 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
6.283GlyAla: 6.283 ± 0.081
0.724GlyCys: 0.724 ± 0.019
3.577GlyAsp: 3.577 ± 0.047
4.89GlyGlu: 4.89 ± 0.06
3.277GlyPhe: 3.277 ± 0.048
6.096GlyGly: 6.096 ± 0.079
1.497GlyHis: 1.497 ± 0.029
5.586GlyIle: 5.586 ± 0.07
4.656GlyLys: 4.656 ± 0.058
7.597GlyLeu: 7.597 ± 0.076
2.316GlyMet: 2.316 ± 0.039
2.581GlyAsn: 2.581 ± 0.047
2.275GlyPro: 2.275 ± 0.043
2.683GlyGln: 2.683 ± 0.048
4.196GlyArg: 4.196 ± 0.057
5.401GlySer: 5.401 ± 0.068
4.916GlyThr: 4.916 ± 0.065
5.266GlyVal: 5.266 ± 0.053
1.113GlyTrp: 1.113 ± 0.027
2.925GlyTyr: 2.925 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
1.623HisAla: 1.623 ± 0.034
0.185HisCys: 0.185 ± 0.01
0.987HisAsp: 0.987 ± 0.02
1.302HisGlu: 1.302 ± 0.03
0.962HisPhe: 0.962 ± 0.026
1.54HisGly: 1.54 ± 0.031
0.586HisHis: 0.586 ± 0.025
1.286HisIle: 1.286 ± 0.031
0.865HisLys: 0.865 ± 0.023
2.105HisLeu: 2.105 ± 0.041
0.527HisMet: 0.527 ± 0.017
0.57HisAsn: 0.57 ± 0.017
1.198HisPro: 1.198 ± 0.025
0.68HisGln: 0.68 ± 0.019
1.093HisArg: 1.093 ± 0.026
1.137HisSer: 1.137 ± 0.024
1.108HisThr: 1.108 ± 0.026
1.382HisVal: 1.382 ± 0.028
0.276HisTrp: 0.276 ± 0.011
0.796HisTyr: 0.796 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
5.574IleAla: 5.574 ± 0.061
0.527IleCys: 0.527 ± 0.02
3.166IleAsp: 3.166 ± 0.047
3.91IleGlu: 3.91 ± 0.048
2.189IlePhe: 2.189 ± 0.04
5.332IleGly: 5.332 ± 0.066
1.346IleHis: 1.346 ± 0.028
3.386IleIle: 3.386 ± 0.051
2.674IleLys: 2.674 ± 0.041
5.652IleLeu: 5.652 ± 0.068
1.445IleMet: 1.445 ± 0.031
2.025IleAsn: 2.025 ± 0.038
3.048IlePro: 3.048 ± 0.05
2.223IleGln: 2.223 ± 0.037
3.99IleArg: 3.99 ± 0.049
4.01IleSer: 4.01 ± 0.055
3.335IleThr: 3.335 ± 0.046
4.817IleVal: 4.817 ± 0.059
0.639IleTrp: 0.639 ± 0.02
1.86IleTyr: 1.86 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
4.527LysAla: 4.527 ± 0.061
0.228LysCys: 0.228 ± 0.011
2.873LysAsp: 2.873 ± 0.043
4.515LysGlu: 4.515 ± 0.056
1.635LysPhe: 1.635 ± 0.034
3.628LysGly: 3.628 ± 0.047
1.049LysHis: 1.049 ± 0.02
3.169LysIle: 3.169 ± 0.049
3.535LysLys: 3.535 ± 0.055
5.416LysLeu: 5.416 ± 0.064
1.528LysMet: 1.528 ± 0.033
2.11LysAsn: 2.11 ± 0.039
2.504LysPro: 2.504 ± 0.039
2.36LysGln: 2.36 ± 0.038
2.885LysArg: 2.885 ± 0.037
2.987LysSer: 2.987 ± 0.046
3.153LysThr: 3.153 ± 0.043
3.427LysVal: 3.427 ± 0.052
0.701LysTrp: 0.701 ± 0.02
1.639LysTyr: 1.639 ± 0.029
0.0LysXaa: 0.0 ± 0.0
Leu
8.981LeuAla: 8.981 ± 0.091
0.86LeuCys: 0.86 ± 0.028
5.181LeuAsp: 5.181 ± 0.063
6.387LeuGlu: 6.387 ± 0.08
4.616LeuPhe: 4.616 ± 0.057
7.099LeuGly: 7.099 ± 0.07
2.193LeuHis: 2.193 ± 0.036
6.286LeuIle: 6.286 ± 0.078
5.384LeuLys: 5.384 ± 0.062
11.521LeuLeu: 11.521 ± 0.112
2.479LeuMet: 2.479 ± 0.042
3.736LeuAsn: 3.736 ± 0.044
4.896LeuPro: 4.896 ± 0.056
4.041LeuGln: 4.041 ± 0.057
5.249LeuArg: 5.249 ± 0.059
6.891LeuSer: 6.891 ± 0.063
5.973LeuThr: 5.973 ± 0.064
6.564LeuVal: 6.564 ± 0.074
0.975LeuTrp: 0.975 ± 0.025
3.21LeuTyr: 3.21 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
2.12MetAla: 2.12 ± 0.037
0.142MetCys: 0.142 ± 0.01
1.391MetAsp: 1.391 ± 0.026
1.783MetGlu: 1.783 ± 0.034
1.026MetPhe: 1.026 ± 0.027
1.627MetGly: 1.627 ± 0.033
0.437MetHis: 0.437 ± 0.015
1.743MetIle: 1.743 ± 0.037
1.969MetLys: 1.969 ± 0.036
2.699MetLeu: 2.699 ± 0.039
0.8MetMet: 0.8 ± 0.025
1.424MetAsn: 1.424 ± 0.031
1.168MetPro: 1.168 ± 0.026
0.953MetGln: 0.953 ± 0.026
1.341MetArg: 1.341 ± 0.031
1.705MetSer: 1.705 ± 0.036
1.665MetThr: 1.665 ± 0.032
1.537MetVal: 1.537 ± 0.033
0.213MetTrp: 0.213 ± 0.012
0.735MetTyr: 0.735 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.724AsnAla: 2.724 ± 0.04
0.237AsnCys: 0.237 ± 0.012
1.642AsnAsp: 1.642 ± 0.03
2.303AsnGlu: 2.303 ± 0.037
1.318AsnPhe: 1.318 ± 0.027
3.079AsnGly: 3.079 ± 0.057
0.74AsnHis: 0.74 ± 0.023
2.063AsnIle: 2.063 ± 0.035
1.988AsnLys: 1.988 ± 0.039
3.271AsnLeu: 3.271 ± 0.047
0.941AsnMet: 0.941 ± 0.021
1.357AsnAsn: 1.357 ± 0.036
2.119AsnPro: 2.119 ± 0.035
1.291AsnGln: 1.291 ± 0.027
2.205AsnArg: 2.205 ± 0.036
1.91AsnSer: 1.91 ± 0.04
1.883AsnThr: 1.883 ± 0.04
2.466AsnVal: 2.466 ± 0.038
0.494AsnTrp: 0.494 ± 0.017
1.209AsnTyr: 1.209 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
4.379ProAla: 4.379 ± 0.068
0.243ProCys: 0.243 ± 0.014
2.947ProAsp: 2.947 ± 0.047
3.569ProGlu: 3.569 ± 0.048
1.921ProPhe: 1.921 ± 0.034
3.676ProGly: 3.676 ± 0.065
0.925ProHis: 0.925 ± 0.021
2.106ProIle: 2.106 ± 0.041
1.828ProLys: 1.828 ± 0.03
4.286ProLeu: 4.286 ± 0.056
0.922ProMet: 0.922 ± 0.022
1.393ProAsn: 1.393 ± 0.03
1.531ProPro: 1.531 ± 0.043
1.505ProGln: 1.505 ± 0.032
1.574ProArg: 1.574 ± 0.029
2.786ProSer: 2.786 ± 0.043
1.883ProThr: 1.883 ± 0.04
3.813ProVal: 3.813 ± 0.06
0.498ProTrp: 0.498 ± 0.018
1.563ProTyr: 1.563 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.545GlnAla: 3.545 ± 0.046
0.19GlnCys: 0.19 ± 0.011
1.59GlnAsp: 1.59 ± 0.028
2.493GlnGlu: 2.493 ± 0.046
1.383GlnPhe: 1.383 ± 0.033
2.542GlnGly: 2.542 ± 0.043
0.69GlnHis: 0.69 ± 0.021
2.264GlnIle: 2.264 ± 0.04
1.959GlnLys: 1.959 ± 0.034
3.833GlnLeu: 3.833 ± 0.053
1.01GlnMet: 1.01 ± 0.026
1.295GlnAsn: 1.295 ± 0.03
1.633GlnPro: 1.633 ± 0.033
1.509GlnGln: 1.509 ± 0.036
1.813GlnArg: 1.813 ± 0.034
2.14GlnSer: 2.14 ± 0.036
2.1GlnThr: 2.1 ± 0.037
2.349GlnVal: 2.349 ± 0.041
0.456GlnTrp: 0.456 ± 0.017
1.105GlnTyr: 1.105 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
3.608ArgAla: 3.608 ± 0.052
0.379ArgCys: 0.379 ± 0.014
2.6ArgAsp: 2.6 ± 0.045
4.358ArgGlu: 4.358 ± 0.066
2.312ArgPhe: 2.312 ± 0.037
3.313ArgGly: 3.313 ± 0.054
1.242ArgHis: 1.242 ± 0.026
3.779ArgIle: 3.779 ± 0.051
3.399ArgLys: 3.399 ± 0.043
5.793ArgLeu: 5.793 ± 0.069
1.662ArgMet: 1.662 ± 0.031
1.928ArgAsn: 1.928 ± 0.032
1.935ArgPro: 1.935 ± 0.035
2.203ArgGln: 2.203 ± 0.036
3.172ArgArg: 3.172 ± 0.06
3.385ArgSer: 3.385 ± 0.044
3.031ArgThr: 3.031 ± 0.04
3.304ArgVal: 3.304 ± 0.041
0.642ArgTrp: 0.642 ± 0.021
1.906ArgTyr: 1.906 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
5.515SerAla: 5.515 ± 0.065
0.465SerCys: 0.465 ± 0.016
2.963SerAsp: 2.963 ± 0.047
3.669SerGlu: 3.669 ± 0.052
3.03SerPhe: 3.03 ± 0.044
6.048SerGly: 6.048 ± 0.061
1.146SerHis: 1.146 ± 0.027
3.637SerIle: 3.637 ± 0.045
3.025SerLys: 3.025 ± 0.043
6.49SerLeu: 6.49 ± 0.069
1.691SerMet: 1.691 ± 0.035
1.885SerAsn: 1.885 ± 0.042
2.757SerPro: 2.757 ± 0.038
1.814SerGln: 1.814 ± 0.037
3.521SerArg: 3.521 ± 0.049
4.175SerSer: 4.175 ± 0.057
2.954SerThr: 2.954 ± 0.043
4.712SerVal: 4.712 ± 0.076
0.765SerTrp: 0.765 ± 0.021
2.222SerTyr: 2.222 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
5.357ThrAla: 5.357 ± 0.071
0.37ThrCys: 0.37 ± 0.015
2.926ThrAsp: 2.926 ± 0.046
3.418ThrGlu: 3.418 ± 0.044
2.43ThrPhe: 2.43 ± 0.038
5.094ThrGly: 5.094 ± 0.067
0.96ThrHis: 0.96 ± 0.025
3.307ThrIle: 3.307 ± 0.049
2.393ThrLys: 2.393 ± 0.042
5.544ThrLeu: 5.544 ± 0.061
1.232ThrMet: 1.232 ± 0.027
1.868ThrAsn: 1.868 ± 0.041
2.694ThrPro: 2.694 ± 0.05
1.49ThrGln: 1.49 ± 0.029
2.421ThrArg: 2.421 ± 0.038
3.116ThrSer: 3.116 ± 0.048
2.699ThrThr: 2.699 ± 0.045
4.694ThrVal: 4.694 ± 0.082
0.625ThrTrp: 0.625 ± 0.022
1.922ThrTyr: 1.922 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
4.956ValAla: 4.956 ± 0.061
0.658ValCys: 0.658 ± 0.021
3.375ValAsp: 3.375 ± 0.049
4.094ValGlu: 4.094 ± 0.05
3.034ValPhe: 3.034 ± 0.052
4.447ValGly: 4.447 ± 0.06
1.51ValHis: 1.51 ± 0.026
4.597ValIle: 4.597 ± 0.057
3.989ValLys: 3.989 ± 0.05
7.735ValLeu: 7.735 ± 0.077
1.88ValMet: 1.88 ± 0.031
2.751ValAsn: 2.751 ± 0.045
3.336ValPro: 3.336 ± 0.05
2.508ValGln: 2.508 ± 0.035
3.984ValArg: 3.984 ± 0.055
4.941ValSer: 4.941 ± 0.064
4.682ValThr: 4.682 ± 0.095
4.914ValVal: 4.914 ± 0.069
0.857ValTrp: 0.857 ± 0.026
2.478ValTyr: 2.478 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.823TrpAla: 0.823 ± 0.022
0.089TrpCys: 0.089 ± 0.007
0.626TrpAsp: 0.626 ± 0.017
0.759TrpGlu: 0.759 ± 0.021
0.54TrpPhe: 0.54 ± 0.017
0.786TrpGly: 0.786 ± 0.024
0.23TrpHis: 0.23 ± 0.011
0.867TrpIle: 0.867 ± 0.022
0.755TrpLys: 0.755 ± 0.022
1.423TrpLeu: 1.423 ± 0.033
0.401TrpMet: 0.401 ± 0.016
0.661TrpAsn: 0.661 ± 0.022
0.358TrpPro: 0.358 ± 0.015
0.456TrpGln: 0.456 ± 0.017
0.576TrpArg: 0.576 ± 0.019
0.746TrpSer: 0.746 ± 0.026
0.714TrpThr: 0.714 ± 0.022
0.725TrpVal: 0.725 ± 0.019
0.174TrpTrp: 0.174 ± 0.01
0.372TrpTyr: 0.372 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.719TyrAla: 2.719 ± 0.034
0.281TyrCys: 0.281 ± 0.013
1.787TyrAsp: 1.787 ± 0.038
2.289TyrGlu: 2.289 ± 0.039
1.485TyrPhe: 1.485 ± 0.033
2.718TyrGly: 2.718 ± 0.043
0.663TyrHis: 0.663 ± 0.02
1.907TyrIle: 1.907 ± 0.035
1.693TyrLys: 1.693 ± 0.034
3.261TyrLeu: 3.261 ± 0.045
0.906TyrMet: 0.906 ± 0.022
1.256TyrAsn: 1.256 ± 0.03
1.613TyrPro: 1.613 ± 0.031
1.051TyrGln: 1.051 ± 0.026
2.134TyrArg: 2.134 ± 0.036
2.057TyrSer: 2.057 ± 0.032
1.99TyrThr: 1.99 ± 0.039
2.305TyrVal: 2.305 ± 0.037
0.428TyrTrp: 0.428 ± 0.017
1.236TyrTyr: 1.236 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5992 proteins (1790891 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski