Amino acid dipepetide frequency for Herbaspirillum sp. SJZ107

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.421AlaAla: 19.421 ± 0.181
1.323AlaCys: 1.323 ± 0.033
6.832AlaAsp: 6.832 ± 0.07
6.292AlaGlu: 6.292 ± 0.069
4.086AlaPhe: 4.086 ± 0.045
11.794AlaGly: 11.794 ± 0.1
2.625AlaHis: 2.625 ± 0.038
5.52AlaIle: 5.52 ± 0.06
3.691AlaLys: 3.691 ± 0.065
14.39AlaLeu: 14.39 ± 0.131
3.362AlaMet: 3.362 ± 0.04
3.323AlaAsn: 3.323 ± 0.047
6.714AlaPro: 6.714 ± 0.083
5.264AlaGln: 5.264 ± 0.07
9.42AlaArg: 9.42 ± 0.076
6.888AlaSer: 6.888 ± 0.067
6.229AlaThr: 6.229 ± 0.06
8.786AlaVal: 8.786 ± 0.08
1.879AlaTrp: 1.879 ± 0.039
2.704AlaTyr: 2.704 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.172CysAla: 1.172 ± 0.026
0.108CysCys: 0.108 ± 0.008
0.503CysAsp: 0.503 ± 0.015
0.397CysGlu: 0.397 ± 0.015
0.276CysPhe: 0.276 ± 0.012
0.868CysGly: 0.868 ± 0.022
0.217CysHis: 0.217 ± 0.01
0.397CysIle: 0.397 ± 0.014
0.217CysLys: 0.217 ± 0.01
0.798CysLeu: 0.798 ± 0.02
0.191CysMet: 0.191 ± 0.01
0.213CysAsn: 0.213 ± 0.01
0.372CysPro: 0.372 ± 0.015
0.226CysGln: 0.226 ± 0.01
0.562CysArg: 0.562 ± 0.019
0.485CysSer: 0.485 ± 0.015
0.462CysThr: 0.462 ± 0.018
0.584CysVal: 0.584 ± 0.02
0.134CysTrp: 0.134 ± 0.008
0.208CysTyr: 0.208 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.487AspAla: 7.487 ± 0.064
0.428AspCys: 0.428 ± 0.014
3.094AspAsp: 3.094 ± 0.044
3.094AspGlu: 3.094 ± 0.042
2.07AspPhe: 2.07 ± 0.033
5.292AspGly: 5.292 ± 0.062
1.027AspHis: 1.027 ± 0.025
2.722AspIle: 2.722 ± 0.036
1.992AspLys: 1.992 ± 0.039
5.352AspLeu: 5.352 ± 0.054
1.253AspMet: 1.253 ± 0.026
1.522AspAsn: 1.522 ± 0.036
3.288AspPro: 3.288 ± 0.048
1.778AspGln: 1.778 ± 0.033
3.339AspArg: 3.339 ± 0.039
2.503AspSer: 2.503 ± 0.036
2.897AspThr: 2.897 ± 0.053
3.98AspVal: 3.98 ± 0.049
0.96AspTrp: 0.96 ± 0.024
1.672AspTyr: 1.672 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
6.367GluAla: 6.367 ± 0.066
0.349GluCys: 0.349 ± 0.014
2.234GluAsp: 2.234 ± 0.038
2.608GluGlu: 2.608 ± 0.044
1.743GluPhe: 1.743 ± 0.03
3.617GluGly: 3.617 ± 0.05
1.28GluHis: 1.28 ± 0.026
2.551GluIle: 2.551 ± 0.035
1.853GluLys: 1.853 ± 0.035
5.593GluLeu: 5.593 ± 0.063
1.286GluMet: 1.286 ± 0.025
1.414GluAsn: 1.414 ± 0.028
2.206GluPro: 2.206 ± 0.042
2.658GluGln: 2.658 ± 0.039
4.679GluArg: 4.679 ± 0.061
2.316GluSer: 2.316 ± 0.031
2.527GluThr: 2.527 ± 0.036
3.641GluVal: 3.641 ± 0.05
0.707GluTrp: 0.707 ± 0.021
1.132GluTyr: 1.132 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
4.102PheAla: 4.102 ± 0.054
0.359PheCys: 0.359 ± 0.014
2.559PheAsp: 2.559 ± 0.037
1.888PheGlu: 1.888 ± 0.033
1.275PhePhe: 1.275 ± 0.031
3.405PheGly: 3.405 ± 0.048
0.703PheHis: 0.703 ± 0.019
1.501PheIle: 1.501 ± 0.033
1.107PheLys: 1.107 ± 0.024
3.004PheLeu: 3.004 ± 0.045
0.81PheMet: 0.81 ± 0.02
1.2PheAsn: 1.2 ± 0.024
1.496PhePro: 1.496 ± 0.025
1.057PheGln: 1.057 ± 0.024
1.99PheArg: 1.99 ± 0.031
2.199PheSer: 2.199 ± 0.035
2.027PheThr: 2.027 ± 0.035
2.587PheVal: 2.587 ± 0.043
0.477PheTrp: 0.477 ± 0.015
0.962PheTyr: 0.962 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
10.1GlyAla: 10.1 ± 0.092
0.799GlyCys: 0.799 ± 0.021
4.252GlyAsp: 4.252 ± 0.057
4.123GlyGlu: 4.123 ± 0.053
3.289GlyPhe: 3.289 ± 0.044
7.319GlyGly: 7.319 ± 0.101
1.906GlyHis: 1.906 ± 0.034
4.253GlyIle: 4.253 ± 0.05
3.624GlyLys: 3.624 ± 0.05
8.418GlyLeu: 8.418 ± 0.082
2.436GlyMet: 2.436 ± 0.036
2.587GlyAsn: 2.587 ± 0.066
3.126GlyPro: 3.126 ± 0.045
3.306GlyGln: 3.306 ± 0.041
5.521GlyArg: 5.521 ± 0.063
5.005GlySer: 5.005 ± 0.063
4.873GlyThr: 4.873 ± 0.056
6.275GlyVal: 6.275 ± 0.057
1.489GlyTrp: 1.489 ± 0.029
2.494GlyTyr: 2.494 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
2.912HisAla: 2.912 ± 0.042
0.232HisCys: 0.232 ± 0.01
1.272HisAsp: 1.272 ± 0.026
1.134HisGlu: 1.134 ± 0.025
0.859HisPhe: 0.859 ± 0.023
2.182HisGly: 2.182 ± 0.032
0.587HisHis: 0.587 ± 0.02
0.936HisIle: 0.936 ± 0.02
0.6HisLys: 0.6 ± 0.018
2.078HisLeu: 2.078 ± 0.037
0.532HisMet: 0.532 ± 0.016
0.515HisAsn: 0.515 ± 0.016
1.413HisPro: 1.413 ± 0.029
0.677HisGln: 0.677 ± 0.018
1.28HisArg: 1.28 ± 0.026
0.963HisSer: 0.963 ± 0.02
1.047HisThr: 1.047 ± 0.023
1.509HisVal: 1.509 ± 0.029
0.347HisTrp: 0.347 ± 0.013
0.684HisTyr: 0.684 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
6.142IleAla: 6.142 ± 0.058
0.356IleCys: 0.356 ± 0.012
3.357IleAsp: 3.357 ± 0.045
3.002IleGlu: 3.002 ± 0.041
1.226IlePhe: 1.226 ± 0.028
4.441IleGly: 4.441 ± 0.057
0.87IleHis: 0.87 ± 0.021
1.599IleIle: 1.599 ± 0.032
1.559IleLys: 1.559 ± 0.029
3.679IleLeu: 3.679 ± 0.044
0.844IleMet: 0.844 ± 0.02
1.467IleAsn: 1.467 ± 0.03
2.117IlePro: 2.117 ± 0.034
1.224IleGln: 1.224 ± 0.022
2.682IleArg: 2.682 ± 0.037
2.302IleSer: 2.302 ± 0.036
2.279IleThr: 2.279 ± 0.036
3.776IleVal: 3.776 ± 0.044
0.462IleTrp: 0.462 ± 0.015
0.976IleTyr: 0.976 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
3.93LysAla: 3.93 ± 0.059
0.17LysCys: 0.17 ± 0.008
1.745LysAsp: 1.745 ± 0.03
1.62LysGlu: 1.62 ± 0.034
0.94LysPhe: 0.94 ± 0.021
2.368LysGly: 2.368 ± 0.046
0.7LysHis: 0.7 ± 0.02
1.54LysIle: 1.54 ± 0.032
1.384LysLys: 1.384 ± 0.034
3.561LysLeu: 3.561 ± 0.05
0.891LysMet: 0.891 ± 0.021
1.129LysAsn: 1.129 ± 0.027
2.152LysPro: 2.152 ± 0.037
1.309LysGln: 1.309 ± 0.031
2.259LysArg: 2.259 ± 0.039
1.738LysSer: 1.738 ± 0.037
1.979LysThr: 1.979 ± 0.042
2.485LysVal: 2.485 ± 0.039
0.36LysTrp: 0.36 ± 0.014
0.786LysTyr: 0.786 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
15.142LeuAla: 15.142 ± 0.131
0.937LeuCys: 0.937 ± 0.023
6.305LeuAsp: 6.305 ± 0.055
5.156LeuGlu: 5.156 ± 0.056
3.441LeuPhe: 3.441 ± 0.05
8.333LeuGly: 8.333 ± 0.08
2.297LeuHis: 2.297 ± 0.03
3.846LeuIle: 3.846 ± 0.047
3.376LeuLys: 3.376 ± 0.043
10.799LeuLeu: 10.799 ± 0.123
2.228LeuMet: 2.228 ± 0.038
2.831LeuAsn: 2.831 ± 0.041
5.911LeuPro: 5.911 ± 0.059
3.945LeuGln: 3.945 ± 0.039
7.569LeuArg: 7.569 ± 0.077
5.947LeuSer: 5.947 ± 0.064
4.932LeuThr: 4.932 ± 0.058
7.674LeuVal: 7.674 ± 0.074
1.158LeuTrp: 1.158 ± 0.028
2.255LeuTyr: 2.255 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
3.012MetAla: 3.012 ± 0.037
0.171MetCys: 0.171 ± 0.01
1.179MetAsp: 1.179 ± 0.024
1.027MetGlu: 1.027 ± 0.025
0.742MetPhe: 0.742 ± 0.019
1.711MetGly: 1.711 ± 0.03
0.592MetHis: 0.592 ± 0.019
1.032MetIle: 1.032 ± 0.024
1.065MetLys: 1.065 ± 0.026
2.756MetLeu: 2.756 ± 0.035
0.609MetMet: 0.609 ± 0.017
0.946MetAsn: 0.946 ± 0.022
1.522MetPro: 1.522 ± 0.029
1.077MetGln: 1.077 ± 0.022
1.785MetArg: 1.785 ± 0.031
1.525MetSer: 1.525 ± 0.025
1.519MetThr: 1.519 ± 0.027
1.628MetVal: 1.628 ± 0.03
0.206MetTrp: 0.206 ± 0.009
0.466MetTyr: 0.466 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.6AsnAla: 3.6 ± 0.05
0.253AsnCys: 0.253 ± 0.012
1.568AsnAsp: 1.568 ± 0.04
1.334AsnGlu: 1.334 ± 0.026
1.047AsnPhe: 1.047 ± 0.026
2.827AsnGly: 2.827 ± 0.048
0.579AsnHis: 0.579 ± 0.017
1.366AsnIle: 1.366 ± 0.025
0.956AsnLys: 0.956 ± 0.025
2.959AsnLeu: 2.959 ± 0.046
0.658AsnMet: 0.658 ± 0.017
1.016AsnAsn: 1.016 ± 0.023
1.922AsnPro: 1.922 ± 0.035
1.006AsnGln: 1.006 ± 0.027
1.85AsnArg: 1.85 ± 0.033
1.374AsnSer: 1.374 ± 0.03
1.573AsnThr: 1.573 ± 0.032
2.221AsnVal: 2.221 ± 0.042
0.432AsnTrp: 0.432 ± 0.015
0.856AsnTyr: 0.856 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
7.849ProAla: 7.849 ± 0.091
0.329ProCys: 0.329 ± 0.013
3.444ProAsp: 3.444 ± 0.048
3.079ProGlu: 3.079 ± 0.044
1.786ProPhe: 1.786 ± 0.032
4.836ProGly: 4.836 ± 0.06
1.08ProHis: 1.08 ± 0.024
1.83ProIle: 1.83 ± 0.032
1.466ProLys: 1.466 ± 0.034
5.269ProLeu: 5.269 ± 0.069
1.158ProMet: 1.158 ± 0.026
1.447ProAsn: 1.447 ± 0.032
2.545ProPro: 2.545 ± 0.055
2.056ProGln: 2.056 ± 0.032
2.782ProArg: 2.782 ± 0.042
2.687ProSer: 2.687 ± 0.037
2.361ProThr: 2.361 ± 0.036
4.211ProVal: 4.211 ± 0.053
0.66ProTrp: 0.66 ± 0.02
1.218ProTyr: 1.218 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
5.481GlnAla: 5.481 ± 0.07
0.267GlnCys: 0.267 ± 0.012
1.868GlnAsp: 1.868 ± 0.033
1.672GlnGlu: 1.672 ± 0.032
1.241GlnPhe: 1.241 ± 0.026
2.858GlnGly: 2.858 ± 0.041
0.899GlnHis: 0.899 ± 0.02
1.634GlnIle: 1.634 ± 0.033
1.126GlnLys: 1.126 ± 0.026
4.095GlnLeu: 4.095 ± 0.049
1.006GlnMet: 1.006 ± 0.023
1.035GlnAsn: 1.035 ± 0.026
2.207GlnPro: 2.207 ± 0.033
1.95GlnGln: 1.95 ± 0.047
2.97GlnArg: 2.97 ± 0.044
1.853GlnSer: 1.853 ± 0.031
1.779GlnThr: 1.779 ± 0.036
2.999GlnVal: 2.999 ± 0.042
0.497GlnTrp: 0.497 ± 0.017
0.885GlnTyr: 0.885 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
7.974ArgAla: 7.974 ± 0.071
0.493ArgCys: 0.493 ± 0.017
3.841ArgAsp: 3.841 ± 0.047
3.803ArgGlu: 3.803 ± 0.05
2.799ArgPhe: 2.799 ± 0.039
4.555ArgGly: 4.555 ± 0.051
1.792ArgHis: 1.792 ± 0.034
3.733ArgIle: 3.733 ± 0.048
2.194ArgLys: 2.194 ± 0.038
7.58ArgLeu: 7.58 ± 0.073
2.009ArgMet: 2.009 ± 0.031
2.044ArgAsn: 2.044 ± 0.036
3.22ArgPro: 3.22 ± 0.046
2.951ArgGln: 2.951 ± 0.041
5.172ArgArg: 5.172 ± 0.067
3.489ArgSer: 3.489 ± 0.042
3.47ArgThr: 3.47 ± 0.048
4.788ArgVal: 4.788 ± 0.05
1.047ArgTrp: 1.047 ± 0.024
1.966ArgTyr: 1.966 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
6.479SerAla: 6.479 ± 0.067
0.398SerCys: 0.398 ± 0.017
2.684SerAsp: 2.684 ± 0.039
2.409SerGlu: 2.409 ± 0.04
2.014SerPhe: 2.014 ± 0.034
5.456SerGly: 5.456 ± 0.054
1.128SerHis: 1.128 ± 0.026
2.546SerIle: 2.546 ± 0.039
1.721SerLys: 1.721 ± 0.031
5.446SerLeu: 5.446 ± 0.061
1.415SerMet: 1.415 ± 0.032
1.669SerAsn: 1.669 ± 0.03
2.669SerPro: 2.669 ± 0.032
1.793SerGln: 1.793 ± 0.031
3.27SerArg: 3.27 ± 0.041
3.189SerSer: 3.189 ± 0.053
3.048SerThr: 3.048 ± 0.05
3.898SerVal: 3.898 ± 0.051
0.784SerTrp: 0.784 ± 0.02
1.404SerTyr: 1.404 ± 0.026
0.0SerXaa: 0.0 ± 0.0
Thr
6.095ThrAla: 6.095 ± 0.061
0.399ThrCys: 0.399 ± 0.015
2.597ThrAsp: 2.597 ± 0.036
2.323ThrGlu: 2.323 ± 0.037
1.768ThrPhe: 1.768 ± 0.033
4.835ThrGly: 4.835 ± 0.064
1.071ThrHis: 1.071 ± 0.02
2.476ThrIle: 2.476 ± 0.035
1.324ThrLys: 1.324 ± 0.039
6.061ThrLeu: 6.061 ± 0.064
1.244ThrMet: 1.244 ± 0.026
1.418ThrAsn: 1.418 ± 0.026
3.424ThrPro: 3.424 ± 0.042
1.719ThrGln: 1.719 ± 0.033
3.468ThrArg: 3.468 ± 0.037
2.737ThrSer: 2.737 ± 0.04
2.837ThrThr: 2.837 ± 0.044
4.452ThrVal: 4.452 ± 0.052
0.713ThrTrp: 0.713 ± 0.02
1.257ThrTyr: 1.257 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
9.179ValAla: 9.179 ± 0.071
0.65ValCys: 0.65 ± 0.018
4.253ValAsp: 4.253 ± 0.05
4.029ValGlu: 4.029 ± 0.054
2.596ValPhe: 2.596 ± 0.034
5.407ValGly: 5.407 ± 0.057
1.49ValHis: 1.49 ± 0.028
3.281ValIle: 3.281 ± 0.045
2.486ValLys: 2.486 ± 0.043
7.915ValLeu: 7.915 ± 0.079
1.736ValMet: 1.736 ± 0.031
2.345ValAsn: 2.345 ± 0.037
3.912ValPro: 3.912 ± 0.054
2.702ValGln: 2.702 ± 0.04
5.152ValArg: 5.152 ± 0.05
4.119ValSer: 4.119 ± 0.048
4.236ValThr: 4.236 ± 0.045
5.784ValVal: 5.784 ± 0.069
0.908ValTrp: 0.908 ± 0.024
1.706ValTyr: 1.706 ± 0.028
0.0ValXaa: 0.0 ± 0.0
Trp
1.213TrpAla: 1.213 ± 0.029
0.135TrpCys: 0.135 ± 0.008
0.706TrpAsp: 0.706 ± 0.02
0.587TrpGlu: 0.587 ± 0.017
0.566TrpPhe: 0.566 ± 0.019
0.869TrpGly: 0.869 ± 0.02
0.388TrpHis: 0.388 ± 0.013
0.698TrpIle: 0.698 ± 0.019
0.504TrpLys: 0.504 ± 0.017
1.778TrpLeu: 1.778 ± 0.036
0.408TrpMet: 0.408 ± 0.015
0.505TrpAsn: 0.505 ± 0.018
0.631TrpPro: 0.631 ± 0.018
0.682TrpGln: 0.682 ± 0.019
1.145TrpArg: 1.145 ± 0.021
0.794TrpSer: 0.794 ± 0.021
0.781TrpThr: 0.781 ± 0.023
0.868TrpVal: 0.868 ± 0.021
0.248TrpTrp: 0.248 ± 0.011
0.357TrpTyr: 0.357 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.887TyrAla: 2.887 ± 0.042
0.235TyrCys: 0.235 ± 0.011
1.454TyrAsp: 1.454 ± 0.029
1.182TyrGlu: 1.182 ± 0.027
0.946TyrPhe: 0.946 ± 0.024
2.224TyrGly: 2.224 ± 0.037
0.53TyrHis: 0.53 ± 0.018
0.901TyrIle: 0.901 ± 0.023
0.801TyrLys: 0.801 ± 0.018
2.55TyrLeu: 2.55 ± 0.035
0.493TyrMet: 0.493 ± 0.017
0.774TyrAsn: 0.774 ± 0.021
1.244TyrPro: 1.244 ± 0.027
0.939TyrGln: 0.939 ± 0.022
2.026TyrArg: 2.026 ± 0.035
1.305TyrSer: 1.305 ± 0.031
1.362TyrThr: 1.362 ± 0.034
1.739TyrVal: 1.739 ± 0.028
0.396TyrTrp: 0.396 ± 0.016
0.736TyrTyr: 0.736 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5733 proteins (2016752 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski