Amino acid dipepetide frequency for Planctomicrobium piriforme

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.759AlaAla: 12.759 ± 0.123
1.259AlaCys: 1.259 ± 0.034
5.55AlaAsp: 5.55 ± 0.066
6.273AlaGlu: 6.273 ± 0.075
3.418AlaPhe: 3.418 ± 0.051
8.154AlaGly: 8.154 ± 0.089
1.719AlaHis: 1.719 ± 0.033
5.078AlaIle: 5.078 ± 0.058
3.678AlaLys: 3.678 ± 0.054
9.233AlaLeu: 9.233 ± 0.079
2.328AlaMet: 2.328 ± 0.046
3.038AlaAsn: 3.038 ± 0.055
4.683AlaPro: 4.683 ± 0.069
3.76AlaGln: 3.76 ± 0.055
5.786AlaArg: 5.786 ± 0.062
5.684AlaSer: 5.684 ± 0.07
5.44AlaThr: 5.44 ± 0.07
7.58AlaVal: 7.58 ± 0.081
1.463AlaTrp: 1.463 ± 0.03
2.157AlaTyr: 2.157 ± 0.032
0.0AlaXaa: 0.0 ± 0.0
Cys
0.891CysAla: 0.891 ± 0.03
0.249CysCys: 0.249 ± 0.014
0.68CysAsp: 0.68 ± 0.022
0.684CysGlu: 0.684 ± 0.02
0.439CysPhe: 0.439 ± 0.018
1.149CysGly: 1.149 ± 0.029
0.389CysHis: 0.389 ± 0.018
0.485CysIle: 0.485 ± 0.015
0.341CysLys: 0.341 ± 0.016
1.361CysLeu: 1.361 ± 0.035
0.193CysMet: 0.193 ± 0.011
0.301CysAsn: 0.301 ± 0.012
0.692CysPro: 0.692 ± 0.021
0.483CysGln: 0.483 ± 0.016
0.877CysArg: 0.877 ± 0.024
0.677CysSer: 0.677 ± 0.02
0.554CysThr: 0.554 ± 0.018
0.846CysVal: 0.846 ± 0.025
0.213CysTrp: 0.213 ± 0.01
0.297CysTyr: 0.297 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.609AspAla: 5.609 ± 0.071
0.547AspCys: 0.547 ± 0.017
3.124AspAsp: 3.124 ± 0.05
3.601AspGlu: 3.601 ± 0.054
2.259AspPhe: 2.259 ± 0.045
4.958AspGly: 4.958 ± 0.088
1.254AspHis: 1.254 ± 0.028
2.316AspIle: 2.316 ± 0.039
1.639AspLys: 1.639 ± 0.038
5.778AspLeu: 5.778 ± 0.064
0.944AspMet: 0.944 ± 0.025
1.383AspAsn: 1.383 ± 0.029
3.408AspPro: 3.408 ± 0.05
2.343AspGln: 2.343 ± 0.039
3.754AspArg: 3.754 ± 0.048
3.099AspSer: 3.099 ± 0.064
2.067AspThr: 2.067 ± 0.049
4.212AspVal: 4.212 ± 0.048
1.063AspTrp: 1.063 ± 0.026
1.464AspTyr: 1.464 ± 0.029
0.0AspXaa: 0.0 ± 0.0
Glu
5.325GluAla: 5.325 ± 0.066
0.539GluCys: 0.539 ± 0.018
2.5GluAsp: 2.5 ± 0.039
3.475GluGlu: 3.475 ± 0.06
2.624GluPhe: 2.624 ± 0.046
3.419GluGly: 3.419 ± 0.044
1.34GluHis: 1.34 ± 0.027
3.508GluIle: 3.508 ± 0.047
2.69GluLys: 2.69 ± 0.054
6.88GluLeu: 6.88 ± 0.08
1.534GluMet: 1.534 ± 0.03
1.869GluAsn: 1.869 ± 0.038
2.754GluPro: 2.754 ± 0.04
3.222GluGln: 3.222 ± 0.053
4.178GluArg: 4.178 ± 0.067
3.51GluSer: 3.51 ± 0.048
3.524GluThr: 3.524 ± 0.056
3.948GluVal: 3.948 ± 0.046
0.83GluTrp: 0.83 ± 0.024
1.396GluTyr: 1.396 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.725PheAla: 3.725 ± 0.048
0.526PheCys: 0.526 ± 0.018
2.595PheAsp: 2.595 ± 0.044
2.279PheGlu: 2.279 ± 0.041
1.394PhePhe: 1.394 ± 0.031
3.186PheGly: 3.186 ± 0.05
0.863PheHis: 0.863 ± 0.022
1.52PheIle: 1.52 ± 0.03
1.098PheLys: 1.098 ± 0.026
3.794PheLeu: 3.794 ± 0.054
0.694PheMet: 0.694 ± 0.019
1.305PheAsn: 1.305 ± 0.035
1.903PhePro: 1.903 ± 0.039
1.577PheGln: 1.577 ± 0.032
2.425PheArg: 2.425 ± 0.044
2.609PheSer: 2.609 ± 0.043
2.126PheThr: 2.126 ± 0.052
2.843PheVal: 2.843 ± 0.044
0.606PheTrp: 0.606 ± 0.019
0.968PheTyr: 0.968 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
6.107GlyAla: 6.107 ± 0.081
1.012GlyCys: 1.012 ± 0.027
4.205GlyAsp: 4.205 ± 0.065
4.564GlyGlu: 4.564 ± 0.058
3.048GlyPhe: 3.048 ± 0.044
6.43GlyGly: 6.43 ± 0.105
1.661GlyHis: 1.661 ± 0.036
4.029GlyIle: 4.029 ± 0.055
3.531GlyLys: 3.531 ± 0.059
7.46GlyLeu: 7.46 ± 0.07
1.951GlyMet: 1.951 ± 0.04
2.851GlyAsn: 2.851 ± 0.076
3.291GlyPro: 3.291 ± 0.054
3.48GlyGln: 3.48 ± 0.052
4.871GlyArg: 4.871 ± 0.053
4.984GlySer: 4.984 ± 0.089
4.953GlyThr: 4.953 ± 0.12
5.54GlyVal: 5.54 ± 0.06
1.39GlyTrp: 1.39 ± 0.034
2.166GlyTyr: 2.166 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
1.996HisAla: 1.996 ± 0.04
0.322HisCys: 0.322 ± 0.014
1.203HisAsp: 1.203 ± 0.029
1.21HisGlu: 1.21 ± 0.026
0.944HisPhe: 0.944 ± 0.025
1.841HisGly: 1.841 ± 0.038
0.619HisHis: 0.619 ± 0.02
0.847HisIle: 0.847 ± 0.024
0.568HisLys: 0.568 ± 0.02
2.249HisLeu: 2.249 ± 0.044
0.389HisMet: 0.389 ± 0.017
0.602HisAsn: 0.602 ± 0.02
1.512HisPro: 1.512 ± 0.032
0.879HisGln: 0.879 ± 0.02
1.514HisArg: 1.514 ± 0.029
1.241HisSer: 1.241 ± 0.03
0.884HisThr: 0.884 ± 0.025
1.542HisVal: 1.542 ± 0.032
0.412HisTrp: 0.412 ± 0.016
0.588HisTyr: 0.588 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
5.432IleAla: 5.432 ± 0.053
0.693IleCys: 0.693 ± 0.023
3.412IleAsp: 3.412 ± 0.046
3.379IleGlu: 3.379 ± 0.049
1.545IlePhe: 1.545 ± 0.036
4.026IleGly: 4.026 ± 0.061
1.018IleHis: 1.018 ± 0.023
1.919IleIle: 1.919 ± 0.04
1.385IleLys: 1.385 ± 0.03
4.413IleLeu: 4.413 ± 0.057
0.715IleMet: 0.715 ± 0.02
1.51IleAsn: 1.51 ± 0.038
2.824IlePro: 2.824 ± 0.041
1.883IleGln: 1.883 ± 0.031
3.316IleArg: 3.316 ± 0.049
3.154IleSer: 3.154 ± 0.044
2.645IleThr: 2.645 ± 0.054
3.865IleVal: 3.865 ± 0.056
0.675IleTrp: 0.675 ± 0.021
1.141IleTyr: 1.141 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.225LysAla: 3.225 ± 0.056
0.329LysCys: 0.329 ± 0.016
1.852LysAsp: 1.852 ± 0.041
2.166LysGlu: 2.166 ± 0.047
1.324LysPhe: 1.324 ± 0.032
2.207LysGly: 2.207 ± 0.039
0.751LysHis: 0.751 ± 0.021
1.931LysIle: 1.931 ± 0.033
1.829LysLys: 1.829 ± 0.05
3.862LysLeu: 3.862 ± 0.052
0.991LysMet: 0.991 ± 0.027
1.289LysAsn: 1.289 ± 0.025
2.283LysPro: 2.283 ± 0.047
1.777LysGln: 1.777 ± 0.033
2.146LysArg: 2.146 ± 0.043
2.471LysSer: 2.471 ± 0.039
2.3LysThr: 2.3 ± 0.046
2.51LysVal: 2.51 ± 0.045
0.558LysTrp: 0.558 ± 0.017
0.924LysTyr: 0.924 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
10.86LeuAla: 10.86 ± 0.099
1.313LeuCys: 1.313 ± 0.033
5.535LeuAsp: 5.535 ± 0.066
5.639LeuGlu: 5.639 ± 0.066
3.724LeuPhe: 3.724 ± 0.05
7.311LeuGly: 7.311 ± 0.068
2.102LeuHis: 2.102 ± 0.043
4.971LeuIle: 4.971 ± 0.056
4.442LeuLys: 4.442 ± 0.064
11.373LeuLeu: 11.373 ± 0.113
2.057LeuMet: 2.057 ± 0.037
3.393LeuAsn: 3.393 ± 0.051
5.997LeuPro: 5.997 ± 0.065
4.623LeuGln: 4.623 ± 0.054
6.55LeuArg: 6.55 ± 0.066
6.838LeuSer: 6.838 ± 0.071
6.406LeuThr: 6.406 ± 0.097
6.879LeuVal: 6.879 ± 0.075
1.389LeuTrp: 1.389 ± 0.035
2.049LeuTyr: 2.049 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
2.107MetAla: 2.107 ± 0.039
0.235MetCys: 0.235 ± 0.013
0.944MetAsp: 0.944 ± 0.023
1.042MetGlu: 1.042 ± 0.024
0.741MetPhe: 0.741 ± 0.021
1.403MetGly: 1.403 ± 0.031
0.458MetHis: 0.458 ± 0.018
1.082MetIle: 1.082 ± 0.028
0.881MetLys: 0.881 ± 0.026
2.345MetLeu: 2.345 ± 0.039
0.491MetMet: 0.491 ± 0.016
0.825MetAsn: 0.825 ± 0.021
1.375MetPro: 1.375 ± 0.026
1.055MetGln: 1.055 ± 0.025
1.337MetArg: 1.337 ± 0.028
1.586MetSer: 1.586 ± 0.029
1.547MetThr: 1.547 ± 0.03
1.297MetVal: 1.297 ± 0.028
0.229MetTrp: 0.229 ± 0.011
0.367MetTyr: 0.367 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.048AsnAla: 3.048 ± 0.053
0.396AsnCys: 0.396 ± 0.015
1.828AsnAsp: 1.828 ± 0.055
1.636AsnGlu: 1.636 ± 0.03
1.261AsnPhe: 1.261 ± 0.029
3.054AsnGly: 3.054 ± 0.076
0.682AsnHis: 0.682 ± 0.02
1.374AsnIle: 1.374 ± 0.033
0.872AsnLys: 0.872 ± 0.019
3.065AsnLeu: 3.065 ± 0.051
0.599AsnMet: 0.599 ± 0.017
1.09AsnAsn: 1.09 ± 0.036
2.116AsnPro: 2.116 ± 0.038
1.317AsnGln: 1.317 ± 0.029
2.106AsnArg: 2.106 ± 0.037
2.059AsnSer: 2.059 ± 0.051
1.608AsnThr: 1.608 ± 0.04
2.39AsnVal: 2.39 ± 0.04
0.667AsnTrp: 0.667 ± 0.02
0.926AsnTyr: 0.926 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
6.203ProAla: 6.203 ± 0.075
0.461ProCys: 0.461 ± 0.019
3.384ProAsp: 3.384 ± 0.049
4.09ProGlu: 4.09 ± 0.057
1.978ProPhe: 1.978 ± 0.032
4.497ProGly: 4.497 ± 0.07
1.174ProHis: 1.174 ± 0.03
2.412ProIle: 2.412 ± 0.036
1.929ProLys: 1.929 ± 0.041
5.063ProLeu: 5.063 ± 0.054
1.038ProMet: 1.038 ± 0.027
1.763ProAsn: 1.763 ± 0.035
3.213ProPro: 3.213 ± 0.058
2.36ProGln: 2.36 ± 0.039
2.908ProArg: 2.908 ± 0.04
3.172ProSer: 3.172 ± 0.047
3.025ProThr: 3.025 ± 0.04
4.606ProVal: 4.606 ± 0.06
0.737ProTrp: 0.737 ± 0.021
1.208ProTyr: 1.208 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
4.702GlnAla: 4.702 ± 0.066
0.44GlnCys: 0.44 ± 0.017
1.717GlnAsp: 1.717 ± 0.034
2.037GlnGlu: 2.037 ± 0.036
1.753GlnPhe: 1.753 ± 0.032
2.923GlnGly: 2.923 ± 0.047
1.008GlnHis: 1.008 ± 0.023
2.355GlnIle: 2.355 ± 0.041
1.698GlnLys: 1.698 ± 0.039
5.035GlnLeu: 5.035 ± 0.058
1.052GlnMet: 1.052 ± 0.026
1.374GlnAsn: 1.374 ± 0.031
2.537GlnPro: 2.537 ± 0.042
2.75GlnGln: 2.75 ± 0.071
3.179GlnArg: 3.179 ± 0.055
2.792GlnSer: 2.792 ± 0.049
2.577GlnThr: 2.577 ± 0.048
3.021GlnVal: 3.021 ± 0.043
0.561GlnTrp: 0.561 ± 0.018
0.955GlnTyr: 0.955 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
4.901ArgAla: 4.901 ± 0.069
0.773ArgCys: 0.773 ± 0.024
3.533ArgAsp: 3.533 ± 0.052
4.098ArgGlu: 4.098 ± 0.065
2.755ArgPhe: 2.755 ± 0.042
4.09ArgGly: 4.09 ± 0.052
1.549ArgHis: 1.549 ± 0.031
3.564ArgIle: 3.564 ± 0.041
2.48ArgLys: 2.48 ± 0.042
7.177ArgLeu: 7.177 ± 0.082
1.682ArgMet: 1.682 ± 0.031
2.025ArgAsn: 2.025 ± 0.039
3.232ArgPro: 3.232 ± 0.05
3.265ArgGln: 3.265 ± 0.051
4.971ArgArg: 4.971 ± 0.07
3.902ArgSer: 3.902 ± 0.05
3.483ArgThr: 3.483 ± 0.044
4.306ArgVal: 4.306 ± 0.051
1.139ArgTrp: 1.139 ± 0.029
1.701ArgTyr: 1.701 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
5.905SerAla: 5.905 ± 0.067
0.586SerCys: 0.586 ± 0.019
3.36SerAsp: 3.36 ± 0.052
3.474SerGlu: 3.474 ± 0.042
2.304SerPhe: 2.304 ± 0.049
5.632SerGly: 5.632 ± 0.096
1.331SerHis: 1.331 ± 0.031
2.987SerIle: 2.987 ± 0.046
2.0SerLys: 2.0 ± 0.035
6.595SerLeu: 6.595 ± 0.073
1.313SerMet: 1.313 ± 0.033
2.036SerAsn: 2.036 ± 0.052
3.85SerPro: 3.85 ± 0.047
2.818SerGln: 2.818 ± 0.046
3.993SerArg: 3.993 ± 0.058
4.357SerSer: 4.357 ± 0.092
3.64SerThr: 3.64 ± 0.057
4.311SerVal: 4.311 ± 0.062
0.896SerTrp: 0.896 ± 0.023
1.43SerTyr: 1.43 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
6.046ThrAla: 6.046 ± 0.077
0.563ThrCys: 0.563 ± 0.023
3.076ThrAsp: 3.076 ± 0.06
2.837ThrGlu: 2.837 ± 0.042
2.282ThrPhe: 2.282 ± 0.049
5.214ThrGly: 5.214 ± 0.086
1.027ThrHis: 1.027 ± 0.024
2.947ThrIle: 2.947 ± 0.059
1.754ThrLys: 1.754 ± 0.031
6.04ThrLeu: 6.04 ± 0.087
1.071ThrMet: 1.071 ± 0.025
1.672ThrAsn: 1.672 ± 0.037
3.701ThrPro: 3.701 ± 0.052
2.072ThrGln: 2.072 ± 0.035
3.108ThrArg: 3.108 ± 0.043
3.49ThrSer: 3.49 ± 0.06
3.287ThrThr: 3.287 ± 0.067
4.488ThrVal: 4.488 ± 0.088
0.852ThrTrp: 0.852 ± 0.024
1.412ThrTyr: 1.412 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
6.999ValAla: 6.999 ± 0.072
1.007ValCys: 1.007 ± 0.025
4.08ValAsp: 4.08 ± 0.055
4.433ValGlu: 4.433 ± 0.053
2.592ValPhe: 2.592 ± 0.047
4.922ValGly: 4.922 ± 0.061
1.421ValHis: 1.421 ± 0.03
3.946ValIle: 3.946 ± 0.049
2.48ValLys: 2.48 ± 0.045
7.389ValLeu: 7.389 ± 0.069
1.502ValMet: 1.502 ± 0.032
2.409ValAsn: 2.409 ± 0.055
3.912ValPro: 3.912 ± 0.052
2.861ValGln: 2.861 ± 0.042
4.738ValArg: 4.738 ± 0.059
4.662ValSer: 4.662 ± 0.068
4.732ValThr: 4.732 ± 0.088
5.633ValVal: 5.633 ± 0.07
1.067ValTrp: 1.067 ± 0.026
1.62ValTyr: 1.62 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.096TrpAla: 1.096 ± 0.026
0.213TrpCys: 0.213 ± 0.011
0.706TrpAsp: 0.706 ± 0.024
0.784TrpGlu: 0.784 ± 0.025
0.583TrpPhe: 0.583 ± 0.019
1.043TrpGly: 1.043 ± 0.028
0.389TrpHis: 0.389 ± 0.014
0.882TrpIle: 0.882 ± 0.025
0.755TrpLys: 0.755 ± 0.021
1.864TrpLeu: 1.864 ± 0.037
0.426TrpMet: 0.426 ± 0.017
0.648TrpAsn: 0.648 ± 0.02
0.722TrpPro: 0.722 ± 0.021
0.83TrpGln: 0.83 ± 0.02
0.97TrpArg: 0.97 ± 0.024
1.006TrpSer: 1.006 ± 0.024
0.966TrpThr: 0.966 ± 0.03
0.901TrpVal: 0.901 ± 0.026
0.273TrpTrp: 0.273 ± 0.014
0.378TrpTyr: 0.378 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.102TyrAla: 2.102 ± 0.035
0.339TyrCys: 0.339 ± 0.014
1.538TyrAsp: 1.538 ± 0.047
1.363TyrGlu: 1.363 ± 0.027
1.065TyrPhe: 1.065 ± 0.028
2.064TyrGly: 2.064 ± 0.043
0.608TyrHis: 0.608 ± 0.02
0.795TyrIle: 0.795 ± 0.022
0.676TyrLys: 0.676 ± 0.018
2.475TyrLeu: 2.475 ± 0.043
0.376TyrMet: 0.376 ± 0.016
0.714TyrAsn: 0.714 ± 0.023
1.218TyrPro: 1.218 ± 0.032
1.11TyrGln: 1.11 ± 0.023
1.901TyrArg: 1.901 ± 0.034
1.53TyrSer: 1.53 ± 0.036
1.174TyrThr: 1.174 ± 0.034
1.684TyrVal: 1.684 ± 0.027
0.413TyrTrp: 0.413 ± 0.015
0.729TyrTyr: 0.729 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5057 proteins (1801768 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski