Amino acid dipepetide frequency for Pantoea cypripedii (Pectobacterium cypripedii) (Erwinia cypripedii)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.508AlaAla: 11.508 ± 0.117
1.053AlaCys: 1.053 ± 0.025
5.275AlaAsp: 5.275 ± 0.051
6.004AlaGlu: 6.004 ± 0.064
3.643AlaPhe: 3.643 ± 0.05
8.161AlaGly: 8.161 ± 0.088
1.915AlaHis: 1.915 ± 0.035
5.918AlaIle: 5.918 ± 0.06
3.451AlaLys: 3.451 ± 0.051
12.767AlaLeu: 12.767 ± 0.112
3.025AlaMet: 3.025 ± 0.041
2.912AlaAsn: 2.912 ± 0.038
3.838AlaPro: 3.838 ± 0.053
5.118AlaGln: 5.118 ± 0.058
5.919AlaArg: 5.919 ± 0.067
6.089AlaSer: 6.089 ± 0.07
5.065AlaThr: 5.065 ± 0.066
7.007AlaVal: 7.007 ± 0.07
1.705AlaTrp: 1.705 ± 0.035
2.003AlaTyr: 2.003 ± 0.032
0.0AlaXaa: 0.0 ± 0.0
Cys
0.977CysAla: 0.977 ± 0.024
0.177CysCys: 0.177 ± 0.011
0.558CysAsp: 0.558 ± 0.017
0.5CysGlu: 0.5 ± 0.015
0.404CysPhe: 0.404 ± 0.014
0.995CysGly: 0.995 ± 0.023
0.309CysHis: 0.309 ± 0.015
0.522CysIle: 0.522 ± 0.018
0.264CysLys: 0.264 ± 0.01
1.014CysLeu: 1.014 ± 0.024
0.231CysMet: 0.231 ± 0.012
0.302CysAsn: 0.302 ± 0.013
0.449CysPro: 0.449 ± 0.015
0.493CysGln: 0.493 ± 0.016
0.598CysArg: 0.598 ± 0.017
0.616CysSer: 0.616 ± 0.017
0.443CysThr: 0.443 ± 0.016
0.689CysVal: 0.689 ± 0.019
0.166CysTrp: 0.166 ± 0.01
0.304CysTyr: 0.304 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
5.448AspAla: 5.448 ± 0.059
0.488AspCys: 0.488 ± 0.019
2.945AspAsp: 2.945 ± 0.045
3.35AspGlu: 3.35 ± 0.05
2.2AspPhe: 2.2 ± 0.037
3.758AspGly: 3.758 ± 0.064
1.101AspHis: 1.101 ± 0.028
3.251AspIle: 3.251 ± 0.042
2.239AspLys: 2.239 ± 0.043
4.849AspLeu: 4.849 ± 0.049
1.289AspMet: 1.289 ± 0.027
2.126AspAsn: 2.126 ± 0.038
2.296AspPro: 2.296 ± 0.033
1.924AspGln: 1.924 ± 0.037
2.906AspArg: 2.906 ± 0.044
2.895AspSer: 2.895 ± 0.041
2.473AspThr: 2.473 ± 0.049
3.791AspVal: 3.791 ± 0.044
0.858AspTrp: 0.858 ± 0.024
1.831AspTyr: 1.831 ± 0.029
0.001AspXaa: 0.001 ± 0.001
Glu
5.247GluAla: 5.247 ± 0.064
0.397GluCys: 0.397 ± 0.016
2.283GluAsp: 2.283 ± 0.04
3.049GluGlu: 3.049 ± 0.044
1.781GluPhe: 1.781 ± 0.031
3.355GluGly: 3.355 ± 0.049
1.282GluHis: 1.282 ± 0.029
3.129GluIle: 3.129 ± 0.052
2.865GluLys: 2.865 ± 0.051
5.71GluLeu: 5.71 ± 0.063
1.643GluMet: 1.643 ± 0.031
2.211GluAsn: 2.211 ± 0.041
2.039GluPro: 2.039 ± 0.035
3.379GluGln: 3.379 ± 0.046
3.364GluArg: 3.364 ± 0.048
2.829GluSer: 2.829 ± 0.043
2.731GluThr: 2.731 ± 0.042
3.644GluVal: 3.644 ± 0.047
0.757GluTrp: 0.757 ± 0.023
1.276GluTyr: 1.276 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
3.843PheAla: 3.843 ± 0.049
0.488PheCys: 0.488 ± 0.019
2.316PheAsp: 2.316 ± 0.04
1.571PheGlu: 1.571 ± 0.031
1.652PhePhe: 1.652 ± 0.036
3.101PheGly: 3.101 ± 0.046
0.836PheHis: 0.836 ± 0.024
2.438PheIle: 2.438 ± 0.037
1.124PheLys: 1.124 ± 0.03
3.395PheLeu: 3.395 ± 0.055
0.946PheMet: 0.946 ± 0.025
1.701PheAsn: 1.701 ± 0.029
1.655PhePro: 1.655 ± 0.03
1.271PheGln: 1.271 ± 0.024
1.985PheArg: 1.985 ± 0.034
3.215PheSer: 3.215 ± 0.037
2.348PheThr: 2.348 ± 0.043
2.465PheVal: 2.465 ± 0.04
0.644PheTrp: 0.644 ± 0.019
1.157PheTyr: 1.157 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
6.462GlyAla: 6.462 ± 0.075
0.951GlyCys: 0.951 ± 0.022
3.706GlyAsp: 3.706 ± 0.052
4.403GlyGlu: 4.403 ± 0.052
3.327GlyPhe: 3.327 ± 0.044
5.388GlyGly: 5.388 ± 0.074
1.704GlyHis: 1.704 ± 0.034
4.825GlyIle: 4.825 ± 0.06
3.65GlyLys: 3.65 ± 0.055
7.667GlyLeu: 7.667 ± 0.068
2.402GlyMet: 2.402 ± 0.041
2.747GlyAsn: 2.747 ± 0.055
2.132GlyPro: 2.132 ± 0.037
3.002GlyGln: 3.002 ± 0.047
3.892GlyArg: 3.892 ± 0.046
4.475GlySer: 4.475 ± 0.072
3.812GlyThr: 3.812 ± 0.079
5.714GlyVal: 5.714 ± 0.055
1.38GlyTrp: 1.38 ± 0.03
2.549GlyTyr: 2.549 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
2.127HisAla: 2.127 ± 0.039
0.332HisCys: 0.332 ± 0.014
1.262HisAsp: 1.262 ± 0.029
1.087HisGlu: 1.087 ± 0.027
1.084HisPhe: 1.084 ± 0.028
1.759HisGly: 1.759 ± 0.031
0.799HisHis: 0.799 ± 0.02
1.288HisIle: 1.288 ± 0.027
0.72HisLys: 0.72 ± 0.02
2.418HisLeu: 2.418 ± 0.036
0.546HisMet: 0.546 ± 0.017
0.823HisAsn: 0.823 ± 0.021
1.415HisPro: 1.415 ± 0.027
1.365HisGln: 1.365 ± 0.028
1.291HisArg: 1.291 ± 0.027
1.389HisSer: 1.389 ± 0.028
1.073HisThr: 1.073 ± 0.022
1.155HisVal: 1.155 ± 0.023
0.467HisTrp: 0.467 ± 0.017
0.896HisTyr: 0.896 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.531IleAla: 6.531 ± 0.059
0.641IleCys: 0.641 ± 0.019
3.468IleAsp: 3.468 ± 0.046
3.114IleGlu: 3.114 ± 0.043
1.89IlePhe: 1.89 ± 0.032
4.481IleGly: 4.481 ± 0.053
1.157IleHis: 1.157 ± 0.028
3.163IleIle: 3.163 ± 0.044
2.12IleLys: 2.12 ± 0.035
4.713IleLeu: 4.713 ± 0.052
1.209IleMet: 1.209 ± 0.026
2.496IleAsn: 2.496 ± 0.038
2.627IlePro: 2.627 ± 0.035
1.873IleGln: 1.873 ± 0.031
3.02IleArg: 3.02 ± 0.047
3.94IleSer: 3.94 ± 0.055
3.506IleThr: 3.506 ± 0.049
3.631IleVal: 3.631 ± 0.045
0.7IleTrp: 0.7 ± 0.018
1.474IleTyr: 1.474 ± 0.024
0.0IleXaa: 0.0 ± 0.0
Lys
3.852LysAla: 3.852 ± 0.052
0.216LysCys: 0.216 ± 0.011
1.843LysAsp: 1.843 ± 0.038
2.0LysGlu: 2.0 ± 0.035
1.004LysPhe: 1.004 ± 0.028
2.663LysGly: 2.663 ± 0.043
0.78LysHis: 0.78 ± 0.021
2.175LysIle: 2.175 ± 0.038
1.983LysLys: 1.983 ± 0.039
3.821LysLeu: 3.821 ± 0.053
1.091LysMet: 1.091 ± 0.026
1.559LysAsn: 1.559 ± 0.031
1.943LysPro: 1.943 ± 0.036
1.865LysGln: 1.865 ± 0.037
2.296LysArg: 2.296 ± 0.031
2.217LysSer: 2.217 ± 0.037
2.332LysThr: 2.332 ± 0.04
2.713LysVal: 2.713 ± 0.049
0.442LysTrp: 0.442 ± 0.015
1.003LysTyr: 1.003 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
12.363LeuAla: 12.363 ± 0.106
1.21LeuCys: 1.21 ± 0.026
5.369LeuAsp: 5.369 ± 0.058
5.216LeuGlu: 5.216 ± 0.066
4.27LeuPhe: 4.27 ± 0.053
7.415LeuGly: 7.415 ± 0.074
2.452LeuHis: 2.452 ± 0.04
5.888LeuIle: 5.888 ± 0.061
4.225LeuLys: 4.225 ± 0.05
12.704LeuLeu: 12.704 ± 0.133
3.029LeuMet: 3.029 ± 0.038
4.247LeuAsn: 4.247 ± 0.055
6.014LeuPro: 6.014 ± 0.064
4.88LeuGln: 4.88 ± 0.061
6.478LeuArg: 6.478 ± 0.071
7.742LeuSer: 7.742 ± 0.073
6.567LeuThr: 6.567 ± 0.07
7.166LeuVal: 7.166 ± 0.076
1.432LeuTrp: 1.432 ± 0.036
2.517LeuTyr: 2.517 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
2.875MetAla: 2.875 ± 0.04
0.187MetCys: 0.187 ± 0.009
1.14MetAsp: 1.14 ± 0.024
1.116MetGlu: 1.116 ± 0.022
0.832MetPhe: 0.832 ± 0.024
1.792MetGly: 1.792 ± 0.033
0.516MetHis: 0.516 ± 0.016
1.436MetIle: 1.436 ± 0.03
1.392MetLys: 1.392 ± 0.03
3.177MetLeu: 3.177 ± 0.05
0.87MetMet: 0.87 ± 0.021
1.041MetAsn: 1.041 ± 0.022
1.343MetPro: 1.343 ± 0.026
1.331MetGln: 1.331 ± 0.027
1.544MetArg: 1.544 ± 0.028
1.909MetSer: 1.909 ± 0.034
1.765MetThr: 1.765 ± 0.03
1.891MetVal: 1.891 ± 0.031
0.273MetTrp: 0.273 ± 0.013
0.478MetTyr: 0.478 ± 0.018
0.001MetXaa: 0.001 ± 0.0
Asn
3.538AsnAla: 3.538 ± 0.041
0.346AsnCys: 0.346 ± 0.013
2.033AsnAsp: 2.033 ± 0.045
1.705AsnGlu: 1.705 ± 0.036
1.333AsnPhe: 1.333 ± 0.029
2.906AsnGly: 2.906 ± 0.047
0.842AsnHis: 0.842 ± 0.019
2.192AsnIle: 2.192 ± 0.034
1.348AsnLys: 1.348 ± 0.029
3.575AsnLeu: 3.575 ± 0.045
0.829AsnMet: 0.829 ± 0.022
1.555AsnAsn: 1.555 ± 0.036
2.094AsnPro: 2.094 ± 0.035
1.722AsnGln: 1.722 ± 0.04
1.983AsnArg: 1.983 ± 0.032
2.026AsnSer: 2.026 ± 0.043
1.897AsnThr: 1.897 ± 0.038
2.436AsnVal: 2.436 ± 0.038
0.588AsnTrp: 0.588 ± 0.019
1.089AsnTyr: 1.089 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
5.15ProAla: 5.15 ± 0.058
0.357ProCys: 0.357 ± 0.014
2.946ProAsp: 2.946 ± 0.045
3.28ProGlu: 3.28 ± 0.047
1.821ProPhe: 1.821 ± 0.036
3.725ProGly: 3.725 ± 0.046
1.134ProHis: 1.134 ± 0.025
1.864ProIle: 1.864 ± 0.033
1.272ProLys: 1.272 ± 0.03
5.322ProLeu: 5.322 ± 0.06
1.112ProMet: 1.112 ± 0.021
1.263ProAsn: 1.263 ± 0.026
1.781ProPro: 1.781 ± 0.032
2.532ProGln: 2.532 ± 0.042
2.092ProArg: 2.092 ± 0.039
2.253ProSer: 2.253 ± 0.035
2.129ProThr: 2.129 ± 0.035
3.912ProVal: 3.912 ± 0.051
0.782ProTrp: 0.782 ± 0.021
1.184ProTyr: 1.184 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
5.136GlnAla: 5.136 ± 0.068
0.358GlnCys: 0.358 ± 0.014
2.016GlnAsp: 2.016 ± 0.033
2.063GlnGlu: 2.063 ± 0.038
1.545GlnPhe: 1.545 ± 0.029
3.442GlnGly: 3.442 ± 0.057
1.529GlnHis: 1.529 ± 0.03
2.316GlnIle: 2.316 ± 0.038
1.647GlnLys: 1.647 ± 0.034
5.815GlnLeu: 5.815 ± 0.068
1.275GlnMet: 1.275 ± 0.029
1.462GlnAsn: 1.462 ± 0.028
2.681GlnPro: 2.681 ± 0.042
4.414GlnGln: 4.414 ± 0.108
3.621GlnArg: 3.621 ± 0.056
2.507GlnSer: 2.507 ± 0.043
2.299GlnThr: 2.299 ± 0.036
3.199GlnVal: 3.199 ± 0.043
0.728GlnTrp: 0.728 ± 0.021
1.145GlnTyr: 1.145 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
4.967ArgAla: 4.967 ± 0.051
0.575ArgCys: 0.575 ± 0.019
3.14ArgAsp: 3.14 ± 0.046
3.649ArgGlu: 3.649 ± 0.053
2.592ArgPhe: 2.592 ± 0.035
3.459ArgGly: 3.459 ± 0.045
1.669ArgHis: 1.669 ± 0.03
3.326ArgIle: 3.326 ± 0.044
2.178ArgLys: 2.178 ± 0.041
6.767ArgLeu: 6.767 ± 0.073
1.57ArgMet: 1.57 ± 0.031
2.105ArgAsn: 2.105 ± 0.038
2.264ArgPro: 2.264 ± 0.036
3.371ArgGln: 3.371 ± 0.05
3.585ArgArg: 3.585 ± 0.053
2.967ArgSer: 2.967 ± 0.042
2.578ArgThr: 2.578 ± 0.039
3.954ArgVal: 3.954 ± 0.046
1.06ArgTrp: 1.06 ± 0.021
2.132ArgTyr: 2.132 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
6.234SerAla: 6.234 ± 0.064
0.557SerCys: 0.557 ± 0.02
3.317SerAsp: 3.317 ± 0.045
3.149SerGlu: 3.149 ± 0.042
2.392SerPhe: 2.392 ± 0.04
5.634SerGly: 5.634 ± 0.066
1.499SerHis: 1.499 ± 0.029
2.906SerIle: 2.906 ± 0.046
1.893SerLys: 1.893 ± 0.036
7.1SerLeu: 7.1 ± 0.063
1.559SerMet: 1.559 ± 0.025
1.964SerAsn: 1.964 ± 0.041
2.742SerPro: 2.742 ± 0.042
2.764SerGln: 2.764 ± 0.044
3.494SerArg: 3.494 ± 0.038
3.768SerSer: 3.768 ± 0.057
3.091SerThr: 3.091 ± 0.052
4.342SerVal: 4.342 ± 0.058
1.124SerTrp: 1.124 ± 0.023
1.661SerTyr: 1.661 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
5.261ThrAla: 5.261 ± 0.073
0.443ThrCys: 0.443 ± 0.015
2.666ThrAsp: 2.666 ± 0.054
2.461ThrGlu: 2.461 ± 0.037
1.99ThrPhe: 1.99 ± 0.035
4.486ThrGly: 4.486 ± 0.065
1.274ThrHis: 1.274 ± 0.028
2.625ThrIle: 2.625 ± 0.043
1.379ThrLys: 1.379 ± 0.03
7.646ThrLeu: 7.646 ± 0.077
1.066ThrMet: 1.066 ± 0.026
1.465ThrAsn: 1.465 ± 0.037
3.392ThrPro: 3.392 ± 0.053
2.333ThrGln: 2.333 ± 0.042
3.126ThrArg: 3.126 ± 0.034
3.049ThrSer: 3.049 ± 0.054
2.905ThrThr: 2.905 ± 0.079
3.886ThrVal: 3.886 ± 0.077
0.77ThrTrp: 0.77 ± 0.021
1.133ThrTyr: 1.133 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
7.417ValAla: 7.417 ± 0.078
0.708ValCys: 0.708 ± 0.02
3.559ValAsp: 3.559 ± 0.045
3.535ValGlu: 3.535 ± 0.042
2.533ValPhe: 2.533 ± 0.044
4.602ValGly: 4.602 ± 0.058
1.264ValHis: 1.264 ± 0.027
4.527ValIle: 4.527 ± 0.051
2.746ValLys: 2.746 ± 0.041
7.35ValLeu: 7.35 ± 0.078
2.246ValMet: 2.246 ± 0.037
2.601ValAsn: 2.601 ± 0.04
3.109ValPro: 3.109 ± 0.039
2.536ValGln: 2.536 ± 0.037
3.773ValArg: 3.773 ± 0.046
4.736ValSer: 4.736 ± 0.059
4.384ValThr: 4.384 ± 0.069
5.355ValVal: 5.355 ± 0.06
0.94ValTrp: 0.94 ± 0.025
1.613ValTyr: 1.613 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
0.979TrpAla: 0.979 ± 0.023
0.183TrpCys: 0.183 ± 0.01
0.654TrpAsp: 0.654 ± 0.02
0.58TrpGlu: 0.58 ± 0.018
0.703TrpPhe: 0.703 ± 0.02
0.95TrpGly: 0.95 ± 0.024
0.505TrpHis: 0.505 ± 0.018
0.73TrpIle: 0.73 ± 0.024
0.473TrpLys: 0.473 ± 0.016
2.431TrpLeu: 2.431 ± 0.042
0.422TrpMet: 0.422 ± 0.017
0.498TrpAsn: 0.498 ± 0.019
0.748TrpPro: 0.748 ± 0.019
1.395TrpGln: 1.395 ± 0.033
1.125TrpArg: 1.125 ± 0.025
0.915TrpSer: 0.915 ± 0.028
0.608TrpThr: 0.608 ± 0.018
0.939TrpVal: 0.939 ± 0.022
0.24TrpTrp: 0.24 ± 0.013
0.422TrpTyr: 0.422 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.462TyrAla: 2.462 ± 0.035
0.346TyrCys: 0.346 ± 0.014
1.524TyrAsp: 1.524 ± 0.034
1.139TyrGlu: 1.139 ± 0.025
1.151TyrPhe: 1.151 ± 0.028
2.121TyrGly: 2.121 ± 0.037
0.726TyrHis: 0.726 ± 0.023
1.234TyrIle: 1.234 ± 0.028
0.826TyrLys: 0.826 ± 0.024
2.945TyrLeu: 2.945 ± 0.038
0.55TyrMet: 0.55 ± 0.017
0.956TyrAsn: 0.956 ± 0.024
1.346TyrPro: 1.346 ± 0.027
1.631TyrGln: 1.631 ± 0.032
1.818TyrArg: 1.818 ± 0.031
1.653TyrSer: 1.653 ± 0.032
1.35TyrThr: 1.35 ± 0.029
1.647TyrVal: 1.647 ± 0.027
0.443TyrTrp: 0.443 ± 0.017
0.835TyrTyr: 0.835 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.002XaaPhe: 0.002 ± 0.001
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5848 proteins (1873892 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski