Amino acid dipepetide frequency for Pseudomonas pharmacofabricae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.128AlaAla: 14.128 ± 0.162
1.545AlaCys: 1.545 ± 0.044
6.035AlaAsp: 6.035 ± 0.087
8.443AlaGlu: 8.443 ± 0.113
3.554AlaPhe: 3.554 ± 0.066
9.619AlaGly: 9.619 ± 0.106
2.325AlaHis: 2.325 ± 0.05
4.913AlaIle: 4.913 ± 0.078
3.276AlaLys: 3.276 ± 0.078
15.386AlaLeu: 15.386 ± 0.156
2.837AlaMet: 2.837 ± 0.052
2.811AlaAsn: 2.811 ± 0.054
4.671AlaPro: 4.671 ± 0.084
6.33AlaGln: 6.33 ± 0.109
8.385AlaArg: 8.385 ± 0.108
6.594AlaSer: 6.594 ± 0.085
4.207AlaThr: 4.207 ± 0.074
7.418AlaVal: 7.418 ± 0.096
1.863AlaTrp: 1.863 ± 0.046
2.522AlaTyr: 2.522 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
1.32CysAla: 1.32 ± 0.043
0.162CysCys: 0.162 ± 0.013
0.506CysAsp: 0.506 ± 0.024
0.582CysGlu: 0.582 ± 0.026
0.365CysPhe: 0.365 ± 0.021
1.008CysGly: 1.008 ± 0.036
0.309CysHis: 0.309 ± 0.019
0.463CysIle: 0.463 ± 0.022
0.293CysLys: 0.293 ± 0.016
1.459CysLeu: 1.459 ± 0.043
0.201CysMet: 0.201 ± 0.014
0.298CysAsn: 0.298 ± 0.017
0.58CysPro: 0.58 ± 0.025
0.652CysGln: 0.652 ± 0.03
0.75CysArg: 0.75 ± 0.027
0.693CysSer: 0.693 ± 0.029
0.434CysThr: 0.434 ± 0.024
0.708CysVal: 0.708 ± 0.03
0.179CysTrp: 0.179 ± 0.014
0.268CysTyr: 0.268 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.385AspAla: 5.385 ± 0.076
0.659AspCys: 0.659 ± 0.029
2.671AspAsp: 2.671 ± 0.059
3.649AspGlu: 3.649 ± 0.07
1.909AspPhe: 1.909 ± 0.043
4.106AspGly: 4.106 ± 0.066
0.986AspHis: 0.986 ± 0.033
2.479AspIle: 2.479 ± 0.057
1.774AspLys: 1.774 ± 0.055
5.846AspLeu: 5.846 ± 0.085
1.23AspMet: 1.23 ± 0.036
1.649AspAsn: 1.649 ± 0.049
2.571AspPro: 2.571 ± 0.05
2.074AspGln: 2.074 ± 0.044
2.731AspArg: 2.731 ± 0.053
3.234AspSer: 3.234 ± 0.061
1.982AspThr: 1.982 ± 0.045
3.083AspVal: 3.083 ± 0.064
1.046AspTrp: 1.046 ± 0.035
1.731AspTyr: 1.731 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
7.108GluAla: 7.108 ± 0.112
0.444GluCys: 0.444 ± 0.022
2.45GluAsp: 2.45 ± 0.052
3.531GluGlu: 3.531 ± 0.076
1.758GluPhe: 1.758 ± 0.048
3.813GluGly: 3.813 ± 0.074
1.62GluHis: 1.62 ± 0.039
2.693GluIle: 2.693 ± 0.055
1.974GluLys: 1.974 ± 0.051
7.845GluLeu: 7.845 ± 0.098
1.291GluMet: 1.291 ± 0.038
1.345GluAsn: 1.345 ± 0.039
2.385GluPro: 2.385 ± 0.061
5.29GluGln: 5.29 ± 0.086
5.152GluArg: 5.152 ± 0.089
2.48GluSer: 2.48 ± 0.059
2.183GluThr: 2.183 ± 0.053
4.223GluVal: 4.223 ± 0.067
0.717GluTrp: 0.717 ± 0.024
1.19GluTyr: 1.19 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
4.1PheAla: 4.1 ± 0.073
0.47PheCys: 0.47 ± 0.023
2.177PheAsp: 2.177 ± 0.051
1.877PheGlu: 1.877 ± 0.043
1.299PhePhe: 1.299 ± 0.043
2.75PheGly: 2.75 ± 0.051
0.664PheHis: 0.664 ± 0.027
1.592PheIle: 1.592 ± 0.041
1.051PheLys: 1.051 ± 0.036
3.174PheLeu: 3.174 ± 0.063
0.731PheMet: 0.731 ± 0.03
1.206PheAsn: 1.206 ± 0.036
1.284PhePro: 1.284 ± 0.037
1.254PheGln: 1.254 ± 0.037
1.853PheArg: 1.853 ± 0.044
2.399PheSer: 2.399 ± 0.058
1.652PheThr: 1.652 ± 0.042
2.263PheVal: 2.263 ± 0.05
0.459PheTrp: 0.459 ± 0.023
0.947PheTyr: 0.947 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
6.914GlyAla: 6.914 ± 0.092
1.036GlyCys: 1.036 ± 0.033
3.645GlyAsp: 3.645 ± 0.064
5.15GlyGlu: 5.15 ± 0.081
2.922GlyPhe: 2.922 ± 0.058
5.528GlyGly: 5.528 ± 0.097
1.864GlyHis: 1.864 ± 0.048
3.778GlyIle: 3.778 ± 0.065
3.136GlyLys: 3.136 ± 0.063
9.863GlyLeu: 9.863 ± 0.117
2.136GlyMet: 2.136 ± 0.047
2.283GlyAsn: 2.283 ± 0.058
2.331GlyPro: 2.331 ± 0.053
4.445GlyGln: 4.445 ± 0.084
5.099GlyArg: 5.099 ± 0.074
4.445GlySer: 4.445 ± 0.079
3.104GlyThr: 3.104 ± 0.061
5.431GlyVal: 5.431 ± 0.087
1.318GlyTrp: 1.318 ± 0.039
2.372GlyTyr: 2.372 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.541HisAla: 2.541 ± 0.062
0.373HisCys: 0.373 ± 0.018
1.141HisAsp: 1.141 ± 0.031
1.194HisGlu: 1.194 ± 0.038
0.947HisPhe: 0.947 ± 0.031
2.018HisGly: 2.018 ± 0.053
0.548HisHis: 0.548 ± 0.023
0.999HisIle: 0.999 ± 0.036
0.665HisLys: 0.665 ± 0.027
2.732HisLeu: 2.732 ± 0.056
0.515HisMet: 0.515 ± 0.022
0.685HisAsn: 0.685 ± 0.03
1.368HisPro: 1.368 ± 0.04
1.011HisGln: 1.011 ± 0.036
1.306HisArg: 1.306 ± 0.041
1.488HisSer: 1.488 ± 0.042
0.948HisThr: 0.948 ± 0.03
1.286HisVal: 1.286 ± 0.036
0.537HisTrp: 0.537 ± 0.026
0.817HisTyr: 0.817 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.373IleAla: 5.373 ± 0.08
0.433IleCys: 0.433 ± 0.021
3.007IleAsp: 3.007 ± 0.059
3.27IleGlu: 3.27 ± 0.063
1.178IlePhe: 1.178 ± 0.038
3.835IleGly: 3.835 ± 0.063
0.93IleHis: 0.93 ± 0.032
1.875IleIle: 1.875 ± 0.053
1.501IleLys: 1.501 ± 0.05
3.994IleLeu: 3.994 ± 0.066
0.736IleMet: 0.736 ± 0.03
1.556IleAsn: 1.556 ± 0.047
1.997IlePro: 1.997 ± 0.045
1.596IleGln: 1.596 ± 0.04
2.909IleArg: 2.909 ± 0.05
2.77IleSer: 2.77 ± 0.055
2.141IleThr: 2.141 ± 0.053
2.689IleVal: 2.689 ± 0.064
0.402IleTrp: 0.402 ± 0.021
0.996IleTyr: 0.996 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
3.656LysAla: 3.656 ± 0.087
0.196LysCys: 0.196 ± 0.014
1.426LysAsp: 1.426 ± 0.043
1.461LysGlu: 1.461 ± 0.044
0.745LysPhe: 0.745 ± 0.03
2.253LysGly: 2.253 ± 0.057
0.657LysHis: 0.657 ± 0.026
1.344LysIle: 1.344 ± 0.044
1.169LysLys: 1.169 ± 0.052
3.34LysLeu: 3.34 ± 0.07
0.591LysMet: 0.591 ± 0.025
0.884LysAsn: 0.884 ± 0.034
1.821LysPro: 1.821 ± 0.045
1.531LysGln: 1.531 ± 0.042
2.291LysArg: 2.291 ± 0.053
1.543LysSer: 1.543 ± 0.054
1.546LysThr: 1.546 ± 0.047
2.425LysVal: 2.425 ± 0.054
0.271LysTrp: 0.271 ± 0.016
0.597LysTyr: 0.597 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
17.099LeuAla: 17.099 ± 0.21
1.546LeuCys: 1.546 ± 0.049
7.18LeuAsp: 7.18 ± 0.1
6.61LeuGlu: 6.61 ± 0.095
4.336LeuPhe: 4.336 ± 0.073
9.75LeuGly: 9.75 ± 0.108
3.009LeuHis: 3.009 ± 0.063
5.129LeuIle: 5.129 ± 0.072
3.838LeuLys: 3.838 ± 0.064
17.828LeuLeu: 17.828 ± 0.275
2.304LeuMet: 2.304 ± 0.058
3.369LeuAsn: 3.369 ± 0.059
7.112LeuPro: 7.112 ± 0.1
7.684LeuGln: 7.684 ± 0.136
9.041LeuArg: 9.041 ± 0.105
7.194LeuSer: 7.194 ± 0.085
4.993LeuThr: 4.993 ± 0.069
7.849LeuVal: 7.849 ± 0.103
1.681LeuTrp: 1.681 ± 0.051
2.813LeuTyr: 2.813 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
2.654MetAla: 2.654 ± 0.056
0.152MetCys: 0.152 ± 0.011
0.922MetAsp: 0.922 ± 0.03
0.884MetGlu: 0.884 ± 0.035
0.554MetPhe: 0.554 ± 0.024
1.534MetGly: 1.534 ± 0.04
0.517MetHis: 0.517 ± 0.022
0.88MetIle: 0.88 ± 0.033
0.737MetLys: 0.737 ± 0.032
2.604MetLeu: 2.604 ± 0.058
0.381MetMet: 0.381 ± 0.02
0.696MetAsn: 0.696 ± 0.029
1.301MetPro: 1.301 ± 0.037
1.266MetGln: 1.266 ± 0.037
1.555MetArg: 1.555 ± 0.04
1.582MetSer: 1.582 ± 0.043
1.156MetThr: 1.156 ± 0.032
1.316MetVal: 1.316 ± 0.036
0.165MetTrp: 0.165 ± 0.014
0.336MetTyr: 0.336 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.894AsnAla: 2.894 ± 0.064
0.283AsnCys: 0.283 ± 0.017
1.369AsnAsp: 1.369 ± 0.045
1.324AsnGlu: 1.324 ± 0.042
0.89AsnPhe: 0.89 ± 0.033
2.169AsnGly: 2.169 ± 0.055
0.594AsnHis: 0.594 ± 0.023
1.241AsnIle: 1.241 ± 0.04
0.802AsnLys: 0.802 ± 0.031
3.453AsnLeu: 3.453 ± 0.06
0.519AsnMet: 0.519 ± 0.027
0.745AsnAsn: 0.745 ± 0.031
1.914AsnPro: 1.914 ± 0.045
1.372AsnGln: 1.372 ± 0.042
1.798AsnArg: 1.798 ± 0.051
1.464AsnSer: 1.464 ± 0.042
1.19AsnThr: 1.19 ± 0.034
1.621AsnVal: 1.621 ± 0.046
0.365AsnTrp: 0.365 ± 0.019
0.719AsnTyr: 0.719 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
6.272ProAla: 6.272 ± 0.096
0.461ProCys: 0.461 ± 0.024
2.388ProAsp: 2.388 ± 0.048
3.038ProGlu: 3.038 ± 0.056
1.658ProPhe: 1.658 ± 0.044
4.096ProGly: 4.096 ± 0.065
1.03ProHis: 1.03 ± 0.032
1.659ProIle: 1.659 ± 0.045
1.165ProLys: 1.165 ± 0.037
6.403ProLeu: 6.403 ± 0.098
1.094ProMet: 1.094 ± 0.031
1.125ProAsn: 1.125 ± 0.035
2.051ProPro: 2.051 ± 0.065
2.731ProGln: 2.731 ± 0.059
3.002ProArg: 3.002 ± 0.069
2.523ProSer: 2.523 ± 0.062
1.758ProThr: 1.758 ± 0.042
3.507ProVal: 3.507 ± 0.066
0.791ProTrp: 0.791 ± 0.029
1.154ProTyr: 1.154 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
8.763GlnAla: 8.763 ± 0.148
0.447GlnCys: 0.447 ± 0.021
2.046GlnAsp: 2.046 ± 0.05
2.342GlnGlu: 2.342 ± 0.052
1.504GlnPhe: 1.504 ± 0.038
4.322GlnGly: 4.322 ± 0.082
1.582GlnHis: 1.582 ± 0.045
2.108GlnIle: 2.108 ± 0.048
1.059GlnLys: 1.059 ± 0.039
7.967GlnLeu: 7.967 ± 0.141
1.062GlnMet: 1.062 ± 0.034
0.953GlnAsn: 0.953 ± 0.033
3.169GlnPro: 3.169 ± 0.066
4.519GlnGln: 4.519 ± 0.119
5.344GlnArg: 5.344 ± 0.11
2.276GlnSer: 2.276 ± 0.057
1.895GlnThr: 1.895 ± 0.044
4.306GlnVal: 4.306 ± 0.065
0.86GlnTrp: 0.86 ± 0.028
1.079GlnTyr: 1.079 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
6.809ArgAla: 6.809 ± 0.098
0.733ArgCys: 0.733 ± 0.027
3.695ArgAsp: 3.695 ± 0.059
4.806ArgGlu: 4.806 ± 0.076
2.678ArgPhe: 2.678 ± 0.062
4.331ArgGly: 4.331 ± 0.062
1.98ArgHis: 1.98 ± 0.044
3.32ArgIle: 3.32 ± 0.063
1.857ArgLys: 1.857 ± 0.053
10.31ArgLeu: 10.31 ± 0.143
1.553ArgMet: 1.553 ± 0.041
1.829ArgAsn: 1.829 ± 0.047
2.983ArgPro: 2.983 ± 0.059
4.935ArgGln: 4.935 ± 0.101
5.008ArgArg: 5.008 ± 0.084
3.553ArgSer: 3.553 ± 0.056
2.402ArgThr: 2.402 ± 0.046
4.535ArgVal: 4.535 ± 0.071
1.172ArgTrp: 1.172 ± 0.04
2.148ArgTyr: 2.148 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
6.006SerAla: 6.006 ± 0.086
0.569SerCys: 0.569 ± 0.028
2.67SerAsp: 2.67 ± 0.051
3.329SerGlu: 3.329 ± 0.059
1.867SerPhe: 1.867 ± 0.048
4.932SerGly: 4.932 ± 0.068
1.331SerHis: 1.331 ± 0.047
2.354SerIle: 2.354 ± 0.049
1.613SerLys: 1.613 ± 0.045
7.769SerLeu: 7.769 ± 0.104
1.22SerMet: 1.22 ± 0.037
1.56SerAsn: 1.56 ± 0.046
2.531SerPro: 2.531 ± 0.051
3.039SerGln: 3.039 ± 0.065
3.668SerArg: 3.668 ± 0.064
3.372SerSer: 3.372 ± 0.061
2.393SerThr: 2.393 ± 0.061
3.39SerVal: 3.39 ± 0.072
0.794SerTrp: 0.794 ± 0.03
1.43SerTyr: 1.43 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
4.482ThrAla: 4.482 ± 0.069
0.471ThrCys: 0.471 ± 0.022
1.936ThrAsp: 1.936 ± 0.044
1.913ThrGlu: 1.913 ± 0.052
1.417ThrPhe: 1.417 ± 0.041
3.525ThrGly: 3.525 ± 0.063
0.863ThrHis: 0.863 ± 0.034
1.566ThrIle: 1.566 ± 0.045
0.784ThrLys: 0.784 ± 0.029
6.241ThrLeu: 6.241 ± 0.084
0.562ThrMet: 0.562 ± 0.022
0.844ThrAsn: 0.844 ± 0.031
2.738ThrPro: 2.738 ± 0.053
1.818ThrGln: 1.818 ± 0.046
2.847ThrArg: 2.847 ± 0.051
2.154ThrSer: 2.154 ± 0.048
1.873ThrThr: 1.873 ± 0.051
2.679ThrVal: 2.679 ± 0.066
0.541ThrTrp: 0.541 ± 0.025
0.945ThrTyr: 0.945 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
7.477ValAla: 7.477 ± 0.105
0.755ValCys: 0.755 ± 0.03
3.754ValAsp: 3.754 ± 0.059
4.496ValGlu: 4.496 ± 0.083
2.273ValPhe: 2.273 ± 0.053
4.482ValGly: 4.482 ± 0.071
1.324ValHis: 1.324 ± 0.039
3.352ValIle: 3.352 ± 0.066
2.026ValLys: 2.026 ± 0.052
8.093ValLeu: 8.093 ± 0.116
1.488ValMet: 1.488 ± 0.035
1.911ValAsn: 1.911 ± 0.049
2.998ValPro: 2.998 ± 0.07
3.08ValGln: 3.08 ± 0.052
4.494ValArg: 4.494 ± 0.067
3.929ValSer: 3.929 ± 0.072
2.89ValThr: 2.89 ± 0.062
4.799ValVal: 4.799 ± 0.081
0.789ValTrp: 0.789 ± 0.031
1.468ValTyr: 1.468 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.158TrpAla: 1.158 ± 0.038
0.171TrpCys: 0.171 ± 0.014
0.578TrpAsp: 0.578 ± 0.02
0.522TrpGlu: 0.522 ± 0.024
0.456TrpPhe: 0.456 ± 0.02
0.834TrpGly: 0.834 ± 0.031
0.411TrpHis: 0.411 ± 0.021
0.538TrpIle: 0.538 ± 0.025
0.351TrpLys: 0.351 ± 0.017
2.825TrpLeu: 2.825 ± 0.073
0.327TrpMet: 0.327 ± 0.017
0.344TrpAsn: 0.344 ± 0.02
0.802TrpPro: 0.802 ± 0.037
1.427TrpGln: 1.427 ± 0.05
1.286TrpArg: 1.286 ± 0.039
0.701TrpSer: 0.701 ± 0.025
0.474TrpThr: 0.474 ± 0.022
0.855TrpVal: 0.855 ± 0.033
0.235TrpTrp: 0.235 ± 0.019
0.348TrpTyr: 0.348 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.722TyrAla: 2.722 ± 0.055
0.324TyrCys: 0.324 ± 0.022
1.199TyrAsp: 1.199 ± 0.037
1.069TyrGlu: 1.069 ± 0.028
0.893TyrPhe: 0.893 ± 0.031
1.895TyrGly: 1.895 ± 0.041
0.576TyrHis: 0.576 ± 0.027
0.905TyrIle: 0.905 ± 0.028
0.673TyrLys: 0.673 ± 0.026
3.233TyrLeu: 3.233 ± 0.062
0.413TyrMet: 0.413 ± 0.02
0.633TyrAsn: 0.633 ± 0.028
1.299TyrPro: 1.299 ± 0.037
1.547TyrGln: 1.547 ± 0.046
2.119TyrArg: 2.119 ± 0.043
1.445TyrSer: 1.445 ± 0.042
0.978TyrThr: 0.978 ± 0.034
1.497TyrVal: 1.497 ± 0.041
0.459TyrTrp: 0.459 ± 0.02
0.685TyrTyr: 0.685 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3166 proteins (1018675 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski